TOP ACTIVATIONS
MAX = 22.761

clown s juggling and ac rob ats doing flips . But
of fun watching the ac rob ats and clown s .
thought they looked like ac rob ats . She enjoyed the
elephants , lions and ac rob ats . They are amazed
clown s and the ac rob ats and the lions and
the animals , the ac rob ats and the clown s
saw clown s and ac rob ats and lions and elephants
him from getting a sun burn . Tim my
made him an even y umm ier snack and they both
want Lily to get sun burn ed . L
she knew the word ' burn '. Her friends were so
don 't get a sun burn ." Lily yawn ed and
ice cream taste even y umm ier ! And every time
to make food look y umm ier than before . She
bakery to buy the y umm iest pastry he could find
and it was the y umm iest ice cream she ever
. It was the y umm iest thing he ever tasted
magic apple was the y umm iest thing they ever tasted
leaves to find the y umm iest one !" Lily smiled
her hair in a pony tail so she could look pretty

MIN ACTIVATIONS
MIN = -15.158

swing together . <|endoftext|> Once upon a time , there was
with snow . <|endoftext|> Once upon a time , there was
to her . <|endoftext|> Once upon a time there was a
to them . <|endoftext|> Once upon a time , there was
and dad . <|endoftext|> Once upon a time , there was
with them . <|endoftext|> Once upon a time , there was
much chalk ! <|endoftext|> Once upon a time , there was
with them . <|endoftext|> Once upon a time , there was
its creatures . <|endoftext|> Once upon a time , there was
, Lily ." <|endoftext|> Once upon a time , there was
to it . <|endoftext|> Once upon a time there was a
his toys . <|endoftext|> Once upon a time there was a
the rain ." <|endoftext|> Once upon a time , there was
very much . <|endoftext|> Once upon a time , there was
He said , " Once upon a time , there was
it forever . <|endoftext|> Once upon a time there were two
bad ending . <|endoftext|> Once upon a time , there was
any yogurt . <|endoftext|> Once upon a time , there was
toy box . <|endoftext|> Once upon a time , there was
friends forever . <|endoftext|> Once upon a time , a little

INTERVAL 13.281 - 22.761
CONTAINS 0.005%

he wanted some ice - cream but his mom said no
Alex for some ice - cream . He smiled and was
she could have the ash tr ay to put her rocks
mom for some ice - cream after her bike rides in
store and got the y umm iest ice cream they could

INTERVAL 3.802 - 13.281
CONTAINS 7.214%

little girl named Lily . She loved to play outside on
friendship can be found in many places . <|endoftext|> Once upon
little girl felt so much better . She still had her
The dove was so grateful for Grace 's help
medicine to make her feel better . After taking it ,

INTERVAL -5.678 - 3.802
CONTAINS 91.448%

around , laughing and having a lot of fun . It
a little bird saw a big log . The bird did
took Lily to the living room and gave her some milk
her shiny bracelet every day . One day , she showed
says . He pulls harder . " Stop it

INTERVAL -15.158 - -5.678
CONTAINS 1.331%

time , there was a little girl named Lily . She
, but a very scary dog . I 'm sorry it
ie felt very scared and didn 't know what to do
time , there was a little girl named Lily . She
time , there was a little girl named Lily . She
Token clown
Token ID25573
Feature activation-2.77e+00

Pos logit contributions
 alone+0.103
ges+0.097
apping+0.096
 War+0.095
umpy+0.094
Neg logit contributions
 flavors-0.109
 scar-0.102
mm-0.101
 fries-0.098
 cones-0.095
Tokens
Token ID82
Feature activation-4.09e+00

Pos logit contributions
 alone+0.113
 War+0.109
umpy+0.108
ges+0.105
 nowhere+0.102
Neg logit contributions
 flavors-0.078
 scar-0.075
 fries-0.071
 cones-0.070
mm-0.068
Token juggling
Token ID48097
Feature activation-1.30e+00

Pos logit contributions
 alone+0.123
ges+0.116
 War+0.115
umpy+0.113
apping+0.113
Neg logit contributions
 flavors-0.160
 scar-0.151
mm-0.149
 fries-0.146
 cones-0.144
Token and
Token ID290
Feature activation+2.19e-01

Pos logit contributions
 alone+0.044
ges+0.043
apping+0.043
umpy+0.042
 War+0.042
Neg logit contributions
 flavors-0.046
 scar-0.044
 cones-0.042
 chips-0.041
 fries-0.041
Token ac
Token ID936
Feature activation+7.92e+00

Pos logit contributions
mm+0.009
 flavors+0.009
 scar+0.009
 extract+0.008
 fries+0.008
Neg logit contributions
 alone-0.006
ges-0.006
 War-0.005
apping-0.005
 crying-0.005
Tokenrob
Token ID22609
Feature activation+2.28e+01

Pos logit contributions
 flavors+0.093
 scar+0.082
mm+0.078
 fries+0.075
 french+0.066
Neg logit contributions
 alone-0.458
ges-0.442
umpy-0.433
apping-0.432
 War-0.430
Tokenats
Token ID1381
Feature activation+2.45e+00

Pos logit contributions
 flavors+0.600
mm+0.548
 scar+0.543
 fries+0.510
 cones+0.497
Neg logit contributions
 alone-0.990
ges-0.946
 War-0.937
apping-0.929
umpy-0.921
Token doing
Token ID1804
Feature activation+1.04e+00

Pos logit contributions
 flavors+0.093
 scar+0.087
 fries+0.085
mm+0.084
 chips+0.084
Neg logit contributions
 alone-0.075
 War-0.071
umpy-0.071
apping-0.070
ges-0.070
Token flips
Token ID45971
Feature activation-4.54e+00

Pos logit contributions
 flavors+0.038
mm+0.038
 scar+0.035
 fries+0.035
 extract+0.034
Neg logit contributions
 alone-0.034
 War-0.031
ges-0.031
apping-0.030
umpy-0.029
Token.
Token ID13
Feature activation+2.32e-01

Pos logit contributions
apping+0.133
 alone+0.133
ges+0.131
umpy+0.131
 War+0.130
Neg logit contributions
 flavors-0.171
 scar-0.165
 chips-0.155
 cones-0.155
 fries-0.149
Token But
Token ID887
Feature activation+4.91e-01

Pos logit contributions
 flavors+0.008
mm+0.008
 scar+0.007
 fries+0.007
 cones+0.007
Neg logit contributions
 alone-0.008
ges-0.007
umpy-0.007
apping-0.007
 War-0.007
Token of
Token ID286
Feature activation-4.37e+00

Pos logit contributions
 flavors+0.120
 scar+0.112
 cones+0.111
mm+0.110
 fries+0.109
Neg logit contributions
 alone-0.042
ges-0.040
 War-0.038
apping-0.037
umpy-0.036
Token fun
Token ID1257
Feature activation+2.70e+00

Pos logit contributions
 alone+0.153
 War+0.144
 by+0.140
 outside+0.135
 here+0.126
Neg logit contributions
comb-0.145
mm-0.142
 extract-0.140
burn-0.135
 chip-0.126
Token watching
Token ID4964
Feature activation-3.37e+00

Pos logit contributions
 flavors+0.113
 scar+0.110
mm+0.110
 fries+0.106
 extract+0.104
Neg logit contributions
 alone-0.074
apping-0.069
ges-0.069
 War-0.066
 by-0.063
Token the
Token ID262
Feature activation+1.11e+00

Pos logit contributions
 alone+0.108
apping+0.104
ges+0.103
umpy+0.103
 War+0.099
Neg logit contributions
 flavors-0.124
 scar-0.118
 fries-0.114
mm-0.114
 chips-0.112
Token ac
Token ID936
Feature activation+7.16e+00

Pos logit contributions
 flavors+0.035
mm+0.033
 scar+0.033
 extract+0.032
 fries+0.031
Neg logit contributions
 alone-0.041
 War-0.039
ges-0.038
umpy-0.037
apping-0.037
Tokenrob
Token ID22609
Feature activation+2.11e+01

Pos logit contributions
 flavors+0.057
mm+0.042
 scar+0.040
 fries+0.030
 cones+0.028
Neg logit contributions
 alone-0.442
ges-0.431
apping-0.427
 War-0.426
umpy-0.420
Tokenats
Token ID1381
Feature activation+1.67e+00

Pos logit contributions
 flavors+0.543
mm+0.496
 scar+0.484
 fries+0.464
 cones+0.451
Neg logit contributions
 alone-0.935
ges-0.899
 War-0.887
apping-0.878
umpy-0.870
Token and
Token ID290
Feature activation-8.36e-01

Pos logit contributions
 flavors+0.066
mm+0.063
 scar+0.062
 fries+0.059
 extract+0.058
Neg logit contributions
 alone-0.052
ges-0.048
apping-0.047
 War-0.047
umpy-0.045
Token clown
Token ID25573
Feature activation-3.79e+00

Pos logit contributions
 alone+0.027
ges+0.024
 War+0.024
 crying+0.023
 by+0.023
Neg logit contributions
mm-0.032
 flavors-0.029
 extract-0.028
 scar-0.027
rob-0.027
Tokens
Token ID82
Feature activation-5.18e+00

Pos logit contributions
 alone+0.130
ges+0.124
apping+0.123
 War+0.119
umpy+0.117
Neg logit contributions
 flavors-0.135
mm-0.128
 scar-0.126
 fries-0.119
 extract-0.118
Token.
Token ID13
Feature activation-4.51e-01

Pos logit contributions
 alone+0.150
apping+0.134
ges+0.133
 War+0.130
 by+0.126
Neg logit contributions
 flavors-0.212
mm-0.208
 scar-0.206
 fries-0.197
 extract-0.196
Token thought
Token ID1807
Feature activation-6.06e-01

Pos logit contributions
 alone+0.063
ges+0.057
apping+0.057
 War+0.056
umpy+0.056
Neg logit contributions
 flavors-0.066
mm-0.064
 scar-0.061
 fries-0.061
 french-0.059
Token they
Token ID484
Feature activation+2.37e+00

Pos logit contributions
 alone+0.020
umpy+0.020
ges+0.020
 War+0.019
apping+0.019
Neg logit contributions
 flavors-0.021
 scar-0.020
mm-0.019
 chips-0.019
 fries-0.019
Token looked
Token ID3114
Feature activation+2.08e+00

Pos logit contributions
mm+0.082
 fries+0.075
 flavors+0.074
 french+0.073
rob+0.070
Neg logit contributions
 alone-0.092
 nowhere-0.081
ges-0.080
 War-0.080
 outside-0.079
Token like
Token ID588
Feature activation-1.36e+00

Pos logit contributions
 scar+0.066
 flavors+0.063
 fries+0.054
 french+0.053
 tast+0.052
Neg logit contributions
ges-0.081
apping-0.078
unk-0.077
umpy-0.076
 War-0.075
Token ac
Token ID936
Feature activation+7.08e+00

Pos logit contributions
 alone+0.046
ges+0.043
 War+0.042
apping+0.042
umpy+0.041
Neg logit contributions
 flavors-0.049
mm-0.046
 scar-0.046
 fries-0.044
 cones-0.043
Tokenrob
Token ID22609
Feature activation+2.10e+01

Pos logit contributions
 flavors+0.130
 scar+0.117
 fries+0.115
mm+0.110
 french+0.107
Neg logit contributions
 alone-0.366
umpy-0.346
ges-0.346
apping-0.339
 War-0.337
Tokenats
Token ID1381
Feature activation+1.71e+00

Pos logit contributions
 flavors+0.548
mm+0.514
 scar+0.485
 fries+0.457
 chips+0.452
Neg logit contributions
 alone-0.931
 War-0.890
ges-0.884
umpy-0.862
apping-0.858
Token.
Token ID13
Feature activation-3.36e-01

Pos logit contributions
 flavors+0.074
mm+0.071
 scar+0.069
 fries+0.067
 extract+0.067
Neg logit contributions
 alone-0.046
ges-0.042
 War-0.041
apping-0.041
umpy-0.040
Token She
Token ID1375
Feature activation+3.12e+00

Pos logit contributions
 alone+0.010
ges+0.010
apping+0.009
 War+0.009
umpy+0.009
Neg logit contributions
 flavors-0.013
 scar-0.013
mm-0.012
 fries-0.012
 cones-0.012
Token enjoyed
Token ID8359
Feature activation+2.08e+00

Pos logit contributions
mm+0.118
 flavors+0.111
 extract+0.106
 fare+0.106
 fries+0.105
Neg logit contributions
 alone-0.103
 nowhere-0.094
apping-0.089
 by-0.089
 War-0.088
Token the
Token ID262
Feature activation+1.14e+00

Pos logit contributions
 flavors+0.096
 scar+0.092
mm+0.091
 fries+0.088
 cones+0.087
Neg logit contributions
 alone-0.049
ges-0.045
umpy-0.045
apping-0.045
 War-0.044
Token elephants
Token ID25368
Feature activation+4.94e+00

Pos logit contributions
 alone+0.038
apping+0.036
ges+0.036
umpy+0.035
 War+0.035
Neg logit contributions
 flavors-0.042
 scar-0.040
mm-0.038
 fries-0.038
 cones-0.037
Token,
Token ID11
Feature activation-1.17e+00

Pos logit contributions
 flavors+0.180
 scar+0.169
mm+0.167
 fries+0.162
 cones+0.159
Neg logit contributions
 alone-0.164
ges-0.154
apping-0.152
 War-0.152
umpy-0.150
Token lions
Token ID31420
Feature activation+2.76e+00

Pos logit contributions
 alone+0.032
ges+0.030
 War+0.029
apping+0.029
umpy+0.028
Neg logit contributions
 flavors-0.049
mm-0.046
 scar-0.046
 fries-0.044
 extract-0.044
Token and
Token ID290
Feature activation-6.78e-01

Pos logit contributions
 flavors+0.089
 scar+0.084
 cones+0.082
mm+0.082
 fries+0.080
Neg logit contributions
 alone-0.101
umpy-0.097
 War-0.097
apping-0.096
ges-0.095
Token ac
Token ID936
Feature activation+7.13e+00

Pos logit contributions
 alone+0.023
 War+0.021
ges+0.021
 crying+0.020
umpy+0.019
Neg logit contributions
mm-0.024
 flavors-0.023
 extract-0.023
 scar-0.021
 fries-0.021
Tokenrob
Token ID22609
Feature activation+2.09e+01

Pos logit contributions
 flavors+0.097
 scar+0.082
mm+0.082
 fries+0.078
 french+0.075
Neg logit contributions
 alone-0.390
ges-0.383
 War-0.368
apping-0.365
umpy-0.365
Tokenats
Token ID1381
Feature activation+1.53e+00

Pos logit contributions
 flavors+0.526
 scar+0.477
mm+0.476
 fries+0.447
 cones+0.433
Neg logit contributions
 alone-0.933
ges-0.893
 War-0.882
apping-0.880
umpy-0.871
Token.
Token ID13
Feature activation-2.81e-01

Pos logit contributions
 flavors+0.055
 scar+0.051
 fries+0.050
mm+0.050
 cones+0.049
Neg logit contributions
 alone-0.051
apping-0.048
umpy-0.048
 War-0.048
ges-0.047
Token They
Token ID1119
Feature activation+3.21e+00

Pos logit contributions
 alone+0.011
ges+0.010
apping+0.010
 War+0.010
umpy+0.010
Neg logit contributions
 flavors-0.009
 scar-0.008
mm-0.008
 fries-0.008
 cones-0.008
Token are
Token ID389
Feature activation-5.97e-01

Pos logit contributions
mm+0.114
 flavors+0.110
 extract+0.102
 scar+0.099
 chips+0.098
Neg logit contributions
 alone-0.117
ges-0.105
 War-0.102
apping-0.098
 by-0.096
Token amazed
Token ID24562
Feature activation-3.00e+00

Pos logit contributions
 alone+0.017
ges+0.016
apping+0.016
 War+0.016
umpy+0.016
Neg logit contributions
 flavors-0.024
 scar-0.023
mm-0.023
 fries-0.022
 cones-0.022
Token clown
Token ID25573
Feature activation-3.65e+00

Pos logit contributions
 flavors+0.040
 scar+0.037
mm+0.036
 fries+0.035
 cones+0.034
Neg logit contributions
 alone-0.044
ges-0.043
apping-0.043
 War-0.042
umpy-0.041
Tokens
Token ID82
Feature activation-5.25e+00

Pos logit contributions
 alone+0.108
ges+0.102
 War+0.102
umpy+0.101
apping+0.100
Neg logit contributions
 flavors-0.143
 scar-0.136
mm-0.133
 fries-0.131
 cones-0.130
Token and
Token ID290
Feature activation-1.22e+00

Pos logit contributions
umpy+0.146
 War+0.144
ges+0.144
 alone+0.144
apping+0.138
Neg logit contributions
 flavors-0.210
 scar-0.199
 cones-0.194
mm-0.188
 chips-0.187
Token the
Token ID262
Feature activation+1.15e+00

Pos logit contributions
 alone+0.037
ges+0.032
 crying+0.031
 War+0.030
 nowhere+0.030
Neg logit contributions
mm-0.053
 flavors-0.049
 extract-0.047
 scar-0.045
 french-0.045
Token ac
Token ID936
Feature activation+7.24e+00

Pos logit contributions
 flavors+0.039
 scar+0.037
mm+0.036
 fries+0.035
 cones+0.034
Neg logit contributions
 alone-0.041
ges-0.039
apping-0.038
 War-0.038
umpy-0.038
Tokenrob
Token ID22609
Feature activation+2.08e+01

Pos logit contributions
 flavors+0.088
 scar+0.071
mm+0.071
 fries+0.060
 cones+0.057
Neg logit contributions
 alone-0.418
ges-0.405
apping-0.402
 War-0.401
umpy-0.397
Tokenats
Token ID1381
Feature activation+1.33e+00

Pos logit contributions
 flavors+0.501
mm+0.451
 scar+0.426
 fries+0.409
 cones+0.406
Neg logit contributions
 alone-0.963
 War-0.920
ges-0.920
apping-0.900
umpy-0.895
Token and
Token ID290
Feature activation-1.28e+00

Pos logit contributions
 flavors+0.049
 scar+0.046
mm+0.045
 fries+0.045
 cones+0.044
Neg logit contributions
 alone-0.042
apping-0.039
ges-0.039
 War-0.039
umpy-0.039
Token the
Token ID262
Feature activation+1.10e+00

Pos logit contributions
 alone+0.032
ges+0.025
 crying+0.025
 War+0.024
 nowhere+0.023
Neg logit contributions
mm-0.063
 flavors-0.056
 extract-0.056
rob-0.053
moon-0.053
Token lions
Token ID31420
Feature activation+2.77e+00

Pos logit contributions
 flavors+0.037
mm+0.037
 scar+0.035
 fries+0.033
 extract+0.033
Neg logit contributions
 alone-0.039
 War-0.036
ges-0.035
apping-0.035
umpy-0.034
Token and
Token ID290
Feature activation-1.31e+00

Pos logit contributions
 flavors+0.103
 scar+0.097
mm+0.096
 fries+0.093
 cones+0.092
Neg logit contributions
 alone-0.088
ges-0.083
apping-0.083
 War-0.082
umpy-0.082
Token the
Token ID262
Feature activation+6.88e-01

Pos logit contributions
mm+0.028
 extract+0.027
 flavors+0.026
 feast+0.025
 fiber+0.025
Neg logit contributions
 alone-0.018
 by-0.016
ges-0.015
 War-0.015
 outside-0.014
Token animals
Token ID4695
Feature activation-1.80e+00

Pos logit contributions
 flavors+0.024
mm+0.023
 scar+0.023
 fries+0.022
 extract+0.022
Neg logit contributions
 alone-0.023
 War-0.022
ges-0.022
apping-0.021
umpy-0.021
Token,
Token ID11
Feature activation-1.58e+00

Pos logit contributions
 alone+0.045
ges+0.041
apping+0.040
 War+0.040
umpy+0.039
Neg logit contributions
 flavors-0.080
 scar-0.077
mm-0.077
 fries-0.074
 french-0.073
Token the
Token ID262
Feature activation+6.67e-01

Pos logit contributions
 alone+0.047
ges+0.042
 War+0.042
apping+0.040
umpy+0.040
Neg logit contributions
 flavors-0.064
mm-0.062
 scar-0.059
 extract-0.058
 fries-0.057
Token ac
Token ID936
Feature activation+6.54e+00

Pos logit contributions
 flavors+0.024
 scar+0.022
mm+0.022
 fries+0.021
 cones+0.021
Neg logit contributions
 alone-0.023
ges-0.022
apping-0.021
 War-0.021
umpy-0.021
Tokenrob
Token ID22609
Feature activation+2.00e+01

Pos logit contributions
 flavors+0.063
 scar+0.050
mm+0.046
 fries+0.041
 french+0.036
Neg logit contributions
 alone-0.392
ges-0.379
apping-0.374
umpy-0.373
 War-0.373
Tokenats
Token ID1381
Feature activation+1.05e+00

Pos logit contributions
 flavors+0.488
mm+0.442
 scar+0.427
 fries+0.407
 cones+0.397
Neg logit contributions
 alone-0.919
ges-0.879
 War-0.874
apping-0.865
umpy-0.856
Token and
Token ID290
Feature activation-1.02e+00

Pos logit contributions
 flavors+0.043
mm+0.041
 scar+0.041
 french+0.039
 fries+0.038
Neg logit contributions
 alone-0.031
ges-0.029
apping-0.028
 War-0.027
umpy-0.026
Token the
Token ID262
Feature activation+6.43e-01

Pos logit contributions
 alone+0.026
ges+0.022
 by+0.021
 War+0.021
 crying+0.019
Neg logit contributions
mm-0.049
 flavors-0.044
 extract-0.042
rob-0.042
 feast-0.040
Token clown
Token ID25573
Feature activation-4.38e+00

Pos logit contributions
mm+0.024
 flavors+0.023
 extract+0.021
 scar+0.021
 fries+0.021
Neg logit contributions
 alone-0.021
 War-0.019
ges-0.018
umpy-0.018
apping-0.018
Tokens
Token ID82
Feature activation-5.78e+00

Pos logit contributions
 alone+0.141
ges+0.137
apping+0.134
 War+0.128
 by+0.127
Neg logit contributions
 flavors-0.160
mm-0.157
 scar-0.154
 extract-0.144
 french-0.143
Token saw
Token ID2497
Feature activation-4.41e+00

Pos logit contributions
 flavors+0.034
 scar+0.032
mm+0.032
 fries+0.031
 cones+0.030
Neg logit contributions
 alone-0.037
ges-0.035
apping-0.034
 War-0.034
umpy-0.034
Token clown
Token ID25573
Feature activation-4.26e+00

Pos logit contributions
 alone+0.150
apping+0.149
umpy+0.148
ges+0.146
 War+0.141
Neg logit contributions
 flavors-0.154
 scar-0.146
 fries-0.143
 cones-0.139
 chips-0.138
Tokens
Token ID82
Feature activation-5.89e+00

Pos logit contributions
 alone+0.172
 War+0.165
umpy+0.164
ges+0.160
 nowhere+0.159
Neg logit contributions
 flavors-0.122
 scar-0.118
 cones-0.112
 fries-0.111
mm-0.108
Token and
Token ID290
Feature activation-1.75e+00

Pos logit contributions
 alone+0.164
umpy+0.161
 War+0.161
ges+0.158
apping+0.154
Neg logit contributions
 flavors-0.240
 scar-0.224
 cones-0.221
mm-0.217
 fries-0.216
Token ac
Token ID936
Feature activation+6.68e+00

Pos logit contributions
 alone+0.059
ges+0.055
 War+0.053
 crying+0.051
 nowhere+0.050
Neg logit contributions
mm-0.065
 flavors-0.062
 extract-0.058
 scar-0.057
 fare-0.055
Tokenrob
Token ID22609
Feature activation+1.99e+01

Pos logit contributions
 flavors+0.102
 scar+0.087
mm+0.086
 fries+0.080
 french+0.074
Neg logit contributions
 alone-0.364
ges-0.353
apping-0.346
 War-0.345
umpy-0.344
Tokenats
Token ID1381
Feature activation+1.07e+00

Pos logit contributions
 flavors+0.498
mm+0.451
 scar+0.446
 fries+0.418
 cones+0.409
Neg logit contributions
 alone-0.898
ges-0.857
 War-0.851
apping-0.844
umpy-0.837
Token and
Token ID290
Feature activation-1.79e+00

Pos logit contributions
 flavors+0.040
 scar+0.037
 cones+0.037
 fries+0.037
 chips+0.036
Neg logit contributions
 alone-0.032
apping-0.031
umpy-0.031
 War-0.031
ges-0.030
Token lions
Token ID31420
Feature activation+2.17e+00

Pos logit contributions
 alone+0.062
 War+0.056
ges+0.056
 crying+0.054
 nowhere+0.051
Neg logit contributions
mm-0.064
 flavors-0.059
 extract-0.058
 fare-0.053
moon-0.051
Token and
Token ID290
Feature activation-1.82e+00

Pos logit contributions
 flavors+0.081
 scar+0.077
mm+0.075
 cones+0.074
 fries+0.074
Neg logit contributions
 alone-0.068
ges-0.065
apping-0.065
 War-0.064
umpy-0.064
Token elephants
Token ID25368
Feature activation+4.30e+00

Pos logit contributions
 alone+0.065
ges+0.060
 War+0.059
apping+0.056
umpy+0.056
Neg logit contributions
mm-0.061
 flavors-0.061
 extract-0.056
 scar-0.055
 fries-0.053
Token him
Token ID683
Feature activation-3.25e+00

Pos logit contributions
 alone+0.034
ges+0.030
 War+0.030
apping+0.028
umpy+0.028
Neg logit contributions
 flavors-0.056
mm-0.053
 scar-0.053
 fries-0.051
 french-0.050
Token from
Token ID422
Feature activation+9.75e-02

Pos logit contributions
 alone+0.102
ges+0.091
apping+0.089
umpy+0.088
 War+0.086
Neg logit contributions
 flavors-0.131
mm-0.125
 scar-0.122
 fries-0.119
 extract-0.117
Token getting
Token ID1972
Feature activation+3.79e-01

Pos logit contributions
 flavors+0.004
 scar+0.004
mm+0.004
 fries+0.004
 cones+0.003
Neg logit contributions
 alone-0.003
ges-0.003
apping-0.003
 War-0.003
umpy-0.003
Token a
Token ID257
Feature activation+2.10e+00

Pos logit contributions
 flavors+0.015
mm+0.014
 scar+0.014
 fries+0.013
 extract+0.013
Neg logit contributions
 alone-0.012
ges-0.011
 War-0.011
apping-0.011
umpy-0.010
Token sun
Token ID4252
Feature activation-3.35e-01

Pos logit contributions
 flavors+0.072
mm+0.070
 scar+0.065
 fries+0.064
 cones+0.062
Neg logit contributions
 alone-0.075
ges-0.069
apping-0.069
 War-0.068
umpy-0.068
Tokenburn
Token ID10899
Feature activation+1.72e+01

Pos logit contributions
 alone+0.015
 by+0.014
apping+0.013
ges+0.013
 outside+0.013
Neg logit contributions
 flavors-0.010
mm-0.009
 fries-0.009
 fiber-0.008
 french-0.008
Token.
Token ID13
Feature activation+2.74e-01

Pos logit contributions
 flavors+0.750
 scar+0.702
mm+0.699
 fries+0.678
 french+0.669
Neg logit contributions
 alone-0.501
ges-0.458
apping-0.427
umpy-0.425
 by-0.414
Token
Token ID198
Feature activation-1.48e-01

Pos logit contributions
 flavors+0.011
 scar+0.010
mm+0.010
 fries+0.010
 cones+0.009
Neg logit contributions
 alone-0.008
ges-0.008
apping-0.008
 War-0.008
umpy-0.008
Token
Token ID198
Feature activation-1.63e-01

Pos logit contributions
 alone+0.005
ges+0.004
apping+0.004
 War+0.004
umpy+0.004
Neg logit contributions
 flavors-0.005
mm-0.005
 scar-0.005
 cones-0.005
 fries-0.005
TokenTim
Token ID14967
Feature activation-1.89e+00

Pos logit contributions
 alone+0.006
ges+0.006
apping+0.006
unk+0.006
 War+0.006
Neg logit contributions
 flavors-0.005
mm-0.005
 scar-0.005
 chips-0.005
 cones-0.005
Tokenmy
Token ID1820
Feature activation+2.24e-01

Pos logit contributions
 alone+0.048
umpy+0.042
 by+0.041
ges+0.040
apping+0.039
Neg logit contributions
 flavors-0.086
mm-0.084
 fries-0.082
 chips-0.080
 french-0.080
Token made
Token ID925
Feature activation+3.05e+00

Pos logit contributions
mm+0.041
 flavors+0.041
 extract+0.038
 scar+0.037
 cones+0.037
Neg logit contributions
 alone-0.042
 War-0.037
ges-0.036
umpy-0.035
 crying-0.035
Token him
Token ID683
Feature activation-4.80e+00

Pos logit contributions
 flavors+0.104
 scar+0.102
 chips+0.095
 fries+0.094
 french+0.090
Neg logit contributions
umpy-0.104
apping-0.103
 alone-0.103
ges-0.103
 War-0.097
Token an
Token ID281
Feature activation-8.10e-02

Pos logit contributions
 alone+0.172
ges+0.156
apping+0.155
 War+0.154
umpy+0.151
Neg logit contributions
 flavors-0.167
mm-0.160
 scar-0.154
 fries-0.149
 extract-0.147
Token even
Token ID772
Feature activation+5.81e+00

Pos logit contributions
ges+0.003
 alone+0.003
apping+0.003
 War+0.003
umpy+0.002
Neg logit contributions
 flavors-0.003
 scar-0.003
mm-0.003
 fries-0.003
 french-0.003
Token y
Token ID331
Feature activation+4.02e+00

Pos logit contributions
 flavors+0.192
mm+0.187
 scar+0.176
 fries+0.174
 extract+0.172
Neg logit contributions
 alone-0.217
 War-0.197
apping-0.193
umpy-0.193
ges-0.192
Tokenumm
Token ID13929
Feature activation+1.66e+01

Pos logit contributions
 flavors+0.154
 scar+0.145
mm+0.144
 fries+0.139
 cones+0.136
Neg logit contributions
 alone-0.126
ges-0.119
apping-0.117
 War-0.116
umpy-0.115
Tokenier
Token ID959
Feature activation+7.98e+00

Pos logit contributions
 scar+0.631
 flavors+0.606
 fries+0.594
mm+0.565
 chips+0.555
Neg logit contributions
 alone-0.529
ges-0.528
apping-0.505
 by-0.490
umpy-0.485
Token snack
Token ID26906
Feature activation+4.36e+00

Pos logit contributions
mm+0.297
 scar+0.288
 flavors+0.287
 fries+0.267
 cones+0.266
Neg logit contributions
 alone-0.257
ges-0.229
 by-0.222
umpy-0.221
 War-0.214
Token and
Token ID290
Feature activation-7.15e-01

Pos logit contributions
 scar+0.183
 flavors+0.182
mm+0.179
 fries+0.171
rob+0.168
Neg logit contributions
 alone-0.119
ges-0.106
 by-0.106
apping-0.101
umpy-0.096
Token they
Token ID484
Feature activation+2.56e+00

Pos logit contributions
 alone+0.025
 by+0.023
ges+0.022
 crying+0.022
 War+0.021
Neg logit contributions
mm-0.023
 scar-0.022
 fare-0.022
 flavors-0.022
rob-0.022
Token both
Token ID1111
Feature activation+2.16e+00

Pos logit contributions
mm+0.094
 flavors+0.087
rob+0.084
 scar+0.083
 extract+0.083
Neg logit contributions
 alone-0.096
ges-0.078
 by-0.077
 nowhere-0.073
 outside-0.072
Token want
Token ID765
Feature activation-3.19e+00

Pos logit contributions
 alone+0.171
umpy+0.152
ges+0.152
 War+0.148
apping+0.144
Neg logit contributions
 flavors-0.170
mm-0.168
 scar-0.153
 fries-0.152
 cones-0.152
Token Lily
Token ID20037
Feature activation-1.48e+00

Pos logit contributions
 alone+0.096
ges+0.090
 War+0.088
apping+0.085
umpy+0.084
Neg logit contributions
 flavors-0.126
mm-0.121
 scar-0.119
 fries-0.114
 cones-0.112
Token to
Token ID284
Feature activation+2.13e+00

Pos logit contributions
 alone+0.046
ges+0.043
apping+0.041
umpy+0.041
 War+0.041
Neg logit contributions
 flavors-0.058
mm-0.055
 scar-0.055
 fries-0.053
 french-0.051
Token get
Token ID651
Feature activation-5.56e-02

Pos logit contributions
 flavors+0.076
rob+0.074
mm+0.072
 scar+0.070
 french+0.067
Neg logit contributions
 alone-0.082
ges-0.070
 by-0.068
 crying-0.068
 War-0.068
Token sun
Token ID4252
Feature activation-6.78e-01

Pos logit contributions
apping+0.002
 alone+0.002
ges+0.002
umpy+0.002
 War+0.002
Neg logit contributions
 flavors-0.002
 scar-0.002
 fries-0.002
mm-0.002
 chips-0.002
Tokenburn
Token ID10899
Feature activation+1.65e+01

Pos logit contributions
 alone+0.030
 by+0.030
 outside+0.028
ges+0.028
apping+0.027
Neg logit contributions
 flavors-0.019
mm-0.017
 fiber-0.016
 fries-0.016
 french-0.016
Tokened
Token ID276
Feature activation-4.53e-01

Pos logit contributions
 flavors+0.759
 scar+0.722
mm+0.712
 fries+0.693
 french+0.688
Neg logit contributions
 alone-0.405
ges-0.390
apping-0.362
umpy-0.350
 War-0.348
Token.
Token ID13
Feature activation-6.33e-02

Pos logit contributions
 alone+0.013
ges+0.012
apping+0.012
 War+0.012
umpy+0.011
Neg logit contributions
 flavors-0.019
 scar-0.018
mm-0.018
 fries-0.017
 cones-0.017
Token
Token ID198
Feature activation-4.96e-01

Pos logit contributions
ges+0.002
 alone+0.002
apping+0.002
umpy+0.002
udge+0.002
Neg logit contributions
mm-0.002
 flavors-0.002
 scar-0.002
 extract-0.002
 chips-0.002
Token
Token ID198
Feature activation-4.88e-01

Pos logit contributions
 alone+0.019
apping+0.018
ges+0.018
 War+0.017
AR+0.017
Neg logit contributions
mm-0.013
 flavors-0.013
 scar-0.013
Together-0.012
 cones-0.012
TokenL
Token ID43
Feature activation-2.08e+00

Pos logit contributions
ges+0.021
unk+0.021
 alone+0.020
apping+0.020
udge+0.019
Neg logit contributions
Together-0.013
mm-0.012
 flavors-0.010
First-0.010
 spend-0.010
Token she
Token ID673
Feature activation+1.54e+00

Pos logit contributions
 flavors+0.146
mm+0.134
 scar+0.133
 fries+0.129
 chips+0.126
Neg logit contributions
 alone-0.164
ges-0.162
apping-0.157
 War-0.157
umpy-0.153
Token knew
Token ID2993
Feature activation+9.00e-02

Pos logit contributions
 flavors+0.052
 scar+0.052
mm+0.052
 fare+0.049
 extract+0.048
Neg logit contributions
 alone-0.054
umpy-0.048
 War-0.048
ges-0.048
 by-0.047
Token the
Token ID262
Feature activation+1.71e+00

Pos logit contributions
 flavors+0.003
 scar+0.003
mm+0.003
 fries+0.003
 extract+0.003
Neg logit contributions
 alone-0.003
ges-0.003
 War-0.003
apping-0.003
umpy-0.002
Token word
Token ID1573
Feature activation+1.49e+00

Pos logit contributions
 flavors+0.060
 scar+0.056
mm+0.055
 fries+0.054
 chips+0.052
Neg logit contributions
 alone-0.060
ges-0.057
apping-0.056
 War-0.056
umpy-0.055
Token '
Token ID705
Feature activation-3.12e-01

Pos logit contributions
 flavors+0.052
 scar+0.051
mm+0.051
 fries+0.049
 french+0.048
Neg logit contributions
 alone-0.050
 War-0.046
ges-0.046
apping-0.045
 by-0.045
Tokenburn
Token ID10899
Feature activation+1.63e+01

Pos logit contributions
 alone+0.010
ges+0.009
apping+0.009
 War+0.009
umpy+0.009
Neg logit contributions
 flavors-0.012
 scar-0.011
mm-0.011
 fries-0.011
 french-0.011
Token'.
Token ID4458
Feature activation+2.56e+00

Pos logit contributions
 flavors+0.606
 scar+0.589
mm+0.566
 fries+0.544
 cones+0.534
Neg logit contributions
 alone-0.531
ges-0.516
apping-0.496
 War-0.493
umpy-0.490
Token Her
Token ID2332
Feature activation+1.54e+00

Pos logit contributions
 flavors+0.104
 scar+0.099
mm+0.098
 fries+0.095
 cones+0.093
Neg logit contributions
 alone-0.073
ges-0.069
apping-0.068
 War-0.067
umpy-0.066
Token friends
Token ID2460
Feature activation+1.42e+00

Pos logit contributions
 flavors+0.045
mm+0.042
 scar+0.042
 fries+0.040
 french+0.039
Neg logit contributions
 alone-0.061
apping-0.058
 War-0.058
ges-0.057
umpy-0.057
Token were
Token ID547
Feature activation+7.21e-01

Pos logit contributions
 flavors+0.044
mm+0.042
 scar+0.041
 extract+0.039
 chips+0.039
Neg logit contributions
 alone-0.057
ges-0.053
apping-0.053
 War-0.051
umpy-0.050
Token so
Token ID523
Feature activation+7.92e+00

Pos logit contributions
 flavors+0.027
 scar+0.026
mm+0.025
 fries+0.025
 cones+0.024
Neg logit contributions
 alone-0.023
ges-0.022
apping-0.021
 War-0.021
umpy-0.021
Token don
Token ID836
Feature activation-4.45e+00

Pos logit contributions
 alone+0.092
umpy+0.078
 by+0.078
ges+0.077
 War+0.075
Neg logit contributions
 flavors-0.087
mm-0.085
 french-0.078
 extract-0.078
rob-0.078
Token't
Token ID470
Feature activation-4.79e+00

Pos logit contributions
 alone+0.085
umpy+0.077
ges+0.077
apping+0.077
 War+0.074
Neg logit contributions
 flavors-0.228
 scar-0.222
 fries-0.213
mm-0.211
 french-0.207
Token get
Token ID651
Feature activation-3.35e-01

Pos logit contributions
 alone+0.178
umpy+0.158
ges+0.156
apping+0.152
 War+0.150
Neg logit contributions
 flavors-0.179
mm-0.174
 scar-0.164
 fries-0.159
 french-0.159
Token a
Token ID257
Feature activation+1.90e+00

Pos logit contributions
 alone+0.010
ges+0.009
apping+0.009
 War+0.009
umpy+0.009
Neg logit contributions
 flavors-0.013
 scar-0.013
mm-0.012
 fries-0.012
 cones-0.012
Token sun
Token ID4252
Feature activation-8.05e-01

Pos logit contributions
 flavors+0.061
mm+0.060
 scar+0.053
 cones+0.052
 extract+0.052
Neg logit contributions
 alone-0.071
 War-0.065
ges-0.065
apping-0.065
umpy-0.064
Tokenburn
Token ID10899
Feature activation+1.63e+01

Pos logit contributions
 alone+0.036
 by+0.035
 outside+0.034
ges+0.034
apping+0.033
Neg logit contributions
 flavors-0.022
mm-0.019
 fiber-0.018
 fries-0.018
 french-0.017
Token."
Token ID526
Feature activation+4.41e+00

Pos logit contributions
 flavors+0.630
 scar+0.587
mm+0.577
 french+0.558
 fries+0.556
Neg logit contributions
 alone-0.547
ges-0.522
umpy-0.484
apping-0.476
 by-0.475
Token Lily
Token ID20037
Feature activation-1.57e+00

Pos logit contributions
 flavors+0.165
mm+0.162
 scar+0.160
 fries+0.154
 extract+0.153
Neg logit contributions
 alone-0.133
ges-0.130
apping-0.126
umpy-0.126
 War-0.124
Token yawn
Token ID40908
Feature activation-1.35e+00

Pos logit contributions
 alone+0.058
apping+0.054
 by+0.052
 War+0.051
 nowhere+0.051
Neg logit contributions
 flavors-0.049
mm-0.048
 scar-0.045
 extract-0.045
 fare-0.045
Tokened
Token ID276
Feature activation-7.22e-01

Pos logit contributions
 alone+0.025
apping+0.023
ges+0.022
umpy+0.022
 War+0.020
Neg logit contributions
 flavors-0.071
 scar-0.067
mm-0.066
 fries-0.065
 french-0.065
Token and
Token ID290
Feature activation-2.23e-01

Pos logit contributions
 alone+0.017
ges+0.014
apping+0.014
 War+0.014
umpy+0.013
Neg logit contributions
 flavors-0.035
mm-0.033
 scar-0.032
 fries-0.032
 french-0.031
Token ice
Token ID4771
Feature activation+4.42e+00

Pos logit contributions
 alone+0.046
 crying+0.039
 by+0.038
 sad+0.037
apping+0.036
Neg logit contributions
rob-0.047
mm-0.046
 flavors-0.045
 scar-0.043
 fare-0.041
Token cream
Token ID8566
Feature activation+4.99e+00

Pos logit contributions
 scar+0.153
 Sale+0.151
 fries+0.151
 spend+0.149
mm+0.145
Neg logit contributions
 by-0.145
 alone-0.144
it-0.133
apping-0.127
 outside-0.125
Token taste
Token ID6938
Feature activation+1.76e+00

Pos logit contributions
rob+0.224
 scar+0.218
 fiber+0.214
mm+0.210
 Sale+0.210
Neg logit contributions
 by-0.121
 alone-0.115
 outside-0.100
apping-0.095
it-0.091
Token even
Token ID772
Feature activation+6.63e+00

Pos logit contributions
 scar+0.074
mm+0.073
rob+0.072
 flavors+0.072
 fare+0.070
Neg logit contributions
 alone-0.049
 by-0.046
umpy-0.040
 outside-0.039
ges-0.039
Token y
Token ID331
Feature activation+4.75e+00

Pos logit contributions
mm+0.234
 vows+0.215
tails+0.210
moon+0.210
 feast+0.209
Neg logit contributions
 alone-0.247
 by-0.230
 sad-0.192
 outside-0.191
umpy-0.191
Tokenumm
Token ID13929
Feature activation+1.63e+01

Pos logit contributions
 scar+0.197
 flavors+0.194
 fries+0.181
mm+0.181
 fare+0.176
Neg logit contributions
 alone-0.127
ges-0.123
apping-0.121
umpy-0.119
 War-0.110
Tokenier
Token ID959
Feature activation+8.02e+00

Pos logit contributions
 scar+0.691
 fries+0.612
 flavors+0.611
 spend+0.602
 brid+0.598
Neg logit contributions
ges-0.466
 alone-0.452
 by-0.450
apping-0.444
umpy-0.430
Token!
Token ID0
Feature activation-6.51e-01

Pos logit contributions
 scar+0.360
mm+0.351
 flavors+0.344
 fries+0.337
rob+0.333
Neg logit contributions
 alone-0.194
 by-0.179
ges-0.172
umpy-0.158
apping-0.155
Token And
Token ID843
Feature activation+4.19e-01

Pos logit contributions
 alone+0.019
ges+0.018
apping+0.017
 War+0.017
umpy+0.017
Neg logit contributions
 flavors-0.026
 scar-0.025
mm-0.025
 fries-0.024
 chips-0.024
Token every
Token ID790
Feature activation+6.25e+00

Pos logit contributions
 flavors+0.016
 scar+0.015
rob+0.015
 fries+0.015
mm+0.015
Neg logit contributions
 alone-0.013
 by-0.012
ges-0.012
apping-0.011
 crying-0.011
Token time
Token ID640
Feature activation-1.76e+00

Pos logit contributions
 flavors+0.281
mm+0.276
 scar+0.272
 fries+0.262
 cones+0.259
Neg logit contributions
 alone-0.155
ges-0.134
apping-0.132
 nowhere-0.131
umpy-0.128
Token to
Token ID284
Feature activation+2.53e+00

Pos logit contributions
mm+0.213
 flavors+0.206
rob+0.192
 scar+0.188
 french+0.188
Neg logit contributions
 alone-0.117
ges-0.108
 by-0.104
apping-0.097
 outside-0.095
Token make
Token ID787
Feature activation+5.05e+00

Pos logit contributions
mm+0.082
 flavors+0.078
rob+0.076
 cones+0.068
 scar+0.068
Neg logit contributions
 alone-0.094
ges-0.090
 by-0.085
 outside-0.084
udge-0.083
Token food
Token ID2057
Feature activation+1.63e+00

Pos logit contributions
mm+0.212
 flavors+0.210
rob+0.196
 cones+0.194
 scar+0.194
Neg logit contributions
 alone-0.129
ges-0.123
 by-0.119
 War-0.116
apping-0.113
Token look
Token ID804
Feature activation+3.50e+00

Pos logit contributions
mm+0.078
rob+0.077
 flavors+0.076
 french+0.070
 fries+0.069
Neg logit contributions
 alone-0.040
 by-0.037
 outside-0.034
apping-0.030
ges-0.030
Token y
Token ID331
Feature activation+5.01e+00

Pos logit contributions
mm+0.150
 flavors+0.147
rob+0.144
 cones+0.140
 extract+0.136
Neg logit contributions
 alone-0.100
 by-0.091
 outside-0.083
ges-0.081
umpy-0.078
Tokenumm
Token ID13929
Feature activation+1.62e+01

Pos logit contributions
 flavors+0.204
 scar+0.199
mm+0.191
 fries+0.187
 extract+0.182
Neg logit contributions
 alone-0.139
ges-0.134
apping-0.134
umpy-0.130
 War-0.125
Tokenier
Token ID959
Feature activation+7.95e+00

Pos logit contributions
 scar+0.682
 flavors+0.657
 fries+0.625
 spend+0.621
 brid+0.609
Neg logit contributions
ges-0.453
 alone-0.427
 by-0.425
apping-0.424
umpy-0.398
Token than
Token ID621
Feature activation+2.82e+00

Pos logit contributions
 flavors+0.361
mm+0.361
 scar+0.354
 fries+0.341
rob+0.339
Neg logit contributions
 alone-0.190
 by-0.174
ges-0.173
umpy-0.155
apping-0.154
Token before
Token ID878
Feature activation-1.98e+00

Pos logit contributions
 flavors+0.107
mm+0.101
 scar+0.101
 fries+0.096
 cones+0.094
Neg logit contributions
 alone-0.090
ges-0.085
 War-0.081
apping-0.081
umpy-0.079
Token.
Token ID13
Feature activation+5.98e-01

Pos logit contributions
 alone+0.051
ges+0.047
 crying+0.043
 by+0.043
apping+0.042
Neg logit contributions
 flavors-0.089
mm-0.085
 scar-0.085
rob-0.083
 fries-0.083
Token She
Token ID1375
Feature activation+4.15e+00

Pos logit contributions
 flavors+0.019
mm+0.018
 scar+0.017
 fries+0.017
 cones+0.016
Neg logit contributions
 alone-0.023
ges-0.022
apping-0.021
 War-0.021
umpy-0.021
Token bakery
Token ID38164
Feature activation+2.26e+00

Pos logit contributions
 flavors+0.001
 scar+0.001
mm+0.001
 fries+0.001
 cones+0.001
Neg logit contributions
 alone-0.001
ges-0.001
apping-0.001
 War-0.001
umpy-0.001
Token to
Token ID284
Feature activation+6.72e-01

Pos logit contributions
 scar+0.096
 flavors+0.096
mm+0.095
 fries+0.090
 french+0.088
Neg logit contributions
 alone-0.064
 by-0.055
apping-0.055
ges-0.052
umpy-0.050
Token buy
Token ID2822
Feature activation+1.97e+00

Pos logit contributions
 flavors+0.022
 scar+0.022
mm+0.022
 cones+0.020
 extract+0.020
Neg logit contributions
 alone-0.026
ges-0.022
 War-0.022
 by-0.021
 nowhere-0.021
Token the
Token ID262
Feature activation+1.13e+00

Pos logit contributions
 scar+0.077
mm+0.074
 extract+0.073
 spend+0.071
 flavors+0.071
Neg logit contributions
 alone-0.058
 by-0.055
 outside-0.048
ges-0.048
 War-0.047
Token y
Token ID331
Feature activation+4.25e+00

Pos logit contributions
mm+0.032
 flavors+0.031
 scar+0.031
 extract+0.029
 fries+0.028
Neg logit contributions
 alone-0.046
 by-0.040
umpy-0.040
 War-0.040
ges-0.040
Tokenumm
Token ID13929
Feature activation+1.62e+01

Pos logit contributions
 scar+0.182
 flavors+0.174
mm+0.165
rob+0.163
 fries+0.162
Neg logit contributions
 alone-0.107
ges-0.102
apping-0.102
umpy-0.098
 by-0.093
Tokeniest
Token ID6386
Feature activation+7.25e+00

Pos logit contributions
 scar+0.652
 flavors+0.586
 fries+0.572
 wrists+0.562
 french+0.554
Neg logit contributions
ges-0.494
 alone-0.489
 by-0.476
apping-0.456
umpy-0.447
Token pastry
Token ID41738
Feature activation+1.92e+00

Pos logit contributions
mm+0.178
 scar+0.173
 flavors+0.167
rob+0.163
 extract+0.152
Neg logit contributions
 alone-0.306
 by-0.282
ges-0.282
umpy-0.274
apping-0.271
Token he
Token ID339
Feature activation+2.47e-01

Pos logit contributions
 scar+0.088
mm+0.082
 flavors+0.080
 fiber+0.078
rob+0.078
Neg logit contributions
 alone-0.049
 by-0.045
 outside-0.040
ges-0.040
apping-0.037
Token could
Token ID714
Feature activation+2.34e+00

Pos logit contributions
mm+0.009
 scar+0.009
 flavors+0.009
 extract+0.008
rob+0.008
Neg logit contributions
 alone-0.009
umpy-0.007
 nowhere-0.007
 by-0.007
ges-0.007
Token find
Token ID1064
Feature activation-9.78e-01

Pos logit contributions
 scar+0.089
 flavors+0.082
rob+0.082
mm+0.079
 fries+0.076
Neg logit contributions
 alone-0.084
umpy-0.071
ges-0.071
 by-0.070
apping-0.069
Token and
Token ID290
Feature activation-2.99e-01

Pos logit contributions
 flavors+0.244
mm+0.237
 scar+0.234
 fries+0.228
 fiber+0.226
Neg logit contributions
 alone-0.120
 by-0.106
ges-0.100
apping-0.095
 outside-0.094
Token it
Token ID340
Feature activation-1.46e+00

Pos logit contributions
 alone+0.009
 crying+0.008
 by+0.008
 unconscious+0.008
 nowhere+0.007
Neg logit contributions
mm-0.012
 fare-0.011
 nineteen-0.010
rob-0.010
 feast-0.010
Token was
Token ID373
Feature activation-1.09e+00

Pos logit contributions
 alone+0.044
 by+0.038
 nowhere+0.035
 crying+0.035
 outside+0.034
Neg logit contributions
mm-0.062
 fare-0.058
 fries-0.056
 fiber-0.056
 flavors-0.056
Token the
Token ID262
Feature activation+1.74e+00

Pos logit contributions
 alone+0.036
 nowhere+0.030
 by+0.030
 outside+0.029
 crying+0.029
Neg logit contributions
 flavors-0.042
mm-0.041
rob-0.041
 fries-0.039
 scar-0.039
Token y
Token ID331
Feature activation+4.88e+00

Pos logit contributions
 flavors+0.052
mm+0.051
 scar+0.049
 fries+0.046
 extract+0.045
Neg logit contributions
 alone-0.069
 War-0.062
 by-0.061
umpy-0.061
apping-0.060
Tokenumm
Token ID13929
Feature activation+1.59e+01

Pos logit contributions
 flavors+0.164
 scar+0.162
mm+0.149
 fries+0.148
 french+0.144
Neg logit contributions
 alone-0.171
ges-0.163
apping-0.163
umpy-0.161
 War-0.155
Tokeniest
Token ID6386
Feature activation+6.88e+00

Pos logit contributions
 scar+0.578
 flavors+0.557
 fries+0.529
mm+0.507
 french+0.506
Neg logit contributions
 alone-0.527
ges-0.521
apping-0.496
umpy-0.488
 by-0.485
Token ice
Token ID4771
Feature activation+5.01e+00

Pos logit contributions
 scar+0.194
mm+0.188
 flavors+0.184
 fries+0.174
rob+0.169
Neg logit contributions
 alone-0.270
ges-0.249
 by-0.248
apping-0.248
umpy-0.244
Token cream
Token ID8566
Feature activation+5.31e+00

Pos logit contributions
 scar+0.149
 flavors+0.134
 fries+0.133
mm+0.129
 spend+0.129
Neg logit contributions
 alone-0.200
 by-0.191
apping-0.183
it-0.182
ges-0.180
Token she
Token ID673
Feature activation+1.67e+00

Pos logit contributions
 scar+0.219
 flavors+0.208
mm+0.203
 fiber+0.197
 fries+0.195
Neg logit contributions
 alone-0.157
 by-0.144
ges-0.136
 outside-0.132
it-0.131
Token ever
Token ID1683
Feature activation+2.63e+00

Pos logit contributions
 flavors+0.053
 scar+0.051
mm+0.050
 cones+0.047
 chips+0.046
Neg logit contributions
 alone-0.065
umpy-0.059
ges-0.059
apping-0.057
 War-0.057
Token.
Token ID13
Feature activation-4.01e-02

Pos logit contributions
 flavors+0.183
mm+0.181
 scar+0.177
rob+0.167
 fries+0.166
Neg logit contributions
 alone-0.103
ges-0.093
apping-0.088
 by-0.086
umpy-0.084
Token It
Token ID632
Feature activation-1.11e+00

Pos logit contributions
 alone+0.001
 War+0.001
ges+0.001
apping+0.001
umpy+0.001
Neg logit contributions
 flavors-0.002
 scar-0.002
mm-0.002
 fries-0.002
 french-0.002
Token was
Token ID373
Feature activation-8.89e-01

Pos logit contributions
 alone+0.031
 by+0.028
 nowhere+0.026
haired+0.025
 crying+0.025
Neg logit contributions
 fare-0.046
mm-0.045
 scar-0.042
 extract-0.041
 passports-0.041
Token the
Token ID262
Feature activation+1.84e+00

Pos logit contributions
 alone+0.031
 nowhere+0.027
 by+0.027
ges+0.027
 War+0.027
Neg logit contributions
 flavors-0.030
mm-0.030
rob-0.029
 scar-0.028
 extract-0.028
Token y
Token ID331
Feature activation+4.71e+00

Pos logit contributions
 flavors+0.051
mm+0.051
 scar+0.047
 extract+0.045
 fries+0.045
Neg logit contributions
 alone-0.076
 War-0.068
 by-0.067
umpy-0.066
apping-0.066
Tokenumm
Token ID13929
Feature activation+1.54e+01

Pos logit contributions
 flavors+0.178
 scar+0.173
mm+0.164
 fries+0.162
 french+0.157
Neg logit contributions
 alone-0.147
ges-0.140
apping-0.139
umpy-0.136
 War-0.133
Tokeniest
Token ID6386
Feature activation+6.59e+00

Pos logit contributions
 scar+0.546
 flavors+0.523
 fries+0.494
mm+0.478
 french+0.470
Neg logit contributions
 alone-0.526
ges-0.526
apping-0.498
umpy-0.487
 by-0.484
Token thing
Token ID1517
Feature activation-5.67e+00

Pos logit contributions
 flavors+0.192
 scar+0.177
mm+0.176
 fries+0.168
 chips+0.163
Neg logit contributions
 alone-0.267
ges-0.256
apping-0.253
 War-0.252
umpy-0.249
Token he
Token ID339
Feature activation+6.61e-01

Pos logit contributions
 alone+0.197
ges+0.185
apping+0.182
 War+0.181
umpy+0.177
Neg logit contributions
 flavors-0.197
 scar-0.187
mm-0.185
 fries-0.177
 extract-0.172
Token ever
Token ID1683
Feature activation+2.40e+00

Pos logit contributions
 flavors+0.021
 scar+0.021
mm+0.020
 extract+0.019
 cones+0.019
Neg logit contributions
 alone-0.025
umpy-0.023
ges-0.023
 War-0.022
 nowhere-0.022
Token tasted
Token ID29187
Feature activation+2.39e+00

Pos logit contributions
 flavors+0.076
 scar+0.075
mm+0.074
 chips+0.069
 fries+0.068
Neg logit contributions
 alone-0.091
ges-0.086
umpy-0.082
 War-0.081
apping-0.080
Token magic
Token ID5536
Feature activation-5.22e-01

Pos logit contributions
 flavors+0.068
mm+0.064
 scar+0.064
 fries+0.061
 french+0.060
Neg logit contributions
 alone-0.071
 War-0.065
ges-0.065
apping-0.064
umpy-0.063
Token apple
Token ID17180
Feature activation+3.85e+00

Pos logit contributions
 alone+0.019
ges+0.017
 War+0.017
apping+0.017
umpy+0.017
Neg logit contributions
 flavors-0.017
 scar-0.017
mm-0.017
 fries-0.016
 french-0.016
Token was
Token ID373
Feature activation-8.24e-01

Pos logit contributions
 flavors+0.124
mm+0.123
 scar+0.121
 fries+0.116
 french+0.116
Neg logit contributions
 alone-0.144
ges-0.127
 War-0.127
 by-0.126
apping-0.125
Token the
Token ID262
Feature activation+2.00e+00

Pos logit contributions
 alone+0.027
 nowhere+0.024
 War+0.024
 by+0.024
apping+0.023
Neg logit contributions
mm-0.031
 flavors-0.031
 scar-0.030
 fries-0.029
rob-0.029
Token y
Token ID331
Feature activation+4.91e+00

Pos logit contributions
mm+0.053
 flavors+0.053
 fries+0.049
 scar+0.049
 extract+0.046
Neg logit contributions
 alone-0.087
 by-0.078
 War-0.078
apping-0.076
umpy-0.076
Tokenumm
Token ID13929
Feature activation+1.54e+01

Pos logit contributions
 flavors+0.159
 scar+0.152
mm+0.145
 fries+0.142
 french+0.137
Neg logit contributions
 alone-0.181
ges-0.174
apping-0.173
umpy-0.169
 War-0.168
Tokeniest
Token ID6386
Feature activation+6.68e+00

Pos logit contributions
 scar+0.566
 flavors+0.537
 fries+0.509
mm+0.494
 french+0.490
Neg logit contributions
ges-0.511
 alone-0.506
apping-0.482
 by-0.472
umpy-0.469
Token thing
Token ID1517
Feature activation-5.69e+00

Pos logit contributions
 flavors+0.253
 scar+0.251
mm+0.249
 fries+0.240
 french+0.234
Neg logit contributions
 alone-0.205
ges-0.184
apping-0.181
 by-0.177
umpy-0.176
Token they
Token ID484
Feature activation+2.83e+00

Pos logit contributions
 alone+0.212
ges+0.202
apping+0.200
 War+0.199
umpy+0.198
Neg logit contributions
 flavors-0.181
 scar-0.168
mm-0.166
 fries-0.160
 cones-0.157
Token ever
Token ID1683
Feature activation+2.58e+00

Pos logit contributions
 flavors+0.084
mm+0.078
 scar+0.078
 fries+0.073
 cones+0.072
Neg logit contributions
 alone-0.116
ges-0.109
 War-0.106
apping-0.106
umpy-0.105
Token tasted
Token ID29187
Feature activation+2.70e+00

Pos logit contributions
 flavors+0.086
mm+0.084
 scar+0.083
 chips+0.078
 fries+0.078
Neg logit contributions
 alone-0.094
ges-0.089
 War-0.085
umpy-0.085
apping-0.084
Token leaves
Token ID5667
Feature activation+3.49e+00

Pos logit contributions
 alone+0.099
apping+0.096
ges+0.095
umpy+0.095
 War+0.091
Neg logit contributions
 flavors-0.134
 scar-0.127
mm-0.122
 fries-0.121
 chips-0.119
Token to
Token ID284
Feature activation+2.15e+00

Pos logit contributions
 flavors+0.140
mm+0.132
 scar+0.131
 fries+0.127
 french+0.124
Neg logit contributions
 alone-0.105
ges-0.098
apping-0.096
 War-0.095
umpy-0.094
Token find
Token ID1064
Feature activation-1.11e+00

Pos logit contributions
 flavors+0.068
mm+0.066
 fries+0.063
 scar+0.062
 french+0.060
Neg logit contributions
 alone-0.088
ges-0.077
 by-0.075
 War-0.074
apping-0.073
Token the
Token ID262
Feature activation+1.97e+00

Pos logit contributions
 alone+0.037
ges+0.035
 War+0.034
apping+0.034
umpy+0.033
Neg logit contributions
 flavors-0.040
mm-0.038
 scar-0.038
 fries-0.037
 extract-0.036
Token y
Token ID331
Feature activation+4.80e+00

Pos logit contributions
mm+0.066
 flavors+0.066
 fries+0.064
 french+0.061
rob+0.060
Neg logit contributions
 alone-0.073
 by-0.064
umpy-0.063
 War-0.061
ges-0.061
Tokenumm
Token ID13929
Feature activation+1.54e+01

Pos logit contributions
 flavors+0.194
 scar+0.191
 fries+0.183
mm+0.183
 french+0.177
Neg logit contributions
 alone-0.134
ges-0.126
apping-0.125
umpy-0.124
 by-0.115
Tokeniest
Token ID6386
Feature activation+6.73e+00

Pos logit contributions
 scar+0.600
 flavors+0.553
 fries+0.545
 french+0.524
 wrists+0.522
Neg logit contributions
ges-0.482
 alone-0.481
 by-0.457
umpy-0.439
apping-0.433
Token one
Token ID530
Feature activation+2.72e+00

Pos logit contributions
rob+0.225
mm+0.223
 fries+0.223
 scar+0.222
 flavors+0.212
Neg logit contributions
 alone-0.241
 by-0.221
ges-0.209
umpy-0.204
 outside-0.202
Token!"
Token ID2474
Feature activation+4.27e+00

Pos logit contributions
 flavors+0.100
 scar+0.098
mm+0.095
 fries+0.092
 extract+0.090
Neg logit contributions
 alone-0.091
ges-0.083
apping-0.080
umpy-0.080
 War-0.080
Token Lily
Token ID20037
Feature activation-1.26e+00

Pos logit contributions
 flavors+0.209
 scar+0.195
mm+0.195
 french+0.189
 fries+0.189
Neg logit contributions
 alone-0.096
 War-0.083
ges-0.082
apping-0.082
 by-0.080
Token smiled
Token ID13541
Feature activation-3.66e+00

Pos logit contributions
 alone+0.046
apping+0.043
 by+0.042
 nowhere+0.041
ges+0.041
Neg logit contributions
mm-0.040
 flavors-0.040
 scar-0.038
 fare-0.038
 fries-0.037
Token her
Token ID607
Feature activation-5.22e-01

Pos logit contributions
 flavors+0.219
mm+0.209
 scar+0.203
 fries+0.200
 french+0.195
Neg logit contributions
 alone-0.137
ges-0.125
 War-0.117
apping-0.116
 by-0.114
Token hair
Token ID4190
Feature activation+4.51e+00

Pos logit contributions
 alone+0.020
ges+0.018
apping+0.018
 War+0.017
umpy+0.017
Neg logit contributions
 flavors-0.017
mm-0.016
 scar-0.015
 fries-0.015
 french-0.015
Token in
Token ID287
Feature activation-1.66e+00

Pos logit contributions
 flavors+0.217
mm+0.210
 fries+0.204
 french+0.200
 scar+0.199
Neg logit contributions
 alone-0.105
ges-0.090
 by-0.085
apping-0.085
umpy-0.080
Token a
Token ID257
Feature activation+2.26e+00

Pos logit contributions
 alone+0.045
ges+0.041
 War+0.039
apping+0.039
 nowhere+0.038
Neg logit contributions
 flavors-0.071
mm-0.069
 scar-0.067
 fries-0.066
 cones-0.064
Token pony
Token ID26902
Feature activation+3.08e+00

Pos logit contributions
 flavors+0.074
 scar+0.069
mm+0.067
 fries+0.065
 cones+0.064
Neg logit contributions
 alone-0.083
ges-0.080
apping-0.078
 War-0.078
umpy-0.077
Tokentail
Token ID13199
Feature activation+1.53e+01

Pos logit contributions
 flavors+0.123
mm+0.116
 scar+0.113
 fries+0.111
 french+0.108
Neg logit contributions
 alone-0.096
ges-0.087
apping-0.086
 War-0.083
umpy-0.082
Token so
Token ID523
Feature activation+8.19e+00

Pos logit contributions
 flavors+0.608
 scar+0.575
mm+0.572
 fries+0.553
 cones+0.541
Neg logit contributions
 alone-0.458
ges-0.429
apping-0.422
 War-0.419
umpy-0.412
Token she
Token ID673
Feature activation+1.94e+00

Pos logit contributions
 flavors+0.328
 scar+0.315
rob+0.313
mm+0.303
 fries+0.302
Neg logit contributions
 alone-0.255
apping-0.225
ges-0.222
 by-0.219
 War-0.209
Token could
Token ID714
Feature activation+2.34e+00

Pos logit contributions
 flavors+0.064
mm+0.064
rob+0.062
 cones+0.058
 extract+0.058
Neg logit contributions
 alone-0.070
 by-0.064
 nowhere-0.062
apping-0.062
it-0.061
Token look
Token ID804
Feature activation+3.09e+00

Pos logit contributions
 flavors+0.084
rob+0.083
mm+0.080
 scar+0.080
 fries+0.075
Neg logit contributions
 alone-0.082
 outside-0.072
ges-0.071
apping-0.071
 by-0.071
Token pretty
Token ID2495
Feature activation-5.19e-01

Pos logit contributions
 flavors+0.147
 scar+0.142
mm+0.138
 fries+0.137
 french+0.134
Neg logit contributions
 alone-0.064
ges-0.062
 War-0.061
apping-0.061
umpy-0.059
Token swing
Token ID9628
Feature activation-1.96e+00

Pos logit contributions
umpy+0.100
apping+0.100
 alone+0.099
 War+0.097
ges+0.097
Neg logit contributions
 flavors-0.087
 scar-0.084
mm-0.081
 chips-0.079
 cones-0.079
Token together
Token ID1978
Feature activation+8.23e-01

Pos logit contributions
 alone+0.056
ges+0.050
apping+0.049
 War+0.048
umpy+0.047
Neg logit contributions
 flavors-0.082
 scar-0.079
mm-0.078
 fries-0.077
 french-0.076
Token.
Token ID13
Feature activation-4.56e+00

Pos logit contributions
 flavors+0.034
mm+0.034
 scar+0.033
 fries+0.032
 french+0.032
Neg logit contributions
 alone-0.024
ges-0.021
apping-0.020
 by-0.020
 War-0.020
Token<|endoftext|>
Token ID50256
Feature activation-5.81e+00

Pos logit contributions
 alone+0.160
ges+0.152
apping+0.149
umpy+0.148
 War+0.146
Neg logit contributions
 flavors-0.152
mm-0.146
 scar-0.142
 fries-0.138
 cones-0.135
TokenOnce
Token ID7454
Feature activation-1.13e+00

Pos logit contributions
apping+0.226
uttering+0.222
 alone+0.219
 War+0.218
umpy+0.217
Neg logit contributions
 flavors-0.149
mm-0.145
 cones-0.140
rob-0.137
�-0.130
Token upon
Token ID2402
Feature activation-1.21e+01

Pos logit contributions
AR+0.028
 War+0.027
ges+0.024
udge+0.024
ucking+0.024
Neg logit contributions
 would-0.045
 scar-0.045
mm-0.044
 spend-0.042
 collection-0.042
Token a
Token ID257
Feature activation+9.09e-01

Pos logit contributions
ges+0.505
apping+0.498
umpy+0.483
 War+0.482
 alone+0.476
Neg logit contributions
 flavors-0.323
 scar-0.308
mm-0.291
 cones-0.276
 fries-0.272
Token time
Token ID640
Feature activation-2.59e+00

Pos logit contributions
 flavors+0.034
 scar+0.033
mm+0.033
 cones+0.031
 feast+0.031
Neg logit contributions
ges-0.028
 alone-0.027
 War-0.027
apping-0.025
umpy-0.024
Token,
Token ID11
Feature activation-1.24e+00

Pos logit contributions
 War+0.133
umpy+0.128
udge+0.126
 unconscious+0.124
ges+0.123
Neg logit contributions
mm-0.048
 scar-0.028
 feast-0.028
 cones-0.027
 Wil-0.025
Token there
Token ID612
Feature activation-3.86e+00

Pos logit contributions
 War+0.040
umpy+0.038
and+0.035
AR+0.034
udge+0.034
Neg logit contributions
 scar-0.041
mm-0.037
 would-0.037
 nineteen-0.037
 will-0.036
Token was
Token ID373
Feature activation-1.48e+00

Pos logit contributions
umpy+0.164
 alone+0.163
 War+0.162
apping+0.162
ges+0.158
Neg logit contributions
 flavors-0.098
mm-0.088
 would-0.083
 scar-0.078
 extract-0.077
Token with
Token ID351
Feature activation-8.37e+00

Pos logit contributions
 alone+0.064
ges+0.062
 by+0.061
 War+0.059
apping+0.059
Neg logit contributions
 flavors-0.061
mm-0.061
 scar-0.059
 french-0.057
 fries-0.057
Token snow
Token ID6729
Feature activation+1.30e+00

Pos logit contributions
 alone+0.258
ges+0.234
 War+0.228
apping+0.219
 nowhere+0.216
Neg logit contributions
 flavors-0.326
mm-0.317
 scar-0.308
 extract-0.300
 fries-0.298
Token.
Token ID13
Feature activation-4.45e+00

Pos logit contributions
 flavors+0.055
 french+0.052
 fries+0.052
 scar+0.051
mm+0.051
Neg logit contributions
 alone-0.037
 by-0.032
ges-0.032
apping-0.031
 outside-0.030
Token<|endoftext|>
Token ID50256
Feature activation-5.53e+00

Pos logit contributions
 alone+0.153
ges+0.145
apping+0.143
umpy+0.141
 War+0.140
Neg logit contributions
 flavors-0.151
mm-0.146
 scar-0.143
 fries-0.139
 cones-0.136
TokenOnce
Token ID7454
Feature activation-1.11e+00

Pos logit contributions
apping+0.212
uttering+0.207
 alone+0.206
 War+0.204
umpy+0.203
Neg logit contributions
 flavors-0.148
mm-0.143
 cones-0.139
rob-0.135
�-0.127
Token upon
Token ID2402
Feature activation-1.20e+01

Pos logit contributions
AR+0.026
 War+0.026
ges+0.023
udge+0.023
ucking+0.023
Neg logit contributions
 would-0.045
 scar-0.045
mm-0.044
 flavors-0.042
 spend-0.042
Token a
Token ID257
Feature activation+9.99e-01

Pos logit contributions
ges+0.492
apping+0.486
 War+0.470
umpy+0.470
 alone+0.467
Neg logit contributions
 flavors-0.337
 scar-0.321
mm-0.304
 cones-0.290
 fries-0.286
Token time
Token ID640
Feature activation-2.58e+00

Pos logit contributions
 flavors+0.040
 scar+0.039
mm+0.038
 cones+0.036
 fries+0.036
Neg logit contributions
 alone-0.029
ges-0.028
 War-0.028
apping-0.026
umpy-0.025
Token,
Token ID11
Feature activation-1.20e+00

Pos logit contributions
 War+0.130
umpy+0.127
udge+0.124
 unconscious+0.122
ges+0.120
Neg logit contributions
mm-0.050
 scar-0.033
 cones-0.031
 feast-0.031
 flavors-0.030
Token there
Token ID612
Feature activation-3.94e+00

Pos logit contributions
 War+0.039
umpy+0.036
and+0.033
udge+0.033
AR+0.032
Neg logit contributions
 scar-0.041
mm-0.038
 would-0.036
 nineteen-0.036
 will-0.035
Token was
Token ID373
Feature activation-1.44e+00

Pos logit contributions
umpy+0.164
 alone+0.164
apping+0.163
 War+0.162
ges+0.159
Neg logit contributions
 flavors-0.104
mm-0.094
 would-0.086
 scar-0.085
 extract-0.083
Token to
Token ID284
Feature activation-4.90e+00

Pos logit contributions
 flavors+0.035
mm+0.032
 scar+0.032
 fries+0.030
 cones+0.030
Neg logit contributions
 alone-0.043
ges-0.041
 War-0.041
apping-0.040
umpy-0.040
Token her
Token ID607
Feature activation-5.44e+00

Pos logit contributions
 alone+0.185
ges+0.165
 War+0.163
apping+0.158
 nowhere+0.155
Neg logit contributions
 flavors-0.163
mm-0.159
 scar-0.152
 extract-0.144
 cones-0.144
Token.
Token ID13
Feature activation-4.57e+00

Pos logit contributions
 alone+0.192
 by+0.161
 crying+0.159
 nowhere+0.157
apping+0.156
Neg logit contributions
 flavors-0.198
mm-0.197
rob-0.183
 scar-0.181
 extract-0.179
Token<|endoftext|>
Token ID50256
Feature activation-5.66e+00

Pos logit contributions
 alone+0.145
ges+0.137
apping+0.134
 War+0.133
umpy+0.132
Neg logit contributions
 flavors-0.169
mm-0.161
 scar-0.159
 fries-0.154
 cones-0.151
TokenOnce
Token ID7454
Feature activation-1.14e+00

Pos logit contributions
 alone+0.203
apping+0.202
umpy+0.198
 War+0.197
uttering+0.192
Neg logit contributions
 flavors-0.169
mm-0.164
 cones-0.158
 scar-0.151
rob-0.149
Token upon
Token ID2402
Feature activation-1.20e+01

Pos logit contributions
AR+0.026
 War+0.025
ges+0.022
udge+0.022
umpy+0.021
Neg logit contributions
 would-0.048
 scar-0.047
mm-0.047
 flavors-0.045
 spend-0.045
Token a
Token ID257
Feature activation+8.80e-01

Pos logit contributions
ges+0.484
apping+0.479
 War+0.466
umpy+0.464
 alone+0.461
Neg logit contributions
 flavors-0.343
 scar-0.324
mm-0.307
 cones-0.294
 fries-0.292
Token time
Token ID640
Feature activation-2.56e+00

Pos logit contributions
 flavors+0.034
 scar+0.033
mm+0.033
 cones+0.031
 chips+0.031
Neg logit contributions
ges-0.026
 alone-0.026
 War-0.025
apping-0.024
umpy-0.023
Token there
Token ID612
Feature activation-3.98e+00

Pos logit contributions
 War+0.131
umpy+0.128
udge+0.125
 unconscious+0.122
 alone+0.120
Neg logit contributions
mm-0.047
 scar-0.030
 feast-0.029
 cones-0.028
 Wil-0.028
Token was
Token ID373
Feature activation-1.54e+00

Pos logit contributions
 alone+0.156
apping+0.155
umpy+0.154
 War+0.154
ges+0.151
Neg logit contributions
 flavors-0.115
mm-0.107
 scar-0.099
 extract-0.095
 cones-0.094
Token a
Token ID257
Feature activation+1.44e+00

Pos logit contributions
ges+0.061
 War+0.060
 alone+0.060
umpy+0.060
apping+0.059
Neg logit contributions
mm-0.043
 scar-0.042
 flavors-0.041
 breakfast-0.040
 chips-0.039
Token to
Token ID284
Feature activation-4.90e+00

Pos logit contributions
 alone+0.124
ges+0.112
 War+0.107
 by+0.106
 crying+0.104
Neg logit contributions
 flavors-0.172
mm-0.171
 scar-0.158
 extract-0.156
 fries-0.154
Token them
Token ID606
Feature activation-5.45e+00

Pos logit contributions
 alone+0.185
ges+0.165
 War+0.163
apping+0.158
 nowhere+0.155
Neg logit contributions
 flavors-0.163
mm-0.159
 scar-0.152
 extract-0.144
 cones-0.144
Token.
Token ID13
Feature activation-4.44e+00

Pos logit contributions
 alone+0.152
apping+0.128
ges+0.127
 by+0.125
 War+0.119
Neg logit contributions
 flavors-0.237
mm-0.235
 scar-0.229
 extract-0.220
 fries-0.219
Token<|endoftext|>
Token ID50256
Feature activation-5.76e+00

Pos logit contributions
 alone+0.153
ges+0.145
apping+0.142
umpy+0.140
 War+0.139
Neg logit contributions
 flavors-0.152
mm-0.145
 scar-0.141
 fries-0.138
 cones-0.135
TokenOnce
Token ID7454
Feature activation-1.17e+00

Pos logit contributions
 alone+0.214
apping+0.213
umpy+0.209
 War+0.207
uttering+0.206
Neg logit contributions
 flavors-0.162
mm-0.158
 cones-0.154
rob-0.145
 scar-0.141
Token upon
Token ID2402
Feature activation-1.20e+01

Pos logit contributions
AR+0.025
 War+0.024
ges+0.021
umpy+0.021
udge+0.021
Neg logit contributions
 would-0.050
 scar-0.050
mm-0.050
 flavors-0.049
 spend-0.047
Token a
Token ID257
Feature activation+8.34e-01

Pos logit contributions
ges+0.486
apping+0.482
umpy+0.467
 War+0.466
 alone+0.464
Neg logit contributions
 flavors-0.342
 scar-0.321
mm-0.305
 cones-0.293
 fries-0.291
Token time
Token ID640
Feature activation-2.57e+00

Pos logit contributions
 flavors+0.033
 scar+0.032
mm+0.032
 cones+0.030
 fries+0.030
Neg logit contributions
 alone-0.024
ges-0.024
 War-0.023
apping-0.022
umpy-0.021
Token,
Token ID11
Feature activation-1.32e+00

Pos logit contributions
 War+0.132
umpy+0.130
udge+0.126
 unconscious+0.122
 alone+0.121
Neg logit contributions
mm-0.047
 scar-0.030
 cones-0.029
 feast-0.029
 Wil-0.029
Token there
Token ID612
Feature activation-3.96e+00

Pos logit contributions
 War+0.042
umpy+0.040
and+0.036
udge+0.036
AR+0.035
Neg logit contributions
 scar-0.045
mm-0.042
 would-0.040
 nineteen-0.040
 flavors-0.039
Token was
Token ID373
Feature activation-1.51e+00

Pos logit contributions
 alone+0.162
umpy+0.161
apping+0.160
 War+0.159
ges+0.156
Neg logit contributions
 flavors-0.110
mm-0.101
 scar-0.090
 extract-0.089
 would-0.089
Token and
Token ID290
Feature activation-6.65e+00

Pos logit contributions
 War+0.139
oon+0.132
AR+0.131
otion+0.130
ges+0.130
Neg logit contributions
 flavors-0.095
 scar-0.094
 spend-0.090
 cones-0.089
 roast-0.081
Token dad
Token ID9955
Feature activation-1.43e+00

Pos logit contributions
 alone+0.251
ges+0.214
 War+0.213
 by+0.207
 crying+0.206
Neg logit contributions
mm-0.226
 flavors-0.221
 extract-0.205
 scar-0.201
 fries-0.196
Token.
Token ID13
Feature activation-4.91e+00

Pos logit contributions
 alone+0.054
ges+0.049
 by+0.048
apping+0.048
 War+0.047
Neg logit contributions
mm-0.048
 flavors-0.046
 scar-0.046
 extract-0.044
rob-0.042
Token<|endoftext|>
Token ID50256
Feature activation-5.66e+00

Pos logit contributions
 alone+0.171
ges+0.162
apping+0.159
 War+0.157
umpy+0.157
Neg logit contributions
 flavors-0.167
mm-0.158
 scar-0.156
 fries-0.152
 cones-0.148
TokenOnce
Token ID7454
Feature activation-1.08e+00

Pos logit contributions
apping+0.216
 alone+0.214
uttering+0.211
umpy+0.210
 War+0.206
Neg logit contributions
 flavors-0.151
mm-0.146
 cones-0.144
rob-0.139
 feast-0.131
Token upon
Token ID2402
Feature activation-1.20e+01

Pos logit contributions
AR+0.024
 War+0.023
ges+0.020
 Fighting+0.020
umpy+0.020
Neg logit contributions
 would-0.045
 scar-0.045
mm-0.044
 spend-0.043
 flavors-0.043
Token a
Token ID257
Feature activation+9.70e-01

Pos logit contributions
ges+0.497
apping+0.491
umpy+0.476
 War+0.476
 alone+0.469
Neg logit contributions
 flavors-0.329
 scar-0.309
mm-0.289
 fries-0.279
 cones-0.278
Token time
Token ID640
Feature activation-2.64e+00

Pos logit contributions
 flavors+0.037
 scar+0.036
mm+0.036
 feast+0.033
 cones+0.033
Neg logit contributions
ges-0.030
 alone-0.029
 War-0.028
apping-0.027
umpy-0.026
Token,
Token ID11
Feature activation-1.24e+00

Pos logit contributions
 War+0.137
umpy+0.134
udge+0.131
 unconscious+0.129
ucking+0.125
Neg logit contributions
mm-0.048
 feast-0.030
 scar-0.029
 cones-0.028
 Wil-0.027
Token there
Token ID612
Feature activation-3.84e+00

Pos logit contributions
 War+0.040
umpy+0.038
udge+0.035
and+0.034
AR+0.034
Neg logit contributions
 scar-0.041
mm-0.039
 would-0.038
 nineteen-0.037
 will-0.036
Token was
Token ID373
Feature activation-1.47e+00

Pos logit contributions
umpy+0.160
 alone+0.159
apping+0.159
 War+0.158
ges+0.154
Neg logit contributions
 flavors-0.103
mm-0.094
 would-0.084
 scar-0.083
 extract-0.082
Token with
Token ID351
Feature activation-8.37e+00

Pos logit contributions
 alone+0.068
ges+0.066
apping+0.065
 War+0.065
umpy+0.064
Neg logit contributions
 flavors-0.065
 scar-0.061
mm-0.058
 fries-0.057
 cones-0.057
Token them
Token ID606
Feature activation-5.47e+00

Pos logit contributions
 alone+0.258
ges+0.234
 War+0.228
apping+0.219
 nowhere+0.216
Neg logit contributions
 flavors-0.326
mm-0.317
 scar-0.308
 extract-0.300
 fries-0.298
Token.
Token ID13
Feature activation-4.53e+00

Pos logit contributions
 alone+0.162
ges+0.138
apping+0.136
 War+0.133
 by+0.132
Neg logit contributions
 flavors-0.226
mm-0.223
 scar-0.215
 extract-0.213
 fries-0.208
Token<|endoftext|>
Token ID50256
Feature activation-5.76e+00

Pos logit contributions
 alone+0.159
ges+0.150
apping+0.148
umpy+0.146
 War+0.144
Neg logit contributions
 flavors-0.151
mm-0.145
 scar-0.141
 fries-0.138
 cones-0.135
TokenOnce
Token ID7454
Feature activation-1.14e+00

Pos logit contributions
apping+0.215
 alone+0.213
umpy+0.208
uttering+0.208
 War+0.207
Neg logit contributions
 flavors-0.161
mm-0.154
 cones-0.151
rob-0.145
 scar-0.137
Token upon
Token ID2402
Feature activation-1.20e+01

Pos logit contributions
AR+0.024
 War+0.024
ges+0.022
umpy+0.021
udge+0.021
Neg logit contributions
 scar-0.049
 would-0.048
mm-0.048
 flavors-0.047
 spend-0.046
Token a
Token ID257
Feature activation+7.99e-01

Pos logit contributions
ges+0.479
apping+0.474
 War+0.459
umpy+0.459
 alone+0.458
Neg logit contributions
 flavors-0.348
 scar-0.328
mm-0.311
 cones-0.298
 fries-0.296
Token time
Token ID640
Feature activation-2.61e+00

Pos logit contributions
 flavors+0.032
 scar+0.031
mm+0.031
 cones+0.029
 fries+0.029
Neg logit contributions
 alone-0.023
ges-0.023
 War-0.022
apping-0.021
umpy-0.020
Token,
Token ID11
Feature activation-1.31e+00

Pos logit contributions
 War+0.133
umpy+0.131
udge+0.127
 unconscious+0.125
 alone+0.122
Neg logit contributions
mm-0.049
 scar-0.031
 cones-0.030
 Wil-0.030
 feast-0.030
Token there
Token ID612
Feature activation-3.99e+00

Pos logit contributions
 War+0.041
umpy+0.039
and+0.036
udge+0.035
AR+0.035
Neg logit contributions
 scar-0.044
mm-0.042
 would-0.039
 nineteen-0.039
 flavors-0.039
Token was
Token ID373
Feature activation-1.52e+00

Pos logit contributions
 alone+0.163
umpy+0.162
apping+0.161
 War+0.161
ges+0.157
Neg logit contributions
 flavors-0.111
mm-0.101
 scar-0.091
 would-0.089
 extract-0.089
Token much
Token ID881
Feature activation-2.73e+00

Pos logit contributions
 flavors+0.169
 scar+0.161
mm+0.158
 fries+0.153
 cones+0.152
Neg logit contributions
 alone-0.129
apping-0.123
ges-0.122
umpy-0.121
 War-0.120
Token chalk
Token ID30860
Feature activation-5.43e+00

Pos logit contributions
 alone+0.096
 by+0.083
umpy+0.083
 nowhere+0.079
ges+0.078
Neg logit contributions
mm-0.097
 flavors-0.096
 scar-0.094
 extract-0.090
 fries-0.090
Token!
Token ID0
Feature activation-4.90e+00

Pos logit contributions
 alone+0.157
ges+0.136
 by+0.132
umpy+0.130
apping+0.126
Neg logit contributions
 scar-0.225
 flavors-0.221
mm-0.214
 fries-0.213
 french-0.208
Token<|endoftext|>
Token ID50256
Feature activation-5.58e+00

Pos logit contributions
 alone+0.150
ges+0.141
apping+0.139
 War+0.137
umpy+0.136
Neg logit contributions
 flavors-0.188
mm-0.179
 scar-0.176
 fries-0.171
 cones-0.167
TokenOnce
Token ID7454
Feature activation-8.80e-01

Pos logit contributions
apping+0.204
 alone+0.201
 War+0.200
uttering+0.199
umpy+0.197
Neg logit contributions
 flavors-0.158
mm-0.152
 cones-0.146
rob-0.140
 scar-0.136
Token upon
Token ID2402
Feature activation-1.20e+01

Pos logit contributions
 War+0.020
AR+0.020
ges+0.018
udge+0.017
 Fighting+0.017
Neg logit contributions
 would-0.036
 scar-0.036
mm-0.035
 spend-0.034
 flavors-0.034
Token a
Token ID257
Feature activation+6.36e-01

Pos logit contributions
ges+0.486
apping+0.481
 War+0.468
umpy+0.467
 alone+0.464
Neg logit contributions
 flavors-0.337
 scar-0.321
mm-0.300
 cones-0.288
 fries-0.286
Token time
Token ID640
Feature activation-2.47e+00

Pos logit contributions
 flavors+0.024
 scar+0.024
mm+0.023
 cones+0.022
 feast+0.022
Neg logit contributions
ges-0.019
 alone-0.019
 War-0.019
apping-0.018
umpy-0.017
Token,
Token ID11
Feature activation-1.23e+00

Pos logit contributions
 War+0.127
umpy+0.124
udge+0.122
 unconscious+0.119
haired+0.117
Neg logit contributions
mm-0.044
 Wil-0.027
 scar-0.026
 feast-0.025
 cones-0.024
Token there
Token ID612
Feature activation-3.82e+00

Pos logit contributions
 War+0.040
umpy+0.037
and+0.035
AR+0.034
udge+0.034
Neg logit contributions
 scar-0.040
mm-0.037
 would-0.036
 nineteen-0.036
 will-0.035
Token was
Token ID373
Feature activation-1.52e+00

Pos logit contributions
 alone+0.161
umpy+0.160
apping+0.159
 War+0.158
ges+0.155
Neg logit contributions
 flavors-0.101
mm-0.091
 would-0.083
 scar-0.080
 extract-0.079
Token with
Token ID351
Feature activation-6.87e+00

Pos logit contributions
 alone+0.118
ges+0.107
apping+0.107
umpy+0.103
 War+0.103
Neg logit contributions
 flavors-0.145
 scar-0.139
mm-0.138
 fries-0.133
 french-0.131
Token them
Token ID606
Feature activation-4.45e+00

Pos logit contributions
 alone+0.217
ges+0.202
 War+0.198
apping+0.194
umpy+0.189
Neg logit contributions
 flavors-0.260
mm-0.250
 scar-0.244
 fries-0.236
 extract-0.235
Token.
Token ID13
Feature activation-3.97e+00

Pos logit contributions
 alone+0.127
ges+0.108
apping+0.108
 War+0.105
 by+0.105
Neg logit contributions
 flavors-0.188
mm-0.185
 extract-0.179
 scar-0.178
 fries-0.173
Token<|endoftext|>
Token ID50256
Feature activation-5.36e+00

Pos logit contributions
 alone+0.139
ges+0.131
apping+0.129
umpy+0.129
 War+0.125
Neg logit contributions
 flavors-0.132
mm-0.127
 scar-0.123
 fries-0.121
 cones-0.119
TokenOnce
Token ID7454
Feature activation-8.67e-01

Pos logit contributions
apping+0.203
 alone+0.201
uttering+0.198
umpy+0.196
 War+0.195
Neg logit contributions
 flavors-0.146
mm-0.138
 cones-0.137
rob-0.132
�-0.122
Token upon
Token ID2402
Feature activation-1.20e+01

Pos logit contributions
AR+0.020
 War+0.019
ges+0.017
umpy+0.017
ivering+0.016
Neg logit contributions
mm-0.036
 would-0.036
 scar-0.035
 flavors-0.035
 spend-0.034
Token a
Token ID257
Feature activation+1.07e+00

Pos logit contributions
ges+0.490
apping+0.486
umpy+0.470
 War+0.468
 alone+0.466
Neg logit contributions
 flavors-0.336
 scar-0.317
mm-0.298
 cones-0.288
 fries-0.284
Token time
Token ID640
Feature activation-2.60e+00

Pos logit contributions
 flavors+0.042
 scar+0.041
mm+0.040
 cones+0.038
 chips+0.038
Neg logit contributions
ges-0.032
 alone-0.032
 War-0.031
apping-0.029
umpy-0.028
Token,
Token ID11
Feature activation-1.20e+00

Pos logit contributions
 War+0.135
umpy+0.132
udge+0.130
 unconscious+0.126
ucking+0.124
Neg logit contributions
mm-0.047
 Wil-0.029
 cones-0.028
 scar-0.027
 feast-0.027
Token there
Token ID612
Feature activation-3.85e+00

Pos logit contributions
 War+0.039
umpy+0.037
and+0.034
udge+0.033
AR+0.032
Neg logit contributions
 scar-0.039
mm-0.037
 would-0.035
 nineteen-0.035
 ninety-0.034
Token was
Token ID373
Feature activation-1.46e+00

Pos logit contributions
 alone+0.160
umpy+0.160
apping+0.159
 War+0.157
ges+0.155
Neg logit contributions
 flavors-0.104
mm-0.095
 would-0.084
 extract-0.083
 scar-0.083
Token its
Token ID663
Feature activation-6.72e+00

Pos logit contributions
 alone+0.078
umpy+0.071
ges+0.069
 War+0.069
apping+0.069
Neg logit contributions
 flavors-0.082
 scar-0.077
mm-0.077
 fries-0.076
 chips-0.074
Token creatures
Token ID8109
Feature activation-7.58e+00

Pos logit contributions
 alone+0.239
ges+0.225
apping+0.221
 War+0.221
umpy+0.218
Neg logit contributions
 flavors-0.228
mm-0.213
 scar-0.213
 fries-0.204
 cones-0.198
Token.
Token ID13
Feature activation-4.18e+00

Pos logit contributions
 alone+0.226
ges+0.212
apping+0.208
 War+0.207
umpy+0.203
Neg logit contributions
 flavors-0.299
 scar-0.283
mm-0.281
 fries-0.272
 cones-0.266
Token<|endoftext|>
Token ID50256
Feature activation-5.55e+00

Pos logit contributions
 alone+0.138
ges+0.131
apping+0.128
umpy+0.127
 War+0.126
Neg logit contributions
 flavors-0.147
mm-0.141
 scar-0.137
 fries-0.134
 cones-0.131
TokenOnce
Token ID7454
Feature activation-1.05e+00

Pos logit contributions
apping+0.204
 alone+0.202
 War+0.199
umpy+0.198
uttering+0.197
Neg logit contributions
 flavors-0.157
mm-0.150
 cones-0.147
rob-0.140
 scar-0.136
Token upon
Token ID2402
Feature activation-1.19e+01

Pos logit contributions
AR+0.025
 War+0.024
ges+0.022
udge+0.021
ucking+0.021
Neg logit contributions
 would-0.042
 scar-0.041
mm-0.041
 flavors-0.040
 spend-0.039
Token a
Token ID257
Feature activation+1.03e+00

Pos logit contributions
ges+0.479
apping+0.472
 War+0.459
umpy+0.458
 alone+0.456
Neg logit contributions
 flavors-0.345
 scar-0.327
mm-0.307
 cones-0.297
 fries-0.293
Token time
Token ID640
Feature activation-2.47e+00

Pos logit contributions
 flavors+0.039
 scar+0.038
mm+0.037
 feast+0.035
 cones+0.035
Neg logit contributions
ges-0.031
 alone-0.031
 War-0.030
apping-0.029
umpy-0.027
Token,
Token ID11
Feature activation-1.15e+00

Pos logit contributions
 War+0.127
umpy+0.125
udge+0.123
 unconscious+0.119
ucking+0.117
Neg logit contributions
mm-0.044
 scar-0.026
 Wil-0.026
 feast-0.025
 cones-0.025
Token there
Token ID612
Feature activation-3.82e+00

Pos logit contributions
 War+0.037
umpy+0.034
and+0.031
udge+0.031
AR+0.030
Neg logit contributions
 scar-0.038
mm-0.036
 nineteen-0.034
 would-0.034
 flavors-0.033
Token was
Token ID373
Feature activation-1.52e+00

Pos logit contributions
umpy+0.161
apping+0.160
 alone+0.160
 War+0.158
ges+0.156
Neg logit contributions
 flavors-0.100
mm-0.090
 would-0.082
 scar-0.078
 extract-0.078
Token,
Token ID11
Feature activation-6.10e+00

Pos logit contributions
 alone+0.297
apping+0.264
ges+0.262
 War+0.261
umpy+0.260
Neg logit contributions
 flavors-0.264
mm-0.255
 scar-0.251
 extract-0.241
 cones-0.235
Token Lily
Token ID20037
Feature activation-4.08e+00

Pos logit contributions
 alone+0.219
ges+0.203
 War+0.202
apping+0.202
umpy+0.196
Neg logit contributions
 flavors-0.206
mm-0.193
 scar-0.191
 fries-0.181
 cones-0.180
Token."
Token ID526
Feature activation+5.96e-01

Pos logit contributions
 alone+0.139
 by+0.117
umpy+0.115
apping+0.112
ges+0.111
Neg logit contributions
 flavors-0.161
mm-0.157
 scar-0.151
 fries-0.147
 cones-0.146
Token<|endoftext|>
Token ID50256
Feature activation-5.38e+00

Pos logit contributions
 flavors+0.024
 scar+0.022
mm+0.022
 fries+0.022
 cones+0.021
Neg logit contributions
 alone-0.018
ges-0.017
apping-0.016
 War-0.016
umpy-0.016
TokenOnce
Token ID7454
Feature activation-1.05e+00

Pos logit contributions
apping+0.202
 alone+0.198
uttering+0.198
ges+0.194
umpy+0.193
Neg logit contributions
 flavors-0.150
mm-0.145
 cones-0.140
rob-0.136
 scar-0.132
Token upon
Token ID2402
Feature activation-1.19e+01

Pos logit contributions
AR+0.021
 War+0.021
ges+0.018
umpy+0.018
udge+0.017
Neg logit contributions
 scar-0.047
mm-0.047
 flavors-0.046
 would-0.045
 spend-0.044
Token a
Token ID257
Feature activation+9.93e-01

Pos logit contributions
ges+0.493
apping+0.486
umpy+0.468
 War+0.466
 alone+0.462
Neg logit contributions
 flavors-0.333
 scar-0.314
mm-0.303
 cones-0.283
 fries-0.280
Token time
Token ID640
Feature activation-2.72e+00

Pos logit contributions
 flavors+0.041
 scar+0.039
mm+0.039
 fries+0.037
 cones+0.037
Neg logit contributions
 alone-0.028
ges-0.027
 War-0.026
apping-0.025
umpy-0.024
Token,
Token ID11
Feature activation-1.12e+00

Pos logit contributions
 War+0.137
umpy+0.135
udge+0.133
haired+0.128
ges+0.127
Neg logit contributions
mm-0.054
 scar-0.034
 feast-0.034
 cones-0.033
 flavors-0.032
Token there
Token ID612
Feature activation-3.79e+00

Pos logit contributions
 War+0.036
umpy+0.035
udge+0.032
ges+0.031
and+0.031
Neg logit contributions
 scar-0.038
mm-0.036
 would-0.035
 nineteen-0.034
 flavors-0.034
Token was
Token ID373
Feature activation-1.38e+00

Pos logit contributions
umpy+0.154
 alone+0.153
 War+0.152
apping+0.152
ges+0.149
Neg logit contributions
 flavors-0.107
mm-0.100
 scar-0.088
 extract-0.087
 would-0.087
Token to
Token ID284
Feature activation-4.90e+00

Pos logit contributions
 alone+0.113
apping+0.111
 War+0.109
umpy+0.107
ges+0.106
Neg logit contributions
 flavors-0.105
mm-0.097
 cones-0.096
 scar-0.092
rob-0.092
Token it
Token ID340
Feature activation-5.71e+00

Pos logit contributions
 alone+0.185
ges+0.165
 War+0.163
apping+0.158
 nowhere+0.155
Neg logit contributions
 flavors-0.163
mm-0.159
 scar-0.152
 extract-0.144
 cones-0.144
Token.
Token ID13
Feature activation-4.31e+00

Pos logit contributions
 alone+0.167
 by+0.136
apping+0.135
ges+0.131
umpy+0.128
Neg logit contributions
 flavors-0.247
mm-0.242
 scar-0.233
 fries-0.232
 extract-0.229
Token<|endoftext|>
Token ID50256
Feature activation-5.46e+00

Pos logit contributions
 alone+0.144
ges+0.137
apping+0.134
umpy+0.132
 War+0.132
Neg logit contributions
 flavors-0.152
mm-0.144
 scar-0.141
 fries-0.138
 cones-0.135
TokenOnce
Token ID7454
Feature activation-1.11e+00

Pos logit contributions
 alone+0.199
apping+0.198
umpy+0.193
 War+0.192
uttering+0.190
Neg logit contributions
 flavors-0.159
mm-0.154
 cones-0.149
 scar-0.140
rob-0.140
Token upon
Token ID2402
Feature activation-1.19e+01

Pos logit contributions
AR+0.024
 War+0.023
ges+0.021
umpy+0.020
udge+0.020
Neg logit contributions
 would-0.048
 scar-0.047
mm-0.047
 flavors-0.046
 spend-0.045
Token a
Token ID257
Feature activation+9.43e-01

Pos logit contributions
ges+0.480
apping+0.475
 War+0.461
umpy+0.461
 alone+0.458
Neg logit contributions
 flavors-0.344
 scar-0.324
mm-0.306
 cones-0.295
 fries-0.293
Token time
Token ID640
Feature activation-2.49e+00

Pos logit contributions
 flavors+0.037
 scar+0.036
mm+0.036
 cones+0.033
 fries+0.033
Neg logit contributions
 alone-0.027
ges-0.027
 War-0.027
apping-0.025
umpy-0.024
Token there
Token ID612
Feature activation-3.96e+00

Pos logit contributions
 War+0.128
umpy+0.126
udge+0.123
 unconscious+0.120
 alone+0.118
Neg logit contributions
mm-0.044
 scar-0.028
 Wil-0.028
 cones-0.027
 feast-0.027
Token was
Token ID373
Feature activation-1.47e+00

Pos logit contributions
 alone+0.157
umpy+0.156
apping+0.156
 War+0.154
ges+0.151
Neg logit contributions
 flavors-0.113
mm-0.104
 scar-0.096
 extract-0.093
 would-0.092
Token a
Token ID257
Feature activation+1.48e+00

Pos logit contributions
ges+0.058
 War+0.058
umpy+0.058
 alone+0.058
apping+0.057
Neg logit contributions
mm-0.040
 scar-0.039
 flavors-0.038
 breakfast-0.037
 chips-0.036
Token his
Token ID465
Feature activation-5.51e+00

Pos logit contributions
 alone+0.258
ges+0.234
 War+0.228
apping+0.219
 nowhere+0.216
Neg logit contributions
 flavors-0.326
mm-0.317
 scar-0.308
 extract-0.300
 fries-0.298
Token toys
Token ID14958
Feature activation-3.38e+00

Pos logit contributions
 alone+0.203
ges+0.182
 War+0.182
apping+0.178
umpy+0.176
Neg logit contributions
 flavors-0.182
mm-0.175
 scar-0.167
 extract-0.163
 fries-0.162
Token.
Token ID13
Feature activation-3.64e+00

Pos logit contributions
 alone+0.099
ges+0.083
apping+0.081
umpy+0.080
 by+0.079
Neg logit contributions
 flavors-0.143
mm-0.139
 scar-0.136
 fries-0.132
 extract-0.131
Token<|endoftext|>
Token ID50256
Feature activation-5.05e+00

Pos logit contributions
 alone+0.127
ges+0.121
apping+0.119
umpy+0.117
 War+0.115
Neg logit contributions
 flavors-0.121
mm-0.116
 scar-0.113
 fries-0.111
 cones-0.109
TokenOnce
Token ID7454
Feature activation-8.83e-01

Pos logit contributions
apping+0.184
 alone+0.182
umpy+0.178
 War+0.178
uttering+0.176
Neg logit contributions
 flavors-0.146
mm-0.139
 cones-0.136
rob-0.130
 scar-0.128
Token upon
Token ID2402
Feature activation-1.19e+01

Pos logit contributions
AR+0.021
 War+0.020
ges+0.018
 Fighting+0.018
udge+0.018
Neg logit contributions
 would-0.036
 scar-0.035
mm-0.035
 flavors-0.034
 spend-0.034
Token a
Token ID257
Feature activation+1.12e+00

Pos logit contributions
ges+0.485
apping+0.480
 War+0.466
umpy+0.464
 alone+0.459
Neg logit contributions
 flavors-0.335
 scar-0.317
mm-0.298
 cones-0.288
 fries-0.284
Token time
Token ID640
Feature activation-2.56e+00

Pos logit contributions
 flavors+0.044
 scar+0.043
mm+0.043
 cones+0.040
 fries+0.040
Neg logit contributions
 alone-0.032
ges-0.032
 War-0.031
apping-0.030
umpy-0.029
Token there
Token ID612
Feature activation-3.88e+00

Pos logit contributions
 War+0.129
umpy+0.125
udge+0.123
 unconscious+0.121
ges+0.119
Neg logit contributions
mm-0.050
 scar-0.032
 feast-0.031
 cones-0.031
 flavors-0.030
Token was
Token ID373
Feature activation-1.46e+00

Pos logit contributions
apping+0.154
 alone+0.154
umpy+0.154
 War+0.153
ges+0.149
Neg logit contributions
 flavors-0.109
mm-0.101
 scar-0.093
 would-0.090
 extract-0.090
Token a
Token ID257
Feature activation+1.53e+00

Pos logit contributions
ges+0.058
 War+0.058
umpy+0.058
 alone+0.057
apping+0.057
Neg logit contributions
mm-0.039
 scar-0.039
 flavors-0.037
 breakfast-0.037
 chips-0.035
Token the
Token ID262
Feature activation-4.51e+00

Pos logit contributions
 alone+0.075
ges+0.067
 War+0.065
 nowhere+0.062
apping+0.062
Neg logit contributions
 flavors-0.109
mm-0.106
 scar-0.103
 fries-0.100
 extract-0.099
Token rain
Token ID6290
Feature activation-5.65e+00

Pos logit contributions
 alone+0.153
ges+0.148
apping+0.145
 War+0.144
umpy+0.142
Neg logit contributions
 flavors-0.158
 scar-0.149
mm-0.146
 fries-0.142
 cones-0.139
Token."
Token ID526
Feature activation+2.77e-01

Pos logit contributions
 alone+0.178
apping+0.159
ges+0.158
umpy+0.158
 War+0.154
Neg logit contributions
 flavors-0.221
mm-0.214
 scar-0.210
 fries-0.208
 french-0.203
Token<|endoftext|>
Token ID50256
Feature activation-5.28e+00

Pos logit contributions
 flavors+0.011
mm+0.010
 scar+0.010
 fries+0.010
 cones+0.009
Neg logit contributions
 alone-0.009
ges-0.008
apping-0.008
 War-0.008
umpy-0.008
TokenOnce
Token ID7454
Feature activation-8.66e-01

Pos logit contributions
apping+0.197
 alone+0.195
uttering+0.195
 War+0.192
umpy+0.191
Neg logit contributions
 flavors-0.142
mm-0.137
 cones-0.132
rob-0.128
�-0.125
Token upon
Token ID2402
Feature activation-1.19e+01

Pos logit contributions
AR+0.021
 War+0.019
ges+0.017
udge+0.017
ucking+0.017
Neg logit contributions
 would-0.035
 scar-0.034
mm-0.034
 flavors-0.033
 spend-0.033
Token a
Token ID257
Feature activation+8.12e-01

Pos logit contributions
ges+0.489
apping+0.482
umpy+0.469
 War+0.468
 alone+0.465
Neg logit contributions
 flavors-0.327
 scar-0.312
mm-0.293
 cones-0.281
 fries-0.278
Token time
Token ID640
Feature activation-2.56e+00

Pos logit contributions
 flavors+0.032
 scar+0.031
mm+0.031
 cones+0.029
 fries+0.029
Neg logit contributions
 alone-0.024
ges-0.024
 War-0.023
apping-0.022
umpy-0.021
Token,
Token ID11
Feature activation-1.13e+00

Pos logit contributions
 War+0.136
umpy+0.133
udge+0.131
 unconscious+0.127
ucking+0.126
Neg logit contributions
mm-0.041
 Wil-0.022
,"-0.020
 feast-0.020
 scar-0.020
Token there
Token ID612
Feature activation-3.89e+00

Pos logit contributions
 War+0.037
umpy+0.035
and+0.032
udge+0.032
 Fighting+0.031
Neg logit contributions
 scar-0.036
mm-0.033
 would-0.032
 nineteen-0.032
 Wil-0.031
Token was
Token ID373
Feature activation-1.40e+00

Pos logit contributions
umpy+0.170
apping+0.168
 alone+0.166
 War+0.166
ges+0.163
Neg logit contributions
 flavors-0.096
mm-0.088
 would-0.081
 scar-0.074
 extract-0.074
Token very
Token ID845
Feature activation-4.68e+00

Pos logit contributions
 flavors+0.151
mm+0.138
 scar+0.134
 spend+0.126
 chips+0.122
Neg logit contributions
 War-0.270
ges-0.268
AR-0.262
apping-0.261
 alone-0.261
Token much
Token ID881
Feature activation-8.04e-01

Pos logit contributions
 alone+0.126
ges+0.117
apping+0.115
 War+0.114
umpy+0.112
Neg logit contributions
 flavors-0.200
 scar-0.189
mm-0.188
 fries-0.183
 cones-0.179
Token.
Token ID13
Feature activation-4.25e+00

Pos logit contributions
 alone+0.026
 by+0.022
umpy+0.021
 outside+0.020
 nowhere+0.020
Neg logit contributions
mm-0.033
 flavors-0.031
 extract-0.031
 scar-0.030
rob-0.030
Token<|endoftext|>
Token ID50256
Feature activation-5.74e+00

Pos logit contributions
 alone+0.141
ges+0.134
apping+0.132
umpy+0.130
 War+0.129
Neg logit contributions
 flavors-0.148
mm-0.143
 scar-0.138
 fries-0.136
 cones-0.133
TokenOnce
Token ID7454
Feature activation-1.01e+00

Pos logit contributions
apping+0.210
 alone+0.206
 War+0.204
uttering+0.202
umpy+0.202
Neg logit contributions
 flavors-0.164
mm-0.158
 cones-0.154
rob-0.148
 scar-0.144
Token upon
Token ID2402
Feature activation-1.19e+01

Pos logit contributions
AR+0.023
 War+0.023
ges+0.019
 Fighting+0.019
udge+0.019
Neg logit contributions
 scar-0.042
 would-0.042
mm-0.041
 spend-0.039
 flavors-0.039
Token a
Token ID257
Feature activation+9.00e-01

Pos logit contributions
ges+0.484
apping+0.478
 War+0.466
umpy+0.463
 alone+0.459
Neg logit contributions
 flavors-0.333
 scar-0.317
mm-0.301
 cones-0.286
 fries-0.285
Token time
Token ID640
Feature activation-2.41e+00

Pos logit contributions
 flavors+0.034
 scar+0.034
mm+0.033
 cones+0.031
 fries+0.031
Neg logit contributions
ges-0.027
 alone-0.027
 War-0.026
apping-0.025
umpy-0.024
Token,
Token ID11
Feature activation-1.32e+00

Pos logit contributions
 War+0.121
umpy+0.116
udge+0.113
 unconscious+0.113
 alone+0.111
Neg logit contributions
mm-0.047
 scar-0.031
 feast-0.030
 cones-0.029
 flavors-0.028
Token there
Token ID612
Feature activation-3.92e+00

Pos logit contributions
 War+0.041
umpy+0.038
and+0.036
AR+0.035
udge+0.035
Neg logit contributions
 scar-0.045
mm-0.043
 nineteen-0.040
 would-0.040
 flavors-0.039
Token was
Token ID373
Feature activation-1.50e+00

Pos logit contributions
 alone+0.160
umpy+0.159
apping+0.159
 War+0.158
ges+0.155
Neg logit contributions
 flavors-0.108
mm-0.099
 scar-0.087
 would-0.087
 extract-0.086
TokenHe
Token ID1544
Feature activation-2.65e+00

Pos logit contributions
 alone+0.229
ges+0.229
unk+0.214
umpy+0.213
apping+0.210
Neg logit contributions
mm-0.192
 flavors-0.185
 scar-0.182
Together-0.179
 spend-0.169
Token said
Token ID531
Feature activation+5.64e-01

Pos logit contributions
 alone+0.100
 War+0.086
 by+0.085
 nowhere+0.085
ges+0.084
Neg logit contributions
mm-0.090
 flavors-0.085
 extract-0.081
 scar-0.076
 chips-0.075
Token,
Token ID11
Feature activation-3.41e+00

Pos logit contributions
 flavors+0.020
 scar+0.019
mm+0.019
 fries+0.018
 chips+0.018
Neg logit contributions
 alone-0.019
ges-0.018
apping-0.018
 War-0.018
umpy-0.018
Token "
Token ID366
Feature activation-4.63e+00

Pos logit contributions
 alone+0.102
umpy+0.097
ges+0.094
 War+0.093
apping+0.092
Neg logit contributions
 flavors-0.129
 scar-0.128
 fries-0.124
mm-0.121
 cones-0.119
TokenOnce
Token ID7454
Feature activation-8.99e-01

Pos logit contributions
 alone+0.156
apping+0.145
ges+0.144
 War+0.140
umpy+0.139
Neg logit contributions
 flavors-0.171
mm-0.160
 scar-0.156
 fries-0.150
 french-0.149
Token upon
Token ID2402
Feature activation-1.19e+01

Pos logit contributions
 alone+0.029
ges+0.028
 War+0.028
umpy+0.028
apping+0.028
Neg logit contributions
 flavors-0.032
mm-0.030
 scar-0.030
 cones-0.029
 fries-0.029
Token a
Token ID257
Feature activation+1.05e+00

Pos logit contributions
 alone+0.369
ges+0.349
apping+0.343
 War+0.341
umpy+0.336
Neg logit contributions
 flavors-0.456
 scar-0.430
mm-0.428
 fries-0.413
 cones-0.404
Token time
Token ID640
Feature activation-2.50e+00

Pos logit contributions
 flavors+0.049
mm+0.047
 scar+0.046
 fries+0.045
 cones+0.044
Neg logit contributions
 alone-0.024
ges-0.022
apping-0.022
 War-0.022
umpy-0.021
Token,
Token ID11
Feature activation-9.13e-01

Pos logit contributions
 alone+0.089
umpy+0.087
ges+0.086
 War+0.086
apping+0.085
Neg logit contributions
 flavors-0.079
 scar-0.074
mm-0.073
 fries-0.070
 cones-0.070
Token there
Token ID612
Feature activation-3.81e+00

Pos logit contributions
 alone+0.027
ges+0.026
apping+0.025
umpy+0.025
 War+0.025
Neg logit contributions
 flavors-0.036
 scar-0.035
mm-0.034
 fries-0.033
 french-0.032
Token was
Token ID373
Feature activation-1.41e+00

Pos logit contributions
 alone+0.132
ges+0.125
apping+0.123
 War+0.123
umpy+0.121
Neg logit contributions
 flavors-0.132
 scar-0.124
mm-0.123
 fries-0.118
 cones-0.116
Token it
Token ID340
Feature activation-7.49e+00

Pos logit contributions
 alone+0.105
ges+0.098
apping+0.097
 War+0.095
umpy+0.093
Neg logit contributions
 flavors-0.149
 scar-0.141
mm-0.141
 fries-0.136
 extract-0.133
Token forever
Token ID8097
Feature activation+2.67e+00

Pos logit contributions
 alone+0.266
ges+0.232
umpy+0.228
apping+0.227
 War+0.227
Neg logit contributions
 flavors-0.265
mm-0.258
 scar-0.257
 fries-0.245
 extract-0.240
Token.
Token ID13
Feature activation-3.99e+00

Pos logit contributions
 flavors+0.101
mm+0.099
 scar+0.099
 fries+0.096
 french+0.095
Neg logit contributions
 alone-0.089
ges-0.075
apping-0.073
 by-0.073
umpy-0.073
Token<|endoftext|>
Token ID50256
Feature activation-5.61e+00

Pos logit contributions
 alone+0.127
ges+0.121
apping+0.118
 War+0.117
umpy+0.116
Neg logit contributions
 flavors-0.146
mm-0.140
 scar-0.137
 fries-0.133
 cones-0.130
TokenOnce
Token ID7454
Feature activation-9.98e-01

Pos logit contributions
apping+0.198
 alone+0.197
 War+0.192
umpy+0.191
uttering+0.189
Neg logit contributions
 flavors-0.168
mm-0.163
 cones-0.157
 scar-0.151
rob-0.149
Token upon
Token ID2402
Feature activation-1.19e+01

Pos logit contributions
AR+0.022
 War+0.022
ges+0.019
umpy+0.019
 Fighting+0.018
Neg logit contributions
 scar-0.042
 would-0.042
mm-0.041
 flavors-0.040
 spend-0.040
Token a
Token ID257
Feature activation+8.03e-01

Pos logit contributions
ges+0.473
apping+0.465
 War+0.455
umpy+0.454
 alone+0.453
Neg logit contributions
 flavors-0.345
 scar-0.331
mm-0.308
 fries-0.297
 cones-0.297
Token time
Token ID640
Feature activation-2.48e+00

Pos logit contributions
 flavors+0.030
 scar+0.030
mm+0.029
 feast+0.028
 cones+0.027
Neg logit contributions
ges-0.025
 alone-0.024
 War-0.024
apping-0.022
umpy-0.021
Token there
Token ID612
Feature activation-3.99e+00

Pos logit contributions
 War+0.129
umpy+0.126
udge+0.124
 unconscious+0.120
haired+0.118
Neg logit contributions
mm-0.044
 scar-0.028
 Wil-0.026
 feast-0.026
 cones-0.025
Token were
Token ID547
Feature activation-2.62e-01

Pos logit contributions
umpy+0.161
apping+0.160
 alone+0.160
 War+0.159
ges+0.155
Neg logit contributions
 flavors-0.109
mm-0.101
 scar-0.093
 would-0.092
 extract-0.089
Token two
Token ID734
Feature activation+3.80e+00

Pos logit contributions
 alone+0.010
ges+0.010
 War+0.010
apping+0.010
umpy+0.010
Neg logit contributions
 flavors-0.008
 scar-0.007
mm-0.007
 fries-0.007
 cones-0.007
Token bad
Token ID2089
Feature activation-9.41e-01

Pos logit contributions
 flavors+0.036
mm+0.034
 scar+0.032
 fries+0.032
 cones+0.032
Neg logit contributions
 alone-0.029
 War-0.027
ges-0.026
apping-0.026
umpy-0.025
Token ending
Token ID7464
Feature activation-1.76e+00

Pos logit contributions
 alone+0.034
 War+0.031
apping+0.031
umpy+0.030
ges+0.030
Neg logit contributions
 flavors-0.033
mm-0.031
 scar-0.030
 fries-0.029
 extract-0.029
Token.
Token ID13
Feature activation-1.43e+00

Pos logit contributions
 alone+0.059
ges+0.058
 War+0.058
umpy+0.057
apping+0.056
Neg logit contributions
 flavors-0.060
 scar-0.056
mm-0.055
 cones-0.053
 fries-0.053
Token<|endoftext|>
Token ID50256
Feature activation-3.80e+00

Pos logit contributions
 alone+0.049
ges+0.046
apping+0.046
umpy+0.044
uttering+0.043
Neg logit contributions
 flavors-0.048
mm-0.048
 scar-0.045
 fries-0.045
 french-0.043
TokenOnce
Token ID7454
Feature activation-3.71e-01

Pos logit contributions
 alone+0.148
haired+0.145
 War+0.144
umpy+0.144
apping+0.142
Neg logit contributions
 flavors-0.098
 cones-0.098
rob-0.093
mm-0.093
 feast-0.083
Token upon
Token ID2402
Feature activation-1.19e+01

Pos logit contributions
AR+0.007
 War+0.007
 alone+0.006
umpy+0.006
ges+0.006
Neg logit contributions
mm-0.017
 flavors-0.017
 scar-0.017
 would-0.017
 cones-0.017
Token a
Token ID257
Feature activation+1.20e+00

Pos logit contributions
apping+0.527
umpy+0.523
ges+0.518
unk+0.509
ubby+0.502
Neg logit contributions
 flavors-0.281
 scar-0.245
mm-0.240
 breakfast-0.238
 spend-0.233
Token time
Token ID640
Feature activation-3.47e+00

Pos logit contributions
 flavors+0.048
 scar+0.046
mm+0.045
 fries+0.043
 chips+0.043
Neg logit contributions
 alone-0.035
ges-0.034
 War-0.033
apping-0.032
umpy-0.031
Token,
Token ID11
Feature activation-1.73e+00

Pos logit contributions
 War+0.172
umpy+0.170
 unconscious+0.161
udge+0.160
 alone+0.160
Neg logit contributions
mm-0.069
 flavors-0.044
 cones-0.044
 feast-0.043
 scar-0.039
Token there
Token ID612
Feature activation-4.14e+00

Pos logit contributions
umpy+0.057
 War+0.056
AR+0.055
and+0.053
ucking+0.053
Neg logit contributions
 would-0.053
mm-0.052
 will-0.050
 spend-0.049
 scar-0.048
Token was
Token ID373
Feature activation-1.84e+00

Pos logit contributions
umpy+0.172
 alone+0.170
apping+0.169
 War+0.166
ges+0.163
Neg logit contributions
 flavors-0.116
mm-0.105
 extract-0.093
 would-0.093
 scar-0.090
Token any
Token ID597
Feature activation-1.42e+00

Pos logit contributions
 flavors+0.124
 scar+0.117
mm+0.116
 fries+0.114
 extract+0.113
Neg logit contributions
 alone-0.067
apping-0.057
ges-0.057
 by-0.054
umpy-0.054
Token yogurt
Token ID32132
Feature activation+1.50e-01

Pos logit contributions
 alone+0.046
ges+0.039
umpy+0.039
apping+0.038
 by+0.038
Neg logit contributions
mm-0.054
 flavors-0.053
 scar-0.052
 fries-0.051
 extract-0.050
Token.
Token ID13
Feature activation-4.52e+00

Pos logit contributions
 scar+0.006
 flavors+0.006
 fare+0.006
 fries+0.006
mm+0.006
Neg logit contributions
 alone-0.004
 by-0.004
 outside-0.004
it-0.004
ges-0.004
Token<|endoftext|>
Token ID50256
Feature activation-5.62e+00

Pos logit contributions
 alone+0.152
ges+0.144
apping+0.141
umpy+0.140
 War+0.139
Neg logit contributions
 flavors-0.157
mm-0.151
 scar-0.146
 fries-0.144
 cones-0.140
TokenOnce
Token ID7454
Feature activation-8.24e-01

Pos logit contributions
apping+0.210
 alone+0.208
uttering+0.207
 War+0.206
umpy+0.203
Neg logit contributions
 flavors-0.155
mm-0.149
 cones-0.144
rob-0.138
 feast-0.132
Token upon
Token ID2402
Feature activation-1.19e+01

Pos logit contributions
AR+0.019
 War+0.019
ges+0.017
udge+0.016
 Fighting+0.016
Neg logit contributions
 scar-0.034
 would-0.033
mm-0.033
 flavors-0.032
 spend-0.032
Token a
Token ID257
Feature activation+1.01e+00

Pos logit contributions
ges+0.476
apping+0.470
 War+0.458
umpy+0.457
 alone+0.456
Neg logit contributions
 flavors-0.340
 scar-0.322
mm-0.307
 cones-0.293
 fries-0.292
Token time
Token ID640
Feature activation-2.42e+00

Pos logit contributions
 flavors+0.040
 scar+0.039
mm+0.039
 fries+0.036
 cones+0.036
Neg logit contributions
 alone-0.029
ges-0.028
 War-0.028
apping-0.027
umpy-0.025
Token,
Token ID11
Feature activation-1.29e+00

Pos logit contributions
 War+0.125
umpy+0.122
udge+0.120
 unconscious+0.117
haired+0.115
Neg logit contributions
mm-0.044
 scar-0.026
 cones-0.026
 Wil-0.026
 feast-0.025
Token there
Token ID612
Feature activation-3.84e+00

Pos logit contributions
 War+0.042
umpy+0.040
and+0.036
udge+0.035
AR+0.035
Neg logit contributions
 scar-0.042
mm-0.039
 would-0.038
 nineteen-0.038
 will-0.037
Token was
Token ID373
Feature activation-1.49e+00

Pos logit contributions
 alone+0.160
umpy+0.159
apping+0.158
 War+0.157
ges+0.154
Neg logit contributions
 flavors-0.104
mm-0.094
 would-0.084
 scar-0.083
 extract-0.082
Token toy
Token ID13373
Feature activation-4.13e+00

Pos logit contributions
 alone+0.279
ges+0.253
 War+0.251
apping+0.250
umpy+0.246
Neg logit contributions
 flavors-0.245
mm-0.235
 scar-0.226
 fries-0.218
 extract-0.216
Token box
Token ID3091
Feature activation-1.64e+00

Pos logit contributions
 alone+0.129
ges+0.115
 War+0.114
apping+0.112
umpy+0.108
Neg logit contributions
 flavors-0.161
mm-0.155
 scar-0.153
 fries-0.146
 extract-0.145
Token.
Token ID13
Feature activation-3.77e+00

Pos logit contributions
 alone+0.048
ges+0.043
apping+0.042
umpy+0.041
 War+0.041
Neg logit contributions
 flavors-0.068
mm-0.065
 scar-0.065
 fries-0.062
 extract-0.061
Token<|endoftext|>
Token ID50256
Feature activation-5.06e+00

Pos logit contributions
 alone+0.130
ges+0.124
apping+0.122
umpy+0.121
 War+0.117
Neg logit contributions
 flavors-0.125
mm-0.121
 scar-0.117
 fries-0.115
 cones-0.114
TokenOnce
Token ID7454
Feature activation-6.59e-01

Pos logit contributions
apping+0.190
uttering+0.187
 alone+0.186
umpy+0.183
 War+0.183
Neg logit contributions
 flavors-0.139
mm-0.130
 cones-0.130
rob-0.123
�-0.117
Token upon
Token ID2402
Feature activation-1.19e+01

Pos logit contributions
AR+0.016
 War+0.016
udge+0.014
 Fighting+0.014
ges+0.014
Neg logit contributions
 would-0.026
 scar-0.025
mm-0.025
 spend-0.024
 collection-0.024
Token a
Token ID257
Feature activation+1.26e+00

Pos logit contributions
ges+0.495
apping+0.491
umpy+0.474
 War+0.472
 alone+0.466
Neg logit contributions
 flavors-0.321
 scar-0.304
mm-0.282
 cones-0.275
 fries-0.269
Token time
Token ID640
Feature activation-2.50e+00

Pos logit contributions
 flavors+0.047
 scar+0.046
mm+0.045
 cones+0.042
 feast+0.042
Neg logit contributions
ges-0.039
 alone-0.038
 War-0.038
apping-0.036
 Fighting-0.034
Token,
Token ID11
Feature activation-1.01e+00

Pos logit contributions
 War+0.129
umpy+0.126
udge+0.124
 unconscious+0.122
ges+0.119
Neg logit contributions
mm-0.045
 feast-0.027
 scar-0.026
 cones-0.026
 Wil-0.026
Token there
Token ID612
Feature activation-3.69e+00

Pos logit contributions
 War+0.032
umpy+0.031
and+0.028
udge+0.027
AR+0.027
Neg logit contributions
 scar-0.033
mm-0.030
 would-0.030
 nineteen-0.030
 will-0.029
Token was
Token ID373
Feature activation-1.45e+00

Pos logit contributions
umpy+0.156
 alone+0.156
apping+0.156
 War+0.154
ges+0.151
Neg logit contributions
 flavors-0.094
mm-0.085
 would-0.079
 extract-0.074
 scar-0.073
Token friends
Token ID2460
Feature activation-3.23e+00

Pos logit contributions
 flavors+0.057
 scar+0.055
mm+0.053
 fries+0.051
 chips+0.050
Neg logit contributions
 alone-0.061
ges-0.060
apping-0.059
umpy-0.059
 War-0.058
Token forever
Token ID8097
Feature activation+2.55e+00

Pos logit contributions
 alone+0.103
apping+0.090
ges+0.088
 by+0.086
 War+0.085
Neg logit contributions
 flavors-0.131
mm-0.126
 scar-0.123
 extract-0.122
 fries-0.118
Token.
Token ID13
Feature activation-3.93e+00

Pos logit contributions
 flavors+0.108
mm+0.104
 scar+0.104
 fries+0.102
 extract+0.102
Neg logit contributions
 alone-0.078
apping-0.065
ges-0.064
 by-0.063
umpy-0.062
Token<|endoftext|>
Token ID50256
Feature activation-5.44e+00

Pos logit contributions
 alone+0.132
ges+0.125
apping+0.123
umpy+0.121
 War+0.121
Neg logit contributions
 flavors-0.137
mm-0.132
 scar-0.127
 fries-0.124
 cones-0.121
TokenOnce
Token ID7454
Feature activation-8.24e-01

Pos logit contributions
apping+0.204
 alone+0.199
umpy+0.197
 War+0.196
uttering+0.195
Neg logit contributions
 flavors-0.149
mm-0.148
 cones-0.139
rob-0.137
 feast-0.131
Token upon
Token ID2402
Feature activation-1.19e+01

Pos logit contributions
AR+0.020
 War+0.019
ges+0.017
udge+0.017
 Fighting+0.017
Neg logit contributions
 would-0.033
 scar-0.033
mm-0.032
 spend-0.031
 collection-0.031
Token a
Token ID257
Feature activation+9.70e-01

Pos logit contributions
ges+0.495
apping+0.486
umpy+0.474
 War+0.474
 alone+0.464
Neg logit contributions
 flavors-0.317
 scar-0.305
mm-0.284
 fries-0.270
 cones-0.270
Token time
Token ID640
Feature activation-2.43e+00

Pos logit contributions
 flavors+0.036
 scar+0.035
mm+0.035
 feast+0.033
 cones+0.032
Neg logit contributions
ges-0.031
 alone-0.030
 War-0.029
apping-0.028
 Fighting-0.027
Token,
Token ID11
Feature activation-1.20e+00

Pos logit contributions
 War+0.126
umpy+0.123
udge+0.121
 unconscious+0.118
haired+0.116
Neg logit contributions
mm-0.043
 scar-0.026
 feast-0.025
 Wil-0.024
 cones-0.023
Token a
Token ID257
Feature activation+1.39e+00

Pos logit contributions
 War+0.040
umpy+0.037
udge+0.033
 Fighting+0.033
and+0.033
Neg logit contributions
 scar-0.039
mm-0.036
 would-0.036
 nineteen-0.035
 will-0.034
Token little
Token ID1310
Feature activation-6.16e+00

Pos logit contributions
 scar+0.050
mm+0.047
 flavors+0.047
 fries+0.046
 cones+0.045
Neg logit contributions
ges-0.044
 alone-0.043
 Fighting-0.041
 War-0.041
 unconscious-0.041
Token he
Token ID339
Feature activation+1.53e-01

Pos logit contributions
 flavors+0.057
mm+0.052
 scar+0.051
 fries+0.049
 cones+0.049
Neg logit contributions
apping-0.075
ges-0.075
umpy-0.075
 alone-0.075
 War-0.073
Token wanted
Token ID2227
Feature activation-2.21e+00

Pos logit contributions
 flavors+0.003
 would+0.003
 scar+0.003
mm+0.003
 spend+0.002
Neg logit contributions
apping-0.007
ges-0.007
uttering-0.007
 War-0.007
ucking-0.007
Token some
Token ID617
Feature activation+5.70e+00

Pos logit contributions
 alone+0.071
ges+0.066
apping+0.065
 War+0.065
umpy+0.064
Neg logit contributions
 flavors-0.083
 scar-0.079
mm-0.078
 fries-0.075
 cones-0.074
Token ice
Token ID4771
Feature activation+4.70e+00

Pos logit contributions
 flavors+0.152
 scar+0.139
mm+0.130
 fries+0.129
 cones+0.129
Neg logit contributions
 alone-0.243
ges-0.243
apping-0.238
umpy-0.237
 War-0.237
Token-
Token ID12
Feature activation-3.53e+00

Pos logit contributions
mm+0.144
 scar+0.144
 flavors+0.142
 fries+0.140
 spend+0.138
Neg logit contributions
 alone-0.187
 by-0.171
 outside-0.159
ges-0.155
umpy-0.154
Tokencream
Token ID36277
Feature activation+1.40e+01

Pos logit contributions
 alone+0.205
ges+0.199
 War+0.195
apping+0.195
umpy+0.194
Neg logit contributions
 flavors-0.039
 scar-0.033
mm-0.032
 fries-0.027
 french-0.024
Token but
Token ID475
Feature activation+6.21e-01

Pos logit contributions
 scar+0.608
 flavors+0.603
mm+0.575
 spend+0.558
 fiber+0.556
Neg logit contributions
 alone-0.397
 by-0.352
ges-0.313
 outside-0.311
apping-0.311
Token his
Token ID465
Feature activation-8.53e-01

Pos logit contributions
 flavors+0.022
mm+0.021
 scar+0.021
 fries+0.020
 french+0.020
Neg logit contributions
 alone-0.021
ges-0.019
 War-0.019
apping-0.019
umpy-0.019
Token mom
Token ID1995
Feature activation+2.08e+00

Pos logit contributions
 alone+0.032
 crying+0.028
 nowhere+0.028
 by+0.027
unk+0.026
Neg logit contributions
 flavors-0.028
mm-0.027
rob-0.027
 scar-0.025
 extract-0.025
Token said
Token ID531
Feature activation+4.25e+00

Pos logit contributions
mm+0.065
 flavors+0.064
 scar+0.060
 cones+0.059
rob+0.059
Neg logit contributions
 alone-0.082
 by-0.077
 crying-0.075
 nowhere-0.072
ges-0.071
Token no
Token ID645
Feature activation+4.99e+00

Pos logit contributions
 flavors+0.132
mm+0.123
 scar+0.122
 fries+0.119
 chips+0.116
Neg logit contributions
 alone-0.161
ges-0.155
umpy-0.154
apping-0.154
 War-0.152
Token Alex
Token ID4422
Feature activation-3.37e-01

Pos logit contributions
 flavors+0.105
mm+0.100
 scar+0.098
 fries+0.095
 cones+0.093
Neg logit contributions
 alone-0.069
ges-0.066
 War-0.062
apping-0.062
umpy-0.059
Token for
Token ID329
Feature activation+3.89e+00

Pos logit contributions
 War+0.011
umpy+0.011
 alone+0.011
apping+0.011
ges+0.010
Neg logit contributions
 flavors-0.011
 scar-0.011
 cones-0.011
 chips-0.010
 french-0.010
Token some
Token ID617
Feature activation+6.07e+00

Pos logit contributions
 flavors+0.169
mm+0.164
 scar+0.155
 fries+0.154
 extract+0.153
Neg logit contributions
 alone-0.104
ges-0.094
 War-0.092
 nowhere-0.091
apping-0.087
Token ice
Token ID4771
Feature activation+4.61e+00

Pos logit contributions
 flavors+0.198
 scar+0.184
mm+0.183
 fries+0.176
 cones+0.171
Neg logit contributions
 alone-0.226
ges-0.215
apping-0.212
 War-0.211
umpy-0.208
Token-
Token ID12
Feature activation-4.33e+00

Pos logit contributions
 fries+0.168
 flavors+0.166
mm+0.166
 scar+0.164
 french+0.164
Neg logit contributions
 alone-0.149
 by-0.142
ges-0.133
and-0.130
 outside-0.129
Tokencream
Token ID36277
Feature activation+1.33e+01

Pos logit contributions
 alone+0.233
ges+0.230
 War+0.222
apping+0.220
umpy+0.220
Neg logit contributions
 flavors-0.063
 scar-0.059
mm-0.055
 fries-0.050
 french-0.045
Token.
Token ID13
Feature activation-2.81e-01

Pos logit contributions
 flavors+0.577
 scar+0.565
mm+0.548
 fries+0.533
 spend+0.518
Neg logit contributions
 alone-0.365
ges-0.325
 by-0.322
apping-0.316
umpy-0.300
Token He
Token ID679
Feature activation+1.71e+00

Pos logit contributions
 alone+0.010
ges+0.009
umpy+0.009
 War+0.009
apping+0.009
Neg logit contributions
 flavors-0.009
mm-0.009
 scar-0.009
 fries-0.009
 cones-0.008
Token smiled
Token ID13541
Feature activation-3.95e+00

Pos logit contributions
mm+0.063
 fare+0.058
rob+0.055
 extract+0.055
 nineteen+0.054
Neg logit contributions
 alone-0.054
 by-0.050
 nowhere-0.048
haired-0.047
 crying-0.046
Token and
Token ID290
Feature activation-4.47e-01

Pos logit contributions
 alone+0.099
apping+0.090
ges+0.089
 War+0.086
umpy+0.085
Neg logit contributions
 flavors-0.179
mm-0.171
 scar-0.169
 fries-0.165
 chips-0.162
Token was
Token ID373
Feature activation-1.46e+00

Pos logit contributions
 by+0.014
 alone+0.014
 crying+0.013
 outside+0.013
ges+0.012
Neg logit contributions
 fare-0.017
mm-0.015
 flavors-0.015
 fiber-0.014
 destination-0.014
Token she
Token ID673
Feature activation+8.53e-01

Pos logit contributions
 flavors+0.060
 scar+0.055
mm+0.055
 extract+0.053
 cones+0.052
Neg logit contributions
 alone-0.052
ges-0.046
 nowhere-0.046
 War-0.045
 by-0.045
Token could
Token ID714
Feature activation+1.66e+00

Pos logit contributions
rob+0.037
 scar+0.037
 flavors+0.036
mm+0.036
 extract+0.035
Neg logit contributions
 alone-0.022
 nowhere-0.019
 by-0.018
umpy-0.018
ges-0.018
Token have
Token ID423
Feature activation-1.14e-01

Pos logit contributions
 flavors+0.058
 scar+0.055
mm+0.054
rob+0.052
 fries+0.051
Neg logit contributions
 alone-0.061
ges-0.056
apping-0.056
 War-0.054
 by-0.054
Token the
Token ID262
Feature activation+8.46e-01

Pos logit contributions
 alone+0.003
 nowhere+0.003
 by+0.002
 outside+0.002
 crying+0.002
Neg logit contributions
mm-0.005
 flavors-0.005
comb-0.005
 french-0.005
 extract-0.005
Token ash
Token ID12530
Feature activation-3.74e+00

Pos logit contributions
 flavors+0.027
mm+0.025
 scar+0.025
 fries+0.024
 french+0.023
Neg logit contributions
 alone-0.032
ges-0.030
 War-0.029
apping-0.029
umpy-0.029
Tokentr
Token ID2213
Feature activation+1.45e+01

Pos logit contributions
 alone+0.141
 by+0.126
ges+0.124
apping+0.121
 outside+0.120
Neg logit contributions
 flavors-0.127
mm-0.120
 scar-0.116
 fries-0.116
 french-0.114
Tokenay
Token ID323
Feature activation+3.25e+00

Pos logit contributions
 flavors+0.672
 scar+0.638
 fries+0.634
 french+0.615
mm+0.608
Neg logit contributions
 alone-0.349
apping-0.322
ges-0.319
umpy-0.307
 War-0.301
Token to
Token ID284
Feature activation+1.89e+00

Pos logit contributions
 flavors+0.124
 scar+0.116
mm+0.114
 fries+0.113
 french+0.110
Neg logit contributions
 alone-0.104
ges-0.098
 War-0.095
apping-0.095
umpy-0.092
Token put
Token ID1234
Feature activation+4.88e+00

Pos logit contributions
 flavors+0.060
 scar+0.054
mm+0.054
 cones+0.052
rob+0.052
Neg logit contributions
 alone-0.075
ges-0.068
 War-0.065
AR-0.065
 by-0.065
Token her
Token ID607
Feature activation-1.02e+00

Pos logit contributions
 flavors+0.199
 scar+0.189
mm+0.187
 fries+0.181
 cones+0.178
Neg logit contributions
 alone-0.139
ges-0.131
apping-0.130
 War-0.129
umpy-0.127
Token rocks
Token ID12586
Feature activation+3.58e+00

Pos logit contributions
 alone+0.033
ges+0.029
 War+0.028
apping+0.028
umpy+0.028
Neg logit contributions
 flavors-0.039
mm-0.038
 scar-0.036
 fries-0.036
 french-0.035
Token mom
Token ID1995
Feature activation+1.91e+00

Pos logit contributions
 alone+0.030
apping+0.027
ges+0.027
 by+0.027
 War+0.027
Neg logit contributions
 flavors-0.025
mm-0.024
rob-0.023
 scar-0.022
 extract-0.022
Token for
Token ID329
Feature activation+4.25e+00

Pos logit contributions
 flavors+0.079
mm+0.075
 scar+0.074
 fries+0.072
 french+0.070
Neg logit contributions
 alone-0.055
ges-0.051
apping-0.049
 by-0.048
 War-0.048
Token some
Token ID617
Feature activation+6.22e+00

Pos logit contributions
 flavors+0.156
mm+0.147
 scar+0.145
 fries+0.138
 extract+0.138
Neg logit contributions
 alone-0.140
ges-0.130
 War-0.127
apping-0.126
 nowhere-0.123
Token ice
Token ID4771
Feature activation+4.81e+00

Pos logit contributions
 flavors+0.169
 scar+0.155
mm+0.155
 fries+0.146
 cones+0.141
Neg logit contributions
 alone-0.264
ges-0.252
apping-0.249
 War-0.248
umpy-0.245
Token-
Token ID12
Feature activation-3.77e+00

Pos logit contributions
 scar+0.162
 spend+0.159
 flavors+0.155
mm+0.154
 feast+0.150
Neg logit contributions
 by-0.156
 alone-0.155
apping-0.145
ges-0.144
and-0.142
Tokencream
Token ID36277
Feature activation+1.40e+01

Pos logit contributions
 alone+0.201
ges+0.200
 War+0.194
apping+0.192
umpy+0.191
Neg logit contributions
 flavors-0.053
 scar-0.049
mm-0.045
 fries-0.040
 feast-0.038
Token after
Token ID706
Feature activation-5.98e-01

Pos logit contributions
 scar+0.577
 flavors+0.577
mm+0.540
 fries+0.523
 spend+0.521
Neg logit contributions
 alone-0.394
 by-0.367
ges-0.353
apping-0.343
it-0.325
Token her
Token ID607
Feature activation-7.63e-01

Pos logit contributions
 alone+0.020
ges+0.019
apping+0.018
 War+0.018
 nowhere+0.018
Neg logit contributions
 flavors-0.022
 scar-0.021
mm-0.020
 fries-0.019
 french-0.019
Token bike
Token ID7161
Feature activation+3.07e+00

Pos logit contributions
 alone+0.027
 by+0.024
 crying+0.024
apping+0.024
ges+0.023
Neg logit contributions
rob-0.026
 scar-0.024
 flavors-0.024
mm-0.024
 extract-0.022
Token rides
Token ID17445
Feature activation+3.49e+00

Pos logit contributions
 flavors+0.140
mm+0.136
 scar+0.136
 fries+0.133
 french+0.132
Neg logit contributions
 alone-0.082
 by-0.071
apping-0.061
ges-0.061
 outside-0.060
Token in
Token ID287
Feature activation-1.63e+00

Pos logit contributions
 flavors+0.147
mm+0.141
 scar+0.140
 fries+0.136
 french+0.133
Neg logit contributions
 alone-0.100
ges-0.090
apping-0.088
 War-0.086
umpy-0.085
Token store
Token ID3650
Feature activation-6.43e-01

Pos logit contributions
 flavors+0.029
 scar+0.027
mm+0.026
 fries+0.026
 cones+0.026
Neg logit contributions
 alone-0.029
ges-0.029
apping-0.028
 War-0.028
umpy-0.027
Token and
Token ID290
Feature activation-8.75e-01

Pos logit contributions
 alone+0.018
ges+0.017
 War+0.017
umpy+0.016
apping+0.016
Neg logit contributions
 flavors-0.027
 scar-0.025
mm-0.025
 cones-0.024
 fries-0.024
Token got
Token ID1392
Feature activation-6.29e-01

Pos logit contributions
 alone+0.035
ges+0.033
apping+0.033
 War+0.033
umpy+0.032
Neg logit contributions
 flavors-0.026
mm-0.025
 scar-0.024
 fries-0.023
 chips-0.022
Token the
Token ID262
Feature activation+8.49e-01

Pos logit contributions
 alone+0.020
ges+0.018
 War+0.018
apping+0.017
 by+0.017
Neg logit contributions
 flavors-0.024
mm-0.024
 scar-0.023
 fries-0.022
 extract-0.022
Token y
Token ID331
Feature activation+3.81e+00

Pos logit contributions
mm+0.024
 flavors+0.023
 scar+0.023
 fries+0.020
rob+0.020
Neg logit contributions
 alone-0.034
 by-0.031
ges-0.031
umpy-0.030
 War-0.030
Tokenumm
Token ID13929
Feature activation+1.34e+01

Pos logit contributions
 flavors+0.154
 scar+0.154
mm+0.145
 fries+0.143
rob+0.140
Neg logit contributions
 alone-0.104
apping-0.100
ges-0.098
umpy-0.096
 by-0.089
Tokeniest
Token ID6386
Feature activation+5.33e+00

Pos logit contributions
 scar+0.497
 flavors+0.480
 fries+0.452
mm+0.447
 french+0.437
Neg logit contributions
ges-0.434
 alone-0.432
apping-0.407
 by-0.405
umpy-0.393
Token ice
Token ID4771
Feature activation+4.14e+00

Pos logit contributions
mm+0.134
 scar+0.133
 flavors+0.131
 fries+0.118
 extract+0.117
Neg logit contributions
 alone-0.229
ges-0.210
apping-0.208
 by-0.206
umpy-0.206
Token cream
Token ID8566
Feature activation+4.17e+00

Pos logit contributions
 scar+0.125
mm+0.110
 flavors+0.109
 spend+0.108
 fries+0.104
Neg logit contributions
 alone-0.162
 by-0.156
ges-0.151
apping-0.149
it-0.147
Token they
Token ID484
Feature activation+2.11e+00

Pos logit contributions
 scar+0.168
mm+0.162
 flavors+0.160
 fiber+0.149
rob+0.147
Neg logit contributions
 alone-0.123
 by-0.115
ges-0.114
 outside-0.107
apping-0.106
Token could
Token ID714
Feature activation+1.34e+00

Pos logit contributions
 flavors+0.070
mm+0.069
 scar+0.068
 extract+0.063
 cones+0.062
Neg logit contributions
 alone-0.081
ges-0.073
 nowhere-0.071
 War-0.070
umpy-0.070
Token little
Token ID1310
Feature activation-5.37e+00

Pos logit contributions
 scar+0.092
 flavors+0.092
 fries+0.087
mm+0.087
 chips+0.084
Neg logit contributions
ges-0.062
 alone-0.060
 War-0.057
apping-0.056
umpy-0.055
Token girl
Token ID2576
Feature activation-2.60e+00

Pos logit contributions
 alone+0.170
ges+0.165
 War+0.161
apping+0.160
umpy+0.156
Neg logit contributions
 flavors-0.199
 scar-0.193
mm-0.186
 fries-0.182
 cones-0.180
Token named
Token ID3706
Feature activation-6.63e-01

Pos logit contributions
 War+0.085
 alone+0.083
umpy+0.083
ges+0.082
apping+0.081
Neg logit contributions
 flavors-0.088
 scar-0.086
mm-0.082
 fries-0.079
 french-0.078
Token Lily
Token ID20037
Feature activation-1.21e+00

Pos logit contributions
 War+0.021
apping+0.020
 alone+0.020
umpy+0.020
ges+0.020
Neg logit contributions
mm-0.023
 scar-0.023
 flavors-0.023
 chips-0.021
 spend-0.021
Token.
Token ID13
Feature activation+4.21e-01

Pos logit contributions
apping+0.042
 War+0.042
 alone+0.042
ges+0.041
umpy+0.041
Neg logit contributions
 flavors-0.039
 scar-0.036
mm-0.034
 cones-0.033
 fries-0.033
Token She
Token ID1375
Feature activation+3.88e+00

Pos logit contributions
mm+0.013
 flavors+0.013
 scar+0.013
 extract+0.012
 spend+0.012
Neg logit contributions
 alone-0.014
unk-0.014
 War-0.014
apping-0.014
ges-0.014
Token loved
Token ID6151
Feature activation-3.46e+00

Pos logit contributions
 flavors+0.127
 scar+0.120
mm+0.118
 fries+0.112
 chips+0.109
Neg logit contributions
ges-0.137
apping-0.137
 alone-0.134
 War-0.131
umpy-0.128
Token to
Token ID284
Feature activation+2.29e+00

Pos logit contributions
 alone+0.098
ges+0.089
 War+0.084
 by+0.084
 crying+0.083
Neg logit contributions
 flavors-0.141
mm-0.140
 scar-0.130
 extract-0.128
 fries-0.127
Token play
Token ID711
Feature activation-1.03e+00

Pos logit contributions
 flavors+0.095
mm+0.093
rob+0.093
 fare+0.089
 extract+0.088
Neg logit contributions
 alone-0.059
 outside-0.056
ges-0.050
 crying-0.050
 basketball-0.048
Token outside
Token ID2354
Feature activation-4.19e+00

Pos logit contributions
umpy+0.024
ges+0.023
 War+0.022
apping+0.021
 alone+0.020
Neg logit contributions
 scar-0.047
 flavors-0.046
 chips-0.046
mm-0.045
 fries-0.044
Token on
Token ID319
Feature activation+4.50e+00

Pos logit contributions
 War+0.093
umpy+0.089
ges+0.087
apping+0.085
 alone+0.084
Neg logit contributions
 flavors-0.192
 scar-0.184
mm-0.181
 cones-0.179
 chips-0.177
Token friendship
Token ID14738
Feature activation+1.27e+00

Pos logit contributions
 flavors+0.095
 scar+0.092
mm+0.090
 fries+0.086
 extract+0.085
Neg logit contributions
 alone-0.099
 War-0.089
ges-0.089
apping-0.085
 nowhere-0.085
Token can
Token ID460
Feature activation-1.05e-01

Pos logit contributions
 flavors+0.046
 scar+0.046
mm+0.046
 fries+0.045
 french+0.044
Neg logit contributions
 alone-0.042
ges-0.037
 by-0.037
 War-0.036
apping-0.036
Token be
Token ID307
Feature activation+2.73e+00

Pos logit contributions
 alone+0.004
 by+0.003
umpy+0.003
 crying+0.003
 War+0.003
Neg logit contributions
 flavors-0.004
 scar-0.004
 fries-0.004
 french-0.004
 cones-0.003
Token found
Token ID1043
Feature activation-9.77e-01

Pos logit contributions
 flavors+0.103
mm+0.100
 scar+0.097
 fries+0.095
 cones+0.094
Neg logit contributions
 alone-0.093
 War-0.080
ges-0.080
umpy-0.080
apping-0.079
Token in
Token ID287
Feature activation-1.23e+00

Pos logit contributions
 alone+0.028
ges+0.027
apping+0.026
 War+0.026
umpy+0.025
Neg logit contributions
 flavors-0.039
 scar-0.037
mm-0.037
 fries-0.036
 cones-0.035
Token many
Token ID867
Feature activation+6.12e+00

Pos logit contributions
 alone+0.038
ges+0.035
 War+0.034
apping+0.034
umpy+0.033
Neg logit contributions
 flavors-0.048
mm-0.046
 scar-0.046
 fries-0.044
 cones-0.043
Token places
Token ID4113
Feature activation+2.02e+00

Pos logit contributions
 flavors+0.217
 scar+0.202
mm+0.199
 fries+0.192
 cones+0.189
Neg logit contributions
 alone-0.206
ges-0.200
apping-0.197
 War-0.195
umpy-0.191
Token.
Token ID13
Feature activation+3.08e-01

Pos logit contributions
 flavors+0.085
 scar+0.082
mm+0.081
 fries+0.078
 extract+0.077
Neg logit contributions
 alone-0.060
ges-0.052
apping-0.051
umpy-0.049
 by-0.049
Token<|endoftext|>
Token ID50256
Feature activation-2.48e+00

Pos logit contributions
 flavors+0.013
 scar+0.012
mm+0.012
 fries+0.012
 chips+0.012
Neg logit contributions
 alone-0.009
ges-0.008
 War-0.008
apping-0.008
umpy-0.008
TokenOnce
Token ID7454
Feature activation+1.28e+00

Pos logit contributions
apping+0.082
 alone+0.082
 War+0.080
ges+0.079
umpy+0.078
Neg logit contributions
 flavors-0.083
mm-0.078
 cones-0.075
 scar-0.073
rob-0.071
Token upon
Token ID2402
Feature activation-9.93e+00

Pos logit contributions
 would+0.056
mm+0.055
 flavors+0.055
 scar+0.055
 spend+0.053
Neg logit contributions
AR-0.027
 War-0.026
ges-0.023
udge-0.022
umpy-0.022
Token little
Token ID1310
Feature activation-5.56e+00

Pos logit contributions
 flavors+0.093
 scar+0.089
mm+0.087
 fries+0.085
 chips+0.085
Neg logit contributions
ges-0.050
 alone-0.050
apping-0.048
 War-0.047
umpy-0.045
Token girl
Token ID2576
Feature activation-3.01e+00

Pos logit contributions
 alone+0.155
ges+0.142
apping+0.140
 War+0.137
umpy+0.136
Neg logit contributions
 flavors-0.234
mm-0.224
 scar-0.221
 fries-0.217
 french-0.213
Token felt
Token ID2936
Feature activation+3.95e+00

Pos logit contributions
 alone+0.096
 by+0.086
apping+0.085
ges+0.081
haired+0.081
Neg logit contributions
mm-0.113
 flavors-0.112
 scar-0.106
 fries-0.106
rob-0.105
Token so
Token ID523
Feature activation+8.02e+00

Pos logit contributions
 flavors+0.102
 scar+0.094
mm+0.092
 fries+0.088
 french+0.085
Neg logit contributions
ges-0.169
apping-0.164
 War-0.159
AR-0.159
 alone-0.158
Token much
Token ID881
Feature activation+3.69e+00

Pos logit contributions
 flavors+0.300
mm+0.293
 scar+0.275
 fries+0.264
 spend+0.262
Neg logit contributions
ges-0.247
 War-0.234
 alone-0.229
apping-0.229
AR-0.226
Token better
Token ID1365
Feature activation+4.20e+00

Pos logit contributions
 flavors+0.105
mm+0.095
 scar+0.093
 fries+0.092
 french+0.090
Neg logit contributions
 alone-0.157
 War-0.145
apping-0.143
umpy-0.142
ges-0.141
Token.
Token ID13
Feature activation+1.17e-01

Pos logit contributions
 flavors+0.168
 scar+0.159
mm+0.158
 fries+0.153
 chips+0.149
Neg logit contributions
 alone-0.126
ges-0.118
apping-0.116
 War-0.114
umpy-0.112
Token She
Token ID1375
Feature activation+3.61e+00

Pos logit contributions
 flavors+0.005
 scar+0.005
mm+0.005
 fries+0.004
 cones+0.004
Neg logit contributions
 alone-0.003
ges-0.003
apping-0.003
 War-0.003
umpy-0.003
Token still
Token ID991
Feature activation-1.96e+00

Pos logit contributions
mm+0.136
 flavors+0.131
 fare+0.129
 extract+0.126
rob+0.125
Neg logit contributions
 alone-0.117
 nowhere-0.106
 by-0.105
 crying-0.099
 outside-0.098
Token had
Token ID550
Feature activation+6.03e-01

Pos logit contributions
 alone+0.066
 War+0.056
umpy+0.056
ges+0.056
 crying+0.055
Neg logit contributions
 flavors-0.074
mm-0.074
 scar-0.070
 chips-0.070
 fries-0.069
Token her
Token ID607
Feature activation-1.07e+00

Pos logit contributions
mm+0.028
rob+0.027
 extract+0.026
 flavors+0.026
 fries+0.025
Neg logit contributions
 alone-0.016
 nowhere-0.014
 by-0.013
 crying-0.012
 War-0.012
Token
Token ID198
Feature activation-4.69e-02

Pos logit contributions
mm+0.109
 flavors+0.108
 cones+0.101
 scar+0.097
 chips+0.094
Neg logit contributions
 alone-0.130
ges-0.124
 War-0.123
umpy-0.120
apping-0.120
Token
Token ID198
Feature activation-5.13e-02

Pos logit contributions
 alone+0.001
ges+0.001
apping+0.001
 War+0.001
umpy+0.001
Neg logit contributions
 flavors-0.002
 scar-0.002
mm-0.002
 fries-0.002
 cones-0.002
TokenThe
Token ID464
Feature activation+2.24e+00

Pos logit contributions
 alone+0.002
ges+0.002
apping+0.001
 War+0.001
umpy+0.001
Neg logit contributions
 flavors-0.002
 scar-0.002
mm-0.002
 fries-0.002
 french-0.002
Token dove
Token ID38862
Feature activation-3.66e+00

Pos logit contributions
 scar+0.085
 flavors+0.084
mm+0.079
 chips+0.077
 cones+0.076
Neg logit contributions
ges-0.073
apping-0.070
 alone-0.069
umpy-0.065
 War-0.065
Token was
Token ID373
Feature activation-7.33e-01

Pos logit contributions
 alone+0.135
ges+0.126
apping+0.124
 War+0.123
umpy+0.122
Neg logit contributions
 flavors-0.122
mm-0.114
 scar-0.113
 fries-0.108
 french-0.106
Token so
Token ID523
Feature activation+8.35e+00

Pos logit contributions
ges+0.023
 War+0.022
apping+0.022
umpy+0.021
 alone+0.021
Neg logit contributions
 scar-0.026
 flavors-0.026
 fries-0.024
mm-0.024
 cones-0.023
Token grateful
Token ID14066
Feature activation+1.75e+00

Pos logit contributions
 flavors+0.328
 scar+0.321
mm+0.317
 fries+0.302
 cones+0.300
Neg logit contributions
ges-0.222
 War-0.208
apping-0.204
udge-0.203
AR-0.201
Token for
Token ID329
Feature activation+4.49e+00

Pos logit contributions
 flavors+0.071
mm+0.067
 scar+0.066
 fries+0.064
 extract+0.063
Neg logit contributions
 alone-0.053
ges-0.049
apping-0.048
 War-0.047
umpy-0.047
Token Grace
Token ID16156
Feature activation+3.71e+00

Pos logit contributions
 flavors+0.193
 scar+0.180
mm+0.179
 fries+0.175
 cones+0.172
Neg logit contributions
 alone-0.126
ges-0.116
 War-0.112
apping-0.111
umpy-0.105
Token's
Token ID338
Feature activation-1.86e+00

Pos logit contributions
 flavors+0.154
 scar+0.146
mm+0.145
 fries+0.141
 cones+0.138
Neg logit contributions
 alone-0.103
ges-0.097
apping-0.095
 War-0.095
umpy-0.093
Token help
Token ID1037
Feature activation+3.54e+00

Pos logit contributions
 alone+0.067
 War+0.062
apping+0.061
ges+0.061
umpy+0.060
Neg logit contributions
 flavors-0.063
mm-0.060
 scar-0.058
 fries-0.056
 extract-0.055
Token medicine
Token ID9007
Feature activation+2.37e+00

Pos logit contributions
 flavors+0.165
mm+0.153
 scar+0.149
 fries+0.141
 extract+0.139
Neg logit contributions
 alone-0.267
ges-0.252
 War-0.248
apping-0.248
umpy-0.247
Token to
Token ID284
Feature activation+1.92e+00

Pos logit contributions
 flavors+0.107
mm+0.104
 scar+0.100
 fries+0.097
 extract+0.097
Neg logit contributions
 alone-0.063
ges-0.054
apping-0.052
 by-0.052
umpy-0.051
Token make
Token ID787
Feature activation+4.65e+00

Pos logit contributions
 flavors+0.062
 cones+0.055
rob+0.055
mm+0.054
 scar+0.053
Neg logit contributions
 alone-0.077
 by-0.067
 crying-0.066
ges-0.066
 War-0.063
Token her
Token ID607
Feature activation-1.13e+00

Pos logit contributions
 flavors+0.181
 scar+0.171
mm+0.171
 chips+0.165
 fries+0.165
Neg logit contributions
 alone-0.138
ges-0.133
apping-0.132
umpy-0.131
 War-0.131
Token feel
Token ID1254
Feature activation+1.57e+00

Pos logit contributions
 alone+0.041
 crying+0.035
 sad+0.033
 by+0.033
 upset+0.032
Neg logit contributions
 flavors-0.044
mm-0.044
rob-0.042
 extract-0.039
 cones-0.038
Token better
Token ID1365
Feature activation+3.96e+00

Pos logit contributions
 flavors+0.038
 scar+0.035
mm+0.035
 fries+0.033
 french+0.032
Neg logit contributions
 alone-0.070
ges-0.068
apping-0.067
 War-0.067
umpy-0.066
Token.
Token ID13
Feature activation+6.49e-02

Pos logit contributions
 flavors+0.203
 scar+0.194
 fries+0.192
mm+0.191
rob+0.191
Neg logit contributions
 alone-0.097
 by-0.082
apping-0.073
umpy-0.070
ges-0.068
Token After
Token ID2293
Feature activation+1.83e+00

Pos logit contributions
 flavors+0.002
mm+0.002
 scar+0.002
 chips+0.002
 fries+0.002
Neg logit contributions
 alone-0.002
ges-0.002
apping-0.002
umpy-0.002
 War-0.002
Token taking
Token ID2263
Feature activation-7.02e-01

Pos logit contributions
 flavors+0.070
 scar+0.062
 fare+0.061
mm+0.059
 extract+0.059
Neg logit contributions
 alone-0.060
ges-0.056
 crying-0.055
 nowhere-0.054
 War-0.053
Token it
Token ID340
Feature activation-1.96e+00

Pos logit contributions
 alone+0.021
ges+0.019
 War+0.018
 by+0.018
umpy+0.017
Neg logit contributions
 flavors-0.029
mm-0.027
 scar-0.027
 fries-0.026
 cones-0.026
Token,
Token ID11
Feature activation-8.39e-01

Pos logit contributions
 alone+0.063
ges+0.057
apping+0.056
 War+0.055
umpy+0.055
Neg logit contributions
 flavors-0.078
mm-0.072
 scar-0.071
 fries-0.071
 french-0.069
Token around
Token ID1088
Feature activation-3.80e+00

Pos logit contributions
 alone+0.125
ges+0.117
apping+0.114
 War+0.114
umpy+0.112
Neg logit contributions
 flavors-0.230
 scar-0.218
mm-0.217
 fries-0.211
 cones-0.208
Token,
Token ID11
Feature activation-1.35e+00

Pos logit contributions
umpy+0.128
apping+0.121
 alone+0.120
 War+0.118
ges+0.117
Neg logit contributions
 flavors-0.136
 scar-0.126
 cones-0.123
 chips-0.123
 fries-0.122
Token laughing
Token ID14376
Feature activation-4.55e+00

Pos logit contributions
umpy+0.040
ges+0.039
 War+0.039
 alone+0.038
apping+0.038
Neg logit contributions
 flavors-0.053
 scar-0.050
 fries-0.049
mm-0.048
 chips-0.048
Token and
Token ID290
Feature activation-8.27e-01

Pos logit contributions
 alone+0.113
ges+0.104
apping+0.102
 War+0.101
umpy+0.099
Neg logit contributions
 flavors-0.204
 scar-0.194
mm-0.194
 fries-0.188
 extract-0.184
Token having
Token ID1719
Feature activation-1.49e+00

Pos logit contributions
 alone+0.019
ges+0.017
apping+0.017
 War+0.017
umpy+0.016
Neg logit contributions
 flavors-0.039
 scar-0.037
mm-0.037
 fries-0.036
 chips-0.035
Token a
Token ID257
Feature activation+1.37e+00

Pos logit contributions
 alone+0.048
 nowhere+0.042
 War+0.042
 by+0.041
and+0.039
Neg logit contributions
mm-0.054
 fare-0.053
 extract-0.052
 flavors-0.051
 scar-0.050
Token lot
Token ID1256
Feature activation+2.11e+00

Pos logit contributions
 flavors+0.046
mm+0.044
 scar+0.042
 extract+0.041
rob+0.041
Neg logit contributions
 alone-0.048
apping-0.044
 War-0.043
umpy-0.042
 by-0.042
Token of
Token ID286
Feature activation-4.69e+00

Pos logit contributions
 flavors+0.113
mm+0.106
 scar+0.106
 fries+0.104
 cones+0.103
Neg logit contributions
 alone-0.034
ges-0.031
 War-0.030
apping-0.029
umpy-0.028
Token fun
Token ID1257
Feature activation+2.35e+00

Pos logit contributions
 alone+0.197
apping+0.174
 War+0.174
 by+0.173
umpy+0.170
Neg logit contributions
 scar-0.125
 flavors-0.123
mm-0.122
 extract-0.120
 fries-0.116
Token.
Token ID13
Feature activation-4.86e-01

Pos logit contributions
 flavors+0.096
 scar+0.094
mm+0.092
 fries+0.090
 french+0.088
Neg logit contributions
 alone-0.068
apping-0.062
ges-0.062
 War-0.059
umpy-0.057
Token It
Token ID632
Feature activation-2.03e+00

Pos logit contributions
 alone+0.015
ges+0.014
apping+0.014
umpy+0.014
 War+0.014
Neg logit contributions
 flavors-0.018
mm-0.018
 scar-0.017
 fries-0.017
 cones-0.016
Token a
Token ID257
Feature activation+2.22e+00

Pos logit contributions
ivering+0.023
umpy+0.022
ucking+0.022
apping+0.021
 ey+0.021
Neg logit contributions
 flavors-0.007
 would-0.006
 flavor-0.006
 cones-0.006
 spend-0.005
Token little
Token ID1310
Feature activation-5.43e+00

Pos logit contributions
 flavors+0.089
 scar+0.088
 fries+0.085
mm+0.084
 chips+0.082
Neg logit contributions
ges-0.063
 alone-0.063
apping-0.057
 War-0.055
umpy-0.055
Token bird
Token ID6512
Feature activation-2.56e+00

Pos logit contributions
ges+0.189
 alone+0.188
 War+0.187
apping+0.182
AR+0.181
Neg logit contributions
 scar-0.172
 flavors-0.166
 cones-0.158
 chips-0.156
 fries-0.155
Token saw
Token ID2497
Feature activation-3.33e+00

Pos logit contributions
apping+0.104
 War+0.102
uttering+0.098
AR+0.096
umpy+0.096
Neg logit contributions
 flavors-0.065
 would-0.062
 spend-0.059
 scar-0.059
mm-0.057
Token a
Token ID257
Feature activation+2.25e+00

Pos logit contributions
umpy+0.156
apping+0.150
udge+0.140
and+0.138
ges+0.138
Neg logit contributions
 flavors-0.077
 scar-0.071
 fries-0.069
mm-0.069
 chips-0.068
Token big
Token ID1263
Feature activation+4.05e-01

Pos logit contributions
 flavors+0.083
 scar+0.083
 fries+0.076
 chips+0.075
mm+0.074
Neg logit contributions
ges-0.073
 alone-0.070
apping-0.068
 War-0.067
umpy-0.066
Token log
Token ID2604
Feature activation-1.67e-01

Pos logit contributions
 flavors+0.014
 scar+0.013
mm+0.013
 fries+0.012
 cones+0.012
Neg logit contributions
 alone-0.014
ges-0.014
apping-0.013
 War-0.013
umpy-0.013
Token.
Token ID13
Feature activation+3.31e-01

Pos logit contributions
 War+0.006
umpy+0.006
ges+0.005
apping+0.005
AR+0.005
Neg logit contributions
 flavors-0.005
 cones-0.005
 chips-0.005
 feast-0.005
 scar-0.005
Token The
Token ID383
Feature activation+2.93e+00

Pos logit contributions
 flavors+0.014
mm+0.014
 scar+0.013
 fries+0.013
 cones+0.013
Neg logit contributions
 alone-0.008
ges-0.008
apping-0.008
 War-0.008
umpy-0.008
Token bird
Token ID6512
Feature activation-2.64e+00

Pos logit contributions
 scar+0.100
 flavors+0.098
 cones+0.090
mm+0.090
 chips+0.088
Neg logit contributions
ges-0.105
 alone-0.103
apping-0.100
 War-0.098
umpy-0.095
Token did
Token ID750
Feature activation+7.32e-01

Pos logit contributions
 alone+0.096
ges+0.095
apping+0.095
 War+0.094
umpy+0.091
Neg logit contributions
 flavors-0.085
 scar-0.078
mm-0.076
 cones-0.072
 fries-0.072
Token took
Token ID1718
Feature activation+1.71e+00

Pos logit contributions
mm+0.107
 flavors+0.099
 extract+0.096
 fiber+0.091
rob+0.090
Neg logit contributions
 alone-0.098
 nowhere-0.091
 by-0.079
 crying-0.077
 sad-0.075
Token Lily
Token ID20037
Feature activation-2.81e+00

Pos logit contributions
 flavors+0.072
mm+0.069
 scar+0.067
 fries+0.065
 french+0.064
Neg logit contributions
 alone-0.049
ges-0.045
 War-0.043
apping-0.042
 by-0.040
Token to
Token ID284
Feature activation+1.07e+00

Pos logit contributions
 War+0.104
umpy+0.099
apping+0.098
AR+0.094
ges+0.093
Neg logit contributions
 scar-0.089
 flavors-0.083
 cones-0.082
 chips-0.080
 extract-0.076
Token the
Token ID262
Feature activation+2.33e-01

Pos logit contributions
 flavors+0.048
mm+0.047
rob+0.044
 scar+0.043
 extract+0.043
Neg logit contributions
 alone-0.029
ges-0.024
 by-0.023
 crying-0.023
 nowhere-0.023
Token living
Token ID2877
Feature activation-2.96e+00

Pos logit contributions
 flavors+0.009
 scar+0.008
mm+0.008
 fries+0.008
 cones+0.008
Neg logit contributions
 alone-0.007
ges-0.007
apping-0.007
 War-0.007
umpy-0.007
Token room
Token ID2119
Feature activation-1.27e+00

Pos logit contributions
 War+0.090
umpy+0.089
ges+0.087
apping+0.086
 alone+0.083
Neg logit contributions
 flavors-0.113
 scar-0.105
 extract-0.098
 cones-0.098
mm-0.097
Token and
Token ID290
Feature activation-1.93e+00

Pos logit contributions
 alone+0.032
apping+0.028
ges+0.027
umpy+0.025
 War+0.025
Neg logit contributions
 flavors-0.059
mm-0.056
 scar-0.055
 fries-0.054
 french-0.053
Token gave
Token ID2921
Feature activation+1.64e+00

Pos logit contributions
 alone+0.074
 crying+0.063
 nowhere+0.063
 by+0.062
ges+0.061
Neg logit contributions
mm-0.071
 flavors-0.068
rob-0.061
 fiber-0.060
 french-0.059
Token her
Token ID607
Feature activation-2.29e+00

Pos logit contributions
 flavors+0.071
mm+0.068
 scar+0.067
 fries+0.065
 cones+0.064
Neg logit contributions
 alone-0.043
ges-0.040
 War-0.039
apping-0.039
umpy-0.037
Token some
Token ID617
Feature activation+5.52e+00

Pos logit contributions
 alone+0.064
ges+0.057
 War+0.055
 nowhere+0.054
 by+0.054
Neg logit contributions
 flavors-0.096
mm-0.095
 extract-0.088
 scar-0.088
 fries-0.087
Token milk
Token ID7545
Feature activation+6.74e+00

Pos logit contributions
 flavors+0.150
mm+0.139
 scar+0.138
 fries+0.130
 extract+0.126
Neg logit contributions
 alone-0.233
ges-0.222
apping-0.219
 War-0.218
umpy-0.215
Token her
Token ID607
Feature activation-8.42e-01

Pos logit contributions
 alone+0.015
ges+0.014
 War+0.013
apping+0.013
umpy+0.013
Neg logit contributions
 flavors-0.019
mm-0.018
 scar-0.018
 fries-0.018
 cones-0.017
Token shiny
Token ID22441
Feature activation-7.61e-01

Pos logit contributions
 alone+0.030
ges+0.027
 War+0.026
apping+0.026
 nowhere+0.026
Neg logit contributions
 flavors-0.029
mm-0.028
 scar-0.026
 fries-0.026
 french-0.026
Token bracelet
Token ID42893
Feature activation+1.58e+00

Pos logit contributions
 alone+0.031
ges+0.029
apping+0.029
 War+0.029
umpy+0.029
Neg logit contributions
 flavors-0.022
 scar-0.021
mm-0.020
 fries-0.019
 chips-0.019
Token every
Token ID790
Feature activation+6.40e+00

Pos logit contributions
 flavors+0.071
 fries+0.069
 french+0.068
mm+0.068
 extract+0.065
Neg logit contributions
 alone-0.046
 by-0.041
 outside-0.038
 nowhere-0.035
apping-0.034
Token day
Token ID1110
Feature activation+1.16e+00

Pos logit contributions
 flavors+0.217
 scar+0.203
mm+0.200
 fries+0.193
 chips+0.189
Neg logit contributions
 alone-0.225
ges-0.217
apping-0.215
 War-0.214
umpy-0.210
Token.
Token ID13
Feature activation+4.40e-01

Pos logit contributions
 flavors+0.045
 scar+0.043
mm+0.043
 fries+0.041
 cones+0.040
Neg logit contributions
 alone-0.036
ges-0.034
apping-0.033
 War-0.033
umpy-0.032
Token One
Token ID1881
Feature activation+7.25e+00

Pos logit contributions
 flavors+0.014
 scar+0.013
mm+0.013
 fries+0.012
 cones+0.012
Neg logit contributions
 alone-0.017
ges-0.016
apping-0.016
 War-0.016
umpy-0.016
Token day
Token ID1110
Feature activation+1.11e+00

Pos logit contributions
 flavors+0.185
mm+0.168
 scar+0.165
 spend+0.150
 flavor+0.147
Neg logit contributions
ges-0.314
 War-0.312
apping-0.306
AR-0.306
 alone-0.299
Token,
Token ID11
Feature activation-3.07e-01

Pos logit contributions
mm+0.025
 flavors+0.024
 scar+0.023
 chips+0.023
 extract+0.022
Neg logit contributions
umpy-0.051
 alone-0.051
 War-0.050
AR-0.048
apping-0.047
Token she
Token ID673
Feature activation+1.75e+00

Pos logit contributions
umpy+0.014
apping+0.013
 War+0.013
 alone+0.013
unk+0.013
Neg logit contributions
 scar-0.008
 chips-0.008
 flavors-0.008
mm-0.008
 extract-0.007
Token showed
Token ID3751
Feature activation-9.80e-03

Pos logit contributions
 flavors+0.047
 scar+0.044
mm+0.041
 fries+0.040
 french+0.038
Neg logit contributions
ges-0.073
apping-0.073
 alone-0.073
 War-0.072
umpy-0.070
Token says
Token ID1139
Feature activation+2.00e+00

Pos logit contributions
 alone+0.079
 by+0.072
umpy+0.072
 War+0.072
apping+0.070
Neg logit contributions
mm-0.045
 flavors-0.044
 fries-0.043
 feast-0.041
 scar-0.040
Token.
Token ID13
Feature activation-7.24e-01

Pos logit contributions
 flavors+0.056
 scar+0.052
 cones+0.051
 chips+0.051
mm+0.051
Neg logit contributions
 War-0.073
umpy-0.073
ges-0.073
 alone-0.072
AR-0.072
Token He
Token ID679
Feature activation+1.20e+00

Pos logit contributions
 alone+0.023
ges+0.022
apping+0.021
umpy+0.021
 War+0.021
Neg logit contributions
 flavors-0.027
mm-0.025
 scar-0.025
 cones-0.025
 chips-0.024
Token pulls
Token ID16194
Feature activation+2.04e+00

Pos logit contributions
 flavors+0.035
 scar+0.032
mm+0.031
 fries+0.030
 cones+0.030
Neg logit contributions
 alone-0.049
ges-0.048
apping-0.047
umpy-0.046
 War-0.046
Token harder
Token ID7069
Feature activation+3.24e+00

Pos logit contributions
 flavors+0.062
 scar+0.062
 cones+0.060
 chips+0.058
 extract+0.054
Neg logit contributions
apping-0.075
umpy-0.072
AR-0.072
uttering-0.071
oon-0.070
Token.
Token ID13
Feature activation-7.70e-01

Pos logit contributions
 flavors+0.138
 scar+0.131
mm+0.130
 fries+0.126
 cones+0.125
Neg logit contributions
 alone-0.087
ges-0.082
apping-0.080
 War-0.080
umpy-0.078
Token
Token ID198
Feature activation-1.01e+00

Pos logit contributions
 alone+0.024
ges+0.022
apping+0.022
umpy+0.022
 War+0.021
Neg logit contributions
 flavors-0.029
mm-0.028
 scar-0.027
 cones-0.027
 fries-0.027
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 alone+0.033
ges+0.031
apping+0.030
umpy+0.029
 War+0.029
Neg logit contributions
 flavors-0.036
mm-0.035
 scar-0.033
 cones-0.032
 chips-0.032
Token"
Token ID1
Feature activation-3.05e+00

Pos logit contributions
 alone+0.035
ges+0.033
apping+0.032
 crying+0.030
 nowhere+0.030
Neg logit contributions
 flavors-0.035
mm-0.033
 scar-0.032
 cones-0.032
 chips-0.031
TokenStop
Token ID19485
Feature activation-3.29e+00

Pos logit contributions
 alone+0.098
apping+0.093
ges+0.091
umpy+0.087
 War+0.086
Neg logit contributions
 flavors-0.118
 scar-0.108
mm-0.108
 chips-0.105
 cones-0.104
Token it
Token ID340
Feature activation-2.85e+00

Pos logit contributions
 alone+0.120
umpy+0.119
apping+0.115
ges+0.114
uttering+0.111
Neg logit contributions
 flavors-0.104
 scar-0.098
mm-0.096
 fries-0.092
 chips-0.091
Token time
Token ID640
Feature activation-2.44e+00

Pos logit contributions
 flavors+0.077
mm+0.074
 scar+0.074
 fries+0.069
 feast+0.069
Neg logit contributions
 alone-0.057
ges-0.057
 War-0.055
apping-0.052
umpy-0.051
Token,
Token ID11
Feature activation-5.44e-01

Pos logit contributions
 War+0.125
umpy+0.125
udge+0.122
 unconscious+0.116
ucking+0.115
Neg logit contributions
mm-0.043
 flavors-0.026
 feast-0.026
 spend-0.026
 cones-0.025
Token there
Token ID612
Feature activation-3.27e+00

Pos logit contributions
 War+0.019
umpy+0.019
udge+0.018
AR+0.018
ucking+0.018
Neg logit contributions
 would-0.016
mm-0.016
 spend-0.015
 will-0.015
,"-0.015
Token was
Token ID373
Feature activation-8.41e-01

Pos logit contributions
apping+0.140
umpy+0.139
 alone+0.138
 War+0.136
ges+0.135
Neg logit contributions
 flavors-0.088
mm-0.078
 would-0.071
 extract-0.068
 cones-0.067
Token a
Token ID257
Feature activation+1.88e+00

Pos logit contributions
umpy+0.037
apping+0.036
ges+0.036
 War+0.036
 alone+0.035
Neg logit contributions
 flavors-0.020
mm-0.020
 breakfast-0.020
 scar-0.020
 chips-0.018
Token little
Token ID1310
Feature activation-5.70e+00

Pos logit contributions
 scar+0.079
 flavors+0.079
 fries+0.075
mm+0.075
 chips+0.073
Neg logit contributions
ges-0.050
 alone-0.049
 War-0.046
apping-0.045
umpy-0.044
Token girl
Token ID2576
Feature activation-3.33e+00

Pos logit contributions
 alone+0.181
ges+0.175
 War+0.172
apping+0.170
umpy+0.167
Neg logit contributions
 flavors-0.211
 scar-0.205
mm-0.197
 fries-0.194
 cones-0.191
Token named
Token ID3706
Feature activation-1.32e+00

Pos logit contributions
 War+0.115
umpy+0.113
 alone+0.111
ges+0.110
apping+0.109
Neg logit contributions
 flavors-0.105
 scar-0.104
mm-0.098
 spend-0.094
 fries-0.093
Token Lily
Token ID20037
Feature activation-1.55e+00

Pos logit contributions
 War+0.043
apping+0.041
umpy+0.041
 alone+0.040
ges+0.040
Neg logit contributions
mm-0.045
 scar-0.044
 flavors-0.044
 spend-0.042
 chips-0.041
Token.
Token ID13
Feature activation+2.18e-02

Pos logit contributions
 War+0.057
apping+0.056
 alone+0.055
AR+0.055
umpy+0.055
Neg logit contributions
 flavors-0.047
 scar-0.044
mm-0.041
 spend-0.040
 cones-0.040
Token She
Token ID1375
Feature activation+3.38e+00

Pos logit contributions
mm+0.001
 flavors+0.001
 scar+0.001
 extract+0.001
 spend+0.001
Neg logit contributions
unk-0.001
 War-0.001
apping-0.001
 alone-0.001
ges-0.001
Token,
Token ID11
Feature activation-2.18e+00

Pos logit contributions
 alone+0.059
apping+0.052
ges+0.052
 War+0.050
 by+0.050
Neg logit contributions
 flavors-0.076
 scar-0.074
mm-0.073
 fries-0.071
 extract-0.070
Token but
Token ID475
Feature activation-5.45e-01

Pos logit contributions
 alone+0.060
ges+0.054
apping+0.054
 War+0.054
umpy+0.053
Neg logit contributions
 flavors-0.093
 scar-0.087
mm-0.087
 fries-0.085
 french-0.083
Token a
Token ID257
Feature activation+7.15e-01

Pos logit contributions
 alone+0.019
ges+0.018
apping+0.017
 War+0.017
umpy+0.017
Neg logit contributions
 flavors-0.019
 scar-0.018
mm-0.018
 fries-0.017
 cones-0.017
Token very
Token ID845
Feature activation-2.31e-01

Pos logit contributions
 flavors+0.026
mm+0.025
 scar+0.024
 fries+0.023
 cones+0.023
Neg logit contributions
 alone-0.024
ges-0.022
 War-0.022
apping-0.022
umpy-0.021
Token scary
Token ID14343
Feature activation-2.46e+00

Pos logit contributions
 alone+0.008
ges+0.007
apping+0.007
 War+0.007
umpy+0.007
Neg logit contributions
 flavors-0.008
 scar-0.008
mm-0.008
 fries-0.008
 cones-0.007
Token dog
Token ID3290
Feature activation-5.76e+00

Pos logit contributions
 alone+0.068
ges+0.066
apping+0.065
umpy+0.062
 War+0.062
Neg logit contributions
 flavors-0.101
 scar-0.096
mm-0.093
 fries-0.092
 chips-0.092
Token.
Token ID13
Feature activation-1.46e+00

Pos logit contributions
 alone+0.188
ges+0.181
apping+0.179
 War+0.177
umpy+0.175
Neg logit contributions
 flavors-0.208
 scar-0.196
mm-0.194
 fries-0.188
 chips-0.184
Token I
Token ID314
Feature activation-1.81e+00

Pos logit contributions
 alone+0.045
ges+0.041
apping+0.041
 nowhere+0.040
uttering+0.039
Neg logit contributions
mm-0.055
 flavors-0.054
 fries-0.052
 scar-0.051
 cones-0.050
Token'm
Token ID1101
Feature activation-1.06e+00

Pos logit contributions
 alone+0.067
ges+0.063
 War+0.062
umpy+0.062
apping+0.061
Neg logit contributions
 flavors-0.059
mm-0.057
 scar-0.055
 fries-0.053
 cones-0.053
Token sorry
Token ID7926
Feature activation-3.73e+00

Pos logit contributions
ges+0.034
umpy+0.032
apping+0.031
 alone+0.031
 War+0.031
Neg logit contributions
 scar-0.037
 flavors-0.036
 fries-0.036
mm-0.034
 french-0.034
Token it
Token ID340
Feature activation-3.06e+00

Pos logit contributions
 alone+0.123
apping+0.111
 War+0.109
ges+0.108
 by+0.106
Neg logit contributions
 flavors-0.145
 scar-0.133
mm-0.132
 fries-0.131
 extract-0.130
Tokenie
Token ID494
Feature activation-1.77e+00

Pos logit contributions
 War+0.134
apping+0.133
ges+0.129
 alone+0.127
umpy+0.125
Neg logit contributions
 scar-0.094
 flavors-0.092
mm-0.080
 would-0.080
 spend-0.076
Token felt
Token ID2936
Feature activation+3.54e+00

Pos logit contributions
 alone+0.066
ges+0.064
apping+0.063
 War+0.063
umpy+0.062
Neg logit contributions
 flavors-0.056
 scar-0.053
mm-0.051
 fries-0.049
 cones-0.048
Token very
Token ID845
Feature activation+6.86e-01

Pos logit contributions
 scar+0.110
 flavors+0.105
 thank+0.102
 fries+0.102
 sa+0.101
Neg logit contributions
ges-0.127
apping-0.124
AR-0.121
udge-0.116
unk-0.115
Token scared
Token ID12008
Feature activation-3.17e+00

Pos logit contributions
 flavors+0.023
mm+0.023
 thank+0.023
 wait+0.023
 scar+0.022
Neg logit contributions
ges-0.021
AR-0.020
udge-0.019
oon-0.018
 War-0.018
Token and
Token ID290
Feature activation-5.04e-01

Pos logit contributions
AR+0.125
unk+0.117
 War+0.117
haired+0.115
oon+0.112
Neg logit contributions
 scar-0.089
 enough-0.083
 cones-0.077
 flavors-0.075
 would-0.072
Token didn
Token ID1422
Feature activation-5.93e+00

Pos logit contributions
apping+0.019
ges+0.019
umpy+0.019
 War+0.019
udge+0.018
Neg logit contributions
 flavors-0.014
 scar-0.014
 fries-0.013
mm-0.013
 cones-0.013
Token't
Token ID470
Feature activation-5.47e+00

Pos logit contributions
 alone+0.145
ges+0.135
apping+0.133
 War+0.131
umpy+0.128
Neg logit contributions
 flavors-0.264
mm-0.253
 scar-0.252
 fries-0.243
 chips-0.240
Token know
Token ID760
Feature activation+3.62e+00

Pos logit contributions
 alone+0.246
ges+0.232
 War+0.229
umpy+0.228
apping+0.225
Neg logit contributions
 flavors-0.144
mm-0.135
 scar-0.126
 fries-0.125
 extract-0.121
Token what
Token ID644
Feature activation+1.38e+00

Pos logit contributions
 flavors+0.134
 scar+0.129
mm+0.126
 cones+0.120
 fries+0.120
Neg logit contributions
 alone-0.114
umpy-0.112
apping-0.109
 War-0.109
ges-0.109
Token to
Token ID284
Feature activation+1.44e+00

Pos logit contributions
 flavors+0.049
 scar+0.046
mm+0.045
 fries+0.044
 cones+0.044
Neg logit contributions
 alone-0.046
ges-0.044
apping-0.044
umpy-0.043
 War-0.043
Token do
Token ID466
Feature activation-2.84e-01

Pos logit contributions
 flavors+0.046
 scar+0.043
mm+0.043
 fries+0.042
 french+0.041
Neg logit contributions
 alone-0.054
ges-0.051
 War-0.049
apping-0.049
umpy-0.049
Token time
Token ID640
Feature activation-3.29e+00

Pos logit contributions
 flavors+0.050
mm+0.048
 scar+0.047
 chips+0.044
 fries+0.044
Neg logit contributions
 alone-0.038
ges-0.038
 War-0.036
apping-0.035
umpy-0.034
Token,
Token ID11
Feature activation-1.47e+00

Pos logit contributions
umpy+0.161
 War+0.159
udge+0.154
 alone+0.152
 unconscious+0.151
Neg logit contributions
mm-0.067
 flavors-0.049
 cones-0.047
 feast-0.046
 spend-0.042
Token there
Token ID612
Feature activation-3.97e+00

Pos logit contributions
umpy+0.045
 War+0.044
AR+0.042
apping+0.041
udge+0.041
Neg logit contributions
mm-0.051
 would-0.048
 scar-0.047
 spend-0.046
 flavors-0.046
Token was
Token ID373
Feature activation-1.58e+00

Pos logit contributions
umpy+0.168
 alone+0.168
apping+0.166
 War+0.163
ges+0.162
Neg logit contributions
 flavors-0.107
mm-0.097
 would-0.086
 extract-0.085
 cones-0.083
Token a
Token ID257
Feature activation+1.22e+00

Pos logit contributions
umpy+0.070
apping+0.068
ges+0.068
 alone+0.067
 War+0.067
Neg logit contributions
mm-0.038
 breakfast-0.037
 flavors-0.036
 scar-0.036
 chips-0.034
Token little
Token ID1310
Feature activation-6.43e+00

Pos logit contributions
 flavors+0.049
 scar+0.049
mm+0.046
 fries+0.046
 chips+0.045
Neg logit contributions
ges-0.035
 alone-0.034
 War-0.032
apping-0.032
umpy-0.031
Token girl
Token ID2576
Feature activation-4.24e+00

Pos logit contributions
 alone+0.198
ges+0.190
 War+0.186
apping+0.185
umpy+0.181
Neg logit contributions
 flavors-0.246
 scar-0.236
mm-0.231
 fries-0.225
 cones-0.221
Token named
Token ID3706
Feature activation-2.28e+00

Pos logit contributions
 War+0.141
umpy+0.139
 alone+0.137
ges+0.135
apping+0.135
Neg logit contributions
 flavors-0.137
 scar-0.135
mm-0.128
 fries-0.123
 spend-0.123
Token Lily
Token ID20037
Feature activation-2.21e+00

Pos logit contributions
 War+0.067
 alone+0.067
apping+0.066
umpy+0.065
ges+0.064
Neg logit contributions
 flavors-0.084
mm-0.084
 scar-0.082
 chips-0.078
 extract-0.077
Token.
Token ID13
Feature activation-8.41e-01

Pos logit contributions
apping+0.079
 War+0.079
 alone+0.078
umpy+0.077
ges+0.077
Neg logit contributions
 flavors-0.068
 scar-0.063
mm-0.059
 cones-0.058
 spend-0.057
Token She
Token ID1375
Feature activation+2.63e+00

Pos logit contributions
 alone+0.028
 War+0.027
ges+0.027
apping+0.027
unk+0.027
Neg logit contributions
 flavors-0.028
mm-0.028
 scar-0.026
 extract-0.025
 chips-0.025
Token time
Token ID640
Feature activation-2.83e+00

Pos logit contributions
 flavors+0.067
 scar+0.064
mm+0.064
 fries+0.060
 chips+0.060
Neg logit contributions
 alone-0.048
ges-0.047
 War-0.046
apping-0.044
umpy-0.043
Token,
Token ID11
Feature activation-1.06e+00

Pos logit contributions
umpy+0.142
 War+0.140
udge+0.135
ivering+0.133
 unconscious+0.133
Neg logit contributions
mm-0.051
 flavors-0.037
 feast-0.036
 cones-0.035
 would-0.034
Token there
Token ID612
Feature activation-3.59e+00

Pos logit contributions
umpy+0.036
 War+0.036
AR+0.033
ucking+0.033
udge+0.033
Neg logit contributions
 would-0.032
mm-0.031
 spend-0.030
 will-0.030
 scar-0.030
Token was
Token ID373
Feature activation-1.24e+00

Pos logit contributions
 alone+0.147
umpy+0.145
apping+0.145
 War+0.142
ges+0.140
Neg logit contributions
 flavors-0.104
mm-0.093
 extract-0.084
 scar-0.083
 cones-0.082
Token a
Token ID257
Feature activation+1.60e+00

Pos logit contributions
umpy+0.052
apping+0.051
ges+0.051
 alone+0.051
 War+0.050
Neg logit contributions
 flavors-0.033
 scar-0.031
mm-0.031
 breakfast-0.030
 chips-0.029
Token little
Token ID1310
Feature activation-6.21e+00

Pos logit contributions
 scar+0.067
 flavors+0.066
 fries+0.063
mm+0.063
 chips+0.062
Neg logit contributions
ges-0.043
 alone-0.041
 War-0.040
apping-0.039
umpy-0.038
Token girl
Token ID2576
Feature activation-3.73e+00

Pos logit contributions
 alone+0.197
ges+0.192
 War+0.188
apping+0.186
umpy+0.182
Neg logit contributions
 flavors-0.229
 scar-0.223
mm-0.214
 fries-0.210
 cones-0.207
Token named
Token ID3706
Feature activation-1.84e+00

Pos logit contributions
 War+0.128
umpy+0.125
 alone+0.123
ges+0.122
apping+0.121
Neg logit contributions
 flavors-0.118
 scar-0.117
mm-0.109
 french-0.106
 spend-0.106
Token Lily
Token ID20037
Feature activation-1.97e+00

Pos logit contributions
 War+0.058
apping+0.057
umpy+0.056
 alone+0.056
ges+0.054
Neg logit contributions
mm-0.063
 flavors-0.063
 scar-0.062
 spend-0.059
 chips-0.058
Token.
Token ID13
Feature activation-3.28e-01

Pos logit contributions
 War+0.071
apping+0.070
 alone+0.069
umpy+0.069
ges+0.069
Neg logit contributions
 flavors-0.061
 scar-0.056
 spend-0.052
mm-0.052
 cones-0.051
Token She
Token ID1375
Feature activation+3.12e+00

Pos logit contributions
 alone+0.011
unk+0.011
 War+0.011
apping+0.011
ges+0.011
Neg logit contributions
 flavors-0.010
mm-0.010
 scar-0.010
 extract-0.009
 spend-0.009