TOP ACTIVATIONS
MAX = 9.480

and enjoyed the beautiful sights they found . At the end
win in all the games they played .
about all the fun things they did on their holiday .
was the funn iest thing they had ever done . <|endoftext|>
prevent accidents and save things we love . Lily nodded .
share the news with everyone they knew . But then one
muff ins were the best they ever had ! <|endoftext|> Once
the y umm iest dinner they ever had . <|endoftext|> Once
was the best mel on they â d ever
of her song with everyone they met : ' Be patient
house was the best decision they had ever made . <|endoftext|>
all the other fun things they could do in the forest
collecting all kinds of things they could use to build the
good friends and share everything they found in the park from
of all the wonderful things they had done . It was
about all the delicious things they would make with the sa
make new memories the best we can ," she said .
to remember all the things we saw . I think you
it would take them anywhere they wanted to go .
the y umm iest thing they ever tasted . <|endoftext|> Once

MIN ACTIVATIONS
MIN = -11.401

The rh in oc eros nodded its head .
Billy . The rh in oc eros looked at Billy with
. The av oc ados were shared with everybody
The rh in oc eros walked to the fence
The rh in oc eros looked up , hopeful
happy ." The rh in oc eros did what the bird
here ?" The rh in oc eros replied , " I
wings !" The rh in oc eros laughed and said ,
The rh in oc eros couldn 't believe his
The rh in oc eros was so excited that
help . The rh in oc eros was very happy and
The rh in oc eros was so excited to
away . The rh in oc eros felt che ery and
Finally , the rh in oc eros remembered that animals often
The rh in oc eros was so delighted with
The rh in oc eros smiled and flew away
he ." The rh in oc eros felt a little better
The bad rh in oc eros was so happy .
now !" The rh in oc eros felt happy too .
doing ?" The rh in oc eros replied , " I

INTERVAL 4.260 - 9.480
CONTAINS 1.898%

ll have to go back and get a new
going on a trip ! We need to order tickets for
log in the park had brought them even closer together .
that he was not having fun . <|endoftext|> L ily and
it was dirty and said they should not take it .

INTERVAL -0.960 - 4.260
CONTAINS 68.895%

to explore so he went outside . There , the moon
boss of the forest ." The bird thought for
to play in the forest with his friends . One evening
, " That sounds like a plan . You are a
and his mom my bought it for him .

INTERVAL -6.180 - -0.960
CONTAINS 29.132%

rainbow in the sky . She wondered how it got there
you for an ice - cream !" Jack smiled
and learn something new . They shared stories , laughed
a special surprise . It was a bright pink butterfly .
en vious of him . He did not like to shoot

INTERVAL -11.401 - -6.180
CONTAINS 0.075%

hang it . But the wall had no hook to hang
and sm oky . The fire alarm went off . Sara
job . The fire man was so happy that
Every day , the knob made sure that the box
Every morning , the bell would ring and the village
Token and
Token ID290
Feature activation+8.57e-01

Pos logit contributions
 parties+0.003
text+0.002
P+0.002
 recess+0.002
 tea+0.002
Neg logit contributions
 ringing-0.004
 phrase-0.004
 guardian-0.003
cut-0.003
 claw-0.003
Token enjoyed
Token ID8359
Feature activation+1.68e+00

Pos logit contributions
 parties+0.016
text+0.012
 recess+0.011
 outside+0.011
 tea+0.010
Neg logit contributions
 phrase-0.038
cut-0.036
 ringing-0.035
 guardian-0.034
 claw-0.034
Token the
Token ID262
Feature activation+2.45e+00

Pos logit contributions
 parties+0.032
text+0.029
 recess+0.024
ashed+0.024
 outside+0.023
Neg logit contributions
 phrase-0.071
 ringing-0.068
cut-0.066
 guardian-0.063
 drill-0.063
Token beautiful
Token ID4950
Feature activation+2.14e-01

Pos logit contributions
 parties+0.065
text+0.054
 recess+0.053
P+0.049
 outside+0.049
Neg logit contributions
 phrase-0.088
 ringing-0.083
cut-0.083
 drill-0.078
 guardian-0.078
Token sights
Token ID21343
Feature activation+1.66e+00

Pos logit contributions
 parties+0.006
 outside+0.005
 recess+0.005
text+0.005
 tea+0.005
Neg logit contributions
 phrase-0.008
cut-0.007
fly-0.007
rum-0.007
 guardian-0.007
Token they
Token ID484
Feature activation+9.48e+00

Pos logit contributions
 parties+0.035
text+0.032
P+0.030
ackers+0.030
 recess+0.030
Neg logit contributions
 phrase-0.065
 ringing-0.064
cut-0.061
fly-0.058
 word-0.058
Token found
Token ID1043
Feature activation+2.85e+00

Pos logit contributions
 parties+0.264
text+0.241
 tea+0.226
aws+0.212
ashes+0.209
Neg logit contributions
 phrase-0.308
 ringing-0.301
cut-0.285
fly-0.281
 guardian-0.278
Token.
Token ID13
Feature activation+2.52e-01

Pos logit contributions
 parties+0.057
text+0.054
 recess+0.047
 tea+0.044
ashed+0.044
Neg logit contributions
 phrase-0.115
cut-0.110
 ringing-0.109
 guardian-0.105
 word-0.103
Token At
Token ID1629
Feature activation-3.74e+00

Pos logit contributions
 parties+0.007
 recess+0.006
P+0.006
text+0.005
 outside+0.005
Neg logit contributions
 phrase-0.008
ric-0.008
ller-0.007
cut-0.007
 ringing-0.007
Token the
Token ID262
Feature activation-3.47e-01

Pos logit contributions
 phrase+0.144
 ringing+0.142
cut+0.138
 guardian+0.134
 drill+0.132
Neg logit contributions
 parties-0.082
text-0.071
ashed-0.063
aws-0.061
ashes-0.059
Token end
Token ID886
Feature activation-1.75e+00

Pos logit contributions
 phrase+0.012
cut+0.011
 ringing+0.011
 guardian+0.010
 claw+0.010
Neg logit contributions
 parties-0.010
text-0.008
 recess-0.008
 outside-0.008
 together-0.008
Token win
Token ID1592
Feature activation+1.42e+00

Pos logit contributions
 phrase+0.074
cut+0.070
 ringing+0.066
rum+0.066
fly+0.065
Neg logit contributions
 parties-0.057
 recess-0.046
 outside-0.045
text-0.045
 together-0.043
Token in
Token ID287
Feature activation+1.06e+00

Pos logit contributions
 parties+0.027
text+0.022
 recess+0.020
 tea+0.019
 outside+0.019
Neg logit contributions
 phrase-0.062
cut-0.057
 ringing-0.057
rum-0.056
 drill-0.055
Token all
Token ID477
Feature activation+2.93e+00

Pos logit contributions
 parties+0.022
text+0.018
ashed+0.017
 outside+0.016
 tea+0.016
Neg logit contributions
 phrase-0.044
 ringing-0.041
cut-0.040
rum-0.039
 claw-0.039
Token the
Token ID262
Feature activation+3.59e+00

Pos logit contributions
 parties+0.064
text+0.052
 recess+0.047
 tea+0.047
 outside+0.045
Neg logit contributions
 phrase-0.121
cut-0.115
 ringing-0.111
rum-0.110
 guardian-0.108
Token games
Token ID1830
Feature activation+4.42e+00

Pos logit contributions
 parties+0.087
text+0.080
ashed+0.072
 recess+0.070
 tea+0.067
Neg logit contributions
 phrase-0.137
 ringing-0.122
cut-0.122
 word-0.121
 drill-0.118
Token they
Token ID484
Feature activation+9.05e+00

Pos logit contributions
 parties+0.086
text+0.083
|+0.067
 recess+0.065
ashed+0.065
Neg logit contributions
 phrase-0.181
cut-0.175
 ringing-0.167
 word-0.165
 drill-0.165
Token played
Token ID2826
Feature activation+7.40e+00

Pos logit contributions
 parties+0.234
text+0.212
 tea+0.191
 recess+0.186
 outside+0.182
Neg logit contributions
 phrase-0.320
cut-0.300
 ringing-0.297
 claw-0.282
 drill-0.281
Token.
Token ID13
Feature activation+5.75e-01

Pos logit contributions
text+0.182
ashed+0.158
 parties+0.153
ackers+0.150
ens+0.141
Neg logit contributions
 phrase-0.266
pe-0.247
 word-0.244
cut-0.236
 ringing-0.229
Token
Token ID220
Feature activation-1.31e+00

Pos logit contributions
 parties+0.012
text+0.009
 recess+0.009
 outside+0.008
 tea+0.008
Neg logit contributions
 phrase-0.025
cut-0.023
 ringing-0.022
 drill-0.022
rum-0.022
Token
Token ID198
Feature activation+8.02e-01

Pos logit contributions
 phrase+0.049
cut+0.047
rum+0.046
 drill+0.045
ric+0.044
Neg logit contributions
 parties-0.033
 recess-0.027
 tea-0.026
 party-0.025
 picnic-0.024
Token
Token ID198
Feature activation+1.99e-01

Pos logit contributions
 parties+0.017
text+0.013
 recess+0.013
 tea+0.012
ashed+0.012
Neg logit contributions
 phrase-0.034
cut-0.032
 ringing-0.030
rum-0.030
 drill-0.030
Token about
Token ID546
Feature activation-2.16e+00

Pos logit contributions
 parties+0.100
 recess+0.083
text+0.083
P+0.076
aws+0.076
Neg logit contributions
 phrase-0.156
cut-0.143
 ringing-0.139
 guardian-0.137
rum-0.133
Token all
Token ID477
Feature activation+1.58e+00

Pos logit contributions
 phrase+0.100
cut+0.094
rum+0.090
 ringing+0.089
 claw+0.089
Neg logit contributions
 parties-0.040
 outside-0.028
 recess-0.027
text-0.026
 together-0.026
Token the
Token ID262
Feature activation+2.65e+00

Pos logit contributions
 parties+0.029
text+0.023
 recess+0.022
 outside+0.020
aws+0.019
Neg logit contributions
 phrase-0.070
cut-0.066
 ringing-0.065
 guardian-0.063
rum-0.063
Token fun
Token ID1257
Feature activation+4.01e+00

Pos logit contributions
 parties+0.055
text+0.041
 recess+0.040
 outside+0.040
 tea+0.037
Neg logit contributions
 phrase-0.114
cut-0.107
 ringing-0.103
rum-0.102
 guardian-0.102
Token things
Token ID1243
Feature activation+1.88e+00

Pos logit contributions
 parties+0.069
text+0.065
 recess+0.059
P+0.058
ashed+0.058
Neg logit contributions
 phrase-0.163
 ringing-0.157
 word-0.149
cut-0.146
 prayer-0.144
Token they
Token ID484
Feature activation+8.96e+00

Pos logit contributions
 parties+0.030
text+0.027
 recess+0.026
P+0.022
aws+0.022
Neg logit contributions
 phrase-0.082
 ringing-0.080
cut-0.079
fly-0.076
 guardian-0.075
Token did
Token ID750
Feature activation+1.14e+00

Pos logit contributions
 parties+0.240
text+0.217
 recess+0.198
 tea+0.191
aws+0.191
Neg logit contributions
 phrase-0.308
cut-0.289
 ringing-0.287
fly-0.279
 guardian-0.278
Token on
Token ID319
Feature activation+1.82e+00

Pos logit contributions
text+0.023
 parties+0.022
 recess+0.020
ashed+0.020
 Woo+0.019
Neg logit contributions
 phrase-0.045
 ringing-0.042
cut-0.041
rum-0.040
pe-0.040
Token their
Token ID511
Feature activation+2.69e+00

Pos logit contributions
 parties+0.032
text+0.028
 recess+0.025
ashed+0.025
 outside+0.022
Neg logit contributions
 phrase-0.079
cut-0.074
 ringing-0.074
rum-0.072
 drill-0.071
Token holiday
Token ID9912
Feature activation+4.62e+00

Pos logit contributions
 parties+0.062
text+0.054
 recess+0.050
he+0.049
 Woo+0.048
Neg logit contributions
 phrase-0.103
 ringing-0.095
cut-0.094
 guardian-0.091
 claw-0.091
Token.
Token ID13
Feature activation+6.61e-01

Pos logit contributions
text+0.096
ashed+0.083
 Woo+0.078
ashes+0.076
 parties+0.075
Neg logit contributions
 phrase-0.174
 ringing-0.172
 word-0.160
ller-0.158
 guardian-0.158
Token was
Token ID373
Feature activation+7.38e-01

Pos logit contributions
 phrase+0.066
cut+0.063
 ringing+0.061
 claw+0.060
fly+0.060
Neg logit contributions
 parties-0.036
text-0.030
 recess-0.029
 outside-0.026
 tea-0.025
Token the
Token ID262
Feature activation+2.34e+00

Pos logit contributions
 parties+0.017
text+0.013
 recess+0.013
 outside+0.013
 tea+0.012
Neg logit contributions
 phrase-0.030
cut-0.028
 ringing-0.027
 guardian-0.027
rum-0.027
Token funn
Token ID36090
Feature activation+1.11e+00

Pos logit contributions
 parties+0.051
text+0.040
 recess+0.039
 outside+0.036
 together+0.035
Neg logit contributions
 phrase-0.098
cut-0.092
 ringing-0.089
 guardian-0.089
 claw-0.088
Tokeniest
Token ID6386
Feature activation+2.42e+00

Pos logit contributions
 parties+0.039
 recess+0.033
text+0.032
 outside+0.031
|+0.031
Neg logit contributions
 phrase-0.032
cut-0.028
rum-0.028
 claw-0.027
 ringing-0.027
Token thing
Token ID1517
Feature activation-1.01e+00

Pos logit contributions
 parties+0.065
text+0.053
 recess+0.052
 outside+0.051
 tea+0.049
Neg logit contributions
 phrase-0.089
cut-0.082
 ringing-0.080
rum-0.078
 guardian-0.078
Token they
Token ID484
Feature activation+8.91e+00

Pos logit contributions
 phrase+0.046
cut+0.043
 ringing+0.042
fly+0.042
 guardian+0.041
Neg logit contributions
 parties-0.019
 recess-0.014
text-0.014
aws-0.011
 tea-0.011
Token had
Token ID550
Feature activation+5.47e+00

Pos logit contributions
 parties+0.273
text+0.247
aws+0.226
 recess+0.225
 tea+0.223
Neg logit contributions
 phrase-0.268
cut-0.249
 ringing-0.242
fly-0.239
 claw-0.238
Token ever
Token ID1683
Feature activation+3.47e+00

Pos logit contributions
 parties+0.131
text+0.127
 tea+0.114
 recess+0.113
 Holly+0.113
Neg logit contributions
 phrase-0.188
 ringing-0.168
rum-0.164
pe-0.164
 meaning-0.162
Token done
Token ID1760
Feature activation+3.23e+00

Pos logit contributions
 parties+0.090
text+0.075
 recess+0.071
 outside+0.070
 tea+0.066
Neg logit contributions
 phrase-0.127
 ringing-0.117
 claw-0.115
cut-0.114
fly-0.112
Token.
Token ID13
Feature activation+7.69e-01

Pos logit contributions
 parties+0.063
text+0.056
 recess+0.051
ashed+0.045
 picnic+0.044
Neg logit contributions
 phrase-0.138
 ringing-0.126
rum-0.125
cut-0.124
 word-0.120
Token<|endoftext|>
Token ID50256
Feature activation+1.64e+00

Pos logit contributions
 parties+0.018
 recess+0.014
 outside+0.013
text+0.013
 tea+0.012
Neg logit contributions
 phrase-0.031
cut-0.029
rum-0.028
 ringing-0.028
ller-0.027
Token prevent
Token ID2948
Feature activation-2.81e+00

Pos logit contributions
 phrase+0.065
cut+0.061
 ringing+0.059
 guardian+0.057
 drill+0.057
Neg logit contributions
 parties-0.047
text-0.039
 recess-0.038
 outside-0.037
 tea-0.035
Token accidents
Token ID17390
Feature activation+7.08e-01

Pos logit contributions
 phrase+0.119
cut+0.111
 ringing+0.110
rum+0.107
 claw+0.106
Neg logit contributions
 parties-0.059
text-0.045
 recess-0.044
 outside-0.042
 tea-0.040
Token and
Token ID290
Feature activation+3.67e+00

Pos logit contributions
 parties+0.016
 recess+0.013
text+0.013
 outside+0.011
aws+0.011
Neg logit contributions
 phrase-0.029
cut-0.027
 ringing-0.027
rum-0.026
 drill-0.026
Token save
Token ID3613
Feature activation+7.36e-01

Pos logit contributions
 parties+0.090
 outside+0.074
 together+0.073
 they+0.071
 each+0.068
Neg logit contributions
 phrase-0.149
rum-0.133
cut-0.132
 guardian-0.131
omm-0.130
Token things
Token ID1243
Feature activation+3.38e+00

Pos logit contributions
 parties+0.016
text+0.013
 recess+0.012
ashed+0.011
 picnic+0.011
Neg logit contributions
 phrase-0.030
 ringing-0.028
cut-0.028
rum-0.027
 drill-0.027
Token we
Token ID356
Feature activation+8.91e+00

Pos logit contributions
 parties+0.065
text+0.053
 recess+0.051
ashed+0.044
 tea+0.043
Neg logit contributions
 phrase-0.144
cut-0.141
 ringing-0.137
rum-0.132
 drill-0.132
Token love
Token ID1842
Feature activation+2.69e+00

Pos logit contributions
 parties+0.204
 outside+0.155
 recess+0.152
text+0.150
 together+0.144
Neg logit contributions
 phrase-0.369
cut-0.342
rum-0.329
 ringing-0.329
 guardian-0.327
Token.
Token ID13
Feature activation+1.40e+00

Pos logit contributions
 parties+0.051
text+0.043
 recess+0.043
ashed+0.038
|+0.035
Neg logit contributions
 phrase-0.111
cut-0.106
 word-0.103
rum-0.103
 ringing-0.103
Token Lily
Token ID20037
Feature activation+1.87e+00

Pos logit contributions
 parties+0.019
text+0.013
 outside+0.012
 recess+0.012
 together+0.011
Neg logit contributions
 phrase-0.071
cut-0.067
 ringing-0.065
 claw-0.064
 guardian-0.063
Token nodded
Token ID14464
Feature activation-2.35e+00

Pos logit contributions
 parties+0.038
text+0.032
 outside+0.029
 together+0.029
 played+0.028
Neg logit contributions
 phrase-0.078
cut-0.073
 ringing-0.073
rum-0.072
 drill-0.071
Token.
Token ID13
Feature activation+5.31e-01

Pos logit contributions
 phrase+0.096
 ringing+0.087
cut+0.087
 guardian+0.084
 word+0.084
Neg logit contributions
 parties-0.055
text-0.044
 recess-0.043
 picnic-0.038
aws-0.038
Token share
Token ID2648
Feature activation+2.55e+00

Pos logit contributions
 phrase+0.093
cut+0.087
 ringing+0.086
 guardian+0.084
 claw+0.083
Neg logit contributions
 parties-0.042
text-0.033
 recess-0.031
 outside-0.030
 tea-0.028
Token the
Token ID262
Feature activation+3.04e+00

Pos logit contributions
 parties+0.041
text+0.034
 recess+0.030
ashed+0.027
 outside+0.027
Neg logit contributions
 phrase-0.117
cut-0.109
 ringing-0.108
 word-0.105
rum-0.105
Token news
Token ID1705
Feature activation+1.04e+00

Pos logit contributions
 parties+0.072
text+0.066
 recess+0.062
ashed+0.058
aws+0.057
Neg logit contributions
 phrase-0.117
 ringing-0.107
cut-0.104
 guardian-0.102
 word-0.102
Token with
Token ID351
Feature activation+2.05e+00

Pos logit contributions
 parties+0.018
text+0.017
 recess+0.015
ashed+0.013
aws+0.013
Neg logit contributions
 phrase-0.047
cut-0.043
 ringing-0.043
 word-0.042
 guardian-0.041
Token everyone
Token ID2506
Feature activation+4.36e+00

Pos logit contributions
 parties+0.036
text+0.026
 recess+0.025
 outside+0.024
 tea+0.022
Neg logit contributions
 phrase-0.094
cut-0.088
 ringing-0.087
rum-0.085
 guardian-0.085
Token they
Token ID484
Feature activation+8.90e+00

Pos logit contributions
 parties+0.084
 recess+0.069
text+0.067
aws+0.055
ashed+0.054
Neg logit contributions
 phrase-0.188
cut-0.179
 ringing-0.176
rum-0.174
 guardian-0.173
Token knew
Token ID2993
Feature activation-2.71e-01

Pos logit contributions
 parties+0.225
text+0.212
 recess+0.184
 tea+0.183
aws+0.181
Neg logit contributions
 phrase-0.304
 ringing-0.294
cut-0.286
 guardian-0.281
fly-0.274
Token.
Token ID13
Feature activation+4.57e-01

Pos logit contributions
 phrase+0.013
cut+0.012
 ringing+0.012
rum+0.012
 guardian+0.012
Neg logit contributions
 parties-0.004
text-0.003
 recess-0.003
 outside-0.003
 together-0.002
Token But
Token ID887
Feature activation-3.44e+00

Pos logit contributions
 parties+0.009
text+0.007
 recess+0.007
 outside+0.006
 tea+0.006
Neg logit contributions
 phrase-0.020
cut-0.019
 ringing-0.018
 guardian-0.018
 claw-0.018
Token then
Token ID788
Feature activation+1.10e+00

Pos logit contributions
 phrase+0.172
rum+0.159
cut+0.157
 drill+0.155
 claw+0.151
Neg logit contributions
 parties-0.052
 together-0.041
 outside-0.037
 they-0.037
 recess-0.033
Token one
Token ID530
Feature activation+5.21e-01

Pos logit contributions
 parties+0.022
text+0.017
 recess+0.016
 outside+0.015
ashes+0.014
Neg logit contributions
 phrase-0.047
cut-0.044
 ringing-0.044
 guardian-0.043
 claw-0.043
Token muff
Token ID27563
Feature activation-1.96e+00

Pos logit contributions
 parties+0.025
 recess+0.020
text+0.020
 outside+0.020
 tea+0.019
Neg logit contributions
 phrase-0.033
cut-0.031
rum-0.030
 ringing-0.029
fly-0.029
Tokenins
Token ID1040
Feature activation-1.12e+00

Pos logit contributions
 phrase+0.074
cut+0.068
 ringing+0.066
rum+0.065
 guardian+0.065
Neg logit contributions
 parties-0.051
text-0.041
 recess-0.041
 outside-0.040
 tea-0.038
Token were
Token ID547
Feature activation+2.23e+00

Pos logit contributions
 phrase+0.048
cut+0.045
 ringing+0.044
rum+0.043
 guardian+0.043
Neg logit contributions
 parties-0.023
text-0.017
 recess-0.017
 outside-0.016
 tea-0.015
Token the
Token ID262
Feature activation+3.27e+00

Pos logit contributions
 parties+0.048
text+0.036
 recess+0.036
 outside+0.035
 tea+0.032
Neg logit contributions
 phrase-0.095
cut-0.089
 ringing-0.086
 guardian-0.085
rum-0.085
Token best
Token ID1266
Feature activation+5.10e+00

Pos logit contributions
 parties+0.070
text+0.059
 recess+0.055
 outside+0.051
ashed+0.049
Neg logit contributions
 phrase-0.137
cut-0.127
 ringing-0.125
 guardian-0.124
 claw-0.121
Token they
Token ID484
Feature activation+8.89e+00

Pos logit contributions
 parties+0.116
text+0.100
 recess+0.096
aws+0.087
P+0.086
Neg logit contributions
 phrase-0.202
 ringing-0.184
 guardian-0.180
 word-0.179
cut-0.179
Token ever
Token ID1683
Feature activation+3.25e+00

Pos logit contributions
 parties+0.264
text+0.251
aws+0.233
 recess+0.232
 tea+0.222
Neg logit contributions
 phrase-0.263
cut-0.246
 ringing-0.240
 guardian-0.232
fly-0.227
Token had
Token ID550
Feature activation+5.49e+00

Pos logit contributions
 parties+0.088
text+0.077
 recess+0.069
aws+0.068
 outside+0.067
Neg logit contributions
 phrase-0.110
 ringing-0.106
fly-0.100
 claw-0.098
rum-0.098
Token!
Token ID0
Feature activation+1.07e+00

Pos logit contributions
text+0.132
 parties+0.130
 Woo+0.118
ens+0.117
ackers+0.113
Neg logit contributions
 phrase-0.184
 ringing-0.168
 meaning-0.159
 word-0.157
 guardian-0.155
Token<|endoftext|>
Token ID50256
Feature activation+1.72e+00

Pos logit contributions
 parties+0.031
 recess+0.026
 outside+0.025
 tea+0.024
 camping+0.024
Neg logit contributions
 phrase-0.035
 word-0.030
omm-0.030
 guardian-0.029
rum-0.029
TokenOnce
Token ID7454
Feature activation+1.29e+00

Pos logit contributions
 parties+0.056
ashing+0.046
ashes+0.046
ashed+0.046
 recess+0.044
Neg logit contributions
 phrase-0.048
 word-0.043
fly-0.042
pe-0.041
 beast-0.041
Token the
Token ID262
Feature activation+1.89e+00

Pos logit contributions
 parties+0.012
text+0.009
 recess+0.009
 outside+0.009
 tea+0.008
Neg logit contributions
 phrase-0.026
cut-0.025
 ringing-0.024
 guardian-0.023
rum-0.023
Token y
Token ID331
Feature activation-1.08e+00

Pos logit contributions
 parties+0.040
text+0.031
 recess+0.030
 outside+0.029
 together+0.027
Neg logit contributions
 phrase-0.081
cut-0.075
 ringing-0.073
 guardian-0.072
 claw-0.071
Tokenumm
Token ID13929
Feature activation-1.97e+00

Pos logit contributions
 phrase+0.031
cut+0.028
 ringing+0.027
rum+0.026
 claw+0.026
Neg logit contributions
 parties-0.037
text-0.032
 recess-0.032
 outside-0.031
 together-0.030
Tokeniest
Token ID6386
Feature activation+2.33e+00

Pos logit contributions
 phrase+0.065
cut+0.059
 ringing+0.057
 guardian+0.056
 claw+0.056
Neg logit contributions
 parties-0.060
 outside-0.050
text-0.050
 recess-0.049
 tea-0.048
Token dinner
Token ID8073
Feature activation+2.54e+00

Pos logit contributions
 parties+0.061
 outside+0.048
 recess+0.047
text+0.047
 tea+0.047
Neg logit contributions
 phrase-0.088
cut-0.082
 ringing-0.079
rum-0.079
fly-0.078
Token they
Token ID484
Feature activation+8.88e+00

Pos logit contributions
 parties+0.046
text+0.039
 recess+0.035
aws+0.032
ashed+0.031
Neg logit contributions
 phrase-0.112
cut-0.105
 ringing-0.102
 word-0.100
 guardian-0.099
Token ever
Token ID1683
Feature activation+3.15e+00

Pos logit contributions
 parties+0.250
text+0.225
 recess+0.210
aws+0.208
ashes+0.200
Neg logit contributions
 phrase-0.291
cut-0.274
 ringing-0.265
fly-0.261
 guardian-0.257
Token had
Token ID550
Feature activation+5.64e+00

Pos logit contributions
 parties+0.085
text+0.072
 recess+0.067
 outside+0.066
aws+0.064
Neg logit contributions
 phrase-0.111
 ringing-0.104
fly-0.100
cut-0.100
 claw-0.098
Token.
Token ID13
Feature activation+1.03e+00

Pos logit contributions
text+0.120
 parties+0.120
 Woo+0.109
 recess+0.104
 Holly+0.102
Neg logit contributions
 phrase-0.203
 ringing-0.186
rum-0.180
 meaning-0.177
cut-0.177
Token<|endoftext|>
Token ID50256
Feature activation+1.68e+00

Pos logit contributions
 parties+0.020
text+0.015
 recess+0.015
 outside+0.014
 tea+0.013
Neg logit contributions
 phrase-0.046
cut-0.043
 ringing-0.041
rum-0.041
 guardian-0.041
TokenOnce
Token ID7454
Feature activation+1.26e+00

Pos logit contributions
 parties+0.052
ashing+0.041
ashed+0.041
ashes+0.041
 recess+0.041
Neg logit contributions
 phrase-0.052
 word-0.046
fly-0.046
cut-0.045
pe-0.045
Token was
Token ID373
Feature activation+1.22e+00

Pos logit contributions
 phrase+0.004
cut+0.004
 ringing+0.004
 guardian+0.004
rum+0.004
Neg logit contributions
 parties-0.003
text-0.003
 recess-0.003
 outside-0.002
 tea-0.002
Token the
Token ID262
Feature activation+2.20e+00

Pos logit contributions
 parties+0.026
text+0.020
 recess+0.020
 outside+0.018
ashed+0.017
Neg logit contributions
 phrase-0.052
cut-0.048
 ringing-0.048
 guardian-0.046
rum-0.046
Token best
Token ID1266
Feature activation+4.70e+00

Pos logit contributions
 parties+0.038
 outside+0.026
 recess+0.024
 tea+0.024
 party+0.024
Neg logit contributions
 phrase-0.102
rum-0.098
cut-0.097
fly-0.093
ller-0.093
Token mel
Token ID7758
Feature activation-1.34e+00

Pos logit contributions
 parties+0.142
 outside+0.121
 tea+0.115
 recess+0.114
 party+0.113
Neg logit contributions
 phrase-0.157
cut-0.149
fly-0.142
rum-0.140
 swollen-0.136
Tokenon
Token ID261
Feature activation+1.85e-01

Pos logit contributions
 phrase+0.041
cut+0.037
 ringing+0.035
rum+0.035
 guardian+0.035
Neg logit contributions
 parties-0.044
text-0.038
 outside-0.037
 recess-0.037
 together-0.036
Token they
Token ID484
Feature activation+8.85e+00

Pos logit contributions
 parties+0.005
 recess+0.004
text+0.004
aws+0.004
 outside+0.004
Neg logit contributions
 phrase-0.007
cut-0.006
fly-0.006
 ringing-0.006
 guardian-0.006
Tokenâ
Token ID22940
Feature activation+2.49e+00

Pos logit contributions
 parties+0.267
text+0.244
 recess+0.232
aws+0.223
 tea+0.222
Neg logit contributions
 phrase-0.266
cut-0.250
 ringing-0.237
fly-0.235
 guardian-0.231
Token
Token ID26391
Feature activation+3.61e+00

Pos logit contributions
 parties+0.078
text+0.064
 tea+0.063
 recess+0.062
 outside+0.062
Neg logit contributions
 phrase-0.082
cut-0.072
 ringing-0.070
rum-0.069
 guardian-0.068
Token
Token ID8151
Feature activation+4.07e+00

Pos logit contributions
 parties+0.108
 recess+0.091
 tea+0.090
 Holly+0.090
 outside+0.089
Neg logit contributions
ller-0.106
 phrase-0.106
rum-0.099
 ringing-0.099
 drill-0.096
Tokend
Token ID67
Feature activation+1.04e+00

Pos logit contributions
 parties+0.133
 recess+0.111
 tea+0.105
 picnic+0.103
 outside+0.100
Neg logit contributions
rum-0.118
 phrase-0.116
cut-0.114
pe-0.106
 ringing-0.104
Token ever
Token ID1683
Feature activation+3.04e+00

Pos logit contributions
 parties+0.026
text+0.021
 recess+0.021
 outside+0.020
 tea+0.019
Neg logit contributions
 phrase-0.040
cut-0.037
 ringing-0.036
rum-0.035
 guardian-0.035
Token of
Token ID286
Feature activation-4.94e-01

Pos logit contributions
 phrase+0.013
cut+0.012
 ringing+0.012
 guardian+0.012
fly+0.012
Neg logit contributions
 parties-0.006
text-0.005
 recess-0.005
aws-0.004
 outside-0.004
Token her
Token ID607
Feature activation+1.82e+00

Pos logit contributions
 phrase+0.024
cut+0.023
rum+0.023
fly+0.022
ller+0.022
Neg logit contributions
 parties-0.008
 recess-0.005
 outside-0.005
 tea-0.005
 together-0.004
Token song
Token ID3496
Feature activation+4.91e-01

Pos logit contributions
 parties+0.053
text+0.044
 recess+0.044
 outside+0.042
 tea+0.041
Neg logit contributions
 phrase-0.063
cut-0.056
 ringing-0.056
 guardian-0.054
 claw-0.054
Token with
Token ID351
Feature activation+1.30e+00

Pos logit contributions
 parties+0.009
text+0.007
 recess+0.006
 outside+0.005
 tea+0.005
Neg logit contributions
 phrase-0.022
cut-0.021
 ringing-0.021
 guardian-0.020
fly-0.020
Token everyone
Token ID2506
Feature activation+4.18e+00

Pos logit contributions
 parties+0.024
text+0.017
 recess+0.017
 outside+0.016
 tea+0.015
Neg logit contributions
 phrase-0.059
cut-0.056
 ringing-0.054
rum-0.054
 drill-0.053
Token they
Token ID484
Feature activation+8.83e+00

Pos logit contributions
 parties+0.080
text+0.063
 recess+0.063
aws+0.054
ashed+0.052
Neg logit contributions
 phrase-0.184
 ringing-0.172
cut-0.170
 guardian-0.167
fly-0.166
Token met
Token ID1138
Feature activation+3.48e+00

Pos logit contributions
 parties+0.222
text+0.184
 recess+0.173
 tea+0.168
 outside+0.166
Neg logit contributions
 phrase-0.332
 ringing-0.308
cut-0.307
 guardian-0.296
fly-0.294
Token:
Token ID25
Feature activation-1.23e-02

Pos logit contributions
 parties+0.063
text+0.053
 recess+0.049
ashed+0.043
ashes+0.042
Neg logit contributions
 phrase-0.155
 ringing-0.143
cut-0.142
 guardian-0.139
fly-0.137
Token '
Token ID705
Feature activation+9.92e-01

Pos logit contributions
 phrase+0.000
cut+0.000
rum+0.000
 claw+0.000
 drill+0.000
Neg logit contributions
 parties-0.000
 outside-0.000
 together-0.000
text-0.000
 recess-0.000
TokenBe
Token ID3856
Feature activation-4.16e+00

Pos logit contributions
 parties+0.026
text+0.021
 recess+0.020
 outside+0.020
 tea+0.019
Neg logit contributions
 phrase-0.037
cut-0.034
 ringing-0.033
 guardian-0.033
fly-0.032
Token patient
Token ID5827
Feature activation-1.81e+00

Pos logit contributions
 phrase+0.137
cut+0.126
 ringing+0.121
rum+0.118
 guardian+0.118
Neg logit contributions
 parties-0.127
text-0.106
 recess-0.105
 outside-0.104
 tea-0.099
Token house
Token ID2156
Feature activation-2.87e+00

Pos logit contributions
 parties+0.029
text+0.023
 recess+0.023
 outside+0.022
 tea+0.020
Neg logit contributions
 phrase-0.045
cut-0.042
 ringing-0.041
 guardian-0.040
 drill-0.039
Token was
Token ID373
Feature activation+4.53e-01

Pos logit contributions
 phrase+0.107
cut+0.099
 ringing+0.095
rum+0.095
 guardian+0.094
Neg logit contributions
 parties-0.076
text-0.060
 outside-0.060
 recess-0.059
 together-0.056
Token the
Token ID262
Feature activation+1.74e+00

Pos logit contributions
 parties+0.010
 outside+0.008
 together+0.007
 tea+0.007
 recess+0.007
Neg logit contributions
 phrase-0.019
cut-0.018
rum-0.018
 claw-0.017
fly-0.017
Token best
Token ID1266
Feature activation+4.25e+00

Pos logit contributions
 parties+0.037
text+0.028
 recess+0.028
 outside+0.027
 tea+0.025
Neg logit contributions
 phrase-0.074
cut-0.069
 ringing-0.067
 guardian-0.066
rum-0.066
Token decision
Token ID2551
Feature activation-3.88e-01

Pos logit contributions
 parties+0.120
text+0.103
 recess+0.102
aws+0.096
P+0.095
Neg logit contributions
 phrase-0.147
 ringing-0.132
cut-0.130
 guardian-0.128
 word-0.127
Token they
Token ID484
Feature activation+8.83e+00

Pos logit contributions
 phrase+0.015
cut+0.014
 ringing+0.014
 guardian+0.014
fly+0.014
Neg logit contributions
 parties-0.009
 recess-0.007
text-0.007
aws-0.006
 tea-0.006
Token had
Token ID550
Feature activation+5.25e+00

Pos logit contributions
 parties+0.263
text+0.234
 recess+0.230
aws+0.220
 tea+0.219
Neg logit contributions
 phrase-0.273
cut-0.258
 ringing-0.246
 guardian-0.244
fly-0.238
Token ever
Token ID1683
Feature activation+3.59e+00

Pos logit contributions
 parties+0.129
text+0.117
 recess+0.110
 Woo+0.106
 tea+0.106
Neg logit contributions
 phrase-0.183
 ringing-0.166
 meaning-0.161
 guardian-0.159
 word-0.159
Token made
Token ID925
Feature activation+2.97e+00

Pos logit contributions
 parties+0.091
text+0.075
 recess+0.073
 outside+0.071
 tea+0.067
Neg logit contributions
 phrase-0.134
 ringing-0.124
cut-0.122
 claw-0.119
 guardian-0.118
Token.
Token ID13
Feature activation+4.14e-01

Pos logit contributions
 parties+0.060
text+0.052
 recess+0.051
ashed+0.048
ashes+0.042
Neg logit contributions
 phrase-0.124
 ringing-0.117
cut-0.112
 word-0.111
rum-0.110
Token<|endoftext|>
Token ID50256
Feature activation+1.20e+00

Pos logit contributions
 parties+0.009
 recess+0.006
 outside+0.006
text+0.006
 tea+0.006
Neg logit contributions
 phrase-0.018
cut-0.016
 ringing-0.016
rum-0.016
 guardian-0.016
Token all
Token ID477
Feature activation+1.56e+00

Pos logit contributions
 phrase+0.106
cut+0.099
 ringing+0.096
 guardian+0.095
rum+0.095
Neg logit contributions
 parties-0.055
 outside-0.041
 recess-0.040
text-0.040
 together-0.039
Token the
Token ID262
Feature activation+2.21e+00

Pos logit contributions
 parties+0.027
 outside+0.018
text+0.018
 recess+0.018
 tea+0.017
Neg logit contributions
 phrase-0.073
cut-0.069
 ringing-0.067
 guardian-0.066
 claw-0.066
Token other
Token ID584
Feature activation-6.60e-01

Pos logit contributions
 parties+0.051
 outside+0.039
 recess+0.038
text+0.037
 tea+0.037
Neg logit contributions
 phrase-0.090
cut-0.085
rum-0.082
 ringing-0.082
 guardian-0.081
Token fun
Token ID1257
Feature activation+4.10e+00

Pos logit contributions
 phrase+0.018
 word+0.016
 ringing+0.016
cut+0.015
fly+0.015
Neg logit contributions
 parties-0.022
text-0.021
 recess-0.020
 Woo-0.020
nes-0.020
Token things
Token ID1243
Feature activation+1.81e+00

Pos logit contributions
 parties+0.119
text+0.110
 recess+0.103
 Woo+0.101
he+0.099
Neg logit contributions
 phrase-0.129
 ringing-0.116
cut-0.116
 word-0.116
fly-0.114
Token they
Token ID484
Feature activation+8.83e+00

Pos logit contributions
 parties+0.033
text+0.032
 recess+0.028
P+0.025
ashed+0.023
Neg logit contributions
 phrase-0.075
cut-0.073
 ringing-0.072
fly-0.070
 word-0.069
Token could
Token ID714
Feature activation+4.61e-01

Pos logit contributions
 parties+0.230
text+0.226
 recess+0.188
 tea+0.186
aws+0.182
Neg logit contributions
 phrase-0.296
 ringing-0.280
cut-0.277
fly-0.274
 guardian-0.265
Token do
Token ID466
Feature activation+1.16e+00

Pos logit contributions
text+0.010
 parties+0.010
 recess+0.008
ashed+0.008
 Woo+0.008
Neg logit contributions
 phrase-0.018
 ringing-0.017
cut-0.017
fly-0.016
 claw-0.016
Token in
Token ID287
Feature activation-1.18e-02

Pos logit contributions
text+0.029
 parties+0.024
 Woo+0.024
ens+0.024
 recess+0.023
Neg logit contributions
 phrase-0.042
pe-0.039
cut-0.038
 word-0.037
 ringing-0.037
Token the
Token ID262
Feature activation+3.06e+00

Pos logit contributions
 phrase+0.001
cut+0.001
 ringing+0.001
rum+0.001
 guardian+0.001
Neg logit contributions
 parties-0.000
 recess-0.000
 outside-0.000
text-0.000
 tea-0.000
Token forest
Token ID8222
Feature activation+2.48e-01

Pos logit contributions
 parties+0.088
text+0.077
 recess+0.071
 outside+0.068
ashed+0.067
Neg logit contributions
 phrase-0.106
cut-0.096
 ringing-0.094
 guardian-0.093
 word-0.091
Token collecting
Token ID13157
Feature activation+2.20e+00

Pos logit contributions
 phrase+0.007
cut+0.007
 guardian+0.006
rum+0.006
fly+0.006
Neg logit contributions
 parties-0.005
 outside-0.004
 together-0.004
 recess-0.004
 tea-0.004
Token all
Token ID477
Feature activation+1.60e+00

Pos logit contributions
 parties+0.060
text+0.053
 recess+0.049
|+0.047
ashed+0.045
Neg logit contributions
 phrase-0.078
cut-0.071
 ringing-0.069
 claw-0.068
 drill-0.068
Token kinds
Token ID6982
Feature activation+3.09e-02

Pos logit contributions
 parties+0.048
text+0.040
 recess+0.039
 outside+0.038
 tea+0.037
Neg logit contributions
 phrase-0.054
cut-0.050
 ringing-0.048
rum-0.047
 guardian-0.047
Token of
Token ID286
Feature activation+1.21e+00

Pos logit contributions
 parties+0.001
text+0.001
 recess+0.001
 outside+0.000
 tea+0.000
Neg logit contributions
 phrase-0.001
cut-0.001
 ringing-0.001
rum-0.001
 claw-0.001
Token things
Token ID1243
Feature activation+2.58e+00

Pos logit contributions
 parties+0.032
 outside+0.026
 recess+0.025
 tea+0.024
 together+0.024
Neg logit contributions
 phrase-0.046
cut-0.043
fly-0.041
 guardian-0.041
 ringing-0.041
Token they
Token ID484
Feature activation+8.80e+00

Pos logit contributions
 parties+0.042
text+0.038
 recess+0.032
|+0.028
ashed+0.026
Neg logit contributions
 phrase-0.117
cut-0.111
 ringing-0.110
fly-0.106
 word-0.105
Token could
Token ID714
Feature activation+2.30e-01

Pos logit contributions
 parties+0.233
text+0.213
 recess+0.191
 tea+0.182
aws+0.179
Neg logit contributions
 phrase-0.313
cut-0.292
 ringing-0.283
 claw-0.278
 drill-0.274
Token use
Token ID779
Feature activation+2.71e+00

Pos logit contributions
 parties+0.006
text+0.005
 recess+0.005
 tea+0.004
ashed+0.004
Neg logit contributions
 phrase-0.009
cut-0.008
 ringing-0.008
 claw-0.008
 drill-0.008
Token to
Token ID284
Feature activation-9.58e-01

Pos logit contributions
 parties+0.048
text+0.042
 recess+0.036
ashed+0.031
|+0.031
Neg logit contributions
 phrase-0.119
cut-0.113
 ringing-0.110
 drill-0.107
fly-0.106
Token build
Token ID1382
Feature activation+2.01e+00

Pos logit contributions
 phrase+0.037
cut+0.035
 ringing+0.033
rum+0.033
 guardian+0.033
Neg logit contributions
 parties-0.024
 recess-0.019
 outside-0.019
text-0.018
 together-0.018
Token the
Token ID262
Feature activation+1.91e+00

Pos logit contributions
 parties+0.040
text+0.038
ashed+0.034
 recess+0.032
|+0.031
Neg logit contributions
 phrase-0.082
 ringing-0.076
 drill-0.074
 word-0.074
cut-0.074
Token good
Token ID922
Feature activation+1.93e+00

Pos logit contributions
 parties+0.022
text+0.020
|+0.018
he+0.017
 recess+0.017
Neg logit contributions
 phrase-0.027
 guardian-0.025
 ringing-0.025
cut-0.024
 claw-0.024
Token friends
Token ID2460
Feature activation+5.80e+00

Pos logit contributions
 parties+0.046
text+0.044
|+0.043
ackers+0.037
 recess+0.037
Neg logit contributions
 phrase-0.067
 ringing-0.066
 guardian-0.062
cut-0.061
 claw-0.061
Token and
Token ID290
Feature activation+1.84e+00

Pos logit contributions
 parties+0.144
 recess+0.132
text+0.129
ashed+0.119
ashes+0.117
Neg logit contributions
 phrase-0.207
cut-0.193
 guardian-0.186
 ringing-0.183
fly-0.182
Token share
Token ID2648
Feature activation+2.37e+00

Pos logit contributions
 parties+0.039
 recess+0.028
 together+0.027
 outside+0.027
 tea+0.026
Neg logit contributions
 phrase-0.081
cut-0.076
rum-0.075
 guardian-0.073
 ringing-0.073
Token everything
Token ID2279
Feature activation+1.80e+00

Pos logit contributions
 parties+0.042
text+0.037
 recess+0.032
ashed+0.030
 outside+0.028
Neg logit contributions
 phrase-0.104
cut-0.097
 ringing-0.097
 word-0.095
rum-0.093
Token they
Token ID484
Feature activation+8.77e+00

Pos logit contributions
 parties+0.040
text+0.036
 recess+0.035
ashed+0.032
aws+0.030
Neg logit contributions
 phrase-0.065
cut-0.064
 ringing-0.061
 guardian-0.061
 word-0.059
Token found
Token ID1043
Feature activation+2.50e+00

Pos logit contributions
text+0.254
 parties+0.240
 recess+0.209
 tea+0.207
aws+0.203
Neg logit contributions
 phrase-0.260
 ringing-0.251
cut-0.243
 meaning-0.238
fly-0.236
Token in
Token ID287
Feature activation-7.45e-01

Pos logit contributions
 parties+0.051
text+0.047
 recess+0.044
|+0.038
 bath+0.038
Neg logit contributions
 phrase-0.099
cut-0.098
 ringing-0.093
rum-0.091
 word-0.090
Token the
Token ID262
Feature activation+2.84e+00

Pos logit contributions
 phrase+0.033
cut+0.031
 ringing+0.030
 guardian+0.030
rum+0.029
Neg logit contributions
 parties-0.014
text-0.011
 outside-0.010
 recess-0.010
 tea-0.009
Token park
Token ID3952
Feature activation+1.48e+00

Pos logit contributions
 parties+0.063
 outside+0.049
 recess+0.048
text+0.047
 tea+0.046
Neg logit contributions
 phrase-0.119
cut-0.111
rum-0.109
 ringing-0.107
fly-0.105
Token from
Token ID422
Feature activation-1.15e+00

Pos logit contributions
 parties+0.025
text+0.022
 recess+0.019
P+0.016
 outside+0.016
Neg logit contributions
 phrase-0.066
cut-0.063
 ringing-0.062
fly-0.060
 guardian-0.060
Token of
Token ID286
Feature activation-9.90e-01

Pos logit contributions
 phrase+0.041
cut+0.038
 ringing+0.037
rum+0.036
 guardian+0.036
Neg logit contributions
 parties-0.023
text-0.018
 recess-0.018
 outside-0.017
 tea-0.016
Token all
Token ID477
Feature activation+1.58e+00

Pos logit contributions
 phrase+0.051
cut+0.048
 claw+0.047
 ringing+0.047
 drill+0.046
Neg logit contributions
 parties-0.013
 outside-0.007
 recess-0.007
 together-0.007
text-0.007
Token the
Token ID262
Feature activation+2.41e+00

Pos logit contributions
 parties+0.030
 outside+0.020
text+0.020
 together+0.020
 tea+0.019
Neg logit contributions
 phrase-0.072
cut-0.068
 claw-0.065
 ringing-0.065
 drill-0.065
Token wonderful
Token ID7932
Feature activation+2.21e+00

Pos logit contributions
 parties+0.065
text+0.053
 recess+0.052
 outside+0.052
 tea+0.049
Neg logit contributions
 phrase-0.089
cut-0.082
 ringing-0.079
rum-0.077
 guardian-0.077
Token things
Token ID1243
Feature activation+1.75e+00

Pos logit contributions
 parties+0.065
text+0.056
 recess+0.054
 outside+0.053
aws+0.051
Neg logit contributions
 phrase-0.076
cut-0.067
 ringing-0.066
 guardian-0.064
 word-0.064
Token they
Token ID484
Feature activation+8.75e+00

Pos logit contributions
 parties+0.032
text+0.025
 recess+0.025
 outside+0.021
 tea+0.021
Neg logit contributions
 phrase-0.078
cut-0.073
 ringing-0.072
 guardian-0.070
fly-0.069
Token had
Token ID550
Feature activation+5.30e+00

Pos logit contributions
 parties+0.229
text+0.208
 recess+0.189
 tea+0.184
aws+0.181
Neg logit contributions
 phrase-0.309
cut-0.287
 ringing-0.282
 guardian-0.277
fly-0.272
Token done
Token ID1760
Feature activation+2.94e+00

Pos logit contributions
text+0.125
 parties+0.123
 recess+0.107
 tea+0.106
 Woo+0.104
Neg logit contributions
 phrase-0.185
 ringing-0.170
 meaning-0.164
cut-0.162
 guardian-0.162
Token.
Token ID13
Feature activation+7.01e-01

Pos logit contributions
 parties+0.054
text+0.048
 recess+0.045
ashed+0.042
 picnic+0.040
Neg logit contributions
 phrase-0.127
 ringing-0.115
cut-0.114
rum-0.113
 word-0.112
Token It
Token ID632
Feature activation-4.53e+00

Pos logit contributions
 parties+0.013
 recess+0.010
text+0.010
 outside+0.009
 tea+0.009
Neg logit contributions
 phrase-0.031
cut-0.029
 ringing-0.028
rum-0.028
 guardian-0.028
Token was
Token ID373
Feature activation-1.59e+00

Pos logit contributions
 phrase+0.181
rum+0.173
cut+0.172
 drill+0.169
 claw+0.166
Neg logit contributions
 parties-0.101
 together-0.082
text-0.080
 outside-0.079
 they-0.074
Token about
Token ID546
Feature activation-2.22e+00

Pos logit contributions
 parties+0.036
text+0.032
 recess+0.030
P+0.027
|+0.027
Neg logit contributions
 phrase-0.061
 ringing-0.056
cut-0.054
 word-0.053
 guardian-0.051
Token all
Token ID477
Feature activation+1.63e+00

Pos logit contributions
 phrase+0.096
cut+0.090
 ringing+0.087
rum+0.086
 guardian+0.086
Neg logit contributions
 parties-0.046
text-0.034
 recess-0.033
 outside-0.033
 tea-0.031
Token the
Token ID262
Feature activation+2.52e+00

Pos logit contributions
 parties+0.031
text+0.022
 recess+0.022
 outside+0.021
 tea+0.019
Neg logit contributions
 phrase-0.074
cut-0.069
 ringing-0.067
 guardian-0.066
rum-0.066
Token delicious
Token ID12625
Feature activation-7.05e-02

Pos logit contributions
 parties+0.065
text+0.052
 recess+0.051
 outside+0.051
 tea+0.048
Neg logit contributions
 phrase-0.096
cut-0.089
 ringing-0.086
rum-0.084
 guardian-0.084
Token things
Token ID1243
Feature activation+1.78e+00

Pos logit contributions
 phrase+0.002
cut+0.002
 ringing+0.002
rum+0.002
 claw+0.002
Neg logit contributions
 parties-0.002
text-0.002
 recess-0.002
 outside-0.002
 together-0.002
Token they
Token ID484
Feature activation+8.75e+00

Pos logit contributions
 parties+0.030
text+0.026
 recess+0.024
|+0.019
P+0.019
Neg logit contributions
 phrase-0.079
 ringing-0.075
cut-0.075
 guardian-0.072
 word-0.071
Token would
Token ID561
Feature activation+2.22e+00

Pos logit contributions
 parties+0.219
text+0.188
 recess+0.174
 outside+0.168
 tea+0.166
Neg logit contributions
 phrase-0.329
cut-0.309
 ringing-0.301
 guardian-0.291
 claw-0.290
Token make
Token ID787
Feature activation+1.67e+00

Pos logit contributions
 parties+0.049
 recess+0.035
 outside+0.035
text+0.033
 party+0.033
Neg logit contributions
 phrase-0.094
cut-0.088
rum-0.085
 ringing-0.084
 drill-0.084
Token with
Token ID351
Feature activation+9.02e-01

Pos logit contributions
 parties+0.032
text+0.030
 recess+0.030
ashed+0.028
|+0.026
Neg logit contributions
 phrase-0.069
 ringing-0.065
cut-0.064
 word-0.061
rum-0.061
Token the
Token ID262
Feature activation+1.84e+00

Pos logit contributions
 parties+0.016
text+0.012
 recess+0.011
 outside+0.011
 tea+0.010
Neg logit contributions
 phrase-0.041
cut-0.039
 ringing-0.038
 guardian-0.037
rum-0.037
Token sa
Token ID473
Feature activation-1.13e+00

Pos logit contributions
 parties+0.052
text+0.043
 recess+0.042
 outside+0.042
 tea+0.040
Neg logit contributions
 phrase-0.065
cut-0.060
 ringing-0.058
rum-0.057
 guardian-0.057
Token make
Token ID787
Feature activation+5.11e-01

Pos logit contributions
 phrase+0.097
cut+0.089
rum+0.086
 ringing+0.085
 drill+0.085
Neg logit contributions
 parties-0.065
 outside-0.050
 recess-0.049
text-0.049
 tea-0.047
Token new
Token ID649
Feature activation+1.27e-01

Pos logit contributions
 parties+0.013
text+0.012
ashed+0.012
 recess+0.012
ashing+0.011
Neg logit contributions
 phrase-0.018
 ringing-0.017
cut-0.016
ller-0.016
 word-0.016
Token memories
Token ID9846
Feature activation+4.14e+00

Pos logit contributions
 parties+0.004
text+0.003
 recess+0.003
ashed+0.003
 outside+0.003
Neg logit contributions
 phrase-0.004
 ringing-0.004
cut-0.004
 guardian-0.004
 claw-0.004
Token the
Token ID262
Feature activation+1.81e+00

Pos logit contributions
 parties+0.074
text+0.066
 recess+0.063
aws+0.056
ashed+0.055
Neg logit contributions
 phrase-0.178
cut-0.169
 ringing-0.168
fly-0.160
 word-0.160
Token best
Token ID1266
Feature activation+4.04e+00

Pos logit contributions
 parties+0.047
 outside+0.037
 recess+0.037
text+0.036
 tea+0.036
Neg logit contributions
 phrase-0.068
cut-0.063
rum-0.062
 ringing-0.060
fly-0.060
Token we
Token ID356
Feature activation+8.75e+00

Pos logit contributions
 parties+0.109
text+0.091
 recess+0.089
 outside+0.084
aws+0.083
Neg logit contributions
 phrase-0.148
cut-0.134
 ringing-0.133
 guardian-0.130
 claw-0.127
Token can
Token ID460
Feature activation-2.23e-02

Pos logit contributions
 parties+0.214
text+0.182
 recess+0.170
 tea+0.160
aws+0.160
Neg logit contributions
 phrase-0.332
cut-0.311
 ringing-0.301
fly-0.296
 guardian-0.295
Token,"
Token ID553
Feature activation+1.53e+00

Pos logit contributions
 phrase+0.001
cut+0.001
 ringing+0.001
 guardian+0.001
fly+0.001
Neg logit contributions
 parties-0.001
text-0.000
 recess-0.000
 outside-0.000
 tea-0.000
Token she
Token ID673
Feature activation+1.00e+00

Pos logit contributions
 parties+0.031
text+0.024
 recess+0.023
 outside+0.021
 tea+0.020
Neg logit contributions
 phrase-0.066
cut-0.062
 ringing-0.062
 guardian-0.059
 claw-0.059
Token said
Token ID531
Feature activation+6.01e-01

Pos logit contributions
 parties+0.018
 together+0.016
 outside+0.014
 they+0.013
text+0.013
Neg logit contributions
ric-0.043
 phrase-0.043
rum-0.041
cut-0.041
 beast-0.040
Token.
Token ID13
Feature activation-5.68e-01

Pos logit contributions
 parties+0.010
text+0.007
 recess+0.007
 outside+0.006
 tea+0.006
Neg logit contributions
 phrase-0.029
cut-0.027
 ringing-0.027
rum-0.026
 guardian-0.026
Token to
Token ID284
Feature activation-1.58e+00

Pos logit contributions
 parties+0.010
text+0.008
 recess+0.008
 outside+0.007
 tea+0.007
Neg logit contributions
 phrase-0.018
cut-0.016
 ringing-0.016
 guardian-0.016
rum-0.016
Token remember
Token ID3505
Feature activation+1.50e+00

Pos logit contributions
 phrase+0.064
cut+0.060
rum+0.058
 ringing+0.058
 drill+0.057
Neg logit contributions
 parties-0.038
 recess-0.029
 outside-0.028
text-0.028
 tea-0.026
Token all
Token ID477
Feature activation+1.94e+00

Pos logit contributions
 parties+0.032
 outside+0.024
 together+0.023
 tea+0.022
 recess+0.021
Neg logit contributions
 phrase-0.066
cut-0.063
 claw-0.060
rum-0.060
 drill-0.060
Token the
Token ID262
Feature activation+2.91e+00

Pos logit contributions
 parties+0.039
 outside+0.028
 recess+0.027
text+0.027
 together+0.026
Neg logit contributions
 phrase-0.086
cut-0.080
 claw-0.078
 drill-0.077
 ringing-0.077
Token things
Token ID1243
Feature activation+1.81e+00

Pos logit contributions
 parties+0.070
text+0.055
 recess+0.054
 outside+0.053
 tea+0.050
Neg logit contributions
 phrase-0.115
cut-0.108
 ringing-0.104
rum-0.102
 guardian-0.102
Token we
Token ID356
Feature activation+8.73e+00

Pos logit contributions
 parties+0.032
text+0.027
 recess+0.026
ashed+0.022
aws+0.022
Neg logit contributions
 phrase-0.079
cut-0.074
 ringing-0.074
 guardian-0.072
fly-0.071
Token saw
Token ID2497
Feature activation+7.81e-01

Pos logit contributions
 parties+0.198
text+0.157
 recess+0.152
 outside+0.149
 tea+0.141
Neg logit contributions
 phrase-0.355
cut-0.331
 ringing-0.321
 guardian-0.315
rum-0.314
Token.
Token ID13
Feature activation+9.58e-01

Pos logit contributions
 parties+0.016
 recess+0.014
text+0.013
ashed+0.012
 tea+0.012
Neg logit contributions
 phrase-0.031
cut-0.029
 ringing-0.029
 guardian-0.028
fly-0.028
Token I
Token ID314
Feature activation+3.30e+00

Pos logit contributions
 parties+0.022
text+0.019
 together+0.018
 outside+0.017
 Lily+0.017
Neg logit contributions
 phrase-0.038
cut-0.037
 ringing-0.035
 claw-0.035
 swollen-0.034
Token think
Token ID892
Feature activation+1.15e-01

Pos logit contributions
 parties+0.077
 outside+0.059
 together+0.058
text+0.057
 play+0.057
Neg logit contributions
 phrase-0.130
cut-0.124
 ringing-0.122
rum-0.120
ller-0.120
Token you
Token ID345
Feature activation+2.91e+00

Pos logit contributions
 parties+0.002
text+0.002
 recess+0.002
 outside+0.002
 tea+0.002
Neg logit contributions
 phrase-0.005
cut-0.005
 ringing-0.005
 guardian-0.004
rum-0.004
Token it
Token ID340
Feature activation-2.32e+00

Pos logit contributions
 phrase+0.033
cut+0.031
rum+0.030
 ringing+0.029
 guardian+0.029
Neg logit contributions
 parties-0.013
 outside-0.009
 recess-0.009
text-0.008
 together-0.008
Token would
Token ID561
Feature activation-5.92e-01

Pos logit contributions
 phrase+0.090
rum+0.084
cut+0.084
 ringing+0.080
 drill+0.080
Neg logit contributions
 parties-0.055
 outside-0.045
 together-0.044
text-0.044
 around-0.040
Token take
Token ID1011
Feature activation+3.41e+00

Pos logit contributions
 phrase+0.024
rum+0.022
cut+0.022
ller+0.021
omm+0.021
Neg logit contributions
 parties-0.014
 outside-0.011
 play-0.011
 recess-0.011
 together-0.010
Token them
Token ID606
Feature activation+3.97e+00

Pos logit contributions
 parties+0.070
 recess+0.054
text+0.051
ashing+0.050
ashed+0.049
Neg logit contributions
 phrase-0.143
 ringing-0.129
cut-0.126
 word-0.126
 guardian-0.124
Token anywhere
Token ID6609
Feature activation+3.72e+00

Pos logit contributions
 parties+0.089
text+0.081
 recess+0.078
ashed+0.074
aws+0.070
Neg logit contributions
 phrase-0.146
 ringing-0.143
 guardian-0.141
cut-0.136
 claw-0.131
Token they
Token ID484
Feature activation+8.72e+00

Pos logit contributions
 parties+0.069
text+0.058
 recess+0.054
ashed+0.052
|+0.050
Neg logit contributions
 phrase-0.153
 ringing-0.148
cut-0.147
 guardian-0.143
 word-0.139
Token wanted
Token ID2227
Feature activation+2.80e+00

Pos logit contributions
 parties+0.241
text+0.207
 recess+0.199
 tea+0.195
 outside+0.187
Neg logit contributions
 phrase-0.303
cut-0.284
 ringing-0.274
 guardian-0.269
 claw-0.267
Token to
Token ID284
Feature activation-1.14e+00

Pos logit contributions
 parties+0.062
text+0.054
 recess+0.052
ashed+0.049
P+0.048
Neg logit contributions
 phrase-0.110
 ringing-0.100
 word-0.098
cut-0.098
rum-0.097
Token go
Token ID467
Feature activation+5.33e+00

Pos logit contributions
 phrase+0.046
cut+0.043
 ringing+0.042
 guardian+0.041
rum+0.041
Neg logit contributions
 parties-0.026
text-0.021
 recess-0.020
 outside-0.020
 tea-0.019
Token.
Token ID13
Feature activation+5.32e-01

Pos logit contributions
text+0.125
 parties+0.123
ashed+0.105
ackers+0.099
 recess+0.097
Neg logit contributions
 ringing-0.200
 phrase-0.191
 guardian-0.187
cut-0.181
pe-0.179
Token
Token ID220
Feature activation-1.60e+00

Pos logit contributions
 parties+0.013
 recess+0.010
text+0.010
 outside+0.010
 tea+0.009
Neg logit contributions
 phrase-0.021
cut-0.019
rum-0.018
 drill-0.018
 ringing-0.018
Token the
Token ID262
Feature activation+2.04e+00

Pos logit contributions
 parties+0.016
 outside+0.012
 recess+0.012
text+0.012
 together+0.011
Neg logit contributions
 phrase-0.030
cut-0.028
rum-0.027
 ringing-0.027
 claw-0.027
Token y
Token ID331
Feature activation-1.29e+00

Pos logit contributions
 parties+0.032
 outside+0.021
 recess+0.020
 tea+0.019
text+0.019
Neg logit contributions
 phrase-0.097
rum-0.092
cut-0.092
 ringing-0.090
ller-0.089
Tokenumm
Token ID13929
Feature activation-2.35e+00

Pos logit contributions
 phrase+0.030
cut+0.025
 ringing+0.025
 claw+0.024
rum+0.024
Neg logit contributions
 parties-0.051
text-0.045
 recess-0.044
 outside-0.043
 together-0.042
Tokeniest
Token ID6386
Feature activation+2.21e+00

Pos logit contributions
 phrase+0.074
cut+0.067
 ringing+0.065
 guardian+0.063
rum+0.063
Neg logit contributions
 parties-0.076
text-0.064
 recess-0.063
 outside-0.063
 tea-0.060
Token thing
Token ID1517
Feature activation-1.34e+00

Pos logit contributions
 parties+0.055
 outside+0.043
 tea+0.042
 recess+0.042
 together+0.041
Neg logit contributions
 phrase-0.087
cut-0.081
rum-0.078
 ringing-0.078
 drill-0.076
Token they
Token ID484
Feature activation+8.72e+00

Pos logit contributions
 phrase+0.061
cut+0.058
 ringing+0.056
fly+0.056
 guardian+0.055
Neg logit contributions
 parties-0.024
text-0.018
 recess-0.017
aws-0.014
 tea-0.014
Token ever
Token ID1683
Feature activation+3.14e+00

Pos logit contributions
 parties+0.241
text+0.216
 recess+0.196
 tea+0.195
aws+0.191
Neg logit contributions
 phrase-0.293
cut-0.277
fly-0.266
 ringing-0.264
 guardian-0.262
Token tasted
Token ID29187
Feature activation+6.64e-01

Pos logit contributions
 parties+0.080
text+0.065
 outside+0.062
 recess+0.060
 tea+0.060
Neg logit contributions
 phrase-0.117
 ringing-0.108
cut-0.106
 claw-0.106
fly-0.105
Token.
Token ID13
Feature activation+9.77e-01

Pos logit contributions
 parties+0.014
text+0.014
 recess+0.014
|+0.013
ashed+0.011
Neg logit contributions
 phrase-0.025
 word-0.023
cut-0.022
 ringing-0.022
 meaning-0.022
Token<|endoftext|>
Token ID50256
Feature activation+1.68e+00

Pos logit contributions
 parties+0.023
 recess+0.017
 outside+0.017
 party+0.016
 picnic+0.016
Neg logit contributions
 phrase-0.038
cut-0.035
rum-0.035
ller-0.035
fly-0.034
TokenOnce
Token ID7454
Feature activation+1.17e+00

Pos logit contributions
 parties+0.050
ashed+0.042
ashing+0.042
ashes+0.040
 recess+0.039
Neg logit contributions
 phrase-0.052
 word-0.047
fly-0.046
pe-0.046
cut-0.046
Token
Token ID198
Feature activation-1.92e-01

Pos logit contributions
 phrase+0.016
cut+0.015
 ringing+0.014
fly+0.014
 guardian+0.014
Neg logit contributions
 parties-0.012
text-0.010
 recess-0.010
 outside-0.010
 tea-0.009
Token
Token ID198
Feature activation-5.64e-01

Pos logit contributions
 phrase+0.008
cut+0.007
 ringing+0.007
 claw+0.007
ller+0.007
Neg logit contributions
 parties-0.004
 recess-0.003
text-0.003
 tea-0.003
ashed-0.003
TokenThe
Token ID464
Feature activation-4.31e+00

Pos logit contributions
 phrase+0.020
cut+0.019
 ringing+0.018
 guardian+0.018
 claw+0.018
Neg logit contributions
 parties-0.016
text-0.013
 recess-0.012
 outside-0.012
 tea-0.012
Token rh
Token ID9529
Feature activation-4.33e+00

Pos logit contributions
 phrase+0.141
rum+0.125
cut+0.115
 drill+0.114
 guardian+0.110
Neg logit contributions
 parties-0.134
text-0.112
 recess-0.111
 outside-0.111
 picnic-0.111
Tokenin
Token ID259
Feature activation-4.73e+00

Pos logit contributions
 phrase+0.215
 ringing+0.190
 prayer+0.188
rum+0.185
 guardian+0.185
Neg logit contributions
 parties-0.054
he-0.052
 together-0.046
 outside-0.045
P-0.043
Tokenoc
Token ID420
Feature activation-1.14e+01

Pos logit contributions
 phrase+0.171
osity+0.165
rum+0.164
isel+0.149
omm+0.147
Neg logit contributions
 together-0.103
 play-0.102
 played-0.102
 they-0.101
 in-0.099
Tokeneros
Token ID27498
Feature activation-3.75e+00

Pos logit contributions
 phrase+0.398
 outage+0.369
 drill+0.354
 definition+0.347
Sad+0.345
Neg logit contributions
 parties-0.239
 outside-0.231
 each-0.222
aws-0.215
 in-0.206
Token nodded
Token ID14464
Feature activation-3.49e+00

Pos logit contributions
rum+0.146
 phrase+0.143
ric+0.138
cut+0.138
 drill+0.132
Neg logit contributions
 parties-0.081
 outside-0.072
 together-0.072
 played-0.070
 party-0.067
Token its
Token ID663
Feature activation-4.28e+00

Pos logit contributions
 phrase+0.157
cut+0.144
 ringing+0.143
 claw+0.140
fly+0.140
Neg logit contributions
 parties-0.071
 recess-0.052
text-0.049
 tea-0.043
 picnic-0.041
Token head
Token ID1182
Feature activation-3.97e+00

Pos logit contributions
cut+0.144
bul+0.132
 phrase+0.130
bek+0.129
rum+0.125
Neg logit contributions
 back-0.107
 both-0.100
 outside-0.099
 each-0.098
 play-0.097
Token.
Token ID13
Feature activation-1.17e-01

Pos logit contributions
 phrase+0.194
cut+0.183
 ringing+0.179
rum+0.176
 guardian+0.175
Neg logit contributions
 parties-0.059
text-0.039
 recess-0.038
 outside-0.035
 tea-0.032
Token Billy
Token ID15890
Feature activation-3.85e-01

Pos logit contributions
 parties+0.003
text+0.002
 outside+0.002
 recess+0.002
 together+0.002
Neg logit contributions
 phrase-0.006
cut-0.006
 ringing-0.005
rum-0.005
 guardian-0.005
Token.
Token ID13
Feature activation-4.21e-01

Pos logit contributions
 phrase+0.016
cut+0.014
 ringing+0.014
 drill+0.014
 claw+0.014
Neg logit contributions
 parties-0.009
 outside-0.007
text-0.007
 together-0.006
 recess-0.006
Token The
Token ID383
Feature activation-3.81e+00

Pos logit contributions
 phrase+0.016
cut+0.015
 ringing+0.015
fly+0.015
 claw+0.015
Neg logit contributions
 parties-0.010
text-0.008
 outside-0.008
 recess-0.008
 together-0.008
Token rh
Token ID9529
Feature activation-4.51e+00

Pos logit contributions
 phrase+0.136
rum+0.125
bul+0.111
 drill+0.110
osity+0.109
Neg logit contributions
 parties-0.103
 picnic-0.092
 play-0.091
 party-0.090
 rides-0.088
Tokenin
Token ID259
Feature activation-4.88e+00

Pos logit contributions
 phrase+0.218
 prayer+0.195
 guardian+0.191
 ringing+0.190
 claw+0.189
Neg logit contributions
 parties-0.064
he-0.060
P-0.051
 together-0.051
aws-0.048
Tokenoc
Token ID420
Feature activation-1.13e+01

Pos logit contributions
 phrase+0.169
osity+0.167
isel+0.155
 appearances+0.155
 outage+0.155
Neg logit contributions
 played-0.103
 together-0.101
 play-0.101
 went-0.096
 in-0.095
Tokeneros
Token ID27498
Feature activation-3.85e+00

Pos logit contributions
 outage+0.380
 phrase+0.379
 drill+0.354
 prayer+0.337
 pounding+0.335
Neg logit contributions
 parties-0.264
aws-0.238
 rides-0.224
 outside-0.222
 each-0.220
Token looked
Token ID3114
Feature activation-1.04e+00

Pos logit contributions
 phrase+0.150
rum+0.141
ric+0.138
 drill+0.138
cut+0.136
Neg logit contributions
 parties-0.085
 together-0.071
 outside-0.070
text-0.069
 played-0.067
Token at
Token ID379
Feature activation-3.19e+00

Pos logit contributions
 phrase+0.044
 ringing+0.040
cut+0.040
 guardian+0.039
 claw+0.039
Neg logit contributions
 parties-0.023
 recess-0.019
text-0.018
|-0.015
 tea-0.014
Token Billy
Token ID15890
Feature activation+1.25e+00

Pos logit contributions
 phrase+0.131
cut+0.123
 ringing+0.120
 claw+0.117
rum+0.117
Neg logit contributions
 parties-0.071
text-0.056
 recess-0.053
 outside-0.051
 tea-0.050
Token with
Token ID351
Feature activation-4.10e-02

Pos logit contributions
 parties+0.020
 outside+0.015
text+0.014
 together+0.013
 recess+0.012
Neg logit contributions
 phrase-0.059
cut-0.055
 guardian-0.053
 drill-0.053
rum-0.053
Token.
Token ID13
Feature activation+4.31e-01

Pos logit contributions
 parties+0.023
text+0.016
 recess+0.015
 outside+0.014
 tea+0.013
Neg logit contributions
 phrase-0.072
cut-0.068
 ringing-0.067
rum-0.065
 guardian-0.065
Token
Token ID198
Feature activation+4.86e-01

Pos logit contributions
 parties+0.008
text+0.006
 recess+0.006
 outside+0.006
 together+0.005
Neg logit contributions
 phrase-0.019
cut-0.018
 ringing-0.018
 guardian-0.017
rum-0.017
Token
Token ID198
Feature activation-5.50e-02

Pos logit contributions
 parties+0.012
 recess+0.009
text+0.009
 outside+0.009
 tea+0.008
Neg logit contributions
 phrase-0.019
cut-0.018
rum-0.017
 ringing-0.017
 guardian-0.017
TokenThe
Token ID464
Feature activation-3.79e+00

Pos logit contributions
 phrase+0.002
cut+0.002
 ringing+0.002
 guardian+0.002
rum+0.002
Neg logit contributions
 parties-0.001
text-0.001
 recess-0.001
 outside-0.001
 tea-0.001
Token av
Token ID1196
Feature activation-6.43e+00

Pos logit contributions
 phrase+0.132
rum+0.121
fly+0.117
ric+0.115
ller+0.113
Neg logit contributions
 parties-0.111
 outside-0.095
 party-0.091
 recess-0.091
 picnic-0.088
Tokenoc
Token ID420
Feature activation-1.10e+01

Pos logit contributions
 phrase+0.219
 drill+0.189
 outage+0.188
 alarm+0.188
 guardian+0.187
Neg logit contributions
P-0.169
 together-0.168
 parties-0.163
he-0.156
text-0.151
Tokenados
Token ID22484
Feature activation-3.95e+00

Pos logit contributions
 outage+0.470
 phrase+0.468
 drill+0.441
 prayer+0.421
 pounding+0.420
Neg logit contributions
 parties-0.194
 outside-0.156
 together-0.146
 each-0.134
aws-0.131
Token were
Token ID547
Feature activation-8.17e-01

Pos logit contributions
 phrase+0.187
rum+0.173
 drill+0.169
 ringing+0.166
 claw+0.165
Neg logit contributions
 parties-0.067
 together-0.048
 outside-0.046
 party-0.043
he-0.043
Token shared
Token ID4888
Feature activation+2.37e+00

Pos logit contributions
 phrase+0.038
cut+0.035
 drill+0.035
rum+0.034
 ringing+0.034
Neg logit contributions
 parties-0.014
 outside-0.010
 together-0.010
text-0.010
 tea-0.009
Token with
Token ID351
Feature activation+1.46e+00

Pos logit contributions
 parties+0.045
text+0.035
 recess+0.034
 outside+0.031
 tea+0.029
Neg logit contributions
 phrase-0.104
cut-0.098
 ringing-0.095
 guardian-0.093
rum-0.093
Token everybody
Token ID7288
Feature activation+3.93e+00

Pos logit contributions
 parties+0.024
text+0.017
 together+0.016
 outside+0.016
 recess+0.016
Neg logit contributions
 phrase-0.069
cut-0.065
 drill-0.063
 ringing-0.063
rum-0.063
Token
Token ID198
Feature activation-2.28e-01

Pos logit contributions
 parties+0.025
 recess+0.020
text+0.019
|+0.018
ashed+0.018
Neg logit contributions
 phrase-0.036
 word-0.032
 ringing-0.032
 claw-0.031
cut-0.031
Token
Token ID198
Feature activation-4.87e-01

Pos logit contributions
 phrase+0.009
cut+0.008
rum+0.008
ller+0.008
 ringing+0.008
Neg logit contributions
 parties-0.006
 recess-0.004
ashed-0.004
text-0.004
 party-0.004
TokenThe
Token ID464
Feature activation-4.15e+00

Pos logit contributions
 phrase+0.017
cut+0.016
 claw+0.015
 guardian+0.015
 ringing+0.015
Neg logit contributions
 parties-0.014
text-0.011
 recess-0.011
 tea-0.011
 outside-0.011
Token rh
Token ID9529
Feature activation-4.07e+00

Pos logit contributions
 phrase+0.143
rum+0.130
 drill+0.118
osity+0.117
fly+0.116
Neg logit contributions
 parties-0.117
 picnic-0.100
 party-0.100
 outside-0.098
 play-0.098
Tokenin
Token ID259
Feature activation-4.61e+00

Pos logit contributions
 phrase+0.195
 ringing+0.177
rum+0.171
 definition+0.170
 prayer+0.170
Neg logit contributions
 parties-0.059
he-0.050
 together-0.046
 outside-0.045
P-0.044
Tokenoc
Token ID420
Feature activation-1.10e+01

Pos logit contributions
osity+0.162
rum+0.155
 phrase+0.153
 outage+0.140
omm+0.139
Neg logit contributions
 played-0.103
 in-0.101
 play-0.101
 went-0.101
 together-0.098
Tokeneros
Token ID27498
Feature activation-3.63e+00

Pos logit contributions
 phrase+0.370
 outage+0.342
 drill+0.336
 definition+0.331
 swollen+0.331
Neg logit contributions
 parties-0.250
 outside-0.239
 each-0.222
aws-0.222
 in-0.216
Token walked
Token ID6807
Feature activation+2.86e+00

Pos logit contributions
rum+0.139
 phrase+0.135
cut+0.133
ric+0.133
 guardian+0.128
Neg logit contributions
 parties-0.077
 outside-0.070
 played-0.069
 together-0.069
 at-0.065
Token to
Token ID284
Feature activation-3.55e+00

Pos logit contributions
 parties+0.069
text+0.054
 recess+0.053
 outside+0.052
 tea+0.050
Neg logit contributions
 phrase-0.114
cut-0.106
 ringing-0.103
rum-0.101
 guardian-0.101
Token the
Token ID262
Feature activation+2.26e-01

Pos logit contributions
 phrase+0.149
 ringing+0.137
 claw+0.135
cut+0.134
 guardian+0.133
Neg logit contributions
 parties-0.074
text-0.064
ashed-0.056
aws-0.053
ashing-0.053
Token fence
Token ID13990
Feature activation-3.01e+00

Pos logit contributions
 parties+0.006
 outside+0.006
 picnic+0.005
 party+0.005
 tea+0.005
Neg logit contributions
 phrase-0.008
rum-0.007
cut-0.007
fly-0.007
ller-0.007
Token
Token ID198
Feature activation-2.37e-01

Pos logit contributions
 parties+0.008
 outside+0.006
text+0.005
 together+0.005
 recess+0.005
Neg logit contributions
 phrase-0.030
cut-0.028
 guardian-0.027
 ringing-0.027
rum-0.027
Token
Token ID198
Feature activation-5.78e-01

Pos logit contributions
 phrase+0.010
cut+0.009
 ringing+0.009
rum+0.009
 drill+0.009
Neg logit contributions
 parties-0.005
 recess-0.004
text-0.004
 tea-0.004
ashed-0.004
TokenThe
Token ID464
Feature activation-4.14e+00

Pos logit contributions
 phrase+0.025
rum+0.023
 ringing+0.023
cut+0.022
 claw+0.022
Neg logit contributions
 parties-0.012
text-0.009
 together-0.009
 outside-0.008
P-0.008
Token rh
Token ID9529
Feature activation-4.40e+00

Pos logit contributions
 phrase+0.112
cut+0.098
rum+0.095
 drill+0.092
 ringing+0.092
Neg logit contributions
 parties-0.152
text-0.132
 recess-0.129
 outside-0.129
 tea-0.125
Tokenin
Token ID259
Feature activation-4.89e+00

Pos logit contributions
 phrase+0.212
 prayer+0.190
 ringing+0.187
 claw+0.184
 guardian+0.184
Neg logit contributions
 parties-0.062
he-0.058
 together-0.053
P-0.051
 outside-0.048
Tokenoc
Token ID420
Feature activation-1.10e+01

Pos logit contributions
 phrase+0.174
isel+0.162
 ey+0.159
rum+0.157
osity+0.156
Neg logit contributions
 together-0.102
 played-0.101
 play-0.100
text-0.096
 outside-0.096
Tokeneros
Token ID27498
Feature activation-4.03e+00

Pos logit contributions
 phrase+0.359
 outage+0.331
 drill+0.312
 prayer+0.307
 swollen+0.304
Neg logit contributions
 parties-0.279
 outside-0.243
 rides-0.235
 recess-0.230
 each-0.228
Token looked
Token ID3114
Feature activation-1.57e+00

Pos logit contributions
 phrase+0.168
ric+0.165
rum+0.164
 ringing+0.158
 drill+0.157
Neg logit contributions
 parties-0.069
 together-0.067
 outside-0.065
 played-0.064
ashed-0.059
Token up
Token ID510
Feature activation-2.36e+00

Pos logit contributions
 phrase+0.068
cut+0.064
 ringing+0.062
 guardian+0.061
 claw+0.061
Neg logit contributions
 parties-0.033
 recess-0.027
text-0.027
|-0.020
 tea-0.020
Token,
Token ID11
Feature activation+4.67e-01

Pos logit contributions
 phrase+0.105
cut+0.098
 ringing+0.096
 guardian+0.094
 claw+0.094
Neg logit contributions
 parties-0.046
text-0.035
 recess-0.033
 outside-0.030
 tea-0.030
Token hopeful
Token ID17836
Feature activation-2.99e+00

Pos logit contributions
 outside+0.007
 together+0.006
 parties+0.006
 around+0.005
 they+0.005
Neg logit contributions
 phrase-0.023
rum-0.022
cut-0.021
 drill-0.021
omm-0.021
Token happy
Token ID3772
Feature activation+4.28e-02

Pos logit contributions
 phrase+0.010
cut+0.010
 ringing+0.009
 guardian+0.009
rum+0.009
Neg logit contributions
 parties-0.006
 recess-0.005
text-0.005
ashed-0.004
 outside-0.004
Token."
Token ID526
Feature activation+1.48e-01

Pos logit contributions
 parties+0.001
text+0.001
 recess+0.001
aws+0.001
 tea+0.001
Neg logit contributions
 phrase-0.002
cut-0.002
 ringing-0.002
rum-0.002
fly-0.002
Token The
Token ID383
Feature activation-4.06e+00

Pos logit contributions
 parties+0.003
text+0.003
 recess+0.002
 outside+0.002
 tea+0.002
Neg logit contributions
 phrase-0.006
cut-0.006
 ringing-0.006
rum-0.006
 guardian-0.006
Token rh
Token ID9529
Feature activation-4.57e+00

Pos logit contributions
 phrase+0.144
rum+0.133
cut+0.118
 drill+0.118
 guardian+0.117
Neg logit contributions
 parties-0.113
 picnic-0.097
 party-0.097
 outside-0.096
text-0.095
Tokenin
Token ID259
Feature activation-5.09e+00

Pos logit contributions
 phrase+0.221
 prayer+0.198
 guardian+0.193
 ringing+0.192
 drill+0.187
Neg logit contributions
 parties-0.062
he-0.057
 together-0.057
 outside-0.051
P-0.050
Tokenoc
Token ID420
Feature activation-1.09e+01

Pos logit contributions
 phrase+0.183
 ey+0.178
osity+0.177
 angles+0.170
rum+0.167
Neg logit contributions
 together-0.111
 played-0.108
 went-0.104
 in-0.101
's-0.101
Tokeneros
Token ID27498
Feature activation-4.22e+00

Pos logit contributions
 phrase+0.368
 outage+0.352
 drill+0.334
 prayer+0.319
 definition+0.315
Neg logit contributions
 parties-0.258
aws-0.233
 outside-0.227
 together-0.225
 each-0.220
Token did
Token ID750
Feature activation-3.54e+00

Pos logit contributions
 phrase+0.167
rum+0.157
cut+0.157
 drill+0.155
ric+0.154
Neg logit contributions
 parties-0.090
 together-0.078
text-0.077
 outside-0.077
 played-0.072
Token what
Token ID644
Feature activation+4.73e-01

Pos logit contributions
 phrase+0.129
cut+0.121
 ringing+0.116
 guardian+0.113
rum+0.113
Neg logit contributions
 parties-0.096
text-0.077
 outside-0.077
 recess-0.076
 together-0.073
Token the
Token ID262
Feature activation-7.73e-02

Pos logit contributions
 parties+0.007
 outside+0.005
 together+0.005
text+0.005
 recess+0.004
Neg logit contributions
 phrase-0.023
cut-0.021
 guardian-0.021
 claw-0.021
 ringing-0.021
Token bird
Token ID6512
Feature activation-2.60e+00

Pos logit contributions
 phrase+0.003
rum+0.003
cut+0.002
 drill+0.002
ric+0.002
Neg logit contributions
 parties-0.002
 picnic-0.002
 party-0.002
 tea-0.002
 outside-0.002
Token here
Token ID994
Feature activation-1.71e-01

Pos logit contributions
 parties+0.020
 outside+0.015
text+0.014
 recess+0.014
 together+0.013
Neg logit contributions
 phrase-0.044
cut-0.041
 ringing-0.040
rum-0.039
 claw-0.039
Token?"
Token ID1701
Feature activation-6.49e-01

Pos logit contributions
 phrase+0.007
cut+0.006
 ringing+0.006
 guardian+0.006
fly+0.006
Neg logit contributions
 parties-0.004
text-0.003
 recess-0.003
 outside-0.003
 tea-0.003
Token The
Token ID383
Feature activation-4.14e+00

Pos logit contributions
 phrase+0.029
cut+0.028
 ringing+0.028
rum+0.028
 guardian+0.028
Neg logit contributions
 parties-0.010
 outside-0.008
 together-0.008
 play-0.007
text-0.007
Token rh
Token ID9529
Feature activation-4.59e+00

Pos logit contributions
 phrase+0.131
rum+0.114
cut+0.108
 drill+0.108
bul+0.104
Neg logit contributions
 parties-0.131
 outside-0.113
text-0.112
 play-0.110
 recess-0.110
Tokenin
Token ID259
Feature activation-5.22e+00

Pos logit contributions
 phrase+0.227
 prayer+0.206
 guardian+0.198
 ringing+0.197
 claw+0.194
Neg logit contributions
he-0.064
 parties-0.052
 together-0.051
P-0.046
aws-0.044
Tokenoc
Token ID420
Feature activation-1.09e+01

Pos logit contributions
osity+0.193
 phrase+0.190
 ey+0.184
rum+0.182
 outage+0.182
Neg logit contributions
-0.102
 together-0.101
s-0.097
's-0.094
 played-0.093
Tokeneros
Token ID27498
Feature activation-4.34e+00

Pos logit contributions
 phrase+0.368
 outage+0.358
 drill+0.329
 prayer+0.318
 swollen+0.312
Neg logit contributions
 parties-0.262
 rides-0.231
aws-0.230
 outside-0.228
 each-0.214
Token replied
Token ID8712
Feature activation-3.01e-01

Pos logit contributions
rum+0.173
ric+0.170
 phrase+0.167
 drill+0.162
osity+0.158
Neg logit contributions
 together-0.081
 at-0.079
 played-0.076
 parties-0.074
 in-0.073
Token,
Token ID11
Feature activation-7.54e-03

Pos logit contributions
 phrase+0.015
cut+0.014
 guardian+0.014
ric+0.014
 claw+0.014
Neg logit contributions
 parties-0.003
 outside-0.003
 together-0.002
 tea-0.002
text-0.002
Token "
Token ID366
Feature activation-2.13e+00

Pos logit contributions
 phrase+0.000
rum+0.000
 guardian+0.000
 outage+0.000
 drill+0.000
Neg logit contributions
 outside-0.000
 they-0.000
 together-0.000
 Lily-0.000
 in-0.000
TokenI
Token ID40
Feature activation+2.14e+00

Pos logit contributions
 phrase+0.101
 ringing+0.093
cut+0.092
rum+0.092
 drill+0.092
Neg logit contributions
 parties-0.032
text-0.025
 outside-0.023
 together-0.022
P-0.021
Token wings
Token ID12098
Feature activation+4.25e-01

Pos logit contributions
 parties+0.047
 recess+0.032
 outside+0.032
text+0.032
 tea+0.030
Neg logit contributions
 phrase-0.118
cut-0.114
rum-0.107
 guardian-0.107
 ringing-0.106
Token!"
Token ID2474
Feature activation+5.89e-01

Pos logit contributions
 parties+0.007
text+0.005
 outside+0.005
 recess+0.004
 together+0.004
Neg logit contributions
 phrase-0.020
cut-0.019
 ringing-0.018
 guardian-0.018
rum-0.018
Token The
Token ID383
Feature activation-3.90e+00

Pos logit contributions
 parties+0.012
text+0.010
 recess+0.010
 outside+0.009
 tea+0.009
Neg logit contributions
 phrase-0.025
cut-0.023
 ringing-0.023
 drill-0.022
rum-0.022
Token rh
Token ID9529
Feature activation-4.50e+00

Pos logit contributions
 phrase+0.143
rum+0.131
cut+0.124
 drill+0.122
fly+0.120
Neg logit contributions
 parties-0.105
text-0.087
 outside-0.085
 party-0.084
 tea-0.084
Tokenin
Token ID259
Feature activation-5.12e+00

Pos logit contributions
 phrase+0.205
 ringing+0.183
 prayer+0.182
 guardian+0.182
 drill+0.178
Neg logit contributions
 parties-0.076
he-0.064
 together-0.063
 outside-0.060
aws-0.058
Tokenoc
Token ID420
Feature activation-1.09e+01

Pos logit contributions
 phrase+0.160
 ey+0.156
osity+0.155
 outage+0.147
rum+0.146
Neg logit contributions
 played-0.132
 together-0.129
 went-0.124
 play-0.124
 in-0.121
Tokeneros
Token ID27498
Feature activation-4.15e+00

Pos logit contributions
 phrase+0.356
 outage+0.337
 drill+0.322
 prayer+0.312
 definition+0.299
Neg logit contributions
 parties-0.278
aws-0.248
 outside-0.236
 each-0.231
uggle-0.229
Token laughed
Token ID13818
Feature activation+1.48e-01

Pos logit contributions
 phrase+0.159
cut+0.149
rum+0.148
 drill+0.148
 guardian+0.148
Neg logit contributions
 parties-0.094
 together-0.082
 outside-0.078
text-0.077
 played-0.076
Token and
Token ID290
Feature activation+8.87e-02

Pos logit contributions
 parties+0.005
 recess+0.004
text+0.004
|+0.004
 pic+0.003
Neg logit contributions
 phrase-0.005
 ringing-0.005
cut-0.004
ller-0.004
 word-0.004
Token said
Token ID531
Feature activation+8.09e-01

Pos logit contributions
 parties+0.002
 outside+0.002
text+0.002
 recess+0.002
 together+0.002
Neg logit contributions
 phrase-0.003
cut-0.003
rum-0.003
 guardian-0.003
 ringing-0.003
Token,
Token ID11
Feature activation+5.79e-01

Pos logit contributions
 parties+0.012
text+0.008
 outside+0.008
 recess+0.007
 together+0.007
Neg logit contributions
 phrase-0.039
cut-0.037
 ringing-0.036
 guardian-0.036
rum-0.036
Token
Token ID198
Feature activation-8.85e-02

Pos logit contributions
 phrase+0.090
cut+0.088
rum+0.085
 drill+0.081
 ringing+0.081
Neg logit contributions
 parties-0.054
 recess-0.044
 tea-0.041
 party-0.039
text-0.039
Token
Token ID198
Feature activation-4.14e-01

Pos logit contributions
 phrase+0.003
cut+0.003
rum+0.003
 ringing+0.003
 drill+0.003
Neg logit contributions
 parties-0.002
 recess-0.002
 tea-0.002
text-0.002
ashed-0.002
TokenThe
Token ID464
Feature activation-4.13e+00

Pos logit contributions
 phrase+0.017
cut+0.015
 ringing+0.015
rum+0.015
 guardian+0.015
Neg logit contributions
 parties-0.010
text-0.008
 recess-0.007
 outside-0.007
 together-0.007
Token rh
Token ID9529
Feature activation-4.10e+00

Pos logit contributions
 phrase+0.141
rum+0.130
 drill+0.115
bul+0.114
cut+0.113
Neg logit contributions
 parties-0.116
 picnic-0.103
 party-0.102
 tea-0.100
 play-0.100
Tokenin
Token ID259
Feature activation-4.66e+00

Pos logit contributions
 phrase+0.199
 prayer+0.182
 ringing+0.179
 guardian+0.175
rum+0.171
Neg logit contributions
 parties-0.049
 together-0.048
P-0.047
he-0.045
aws-0.040
Tokenoc
Token ID420
Feature activation-1.08e+01

Pos logit contributions
 phrase+0.162
 outage+0.155
 ey+0.154
osity+0.153
 ringing+0.150
Neg logit contributions
 together-0.104
 played-0.097
 play-0.094
's-0.092
 went-0.091
Tokeneros
Token ID27498
Feature activation-3.81e+00

Pos logit contributions
 outage+0.371
 phrase+0.366
 prayer+0.335
 drill+0.331
 pounding+0.324
Neg logit contributions
 parties-0.222
 each-0.217
aws-0.210
 outside-0.205
 together-0.202
Token couldn
Token ID3521
Feature activation-2.10e+00

Pos logit contributions
 phrase+0.147
cut+0.136
rum+0.135
 ringing+0.135
 drill+0.133
Neg logit contributions
 parties-0.089
 outside-0.073
 together-0.073
text-0.072
 played-0.067
Token't
Token ID470
Feature activation-2.55e+00

Pos logit contributions
 phrase+0.056
cut+0.050
 ringing+0.049
rum+0.047
 drill+0.046
Neg logit contributions
 parties-0.078
 recess-0.067
text-0.066
 outside-0.065
 tea-0.064
Token believe
Token ID1975
Feature activation-8.17e-01

Pos logit contributions
 phrase+0.089
cut+0.081
 ringing+0.078
 guardian+0.078
 claw+0.077
Neg logit contributions
 parties-0.075
text-0.064
 recess-0.061
aws-0.059
 tea-0.058
Token his
Token ID465
Feature activation-1.13e+00

Pos logit contributions
 phrase+0.031
cut+0.028
 ringing+0.027
 guardian+0.027
rum+0.027
Neg logit contributions
 parties-0.021
text-0.017
 recess-0.017
 outside-0.017
 together-0.016
Token
Token ID198
Feature activation+2.14e-01

Pos logit contributions
 parties+0.004
text+0.003
 recess+0.003
 outside+0.003
 tea+0.003
Neg logit contributions
 phrase-0.010
cut-0.009
 ringing-0.009
rum-0.009
 guardian-0.009
Token
Token ID198
Feature activation-3.08e-01

Pos logit contributions
 parties+0.005
 recess+0.004
ashed+0.004
 tea+0.004
 party+0.004
Neg logit contributions
 phrase-0.008
cut-0.008
rum-0.007
 drill-0.007
 ringing-0.007
TokenThe
Token ID464
Feature activation-3.76e+00

Pos logit contributions
 phrase+0.011
cut+0.011
 ringing+0.010
rum+0.010
 guardian+0.010
Neg logit contributions
 parties-0.008
text-0.007
 recess-0.007
 outside-0.007
 tea-0.006
Token rh
Token ID9529
Feature activation-4.08e+00

Pos logit contributions
 phrase+0.128
rum+0.117
 drill+0.106
ric+0.104
fly+0.103
Neg logit contributions
 parties-0.107
 party-0.094
 play-0.093
 outside-0.093
 picnic-0.093
Tokenin
Token ID259
Feature activation-4.52e+00

Pos logit contributions
 phrase+0.205
 prayer+0.187
 ringing+0.177
 guardian+0.177
 definition+0.174
Neg logit contributions
he-0.049
 together-0.044
 parties-0.042
P-0.041
 outside-0.037
Tokenoc
Token ID420
Feature activation-1.08e+01

Pos logit contributions
 phrase+0.166
isel+0.153
osity+0.151
 outage+0.147
 ey+0.146
Neg logit contributions
 together-0.088
 in-0.088
he-0.087
text-0.086
 outside-0.085
Tokeneros
Token ID27498
Feature activation-3.54e+00

Pos logit contributions
 phrase+0.372
 outage+0.365
 drill+0.342
 swollen+0.326
 prayer+0.325
Neg logit contributions
 parties-0.228
 outside-0.214
 each-0.209
 rides-0.204
aws-0.201
Token was
Token ID373
Feature activation-1.74e+00

Pos logit contributions
 phrase+0.141
rum+0.130
cut+0.129
 drill+0.129
 ringing+0.127
Neg logit contributions
 parties-0.078
text-0.065
 outside-0.064
 together-0.064
 party-0.058
Token so
Token ID523
Feature activation-6.84e-01

Pos logit contributions
 phrase+0.079
cut+0.074
 drill+0.072
rum+0.071
fly+0.070
Neg logit contributions
 parties-0.031
 outside-0.025
 together-0.023
text-0.022
 recess-0.021
Token excited
Token ID6568
Feature activation-1.64e+00

Pos logit contributions
 phrase+0.031
cut+0.029
 drill+0.029
rum+0.028
fly+0.028
Neg logit contributions
 parties-0.012
 outside-0.009
 together-0.009
text-0.008
 recess-0.008
Token that
Token ID326
Feature activation+9.04e-01

Pos logit contributions
 phrase+0.069
cut+0.065
 ringing+0.064
 guardian+0.061
rum+0.061
Neg logit contributions
 parties-0.035
text-0.028
 recess-0.027
 tea-0.024
 outside-0.024
Token help
Token ID1037
Feature activation-9.36e-02

Pos logit contributions
 phrase+0.149
cut+0.133
rum+0.132
fly+0.129
 guardian+0.129
Neg logit contributions
 parties-0.080
 outside-0.060
 recess-0.059
 play-0.057
 tea-0.056
Token.
Token ID13
Feature activation-3.29e-01

Pos logit contributions
 phrase+0.005
cut+0.004
 ringing+0.004
 guardian+0.004
rum+0.004
Neg logit contributions
 parties-0.001
 outside-0.001
text-0.001
 recess-0.001
 together-0.001
Token The
Token ID383
Feature activation-3.60e+00

Pos logit contributions
 phrase+0.016
cut+0.015
rum+0.015
 guardian+0.014
 ringing+0.014
Neg logit contributions
 parties-0.005
 outside-0.004
 together-0.004
text-0.003
 Lily-0.003
Token rh
Token ID9529
Feature activation-4.33e+00

Pos logit contributions
 phrase+0.096
rum+0.082
cut+0.081
fly+0.075
 ringing+0.075
Neg logit contributions
 parties-0.135
 recess-0.116
text-0.116
 outside-0.116
 tea-0.113
Tokenin
Token ID259
Feature activation-4.68e+00

Pos logit contributions
 phrase+0.199
 prayer+0.178
 ringing+0.176
 claw+0.173
rum+0.172
Neg logit contributions
 parties-0.073
 together-0.063
he-0.061
 outside-0.057
P-0.054
Tokenoc
Token ID420
Feature activation-1.08e+01

Pos logit contributions
 phrase+0.177
 outage+0.152
osity+0.151
rum+0.150
isel+0.148
Neg logit contributions
 together-0.098
 played-0.093
s-0.093
's-0.092
 play-0.090
Tokeneros
Token ID27498
Feature activation-3.86e+00

Pos logit contributions
 phrase+0.344
 outage+0.338
 prayer+0.310
 swollen+0.304
 drill+0.300
Neg logit contributions
 parties-0.293
 outside-0.250
 recess-0.248
aws-0.241
 each-0.229
Token was
Token ID373
Feature activation-2.19e+00

Pos logit contributions
 phrase+0.171
ric+0.163
rum+0.161
 guardian+0.154
 ey+0.154
Neg logit contributions
 together-0.060
 played-0.059
 parties-0.056
 outside-0.055
ashed-0.052
Token very
Token ID845
Feature activation-2.70e+00

Pos logit contributions
 phrase+0.107
cut+0.098
 drill+0.097
rum+0.096
ric+0.096
Neg logit contributions
 outside-0.030
 parties-0.029
 together-0.027
 around-0.023
 everywhere-0.023
Token happy
Token ID3772
Feature activation-2.04e-01

Pos logit contributions
 phrase+0.127
ric+0.117
 drill+0.115
bul+0.114
cut+0.113
Neg logit contributions
 parties-0.040
 together-0.037
 outside-0.032
 play-0.032
 their-0.031
Token and
Token ID290
Feature activation+2.84e-01

Pos logit contributions
 phrase+0.010
cut+0.009
rum+0.009
 ringing+0.009
 guardian+0.009
Neg logit contributions
 parties-0.003
 outside-0.003
 together-0.002
 recess-0.002
text-0.002
Token
Token ID198
Feature activation+3.77e-01

Pos logit contributions
 phrase+0.069
cut+0.067
rum+0.065
 drill+0.064
ric+0.064
Neg logit contributions
 parties-0.043
 recess-0.035
 tea-0.032
 party-0.031
text-0.030
Token
Token ID198
Feature activation-1.32e-01

Pos logit contributions
 parties+0.009
 recess+0.007
 tea+0.006
 outside+0.006
text+0.006
Neg logit contributions
 phrase-0.015
cut-0.015
rum-0.014
 ringing-0.014
 drill-0.014
TokenThe
Token ID464
Feature activation-3.92e+00

Pos logit contributions
 phrase+0.005
cut+0.005
 ringing+0.004
rum+0.004
 guardian+0.004
Neg logit contributions
 parties-0.003
text-0.003
 recess-0.003
 outside-0.003
 tea-0.003
Token rh
Token ID9529
Feature activation-4.10e+00

Pos logit contributions
 phrase+0.136
rum+0.127
 outage+0.111
 drill+0.110
bul+0.110
Neg logit contributions
 parties-0.105
 picnic-0.097
 party-0.095
 play-0.095
 tea-0.093
Tokenin
Token ID259
Feature activation-4.61e+00

Pos logit contributions
 phrase+0.198
 prayer+0.181
 ringing+0.176
 guardian+0.173
rum+0.170
Neg logit contributions
 parties-0.051
 together-0.051
P-0.049
he-0.048
 outside-0.042
Tokenoc
Token ID420
Feature activation-1.08e+01

Pos logit contributions
 phrase+0.171
 outage+0.152
isel+0.152
osity+0.152
 ey+0.149
Neg logit contributions
 together-0.096
P-0.089
 played-0.089
 play-0.086
's-0.085
Tokeneros
Token ID27498
Feature activation-3.58e+00

Pos logit contributions
 outage+0.369
 phrase+0.365
 drill+0.336
 prayer+0.332
 definition+0.316
Neg logit contributions
 parties-0.226
 each-0.217
aws-0.215
 together-0.212
 outside-0.209
Token was
Token ID373
Feature activation-1.81e+00

Pos logit contributions
 phrase+0.143
cut+0.132
rum+0.130
 ringing+0.129
 drill+0.129
Neg logit contributions
 parties-0.081
text-0.067
 outside-0.064
 together-0.063
 recess-0.059
Token so
Token ID523
Feature activation-7.68e-01

Pos logit contributions
 phrase+0.083
cut+0.078
 drill+0.074
rum+0.074
 ringing+0.074
Neg logit contributions
 parties-0.032
 outside-0.024
text-0.023
 recess-0.022
 together-0.022
Token excited
Token ID6568
Feature activation-1.69e+00

Pos logit contributions
 phrase+0.033
cut+0.031
 ringing+0.030
rum+0.029
 drill+0.029
Neg logit contributions
 parties-0.016
text-0.012
 recess-0.012
 outside-0.012
 together-0.011
Token to
Token ID284
Feature activation-3.56e+00

Pos logit contributions
 phrase+0.068
 ringing+0.063
cut+0.062
ller+0.059
fly+0.059
Neg logit contributions
 parties-0.039
text-0.033
 recess-0.032
 tea-0.028
 outside-0.027
Token away
Token ID1497
Feature activation-3.57e+00

Pos logit contributions
 parties+0.110
text+0.090
 recess+0.087
 tea+0.078
 party+0.074
Neg logit contributions
 phrase-0.178
 ringing-0.162
cut-0.162
fly-0.158
 guardian-0.156
Token.
Token ID13
Feature activation-2.10e-01

Pos logit contributions
 phrase+0.173
cut+0.163
 ringing+0.160
 guardian+0.158
rum+0.157
Neg logit contributions
 parties-0.054
text-0.036
 recess-0.035
 outside-0.032
 tea-0.029
Token The
Token ID383
Feature activation-3.57e+00

Pos logit contributions
 phrase+0.008
cut+0.008
 ringing+0.008
rum+0.008
 guardian+0.008
Neg logit contributions
 parties-0.005
text-0.004
 outside-0.004
 recess-0.004
 together-0.004
Token rh
Token ID9529
Feature activation-4.22e+00

Pos logit contributions
 phrase+0.119
rum+0.111
cut+0.098
 drill+0.097
fly+0.096
Neg logit contributions
 parties-0.108
 outside-0.093
 party-0.093
 picnic-0.092
 tea-0.091
Tokenin
Token ID259
Feature activation-4.62e+00

Pos logit contributions
 phrase+0.203
 prayer+0.181
 guardian+0.175
 ringing+0.175
rum+0.173
Neg logit contributions
 parties-0.061
he-0.057
 together-0.054
 outside-0.049
P-0.048
Tokenoc
Token ID420
Feature activation-1.08e+01

Pos logit contributions
 phrase+0.169
isel+0.150
rum+0.147
osity+0.146
 outage+0.146
Neg logit contributions
 played-0.101
text-0.100
 play-0.099
he-0.097
 outside-0.097
Tokeneros
Token ID27498
Feature activation-3.58e+00

Pos logit contributions
 phrase+0.349
 outage+0.336
 drill+0.322
 prayer+0.303
 swollen+0.301
Neg logit contributions
 parties-0.282
 outside-0.253
aws-0.236
 each-0.235
 recess-0.235
Token felt
Token ID2936
Feature activation-3.15e-01

Pos logit contributions
 phrase+0.142
cut+0.131
rum+0.130
 drill+0.129
 ringing+0.127
Neg logit contributions
 parties-0.082
text-0.069
 outside-0.067
 together-0.064
 party-0.060
Token che
Token ID1125
Feature activation-1.65e+00

Pos logit contributions
 phrase+0.012
cut+0.011
 ringing+0.011
 guardian+0.011
 claw+0.010
Neg logit contributions
 parties-0.009
 recess-0.007
text-0.006
dates-0.006
 tea-0.006
Tokenery
Token ID1924
Feature activation-1.58e+00

Pos logit contributions
 phrase+0.056
cut+0.052
rum+0.049
 ringing+0.049
 claw+0.048
Neg logit contributions
 parties-0.050
 recess-0.040
text-0.040
 outside-0.040
 tea-0.038
Token and
Token ID290
Feature activation-1.20e-01

Pos logit contributions
 phrase+0.070
cut+0.065
 ringing+0.063
rum+0.063
 drill+0.062
Neg logit contributions
 parties-0.030
 outside-0.023
text-0.023
 recess-0.021
 together-0.021
TokenFinally
Token ID11158
Feature activation-3.40e+00

Pos logit contributions
 phrase+0.011
cut+0.010
 ringing+0.009
 guardian+0.009
rum+0.009
Neg logit contributions
 parties-0.008
text-0.007
 recess-0.007
 outside-0.007
 tea-0.006
Token,
Token ID11
Feature activation+1.55e-01

Pos logit contributions
 ringing+0.109
 phrase+0.109
 guardian+0.100
 word+0.099
 prayer+0.099
Neg logit contributions
 parties-0.090
ashes-0.085
ashing-0.081
text-0.080
aws-0.079
Token the
Token ID262
Feature activation-2.31e+00

Pos logit contributions
 parties+0.003
text+0.002
 recess+0.002
 outside+0.002
aws+0.002
Neg logit contributions
 phrase-0.007
cut-0.006
 ringing-0.006
 guardian-0.006
fly-0.006
Token rh
Token ID9529
Feature activation-3.92e+00

Pos logit contributions
 phrase+0.094
rum+0.092
omm+0.081
bek+0.080
cut+0.079
Neg logit contributions
 parties-0.051
 outside-0.048
 picnic-0.047
 play-0.046
 party-0.045
Tokenin
Token ID259
Feature activation-3.91e+00

Pos logit contributions
 phrase+0.190
 prayer+0.169
 ringing+0.167
rum+0.165
 claw+0.165
Neg logit contributions
 parties-0.056
he-0.050
 together-0.046
 outside-0.044
P-0.044
Tokenoc
Token ID420
Feature activation-1.07e+01

Pos logit contributions
 phrase+0.132
rum+0.123
osity+0.119
 swollen+0.119
 ey+0.119
Neg logit contributions
 played-0.097
 play-0.097
 outside-0.095
 together-0.095
 went-0.092
Tokeneros
Token ID27498
Feature activation-2.74e+00

Pos logit contributions
 phrase+0.339
 outage+0.333
 swollen+0.328
 drill+0.312
 pounding+0.302
Neg logit contributions
 parties-0.263
 outside-0.253
 each-0.236
 back-0.222
 in-0.215
Token remembered
Token ID12086
Feature activation-2.06e+00

Pos logit contributions
 phrase+0.105
cut+0.098
 ringing+0.095
 guardian+0.093
 claw+0.093
Neg logit contributions
 parties-0.069
 recess-0.055
text-0.055
 outside-0.052
 tea-0.050
Token that
Token ID326
Feature activation+9.16e-01

Pos logit contributions
 phrase+0.097
cut+0.092
rum+0.088
 drill+0.087
 claw+0.087
Neg logit contributions
 parties-0.037
 outside-0.026
 recess-0.024
 tea-0.024
 together-0.023
Token animals
Token ID4695
Feature activation+1.29e+00

Pos logit contributions
 parties+0.014
 outside+0.010
 together+0.010
 they+0.009
 tea+0.009
Neg logit contributions
 phrase-0.045
rum-0.042
cut-0.041
ric-0.041
ller-0.041
Token often
Token ID1690
Feature activation-2.81e+00

Pos logit contributions
 parties+0.026
text+0.019
 recess+0.019
 outside+0.018
 tea+0.017
Neg logit contributions
 phrase-0.056
cut-0.053
 ringing-0.051
rum-0.051
 guardian-0.050
Token
Token ID198
Feature activation+3.91e-01

Pos logit contributions
 parties+0.001
text+0.001
 outside+0.001
 together+0.000
 Lily+0.000
Neg logit contributions
 phrase-0.001
cut-0.001
 ringing-0.001
fly-0.001
 drill-0.001
Token
Token ID198
Feature activation-1.04e-01

Pos logit contributions
 parties+0.007
 recess+0.005
text+0.005
 outside+0.004
 tea+0.004
Neg logit contributions
 phrase-0.018
cut-0.017
 ringing-0.017
rum-0.016
 guardian-0.016
TokenThe
Token ID464
Feature activation-3.71e+00

Pos logit contributions
 phrase+0.004
fly+0.004
 ringing+0.004
 drill+0.004
cut+0.004
Neg logit contributions
 parties-0.002
text-0.002
 outside-0.002
P-0.002
 together-0.002
Token rh
Token ID9529
Feature activation-3.97e+00

Pos logit contributions
 phrase+0.123
rum+0.113
 drill+0.100
fly+0.099
cut+0.099
Neg logit contributions
 parties-0.111
 picnic-0.099
 outside-0.098
 play-0.096
 party-0.096
Tokenin
Token ID259
Feature activation-4.41e+00

Pos logit contributions
 phrase+0.191
 prayer+0.171
rum+0.165
 guardian+0.164
 ringing+0.164
Neg logit contributions
 parties-0.054
he-0.050
 together-0.049
P-0.045
 outside-0.044
Tokenoc
Token ID420
Feature activation-1.07e+01

Pos logit contributions
 phrase+0.167
 outage+0.153
osity+0.152
rum+0.149
 appearances+0.144
Neg logit contributions
 played-0.088
 went-0.085
 play-0.085
 together-0.084
 met-0.081
Tokeneros
Token ID27498
Feature activation-3.32e+00

Pos logit contributions
 phrase+0.362
 outage+0.362
 drill+0.332
 prayer+0.314
Sad+0.306
Neg logit contributions
 parties-0.242
aws-0.223
 each-0.215
 outside-0.214
 rides-0.213
Token was
Token ID373
Feature activation-1.90e+00

Pos logit contributions
 phrase+0.134
rum+0.124
cut+0.123
 drill+0.123
 ringing+0.119
Neg logit contributions
 parties-0.071
 outside-0.060
text-0.059
 together-0.058
 played-0.054
Token so
Token ID523
Feature activation-8.58e-01

Pos logit contributions
 phrase+0.088
cut+0.083
 drill+0.081
rum+0.080
fly+0.078
Neg logit contributions
 parties-0.032
 outside-0.026
 together-0.023
text-0.022
 tea-0.021
Token delighted
Token ID20707
Feature activation-1.37e+00

Pos logit contributions
 phrase+0.038
cut+0.035
 drill+0.034
 ringing+0.034
rum+0.034
Neg logit contributions
 parties-0.017
text-0.013
 outside-0.013
 recess-0.012
 together-0.012
Token with
Token ID351
Feature activation-5.70e-01

Pos logit contributions
 phrase+0.063
cut+0.059
rum+0.058
 drill+0.057
 claw+0.057
Neg logit contributions
 parties-0.024
 outside-0.019
 together-0.017
text-0.016
 recess-0.015
Token
Token ID198
Feature activation+3.81e-01

Pos logit contributions
 parties+0.004
 recess+0.003
text+0.003
 outside+0.003
 tea+0.003
Neg logit contributions
 phrase-0.007
cut-0.007
 ringing-0.006
rum-0.006
 guardian-0.006
Token
Token ID198
Feature activation-1.88e-01

Pos logit contributions
 parties+0.010
 recess+0.007
 tea+0.007
 party+0.007
ashed+0.007
Neg logit contributions
 phrase-0.015
cut-0.014
rum-0.014
 ringing-0.013
pe-0.013
TokenThe
Token ID464
Feature activation-3.83e+00

Pos logit contributions
 phrase+0.007
cut+0.006
 ringing+0.006
rum+0.006
 guardian+0.006
Neg logit contributions
 parties-0.005
 recess-0.004
text-0.004
 outside-0.004
 tea-0.004
Token rh
Token ID9529
Feature activation-4.01e+00

Pos logit contributions
 phrase+0.127
rum+0.115
 drill+0.102
fly+0.101
cut+0.101
Neg logit contributions
 parties-0.112
 picnic-0.098
 outside-0.097
text-0.097
 play-0.096
Tokenin
Token ID259
Feature activation-4.56e+00

Pos logit contributions
 phrase+0.192
 prayer+0.172
 ringing+0.166
 guardian+0.166
rum+0.164
Neg logit contributions
 parties-0.054
he-0.052
 together-0.051
P-0.045
aws-0.044
Tokenoc
Token ID420
Feature activation-1.07e+01

Pos logit contributions
 phrase+0.169
 outage+0.155
osity+0.152
isel+0.152
 ey+0.147
Neg logit contributions
 together-0.092
 played-0.089
text-0.085
 play-0.084
 were-0.083
Tokeneros
Token ID27498
Feature activation-3.59e+00

Pos logit contributions
 outage+0.363
 phrase+0.360
 drill+0.336
 prayer+0.313
 swollen+0.302
Neg logit contributions
 parties-0.245
aws-0.231
 each-0.218
 rides-0.215
 outside-0.213
Token smiled
Token ID13541
Feature activation-1.33e+00

Pos logit contributions
 phrase+0.143
cut+0.131
 drill+0.131
rum+0.130
 ringing+0.128
Neg logit contributions
 parties-0.081
text-0.067
 outside-0.065
 together-0.064
 tea-0.059
Token and
Token ID290
Feature activation-1.63e-01

Pos logit contributions
 phrase+0.059
cut+0.056
 ringing+0.054
 guardian+0.053
rum+0.053
Neg logit contributions
 parties-0.026
text-0.019
 recess-0.018
 outside-0.017
 tea-0.016
Token flew
Token ID13112
Feature activation+3.71e+00

Pos logit contributions
 phrase+0.006
cut+0.006
 drill+0.006
rum+0.006
 ringing+0.006
Neg logit contributions
 parties-0.004
 together-0.003
text-0.003
 outside-0.003
 recess-0.003
Token away
Token ID1497
Feature activation-3.88e+00

Pos logit contributions
 parties+0.078
text+0.061
 recess+0.058
 tea+0.052
 party+0.049
Neg logit contributions
 phrase-0.161
cut-0.150
 ringing-0.146
 guardian-0.143
 claw-0.142
Tokenhe
Token ID258
Feature activation-7.85e-01

Pos logit contributions
 parties+0.074
 recess+0.070
 picnic+0.068
text+0.067
 pic+0.063
Neg logit contributions
cut-0.150
 phrase-0.134
pe-0.132
rum-0.128
icle-0.126
Token."
Token ID526
Feature activation+2.47e-01

Pos logit contributions
 phrase+0.033
cut+0.031
rum+0.030
 ringing+0.030
 drill+0.029
Neg logit contributions
 parties-0.016
text-0.013
 recess-0.013
aws-0.011
ashed-0.011
Token The
Token ID383
Feature activation-3.89e+00

Pos logit contributions
 parties+0.005
 outside+0.004
text+0.004
 recess+0.003
 together+0.003
Neg logit contributions
 phrase-0.011
cut-0.010
 ringing-0.010
 guardian-0.010
rum-0.010
Token rh
Token ID9529
Feature activation-4.21e+00

Pos logit contributions
 phrase+0.126
rum+0.113
cut+0.104
 drill+0.104
 guardian+0.100
Neg logit contributions
 parties-0.121
 outside-0.104
text-0.103
 party-0.102
 picnic-0.102
Tokenin
Token ID259
Feature activation-4.95e+00

Pos logit contributions
 phrase+0.194
 ringing+0.171
 prayer+0.171
 guardian+0.169
 drill+0.166
Neg logit contributions
 parties-0.069
he-0.059
 together-0.059
 outside-0.054
aws-0.052
Tokenoc
Token ID420
Feature activation-1.06e+01

Pos logit contributions
 phrase+0.171
osity+0.170
 ey+0.167
rum+0.158
 outage+0.157
Neg logit contributions
 played-0.109
 together-0.106
 went-0.106
 play-0.103
 in-0.102
Tokeneros
Token ID27498
Feature activation-4.00e+00

Pos logit contributions
 phrase+0.366
 outage+0.342
 drill+0.317
 pounding+0.315
 swollen+0.312
Neg logit contributions
 parties-0.251
aws-0.222
 outside-0.216
 each-0.214
 together-0.213
Token felt
Token ID2936
Feature activation+9.89e-02

Pos logit contributions
 phrase+0.155
 ringing+0.145
cut+0.144
rum+0.143
 guardian+0.142
Neg logit contributions
 parties-0.089
 together-0.078
 outside-0.076
text-0.075
ashed-0.074
Token a
Token ID257
Feature activation+2.17e+00

Pos logit contributions
 parties+0.003
 recess+0.002
text+0.002
aws+0.002
 outside+0.002
Neg logit contributions
 phrase-0.004
cut-0.003
 ringing-0.003
 claw-0.003
rum-0.003
Token little
Token ID1310
Feature activation-1.98e-01

Pos logit contributions
 parties+0.049
text+0.041
 outside+0.041
 tea+0.038
 together+0.037
Neg logit contributions
 phrase-0.085
cut-0.081
 ringing-0.077
rum-0.077
fly-0.077
Token better
Token ID1365
Feature activation+3.22e-02

Pos logit contributions
 phrase+0.007
cut+0.007
 ringing+0.006
rum+0.006
 guardian+0.006
Neg logit contributions
 parties-0.005
text-0.004
 outside-0.004
 recess-0.004
 tea-0.004
Token
Token ID198
Feature activation-5.39e-01

Pos logit contributions
 phrase+0.008
cut+0.008
 ringing+0.007
rum+0.007
ller+0.007
Neg logit contributions
 parties-0.005
 recess-0.004
text-0.004
 tea-0.004
ashed-0.004
TokenThe
Token ID464
Feature activation-4.17e+00

Pos logit contributions
 phrase+0.024
fly+0.022
 ringing+0.022
rum+0.022
 guardian+0.022
Neg logit contributions
 parties-0.010
text-0.007
 outside-0.007
 together-0.007
P-0.007
Token bad
Token ID2089
Feature activation-3.51e+00

Pos logit contributions
 phrase+0.138
rum+0.122
cut+0.115
ller+0.112
ric+0.112
Neg logit contributions
 parties-0.127
 outside-0.107
 tea-0.105
text-0.105
 party-0.104
Token rh
Token ID9529
Feature activation-3.99e+00

Pos logit contributions
 phrase+0.117
rum+0.100
ric+0.099
bul+0.095
 claw+0.093
Neg logit contributions
 parties-0.104
 outside-0.089
 tea-0.086
 party-0.085
 play-0.085
Tokenin
Token ID259
Feature activation-4.19e+00

Pos logit contributions
 phrase+0.191
 prayer+0.170
 ringing+0.169
 claw+0.165
 guardian+0.165
Neg logit contributions
 parties-0.056
he-0.053
 together-0.049
P-0.046
 outside-0.045
Tokenoc
Token ID420
Feature activation-1.06e+01

Pos logit contributions
 phrase+0.161
 ey+0.156
osity+0.155
 angles+0.146
isel+0.146
Neg logit contributions
's-0.074
 together-0.074
 in-0.071
he-0.069
 played-0.069
Tokeneros
Token ID27498
Feature activation-3.31e+00

Pos logit contributions
 phrase+0.365
 outage+0.347
 drill+0.318
 prayer+0.316
 pounding+0.309
Neg logit contributions
 parties-0.239
 outside-0.212
uggle-0.206
aws-0.204
 each-0.203
Token was
Token ID373
Feature activation-1.44e+00

Pos logit contributions
 phrase+0.131
cut+0.122
rum+0.121
 ringing+0.119
 drill+0.118
Neg logit contributions
 parties-0.073
 outside-0.061
 together-0.060
text-0.059
 played-0.054
Token so
Token ID523
Feature activation-7.63e-01

Pos logit contributions
 phrase+0.066
cut+0.060
 drill+0.060
rum+0.059
inct+0.058
Neg logit contributions
 parties-0.025
 outside-0.022
 together-0.021
 around-0.019
 their-0.018
Token happy
Token ID3772
Feature activation-6.62e-01

Pos logit contributions
 phrase+0.034
 drill+0.031
ric+0.031
cut+0.030
rum+0.030
Neg logit contributions
 parties-0.013
 together-0.012
 outside-0.012
 they-0.011
 at-0.011
Token.
Token ID13
Feature activation-5.26e-01

Pos logit contributions
 phrase+0.030
cut+0.028
 ringing+0.028
 guardian+0.027
rum+0.027
Neg logit contributions
 parties-0.012
 outside-0.009
text-0.009
 recess-0.008
 together-0.008
Token now
Token ID783
Feature activation+3.02e+00

Pos logit contributions
 parties+0.065
 recess+0.057
text+0.051
ashed+0.050
ashes+0.050
Neg logit contributions
 phrase-0.094
cut-0.090
 guardian-0.088
 ringing-0.088
fly-0.087
Token!"
Token ID2474
Feature activation+1.15e+00

Pos logit contributions
 parties+0.086
aws+0.080
text+0.078
 recess+0.073
ashes+0.071
Neg logit contributions
 ringing-0.097
 phrase-0.095
rum-0.092
 guardian-0.091
cut-0.089
Token The
Token ID383
Feature activation-3.68e+00

Pos logit contributions
 parties+0.029
 recess+0.024
text+0.023
 outside+0.021
 tea+0.021
Neg logit contributions
 phrase-0.042
cut-0.040
ller-0.038
fly-0.038
 ringing-0.038
Token rh
Token ID9529
Feature activation-4.03e+00

Pos logit contributions
 phrase+0.126
rum+0.113
 drill+0.103
cut+0.102
fly+0.097
Neg logit contributions
 parties-0.107
 party-0.092
 outside-0.092
 picnic-0.091
 tea-0.091
Tokenin
Token ID259
Feature activation-4.63e+00

Pos logit contributions
 phrase+0.193
 prayer+0.173
 ringing+0.169
 guardian+0.168
 drill+0.166
Neg logit contributions
 parties-0.055
he-0.052
 together-0.051
 outside-0.046
P-0.045
Tokenoc
Token ID420
Feature activation-1.06e+01

Pos logit contributions
 phrase+0.153
osity+0.140
 ey+0.138
 outage+0.137
isel+0.136
Neg logit contributions
 together-0.115
 played-0.112
 in-0.108
-0.108
 went-0.108
Tokeneros
Token ID27498
Feature activation-3.67e+00

Pos logit contributions
 phrase+0.359
 outage+0.341
 drill+0.331
 prayer+0.316
 definition+0.301
Neg logit contributions
 parties-0.247
aws-0.229
 outside-0.222
 each-0.216
 together-0.215
Token felt
Token ID2936
Feature activation+8.87e-02

Pos logit contributions
 phrase+0.141
cut+0.132
rum+0.129
 ringing+0.128
 drill+0.128
Neg logit contributions
 parties-0.088
text-0.073
 outside-0.071
 together-0.070
 tea-0.066
Token happy
Token ID3772
Feature activation-1.39e-01

Pos logit contributions
 parties+0.003
 recess+0.002
text+0.002
ashing+0.002
suits+0.002
Neg logit contributions
 phrase-0.003
cut-0.003
 guardian-0.003
 ringing-0.003
 gen-0.003
Token too
Token ID1165
Feature activation-1.09e+00

Pos logit contributions
 phrase+0.006
cut+0.005
 ringing+0.005
 guardian+0.005
fly+0.005
Neg logit contributions
 parties-0.003
text-0.003
 recess-0.003
 outside-0.002
 tea-0.002
Token.
Token ID13
Feature activation+4.36e-01

Pos logit contributions
 phrase+0.046
 ringing+0.044
cut+0.043
 guardian+0.042
ller+0.042
Neg logit contributions
 parties-0.023
 recess-0.018
text-0.018
 tea-0.015
aws-0.015
Token doing
Token ID1804
Feature activation+1.09e+00

Pos logit contributions
 parties+0.128
 outside+0.108
 recess+0.096
 party+0.096
 tea+0.095
Neg logit contributions
 phrase-0.181
cut-0.166
rum-0.160
 drill-0.151
ller-0.151
Token?"
Token ID1701
Feature activation-3.26e-01

Pos logit contributions
 parties+0.023
text+0.019
 recess+0.018
 outside+0.017
 tea+0.016
Neg logit contributions
 phrase-0.046
cut-0.043
 ringing-0.042
 guardian-0.041
 drill-0.041
Token The
Token ID383
Feature activation-3.75e+00

Pos logit contributions
 phrase+0.016
cut+0.015
 guardian+0.015
rum+0.015
 ringing+0.015
Neg logit contributions
 parties-0.004
 outside-0.003
 together-0.003
text-0.003
 recess-0.003
Token rh
Token ID9529
Feature activation-4.28e+00

Pos logit contributions
 phrase+0.120
rum+0.106
cut+0.102
 drill+0.100
 guardian+0.098
Neg logit contributions
 parties-0.120
text-0.102
 outside-0.101
 tea-0.098
 recess-0.097
Tokenin
Token ID259
Feature activation-4.98e+00

Pos logit contributions
 phrase+0.197
 ringing+0.174
 prayer+0.174
 guardian+0.173
 claw+0.170
Neg logit contributions
 parties-0.072
he-0.064
 together-0.060
 outside-0.057
P-0.054
Tokenoc
Token ID420
Feature activation-1.06e+01

Pos logit contributions
 phrase+0.167
osity+0.160
 ey+0.153
rum+0.150
 width+0.149
Neg logit contributions
-0.123
 together-0.120
 played-0.117
.-0.115
-0.114
Tokeneros
Token ID27498
Feature activation-4.04e+00

Pos logit contributions
 phrase+0.360
 outage+0.332
 drill+0.319
 prayer+0.313
 pounding+0.299
Neg logit contributions
 parties-0.256
aws-0.226
 outside-0.222
uggle-0.213
 each-0.212
Token replied
Token ID8712
Feature activation-1.71e-02

Pos logit contributions
 phrase+0.164
rum+0.158
cut+0.154
 drill+0.154
 guardian+0.152
Neg logit contributions
 parties-0.080
 together-0.072
 outside-0.069
 played-0.068
 at-0.065
Token,
Token ID11
Feature activation+2.68e-01

Pos logit contributions
 phrase+0.001
cut+0.001
 guardian+0.001
 ringing+0.001
rum+0.001
Neg logit contributions
 parties-0.000
 outside-0.000
text-0.000
 together-0.000
 recess-0.000
Token "
Token ID366
Feature activation-1.79e+00

Pos logit contributions
 parties+0.006
 outside+0.006
 together+0.006
 they+0.006
 Lily+0.006
Neg logit contributions
 phrase-0.010
rum-0.009
 guardian-0.009
cut-0.009
 claw-0.009
TokenI
Token ID40
Feature activation+2.38e+00

Pos logit contributions
 phrase+0.084
cut+0.077
rum+0.077
 ringing+0.076
 drill+0.076
Neg logit contributions
 parties-0.029
text-0.022
 outside-0.021
 recess-0.019
 together-0.019
Token
Token ID26391
Feature activation+1.21e+00

Pos logit contributions
 parties+0.011
text+0.009
 outside+0.009
 recess+0.009
 together+0.008
Neg logit contributions
 phrase-0.015
cut-0.014
 ringing-0.013
 guardian-0.013
fly-0.013
Token
Token ID8151
Feature activation+2.09e+00

Pos logit contributions
 parties+0.035
 recess+0.030
 tea+0.028
 outside+0.028
text+0.027
Neg logit contributions
 phrase-0.039
ller-0.036
 ringing-0.036
cut-0.036
rum-0.036
Tokenll
Token ID297
Feature activation-1.18e+00

Pos logit contributions
 parties+0.067
 recess+0.056
 tea+0.054
text+0.054
 outside+0.053
Neg logit contributions
 phrase-0.064
cut-0.061
rum-0.059
 ringing-0.058
 guardian-0.055
Token have
Token ID423
Feature activation+2.02e+00

Pos logit contributions
 phrase+0.046
cut+0.042
 ringing+0.041
 guardian+0.041
fly+0.040
Neg logit contributions
 parties-0.029
 outside-0.023
text-0.022
 recess-0.022
 together-0.021
Token to
Token ID284
Feature activation-2.54e+00

Pos logit contributions
 parties+0.042
text+0.031
 outside+0.031
 recess+0.031
 tea+0.030
Neg logit contributions
 phrase-0.087
cut-0.083
 ringing-0.079
fly-0.079
rum-0.079
Token go
Token ID467
Feature activation+4.55e+00

Pos logit contributions
 phrase+0.104
cut+0.094
fly+0.093
ller+0.093
rum+0.092
Neg logit contributions
 parties-0.060
 play-0.047
 outside-0.047
 party-0.044
 tea-0.043
Token back
Token ID736
Feature activation+8.57e-01

Pos logit contributions
 parties+0.096
text+0.084
 recess+0.073
aws+0.069
ashed+0.066
Neg logit contributions
 phrase-0.186
cut-0.176
 ringing-0.175
 guardian-0.170
 drill-0.170
Token and
Token ID290
Feature activation+1.51e+00

Pos logit contributions
 parties+0.019
text+0.016
 recess+0.014
 tea+0.013
aws+0.013
Neg logit contributions
 phrase-0.034
cut-0.032
 ringing-0.032
 guardian-0.031
rum-0.031
Token get
Token ID651
Feature activation+9.31e-01

Pos logit contributions
 parties+0.032
 outside+0.026
 together+0.024
 recess+0.023
 play+0.023
Neg logit contributions
 phrase-0.065
cut-0.059
rum-0.058
 ringing-0.057
fly-0.056
Token a
Token ID257
Feature activation+3.44e+00

Pos logit contributions
 parties+0.021
text+0.016
 recess+0.016
 outside+0.015
 tea+0.014
Neg logit contributions
 phrase-0.038
cut-0.036
 ringing-0.035
 claw-0.034
 guardian-0.034
Token new
Token ID649
Feature activation+1.23e+00

Pos logit contributions
 parties+0.078
 tea+0.072
 bath+0.072
 picnic+0.071
 party+0.070
Neg logit contributions
rum-0.132
 phrase-0.130
cut-0.128
ller-0.124
ric-0.122
Token going
Token ID1016
Feature activation+2.48e+00

Pos logit contributions
 parties+0.022
text+0.021
 recess+0.019
ashed+0.018
aws+0.018
Neg logit contributions
 ringing-0.028
 phrase-0.027
cut-0.025
 guardian-0.025
 drill-0.024
Token on
Token ID319
Feature activation+1.24e+00

Pos logit contributions
text+0.070
 parties+0.068
ackers+0.064
aws+0.060
he+0.059
Neg logit contributions
 ringing-0.079
 phrase-0.072
 drill-0.071
 guardian-0.069
ller-0.066
Token a
Token ID257
Feature activation+4.67e+00

Pos logit contributions
ashed+0.032
ashes+0.031
ashing+0.031
ackers+0.029
 parties+0.029
Neg logit contributions
 ringing-0.040
 phrase-0.037
ller-0.036
 beast-0.036
 drill-0.035
Token trip
Token ID5296
Feature activation+2.67e+00

Pos logit contributions
text+0.112
aws+0.107
ashed+0.107
 parties+0.106
ashes+0.106
Neg logit contributions
 ringing-0.163
 phrase-0.151
 drill-0.145
ller-0.143
 alarm-0.141
Token!
Token ID0
Feature activation+1.19e+00

Pos logit contributions
text+0.079
 parties+0.069
ashed+0.067
aws+0.066
ackers+0.065
Neg logit contributions
 ringing-0.078
 phrase-0.072
ller-0.072
 word-0.069
 drill-0.069
Token We
Token ID775
Feature activation+5.19e+00

Pos logit contributions
 parties+0.034
text+0.028
 recess+0.028
 outside+0.027
 tea+0.026
Neg logit contributions
 phrase-0.042
cut-0.038
 ringing-0.037
 guardian-0.036
 drill-0.036
Token need
Token ID761
Feature activation+1.42e+00

Pos logit contributions
 parties+0.123
text+0.098
 recess+0.096
 outside+0.093
 tea+0.089
Neg logit contributions
 phrase-0.205
cut-0.193
 ringing-0.186
 guardian-0.183
 claw-0.182
Token to
Token ID284
Feature activation-1.81e+00

Pos logit contributions
 parties+0.031
text+0.024
 recess+0.023
 outside+0.022
 tea+0.021
Neg logit contributions
 phrase-0.059
cut-0.055
 ringing-0.054
 guardian-0.053
rum-0.053
Token order
Token ID1502
Feature activation+3.91e-01

Pos logit contributions
 phrase+0.072
cut+0.068
 ringing+0.067
 guardian+0.066
 claw+0.065
Neg logit contributions
 parties-0.040
text-0.033
 recess-0.031
 outside-0.030
ashed-0.030
Token tickets
Token ID8587
Feature activation+1.76e+00

Pos logit contributions
 parties+0.008
text+0.006
 recess+0.006
 outside+0.006
 tea+0.006
Neg logit contributions
 phrase-0.016
cut-0.015
 ringing-0.015
 guardian-0.015
rum-0.015
Token for
Token ID329
Feature activation+1.88e+00

Pos logit contributions
 parties+0.037
text+0.035
 recess+0.032
|+0.030
 tea+0.029
Neg logit contributions
 phrase-0.068
 ringing-0.064
cut-0.063
 word-0.061
 drill-0.061
Token log
Token ID2604
Feature activation-2.76e+00

Pos logit contributions
 phrase+0.011
cut+0.010
 ringing+0.009
 guardian+0.009
 claw+0.009
Neg logit contributions
 parties-0.009
text-0.007
 recess-0.007
 outside-0.007
 together-0.007
Token in
Token ID287
Feature activation-1.32e+00

Pos logit contributions
 phrase+0.105
cut+0.101
 ringing+0.096
fly+0.095
 guardian+0.094
Neg logit contributions
 parties-0.068
 recess-0.056
text-0.055
 outside-0.050
 tea-0.048
Token the
Token ID262
Feature activation+2.07e+00

Pos logit contributions
 phrase+0.058
cut+0.054
 ringing+0.053
 guardian+0.053
 claw+0.053
Neg logit contributions
 parties-0.025
text-0.019
 outside-0.018
 recess-0.017
 tea-0.016
Token park
Token ID3952
Feature activation+7.10e-01

Pos logit contributions
 parties+0.052
text+0.042
 recess+0.041
 outside+0.040
 together+0.038
Neg logit contributions
 phrase-0.079
cut-0.074
 ringing-0.071
 guardian-0.070
 claw-0.069
Token had
Token ID550
Feature activation+3.52e+00

Pos logit contributions
 parties+0.013
text+0.011
 recess+0.010
 outside+0.009
 tea+0.009
Neg logit contributions
 phrase-0.031
cut-0.029
 ringing-0.029
fly-0.028
 guardian-0.028
Token brought
Token ID3181
Feature activation+4.53e+00

Pos logit contributions
 parties+0.091
text+0.075
 recess+0.074
 outside+0.072
 tea+0.067
Neg logit contributions
 phrase-0.130
cut-0.118
 ringing-0.118
 guardian-0.114
 claw-0.113
Token them
Token ID606
Feature activation+4.50e+00

Pos logit contributions
 parties+0.092
text+0.075
 recess+0.072
|+0.060
ashed+0.060
Neg logit contributions
 phrase-0.189
 ringing-0.172
cut-0.170
 word-0.168
fly-0.166
Token even
Token ID772
Feature activation+2.92e+00

Pos logit contributions
text+0.090
 parties+0.089
 recess+0.088
P+0.077
 Woo+0.076
Neg logit contributions
 phrase-0.162
 guardian-0.156
 ringing-0.155
fly-0.149
 word-0.146
Token closer
Token ID5699
Feature activation-1.86e+00

Pos logit contributions
 parties+0.065
text+0.050
 recess+0.049
 outside+0.049
 tea+0.046
Neg logit contributions
 phrase-0.121
cut-0.113
 ringing-0.110
rum-0.108
 guardian-0.108
Token together
Token ID1978
Feature activation+2.97e+00

Pos logit contributions
 phrase+0.085
cut+0.080
 ringing+0.078
rum+0.077
 guardian+0.077
Neg logit contributions
 parties-0.033
text-0.024
 recess-0.023
 outside-0.022
 tea-0.020
Token.
Token ID13
Feature activation+1.10e-01

Pos logit contributions
 parties+0.051
text+0.039
 recess+0.037
 outside+0.031
 tea+0.031
Neg logit contributions
 phrase-0.137
cut-0.128
 ringing-0.125
rum-0.123
 guardian-0.123
Token that
Token ID326
Feature activation+9.36e-01

Pos logit contributions
 parties+0.008
text+0.007
 recess+0.006
ashed+0.006
 picnic+0.006
Neg logit contributions
 phrase-0.014
cut-0.014
 ringing-0.013
rum-0.013
 drill-0.013
Token he
Token ID339
Feature activation+2.71e+00

Pos logit contributions
 parties+0.016
text+0.013
 recess+0.011
 outside+0.011
ashed+0.010
Neg logit contributions
 phrase-0.041
cut-0.040
 ringing-0.039
 drill-0.038
 guardian-0.038
Token was
Token ID373
Feature activation-5.41e-01

Pos logit contributions
 parties+0.080
 recess+0.065
text+0.064
 outside+0.061
 tea+0.060
Neg logit contributions
 phrase-0.094
cut-0.086
 ringing-0.085
 claw-0.082
fly-0.082
Token not
Token ID407
Feature activation-2.58e+00

Pos logit contributions
 phrase+0.021
cut+0.020
rum+0.019
 ringing+0.018
fly+0.018
Neg logit contributions
 parties-0.014
 outside-0.011
 recess-0.011
text-0.010
 tea-0.010
Token having
Token ID1719
Feature activation+3.14e+00

Pos logit contributions
 phrase+0.093
cut+0.087
 ringing+0.086
rum+0.083
 guardian+0.082
Neg logit contributions
 parties-0.070
text-0.059
 recess-0.056
 outside-0.053
 tea-0.052
Token fun
Token ID1257
Feature activation+5.57e+00

Pos logit contributions
 parties+0.060
text+0.057
he+0.051
ashed+0.050
ashing+0.050
Neg logit contributions
 phrase-0.122
 ringing-0.118
 drill-0.111
 claw-0.109
 meaning-0.109
Token.
Token ID13
Feature activation+8.74e-01

Pos logit contributions
 parties+0.097
text+0.084
 recess+0.075
P+0.068
aws+0.068
Neg logit contributions
 phrase-0.241
 ringing-0.231
cut-0.228
rum-0.225
 drill-0.221
Token<|endoftext|>
Token ID50256
Feature activation+1.91e+00

Pos logit contributions
 parties+0.023
text+0.019
 recess+0.018
 outside+0.018
 tea+0.017
Neg logit contributions
 phrase-0.033
cut-0.030
 ringing-0.029
 guardian-0.029
rum-0.029
TokenL
Token ID43
Feature activation+3.12e+00

Pos logit contributions
 parties+0.056
ashed+0.049
ashing+0.049
ashes+0.045
text+0.044
Neg logit contributions
 phrase-0.058
 word-0.055
cut-0.051
fly-0.051
 drill-0.051
Tokenily
Token ID813
Feature activation+2.00e+00

Pos logit contributions
 parties+0.070
ashing+0.067
ashes+0.067
dates+0.066
ashed+0.064
Neg logit contributions
rum-0.100
omm-0.096
 adult-0.095
 phrase-0.093
 drill-0.090
Token and
Token ID290
Feature activation+9.76e-01

Pos logit contributions
 parties+0.050
 recess+0.041
text+0.040
 tea+0.037
 together+0.036
Neg logit contributions
 phrase-0.075
cut-0.071
 ringing-0.067
 claw-0.067
 guardian-0.066
Token it
Token ID340
Feature activation-4.30e-01

Pos logit contributions
 phrase+0.001
cut+0.001
 ringing+0.001
rum+0.001
 drill+0.001
Neg logit contributions
 parties-0.001
 recess-0.001
text-0.001
ashes-0.001
aws-0.001
Token was
Token ID373
Feature activation+6.95e-01

Pos logit contributions
 phrase+0.015
cut+0.014
 ringing+0.013
rum+0.013
 guardian+0.013
Neg logit contributions
 parties-0.013
text-0.010
 recess-0.010
 outside-0.010
 tea-0.010
Token dirty
Token ID11841
Feature activation-4.82e-01

Pos logit contributions
 parties+0.017
 outside+0.013
text+0.013
 recess+0.013
 tea+0.013
Neg logit contributions
 phrase-0.027
cut-0.026
 ringing-0.024
rum-0.024
 guardian-0.024
Token and
Token ID290
Feature activation+8.36e-01

Pos logit contributions
 phrase+0.018
 ringing+0.017
cut+0.017
rum+0.016
 word+0.016
Neg logit contributions
 parties-0.012
text-0.011
 recess-0.010
|-0.009
P-0.009
Token said
Token ID531
Feature activation+1.81e+00

Pos logit contributions
 parties+0.020
 outside+0.016
 together+0.016
 they+0.015
 tea+0.015
Neg logit contributions
 phrase-0.034
cut-0.030
 guardian-0.030
rum-0.030
 ringing-0.029
Token they
Token ID484
Feature activation+5.30e+00

Pos logit contributions
 parties+0.032
 outside+0.024
 together+0.024
text+0.024
 they+0.024
Neg logit contributions
 phrase-0.081
cut-0.076
 ringing-0.074
 guardian-0.074
rum-0.073
Token should
Token ID815
Feature activation-3.96e+00

Pos logit contributions
 parties+0.147
 recess+0.121
text+0.119
 outside+0.113
 tea+0.109
Neg logit contributions
 phrase-0.190
cut-0.177
 ringing-0.171
 claw-0.169
 drill-0.167
Token not
Token ID407
Feature activation-2.09e+00

Pos logit contributions
 phrase+0.166
cut+0.154
 ringing+0.150
 guardian+0.147
fly+0.147
Neg logit contributions
 parties-0.088
text-0.066
 outside-0.065
 recess-0.064
 tea-0.061
Token take
Token ID1011
Feature activation+3.98e+00

Pos logit contributions
 phrase+0.096
cut+0.091
 ringing+0.087
 guardian+0.086
rum+0.086
Neg logit contributions
 parties-0.038
 outside-0.028
 recess-0.026
text-0.025
 together-0.025
Token it
Token ID340
Feature activation+2.75e+00

Pos logit contributions
 parties+0.111
text+0.091
 recess+0.088
he+0.085
ashed+0.083
Neg logit contributions
 phrase-0.140
 ringing-0.128
 claw-0.126
cut-0.125
 drill-0.123
Token.
Token ID13
Feature activation+2.35e-01

Pos logit contributions
 parties+0.068
text+0.066
 recess+0.059
P+0.055
dates+0.055
Neg logit contributions
 ringing-0.087
cut-0.087
rum-0.087
 phrase-0.086
 claw-0.082
Token to
Token ID284
Feature activation-3.07e+00

Pos logit contributions
 parties+0.004
 outside+0.003
 tea+0.003
 recess+0.003
 together+0.003
Neg logit contributions
 phrase-0.011
cut-0.011
rum-0.011
fly-0.011
 guardian-0.010
Token explore
Token ID7301
Feature activation+2.06e+00

Pos logit contributions
 phrase+0.136
cut+0.129
 ringing+0.124
rum+0.123
fly+0.122
Neg logit contributions
 parties-0.063
 outside-0.045
 recess-0.043
text-0.043
 tea-0.043
Token so
Token ID523
Feature activation+4.39e-01

Pos logit contributions
 parties+0.046
text+0.040
ashed+0.037
 recess+0.036
ashes+0.034
Neg logit contributions
 phrase-0.081
cut-0.076
 ringing-0.075
 drill-0.073
 guardian-0.072
Token he
Token ID339
Feature activation+3.71e+00

Pos logit contributions
 parties+0.012
text+0.010
 recess+0.009
 outside+0.009
 tea+0.009
Neg logit contributions
 phrase-0.016
cut-0.015
 ringing-0.014
rum-0.014
 drill-0.014
Token went
Token ID1816
Feature activation+4.76e+00

Pos logit contributions
 parties+0.093
text+0.075
 recess+0.074
 outside+0.071
 tea+0.068
Neg logit contributions
 phrase-0.143
cut-0.132
 ringing-0.128
 claw-0.126
 guardian-0.125
Token outside
Token ID2354
Feature activation+1.32e+00

Pos logit contributions
 parties+0.093
text+0.079
 recess+0.067
aws+0.062
 tea+0.060
Neg logit contributions
 phrase-0.207
 ringing-0.191
cut-0.189
 guardian-0.187
 drill-0.185
Token.
Token ID13
Feature activation-9.70e-02

Pos logit contributions
 parties+0.032
text+0.031
aws+0.028
 recess+0.026
ashed+0.026
Neg logit contributions
 phrase-0.048
 ringing-0.046
cut-0.044
 drill-0.043
 claw-0.043
Token There
Token ID1318
Feature activation-2.38e+00

Pos logit contributions
 phrase+0.003
cut+0.003
 ringing+0.003
fly+0.003
 guardian+0.003
Neg logit contributions
 parties-0.003
text-0.002
 outside-0.002
 recess-0.002
 tea-0.002
Token,
Token ID11
Feature activation-8.55e-02

Pos logit contributions
 phrase+0.093
cut+0.088
 ringing+0.085
 drill+0.083
 claw+0.083
Neg logit contributions
 parties-0.057
 recess-0.046
text-0.045
 outside-0.042
 tea-0.041
Token the
Token ID262
Feature activation-1.87e+00

Pos logit contributions
 phrase+0.003
cut+0.003
 ringing+0.003
 drill+0.003
 claw+0.003
Neg logit contributions
 parties-0.002
text-0.002
 recess-0.002
 outside-0.002
 tea-0.002
Token moon
Token ID8824
Feature activation-4.12e+00

Pos logit contributions
rum+0.069
 phrase+0.068
fly+0.065
ric+0.064
ller+0.063
Neg logit contributions
 outside-0.050
 parties-0.046
 play-0.042
 tea-0.042
 party-0.041
Token boss
Token ID6478
Feature activation-2.86e+00

Pos logit contributions
 parties+0.039
 outside+0.031
 recess+0.031
text+0.030
 tea+0.030
Neg logit contributions
 phrase-0.054
cut-0.050
rum-0.049
 ringing-0.048
 claw-0.047
Token of
Token ID286
Feature activation-1.07e+00

Pos logit contributions
 phrase+0.112
cut+0.107
 ringing+0.102
rum+0.102
fly+0.100
Neg logit contributions
 parties-0.068
text-0.055
 recess-0.054
 tea-0.050
 outside-0.050
Token the
Token ID262
Feature activation+1.67e+00

Pos logit contributions
 phrase+0.044
cut+0.041
fly+0.040
rum+0.040
 guardian+0.040
Neg logit contributions
 parties-0.025
 outside-0.019
 recess-0.019
 tea-0.018
 together-0.017
Token forest
Token ID8222
Feature activation-6.11e-01

Pos logit contributions
 parties+0.051
 outside+0.042
 recess+0.042
 tea+0.041
 party+0.041
Neg logit contributions
 phrase-0.055
rum-0.051
cut-0.049
fly-0.048
ller-0.048
Token."
Token ID526
Feature activation-5.82e-01

Pos logit contributions
 phrase+0.028
cut+0.027
 ringing+0.026
 guardian+0.025
fly+0.025
Neg logit contributions
 parties-0.010
text-0.008
 recess-0.008
 outside-0.007
 tea-0.006
Token
Token ID198
Feature activation-6.44e-01

Pos logit contributions
 phrase+0.023
cut+0.022
 ringing+0.021
rum+0.021
 claw+0.021
Neg logit contributions
 parties-0.013
text-0.010
 recess-0.010
 outside-0.009
 tea-0.009
Token
Token ID198
Feature activation-9.27e-01

Pos logit contributions
 phrase+0.028
cut+0.026
rum+0.025
 guardian+0.025
 ringing+0.025
Neg logit contributions
 parties-0.014
 recess-0.010
text-0.010
 outside-0.009
 tea-0.009
TokenThe
Token ID464
Feature activation-4.41e+00

Pos logit contributions
 phrase+0.036
cut+0.033
 ringing+0.032
rum+0.032
 guardian+0.032
Neg logit contributions
 parties-0.023
text-0.018
 recess-0.018
 outside-0.018
 tea-0.017
Token bird
Token ID6512
Feature activation-4.77e+00

Pos logit contributions
rum+0.146
 phrase+0.144
ller+0.128
ric+0.125
cut+0.124
Neg logit contributions
 parties-0.126
 outside-0.111
 party-0.109
 tea-0.108
 recess-0.107
Token thought
Token ID1807
Feature activation-1.56e+00

Pos logit contributions
 phrase+0.187
omm+0.174
 drill+0.172
 guardian+0.172
rum+0.171
Neg logit contributions
 parties-0.100
 together-0.092
 outside-0.088
 around-0.084
 played-0.084
Token for
Token ID329
Feature activation-1.32e+00

Pos logit contributions
 phrase+0.067
cut+0.063
 ringing+0.061
rum+0.060
 guardian+0.060
Neg logit contributions
 parties-0.032
text-0.024
 recess-0.024
 outside-0.023
 tea-0.022
Token to
Token ID284
Feature activation-1.77e+00

Pos logit contributions
 parties+0.012
 outside+0.009
 recess+0.008
 tea+0.008
text+0.008
Neg logit contributions
 phrase-0.029
cut-0.027
 guardian-0.026
rum-0.026
fly-0.026
Token play
Token ID711
Feature activation+5.84e+00

Pos logit contributions
 phrase+0.089
cut+0.084
 ringing+0.082
rum+0.081
 guardian+0.081
Neg logit contributions
 parties-0.025
text-0.015
 outside-0.015
 recess-0.015
 tea-0.013
Token in
Token ID287
Feature activation+2.23e-01

Pos logit contributions
text+0.124
 parties+0.109
 recess+0.103
aws+0.100
ashed+0.098
Neg logit contributions
cut-0.222
 drill-0.220
 phrase-0.213
 ringing-0.212
pe-0.203
Token the
Token ID262
Feature activation+3.12e+00

Pos logit contributions
ashed+0.006
 parties+0.006
ashing+0.005
text+0.005
ashes+0.005
Neg logit contributions
 ringing-0.007
 drill-0.007
 phrase-0.007
cut-0.007
 beast-0.007
Token forest
Token ID8222
Feature activation+2.62e-01

Pos logit contributions
 parties+0.102
text+0.093
 together+0.089
 recess+0.088
ashed+0.088
Neg logit contributions
 phrase-0.078
cut-0.077
 drill-0.076
 guardian-0.075
 ringing-0.074
Token with
Token ID351
Feature activation+1.92e+00

Pos logit contributions
 parties+0.005
text+0.005
 recess+0.005
ashed+0.004
aws+0.004
Neg logit contributions
 phrase-0.011
cut-0.011
 ringing-0.010
fly-0.010
 drill-0.010
Token his
Token ID465
Feature activation+8.93e-01

Pos logit contributions
 parties+0.066
text+0.060
ashes+0.057
ashing+0.057
 recess+0.057
Neg logit contributions
cut-0.047
 phrase-0.047
 claw-0.045
 drill-0.045
 ringing-0.044
Token friends
Token ID2460
Feature activation+4.90e+00

Pos logit contributions
 parties+0.020
text+0.015
 outside+0.015
 recess+0.015
 tea+0.014
Neg logit contributions
 phrase-0.037
cut-0.035
 ringing-0.034
rum-0.033
 guardian-0.033
Token.
Token ID13
Feature activation+4.61e-01

Pos logit contributions
 parties+0.089
text+0.079
 recess+0.074
|+0.058
ashes+0.057
Neg logit contributions
 phrase-0.212
cut-0.211
 ringing-0.199
fly-0.196
 drill-0.195
Token One
Token ID1881
Feature activation-1.76e-01

Pos logit contributions
 parties+0.015
 recess+0.013
text+0.013
ashed+0.012
 outside+0.012
Neg logit contributions
 phrase-0.012
 drill-0.012
ric-0.012
cut-0.012
fly-0.011
Token evening
Token ID6180
Feature activation+1.67e-02

Pos logit contributions
 phrase+0.008
cut+0.007
 ringing+0.007
 guardian+0.007
fly+0.007
Neg logit contributions
 parties-0.004
 recess-0.003
text-0.003
aws-0.002
 outside-0.002
Token,
Token ID11
Feature activation+5.19e-01

Pos logit contributions
 parties+0.005
text+0.004
 outside+0.004
 recess+0.004
 together+0.003
Neg logit contributions
 phrase-0.014
cut-0.013
 ringing-0.013
 guardian-0.012
rum-0.012
Token "
Token ID366
Feature activation-1.96e+00

Pos logit contributions
 they+0.012
 together+0.011
 parties+0.011
 outside+0.010
 Lily+0.010
Neg logit contributions
 phrase-0.020
rum-0.019
 claw-0.019
 beast-0.019
ric-0.019
TokenThat
Token ID2504
Feature activation-2.11e+00

Pos logit contributions
 phrase+0.084
 drill+0.079
rum+0.079
 claw+0.079
cut+0.078
Neg logit contributions
 parties-0.036
We-0.030
text-0.029
 together-0.028
P-0.028
Token sounds
Token ID5238
Feature activation-9.48e-01

Pos logit contributions
 phrase+0.097
rum+0.094
cut+0.093
 claw+0.089
ric+0.088
Neg logit contributions
 parties-0.032
 outside-0.026
 tea-0.023
text-0.023
 party-0.022
Token like
Token ID588
Feature activation+1.03e+00

Pos logit contributions
 phrase+0.037
 ringing+0.033
 word+0.033
cut+0.032
 meaning+0.032
Neg logit contributions
 parties-0.023
 recess-0.020
text-0.019
aws-0.017
P-0.017
Token a
Token ID257
Feature activation+2.48e+00

Pos logit contributions
 parties+0.017
 outside+0.011
 recess+0.011
 together+0.011
 tea+0.011
Neg logit contributions
 phrase-0.049
cut-0.047
rum-0.045
fly-0.045
 guardian-0.045
Token plan
Token ID1410
Feature activation-1.30e+00

Pos logit contributions
 parties+0.054
 tea+0.044
 party+0.042
 outside+0.042
 together+0.041
Neg logit contributions
 phrase-0.100
cut-0.098
rum-0.097
ric-0.092
ller-0.090
Token.
Token ID13
Feature activation+7.77e-01

Pos logit contributions
 phrase+0.056
cut+0.053
 ringing+0.052
rum+0.051
 guardian+0.051
Neg logit contributions
 parties-0.025
text-0.020
 recess-0.019
 outside-0.017
 tea-0.017
Token You
Token ID921
Feature activation+1.36e+00

Pos logit contributions
 parties+0.012
 Lily+0.012
text+0.011
 together+0.011
 Ben+0.010
Neg logit contributions
 phrase-0.034
cut-0.033
 ringing-0.033
 swollen-0.032
 cracking-0.031
Token are
Token ID389
Feature activation-5.81e-01

Pos logit contributions
 parties+0.030
text+0.023
 outside+0.022
 together+0.022
 party+0.021
Neg logit contributions
 phrase-0.057
cut-0.054
rum-0.052
 ringing-0.052
 drill-0.050
Token a
Token ID257
Feature activation+2.85e+00

Pos logit contributions
 phrase+0.028
cut+0.026
rum+0.025
 drill+0.025
 claw+0.024
Neg logit contributions
 parties-0.010
 outside-0.008
 together-0.008
 tea-0.007
 both-0.006
Token and
Token ID290
Feature activation+1.94e+00

Pos logit contributions
 parties+0.044
text+0.034
 recess+0.033
 outside+0.031
 tea+0.030
Neg logit contributions
 phrase-0.090
cut-0.084
 ringing-0.083
rum-0.081
 drill-0.080
Token his
Token ID465
Feature activation-1.60e-01

Pos logit contributions
 parties+0.044
 outside+0.033
 together+0.032
 recess+0.032
 they+0.032
Neg logit contributions
 phrase-0.084
cut-0.077
fly-0.075
 guardian-0.075
rum-0.074
Token mom
Token ID1995
Feature activation+6.49e-01

Pos logit contributions
 phrase+0.006
cut+0.006
rum+0.005
fly+0.005
 ringing+0.005
Neg logit contributions
 parties-0.004
 outside-0.003
 recess-0.003
text-0.003
 tea-0.003
Tokenmy
Token ID1820
Feature activation+1.92e+00

Pos logit contributions
 parties+0.014
 outside+0.012
 together+0.012
 everywhere+0.010
 play+0.010
Neg logit contributions
 phrase-0.028
 beast-0.025
rum-0.025
 guardian-0.025
cut-0.025
Token bought
Token ID5839
Feature activation+1.75e+00

Pos logit contributions
 parties+0.047
 outside+0.038
text+0.036
 together+0.036
 recess+0.035
Neg logit contributions
 phrase-0.075
cut-0.068
rum-0.067
 ringing-0.066
 guardian-0.066
Token it
Token ID340
Feature activation+1.48e+00

Pos logit contributions
 parties+0.043
text+0.034
 recess+0.033
 outside+0.033
 tea+0.031
Neg logit contributions
 phrase-0.069
cut-0.064
 ringing-0.062
 guardian-0.061
rum-0.061
Token for
Token ID329
Feature activation+1.54e+00

Pos logit contributions
text+0.035
 parties+0.034
 recess+0.031
|+0.028
dates+0.026
Neg logit contributions
 phrase-0.054
 ringing-0.052
cut-0.051
 word-0.050
 drill-0.049
Token him
Token ID683
Feature activation+3.00e+00

Pos logit contributions
 parties+0.029
text+0.023
 recess+0.021
 outside+0.021
 tea+0.019
Neg logit contributions
 phrase-0.068
cut-0.064
 ringing-0.063
rum-0.061
 drill-0.061
Token.
Token ID13
Feature activation+1.86e-01

Pos logit contributions
 parties+0.057
text+0.049
 recess+0.047
P+0.039
|+0.037
Neg logit contributions
 phrase-0.128
cut-0.121
 ringing-0.121
rum-0.119
 claw-0.117
Token
Token ID198
Feature activation+4.87e-01

Pos logit contributions
 parties+0.003
 outside+0.002
text+0.002
 together+0.002
 recess+0.002
Neg logit contributions
 phrase-0.009
cut-0.008
 ringing-0.008
 guardian-0.008
fly-0.008
Token
Token ID198
Feature activation-2.15e-02

Pos logit contributions
 parties+0.014
 recess+0.012
 tea+0.011
text+0.011
ashed+0.011
Neg logit contributions
 phrase-0.016
cut-0.015
rum-0.015
ller-0.015
 drill-0.015
Token rainbow
Token ID27223
Feature activation+1.02e-01

Pos logit contributions
 parties+0.021
 outside+0.019
 tea+0.018
 party+0.017
 picnic+0.017
Neg logit contributions
 phrase-0.032
rum-0.029
cut-0.029
inct-0.027
ric-0.026
Token in
Token ID287
Feature activation-9.80e-01

Pos logit contributions
 parties+0.002
text+0.001
 recess+0.001
ashed+0.001
aws+0.001
Neg logit contributions
 phrase-0.005
 ringing-0.005
cut-0.005
fly-0.004
 drill-0.004
Token the
Token ID262
Feature activation+2.61e+00

Pos logit contributions
 phrase+0.045
cut+0.042
 ringing+0.041
 guardian+0.040
rum+0.040
Neg logit contributions
 parties-0.018
text-0.013
 recess-0.012
 outside-0.012
 tea-0.011
Token sky
Token ID6766
Feature activation-9.42e-01

Pos logit contributions
 parties+0.082
text+0.069
 recess+0.068
 outside+0.067
 tea+0.065
Neg logit contributions
 phrase-0.084
cut-0.077
 ringing-0.074
 guardian-0.072
rum-0.072
Token.
Token ID13
Feature activation-9.19e-02

Pos logit contributions
 phrase+0.052
cut+0.049
rum+0.048
 guardian+0.047
 drill+0.047
Neg logit contributions
 parties-0.009
 outside-0.005
 together-0.004
 recess-0.003
 tea-0.003
Token She
Token ID1375
Feature activation-1.80e+00

Pos logit contributions
 phrase+0.003
cut+0.003
 ringing+0.003
 drill+0.003
 claw+0.003
Neg logit contributions
 parties-0.002
text-0.002
 recess-0.002
 together-0.002
 outside-0.002
Token wondered
Token ID14028
Feature activation-3.57e+00

Pos logit contributions
 phrase+0.081
cut+0.079
rum+0.079
 drill+0.075
 guardian+0.074
Neg logit contributions
 played-0.030
 parties-0.026
 both-0.026
 outside-0.026
 went-0.025
Token how
Token ID703
Feature activation+6.24e-01

Pos logit contributions
 phrase+0.123
cut+0.113
 ringing+0.109
 guardian+0.106
rum+0.106
Neg logit contributions
 parties-0.105
text-0.087
 recess-0.086
 outside-0.084
 tea-0.081
Token it
Token ID340
Feature activation-8.24e-01

Pos logit contributions
 parties+0.019
 recess+0.015
text+0.015
 party+0.014
aws+0.014
Neg logit contributions
 phrase-0.021
 ringing-0.019
cut-0.019
 guardian-0.018
 claw-0.018
Token got
Token ID1392
Feature activation+1.29e+00

Pos logit contributions
 phrase+0.032
cut+0.029
rum+0.028
 ringing+0.028
 guardian+0.028
Neg logit contributions
 parties-0.021
 outside-0.017
text-0.017
 recess-0.016
 together-0.016
Token there
Token ID612
Feature activation+1.93e+00

Pos logit contributions
 parties+0.031
text+0.027
 recess+0.025
dates+0.022
aws+0.021
Neg logit contributions
 ringing-0.049
 phrase-0.048
cut-0.047
 claw-0.045
fly-0.045
Token you
Token ID345
Feature activation+4.04e+00

Pos logit contributions
 phrase+0.024
 ringing+0.023
fly+0.022
 word+0.022
cut+0.022
Neg logit contributions
 parties-0.013
text-0.011
ashed-0.010
 recess-0.010
ashing-0.009
Token for
Token ID329
Feature activation+1.53e+00

Pos logit contributions
 parties+0.086
text+0.070
 recess+0.069
 outside+0.060
 tea+0.060
Neg logit contributions
 phrase-0.169
cut-0.158
 ringing-0.156
rum-0.152
fly-0.151
Token an
Token ID281
Feature activation+1.70e+00

Pos logit contributions
 parties+0.032
 recess+0.024
 outside+0.024
 together+0.023
text+0.022
Neg logit contributions
 phrase-0.066
cut-0.063
 guardian-0.060
fly-0.060
rum-0.059
Token ice
Token ID4771
Feature activation-1.46e-01

Pos logit contributions
 parties+0.049
text+0.041
 recess+0.040
 outside+0.039
 together+0.038
Neg logit contributions
 phrase-0.059
cut-0.054
 ringing-0.053
rum-0.051
 guardian-0.051
Token-
Token ID12
Feature activation-1.52e+00

Pos logit contributions
 phrase+0.005
cut+0.005
 ringing+0.004
rum+0.004
 drill+0.004
Neg logit contributions
 parties-0.004
text-0.004
 recess-0.004
 outside-0.003
aws-0.003
Tokencream
Token ID36277
Feature activation-1.76e+00

Pos logit contributions
 phrase+0.053
 guardian+0.047
 claw+0.047
rum+0.047
 ringing+0.046
Neg logit contributions
 parties-0.043
 outside-0.038
 together-0.036
 recess-0.036
text-0.035
Token!"
Token ID2474
Feature activation+9.21e-01

Pos logit contributions
 phrase+0.075
cut+0.069
rum+0.067
 ringing+0.067
 guardian+0.067
Neg logit contributions
 parties-0.038
 outside-0.029
 recess-0.027
text-0.027
 together-0.027
Token
Token ID198
Feature activation+5.38e-03

Pos logit contributions
 parties+0.023
text+0.019
 recess+0.019
 tea+0.017
 picnic+0.017
Neg logit contributions
 phrase-0.035
cut-0.031
 ringing-0.030
 word-0.030
 drill-0.030
Token
Token ID198
Feature activation-3.41e-01

Pos logit contributions
 parties+0.000
 recess+0.000
text+0.000
ashed+0.000
 tea+0.000
Neg logit contributions
 phrase-0.000
cut-0.000
 ringing-0.000
rum-0.000
 drill-0.000
TokenJack
Token ID14295
Feature activation-8.87e-01

Pos logit contributions
 phrase+0.013
cut+0.012
 ringing+0.012
 guardian+0.011
rum+0.011
Neg logit contributions
 parties-0.009
text-0.007
 recess-0.007
 outside-0.007
 tea-0.007
Token smiled
Token ID13541
Feature activation-1.22e+00

Pos logit contributions
 phrase+0.033
cut+0.031
rum+0.031
 drill+0.031
 ringing+0.031
Neg logit contributions
 parties-0.021
 together-0.018
 outside-0.018
text-0.017
ashed-0.017
Token and
Token ID290
Feature activation+1.42e+00

Pos logit contributions
 phrase+0.184
cut+0.173
 ringing+0.167
rum+0.166
 claw+0.166
Neg logit contributions
 parties-0.062
 outside-0.041
text-0.041
 recess-0.040
 together-0.038
Token learn
Token ID2193
Feature activation-1.00e+00

Pos logit contributions
 parties+0.030
 together+0.023
 they+0.022
 outside+0.022
 play+0.022
Neg logit contributions
 phrase-0.062
rum-0.059
omm-0.057
cut-0.057
 drill-0.056
Token something
Token ID1223
Feature activation+7.89e-01

Pos logit contributions
 phrase+0.041
cut+0.039
 ringing+0.038
 claw+0.037
rum+0.037
Neg logit contributions
 parties-0.023
 outside-0.017
text-0.017
 together-0.016
 recess-0.016
Token new
Token ID649
Feature activation+1.12e+00

Pos logit contributions
 parties+0.019
text+0.015
 recess+0.015
 outside+0.015
 tea+0.014
Neg logit contributions
 phrase-0.031
cut-0.029
 ringing-0.028
rum-0.028
 guardian-0.028
Token.
Token ID13
Feature activation+3.38e-01

Pos logit contributions
 parties+0.016
text+0.010
 outside+0.009
 recess+0.009
 together+0.008
Neg logit contributions
 phrase-0.056
cut-0.053
 ringing-0.051
 guardian-0.051
 claw-0.051
Token
Token ID220
Feature activation-1.72e+00

Pos logit contributions
 parties+0.007
text+0.006
 recess+0.006
 outside+0.005
 tea+0.005
Neg logit contributions
 phrase-0.014
cut-0.013
 ringing-0.013
rum-0.013
 guardian-0.013
Token They
Token ID1119
Feature activation+1.34e+00

Pos logit contributions
 phrase+0.071
cut+0.068
rum+0.066
 drill+0.065
 ringing+0.064
Neg logit contributions
 parties-0.037
 recess-0.028
text-0.026
 tea-0.026
 party-0.025
Token shared
Token ID4888
Feature activation+1.71e+00

Pos logit contributions
 parties+0.026
 outside+0.017
 recess+0.017
text+0.017
 together+0.016
Neg logit contributions
 phrase-0.061
rum-0.056
cut-0.056
 ringing-0.055
 drill-0.055
Token stories
Token ID3923
Feature activation+1.02e+00

Pos logit contributions
 parties+0.031
text+0.023
 recess+0.022
 outside+0.022
 tea+0.020
Neg logit contributions
 phrase-0.077
cut-0.073
 ringing-0.071
rum-0.070
 guardian-0.070
Token,
Token ID11
Feature activation+2.06e+00

Pos logit contributions
 parties+0.019
text+0.015
 recess+0.014
 outside+0.014
 tea+0.013
Neg logit contributions
 phrase-0.045
cut-0.042
 ringing-0.041
 guardian-0.040
rum-0.040
Token laughed
Token ID13818
Feature activation+1.01e+00

Pos logit contributions
 parties+0.042
 together+0.028
 outside+0.028
 tea+0.028
 they+0.026
Neg logit contributions
 phrase-0.095
rum-0.089
inct-0.088
cut-0.088
 guardian-0.086
Token a
Token ID257
Feature activation+1.85e+00

Pos logit contributions
 phrase+0.025
cut+0.023
rum+0.023
 ringing+0.022
fly+0.022
Neg logit contributions
 parties-0.008
 outside-0.005
 recess-0.005
text-0.005
 tea-0.005
Token special
Token ID2041
Feature activation-2.04e-01

Pos logit contributions
 parties+0.038
 picnic+0.036
 tea+0.036
 party+0.035
 outside+0.035
Neg logit contributions
rum-0.080
 phrase-0.075
cut-0.072
inct-0.069
ric-0.068
Token surprise
Token ID5975
Feature activation-1.24e+00

Pos logit contributions
rum+0.007
 phrase+0.007
ric+0.007
cut+0.006
inct+0.006
Neg logit contributions
 parties-0.006
 outside-0.005
 tea-0.005
 party-0.005
 picnic-0.005
Token.
Token ID13
Feature activation-5.18e-01

Pos logit contributions
 phrase+0.054
cut+0.051
 ringing+0.050
 guardian+0.048
 drill+0.048
Neg logit contributions
 parties-0.025
text-0.019
 recess-0.018
 outside-0.016
aws-0.016
Token It
Token ID632
Feature activation-4.71e+00

Pos logit contributions
cut+0.022
 phrase+0.022
fly+0.020
 ringing+0.020
 claw+0.020
Neg logit contributions
 parties-0.010
 outside-0.009
 Lily-0.008
text-0.008
 recess-0.008
Token was
Token ID373
Feature activation-1.72e+00

Pos logit contributions
 phrase+0.174
cut+0.162
 ringing+0.156
rum+0.154
 guardian+0.152
Neg logit contributions
 parties-0.126
text-0.102
 outside-0.101
 recess-0.099
 together-0.095
Token a
Token ID257
Feature activation+2.15e+00

Pos logit contributions
 phrase+0.086
cut+0.082
rum+0.081
fly+0.077
 claw+0.077
Neg logit contributions
 parties-0.026
 outside-0.018
 tea-0.016
 together-0.015
 recess-0.015
Token bright
Token ID6016
Feature activation-1.30e+00

Pos logit contributions
 parties+0.049
 tea+0.044
 party+0.044
 picnic+0.044
 outside+0.044
Neg logit contributions
rum-0.086
 phrase-0.086
cut-0.080
ric-0.077
inct-0.075
Token pink
Token ID11398
Feature activation-9.89e-01

Pos logit contributions
 phrase+0.051
rum+0.048
cut+0.048
ric+0.046
inct+0.045
Neg logit contributions
 parties-0.030
 outside-0.028
 tea-0.026
 party-0.025
 picnic-0.024
Token butterfly
Token ID35113
Feature activation+4.37e-02

Pos logit contributions
 phrase+0.035
cut+0.033
rum+0.032
 ringing+0.030
 guardian+0.030
Neg logit contributions
 parties-0.028
 outside-0.024
 tea-0.023
 party-0.022
 picnic-0.022
Token.
Token ID13
Feature activation-1.79e-01

Pos logit contributions
 parties+0.001
text+0.001
 recess+0.001
aws+0.001
ashed+0.001
Neg logit contributions
 phrase-0.002
cut-0.002
 ringing-0.002
fly-0.002
 guardian-0.002
Token en
Token ID551
Feature activation-4.94e-01

Pos logit contributions
 phrase+0.016
cut+0.015
 ringing+0.014
rum+0.014
 guardian+0.014
Neg logit contributions
 parties-0.010
text-0.008
 outside-0.008
 recess-0.008
 tea-0.008
Tokenvious
Token ID1442
Feature activation-1.89e+00

Pos logit contributions
 phrase+0.017
cut+0.016
 ringing+0.015
 guardian+0.015
 claw+0.015
Neg logit contributions
 parties-0.014
text-0.012
 recess-0.011
 tea-0.011
 outside-0.011
Token of
Token ID286
Feature activation-7.66e-01

Pos logit contributions
 phrase+0.078
cut+0.072
 ringing+0.072
 drill+0.070
rum+0.070
Neg logit contributions
 parties-0.042
text-0.032
 recess-0.032
 tea-0.029
 picnic-0.028
Token him
Token ID683
Feature activation+1.56e+00

Pos logit contributions
 phrase+0.034
cut+0.032
 ringing+0.031
rum+0.031
 guardian+0.031
Neg logit contributions
 parties-0.015
text-0.011
 recess-0.011
 outside-0.010
 tea-0.010
Token.
Token ID13
Feature activation-2.05e-01

Pos logit contributions
 parties+0.027
text+0.021
 recess+0.021
ashed+0.017
 tea+0.017
Neg logit contributions
 phrase-0.071
cut-0.067
 ringing-0.066
rum-0.065
fly-0.064
Token He
Token ID679
Feature activation-2.14e+00

Pos logit contributions
 phrase+0.007
cut+0.007
 ringing+0.007
rum+0.007
 guardian+0.007
Neg logit contributions
 parties-0.006
text-0.005
 outside-0.004
 recess-0.004
 tea-0.004
Token did
Token ID750
Feature activation-3.68e+00

Pos logit contributions
 phrase+0.085
cut+0.079
rum+0.078
 ringing+0.076
 guardian+0.075
Neg logit contributions
 parties-0.051
 outside-0.040
text-0.039
 recess-0.037
 together-0.037
Token not
Token ID407
Feature activation-3.52e+00

Pos logit contributions
cut+0.124
 phrase+0.124
 sprite+0.116
fly+0.114
bul+0.114
Neg logit contributions
 parties-0.112
 outside-0.091
 play-0.089
 together-0.088
 their-0.088
Token like
Token ID588
Feature activation+1.13e+00

Pos logit contributions
 phrase+0.146
cut+0.136
 ringing+0.133
 guardian+0.130
 claw+0.130
Neg logit contributions
 parties-0.077
text-0.062
 recess-0.058
 outside-0.057
 tea-0.054
Token to
Token ID284
Feature activation-3.07e+00

Pos logit contributions
 parties+0.026
text+0.020
 recess+0.020
 outside+0.019
 tea+0.018
Neg logit contributions
 phrase-0.046
cut-0.043
 ringing-0.042
 guardian-0.041
rum-0.041
Token shoot
Token ID2686
Feature activation+1.70e+00

Pos logit contributions
 phrase+0.108
cut+0.101
 ringing+0.100
 claw+0.098
 guardian+0.095
Neg logit contributions
 parties-0.083
text-0.073
 outside-0.068
 recess-0.066
ashed-0.065
Token hang
Token ID8181
Feature activation+2.04e+00

Pos logit contributions
 phrase+0.108
cut+0.105
rum+0.102
 beast+0.100
ller+0.099
Neg logit contributions
 parties-0.065
 outside-0.056
 play-0.054
 together-0.049
 recess-0.048
Token it
Token ID340
Feature activation+1.59e+00

Pos logit contributions
 parties+0.043
 outside+0.034
 recess+0.033
 together+0.032
text+0.032
Neg logit contributions
 phrase-0.086
cut-0.082
rum-0.078
 ringing-0.077
 guardian-0.077
Token.
Token ID13
Feature activation+2.94e-01

Pos logit contributions
 parties+0.032
text+0.027
 recess+0.025
aws+0.022
ashed+0.021
Neg logit contributions
 phrase-0.069
 ringing-0.064
cut-0.062
 word-0.061
 claw-0.061
Token But
Token ID887
Feature activation-3.55e+00

Pos logit contributions
 parties+0.008
text+0.006
 outside+0.006
 recess+0.006
 together+0.006
Neg logit contributions
 phrase-0.011
cut-0.010
 ringing-0.010
 guardian-0.010
rum-0.010
Token the
Token ID262
Feature activation-1.19e+00

Pos logit contributions
 phrase+0.160
cut+0.148
 ringing+0.144
 drill+0.144
rum+0.143
Neg logit contributions
 parties-0.069
 outside-0.051
 together-0.049
text-0.049
 recess-0.048
Token wall
Token ID3355
Feature activation-7.34e+00

Pos logit contributions
rum+0.044
 phrase+0.041
cut+0.040
ller+0.037
ric+0.037
Neg logit contributions
 parties-0.033
 outside-0.029
 tea-0.029
 recess-0.029
 party-0.028
Token had
Token ID550
Feature activation+1.50e+00

Pos logit contributions
 phrase+0.314
rum+0.305
ric+0.294
 guardian+0.288
cut+0.287
Neg logit contributions
 parties-0.126
 outside-0.120
 together-0.118
 around-0.117
 they-0.111
Token no
Token ID645
Feature activation+1.36e+00

Pos logit contributions
 parties+0.032
 outside+0.023
text+0.023
 tea+0.023
 everywhere+0.022
Neg logit contributions
cut-0.066
 phrase-0.065
rum-0.062
fly-0.059
 sprite-0.058
Token hook
Token ID8011
Feature activation-1.64e+00

Pos logit contributions
 parties+0.042
 tea+0.036
 recess+0.035
 together+0.035
 outside+0.035
Neg logit contributions
cut-0.044
 phrase-0.043
rum-0.043
fly-0.040
omm-0.040
Token to
Token ID284
Feature activation-2.23e+00

Pos logit contributions
 phrase+0.072
cut+0.068
 ringing+0.067
 claw+0.065
 guardian+0.065
Neg logit contributions
 parties-0.031
text-0.024
 recess-0.023
 outside-0.021
 tea-0.020
Token hang
Token ID8181
Feature activation+1.93e+00

Pos logit contributions
 phrase+0.083
cut+0.077
rum+0.073
 guardian+0.072
 ringing+0.072
Neg logit contributions
 parties-0.060
 recess-0.048
 outside-0.048
text-0.046
 tea-0.045
Token and
Token ID290
Feature activation+1.47e-02

Pos logit contributions
 phrase+0.114
cut+0.107
 ringing+0.104
rum+0.102
 guardian+0.102
Neg logit contributions
 parties-0.048
text-0.035
 recess-0.035
 outside-0.033
 tea-0.031
Token sm
Token ID895
Feature activation-1.74e+00

Pos logit contributions
 Lily+0.000
 they+0.000
 their+0.000
 tea+0.000
 together+0.000
Neg logit contributions
 phrase-0.001
rum-0.001
 guardian-0.001
cut-0.001
ric-0.001
Tokenoky
Token ID31375
Feature activation-3.79e+00

Pos logit contributions
 phrase+0.055
rum+0.050
cut+0.049
 claw+0.049
fly+0.048
Neg logit contributions
 parties-0.054
text-0.044
 picnic-0.044
 party-0.042
 tea-0.042
Token.
Token ID13
Feature activation-1.14e-02

Pos logit contributions
 phrase+0.188
cut+0.178
 ringing+0.173
rum+0.171
 guardian+0.170
Neg logit contributions
 parties-0.053
text-0.034
 recess-0.033
 outside-0.031
 tea-0.027
Token The
Token ID383
Feature activation-3.21e+00

Pos logit contributions
 phrase+0.000
cut+0.000
 ringing+0.000
 guardian+0.000
rum+0.000
Neg logit contributions
 parties-0.000
text-0.000
 recess-0.000
 outside-0.000
 tea-0.000
Token fire
Token ID2046
Feature activation-6.27e+00

Pos logit contributions
 phrase+0.102
rum+0.099
cut+0.090
 guardian+0.088
ller+0.087
Neg logit contributions
 parties-0.096
 tea-0.091
 outside-0.088
 party-0.084
 recess-0.083
Token alarm
Token ID10436
Feature activation-6.34e+00

Pos logit contributions
 phrase+0.234
rum+0.219
cut+0.212
 guardian+0.211
 definition+0.205
Neg logit contributions
 parties-0.147
 outside-0.143
text-0.132
 together-0.129
 tea-0.128
Token went
Token ID1816
Feature activation+4.08e+00

Pos logit contributions
 phrase+0.248
cut+0.231
rum+0.229
 guardian+0.227
ric+0.216
Neg logit contributions
 parties-0.150
 outside-0.132
text-0.125
 tea-0.119
 together-0.119
Token off
Token ID572
Feature activation-6.39e-01

Pos logit contributions
 parties+0.067
 outside+0.052
text+0.045
 recess+0.045
 together+0.044
Neg logit contributions
 phrase-0.193
cut-0.181
rum-0.177
 guardian-0.175
 ringing-0.172
Token.
Token ID13
Feature activation-2.76e-02

Pos logit contributions
 phrase+0.029
cut+0.027
 ringing+0.027
rum+0.026
 drill+0.026
Neg logit contributions
 parties-0.011
text-0.008
 recess-0.008
 outside-0.007
aws-0.007
Token Sara
Token ID24799
Feature activation+3.78e-01

Pos logit contributions
 phrase+0.001
cut+0.001
 ringing+0.001
 guardian+0.001
fly+0.001
Neg logit contributions
 parties-0.000
text-0.000
 recess-0.000
 outside-0.000
 together-0.000
Token job
Token ID1693
Feature activation-1.32e+00

Pos logit contributions
 phrase+0.031
cut+0.029
fly+0.029
 claw+0.028
 drill+0.028
Neg logit contributions
 parties-0.016
 outside-0.012
 together-0.011
 recess-0.011
text-0.011
Token.
Token ID13
Feature activation+3.57e-01

Pos logit contributions
 phrase+0.063
cut+0.059
 ringing+0.058
 guardian+0.057
rum+0.057
Neg logit contributions
 parties-0.021
text-0.014
 recess-0.013
 outside-0.013
 together-0.012
Token
Token ID198
Feature activation+6.13e-01

Pos logit contributions
 parties+0.008
 recess+0.006
text+0.006
 outside+0.006
 tea+0.006
Neg logit contributions
 phrase-0.014
cut-0.013
rum-0.013
 ringing-0.013
 drill-0.013
Token
Token ID198
Feature activation+5.90e-02

Pos logit contributions
 parties+0.017
 recess+0.014
 party+0.013
ashed+0.013
ashes+0.013
Neg logit contributions
 phrase-0.021
cut-0.020
rum-0.019
 drill-0.018
 ringing-0.018
TokenThe
Token ID464
Feature activation-3.59e+00

Pos logit contributions
 parties+0.002
 recess+0.001
text+0.001
 outside+0.001
 tea+0.001
Neg logit contributions
 phrase-0.002
cut-0.002
 ringing-0.002
rum-0.002
 drill-0.002
Token fire
Token ID2046
Feature activation-6.76e+00

Pos logit contributions
 phrase+0.139
rum+0.136
fly+0.136
ller+0.129
ous+0.127
Neg logit contributions
 outside-0.080
 parties-0.078
 party-0.073
 play-0.071
 tea-0.070
Tokenman
Token ID805
Feature activation-3.50e+00

Pos logit contributions
 phrase+0.170
rum+0.161
 ey+0.154
 swollen+0.154
 definition+0.147
Neg logit contributions
 together-0.233
 outside-0.224
 they-0.216
 in-0.215
 were-0.209
Token was
Token ID373
Feature activation-9.65e-01

Pos logit contributions
 phrase+0.136
cut+0.125
rum+0.122
 ringing+0.121
 guardian+0.119
Neg logit contributions
 parties-0.085
text-0.069
 outside-0.067
 together-0.066
 recess-0.064
Token so
Token ID523
Feature activation-3.14e-01

Pos logit contributions
 phrase+0.043
cut+0.041
rum+0.039
fly+0.039
 drill+0.039
Neg logit contributions
 parties-0.018
 outside-0.014
text-0.013
 recess-0.012
 together-0.012
Token happy
Token ID3772
Feature activation-2.47e-01

Pos logit contributions
 phrase+0.013
cut+0.013
fly+0.012
 drill+0.012
rum+0.012
Neg logit contributions
 parties-0.007
 outside-0.005
text-0.005
 recess-0.005
 together-0.005
Token that
Token ID326
Feature activation+1.14e+00

Pos logit contributions
 phrase+0.010
cut+0.010
 ringing+0.010
rum+0.009
 guardian+0.009
Neg logit contributions
 parties-0.005
text-0.004
 recess-0.004
 outside-0.004
 tea-0.004
Token
Token ID198
Feature activation+1.22e+00

Pos logit contributions
 phrase+0.059
cut+0.056
rum+0.055
 drill+0.053
 ringing+0.052
Neg logit contributions
 parties-0.040
 recess-0.032
 tea-0.030
text-0.030
 party-0.029
TokenEvery
Token ID6109
Feature activation+7.87e-01

Pos logit contributions
 parties+0.032
ashed+0.027
 recess+0.027
text+0.027
 tea+0.026
Neg logit contributions
 phrase-0.044
cut-0.040
pe-0.039
 word-0.039
 ringing-0.038
Token day
Token ID1110
Feature activation+7.48e-01

Pos logit contributions
 parties+0.014
text+0.010
 recess+0.010
 outside+0.009
 together+0.009
Neg logit contributions
 phrase-0.036
cut-0.034
 ringing-0.033
 guardian-0.032
 drill-0.032
Token,
Token ID11
Feature activation+2.89e-01

Pos logit contributions
 parties+0.014
text+0.010
 recess+0.010
 outside+0.010
aws+0.009
Neg logit contributions
 phrase-0.033
cut-0.031
 ringing-0.030
 claw-0.030
 drill-0.030
Token the
Token ID262
Feature activation-1.48e+00

Pos logit contributions
 parties+0.004
 outside+0.003
 recess+0.003
 together+0.003
 tea+0.002
Neg logit contributions
 phrase-0.015
cut-0.013
rum-0.013
 guardian-0.013
fly-0.013
Token knob
Token ID36842
Feature activation-7.91e+00

Pos logit contributions
fly+0.050
rum+0.046
 phrase+0.046
ric+0.044
cut+0.042
Neg logit contributions
 outside-0.048
 parties-0.041
 everywhere-0.040
 outdoors-0.038
 in-0.038
Token made
Token ID925
Feature activation-8.41e-02

Pos logit contributions
 phrase+0.324
cut+0.296
rum+0.295
 ringing+0.292
 guardian+0.287
Neg logit contributions
 parties-0.181
 outside-0.143
text-0.138
 recess-0.134
 together-0.131
Token sure
Token ID1654
Feature activation-5.63e-01

Pos logit contributions
 phrase+0.004
cut+0.004
rum+0.004
fly+0.004
 guardian+0.004
Neg logit contributions
 parties-0.001
 outside-0.001
 together-0.001
 tea-0.001
text-0.001
Token that
Token ID326
Feature activation+2.10e-02

Pos logit contributions
 phrase+0.027
rum+0.025
cut+0.025
fly+0.024
 grizz+0.024
Neg logit contributions
 parties-0.010
 they-0.008
 outside-0.008
 together-0.007
 their-0.007
Token the
Token ID262
Feature activation-1.22e+00

Pos logit contributions
 parties+0.000
 outside+0.000
 they+0.000
 together+0.000
 their+0.000
Neg logit contributions
 phrase-0.001
rum-0.001
cut-0.001
ric-0.001
fly-0.001
Token box
Token ID3091
Feature activation-5.71e+00

Pos logit contributions
fly+0.040
rum+0.037
 phrase+0.036
ric+0.034
cut+0.033
Neg logit contributions
 outside-0.041
 parties-0.035
 everywhere-0.034
 outdoors-0.033
 in-0.032
Token
Token ID198
Feature activation+3.12e-01

Pos logit contributions
 parties+0.023
 recess+0.019
ashed+0.019
text+0.018
 tea+0.017
Neg logit contributions
 phrase-0.033
cut-0.030
 word-0.029
 ringing-0.029
 drill-0.029
TokenEvery
Token ID6109
Feature activation+6.55e-01

Pos logit contributions
 parties+0.008
text+0.006
 recess+0.006
 outside+0.006
 tea+0.006
Neg logit contributions
 phrase-0.012
cut-0.011
 ringing-0.011
rum-0.011
 guardian-0.010
Token morning
Token ID3329
Feature activation-2.21e-01

Pos logit contributions
 parties+0.009
 outside+0.006
 recess+0.005
text+0.005
 tea+0.005
Neg logit contributions
 phrase-0.033
cut-0.031
rum-0.031
 ringing-0.030
 claw-0.030
Token,
Token ID11
Feature activation+8.44e-02

Pos logit contributions
 phrase+0.011
cut+0.010
rum+0.010
 ringing+0.010
 guardian+0.010
Neg logit contributions
 parties-0.004
text-0.002
 outside-0.002
 recess-0.002
 tea-0.002
Token the
Token ID262
Feature activation-1.64e+00

Pos logit contributions
 parties+0.001
 outside+0.001
 recess+0.001
 together+0.001
text+0.001
Neg logit contributions
 phrase-0.004
cut-0.004
rum-0.004
 guardian-0.004
 ringing-0.004
Token bell
Token ID8966
Feature activation-6.95e+00

Pos logit contributions
rum+0.058
 phrase+0.058
fly+0.058
ric+0.053
 swollen+0.050
Neg logit contributions
 outside-0.045
 parties-0.044
 outdoors-0.037
 party-0.037
 in-0.036
Token would
Token ID561
Feature activation-1.03e+00

Pos logit contributions
 phrase+0.297
rum+0.271
 guardian+0.263
cut+0.263
 ringing+0.261
Neg logit contributions
 parties-0.147
 outside-0.119
 together-0.111
text-0.105
 play-0.102
Token ring
Token ID5858
Feature activation-4.18e+00

Pos logit contributions
 phrase+0.044
rum+0.041
omm+0.039
ller+0.039
cut+0.039
Neg logit contributions
 parties-0.023
 play-0.019
 outside-0.018
 jog-0.016
 party-0.016
Token and
Token ID290
Feature activation+4.97e-01

Pos logit contributions
 phrase+0.198
cut+0.186
rum+0.184
 claw+0.181
 guardian+0.180
Neg logit contributions
 parties-0.070
 outside-0.051
 together-0.049
 recess-0.040
 tea-0.039
Token the
Token ID262
Feature activation-5.07e-01

Pos logit contributions
 parties+0.011
 play+0.009
 outside+0.008
 they+0.008
 together+0.008
Neg logit contributions
 phrase-0.022
rum-0.021
omm-0.021
cut-0.019
 swollen-0.019
Token village
Token ID7404
Feature activation-3.86e+00

Pos logit contributions
rum+0.020
fly+0.019
 phrase+0.019
omm+0.018
ller+0.017
Neg logit contributions
 parties-0.012
 outside-0.011
 outdoors-0.010
 games-0.009
 party-0.009