TOP ACTIVATIONS
MAX = 8.049

I go near the k ang aroo ?" Anna asked .
ran over to the k ang aroo and said , "
ran away from the k ang aroo . The
else . In the k ang aroo 's pouch was a
They play with the k ang aroo until it is time
cl ap for the k ang aro os again . <|endoftext|>
He runs to the k ang aroo and tries to open
and play with the k ang aroo . They have a
They run to the k ang aroo . They want to
tried to catch the k ang aroo . It was not
go closer to the k ang aroo . The k ang
mean to make the k ang aroo mad . We will
should not bother the k ang aroo . She is a
give them to the k ang aroo . The k ang
They run after the k ang aroo . The k ang
they went to the k ang aroo enclosure . There were
Sam sit with the k ang aroo . They listen to
wanted to catch the k ang aro os and take them
says . " The k ang aroo is not from the
He runs after the k ang aroo , but it is

MIN ACTIVATIONS
MIN = -9.781

something new ! She thought av oc ados were the most
something new ! She thought av oc ados were the most
time there was a little av ac ado . It was
if she could request some av oc ados from the store
happy that it produced more av oc ados than ever before
the tree if they were av oc ados , and they
says , " They are av oc ados . My mom
, there was a loud av ac ado . It was
said , " I love av oc ados ! They are
" I will buy more av oc ados tomorrow . You
away . She learned that av oc ados are good for
. Mom says they are av oc ados . Sara wants
tree was sad that the av oc ados were being taken
Everyday he would take the av oc ados and scatter them
. Did you know that av oc ados are very healthy
the tree knew that the av oc ados would be back
girl who didn 't like av oc ados . She always
It turned out that the av oc ados were bad and
B ongo loved to eat av oc ados . One day
recommendation and picked the right av oc ados .

INTERVAL 3.592 - 8.049
CONTAINS 0.345%

mom in the kitchen and called her . " Mom ,
good night hugs and the goat would sn uggle close ,
She went to the man who was selling it , but
He loved playing with the black cat in his backyard .
and moved together in different directions . When

INTERVAL -0.866 - 3.592
CONTAINS 78.198%

that day on , Lily always made sure to put her
own gun from the distant hill ! <|endoftext|> Once upon a
boy does not move . Tom tries to climb up the
forgot all about the rude boy . She was excited to
't worry , Tim my . It will be fun ."

INTERVAL -5.324 - -0.866
CONTAINS 21.433%

ir ped a thank you and flew off , back into
home but she got lost . She saw a big ,
he thought about what type of nut he had found .
. She was feeling happy and relaxed . Then
close her eyes and imagine a palace made out of clouds

INTERVAL -9.781 - -5.324
CONTAINS 0.024%

wanted to make a perfect cake , so he followed all
of the old man 's stories and adventures . <|endoftext|> Will
a moment . She liked stories about princess es , fair
his favourite snack and some books . When he was ready
minutes , a delicious chocolate cake was done . Everyone was
Token I
Token ID314
Feature activation+1.47e+00

Pos logit contributions
aver+0.002
 pipe+0.002
 afraid+0.002
 warned+0.001
 suppose+0.001
Neg logit contributions
 novels-0.002
 av-0.002
edly-0.002
der-0.002
ados-0.002
Token go
Token ID467
Feature activation-3.73e-01

Pos logit contributions
aver+0.042
 pipe+0.037
 afraid+0.033
 suppose+0.033
 bet+0.031
Neg logit contributions
 novels-0.074
 av-0.072
ados-0.065
 broccoli-0.064
 brace-0.063
Token near
Token ID1474
Feature activation+2.95e+00

Pos logit contributions
 novels+0.013
 av+0.013
 Sunny+0.011
 broccoli+0.011
 brace+0.011
Neg logit contributions
aver-0.015
 pipe-0.014
 afraid-0.013
 suppose-0.013
avers-0.012
Token the
Token ID262
Feature activation+1.73e+00

Pos logit contributions
aver+0.114
 pipe+0.110
 afraid+0.103
 suppose+0.096
 warned+0.095
Neg logit contributions
 novels-0.107
 av-0.106
 vows-0.093
 brace-0.091
 broccoli-0.089
Token k
Token ID479
Feature activation+3.96e+00

Pos logit contributions
aver+0.061
 pipe+0.057
 afraid+0.053
 suppose+0.050
 worried+0.047
Neg logit contributions
 novels-0.071
 av-0.070
 broccoli-0.061
 brace-0.060
ados-0.060
Tokenang
Token ID648
Feature activation+8.05e+00

Pos logit contributions
aver+0.087
 pipe+0.074
 suppose+0.066
 afraid+0.065
 bet+0.053
Neg logit contributions
 novels-0.212
 av-0.207
 Sunny-0.195
 Ivy-0.193
 Elsa-0.190
Tokenaroo
Token ID38049
Feature activation+5.29e+00

Pos logit contributions
aver+0.252
 pipe+0.226
 suppose+0.211
 afraid+0.209
eping+0.188
Neg logit contributions
 novels-0.350
 av-0.346
 Sunny-0.319
 Ivy-0.312
 broccoli-0.308
Token?"
Token ID1701
Feature activation-5.73e-01

Pos logit contributions
aver+0.170
 pipe+0.156
 afraid+0.141
 suppose+0.136
 bet+0.128
Neg logit contributions
 novels-0.238
 av-0.233
 brace-0.209
 broccoli-0.208
ados-0.206
Token Anna
Token ID11735
Feature activation-1.21e+00

Pos logit contributions
 av+0.017
 novels+0.017
ados+0.014
 brace+0.013
 vows+0.013
Neg logit contributions
aver-0.026
 afraid-0.025
 pipe-0.024
 worried-0.023
 suppose-0.023
Token asked
Token ID1965
Feature activation+3.39e+00

Pos logit contributions
 av+0.051
 Elsa+0.051
 novels+0.050
 brace+0.049
 Sunny+0.049
Neg logit contributions
aver-0.038
 pipe-0.036
 suppose-0.030
inter-0.029
avers-0.028
Token.
Token ID13
Feature activation-7.28e-01

Pos logit contributions
aver+0.102
 afraid+0.099
 scared+0.096
 worried+0.095
 pipe+0.092
Neg logit contributions
 novels-0.146
 av-0.142
 vows-0.136
ils-0.132
ados-0.131
Token ran
Token ID4966
Feature activation-4.60e-02

Pos logit contributions
 novels+0.076
 av+0.075
ados+0.067
 broccoli+0.067
 brace+0.065
Neg logit contributions
aver-0.041
 pipe-0.035
 afraid-0.032
 suppose-0.032
 warned-0.029
Token over
Token ID625
Feature activation+5.05e-01

Pos logit contributions
 av+0.002
 novels+0.002
 broccoli+0.002
ados+0.002
 brace+0.002
Neg logit contributions
aver-0.002
 pipe-0.001
 afraid-0.001
 suppose-0.001
 bet-0.001
Token to
Token ID284
Feature activation+6.86e-01

Pos logit contributions
 pipe+0.015
aver+0.014
 afraid+0.013
 suppose+0.013
 bet+0.013
Neg logit contributions
 novels-0.023
 av-0.022
ados-0.020
 vows-0.020
 broccoli-0.020
Token the
Token ID262
Feature activation+8.86e-01

Pos logit contributions
aver+0.025
 pipe+0.024
 suppose+0.023
 afraid+0.023
 bet+0.022
Neg logit contributions
 novels-0.027
 av-0.026
ados-0.023
 cand-0.023
 brace-0.023
Token k
Token ID479
Feature activation+3.12e+00

Pos logit contributions
aver+0.030
 pipe+0.027
 afraid+0.025
 suppose+0.024
 bet+0.023
Neg logit contributions
 novels-0.038
 av-0.038
 broccoli-0.033
 brace-0.033
ados-0.033
Tokenang
Token ID648
Feature activation+7.50e+00

Pos logit contributions
aver+0.071
 pipe+0.062
 suppose+0.057
 afraid+0.057
 bet+0.047
Neg logit contributions
 novels-0.160
 av-0.158
 Ivy-0.146
 Sunny-0.146
 brace-0.145
Tokenaroo
Token ID38049
Feature activation+4.58e+00

Pos logit contributions
aver+0.225
 pipe+0.202
 afraid+0.187
 suppose+0.184
 bet+0.164
Neg logit contributions
 novels-0.339
 av-0.334
 Sunny-0.305
 Ivy-0.302
 broccoli-0.299
Token and
Token ID290
Feature activation-1.12e-01

Pos logit contributions
aver+0.157
 pipe+0.150
 afraid+0.134
 suppose+0.131
 warned+0.125
Neg logit contributions
 novels-0.195
 av-0.190
 broccoli-0.165
 brace-0.165
ados-0.164
Token said
Token ID531
Feature activation+2.02e+00

Pos logit contributions
 novels+0.006
 av+0.006
 broccoli+0.005
ados+0.005
 brace+0.005
Neg logit contributions
aver-0.003
 pipe-0.003
 afraid-0.002
 suppose-0.002
 warned-0.002
Token,
Token ID11
Feature activation-1.20e+00

Pos logit contributions
aver+0.044
 afraid+0.041
 careful+0.040
 suppose+0.038
 worried+0.037
Neg logit contributions
 novels-0.105
 av-0.100
 magazines-0.092
 cere-0.092
 broccoli-0.091
Token "
Token ID366
Feature activation-4.13e-01

Pos logit contributions
 novels+0.050
 av+0.049
 broccoli+0.043
 brace+0.043
ados+0.042
Neg logit contributions
aver-0.043
 pipe-0.038
 afraid-0.036
 suppose-0.035
 bet-0.033
Token ran
Token ID4966
Feature activation+5.09e-01

Pos logit contributions
aver+0.023
 pipe+0.020
 suppose+0.017
 afraid+0.017
 bet+0.015
Neg logit contributions
 novels-0.042
 av-0.042
 broccoli-0.038
 brace-0.038
 Sunny-0.037
Token away
Token ID1497
Feature activation+1.30e-01

Pos logit contributions
aver+0.021
 pipe+0.019
 afraid+0.018
 suppose+0.018
 bet+0.016
Neg logit contributions
 novels-0.018
 av-0.018
 broccoli-0.016
 brace-0.015
 Sunny-0.015
Token from
Token ID422
Feature activation-2.07e+00

Pos logit contributions
aver+0.005
 afraid+0.005
 pipe+0.005
 scared+0.005
 worried+0.005
Neg logit contributions
 av-0.005
 novels-0.005
ados-0.004
 broccoli-0.004
 brace-0.004
Token the
Token ID262
Feature activation+1.09e+00

Pos logit contributions
 novels+0.074
 av+0.073
 brace+0.062
 broccoli+0.062
 vows+0.062
Neg logit contributions
aver-0.084
 pipe-0.080
 afraid-0.075
 suppose-0.072
 worried-0.069
Token k
Token ID479
Feature activation+3.43e+00

Pos logit contributions
aver+0.033
 pipe+0.031
 afraid+0.027
 suppose+0.025
 worried+0.024
Neg logit contributions
 novels-0.051
 av-0.051
ados-0.045
 brace-0.045
 broccoli-0.045
Tokenang
Token ID648
Feature activation+7.47e+00

Pos logit contributions
aver+0.091
 pipe+0.078
 afraid+0.070
 suppose+0.069
 bet+0.061
Neg logit contributions
 novels-0.174
 av-0.173
 broccoli-0.156
 brace-0.155
 Sunny-0.154
Tokenaroo
Token ID38049
Feature activation+5.04e+00

Pos logit contributions
aver+0.228
 pipe+0.206
 suppose+0.191
 afraid+0.190
 bet+0.173
Neg logit contributions
 novels-0.333
 av-0.329
 Sunny-0.297
 broccoli-0.292
 Ivy-0.291
Token.
Token ID13
Feature activation+3.65e-01

Pos logit contributions
aver+0.207
 pipe+0.196
 afraid+0.182
 suppose+0.173
 warned+0.166
Neg logit contributions
 novels-0.180
 av-0.178
 broccoli-0.152
 brace-0.150
 vows-0.148
Token
Token ID198
Feature activation+6.29e-01

Pos logit contributions
aver+0.016
 pipe+0.015
b+0.013
aking+0.013
eping+0.013
Neg logit contributions
 novels-0.012
 av-0.011
 Sunny-0.011
 Ivy-0.011
 Coco-0.010
Token
Token ID198
Feature activation+4.51e-01

Pos logit contributions
aver+0.027
 pipe+0.026
inter+0.024
 afraid+0.024
 suppose+0.023
Neg logit contributions
 novels-0.019
 Sunny-0.018
 av-0.017
 Ivy-0.016
Sun-0.016
TokenThe
Token ID464
Feature activation+6.03e-01

Pos logit contributions
 pipe+0.020
aver+0.020
inter+0.019
 suppose+0.019
 afraid+0.018
Neg logit contributions
 novels-0.012
 av-0.011
 Sunny-0.011
Sun-0.011
 Ivy-0.010
Token else
Token ID2073
Feature activation-7.73e-01

Pos logit contributions
 novels+0.115
 av+0.113
 broccoli+0.100
ados+0.100
 brace+0.099
Neg logit contributions
aver-0.090
 pipe-0.082
 afraid-0.076
 suppose-0.073
 worried-0.068
Token.
Token ID13
Feature activation-6.63e-01

Pos logit contributions
 novels+0.030
 av+0.029
ados+0.025
 broccoli+0.025
 brace+0.025
Neg logit contributions
aver-0.030
 pipe-0.028
 afraid-0.026
 suppose-0.025
 worried-0.024
Token In
Token ID554
Feature activation-1.39e+00

Pos logit contributions
 novels+0.028
 av+0.027
 brace+0.025
 vows+0.024
 broccoli+0.024
Neg logit contributions
aver-0.023
 pipe-0.021
 afraid-0.019
 suppose-0.019
 bet-0.018
Token the
Token ID262
Feature activation+9.09e-01

Pos logit contributions
 av+0.052
 novels+0.051
 brace+0.044
cakes+0.044
 vows+0.044
Neg logit contributions
aver-0.055
 pipe-0.051
 afraid-0.047
 suppose-0.046
 worried-0.043
Token k
Token ID479
Feature activation+3.17e+00

Pos logit contributions
aver+0.033
 pipe+0.029
 afraid+0.027
 suppose+0.027
avers+0.025
Neg logit contributions
 novels-0.037
 av-0.037
 broccoli-0.033
 Sunny-0.032
 Ivy-0.032
Tokenang
Token ID648
Feature activation+7.30e+00

Pos logit contributions
aver+0.080
 pipe+0.070
 afraid+0.067
 suppose+0.065
 bet+0.054
Neg logit contributions
 novels-0.153
 av-0.151
 Ivy-0.136
es-0.135
 Sunny-0.135
Tokenaroo
Token ID38049
Feature activation+4.81e+00

Pos logit contributions
aver+0.249
 pipe+0.238
 suppose+0.220
 afraid+0.220
eping+0.204
Neg logit contributions
 av-0.261
 novels-0.254
 Sunny-0.245
 Ivy-0.241
es-0.231
Token's
Token ID338
Feature activation+2.66e+00

Pos logit contributions
aver+0.133
 pipe+0.113
 afraid+0.102
 suppose+0.100
avers+0.090
Neg logit contributions
 av-0.235
 novels-0.232
 broccoli-0.211
 Sunny-0.210
 brace-0.208
Token pouch
Token ID36594
Feature activation+2.93e+00

Pos logit contributions
aver+0.074
 pipe+0.070
 afraid+0.062
 suppose+0.058
 worried+0.054
Neg logit contributions
 novels-0.129
 av-0.128
 broccoli-0.113
ados-0.112
 brace-0.112
Token was
Token ID373
Feature activation+3.95e-01

Pos logit contributions
aver+0.102
 pipe+0.092
 afraid+0.084
 suppose+0.082
 bet+0.076
Neg logit contributions
 novels-0.125
 av-0.122
 broccoli-0.109
 brace-0.107
 Sunny-0.106
Token a
Token ID257
Feature activation-1.21e+00

Pos logit contributions
aver+0.013
 pipe+0.011
 afraid+0.011
 suppose+0.010
 worried+0.010
Neg logit contributions
 novels-0.018
 av-0.018
ados-0.016
 brace-0.016
cakes-0.015
Token They
Token ID1119
Feature activation+1.40e+00

Pos logit contributions
aver+0.011
 pipe+0.010
 afraid+0.009
 suppose+0.009
 bet+0.009
Neg logit contributions
 av-0.017
 novels-0.017
 brace-0.015
ados-0.015
ils-0.015
Token play
Token ID711
Feature activation+6.18e-01

Pos logit contributions
aver+0.029
 bet+0.028
 know+0.027
 afraid+0.027
 suppose+0.026
Neg logit contributions
 novels-0.080
 av-0.076
ados-0.075
cakes-0.071
rily-0.069
Token with
Token ID351
Feature activation+5.48e-01

Pos logit contributions
aver+0.021
 pipe+0.019
 afraid+0.018
 bet+0.017
 scared+0.017
Neg logit contributions
 av-0.026
 novels-0.026
 vows-0.024
ados-0.024
 cere-0.023
Token the
Token ID262
Feature activation+7.91e-01

Pos logit contributions
 pipe+0.017
aver+0.016
 afraid+0.015
 scared+0.015
 worried+0.014
Neg logit contributions
 av-0.023
 novels-0.023
quet-0.022
cakes-0.022
 vows-0.022
Token k
Token ID479
Feature activation+3.06e+00

Pos logit contributions
aver+0.036
 pipe+0.032
 suppose+0.030
 afraid+0.030
avers+0.029
Neg logit contributions
 novels-0.025
 av-0.024
 brace-0.022
 Sunny-0.022
 broccoli-0.022
Tokenang
Token ID648
Feature activation+7.28e+00

Pos logit contributions
aver+0.066
 pipe+0.056
 suppose+0.051
 afraid+0.049
 bet+0.042
Neg logit contributions
 novels-0.165
 av-0.161
 Sunny-0.150
 brace-0.149
 Ivy-0.148
Tokenaroo
Token ID38049
Feature activation+4.64e+00

Pos logit contributions
aver+0.231
 pipe+0.211
 suppose+0.198
 afraid+0.196
 bet+0.179
Neg logit contributions
 novels-0.311
 av-0.302
 Sunny-0.280
 Ivy-0.273
 broccoli-0.270
Token until
Token ID1566
Feature activation+1.02e+00

Pos logit contributions
aver+0.162
 pipe+0.159
 afraid+0.150
 scared+0.136
 suppose+0.135
Neg logit contributions
 av-0.184
 novels-0.182
 vows-0.161
cakes-0.159
ados-0.159
Token it
Token ID340
Feature activation+1.06e-01

Pos logit contributions
aver+0.029
 pipe+0.028
 afraid+0.026
 worried+0.025
 saw+0.025
Neg logit contributions
 novels-0.048
 av-0.047
ados-0.043
 brace-0.042
 cere-0.042
Token is
Token ID318
Feature activation+8.66e-01

Pos logit contributions
aver+0.003
 pipe+0.003
 afraid+0.003
 bet+0.003
 suppose+0.002
Neg logit contributions
 av-0.005
 novels-0.005
ados-0.005
 sweets-0.004
 broccoli-0.004
Token time
Token ID640
Feature activation+2.19e+00

Pos logit contributions
aver+0.027
 pipe+0.025
 suppose+0.023
 afraid+0.022
 bet+0.021
Neg logit contributions
 novels-0.038
 av-0.038
 broccoli-0.034
 Sunny-0.034
 brace-0.034
Token cl
Token ID537
Feature activation+8.37e-01

Pos logit contributions
aver+0.019
 pipe+0.017
 suppose+0.016
 know+0.015
 afraid+0.015
Neg logit contributions
 novels-0.051
 av-0.048
 broccoli-0.045
ados-0.045
 Elsa-0.043
Tokenap
Token ID499
Feature activation+2.22e-01

Pos logit contributions
aver+0.027
 pipe+0.025
 afraid+0.023
 suppose+0.022
 worried+0.020
Neg logit contributions
 novels-0.037
 av-0.037
 broccoli-0.033
ados-0.032
 brace-0.032
Token for
Token ID329
Feature activation-1.54e+00

Pos logit contributions
aver+0.008
 pipe+0.007
 afraid+0.007
 suppose+0.006
 bet+0.006
Neg logit contributions
 av-0.010
 novels-0.009
ados-0.008
 Ivy-0.008
 brace-0.008
Token the
Token ID262
Feature activation+8.27e-01

Pos logit contributions
 novels+0.059
 av+0.058
cakes+0.055
ados+0.051
 vows+0.051
Neg logit contributions
aver-0.054
 pipe-0.054
 afraid-0.052
 suppose-0.050
 worried-0.047
Token k
Token ID479
Feature activation+2.99e+00

Pos logit contributions
aver+0.033
 pipe+0.028
aking+0.026
 suppose+0.026
 afraid+0.026
Neg logit contributions
 novels-0.031
 av-0.029
 Sunny-0.027
 broccoli-0.026
 Ivy-0.026
Tokenang
Token ID648
Feature activation+7.25e+00

Pos logit contributions
aver+0.060
 pipe+0.054
 suppose+0.046
 afraid+0.044
 bet+0.039
Neg logit contributions
 novels-0.165
 av-0.162
 Sunny-0.149
 brace-0.147
 Ivy-0.147
Tokenaro
Token ID12022
Feature activation+2.03e+00

Pos logit contributions
aver+0.237
 pipe+0.227
 suppose+0.211
 afraid+0.206
 bet+0.194
Neg logit contributions
 novels-0.290
 av-0.282
 Sunny-0.262
 Ivy-0.255
ados-0.248
Tokenos
Token ID418
Feature activation+8.20e-01

Pos logit contributions
aver+0.066
 pipe+0.059
 afraid+0.053
 suppose+0.053
 bet+0.048
Neg logit contributions
 novels-0.090
 av-0.090
 broccoli-0.079
 Sunny-0.079
 brace-0.078
Token again
Token ID757
Feature activation-2.65e+00

Pos logit contributions
aver+0.027
 pipe+0.025
 afraid+0.025
 scared+0.023
 worried+0.022
Neg logit contributions
 av-0.035
 novels-0.034
 jew-0.031
ados-0.031
 brace-0.030
Token.
Token ID13
Feature activation+3.65e-01

Pos logit contributions
 novels+0.114
 av+0.113
cakes+0.104
ados+0.103
 vows+0.099
Neg logit contributions
aver-0.089
 pipe-0.084
 afraid-0.081
 warned-0.074
 suppose-0.073
Token<|endoftext|>
Token ID50256
Feature activation-1.19e+00

Pos logit contributions
aver+0.016
 pipe+0.014
 afraid+0.013
b+0.013
 suppose+0.013
Neg logit contributions
 novels-0.013
 av-0.012
 Sunny-0.011
 broccoli-0.011
 Elsa-0.011
Token He
Token ID679
Feature activation+1.36e-01

Pos logit contributions
 novels+0.020
 av+0.019
 brace+0.017
 broccoli+0.017
 Elsa+0.017
Neg logit contributions
aver-0.017
 pipe-0.015
 afraid-0.015
 suppose-0.014
 bet-0.013
Token runs
Token ID4539
Feature activation-2.03e-03

Pos logit contributions
aver+0.004
 pipe+0.003
 afraid+0.003
 suppose+0.003
 bet+0.003
Neg logit contributions
 novels-0.007
 av-0.007
ados-0.006
 broccoli-0.006
 Ivy-0.006
Token to
Token ID284
Feature activation+5.35e-01

Pos logit contributions
 novels+0.000
 av+0.000
 broccoli+0.000
 brace+0.000
ados+0.000
Neg logit contributions
aver-0.000
 pipe-0.000
 afraid-0.000
 suppose-0.000
 bet-0.000
Token the
Token ID262
Feature activation+6.14e-01

Pos logit contributions
aver+0.024
 pipe+0.022
 afraid+0.021
 suppose+0.021
 bet+0.020
Neg logit contributions
 novels-0.017
 av-0.017
 broccoli-0.014
 brace-0.014
 Sunny-0.014
Token k
Token ID479
Feature activation+2.98e+00

Pos logit contributions
aver+0.022
 pipe+0.020
 afraid+0.019
 suppose+0.018
 bet+0.017
Neg logit contributions
 novels-0.025
 av-0.025
 broccoli-0.022
 brace-0.022
 Sunny-0.022
Tokenang
Token ID648
Feature activation+7.24e+00

Pos logit contributions
aver+0.075
 pipe+0.065
 suppose+0.061
 afraid+0.060
 bet+0.050
Neg logit contributions
 novels-0.149
 av-0.144
 Sunny-0.134
 Ivy-0.134
ados-0.132
Tokenaroo
Token ID38049
Feature activation+4.30e+00

Pos logit contributions
aver+0.226
 pipe+0.204
 afraid+0.192
 suppose+0.191
eping+0.169
Neg logit contributions
 novels-0.314
 av-0.307
 Sunny-0.284
 Ivy-0.279
 broccoli-0.276
Token and
Token ID290
Feature activation+3.32e-01

Pos logit contributions
aver+0.175
 pipe+0.159
 afraid+0.149
 suppose+0.146
 bet+0.137
Neg logit contributions
 novels-0.157
 av-0.156
 broccoli-0.135
 brace-0.132
 Sunny-0.131
Token tries
Token ID8404
Feature activation+2.69e+00

Pos logit contributions
aver+0.011
 pipe+0.010
 afraid+0.009
 suppose+0.009
 bet+0.008
Neg logit contributions
 novels-0.014
 av-0.014
 broccoli-0.013
 brace-0.012
ados-0.012
Token to
Token ID284
Feature activation-4.57e-01

Pos logit contributions
aver+0.105
 pipe+0.098
 afraid+0.092
 suppose+0.090
 bet+0.083
Neg logit contributions
 novels-0.102
 av-0.100
 broccoli-0.087
 brace-0.085
ados-0.085
Token open
Token ID1280
Feature activation+1.40e-01

Pos logit contributions
 av+0.019
 novels+0.018
 brace+0.017
 Sunny+0.017
 Ivy+0.017
Neg logit contributions
aver-0.016
 pipe-0.014
 afraid-0.013
 suppose-0.013
avers-0.013
Token and
Token ID290
Feature activation+6.31e-01

Pos logit contributions
 av+0.109
 novels+0.101
ados+0.098
 jew+0.098
cakes+0.097
Neg logit contributions
 afraid-0.052
 scared-0.049
aver-0.048
 worried-0.045
 pipe-0.042
Token play
Token ID711
Feature activation+6.25e-01

Pos logit contributions
 pipe+0.012
 say+0.012
aver+0.012
 know+0.012
 think+0.011
Neg logit contributions
 novels-0.035
 av-0.034
lor-0.033
ados-0.032
der-0.031
Token with
Token ID351
Feature activation+6.90e-01

Pos logit contributions
aver+0.020
 pipe+0.017
 afraid+0.016
 scared+0.016
avers+0.016
Neg logit contributions
 novels-0.027
 av-0.026
 masse-0.026
 Elsa-0.025
 vows-0.025
Token the
Token ID262
Feature activation+6.87e-01

Pos logit contributions
aver+0.024
 pipe+0.022
 afraid+0.021
 scared+0.019
 suppose+0.019
Neg logit contributions
 novels-0.029
 av-0.029
ados-0.026
 brace-0.025
cakes-0.025
Token k
Token ID479
Feature activation+2.93e+00

Pos logit contributions
aver+0.034
 pipe+0.030
 afraid+0.028
 suppose+0.028
eping+0.028
Neg logit contributions
 novels-0.018
 av-0.018
 Sunny-0.017
 brace-0.016
 Ivy-0.016
Tokenang
Token ID648
Feature activation+7.23e+00

Pos logit contributions
aver+0.060
 pipe+0.050
 suppose+0.044
 afraid+0.044
 bet+0.036
Neg logit contributions
 novels-0.162
 av-0.160
 Sunny-0.148
 brace-0.147
 Ivy-0.146
Tokenaroo
Token ID38049
Feature activation+4.58e+00

Pos logit contributions
aver+0.232
 pipe+0.211
 afraid+0.197
 suppose+0.197
 bet+0.178
Neg logit contributions
 novels-0.309
 av-0.304
 Sunny-0.278
 Ivy-0.272
 broccoli-0.270
Token.
Token ID13
Feature activation+4.42e-01

Pos logit contributions
aver+0.159
 pipe+0.157
 afraid+0.147
 scared+0.135
 worried+0.133
Neg logit contributions
 novels-0.181
 av-0.178
 vows-0.154
 jew-0.154
 Ivy-0.154
Token They
Token ID1119
Feature activation+1.30e+00

Pos logit contributions
aver+0.017
 pipe+0.015
 afraid+0.014
 suppose+0.014
 bet+0.013
Neg logit contributions
 av-0.017
 novels-0.017
 brace-0.015
 broccoli-0.015
ados-0.015
Token have
Token ID423
Feature activation+8.19e-02

Pos logit contributions
 think+0.028
 bet+0.028
 know+0.027
 say+0.026
 want+0.026
Neg logit contributions
 novels-0.078
ados-0.071
 av-0.070
edly-0.066
cakes-0.066
Token a
Token ID257
Feature activation+3.43e-01

Pos logit contributions
aver+0.002
 pipe+0.002
 afraid+0.002
 suppose+0.002
 warned+0.002
Neg logit contributions
 novels-0.004
 av-0.004
ados-0.003
 broccoli-0.003
 Ivy-0.003
TokenThey
Token ID2990
Feature activation+3.90e-01

Pos logit contributions
aver+0.010
 pipe+0.010
 suppose+0.010
inter+0.010
 afraid+0.010
Neg logit contributions
 novels-0.006
 av-0.006
Bon-0.005
Sun-0.005
 Sunny-0.005
Token run
Token ID1057
Feature activation+2.40e-01

Pos logit contributions
aver+0.008
 pipe+0.008
 bet+0.008
 afraid+0.007
 know+0.007
Neg logit contributions
 novels-0.022
 av-0.021
ados-0.020
cakes-0.019
edly-0.019
Token to
Token ID284
Feature activation+7.54e-01

Pos logit contributions
aver+0.010
 pipe+0.009
 afraid+0.008
 suppose+0.008
 bet+0.008
Neg logit contributions
 novels-0.009
 av-0.009
 broccoli-0.007
 brace-0.007
 Sunny-0.007
Token the
Token ID262
Feature activation+6.42e-01

Pos logit contributions
aver+0.043
 pipe+0.037
avers+0.036
b+0.035
 afraid+0.035
Neg logit contributions
 av-0.017
 novels-0.017
 Sunny-0.016
 broccoli-0.014
 Ivy-0.014
Token k
Token ID479
Feature activation+2.91e+00

Pos logit contributions
aver+0.026
 pipe+0.023
 afraid+0.022
 suppose+0.021
 bet+0.020
Neg logit contributions
 novels-0.024
 av-0.024
 broccoli-0.021
 brace-0.021
 Sunny-0.020
Tokenang
Token ID648
Feature activation+7.21e+00

Pos logit contributions
aver+0.067
 pipe+0.059
 suppose+0.054
 afraid+0.054
 bet+0.045
Neg logit contributions
 novels-0.150
 av-0.146
 Sunny-0.135
 Ivy-0.135
 brace-0.135
Tokenaroo
Token ID38049
Feature activation+4.42e+00

Pos logit contributions
aver+0.230
 pipe+0.209
 afraid+0.200
 suppose+0.198
 bet+0.177
Neg logit contributions
 novels-0.303
 av-0.297
 Sunny-0.275
 Ivy-0.268
ados-0.265
Token.
Token ID13
Feature activation-6.10e-02

Pos logit contributions
aver+0.178
 pipe+0.162
 afraid+0.151
 suppose+0.149
 bet+0.140
Neg logit contributions
 novels-0.163
 av-0.161
 broccoli-0.140
 brace-0.138
 Sunny-0.136
Token They
Token ID1119
Feature activation+6.83e-01

Pos logit contributions
 novels+0.002
 av+0.002
 Sunny+0.002
 broccoli+0.001
 Ivy+0.001
Neg logit contributions
aver-0.003
 pipe-0.003
eping-0.002
 afraid-0.002
b-0.002
Token want
Token ID765
Feature activation+1.29e+00

Pos logit contributions
aver+0.014
 pipe+0.013
 bet+0.013
 afraid+0.013
 suppose+0.012
Neg logit contributions
 novels-0.039
 av-0.037
ados-0.035
edly-0.034
 broccoli-0.033
Token to
Token ID284
Feature activation-2.77e-01

Pos logit contributions
aver+0.052
 pipe+0.048
 afraid+0.045
 suppose+0.044
 bet+0.041
Neg logit contributions
 novels-0.048
 av-0.047
 broccoli-0.041
 brace-0.040
ados-0.040
Token tried
Token ID3088
Feature activation+3.09e+00

Pos logit contributions
aver+0.017
 pipe+0.016
 suppose+0.013
 afraid+0.013
 bet+0.013
Neg logit contributions
 novels-0.026
 av-0.026
 broccoli-0.023
 brace-0.023
 Sunny-0.023
Token to
Token ID284
Feature activation+1.97e-01

Pos logit contributions
aver+0.101
 pipe+0.100
 afraid+0.098
 suppose+0.089
 scared+0.086
Neg logit contributions
 av-0.127
 novels-0.126
 vows-0.114
ados-0.110
 Ivy-0.110
Token catch
Token ID4929
Feature activation+1.19e+00

Pos logit contributions
aver+0.006
 pipe+0.005
 afraid+0.005
 suppose+0.004
 bet+0.004
Neg logit contributions
 novels-0.010
 av-0.009
 broccoli-0.009
 brace-0.008
ados-0.008
Token the
Token ID262
Feature activation+4.71e-01

Pos logit contributions
 pipe+0.036
 afraid+0.034
 scared+0.033
aver+0.032
b+0.030
Neg logit contributions
 vows-0.052
 novels-0.052
 av-0.050
cakes-0.047
edly-0.046
Token k
Token ID479
Feature activation+2.97e+00

Pos logit contributions
aver+0.020
 pipe+0.017
 suppose+0.016
 afraid+0.016
avers+0.015
Neg logit contributions
 novels-0.017
 av-0.017
 broccoli-0.015
 Sunny-0.014
 brace-0.014
Tokenang
Token ID648
Feature activation+7.19e+00

Pos logit contributions
aver+0.054
 pipe+0.043
 afraid+0.036
 suppose+0.036
 bet+0.028
Neg logit contributions
 novels-0.174
 av-0.171
 Sunny-0.158
 Ivy-0.156
 brace-0.156
Tokenaroo
Token ID38049
Feature activation+4.47e+00

Pos logit contributions
aver+0.219
 pipe+0.194
 suppose+0.179
 afraid+0.178
 bet+0.159
Neg logit contributions
 novels-0.325
 av-0.321
 Sunny-0.291
 broccoli-0.288
 Ivy-0.285
Token.
Token ID13
Feature activation+8.32e-01

Pos logit contributions
aver+0.149
 pipe+0.148
 afraid+0.144
 scared+0.131
 worried+0.125
Neg logit contributions
 novels-0.186
 av-0.182
 vows-0.169
 brace-0.162
 jew-0.161
Token It
Token ID632
Feature activation+2.14e+00

Pos logit contributions
aver+0.031
 pipe+0.028
 afraid+0.026
 suppose+0.025
 bet+0.024
Neg logit contributions
 novels-0.033
 av-0.033
 broccoli-0.029
 brace-0.029
ados-0.028
Token was
Token ID373
Feature activation+1.24e+00

Pos logit contributions
aver+0.084
 pipe+0.075
 suppose+0.069
 afraid+0.067
avers+0.066
Neg logit contributions
 novels-0.079
 av-0.079
 Sunny-0.069
 broccoli-0.069
 brace-0.069
Token not
Token ID407
Feature activation+1.48e+00

Pos logit contributions
aver+0.043
 pipe+0.038
 afraid+0.035
 suppose+0.035
 bet+0.032
Neg logit contributions
 novels-0.053
 av-0.052
 broccoli-0.046
 brace-0.046
 Sunny-0.045
Token go
Token ID467
Feature activation-5.36e-01

Pos logit contributions
aver+0.013
 pipe+0.012
 afraid+0.012
 bet+0.011
 suppose+0.011
Neg logit contributions
 novels-0.028
 av-0.027
ados-0.025
 broccoli-0.024
cakes-0.024
Token closer
Token ID5699
Feature activation+9.62e-01

Pos logit contributions
 novels+0.015
 Sunny+0.015
 av+0.015
 Ivy+0.015
 Coco+0.014
Neg logit contributions
aver-0.023
 pipe-0.022
eping-0.020
 War-0.019
inter-0.019
Token to
Token ID284
Feature activation+4.37e-01

Pos logit contributions
aver+0.036
 pipe+0.034
 afraid+0.032
 suppose+0.031
 bet+0.029
Neg logit contributions
 novels-0.037
 av-0.037
 broccoli-0.032
 brace-0.032
ados-0.031
Token the
Token ID262
Feature activation+5.28e-01

Pos logit contributions
aver+0.021
 pipe+0.019
 afraid+0.018
 suppose+0.017
avers+0.017
Neg logit contributions
 novels-0.013
 av-0.013
 broccoli-0.011
 Sunny-0.011
 brace-0.011
Token k
Token ID479
Feature activation+2.95e+00

Pos logit contributions
aver+0.021
 pipe+0.019
 afraid+0.018
 suppose+0.018
 bet+0.017
Neg logit contributions
 novels-0.020
 av-0.019
 broccoli-0.017
 brace-0.017
 Sunny-0.016
Tokenang
Token ID648
Feature activation+7.19e+00

Pos logit contributions
aver+0.063
 pipe+0.053
 afraid+0.048
 suppose+0.048
 bet+0.039
Neg logit contributions
 novels-0.162
 av-0.159
 Sunny-0.146
 brace-0.145
 Ivy-0.145
Tokenaroo
Token ID38049
Feature activation+4.42e+00

Pos logit contributions
aver+0.225
 pipe+0.203
 afraid+0.191
 suppose+0.189
 bet+0.167
Neg logit contributions
 novels-0.313
 av-0.307
 Sunny-0.284
 Ivy-0.277
 broccoli-0.274
Token.
Token ID13
Feature activation+2.56e-01

Pos logit contributions
aver+0.164
 pipe+0.155
 afraid+0.142
 suppose+0.137
 bet+0.130
Neg logit contributions
 novels-0.175
 av-0.171
 broccoli-0.148
ados-0.147
 brace-0.146
Token The
Token ID383
Feature activation+1.06e+00

Pos logit contributions
aver+0.010
 pipe+0.009
 afraid+0.008
 suppose+0.008
 bet+0.008
Neg logit contributions
 novels-0.010
 av-0.010
 broccoli-0.008
 brace-0.008
 Sunny-0.008
Token k
Token ID479
Feature activation+2.34e+00

Pos logit contributions
aver+0.051
 pipe+0.045
 suppose+0.044
 afraid+0.043
eping+0.042
Neg logit contributions
 novels-0.031
 av-0.029
 Sunny-0.027
 broccoli-0.026
 Ivy-0.026
Tokenang
Token ID648
Feature activation+6.58e+00

Pos logit contributions
aver+0.053
 pipe+0.046
 suppose+0.042
 afraid+0.040
 dared+0.034
Neg logit contributions
 novels-0.124
 av-0.118
 Sunny-0.112
 brace-0.111
 Ivy-0.110
Token mean
Token ID1612
Feature activation+1.89e+00

Pos logit contributions
aver+0.053
 pipe+0.049
ounded+0.046
 suppose+0.042
avers+0.041
Neg logit contributions
 av-0.093
 novels-0.090
 brace-0.083
 broccoli-0.082
 Sunny-0.082
Token to
Token ID284
Feature activation+1.01e+00

Pos logit contributions
 afraid+0.046
 pipe+0.043
 scared+0.040
aver+0.038
 worried+0.038
Neg logit contributions
 av-0.094
 novels-0.087
 Elsa-0.084
 Ivy-0.083
quet-0.083
Token make
Token ID787
Feature activation+1.53e+00

Pos logit contributions
aver+0.030
 pipe+0.026
 afraid+0.024
 suppose+0.023
 bet+0.021
Neg logit contributions
 novels-0.048
 av-0.048
 broccoli-0.043
 brace-0.042
 Sunny-0.042
Token the
Token ID262
Feature activation+8.64e-01

Pos logit contributions
 pipe+0.048
aver+0.048
 afraid+0.046
 scared+0.042
 worried+0.041
Neg logit contributions
 av-0.062
 novels-0.061
 vows-0.055
edly-0.055
cakes-0.054
Token k
Token ID479
Feature activation+2.88e+00

Pos logit contributions
aver+0.038
 pipe+0.033
 suppose+0.031
 afraid+0.031
avers+0.030
Neg logit contributions
 novels-0.030
 av-0.029
 broccoli-0.026
 Sunny-0.026
 brace-0.025
Tokenang
Token ID648
Feature activation+7.14e+00

Pos logit contributions
aver+0.064
 pipe+0.054
 suppose+0.052
 afraid+0.045
 bet+0.041
Neg logit contributions
 novels-0.151
 av-0.150
 Sunny-0.137
 Ivy-0.135
 brace-0.135
Tokenaroo
Token ID38049
Feature activation+4.47e+00

Pos logit contributions
aver+0.232
 pipe+0.212
 suppose+0.204
 afraid+0.191
 bet+0.181
Neg logit contributions
 novels-0.295
 av-0.291
 Sunny-0.265
ados-0.258
 Ivy-0.258
Token mad
Token ID8805
Feature activation+4.47e-01

Pos logit contributions
aver+0.152
 pipe+0.143
 afraid+0.137
 scared+0.124
 worried+0.123
Neg logit contributions
 novels-0.190
 av-0.186
 broccoli-0.164
 Elsa-0.163
 vows-0.162
Token.
Token ID13
Feature activation+7.49e-01

Pos logit contributions
aver+0.015
 afraid+0.015
 pipe+0.014
 scared+0.013
 worried+0.013
Neg logit contributions
 av-0.019
 novels-0.018
 brace-0.016
 jew-0.016
 Ivy-0.016
Token We
Token ID775
Feature activation+1.63e+00

Pos logit contributions
aver+0.032
 pipe+0.028
 suppose+0.026
 afraid+0.026
b+0.025
Neg logit contributions
 novels-0.027
 av-0.026
 Sunny-0.023
 broccoli-0.023
 Elsa-0.022
Token will
Token ID481
Feature activation-1.72e-01

Pos logit contributions
aver+0.060
 pipe+0.055
 afraid+0.048
avers+0.047
 suppose+0.047
Neg logit contributions
 av-0.060
 novels-0.059
 vows-0.054
 brace-0.053
 broccoli-0.053
Token should
Token ID815
Feature activation-3.65e-01

Pos logit contributions
aver+0.006
 pipe+0.006
 suppose+0.005
b+0.005
avers+0.005
Neg logit contributions
 av-0.004
 novels-0.004
 cand-0.004
 broccoli-0.004
 sweets-0.004
Token not
Token ID407
Feature activation+2.03e+00

Pos logit contributions
 novels+0.017
 av+0.017
 broccoli+0.015
 brace+0.014
ados+0.014
Neg logit contributions
aver-0.012
 pipe-0.010
 afraid-0.010
 suppose-0.009
 bet-0.009
Token bother
Token ID11393
Feature activation+1.89e+00

Pos logit contributions
aver+0.079
 pipe+0.066
 afraid+0.064
avers+0.063
 suppose+0.062
Neg logit contributions
 av-0.075
 novels-0.073
 broccoli-0.065
 brace-0.065
 cand-0.063
Token the
Token ID262
Feature activation+1.03e+00

Pos logit contributions
 pipe+0.061
aver+0.057
 afraid+0.055
 suppose+0.049
 scared+0.049
Neg logit contributions
 novels-0.079
 av-0.077
 vows-0.075
edly-0.074
 jew-0.069
Token k
Token ID479
Feature activation+3.02e+00

Pos logit contributions
aver+0.031
 pipe+0.029
 afraid+0.026
 suppose+0.024
 worried+0.023
Neg logit contributions
 novels-0.048
 av-0.047
 brace-0.042
ados-0.042
 broccoli-0.042
Tokenang
Token ID648
Feature activation+7.11e+00

Pos logit contributions
aver+0.068
 pipe+0.057
 afraid+0.050
 suppose+0.046
avers+0.042
Neg logit contributions
 novels-0.166
 av-0.165
 broccoli-0.151
 brace-0.149
 vows-0.148
Tokenaroo
Token ID38049
Feature activation+4.69e+00

Pos logit contributions
aver+0.207
 pipe+0.182
 afraid+0.166
 suppose+0.165
 bet+0.148
Neg logit contributions
 novels-0.336
 av-0.333
 broccoli-0.299
 Sunny-0.296
 brace-0.293
Token.
Token ID13
Feature activation+8.34e-01

Pos logit contributions
 pipe+0.159
aver+0.156
 afraid+0.150
 scared+0.138
 worried+0.132
Neg logit contributions
 novels-0.197
 av-0.190
 vows-0.180
 brace-0.173
 jew-0.171
Token She
Token ID1375
Feature activation+7.63e-02

Pos logit contributions
aver+0.031
 pipe+0.028
 afraid+0.026
 suppose+0.025
 bet+0.024
Neg logit contributions
 novels-0.034
 av-0.033
 broccoli-0.029
 brace-0.029
 Sunny-0.028
Token is
Token ID318
Feature activation+7.28e-01

Pos logit contributions
aver+0.003
 pipe+0.003
 suppose+0.002
 afraid+0.002
avers+0.002
Neg logit contributions
 av-0.003
 novels-0.003
 brace-0.002
 broccoli-0.002
 cand-0.002
Token a
Token ID257
Feature activation-1.35e-01

Pos logit contributions
aver+0.023
 pipe+0.021
 suppose+0.019
 bet+0.019
 afraid+0.018
Neg logit contributions
 novels-0.032
 av-0.032
 broccoli-0.029
 brace-0.028
 Sunny-0.028
Token give
Token ID1577
Feature activation-1.08e+00

Pos logit contributions
 bet+0.018
 think+0.018
 say+0.017
 saw+0.017
 know+0.017
Neg logit contributions
 novels-0.063
ados-0.057
 av-0.055
edly-0.055
lor-0.055
Token them
Token ID606
Feature activation-3.97e-01

Pos logit contributions
 novels+0.041
 av+0.041
der+0.035
ados+0.035
 Ivy+0.035
Neg logit contributions
aver-0.039
 pipe-0.037
 afraid-0.034
 suppose-0.032
 bet-0.032
Token to
Token ID284
Feature activation-1.86e-01

Pos logit contributions
 av+0.016
 novels+0.016
der+0.015
 Elsa+0.014
ados+0.014
Neg logit contributions
aver-0.012
 pipe-0.012
 careful-0.010
 afraid-0.010
 sprayed-0.010
Token the
Token ID262
Feature activation+1.37e-01

Pos logit contributions
 novels+0.008
 av+0.008
ados+0.007
cakes+0.007
paste+0.007
Neg logit contributions
aver-0.006
 pipe-0.006
 afraid-0.005
 suppose-0.005
 say-0.005
Token k
Token ID479
Feature activation+2.73e+00

Pos logit contributions
aver+0.005
 pipe+0.004
 afraid+0.004
 suppose+0.004
 bet+0.004
Neg logit contributions
 novels-0.006
 av-0.006
ados-0.005
 broccoli-0.005
 brace-0.005
Tokenang
Token ID648
Feature activation+7.07e+00

Pos logit contributions
aver+0.055
 pipe+0.044
 afraid+0.039
 suppose+0.039
 bet+0.031
Neg logit contributions
 novels-0.154
 av-0.152
 Sunny-0.140
 brace-0.139
 Ivy-0.138
Tokenaroo
Token ID38049
Feature activation+4.37e+00

Pos logit contributions
aver+0.219
 pipe+0.196
 afraid+0.185
 suppose+0.182
 bet+0.163
Neg logit contributions
 novels-0.312
 av-0.308
 Sunny-0.283
 broccoli-0.275
 Ivy-0.275
Token.
Token ID13
Feature activation+3.04e-01

Pos logit contributions
aver+0.156
 pipe+0.151
 afraid+0.133
 suppose+0.126
 bet+0.124
Neg logit contributions
 novels-0.176
 av-0.172
ados-0.151
 Ivy-0.149
 vows-0.148
Token The
Token ID383
Feature activation+1.15e+00

Pos logit contributions
aver+0.008
 pipe+0.007
 afraid+0.007
 suppose+0.007
 bet+0.007
Neg logit contributions
 av-0.015
 novels-0.014
ados-0.013
 brace-0.013
 sweets-0.013
Token k
Token ID479
Feature activation+2.47e+00

Pos logit contributions
aver+0.054
 pipe+0.048
 afraid+0.046
 suppose+0.046
eping+0.044
Neg logit contributions
 novels-0.035
 av-0.034
 Sunny-0.031
 Ivy-0.030
 broccoli-0.030
Tokenang
Token ID648
Feature activation+6.61e+00

Pos logit contributions
aver+0.053
 pipe+0.044
 suppose+0.039
 afraid+0.038
 dared+0.031
Neg logit contributions
 novels-0.134
 av-0.130
 Sunny-0.123
 Ivy-0.121
 brace-0.121
TokenThey
Token ID2990
Feature activation+5.64e-01

Pos logit contributions
aver+0.021
 pipe+0.019
 afraid+0.018
 suppose+0.018
 bet+0.017
Neg logit contributions
 novels-0.017
 av-0.017
 brace-0.015
 Sunny-0.014
 broccoli-0.014
Token run
Token ID1057
Feature activation+4.21e-01

Pos logit contributions
aver+0.013
 pipe+0.012
 bet+0.011
 afraid+0.011
 suppose+0.011
Neg logit contributions
 novels-0.031
 av-0.030
ados-0.028
 broccoli-0.027
cakes-0.027
Token after
Token ID706
Feature activation-4.44e-01

Pos logit contributions
aver+0.015
 pipe+0.014
 afraid+0.013
 suppose+0.013
 bet+0.012
Neg logit contributions
 av-0.017
 novels-0.017
 broccoli-0.015
ados-0.015
 brace-0.014
Token the
Token ID262
Feature activation+5.80e-01

Pos logit contributions
 novels+0.016
 av+0.016
 brace+0.014
 broccoli+0.013
 vows+0.013
Neg logit contributions
aver-0.017
 pipe-0.017
 afraid-0.015
 suppose-0.015
 worried-0.014
Token k
Token ID479
Feature activation+2.83e+00

Pos logit contributions
aver+0.031
 pipe+0.027
 suppose+0.026
 afraid+0.026
 seen+0.026
Neg logit contributions
 novels-0.014
 av-0.013
 Sunny-0.013
 Ivy-0.013
 brace-0.013
Tokenang
Token ID648
Feature activation+7.05e+00

Pos logit contributions
aver+0.063
 pipe+0.054
 afraid+0.049
 suppose+0.048
 bet+0.040
Neg logit contributions
 novels-0.150
 av-0.148
 Sunny-0.137
 brace-0.137
 Ivy-0.136
Tokenaroo
Token ID38049
Feature activation+4.12e+00

Pos logit contributions
aver+0.226
 pipe+0.209
 suppose+0.193
 afraid+0.192
 bet+0.173
Neg logit contributions
 novels-0.297
 av-0.292
 Sunny-0.271
 Ivy-0.265
ados-0.259
Token.
Token ID13
Feature activation+3.26e-01

Pos logit contributions
aver+0.174
 pipe+0.162
 afraid+0.152
 suppose+0.149
 bet+0.141
Neg logit contributions
 novels-0.145
 av-0.142
 broccoli-0.121
ados-0.118
 brace-0.118
Token The
Token ID383
Feature activation+1.09e+00

Pos logit contributions
aver+0.013
 pipe+0.012
 afraid+0.011
 suppose+0.011
b+0.010
Neg logit contributions
 novels-0.012
 av-0.012
 broccoli-0.010
 Sunny-0.010
 brace-0.010
Token k
Token ID479
Feature activation+2.41e+00

Pos logit contributions
aver+0.053
 pipe+0.047
 suppose+0.045
 afraid+0.045
eping+0.044
Neg logit contributions
 novels-0.030
 av-0.029
 Sunny-0.027
 Ivy-0.026
 brace-0.026
Tokenang
Token ID648
Feature activation+6.59e+00

Pos logit contributions
aver+0.056
 pipe+0.049
 suppose+0.044
 afraid+0.043
 dared+0.036
Neg logit contributions
 novels-0.125
 av-0.119
 Sunny-0.114
 brace-0.113
 Ivy-0.112
Token they
Token ID484
Feature activation-8.48e-01

Pos logit contributions
 novels+0.037
 av+0.036
 brace+0.033
ados+0.032
 vows+0.031
Neg logit contributions
aver-0.030
 pipe-0.027
 warned-0.025
 afraid-0.025
 suppose-0.025
Token went
Token ID1816
Feature activation-3.95e-01

Pos logit contributions
 novels+0.046
 av+0.044
 broccoli+0.041
ados+0.040
 brace+0.040
Neg logit contributions
aver-0.021
 pipe-0.018
 afraid-0.016
 suppose-0.015
 warned-0.014
Token to
Token ID284
Feature activation+2.71e-01

Pos logit contributions
 novels+0.016
 av+0.016
 broccoli+0.014
 brace+0.013
ados+0.013
Neg logit contributions
aver-0.015
 pipe-0.014
 afraid-0.013
 suppose-0.012
 worried-0.012
Token the
Token ID262
Feature activation+2.23e-01

Pos logit contributions
aver+0.011
 pipe+0.010
 afraid+0.010
 suppose+0.010
 bet+0.009
Neg logit contributions
 novels-0.010
 av-0.009
 broccoli-0.008
 brace-0.008
ados-0.008
Token k
Token ID479
Feature activation+2.63e+00

Pos logit contributions
aver+0.008
 pipe+0.007
 afraid+0.006
 suppose+0.006
 bet+0.006
Neg logit contributions
 novels-0.010
 av-0.009
 broccoli-0.008
 Sunny-0.008
 brace-0.008
Tokenang
Token ID648
Feature activation+7.01e+00

Pos logit contributions
aver+0.076
 pipe+0.071
 afraid+0.067
 suppose+0.065
inter+0.058
Neg logit contributions
 novels-0.122
 av-0.121
ados-0.106
cakes-0.106
 Sunny-0.106
Tokenaroo
Token ID38049
Feature activation+4.21e+00

Pos logit contributions
aver+0.222
 pipe+0.203
 afraid+0.191
 suppose+0.187
 worried+0.167
Neg logit contributions
 novels-0.301
 av-0.296
 Sunny-0.266
 Ivy-0.262
ados-0.262
Token enclosure
Token ID30685
Feature activation+2.60e+00

Pos logit contributions
aver+0.148
 pipe+0.137
 afraid+0.124
 suppose+0.121
 bet+0.115
Neg logit contributions
 novels-0.175
 av-0.173
 broccoli-0.152
 brace-0.152
ados-0.149
Token.
Token ID13
Feature activation-4.14e-01

Pos logit contributions
aver+0.102
 pipe+0.095
 afraid+0.088
 suppose+0.085
 warned+0.081
Neg logit contributions
 novels-0.099
 av-0.098
 brace-0.086
 broccoli-0.084
 Elsa-0.084
Token There
Token ID1318
Feature activation+1.68e+00

Pos logit contributions
 novels+0.018
 av+0.018
 broccoli+0.016
 brace+0.015
ados+0.015
Neg logit contributions
aver-0.014
 pipe-0.013
 afraid-0.012
 suppose-0.012
 bet-0.011
Token were
Token ID547
Feature activation+2.15e-01

Pos logit contributions
aver+0.067
 pipe+0.061
 afraid+0.057
 suppose+0.056
 bet+0.052
Neg logit contributions
 novels-0.063
 av-0.062
 broccoli-0.054
 brace-0.053
 Sunny-0.053
Token Sam
Token ID3409
Feature activation+6.01e-01

Pos logit contributions
 novels+0.033
 av+0.032
 vows+0.029
 broccoli+0.029
 brace+0.029
Neg logit contributions
aver-0.023
 pipe-0.021
 afraid-0.019
 suppose-0.019
 bet-0.017
Token sit
Token ID1650
Feature activation-1.51e+00

Pos logit contributions
aver+0.013
 bet+0.012
 think+0.012
 know+0.012
 say+0.012
Neg logit contributions
 novels-0.033
 av-0.031
ados-0.030
 broccoli-0.028
cakes-0.028
Token with
Token ID351
Feature activation+2.49e-01

Pos logit contributions
 av+0.061
 novels+0.060
 broccoli+0.056
 Ivy+0.055
 brace+0.054
Neg logit contributions
aver-0.056
 pipe-0.050
 afraid-0.048
aking-0.046
 worried-0.045
Token the
Token ID262
Feature activation+4.94e-01

Pos logit contributions
aver+0.010
 pipe+0.009
 afraid+0.009
 suppose+0.008
 worried+0.008
Neg logit contributions
 novels-0.009
 av-0.009
 brace-0.008
 broccoli-0.008
ados-0.008
Token k
Token ID479
Feature activation+2.75e+00

Pos logit contributions
aver+0.019
 suppose+0.016
 afraid+0.015
 pipe+0.015
aking+0.015
Neg logit contributions
 av-0.019
 novels-0.019
 caul-0.017
 broccoli-0.016
 Sunny-0.016
Tokenang
Token ID648
Feature activation+6.98e+00

Pos logit contributions
aver+0.054
 pipe+0.044
 suppose+0.040
 afraid+0.039
 bet+0.032
Neg logit contributions
 novels-0.154
 av-0.154
ados-0.140
 Sunny-0.139
 brace-0.138
Tokenaroo
Token ID38049
Feature activation+4.50e+00

Pos logit contributions
aver+0.219
 pipe+0.200
 suppose+0.190
 afraid+0.189
 bet+0.171
Neg logit contributions
 novels-0.298
 av-0.292
 Sunny-0.265
ados-0.259
 broccoli-0.257
Token.
Token ID13
Feature activation-1.62e-01

Pos logit contributions
aver+0.164
 pipe+0.159
 afraid+0.150
 suppose+0.138
 worried+0.136
Neg logit contributions
 av-0.172
 novels-0.168
 Ivy-0.150
 broccoli-0.146
 brace-0.146
Token They
Token ID1119
Feature activation+8.22e-01

Pos logit contributions
 novels+0.006
 av+0.006
 broccoli+0.005
 brace+0.005
 Sunny+0.005
Neg logit contributions
aver-0.007
 pipe-0.006
 afraid-0.006
 suppose-0.006
 bet-0.005
Token listen
Token ID6004
Feature activation-1.73e+00

Pos logit contributions
 know+0.015
 think+0.015
 bet+0.015
 say+0.015
 afraid+0.015
Neg logit contributions
 novels-0.047
 av-0.045
ados-0.045
cakes-0.043
edly-0.041
Token to
Token ID284
Feature activation-1.89e-01

Pos logit contributions
 av+0.077
 novels+0.075
ados+0.068
 brace+0.065
 Ivy+0.065
Neg logit contributions
aver-0.054
 afraid-0.051
 pipe-0.050
 suppose-0.047
 worried-0.046
Token wanted
Token ID2227
Feature activation+2.57e+00

Pos logit contributions
aver+0.057
 pipe+0.052
 suppose+0.049
inter+0.048
 kidding+0.046
Neg logit contributions
 av-0.049
 novels-0.046
es-0.041
 cand-0.039
 Sunny-0.039
Token to
Token ID284
Feature activation-6.11e-02

Pos logit contributions
aver+0.086
 pipe+0.082
 afraid+0.077
 suppose+0.073
 warned+0.070
Neg logit contributions
 av-0.107
 novels-0.107
 brace-0.094
ados-0.093
der-0.093
Token catch
Token ID4929
Feature activation+8.08e-01

Pos logit contributions
 novels+0.003
 av+0.003
 broccoli+0.002
 brace+0.002
 Sunny+0.002
Neg logit contributions
aver-0.002
 pipe-0.002
 afraid-0.002
 suppose-0.002
 bet-0.002
Token the
Token ID262
Feature activation+1.64e-01

Pos logit contributions
aver+0.032
 pipe+0.030
 afraid+0.029
 suppose+0.027
 scared+0.027
Neg logit contributions
 novels-0.030
 av-0.029
 vows-0.025
 brace-0.025
 broccoli-0.025
Token k
Token ID479
Feature activation+2.73e+00

Pos logit contributions
aver+0.006
 pipe+0.005
 suppose+0.005
b+0.005
 bet+0.005
Neg logit contributions
 novels-0.006
 av-0.006
 broccoli-0.006
 Sunny-0.006
 vows-0.005
Tokenang
Token ID648
Feature activation+6.93e+00

Pos logit contributions
aver+0.056
 pipe+0.047
 suppose+0.041
 afraid+0.041
 bet+0.034
Neg logit contributions
 novels-0.153
 av-0.152
 Sunny-0.137
ados-0.137
 broccoli-0.137
Tokenaro
Token ID12022
Feature activation+1.87e+00

Pos logit contributions
aver+0.218
 pipe+0.198
 suppose+0.189
 afraid+0.185
 bet+0.171
Neg logit contributions
 novels-0.289
 av-0.288
 Sunny-0.261
ados-0.255
 broccoli-0.254
Tokenos
Token ID418
Feature activation+5.11e-01

Pos logit contributions
aver+0.064
 pipe+0.056
 suppose+0.052
 afraid+0.052
 bet+0.048
Neg logit contributions
 av-0.078
 novels-0.078
 Sunny-0.069
 broccoli-0.068
ados-0.067
Token and
Token ID290
Feature activation+5.85e-01

Pos logit contributions
aver+0.015
 pipe+0.014
 afraid+0.014
 scared+0.013
 worried+0.012
Neg logit contributions
 av-0.023
 novels-0.023
 jew-0.022
 vows-0.022
oph-0.021
Token take
Token ID1011
Feature activation-1.31e+00

Pos logit contributions
aver+0.018
 pipe+0.016
 afraid+0.015
 suppose+0.014
 bet+0.013
Neg logit contributions
 novels-0.027
 av-0.027
 broccoli-0.024
 brace-0.024
ados-0.023
Token them
Token ID606
Feature activation+3.65e-01

Pos logit contributions
 av+0.047
 novels+0.047
 broccoli+0.040
 brace+0.039
der+0.039
Neg logit contributions
aver-0.053
 pipe-0.049
 afraid-0.046
 suppose-0.044
 bet-0.043
Token says
Token ID1139
Feature activation+2.69e+00

Pos logit contributions
 novels+0.028
 av+0.027
 broccoli+0.025
 brace+0.025
 Sunny+0.025
Neg logit contributions
aver-0.013
 pipe-0.011
 afraid-0.009
 suppose-0.009
 bet-0.008
Token.
Token ID13
Feature activation+1.50e-01

Pos logit contributions
aver+0.101
 pipe+0.095
 afraid+0.093
 suppose+0.087
 worried+0.086
Neg logit contributions
 novels-0.101
 av-0.097
 broccoli-0.087
 vows-0.085
 sweets-0.084
Token "
Token ID366
Feature activation+1.51e-02

Pos logit contributions
aver+0.004
 pipe+0.004
 afraid+0.004
 suppose+0.004
 worried+0.004
Neg logit contributions
 novels-0.007
 av-0.007
 broccoli-0.006
 vows-0.006
 sweets-0.006
TokenThe
Token ID464
Feature activation+1.50e+00

Pos logit contributions
aver+0.001
 pipe+0.001
 afraid+0.000
 suppose+0.000
 bet+0.000
Neg logit contributions
 novels-0.001
 av-0.001
 broccoli-0.001
 brace-0.001
ados-0.001
Token k
Token ID479
Feature activation+2.73e+00

Pos logit contributions
aver+0.057
 pipe+0.051
 suppose+0.047
 afraid+0.047
aking+0.044
Neg logit contributions
 novels-0.058
 av-0.056
 Sunny-0.052
 broccoli-0.050
 brace-0.050
Tokenang
Token ID648
Feature activation+6.89e+00

Pos logit contributions
aver+0.063
 pipe+0.055
 suppose+0.049
 afraid+0.048
 worried+0.040
Neg logit contributions
 novels-0.140
 av-0.135
 Sunny-0.128
 Ivy-0.127
 brace-0.126
Tokenaroo
Token ID38049
Feature activation+4.37e+00

Pos logit contributions
aver+0.234
 pipe+0.221
 suppose+0.207
 afraid+0.200
eping+0.190
Neg logit contributions
 novels-0.257
 av-0.251
 Sunny-0.238
 Ivy-0.229
ados-0.223
Token is
Token ID318
Feature activation+7.11e-01

Pos logit contributions
aver+0.158
 pipe+0.142
 afraid+0.134
 suppose+0.129
 bet+0.122
Neg logit contributions
 novels-0.181
 av-0.178
 broccoli-0.157
 brace-0.154
ados-0.153
Token not
Token ID407
Feature activation+1.47e+00

Pos logit contributions
aver+0.022
 pipe+0.020
 afraid+0.018
 suppose+0.018
 bet+0.016
Neg logit contributions
 novels-0.032
 av-0.032
 broccoli-0.029
 brace-0.028
 Sunny-0.028
Token from
Token ID422
Feature activation-1.87e+00

Pos logit contributions
aver+0.047
 pipe+0.042
 suppose+0.037
 afraid+0.036
 bet+0.035
Neg logit contributions
 av-0.066
 novels-0.066
 broccoli-0.059
 Sunny-0.059
 brace-0.058
Token the
Token ID262
Feature activation+1.52e+00

Pos logit contributions
 novels+0.077
 av+0.077
 brace+0.068
 broccoli+0.067
ados+0.066
Neg logit contributions
aver-0.066
 pipe-0.060
 afraid-0.057
 suppose-0.054
 worried-0.050
Token He
Token ID679
Feature activation-1.47e-01

Pos logit contributions
 novels+0.024
 av+0.024
 brace+0.021
 broccoli+0.021
ados+0.021
Neg logit contributions
aver-0.021
 pipe-0.019
 afraid-0.017
 suppose-0.017
 bet-0.016
Token runs
Token ID4539
Feature activation-1.30e-01

Pos logit contributions
 novels+0.007
 av+0.007
 broccoli+0.006
ados+0.006
 brace+0.006
Neg logit contributions
aver-0.005
 pipe-0.004
 afraid-0.004
 suppose-0.004
 bet-0.004
Token after
Token ID706
Feature activation-5.72e-01

Pos logit contributions
 novels+0.005
 av+0.005
 broccoli+0.004
 Sunny+0.004
 brace+0.004
Neg logit contributions
aver-0.005
 pipe-0.005
 afraid-0.005
 suppose-0.004
avers-0.004
Token the
Token ID262
Feature activation+5.25e-01

Pos logit contributions
 novels+0.023
 av+0.023
 brace+0.020
 vows+0.020
ados+0.019
Neg logit contributions
aver-0.020
 pipe-0.019
 afraid-0.017
 suppose-0.017
 worried-0.016
Token k
Token ID479
Feature activation+2.75e+00

Pos logit contributions
aver+0.028
 pipe+0.024
 suppose+0.024
eping+0.023
 afraid+0.023
Neg logit contributions
 novels-0.013
 av-0.013
 Sunny-0.012
 Ivy-0.012
 brace-0.012
Tokenang
Token ID648
Feature activation+6.88e+00

Pos logit contributions
aver+0.063
 pipe+0.054
 suppose+0.049
 afraid+0.048
 bet+0.040
Neg logit contributions
 novels-0.145
 av-0.142
 Sunny-0.131
 Ivy-0.131
 brace-0.130
Tokenaroo
Token ID38049
Feature activation+3.93e+00

Pos logit contributions
aver+0.206
 pipe+0.184
 afraid+0.168
 suppose+0.167
 bet+0.149
Neg logit contributions
 novels-0.316
 av-0.312
 Sunny-0.282
 broccoli-0.280
 Ivy-0.279
Token,
Token ID11
Feature activation-8.12e-02

Pos logit contributions
aver+0.157
 pipe+0.144
 afraid+0.134
 suppose+0.132
 bet+0.124
Neg logit contributions
 novels-0.146
 av-0.144
 broccoli-0.125
 brace-0.123
 Sunny-0.121
Token but
Token ID475
Feature activation+2.25e+00

Pos logit contributions
 av+0.003
 Sunny+0.003
 Ivy+0.003
 novels+0.003
 broccoli+0.003
Neg logit contributions
aver-0.003
 pipe-0.003
avers-0.002
eping-0.002
iz-0.002
Token it
Token ID340
Feature activation-6.54e-01

Pos logit contributions
aver+0.091
 pipe+0.081
 afraid+0.078
 suppose+0.076
 worried+0.071
Neg logit contributions
 novels-0.086
 av-0.085
 brace-0.072
ados-0.072
 broccoli-0.071
Token is
Token ID318
Feature activation-1.41e-01

Pos logit contributions
 novels+0.029
 av+0.029
ados+0.025
 broccoli+0.025
 Elsa+0.025
Neg logit contributions
aver-0.021
 pipe-0.019
 afraid-0.018
 suppose-0.017
 bet-0.016
Token something
Token ID1223
Feature activation-2.90e+00

Pos logit contributions
 av+0.142
 novels+0.136
 vows+0.136
cakes+0.135
umm+0.134
Neg logit contributions
 pipe-0.064
 afraid-0.063
 warned-0.060
 suppose-0.057
 worried-0.055
Token new
Token ID649
Feature activation-2.38e+00

Pos logit contributions
 novels+0.153
 av+0.146
edly+0.145
 Sunny+0.144
ados+0.142
Neg logit contributions
 pipe-0.062
 afraid-0.058
 worried-0.053
 scared-0.052
aver-0.050
Token!
Token ID0
Feature activation-3.36e-01

Pos logit contributions
 av+0.106
 novels+0.100
 Elsa+0.093
 Sunny+0.092
 Ivy+0.092
Neg logit contributions
 pipe-0.075
aver-0.075
 afraid-0.068
 suppose-0.067
 warned-0.064
Token She
Token ID1375
Feature activation-1.41e+00

Pos logit contributions
 av+0.013
 novels+0.013
 broccoli+0.011
 brace+0.011
 sweets+0.011
Neg logit contributions
aver-0.012
 pipe-0.012
 suppose-0.011
 afraid-0.011
 bet-0.011
Token thought
Token ID1807
Feature activation-5.06e-02

Pos logit contributions
 novels+0.083
 av+0.076
ados+0.075
rier+0.070
 broccoli+0.070
Neg logit contributions
aver-0.032
 pipe-0.026
 warned-0.025
 dared-0.024
 wanted-0.024
Token av
Token ID1196
Feature activation-9.78e+00

Pos logit contributions
 novels+0.002
Bon+0.002
edly+0.002
 av+0.002
 vows+0.002
Neg logit contributions
 pipe-0.001
 afraid-0.001
aver-0.001
 worried-0.001
 suppose-0.001
Tokenoc
Token ID420
Feature activation-5.44e-01

Pos logit contributions
 novels+0.391
 av+0.387
 broccoli+0.340
 brace+0.334
 Sunny+0.332
Neg logit contributions
aver-0.363
 pipe-0.327
 afraid-0.304
 suppose-0.298
 bet-0.279
Tokenados
Token ID22484
Feature activation-5.69e+00

Pos logit contributions
 novels+0.010
 Elsa+0.010
 Sunny+0.009
Elsa+0.008
 Ivy+0.008
Neg logit contributions
 pipe-0.028
aver-0.027
b-0.025
 suppose-0.025
 claw-0.025
Token were
Token ID547
Feature activation-1.59e+00

Pos logit contributions
 novels+0.285
 av+0.272
 Elsa+0.253
ados+0.251
 masse+0.250
Neg logit contributions
 pipe-0.140
 warned-0.135
aver-0.135
 afraid-0.128
b-0.124
Token the
Token ID262
Feature activation-1.25e+00

Pos logit contributions
 novels+0.082
edly+0.081
lor+0.078
 vows+0.076
der+0.076
Neg logit contributions
 afraid-0.033
aver-0.033
 pipe-0.030
 scared-0.030
 careful-0.029
Token most
Token ID749
Feature activation-9.84e-01

Pos logit contributions
 novels+0.065
 av+0.063
ados+0.058
 Sunny+0.058
 Ivy+0.057
Neg logit contributions
aver-0.030
 pipe-0.027
 afraid-0.025
 suppose-0.024
 worried-0.022
Token something
Token ID1223
Feature activation-3.00e+00

Pos logit contributions
 av+0.143
 novels+0.137
 vows+0.136
cakes+0.136
umm+0.136
Neg logit contributions
 afraid-0.062
 pipe-0.062
 warned-0.060
 suppose-0.057
 careful-0.055
Token new
Token ID649
Feature activation-2.38e+00

Pos logit contributions
 novels+0.160
 av+0.152
edly+0.151
ados+0.149
 Sunny+0.149
Neg logit contributions
 pipe-0.061
 afraid-0.058
 worried-0.052
 scared-0.052
 careful-0.049
Token!
Token ID0
Feature activation-2.79e-01

Pos logit contributions
 av+0.105
 novels+0.100
 Elsa+0.092
 Sunny+0.092
 Ivy+0.091
Neg logit contributions
aver-0.075
 pipe-0.075
 afraid-0.068
 suppose-0.067
 warned-0.064
Token She
Token ID1375
Feature activation-1.33e+00

Pos logit contributions
 av+0.012
 novels+0.011
 broccoli+0.010
 brace+0.010
 sweets+0.010
Neg logit contributions
aver-0.010
 pipe-0.009
 suppose-0.009
 afraid-0.008
 bet-0.008
Token thought
Token ID1807
Feature activation-9.94e-02

Pos logit contributions
 novels+0.079
 av+0.072
ados+0.070
 Ivy+0.066
 broccoli+0.066
Neg logit contributions
aver-0.030
 pipe-0.024
 warned-0.023
 dared-0.022
 wanted-0.021
Token av
Token ID1196
Feature activation-9.52e+00

Pos logit contributions
 novels+0.004
Bon+0.004
edly+0.004
 av+0.004
 vows+0.004
Neg logit contributions
 pipe-0.003
 afraid-0.002
aver-0.002
 suppose-0.002
 worried-0.002
Tokenoc
Token ID420
Feature activation-5.79e-01

Pos logit contributions
 novels+0.372
 av+0.369
 broccoli+0.322
 brace+0.318
 Sunny+0.313
Neg logit contributions
aver-0.362
 pipe-0.327
 afraid-0.304
 suppose-0.299
 bet-0.278
Tokenados
Token ID22484
Feature activation-5.55e+00

Pos logit contributions
 novels+0.011
 Elsa+0.010
 Sunny+0.009
Bon+0.008
Elsa+0.008
Neg logit contributions
 pipe-0.029
aver-0.029
b-0.027
 suppose-0.027
 claw-0.026
Token were
Token ID547
Feature activation-1.60e+00

Pos logit contributions
 novels+0.282
 av+0.268
 Elsa+0.248
ados+0.248
 masse+0.247
Neg logit contributions
 pipe-0.133
 warned-0.130
aver-0.127
 afraid-0.121
 scared-0.117
Token the
Token ID262
Feature activation-1.19e+00

Pos logit contributions
 novels+0.083
edly+0.082
der+0.078
lor+0.078
 vows+0.077
Neg logit contributions
aver-0.033
 afraid-0.032
 pipe-0.030
 scared-0.030
 careful-0.029
Token most
Token ID749
Feature activation-1.02e+00

Pos logit contributions
 novels+0.062
 av+0.060
ados+0.056
 Sunny+0.055
 Elsa+0.055
Neg logit contributions
aver-0.028
 pipe-0.025
 afraid-0.024
 suppose-0.022
 bet-0.020
Token time
Token ID640
Feature activation+1.26e+00

Pos logit contributions
aver+0.022
 pipe+0.021
 suppose+0.018
 afraid+0.018
b+0.018
Neg logit contributions
 novels-0.027
 av-0.026
 broccoli-0.024
 Sunny-0.024
Bon-0.023
Token there
Token ID612
Feature activation+2.14e+00

Pos logit contributions
aver+0.053
 pipe+0.048
 afraid+0.045
 suppose+0.044
 bet+0.042
Neg logit contributions
 novels-0.044
 av-0.043
 broccoli-0.037
 Sunny-0.037
 Ivy-0.036
Token was
Token ID373
Feature activation+1.13e+00

Pos logit contributions
aver+0.077
 pipe+0.069
 afraid+0.064
 suppose+0.062
avers+0.059
Neg logit contributions
 novels-0.087
 av-0.086
 broccoli-0.077
 Sunny-0.076
 Ivy-0.075
Token a
Token ID257
Feature activation-1.67e-01

Pos logit contributions
aver+0.044
 pipe+0.041
 afraid+0.038
 suppose+0.037
 bet+0.035
Neg logit contributions
 novels-0.043
 av-0.042
 broccoli-0.037
 Sunny-0.036
 brace-0.036
Token little
Token ID1310
Feature activation+2.15e+00

Pos logit contributions
 av+0.008
 novels+0.008
 broccoli+0.007
 Sunny+0.007
 Elsa+0.007
Neg logit contributions
aver-0.005
 pipe-0.004
 suppose-0.004
 afraid-0.004
b-0.004
Token av
Token ID1196
Feature activation-8.99e+00

Pos logit contributions
aver+0.056
 pipe+0.048
 afraid+0.043
 suppose+0.042
 bet+0.038
Neg logit contributions
 novels-0.110
 av-0.109
 broccoli-0.098
 brace-0.097
 Sunny-0.096
Tokenac
Token ID330
Feature activation-3.47e+00

Pos logit contributions
 novels+0.324
 av+0.324
 brace+0.275
 broccoli+0.271
ados+0.267
Neg logit contributions
aver-0.363
 pipe-0.337
 suppose-0.327
 afraid-0.316
 bet-0.297
Tokenado
Token ID4533
Feature activation+3.02e+00

Pos logit contributions
 novels+0.130
 av+0.130
 brace+0.111
 cand+0.108
ados+0.108
Neg logit contributions
aver-0.131
 pipe-0.126
 suppose-0.125
 afraid-0.123
 bet-0.121
Token.
Token ID13
Feature activation-6.60e-01

Pos logit contributions
aver+0.101
 pipe+0.091
 suppose+0.086
 afraid+0.085
 bet+0.079
Neg logit contributions
 novels-0.128
 av-0.128
 broccoli-0.113
 Sunny-0.111
 brace-0.110
Token It
Token ID632
Feature activation+7.55e-01

Pos logit contributions
 av+0.026
 novels+0.026
 brace+0.023
 broccoli+0.023
ados+0.022
Neg logit contributions
aver-0.024
 pipe-0.022
 afraid-0.020
 suppose-0.020
 bet-0.019
Token was
Token ID373
Feature activation+1.86e-01

Pos logit contributions
aver+0.025
 pipe+0.022
 afraid+0.020
 suppose+0.019
 bet+0.018
Neg logit contributions
 novels-0.034
 av-0.034
 broccoli-0.030
 brace-0.030
 Sunny-0.029
Token if
Token ID611
Feature activation+3.22e-01

Pos logit contributions
 novels+0.034
 av+0.033
 brace+0.031
 vows+0.030
ados+0.030
Neg logit contributions
aver-0.022
 pipe-0.020
 suppose-0.019
 afraid-0.019
 worried-0.017
Token she
Token ID673
Feature activation-2.02e+00

Pos logit contributions
 warned+0.010
 pipe+0.010
 worried+0.010
aver+0.010
 afraid+0.010
Neg logit contributions
 novels-0.013
edly-0.012
rily-0.012
 av-0.012
der-0.011
Token could
Token ID714
Feature activation+8.71e-01

Pos logit contributions
 novels+0.100
 av+0.096
ados+0.090
 brace+0.088
 broccoli+0.087
Neg logit contributions
aver-0.057
 pipe-0.048
 afraid-0.048
 suppose-0.047
 dared-0.044
Token request
Token ID2581
Feature activation+7.21e-01

Pos logit contributions
aver+0.019
 pipe+0.018
 know+0.017
 suppose+0.017
 dare+0.016
Neg logit contributions
 novels-0.047
 av-0.043
ados-0.042
 Elsa-0.042
edly-0.042
Token some
Token ID617
Feature activation-1.63e+00

Pos logit contributions
 warned+0.015
aver+0.015
 pipe+0.014
 afraid+0.014
 suppose+0.013
Neg logit contributions
edly-0.035
lor-0.034
Bon-0.033
comb-0.032
rily-0.031
Token av
Token ID1196
Feature activation-8.83e+00

Pos logit contributions
 Elsa+0.042
edly+0.041
 vows+0.040
Bon+0.040
comb+0.039
Neg logit contributions
aver-0.073
 pipe-0.069
 warned-0.068
 afraid-0.068
 worried-0.066
Tokenoc
Token ID420
Feature activation+3.71e-01

Pos logit contributions
 novels+0.362
 av+0.354
 broccoli+0.319
 Sunny+0.313
 Ivy+0.313
Neg logit contributions
aver-0.315
 pipe-0.280
 afraid-0.259
 suppose-0.252
b-0.238
Tokenados
Token ID22484
Feature activation-4.69e+00

Pos logit contributions
aver+0.019
 pipe+0.018
avers+0.017
b+0.017
 warned+0.016
Neg logit contributions
 Elsa-0.007
 Sunny-0.007
 Ivy-0.007
Bon-0.006
 novels-0.006
Token from
Token ID422
Feature activation-2.70e+00

Pos logit contributions
 novels+0.235
 vows+0.225
 av+0.213
 cere+0.210
 unic+0.209
Neg logit contributions
 warned-0.109
 afraid-0.097
aver-0.097
 pipe-0.096
 said-0.095
Token the
Token ID262
Feature activation+5.74e-01

Pos logit contributions
 novels+0.099
cakes+0.098
paste+0.097
 vows+0.095
lor+0.092
Neg logit contributions
 pipe-0.090
aver-0.088
 afraid-0.084
 worried-0.084
 warned-0.082
Token store
Token ID3650
Feature activation+2.79e-01

Pos logit contributions
 pipe+0.018
aver+0.016
 worried+0.015
 afraid+0.015
 saw+0.014
Neg logit contributions
es-0.024
 novels-0.024
edly-0.024
der-0.024
rily-0.023
Token happy
Token ID3772
Feature activation-1.56e+00

Pos logit contributions
aver+0.009
 afraid+0.007
 pipe+0.007
 worried+0.006
 suppose+0.006
Neg logit contributions
 novels-0.019
 av-0.018
 brace-0.017
ados-0.017
 Elsa-0.017
Token that
Token ID326
Feature activation+7.59e-01

Pos logit contributions
 novels+0.075
 av+0.072
 brace+0.066
Elsa+0.066
 Elsa+0.065
Neg logit contributions
 pipe-0.043
aver-0.042
 afraid-0.040
 warned-0.038
 suppose-0.037
Token it
Token ID340
Feature activation-1.80e+00

Pos logit contributions
 pipe+0.020
 afraid+0.020
aver+0.019
 warned+0.019
 suppose+0.019
Neg logit contributions
 novels-0.035
edly-0.033
cakes-0.032
 av-0.032
 vows-0.032
Token produced
Token ID4635
Feature activation-1.19e+00

Pos logit contributions
 novels+0.109
 av+0.099
ados+0.096
 brace+0.092
 broccoli+0.091
Neg logit contributions
aver-0.037
 dared-0.030
 wanted-0.030
 afraid-0.030
 warned-0.029
Token more
Token ID517
Feature activation-1.73e+00

Pos logit contributions
 novels+0.057
 Elsa+0.053
 vows+0.050
cakes+0.050
paste+0.050
Neg logit contributions
aver-0.032
 pipe-0.029
 afraid-0.027
 warned-0.027
 scared-0.027
Token av
Token ID1196
Feature activation-8.80e+00

Pos logit contributions
 Elsa+0.058
 novels+0.055
 Midnight+0.054
 Sunny+0.053
 vows+0.053
Neg logit contributions
aver-0.066
 pipe-0.065
 afraid-0.065
 scared-0.061
 worried-0.061
Tokenoc
Token ID420
Feature activation+5.24e-02

Pos logit contributions
 novels+0.414
 vows+0.382
 av+0.381
 Ivy+0.380
 Elsa+0.378
Neg logit contributions
aver-0.258
 pipe-0.224
b-0.205
 afraid-0.201
aking-0.200
Tokenados
Token ID22484
Feature activation-4.55e+00

Pos logit contributions
aver+0.003
 pipe+0.002
b+0.002
avers+0.002
 claw+0.002
Neg logit contributions
 Elsa-0.001
 novels-0.001
 Sunny-0.001
 Ivy-0.001
Elsa-0.001
Token than
Token ID621
Feature activation+1.05e+00

Pos logit contributions
 novels+0.226
 vows+0.197
Elsa+0.193
 Elsa+0.192
 Ivy+0.191
Neg logit contributions
 pipe-0.117
 warned-0.110
 afraid-0.109
 scared-0.105
b-0.105
Token ever
Token ID1683
Feature activation-5.91e-01

Pos logit contributions
 afraid+0.035
 pipe+0.033
aver+0.033
 warned+0.032
 scared+0.031
Neg logit contributions
 novels-0.042
cakes-0.038
 vows-0.038
Elsa-0.037
 Elsa-0.037
Token before
Token ID878
Feature activation+4.70e-01

Pos logit contributions
 novels+0.024
 av+0.022
cakes+0.021
 Ivy+0.020
 vows+0.020
Neg logit contributions
aver-0.021
 afraid-0.020
 pipe-0.019
 suppose-0.018
 seen-0.018
Token the
Token ID262
Feature activation-2.04e-01

Pos logit contributions
aver+0.093
 afraid+0.092
 suppose+0.091
 pipe+0.090
 warned+0.088
Neg logit contributions
cakes-0.111
 novels-0.110
quet-0.108
 vows-0.108
edly-0.106
Token tree
Token ID5509
Feature activation-3.29e-01

Pos logit contributions
 novels+0.009
rily+0.009
 vows+0.008
 Elsa+0.008
es+0.008
Neg logit contributions
aver-0.006
 pipe-0.006
 afraid-0.005
 worried-0.005
 scared-0.005
Token if
Token ID611
Feature activation+5.02e-01

Pos logit contributions
 novels+0.014
 av+0.013
 broccoli+0.012
 brace+0.012
 vows+0.012
Neg logit contributions
aver-0.012
 pipe-0.011
 afraid-0.010
 suppose-0.010
 worried-0.009
Token they
Token ID484
Feature activation-7.20e-01

Pos logit contributions
 afraid+0.017
 pipe+0.017
 warned+0.017
aver+0.017
 worried+0.016
Neg logit contributions
 novels-0.020
 av-0.017
 cere-0.017
edly-0.017
der-0.017
Token were
Token ID547
Feature activation-9.03e-01

Pos logit contributions
 novels+0.038
 av+0.036
ados+0.034
 broccoli+0.034
 Elsa+0.033
Neg logit contributions
aver-0.019
 pipe-0.016
 afraid-0.016
 suppose-0.015
 dared-0.014
Token av
Token ID1196
Feature activation-8.74e+00

Pos logit contributions
 novels+0.046
 av+0.043
 vows+0.040
ados+0.040
 broccoli+0.040
Neg logit contributions
aver-0.026
 afraid-0.022
 pipe-0.020
 worried-0.019
 suppose-0.019
Tokenoc
Token ID420
Feature activation+5.38e-01

Pos logit contributions
 novels+0.362
 av+0.354
 broccoli+0.320
 Sunny+0.314
 Ivy+0.312
Neg logit contributions
aver-0.312
 pipe-0.277
 afraid-0.257
 suppose-0.248
 bet-0.235
Tokenados
Token ID22484
Feature activation-4.51e+00

Pos logit contributions
aver+0.028
 pipe+0.027
avers+0.025
 claw+0.025
b+0.025
Neg logit contributions
 Elsa-0.010
 Sunny-0.010
 novels-0.009
 Ivy-0.009
Elsa-0.009
Token,
Token ID11
Feature activation-8.10e-01

Pos logit contributions
 novels+0.235
 vows+0.213
 av+0.207
 cere+0.198
 journals+0.197
Neg logit contributions
 pipe-0.107
 warned-0.106
aver-0.104
 afraid-0.102
 scared-0.093
Token and
Token ID290
Feature activation-6.44e-01

Pos logit contributions
 novels+0.045
 av+0.043
 vows+0.041
 brace+0.039
Bon+0.039
Neg logit contributions
aver-0.017
 pipe-0.016
 afraid-0.014
 suppose-0.013
 warned-0.013
Token they
Token ID484
Feature activation-5.03e-02

Pos logit contributions
 novels+0.025
 av+0.023
Bon+0.021
 vows+0.021
 brace+0.021
Neg logit contributions
aver-0.024
 pipe-0.022
 afraid-0.022
 warned-0.021
 suppose-0.020
Token says
Token ID1139
Feature activation+1.50e+00

Pos logit contributions
 novels+0.039
 av+0.039
 broccoli+0.035
 brace+0.035
ados+0.035
Neg logit contributions
aver-0.014
 pipe-0.012
 afraid-0.010
 suppose-0.010
 bet-0.008
Token,
Token ID11
Feature activation-1.50e+00

Pos logit contributions
aver+0.043
 pipe+0.040
 afraid+0.038
 suppose+0.035
 worried+0.034
Neg logit contributions
 novels-0.071
 av-0.068
 broccoli-0.062
Bon-0.061
 Elsa-0.061
Token "
Token ID366
Feature activation-8.55e-01

Pos logit contributions
 novels+0.071
 av+0.070
 brace+0.063
ados+0.062
 broccoli+0.061
Neg logit contributions
aver-0.043
 pipe-0.038
 afraid-0.036
 suppose-0.035
 worried-0.033
TokenThey
Token ID2990
Feature activation+4.42e-01

Pos logit contributions
 novels+0.029
 av+0.028
 broccoli+0.024
 brace+0.024
 Sunny+0.024
Neg logit contributions
aver-0.038
 pipe-0.034
 afraid-0.033
 suppose-0.032
 worried-0.030
Token are
Token ID389
Feature activation-1.74e+00

Pos logit contributions
aver+0.016
 pipe+0.015
 afraid+0.014
 suppose+0.013
 bet+0.013
Neg logit contributions
 novels-0.018
 av-0.017
 broccoli-0.015
 Elsa-0.015
ados-0.015
Token av
Token ID1196
Feature activation-8.66e+00

Pos logit contributions
 novels+0.075
 av+0.073
edly+0.068
 vows+0.068
der+0.067
Neg logit contributions
aver-0.058
 pipe-0.053
 afraid-0.051
 suppose-0.045
 worried-0.045
Tokenoc
Token ID420
Feature activation+1.87e-01

Pos logit contributions
 novels+0.342
 av+0.339
 broccoli+0.296
 brace+0.292
 Sunny+0.289
Neg logit contributions
aver-0.326
 pipe-0.294
 afraid-0.273
 suppose-0.269
 bet-0.251
Tokenados
Token ID22484
Feature activation-4.29e+00

Pos logit contributions
aver+0.010
 pipe+0.010
avers+0.009
 claw+0.009
idd+0.009
Neg logit contributions
 novels-0.003
 Elsa-0.003
 Sunny-0.003
Elsa-0.003
 Ivy-0.003
Token.
Token ID13
Feature activation+3.94e-01

Pos logit contributions
 novels+0.186
 av+0.174
 vows+0.173
 Ivy+0.164
 brace+0.160
Neg logit contributions
aver-0.140
 pipe-0.134
 afraid-0.127
 warned-0.118
 scared-0.116
Token My
Token ID2011
Feature activation+9.55e-01

Pos logit contributions
aver+0.013
 pipe+0.012
 afraid+0.011
 suppose+0.011
 bet+0.011
Neg logit contributions
 novels-0.017
 av-0.016
 brace-0.015
 broccoli-0.015
 vows-0.014
Token mom
Token ID1995
Feature activation-5.20e-01

Pos logit contributions
 pipe+0.026
aver+0.026
 afraid+0.023
 suppose+0.021
 bet+0.020
Neg logit contributions
 novels-0.043
 av-0.041
 Ivy-0.041
Bon-0.040
 Elsa-0.039
Token,
Token ID11
Feature activation-1.09e+00

Pos logit contributions
aver+0.049
 pipe+0.045
 afraid+0.041
 suppose+0.041
 bet+0.039
Neg logit contributions
 novels-0.042
 av-0.042
 broccoli-0.036
 Sunny-0.036
 brace-0.035
Token there
Token ID612
Feature activation+2.00e+00

Pos logit contributions
 novels+0.042
 av+0.041
 Sunny+0.039
 broccoli+0.038
 Ivy+0.036
Neg logit contributions
aver-0.041
 pipe-0.038
 suppose-0.035
inter-0.032
eping-0.032
Token was
Token ID373
Feature activation+1.01e+00

Pos logit contributions
aver+0.072
 pipe+0.064
 afraid+0.059
 suppose+0.058
avers+0.055
Neg logit contributions
 novels-0.082
 av-0.081
 broccoli-0.073
 Sunny-0.072
 Ivy-0.071
Token a
Token ID257
Feature activation-2.56e-01

Pos logit contributions
aver+0.041
 pipe+0.038
 suppose+0.035
 afraid+0.035
 bet+0.033
Neg logit contributions
 novels-0.037
 av-0.037
 broccoli-0.033
 Sunny-0.032
 brace-0.031
Token loud
Token ID7812
Feature activation-1.89e-01

Pos logit contributions
 av+0.012
 novels+0.012
 broccoli+0.011
 Sunny+0.011
 Elsa+0.010
Neg logit contributions
aver-0.008
 pipe-0.007
 suppose-0.006
 afraid-0.006
b-0.006
Token av
Token ID1196
Feature activation-8.66e+00

Pos logit contributions
 novels+0.008
 av+0.008
 broccoli+0.007
 brace+0.007
 Sunny+0.007
Neg logit contributions
aver-0.006
 pipe-0.005
 afraid-0.005
 suppose-0.005
 bet-0.005
Tokenac
Token ID330
Feature activation-3.24e+00

Pos logit contributions
 novels+0.301
 av+0.297
 brace+0.254
 broccoli+0.245
ados+0.244
Neg logit contributions
aver-0.358
 pipe-0.333
 suppose-0.332
 afraid-0.317
 bet-0.300
Tokenado
Token ID4533
Feature activation+3.16e+00

Pos logit contributions
 novels+0.120
 av+0.118
 brace+0.100
ados+0.099
 cand+0.099
Neg logit contributions
aver-0.121
 suppose-0.121
 pipe-0.119
 afraid-0.117
 bet-0.117
Token.
Token ID13
Feature activation-6.24e-01

Pos logit contributions
aver+0.120
 pipe+0.108
 afraid+0.101
 suppose+0.100
 bet+0.093
Neg logit contributions
 novels-0.124
 av-0.122
 broccoli-0.107
 brace-0.105
 Sunny-0.104
Token It
Token ID632
Feature activation+8.34e-01

Pos logit contributions
 novels+0.024
 av+0.024
 broccoli+0.021
 Sunny+0.020
ados+0.020
Neg logit contributions
aver-0.024
 pipe-0.022
 afraid-0.021
 suppose-0.020
b-0.019
Token was
Token ID373
Feature activation+3.28e-01

Pos logit contributions
aver+0.028
 pipe+0.025
 suppose+0.023
 afraid+0.023
 bet+0.021
Neg logit contributions
 novels-0.036
 av-0.035
 broccoli-0.032
 Sunny-0.031
 brace-0.031
Token said
Token ID531
Feature activation+1.13e+00

Pos logit contributions
aver+0.005
 pipe+0.004
 afraid+0.004
 suppose+0.004
 warned+0.003
Neg logit contributions
 novels-0.014
 av-0.014
ados-0.013
 broccoli-0.013
 brace-0.013
Token,
Token ID11
Feature activation-1.83e+00

Pos logit contributions
aver+0.032
 afraid+0.030
 worried+0.030
 pipe+0.029
 scared+0.029
Neg logit contributions
 novels-0.054
 av-0.049
 cere-0.049
Bon-0.046
 parties-0.046
Token "
Token ID366
Feature activation-7.20e-01

Pos logit contributions
 novels+0.082
 av+0.081
 brace+0.072
 broccoli+0.071
 vows+0.071
Neg logit contributions
aver-0.058
 pipe-0.053
 afraid-0.050
 suppose-0.048
 worried-0.045
TokenI
Token ID40
Feature activation+8.36e-01

Pos logit contributions
 av+0.022
 novels+0.021
 broccoli+0.019
 brace+0.019
Bon+0.018
Neg logit contributions
aver-0.032
 pipe-0.030
 afraid-0.028
 suppose-0.027
avers-0.026
Token love
Token ID1842
Feature activation-2.59e+00

Pos logit contributions
aver+0.026
 pipe+0.022
avers+0.020
eping+0.019
 afraid+0.018
Neg logit contributions
 av-0.037
 novels-0.034
 broccoli-0.033
 Ivy-0.033
 vows-0.032
Token av
Token ID1196
Feature activation-8.64e+00

Pos logit contributions
 novels+0.101
 vows+0.099
edly+0.098
Bon+0.096
Elsa+0.090
Neg logit contributions
 pipe-0.086
 afraid-0.077
 warned-0.077
aver-0.075
 worried-0.073
Tokenoc
Token ID420
Feature activation+5.35e-01

Pos logit contributions
 novels+0.362
 av+0.353
 broccoli+0.319
 vows+0.317
 Sunny+0.317
Neg logit contributions
aver-0.299
 pipe-0.265
 afraid-0.246
 suppose-0.237
b-0.227
Tokenados
Token ID22484
Feature activation-4.31e+00

Pos logit contributions
aver+0.027
 pipe+0.026
 claw+0.024
 warned+0.024
b+0.024
Neg logit contributions
 Elsa-0.010
 novels-0.010
 Sunny-0.010
Bon-0.009
 Ivy-0.009
Token!
Token ID0
Feature activation+7.44e-01

Pos logit contributions
 vows+0.212
 novels+0.205
growth+0.196
 jew+0.193
Elsa+0.193
Neg logit contributions
aver-0.089
 warned-0.088
 afraid-0.088
 pipe-0.087
 careful-0.079
Token They
Token ID1119
Feature activation+4.69e-01

Pos logit contributions
aver+0.025
 pipe+0.023
 suppose+0.022
 afraid+0.021
 bet+0.020
Neg logit contributions
 novels-0.032
 av-0.031
 brace-0.028
 broccoli-0.027
 vows-0.027
Token are
Token ID389
Feature activation-1.94e+00

Pos logit contributions
aver+0.013
 pipe+0.013
 afraid+0.012
 warned+0.012
 bet+0.011
Neg logit contributions
 novels-0.023
 av-0.020
edly-0.020
 Elsa-0.019
 Sunny-0.019
Token "
Token ID366
Feature activation-3.19e-01

Pos logit contributions
 novels+0.040
 av+0.040
 broccoli+0.035
 brace+0.035
ados+0.034
Neg logit contributions
aver-0.031
 pipe-0.028
 afraid-0.026
 suppose-0.026
 worried-0.024
TokenI
Token ID40
Feature activation+1.13e+00

Pos logit contributions
 av+0.011
 novels+0.011
 Ivy+0.010
 Sunny+0.010
Bon+0.010
Neg logit contributions
aver-0.013
 pipe-0.012
 afraid-0.011
 suppose-0.010
 engine-0.010
Token will
Token ID481
Feature activation-1.24e+00

Pos logit contributions
aver+0.035
 pipe+0.033
avers+0.027
eping+0.026
 engine+0.025
Neg logit contributions
 av-0.047
 novels-0.045
 Ivy-0.044
 Sunny-0.042
 broccoli-0.042
Token buy
Token ID2822
Feature activation-1.74e+00

Pos logit contributions
 novels+0.057
 av+0.056
 broccoli+0.049
 brace+0.049
ados+0.049
Neg logit contributions
aver-0.040
 pipe-0.035
 afraid-0.033
 suppose-0.032
 bet-0.030
Token more
Token ID517
Feature activation-1.54e+00

Pos logit contributions
 novels+0.076
edly+0.074
 vows+0.074
comb+0.073
Bon+0.071
Neg logit contributions
 pipe-0.044
aver-0.042
 warned-0.040
 afraid-0.039
 careful-0.038
Token av
Token ID1196
Feature activation-8.63e+00

Pos logit contributions
 novels+0.057
 av+0.057
 Ivy+0.053
 Sunny+0.053
 Elsa+0.053
Neg logit contributions
aver-0.056
 pipe-0.054
 afraid-0.048
 suppose-0.046
b-0.045
Tokenoc
Token ID420
Feature activation+9.56e-01

Pos logit contributions
 novels+0.380
 av+0.367
 vows+0.349
 Ivy+0.346
 Elsa+0.345
Neg logit contributions
aver-0.277
 pipe-0.237
b-0.216
 afraid-0.214
aking-0.210
Tokenados
Token ID22484
Feature activation-4.56e+00

Pos logit contributions
aver+0.050
 pipe+0.047
b+0.045
avers+0.044
 claw+0.044
Neg logit contributions
 Elsa-0.018
 Sunny-0.017
 Ivy-0.017
 novels-0.016
Bon-0.015
Token tomorrow
Token ID9439
Feature activation-1.93e+00

Pos logit contributions
 novels+0.219
 vows+0.212
 av+0.209
 unic+0.199
 brace+0.198
Neg logit contributions
aver-0.105
 warned-0.100
 pipe-0.097
b-0.093
 afraid-0.093
Token.
Token ID13
Feature activation+4.87e-01

Pos logit contributions
 av+0.090
 novels+0.087
 brace+0.080
 unic+0.078
 Ivy+0.078
Neg logit contributions
 pipe-0.051
aver-0.049
 afraid-0.043
 warned-0.042
 suppose-0.042
Token You
Token ID921
Feature activation+8.76e-02

Pos logit contributions
aver+0.016
 pipe+0.014
 suppose+0.013
 afraid+0.013
 bet+0.012
Neg logit contributions
 av-0.022
 novels-0.021
 brace-0.019
 broccoli-0.019
ados-0.019
Token away
Token ID1497
Feature activation-5.16e-01

Pos logit contributions
aver+0.006
 pipe+0.005
 afraid+0.005
 worried+0.005
 scared+0.005
Neg logit contributions
 novels-0.008
 av-0.008
 vows-0.008
der-0.007
Bon-0.007
Token.
Token ID13
Feature activation-2.17e-01

Pos logit contributions
 novels+0.026
 av+0.024
 journals+0.023
 vows+0.023
ados+0.022
Neg logit contributions
 afraid-0.015
 scared-0.015
 worried-0.014
 frightened-0.013
 careful-0.012
Token She
Token ID1375
Feature activation-1.56e+00

Pos logit contributions
 av+0.009
 novels+0.009
 brace+0.008
 broccoli+0.008
 vows+0.008
Neg logit contributions
aver-0.007
 pipe-0.006
 afraid-0.006
 suppose-0.006
 bet-0.006
Token learned
Token ID4499
Feature activation-3.16e+00

Pos logit contributions
 novels+0.087
 av+0.084
ados+0.081
 broccoli+0.074
 Ivy+0.073
Neg logit contributions
aver-0.038
 pipe-0.031
 afraid-0.029
 warned-0.029
 dared-0.028
Token that
Token ID326
Feature activation+1.39e+00

Pos logit contributions
 av+0.147
 novels+0.146
 vows+0.138
 cere+0.137
ados+0.132
Neg logit contributions
 pipe-0.080
 afraid-0.077
aver-0.076
 warned-0.074
 worried-0.071
Token av
Token ID1196
Feature activation-8.63e+00

Pos logit contributions
 pipe+0.046
aver+0.044
 afraid+0.042
 suppose+0.040
 warned+0.039
Neg logit contributions
 novels-0.056
 av-0.055
 vows-0.052
cakes-0.050
edly-0.049
Tokenoc
Token ID420
Feature activation+5.62e-02

Pos logit contributions
 novels+0.334
 av+0.331
 broccoli+0.289
 brace+0.285
ados+0.281
Neg logit contributions
aver-0.332
 pipe-0.300
 afraid-0.280
 suppose-0.276
 bet-0.256
Tokenados
Token ID22484
Feature activation-4.58e+00

Pos logit contributions
 pipe+0.003
aver+0.003
b+0.002
 claw+0.002
 suppose+0.002
Neg logit contributions
 Elsa-0.001
 novels-0.001
Bon-0.001
 Sunny-0.001
Elsa-0.001
Token are
Token ID389
Feature activation-2.05e+00

Pos logit contributions
 novels+0.210
 av+0.203
 vows+0.180
 Sunny+0.179
 Elsa+0.179
Neg logit contributions
 pipe-0.137
aver-0.135
 afraid-0.128
 suppose-0.119
 warned-0.119
Token good
Token ID922
Feature activation-3.34e+00

Pos logit contributions
 novels+0.101
 av+0.097
 vows+0.094
edly+0.091
ados+0.091
Neg logit contributions
aver-0.057
 afraid-0.052
 pipe-0.051
 suppose-0.047
 careful-0.046
Token for
Token ID329
Feature activation-2.38e+00

Pos logit contributions
 novels+0.131
 av+0.129
 Sunny+0.112
 brace+0.112
 broccoli+0.111
Neg logit contributions
aver-0.126
 pipe-0.117
 afraid-0.111
 suppose-0.107
 warned-0.100
Token.
Token ID13
Feature activation-3.53e-01

Pos logit contributions
 novels+0.166
 vows+0.158
 av+0.152
Rober+0.148
Bon+0.148
Neg logit contributions
 pipe-0.086
aver-0.080
 afraid-0.071
 warned-0.070
 scared-0.067
Token Mom
Token ID11254
Feature activation-6.63e-01

Pos logit contributions
 av+0.014
 novels+0.014
ils+0.013
quet+0.013
 brace+0.012
Neg logit contributions
aver-0.011
 afraid-0.011
 pipe-0.010
 bet-0.010
 worried-0.010
Token says
Token ID1139
Feature activation+1.51e+00

Pos logit contributions
 novels+0.034
 av+0.034
 broccoli+0.031
 brace+0.030
 Sunny+0.030
Neg logit contributions
aver-0.017
 pipe-0.015
 afraid-0.013
 suppose-0.013
 bet-0.011
Token they
Token ID484
Feature activation-8.10e-01

Pos logit contributions
aver+0.033
 pipe+0.031
 afraid+0.031
 warned+0.030
 worried+0.028
Neg logit contributions
 novels-0.074
edly-0.072
rily-0.068
Bon-0.067
 masse-0.067
Token are
Token ID389
Feature activation-1.81e+00

Pos logit contributions
 novels+0.039
 av+0.039
 brace+0.035
 broccoli+0.035
ados+0.035
Neg logit contributions
aver-0.022
 pipe-0.020
 afraid-0.019
 suppose-0.018
 bet-0.017
Token av
Token ID1196
Feature activation-8.61e+00

Pos logit contributions
 novels+0.086
der+0.082
 av+0.082
edly+0.080
Bon+0.077
Neg logit contributions
aver-0.054
 afraid-0.047
 pipe-0.044
 worried-0.043
 scared-0.041
Tokenoc
Token ID420
Feature activation+8.58e-02

Pos logit contributions
 novels+0.354
 av+0.350
 broccoli+0.312
 Sunny+0.305
 Ivy+0.304
Neg logit contributions
aver-0.308
 pipe-0.274
 afraid-0.254
 suppose-0.245
 bet-0.232
Tokenados
Token ID22484
Feature activation-4.38e+00

Pos logit contributions
aver+0.004
 pipe+0.004
idd+0.004
avers+0.004
 claw+0.004
Neg logit contributions
 novels-0.002
 Elsa-0.002
Elsa-0.001
Bon-0.001
 Sunny-0.001
Token.
Token ID13
Feature activation-2.62e-01

Pos logit contributions
 novels+0.199
 av+0.192
 vows+0.180
Rober+0.176
ados+0.174
Neg logit contributions
 pipe-0.128
aver-0.124
 afraid-0.115
 warned-0.111
 scared-0.104
Token Sara
Token ID24799
Feature activation-6.14e-01

Pos logit contributions
 novels+0.010
 av+0.010
 brace+0.009
ils+0.009
 unic+0.009
Neg logit contributions
aver-0.009
 afraid-0.008
 pipe-0.008
 suppose-0.008
 worried-0.008
Token wants
Token ID3382
Feature activation+1.75e+00

Pos logit contributions
 novels+0.033
 av+0.033
ados+0.030
 Ivy+0.029
 Sunny+0.029
Neg logit contributions
aver-0.013
 pipe-0.012
 afraid-0.011
 suppose-0.011
 bet-0.010
Token tree
Token ID5509
Feature activation-4.42e-01

Pos logit contributions
 novels+0.019
 av+0.018
 brace+0.018
 Sunny+0.017
 Elsa+0.017
Neg logit contributions
aver-0.013
 pipe-0.012
 afraid-0.011
 worried-0.010
 suppose-0.010
Token was
Token ID373
Feature activation-9.45e-02

Pos logit contributions
 novels+0.022
 av+0.021
 Elsa+0.019
 broccoli+0.019
 Ivy+0.019
Neg logit contributions
aver-0.013
 pipe-0.011
 afraid-0.010
 suppose-0.010
 warned-0.009
Token sad
Token ID6507
Feature activation+1.20e+00

Pos logit contributions
 novels+0.004
 av+0.004
 brace+0.004
ados+0.004
 broccoli+0.004
Neg logit contributions
aver-0.003
 pipe-0.003
 afraid-0.003
 suppose-0.002
 worried-0.002
Token that
Token ID326
Feature activation+1.53e+00

Pos logit contributions
 pipe+0.034
 afraid+0.032
aver+0.031
 scared+0.029
 worried+0.028
Neg logit contributions
 av-0.057
 novels-0.056
 brace-0.052
 vows-0.051
ados-0.050
Token the
Token ID262
Feature activation-1.00e+00

Pos logit contributions
 pipe+0.045
aver+0.042
 afraid+0.042
 warned+0.039
 suppose+0.039
Neg logit contributions
 novels-0.065
 av-0.063
 brace-0.062
cakes-0.061
edly-0.060
Token av
Token ID1196
Feature activation-8.59e+00

Pos logit contributions
 novels+0.043
 Elsa+0.040
 av+0.040
es+0.039
 brace+0.039
Neg logit contributions
aver-0.031
 pipe-0.031
 afraid-0.028
 worried-0.026
 scared-0.025
Tokenoc
Token ID420
Feature activation+1.22e+00

Pos logit contributions
 novels+0.378
 av+0.359
 vows+0.336
 Elsa+0.335
 Ivy+0.335
Neg logit contributions
aver-0.284
 pipe-0.250
 afraid-0.226
b-0.218
 suppose-0.212
Tokenados
Token ID22484
Feature activation-4.48e+00

Pos logit contributions
 pipe+0.062
aver+0.061
 claw+0.056
avers+0.054
b+0.054
Neg logit contributions
 Elsa-0.027
 Sunny-0.023
 Ivy-0.023
Bon-0.022
 Midnight-0.022
Token were
Token ID547
Feature activation-7.29e-01

Pos logit contributions
 novels+0.216
 av+0.204
ados+0.186
 Elsa+0.185
 brace+0.185
Neg logit contributions
aver-0.131
 pipe-0.126
 afraid-0.115
 warned-0.111
 scared-0.107
Token being
Token ID852
Feature activation-5.74e-01

Pos logit contributions
 novels+0.033
 av+0.030
 Elsa+0.029
cakes+0.029
Bon+0.028
Neg logit contributions
aver-0.024
 afraid-0.022
 pipe-0.021
 worried-0.020
 scared-0.020
Token taken
Token ID2077
Feature activation-5.67e-01

Pos logit contributions
 novels+0.030
 av+0.029
 vows+0.027
 brace+0.027
 broccoli+0.026
Neg logit contributions
aver-0.015
 afraid-0.012
 pipe-0.012
 worried-0.011
 suppose-0.010
Token Everyday
Token ID48154
Feature activation+1.38e-01

Pos logit contributions
 novels+0.027
 av+0.026
ados+0.024
 broccoli+0.024
 brace+0.023
Neg logit contributions
aver-0.018
 pipe-0.016
 afraid-0.015
 suppose-0.015
 warned-0.015
Token he
Token ID339
Feature activation-8.88e-01

Pos logit contributions
aver+0.004
 pipe+0.004
 afraid+0.004
 warned+0.004
 scared+0.004
Neg logit contributions
 av-0.006
 novels-0.005
 brace-0.005
ados-0.005
 vows-0.005
Token would
Token ID561
Feature activation-9.66e-01

Pos logit contributions
 novels+0.045
 av+0.045
ados+0.041
 broccoli+0.040
 brace+0.040
Neg logit contributions
aver-0.024
 pipe-0.020
 afraid-0.019
 suppose-0.019
 bet-0.018
Token take
Token ID1011
Feature activation-1.86e+00

Pos logit contributions
 novels+0.049
 av+0.049
ados+0.046
cakes+0.044
 broccoli+0.043
Neg logit contributions
aver-0.025
 pipe-0.022
 afraid-0.021
 suppose-0.020
 bet-0.020
Token the
Token ID262
Feature activation-8.00e-01

Pos logit contributions
 av+0.079
der+0.075
 novels+0.073
cakes+0.073
cake+0.071
Neg logit contributions
aver-0.056
 pipe-0.052
 afraid-0.049
b-0.046
 scared-0.045
Token av
Token ID1196
Feature activation-8.57e+00

Pos logit contributions
 novels+0.032
 av+0.032
 Ivy+0.028
ados+0.028
 Sunny+0.028
Neg logit contributions
aver-0.028
 pipe-0.027
 afraid-0.024
 suppose-0.023
 bet-0.022
Tokenoc
Token ID420
Feature activation+1.89e+00

Pos logit contributions
 novels+0.373
 av+0.355
 vows+0.342
 Ivy+0.341
 Elsa+0.338
Neg logit contributions
aver-0.276
 pipe-0.242
b-0.219
aking-0.217
 afraid-0.211
Tokenados
Token ID22484
Feature activation-5.25e+00

Pos logit contributions
aver+0.101
 pipe+0.101
b+0.092
 claw+0.091
 engine+0.089
Neg logit contributions
 Ivy-0.034
 Elsa-0.034
 Sunny-0.033
 novels-0.032
 Midnight-0.030
Token and
Token ID290
Feature activation+1.89e-01

Pos logit contributions
 novels+0.245
 vows+0.232
 av+0.229
 brace+0.222
 Ivy+0.220
Neg logit contributions
 pipe-0.146
aver-0.137
b-0.129
 warned-0.124
 afraid-0.120
Token scatter
Token ID41058
Feature activation-6.76e-02

Pos logit contributions
 pipe+0.006
aver+0.006
 warned+0.005
 afraid+0.005
 claw+0.005
Neg logit contributions
 novels-0.008
Bon-0.007
 Elsa-0.007
 Ivy-0.007
der-0.007
Token them
Token ID606
Feature activation-5.41e-01

Pos logit contributions
 novels+0.003
edly+0.003
 vows+0.003
comb+0.003
 Elsa+0.003
Neg logit contributions
 pipe-0.002
aver-0.002
 bet-0.002
 afraid-0.002
b-0.002
Token.
Token ID13
Feature activation+6.71e-01

Pos logit contributions
 novels+0.009
 av+0.009
 brace+0.008
 vows+0.008
 broccoli+0.008
Neg logit contributions
aver-0.009
 pipe-0.008
 afraid-0.008
 warned-0.007
 worried-0.007
Token Did
Token ID7731
Feature activation-8.78e-01

Pos logit contributions
aver+0.025
 pipe+0.023
 afraid+0.021
 suppose+0.021
b+0.019
Neg logit contributions
 novels-0.026
 av-0.026
 broccoli-0.023
 Sunny-0.022
 Elsa-0.022
Token you
Token ID345
Feature activation-5.34e-01

Pos logit contributions
 novels+0.036
 av+0.036
edly+0.033
 vows+0.033
cakes+0.032
Neg logit contributions
 pipe-0.027
aver-0.027
 afraid-0.025
 suppose-0.024
 warned-0.024
Token know
Token ID760
Feature activation+5.37e-01

Pos logit contributions
 av+0.025
 novels+0.025
 broccoli+0.022
 brace+0.022
 Sunny+0.022
Neg logit contributions
aver-0.016
 pipe-0.014
 afraid-0.013
 suppose-0.013
avers-0.012
Token that
Token ID326
Feature activation+1.25e+00

Pos logit contributions
 pipe+0.014
aver+0.014
 afraid+0.013
 suppose+0.012
 warned+0.012
Neg logit contributions
edly-0.024
 av-0.024
 novels-0.024
 vows-0.023
Bon-0.023
Token av
Token ID1196
Feature activation-8.53e+00

Pos logit contributions
 pipe+0.042
 afraid+0.039
aver+0.039
 scared+0.036
 warned+0.036
Neg logit contributions
 novels-0.050
edly-0.049
 av-0.048
 vows-0.047
ados-0.046
Tokenoc
Token ID420
Feature activation+8.19e-01

Pos logit contributions
 novels+0.341
 av+0.337
 broccoli+0.297
 brace+0.292
 Sunny+0.290
Neg logit contributions
aver-0.317
 pipe-0.285
 afraid-0.265
 suppose-0.259
 bet-0.243
Tokenados
Token ID22484
Feature activation-4.57e+00

Pos logit contributions
aver+0.045
 pipe+0.044
b+0.041
 warned+0.040
 claw+0.040
Neg logit contributions
 Elsa-0.014
 Sunny-0.013
 Ivy-0.013
 novels-0.012
Bon-0.012
Token are
Token ID389
Feature activation-1.83e+00

Pos logit contributions
 novels+0.227
 av+0.213
 Elsa+0.197
 Ivy+0.197
edly+0.196
Neg logit contributions
aver-0.119
 pipe-0.117
 warned-0.109
 afraid-0.109
 suppose-0.102
Token very
Token ID845
Feature activation-6.82e-02

Pos logit contributions
 novels+0.090
edly+0.089
 Ivy+0.084
 vows+0.083
Bon+0.083
Neg logit contributions
aver-0.049
 afraid-0.041
 pipe-0.040
 suppose-0.036
 warned-0.035
Token healthy
Token ID5448
Feature activation-2.50e+00

Pos logit contributions
 novels+0.003
 Elsa+0.003
 av+0.003
 brace+0.003
 Ivy+0.003
Neg logit contributions
aver-0.002
 pipe-0.002
 afraid-0.002
 worried-0.001
b-0.001
Token the
Token ID262
Feature activation-1.34e+00

Pos logit contributions
aver+0.013
 pipe+0.012
 warned+0.011
 afraid+0.011
 suppose+0.011
Neg logit contributions
 novels-0.019
 av-0.018
 brace-0.017
cakes-0.016
 vows-0.016
Token tree
Token ID5509
Feature activation-1.32e+00

Pos logit contributions
 novels+0.060
 av+0.055
 brace+0.055
 Elsa+0.054
 vows+0.054
Neg logit contributions
aver-0.042
 pipe-0.038
 afraid-0.035
 worried-0.033
 suppose-0.031
Token knew
Token ID2993
Feature activation-1.48e+00

Pos logit contributions
 novels+0.065
 av+0.062
 Elsa+0.056
 broccoli+0.056
 Ivy+0.056
Neg logit contributions
aver-0.040
 pipe-0.035
 afraid-0.031
 suppose-0.029
 warned-0.029
Token that
Token ID326
Feature activation+9.53e-01

Pos logit contributions
 novels+0.066
 av+0.062
 vows+0.057
 Elsa+0.057
ados+0.057
Neg logit contributions
aver-0.050
 pipe-0.047
 suppose-0.042
 warned-0.042
 afraid-0.042
Token the
Token ID262
Feature activation-9.27e-01

Pos logit contributions
 pipe+0.030
 afraid+0.028
aver+0.028
 warned+0.027
 suppose+0.026
Neg logit contributions
 novels-0.041
 vows-0.038
cakes-0.037
edly-0.037
 av-0.036
Token av
Token ID1196
Feature activation-8.50e+00

Pos logit contributions
 novels+0.029
 Elsa+0.026
 vows+0.026
es+0.026
 Sunny+0.025
Neg logit contributions
 pipe-0.039
aver-0.039
 afraid-0.037
 worried-0.036
 scared-0.035
Tokenoc
Token ID420
Feature activation+1.14e+00

Pos logit contributions
 novels+0.369
 av+0.351
 vows+0.325
 Elsa+0.324
 broccoli+0.324
Neg logit contributions
aver-0.285
 pipe-0.254
 afraid-0.231
b-0.220
 suppose-0.220
Tokenados
Token ID22484
Feature activation-4.88e+00

Pos logit contributions
 pipe+0.057
aver+0.056
 claw+0.052
b+0.050
idd+0.049
Neg logit contributions
 Elsa-0.026
 Sunny-0.022
 Ivy-0.022
 novels-0.022
Bon-0.022
Token would
Token ID561
Feature activation-7.29e-01

Pos logit contributions
 novels+0.246
 av+0.226
 Elsa+0.216
ados+0.210
 brace+0.209
Neg logit contributions
aver-0.130
 pipe-0.127
 afraid-0.117
 warned-0.116
 scared-0.111
Token be
Token ID307
Feature activation+1.73e-01

Pos logit contributions
 novels+0.041
 av+0.037
ados+0.035
 Elsa+0.035
cakes+0.034
Neg logit contributions
aver-0.016
 pipe-0.016
 know-0.014
 suppose-0.014
 afraid-0.014
Token back
Token ID736
Feature activation-2.12e+00

Pos logit contributions
aver+0.006
 afraid+0.005
 pipe+0.005
 worried+0.005
 suppose+0.005
Neg logit contributions
 novels-0.008
 av-0.007
 Elsa-0.007
 vows-0.007
cakes-0.007
Token girl
Token ID2576
Feature activation-4.59e-02

Pos logit contributions
aver+0.053
 pipe+0.046
 afraid+0.041
 suppose+0.041
 bet+0.036
Neg logit contributions
 novels-0.101
 av-0.101
 broccoli-0.091
 brace-0.089
 Sunny-0.089
Token who
Token ID508
Feature activation+3.37e+00

Pos logit contributions
 novels+0.002
 av+0.002
 Elsa+0.002
 Sunny+0.002
 broccoli+0.002
Neg logit contributions
aver-0.001
 suppose-0.001
 pipe-0.001
 afraid-0.001
 bet-0.001
Token didn
Token ID1422
Feature activation-1.20e+00

Pos logit contributions
aver+0.098
 pipe+0.085
 afraid+0.079
 suppose+0.075
 bet+0.069
Neg logit contributions
 av-0.164
 novels-0.164
 broccoli-0.146
ados-0.146
 brace-0.145
Token't
Token ID470
Feature activation+2.18e+00

Pos logit contributions
 av+0.055
 novels+0.054
ados+0.049
 brace+0.049
 cand+0.047
Neg logit contributions
aver-0.038
 pipe-0.035
 afraid-0.031
 suppose-0.030
 scared-0.028
Token like
Token ID588
Feature activation+5.18e-01

Pos logit contributions
aver+0.073
 pipe+0.067
 suppose+0.060
 afraid+0.059
avers+0.055
Neg logit contributions
 novels-0.094
 av-0.091
 broccoli-0.083
 brace-0.080
 Sunny-0.080
Token av
Token ID1196
Feature activation-8.47e+00

Pos logit contributions
 pipe+0.015
 afraid+0.014
 worried+0.013
 scared+0.013
 shouting+0.012
Neg logit contributions
 novels-0.022
 av-0.022
edly-0.021
 vows-0.021
Bon-0.021
Tokenoc
Token ID420
Feature activation+2.07e-01

Pos logit contributions
 novels+0.334
 av+0.329
 broccoli+0.293
 Ivy+0.291
 vows+0.289
Neg logit contributions
aver-0.312
 pipe-0.280
 afraid-0.263
 suppose-0.250
b-0.240
Tokenados
Token ID22484
Feature activation-4.17e+00

Pos logit contributions
 pipe+0.009
aver+0.009
 dared+0.009
 warned+0.009
 claw+0.009
Neg logit contributions
 Elsa-0.004
Bon-0.004
Elsa-0.004
 novels-0.004
 Sunny-0.004
Token.
Token ID13
Feature activation-8.66e-01

Pos logit contributions
 novels+0.179
 vows+0.172
 av+0.171
ados+0.161
rily+0.158
Neg logit contributions
 pipe-0.126
 afraid-0.123
aver-0.121
 warned-0.117
 scared-0.115
Token She
Token ID1375
Feature activation-1.78e+00

Pos logit contributions
 av+0.030
 novels+0.029
 vows+0.026
ils+0.026
 brace+0.025
Neg logit contributions
aver-0.033
 pipe-0.032
 afraid-0.030
 suppose-0.030
 bet-0.029
Token always
Token ID1464
Feature activation+2.12e-01

Pos logit contributions
 novels+0.099
 av+0.098
ados+0.093
 brace+0.089
 broccoli+0.089
Neg logit contributions
aver-0.039
 pipe-0.031
 afraid-0.028
 suppose-0.025
 dared-0.025
Token It
Token ID632
Feature activation+7.04e-01

Pos logit contributions
 novels+0.011
 av+0.011
 brace+0.010
 broccoli+0.010
 vows+0.010
Neg logit contributions
aver-0.009
 pipe-0.008
 afraid-0.008
 suppose-0.007
 bet-0.007
Token turned
Token ID2900
Feature activation+5.62e-01

Pos logit contributions
aver+0.021
 pipe+0.018
 afraid+0.017
 suppose+0.016
 worried+0.015
Neg logit contributions
 novels-0.034
 av-0.034
 broccoli-0.030
ados-0.030
 brace-0.030
Token out
Token ID503
Feature activation-1.50e+00

Pos logit contributions
aver+0.017
 pipe+0.017
 scared+0.017
 afraid+0.017
 worried+0.017
Neg logit contributions
 novels-0.025
 av-0.023
 vows-0.022
 brace-0.021
quet-0.021
Token that
Token ID326
Feature activation+1.44e+00

Pos logit contributions
 novels+0.071
 av+0.068
 cand+0.063
 vows+0.063
 brace+0.062
Neg logit contributions
 pipe-0.043
aver-0.043
 afraid-0.043
 scared-0.041
 worried-0.040
Token the
Token ID262
Feature activation-9.37e-01

Pos logit contributions
 pipe+0.046
aver+0.044
 afraid+0.044
 scared+0.042
 warned+0.041
Neg logit contributions
 novels-0.061
 av-0.059
ados-0.055
 vows-0.055
edly-0.054
Token av
Token ID1196
Feature activation-8.27e+00

Pos logit contributions
 novels+0.045
 av+0.044
 broccoli+0.039
 brace+0.039
ados+0.039
Neg logit contributions
aver-0.028
 pipe-0.024
 afraid-0.022
 suppose-0.021
 bet-0.019
Tokenoc
Token ID420
Feature activation+1.82e+00

Pos logit contributions
 novels+0.334
 av+0.329
 broccoli+0.291
 brace+0.285
 Sunny+0.284
Neg logit contributions
aver-0.305
 pipe-0.275
 afraid-0.254
 suppose-0.248
 bet-0.233
Tokenados
Token ID22484
Feature activation-4.57e+00

Pos logit contributions
 pipe+0.093
aver+0.093
 claw+0.085
 engine+0.082
b+0.082
Neg logit contributions
 Elsa-0.038
 Ivy-0.034
 Sunny-0.034
 novels-0.033
Bon-0.031
Token were
Token ID547
Feature activation-8.51e-01

Pos logit contributions
 novels+0.225
 av+0.210
ados+0.189
 Elsa+0.189
 broccoli+0.189
Neg logit contributions
aver-0.134
 pipe-0.129
 afraid-0.116
 warned-0.112
 scared-0.108
Token bad
Token ID2089
Feature activation-1.35e+00

Pos logit contributions
 novels+0.042
 av+0.039
 vows+0.037
 brace+0.036
 Elsa+0.036
Neg logit contributions
aver-0.025
 afraid-0.023
 pipe-0.022
 worried-0.021
 scared-0.020
Token and
Token ID290
Feature activation-1.59e-01

Pos logit contributions
 av+0.058
 novels+0.057
 Ivy+0.054
Rober+0.053
Elsa+0.052
Neg logit contributions
 pipe-0.041
aver-0.039
 afraid-0.039
 scared-0.035
 worried-0.034
Token B
Token ID347
Feature activation+1.51e+00

Pos logit contributions
 av+0.052
 novels+0.052
 brace+0.045
 broccoli+0.045
 Sunny+0.045
Neg logit contributions
aver-0.039
 pipe-0.034
 afraid-0.032
 suppose-0.031
 bet-0.029
Tokenongo
Token ID25162
Feature activation+7.83e-01

Pos logit contributions
aver+0.061
 pipe+0.055
 afraid+0.052
 suppose+0.051
 bet+0.048
Neg logit contributions
 novels-0.056
 av-0.055
 broccoli-0.048
 brace-0.047
 Sunny-0.046
Token loved
Token ID6151
Feature activation-9.96e-01

Pos logit contributions
aver+0.025
 pipe+0.022
 afraid+0.020
 suppose+0.019
 warned+0.017
Neg logit contributions
 novels-0.036
 av-0.036
 broccoli-0.032
ados-0.032
 Ivy-0.031
Token to
Token ID284
Feature activation-5.13e-01

Pos logit contributions
 novels+0.038
 av+0.037
 Ivy+0.034
 vows+0.034
 Elsa+0.034
Neg logit contributions
aver-0.036
 pipe-0.034
 afraid-0.031
 suppose-0.029
 warned-0.028
Token eat
Token ID4483
Feature activation-2.40e+00

Pos logit contributions
 novels+0.025
 av+0.025
der+0.023
ados+0.023
 Ivy+0.023
Neg logit contributions
aver-0.014
 pipe-0.013
 afraid-0.010
 know-0.010
 suppose-0.009
Token av
Token ID1196
Feature activation-8.22e+00

Pos logit contributions
 vows+0.094
edly+0.089
Elsa+0.087
 Elsa+0.085
lor+0.084
Neg logit contributions
 pipe-0.082
aver-0.073
 scared-0.071
 afraid-0.070
b-0.069
Tokenoc
Token ID420
Feature activation+6.33e-01

Pos logit contributions
 novels+0.332
 av+0.326
 vows+0.294
 broccoli+0.294
 Ivy+0.294
Neg logit contributions
aver-0.295
 pipe-0.261
 afraid-0.241
 suppose-0.230
b-0.227
Tokenados
Token ID22484
Feature activation-4.42e+00

Pos logit contributions
aver+0.032
 pipe+0.031
 claw+0.028
 warned+0.028
 dared+0.028
Neg logit contributions
 Elsa-0.012
 Ivy-0.011
Bon-0.011
 Sunny-0.010
 Claire-0.010
Token.
Token ID13
Feature activation-1.08e+00

Pos logit contributions
 vows+0.207
 novels+0.193
 Merry+0.187
 Ivy+0.179
Rober+0.177
Neg logit contributions
 pipe-0.117
 warned-0.112
aver-0.112
b-0.112
 afraid-0.108
Token One
Token ID1881
Feature activation+1.36e+00

Pos logit contributions
 novels+0.040
 av+0.039
 broccoli+0.033
 Sunny+0.033
 brace+0.033
Neg logit contributions
aver-0.045
 pipe-0.040
 afraid-0.038
 suppose-0.036
eping-0.035
Token day
Token ID1110
Feature activation+5.14e-01

Pos logit contributions
aver+0.038
 suppose+0.035
 pipe+0.033
 afraid+0.032
 bet+0.031
Neg logit contributions
 novels-0.065
 av-0.063
 broccoli-0.060
 Sunny-0.057
 Christmas-0.056
Token recommendation
Token ID15602
Feature activation-1.41e+00

Pos logit contributions
aver+0.045
 pipe+0.040
 afraid+0.037
 suppose+0.036
 bet+0.033
Neg logit contributions
 novels-0.061
 av-0.061
 broccoli-0.054
 brace-0.054
 Sunny-0.053
Token and
Token ID290
Feature activation-1.17e+00

Pos logit contributions
 novels+0.067
 av+0.067
 brace+0.060
ados+0.059
 Ivy+0.058
Neg logit contributions
 pipe-0.040
aver-0.039
 warned-0.035
 afraid-0.034
 suppose-0.033
Token picked
Token ID6497
Feature activation-8.43e-01

Pos logit contributions
 novels+0.059
 av+0.054
ados+0.050
Bon+0.050
 brace+0.050
Neg logit contributions
aver-0.033
 pipe-0.030
 afraid-0.028
 warned-0.027
 suppose-0.027
Token the
Token ID262
Feature activation-7.57e-01

Pos logit contributions
 novels+0.039
 vows+0.037
edly+0.036
quet+0.035
cakes+0.035
Neg logit contributions
aver-0.020
 pipe-0.020
 afraid-0.019
 scared-0.018
 worried-0.018
Token right
Token ID826
Feature activation-2.31e+00

Pos logit contributions
 novels+0.017
 vows+0.017
edly+0.016
es+0.016
 Elsa+0.016
Neg logit contributions
 pipe-0.035
 worried-0.035
 afraid-0.035
aver-0.034
 scared-0.033
Token av
Token ID1196
Feature activation-8.21e+00

Pos logit contributions
 novels+0.105
 Elsa+0.099
 av+0.098
 Sunny+0.097
Bon+0.097
Neg logit contributions
aver-0.069
 pipe-0.064
 afraid-0.056
 suppose-0.053
 bet-0.052
Tokenoc
Token ID420
Feature activation+1.73e+00

Pos logit contributions
 novels+0.361
 av+0.347
 vows+0.319
 Ivy+0.318
 broccoli+0.317
Neg logit contributions
aver-0.271
 pipe-0.241
 afraid-0.215
b-0.209
 suppose-0.209
Tokenados
Token ID22484
Feature activation-4.27e+00

Pos logit contributions
 pipe+0.088
aver+0.088
 claw+0.080
b+0.078
idd+0.078
Neg logit contributions
 Elsa-0.035
 novels-0.032
 Sunny-0.032
 Ivy-0.030
Bon-0.029
Token.
Token ID13
Feature activation-1.46e-01

Pos logit contributions
 novels+0.218
 vows+0.200
 av+0.186
 unic+0.183
 masse+0.181
Neg logit contributions
 pipe-0.105
 warned-0.104
b-0.095
 careful-0.095
 afraid-0.094
Token
Token ID198
Feature activation+2.38e-02

Pos logit contributions
 novels+0.006
 av+0.006
 brace+0.006
 vows+0.006
 broccoli+0.006
Neg logit contributions
aver-0.005
 pipe-0.004
 suppose-0.004
 afraid-0.004
 bet-0.004
Token
Token ID198
Feature activation-9.02e-02

Pos logit contributions
aver+0.001
 pipe+0.001
 afraid+0.001
 suppose+0.001
 bet+0.001
Neg logit contributions
 novels-0.001
 av-0.001
 Sunny-0.001
 broccoli-0.001
 Ivy-0.001
Token mom
Token ID1995
Feature activation-6.88e-01

Pos logit contributions
 novels+0.025
 av+0.025
 broccoli+0.022
 brace+0.022
 Sunny+0.022
Neg logit contributions
aver-0.021
 pipe-0.019
 afraid-0.017
 suppose-0.017
 bet-0.016
Token in
Token ID287
Feature activation+5.46e-01

Pos logit contributions
 novels+0.029
 av+0.028
 broccoli+0.025
 brace+0.025
ados+0.024
Neg logit contributions
aver-0.024
 pipe-0.022
 afraid-0.020
 suppose-0.020
 worried-0.018
Token the
Token ID262
Feature activation+1.22e+00

Pos logit contributions
aver+0.016
 afraid+0.015
 pipe+0.015
 scared+0.015
 worried+0.015
Neg logit contributions
 av-0.024
cakes-0.024
 novels-0.023
 vows-0.022
der-0.022
Token kitchen
Token ID9592
Feature activation-8.31e-01

Pos logit contributions
aver+0.040
 pipe+0.034
 suppose+0.032
 afraid+0.032
 bet+0.029
Neg logit contributions
 novels-0.055
 av-0.054
 broccoli-0.048
 Sunny-0.047
 brace-0.047
Token and
Token ID290
Feature activation+3.90e-01

Pos logit contributions
 av+0.038
 novels+0.038
Bon+0.034
 Ivy+0.034
ados+0.033
Neg logit contributions
aver-0.024
 pipe-0.024
 afraid-0.022
 worried-0.022
 scared-0.021
Token called
Token ID1444
Feature activation+4.01e+00

Pos logit contributions
aver+0.012
 pipe+0.010
 afraid+0.010
 suppose+0.009
 worried+0.009
Neg logit contributions
 novels-0.019
 av-0.018
 broccoli-0.016
ados-0.016
 vows-0.016
Token her
Token ID607
Feature activation-8.53e-01

Pos logit contributions
aver+0.128
 pipe+0.127
 suppose+0.124
 afraid+0.123
 worried+0.113
Neg logit contributions
 av-0.177
 novels-0.164
 brace-0.149
cakes-0.147
 cand-0.146
Token.
Token ID13
Feature activation-5.59e-02

Pos logit contributions
 av+0.039
 novels+0.037
ados+0.034
 broccoli+0.034
 brace+0.033
Neg logit contributions
aver-0.028
 pipe-0.025
 afraid-0.024
 suppose-0.023
 worried-0.022
Token "
Token ID366
Feature activation+2.06e-01

Pos logit contributions
 av+0.002
 novels+0.002
 broccoli+0.002
 brace+0.002
ados+0.002
Neg logit contributions
aver-0.002
 pipe-0.002
 afraid-0.002
 suppose-0.002
 worried-0.002
TokenMom
Token ID29252
Feature activation+1.52e+00

Pos logit contributions
aver+0.007
 pipe+0.006
 afraid+0.006
 suppose+0.006
 bet+0.005
Neg logit contributions
 av-0.009
 novels-0.009
 broccoli-0.008
ados-0.008
 brace-0.008
Token,
Token ID11
Feature activation+6.74e-01

Pos logit contributions
aver+0.064
 pipe+0.058
 suppose+0.054
 afraid+0.053
 bet+0.052
Neg logit contributions
 av-0.052
 novels-0.052
 Sunny-0.044
 broccoli-0.044
 brace-0.043
Token good
Token ID922
Feature activation-1.94e+00

Pos logit contributions
 novels+0.096
 av+0.093
 brace+0.085
cakes+0.085
ados+0.082
Neg logit contributions
aver-0.083
 pipe-0.079
 afraid-0.071
 suppose-0.069
 warned-0.069
Tokennight
Token ID3847
Feature activation-1.23e+00

Pos logit contributions
 av+0.072
 novels+0.072
 brace+0.063
 Ivy+0.063
 Elsa+0.062
Neg logit contributions
aver-0.074
 pipe-0.068
 afraid-0.065
 suppose-0.062
 worried-0.059
Token hugs
Token ID40657
Feature activation-1.67e+00

Pos logit contributions
 av+0.056
 novels+0.056
 brace+0.050
ados+0.049
 Ivy+0.049
Neg logit contributions
aver-0.038
 pipe-0.036
 afraid-0.031
 warned-0.030
 scared-0.029
Token and
Token ID290
Feature activation+3.72e-01

Pos logit contributions
 av+0.089
 novels+0.085
 brace+0.080
 Ivy+0.077
 cere+0.077
Neg logit contributions
 pipe-0.036
aver-0.034
 warned-0.030
 engine-0.029
 said-0.027
Token the
Token ID262
Feature activation-5.30e-01

Pos logit contributions
aver+0.011
 pipe+0.011
 afraid+0.010
 warned+0.010
 suppose+0.010
Neg logit contributions
 novels-0.017
 av-0.017
ados-0.015
 brace-0.014
 Ivy-0.014
Token goat
Token ID26917
Feature activation+3.70e+00

Pos logit contributions
 av+0.024
 novels+0.023
 broccoli+0.022
 brace+0.021
 sweets+0.021
Neg logit contributions
aver-0.017
 suppose-0.014
 pipe-0.013
 afraid-0.013
 bet-0.013
Token would
Token ID561
Feature activation+1.24e-01

Pos logit contributions
aver+0.109
 pipe+0.094
 afraid+0.092
 warned+0.087
 scared+0.086
Neg logit contributions
 av-0.179
 novels-0.176
ados-0.161
 brace-0.153
 broccoli-0.153
Token sn
Token ID3013
Feature activation-6.06e-01

Pos logit contributions
aver+0.003
 know+0.003
 think+0.002
 suppose+0.002
 pipe+0.002
Neg logit contributions
 novels-0.007
 av-0.007
ados-0.007
cakes-0.006
 broccoli-0.006
Tokenuggle
Token ID33498
Feature activation-3.06e+00

Pos logit contributions
 novels+0.023
 av+0.022
 brace+0.020
 broccoli+0.019
 Elsa+0.018
Neg logit contributions
aver-0.026
 pipe-0.023
aking-0.021
 suppose-0.021
avers-0.020
Token close
Token ID1969
Feature activation+9.74e-01

Pos logit contributions
 novels+0.145
 av+0.143
ados+0.132
 broccoli+0.121
pill+0.119
Neg logit contributions
aver-0.089
 pipe-0.080
 worried-0.078
b-0.078
 afraid-0.078
Token,
Token ID11
Feature activation+2.96e-01

Pos logit contributions
aver+0.025
 pipe+0.025
 afraid+0.023
 seen+0.022
 warned+0.021
Neg logit contributions
 novels-0.046
 av-0.045
 brace-0.041
ados-0.041
 Ivy-0.040
TokenShe
Token ID3347
Feature activation-1.77e+00

Pos logit contributions
 av+0.015
 novels+0.015
ados+0.013
 vows+0.013
 brace+0.013
Neg logit contributions
aver-0.015
 pipe-0.013
 afraid-0.013
 worried-0.012
 suppose-0.012
Token went
Token ID1816
Feature activation-6.57e-01

Pos logit contributions
 av+0.086
 novels+0.086
 brace+0.077
 broccoli+0.077
 Sunny+0.076
Neg logit contributions
aver-0.049
 pipe-0.043
 afraid-0.039
 suppose-0.038
 bet-0.034
Token to
Token ID284
Feature activation+2.19e-01

Pos logit contributions
 novels+0.023
 av+0.023
 broccoli+0.020
 Sunny+0.020
 brace+0.020
Neg logit contributions
aver-0.027
 pipe-0.025
 afraid-0.023
 suppose-0.023
 bet-0.021
Token the
Token ID262
Feature activation+3.33e-01

Pos logit contributions
aver+0.009
 pipe+0.009
 afraid+0.008
 suppose+0.008
 bet+0.007
Neg logit contributions
 novels-0.008
 av-0.007
 brace-0.006
 broccoli-0.006
ados-0.006
Token man
Token ID582
Feature activation+1.95e+00

Pos logit contributions
aver+0.012
 pipe+0.012
 afraid+0.011
 suppose+0.010
 worried+0.010
Neg logit contributions
 novels-0.013
 av-0.013
 broccoli-0.011
 brace-0.011
ados-0.011
Token who
Token ID508
Feature activation+3.87e+00

Pos logit contributions
aver+0.077
 pipe+0.070
 afraid+0.065
 suppose+0.064
 bet+0.060
Neg logit contributions
 novels-0.073
 av-0.072
 broccoli-0.063
 brace-0.062
 Sunny-0.061
Token was
Token ID373
Feature activation+2.05e-03

Pos logit contributions
aver+0.097
 pipe+0.083
 afraid+0.078
 warned+0.074
 suppose+0.074
Neg logit contributions
 novels-0.202
 av-0.194
ados-0.184
 broccoli-0.180
 vows-0.178
Token selling
Token ID6301
Feature activation-1.90e+00

Pos logit contributions
aver+0.000
 pipe+0.000
 afraid+0.000
 suppose+0.000
 bet+0.000
Neg logit contributions
 novels-0.000
 av-0.000
 broccoli-0.000
 brace-0.000
 Sunny-0.000
Token it
Token ID340
Feature activation-1.48e+00

Pos logit contributions
Bon+0.074
 vows+0.074
 novels+0.073
edly+0.072
 Elsa+0.071
Neg logit contributions
 pipe-0.059
aver-0.056
 warned-0.054
 afraid-0.053
 scared-0.052
Token,
Token ID11
Feature activation-1.02e+00

Pos logit contributions
 novels+0.067
 vows+0.063
 av+0.062
Elsa+0.061
Bon+0.061
Neg logit contributions
 pipe-0.043
aver-0.041
 afraid-0.038
 warned-0.038
 scared-0.036
Token but
Token ID475
Feature activation+1.44e+00

Pos logit contributions
 novels+0.049
 av+0.049
 broccoli+0.044
 brace+0.043
 Sunny+0.043
Neg logit contributions
aver-0.029
 pipe-0.026
 afraid-0.023
 suppose-0.023
 bet-0.020
TokenHe
Token ID1544
Feature activation+4.41e-02

Pos logit contributions
aver+0.001
 pipe+0.001
 afraid+0.001
 suppose+0.000
 bet+0.000
Neg logit contributions
 novels-0.001
 av-0.001
 broccoli-0.001
 Sunny-0.001
 brace-0.001
Token loved
Token ID6151
Feature activation-2.42e-01

Pos logit contributions
aver+0.001
 pipe+0.001
 suppose+0.001
 warned+0.001
 afraid+0.001
Neg logit contributions
 novels-0.002
 av-0.002
ados-0.002
 broccoli-0.002
 brace-0.002
Token playing
Token ID2712
Feature activation+5.81e-01

Pos logit contributions
 novels+0.010
 av+0.010
 vows+0.009
 Ivy+0.009
Bon+0.009
Neg logit contributions
aver-0.008
 pipe-0.008
 afraid-0.007
 suppose-0.007
 warned-0.007
Token with
Token ID351
Feature activation-9.04e-02

Pos logit contributions
aver+0.020
 pipe+0.020
 afraid+0.016
 scared+0.016
avers+0.015
Neg logit contributions
 novels-0.024
 vows-0.024
 av-0.024
 Ivy-0.022
 Elsa-0.022
Token the
Token ID262
Feature activation+2.00e-01

Pos logit contributions
 novels+0.003
 av+0.003
 vows+0.003
edly+0.003
 Ivy+0.003
Neg logit contributions
aver-0.003
 pipe-0.003
 afraid-0.003
 suppose-0.003
 scared-0.003
Token black
Token ID2042
Feature activation+3.89e+00

Pos logit contributions
aver+0.007
 suppose+0.006
 pipe+0.006
 bet+0.006
b+0.006
Neg logit contributions
 novels-0.008
 av-0.008
 broccoli-0.007
 brace-0.007
 Sunny-0.007
Token cat
Token ID3797
Feature activation+1.48e+00

Pos logit contributions
aver+0.103
 pipe+0.094
 afraid+0.075
 suppose+0.068
avers+0.068
Neg logit contributions
 av-0.199
 novels-0.198
 brace-0.179
 Ivy-0.178
 Sunny-0.177
Token in
Token ID287
Feature activation+3.60e-01

Pos logit contributions
 pipe+0.047
aver+0.047
 afraid+0.042
 scared+0.039
 warned+0.038
Neg logit contributions
 novels-0.063
 av-0.063
 vows-0.059
 Ivy-0.056
Rober-0.055
Token his
Token ID465
Feature activation+1.47e+00

Pos logit contributions
aver+0.015
 pipe+0.014
 afraid+0.013
 suppose+0.012
 bet+0.012
Neg logit contributions
 av-0.013
 novels-0.013
 vows-0.011
 brace-0.011
 broccoli-0.011
Token backyard
Token ID24296
Feature activation+1.11e+00

Pos logit contributions
aver+0.042
 pipe+0.036
 suppose+0.034
 afraid+0.033
 bet+0.030
Neg logit contributions
 novels-0.071
 av-0.070
 broccoli-0.064
 brace-0.062
 Sunny-0.062
Token.
Token ID13
Feature activation-5.15e-01

Pos logit contributions
aver+0.040
 pipe+0.038
 afraid+0.034
 suppose+0.033
 warned+0.032
Neg logit contributions
 novels-0.045
 av-0.044
 Ivy-0.041
 Elsa-0.040
 brace-0.039
Token and
Token ID290
Feature activation-4.47e-02

Pos logit contributions
 novels+0.005
 av+0.005
 brace+0.005
 broccoli+0.005
 Sunny+0.005
Neg logit contributions
aver-0.004
 pipe-0.003
 afraid-0.003
 suppose-0.003
avers-0.003
Token moved
Token ID3888
Feature activation+1.34e+00

Pos logit contributions
 novels+0.002
 av+0.002
 broccoli+0.002
ados+0.002
 vows+0.001
Neg logit contributions
aver-0.002
 pipe-0.001
 afraid-0.001
 suppose-0.001
 scared-0.001
Token together
Token ID1978
Feature activation+7.00e-01

Pos logit contributions
aver+0.044
 pipe+0.041
 afraid+0.040
 scared+0.037
 suppose+0.037
Neg logit contributions
 av-0.060
 novels-0.059
 broccoli-0.052
ados-0.051
aul-0.051
Token in
Token ID287
Feature activation+5.15e-01

Pos logit contributions
 pipe+0.020
aver+0.019
 afraid+0.019
 shouting+0.018
 scared+0.018
Neg logit contributions
 av-0.033
 novels-0.033
ados-0.029
 cand-0.028
 broccoli-0.028
Token different
Token ID1180
Feature activation+3.00e+00

Pos logit contributions
aver+0.018
 pipe+0.018
 afraid+0.017
 suppose+0.016
 bet+0.015
Neg logit contributions
 av-0.021
 novels-0.020
ados-0.018
cakes-0.018
 broccoli-0.018
Token directions
Token ID11678
Feature activation+3.66e+00

Pos logit contributions
aver+0.085
 pipe+0.077
 afraid+0.069
 suppose+0.067
 bet+0.062
Neg logit contributions
 av-0.144
 novels-0.144
 broccoli-0.129
 Sunny-0.128
ados-0.127
Token.
Token ID13
Feature activation+1.70e-02

Pos logit contributions
aver+0.092
 pipe+0.089
 afraid+0.087
 scared+0.079
 shouting+0.078
Neg logit contributions
 av-0.182
 novels-0.172
 broccoli-0.159
 cand-0.153
ados-0.152
Token
Token ID220
Feature activation+2.06e+00

Pos logit contributions
aver+0.001
 pipe+0.000
 afraid+0.000
 suppose+0.000
 bet+0.000
Neg logit contributions
 novels-0.001
 av-0.001
 broccoli-0.001
 brace-0.001
ados-0.001
Token
Token ID198
Feature activation+1.53e-01

Pos logit contributions
aver+0.057
 pipe+0.049
 suppose+0.046
 afraid+0.046
 bet+0.041
Neg logit contributions
 novels-0.101
 av-0.101
 broccoli-0.091
 brace-0.090
 vows-0.089
Token
Token ID198
Feature activation+2.61e-02

Pos logit contributions
aver+0.006
 pipe+0.005
 afraid+0.005
 suppose+0.004
 claw+0.004
Neg logit contributions
 novels-0.006
 av-0.006
 Sunny-0.005
 Ivy-0.005
 broccoli-0.005
TokenWhen
Token ID2215
Feature activation-2.40e-01

Pos logit contributions
aver+0.001
 pipe+0.001
 afraid+0.001
 suppose+0.001
 bet+0.001
Neg logit contributions
 novels-0.001
 av-0.001
 broccoli-0.001
 brace-0.001
 Sunny-0.001
Token that
Token ID326
Feature activation+2.46e+00

Pos logit contributions
 novels+0.136
 av+0.135
 broccoli+0.116
 brace+0.114
 Sunny+0.113
Neg logit contributions
aver-0.158
 pipe-0.144
 afraid-0.135
 suppose-0.133
 bet-0.125
Token day
Token ID1110
Feature activation+8.54e-01

Pos logit contributions
aver+0.072
 pipe+0.056
 afraid+0.054
 suppose+0.053
avers+0.051
Neg logit contributions
 novels-0.121
 av-0.117
 broccoli-0.111
 Elsa-0.106
 Sunny-0.106
Token on
Token ID319
Feature activation-1.45e+00

Pos logit contributions
aver+0.036
 pipe+0.033
 afraid+0.031
 suppose+0.030
 worried+0.029
Neg logit contributions
 av-0.030
 novels-0.030
 broccoli-0.026
 brace-0.026
 Sunny-0.025
Token,
Token ID11
Feature activation-2.32e-01

Pos logit contributions
 novels+0.085
edly+0.085
 av+0.084
der+0.081
 vows+0.081
Neg logit contributions
 pipe-0.022
 warned-0.016
 worried-0.014
aver-0.014
 scared-0.013
Token Lily
Token ID20037
Feature activation-8.67e-01

Pos logit contributions
 novels+0.012
edly+0.012
lor+0.011
 av+0.011
 vows+0.011
Neg logit contributions
 afraid-0.004
 worried-0.004
 scared-0.004
 suppose-0.004
aver-0.004
Token always
Token ID1464
Feature activation+5.01e-01

Pos logit contributions
 novels+0.044
 av+0.044
 Ivy+0.040
 broccoli+0.039
ados+0.039
Neg logit contributions
aver-0.023
 pipe-0.021
 afraid-0.018
 suppose-0.018
 warned-0.017
Token made
Token ID925
Feature activation+1.60e+00

Pos logit contributions
aver+0.017
 pipe+0.015
avers+0.013
 afraid+0.013
 suppose+0.012
Neg logit contributions
 av-0.021
 novels-0.021
 Elsa-0.019
 brace-0.019
 broccoli-0.019
Token sure
Token ID1654
Feature activation+1.46e+00

Pos logit contributions
aver+0.035
 pipe+0.030
 afraid+0.026
 suppose+0.025
 bet+0.022
Neg logit contributions
 novels-0.088
 av-0.087
 broccoli-0.080
 brace-0.079
 Sunny-0.078
Token to
Token ID284
Feature activation-9.01e-01

Pos logit contributions
 pipe+0.034
 afraid+0.033
 warned+0.032
 worried+0.032
 suppose+0.031
Neg logit contributions
 novels-0.070
 av-0.069
Bon-0.068
rily-0.066
Sun-0.065
Token put
Token ID1234
Feature activation+1.60e-01

Pos logit contributions
 novels+0.041
 av+0.040
 broccoli+0.036
 brace+0.035
 Sunny+0.035
Neg logit contributions
aver-0.029
 pipe-0.026
 afraid-0.024
 suppose-0.023
 bet-0.021
Token her
Token ID607
Feature activation-1.08e+00

Pos logit contributions
 pipe+0.005
aver+0.005
b+0.004
 careful+0.004
 afraid+0.004
Neg logit contributions
der-0.007
 av-0.006
 novels-0.006
 vows-0.006
edly-0.006
Token own
Token ID898
Feature activation+2.34e+00

Pos logit contributions
aver+0.006
 suppose+0.005
 afraid+0.005
aking+0.005
iz+0.005
Neg logit contributions
 av-0.006
 novels-0.006
 broccoli-0.006
 sweets-0.005
 brace-0.005
Token gun
Token ID2485
Feature activation+1.86e+00

Pos logit contributions
aver+0.083
 suppose+0.068
 afraid+0.066
 pipe+0.065
aking+0.064
Neg logit contributions
 av-0.098
 novels-0.095
 broccoli-0.088
 brace-0.083
ados-0.082
Token from
Token ID422
Feature activation-2.61e+00

Pos logit contributions
aver+0.056
 pipe+0.055
 afraid+0.050
 scared+0.048
 warned+0.047
Neg logit contributions
 novels-0.085
 av-0.084
 Ivy-0.075
 broccoli-0.074
 Elsa-0.073
Token the
Token ID262
Feature activation+7.15e-01

Pos logit contributions
 av+0.104
 novels+0.104
cakes+0.097
 brace+0.094
 vows+0.093
Neg logit contributions
aver-0.092
 pipe-0.090
 afraid-0.083
 suppose-0.077
 worried-0.075
Token distant
Token ID12899
Feature activation+2.99e+00

Pos logit contributions
aver+0.021
 pipe+0.021
 afraid+0.018
 suppose+0.017
 worried+0.016
Neg logit contributions
 novels-0.034
 av-0.033
 brace-0.030
ados-0.029
 Elsa-0.029
Token hill
Token ID12788
Feature activation+1.68e+00

Pos logit contributions
aver+0.096
 pipe+0.088
 afraid+0.078
 suppose+0.076
 bet+0.071
Neg logit contributions
 novels-0.135
 av-0.133
 broccoli-0.119
 brace-0.118
 Sunny-0.117
Token!
Token ID0
Feature activation+5.41e-02

Pos logit contributions
aver+0.060
 pipe+0.057
 afraid+0.050
 suppose+0.048
 warned+0.048
Neg logit contributions
 novels-0.071
 av-0.068
 broccoli-0.062
 brace-0.062
 Elsa-0.061
Token<|endoftext|>
Token ID50256
Feature activation-1.46e+00

Pos logit contributions
aver+0.002
 pipe+0.002
 afraid+0.002
 suppose+0.002
 worried+0.002
Neg logit contributions
 novels-0.002
 av-0.002
 Sunny-0.002
 broccoli-0.002
ados-0.002
TokenOnce
Token ID7454
Feature activation+5.75e-01

Pos logit contributions
 novels+0.066
 av+0.066
 brace+0.059
 broccoli+0.059
ados+0.058
Neg logit contributions
aver-0.046
 pipe-0.041
 afraid-0.037
 suppose-0.037
 bet-0.034
Token upon
Token ID2402
Feature activation-1.19e+00

Pos logit contributions
aver+0.019
 pipe+0.016
eping+0.014
 afraid+0.014
 suppose+0.014
Neg logit contributions
 novels-0.025
 av-0.025
 Sunny-0.023
 broccoli-0.022
 Elsa-0.022
Token a
Token ID257
Feature activation+5.35e-01

Pos logit contributions
 novels+0.047
 av+0.047
 broccoli+0.041
 brace+0.040
ados+0.040
Neg logit contributions
aver-0.045
 pipe-0.040
 afraid-0.038
 suppose-0.037
 bet-0.034
Token boy
Token ID2933
Feature activation+7.04e-01

Pos logit contributions
aver+0.029
 pipe+0.027
 afraid+0.024
 suppose+0.024
b+0.024
Neg logit contributions
 novels-0.020
 av-0.019
 Sunny-0.017
 broccoli-0.017
 Ivy-0.016
Token does
Token ID857
Feature activation-5.69e-01

Pos logit contributions
aver+0.021
 pipe+0.018
 afraid+0.016
 suppose+0.016
 bet+0.015
Neg logit contributions
 novels-0.033
 av-0.033
 broccoli-0.030
 brace-0.029
 Sunny-0.029
Token not
Token ID407
Feature activation+1.64e+00

Pos logit contributions
 av+0.026
 novels+0.026
ados+0.023
 brace+0.022
 cand+0.022
Neg logit contributions
aver-0.018
 pipe-0.016
 afraid-0.016
 suppose-0.015
 bet-0.014
Token move
Token ID1445
Feature activation+1.17e+00

Pos logit contributions
aver+0.043
 afraid+0.035
 pipe+0.035
 suppose+0.032
 bet+0.030
Neg logit contributions
 novels-0.086
 av-0.085
ados-0.077
 broccoli-0.076
 brace-0.075
Token.
Token ID13
Feature activation+5.05e-01

Pos logit contributions
 afraid+0.038
 scared+0.036
 worried+0.033
 pipe+0.032
aver+0.032
Neg logit contributions
 av-0.053
cakes-0.049
 vows-0.049
 novels-0.049
quet-0.048
Token Tom
Token ID4186
Feature activation+1.13e+00

Pos logit contributions
aver+0.019
 pipe+0.018
 afraid+0.017
 suppose+0.016
 bet+0.015
Neg logit contributions
 av-0.019
 novels-0.019
 broccoli-0.017
 brace-0.017
ados-0.016
Token tries
Token ID8404
Feature activation+2.11e+00

Pos logit contributions
aver+0.032
 afraid+0.028
 pipe+0.028
 suppose+0.026
 warned+0.025
Neg logit contributions
 novels-0.056
 av-0.055
ados-0.050
 broccoli-0.048
 brace-0.048
Token to
Token ID284
Feature activation-8.24e-01

Pos logit contributions
 pipe+0.066
aver+0.065
 afraid+0.063
 suppose+0.060
 warned+0.057
Neg logit contributions
 av-0.093
 novels-0.092
 vows-0.081
ados-0.081
 brace-0.078
Token climb
Token ID12080
Feature activation-9.43e-01

Pos logit contributions
 novels+0.043
 av+0.042
ados+0.038
 broccoli+0.037
 vows+0.036
Neg logit contributions
aver-0.022
 pipe-0.020
 afraid-0.018
 suppose-0.017
 bet-0.016
Token up
Token ID510
Feature activation-1.87e-01

Pos logit contributions
 av+0.046
 vows+0.044
 novels+0.044
quet+0.042
 brace+0.042
Neg logit contributions
 pipe-0.026
aver-0.022
 afraid-0.022
 scared-0.021
 worried-0.019
Token the
Token ID262
Feature activation+3.79e-01

Pos logit contributions
 vows+0.009
 av+0.009
ils+0.009
 novels+0.009
aul+0.009
Neg logit contributions
 pipe-0.005
 scared-0.005
 afraid-0.004
aver-0.004
 worried-0.004
Token forgot
Token ID16453
Feature activation-3.54e-01

Pos logit contributions
 novels+0.061
 av+0.060
ados+0.054
 broccoli+0.054
 Sunny+0.053
Neg logit contributions
aver-0.034
 pipe-0.029
 afraid-0.027
 suppose-0.026
 warned-0.024
Token all
Token ID477
Feature activation+1.47e+00

Pos logit contributions
Elsa+0.017
Bon+0.017
 novels+0.017
avored+0.016
oco+0.016
Neg logit contributions
 afraid-0.008
 worried-0.008
 scared-0.008
 warned-0.008
 pipe-0.008
Token about
Token ID546
Feature activation-7.38e-01

Pos logit contributions
aver+0.062
 pipe+0.057
 afraid+0.054
 suppose+0.052
 bet+0.049
Neg logit contributions
 novels-0.051
 av-0.051
 broccoli-0.044
 brace-0.043
ados-0.043
Token the
Token ID262
Feature activation-9.28e-03

Pos logit contributions
 novels+0.029
Bon+0.028
cakes+0.028
edly+0.028
Sun+0.027
Neg logit contributions
 pipe-0.023
 afraid-0.022
 worried-0.022
 shouting-0.021
 yelling-0.021
Token rude
Token ID22066
Feature activation+2.54e+00

Pos logit contributions
 novels+0.000
 av+0.000
 broccoli+0.000
 Sunny+0.000
ados+0.000
Neg logit contributions
aver-0.000
 pipe-0.000
 afraid-0.000
 suppose-0.000
 worried-0.000
Token boy
Token ID2933
Feature activation+3.05e+00

Pos logit contributions
aver+0.073
 pipe+0.068
 afraid+0.062
 worried+0.057
 warned+0.056
Neg logit contributions
 novels-0.114
 av-0.110
 Sunny-0.103
 Ivy-0.102
Bon-0.102
Token.
Token ID13
Feature activation-2.09e-01

Pos logit contributions
aver+0.096
 pipe+0.089
 afraid+0.088
 warned+0.080
 worried+0.080
Neg logit contributions
 novels-0.133
 av-0.132
ados-0.118
 vows-0.118
 broccoli-0.116
Token She
Token ID1375
Feature activation-1.35e+00

Pos logit contributions
 novels+0.009
 av+0.009
 brace+0.008
 broccoli+0.008
ados+0.008
Neg logit contributions
aver-0.007
 pipe-0.006
 afraid-0.006
 suppose-0.006
 bet-0.005
Token was
Token ID373
Feature activation+5.92e-01

Pos logit contributions
 novels+0.075
 av+0.072
ados+0.070
 broccoli+0.065
 Sunny+0.064
Neg logit contributions
aver-0.033
 pipe-0.026
 warned-0.025
 afraid-0.024
 suppose-0.024
Token excited
Token ID6568
Feature activation+6.13e-01

Pos logit contributions
aver+0.019
 pipe+0.017
 suppose+0.015
 afraid+0.015
 bet+0.014
Neg logit contributions
 novels-0.026
 av-0.026
 broccoli-0.023
 Sunny-0.023
 brace-0.023
Token to
Token ID284
Feature activation-4.83e-01

Pos logit contributions
aver+0.020
 pipe+0.019
 afraid+0.018
 suppose+0.017
 warned+0.017
Neg logit contributions
 novels-0.026
 av-0.026
 vows-0.022
 Ivy-0.022
 Sunny-0.022
Token't
Token ID470
Feature activation+2.18e+00

Pos logit contributions
 novels+0.087
 av+0.087
 brace+0.079
 broccoli+0.078
 cand+0.076
Neg logit contributions
aver-0.049
 pipe-0.042
 suppose-0.040
 afraid-0.038
 warned-0.036
Token worry
Token ID5490
Feature activation+1.84e+00

Pos logit contributions
aver+0.053
 pipe+0.041
 suppose+0.036
 afraid+0.036
avers+0.034
Neg logit contributions
 av-0.116
 novels-0.116
 broccoli-0.106
 Sunny-0.106
 brace-0.105
Token,
Token ID11
Feature activation-2.98e-01

Pos logit contributions
aver+0.044
 pipe+0.043
 afraid+0.039
 scared+0.035
 worried+0.033
Neg logit contributions
 av-0.091
 novels-0.087
edly-0.082
 brace-0.082
 vows-0.080
Token Tim
Token ID5045
Feature activation+2.11e+00

Pos logit contributions
 novels+0.013
 av+0.013
 brace+0.011
 vows+0.011
ados+0.011
Neg logit contributions
aver-0.010
 pipe-0.009
 afraid-0.008
 suppose-0.008
 warned-0.008
Tokenmy
Token ID1820
Feature activation+1.13e+00

Pos logit contributions
aver+0.062
 pipe+0.055
 afraid+0.050
 suppose+0.050
 bet+0.046
Neg logit contributions
 av-0.098
 novels-0.097
 broccoli-0.087
 brace-0.086
 Sunny-0.086
Token.
Token ID13
Feature activation+3.14e-01

Pos logit contributions
aver+0.046
 pipe+0.043
 afraid+0.041
 suppose+0.039
 worried+0.037
Neg logit contributions
 novels-0.042
 av-0.041
 broccoli-0.035
 brace-0.035
 Ivy-0.034
Token It
Token ID632
Feature activation+1.00e+00

Pos logit contributions
aver+0.013
 pipe+0.012
 afraid+0.011
 suppose+0.011
b+0.010
Neg logit contributions
 novels-0.011
 av-0.011
 Sunny-0.010
 broccoli-0.009
 Elsa-0.009
Token will
Token ID481
Feature activation-1.28e+00

Pos logit contributions
aver+0.034
 pipe+0.030
 afraid+0.027
 suppose+0.027
avers+0.025
Neg logit contributions
 novels-0.043
 av-0.043
 broccoli-0.038
 brace-0.038
 Sunny-0.038
Token be
Token ID307
Feature activation+2.08e-01

Pos logit contributions
 novels+0.068
 av+0.067
ados+0.060
 broccoli+0.060
 brace+0.059
Neg logit contributions
aver-0.032
 pipe-0.029
 afraid-0.026
 suppose-0.025
 bet-0.023
Token fun
Token ID1257
Feature activation-1.06e+00

Pos logit contributions
aver+0.007
 pipe+0.006
 afraid+0.006
 suppose+0.006
 bet+0.005
Neg logit contributions
 novels-0.009
 av-0.009
 broccoli-0.008
 brace-0.008
 Sunny-0.008
Token."
Token ID526
Feature activation-6.61e-01

Pos logit contributions
 av+0.042
 novels+0.041
 brace+0.036
 broccoli+0.036
ados+0.036
Neg logit contributions
aver-0.039
 pipe-0.036
 afraid-0.035
 suppose-0.033
 bet-0.032
Tokenir
Token ID343
Feature activation-6.41e-01

Pos logit contributions
 novels+0.037
 av+0.037
 broccoli+0.033
 Sunny+0.032
 brace+0.032
Neg logit contributions
aver-0.029
 pipe-0.026
 afraid-0.024
 suppose-0.023
 bet-0.022
Tokenped
Token ID9124
Feature activation+9.80e-01

Pos logit contributions
 novels+0.030
 av+0.030
 broccoli+0.027
 brace+0.027
 Sunny+0.026
Neg logit contributions
aver-0.019
 pipe-0.017
 afraid-0.015
 suppose-0.015
 bet-0.013
Token a
Token ID257
Feature activation-1.29e+00

Pos logit contributions
aver+0.033
 pipe+0.029
 afraid+0.028
 scared+0.027
 worried+0.026
Neg logit contributions
 novels-0.043
 av-0.041
cakes-0.037
 cand-0.037
 desserts-0.036
Token thank
Token ID5875
Feature activation-3.71e+00

Pos logit contributions
 novels+0.057
 av+0.057
 broccoli+0.050
 brace+0.050
ados+0.050
Neg logit contributions
aver-0.042
 pipe-0.038
 afraid-0.035
 suppose-0.034
 bet-0.031
Token you
Token ID345
Feature activation-9.92e-01

Pos logit contributions
 novels+0.162
 av+0.152
cakes+0.139
 cere+0.138
 broccoli+0.137
Neg logit contributions
aver-0.123
 pipe-0.113
 afraid-0.103
b-0.102
 warned-0.098
Token and
Token ID290
Feature activation-8.76e-01

Pos logit contributions
 novels+0.043
 av+0.042
 brace+0.038
ados+0.037
 cand+0.037
Neg logit contributions
aver-0.031
 afraid-0.030
 pipe-0.029
 suppose-0.028
 bet-0.027
Token flew
Token ID13112
Feature activation+1.41e+00

Pos logit contributions
 novels+0.047
 av+0.043
ados+0.041
 broccoli+0.040
Bon+0.039
Neg logit contributions
aver-0.022
 pipe-0.019
 afraid-0.018
 warned-0.017
 scared-0.017
Token off
Token ID572
Feature activation-1.81e+00

Pos logit contributions
aver+0.046
 pipe+0.039
 afraid+0.038
b+0.035
 scared+0.035
Neg logit contributions
 novels-0.065
 av-0.063
 cand-0.056
 broccoli-0.055
ados-0.055
Token,
Token ID11
Feature activation-1.69e-01

Pos logit contributions
 novels+0.085
 av+0.079
 cand+0.074
ados+0.074
aul+0.074
Neg logit contributions
 afraid-0.048
aver-0.047
 scared-0.047
 pipe-0.047
 shouting-0.044
Token back
Token ID736
Feature activation-2.00e+00

Pos logit contributions
 novels+0.008
 av+0.007
 broccoli+0.007
ados+0.006
 brace+0.006
Neg logit contributions
aver-0.006
 pipe-0.005
 afraid-0.005
 suppose-0.004
 worried-0.004
Token into
Token ID656
Feature activation-3.68e-01

Pos logit contributions
 novels+0.086
 av+0.086
ados+0.076
 broccoli+0.076
 brace+0.075
Neg logit contributions
aver-0.064
 pipe-0.061
 afraid-0.059
 suppose-0.054
 scared-0.053
Token home
Token ID1363
Feature activation-5.77e-01

Pos logit contributions
aver+0.012
 pipe+0.012
 afraid+0.012
 scared+0.012
 worried+0.011
Neg logit contributions
 av-0.023
 novels-0.022
ados-0.020
 vows-0.020
 broccoli-0.020
Token but
Token ID475
Feature activation+1.31e+00

Pos logit contributions
 novels+0.024
 av+0.024
 broccoli+0.021
ados+0.021
 vows+0.020
Neg logit contributions
aver-0.020
 pipe-0.019
 afraid-0.019
 suppose-0.017
 worried-0.017
Token she
Token ID673
Feature activation-2.05e+00

Pos logit contributions
aver+0.061
 pipe+0.057
 afraid+0.056
 suppose+0.054
 worried+0.052
Neg logit contributions
 novels-0.040
 av-0.040
ados-0.032
 broccoli-0.032
 brace-0.032
Token got
Token ID1392
Feature activation-4.68e-01

Pos logit contributions
 novels+0.087
 av+0.087
 broccoli+0.079
 Elsa+0.078
 Sunny+0.078
Neg logit contributions
aver-0.065
 pipe-0.060
 suppose-0.052
 afraid-0.051
avers-0.050
Token lost
Token ID2626
Feature activation+2.14e+00

Pos logit contributions
 av+0.023
 novels+0.023
 broccoli+0.021
 Elsa+0.020
 Sunny+0.020
Neg logit contributions
aver-0.013
 pipe-0.011
 suppose-0.010
 afraid-0.009
 bet-0.009
Token.
Token ID13
Feature activation-1.01e+00

Pos logit contributions
 afraid+0.050
 scared+0.050
 pipe+0.049
 worried+0.048
aver+0.045
Neg logit contributions
 vows-0.117
cakes-0.112
 av-0.108
 novels-0.108
mm-0.105
Token She
Token ID1375
Feature activation-1.80e+00

Pos logit contributions
 av+0.031
 novels+0.031
 brace+0.026
 broccoli+0.026
 Sunny+0.025
Neg logit contributions
aver-0.045
 pipe-0.043
 afraid-0.041
 suppose-0.040
 worried-0.038
Token saw
Token ID2497
Feature activation+9.92e-01

Pos logit contributions
 novels+0.077
 av+0.075
 Elsa+0.070
 brace+0.069
 broccoli+0.068
Neg logit contributions
aver-0.059
 pipe-0.053
 suppose-0.047
aking-0.046
avers-0.045
Token a
Token ID257
Feature activation-1.93e-01

Pos logit contributions
aver+0.046
 pipe+0.042
 afraid+0.040
 suppose+0.040
 bet+0.038
Neg logit contributions
 novels-0.031
 av-0.030
 broccoli-0.026
 brace-0.025
 Sunny-0.025
Token big
Token ID1263
Feature activation-5.14e-01

Pos logit contributions
 av+0.008
 novels+0.008
 broccoli+0.008
 brace+0.007
 Sunny+0.007
Neg logit contributions
aver-0.006
 pipe-0.005
 suppose-0.005
 afraid-0.005
 bet-0.005
Token,
Token ID11
Feature activation-1.09e+00

Pos logit contributions
 novels+0.023
 av+0.023
 brace+0.020
 broccoli+0.020
 vows+0.020
Neg logit contributions
aver-0.016
 pipe-0.015
 afraid-0.014
 suppose-0.013
 worried-0.012
Token he
Token ID339
Feature activation-2.21e-02

Pos logit contributions
 novels+0.019
 av+0.018
 brace+0.017
 vows+0.016
ados+0.016
Neg logit contributions
aver-0.018
 pipe-0.017
 afraid-0.016
 suppose-0.015
 worried-0.015
Token thought
Token ID1807
Feature activation-1.82e-01

Pos logit contributions
 av+0.001
 novels+0.001
 brace+0.001
 broccoli+0.001
 Sunny+0.001
Neg logit contributions
aver-0.000
 pipe-0.000
 suppose-0.000
 afraid-0.000
avers-0.000
Token about
Token ID546
Feature activation-2.49e+00

Pos logit contributions
 av+0.008
 novels+0.008
ados+0.008
 Ivy+0.008
Bon+0.007
Neg logit contributions
aver-0.006
 pipe-0.005
 afraid-0.005
 suppose-0.005
 worried-0.004
Token what
Token ID644
Feature activation-2.33e+00

Pos logit contributions
 novels+0.099
 av+0.098
cakes+0.090
 vows+0.089
 Elsa+0.088
Neg logit contributions
aver-0.084
 pipe-0.083
 afraid-0.077
 suppose-0.076
 warned-0.070
Token type
Token ID2099
Feature activation+4.65e-01

Pos logit contributions
 novels+0.090
 av+0.089
ados+0.083
 vows+0.082
 brace+0.082
Neg logit contributions
aver-0.085
 pipe-0.081
 afraid-0.075
 suppose-0.072
 warned-0.070
Token of
Token ID286
Feature activation-2.32e+00

Pos logit contributions
aver+0.014
 pipe+0.013
 afraid+0.011
 warned+0.010
 suppose+0.010
Neg logit contributions
 novels-0.022
 av-0.021
 brace-0.020
 Elsa-0.020
 vows-0.020
Token nut
Token ID6701
Feature activation-3.44e-01

Pos logit contributions
 novels+0.072
 vows+0.070
 Elsa+0.069
cakes+0.068
 av+0.066
Neg logit contributions
 pipe-0.093
aver-0.091
 afraid-0.091
 scared-0.086
 warned-0.085
Token he
Token ID339
Feature activation-1.09e+00

Pos logit contributions
 novels+0.016
 av+0.014
 vows+0.014
Bon+0.014
Elsa+0.014
Neg logit contributions
 pipe-0.010
aver-0.010
 afraid-0.009
b-0.009
 warned-0.009
Token had
Token ID550
Feature activation-2.66e-01

Pos logit contributions
 novels+0.053
 av+0.053
 broccoli+0.047
 brace+0.047
 Sunny+0.046
Neg logit contributions
aver-0.031
 pipe-0.027
 afraid-0.024
 suppose-0.024
avers-0.022
Token found
Token ID1043
Feature activation-1.85e-01

Pos logit contributions
 novels+0.012
 av+0.012
 broccoli+0.011
 brace+0.011
 Sunny+0.011
Neg logit contributions
aver-0.008
 pipe-0.007
 afraid-0.006
 suppose-0.006
 bet-0.006
Token.
Token ID13
Feature activation-5.31e-01

Pos logit contributions
 novels+0.008
 vows+0.008
 Ivy+0.008
 av+0.008
 Elsa+0.008
Neg logit contributions
aver-0.005
 pipe-0.005
 seen-0.005
 afraid-0.005
 warned-0.005
Token.
Token ID13
Feature activation-6.68e-01

Pos logit contributions
 novels+0.234
Rober+0.219
 Merry+0.216
 av+0.216
 vows+0.215
Neg logit contributions
aver-0.125
 pipe-0.120
 called-0.102
 warned-0.096
 said-0.091
Token She
Token ID1375
Feature activation-1.81e+00

Pos logit contributions
 av+0.025
 novels+0.025
 brace+0.022
ados+0.021
 broccoli+0.021
Neg logit contributions
aver-0.025
 pipe-0.023
 afraid-0.022
 suppose-0.022
 worried-0.020
Token was
Token ID373
Feature activation+1.10e-01

Pos logit contributions
 novels+0.100
 av+0.098
ados+0.090
 broccoli+0.089
 brace+0.088
Neg logit contributions
aver-0.041
 pipe-0.034
 afraid-0.031
 suppose-0.029
 warned-0.027
Token feeling
Token ID4203
Feature activation-1.55e+00

Pos logit contributions
aver+0.004
 pipe+0.004
 bet+0.004
 suppose+0.004
inter+0.003
Neg logit contributions
 novels-0.004
 av-0.004
 broccoli-0.004
 Sunny-0.004
 Elsa-0.003
Token happy
Token ID3772
Feature activation-1.58e+00

Pos logit contributions
 novels+0.087
 av+0.084
 vows+0.082
 brace+0.080
ados+0.080
Neg logit contributions
 afraid-0.032
aver-0.031
 worried-0.029
 scared-0.028
 frightened-0.026
Token and
Token ID290
Feature activation-1.63e+00

Pos logit contributions
 av+0.070
 novels+0.069
 vows+0.062
 brace+0.062
 Ivy+0.061
Neg logit contributions
aver-0.051
 pipe-0.048
 afraid-0.046
 worried-0.042
 suppose-0.042
Token relaxed
Token ID18397
Feature activation-1.05e+00

Pos logit contributions
 novels+0.075
 av+0.075
 broccoli+0.068
 brace+0.067
 Sunny+0.066
Neg logit contributions
aver-0.050
 pipe-0.044
 suppose-0.039
 afraid-0.039
 bet-0.036
Token.
Token ID13
Feature activation-6.66e-01

Pos logit contributions
 novels+0.053
 av+0.052
 brace+0.050
 vows+0.049
ados+0.048
Neg logit contributions
aver-0.026
 afraid-0.024
 worried-0.024
 scared-0.024
 pipe-0.023
Token
Token ID198
Feature activation-3.19e-01

Pos logit contributions
 novels+0.029
 av+0.028
 broccoli+0.025
 Sunny+0.025
 Elsa+0.025
Neg logit contributions
aver-0.024
 pipe-0.021
 afraid-0.019
 suppose-0.019
avers-0.017
Token
Token ID198
Feature activation-4.54e-01

Pos logit contributions
 novels+0.010
 av+0.008
 Sunny+0.008
 Elsa+0.008
 Ivy+0.008
Neg logit contributions
aver-0.015
 pipe-0.014
 claw-0.013
 kidding-0.013
 afraid-0.013
TokenThen
Token ID6423
Feature activation+8.82e-01

Pos logit contributions
 novels+0.020
 av+0.020
 broccoli+0.018
 brace+0.018
 Sunny+0.017
Neg logit contributions
aver-0.015
 pipe-0.013
 afraid-0.012
 suppose-0.012
 bet-0.011
Token close
Token ID1969
Feature activation+9.30e-02

Pos logit contributions
 av+0.048
 novels+0.047
ados+0.043
 broccoli+0.042
 Ivy+0.042
Neg logit contributions
aver-0.031
 pipe-0.028
 afraid-0.025
 suppose-0.025
 bet-0.023
Token her
Token ID607
Feature activation-4.52e-01

Pos logit contributions
 pipe+0.003
 afraid+0.003
 worried+0.003
 scared+0.003
aver+0.003
Neg logit contributions
 av-0.003
Bon-0.003
 novels-0.003
cakes-0.003
quet-0.003
Token eyes
Token ID2951
Feature activation-1.09e+00

Pos logit contributions
 av+0.021
 Ivy+0.020
 novels+0.020
 Midnight+0.019
Sun+0.019
Neg logit contributions
 pipe-0.013
aver-0.011
 afraid-0.010
 eye-0.010
 claw-0.010
Token and
Token ID290
Feature activation-6.15e-01

Pos logit contributions
 novels+0.052
 av+0.051
 Ivy+0.046
ados+0.046
 vows+0.045
Neg logit contributions
 pipe-0.029
aver-0.029
 afraid-0.026
 scared-0.024
 worried-0.024
Token imagine
Token ID5967
Feature activation-6.57e-01

Pos logit contributions
 novels+0.030
 av+0.030
ados+0.027
 broccoli+0.027
 Ivy+0.026
Neg logit contributions
aver-0.017
 pipe-0.015
 afraid-0.014
 suppose-0.013
 bet-0.013
Token a
Token ID257
Feature activation-9.80e-01

Pos logit contributions
 av+0.028
Bon+0.027
 Ivy+0.026
Elsa+0.026
aul+0.026
Neg logit contributions
 afraid-0.020
 scared-0.019
 pipe-0.018
 worried-0.018
aver-0.017
Token palace
Token ID20562
Feature activation-3.77e+00

Pos logit contributions
 av+0.037
 novels+0.037
ados+0.033
 broccoli+0.031
 Ivy+0.031
Neg logit contributions
aver-0.038
 pipe-0.036
 afraid-0.034
 suppose-0.032
 bet-0.031
Token made
Token ID925
Feature activation+1.28e+00

Pos logit contributions
 av+0.177
 Ivy+0.173
Bon+0.167
 novels+0.167
 Midnight+0.167
Neg logit contributions
aver-0.100
 pipe-0.098
 afraid-0.095
 scared-0.090
 called-0.085
Token out
Token ID503
Feature activation+1.83e-01

Pos logit contributions
aver+0.036
 afraid+0.033
 pipe+0.033
 scared+0.031
 worried+0.029
Neg logit contributions
 av-0.058
 novels-0.057
ados-0.053
 Ivy-0.052
Bon-0.052
Token of
Token ID286
Feature activation-1.22e+00

Pos logit contributions
 afraid+0.005
aver+0.005
 pipe+0.005
 scared+0.005
 worried+0.005
Neg logit contributions
 av-0.008
 Ivy-0.008
 novels-0.008
ados-0.007
 Midnight-0.007
Token clouds
Token ID15114
Feature activation+9.14e-01

Pos logit contributions
cakes+0.048
cake+0.046
edly+0.046
Bon+0.046
 av+0.045
Neg logit contributions
 pipe-0.039
 afraid-0.039
 scared-0.036
 worried-0.036
aver-0.034
Token wanted
Token ID2227
Feature activation+2.06e+00

Pos logit contributions
aver+0.007
 pipe+0.006
 warned+0.005
 thought+0.005
 suppose+0.005
Neg logit contributions
 novels-0.015
 av-0.015
ados-0.014
 Elsa-0.013
 brace-0.013
Token to
Token ID284
Feature activation-5.25e-01

Pos logit contributions
 pipe+0.057
aver+0.054
 warned+0.053
 afraid+0.052
 suppose+0.052
Neg logit contributions
 novels-0.094
 av-0.090
 brace-0.086
 vows-0.084
ados-0.084
Token make
Token ID787
Feature activation+5.50e-01

Pos logit contributions
 novels+0.029
 av+0.027
ados+0.026
 Elsa+0.025
edly+0.025
Neg logit contributions
aver-0.011
 pipe-0.011
 know-0.010
 suppose-0.009
 think-0.009
Token a
Token ID257
Feature activation+2.53e-01

Pos logit contributions
aver+0.016
 pipe+0.015
 afraid+0.014
 worried+0.013
 scared+0.013
Neg logit contributions
 novels-0.025
 av-0.024
Bon-0.023
 Ivy-0.023
edly-0.022
Token perfect
Token ID2818
Feature activation-5.85e-01

Pos logit contributions
aver+0.011
 pipe+0.010
 afraid+0.010
 suppose+0.009
 bet+0.009
Neg logit contributions
 novels-0.008
 av-0.008
ados-0.007
 brace-0.007
 Sunny-0.007
Token cake
Token ID12187
Feature activation-5.41e+00

Pos logit contributions
 novels+0.020
 av+0.020
 Elsa+0.018
ados+0.018
 brace+0.018
Neg logit contributions
aver-0.023
 pipe-0.023
 afraid-0.020
 suppose-0.020
 bet-0.020
Token,
Token ID11
Feature activation-8.79e-01

Pos logit contributions
 novels+0.260
 av+0.249
Bon+0.242
Sun+0.238
Elsa+0.233
Neg logit contributions
 pipe-0.137
aver-0.126
 warned-0.116
 guess-0.114
 say-0.113
Token so
Token ID523
Feature activation+1.35e+00

Pos logit contributions
 novels+0.044
 av+0.043
 vows+0.039
ados+0.038
 brace+0.038
Neg logit contributions
aver-0.023
 pipe-0.021
 afraid-0.020
 suppose-0.019
 warned-0.018
Token he
Token ID339
Feature activation-1.28e+00

Pos logit contributions
aver+0.057
 pipe+0.052
 afraid+0.049
 suppose+0.048
 bet+0.045
Neg logit contributions
 novels-0.047
 av-0.047
 broccoli-0.040
 brace-0.040
 Sunny-0.039
Token followed
Token ID3940
Feature activation+2.56e-01

Pos logit contributions
 novels+0.071
 av+0.068
ados+0.063
 brace+0.061
 broccoli+0.060
Neg logit contributions
aver-0.033
 pipe-0.025
 warned-0.023
 afraid-0.022
 dared-0.022
Token all
Token ID477
Feature activation+1.62e+00

Pos logit contributions
aver+0.009
 pipe+0.009
 afraid+0.008
 suppose+0.008
 claw+0.007
Neg logit contributions
 novels-0.010
 av-0.010
 brace-0.009
 cand-0.009
 vows-0.009
Token of
Token ID286
Feature activation-1.29e+00

Pos logit contributions
 av+0.026
 Ivy+0.025
 novels+0.025
ados+0.024
 Midnight+0.024
Neg logit contributions
 afraid-0.024
aver-0.024
 pipe-0.024
 scared-0.022
 worried-0.022
Token the
Token ID262
Feature activation+1.47e-01

Pos logit contributions
 av+0.050
 novels+0.050
cakes+0.048
Bon+0.047
 vows+0.046
Neg logit contributions
 pipe-0.043
aver-0.041
 afraid-0.040
 shouting-0.037
 suppose-0.037
Token old
Token ID1468
Feature activation+2.70e+00

Pos logit contributions
aver+0.005
 pipe+0.005
 afraid+0.004
 suppose+0.004
 bet+0.004
Neg logit contributions
 novels-0.006
 av-0.006
 broccoli-0.005
 brace-0.005
 Sunny-0.005
Token man
Token ID582
Feature activation+1.72e+00

Pos logit contributions
aver+0.078
 pipe+0.072
 afraid+0.067
 worried+0.061
 suppose+0.060
Neg logit contributions
 novels-0.124
 av-0.123
 Ivy-0.112
ados-0.111
 broccoli-0.111
Token's
Token ID338
Feature activation+1.92e+00

Pos logit contributions
aver+0.057
 pipe+0.052
 afraid+0.049
 suppose+0.047
 worried+0.044
Neg logit contributions
 novels-0.074
 av-0.073
 broccoli-0.065
ados-0.064
 brace-0.064
Token stories
Token ID3923
Feature activation-5.44e+00

Pos logit contributions
aver+0.075
 pipe+0.068
 afraid+0.064
 suppose+0.063
 bet+0.059
Neg logit contributions
 novels-0.073
 av-0.072
 broccoli-0.063
 brace-0.062
 Sunny-0.061
Token and
Token ID290
Feature activation-1.82e-01

Pos logit contributions
 av+0.252
 novels+0.245
ados+0.240
 Ivy+0.221
 cere+0.220
Neg logit contributions
aver-0.145
 warned-0.134
 pipe-0.132
 afraid-0.131
 scared-0.126
Token adventures
Token ID17545
Feature activation-1.49e+00

Pos logit contributions
 av+0.008
 novels+0.008
ados+0.008
 Ivy+0.007
Bon+0.007
Neg logit contributions
aver-0.005
 pipe-0.005
 afraid-0.005
 suppose-0.005
 know-0.004
Token.
Token ID13
Feature activation-1.96e-01

Pos logit contributions
 novels+0.066
 av+0.065
Elsa+0.063
 Elsa+0.063
ados+0.063
Neg logit contributions
 pipe-0.037
aver-0.036
 scared-0.035
b-0.035
 afraid-0.034
Token<|endoftext|>
Token ID50256
Feature activation-1.51e+00

Pos logit contributions
 av+0.010
 novels+0.010
 brace+0.009
 broccoli+0.009
ados+0.009
Neg logit contributions
aver-0.005
 pipe-0.004
 suppose-0.004
 afraid-0.004
 bet-0.004
TokenWill
Token ID8743
Feature activation+1.60e+00

Pos logit contributions
 novels+0.075
 av+0.075
 brace+0.070
ados+0.068
 vows+0.068
Neg logit contributions
aver-0.040
 pipe-0.035
 suppose-0.032
 afraid-0.031
 bet-0.028
Token a
Token ID257
Feature activation-4.90e-01

Pos logit contributions
 av+0.175
 novels+0.174
ados+0.158
 Ivy+0.154
 brace+0.154
Neg logit contributions
aver-0.136
 pipe-0.123
 afraid-0.116
 suppose-0.111
 worried-0.103
Token moment
Token ID2589
Feature activation-1.86e+00

Pos logit contributions
 novels+0.021
 av+0.020
 broccoli+0.019
 brace+0.018
 Sunny+0.018
Neg logit contributions
aver-0.018
 pipe-0.015
aking-0.014
avers-0.014
eping-0.013
Token.
Token ID13
Feature activation-1.22e+00

Pos logit contributions
 novels+0.072
 av+0.072
 brace+0.063
 broccoli+0.062
ados+0.062
Neg logit contributions
aver-0.071
 pipe-0.065
 afraid-0.061
 suppose-0.059
 warned-0.055
Token She
Token ID1375
Feature activation-2.01e+00

Pos logit contributions
 novels+0.044
 av+0.043
 broccoli+0.037
 brace+0.037
 vows+0.036
Neg logit contributions
aver-0.049
 pipe-0.046
 afraid-0.043
 suppose-0.043
 bet-0.040
Token liked
Token ID8288
Feature activation+3.56e-01

Pos logit contributions
 novels+0.108
 av+0.107
 broccoli+0.097
 brace+0.096
 Sunny+0.096
Neg logit contributions
aver-0.047
 pipe-0.040
 afraid-0.035
 suppose-0.034
 bet-0.030
Token stories
Token ID3923
Feature activation-5.35e+00

Pos logit contributions
 pipe+0.011
 afraid+0.011
aver+0.010
 worried+0.010
 scared+0.010
Neg logit contributions
 novels-0.014
edly-0.013
 av-0.013
Bon-0.013
rily-0.013
Token about
Token ID546
Feature activation-1.01e+00

Pos logit contributions
 av+0.262
ados+0.240
 novels+0.238
Bon+0.234
 Ivy+0.233
Neg logit contributions
aver-0.140
 afraid-0.126
 scared-0.118
 pipe-0.115
 warned-0.114
Token princess
Token ID21752
Feature activation-1.12e+00

Pos logit contributions
edly+0.044
cakes+0.042
rily+0.042
 novels+0.041
cake+0.041
Neg logit contributions
 pipe-0.027
 afraid-0.026
 scared-0.023
 shouting-0.022
aver-0.022
Tokenes
Token ID274
Feature activation-2.51e+00

Pos logit contributions
 av+0.029
 Ivy+0.026
 masse+0.026
 novels+0.026
der+0.024
Neg logit contributions
aver-0.053
 afraid-0.051
 scared-0.049
 pipe-0.048
 worried-0.047
Token,
Token ID11
Feature activation-4.33e-01

Pos logit contributions
 av+0.129
 novels+0.126
 masse+0.121
 vows+0.121
 cere+0.120
Neg logit contributions
 afraid-0.050
 scared-0.048
 warned-0.047
aver-0.046
 pipe-0.046
Token fair
Token ID3148
Feature activation-5.10e-01

Pos logit contributions
 novels+0.019
 av+0.019
 vows+0.018
Bon+0.018
 Ivy+0.018
Neg logit contributions
aver-0.012
 pipe-0.012
 afraid-0.012
 suppose-0.010
 warned-0.010
Token his
Token ID465
Feature activation-7.36e-01

Pos logit contributions
 novels+0.042
 av+0.041
ados+0.037
 Ivy+0.037
 vows+0.036
Neg logit contributions
aver-0.032
 pipe-0.029
 afraid-0.027
 warned-0.025
 suppose-0.025
Token favourite
Token ID12507
Feature activation-3.48e-01

Pos logit contributions
 av+0.032
 novels+0.032
 Ivy+0.029
ados+0.029
 brace+0.028
Neg logit contributions
aver-0.024
 pipe-0.022
 afraid-0.020
 suppose-0.019
 worried-0.018
Token snack
Token ID26906
Feature activation-5.01e+00

Pos logit contributions
 av+0.015
 novels+0.015
ados+0.014
 Sunny+0.014
 Ivy+0.013
Neg logit contributions
aver-0.011
 pipe-0.010
 afraid-0.009
 suppose-0.009
 bet-0.008
Token and
Token ID290
Feature activation-1.07e+00

Pos logit contributions
 av+0.234
 novels+0.228
Bon+0.212
 Ivy+0.212
Rober+0.211
Neg logit contributions
aver-0.150
 pipe-0.144
 warned-0.114
 called-0.113
 said-0.106
Token some
Token ID617
Feature activation-2.02e+00

Pos logit contributions
 av+0.052
 novels+0.052
ados+0.048
 Ivy+0.047
der+0.047
Neg logit contributions
aver-0.029
 pipe-0.025
 afraid-0.021
 warned-0.021
 dared-0.021
Token books
Token ID3835
Feature activation-5.38e+00

Pos logit contributions
 novels+0.075
 av+0.075
ados+0.073
 Ivy+0.071
 Sunny+0.071
Neg logit contributions
aver-0.073
 pipe-0.068
 afraid-0.060
 worried-0.057
 warned-0.057
Token.
Token ID13
Feature activation-8.65e-01

Pos logit contributions
 novels+0.230
 av+0.226
ados+0.207
 broccoli+0.204
 Ivy+0.202
Neg logit contributions
aver-0.184
 pipe-0.162
 afraid-0.152
 warned-0.148
b-0.145
Token When
Token ID1649
Feature activation-1.11e-01

Pos logit contributions
 av+0.037
 novels+0.037
 brace+0.034
 Ivy+0.032
 broccoli+0.032
Neg logit contributions
aver-0.028
 pipe-0.026
 suppose-0.023
 afraid-0.023
 bet-0.022
Token he
Token ID339
Feature activation-1.58e-01

Pos logit contributions
 av+0.005
 novels+0.005
 brace+0.005
 Ivy+0.004
 vows+0.004
Neg logit contributions
 pipe-0.003
aver-0.003
 afraid-0.003
 worried-0.003
 scared-0.003
Token was
Token ID373
Feature activation+5.85e-01

Pos logit contributions
 novels+0.007
 av+0.007
 broccoli+0.006
 Sunny+0.006
 Elsa+0.006
Neg logit contributions
aver-0.005
 pipe-0.005
 afraid-0.004
 suppose-0.004
avers-0.004
Token ready
Token ID3492
Feature activation-1.47e+00

Pos logit contributions
aver+0.020
 pipe+0.018
 afraid+0.016
 suppose+0.016
 bet+0.015
Neg logit contributions
 novels-0.025
 av-0.025
 broccoli-0.022
 Sunny-0.022
 Elsa-0.022
Token minutes
Token ID2431
Feature activation+2.40e+00

Pos logit contributions
aver+0.013
 pipe+0.011
 afraid+0.011
 suppose+0.011
 bet+0.010
Neg logit contributions
 av-0.016
 novels-0.016
 brace-0.014
ados-0.014
 broccoli-0.014
Token,
Token ID11
Feature activation-1.72e-01

Pos logit contributions
aver+0.079
 pipe+0.072
 suppose+0.066
 warned+0.064
 afraid+0.062
Neg logit contributions
 novels-0.108
 av-0.103
 brace-0.093
 broccoli-0.093
 vows-0.090
Token a
Token ID257
Feature activation+6.35e-02

Pos logit contributions
 novels+0.007
 av+0.007
 brace+0.006
 vows+0.006
ados+0.006
Neg logit contributions
aver-0.006
 pipe-0.005
 afraid-0.005
 warned-0.005
 suppose-0.005
Token delicious
Token ID12625
Feature activation-3.72e+00

Pos logit contributions
aver+0.002
 pipe+0.002
 afraid+0.002
 suppose+0.002
 bet+0.001
Neg logit contributions
 novels-0.003
 av-0.003
 brace-0.003
ados-0.003
 broccoli-0.003
Token chocolate
Token ID11311
Feature activation-2.97e+00

Pos logit contributions
 novels+0.153
 av+0.143
 brace+0.133
ados+0.129
 Elsa+0.129
Neg logit contributions
aver-0.136
 pipe-0.130
 afraid-0.109
 worried-0.105
 bet-0.103
Token cake
Token ID12187
Feature activation-5.33e+00

Pos logit contributions
 novels+0.122
 av+0.120
Bon+0.114
Rober+0.113
 Elsa+0.109
Neg logit contributions
 pipe-0.100
aver-0.092
 saw-0.082
 kid-0.080
 hose-0.079
Token was
Token ID373
Feature activation+1.85e-01

Pos logit contributions
 novels+0.271
 av+0.257
Bon+0.239
Rober+0.235
ados+0.232
Neg logit contributions
aver-0.127
 pipe-0.122
 said-0.119
 saw-0.113
 warned-0.110
Token done
Token ID1760
Feature activation-2.34e+00

Pos logit contributions
aver+0.006
 pipe+0.005
 warned+0.004
 afraid+0.004
 worried+0.004
Neg logit contributions
 novels-0.009
 av-0.008
ados-0.008
Bon-0.008
edly-0.008
Token.
Token ID13
Feature activation-5.98e-03

Pos logit contributions
 novels+0.119
 av+0.113
 Ivy+0.106
ados+0.105
 Sunny+0.105
Neg logit contributions
 pipe-0.057
aver-0.057
 warned-0.048
b-0.046
 scared-0.045
Token Everyone
Token ID11075
Feature activation+1.01e+00

Pos logit contributions
 novels+0.000
 av+0.000
 brace+0.000
 vows+0.000
 broccoli+0.000
Neg logit contributions
aver-0.000
 pipe-0.000
 suppose-0.000
 afraid-0.000
 bet-0.000
Token was
Token ID373
Feature activation+5.37e-01

Pos logit contributions
aver+0.028
 pipe+0.023
 afraid+0.022
 suppose+0.021
 warned+0.020
Neg logit contributions
 novels-0.051
 av-0.051
ados-0.046
 broccoli-0.045
 brace-0.045