TOP ACTIVATIONS
MAX = 0.159

velvet box all those years ago . They were glad that
she bowed , the town spe ople stopped to admire her
imagined the tart was long ago , when she was a
The town spe ople had to find a
bell stopped . The town spe ople cheered and hugged each
! The town spe ople were sad and scared
. The town spe ople soon discovered where the
he had seen a year ago . He went home ,
. The town spe ople decided to ask their
. He asked the town spe ople what was wrong and
found the truck many years ago and worked hard to fix
. The town spe ople were sad because they
my had told him long ago . The bird spread out
ones are yellow and the un ri pe ones are green
rope bouncing . The town spe ople gathered around to watch
. It is the door bell . Mom my is home
her that a long time ago , someone made the mark
The town spe ople thought and thought until
! The town spe ople thanked the hero for
. The town spe ople were so happy they

MIN ACTIVATIONS
MIN = -0.131

know you did not like le mons . We wanted to
on , Sue always had le mons in her organized fruit
to climb safely and use lot ion when she got a
? I want to spell snake ." Lily says , "
she still loved to eat pepper on her food , just
. He wants to spell snake with his blocks . He
. On a and her friends had so
But we don 't like le mons . They are too
a little girl named On a . She wanted to be
more careful when she ate pepper and she always remembered what
ready to go now little onion !" The onion
never asked for too much pepper again . <|endoftext|> Once upon
lemon tree harvested lots of le mons and made a big
. But we need more le mons ." Jane and her
the sack and saw many le mons . They smelled sour
made sure to put on lot ion every time she went
do not need to spell snake . You need to say
. They have so many le mons . They won 't
kept putting way too much pepper on it and Ben kept
butterfly as it flew from flower to flower .

INTERVAL 0.087 - 0.159
CONTAINS 0.020%

Mom my , the f auc et in the bathroom is
Lily . She loved the holiday because she got to play
very special attic . The attic was full of fun things
moving , and the book case said , � � €
felt his tum my ac he starting to feel a bit

INTERVAL 0.014 - 0.087
CONTAINS 22.638%

they see a big cake on the table . It is
to rest . Max was not happy . He started to
of the animals along the way . When they arrived at
before he returned to his cozy cave in the forest .
new toy . Lily said yes and they went to Emma

INTERVAL -0.058 - 0.014
CONTAINS 76.579%

ground . He picked it up and showed it to his
to her own warning . From then on , she made
She wanted to make it soft and easy to work with
. They had a castle , a king , a queen
the things in our home . <|endoftext|> Once there was a

INTERVAL -0.131 - -0.058
CONTAINS 0.763%

finally , in a nearby bush , they saw a beautiful
" You 're welcome , sweet ie . Be careful ,
count them together ; let 's go outside and look !"
the checkout . In the end , Mom my was happy
of the knight and thankful for his courage . <|endoftext|> The
Token velvet
Token ID47750
Feature activation+2.30e-02

Pos logit contributions
aw+0.010
iles+0.009
 me+0.008
ups+0.008
 us+0.008
Neg logit contributions
 spoiler-0.010
 sponge-0.009
 ramp-0.009
ro-0.008
 widened-0.008
Token box
Token ID3091
Feature activation+2.29e-02

Pos logit contributions
 spoiler+0.042
 sponge+0.035
 ramp+0.034
 widened+0.032
ro+0.032
Neg logit contributions
aw-0.043
iles-0.037
ups-0.035
ing-0.035
ils-0.034
Token all
Token ID477
Feature activation+1.57e-03

Pos logit contributions
 spoiler+0.053
 sponge+0.046
 ramp+0.046
ro+0.044
 prope+0.044
Neg logit contributions
aw-0.032
iles-0.026
ups-0.024
ing-0.024
ils-0.023
Token those
Token ID883
Feature activation+4.15e-03

Pos logit contributions
 spoiler+0.004
 sponge+0.003
 ramp+0.003
ro+0.003
 widened+0.003
Neg logit contributions
aw-0.002
iles-0.002
ups-0.002
una-0.001
ese-0.001
Token years
Token ID812
Feature activation+6.14e-02

Pos logit contributions
 spoiler+0.008
 sponge+0.007
 ramp+0.007
 widened+0.007
ro+0.007
Neg logit contributions
aw-0.007
iles-0.006
ups-0.005
una-0.005
ing-0.005
Token ago
Token ID2084
Feature activation+1.59e-01

Pos logit contributions
 spoiler+0.129
 sponge+0.112
 ramp+0.110
ro+0.104
 widened+0.104
Neg logit contributions
aw-0.100
iles-0.083
ups-0.077
ing-0.076
ils-0.074
Token.
Token ID13
Feature activation-5.35e-03

Pos logit contributions
 spoiler+0.399
 ramp+0.347
 sponge+0.345
 prope+0.330
 widened+0.329
Neg logit contributions
aw-0.204
iles-0.153
ups-0.150
ing-0.143
ils-0.143
Token They
Token ID1119
Feature activation-1.03e-02

Pos logit contributions
aw+0.010
iles+0.009
ups+0.008
ing+0.008
ow+0.008
Neg logit contributions
 spoiler-0.010
 sponge-0.008
 ramp-0.008
 prope-0.008
 widened-0.008
Token were
Token ID547
Feature activation-7.29e-03

Pos logit contributions
aw+0.019
iles+0.017
ups+0.016
ing+0.015
una+0.015
Neg logit contributions
 spoiler-0.019
 sponge-0.016
 ramp-0.016
 widened-0.015
 prope-0.015
Token glad
Token ID9675
Feature activation-1.16e-02

Pos logit contributions
aw+0.012
iles+0.011
ups+0.010
ing+0.010
una+0.010
Neg logit contributions
 spoiler-0.014
 sponge-0.012
 ramp-0.012
 prope-0.012
 widened-0.012
Token that
Token ID326
Feature activation-1.57e-02

Pos logit contributions
aw+0.019
iles+0.017
ils+0.016
ese+0.016
una+0.015
Neg logit contributions
 spoiler-0.021
 sponge-0.020
ro-0.018
 allowance-0.018
 ramp-0.018
Token she
Token ID673
Feature activation-1.63e-03

Pos logit contributions
 spoiler+0.031
 sponge+0.026
 ramp+0.026
 allowance+0.024
ro+0.024
Neg logit contributions
aw-0.036
iles-0.031
ups-0.030
ils-0.029
ese-0.029
Token bowed
Token ID35737
Feature activation+3.69e-03

Pos logit contributions
aw+0.003
iles+0.002
ils+0.002
ups+0.002
 me+0.002
Neg logit contributions
 spoiler-0.003
 sponge-0.003
 ramp-0.003
 prope-0.003
ro-0.003
Token,
Token ID11
Feature activation+9.99e-05

Pos logit contributions
 spoiler+0.008
 sponge+0.007
 ramp+0.007
ro+0.007
 widened+0.007
Neg logit contributions
aw-0.005
iles-0.004
ing-0.004
ups-0.004
ow-0.004
Token the
Token ID262
Feature activation+1.74e-02

Pos logit contributions
 spoiler+0.000
 sponge+0.000
 ramp+0.000
ro+0.000
 prope+0.000
Neg logit contributions
aw-0.000
iles-0.000
ups-0.000
ing-0.000
ils-0.000
Token town
Token ID3240
Feature activation+8.45e-02

Pos logit contributions
 spoiler+0.032
 sponge+0.027
 ramp+0.027
ro+0.025
 prope+0.025
Neg logit contributions
aw-0.032
iles-0.028
ups-0.026
ing-0.026
ils-0.026
Tokenspe
Token ID4125
Feature activation+1.54e-01

Pos logit contributions
 spoiler+0.159
 sponge+0.135
 ramp+0.133
ro+0.126
 widened+0.124
Neg logit contributions
aw-0.153
iles-0.133
ups-0.122
una-0.121
ils-0.120
Tokenople
Token ID643
Feature activation+5.63e-02

Pos logit contributions
 spoiler+0.334
 sponge+0.292
 ramp+0.287
ro+0.272
 widened+0.271
Neg logit contributions
aw-0.240
iles-0.200
ups-0.184
ing-0.179
ils-0.177
Token stopped
Token ID5025
Feature activation-1.22e-02

Pos logit contributions
 spoiler+0.110
 sponge+0.095
 ramp+0.093
 widened+0.089
ro+0.089
Neg logit contributions
aw-0.096
iles-0.083
ups-0.077
una-0.075
ing-0.074
Token to
Token ID284
Feature activation-7.27e-03

Pos logit contributions
aw+0.019
iles+0.016
ils+0.015
una+0.015
ups+0.015
Neg logit contributions
 spoiler-0.026
 sponge-0.023
 ramp-0.022
 prope-0.022
 widened-0.021
Token admire
Token ID21099
Feature activation-1.29e-02

Pos logit contributions
aw+0.012
 me+0.010
iles+0.010
ow+0.009
 us+0.009
Neg logit contributions
 spoiler-0.015
 sponge-0.013
 ramp-0.013
bes-0.012
 allowance-0.012
Token her
Token ID607
Feature activation-4.76e-03

Pos logit contributions
aw+0.019
iles+0.016
ups+0.015
ese+0.015
ing+0.014
Neg logit contributions
 spoiler-0.028
 sponge-0.024
 ramp-0.024
 prope-0.023
 widened-0.023
Token imagined
Token ID15758
Feature activation+5.37e-02

Pos logit contributions
 spoiler+0.019
 sponge+0.016
 ramp+0.016
ro+0.015
 widened+0.015
Neg logit contributions
aw-0.020
iles-0.018
ups-0.017
ing-0.016
ils-0.016
Token the
Token ID262
Feature activation+1.07e-02

Pos logit contributions
 spoiler+0.115
 sponge+0.102
 ramp+0.098
 widened+0.097
 allowance+0.095
Neg logit contributions
aw-0.080
iles-0.068
ese-0.061
ups-0.061
una-0.059
Token tart
Token ID35842
Feature activation-1.67e-02

Pos logit contributions
 spoiler+0.020
 ramp+0.017
 sponge+0.016
 prope+0.016
ro+0.016
Neg logit contributions
aw-0.019
iles-0.016
ups-0.016
 me-0.015
ing-0.015
Token was
Token ID373
Feature activation+3.42e-02

Pos logit contributions
aw+0.031
iles+0.026
ing+0.025
ups+0.024
ils+0.024
Neg logit contributions
 spoiler-0.033
 ramp-0.027
 sponge-0.026
 prope-0.026
 widened-0.025
Token long
Token ID890
Feature activation+4.46e-02

Pos logit contributions
 spoiler+0.070
 sponge+0.062
 ramp+0.059
ro+0.057
 widened+0.057
Neg logit contributions
aw-0.057
iles-0.048
ups-0.044
ing-0.043
ils-0.042
Token ago
Token ID2084
Feature activation+1.50e-01

Pos logit contributions
 spoiler+0.105
 ramp+0.087
 sponge+0.085
 prope+0.082
ge+0.082
Neg logit contributions
aw-0.062
 me-0.049
ing-0.047
iles-0.047
ils-0.046
Token,
Token ID11
Feature activation+3.60e-02

Pos logit contributions
 spoiler+0.375
 ramp+0.318
 sponge+0.315
 prope+0.304
 widened+0.302
Neg logit contributions
aw-0.199
ups-0.145
ow-0.144
ing-0.143
iles-0.143
Token when
Token ID618
Feature activation-2.51e-02

Pos logit contributions
 spoiler+0.078
 ramp+0.070
ge+0.065
 sponge+0.060
igsaw+0.060
Neg logit contributions
aw-0.059
 me-0.050
 that-0.047
 by-0.046
 for-0.046
Token she
Token ID673
Feature activation-1.75e-02

Pos logit contributions
aw+0.043
iles+0.038
ese+0.034
ups+0.034
ils+0.034
Neg logit contributions
 spoiler-0.049
 sponge-0.043
 ramp-0.041
 widened-0.040
 allowance-0.039
Token was
Token ID373
Feature activation-9.19e-03

Pos logit contributions
aw+0.032
iles+0.028
ups+0.026
ing+0.026
una+0.025
Neg logit contributions
 spoiler-0.032
 sponge-0.027
 ramp-0.026
 widened-0.026
ro-0.026
Token a
Token ID257
Feature activation+1.97e-02

Pos logit contributions
aw+0.016
iles+0.014
ups+0.012
ing+0.012
ils+0.012
Neg logit contributions
 spoiler-0.018
 sponge-0.016
 ramp-0.015
 widened-0.015
 prope-0.015
Token
Token ID220
Feature activation+3.14e-02

Pos logit contributions
 spoiler+0.021
 sponge+0.018
 prope+0.018
 ramp+0.017
ro+0.017
Neg logit contributions
aw-0.011
iles-0.010
ow-0.009
uck-0.009
 me-0.009
Token
Token ID198
Feature activation-6.85e-03

Pos logit contributions
 spoiler+0.056
 sponge+0.048
 ramp+0.046
ro+0.045
 widened+0.044
Neg logit contributions
aw-0.058
iles-0.052
ils-0.048
ups-0.047
ese-0.047
Token
Token ID198
Feature activation-8.63e-03

Pos logit contributions
aw+0.012
iles+0.011
ils+0.010
ups+0.010
ing+0.010
Neg logit contributions
 spoiler-0.013
 sponge-0.011
 ramp-0.011
ro-0.010
 widened-0.010
TokenThe
Token ID464
Feature activation-4.63e-03

Pos logit contributions
aw+0.016
iles+0.014
ups+0.012
ing+0.012
ils+0.012
Neg logit contributions
 spoiler-0.016
 sponge-0.014
 ramp-0.014
ro-0.013
 widened-0.013
Token town
Token ID3240
Feature activation+7.30e-02

Pos logit contributions
aw+0.009
iles+0.007
ups+0.007
ils+0.007
ow+0.007
Neg logit contributions
 spoiler-0.009
 sponge-0.008
 ramp-0.007
 prope-0.007
ro-0.007
Tokenspe
Token ID4125
Feature activation+1.48e-01

Pos logit contributions
 spoiler+0.115
 sponge+0.095
 ramp+0.093
ro+0.086
 widened+0.086
Neg logit contributions
aw-0.153
iles-0.137
ups-0.127
una-0.127
ils-0.125
Tokenople
Token ID643
Feature activation+5.12e-02

Pos logit contributions
 spoiler+0.306
 sponge+0.263
 ramp+0.259
ro+0.246
 widened+0.244
Neg logit contributions
aw-0.242
iles-0.205
ups-0.189
ing-0.185
ils-0.182
Token had
Token ID550
Feature activation+2.87e-03

Pos logit contributions
 spoiler+0.093
 sponge+0.079
 ramp+0.077
ro+0.073
 widened+0.073
Neg logit contributions
aw-0.096
iles-0.083
ups-0.078
ing-0.076
ils-0.075
Token to
Token ID284
Feature activation+8.69e-04

Pos logit contributions
 spoiler+0.006
 sponge+0.005
 ramp+0.005
 widened+0.005
 prope+0.005
Neg logit contributions
aw-0.004
iles-0.004
ing-0.003
ups-0.003
una-0.003
Token find
Token ID1064
Feature activation-5.48e-04

Pos logit contributions
 spoiler+0.002
 sponge+0.002
 ramp+0.001
ro+0.001
 widened+0.001
Neg logit contributions
aw-0.001
iles-0.001
 me-0.001
ow-0.001
 us-0.001
Token a
Token ID257
Feature activation+9.98e-03

Pos logit contributions
aw+0.001
iles+0.001
ese+0.001
una+0.001
ups+0.001
Neg logit contributions
 spoiler-0.001
 sponge-0.001
 ramp-0.001
 cloth-0.001
 prope-0.001
Token bell
Token ID8966
Feature activation+3.04e-02

Pos logit contributions
 spoiler+0.040
 sponge+0.034
 ramp+0.034
ro+0.032
 widened+0.032
Neg logit contributions
aw-0.040
iles-0.034
ups-0.032
ing-0.031
 me-0.031
Token stopped
Token ID5025
Feature activation-7.06e-03

Pos logit contributions
 spoiler+0.057
 sponge+0.049
 ramp+0.048
ro+0.045
 widened+0.045
Neg logit contributions
aw-0.055
iles-0.047
ups-0.045
ing-0.044
ils-0.043
Token.
Token ID13
Feature activation-6.69e-03

Pos logit contributions
aw+0.011
iles+0.009
ups+0.009
ils+0.009
una+0.008
Neg logit contributions
 spoiler-0.015
 sponge-0.013
 ramp-0.013
 prope-0.012
 widened-0.012
Token The
Token ID383
Feature activation-4.06e-03

Pos logit contributions
aw+0.012
iles+0.010
ups+0.010
ing+0.010
ow+0.010
Neg logit contributions
 spoiler-0.012
 sponge-0.011
 ramp-0.010
 prope-0.010
 widened-0.010
Token town
Token ID3240
Feature activation+8.12e-02

Pos logit contributions
aw+0.007
iles+0.006
ups+0.006
ing+0.006
 me+0.006
Neg logit contributions
 spoiler-0.008
 sponge-0.007
 ramp-0.007
ro-0.006
 widened-0.006
Tokenspe
Token ID4125
Feature activation+1.40e-01

Pos logit contributions
 spoiler+0.146
 sponge+0.123
 ramp+0.121
ro+0.113
 widened+0.112
Neg logit contributions
aw-0.156
iles-0.134
ups-0.126
ing-0.124
ils-0.122
Tokenople
Token ID643
Feature activation+5.17e-02

Pos logit contributions
 spoiler+0.301
 sponge+0.260
 ramp+0.256
ro+0.245
 widened+0.243
Neg logit contributions
aw-0.217
iles-0.183
ups-0.168
ing-0.163
ils-0.161
Token cheered
Token ID34767
Feature activation+1.77e-02

Pos logit contributions
 spoiler+0.094
 sponge+0.079
 ramp+0.078
ro+0.073
 widened+0.072
Neg logit contributions
aw-0.098
iles-0.085
ups-0.079
ing-0.078
ils-0.077
Token and
Token ID290
Feature activation-7.38e-03

Pos logit contributions
 spoiler+0.032
 sponge+0.027
 ramp+0.026
 widened+0.026
ro+0.025
Neg logit contributions
aw-0.033
iles-0.030
ups-0.027
ils-0.027
ese-0.026
Token hugged
Token ID41130
Feature activation+1.18e-02

Pos logit contributions
aw+0.014
iles+0.012
ups+0.011
una+0.011
ese+0.011
Neg logit contributions
 spoiler-0.014
 sponge-0.012
 ramp-0.011
 widened-0.011
 prope-0.011
Token each
Token ID1123
Feature activation-1.67e-02

Pos logit contributions
 spoiler+0.028
 sponge+0.024
 ramp+0.024
 widened+0.023
ro+0.023
Neg logit contributions
aw-0.016
iles-0.014
ups-0.012
ese-0.011
una-0.011
Token!
Token ID0
Feature activation-2.12e-02

Pos logit contributions
 spoiler+0.049
 ramp+0.043
 sponge+0.042
bes+0.040
 prope+0.040
Neg logit contributions
aw-0.020
ing-0.015
iles-0.014
 me-0.014
 by-0.014
Token
Token ID198
Feature activation-1.24e-02

Pos logit contributions
aw+0.038
iles+0.032
ups+0.030
ing+0.030
ow+0.029
Neg logit contributions
 spoiler-0.041
 sponge-0.035
 ramp-0.034
 prope-0.032
 widened-0.032
Token
Token ID198
Feature activation-9.64e-03

Pos logit contributions
aw+0.021
iles+0.018
ups+0.016
ing+0.016
ils+0.016
Neg logit contributions
 spoiler-0.025
 sponge-0.022
 ramp-0.021
ro-0.020
 prope-0.020
TokenThe
Token ID464
Feature activation-1.18e-02

Pos logit contributions
aw+0.017
iles+0.015
ups+0.014
ing+0.013
ils+0.013
Neg logit contributions
 spoiler-0.019
 sponge-0.016
 ramp-0.016
ro-0.015
 widened-0.015
Token town
Token ID3240
Feature activation+7.65e-02

Pos logit contributions
aw+0.021
iles+0.018
ups+0.017
ing+0.017
ils+0.017
Neg logit contributions
 spoiler-0.023
 sponge-0.019
 ramp-0.019
ro-0.018
 widened-0.018
Tokenspe
Token ID4125
Feature activation+1.40e-01

Pos logit contributions
 spoiler+0.135
 sponge+0.115
 ramp+0.113
ro+0.107
 widened+0.106
Neg logit contributions
aw-0.145
iles-0.129
una-0.117
ups-0.117
ils-0.116
Tokenople
Token ID643
Feature activation+5.30e-02

Pos logit contributions
 spoiler+0.297
 sponge+0.251
 ramp+0.249
ro+0.240
 widened+0.238
Neg logit contributions
aw-0.216
iles-0.187
ups-0.171
una-0.170
ing-0.166
Token were
Token ID547
Feature activation-1.91e-03

Pos logit contributions
 spoiler+0.091
 sponge+0.077
 ramp+0.074
 widened+0.071
 prope+0.070
Neg logit contributions
aw-0.104
iles-0.093
ups-0.087
una-0.084
ing-0.084
Token sad
Token ID6507
Feature activation+1.58e-02

Pos logit contributions
aw+0.004
iles+0.003
una+0.003
ils+0.003
ups+0.003
Neg logit contributions
 spoiler-0.003
 sponge-0.003
 ramp-0.003
 widened-0.003
 prope-0.003
Token and
Token ID290
Feature activation-7.08e-03

Pos logit contributions
 spoiler+0.028
 sponge+0.025
 ramp+0.024
 widened+0.023
ro+0.023
Neg logit contributions
aw-0.030
iles-0.026
ese-0.024
ils-0.024
ups-0.024
Token scared
Token ID12008
Feature activation+2.08e-03

Pos logit contributions
aw+0.014
iles+0.012
una+0.011
ese+0.011
ups+0.011
Neg logit contributions
 spoiler-0.013
 sponge-0.011
 widened-0.010
 ramp-0.010
 prope-0.010
Token.
Token ID13
Feature activation+7.61e-03

Pos logit contributions
 spoiler+0.021
 sponge+0.018
 ramp+0.018
ro+0.017
 widened+0.017
Neg logit contributions
aw-0.016
iles-0.014
ups-0.012
ese-0.012
ils-0.012
Token
Token ID198
Feature activation+7.99e-03

Pos logit contributions
 spoiler+0.013
 sponge+0.011
 ramp+0.011
ro+0.010
 widened+0.010
Neg logit contributions
aw-0.015
iles-0.013
ups-0.012
ing-0.012
ils-0.012
Token
Token ID198
Feature activation+1.05e-02

Pos logit contributions
 spoiler+0.016
 sponge+0.014
 ramp+0.014
ro+0.013
 prope+0.013
Neg logit contributions
aw-0.014
iles-0.012
ups-0.011
ing-0.011
ils-0.010
TokenThe
Token ID464
Feature activation-1.33e-05

Pos logit contributions
 spoiler+0.021
 sponge+0.018
 ramp+0.018
ro+0.017
 widened+0.017
Neg logit contributions
aw-0.018
iles-0.015
ups-0.014
ing-0.013
ils-0.013
Token town
Token ID3240
Feature activation+8.36e-02

Pos logit contributions
aw+0.000
iles+0.000
ups+0.000
ils+0.000
ing+0.000
Neg logit contributions
 spoiler-0.000
 sponge-0.000
 ramp-0.000
 prope-0.000
ro-0.000
Tokenspe
Token ID4125
Feature activation+1.38e-01

Pos logit contributions
 spoiler+0.130
 sponge+0.107
 ramp+0.105
ro+0.099
 widened+0.098
Neg logit contributions
aw-0.176
iles-0.158
una-0.147
ups-0.144
ese-0.144
Tokenople
Token ID643
Feature activation+5.61e-02

Pos logit contributions
 spoiler+0.290
 sponge+0.243
 ramp+0.242
ro+0.233
 widened+0.231
Neg logit contributions
aw-0.215
iles-0.187
una-0.171
ups-0.169
ing-0.165
Token soon
Token ID2582
Feature activation+3.65e-02

Pos logit contributions
 spoiler+0.103
 sponge+0.089
 ramp+0.086
 widened+0.084
ro+0.082
Neg logit contributions
aw-0.102
iles-0.089
ups-0.083
una-0.081
ing-0.080
Token discovered
Token ID5071
Feature activation-3.21e-05

Pos logit contributions
 spoiler+0.060
 widened+0.056
 sponge+0.054
ro+0.051
 ramp+0.050
Neg logit contributions
aw-0.063
iles-0.060
ow-0.055
una-0.054
ese-0.054
Token where
Token ID810
Feature activation+2.36e-02

Pos logit contributions
aw+0.000
iles+0.000
ese+0.000
una+0.000
ups+0.000
Neg logit contributions
 spoiler-0.000
 sponge-0.000
 ramp-0.000
 widened-0.000
 prope-0.000
Token the
Token ID262
Feature activation+1.78e-03

Pos logit contributions
 spoiler+0.044
 sponge+0.039
 ramp+0.037
 prope+0.035
 widened+0.035
Neg logit contributions
aw-0.042
iles-0.037
ese-0.034
una-0.033
ils-0.033
Token he
Token ID339
Feature activation+4.44e-02

Pos logit contributions
 spoiler+0.038
 sponge+0.034
 ramp+0.033
ro+0.032
 allowance+0.031
Neg logit contributions
aw-0.024
iles-0.020
ups-0.018
ese-0.018
ils-0.017
Token had
Token ID550
Feature activation+3.92e-02

Pos logit contributions
 spoiler+0.080
 sponge+0.067
 ramp+0.066
 prope+0.062
ro+0.061
Neg logit contributions
aw-0.086
iles-0.074
ups-0.070
ils-0.069
ing-0.068
Token seen
Token ID1775
Feature activation+4.43e-02

Pos logit contributions
 spoiler+0.083
 sponge+0.072
 ramp+0.071
ro+0.069
 widened+0.068
Neg logit contributions
aw-0.061
iles-0.052
ups-0.046
ing-0.046
ese-0.045
Token a
Token ID257
Feature activation+2.51e-02

Pos logit contributions
 spoiler+0.090
 sponge+0.078
 ramp+0.077
ro+0.075
 widened+0.073
Neg logit contributions
aw-0.071
iles-0.061
ups-0.056
ils-0.054
ese-0.054
Token year
Token ID614
Feature activation+4.57e-02

Pos logit contributions
 spoiler+0.046
 sponge+0.039
 ramp+0.038
 prope+0.036
ro+0.036
Neg logit contributions
aw-0.048
iles-0.041
ups-0.039
ing-0.038
ils-0.038
Token ago
Token ID2084
Feature activation+1.38e-01

Pos logit contributions
 spoiler+0.099
 sponge+0.086
 ramp+0.085
ro+0.080
 prope+0.080
Neg logit contributions
aw-0.072
iles-0.059
ups-0.056
ing-0.055
ils-0.053
Token.
Token ID13
Feature activation-1.09e-02

Pos logit contributions
 spoiler+0.355
 sponge+0.308
 ramp+0.307
 prope+0.295
 widened+0.293
Neg logit contributions
aw-0.167
iles-0.117
ups-0.116
ing-0.115
ow-0.111
Token He
Token ID679
Feature activation+1.54e-02

Pos logit contributions
aw+0.021
iles+0.018
ups+0.017
 me+0.017
ing+0.016
Neg logit contributions
 spoiler-0.020
 sponge-0.017
 ramp-0.016
 prope-0.016
 widened-0.015
Token went
Token ID1816
Feature activation+9.83e-03

Pos logit contributions
 spoiler+0.030
 sponge+0.026
 ramp+0.025
ro+0.024
 prope+0.024
Neg logit contributions
aw-0.028
iles-0.023
ups-0.022
 me-0.021
ils-0.021
Token home
Token ID1363
Feature activation-1.46e-02

Pos logit contributions
 spoiler+0.023
 sponge+0.021
 ramp+0.021
ro+0.020
 allowance+0.019
Neg logit contributions
aw-0.014
iles-0.012
ese-0.010
ils-0.010
una-0.009
Token,
Token ID11
Feature activation-9.03e-03

Pos logit contributions
aw+0.018
 for+0.016
ow+0.016
ups+0.016
 by+0.016
Neg logit contributions
 spoiler-0.036
 sponge-0.030
 ramp-0.029
 prope-0.028
 widened-0.027
Token.
Token ID13
Feature activation+2.48e-03

Pos logit contributions
 spoiler+0.022
 sponge+0.019
 ramp+0.018
ro+0.018
 widened+0.018
Neg logit contributions
aw-0.016
iles-0.014
ups-0.012
ese-0.012
ils-0.012
Token
Token ID198
Feature activation+9.50e-03

Pos logit contributions
 spoiler+0.005
 sponge+0.004
 ramp+0.004
ro+0.004
 widened+0.004
Neg logit contributions
aw-0.005
iles-0.004
ups-0.004
ing-0.004
ils-0.004
Token
Token ID198
Feature activation+1.19e-02

Pos logit contributions
 spoiler+0.019
 sponge+0.016
 ramp+0.016
ro+0.015
 widened+0.015
Neg logit contributions
aw-0.017
iles-0.014
ups-0.013
ing-0.013
ils-0.013
TokenThe
Token ID464
Feature activation-5.86e-03

Pos logit contributions
 spoiler+0.024
 sponge+0.021
 ramp+0.021
ro+0.020
 widened+0.020
Neg logit contributions
aw-0.020
iles-0.017
ups-0.016
ing-0.015
ils-0.015
Token town
Token ID3240
Feature activation+8.07e-02

Pos logit contributions
aw+0.011
iles+0.010
 me+0.009
ups+0.009
ing+0.009
Neg logit contributions
 spoiler-0.011
 ramp-0.009
 sponge-0.009
 widened-0.008
 prope-0.008
Tokenspe
Token ID4125
Feature activation+1.37e-01

Pos logit contributions
 spoiler+0.134
 sponge+0.113
 ramp+0.111
ro+0.104
 widened+0.103
Neg logit contributions
aw-0.161
iles-0.144
una-0.133
ese-0.131
ils-0.130
Tokenople
Token ID643
Feature activation+5.67e-02

Pos logit contributions
 spoiler+0.297
 sponge+0.255
 ramp+0.252
ro+0.241
 widened+0.239
Neg logit contributions
aw-0.210
iles-0.179
ups-0.163
ing-0.159
una-0.158
Token decided
Token ID3066
Feature activation+1.66e-02

Pos logit contributions
 spoiler+0.100
 sponge+0.086
 ramp+0.083
 widened+0.082
 prope+0.078
Neg logit contributions
aw-0.107
iles-0.095
ups-0.088
una-0.087
ese-0.085
Token to
Token ID284
Feature activation-1.57e-02

Pos logit contributions
 spoiler+0.039
 sponge+0.035
 ramp+0.034
 widened+0.034
 prope+0.033
Neg logit contributions
aw-0.021
iles-0.019
una-0.015
ese-0.015
ils-0.015
Token ask
Token ID1265
Feature activation-9.91e-03

Pos logit contributions
aw+0.026
iles+0.023
 me+0.022
ow+0.022
 us+0.022
Neg logit contributions
 spoiler-0.031
 sponge-0.026
 ramp-0.026
ro-0.024
 widened-0.024
Token their
Token ID511
Feature activation-1.27e-02

Pos logit contributions
aw+0.016
iles+0.013
ese+0.013
ups+0.012
ing+0.011
Neg logit contributions
 spoiler-0.020
 sponge-0.018
 ramp-0.017
 allowance-0.017
 prope-0.017
Token.
Token ID13
Feature activation-4.66e-03

Pos logit contributions
 spoiler+0.035
 sponge+0.030
 ramp+0.029
 widened+0.028
ro+0.028
Neg logit contributions
aw-0.031
iles-0.027
ups-0.025
ils-0.025
ese-0.024
Token He
Token ID679
Feature activation+1.66e-03

Pos logit contributions
aw+0.010
iles+0.009
ups+0.008
ing+0.008
ow+0.008
Neg logit contributions
 spoiler-0.007
 sponge-0.006
 ramp-0.006
 prope-0.005
 widened-0.005
Token asked
Token ID1965
Feature activation+2.89e-02

Pos logit contributions
 spoiler+0.003
 sponge+0.002
 ramp+0.002
 widened+0.002
ro+0.002
Neg logit contributions
aw-0.003
iles-0.003
ups-0.003
ing-0.003
ese-0.003
Token the
Token ID262
Feature activation-8.18e-03

Pos logit contributions
 spoiler+0.052
 sponge+0.044
 ramp+0.043
ro+0.041
 allowance+0.041
Neg logit contributions
aw-0.055
iles-0.047
ups-0.044
ese-0.044
ils-0.042
Token town
Token ID3240
Feature activation+6.37e-02

Pos logit contributions
aw+0.016
iles+0.014
ils+0.013
ow+0.013
ups+0.013
Neg logit contributions
 spoiler-0.015
 ramp-0.013
 sponge-0.013
ro-0.012
 prope-0.012
Tokenspe
Token ID4125
Feature activation+1.35e-01

Pos logit contributions
 spoiler+0.138
 sponge+0.121
 ramp+0.120
ro+0.115
 allowance+0.113
Neg logit contributions
aw-0.097
iles-0.080
ils-0.072
ese-0.071
una-0.071
Tokenople
Token ID643
Feature activation+3.59e-02

Pos logit contributions
 spoiler+0.284
 ramp+0.238
 sponge+0.234
ro+0.228
 prope+0.224
Neg logit contributions
aw-0.215
iles-0.186
una-0.172
ups-0.168
ing-0.164
Token what
Token ID644
Feature activation+2.07e-03

Pos logit contributions
 spoiler+0.080
 sponge+0.072
 ramp+0.069
ro+0.067
ge+0.067
Neg logit contributions
aw-0.052
iles-0.042
ups-0.037
ese-0.037
una-0.036
Token was
Token ID373
Feature activation+5.22e-02

Pos logit contributions
 spoiler+0.003
 sponge+0.003
 ramp+0.003
 widened+0.003
 allowance+0.003
Neg logit contributions
aw-0.004
iles-0.004
ese-0.004
ups-0.004
ing-0.003
Token wrong
Token ID2642
Feature activation+1.18e-02

Pos logit contributions
 spoiler+0.134
 ramp+0.121
 sponge+0.118
 prope+0.116
 widened+0.116
Neg logit contributions
aw-0.057
iles-0.044
ups-0.041
una-0.038
ese-0.037
Token and
Token ID290
Feature activation+1.07e-02

Pos logit contributions
 spoiler+0.026
 sponge+0.023
 ramp+0.022
 prope+0.021
ro+0.021
Neg logit contributions
aw-0.018
iles-0.015
ups-0.014
ing-0.014
ow-0.013
Token found
Token ID1043
Feature activation+6.94e-03

Pos logit contributions
 spoiler+0.036
 sponge+0.031
 widened+0.030
 prope+0.030
 allowance+0.029
Neg logit contributions
aw-0.033
iles-0.030
ing-0.028
una-0.026
ups-0.026
Token the
Token ID262
Feature activation+8.37e-03

Pos logit contributions
 spoiler+0.013
 sponge+0.011
 ramp+0.010
 cloth+0.010
 allowance+0.010
Neg logit contributions
aw-0.012
iles-0.011
ups-0.010
ese-0.010
ils-0.009
Token truck
Token ID7779
Feature activation+9.50e-03

Pos logit contributions
 spoiler+0.016
 sponge+0.013
 ramp+0.013
ro+0.012
 widened+0.012
Neg logit contributions
aw-0.015
iles-0.013
ups-0.012
ing-0.012
ils-0.012
Token many
Token ID867
Feature activation+1.02e-02

Pos logit contributions
 spoiler+0.020
 sponge+0.017
 ramp+0.017
 prope+0.016
ro+0.016
Neg logit contributions
aw-0.015
iles-0.013
ups-0.012
ing-0.012
ow-0.011
Token years
Token ID812
Feature activation+2.56e-02

Pos logit contributions
 spoiler+0.022
 sponge+0.019
 ramp+0.019
ro+0.018
 prope+0.018
Neg logit contributions
aw-0.017
iles-0.014
ups-0.013
ils-0.013
ing-0.012
Token ago
Token ID2084
Feature activation+1.35e-01

Pos logit contributions
 spoiler+0.049
 sponge+0.042
 ramp+0.041
 widened+0.038
 prope+0.038
Neg logit contributions
aw-0.047
iles-0.040
ups-0.038
ing-0.037
ils-0.036
Token and
Token ID290
Feature activation-6.44e-03

Pos logit contributions
 spoiler+0.337
 ramp+0.292
 sponge+0.291
 prope+0.279
 widened+0.275
Neg logit contributions
aw-0.175
iles-0.130
ups-0.126
ing-0.123
ow-0.121
Token worked
Token ID3111
Feature activation+1.88e-03

Pos logit contributions
aw+0.011
iles+0.009
ups+0.009
ing+0.009
ils+0.009
Neg logit contributions
 spoiler-0.013
 sponge-0.011
 ramp-0.011
ro-0.010
 prope-0.010
Token hard
Token ID1327
Feature activation-3.94e-02

Pos logit contributions
 spoiler+0.004
 sponge+0.004
 ramp+0.004
 widened+0.003
ro+0.003
Neg logit contributions
aw-0.003
iles-0.002
ups-0.002
ese-0.002
ing-0.002
Token to
Token ID284
Feature activation-2.26e-02

Pos logit contributions
aw+0.049
iles+0.039
ups+0.034
ing+0.033
ils+0.033
Neg logit contributions
 spoiler-0.098
 sponge-0.087
 ramp-0.086
ro-0.082
 widened-0.082
Token fix
Token ID4259
Feature activation+1.19e-02

Pos logit contributions
aw+0.038
iles+0.033
 me+0.033
 us+0.032
ow+0.032
Neg logit contributions
 spoiler-0.044
 sponge-0.037
 ramp-0.035
 prope-0.034
bes-0.034
Token.
Token ID13
Feature activation-1.15e-03

Pos logit contributions
 spoiler+0.029
 sponge+0.025
 ramp+0.025
ro+0.024
 widened+0.023
Neg logit contributions
aw-0.024
iles-0.020
ups-0.019
ese-0.018
ils-0.018
Token
Token ID198
Feature activation+5.54e-03

Pos logit contributions
aw+0.002
iles+0.002
ils+0.002
ups+0.002
ese+0.002
Neg logit contributions
 spoiler-0.002
 sponge-0.001
 ramp-0.001
ro-0.001
 widened-0.001
Token
Token ID198
Feature activation+9.63e-03

Pos logit contributions
 spoiler+0.009
 ramp+0.008
 sponge+0.008
ro+0.008
 widened+0.007
Neg logit contributions
aw-0.011
iles-0.009
ils-0.009
ups-0.009
ese-0.009
TokenThe
Token ID464
Feature activation-2.85e-03

Pos logit contributions
 spoiler+0.016
 ramp+0.013
 sponge+0.013
ro+0.013
 widened+0.013
Neg logit contributions
aw-0.019
iles-0.017
ow-0.016
ing-0.016
ils-0.016
Token town
Token ID3240
Feature activation+7.96e-02

Pos logit contributions
aw+0.006
iles+0.005
 me+0.005
ups+0.005
ing+0.005
Neg logit contributions
 spoiler-0.005
 sponge-0.004
 ramp-0.004
ro-0.004
 widened-0.004
Tokenspe
Token ID4125
Feature activation+1.34e-01

Pos logit contributions
 spoiler+0.133
 sponge+0.111
 ramp+0.110
ro+0.103
 widened+0.101
Neg logit contributions
aw-0.157
iles-0.143
una-0.131
ese-0.129
ils-0.128
Tokenople
Token ID643
Feature activation+5.44e-02

Pos logit contributions
 spoiler+0.287
 sponge+0.244
 ramp+0.242
ro+0.232
 widened+0.230
Neg logit contributions
aw-0.208
iles-0.178
ups-0.163
una-0.159
ing-0.158
Token were
Token ID547
Feature activation+1.84e-04

Pos logit contributions
 spoiler+0.088
 sponge+0.075
 widened+0.074
 ramp+0.071
ro+0.068
Neg logit contributions
aw-0.106
iles-0.097
ups-0.091
una-0.091
ese-0.088
Token sad
Token ID6507
Feature activation+1.93e-02

Pos logit contributions
 spoiler+0.000
 prope+0.000
 sponge+0.000
 ramp+0.000
 widened+0.000
Neg logit contributions
aw-0.000
iles-0.000
una-0.000
ups-0.000
ils-0.000
Token because
Token ID780
Feature activation-2.82e-02

Pos logit contributions
 spoiler+0.034
 sponge+0.031
 ramp+0.029
ro+0.028
 widened+0.027
Neg logit contributions
aw-0.037
iles-0.031
ese-0.031
ups-0.030
ils-0.030
Token they
Token ID484
Feature activation+1.18e-02

Pos logit contributions
aw+0.044
iles+0.037
ups+0.034
ing+0.034
ils+0.032
Neg logit contributions
 spoiler-0.061
 sponge-0.053
 ramp-0.052
ro-0.051
 widened-0.049
Tokenmy
Token ID1820
Feature activation+3.20e-02

Pos logit contributions
 spoiler+0.045
 sponge+0.038
 ramp+0.036
 widened+0.035
ro+0.034
Neg logit contributions
aw-0.059
iles-0.053
ups-0.049
una-0.048
ing-0.048
Token had
Token ID550
Feature activation+4.28e-02

Pos logit contributions
 spoiler+0.057
 sponge+0.049
 ramp+0.047
 widened+0.045
ro+0.045
Neg logit contributions
aw-0.061
iles-0.053
ups-0.049
una-0.048
ing-0.048
Token told
Token ID1297
Feature activation+1.30e-02

Pos logit contributions
 spoiler+0.089
 sponge+0.078
 ramp+0.075
 widened+0.073
ro+0.073
Neg logit contributions
aw-0.068
iles-0.058
ups-0.052
ing-0.052
una-0.051
Token him
Token ID683
Feature activation-1.39e-03

Pos logit contributions
 spoiler+0.028
 sponge+0.026
ro+0.025
 cloth+0.025
 ramp+0.025
Neg logit contributions
aw-0.018
iles-0.014
ese-0.013
ups-0.013
ils-0.012
Token long
Token ID890
Feature activation+3.80e-02

Pos logit contributions
aw+0.002
iles+0.001
ese+0.001
ups+0.001
una+0.001
Neg logit contributions
 spoiler-0.003
 sponge-0.003
 ramp-0.003
ro-0.003
 allowance-0.003
Token ago
Token ID2084
Feature activation+1.34e-01

Pos logit contributions
 spoiler+0.072
 sponge+0.061
 ramp+0.060
 widened+0.057
 prope+0.056
Neg logit contributions
aw-0.069
iles-0.059
ups-0.055
ing-0.054
ils-0.054
Token.
Token ID13
Feature activation+7.43e-04

Pos logit contributions
 spoiler+0.335
 ramp+0.286
 sponge+0.285
 prope+0.277
 widened+0.275
Neg logit contributions
aw-0.170
ups-0.132
iles-0.127
ing-0.123
ow-0.120
Token The
Token ID383
Feature activation+7.68e-03

Pos logit contributions
 spoiler+0.001
 sponge+0.001
 ramp+0.001
 prope+0.001
ro+0.001
Neg logit contributions
aw-0.001
iles-0.001
ups-0.001
ing-0.001
ow-0.001
Token bird
Token ID6512
Feature activation-1.06e-02

Pos logit contributions
 spoiler+0.015
 sponge+0.012
 ramp+0.012
ro+0.011
 prope+0.011
Neg logit contributions
aw-0.014
iles-0.012
ups-0.011
ing-0.011
ils-0.011
Token spread
Token ID4104
Feature activation+4.42e-03

Pos logit contributions
aw+0.020
iles+0.017
ups+0.016
ing+0.015
una+0.015
Neg logit contributions
 spoiler-0.019
 sponge-0.017
 ramp-0.016
 widened-0.016
ro-0.016
Token out
Token ID503
Feature activation-1.30e-02

Pos logit contributions
 spoiler+0.009
 sponge+0.008
 ramp+0.008
 widened+0.007
ro+0.007
Neg logit contributions
aw-0.007
iles-0.006
ups-0.006
ese-0.006
ils-0.006
Token ones
Token ID3392
Feature activation-4.72e-03

Pos logit contributions
 spoiler+0.001
 sponge+0.001
 ramp+0.001
 prope+0.001
 widened+0.001
Neg logit contributions
aw-0.001
iles-0.001
 me-0.001
ing-0.001
ups-0.001
Token are
Token ID389
Feature activation+1.82e-02

Pos logit contributions
aw+0.008
iles+0.007
ups+0.006
ese+0.006
una+0.006
Neg logit contributions
 spoiler-0.009
 sponge-0.008
 ramp-0.008
ro-0.008
 cloth-0.007
Token yellow
Token ID7872
Feature activation+8.83e-03

Pos logit contributions
 spoiler+0.033
 sponge+0.030
 ramp+0.028
 allowance+0.027
 cloth+0.026
Neg logit contributions
aw-0.034
iles-0.028
ing-0.027
ups-0.027
ow-0.026
Token and
Token ID290
Feature activation+4.64e-02

Pos logit contributions
 spoiler+0.020
 ramp+0.017
 widened+0.017
 sponge+0.017
 prope+0.017
Neg logit contributions
aw-0.014
iles-0.011
ing-0.011
ups-0.010
 me-0.010
Token the
Token ID262
Feature activation+3.98e-02

Pos logit contributions
 spoiler+0.085
 ramp+0.074
ge+0.074
ro+0.071
 tant+0.070
Neg logit contributions
aw-0.081
 me-0.073
iles-0.070
 us-0.069
 that-0.065
Token un
Token ID555
Feature activation+1.32e-01

Pos logit contributions
 spoiler+0.074
 sponge+0.063
 ramp+0.061
ro+0.058
 prope+0.057
Neg logit contributions
aw-0.074
iles-0.064
ups-0.060
ing-0.059
ils-0.058
Tokenri
Token ID380
Feature activation+6.48e-02

Pos logit contributions
 spoiler+0.243
 sponge+0.212
 ramp+0.201
 widened+0.201
 prope+0.197
Neg logit contributions
aw-0.253
iles-0.212
ups-0.204
ow-0.197
ese-0.195
Tokenpe
Token ID431
Feature activation+2.91e-02

Pos logit contributions
 spoiler+0.119
 sponge+0.101
 ramp+0.099
 widened+0.093
 prope+0.092
Neg logit contributions
aw-0.123
iles-0.106
ups-0.099
ing-0.097
ils-0.096
Token ones
Token ID3392
Feature activation+1.43e-02

Pos logit contributions
 spoiler+0.058
 sponge+0.049
 ramp+0.049
 prope+0.046
ro+0.046
Neg logit contributions
aw-0.050
iles-0.043
ups-0.040
ing-0.039
ils-0.039
Token are
Token ID389
Feature activation+2.66e-02

Pos logit contributions
 spoiler+0.028
 sponge+0.025
 ramp+0.024
ro+0.023
 widened+0.023
Neg logit contributions
aw-0.024
iles-0.020
ups-0.018
ing-0.018
una-0.018
Token green
Token ID4077
Feature activation+2.24e-02

Pos logit contributions
 spoiler+0.051
 sponge+0.044
 ramp+0.043
ro+0.041
 prope+0.041
Neg logit contributions
aw-0.047
iles-0.040
ups-0.037
ing-0.037
ils-0.036
Token rope
Token ID17182
Feature activation-9.05e-03

Pos logit contributions
aw+0.034
iles+0.032
ils+0.031
ow+0.031
ups+0.030
Neg logit contributions
 spoiler-0.030
 ramp-0.028
 sponge-0.026
 cloth-0.025
 prope-0.024
Token bouncing
Token ID30713
Feature activation+5.82e-03

Pos logit contributions
aw+0.014
iles+0.012
ups+0.011
ing+0.011
ils+0.011
Neg logit contributions
 spoiler-0.019
 sponge-0.017
 ramp-0.016
ro-0.016
 widened-0.016
Token.
Token ID13
Feature activation-5.83e-03

Pos logit contributions
 spoiler+0.014
 sponge+0.012
 ramp+0.012
 prope+0.011
 widened+0.011
Neg logit contributions
aw-0.008
iles-0.006
ing-0.006
ups-0.006
ils-0.005
Token The
Token ID383
Feature activation-1.22e-03

Pos logit contributions
aw+0.013
iles+0.011
ups+0.011
ing+0.011
ils+0.011
Neg logit contributions
 spoiler-0.009
 sponge-0.007
 ramp-0.007
ro-0.007
 widened-0.007
Token town
Token ID3240
Feature activation+7.62e-02

Pos logit contributions
aw+0.003
iles+0.002
ups+0.002
ow+0.002
ils+0.002
Neg logit contributions
 spoiler-0.002
 sponge-0.002
 ramp-0.002
ro-0.001
 prope-0.001
Tokenspe
Token ID4125
Feature activation+1.31e-01

Pos logit contributions
 spoiler+0.140
 sponge+0.119
 ramp+0.116
ro+0.109
 widened+0.109
Neg logit contributions
aw-0.143
iles-0.124
ups-0.116
ing-0.114
ils-0.112
Tokenople
Token ID643
Feature activation+5.16e-02

Pos logit contributions
 spoiler+0.279
 sponge+0.235
 ramp+0.234
ro+0.226
 widened+0.223
Neg logit contributions
aw-0.201
iles-0.176
ups-0.160
una-0.158
ing-0.155
Token gathered
Token ID9272
Feature activation-1.61e-02

Pos logit contributions
 spoiler+0.092
 sponge+0.078
 ramp+0.076
 widened+0.072
ro+0.072
Neg logit contributions
aw-0.098
iles-0.086
ups-0.081
ing-0.078
una-0.078
Token around
Token ID1088
Feature activation-1.54e-02

Pos logit contributions
aw+0.022
iles+0.020
ups+0.018
una+0.017
ese+0.017
Neg logit contributions
 spoiler-0.036
 sponge-0.032
 ramp-0.031
 widened-0.030
 prope-0.030
Token to
Token ID284
Feature activation-6.84e-03

Pos logit contributions
aw+0.028
iles+0.025
ups+0.023
ese+0.022
una+0.022
Neg logit contributions
 spoiler-0.029
 sponge-0.025
 ramp-0.024
ro-0.023
 widened-0.023
Token watch
Token ID2342
Feature activation-4.14e-02

Pos logit contributions
aw+0.010
 me+0.009
iles+0.008
 us+0.008
ow+0.008
Neg logit contributions
 spoiler-0.014
 sponge-0.012
 allowance-0.012
 tant-0.012
 ramp-0.012
Token.
Token ID13
Feature activation+1.08e-02

Pos logit contributions
aw+0.022
iles+0.017
ups+0.016
ing+0.016
ow+0.015
Neg logit contributions
 spoiler-0.040
 sponge-0.035
 ramp-0.034
ro-0.033
 prope-0.033
Token It
Token ID632
Feature activation+4.27e-03

Pos logit contributions
 spoiler+0.020
ge+0.018
 ramp+0.017
 sponge+0.017
 prope+0.016
Neg logit contributions
aw-0.020
iles-0.016
 me-0.015
una-0.015
ow-0.015
Token is
Token ID318
Feature activation+2.18e-03

Pos logit contributions
 spoiler+0.008
 ramp+0.007
 sponge+0.006
 widened+0.006
ro+0.006
Neg logit contributions
aw-0.008
iles-0.007
ups-0.007
ing-0.007
ils-0.006
Token the
Token ID262
Feature activation+4.28e-02

Pos logit contributions
 spoiler+0.004
 sponge+0.004
 ramp+0.003
 prope+0.003
 flat+0.003
Neg logit contributions
aw-0.004
iles-0.003
ils-0.003
ups-0.003
ing-0.003
Token door
Token ID3420
Feature activation+7.35e-02

Pos logit contributions
 spoiler+0.078
 ramp+0.068
 sponge+0.067
 prope+0.062
 cloth+0.060
Neg logit contributions
aw-0.084
ils-0.072
iles-0.070
ups-0.069
ing-0.069
Tokenbell
Token ID7923
Feature activation+1.31e-01

Pos logit contributions
 spoiler+0.138
 sponge+0.116
 ramp+0.112
 prope+0.109
ro+0.109
Neg logit contributions
aw-0.135
iles-0.112
ups-0.111
ing-0.109
ow-0.107
Token.
Token ID13
Feature activation+2.43e-03

Pos logit contributions
 spoiler+0.277
 sponge+0.241
 ramp+0.236
ro+0.224
 widened+0.222
Neg logit contributions
aw-0.212
iles-0.176
ups-0.163
ing-0.160
ow-0.156
Token Mom
Token ID11254
Feature activation+9.84e-04

Pos logit contributions
 spoiler+0.005
ge+0.004
 ramp+0.004
 sponge+0.004
 prope+0.004
Neg logit contributions
aw-0.004
iles-0.003
 me-0.003
una-0.003
ow-0.003
Tokenmy
Token ID1820
Feature activation+9.12e-03

Pos logit contributions
 spoiler+0.002
 ramp+0.001
 sponge+0.001
 widened+0.001
 pret+0.001
Neg logit contributions
aw-0.002
iles-0.002
una-0.002
ils-0.002
ups-0.002
Token is
Token ID318
Feature activation+6.60e-03

Pos logit contributions
 spoiler+0.015
 ramp+0.014
 sponge+0.013
 widened+0.013
 pret+0.013
Neg logit contributions
aw-0.017
iles-0.015
ups-0.014
ing-0.014
una-0.014
Token home
Token ID1363
Feature activation+3.98e-02

Pos logit contributions
 spoiler+0.012
 sponge+0.010
 ramp+0.010
 widened+0.010
 prope+0.009
Neg logit contributions
aw-0.012
iles-0.011
ils-0.010
una-0.010
ing-0.010
Token her
Token ID607
Feature activation+9.31e-03

Pos logit contributions
 spoiler+0.049
 sponge+0.044
 ramp+0.042
 widened+0.041
ro+0.041
Neg logit contributions
aw-0.028
iles-0.023
ups-0.021
ing-0.020
ese-0.020
Token that
Token ID326
Feature activation+8.15e-03

Pos logit contributions
 spoiler+0.025
 sponge+0.022
 ramp+0.022
ro+0.021
 widened+0.021
Neg logit contributions
aw-0.010
iles-0.008
ups-0.007
ing-0.007
ils-0.006
Token a
Token ID257
Feature activation+3.12e-02

Pos logit contributions
 spoiler+0.016
 sponge+0.014
 ramp+0.014
ro+0.013
 widened+0.013
Neg logit contributions
aw-0.014
iles-0.012
ups-0.011
ing-0.011
ils-0.011
Token long
Token ID890
Feature activation+4.49e-02

Pos logit contributions
 spoiler+0.053
 sponge+0.044
 ramp+0.043
 widened+0.041
ro+0.040
Neg logit contributions
aw-0.063
iles-0.055
ups-0.052
ing-0.051
una-0.050
Token time
Token ID640
Feature activation+2.54e-02

Pos logit contributions
 spoiler+0.086
 widened+0.071
 sponge+0.071
 ramp+0.071
rily+0.067
Neg logit contributions
aw-0.080
iles-0.067
 me-0.065
ow-0.064
una-0.064
Token ago
Token ID2084
Feature activation+1.31e-01

Pos logit contributions
 spoiler+0.055
 sponge+0.048
 ramp+0.047
ro+0.045
 widened+0.045
Neg logit contributions
aw-0.039
iles-0.033
ups-0.030
ing-0.029
una-0.029
Token,
Token ID11
Feature activation+1.62e-03

Pos logit contributions
 spoiler+0.319
 ramp+0.274
 sponge+0.266
 prope+0.262
 widened+0.259
Neg logit contributions
aw-0.182
ups-0.134
iles-0.133
ow-0.132
ing-0.130
Token someone
Token ID2130
Feature activation+2.60e-02

Pos logit contributions
 spoiler+0.004
 ramp+0.003
 sponge+0.003
ro+0.003
ge+0.003
Neg logit contributions
aw-0.003
 me-0.002
iles-0.002
 at-0.002
 us-0.002
Token made
Token ID925
Feature activation+2.85e-02

Pos logit contributions
 spoiler+0.038
 sponge+0.038
 allowance+0.036
 cloth+0.035
 ramp+0.035
Neg logit contributions
aw-0.049
iles-0.044
una-0.040
ese-0.040
ani-0.039
Token the
Token ID262
Feature activation+2.10e-02

Pos logit contributions
 spoiler+0.043
 sponge+0.040
 cloth+0.039
 allowance+0.036
 glass+0.036
Neg logit contributions
aw-0.052
iles-0.050
wl-0.045
ese-0.045
ani-0.043
Token mark
Token ID1317
Feature activation+6.67e-03

Pos logit contributions
 spoiler+0.036
 ramp+0.031
 sponge+0.030
ro+0.028
 prope+0.028
Neg logit contributions
aw-0.041
iles-0.037
ups-0.035
ils-0.035
ing-0.034
Token
Token ID220
Feature activation-7.99e-03

Pos logit contributions
 spoiler+0.006
 sponge+0.005
 ramp+0.005
ro+0.004
 widened+0.004
Neg logit contributions
aw-0.009
iles-0.009
ils-0.008
ese-0.008
una-0.008
Token
Token ID198
Feature activation+3.76e-03

Pos logit contributions
aw+0.015
iles+0.013
ils+0.012
ups+0.012
ese+0.012
Neg logit contributions
 spoiler-0.014
 sponge-0.012
 ramp-0.012
ro-0.011
 widened-0.011
Token
Token ID198
Feature activation+6.58e-03

Pos logit contributions
 spoiler+0.007
 ramp+0.006
 sponge+0.006
ro+0.005
 widened+0.005
Neg logit contributions
aw-0.007
iles-0.006
ils-0.006
ups-0.006
ese-0.006
TokenThe
Token ID464
Feature activation-6.05e-03

Pos logit contributions
 spoiler+0.012
 sponge+0.010
 ramp+0.010
 widened+0.009
ro+0.009
Neg logit contributions
aw-0.012
iles-0.011
ow-0.010
ing-0.010
ese-0.010
Token town
Token ID3240
Feature activation+7.93e-02

Pos logit contributions
aw+0.010
iles+0.008
ups+0.008
ing+0.008
ils+0.007
Neg logit contributions
 spoiler-0.013
 sponge-0.011
 ramp-0.011
ro-0.010
 widened-0.010
Tokenspe
Token ID4125
Feature activation+1.31e-01

Pos logit contributions
 spoiler+0.128
 sponge+0.106
 ramp+0.105
ro+0.098
 widened+0.097
Neg logit contributions
aw-0.161
iles-0.146
una-0.136
ese-0.132
ils-0.131
Tokenople
Token ID643
Feature activation+5.19e-02

Pos logit contributions
 spoiler+0.280
 sponge+0.238
 ramp+0.236
ro+0.226
 widened+0.225
Neg logit contributions
aw-0.202
iles-0.173
ups-0.158
una-0.155
ing-0.153
Token thought
Token ID1807
Feature activation+5.70e-03

Pos logit contributions
 spoiler+0.089
 sponge+0.077
 widened+0.075
 ramp+0.073
 prope+0.070
Neg logit contributions
aw-0.098
iles-0.088
una-0.082
ups-0.082
ese-0.079
Token and
Token ID290
Feature activation+4.80e-03

Pos logit contributions
 spoiler+0.014
 sponge+0.012
 ramp+0.012
 widened+0.012
ro+0.012
Neg logit contributions
aw-0.007
iles-0.006
ups-0.005
ese-0.005
ing-0.005
Token thought
Token ID1807
Feature activation-4.06e-03

Pos logit contributions
 spoiler+0.007
 sponge+0.006
 widened+0.006
 ramp+0.006
 prope+0.006
Neg logit contributions
aw-0.010
iles-0.009
ups-0.009
una-0.008
ing-0.008
Token until
Token ID1566
Feature activation-3.33e-02

Pos logit contributions
aw+0.005
iles+0.004
ing+0.003
ups+0.003
ils+0.003
Neg logit contributions
 spoiler-0.010
 sponge-0.009
 ramp-0.009
ro-0.009
 prope-0.009
Token!
Token ID0
Feature activation-2.74e-02

Pos logit contributions
aw+0.048
iles+0.041
ups+0.037
ing+0.036
ils+0.036
Neg logit contributions
 spoiler-0.061
 sponge-0.053
 ramp-0.052
ro-0.050
 widened-0.049
Token
Token ID198
Feature activation-1.16e-02

Pos logit contributions
aw+0.049
iles+0.042
ups+0.039
ing+0.039
ils+0.038
Neg logit contributions
 spoiler-0.053
 sponge-0.045
 ramp-0.044
ro-0.042
 widened-0.042
Token
Token ID198
Feature activation-4.72e-03

Pos logit contributions
aw+0.021
iles+0.018
ups+0.016
ils+0.016
ing+0.016
Neg logit contributions
 spoiler-0.022
 sponge-0.019
 ramp-0.019
ro-0.018
 widened-0.018
TokenThe
Token ID464
Feature activation-8.71e-03

Pos logit contributions
aw+0.009
iles+0.008
ups+0.007
ing+0.007
ils+0.007
Neg logit contributions
 spoiler-0.009
 sponge-0.007
 ramp-0.007
ro-0.007
 widened-0.007
Token town
Token ID3240
Feature activation+8.01e-02

Pos logit contributions
aw+0.015
ups+0.012
iles+0.012
ils+0.012
ow+0.012
Neg logit contributions
 spoiler-0.018
 ramp-0.016
 prope-0.015
 sponge-0.015
 allowance-0.014
Tokenspe
Token ID4125
Feature activation+1.31e-01

Pos logit contributions
 spoiler+0.123
 sponge+0.100
 ramp+0.099
ro+0.091
 widened+0.091
Neg logit contributions
aw-0.172
iles-0.154
ups-0.143
una-0.142
ils-0.141
Tokenople
Token ID643
Feature activation+4.95e-02

Pos logit contributions
 spoiler+0.276
 sponge+0.232
 ramp+0.232
ro+0.223
 widened+0.221
Neg logit contributions
aw-0.202
iles-0.175
una-0.161
ups-0.161
ing-0.155
Token thanked
Token ID26280
Feature activation+1.24e-02

Pos logit contributions
 spoiler+0.090
 sponge+0.076
 ramp+0.074
ro+0.070
 widened+0.069
Neg logit contributions
aw-0.094
iles-0.081
ups-0.076
ing-0.075
ils-0.074
Token the
Token ID262
Feature activation+1.81e-03

Pos logit contributions
 spoiler+0.025
 ramp+0.021
 sponge+0.020
 widened+0.020
 prope+0.020
Neg logit contributions
aw-0.021
iles-0.018
ups-0.018
ing-0.016
ils-0.016
Token hero
Token ID4293
Feature activation+8.84e-03

Pos logit contributions
 spoiler+0.003
 ramp+0.003
 prope+0.003
 sponge+0.003
 magnet+0.003
Neg logit contributions
aw-0.003
iles-0.003
ils-0.003
ups-0.003
ow-0.003
Token for
Token ID329
Feature activation-4.62e-02

Pos logit contributions
 spoiler+0.018
 ramp+0.016
 sponge+0.016
ro+0.016
 allowance+0.015
Neg logit contributions
aw-0.014
iles-0.012
ups-0.011
ese-0.010
ils-0.010
Token.
Token ID13
Feature activation+1.87e-03

Pos logit contributions
aw+0.056
iles+0.049
ups+0.044
ese+0.043
ils+0.043
Neg logit contributions
 spoiler-0.062
 sponge-0.054
ro-0.052
 ramp-0.052
 widened-0.051
Token
Token ID198
Feature activation-3.95e-04

Pos logit contributions
 spoiler+0.003
 sponge+0.003
 ramp+0.003
ro+0.003
 widened+0.003
Neg logit contributions
aw-0.004
iles-0.003
ups-0.003
ils-0.003
ing-0.003
Token
Token ID198
Feature activation+2.76e-03

Pos logit contributions
aw+0.001
iles+0.001
ups+0.001
ils+0.001
ese+0.001
Neg logit contributions
 spoiler-0.001
 ramp-0.001
 sponge-0.001
 widened-0.001
ro-0.001
TokenThe
Token ID464
Feature activation-7.19e-03

Pos logit contributions
 spoiler+0.005
 sponge+0.004
 ramp+0.004
 widened+0.004
 prope+0.004
Neg logit contributions
aw-0.005
iles-0.004
ow-0.004
ing-0.004
ups-0.004
Token town
Token ID3240
Feature activation+7.98e-02

Pos logit contributions
aw+0.011
iles+0.009
ups+0.008
ing+0.008
ils+0.008
Neg logit contributions
 spoiler-0.016
 sponge-0.014
 ramp-0.014
ro-0.013
 widened-0.013
Tokenspe
Token ID4125
Feature activation+1.30e-01

Pos logit contributions
 spoiler+0.126
 sponge+0.102
 ramp+0.101
 prope+0.095
 widened+0.095
Neg logit contributions
aw-0.166
iles-0.149
una-0.140
ups-0.137
ils-0.136
Tokenople
Token ID643
Feature activation+5.29e-02

Pos logit contributions
 spoiler+0.276
 sponge+0.233
 ramp+0.231
ro+0.222
 widened+0.221
Neg logit contributions
aw-0.202
iles-0.175
una-0.160
ups-0.159
ing-0.154
Token were
Token ID547
Feature activation-2.37e-03

Pos logit contributions
 spoiler+0.100
 sponge+0.087
 widened+0.083
 ramp+0.083
 prope+0.082
Neg logit contributions
aw-0.092
iles-0.080
ups-0.076
una-0.073
ing-0.072
Token so
Token ID523
Feature activation+2.09e-02

Pos logit contributions
aw+0.004
iles+0.004
una+0.004
ils+0.004
ups+0.004
Neg logit contributions
 spoiler-0.004
 prope-0.003
 widened-0.003
 sponge-0.003
 ramp-0.003
Token happy
Token ID3772
Feature activation-2.05e-02

Pos logit contributions
 spoiler+0.031
 sponge+0.025
 widened+0.024
 ramp+0.024
 prope+0.023
Neg logit contributions
aw-0.046
iles-0.041
ils-0.039
ups-0.038
una-0.038
Token they
Token ID484
Feature activation-8.31e-03

Pos logit contributions
aw+0.035
iles+0.032
ese+0.030
una+0.030
ils+0.030
Neg logit contributions
 spoiler-0.033
 sponge-0.030
 ramp-0.030
ro-0.030
 widened-0.029
Token know
Token ID760
Feature activation-2.11e-02

Pos logit contributions
aw+0.024
ing+0.021
iles+0.020
ocked+0.020
ups+0.020
Neg logit contributions
 spoiler-0.020
 sponge-0.016
ro-0.016
 ramp-0.015
 cloth-0.015
Token you
Token ID345
Feature activation-2.08e-02

Pos logit contributions
aw+0.034
iles+0.031
ups+0.028
ils+0.028
una+0.027
Neg logit contributions
 spoiler-0.042
 sponge-0.036
 ramp-0.035
 prope-0.033
 widened-0.033
Token did
Token ID750
Feature activation-2.66e-02

Pos logit contributions
aw+0.037
iles+0.032
ups+0.030
ing+0.030
una+0.029
Neg logit contributions
 spoiler-0.039
 sponge-0.033
 ramp-0.032
 widened-0.031
ro-0.031
Token not
Token ID407
Feature activation-2.52e-02

Pos logit contributions
aw+0.046
iles+0.041
ing+0.037
ups+0.036
una+0.035
Neg logit contributions
 spoiler-0.051
 sponge-0.044
 ramp-0.043
 allowance-0.041
 widened-0.041
Token like
Token ID588
Feature activation-6.05e-02

Pos logit contributions
aw+0.055
ing+0.046
iles+0.046
ocked+0.044
ils+0.044
Neg logit contributions
 spoiler-0.035
ro-0.030
 sponge-0.030
ge-0.028
 cloth-0.027
Token le
Token ID443
Feature activation-1.31e-01

Pos logit contributions
aw+0.104
iles+0.090
ups+0.083
ing+0.082
ow+0.081
Neg logit contributions
 spoiler-0.116
 sponge-0.103
 ramp-0.097
 allowance-0.094
 cloth-0.094
Tokenmons
Token ID11567
Feature activation-8.58e-02

Pos logit contributions
aw+0.233
 me+0.229
iles+0.221
 us+0.212
 opera+0.211
Neg logit contributions
ro-0.184
 spoiler-0.170
 sponge-0.156
 allowance-0.152
ge-0.150
Token.
Token ID13
Feature activation-6.09e-04

Pos logit contributions
aw+0.151
iles+0.135
ups+0.122
ow+0.122
ani+0.120
Neg logit contributions
 spoiler-0.151
 sponge-0.135
ro-0.130
 ramp-0.128
 allowance-0.122
Token We
Token ID775
Feature activation+3.05e-03

Pos logit contributions
aw+0.001
iles+0.001
ups+0.001
ow+0.001
ing+0.001
Neg logit contributions
 spoiler-0.001
 ramp-0.001
 sponge-0.001
ge-0.001
 widened-0.001
Token wanted
Token ID2227
Feature activation+1.79e-02

Pos logit contributions
 spoiler+0.006
 sponge+0.005
 ramp+0.005
ro+0.005
 widened+0.005
Neg logit contributions
aw-0.005
iles-0.004
ups-0.004
ing-0.004
ils-0.004
Token to
Token ID284
Feature activation-1.43e-02

Pos logit contributions
 spoiler+0.036
 sponge+0.034
 allowance+0.032
 cloth+0.031
 flat+0.030
Neg logit contributions
aw-0.027
iles-0.024
ani-0.022
una-0.021
ils-0.021
Token on
Token ID319
Feature activation-4.84e-04

Pos logit contributions
aw+0.038
iles+0.037
ups+0.036
ils+0.034
ese+0.034
Neg logit contributions
 spoiler-0.025
kin-0.022
 sponge-0.021
 flat-0.020
driver-0.019
Token,
Token ID11
Feature activation-2.76e-02

Pos logit contributions
aw+0.001
iles+0.001
ups+0.001
ing+0.001
ils+0.001
Neg logit contributions
 spoiler-0.001
 sponge-0.001
 ramp-0.001
ro-0.001
 widened-0.001
Token Sue
Token ID26089
Feature activation-2.77e-02

Pos logit contributions
aw+0.057
iles+0.053
ils+0.050
ing+0.049
ani+0.049
Neg logit contributions
 spoiler-0.035
 sponge-0.031
 allowance-0.030
ro-0.030
 cloth-0.028
Token always
Token ID1464
Feature activation-2.62e-02

Pos logit contributions
aw+0.050
iles+0.045
ing+0.041
ups+0.040
ow+0.039
Neg logit contributions
 spoiler-0.047
 sponge-0.041
ro-0.040
 ramp-0.040
 widened-0.039
Token had
Token ID550
Feature activation-2.85e-02

Pos logit contributions
aw+0.047
iles+0.040
ups+0.037
ing+0.036
una+0.036
Neg logit contributions
 spoiler-0.051
 sponge-0.044
 ramp-0.042
ro-0.041
 widened-0.040
Token le
Token ID443
Feature activation-1.26e-01

Pos logit contributions
aw+0.050
iles+0.049
ing+0.044
ting+0.043
ani+0.042
Neg logit contributions
 spoiler-0.046
 sponge-0.045
 cloth-0.043
 allowance-0.039
 flat-0.039
Tokenmons
Token ID11567
Feature activation-9.67e-02

Pos logit contributions
aw+0.210
 me+0.203
iles+0.202
 us+0.198
 opera+0.194
Neg logit contributions
 spoiler-0.194
ro-0.194
 sponge-0.182
 allowance-0.162
driver-0.160
Token in
Token ID287
Feature activation+2.67e-02

Pos logit contributions
aw+0.157
iles+0.142
ani+0.121
una+0.120
ow+0.119
Neg logit contributions
 spoiler-0.178
 sponge-0.169
ro-0.162
 cloth-0.153
 ramp-0.152
Token her
Token ID607
Feature activation+3.59e-03

Pos logit contributions
 spoiler+0.070
 ramp+0.062
 sponge+0.061
 widened+0.059
 prope+0.059
Neg logit contributions
aw-0.029
iles-0.024
 me-0.021
ups-0.021
ow-0.020
Token organized
Token ID8389
Feature activation+5.94e-02

Pos logit contributions
 spoiler+0.005
 sponge+0.004
 ramp+0.004
 widened+0.004
ro+0.004
Neg logit contributions
aw-0.008
iles-0.007
ups-0.007
ing-0.007
ils-0.007
Token fruit
Token ID8234
Feature activation-5.86e-02

Pos logit contributions
 spoiler+0.110
 sponge+0.091
 ramp+0.090
 widened+0.085
 prope+0.085
Neg logit contributions
aw-0.113
iles-0.096
ups-0.092
ils-0.089
ing-0.089
Token to
Token ID284
Feature activation-2.69e-03

Pos logit contributions
 spoiler+0.026
 sponge+0.025
 cloth+0.023
 ramp+0.023
 allowance+0.023
Neg logit contributions
aw-0.021
iles-0.019
ese-0.017
una-0.016
ils-0.016
Token climb
Token ID12080
Feature activation-3.76e-02

Pos logit contributions
aw+0.004
iles+0.004
ow+0.004
 me+0.004
 us+0.004
Neg logit contributions
 spoiler-0.006
 sponge-0.005
 tant-0.004
 ramp-0.004
 widened-0.004
Token safely
Token ID11512
Feature activation-8.73e-03

Pos logit contributions
aw+0.053
iles+0.042
ing+0.039
ups+0.039
 me+0.037
Neg logit contributions
 spoiler-0.089
 sponge-0.078
 ramp-0.075
ro-0.072
 widened-0.072
Token and
Token ID290
Feature activation-2.84e-02

Pos logit contributions
aw+0.014
iles+0.012
ups+0.011
ing+0.011
ils+0.010
Neg logit contributions
 spoiler-0.018
 sponge-0.016
 ramp-0.016
 prope-0.015
ro-0.015
Token use
Token ID779
Feature activation-3.94e-02

Pos logit contributions
aw+0.054
iles+0.047
ups+0.044
ing+0.043
ils+0.043
Neg logit contributions
 spoiler-0.051
 sponge-0.043
 ramp-0.043
ro-0.040
 widened-0.040
Token lot
Token ID1256
Feature activation-1.26e-01

Pos logit contributions
aw+0.052
iles+0.041
ups+0.038
ing+0.036
ils+0.036
Neg logit contributions
 spoiler-0.095
 sponge-0.083
 ramp-0.083
ro-0.079
 widened-0.079
Tokenion
Token ID295
Feature activation-7.23e-02

Pos logit contributions
aw+0.187
iles+0.158
ese+0.151
ils+0.144
ing+0.138
Neg logit contributions
 spoiler-0.260
 sponge-0.237
ro-0.230
 ramp-0.225
 cloth-0.221
Token when
Token ID618
Feature activation-4.10e-02

Pos logit contributions
aw+0.105
iles+0.092
una+0.080
ese+0.079
 lions+0.072
Neg logit contributions
 spoiler-0.145
 sponge-0.140
ro-0.136
 ramp-0.130
 cloth-0.130
Token she
Token ID673
Feature activation-2.67e-02

Pos logit contributions
aw+0.070
iles+0.060
ese+0.054
una+0.053
ups+0.053
Neg logit contributions
 spoiler-0.080
 sponge-0.071
 ramp-0.069
 allowance-0.066
ro-0.066
Token got
Token ID1392
Feature activation-8.98e-03

Pos logit contributions
aw+0.048
iles+0.041
ups+0.039
ing+0.038
ils+0.037
Neg logit contributions
 spoiler-0.051
 sponge-0.044
 ramp-0.043
ro-0.040
 widened-0.040
Token a
Token ID257
Feature activation-1.50e-03

Pos logit contributions
aw+0.014
iles+0.012
una+0.010
ils+0.010
ups+0.010
Neg logit contributions
 spoiler-0.019
 sponge-0.017
 ramp-0.017
 widened-0.016
ro-0.016
Token?
Token ID30
Feature activation+1.28e-02

Pos logit contributions
 spoiler+0.000
 sponge+0.000
 ramp+0.000
 widened+0.000
 prope+0.000
Neg logit contributions
aw-0.000
iles-0.000
ups-0.000
ing-0.000
ow-0.000
Token I
Token ID314
Feature activation+2.44e-02

Pos logit contributions
 spoiler+0.024
 widened+0.021
 sponge+0.021
 ramp+0.021
ge+0.021
Neg logit contributions
aw-0.022
ow-0.018
 me-0.018
iles-0.017
ing-0.017
Token want
Token ID765
Feature activation+3.16e-03

Pos logit contributions
 spoiler+0.054
 ramp+0.044
 prope+0.044
bes+0.043
 sponge+0.043
Neg logit contributions
aw-0.037
ow-0.032
iles-0.032
 at-0.030
 me-0.030
Token to
Token ID284
Feature activation-1.15e-02

Pos logit contributions
 spoiler+0.006
 sponge+0.006
 cloth+0.005
 allowance+0.005
 ramp+0.005
Neg logit contributions
aw-0.005
iles-0.005
ese-0.004
ils-0.004
ani-0.004
Token spell
Token ID4822
Feature activation-8.06e-02

Pos logit contributions
aw+0.019
 me+0.017
iles+0.016
ow+0.015
 us+0.015
Neg logit contributions
 spoiler-0.023
 sponge-0.019
 ramp-0.019
 widened-0.019
bes-0.018
Token snake
Token ID17522
Feature activation-1.24e-01

Pos logit contributions
aw+0.167
iles+0.151
ups+0.141
ese+0.140
ils+0.138
Neg logit contributions
 spoiler-0.127
 sponge-0.101
 ramp-0.099
 prope-0.095
ro-0.094
Token."
Token ID526
Feature activation+3.00e-02

Pos logit contributions
aw+0.209
iles+0.180
ups+0.176
ocked+0.171
ils+0.164
Neg logit contributions
 spoiler-0.224
 ramp-0.202
ro-0.195
 sponge-0.191
 pret-0.182
Token Lily
Token ID20037
Feature activation+1.46e-02

Pos logit contributions
 spoiler+0.054
 sponge+0.045
 ramp+0.043
 prope+0.043
 widened+0.041
Neg logit contributions
aw-0.059
iles-0.049
ups-0.048
 me-0.047
una-0.046
Token says
Token ID1139
Feature activation-2.63e-02

Pos logit contributions
 spoiler+0.028
 ramp+0.024
 sponge+0.024
 widened+0.023
 pret+0.023
Neg logit contributions
aw-0.024
iles-0.021
ups-0.020
ing-0.020
ils-0.020
Token,
Token ID11
Feature activation+1.54e-03

Pos logit contributions
aw+0.054
iles+0.050
ups+0.045
ese+0.045
ils+0.044
Neg logit contributions
 spoiler-0.042
 sponge-0.037
 ramp-0.033
driver-0.033
 cloth-0.033
Token "
Token ID366
Feature activation-1.07e-02

Pos logit contributions
 spoiler+0.003
 sponge+0.003
 ramp+0.003
ro+0.002
 widened+0.002
Neg logit contributions
aw-0.003
iles-0.002
ups-0.002
ing-0.002
ils-0.002
Token she
Token ID673
Feature activation-1.51e-02

Pos logit contributions
aw+0.073
iles+0.063
ups+0.056
ese+0.055
ils+0.054
Neg logit contributions
 spoiler-0.095
 sponge-0.083
 ramp-0.081
ro-0.078
 allowance-0.078
Token still
Token ID991
Feature activation-2.03e-02

Pos logit contributions
aw+0.025
iles+0.022
ups+0.020
ing+0.019
una+0.019
Neg logit contributions
 spoiler-0.029
 sponge-0.025
 ramp-0.025
 widened-0.024
 prope-0.024
Token loved
Token ID6151
Feature activation-1.23e-02

Pos logit contributions
aw+0.032
iles+0.026
ow+0.025
ing+0.025
 me+0.024
Neg logit contributions
 spoiler-0.043
 sponge-0.037
 ramp-0.037
ro-0.035
 allowance-0.034
Token to
Token ID284
Feature activation-1.79e-02

Pos logit contributions
aw+0.021
iles+0.017
ese+0.016
ups+0.016
uck+0.016
Neg logit contributions
 spoiler-0.021
 sponge-0.020
 prope-0.020
 cloth-0.019
 pret-0.019
Token eat
Token ID4483
Feature activation-5.14e-02

Pos logit contributions
aw+0.033
iles+0.028
ups+0.026
ing+0.026
ils+0.025
Neg logit contributions
 spoiler-0.034
 sponge-0.029
 ramp-0.028
ro-0.027
 widened-0.027
Token pepper
Token ID13385
Feature activation-1.23e-01

Pos logit contributions
aw+0.085
iles+0.073
ups+0.068
ing+0.065
ow+0.065
Neg logit contributions
 spoiler-0.103
 sponge-0.090
 ramp-0.088
 widened-0.083
ro-0.083
Token on
Token ID319
Feature activation+1.12e-02

Pos logit contributions
aw+0.217
iles+0.189
ups+0.172
ow+0.171
una+0.171
Neg logit contributions
 spoiler-0.222
 sponge-0.198
 ramp-0.191
ro-0.188
 widened-0.183
Token her
Token ID607
Feature activation-5.20e-03

Pos logit contributions
 spoiler+0.025
 sponge+0.021
 ramp+0.021
ro+0.020
 prope+0.020
Neg logit contributions
aw-0.017
iles-0.014
ups-0.013
ing-0.013
ils-0.012
Token food
Token ID2057
Feature activation-2.95e-02

Pos logit contributions
aw+0.010
iles+0.008
ups+0.008
ese+0.008
 me+0.007
Neg logit contributions
 spoiler-0.010
 sponge-0.008
 ramp-0.008
 widened-0.008
ro-0.008
Token,
Token ID11
Feature activation+1.10e-02

Pos logit contributions
aw+0.050
iles+0.043
ups+0.039
ing+0.039
ils+0.038
Neg logit contributions
 spoiler-0.059
 sponge-0.051
 ramp-0.050
ro-0.048
 widened-0.047
Token just
Token ID655
Feature activation-5.51e-02

Pos logit contributions
 spoiler+0.023
 ramp+0.019
 sponge+0.019
ro+0.018
 widened+0.018
Neg logit contributions
aw-0.019
iles-0.016
ups-0.015
ing-0.014
 me-0.014
Token.
Token ID13
Feature activation-1.37e-03

Pos logit contributions
 spoiler+0.043
 sponge+0.038
 ramp+0.037
 widened+0.035
ro+0.035
Neg logit contributions
aw-0.027
iles-0.022
ups-0.020
ing-0.020
ow-0.019
Token He
Token ID679
Feature activation+8.53e-03

Pos logit contributions
aw+0.002
iles+0.002
 me+0.002
ow+0.002
una+0.002
Neg logit contributions
 spoiler-0.003
 sponge-0.003
ge-0.002
 prope-0.002
 ramp-0.002
Token wants
Token ID3382
Feature activation+1.86e-02

Pos logit contributions
 spoiler+0.016
 ramp+0.014
 sponge+0.013
 widened+0.013
ro+0.013
Neg logit contributions
aw-0.015
iles-0.013
ing-0.013
ups-0.013
ese-0.012
Token to
Token ID284
Feature activation-1.38e-02

Pos logit contributions
 spoiler+0.029
 sponge+0.027
 glass+0.027
 cloth+0.027
 allowance+0.026
Neg logit contributions
aw-0.034
ani-0.030
itting-0.030
arling-0.029
iles-0.029
Token spell
Token ID4822
Feature activation-7.37e-02

Pos logit contributions
aw+0.027
iles+0.024
ing+0.023
ups+0.022
ils+0.022
Neg logit contributions
 spoiler-0.024
 sponge-0.020
 ramp-0.020
ro-0.019
 prope-0.018
Token snake
Token ID17522
Feature activation-1.22e-01

Pos logit contributions
aw+0.133
iles+0.115
ups+0.109
ese+0.106
ils+0.106
Neg logit contributions
 spoiler-0.139
 sponge-0.115
 ramp-0.115
 prope-0.109
ro-0.107
Token with
Token ID351
Feature activation-8.65e-03

Pos logit contributions
aw+0.210
iles+0.176
ocked+0.176
ups+0.173
ani+0.164
Neg logit contributions
 spoiler-0.214
 ramp-0.194
ro-0.189
 sponge-0.182
 pret-0.175
Token his
Token ID465
Feature activation+2.95e-02

Pos logit contributions
aw+0.016
iles+0.014
ups+0.013
ing+0.013
ils+0.013
Neg logit contributions
 spoiler-0.016
 sponge-0.013
 ramp-0.013
ro-0.012
 prope-0.012
Token blocks
Token ID7021
Feature activation+1.71e-02

Pos logit contributions
 spoiler+0.046
 sponge+0.039
 ramp+0.038
 allowance+0.035
 prope+0.034
Neg logit contributions
aw-0.065
ups-0.056
iles-0.055
ing-0.054
ils-0.053
Token.
Token ID13
Feature activation-5.05e-04

Pos logit contributions
 spoiler+0.042
 sponge+0.036
 ramp+0.036
 prope+0.035
 widened+0.035
Neg logit contributions
aw-0.022
iles-0.018
ups-0.016
ow-0.016
ing-0.016
Token He
Token ID679
Feature activation+1.26e-02

Pos logit contributions
aw+0.001
iles+0.001
ups+0.001
ing+0.001
ow+0.001
Neg logit contributions
 spoiler-0.001
 sponge-0.001
 ramp-0.001
 prope-0.001
 widened-0.001
Token.
Token ID13
Feature activation+6.38e-03

Pos logit contributions
 spoiler+0.006
 sponge+0.005
 ramp+0.005
ro+0.005
 prope+0.005
Neg logit contributions
aw-0.007
iles-0.006
ups-0.005
ing-0.005
una-0.005
Token
Token ID220
Feature activation+4.40e-04

Pos logit contributions
 spoiler+0.011
 sponge+0.010
 ramp+0.009
 widened+0.009
ro+0.009
Neg logit contributions
aw-0.012
iles-0.011
ups-0.010
ing-0.010
ow-0.010
Token
Token ID198
Feature activation+1.30e-02

Pos logit contributions
 spoiler+0.001
 sponge+0.001
 ramp+0.001
ro+0.001
 widened+0.001
Neg logit contributions
aw-0.001
iles-0.001
ups-0.001
ing-0.001
ils-0.001
Token
Token ID198
Feature activation+1.53e-02

Pos logit contributions
 spoiler+0.024
 sponge+0.020
 ramp+0.020
ro+0.019
 widened+0.019
Neg logit contributions
aw-0.024
iles-0.021
ups-0.019
ils-0.019
ing-0.019
TokenOn
Token ID2202
Feature activation-6.42e-03

Pos logit contributions
 spoiler+0.025
 ramp+0.021
 sponge+0.021
 widened+0.021
ro+0.021
Neg logit contributions
aw-0.029
iles-0.027
ow-0.026
ing-0.025
ese-0.024
Tokena
Token ID64
Feature activation-1.21e-01

Pos logit contributions
aw+0.010
iles+0.008
ups+0.008
ing+0.008
ils+0.007
Neg logit contributions
 spoiler-0.014
 sponge-0.012
 ramp-0.012
ro-0.011
 prope-0.011
Token and
Token ID290
Feature activation-1.59e-02

Pos logit contributions
aw+0.193
iles+0.169
ups+0.152
una+0.149
ing+0.149
Neg logit contributions
 spoiler-0.245
 sponge-0.212
 ramp-0.211
ro-0.204
 widened-0.200
Token her
Token ID607
Feature activation-7.22e-03

Pos logit contributions
aw+0.025
iles+0.022
ups+0.019
ese+0.019
ing+0.019
Neg logit contributions
 spoiler-0.033
 sponge-0.029
ro-0.028
 ramp-0.028
 allowance-0.027
Token friends
Token ID2460
Feature activation-8.73e-03

Pos logit contributions
aw+0.013
iles+0.011
ups+0.010
ing+0.010
una+0.009
Neg logit contributions
 spoiler-0.014
 ramp-0.012
 sponge-0.012
 allowance-0.012
ro-0.012
Token had
Token ID550
Feature activation+4.27e-03

Pos logit contributions
aw+0.016
iles+0.014
una+0.013
ups+0.013
ing+0.013
Neg logit contributions
 spoiler-0.015
 sponge-0.013
 ramp-0.013
ro-0.013
 widened-0.012
Token so
Token ID523
Feature activation+7.51e-03

Pos logit contributions
 spoiler+0.009
 sponge+0.008
 ramp+0.007
 widened+0.007
ro+0.007
Neg logit contributions
aw-0.007
iles-0.006
ing-0.005
ups-0.005
ow-0.005
Token But
Token ID887
Feature activation-5.29e-03

Pos logit contributions
 spoiler+0.004
 sponge+0.004
 ramp+0.004
 widened+0.003
ro+0.003
Neg logit contributions
aw-0.006
iles-0.005
ups-0.005
ing-0.005
ow-0.005
Token we
Token ID356
Feature activation+5.95e-04

Pos logit contributions
aw+0.009
iles+0.008
ups+0.008
ing+0.007
ils+0.007
Neg logit contributions
 spoiler-0.010
 sponge-0.009
 ramp-0.009
ro-0.008
 widened-0.008
Token don
Token ID836
Feature activation-2.01e-03

Pos logit contributions
 spoiler+0.001
 sponge+0.001
 ramp+0.001
ro+0.001
 widened+0.001
Neg logit contributions
aw-0.001
iles-0.001
ups-0.001
ing-0.001
ils-0.001
Token't
Token ID470
Feature activation-4.67e-03

Pos logit contributions
aw+0.003
iles+0.003
ups+0.003
ing+0.003
ow+0.003
Neg logit contributions
 spoiler-0.004
 sponge-0.003
 ramp-0.003
ro-0.003
 prope-0.003
Token like
Token ID588
Feature activation-5.27e-02

Pos logit contributions
aw+0.011
iles+0.009
ups+0.009
ing+0.009
ils+0.009
Neg logit contributions
 spoiler-0.006
 sponge-0.006
ro-0.005
 ramp-0.005
 cloth-0.005
Token le
Token ID443
Feature activation-1.21e-01

Pos logit contributions
aw+0.095
iles+0.083
ing+0.074
ups+0.074
ow+0.074
Neg logit contributions
 spoiler-0.094
 sponge-0.088
 cloth-0.081
 ramp-0.080
 allowance-0.079
Tokenmons
Token ID11567
Feature activation-9.23e-02

Pos logit contributions
 me+0.224
father+0.220
aw+0.218
mother+0.211
 introducing+0.205
Neg logit contributions
ro-0.168
 sponge-0.140
 spoiler-0.135
bes-0.133
�-0.124
Token.
Token ID13
Feature activation+6.88e-03

Pos logit contributions
aw+0.161
iles+0.141
ups+0.128
ow+0.127
ils+0.125
Neg logit contributions
 spoiler-0.169
 sponge-0.151
 ramp-0.144
ro-0.141
 widened-0.136
Token They
Token ID1119
Feature activation+8.33e-03

Pos logit contributions
 spoiler+0.013
 ramp+0.011
ge+0.011
 sponge+0.011
 widened+0.011
Neg logit contributions
aw-0.013
iles-0.010
ups-0.010
ese-0.010
ow-0.010
Token are
Token ID389
Feature activation-1.07e-02

Pos logit contributions
 spoiler+0.015
 sponge+0.013
 ramp+0.012
ro+0.012
 widened+0.012
Neg logit contributions
aw-0.016
iles-0.014
ups-0.013
ing-0.013
ils-0.013
Token too
Token ID1165
Feature activation-1.41e-02

Pos logit contributions
aw+0.021
iles+0.018
ing+0.018
ups+0.017
ils+0.016
Neg logit contributions
 spoiler-0.018
 sponge-0.016
 ramp-0.015
 allowance-0.014
 cloth-0.014
Token a
Token ID257
Feature activation+1.68e-03

Pos logit contributions
aw+0.048
iles+0.037
una+0.033
 me+0.032
ing+0.032
Neg logit contributions
 spoiler-0.081
 ramp-0.070
 sponge-0.069
ro-0.067
ge-0.066
Token little
Token ID1310
Feature activation+2.23e-04

Pos logit contributions
 spoiler+0.003
 sponge+0.002
 ramp+0.002
ro+0.002
 widened+0.002
Neg logit contributions
aw-0.003
iles-0.003
ing-0.003
ups-0.003
una-0.003
Token girl
Token ID2576
Feature activation-1.16e-02

Pos logit contributions
 spoiler+0.000
 sponge+0.000
 ramp+0.000
ro+0.000
 widened+0.000
Neg logit contributions
aw-0.000
iles-0.000
ups-0.000
ing-0.000
ils-0.000
Token named
Token ID3706
Feature activation+4.34e-03

Pos logit contributions
aw+0.019
iles+0.018
ups+0.016
ing+0.016
ese+0.015
Neg logit contributions
 spoiler-0.021
 sponge-0.019
 ramp-0.018
ro-0.018
 cloth-0.018
Token On
Token ID1550
Feature activation-9.91e-03

Pos logit contributions
 spoiler+0.006
 sponge+0.005
 ramp+0.005
ro+0.005
 cloth+0.004
Neg logit contributions
aw-0.010
iles-0.009
ups-0.008
ese-0.008
ing-0.008
Tokena
Token ID64
Feature activation-1.19e-01

Pos logit contributions
aw+0.014
iles+0.011
 me+0.010
ups+0.010
una+0.010
Neg logit contributions
 spoiler-0.023
 sponge-0.020
 ramp-0.020
ro-0.019
 widened-0.019
Token.
Token ID13
Feature activation-9.93e-03

Pos logit contributions
aw+0.189
iles+0.170
ing+0.151
ups+0.149
ese+0.146
Neg logit contributions
 spoiler-0.229
 sponge-0.205
 ramp-0.198
ro-0.194
bes-0.193
Token She
Token ID1375
Feature activation-1.84e-03

Pos logit contributions
aw+0.015
iles+0.012
ow+0.012
una+0.011
 me+0.011
Neg logit contributions
 spoiler-0.022
 ramp-0.019
 sponge-0.019
ge-0.018
 prope-0.018
Token wanted
Token ID2227
Feature activation+2.62e-02

Pos logit contributions
aw+0.003
iles+0.002
 me+0.002
ils+0.002
ups+0.002
Neg logit contributions
 spoiler-0.004
 sponge-0.004
 ramp-0.004
 prope-0.003
ge-0.003
Token to
Token ID284
Feature activation-1.18e-02

Pos logit contributions
 spoiler+0.053
 sponge+0.051
 allowance+0.049
 cloth+0.049
 ramp+0.047
Neg logit contributions
aw-0.040
iles-0.035
ese-0.031
ils-0.030
ani-0.028
Token be
Token ID307
Feature activation-1.69e-02

Pos logit contributions
aw+0.018
 me+0.016
ow+0.015
iles+0.015
ups+0.015
Neg logit contributions
 spoiler-0.025
 sponge-0.021
 ramp-0.021
 prope-0.021
 widened-0.020
Token more
Token ID517
Feature activation-1.25e-02

Pos logit contributions
aw+0.017
iles+0.015
ing+0.013
ils+0.013
ups+0.012
Neg logit contributions
 spoiler-0.021
 sponge-0.019
 ramp-0.018
 prope-0.018
 allowance-0.018
Token careful
Token ID8161
Feature activation+1.66e-02

Pos logit contributions
aw+0.019
iles+0.016
ups+0.015
ils+0.014
ing+0.014
Neg logit contributions
 spoiler-0.027
 sponge-0.024
 ramp-0.023
ro-0.022
 prope-0.022
Token when
Token ID618
Feature activation-3.25e-02

Pos logit contributions
 spoiler+0.035
 sponge+0.032
ro+0.032
 ramp+0.031
ge+0.030
Neg logit contributions
aw-0.024
iles-0.021
ils-0.018
ese-0.017
una-0.017
Token she
Token ID673
Feature activation-2.65e-02

Pos logit contributions
aw+0.052
iles+0.043
ups+0.038
ese+0.038
ils+0.038
Neg logit contributions
 spoiler-0.068
 sponge-0.061
 ramp-0.058
 allowance-0.057
ro-0.056
Token ate
Token ID15063
Feature activation-3.36e-02

Pos logit contributions
aw+0.047
iles+0.041
ing+0.038
ups+0.038
una+0.037
Neg logit contributions
 spoiler-0.049
 sponge-0.043
 ramp-0.041
ro-0.041
 widened-0.040
Token pepper
Token ID13385
Feature activation-1.19e-01

Pos logit contributions
aw+0.052
iles+0.042
ups+0.039
ing+0.039
ils+0.038
Neg logit contributions
 spoiler-0.075
 ramp-0.064
 sponge-0.063
 prope-0.060
ro-0.060
Token and
Token ID290
Feature activation+1.86e-03

Pos logit contributions
aw+0.200
iles+0.173
ups+0.155
ese+0.153
ow+0.153
Neg logit contributions
 spoiler-0.224
 sponge-0.204
ro-0.196
 ramp-0.193
 widened-0.186
Token she
Token ID673
Feature activation+3.24e-03

Pos logit contributions
 spoiler+0.004
 sponge+0.003
 ramp+0.003
ro+0.003
 widened+0.003
Neg logit contributions
aw-0.003
iles-0.003
ups-0.002
ing-0.002
ils-0.002
Token always
Token ID1464
Feature activation-8.07e-03

Pos logit contributions
 spoiler+0.007
 sponge+0.006
 ramp+0.006
ro+0.006
 widened+0.006
Neg logit contributions
aw-0.005
iles-0.004
ups-0.004
ing-0.004
ils-0.004
Token remembered
Token ID12086
Feature activation-1.60e-02

Pos logit contributions
aw+0.013
iles+0.011
ils+0.010
ups+0.010
ing+0.010
Neg logit contributions
 spoiler-0.017
 ramp-0.015
 sponge-0.015
 prope-0.014
ro-0.014
Token what
Token ID644
Feature activation-7.17e-04

Pos logit contributions
aw+0.025
iles+0.022
ese+0.019
una+0.018
ow+0.018
Neg logit contributions
 sponge-0.030
 spoiler-0.030
 cloth-0.029
 allowance-0.028
ro-0.027
Token ready
Token ID3492
Feature activation+1.45e-02

Pos logit contributions
aw+0.038
ups+0.032
iles+0.032
ing+0.031
ani+0.030
Neg logit contributions
 spoiler-0.029
 flat-0.024
 ramp-0.024
 sponge-0.024
 magnet-0.023
Token to
Token ID284
Feature activation-2.99e-02

Pos logit contributions
 spoiler+0.034
 sponge+0.031
 ramp+0.030
ro+0.029
 allowance+0.028
Neg logit contributions
aw-0.019
iles-0.015
ups-0.014
ils-0.013
ese-0.013
Token go
Token ID467
Feature activation-3.32e-02

Pos logit contributions
aw+0.057
iles+0.048
ups+0.046
ing+0.046
ils+0.044
Neg logit contributions
 spoiler-0.055
 sponge-0.047
 ramp-0.046
ro-0.043
 prope-0.043
Token now
Token ID783
Feature activation+5.09e-03

Pos logit contributions
aw+0.047
iles+0.038
ups+0.035
ing+0.035
 me+0.034
Neg logit contributions
 spoiler-0.078
 sponge-0.068
 ramp-0.066
 widened-0.064
 prope-0.063
Token little
Token ID1310
Feature activation-5.61e-02

Pos logit contributions
 spoiler+0.010
 sponge+0.009
 ramp+0.009
 widened+0.008
 prope+0.008
Neg logit contributions
aw-0.009
iles-0.007
ups-0.007
ing-0.007
ils-0.007
Token onion
Token ID21670
Feature activation-1.19e-01

Pos logit contributions
aw+0.096
iles+0.082
ups+0.076
ils+0.074
una+0.073
Neg logit contributions
 spoiler-0.110
 sponge-0.096
 ramp-0.094
ro-0.090
 prope-0.088
Token!"
Token ID2474
Feature activation+2.65e-02

Pos logit contributions
aw+0.205
iles+0.175
ups+0.163
ils+0.156
una+0.155
Neg logit contributions
 spoiler-0.225
 sponge-0.198
 ramp-0.197
ro-0.190
 widened-0.179
Token
Token ID198
Feature activation+6.35e-03

Pos logit contributions
 spoiler+0.043
 sponge+0.036
 ramp+0.035
ro+0.034
 widened+0.033
Neg logit contributions
aw-0.054
iles-0.050
ups-0.046
ils-0.045
ing-0.045
Token
Token ID198
Feature activation+1.00e-02

Pos logit contributions
 spoiler+0.010
 ramp+0.009
 sponge+0.009
ro+0.008
 widened+0.008
Neg logit contributions
aw-0.013
iles-0.012
ils-0.011
ups-0.011
ese-0.010
TokenThe
Token ID464
Feature activation-5.37e-03

Pos logit contributions
 spoiler+0.016
 ramp+0.014
 sponge+0.013
 widened+0.013
ro+0.013
Neg logit contributions
aw-0.020
iles-0.018
ing-0.017
ow-0.017
ils-0.017
Token onion
Token ID21670
Feature activation-3.26e-02

Pos logit contributions
aw+0.011
iles+0.009
ups+0.009
ow+0.008
ing+0.008
Neg logit contributions
 spoiler-0.009
 sponge-0.008
 ramp-0.008
 cloth-0.008
 prope-0.007
Token never
Token ID1239
Feature activation+1.54e-02

Pos logit contributions
aw+0.001
iles+0.001
ups+0.000
ing+0.000
ese+0.000
Neg logit contributions
 spoiler-0.001
 sponge-0.001
 ramp-0.001
ro-0.001
 widened-0.001
Token asked
Token ID1965
Feature activation+2.07e-03

Pos logit contributions
 spoiler+0.022
 sponge+0.020
ro+0.019
ge+0.018
 widened+0.018
Neg logit contributions
aw-0.033
iles-0.030
una-0.028
ani-0.027
ils-0.027
Token for
Token ID329
Feature activation-7.07e-02

Pos logit contributions
 spoiler+0.004
 sponge+0.004
 allowance+0.004
 ramp+0.004
ro+0.003
Neg logit contributions
aw-0.004
iles-0.003
ups-0.003
ing-0.003
ese-0.003
Token too
Token ID1165
Feature activation-4.86e-02

Pos logit contributions
aw+0.110
iles+0.092
ups+0.085
ing+0.083
ils+0.082
Neg logit contributions
 spoiler-0.153
 sponge-0.133
 ramp-0.131
ro-0.124
 widened-0.124
Token much
Token ID881
Feature activation-4.76e-02

Pos logit contributions
aw+0.087
iles+0.074
ups+0.069
ing+0.068
ils+0.067
Neg logit contributions
 spoiler-0.094
 sponge-0.081
 ramp-0.079
ro-0.075
 widened-0.075
Token pepper
Token ID13385
Feature activation-1.17e-01

Pos logit contributions
aw+0.089
iles+0.076
ils+0.073
ow+0.072
ups+0.071
Neg logit contributions
 spoiler-0.083
 sponge-0.075
 ramp-0.069
 allowance-0.068
ro-0.067
Token again
Token ID757
Feature activation+1.07e-02

Pos logit contributions
aw+0.189
iles+0.159
ani+0.151
ow+0.150
 me+0.143
Neg logit contributions
 spoiler-0.208
 sponge-0.199
ro-0.197
kin-0.184
 ramp-0.183
Token.
Token ID13
Feature activation-1.22e-02

Pos logit contributions
 spoiler+0.023
 sponge+0.020
 ramp+0.020
ro+0.019
 widened+0.019
Neg logit contributions
aw-0.016
iles-0.014
ups-0.012
ing-0.012
ils-0.012
Token<|endoftext|>
Token ID50256
Feature activation-3.74e-02

Pos logit contributions
aw+0.024
iles+0.021
ils+0.020
ups+0.020
ese+0.019
Neg logit contributions
 spoiler-0.021
 sponge-0.018
 ramp-0.018
ro-0.017
 allowance-0.016
TokenOnce
Token ID7454
Feature activation+8.39e-03

Pos logit contributions
aw+0.061
iles+0.051
ups+0.047
ing+0.047
una+0.046
Neg logit contributions
 spoiler-0.079
 sponge-0.068
 ramp-0.066
 prope-0.063
 widened-0.062
Token upon
Token ID2402
Feature activation-4.26e-02

Pos logit contributions
 spoiler+0.015
 sponge+0.013
 ramp+0.012
ro+0.012
 widened+0.012
Neg logit contributions
aw-0.016
iles-0.014
ups-0.013
ing-0.013
ils-0.013
Token lemon
Token ID18873
Feature activation-4.15e-02

Pos logit contributions
 spoiler+0.001
 sponge+0.001
 ramp+0.001
ro+0.001
 allowance+0.001
Neg logit contributions
aw-0.001
iles-0.001
ups-0.001
ing-0.001
ils-0.001
Token tree
Token ID5509
Feature activation-1.12e-02

Pos logit contributions
aw+0.076
iles+0.067
ow+0.061
ups+0.060
ing+0.060
Neg logit contributions
 spoiler-0.073
 sponge-0.066
 ramp-0.063
ro-0.061
 widened-0.060
Token harvested
Token ID34262
Feature activation-1.95e-02

Pos logit contributions
aw+0.020
iles+0.018
ups+0.016
ing+0.016
ils+0.016
Neg logit contributions
 spoiler-0.021
 sponge-0.018
 ramp-0.018
ro-0.017
 widened-0.017
Token lots
Token ID6041
Feature activation-5.26e-03

Pos logit contributions
aw+0.034
iles+0.030
ups+0.028
ow+0.027
ils+0.027
Neg logit contributions
 spoiler-0.036
 sponge-0.032
 ramp-0.031
ro-0.030
 cloth-0.030
Token of
Token ID286
Feature activation+9.79e-04

Pos logit contributions
aw+0.007
iles+0.006
ups+0.006
ow+0.005
ils+0.005
Neg logit contributions
 spoiler-0.012
 sponge-0.011
 ramp-0.010
ro-0.010
 widened-0.010
Token le
Token ID443
Feature activation-1.17e-01

Pos logit contributions
 spoiler+0.002
 sponge+0.002
 ramp+0.002
ro+0.001
 prope+0.001
Neg logit contributions
aw-0.002
iles-0.002
ups-0.001
ing-0.001
ils-0.001
Tokenmons
Token ID11567
Feature activation-8.70e-02

Pos logit contributions
 me+0.199
aw+0.191
 opera+0.191
 us+0.190
iles+0.186
Neg logit contributions
ro-0.177
 spoiler-0.168
 sponge-0.160
driver-0.149
 ramp-0.148
Token and
Token ID290
Feature activation+8.81e-04

Pos logit contributions
aw+0.162
iles+0.144
ups+0.130
una+0.129
ow+0.128
Neg logit contributions
 spoiler-0.151
 sponge-0.135
 ramp-0.127
ro-0.125
 widened-0.123
Token made
Token ID925
Feature activation+1.64e-02

Pos logit contributions
 spoiler+0.002
 sponge+0.001
 ramp+0.001
ro+0.001
 widened+0.001
Neg logit contributions
aw-0.002
iles-0.001
ups-0.001
ing-0.001
ow-0.001
Token a
Token ID257
Feature activation+1.97e-02

Pos logit contributions
 spoiler+0.029
 sponge+0.026
 ramp+0.024
 widened+0.024
 cloth+0.023
Neg logit contributions
aw-0.028
iles-0.027
ups-0.023
ese-0.023
ow-0.023
Token big
Token ID1263
Feature activation+8.24e-03

Pos logit contributions
 spoiler+0.034
 sponge+0.030
 ramp+0.029
 widened+0.027
 prope+0.026
Neg logit contributions
aw-0.038
iles-0.035
ups-0.032
ils-0.032
ow-0.032
Token.
Token ID13
Feature activation+7.84e-03

Pos logit contributions
aw+0.019
iles+0.016
ups+0.015
ing+0.014
ese+0.014
Neg logit contributions
 spoiler-0.020
 sponge-0.017
 ramp-0.017
 cloth-0.016
 allowance-0.016
Token But
Token ID887
Feature activation-3.67e-03

Pos logit contributions
 spoiler+0.012
 sponge+0.010
 ramp+0.010
 widened+0.010
 prope+0.009
Neg logit contributions
aw-0.017
iles-0.014
ups-0.014
ing-0.013
ow-0.013
Token we
Token ID356
Feature activation-1.77e-04

Pos logit contributions
aw+0.006
iles+0.005
ups+0.005
ing+0.005
ils+0.005
Neg logit contributions
 spoiler-0.008
 sponge-0.006
 ramp-0.006
 widened-0.006
ro-0.006
Token need
Token ID761
Feature activation-2.07e-02

Pos logit contributions
aw+0.000
iles+0.000
ups+0.000
ing+0.000
ils+0.000
Neg logit contributions
 spoiler-0.000
 sponge-0.000
 ramp-0.000
ro-0.000
 widened-0.000
Token more
Token ID517
Feature activation-3.74e-02

Pos logit contributions
aw+0.032
iles+0.029
ing+0.025
ani+0.025
ese+0.025
Neg logit contributions
 spoiler-0.041
 sponge-0.039
 cloth-0.038
 allowance-0.037
 glass-0.034
Token le
Token ID443
Feature activation-1.17e-01

Pos logit contributions
aw+0.064
iles+0.054
ups+0.050
ing+0.049
ils+0.048
Neg logit contributions
 spoiler-0.075
 sponge-0.065
 ramp-0.064
ro-0.061
 widened-0.060
Tokenmons
Token ID11567
Feature activation-9.62e-02

Pos logit contributions
aw+0.206
 me+0.196
iles+0.190
 us+0.188
 opera+0.185
Neg logit contributions
ro-0.175
 spoiler-0.169
 sponge-0.158
 allowance-0.155
driver-0.149
Token."
Token ID526
Feature activation+3.76e-02

Pos logit contributions
aw+0.140
iles+0.120
una+0.100
ing+0.099
ani+0.098
Neg logit contributions
 spoiler-0.197
 sponge-0.187
ro-0.179
 ramp-0.173
 allowance-0.173
Token Jane
Token ID12091
Feature activation+2.34e-02

Pos logit contributions
 spoiler+0.060
 sponge+0.051
 ramp+0.048
 widened+0.046
ro+0.046
Neg logit contributions
aw-0.078
iles-0.070
ups-0.065
ing-0.064
ils-0.064
Token and
Token ID290
Feature activation+8.93e-03

Pos logit contributions
 spoiler+0.042
 sponge+0.036
 ramp+0.035
ro+0.034
 widened+0.034
Neg logit contributions
aw-0.044
iles-0.038
ups-0.036
ing-0.035
ils-0.034
Token her
Token ID607
Feature activation+3.43e-02

Pos logit contributions
 spoiler+0.019
 sponge+0.018
 cloth+0.017
 ramp+0.016
 widened+0.016
Neg logit contributions
aw-0.014
iles-0.012
ups-0.011
ing-0.010
ese-0.010
Token the
Token ID262
Feature activation-4.42e-02

Pos logit contributions
 spoiler+0.052
 ramp+0.046
ro+0.045
 sponge+0.045
bes+0.043
Neg logit contributions
aw-0.025
iles-0.018
ing-0.017
 me-0.017
 us-0.016
Token sack
Token ID23704
Feature activation-3.89e-04

Pos logit contributions
aw+0.088
 me+0.074
ing+0.073
ups+0.072
iles+0.072
Neg logit contributions
 spoiler-0.075
 ramp-0.060
ge-0.059
 sponge-0.059
 widened-0.059
Token and
Token ID290
Feature activation-1.04e-02

Pos logit contributions
aw+0.001
iles+0.001
ing+0.001
ups+0.001
ow+0.001
Neg logit contributions
 spoiler-0.001
 sponge-0.001
 ramp-0.001
 prope-0.001
 widened-0.001
Token saw
Token ID2497
Feature activation-8.28e-03

Pos logit contributions
aw+0.018
iles+0.015
ups+0.014
ing+0.013
ils+0.013
Neg logit contributions
 spoiler-0.022
 sponge-0.018
 ramp-0.018
 prope-0.017
ro-0.017
Token many
Token ID867
Feature activation-1.33e-02

Pos logit contributions
aw+0.014
iles+0.013
ups+0.012
ese+0.012
ing+0.012
Neg logit contributions
 spoiler-0.016
 sponge-0.013
 ramp-0.013
 widened-0.012
 cloth-0.012
Token le
Token ID443
Feature activation-1.17e-01

Pos logit contributions
aw+0.022
iles+0.018
ils+0.017
ups+0.017
ing+0.017
Neg logit contributions
 spoiler-0.029
 ramp-0.024
ro-0.024
 sponge-0.024
 prope-0.024
Tokenmons
Token ID11567
Feature activation-8.58e-02

Pos logit contributions
 me+0.215
aw+0.205
 us+0.200
 opera+0.191
iles+0.190
Neg logit contributions
 spoiler-0.170
ro-0.169
driver-0.164
 sponge-0.157
 ramp-0.153
Token.
Token ID13
Feature activation+1.63e-02

Pos logit contributions
aw+0.145
iles+0.129
una+0.115
ow+0.114
ese+0.112
Neg logit contributions
 spoiler-0.158
 sponge-0.148
 ramp-0.137
ro-0.133
 cloth-0.133
Token They
Token ID1119
Feature activation+5.17e-03

Pos logit contributions
 spoiler+0.027
 sponge+0.022
 ramp+0.022
ro+0.020
 widened+0.020
Neg logit contributions
aw-0.034
iles-0.030
ups-0.028
ing-0.028
ils-0.027
Token smelled
Token ID37609
Feature activation-2.92e-02

Pos logit contributions
 spoiler+0.010
 ramp+0.009
 sponge+0.008
ro+0.008
 prope+0.008
Neg logit contributions
aw-0.009
iles-0.008
ups-0.007
ils-0.007
ing-0.007
Token sour
Token ID11348
Feature activation-5.30e-02

Pos logit contributions
aw+0.050
iles+0.044
ups+0.040
ow+0.039
ing+0.039
Neg logit contributions
 spoiler-0.057
 sponge-0.050
 ramp-0.048
 widened-0.046
ro-0.045
Token made
Token ID925
Feature activation+9.76e-03

Pos logit contributions
aw+0.013
iles+0.011
ups+0.010
ow+0.010
ing+0.010
Neg logit contributions
 spoiler-0.015
 sponge-0.013
 ramp-0.013
ro-0.012
 widened-0.012
Token sure
Token ID1654
Feature activation-5.70e-03

Pos logit contributions
 spoiler+0.018
 sponge+0.017
 cloth+0.017
 ramp+0.016
bes+0.016
Neg logit contributions
aw-0.015
iles-0.015
ow-0.012
ese-0.012
wl-0.011
Token to
Token ID284
Feature activation-1.36e-02

Pos logit contributions
aw+0.009
iles+0.008
ese+0.007
una+0.006
ils+0.006
Neg logit contributions
 spoiler-0.011
 sponge-0.011
 allowance-0.011
 cloth-0.011
ro-0.010
Token put
Token ID1234
Feature activation-7.03e-03

Pos logit contributions
aw+0.025
iles+0.021
ups+0.020
ing+0.019
ils+0.019
Neg logit contributions
 spoiler-0.026
 sponge-0.022
 ramp-0.022
 widened-0.020
ro-0.020
Token on
Token ID319
Feature activation+7.07e-03

Pos logit contributions
aw+0.010
iles+0.007
ing+0.007
ups+0.007
 me+0.007
Neg logit contributions
 spoiler-0.017
 ramp-0.015
 sponge-0.014
 widened-0.014
 prope-0.014
Token lot
Token ID1256
Feature activation-1.17e-01

Pos logit contributions
 spoiler+0.017
 ramp+0.015
 sponge+0.015
 widened+0.014
ro+0.014
Neg logit contributions
aw-0.010
iles-0.007
ups-0.007
ing-0.007
ils-0.007
Tokenion
Token ID295
Feature activation-7.52e-02

Pos logit contributions
aw+0.160
iles+0.128
ups+0.120
ing+0.117
ils+0.112
Neg logit contributions
 spoiler-0.279
 sponge-0.242
 ramp-0.241
 widened-0.228
 prope-0.228
Token every
Token ID790
Feature activation+1.34e-02

Pos logit contributions
aw+0.129
iles+0.121
ese+0.103
una+0.102
ups+0.100
Neg logit contributions
 spoiler-0.134
 sponge-0.124
ro-0.124
 ramp-0.118
 cloth-0.114
Token time
Token ID640
Feature activation+1.80e-03

Pos logit contributions
 spoiler+0.032
 sponge+0.028
 ramp+0.028
 widened+0.026
ro+0.026
Neg logit contributions
aw-0.018
iles-0.014
ups-0.013
ing-0.013
ils-0.013
Token she
Token ID673
Feature activation-2.05e-02

Pos logit contributions
 spoiler+0.004
 sponge+0.003
 ramp+0.003
ro+0.003
 allowance+0.003
Neg logit contributions
aw-0.003
iles-0.003
ese-0.002
ups-0.002
una-0.002
Token went
Token ID1816
Feature activation-4.56e-03

Pos logit contributions
aw+0.037
iles+0.032
ups+0.030
ing+0.030
una+0.029
Neg logit contributions
 spoiler-0.038
 sponge-0.033
 ramp-0.032
ro-0.031
 widened-0.030
Token do
Token ID466
Feature activation-5.78e-03

Pos logit contributions
aw+0.031
iles+0.026
ups+0.024
ing+0.024
ils+0.023
Neg logit contributions
 spoiler-0.037
 sponge-0.032
 ramp-0.031
ro-0.029
 widened-0.029
Token not
Token ID407
Feature activation-3.72e-03

Pos logit contributions
aw+0.010
iles+0.009
ing+0.008
ils+0.008
ups+0.008
Neg logit contributions
 spoiler-0.011
 ramp-0.010
 sponge-0.009
ro-0.009
bes-0.009
Token need
Token ID761
Feature activation-2.65e-02

Pos logit contributions
aw+0.008
ing+0.007
iles+0.007
ups+0.007
ils+0.007
Neg logit contributions
 spoiler-0.005
ro-0.004
 ramp-0.004
 sponge-0.004
ge-0.004
Token to
Token ID284
Feature activation-1.92e-02

Pos logit contributions
aw+0.052
iles+0.043
ing+0.042
ese+0.042
ils+0.041
Neg logit contributions
 cloth-0.040
 sponge-0.039
 spoiler-0.039
 allowance-0.037
 plastic-0.035
Token spell
Token ID4822
Feature activation-7.88e-02

Pos logit contributions
aw+0.039
iles+0.034
ing+0.033
ups+0.032
ils+0.031
Neg logit contributions
 spoiler-0.032
 sponge-0.027
 ramp-0.027
ro-0.025
 prope-0.025
Token snake
Token ID17522
Feature activation-1.17e-01

Pos logit contributions
aw+0.125
iles+0.106
ups+0.095
ese+0.095
ils+0.094
Neg logit contributions
 spoiler-0.164
 sponge-0.142
 ramp-0.139
 prope-0.134
ro-0.134
Token.
Token ID13
Feature activation-7.55e-03

Pos logit contributions
aw+0.196
iles+0.167
ocked+0.156
una+0.154
ups+0.152
Neg logit contributions
 spoiler-0.211
 ramp-0.189
 sponge-0.188
ro-0.186
 allowance-0.175
Token You
Token ID921
Feature activation-1.47e-02

Pos logit contributions
aw+0.012
iles+0.010
ow+0.009
una+0.009
ups+0.009
Neg logit contributions
 spoiler-0.016
 sponge-0.014
 ramp-0.014
 widened-0.013
ge-0.013
Token need
Token ID761
Feature activation-1.63e-02

Pos logit contributions
aw+0.028
iles+0.024
ups+0.023
ing+0.022
ils+0.022
Neg logit contributions
 spoiler-0.027
 sponge-0.022
 ramp-0.022
ro-0.021
 widened-0.020
Token to
Token ID284
Feature activation-9.20e-03

Pos logit contributions
aw+0.026
iles+0.022
ese+0.020
ing+0.020
ils+0.020
Neg logit contributions
 spoiler-0.031
 sponge-0.031
 cloth-0.030
 prope-0.028
ro-0.028
Token say
Token ID910
Feature activation-5.72e-02

Pos logit contributions
aw+0.018
iles+0.015
ing+0.015
ups+0.014
ils+0.014
Neg logit contributions
 spoiler-0.016
 sponge-0.014
 ramp-0.014
ro-0.013
 prope-0.013
Token.
Token ID13
Feature activation+2.32e-03

Pos logit contributions
aw+0.029
iles+0.025
ups+0.023
ils+0.022
ese+0.022
Neg logit contributions
 spoiler-0.034
 sponge-0.030
 ramp-0.029
ro-0.028
 widened-0.027
Token They
Token ID1119
Feature activation+4.54e-03

Pos logit contributions
 spoiler+0.004
ge+0.004
 sponge+0.004
 ramp+0.004
 widened+0.004
Neg logit contributions
aw-0.004
iles-0.003
ow-0.003
ups-0.003
ing-0.003
Token have
Token ID423
Feature activation-1.14e-02

Pos logit contributions
 spoiler+0.008
 sponge+0.007
 ramp+0.007
 widened+0.006
ro+0.006
Neg logit contributions
aw-0.009
iles-0.008
ups-0.007
ing-0.007
una-0.007
Token so
Token ID523
Feature activation+5.22e-03

Pos logit contributions
aw+0.019
iles+0.016
ups+0.015
ing+0.015
ils+0.015
Neg logit contributions
 spoiler-0.023
 sponge-0.020
 ramp-0.020
ro-0.019
 widened-0.019
Token many
Token ID867
Feature activation-2.24e-02

Pos logit contributions
 spoiler+0.012
 sponge+0.010
 ramp+0.010
ro+0.010
 widened+0.009
Neg logit contributions
aw-0.008
iles-0.007
ups-0.006
ing-0.006
ils-0.006
Token le
Token ID443
Feature activation-1.17e-01

Pos logit contributions
aw+0.037
iles+0.030
ups+0.029
ils+0.027
ing+0.027
Neg logit contributions
 spoiler-0.048
 ramp-0.041
 sponge-0.041
ro-0.039
 widened-0.038
Tokenmons
Token ID11567
Feature activation-8.52e-02

Pos logit contributions
 me+0.214
aw+0.202
 us+0.200
iles+0.196
 opera+0.191
Neg logit contributions
ro-0.169
 spoiler-0.164
 sponge-0.148
driver-0.146
 allowance-0.139
Token.
Token ID13
Feature activation+4.70e-04

Pos logit contributions
aw+0.136
iles+0.117
una+0.102
ups+0.102
ese+0.101
Neg logit contributions
 spoiler-0.168
 sponge-0.155
ro-0.146
 ramp-0.146
 allowance-0.139
Token They
Token ID1119
Feature activation+4.96e-03

Pos logit contributions
 spoiler+0.001
 ramp+0.001
ge+0.001
 sponge+0.001
 widened+0.001
Neg logit contributions
aw-0.001
iles-0.001
ups-0.001
ow-0.001
 me-0.001
Token won
Token ID1839
Feature activation+1.13e-02

Pos logit contributions
 spoiler+0.009
 sponge+0.007
 ramp+0.007
ro+0.007
 widened+0.007
Neg logit contributions
aw-0.010
iles-0.008
ups-0.008
ing-0.008
ils-0.008
Token't
Token ID470
Feature activation-8.67e-03

Pos logit contributions
 spoiler+0.022
 sponge+0.019
 ramp+0.018
ro+0.017
 widened+0.017
Neg logit contributions
aw-0.020
iles-0.018
ups-0.016
ing-0.016
ils-0.016
Token kept
Token ID4030
Feature activation+2.89e-02

Pos logit contributions
 spoiler+0.049
 sponge+0.042
 ramp+0.041
ro+0.040
 widened+0.040
Neg logit contributions
aw-0.051
iles-0.045
ups-0.041
ing-0.041
una-0.040
Token putting
Token ID5137
Feature activation+2.01e-02

Pos logit contributions
 spoiler+0.064
 sponge+0.056
 ramp+0.055
 prope+0.052
ro+0.052
Neg logit contributions
aw-0.044
iles-0.037
ups-0.033
ing-0.032
ils-0.032
Token way
Token ID835
Feature activation-1.57e-02

Pos logit contributions
 spoiler+0.046
 ramp+0.040
 sponge+0.039
ro+0.038
 prope+0.038
Neg logit contributions
aw-0.030
iles-0.024
ing-0.022
ils-0.022
 me-0.022
Token too
Token ID1165
Feature activation-5.12e-02

Pos logit contributions
aw+0.021
iles+0.017
ils+0.015
ese+0.014
una+0.014
Neg logit contributions
 spoiler-0.037
 sponge-0.034
 ramp-0.032
ro-0.031
 cloth-0.031
Token much
Token ID881
Feature activation-4.67e-02

Pos logit contributions
aw+0.088
iles+0.075
ups+0.070
ing+0.070
ils+0.067
Neg logit contributions
 spoiler-0.103
 sponge-0.087
 ramp-0.087
ro-0.081
 prope-0.081
Token pepper
Token ID13385
Feature activation-1.16e-01

Pos logit contributions
aw+0.079
iles+0.066
ups+0.062
ing+0.061
ils+0.059
Neg logit contributions
 spoiler-0.097
 sponge-0.082
 ramp-0.081
 prope-0.077
ro-0.076
Token on
Token ID319
Feature activation+1.59e-02

Pos logit contributions
aw+0.179
iles+0.150
ups+0.135
ese+0.135
una+0.132
Neg logit contributions
 spoiler-0.231
 sponge-0.215
ro-0.212
 ramp-0.205
 widened-0.195
Token it
Token ID340
Feature activation-1.71e-02

Pos logit contributions
 spoiler+0.036
 sponge+0.030
 ramp+0.030
 prope+0.029
 widened+0.029
Neg logit contributions
aw-0.024
iles-0.020
 me-0.018
ing-0.018
ups-0.018
Token and
Token ID290
Feature activation+3.29e-03

Pos logit contributions
aw+0.028
iles+0.024
ups+0.022
ing+0.021
ils+0.021
Neg logit contributions
 spoiler-0.035
 sponge-0.031
 ramp-0.030
ro-0.029
 widened-0.028
Token Ben
Token ID3932
Feature activation-7.95e-03

Pos logit contributions
 spoiler+0.006
 sponge+0.005
 ramp+0.005
ro+0.005
 widened+0.005
Neg logit contributions
aw-0.006
iles-0.005
ups-0.005
ing-0.005
ese-0.005
Token kept
Token ID4030
Feature activation+1.91e-02

Pos logit contributions
aw+0.015
iles+0.013
ups+0.012
ing+0.011
ils+0.011
Neg logit contributions
 spoiler-0.015
 sponge-0.013
 ramp-0.012
ro-0.012
 widened-0.012
Token butterfly
Token ID35113
Feature activation-4.78e-02

Pos logit contributions
aw+0.057
iles+0.051
ow+0.048
ups+0.047
ils+0.046
Neg logit contributions
 spoiler-0.053
 sponge-0.047
 cloth-0.044
 ramp-0.044
 prope-0.043
Token as
Token ID355
Feature activation-4.66e-02

Pos logit contributions
aw+0.090
iles+0.080
ups+0.070
ese+0.068
una+0.067
Neg logit contributions
 spoiler-0.082
 ramp-0.076
 sponge-0.076
ro-0.075
 cloth-0.071
Token it
Token ID340
Feature activation-3.27e-02

Pos logit contributions
aw+0.087
iles+0.075
ups+0.071
ing+0.069
ils+0.068
Neg logit contributions
 spoiler-0.086
 sponge-0.073
 ramp-0.072
ro-0.067
 widened-0.067
Token flew
Token ID13112
Feature activation-1.91e-02

Pos logit contributions
aw+0.057
iles+0.053
ups+0.048
ing+0.047
una+0.047
Neg logit contributions
 spoiler-0.056
ro-0.051
 sponge-0.050
 widened-0.050
 ramp-0.049
Token from
Token ID422
Feature activation+1.48e-02

Pos logit contributions
aw+0.032
iles+0.030
ups+0.027
ils+0.027
ese+0.027
Neg logit contributions
 spoiler-0.036
ro-0.032
 sponge-0.031
 prope-0.030
 ramp-0.030
Token flower
Token ID15061
Feature activation-1.16e-01

Pos logit contributions
 spoiler+0.028
 sponge+0.024
 ramp+0.023
 prope+0.023
ro+0.023
Neg logit contributions
aw-0.026
ups-0.023
iles-0.023
ing-0.022
ese-0.021
Token to
Token ID284
Feature activation+6.38e-03

Pos logit contributions
aw+0.190
iles+0.186
ups+0.158
 hey+0.158
una+0.158
Neg logit contributions
 spoiler-0.195
ro-0.189
 sponge-0.188
 cloth-0.182
 prope-0.178
Token flower
Token ID15061
Feature activation-1.05e-01

Pos logit contributions
 spoiler+0.012
 sponge+0.010
 ramp+0.010
 prope+0.009
ro+0.009
Neg logit contributions
aw-0.012
iles-0.011
ups-0.010
ing-0.010
ils-0.009
Token.
Token ID13
Feature activation-7.71e-03

Pos logit contributions
aw+0.187
iles+0.179
ups+0.157
 hey+0.153
ese+0.153
Neg logit contributions
 spoiler-0.165
 sponge-0.154
 cloth-0.152
ro-0.150
 prope-0.142
Token
Token ID198
Feature activation+1.96e-03

Pos logit contributions
aw+0.014
iles+0.013
ups+0.012
ils+0.011
ing+0.011
Neg logit contributions
 spoiler-0.014
 sponge-0.012
 ramp-0.012
ro-0.011
 widened-0.011
Token
Token ID198
Feature activation+5.37e-03

Pos logit contributions
 spoiler+0.004
 sponge+0.003
 ramp+0.003
ro+0.003
 widened+0.003
Neg logit contributions
aw-0.003
iles-0.003
ups-0.003
ing-0.003
ils-0.003
TokenMom
Token ID29252
Feature activation+3.07e-02

Pos logit contributions
aw+0.001
iles+0.000
ups+0.000
ing+0.000
ils+0.000
Neg logit contributions
 spoiler-0.001
 sponge-0.001
 ramp-0.001
ro-0.001
 widened-0.001
Tokenmy
Token ID1820
Feature activation+2.36e-02

Pos logit contributions
 spoiler+0.077
 ramp+0.068
 sponge+0.067
 prope+0.066
 widened+0.065
Neg logit contributions
aw-0.036
iles-0.029
ing-0.028
ups-0.027
 me-0.026
Token,
Token ID11
Feature activation+3.62e-02

Pos logit contributions
 spoiler+0.046
 sponge+0.040
 ramp+0.039
ro+0.037
 widened+0.037
Neg logit contributions
aw-0.041
iles-0.035
ups-0.033
ils-0.032
una-0.031
Token the
Token ID262
Feature activation+3.66e-02

Pos logit contributions
 spoiler+0.079
 widened+0.067
 ramp+0.067
bes+0.066
ro+0.066
Neg logit contributions
aw-0.054
iles-0.045
 me-0.045
ing-0.043
 us-0.042
Token f
Token ID277
Feature activation+3.12e-02

Pos logit contributions
 spoiler+0.060
 sponge+0.050
 ramp+0.049
ro+0.045
 widened+0.045
Neg logit contributions
aw-0.076
iles-0.067
ups-0.063
ing-0.062
ils-0.062
Tokenauc
Token ID14272
Feature activation+9.39e-02

Pos logit contributions
 spoiler+0.047
 sponge+0.040
 ramp+0.037
ro+0.036
 widened+0.033
Neg logit contributions
aw-0.067
iles-0.059
ese-0.056
una-0.056
ils-0.055
Tokenet
Token ID316
Feature activation+1.81e-02

Pos logit contributions
 spoiler+0.199
 sponge+0.174
 ramp+0.169
ro+0.164
 widened+0.160
Neg logit contributions
aw-0.145
iles-0.123
ups-0.113
ing-0.109
una-0.108
Token in
Token ID287
Feature activation+3.99e-02

Pos logit contributions
 spoiler+0.038
 sponge+0.033
 ramp+0.032
 prope+0.031
 widened+0.031
Neg logit contributions
aw-0.029
iles-0.025
ups-0.023
ing-0.023
ow-0.022
Token the
Token ID262
Feature activation+6.52e-04

Pos logit contributions
 spoiler+0.104
 ramp+0.092
 widened+0.090
ro+0.088
bes+0.087
Neg logit contributions
aw-0.043
 me-0.038
 us-0.035
iles-0.031
 that-0.030
Token bathroom
Token ID12436
Feature activation+7.11e-02

Pos logit contributions
 spoiler+0.001
 sponge+0.001
 ramp+0.001
ro+0.001
 widened+0.001
Neg logit contributions
aw-0.001
iles-0.001
ups-0.001
ing-0.001
ils-0.001
Token is
Token ID318
Feature activation+6.32e-03

Pos logit contributions
 spoiler+0.154
 ramp+0.129
 sponge+0.129
 prope+0.127
 widened+0.124
Neg logit contributions
aw-0.113
iles-0.089
ups-0.089
ing-0.088
 me-0.086
Token Lily
Token ID20037
Feature activation+2.06e-02

Pos logit contributions
 spoiler+0.010
 sponge+0.008
 ramp+0.008
ro+0.007
 cloth+0.007
Neg logit contributions
aw-0.018
iles-0.017
ups-0.015
ese-0.015
ing-0.015
Token.
Token ID13
Feature activation+4.92e-03

Pos logit contributions
 spoiler+0.043
 sponge+0.037
 ramp+0.036
ro+0.035
 widened+0.035
Neg logit contributions
aw-0.033
iles-0.029
ups-0.026
ils-0.025
ing-0.025
Token She
Token ID1375
Feature activation+1.16e-02

Pos logit contributions
 spoiler+0.012
 ramp+0.010
ge+0.010
 prope+0.010
 sponge+0.010
Neg logit contributions
aw-0.006
iles-0.005
 me-0.005
una-0.004
ing-0.004
Token loved
Token ID6151
Feature activation+5.87e-03

Pos logit contributions
 spoiler+0.025
 sponge+0.022
 ramp+0.021
ro+0.020
 widened+0.020
Neg logit contributions
aw-0.018
iles-0.015
ups-0.014
ing-0.014
ils-0.014
Token the
Token ID262
Feature activation+7.09e-03

Pos logit contributions
 sponge+0.011
 spoiler+0.011
 cloth+0.011
 allowance+0.010
 plastic+0.010
Neg logit contributions
aw-0.010
iles-0.008
arling-0.008
ani-0.007
ing-0.007
Token holiday
Token ID9912
Feature activation+1.01e-01

Pos logit contributions
 spoiler+0.013
 sponge+0.011
 ramp+0.011
ro+0.010
 widened+0.010
Neg logit contributions
aw-0.014
iles-0.012
ups-0.011
ing-0.011
ils-0.011
Token because
Token ID780
Feature activation-6.37e-03

Pos logit contributions
 spoiler+0.195
 sponge+0.172
 ramp+0.170
ro+0.161
 allowance+0.159
Neg logit contributions
aw-0.171
iles-0.151
una-0.134
ese-0.131
ups-0.131
Token she
Token ID673
Feature activation+2.96e-02

Pos logit contributions
aw+0.008
 that+0.007
 me+0.007
 us+0.007
 at+0.006
Neg logit contributions
 spoiler-0.016
 ramp-0.014
ro-0.014
bes-0.013
 widened-0.013
Token got
Token ID1392
Feature activation+1.77e-02

Pos logit contributions
 spoiler+0.064
 sponge+0.054
 ramp+0.052
 prope+0.051
ro+0.049
Neg logit contributions
aw-0.048
iles-0.040
ups-0.038
 me-0.038
ing-0.038
Token to
Token ID284
Feature activation+2.10e-02

Pos logit contributions
 spoiler+0.041
 sponge+0.036
 ramp+0.035
 allowance+0.034
 prope+0.034
Neg logit contributions
aw-0.024
iles-0.021
una-0.018
ils-0.018
ups-0.018
Token play
Token ID711
Feature activation+3.04e-02

Pos logit contributions
 spoiler+0.052
driver+0.047
 ramp+0.046
ro+0.045
bes+0.045
Neg logit contributions
aw-0.025
 me-0.023
 us-0.021
 for-0.019
 at-0.018
Token very
Token ID845
Feature activation+6.20e-03

Pos logit contributions
 spoiler+0.013
 sponge+0.011
 ramp+0.011
ro+0.010
 widened+0.010
Neg logit contributions
aw-0.011
iles-0.009
ups-0.009
ing-0.008
ils-0.008
Token special
Token ID2041
Feature activation-2.03e-02

Pos logit contributions
 spoiler+0.012
 sponge+0.010
 ramp+0.010
 widened+0.010
 prope+0.010
Neg logit contributions
aw-0.011
iles-0.009
ups-0.008
ils-0.008
ing-0.008
Token attic
Token ID46113
Feature activation+6.64e-02

Pos logit contributions
aw+0.037
iles+0.032
ing+0.029
ups+0.029
 me+0.029
Neg logit contributions
 spoiler-0.038
 sponge-0.031
 ramp-0.031
 widened-0.030
ro-0.029
Token.
Token ID13
Feature activation+6.26e-03

Pos logit contributions
 spoiler+0.149
 sponge+0.130
 ramp+0.128
ro+0.123
 widened+0.122
Neg logit contributions
aw-0.098
iles-0.081
ups-0.074
ing-0.072
ils-0.071
Token The
Token ID383
Feature activation+1.33e-02

Pos logit contributions
 spoiler+0.012
ge+0.011
 sponge+0.010
 tant+0.010
 prope+0.010
Neg logit contributions
aw-0.011
iles-0.009
 me-0.008
 lions-0.008
ow-0.008
Token attic
Token ID46113
Feature activation+1.12e-01

Pos logit contributions
 spoiler+0.029
 ramp+0.024
ro+0.024
ge+0.024
 sponge+0.024
Neg logit contributions
aw-0.020
 me-0.017
iles-0.016
 us-0.015
ups-0.015
Token was
Token ID373
Feature activation+3.24e-02

Pos logit contributions
 spoiler+0.228
 sponge+0.193
 ramp+0.187
 prope+0.181
ro+0.180
Neg logit contributions
aw-0.192
iles-0.158
ups-0.155
ing-0.152
ils-0.148
Token full
Token ID1336
Feature activation+2.70e-02

Pos logit contributions
 spoiler+0.076
 ramp+0.064
 sponge+0.063
ro+0.063
bes+0.062
Neg logit contributions
aw-0.041
 me-0.036
 for-0.035
 at-0.033
 by-0.033
Token of
Token ID286
Feature activation+7.95e-03

Pos logit contributions
 spoiler+0.057
 sponge+0.052
 ramp+0.050
ro+0.048
 cloth+0.047
Neg logit contributions
aw-0.042
iles-0.036
ils-0.032
ups-0.032
ese-0.032
Token fun
Token ID1257
Feature activation-1.20e-02

Pos logit contributions
 spoiler+0.017
 ramp+0.015
ro+0.015
 sponge+0.014
ge+0.014
Neg logit contributions
aw-0.012
 me-0.010
iles-0.010
 us-0.010
ups-0.009
Token things
Token ID1243
Feature activation-1.93e-02

Pos logit contributions
aw+0.015
ing+0.012
ups+0.012
iles+0.012
ils+0.011
Neg logit contributions
 spoiler-0.030
 sponge-0.025
 ramp-0.025
ro-0.025
 prope-0.024
Token moving
Token ID3867
Feature activation+3.57e-02

Pos logit contributions
 spoiler+0.069
 sponge+0.060
 ramp+0.060
ro+0.057
bes+0.056
Neg logit contributions
aw-0.046
iles-0.039
 me-0.036
ow-0.035
ing-0.035
Token,
Token ID11
Feature activation+2.28e-02

Pos logit contributions
 spoiler+0.088
 sponge+0.079
 ramp+0.078
bes+0.073
 widened+0.073
Neg logit contributions
aw-0.045
iles-0.034
ing-0.033
 me-0.033
ups-0.032
Token and
Token ID290
Feature activation+3.43e-03

Pos logit contributions
 spoiler+0.042
 sponge+0.036
 ramp+0.035
ge+0.034
 allowance+0.034
Neg logit contributions
aw-0.042
 me-0.035
iles-0.033
 that-0.033
 for-0.033
Token the
Token ID262
Feature activation+2.15e-02

Pos logit contributions
 spoiler+0.006
 sponge+0.005
 ramp+0.005
ro+0.005
 widened+0.005
Neg logit contributions
aw-0.007
iles-0.006
ups-0.005
ils-0.005
ing-0.005
Token book
Token ID1492
Feature activation+8.65e-03

Pos logit contributions
 spoiler+0.037
 ramp+0.031
 sponge+0.030
ro+0.028
 allowance+0.028
Neg logit contributions
aw-0.043
iles-0.038
ups-0.036
ing-0.035
ils-0.035
Tokencase
Token ID7442
Feature activation+9.86e-02

Pos logit contributions
 spoiler+0.013
 ramp+0.011
 sponge+0.010
 widened+0.010
ro+0.010
Neg logit contributions
aw-0.019
iles-0.017
ups-0.016
ing-0.016
ese-0.016
Token said
Token ID531
Feature activation-2.00e-02

Pos logit contributions
 spoiler+0.193
 sponge+0.167
 ramp+0.161
 prope+0.154
ro+0.153
Neg logit contributions
aw-0.174
iles-0.146
ups-0.138
ing-0.134
ils-0.134
Token,
Token ID11
Feature activation+1.69e-02

Pos logit contributions
aw+0.035
iles+0.030
ups+0.028
ing+0.028
ils+0.027
Neg logit contributions
 spoiler-0.039
 sponge-0.034
 ramp-0.033
ro-0.031
 widened-0.031
Token �
Token ID6184
Feature activation+1.89e-02

Pos logit contributions
 spoiler+0.033
 sponge+0.028
 ramp+0.028
ro+0.026
 widened+0.026
Neg logit contributions
aw-0.030
iles-0.025
ups-0.023
ing-0.023
ils-0.023
Token�
Token ID95
Feature activation+3.44e-02

Pos logit contributions
 spoiler+0.035
 sponge+0.030
 ramp+0.029
ro+0.027
 widened+0.027
Neg logit contributions
aw-0.035
iles-0.030
ups-0.028
ing-0.028
ils-0.028
Token€
Token ID26391
Feature activation+1.60e-02

Pos logit contributions
 spoiler+0.061
 ramp+0.050
 sponge+0.050
ro+0.047
 widened+0.046
Neg logit contributions
aw-0.067
iles-0.059
ups-0.055
ing-0.054
ils-0.053
Token felt
Token ID2936
Feature activation+1.50e-02

Pos logit contributions
 spoiler+0.008
 sponge+0.007
 ramp+0.007
 widened+0.007
ro+0.007
Neg logit contributions
aw-0.009
iles-0.008
ups-0.007
ing-0.007
ese-0.007
Token his
Token ID465
Feature activation+4.05e-02

Pos logit contributions
 spoiler+0.033
 ramp+0.029
 sponge+0.029
ro+0.027
ge+0.027
Neg logit contributions
aw-0.024
iles-0.018
ups-0.017
ow-0.017
ils-0.017
Token tum
Token ID11814
Feature activation+3.14e-02

Pos logit contributions
 spoiler+0.068
 sponge+0.058
 ramp+0.056
ro+0.053
 prope+0.052
Neg logit contributions
aw-0.083
iles-0.072
ups-0.069
ing-0.067
ils-0.066
Tokenmy
Token ID1820
Feature activation+3.16e-02

Pos logit contributions
 spoiler+0.072
 sponge+0.063
 ramp+0.062
ro+0.059
 prope+0.059
Neg logit contributions
aw-0.045
iles-0.037
ups-0.034
ing-0.033
ils-0.032
Token ac
Token ID936
Feature activation+4.90e-02

Pos logit contributions
 spoiler+0.063
 sponge+0.054
 ramp+0.053
ro+0.050
 prope+0.049
Neg logit contributions
aw-0.055
iles-0.045
ing-0.044
ups-0.043
ow-0.043
Tokenhe
Token ID258
Feature activation+9.88e-02

Pos logit contributions
 spoiler+0.076
 sponge+0.062
 ramp+0.059
 prope+0.056
 widened+0.055
Neg logit contributions
aw-0.108
iles-0.094
ing-0.090
ups-0.090
ow-0.088
Token starting
Token ID3599
Feature activation+1.37e-04

Pos logit contributions
 spoiler+0.226
 sponge+0.195
 ramp+0.184
 prope+0.182
 widened+0.176
Neg logit contributions
aw-0.140
iles-0.116
ing-0.112
 me-0.111
ils-0.108
Token to
Token ID284
Feature activation+2.26e-03

Pos logit contributions
 spoiler+0.000
 sponge+0.000
 ramp+0.000
ro+0.000
 widened+0.000
Neg logit contributions
aw-0.000
iles-0.000
ups-0.000
ing-0.000
ils-0.000
Token feel
Token ID1254
Feature activation+3.09e-03

Pos logit contributions
 spoiler+0.004
 sponge+0.004
 ramp+0.004
bes+0.003
 allowance+0.003
Neg logit contributions
aw-0.004
 me-0.003
iles-0.003
 us-0.003
ow-0.003
Token a
Token ID257
Feature activation+1.36e-02

Pos logit contributions
 spoiler+0.007
 ramp+0.006
 sponge+0.006
ro+0.005
 prope+0.005
Neg logit contributions
aw-0.005
iles-0.004
ups-0.004
 me-0.004
ing-0.004
Token bit
Token ID1643
Feature activation-2.88e-02

Pos logit contributions
 spoiler+0.026
 ramp+0.021
 sponge+0.021
bes+0.020
ro+0.020
Neg logit contributions
aw-0.024
 me-0.020
ing-0.019
iles-0.019
una-0.018
Token they
Token ID484
Feature activation-1.64e-02

Pos logit contributions
aw+0.028
iles+0.024
ups+0.022
ing+0.022
ils+0.022
Neg logit contributions
 spoiler-0.029
 sponge-0.024
 ramp-0.024
ro-0.022
 widened-0.022
Token see
Token ID766
Feature activation+1.38e-02

Pos logit contributions
aw+0.028
iles+0.024
 me+0.022
 us+0.022
ils+0.021
Neg logit contributions
 spoiler-0.033
 ramp-0.028
 sponge-0.028
bes-0.027
 prope-0.027
Token a
Token ID257
Feature activation+4.69e-03

Pos logit contributions
 spoiler+0.025
 sponge+0.022
 ramp+0.020
 cloth+0.020
 widened+0.020
Neg logit contributions
aw-0.024
iles-0.023
ups-0.021
ils-0.021
ese-0.021
Token big
Token ID1263
Feature activation-8.70e-03

Pos logit contributions
 spoiler+0.009
 sponge+0.007
 ramp+0.007
ro+0.007
 prope+0.007
Neg logit contributions
aw-0.009
iles-0.008
ups-0.007
ils-0.007
ow-0.007
Token cake
Token ID12187
Feature activation-3.77e-02

Pos logit contributions
aw+0.017
iles+0.015
ing+0.013
 me+0.013
ups+0.013
Neg logit contributions
 spoiler-0.015
 sponge-0.013
 ramp-0.012
 widened-0.012
ro-0.012
Token on
Token ID319
Feature activation+4.11e-02

Pos logit contributions
aw+0.061
iles+0.054
ups+0.050
ow+0.048
una+0.047
Neg logit contributions
 spoiler-0.074
 sponge-0.067
 ramp-0.067
kin-0.062
ro-0.062
Token the
Token ID262
Feature activation+6.69e-04

Pos logit contributions
 spoiler+0.095
 sponge+0.083
 ramp+0.082
ro+0.078
 prope+0.078
Neg logit contributions
aw-0.058
iles-0.047
ups-0.042
 me-0.042
una-0.041
Token table
Token ID3084
Feature activation+2.62e-02

Pos logit contributions
 spoiler+0.001
 sponge+0.001
 ramp+0.001
ro+0.001
 prope+0.001
Neg logit contributions
aw-0.002
iles-0.002
ups-0.001
ing-0.001
ils-0.001
Token.
Token ID13
Feature activation+1.63e-02

Pos logit contributions
 spoiler+0.056
 sponge+0.049
 ramp+0.048
ro+0.046
 widened+0.045
Neg logit contributions
aw-0.040
iles-0.034
ups-0.031
ing-0.030
ils-0.030
Token It
Token ID632
Feature activation+1.22e-02

Pos logit contributions
 spoiler+0.031
ge+0.028
 ramp+0.026
 prope+0.026
 sponge+0.024
Neg logit contributions
aw-0.030
 me-0.024
iles-0.023
uck-0.023
una-0.023
Token is
Token ID318
Feature activation+7.00e-03

Pos logit contributions
 spoiler+0.027
 sponge+0.023
ge+0.022
 ramp+0.022
 prope+0.022
Neg logit contributions
aw-0.019
iles-0.015
 me-0.015
 us-0.014
ils-0.014
Token to
Token ID284
Feature activation-1.12e-02

Pos logit contributions
 spoiler+0.033
 sponge+0.030
 ramp+0.029
 cloth+0.029
 allowance+0.028
Neg logit contributions
aw-0.023
iles-0.019
ani-0.017
ese-0.017
ing-0.017
Token rest
Token ID1334
Feature activation-3.56e-02

Pos logit contributions
aw+0.018
iles+0.015
 me+0.015
ow+0.015
ups+0.014
Neg logit contributions
 spoiler-0.023
 sponge-0.020
 ramp-0.020
 prope-0.019
 widened-0.019
Token.
Token ID13
Feature activation+1.05e-02

Pos logit contributions
aw+0.041
 me+0.031
iles+0.031
ing+0.030
 for+0.029
Neg logit contributions
 spoiler-0.094
 ramp-0.081
 sponge-0.080
 prope-0.078
 widened-0.078
Token Max
Token ID5436
Feature activation+3.26e-02

Pos logit contributions
 spoiler+0.016
 sponge+0.013
 ramp+0.013
ro+0.012
 widened+0.012
Neg logit contributions
aw-0.023
iles-0.020
ups-0.019
ils-0.019
ing-0.019
Token was
Token ID373
Feature activation+2.12e-02

Pos logit contributions
 spoiler+0.049
 sponge+0.043
 ramp+0.042
 widened+0.042
ro+0.042
Neg logit contributions
aw-0.065
iles-0.061
ing-0.055
ups-0.054
ese-0.052
Token not
Token ID407
Feature activation+3.93e-02

Pos logit contributions
 spoiler+0.037
 sponge+0.031
 ramp+0.030
 prope+0.028
ro+0.028
Neg logit contributions
aw-0.042
iles-0.037
ups-0.035
ing-0.034
ils-0.034
Token happy
Token ID3772
Feature activation+8.69e-03

Pos logit contributions
 spoiler+0.084
 ramp+0.072
 sponge+0.072
 widened+0.068
 prope+0.067
Neg logit contributions
aw-0.063
iles-0.054
ow-0.049
una-0.049
ing-0.049
Token.
Token ID13
Feature activation+4.18e-03

Pos logit contributions
 spoiler+0.014
ro+0.013
 sponge+0.012
ge+0.012
 ramp+0.012
Neg logit contributions
aw-0.017
iles-0.015
ups-0.014
ani-0.014
ese-0.013
Token He
Token ID679
Feature activation+7.93e-03

Pos logit contributions
 spoiler+0.006
 sponge+0.005
 ramp+0.005
ro+0.004
 widened+0.004
Neg logit contributions
aw-0.010
iles-0.009
ils-0.008
ups-0.008
ing-0.008
Token started
Token ID2067
Feature activation-1.04e-02

Pos logit contributions
 spoiler+0.013
 sponge+0.011
 ramp+0.011
 widened+0.010
ro+0.010
Neg logit contributions
aw-0.016
iles-0.014
ing-0.014
ups-0.013
ese-0.013
Token to
Token ID284
Feature activation+1.19e-03

Pos logit contributions
aw+0.015
iles+0.012
ils+0.011
ese+0.010
ups+0.010
Neg logit contributions
 spoiler-0.023
 sponge-0.022
ro-0.021
 ramp-0.021
 allowance-0.020
Token of
Token ID286
Feature activation+4.89e-02

Pos logit contributions
 spoiler+0.014
 sponge+0.012
 ramp+0.011
 widened+0.011
ro+0.011
Neg logit contributions
aw-0.012
iles-0.011
ups-0.010
ow-0.010
una-0.010
Token the
Token ID262
Feature activation+5.41e-03

Pos logit contributions
 spoiler+0.099
 sponge+0.085
 ramp+0.083
 prope+0.081
 widened+0.079
Neg logit contributions
aw-0.081
iles-0.068
ups-0.065
ing-0.064
ese-0.063
Token animals
Token ID4695
Feature activation-2.22e-02

Pos logit contributions
 spoiler+0.012
 sponge+0.010
 ramp+0.010
 prope+0.010
 allowance+0.009
Neg logit contributions
aw-0.009
iles-0.007
ups-0.007
ing-0.006
ow-0.006
Token along
Token ID1863
Feature activation+4.75e-02

Pos logit contributions
aw+0.037
iles+0.032
una+0.029
ups+0.029
ese+0.028
Neg logit contributions
 spoiler-0.043
 sponge-0.039
 ramp-0.037
 widened-0.037
ge-0.035
Token the
Token ID262
Feature activation-1.59e-02

Pos logit contributions
 spoiler+0.092
 sponge+0.080
 ramp+0.079
ro+0.077
 widened+0.077
Neg logit contributions
aw-0.080
iles-0.070
ups-0.066
ese-0.064
ing-0.063
Token way
Token ID835
Feature activation+2.51e-02

Pos logit contributions
aw+0.037
ow+0.032
uck+0.031
iles+0.031
ups+0.031
Neg logit contributions
 spoiler-0.022
 ramp-0.021
 prope-0.018
 shelf-0.018
 sponge-0.017
Token.
Token ID13
Feature activation-8.00e-03

Pos logit contributions
 spoiler+0.052
 sponge+0.046
 ramp+0.045
ro+0.043
 widened+0.042
Neg logit contributions
aw-0.040
iles-0.034
ils-0.031
ups-0.031
una-0.030
Token When
Token ID1649
Feature activation-2.26e-02

Pos logit contributions
aw+0.016
iles+0.015
ils+0.014
ups+0.014
ese+0.014
Neg logit contributions
 spoiler-0.013
 sponge-0.011
 ramp-0.011
ro-0.010
 prope-0.010
Token they
Token ID484
Feature activation-3.29e-02

Pos logit contributions
aw+0.044
iles+0.039
ese+0.037
una+0.036
ils+0.036
Neg logit contributions
 spoiler-0.039
 sponge-0.033
 ramp-0.033
 prope-0.031
 allowance-0.031
Token arrived
Token ID5284
Feature activation+1.54e-02

Pos logit contributions
aw+0.070
iles+0.062
ups+0.061
ing+0.059
ese+0.059
Neg logit contributions
 spoiler-0.048
 widened-0.042
 sponge-0.040
ro-0.038
 ramp-0.038
Token at
Token ID379
Feature activation-1.04e-02

Pos logit contributions
 spoiler+0.028
 sponge+0.024
 ramp+0.023
 widened+0.023
 allowance+0.023
Neg logit contributions
aw-0.026
iles-0.025
una-0.023
ese-0.022
ups-0.022
Token before
Token ID878
Feature activation-1.91e-02

Pos logit contributions
aw+0.026
iles+0.022
ing+0.020
ups+0.020
 me+0.020
Neg logit contributions
 spoiler-0.029
 sponge-0.024
 ramp-0.023
 widened-0.022
 prope-0.022
Token he
Token ID339
Feature activation-1.91e-02

Pos logit contributions
aw+0.031
iles+0.027
 me+0.025
ups+0.024
 us+0.024
Neg logit contributions
 spoiler-0.039
 sponge-0.033
 ramp-0.032
ro-0.032
 tant-0.032
Token returned
Token ID4504
Feature activation+1.22e-02

Pos logit contributions
aw+0.035
iles+0.030
ing+0.029
ups+0.028
una+0.028
Neg logit contributions
 spoiler-0.033
 ramp-0.028
ro-0.028
 sponge-0.027
 allowance-0.027
Token to
Token ID284
Feature activation-2.99e-03

Pos logit contributions
 spoiler+0.029
 sponge+0.026
 ramp+0.025
ro+0.024
 widened+0.024
Neg logit contributions
aw-0.016
iles-0.013
ups-0.012
ing-0.012
ils-0.011
Token his
Token ID465
Feature activation+5.75e-04

Pos logit contributions
aw+0.004
iles+0.003
 me+0.003
 us+0.003
ups+0.003
Neg logit contributions
 spoiler-0.007
 sponge-0.006
 ramp-0.006
ro-0.006
 widened-0.006
Token cozy
Token ID37438
Feature activation+4.01e-02

Pos logit contributions
 spoiler+0.001
 ramp+0.001
 sponge+0.001
 allowance+0.001
 vest+0.001
Neg logit contributions
aw-0.001
iles-0.001
ing-0.001
ils-0.001
ups-0.001
Token cave
Token ID11527
Feature activation+2.15e-02

Pos logit contributions
 spoiler+0.089
 sponge+0.076
 ramp+0.074
 widened+0.073
 prope+0.070
Neg logit contributions
aw-0.060
iles-0.050
ups-0.047
 me-0.046
ing-0.046
Token in
Token ID287
Feature activation+2.37e-03

Pos logit contributions
 spoiler+0.046
 sponge+0.041
 ramp+0.040
ro+0.038
 widened+0.038
Neg logit contributions
aw-0.033
iles-0.028
ups-0.025
ing-0.024
ils-0.024
Token the
Token ID262
Feature activation-2.64e-02

Pos logit contributions
 spoiler+0.006
 sponge+0.006
ro+0.005
 ramp+0.005
driver+0.005
Neg logit contributions
aw-0.002
 me-0.002
 us-0.002
 that-0.002
 for-0.002
Token forest
Token ID8222
Feature activation-1.39e-02

Pos logit contributions
aw+0.047
iles+0.042
ils+0.040
ups+0.039
ing+0.039
Neg logit contributions
 spoiler-0.049
 ramp-0.042
 sponge-0.042
 vest-0.039
 cloth-0.039
Token.
Token ID13
Feature activation-3.97e-03

Pos logit contributions
aw+0.022
iles+0.018
ups+0.016
ing+0.016
ils+0.016
Neg logit contributions
 spoiler-0.030
 sponge-0.026
 ramp-0.026
ro-0.025
 widened-0.024
Token new
Token ID649
Feature activation+3.51e-03

Pos logit contributions
 spoiler+0.020
 sponge+0.018
 ramp+0.017
 widened+0.017
 prope+0.016
Neg logit contributions
aw-0.012
iles-0.010
ing-0.009
ups-0.009
 me-0.009
Token toy
Token ID13373
Feature activation+3.59e-03

Pos logit contributions
 spoiler+0.007
 sponge+0.006
 widened+0.005
 ramp+0.005
 prope+0.005
Neg logit contributions
aw-0.006
iles-0.005
 me-0.005
una-0.005
ing-0.005
Token.
Token ID13
Feature activation+1.52e-03

Pos logit contributions
 spoiler+0.007
 ramp+0.006
 sponge+0.006
 allowance+0.005
ro+0.005
Neg logit contributions
aw-0.007
iles-0.006
ups-0.005
ese-0.005
ils-0.005
Token Lily
Token ID20037
Feature activation+2.64e-02

Pos logit contributions
 spoiler+0.003
 sponge+0.002
 ramp+0.002
 prope+0.002
ro+0.002
Neg logit contributions
aw-0.003
iles-0.002
ups-0.002
 me-0.002
ing-0.002
Token said
Token ID531
Feature activation-1.58e-02

Pos logit contributions
 spoiler+0.049
 sponge+0.042
 ramp+0.041
 widened+0.039
ro+0.039
Neg logit contributions
aw-0.048
iles-0.042
ups-0.038
ing-0.038
ils-0.037
Token yes
Token ID3763
Feature activation+3.46e-02

Pos logit contributions
aw+0.039
iles+0.035
ups+0.033
ing+0.033
ils+0.033
Neg logit contributions
 spoiler-0.020
 sponge-0.015
 ramp-0.015
ro-0.014
 widened-0.013
Token and
Token ID290
Feature activation+1.90e-03

Pos logit contributions
 spoiler+0.052
 ramp+0.042
 sponge+0.042
ro+0.041
 widened+0.040
Neg logit contributions
aw-0.072
iles-0.068
ils-0.061
ups-0.060
ese-0.059
Token they
Token ID484
Feature activation+5.80e-03

Pos logit contributions
 spoiler+0.003
 sponge+0.003
 ramp+0.003
ro+0.003
 widened+0.003
Neg logit contributions
aw-0.004
iles-0.003
ups-0.003
ing-0.003
ese-0.003
Token went
Token ID1816
Feature activation+5.75e-03

Pos logit contributions
 spoiler+0.013
 sponge+0.012
 ramp+0.011
 widened+0.011
 prope+0.011
Neg logit contributions
aw-0.008
iles-0.007
ups-0.007
ing-0.006
una-0.006
Token to
Token ID284
Feature activation-8.25e-03

Pos logit contributions
 spoiler+0.014
 sponge+0.013
 ramp+0.013
ro+0.012
 allowance+0.012
Neg logit contributions
aw-0.007
iles-0.006
ups-0.005
una-0.005
ese-0.005
Token Emma
Token ID18966
Feature activation-7.85e-03

Pos logit contributions
aw+0.010
 me+0.008
iles+0.008
 us+0.008
ow+0.008
Neg logit contributions
 spoiler-0.020
 sponge-0.017
ro-0.017
 ramp-0.017
kin-0.016
Token ground
Token ID2323
Feature activation+8.47e-03

Pos logit contributions
 spoiler+0.009
 ramp+0.008
 sponge+0.008
ro+0.007
 prope+0.007
Neg logit contributions
aw-0.013
iles-0.011
ups-0.011
ing-0.010
ils-0.010
Token.
Token ID13
Feature activation+1.72e-02

Pos logit contributions
 spoiler+0.017
 sponge+0.015
 ramp+0.015
ro+0.014
 widened+0.014
Neg logit contributions
aw-0.014
iles-0.012
ups-0.011
ils-0.011
ese-0.011
Token He
Token ID679
Feature activation+1.98e-02

Pos logit contributions
 spoiler+0.033
 prope+0.031
ge+0.030
kin+0.030
bes+0.029
Neg logit contributions
aw-0.027
 me-0.025
uck-0.023
ing-0.023
 by-0.023
Token picked
Token ID6497
Feature activation+7.88e-03

Pos logit contributions
 spoiler+0.039
 sponge+0.033
 ramp+0.032
 prope+0.030
ro+0.030
Neg logit contributions
aw-0.035
iles-0.030
ups-0.028
ils-0.028
 me-0.028
Token it
Token ID340
Feature activation+8.78e-03

Pos logit contributions
 spoiler+0.013
 sponge+0.011
 ramp+0.011
 prope+0.010
ro+0.010
Neg logit contributions
aw-0.016
iles-0.014
ups-0.013
ese-0.013
ils-0.013
Token up
Token ID510
Feature activation+9.92e-03

Pos logit contributions
 spoiler+0.018
 sponge+0.016
 ramp+0.016
ro+0.015
 cloth+0.015
Neg logit contributions
aw-0.014
iles-0.012
ups-0.011
ese-0.011
ils-0.011
Token and
Token ID290
Feature activation+1.42e-02

Pos logit contributions
 spoiler+0.019
 sponge+0.016
 ramp+0.016
 prope+0.015
 widened+0.015
Neg logit contributions
aw-0.018
iles-0.016
ups-0.015
ing-0.015
ow-0.014
Token showed
Token ID3751
Feature activation+2.01e-02

Pos logit contributions
 spoiler+0.031
 sponge+0.027
 ramp+0.026
ro+0.026
 prope+0.025
Neg logit contributions
aw-0.022
iles-0.017
ups-0.017
 me-0.017
ing-0.017
Token it
Token ID340
Feature activation+6.18e-04

Pos logit contributions
 spoiler+0.031
 sponge+0.024
 ramp+0.024
 prope+0.022
 cloth+0.022
Neg logit contributions
aw-0.043
iles-0.040
ups-0.037
ese-0.037
ils-0.036
Token to
Token ID284
Feature activation+3.87e-02

Pos logit contributions
 spoiler+0.001
 sponge+0.001
 flat+0.001
 allowance+0.001
 ramp+0.001
Neg logit contributions
aw-0.001
iles-0.001
ese-0.001
ani-0.001
wl-0.001
Token his
Token ID465
Feature activation+2.95e-02

Pos logit contributions
 spoiler+0.058
 sponge+0.046
 ramp+0.045
 cloth+0.040
 prope+0.040
Neg logit contributions
aw-0.089
iles-0.079
ups-0.076
ing-0.074
ils-0.073
Token to
Token ID284
Feature activation-2.15e-03

Pos logit contributions
aw+0.017
iles+0.013
ups+0.011
ing+0.011
ils+0.010
Neg logit contributions
 spoiler-0.042
 sponge-0.038
 ramp-0.037
ro-0.036
 prope-0.036
Token her
Token ID607
Feature activation-8.01e-03

Pos logit contributions
aw+0.003
iles+0.003
ups+0.002
ing+0.002
ese+0.002
Neg logit contributions
 spoiler-0.005
 sponge-0.004
 ramp-0.004
ro-0.004
 prope-0.004
Token own
Token ID898
Feature activation+5.01e-03

Pos logit contributions
aw+0.013
iles+0.011
ups+0.010
ing+0.010
ils+0.010
Neg logit contributions
 spoiler-0.017
 sponge-0.015
 ramp-0.014
 widened-0.014
ro-0.014
Token warning
Token ID6509
Feature activation-5.20e-03

Pos logit contributions
 spoiler+0.011
 sponge+0.009
 ramp+0.009
 widened+0.009
ro+0.009
Neg logit contributions
aw-0.008
iles-0.006
ups-0.006
ing-0.006
ils-0.006
Token.
Token ID13
Feature activation-9.87e-03

Pos logit contributions
aw+0.009
iles+0.008
ese+0.007
ups+0.007
ils+0.007
Neg logit contributions
 spoiler-0.010
 sponge-0.009
ro-0.009
 ramp-0.008
 allowance-0.008
Token From
Token ID3574
Feature activation+5.79e-03

Pos logit contributions
aw+0.016
iles+0.013
ups+0.013
 me+0.012
ing+0.012
Neg logit contributions
 spoiler-0.021
 sponge-0.018
 ramp-0.018
 prope-0.017
 widened-0.016
Token then
Token ID788
Feature activation-1.90e-02

Pos logit contributions
 spoiler+0.011
 cloth+0.010
 sponge+0.009
 widened+0.009
 prope+0.009
Neg logit contributions
iles-0.009
aw-0.008
ups-0.007
ese-0.007
ils-0.007
Token on
Token ID319
Feature activation+2.08e-03

Pos logit contributions
aw+0.032
iles+0.030
ups+0.027
ese+0.026
una+0.025
Neg logit contributions
 spoiler-0.037
 sponge-0.032
 ramp-0.031
 allowance-0.030
 prope-0.030
Token,
Token ID11
Feature activation-3.30e-02

Pos logit contributions
 spoiler+0.004
 sponge+0.004
 ramp+0.004
ro+0.003
 widened+0.003
Neg logit contributions
aw-0.003
iles-0.003
ups-0.003
ing-0.003
ils-0.003
Token she
Token ID673
Feature activation-1.97e-02

Pos logit contributions
aw+0.053
ups+0.044
ing+0.043
 me+0.043
iles+0.043
Neg logit contributions
 spoiler-0.070
 ramp-0.059
 sponge-0.057
 prope-0.056
 widened-0.054
Token made
Token ID925
Feature activation+1.03e-02

Pos logit contributions
aw+0.027
iles+0.022
ups+0.020
ing+0.019
una+0.019
Neg logit contributions
 spoiler-0.045
 sponge-0.040
 ramp-0.040
ro-0.038
 prope-0.038
Token She
Token ID1375
Feature activation+1.39e-02

Pos logit contributions
 spoiler+0.044
 sponge+0.038
 ramp+0.037
ro+0.035
 widened+0.035
Neg logit contributions
aw-0.038
iles-0.033
ups-0.030
ing-0.030
ils-0.029
Token wanted
Token ID2227
Feature activation+3.10e-02

Pos logit contributions
 spoiler+0.027
 ramp+0.022
 sponge+0.022
ro+0.020
 prope+0.020
Neg logit contributions
aw-0.026
iles-0.022
ups-0.021
ils-0.020
ing-0.020
Token to
Token ID284
Feature activation-1.17e-02

Pos logit contributions
 spoiler+0.064
 sponge+0.062
 cloth+0.059
ro+0.056
 ramp+0.056
Neg logit contributions
aw-0.047
iles-0.041
ese-0.037
ils-0.035
ani-0.034
Token make
Token ID787
Feature activation+6.62e-03

Pos logit contributions
aw+0.018
 me+0.017
ow+0.016
iles+0.015
 that+0.015
Neg logit contributions
 spoiler-0.023
 ramp-0.019
rily-0.019
 sponge-0.019
 widened-0.019
Token it
Token ID340
Feature activation+1.21e-02

Pos logit contributions
 spoiler+0.012
 sponge+0.011
 ramp+0.010
 cloth+0.010
 prope+0.010
Neg logit contributions
aw-0.012
iles-0.011
ese-0.010
ups-0.010
ils-0.010
Token soft
Token ID2705
Feature activation+5.98e-03

Pos logit contributions
 spoiler+0.026
 sponge+0.021
 ramp+0.021
ro+0.021
bes+0.021
Neg logit contributions
aw-0.018
 for-0.015
 me-0.015
ing-0.015
ow-0.015
Token and
Token ID290
Feature activation+3.22e-02

Pos logit contributions
 spoiler+0.014
 sponge+0.012
 ramp+0.012
 allowance+0.012
ge+0.011
Neg logit contributions
aw-0.009
ing-0.007
 me-0.006
iles-0.006
 for-0.006
Token easy
Token ID2562
Feature activation+4.52e-03

Pos logit contributions
 spoiler+0.063
 ramp+0.053
ge+0.052
ro+0.051
rily+0.049
Neg logit contributions
 me-0.050
aw-0.049
 for-0.045
 that-0.045
ups-0.044
Token to
Token ID284
Feature activation+8.35e-03

Pos logit contributions
 spoiler+0.010
 sponge+0.009
 ramp+0.008
ro+0.008
 cloth+0.008
Neg logit contributions
aw-0.007
iles-0.006
ese-0.005
ups-0.005
una-0.005
Token work
Token ID670
Feature activation-1.08e-02

Pos logit contributions
 spoiler+0.015
 sponge+0.013
 ramp+0.013
 widened+0.012
 prope+0.012
Neg logit contributions
aw-0.015
iles-0.013
 me-0.013
ow-0.013
ups-0.012
Token with
Token ID351
Feature activation-2.32e-02

Pos logit contributions
aw+0.015
iles+0.011
ups+0.011
ing+0.011
ils+0.011
Neg logit contributions
 spoiler-0.026
 sponge-0.022
 ramp-0.022
 prope-0.021
 widened-0.021
Token.
Token ID13
Feature activation+9.57e-03

Pos logit contributions
aw+0.049
iles+0.043
ils+0.039
una+0.038
ups+0.038
Neg logit contributions
 spoiler-0.055
 sponge-0.049
 ramp-0.048
ro-0.046
 widened-0.045
Token They
Token ID1119
Feature activation+1.74e-03

Pos logit contributions
 spoiler+0.021
 prope+0.018
 ramp+0.017
ge+0.017
 sponge+0.016
Neg logit contributions
aw-0.015
iles-0.013
 me-0.012
ing-0.011
una-0.011
Token had
Token ID550
Feature activation+1.18e-02

Pos logit contributions
 spoiler+0.004
 ramp+0.003
ro+0.003
 sponge+0.003
rily+0.003
Neg logit contributions
aw-0.003
iles-0.002
ils-0.002
 me-0.002
 us-0.002
Token a
Token ID257
Feature activation+8.68e-03

Pos logit contributions
 spoiler+0.026
 sponge+0.022
 ramp+0.022
ro+0.021
 widened+0.021
Neg logit contributions
aw-0.019
iles-0.016
ups-0.014
ing-0.014
ils-0.014
Token castle
Token ID16669
Feature activation+1.74e-02

Pos logit contributions
 spoiler+0.017
ro+0.014
 sponge+0.014
 widened+0.014
ge+0.014
Neg logit contributions
aw-0.015
iles-0.012
 me-0.012
ing-0.011
ils-0.011
Token,
Token ID11
Feature activation+4.55e-03

Pos logit contributions
 spoiler+0.030
 ramp+0.027
 sponge+0.026
ro+0.026
 widened+0.024
Neg logit contributions
aw-0.033
iles-0.029
ups-0.027
ils-0.026
una-0.026
Token a
Token ID257
Feature activation+3.45e-02

Pos logit contributions
 spoiler+0.012
 tant+0.011
rily+0.011
ge+0.010
 ramp+0.010
Neg logit contributions
aw-0.005
 lions-0.004
iles-0.003
 that-0.003
una-0.003
Token king
Token ID5822
Feature activation+6.05e-04

Pos logit contributions
 spoiler+0.060
ge+0.048
 sponge+0.048
ro+0.047
 widened+0.047
Neg logit contributions
aw-0.065
iles-0.055
 me-0.054
ing-0.051
 lions-0.051
Token,
Token ID11
Feature activation+2.08e-02

Pos logit contributions
 spoiler+0.001
 sponge+0.001
 ramp+0.001
ro+0.001
 widened+0.001
Neg logit contributions
aw-0.001
iles-0.001
ups-0.001
ing-0.001
ils-0.001
Token a
Token ID257
Feature activation+4.29e-02

Pos logit contributions
 spoiler+0.049
 sponge+0.042
 ramp+0.042
ro+0.040
 tant+0.040
Neg logit contributions
aw-0.030
iles-0.024
ow-0.021
una-0.021
 me-0.021
Token queen
Token ID16599
Feature activation+4.51e-03

Pos logit contributions
 spoiler+0.080
 sponge+0.067
 ramp+0.065
 widened+0.064
ro+0.064
Neg logit contributions
aw-0.077
iles-0.065
 me-0.061
ups-0.059
ing-0.059
Token the
Token ID262
Feature activation+1.74e-02

Pos logit contributions
aw+0.008
iles+0.007
ow+0.006
ups+0.006
ese+0.006
Neg logit contributions
 spoiler-0.011
 sponge-0.009
 ramp-0.009
 cloth-0.009
ro-0.009
Token things
Token ID1243
Feature activation+2.10e-02

Pos logit contributions
 spoiler+0.039
 ramp+0.032
 sponge+0.032
 widened+0.032
ro+0.032
Neg logit contributions
aw-0.026
iles-0.021
ups-0.020
 me-0.020
 us-0.019
Token in
Token ID287
Feature activation+3.72e-02

Pos logit contributions
 spoiler+0.054
 sponge+0.048
 ramp+0.047
ro+0.045
 widened+0.045
Neg logit contributions
aw-0.025
iles-0.019
ups-0.017
ing-0.017
ils-0.016
Token our
Token ID674
Feature activation+1.64e-03

Pos logit contributions
 spoiler+0.082
 ramp+0.072
 sponge+0.070
 widened+0.070
ro+0.067
Neg logit contributions
aw-0.054
iles-0.048
 us-0.045
 me-0.045
ups-0.042
Token home
Token ID1363
Feature activation+3.33e-02

Pos logit contributions
 spoiler+0.003
 sponge+0.003
 ramp+0.003
ro+0.003
 prope+0.003
Neg logit contributions
aw-0.003
iles-0.002
ups-0.002
ing-0.002
una-0.002
Token.
Token ID13
Feature activation-5.72e-03

Pos logit contributions
 spoiler+0.077
 sponge+0.067
 ramp+0.066
 prope+0.063
 widened+0.063
Neg logit contributions
aw-0.048
iles-0.039
ups-0.037
ing-0.036
ils-0.035
Token<|endoftext|>
Token ID50256
Feature activation-3.90e-02

Pos logit contributions
aw+0.010
iles+0.008
ups+0.008
ing+0.008
ils+0.008
Neg logit contributions
 spoiler-0.011
 sponge-0.010
 ramp-0.010
ro-0.009
 widened-0.009
TokenOnce
Token ID7454
Feature activation+6.47e-04

Pos logit contributions
aw+0.061
iles+0.051
ups+0.047
ing+0.045
ils+0.045
Neg logit contributions
 spoiler-0.084
 sponge-0.073
 ramp-0.072
ro-0.068
 widened-0.068
Token there
Token ID612
Feature activation+2.27e-02

Pos logit contributions
 spoiler+0.001
 sponge+0.001
 ramp+0.001
ro+0.001
 widened+0.001
Neg logit contributions
aw-0.001
iles-0.001
ups-0.001
ing-0.001
ils-0.001
Token was
Token ID373
Feature activation-4.81e-02

Pos logit contributions
 spoiler+0.052
 sponge+0.045
 ramp+0.044
 prope+0.044
bes+0.042
Neg logit contributions
aw-0.034
 me-0.027
iles-0.026
ow-0.026
ing-0.026
Token a
Token ID257
Feature activation-7.10e-03

Pos logit contributions
aw+0.062
iles+0.043
una+0.040
ow+0.038
 me+0.038
Neg logit contributions
 spoiler-0.126
 ramp-0.112
ro-0.106
 sponge-0.106
ge-0.105
Token finally
Token ID3443
Feature activation-8.70e-04

Pos logit contributions
aw+0.065
iles+0.057
ups+0.053
ese+0.052
una+0.052
Neg logit contributions
 spoiler-0.060
 sponge-0.053
 ramp-0.050
 widened-0.049
 allowance-0.047
Token,
Token ID11
Feature activation-3.30e-02

Pos logit contributions
aw+0.002
iles+0.001
ups+0.001
ese+0.001
ing+0.001
Neg logit contributions
 spoiler-0.002
 sponge-0.001
 ramp-0.001
 widened-0.001
ro-0.001
Token in
Token ID287
Feature activation+3.20e-03

Pos logit contributions
aw+0.060
iles+0.052
ups+0.049
una+0.047
ils+0.047
Neg logit contributions
 spoiler-0.061
 sponge-0.054
 ramp-0.051
 widened-0.050
 allowance-0.048
Token a
Token ID257
Feature activation-7.83e-03

Pos logit contributions
 spoiler+0.008
 ramp+0.007
 sponge+0.007
ro+0.007
 prope+0.006
Neg logit contributions
aw-0.004
iles-0.003
 me-0.003
 us-0.003
ils-0.003
Token nearby
Token ID6716
Feature activation-8.07e-03

Pos logit contributions
aw+0.015
iles+0.013
ups+0.012
ing+0.012
ow+0.012
Neg logit contributions
 spoiler-0.014
 sponge-0.012
 ramp-0.012
 widened-0.011
ro-0.011
Token bush
Token ID24484
Feature activation-5.85e-02

Pos logit contributions
aw+0.014
iles+0.013
ups+0.013
ils+0.013
ing+0.013
Neg logit contributions
 spoiler-0.014
 ramp-0.013
 sponge-0.012
ro-0.012
 cloth-0.012
Token,
Token ID11
Feature activation+7.05e-04

Pos logit contributions
aw+0.103
iles+0.098
una+0.088
ils+0.088
ese+0.086
Neg logit contributions
 spoiler-0.099
 sponge-0.093
 widened-0.089
 ramp-0.087
ro-0.086
Token they
Token ID484
Feature activation+1.18e-04

Pos logit contributions
 spoiler+0.001
 sponge+0.001
 ramp+0.001
ro+0.001
 widened+0.001
Neg logit contributions
aw-0.001
iles-0.001
ups-0.001
ing-0.001
ils-0.001
Token saw
Token ID2497
Feature activation+2.78e-03

Pos logit contributions
 spoiler+0.000
 sponge+0.000
 widened+0.000
 ramp+0.000
 prope+0.000
Neg logit contributions
aw-0.000
iles-0.000
ups-0.000
una-0.000
ing-0.000
Token a
Token ID257
Feature activation+1.04e-02

Pos logit contributions
 spoiler+0.005
 sponge+0.004
 widened+0.004
 cloth+0.004
 prope+0.004
Neg logit contributions
aw-0.005
iles-0.005
ups-0.004
ani-0.004
ow-0.004
Token beautiful
Token ID4950
Feature activation-2.34e-02

Pos logit contributions
 spoiler+0.020
 sponge+0.017
 ramp+0.016
ro+0.015
 widened+0.015
Neg logit contributions
aw-0.019
iles-0.017
ups-0.015
ing-0.015
ils-0.015
Token"
Token ID1
Feature activation+4.29e-03

Pos logit contributions
 spoiler+0.015
 sponge+0.013
 ramp+0.012
ro+0.012
 widened+0.012
Neg logit contributions
aw-0.012
iles-0.010
ups-0.009
ing-0.009
ils-0.009
TokenYou
Token ID1639
Feature activation-1.36e-02

Pos logit contributions
 spoiler+0.010
 sponge+0.009
 ramp+0.009
ro+0.008
 widened+0.008
Neg logit contributions
aw-0.006
iles-0.005
ups-0.005
ing-0.004
ils-0.004
Token're
Token ID821
Feature activation-1.51e-02

Pos logit contributions
aw+0.025
iles+0.022
ups+0.020
ing+0.020
ils+0.020
Neg logit contributions
 spoiler-0.026
 sponge-0.022
 ramp-0.021
ro-0.020
 widened-0.020
Token welcome
Token ID7062
Feature activation-1.79e-02

Pos logit contributions
aw+0.029
ups+0.025
ing+0.024
iles+0.024
ils+0.023
Neg logit contributions
 spoiler-0.027
 sponge-0.024
kin-0.023
 ramp-0.022
 cloth-0.021
Token,
Token ID11
Feature activation+1.18e-02

Pos logit contributions
aw+0.037
iles+0.033
ups+0.031
ils+0.031
ing+0.029
Neg logit contributions
 spoiler-0.028
 sponge-0.024
 ramp-0.022
ro-0.022
 allowance-0.021
Token sweet
Token ID6029
Feature activation-6.24e-02

Pos logit contributions
 spoiler+0.022
 sponge+0.019
 ramp+0.019
ro+0.017
 widened+0.017
Neg logit contributions
aw-0.022
iles-0.019
ups-0.017
ing-0.017
una-0.017
Tokenie
Token ID494
Feature activation-1.38e-02

Pos logit contributions
aw+0.088
ils+0.073
iles+0.072
ow+0.066
ups+0.066
Neg logit contributions
 spoiler-0.124
 sponge-0.115
ro-0.112
 cloth-0.110
 ramp-0.107
Token.
Token ID13
Feature activation+6.11e-03

Pos logit contributions
aw+0.023
iles+0.019
ups+0.018
ing+0.018
ils+0.017
Neg logit contributions
 spoiler-0.029
 sponge-0.025
 ramp-0.024
 widened-0.023
ro-0.023
Token Be
Token ID1355
Feature activation-9.47e-03

Pos logit contributions
 spoiler+0.012
 sponge+0.010
 ramp+0.010
 widened+0.010
 prope+0.010
Neg logit contributions
aw-0.011
iles-0.009
ups-0.008
ing-0.008
ow-0.008
Token careful
Token ID8161
Feature activation+2.83e-02

Pos logit contributions
aw+0.015
iles+0.013
ups+0.012
ing+0.012
ils+0.011
Neg logit contributions
 spoiler-0.019
 sponge-0.017
 ramp-0.016
kin-0.016
ro-0.016
Token,
Token ID11
Feature activation+3.51e-03

Pos logit contributions
 spoiler+0.059
 sponge+0.051
 ramp+0.050
ro+0.048
 widened+0.047
Neg logit contributions
aw-0.046
iles-0.039
ups-0.036
ing-0.035
ils-0.035
Token count
Token ID954
Feature activation-2.20e-02

Pos logit contributions
aw+0.010
iles+0.008
ing+0.008
ups+0.008
ils+0.008
Neg logit contributions
 spoiler-0.009
 sponge-0.007
 ramp-0.007
 prope-0.007
 widened-0.007
Token them
Token ID606
Feature activation-6.95e-03

Pos logit contributions
aw+0.033
iles+0.027
ups+0.025
ils+0.025
ese+0.025
Neg logit contributions
 spoiler-0.047
 sponge-0.041
 ramp-0.040
 prope-0.039
ro-0.038
Token together
Token ID1978
Feature activation-7.11e-03

Pos logit contributions
aw+0.012
iles+0.010
ese+0.009
ups+0.009
una+0.009
Neg logit contributions
 spoiler-0.013
 ramp-0.011
 sponge-0.011
ro-0.011
 allowance-0.011
Token;
Token ID26
Feature activation-5.67e-03

Pos logit contributions
aw+0.013
iles+0.011
ups+0.010
ils+0.010
ing+0.010
Neg logit contributions
 spoiler-0.013
 sponge-0.011
 ramp-0.011
ro-0.011
 prope-0.010
Token let
Token ID1309
Feature activation+4.74e-03

Pos logit contributions
aw+0.009
iles+0.008
ils+0.007
ups+0.007
una+0.007
Neg logit contributions
 spoiler-0.011
 sponge-0.010
 ramp-0.010
 prope-0.009
ro-0.009
Token's
Token ID338
Feature activation-7.42e-02

Pos logit contributions
 spoiler+0.009
 sponge+0.008
 cloth+0.008
 ramp+0.008
 allowance+0.008
Neg logit contributions
aw-0.007
iles-0.006
ocked-0.005
ani-0.005
una-0.005
Token go
Token ID467
Feature activation-2.53e-02

Pos logit contributions
aw+0.150
iles+0.128
ing+0.122
ups+0.121
una+0.119
Neg logit contributions
 spoiler-0.127
 sponge-0.107
 ramp-0.107
 prope-0.097
ro-0.097
Token outside
Token ID2354
Feature activation-2.64e-02

Pos logit contributions
aw+0.039
iles+0.032
ing+0.030
ups+0.030
ow+0.029
Neg logit contributions
 spoiler-0.056
 sponge-0.048
 ramp-0.046
 widened-0.046
 prope-0.045
Token and
Token ID290
Feature activation-2.02e-02

Pos logit contributions
aw+0.041
iles+0.034
ing+0.033
 me+0.032
ups+0.032
Neg logit contributions
 spoiler-0.058
 sponge-0.048
 ramp-0.048
 widened-0.046
 prope-0.045
Token look
Token ID804
Feature activation-3.59e-03

Pos logit contributions
aw+0.041
iles+0.035
ups+0.033
ing+0.033
ils+0.032
Neg logit contributions
 spoiler-0.035
 sponge-0.029
 ramp-0.029
ro-0.027
 prope-0.026
Token!"
Token ID2474
Feature activation+1.73e-02

Pos logit contributions
aw+0.004
iles+0.003
ups+0.003
ese+0.002
una+0.002
Neg logit contributions
 spoiler-0.009
 sponge-0.008
 ramp-0.008
 widened-0.008
 prope-0.008
Token the
Token ID262
Feature activation-2.56e-02

Pos logit contributions
aw+0.021
iles+0.017
 me+0.016
ups+0.016
ow+0.015
Neg logit contributions
 spoiler-0.035
 sponge-0.030
 ramp-0.030
ro-0.029
 widened-0.029
Token checkout
Token ID28006
Feature activation+1.56e-02

Pos logit contributions
aw+0.056
iles+0.052
ow+0.047
ani+0.046
una+0.046
Neg logit contributions
 spoiler-0.038
 sponge-0.034
 ramp-0.034
 shelf-0.031
 cloth-0.030
Token.
Token ID13
Feature activation-5.59e-03

Pos logit contributions
 spoiler+0.036
 sponge+0.030
 prope+0.030
 ramp+0.029
 widened+0.029
Neg logit contributions
aw-0.024
ups-0.019
ing-0.018
iles-0.018
 me-0.017
Token In
Token ID554
Feature activation-3.71e-03

Pos logit contributions
aw+0.008
 me+0.006
ing+0.006
ups+0.006
ow+0.006
Neg logit contributions
 spoiler-0.013
 ramp-0.011
 prope-0.011
 sponge-0.011
ge-0.010
Token the
Token ID262
Feature activation-4.02e-02

Pos logit contributions
aw+0.004
iles+0.003
 me+0.003
ow+0.003
ups+0.003
Neg logit contributions
 spoiler-0.009
 sponge-0.008
 ramp-0.008
ro-0.008
 widened-0.008
Token end
Token ID886
Feature activation-5.86e-02

Pos logit contributions
aw+0.073
iles+0.066
ing+0.058
ow+0.057
ils+0.057
Neg logit contributions
 spoiler-0.076
 sponge-0.068
 ramp-0.066
 allowance-0.061
 cloth-0.061
Token,
Token ID11
Feature activation-1.24e-02

Pos logit contributions
aw+0.110
iles+0.095
ils+0.088
ese+0.088
ups+0.085
Neg logit contributions
 spoiler-0.106
 sponge-0.094
 ramp-0.092
ro-0.086
 widened-0.085
Token Mom
Token ID11254
Feature activation-9.95e-03

Pos logit contributions
aw+0.019
iles+0.016
ups+0.015
ing+0.014
ils+0.014
Neg logit contributions
 spoiler-0.027
 sponge-0.023
 ramp-0.023
ro-0.022
 widened-0.022
Tokenmy
Token ID1820
Feature activation-9.59e-03

Pos logit contributions
aw+0.018
iles+0.015
ils+0.014
una+0.013
ups+0.013
Neg logit contributions
 spoiler-0.019
 sponge-0.017
 ramp-0.016
ro-0.015
 cloth-0.015
Token was
Token ID373
Feature activation-1.66e-03

Pos logit contributions
aw+0.017
iles+0.015
ups+0.014
ing+0.014
ils+0.014
Neg logit contributions
 spoiler-0.018
 sponge-0.015
 ramp-0.015
 widened-0.014
ro-0.014
Token happy
Token ID3772
Feature activation+1.39e-03

Pos logit contributions
aw+0.003
iles+0.002
ups+0.002
ing+0.002
ils+0.002
Neg logit contributions
 spoiler-0.004
 sponge-0.003
 ramp-0.003
ro-0.003
 widened-0.003
Token of
Token ID286
Feature activation+4.54e-03

Pos logit contributions
aw+0.049
ils+0.044
iles+0.044
ese+0.044
ow+0.043
Neg logit contributions
 spoiler-0.037
 sponge-0.036
 ramp-0.033
ro-0.033
 cloth-0.032
Token the
Token ID262
Feature activation-6.87e-03

Pos logit contributions
 spoiler+0.010
 sponge+0.009
 ramp+0.009
 prope+0.009
 widened+0.008
Neg logit contributions
aw-0.007
iles-0.005
ups-0.005
ing-0.005
ese-0.005
Token knight
Token ID22062
Feature activation+1.42e-02

Pos logit contributions
aw+0.014
iles+0.012
ups+0.011
ow+0.011
ing+0.011
Neg logit contributions
 spoiler-0.012
 ramp-0.010
 sponge-0.010
 prope-0.009
 widened-0.009
Token and
Token ID290
Feature activation-5.93e-03

Pos logit contributions
 spoiler+0.027
 ramp+0.023
 sponge+0.022
 widened+0.022
 allowance+0.022
Neg logit contributions
aw-0.024
iles-0.021
ups-0.019
ese-0.018
uck-0.018
Token thankful
Token ID25535
Feature activation-6.56e-03

Pos logit contributions
aw+0.011
iles+0.009
ups+0.008
una+0.008
ese+0.008
Neg logit contributions
 spoiler-0.011
 sponge-0.010
 ramp-0.009
 prope-0.009
 widened-0.009
Token for
Token ID329
Feature activation-6.85e-02

Pos logit contributions
aw+0.009
iles+0.008
ese+0.008
ups+0.007
ils+0.007
Neg logit contributions
 spoiler-0.013
ro-0.012
kin-0.012
 sponge-0.012
 allowance-0.012
Token his
Token ID465
Feature activation+6.44e-03

Pos logit contributions
aw+0.134
ups+0.115
iles+0.115
uck+0.107
ing+0.107
Neg logit contributions
 spoiler-0.120
 ramp-0.098
 sponge-0.096
 prope-0.093
ge-0.089
Token courage
Token ID11917
Feature activation-1.52e-03

Pos logit contributions
 spoiler+0.012
 ramp+0.011
 sponge+0.011
 allowance+0.010
 prope+0.010
Neg logit contributions
aw-0.012
iles-0.010
ups-0.009
ing-0.009
ils-0.009
Token.
Token ID13
Feature activation-2.68e-02

Pos logit contributions
aw+0.002
iles+0.002
ups+0.002
ils+0.002
una+0.002
Neg logit contributions
 spoiler-0.003
 sponge-0.003
 ramp-0.003
ro-0.003
 widened-0.003
Token<|endoftext|>
Token ID50256
Feature activation-4.35e-02

Pos logit contributions
aw+0.055
iles+0.050
ups+0.047
ils+0.046
ese+0.046
Neg logit contributions
 spoiler-0.044
 ramp-0.036
 sponge-0.035
ro-0.033
 prope-0.032
TokenThe
Token ID464
Feature activation-2.40e-02

Pos logit contributions
aw+0.066
iles+0.054
ing+0.050
ups+0.048
una+0.048
Neg logit contributions
 spoiler-0.097
 sponge-0.085
 ramp-0.081
 prope-0.080
 allowance-0.078