TOP ACTIVATIONS
MAX = 4.324

. He likes to get letters from his grandma . She
excited ! She loved getting packages in the mail . She
and going . He crossed rivers , mountains and forests ,
. She asked lots of questions , listened to her teacher
grandma . She loved getting letters and cards in the mail
set off . She climbed mountains and crossed rivers , until
a rocket . They put stickers and drawings on it to
they see . They ask questions and learn more . They
cream . He also puts stickers on the days when he
post office . I deliver letters and packages to people .
curiosity . He asked many questions but the box didn รข
. He asked lots of questions , while the wizard laughed
stories . She asked many questions and learned many new words
new . He kept asking questions about it for the rest
He asked a lot of questions , but his mom and
save money . She put coins in a jar . One
blocks . They built tall houses and skinny houses . They
. Emily asked lots of questions , including " What 's
valleys . He flew over mountains and woods . He rushed
a spaceship . They put stickers and drawings on it to

MIN ACTIVATIONS
MIN = -4.793

it over her and her baby . The gear was a
friends without any more toe stub s . <|endoftext|> Once upon
quick thinking and his trust y flashlight . Tim my felt
next few days until his ankle healed . He felt sad
forward , Tweet and her baby bird lived happily in their
egg was broken and her baby bird was gone .
the gifted frog and the mosquito ended . <|endoftext|> Once upon
mess and then rock his baby sister to sleep so his
Lily and Mr . Sp ud went to the park .
apple , Tom and the mosquito played together in the rain
day , Jimmy and his cricket went to the park to
dinner for her and her baby bear . So , she
, Bob and his nos y neighbor became good friends and
to their mom and their baby brother . They gave him
and peaceful under her watch ful eye . <|endoftext|> Once upon
crown and playing in the freshly cut grass . It looked
and your big sne e ze ." They all hugged Tim
hear that , and her toe stopped hurting once she had
when she stub bed her toe on a rock . It
and Tim my hurt his toe . Their mom

INTERVAL 2.045 - 4.324
CONTAINS 0.657%

park . He saw a tree and wanted to climb up
fine . The moral of this story is that
. They saw many big animals , like lions , elephants
I found a pretty pe b ble !" she said ,
. The game was so fun ! When it

INTERVAL -0.234 - 2.045
CONTAINS 61.532%

Once upon a time , there was a little boy named
he flew . Johnny waved back and said ,
the earth . Ben has a small ball that looks like
room !" Ben said , pointing at another picture .
tree !" Ben says . He likes to climb .

INTERVAL -2.513 - -0.234
CONTAINS 37.592%

She also noticed that this flower was even l itt ler
cat liked the song and started to dance . While the
. The squash lived in a comfortable garden with many other
. The little girl explained that it had shrunk and become
spark ly dress and her friend put on a pink ,

INTERVAL -4.793 - -2.513
CONTAINS 0.218%

!" But the busy friend said , " No
Tim my 's little sister accidentally knocked over her glass of
it was spark ly and clean . The
and daddy had hit his thumb with the hammer . M
towards them . The dog quickly ran away .
Token.
Token ID13
Feature activation+4.86e-01

Pos logit contributions
lock+0.000
 guide+0.000
 X+0.000
 Steve+0.000
 Mark+0.000
Neg logit contributions
arre-0.001
 mosquitoes-0.001
 ow-0.001
Acc-0.001
ilt-0.001
Token He
Token ID679
Feature activation+1.81e+00

Pos logit contributions
 guide+0.010
lock+0.010
 X+0.010
lynn+0.009
 believe+0.009
Neg logit contributions
arre-0.014
 ow-0.014
 mosquitoes-0.014
 allergies-0.013
 miserable-0.013
Token likes
Token ID7832
Feature activation+4.31e-01

Pos logit contributions
 guide+0.045
lock+0.044
 X+0.043
 Steve+0.043
 Mark+0.040
Neg logit contributions
arre-0.050
 ow-0.047
 mosquitoes-0.046
 wil-0.045
 suffering-0.044
Token to
Token ID284
Feature activation+5.70e-01

Pos logit contributions
 guide+0.008
 X+0.007
 Steve+0.007
lock+0.007
 Mark+0.007
Neg logit contributions
arre-0.016
ivery-0.014
 ow-0.014
onel-0.014
ilt-0.014
Token get
Token ID651
Feature activation-8.89e-02

Pos logit contributions
lock+0.011
 Steve+0.011
lot+0.010
 Kyle+0.010
lynn+0.010
Neg logit contributions
 ow-0.018
 wil-0.017
 suffering-0.016
 sick-0.016
 mosquitoes-0.016
Token letters
Token ID7475
Feature activation+4.32e+00

Pos logit contributions
 suffering+0.002
 wil+0.002
 mosquitoes+0.002
 dying+0.002
 ow+0.002
Neg logit contributions
lock-0.002
 guide-0.002
 Steve-0.002
 X-0.002
lynn-0.002
Token from
Token ID422
Feature activation+1.10e+00

Pos logit contributions
lynn+0.111
 Steve+0.108
lock+0.108
 guide+0.105
Zip+0.105
Neg logit contributions
 suffering-0.087
 sick-0.086
 asleep-0.086
der-0.086
 dying-0.085
Token his
Token ID465
Feature activation+5.24e-01

Pos logit contributions
lock+0.025
 guide+0.023
 Steve+0.022
 X+0.022
lot+0.021
Neg logit contributions
 ow-0.032
 mosquitoes-0.031
 suffering-0.031
arre-0.031
 wil-0.031
Token grandma
Token ID49890
Feature activation-1.56e-01

Pos logit contributions
lock+0.011
 guide+0.011
 Steve+0.011
 X+0.011
 Mark+0.010
Neg logit contributions
arre-0.016
 ow-0.015
 mosquitoes-0.015
 wil-0.015
 suffering-0.015
Token.
Token ID13
Feature activation+4.64e-01

Pos logit contributions
arre+0.004
 ow+0.004
 suffering+0.004
 mosquitoes+0.004
 allergies+0.004
Neg logit contributions
lock-0.004
 guide-0.004
 X-0.003
 Steve-0.003
 Kyle-0.003
Token She
Token ID1375
Feature activation+5.67e-01

Pos logit contributions
lock+0.010
 guide+0.010
 X+0.010
 Steve+0.009
 map+0.009
Neg logit contributions
arre-0.013
 ow-0.013
 mosquitoes-0.012
 allergies-0.012
 miserable-0.011
Token excited
Token ID6568
Feature activation+2.82e-01

Pos logit contributions
 mosquitoes+0.023
arre+0.023
pill+0.023
 allergies+0.023
 ow+0.022
Neg logit contributions
lock-0.010
 guide-0.010
 Steve-0.009
 close-0.009
ver-0.009
Token!
Token ID0
Feature activation-1.30e-01

Pos logit contributions
 guide+0.005
 Steve+0.005
lock+0.005
 X+0.005
 Mark+0.005
Neg logit contributions
arre-0.010
 ow-0.009
 wil-0.009
 mosquitoes-0.009
onel-0.009
Token She
Token ID1375
Feature activation+3.87e-01

Pos logit contributions
arre+0.004
 ow+0.003
 mosquitoes+0.003
 wil+0.003
 allergies+0.003
Neg logit contributions
lock-0.003
 guide-0.003
 Steve-0.003
 X-0.003
 Mark-0.003
Token loved
Token ID6151
Feature activation+2.72e-01

Pos logit contributions
 guide+0.008
lock+0.008
 X+0.008
 Steve+0.008
 Mark+0.007
Neg logit contributions
arre-0.012
 ow-0.012
 mosquitoes-0.011
 wil-0.011
 allergies-0.011
Token getting
Token ID1972
Feature activation+1.91e-01

Pos logit contributions
 Steve+0.006
 Mark+0.006
 guide+0.006
 X+0.006
lock+0.005
Neg logit contributions
arre-0.010
ivery-0.009
onel-0.009
ivering-0.009
ilt-0.009
Token packages
Token ID10392
Feature activation+4.32e+00

Pos logit contributions
 guide+0.004
lock+0.004
 X+0.004
 Steve+0.004
lot+0.004
Neg logit contributions
arre-0.006
 ow-0.005
 mosquitoes-0.005
 wil-0.005
 suffering-0.005
Token in
Token ID287
Feature activation+6.33e-01

Pos logit contributions
 X+0.100
lot+0.098
Zip+0.098
lynn+0.097
 Steve+0.097
Neg logit contributions
 suffering-0.103
arre-0.101
 dying-0.101
 wil-0.100
 asleep-0.098
Token the
Token ID262
Feature activation+6.56e-02

Pos logit contributions
lock+0.019
 guide+0.019
 X+0.018
 Steve+0.018
lot+0.018
Neg logit contributions
arre-0.013
 ow-0.012
 wil-0.012
 suffering-0.012
 mosquitoes-0.012
Token mail
Token ID6920
Feature activation+1.49e+00

Pos logit contributions
 guide+0.001
lock+0.001
 X+0.001
 Steve+0.001
 Mark+0.001
Neg logit contributions
arre-0.002
 ow-0.002
 mosquitoes-0.002
 wil-0.002
 suffering-0.002
Token.
Token ID13
Feature activation+2.44e-01

Pos logit contributions
 Steve+0.032
 X+0.032
lock+0.031
 steer+0.031
lot+0.031
Neg logit contributions
arre-0.040
 suffering-0.037
 wil-0.037
 ow-0.037
 dying-0.036
Token She
Token ID1375
Feature activation+3.73e-01

Pos logit contributions
lock+0.005
 guide+0.005
 Steve+0.005
 X+0.005
 Mark+0.005
Neg logit contributions
arre-0.007
 ow-0.007
 wil-0.007
 mosquitoes-0.007
 dying-0.007
Token and
Token ID290
Feature activation-4.82e-01

Pos logit contributions
lock+0.012
 X+0.012
 Steve+0.012
 guide+0.011
 Mark+0.011
Neg logit contributions
arre-0.014
 suffering-0.014
 dying-0.014
 ow-0.013
 wil-0.013
Token going
Token ID1016
Feature activation+1.90e-01

Pos logit contributions
arre+0.015
 ow+0.014
 mosquitoes+0.014
 wil+0.014
 suffering+0.014
Neg logit contributions
 guide-0.010
lock-0.010
 X-0.009
 Steve-0.009
 Mark-0.008
Token.
Token ID13
Feature activation-3.11e-01

Pos logit contributions
 Steve+0.004
lock+0.004
 X+0.004
 guide+0.004
 Mark+0.004
Neg logit contributions
arre-0.005
 ow-0.005
 suffering-0.005
 dying-0.005
onel-0.005
Token He
Token ID679
Feature activation+7.67e-01

Pos logit contributions
arre+0.011
 ow+0.010
 mosquitoes+0.010
 wil+0.010
 suffering+0.010
Neg logit contributions
 guide-0.006
lock-0.005
 X-0.005
 Steve-0.005
 Mark-0.005
Token crossed
Token ID12606
Feature activation+1.65e+00

Pos logit contributions
lock+0.017
 guide+0.017
 X+0.017
 Steve+0.017
 Mark+0.015
Neg logit contributions
arre-0.022
 ow-0.021
 wil-0.021
 mosquitoes-0.020
 suffering-0.020
Token rivers
Token ID18180
Feature activation+4.26e+00

Pos logit contributions
lock+0.039
 guide+0.036
 X+0.035
 Steve+0.034
 Copper+0.033
Neg logit contributions
arre-0.042
 suffering-0.041
 dying-0.041
 wil-0.041
 mosquitoes-0.041
Token,
Token ID11
Feature activation+6.52e-01

Pos logit contributions
lock+0.091
 X+0.086
 guide+0.083
 Steve+0.081
 Kyle+0.079
Neg logit contributions
arre-0.119
der-0.109
 suffering-0.109
 wil-0.109
 asleep-0.109
Token mountains
Token ID12269
Feature activation+3.96e+00

Pos logit contributions
lock+0.017
 X+0.015
 guide+0.015
 opener+0.015
 Steve+0.015
Neg logit contributions
arre-0.016
 suffering-0.016
 wil-0.015
 mosquitoes-0.015
 dying-0.015
Token and
Token ID290
Feature activation+1.94e-01

Pos logit contributions
lock+0.086
 Kyle+0.080
 X+0.079
 Steve+0.078
lynn+0.076
Neg logit contributions
arre-0.102
der-0.100
 ow-0.099
 suffering-0.098
 asleep-0.097
Token forests
Token ID17039
Feature activation+2.37e+00

Pos logit contributions
lock+0.005
lynn+0.005
 X+0.005
 guide+0.005
 Steve+0.005
Neg logit contributions
 mosquitoes-0.004
 ow-0.004
arre-0.004
 suffering-0.004
 wil-0.004
Token,
Token ID11
Feature activation+4.33e-01

Pos logit contributions
lock+0.055
 X+0.052
 Steve+0.052
 guide+0.051
 Kyle+0.050
Neg logit contributions
arre-0.059
 ow-0.059
 wil-0.058
der-0.058
 mosquitoes-0.058
Token.
Token ID13
Feature activation+4.37e-01

Pos logit contributions
 guide+0.023
 Steve+0.021
lock+0.021
 X+0.021
 Mark+0.019
Neg logit contributions
arre-0.057
 ow-0.055
 wil-0.054
 mosquitoes-0.054
Acc-0.053
Token She
Token ID1375
Feature activation+2.82e-01

Pos logit contributions
 guide+0.011
lock+0.011
 X+0.011
 Steve+0.011
 Mark+0.010
Neg logit contributions
arre-0.012
 ow-0.011
 mosquitoes-0.011
 wil-0.011
 suffering-0.010
Token asked
Token ID1965
Feature activation+4.37e-01

Pos logit contributions
 guide+0.006
lock+0.006
 Steve+0.006
 X+0.006
 Mark+0.005
Neg logit contributions
arre-0.009
 ow-0.008
 mosquitoes-0.008
 wil-0.008
 dying-0.008
Token lots
Token ID6041
Feature activation+7.07e-01

Pos logit contributions
 guide+0.013
lock+0.013
 X+0.012
 Steve+0.012
 Mark+0.011
Neg logit contributions
arre-0.010
 ow-0.009
 mosquitoes-0.009
 wil-0.009
 suffering-0.009
Token of
Token ID286
Feature activation+3.53e-01

Pos logit contributions
 guide+0.012
lock+0.012
 X+0.011
 Steve+0.011
 Mark+0.010
Neg logit contributions
arre-0.025
 ow-0.024
 mosquitoes-0.023
 wil-0.023
 suffering-0.023
Token questions
Token ID2683
Feature activation+4.23e+00

Pos logit contributions
 guide+0.007
 Steve+0.007
 X+0.007
lock+0.007
 Mark+0.006
Neg logit contributions
arre-0.011
 ow-0.011
 wil-0.011
 dying-0.010
onel-0.010
Token,
Token ID11
Feature activation-4.01e-02

Pos logit contributions
lock+0.075
 guide+0.074
 X+0.072
 Steve+0.069
lot+0.062
Neg logit contributions
arre-0.143
 ow-0.134
 mosquitoes-0.133
 suffering-0.130
 wil-0.130
Token listened
Token ID16399
Feature activation-3.82e-01

Pos logit contributions
arre+0.001
 ow+0.001
 wil+0.001
ilt+0.001
ivery+0.001
Neg logit contributions
 guide-0.001
 Steve-0.001
lock-0.001
 X-0.001
 Mark-0.001
Token to
Token ID284
Feature activation-2.95e-01

Pos logit contributions
arre+0.013
 ow+0.013
 mosquitoes+0.012
 wil+0.012
 allergies+0.012
Neg logit contributions
 guide-0.007
lock-0.007
 Steve-0.006
 X-0.006
 Mark-0.006
Token her
Token ID607
Feature activation+6.10e-03

Pos logit contributions
arre+0.008
 ow+0.007
 wil+0.007
onel+0.007
ivery+0.007
Neg logit contributions
 guide-0.007
 Steve-0.007
 X-0.007
 Mark-0.007
lock-0.007
Token teacher
Token ID4701
Feature activation-5.00e-01

Pos logit contributions
 guide+0.000
lock+0.000
 X+0.000
 Steve+0.000
 Mark+0.000
Neg logit contributions
arre-0.000
 ow-0.000
 mosquitoes-0.000
 wil-0.000
 dying-0.000
Token grandma
Token ID49890
Feature activation-4.27e-01

Pos logit contributions
arre+0.028
 mosquitoes+0.026
ilt+0.025
ins+0.025
 ow+0.025
Neg logit contributions
 guide-0.015
lock-0.014
 X-0.014
 Steve-0.013
 Mark-0.012
Token.
Token ID13
Feature activation-8.80e-01

Pos logit contributions
arre+0.016
 mosquitoes+0.015
 ow+0.015
 wil+0.015
onel+0.015
Neg logit contributions
 guide-0.007
lock-0.006
 Steve-0.006
 X-0.006
 Mark-0.005
Token She
Token ID1375
Feature activation+4.39e-02

Pos logit contributions
arre+0.031
 mosquitoes+0.030
 wil+0.030
 ow+0.030
ilt+0.030
Neg logit contributions
 Steve-0.015
 Mark-0.014
lock-0.014
 guide-0.014
 X-0.014
Token loved
Token ID6151
Feature activation+2.07e-01

Pos logit contributions
 guide+0.001
 X+0.001
lock+0.001
 Steve+0.001
 Mark+0.001
Neg logit contributions
arre-0.001
 mosquitoes-0.001
 ow-0.001
ivery-0.001
 allergies-0.001
Token getting
Token ID1972
Feature activation+1.23e-01

Pos logit contributions
 Mark+0.005
 Steve+0.004
 guide+0.004
 X+0.004
 Copper+0.004
Neg logit contributions
arre-0.007
ivery-0.007
ivering-0.007
ilt-0.007
onel-0.007
Token letters
Token ID7475
Feature activation+4.21e+00

Pos logit contributions
 guide+0.003
lock+0.003
 X+0.002
 Steve+0.002
 Mark+0.002
Neg logit contributions
arre-0.004
 ow-0.004
 mosquitoes-0.004
 wil-0.004
 allergies-0.003
Token and
Token ID290
Feature activation+3.88e-01

Pos logit contributions
 X+0.082
lock+0.081
 Steve+0.081
 guide+0.079
lynn+0.077
Neg logit contributions
arre-0.124
 suffering-0.119
 ow-0.119
 wil-0.119
 dying-0.117
Token cards
Token ID4116
Feature activation+2.05e+00

Pos logit contributions
lock+0.007
 guide+0.007
 Steve+0.007
 X+0.006
lot+0.006
Neg logit contributions
arre-0.013
 ow-0.012
 wil-0.012
 suffering-0.012
 dying-0.012
Token in
Token ID287
Feature activation+1.68e-01

Pos logit contributions
 guide+0.038
lock+0.037
 X+0.036
 Steve+0.035
 Mark+0.032
Neg logit contributions
arre-0.071
 ow-0.067
 mosquitoes-0.066
 wil-0.064
Acc-0.064
Token the
Token ID262
Feature activation-2.77e-01

Pos logit contributions
lock+0.005
 guide+0.005
 X+0.005
 Steve+0.005
lot+0.004
Neg logit contributions
arre-0.004
 ow-0.004
 mosquitoes-0.003
 wil-0.003
 suffering-0.003
Token mail
Token ID6920
Feature activation+5.73e-01

Pos logit contributions
arre+0.009
 mosquitoes+0.009
 ow+0.009
 wil+0.009
onel+0.009
Neg logit contributions
 guide-0.005
 X-0.005
lock-0.005
 Steve-0.005
 Mark-0.004
Token set
Token ID900
Feature activation+5.31e-01

Pos logit contributions
 guide+0.029
lock+0.028
 X+0.028
 Steve+0.028
lot+0.026
Neg logit contributions
arre-0.034
 ow-0.032
 wil-0.032
 mosquitoes-0.032
 allergies-0.031
Token off
Token ID572
Feature activation-1.18e+00

Pos logit contributions
 X+0.014
lot+0.014
lock+0.014
 Steve+0.014
 guide+0.014
Neg logit contributions
 itching-0.010
 mosquitoes-0.010
arre-0.010
 ow-0.010
 wil-0.010
Token.
Token ID13
Feature activation+4.49e-01

Pos logit contributions
arre+0.038
 ow+0.036
 mosquitoes+0.035
 wil+0.034
 allergies+0.034
Neg logit contributions
 guide-0.024
lock-0.024
 Steve-0.023
 X-0.023
 Mark-0.021
Token She
Token ID1375
Feature activation+3.85e-01

Pos logit contributions
lock+0.010
 guide+0.010
 Steve+0.010
 X+0.010
 Mark+0.009
Neg logit contributions
arre-0.013
 ow-0.013
 wil-0.012
 mosquitoes-0.012
 dying-0.012
Token climbed
Token ID19952
Feature activation+1.40e+00

Pos logit contributions
lock+0.008
 guide+0.008
 Steve+0.008
 X+0.008
 Mark+0.008
Neg logit contributions
arre-0.011
 wil-0.011
 ow-0.011
 allergies-0.010
 suffering-0.010
Token mountains
Token ID12269
Feature activation+4.21e+00

Pos logit contributions
lock+0.037
 guide+0.036
 X+0.036
 Steve+0.035
 Kyle+0.034
Neg logit contributions
 asleep-0.030
 mosquitoes-0.030
 itching-0.028
 ow-0.028
 wil-0.028
Token and
Token ID290
Feature activation+5.51e-01

Pos logit contributions
lock+0.098
 X+0.094
 Kyle+0.092
lynn+0.088
 Steve+0.088
Neg logit contributions
 asleep-0.100
arre-0.098
 mosquitoes-0.095
der-0.094
 wil-0.092
Token crossed
Token ID12606
Feature activation+1.22e+00

Pos logit contributions
lock+0.015
lynn+0.015
 Steve+0.015
 X+0.015
 Kyle+0.015
Neg logit contributions
 wil-0.010
 ow-0.010
 mosquitoes-0.010
arre-0.010
 bees-0.010
Token rivers
Token ID18180
Feature activation+3.31e+00

Pos logit contributions
lock+0.030
 guide+0.029
 X+0.028
 Steve+0.027
lot+0.026
Neg logit contributions
arre-0.031
 ow-0.031
 wil-0.031
 mosquitoes-0.031
 dying-0.030
Token,
Token ID11
Feature activation+3.64e-01

Pos logit contributions
lock+0.077
 X+0.075
 guide+0.073
 Steve+0.072
 Kyle+0.070
Neg logit contributions
arre-0.085
 wil-0.079
 mosquitoes-0.079
 ow-0.079
der-0.078
Token until
Token ID1566
Feature activation-8.16e-01

Pos logit contributions
lock+0.010
 X+0.009
 Steve+0.009
 guide+0.009
 Kyle+0.009
Neg logit contributions
arre-0.008
 mosquitoes-0.008
 wil-0.008
 suffering-0.008
 ow-0.008
Token a
Token ID257
Feature activation+7.81e-02

Pos logit contributions
 mosquitoes+0.004
arre+0.004
 ow+0.004
 wil+0.004
 allergies+0.004
Neg logit contributions
 guide-0.004
lock-0.004
 X-0.004
 Steve-0.003
 Kyle-0.003
Token rocket
Token ID10701
Feature activation+3.61e-01

Pos logit contributions
lock+0.002
 guide+0.002
 Steve+0.002
 X+0.001
 Mark+0.001
Neg logit contributions
arre-0.002
 ow-0.002
 wil-0.002
 mosquitoes-0.002
 dying-0.002
Token.
Token ID13
Feature activation-6.02e-02

Pos logit contributions
 guide+0.007
lock+0.007
 X+0.007
 Steve+0.007
 Mark+0.006
Neg logit contributions
arre-0.012
 ow-0.011
 mosquitoes-0.011
 wil-0.011
 allergies-0.011
Token They
Token ID1119
Feature activation+4.48e-01

Pos logit contributions
arre+0.002
 ow+0.002
 wil+0.002
 mosquitoes+0.002
 suffering+0.002
Neg logit contributions
 guide-0.001
 Steve-0.001
lock-0.001
 X-0.001
 Mark-0.001
Token put
Token ID1234
Feature activation+8.44e-01

Pos logit contributions
lock+0.012
 Steve+0.011
 guide+0.011
lynn+0.011
lot+0.011
Neg logit contributions
 wil-0.010
arre-0.010
 spilled-0.010
 ow-0.010
 dying-0.010
Token stickers
Token ID28568
Feature activation+4.20e+00

Pos logit contributions
lock+0.026
lot+0.024
 guide+0.023
lynn+0.023
ver+0.023
Neg logit contributions
 mosquitoes-0.018
 ow-0.017
 bubbles-0.016
 wil-0.016
 dying-0.016
Token and
Token ID290
Feature activation+8.80e-01

Pos logit contributions
 Steve+0.113
 guide+0.110
lock+0.109
 Connor+0.107
lynn+0.107
Neg logit contributions
arre-0.084
 asleep-0.082
ins-0.077
 spilled-0.076
 wil-0.075
Token drawings
Token ID23388
Feature activation+1.49e+00

Pos logit contributions
 Steve+0.024
lock+0.023
 guide+0.022
lynn+0.022
 Kyle+0.022
Neg logit contributions
 ow-0.021
 mosquitoes-0.020
arre-0.020
 allergies-0.019
 ger-0.019
Token on
Token ID319
Feature activation-7.42e-01

Pos logit contributions
 Steve+0.034
lock+0.033
 guide+0.032
 X+0.032
lynn+0.031
Neg logit contributions
arre-0.039
 ow-0.037
 wil-0.037
 dying-0.037
onel-0.036
Token it
Token ID340
Feature activation-1.42e+00

Pos logit contributions
 ow+0.017
 mosquitoes+0.017
 wil+0.016
 ger+0.015
 dying+0.015
Neg logit contributions
lock-0.021
 guide-0.019
lot-0.019
 Steve-0.018
lynn-0.018
Token to
Token ID284
Feature activation-1.24e+00

Pos logit contributions
arre+0.049
 ow+0.047
 mosquitoes+0.046
 wil+0.046
 allergies+0.045
Neg logit contributions
 guide-0.025
lock-0.025
 X-0.024
 Steve-0.023
 Mark-0.021
Token they
Token ID484
Feature activation-6.58e-01

Pos logit contributions
 guide+0.019
lock+0.019
 X+0.018
 Steve+0.018
 Mark+0.016
Neg logit contributions
arre-0.030
 ow-0.028
 mosquitoes-0.028
 wil-0.028
 suffering-0.027
Token see
Token ID766
Feature activation+2.79e-01

Pos logit contributions
arre+0.021
 ow+0.020
 mosquitoes+0.020
 wil+0.019
 suffering+0.019
Neg logit contributions
 guide-0.013
lock-0.013
 X-0.013
 Steve-0.013
 Mark-0.012
Token.
Token ID13
Feature activation+6.04e-01

Pos logit contributions
 guide+0.005
 X+0.005
lock+0.005
 Steve+0.004
 Mark+0.004
Neg logit contributions
arre-0.010
 ow-0.009
ivery-0.009
Acc-0.009
 mosquitoes-0.009
Token They
Token ID1119
Feature activation+6.83e-01

Pos logit contributions
 Steve+0.009
 guide+0.009
 X+0.008
lock+0.008
 Mark+0.008
Neg logit contributions
arre-0.023
 ow-0.022
 mosquitoes-0.022
 wil-0.022
onel-0.022
Token ask
Token ID1265
Feature activation+1.18e+00

Pos logit contributions
lock+0.012
 guide+0.012
 X+0.012
 Steve+0.012
 Mark+0.011
Neg logit contributions
arre-0.023
 ow-0.022
 mosquitoes-0.021
 wil-0.021
 suffering-0.021
Token questions
Token ID2683
Feature activation+4.18e+00

Pos logit contributions
lock+0.032
 guide+0.031
 X+0.031
 Steve+0.030
lot+0.029
Neg logit contributions
arre-0.029
 mosquitoes-0.028
 ow-0.027
 wil-0.027
 suffering-0.026
Token and
Token ID290
Feature activation+1.19e+00

Pos logit contributions
lock+0.072
 guide+0.071
 X+0.069
 Steve+0.068
lot+0.060
Neg logit contributions
arre-0.142
 ow-0.134
 mosquitoes-0.133
 wil-0.131
 dying-0.130
Token learn
Token ID2193
Feature activation+1.62e+00

Pos logit contributions
 guide+0.024
lock+0.023
 X+0.022
 Steve+0.022
 Mark+0.020
Neg logit contributions
arre-0.039
 ow-0.037
 mosquitoes-0.036
 wil-0.036
 allergies-0.035
Token more
Token ID517
Feature activation+5.57e-01

Pos logit contributions
 guide+0.035
lock+0.033
 X+0.033
 Steve+0.033
 Mark+0.031
Neg logit contributions
arre-0.051
 ow-0.048
 wil-0.046
 mosquitoes-0.046
onel-0.046
Token.
Token ID13
Feature activation+9.33e-01

Pos logit contributions
 guide+0.011
lock+0.011
 X+0.011
 Steve+0.010
 Mark+0.010
Neg logit contributions
arre-0.018
 ow-0.017
 wil-0.017
 mosquitoes-0.017
onel-0.017
Token They
Token ID1119
Feature activation+8.16e-01

Pos logit contributions
 Steve+0.015
 guide+0.015
lock+0.014
 X+0.014
 Mark+0.014
Neg logit contributions
arre-0.034
 ow-0.033
 wil-0.033
 mosquitoes-0.032
onel-0.032
Token cream
Token ID8566
Feature activation+7.03e-01

Pos logit contributions
arre+0.003
 ow+0.003
 mosquitoes+0.003
 wil+0.003
ins+0.003
Neg logit contributions
 guide-0.004
lock-0.004
 Steve-0.004
 X-0.004
 Kyle-0.003
Token.
Token ID13
Feature activation-3.34e-01

Pos logit contributions
 guide+0.015
lock+0.015
 X+0.014
 Steve+0.014
 Mark+0.013
Neg logit contributions
arre-0.022
 ow-0.020
 mosquitoes-0.020
 wil-0.020
 allergies-0.019
Token He
Token ID679
Feature activation+1.24e+00

Pos logit contributions
arre+0.011
 ow+0.011
 mosquitoes+0.011
 wil+0.010
 allergies+0.010
Neg logit contributions
 guide-0.006
lock-0.006
 X-0.006
 Steve-0.006
 Mark-0.005
Token also
Token ID635
Feature activation-5.04e-01

Pos logit contributions
 guide+0.027
 X+0.026
lock+0.026
 Steve+0.025
 Mark+0.024
Neg logit contributions
arre-0.038
 ow-0.036
 mosquitoes-0.035
 wil-0.035
 suffering-0.035
Token puts
Token ID7584
Feature activation+1.07e+00

Pos logit contributions
arre+0.017
 ow+0.016
 mosquitoes+0.016
onel+0.016
ivery+0.015
Neg logit contributions
 guide-0.010
 X-0.009
lock-0.009
 Steve-0.009
 point-0.008
Token stickers
Token ID28568
Feature activation+4.17e+00

Pos logit contributions
lock+0.026
 guide+0.026
 Steve+0.024
 X+0.023
lot+0.022
Neg logit contributions
 ow-0.029
arre-0.029
 mosquitoes-0.028
 wil-0.027
 allergies-0.027
Token on
Token ID319
Feature activation+3.52e-01

Pos logit contributions
 guide+0.110
 Steve+0.109
 X+0.107
lock+0.106
 Copper+0.099
Neg logit contributions
arre-0.092
 asleep-0.088
ins-0.086
onel-0.081
 ow-0.080
Token the
Token ID262
Feature activation+4.21e-01

Pos logit contributions
lock+0.008
 guide+0.008
 Steve+0.007
 X+0.007
lot+0.007
Neg logit contributions
 ow-0.010
 mosquitoes-0.010
arre-0.010
 wil-0.009
 allergies-0.009
Token days
Token ID1528
Feature activation+1.20e+00

Pos logit contributions
 guide+0.010
lock+0.010
 X+0.010
 Steve+0.009
 Mark+0.009
Neg logit contributions
arre-0.013
 ow-0.011
onel-0.011
 wil-0.011
Acc-0.011
Token when
Token ID618
Feature activation-7.14e-01

Pos logit contributions
 guide+0.023
lock+0.023
 X+0.023
 Steve+0.022
lot+0.021
Neg logit contributions
arre-0.036
 suffering-0.034
 ow-0.033
 itching-0.033
 mosquitoes-0.033
Token he
Token ID339
Feature activation-1.45e+00

Pos logit contributions
arre+0.026
 ow+0.025
 mosquitoes+0.025
 wil+0.024
 suffering+0.024
Neg logit contributions
 guide-0.011
lock-0.011
 X-0.010
 Steve-0.010
 Mark-0.009
Token post
Token ID1281
Feature activation+7.84e-01

Pos logit contributions
arre+0.025
ivery+0.022
 ow+0.022
ins+0.022
onel+0.021
Neg logit contributions
 guide-0.017
 X-0.017
lock-0.015
 Steve-0.015
 map-0.014
Token office
Token ID2607
Feature activation+8.82e-01

Pos logit contributions
 guide+0.016
lock+0.016
 X+0.016
 Steve+0.016
 Mark+0.015
Neg logit contributions
arre-0.024
 ow-0.023
 mosquitoes-0.022
 wil-0.022
 suffering-0.022
Token.
Token ID13
Feature activation-7.81e-02

Pos logit contributions
 guide+0.020
lock+0.020
 X+0.019
 Steve+0.019
 Mark+0.018
Neg logit contributions
arre-0.026
 ow-0.024
 mosquitoes-0.024
 wil-0.024
 suffering-0.023
Token I
Token ID314
Feature activation-6.84e-01

Pos logit contributions
arre+0.002
 ow+0.002
 mosquitoes+0.002
Acc+0.002
 allergies+0.002
Neg logit contributions
 guide-0.002
lock-0.002
 X-0.002
 Steve-0.002
lot-0.002
Token deliver
Token ID5203
Feature activation+1.88e+00

Pos logit contributions
arre+0.022
 ow+0.021
 mosquitoes+0.021
 wil+0.020
 dying+0.020
Neg logit contributions
 guide-0.014
lock-0.014
 X-0.013
 Steve-0.013
 Mark-0.012
Token letters
Token ID7475
Feature activation+4.17e+00

Pos logit contributions
lock+0.048
 guide+0.047
 Steve+0.045
 X+0.045
lot+0.044
Neg logit contributions
 ow-0.046
arre-0.046
 mosquitoes-0.046
 wil-0.044
 allergies-0.044
Token and
Token ID290
Feature activation+7.90e-02

Pos logit contributions
 guide+0.091
 Steve+0.090
 X+0.089
lock+0.088
lynn+0.082
Neg logit contributions
arre-0.115
 ow-0.109
 mosquitoes-0.107
 allergies-0.106
 dying-0.106
Token packages
Token ID10392
Feature activation+3.40e+00

Pos logit contributions
lock+0.001
 guide+0.001
 Steve+0.001
 X+0.001
 Mark+0.001
Neg logit contributions
arre-0.003
 ow-0.002
 mosquitoes-0.002
 wil-0.002
 allergies-0.002
Token to
Token ID284
Feature activation-1.11e+00

Pos logit contributions
 Steve+0.081
lock+0.080
 guide+0.079
 X+0.079
 Kyle+0.073
Neg logit contributions
arre-0.087
 ow-0.082
 allergies-0.081
 dying-0.081
 suffering-0.080
Token people
Token ID661
Feature activation+4.71e-01

Pos logit contributions
arre+0.032
 ow+0.031
 mosquitoes+0.030
 wil+0.030
 suffering+0.029
Neg logit contributions
 guide-0.025
lock-0.025
 X-0.025
 Steve-0.024
 Mark-0.022
Token.
Token ID13
Feature activation-2.54e-02

Pos logit contributions
 guide+0.011
lock+0.011
 X+0.010
 Steve+0.010
 Mark+0.009
Neg logit contributions
arre-0.014
 ow-0.014
 mosquitoes-0.013
 wil-0.013
Acc-0.013
Token curiosity
Token ID20136
Feature activation+1.14e+00

Pos logit contributions
 guide+0.018
 X+0.017
lock+0.017
 Steve+0.017
 Mark+0.015
Neg logit contributions
arre-0.027
 ow-0.025
 mosquitoes-0.024
Acc-0.024
 wil-0.024
Token.
Token ID13
Feature activation+8.24e-02

Pos logit contributions
 guide+0.019
lock+0.019
 X+0.018
 Steve+0.018
 Mark+0.016
Neg logit contributions
arre-0.040
 ow-0.038
 mosquitoes-0.038
 wil-0.037
Acc-0.037
Token He
Token ID679
Feature activation+9.36e-01

Pos logit contributions
 guide+0.001
lock+0.001
 X+0.001
 Steve+0.001
 Mark+0.001
Neg logit contributions
arre-0.003
 ow-0.003
 mosquitoes-0.003
 wil-0.003
 allergies-0.003
Token asked
Token ID1965
Feature activation+4.82e-02

Pos logit contributions
 guide+0.020
lock+0.019
 X+0.019
 Steve+0.019
 Mark+0.017
Neg logit contributions
arre-0.030
 ow-0.028
 mosquitoes-0.028
 wil-0.027
 dying-0.027
Token many
Token ID867
Feature activation+8.37e-01

Pos logit contributions
lock+0.001
 guide+0.001
 X+0.001
 Steve+0.001
lot+0.001
Neg logit contributions
arre-0.001
 ow-0.001
 mosquitoes-0.001
 allergies-0.001
 suffering-0.001
Token questions
Token ID2683
Feature activation+4.16e+00

Pos logit contributions
 guide+0.019
lock+0.019
 X+0.018
 Steve+0.018
 Mark+0.017
Neg logit contributions
arre-0.025
 ow-0.023
 mosquitoes-0.023
 wil-0.023
 suffering-0.022
Token but
Token ID475
Feature activation+8.27e-01

Pos logit contributions
lock+0.079
 guide+0.078
 X+0.076
 Steve+0.074
lot+0.069
Neg logit contributions
arre-0.129
 ow-0.122
 mosquitoes-0.119
 suffering-0.119
 allergies-0.118
Token the
Token ID262
Feature activation-1.20e-01

Pos logit contributions
 guide+0.016
lock+0.016
 X+0.016
 Steve+0.015
lot+0.014
Neg logit contributions
arre-0.026
 ow-0.025
 mosquitoes-0.024
 wil-0.024
 suffering-0.024
Token box
Token ID3091
Feature activation-1.71e-01

Pos logit contributions
 ow+0.003
arre+0.003
 suffering+0.003
 wil+0.003
 mosquitoes+0.003
Neg logit contributions
lock-0.002
 Steve-0.002
 guide-0.002
 steer-0.002
lot-0.002
Token didn
Token ID1422
Feature activation+1.18e+00

Pos logit contributions
arre+0.005
 ow+0.005
 wil+0.005
 mosquitoes+0.005
 suffering+0.005
Neg logit contributions
 guide-0.004
lock-0.004
 X-0.003
 Steve-0.003
 Mark-0.003
Tokenรข
Token ID22940
Feature activation+6.92e-01

Pos logit contributions
 guide+0.030
lock+0.029
 Steve+0.028
 X+0.028
 Mark+0.026
Neg logit contributions
arre-0.032
 ow-0.030
 mosquitoes-0.030
 wil-0.030
 allergies-0.029
Token.
Token ID13
Feature activation+4.35e-01

Pos logit contributions
 guide+0.007
lock+0.007
 X+0.007
 Steve+0.007
 Mark+0.006
Neg logit contributions
arre-0.009
 ow-0.009
 wil-0.009
 mosquitoes-0.009
 suffering-0.009
Token He
Token ID679
Feature activation+9.29e-01

Pos logit contributions
 guide+0.009
lock+0.009
 X+0.009
 map+0.008
 steer+0.008
Neg logit contributions
arre-0.013
pill-0.011
der-0.011
Acc-0.011
 suffering-0.011
Token asked
Token ID1965
Feature activation+2.00e-01

Pos logit contributions
lock+0.020
 guide+0.020
 X+0.019
 Steve+0.019
 Mark+0.018
Neg logit contributions
arre-0.028
 ow-0.026
 wil-0.026
 suffering-0.026
 mosquitoes-0.025
Token lots
Token ID6041
Feature activation+5.35e-01

Pos logit contributions
lock+0.005
 X+0.004
 guide+0.004
lot+0.004
 Steve+0.004
Neg logit contributions
arre-0.005
 mosquitoes-0.005
 ow-0.005
 suffering-0.005
onel-0.005
Token of
Token ID286
Feature activation+1.79e-01

Pos logit contributions
lock+0.011
 guide+0.010
 X+0.010
 Steve+0.010
lot+0.009
Neg logit contributions
arre-0.017
 mosquitoes-0.016
 ow-0.016
 wil-0.016
 suffering-0.016
Token questions
Token ID2683
Feature activation+4.16e+00

Pos logit contributions
lock+0.004
 guide+0.004
 X+0.004
 Steve+0.004
 Mark+0.004
Neg logit contributions
arre-0.005
 ow-0.005
 mosquitoes-0.005
 wil-0.005
 suffering-0.005
Token,
Token ID11
Feature activation-1.50e-01

Pos logit contributions
lock+0.081
 guide+0.080
 X+0.078
 Steve+0.075
lot+0.070
Neg logit contributions
arre-0.128
 ow-0.119
 suffering-0.118
 dying-0.117
der-0.117
Token while
Token ID981
Feature activation-4.55e-01

Pos logit contributions
arre+0.005
 ow+0.005
 mosquitoes+0.005
 wil+0.005
 suffering+0.005
Neg logit contributions
 guide-0.003
lock-0.003
 X-0.003
 Steve-0.003
 Mark-0.002
Token the
Token ID262
Feature activation-6.34e-01

Pos logit contributions
arre+0.015
 ow+0.014
 mosquitoes+0.014
 suffering+0.014
 dying+0.014
Neg logit contributions
lock-0.008
 guide-0.008
 X-0.008
 Steve-0.008
lot-0.007
Token wizard
Token ID18731
Feature activation-5.11e-01

Pos logit contributions
arre+0.022
 ow+0.020
Acc+0.020
ivery+0.020
 wil+0.020
Neg logit contributions
 guide-0.013
lock-0.012
 X-0.011
 Steve-0.011
 point-0.010
Token laughed
Token ID13818
Feature activation-1.85e+00

Pos logit contributions
arre+0.015
 ow+0.014
 wil+0.014
 mosquitoes+0.014
 suffering+0.014
Neg logit contributions
lock-0.011
 guide-0.011
 X-0.011
 Steve-0.011
 Mark-0.010
Token stories
Token ID3923
Feature activation+1.03e+00

Pos logit contributions
arre+0.011
 ow+0.010
 mosquitoes+0.010
 wil+0.010
 suffering+0.010
Neg logit contributions
 guide-0.007
lock-0.007
 X-0.007
 Steve-0.007
 Mark-0.006
Token.
Token ID13
Feature activation+1.46e-01

Pos logit contributions
 guide+0.022
lock+0.022
 X+0.022
 Steve+0.022
 Mark+0.020
Neg logit contributions
arre-0.030
 ow-0.029
 wil-0.028
 mosquitoes-0.028
 dying-0.027
Token She
Token ID1375
Feature activation+3.24e-01

Pos logit contributions
 Steve+0.003
lock+0.003
 Mark+0.003
 X+0.003
 guide+0.003
Neg logit contributions
arre-0.005
 ow-0.004
 wil-0.004
 mosquitoes-0.004
ilt-0.004
Token asked
Token ID1965
Feature activation+2.97e-01

Pos logit contributions
 guide+0.007
lock+0.007
 X+0.007
 Steve+0.007
 Mark+0.006
Neg logit contributions
arre-0.010
 ow-0.009
 wil-0.009
 mosquitoes-0.009
 suffering-0.009
Token many
Token ID867
Feature activation+7.08e-01

Pos logit contributions
 guide+0.007
lock+0.007
 X+0.007
 Steve+0.007
 Mark+0.006
Neg logit contributions
arre-0.008
 ow-0.008
 mosquitoes-0.008
 wil-0.008
 suffering-0.008
Token questions
Token ID2683
Feature activation+4.15e+00

Pos logit contributions
 guide+0.015
lock+0.015
 X+0.015
 Steve+0.015
 Mark+0.013
Neg logit contributions
arre-0.022
 ow-0.020
 wil-0.020
 mosquitoes-0.020
 dying-0.020
Token and
Token ID290
Feature activation+1.01e+00

Pos logit contributions
lock+0.082
 guide+0.081
 X+0.079
 Steve+0.077
lot+0.073
Neg logit contributions
arre-0.123
 ow-0.119
 mosquitoes-0.117
der-0.117
 dying-0.115
Token learned
Token ID4499
Feature activation+1.10e+00

Pos logit contributions
 guide+0.022
lock+0.022
 X+0.021
 Steve+0.021
 Mark+0.019
Neg logit contributions
arre-0.031
 ow-0.029
 mosquitoes-0.028
 wil-0.028
 allergies-0.028
Token many
Token ID867
Feature activation+4.64e-01

Pos logit contributions
 guide+0.025
lock+0.024
 X+0.024
 Steve+0.024
 Mark+0.022
Neg logit contributions
arre-0.033
 ow-0.031
 wil-0.030
ivery-0.030
 mosquitoes-0.030
Token new
Token ID649
Feature activation+1.19e+00

Pos logit contributions
lock+0.012
 guide+0.011
 X+0.011
 Steve+0.011
lot+0.010
Neg logit contributions
arre-0.013
 ow-0.012
 mosquitoes-0.012
 suffering-0.012
 wil-0.012
Token words
Token ID2456
Feature activation+2.68e+00

Pos logit contributions
lock+0.031
 guide+0.029
 X+0.029
 Steve+0.028
 Kyle+0.026
Neg logit contributions
arre-0.032
 mosquitoes-0.029
 ow-0.029
 allergies-0.028
 suffering-0.028
Token new
Token ID649
Feature activation+1.10e+00

Pos logit contributions
arre+0.020
 ow+0.019
 wil+0.018
ivery+0.018
 mosquitoes+0.018
Neg logit contributions
 guide-0.013
 Steve-0.012
lock-0.012
 X-0.012
 Mark-0.011
Token.
Token ID13
Feature activation+8.10e-02

Pos logit contributions
 guide+0.021
lock+0.021
 Steve+0.020
 X+0.020
 Mark+0.018
Neg logit contributions
arre-0.037
 ow-0.035
 mosquitoes-0.034
 wil-0.034
 dying-0.033
Token He
Token ID679
Feature activation+7.84e-01

Pos logit contributions
 guide+0.001
lock+0.001
 Steve+0.001
 X+0.001
 Mark+0.001
Neg logit contributions
arre-0.003
 ow-0.003
 mosquitoes-0.003
 wil-0.003
 dying-0.003
Token kept
Token ID4030
Feature activation+5.52e-02

Pos logit contributions
 guide+0.017
lock+0.016
 X+0.016
 Steve+0.016
 Mark+0.015
Neg logit contributions
arre-0.024
 ow-0.023
 mosquitoes-0.023
 wil-0.022
 dying-0.022
Token asking
Token ID4737
Feature activation+1.97e+00

Pos logit contributions
 guide+0.002
lock+0.002
 X+0.002
 Steve+0.002
 Mark+0.001
Neg logit contributions
arre-0.001
 ow-0.001
 mosquitoes-0.001
 wil-0.001
 suffering-0.001
Token questions
Token ID2683
Feature activation+4.15e+00

Pos logit contributions
lock+0.052
 X+0.049
lot+0.048
 guide+0.046
lynn+0.045
Neg logit contributions
arre-0.045
 suffering-0.044
 allergies-0.044
 mosquitoes-0.043
onel-0.043
Token about
Token ID546
Feature activation+8.61e-01

Pos logit contributions
lock+0.077
 guide+0.076
 X+0.075
 Steve+0.072
lynn+0.066
Neg logit contributions
arre-0.132
 ow-0.122
 mosquitoes-0.121
 suffering-0.120
 allergies-0.120
Token it
Token ID340
Feature activation-8.89e-01

Pos logit contributions
 guide+0.018
 Steve+0.017
 X+0.017
lock+0.017
 Mark+0.016
Neg logit contributions
arre-0.028
 ow-0.025
 wil-0.025
ivery-0.025
onel-0.025
Token for
Token ID329
Feature activation-2.30e-01

Pos logit contributions
arre+0.032
 ow+0.030
 mosquitoes+0.030
 wil+0.030
 suffering+0.029
Neg logit contributions
 guide-0.014
lock-0.014
 X-0.014
 Steve-0.013
 Mark-0.012
Token the
Token ID262
Feature activation-2.13e-01

Pos logit contributions
arre+0.007
 ow+0.007
 mosquitoes+0.007
 wil+0.006
 suffering+0.006
Neg logit contributions
lock-0.005
 guide-0.005
 X-0.005
 Steve-0.005
 Mark-0.004
Token rest
Token ID1334
Feature activation-1.43e+00

Pos logit contributions
arre+0.006
ivery+0.006
ins+0.006
 ow+0.006
Acc+0.006
Neg logit contributions
 guide-0.005
 X-0.005
 Steve-0.005
lock-0.005
 point-0.005
Token He
Token ID679
Feature activation+1.10e+00

Pos logit contributions
 guide+0.003
lock+0.003
 X+0.003
 Steve+0.003
 Mark+0.003
Neg logit contributions
arre-0.007
 ow-0.006
 mosquitoes-0.006
 wil-0.006
 suffering-0.006
Token asked
Token ID1965
Feature activation+2.68e-01

Pos logit contributions
 guide+0.025
lock+0.025
 X+0.024
 Steve+0.024
 Mark+0.022
Neg logit contributions
arre-0.032
 ow-0.030
 mosquitoes-0.030
 wil-0.029
 suffering-0.029
Token a
Token ID257
Feature activation+6.09e-03

Pos logit contributions
lock+0.006
 X+0.006
 guide+0.006
lot+0.006
 Steve+0.006
Neg logit contributions
arre-0.007
 mosquitoes-0.007
 ow-0.007
 suffering-0.007
 allergies-0.007
Token lot
Token ID1256
Feature activation+1.34e+00

Pos logit contributions
lock+0.000
 X+0.000
 Steve+0.000
 guide+0.000
 Kyle+0.000
Neg logit contributions
arre-0.000
 ow-0.000
 mosquitoes-0.000
 wil-0.000
 suffering-0.000
Token of
Token ID286
Feature activation+3.62e-01

Pos logit contributions
lock+0.027
 guide+0.026
 X+0.026
 Steve+0.026
lot+0.024
Neg logit contributions
arre-0.041
 mosquitoes-0.040
 ow-0.039
 wil-0.038
ins-0.038
Token questions
Token ID2683
Feature activation+4.15e+00

Pos logit contributions
 guide+0.008
lock+0.008
 X+0.008
 Steve+0.008
 Mark+0.007
Neg logit contributions
arre-0.011
 ow-0.010
 mosquitoes-0.010
 wil-0.010
 dying-0.010
Token,
Token ID11
Feature activation-7.43e-02

Pos logit contributions
lock+0.078
 guide+0.077
 X+0.075
 Steve+0.074
lot+0.067
Neg logit contributions
arre-0.133
 ow-0.125
 mosquitoes-0.122
 suffering-0.121
 wil-0.121
Token but
Token ID475
Feature activation+1.36e+00

Pos logit contributions
arre+0.003
 ow+0.003
 mosquitoes+0.002
 wil+0.002
 dying+0.002
Neg logit contributions
 guide-0.001
lock-0.001
 X-0.001
 Steve-0.001
 Mark-0.001
Token his
Token ID465
Feature activation+6.76e-01

Pos logit contributions
lock+0.028
 guide+0.028
 X+0.027
 Steve+0.025
lot+0.025
Neg logit contributions
arre-0.041
 ow-0.039
 mosquitoes-0.039
 suffering-0.039
 dying-0.038
Token mom
Token ID1995
Feature activation-2.99e-01

Pos logit contributions
 guide+0.013
 X+0.012
lock+0.012
 Steve+0.012
 Mark+0.011
Neg logit contributions
arre-0.024
 ow-0.022
 mosquitoes-0.022
 wil-0.021
Acc-0.021
Token and
Token ID290
Feature activation+1.02e+00

Pos logit contributions
arre+0.008
 ow+0.007
 wil+0.007
 suffering+0.007
 allergies+0.007
Neg logit contributions
lock-0.008
 guide-0.007
 X-0.007
 Steve-0.007
lot-0.007
Token save
Token ID3613
Feature activation+9.26e-01

Pos logit contributions
 Steve+0.008
 Kyle+0.007
 X+0.007
 guide+0.007
lynn+0.007
Neg logit contributions
arre-0.011
ins-0.010
 spilled-0.009
 suffering-0.009
 sick-0.009
Token money
Token ID1637
Feature activation+3.13e+00

Pos logit contributions
lock+0.022
 guide+0.021
 X+0.021
 Steve+0.020
lot+0.020
Neg logit contributions
 mosquitoes-0.025
arre-0.025
 ow-0.025
 wil-0.024
 suffering-0.023
Token.
Token ID13
Feature activation+1.64e-01

Pos logit contributions
 Steve+0.073
 X+0.072
 guide+0.070
lock+0.069
 Kyle+0.067
Neg logit contributions
arre-0.082
 wil-0.079
 suffering-0.078
 dying-0.078
 allergies-0.077
Token She
Token ID1375
Feature activation+4.38e-01

Pos logit contributions
 Steve+0.003
lock+0.003
 X+0.003
 Mark+0.003
 guide+0.003
Neg logit contributions
arre-0.006
 ow-0.005
onel-0.005
 wil-0.005
 allergies-0.005
Token put
Token ID1234
Feature activation+1.20e+00

Pos logit contributions
 Steve+0.012
lock+0.011
 Kyle+0.011
lynn+0.011
 X+0.011
Neg logit contributions
 spilled-0.010
arre-0.010
 wil-0.010
ins-0.009
 allergies-0.009
Token coins
Token ID10796
Feature activation+4.13e+00

Pos logit contributions
 guide+0.028
lock+0.028
 Steve+0.027
 X+0.027
 Mark+0.025
Neg logit contributions
arre-0.034
 ow-0.033
 mosquitoes-0.032
 wil-0.032
 dying-0.031
Token in
Token ID287
Feature activation+1.18e+00

Pos logit contributions
 Steve+0.115
 Kyle+0.108
 Mark+0.107
 X+0.106
 guide+0.104
Neg logit contributions
 wil-0.086
ins-0.085
 dying-0.083
arre-0.083
 asleep-0.083
Token a
Token ID257
Feature activation+1.46e-01

Pos logit contributions
lock+0.027
 guide+0.027
 Steve+0.026
 X+0.026
lot+0.024
Neg logit contributions
arre-0.033
 ow-0.032
 mosquitoes-0.031
 wil-0.031
 suffering-0.030
Token jar
Token ID17379
Feature activation+4.89e-01

Pos logit contributions
 guide+0.003
lock+0.003
 Steve+0.003
 X+0.003
 Mark+0.003
Neg logit contributions
arre-0.004
 ow-0.004
 mosquitoes-0.004
 wil-0.004
 dying-0.004
Token.
Token ID13
Feature activation+5.74e-01

Pos logit contributions
 guide+0.009
 Steve+0.009
 X+0.009
lock+0.008
 Mark+0.008
Neg logit contributions
arre-0.016
 wil-0.015
 ow-0.015
 mosquitoes-0.015
 dying-0.015
Token One
Token ID1881
Feature activation+6.57e-01

Pos logit contributions
lock+0.011
 Steve+0.011
 X+0.011
 guide+0.011
 Mark+0.011
Neg logit contributions
arre-0.019
 wil-0.018
 ow-0.018
onel-0.018
 suffering-0.018
Token blocks
Token ID7021
Feature activation+1.86e+00

Pos logit contributions
 X+0.008
 guide+0.008
 Steve+0.008
lock+0.007
 Mark+0.007
Neg logit contributions
arre-0.013
 ow-0.012
onel-0.012
ivery-0.012
ivering-0.012
Token.
Token ID13
Feature activation+2.24e-01

Pos logit contributions
 guide+0.036
lock+0.036
 X+0.035
 Steve+0.034
 Mark+0.031
Neg logit contributions
arre-0.061
 ow-0.058
 mosquitoes-0.057
 wil-0.056
 allergies-0.056
Token They
Token ID1119
Feature activation+6.00e-01

Pos logit contributions
 guide+0.004
 Steve+0.004
lock+0.004
 X+0.004
 Mark+0.003
Neg logit contributions
arre-0.008
 ow-0.008
 wil-0.008
 mosquitoes-0.008
 suffering-0.008
Token built
Token ID3170
Feature activation+9.40e-01

Pos logit contributions
 guide+0.014
lock+0.014
 X+0.014
 Steve+0.013
 Mark+0.012
Neg logit contributions
arre-0.017
 ow-0.016
 mosquitoes-0.016
 wil-0.016
 allergies-0.016
Token tall
Token ID7331
Feature activation+1.04e+00

Pos logit contributions
lock+0.024
 guide+0.023
 Steve+0.022
 X+0.021
lot+0.021
Neg logit contributions
arre-0.025
 ow-0.024
 mosquitoes-0.024
 wil-0.023
 dying-0.023
Token houses
Token ID7777
Feature activation+4.13e+00

Pos logit contributions
lock+0.030
 guide+0.029
 Steve+0.028
lot+0.027
lynn+0.027
Neg logit contributions
arre-0.024
 ow-0.023
 mosquitoes-0.023
 wil-0.022
 suffering-0.021
Token and
Token ID290
Feature activation+2.28e-01

Pos logit contributions
lock+0.092
lynn+0.088
lot+0.085
 Steve+0.084
 Connor+0.083
Neg logit contributions
arre-0.112
 dying-0.103
 wil-0.102
 asleep-0.100
 ow-0.100
Token skinny
Token ID29494
Feature activation+3.26e-01

Pos logit contributions
lock+0.006
 guide+0.006
lynn+0.005
 X+0.005
 Steve+0.005
Neg logit contributions
arre-0.006
 mosquitoes-0.005
 ow-0.005
 wil-0.005
 dying-0.005
Token houses
Token ID7777
Feature activation+2.72e+00

Pos logit contributions
 guide+0.008
lock+0.008
 X+0.007
 Steve+0.007
 Mark+0.007
Neg logit contributions
arre-0.009
 ow-0.009
 mosquitoes-0.009
 wil-0.008
 suffering-0.008
Token.
Token ID13
Feature activation+4.05e-01

Pos logit contributions
lock+0.056
 guide+0.054
 Steve+0.053
 X+0.053
lynn+0.051
Neg logit contributions
arre-0.081
 ow-0.076
 wil-0.076
 dying-0.076
 suffering-0.074
Token They
Token ID1119
Feature activation+5.64e-01

Pos logit contributions
 guide+0.008
lock+0.007
 Steve+0.007
 X+0.007
 Mark+0.007
Neg logit contributions
arre-0.013
 ow-0.013
 wil-0.013
 mosquitoes-0.013
 suffering-0.012
Token.
Token ID13
Feature activation+6.46e-01

Pos logit contributions
arre+0.057
 ow+0.054
 wil+0.054
 mosquitoes+0.053
 allergies+0.052
Neg logit contributions
 guide-0.029
 Steve-0.028
lock-0.028
 X-0.028
 Mark-0.026
Token Emily
Token ID17608
Feature activation-7.32e-01

Pos logit contributions
 guide+0.012
lock+0.012
 X+0.011
 Steve+0.011
 Mark+0.010
Neg logit contributions
arre-0.022
 ow-0.021
 mosquitoes-0.021
 wil-0.021
 suffering-0.020
Token asked
Token ID1965
Feature activation+5.04e-01

Pos logit contributions
arre+0.025
 ow+0.024
 mosquitoes+0.024
onel+0.023
ivery+0.023
Neg logit contributions
 guide-0.013
 Steve-0.012
lock-0.012
 X-0.012
 Mark-0.011
Token lots
Token ID6041
Feature activation+6.54e-01

Pos logit contributions
 guide+0.011
lock+0.011
 Steve+0.011
 X+0.010
 Mark+0.010
Neg logit contributions
arre-0.015
 ow-0.015
 wil-0.014
 mosquitoes-0.014
 dying-0.014
Token of
Token ID286
Feature activation+2.94e-01

Pos logit contributions
 guide+0.010
lock+0.009
 Steve+0.009
 X+0.009
 Mark+0.008
Neg logit contributions
arre-0.024
 ow-0.023
 wil-0.023
 mosquitoes-0.022
 dying-0.022
Token questions
Token ID2683
Feature activation+4.09e+00

Pos logit contributions
 guide+0.006
 Steve+0.006
 X+0.006
lock+0.006
 Mark+0.006
Neg logit contributions
arre-0.009
 ow-0.009
 wil-0.009
ivery-0.008
 dying-0.008
Token,
Token ID11
Feature activation-1.88e-01

Pos logit contributions
 guide+0.071
lock+0.070
 X+0.067
 Steve+0.066
 Mark+0.060
Neg logit contributions
arre-0.142
 ow-0.135
 mosquitoes-0.133
 wil-0.132
 suffering-0.130
Token including
Token ID1390
Feature activation+1.34e+00

Pos logit contributions
arre+0.007
ivery+0.007
ilt+0.007
 ow+0.007
 wil+0.007
Neg logit contributions
 Steve-0.003
 guide-0.003
 Mark-0.002
 X-0.002
lock-0.002
Token "
Token ID366
Feature activation+2.87e-01

Pos logit contributions
 Steve+0.022
 guide+0.022
 Mark+0.021
 X+0.020
 Kyle+0.018
Neg logit contributions
arre-0.050
ivery-0.048
ivering-0.047
Acc-0.046
 wil-0.045
TokenWhat
Token ID2061
Feature activation+3.13e-01

Pos logit contributions
lock+0.008
 guide+0.007
 Steve+0.007
 X+0.007
 steer+0.007
Neg logit contributions
 ow-0.008
arre-0.007
 wil-0.007
 dying-0.007
 suffering-0.007
Token's
Token ID338
Feature activation+6.58e-01

Pos logit contributions
lock+0.007
 guide+0.007
 X+0.007
 Steve+0.007
lot+0.006
Neg logit contributions
arre-0.009
 ow-0.009
 mosquitoes-0.009
 allergies-0.008
 wil-0.008
Token valleys
Token ID43088
Feature activation+1.28e+00

Pos logit contributions
lock+0.005
 X+0.005
 guide+0.004
 Steve+0.004
lynn+0.004
Neg logit contributions
arre-0.005
 mosquitoes-0.004
 wil-0.004
 ow-0.004
 suffering-0.004
Token.
Token ID13
Feature activation+3.07e-01

Pos logit contributions
lock+0.027
 X+0.025
 guide+0.025
 Steve+0.025
lynn+0.023
Neg logit contributions
arre-0.037
 ow-0.037
 mosquitoes-0.036
 wil-0.036
 dying-0.035
Token He
Token ID679
Feature activation+1.20e+00

Pos logit contributions
 guide+0.004
lock+0.004
 X+0.004
 Steve+0.004
 steer+0.003
Neg logit contributions
arre-0.011
 ow-0.011
 mosquitoes-0.011
 wil-0.011
 suffering-0.011
Token flew
Token ID13112
Feature activation+1.17e+00

Pos logit contributions
 guide+0.026
lock+0.026
 X+0.025
 Steve+0.025
 Mark+0.023
Neg logit contributions
arre-0.037
 ow-0.035
 mosquitoes-0.034
 wil-0.034
 suffering-0.033
Token over
Token ID625
Feature activation+1.23e+00

Pos logit contributions
lock+0.029
 guide+0.027
 X+0.027
 Steve+0.025
lynn+0.025
Neg logit contributions
arre-0.030
 mosquitoes-0.029
 ow-0.028
 asleep-0.028
 wil-0.028
Token mountains
Token ID12269
Feature activation+4.09e+00

Pos logit contributions
lock+0.028
 guide+0.028
 Steve+0.026
 X+0.026
lot+0.025
Neg logit contributions
arre-0.035
 mosquitoes-0.034
 ow-0.034
 wil-0.033
 suffering-0.032
Token and
Token ID290
Feature activation+5.54e-01

Pos logit contributions
lock+0.096
 Kyle+0.087
lynn+0.087
 X+0.086
 Steve+0.085
Neg logit contributions
der-0.104
 asleep-0.098
arre-0.098
 wil-0.097
 mosquitoes-0.096
Token woods
Token ID16479
Feature activation+1.55e+00

Pos logit contributions
lock+0.015
 guide+0.014
lynn+0.014
 X+0.014
 Steve+0.014
Neg logit contributions
arre-0.013
 mosquitoes-0.013
 ow-0.012
 wil-0.012
 dying-0.012
Token.
Token ID13
Feature activation+4.36e-01

Pos logit contributions
lock+0.035
 Steve+0.034
 X+0.034
 guide+0.033
lynn+0.032
Neg logit contributions
arre-0.041
 wil-0.040
 ow-0.040
 dying-0.040
 mosquitoes-0.040
Token He
Token ID679
Feature activation+1.21e+00

Pos logit contributions
 guide+0.007
lock+0.007
 X+0.006
 Steve+0.006
 steer+0.005
Neg logit contributions
arre-0.016
 ow-0.015
 mosquitoes-0.015
 allergies-0.014
onel-0.014
Token rushed
Token ID13999
Feature activation+3.28e-01

Pos logit contributions
lock+0.027
 guide+0.027
 X+0.026
 Steve+0.026
 Mark+0.024
Neg logit contributions
arre-0.036
 ow-0.034
 wil-0.033
 mosquitoes-0.033
 suffering-0.032
Token a
Token ID257
Feature activation-1.76e-01

Pos logit contributions
 mosquitoes+0.007
arre+0.007
 ow+0.007
 wil+0.007
 allergies+0.007
Neg logit contributions
 guide-0.006
lock-0.006
 X-0.006
 Steve-0.006
 Kyle-0.005
Token spaceship
Token ID40663
Feature activation-1.85e-01

Pos logit contributions
arre+0.006
 ow+0.005
 mosquitoes+0.005
 wil+0.005
 dying+0.005
Neg logit contributions
lock-0.003
 guide-0.003
 Steve-0.003
 X-0.003
 Mark-0.003
Token.
Token ID13
Feature activation-9.05e-02

Pos logit contributions
arre+0.006
 ow+0.006
 mosquitoes+0.006
 wil+0.006
 allergies+0.005
Neg logit contributions
 guide-0.004
lock-0.004
 X-0.003
 Steve-0.003
 Mark-0.003
Token They
Token ID1119
Feature activation+3.07e-01

Pos logit contributions
arre+0.003
 ow+0.003
 mosquitoes+0.002
 wil+0.002
 allergies+0.002
Neg logit contributions
 guide-0.002
lock-0.002
 X-0.002
 Steve-0.002
lot-0.002
Token put
Token ID1234
Feature activation+6.34e-01

Pos logit contributions
lock+0.008
 Steve+0.008
 guide+0.007
lynn+0.007
lot+0.007
Neg logit contributions
arre-0.007
 wil-0.007
 spilled-0.007
 ow-0.007
 dying-0.007
Token stickers
Token ID28568
Feature activation+4.09e+00

Pos logit contributions
lock+0.020
lot+0.018
 guide+0.018
ver+0.017
lynn+0.017
Neg logit contributions
 mosquitoes-0.013
 ow-0.013
 wil-0.012
 dying-0.012
 bubbles-0.012
Token and
Token ID290
Feature activation+7.30e-01

Pos logit contributions
 Steve+0.105
 guide+0.105
lock+0.103
 X+0.101
lynn+0.099
Neg logit contributions
arre-0.087
 asleep-0.083
ins-0.078
 spilled-0.077
 wil-0.077
Token drawings
Token ID23388
Feature activation+1.46e+00

Pos logit contributions
 Steve+0.018
lock+0.018
 guide+0.018
lynn+0.017
 Kyle+0.017
Neg logit contributions
 ow-0.018
arre-0.018
 mosquitoes-0.017
 wil-0.017
 allergies-0.017
Token on
Token ID319
Feature activation-8.91e-01

Pos logit contributions
 Steve+0.032
lock+0.032
 guide+0.031
 X+0.031
lynn+0.029
Neg logit contributions
arre-0.039
 ow-0.037
 wil-0.037
onel-0.036
 dying-0.036
Token it
Token ID340
Feature activation-1.57e+00

Pos logit contributions
 ow+0.021
 mosquitoes+0.020
 wil+0.020
 dying+0.019
 ger+0.019
Neg logit contributions
lock-0.024
 guide-0.022
lot-0.022
 Steve-0.021
lynn-0.021
Token to
Token ID284
Feature activation-1.50e+00

Pos logit contributions
arre+0.055
 ow+0.052
 mosquitoes+0.051
 wil+0.051
 allergies+0.050
Neg logit contributions
 guide-0.028
lock-0.027
 X-0.026
 Steve-0.026
 Mark-0.024
Token it
Token ID340
Feature activation-1.09e+00

Pos logit contributions
arre+0.002
 ow+0.002
 wil+0.002
 mosquitoes+0.002
 dying+0.002
Neg logit contributions
lock-0.001
 guide-0.001
 Steve-0.001
 X-0.001
lot-0.001
Token over
Token ID625
Feature activation+4.57e-02

Pos logit contributions
arre+0.042
 ow+0.040
 mosquitoes+0.039
 disl+0.039
pill+0.038
Neg logit contributions
lock-0.017
 guide-0.016
 X-0.015
 Steve-0.014
 unlock-0.013
Token her
Token ID607
Feature activation-2.32e-01

Pos logit contributions
 guide+0.001
lock+0.001
 X+0.001
 Steve+0.001
 Mark+0.001
Neg logit contributions
arre-0.001
 ow-0.001
 mosquitoes-0.001
 wil-0.001
onel-0.001
Token and
Token ID290
Feature activation+6.76e-01

Pos logit contributions
arre+0.007
 ow+0.007
 mosquitoes+0.007
 wil+0.007
 suffering+0.007
Neg logit contributions
 guide-0.005
lock-0.005
 X-0.005
 Steve-0.004
 Mark-0.004
Token her
Token ID607
Feature activation-3.22e-01

Pos logit contributions
 guide+0.015
lock+0.015
 X+0.014
 Steve+0.014
 Mark+0.013
Neg logit contributions
arre-0.020
 ow-0.019
 wil-0.019
 mosquitoes-0.019
 suffering-0.018
Token baby
Token ID5156
Feature activation-4.79e+00

Pos logit contributions
arre+0.010
 ow+0.009
 mosquitoes+0.009
 wil+0.009
ilt+0.009
Neg logit contributions
 guide-0.007
lock-0.007
 X-0.007
 Steve-0.006
 Mark-0.006
Token.
Token ID13
Feature activation+1.33e-01

Pos logit contributions
arre+0.138
 ow+0.131
 mosquitoes+0.127
 wil+0.127
 allergies+0.125
Neg logit contributions
 guide-0.111
lock-0.110
 X-0.107
 Steve-0.107
 Mark-0.098
Token The
Token ID383
Feature activation-2.25e-03

Pos logit contributions
lock+0.002
 Steve+0.002
 guide+0.002
 X+0.002
 Mark+0.002
Neg logit contributions
arre-0.005
 ow-0.005
 wil-0.005
 mosquitoes-0.005
onel-0.005
Token gear
Token ID7733
Feature activation-6.78e-02

Pos logit contributions
arre+0.000
pill+0.000
onel+0.000
ins+0.000
ilt+0.000
Neg logit contributions
 guide-0.000
 X-0.000
lock-0.000
 Steve-0.000
 Mark-0.000
Token was
Token ID373
Feature activation+8.54e-01

Pos logit contributions
arre+0.002
 ow+0.002
 mosquitoes+0.002
 wil+0.002
 suffering+0.002
Neg logit contributions
 guide-0.001
lock-0.001
 X-0.001
 Steve-0.001
 Mark-0.001
Token a
Token ID257
Feature activation+1.35e+00

Pos logit contributions
 guide+0.018
 X+0.018
lock+0.018
 Steve+0.018
 Mark+0.017
Neg logit contributions
arre-0.027
 mosquitoes-0.024
 ow-0.024
onel-0.024
 allergies-0.023
Token friends
Token ID2460
Feature activation-6.48e-01

Pos logit contributions
 guide+0.004
lock+0.004
 X+0.004
 Steve+0.004
 Mark+0.004
Neg logit contributions
arre-0.006
 ow-0.006
 wil-0.006
 mosquitoes-0.006
ilt-0.006
Token without
Token ID1231
Feature activation-1.84e+00

Pos logit contributions
arre+0.022
 ow+0.021
 mosquitoes+0.021
 wil+0.021
 dying+0.020
Neg logit contributions
 guide-0.012
lock-0.012
 X-0.011
 Steve-0.011
 Mark-0.010
Token any
Token ID597
Feature activation-1.27e+00

Pos logit contributions
arre+0.046
ilt+0.041
onel+0.040
ivery+0.040
 ow+0.040
Neg logit contributions
 guide-0.051
 Steve-0.050
 X-0.050
 Mark-0.048
 Kyle-0.048
Token more
Token ID517
Feature activation-1.10e+00

Pos logit contributions
arre+0.038
 ow+0.034
Acc+0.033
ivery+0.033
ilt+0.033
Neg logit contributions
 guide-0.031
lock-0.029
 X-0.029
 Steve-0.027
 point-0.026
Token toe
Token ID21189
Feature activation-3.16e+00

Pos logit contributions
arre+0.035
ivery+0.030
Acc+0.030
 ow+0.030
onel+0.030
Neg logit contributions
 guide-0.025
 X-0.024
lock-0.024
 Steve-0.023
 Mark-0.022
Token stub
Token ID17071
Feature activation-4.60e+00

Pos logit contributions
arre+0.118
Acc+0.108
ivery+0.106
 ow+0.104
ilt+0.101
Neg logit contributions
lock-0.056
 guide-0.052
 Steve-0.045
 X-0.043
 Mark-0.042
Tokens
Token ID82
Feature activation-1.40e+00

Pos logit contributions
arre+0.128
 ow+0.118
 wil+0.117
 disl+0.114
 dying+0.113
Neg logit contributions
lock-0.113
 guide-0.104
 Steve-0.103
lot-0.098
 Mark-0.098
Token.
Token ID13
Feature activation-4.64e-01

Pos logit contributions
arre+0.050
 ow+0.047
Acc+0.046
 mosquitoes+0.046
 wil+0.045
Neg logit contributions
 guide-0.025
lock-0.024
 X-0.023
 Steve-0.021
 Mark-0.020
Token<|endoftext|>
Token ID50256
Feature activation-3.15e-01

Pos logit contributions
arre+0.019
 mosquitoes+0.018
ilt+0.018
 wil+0.018
ins+0.018
Neg logit contributions
 Steve-0.005
 Mark-0.005
 guide-0.005
lock-0.005
 Copper-0.005
TokenOnce
Token ID7454
Feature activation-9.49e-01

Pos logit contributions
arre+0.008
 ow+0.008
 mosquitoes+0.007
 wil+0.007
ilt+0.007
Neg logit contributions
 guide-0.008
lock-0.008
 X-0.008
 Steve-0.008
 steer-0.007
Token upon
Token ID2402
Feature activation-9.54e-01

Pos logit contributions
arre+0.026
 ow+0.024
 mosquitoes+0.024
 wil+0.023
 suffering+0.023
Neg logit contributions
 guide-0.024
lock-0.023
 X-0.023
 Steve-0.023
 Mark-0.021
Token quick
Token ID2068
Feature activation-2.70e+00

Pos logit contributions
arre+0.009
 ow+0.009
 mosquitoes+0.008
 wil+0.008
 allergies+0.008
Neg logit contributions
 guide-0.007
lock-0.007
 X-0.007
 Steve-0.007
 Mark-0.006
Token thinking
Token ID3612
Feature activation-7.03e-01

Pos logit contributions
arre+0.082
 ow+0.076
 wil+0.076
 mosquitoes+0.075
Acc+0.074
Neg logit contributions
 guide-0.060
 X-0.058
lock-0.058
 Steve-0.057
 Mark-0.053
Token and
Token ID290
Feature activation-5.71e-01

Pos logit contributions
arre+0.025
 ow+0.024
 mosquitoes+0.023
 wil+0.023
Acc+0.023
Neg logit contributions
 guide-0.012
lock-0.011
 X-0.011
 Steve-0.011
 Mark-0.010
Token his
Token ID465
Feature activation+1.75e-02

Pos logit contributions
arre+0.016
 ow+0.016
 mosquitoes+0.015
 wil+0.015
 suffering+0.015
Neg logit contributions
lock-0.013
 guide-0.013
 X-0.013
 Steve-0.013
 Mark-0.012
Token trust
Token ID3774
Feature activation-1.27e+00

Pos logit contributions
 guide+0.000
lock+0.000
 X+0.000
 Steve+0.000
 Mark+0.000
Neg logit contributions
arre-0.001
 ow-0.001
 mosquitoes-0.001
 wil-0.001
 allergies-0.001
Tokeny
Token ID88
Feature activation-4.59e+00

Pos logit contributions
arre+0.035
 ow+0.032
 mosquitoes+0.031
 wil+0.031
Acc+0.030
Neg logit contributions
 guide-0.032
lock-0.032
 Steve-0.031
 X-0.031
 Mark-0.029
Token flashlight
Token ID38371
Feature activation-2.83e+00

Pos logit contributions
arre+0.150
 wil+0.142
 mosquitoes+0.142
ilt+0.140
Acc+0.140
Neg logit contributions
 guide-0.096
lock-0.089
 Steve-0.087
 X-0.087
 Mark-0.085
Token.
Token ID13
Feature activation+8.58e-02

Pos logit contributions
arre+0.102
 mosquitoes+0.096
 ow+0.094
ilt+0.091
Acc+0.091
Neg logit contributions
 guide-0.052
lock-0.049
 X-0.045
 Steve-0.045
 Mark-0.042
Token Tim
Token ID5045
Feature activation+7.60e-01

Pos logit contributions
 guide+0.002
lock+0.001
 Steve+0.001
 X+0.001
 Mark+0.001
Neg logit contributions
arre-0.003
 ow-0.003
 mosquitoes-0.003
 wil-0.003
 allergies-0.003
Tokenmy
Token ID1820
Feature activation+3.74e-02

Pos logit contributions
lock+0.017
 guide+0.017
 X+0.017
 Steve+0.016
 Kyle+0.015
Neg logit contributions
arre-0.024
 ow-0.022
 suffering-0.021
 wil-0.021
 allergies-0.021
Token felt
Token ID2936
Feature activation-1.91e+00

Pos logit contributions
lock+0.001
 guide+0.001
 X+0.001
 Steve+0.001
 Mark+0.001
Neg logit contributions
arre-0.001
 ow-0.001
 wil-0.001
 mosquitoes-0.001
 suffering-0.001
Token next
Token ID1306
Feature activation+1.11e+00

Pos logit contributions
arre+0.014
 ow+0.013
ins+0.012
 wil+0.012
ilt+0.012
Neg logit contributions
 guide-0.012
lock-0.012
 X-0.012
 Steve-0.011
 Mark-0.011
Token few
Token ID1178
Feature activation-1.73e+00

Pos logit contributions
 guide+0.028
lock+0.028
 X+0.027
 Steve+0.027
 Mark+0.025
Neg logit contributions
arre-0.030
 ow-0.028
 mosquitoes-0.027
 wil-0.027
 suffering-0.026
Token days
Token ID1528
Feature activation-7.29e-01

Pos logit contributions
arre+0.056
ivery+0.050
Acc+0.049
 wil+0.048
ilt+0.047
Neg logit contributions
 guide-0.039
lock-0.035
 X-0.035
 Steve-0.035
 Mark-0.034
Token until
Token ID1566
Feature activation-1.46e+00

Pos logit contributions
arre+0.024
 ow+0.023
 mosquitoes+0.023
 wil+0.022
 allergies+0.022
Neg logit contributions
lock-0.014
 guide-0.014
 Steve-0.013
 X-0.013
 Mark-0.012
Token his
Token ID465
Feature activation+3.23e-02

Pos logit contributions
arre+0.060
ins+0.055
 wil+0.055
pill+0.054
ilt+0.054
Neg logit contributions
 Steve-0.019
 guide-0.019
 X-0.018
lock-0.018
 Mark-0.017
Token ankle
Token ID19847
Feature activation-4.56e+00

Pos logit contributions
 guide+0.001
lock+0.001
 X+0.001
 point+0.001
 Steve+0.001
Neg logit contributions
arre-0.001
Acc-0.001
ilt-0.001
ins-0.001
ivery-0.001
Token healed
Token ID28557
Feature activation-1.19e+00

Pos logit contributions
arre+0.169
Acc+0.162
 mosquitoes+0.162
ins+0.160
 disl+0.156
Neg logit contributions
lock-0.072
 guide-0.068
 point-0.058
 steer-0.056
 pedal-0.056
Token.
Token ID13
Feature activation+8.12e-02

Pos logit contributions
arre+0.037
 ow+0.035
 mosquitoes+0.034
 wil+0.034
 suffering+0.033
Neg logit contributions
 guide-0.026
lock-0.025
 X-0.025
 Steve-0.024
 Mark-0.022
Token He
Token ID679
Feature activation+7.19e-01

Pos logit contributions
 guide+0.001
lock+0.001
 X+0.001
 Steve+0.001
lot+0.001
Neg logit contributions
arre-0.003
 ow-0.003
 mosquitoes-0.003
 wil-0.003
 suffering-0.003
Token felt
Token ID2936
Feature activation-1.71e+00

Pos logit contributions
 guide+0.013
lock+0.013
 X+0.012
 Steve+0.012
 steer+0.011
Neg logit contributions
arre-0.025
 mosquitoes-0.023
ivery-0.022
 ow-0.022
onel-0.022
Token sad
Token ID6507
Feature activation-6.89e-01

Pos logit contributions
arre+0.058
ins+0.054
 mosquitoes+0.053
pill+0.052
 allergies+0.050
Neg logit contributions
 guide-0.031
lock-0.030
 X-0.029
 Steve-0.028
 steer-0.026
Token forward
Token ID2651
Feature activation+1.12e+00

Pos logit contributions
 X+0.015
 guide+0.015
lock+0.014
 Steve+0.014
 steer+0.013
Neg logit contributions
arre-0.016
 ow-0.015
onel-0.015
 suffering-0.014
 mosquitoes-0.014
Token,
Token ID11
Feature activation-8.22e-01

Pos logit contributions
 guide+0.007
 Steve+0.007
lock+0.006
 Mark+0.005
 X+0.005
Neg logit contributions
arre-0.053
 mosquitoes-0.050
ivery-0.050
ilt-0.050
 wil-0.050
Token Tweet
Token ID18752
Feature activation+6.12e-03

Pos logit contributions
arre+0.026
 ow+0.024
 wil+0.024
 mosquitoes+0.023
 dying+0.023
Neg logit contributions
 guide-0.018
 Steve-0.017
lock-0.017
 X-0.016
 Mark-0.016
Token and
Token ID290
Feature activation-1.33e+00

Pos logit contributions
 guide+0.000
 Steve+0.000
lot+0.000
lock+0.000
 X+0.000
Neg logit contributions
arre-0.000
 mosquitoes-0.000
 ow-0.000
ivery-0.000
onel-0.000
Token her
Token ID607
Feature activation-5.59e-01

Pos logit contributions
arre+0.041
ilt+0.036
 ow+0.036
ivery+0.036
 wil+0.035
Neg logit contributions
 guide-0.031
 Steve-0.030
 X-0.030
lock-0.029
 Mark-0.028
Token baby
Token ID5156
Feature activation-4.52e+00

Pos logit contributions
arre+0.018
ilt+0.016
 mosquitoes+0.016
ivery+0.016
Acc+0.016
Neg logit contributions
 guide-0.012
 X-0.012
lock-0.011
 Steve-0.011
 Mark-0.010
Token bird
Token ID6512
Feature activation-2.11e+00

Pos logit contributions
arre+0.144
 mosquitoes+0.132
 ow+0.130
ilt+0.129
 wil+0.126
Neg logit contributions
 guide-0.100
lock-0.093
 Steve-0.092
 X-0.091
 Mark-0.087
Token lived
Token ID5615
Feature activation-1.14e+00

Pos logit contributions
arre+0.069
 mosquitoes+0.065
 ow+0.064
ilt+0.062
 allergies+0.062
Neg logit contributions
 guide-0.044
lock-0.040
 Steve-0.040
 X-0.039
 Mark-0.037
Token happily
Token ID18177
Feature activation-2.49e-01

Pos logit contributions
arre+0.036
 ow+0.035
 mosquitoes+0.034
 wil+0.034
 allergies+0.033
Neg logit contributions
 guide-0.023
lock-0.022
 X-0.022
 Steve-0.022
 Mark-0.020
Token in
Token ID287
Feature activation-1.18e+00

Pos logit contributions
arre+0.009
 ow+0.009
 mosquitoes+0.009
 wil+0.009
Acc+0.009
Neg logit contributions
 guide-0.004
lock-0.004
 Steve-0.003
 X-0.003
 Mark-0.003
Token their
Token ID511
Feature activation-1.46e+00

Pos logit contributions
arre+0.045
ins+0.041
 ow+0.041
 mosquitoes+0.041
 allergies+0.041
Neg logit contributions
 guide-0.019
 Steve-0.018
 X-0.018
 Mark-0.016
lock-0.016
Token egg
Token ID5935
Feature activation-1.56e+00

Pos logit contributions
 guide+0.004
 X+0.004
lock+0.004
 point+0.003
 map+0.003
Neg logit contributions
arre-0.006
pill-0.005
ins-0.005
ivery-0.005
 Wed-0.005
Token was
Token ID373
Feature activation-4.42e-01

Pos logit contributions
arre+0.056
 mosquitoes+0.052
 ow+0.051
Acc+0.050
 allergies+0.050
Neg logit contributions
 guide-0.027
lock-0.026
 Steve-0.025
 X-0.025
 Mark-0.023
Token broken
Token ID5445
Feature activation-2.77e-01

Pos logit contributions
arre+0.014
pill+0.013
 mosquitoes+0.012
ins+0.012
 disl+0.012
Neg logit contributions
 guide-0.010
 X-0.009
lock-0.009
 unlock-0.009
 Steve-0.009
Token and
Token ID290
Feature activation-8.98e-01

Pos logit contributions
arre+0.010
 ow+0.009
 mosquitoes+0.009
 wil+0.009
 allergies+0.009
Neg logit contributions
 guide-0.004
lock-0.004
 X-0.004
 Steve-0.004
 Mark-0.004
Token her
Token ID607
Feature activation-1.54e-01

Pos logit contributions
arre+0.034
ivery+0.030
ilt+0.030
pill+0.029
ins+0.029
Neg logit contributions
 Steve-0.016
 guide-0.015
 Mark-0.014
lock-0.014
 X-0.014
Token baby
Token ID5156
Feature activation-4.51e+00

Pos logit contributions
arre+0.005
ivery+0.005
Acc+0.005
ins+0.004
pill+0.004
Neg logit contributions
 guide-0.003
 X-0.003
lock-0.003
 chest-0.003
 Steve-0.003
Token bird
Token ID6512
Feature activation-2.10e+00

Pos logit contributions
arre+0.155
 mosquitoes+0.135
ilt+0.134
Acc+0.133
ivery+0.132
Neg logit contributions
 guide-0.092
lock-0.089
 Steve-0.084
 X-0.083
 Mark-0.081
Token was
Token ID373
Feature activation-1.31e+00

Pos logit contributions
arre+0.076
 mosquitoes+0.071
Acc+0.068
 ow+0.066
der+0.066
Neg logit contributions
 guide-0.036
lock-0.034
 Steve-0.033
 Mark-0.032
 X-0.031
Token gone
Token ID3750
Feature activation-1.48e+00

Pos logit contributions
pill+0.044
arre+0.042
 mosquitoes+0.040
affles+0.039
ins+0.039
Neg logit contributions
 guide-0.025
 close-0.025
 unlock-0.024
 open-0.024
lock-0.023
Token.
Token ID13
Feature activation+2.39e-01

Pos logit contributions
arre+0.056
 mosquitoes+0.054
 ow+0.053
pill+0.052
 disl+0.051
Neg logit contributions
 guide-0.021
lock-0.021
 X-0.019
 Steve-0.019
 chest-0.017
Token
Token ID198
Feature activation-1.35e-01

Pos logit contributions
 Steve+0.004
lock+0.004
 guide+0.004
 X+0.004
 Mark+0.004
Neg logit contributions
arre-0.008
 ow-0.008
 mosquitoes-0.008
 wil-0.008
 allergies-0.008
Token the
Token ID262
Feature activation-5.19e-01

Pos logit contributions
arre+0.033
 ow+0.032
 mosquitoes+0.031
ivery+0.031
Acc+0.030
Neg logit contributions
 guide-0.016
lock-0.015
 Steve-0.014
 X-0.014
lot-0.013
Token gifted
Token ID22527
Feature activation-1.98e+00

Pos logit contributions
arre+0.017
ins+0.017
 wil+0.016
onel+0.016
ivery+0.016
Neg logit contributions
 guide-0.011
 X-0.010
 point-0.010
 map-0.009
 Steve-0.009
Token frog
Token ID21264
Feature activation-1.45e+00

Pos logit contributions
arre+0.061
Acc+0.058
ivery+0.058
 dying+0.056
onel+0.055
Neg logit contributions
 guide-0.045
 Steve-0.044
 X-0.042
 point-0.042
 Mark-0.040
Token and
Token ID290
Feature activation-3.93e-01

Pos logit contributions
arre+0.053
 ow+0.049
ivery+0.049
 mosquitoes+0.048
 wil+0.047
Neg logit contributions
 guide-0.025
lock-0.024
 Steve-0.023
 X-0.022
 Mark-0.021
Token the
Token ID262
Feature activation-5.18e-01

Pos logit contributions
arre+0.013
 ow+0.012
 mosquitoes+0.012
 wil+0.012
 suffering+0.012
Neg logit contributions
 guide-0.007
lock-0.007
 X-0.007
 Steve-0.007
 Mark-0.006
Token mosquito
Token ID35446
Feature activation-4.47e+00

Pos logit contributions
arre+0.016
ivery+0.015
ins+0.015
 wil+0.015
Acc+0.015
Neg logit contributions
 guide-0.012
 X-0.011
 point-0.011
 Steve-0.010
 map-0.010
Token ended
Token ID4444
Feature activation-1.96e+00

Pos logit contributions
arre+0.169
ivery+0.161
Acc+0.156
 ow+0.156
 mosquitoes+0.155
Neg logit contributions
 guide-0.079
 Steve-0.063
 point-0.059
lock-0.059
 Mark-0.058
Token.
Token ID13
Feature activation+1.27e-01

Pos logit contributions
arre+0.066
 ow+0.063
 mosquitoes+0.061
 wil+0.060
 allergies+0.060
Neg logit contributions
 guide-0.037
 Steve-0.036
 X-0.034
lock-0.034
 Mark-0.033
Token<|endoftext|>
Token ID50256
Feature activation-8.41e-01

Pos logit contributions
lock+0.003
 guide+0.003
 X+0.003
 steer+0.003
 Steve+0.002
Neg logit contributions
arre-0.004
 ow-0.004
 mosquitoes-0.003
 wil-0.003
 suffering-0.003
TokenOnce
Token ID7454
Feature activation-9.54e-01

Pos logit contributions
arre+0.021
 ow+0.020
 mosquitoes+0.019
 wil+0.019
 suffering+0.018
Neg logit contributions
lock-0.022
 guide-0.022
 X-0.021
 Steve-0.020
 steer-0.019
Token upon
Token ID2402
Feature activation-1.23e+00

Pos logit contributions
arre+0.025
 ow+0.024
 wil+0.023
 mosquitoes+0.023
 dying+0.022
Neg logit contributions
lock-0.025
 guide-0.025
 X-0.024
 Steve-0.023
lot-0.022
Token mess
Token ID2085
Feature activation-8.75e-01

Pos logit contributions
 guide+0.006
 X+0.005
lock+0.005
 unlock+0.005
 drive+0.005
Neg logit contributions
arre-0.005
onel-0.005
der-0.005
Acc-0.005
 ow-0.005
Token and
Token ID290
Feature activation-1.79e-01

Pos logit contributions
arre+0.031
 ow+0.030
 mosquitoes+0.029
 wil+0.029
 suffering+0.029
Neg logit contributions
 guide-0.014
lock-0.014
 X-0.014
 Steve-0.014
 Mark-0.012
Token then
Token ID788
Feature activation-1.47e+00

Pos logit contributions
arre+0.006
 ow+0.006
 mosquitoes+0.005
 wil+0.005
 suffering+0.005
Neg logit contributions
 guide-0.003
lock-0.003
 X-0.003
 Steve-0.003
lot-0.003
Token rock
Token ID3881
Feature activation-8.42e-01

Pos logit contributions
arre+0.050
 ow+0.047
 mosquitoes+0.047
 wil+0.046
 allergies+0.045
Neg logit contributions
 guide-0.028
lock-0.027
 X-0.026
 Steve-0.026
 Mark-0.024
Token his
Token ID465
Feature activation+1.91e-01

Pos logit contributions
arre+0.031
 ow+0.029
ilt+0.028
 mosquitoes+0.028
 wil+0.028
Neg logit contributions
 guide-0.014
lock-0.014
 X-0.013
 Steve-0.012
 steer-0.012
Token baby
Token ID5156
Feature activation-4.47e+00

Pos logit contributions
 guide+0.004
 X+0.004
lock+0.003
 drive+0.003
 chest+0.003
Neg logit contributions
arre-0.007
ilt-0.006
ins-0.006
 mosquitoes-0.006
pill-0.006
Token sister
Token ID6621
Feature activation-3.65e+00

Pos logit contributions
arre+0.146
ilt+0.136
 mosquitoes+0.133
Acc+0.130
der+0.127
Neg logit contributions
 guide-0.095
lock-0.092
 Mark-0.084
 X-0.082
 drive-0.080
Token to
Token ID284
Feature activation-1.78e+00

Pos logit contributions
arre+0.135
 mosquitoes+0.134
 ow+0.131
ilt+0.129
ivery+0.126
Neg logit contributions
lock-0.057
 guide-0.056
 Steve-0.049
 Mark-0.048
 X-0.047
Token sleep
Token ID3993
Feature activation-1.67e+00

Pos logit contributions
arre+0.062
 ow+0.058
 mosquitoes+0.057
 wil+0.057
onel+0.056
Neg logit contributions
 guide-0.032
lock-0.031
 X-0.030
 Steve-0.030
 Mark-0.027
Token so
Token ID523
Feature activation-2.00e+00

Pos logit contributions
arre+0.062
pill+0.060
 mosquitoes+0.059
 ow+0.059
 wil+0.057
Neg logit contributions
 guide-0.026
lock-0.024
 X-0.021
 Steve-0.021
 Mark-0.020
Token his
Token ID465
Feature activation-2.09e-02

Pos logit contributions
arre+0.067
 ow+0.063
 mosquitoes+0.062
 wil+0.062
 allergies+0.061
Neg logit contributions
 guide-0.038
lock-0.037
 X-0.036
 Steve-0.035
 Mark-0.033
Token Lily
Token ID20037
Feature activation+3.84e-01

Pos logit contributions
arre+0.008
onel+0.008
 wil+0.008
 ow+0.007
ilt+0.007
Neg logit contributions
 guide-0.005
 Steve-0.005
 X-0.005
lock-0.005
 Mark-0.005
Token and
Token ID290
Feature activation+2.56e-01

Pos logit contributions
lock+0.005
 guide+0.005
 X+0.005
 Steve+0.005
 Mark+0.005
Neg logit contributions
arre-0.014
 ow-0.014
onel-0.014
 mosquitoes-0.014
ivery-0.013
Token Mr
Token ID1770
Feature activation-1.08e+00

Pos logit contributions
 guide+0.006
 Steve+0.006
 X+0.006
lock+0.006
 Mark+0.006
Neg logit contributions
arre-0.008
 wil-0.007
onel-0.007
 dying-0.007
 mosquitoes-0.007
Token.
Token ID13
Feature activation-1.71e+00

Pos logit contributions
 ow+0.030
arre+0.030
 mosquitoes+0.028
 suffering+0.028
 wil+0.028
Neg logit contributions
lock-0.026
 guide-0.026
 X-0.026
 Steve-0.026
 Kyle-0.023
Token Sp
Token ID1338
Feature activation-1.70e+00

Pos logit contributions
arre+0.051
 wil+0.048
ilt+0.048
 ow+0.047
 mosquitoes+0.047
Neg logit contributions
 X-0.039
 guide-0.038
lock-0.038
 Mark-0.037
 Steve-0.037
Tokenud
Token ID463
Feature activation-4.47e+00

Pos logit contributions
 wil+0.042
 dying+0.042
 suffering+0.040
 disl+0.038
 ow+0.038
Neg logit contributions
lock-0.039
ris-0.038
 Mark-0.038
 Steve-0.037
lot-0.036
Token went
Token ID1816
Feature activation-8.63e-01

Pos logit contributions
arre+0.149
Acc+0.146
 ow+0.144
 mosquitoes+0.144
ilt+0.143
Neg logit contributions
lock-0.078
 guide-0.074
 Steve-0.070
 Mark-0.069
 X-0.066
Token to
Token ID284
Feature activation-6.63e-01

Pos logit contributions
arre+0.026
 mosquitoes+0.024
 ow+0.024
 dying+0.024
 suffering+0.024
Neg logit contributions
lock-0.019
 X-0.019
 guide-0.019
 Steve-0.018
 Mark-0.017
Token the
Token ID262
Feature activation-3.49e-01

Pos logit contributions
arre+0.024
 ow+0.023
 wil+0.022
 mosquitoes+0.022
onel+0.022
Neg logit contributions
 guide-0.011
 Steve-0.011
 X-0.011
lock-0.011
 Mark-0.010
Token park
Token ID3952
Feature activation+1.47e+00

Pos logit contributions
arre+0.010
onel+0.009
ins+0.009
 wil+0.009
ivery+0.009
Neg logit contributions
 guide-0.009
 X-0.008
lock-0.008
 Steve-0.008
 map-0.008
Token.
Token ID13
Feature activation+2.18e-02

Pos logit contributions
 X+0.034
lock+0.034
 guide+0.033
 Steve+0.033
lynn+0.032
Neg logit contributions
arre-0.040
 wil-0.036
 suffering-0.036
 mosquitoes-0.036
 dying-0.035
Token apple
Token ID17180
Feature activation+1.25e-02

Pos logit contributions
arre+0.029
onel+0.025
 wil+0.025
 ow+0.025
Acc+0.025
Neg logit contributions
lock-0.028
 guide-0.028
 X-0.028
 Steve-0.027
 Mark-0.025
Token,
Token ID11
Feature activation-5.98e-01

Pos logit contributions
 guide+0.000
lock+0.000
 X+0.000
 Steve+0.000
lot+0.000
Neg logit contributions
arre-0.000
 ow-0.000
 mosquitoes-0.000
 wil-0.000
 suffering-0.000
Token Tom
Token ID4186
Feature activation+2.39e-01

Pos logit contributions
arre+0.021
 ow+0.020
 mosquitoes+0.020
 wil+0.019
 suffering+0.019
Neg logit contributions
lock-0.010
 guide-0.010
 X-0.010
 Steve-0.009
lot-0.008
Token and
Token ID290
Feature activation-9.57e-01

Pos logit contributions
 guide+0.003
lock+0.003
 X+0.003
 Steve+0.003
 Mark+0.003
Neg logit contributions
arre-0.009
 ow-0.009
 mosquitoes-0.009
 wil-0.008
onel-0.008
Token the
Token ID262
Feature activation-7.53e-01

Pos logit contributions
arre+0.032
 ow+0.031
 mosquitoes+0.030
 wil+0.030
 allergies+0.029
Neg logit contributions
 guide-0.018
lock-0.018
 X-0.017
 Steve-0.017
 Mark-0.016
Token mosquito
Token ID35446
Feature activation-4.46e+00

Pos logit contributions
arre+0.020
onel+0.017
ivery+0.017
ins+0.017
ilt+0.017
Neg logit contributions
 guide-0.021
 X-0.020
lock-0.019
 Steve-0.019
 point-0.018
Token played
Token ID2826
Feature activation-1.21e+00

Pos logit contributions
arre+0.164
Acc+0.149
 mosquitoes+0.148
 disl+0.147
ivery+0.147
Neg logit contributions
 guide-0.083
lock-0.068
 Steve-0.066
 X-0.065
 Mark-0.062
Token together
Token ID1978
Feature activation+2.67e-02

Pos logit contributions
arre+0.036
 ow+0.034
 mosquitoes+0.033
 wil+0.033
 suffering+0.032
Neg logit contributions
 guide-0.027
lock-0.027
 X-0.026
 Steve-0.026
 Mark-0.024
Token in
Token ID287
Feature activation-3.45e-01

Pos logit contributions
 guide+0.001
lock+0.001
 X+0.001
 Steve+0.001
 Mark+0.000
Neg logit contributions
arre-0.001
 ow-0.001
 mosquitoes-0.001
 wil-0.001
 allergies-0.001
Token the
Token ID262
Feature activation-9.96e-01

Pos logit contributions
arre+0.011
 ow+0.011
 mosquitoes+0.011
 wil+0.011
 suffering+0.011
Neg logit contributions
lock-0.007
 guide-0.006
 X-0.006
 Steve-0.006
 Mark-0.006
Token rain
Token ID6290
Feature activation-1.28e+00

Pos logit contributions
arre+0.029
ins+0.028
onel+0.028
der+0.028
 allergies+0.026
Neg logit contributions
 guide-0.023
 X-0.022
 point-0.021
 Steve-0.020
lock-0.020
Token day
Token ID1110
Feature activation+7.68e-01

Pos logit contributions
 guide+0.009
lock+0.009
 X+0.008
 Steve+0.008
lot+0.008
Neg logit contributions
arre-0.007
 ow-0.006
 mosquitoes-0.006
 wil-0.006
 suffering-0.006
Token,
Token ID11
Feature activation-3.32e-01

Pos logit contributions
 guide+0.014
lock+0.013
 X+0.013
 Steve+0.012
lot+0.011
Neg logit contributions
arre-0.027
 ow-0.025
 mosquitoes-0.025
 allergies-0.024
 suffering-0.024
Token Jimmy
Token ID12963
Feature activation+1.20e-01

Pos logit contributions
arre+0.010
 ow+0.010
 mosquitoes+0.009
 wil+0.009
 suffering+0.009
Neg logit contributions
 guide-0.007
lock-0.007
 X-0.007
 Steve-0.007
 Mark-0.006
Token and
Token ID290
Feature activation+1.74e-01

Pos logit contributions
 guide+0.003
lock+0.003
 Steve+0.003
 X+0.002
 Mark+0.002
Neg logit contributions
arre-0.004
 ow-0.004
 mosquitoes-0.003
 wil-0.003
onel-0.003
Token his
Token ID465
Feature activation-1.35e-01

Pos logit contributions
lock+0.004
 guide+0.004
 X+0.004
 Steve+0.004
lot+0.004
Neg logit contributions
arre-0.005
 ow-0.004
 mosquitoes-0.004
 suffering-0.004
 allergies-0.004
Token cricket
Token ID18836
Feature activation-4.45e+00

Pos logit contributions
arre+0.003
 ow+0.003
 mosquitoes+0.003
 allergies+0.003
 wil+0.003
Neg logit contributions
lock-0.004
 guide-0.004
 X-0.004
 Steve-0.004
 Mark-0.003
Token went
Token ID1816
Feature activation-1.44e+00

Pos logit contributions
arre+0.143
 ow+0.139
Acc+0.138
 mosquitoes+0.137
ilt+0.137
Neg logit contributions
 guide-0.090
 Steve-0.089
lock-0.084
 X-0.081
 Mark-0.078
Token to
Token ID284
Feature activation-9.11e-01

Pos logit contributions
arre+0.038
 mosquitoes+0.034
 suffering+0.034
 dying+0.034
 allergies+0.033
Neg logit contributions
lock-0.038
 X-0.036
 guide-0.034
 Steve-0.033
lot-0.032
Token the
Token ID262
Feature activation-3.37e-01

Pos logit contributions
arre+0.029
 ow+0.027
 mosquitoes+0.027
 suffering+0.027
 allergies+0.026
Neg logit contributions
lock-0.018
 guide-0.018
 X-0.017
 Steve-0.016
lot-0.015
Token park
Token ID3952
Feature activation+8.16e-01

Pos logit contributions
arre+0.009
 ow+0.008
 wil+0.008
onel+0.008
ins+0.008
Neg logit contributions
 guide-0.009
 X-0.009
 Steve-0.009
lock-0.009
 Kyle-0.008
Token to
Token ID284
Feature activation-1.33e+00

Pos logit contributions
lock+0.019
 X+0.018
 guide+0.018
 Steve+0.017
lynn+0.017
Neg logit contributions
arre-0.023
 mosquitoes-0.021
 suffering-0.021
 wil-0.020
 allergies-0.020
Token dinner
Token ID8073
Feature activation-8.69e-01

Pos logit contributions
arre+0.016
 wil+0.015
Acc+0.015
 ow+0.015
onel+0.015
Neg logit contributions
 guide-0.018
 X-0.018
lock-0.018
 Steve-0.017
 Mark-0.016
Token for
Token ID329
Feature activation-7.77e-01

Pos logit contributions
arre+0.030
 ow+0.028
 wil+0.027
 mosquitoes+0.027
Acc+0.027
Neg logit contributions
 guide-0.016
lock-0.016
 X-0.015
 Steve-0.015
 Mark-0.014
Token her
Token ID607
Feature activation-7.63e-01

Pos logit contributions
arre+0.028
Acc+0.024
vious+0.024
 wil+0.024
ivery+0.023
Neg logit contributions
 guide-0.016
 Steve-0.016
 Mark-0.015
 X-0.014
lock-0.014
Token and
Token ID290
Feature activation+3.25e-01

Pos logit contributions
arre+0.027
ilt+0.024
ivery+0.024
 wil+0.023
Acc+0.023
Neg logit contributions
 guide-0.016
lock-0.015
 X-0.014
 Steve-0.014
 Mark-0.014
Token her
Token ID607
Feature activation-3.68e-01

Pos logit contributions
 guide+0.006
 Steve+0.006
lock+0.006
 X+0.006
 Mark+0.006
Neg logit contributions
arre-0.012
ivery-0.011
ilt-0.011
ivering-0.010
onel-0.010
Token baby
Token ID5156
Feature activation-4.44e+00

Pos logit contributions
arre+0.013
ilt+0.012
ivery+0.012
Acc+0.012
ivering+0.012
Neg logit contributions
 guide-0.007
lock-0.007
 X-0.007
 Steve-0.006
 Mark-0.006
Token bear
Token ID6842
Feature activation-2.11e+00

Pos logit contributions
arre+0.148
ilt+0.136
Acc+0.134
 mosquitoes+0.133
 wil+0.132
Neg logit contributions
 guide-0.092
lock-0.086
 Steve-0.085
 Mark-0.083
 X-0.082
Token.
Token ID13
Feature activation-3.50e-01

Pos logit contributions
arre+0.079
 ow+0.074
 mosquitoes+0.074
Acc+0.074
ilt+0.073
Neg logit contributions
 guide-0.036
lock-0.034
 Steve-0.031
 X-0.029
 Mark-0.029
Token So
Token ID1406
Feature activation+3.14e-02

Pos logit contributions
arre+0.014
 wil+0.013
ilt+0.013
 ow+0.013
 mosquitoes+0.013
Neg logit contributions
 Steve-0.005
 Mark-0.005
lock-0.005
 guide-0.005
 Ray-0.004
Token,
Token ID11
Feature activation-8.66e-01

Pos logit contributions
 guide+0.001
lock+0.001
 X+0.000
 Steve+0.000
 Mark+0.000
Neg logit contributions
arre-0.001
 ow-0.001
 mosquitoes-0.001
 wil-0.001
 suffering-0.001
Token she
Token ID673
Feature activation-9.81e-01

Pos logit contributions
arre+0.029
 ow+0.026
ilt+0.026
 wil+0.026
onel+0.026
Neg logit contributions
 guide-0.019
 Steve-0.018
lock-0.018
 Mark-0.017
 X-0.016
Token,
Token ID11
Feature activation-3.01e-01

Pos logit contributions
lock+0.003
 X+0.003
 guide+0.003
 Steve+0.003
lot+0.003
Neg logit contributions
arre-0.007
 ow-0.007
 mosquitoes-0.006
 suffering-0.006
 dying-0.006
Token Bob
Token ID5811
Feature activation+5.20e-01

Pos logit contributions
arre+0.011
 ow+0.010
 mosquitoes+0.010
 wil+0.010
 suffering+0.010
Neg logit contributions
lock-0.005
 guide-0.005
 X-0.005
 Steve-0.005
 Mark-0.004
Token and
Token ID290
Feature activation-4.92e-01

Pos logit contributions
 guide+0.009
lock+0.008
 Steve+0.008
 X+0.008
 Mark+0.007
Neg logit contributions
arre-0.019
 ow-0.018
 mosquitoes-0.018
 wil-0.017
onel-0.017
Token his
Token ID465
Feature activation-1.46e-01

Pos logit contributions
arre+0.015
 mosquitoes+0.014
 ow+0.014
 suffering+0.014
 wil+0.013
Neg logit contributions
lock-0.011
 guide-0.010
 X-0.010
 Steve-0.009
lot-0.009
Token nos
Token ID43630
Feature activation-2.10e+00

Pos logit contributions
arre+0.004
 ow+0.004
 wil+0.004
 mosquitoes+0.004
onel+0.004
Neg logit contributions
 guide-0.004
lock-0.004
 X-0.004
 Steve-0.004
 Mark-0.003
Tokeny
Token ID88
Feature activation-4.43e+00

Pos logit contributions
arre+0.049
 ow+0.046
 mosquitoes+0.044
 wil+0.044
 dying+0.043
Neg logit contributions
 guide-0.060
lock-0.060
 X-0.059
 Steve-0.058
 Mark-0.055
Token neighbor
Token ID4780
Feature activation-2.28e+00

Pos logit contributions
arre+0.137
ilt+0.135
Acc+0.135
onel+0.132
pill+0.130
Neg logit contributions
 guide-0.097
 Steve-0.084
lock-0.082
 X-0.082
 Mark-0.079
Token became
Token ID2627
Feature activation-1.41e+00

Pos logit contributions
arre+0.069
 ow+0.065
 mosquitoes+0.064
 wil+0.064
 suffering+0.063
Neg logit contributions
lock-0.049
 guide-0.049
 X-0.048
 Steve-0.047
 Mark-0.043
Token good
Token ID922
Feature activation-1.18e+00

Pos logit contributions
arre+0.031
 mosquitoes+0.030
 ow+0.028
 suffering+0.027
 allergies+0.027
Neg logit contributions
lock-0.044
 X-0.042
 guide-0.041
 Steve-0.040
lynn-0.038
Token friends
Token ID2460
Feature activation+3.36e-01

Pos logit contributions
arre+0.032
 ow+0.030
 wil+0.030
 mosquitoes+0.029
 dying+0.029
Neg logit contributions
 guide-0.030
lock-0.030
 X-0.029
 Steve-0.029
 Mark-0.027
Token and
Token ID290
Feature activation-9.32e-01

Pos logit contributions
lock+0.007
 X+0.007
 guide+0.007
 Steve+0.007
 steer+0.007
Neg logit contributions
arre-0.010
 ow-0.009
 mosquitoes-0.009
 suffering-0.009
der-0.009
Token to
Token ID284
Feature activation-5.17e-01

Pos logit contributions
 guide+0.003
lock+0.003
 Steve+0.003
 X+0.003
 Mark+0.003
Neg logit contributions
arre-0.008
 wil-0.007
ilt-0.007
 ow-0.007
onel-0.007
Token their
Token ID511
Feature activation-7.88e-01

Pos logit contributions
arre+0.018
 ow+0.016
 mosquitoes+0.016
 wil+0.016
 allergies+0.016
Neg logit contributions
 guide-0.010
lock-0.009
 X-0.009
 Steve-0.009
 Mark-0.008
Token mom
Token ID1995
Feature activation+5.03e-02

Pos logit contributions
arre+0.031
pill+0.029
ilt+0.029
ivery+0.028
Acc+0.028
Neg logit contributions
 guide-0.013
 Mark-0.011
 X-0.011
 keeper-0.010
 guard-0.009
Token and
Token ID290
Feature activation+2.88e-01

Pos logit contributions
 guide+0.001
lock+0.001
 X+0.001
 Steve+0.001
 Mark+0.001
Neg logit contributions
arre-0.002
 ow-0.002
 mosquitoes-0.002
 wil-0.002
 suffering-0.002
Token their
Token ID511
Feature activation-9.86e-01

Pos logit contributions
 guide+0.005
lock+0.005
 X+0.005
 Steve+0.005
 Mark+0.005
Neg logit contributions
arre-0.010
 ow-0.009
 mosquitoes-0.009
 wil-0.009
ilt-0.009
Token baby
Token ID5156
Feature activation-4.40e+00

Pos logit contributions
arre+0.039
ilt+0.037
ivery+0.036
ivering+0.036
pill+0.036
Neg logit contributions
 guide-0.016
 X-0.014
 Mark-0.013
 keeper-0.012
 map-0.012
Token brother
Token ID3956
Feature activation-2.13e+00

Pos logit contributions
arre+0.153
ilt+0.147
Acc+0.138
 wil+0.138
 mosquitoes+0.137
Neg logit contributions
 guide-0.084
 Mark-0.076
 X-0.072
 Steve-0.072
lock-0.072
Token.
Token ID13
Feature activation+7.83e-01

Pos logit contributions
arre+0.079
ilt+0.077
 ow+0.077
 mosquitoes+0.076
 wil+0.076
Neg logit contributions
 guide-0.032
 X-0.028
lock-0.028
 Steve-0.028
 Mark-0.027
Token They
Token ID1119
Feature activation+6.40e-01

Pos logit contributions
 guide+0.013
lock+0.013
 X+0.012
 Steve+0.012
 Mark+0.011
Neg logit contributions
arre-0.028
 ow-0.027
 mosquitoes-0.026
 wil-0.026
onel-0.026
Token gave
Token ID2921
Feature activation+5.89e-01

Pos logit contributions
 guide+0.014
lock+0.013
 X+0.013
 Steve+0.013
 Mark+0.012
Neg logit contributions
arre-0.020
 ow-0.019
 mosquitoes-0.019
 wil-0.018
 allergies-0.018
Token him
Token ID683
Feature activation+1.51e+00

Pos logit contributions
 guide+0.015
lock+0.015
 Steve+0.015
 X+0.015
 Mark+0.014
Neg logit contributions
arre-0.016
 ow-0.015
 wil-0.014
 mosquitoes-0.014
 dying-0.014
Token and
Token ID290
Feature activation-1.17e+00

Pos logit contributions
arre+0.019
 ow+0.018
 mosquitoes+0.017
 wil+0.017
ivery+0.017
Neg logit contributions
 guide-0.006
lock-0.006
 Steve-0.005
 X-0.005
 Mark-0.004
Token peaceful
Token ID12309
Feature activation-1.95e+00

Pos logit contributions
arre+0.033
 ow+0.031
 mosquitoes+0.031
 wil+0.030
 allergies+0.030
Neg logit contributions
 guide-0.028
lock-0.027
 X-0.027
 Steve-0.026
 Mark-0.024
Token under
Token ID739
Feature activation-3.25e+00

Pos logit contributions
arre+0.077
ivery+0.071
 mosquitoes+0.070
 ow+0.070
Acc+0.069
Neg logit contributions
 guide-0.028
lock-0.024
 X-0.024
 Steve-0.023
 point-0.021
Token her
Token ID607
Feature activation-1.50e+00

Pos logit contributions
arre+0.157
ivery+0.143
ivering+0.141
ins+0.140
onel+0.133
Neg logit contributions
 guide-0.028
 Steve-0.027
 Mark-0.026
 point-0.024
 X-0.024
Token watch
Token ID2342
Feature activation-2.12e+00

Pos logit contributions
arre+0.049
ins+0.042
 mosquitoes+0.041
ivery+0.040
Acc+0.040
Neg logit contributions
 guide-0.036
lock-0.033
 X-0.033
 Steve-0.032
 map-0.030
Tokenful
Token ID913
Feature activation-4.37e+00

Pos logit contributions
arre+0.082
 mosquitoes+0.074
 ow+0.073
Acc+0.073
 disl+0.072
Neg logit contributions
 guide-0.033
lock-0.032
 X-0.029
 Steve-0.029
 Mark-0.028
Token eye
Token ID4151
Feature activation-2.11e+00

Pos logit contributions
arre+0.155
Acc+0.141
 mosquitoes+0.134
ivery+0.132
 ow+0.130
Neg logit contributions
 guide-0.093
lock-0.081
 X-0.077
 point-0.072
 Steve-0.069
Token.
Token ID13
Feature activation+3.04e-01

Pos logit contributions
arre+0.077
Acc+0.069
 ow+0.069
 mosquitoes+0.068
ivery+0.068
Neg logit contributions
 guide-0.039
lock-0.036
 Steve-0.033
 X-0.033
 Mark-0.030
Token<|endoftext|>
Token ID50256
Feature activation+4.61e-02

Pos logit contributions
 guide+0.005
 Steve+0.005
lock+0.005
 X+0.005
 Mark+0.005
Neg logit contributions
arre-0.011
 ow-0.010
 mosquitoes-0.010
 wil-0.010
 dying-0.010
TokenOnce
Token ID7454
Feature activation-5.70e-01

Pos logit contributions
 guide+0.001
lock+0.001
 X+0.001
 Steve+0.001
 steer+0.001
Neg logit contributions
arre-0.001
 ow-0.001
 mosquitoes-0.001
 wil-0.001
 suffering-0.001
Token upon
Token ID2402
Feature activation-8.03e-01

Pos logit contributions
arre+0.015
 ow+0.014
 mosquitoes+0.014
 wil+0.014
 suffering+0.013
Neg logit contributions
 guide-0.015
lock-0.015
 X-0.014
 Steve-0.014
 Mark-0.013
Token crown
Token ID12389
Feature activation-3.85e-01

Pos logit contributions
arre+0.061
 mosquitoes+0.055
 ow+0.055
Acc+0.053
 allergies+0.053
Neg logit contributions
 guide-0.047
lock-0.045
 X-0.043
 Steve-0.042
 Mark-0.039
Token and
Token ID290
Feature activation-2.32e-01

Pos logit contributions
arre+0.011
 ow+0.011
 wil+0.011
onel+0.010
 suffering+0.010
Neg logit contributions
 guide-0.008
 X-0.008
 Steve-0.008
lock-0.008
 Mark-0.008
Token playing
Token ID2712
Feature activation-1.32e+00

Pos logit contributions
arre+0.007
ivery+0.007
 ow+0.007
ivering+0.006
ilt+0.006
Neg logit contributions
 guide-0.005
 Steve-0.005
 X-0.005
lock-0.005
 Mark-0.005
Token in
Token ID287
Feature activation-1.98e-01

Pos logit contributions
arre+0.056
 ow+0.053
 allergies+0.052
 wil+0.052
 suffering+0.052
Neg logit contributions
 guide-0.014
lock-0.013
 X-0.013
 Steve-0.012
 Mark-0.010
Token the
Token ID262
Feature activation-1.03e+00

Pos logit contributions
arre+0.008
onel+0.007
 allergies+0.007
ivery+0.007
 ow+0.007
Neg logit contributions
 X-0.003
 guide-0.003
 Steve-0.003
 Mark-0.002
lock-0.002
Token freshly
Token ID29026
Feature activation-4.37e+00

Pos logit contributions
arre+0.031
onel+0.028
ins+0.028
der+0.027
 allergies+0.027
Neg logit contributions
 guide-0.024
 X-0.024
lock-0.023
 point-0.022
 Steve-0.021
Token cut
Token ID2005
Feature activation-7.39e-01

Pos logit contributions
arre+0.130
 mosquitoes+0.119
onel+0.118
 ow+0.115
ivery+0.114
Neg logit contributions
lock-0.104
 guide-0.102
 X-0.100
 Steve-0.092
 point-0.089
Token grass
Token ID8701
Feature activation+4.97e-01

Pos logit contributions
arre+0.025
 ow+0.022
Acc+0.022
 mosquitoes+0.022
 dying+0.022
Neg logit contributions
 guide-0.015
lock-0.015
 X-0.014
 Steve-0.013
 Mark-0.012
Token.
Token ID13
Feature activation+1.87e-01

Pos logit contributions
 guide+0.008
lock+0.008
 X+0.008
 Steve+0.007
 Mark+0.007
Neg logit contributions
arre-0.018
 ow-0.017
Acc-0.017
 mosquitoes-0.016
 allergies-0.016
Token It
Token ID632
Feature activation+1.56e-01

Pos logit contributions
lock+0.003
 Steve+0.003
 guide+0.003
 X+0.003
 Mark+0.003
Neg logit contributions
 mosquitoes-0.006
arre-0.006
 ow-0.006
 wil-0.006
 allergies-0.006
Token looked
Token ID3114
Feature activation-6.91e-01

Pos logit contributions
 guide+0.003
lock+0.002
 Steve+0.002
 X+0.002
 Mark+0.002
Neg logit contributions
arre-0.005
 mosquitoes-0.005
 ow-0.005
 allergies-0.005
 suffering-0.005
Token and
Token ID290
Feature activation-6.11e-01

Pos logit contributions
pill+0.018
 wil+0.017
arre+0.017
 mosquitoes+0.017
onel+0.017
Neg logit contributions
 guide-0.007
lock-0.006
 Mark-0.006
 steer-0.006
 point-0.006
Token your
Token ID534
Feature activation-1.06e+00

Pos logit contributions
arre+0.021
 ow+0.019
ilt+0.019
onel+0.019
ivery+0.019
Neg logit contributions
 guide-0.011
 Steve-0.011
lock-0.010
 X-0.010
 Mark-0.010
Token big
Token ID1263
Feature activation-2.19e+00

Pos logit contributions
arre+0.037
Acc+0.034
onel+0.034
ilt+0.034
pill+0.033
Neg logit contributions
 guide-0.019
 point-0.017
 bow-0.017
 pedal-0.016
 smile-0.016
Token sne
Token ID10505
Feature activation-2.68e+00

Pos logit contributions
Acc+0.070
arre+0.069
onel+0.069
ilt+0.065
 dying+0.065
Neg logit contributions
 guide-0.043
 X-0.040
 point-0.040
 pedal-0.038
 Steve-0.038
Tokene
Token ID68
Feature activation-2.66e+00

Pos logit contributions
arre+0.064
 dying+0.063
 wil+0.062
 ow+0.062
 mosquitoes+0.061
Neg logit contributions
 guide-0.067
lock-0.066
 X-0.065
 Steve-0.064
 Mark-0.063
Tokenze
Token ID2736
Feature activation-4.35e+00

Pos logit contributions
arre+0.057
Acc+0.055
pill+0.053
 dying+0.051
onel+0.049
Neg logit contributions
 guide-0.076
 driver-0.074
 point-0.070
 steer-0.070
 drive-0.069
Token."
Token ID526
Feature activation+9.10e-01

Pos logit contributions
arre+0.163
pill+0.161
Acc+0.161
 wil+0.156
 disl+0.151
Neg logit contributions
 guide-0.061
 Mark-0.054
,-0.053
 driver-0.052
lock-0.050
Token They
Token ID1119
Feature activation+2.21e-01

Pos logit contributions
 Steve+0.015
 guide+0.015
lock+0.015
 X+0.015
 Mark+0.014
Neg logit contributions
arre-0.032
 wil-0.031
 ow-0.031
 mosquitoes-0.031
 dying-0.030
Token all
Token ID477
Feature activation-7.03e-01

Pos logit contributions
 guide+0.006
lock+0.005
 X+0.005
 Steve+0.005
 Mark+0.005
Neg logit contributions
arre-0.006
 ow-0.006
 mosquitoes-0.006
 wil-0.005
 suffering-0.005
Token hugged
Token ID41130
Feature activation+8.77e-01

Pos logit contributions
arre+0.023
 mosquitoes+0.022
 ow+0.022
 allergies+0.021
 dying+0.021
Neg logit contributions
 guide-0.013
 X-0.012
 Steve-0.012
lock-0.012
 Mark-0.012
Token Tim
Token ID5045
Feature activation+1.42e+00

Pos logit contributions
lock+0.020
 guide+0.020
lot+0.018
 X+0.018
 Steve+0.017
Neg logit contributions
arre-0.027
 ow-0.026
 mosquitoes-0.025
 suffering-0.024
Acc-0.024
Token hear
Token ID3285
Feature activation-2.18e-01

Pos logit contributions
 Steve+0.025
lock+0.024
 X+0.024
lot+0.023
 guide+0.023
Neg logit contributions
 ow-0.043
arre-0.042
 suffering-0.040
 itching-0.040
ivery-0.040
Token that
Token ID326
Feature activation-1.03e-01

Pos logit contributions
arre+0.007
ivery+0.007
 ow+0.006
onel+0.006
Acc+0.006
Neg logit contributions
 guide-0.005
 Steve-0.005
 Mark-0.004
 X-0.004
 Kyle-0.004
Token,
Token ID11
Feature activation-3.81e-01

Pos logit contributions
arre+0.003
 ow+0.003
 mosquitoes+0.003
 wil+0.003
onel+0.003
Neg logit contributions
 guide-0.002
lock-0.002
 Steve-0.002
 X-0.002
 Mark-0.002
Token and
Token ID290
Feature activation+3.39e-01

Pos logit contributions
arre+0.015
 ow+0.014
ilt+0.014
 mosquitoes+0.014
onel+0.014
Neg logit contributions
 guide-0.005
 Steve-0.005
lock-0.005
 X-0.005
 Mark-0.004
Token her
Token ID607
Feature activation+1.06e-01

Pos logit contributions
 guide+0.008
 Steve+0.008
lock+0.007
 Mark+0.007
 X+0.007
Neg logit contributions
arre-0.010
 ow-0.010
 mosquitoes-0.010
ilt-0.010
ivery-0.009
Token toe
Token ID21189
Feature activation-4.34e+00

Pos logit contributions
 guide+0.001
 X+0.001
lock+0.001
 Steve+0.001
 point+0.001
Neg logit contributions
arre-0.004
pill-0.004
Acc-0.004
ilt-0.004
ivery-0.004
Token stopped
Token ID5025
Feature activation-4.53e-01

Pos logit contributions
arre+0.147
 mosquitoes+0.139
 ow+0.138
Acc+0.133
 dying+0.133
Neg logit contributions
 guide-0.083
lock-0.080
 X-0.075
 Steve-0.075
 Mark-0.069
Token hurting
Token ID21530
Feature activation-8.22e-01

Pos logit contributions
arre+0.013
ilt+0.012
 ow+0.012
 mosquitoes+0.012
onel+0.012
Neg logit contributions
 guide-0.011
 Steve-0.011
 X-0.010
lock-0.010
 Mark-0.010
Token once
Token ID1752
Feature activation+6.31e-02

Pos logit contributions
arre+0.023
 ow+0.021
 mosquitoes+0.021
 wil+0.021
 dying+0.020
Neg logit contributions
 guide-0.020
lock-0.020
 X-0.019
 Steve-0.019
 Mark-0.018
Token she
Token ID673
Feature activation-2.05e+00

Pos logit contributions
lock+0.002
 guide+0.002
 Steve+0.002
 X+0.002
lot+0.001
Neg logit contributions
arre-0.002
 ow-0.002
 wil-0.002
 mosquitoes-0.002
 suffering-0.002
Token had
Token ID550
Feature activation-8.22e-01

Pos logit contributions
arre+0.063
 ow+0.060
 mosquitoes+0.059
 wil+0.058
 dying+0.057
Neg logit contributions
 guide-0.044
lock-0.044
 X-0.043
 Steve-0.042
 Mark-0.038
Token when
Token ID618
Feature activation-2.61e-01

Pos logit contributions
 guide+0.029
lock+0.029
 X+0.028
 Steve+0.028
 Mark+0.026
Neg logit contributions
arre-0.046
 ow-0.044
 mosquitoes-0.043
 wil-0.042
 allergies-0.041
Token she
Token ID673
Feature activation-1.69e+00

Pos logit contributions
arre+0.006
 ow+0.005
 mosquitoes+0.005
 wil+0.005
 suffering+0.005
Neg logit contributions
 guide-0.008
lock-0.008
 X-0.008
 Steve-0.007
lot-0.007
Token stub
Token ID17071
Feature activation-4.18e+00

Pos logit contributions
arre+0.053
 mosquitoes+0.049
 ow+0.049
 wil+0.048
onel+0.047
Neg logit contributions
 guide-0.036
lock-0.036
 X-0.034
 Steve-0.034
 Mark-0.032
Tokenbed
Token ID3077
Feature activation-1.62e+00

Pos logit contributions
arre+0.111
 wil+0.102
 ow+0.100
 dying+0.098
onel+0.098
Neg logit contributions
lock-0.108
 Steve-0.105
 guide-0.104
 X-0.103
 Mark-0.099
Token her
Token ID607
Feature activation-8.51e-01

Pos logit contributions
arre+0.044
pill+0.037
onel+0.037
Acc+0.037
 wil+0.037
Neg logit contributions
 Steve-0.042
lock-0.041
 Mark-0.041
 guide-0.041
 X-0.040
Token toe
Token ID21189
Feature activation-4.32e+00

Pos logit contributions
arre+0.022
 ow+0.021
 mosquitoes+0.020
 wil+0.020
 suffering+0.020
Neg logit contributions
 guide-0.022
lock-0.022
 X-0.022
 Steve-0.022
 Mark-0.020
Token on
Token ID319
Feature activation-5.28e-01

Pos logit contributions
arre+0.175
 disl+0.174
pill+0.167
Acc+0.159
ins+0.158
Neg logit contributions
lock-0.063
 point-0.048
 close-0.045
 guide-0.044
 pedal-0.044
Token a
Token ID257
Feature activation-1.19e+00

Pos logit contributions
arre+0.019
onel+0.017
ins+0.017
 wil+0.017
 ow+0.017
Neg logit contributions
 guide-0.009
 X-0.009
lock-0.009
 Steve-0.009
 Mark-0.008
Token rock
Token ID3881
Feature activation-8.52e-01

Pos logit contributions
arre+0.040
 ow+0.036
 mosquitoes+0.035
 wil+0.035
onel+0.035
Neg logit contributions
 guide-0.023
lock-0.023
 X-0.022
 Steve-0.021
 Mark-0.020
Token.
Token ID13
Feature activation+1.30e-02

Pos logit contributions
arre+0.037
 disl+0.034
ins+0.034
pill+0.033
 mosquitoes+0.033
Neg logit contributions
lock-0.011
 guide-0.009
 point-0.009
 X-0.008
 close-0.008
Token It
Token ID632
Feature activation+2.06e-01

Pos logit contributions
lock+0.000
 guide+0.000
 Steve+0.000
 X+0.000
 Mark+0.000
Neg logit contributions
arre-0.000
 ow-0.000
 wil-0.000
 mosquitoes-0.000
 dying-0.000
Token and
Token ID290
Feature activation-1.28e+00

Pos logit contributions
 guide+0.002
lock+0.002
 X+0.002
 Steve+0.002
 Mark+0.001
Neg logit contributions
arre-0.004
 ow-0.004
 mosquitoes-0.004
 wil-0.004
 suffering-0.004
Token Tim
Token ID5045
Feature activation-8.94e-01

Pos logit contributions
arre+0.040
ins+0.037
pill+0.037
ilt+0.036
onel+0.036
Neg logit contributions
 Steve-0.028
 Mark-0.027
 guide-0.027
 X-0.025
lock-0.025
Tokenmy
Token ID1820
Feature activation-1.65e+00

Pos logit contributions
arre+0.034
 mosquitoes+0.033
 ow+0.033
 wil+0.032
 dying+0.032
Neg logit contributions
 Steve-0.011
 guide-0.011
lock-0.011
 Mark-0.011
 X-0.010
Token hurt
Token ID5938
Feature activation-1.69e+00

Pos logit contributions
arre+0.052
 mosquitoes+0.050
 ow+0.050
 wil+0.048
onel+0.048
Neg logit contributions
 guide-0.033
lock-0.032
 Steve-0.031
 X-0.030
 Mark-0.029
Token his
Token ID465
Feature activation-1.66e-01

Pos logit contributions
arre+0.056
 ow+0.052
 wil+0.052
 mosquitoes+0.052
ilt+0.051
Neg logit contributions
 guide-0.032
 Steve-0.032
lock-0.031
 X-0.030
 Mark-0.030
Token toe
Token ID21189
Feature activation-4.31e+00

Pos logit contributions
arre+0.005
 mosquitoes+0.004
 wil+0.004
Acc+0.004
pill+0.004
Neg logit contributions
 guide-0.004
lock-0.004
 X-0.004
 Steve-0.004
 Mark-0.003
Token.
Token ID13
Feature activation-3.33e-01

Pos logit contributions
arre+0.178
pill+0.172
 disl+0.167
ilt+0.166
Acc+0.165
Neg logit contributions
lock-0.047
 guide-0.042
 Mark-0.036
 driver-0.033
 point-0.033
Token
Token ID198
Feature activation-6.90e-01

Pos logit contributions
arre+0.011
 ow+0.010
 mosquitoes+0.010
 wil+0.010
 suffering+0.010
Neg logit contributions
 guide-0.007
lock-0.007
 X-0.006
 Steve-0.006
 steer-0.005
Token
Token ID198
Feature activation-8.87e-01

Pos logit contributions
arre+0.020
 ow+0.018
 mosquitoes+0.017
 wil+0.017
 suffering+0.017
Neg logit contributions
 X-0.016
 guide-0.016
lock-0.016
 Steve-0.015
 Kyle-0.014
TokenTheir
Token ID14574
Feature activation-3.36e-01

Pos logit contributions
arre+0.028
 ow+0.027
 mosquitoes+0.027
 wil+0.026
 allergies+0.026
Neg logit contributions
 guide-0.018
lock-0.018
 Steve-0.017
 X-0.017
 Mark-0.016
Token mom
Token ID1995
Feature activation-2.17e-01

Pos logit contributions
arre+0.014
ilt+0.014
pill+0.013
ins+0.013
Acc+0.013
Neg logit contributions
 guide-0.004
 Mark-0.003
 Steve-0.003
 X-0.003
lock-0.003
Token park
Token ID3952
Feature activation+7.65e-01

Pos logit contributions
onel+0.013
arre+0.013
ins+0.013
der+0.013
 wil+0.013
Neg logit contributions
 guide-0.010
 X-0.010
 Steve-0.010
lock-0.009
 Mark-0.009
Token.
Token ID13
Feature activation-1.84e-01

Pos logit contributions
 guide+0.016
lock+0.015
 X+0.015
 Steve+0.015
lot+0.014
Neg logit contributions
arre-0.024
 ow-0.022
 mosquitoes-0.022
 suffering-0.022
 wil-0.022
Token He
Token ID679
Feature activation+1.36e+00

Pos logit contributions
arre+0.004
 ow+0.004
 raining+0.004
 miserable+0.004
 asleep+0.004
Neg logit contributions
 guide-0.005
lock-0.004
lynn-0.004
keeper-0.004
 keeper-0.004
Token saw
Token ID2497
Feature activation-1.08e-01

Pos logit contributions
 guide+0.034
lock+0.034
 X+0.033
 Steve+0.033
 Mark+0.030
Neg logit contributions
arre-0.036
 ow-0.035
 wil-0.034
 mosquitoes-0.033
 suffering-0.033
Token a
Token ID257
Feature activation+1.02e+00

Pos logit contributions
arre+0.003
 ow+0.003
 mosquitoes+0.003
 wil+0.003
 suffering+0.003
Neg logit contributions
 guide-0.002
lock-0.002
 X-0.002
 Steve-0.002
 Mark-0.002
Token tree
Token ID5509
Feature activation+2.08e+00

Pos logit contributions
 Steve+0.029
lock+0.028
 guide+0.027
 Mark+0.027
 Kyle+0.027
Neg logit contributions
 ow-0.024
arre-0.023
 mosquitoes-0.022
 wil-0.022
 suffering-0.022
Token and
Token ID290
Feature activation+7.08e-01

Pos logit contributions
 guide+0.039
 Steve+0.038
 X+0.037
lock+0.037
 Kyle+0.036
Neg logit contributions
arre-0.066
 wil-0.065
 ow-0.064
 suffering-0.063
 dying-0.062
Token wanted
Token ID2227
Feature activation+6.83e-01

Pos logit contributions
lock+0.017
 guide+0.017
 X+0.017
 Steve+0.016
 Mark+0.015
Neg logit contributions
arre-0.021
 ow-0.019
 mosquitoes-0.019
 wil-0.018
ilt-0.018
Token to
Token ID284
Feature activation+1.43e-01

Pos logit contributions
 guide+0.016
lock+0.016
 X+0.016
 Steve+0.015
 Kyle+0.014
Neg logit contributions
 mosquitoes-0.018
 ow-0.018
arre-0.018
 allergies-0.018
 suffering-0.017
Token climb
Token ID12080
Feature activation-1.90e-01

Pos logit contributions
 guide+0.003
 Steve+0.003
 X+0.003
lock+0.003
lot+0.002
Neg logit contributions
 ow-0.004
arre-0.004
 mosquitoes-0.004
 wil-0.004
 suffering-0.004
Token up
Token ID510
Feature activation+5.97e-01

Pos logit contributions
 mosquitoes+0.005
 ow+0.005
 wil+0.005
arre+0.005
 itching+0.005
Neg logit contributions
 guide-0.005
lock-0.005
 X-0.004
 Steve-0.004
lot-0.004
Token fine
Token ID3734
Feature activation-4.37e-01

Pos logit contributions
pill+0.021
arre+0.020
 mosquitoes+0.019
 ow+0.018
onel+0.017
Neg logit contributions
 guide-0.014
 Steve-0.013
lock-0.013
 unlock-0.012
 X-0.012
Token.
Token ID13
Feature activation+9.07e-02

Pos logit contributions
arre+0.017
 mosquitoes+0.015
 ow+0.015
ivery+0.015
 wil+0.015
Neg logit contributions
 guide-0.007
lock-0.006
 Steve-0.006
 X-0.006
 Mark-0.006
Token
Token ID198
Feature activation-6.60e-01

Pos logit contributions
lock+0.002
 guide+0.002
 X+0.002
 Steve+0.002
 Kyle+0.002
Neg logit contributions
arre-0.003
 ow-0.002
 wil-0.002
 suffering-0.002
 dying-0.002
Token
Token ID198
Feature activation-8.95e-01

Pos logit contributions
arre+0.019
 ow+0.018
 mosquitoes+0.018
 wil+0.017
 allergies+0.017
Neg logit contributions
 guide-0.016
lock-0.016
 Steve-0.015
 X-0.015
 Mark-0.014
TokenThe
Token ID464
Feature activation-1.98e-02

Pos logit contributions
arre+0.030
 ow+0.029
 mosquitoes+0.028
 wil+0.028
 allergies+0.028
Neg logit contributions
 guide-0.018
lock-0.017
 Steve-0.016
 X-0.016
 Mark-0.015
Token moral
Token ID6573
Feature activation+2.31e+00

Pos logit contributions
arre+0.001
 ow+0.001
 mosquitoes+0.000
 wil+0.000
 suffering+0.000
Neg logit contributions
 guide-0.000
lock-0.000
 X-0.000
 Steve-0.000
 Mark-0.000
Token of
Token ID286
Feature activation-4.63e-01

Pos logit contributions
 X+0.064
 Kyle+0.064
 Steve+0.062
acle+0.062
 Roy+0.060
Neg logit contributions
arre-0.049
 mosquitoes-0.047
 suffering-0.046
 mistakes-0.046
ained-0.045
Token this
Token ID428
Feature activation+1.15e-01

Pos logit contributions
arre+0.013
 ow+0.013
 mosquitoes+0.012
 suffering+0.012
 wil+0.012
Neg logit contributions
lock-0.011
 guide-0.011
 X-0.010
 Steve-0.010
lot-0.009
Token story
Token ID1621
Feature activation+6.24e-01

Pos logit contributions
lock+0.003
 guide+0.003
 X+0.003
 Steve+0.003
 steer+0.003
Neg logit contributions
arre-0.003
 ow-0.003
 suffering-0.003
 mosquitoes-0.003
 wil-0.003
Token is
Token ID318
Feature activation-4.13e-01

Pos logit contributions
lock+0.017
 X+0.017
 guide+0.017
 Steve+0.016
 steer+0.016
Neg logit contributions
arre-0.015
der-0.013
 ow-0.013
 wil-0.013
 dying-0.013
Token that
Token ID326
Feature activation-1.08e+00

Pos logit contributions
arre+0.010
 suffering+0.009
 mosquitoes+0.009
 ow+0.009
 wil+0.009
Neg logit contributions
 X-0.011
lock-0.011
 guide-0.011
 Steve-0.011
 Mark-0.010
Token.
Token ID13
Feature activation-1.15e+00

Pos logit contributions
arre+0.020
 suffering+0.018
 ow+0.018
 dying+0.018
 wil+0.018
Neg logit contributions
 guide-0.020
lock-0.020
 Steve-0.020
 X-0.020
 Kyle-0.018
Token They
Token ID1119
Feature activation+3.17e-01

Pos logit contributions
arre+0.029
 ow+0.027
 mosquitoes+0.027
 wil+0.026
 spilled+0.026
Neg logit contributions
lock-0.030
 guide-0.029
lot-0.028
 Steve-0.027
lynn-0.027
Token saw
Token ID2497
Feature activation-1.93e-01

Pos logit contributions
lock+0.010
 X+0.009
 guide+0.009
 Steve+0.009
 Kyle+0.009
Neg logit contributions
 wil-0.005
arre-0.005
 ow-0.005
 spilled-0.005
 suffering-0.005
Token many
Token ID867
Feature activation+2.39e-01

Pos logit contributions
 ow+0.005
 mosquitoes+0.004
arre+0.004
 wil+0.004
 suffering+0.004
Neg logit contributions
lock-0.005
 guide-0.005
 Steve-0.005
 steer-0.005
lynn-0.005
Token big
Token ID1263
Feature activation+3.85e-01

Pos logit contributions
lock+0.009
 Steve+0.008
ver+0.008
lot+0.008
 Kyle+0.008
Neg logit contributions
 ow-0.004
ins-0.004
 mosquitoes-0.004
pill-0.004
arre-0.004
Token animals
Token ID4695
Feature activation+2.06e+00

Pos logit contributions
lock+0.013
ver+0.013
lot+0.012
lynn+0.011
 guide+0.011
Neg logit contributions
 ow-0.008
pill-0.007
 mosquitoes-0.007
arre-0.007
 bees-0.006
Token,
Token ID11
Feature activation-3.89e-01

Pos logit contributions
lock+0.043
 X+0.042
 guide+0.042
 Steve+0.041
 Kyle+0.039
Neg logit contributions
arre-0.060
 suffering-0.057
 mosquitoes-0.057
 ow-0.057
 wil-0.056
Token like
Token ID588
Feature activation+3.16e-01

Pos logit contributions
 mosquitoes+0.009
 ow+0.009
 miserable+0.009
 wil+0.009
 suffering+0.008
Neg logit contributions
lock-0.011
 X-0.010
 guide-0.010
lynn-0.010
 Kyle-0.010
Token lions
Token ID31420
Feature activation+9.72e-01

Pos logit contributions
 guide+0.008
 Steve+0.008
 X+0.008
lock+0.008
 Mark+0.008
Neg logit contributions
arre-0.009
 ow-0.008
 wil-0.008
ilt-0.008
ivery-0.008
Token,
Token ID11
Feature activation-1.09e+00

Pos logit contributions
lock+0.018
 guide+0.016
 X+0.016
 Steve+0.015
lynn+0.015
Neg logit contributions
 ow-0.033
arre-0.032
 mosquitoes-0.031
 suffering-0.030
 wil-0.030
Token elephants
Token ID25368
Feature activation+1.49e+00

Pos logit contributions
 ow+0.026
arre+0.026
 mosquitoes+0.025
pill+0.024
 wil+0.023
Neg logit contributions
lock-0.031
 guide-0.029
 X-0.029
 Steve-0.028
 Kyle-0.027
Token I
Token ID314
Feature activation-8.35e-01

Pos logit contributions
arre+0.085
 mosquitoes+0.082
 wil+0.081
 ow+0.080
ilt+0.080
Neg logit contributions
lock-0.054
 guide-0.052
 Steve-0.052
 X-0.051
 Mark-0.050
Token found
Token ID1043
Feature activation+4.06e-02

Pos logit contributions
 mosquitoes+0.030
 ow+0.028
arre+0.028
der+0.027
Acc+0.027
Neg logit contributions
 guide-0.015
lock-0.014
 X-0.014
 point-0.013
 Steve-0.012
Token a
Token ID257
Feature activation+8.90e-01

Pos logit contributions
 X+0.001
 Steve+0.001
 guide+0.001
 Mark+0.001
lock+0.001
Neg logit contributions
arre-0.002
ivery-0.002
ivering-0.001
Acc-0.001
ilt-0.001
Token pretty
Token ID2495
Feature activation+1.40e-02

Pos logit contributions
 guide+0.022
lock+0.021
 X+0.021
 Steve+0.020
 Mark+0.019
Neg logit contributions
arre-0.025
 ow-0.023
 mosquitoes-0.023
 wil-0.023
onel-0.022
Token pe
Token ID613
Feature activation+9.29e-01

Pos logit contributions
 guide+0.000
 X+0.000
lock+0.000
 Steve+0.000
 Mark+0.000
Neg logit contributions
arre-0.000
 ow-0.000
 mosquitoes-0.000
onel-0.000
Acc-0.000
Tokenb
Token ID65
Feature activation+2.35e+00

Pos logit contributions
 guide+0.022
 Steve+0.022
lock+0.022
 X+0.021
 Mark+0.020
Neg logit contributions
arre-0.025
 ow-0.024
 wil-0.024
 mosquitoes-0.024
 suffering-0.023
Tokenble
Token ID903
Feature activation-5.27e-01

Pos logit contributions
 guide+0.060
 Steve+0.059
lock+0.057
 X+0.057
 Mark+0.055
Neg logit contributions
arre-0.059
 ow-0.058
onel-0.055
 wil-0.055
 mosquitoes-0.054
Token!"
Token ID2474
Feature activation-1.67e-01

Pos logit contributions
arre+0.020
Acc+0.018
ivery+0.018
 ow+0.018
 mosquitoes+0.017
Neg logit contributions
lock-0.010
 guide-0.009
 X-0.009
 Steve-0.008
 point-0.008
Token she
Token ID673
Feature activation-1.67e+00

Pos logit contributions
arre+0.006
 ow+0.005
 mosquitoes+0.005
 wil+0.005
onel+0.005
Neg logit contributions
 guide-0.003
 Steve-0.003
lock-0.003
 X-0.003
 Mark-0.003
Token said
Token ID531
Feature activation-2.11e-01

Pos logit contributions
arre+0.056
 ow+0.053
 mosquitoes+0.052
 wil+0.051
ivery+0.051
Neg logit contributions
 guide-0.032
 Steve-0.032
 X-0.031
lock-0.031
 Mark-0.029
Token,
Token ID11
Feature activation-4.55e-01

Pos logit contributions
arre+0.007
 ow+0.007
 wil+0.007
 mosquitoes+0.007
 suffering+0.007
Neg logit contributions
 guide-0.004
lock-0.003
 X-0.003
 Steve-0.003
 Mark-0.003
Token.
Token ID13
Feature activation+3.68e-01

Pos logit contributions
 guide+0.005
lock+0.005
 X+0.005
 Steve+0.005
 Mark+0.004
Neg logit contributions
arre-0.009
 ow-0.009
 mosquitoes-0.009
 wil-0.009
Acc-0.009
Token The
Token ID383
Feature activation+5.04e-01

Pos logit contributions
 Steve+0.004
 guide+0.004
 Mark+0.004
lock+0.004
 Inside+0.003
Neg logit contributions
 wil-0.015
onel-0.015
 ow-0.014
eling-0.014
 dying-0.014
Token game
Token ID983
Feature activation-9.01e-01

Pos logit contributions
lock+0.014
 Steve+0.014
 guide+0.014
lot+0.013
 X+0.013
Neg logit contributions
arre-0.012
 ow-0.011
 mosquitoes-0.011
 suffering-0.011
 allergies-0.011
Token was
Token ID373
Feature activation+6.54e-01

Pos logit contributions
arre+0.027
 ow+0.025
 mosquitoes+0.025
 wil+0.025
 dying+0.024
Neg logit contributions
lock-0.020
 guide-0.020
 X-0.019
 Steve-0.019
 Mark-0.017
Token so
Token ID523
Feature activation+6.65e-02

Pos logit contributions
 guide+0.014
lock+0.014
 X+0.013
 Steve+0.013
 Mark+0.012
Neg logit contributions
arre-0.020
 ow-0.019
 mosquitoes-0.019
 wil-0.019
 suffering-0.018
Token fun
Token ID1257
Feature activation+2.12e+00

Pos logit contributions
lock+0.002
 guide+0.002
 X+0.002
 Steve+0.002
lot+0.001
Neg logit contributions
arre-0.002
 ow-0.002
 wil-0.002
 mosquitoes-0.002
 dying-0.002
Token!
Token ID0
Feature activation+2.44e-01

Pos logit contributions
 guide+0.037
 Steve+0.036
lock+0.035
 X+0.034
 Mark+0.032
Neg logit contributions
arre-0.075
 ow-0.071
 wil-0.070
ivery-0.070
onel-0.069
Token
Token ID198
Feature activation-3.67e-01

Pos logit contributions
 Steve+0.003
lock+0.003
 guide+0.003
 Mark+0.003
 X+0.003
Neg logit contributions
 wil-0.009
onel-0.009
 ow-0.009
 mosquitoes-0.009
 dying-0.009
Token
Token ID198
Feature activation-6.16e-01

Pos logit contributions
arre+0.013
 ow+0.012
 wil+0.012
 mosquitoes+0.012
 suffering+0.012
Neg logit contributions
 guide-0.006
lock-0.006
 Steve-0.006
 X-0.006
 Mark-0.005
TokenWhen
Token ID2215
Feature activation+6.66e-01

Pos logit contributions
 ow+0.023
onel+0.023
 fault+0.023
 wil+0.022
 suffering+0.022
Neg logit contributions
lock-0.009
 guide-0.009
 Steve-0.008
 steer-0.007
 X-0.007
Token it
Token ID340
Feature activation+6.59e-02

Pos logit contributions
 guide+0.012
lock+0.012
 X+0.011
 Steve+0.011
 Mark+0.010
Neg logit contributions
arre-0.023
 ow-0.022
 wil-0.021
 mosquitoes-0.021
onel-0.021
TokenOnce
Token ID7454
Feature activation-6.82e-01

Pos logit contributions
arre+0.008
 ow+0.007
 mosquitoes+0.007
 wil+0.007
 suffering+0.007
Neg logit contributions
 guide-0.006
lock-0.006
 X-0.005
 Steve-0.005
 steer-0.005
Token upon
Token ID2402
Feature activation-7.90e-01

Pos logit contributions
arre+0.019
 ow+0.018
 mosquitoes+0.018
 wil+0.018
 allergies+0.017
Neg logit contributions
 guide-0.016
lock-0.016
 Steve-0.016
 X-0.015
 Mark-0.015
Token a
Token ID257
Feature activation-1.92e+00

Pos logit contributions
arre+0.029
 wil+0.026
 ow+0.026
 mosquitoes+0.026
onel+0.026
Neg logit contributions
lock-0.013
 guide-0.013
 X-0.012
 Steve-0.012
 Mark-0.011
Token time
Token ID640
Feature activation+2.00e+00

Pos logit contributions
arre+0.050
 ow+0.047
 mosquitoes+0.047
 wil+0.046
 dying+0.046
Neg logit contributions
 guide-0.047
lock-0.047
 X-0.046
 Steve-0.046
 Mark-0.044
Token,
Token ID11
Feature activation+2.90e-01

Pos logit contributions
 guide+0.030
lock+0.029
 X+0.028
 Steve+0.025
lot+0.023
Neg logit contributions
arre-0.075
 ow-0.073
 mosquitoes-0.070
 wil-0.069
 suffering-0.069
Token there
Token ID612
Feature activation+9.06e-01

Pos logit contributions
lock+0.007
 guide+0.006
 Steve+0.006
 X+0.006
 Mark+0.006
Neg logit contributions
arre-0.009
 ow-0.008
 mosquitoes-0.008
 allergies-0.008
 wil-0.008
Token was
Token ID373
Feature activation-1.45e-01

Pos logit contributions
 guide+0.019
lock+0.019
 X+0.018
 Steve+0.018
 Mark+0.016
Neg logit contributions
arre-0.028
 ow-0.027
 mosquitoes-0.026
 wil-0.026
 suffering-0.026
Token a
Token ID257
Feature activation-1.57e-03

Pos logit contributions
arre+0.006
 wil+0.005
onel+0.005
 ow+0.005
 dying+0.005
Neg logit contributions
 Steve-0.002
 X-0.002
 Mark-0.002
lock-0.002
 guide-0.002
Token little
Token ID1310
Feature activation-3.72e-02

Pos logit contributions
 ow+0.000
arre+0.000
 mosquitoes+0.000
 wil+0.000
 suffering+0.000
Neg logit contributions
lock-0.000
 Steve-0.000
 guide-0.000
 X-0.000
 Mark-0.000
Token boy
Token ID2933
Feature activation+1.13e+00

Pos logit contributions
 ow+0.001
arre+0.001
 mosquitoes+0.001
 wil+0.001
 suffering+0.001
Neg logit contributions
lock-0.001
 guide-0.001
 Steve-0.001
 X-0.001
lot-0.001
Token named
Token ID3706
Feature activation-9.94e-01

Pos logit contributions
 guide+0.029
lock+0.029
 X+0.028
 Steve+0.028
 Kyle+0.027
Neg logit contributions
arre-0.029
 ow-0.028
 suffering-0.028
 wil-0.028
 mosquitoes-0.027
Token he
Token ID339
Feature activation-1.32e+00

Pos logit contributions
 mosquitoes+0.000
 ow+0.000
 wil+0.000
 dying+0.000
 suffering+0.000
Neg logit contributions
lock-0.000
 guide-0.000
 X-0.000
 Steve-0.000
lynn-0.000
Token flew
Token ID13112
Feature activation-7.55e-02

Pos logit contributions
arre+0.040
 ow+0.037
 mosquitoes+0.037
 wil+0.036
 suffering+0.036
Neg logit contributions
 guide-0.029
lock-0.029
 X-0.028
 Steve-0.027
 Mark-0.025
Token.
Token ID13
Feature activation+4.03e-02

Pos logit contributions
arre+0.002
 ow+0.002
 mosquitoes+0.002
 wil+0.002
 suffering+0.002
Neg logit contributions
lock-0.002
 guide-0.002
 X-0.002
 Steve-0.002
 Mark-0.001
Token
Token ID198
Feature activation-2.15e-01

Pos logit contributions
 guide+0.001
lock+0.001
 X+0.001
 Steve+0.001
lot+0.001
Neg logit contributions
arre-0.001
 ow-0.001
Acc-0.001
 suffering-0.001
 allergies-0.001
Token
Token ID198
Feature activation-4.56e-01

Pos logit contributions
arre+0.005
Acc+0.004
ins+0.004
 mosquitoes+0.004
 ow+0.004
Neg logit contributions
 X-0.006
 guide-0.006
lock-0.006
 Steve-0.005
 map-0.005
TokenJohnny
Token ID44960
Feature activation+1.27e+00

Pos logit contributions
arre+0.014
 ow+0.013
 mosquitoes+0.013
 wil+0.013
 allergies+0.012
Neg logit contributions
 guide-0.010
lock-0.010
 X-0.010
 Steve-0.009
 Mark-0.009
Token waved
Token ID26834
Feature activation+9.85e-02

Pos logit contributions
 guide+0.027
lock+0.026
 Steve+0.026
 X+0.025
 Mark+0.024
Neg logit contributions
arre-0.039
 ow-0.037
 mosquitoes-0.037
 wil-0.036
onel-0.036
Token back
Token ID736
Feature activation-8.47e-02

Pos logit contributions
lock+0.003
 guide+0.003
 X+0.002
 Steve+0.002
lynn+0.002
Neg logit contributions
arre-0.002
 ow-0.002
 mosquitoes-0.002
 wil-0.002
 suffering-0.002
Token and
Token ID290
Feature activation+8.27e-01

Pos logit contributions
arre+0.003
 ow+0.003
 mosquitoes+0.003
 wil+0.003
 suffering+0.003
Neg logit contributions
 guide-0.002
lock-0.002
 X-0.002
 Steve-0.002
 Mark-0.001
Token said
Token ID531
Feature activation-5.67e-01

Pos logit contributions
lock+0.019
 guide+0.019
 X+0.018
 Steve+0.017
lynn+0.017
Neg logit contributions
arre-0.023
 ow-0.022
 wil-0.022
 mosquitoes-0.022
 suffering-0.022
Token,
Token ID11
Feature activation-8.63e-01

Pos logit contributions
arre+0.019
 ow+0.019
 mosquitoes+0.018
onel+0.018
 suffering+0.018
Neg logit contributions
lock-0.009
 guide-0.009
 X-0.009
 Steve-0.008
lynn-0.008
Token the
Token ID262
Feature activation+3.44e-01

Pos logit contributions
 Steve+0.015
 X+0.014
 guide+0.014
 Mark+0.013
 point+0.012
Neg logit contributions
arre-0.047
ivery-0.044
ivering-0.042
onel-0.041
Acc-0.040
Token earth
Token ID4534
Feature activation-2.51e-01

Pos logit contributions
 guide+0.008
 X+0.008
lock+0.008
 Steve+0.008
 Mark+0.007
Neg logit contributions
arre-0.010
Acc-0.009
 ow-0.009
onel-0.009
ivery-0.009
Token.
Token ID13
Feature activation-6.25e-01

Pos logit contributions
arre+0.009
Acc+0.008
 ow+0.008
ivery+0.008
 mosquitoes+0.008
Neg logit contributions
 guide-0.004
lock-0.004
 X-0.004
 Steve-0.004
 point-0.004
Token Ben
Token ID3932
Feature activation+8.05e-01

Pos logit contributions
arre+0.021
 ow+0.020
 wil+0.019
 mosquitoes+0.019
 suffering+0.019
Neg logit contributions
 guide-0.012
lock-0.012
 Steve-0.012
 X-0.012
 Mark-0.011
Token has
Token ID468
Feature activation+8.23e-01

Pos logit contributions
 guide+0.018
lock+0.018
 X+0.018
 Steve+0.018
 Mark+0.016
Neg logit contributions
arre-0.024
 ow-0.022
 mosquitoes-0.022
 wil-0.022
 suffering-0.021
Token a
Token ID257
Feature activation+1.06e+00

Pos logit contributions
 X+0.013
 guide+0.012
 Steve+0.012
lock+0.012
 Mark+0.011
Neg logit contributions
arre-0.033
ivering-0.031
ivery-0.031
ilt-0.031
ins-0.030
Token small
Token ID1402
Feature activation+9.75e-01

Pos logit contributions
lock+0.026
 guide+0.026
 Steve+0.025
 X+0.025
 Mark+0.023
Neg logit contributions
arre-0.029
 ow-0.028
 mosquitoes-0.027
 wil-0.027
 suffering-0.026
Token ball
Token ID2613
Feature activation+6.03e-01

Pos logit contributions
lock+0.022
 guide+0.022
 Steve+0.021
 X+0.020
 Mark+0.020
Neg logit contributions
arre-0.028
 ow-0.027
 mosquitoes-0.026
 wil-0.026
 suffering-0.026
Token that
Token ID326
Feature activation-7.34e-01

Pos logit contributions
 guide+0.010
lock+0.010
 X+0.010
 Steve+0.009
 Mark+0.008
Neg logit contributions
arre-0.023
 ow-0.021
 mosquitoes-0.021
Acc-0.021
ivery-0.020
Token looks
Token ID3073
Feature activation-8.82e-01

Pos logit contributions
arre+0.026
 ow+0.024
 mosquitoes+0.024
 wil+0.023
 suffering+0.023
Neg logit contributions
 guide-0.013
lock-0.013
 X-0.013
 Steve-0.012
 Mark-0.011
Token like
Token ID588
Feature activation+1.37e+00

Pos logit contributions
arre+0.034
 ow+0.031
ivery+0.031
 mosquitoes+0.031
onel+0.030
Neg logit contributions
 guide-0.014
 X-0.013
 Steve-0.012
lock-0.012
 point-0.011
Tokenroom
Token ID3823
Feature activation-8.78e-02

Pos logit contributions
arre+0.004
ins+0.004
onel+0.003
ilt+0.003
 ow+0.003
Neg logit contributions
 guide-0.005
 Steve-0.005
 X-0.005
lock-0.005
 Mark-0.005
Token!"
Token ID2474
Feature activation-1.01e-01

Pos logit contributions
arre+0.003
 mosquitoes+0.003
Acc+0.003
 ow+0.003
 wil+0.003
Neg logit contributions
 guide-0.002
lock-0.002
 X-0.002
 Steve-0.001
 Mark-0.001
Token Ben
Token ID3932
Feature activation-3.91e-01

Pos logit contributions
arre+0.004
 ow+0.003
 wil+0.003
onel+0.003
 mosquitoes+0.003
Neg logit contributions
 guide-0.002
 Steve-0.002
lock-0.002
 X-0.002
 Mark-0.002
Token said
Token ID531
Feature activation-5.70e-01

Pos logit contributions
arre+0.013
 ow+0.012
 wil+0.012
 mosquitoes+0.012
 dying+0.012
Neg logit contributions
 guide-0.008
 X-0.007
lock-0.007
 Steve-0.007
 Mark-0.007
Token,
Token ID11
Feature activation-2.34e-01

Pos logit contributions
arre+0.015
 ow+0.014
 suffering+0.014
 dying+0.014
 wil+0.014
Neg logit contributions
 X-0.014
 guide-0.014
lock-0.014
 Steve-0.013
lynn-0.012
Token pointing
Token ID10609
Feature activation-1.04e-01

Pos logit contributions
arre+0.007
 ow+0.006
ilt+0.006
 mosquitoes+0.006
 wil+0.006
Neg logit contributions
 guide-0.006
lock-0.006
 Steve-0.005
 X-0.005
 Mark-0.005
Token at
Token ID379
Feature activation+8.44e-01

Pos logit contributions
arre+0.003
 ow+0.002
 wil+0.002
 mosquitoes+0.002
 dying+0.002
Neg logit contributions
lock-0.003
 guide-0.003
 X-0.003
 Steve-0.003
 Mark-0.002
Token another
Token ID1194
Feature activation+9.16e-01

Pos logit contributions
 guide+0.015
lock+0.015
 X+0.015
 Steve+0.015
 Mark+0.013
Neg logit contributions
arre-0.029
 ow-0.027
 mosquitoes-0.027
 wil-0.027
 allergies-0.026
Token picture
Token ID4286
Feature activation-1.56e-01

Pos logit contributions
lock+0.022
 guide+0.021
 Steve+0.021
 X+0.020
lot+0.019
Neg logit contributions
arre-0.025
 ow-0.025
 wil-0.024
 suffering-0.023
 mosquitoes-0.023
Token.
Token ID13
Feature activation+5.75e-02

Pos logit contributions
arre+0.005
 ow+0.005
 mosquitoes+0.005
 wil+0.005
 allergies+0.005
Neg logit contributions
 guide-0.003
lock-0.003
 X-0.003
 Steve-0.003
 Mark-0.003
Token
Token ID198
Feature activation-6.80e-02

Pos logit contributions
 guide+0.001
lock+0.001
 X+0.001
 Steve+0.001
 Mark+0.001
Neg logit contributions
arre-0.002
 ow-0.002
 mosquitoes-0.002
 wil-0.002
 suffering-0.002
Token tree
Token ID5509
Feature activation+7.64e-01

Pos logit contributions
 guide+0.003
lock+0.003
 X+0.002
 Steve+0.002
 Mark+0.002
Neg logit contributions
arre-0.005
 wil-0.004
 ow-0.004
 allergies-0.004
Acc-0.004
Token!"
Token ID2474
Feature activation-1.96e-01

Pos logit contributions
 guide+0.015
lock+0.015
 X+0.015
 Steve+0.014
 Mark+0.013
Neg logit contributions
arre-0.025
 ow-0.024
 mosquitoes-0.023
 wil-0.023
 allergies-0.023
Token Ben
Token ID3932
Feature activation-4.78e-02

Pos logit contributions
arre+0.007
 wil+0.007
onel+0.007
Acc+0.006
 ow+0.006
Neg logit contributions
 guide-0.003
 Steve-0.003
 X-0.003
lock-0.003
 Mark-0.003
Token says
Token ID1139
Feature activation-1.10e-01

Pos logit contributions
arre+0.001
 ow+0.001
 mosquitoes+0.001
 allergies+0.001
 wil+0.001
Neg logit contributions
 guide-0.001
lock-0.001
 X-0.001
 Steve-0.001
lot-0.001
Token.
Token ID13
Feature activation+9.03e-02

Pos logit contributions
arre+0.002
 suffering+0.002
 itching+0.002
 ow+0.002
 miserable+0.002
Neg logit contributions
 X-0.003
 guide-0.003
lock-0.003
lynn-0.003
Ray-0.003
Token He
Token ID679
Feature activation+1.26e+00

Pos logit contributions
lock+0.001
 guide+0.001
 X+0.001
 Steve+0.001
lynn+0.001
Neg logit contributions
arre-0.003
 ow-0.003
 mosquitoes-0.003
 itching-0.003
 allergies-0.003
Token likes
Token ID7832
Feature activation+5.36e-02

Pos logit contributions
 guide+0.032
lock+0.032
 X+0.031
 Steve+0.031
 Mark+0.029
Neg logit contributions
arre-0.034
 ow-0.032
 mosquitoes-0.031
 wil-0.031
 allergies-0.030
Token to
Token ID284
Feature activation+2.98e-01

Pos logit contributions
 Mark+0.001
 Steve+0.001
 X+0.001
 guide+0.001
 Ray+0.001
Neg logit contributions
arre-0.002
ivery-0.002
ivering-0.002
onel-0.002
ilt-0.002
Token climb
Token ID12080
Feature activation+5.48e-01

Pos logit contributions
 guide+0.005
lock+0.005
 Steve+0.005
 X+0.005
 Kyle+0.004
Neg logit contributions
arre-0.010
 ow-0.010
 mosquitoes-0.010
 wil-0.010
 suffering-0.009
Token.
Token ID13
Feature activation+1.87e-01

Pos logit contributions
lock+0.012
 guide+0.012
 X+0.011
 Steve+0.011
lot+0.010
Neg logit contributions
arre-0.016
 mosquitoes-0.016
 ow-0.016
 wil-0.016
 dying-0.015
Token
Token ID198
Feature activation-3.12e-01

Pos logit contributions
 guide+0.003
lock+0.003
 X+0.003
 Steve+0.003
 Mark+0.003
Neg logit contributions
arre-0.006
 ow-0.006
 mosquitoes-0.006
 wil-0.006
 allergies-0.006
Token She
Token ID1375
Feature activation+2.48e-01

Pos logit contributions
lock+0.011
 Steve+0.011
 guide+0.010
 X+0.010
 Mark+0.010
Neg logit contributions
arre-0.017
 ow-0.016
 mosquitoes-0.016
 wil-0.016
 dying-0.016
Token also
Token ID635
Feature activation-3.60e-01

Pos logit contributions
 guide+0.005
lock+0.005
 X+0.005
 Steve+0.005
 Mark+0.005
Neg logit contributions
arre-0.008
 ow-0.007
 mosquitoes-0.007
 wil-0.007
 allergies-0.007
Token noticed
Token ID6810
Feature activation-4.45e-01

Pos logit contributions
arre+0.011
 mosquitoes+0.011
 ow+0.011
 allergies+0.011
ivery+0.010
Neg logit contributions
 guide-0.007
 Steve-0.007
lock-0.007
 X-0.007
 Kyle-0.006
Token that
Token ID326
Feature activation+4.39e-02

Pos logit contributions
arre+0.015
ivery+0.014
ivering+0.013
 ow+0.013
onel+0.013
Neg logit contributions
 Steve-0.009
 guide-0.009
 X-0.009
lock-0.008
 Mark-0.008
Token this
Token ID428
Feature activation-2.78e-01

Pos logit contributions
 guide+0.001
 Steve+0.001
lock+0.001
 X+0.001
 Mark+0.001
Neg logit contributions
arre-0.002
 ow-0.001
ivery-0.001
 mosquitoes-0.001
 allergies-0.001
Token flower
Token ID15061
Feature activation-1.56e+00

Pos logit contributions
arre+0.008
ivery+0.007
Acc+0.007
onel+0.007
 allergies+0.007
Neg logit contributions
 guide-0.007
 X-0.007
lock-0.007
 Steve-0.006
 point-0.006
Token was
Token ID373
Feature activation-6.91e-02

Pos logit contributions
arre+0.058
 mosquitoes+0.053
ivery+0.053
Acc+0.052
 ow+0.052
Neg logit contributions
 guide-0.026
lock-0.025
 Steve-0.024
 X-0.023
 point-0.021
Token even
Token ID772
Feature activation-7.04e-02

Pos logit contributions
arre+0.002
ivery+0.002
ins+0.002
 mosquitoes+0.002
onel+0.002
Neg logit contributions
 guide-0.001
 X-0.001
 Steve-0.001
lock-0.001
 point-0.001
Token l
Token ID300
Feature activation+1.16e-01

Pos logit contributions
arre+0.002
 ow+0.002
 mosquitoes+0.002
ivery+0.002
onel+0.002
Neg logit contributions
 guide-0.001
 X-0.001
lock-0.001
 Steve-0.001
 point-0.001
Tokenitt
Token ID715
Feature activation+2.26e-01

Pos logit contributions
lock+0.003
 guide+0.003
 X+0.003
 Steve+0.003
 Kyle+0.003
Neg logit contributions
arre-0.003
 ow-0.002
 mosquitoes-0.002
 allergies-0.002
 suffering-0.002
Tokenler
Token ID1754
Feature activation+5.58e-01

Pos logit contributions
lock+0.006
 guide+0.006
 X+0.005
 Steve+0.005
 steer+0.005
Neg logit contributions
arre-0.006
 ow-0.006
Acc-0.006
 mosquitoes-0.005
ivery-0.005
Token cat
Token ID3797
Feature activation-7.47e-01

Pos logit contributions
 guide+0.007
 X+0.007
lock+0.007
 Steve+0.007
 Mark+0.006
Neg logit contributions
arre-0.008
 ow-0.008
 wil-0.008
onel-0.008
Acc-0.007
Token liked
Token ID8288
Feature activation+2.76e-01

Pos logit contributions
arre+0.026
ivery+0.024
 ow+0.024
 mosquitoes+0.024
Acc+0.024
Neg logit contributions
 guide-0.014
 Steve-0.013
lock-0.013
 X-0.012
 Mark-0.012
Token the
Token ID262
Feature activation+6.11e-01

Pos logit contributions
 Steve+0.004
 Mark+0.004
 guide+0.004
 X+0.004
 Ray+0.003
Neg logit contributions
arre-0.011
ivery-0.011
onel-0.010
ilt-0.010
ivering-0.010
Token song
Token ID3496
Feature activation-3.84e-01

Pos logit contributions
 guide+0.014
 X+0.013
 Steve+0.013
lock+0.013
 Mark+0.012
Neg logit contributions
arre-0.019
 wil-0.017
 ow-0.017
onel-0.017
Acc-0.017
Token and
Token ID290
Feature activation+4.80e-01

Pos logit contributions
arre+0.013
 ow+0.012
 mosquitoes+0.012
 wil+0.012
 suffering+0.012
Neg logit contributions
 guide-0.007
lock-0.007
 X-0.007
 Steve-0.007
 Mark-0.006
Token started
Token ID2067
Feature activation-1.39e+00

Pos logit contributions
 guide+0.009
 Steve+0.009
 Mark+0.008
 X+0.008
 Ray+0.008
Neg logit contributions
arre-0.017
ivery-0.016
onel-0.016
ilt-0.016
ivering-0.016
Token to
Token ID284
Feature activation-3.31e-01

Pos logit contributions
ivery+0.058
ilt+0.056
 ow+0.056
onel+0.056
arre+0.056
Neg logit contributions
 guide-0.016
 Steve-0.016
 Mark-0.015
 point-0.014
 steer-0.013
Token dance
Token ID9280
Feature activation-1.20e+00

Pos logit contributions
arre+0.013
ins+0.012
onel+0.012
 mosquitoes+0.012
 ow+0.012
Neg logit contributions
 guide-0.005
 point-0.005
 Steve-0.005
 X-0.005
 steer-0.004
Token.
Token ID13
Feature activation+2.55e-01

Pos logit contributions
arre+0.042
 ow+0.040
 wil+0.039
 mosquitoes+0.039
 suffering+0.039
Neg logit contributions
 guide-0.022
lock-0.020
 Steve-0.020
 X-0.020
 Mark-0.018
Token While
Token ID2893
Feature activation-7.77e-01

Pos logit contributions
 guide+0.005
lock+0.005
 X+0.005
 Steve+0.004
 Mark+0.004
Neg logit contributions
arre-0.009
 ow-0.008
 mosquitoes-0.008
 wil-0.008
 allergies-0.008
Token the
Token ID262
Feature activation+2.32e-01

Pos logit contributions
arre+0.030
ivery+0.027
onel+0.027
 ow+0.027
ilt+0.027
Neg logit contributions
 guide-0.012
 Steve-0.012
 Mark-0.011
lock-0.011
 X-0.011
Token.
Token ID13
Feature activation-1.58e-01

Pos logit contributions
arre+0.047
 disl+0.042
ivery+0.041
 mosquitoes+0.041
 allergies+0.040
Neg logit contributions
 guide-0.018
lock-0.017
 X-0.015
 point-0.014
 Steve-0.014
Token The
Token ID383
Feature activation+1.39e-01

Pos logit contributions
 ow+0.005
arre+0.005
 mosquitoes+0.004
 wil+0.004
 suffering+0.004
Neg logit contributions
 guide-0.003
lock-0.003
 X-0.003
 Steve-0.003
lot-0.003
Token squash
Token ID34613
Feature activation-1.52e+00

Pos logit contributions
 guide+0.004
 X+0.004
 Steve+0.004
lock+0.004
 point+0.004
Neg logit contributions
arre-0.003
 allergies-0.003
onel-0.003
pill-0.003
 wil-0.003
Token lived
Token ID5615
Feature activation-1.08e-01

Pos logit contributions
arre+0.063
 mosquitoes+0.057
ivery+0.057
 allergies+0.056
onel+0.056
Neg logit contributions
 guide-0.020
lock-0.018
 X-0.017
 point-0.017
 Steve-0.016
Token in
Token ID287
Feature activation+2.85e-01

Pos logit contributions
arre+0.004
 ow+0.004
 mosquitoes+0.004
 allergies+0.004
 disl+0.004
Neg logit contributions
lock-0.002
 guide-0.002
 X-0.002
 Steve-0.001
 point-0.001
Token a
Token ID257
Feature activation-1.10e+00

Pos logit contributions
 X+0.004
 guide+0.004
 Steve+0.004
lock+0.004
 Mark+0.004
Neg logit contributions
arre-0.011
onel-0.010
ins-0.010
ivery-0.010
 allergies-0.010
Token comfortable
Token ID6792
Feature activation-1.93e+00

Pos logit contributions
arre+0.040
der+0.037
onel+0.037
 allergies+0.037
ins+0.036
Neg logit contributions
 guide-0.019
 X-0.019
lock-0.018
 point-0.018
 map-0.016
Token garden
Token ID11376
Feature activation+5.18e-02

Pos logit contributions
arre+0.061
onel+0.057
Acc+0.056
 ow+0.055
ins+0.054
Neg logit contributions
 guide-0.043
 X-0.041
lock-0.040
 Steve-0.038
 point-0.038
Token with
Token ID351
Feature activation+7.28e-02

Pos logit contributions
 guide+0.001
lock+0.001
 X+0.001
 Steve+0.001
 Mark+0.001
Neg logit contributions
arre-0.002
 ow-0.002
 mosquitoes-0.002
 wil-0.002
 allergies-0.002
Token many
Token ID867
Feature activation-6.90e-01

Pos logit contributions
 X+0.002
 Steve+0.001
 guide+0.001
 Mark+0.001
lock+0.001
Neg logit contributions
arre-0.003
ivering-0.002
ivery-0.002
Acc-0.002
onel-0.002
Token other
Token ID584
Feature activation-4.44e-01

Pos logit contributions
arre+0.023
onel+0.021
storm+0.021
Acc+0.020
ivery+0.020
Neg logit contributions
 guide-0.013
 X-0.012
 point-0.012
 Steve-0.012
lock-0.012
Token.
Token ID13
Feature activation+4.31e-01

Pos logit contributions
 guide+0.014
 Steve+0.013
lock+0.013
 X+0.012
 Mark+0.011
Neg logit contributions
arre-0.044
ivery-0.042
 ow-0.042
Acc-0.041
onel-0.041
Token The
Token ID383
Feature activation+4.90e-01

Pos logit contributions
 Steve+0.007
 Mark+0.007
lock+0.006
 guide+0.006
 Kyle+0.006
Neg logit contributions
arre-0.016
 ow-0.016
 mosquitoes-0.016
 wil-0.015
onel-0.015
Token little
Token ID1310
Feature activation+6.00e-01

Pos logit contributions
 guide+0.012
lock+0.012
 X+0.012
 Steve+0.011
 point+0.010
Neg logit contributions
arre-0.014
onel-0.012
ins-0.012
ilt-0.012
 wil-0.012
Token girl
Token ID2576
Feature activation-2.22e-01

Pos logit contributions
 guide+0.014
lock+0.014
 X+0.014
 Steve+0.013
 Mark+0.012
Neg logit contributions
arre-0.018
 ow-0.016
 wil-0.016
 mosquitoes-0.016
onel-0.016
Token explained
Token ID4893
Feature activation+5.82e-01

Pos logit contributions
arre+0.007
 ow+0.007
 mosquitoes+0.007
 wil+0.007
 dying+0.007
Neg logit contributions
 guide-0.004
lock-0.004
 Steve-0.004
 X-0.004
 Mark-0.004
Token that
Token ID326
Feature activation-4.27e-01

Pos logit contributions
 guide+0.014
lock+0.014
 Steve+0.014
 X+0.014
 Mark+0.013
Neg logit contributions
arre-0.016
 ow-0.015
 wil-0.015
 mosquitoes-0.015
 allergies-0.014
Token it
Token ID340
Feature activation-9.32e-01

Pos logit contributions
arre+0.015
ivery+0.013
 ow+0.013
ilt+0.013
ivering+0.013
Neg logit contributions
 guide-0.008
 Steve-0.008
lock-0.008
 X-0.008
 Mark-0.008
Token had
Token ID550
Feature activation+3.68e-01

Pos logit contributions
arre+0.029
 ow+0.028
 mosquitoes+0.027
 wil+0.027
 suffering+0.026
Neg logit contributions
 guide-0.019
lock-0.019
 X-0.019
 Steve-0.018
 Mark-0.017
Token shrunk
Token ID38900
Feature activation-7.77e-01

Pos logit contributions
 guide+0.008
 Steve+0.008
lock+0.007
 X+0.007
 point+0.007
Neg logit contributions
arre-0.012
ivery-0.011
ivering-0.011
onel-0.011
Acc-0.010
Token and
Token ID290
Feature activation-3.38e-01

Pos logit contributions
arre+0.025
 ow+0.024
 mosquitoes+0.023
 wil+0.023
onel+0.023
Neg logit contributions
 guide-0.015
lock-0.015
 Steve-0.015
 X-0.015
 Mark-0.013
Token become
Token ID1716
Feature activation-7.39e-01

Pos logit contributions
arre+0.012
ivery+0.011
ivering+0.011
 ow+0.010
ilt+0.010
Neg logit contributions
 guide-0.007
 Steve-0.006
lock-0.006
 X-0.006
 Mark-0.006
Token spark
Token ID9009
Feature activation-1.38e+00

Pos logit contributions
 guide+0.007
lock+0.007
 Steve+0.007
 X+0.007
 Mark+0.007
Neg logit contributions
 ow-0.005
 wil-0.005
arre-0.005
 itching-0.005
 suffering-0.005
Tokenly
Token ID306
Feature activation+6.88e-02

Pos logit contributions
arre+0.047
 ow+0.044
 wil+0.043
 mosquitoes+0.043
 dying+0.043
Neg logit contributions
 guide-0.026
lock-0.025
 X-0.025
 Steve-0.023
 Mark-0.021
Token dress
Token ID6576
Feature activation+3.67e-01

Pos logit contributions
 guide+0.002
lock+0.002
 X+0.002
 Steve+0.002
 point+0.001
Neg logit contributions
arre-0.002
 mosquitoes-0.002
 ow-0.002
onel-0.002
 wil-0.002
Token and
Token ID290
Feature activation+6.04e-01

Pos logit contributions
 Steve+0.008
 guide+0.008
 X+0.008
lock+0.008
 Kyle+0.007
Neg logit contributions
arre-0.011
 ow-0.010
 wil-0.010
der-0.010
 suffering-0.010
Token her
Token ID607
Feature activation+1.10e-01

Pos logit contributions
lock+0.016
 guide+0.016
 X+0.015
 Steve+0.015
lynn+0.014
Neg logit contributions
 ow-0.015
arre-0.014
 wil-0.014
 mosquitoes-0.014
 spilled-0.014
Token friend
Token ID1545
Feature activation-2.00e+00

Pos logit contributions
 guide+0.003
 X+0.002
lock+0.002
 Steve+0.002
 Mark+0.002
Neg logit contributions
arre-0.003
 mosquitoes-0.003
ilt-0.003
Acc-0.003
 ow-0.003
Token put
Token ID1234
Feature activation-2.52e-01

Pos logit contributions
arre+0.051
 ow+0.048
 allergies+0.047
 wil+0.047
 spilled+0.046
Neg logit contributions
lock-0.051
 guide-0.051
 X-0.050
 Steve-0.047
lot-0.045
Token on
Token ID319
Feature activation-3.75e-01

Pos logit contributions
arre+0.007
 ow+0.007
 wil+0.007
 mosquitoes+0.006
 dying+0.006
Neg logit contributions
lock-0.006
 guide-0.006
 Steve-0.006
 X-0.006
lot-0.005
Token a
Token ID257
Feature activation+4.36e-02

Pos logit contributions
arre+0.009
 ow+0.009
 mosquitoes+0.008
 wil+0.008
 suffering+0.008
Neg logit contributions
 guide-0.010
lock-0.010
 X-0.010
 Steve-0.010
 Mark-0.009
Token pink
Token ID11398
Feature activation-1.18e+00

Pos logit contributions
 guide+0.001
lock+0.001
 X+0.001
 Steve+0.001
 Mark+0.001
Neg logit contributions
arre-0.001
 mosquitoes-0.001
 ow-0.001
 wil-0.001
onel-0.001
Token,
Token ID11
Feature activation-1.17e-01

Pos logit contributions
arre+0.034
 mosquitoes+0.030
onel+0.030
 ow+0.029
 wil+0.029
Neg logit contributions
 guide-0.030
 X-0.029
lock-0.029
 Steve-0.027
 point-0.025
Token!"
Token ID2474
Feature activation+5.19e-01

Pos logit contributions
 guide+0.031
lock+0.030
 Steve+0.029
 X+0.029
 Mark+0.027
Neg logit contributions
arre-0.045
 ow-0.043
 wil-0.042
 mosquitoes-0.041
Acc-0.041
Token
Token ID198
Feature activation+1.28e-01

Pos logit contributions
 guide+0.009
lock+0.009
 Steve+0.009
 X+0.009
 Mark+0.008
Neg logit contributions
arre-0.018
 ow-0.017
 mosquitoes-0.017
 wil-0.017
 dying-0.016
Token
Token ID198
Feature activation-1.14e-01

Pos logit contributions
lock+0.003
 guide+0.003
 X+0.003
 Steve+0.003
 Mark+0.002
Neg logit contributions
arre-0.004
 ow-0.004
 mosquitoes-0.004
 wil-0.004
Acc-0.004
TokenBut
Token ID1537
Feature activation+1.38e+00

Pos logit contributions
arre+0.004
 ow+0.003
 mosquitoes+0.003
 wil+0.003
 allergies+0.003
Neg logit contributions
 guide-0.002
lock-0.002
 X-0.002
 Steve-0.002
 Mark-0.002
Token the
Token ID262
Feature activation-1.61e-02

Pos logit contributions
lock+0.028
 guide+0.028
 X+0.027
 Steve+0.026
lot+0.024
Neg logit contributions
arre-0.043
 ow-0.041
 mosquitoes-0.041
 wil-0.040
 suffering-0.040
Token busy
Token ID8179
Feature activation-2.64e+00

Pos logit contributions
arre+0.000
 ow+0.000
 wil+0.000
ilt+0.000
 mosquitoes+0.000
Neg logit contributions
 guide-0.000
 X-0.000
lock-0.000
 Steve-0.000
 Mark-0.000
Token friend
Token ID1545
Feature activation-5.19e-01

Pos logit contributions
arre+0.082
 ow+0.077
 wil+0.075
 mosquitoes+0.075
Acc+0.074
Neg logit contributions
 guide-0.058
lock-0.055
 Steve-0.054
 X-0.054
 Mark-0.050
Token said
Token ID531
Feature activation-2.60e-01

Pos logit contributions
arre+0.016
 ow+0.015
 mosquitoes+0.014
 wil+0.014
 dying+0.014
Neg logit contributions
 guide-0.011
lock-0.011
 X-0.011
 Steve-0.011
 Mark-0.010
Token,
Token ID11
Feature activation-6.88e-01

Pos logit contributions
arre+0.011
 ow+0.010
 mosquitoes+0.010
 wil+0.010
ilt+0.010
Neg logit contributions
 guide-0.003
lock-0.003
 Steve-0.003
 X-0.003
 Mark-0.002
Token "
Token ID366
Feature activation-9.26e-01

Pos logit contributions
arre+0.025
ilt+0.024
 wil+0.023
 ow+0.023
ivery+0.023
Neg logit contributions
 Steve-0.012
 guide-0.012
lock-0.011
 Mark-0.011
 X-0.011
TokenNo
Token ID2949
Feature activation-2.79e-01

Pos logit contributions
 ow+0.028
arre+0.027
 dying+0.026
 wil+0.026
onel+0.025
Neg logit contributions
lock-0.020
 guide-0.020
 Steve-0.019
 steer-0.018
 Kyle-0.018
Token Tim
Token ID5045
Feature activation+1.07e+00

Pos logit contributions
arre+0.042
 wil+0.038
onel+0.038
ilt+0.038
 ow+0.038
Neg logit contributions
 Steve-0.018
lock-0.017
 guide-0.017
 X-0.017
 Mark-0.017
Tokenmy
Token ID1820
Feature activation+2.61e-02

Pos logit contributions
lock+0.015
 guide+0.015
 X+0.015
 Steve+0.015
 Mark+0.013
Neg logit contributions
arre-0.039
 ow-0.038
 mosquitoes-0.038
 wil-0.037
 dying-0.037
Token's
Token ID338
Feature activation-3.19e-01

Pos logit contributions
lock+0.000
 X+0.000
 guide+0.000
 Steve+0.000
lot+0.000
Neg logit contributions
 ow-0.001
arre-0.001
onel-0.001
 mosquitoes-0.001
ivery-0.001
Token little
Token ID1310
Feature activation-8.51e-02

Pos logit contributions
arre+0.013
pill+0.012
onel+0.012
 wil+0.012
ivery+0.012
Neg logit contributions
 X-0.004
 guide-0.004
lock-0.004
 Steve-0.004
 Mark-0.003
Token sister
Token ID6621
Feature activation-2.41e+00

Pos logit contributions
arre+0.003
ilt+0.002
Acc+0.002
 wil+0.002
onel+0.002
Neg logit contributions
 X-0.002
 guide-0.002
lock-0.002
 Steve-0.002
 Mark-0.002
Token accidentally
Token ID14716
Feature activation-2.71e+00

Pos logit contributions
arre+0.079
 ow+0.078
onel+0.077
ivery+0.077
 mosquitoes+0.077
Neg logit contributions
 guide-0.043
 Steve-0.043
lock-0.040
 Mark-0.040
 X-0.038
Token knocked
Token ID13642
Feature activation-1.03e+00

Pos logit contributions
arre+0.077
ivery+0.072
pill+0.072
onel+0.072
ivering+0.071
Neg logit contributions
lock-0.067
 X-0.062
 drive-0.062
 steer-0.061
 guide-0.061
Token over
Token ID625
Feature activation+6.63e-01

Pos logit contributions
onel+0.042
arre+0.041
pill+0.041
 ow+0.041
 disl+0.041
Neg logit contributions
 Steve-0.010
 Mark-0.010
lock-0.009
 X-0.008
 guide-0.007
Token her
Token ID607
Feature activation+6.24e-02

Pos logit contributions
 guide+0.011
 Steve+0.011
lock+0.011
 X+0.011
 Mark+0.010
Neg logit contributions
arre-0.023
 ow-0.022
 wil-0.022
onel-0.022
ilt-0.021
Token glass
Token ID5405
Feature activation-2.16e+00

Pos logit contributions
 guide+0.001
 X+0.001
lock+0.001
 Steve+0.001
 Mark+0.001
Neg logit contributions
arre-0.002
Acc-0.002
 mosquitoes-0.002
onel-0.002
pill-0.002
Token of
Token ID286
Feature activation+6.86e-01

Pos logit contributions
Acc+0.084
arre+0.083
 disl+0.081
onel+0.080
 ow+0.079
Neg logit contributions
lock-0.028
 guide-0.026
 X-0.024
 opener-0.023
 Steve-0.020
Token it
Token ID340
Feature activation-2.01e+00

Pos logit contributions
arre+0.054
 ow+0.051
 mosquitoes+0.050
 wil+0.049
 allergies+0.049
Neg logit contributions
 guide-0.020
lock-0.020
 X-0.019
 Steve-0.018
 Mark-0.016
Token was
Token ID373
Feature activation-1.14e+00

Pos logit contributions
arre+0.070
 mosquitoes+0.069
 ow+0.067
Acc+0.067
 disl+0.066
Neg logit contributions
 guide-0.034
lock-0.033
 X-0.030
 Steve-0.029
 Mark-0.028
Token spark
Token ID9009
Feature activation-1.39e+00

Pos logit contributions
arre+0.037
pill+0.036
 mosquitoes+0.034
 disl+0.033
ins+0.033
Neg logit contributions
 guide-0.023
 X-0.022
lock-0.022
 Steve-0.022
 point-0.021
Tokenly
Token ID306
Feature activation+6.49e-02

Pos logit contributions
arre+0.045
 ow+0.043
 mosquitoes+0.042
 wil+0.042
 dying+0.041
Neg logit contributions
 guide-0.027
lock-0.027
 X-0.026
 Steve-0.025
 Mark-0.023
Token and
Token ID290
Feature activation-3.18e-01

Pos logit contributions
 guide+0.001
lock+0.001
 X+0.001
 Steve+0.001
 point+0.001
Neg logit contributions
arre-0.002
 ow-0.002
 mosquitoes-0.002
Acc-0.002
ivery-0.002
Token clean
Token ID3424
Feature activation-2.61e+00

Pos logit contributions
arre+0.009
 ow+0.009
 wil+0.008
 mosquitoes+0.008
 dying+0.008
Neg logit contributions
lock-0.008
 guide-0.008
 X-0.007
 Steve-0.007
 Mark-0.007
Token.
Token ID13
Feature activation-7.53e-02

Pos logit contributions
arre+0.087
 ow+0.082
 mosquitoes+0.081
 wil+0.080
 allergies+0.079
Neg logit contributions
 guide-0.050
lock-0.048
 X-0.048
 Steve-0.047
 Mark-0.043
Token
Token ID220
Feature activation-8.14e-01

Pos logit contributions
arre+0.003
 ow+0.003
 mosquitoes+0.003
 wil+0.003
 allergies+0.002
Neg logit contributions
 guide-0.001
lock-0.001
 X-0.001
 Steve-0.001
 Mark-0.001
Token
Token ID198
Feature activation-3.95e-01

Pos logit contributions
arre+0.027
 ow+0.026
 mosquitoes+0.025
 wil+0.025
 suffering+0.025
Neg logit contributions
 guide-0.015
lock-0.015
 X-0.015
 Steve-0.014
 Mark-0.013
Token
Token ID198
Feature activation-7.32e-01

Pos logit contributions
arre+0.014
 ow+0.013
 mosquitoes+0.013
 wil+0.013
 suffering+0.012
Neg logit contributions
 guide-0.007
lock-0.007
 X-0.007
 Steve-0.007
 Mark-0.006
TokenThe
Token ID464
Feature activation-8.43e-02

Pos logit contributions
 ow+0.025
arre+0.024
 wil+0.024
 mosquitoes+0.024
 allergies+0.023
Neg logit contributions
lock-0.014
 guide-0.014
 Steve-0.013
 X-0.012
 steer-0.011
Token and
Token ID290
Feature activation-9.02e-02

Pos logit contributions
 guide+0.009
lock+0.009
 X+0.008
 Steve+0.008
 Mark+0.007
Neg logit contributions
arre-0.024
 ow-0.023
 mosquitoes-0.023
 wil-0.023
 dying-0.022
Token daddy
Token ID35822
Feature activation+1.07e-01

Pos logit contributions
arre+0.004
pill+0.004
 mosquitoes+0.004
ins+0.003
 allergies+0.003
Neg logit contributions
 Steve-0.001
 guide-0.001
 Mark-0.001
lock-0.001
 X-0.001
Token had
Token ID550
Feature activation-3.97e-01

Pos logit contributions
 guide+0.002
lock+0.002
 X+0.002
 Steve+0.002
lynn+0.002
Neg logit contributions
arre-0.003
 ow-0.003
 wil-0.003
 suffering-0.003
 mosquitoes-0.003
Token hit
Token ID2277
Feature activation+1.71e-01

Pos logit contributions
ilt+0.015
ivering+0.014
ivery+0.014
ins+0.014
arre+0.014
Neg logit contributions
 Steve-0.007
 Mark-0.006
 guide-0.006
lock-0.006
 X-0.006
Token his
Token ID465
Feature activation+1.07e+00

Pos logit contributions
 Steve+0.003
 guide+0.003
 Mark+0.003
 X+0.003
lock+0.003
Neg logit contributions
arre-0.006
 wil-0.006
 ow-0.006
onel-0.006
 mosquitoes-0.006
Token thumb
Token ID15683
Feature activation-2.83e+00

Pos logit contributions
 guide+0.023
 X+0.023
lock+0.023
 Steve+0.021
 Mark+0.020
Neg logit contributions
arre-0.033
 mosquitoes-0.031
 wil-0.030
Acc-0.030
 ow-0.030
Token with
Token ID351
Feature activation+6.65e-01

Pos logit contributions
 ow+0.101
arre+0.100
Acc+0.099
 mosquitoes+0.098
pill+0.098
Neg logit contributions
lock-0.046
 guide-0.041
 Steve-0.040
 X-0.038
 Mark-0.037
Token the
Token ID262
Feature activation+1.55e-01

Pos logit contributions
 Steve+0.010
 X+0.010
 Mark+0.010
 guide+0.009
 point+0.008
Neg logit contributions
arre-0.025
ins-0.024
Acc-0.024
ivery-0.023
ivering-0.023
Token hammer
Token ID15554
Feature activation-5.93e-01

Pos logit contributions
 X+0.004
lock+0.004
 guide+0.004
 Steve+0.003
 point+0.003
Neg logit contributions
arre-0.004
ins-0.004
Acc-0.004
pill-0.004
ilt-0.004
Token.
Token ID13
Feature activation+1.05e-01

Pos logit contributions
arre+0.021
 mosquitoes+0.021
 ow+0.021
Acc+0.021
 wil+0.020
Neg logit contributions
lock-0.009
 Steve-0.009
 guide-0.009
 X-0.008
 Mark-0.008
Token M
Token ID337
Feature activation-1.76e-01

Pos logit contributions
 Steve+0.001
 Mark+0.001
 guide+0.001
lock+0.001
 X+0.001
Neg logit contributions
 wil-0.004
 mosquitoes-0.004
arre-0.004
 ow-0.004
onel-0.004
Token towards
Token ID3371
Feature activation-6.22e-01

Pos logit contributions
arre+0.018
 mosquitoes+0.017
 itching+0.016
 miserable+0.016
 suffering+0.016
Neg logit contributions
lock-0.025
 X-0.024
 guide-0.024
ver-0.023
 Steve-0.022
Token them
Token ID606
Feature activation-8.10e-01

Pos logit contributions
 ow+0.019
arre+0.018
 mosquitoes+0.018
 suffering+0.018
 wil+0.018
Neg logit contributions
lock-0.013
 guide-0.013
 X-0.012
 Steve-0.012
lot-0.011
Token.
Token ID13
Feature activation-1.54e-01

Pos logit contributions
arre+0.027
 ow+0.025
 mosquitoes+0.025
 wil+0.025
 allergies+0.024
Neg logit contributions
 guide-0.016
lock-0.016
 X-0.015
 Steve-0.015
 Mark-0.014
Token The
Token ID383
Feature activation+1.72e-01

Pos logit contributions
arre+0.004
 ow+0.004
 mosquitoes+0.004
 suffering+0.004
 wil+0.004
Neg logit contributions
 guide-0.003
lock-0.003
 X-0.003
 Steve-0.003
lynn-0.003
Token dog
Token ID3290
Feature activation-1.12e+00

Pos logit contributions
 guide+0.004
lock+0.004
 X+0.004
 Steve+0.004
 Mark+0.004
Neg logit contributions
arre-0.005
 ow-0.004
 mosquitoes-0.004
 wil-0.004
 suffering-0.004
Token quickly
Token ID2952
Feature activation-2.59e+00

Pos logit contributions
arre+0.036
 ow+0.034
 mosquitoes+0.034
 wil+0.033
ilt+0.033
Neg logit contributions
 guide-0.023
lock-0.022
 Steve-0.022
 X-0.021
 Mark-0.020
Token ran
Token ID4966
Feature activation-4.50e-01

Pos logit contributions
arre+0.080
 ow+0.075
 mosquitoes+0.075
 wil+0.073
 dying+0.072
Neg logit contributions
 guide-0.056
lock-0.055
 X-0.053
 Steve-0.053
 Mark-0.048
Token away
Token ID1497
Feature activation+3.73e-01

Pos logit contributions
arre+0.009
 mosquitoes+0.009
 miserable+0.008
 itching+0.008
 suffering+0.008
Neg logit contributions
lock-0.014
 X-0.013
 guide-0.012
lot-0.012
ver-0.012
Token.
Token ID13
Feature activation-3.18e-01

Pos logit contributions
lock+0.006
 guide+0.006
 X+0.006
 Steve+0.006
 Kyle+0.005
Neg logit contributions
arre-0.013
 ow-0.012
 wil-0.012
 suffering-0.012
 mosquitoes-0.012
Token
Token ID198
Feature activation-7.87e-01

Pos logit contributions
arre+0.009
 ow+0.009
 mosquitoes+0.008
 suffering+0.008
 wil+0.008
Neg logit contributions
 guide-0.007
lock-0.007
 X-0.007
 Steve-0.006
lynn-0.006
Token
Token ID198
Feature activation-9.71e-01

Pos logit contributions
arre+0.019
 ow+0.018
 mosquitoes+0.017
 wil+0.016
Acc+0.016
Neg logit contributions
 X-0.021
 guide-0.021
lock-0.020
 Steve-0.019
 Kyle-0.019