TOP ACTIVATIONS
MAX = 0.093

soft paper from a large pad . Then she picked up
a big box with a pad lock on it . Tom
dust onto the l ily pad and the pond .
bug on a l ily pad . Ho ppy tried to
fish , some l ily pads and even a frog .
's warmth and it was able to move again !
brush under the l ily pad in my nest . You
his lucky stars he was able to observe it . <|endoftext|>
big box , with a pad lock on it . The
was happy that it was able to help out the small
of wood and it was able to fix itself .
and strong that it was able to make it the whole
very happy that it was able to help in the emergency
key . She unlocked the pad lock and opened the door
sitting on a l ily pad . "
was happy that everyone was able to enjoy their meal together
happy that the flower was able to help heal the old
the wood cut ter was able to part the tall trees
top of a l illy pad . One day however a
paints , and a huge pad of paper . She couldn

MIN ACTIVATIONS
MIN = -0.096

John began to make - bel ieve he was a hero
liked to play make - bel ieve . One
loved to play make - bel ieve . Today she was
- up and make - bel ieve . One day ,
red cape and make - bel ieve he had super powers
He loved playing make - bel ieve with his friends .
to take down the light bul b but she was too
and they played make - bel ieve . Finally , they
he was playing make - bel ieve at home , too
was ready for make - bel ieve fun . <|endoftext|> Once
he accidentally touched the light bul b and it was very
Finally , her make - bel ieve adventures were recorded .
He would race make - bel ieve cars around the track
" Please keep your seat bel ts on . We will
a world of make - bel ieve . She
a look at the light bul b . It was very
box next to the light bul b , then climbed up
to cry and threw a tant rum . She didn 't
kept crying and throwing a tant rum . Finally
and started to throw a tant rum . Suddenly , she

INTERVAL 0.045 - 0.093
CONTAINS 0.119%

tree with many branches . Sara wanted to climb the tree
with diced carrots . The b unn ies were very happy
it some water and a bisc uit . The dog smiled
sky . His mother had promised him a reward , and
Jane was able to understand and she nodded

INTERVAL -0.002 - 0.045
CONTAINS 54.249%

Sarah was surprised and happy . She thanked Lily and
for making me happy !" From that day on
different places with her family . One day , they went
, you are very generous ," Lily said , smiling .
started to wr iggle around the hammer vicious ly . Then

INTERVAL -0.049 - -0.002
CONTAINS 45.575%

was happy . <|endoftext|> Once upon a time there were two
tear them ." His friends nodded . They touched his pants
raining outside . <|endoftext|> Once upon a time , there was
he bowed down and gave the cherry to F uzz y
knew it was important . He wanted to make his mom

INTERVAL -0.096 - -0.049
CONTAINS 0.058%

in the pool with her daddy . One day , Lily
. But then , mid - way through the night ,
helicopter and Sally 's face lit up with joy as she
happy to be with his daddy and fell asleep in his
But then he heard a voice coming from outside the house
Token soft
Token ID2705
Feature activation+6.99e-03

Pos logit contributions
 ne+0.027
 subt+0.026
 caring+0.025
 ang+0.023
 forgiving+0.023
Neg logit contributions
 eye-0.023
anger-0.021
 pad-0.021
inger-0.020
ocket-0.020
Token paper
Token ID3348
Feature activation+1.16e-03

Pos logit contributions
 eye+0.010
ocket+0.009
inger+0.009
 Sasha+0.009
anger+0.009
Neg logit contributions
 ne-0.013
 subt-0.013
 caring-0.012
bel-0.011
 ang-0.011
Token from
Token ID422
Feature activation-2.53e-02

Pos logit contributions
 eye+0.002
 Sasha+0.002
ocket+0.002
inger+0.002
iya+0.002
Neg logit contributions
 ne-0.002
 subt-0.002
 caring-0.002
bel-0.002
ths-0.002
Token a
Token ID257
Feature activation+1.19e-02

Pos logit contributions
 ne+0.044
 subt+0.042
 caring+0.039
ths+0.037
Yeah+0.037
Neg logit contributions
 eye-0.043
 gate-0.038
ocket-0.038
 pad-0.037
anger-0.036
Token large
Token ID1588
Feature activation+6.70e-03

Pos logit contributions
 eye+0.018
inger+0.017
anger+0.016
ocket+0.016
 Sasha+0.016
Neg logit contributions
 ne-0.022
 subt-0.020
 caring-0.020
 ang-0.019
bel-0.018
Token pad
Token ID14841
Feature activation+8.57e-02

Pos logit contributions
 eye+0.008
inger+0.008
ocket+0.008
 Sasha+0.007
anger+0.007
Neg logit contributions
 ne-0.014
 subt-0.013
 caring-0.012
 ang-0.012
 fought-0.011
Token.
Token ID13
Feature activation-1.55e-03

Pos logit contributions
 Sasha+0.132
iya+0.126
 eye+0.120
ulp+0.117
inger+0.117
Neg logit contributions
 ne-0.139
bel-0.130
 subt-0.128
penter-0.116
 fought-0.115
Token Then
Token ID3244
Feature activation-5.51e-03

Pos logit contributions
 subt+0.003
 ne+0.003
 caring+0.002
Yeah+0.002
 ang+0.002
Neg logit contributions
 eye-0.003
ocket-0.002
inger-0.002
adow-0.002
hole-0.002
Token she
Token ID673
Feature activation+2.21e-03

Pos logit contributions
 ne+0.009
 subt+0.008
 caring+0.007
ths+0.007
 ang+0.007
Neg logit contributions
 eye-0.010
ocket-0.010
anger-0.009
inger-0.009
 gate-0.009
Token picked
Token ID6497
Feature activation+4.48e-03

Pos logit contributions
 eye+0.004
ocket+0.003
 pad+0.003
 gate+0.003
anger+0.003
Neg logit contributions
 ne-0.004
 subt-0.003
 caring-0.003
 ang-0.003
ths-0.003
Token up
Token ID510
Feature activation-9.54e-03

Pos logit contributions
 eye+0.009
inger+0.008
adow+0.008
ocket+0.008
ipper+0.008
Neg logit contributions
 ne-0.006
 subt-0.006
 caring-0.006
 ang-0.005
ths-0.005
Token a
Token ID257
Feature activation+4.27e-03

Pos logit contributions
 ne+0.013
 subt+0.013
 caring+0.012
ths+0.011
Yeah+0.011
Neg logit contributions
 eye-0.013
ocket-0.012
 gate-0.012
anger-0.011
 pad-0.011
Token big
Token ID1263
Feature activation-1.13e-02

Pos logit contributions
 eye+0.008
ocket+0.007
 gate+0.007
anger+0.007
 pad+0.006
Neg logit contributions
 ne-0.007
 subt-0.007
 caring-0.006
 ang-0.006
ths-0.006
Token box
Token ID3091
Feature activation-3.95e-03

Pos logit contributions
 ne+0.020
 subt+0.019
 caring+0.017
 ang+0.017
ths+0.016
Neg logit contributions
 eye-0.018
ocket-0.017
anger-0.016
inger-0.016
 pad-0.016
Token with
Token ID351
Feature activation-4.04e-03

Pos logit contributions
 ne+0.007
 subt+0.006
 caring+0.005
bel+0.005
ths+0.005
Neg logit contributions
 eye-0.007
 Sasha-0.006
inger-0.006
ocket-0.006
 pad-0.006
Token a
Token ID257
Feature activation+1.24e-02

Pos logit contributions
 ne+0.006
 subt+0.006
 caring+0.005
 ang+0.005
bel+0.005
Neg logit contributions
 eye-0.008
ocket-0.007
anger-0.007
inger-0.007
 pad-0.007
Token pad
Token ID14841
Feature activation+8.34e-02

Pos logit contributions
 eye+0.020
inger+0.019
 Sasha+0.019
ocket+0.018
anger+0.018
Neg logit contributions
 ne-0.022
 subt-0.020
 caring-0.019
 ang-0.017
 forgiving-0.017
Tokenlock
Token ID5354
Feature activation+1.27e-02

Pos logit contributions
 Sasha+0.108
iya+0.106
 eye+0.094
 Gloria+0.091
inger+0.090
Neg logit contributions
 ne-0.162
bel-0.146
 subt-0.143
 fought-0.133
 tug-0.131
Token on
Token ID319
Feature activation+5.00e-03

Pos logit contributions
 Sasha+0.024
iya+0.024
ulp+0.023
 eye+0.023
ipper+0.022
Neg logit contributions
 ne-0.019
penter-0.014
 subt-0.014
 hon-0.014
cut-0.013
Token it
Token ID340
Feature activation+5.81e-03

Pos logit contributions
 eye+0.008
ocket+0.007
adow+0.007
inger+0.007
ulp+0.007
Neg logit contributions
 ne-0.009
 subt-0.008
 caring-0.008
 ang-0.008
Yeah-0.007
Token.
Token ID13
Feature activation-3.93e-03

Pos logit contributions
 eye+0.011
iya+0.011
inger+0.010
 Sasha+0.010
ipper+0.010
Neg logit contributions
 ne-0.009
 subt-0.007
 caring-0.007
 ang-0.006
ths-0.006
Token Tom
Token ID4186
Feature activation+2.72e-02

Pos logit contributions
 ne+0.006
 subt+0.006
Yeah+0.005
 ang+0.005
 caring+0.005
Neg logit contributions
 eye-0.008
ocket-0.008
inger-0.007
lor-0.007
ulp-0.007
Token dust
Token ID8977
Feature activation-1.28e-02

Pos logit contributions
 ne+0.056
 subt+0.053
 caring+0.049
bel+0.048
ths+0.046
Neg logit contributions
 eye-0.047
ocket-0.041
anger-0.041
 gate-0.040
inger-0.040
Token onto
Token ID4291
Feature activation+1.06e-02

Pos logit contributions
 ne+0.024
 subt+0.022
 caring+0.020
bel+0.020
ths+0.020
Neg logit contributions
 eye-0.021
ocket-0.018
 pad-0.017
 gate-0.017
inger-0.017
Token the
Token ID262
Feature activation+1.32e-02

Pos logit contributions
 eye+0.020
ocket+0.018
anger+0.018
 gate+0.018
 pad+0.018
Neg logit contributions
 ne-0.016
 subt-0.015
 caring-0.014
 ang-0.013
ths-0.013
Token l
Token ID300
Feature activation+5.03e-02

Pos logit contributions
 eye+0.019
ocket+0.018
 Sasha+0.018
inger+0.018
hole+0.017
Neg logit contributions
 ne-0.026
 caring-0.021
 subt-0.021
Yeah-0.020
 ang-0.020
Tokenily
Token ID813
Feature activation+3.76e-02

Pos logit contributions
iya+0.067
 envelope+0.066
 Sandra+0.066
 passports+0.064
 eye+0.064
Neg logit contributions
 ne-0.081
 Sp-0.066
 subt-0.063
 enthusiastically-0.063
bel-0.062
Token pad
Token ID14841
Feature activation+8.34e-02

Pos logit contributions
ipper+0.042
anger+0.042
 eye+0.041
ulp+0.041
ocket+0.040
Neg logit contributions
 ne-0.076
 subt-0.067
bel-0.064
 caring-0.064
cl-0.059
Token and
Token ID290
Feature activation+1.12e-02

Pos logit contributions
 Sasha+0.143
iya+0.141
 eye+0.135
ipper+0.131
 gate+0.129
Neg logit contributions
 ne-0.118
bel-0.113
 subt-0.102
 enthusiastically-0.090
penter-0.090
Token the
Token ID262
Feature activation+3.33e-03

Pos logit contributions
 eye+0.021
ocket+0.019
 gate+0.019
 pad+0.018
anger+0.018
Neg logit contributions
 ne-0.017
 subt-0.016
 caring-0.015
 ang-0.014
ths-0.014
Token pond
Token ID16723
Feature activation+2.54e-02

Pos logit contributions
 eye+0.006
ocket+0.005
anger+0.005
inger+0.005
 gate+0.005
Neg logit contributions
 ne-0.006
 subt-0.005
 caring-0.005
 ang-0.005
Yeah-0.004
Token.
Token ID13
Feature activation+1.45e-03

Pos logit contributions
 eye+0.043
ocket+0.039
anger+0.038
 pad+0.038
inger+0.037
Neg logit contributions
 ne-0.044
 subt-0.040
 caring-0.038
bel-0.037
 ang-0.035
Token
Token ID198
Feature activation+2.47e-03

Pos logit contributions
 eye+0.003
ocket+0.003
 gate+0.002
inger+0.002
anger+0.002
Neg logit contributions
 ne-0.002
 subt-0.002
 caring-0.002
Yeah-0.002
bel-0.002
Token bug
Token ID5434
Feature activation-8.23e-03

Pos logit contributions
ocket+0.015
lor+0.015
 eye+0.015
hole+0.014
inger+0.014
Neg logit contributions
 ne-0.016
 caring-0.013
 subt-0.012
 ang-0.012
 forgiving-0.012
Token on
Token ID319
Feature activation+7.04e-03

Pos logit contributions
 ne+0.014
 subt+0.012
 caring+0.012
 ang+0.011
ths+0.011
Neg logit contributions
 eye-0.014
ocket-0.013
inger-0.013
anger-0.013
 Sasha-0.012
Token a
Token ID257
Feature activation+1.47e-02

Pos logit contributions
 eye+0.012
 gate+0.011
ocket+0.011
anger+0.011
 pad+0.011
Neg logit contributions
 ne-0.012
 subt-0.011
 caring-0.010
ths-0.010
bel-0.010
Token l
Token ID300
Feature activation+4.73e-02

Pos logit contributions
 eye+0.022
ocket+0.022
inger+0.022
anger+0.021
lor+0.021
Neg logit contributions
 ne-0.025
 caring-0.022
 trumpet-0.021
 w-0.020
 ang-0.019
Tokenily
Token ID813
Feature activation+3.28e-02

Pos logit contributions
iya+0.071
 Sandra+0.065
 Sasha+0.064
 doorstep+0.064
hole+0.062
Neg logit contributions
 ne-0.069
years-0.059
 subt-0.056
 Sp-0.055
Deal-0.055
Token pad
Token ID14841
Feature activation+8.30e-02

Pos logit contributions
anger+0.033
inger+0.030
 Sasha+0.029
iya+0.028
lor+0.028
Neg logit contributions
 ne-0.071
 fought-0.065
 subt-0.063
 caring-0.061
 w-0.060
Token.
Token ID13
Feature activation-6.43e-04

Pos logit contributions
 Sasha+0.140
iya+0.140
 eye+0.132
 Sandra+0.127
 Gloria+0.126
Neg logit contributions
 ne-0.119
bel-0.106
 subt-0.102
 fought-0.101
cut-0.092
Token Ho
Token ID9544
Feature activation+1.71e-02

Pos logit contributions
 ne+0.001
 subt+0.001
 caring+0.001
 ang+0.001
Yeah+0.001
Neg logit contributions
 eye-0.001
ocket-0.001
inger-0.001
anger-0.001
 gate-0.001
Tokenppy
Token ID14097
Feature activation+1.11e-02

Pos logit contributions
 eye+0.027
 gate+0.023
ocket+0.023
anger+0.023
 pad+0.022
Neg logit contributions
 ne-0.032
 subt-0.030
 caring-0.028
ths-0.027
 ang-0.027
Token tried
Token ID3088
Feature activation+2.57e-02

Pos logit contributions
 eye+0.018
 pad+0.016
 gate+0.016
ocket+0.015
 Sasha+0.015
Neg logit contributions
 subt-0.019
 ne-0.018
 ang-0.017
bel-0.017
 caring-0.016
Token to
Token ID284
Feature activation-1.17e-02

Pos logit contributions
 eye+0.031
 pad+0.025
 gate+0.025
ocket+0.025
 Sasha+0.024
Neg logit contributions
 ne-0.057
 subt-0.056
ths-0.051
 caring-0.051
Yeah-0.050
Token fish
Token ID5916
Feature activation+2.68e-02

Pos logit contributions
 ne+0.001
 subt+0.001
 caring+0.000
 ang+0.000
ths+0.000
Neg logit contributions
 eye-0.000
ocket-0.000
 gate-0.000
anger-0.000
 pad-0.000
Token,
Token ID11
Feature activation+1.62e-02

Pos logit contributions
 eye+0.049
anger+0.045
ocket+0.044
inger+0.044
ork+0.043
Neg logit contributions
 ne-0.042
 caring-0.036
 subt-0.034
bel-0.032
 forgiving-0.032
Token some
Token ID617
Feature activation+3.03e-02

Pos logit contributions
 eye+0.028
 gate+0.025
ocket+0.025
 pad+0.024
anger+0.024
Neg logit contributions
 ne-0.027
 subt-0.026
 caring-0.024
ths-0.023
 ang-0.023
Token l
Token ID300
Feature activation+4.42e-02

Pos logit contributions
 eye+0.051
ocket+0.044
 gate+0.044
 pad+0.042
anger+0.041
Neg logit contributions
 subt-0.052
 ne-0.052
ths-0.046
 caring-0.046
Yeah-0.045
Tokenily
Token ID813
Feature activation+3.05e-02

Pos logit contributions
 eye+0.051
 doorstep+0.045
 gate+0.045
hole+0.042
 envelope+0.042
Neg logit contributions
 ne-0.094
 subt-0.086
years-0.080
 caring-0.079
bel-0.079
Token pads
Token ID21226
Feature activation+8.20e-02

Pos logit contributions
 eye+0.049
anger+0.045
ocket+0.043
 gate+0.042
inger+0.042
Neg logit contributions
 ne-0.055
 subt-0.049
 caring-0.047
bel-0.044
ths-0.044
Token and
Token ID290
Feature activation+8.32e-03

Pos logit contributions
 eye+0.151
ocket+0.141
iya+0.141
inger+0.141
ulp+0.138
Neg logit contributions
 ne-0.118
bel-0.094
 fought-0.093
 subt-0.093
 caring-0.092
Token even
Token ID772
Feature activation+8.02e-03

Pos logit contributions
 eye+0.013
 pad+0.012
ocket+0.011
 gate+0.011
 Sasha+0.011
Neg logit contributions
 ne-0.015
 subt-0.015
 caring-0.014
ths-0.013
 ang-0.013
Token a
Token ID257
Feature activation+1.07e-02

Pos logit contributions
 eye+0.014
 pad+0.012
 Sasha+0.011
ocket+0.011
 gate+0.011
Neg logit contributions
 subt-0.013
ths-0.013
 ne-0.012
Yeah-0.012
 caring-0.012
Token frog
Token ID21264
Feature activation+9.96e-03

Pos logit contributions
 eye+0.019
ocket+0.016
 gate+0.016
 pad+0.016
anger+0.016
Neg logit contributions
 ne-0.018
 subt-0.017
 caring-0.016
ths-0.015
 ang-0.015
Token.
Token ID13
Feature activation-5.95e-03

Pos logit contributions
 eye+0.017
inger+0.016
ocket+0.015
anger+0.015
 gate+0.015
Neg logit contributions
 ne-0.017
 subt-0.015
 caring-0.014
bel-0.013
Yeah-0.013
Token's
Token ID338
Feature activation+1.63e-02

Pos logit contributions
 eye+0.029
anger+0.027
inger+0.027
 gate+0.027
ocket+0.027
Neg logit contributions
 ne-0.026
 subt-0.024
 caring-0.023
Yeah-0.021
 ang-0.021
Token warmth
Token ID23125
Feature activation+2.54e-02

Pos logit contributions
 eye+0.025
ocket+0.022
 gate+0.022
anger+0.021
 pad+0.021
Neg logit contributions
 ne-0.031
 subt-0.029
 caring-0.028
Yeah-0.026
 ang-0.026
Token and
Token ID290
Feature activation+6.13e-03

Pos logit contributions
inger+0.043
ocket+0.042
 eye+0.041
lor+0.041
 Sasha+0.040
Neg logit contributions
 ne-0.042
 caring-0.034
 subt-0.034
 ang-0.032
Yeah-0.031
Token it
Token ID340
Feature activation-4.14e-03

Pos logit contributions
 eye+0.012
ocket+0.010
 gate+0.010
 pad+0.010
 Sasha+0.010
Neg logit contributions
 ne-0.009
 subt-0.009
 caring-0.008
ths-0.008
 ang-0.008
Token was
Token ID373
Feature activation+7.90e-03

Pos logit contributions
 subt+0.007
 ne+0.007
 caring+0.006
 ang+0.006
ths+0.006
Neg logit contributions
 eye-0.007
ocket-0.006
 gate-0.006
 pad-0.006
anger-0.006
Token able
Token ID1498
Feature activation+8.14e-02

Pos logit contributions
 eye+0.016
ocket+0.014
 gate+0.014
 pad+0.014
anger+0.013
Neg logit contributions
 ne-0.012
 subt-0.011
 caring-0.010
ths-0.009
 ang-0.009
Token to
Token ID284
Feature activation-7.21e-03

Pos logit contributions
 eye+0.143
anger+0.135
ocket+0.134
lor+0.127
inger+0.125
Neg logit contributions
 ne-0.137
 caring-0.128
 subt-0.122
 ang-0.113
bel-0.111
Token move
Token ID1445
Feature activation-1.70e-02

Pos logit contributions
 subt+0.013
 ne+0.012
 caring+0.011
ths+0.011
bel+0.011
Neg logit contributions
 eye-0.012
 gate-0.010
inger-0.010
anger-0.010
ocket-0.009
Token again
Token ID757
Feature activation-7.44e-04

Pos logit contributions
 ne+0.031
 subt+0.029
 caring+0.027
 ang+0.026
ths+0.026
Neg logit contributions
 eye-0.028
ocket-0.025
anger-0.024
 gate-0.024
inger-0.023
Token!
Token ID0
Feature activation-4.33e-03

Pos logit contributions
 ne+0.001
 subt+0.001
 caring+0.001
ths+0.001
bel+0.001
Neg logit contributions
 eye-0.001
ocket-0.001
anger-0.001
inger-0.001
lor-0.001
Token
Token ID198
Feature activation+2.21e-03

Pos logit contributions
 subt+0.006
 ne+0.006
Yeah+0.006
 caring+0.005
 Acc+0.005
Neg logit contributions
 eye-0.009
ocket-0.008
 gate-0.008
anger-0.007
 pad-0.007
Token brush
Token ID14093
Feature activation+2.57e-03

Pos logit contributions
 eye+0.026
ocket+0.025
lor+0.024
inger+0.023
anger+0.023
Neg logit contributions
 ne-0.031
 subt-0.029
 caring-0.028
Yeah-0.026
ths-0.025
Token under
Token ID739
Feature activation+9.44e-03

Pos logit contributions
 eye+0.004
ocket+0.004
 Sasha+0.004
inger+0.004
ulp+0.004
Neg logit contributions
 ne-0.004
 subt-0.004
bel-0.003
 caring-0.003
Yeah-0.003
Token the
Token ID262
Feature activation+8.72e-03

Pos logit contributions
 eye+0.020
ocket+0.018
inger+0.018
anger+0.018
adow+0.018
Neg logit contributions
 ne-0.012
 subt-0.011
 caring-0.010
ths-0.009
 forgiving-0.009
Token l
Token ID300
Feature activation+5.14e-02

Pos logit contributions
ocket+0.014
lor+0.014
inger+0.014
adow+0.014
iya+0.013
Neg logit contributions
 ne-0.015
 caring-0.012
 ang-0.012
 trumpet-0.012
 subt-0.012
Tokenily
Token ID813
Feature activation+3.39e-02

Pos logit contributions
iya+0.072
 doorstep+0.069
 eye+0.066
keeping+0.065
 Sandra+0.064
Neg logit contributions
 subt-0.074
 ne-0.069
 Sp-0.068
years-0.059
smart-0.059
Token pad
Token ID14841
Feature activation+8.08e-02

Pos logit contributions
ipper+0.052
ulp+0.050
ocket+0.049
iya+0.049
wen+0.049
Neg logit contributions
 ne-0.047
 subt-0.042
 bit-0.039
 how-0.037
 trucks-0.037
Token in
Token ID287
Feature activation+4.05e-04

Pos logit contributions
iya+0.149
 Sasha+0.148
 eye+0.137
ulp+0.137
ipper+0.136
Neg logit contributions
 ne-0.095
bel-0.086
 subt-0.083
cut-0.075
penter-0.072
Token my
Token ID616
Feature activation+3.48e-02

Pos logit contributions
 eye+0.001
anger+0.001
 pad+0.001
ocket+0.001
inger+0.001
Neg logit contributions
 ne-0.001
 subt-0.000
 caring-0.000
 ang-0.000
 forgiving-0.000
Token nest
Token ID16343
Feature activation+1.94e-02

Pos logit contributions
 eye+0.038
inger+0.038
anger+0.038
ocket+0.037
lor+0.037
Neg logit contributions
 ne-0.074
 subt-0.069
 caring-0.066
 ang-0.061
Yeah-0.061
Token.
Token ID13
Feature activation+5.62e-03

Pos logit contributions
ocket+0.034
 eye+0.034
iya+0.032
 Sasha+0.032
anger+0.031
Neg logit contributions
 ne-0.027
bel-0.024
spe-0.022
 caring-0.022
 subt-0.022
Token You
Token ID921
Feature activation+5.25e-04

Pos logit contributions
 eye+0.009
ocket+0.008
 gate+0.008
anger+0.008
inger+0.008
Neg logit contributions
 ne-0.010
 subt-0.010
 caring-0.009
Yeah-0.008
ths-0.008
Token his
Token ID465
Feature activation-4.06e-02

Pos logit contributions
 ne+0.020
 subt+0.019
 caring+0.017
ths+0.016
 ang+0.016
Neg logit contributions
 eye-0.026
ocket-0.023
 gate-0.022
anger-0.022
 pad-0.022
Token lucky
Token ID9670
Feature activation-2.90e-03

Pos logit contributions
 ne+0.078
 subt+0.076
bel+0.069
 ang+0.068
ths+0.068
Neg logit contributions
 eye-0.064
 gate-0.054
ocket-0.052
 pad-0.050
anger-0.048
Token stars
Token ID5788
Feature activation-2.21e-02

Pos logit contributions
 ne+0.006
 subt+0.005
 caring+0.005
ths+0.005
 ang+0.005
Neg logit contributions
 eye-0.004
ocket-0.004
anger-0.004
 gate-0.004
 pad-0.004
Token he
Token ID339
Feature activation-7.03e-03

Pos logit contributions
 ne+0.039
 subt+0.035
 caring+0.034
bel+0.031
ths+0.031
Neg logit contributions
 eye-0.037
inger-0.033
ocket-0.032
anger-0.032
 gate-0.032
Token was
Token ID373
Feature activation-2.52e-03

Pos logit contributions
 ne+0.011
 subt+0.010
 caring+0.009
ths+0.009
 ang+0.009
Neg logit contributions
 eye-0.013
anger-0.012
inger-0.012
 gate-0.012
ocket-0.011
Token able
Token ID1498
Feature activation+8.07e-02

Pos logit contributions
 ne+0.004
 caring+0.003
 subt+0.003
ths+0.003
bel+0.003
Neg logit contributions
 eye-0.005
anger-0.005
inger-0.004
 gate-0.004
ork-0.004
Token to
Token ID284
Feature activation-5.69e-03

Pos logit contributions
 eye+0.147
anger+0.137
ocket+0.131
inger+0.127
 gate+0.126
Neg logit contributions
 ne-0.130
 caring-0.121
 subt-0.115
bel-0.107
 ang-0.103
Token observe
Token ID12414
Feature activation-4.97e-03

Pos logit contributions
 ne+0.009
 subt+0.008
 caring+0.008
bel+0.007
 ang+0.007
Neg logit contributions
 eye-0.010
anger-0.010
ocket-0.009
 pad-0.009
inger-0.009
Token it
Token ID340
Feature activation-9.34e-03

Pos logit contributions
 ne+0.009
 subt+0.008
 caring+0.008
ths+0.007
 ang+0.007
Neg logit contributions
 eye-0.008
ocket-0.007
inger-0.007
anger-0.007
 pad-0.007
Token.
Token ID13
Feature activation-3.77e-03

Pos logit contributions
 ne+0.016
 subt+0.015
 caring+0.014
ths+0.013
 ang+0.013
Neg logit contributions
 eye-0.016
ocket-0.015
anger-0.014
inger-0.014
 gate-0.014
Token<|endoftext|>
Token ID50256
Feature activation-8.78e-04

Pos logit contributions
 ne+0.006
 subt+0.006
 caring+0.005
Yeah+0.005
 ang+0.005
Neg logit contributions
 eye-0.007
ocket-0.007
 gate-0.006
anger-0.006
inger-0.006
Token big
Token ID1263
Feature activation-1.39e-02

Pos logit contributions
 eye+0.011
 gate+0.010
ocket+0.010
 pad+0.010
anger+0.009
Neg logit contributions
 ne-0.011
 subt-0.011
 caring-0.010
ths-0.010
 ang-0.010
Token box
Token ID3091
Feature activation-9.28e-03

Pos logit contributions
 ne+0.026
 subt+0.024
 caring+0.023
 ang+0.022
ths+0.022
Neg logit contributions
 eye-0.022
ocket-0.019
 gate-0.019
anger-0.019
 pad-0.019
Token,
Token ID11
Feature activation+1.11e-02

Pos logit contributions
 ne+0.016
 subt+0.014
 caring+0.013
bel+0.013
Yeah+0.012
Neg logit contributions
 eye-0.016
inger-0.015
ocket-0.014
 Sasha-0.014
 pad-0.014
Token with
Token ID351
Feature activation-4.39e-03

Pos logit contributions
 eye+0.018
ocket+0.016
anger+0.015
 gate+0.015
inger+0.015
Neg logit contributions
 ne-0.020
 subt-0.019
 caring-0.018
 ang-0.017
Yeah-0.017
Token a
Token ID257
Feature activation+5.10e-03

Pos logit contributions
 ne+0.007
 subt+0.006
 caring+0.006
 ang+0.005
bel+0.005
Neg logit contributions
 eye-0.008
ocket-0.008
anger-0.007
inger-0.007
 pad-0.007
Token pad
Token ID14841
Feature activation+8.05e-02

Pos logit contributions
 eye+0.009
inger+0.008
ocket+0.008
 Sasha+0.008
anger+0.008
Neg logit contributions
 ne-0.008
 subt-0.008
 caring-0.007
 ang-0.007
Yeah-0.007
Tokenlock
Token ID5354
Feature activation+9.45e-03

Pos logit contributions
 Sasha+0.105
iya+0.103
 eye+0.095
inger+0.095
ubs+0.090
Neg logit contributions
 ne-0.155
 subt-0.141
bel-0.139
 fought-0.125
penter-0.124
Token on
Token ID319
Feature activation-1.57e-03

Pos logit contributions
 Sasha+0.019
iya+0.018
ulp+0.017
 eye+0.017
ipper+0.017
Neg logit contributions
 ne-0.014
 subt-0.010
bel-0.010
penter-0.010
cl-0.009
Token it
Token ID340
Feature activation+1.83e-03

Pos logit contributions
 ne+0.003
 subt+0.003
 caring+0.002
ths+0.002
Yeah+0.002
Neg logit contributions
 eye-0.003
ocket-0.002
ipper-0.002
inger-0.002
adow-0.002
Token.
Token ID13
Feature activation-5.20e-03

Pos logit contributions
 eye+0.003
inger+0.003
iya+0.003
ocket+0.003
ipper+0.003
Neg logit contributions
 ne-0.003
 subt-0.003
 caring-0.002
bel-0.002
ths-0.002
Token The
Token ID383
Feature activation-4.68e-03

Pos logit contributions
 ne+0.010
 subt+0.009
 caring+0.009
Yeah+0.008
bel+0.008
Neg logit contributions
 eye-0.008
ocket-0.007
 pad-0.007
 gate-0.007
inger-0.007
Token was
Token ID373
Feature activation+1.07e-02

Pos logit contributions
 subt+0.001
 ne+0.001
 ang+0.000
 caring+0.000
bel+0.000
Neg logit contributions
 eye-0.001
ocket-0.000
 gate-0.000
 pad-0.000
 Sasha-0.000
Token happy
Token ID3772
Feature activation+1.85e-02

Pos logit contributions
 eye+0.020
ocket+0.017
 pad+0.017
 gate+0.017
 Sasha+0.017
Neg logit contributions
 subt-0.017
 ne-0.017
bel-0.014
 caring-0.014
Yeah-0.014
Token that
Token ID326
Feature activation+2.00e-03

Pos logit contributions
 eye+0.030
ocket+0.027
anger+0.026
 gate+0.026
inger+0.026
Neg logit contributions
 ne-0.034
 subt-0.032
 caring-0.030
ths-0.029
Yeah-0.028
Token it
Token ID340
Feature activation+3.36e-03

Pos logit contributions
 eye+0.003
ocket+0.003
 gate+0.003
anger+0.003
 pad+0.003
Neg logit contributions
 ne-0.004
 subt-0.003
 caring-0.003
 ang-0.003
ths-0.003
Token was
Token ID373
Feature activation+4.35e-03

Pos logit contributions
 eye+0.006
ocket+0.005
 pad+0.005
 gate+0.005
anger+0.005
Neg logit contributions
 ne-0.006
 subt-0.006
 caring-0.005
 ang-0.005
ths-0.005
Token able
Token ID1498
Feature activation+8.02e-02

Pos logit contributions
 eye+0.008
anger+0.008
ocket+0.007
 gate+0.007
 pad+0.007
Neg logit contributions
 ne-0.007
 subt-0.006
 caring-0.006
 ang-0.005
ths-0.005
Token to
Token ID284
Feature activation-4.27e-03

Pos logit contributions
 eye+0.146
anger+0.141
ocket+0.138
lor+0.130
inger+0.128
Neg logit contributions
 ne-0.129
 caring-0.124
 subt-0.114
 ang-0.108
 forgiving-0.104
Token help
Token ID1037
Feature activation-3.60e-03

Pos logit contributions
 ne+0.007
 subt+0.007
 caring+0.006
 ang+0.006
ths+0.006
Neg logit contributions
 eye-0.008
ocket-0.007
 gate-0.007
anger-0.007
 pad-0.007
Token out
Token ID503
Feature activation-8.80e-03

Pos logit contributions
 ne+0.005
 caring+0.005
 subt+0.005
 ang+0.004
 forgiving+0.004
Neg logit contributions
 eye-0.007
anger-0.007
ocket-0.006
inger-0.006
 pad-0.006
Token the
Token ID262
Feature activation-4.40e-03

Pos logit contributions
 ne+0.016
 subt+0.015
 caring+0.014
 ang+0.014
ths+0.014
Neg logit contributions
 eye-0.014
ocket-0.012
 gate-0.012
anger-0.012
 pad-0.012
Token small
Token ID1402
Feature activation+1.00e-02

Pos logit contributions
 ne+0.008
 subt+0.008
 caring+0.008
 ang+0.007
ths+0.007
Neg logit contributions
 eye-0.007
ocket-0.006
anger-0.006
 gate-0.006
inger-0.006
Token of
Token ID286
Feature activation-4.75e-03

Pos logit contributions
 ne+0.003
 caring+0.003
 subt+0.003
bel+0.003
 ang+0.003
Neg logit contributions
 eye-0.003
ocket-0.003
anger-0.003
inger-0.003
 pad-0.003
Token wood
Token ID4898
Feature activation-8.89e-03

Pos logit contributions
 ne+0.008
 subt+0.008
 caring+0.007
 ang+0.007
ths+0.007
Neg logit contributions
 eye-0.008
ocket-0.007
anger-0.007
 pad-0.007
inger-0.007
Token and
Token ID290
Feature activation+2.73e-03

Pos logit contributions
 ne+0.017
 subt+0.016
 caring+0.015
 ang+0.014
ths+0.014
Neg logit contributions
 eye-0.014
ocket-0.012
 gate-0.012
anger-0.012
 pad-0.012
Token it
Token ID340
Feature activation-1.01e-02

Pos logit contributions
 eye+0.005
 gate+0.004
ocket+0.004
 pad+0.004
anger+0.004
Neg logit contributions
 ne-0.004
 subt-0.004
 caring-0.004
 ang-0.004
ths-0.004
Token was
Token ID373
Feature activation+3.70e-03

Pos logit contributions
 subt+0.016
 ne+0.015
 caring+0.014
 ang+0.013
Yeah+0.013
Neg logit contributions
 eye-0.019
 gate-0.016
ocket-0.016
anger-0.016
 pad-0.016
Token able
Token ID1498
Feature activation+7.97e-02

Pos logit contributions
 eye+0.007
ocket+0.006
 pad+0.006
 gate+0.006
inger+0.006
Neg logit contributions
 subt-0.005
 ne-0.005
bel-0.005
ths-0.005
Yeah-0.005
Token to
Token ID284
Feature activation-8.23e-03

Pos logit contributions
 eye+0.144
ocket+0.135
anger+0.134
lor+0.124
inger+0.123
Neg logit contributions
 ne-0.134
 caring-0.124
 subt-0.118
 ang-0.110
bel-0.108
Token fix
Token ID4259
Feature activation-2.81e-02

Pos logit contributions
 subt+0.013
 ne+0.013
 caring+0.011
 ang+0.011
ths+0.011
Neg logit contributions
 eye-0.016
 gate-0.013
anger-0.013
ocket-0.012
ork-0.012
Token itself
Token ID2346
Feature activation-1.90e-02

Pos logit contributions
 ne+0.051
 subt+0.048
 caring+0.045
 ang+0.043
ths+0.043
Neg logit contributions
 eye-0.046
ocket-0.040
anger-0.039
 pad-0.039
 gate-0.039
Token.
Token ID13
Feature activation-6.23e-03

Pos logit contributions
 ne+0.032
 subt+0.029
 caring+0.028
 ang+0.026
bel+0.026
Neg logit contributions
 eye-0.033
ocket-0.030
anger-0.029
 gate-0.029
 Sasha-0.029
Token
Token ID220
Feature activation-1.09e-02

Pos logit contributions
 ne+0.009
 subt+0.009
Yeah+0.008
 caring+0.008
ths+0.007
Neg logit contributions
 eye-0.012
ocket-0.011
 gate-0.011
anger-0.010
inger-0.010
Token and
Token ID290
Feature activation-1.71e-02

Pos logit contributions
 ne+0.017
 subt+0.017
 caring+0.016
ths+0.015
 ang+0.015
Neg logit contributions
 eye-0.013
ocket-0.012
 gate-0.011
anger-0.011
 pad-0.011
Token strong
Token ID1913
Feature activation-1.40e-02

Pos logit contributions
 ne+0.026
 subt+0.026
Yeah+0.022
 ang+0.022
bel+0.022
Neg logit contributions
 eye-0.031
 gate-0.028
ocket-0.026
 pad-0.026
hole-0.026
Token that
Token ID326
Feature activation-2.61e-03

Pos logit contributions
 ne+0.026
 subt+0.026
 caring+0.024
 ang+0.023
ths+0.023
Neg logit contributions
 eye-0.021
 gate-0.018
ocket-0.018
anger-0.018
 pad-0.018
Token it
Token ID340
Feature activation-4.86e-04

Pos logit contributions
 ne+0.005
 subt+0.004
 caring+0.004
ths+0.004
 ang+0.004
Neg logit contributions
 eye-0.004
ocket-0.004
anger-0.004
inger-0.004
 gate-0.004
Token was
Token ID373
Feature activation+3.75e-03

Pos logit contributions
 subt+0.001
 ne+0.001
 caring+0.001
 ang+0.001
Yeah+0.001
Neg logit contributions
 eye-0.001
 gate-0.001
ocket-0.001
anger-0.001
 pad-0.001
Token able
Token ID1498
Feature activation+7.97e-02

Pos logit contributions
 eye+0.007
 gate+0.006
ocket+0.006
 pad+0.006
inger+0.006
Neg logit contributions
 ne-0.006
 subt-0.006
 caring-0.005
Yeah-0.005
bel-0.005
Token to
Token ID284
Feature activation-6.45e-03

Pos logit contributions
 eye+0.138
ocket+0.128
anger+0.125
inger+0.119
lor+0.118
Neg logit contributions
 ne-0.138
 caring-0.128
 subt-0.125
 ang-0.114
bel-0.112
Token make
Token ID787
Feature activation+6.22e-03

Pos logit contributions
 subt+0.011
 ne+0.011
 caring+0.010
ths+0.009
 ang+0.009
Neg logit contributions
 eye-0.011
 gate-0.010
ocket-0.009
anger-0.009
inger-0.009
Token it
Token ID340
Feature activation-1.55e-02

Pos logit contributions
 eye+0.011
ocket+0.010
 pad+0.010
anger+0.010
 gate+0.010
Neg logit contributions
 ne-0.010
 subt-0.010
 caring-0.009
 ang-0.009
ths-0.008
Token the
Token ID262
Feature activation+1.09e-03

Pos logit contributions
 subt+0.025
 ne+0.025
 caring+0.022
Yeah+0.021
ths+0.021
Neg logit contributions
 eye-0.028
 gate-0.024
ocket-0.024
anger-0.023
 pad-0.023
Token whole
Token ID2187
Feature activation+1.36e-02

Pos logit contributions
 eye+0.002
ocket+0.002
 gate+0.002
anger+0.002
 pad+0.001
Neg logit contributions
 ne-0.002
 subt-0.002
 caring-0.002
 ang-0.002
ths-0.002
Token very
Token ID845
Feature activation+8.08e-03

Pos logit contributions
 eye+0.009
ocket+0.008
 gate+0.008
 pad+0.008
inger+0.008
Neg logit contributions
 ne-0.006
 subt-0.006
 caring-0.005
ths-0.005
Yeah-0.005
Token happy
Token ID3772
Feature activation+1.24e-02

Pos logit contributions
 eye+0.016
 gate+0.014
ocket+0.014
inger+0.014
 Sasha+0.014
Neg logit contributions
 subt-0.012
 ne-0.011
ths-0.010
Yeah-0.009
 cartoons-0.009
Token that
Token ID326
Feature activation-1.61e-03

Pos logit contributions
 eye+0.019
ocket+0.017
anger+0.016
inger+0.016
 gate+0.016
Neg logit contributions
 ne-0.024
 subt-0.022
 caring-0.021
ths-0.020
Yeah-0.020
Token it
Token ID340
Feature activation-2.47e-04

Pos logit contributions
 ne+0.003
 subt+0.002
 caring+0.002
ths+0.002
 ang+0.002
Neg logit contributions
 eye-0.003
ocket-0.003
anger-0.003
 gate-0.003
 pad-0.003
Token was
Token ID373
Feature activation+3.34e-03

Pos logit contributions
 ne+0.000
 subt+0.000
 caring+0.000
 ang+0.000
ths+0.000
Neg logit contributions
 eye-0.000
ocket-0.000
 gate-0.000
 pad-0.000
inger-0.000
Token able
Token ID1498
Feature activation+7.93e-02

Pos logit contributions
 eye+0.006
ocket+0.005
anger+0.005
 gate+0.005
 pad+0.005
Neg logit contributions
 ne-0.005
 subt-0.005
 caring-0.005
 ang-0.004
ths-0.004
Token to
Token ID284
Feature activation-4.29e-03

Pos logit contributions
 eye+0.141
anger+0.133
ocket+0.133
lor+0.124
inger+0.121
Neg logit contributions
 ne-0.131
 caring-0.125
 subt-0.115
 ang-0.110
bel-0.108
Token help
Token ID1037
Feature activation-1.94e-03

Pos logit contributions
 ne+0.007
 subt+0.006
 caring+0.006
 ang+0.005
ths+0.005
Neg logit contributions
 eye-0.008
ocket-0.007
 gate-0.007
anger-0.007
 pad-0.007
Token in
Token ID287
Feature activation-6.97e-03

Pos logit contributions
 ne+0.003
 caring+0.003
 subt+0.002
 forgiving+0.002
 ang+0.002
Neg logit contributions
 eye-0.004
ocket-0.004
anger-0.004
 pad-0.004
lor-0.004
Token the
Token ID262
Feature activation+1.03e-02

Pos logit contributions
 ne+0.010
 caring+0.009
 subt+0.009
 ang+0.009
 forgiving+0.008
Neg logit contributions
 eye-0.013
anger-0.013
ocket-0.012
inger-0.012
 pad-0.012
Token emergency
Token ID6334
Feature activation+2.94e-03

Pos logit contributions
 eye+0.010
anger+0.010
ocket+0.009
inger+0.009
hole+0.009
Neg logit contributions
 ne-0.024
 subt-0.023
 caring-0.023
 ang-0.021
ths-0.021
Token key
Token ID1994
Feature activation+4.22e-03

Pos logit contributions
lor+0.004
 eye+0.004
ipper+0.004
inger+0.004
ulp+0.004
Neg logit contributions
 ne-0.006
 caring-0.005
 ang-0.005
 subt-0.005
nam-0.005
Token.
Token ID13
Feature activation-1.78e-03

Pos logit contributions
iya+0.008
 Sasha+0.008
 eye+0.008
ulp+0.008
inger+0.007
Neg logit contributions
 ne-0.006
 subt-0.005
bel-0.005
spe-0.005
ths-0.004
Token She
Token ID1375
Feature activation-1.34e-02

Pos logit contributions
 subt+0.003
 ne+0.002
 caring+0.002
Yeah+0.002
 subtract+0.002
Neg logit contributions
 eye-0.003
ocket-0.003
inger-0.003
ipper-0.003
anger-0.003
Token unlocked
Token ID14838
Feature activation-1.56e-02

Pos logit contributions
 ne+0.021
 subt+0.020
 caring+0.018
ths+0.017
 ang+0.017
Neg logit contributions
 eye-0.025
ocket-0.022
 gate-0.022
anger-0.022
 pad-0.022
Token the
Token ID262
Feature activation+3.11e-03

Pos logit contributions
 ne+0.016
 how+0.012
 caring+0.011
 subt+0.011
 ang+0.011
Neg logit contributions
 eye-0.037
 pad-0.036
ipper-0.036
ocket-0.036
anger-0.035
Token pad
Token ID14841
Feature activation+7.92e-02

Pos logit contributions
 eye+0.003
ipper+0.003
 Sasha+0.003
inger+0.003
anger+0.003
Neg logit contributions
 ne-0.007
 subt-0.006
 caring-0.006
 ang-0.006
ths-0.006
Tokenlock
Token ID5354
Feature activation+1.72e-02

Pos logit contributions
iya+0.136
 Sasha+0.130
ipper+0.127
ulp+0.127
 eye+0.125
Neg logit contributions
 ne-0.114
bel-0.106
 subt-0.100
ths-0.096
penter-0.095
Token and
Token ID290
Feature activation-3.04e-03

Pos logit contributions
iya+0.036
ipper+0.035
 Sasha+0.034
ulp+0.033
inger+0.033
Neg logit contributions
 ne-0.023
penter-0.016
bel-0.016
 subt-0.015
 enthusiastically-0.015
Token opened
Token ID4721
Feature activation-2.36e-02

Pos logit contributions
 ne+0.005
 subt+0.005
bel+0.004
 caring+0.004
ths+0.004
Neg logit contributions
 eye-0.005
inger-0.005
anger-0.005
 pad-0.005
ocket-0.005
Token the
Token ID262
Feature activation-3.18e-03

Pos logit contributions
 ne+0.030
 how+0.023
 subt+0.022
 caring+0.021
ths+0.021
Neg logit contributions
 eye-0.051
ocket-0.049
inger-0.047
 pad-0.047
ipper-0.047
Token door
Token ID3420
Feature activation-4.00e-03

Pos logit contributions
 ne+0.007
 ang+0.006
 subt+0.006
 caring+0.006
ths+0.006
Neg logit contributions
ipper-0.004
ubs-0.004
 Sasha-0.003
inger-0.003
iya-0.003
Token sitting
Token ID5586
Feature activation-1.68e-02

Pos logit contributions
 eye+0.013
anger+0.012
inger+0.012
ocket+0.012
ork+0.011
Neg logit contributions
 ne-0.012
 subt-0.010
 caring-0.010
 ang-0.009
 forgiving-0.009
Token on
Token ID319
Feature activation+5.29e-03

Pos logit contributions
 ne+0.028
 subt+0.024
 ang+0.024
 caring+0.023
ths+0.023
Neg logit contributions
 eye-0.029
anger-0.027
ocket-0.026
ork-0.026
 gate-0.025
Token a
Token ID257
Feature activation+9.70e-03

Pos logit contributions
 eye+0.011
ocket+0.010
anger+0.009
inger+0.009
 pad+0.009
Neg logit contributions
 ne-0.008
 subt-0.007
 caring-0.007
 ang-0.006
Yeah-0.006
Token l
Token ID300
Feature activation+3.98e-02

Pos logit contributions
 eye+0.014
ocket+0.014
anger+0.013
inger+0.013
hole+0.012
Neg logit contributions
 ne-0.018
 caring-0.016
 ang-0.015
 trumpet-0.014
 subt-0.014
Tokenily
Token ID813
Feature activation+2.28e-02

Pos logit contributions
iya+0.054
 doorstep+0.051
 Sandra+0.050
 passport+0.050
 envelope+0.050
Neg logit contributions
 ne-0.066
 subt-0.054
years-0.054
 enthusiastically-0.051
igator-0.051
Token pad
Token ID14841
Feature activation+7.88e-02

Pos logit contributions
anger+0.019
 eye+0.018
inger+0.017
ocket+0.016
ulp+0.016
Neg logit contributions
 ne-0.056
 subt-0.050
 caring-0.047
 fought-0.047
ths-0.045
Token.
Token ID13
Feature activation-5.84e-04

Pos logit contributions
 eye+0.133
iya+0.133
 Sasha+0.130
 gate+0.123
ocket+0.122
Neg logit contributions
 ne-0.113
 subt-0.095
bel-0.095
 fought-0.086
cut-0.084
Token
Token ID220
Feature activation-9.41e-03

Pos logit contributions
 ne+0.001
 subt+0.001
 caring+0.001
Yeah+0.001
 ang+0.001
Neg logit contributions
 eye-0.001
ocket-0.001
anger-0.001
inger-0.001
 gate-0.001
Token
Token ID198
Feature activation+3.63e-03

Pos logit contributions
 ne+0.013
 subt+0.012
 caring+0.011
bel+0.011
 ang+0.011
Neg logit contributions
 eye-0.019
ocket-0.017
 envelope-0.017
 gate-0.017
iya-0.016
Token
Token ID198
Feature activation-2.56e-03

Pos logit contributions
 eye+0.007
ocket+0.006
anger+0.006
 gate+0.006
ulp+0.006
Neg logit contributions
 ne-0.006
 subt-0.006
 caring-0.005
Yeah-0.005
bel-0.005
Token"
Token ID1
Feature activation-1.11e-02

Pos logit contributions
 ne+0.005
 subt+0.004
 caring+0.004
 ang+0.004
ths+0.004
Neg logit contributions
 eye-0.004
ocket-0.004
 gate-0.004
 pad-0.004
anger-0.004
Token was
Token ID373
Feature activation+2.91e-03

Pos logit contributions
 ne+0.003
 subt+0.003
 ang+0.003
 caring+0.003
bel+0.003
Neg logit contributions
 eye-0.004
 pad-0.003
ocket-0.003
 gate-0.003
 Sasha-0.003
Token happy
Token ID3772
Feature activation+1.53e-02

Pos logit contributions
 eye+0.006
ocket+0.005
 pad+0.005
 gate+0.005
 Sasha+0.005
Neg logit contributions
 ne-0.004
 subt-0.004
ths-0.004
bel-0.003
 caring-0.003
Token that
Token ID326
Feature activation-9.61e-06

Pos logit contributions
 eye+0.025
ocket+0.022
 gate+0.021
anger+0.021
 pad+0.021
Neg logit contributions
 ne-0.028
 subt-0.027
 caring-0.025
 ang-0.024
ths-0.023
Token everyone
Token ID2506
Feature activation+9.08e-03

Feature not active on prev token;
no predictions were affected.
Token was
Token ID373
Feature activation-2.54e-03

Pos logit contributions
 eye+0.016
ocket+0.014
 pad+0.013
 gate+0.013
anger+0.013
Neg logit contributions
 ne-0.015
 subt-0.015
 caring-0.013
 ang-0.013
ths-0.013
Token able
Token ID1498
Feature activation+7.88e-02

Pos logit contributions
 ne+0.003
 subt+0.003
 caring+0.003
 ang+0.003
ths+0.003
Neg logit contributions
 eye-0.006
anger-0.005
ocket-0.005
 gate-0.005
 pad-0.005
Token to
Token ID284
Feature activation-6.16e-03

Pos logit contributions
 eye+0.136
anger+0.126
ocket+0.124
inger+0.118
 gate+0.117
Neg logit contributions
 ne-0.136
 caring-0.127
 subt-0.125
bel-0.114
 ang-0.113
Token enjoy
Token ID2883
Feature activation+1.79e-02

Pos logit contributions
 ne+0.010
 subt+0.010
 caring+0.009
ths+0.008
 ang+0.008
Neg logit contributions
 eye-0.011
ocket-0.010
 gate-0.010
anger-0.009
 pad-0.009
Token their
Token ID511
Feature activation-1.36e-02

Pos logit contributions
 eye+0.031
anger+0.028
ocket+0.027
inger+0.027
 gate+0.027
Neg logit contributions
 ne-0.030
 subt-0.029
 caring-0.027
 ang-0.025
ths-0.025
Token meal
Token ID9799
Feature activation-2.88e-03

Pos logit contributions
 ne+0.027
 subt+0.026
ths+0.024
 ang+0.024
bel+0.024
Neg logit contributions
 eye-0.020
ocket-0.016
ulp-0.015
 gate-0.015
 pad-0.015
Token together
Token ID1978
Feature activation-1.23e-02

Pos logit contributions
 ne+0.005
 subt+0.005
 caring+0.004
 ang+0.004
ths+0.004
Neg logit contributions
 eye-0.005
ocket-0.005
 gate-0.004
anger-0.004
 pad-0.004
Token happy
Token ID3772
Feature activation+1.34e-02

Pos logit contributions
 eye+0.018
ocket+0.016
adow+0.015
 Sasha+0.015
ulp+0.015
Neg logit contributions
 subt-0.016
 ne-0.015
ths-0.014
 trucks-0.013
 robots-0.013
Token that
Token ID326
Feature activation-2.82e-03

Pos logit contributions
 eye+0.023
ocket+0.021
inger+0.020
anger+0.020
 gate+0.020
Neg logit contributions
 ne-0.023
 subt-0.022
 caring-0.020
ths-0.019
Yeah-0.019
Token the
Token ID262
Feature activation+1.06e-02

Pos logit contributions
 ne+0.004
 subt+0.003
 caring+0.003
ths+0.003
 ang+0.003
Neg logit contributions
 eye-0.006
ocket-0.005
 gate-0.005
 pad-0.005
anger-0.005
Token flower
Token ID15061
Feature activation+2.71e-02

Pos logit contributions
 eye+0.019
anger+0.018
ocket+0.017
 pad+0.017
inger+0.017
Neg logit contributions
 ne-0.017
 caring-0.016
 subt-0.015
 forgiving-0.014
 ang-0.014
Token was
Token ID373
Feature activation+1.65e-03

Pos logit contributions
 eye+0.050
ocket+0.045
 gate+0.044
anger+0.044
 pad+0.043
Neg logit contributions
 ne-0.043
 subt-0.040
 caring-0.037
ths-0.035
 ang-0.035
Token able
Token ID1498
Feature activation+7.87e-02

Pos logit contributions
 eye+0.003
ocket+0.003
anger+0.003
 gate+0.003
 pad+0.003
Neg logit contributions
 ne-0.002
 subt-0.002
 caring-0.002
 ang-0.002
ths-0.002
Token to
Token ID284
Feature activation-5.32e-03

Pos logit contributions
 eye+0.137
anger+0.129
ocket+0.126
inger+0.119
 gate+0.118
Neg logit contributions
 ne-0.136
 caring-0.127
 subt-0.121
 ang-0.112
 forgiving-0.109
Token help
Token ID1037
Feature activation-1.13e-03

Pos logit contributions
 ne+0.007
 subt+0.007
 caring+0.006
 ang+0.006
ths+0.006
Neg logit contributions
 eye-0.011
ocket-0.010
 gate-0.010
anger-0.009
 pad-0.009
Token heal
Token ID12035
Feature activation-1.79e-02

Pos logit contributions
 ne+0.002
 caring+0.002
 subt+0.002
 forgiving+0.001
 ang+0.001
Neg logit contributions
 eye-0.002
anger-0.002
ocket-0.002
 pad-0.002
inger-0.002
Token the
Token ID262
Feature activation-2.11e-03

Pos logit contributions
 ne+0.033
 subt+0.031
 caring+0.029
ths+0.028
 ang+0.028
Neg logit contributions
 eye-0.029
ocket-0.025
 gate-0.025
anger-0.024
 pad-0.024
Token old
Token ID1468
Feature activation+4.93e-03

Pos logit contributions
 ne+0.003
 caring+0.003
 subt+0.003
 forgiving+0.003
Yeah+0.003
Neg logit contributions
 eye-0.004
anger-0.004
lor-0.004
ocket-0.004
inger-0.004
Token the
Token ID262
Feature activation+1.99e-04

Pos logit contributions
 ne+0.003
 subt+0.003
 caring+0.003
ths+0.003
 ang+0.003
Neg logit contributions
 eye-0.004
 gate-0.003
 Sasha-0.003
ocket-0.003
anger-0.003
Token wood
Token ID4898
Feature activation-8.04e-04

Pos logit contributions
 eye+0.000
 gate+0.000
 pad+0.000
ocket+0.000
 envelope+0.000
Neg logit contributions
 subt-0.000
 ne-0.000
bel-0.000
Yeah-0.000
 caring-0.000
Tokencut
Token ID8968
Feature activation-1.98e-02

Pos logit contributions
 ne+0.001
 subt+0.001
 caring+0.001
 ang+0.001
Yeah+0.001
Neg logit contributions
 eye-0.002
ocket-0.001
 gate-0.001
anger-0.001
 pad-0.001
Tokenter
Token ID353
Feature activation+1.64e-02

Pos logit contributions
 ne+0.033
 subt+0.031
 caring+0.029
 ang+0.027
ths+0.027
Neg logit contributions
 eye-0.035
ocket-0.031
 gate-0.031
anger-0.030
 pad-0.030
Token was
Token ID373
Feature activation+3.35e-03

Pos logit contributions
 eye+0.029
 pad+0.026
ocket+0.025
 gate+0.025
anger+0.025
Neg logit contributions
 ne-0.027
 subt-0.026
 caring-0.024
Yeah-0.022
 ang-0.022
Token able
Token ID1498
Feature activation+7.85e-02

Pos logit contributions
 eye+0.006
ocket+0.006
 gate+0.005
 pad+0.005
inger+0.005
Neg logit contributions
 ne-0.005
 subt-0.005
 caring-0.004
Yeah-0.004
ths-0.004
Token to
Token ID284
Feature activation-6.62e-03

Pos logit contributions
 eye+0.121
ocket+0.108
anger+0.107
 gate+0.101
inger+0.100
Neg logit contributions
 ne-0.152
 subt-0.140
 caring-0.137
 ang-0.130
bel-0.127
Token part
Token ID636
Feature activation+1.17e-02

Pos logit contributions
 subt+0.011
 ne+0.011
 caring+0.010
Yeah+0.009
ths+0.009
Neg logit contributions
 eye-0.012
 gate-0.010
ocket-0.010
inger-0.010
 pad-0.010
Token the
Token ID262
Feature activation+8.20e-03

Pos logit contributions
 eye+0.021
 gate+0.018
ocket+0.018
anger+0.018
 pad+0.017
Neg logit contributions
 ne-0.019
 subt-0.019
 caring-0.017
Yeah-0.016
ths-0.016
Token tall
Token ID7331
Feature activation+5.01e-03

Pos logit contributions
 eye+0.013
 gate+0.011
 pad+0.011
ocket+0.011
inger+0.010
Neg logit contributions
 ne-0.015
 subt-0.015
 caring-0.014
Yeah-0.013
bel-0.013
Token trees
Token ID7150
Feature activation+1.83e-02

Pos logit contributions
 eye+0.007
 gate+0.006
ocket+0.006
 pad+0.006
anger+0.006
Neg logit contributions
 ne-0.010
 subt-0.010
 caring-0.009
Yeah-0.009
 ang-0.009
Token top
Token ID1353
Feature activation-1.25e-02

Pos logit contributions
 ne+0.012
 subt+0.011
 caring+0.010
 ang+0.010
ths+0.010
Neg logit contributions
 eye-0.014
ocket-0.012
 gate-0.012
anger-0.012
 pad-0.012
Token of
Token ID286
Feature activation-8.67e-04

Pos logit contributions
 ne+0.025
 subt+0.022
bel+0.021
 caring+0.020
ths+0.020
Neg logit contributions
 eye-0.017
ocket-0.017
anger-0.015
inger-0.015
 Sasha-0.015
Token a
Token ID257
Feature activation+2.23e-03

Pos logit contributions
 ne+0.001
 subt+0.001
 caring+0.001
ths+0.001
bel+0.001
Neg logit contributions
 eye-0.002
ocket-0.001
 gate-0.001
 pad-0.001
anger-0.001
Token l
Token ID300
Feature activation+3.94e-02

Pos logit contributions
 eye+0.004
ocket+0.003
 gate+0.003
anger+0.003
 pad+0.003
Neg logit contributions
 ne-0.004
 subt-0.004
 caring-0.004
 ang-0.004
ths-0.004
Tokenilly
Token ID6548
Feature activation+1.73e-02

Pos logit contributions
 eye+0.055
iya+0.054
 doorstep+0.053
 envelope+0.053
 gate+0.053
Neg logit contributions
 ne-0.062
Deal-0.054
years-0.053
 subt-0.051
igator-0.049
Token pad
Token ID14841
Feature activation+7.83e-02

Pos logit contributions
 eye+0.031
inger+0.029
anger+0.029
ocket+0.028
ipper+0.027
Neg logit contributions
 ne-0.028
 caring-0.023
 subt-0.022
Yeah-0.020
 w-0.020
Token.
Token ID13
Feature activation+2.94e-03

Pos logit contributions
 eye+0.130
 Sasha+0.127
iya+0.123
 gate+0.120
inger+0.118
Neg logit contributions
 ne-0.118
 subt-0.106
bel-0.105
 fought-0.096
 caring-0.094
Token One
Token ID1881
Feature activation+2.29e-02

Pos logit contributions
 eye+0.006
ocket+0.006
anger+0.005
inger+0.005
 gate+0.005
Neg logit contributions
 ne-0.004
 subt-0.004
 caring-0.003
Yeah-0.003
 ang-0.003
Token day
Token ID1110
Feature activation-2.20e-02

Pos logit contributions
 eye+0.039
ocket+0.036
anger+0.036
 gate+0.035
inger+0.034
Neg logit contributions
 ne-0.039
 subt-0.035
 caring-0.034
 ang-0.032
Yeah-0.031
Token however
Token ID2158
Feature activation-2.69e-02

Pos logit contributions
 ne+0.038
 subt+0.038
 caring+0.035
ths+0.033
 ang+0.033
Neg logit contributions
 eye-0.036
ocket-0.031
 gate-0.031
anger-0.031
 pad-0.031
Token a
Token ID257
Feature activation-2.83e-03

Pos logit contributions
 subt+0.063
bel+0.056
Yeah+0.056
 ne+0.055
ths+0.054
Neg logit contributions
 eye-0.029
 Sasha-0.024
 gate-0.024
 net-0.023
 pad-0.023
Token paints
Token ID32281
Feature activation-6.59e-03

Pos logit contributions
 ne+0.044
 subt+0.042
 caring+0.039
 ang+0.037
ths+0.037
Neg logit contributions
 eye-0.036
ocket-0.032
 gate-0.031
anger-0.031
 pad-0.031
Token,
Token ID11
Feature activation+2.29e-02

Pos logit contributions
 ne+0.012
 subt+0.011
 caring+0.010
bel+0.010
ths+0.010
Neg logit contributions
 eye-0.010
ocket-0.009
inger-0.009
 Sasha-0.009
ulp-0.009
Token and
Token ID290
Feature activation+6.99e-03

Pos logit contributions
 eye+0.033
ocket+0.029
 gate+0.028
anger+0.028
 pad+0.028
Neg logit contributions
 ne-0.045
 subt-0.043
 caring-0.040
 ang-0.039
ths-0.039
Token a
Token ID257
Feature activation+2.59e-03

Pos logit contributions
 eye+0.012
ocket+0.011
 gate+0.011
 pad+0.011
anger+0.010
Neg logit contributions
 ne-0.012
 subt-0.011
 caring-0.010
 ang-0.010
ths-0.010
Token huge
Token ID3236
Feature activation-7.96e-03

Pos logit contributions
 eye+0.004
ocket+0.003
 gate+0.003
anger+0.003
 pad+0.003
Neg logit contributions
 ne-0.005
 subt-0.005
 caring-0.004
 ang-0.004
ths-0.004
Token pad
Token ID14841
Feature activation+7.79e-02

Pos logit contributions
 ne+0.015
 subt+0.015
 caring+0.014
 ang+0.013
ths+0.013
Neg logit contributions
 eye-0.012
ocket-0.010
 gate-0.010
anger-0.010
 pad-0.010
Token of
Token ID286
Feature activation-1.54e-02

Pos logit contributions
 Sasha+0.104
 eye+0.100
iya+0.100
ulp+0.094
inger+0.094
Neg logit contributions
 ne-0.146
bel-0.141
 subt-0.140
penter-0.121
ths-0.121
Token paper
Token ID3348
Feature activation+5.43e-03

Pos logit contributions
 ne+0.027
 subt+0.026
 caring+0.024
ths+0.023
 ang+0.023
Neg logit contributions
 eye-0.026
ocket-0.023
 gate-0.022
 pad-0.022
anger-0.021
Token.
Token ID13
Feature activation+2.33e-03

Pos logit contributions
 eye+0.009
inger+0.008
 Sasha+0.008
ocket+0.008
ulp+0.008
Neg logit contributions
 ne-0.009
 subt-0.009
 caring-0.008
bel-0.008
ths-0.007
Token She
Token ID1375
Feature activation-8.26e-03

Pos logit contributions
 eye+0.004
ocket+0.004
inger+0.004
ipper+0.004
 gate+0.004
Neg logit contributions
 subt-0.004
 ne-0.004
 caring-0.003
Yeah-0.003
 Sp-0.003
Token couldn
Token ID3521
Feature activation-6.29e-04

Pos logit contributions
 ne+0.014
 ang+0.014
 subt+0.014
 caring+0.013
 forgiving+0.012
Neg logit contributions
 eye-0.014
ocket-0.012
 pad-0.012
 gate-0.011
ork-0.011
Token John
Token ID1757
Feature activation+1.30e-02

Pos logit contributions
 ne+0.017
 subt+0.016
 caring+0.014
Yeah+0.014
ths+0.013
Neg logit contributions
 eye-0.021
ocket-0.019
anger-0.019
 gate-0.018
inger-0.018
Token began
Token ID2540
Feature activation+1.37e-02

Pos logit contributions
 eye+0.025
 pad+0.022
ocket+0.022
 gate+0.021
inger+0.021
Neg logit contributions
 ne-0.019
 subt-0.019
 caring-0.017
 ang-0.017
bel-0.016
Token to
Token ID284
Feature activation-4.63e-03

Pos logit contributions
 eye+0.020
 gate+0.017
ocket+0.017
 pad+0.017
anger+0.016
Neg logit contributions
 ne-0.027
 subt-0.026
 caring-0.024
ths-0.023
Yeah-0.023
Token make
Token ID787
Feature activation+7.73e-03

Pos logit contributions
 subt+0.008
ths+0.007
Yeah+0.007
 ne+0.007
bel+0.007
Neg logit contributions
 eye-0.008
 pad-0.007
 gate-0.007
 Sasha-0.006
ocket-0.006
Token-
Token ID12
Feature activation-5.14e-02

Pos logit contributions
 eye+0.014
ocket+0.013
 gate+0.012
anger+0.012
 pad+0.012
Neg logit contributions
 ne-0.012
 subt-0.012
 caring-0.011
ths-0.010
 ang-0.010
Tokenbel
Token ID6667
Feature activation-9.61e-02

Pos logit contributions
 subt+0.056
 ne+0.055
Yeah+0.044
ths+0.044
 caring+0.044
Neg logit contributions
 eye-0.113
ocket-0.102
 gate-0.101
inger-0.100
 pad-0.100
Tokenieve
Token ID12311
Feature activation-1.13e-02

Pos logit contributions
 ne+0.128
 subt+0.119
spe+0.117
Yeah+0.109
 grizz+0.107
Neg logit contributions
 eye-0.177
ocket-0.172
hole-0.158
lor-0.155
 envelope-0.153
Token he
Token ID339
Feature activation-8.49e-03

Pos logit contributions
 ne+0.021
 subt+0.020
 ang+0.018
 caring+0.018
Yeah+0.018
Neg logit contributions
 eye-0.017
 gate-0.015
ocket-0.015
 pad-0.014
anger-0.014
Token was
Token ID373
Feature activation-4.45e-03

Pos logit contributions
 subt+0.014
 ne+0.014
 caring+0.013
ths+0.013
Yeah+0.012
Neg logit contributions
 eye-0.015
ocket-0.013
 pad-0.013
 gate-0.012
adow-0.012
Token a
Token ID257
Feature activation-3.50e-03

Pos logit contributions
 ne+0.007
 subt+0.007
 caring+0.006
ths+0.006
 ang+0.006
Neg logit contributions
 eye-0.008
ocket-0.007
 gate-0.007
 pad-0.007
anger-0.007
Token hero
Token ID4293
Feature activation-2.94e-02

Pos logit contributions
 subt+0.006
 ne+0.005
ths+0.005
bel+0.005
 ang+0.005
Neg logit contributions
 eye-0.006
 gate-0.006
 pad-0.005
 net-0.005
ork-0.005
Token liked
Token ID8288
Feature activation-4.80e-03

Pos logit contributions
 eye+0.007
ocket+0.006
 gate+0.006
 pad+0.006
adow+0.006
Neg logit contributions
 subt-0.006
 ne-0.006
 ang-0.005
ths-0.005
spe-0.005
Token to
Token ID284
Feature activation-8.18e-03

Pos logit contributions
 ne+0.011
 subt+0.010
 ang+0.010
 caring+0.010
Yeah+0.009
Neg logit contributions
 eye-0.005
 gate-0.005
 pad-0.004
ocket-0.004
anger-0.004
Token play
Token ID711
Feature activation-2.33e-02

Pos logit contributions
 ne+0.014
 subt+0.013
ths+0.012
 ang+0.012
 caring+0.012
Neg logit contributions
 eye-0.015
 gate-0.012
ocket-0.012
anger-0.012
inger-0.012
Token make
Token ID787
Feature activation+1.27e-04

Pos logit contributions
 subt+0.043
 ne+0.043
spe+0.039
 ang+0.039
 caring+0.039
Neg logit contributions
 eye-0.035
anger-0.031
 gate-0.031
ork-0.030
ocket-0.029
Token-
Token ID12
Feature activation-5.23e-02

Pos logit contributions
 eye+0.000
ocket+0.000
 gate+0.000
anger+0.000
 pad+0.000
Neg logit contributions
 ne-0.000
 subt-0.000
 caring-0.000
 ang-0.000
ths-0.000
Tokenbel
Token ID6667
Feature activation-9.45e-02

Pos logit contributions
 ne+0.052
 subt+0.048
ths+0.038
 caring+0.035
Yeah+0.035
Neg logit contributions
 eye-0.127
ork-0.116
inger-0.115
 gate-0.115
anger-0.114
Tokenieve
Token ID12311
Feature activation-1.63e-02

Pos logit contributions
 ne+0.122
 subt+0.114
spe+0.107
Yeah+0.103
 grizz+0.100
Neg logit contributions
 eye-0.183
ocket-0.172
ork-0.163
anger-0.163
inger-0.160
Token.
Token ID13
Feature activation+3.38e-03

Pos logit contributions
 ne+0.031
 subt+0.030
 ang+0.028
 caring+0.027
ths+0.027
Neg logit contributions
 eye-0.025
 gate-0.022
ocket-0.021
anger-0.021
ork-0.021
Token
Token ID198
Feature activation+7.68e-03

Pos logit contributions
 eye+0.008
ocket+0.008
inger+0.008
anger+0.008
ipper+0.008
Neg logit contributions
 ne-0.003
 subt-0.003
 caring-0.002
Deal-0.002
Yeah-0.002
Token
Token ID198
Feature activation-1.84e-03

Pos logit contributions
 eye+0.016
ocket+0.014
anger+0.014
 gate+0.013
inger+0.013
Neg logit contributions
 ne-0.011
 subt-0.010
 caring-0.010
bel-0.009
Yeah-0.009
TokenOne
Token ID3198
Feature activation+2.06e-02

Pos logit contributions
 ne+0.002
 subt+0.002
 caring+0.002
ths+0.001
Yeah+0.001
Neg logit contributions
 eye-0.004
ocket-0.004
anger-0.004
 gate-0.004
inger-0.004
Token loved
Token ID6151
Feature activation-8.33e-03

Pos logit contributions
 eye+0.012
 pad+0.011
ocket+0.011
 gate+0.011
inger+0.010
Neg logit contributions
 ne-0.010
 subt-0.009
 caring-0.008
 ang-0.008
bel-0.008
Token to
Token ID284
Feature activation-5.21e-03

Pos logit contributions
 ne+0.018
 subt+0.017
 ang+0.016
Yeah+0.016
 caring+0.015
Neg logit contributions
 eye-0.010
 gate-0.009
 pad-0.009
ocket-0.009
 Sasha-0.008
Token play
Token ID711
Feature activation-1.57e-02

Pos logit contributions
 ne+0.008
 subt+0.008
 ang+0.008
ths+0.008
 caring+0.007
Neg logit contributions
 eye-0.009
 gate-0.008
ocket-0.008
 pad-0.008
 Sasha-0.008
Token make
Token ID787
Feature activation+2.23e-03

Pos logit contributions
 subt+0.029
 ne+0.029
 ang+0.028
spe+0.027
 caring+0.027
Neg logit contributions
 eye-0.022
anger-0.020
 gate-0.020
ork-0.019
 pad-0.019
Token-
Token ID12
Feature activation-5.09e-02

Pos logit contributions
 eye+0.004
 gate+0.004
anger+0.004
ocket+0.004
 pad+0.003
Neg logit contributions
 ne-0.004
 subt-0.003
ths-0.003
 caring-0.003
Yeah-0.003
Tokenbel
Token ID6667
Feature activation-9.41e-02

Pos logit contributions
 ne+0.075
 subt+0.068
ths+0.061
 ang+0.061
 caring+0.060
Neg logit contributions
 eye-0.097
ork-0.088
inger-0.086
anger-0.086
ocket-0.086
Tokenieve
Token ID12311
Feature activation-1.61e-02

Pos logit contributions
 ne+0.130
 grizz+0.110
spe+0.109
 subt+0.107
Yeah+0.105
Neg logit contributions
ocket-0.166
 eye-0.165
anger-0.153
ork-0.153
 envelope-0.148
Token.
Token ID13
Feature activation+4.06e-03

Pos logit contributions
 ne+0.027
 subt+0.027
 ang+0.025
 caring+0.025
Yeah+0.024
Neg logit contributions
 eye-0.027
 gate-0.024
anger-0.024
ocket-0.023
ork-0.023
Token Today
Token ID6288
Feature activation-6.73e-03

Pos logit contributions
 eye+0.008
ocket+0.007
 gate+0.007
inger+0.007
anger+0.007
Neg logit contributions
 ne-0.006
 subt-0.006
 caring-0.005
Yeah-0.005
ths-0.005
Token she
Token ID673
Feature activation-3.19e-04

Pos logit contributions
 ne+0.010
 subt+0.009
 caring+0.008
ths+0.008
bel+0.007
Neg logit contributions
 eye-0.014
ocket-0.013
inger-0.013
 pad-0.012
ipper-0.012
Token was
Token ID373
Feature activation-4.89e-03

Pos logit contributions
 ne+0.000
 subt+0.000
 caring+0.000
 ang+0.000
ths+0.000
Neg logit contributions
 eye-0.001
ocket-0.001
 gate-0.001
anger-0.001
 pad-0.001
Token-
Token ID12
Feature activation-3.68e-02

Pos logit contributions
 ne+0.032
 subt+0.030
 caring+0.028
ths+0.026
 ang+0.026
Neg logit contributions
 eye-0.039
ocket-0.035
 gate-0.034
 pad-0.034
anger-0.034
Tokenup
Token ID929
Feature activation-1.42e-02

Pos logit contributions
 ne+0.068
 subt+0.065
 caring+0.060
 ang+0.058
ths+0.058
Neg logit contributions
 eye-0.059
ocket-0.051
anger-0.050
 gate-0.050
 pad-0.049
Token and
Token ID290
Feature activation+1.50e-02

Pos logit contributions
 ne+0.024
 subt+0.023
 caring+0.021
ths+0.020
 ang+0.020
Neg logit contributions
 eye-0.025
ocket-0.022
 gate-0.022
anger-0.022
 pad-0.021
Token make
Token ID787
Feature activation+6.57e-03

Pos logit contributions
 eye+0.024
 gate+0.020
ocket+0.020
 pad+0.019
 envelope+0.018
Neg logit contributions
 ne-0.028
 subt-0.027
ths-0.026
 ang-0.025
 caring-0.025
Token-
Token ID12
Feature activation-4.67e-02

Pos logit contributions
 eye+0.012
ocket+0.011
 gate+0.010
anger+0.010
 pad+0.010
Neg logit contributions
 ne-0.011
 subt-0.010
 caring-0.009
ths-0.009
 ang-0.009
Tokenbel
Token ID6667
Feature activation-9.19e-02

Pos logit contributions
 ne+0.067
 subt+0.064
 caring+0.056
ths+0.055
 ang+0.054
Neg logit contributions
 eye-0.094
ocket-0.084
anger-0.083
 gate-0.082
ork-0.081
Tokenieve
Token ID12311
Feature activation-1.48e-02

Pos logit contributions
 ne+0.115
 subt+0.108
spe+0.103
 grizz+0.101
Yeah+0.095
Neg logit contributions
 eye-0.177
ocket-0.168
ork-0.159
anger-0.158
 envelope-0.155
Token.
Token ID13
Feature activation+3.17e-03

Pos logit contributions
 ne+0.025
 subt+0.024
 caring+0.022
 ang+0.021
ths+0.021
Neg logit contributions
 eye-0.026
ocket-0.023
 gate-0.023
anger-0.023
 pad-0.023
Token One
Token ID1881
Feature activation+2.10e-02

Pos logit contributions
ocket+0.008
 eye+0.008
inger+0.007
lor+0.007
anger+0.007
Neg logit contributions
 ne-0.003
 subt-0.003
 cartoons-0.002
 caring-0.002
 Sp-0.002
Token day
Token ID1110
Feature activation-2.56e-02

Pos logit contributions
 eye+0.037
ocket+0.036
anger+0.035
 gate+0.035
inger+0.035
Neg logit contributions
 ne-0.033
 subt-0.029
 caring-0.029
bel-0.026
Yeah-0.026
Token,
Token ID11
Feature activation-6.51e-03

Pos logit contributions
 ne+0.040
 subt+0.037
 caring+0.034
 ang+0.032
ths+0.032
Neg logit contributions
 eye-0.049
ocket-0.044
 gate-0.043
inger-0.043
 pad-0.042
Token red
Token ID2266
Feature activation+2.11e-02

Pos logit contributions
 eye+0.004
ocket+0.003
 gate+0.003
 pad+0.003
anger+0.003
Neg logit contributions
 subt-0.004
 ne-0.004
ths-0.004
 caring-0.004
Yeah-0.004
Token cape
Token ID36023
Feature activation-1.34e-02

Pos logit contributions
 eye+0.035
ocket+0.030
 gate+0.030
anger+0.030
 pad+0.029
Neg logit contributions
 ne-0.038
 subt-0.036
 caring-0.033
 ang-0.032
ths-0.032
Token and
Token ID290
Feature activation+8.93e-03

Pos logit contributions
 ne+0.025
 subt+0.024
 caring+0.023
 ang+0.022
ths+0.022
Neg logit contributions
 eye-0.021
ocket-0.018
 gate-0.018
anger-0.017
 pad-0.017
Token make
Token ID787
Feature activation+9.61e-03

Pos logit contributions
 eye+0.017
 pad+0.014
 gate+0.014
ocket+0.014
 Sasha+0.013
Neg logit contributions
 ne-0.014
 subt-0.013
ths-0.012
Yeah-0.012
 caring-0.012
Token-
Token ID12
Feature activation-4.64e-02

Pos logit contributions
 eye+0.017
ocket+0.015
 gate+0.014
anger+0.014
 pad+0.014
Neg logit contributions
 ne-0.016
 subt-0.016
 caring-0.014
ths-0.014
 ang-0.014
Tokenbel
Token ID6667
Feature activation-9.13e-02

Pos logit contributions
 ne+0.036
 subt+0.034
 caring+0.025
Yeah+0.024
ths+0.024
Neg logit contributions
 eye-0.122
ocket-0.112
ork-0.110
 pad-0.110
 gate-0.109
Tokenieve
Token ID12311
Feature activation-1.12e-02

Pos logit contributions
 ne+0.116
 subt+0.109
spe+0.100
Yeah+0.098
 grizz+0.095
Neg logit contributions
 eye-0.178
ocket-0.172
ork-0.159
lor-0.157
anger-0.156
Token he
Token ID339
Feature activation-7.78e-03

Pos logit contributions
 ne+0.020
 subt+0.020
 caring+0.018
 ang+0.017
ths+0.017
Neg logit contributions
 eye-0.018
ocket-0.016
 gate-0.016
anger-0.016
 pad-0.015
Token had
Token ID550
Feature activation+3.81e-02

Pos logit contributions
 subt+0.015
Yeah+0.013
 ne+0.013
ths+0.013
 caring+0.013
Neg logit contributions
 eye-0.013
ocket-0.011
adow-0.010
 pad-0.010
hole-0.009
Token super
Token ID2208
Feature activation+6.74e-03

Pos logit contributions
 eye+0.070
ocket+0.062
 gate+0.061
anger+0.061
 pad+0.060
Neg logit contributions
 ne-0.062
 subt-0.058
 caring-0.053
 ang-0.050
ths-0.050
Tokenpowers
Token ID30132
Feature activation-1.13e-02

Pos logit contributions
 eye+0.012
ocket+0.010
adow+0.010
ork+0.010
 gate+0.010
Neg logit contributions
 subt-0.011
 ne-0.011
ths-0.010
 ang-0.010
Yeah-0.010
Token He
Token ID679
Feature activation-1.20e-02

Pos logit contributions
 eye+0.003
 gate+0.003
 pad+0.003
 Sasha+0.003
hole+0.003
Neg logit contributions
 ne-0.003
 subt-0.003
 caring-0.003
bel-0.003
ths-0.003
Token loved
Token ID6151
Feature activation-8.56e-03

Pos logit contributions
 ne+0.019
 subt+0.019
 caring+0.017
 ang+0.017
ths+0.017
Neg logit contributions
 eye-0.021
ocket-0.019
 pad-0.019
 gate-0.018
ork-0.017
Token playing
Token ID2712
Feature activation-2.57e-02

Pos logit contributions
 ne+0.019
 subt+0.019
Deal+0.017
Yeah+0.017
 ang+0.017
Neg logit contributions
 eye-0.009
 gate-0.008
 pad-0.007
ocket-0.007
 home-0.007
Token make
Token ID787
Feature activation+3.61e-03

Pos logit contributions
 subt+0.048
 ne+0.046
ths+0.045
spe+0.044
 caring+0.044
Neg logit contributions
 eye-0.036
 gate-0.034
 net-0.031
anger-0.031
 garden-0.030
Token-
Token ID12
Feature activation-4.78e-02

Pos logit contributions
 eye+0.006
 gate+0.006
anger+0.006
ocket+0.006
 pad+0.005
Neg logit contributions
 ne-0.006
 subt-0.006
ths-0.005
Yeah-0.005
 caring-0.005
Tokenbel
Token ID6667
Feature activation-9.10e-02

Pos logit contributions
 ne+0.056
 subt+0.050
ths+0.042
 caring+0.041
Yeah+0.040
Neg logit contributions
 eye-0.104
ork-0.098
ubs-0.095
ocket-0.095
inger-0.094
Tokenieve
Token ID12311
Feature activation-1.54e-02

Pos logit contributions
 ne+0.127
 grizz+0.113
 subt+0.109
spe+0.107
Yeah+0.106
Neg logit contributions
 eye-0.156
ocket-0.155
hole-0.143
ork-0.142
 envelope-0.140
Token with
Token ID351
Feature activation-5.73e-03

Pos logit contributions
 ne+0.028
 subt+0.027
 caring+0.025
 ang+0.025
Yeah+0.024
Neg logit contributions
 eye-0.024
 gate-0.022
ocket-0.021
anger-0.021
ork-0.020
Token his
Token ID465
Feature activation-2.64e-03

Pos logit contributions
 ne+0.008
 subt+0.008
Deal+0.007
ths+0.007
Yeah+0.007
Neg logit contributions
 eye-0.011
 gate-0.010
ocket-0.010
anger-0.009
 envelope-0.009
Token friends
Token ID2460
Feature activation-8.69e-03

Pos logit contributions
 ne+0.005
 subt+0.005
 ang+0.004
 caring+0.004
ths+0.004
Neg logit contributions
 eye-0.004
 gate-0.004
ocket-0.003
 pad-0.003
anger-0.003
Token.
Token ID13
Feature activation+2.31e-03

Pos logit contributions
 ne+0.018
 subt+0.017
 caring+0.016
ths+0.016
 ang+0.016
Neg logit contributions
 eye-0.012
 pad-0.010
ocket-0.010
 gate-0.010
hole-0.009
Token to
Token ID284
Feature activation-1.19e-02

Pos logit contributions
 eye+0.038
ocket+0.033
anger+0.031
 gate+0.031
inger+0.031
Neg logit contributions
 ne-0.049
 subt-0.048
 caring-0.045
 ang-0.043
ths-0.042
Token take
Token ID1011
Feature activation-2.25e-03

Pos logit contributions
 ne+0.021
 subt+0.020
 caring+0.018
bel+0.018
Yeah+0.018
Neg logit contributions
 eye-0.020
 gate-0.018
ocket-0.016
 pad-0.016
inger-0.016
Token down
Token ID866
Feature activation+8.44e-03

Pos logit contributions
 ne+0.004
 subt+0.004
 caring+0.004
bel+0.003
Yeah+0.003
Neg logit contributions
 eye-0.004
ocket-0.003
 gate-0.003
anger-0.003
 pad-0.003
Token the
Token ID262
Feature activation+4.60e-04

Pos logit contributions
 eye+0.016
ocket+0.014
 pad+0.014
anger+0.014
 gate+0.014
Neg logit contributions
 ne-0.013
 subt-0.012
 caring-0.011
 ang-0.011
ths-0.011
Token light
Token ID1657
Feature activation-1.45e-02

Pos logit contributions
 eye+0.001
 gate+0.001
anger+0.001
ocket+0.001
 pad+0.001
Neg logit contributions
 ne-0.001
 subt-0.001
 caring-0.001
Yeah-0.001
 ang-0.001
Tokenbul
Token ID15065
Feature activation-9.05e-02

Pos logit contributions
 ne+0.026
 subt+0.025
 caring+0.023
bel+0.022
ths+0.022
Neg logit contributions
 eye-0.023
ocket-0.021
 pad-0.020
inger-0.020
 gate-0.020
Tokenb
Token ID65
Feature activation+1.16e-02

Pos logit contributions
 ne+0.138
 subt+0.129
 caring+0.117
Yeah+0.116
Deal+0.112
Neg logit contributions
 eye-0.171
hole-0.158
anger-0.156
inger-0.149
ocket-0.148
Token but
Token ID475
Feature activation+1.19e-02

Pos logit contributions
 eye+0.018
ocket+0.016
 pad+0.015
 Sasha+0.015
 gate+0.015
Neg logit contributions
 ne-0.021
 subt-0.020
bel-0.019
 caring-0.019
ths-0.018
Token she
Token ID673
Feature activation+6.14e-03

Pos logit contributions
 eye+0.017
inger+0.015
ocket+0.015
 gate+0.015
 pad+0.015
Neg logit contributions
 ne-0.024
 subt-0.022
 caring-0.021
ths-0.020
 ang-0.020
Token was
Token ID373
Feature activation+3.82e-03

Pos logit contributions
 eye+0.012
 gate+0.011
ocket+0.011
anger+0.011
inger+0.011
Neg logit contributions
 ne-0.009
 subt-0.008
 caring-0.007
ths-0.007
Yeah-0.007
Token too
Token ID1165
Feature activation+7.11e-03

Pos logit contributions
 eye+0.008
ocket+0.007
 pad+0.007
 gate+0.007
 Sasha+0.007
Neg logit contributions
 subt-0.005
 ne-0.005
bel-0.004
 caring-0.004
ths-0.004
Token and
Token ID290
Feature activation+9.88e-03

Pos logit contributions
 eye+0.024
inger+0.022
ocket+0.021
anger+0.021
 Sasha+0.021
Neg logit contributions
 ne-0.025
 subt-0.022
 caring-0.021
bel-0.020
ths-0.020
Token they
Token ID484
Feature activation+8.47e-03

Pos logit contributions
 eye+0.018
ocket+0.016
 gate+0.016
 pad+0.016
 Sasha+0.015
Neg logit contributions
 ne-0.015
 subt-0.014
 ang-0.013
ths-0.013
 caring-0.013
Token played
Token ID2826
Feature activation-7.81e-03

Pos logit contributions
 eye+0.015
ocket+0.013
 pad+0.012
 gate+0.012
adow+0.011
Neg logit contributions
 subt-0.015
 ne-0.014
 ang-0.013
ths-0.013
 caring-0.012
Token make
Token ID787
Feature activation+1.28e-02

Pos logit contributions
 ne+0.011
 subt+0.011
ths+0.009
 ang+0.009
 caring+0.009
Neg logit contributions
 eye-0.015
 gate-0.014
ocket-0.014
anger-0.014
 pad-0.013
Token-
Token ID12
Feature activation-5.25e-02

Pos logit contributions
 eye+0.024
ocket+0.021
 gate+0.021
anger+0.021
 pad+0.020
Neg logit contributions
 ne-0.020
 subt-0.019
 caring-0.017
ths-0.017
 ang-0.016
Tokenbel
Token ID6667
Feature activation-8.96e-02

Pos logit contributions
 ne+0.040
 subt+0.039
ths+0.028
 caring+0.025
 ang+0.025
Neg logit contributions
 eye-0.134
ork-0.127
ocket-0.126
anger-0.124
 gate-0.124
Tokenieve
Token ID12311
Feature activation-9.26e-03

Pos logit contributions
 ne+0.118
 subt+0.111
 grizz+0.107
spe+0.100
Yeah+0.097
Neg logit contributions
 eye-0.157
ocket-0.156
ork-0.149
hole-0.144
anger-0.143
Token.
Token ID13
Feature activation-3.92e-03

Pos logit contributions
 ne+0.016
 subt+0.016
 ang+0.014
 caring+0.014
ths+0.014
Neg logit contributions
 eye-0.015
ocket-0.013
 gate-0.013
anger-0.013
 pad-0.013
Token Finally
Token ID9461
Feature activation-1.93e-02

Pos logit contributions
 subt+0.006
 ne+0.006
Yeah+0.006
 Sp+0.005
 cartoons+0.005
Neg logit contributions
hole-0.007
 eye-0.007
ocket-0.007
inger-0.007
 gate-0.006
Token,
Token ID11
Feature activation-6.31e-04

Pos logit contributions
 ne+0.029
 subt+0.026
 caring+0.024
 ang+0.023
ths+0.023
Neg logit contributions
 eye-0.038
ocket-0.035
 pad-0.034
inger-0.034
anger-0.034
Token they
Token ID484
Feature activation+1.19e-02

Pos logit contributions
 ne+0.001
 subt+0.001
 caring+0.001
Yeah+0.001
 ang+0.001
Neg logit contributions
 eye-0.001
inger-0.001
hole-0.001
ocket-0.001
anger-0.001
Token he
Token ID339
Feature activation-1.26e-02

Pos logit contributions
 eye+0.027
 gate+0.024
 Sasha+0.024
ocket+0.023
 pad+0.023
Neg logit contributions
 ne-0.019
 subt-0.019
 caring-0.016
bel-0.016
 ang-0.016
Token was
Token ID373
Feature activation-4.14e-03

Pos logit contributions
 ne+0.020
 subt+0.019
 caring+0.018
 ang+0.017
ths+0.017
Neg logit contributions
 eye-0.023
ocket-0.020
 gate-0.020
 pad-0.020
anger-0.020
Token playing
Token ID2712
Feature activation-2.34e-02

Pos logit contributions
 ne+0.006
 subt+0.006
 caring+0.005
 ang+0.005
ths+0.005
Neg logit contributions
 eye-0.008
ocket-0.007
 gate-0.007
anger-0.007
 pad-0.007
Token make
Token ID787
Feature activation+1.05e-02

Pos logit contributions
 ne+0.038
 subt+0.038
 caring+0.034
ths+0.033
 ang+0.032
Neg logit contributions
 eye-0.041
 gate-0.036
ocket-0.035
anger-0.035
 pad-0.035
Token-
Token ID12
Feature activation-4.98e-02

Pos logit contributions
 eye+0.019
 gate+0.017
ocket+0.017
anger+0.016
 pad+0.016
Neg logit contributions
 ne-0.017
 subt-0.016
 caring-0.014
ths-0.014
Yeah-0.014
Tokenbel
Token ID6667
Feature activation-8.95e-02

Pos logit contributions
 ne+0.042
 subt+0.039
 caring+0.029
ths+0.029
 ang+0.028
Neg logit contributions
 eye-0.127
ocket-0.118
ork-0.115
inger-0.115
anger-0.115
Tokenieve
Token ID12311
Feature activation-1.25e-02

Pos logit contributions
 ne+0.120
 subt+0.113
 grizz+0.107
Yeah+0.102
spe+0.101
Neg logit contributions
 eye-0.159
ocket-0.154
ork-0.145
hole-0.144
lor-0.143
Token at
Token ID379
Feature activation+2.50e-03

Pos logit contributions
 ne+0.023
 subt+0.022
 caring+0.020
 ang+0.020
ths+0.020
Neg logit contributions
 eye-0.020
ocket-0.017
 gate-0.017
anger-0.017
 pad-0.017
Token home
Token ID1363
Feature activation+7.09e-03

Pos logit contributions
 eye+0.005
ocket+0.004
anger+0.004
 pad+0.004
 gate+0.004
Neg logit contributions
 ne-0.004
 subt-0.004
 caring-0.003
 ang-0.003
ths-0.003
Token,
Token ID11
Feature activation+1.72e-03

Pos logit contributions
 eye+0.010
 gate+0.009
ocket+0.009
anger+0.008
 pad+0.008
Neg logit contributions
 ne-0.014
 subt-0.014
Yeah-0.012
 caring-0.012
ths-0.012
Token too
Token ID1165
Feature activation+1.18e-02

Pos logit contributions
 eye+0.003
ocket+0.002
 gate+0.002
 pad+0.002
anger+0.002
Neg logit contributions
 ne-0.003
 subt-0.003
 caring-0.003
ths-0.003
 ang-0.003
Token was
Token ID373
Feature activation+1.37e-03

Pos logit contributions
 eye+0.031
ocket+0.027
 gate+0.027
anger+0.027
 pad+0.026
Neg logit contributions
 ne-0.028
 subt-0.026
 caring-0.024
 ang-0.023
ths-0.023
Token ready
Token ID3492
Feature activation+2.03e-02

Pos logit contributions
 eye+0.003
ocket+0.002
 gate+0.002
 pad+0.002
anger+0.002
Neg logit contributions
 ne-0.002
 subt-0.002
 caring-0.002
ths-0.002
 ang-0.002
Token for
Token ID329
Feature activation+1.27e-02

Pos logit contributions
 eye+0.035
ocket+0.035
inger+0.034
anger+0.034
 gate+0.032
Neg logit contributions
 ne-0.032
 caring-0.028
 subt-0.028
bel-0.026
 ang-0.025
Token make
Token ID787
Feature activation+5.15e-03

Pos logit contributions
 eye+0.023
ocket+0.021
anger+0.020
 gate+0.020
 pad+0.020
Neg logit contributions
 ne-0.021
 subt-0.020
 caring-0.018
ths-0.017
 ang-0.017
Token-
Token ID12
Feature activation-4.92e-02

Pos logit contributions
 eye+0.009
ocket+0.008
anger+0.008
 pad+0.008
 gate+0.008
Neg logit contributions
 ne-0.009
 subt-0.008
 caring-0.008
 ang-0.007
ths-0.007
Tokenbel
Token ID6667
Feature activation-8.83e-02

Pos logit contributions
 ne+0.082
 subt+0.079
ths+0.070
Yeah+0.068
 caring+0.068
Neg logit contributions
 eye-0.083
ocket-0.074
inger-0.071
ork-0.071
anger-0.069
Tokenieve
Token ID12311
Feature activation-1.09e-02

Pos logit contributions
 ne+0.114
 subt+0.101
Yeah+0.091
 grizz+0.088
spe+0.087
Neg logit contributions
 eye-0.169
ocket-0.166
lor-0.154
ork-0.153
inger-0.150
Token fun
Token ID1257
Feature activation+8.60e-03

Pos logit contributions
 ne+0.021
 subt+0.020
 caring+0.018
 ang+0.018
ths+0.018
Neg logit contributions
 eye-0.016
ocket-0.014
 gate-0.014
anger-0.014
 pad-0.014
Token.
Token ID13
Feature activation-2.38e-03

Pos logit contributions
 eye+0.014
ocket+0.012
 gate+0.012
anger+0.012
 pad+0.011
Neg logit contributions
 ne-0.016
 subt-0.015
 caring-0.014
 ang-0.014
ths-0.013
Token<|endoftext|>
Token ID50256
Feature activation-4.44e-03

Pos logit contributions
 subt+0.003
 ne+0.003
Yeah+0.003
 caring+0.003
ths+0.003
Neg logit contributions
 eye-0.005
ocket-0.004
anger-0.004
inger-0.004
 gate-0.004
TokenOnce
Token ID7454
Feature activation-8.95e-03

Pos logit contributions
 ne+0.007
 subt+0.006
cl+0.005
B+0.005
 Bones+0.005
Neg logit contributions
 eye-0.009
ocket-0.008
 gate-0.008
 envelope-0.008
inger-0.008
Token he
Token ID339
Feature activation-9.23e-04

Pos logit contributions
 subt+0.015
 ne+0.014
 caring+0.013
bel+0.012
kind+0.012
Neg logit contributions
 eye-0.015
 gate-0.014
 Sasha-0.014
anger-0.013
 home-0.013
Token accidentally
Token ID14716
Feature activation+1.13e-02

Pos logit contributions
 ne+0.002
 subt+0.002
 caring+0.001
 ang+0.001
ths+0.001
Neg logit contributions
 eye-0.002
ocket-0.001
 gate-0.001
anger-0.001
 pad-0.001
Token touched
Token ID12615
Feature activation+2.87e-03

Pos logit contributions
 eye+0.021
ocket+0.019
 gate+0.018
anger+0.018
inger+0.018
Neg logit contributions
 ne-0.018
 subt-0.017
 caring-0.015
 ang-0.015
ths-0.015
Token the
Token ID262
Feature activation+5.20e-03

Pos logit contributions
 eye+0.005
ocket+0.004
 gate+0.004
anger+0.004
 pad+0.004
Neg logit contributions
 ne-0.005
 subt-0.005
 caring-0.004
 ang-0.004
ths-0.004
Token light
Token ID1657
Feature activation-1.55e-02

Pos logit contributions
 eye+0.009
 gate+0.008
anger+0.008
ocket+0.008
 pad+0.007
Neg logit contributions
 ne-0.009
 subt-0.009
 caring-0.008
ths-0.008
 ang-0.008
Tokenbul
Token ID15065
Feature activation-8.65e-02

Pos logit contributions
 ne+0.029
 subt+0.027
 caring+0.025
 ang+0.024
ths+0.024
Neg logit contributions
 eye-0.024
ocket-0.022
 gate-0.021
inger-0.021
 pad-0.021
Tokenb
Token ID65
Feature activation+1.24e-02

Pos logit contributions
 ne+0.124
 subt+0.119
 caring+0.108
Yeah+0.101
Deal+0.100
Neg logit contributions
 eye-0.170
hole-0.160
anger-0.154
ocket-0.147
inger-0.144
Token and
Token ID290
Feature activation+3.15e-03

Pos logit contributions
 eye+0.019
ocket+0.017
 Sasha+0.017
 pad+0.016
 gate+0.016
Neg logit contributions
 ne-0.023
 subt-0.021
bel-0.021
 ang-0.020
 caring-0.020
Token it
Token ID340
Feature activation-1.04e-02

Pos logit contributions
 eye+0.004
ocket+0.004
 gate+0.003
anger+0.003
 pad+0.003
Neg logit contributions
 ne-0.007
 subt-0.006
 caring-0.006
 ang-0.006
ths-0.006
Token was
Token ID373
Feature activation+5.10e-03

Pos logit contributions
 subt+0.018
 ne+0.018
 caring+0.017
Deal+0.016
Yeah+0.016
Neg logit contributions
 eye-0.017
 gate-0.014
anger-0.014
ocket-0.014
ork-0.014
Token very
Token ID845
Feature activation+4.13e-03

Pos logit contributions
 eye+0.009
 gate+0.008
ocket+0.008
 pad+0.008
anger+0.008
Neg logit contributions
 ne-0.008
 subt-0.008
 caring-0.007
bel-0.007
ths-0.007
Token Finally
Token ID9461
Feature activation-1.89e-02

Pos logit contributions
 subt+0.003
 ne+0.003
 caring+0.003
Yeah+0.003
ths+0.002
Neg logit contributions
 eye-0.004
ocket-0.003
 gate-0.003
anger-0.003
inger-0.003
Token,
Token ID11
Feature activation-2.14e-03

Pos logit contributions
 ne+0.029
 subt+0.027
 caring+0.025
ths+0.024
 ang+0.023
Neg logit contributions
 eye-0.036
ocket-0.033
 pad-0.032
inger-0.032
anger-0.032
Token her
Token ID607
Feature activation-4.83e-03

Pos logit contributions
 ne+0.004
 subt+0.003
 caring+0.003
bel+0.003
ths+0.003
Neg logit contributions
 eye-0.004
ocket-0.003
inger-0.003
 pad-0.003
anger-0.003
Token make
Token ID787
Feature activation+6.45e-03

Pos logit contributions
 ne+0.009
 subt+0.009
 caring+0.008
 ang+0.008
ths+0.008
Neg logit contributions
 eye-0.008
ocket-0.006
 gate-0.006
 pad-0.006
anger-0.006
Token-
Token ID12
Feature activation-4.72e-02

Pos logit contributions
 eye+0.012
ocket+0.011
anger+0.011
 pad+0.011
inger+0.011
Neg logit contributions
 ne-0.010
 subt-0.009
 caring-0.009
 ang-0.008
ths-0.008
Tokenbel
Token ID6667
Feature activation-8.63e-02

Pos logit contributions
 ne+0.067
 subt+0.063
 caring+0.053
 ang+0.053
ths+0.052
Neg logit contributions
 eye-0.093
ocket-0.084
ork-0.083
 pad-0.082
inger-0.081
Tokenieve
Token ID12311
Feature activation-5.18e-03

Pos logit contributions
 ne+0.108
 subt+0.098
Yeah+0.089
spe+0.089
 ang+0.084
Neg logit contributions
 eye-0.173
ocket-0.168
lor-0.154
hole-0.153
ork-0.151
Token adventures
Token ID17545
Feature activation-1.12e-02

Pos logit contributions
 ne+0.009
 subt+0.009
 caring+0.008
bel+0.008
ths+0.008
Neg logit contributions
 eye-0.008
ocket-0.007
inger-0.007
anger-0.007
 pad-0.007
Token were
Token ID547
Feature activation+3.90e-03

Pos logit contributions
 ne+0.021
 subt+0.020
 caring+0.019
 ang+0.018
ths+0.018
Neg logit contributions
 eye-0.017
ocket-0.015
 gate-0.015
anger-0.015
 pad-0.015
Token recorded
Token ID6264
Feature activation-1.51e-02

Pos logit contributions
 eye+0.007
ocket+0.006
anger+0.006
 gate+0.006
 pad+0.006
Neg logit contributions
 ne-0.006
 subt-0.006
 caring-0.006
ths-0.005
 ang-0.005
Token.
Token ID13
Feature activation-4.09e-03

Pos logit contributions
 ne+0.026
 subt+0.025
 caring+0.023
 ang+0.022
ths+0.022
Neg logit contributions
 eye-0.025
ocket-0.022
 gate-0.022
anger-0.022
 pad-0.021
Token He
Token ID679
Feature activation-1.26e-02

Pos logit contributions
 ne+0.001
 subt+0.001
 caring+0.001
Yeah+0.001
 ang+0.001
Neg logit contributions
 eye-0.001
ocket-0.001
anger-0.001
 gate-0.001
inger-0.001
Token would
Token ID561
Feature activation+4.03e-03

Pos logit contributions
 ne+0.020
 subt+0.020
 caring+0.019
 ang+0.018
ths+0.018
Neg logit contributions
 eye-0.022
 gate-0.019
 pad-0.019
ocket-0.018
anger-0.017
Token race
Token ID3234
Feature activation-9.50e-03

Pos logit contributions
 eye+0.007
 gate+0.006
 pad+0.006
ocket+0.006
 Sasha+0.006
Neg logit contributions
 ne-0.007
 subt-0.007
 caring-0.006
ths-0.006
 ang-0.006
Token make
Token ID787
Feature activation+1.15e-02

Pos logit contributions
 ne+0.017
 subt+0.016
 caring+0.015
ths+0.014
 ang+0.014
Neg logit contributions
 eye-0.015
 gate-0.014
anger-0.013
 pad-0.013
ocket-0.013
Token-
Token ID12
Feature activation-4.29e-02

Pos logit contributions
 eye+0.021
 gate+0.019
anger+0.018
ocket+0.018
 pad+0.018
Neg logit contributions
 ne-0.018
 subt-0.018
 caring-0.016
ths-0.015
Yeah-0.015
Tokenbel
Token ID6667
Feature activation-8.57e-02

Pos logit contributions
 ne+0.061
 subt+0.059
 caring+0.050
ths+0.049
 ang+0.049
Neg logit contributions
 eye-0.081
ork-0.075
inger-0.075
 gate-0.075
anger-0.074
Tokenieve
Token ID12311
Feature activation-1.03e-02

Pos logit contributions
 ne+0.112
 subt+0.097
Yeah+0.089
spe+0.086
 ang+0.086
Neg logit contributions
 eye-0.163
ocket-0.160
ork-0.151
anger-0.150
hole-0.147
Token cars
Token ID5006
Feature activation-1.57e-02

Pos logit contributions
 ne+0.019
 subt+0.018
 caring+0.017
 ang+0.017
Yeah+0.016
Neg logit contributions
 eye-0.016
 gate-0.014
anger-0.014
 pad-0.014
ocket-0.014
Token around
Token ID1088
Feature activation-7.59e-04

Pos logit contributions
 ne+0.031
 subt+0.031
 caring+0.029
Yeah+0.028
ths+0.028
Neg logit contributions
 eye-0.021
 gate-0.019
 pad-0.018
anger-0.018
ocket-0.017
Token the
Token ID262
Feature activation+9.85e-03

Pos logit contributions
 ne+0.001
 subt+0.001
 caring+0.001
Yeah+0.001
 ang+0.001
Neg logit contributions
 eye-0.001
 gate-0.001
 pad-0.001
anger-0.001
ocket-0.001
Token track
Token ID2610
Feature activation+1.47e-02

Pos logit contributions
 eye+0.013
ocket+0.012
anger+0.011
 Sasha+0.011
inger+0.011
Neg logit contributions
 ne-0.020
 subt-0.019
 caring-0.018
ths-0.017
 ang-0.017
Token "
Token ID366
Feature activation-2.77e-02

Pos logit contributions
 ne+0.010
 subt+0.009
 caring+0.008
ths+0.008
 ang+0.008
Neg logit contributions
 eye-0.009
ocket-0.008
 gate-0.008
 pad-0.008
anger-0.008
TokenPlease
Token ID5492
Feature activation+5.17e-03

Pos logit contributions
 ne+0.044
 subt+0.042
 caring+0.038
 ang+0.037
ths+0.036
Neg logit contributions
 eye-0.051
ocket-0.045
 pad-0.044
 gate-0.044
anger-0.044
Token keep
Token ID1394
Feature activation+2.76e-03

Pos logit contributions
 eye+0.009
ocket+0.008
 gate+0.008
anger+0.008
 pad+0.008
Neg logit contributions
 ne-0.009
 subt-0.008
 caring-0.007
 ang-0.007
ths-0.007
Token your
Token ID534
Feature activation-9.01e-03

Pos logit contributions
 eye+0.005
ocket+0.004
anger+0.004
 pad+0.004
 gate+0.004
Neg logit contributions
 ne-0.005
 subt-0.004
 caring-0.004
ths-0.004
 ang-0.004
Token seat
Token ID5852
Feature activation-2.45e-02

Pos logit contributions
 ne+0.017
 subt+0.016
 caring+0.015
bel+0.015
ths+0.014
Neg logit contributions
 eye-0.013
ocket-0.012
 Sasha-0.012
anger-0.012
 pad-0.011
Tokenbel
Token ID6667
Feature activation-8.54e-02

Pos logit contributions
 ne+0.040
 subt+0.040
 caring+0.036
 ang+0.034
ths+0.034
Neg logit contributions
 eye-0.044
ocket-0.038
 gate-0.038
anger-0.037
 pad-0.037
Tokents
Token ID912
Feature activation-4.31e-02

Pos logit contributions
 ne+0.104
 subt+0.098
Yeah+0.087
spe+0.082
 caring+0.082
Neg logit contributions
 eye-0.179
ocket-0.170
anger-0.158
lor-0.158
inger-0.158
Token on
Token ID319
Feature activation-4.92e-03

Pos logit contributions
 ne+0.077
 subt+0.070
 caring+0.067
bel+0.064
 ang+0.063
Neg logit contributions
 eye-0.071
ocket-0.063
anger-0.062
 gate-0.061
 pad-0.060
Token.
Token ID13
Feature activation-2.12e-03

Pos logit contributions
 ne+0.008
 subt+0.007
 caring+0.007
ths+0.006
 ang+0.006
Neg logit contributions
 eye-0.009
ocket-0.008
anger-0.008
 pad-0.008
inger-0.008
Token We
Token ID775
Feature activation+2.22e-02

Pos logit contributions
 ne+0.004
 subt+0.004
 caring+0.004
Yeah+0.004
ths+0.004
Neg logit contributions
 eye-0.003
ocket-0.003
 gate-0.003
anger-0.003
inger-0.003
Token will
Token ID481
Feature activation+4.43e-03

Pos logit contributions
 eye+0.038
ocket+0.034
anger+0.034
 gate+0.033
 Sasha+0.033
Neg logit contributions
 ne-0.038
 subt-0.034
 caring-0.032
 fought-0.031
bel-0.031
Token a
Token ID257
Feature activation+2.98e-03

Pos logit contributions
 ne+0.015
 subt+0.014
 caring+0.013
 ang+0.012
bel+0.011
Neg logit contributions
 eye-0.021
anger-0.020
inger-0.019
ocket-0.019
 pad-0.019
Token world
Token ID995
Feature activation-1.35e-02

Pos logit contributions
 eye+0.005
anger+0.005
ocket+0.005
inger+0.005
 Sasha+0.004
Neg logit contributions
 ne-0.005
 subt-0.005
 caring-0.004
 ang-0.004
ths-0.004
Token of
Token ID286
Feature activation-3.27e-03

Pos logit contributions
 ne+0.024
 subt+0.023
 caring+0.021
bel+0.020
ths+0.019
Neg logit contributions
 eye-0.023
ocket-0.020
inger-0.019
anger-0.019
 pad-0.019
Token make
Token ID787
Feature activation+8.72e-03

Pos logit contributions
 ne+0.005
 subt+0.005
 caring+0.004
 ang+0.004
ths+0.004
Neg logit contributions
 eye-0.006
anger-0.006
ocket-0.006
inger-0.006
 pad-0.005
Token-
Token ID12
Feature activation-4.81e-02

Pos logit contributions
 eye+0.017
ocket+0.015
anger+0.015
 pad+0.015
inger+0.015
Neg logit contributions
 ne-0.013
 subt-0.012
 caring-0.012
 ang-0.011
bel-0.011
Tokenbel
Token ID6667
Feature activation-8.49e-02

Pos logit contributions
 ne+0.049
 subt+0.047
Yeah+0.038
 caring+0.036
ths+0.036
Neg logit contributions
 eye-0.114
ocket-0.106
 gate-0.103
 pad-0.103
ork-0.102
Tokenieve
Token ID12311
Feature activation-6.22e-03

Pos logit contributions
 ne+0.105
 subt+0.095
Yeah+0.088
spe+0.083
 ang+0.083
Neg logit contributions
 eye-0.171
ocket-0.167
hole-0.155
ork-0.152
lor-0.151
Token.
Token ID13
Feature activation-1.13e-03

Pos logit contributions
 ne+0.010
 subt+0.010
 caring+0.009
bel+0.009
ths+0.009
Neg logit contributions
 eye-0.011
anger-0.009
ocket-0.009
inger-0.009
 pad-0.009
Token
Token ID198
Feature activation+5.25e-03

Pos logit contributions
 subt+0.002
 ne+0.002
 caring+0.001
Yeah+0.001
 Sp+0.001
Neg logit contributions
 eye-0.002
ipper-0.002
inger-0.002
ocket-0.002
anger-0.002
Token
Token ID198
Feature activation-2.22e-03

Pos logit contributions
 eye+0.011
ocket+0.010
anger+0.010
 gate+0.010
ipper+0.010
Neg logit contributions
 ne-0.007
 caring-0.006
 subt-0.006
bel-0.005
 inviting-0.005
TokenShe
Token ID3347
Feature activation-1.58e-02

Pos logit contributions
 ne+0.003
 subt+0.003
 caring+0.003
ths+0.003
Yeah+0.003
Neg logit contributions
 eye-0.004
ocket-0.004
anger-0.004
 gate-0.004
 pad-0.004
Token a
Token ID257
Feature activation-1.11e-02

Pos logit contributions
 ne+0.010
 subt+0.009
 caring+0.009
ths+0.008
Yeah+0.008
Neg logit contributions
 eye-0.010
ocket-0.009
 gate-0.009
anger-0.009
 pad-0.009
Token look
Token ID804
Feature activation-4.24e-02

Pos logit contributions
 ne+0.020
 subt+0.019
 caring+0.018
 ang+0.017
bel+0.017
Neg logit contributions
 eye-0.018
ocket-0.016
anger-0.015
 gate-0.015
 pad-0.015
Token at
Token ID379
Feature activation-1.38e-03

Pos logit contributions
 ne+0.085
 subt+0.082
 caring+0.077
ths+0.073
 ang+0.073
Neg logit contributions
 eye-0.061
ocket-0.052
 gate-0.051
anger-0.050
 pad-0.050
Token the
Token ID262
Feature activation+1.00e-02

Pos logit contributions
 ne+0.002
 subt+0.002
 caring+0.002
ths+0.002
 ang+0.002
Neg logit contributions
 eye-0.003
ocket-0.002
 pad-0.002
inger-0.002
anger-0.002
Token light
Token ID1657
Feature activation-1.08e-02

Pos logit contributions
 eye+0.016
ocket+0.014
 gate+0.014
anger+0.014
 pad+0.013
Neg logit contributions
 ne-0.019
 subt-0.018
 caring-0.016
 ang-0.016
ths-0.016
Tokenbul
Token ID15065
Feature activation-8.42e-02

Pos logit contributions
 ne+0.020
 subt+0.019
 caring+0.018
bel+0.018
ths+0.017
Neg logit contributions
 eye-0.016
ocket-0.015
inger-0.015
 Sasha-0.015
 pad-0.014
Tokenb
Token ID65
Feature activation+1.50e-02

Pos logit contributions
 ne+0.124
 subt+0.115
 caring+0.103
Yeah+0.102
Deal+0.096
Neg logit contributions
 eye-0.161
hole-0.153
anger-0.147
ocket-0.145
 gate-0.141
Token.
Token ID13
Feature activation+1.03e-03

Pos logit contributions
 eye+0.022
 Sasha+0.021
ocket+0.020
 pad+0.020
 gate+0.019
Neg logit contributions
 ne-0.026
 subt-0.025
bel-0.025
ths-0.024
 ang-0.024
Token It
Token ID632
Feature activation-1.98e-02

Pos logit contributions
 eye+0.002
ocket+0.001
inger+0.001
ipper+0.001
lor+0.001
Neg logit contributions
 subt-0.002
 ne-0.002
 caring-0.002
Yeah-0.002
bel-0.002
Token was
Token ID373
Feature activation+6.62e-03

Pos logit contributions
 subt+0.039
 ne+0.036
 caring+0.036
Yeah+0.035
Deal+0.034
Neg logit contributions
 eye-0.028
 gate-0.023
hole-0.022
ork-0.022
 pad-0.021
Token very
Token ID845
Feature activation+6.35e-03

Pos logit contributions
 eye+0.012
 gate+0.010
ocket+0.010
 pad+0.010
 envelope+0.010
Neg logit contributions
 subt-0.011
 ne-0.011
Yeah-0.009
ths-0.009
Deal-0.009
Token box
Token ID3091
Feature activation-2.84e-02

Pos logit contributions
 ne+0.022
 subt+0.021
 caring+0.019
ths+0.018
Yeah+0.018
Neg logit contributions
 eye-0.022
 gate-0.019
ocket-0.019
anger-0.019
 pad-0.019
Token next
Token ID1306
Feature activation+1.79e-03

Pos logit contributions
 ne+0.051
 subt+0.047
bel+0.045
 caring+0.044
ths+0.042
Neg logit contributions
 eye-0.045
ocket-0.041
 Sasha-0.040
anger-0.040
 pad-0.039
Token to
Token ID284
Feature activation+1.68e-03

Pos logit contributions
 eye+0.002
ocket+0.002
 gate+0.002
anger+0.002
 pad+0.002
Neg logit contributions
 ne-0.004
 subt-0.003
 caring-0.003
ths-0.003
 ang-0.003
Token the
Token ID262
Feature activation-4.72e-04

Pos logit contributions
 eye+0.003
ocket+0.003
 pad+0.003
 gate+0.003
anger+0.003
Neg logit contributions
 ne-0.003
 subt-0.002
 caring-0.002
 ang-0.002
ths-0.002
Token light
Token ID1657
Feature activation-1.06e-02

Pos logit contributions
 ne+0.001
 subt+0.001
 caring+0.001
ths+0.001
 ang+0.001
Neg logit contributions
 eye-0.001
ocket-0.001
inger-0.001
 Sasha-0.001
anger-0.001
Tokenbul
Token ID15065
Feature activation-8.40e-02

Pos logit contributions
 ne+0.019
 subt+0.017
bel+0.016
 caring+0.016
ths+0.015
Neg logit contributions
 eye-0.017
ocket-0.016
inger-0.016
 Sasha-0.016
 pad-0.015
Tokenb
Token ID65
Feature activation+1.71e-02

Pos logit contributions
 ne+0.119
 subt+0.112
 caring+0.100
Yeah+0.098
ths+0.092
Neg logit contributions
 eye-0.167
anger-0.154
hole-0.154
ocket-0.150
inger-0.145
Token,
Token ID11
Feature activation+6.79e-03

Pos logit contributions
 eye+0.026
ocket+0.024
 Sasha+0.024
 pad+0.023
 gate+0.023
Neg logit contributions
 ne-0.030
 subt-0.029
bel-0.028
ths-0.027
 caring-0.027
Token then
Token ID788
Feature activation-3.62e-04

Pos logit contributions
 eye+0.012
ocket+0.010
anger+0.010
 gate+0.010
inger+0.010
Neg logit contributions
 ne-0.012
 subt-0.011
 caring-0.010
 ang-0.010
ths-0.010
Token climbed
Token ID19952
Feature activation+1.34e-03

Pos logit contributions
 ne+0.001
 subt+0.001
 caring+0.001
ths+0.000
 ang+0.000
Neg logit contributions
 eye-0.001
ocket-0.001
inger-0.001
 pad-0.001
anger-0.001
Token up
Token ID510
Feature activation+7.27e-03

Pos logit contributions
 eye+0.003
ocket+0.002
ulp+0.002
adow+0.002
iya+0.002
Neg logit contributions
 ne-0.002
 subt-0.002
 ang-0.002
ths-0.002
bel-0.002
Token to
Token ID284
Feature activation-6.84e-03

Pos logit contributions
 eye+0.009
 gate+0.007
ocket+0.007
 pad+0.007
anger+0.007
Neg logit contributions
 ne-0.016
 subt-0.015
 caring-0.014
Yeah-0.014
ths-0.014
Token cry
Token ID3960
Feature activation-1.11e-02

Pos logit contributions
 subt+0.012
 ne+0.012
ths+0.012
Yeah+0.011
bel+0.011
Neg logit contributions
 eye-0.012
 gate-0.009
 doorstep-0.009
 garden-0.009
 envelope-0.008
Token and
Token ID290
Feature activation+4.92e-04

Pos logit contributions
 ne+0.022
 subt+0.021
 caring+0.019
 ang+0.019
ths+0.019
Neg logit contributions
 eye-0.017
ocket-0.014
 gate-0.014
anger-0.014
 pad-0.014
Token threw
Token ID9617
Feature activation+7.93e-03

Pos logit contributions
 eye+0.001
 gate+0.001
ocket+0.001
 pad+0.001
 Sasha+0.001
Neg logit contributions
 ne-0.001
 subt-0.001
 caring-0.001
ths-0.001
bel-0.001
Token a
Token ID257
Feature activation-1.02e-02

Pos logit contributions
 eye+0.013
 gate+0.011
ocket+0.011
 pad+0.011
anger+0.011
Neg logit contributions
 ne-0.015
 subt-0.014
 caring-0.013
ths-0.013
bel-0.012
Token tant
Token ID24246
Feature activation-8.33e-02

Pos logit contributions
 ne+0.015
 subt+0.014
 caring+0.013
 ang+0.012
ths+0.012
Neg logit contributions
 eye-0.020
ocket-0.018
 gate-0.017
anger-0.017
 pad-0.017
Tokenrum
Token ID6582
Feature activation+3.39e-03

Pos logit contributions
 ne+0.167
 subt+0.154
Yeah+0.141
 caring+0.139
ths+0.139
Neg logit contributions
 eye-0.125
ocket-0.117
inger-0.109
anger-0.108
ulp-0.103
Token.
Token ID13
Feature activation-1.57e-03

Pos logit contributions
 eye+0.006
ocket+0.005
inger+0.005
anger+0.005
 gate+0.005
Neg logit contributions
 ne-0.006
 subt-0.005
 caring-0.005
ths-0.005
Yeah-0.005
Token She
Token ID1375
Feature activation-1.39e-02

Pos logit contributions
 ne+0.003
 subt+0.003
 caring+0.003
Yeah+0.003
ths+0.002
Neg logit contributions
 eye-0.002
ocket-0.002
inger-0.002
anger-0.002
 gate-0.002
Token didn
Token ID1422
Feature activation-5.10e-03

Pos logit contributions
 subt+0.023
 ne+0.022
bel+0.020
 caring+0.020
ths+0.019
Neg logit contributions
 eye-0.025
 pad-0.021
 gate-0.021
ocket-0.020
ork-0.020
Token't
Token ID470
Feature activation+1.80e-02

Pos logit contributions
 ne+0.008
 subt+0.008
 caring+0.007
ths+0.007
 ang+0.006
Neg logit contributions
 eye-0.010
ocket-0.009
 pad-0.008
 gate-0.008
anger-0.008
Token kept
Token ID4030
Feature activation+3.85e-03

Pos logit contributions
 ne+0.022
 subt+0.021
 caring+0.019
bel+0.019
 ang+0.018
Neg logit contributions
 eye-0.029
 pad-0.025
ocket-0.025
 gate-0.025
anger-0.024
Token crying
Token ID13774
Feature activation-2.72e-02

Pos logit contributions
 eye+0.007
 gate+0.006
ocket+0.006
 Sasha+0.006
 pad+0.006
Neg logit contributions
 ne-0.006
 subt-0.006
bel-0.005
spe-0.005
 caring-0.005
Token and
Token ID290
Feature activation+3.45e-03

Pos logit contributions
 ne+0.055
 subt+0.052
 caring+0.049
 ang+0.047
ths+0.047
Neg logit contributions
 eye-0.039
ocket-0.033
 gate-0.033
anger-0.032
 pad-0.032
Token throwing
Token ID9644
Feature activation+3.61e-03

Pos logit contributions
 eye+0.006
 gate+0.005
 pad+0.005
ocket+0.005
 Sasha+0.005
Neg logit contributions
 ne-0.006
 subt-0.005
 caring-0.005
bel-0.005
Yeah-0.005
Token a
Token ID257
Feature activation-1.25e-02

Pos logit contributions
 eye+0.006
 gate+0.005
 pad+0.005
ocket+0.005
 Sasha+0.005
Neg logit contributions
 ne-0.007
 subt-0.007
ths-0.006
 caring-0.006
bel-0.006
Token tant
Token ID24246
Feature activation-8.32e-02

Pos logit contributions
 ne+0.018
 subt+0.017
 caring+0.015
 ang+0.015
ths+0.015
Neg logit contributions
 eye-0.025
ocket-0.022
 gate-0.022
anger-0.022
 pad-0.022
Tokenrum
Token ID6582
Feature activation+5.18e-03

Pos logit contributions
 ne+0.161
 subt+0.150
 caring+0.136
Yeah+0.135
 ang+0.134
Neg logit contributions
 eye-0.129
ocket-0.120
inger-0.113
anger-0.112
ork-0.107
Token.
Token ID13
Feature activation-7.69e-04

Pos logit contributions
 eye+0.009
ocket+0.008
inger+0.008
 gate+0.008
anger+0.008
Neg logit contributions
 ne-0.009
 subt-0.008
 caring-0.008
ths-0.007
Yeah-0.007
Token
Token ID198
Feature activation+1.80e-03

Pos logit contributions
 ne+0.001
 subt+0.001
 caring+0.001
Yeah+0.001
ths+0.001
Neg logit contributions
 eye-0.001
ocket-0.001
 gate-0.001
inger-0.001
anger-0.001
Token
Token ID198
Feature activation-7.70e-03

Pos logit contributions
 eye+0.003
ocket+0.003
 gate+0.003
 pad+0.003
anger+0.003
Neg logit contributions
 ne-0.003
 subt-0.003
 caring-0.003
bel-0.002
Yeah-0.002
TokenFinally
Token ID11158
Feature activation-2.38e-02

Pos logit contributions
 ne+0.012
 subt+0.011
 caring+0.010
 ang+0.009
ths+0.009
Neg logit contributions
 eye-0.015
ocket-0.013
 gate-0.013
 pad-0.013
anger-0.013
Token and
Token ID290
Feature activation-6.31e-03

Pos logit contributions
 ne+0.025
 subt+0.024
 caring+0.023
 ang+0.022
ths+0.022
Neg logit contributions
 eye-0.021
ocket-0.019
inger-0.019
anger-0.018
 gate-0.018
Token started
Token ID2067
Feature activation+9.99e-03

Pos logit contributions
 subt+0.011
 ne+0.010
bel+0.009
spe+0.009
ths+0.009
Neg logit contributions
 eye-0.010
 gate-0.009
 Sasha-0.009
 envelope-0.008
 pad-0.008
Token to
Token ID284
Feature activation-4.62e-03

Pos logit contributions
 eye+0.012
 gate+0.010
ocket+0.010
 pad+0.010
 Sasha+0.010
Neg logit contributions
 ne-0.022
 subt-0.021
 caring-0.020
ths-0.019
Yeah-0.019
Token throw
Token ID3714
Feature activation+4.85e-03

Pos logit contributions
 ne+0.007
 subt+0.007
bel+0.006
ths+0.006
 caring+0.006
Neg logit contributions
 eye-0.009
 gate-0.007
 Sasha-0.007
 pad-0.007
ocket-0.007
Token a
Token ID257
Feature activation-1.04e-02

Pos logit contributions
 eye+0.009
ocket+0.008
anger+0.008
 gate+0.008
 pad+0.007
Neg logit contributions
 ne-0.008
 subt-0.008
 caring-0.007
 ang-0.007
ths-0.007
Token tant
Token ID24246
Feature activation-8.20e-02

Pos logit contributions
 ne+0.017
 subt+0.015
 caring+0.015
 ang+0.014
bel+0.014
Neg logit contributions
 eye-0.018
ocket-0.017
inger-0.016
anger-0.016
 Sasha-0.016
Tokenrum
Token ID6582
Feature activation+7.51e-03

Pos logit contributions
 ne+0.161
 subt+0.149
Yeah+0.135
 caring+0.134
ths+0.132
Neg logit contributions
 eye-0.125
ocket-0.116
inger-0.111
anger-0.109
ork-0.104
Token.
Token ID13
Feature activation-1.74e-03

Pos logit contributions
 eye+0.013
ocket+0.012
inger+0.012
anger+0.012
 gate+0.012
Neg logit contributions
 ne-0.012
 subt-0.011
 caring-0.010
ths-0.010
Yeah-0.010
Token Suddenly
Token ID24975
Feature activation-2.26e-03

Pos logit contributions
 subt+0.003
 ne+0.003
 caring+0.003
Yeah+0.003
ths+0.002
Neg logit contributions
 eye-0.003
ocket-0.003
inger-0.003
ipper-0.002
 gate-0.002
Token,
Token ID11
Feature activation-1.27e-03

Pos logit contributions
 ne+0.003
 subt+0.003
 caring+0.002
 ang+0.002
ths+0.002
Neg logit contributions
 eye-0.005
ocket-0.005
inger-0.004
 pad-0.004
anger-0.004
Token she
Token ID673
Feature activation+1.97e-03

Pos logit contributions
 ne+0.002
 subt+0.002
 caring+0.002
ths+0.002
 ang+0.002
Neg logit contributions
 eye-0.002
ocket-0.002
 gate-0.002
anger-0.002
 pad-0.002
Token tree
Token ID5509
Feature activation+2.97e-03

Pos logit contributions
 ne+0.024
 subt+0.022
 caring+0.021
 ang+0.020
ths+0.019
Neg logit contributions
 eye-0.019
ocket-0.017
inger-0.017
anger-0.017
lor-0.016
Token with
Token ID351
Feature activation-1.64e-04

Pos logit contributions
 eye+0.006
ocket+0.006
anger+0.006
inger+0.005
 pad+0.005
Neg logit contributions
 ne-0.004
 subt-0.003
 caring-0.003
bel-0.003
ths-0.003
Token many
Token ID867
Feature activation-1.11e-02

Pos logit contributions
 ne+0.000
 subt+0.000
 caring+0.000
ths+0.000
 ang+0.000
Neg logit contributions
 eye-0.000
 gate-0.000
ocket-0.000
 pad-0.000
anger-0.000
Token branches
Token ID13737
Feature activation+3.45e-02

Pos logit contributions
 ne+0.021
 subt+0.020
 caring+0.019
ths+0.018
 ang+0.018
Neg logit contributions
 eye-0.017
ocket-0.015
anger-0.014
 gate-0.014
 pad-0.014
Token.
Token ID13
Feature activation+2.63e-03

Pos logit contributions
ocket+0.071
iya+0.067
 eye+0.067
mare+0.066
lor+0.065
Neg logit contributions
 ne-0.045
 hon-0.033
 subt-0.033
 how-0.032
 caring-0.032
Token Sara
Token ID24799
Feature activation+4.71e-02

Pos logit contributions
ocket+0.006
 eye+0.005
inger+0.005
anger+0.005
 envelope+0.005
Neg logit contributions
 ne-0.004
Yeah-0.003
 subt-0.003
Really-0.003
 Bones-0.003
Token wanted
Token ID2227
Feature activation+2.33e-02

Pos logit contributions
 eye+0.093
ocket+0.086
anger+0.085
 gate+0.081
inger+0.081
Neg logit contributions
 ne-0.070
 subt-0.064
 caring-0.058
Yeah-0.056
ths-0.055
Token to
Token ID284
Feature activation-3.08e-03

Pos logit contributions
 eye+0.044
ocket+0.041
anger+0.040
inger+0.040
lor+0.040
Neg logit contributions
 ne-0.037
 caring-0.033
 subt-0.032
 ang-0.030
ths-0.029
Token climb
Token ID12080
Feature activation-1.02e-02

Pos logit contributions
 ne+0.005
 subt+0.004
 caring+0.004
bel+0.004
 ang+0.004
Neg logit contributions
 eye-0.006
ocket-0.005
anger-0.005
inger-0.005
lor-0.005
Token the
Token ID262
Feature activation+1.96e-03

Pos logit contributions
 ne+0.018
 subt+0.017
 caring+0.016
 ang+0.015
ths+0.015
Neg logit contributions
 eye-0.017
ocket-0.015
anger-0.014
 pad-0.014
inger-0.014
Token tree
Token ID5509
Feature activation+8.59e-03

Pos logit contributions
 eye+0.003
ocket+0.003
ipper+0.003
ulp+0.002
iya+0.002
Neg logit contributions
 ne-0.004
 subt-0.003
ths-0.003
 caring-0.003
 ang-0.003
Token with
Token ID351
Feature activation+5.96e-03

Pos logit contributions
 eye+0.027
ocket+0.023
anger+0.023
 gate+0.023
inger+0.023
Neg logit contributions
 ne-0.027
 subt-0.024
 caring-0.024
bel-0.022
ths-0.022
Token diced
Token ID48898
Feature activation+1.25e-02

Pos logit contributions
 eye+0.010
ocket+0.009
 gate+0.009
 pad+0.009
anger+0.009
Neg logit contributions
 ne-0.010
 subt-0.010
 caring-0.009
ths-0.009
 ang-0.009
Token carrots
Token ID34397
Feature activation+4.92e-02

Pos logit contributions
 eye+0.021
ocket+0.018
 gate+0.018
anger+0.018
 pad+0.018
Neg logit contributions
 ne-0.022
 subt-0.021
 caring-0.020
 ang-0.019
ths-0.019
Token.
Token ID13
Feature activation+2.68e-03

Pos logit contributions
 eye+0.094
ocket+0.088
anger+0.088
inger+0.086
 gate+0.084
Neg logit contributions
 ne-0.069
 caring-0.060
 subt-0.056
bel-0.054
ths-0.051
Token The
Token ID383
Feature activation-4.53e-03

Pos logit contributions
 eye+0.005
ocket+0.004
anger+0.004
 gate+0.004
inger+0.004
Neg logit contributions
 ne-0.005
 subt-0.004
 caring-0.004
Yeah-0.004
ths-0.004
Token b
Token ID275
Feature activation+4.74e-02

Pos logit contributions
 subt+0.008
 ne+0.008
bel+0.007
 caring+0.007
ths+0.007
Neg logit contributions
 eye-0.007
 gate-0.006
ocket-0.006
 pad-0.006
anger-0.006
Tokenunn
Token ID20935
Feature activation+5.44e-02

Pos logit contributions
 gate+0.088
 eye+0.086
hole+0.080
 doorstep+0.080
lor+0.079
Neg logit contributions
 ne-0.064
 subt-0.058
 caring-0.056
Yeah-0.053
bel-0.052
Tokenies
Token ID444
Feature activation+3.30e-02

Pos logit contributions
 eye+0.093
 gate+0.085
ocket+0.084
anger+0.083
ork+0.082
Neg logit contributions
 ne-0.089
 subt-0.081
 caring-0.080
bel-0.076
ths-0.074
Token were
Token ID547
Feature activation+6.14e-03

Pos logit contributions
 eye+0.057
ocket+0.050
 pad+0.048
 gate+0.048
anger+0.046
Neg logit contributions
 subt-0.056
 ne-0.056
 caring-0.049
 ang-0.048
ths-0.047
Token very
Token ID845
Feature activation+7.49e-03

Pos logit contributions
 eye+0.012
ocket+0.011
 Sasha+0.011
 gate+0.011
 pad+0.011
Neg logit contributions
 subt-0.009
 ne-0.008
ths-0.007
Yeah-0.007
bel-0.007
Token happy
Token ID3772
Feature activation+1.25e-02

Pos logit contributions
 eye+0.015
ocket+0.013
ulp+0.012
 Sasha+0.012
 gate+0.012
Neg logit contributions
 subt-0.011
 ne-0.011
ths-0.010
 ang-0.009
penter-0.009
Token it
Token ID340
Feature activation-1.82e-02

Pos logit contributions
 eye+0.052
ocket+0.047
anger+0.047
inger+0.045
 pad+0.045
Neg logit contributions
 ne-0.056
 subt-0.051
 caring-0.049
 ang-0.048
Yeah-0.046
Token some
Token ID617
Feature activation+2.16e-02

Pos logit contributions
 ne+0.033
 subt+0.033
ths+0.029
 caring+0.029
Yeah+0.028
Neg logit contributions
 eye-0.029
ocket-0.025
 gate-0.025
 pad-0.024
anger-0.024
Token water
Token ID1660
Feature activation+1.34e-02

Pos logit contributions
 eye+0.033
ocket+0.031
inger+0.030
anger+0.030
 gate+0.030
Neg logit contributions
 ne-0.040
 subt-0.036
 caring-0.035
 ang-0.033
bel-0.032
Token and
Token ID290
Feature activation+1.21e-02

Pos logit contributions
 eye+0.023
ocket+0.021
inger+0.020
anger+0.020
 gate+0.020
Neg logit contributions
 ne-0.023
 subt-0.019
 caring-0.019
bel-0.018
 ang-0.018
Token a
Token ID257
Feature activation+7.76e-04

Pos logit contributions
 eye+0.021
 gate+0.018
ocket+0.017
 pad+0.017
 Sasha+0.017
Neg logit contributions
 ne-0.021
 subt-0.020
ths-0.018
 caring-0.018
Yeah-0.018
Token bisc
Token ID36228
Feature activation+7.13e-02

Pos logit contributions
 eye+0.001
ocket+0.001
anger+0.001
inger+0.001
 pad+0.001
Neg logit contributions
 ne-0.001
 subt-0.001
 caring-0.001
 ang-0.001
bel-0.001
Tokenuit
Token ID5013
Feature activation+1.11e-02

Pos logit contributions
 eye+0.110
 gate+0.107
ubs+0.103
 Sasha+0.103
iya+0.101
Neg logit contributions
 ne-0.120
 subt-0.097
 caring-0.094
Deal-0.093
Yeah-0.093
Token.
Token ID13
Feature activation+3.97e-03

Pos logit contributions
 eye+0.019
ocket+0.018
inger+0.017
anger+0.017
 Sasha+0.017
Neg logit contributions
 ne-0.018
 subt-0.015
 caring-0.015
bel-0.015
 ang-0.014
Token The
Token ID383
Feature activation-4.64e-03

Pos logit contributions
 eye+0.008
ocket+0.007
inger+0.007
anger+0.007
lor+0.007
Neg logit contributions
 ne-0.006
 subt-0.006
Yeah-0.005
 caring-0.005
 forgiving-0.005
Token dog
Token ID3290
Feature activation+3.75e-03

Pos logit contributions
 ne+0.009
 caring+0.008
 subt+0.008
 ang+0.008
Yeah+0.008
Neg logit contributions
ocket-0.006
 eye-0.006
inger-0.006
anger-0.006
hole-0.006
Token smiled
Token ID13541
Feature activation-3.60e-03

Pos logit contributions
 eye+0.007
 pad+0.006
 gate+0.006
ocket+0.006
 Sasha+0.006
Neg logit contributions
 subt-0.006
ths-0.005
 caring-0.005
 ne-0.005
spe-0.005
Token sky
Token ID6766
Feature activation-3.68e-03

Pos logit contributions
 eye+0.022
ocket+0.021
anger+0.020
inger+0.019
ulp+0.019
Neg logit contributions
 ne-0.041
 subt-0.038
 caring-0.035
Yeah-0.035
 ang-0.034
Token.
Token ID13
Feature activation-4.86e-03

Pos logit contributions
 ne+0.007
 subt+0.006
 caring+0.006
ths+0.005
 ang+0.005
Neg logit contributions
 eye-0.006
ocket-0.005
anger-0.005
 gate-0.005
inger-0.005
Token His
Token ID2399
Feature activation-1.18e-02

Pos logit contributions
 subt+0.006
 ne+0.006
Yeah+0.005
 caring+0.005
 ang+0.005
Neg logit contributions
 eye-0.010
ocket-0.010
inger-0.010
anger-0.009
 gate-0.009
Token mother
Token ID2802
Feature activation-9.01e-03

Pos logit contributions
 ne+0.023
 subt+0.022
 caring+0.021
ths+0.020
 ang+0.020
Neg logit contributions
 eye-0.017
ocket-0.015
anger-0.015
 gate-0.014
 pad-0.014
Token had
Token ID550
Feature activation+2.25e-02

Pos logit contributions
 ne+0.014
 subt+0.013
 caring+0.012
 ang+0.012
bel+0.011
Neg logit contributions
 eye-0.017
ocket-0.015
 gate-0.015
 pad-0.015
anger-0.014
Token promised
Token ID8072
Feature activation+5.00e-02

Pos logit contributions
 eye+0.043
anger+0.040
ocket+0.039
inger+0.038
 gate+0.037
Neg logit contributions
 ne-0.036
 subt-0.033
 caring-0.031
 ang-0.029
 fought-0.028
Token him
Token ID683
Feature activation-2.82e-02

Pos logit contributions
 eye+0.098
ocket+0.090
 pad+0.089
anger+0.087
 gate+0.087
Neg logit contributions
 ne-0.073
 subt-0.069
 caring-0.067
ths-0.060
 ang-0.059
Token a
Token ID257
Feature activation-1.35e-02

Pos logit contributions
 ne+0.042
 caring+0.040
 subt+0.038
 forgiving+0.033
ths+0.033
Neg logit contributions
 eye-0.052
anger-0.048
inger-0.047
 pad-0.047
 gate-0.047
Token reward
Token ID6721
Feature activation+2.29e-02

Pos logit contributions
 ne+0.023
 subt+0.022
ths+0.020
bel+0.019
 ang+0.019
Neg logit contributions
 eye-0.024
 gate-0.021
 pad-0.020
ocket-0.020
anger-0.020
Token,
Token ID11
Feature activation-1.25e-03

Pos logit contributions
 eye+0.040
inger+0.038
ulp+0.037
ocket+0.036
 gate+0.036
Neg logit contributions
 ne-0.036
 subt-0.032
 caring-0.032
bel-0.029
ths-0.028
Token and
Token ID290
Feature activation-8.42e-03

Pos logit contributions
 ne+0.002
 subt+0.002
 caring+0.002
Yeah+0.002
 ang+0.001
Neg logit contributions
 eye-0.002
anger-0.002
inger-0.002
ocket-0.002
ork-0.002
Token
Token ID220
Feature activation-2.35e-02

Pos logit contributions
 eye+0.021
ocket+0.019
 gate+0.018
anger+0.018
inger+0.018
Neg logit contributions
 ne-0.021
 subt-0.021
 caring-0.019
Yeah-0.018
 ang-0.018
Token
Token ID198
Feature activation-3.57e-04

Pos logit contributions
 ne+0.028
 subt+0.027
 caring+0.026
Yeah+0.024
 ang+0.024
Neg logit contributions
 eye-0.050
ocket-0.048
 gate-0.044
 envelope-0.044
 pad-0.044
Token
Token ID198
Feature activation-3.38e-03

Pos logit contributions
 ne+0.001
 subt+0.001
 caring+0.001
Yeah+0.001
ths+0.001
Neg logit contributions
 eye-0.001
ocket-0.001
 gate-0.000
anger-0.000
 pad-0.000
TokenJane
Token ID41083
Feature activation+3.35e-03

Pos logit contributions
 ne+0.006
 subt+0.006
 ang+0.005
 caring+0.005
bel+0.005
Neg logit contributions
 eye-0.005
 Sasha-0.005
ocket-0.005
 gate-0.005
ork-0.005
Token was
Token ID373
Feature activation-2.07e-04

Pos logit contributions
 eye+0.006
 pad+0.005
ocket+0.005
 Sasha+0.005
 gate+0.005
Neg logit contributions
 subt-0.005
 ne-0.005
bel-0.005
 caring-0.004
nam-0.004
Token able
Token ID1498
Feature activation+5.72e-02

Pos logit contributions
 subt+0.000
 ne+0.000
bel+0.000
ths+0.000
 caring+0.000
Neg logit contributions
 eye-0.000
ocket-0.000
 gate-0.000
 Sasha-0.000
 pad-0.000
Token to
Token ID284
Feature activation-1.22e-02

Pos logit contributions
 eye+0.095
anger+0.089
ocket+0.087
lor+0.081
inger+0.081
Neg logit contributions
 ne-0.104
 caring-0.097
 subt-0.096
 ang-0.088
 forgiving-0.086
Token understand
Token ID1833
Feature activation-1.12e-02

Pos logit contributions
 ne+0.019
 subt+0.018
 caring+0.017
 ang+0.016
ths+0.015
Neg logit contributions
 eye-0.022
ocket-0.020
anger-0.020
 pad-0.020
 gate-0.020
Token and
Token ID290
Feature activation-8.31e-03

Pos logit contributions
 ne+0.017
 subt+0.016
 caring+0.015
 ang+0.014
ths+0.014
Neg logit contributions
 eye-0.021
ocket-0.019
anger-0.019
 gate-0.019
 pad-0.019
Token she
Token ID673
Feature activation-5.62e-03

Pos logit contributions
 ne+0.013
 subt+0.013
 caring+0.011
ths+0.011
Yeah+0.011
Neg logit contributions
 eye-0.015
 gate-0.013
ocket-0.013
 Sasha-0.013
 pad-0.013
Token nodded
Token ID14464
Feature activation-1.34e-02

Pos logit contributions
 subt+0.009
 ne+0.008
bel+0.008
 caring+0.007
 ang+0.007
Neg logit contributions
 eye-0.010
ocket-0.009
 gate-0.009
 pad-0.009
 Sasha-0.009
Token
Token ID198
Feature activation-1.96e-03

Pos logit contributions
 eye+0.004
ocket+0.004
 gate+0.003
anger+0.003
 pad+0.003
Neg logit contributions
 ne-0.005
 subt-0.005
 caring-0.004
Yeah-0.004
bel-0.004
TokenSarah
Token ID29284
Feature activation-6.64e-03

Pos logit contributions
 ne+0.003
 subt+0.003
 ang+0.003
 caring+0.002
bel+0.002
Neg logit contributions
 eye-0.004
 gate-0.003
 pad-0.003
ork-0.003
inger-0.003
Token was
Token ID373
Feature activation-1.27e-03

Pos logit contributions
 ne+0.009
 subt+0.009
 ang+0.009
 caring+0.008
bel+0.008
Neg logit contributions
 eye-0.013
 pad-0.011
ocket-0.011
 gate-0.011
inger-0.011
Token surprised
Token ID6655
Feature activation+4.44e-03

Pos logit contributions
 ne+0.002
 subt+0.002
ths+0.002
Deal+0.002
bel+0.002
Neg logit contributions
 eye-0.002
 gate-0.002
 pad-0.002
ocket-0.002
 Sasha-0.002
Token and
Token ID290
Feature activation-4.52e-03

Pos logit contributions
 eye+0.006
 gate+0.005
ocket+0.005
 pad+0.005
anger+0.005
Neg logit contributions
 ne-0.009
 subt-0.009
 caring-0.008
 ang-0.008
bel-0.008
Token happy
Token ID3772
Feature activation+4.18e-03

Pos logit contributions
 subt+0.008
 ne+0.007
ths+0.007
 Amen+0.007
 cartoons+0.007
Neg logit contributions
 eye-0.007
 gate-0.006
 Sasha-0.006
 pad-0.006
ocket-0.006
Token.
Token ID13
Feature activation-3.76e-03

Pos logit contributions
 eye+0.006
 gate+0.005
ocket+0.005
anger+0.005
 pad+0.005
Neg logit contributions
 ne-0.008
 subt-0.008
 caring-0.007
 ang-0.007
ths-0.007
Token She
Token ID1375
Feature activation-6.30e-03

Pos logit contributions
 ne+0.008
 subt+0.008
 caring+0.007
 ang+0.007
ths+0.007
Neg logit contributions
 eye-0.005
ocket-0.004
 gate-0.004
anger-0.004
 pad-0.004
Token thanked
Token ID26280
Feature activation-1.03e-02

Pos logit contributions
 ne+0.010
 subt+0.009
 caring+0.008
 ang+0.008
ths+0.008
Neg logit contributions
 eye-0.012
ocket-0.011
 gate-0.010
 pad-0.010
anger-0.010
Token Lily
Token ID20037
Feature activation-4.57e-04

Pos logit contributions
 ne+0.017
 subt+0.016
 caring+0.015
 ang+0.014
bel+0.014
Neg logit contributions
 eye-0.018
ocket-0.016
 gate-0.016
anger-0.015
 Sasha-0.015
Token and
Token ID290
Feature activation-8.49e-04

Pos logit contributions
 ne+0.001
 subt+0.001
 caring+0.001
 ang+0.001
ths+0.001
Neg logit contributions
 eye-0.001
ocket-0.001
anger-0.001
inger-0.001
 gate-0.001
Token for
Token ID329
Feature activation+4.20e-03

Pos logit contributions
 ne+0.037
 subt+0.035
 caring+0.033
 ang+0.030
ths+0.030
Neg logit contributions
 eye-0.041
ocket-0.037
anger-0.036
inger-0.036
 pad-0.035
Token making
Token ID1642
Feature activation+2.23e-03

Pos logit contributions
 eye+0.007
ocket+0.007
 gate+0.007
 envelope+0.006
 Sasha+0.006
Neg logit contributions
 ne-0.007
 subt-0.006
bel-0.006
spe-0.006
Yeah-0.006
Token me
Token ID502
Feature activation-1.39e-03

Pos logit contributions
 eye+0.004
ocket+0.003
 gate+0.003
anger+0.003
 pad+0.003
Neg logit contributions
 ne-0.004
 subt-0.004
 caring-0.003
 ang-0.003
ths-0.003
Token happy
Token ID3772
Feature activation-6.22e-03

Pos logit contributions
 ne+0.002
 subt+0.002
 caring+0.002
ths+0.002
 ang+0.002
Neg logit contributions
 eye-0.003
ocket-0.003
 gate-0.002
anger-0.002
 pad-0.002
Token!"
Token ID2474
Feature activation+2.04e-02

Pos logit contributions
 ne+0.013
 subt+0.013
 ang+0.012
ths+0.012
 caring+0.012
Neg logit contributions
 eye-0.008
 gate-0.007
ocket-0.006
 pad-0.006
anger-0.006
Token
Token ID198
Feature activation-5.48e-04

Pos logit contributions
 eye+0.035
ocket+0.031
inger+0.030
 gate+0.030
anger+0.030
Neg logit contributions
 ne-0.036
 subt-0.035
 caring-0.032
Yeah-0.031
 ang-0.030
Token
Token ID198
Feature activation-3.74e-03

Pos logit contributions
 ne+0.001
 subt+0.001
 caring+0.001
bel+0.001
Yeah+0.001
Neg logit contributions
 eye-0.001
ocket-0.001
 gate-0.001
 pad-0.001
anger-0.001
TokenFrom
Token ID4863
Feature activation-3.00e-02

Pos logit contributions
 ne+0.007
 subt+0.007
 ang+0.006
 caring+0.006
ths+0.006
Neg logit contributions
 eye-0.005
ork-0.005
 gate-0.005
inger-0.005
lor-0.005
Token that
Token ID326
Feature activation-8.48e-03

Pos logit contributions
 ne+0.050
 subt+0.048
 caring+0.043
 ang+0.041
bel+0.041
Neg logit contributions
 eye-0.054
ocket-0.047
 gate-0.047
anger-0.046
 pad-0.046
Token day
Token ID1110
Feature activation-3.32e-02

Pos logit contributions
 ne+0.014
 subt+0.013
 caring+0.012
 ang+0.011
ths+0.011
Neg logit contributions
 eye-0.015
ocket-0.014
anger-0.014
inger-0.014
 gate-0.014
Token on
Token ID319
Feature activation-6.83e-03

Pos logit contributions
 ne+0.051
 subt+0.048
ths+0.042
 caring+0.041
 ang+0.039
Neg logit contributions
 eye-0.063
ocket-0.057
 pad-0.057
inger-0.056
 gate-0.055
Token different
Token ID1180
Feature activation-1.03e-02

Pos logit contributions
 eye+0.012
anger+0.012
ocket+0.012
inger+0.012
 pad+0.011
Neg logit contributions
 ne-0.008
 caring-0.007
 subt-0.007
 ang-0.006
 inviting-0.006
Token places
Token ID4113
Feature activation+1.34e-02

Pos logit contributions
 ne+0.020
 subt+0.019
 caring+0.017
 ang+0.017
ths+0.017
Neg logit contributions
 eye-0.016
ocket-0.014
anger-0.014
 pad-0.014
 gate-0.013
Token with
Token ID351
Feature activation+4.11e-03

Pos logit contributions
 eye+0.027
inger+0.026
anger+0.025
 pad+0.025
ulp+0.025
Neg logit contributions
 ne-0.019
 caring-0.016
 subt-0.015
bel-0.014
nam-0.014
Token her
Token ID607
Feature activation+1.19e-02

Pos logit contributions
 eye+0.008
anger+0.008
inger+0.008
 pad+0.008
ocket+0.008
Neg logit contributions
 ne-0.006
 subt-0.005
 caring-0.005
bel-0.004
nam-0.004
Token family
Token ID1641
Feature activation-1.23e-02

Pos logit contributions
 eye+0.018
ocket+0.016
anger+0.016
 pad+0.016
inger+0.016
Neg logit contributions
 ne-0.023
 subt-0.021
 caring-0.020
bel-0.019
ths-0.019
Token.
Token ID13
Feature activation+5.10e-03

Pos logit contributions
 ne+0.021
 subt+0.020
 caring+0.019
ths+0.017
bel+0.017
Neg logit contributions
 eye-0.020
ocket-0.019
anger-0.018
inger-0.018
 gate-0.018
Token One
Token ID1881
Feature activation+2.12e-02

Pos logit contributions
ocket+0.012
 eye+0.012
inger+0.012
anger+0.011
 gate+0.011
Neg logit contributions
 ne-0.005
 subt-0.004
 cartoons-0.004
 caring-0.004
Yeah-0.003
Token day
Token ID1110
Feature activation-2.50e-02

Pos logit contributions
 eye+0.039
ocket+0.038
anger+0.037
inger+0.037
 pad+0.036
Neg logit contributions
 ne-0.032
 caring-0.028
 subt-0.027
bel-0.025
Yeah-0.025
Token,
Token ID11
Feature activation-5.74e-03

Pos logit contributions
 ne+0.040
 subt+0.037
 caring+0.034
 ang+0.032
ths+0.032
Neg logit contributions
 eye-0.047
ocket-0.042
 gate-0.041
anger-0.041
 pad-0.040
Token they
Token ID484
Feature activation+1.67e-02

Pos logit contributions
 subt+0.011
 ne+0.011
ths+0.010
 caring+0.010
 ang+0.009
Neg logit contributions
 eye-0.009
 gate-0.007
 Sasha-0.007
ocket-0.007
anger-0.007
Token went
Token ID1816
Feature activation-1.07e-02

Pos logit contributions
 eye+0.036
anger+0.035
ocket+0.034
inger+0.034
ulp+0.033
Neg logit contributions
 ne-0.022
 subt-0.017
 fought-0.017
bel-0.016
ths-0.016
Token,
Token ID11
Feature activation-2.33e-02

Pos logit contributions
 ne+0.005
 subt+0.004
 caring+0.004
ths+0.004
Yeah+0.004
Neg logit contributions
 eye-0.004
ocket-0.003
 gate-0.003
 pad-0.003
anger-0.003
Token you
Token ID345
Feature activation-1.42e-02

Pos logit contributions
 ne+0.041
 subt+0.038
 caring+0.036
Yeah+0.033
 ang+0.033
Neg logit contributions
 eye-0.040
inger-0.035
ocket-0.035
anger-0.035
 gate-0.034
Token are
Token ID389
Feature activation-1.29e-02

Pos logit contributions
 ne+0.031
 subt+0.030
 caring+0.028
ths+0.027
 ang+0.027
Neg logit contributions
 eye-0.018
ocket-0.016
 gate-0.015
anger-0.015
 pad-0.015
Token very
Token ID845
Feature activation-1.15e-02

Pos logit contributions
 subt+0.021
 ne+0.021
bel+0.020
ths+0.019
Yeah+0.018
Neg logit contributions
 eye-0.023
ocket-0.020
 gate-0.020
 Sasha-0.020
 pad-0.019
Token generous
Token ID14431
Feature activation+6.36e-03

Pos logit contributions
 subt+0.018
ths+0.016
 ne+0.016
bel+0.015
 cartoons+0.015
Neg logit contributions
 eye-0.021
adow-0.019
ocket-0.018
 Sasha-0.018
inger-0.018
Token,"
Token ID553
Feature activation+1.84e-02

Pos logit contributions
 eye+0.010
 gate+0.008
ocket+0.008
 pad+0.008
anger+0.008
Neg logit contributions
 ne-0.012
 subt-0.012
 caring-0.011
ths-0.011
 ang-0.011
Token Lily
Token ID20037
Feature activation-3.49e-05

Pos logit contributions
 eye+0.027
 Sasha+0.024
 gate+0.023
ocket+0.023
 envelope+0.022
Neg logit contributions
 subt-0.034
 ne-0.033
ths-0.032
bel-0.030
spe-0.030
Token said
Token ID531
Feature activation+2.73e-02

Pos logit contributions
 subt+0.000
 ne+0.000
ths+0.000
 caring+0.000
bel+0.000
Neg logit contributions
 eye-0.000
 pad-0.000
 gate-0.000
ocket-0.000
 Sasha-0.000
Token,
Token ID11
Feature activation-1.75e-02

Pos logit contributions
 eye+0.036
ocket+0.029
 gate+0.028
 Sasha+0.028
anger+0.028
Neg logit contributions
 subt-0.057
 ne-0.056
ths-0.052
 caring-0.052
Yeah-0.051
Token smiling
Token ID16755
Feature activation-2.05e-02

Pos logit contributions
 subt+0.032
 ne+0.028
Deal+0.028
ths+0.028
bel+0.028
Neg logit contributions
 eye-0.027
 pad-0.024
 Sasha-0.023
 gate-0.023
 hand-0.023
Token.
Token ID13
Feature activation-2.61e-03

Pos logit contributions
 subt+0.048
 ne+0.045
ths+0.043
 caring+0.043
Spe+0.041
Neg logit contributions
 eye-0.023
 pad-0.018
 gate-0.018
ocket-0.017
 Sasha-0.016
Token started
Token ID2067
Feature activation+1.58e-02

Pos logit contributions
 subt+0.005
 ne+0.005
 caring+0.005
bel+0.005
 ang+0.005
Neg logit contributions
 eye-0.006
 pad-0.005
 gate-0.005
ocket-0.005
 Sasha-0.005
Token to
Token ID284
Feature activation+6.78e-04

Pos logit contributions
 eye+0.017
 gate+0.014
 pad+0.014
ork+0.013
 Sasha+0.013
Neg logit contributions
 ne-0.036
 subt-0.035
Deal-0.033
ths-0.033
 caring-0.032
Token wr
Token ID1319
Feature activation-6.22e-03

Pos logit contributions
 eye+0.001
 gate+0.001
 Sasha+0.001
 net+0.001
 pad+0.001
Neg logit contributions
 subt-0.001
ths-0.001
 ne-0.001
years-0.001
bel-0.001
Tokeniggle
Token ID24082
Feature activation-1.03e-02

Pos logit contributions
 ne+0.012
 subt+0.011
 caring+0.011
 ang+0.010
ths+0.010
Neg logit contributions
 eye-0.009
ocket-0.008
 gate-0.008
 pad-0.008
anger-0.007
Token around
Token ID1088
Feature activation-1.06e-02

Pos logit contributions
 subt+0.020
 ne+0.020
ths+0.018
 caring+0.018
Yeah+0.018
Neg logit contributions
 eye-0.015
 gate-0.012
 pad-0.012
ocket-0.012
anger-0.012
Token the
Token ID262
Feature activation+4.75e-03

Pos logit contributions
 ne+0.020
 subt+0.020
 caring+0.018
ths+0.017
bel+0.017
Neg logit contributions
 eye-0.016
 gate-0.014
 pad-0.014
ocket-0.013
 Sasha-0.013
Token hammer
Token ID15554
Feature activation-2.52e-02

Pos logit contributions
anger+0.005
ocket+0.005
 eye+0.005
inger+0.005
lor+0.005
Neg logit contributions
 ne-0.010
 subt-0.010
 caring-0.010
 ang-0.009
ths-0.009
Token vicious
Token ID17686
Feature activation+1.37e-03

Pos logit contributions
 ne+0.049
 subt+0.048
 caring+0.044
Yeah+0.042
ths+0.042
Neg logit contributions
 eye-0.037
 gate-0.032
 pad-0.031
ocket-0.031
anger-0.031
Tokenly
Token ID306
Feature activation+1.98e-02

Pos logit contributions
 eye+0.003
ocket+0.002
anger+0.002
 gate+0.002
inger+0.002
Neg logit contributions
 ne-0.002
 subt-0.002
 caring-0.002
 ang-0.002
ths-0.002
Token.
Token ID13
Feature activation-6.57e-03

Pos logit contributions
 eye+0.031
ocket+0.028
anger+0.027
inger+0.027
 gate+0.026
Neg logit contributions
 ne-0.037
 subt-0.034
 caring-0.032
 ang-0.030
bel-0.030
Token Then
Token ID3244
Feature activation-5.29e-03

Pos logit contributions
 ne+0.011
 subt+0.010
 caring+0.009
Yeah+0.009
 ang+0.009
Neg logit contributions
 eye-0.012
ocket-0.011
 gate-0.010
anger-0.010
inger-0.010
Token was
Token ID373
Feature activation-3.09e-03

Pos logit contributions
 eye+0.009
ocket+0.008
 pad+0.008
 gate+0.008
anger+0.008
Neg logit contributions
 ne-0.008
 subt-0.008
 caring-0.007
 ang-0.007
ths-0.007
Token happy
Token ID3772
Feature activation+1.38e-02

Pos logit contributions
 ne+0.004
 subt+0.004
 caring+0.004
ths+0.003
 ang+0.003
Neg logit contributions
 eye-0.006
ocket-0.006
 gate-0.006
 pad-0.006
anger-0.006
Token.
Token ID13
Feature activation-3.81e-03

Pos logit contributions
 eye+0.021
ocket+0.018
 gate+0.018
anger+0.018
 pad+0.018
Neg logit contributions
 ne-0.026
 subt-0.025
 caring-0.023
 ang-0.022
ths-0.022
Token<|endoftext|>
Token ID50256
Feature activation-4.32e-03

Pos logit contributions
 ne+0.006
 subt+0.006
 caring+0.006
Yeah+0.005
 Amen+0.005
Neg logit contributions
 eye-0.007
ocket-0.006
 gate-0.006
anger-0.006
inger-0.006
TokenOnce
Token ID7454
Feature activation-1.25e-02

Pos logit contributions
 ne+0.007
 subt+0.006
B+0.005
 Sp+0.005
 Bones+0.005
Neg logit contributions
 eye-0.009
ocket-0.008
 envelope-0.008
ige-0.008
 gate-0.008
Token upon
Token ID2402
Feature activation-1.08e-02

Pos logit contributions
 ne+0.024
 caring+0.021
 subt+0.021
bel+0.021
 ang+0.020
Neg logit contributions
 eye-0.018
ocket-0.016
inger-0.015
 gate-0.015
lor-0.015
Token a
Token ID257
Feature activation-1.29e-02

Pos logit contributions
 ne+0.017
 subt+0.016
 caring+0.014
 ang+0.013
ths+0.013
Neg logit contributions
 eye-0.020
ocket-0.018
 gate-0.018
anger-0.018
 pad-0.018
Token time
Token ID640
Feature activation+2.59e-03

Pos logit contributions
 ne+0.030
 subt+0.029
 caring+0.027
 ang+0.027
ths+0.026
Neg logit contributions
 eye-0.015
 pad-0.012
ocket-0.011
 gate-0.011
inger-0.011
Token there
Token ID612
Feature activation-1.24e-03

Pos logit contributions
 eye+0.004
 gate+0.003
anger+0.003
 Sasha+0.003
 pad+0.003
Neg logit contributions
 subt-0.005
 ne-0.005
 caring-0.004
Yeah-0.004
 ang-0.004
Token were
Token ID547
Feature activation+7.71e-03

Pos logit contributions
 ne+0.002
 subt+0.002
 caring+0.002
ths+0.001
 ang+0.001
Neg logit contributions
 eye-0.002
ocket-0.002
 gate-0.002
anger-0.002
 pad-0.002
Token two
Token ID734
Feature activation+4.69e-03

Pos logit contributions
 eye+0.014
ocket+0.012
anger+0.012
 gate+0.012
inger+0.012
Neg logit contributions
 ne-0.013
 subt-0.012
 caring-0.011
ths-0.011
Yeah-0.010
Token tear
Token ID11626
Feature activation-2.41e-02

Pos logit contributions
 subt+0.017
 ne+0.016
ths+0.015
 caring+0.014
Yeah+0.014
Neg logit contributions
 eye-0.018
 gate-0.016
ocket-0.015
 pad-0.015
 envelope-0.014
Token them
Token ID606
Feature activation-1.89e-02

Pos logit contributions
 ne+0.046
 subt+0.045
 caring+0.041
ths+0.040
 ang+0.039
Neg logit contributions
 eye-0.036
ocket-0.031
 gate-0.031
anger-0.031
 pad-0.030
Token."
Token ID526
Feature activation+1.58e-02

Pos logit contributions
 ne+0.035
 subt+0.032
 caring+0.031
 ang+0.030
ths+0.029
Neg logit contributions
 eye-0.030
ocket-0.026
anger-0.025
 gate-0.025
 pad-0.025
Token His
Token ID2399
Feature activation-2.28e-02

Pos logit contributions
 eye+0.026
 gate+0.021
anger+0.021
ork+0.021
 Sasha+0.021
Neg logit contributions
 ne-0.028
 subt-0.028
 caring-0.027
ths-0.027
nam-0.024
Token friends
Token ID2460
Feature activation-9.89e-03

Pos logit contributions
 subt+0.046
 ne+0.046
ths+0.042
bel+0.042
Yeah+0.041
Neg logit contributions
 eye-0.034
 gate-0.029
anger-0.026
inger-0.026
ocket-0.025
Token nodded
Token ID14464
Feature activation-8.97e-03

Pos logit contributions
 subt+0.015
 ne+0.015
 caring+0.013
bel+0.013
ths+0.013
Neg logit contributions
 eye-0.019
ocket-0.016
 gate-0.016
 pad-0.016
anger-0.015
Token.
Token ID13
Feature activation+1.89e-03

Pos logit contributions
 subt+0.022
 ne+0.021
ths+0.020
 caring+0.019
bel+0.019
Neg logit contributions
 eye-0.010
 gate-0.007
 pad-0.007
ocket-0.006
 Sasha-0.006
Token They
Token ID1119
Feature activation+7.21e-03

Pos logit contributions
 eye+0.003
 gate+0.003
ocket+0.002
anger+0.002
 pad+0.002
Neg logit contributions
 ne-0.004
 subt-0.003
 caring-0.003
ths-0.003
 ang-0.003
Token touched
Token ID12615
Feature activation+1.13e-02

Pos logit contributions
 eye+0.013
ocket+0.011
 gate+0.011
 pad+0.011
 Sasha+0.011
Neg logit contributions
 subt-0.012
 ne-0.011
 caring-0.010
ths-0.010
 ang-0.010
Token his
Token ID465
Feature activation-1.26e-02

Pos logit contributions
 eye+0.018
 gate+0.015
 Sasha+0.014
 pad+0.014
anger+0.014
Neg logit contributions
 subt-0.021
 ne-0.021
ths-0.019
 caring-0.019
bel-0.018
Token pants
Token ID12581
Feature activation-1.20e-03

Pos logit contributions
 ne+0.024
 subt+0.023
 caring+0.022
 ang+0.021
ths+0.021
Neg logit contributions
 eye-0.019
ocket-0.016
 gate-0.016
anger-0.016
 pad-0.016
Token raining
Token ID43079
Feature activation-1.56e-02

Pos logit contributions
 subt+0.015
 ne+0.014
bel+0.013
Yeah+0.013
ths+0.013
Neg logit contributions
 eye-0.015
 gate-0.013
 pad-0.013
 Sasha-0.013
inger-0.012
Token outside
Token ID2354
Feature activation+7.09e-04

Pos logit contributions
 subt+0.033
 ne+0.032
 caring+0.031
 ang+0.030
 forgiving+0.030
Neg logit contributions
 eye-0.019
anger-0.017
 gate-0.017
ork-0.016
 pad-0.016
Token.
Token ID13
Feature activation-2.21e-03

Pos logit contributions
 eye+0.001
ocket+0.001
inger+0.001
 pad+0.001
anger+0.001
Neg logit contributions
 ne-0.001
 subt-0.001
 caring-0.001
bel-0.001
 ang-0.001
Token<|endoftext|>
Token ID50256
Feature activation-3.56e-03

Pos logit contributions
 ne+0.004
 subt+0.004
 caring+0.003
Yeah+0.003
 ang+0.003
Neg logit contributions
 eye-0.004
ocket-0.003
 gate-0.003
anger-0.003
inger-0.003
TokenOnce
Token ID7454
Feature activation-1.65e-02

Pos logit contributions
 ne+0.005
 subt+0.004
B+0.003
cl+0.003
 Sp+0.003
Neg logit contributions
 eye-0.008
ocket-0.007
 gate-0.007
 envelope-0.007
 pad-0.007
Token upon
Token ID2402
Feature activation-1.07e-02

Pos logit contributions
 ne+0.032
 caring+0.028
 subt+0.028
bel+0.027
 ang+0.027
Neg logit contributions
 eye-0.023
ocket-0.022
lor-0.020
anger-0.020
 gate-0.020
Token a
Token ID257
Feature activation-1.54e-02

Pos logit contributions
 ne+0.015
 subt+0.013
 caring+0.013
 ang+0.012
bel+0.012
Neg logit contributions
 eye-0.022
anger-0.020
ocket-0.020
 gate-0.020
inger-0.019
Token time
Token ID640
Feature activation-3.58e-04

Pos logit contributions
 ne+0.033
 subt+0.031
 caring+0.029
ths+0.028
 ang+0.028
Neg logit contributions
 eye-0.020
ocket-0.017
anger-0.017
 gate-0.017
 pad-0.016
Token,
Token ID11
Feature activation+3.43e-03

Pos logit contributions
 ne+0.001
 subt+0.001
 caring+0.001
 ang+0.001
Yeah+0.001
Neg logit contributions
 eye-0.001
 gate-0.000
anger-0.000
ocket-0.000
 pad-0.000
Token there
Token ID612
Feature activation+3.01e-05

Pos logit contributions
 eye+0.004
ocket+0.004
 gate+0.004
 pad+0.004
anger+0.003
Neg logit contributions
 ne-0.007
 subt-0.007
 caring-0.007
ths-0.006
 ang-0.006
Token was
Token ID373
Feature activation+1.02e-03

Pos logit contributions
 eye+0.000
ocket+0.000
inger+0.000
 gate+0.000
anger+0.000
Neg logit contributions
 ne-0.000
 subt-0.000
 caring-0.000
ths-0.000
Yeah-0.000
Token he
Token ID339
Feature activation-7.12e-03

Pos logit contributions
 ne+0.001
 subt+0.001
 caring+0.001
 ang+0.001
bel+0.001
Neg logit contributions
 eye-0.001
 gate-0.001
ocket-0.001
anger-0.001
 pad-0.001
Token bowed
Token ID35737
Feature activation-5.16e-03

Pos logit contributions
 subt+0.011
 ne+0.011
 caring+0.010
 ang+0.010
bel+0.010
Neg logit contributions
 eye-0.013
 pad-0.011
 gate-0.011
ocket-0.011
 Sasha-0.010
Token down
Token ID866
Feature activation+5.72e-03

Pos logit contributions
 subt+0.008
 ne+0.008
 caring+0.007
ths+0.007
 ang+0.007
Neg logit contributions
 eye-0.009
 gate-0.008
 pad-0.008
ocket-0.008
anger-0.008
Token and
Token ID290
Feature activation-7.64e-03

Pos logit contributions
 eye+0.007
 gate+0.006
anger+0.006
 pad+0.005
ocket+0.005
Neg logit contributions
 ne-0.012
 subt-0.012
 caring-0.011
ths-0.011
 ang-0.011
Token gave
Token ID2921
Feature activation+2.17e-02

Pos logit contributions
 subt+0.012
 ne+0.011
 caring+0.011
ths+0.010
bel+0.010
Neg logit contributions
 eye-0.013
 Sasha-0.011
 gate-0.011
 pad-0.011
ocket-0.011
Token the
Token ID262
Feature activation-9.16e-03

Pos logit contributions
 eye+0.040
ocket+0.035
anger+0.034
 gate+0.034
 pad+0.034
Neg logit contributions
 ne-0.035
 subt-0.034
 caring-0.031
ths-0.029
bel-0.029
Token cherry
Token ID23612
Feature activation+7.70e-03

Pos logit contributions
 ne+0.015
 subt+0.015
 caring+0.014
 ang+0.013
ths+0.013
Neg logit contributions
 eye-0.016
ocket-0.014
anger-0.014
 gate-0.014
 pad-0.014
Token to
Token ID284
Feature activation-7.53e-03

Pos logit contributions
 eye+0.011
 gate+0.010
 pad+0.009
anger+0.009
ocket+0.009
Neg logit contributions
 subt-0.015
 ne-0.015
 caring-0.013
ths-0.013
Yeah-0.013
Token F
Token ID376
Feature activation+8.06e-03

Pos logit contributions
 ne+0.011
 subt+0.010
 caring+0.009
 ang+0.009
Yeah+0.009
Neg logit contributions
 eye-0.015
ocket-0.014
 pad-0.013
 gate-0.013
anger-0.013
Tokenuzz
Token ID4715
Feature activation-1.57e-02

Pos logit contributions
 eye+0.014
 gate+0.013
ocket+0.013
 net+0.012
anger+0.012
Neg logit contributions
 ne-0.013
 subt-0.012
Yeah-0.011
 caring-0.011
spe-0.011
Tokeny
Token ID88
Feature activation-2.03e-04

Pos logit contributions
 ne+0.028
 subt+0.026
 caring+0.024
 ang+0.023
bel+0.023
Neg logit contributions
 eye-0.026
ocket-0.023
 gate-0.022
anger-0.022
inger-0.022
Token knew
Token ID2993
Feature activation+8.09e-03

Pos logit contributions
 eye+0.005
ocket+0.005
 pad+0.005
 gate+0.004
ork+0.004
Neg logit contributions
 subt-0.005
 ne-0.005
bel-0.004
 ang-0.004
 caring-0.004
Token it
Token ID340
Feature activation-1.64e-02

Pos logit contributions
 eye+0.017
ocket+0.015
inger+0.015
 pad+0.015
anger+0.015
Neg logit contributions
 ne-0.011
 subt-0.010
 caring-0.010
ths-0.009
 ang-0.009
Token was
Token ID373
Feature activation-9.43e-03

Pos logit contributions
 ne+0.027
 subt+0.027
 caring+0.024
bel+0.024
Yeah+0.024
Neg logit contributions
 eye-0.028
 gate-0.024
anger-0.024
ocket-0.024
 pad-0.024
Token important
Token ID1593
Feature activation+1.30e-02

Pos logit contributions
 ne+0.014
 subt+0.013
 caring+0.011
bel+0.011
Yeah+0.011
Neg logit contributions
 eye-0.019
ocket-0.017
 gate-0.017
 pad-0.016
anger-0.016
Token.
Token ID13
Feature activation-1.51e-03

Pos logit contributions
 eye+0.019
ocket+0.017
inger+0.017
 gate+0.016
anger+0.016
Neg logit contributions
 ne-0.025
 subt-0.024
 caring-0.023
ths-0.021
 ang-0.021
Token He
Token ID679
Feature activation-1.63e-02

Pos logit contributions
 ne+0.002
 subt+0.002
 caring+0.002
Yeah+0.002
 ang+0.002
Neg logit contributions
 eye-0.003
ocket-0.003
 gate-0.003
anger-0.003
inger-0.003
Token wanted
Token ID2227
Feature activation+2.53e-02

Pos logit contributions
 subt+0.025
 ne+0.025
 caring+0.022
bel+0.022
 ang+0.022
Neg logit contributions
 eye-0.031
ocket-0.027
 pad-0.026
 gate-0.026
anger-0.025
Token to
Token ID284
Feature activation-5.12e-03

Pos logit contributions
 eye+0.039
ocket+0.035
inger+0.034
anger+0.034
 pad+0.034
Neg logit contributions
 ne-0.048
 subt-0.045
 caring-0.043
 ang-0.041
ths-0.040
Token make
Token ID787
Feature activation-4.82e-04

Pos logit contributions
 ne+0.008
 subt+0.007
 caring+0.007
bel+0.006
Yeah+0.006
Neg logit contributions
 eye-0.009
 pad-0.009
ocket-0.009
anger-0.009
inger-0.009
Token his
Token ID465
Feature activation-5.65e-03

Pos logit contributions
 ne+0.001
 subt+0.001
 caring+0.001
 ang+0.001
ths+0.001
Neg logit contributions
 eye-0.001
ocket-0.001
 pad-0.001
anger-0.001
 gate-0.001
Token mom
Token ID1995
Feature activation-3.16e-02

Pos logit contributions
 ne+0.011
 subt+0.011
 caring+0.010
ths+0.010
 ang+0.010
Neg logit contributions
 eye-0.008
ocket-0.007
 gate-0.007
 pad-0.007
anger-0.007
Token in
Token ID287
Feature activation-3.68e-03

Pos logit contributions
 subt+0.033
 ne+0.033
 caring+0.030
 ang+0.029
spe+0.029
Neg logit contributions
 eye-0.027
 gate-0.024
anger-0.024
ocket-0.023
ork-0.023
Token the
Token ID262
Feature activation+1.29e-02

Pos logit contributions
 ne+0.006
 subt+0.006
 caring+0.006
 ang+0.005
ths+0.005
Neg logit contributions
 eye-0.006
 gate-0.005
ocket-0.005
anger-0.005
 pad-0.005
Token pool
Token ID5933
Feature activation+1.37e-02

Pos logit contributions
ocket+0.018
 eye+0.018
hole+0.016
anger+0.016
inger+0.016
Neg logit contributions
 ne-0.026
 subt-0.023
 caring-0.023
 ang-0.021
ths-0.021
Token with
Token ID351
Feature activation+4.24e-03

Pos logit contributions
 eye+0.027
ocket+0.024
 gate+0.023
anger+0.023
 pad+0.023
Neg logit contributions
 ne-0.021
 subt-0.020
 caring-0.018
 ang-0.017
ths-0.017
Token her
Token ID607
Feature activation+6.16e-03

Pos logit contributions
 eye+0.006
 gate+0.005
ocket+0.005
 Sasha+0.005
anger+0.005
Neg logit contributions
 ne-0.008
 subt-0.008
ths-0.008
 caring-0.008
 ang-0.007
Token daddy
Token ID35822
Feature activation-4.98e-02

Pos logit contributions
 eye+0.010
ocket+0.009
 gate+0.009
anger+0.009
 pad+0.008
Neg logit contributions
 ne-0.011
 subt-0.011
 caring-0.010
 ang-0.010
ths-0.010
Token.
Token ID13
Feature activation+4.81e-03

Pos logit contributions
 ne+0.097
 subt+0.094
 caring+0.086
 ang+0.084
ths+0.083
Neg logit contributions
 eye-0.074
ocket-0.063
 gate-0.062
 pad-0.062
anger-0.061
Token One
Token ID1881
Feature activation+2.11e-02

Pos logit contributions
 eye+0.010
ocket+0.010
inger+0.010
anger+0.009
 gate+0.009
Neg logit contributions
 ne-0.006
 subt-0.005
 caring-0.005
Yeah-0.005
 cartoons-0.004
Token day
Token ID1110
Feature activation-2.59e-02

Pos logit contributions
 eye+0.036
ocket+0.034
anger+0.033
 gate+0.032
 pad+0.032
Neg logit contributions
 ne-0.035
 subt-0.032
 caring-0.031
Yeah-0.029
 ang-0.029
Token,
Token ID11
Feature activation-4.73e-03

Pos logit contributions
 ne+0.042
 subt+0.039
 caring+0.036
 ang+0.034
ths+0.034
Neg logit contributions
 eye-0.047
ocket-0.042
 gate-0.041
anger-0.041
 pad-0.041
Token Lily
Token ID20037
Feature activation+2.75e-02

Pos logit contributions
 subt+0.010
 ne+0.009
ths+0.009
 caring+0.008
spe+0.008
Neg logit contributions
 eye-0.007
 gate-0.006
 Sasha-0.005
anger-0.005
ocket-0.005
Token.
Token ID13
Feature activation-6.68e-04

Pos logit contributions
 ne+0.019
 subt+0.018
 caring+0.017
bel+0.016
 ang+0.016
Neg logit contributions
 eye-0.019
ocket-0.017
 pad-0.017
inger-0.017
 gate-0.016
Token But
Token ID887
Feature activation+3.92e-03

Pos logit contributions
 subt+0.001
 ne+0.001
Yeah+0.001
 caring+0.001
 Acc+0.001
Neg logit contributions
ocket-0.001
 eye-0.001
inger-0.001
ipper-0.001
 gate-0.001
Token then
Token ID788
Feature activation-1.61e-02

Pos logit contributions
 eye+0.008
ocket+0.007
inger+0.007
anger+0.007
 gate+0.007
Neg logit contributions
 ne-0.005
 subt-0.005
 caring-0.005
ths-0.004
 ang-0.004
Token,
Token ID11
Feature activation-5.23e-03

Pos logit contributions
 ne+0.027
 subt+0.025
 caring+0.023
ths+0.022
 ang+0.022
Neg logit contributions
 eye-0.029
ocket-0.026
anger-0.026
inger-0.026
 pad-0.025
Token mid
Token ID3095
Feature activation-4.44e-02

Pos logit contributions
 ne+0.009
 subt+0.008
 caring+0.008
 ang+0.007
ths+0.007
Neg logit contributions
 eye-0.009
ocket-0.008
inger-0.008
anger-0.008
 pad-0.008
Token-
Token ID12
Feature activation-5.06e-02

Pos logit contributions
 ne+0.077
 subt+0.075
 caring+0.069
Yeah+0.067
bel+0.066
Neg logit contributions
 eye-0.076
anger-0.064
 gate-0.064
inger-0.064
ocket-0.064
Tokenway
Token ID1014
Feature activation+3.78e-03

Pos logit contributions
 ne+0.095
 subt+0.091
 caring+0.084
 ang+0.082
ths+0.081
Neg logit contributions
 eye-0.079
anger-0.068
ocket-0.068
inger-0.067
 pad-0.066
Token through
Token ID832
Feature activation-1.81e-02

Pos logit contributions
ocket+0.007
 eye+0.007
anger+0.006
inger+0.006
lor+0.006
Neg logit contributions
 ne-0.006
 subt-0.005
 ang-0.005
 caring-0.005
ths-0.005
Token the
Token ID262
Feature activation+1.65e-03

Pos logit contributions
 ne+0.028
 subt+0.027
 caring+0.024
ths+0.023
 ang+0.023
Neg logit contributions
 eye-0.034
ocket-0.031
anger-0.030
 pad-0.030
 gate-0.030
Token night
Token ID1755
Feature activation+1.32e-02

Pos logit contributions
 eye+0.002
ocket+0.002
 pad+0.002
inger+0.002
anger+0.002
Neg logit contributions
 ne-0.003
 subt-0.003
 caring-0.003
 ang-0.003
Yeah-0.003
Token,
Token ID11
Feature activation+5.07e-03

Pos logit contributions
ocket+0.018
inger+0.017
ipper+0.016
ige+0.016
 eye+0.015
Neg logit contributions
 ne-0.027
 subt-0.025
bel-0.023
 cartoons-0.021
cl-0.021
Token helicopter
Token ID15392
Feature activation-2.30e-02

Pos logit contributions
 ne+0.014
 subt+0.013
 caring+0.012
 ang+0.012
ths+0.012
Neg logit contributions
 eye-0.006
ocket-0.005
 Sasha-0.005
inger-0.005
anger-0.005
Token and
Token ID290
Feature activation+2.94e-03

Pos logit contributions
 ne+0.042
 subt+0.039
 caring+0.036
ths+0.036
bel+0.035
Neg logit contributions
 eye-0.035
ocket-0.034
inger-0.032
ulp-0.032
anger-0.031
Token Sally
Token ID25737
Feature activation+2.76e-02

Pos logit contributions
 eye+0.005
ocket+0.005
 gate+0.004
anger+0.004
 pad+0.004
Neg logit contributions
 ne-0.005
 subt-0.005
 caring-0.004
 ang-0.004
ths-0.004
Token's
Token ID338
Feature activation+8.12e-03

Pos logit contributions
 eye+0.052
ocket+0.046
 gate+0.046
 pad+0.045
anger+0.045
Neg logit contributions
 ne-0.042
 subt-0.041
 caring-0.037
 ang-0.035
ths-0.035
Token face
Token ID1986
Feature activation-1.61e-02

Pos logit contributions
 eye+0.011
 gate+0.009
ocket+0.008
anger+0.008
 pad+0.008
Neg logit contributions
 subt-0.018
 ne-0.018
 ang-0.016
 caring-0.016
ths-0.016
Token lit
Token ID6578
Feature activation-5.67e-02

Pos logit contributions
 subt+0.023
 ne+0.022
 Amen+0.021
Deal+0.020
 caring+0.020
Neg logit contributions
 eye-0.030
ork-0.026
 gate-0.026
ocket-0.025
anger-0.025
Token up
Token ID510
Feature activation+5.40e-03

Pos logit contributions
 subt+0.117
 ne+0.108
Deal+0.105
ths+0.101
 caring+0.100
Neg logit contributions
 eye-0.075
 gate-0.067
ork-0.062
 home-0.061
hole-0.060
Token with
Token ID351
Feature activation+1.39e-03

Pos logit contributions
 eye+0.008
 gate+0.007
ocket+0.007
 pad+0.006
 Sasha+0.006
Neg logit contributions
 subt-0.010
 ne-0.010
ths-0.009
 caring-0.009
Yeah-0.009
Token joy
Token ID8716
Feature activation+4.43e-03

Pos logit contributions
 eye+0.003
ocket+0.002
 gate+0.002
 pad+0.002
 Sasha+0.002
Neg logit contributions
 ne-0.002
 subt-0.002
ths-0.002
 caring-0.002
bel-0.002
Token as
Token ID355
Feature activation+2.42e-02

Pos logit contributions
 eye+0.007
ocket+0.007
 gate+0.006
inger+0.006
anger+0.006
Neg logit contributions
 ne-0.008
 subt-0.007
 caring-0.007
ths-0.007
bel-0.007
Token she
Token ID673
Feature activation-7.73e-03

Pos logit contributions
 eye+0.041
ocket+0.037
 gate+0.037
anger+0.036
 Sasha+0.036
Neg logit contributions
 ne-0.040
 subt-0.039
 ang-0.034
ths-0.034
 caring-0.034
Token happy
Token ID3772
Feature activation+1.66e-02

Pos logit contributions
 subt+0.006
 ne+0.005
bel+0.005
ths+0.005
Deal+0.004
Neg logit contributions
 eye-0.006
 home-0.005
 gate-0.005
ocket-0.005
 Sasha-0.005
Token to
Token ID284
Feature activation-1.31e-02

Pos logit contributions
 eye+0.024
ocket+0.020
 gate+0.020
 pad+0.020
anger+0.020
Neg logit contributions
 ne-0.032
 subt-0.031
 caring-0.028
bel-0.028
 ang-0.028
Token be
Token ID307
Feature activation-8.29e-03

Pos logit contributions
 ne+0.024
 subt+0.023
ths+0.021
 caring+0.021
 ang+0.021
Neg logit contributions
 eye-0.021
ocket-0.017
 gate-0.017
 pad-0.017
ork-0.017
Token with
Token ID351
Feature activation+6.91e-04

Pos logit contributions
 ne+0.014
 subt+0.013
ths+0.012
Yeah+0.011
bel+0.011
Neg logit contributions
 eye-0.015
 gate-0.013
ocket-0.013
inger-0.012
 pad-0.012
Token his
Token ID465
Feature activation-1.15e-02

Pos logit contributions
 eye+0.001
 envelope+0.001
 home+0.001
ulp+0.001
ocket+0.001
Neg logit contributions
 subt-0.001
 ne-0.001
bel-0.001
Neigh-0.001
Deal-0.001
Token daddy
Token ID35822
Feature activation-5.13e-02

Pos logit contributions
 subt+0.023
 ne+0.023
bel+0.022
spe+0.021
 Amen+0.021
Neg logit contributions
 eye-0.017
 gate-0.013
 home-0.013
 pad-0.013
 bag-0.013
Token and
Token ID290
Feature activation+9.62e-03

Pos logit contributions
 ne+0.099
 subt+0.096
 caring+0.088
bel+0.086
ths+0.086
Neg logit contributions
 eye-0.078
ocket-0.066
 gate-0.065
 pad-0.064
anger-0.062
Token fell
Token ID3214
Feature activation+1.20e-02

Pos logit contributions
 eye+0.018
 gate+0.015
ocket+0.015
 Sasha+0.014
 pad+0.014
Neg logit contributions
 ne-0.015
 subt-0.015
ths-0.013
 ang-0.012
Yeah-0.012
Token asleep
Token ID16039
Feature activation-4.13e-02

Pos logit contributions
 eye+0.015
 gate+0.012
 home+0.012
 pad+0.012
anger+0.012
Neg logit contributions
 subt-0.026
 ne-0.025
 caring-0.024
Deal-0.024
Yeah-0.024
Token in
Token ID287
Feature activation-8.16e-03

Pos logit contributions
 subt+0.076
 ne+0.075
ths+0.069
 caring+0.069
 ang+0.067
Neg logit contributions
 eye-0.062
ocket-0.053
 gate-0.052
 pad-0.052
anger-0.050
Token his
Token ID465
Feature activation+4.93e-03

Pos logit contributions
 ne+0.013
 subt+0.013
 caring+0.012
ths+0.011
Yeah+0.011
Neg logit contributions
 eye-0.015
 gate-0.013
ocket-0.013
anger-0.012
 pad-0.012
TokenBut
Token ID1537
Feature activation-1.06e-02

Pos logit contributions
 ne+0.006
 subt+0.006
 caring+0.005
 ang+0.005
ths+0.005
Neg logit contributions
 eye-0.008
ocket-0.007
 gate-0.007
 pad-0.007
anger-0.007
Token then
Token ID788
Feature activation-2.02e-02

Pos logit contributions
 ne+0.014
 subt+0.013
 caring+0.012
 ang+0.011
ths+0.011
Neg logit contributions
 eye-0.022
ocket-0.020
 gate-0.020
anger-0.020
 pad-0.020
Token he
Token ID339
Feature activation-8.62e-03

Pos logit contributions
 ne+0.033
 subt+0.033
 caring+0.030
Yeah+0.029
bel+0.029
Neg logit contributions
 eye-0.034
 gate-0.030
 Sasha-0.030
ocket-0.030
 pad-0.029
Token heard
Token ID2982
Feature activation-7.64e-03

Pos logit contributions
 ne+0.016
 subt+0.015
 caring+0.014
 ang+0.013
ths+0.013
Neg logit contributions
 eye-0.014
ocket-0.012
anger-0.012
 gate-0.012
inger-0.012
Token a
Token ID257
Feature activation-1.10e-02

Pos logit contributions
 ne+0.013
 subt+0.012
 caring+0.011
ths+0.010
Yeah+0.010
Neg logit contributions
 eye-0.014
ocket-0.012
 gate-0.012
anger-0.012
 pad-0.012
Token voice
Token ID3809
Feature activation-5.34e-02

Pos logit contributions
 subt+0.018
 ne+0.017
ths+0.016
beh+0.016
Yeah+0.016
Neg logit contributions
 eye-0.019
 gate-0.018
ork-0.016
 pad-0.015
 door-0.015
Token coming
Token ID2406
Feature activation-1.61e-02

Pos logit contributions
 ne+0.101
 subt+0.099
 caring+0.091
ths+0.088
Yeah+0.088
Neg logit contributions
 eye-0.082
 gate-0.070
ocket-0.070
anger-0.068
 pad-0.067
Token from
Token ID422
Feature activation-2.60e-02

Pos logit contributions
 ne+0.029
 subt+0.027
 caring+0.026
 ang+0.025
bel+0.024
Neg logit contributions
 eye-0.026
ocket-0.024
anger-0.023
 pad-0.022
inger-0.022
Token outside
Token ID2354
Feature activation+3.03e-03

Pos logit contributions
 ne+0.043
 subt+0.041
 caring+0.037
 ang+0.036
ths+0.035
Neg logit contributions
 eye-0.046
ocket-0.041
anger-0.040
 pad-0.040
 gate-0.040
Token the
Token ID262
Feature activation+1.18e-02

Pos logit contributions
 eye+0.006
ocket+0.006
inger+0.006
 pad+0.006
ige+0.005
Neg logit contributions
 ne-0.004
 subt-0.004
 ang-0.004
bel-0.004
 caring-0.004
Token house
Token ID2156
Feature activation-2.65e-04

Pos logit contributions
inger+0.013
 eye+0.012
ocket+0.012
 Sasha+0.012
ulp+0.012
Neg logit contributions
 ne-0.026
 subt-0.024
 caring-0.022
 ang-0.022
bel-0.022