TOP ACTIVATIONS
MAX = 0.067

Mom and Dad came home and saw the fire .
. Mom came home and saw them crying .
Their mom and dad came home and saw the smoke and
ulously , the wind and rain stopped . Bailey was amazed
Soon , dad came home from work . He saw
delicious ! THE <|endoftext|> Elsa was playing chess
. When mom my came home , she saw the shirt
his tail . They went home and had a fun day
her mom . They went home happy . <|endoftext|> S ara
, and they both walked home slowly , being careful not
and her mom my went home and put away all the
Mom my and Daddy came home , Sarah was so grateful
her back . They walked home together , happy and content
being cold ." They went home and Lily c uddled up
big brother and they went home together . The end .
were happy again and went home with a smile . <|endoftext|>
les . When mom came home , she was surprised and
Soon , Dad came home from work . He was
. Dad came home and saw the mess .
friends said goodbye and went home . Tim my was tired

MIN ACTIVATIONS
MIN = -0.064

' No ,' but J od ie kept going .
G ain is better than greed , child ."
shrimp was pushing against the hull of the vessel , which
had to suffer for her greed . She was very sorry
it . J od ie was amazed by what
difficult to hold the silver ware , but she tried her
dolls around in a wheel bar row she had found in
" I can stretch my wings and fly high in the
" Can you stretch your wings and fly with me ?"
bird . Just flap your wings and fly to the top
, " Always hold your spoon tight , so you won
. So J od ie 's mom bought the
of giving in to his greed . <|endoftext|> Once upon a
knot in his sho el aces and he couldn 't get
it away in a wheel bar row . Once
easier to carry the jelly fish ! â €
He jumped onto the wheel bar row and started rolling around
would cut it into small squares and put it on the
I like swimming in the shallow water near the rocks .
to cut it into small pieces , so it is easy

INTERVAL 0.034 - 0.067
CONTAINS 0.226%

the park and soon the sky grew dark . Big drops
where are you ?" she sob bed . Ben
Daisy saw a nice , clean subway and thought it looked
on dry and cozy p aj amas and sn ugg led
sun . As days passed , the plant started

INTERVAL 0.001 - 0.034
CONTAINS 43.900%

it so it could come back . He said " rocket
â € � � We â € ™ ll never know
and gave it a new name , Fl uffy . Mia
wardrobe . He knew he had done something special and he
a friendly mechanic named Sam . Sam was deep in thought

INTERVAL -0.032 - 0.001
CONTAINS 55.474%

a time , there was a phone . It liked to
. They have fun . They are happy . They have
are you doing , young man ? Are you trying to
but he still wanted to keep the ash tr ay .
S ue smiled and they played together all day .

INTERVAL -0.064 - -0.032
CONTAINS 0.400%

, she always remembered to be honest and reliable . <|endoftext|>
to mark the beetle on his paper . He
dance . She swayed her arms up and down and tw
string . She tied the string around the balloon and started
in and out of the flowers , eating the juicy fruit
Token
Token ID198
Feature activation+2.11e-02

Pos logit contributions
etti+0.028
isions+0.024
icol+0.024
ller+0.023
itor+0.023
Neg logit contributions
 often-0.016
 end-0.016
 forward-0.014
 narrative-0.013
 hop-0.013
TokenMom
Token ID29252
Feature activation+4.80e-02

Pos logit contributions
etti+0.056
icol+0.048
isions+0.047
ller+0.047
 Jenn+0.046
Neg logit contributions
 end-0.022
 often-0.022
 forward-0.020
 narrative-0.019
 also-0.017
Token and
Token ID290
Feature activation+2.52e-02

Pos logit contributions
etti+0.107
isions+0.088
ller+0.088
 Jenn+0.086
icol+0.086
Neg logit contributions
 often-0.068
 end-0.067
 forward-0.064
 narrative-0.061
 hop-0.057
Token Dad
Token ID17415
Feature activation+2.88e-02

Pos logit contributions
etti+0.051
isions+0.044
ller+0.043
igsaw+0.039
icol+0.039
Neg logit contributions
 end-0.046
 often-0.042
 forward-0.041
 also-0.040
 sometimes-0.040
Token came
Token ID1625
Feature activation+3.07e-02

Pos logit contributions
etti+0.060
isions+0.049
 Jenn+0.048
ller+0.048
icol+0.047
Neg logit contributions
 often-0.044
 end-0.042
 forward-0.041
 narrative-0.041
 hop-0.037
Token home
Token ID1363
Feature activation+6.71e-02

Pos logit contributions
etti+0.078
ller+0.067
isions+0.066
icol+0.066
 Pole+0.064
Neg logit contributions
 forward-0.037
 often-0.037
 end-0.037
 hop-0.031
 also-0.028
Token and
Token ID290
Feature activation+4.97e-02

Pos logit contributions
etti+0.196
icol+0.175
isions+0.174
 badges+0.165
ller+0.164
Neg logit contributions
 end-0.058
 often-0.057
 forward-0.050
 also-0.042
 sometimes-0.041
Token saw
Token ID2497
Feature activation+2.09e-02

Pos logit contributions
etti+0.120
isions+0.100
ller+0.099
icol+0.098
MY+0.094
Neg logit contributions
 end-0.070
 often-0.069
 forward-0.064
 also-0.062
 always-0.057
Token the
Token ID262
Feature activation+1.99e-02

Pos logit contributions
etti+0.054
icol+0.048
isions+0.047
ller+0.047
ñ+0.045
Neg logit contributions
 end-0.026
 often-0.021
 forward-0.020
 also-0.020
 sometimes-0.020
Token fire
Token ID2046
Feature activation+4.30e-02

Pos logit contributions
etti+0.038
 Jenn+0.030
isions+0.030
ller+0.030
icol+0.029
Neg logit contributions
 end-0.037
 often-0.035
 narrative-0.033
 forward-0.033
ery-0.030
Token.
Token ID13
Feature activation+1.98e-02

Pos logit contributions
etti+0.100
isions+0.087
icol+0.087
 Jenn+0.086
ller+0.084
Neg logit contributions
 end-0.058
 often-0.053
 forward-0.050
 narrative-0.045
 sometimes-0.044
Token.
Token ID13
Feature activation-5.79e-04

Pos logit contributions
etti+0.005
 Brown+0.004
isions+0.004
 Jenn+0.004
ller+0.004
Neg logit contributions
 narrative-0.004
 often-0.004
 end-0.003
orn-0.003
 brav-0.003
Token
Token ID198
Feature activation-1.09e-03

Pos logit contributions
 often+0.001
 end+0.001
 forward+0.001
 narrative+0.001
 sometimes+0.001
Neg logit contributions
etti-0.001
ller-0.001
isions-0.001
 coaster-0.001
ñ-0.001
Token
Token ID198
Feature activation+2.64e-03

Pos logit contributions
 often+0.002
 end+0.001
 forward+0.001
 narrative+0.001
 hop+0.001
Neg logit contributions
etti-0.003
isions-0.002
icol-0.002
ller-0.002
itor-0.002
TokenMom
Token ID29252
Feature activation+4.34e-02

Pos logit contributions
etti+0.006
icol+0.005
ller+0.005
isions+0.005
 Jenn+0.005
Neg logit contributions
 end-0.003
 often-0.003
 forward-0.003
 narrative-0.003
 hop-0.003
Token came
Token ID1625
Feature activation+2.46e-02

Pos logit contributions
etti+0.091
isions+0.075
 Jenn+0.074
ller+0.074
icol+0.072
Neg logit contributions
 often-0.066
 end-0.064
 forward-0.063
 narrative-0.061
 hop-0.056
Token home
Token ID1363
Feature activation+6.48e-02

Pos logit contributions
etti+0.059
ller+0.050
isions+0.049
icol+0.049
 Jenn+0.047
Neg logit contributions
 often-0.033
 end-0.032
 forward-0.032
 hop-0.027
 narrative-0.026
Token and
Token ID290
Feature activation+4.65e-02

Pos logit contributions
etti+0.181
icol+0.161
isions+0.158
ller+0.150
 badges+0.150
Neg logit contributions
 end-0.064
 often-0.063
 forward-0.056
 also-0.048
 sometimes-0.047
Token saw
Token ID2497
Feature activation+1.30e-02

Pos logit contributions
etti+0.110
isions+0.090
icol+0.088
ller+0.088
MY+0.083
Neg logit contributions
 often-0.068
 end-0.066
 forward-0.062
 also-0.059
 sometimes-0.055
Token them
Token ID606
Feature activation+2.99e-02

Pos logit contributions
etti+0.032
icol+0.029
ller+0.028
isions+0.028
itor+0.027
Neg logit contributions
 end-0.017
 often-0.015
 forward-0.014
 sometimes-0.014
 also-0.013
Token crying
Token ID13774
Feature activation+1.22e-02

Pos logit contributions
etti+0.059
isions+0.048
ller+0.047
 Jenn+0.046
 flashing+0.044
Neg logit contributions
 often-0.048
 narrative-0.046
 forward-0.046
 end-0.045
 brav-0.043
Token.
Token ID13
Feature activation+7.62e-03

Pos logit contributions
etti+0.022
ller+0.018
isions+0.018
 Jenn+0.017
 Brown+0.017
Neg logit contributions
 often-0.021
 brav-0.020
 narrative-0.019
 hop-0.019
 forward-0.018
Token Their
Token ID5334
Feature activation+1.99e-02

Pos logit contributions
 often+0.008
 end+0.007
 forward+0.007
 narrative+0.007
 hop+0.006
Neg logit contributions
etti-0.011
isions-0.009
ller-0.009
 Jenn-0.009
itor-0.009
Token mom
Token ID1995
Feature activation+3.82e-02

Pos logit contributions
etti+0.046
isions+0.039
icol+0.037
MY+0.036
ller+0.036
Neg logit contributions
 end-0.030
 often-0.030
 narrative-0.025
 moral-0.024
 forward-0.023
Token and
Token ID290
Feature activation+2.28e-02

Pos logit contributions
etti+0.078
isions+0.064
 Jenn+0.063
ller+0.063
icol+0.061
Neg logit contributions
 often-0.060
 end-0.058
 forward-0.057
 narrative-0.056
 hop-0.051
Token dad
Token ID9955
Feature activation+2.22e-02

Pos logit contributions
etti+0.051
isions+0.042
ller+0.040
MY+0.037
 coaster+0.037
Neg logit contributions
 end-0.038
 often-0.036
 forward-0.034
 always-0.032
 also-0.032
Token came
Token ID1625
Feature activation+2.51e-02

Pos logit contributions
etti+0.045
isions+0.036
 Jenn+0.036
ller+0.036
icol+0.035
Neg logit contributions
 often-0.035
 end-0.034
 forward-0.034
 narrative-0.033
 hop-0.030
Token home
Token ID1363
Feature activation+6.29e-02

Pos logit contributions
etti+0.057
ller+0.047
isions+0.047
icol+0.046
 Jenn+0.045
Neg logit contributions
 often-0.036
 end-0.035
 forward-0.034
 narrative-0.031
 hop-0.030
Token and
Token ID290
Feature activation+4.42e-02

Pos logit contributions
etti+0.187
icol+0.166
isions+0.164
ller+0.157
 badges+0.155
Neg logit contributions
 end-0.052
 often-0.051
 forward-0.044
 sometimes-0.037
 also-0.037
Token saw
Token ID2497
Feature activation+9.44e-03

Pos logit contributions
etti+0.112
isions+0.092
ller+0.091
icol+0.089
MY+0.087
Neg logit contributions
 often-0.060
 end-0.059
 also-0.053
 forward-0.053
 always-0.049
Token the
Token ID262
Feature activation+2.97e-02

Pos logit contributions
etti+0.021
isions+0.018
ller+0.018
icol+0.018
 Jenn+0.017
Neg logit contributions
 end-0.014
 often-0.013
 forward-0.013
 narrative-0.012
 hop-0.011
Token smoke
Token ID7523
Feature activation+2.27e-02

Pos logit contributions
etti+0.059
icol+0.047
 Jenn+0.046
isions+0.046
ller+0.045
Neg logit contributions
 end-0.055
 often-0.050
 narrative-0.048
 forward-0.047
 moral-0.044
Token and
Token ID290
Feature activation+3.84e-02

Pos logit contributions
etti+0.053
isions+0.044
ller+0.044
 Jenn+0.043
icol+0.042
Neg logit contributions
 often-0.030
 end-0.028
 forward-0.027
 narrative-0.027
 hop-0.024
Tokenulously
Token ID18117
Feature activation+4.58e-03

Pos logit contributions
 often+0.026
 end+0.025
 forward+0.024
 narrative+0.023
 hop+0.021
Neg logit contributions
etti-0.050
isions-0.043
ller-0.042
 Jenn-0.042
icol-0.041
Token,
Token ID11
Feature activation-3.20e-03

Pos logit contributions
etti+0.011
isions+0.009
ller+0.009
icol+0.009
 Jenn+0.009
Neg logit contributions
 often-0.006
 end-0.006
 forward-0.005
 narrative-0.005
 hop-0.005
Token the
Token ID262
Feature activation-8.72e-03

Pos logit contributions
 often+0.005
 end+0.005
 forward+0.005
 narrative+0.005
 hop+0.004
Neg logit contributions
etti-0.007
isions-0.006
ller-0.005
 Jenn-0.005
icol-0.005
Token wind
Token ID2344
Feature activation+2.41e-02

Pos logit contributions
 often+0.012
 forward+0.011
 narrative+0.011
 end+0.011
ery+0.010
Neg logit contributions
etti-0.019
ller-0.016
 Jenn-0.016
isions-0.016
 flashing-0.015
Token and
Token ID290
Feature activation+8.12e-03

Pos logit contributions
etti+0.046
isions+0.038
ller+0.037
 Jenn+0.036
icol+0.036
Neg logit contributions
 often-0.041
 end-0.041
 forward-0.039
 narrative-0.038
 hop-0.035
Token rain
Token ID6290
Feature activation+6.23e-02

Pos logit contributions
etti+0.017
isions+0.014
ller+0.014
 Jenn+0.014
icol+0.014
Neg logit contributions
 often-0.012
 end-0.012
 forward-0.012
 narrative-0.011
 hop-0.010
Token stopped
Token ID5025
Feature activation+1.82e-02

Pos logit contributions
etti+0.133
isions+0.119
itor+0.115
ller+0.114
icol+0.110
Neg logit contributions
 end-0.087
 often-0.086
 forward-0.084
 always-0.075
 sometimes-0.071
Token.
Token ID13
Feature activation+9.42e-03

Pos logit contributions
etti+0.044
isions+0.038
itor+0.037
ller+0.037
 Jenn+0.036
Neg logit contributions
 often-0.023
 end-0.022
 forward-0.022
 narrative-0.018
 hop-0.018
Token Bailey
Token ID20330
Feature activation+6.08e-03

Pos logit contributions
etti+0.018
isions+0.014
ller+0.014
 Jenn+0.014
icol+0.014
Neg logit contributions
 often-0.016
 end-0.016
 forward-0.016
 narrative-0.015
 hop-0.014
Token was
Token ID373
Feature activation+1.21e-02

Pos logit contributions
etti+0.013
isions+0.011
ller+0.011
 Jenn+0.011
icol+0.010
Neg logit contributions
 often-0.009
 end-0.009
 forward-0.008
 narrative-0.008
 hop-0.008
Token amazed
Token ID24562
Feature activation+4.02e-03

Pos logit contributions
etti+0.030
isions+0.026
ller+0.025
icol+0.025
 Jenn+0.024
Neg logit contributions
 often-0.015
 end-0.014
 forward-0.014
 narrative-0.012
 hop-0.011
Token
Token ID198
Feature activation+7.64e-03

Pos logit contributions
etti+0.004
isions+0.003
ller+0.003
icol+0.003
 Jenn+0.003
Neg logit contributions
 often-0.002
 end-0.002
 forward-0.002
 narrative-0.002
 hop-0.002
TokenSoon
Token ID28093
Feature activation+2.12e-02

Pos logit contributions
etti+0.017
isions+0.015
 Jenn+0.015
ller+0.015
icol+0.014
Neg logit contributions
 often-0.010
 end-0.009
 forward-0.009
 narrative-0.009
 brav-0.008
Token,
Token ID11
Feature activation+1.03e-02

Pos logit contributions
etti+0.057
isions+0.054
ller+0.053
plug+0.050
icol+0.050
Neg logit contributions
 end-0.024
 often-0.021
 forward-0.021
 hop-0.017
 sometimes-0.017
Token dad
Token ID9955
Feature activation+3.61e-02

Pos logit contributions
etti+0.021
isions+0.018
ller+0.018
icol+0.017
itor+0.016
Neg logit contributions
 end-0.018
 often-0.017
 forward-0.016
 narrative-0.015
 sometimes-0.015
Token came
Token ID1625
Feature activation+2.39e-02

Pos logit contributions
etti+0.087
icol+0.075
ller+0.074
isions+0.073
 Jenn+0.070
Neg logit contributions
 end-0.046
 often-0.045
 forward-0.041
 also-0.038
 hop-0.037
Token home
Token ID1363
Feature activation+6.07e-02

Pos logit contributions
etti+0.058
ller+0.049
isions+0.049
icol+0.048
 Jenn+0.047
Neg logit contributions
 forward-0.031
 often-0.031
 end-0.031
 hop-0.026
 narrative-0.024
Token from
Token ID422
Feature activation+2.74e-02

Pos logit contributions
etti+0.168
isions+0.146
icol+0.146
ller+0.140
 badges+0.136
Neg logit contributions
 often-0.061
 end-0.059
 forward-0.055
 also-0.044
 sometimes-0.044
Token work
Token ID670
Feature activation+1.78e-02

Pos logit contributions
etti+0.063
isions+0.055
ller+0.051
icol+0.050
 Jenn+0.049
Neg logit contributions
 end-0.041
 often-0.041
 forward-0.036
 narrative-0.033
 sometimes-0.032
Token.
Token ID13
Feature activation+6.96e-03

Pos logit contributions
etti+0.036
ller+0.030
 Jenn+0.029
isions+0.029
 Paul+0.028
Neg logit contributions
 end-0.027
 narrative-0.026
 often-0.026
 brav-0.025
 hop-0.025
Token He
Token ID679
Feature activation+2.94e-02

Pos logit contributions
etti+0.019
isions+0.016
ller+0.016
icol+0.015
 Jenn+0.015
Neg logit contributions
 often-0.007
 end-0.007
 forward-0.006
 narrative-0.006
 hop-0.005
Token saw
Token ID2497
Feature activation+3.61e-03

Pos logit contributions
etti+0.068
isions+0.056
ller+0.054
icol+0.054
 Jenn+0.052
Neg logit contributions
 often-0.044
 end-0.041
 also-0.037
 narrative-0.036
 forward-0.036
Token delicious
Token ID12625
Feature activation+8.18e-03

Pos logit contributions
etti+0.012
 sprink+0.010
 Sus+0.009
 conf+0.009
 John+0.009
Neg logit contributions
 narrative-0.011
 brav-0.010
 forward-0.010
iddle-0.010
orn-0.010
Token!
Token ID0
Feature activation+2.05e-02

Pos logit contributions
etti+0.017
isions+0.013
ller+0.013
 Jenn+0.013
icol+0.013
Neg logit contributions
 often-0.012
 narrative-0.012
 end-0.012
 forward-0.012
 brav-0.011
Token
Token ID220
Feature activation+2.96e-02

Pos logit contributions
etti+0.039
icol+0.032
isions+0.032
 badges+0.032
ller+0.031
Neg logit contributions
 often-0.036
 end-0.032
 forward-0.031
 narrative-0.029
 sometimes-0.029
Token
Token ID198
Feature activation+1.64e-02

Pos logit contributions
etti+0.069
icol+0.060
isions+0.059
ller+0.058
itor+0.057
Neg logit contributions
 often-0.037
 end-0.034
 forward-0.034
 narrative-0.030
 hop-0.029
Token
Token ID198
Feature activation+1.54e-02

Pos logit contributions
etti+0.036
isions+0.030
ller+0.029
icol+0.029
 Jenn+0.028
Neg logit contributions
 often-0.024
 end-0.023
 forward-0.022
 narrative-0.021
 hop-0.020
TokenTHE
Token ID10970
Feature activation+6.07e-02

Pos logit contributions
etti+0.029
isions+0.024
 Jenn+0.023
ller+0.023
icol+0.022
Neg logit contributions
 often-0.026
 end-0.026
 forward-0.025
 narrative-0.024
 hop-0.023
Token<|endoftext|>
Token ID50256
Feature activation+9.27e-03

Pos logit contributions
etti+0.165
isions+0.155
 extingu+0.143
 Pole+0.142
 badges+0.141
Neg logit contributions
 often-0.058
 end-0.042
 always-0.042
 hop-0.039
 forward-0.038
TokenElsa
Token ID49050
Feature activation-1.33e-02

Pos logit contributions
etti+0.018
isions+0.014
ller+0.014
icol+0.014
 Jenn+0.014
Neg logit contributions
 often-0.016
 end-0.016
 forward-0.015
 narrative-0.015
 hop-0.014
Token was
Token ID373
Feature activation-6.88e-03

Pos logit contributions
 often+0.018
 forward+0.018
 end+0.018
 narrative+0.018
 brav+0.016
Neg logit contributions
etti-0.029
ller-0.024
isions-0.024
icol-0.024
 Jenn-0.024
Token playing
Token ID2712
Feature activation-1.78e-02

Pos logit contributions
 narrative+0.012
 end+0.012
 often+0.011
orn+0.010
iness+0.010
Neg logit contributions
etti-0.012
 Jenn-0.010
icol-0.009
isions-0.009
 Grand-0.009
Token chess
Token ID19780
Feature activation-2.93e-02

Pos logit contributions
 narrative+0.033
 forward+0.030
 brav+0.029
iness+0.029
 often+0.028
Neg logit contributions
etti-0.029
 conf-0.022
isions-0.022
 James-0.022
 Brown-0.021
Token.
Token ID13
Feature activation+1.14e-02

Pos logit contributions
 often+0.011
 narrative+0.011
 end+0.011
 forward+0.011
 brav+0.010
Neg logit contributions
etti-0.016
ller-0.013
 Jenn-0.013
isions-0.012
icol-0.012
Token When
Token ID1649
Feature activation+1.70e-02

Pos logit contributions
etti+0.027
isions+0.022
icol+0.022
ller+0.022
itor+0.021
Neg logit contributions
 often-0.015
 end-0.014
 forward-0.014
 narrative-0.013
 hop-0.012
Token mom
Token ID1995
Feature activation+3.78e-02

Pos logit contributions
etti+0.043
isions+0.041
ller+0.038
itor+0.037
 Pole+0.036
Neg logit contributions
 end-0.021
 often-0.019
 forward-0.018
 narrative-0.016
 hop-0.016
Tokenmy
Token ID1820
Feature activation+3.39e-02

Pos logit contributions
etti+0.101
isions+0.087
ller+0.087
icol+0.087
 Jenn+0.085
Neg logit contributions
 often-0.037
 end-0.036
 forward-0.032
 narrative-0.030
 hop-0.027
Token came
Token ID1625
Feature activation+2.47e-02

Pos logit contributions
etti+0.078
isions+0.067
icol+0.067
ller+0.067
itor+0.064
Neg logit contributions
 end-0.045
 often-0.044
 hop-0.041
 also-0.039
 forward-0.038
Token home
Token ID1363
Feature activation+5.99e-02

Pos logit contributions
etti+0.059
isions+0.050
ller+0.049
icol+0.048
 Jenn+0.048
Neg logit contributions
 often-0.032
 end-0.031
 forward-0.030
 narrative-0.027
 hop-0.026
Token,
Token ID11
Feature activation+1.43e-02

Pos logit contributions
etti+0.170
isions+0.158
icol+0.150
 badges+0.148
ller+0.145
Neg logit contributions
 often-0.053
 end-0.053
 forward-0.050
 also-0.039
 sometimes-0.038
Token she
Token ID673
Feature activation+3.25e-02

Pos logit contributions
etti+0.035
isions+0.030
icol+0.029
ller+0.029
 Pole+0.028
Neg logit contributions
 end-0.019
 often-0.019
 forward-0.017
 narrative-0.016
 also-0.015
Token saw
Token ID2497
Feature activation+4.95e-03

Pos logit contributions
etti+0.069
isions+0.056
ller+0.055
 Jenn+0.055
icol+0.055
Neg logit contributions
 often-0.051
 end-0.049
 forward-0.046
 narrative-0.045
 hop-0.043
Token the
Token ID262
Feature activation+2.18e-02

Pos logit contributions
etti+0.011
isions+0.010
icol+0.010
ller+0.009
 Jenn+0.009
Neg logit contributions
 end-0.007
 often-0.007
 forward-0.006
 narrative-0.006
 sometimes-0.005
Token shirt
Token ID10147
Feature activation+5.25e-03

Pos logit contributions
etti+0.039
isions+0.031
ller+0.030
 Jenn+0.030
icol+0.029
Neg logit contributions
 often-0.040
 end-0.040
 forward-0.038
 narrative-0.038
 hop-0.035
Token his
Token ID465
Feature activation+5.45e-04

Pos logit contributions
etti+0.019
isions+0.016
ller+0.016
 Jenn+0.016
icol+0.015
Neg logit contributions
 often-0.009
 end-0.009
 forward-0.008
 narrative-0.008
 hop-0.007
Token tail
Token ID7894
Feature activation-1.68e-02

Pos logit contributions
etti+0.001
isions+0.001
 Jenn+0.001
ller+0.001
icol+0.001
Neg logit contributions
 end-0.001
 often-0.001
 forward-0.001
 hop-0.001
 narrative-0.001
Token.
Token ID13
Feature activation-3.37e-03

Pos logit contributions
 narrative+0.031
 often+0.026
ying+0.026
 end+0.025
 forward+0.025
Neg logit contributions
etti-0.031
ller-0.024
 Brown-0.024
 Jenn-0.024
 Rob-0.023
Token They
Token ID1119
Feature activation-5.68e-04

Pos logit contributions
 often+0.005
 end+0.005
 forward+0.005
 narrative+0.005
 hop+0.005
Neg logit contributions
etti-0.007
isions-0.006
ller-0.006
 Jenn-0.006
icol-0.005
Token went
Token ID1816
Feature activation+2.27e-02

Pos logit contributions
 forward+0.001
 narrative+0.001
 end+0.001
 often+0.001
 brav+0.001
Neg logit contributions
etti-0.001
 Jenn-0.001
 Brown-0.001
 Paul-0.001
ller-0.001
Token home
Token ID1363
Feature activation+5.97e-02

Pos logit contributions
etti+0.054
isions+0.046
ller+0.045
icol+0.044
 Jenn+0.044
Neg logit contributions
 often-0.030
 end-0.029
 forward-0.029
 narrative-0.025
 hop-0.024
Token and
Token ID290
Feature activation+2.09e-02

Pos logit contributions
etti+0.168
isions+0.151
icol+0.147
 conf+0.142
 Pole+0.141
Neg logit contributions
 often-0.062
 end-0.051
 forward-0.049
 always-0.043
 also-0.042
Token had
Token ID550
Feature activation+1.97e-02

Pos logit contributions
etti+0.042
isions+0.035
ller+0.033
icol+0.032
 Jenn+0.032
Neg logit contributions
 often-0.036
 end-0.034
 forward-0.033
 narrative-0.031
 hop-0.031
Token a
Token ID257
Feature activation+2.57e-02

Pos logit contributions
etti+0.045
ller+0.038
isions+0.037
 Jenn+0.036
icol+0.036
Neg logit contributions
 often-0.028
 end-0.027
 forward-0.025
 narrative-0.024
 hop-0.023
Token fun
Token ID1257
Feature activation+2.38e-02

Pos logit contributions
etti+0.062
isions+0.053
ller+0.053
 Jenn+0.050
itor+0.050
Neg logit contributions
 end-0.036
 often-0.035
 forward-0.032
 hop-0.030
 narrative-0.029
Token day
Token ID1110
Feature activation+2.57e-02

Pos logit contributions
etti+0.069
isions+0.060
ller+0.060
 Jenn+0.059
icol+0.059
Neg logit contributions
 often-0.018
 end-0.017
 forward-0.015
 narrative-0.014
 hop-0.012
Token her
Token ID607
Feature activation+8.42e-03

Pos logit contributions
 often+0.007
 end+0.007
 forward+0.007
 narrative+0.007
 hop+0.006
Neg logit contributions
etti-0.008
 Jenn-0.007
ller-0.007
isions-0.007
icol-0.007
Token mom
Token ID1995
Feature activation+1.40e-04

Pos logit contributions
etti+0.015
isions+0.012
ller+0.012
 Jenn+0.012
icol+0.012
Neg logit contributions
 often-0.016
 end-0.015
 forward-0.015
 narrative-0.014
 hop-0.014
Token.
Token ID13
Feature activation-6.74e-03

Pos logit contributions
etti+0.000
ller+0.000
 Jenn+0.000
 Brown+0.000
 Smith+0.000
Neg logit contributions
 narrative-0.000
 often-0.000
 end-0.000
 brav-0.000
 forward-0.000
Token They
Token ID1119
Feature activation+1.38e-02

Pos logit contributions
 often+0.010
 end+0.009
 forward+0.009
 narrative+0.009
 hop+0.008
Neg logit contributions
etti-0.015
isions-0.013
ller-0.012
 Jenn-0.012
icol-0.012
Token went
Token ID1816
Feature activation+2.46e-02

Pos logit contributions
etti+0.027
 Jenn+0.022
isions+0.022
ller+0.022
icol+0.021
Neg logit contributions
 often-0.022
 forward-0.021
 end-0.021
 narrative-0.020
 brav-0.019
Token home
Token ID1363
Feature activation+5.94e-02

Pos logit contributions
etti+0.058
isions+0.049
ller+0.049
icol+0.047
 Jenn+0.046
Neg logit contributions
 often-0.035
 forward-0.034
 end-0.033
 hop-0.027
 narrative-0.027
Token happy
Token ID3772
Feature activation+2.76e-02

Pos logit contributions
etti+0.168
icol+0.152
isions+0.151
 Pole+0.141
MY+0.140
Neg logit contributions
 often-0.060
 end-0.056
 forward-0.048
 also-0.042
 sometimes-0.041
Token.
Token ID13
Feature activation+4.90e-03

Pos logit contributions
etti+0.073
isions+0.063
icol+0.062
 Jenn+0.062
ller+0.061
Neg logit contributions
 often-0.031
 end-0.029
 forward-0.026
 narrative-0.024
 sometimes-0.023
Token<|endoftext|>
Token ID50256
Feature activation+7.78e-03

Pos logit contributions
etti+0.012
isions+0.010
 coaster+0.010
ller+0.010
itor+0.010
Neg logit contributions
 end-0.006
 often-0.006
 narrative-0.006
 sometimes-0.005
 forward-0.005
TokenS
Token ID50
Feature activation+1.37e-02

Pos logit contributions
etti+0.016
isions+0.013
ller+0.013
icol+0.013
 Jenn+0.013
Neg logit contributions
 end-0.012
 often-0.012
 forward-0.012
 narrative-0.012
 hop-0.011
Tokenara
Token ID3301
Feature activation+9.46e-03

Pos logit contributions
etti+0.024
ller+0.020
isions+0.020
icol+0.019
 Jenn+0.018
Neg logit contributions
 end-0.027
 often-0.026
 forward-0.024
 narrative-0.023
 also-0.022
Token,
Token ID11
Feature activation-2.27e-03

Pos logit contributions
etti+0.034
isions+0.029
ller+0.028
 Jenn+0.028
icol+0.028
Neg logit contributions
 often-0.019
 end-0.019
 forward-0.018
 narrative-0.017
 hop-0.016
Token and
Token ID290
Feature activation+4.65e-03

Pos logit contributions
 narrative+0.003
 end+0.003
 often+0.003
 forward+0.003
ery+0.003
Neg logit contributions
etti-0.005
ller-0.004
 Jenn-0.004
isions-0.004
 Paul-0.004
Token they
Token ID484
Feature activation+5.66e-03

Pos logit contributions
etti+0.009
isions+0.008
ller+0.007
icol+0.007
 Jenn+0.007
Neg logit contributions
 often-0.008
 end-0.008
 forward-0.007
 narrative-0.007
 hop-0.007
Token both
Token ID1111
Feature activation+9.51e-03

Pos logit contributions
etti+0.012
 Jenn+0.009
isions+0.009
ller+0.009
icol+0.009
Neg logit contributions
 often-0.008
 forward-0.008
 narrative-0.008
 end-0.008
 brav-0.007
Token walked
Token ID6807
Feature activation+1.46e-02

Pos logit contributions
etti+0.018
 Jenn+0.015
isions+0.014
ller+0.014
icol+0.014
Neg logit contributions
 often-0.016
 end-0.015
 forward-0.015
 narrative-0.015
 brav-0.013
Token home
Token ID1363
Feature activation+5.94e-02

Pos logit contributions
etti+0.036
isions+0.030
ller+0.030
 Jenn+0.029
icol+0.029
Neg logit contributions
 often-0.018
 end-0.018
 forward-0.017
 narrative-0.016
 hop-0.014
Token slowly
Token ID6364
Feature activation+2.44e-02

Pos logit contributions
etti+0.158
icol+0.139
isions+0.137
 Pole+0.131
ller+0.130
Neg logit contributions
 often-0.069
 end-0.061
 forward-0.061
 also-0.050
 always-0.049
Token,
Token ID11
Feature activation+1.86e-02

Pos logit contributions
etti+0.054
isions+0.045
ller+0.044
 Jenn+0.043
icol+0.043
Neg logit contributions
 often-0.035
 end-0.034
 forward-0.033
 narrative-0.031
 hop-0.029
Token being
Token ID852
Feature activation+1.58e-03

Pos logit contributions
etti+0.042
isions+0.035
ller+0.034
 Jenn+0.034
icol+0.034
Neg logit contributions
 often-0.027
 end-0.025
 forward-0.025
 narrative-0.022
 hop-0.021
Token careful
Token ID8161
Feature activation+9.84e-03

Pos logit contributions
etti+0.004
isions+0.004
ller+0.004
icol+0.004
 Jenn+0.003
Neg logit contributions
 often-0.002
 end-0.002
 forward-0.002
 narrative-0.001
 brav-0.001
Token not
Token ID407
Feature activation+1.28e-02

Pos logit contributions
etti+0.021
ller+0.017
isions+0.017
 Jenn+0.017
 Paul+0.016
Neg logit contributions
 narrative-0.015
 often-0.013
 end-0.013
 forward-0.013
orn-0.013
Token and
Token ID290
Feature activation+2.10e-02

Pos logit contributions
etti+0.010
isions+0.009
ller+0.008
icol+0.008
 Jenn+0.008
Neg logit contributions
 often-0.005
 end-0.005
 forward-0.004
 hop-0.004
 also-0.004
Token her
Token ID607
Feature activation+1.72e-02

Pos logit contributions
etti+0.033
isions+0.026
drivers+0.024
ew+0.023
ller+0.023
Neg logit contributions
 end-0.047
 often-0.046
 forward-0.046
 always-0.042
 sometimes-0.042
Token mom
Token ID1995
Feature activation+2.70e-02

Pos logit contributions
etti+0.041
isions+0.034
icol+0.033
ller+0.033
 Pole+0.031
Neg logit contributions
 end-0.027
 often-0.025
 forward-0.021
 narrative-0.021
 moral-0.020
Tokenmy
Token ID1820
Feature activation+1.81e-02

Pos logit contributions
etti+0.073
ller+0.064
icol+0.064
isions+0.064
itor+0.061
Neg logit contributions
 often-0.027
 end-0.026
 forward-0.023
 always-0.019
 narrative-0.019
Token went
Token ID1816
Feature activation+2.76e-02

Pos logit contributions
etti+0.043
ller+0.037
isions+0.037
icol+0.036
 Jenn+0.035
Neg logit contributions
 often-0.024
 end-0.023
 forward-0.021
 narrative-0.019
 hop-0.018
Token home
Token ID1363
Feature activation+5.92e-02

Pos logit contributions
etti+0.066
ller+0.056
isions+0.056
icol+0.054
llers+0.052
Neg logit contributions
 forward-0.038
 often-0.038
 end-0.035
 hop-0.029
 narrative-0.029
Token and
Token ID290
Feature activation+3.01e-02

Pos logit contributions
etti+0.170
icol+0.151
isions+0.150
ller+0.143
 conf+0.143
Neg logit contributions
 often-0.060
 end-0.051
 forward-0.050
 always-0.041
 alone-0.041
Token put
Token ID1234
Feature activation+1.40e-02

Pos logit contributions
etti+0.073
isions+0.064
ller+0.063
MY+0.059
icol+0.059
Neg logit contributions
 often-0.045
 end-0.039
 forward-0.039
 also-0.038
 always-0.038
Token away
Token ID1497
Feature activation+4.18e-02

Pos logit contributions
etti+0.035
isions+0.030
ller+0.030
icol+0.029
 Jenn+0.028
Neg logit contributions
 forward-0.016
 end-0.016
 often-0.016
 narrative-0.015
 hop-0.013
Token all
Token ID477
Feature activation+1.84e-02

Pos logit contributions
etti+0.099
isions+0.086
ller+0.086
icol+0.083
 Pole+0.080
Neg logit contributions
 often-0.057
 end-0.057
 forward-0.056
 narrative-0.049
 all-0.047
Token the
Token ID262
Feature activation+6.49e-03

Pos logit contributions
etti+0.039
isions+0.032
 Jenn+0.032
ller+0.031
icol+0.031
Neg logit contributions
 often-0.028
 end-0.027
 forward-0.026
 narrative-0.025
 hop-0.024
Token Mom
Token ID11254
Feature activation+3.09e-02

Pos logit contributions
etti+0.032
isions+0.026
ller+0.026
 Jenn+0.025
icol+0.025
Neg logit contributions
 end-0.027
 often-0.026
 forward-0.023
 hop-0.023
 always-0.022
Tokenmy
Token ID1820
Feature activation+2.15e-02

Pos logit contributions
etti+0.070
isions+0.059
ller+0.058
icol+0.057
 Jenn+0.056
Neg logit contributions
 often-0.043
 end-0.041
 forward-0.039
 narrative-0.035
 hop-0.035
Token and
Token ID290
Feature activation+1.71e-02

Pos logit contributions
etti+0.045
isions+0.037
ller+0.036
icol+0.036
 Jenn+0.036
Neg logit contributions
 often-0.034
 end-0.032
 forward-0.030
 hop-0.029
 narrative-0.028
Token Daddy
Token ID26926
Feature activation+1.63e-02

Pos logit contributions
etti+0.031
isions+0.025
ller+0.024
icol+0.023
MY+0.022
Neg logit contributions
 often-0.033
 end-0.032
 forward-0.030
 sometimes-0.028
 hop-0.028
Token came
Token ID1625
Feature activation+2.76e-02

Pos logit contributions
etti+0.033
isions+0.027
ller+0.027
 Jenn+0.027
icol+0.026
Neg logit contributions
 often-0.026
 end-0.025
 forward-0.024
 narrative-0.024
 hop-0.022
Token home
Token ID1363
Feature activation+5.92e-02

Pos logit contributions
etti+0.072
ller+0.061
isions+0.061
icol+0.060
 Jenn+0.059
Neg logit contributions
 often-0.030
 forward-0.030
 end-0.028
 narrative-0.023
 hop-0.023
Token,
Token ID11
Feature activation+2.25e-02

Pos logit contributions
etti+0.158
isions+0.143
icol+0.140
ller+0.134
 badges+0.133
Neg logit contributions
 often-0.063
 end-0.057
 forward-0.055
 also-0.045
 always-0.044
Token Sarah
Token ID10490
Feature activation+2.41e-02

Pos logit contributions
etti+0.044
isions+0.035
icol+0.034
ller+0.034
itor+0.033
Neg logit contributions
 often-0.041
 end-0.037
 forward-0.037
 also-0.035
 always-0.034
Token was
Token ID373
Feature activation+1.02e-02

Pos logit contributions
etti+0.052
isions+0.043
ller+0.042
icol+0.041
 Jenn+0.041
Neg logit contributions
 often-0.037
 end-0.035
 forward-0.033
 narrative-0.031
 hop-0.030
Token so
Token ID523
Feature activation+1.05e-02

Pos logit contributions
etti+0.026
isions+0.022
ller+0.022
icol+0.022
 Jenn+0.021
Neg logit contributions
 often-0.012
 end-0.011
 forward-0.011
 narrative-0.009
 hop-0.009
Token grateful
Token ID14066
Feature activation-3.27e-03

Pos logit contributions
etti+0.026
isions+0.022
icol+0.022
ller+0.022
 Jenn+0.022
Neg logit contributions
 often-0.013
 end-0.011
 forward-0.011
 narrative-0.010
 hop-0.010
Token her
Token ID607
Feature activation+8.63e-03

Pos logit contributions
etti+0.030
 Jenn+0.025
ller+0.024
isions+0.024
icol+0.024
Neg logit contributions
 often-0.020
 end-0.019
 narrative-0.019
 forward-0.019
 brav-0.017
Token back
Token ID736
Feature activation+1.41e-02

Pos logit contributions
etti+0.019
ller+0.015
 Jenn+0.015
isions+0.015
 Brown+0.015
Neg logit contributions
 often-0.012
 narrative-0.012
 end-0.011
 forward-0.011
 hop-0.010
Token.
Token ID13
Feature activation+1.19e-02

Pos logit contributions
etti+0.034
isions+0.029
ller+0.028
 Jenn+0.028
icol+0.028
Neg logit contributions
 often-0.018
 end-0.017
 forward-0.017
 narrative-0.016
 hop-0.014
Token They
Token ID1119
Feature activation+1.33e-02

Pos logit contributions
etti+0.024
isions+0.020
ller+0.019
itor+0.019
 Jenn+0.019
Neg logit contributions
 often-0.020
 end-0.019
 forward-0.018
 narrative-0.017
 hop-0.016
Token walked
Token ID6807
Feature activation+1.92e-02

Pos logit contributions
etti+0.029
 Jenn+0.024
isions+0.023
ller+0.023
 Paul+0.023
Neg logit contributions
 often-0.019
 end-0.018
 forward-0.018
 narrative-0.018
 hop-0.016
Token home
Token ID1363
Feature activation+5.91e-02

Pos logit contributions
etti+0.051
isions+0.045
ller+0.045
llers+0.043
icol+0.042
Neg logit contributions
 forward-0.023
 often-0.022
 end-0.020
 hop-0.015
 all-0.015
Token together
Token ID1978
Feature activation+1.86e-02

Pos logit contributions
etti+0.162
isions+0.147
icol+0.142
 Pole+0.137
 conf+0.137
Neg logit contributions
 often-0.065
 forward-0.052
 end-0.051
 always-0.047
 also-0.044
Token,
Token ID11
Feature activation+1.93e-02

Pos logit contributions
etti+0.042
isions+0.035
 Jenn+0.035
ller+0.035
icol+0.034
Neg logit contributions
 often-0.024
 end-0.023
 narrative-0.023
 forward-0.022
 brav-0.021
Token happy
Token ID3772
Feature activation+2.77e-02

Pos logit contributions
etti+0.046
isions+0.039
ller+0.038
icol+0.037
itor+0.037
Neg logit contributions
 often-0.026
 end-0.023
 forward-0.023
 narrative-0.020
 hop-0.020
Token and
Token ID290
Feature activation+1.78e-02

Pos logit contributions
etti+0.074
isions+0.067
drivers+0.065
plug+0.063
MY+0.062
Neg logit contributions
 often-0.033
 end-0.027
 always-0.025
 sometimes-0.025
 forward-0.024
Token content
Token ID2695
Feature activation+3.03e-02

Pos logit contributions
etti+0.043
isions+0.037
ller+0.036
icol+0.035
itor+0.034
Neg logit contributions
 often-0.023
 forward-0.022
 end-0.021
 narrative-0.019
 hop-0.018
Token being
Token ID852
Feature activation+1.58e-03

Pos logit contributions
etti+0.023
isions+0.019
icol+0.019
ller+0.018
 Jenn+0.018
Neg logit contributions
 often-0.016
 end-0.016
 forward-0.014
 hop-0.013
 narrative-0.013
Token cold
Token ID4692
Feature activation+1.31e-02

Pos logit contributions
etti+0.004
isions+0.003
icol+0.003
ller+0.003
 Jenn+0.003
Neg logit contributions
 often-0.002
 forward-0.002
 end-0.002
 narrative-0.002
 brav-0.001
Token."
Token ID526
Feature activation-4.38e-03

Pos logit contributions
etti+0.028
isions+0.024
ller+0.023
 Jenn+0.023
icol+0.022
Neg logit contributions
 often-0.018
 end-0.018
 narrative-0.017
 forward-0.017
 hop-0.016
Token They
Token ID1119
Feature activation+4.03e-03

Pos logit contributions
 often+0.006
 end+0.006
 forward+0.006
 narrative+0.006
 hop+0.005
Neg logit contributions
etti-0.010
isions-0.008
ller-0.008
 Jenn-0.008
icol-0.008
Token went
Token ID1816
Feature activation+2.57e-02

Pos logit contributions
etti+0.008
 Jenn+0.007
isions+0.006
icol+0.006
 Paul+0.006
Neg logit contributions
 often-0.006
 narrative-0.006
 end-0.006
 forward-0.006
 brav-0.005
Token home
Token ID1363
Feature activation+5.90e-02

Pos logit contributions
etti+0.063
ller+0.054
isions+0.054
icol+0.051
 Jenn+0.050
Neg logit contributions
 often-0.034
 forward-0.034
 end-0.032
 hop-0.027
 narrative-0.025
Token and
Token ID290
Feature activation+2.06e-02

Pos logit contributions
etti+0.175
isions+0.155
icol+0.154
 conf+0.148
ller+0.146
Neg logit contributions
 often-0.053
 end-0.044
 forward-0.041
 also-0.037
 always-0.035
Token Lily
Token ID20037
Feature activation+1.56e-02

Pos logit contributions
etti+0.045
isions+0.039
ller+0.038
icol+0.036
itor+0.035
Neg logit contributions
 often-0.032
 end-0.030
 forward-0.029
 hop-0.027
 narrative-0.026
Token c
Token ID269
Feature activation+1.29e-02

Pos logit contributions
etti+0.035
isions+0.029
ller+0.028
 Jenn+0.028
icol+0.028
Neg logit contributions
 often-0.022
 end-0.022
 forward-0.021
 narrative-0.020
 hop-0.019
Tokenuddled
Token ID32745
Feature activation+2.53e-02

Pos logit contributions
etti+0.018
 Jenn+0.014
isions+0.014
 Paul+0.014
icol+0.013
Neg logit contributions
 often-0.028
 end-0.027
 forward-0.027
 narrative-0.025
ery-0.025
Token up
Token ID510
Feature activation+2.49e-02

Pos logit contributions
etti+0.065
isions+0.058
ller+0.055
 Pole+0.054
icol+0.054
Neg logit contributions
 often-0.028
 end-0.026
 forward-0.026
 narrative-0.022
 hop-0.021
Token big
Token ID1263
Feature activation-4.03e-03

Pos logit contributions
etti+0.010
isions+0.008
 Jenn+0.008
icol+0.008
ller+0.007
Neg logit contributions
 end-0.006
 often-0.006
 hop-0.005
 forward-0.005
 moral-0.005
Token brother
Token ID3956
Feature activation-8.75e-03

Pos logit contributions
 often+0.007
 end+0.006
 forward+0.006
 narrative+0.006
 hop+0.006
Neg logit contributions
etti-0.008
isions-0.006
ller-0.006
 Jenn-0.006
icol-0.006
Token and
Token ID290
Feature activation+1.00e-02

Pos logit contributions
 often+0.012
 narrative+0.011
 end+0.011
 forward+0.010
 brav+0.009
Neg logit contributions
etti-0.020
ller-0.016
 Jenn-0.016
isions-0.016
icol-0.016
Token they
Token ID484
Feature activation+1.07e-02

Pos logit contributions
etti+0.021
isions+0.018
ller+0.017
 Jenn+0.016
 Pole+0.016
Neg logit contributions
 often-0.016
 end-0.015
 forward-0.014
 narrative-0.014
 hop-0.014
Token went
Token ID1816
Feature activation+2.48e-02

Pos logit contributions
etti+0.022
 Jenn+0.018
ller+0.018
isions+0.017
icol+0.017
Neg logit contributions
 often-0.016
 end-0.016
 forward-0.015
 narrative-0.015
 brav-0.014
Token home
Token ID1363
Feature activation+5.89e-02

Pos logit contributions
etti+0.061
isions+0.054
ller+0.053
icol+0.051
 Jenn+0.050
Neg logit contributions
 often-0.031
 forward-0.030
 end-0.029
 hop-0.025
 narrative-0.024
Token together
Token ID1978
Feature activation+2.11e-02

Pos logit contributions
etti+0.158
isions+0.141
icol+0.139
 Pole+0.134
 conf+0.133
Neg logit contributions
 often-0.068
 forward-0.056
 end-0.056
 always-0.049
 also-0.047
Token.
Token ID13
Feature activation+1.81e-02

Pos logit contributions
etti+0.046
isions+0.038
 Jenn+0.038
ller+0.038
icol+0.037
Neg logit contributions
 often-0.029
 end-0.029
 narrative-0.028
 forward-0.027
 brav-0.025
Token The
Token ID383
Feature activation-1.63e-02

Pos logit contributions
etti+0.042
isions+0.037
itor+0.035
ller+0.035
 conf+0.035
Neg logit contributions
 often-0.024
 end-0.022
 forward-0.019
 also-0.019
 hop-0.018
Token end
Token ID886
Feature activation+8.63e-03

Pos logit contributions
 often+0.008
 forward+0.006
ying+0.005
 narrative+0.005
ery+0.004
Neg logit contributions
etti-0.048
isions-0.043
 Brown-0.043
ller-0.042
 grandma-0.042
Token.
Token ID13
Feature activation+1.59e-02

Pos logit contributions
etti+0.024
isions+0.021
itor+0.020
icol+0.020
 Jenn+0.020
Neg logit contributions
 end-0.009
 often-0.008
 forward-0.007
 narrative-0.006
 hop-0.006
Token were
Token ID547
Feature activation-2.25e-03

Pos logit contributions
etti+0.018
 Jenn+0.015
isions+0.014
ller+0.014
 Brown+0.014
Neg logit contributions
 forward-0.013
 often-0.013
 end-0.013
 narrative-0.013
 brav-0.012
Token happy
Token ID3772
Feature activation+2.72e-02

Pos logit contributions
 often+0.003
 end+0.003
 forward+0.003
 narrative+0.003
 hop+0.002
Neg logit contributions
etti-0.005
 Jenn-0.004
ller-0.004
isions-0.004
icol-0.004
Token again
Token ID757
Feature activation+1.65e-02

Pos logit contributions
etti+0.070
isions+0.065
icol+0.063
drivers+0.062
itor+0.061
Neg logit contributions
 often-0.033
 end-0.027
 sometimes-0.027
 always-0.025
 all-0.025
Token and
Token ID290
Feature activation+1.35e-02

Pos logit contributions
etti+0.043
isions+0.038
icol+0.037
itor+0.037
 Jenn+0.037
Neg logit contributions
 often-0.018
 end-0.018
 forward-0.015
 narrative-0.014
 sometimes-0.013
Token went
Token ID1816
Feature activation+2.52e-02

Pos logit contributions
etti+0.029
isions+0.026
ller+0.024
icol+0.023
itor+0.023
Neg logit contributions
 often-0.021
 end-0.020
 forward-0.019
 narrative-0.018
 hop-0.018
Token home
Token ID1363
Feature activation+5.87e-02

Pos logit contributions
etti+0.063
isions+0.057
ller+0.055
icol+0.053
 Jenn+0.052
Neg logit contributions
 often-0.031
 forward-0.030
 end-0.029
 hop-0.024
 narrative-0.023
Token with
Token ID351
Feature activation+3.13e-03

Pos logit contributions
etti+0.165
isions+0.153
icol+0.149
 Pole+0.142
 conf+0.141
Neg logit contributions
 often-0.059
 end-0.050
 forward-0.048
 always-0.040
 alone-0.038
Token a
Token ID257
Feature activation+1.92e-02

Pos logit contributions
etti+0.007
isions+0.007
ller+0.006
itor+0.006
 Pole+0.006
Neg logit contributions
 end-0.005
 often-0.004
 forward-0.004
 always-0.004
 sometimes-0.004
Token smile
Token ID8212
Feature activation+1.35e-02

Pos logit contributions
etti+0.049
isions+0.041
ller+0.040
itor+0.039
icol+0.039
Neg logit contributions
 end-0.025
 often-0.023
 forward-0.022
 hop-0.020
 narrative-0.019
Token.
Token ID13
Feature activation+8.26e-03

Pos logit contributions
etti+0.032
ller+0.027
isions+0.027
 Jenn+0.027
icol+0.026
Neg logit contributions
 often-0.017
 end-0.016
 forward-0.015
 narrative-0.015
 hop-0.013
Token<|endoftext|>
Token ID50256
Feature activation+4.00e-03

Pos logit contributions
etti+0.020
isions+0.018
itor+0.017
icol+0.017
 conf+0.016
Neg logit contributions
 often-0.010
 end-0.010
 forward-0.008
 always-0.008
 sometimes-0.008
Tokenles
Token ID829
Feature activation-1.87e-02

Pos logit contributions
 often+0.030
 end+0.028
 forward+0.028
 narrative+0.028
 hop+0.026
Neg logit contributions
etti-0.045
ller-0.037
isions-0.037
 conf-0.035
 sprink-0.035
Token.
Token ID13
Feature activation+5.45e-03

Pos logit contributions
 often+0.027
 end+0.026
 narrative+0.026
 forward+0.025
 brav+0.023
Neg logit contributions
etti-0.041
isions-0.033
ller-0.033
 Jenn-0.032
 conf-0.032
Token When
Token ID1649
Feature activation+1.81e-02

Pos logit contributions
etti+0.012
ller+0.010
isions+0.010
drivers+0.010
itor+0.010
Neg logit contributions
 often-0.008
 end-0.007
 forward-0.007
 sometimes-0.007
 narrative-0.007
Token mom
Token ID1995
Feature activation+3.94e-02

Pos logit contributions
etti+0.042
isions+0.039
ller+0.037
drivers+0.036
icol+0.036
Neg logit contributions
 end-0.028
 often-0.027
 also-0.022
 forward-0.022
 sometimes-0.022
Token came
Token ID1625
Feature activation+2.02e-02

Pos logit contributions
etti+0.097
ller+0.088
icol+0.088
isions+0.087
itor+0.084
Neg logit contributions
 end-0.045
 often-0.041
 also-0.036
 always-0.034
 sometimes-0.034
Token home
Token ID1363
Feature activation+5.86e-02

Pos logit contributions
etti+0.053
isions+0.047
ller+0.046
icol+0.045
 Jenn+0.044
Neg logit contributions
 forward-0.022
 end-0.021
 often-0.021
 hop-0.016
 narrative-0.015
Token,
Token ID11
Feature activation+1.26e-02

Pos logit contributions
etti+0.168
isions+0.159
icol+0.154
 badges+0.148
ller+0.147
Neg logit contributions
 end-0.050
 often-0.049
 forward-0.045
 sometimes-0.036
 also-0.035
Token she
Token ID673
Feature activation+3.09e-02

Pos logit contributions
etti+0.025
icol+0.021
isions+0.020
ller+0.020
drivers+0.019
Neg logit contributions
 often-0.022
 end-0.021
 forward-0.021
 also-0.020
 narrative-0.020
Token was
Token ID373
Feature activation+2.53e-03

Pos logit contributions
etti+0.067
isions+0.056
ller+0.053
icol+0.053
 Jenn+0.053
Neg logit contributions
 often-0.048
 end-0.045
 forward-0.041
 also-0.040
 narrative-0.040
Token surprised
Token ID6655
Feature activation+1.15e-04

Pos logit contributions
etti+0.006
 Jenn+0.005
ller+0.005
isions+0.005
icol+0.005
Neg logit contributions
 often-0.003
 end-0.003
 narrative-0.003
 forward-0.003
 hop-0.003
Token and
Token ID290
Feature activation+1.04e-02

Pos logit contributions
etti+0.000
isions+0.000
ller+0.000
 Jenn+0.000
icol+0.000
Neg logit contributions
 often-0.000
 end-0.000
 forward-0.000
 narrative-0.000
 hop-0.000
Token
Token ID198
Feature activation+1.57e-02

Pos logit contributions
 often+0.003
 end+0.002
 forward+0.002
 narrative+0.002
 hop+0.002
Neg logit contributions
etti-0.003
isions-0.003
ller-0.003
 Jenn-0.003
icol-0.003
TokenSoon
Token ID28093
Feature activation+2.31e-02

Pos logit contributions
etti+0.040
icol+0.034
ller+0.033
isions+0.033
 Jenn+0.032
Neg logit contributions
 often-0.019
 end-0.018
 forward-0.017
 narrative-0.016
 also-0.014
Token,
Token ID11
Feature activation+1.98e-02

Pos logit contributions
etti+0.067
ller+0.064
isions+0.064
icol+0.061
itor+0.060
Neg logit contributions
 end-0.022
 often-0.018
 forward-0.016
 sometimes-0.014
 all-0.012
Token Dad
Token ID17415
Feature activation+4.07e-02

Pos logit contributions
etti+0.045
icol+0.040
ller+0.039
isions+0.039
drivers+0.038
Neg logit contributions
 end-0.032
 often-0.029
 also-0.027
 forward-0.027
 sometimes-0.026
Token came
Token ID1625
Feature activation+2.16e-02

Pos logit contributions
etti+0.103
icol+0.090
ller+0.088
isions+0.086
llers+0.082
Neg logit contributions
 end-0.048
 often-0.047
 also-0.043
 forward-0.041
 always-0.040
Token home
Token ID1363
Feature activation+5.85e-02

Pos logit contributions
etti+0.054
ller+0.047
icol+0.046
isions+0.046
 Pole+0.044
Neg logit contributions
 forward-0.028
 end-0.027
 often-0.026
 hop-0.022
 also-0.020
Token from
Token ID422
Feature activation+2.37e-02

Pos logit contributions
etti+0.161
icol+0.141
isions+0.141
ller+0.134
 badges+0.133
Neg logit contributions
 often-0.060
 end-0.059
 forward-0.054
 also-0.043
 sometimes-0.043
Token work
Token ID670
Feature activation+1.04e-02

Pos logit contributions
etti+0.055
isions+0.049
icol+0.045
ller+0.045
 Pole+0.043
Neg logit contributions
 end-0.038
 often-0.036
 forward-0.031
 sometimes-0.029
 narrative-0.027
Token.
Token ID13
Feature activation+1.85e-02

Pos logit contributions
etti+0.020
ller+0.017
isions+0.017
 Jenn+0.017
 Paul+0.016
Neg logit contributions
 end-0.016
 narrative-0.016
 often-0.015
 brav-0.015
 hop-0.015
Token He
Token ID679
Feature activation+2.85e-02

Pos logit contributions
etti+0.055
ller+0.046
isions+0.046
icol+0.045
MY+0.044
Neg logit contributions
 often-0.014
 end-0.014
 forward-0.012
 narrative-0.010
 also-0.010
Token was
Token ID373
Feature activation+1.40e-02

Pos logit contributions
etti+0.070
isions+0.058
icol+0.057
ller+0.057
llers+0.054
Neg logit contributions
 often-0.039
 end-0.036
 also-0.035
 always-0.032
 forward-0.030
Token.
Token ID13
Feature activation+6.22e-03

Pos logit contributions
 narrative+0.021
 often+0.021
 end+0.021
 brav+0.020
 hop+0.020
Neg logit contributions
etti-0.022
ller-0.018
 Jenn-0.018
isions-0.018
 Paul-0.017
Token
Token ID198
Feature activation+9.78e-03

Pos logit contributions
etti+0.014
isions+0.012
ller+0.012
icol+0.011
itor+0.011
Neg logit contributions
 often-0.009
 end-0.008
 forward-0.008
 narrative-0.007
 hop-0.007
Token
Token ID198
Feature activation+1.76e-02

Pos logit contributions
etti+0.021
isions+0.017
ller+0.017
icol+0.017
 Jenn+0.017
Neg logit contributions
 often-0.015
 end-0.014
 forward-0.014
 narrative-0.013
 hop-0.012
TokenDad
Token ID46270
Feature activation+3.82e-02

Pos logit contributions
etti+0.038
isions+0.031
ller+0.031
 Jenn+0.031
icol+0.030
Neg logit contributions
 often-0.026
 end-0.025
 forward-0.025
 narrative-0.024
 hop-0.022
Token came
Token ID1625
Feature activation+1.99e-02

Pos logit contributions
etti+0.085
isions+0.070
icol+0.069
ller+0.069
 Jenn+0.067
Neg logit contributions
 end-0.055
 often-0.055
 forward-0.050
 narrative-0.049
 also-0.046
Token home
Token ID1363
Feature activation+5.84e-02

Pos logit contributions
etti+0.049
ller+0.041
isions+0.041
icol+0.041
 Jenn+0.040
Neg logit contributions
 often-0.024
 end-0.024
 forward-0.023
 narrative-0.020
 hop-0.020
Token and
Token ID290
Feature activation+4.01e-02

Pos logit contributions
etti+0.162
icol+0.142
isions+0.142
ller+0.135
 badges+0.132
Neg logit contributions
 end-0.057
 often-0.057
 forward-0.050
 sometimes-0.043
 also-0.042
Token saw
Token ID2497
Feature activation+1.04e-02

Pos logit contributions
etti+0.094
isions+0.077
ller+0.075
icol+0.075
MY+0.071
Neg logit contributions
 often-0.058
 end-0.056
 forward-0.052
 also-0.051
 sometimes-0.048
Token the
Token ID262
Feature activation+2.64e-02

Pos logit contributions
etti+0.020
isions+0.017
icol+0.016
ller+0.016
itor+0.015
Neg logit contributions
 end-0.018
 often-0.018
 forward-0.016
 narrative-0.016
 sometimes-0.015
Token mess
Token ID2085
Feature activation+2.77e-02

Pos logit contributions
etti+0.051
icol+0.039
isions+0.039
 Jenn+0.039
ller+0.038
Neg logit contributions
 end-0.049
 often-0.046
 forward-0.043
 narrative-0.043
ery-0.041
Token.
Token ID13
Feature activation+8.77e-03

Pos logit contributions
etti+0.063
isions+0.051
ller+0.051
 Jenn+0.051
 Paul+0.048
Neg logit contributions
 often-0.038
 end-0.036
 forward-0.035
 narrative-0.035
 brav-0.032
Token friends
Token ID2460
Feature activation+5.54e-03

Pos logit contributions
 often+0.021
 end+0.020
 forward+0.019
 narrative+0.019
 hop+0.017
Neg logit contributions
etti-0.033
isions-0.028
ller-0.027
 Jenn-0.027
icol-0.027
Token said
Token ID531
Feature activation+3.15e-03

Pos logit contributions
etti+0.010
 Jenn+0.009
 Paul+0.009
 Brown+0.008
isions+0.008
Neg logit contributions
 narrative-0.009
 end-0.009
 forward-0.009
 often-0.008
ery-0.008
Token goodbye
Token ID24829
Feature activation+1.75e-02

Pos logit contributions
etti+0.007
 Jenn+0.005
isions+0.005
ller+0.005
icol+0.005
Neg logit contributions
 often-0.005
 end-0.005
 narrative-0.004
 forward-0.004
 hop-0.004
Token and
Token ID290
Feature activation+1.05e-02

Pos logit contributions
etti+0.043
isions+0.037
 Jenn+0.035
ller+0.035
 conf+0.034
Neg logit contributions
 often-0.022
 forward-0.021
 end-0.020
 hop-0.016
 sometimes-0.016
Token went
Token ID1816
Feature activation+2.50e-02

Pos logit contributions
etti+0.025
isions+0.022
ller+0.021
 Jenn+0.020
icol+0.020
Neg logit contributions
 often-0.014
 end-0.013
 forward-0.013
 narrative-0.012
 hop-0.011
Token home
Token ID1363
Feature activation+5.83e-02

Pos logit contributions
etti+0.060
isions+0.052
ller+0.051
 Jenn+0.049
icol+0.049
Neg logit contributions
 often-0.032
 forward-0.031
 end-0.030
 hop-0.026
 narrative-0.025
Token.
Token ID13
Feature activation+1.99e-02

Pos logit contributions
etti+0.168
isions+0.152
icol+0.149
 conf+0.145
 Pole+0.143
Neg logit contributions
 often-0.055
 forward-0.043
 end-0.042
 alone-0.035
 always-0.035
Token Tim
Token ID5045
Feature activation-1.26e-02

Pos logit contributions
etti+0.037
ñ+0.030
ller+0.030
isions+0.029
 conf+0.029
Neg logit contributions
 often-0.038
 end-0.032
 forward-0.032
 always-0.031
 sometimes-0.031
Tokenmy
Token ID1820
Feature activation-4.30e-03

Pos logit contributions
 narrative+0.019
 forward+0.019
 often+0.018
 end+0.017
 brav+0.016
Neg logit contributions
etti-0.024
 Brown-0.021
ller-0.021
 Jenn-0.021
 Paul-0.020
Token was
Token ID373
Feature activation-2.08e-03

Pos logit contributions
 forward+0.007
 narrative+0.007
 end+0.007
 often+0.006
ery+0.006
Neg logit contributions
etti-0.008
ller-0.007
 Jenn-0.007
isions-0.007
icol-0.007
Token tired
Token ID10032
Feature activation+7.86e-03

Pos logit contributions
 often+0.002
 end+0.002
 narrative+0.002
 forward+0.002
 hop+0.002
Neg logit contributions
etti-0.005
isions-0.004
ller-0.004
 Jenn-0.004
icol-0.004
Token '
Token ID705
Feature activation-4.74e-03

Pos logit contributions
etti+0.003
isions+0.002
ller+0.002
 Jenn+0.002
 Paul+0.002
Neg logit contributions
 end-0.003
 often-0.003
 narrative-0.003
 forward-0.003
orn-0.003
TokenNo
Token ID2949
Feature activation-6.31e-03

Pos logit contributions
 often+0.008
 end+0.007
 forward+0.007
 narrative+0.007
 hop+0.006
Neg logit contributions
etti-0.010
isions-0.008
ller-0.008
 Jenn-0.008
icol-0.008
Token,'
Token ID4032
Feature activation-2.39e-02

Pos logit contributions
 often+0.010
 end+0.010
 forward+0.010
 narrative+0.010
 hop+0.009
Neg logit contributions
etti-0.012
isions-0.010
 Jenn-0.010
ller-0.010
icol-0.010
Token but
Token ID475
Feature activation-1.42e-02

Pos logit contributions
 often+0.040
 end+0.038
 forward+0.037
 narrative+0.037
ery+0.035
Neg logit contributions
etti-0.046
 Jenn-0.038
isions-0.036
ller-0.036
 Paul-0.036
Token J
Token ID449
Feature activation-1.35e-02

Pos logit contributions
 often+0.026
 end+0.025
 narrative+0.025
 forward+0.025
ery+0.025
Neg logit contributions
etti-0.024
 Jenn-0.020
 Paul-0.019
ller-0.019
isions-0.018
Tokenod
Token ID375
Feature activation-6.44e-02

Pos logit contributions
 often+0.028
 end+0.028
 forward+0.028
 narrative+0.027
 hop+0.025
Neg logit contributions
etti-0.021
isions-0.016
 Jenn-0.015
ller-0.015
icol-0.014
Tokenie
Token ID494
Feature activation-2.70e-02

Pos logit contributions
 forward+0.095
 hop+0.085
 end+0.081
 brav+0.081
 often+0.077
Neg logit contributions
etti-0.116
ller-0.103
 Jenn-0.103
 Paul-0.098
yl-0.098
Token kept
Token ID4030
Feature activation-2.24e-02

Pos logit contributions
 forward+0.046
 narrative+0.046
 end+0.045
 often+0.043
 brav+0.039
Neg logit contributions
etti-0.046
ller-0.039
 Jenn-0.039
isions-0.038
 Paul-0.037
Token going
Token ID1016
Feature activation-2.38e-02

Pos logit contributions
 narrative+0.041
 end+0.041
 often+0.040
orn+0.038
ery+0.037
Neg logit contributions
etti-0.036
 Jenn-0.029
 Paul-0.029
ller-0.028
 flashing-0.028
Token.
Token ID13
Feature activation-2.28e-03

Pos logit contributions
 narrative+0.043
orn+0.040
ery+0.039
 brav+0.038
py+0.036
Neg logit contributions
etti-0.038
 Smith-0.033
 Paul-0.032
 Johnson-0.032
 Sus-0.031
Token
Token ID198
Feature activation-6.19e-03

Pos logit contributions
 often+0.004
 end+0.003
 forward+0.003
 narrative+0.003
 hop+0.003
Neg logit contributions
etti-0.005
isions-0.004
ller-0.004
 Jenn-0.004
icol-0.004
TokenG
Token ID38
Feature activation-3.48e-03

Pos logit contributions
 often+0.025
 end+0.024
 forward+0.023
 narrative+0.022
 hop+0.021
Neg logit contributions
etti-0.027
isions-0.022
 Jenn-0.021
ller-0.021
icol-0.021
Tokenain
Token ID391
Feature activation-1.88e-02

Pos logit contributions
 often+0.006
 end+0.005
 forward+0.005
 narrative+0.005
 hop+0.005
Neg logit contributions
etti-0.007
isions-0.006
 Jenn-0.006
ller-0.006
icol-0.006
Token is
Token ID318
Feature activation-1.37e-02

Pos logit contributions
 often+0.031
 end+0.030
 narrative+0.030
 forward+0.030
 hop+0.025
Neg logit contributions
etti-0.036
ller-0.029
 Jenn-0.028
isions-0.028
 Paul-0.028
Token better
Token ID1365
Feature activation+1.03e-03

Pos logit contributions
 narrative+0.021
 end+0.021
 often+0.021
 forward+0.020
 hop+0.018
Neg logit contributions
etti-0.027
 Jenn-0.023
ller-0.022
isions-0.022
 Brown-0.022
Token than
Token ID621
Feature activation-9.21e-03

Pos logit contributions
etti+0.002
ller+0.002
isions+0.002
 Jenn+0.002
icol+0.002
Neg logit contributions
 end-0.002
 often-0.002
 narrative-0.002
 forward-0.002
 hop-0.001
Token greed
Token ID25474
Feature activation-6.37e-02

Pos logit contributions
 often+0.015
 end+0.015
 narrative+0.015
 forward+0.015
 hop+0.013
Neg logit contributions
etti-0.017
 Jenn-0.014
ller-0.014
isions-0.014
 Paul-0.013
Token,
Token ID11
Feature activation-7.80e-03

Pos logit contributions
 narrative+0.115
 often+0.109
wered+0.106
 end+0.097
 moral+0.096
Neg logit contributions
etti-0.080
 Tommy-0.072
 Smith-0.068
 Lee-0.067
 Brown-0.067
Token child
Token ID1200
Feature activation-1.96e-02

Pos logit contributions
 narrative+0.015
ery+0.014
 often+0.014
 end+0.014
 forward+0.014
Neg logit contributions
etti-0.012
 Sus-0.010
 Jenn-0.010
 Paul-0.010
icol-0.009
Token."
Token ID526
Feature activation+1.89e-03

Pos logit contributions
 narrative+0.031
 end+0.030
 often+0.030
 forward+0.029
 hop+0.026
Neg logit contributions
etti-0.039
ller-0.032
 Jenn-0.032
isions-0.032
 Paul-0.030
Token
Token ID198
Feature activation-3.69e-03

Pos logit contributions
etti+0.003
isions+0.003
ller+0.002
 Jenn+0.002
icol+0.002
Neg logit contributions
 often-0.004
 end-0.004
 forward-0.003
 narrative-0.003
 hop-0.003
Token
Token ID198
Feature activation-4.65e-03

Pos logit contributions
 often+0.006
 end+0.006
 forward+0.006
 narrative+0.006
 hop+0.005
Neg logit contributions
etti-0.007
isions-0.006
ller-0.006
 Jenn-0.006
icol-0.006
Token shrimp
Token ID30060
Feature activation-1.90e-02

Pos logit contributions
 often+0.037
 narrative+0.034
 forward+0.034
 end+0.032
ery+0.031
Neg logit contributions
etti-0.033
isions-0.027
 Pole-0.026
ller-0.026
 conf-0.026
Token was
Token ID373
Feature activation-3.97e-04

Pos logit contributions
 forward+0.034
 narrative+0.033
 end+0.033
 often+0.031
 brav+0.029
Neg logit contributions
etti-0.032
isions-0.026
ller-0.025
 Rob-0.024
 Jenn-0.024
Token pushing
Token ID7796
Feature activation-1.94e-02

Pos logit contributions
 narrative+0.001
 end+0.001
 often+0.001
 forward+0.001
orn+0.000
Neg logit contributions
etti-0.001
 Jenn-0.001
ller-0.001
isions-0.001
 Paul-0.001
Token against
Token ID1028
Feature activation-7.19e-03

Pos logit contributions
 often+0.027
 end+0.026
 narrative+0.025
 forward+0.025
 hop+0.022
Neg logit contributions
etti-0.043
isions-0.036
ller-0.036
 Jenn-0.036
icol-0.035
Token the
Token ID262
Feature activation-2.40e-02

Pos logit contributions
 often+0.011
 end+0.011
 forward+0.010
 narrative+0.010
 hop+0.009
Neg logit contributions
etti-0.015
isions-0.013
ller-0.013
icol-0.012
 Jenn-0.012
Token hull
Token ID23644
Feature activation-6.01e-02

Pos logit contributions
 often+0.039
 end+0.036
 forward+0.036
 narrative+0.036
 hop+0.032
Neg logit contributions
etti-0.048
isions-0.039
ller-0.039
icol-0.038
 Jenn-0.037
Token of
Token ID286
Feature activation-1.78e-02

Pos logit contributions
 narrative+0.126
orn+0.098
 brav+0.098
 moral+0.096
 often+0.094
Neg logit contributions
etti-0.098
 flashing-0.078
 Paul-0.077
 Jenn-0.077
ller-0.077
Token the
Token ID262
Feature activation-2.30e-02

Pos logit contributions
 often+0.032
 narrative+0.031
 forward+0.031
 end+0.030
ery+0.027
Neg logit contributions
etti-0.032
 Jenn-0.026
 Paul-0.025
isions-0.025
 Rob-0.025
Token vessel
Token ID8837
Feature activation-3.42e-02

Pos logit contributions
 often+0.038
 forward+0.035
 end+0.035
 narrative+0.035
 hop+0.031
Neg logit contributions
etti-0.045
isions-0.037
ller-0.036
icol-0.035
 Pole-0.035
Token,
Token ID11
Feature activation+2.28e-03

Pos logit contributions
 narrative+0.059
 often+0.052
 end+0.051
 forward+0.050
orn+0.050
Neg logit contributions
etti-0.064
isions-0.052
ller-0.051
 Paul-0.050
 Jenn-0.050
Token which
Token ID543
Feature activation-9.50e-03

Pos logit contributions
etti+0.004
ller+0.004
isions+0.003
 flashing+0.003
icol+0.003
Neg logit contributions
 narrative-0.004
 end-0.004
 often-0.004
 forward-0.003
orn-0.003
Token had
Token ID550
Feature activation-1.45e-03

Pos logit contributions
etti+0.007
ller+0.006
isions+0.006
 Jenn+0.006
icol+0.006
Neg logit contributions
 often-0.006
 forward-0.006
 end-0.005
 narrative-0.005
 hop-0.005
Token to
Token ID284
Feature activation-1.38e-02

Pos logit contributions
 often+0.002
 end+0.002
 forward+0.002
 narrative+0.002
 hop+0.002
Neg logit contributions
etti-0.003
isions-0.003
ller-0.003
 Jenn-0.003
icol-0.002
Token suffer
Token ID8659
Feature activation-2.53e-03

Pos logit contributions
 forward+0.019
 often+0.018
 narrative+0.017
 brav+0.016
ying+0.016
Neg logit contributions
etti-0.030
 Jenn-0.025
 Brown-0.025
icol-0.025
 Rob-0.024
Token for
Token ID329
Feature activation-6.64e-04

Pos logit contributions
 narrative+0.004
 often+0.004
 end+0.004
 forward+0.004
 brav+0.003
Neg logit contributions
etti-0.005
 Brown-0.004
 Jenn-0.004
ller-0.004
 Paul-0.004
Token her
Token ID607
Feature activation-4.86e-03

Pos logit contributions
 often+0.001
 end+0.001
 forward+0.001
 narrative+0.001
 hop+0.001
Neg logit contributions
etti-0.001
isions-0.001
ller-0.001
icol-0.001
 Jenn-0.001
Token greed
Token ID25474
Feature activation-5.95e-02

Pos logit contributions
 often+0.008
 forward+0.008
 narrative+0.008
 end+0.008
 brav+0.007
Neg logit contributions
etti-0.009
ller-0.007
isions-0.007
 Brown-0.007
 Jenn-0.007
Token.
Token ID13
Feature activation-5.91e-03

Pos logit contributions
 often+0.084
 narrative+0.084
 brav+0.083
wered+0.080
 fitt+0.079
Neg logit contributions
 Tommy-0.076
etti-0.076
 grandma-0.074
keeping-0.072
 Tom-0.072
Token She
Token ID1375
Feature activation+4.97e-03

Pos logit contributions
 often+0.007
 end+0.007
 forward+0.007
 narrative+0.007
 hop+0.006
Neg logit contributions
etti-0.014
isions-0.012
ller-0.012
icol-0.012
 Jenn-0.011
Token was
Token ID373
Feature activation-1.21e-02

Pos logit contributions
etti+0.009
ller+0.008
 Jenn+0.008
isions+0.007
 Brown+0.007
Neg logit contributions
 forward-0.008
 often-0.008
 end-0.008
 narrative-0.008
 brav-0.007
Token very
Token ID845
Feature activation-4.05e-03

Pos logit contributions
 narrative+0.017
 often+0.016
 end+0.016
 forward+0.015
 hop+0.015
Neg logit contributions
etti-0.024
ller-0.021
 Jenn-0.021
 Brown-0.020
 flashing-0.020
Token sorry
Token ID7926
Feature activation-9.81e-03

Pos logit contributions
 often+0.006
 narrative+0.006
 end+0.006
orn+0.006
 forward+0.005
Neg logit contributions
etti-0.007
ller-0.006
plug-0.006
 Jenn-0.006
 grandma-0.006
Token it
Token ID340
Feature activation+6.31e-03

Pos logit contributions
etti+0.004
isions+0.003
icol+0.003
ller+0.003
itor+0.003
Neg logit contributions
 often-0.002
 end-0.002
 forward-0.002
 hop-0.002
 narrative-0.002
Token.
Token ID13
Feature activation+2.25e-03

Pos logit contributions
etti+0.014
ller+0.012
 Jenn+0.012
isions+0.011
icol+0.011
Neg logit contributions
 narrative-0.008
 often-0.008
 end-0.008
 forward-0.008
 brav-0.007
Token
Token ID198
Feature activation-5.61e-03

Pos logit contributions
etti+0.004
isions+0.004
ller+0.004
itor+0.003
 Jenn+0.003
Neg logit contributions
 often-0.004
 end-0.004
 forward-0.004
 narrative-0.003
 hop-0.003
Token
Token ID198
Feature activation+4.00e-04

Pos logit contributions
 often+0.009
 end+0.009
 forward+0.009
 narrative+0.009
 hop+0.008
Neg logit contributions
etti-0.011
isions-0.009
 Jenn-0.009
ller-0.009
icol-0.008
TokenJ
Token ID41
Feature activation-2.00e-03

Pos logit contributions
etti+0.001
isions+0.001
 Jenn+0.001
ller+0.001
icol+0.001
Neg logit contributions
 often-0.001
 end-0.001
 forward-0.001
 narrative-0.001
ery-0.001
Tokenod
Token ID375
Feature activation-5.94e-02

Pos logit contributions
 often+0.004
 end+0.004
 forward+0.003
 narrative+0.003
 hop+0.003
Neg logit contributions
etti-0.004
isions-0.003
ller-0.003
icol-0.003
 Jenn-0.003
Tokenie
Token ID494
Feature activation-1.54e-02

Pos logit contributions
 forward+0.096
 hop+0.081
 brav+0.079
 often+0.077
 narrative+0.076
Neg logit contributions
etti-0.104
ller-0.090
isions-0.088
 Jenn-0.086
edd-0.085
Token was
Token ID373
Feature activation-7.17e-03

Pos logit contributions
 end+0.023
 often+0.023
 forward+0.023
 narrative+0.023
 brav+0.020
Neg logit contributions
etti-0.032
ller-0.026
isions-0.026
 Jenn-0.026
icol-0.025
Token amazed
Token ID24562
Feature activation-7.18e-03

Pos logit contributions
 often+0.008
 end+0.008
 forward+0.008
 narrative+0.007
 hop+0.006
Neg logit contributions
etti-0.018
isions-0.015
ller-0.015
icol-0.015
 Jenn-0.015
Token by
Token ID416
Feature activation+6.92e-04

Pos logit contributions
 often+0.009
 end+0.009
 forward+0.008
 narrative+0.008
 hop+0.007
Neg logit contributions
etti-0.017
isions-0.015
ller-0.015
 Jenn-0.014
icol-0.014
Token what
Token ID644
Feature activation-3.36e-03

Pos logit contributions
etti+0.001
 Jenn+0.001
ller+0.001
isions+0.001
 Paul+0.001
Neg logit contributions
 often-0.001
 end-0.001
 forward-0.001
 narrative-0.001
 hop-0.001
Token difficult
Token ID2408
Feature activation-1.39e-02

Pos logit contributions
 narrative+0.015
 forward+0.012
 brav+0.012
 end+0.012
 often+0.011
Neg logit contributions
etti-0.013
ller-0.010
 Brown-0.010
 sprink-0.010
 Johnson-0.010
Token to
Token ID284
Feature activation-1.29e-02

Pos logit contributions
 often+0.022
 end+0.022
 narrative+0.022
 forward+0.022
 brav+0.020
Neg logit contributions
etti-0.027
isions-0.022
ller-0.022
 Jenn-0.022
icol-0.021
Token hold
Token ID1745
Feature activation-2.08e-02

Pos logit contributions
 narrative+0.022
 forward+0.022
 often+0.019
 brav+0.018
 end+0.018
Neg logit contributions
etti-0.026
 Jenn-0.021
ller-0.020
 Brown-0.020
 sprink-0.019
Token the
Token ID262
Feature activation-4.02e-03

Pos logit contributions
 narrative+0.036
 often+0.036
 end+0.034
 hop+0.032
 brav+0.031
Neg logit contributions
etti-0.037
 Jenn-0.031
 Brown-0.030
 Sus-0.030
 Paul-0.029
Token silver
Token ID8465
Feature activation-1.43e-02

Pos logit contributions
 often+0.007
 end+0.007
 forward+0.007
 narrative+0.007
 hop+0.006
Neg logit contributions
etti-0.007
isions-0.006
ller-0.006
 Jenn-0.006
icol-0.006
Tokenware
Token ID1574
Feature activation-5.89e-02

Pos logit contributions
 narrative+0.030
 often+0.030
 forward+0.029
 end+0.028
 brav+0.027
Neg logit contributions
etti-0.022
ller-0.016
isions-0.015
 Jenn-0.015
 Pole-0.014
Token,
Token ID11
Feature activation-1.06e-02

Pos logit contributions
 narrative+0.121
wered+0.104
 End+0.099
 brav+0.098
py+0.098
Neg logit contributions
etti-0.082
 extingu-0.070
 Grand-0.066
 Brown-0.064
isions-0.062
Token but
Token ID475
Feature activation-1.57e-02

Pos logit contributions
 narrative+0.019
 end+0.018
orn+0.018
ery+0.018
 forward+0.017
Neg logit contributions
etti-0.018
ller-0.015
 Jenn-0.015
isions-0.015
 Grand-0.015
Token she
Token ID673
Feature activation+1.91e-03

Pos logit contributions
 often+0.022
 end+0.021
 forward+0.020
 narrative+0.020
 hop+0.018
Neg logit contributions
etti-0.035
 Jenn-0.029
ller-0.029
isions-0.029
icol-0.028
Token tried
Token ID3088
Feature activation-2.02e-02

Pos logit contributions
etti+0.004
ller+0.003
isions+0.003
 Jenn+0.003
itor+0.003
Neg logit contributions
 forward-0.003
 narrative-0.003
 end-0.003
 often-0.003
 brav-0.003
Token her
Token ID607
Feature activation-8.13e-03

Pos logit contributions
 narrative+0.037
 often+0.035
 end+0.034
 forward+0.034
 brav+0.033
Neg logit contributions
etti-0.035
 Jenn-0.029
ller-0.028
isions-0.027
 Brown-0.027
Token dolls
Token ID36062
Feature activation-2.43e-02

Pos logit contributions
etti+0.026
icol+0.019
isions+0.019
ller+0.019
 Jenn+0.018
Neg logit contributions
 often-0.032
 end-0.032
 forward-0.031
 narrative-0.029
 hop-0.028
Token around
Token ID1088
Feature activation-7.71e-03

Pos logit contributions
 narrative+0.045
 brav+0.038
 moral+0.037
ery+0.037
orn+0.037
Neg logit contributions
etti-0.040
 Jenn-0.034
isions-0.033
 flashing-0.032
 Paul-0.032
Token in
Token ID287
Feature activation-9.55e-03

Pos logit contributions
 narrative+0.011
 often+0.011
 forward+0.011
 end+0.011
ery+0.010
Neg logit contributions
etti-0.016
 Jenn-0.013
isions-0.013
ller-0.013
 Paul-0.013
Token a
Token ID257
Feature activation-1.13e-02

Pos logit contributions
 often+0.017
 narrative+0.016
 forward+0.016
 end+0.016
ery+0.015
Neg logit contributions
etti-0.017
 Jenn-0.014
isions-0.014
 Paul-0.013
 conf-0.013
Token wheel
Token ID7825
Feature activation-2.97e-02

Pos logit contributions
 often+0.017
 end+0.017
 forward+0.016
 narrative+0.016
 hop+0.015
Neg logit contributions
etti-0.023
isions-0.019
ller-0.019
 Jenn-0.019
icol-0.018
Tokenbar
Token ID5657
Feature activation-5.89e-02

Pos logit contributions
 narrative+0.051
 often+0.045
 forward+0.045
 brav+0.045
 end+0.044
Neg logit contributions
etti-0.056
 Jenn-0.048
ller-0.047
isions-0.045
 Paul-0.044
Tokenrow
Token ID808
Feature activation-2.51e-02

Pos logit contributions
 often+0.083
 forward+0.080
 narrative+0.079
 brav+0.070
 hop+0.068
Neg logit contributions
etti-0.121
ller-0.105
drivers-0.104
 Jenn-0.102
uc-0.098
Token she
Token ID673
Feature activation+2.04e-03

Pos logit contributions
 narrative+0.052
 moral+0.045
 end+0.043
 brav+0.043
 forward+0.041
Neg logit contributions
etti-0.041
 Jenn-0.034
ller-0.033
 Johnson-0.033
 flashing-0.032
Token had
Token ID550
Feature activation+1.18e-03

Pos logit contributions
etti+0.004
ller+0.004
isions+0.004
 Jenn+0.004
icol+0.003
Neg logit contributions
 end-0.003
 narrative-0.003
 forward-0.003
 often-0.003
 hop-0.002
Token found
Token ID1043
Feature activation-2.48e-03

Pos logit contributions
etti+0.002
 Jenn+0.002
isions+0.002
ller+0.002
icol+0.002
Neg logit contributions
 end-0.002
 often-0.002
 narrative-0.002
 forward-0.002
 hop-0.001
Token in
Token ID287
Feature activation+2.02e-03

Pos logit contributions
 narrative+0.004
 often+0.004
 end+0.003
 forward+0.003
 hop+0.003
Neg logit contributions
etti-0.005
 Jenn-0.004
 Brown-0.004
 Paul-0.004
isions-0.004
Token "
Token ID366
Feature activation-1.59e-02

Pos logit contributions
 narrative+0.001
 end+0.001
 often+0.001
 forward+0.001
ery+0.001
Neg logit contributions
etti-0.001
ller-0.001
isions-0.001
 Jenn-0.001
 Brown-0.001
TokenI
Token ID40
Feature activation-2.16e-02

Pos logit contributions
 end+0.027
 often+0.027
 forward+0.026
 narrative+0.026
ery+0.024
Neg logit contributions
etti-0.028
isions-0.024
ller-0.023
 Jenn-0.022
icol-0.022
Token can
Token ID460
Feature activation-2.59e-02

Pos logit contributions
 narrative+0.033
 forward+0.028
 end+0.028
 often+0.028
wered+0.027
Neg logit contributions
etti-0.039
 turns-0.035
 Jenn-0.032
 Paul-0.032
 Rob-0.032
Token stretch
Token ID7539
Feature activation-4.00e-02

Pos logit contributions
 narrative+0.043
 forward+0.037
 end+0.033
 often+0.033
orn+0.032
Neg logit contributions
etti-0.052
ller-0.043
 Rob-0.043
 Brown-0.040
isions-0.040
Token my
Token ID616
Feature activation-3.43e-02

Pos logit contributions
 narrative+0.076
 end+0.064
orn+0.062
 brav+0.062
 often+0.061
Neg logit contributions
etti-0.064
 Rob-0.051
 Zoom-0.051
 Paul-0.050
 Brown-0.050
Token wings
Token ID12098
Feature activation-5.88e-02

Pos logit contributions
 narrative+0.071
 often+0.068
 forward+0.065
 end+0.064
ying+0.061
Neg logit contributions
etti-0.044
ller-0.037
 Brown-0.034
sed-0.034
icol-0.034
Token and
Token ID290
Feature activation-2.72e-02

Pos logit contributions
 narrative+0.134
 moral+0.106
 End+0.104
wered+0.102
 brav+0.101
Neg logit contributions
etti-0.081
ller-0.062
 Brown-0.059
ding-0.055
MY-0.055
Token fly
Token ID6129
Feature activation-3.24e-02

Pos logit contributions
 narrative+0.044
 forward+0.038
 often+0.037
 end+0.036
ying+0.036
Neg logit contributions
etti-0.053
ller-0.045
 Brown-0.044
 Rob-0.043
icol-0.043
Token high
Token ID1029
Feature activation-1.49e-02

Pos logit contributions
 narrative+0.058
 end+0.050
 often+0.047
ying+0.046
orn+0.046
Neg logit contributions
etti-0.056
ller-0.046
 turns-0.045
 Zoom-0.045
 Jenn-0.045
Token in
Token ID287
Feature activation-1.66e-02

Pos logit contributions
 end+0.024
 narrative+0.024
 often+0.022
 forward+0.022
 brav+0.021
Neg logit contributions
etti-0.029
ller-0.024
icol-0.023
isions-0.023
 Rob-0.022
Token the
Token ID262
Feature activation-2.19e-02

Pos logit contributions
 often+0.032
 narrative+0.031
 forward+0.031
 end+0.029
ery+0.028
Neg logit contributions
etti-0.027
 Jenn-0.021
 conf-0.020
 Paul-0.020
isions-0.020
Token "
Token ID366
Feature activation-2.70e-02

Pos logit contributions
etti+0.001
isions+0.001
ller+0.001
icol+0.001
 Jenn+0.001
Neg logit contributions
 often-0.001
 end-0.001
 forward-0.001
 hop-0.001
 narrative-0.001
TokenCan
Token ID6090
Feature activation-1.42e-02

Pos logit contributions
 end+0.047
 forward+0.044
 narrative+0.042
 often+0.042
ery+0.041
Neg logit contributions
etti-0.046
ller-0.038
isions-0.038
 Jenn-0.037
llers-0.034
Token you
Token ID345
Feature activation-1.89e-02

Pos logit contributions
 forward+0.025
 often+0.025
 end+0.025
 narrative+0.024
ery+0.024
Neg logit contributions
etti-0.024
ller-0.020
 Jenn-0.020
isions-0.019
 Brown-0.018
Token stretch
Token ID7539
Feature activation-4.16e-02

Pos logit contributions
 narrative+0.033
 forward+0.031
ery+0.028
 often+0.028
iness+0.028
Neg logit contributions
etti-0.032
 Jenn-0.029
 Brown-0.027
 Paul-0.027
ller-0.026
Token your
Token ID534
Feature activation-3.26e-02

Pos logit contributions
 narrative+0.079
 often+0.068
 end+0.066
ery+0.065
 moral+0.063
Neg logit contributions
etti-0.066
 Brown-0.055
ller-0.055
 Grand-0.054
 Jenn-0.054
Token wings
Token ID12098
Feature activation-5.88e-02

Pos logit contributions
 narrative+0.065
 often+0.062
 forward+0.059
 end+0.058
ying+0.054
Neg logit contributions
etti-0.041
ller-0.038
 Brown-0.035
icol-0.034
llers-0.033
Token and
Token ID290
Feature activation-2.44e-02

Pos logit contributions
 narrative+0.115
 moral+0.088
 often+0.088
 End+0.087
 brav+0.086
Neg logit contributions
etti-0.087
ller-0.080
 Brown-0.071
ding-0.067
 Jenn-0.067
Token fly
Token ID6129
Feature activation-3.87e-02

Pos logit contributions
 narrative+0.037
 forward+0.034
 often+0.034
 end+0.033
ery+0.032
Neg logit contributions
etti-0.048
ller-0.042
icol-0.039
 Brown-0.039
 Jenn-0.039
Token with
Token ID351
Feature activation-2.84e-02

Pos logit contributions
 narrative+0.068
 end+0.058
 often+0.056
ying+0.054
iness+0.054
Neg logit contributions
etti-0.067
ller-0.059
 Brown-0.056
 Jenn-0.055
 Zoom-0.055
Token me
Token ID502
Feature activation-2.46e-02

Pos logit contributions
 narrative+0.047
 often+0.045
 forward+0.043
 end+0.042
ery+0.037
Neg logit contributions
etti-0.055
 Paul-0.045
 Brown-0.045
 Jenn-0.044
ller-0.044
Token?"
Token ID1701
Feature activation-2.99e-03

Pos logit contributions
 narrative+0.042
 end+0.036
 often+0.035
 forward+0.035
 brav+0.032
Neg logit contributions
etti-0.045
ller-0.042
 Brown-0.038
isions-0.037
 Grand-0.036
Token bird
Token ID6512
Feature activation-3.61e-02

Pos logit contributions
 often+0.015
 end+0.014
 forward+0.014
 narrative+0.013
 hop+0.013
Neg logit contributions
etti-0.018
isions-0.015
ller-0.015
 Jenn-0.015
icol-0.015
Token.
Token ID13
Feature activation+9.05e-03

Pos logit contributions
 narrative+0.075
 forward+0.064
wered+0.061
 end+0.060
 brav+0.058
Neg logit contributions
etti-0.051
ller-0.045
 Brown-0.043
 Jenn-0.040
 Johnson-0.039
Token Just
Token ID2329
Feature activation-3.44e-02

Pos logit contributions
etti+0.016
isions+0.012
ller+0.012
 Jenn+0.012
 Paul+0.012
Neg logit contributions
 end-0.017
 often-0.017
 narrative-0.017
 forward-0.016
ery-0.015
Token flap
Token ID37699
Feature activation-5.17e-02

Pos logit contributions
 narrative+0.067
 forward+0.058
ying+0.055
 often+0.055
iness+0.054
Neg logit contributions
etti-0.050
 Brown-0.046
 Grand-0.041
 Paul-0.040
 Jenn-0.040
Token your
Token ID534
Feature activation-3.45e-02

Pos logit contributions
 narrative+0.113
 end+0.099
orn+0.099
 often+0.098
iddle+0.096
Neg logit contributions
etti-0.052
 turns-0.051
 Grand-0.046
 flashing-0.046
ller-0.046
Token wings
Token ID12098
Feature activation-5.85e-02

Pos logit contributions
 often+0.068
 narrative+0.066
 end+0.063
 forward+0.063
ying+0.062
Neg logit contributions
etti-0.041
 turns-0.039
ller-0.038
 badges-0.036
 conf-0.036
Token and
Token ID290
Feature activation-2.38e-02

Pos logit contributions
 narrative+0.128
wered+0.098
orn+0.098
 moral+0.098
 End+0.097
Neg logit contributions
etti-0.074
ller-0.065
 Brown-0.063
 flashing-0.058
 Jenn-0.058
Token fly
Token ID6129
Feature activation-3.80e-02

Pos logit contributions
 narrative+0.038
 forward+0.036
 often+0.035
 end+0.034
ery+0.034
Neg logit contributions
etti-0.043
ller-0.038
 Brown-0.036
 turns-0.035
 Paul-0.035
Token to
Token ID284
Feature activation-2.67e-02

Pos logit contributions
 narrative+0.069
 end+0.055
ying+0.052
iness+0.052
 often+0.052
Neg logit contributions
etti-0.063
 turns-0.056
 Brown-0.056
 Zoom-0.056
 Jenn-0.055
Token the
Token ID262
Feature activation-1.35e-02

Pos logit contributions
 narrative+0.049
 forward+0.047
 often+0.045
iness+0.042
ery+0.042
Neg logit contributions
etti-0.043
 Jenn-0.036
ller-0.036
 Brown-0.036
 Grand-0.035
Token top
Token ID1353
Feature activation-1.82e-02

Pos logit contributions
 often+0.025
 forward+0.024
 narrative+0.023
ying+0.021
ery+0.021
Neg logit contributions
etti-0.022
ller-0.018
isions-0.018
 Smith-0.017
 Jenn-0.017
Token,
Token ID11
Feature activation+6.09e-03

Pos logit contributions
 often+0.013
 end+0.013
 narrative+0.013
 forward+0.013
 hop+0.011
Neg logit contributions
etti-0.018
 Jenn-0.015
ller-0.014
isions-0.014
 Paul-0.014
Token "
Token ID366
Feature activation-1.59e-02

Pos logit contributions
etti+0.010
ller+0.008
 Jenn+0.008
isions+0.008
icol+0.008
Neg logit contributions
 end-0.011
 often-0.011
 narrative-0.011
 forward-0.011
ery-0.010
TokenAlways
Token ID30374
Feature activation-2.69e-02

Pos logit contributions
 end+0.026
 often+0.026
 forward+0.025
 narrative+0.024
 hop+0.023
Neg logit contributions
etti-0.030
isions-0.024
 Jenn-0.024
ller-0.024
 Paul-0.023
Token hold
Token ID1745
Feature activation-2.33e-02

Pos logit contributions
 narrative+0.045
 forward+0.044
orn+0.041
ery+0.040
 often+0.039
Neg logit contributions
etti-0.046
 Paul-0.039
 Jenn-0.038
 John-0.038
 Rob-0.038
Token your
Token ID534
Feature activation-8.63e-03

Pos logit contributions
 often+0.040
 narrative+0.039
 end+0.038
 forward+0.036
 hop+0.034
Neg logit contributions
etti-0.042
 Jenn-0.035
 Brown-0.033
 Paul-0.033
isions-0.033
Token spoon
Token ID24556
Feature activation-5.84e-02

Pos logit contributions
 often+0.014
 end+0.014
 forward+0.014
 narrative+0.013
 hop+0.012
Neg logit contributions
etti-0.017
isions-0.014
ller-0.014
 Jenn-0.013
icol-0.013
Token tight
Token ID5381
Feature activation-2.86e-02

Pos logit contributions
 narrative+0.104
 end+0.086
wered+0.084
 moral+0.081
 often+0.080
Neg logit contributions
etti-0.106
ller-0.089
 Brown-0.087
isions-0.087
 Paul-0.084
Token,
Token ID11
Feature activation-2.16e-03

Pos logit contributions
 narrative+0.046
 end+0.044
 often+0.041
 hop+0.040
 forward+0.038
Neg logit contributions
etti-0.055
ller-0.046
isions-0.046
 Brown-0.045
 Johnson-0.044
Token so
Token ID523
Feature activation-1.23e-02

Pos logit contributions
 narrative+0.003
 forward+0.003
 end+0.003
 often+0.003
ery+0.003
Neg logit contributions
etti-0.004
 Jenn-0.004
 Paul-0.003
isions-0.003
ller-0.003
Token you
Token ID345
Feature activation-5.72e-03

Pos logit contributions
 forward+0.021
 narrative+0.021
 end+0.020
 often+0.020
ery+0.019
Neg logit contributions
etti-0.021
 Jenn-0.018
ller-0.018
 Paul-0.017
 Brown-0.017
Token won
Token ID1839
Feature activation-3.52e-03

Pos logit contributions
 narrative+0.008
 forward+0.007
 often+0.007
 end+0.007
ying+0.006
Neg logit contributions
etti-0.012
 Brown-0.010
isions-0.010
ller-0.010
 Paul-0.010
Token.
Token ID13
Feature activation-1.09e-02

Pos logit contributions
etti+0.007
ller+0.006
isions+0.006
 Jenn+0.006
icol+0.006
Neg logit contributions
 often-0.004
 end-0.004
 narrative-0.004
 forward-0.004
 brav-0.003
Token
Token ID198
Feature activation-1.04e-02

Pos logit contributions
 often+0.016
 end+0.015
 forward+0.015
 narrative+0.014
 hop+0.013
Neg logit contributions
etti-0.024
isions-0.020
ller-0.020
icol-0.019
 Jenn-0.019
Token
Token ID198
Feature activation-4.14e-03

Pos logit contributions
 often+0.017
 end+0.016
 forward+0.016
 narrative+0.015
 hop+0.014
Neg logit contributions
etti-0.021
isions-0.017
ller-0.017
 Jenn-0.017
icol-0.016
TokenSo
Token ID2396
Feature activation-8.64e-03

Pos logit contributions
 often+0.007
 end+0.007
 narrative+0.007
 forward+0.006
ery+0.006
Neg logit contributions
etti-0.008
 Jenn-0.007
isions-0.006
ller-0.006
 Paul-0.006
Token J
Token ID449
Feature activation-1.59e-02

Pos logit contributions
 often+0.013
 end+0.012
 forward+0.012
 narrative+0.012
 hop+0.011
Neg logit contributions
etti-0.018
 Jenn-0.015
isions-0.015
ller-0.015
icol-0.015
Tokenod
Token ID375
Feature activation-5.84e-02

Pos logit contributions
 often+0.032
 end+0.032
 forward+0.031
 narrative+0.030
 hop+0.028
Neg logit contributions
etti-0.025
isions-0.019
ller-0.019
 Jenn-0.019
icol-0.018
Tokenie
Token ID494
Feature activation-2.57e-02

Pos logit contributions
 forward+0.099
 brav+0.082
 hop+0.081
 narrative+0.080
 often+0.080
Neg logit contributions
etti-0.094
ller-0.085
yl-0.083
 Jenn-0.081
edd-0.079
Token's
Token ID338
Feature activation+7.14e-03

Pos logit contributions
 narrative+0.040
 end+0.040
 forward+0.038
 often+0.037
 brav+0.034
Neg logit contributions
etti-0.050
isions-0.042
ller-0.042
 Jenn-0.041
icol-0.041
Token mom
Token ID1995
Feature activation+8.24e-03

Pos logit contributions
etti+0.016
ller+0.013
icol+0.013
isions+0.012
MY+0.012
Neg logit contributions
 end-0.012
 often-0.011
 forward-0.010
 narrative-0.010
 hop-0.010
Token bought
Token ID5839
Feature activation-2.50e-03

Pos logit contributions
etti+0.017
isions+0.013
ller+0.013
 Jenn+0.013
icol+0.013
Neg logit contributions
 often-0.013
 end-0.013
 forward-0.013
 narrative-0.012
 hop-0.011
Token the
Token ID262
Feature activation+2.64e-02

Pos logit contributions
 end+0.003
 often+0.003
 forward+0.003
 hop+0.002
 narrative+0.002
Neg logit contributions
etti-0.006
ller-0.005
isions-0.005
icol-0.005
itor-0.005
Token of
Token ID286
Feature activation+4.60e-03

Pos logit contributions
 narrative+0.013
 forward+0.012
 often+0.012
 end+0.012
 brav+0.011
Neg logit contributions
etti-0.015
ller-0.012
 Jenn-0.012
 Paul-0.012
isions-0.012
Token giving
Token ID3501
Feature activation-1.02e-02

Pos logit contributions
etti+0.008
ller+0.006
 Jenn+0.006
isions+0.006
 Paul+0.006
Neg logit contributions
 forward-0.008
 narrative-0.008
 often-0.008
 end-0.008
 brav-0.007
Token in
Token ID287
Feature activation-1.00e-02

Pos logit contributions
 often+0.016
 end+0.015
 narrative+0.015
 forward+0.014
 hop+0.013
Neg logit contributions
etti-0.021
isions-0.017
ller-0.017
 Jenn-0.017
icol-0.017
Token to
Token ID284
Feature activation-1.07e-02

Pos logit contributions
 narrative+0.017
 often+0.016
 forward+0.016
 end+0.016
 brav+0.014
Neg logit contributions
etti-0.018
 Paul-0.015
isions-0.015
 Jenn-0.015
 Brown-0.014
Token his
Token ID465
Feature activation-1.59e-02

Pos logit contributions
 narrative+0.017
 forward+0.016
 often+0.016
 end+0.016
ery+0.014
Neg logit contributions
etti-0.021
 Jenn-0.018
 Paul-0.017
ller-0.017
isions-0.017
Token greed
Token ID25474
Feature activation-5.78e-02

Pos logit contributions
 often+0.025
 forward+0.024
 end+0.024
 narrative+0.024
 hop+0.021
Neg logit contributions
etti-0.030
isions-0.025
ller-0.025
icol-0.025
 Jenn-0.025
Token.
Token ID13
Feature activation-5.39e-04

Pos logit contributions
 narrative+0.096
 often+0.094
wered+0.094
 hop+0.088
 brav+0.087
Neg logit contributions
etti-0.064
 turns-0.056
 Tommy-0.056
 license-0.056
 John-0.054
Token<|endoftext|>
Token ID50256
Feature activation+7.59e-03

Pos logit contributions
 often+0.001
 end+0.001
 forward+0.001
 narrative+0.001
 hop+0.001
Neg logit contributions
etti-0.001
isions-0.001
ller-0.001
icol-0.001
 Jenn-0.001
TokenOnce
Token ID7454
Feature activation-2.67e-03

Pos logit contributions
etti+0.015
isions+0.012
ller+0.012
icol+0.012
 Jenn+0.012
Neg logit contributions
 often-0.013
 end-0.012
 forward-0.012
 narrative-0.012
 hop-0.011
Token upon
Token ID2402
Feature activation-8.98e-03

Pos logit contributions
 often+0.004
 narrative+0.004
 end+0.004
 forward+0.004
 hop+0.004
Neg logit contributions
etti-0.005
 Jenn-0.004
ller-0.004
 Paul-0.004
isions-0.004
Token a
Token ID257
Feature activation-1.05e-02

Pos logit contributions
 often+0.015
 end+0.014
 forward+0.014
 narrative+0.013
 hop+0.012
Neg logit contributions
etti-0.018
isions-0.015
ller-0.015
 Jenn-0.014
icol-0.014
Token knot
Token ID29654
Feature activation-3.57e-02

Pos logit contributions
etti+0.007
ller+0.006
icol+0.006
 Jenn+0.006
isions+0.006
Neg logit contributions
 often-0.006
 forward-0.006
 narrative-0.006
 end-0.006
ery-0.005
Token in
Token ID287
Feature activation-8.07e-03

Pos logit contributions
 narrative+0.070
 end+0.057
 moral+0.056
 often+0.053
 brav+0.053
Neg logit contributions
etti-0.061
 flashing-0.049
ller-0.048
 Paul-0.045
 Jenn-0.045
Token his
Token ID465
Feature activation-2.06e-02

Pos logit contributions
 often+0.012
 narrative+0.011
 forward+0.011
 end+0.011
 hop+0.010
Neg logit contributions
etti-0.017
 Jenn-0.015
 Paul-0.014
ller-0.014
isions-0.014
Token sho
Token ID7354
Feature activation-2.77e-02

Pos logit contributions
 often+0.035
 end+0.034
 narrative+0.034
 forward+0.034
 hop+0.030
Neg logit contributions
etti-0.038
isions-0.031
ller-0.031
 Jenn-0.030
icol-0.030
Tokenel
Token ID417
Feature activation-3.82e-02

Pos logit contributions
 often+0.053
 end+0.052
 narrative+0.051
 forward+0.050
 hop+0.047
Neg logit contributions
etti-0.048
 Jenn-0.038
ller-0.038
 Paul-0.036
isions-0.035
Tokenaces
Token ID2114
Feature activation-5.77e-02

Pos logit contributions
 narrative+0.062
 often+0.062
 forward+0.061
 end+0.057
 brav+0.054
Neg logit contributions
etti-0.069
ller-0.060
isions-0.058
 Jenn-0.057
 badges-0.054
Token and
Token ID290
Feature activation-7.85e-03

Pos logit contributions
 narrative+0.123
 moral+0.105
 end+0.101
 brav+0.101
 hop+0.096
Neg logit contributions
etti-0.088
ller-0.068
ding-0.068
 Jenn-0.066
 flashing-0.065
Token he
Token ID339
Feature activation-7.87e-03

Pos logit contributions
 narrative+0.013
 forward+0.013
 end+0.013
 often+0.012
ery+0.011
Neg logit contributions
etti-0.015
ller-0.013
 Jenn-0.013
 Brown-0.012
 Paul-0.012
Token couldn
Token ID3521
Feature activation+9.58e-03

Pos logit contributions
 narrative+0.014
 forward+0.013
 end+0.013
ying+0.012
 often+0.011
Neg logit contributions
etti-0.013
ller-0.012
 Jenn-0.011
isions-0.011
 Paul-0.011
Token't
Token ID470
Feature activation-7.49e-03

Pos logit contributions
etti+0.018
ller+0.014
 Jenn+0.014
isions+0.014
 Paul+0.013
Neg logit contributions
 often-0.017
 end-0.016
 forward-0.016
 narrative-0.016
 hop-0.014
Token get
Token ID651
Feature activation-2.63e-03

Pos logit contributions
 narrative+0.012
 forward+0.011
 often+0.011
 end+0.010
 brav+0.010
Neg logit contributions
etti-0.015
 Jenn-0.012
ller-0.012
 Brown-0.011
 Paul-0.011
Token it
Token ID340
Feature activation-1.17e-02

Pos logit contributions
 often+0.017
 end+0.017
 narrative+0.016
 forward+0.016
 hop+0.015
Neg logit contributions
etti-0.021
 Jenn-0.016
isions-0.016
ller-0.016
icol-0.016
Token away
Token ID1497
Feature activation-7.37e-04

Pos logit contributions
 narrative+0.023
 end+0.020
 often+0.019
 brav+0.019
 moral+0.018
Neg logit contributions
etti-0.020
 Brown-0.016
 Jenn-0.015
isions-0.015
icol-0.014
Token in
Token ID287
Feature activation-4.94e-03

Pos logit contributions
 narrative+0.001
 end+0.001
 often+0.001
 forward+0.001
 brav+0.001
Neg logit contributions
etti-0.001
 Jenn-0.001
isions-0.001
ller-0.001
 Brown-0.001
Token a
Token ID257
Feature activation-1.64e-02

Pos logit contributions
 often+0.008
 narrative+0.008
 forward+0.008
 end+0.008
 brav+0.007
Neg logit contributions
etti-0.009
 Jenn-0.007
 Paul-0.007
 Brown-0.007
isions-0.007
Token wheel
Token ID7825
Feature activation-2.35e-02

Pos logit contributions
 narrative+0.025
 often+0.024
 forward+0.023
 end+0.023
ying+0.021
Neg logit contributions
etti-0.033
isions-0.027
icol-0.026
ller-0.026
 Jenn-0.026
Tokenbar
Token ID5657
Feature activation-5.77e-02

Pos logit contributions
 narrative+0.037
 often+0.035
 end+0.034
 forward+0.033
 brav+0.031
Neg logit contributions
etti-0.049
 Jenn-0.040
isions-0.040
ller-0.040
 Paul-0.038
Tokenrow
Token ID808
Feature activation-1.85e-02

Pos logit contributions
 often+0.078
 narrative+0.075
 forward+0.073
 brav+0.066
 end+0.063
Neg logit contributions
etti-0.127
drivers-0.109
ller-0.105
 extingu-0.104
uc-0.103
Token.
Token ID13
Feature activation-2.40e-03

Pos logit contributions
 narrative+0.036
 end+0.032
 often+0.031
 moral+0.031
 hop+0.030
Neg logit contributions
etti-0.032
 Jenn-0.025
 Johnson-0.024
ller-0.024
isions-0.024
Token
Token ID198
Feature activation-5.88e-03

Pos logit contributions
 often+0.004
 end+0.004
 forward+0.003
 narrative+0.003
 hop+0.003
Neg logit contributions
etti-0.005
isions-0.004
 Jenn-0.004
ller-0.004
icol-0.004
Token
Token ID198
Feature activation-3.06e-03

Pos logit contributions
 end+0.010
 narrative+0.010
 often+0.010
 forward+0.010
 brav+0.009
Neg logit contributions
etti-0.010
 Jenn-0.008
ller-0.008
isions-0.008
 Paul-0.008
TokenOnce
Token ID7454
Feature activation+1.26e-02

Pos logit contributions
 forward+0.005
 narrative+0.005
 end+0.005
iness+0.005
 often+0.005
Neg logit contributions
etti-0.005
isions-0.005
ller-0.004
 Jenn-0.004
 coaster-0.004
Token easier
Token ID4577
Feature activation-1.55e-02

Pos logit contributions
 narrative+0.017
 end+0.016
 often+0.016
 forward+0.015
orn+0.013
Neg logit contributions
etti-0.022
ller-0.018
 Jenn-0.017
isions-0.017
 Paul-0.017
Token to
Token ID284
Feature activation-1.49e-02

Pos logit contributions
 narrative+0.027
 end+0.025
 often+0.024
 forward+0.023
 brav+0.023
Neg logit contributions
etti-0.027
isions-0.022
ller-0.022
 Paul-0.022
 Jenn-0.021
Token carry
Token ID3283
Feature activation-2.76e-02

Pos logit contributions
 forward+0.025
 narrative+0.025
 often+0.022
 brav+0.021
 end+0.021
Neg logit contributions
etti-0.029
ller-0.022
 Jenn-0.022
 Rob-0.022
 Paul-0.022
Token the
Token ID262
Feature activation-1.91e-02

Pos logit contributions
 narrative+0.049
 often+0.047
 end+0.045
 forward+0.042
 brav+0.042
Neg logit contributions
etti-0.047
 Paul-0.037
isions-0.037
 Rob-0.036
 Brown-0.036
Token jelly
Token ID27644
Feature activation-5.47e-02

Pos logit contributions
 often+0.040
 forward+0.037
 narrative+0.036
py+0.033
 end+0.033
Neg logit contributions
etti-0.028
 conf-0.021
 sprink-0.021
 Pole-0.021
isions-0.021
Tokenfish
Token ID11084
Feature activation-5.77e-02

Pos logit contributions
 narrative+0.099
 often+0.087
wered+0.085
 end+0.085
 brav+0.085
Neg logit contributions
etti-0.094
isions-0.069
MY-0.066
ller-0.065
 Pole-0.065
Token!
Token ID0
Feature activation-1.35e-03

Pos logit contributions
 narrative+0.106
 often+0.091
 end+0.089
 brav+0.088
wered+0.087
Neg logit contributions
etti-0.098
isions-0.076
ller-0.071
 Smith-0.070
 Brown-0.068
Tokenâ
Token ID22940
Feature activation+3.50e-03

Pos logit contributions
 often+0.002
 end+0.002
 narrative+0.002
 forward+0.002
ery+0.002
Neg logit contributions
etti-0.003
isions-0.002
 Jenn-0.002
ller-0.002
icol-0.002
Token€
Token ID26391
Feature activation+1.56e-02

Pos logit contributions
etti+0.006
ller+0.005
 Jenn+0.005
isions+0.005
icol+0.005
Neg logit contributions
 often-0.006
 end-0.006
 forward-0.006
 narrative-0.006
 hop-0.005
Token
Token ID198
Feature activation-7.99e-03

Pos logit contributions
etti+0.032
isions+0.026
ller+0.026
 Jenn+0.025
itor+0.025
Neg logit contributions
 often-0.024
 forward-0.023
 end-0.022
 hop-0.021
 narrative-0.020
Token
Token ID198
Feature activation-2.38e-03

Pos logit contributions
 often+0.015
 end+0.015
 narrative+0.015
 forward+0.015
 brav+0.014
Neg logit contributions
etti-0.013
 Jenn-0.011
isions-0.010
 Paul-0.010
 Mike-0.010
Token He
Token ID679
Feature activation-3.22e-03

Pos logit contributions
 often+0.006
 end+0.006
 forward+0.006
 narrative+0.006
 hop+0.005
Neg logit contributions
etti-0.008
isions-0.006
ller-0.006
 Jenn-0.006
icol-0.006
Token jumped
Token ID11687
Feature activation-8.28e-03

Pos logit contributions
 end+0.005
 forward+0.005
 often+0.005
 narrative+0.005
 brav+0.004
Neg logit contributions
etti-0.006
ller-0.005
 Jenn-0.005
isions-0.005
icol-0.005
Token onto
Token ID4291
Feature activation+5.67e-03

Pos logit contributions
 narrative+0.012
 often+0.012
 end+0.012
 brav+0.010
ery+0.010
Neg logit contributions
etti-0.018
 Jenn-0.014
icol-0.014
ller-0.014
 Brown-0.014
Token the
Token ID262
Feature activation-2.14e-02

Pos logit contributions
etti+0.012
isions+0.011
ller+0.010
icol+0.010
 Jenn+0.010
Neg logit contributions
 often-0.008
 end-0.008
 forward-0.008
 hop-0.007
 narrative-0.007
Token wheel
Token ID7825
Feature activation-2.12e-02

Pos logit contributions
 often+0.033
 narrative+0.032
 forward+0.030
 end+0.028
aying+0.026
Neg logit contributions
etti-0.043
ller-0.037
icol-0.036
isions-0.036
 Pole-0.035
Tokenbar
Token ID5657
Feature activation-5.75e-02

Pos logit contributions
 narrative+0.035
 often+0.031
 end+0.030
 brav+0.029
 forward+0.029
Neg logit contributions
etti-0.044
 Jenn-0.036
ller-0.036
 Paul-0.035
isions-0.034
Tokenrow
Token ID808
Feature activation-1.43e-02

Pos logit contributions
 often+0.083
 narrative+0.083
 forward+0.073
 brav+0.070
 hop+0.065
Neg logit contributions
etti-0.116
drivers-0.104
ller-0.099
 extingu-0.098
 Jenn-0.098
Token and
Token ID290
Feature activation+7.85e-03

Pos logit contributions
 narrative+0.026
 end+0.023
 often+0.022
 brav+0.022
 moral+0.022
Neg logit contributions
etti-0.026
 Jenn-0.022
 Johnson-0.021
ller-0.021
 Brown-0.020
Token started
Token ID2067
Feature activation+1.06e-02

Pos logit contributions
etti+0.017
isions+0.014
ller+0.013
 Jenn+0.013
icol+0.013
Neg logit contributions
 often-0.012
 end-0.012
 forward-0.011
 hop-0.011
 narrative-0.010
Token rolling
Token ID10708
Feature activation-1.14e-02

Pos logit contributions
etti+0.023
isions+0.019
ller+0.018
itor+0.018
icol+0.018
Neg logit contributions
 often-0.017
 forward-0.015
 end-0.015
 hop-0.015
 always-0.013
Token around
Token ID1088
Feature activation+2.69e-03

Pos logit contributions
 often+0.015
 end+0.015
 narrative+0.014
 forward+0.014
 hop+0.012
Neg logit contributions
etti-0.026
 Jenn-0.021
isions-0.021
ller-0.021
icol-0.021
Token would
Token ID561
Feature activation-2.20e-02

Pos logit contributions
 forward+0.022
 narrative+0.020
 end+0.019
ying+0.017
orn+0.016
Neg logit contributions
etti-0.038
ller-0.033
 Brown-0.033
 Rob-0.033
 Jenn-0.032
Token cut
Token ID2005
Feature activation-2.31e-02

Pos logit contributions
 narrative+0.042
 forward+0.038
ying+0.038
iness+0.036
orn+0.036
Neg logit contributions
etti-0.035
 Brown-0.030
 sprink-0.030
ller-0.029
 Rob-0.029
Token it
Token ID340
Feature activation-1.61e-02

Pos logit contributions
 often+0.034
 end+0.033
 narrative+0.032
 forward+0.031
 hop+0.028
Neg logit contributions
etti-0.049
ller-0.041
 Jenn-0.041
isions-0.040
icol-0.039
Token into
Token ID656
Feature activation-9.41e-03

Pos logit contributions
 narrative+0.032
 brav+0.027
 end+0.027
 forward+0.025
 End+0.025
Neg logit contributions
etti-0.027
 Brown-0.022
ller-0.022
isions-0.020
 extingu-0.020
Token small
Token ID1402
Feature activation+4.35e-03

Pos logit contributions
 often+0.015
 forward+0.015
 end+0.015
 narrative+0.014
 hop+0.013
Neg logit contributions
etti-0.018
isions-0.015
 Jenn-0.015
ller-0.015
 Paul-0.014
Token squares
Token ID24438
Feature activation-5.75e-02

Pos logit contributions
etti+0.008
ller+0.006
 Jenn+0.006
isions+0.006
icol+0.006
Neg logit contributions
 often-0.008
 end-0.008
 forward-0.007
 narrative-0.007
 hop-0.007
Token and
Token ID290
Feature activation-9.21e-03

Pos logit contributions
 narrative+0.126
 brav+0.111
 moral+0.104
 forward+0.103
 end+0.102
Neg logit contributions
etti-0.087
 sprink-0.063
isions-0.060
 extingu-0.059
 Brown-0.058
Token put
Token ID1234
Feature activation-1.87e-02

Pos logit contributions
 narrative+0.014
 forward+0.014
 end+0.014
 often+0.013
orn+0.012
Neg logit contributions
etti-0.018
ller-0.015
 Jenn-0.015
 Brown-0.015
isions-0.014
Token it
Token ID340
Feature activation-1.23e-02

Pos logit contributions
 often+0.031
 end+0.030
 narrative+0.030
 forward+0.029
 hop+0.027
Neg logit contributions
etti-0.036
 Jenn-0.028
ller-0.028
isions-0.028
 conf-0.028
Token on
Token ID319
Feature activation-2.83e-03

Pos logit contributions
 narrative+0.023
 end+0.021
 often+0.019
 moral+0.019
 meantime+0.018
Neg logit contributions
etti-0.021
 Brown-0.017
 sprink-0.016
 conf-0.016
 flashing-0.015
Token the
Token ID262
Feature activation-1.65e-02

Pos logit contributions
 narrative+0.005
 forward+0.004
 often+0.004
 end+0.004
 brav+0.004
Neg logit contributions
etti-0.006
ller-0.005
 Rob-0.005
 Brown-0.004
isions-0.004
TokenI
Token ID40
Feature activation-1.46e-02

Pos logit contributions
 end+0.008
 often+0.007
 forward+0.007
 narrative+0.007
ery+0.006
Neg logit contributions
etti-0.010
isions-0.008
 Jenn-0.008
ller-0.008
 Paul-0.007
Token like
Token ID588
Feature activation-1.10e-02

Pos logit contributions
 end+0.024
 forward+0.024
 narrative+0.024
 often+0.024
 brav+0.021
Neg logit contributions
etti-0.026
 Jenn-0.021
isions-0.021
 Paul-0.020
icol-0.020
Token swimming
Token ID14899
Feature activation-3.66e-02

Pos logit contributions
 narrative+0.023
 end+0.022
 forward+0.021
 often+0.021
ery+0.020
Neg logit contributions
etti-0.016
ller-0.013
 Brown-0.013
 Rob-0.013
 Paul-0.013
Token in
Token ID287
Feature activation-1.78e-02

Pos logit contributions
 narrative+0.061
 end+0.054
 often+0.053
 forward+0.048
orn+0.047
Neg logit contributions
etti-0.065
isions-0.053
 Mike-0.053
 Paul-0.052
 conf-0.052
Token the
Token ID262
Feature activation-2.67e-02

Pos logit contributions
 narrative+0.034
 often+0.034
 forward+0.033
 end+0.032
ery+0.032
Neg logit contributions
etti-0.027
 Jenn-0.021
 Paul-0.021
 conf-0.021
isions-0.021
Token shallow
Token ID19337
Feature activation-5.71e-02

Pos logit contributions
 often+0.049
ying+0.044
 narrative+0.043
 forward+0.043
iness+0.041
Neg logit contributions
etti-0.038
 conf-0.036
 sprink-0.035
 Fer-0.035
icol-0.034
Token water
Token ID1660
Feature activation-4.98e-02

Pos logit contributions
 narrative+0.104
 often+0.094
 forward+0.091
 brav+0.089
onel+0.086
Neg logit contributions
etti-0.090
icol-0.070
 Pole-0.069
isions-0.068
 Sus-0.067
Token near
Token ID1474
Feature activation-2.57e-02

Pos logit contributions
 narrative+0.099
 brav+0.083
 often+0.082
 end+0.082
 moral+0.081
Neg logit contributions
etti-0.079
isions-0.060
 Pole-0.058
 Mike-0.058
 Smith-0.057
Token the
Token ID262
Feature activation-2.72e-02

Pos logit contributions
 narrative+0.049
 often+0.048
 forward+0.047
 end+0.044
 brav+0.043
Neg logit contributions
etti-0.041
isions-0.032
 Jenn-0.032
 Grand-0.032
 Paul-0.032
Token rocks
Token ID12586
Feature activation-5.01e-02

Pos logit contributions
 often+0.045
 narrative+0.041
 forward+0.041
ying+0.038
ery+0.037
Neg logit contributions
etti-0.051
 Pole-0.042
isions-0.041
 conf-0.041
icol-0.041
Token.
Token ID13
Feature activation+1.15e-03

Pos logit contributions
 narrative+0.098
 end+0.085
 brav+0.082
 moral+0.081
 often+0.081
Neg logit contributions
etti-0.083
isions-0.062
ller-0.062
 Pole-0.060
 Mike-0.058
Token to
Token ID284
Feature activation-2.60e-02

Pos logit contributions
 narrative+0.019
 forward+0.018
 often+0.018
 end+0.017
 brav+0.016
Neg logit contributions
etti-0.012
 sprink-0.009
isions-0.009
 Grand-0.009
 Paul-0.009
Token cut
Token ID2005
Feature activation-2.14e-02

Pos logit contributions
 forward+0.048
 narrative+0.046
 brav+0.041
ying+0.041
 often+0.039
Neg logit contributions
etti-0.047
 Jenn-0.035
 Brown-0.035
 sprink-0.035
 Rob-0.035
Token it
Token ID340
Feature activation-1.68e-02

Pos logit contributions
 often+0.038
 narrative+0.037
 end+0.035
 forward+0.035
 hop+0.032
Neg logit contributions
etti-0.039
ller-0.032
isions-0.031
 Jenn-0.030
 Brown-0.030
Token into
Token ID656
Feature activation-1.04e-02

Pos logit contributions
 narrative+0.030
 end+0.025
 often+0.025
 brav+0.025
 forward+0.025
Neg logit contributions
etti-0.032
ller-0.026
isions-0.026
 Brown-0.025
MY-0.025
Token small
Token ID1402
Feature activation+2.07e-03

Pos logit contributions
 often+0.018
 forward+0.018
 narrative+0.018
 end+0.017
 brav+0.016
Neg logit contributions
etti-0.019
isions-0.015
ller-0.015
 Jenn-0.015
 sprink-0.014
Token pieces
Token ID5207
Feature activation-5.71e-02

Pos logit contributions
etti+0.004
isions+0.003
ller+0.003
 Jenn+0.003
icol+0.003
Neg logit contributions
 often-0.004
 end-0.004
 forward-0.004
 narrative-0.004
 brav-0.003
Token,
Token ID11
Feature activation-1.01e-02

Pos logit contributions
 narrative+0.112
 brav+0.100
 end+0.098
wered+0.095
 often+0.094
Neg logit contributions
etti-0.092
isions-0.070
ller-0.068
 John-0.066
 James-0.066
Token so
Token ID523
Feature activation-1.88e-02

Pos logit contributions
 narrative+0.016
 end+0.015
 often+0.015
 forward+0.014
orn+0.014
Neg logit contributions
etti-0.020
isions-0.017
 Jenn-0.017
ller-0.017
 Grand-0.017
Token it
Token ID340
Feature activation-1.15e-02

Pos logit contributions
 narrative+0.035
 forward+0.033
 end+0.033
 often+0.032
ery+0.030
Neg logit contributions
etti-0.031
ller-0.026
 Grand-0.026
 Paul-0.025
 Jenn-0.025
Token is
Token ID318
Feature activation-7.56e-03

Pos logit contributions
 forward+0.016
 end+0.016
 narrative+0.016
 often+0.015
orn+0.013
Neg logit contributions
etti-0.024
ller-0.020
isions-0.020
 Brown-0.019
 turns-0.019
Token easy
Token ID2562
Feature activation-1.19e-02

Pos logit contributions
 narrative+0.011
 forward+0.010
 often+0.010
 end+0.010
orn+0.009
Neg logit contributions
etti-0.015
ller-0.013
 Jenn-0.013
 Brown-0.012
isions-0.012
Token the
Token ID262
Feature activation-4.08e-04

Pos logit contributions
etti+0.033
isions+0.027
ller+0.027
 Jenn+0.027
icol+0.026
Neg logit contributions
 often-0.025
 end-0.024
 forward-0.024
 narrative-0.023
 hop-0.021
Token park
Token ID3952
Feature activation+3.22e-05

Pos logit contributions
 often+0.001
 end+0.001
 forward+0.001
 narrative+0.001
 hop+0.001
Neg logit contributions
etti-0.001
isions-0.001
 Jenn-0.001
ller-0.001
icol-0.001
Token and
Token ID290
Feature activation+1.06e-03

Pos logit contributions
etti+0.000
 Jenn+0.000
isions+0.000
ller+0.000
 Paul+0.000
Neg logit contributions
 narrative-0.000
 end-0.000
 often-0.000
 forward-0.000
ery-0.000
Token soon
Token ID2582
Feature activation+1.02e-02

Pos logit contributions
etti+0.002
isions+0.002
ller+0.002
 Jenn+0.002
icol+0.002
Neg logit contributions
 often-0.002
 end-0.002
 forward-0.001
 narrative-0.001
 hop-0.001
Token the
Token ID262
Feature activation+7.98e-03

Pos logit contributions
etti+0.021
isions+0.018
ller+0.018
icol+0.017
itor+0.017
Neg logit contributions
 often-0.016
 end-0.016
 forward-0.016
 narrative-0.014
 hop-0.014
Token sky
Token ID6766
Feature activation+3.58e-02

Pos logit contributions
etti+0.017
isions+0.014
ller+0.014
 Jenn+0.013
icol+0.013
Neg logit contributions
 end-0.012
 often-0.012
 forward-0.011
 narrative-0.011
 hop-0.010
Token grew
Token ID6348
Feature activation+2.34e-02

Pos logit contributions
etti+0.083
isions+0.072
icol+0.070
itor+0.070
ller+0.068
Neg logit contributions
 end-0.049
 often-0.049
 forward-0.043
 always-0.038
 hop-0.038
Token dark
Token ID3223
Feature activation+4.10e-02

Pos logit contributions
etti+0.051
isions+0.042
ller+0.042
 Jenn+0.042
icol+0.041
Neg logit contributions
 often-0.033
 end-0.032
 narrative-0.031
 forward-0.030
 hop-0.028
Token.
Token ID13
Feature activation+9.12e-03

Pos logit contributions
etti+0.105
isions+0.092
itor+0.090
ller+0.090
 Jenn+0.090
Neg logit contributions
 often-0.044
 end-0.044
 forward-0.039
 sometimes-0.034
 narrative-0.032
Token Big
Token ID4403
Feature activation+8.85e-03

Pos logit contributions
etti+0.018
isions+0.014
ller+0.014
 Jenn+0.014
icol+0.014
Neg logit contributions
 often-0.015
 end-0.015
 forward-0.015
 narrative-0.014
 hop-0.013
Token drops
Token ID10532
Feature activation+6.03e-03

Pos logit contributions
etti+0.015
isions+0.013
plug+0.012
 Jenn+0.012
 stret+0.012
Neg logit contributions
 end-0.015
 often-0.015
 forward-0.013
 brav-0.013
 always-0.013
Token where
Token ID810
Feature activation+3.76e-03

Pos logit contributions
etti+0.046
ller+0.039
isions+0.039
icol+0.037
llers+0.037
Neg logit contributions
 end-0.026
 often-0.024
 forward-0.023
 hop-0.021
 narrative-0.021
Token are
Token ID389
Feature activation+2.86e-02

Pos logit contributions
etti+0.009
isions+0.008
ller+0.008
 Jenn+0.008
icol+0.008
Neg logit contributions
 often-0.004
 end-0.004
 forward-0.004
 narrative-0.004
 hop-0.003
Token you
Token ID345
Feature activation+1.33e-02

Pos logit contributions
etti+0.075
icol+0.065
isions+0.065
ller+0.063
llers+0.059
Neg logit contributions
 end-0.035
 often-0.033
 sometimes-0.028
 forward-0.028
 brav-0.028
Token?"
Token ID1701
Feature activation+1.53e-02

Pos logit contributions
etti+0.031
ller+0.025
isions+0.025
 Jenn+0.025
 Paul+0.025
Neg logit contributions
 often-0.017
 narrative-0.016
 forward-0.016
 end-0.015
 brav-0.013
Token she
Token ID673
Feature activation+2.76e-02

Pos logit contributions
etti+0.035
isions+0.030
icol+0.027
ller+0.027
MY+0.027
Neg logit contributions
 often-0.024
 end-0.023
 forward-0.021
 hop-0.020
 sometimes-0.020
Token sob
Token ID27355
Feature activation+3.62e-02

Pos logit contributions
etti+0.060
isions+0.049
icol+0.048
ller+0.047
 Jenn+0.047
Neg logit contributions
 often-0.042
 end-0.041
 forward-0.038
 narrative-0.037
 hop-0.037
Tokenbed
Token ID3077
Feature activation+6.39e-03

Pos logit contributions
etti+0.072
isions+0.061
icol+0.059
 Jenn+0.058
 conf+0.058
Neg logit contributions
 often-0.059
 end-0.059
 forward-0.056
 hop-0.051
 narrative-0.050
Token.
Token ID13
Feature activation-5.91e-03

Pos logit contributions
etti+0.014
ller+0.011
 Jenn+0.011
isions+0.011
 Brown+0.011
Neg logit contributions
 often-0.009
 narrative-0.009
 end-0.009
 forward-0.008
 brav-0.008
Token
Token ID198
Feature activation+3.17e-03

Pos logit contributions
 often+0.010
 end+0.010
 forward+0.010
 narrative+0.010
 brav+0.009
Neg logit contributions
etti-0.011
 Jenn-0.009
ller-0.009
isions-0.009
icol-0.009
Token
Token ID198
Feature activation+9.59e-03

Pos logit contributions
etti+0.008
isions+0.007
icol+0.006
ller+0.006
itor+0.006
Neg logit contributions
 often-0.004
 end-0.004
 forward-0.004
 narrative-0.003
 hop-0.003
TokenBen
Token ID11696
Feature activation+8.98e-03

Pos logit contributions
etti+0.026
isions+0.022
icol+0.022
ller+0.022
 Jenn+0.021
Neg logit contributions
 end-0.010
 often-0.010
 forward-0.008
 narrative-0.008
 hop-0.008
Token Daisy
Token ID40355
Feature activation+4.51e-03

Pos logit contributions
 narrative+0.016
 often+0.016
 end+0.015
 forward+0.015
orn+0.015
Neg logit contributions
etti-0.011
 Jenn-0.009
 Sus-0.009
 John-0.009
 Paul-0.009
Token saw
Token ID2497
Feature activation-4.42e-03

Pos logit contributions
etti+0.009
isions+0.008
ller+0.008
 Jenn+0.008
icol+0.007
Neg logit contributions
 often-0.007
 end-0.007
 forward-0.006
 narrative-0.006
 hop-0.006
Token a
Token ID257
Feature activation+2.41e-02

Pos logit contributions
 often+0.007
 end+0.007
 forward+0.007
 narrative+0.007
 hop+0.006
Neg logit contributions
etti-0.009
isions-0.007
 Jenn-0.007
ller-0.007
icol-0.007
Token nice
Token ID3621
Feature activation+2.53e-02

Pos logit contributions
etti+0.061
isions+0.048
 Jenn+0.047
ller+0.046
llers+0.046
Neg logit contributions
 end-0.033
 often-0.031
 forward-0.029
 brav-0.025
 also-0.025
Token,
Token ID11
Feature activation+1.16e-02

Pos logit contributions
etti+0.057
isions+0.047
ller+0.045
llers+0.044
drivers+0.044
Neg logit contributions
 end-0.040
 often-0.038
 forward-0.036
 always-0.032
 hop-0.032
Token clean
Token ID3424
Feature activation+3.47e-02

Pos logit contributions
etti+0.025
isions+0.020
ller+0.020
 Jenn+0.020
icol+0.020
Neg logit contributions
 often-0.018
 end-0.017
 forward-0.016
 narrative-0.016
 hop-0.015
Token subway
Token ID19612
Feature activation+2.86e-02

Pos logit contributions
etti+0.061
isions+0.047
ller+0.046
 Jenn+0.046
icol+0.045
Neg logit contributions
 often-0.066
 end-0.065
 forward-0.063
 narrative-0.061
 hop-0.057
Token and
Token ID290
Feature activation+4.12e-03

Pos logit contributions
etti+0.065
isions+0.054
icol+0.053
ller+0.053
 Jenn+0.052
Neg logit contributions
 often-0.040
 end-0.039
 forward-0.038
 narrative-0.033
 hop-0.031
Token thought
Token ID1807
Feature activation-5.51e-03

Pos logit contributions
etti+0.009
isions+0.007
ller+0.007
 Jenn+0.007
icol+0.007
Neg logit contributions
 often-0.006
 end-0.006
 forward-0.006
 narrative-0.005
 hop-0.005
Token it
Token ID340
Feature activation+6.13e-03

Pos logit contributions
 narrative+0.010
 end+0.009
 often+0.009
 forward+0.009
orn+0.008
Neg logit contributions
etti-0.010
 Jenn-0.008
ller-0.008
 Paul-0.008
isions-0.008
Token looked
Token ID3114
Feature activation-2.96e-04

Pos logit contributions
etti+0.013
 Jenn+0.011
isions+0.011
ller+0.011
 Paul+0.010
Neg logit contributions
 end-0.009
 often-0.009
 forward-0.009
 narrative-0.009
 hop-0.008
Token on
Token ID319
Feature activation+1.23e-02

Pos logit contributions
 often+0.014
 end+0.013
 narrative+0.013
 forward+0.013
 hop+0.012
Neg logit contributions
etti-0.017
 Jenn-0.014
ller-0.014
isions-0.014
icol-0.013
Token dry
Token ID5894
Feature activation-4.53e-04

Pos logit contributions
etti+0.026
isions+0.021
ller+0.021
 Jenn+0.021
icol+0.021
Neg logit contributions
 often-0.019
 end-0.018
 forward-0.017
 narrative-0.017
 hop-0.016
Token and
Token ID290
Feature activation-4.00e-03

Pos logit contributions
 often+0.001
 end+0.001
 narrative+0.001
 forward+0.001
 brav+0.001
Neg logit contributions
etti-0.001
icol-0.001
ller-0.001
isions-0.001
 Jenn-0.001
Token cozy
Token ID37438
Feature activation+1.86e-02

Pos logit contributions
 often+0.006
 end+0.006
 forward+0.006
 narrative+0.006
 brav+0.005
Neg logit contributions
etti-0.008
icol-0.007
ller-0.007
isions-0.007
 Jenn-0.007
Token p
Token ID279
Feature activation+2.29e-02

Pos logit contributions
etti+0.034
isions+0.027
ller+0.027
 Jenn+0.027
icol+0.026
Neg logit contributions
 often-0.033
 end-0.032
 forward-0.032
 narrative-0.031
 hop-0.029
Tokenaj
Token ID1228
Feature activation+4.46e-02

Pos logit contributions
etti+0.036
isions+0.028
ller+0.028
 Jenn+0.027
 Paul+0.027
Neg logit contributions
 often-0.047
 forward-0.044
 end-0.044
 narrative-0.041
 hop-0.040
Tokenamas
Token ID17485
Feature activation+2.63e-02

Pos logit contributions
etti+0.061
 Jenn+0.050
itor+0.049
isions+0.048
plug+0.047
Neg logit contributions
 often-0.091
 end-0.084
 forward-0.082
ery-0.081
 always-0.080
Token and
Token ID290
Feature activation+7.60e-03

Pos logit contributions
etti+0.058
ller+0.047
 Jenn+0.047
isions+0.047
icol+0.046
Neg logit contributions
 narrative-0.036
 often-0.036
 end-0.036
 forward-0.034
 brav-0.032
Token sn
Token ID3013
Feature activation+6.14e-03

Pos logit contributions
etti+0.015
 Jenn+0.012
icol+0.012
ller+0.012
isions+0.012
Neg logit contributions
 often-0.012
 end-0.012
 forward-0.012
 narrative-0.012
 brav-0.011
Tokenugg
Token ID6837
Feature activation+1.24e-02

Pos logit contributions
etti+0.010
ller+0.007
icol+0.007
isions+0.007
 Jenn+0.007
Neg logit contributions
 often-0.012
 end-0.012
 narrative-0.012
 forward-0.012
 hop-0.011
Tokenled
Token ID992
Feature activation-4.26e-03

Pos logit contributions
etti+0.022
ller+0.018
isions+0.017
icol+0.017
 Jenn+0.017
Neg logit contributions
 often-0.022
 end-0.022
 narrative-0.021
 forward-0.021
 hop-0.020
Token sun
Token ID4252
Feature activation+4.12e-03

Pos logit contributions
 often+0.050
 narrative+0.045
iness+0.045
ying+0.044
 forward+0.043
Neg logit contributions
etti-0.042
icol-0.037
 sprink-0.034
 conf-0.033
 Pole-0.033
Token.
Token ID13
Feature activation+4.18e-03

Pos logit contributions
etti+0.009
ller+0.007
isions+0.007
 Jenn+0.007
 Brown+0.007
Neg logit contributions
 narrative-0.006
 often-0.006
 end-0.006
 forward-0.005
 brav-0.005
Token
Token ID198
Feature activation-1.60e-03

Pos logit contributions
etti+0.009
isions+0.007
ller+0.007
icol+0.007
 Jenn+0.007
Neg logit contributions
 often-0.006
 end-0.006
 forward-0.006
 narrative-0.006
 hop-0.005
Token
Token ID198
Feature activation+5.95e-03

Pos logit contributions
 often+0.003
 end+0.003
 forward+0.003
 narrative+0.002
 brav+0.002
Neg logit contributions
etti-0.003
ller-0.003
 Jenn-0.003
isions-0.003
 Paul-0.002
TokenAs
Token ID1722
Feature activation+3.73e-03

Pos logit contributions
etti+0.011
isions+0.010
ller+0.009
 Jenn+0.009
llers+0.008
Neg logit contributions
 forward-0.010
 narrative-0.009
 brav-0.009
 end-0.009
 often-0.009
Token days
Token ID1528
Feature activation+3.47e-02

Pos logit contributions
etti+0.007
isions+0.007
icol+0.006
itor+0.006
ller+0.006
Neg logit contributions
 often-0.007
 end-0.007
 forward-0.006
 always-0.006
 sometimes-0.006
Token passed
Token ID3804
Feature activation+6.36e-04

Pos logit contributions
etti+0.086
isions+0.077
ller+0.077
itor+0.074
icol+0.073
Neg logit contributions
 end-0.044
 often-0.043
 forward-0.038
 sometimes-0.034
 narrative-0.034
Token,
Token ID11
Feature activation+1.10e-02

Pos logit contributions
etti+0.002
isions+0.001
ller+0.001
icol+0.001
 Jenn+0.001
Neg logit contributions
 often-0.001
 end-0.001
 forward-0.001
 narrative-0.001
 hop-0.001
Token the
Token ID262
Feature activation-2.05e-02

Pos logit contributions
etti+0.019
isions+0.016
icol+0.016
itor+0.015
ller+0.014
Neg logit contributions
 often-0.023
 end-0.022
 sometimes-0.021
 forward-0.020
 all-0.020
Token plant
Token ID4618
Feature activation-1.97e-02

Pos logit contributions
 often+0.029
 forward+0.028
 narrative+0.026
 end+0.024
 hop+0.023
Neg logit contributions
etti-0.044
isions-0.037
ller-0.036
 Jenn-0.036
icol-0.036
Token started
Token ID2067
Feature activation+6.11e-03

Pos logit contributions
 forward+0.039
 narrative+0.038
 brav+0.033
 end+0.033
 often+0.032
Neg logit contributions
etti-0.027
ller-0.024
llers-0.022
 sprink-0.021
 Pole-0.020
Token it
Token ID340
Feature activation+1.90e-03

Pos logit contributions
etti+0.008
 Jenn+0.007
 Paul+0.007
ller+0.007
 Brown+0.007
Neg logit contributions
 narrative-0.007
 often-0.007
 end-0.007
 forward-0.006
ery-0.006
Token so
Token ID523
Feature activation+5.27e-03

Pos logit contributions
etti+0.004
 Jenn+0.003
ller+0.003
isions+0.003
 Paul+0.003
Neg logit contributions
 narrative-0.003
 often-0.003
 end-0.003
 forward-0.002
ery-0.002
Token it
Token ID340
Feature activation-2.86e-03

Pos logit contributions
etti+0.011
isions+0.009
ller+0.009
 Jenn+0.009
icol+0.009
Neg logit contributions
 often-0.008
 end-0.008
 forward-0.007
 narrative-0.007
 hop-0.007
Token could
Token ID714
Feature activation-2.12e-03

Pos logit contributions
 often+0.004
 end+0.004
 narrative+0.004
 forward+0.004
 hop+0.003
Neg logit contributions
etti-0.006
ller-0.005
 Jenn-0.005
isions-0.005
 Paul-0.005
Token come
Token ID1282
Feature activation+6.76e-03

Pos logit contributions
 narrative+0.003
 forward+0.003
 often+0.003
 end+0.003
ying+0.003
Neg logit contributions
etti-0.004
 Jenn-0.004
ller-0.004
 Brown-0.003
 Paul-0.003
Token back
Token ID736
Feature activation+6.46e-03

Pos logit contributions
etti+0.015
ller+0.012
 Jenn+0.012
isions+0.012
 Paul+0.012
Neg logit contributions
 narrative-0.009
 often-0.009
 end-0.009
 forward-0.008
ery-0.008
Token.
Token ID13
Feature activation-6.51e-03

Pos logit contributions
etti+0.013
 Jenn+0.011
ller+0.011
isions+0.011
 Paul+0.011
Neg logit contributions
 end-0.010
 narrative-0.009
 often-0.009
 forward-0.009
 hop-0.008
Token He
Token ID679
Feature activation-3.34e-03

Pos logit contributions
 often+0.010
 end+0.010
 narrative+0.010
 forward+0.010
ery+0.009
Neg logit contributions
etti-0.013
 Jenn-0.011
isions-0.011
ller-0.011
 Paul-0.011
Token said
Token ID531
Feature activation-1.74e-02

Pos logit contributions
 often+0.005
 end+0.005
 forward+0.005
 narrative+0.005
 hop+0.004
Neg logit contributions
etti-0.007
isions-0.005
ller-0.005
 Jenn-0.005
icol-0.005
Token "
Token ID366
Feature activation-8.39e-03

Pos logit contributions
 end+0.028
 narrative+0.027
 often+0.027
 forward+0.026
ery+0.024
Neg logit contributions
etti-0.034
 Jenn-0.028
ller-0.028
 Paul-0.027
 Rob-0.027
Tokenrocket
Token ID30431
Feature activation-1.81e-02

Pos logit contributions
 often+0.013
 end+0.013
 forward+0.012
 narrative+0.012
 hop+0.011
Neg logit contributions
etti-0.017
isions-0.014
ller-0.014
 Jenn-0.014
icol-0.014
Tokenâ
Token ID22940
Feature activation+7.65e-03

Pos logit contributions
etti+0.017
isions+0.014
 Jenn+0.014
ller+0.014
icol+0.014
Neg logit contributions
 often-0.010
 end-0.009
 forward-0.009
 narrative-0.009
 hop-0.008
Token€
Token ID26391
Feature activation+2.72e-02

Pos logit contributions
etti+0.015
isions+0.012
ller+0.012
 Jenn+0.012
icol+0.012
Neg logit contributions
 often-0.013
 end-0.012
 forward-0.012
 narrative-0.012
 hop-0.011
Token�
Token ID129
Feature activation+1.07e-02

Pos logit contributions
etti+0.050
 conf+0.049
 extingu+0.046
 flashing+0.045
plug+0.044
Neg logit contributions
 often-0.039
 always-0.032
 hop-0.032
 end-0.031
 never-0.031
Token�
Token ID241
Feature activation+8.05e-04

Pos logit contributions
etti+0.021
isions+0.017
ller+0.016
icol+0.016
 Jenn+0.016
Neg logit contributions
 often-0.018
 end-0.017
 forward-0.017
 narrative-0.016
 hop-0.015
TokenWe
Token ID1135
Feature activation-2.68e-02

Pos logit contributions
etti+0.002
isions+0.001
ller+0.001
 Jenn+0.001
icol+0.001
Neg logit contributions
 often-0.001
 end-0.001
 forward-0.001
 narrative-0.001
 hop-0.001
Tokenâ
Token ID22940
Feature activation+3.34e-03

Pos logit contributions
 narrative+0.037
 forward+0.036
 weaker+0.036
 often+0.035
ying+0.034
Neg logit contributions
etti-0.051
 Jenn-0.044
 Paul-0.044
 Sus-0.043
 Mike-0.042
Token€
Token ID26391
Feature activation+2.99e-02

Pos logit contributions
etti+0.006
 Jenn+0.005
isions+0.005
ller+0.005
icol+0.005
Neg logit contributions
 often-0.006
 end-0.005
 forward-0.005
 narrative-0.005
 hop-0.005
Tokenâ„¢
Token ID8151
Feature activation+6.52e-04

Pos logit contributions
etti+0.052
plug+0.046
 conf+0.046
ller+0.046
 flashing+0.046
Neg logit contributions
 often-0.047
 hop-0.041
 forward-0.041
 end-0.038
 always-0.038
Tokenll
Token ID297
Feature activation-1.54e-02

Pos logit contributions
etti+0.001
isions+0.001
 Jenn+0.001
ller+0.001
icol+0.001
Neg logit contributions
 often-0.001
 end-0.001
 forward-0.001
 narrative-0.001
 hop-0.001
Token never
Token ID1239
Feature activation-2.30e-03

Pos logit contributions
 narrative+0.026
 forward+0.023
 brav+0.021
orn+0.021
 end+0.020
Neg logit contributions
etti-0.031
 Jenn-0.025
 Sus-0.024
isions-0.024
ller-0.024
Token know
Token ID760
Feature activation+1.22e-03

Pos logit contributions
 forward+0.003
 narrative+0.003
 often+0.003
 end+0.003
ery+0.003
Neg logit contributions
etti-0.005
 Jenn-0.004
isions-0.004
ller-0.004
 Paul-0.004
Token and
Token ID290
Feature activation+1.27e-02

Pos logit contributions
 narrative+0.004
 end+0.004
 forward+0.004
 often+0.004
 brav+0.003
Neg logit contributions
etti-0.006
isions-0.005
 Brown-0.005
 Jenn-0.005
ller-0.005
Token gave
Token ID2921
Feature activation-1.14e-02

Pos logit contributions
etti+0.028
isions+0.024
ller+0.023
icol+0.022
 Jenn+0.022
Neg logit contributions
 often-0.019
 end-0.019
 forward-0.017
 narrative-0.017
 hop-0.016
Token it
Token ID340
Feature activation+6.92e-03

Pos logit contributions
 often+0.019
 end+0.019
 forward+0.018
 narrative+0.018
 hop+0.016
Neg logit contributions
etti-0.022
isions-0.018
 Jenn-0.017
ller-0.017
icol-0.017
Token a
Token ID257
Feature activation+1.64e-02

Pos logit contributions
etti+0.015
isions+0.012
ller+0.012
 Jenn+0.012
icol+0.012
Neg logit contributions
 often-0.010
 end-0.010
 forward-0.010
 narrative-0.009
 hop-0.009
Token new
Token ID649
Feature activation+1.66e-02

Pos logit contributions
etti+0.038
isions+0.032
ller+0.031
 Jenn+0.031
icol+0.030
Neg logit contributions
 often-0.022
 end-0.022
 forward-0.021
 hop-0.020
 narrative-0.019
Token name
Token ID1438
Feature activation+9.04e-03

Pos logit contributions
etti+0.035
isions+0.029
ller+0.029
 Jenn+0.028
icol+0.028
Neg logit contributions
 end-0.026
 often-0.025
 forward-0.023
 hop-0.022
 narrative-0.022
Token,
Token ID11
Feature activation+5.31e-03

Pos logit contributions
etti+0.020
 Jenn+0.016
ller+0.016
isions+0.016
 Brown+0.016
Neg logit contributions
 often-0.013
 end-0.012
 narrative-0.012
 forward-0.011
 hop-0.011
Token Fl
Token ID1610
Feature activation+1.52e-02

Pos logit contributions
etti+0.013
isions+0.011
ller+0.011
 Jenn+0.010
icol+0.010
Neg logit contributions
 often-0.007
 end-0.007
 forward-0.006
 narrative-0.006
 hop-0.005
Tokenuffy
Token ID15352
Feature activation+7.19e-03

Pos logit contributions
etti+0.020
 Jenn+0.016
isions+0.016
ller+0.016
icol+0.016
Neg logit contributions
 end-0.033
 often-0.033
 forward-0.031
 narrative-0.030
ery-0.030
Token.
Token ID13
Feature activation+1.01e-03

Pos logit contributions
etti+0.017
isions+0.014
ller+0.014
 Jenn+0.014
icol+0.014
Neg logit contributions
 often-0.009
 end-0.009
 forward-0.009
 narrative-0.009
 hop-0.008
Token Mia
Token ID32189
Feature activation+4.91e-03

Pos logit contributions
etti+0.002
isions+0.002
ller+0.002
icol+0.002
itor+0.002
Neg logit contributions
 often-0.002
 end-0.002
 forward-0.002
 hop-0.001
 narrative-0.001
Token wardrobe
Token ID35593
Feature activation+6.39e-03

Pos logit contributions
 often+0.007
 forward+0.006
 end+0.006
 narrative+0.006
 hop+0.006
Neg logit contributions
etti-0.008
isions-0.007
ller-0.007
 Jenn-0.007
icol-0.007
Token.
Token ID13
Feature activation+8.93e-03

Pos logit contributions
etti+0.012
 Jenn+0.010
ller+0.010
isions+0.010
itor+0.009
Neg logit contributions
 narrative-0.011
 end-0.010
 often-0.009
 brav-0.009
 forward-0.009
Token He
Token ID679
Feature activation-1.92e-03

Pos logit contributions
etti+0.021
isions+0.018
ller+0.018
icol+0.017
 conf+0.017
Neg logit contributions
 often-0.011
 forward-0.010
 end-0.010
 hop-0.009
 narrative-0.009
Token knew
Token ID2993
Feature activation-3.40e-03

Pos logit contributions
 end+0.003
 forward+0.003
 narrative+0.003
 often+0.003
ery+0.003
Neg logit contributions
etti-0.004
 Jenn-0.003
ller-0.003
isions-0.003
icol-0.003
Token he
Token ID339
Feature activation+4.03e-03

Pos logit contributions
 end+0.006
 often+0.005
 narrative+0.005
 forward+0.005
 hop+0.005
Neg logit contributions
etti-0.006
 Jenn-0.005
 Paul-0.005
ller-0.005
 Rob-0.005
Token had
Token ID550
Feature activation+3.79e-03

Pos logit contributions
etti+0.008
 Jenn+0.007
 Paul+0.007
ller+0.007
isions+0.007
Neg logit contributions
 narrative-0.006
 end-0.006
 forward-0.005
 often-0.005
 hop-0.005
Token done
Token ID1760
Feature activation+1.80e-03

Pos logit contributions
etti+0.007
 Jenn+0.006
isions+0.006
ller+0.006
 Paul+0.006
Neg logit contributions
 end-0.006
 often-0.006
 narrative-0.006
 forward-0.006
 hop-0.005
Token something
Token ID1223
Feature activation-6.16e-03

Pos logit contributions
etti+0.004
isions+0.003
ller+0.003
 Jenn+0.003
icol+0.003
Neg logit contributions
 often-0.003
 end-0.003
 forward-0.003
 narrative-0.003
 hop-0.002
Token special
Token ID2041
Feature activation-1.37e-02

Pos logit contributions
 narrative+0.009
 end+0.009
 often+0.008
 forward+0.008
 hop+0.008
Neg logit contributions
etti-0.012
 Paul-0.010
 Jenn-0.010
isions-0.010
 Rob-0.010
Token and
Token ID290
Feature activation-1.04e-02

Pos logit contributions
 narrative+0.022
 end+0.021
 often+0.021
 forward+0.020
 hop+0.020
Neg logit contributions
etti-0.026
 Paul-0.021
 Jenn-0.021
 Rob-0.020
ller-0.020
Token he
Token ID339
Feature activation-5.05e-03

Pos logit contributions
 end+0.016
 narrative+0.015
 often+0.015
 forward+0.015
 hop+0.013
Neg logit contributions
etti-0.021
 Jenn-0.017
ller-0.017
 Paul-0.017
isions-0.017
Token a
Token ID257
Feature activation+1.38e-02

Pos logit contributions
etti+0.014
isions+0.012
ller+0.012
 Jenn+0.012
icol+0.011
Neg logit contributions
 often-0.011
 end-0.011
 forward-0.011
 narrative-0.011
 hop-0.010
Token friendly
Token ID8030
Feature activation+9.74e-03

Pos logit contributions
etti+0.031
isions+0.025
ller+0.024
icol+0.024
 Jenn+0.024
Neg logit contributions
 end-0.021
 often-0.021
 forward-0.020
 narrative-0.018
 hop-0.018
Token mechanic
Token ID21600
Feature activation+6.69e-03

Pos logit contributions
etti+0.020
isions+0.015
ller+0.015
icol+0.015
 Jenn+0.015
Neg logit contributions
 end-0.017
 often-0.017
 forward-0.016
 hop-0.014
 brav-0.014
Token named
Token ID3706
Feature activation+1.35e-02

Pos logit contributions
etti+0.016
isions+0.013
ller+0.013
 Jenn+0.013
icol+0.013
Neg logit contributions
 often-0.008
 end-0.008
 forward-0.008
 narrative-0.007
 hop-0.007
Token Sam
Token ID3409
Feature activation-8.13e-03

Pos logit contributions
etti+0.019
isions+0.013
icol+0.013
ller+0.013
llers+0.011
Neg logit contributions
 often-0.030
 end-0.030
 forward-0.029
 brav-0.028
 hop-0.027
Token.
Token ID13
Feature activation+6.47e-03

Pos logit contributions
 narrative+0.011
 often+0.011
 end+0.011
 forward+0.011
 brav+0.010
Neg logit contributions
etti-0.017
isions-0.014
 Jenn-0.014
ller-0.014
 Paul-0.014
Token Sam
Token ID3409
Feature activation+4.47e-03

Pos logit contributions
etti+0.012
isions+0.009
ller+0.009
icol+0.009
 conf+0.008
Neg logit contributions
 often-0.012
 forward-0.012
 end-0.011
 hop-0.011
 narrative-0.010
Token was
Token ID373
Feature activation+8.35e-04

Pos logit contributions
etti+0.010
isions+0.008
icol+0.008
 Jenn+0.008
ller+0.008
Neg logit contributions
 often-0.007
 end-0.006
 forward-0.006
 narrative-0.005
 always-0.005
Token deep
Token ID2769
Feature activation+1.04e-02

Pos logit contributions
etti+0.002
isions+0.002
icol+0.002
ller+0.002
 Jenn+0.002
Neg logit contributions
 often-0.001
 end-0.001
 forward-0.001
 narrative-0.001
 hop-0.001
Token in
Token ID287
Feature activation+9.13e-03

Pos logit contributions
etti+0.026
isions+0.022
ller+0.021
 Jenn+0.021
icol+0.021
Neg logit contributions
 often-0.013
 end-0.012
 forward-0.012
 narrative-0.011
 hop-0.010
Token thought
Token ID1807
Feature activation-2.48e-03

Pos logit contributions
etti+0.021
isions+0.018
ller+0.018
icol+0.017
itor+0.017
Neg logit contributions
 often-0.013
 end-0.012
 forward-0.012
 narrative-0.011
 hop-0.010
Token a
Token ID257
Feature activation-1.01e-02

Pos logit contributions
 often+0.006
 end+0.006
 forward+0.005
 narrative+0.005
 hop+0.005
Neg logit contributions
etti-0.008
isions-0.006
ller-0.006
icol-0.006
 Jenn-0.006
Token time
Token ID640
Feature activation+1.03e-02

Pos logit contributions
 often+0.010
 forward+0.010
 end+0.010
 narrative+0.010
 brav+0.009
Neg logit contributions
etti-0.025
isions-0.022
icol-0.021
ller-0.021
 Jenn-0.021
Token,
Token ID11
Feature activation-1.61e-03

Pos logit contributions
etti+0.027
isions+0.023
ller+0.023
icol+0.023
plug+0.022
Neg logit contributions
 often-0.011
 end-0.011
 forward-0.010
 hop-0.009
 narrative-0.009
Token there
Token ID612
Feature activation+9.82e-03

Pos logit contributions
 often+0.002
 end+0.002
 forward+0.002
 narrative+0.002
 hop+0.002
Neg logit contributions
etti-0.003
 Jenn-0.003
isions-0.003
ller-0.003
 Paul-0.003
Token was
Token ID373
Feature activation+6.87e-03

Pos logit contributions
etti+0.021
isions+0.018
ller+0.017
icol+0.017
 Jenn+0.017
Neg logit contributions
 often-0.015
 end-0.015
 forward-0.014
 narrative-0.013
 hop-0.012
Token a
Token ID257
Feature activation-2.08e-03

Pos logit contributions
etti+0.016
ller+0.014
isions+0.014
itor+0.013
icol+0.013
Neg logit contributions
 often-0.009
 end-0.009
 forward-0.008
 hop-0.008
 narrative-0.008
Token phone
Token ID3072
Feature activation+1.20e-02

Pos logit contributions
 often+0.003
 end+0.003
 forward+0.003
 hop+0.003
 narrative+0.003
Neg logit contributions
etti-0.005
isions-0.004
 Jenn-0.004
ller-0.004
icol-0.004
Token.
Token ID13
Feature activation-4.98e-03

Pos logit contributions
etti+0.027
isions+0.023
ller+0.022
 Jenn+0.022
icol+0.022
Neg logit contributions
 often-0.017
 end-0.016
 forward-0.015
 narrative-0.015
 hop-0.013
Token It
Token ID632
Feature activation+7.84e-03

Pos logit contributions
 often+0.007
 end+0.006
 hop+0.006
 forward+0.006
 sometimes+0.005
Neg logit contributions
etti-0.011
isions-0.010
icol-0.010
drivers-0.009
itor-0.009
Token liked
Token ID8288
Feature activation+1.01e-02

Pos logit contributions
etti+0.020
isions+0.017
icol+0.017
itor+0.016
MY+0.016
Neg logit contributions
 often-0.010
 end-0.009
 always-0.009
 also-0.008
 sometimes-0.008
Token to
Token ID284
Feature activation-1.33e-02

Pos logit contributions
etti+0.024
isions+0.019
icol+0.019
MY+0.019
itor+0.018
Neg logit contributions
 often-0.015
 end-0.013
 forward-0.013
 always-0.012
 sometimes-0.012
Token.
Token ID13
Feature activation+4.48e-03

Pos logit contributions
etti+0.003
 Jenn+0.002
isions+0.002
ller+0.002
 Johnson+0.002
Neg logit contributions
 narrative-0.002
orn-0.002
 forward-0.002
 often-0.002
 end-0.002
Token They
Token ID1119
Feature activation-8.20e-03

Pos logit contributions
etti+0.010
ller+0.008
isions+0.008
icol+0.008
itor+0.008
Neg logit contributions
 often-0.007
 end-0.006
 forward-0.006
 narrative-0.006
 hop-0.006
Token have
Token ID423
Feature activation+1.38e-02

Pos logit contributions
 narrative+0.014
ying+0.013
orn+0.013
 forward+0.013
ery+0.013
Neg logit contributions
etti-0.013
 Jenn-0.012
 Zoom-0.011
 Brown-0.011
 Rob-0.011
Token fun
Token ID1257
Feature activation+1.22e-02

Pos logit contributions
etti+0.028
ller+0.023
isions+0.022
icol+0.021
 Jenn+0.021
Neg logit contributions
 often-0.024
 end-0.023
 forward-0.022
 narrative-0.021
 hop-0.021
Token.
Token ID13
Feature activation+7.12e-04

Pos logit contributions
etti+0.027
isions+0.022
ller+0.022
 Jenn+0.022
icol+0.021
Neg logit contributions
 often-0.018
 end-0.017
 forward-0.017
 narrative-0.017
 brav-0.015
Token They
Token ID1119
Feature activation-9.71e-03

Pos logit contributions
etti+0.002
ller+0.001
isions+0.001
icol+0.001
itor+0.001
Neg logit contributions
 often-0.001
 end-0.001
 forward-0.001
 narrative-0.001
 sometimes-0.001
Token are
Token ID389
Feature activation+8.57e-04

Pos logit contributions
 narrative+0.015
ying+0.015
 forward+0.015
orn+0.015
ery+0.015
Neg logit contributions
etti-0.017
 Jenn-0.015
 turns-0.014
 Zoom-0.014
 Rob-0.014
Token happy
Token ID3772
Feature activation+1.77e-02

Pos logit contributions
etti+0.002
 Jenn+0.002
isions+0.002
ller+0.002
icol+0.002
Neg logit contributions
 often-0.001
 end-0.001
 narrative-0.001
 forward-0.001
 hop-0.001
Token.
Token ID13
Feature activation-6.29e-03

Pos logit contributions
etti+0.050
isions+0.044
icol+0.043
ller+0.042
plug+0.042
Neg logit contributions
 often-0.017
 end-0.016
 sometimes-0.014
 forward-0.013
 hop-0.012
Token They
Token ID1119
Feature activation-1.14e-02

Pos logit contributions
 often+0.009
 end+0.009
 forward+0.008
 narrative+0.008
 sometimes+0.007
Neg logit contributions
etti-0.014
ller-0.012
isions-0.012
itor-0.011
icol-0.011
Token have
Token ID423
Feature activation+1.02e-02

Pos logit contributions
 forward+0.017
 narrative+0.016
ying+0.016
orn+0.016
 often+0.016
Neg logit contributions
etti-0.021
 Jenn-0.018
 Brown-0.017
 sprink-0.017
 Rob-0.017
Token are
Token ID389
Feature activation+1.45e-02

Pos logit contributions
etti+0.003
isions+0.003
ller+0.003
 Jenn+0.002
icol+0.002
Neg logit contributions
 often-0.002
 end-0.002
 forward-0.001
 narrative-0.001
 hop-0.001
Token you
Token ID345
Feature activation-6.73e-03

Pos logit contributions
etti+0.034
isions+0.029
ller+0.029
icol+0.028
 Jenn+0.028
Neg logit contributions
 often-0.020
 end-0.018
 forward-0.018
 narrative-0.017
 hop-0.016
Token doing
Token ID1804
Feature activation-3.78e-03

Pos logit contributions
 often+0.011
 narrative+0.011
 end+0.011
 forward+0.011
 hop+0.009
Neg logit contributions
etti-0.013
 Jenn-0.010
ller-0.010
isions-0.010
 Paul-0.010
Token,
Token ID11
Feature activation-9.60e-04

Pos logit contributions
 often+0.005
 end+0.005
 narrative+0.005
 forward+0.005
 hop+0.005
Neg logit contributions
etti-0.008
ller-0.007
isions-0.007
 Jenn-0.007
 Paul-0.006
Token young
Token ID1862
Feature activation+4.98e-03

Pos logit contributions
 often+0.002
 end+0.002
 forward+0.002
 narrative+0.002
 hop+0.002
Neg logit contributions
etti-0.002
isions-0.001
ller-0.001
 Jenn-0.001
icol-0.001
Token man
Token ID582
Feature activation-5.16e-03

Pos logit contributions
etti+0.009
isions+0.007
icol+0.007
ller+0.007
 Jenn+0.007
Neg logit contributions
 end-0.009
 often-0.009
 forward-0.009
 narrative-0.009
 brav-0.008
Token?
Token ID30
Feature activation+1.47e-02

Pos logit contributions
 narrative+0.007
 end+0.006
 often+0.006
 forward+0.006
 hop+0.005
Neg logit contributions
etti-0.011
ller-0.010
 Jenn-0.010
isions-0.010
 Paul-0.009
Token Are
Token ID4231
Feature activation-1.07e-02

Pos logit contributions
etti+0.027
isions+0.021
ller+0.021
 Jenn+0.021
icol+0.020
Neg logit contributions
 often-0.027
 end-0.027
 forward-0.025
 narrative-0.025
 hop-0.023
Token you
Token ID345
Feature activation-5.11e-03

Pos logit contributions
 often+0.016
 end+0.015
 forward+0.015
 narrative+0.014
 hop+0.013
Neg logit contributions
etti-0.023
isions-0.019
ller-0.019
icol-0.019
 Jenn-0.019
Token trying
Token ID2111
Feature activation-1.49e-02

Pos logit contributions
 often+0.008
 end+0.008
 narrative+0.007
 forward+0.007
 hop+0.006
Neg logit contributions
etti-0.011
isions-0.009
ller-0.009
 Jenn-0.009
icol-0.008
Token to
Token ID284
Feature activation-1.93e-02

Pos logit contributions
 narrative+0.029
 often+0.028
 end+0.027
 forward+0.027
 brav+0.025
Neg logit contributions
etti-0.025
ller-0.020
isions-0.020
 Jenn-0.019
 Brown-0.018
Token but
Token ID475
Feature activation-7.69e-03

Pos logit contributions
 narrative+0.012
ery+0.012
 forward+0.011
 end+0.011
 often+0.011
Neg logit contributions
etti-0.012
 Jenn-0.011
 Paul-0.011
 Brown-0.011
 John-0.011
Token he
Token ID339
Feature activation-3.69e-03

Pos logit contributions
 end+0.011
 often+0.011
 forward+0.011
 narrative+0.011
ery+0.011
Neg logit contributions
etti-0.015
 Jenn-0.013
 Paul-0.013
ller-0.012
 Brown-0.012
Token still
Token ID991
Feature activation-2.85e-03

Pos logit contributions
 forward+0.006
 narrative+0.005
 end+0.005
 often+0.005
 hop+0.005
Neg logit contributions
etti-0.007
ller-0.006
 Jenn-0.006
itor-0.006
 Paul-0.006
Token wanted
Token ID2227
Feature activation-9.18e-03

Pos logit contributions
 narrative+0.005
 forward+0.005
 end+0.005
 often+0.004
ery+0.004
Neg logit contributions
etti-0.005
ller-0.005
 Jenn-0.004
 Paul-0.004
isions-0.004
Token to
Token ID284
Feature activation-1.24e-02

Pos logit contributions
 often+0.016
 end+0.016
 forward+0.015
 narrative+0.015
 hop+0.014
Neg logit contributions
etti-0.017
isions-0.013
 Jenn-0.013
ller-0.013
icol-0.013
Token keep
Token ID1394
Feature activation+8.00e-04

Pos logit contributions
 forward+0.020
 narrative+0.019
 often+0.017
 end+0.016
ery+0.016
Neg logit contributions
etti-0.025
 Jenn-0.021
 Paul-0.020
ller-0.020
 Brown-0.020
Token the
Token ID262
Feature activation+1.85e-02

Pos logit contributions
etti+0.002
isions+0.002
ller+0.002
icol+0.002
 Jenn+0.001
Neg logit contributions
 often-0.001
 end-0.001
 forward-0.001
 narrative-0.001
 hop-0.001
Token ash
Token ID12530
Feature activation+1.82e-02

Pos logit contributions
etti+0.039
isions+0.032
ller+0.031
icol+0.031
 Jenn+0.030
Neg logit contributions
 end-0.030
 often-0.029
 forward-0.027
 narrative-0.026
 brav-0.024
Tokentr
Token ID2213
Feature activation+2.68e-02

Pos logit contributions
etti+0.029
ller+0.022
 Brown+0.022
 Jenn+0.022
isions+0.021
Neg logit contributions
 often-0.036
 narrative-0.036
 end-0.035
 forward-0.034
 hop-0.033
Tokenay
Token ID323
Feature activation+1.17e-02

Pos logit contributions
etti+0.047
isions+0.038
icol+0.037
ller+0.037
itor+0.036
Neg logit contributions
 often-0.049
 end-0.048
 forward-0.048
 narrative-0.045
ery-0.041
Token.
Token ID13
Feature activation-5.63e-03

Pos logit contributions
etti+0.026
ller+0.022
 Jenn+0.022
isions+0.022
 Paul+0.021
Neg logit contributions
 often-0.016
 end-0.015
 narrative-0.015
 forward-0.014
 hop-0.013
Token
Token ID198
Feature activation+4.14e-03

Pos logit contributions
 often+0.010
 end+0.010
 narrative+0.010
 forward+0.010
 brav+0.009
Neg logit contributions
etti-0.010
 Jenn-0.008
ller-0.008
isions-0.008
icol-0.008
TokenS
Token ID50
Feature activation+1.78e-02

Pos logit contributions
etti+0.006
 Jenn+0.005
isions+0.005
MY+0.005
 Johnson+0.005
Neg logit contributions
 forward-0.007
 narrative-0.007
 end-0.007
 often-0.007
 brav-0.007
Tokenue
Token ID518
Feature activation+6.96e-03

Pos logit contributions
etti+0.027
isions+0.021
ller+0.021
icol+0.021
 Jenn+0.019
Neg logit contributions
 end-0.039
 often-0.038
 forward-0.036
 narrative-0.036
orn-0.033
Token smiled
Token ID13541
Feature activation-8.81e-03

Pos logit contributions
etti+0.014
ller+0.011
isions+0.011
 Jenn+0.011
icol+0.011
Neg logit contributions
 forward-0.011
 end-0.011
 narrative-0.011
 often-0.011
 brav-0.010
Token and
Token ID290
Feature activation-8.44e-04

Pos logit contributions
 narrative+0.014
 often+0.013
 end+0.013
 brav+0.012
 forward+0.012
Neg logit contributions
etti-0.018
ller-0.015
icol-0.014
 Brown-0.014
 Jenn-0.014
Token they
Token ID484
Feature activation+5.85e-05

Pos logit contributions
 often+0.001
 end+0.001
 hop+0.001
 forward+0.001
 always+0.001
Neg logit contributions
etti-0.002
isions-0.002
ller-0.001
 Pole-0.001
itor-0.001
Token played
Token ID2826
Feature activation+4.48e-03

Pos logit contributions
etti+0.000
 Jenn+0.000
isions+0.000
 Brown+0.000
ller+0.000
Neg logit contributions
 often-0.000
 forward-0.000
 narrative-0.000
 end-0.000
 brav-0.000
Token together
Token ID1978
Feature activation+1.20e-02

Pos logit contributions
etti+0.008
isions+0.006
ller+0.006
 Jenn+0.006
icol+0.006
Neg logit contributions
 often-0.008
 end-0.008
 forward-0.008
 narrative-0.008
 hop-0.007
Token all
Token ID477
Feature activation+4.07e-03

Pos logit contributions
etti+0.025
 Jenn+0.021
 Rob+0.020
ller+0.020
 Brown+0.020
Neg logit contributions
 narrative-0.017
 brav-0.015
 end-0.014
orn-0.014
 forward-0.014
Token day
Token ID1110
Feature activation+2.77e-02

Pos logit contributions
etti+0.010
 Jenn+0.009
ller+0.009
isions+0.008
 Paul+0.008
Neg logit contributions
 narrative-0.004
 forward-0.004
 often-0.004
 end-0.003
 hop-0.003
Token.
Token ID13
Feature activation+9.12e-03

Pos logit contributions
etti+0.070
icol+0.060
isions+0.060
 Jenn+0.059
ller+0.059
Neg logit contributions
 often-0.034
 end-0.029
 forward-0.029
 narrative-0.025
 hop-0.024
Token,
Token ID11
Feature activation-8.37e-03

Pos logit contributions
etti+0.007
 Jenn+0.005
ller+0.005
isions+0.005
icol+0.005
Neg logit contributions
 often-0.005
 end-0.005
 forward-0.005
 narrative-0.004
 brav-0.004
Token she
Token ID673
Feature activation-8.04e-03

Pos logit contributions
 often+0.017
 end+0.017
 forward+0.017
 narrative+0.016
 hop+0.015
Neg logit contributions
etti-0.013
 Jenn-0.010
isions-0.010
ller-0.009
icol-0.009
Token always
Token ID1464
Feature activation-1.22e-02

Pos logit contributions
 forward+0.010
 end+0.009
 narrative+0.009
igator+0.008
 brav+0.008
Neg logit contributions
etti-0.017
ller-0.016
 Brown-0.016
llers-0.015
 Jenn-0.015
Token remembered
Token ID12086
Feature activation-1.03e-02

Pos logit contributions
 forward+0.019
 end+0.019
 often+0.019
 narrative+0.019
ery+0.017
Neg logit contributions
etti-0.023
 Jenn-0.019
icol-0.019
 Paul-0.019
ller-0.019
Token to
Token ID284
Feature activation-3.13e-02

Pos logit contributions
 narrative+0.021
 forward+0.020
 end+0.020
 often+0.019
ery+0.019
Neg logit contributions
etti-0.015
 Jenn-0.012
 Brown-0.012
 Paul-0.012
ller-0.012
Token be
Token ID307
Feature activation-3.62e-02

Pos logit contributions
 forward+0.059
 narrative+0.054
ying+0.052
ery+0.050
iness+0.050
Neg logit contributions
etti-0.044
 Jenn-0.041
 Paul-0.040
 Brown-0.040
 John-0.037
Token honest
Token ID5508
Feature activation-2.16e-02

Pos logit contributions
 narrative+0.056
 end+0.052
 often+0.048
iness+0.047
orn+0.046
Neg logit contributions
etti-0.061
plug-0.058
 Jenn-0.058
 badges-0.054
 Brown-0.054
Token and
Token ID290
Feature activation-2.24e-02

Pos logit contributions
 narrative+0.040
wered+0.036
 end+0.036
 forward+0.034
 brav+0.034
Neg logit contributions
etti-0.033
isions-0.029
 Brown-0.028
ller-0.028
 Smith-0.028
Token reliable
Token ID9314
Feature activation-1.22e-02

Pos logit contributions
 narrative+0.035
 end+0.034
 forward+0.033
ery+0.033
orn+0.032
Neg logit contributions
etti-0.037
 Jenn-0.034
 Paul-0.033
ller-0.033
 Grand-0.032
Token.
Token ID13
Feature activation-5.70e-03

Pos logit contributions
 narrative+0.020
 end+0.019
 often+0.019
 forward+0.018
orn+0.018
Neg logit contributions
etti-0.021
 Brown-0.018
 Paul-0.018
isions-0.018
 Jenn-0.017
Token<|endoftext|>
Token ID50256
Feature activation+8.52e-03

Pos logit contributions
 often+0.009
 end+0.009
 forward+0.008
 narrative+0.008
 hop+0.007
Neg logit contributions
etti-0.012
isions-0.010
ller-0.010
 Jenn-0.009
icol-0.009
Token to
Token ID284
Feature activation-8.08e-03

Pos logit contributions
 often+0.018
 end+0.017
 forward+0.017
 narrative+0.017
 hop+0.015
Neg logit contributions
etti-0.018
isions-0.015
ller-0.015
 Jenn-0.015
icol-0.014
Token mark
Token ID1317
Feature activation-2.00e-02

Pos logit contributions
 forward+0.013
 often+0.013
 narrative+0.013
 end+0.012
ery+0.011
Neg logit contributions
etti-0.016
 Jenn-0.013
ller-0.013
isions-0.013
icol-0.012
Token the
Token ID262
Feature activation-1.78e-02

Pos logit contributions
 narrative+0.035
 often+0.035
 end+0.033
 forward+0.032
ery+0.029
Neg logit contributions
etti-0.034
ller-0.029
 Jenn-0.029
 Brown-0.028
 Rob-0.027
Token beetle
Token ID45489
Feature activation-2.24e-02

Pos logit contributions
 often+0.032
 forward+0.030
 narrative+0.029
ying+0.028
aying+0.027
Neg logit contributions
etti-0.025
 conf-0.024
 sprink-0.023
ller-0.022
 Smith-0.021
Token on
Token ID319
Feature activation-9.89e-03

Pos logit contributions
 narrative+0.048
 brav+0.040
 forward+0.039
wered+0.038
umper+0.037
Neg logit contributions
etti-0.030
ller-0.025
 cards-0.022
 Brown-0.022
 prizes-0.021
Token his
Token ID465
Feature activation-3.23e-02

Pos logit contributions
 narrative+0.019
 often+0.018
 forward+0.018
 end+0.017
ying+0.016
Neg logit contributions
etti-0.016
 Jenn-0.013
 Rob-0.013
ller-0.013
 Brown-0.013
Token paper
Token ID3348
Feature activation-3.62e-02

Pos logit contributions
 narrative+0.051
 forward+0.049
 often+0.049
 end+0.042
ying+0.042
Neg logit contributions
etti-0.061
ller-0.054
icol-0.050
itor-0.049
isions-0.048
Token.
Token ID13
Feature activation-1.61e-02

Pos logit contributions
 narrative+0.072
 brav+0.069
 forward+0.065
 End+0.064
 end+0.062
Neg logit contributions
etti-0.055
ller-0.045
 Brown-0.040
isions-0.039
 cards-0.039
Token
Token ID198
Feature activation-9.07e-03

Pos logit contributions
 often+0.021
 end+0.019
 forward+0.019
 hop+0.017
 narrative+0.017
Neg logit contributions
etti-0.039
isions-0.033
ller-0.032
icol-0.032
itor-0.031
Token
Token ID198
Feature activation+1.60e-04

Pos logit contributions
 forward+0.017
 narrative+0.017
 often+0.017
 end+0.017
 brav+0.015
Neg logit contributions
etti-0.015
 Jenn-0.012
ller-0.012
isions-0.011
 Paul-0.011
TokenHe
Token ID1544
Feature activation-3.70e-03

Pos logit contributions
etti+0.000
isions+0.000
 Jenn+0.000
plug+0.000
 Johnson+0.000
Neg logit contributions
 forward-0.000
 narrative-0.000
 often-0.000
 end-0.000
 brav-0.000
Token dance
Token ID9280
Feature activation-1.22e-02

Pos logit contributions
 narrative+0.032
 forward+0.031
 often+0.031
ery+0.031
iness+0.029
Neg logit contributions
etti-0.024
 Jenn-0.019
 Grand-0.018
 Sus-0.017
 Paul-0.017
Token.
Token ID13
Feature activation-5.49e-03

Pos logit contributions
 narrative+0.021
 end+0.019
 often+0.019
 brav+0.018
 forward+0.018
Neg logit contributions
etti-0.023
 Jenn-0.019
ller-0.018
isions-0.018
icol-0.017
Token She
Token ID1375
Feature activation-7.85e-03

Pos logit contributions
 often+0.007
 end+0.007
 forward+0.007
 narrative+0.007
 hop+0.006
Neg logit contributions
etti-0.013
isions-0.011
 Jenn-0.011
ller-0.011
icol-0.010
Token swayed
Token ID47632
Feature activation-2.29e-02

Pos logit contributions
 narrative+0.015
 forward+0.014
 end+0.014
 often+0.013
ery+0.013
Neg logit contributions
etti-0.013
 Jenn-0.010
ller-0.010
icol-0.010
isions-0.010
Token her
Token ID607
Feature activation-1.26e-02

Pos logit contributions
 narrative+0.040
 end+0.034
 often+0.033
ery+0.032
orn+0.032
Neg logit contributions
etti-0.044
 Jenn-0.037
ller-0.035
 Paul-0.035
icol-0.035
Token arms
Token ID5101
Feature activation-3.52e-02

Pos logit contributions
 often+0.020
 narrative+0.020
 end+0.019
 forward+0.019
orn+0.017
Neg logit contributions
etti-0.025
ller-0.020
icol-0.020
isions-0.019
 Jenn-0.019
Token up
Token ID510
Feature activation-8.91e-03

Pos logit contributions
 narrative+0.072
 end+0.057
 moral+0.056
 brav+0.054
orn+0.054
Neg logit contributions
etti-0.063
 flashing-0.051
ller-0.050
ding-0.049
 Jenn-0.049
Token and
Token ID290
Feature activation-3.61e-03

Pos logit contributions
 narrative+0.014
ery+0.013
 end+0.013
 brav+0.013
 often+0.012
Neg logit contributions
etti-0.017
 Jenn-0.014
icol-0.014
ller-0.014
 Brown-0.013
Token down
Token ID866
Feature activation-2.85e-04

Pos logit contributions
 often+0.005
 end+0.005
 narrative+0.005
 forward+0.005
ery+0.004
Neg logit contributions
etti-0.008
ller-0.006
 Jenn-0.006
isions-0.006
icol-0.006
Token and
Token ID290
Feature activation-5.07e-03

Pos logit contributions
 narrative+0.000
 often+0.000
 end+0.000
ery+0.000
orn+0.000
Neg logit contributions
etti-0.001
 Jenn-0.000
ller-0.000
 Paul-0.000
icol-0.000
Token tw
Token ID665
Feature activation-2.00e-02

Pos logit contributions
 often+0.007
 end+0.007
 narrative+0.007
 forward+0.007
ery+0.006
Neg logit contributions
etti-0.011
 Jenn-0.009
ller-0.009
isions-0.009
icol-0.009
Token string
Token ID4731
Feature activation-2.32e-02

Pos logit contributions
 often+0.003
 forward+0.003
 end+0.003
 narrative+0.003
 hop+0.002
Neg logit contributions
etti-0.003
isions-0.002
 conf-0.002
 sprink-0.002
ller-0.002
Token.
Token ID13
Feature activation+2.42e-03

Pos logit contributions
 narrative+0.036
 end+0.034
 often+0.034
 forward+0.033
 brav+0.031
Neg logit contributions
etti-0.049
 Jenn-0.039
ller-0.039
isions-0.038
 Paul-0.037
Token She
Token ID1375
Feature activation+1.82e-03

Pos logit contributions
etti+0.006
isions+0.005
ller+0.005
 Jenn+0.005
icol+0.005
Neg logit contributions
 often-0.003
 end-0.003
 forward-0.003
 narrative-0.003
 hop-0.002
Token tied
Token ID8165
Feature activation-2.25e-02

Pos logit contributions
etti+0.004
isions+0.003
ller+0.003
 Jenn+0.003
icol+0.003
Neg logit contributions
 forward-0.003
 end-0.003
 narrative-0.003
 often-0.003
 brav-0.003
Token the
Token ID262
Feature activation+6.65e-03

Pos logit contributions
 often+0.038
 end+0.037
 narrative+0.036
 forward+0.035
 hop+0.032
Neg logit contributions
etti-0.044
 Jenn-0.035
ller-0.035
isions-0.034
icol-0.033
Token string
Token ID4731
Feature activation-3.63e-02

Pos logit contributions
etti+0.012
isions+0.009
ller+0.009
 Jenn+0.009
icol+0.009
Neg logit contributions
 often-0.013
 end-0.012
 forward-0.012
 narrative-0.012
 hop-0.011
Token around
Token ID1088
Feature activation+1.60e-03

Pos logit contributions
 narrative+0.077
 end+0.066
 moral+0.063
 often+0.063
 brav+0.062
Neg logit contributions
etti-0.057
 Jenn-0.046
 Brown-0.043
ller-0.043
 Paul-0.042
Token the
Token ID262
Feature activation-8.20e-03

Pos logit contributions
etti+0.004
 Jenn+0.003
isions+0.003
ller+0.003
icol+0.003
Neg logit contributions
 often-0.002
 end-0.002
 forward-0.002
 narrative-0.002
 hop-0.002
Token balloon
Token ID21190
Feature activation-2.84e-02

Pos logit contributions
 often+0.016
 end+0.016
 forward+0.015
 narrative+0.015
 hop+0.014
Neg logit contributions
etti-0.014
isions-0.011
ller-0.011
 Jenn-0.010
icol-0.010
Token and
Token ID290
Feature activation-2.03e-03

Pos logit contributions
 narrative+0.050
 end+0.044
 often+0.043
 forward+0.042
 brav+0.041
Neg logit contributions
etti-0.056
ller-0.045
 Jenn-0.044
 Brown-0.044
 Smith-0.043
Token started
Token ID2067
Feature activation+6.43e-03

Pos logit contributions
 often+0.003
 end+0.003
 forward+0.003
 narrative+0.003
 hop+0.003
Neg logit contributions
etti-0.004
isions-0.004
ller-0.004
 Jenn-0.003
icol-0.003
Token in
Token ID287
Feature activation-1.26e-02

Pos logit contributions
 narrative+0.050
 brav+0.039
 end+0.039
 moderation+0.038
 moral+0.038
Neg logit contributions
etti-0.034
 Brown-0.027
icol-0.026
 Sus-0.026
ding-0.026
Token and
Token ID290
Feature activation-7.59e-03

Pos logit contributions
 narrative+0.024
 often+0.023
 forward+0.023
ying+0.023
ery+0.023
Neg logit contributions
etti-0.020
 Jenn-0.016
 Grand-0.015
 Paul-0.015
 conf-0.015
Token out
Token ID503
Feature activation-2.21e-03

Pos logit contributions
 narrative+0.012
 end+0.012
 often+0.011
 forward+0.011
ery+0.011
Neg logit contributions
etti-0.015
 Jenn-0.012
icol-0.012
ller-0.012
 Brown-0.012
Token of
Token ID286
Feature activation-1.02e-02

Pos logit contributions
 narrative+0.004
orn+0.003
 end+0.003
ery+0.003
 often+0.003
Neg logit contributions
etti-0.004
 Jenn-0.004
 Paul-0.003
 flashing-0.003
 Sus-0.003
Token the
Token ID262
Feature activation-2.96e-02

Pos logit contributions
 often+0.018
 narrative+0.018
 forward+0.018
 end+0.017
ery+0.017
Neg logit contributions
etti-0.018
 Jenn-0.014
icol-0.013
 Paul-0.013
 conf-0.013
Token flowers
Token ID12734
Feature activation-3.41e-02

Pos logit contributions
 often+0.051
 forward+0.048
 narrative+0.047
ery+0.047
ying+0.045
Neg logit contributions
etti-0.047
icol-0.039
 conf-0.038
isions-0.038
 Smith-0.036
Token,
Token ID11
Feature activation-6.30e-03

Pos logit contributions
 narrative+0.068
 moral+0.056
 end+0.055
 forward+0.053
 brav+0.052
Neg logit contributions
etti-0.061
 Brown-0.043
 Jenn-0.043
 flashing-0.042
 sprink-0.042
Token eating
Token ID6600
Feature activation-1.14e-02

Pos logit contributions
 narrative+0.011
 end+0.010
ery+0.009
orn+0.009
 often+0.009
Neg logit contributions
etti-0.012
icol-0.010
 Jenn-0.010
 Brown-0.010
ller-0.009
Token the
Token ID262
Feature activation-8.88e-03

Pos logit contributions
 narrative+0.020
 brav+0.019
 forward+0.019
 often+0.018
 end+0.018
Neg logit contributions
etti-0.019
 conf-0.017
 sprink-0.016
 balloons-0.015
 Brown-0.014
Token juicy
Token ID35075
Feature activation+2.58e-03

Pos logit contributions
 often+0.016
 forward+0.015
 narrative+0.014
 end+0.014
orn+0.013
Neg logit contributions
etti-0.015
 conf-0.013
isions-0.012
icol-0.012
 sprink-0.012
Token fruit
Token ID8234
Feature activation-5.70e-03

Pos logit contributions
etti+0.005
icol+0.004
ller+0.004
isions+0.004
 Jenn+0.004
Neg logit contributions
 often-0.004
 narrative-0.004
 forward-0.004
 end-0.004
 brav-0.004