TOP ACTIVATIONS
MAX = 4.944

liked to play and have pic n ics . One day
in the garden and have pic n ics together .
the park , and have pic n ics outside .
Ann thought that hide and seek was too boring . They
near the cliff and have pic n ics . One day
roof . They would have pic n ics and play games
wanted to play hide and seek with its friends . The
wanted to play hide and seek . Billy loved hide and
particularly liked playing hide and seek with her friends .
wanted to play hide and seek with the sun . The
wanted to play hide and seek , but it was getting
wanted to play hide and seek in the garden . She
wanted to play hide and seek and Ann wanted to play
felt like playing hide and seek . She sprang down a
went on adventures , had pic n ics , and even
wanted to play hide and seek . Spark y said yes
play tag and hide and seek , and tell one another
playing tag and hide and seek . They were fast and
see the bird and have pic n ics . <|endoftext|> Once
loved to play hide and seek with Lily . One day

MIN ACTIVATIONS
MIN = -5.451

filled with treasures from long ago ! The adventurers
in the park many years ago . Toby was amazed !
" It was hidden long ago , and only seems fancy
rest of the ice - cream and played together . Lily
pe eking out from the banks . As he watched for
march around the far my ard with his friends . One
was exploring the far my ard when he came across a
march around the far my ard with his friends . One
are from a long time ago , before we were born
zoo workers have stopped the rh in oc eros . He
who had passed away long ago . One day , a
flew right back up the chim ney and disappeared into the
scientist was surprised at the ot ter 's ability . His
The ants came from the junk under his bed . Tim
is from a time long ago . It has been locked
's house was built long ago . L ily
of animals that lived long ago . They saw shiny rocks
started smoking a long time ago and now it 's hard
velvet box all those years ago . They were glad that
was painted a long time ago , â € her mum

INTERVAL 2.345 - 4.944
CONTAINS 0.278%

They wanted to make a big building with many rooms and
toy st eth oscope and band ages to make them feel
. They had a big sheet that they used to make
the pengu ins swim and slide and w addle . They
the backyard . In the middle of playing she heard a

INTERVAL -0.254 - 2.345
CONTAINS 63.665%

apples and cheese . He thanked the doctor and his mom
dog . He wished he had never gone to the park
could make her feel better . His sister smiled and thanked
to keep the glass jug forever as a reminder of their
a time , there was a man named Jack . Jack

INTERVAL -2.852 - -0.254
CONTAINS 35.861%

so much traffic ?" Her mom my said ,
" Why are you sad ?" The snake said
now ?" " Yes , of course ," Mom
walked to the middle of the room and tw irled around
home , so she wr igg led away from her mom

INTERVAL -5.451 - -2.852
CONTAINS 0.196%

go on walks in the woods with her dog , Spot
Tim my went to the museum with his mom my .
there was a mom my who liked to bake . Every
that day on , the little bird ie was not shy
they marched out of the water and onto the banks .
Token liked
Token ID8288
Feature activation-7.12e-01

Pos logit contributions
storms+0.015
aces+0.013
 wrists+0.013
case+0.013
chair+0.012
Neg logit contributions
 check-0.010
 cook-0.010
 help-0.010
 little-0.010
 save-0.010
Token to
Token ID284
Feature activation+4.92e-01

Pos logit contributions
 help+0.013
 cook+0.013
 check+0.013
 little+0.012
 work+0.011
Neg logit contributions
storms-0.036
case-0.034
aces-0.033
 wrists-0.033
chair-0.032
Token play
Token ID711
Feature activation+7.74e-01

Pos logit contributions
storms+0.025
 wrists+0.023
case+0.023
aces+0.023
chair+0.022
Neg logit contributions
 check-0.007
 little-0.007
asting-0.007
 permission-0.007
 cook-0.007
Token and
Token ID290
Feature activation-2.85e-02

Pos logit contributions
storms+0.035
 wrists+0.031
aces+0.031
case+0.031
chair+0.030
Neg logit contributions
 check-0.018
 help-0.018
 cook-0.018
 little-0.017
 end-0.017
Token have
Token ID423
Feature activation+5.41e-01

Pos logit contributions
 check+0.000
 help+0.000
 cook+0.000
 little+0.000
 end+0.000
Neg logit contributions
storms-0.001
 wrists-0.001
case-0.001
aces-0.001
chair-0.001
Token pic
Token ID8301
Feature activation+4.94e+00

Pos logit contributions
storms+0.028
 wrists+0.024
aces+0.024
chair+0.024
case+0.024
Neg logit contributions
 little-0.012
 help-0.011
 cook-0.010
 check-0.010
 two-0.010
Tokenn
Token ID77
Feature activation-6.86e-01

Pos logit contributions
storms+0.163
case+0.154
aces+0.145
 wrists+0.144
uts+0.137
Neg logit contributions
 check-0.164
 cook-0.160
 little-0.155
 adult-0.145
 permission-0.145
Tokenics
Token ID873
Feature activation+1.26e+00

Pos logit contributions
 help+0.033
 check+0.033
 cook+0.033
 little+0.032
 end+0.032
Neg logit contributions
storms-0.014
 wrists-0.010
aces-0.010
case-0.009
chair-0.009
Token.
Token ID13
Feature activation+3.23e-01

Pos logit contributions
storms+0.075
 wrists+0.068
chair+0.068
aces+0.067
case+0.066
Neg logit contributions
 help-0.016
 forgive-0.013
 check-0.013
 cook-0.013
 end-0.012
Token One
Token ID1881
Feature activation-9.01e-02

Pos logit contributions
storms+0.015
 wrists+0.013
case+0.013
aces+0.013
chair+0.012
Neg logit contributions
 check-0.007
 help-0.007
 cook-0.007
 little-0.007
 save-0.007
Token day
Token ID1110
Feature activation+6.11e-02

Pos logit contributions
 check+0.002
 permission+0.002
 help+0.002
 save+0.002
 install+0.002
Neg logit contributions
storms-0.004
case-0.004
chair-0.004
aces-0.003
oline-0.003
Token in
Token ID287
Feature activation-5.42e-01

Pos logit contributions
storms+0.023
case+0.020
aces+0.020
 wrists+0.020
chair+0.019
Neg logit contributions
 check-0.010
 help-0.009
 permission-0.009
 cook-0.009
 end-0.009
Token the
Token ID262
Feature activation-4.05e-01

Pos logit contributions
 check+0.011
 permission+0.011
 forgive+0.011
asting+0.011
 adult+0.010
Neg logit contributions
storms-0.024
 wrists-0.022
case-0.022
chair-0.021
 Caroline-0.020
Token garden
Token ID11376
Feature activation-1.19e+00

Pos logit contributions
 help+0.010
 check+0.010
 permission+0.010
 cook+0.010
 forgive+0.010
Neg logit contributions
storms-0.017
case-0.015
 wrists-0.015
chair-0.014
aces-0.014
Token and
Token ID290
Feature activation+3.32e-01

Pos logit contributions
 check+0.016
 help+0.015
 cook+0.015
 little+0.014
 end+0.013
Neg logit contributions
storms-0.067
 wrists-0.060
aces-0.060
case-0.060
chair-0.058
Token have
Token ID423
Feature activation+7.01e-01

Pos logit contributions
storms+0.016
 wrists+0.015
case+0.014
chair+0.014
aces+0.013
Neg logit contributions
 adult-0.006
asting-0.006
 permission-0.005
 check-0.005
 little-0.005
Token pic
Token ID8301
Feature activation+4.67e+00

Pos logit contributions
storms+0.033
case+0.028
 wrists+0.028
aces+0.028
chair+0.027
Neg logit contributions
 help-0.017
 check-0.016
 cook-0.016
 little-0.016
 end-0.015
Tokenn
Token ID77
Feature activation-7.05e-01

Pos logit contributions
storms+0.149
case+0.137
 wrists+0.133
aces+0.131
uts+0.120
Neg logit contributions
 check-0.156
 cook-0.155
 little-0.150
 adult-0.141
 permission-0.139
Tokenics
Token ID873
Feature activation+1.16e+00

Pos logit contributions
 check+0.036
 help+0.036
 cook+0.036
 little+0.035
 end+0.034
Neg logit contributions
storms-0.013
case-0.009
 wrists-0.009
aces-0.009
chair-0.007
Token together
Token ID1978
Feature activation+1.51e+00

Pos logit contributions
storms+0.072
aces+0.066
 wrists+0.065
case+0.065
chair+0.064
Neg logit contributions
 help-0.011
 cook-0.010
 check-0.010
 little-0.008
 forgive-0.008
Token.
Token ID13
Feature activation+2.92e-01

Pos logit contributions
storms+0.096
 wrists+0.087
case+0.086
aces+0.086
chair+0.085
Neg logit contributions
 help-0.010
 cook-0.010
 check-0.009
 little-0.007
 end-0.007
Token
Token ID220
Feature activation+1.07e+00

Pos logit contributions
storms+0.015
case+0.013
 wrists+0.013
chair+0.013
aces+0.012
Neg logit contributions
 help-0.006
 check-0.005
 cook-0.005
 permission-0.005
 save-0.005
Token the
Token ID262
Feature activation-3.18e-01

Pos logit contributions
 check+0.007
 help+0.007
 cook+0.007
 little+0.006
 end+0.006
Neg logit contributions
storms-0.030
aces-0.027
 wrists-0.027
case-0.026
chair-0.026
Token park
Token ID3952
Feature activation-6.96e-01

Pos logit contributions
 help+0.008
 permission+0.008
 check+0.008
 forgive+0.008
 cook+0.007
Neg logit contributions
storms-0.013
case-0.012
chair-0.011
 wrists-0.011
oline-0.010
Token,
Token ID11
Feature activation-4.78e-01

Pos logit contributions
 help+0.006
 cook+0.006
 little+0.005
 end+0.005
 check+0.005
Neg logit contributions
storms-0.042
 wrists-0.040
aces-0.039
case-0.038
chair-0.037
Token and
Token ID290
Feature activation+5.46e-01

Pos logit contributions
 check+0.008
 help+0.008
 cook+0.008
 little+0.007
 save+0.007
Neg logit contributions
storms-0.026
 wrists-0.023
aces-0.023
case-0.023
chair-0.022
Token have
Token ID423
Feature activation+5.73e-01

Pos logit contributions
storms+0.026
aces+0.023
 wrists+0.023
case+0.023
chair+0.022
Neg logit contributions
 help-0.012
 check-0.012
 cook-0.012
 little-0.011
 save-0.011
Token pic
Token ID8301
Feature activation+4.59e+00

Pos logit contributions
storms+0.031
 wrists+0.028
chair+0.028
case+0.028
alo+0.028
Neg logit contributions
 little-0.011
 help-0.010
 two-0.010
 a-0.009
 cook-0.009
Tokenn
Token ID77
Feature activation-5.33e-01

Pos logit contributions
storms+0.156
case+0.145
aces+0.143
 wrists+0.135
uts+0.128
Neg logit contributions
 check-0.149
 cook-0.147
 little-0.141
 adult-0.133
 chop-0.132
Tokenics
Token ID873
Feature activation+9.66e-01

Pos logit contributions
 check+0.024
 help+0.024
 cook+0.024
 little+0.023
 end+0.023
Neg logit contributions
storms-0.013
 wrists-0.010
aces-0.009
case-0.009
chair-0.009
Token outside
Token ID2354
Feature activation-1.22e+00

Pos logit contributions
storms+0.056
 wrists+0.051
aces+0.050
chair+0.050
case+0.050
Neg logit contributions
 help-0.013
 check-0.011
 forgive-0.011
 little-0.010
 cook-0.010
Token.
Token ID13
Feature activation+1.49e-01

Pos logit contributions
 help+0.016
 check+0.015
 cook+0.015
 little+0.014
 end+0.014
Neg logit contributions
storms-0.069
 wrists-0.063
aces-0.062
case-0.062
chair-0.060
Token
Token ID220
Feature activation+6.63e-01

Pos logit contributions
storms+0.007
 wrists+0.007
aces+0.007
case+0.006
chair+0.006
Neg logit contributions
 check-0.003
 little-0.003
 cook-0.003
 help-0.003
 end-0.002
Token Ann
Token ID5506
Feature activation+2.42e+00

Pos logit contributions
 help+0.000
 little+0.000
 check+0.000
 what+0.000
 work+0.000
Neg logit contributions
storms-0.001
aces-0.001
case-0.001
 wrists-0.001
alo-0.001
Token thought
Token ID1807
Feature activation-1.14e+00

Pos logit contributions
storms+0.120
alo+0.109
umm+0.108
 wrists+0.106
case+0.104
Neg logit contributions
 little-0.051
 help-0.045
 two-0.043
 who-0.043
 cook-0.043
Token that
Token ID326
Feature activation+2.37e-01

Pos logit contributions
 help+0.029
 what+0.027
 little+0.027
 work+0.024
 end+0.023
Neg logit contributions
storms-0.055
 wrists-0.050
umm-0.047
case-0.047
oph-0.046
Token hide
Token ID7808
Feature activation+2.47e+00

Pos logit contributions
storms+0.011
 wrists+0.010
aces+0.010
case+0.010
alo+0.010
Neg logit contributions
 help-0.006
 what-0.005
 cook-0.005
 two-0.005
 check-0.005
Token and
Token ID290
Feature activation+2.42e-01

Pos logit contributions
storms+0.119
 wrists+0.107
case+0.107
chair+0.102
aces+0.102
Neg logit contributions
 help-0.051
 little-0.050
 end-0.048
 cook-0.047
 what-0.047
Token seek
Token ID5380
Feature activation+4.57e+00

Pos logit contributions
storms+0.011
case+0.010
aces+0.010
ane+0.009
ays+0.009
Neg logit contributions
 help-0.007
 check-0.006
 little-0.006
 work-0.006
 cook-0.006
Token was
Token ID373
Feature activation+8.35e-01

Pos logit contributions
storms+0.251
 wrists+0.232
oph+0.222
umm+0.219
 Caroline+0.219
Neg logit contributions
 help-0.074
 little-0.066
 what-0.062
 end-0.060
 cook-0.058
Token too
Token ID1165
Feature activation+5.35e-01

Pos logit contributions
storms+0.044
 wrists+0.039
aces+0.039
case+0.039
chair+0.038
Neg logit contributions
 help-0.016
 check-0.016
 little-0.015
 what-0.014
 cook-0.014
Token boring
Token ID14262
Feature activation+5.72e-01

Pos logit contributions
storms+0.028
 wrists+0.025
aces+0.025
case+0.024
chair+0.024
Neg logit contributions
 little-0.010
 check-0.010
 help-0.009
 cook-0.009
 work-0.009
Token.
Token ID13
Feature activation+2.28e-01

Pos logit contributions
storms+0.035
 wrists+0.034
case+0.033
aces+0.032
chair+0.032
Neg logit contributions
 help-0.004
 check-0.003
 work-0.003
 cook-0.003
 little-0.002
Token They
Token ID1119
Feature activation+4.93e-01

Pos logit contributions
storms+0.010
aces+0.009
 wrists+0.009
alo+0.009
case+0.009
Neg logit contributions
 little-0.005
 check-0.005
 help-0.005
 two-0.005
 cook-0.005
Token near
Token ID1474
Feature activation-7.17e-01

Pos logit contributions
 help+0.030
 check+0.029
 little+0.028
 cook+0.028
 end+0.025
Neg logit contributions
storms-0.122
 wrists-0.111
aces-0.110
case-0.107
chair-0.106
Token the
Token ID262
Feature activation-3.56e-01

Pos logit contributions
 check+0.013
 cook+0.012
 help+0.012
asting+0.011
 save+0.011
Neg logit contributions
storms-0.035
 wrists-0.033
case-0.031
aces-0.031
chair-0.030
Token cliff
Token ID19516
Feature activation-2.19e+00

Pos logit contributions
 check+0.012
 help+0.012
 cook+0.012
 little+0.011
 end+0.011
Neg logit contributions
storms-0.013
case-0.011
 wrists-0.011
aces-0.011
chair-0.010
Token and
Token ID290
Feature activation+3.59e-01

Pos logit contributions
 check+0.028
 help+0.028
 cook+0.028
 little+0.026
 end+0.025
Neg logit contributions
storms-0.123
 wrists-0.112
aces-0.111
case-0.109
chair-0.106
Token have
Token ID423
Feature activation+6.86e-01

Pos logit contributions
storms+0.019
aces+0.017
 wrists+0.017
case+0.017
chair+0.016
Neg logit contributions
 check-0.006
 help-0.006
 cook-0.006
 little-0.005
 end-0.005
Token pic
Token ID8301
Feature activation+4.57e+00

Pos logit contributions
storms+0.038
case+0.034
 wrists+0.034
chair+0.033
aces+0.033
Neg logit contributions
 little-0.012
 help-0.012
 two-0.011
 check-0.011
 cook-0.011
Tokenn
Token ID77
Feature activation-4.42e-01

Pos logit contributions
storms+0.162
case+0.147
 wrists+0.143
aces+0.143
uts+0.131
Neg logit contributions
 check-0.142
 cook-0.141
 little-0.135
 adult-0.128
 chop-0.127
Tokenics
Token ID873
Feature activation+1.16e+00

Pos logit contributions
 help+0.021
 check+0.021
 cook+0.021
 little+0.021
 end+0.020
Neg logit contributions
storms-0.009
 wrists-0.007
aces-0.006
case-0.006
chair-0.006
Token.
Token ID13
Feature activation+4.29e-01

Pos logit contributions
storms+0.070
 wrists+0.063
aces+0.063
chair+0.062
case+0.062
Neg logit contributions
 help-0.013
 check-0.012
 cook-0.011
 forgive-0.011
 little-0.010
Token One
Token ID1881
Feature activation-1.61e-01

Pos logit contributions
storms+0.022
 wrists+0.020
case+0.020
aces+0.020
chair+0.019
Neg logit contributions
 check-0.008
 help-0.007
 cook-0.007
 little-0.007
 end-0.007
Token day
Token ID1110
Feature activation+1.38e-01

Pos logit contributions
 permission+0.004
 check+0.004
 help+0.004
 save+0.004
 install+0.004
Neg logit contributions
storms-0.006
chair-0.006
case-0.006
oline-0.005
 Sof-0.005
Token roof
Token ID9753
Feature activation-6.79e-01

Pos logit contributions
 check+0.008
 help+0.008
 cook+0.008
 forgive+0.008
 save+0.007
Neg logit contributions
storms-0.016
case-0.015
 wrists-0.014
aces-0.014
chair-0.014
Token.
Token ID13
Feature activation-3.66e-02

Pos logit contributions
 cook+0.011
 help+0.011
 check+0.011
 little+0.010
 end+0.010
Neg logit contributions
storms-0.036
 wrists-0.033
aces-0.032
case-0.032
chair-0.031
Token They
Token ID1119
Feature activation+2.90e-01

Pos logit contributions
 help+0.001
 cook+0.001
 check+0.001
 little+0.001
 end+0.001
Neg logit contributions
storms-0.002
 wrists-0.002
aces-0.002
case-0.002
chair-0.001
Token would
Token ID561
Feature activation-2.66e-01

Pos logit contributions
storms+0.012
aces+0.010
 wrists+0.010
case+0.010
chair+0.010
Neg logit contributions
 check-0.008
 help-0.008
 cook-0.008
 little-0.008
 end-0.008
Token have
Token ID423
Feature activation-3.95e-01

Pos logit contributions
 check+0.006
 help+0.006
 cook+0.006
 little+0.006
 end+0.006
Neg logit contributions
storms-0.012
 wrists-0.011
case-0.011
aces-0.011
chair-0.010
Token pic
Token ID8301
Feature activation+4.56e+00

Pos logit contributions
 little+0.006
 help+0.006
 cook+0.006
 two+0.006
 a+0.005
Neg logit contributions
storms-0.023
aces-0.021
case-0.021
overed-0.021
chair-0.020
Tokenn
Token ID77
Feature activation-1.10e+00

Pos logit contributions
storms+0.166
case+0.151
 wrists+0.147
aces+0.146
uts+0.136
Neg logit contributions
 check-0.140
 cook-0.136
 little-0.130
 adult-0.125
 permission-0.124
Tokenics
Token ID873
Feature activation+6.45e-01

Pos logit contributions
 help+0.049
 little+0.049
asting+0.048
 cook+0.048
 end+0.047
Neg logit contributions
storms-0.027
 wrists-0.023
aces-0.020
chair-0.018
case-0.018
Token and
Token ID290
Feature activation+2.42e-02

Pos logit contributions
storms+0.039
 wrists+0.036
aces+0.036
chair+0.035
case+0.034
Neg logit contributions
 help-0.008
 cook-0.006
 who-0.006
 forgive-0.006
 little-0.006
Token play
Token ID711
Feature activation+1.18e+00

Pos logit contributions
storms+0.001
aces+0.001
case+0.001
 wrists+0.001
chair+0.001
Neg logit contributions
 cook-0.000
 help-0.000
 check-0.000
 little-0.000
 save-0.000
Token games
Token ID1830
Feature activation+6.91e-01

Pos logit contributions
storms+0.057
 wrists+0.050
aces+0.049
uts+0.048
case+0.048
Neg logit contributions
 cook-0.027
 help-0.027
 check-0.026
 little-0.026
 save-0.024
Token wanted
Token ID2227
Feature activation-1.09e+00

Pos logit contributions
 little+0.018
 cook+0.017
 check+0.017
 help+0.016
 end+0.015
Neg logit contributions
storms-0.044
 wrists-0.042
aces-0.040
uts-0.038
case-0.038
Token to
Token ID284
Feature activation+9.35e-01

Pos logit contributions
 check+0.022
 help+0.022
 cook+0.022
 little+0.021
 end+0.020
Neg logit contributions
storms-0.054
 wrists-0.048
case-0.048
aces-0.048
chair-0.046
Token play
Token ID711
Feature activation+1.13e+00

Pos logit contributions
storms+0.053
aces+0.047
 wrists+0.047
case+0.046
chair+0.045
Neg logit contributions
 help-0.014
 check-0.014
 cook-0.014
 little-0.012
 save-0.012
Token hide
Token ID7808
Feature activation+5.92e-01

Pos logit contributions
storms+0.055
aces+0.048
case+0.048
 wrists+0.048
chair+0.047
Neg logit contributions
 check-0.023
 cook-0.022
 help-0.022
 little-0.021
 end-0.020
Token and
Token ID290
Feature activation-6.14e-01

Pos logit contributions
storms+0.033
 wrists+0.029
case+0.029
aces+0.029
chair+0.028
Neg logit contributions
 check-0.008
 help-0.008
 cook-0.008
 little-0.008
 end-0.007
Token seek
Token ID5380
Feature activation+4.54e+00

Pos logit contributions
 check+0.022
 help+0.022
 cook+0.021
 little+0.021
 end+0.020
Neg logit contributions
storms-0.021
 wrists-0.017
aces-0.017
case-0.017
chair-0.016
Token with
Token ID351
Feature activation+1.08e+00

Pos logit contributions
storms+0.287
 wrists+0.257
case+0.250
aces+0.248
uts+0.246
Neg logit contributions
 help-0.039
 check-0.035
 little-0.033
 end-0.033
 cook-0.031
Token its
Token ID663
Feature activation+1.55e+00

Pos logit contributions
storms+0.069
case+0.061
 wrists+0.061
aces+0.061
uts+0.060
Neg logit contributions
 help-0.013
 little-0.011
 check-0.010
 end-0.008
 two-0.008
Token friends
Token ID2460
Feature activation+1.46e+00

Pos logit contributions
storms+0.091
 wrists+0.078
aces+0.077
chair+0.077
case+0.076
Neg logit contributions
 little-0.025
 help-0.025
 check-0.020
 two-0.019
 end-0.019
Token.
Token ID13
Feature activation+4.54e-01

Pos logit contributions
storms+0.083
 wrists+0.074
aces+0.074
case+0.074
chair+0.072
Neg logit contributions
 check-0.018
 help-0.018
 cook-0.018
 little-0.016
 end-0.015
Token The
Token ID383
Feature activation-2.11e-01

Pos logit contributions
storms+0.024
 wrists+0.021
aces+0.021
case+0.021
chair+0.021
Neg logit contributions
 check-0.008
 help-0.007
 cook-0.007
 little-0.007
 end-0.007
Token wanted
Token ID2227
Feature activation-1.12e+00

Pos logit contributions
storms+0.059
case+0.049
aces+0.049
 wrists+0.049
chair+0.048
Neg logit contributions
 check-0.049
 help-0.049
 cook-0.049
 permission-0.048
 save-0.047
Token to
Token ID284
Feature activation+6.41e-01

Pos logit contributions
 help+0.021
 check+0.020
 cook+0.019
 little+0.019
 save+0.017
Neg logit contributions
storms-0.059
case-0.053
aces-0.053
 wrists-0.053
chair-0.051
Token play
Token ID711
Feature activation+9.92e-01

Pos logit contributions
storms+0.036
 wrists+0.032
aces+0.032
case+0.032
chair+0.031
Neg logit contributions
 check-0.009
 help-0.009
 cook-0.009
 little-0.008
 end-0.007
Token hide
Token ID7808
Feature activation+4.67e-01

Pos logit contributions
storms+0.050
 wrists+0.044
case+0.044
aces+0.044
chair+0.043
Neg logit contributions
 check-0.019
 help-0.019
 cook-0.018
 little-0.017
 end-0.017
Token and
Token ID290
Feature activation-6.87e-01

Pos logit contributions
storms+0.026
 wrists+0.023
case+0.023
aces+0.023
chair+0.022
Neg logit contributions
 check-0.007
 help-0.007
 cook-0.007
 little-0.006
 end-0.006
Token seek
Token ID5380
Feature activation+4.48e+00

Pos logit contributions
 check+0.026
 help+0.026
 cook+0.025
 little+0.025
 end+0.024
Neg logit contributions
storms-0.021
 wrists-0.018
case-0.018
aces-0.017
chair-0.016
Token.
Token ID13
Feature activation+4.64e-01

Pos logit contributions
storms+0.286
 wrists+0.253
case+0.251
aces+0.247
uts+0.246
Neg logit contributions
 help-0.032
 little-0.028
 check-0.028
 cook-0.027
 end-0.025
Token Billy
Token ID15890
Feature activation-6.57e-01

Pos logit contributions
storms+0.023
case+0.020
 wrists+0.020
chair+0.019
aces+0.019
Neg logit contributions
 cook-0.010
 check-0.009
 help-0.009
asting-0.009
 permission-0.009
Token loved
Token ID6151
Feature activation-1.57e+00

Pos logit contributions
 little+0.012
 cook+0.010
 check+0.010
 pick+0.010
 help+0.009
Neg logit contributions
storms-0.035
aces-0.032
 wrists-0.032
alo-0.032
umm-0.031
Token hide
Token ID7808
Feature activation+8.75e-01

Pos logit contributions
 help+0.022
 check+0.020
 little+0.020
 cook+0.019
 what+0.019
Neg logit contributions
storms-0.089
case-0.088
aces-0.087
 wrists-0.084
ement-0.083
Token and
Token ID290
Feature activation-8.16e-01

Pos logit contributions
storms+0.050
case+0.045
 wrists+0.045
aces+0.044
chair+0.043
Neg logit contributions
 little-0.011
 help-0.011
 check-0.010
 cook-0.010
 end-0.009
Token particularly
Token ID3573
Feature activation-3.22e-02

Pos logit contributions
storms+0.035
umm+0.032
ement+0.032
uts+0.031
 wrists+0.031
Neg logit contributions
 little-0.010
 cook-0.009
 take-0.009
 check-0.009
 help-0.009
Token liked
Token ID8288
Feature activation-7.45e-01

Pos logit contributions
 check+0.001
 cook+0.001
 save+0.001
 help+0.001
 permission+0.001
Neg logit contributions
storms-0.001
aces-0.001
case-0.001
chair-0.001
 wrists-0.001
Token playing
Token ID2712
Feature activation-1.77e-01

Pos logit contributions
 little+0.008
 help+0.008
 cook+0.007
 work+0.006
 check+0.006
Neg logit contributions
case-0.044
storms-0.044
aces-0.043
ement-0.043
overed-0.041
Token hide
Token ID7808
Feature activation+3.70e-01

Pos logit contributions
 help+0.005
 cook+0.005
 check+0.005
 little+0.005
 end+0.004
Neg logit contributions
storms-0.008
 wrists-0.007
case-0.006
aces-0.006
chair-0.006
Token and
Token ID290
Feature activation-7.58e-01

Pos logit contributions
storms+0.021
 wrists+0.019
case+0.019
aces+0.019
chair+0.018
Neg logit contributions
 little-0.005
 cook-0.004
 help-0.004
 check-0.004
 end-0.004
Token seek
Token ID5380
Feature activation+4.47e+00

Pos logit contributions
 check+0.029
 help+0.029
 cook+0.028
 little+0.028
 end+0.027
Neg logit contributions
storms-0.023
 wrists-0.019
case-0.019
aces-0.019
chair-0.018
Token with
Token ID351
Feature activation+1.12e+00

Pos logit contributions
storms+0.263
 wrists+0.230
case+0.226
uts+0.223
chair+0.223
Neg logit contributions
 little-0.056
 help-0.053
 end-0.050
 cook-0.050
 check-0.049
Token her
Token ID607
Feature activation+1.05e+00

Pos logit contributions
storms+0.065
case+0.058
chair+0.055
overed+0.055
aces+0.055
Neg logit contributions
 help-0.018
 little-0.018
 two-0.016
 check-0.015
 what-0.015
Token friends
Token ID2460
Feature activation+1.61e+00

Pos logit contributions
storms+0.067
case+0.058
rol+0.057
overed+0.057
chair+0.057
Neg logit contributions
 little-0.014
 help-0.012
 two-0.010
 work-0.007
 cook-0.007
Token.
Token ID13
Feature activation+1.35e-01

Pos logit contributions
storms+0.084
aces+0.076
 wrists+0.075
case+0.074
chair+0.072
Neg logit contributions
 check-0.026
 cook-0.025
 help-0.024
 save-0.023
 little-0.023
Token
Token ID198
Feature activation-1.26e-01

Pos logit contributions
storms+0.008
 wrists+0.007
aces+0.007
case+0.007
chair+0.007
Neg logit contributions
 check-0.001
 help-0.001
 cook-0.001
 little-0.001
 end-0.001
Token wanted
Token ID2227
Feature activation-1.10e+00

Pos logit contributions
 cook+0.026
 little+0.025
 help+0.025
 check+0.024
 end+0.024
Neg logit contributions
storms-0.060
 wrists-0.056
aces-0.055
case-0.052
 Caroline-0.052
Token to
Token ID284
Feature activation+8.81e-01

Pos logit contributions
 check+0.024
 help+0.023
 cook+0.023
 little+0.022
 end+0.021
Neg logit contributions
storms-0.053
 wrists-0.047
case-0.046
aces-0.046
chair-0.045
Token play
Token ID711
Feature activation+1.11e+00

Pos logit contributions
storms+0.048
case+0.043
 wrists+0.043
aces+0.042
chair+0.041
Neg logit contributions
 check-0.012
 help-0.012
 cook-0.012
 little-0.011
 end-0.011
Token hide
Token ID7808
Feature activation+4.68e-01

Pos logit contributions
storms+0.052
aces+0.046
case+0.046
 wrists+0.045
chair+0.044
Neg logit contributions
 check-0.024
 cook-0.023
 help-0.022
 little-0.021
 save-0.021
Token and
Token ID290
Feature activation-5.76e-01

Pos logit contributions
storms+0.026
 wrists+0.024
case+0.024
aces+0.023
chair+0.023
Neg logit contributions
 check-0.006
 help-0.006
 cook-0.006
 little-0.006
 end-0.005
Token seek
Token ID5380
Feature activation+4.43e+00

Pos logit contributions
 check+0.020
 help+0.020
 cook+0.020
 little+0.020
 end+0.019
Neg logit contributions
storms-0.020
aces-0.016
 wrists-0.016
case-0.016
chair-0.015
Token with
Token ID351
Feature activation+1.03e+00

Pos logit contributions
storms+0.281
 wrists+0.251
case+0.245
uts+0.241
aces+0.241
Neg logit contributions
 help-0.038
 check-0.034
 little-0.032
 end-0.032
 cook-0.031
Token the
Token ID262
Feature activation+6.97e-01

Pos logit contributions
storms+0.062
case+0.056
aces+0.055
chair+0.054
 wrists+0.054
Neg logit contributions
 help-0.014
 little-0.012
 check-0.012
 cook-0.011
 end-0.011
Token sun
Token ID4252
Feature activation+6.69e-01

Pos logit contributions
storms+0.035
 wrists+0.031
case+0.031
aces+0.031
chair+0.030
Neg logit contributions
 check-0.013
 cook-0.012
 help-0.012
 little-0.011
 save-0.011
Token.
Token ID13
Feature activation+5.33e-01

Pos logit contributions
storms+0.042
case+0.040
 wrists+0.040
 Caroline+0.039
chair+0.039
Neg logit contributions
 cook-0.004
 help-0.003
 little-0.002
 pick-0.002
 check-0.002
Token The
Token ID383
Feature activation-3.89e-01

Pos logit contributions
storms+0.028
 wrists+0.025
aces+0.025
case+0.024
uts+0.024
Neg logit contributions
 little-0.010
 cook-0.009
 help-0.009
 check-0.009
 pick-0.008
Token wanted
Token ID2227
Feature activation-1.25e+00

Pos logit contributions
storms+0.043
 wrists+0.037
case+0.037
aces+0.037
chair+0.036
Neg logit contributions
 check-0.025
 help-0.025
 cook-0.025
 little-0.024
 end-0.023
Token to
Token ID284
Feature activation+7.00e-01

Pos logit contributions
 help+0.022
 check+0.021
 cook+0.021
 little+0.019
 pick+0.018
Neg logit contributions
storms-0.068
aces-0.062
case-0.061
 wrists-0.061
chair-0.058
Token play
Token ID711
Feature activation+9.32e-01

Pos logit contributions
storms+0.037
case+0.034
 wrists+0.034
aces+0.033
chair+0.033
Neg logit contributions
 check-0.010
 cook-0.009
 help-0.009
 little-0.009
 end-0.009
Token hide
Token ID7808
Feature activation+3.32e-01

Pos logit contributions
storms+0.046
 wrists+0.041
case+0.041
aces+0.041
chair+0.039
Neg logit contributions
 check-0.018
 help-0.018
 cook-0.018
 little-0.017
 end-0.016
Token and
Token ID290
Feature activation-6.89e-01

Pos logit contributions
storms+0.018
 wrists+0.016
case+0.016
aces+0.016
chair+0.016
Neg logit contributions
 check-0.005
 help-0.005
 cook-0.005
 little-0.004
 end-0.004
Token seek
Token ID5380
Feature activation+4.43e+00

Pos logit contributions
 check+0.026
 help+0.025
 cook+0.025
 little+0.024
 end+0.024
Neg logit contributions
storms-0.022
 wrists-0.018
case-0.018
aces-0.018
chair-0.017
Token,
Token ID11
Feature activation+3.15e-01

Pos logit contributions
storms+0.272
 wrists+0.242
case+0.237
aces+0.234
uts+0.233
Neg logit contributions
 help-0.045
 check-0.041
 little-0.039
 cook-0.038
 end-0.038
Token but
Token ID475
Feature activation+2.09e-01

Pos logit contributions
storms+0.016
 wrists+0.015
aces+0.015
case+0.014
chair+0.014
Neg logit contributions
 little-0.006
 help-0.006
 check-0.006
 cook-0.006
 pick-0.006
Token it
Token ID340
Feature activation+1.34e+00

Pos logit contributions
storms+0.011
aces+0.010
case+0.010
uts+0.010
 wrists+0.010
Neg logit contributions
 end-0.004
 help-0.004
 little-0.004
 check-0.004
 what-0.004
Token was
Token ID373
Feature activation+1.08e+00

Pos logit contributions
 wrists+0.077
storms+0.077
aces+0.074
uts+0.071
alo+0.070
Neg logit contributions
 little-0.019
 end-0.017
 help-0.016
 check-0.015
 cook-0.014
Token getting
Token ID1972
Feature activation-9.89e-01

Pos logit contributions
storms+0.054
 wrists+0.050
aces+0.049
case+0.047
chair+0.046
Neg logit contributions
 check-0.021
 help-0.021
 little-0.021
 end-0.021
 cook-0.020
Token wanted
Token ID2227
Feature activation-1.13e+00

Pos logit contributions
storms+0.048
aces+0.044
 wrists+0.043
case+0.042
uts+0.041
Neg logit contributions
 cook-0.019
 little-0.018
 check-0.018
 help-0.016
 pick-0.016
Token to
Token ID284
Feature activation+9.28e-01

Pos logit contributions
 help+0.018
 check+0.016
 cook+0.016
 little+0.015
 pick+0.014
Neg logit contributions
storms-0.065
aces-0.059
case-0.058
 wrists-0.058
overed-0.055
Token play
Token ID711
Feature activation+8.76e-01

Pos logit contributions
storms+0.053
aces+0.047
 wrists+0.047
case+0.046
chair+0.045
Neg logit contributions
 help-0.013
 check-0.013
 cook-0.013
 little-0.011
 save-0.011
Token hide
Token ID7808
Feature activation+4.97e-01

Pos logit contributions
storms+0.050
 wrists+0.045
aces+0.043
case+0.043
uts+0.043
Neg logit contributions
 help-0.014
 cook-0.013
 check-0.013
 little-0.012
 end-0.012
Token and
Token ID290
Feature activation-6.21e-01

Pos logit contributions
storms+0.030
 wrists+0.027
case+0.026
aces+0.026
chair+0.025
Neg logit contributions
 little-0.005
 help-0.005
 cook-0.005
 check-0.005
 end-0.004
Token seek
Token ID5380
Feature activation+4.42e+00

Pos logit contributions
 check+0.022
 help+0.022
 cook+0.021
 little+0.021
 end+0.020
Neg logit contributions
storms-0.022
aces-0.018
 wrists-0.018
case-0.018
chair-0.017
Token in
Token ID287
Feature activation+4.88e-01

Pos logit contributions
storms+0.280
 wrists+0.248
case+0.241
uts+0.240
aces+0.238
Neg logit contributions
 help-0.039
 little-0.034
 check-0.034
 end-0.033
 cook-0.031
Token the
Token ID262
Feature activation+3.64e-01

Pos logit contributions
storms+0.031
aces+0.028
uts+0.027
 wrists+0.027
case+0.027
Neg logit contributions
 help-0.006
 check-0.005
 end-0.005
 little-0.005
 cook-0.004
Token garden
Token ID11376
Feature activation-2.40e-01

Pos logit contributions
storms+0.018
aces+0.016
 wrists+0.015
case+0.015
uts+0.015
Neg logit contributions
 check-0.008
 help-0.008
 cook-0.008
 little-0.008
 end-0.007
Token.
Token ID13
Feature activation+2.42e-01

Pos logit contributions
 help+0.002
 cook+0.002
 check+0.002
 end+0.002
 little+0.002
Neg logit contributions
storms-0.015
 wrists-0.014
aces-0.013
case-0.013
uts-0.013
Token She
Token ID1375
Feature activation+8.10e-01

Pos logit contributions
storms+0.012
alo+0.010
lights+0.010
aces+0.010
rol+0.010
Neg logit contributions
 little-0.006
 pick-0.005
 check-0.005
 ask-0.005
 help-0.005
Token wanted
Token ID2227
Feature activation-1.30e+00

Pos logit contributions
storms+0.039
alo+0.037
aces+0.036
umm+0.035
lights+0.035
Neg logit contributions
 little-0.012
 pick-0.010
 cook-0.010
 take-0.009
 who-0.009
Token to
Token ID284
Feature activation+8.15e-01

Pos logit contributions
 help+0.021
 check+0.018
 cook+0.018
 little+0.017
 what+0.016
Neg logit contributions
storms-0.075
case-0.068
aces-0.068
 wrists-0.067
chair-0.065
Token play
Token ID711
Feature activation+9.87e-01

Pos logit contributions
storms+0.047
aces+0.042
 wrists+0.042
case+0.041
chair+0.041
Neg logit contributions
 help-0.011
 check-0.011
 cook-0.010
 little-0.009
 save-0.009
Token hide
Token ID7808
Feature activation+6.01e-01

Pos logit contributions
storms+0.055
 wrists+0.049
aces+0.047
case+0.047
chair+0.047
Neg logit contributions
 help-0.017
 cook-0.016
 check-0.015
 little-0.015
 end-0.015
Token and
Token ID290
Feature activation-7.22e-01

Pos logit contributions
storms+0.036
 wrists+0.032
case+0.032
aces+0.031
chair+0.031
Neg logit contributions
 little-0.007
 help-0.006
 cook-0.006
 check-0.006
 end-0.006
Token seek
Token ID5380
Feature activation+4.42e+00

Pos logit contributions
 help+0.026
 check+0.026
 cook+0.026
 little+0.025
 end+0.025
Neg logit contributions
storms-0.025
aces-0.020
 wrists-0.020
case-0.020
chair-0.019
Token and
Token ID290
Feature activation+4.59e-01

Pos logit contributions
storms+0.273
 wrists+0.242
case+0.233
chair+0.233
uts+0.232
Neg logit contributions
 help-0.048
 little-0.044
 end-0.043
 what-0.042
 check-0.039
Token Ann
Token ID5506
Feature activation+2.86e+00

Pos logit contributions
storms+0.023
aces+0.020
 wrists+0.019
case+0.019
uts+0.019
Neg logit contributions
 help-0.011
 check-0.010
 cook-0.010
 save-0.010
 little-0.010
Token wanted
Token ID2227
Feature activation-9.45e-01

Pos logit contributions
storms+0.149
alo+0.133
umm+0.133
uts+0.128
lights+0.127
Neg logit contributions
 help-0.058
 little-0.057
 cook-0.051
 take-0.051
 check-0.050
Token to
Token ID284
Feature activation+8.91e-01

Pos logit contributions
 help+0.016
 check+0.013
 what+0.012
 little+0.012
 cook+0.012
Neg logit contributions
storms-0.055
aces-0.050
case-0.050
chair-0.049
 wrists-0.049
Token play
Token ID711
Feature activation+1.36e+00

Pos logit contributions
storms+0.053
aces+0.047
 wrists+0.046
chair+0.046
case+0.044
Neg logit contributions
 help-0.015
 check-0.013
 cook-0.013
 save-0.012
 pick-0.011
Token felt
Token ID2936
Feature activation-6.35e-01

Pos logit contributions
 cook+0.033
 check+0.033
 help+0.032
 little+0.032
 end+0.030
Neg logit contributions
storms-0.066
 wrists-0.059
aces-0.058
case-0.058
chair-0.055
Token like
Token ID588
Feature activation-1.25e+00

Pos logit contributions
ruck+0.019
 check+0.018
 cook+0.018
 adult+0.018
 install+0.018
Neg logit contributions
storms-0.021
aces-0.018
case-0.018
chair-0.017
 none-0.016
Token playing
Token ID2712
Feature activation+1.73e-01

Pos logit contributions
 check+0.019
 cook+0.018
 help+0.018
 end+0.016
 little+0.016
Neg logit contributions
storms-0.068
aces-0.060
 wrists-0.060
case-0.060
chair-0.058
Token hide
Token ID7808
Feature activation+3.15e-01

Pos logit contributions
storms+0.008
aces+0.007
case+0.007
 wrists+0.006
chair+0.006
Neg logit contributions
 check-0.004
 permission-0.004
 cook-0.004
 adult-0.004
 help-0.004
Token and
Token ID290
Feature activation-5.87e-01

Pos logit contributions
storms+0.018
 wrists+0.016
case+0.016
aces+0.016
chair+0.015
Neg logit contributions
 check-0.004
 help-0.004
 cook-0.004
 little-0.004
 end-0.004
Token seek
Token ID5380
Feature activation+4.41e+00

Pos logit contributions
 check+0.023
 help+0.022
 cook+0.022
 little+0.021
 end+0.021
Neg logit contributions
storms-0.017
 wrists-0.014
case-0.014
aces-0.014
chair-0.013
Token.
Token ID13
Feature activation+4.04e-01

Pos logit contributions
storms+0.274
 wrists+0.251
case+0.240
aces+0.237
 Caroline+0.236
Neg logit contributions
 little-0.036
 help-0.034
 cook-0.032
 end-0.031
 check-0.030
Token She
Token ID1375
Feature activation+7.41e-01

Pos logit contributions
storms+0.019
 wrists+0.016
aces+0.016
case+0.016
chair+0.016
Neg logit contributions
 check-0.009
 help-0.009
 cook-0.009
 little-0.009
 end-0.008
Token sprang
Token ID43908
Feature activation+3.34e-01

Pos logit contributions
storms+0.041
uts+0.037
 wrists+0.036
aces+0.036
ement+0.035
Neg logit contributions
 little-0.014
 check-0.012
 cook-0.012
 help-0.012
 take-0.011
Token down
Token ID866
Feature activation+1.15e+00

Pos logit contributions
storms+0.018
 wrists+0.017
aces+0.016
chair+0.016
case+0.016
Neg logit contributions
 little-0.006
 check-0.005
 help-0.005
 two-0.005
 pick-0.005
Token a
Token ID257
Feature activation+2.05e-02

Pos logit contributions
storms+0.063
 wrists+0.058
aces+0.057
case+0.056
chair+0.054
Neg logit contributions
 help-0.018
 check-0.017
 little-0.016
 cook-0.016
 end-0.015
Token went
Token ID1816
Feature activation+4.36e-01

Pos logit contributions
storms+0.014
case+0.012
aces+0.012
chair+0.012
 wrists+0.012
Neg logit contributions
 help-0.008
 end-0.008
 cook-0.008
 check-0.008
 little-0.008
Token on
Token ID319
Feature activation-6.37e-02

Pos logit contributions
storms+0.023
 wrists+0.021
aces+0.021
case+0.020
chair+0.020
Neg logit contributions
 help-0.008
 check-0.007
 cook-0.007
 little-0.007
 work-0.006
Token adventures
Token ID17545
Feature activation-1.08e+00

Pos logit contributions
 little+0.002
 help+0.001
 work+0.001
 check+0.001
 end+0.001
Neg logit contributions
storms-0.003
aces-0.003
uts-0.003
 wrists-0.003
case-0.003
Token,
Token ID11
Feature activation-6.63e-02

Pos logit contributions
 help+0.011
 little+0.008
 who+0.008
 work+0.007
 cook+0.007
Neg logit contributions
storms-0.064
 wrists-0.062
aces-0.059
oph-0.059
alo-0.059
Token had
Token ID550
Feature activation-4.54e-01

Pos logit contributions
 little+0.001
 help+0.001
 cook+0.001
 check+0.001
 pick+0.001
Neg logit contributions
storms-0.004
case-0.003
 wrists-0.003
aces-0.003
chair-0.003
Token pic
Token ID8301
Feature activation+4.40e+00

Pos logit contributions
 little+0.008
 two+0.008
 a+0.007
 help+0.007
 other+0.006
Neg logit contributions
storms-0.025
chair-0.024
case-0.024
alo-0.023
 wrists-0.023
Tokenn
Token ID77
Feature activation-8.56e-01

Pos logit contributions
storms+0.170
case+0.152
aces+0.150
 wrists+0.147
uts+0.139
Neg logit contributions
 check-0.130
 cook-0.128
 little-0.119
 help-0.117
 chop-0.114
Tokenics
Token ID873
Feature activation+8.69e-01

Pos logit contributions
 little+0.037
 help+0.037
asting+0.036
 check+0.036
 cook+0.036
Neg logit contributions
storms-0.021
 wrists-0.018
aces-0.015
case-0.015
chair-0.015
Token,
Token ID11
Feature activation-6.44e-02

Pos logit contributions
storms+0.054
 wrists+0.050
chair+0.049
aces+0.049
case+0.048
Neg logit contributions
 help-0.008
 who-0.007
 forgive-0.006
 little-0.006
 and-0.005
Token and
Token ID290
Feature activation+4.97e-01

Pos logit contributions
 little+0.001
 help+0.001
 cook+0.001
 check+0.001
 pick+0.001
Neg logit contributions
storms-0.004
case-0.004
 wrists-0.003
chair-0.003
aces-0.003
Token even
Token ID772
Feature activation+3.76e-01

Pos logit contributions
storms+0.022
case+0.019
 wrists+0.019
aces+0.019
chair+0.019
Neg logit contributions
 help-0.012
 check-0.012
 cook-0.012
 little-0.012
 save-0.011
Token wanted
Token ID2227
Feature activation-1.14e+00

Pos logit contributions
storms+0.014
 wrists+0.014
lights+0.013
case+0.013
aces+0.013
Neg logit contributions
 help-0.006
 little-0.005
 forgive-0.005
 cook-0.005
 take-0.005
Token to
Token ID284
Feature activation+8.72e-01

Pos logit contributions
 help+0.023
 check+0.022
 cook+0.021
 little+0.020
 save+0.020
Neg logit contributions
storms-0.058
case-0.053
 wrists-0.052
aces-0.052
chair-0.050
Token play
Token ID711
Feature activation+9.76e-01

Pos logit contributions
storms+0.049
 wrists+0.044
case+0.044
aces+0.043
chair+0.042
Neg logit contributions
 check-0.012
 help-0.011
 cook-0.011
 little-0.010
 end-0.010
Token hide
Token ID7808
Feature activation+3.69e-01

Pos logit contributions
storms+0.052
 wrists+0.047
case+0.046
aces+0.045
chair+0.044
Neg logit contributions
 help-0.018
 check-0.017
 cook-0.016
 little-0.016
 forgive-0.015
Token and
Token ID290
Feature activation-5.35e-01

Pos logit contributions
storms+0.021
 wrists+0.019
case+0.019
aces+0.018
chair+0.018
Neg logit contributions
 help-0.005
 little-0.005
 check-0.004
 cook-0.004
 end-0.004
Token seek
Token ID5380
Feature activation+4.39e+00

Pos logit contributions
 check+0.020
 help+0.020
 cook+0.020
 little+0.019
 end+0.019
Neg logit contributions
storms-0.017
 wrists-0.014
aces-0.014
case-0.014
chair-0.013
Token.
Token ID13
Feature activation+6.61e-03

Pos logit contributions
storms+0.272
 wrists+0.243
case+0.236
uts+0.234
aces+0.230
Neg logit contributions
 help-0.048
 end-0.040
 little-0.039
 check-0.039
 forgive-0.037
Token Spark
Token ID17732
Feature activation+6.99e-01

Pos logit contributions
storms+0.000
aces+0.000
case+0.000
 wrists+0.000
chair+0.000
Neg logit contributions
 help-0.000
 little-0.000
 check-0.000
 cook-0.000
 pick-0.000
Tokeny
Token ID88
Feature activation+3.67e-01

Pos logit contributions
storms+0.032
 wrists+0.030
case+0.029
aces+0.028
chair+0.028
Neg logit contributions
 little-0.015
 check-0.014
 end-0.014
 cook-0.014
 forgive-0.013
Token said
Token ID531
Feature activation+3.16e-01

Pos logit contributions
storms+0.021
umm+0.019
alo+0.019
rol+0.019
ays+0.019
Neg logit contributions
 little-0.006
 two-0.005
 cook-0.004
 end-0.004
 help-0.004
Token yes
Token ID3763
Feature activation+7.71e-01

Pos logit contributions
storms+0.017
 wrists+0.015
aces+0.015
case+0.015
ays+0.015
Neg logit contributions
 help-0.007
 little-0.007
 forgive-0.006
 what-0.006
 end-0.006
Token play
Token ID711
Feature activation+1.20e+00

Pos logit contributions
storms+0.000
aces+0.000
case+0.000
 wrists+0.000
chair+0.000
Neg logit contributions
 check-0.000
 help-0.000
 cook-0.000
 little-0.000
 pick-0.000
Token tag
Token ID7621
Feature activation+2.33e+00

Pos logit contributions
storms+0.063
 wrists+0.057
aces+0.055
case+0.055
uts+0.054
Neg logit contributions
 help-0.023
 cook-0.022
 little-0.021
 check-0.021
 end-0.019
Token and
Token ID290
Feature activation+2.57e-01

Pos logit contributions
storms+0.149
 wrists+0.139
aces+0.136
umm+0.136
uts+0.135
Neg logit contributions
 little-0.017
 help-0.013
 what-0.012
 end-0.010
 cook-0.009
Token hide
Token ID7808
Feature activation+1.64e+00

Pos logit contributions
storms+0.013
aces+0.011
case+0.011
 wrists+0.011
chair+0.011
Neg logit contributions
 help-0.006
 cook-0.006
 check-0.006
 little-0.005
 save-0.005
Token and
Token ID290
Feature activation-1.94e-01

Pos logit contributions
storms+0.085
case+0.078
chair+0.074
 wrists+0.074
aces+0.074
Neg logit contributions
 little-0.030
 help-0.029
 check-0.028
 cook-0.027
 end-0.026
Token seek
Token ID5380
Feature activation+4.38e+00

Pos logit contributions
 help+0.007
 cook+0.006
 check+0.006
 little+0.006
 save+0.006
Neg logit contributions
storms-0.007
aces-0.006
case-0.006
 wrists-0.006
chair-0.006
Token,
Token ID11
Feature activation+5.32e-01

Pos logit contributions
storms+0.271
uts+0.240
 wrists+0.239
case+0.238
chair+0.237
Neg logit contributions
 little-0.042
 help-0.038
 end-0.033
 who-0.031
 what-0.030
Token and
Token ID290
Feature activation+8.08e-01

Pos logit contributions
storms+0.028
aces+0.026
 wrists+0.026
case+0.026
chair+0.025
Neg logit contributions
 help-0.009
 cook-0.009
 check-0.009
 little-0.008
 pick-0.008
Token tell
Token ID1560
Feature activation+4.84e-01

Pos logit contributions
storms+0.040
aces+0.036
case+0.036
 wrists+0.035
uts+0.035
Neg logit contributions
 help-0.018
 cook-0.017
 check-0.017
 little-0.016
 save-0.016
Token one
Token ID530
Feature activation+1.04e+00

Pos logit contributions
storms+0.022
case+0.020
aces+0.020
 wrists+0.019
chair+0.019
Neg logit contributions
 help-0.012
 little-0.012
 check-0.012
 cook-0.012
 end-0.011
Token another
Token ID1194
Feature activation+1.61e+00

Pos logit contributions
storms+0.055
aces+0.049
 wrists+0.048
case+0.048
chair+0.046
Neg logit contributions
 help-0.018
 check-0.018
 cook-0.018
 little-0.018
 end-0.017
Token playing
Token ID2712
Feature activation-1.00e-01

Pos logit contributions
 check+0.010
 help+0.010
 cook+0.010
 little+0.009
 end+0.009
Neg logit contributions
storms-0.019
 wrists-0.017
case-0.016
aces-0.016
chair-0.016
Token tag
Token ID7621
Feature activation+2.07e+00

Pos logit contributions
 check+0.003
 help+0.002
 cook+0.002
 permission+0.002
 little+0.002
Neg logit contributions
storms-0.004
 wrists-0.004
aces-0.004
case-0.004
chair-0.004
Token and
Token ID290
Feature activation+2.80e-01

Pos logit contributions
storms+0.124
aces+0.109
case+0.109
 wrists+0.109
chair+0.106
Neg logit contributions
 little-0.022
 cook-0.022
 help-0.021
 end-0.021
 check-0.019
Token hide
Token ID7808
Feature activation+1.60e+00

Pos logit contributions
storms+0.011
case+0.010
aces+0.010
 wrists+0.009
chair+0.009
Neg logit contributions
 help-0.008
 cook-0.008
 check-0.008
 little-0.008
 end-0.008
Token and
Token ID290
Feature activation-3.64e-01

Pos logit contributions
storms+0.082
case+0.072
 wrists+0.072
aces+0.072
chair+0.070
Neg logit contributions
 check-0.030
 help-0.029
 cook-0.029
 little-0.028
 end-0.026
Token seek
Token ID5380
Feature activation+4.37e+00

Pos logit contributions
 help+0.013
 check+0.013
 cook+0.013
 little+0.012
 save+0.012
Neg logit contributions
storms-0.013
aces-0.010
case-0.010
 wrists-0.010
chair-0.010
Token.
Token ID13
Feature activation+2.76e-01

Pos logit contributions
storms+0.280
case+0.248
 wrists+0.245
chair+0.245
uts+0.244
Neg logit contributions
 little-0.029
 help-0.028
 end-0.027
 cook-0.025
 check-0.022
Token They
Token ID1119
Feature activation+4.87e-01

Pos logit contributions
storms+0.012
case+0.011
aces+0.011
 wrists+0.010
chair+0.010
Neg logit contributions
 cook-0.007
 little-0.007
 check-0.007
 help-0.007
 wash-0.006
Token were
Token ID547
Feature activation+1.09e+00

Pos logit contributions
storms+0.024
uts+0.021
umm+0.021
aces+0.021
ement+0.021
Neg logit contributions
 help-0.011
 pick-0.011
 cook-0.011
 check-0.011
 take-0.011
Token fast
Token ID3049
Feature activation-1.96e-01

Pos logit contributions
storms+0.067
chair+0.060
ane+0.059
case+0.058
oph+0.058
Neg logit contributions
 little-0.013
 two-0.012
 help-0.011
 cook-0.011
 what-0.010
Token and
Token ID290
Feature activation-2.85e-01

Pos logit contributions
 little+0.002
 help+0.002
 check+0.002
 work+0.002
 cook+0.002
Neg logit contributions
storms-0.011
case-0.011
 wrists-0.010
aces-0.010
chair-0.010
Token see
Token ID766
Feature activation-1.56e+00

Pos logit contributions
 adult+0.011
ruck+0.011
 permission+0.010
 little+0.010
asting+0.010
Neg logit contributions
storms-0.022
 wrists-0.021
case-0.021
aces-0.019
 pic-0.019
Token the
Token ID262
Feature activation+3.78e-02

Pos logit contributions
 check+0.023
 help+0.023
 cook+0.023
 little+0.021
 end+0.020
Neg logit contributions
storms-0.085
 wrists-0.076
case-0.076
aces-0.076
chair-0.073
Token bird
Token ID6512
Feature activation+3.91e-03

Pos logit contributions
storms+0.002
aces+0.001
 wrists+0.001
case+0.001
chair+0.001
Neg logit contributions
 check-0.001
 help-0.001
 cook-0.001
 little-0.001
 end-0.001
Token and
Token ID290
Feature activation+6.45e-01

Pos logit contributions
storms+0.000
case+0.000
 wrists+0.000
aces+0.000
chair+0.000
Neg logit contributions
 check-0.000
 help-0.000
 cook-0.000
 little-0.000
 save-0.000
Token have
Token ID423
Feature activation+3.35e-01

Pos logit contributions
storms+0.033
aces+0.029
case+0.029
 wrists+0.028
chair+0.028
Neg logit contributions
 check-0.013
 help-0.013
 cook-0.013
 little-0.012
 save-0.012
Token pic
Token ID8301
Feature activation+4.37e+00

Pos logit contributions
storms+0.020
case+0.018
chair+0.018
aces+0.018
overed+0.018
Neg logit contributions
 little-0.005
 two-0.005
 help-0.005
 a-0.004
 cook-0.004
Tokenn
Token ID77
Feature activation-6.47e-01

Pos logit contributions
storms+0.149
case+0.137
aces+0.135
 wrists+0.133
uts+0.121
Neg logit contributions
 cook-0.137
 check-0.136
 little-0.131
 adult-0.124
 chop-0.123
Tokenics
Token ID873
Feature activation+9.15e-01

Pos logit contributions
 check+0.032
 help+0.032
 cook+0.032
 little+0.031
 end+0.031
Neg logit contributions
storms-0.012
 wrists-0.009
aces-0.008
case-0.008
chair-0.007
Token.
Token ID13
Feature activation+1.96e-02

Pos logit contributions
storms+0.052
aces+0.048
 wrists+0.047
case+0.047
chair+0.046
Neg logit contributions
 help-0.014
 forgive-0.013
 check-0.012
 cook-0.012
 end-0.011
Token<|endoftext|>
Token ID50256
Feature activation+1.13e+00

Pos logit contributions
storms+0.001
case+0.001
chair+0.001
 wrists+0.001
aces+0.001
Neg logit contributions
 check-0.000
 help-0.000
 cook-0.000
 permission-0.000
 save-0.000
TokenOnce
Token ID7454
Feature activation+1.20e+00

Pos logit contributions
storms+0.065
case+0.059
 wrists+0.059
aces+0.059
chair+0.057
Neg logit contributions
 check-0.013
 help-0.013
 cook-0.012
 little-0.011
 end-0.010
Token loved
Token ID6151
Feature activation-1.96e+00

Pos logit contributions
storms+0.012
aces+0.011
 wrists+0.011
case+0.011
chair+0.011
Neg logit contributions
 cook-0.005
 help-0.004
 little-0.004
 check-0.004
 end-0.004
Token to
Token ID284
Feature activation+2.61e-01

Pos logit contributions
 help+0.040
 check+0.040
 cook+0.039
 little+0.037
 save+0.034
Neg logit contributions
storms-0.096
case-0.087
aces-0.086
 wrists-0.086
chair-0.083
Token play
Token ID711
Feature activation+5.16e-01

Pos logit contributions
storms+0.013
 wrists+0.012
case+0.012
aces+0.012
chair+0.011
Neg logit contributions
asting-0.004
 check-0.004
 little-0.004
 end-0.004
 permission-0.004
Token hide
Token ID7808
Feature activation+1.41e-01

Pos logit contributions
storms+0.024
case+0.021
aces+0.021
 wrists+0.021
chair+0.020
Neg logit contributions
 check-0.012
 cook-0.011
 help-0.011
 permission-0.011
 end-0.011
Token and
Token ID290
Feature activation-8.90e-01

Pos logit contributions
storms+0.008
 wrists+0.007
case+0.007
aces+0.007
chair+0.007
Neg logit contributions
 check-0.002
 help-0.002
 cook-0.002
 little-0.002
 end-0.002
Token seek
Token ID5380
Feature activation+4.34e+00

Pos logit contributions
 check+0.036
 cook+0.035
 help+0.034
 permission+0.034
 little+0.034
Neg logit contributions
storms-0.024
 wrists-0.020
case-0.020
aces-0.019
chair-0.018
Token with
Token ID351
Feature activation+8.21e-01

Pos logit contributions
storms+0.252
 wrists+0.224
case+0.219
aces+0.218
chair+0.216
Neg logit contributions
 help-0.056
 little-0.054
 cook-0.050
 check-0.050
 end-0.049
Token Lily
Token ID20037
Feature activation+7.35e-01

Pos logit contributions
storms+0.035
 wrists+0.030
aces+0.030
case+0.030
chair+0.029
Neg logit contributions
 check-0.022
 cook-0.021
 help-0.021
 end-0.020
 little-0.019
Token.
Token ID13
Feature activation-2.18e-01

Pos logit contributions
storms+0.045
 wrists+0.040
case+0.040
chair+0.040
aces+0.039
Neg logit contributions
 help-0.006
 little-0.006
 end-0.006
 check-0.005
 cook-0.005
Token One
Token ID1881
Feature activation-1.41e-01

Pos logit contributions
 help+0.006
 permission+0.006
 check+0.006
 cook+0.005
asting+0.005
Neg logit contributions
storms-0.009
case-0.008
 wrists-0.008
chair-0.008
aces-0.008
Token day
Token ID1110
Feature activation+3.22e-01

Pos logit contributions
 permission+0.004
 check+0.004
 help+0.003
 save+0.003
 forgive+0.003
Neg logit contributions
storms-0.006
chair-0.005
case-0.005
oline-0.005
aces-0.005
Token filled
Token ID5901
Feature activation+7.51e-01

Pos logit contributions
storms+0.015
 wrists+0.014
aces+0.014
case+0.013
chair+0.013
Neg logit contributions
 cook-0.006
 check-0.005
 help-0.005
 little-0.005
 end-0.005
Token with
Token ID351
Feature activation-5.34e-01

Pos logit contributions
storms+0.037
aces+0.033
 wrists+0.033
case+0.033
chair+0.032
Neg logit contributions
 check-0.015
 cook-0.015
 help-0.015
 little-0.014
 end-0.013
Token treasures
Token ID29561
Feature activation-5.62e-02

Pos logit contributions
 help+0.012
 check+0.012
 cook+0.011
 little+0.011
 end+0.011
Neg logit contributions
storms-0.026
case-0.022
aces-0.022
 wrists-0.022
chair-0.021
Token from
Token ID422
Feature activation+5.08e-01

Pos logit contributions
 check+0.001
 help+0.001
 cook+0.001
 little+0.001
 end+0.001
Neg logit contributions
storms-0.003
 wrists-0.003
aces-0.003
case-0.003
chair-0.003
Token long
Token ID890
Feature activation+3.95e-01

Pos logit contributions
storms+0.026
 wrists+0.023
aces+0.023
case+0.023
chair+0.022
Neg logit contributions
 check-0.009
 help-0.009
 cook-0.009
 little-0.008
 end-0.008
Token ago
Token ID2084
Feature activation-5.20e+00

Pos logit contributions
storms+0.017
 wrists+0.015
case+0.015
aces+0.014
chair+0.014
Neg logit contributions
 help-0.010
 little-0.010
 check-0.010
 cook-0.010
 work-0.009
Token!
Token ID0
Feature activation+1.79e-01

Pos logit contributions
 check+0.058
 help+0.058
 cook+0.057
 little+0.052
 end+0.048
Neg logit contributions
storms-0.300
 wrists-0.269
case-0.268
aces-0.267
chair-0.261
Token
Token ID198
Feature activation-4.39e-01

Pos logit contributions
storms+0.009
aces+0.008
 wrists+0.008
case+0.007
uts+0.007
Neg logit contributions
 little-0.004
 check-0.004
 cook-0.004
 help-0.004
 pick-0.003
Token
Token ID198
Feature activation-5.71e-01

Pos logit contributions
 check+0.006
 cook+0.006
 help+0.006
 end+0.005
 permission+0.005
Neg logit contributions
storms-0.024
case-0.022
 wrists-0.021
aces-0.021
chair-0.021
TokenThe
Token ID464
Feature activation-5.73e-01

Pos logit contributions
 check+0.011
 help+0.011
 permission+0.010
 cook+0.010
 end+0.010
Neg logit contributions
storms-0.028
case-0.026
 wrists-0.025
chair-0.024
aces-0.024
Token adventurers
Token ID36843
Feature activation-4.31e-01

Pos logit contributions
 check+0.012
 cook+0.011
 help+0.011
 forgive+0.010
asting+0.010
Neg logit contributions
storms-0.028
case-0.025
aces-0.024
 wrists-0.024
chair-0.023
Token in
Token ID287
Feature activation-1.03e+00

Pos logit contributions
 help+0.018
 check+0.017
 cook+0.017
 little+0.016
 end+0.015
Neg logit contributions
storms-0.075
 wrists-0.068
aces-0.067
case-0.066
chair-0.065
Token the
Token ID262
Feature activation-5.51e-01

Pos logit contributions
 help+0.015
 check+0.015
 little+0.013
 cook+0.013
 end+0.013
Neg logit contributions
storms-0.059
aces-0.053
 wrists-0.053
case-0.052
chair-0.051
Token park
Token ID3952
Feature activation-1.25e+00

Pos logit contributions
 cook+0.015
 help+0.014
asting+0.014
 check+0.014
 save+0.014
Neg logit contributions
storms-0.022
case-0.020
chair-0.019
 wrists-0.019
aces-0.019
Token many
Token ID867
Feature activation-6.34e-01

Pos logit contributions
 cook+0.013
 help+0.012
 check+0.012
 little+0.011
 end+0.011
Neg logit contributions
storms-0.073
 wrists-0.068
aces-0.067
case-0.065
chair-0.064
Token years
Token ID812
Feature activation-2.30e+00

Pos logit contributions
 little+0.014
 end+0.014
 help+0.014
 check+0.013
 cook+0.012
Neg logit contributions
storms-0.032
aces-0.028
case-0.028
uts-0.028
 wrists-0.028
Token ago
Token ID2084
Feature activation-5.12e+00

Pos logit contributions
 little+0.031
 check+0.028
 help+0.028
 cook+0.027
 work+0.025
Neg logit contributions
storms-0.129
 wrists-0.118
aces-0.115
case-0.115
chair-0.109
Token.
Token ID13
Feature activation-1.52e-01

Pos logit contributions
 help+0.047
 cook+0.045
 check+0.044
 little+0.042
 end+0.037
Neg logit contributions
storms-0.309
 wrists-0.276
case-0.275
aces-0.270
chair-0.269
Token Toby
Token ID37578
Feature activation-3.81e-01

Pos logit contributions
 check+0.005
 cook+0.005
 help+0.005
 end+0.005
 little+0.005
Neg logit contributions
storms-0.006
 wrists-0.005
case-0.005
aces-0.005
chair-0.004
Token was
Token ID373
Feature activation-3.47e-01

Pos logit contributions
 little+0.008
 cook+0.008
 check+0.008
 help+0.008
 take+0.007
Neg logit contributions
storms-0.019
aces-0.017
case-0.016
chair-0.016
 wrists-0.016
Token amazed
Token ID24562
Feature activation-8.23e-01

Pos logit contributions
 little+0.008
 check+0.008
 help+0.008
 cook+0.007
 two+0.007
Neg logit contributions
storms-0.017
aces-0.015
 wrists-0.015
chair-0.015
case-0.014
Token!
Token ID0
Feature activation-2.95e-01

Pos logit contributions
 check+0.011
 little+0.011
 help+0.011
 cook+0.010
 end+0.009
Neg logit contributions
storms-0.046
 wrists-0.042
aces-0.042
case-0.042
chair-0.040
Token "
Token ID366
Feature activation+7.73e-01

Pos logit contributions
storms+0.011
 wrists+0.010
case+0.010
aces+0.010
chair+0.010
Neg logit contributions
 check-0.005
 help-0.005
 cook-0.005
 little-0.005
 end-0.005
TokenIt
Token ID1026
Feature activation+1.11e+00

Pos logit contributions
storms+0.032
case+0.028
 wrists+0.028
aces+0.028
chair+0.027
Neg logit contributions
 help-0.021
 check-0.021
 cook-0.020
 little-0.020
 end-0.020
Token was
Token ID373
Feature activation+1.17e-01

Pos logit contributions
storms+0.044
 wrists+0.038
aces+0.038
case+0.038
chair+0.036
Neg logit contributions
 check-0.033
 help-0.033
 cook-0.032
 little-0.032
 save-0.030
Token hidden
Token ID7104
Feature activation+5.97e-01

Pos logit contributions
storms+0.006
 wrists+0.006
case+0.006
aces+0.006
chair+0.005
Neg logit contributions
 check-0.002
 help-0.002
 cook-0.002
 little-0.002
 end-0.002
Token long
Token ID890
Feature activation+7.30e-01

Pos logit contributions
storms+0.030
 wrists+0.027
case+0.027
aces+0.027
chair+0.026
Neg logit contributions
 check-0.011
 help-0.011
 cook-0.011
 little-0.011
 end-0.010
Token ago
Token ID2084
Feature activation-5.10e+00

Pos logit contributions
storms+0.039
 wrists+0.035
case+0.035
aces+0.034
chair+0.033
Neg logit contributions
 check-0.011
 help-0.011
 cook-0.011
 little-0.011
 end-0.010
Token,
Token ID11
Feature activation-1.32e+00

Pos logit contributions
 help+0.061
 cook+0.059
 check+0.059
 little+0.056
 end+0.051
Neg logit contributions
storms-0.294
 wrists-0.262
case-0.261
aces-0.257
chair-0.254
Token and
Token ID290
Feature activation-2.24e-01

Pos logit contributions
 little+0.028
 cook+0.027
 help+0.026
 check+0.026
 pick+0.023
Neg logit contributions
storms-0.068
 wrists-0.059
aces-0.059
case-0.058
chair-0.056
Token only
Token ID691
Feature activation+6.20e-01

Pos logit contributions
 help+0.006
 little+0.006
 check+0.006
 cook+0.006
 save+0.006
Neg logit contributions
storms-0.010
aces-0.008
case-0.008
 wrists-0.008
chair-0.008
Token seems
Token ID2331
Feature activation-3.76e-01

Pos logit contributions
storms+0.031
case+0.028
 wrists+0.028
aces+0.028
chair+0.027
Neg logit contributions
 check-0.012
 help-0.012
 cook-0.011
 little-0.011
 end-0.010
Token fancy
Token ID14996
Feature activation+2.19e-01

Pos logit contributions
 check+0.007
 help+0.007
 cook+0.007
 little+0.007
 end+0.006
Neg logit contributions
storms-0.019
 wrists-0.017
case-0.017
aces-0.017
chair-0.016
Token rest
Token ID1334
Feature activation-1.45e+00

Pos logit contributions
storms+0.003
chair+0.003
 wrists+0.003
case+0.003
aces+0.003
Neg logit contributions
 check-0.002
 forgive-0.002
 cook-0.002
 permission-0.002
 save-0.002
Token of
Token ID286
Feature activation-1.32e+00

Pos logit contributions
 check+0.025
 help+0.025
 cook+0.025
 little+0.024
 end+0.023
Neg logit contributions
storms-0.075
 wrists-0.067
case-0.066
aces-0.066
chair-0.063
Token the
Token ID262
Feature activation-5.88e-01

Pos logit contributions
 help+0.031
 cook+0.030
 check+0.030
 forgive+0.030
 permission+0.030
Neg logit contributions
 wrists-0.050
storms-0.048
 Ruth-0.045
 Sof-0.045
 Zoe-0.044
Token ice
Token ID4771
Feature activation-2.25e+00

Pos logit contributions
 check+0.014
 forgive+0.014
 help+0.014
 permission+0.014
 cook+0.014
Neg logit contributions
storms-0.022
chair-0.021
case-0.020
 wrists-0.020
ement-0.019
Token-
Token ID12
Feature activation-5.61e-01

Pos logit contributions
 check+0.059
 forgive+0.058
 help+0.057
 permission+0.056
 end+0.053
Neg logit contributions
storms-0.093
chair-0.083
aces-0.082
case-0.082
uts-0.080
Tokencream
Token ID36277
Feature activation-5.02e+00

Pos logit contributions
 check+0.021
 help+0.020
 end+0.020
 cook+0.020
 little+0.020
Neg logit contributions
storms-0.017
chair-0.015
aces-0.015
case-0.014
uts-0.014
Token and
Token ID290
Feature activation-1.58e-01

Pos logit contributions
 check+0.127
 help+0.125
 cook+0.123
 little+0.119
 end+0.115
Neg logit contributions
storms-0.220
 wrists-0.191
case-0.190
aces-0.189
chair-0.184
Token played
Token ID2826
Feature activation+5.29e-01

Pos logit contributions
 check+0.004
 help+0.004
 cook+0.004
 little+0.004
 end+0.004
Neg logit contributions
storms-0.006
case-0.006
aces-0.006
 wrists-0.006
chair-0.005
Token together
Token ID1978
Feature activation+4.72e-01

Pos logit contributions
storms+0.022
 wrists+0.020
aces+0.020
case+0.019
chair+0.019
Neg logit contributions
 check-0.014
 help-0.014
 cook-0.014
 little-0.013
 end-0.013
Token.
Token ID13
Feature activation-2.46e-01

Pos logit contributions
storms+0.025
aces+0.022
case+0.022
 wrists+0.022
chair+0.022
Neg logit contributions
 help-0.008
 cook-0.008
 check-0.008
 end-0.008
 little-0.007
Token Lily
Token ID20037
Feature activation-1.01e-02

Pos logit contributions
 check+0.007
 help+0.006
 cook+0.006
 little+0.006
 save+0.006
Neg logit contributions
storms-0.011
case-0.009
 wrists-0.009
aces-0.009
chair-0.009
Token pe
Token ID613
Feature activation-8.19e-01

Pos logit contributions
 help+0.015
 check+0.015
 cook+0.014
 little+0.013
 end+0.013
Neg logit contributions
storms-0.043
 wrists-0.039
aces-0.038
case-0.038
chair-0.037
Tokeneking
Token ID18754
Feature activation-2.63e+00

Pos logit contributions
 check+0.026
 help+0.026
 cook+0.026
 little+0.025
 end+0.025
Neg logit contributions
storms-0.029
aces-0.028
 wrists-0.026
uts-0.024
overed-0.023
Token out
Token ID503
Feature activation-1.10e+00

Pos logit contributions
 help+0.054
 check+0.054
 little+0.054
 cook+0.053
 end+0.050
Neg logit contributions
storms-0.127
 wrists-0.114
case-0.113
aces-0.111
chair-0.108
Token from
Token ID422
Feature activation-1.68e-01

Pos logit contributions
 check+0.020
 help+0.020
 cook+0.020
 little+0.018
 end+0.018
Neg logit contributions
storms-0.056
 wrists-0.050
case-0.049
aces-0.049
chair-0.048
Token the
Token ID262
Feature activation-3.27e-01

Pos logit contributions
 cook+0.003
 check+0.003
 help+0.003
 little+0.003
 forgive+0.003
Neg logit contributions
storms-0.008
 wrists-0.007
aces-0.007
case-0.007
chair-0.007
Token banks
Token ID6341
Feature activation-5.02e+00

Pos logit contributions
 check+0.009
 forgive+0.008
 cook+0.008
 install+0.008
 listen+0.008
Neg logit contributions
storms-0.013
 wrists-0.011
case-0.011
als-0.010
chair-0.010
Token.
Token ID13
Feature activation-1.46e-01

Pos logit contributions
 check+0.111
 help+0.103
 cook+0.101
 save+0.098
 forgive+0.094
Neg logit contributions
storms-0.235
case-0.206
aces-0.205
chair-0.197
uts-0.194
Token As
Token ID1081
Feature activation-1.65e-01

Pos logit contributions
 check+0.004
 help+0.003
 cook+0.003
 end+0.003
 little+0.003
Neg logit contributions
storms-0.007
case-0.006
aces-0.006
 wrists-0.006
chair-0.006
Token he
Token ID339
Feature activation-6.66e-01

Pos logit contributions
 check+0.004
 cook+0.004
 help+0.004
 forgive+0.004
 permission+0.004
Neg logit contributions
storms-0.007
case-0.006
 wrists-0.006
chair-0.006
aces-0.006
Token watched
Token ID7342
Feature activation-4.65e-01

Pos logit contributions
 check+0.018
 help+0.018
 cook+0.018
 little+0.017
 end+0.017
Neg logit contributions
storms-0.028
 wrists-0.024
case-0.024
aces-0.024
chair-0.023
Token for
Token ID329
Feature activation-2.54e-01

Pos logit contributions
 check+0.009
 cook+0.009
 forgive+0.008
 help+0.008
asting+0.008
Neg logit contributions
storms-0.022
 wrists-0.019
aces-0.019
case-0.018
chair-0.018
Token march
Token ID9960
Feature activation+9.70e-01

Pos logit contributions
storms+0.010
 wrists+0.009
case+0.009
aces+0.009
chair+0.008
Neg logit contributions
 adult-0.006
 permission-0.005
asting-0.005
ruck-0.005
 end-0.005
Token around
Token ID1088
Feature activation-1.19e-01

Pos logit contributions
storms+0.048
 wrists+0.043
case+0.041
aces+0.041
chair+0.040
Neg logit contributions
 help-0.022
 check-0.021
 little-0.020
 cook-0.019
 end-0.019
Token the
Token ID262
Feature activation-4.24e-01

Pos logit contributions
 check+0.002
 cook+0.002
 forgive+0.002
asting+0.002
 permission+0.002
Neg logit contributions
storms-0.006
 wrists-0.005
aces-0.005
case-0.005
chair-0.005
Token far
Token ID1290
Feature activation-1.94e+00

Pos logit contributions
 help+0.011
 check+0.011
 permission+0.011
 cook+0.011
 forgive+0.011
Neg logit contributions
storms-0.017
case-0.015
 wrists-0.015
aces-0.014
chair-0.014
Tokenmy
Token ID1820
Feature activation-1.80e+00

Pos logit contributions
 check+0.043
 help+0.043
 cook+0.042
 little+0.041
 end+0.040
Neg logit contributions
storms-0.091
aces-0.081
 wrists-0.081
case-0.080
chair-0.077
Tokenard
Token ID446
Feature activation-4.98e+00

Pos logit contributions
 check+0.031
 help+0.031
 cook+0.031
 little+0.030
 end+0.028
Neg logit contributions
storms-0.093
 wrists-0.084
case-0.083
aces-0.082
chair-0.079
Token with
Token ID351
Feature activation-2.45e-01

Pos logit contributions
 check+0.059
 help+0.059
 cook+0.058
 little+0.053
 end+0.050
Neg logit contributions
storms-0.284
 wrists-0.257
case-0.256
aces-0.255
chair-0.249
Token his
Token ID465
Feature activation+1.03e-01

Pos logit contributions
 help+0.003
 check+0.003
 little+0.002
 cook+0.002
 end+0.002
Neg logit contributions
storms-0.015
case-0.013
 wrists-0.013
aces-0.013
chair-0.013
Token friends
Token ID2460
Feature activation+5.57e-01

Pos logit contributions
storms+0.005
case+0.005
aces+0.005
 wrists+0.005
chair+0.005
Neg logit contributions
 help-0.002
 check-0.002
 cook-0.002
 little-0.002
 end-0.002
Token.
Token ID13
Feature activation+1.32e-01

Pos logit contributions
storms+0.027
aces+0.024
 wrists+0.024
case+0.023
uts+0.022
Neg logit contributions
 check-0.011
 cook-0.011
 save-0.010
 adult-0.010
 help-0.010
Token One
Token ID1881
Feature activation-6.74e-02

Pos logit contributions
storms+0.006
 wrists+0.005
aces+0.005
case+0.005
chair+0.005
Neg logit contributions
 check-0.003
 help-0.003
 cook-0.003
 little-0.003
 end-0.003
Token was
Token ID373
Feature activation-2.48e-01

Pos logit contributions
 help+0.034
 check+0.034
 cook+0.032
 end+0.031
 permission+0.031
Neg logit contributions
storms-0.055
case-0.047
aces-0.047
 wrists-0.047
chair-0.045
Token exploring
Token ID13504
Feature activation-9.56e-01

Pos logit contributions
 forgive+0.007
 end+0.007
 cook+0.007
 check+0.006
 save+0.006
Neg logit contributions
storms-0.009
case-0.008
aces-0.008
 wrists-0.008
chair-0.008
Token the
Token ID262
Feature activation+7.17e-02

Pos logit contributions
arl+0.028
 adult+0.026
 forgive+0.026
 cook+0.026
 permission+0.025
Neg logit contributions
storms-0.032
chair-0.026
 webs-0.026
 lands-0.026
aces-0.025
Token far
Token ID1290
Feature activation-1.79e+00

Pos logit contributions
storms+0.002
case+0.002
 wrists+0.002
chair+0.002
aces+0.002
Neg logit contributions
 help-0.002
 forgive-0.002
 permission-0.002
 check-0.002
 save-0.002
Tokenmy
Token ID1820
Feature activation-1.44e+00

Pos logit contributions
 help+0.043
 check+0.043
 cook+0.042
 forgive+0.040
 permission+0.039
Neg logit contributions
storms-0.080
case-0.070
 wrists-0.069
aces-0.068
chair-0.068
Tokenard
Token ID446
Feature activation-4.98e+00

Pos logit contributions
 check+0.034
 cook+0.034
 help+0.034
 little+0.033
 end+0.032
Neg logit contributions
storms-0.065
 wrists-0.058
aces-0.057
case-0.057
chair-0.054
Token when
Token ID618
Feature activation-1.35e+00

Pos logit contributions
 check+0.086
 help+0.084
 cook+0.081
 little+0.077
 end+0.074
Neg logit contributions
storms-0.260
aces-0.230
case-0.229
 wrists-0.227
chair-0.221
Token he
Token ID339
Feature activation-3.11e-01

Pos logit contributions
 help+0.028
 check+0.028
 little+0.028
 cook+0.027
 end+0.026
Neg logit contributions
storms-0.066
 wrists-0.061
aces-0.060
case-0.058
uts-0.056
Token came
Token ID1625
Feature activation-4.99e-01

Pos logit contributions
 check+0.006
 cook+0.006
 help+0.006
 little+0.006
 end+0.006
Neg logit contributions
storms-0.015
 wrists-0.014
aces-0.014
case-0.013
chair-0.013
Token across
Token ID1973
Feature activation-7.38e-01

Pos logit contributions
 cook+0.019
 install+0.018
 chop+0.018
asting+0.018
 check+0.018
Neg logit contributions
storms-0.012
 Caroline-0.010
aces-0.010
overed-0.010
case-0.010
Token a
Token ID257
Feature activation-1.26e-01

Pos logit contributions
asting+0.016
arl+0.015
 check+0.015
 forgive+0.014
 adult+0.014
Neg logit contributions
storms-0.031
case-0.027
aces-0.026
chair-0.026
 Sof-0.026
Token march
Token ID9960
Feature activation+1.05e+00

Pos logit contributions
storms+0.015
 wrists+0.014
case+0.013
aces+0.013
chair+0.013
Neg logit contributions
 adult-0.008
 permission-0.008
asting-0.007
 end-0.007
ruck-0.007
Token around
Token ID1088
Feature activation-1.62e-01

Pos logit contributions
storms+0.051
 wrists+0.046
case+0.044
aces+0.044
chair+0.042
Neg logit contributions
 help-0.025
 check-0.023
 little-0.023
 cook-0.022
 end-0.022
Token the
Token ID262
Feature activation-5.10e-01

Pos logit contributions
 check+0.003
 cook+0.003
 help+0.003
 forgive+0.003
asting+0.003
Neg logit contributions
storms-0.008
 wrists-0.007
case-0.007
aces-0.007
chair-0.007
Token far
Token ID1290
Feature activation-1.87e+00

Pos logit contributions
 help+0.013
 check+0.013
 cook+0.012
 permission+0.012
 forgive+0.012
Neg logit contributions
storms-0.022
case-0.020
 wrists-0.019
aces-0.019
chair-0.018
Tokenmy
Token ID1820
Feature activation-1.73e+00

Pos logit contributions
 check+0.037
 little+0.037
 help+0.037
 cook+0.036
 end+0.036
Neg logit contributions
storms-0.092
aces-0.083
 wrists-0.082
case-0.081
uts-0.078
Tokenard
Token ID446
Feature activation-4.94e+00

Pos logit contributions
 little+0.030
 cook+0.030
 help+0.029
 check+0.029
 end+0.028
Neg logit contributions
storms-0.090
 wrists-0.081
case-0.079
aces-0.079
chair-0.076
Token with
Token ID351
Feature activation-7.45e-02

Pos logit contributions
 check+0.058
 help+0.058
 cook+0.057
 little+0.053
 end+0.049
Neg logit contributions
storms-0.282
 wrists-0.257
case-0.255
aces-0.253
chair-0.249
Token his
Token ID465
Feature activation+2.15e-01

Pos logit contributions
 help+0.001
 check+0.001
 little+0.001
 cook+0.001
 end+0.001
Neg logit contributions
storms-0.004
case-0.004
 wrists-0.004
aces-0.004
chair-0.004
Token friends
Token ID2460
Feature activation+6.39e-01

Pos logit contributions
storms+0.011
case+0.010
aces+0.010
 wrists+0.010
chair+0.010
Neg logit contributions
 help-0.004
 check-0.004
 cook-0.004
 little-0.004
 end-0.003
Token.
Token ID13
Feature activation+1.25e-01

Pos logit contributions
storms+0.029
aces+0.026
 wrists+0.026
case+0.024
uts+0.024
Neg logit contributions
 check-0.013
 cook-0.013
 save-0.013
 adult-0.013
asting-0.013
Token One
Token ID1881
Feature activation-4.00e-02

Pos logit contributions
storms+0.005
case+0.004
aces+0.004
 wrists+0.004
chair+0.004
Neg logit contributions
 help-0.003
 check-0.003
 permission-0.003
 cook-0.003
 save-0.003
Token are
Token ID389
Feature activation+8.83e-01

Pos logit contributions
 help+0.009
 check+0.009
 cook+0.008
 little+0.008
 end+0.008
Neg logit contributions
storms-0.016
 wrists-0.014
aces-0.014
case-0.014
chair-0.014
Token from
Token ID422
Feature activation-2.08e-02

Pos logit contributions
storms+0.040
aces+0.035
case+0.035
 wrists+0.035
chair+0.033
Neg logit contributions
 check-0.020
 cook-0.020
 help-0.020
 little-0.019
 end-0.018
Token a
Token ID257
Feature activation-5.08e-01

Pos logit contributions
 check+0.000
 help+0.000
 cook+0.000
 little+0.000
 end+0.000
Neg logit contributions
storms-0.001
 wrists-0.001
case-0.001
aces-0.001
chair-0.001
Token long
Token ID890
Feature activation+1.10e+00

Pos logit contributions
 check+0.012
 help+0.012
 cook+0.012
 little+0.011
 end+0.011
Neg logit contributions
storms-0.023
case-0.020
 wrists-0.020
aces-0.019
chair-0.019
Token time
Token ID640
Feature activation-7.51e-01

Pos logit contributions
storms+0.051
 wrists+0.045
aces+0.044
case+0.044
chair+0.043
Neg logit contributions
 check-0.025
 help-0.025
 cook-0.025
 little-0.024
 end-0.023
Token ago
Token ID2084
Feature activation-4.89e+00

Pos logit contributions
 check+0.017
 cook+0.016
 help+0.016
 little+0.016
 end+0.015
Neg logit contributions
storms-0.035
aces-0.030
case-0.030
 wrists-0.030
chair-0.029
Token,
Token ID11
Feature activation-6.94e-01

Pos logit contributions
 help+0.060
 cook+0.059
 check+0.058
 little+0.055
 end+0.051
Neg logit contributions
storms-0.281
 wrists-0.249
case-0.249
aces-0.244
chair-0.243
Token before
Token ID878
Feature activation-9.49e-01

Pos logit contributions
 cook+0.014
 help+0.014
 check+0.014
 little+0.013
 end+0.012
Neg logit contributions
storms-0.035
 wrists-0.031
case-0.031
aces-0.031
chair-0.030
Token we
Token ID356
Feature activation-1.15e+00

Pos logit contributions
 check+0.024
 help+0.024
 cook+0.023
 little+0.022
 end+0.022
Neg logit contributions
storms-0.042
 wrists-0.037
aces-0.037
case-0.037
chair-0.035
Token were
Token ID547
Feature activation+3.19e-01

Pos logit contributions
 check+0.020
 little+0.020
 cook+0.019
 help+0.019
asting+0.018
Neg logit contributions
storms-0.055
case-0.050
aces-0.049
 wrists-0.048
chair-0.047
Token born
Token ID4642
Feature activation-7.54e-01

Pos logit contributions
storms+0.015
 wrists+0.013
case+0.013
aces+0.013
chair+0.013
Neg logit contributions
 check-0.007
 help-0.007
 cook-0.007
 little-0.007
 end-0.006
Token zoo
Token ID26626
Feature activation-2.64e-01

Pos logit contributions
 check+0.010
asting+0.010
 wash+0.010
 cook+0.009
 help+0.009
Neg logit contributions
storms-0.009
 wrists-0.008
case-0.008
chair-0.007
aces-0.007
Token workers
Token ID3259
Feature activation-7.96e-01

Pos logit contributions
 cook+0.007
 help+0.007
 little+0.006
 end+0.006
 check+0.006
Neg logit contributions
storms-0.012
 wrists-0.011
aces-0.011
case-0.010
uts-0.010
Token have
Token ID423
Feature activation-8.22e-01

Pos logit contributions
 check+0.019
 help+0.018
 cook+0.018
 little+0.018
 end+0.017
Neg logit contributions
storms-0.035
case-0.031
aces-0.030
 wrists-0.030
chair-0.029
Token stopped
Token ID5025
Feature activation-1.73e+00

Pos logit contributions
 help+0.016
 little+0.015
 check+0.015
 cook+0.014
 permission+0.014
Neg logit contributions
storms-0.043
case-0.040
 wrists-0.038
aces-0.038
chair-0.038
Token the
Token ID262
Feature activation-3.04e-01

Pos logit contributions
 check+0.043
 forgive+0.041
 cook+0.041
 help+0.041
 end+0.039
Neg logit contributions
storms-0.073
 wrists-0.059
aces-0.057
uts-0.056
 loses-0.054
Token rh
Token ID9529
Feature activation-4.85e+00

Pos logit contributions
 check+0.008
 help+0.008
 cook+0.008
 forgive+0.008
asting+0.008
Neg logit contributions
storms-0.012
case-0.010
 wrists-0.010
chair-0.010
aces-0.010
Tokenin
Token ID259
Feature activation-2.83e+00

Pos logit contributions
 check+0.125
 help+0.124
 wash+0.118
 cook+0.113
 permission+0.113
Neg logit contributions
storms-0.207
case-0.186
chair-0.182
aces-0.176
ane-0.174
Tokenoc
Token ID420
Feature activation-2.92e+00

Pos logit contributions
 check+0.080
 permission+0.079
 help+0.078
 wash+0.078
 forgive+0.077
Neg logit contributions
storms-0.111
chair-0.099
case-0.097
als-0.086
uts-0.083
Tokeneros
Token ID27498
Feature activation-2.64e+00

Pos logit contributions
 check+0.128
 help+0.125
 little+0.124
 cook+0.124
 forgive+0.122
Neg logit contributions
storms-0.074
 wrists-0.059
aces-0.057
case-0.052
chair-0.052
Token.
Token ID13
Feature activation+2.51e-01

Pos logit contributions
 check+0.036
 help+0.035
 cook+0.034
 little+0.031
 end+0.030
Neg logit contributions
storms-0.146
case-0.131
 wrists-0.131
aces-0.131
chair-0.127
Token He
Token ID679
Feature activation-7.25e-01

Pos logit contributions
storms+0.009
case+0.007
aces+0.007
chair+0.007
 wrists+0.007
Neg logit contributions
 check-0.008
 help-0.008
 end-0.008
 cook-0.008
 little-0.008
Token who
Token ID508
Feature activation-2.18e+00

Pos logit contributions
storms+0.008
case+0.008
aces+0.008
 wrists+0.007
chair+0.007
Neg logit contributions
 little-0.003
 check-0.002
 help-0.002
 cook-0.002
 end-0.002
Token had
Token ID550
Feature activation-2.27e-01

Pos logit contributions
 cook+0.057
 check+0.057
 help+0.056
 end+0.053
 little+0.053
Neg logit contributions
storms-0.095
 wrists-0.086
aces-0.084
case-0.081
uts-0.078
Token passed
Token ID3804
Feature activation-2.18e-01

Pos logit contributions
 help+0.004
 little+0.004
 check+0.004
 cook+0.004
 two+0.004
Neg logit contributions
storms-0.012
 wrists-0.011
aces-0.011
case-0.011
chair-0.010
Token away
Token ID1497
Feature activation+1.17e+00

Pos logit contributions
 little+0.003
 check+0.003
 help+0.003
 work+0.002
 two+0.002
Neg logit contributions
storms-0.012
 wrists-0.012
case-0.011
aces-0.011
uts-0.011
Token long
Token ID890
Feature activation+4.26e-01

Pos logit contributions
storms+0.064
 wrists+0.058
aces+0.057
case+0.057
chair+0.055
Neg logit contributions
 check-0.017
 help-0.016
 cook-0.016
 little-0.015
 end-0.014
Token ago
Token ID2084
Feature activation-4.82e+00

Pos logit contributions
storms+0.020
 wrists+0.018
case+0.017
 Caroline+0.017
uts+0.017
Neg logit contributions
 little-0.010
 help-0.010
 cook-0.009
 check-0.009
 end-0.009
Token.
Token ID13
Feature activation+3.41e-01

Pos logit contributions
 help+0.044
 cook+0.042
 little+0.041
 check+0.038
 end+0.034
Neg logit contributions
storms-0.291
 wrists-0.261
case-0.257
chair-0.253
 Caroline-0.251
Token One
Token ID1881
Feature activation-2.01e-01

Pos logit contributions
storms+0.017
 wrists+0.015
aces+0.015
case+0.015
chair+0.014
Neg logit contributions
 check-0.007
 cook-0.007
 help-0.007
 little-0.006
 end-0.006
Token day
Token ID1110
Feature activation+7.63e-02

Pos logit contributions
 check+0.005
 permission+0.005
 help+0.005
 save+0.005
 forgive+0.004
Neg logit contributions
storms-0.008
chair-0.008
case-0.008
aces-0.007
oline-0.007
Token,
Token ID11
Feature activation+9.18e-01

Pos logit contributions
storms+0.003
case+0.003
chair+0.003
aces+0.003
 wrists+0.003
Neg logit contributions
 check-0.002
 cook-0.002
 help-0.002
asting-0.002
 forgive-0.002
Token a
Token ID257
Feature activation+1.63e-01

Pos logit contributions
storms+0.044
 wrists+0.039
case+0.039
aces+0.038
chair+0.038
Neg logit contributions
 check-0.019
 cook-0.018
 help-0.018
asting-0.017
 end-0.017
Token flew
Token ID13112
Feature activation-3.87e-01

Pos logit contributions
 check+0.021
 help+0.021
 cook+0.021
 little+0.020
 end+0.019
Neg logit contributions
storms-0.038
 wrists-0.034
aces-0.034
case-0.033
chair-0.032
Token right
Token ID826
Feature activation+5.82e-01

Pos logit contributions
 little+0.008
 check+0.008
 help+0.007
 cook+0.007
 end+0.007
Neg logit contributions
storms-0.020
 wrists-0.018
aces-0.017
case-0.017
uts-0.017
Token back
Token ID736
Feature activation+2.71e-01

Pos logit contributions
storms+0.028
 wrists+0.025
case+0.024
aces+0.024
chair+0.024
Neg logit contributions
 check-0.013
 help-0.012
 cook-0.012
 little-0.012
 end-0.011
Token up
Token ID510
Feature activation-1.48e+00

Pos logit contributions
storms+0.013
 wrists+0.012
aces+0.012
case+0.012
chair+0.011
Neg logit contributions
 check-0.005
 help-0.005
 cook-0.005
 little-0.005
 end-0.005
Token the
Token ID262
Feature activation-1.47e+00

Pos logit contributions
 check+0.026
 help+0.025
 cook+0.025
 little+0.024
 work+0.022
Neg logit contributions
storms-0.078
 wrists-0.070
aces-0.069
case-0.068
chair-0.066
Token chim
Token ID18205
Feature activation-4.78e+00

Pos logit contributions
 cook+0.043
 check+0.043
 help+0.043
 forgive+0.042
asting+0.042
Neg logit contributions
storms-0.056
case-0.049
 wrists-0.048
chair-0.047
aces-0.047
Tokenney
Token ID1681
Feature activation-3.60e+00

Pos logit contributions
 help+0.186
 check+0.184
 cook+0.179
 save+0.175
 end+0.173
Neg logit contributions
storms-0.148
case-0.129
chair-0.120
aces-0.119
 wrists-0.116
Token and
Token ID290
Feature activation-4.56e-01

Pos logit contributions
 check+0.038
 cook+0.038
 help+0.037
 little+0.035
 end+0.030
Neg logit contributions
storms-0.211
 wrists-0.191
aces-0.189
case-0.185
 Caroline-0.180
Token disappeared
Token ID12120
Feature activation-2.07e-01

Pos logit contributions
 check+0.012
 help+0.011
 cook+0.011
 little+0.011
 save+0.010
Neg logit contributions
storms-0.021
aces-0.019
case-0.019
 wrists-0.018
uts-0.018
Token into
Token ID656
Feature activation-9.93e-01

Pos logit contributions
 check+0.003
 help+0.003
 cook+0.003
 little+0.003
 end+0.003
Neg logit contributions
storms-0.011
 wrists-0.010
case-0.010
aces-0.010
chair-0.010
Token the
Token ID262
Feature activation-6.63e-01

Pos logit contributions
 check+0.015
 help+0.015
 cook+0.014
 little+0.013
 end+0.013
Neg logit contributions
storms-0.054
 wrists-0.048
case-0.048
aces-0.048
chair-0.046
Token scientist
Token ID11444
Feature activation-1.06e+00

Pos logit contributions
arl+0.018
asting+0.018
Their+0.017
 what+0.017
 listen+0.017
Neg logit contributions
storms-0.018
umm-0.015
 character-0.013
 Statue-0.013
 Melody-0.013
Token was
Token ID373
Feature activation-4.46e-01

Pos logit contributions
 cook+0.025
 help+0.025
 little+0.025
 check+0.025
 end+0.023
Neg logit contributions
storms-0.049
aces-0.043
 wrists-0.043
case-0.042
chair-0.041
Token surprised
Token ID6655
Feature activation-7.13e-01

Pos logit contributions
 check+0.011
 cook+0.010
 help+0.010
 little+0.010
 end+0.010
Neg logit contributions
storms-0.020
case-0.018
 wrists-0.017
aces-0.017
chair-0.017
Token at
Token ID379
Feature activation+1.50e+00

Pos logit contributions
 check+0.013
 cook+0.013
 help+0.013
 save+0.012
 end+0.012
Neg logit contributions
storms-0.037
 wrists-0.032
aces-0.032
case-0.032
chair-0.031
Token the
Token ID262
Feature activation-7.67e-01

Pos logit contributions
storms+0.059
 wrists+0.056
aces+0.053
 Jeff+0.052
 Caroline+0.050
Neg logit contributions
 cook-0.035
 check-0.035
 help-0.032
 save-0.032
asting-0.031
Token ot
Token ID30972
Feature activation-4.77e+00

Pos logit contributions
 listen+0.017
 check+0.017
 what+0.017
Some+0.017
Mother+0.017
Neg logit contributions
storms-0.028
case-0.023
umm-0.023
storm-0.022
 Fer-0.021
Tokenter
Token ID353
Feature activation-2.59e+00

Pos logit contributions
 check+0.165
 end+0.163
<|endoftext|>+0.156
 work+0.154
 what+0.152
Neg logit contributions
storms-0.161
aces-0.128
als-0.128
uts-0.121
ane-0.113
Token's
Token ID338
Feature activation-1.44e+00

Pos logit contributions
 check+0.051
 help+0.047
 cook+0.046
 little+0.043
 end+0.043
Neg logit contributions
storms-0.127
case-0.111
aces-0.108
 wrists-0.108
chair-0.106
Token ability
Token ID2694
Feature activation-7.70e-01

Pos logit contributions
 check+0.032
asting+0.030
 chop+0.028
 cook+0.028
ruck+0.027
Neg logit contributions
storms-0.061
case-0.054
 wrists-0.053
aces-0.050
uts-0.049
Token.
Token ID13
Feature activation-1.40e-01

Pos logit contributions
 help+0.012
 check+0.011
 little+0.011
 cook+0.011
 end+0.010
Neg logit contributions
storms-0.042
 wrists-0.039
aces-0.038
case-0.038
chair-0.037
Token His
Token ID2399
Feature activation-2.13e+00

Pos logit contributions
 check+0.003
 cook+0.003
 help+0.003
 little+0.003
 save+0.003
Neg logit contributions
storms-0.006
 wrists-0.006
aces-0.006
case-0.006
chair-0.005
Token The
Token ID383
Feature activation-5.72e-01

Pos logit contributions
storms+0.002
case+0.001
aces+0.001
 wrists+0.001
chair+0.001
Neg logit contributions
 check-0.001
 help-0.001
 cook-0.001
 end-0.001
 little-0.001
Token ants
Token ID27842
Feature activation-7.34e-01

Pos logit contributions
 check+0.016
 help+0.015
 cook+0.015
 little+0.015
 save+0.014
Neg logit contributions
storms-0.024
case-0.021
 wrists-0.020
aces-0.020
chair-0.020
Token came
Token ID1625
Feature activation-1.25e+00

Pos logit contributions
 help+0.021
 cook+0.020
 check+0.020
 little+0.020
 end+0.019
Neg logit contributions
storms-0.031
 wrists-0.027
aces-0.027
case-0.026
chair-0.025
Token from
Token ID422
Feature activation-1.36e-01

Pos logit contributions
 check+0.027
 cook+0.027
 help+0.027
 end+0.025
 save+0.025
Neg logit contributions
storms-0.058
case-0.051
 wrists-0.050
aces-0.050
uts-0.048
Token the
Token ID262
Feature activation-7.33e-01

Pos logit contributions
 cook+0.003
asting+0.003
 check+0.003
 forgive+0.003
 help+0.003
Neg logit contributions
storms-0.006
 wrists-0.005
case-0.005
aces-0.005
chair-0.005
Token junk
Token ID18556
Feature activation-4.77e+00

Pos logit contributions
 check+0.017
 help+0.017
 cook+0.017
 little+0.016
 end+0.016
Neg logit contributions
storms-0.034
aces-0.030
 wrists-0.030
case-0.029
chair-0.028
Token under
Token ID739
Feature activation-1.42e+00

Pos logit contributions
 cook+0.058
 little+0.054
 help+0.053
 end+0.051
 check+0.050
Neg logit contributions
storms-0.271
aces-0.254
 wrists-0.254
 Caroline-0.244
Aud-0.241
Token his
Token ID465
Feature activation-1.65e+00

Pos logit contributions
 check+0.023
 cook+0.022
 help+0.022
 forgive+0.020
 little+0.019
Neg logit contributions
storms-0.075
 wrists-0.067
aces-0.066
case-0.066
chair-0.064
Token bed
Token ID3996
Feature activation-6.87e-01

Pos logit contributions
 help+0.040
 check+0.039
 cook+0.039
 little+0.038
 end+0.036
Neg logit contributions
storms-0.078
aces-0.067
 wrists-0.065
case-0.065
uts-0.063
Token.
Token ID13
Feature activation+2.30e-01

Pos logit contributions
 little+0.008
 cook+0.007
 help+0.007
 check+0.006
 work+0.006
Neg logit contributions
storms-0.039
 wrists-0.037
aces-0.036
 Caroline-0.035
Aud-0.035
Token Tim
Token ID5045
Feature activation-8.27e-01

Pos logit contributions
storms+0.010
case+0.008
 wrists+0.008
aces+0.008
chair+0.008
Neg logit contributions
 check-0.006
 help-0.006
 cook-0.006
 little-0.006
 end-0.006
Token is
Token ID318
Feature activation+7.57e-01

Pos logit contributions
 little+0.027
 help+0.027
 cook+0.026
 end+0.025
 check+0.025
Neg logit contributions
storms-0.048
 wrists-0.045
aces-0.043
case-0.041
chair-0.040
Token from
Token ID422
Feature activation+1.10e+00

Pos logit contributions
storms+0.036
 wrists+0.031
aces+0.031
case+0.031
chair+0.030
Neg logit contributions
 check-0.017
 help-0.017
 cook-0.017
 little-0.016
 end-0.015
Token a
Token ID257
Feature activation+3.21e-02

Pos logit contributions
storms+0.053
aces+0.047
case+0.046
 wrists+0.046
chair+0.045
Neg logit contributions
 check-0.024
 help-0.024
 cook-0.023
 little-0.023
 end-0.022
Token time
Token ID640
Feature activation-2.21e-01

Pos logit contributions
storms+0.001
 wrists+0.001
aces+0.001
case+0.001
chair+0.001
Neg logit contributions
 check-0.001
 help-0.001
 cook-0.001
 little-0.001
 end-0.001
Token long
Token ID890
Feature activation+1.01e+00

Pos logit contributions
 check+0.005
 help+0.005
 cook+0.005
 little+0.004
 end+0.004
Neg logit contributions
storms-0.011
 wrists-0.009
aces-0.009
case-0.009
chair-0.009
Token ago
Token ID2084
Feature activation-4.75e+00

Pos logit contributions
storms+0.047
 wrists+0.042
aces+0.042
case+0.041
uts+0.040
Neg logit contributions
 little-0.024
 check-0.023
 help-0.023
 cook-0.023
 end-0.022
Token.
Token ID13
Feature activation+3.45e-01

Pos logit contributions
 cook+0.081
 help+0.081
 little+0.079
 check+0.078
 end+0.073
Neg logit contributions
storms-0.253
 wrists-0.220
case-0.217
chair-0.213
aces-0.212
Token It
Token ID632
Feature activation+7.97e-01

Pos logit contributions
storms+0.012
case+0.010
aces+0.010
 wrists+0.010
chair+0.009
Neg logit contributions
 check-0.012
 help-0.012
 cook-0.012
 end-0.012
 little-0.012
Token has
Token ID468
Feature activation+5.47e-02

Pos logit contributions
storms+0.033
 wrists+0.031
aces+0.030
case+0.029
uts+0.028
Neg logit contributions
 help-0.023
 check-0.022
 cook-0.022
 little-0.021
 save-0.020
Token been
Token ID587
Feature activation-1.30e+00

Pos logit contributions
storms+0.003
 wrists+0.003
case+0.003
aces+0.003
chair+0.002
Neg logit contributions
 help-0.001
 little-0.001
 check-0.001
 cook-0.001
 two-0.001
Token locked
Token ID8970
Feature activation+4.24e-01

Pos logit contributions
 check+0.028
 help+0.028
 cook+0.027
 little+0.026
 end+0.025
Neg logit contributions
storms-0.062
 wrists-0.055
case-0.055
aces-0.055
chair-0.053
Token's
Token ID338
Feature activation-3.99e-01

Pos logit contributions
storms+0.036
 wrists+0.032
aces+0.032
case+0.031
chair+0.030
Neg logit contributions
 check-0.020
 cook-0.019
 help-0.019
 little-0.019
 end-0.018
Token house
Token ID2156
Feature activation+4.36e-01

Pos logit contributions
 check+0.008
 help+0.008
 cook+0.008
 little+0.008
 end+0.007
Neg logit contributions
storms-0.020
aces-0.017
 wrists-0.017
case-0.017
chair-0.016
Token was
Token ID373
Feature activation-4.82e-01

Pos logit contributions
storms+0.024
 wrists+0.021
aces+0.021
Aud+0.020
 Caroline+0.020
Neg logit contributions
 little-0.008
 help-0.008
 check-0.007
 cook-0.007
 end-0.007
Token built
Token ID3170
Feature activation-9.22e-01

Pos logit contributions
 check+0.011
 help+0.010
 cook+0.010
 little+0.010
 end+0.010
Neg logit contributions
storms-0.023
 wrists-0.021
aces-0.020
case-0.019
chair-0.019
Token long
Token ID890
Feature activation+1.12e+00

Pos logit contributions
 help+0.010
 check+0.009
 cook+0.009
 little+0.009
 work+0.008
Neg logit contributions
storms-0.055
 wrists-0.051
aces-0.048
chair-0.047
case-0.047
Token ago
Token ID2084
Feature activation-4.73e+00

Pos logit contributions
storms+0.054
 wrists+0.049
 Caroline+0.047
aces+0.046
case+0.046
Neg logit contributions
 little-0.022
 cook-0.022
 help-0.022
 check-0.022
 work-0.021
Token.
Token ID13
Feature activation+2.88e-01

Pos logit contributions
 help+0.038
 cook+0.037
 check+0.036
 little+0.035
 end+0.029
Neg logit contributions
storms-0.290
 wrists-0.261
case-0.255
aces-0.253
chair-0.252
Token
Token ID198
Feature activation-3.12e-02

Pos logit contributions
storms+0.017
aces+0.016
 wrists+0.016
case+0.015
uts+0.015
Neg logit contributions
 little-0.003
 check-0.003
 cook-0.002
 help-0.002
 pick-0.002
Token
Token ID198
Feature activation-1.38e-01

Pos logit contributions
 check+0.000
 help+0.000
 cook+0.000
 permission+0.000
 save+0.000
Neg logit contributions
storms-0.002
case-0.002
 wrists-0.002
aces-0.002
chair-0.002
TokenL
Token ID43
Feature activation+8.18e-01

Pos logit contributions
 help+0.003
 permission+0.003
 check+0.003
 cook+0.003
 chop+0.003
Neg logit contributions
storms-0.006
case-0.006
chair-0.006
 wrists-0.006
aces-0.006
Tokenily
Token ID813
Feature activation-2.65e-02

Pos logit contributions
storms+0.034
aces+0.030
case+0.030
 wrists+0.029
chair+0.029
Neg logit contributions
 help-0.022
 check-0.022
 cook-0.022
 save-0.021
 little-0.020
Token of
Token ID286
Feature activation-3.02e-01

Pos logit contributions
 help+0.023
 cook+0.023
 check+0.022
 little+0.021
 end+0.021
Neg logit contributions
storms-0.089
 wrists-0.081
aces-0.077
case-0.076
chair-0.075
Token animals
Token ID4695
Feature activation-1.14e+00

Pos logit contributions
 check+0.008
 cook+0.008
 help+0.008
 forgive+0.007
asting+0.007
Neg logit contributions
storms-0.012
 wrists-0.011
aces-0.011
case-0.010
chair-0.010
Token that
Token ID326
Feature activation-1.24e+00

Pos logit contributions
 check+0.018
 help+0.018
 cook+0.018
 little+0.016
 end+0.015
Neg logit contributions
storms-0.061
 wrists-0.055
aces-0.054
case-0.054
chair-0.053
Token lived
Token ID5615
Feature activation+6.46e-02

Pos logit contributions
 cook+0.029
 help+0.029
 little+0.026
 check+0.026
 pick+0.025
Neg logit contributions
storms-0.059
aces-0.053
 wrists-0.053
overed-0.051
case-0.051
Token long
Token ID890
Feature activation+3.76e-01

Pos logit contributions
storms+0.003
 wrists+0.003
aces+0.003
case+0.003
chair+0.003
Neg logit contributions
 check-0.001
 help-0.001
 cook-0.001
 little-0.001
 end-0.001
Token ago
Token ID2084
Feature activation-4.73e+00

Pos logit contributions
storms+0.020
 wrists+0.018
case+0.017
aces+0.017
chair+0.017
Neg logit contributions
 little-0.006
 help-0.006
 cook-0.006
 check-0.006
 end-0.006
Token.
Token ID13
Feature activation+4.46e-01

Pos logit contributions
 cook+0.051
 help+0.050
 check+0.049
 little+0.047
 end+0.043
Neg logit contributions
storms-0.278
 wrists-0.247
case-0.246
aces-0.242
chair-0.241
Token They
Token ID1119
Feature activation+4.19e-01

Pos logit contributions
storms+0.021
aces+0.019
 wrists+0.019
case+0.018
chair+0.018
Neg logit contributions
 check-0.010
 cook-0.010
 help-0.010
 little-0.009
 end-0.009
Token saw
Token ID2497
Feature activation-8.56e-01

Pos logit contributions
storms+0.018
aces+0.016
case+0.016
 wrists+0.016
chair+0.015
Neg logit contributions
 check-0.011
 cook-0.011
 help-0.011
 little-0.010
 end-0.010
Token shiny
Token ID22441
Feature activation-2.75e-01

Pos logit contributions
 little+0.018
 help+0.017
 cook+0.017
 check+0.016
 two+0.016
Neg logit contributions
storms-0.043
case-0.038
 wrists-0.037
aces-0.037
chair-0.037
Token rocks
Token ID12586
Feature activation-4.80e-01

Pos logit contributions
 check+0.007
 help+0.007
 cook+0.007
 little+0.006
 end+0.006
Neg logit contributions
storms-0.012
case-0.011
 wrists-0.011
aces-0.011
chair-0.010
Token started
Token ID2067
Feature activation-8.33e-01

Pos logit contributions
 little+0.030
 permission+0.030
 check+0.029
 cook+0.029
 end+0.028
Neg logit contributions
storms-0.068
case-0.061
aces-0.056
uts-0.056
 wrists-0.056
Token smoking
Token ID9216
Feature activation-1.66e+00

Pos logit contributions
 check+0.020
 end+0.019
 cook+0.019
 help+0.019
 forgive+0.018
Neg logit contributions
storms-0.037
 wrists-0.031
aces-0.030
case-0.030
uts-0.030
Token a
Token ID257
Feature activation+2.03e-01

Pos logit contributions
 check+0.026
 help+0.025
 cook+0.025
 little+0.023
 end+0.022
Neg logit contributions
storms-0.089
 wrists-0.080
aces-0.079
case-0.079
chair-0.077
Token long
Token ID890
Feature activation+1.28e+00

Pos logit contributions
storms+0.009
case+0.008
 wrists+0.008
chair+0.008
uts+0.007
Neg logit contributions
 check-0.005
 help-0.005
 permission-0.005
 forgive-0.004
asting-0.004
Token time
Token ID640
Feature activation-8.50e-01

Pos logit contributions
storms+0.053
case+0.046
 wrists+0.046
chair+0.044
aces+0.044
Neg logit contributions
 check-0.034
 help-0.033
 forgive-0.032
 permission-0.032
 cook-0.032
Token ago
Token ID2084
Feature activation-4.73e+00

Pos logit contributions
 check+0.016
 help+0.015
 cook+0.015
 little+0.014
 save+0.014
Neg logit contributions
storms-0.042
aces-0.037
case-0.037
 wrists-0.036
chair-0.035
Token and
Token ID290
Feature activation-3.37e-01

Pos logit contributions
 help+0.054
 cook+0.053
 check+0.052
 little+0.049
 end+0.045
Neg logit contributions
storms-0.276
 wrists-0.245
case-0.244
aces-0.240
chair-0.238
Token now
Token ID783
Feature activation+2.74e-01

Pos logit contributions
 check+0.008
 help+0.008
 cook+0.008
 end+0.007
 little+0.007
Neg logit contributions
storms-0.015
 wrists-0.013
case-0.013
aces-0.013
chair-0.013
Token it
Token ID340
Feature activation+8.73e-01

Pos logit contributions
storms+0.013
 wrists+0.012
aces+0.012
case+0.012
chair+0.011
Neg logit contributions
 help-0.006
 check-0.006
 cook-0.006
 little-0.006
 work-0.006
Token's
Token ID338
Feature activation+3.18e-01

Pos logit contributions
storms+0.036
 wrists+0.034
aces+0.033
 Caroline+0.031
case+0.031
Neg logit contributions
 cook-0.024
 help-0.024
 check-0.023
 little-0.023
 end-0.022
Token hard
Token ID1327
Feature activation-6.98e-01

Pos logit contributions
storms+0.015
case+0.013
 wrists+0.013
aces+0.013
chair+0.012
Neg logit contributions
 check-0.007
 help-0.007
 cook-0.006
 save-0.006
 little-0.006
Token velvet
Token ID47750
Feature activation-5.25e-01

Pos logit contributions
storms+0.035
aces+0.030
 wrists+0.030
case+0.030
uts+0.030
Neg logit contributions
 help-0.018
 little-0.018
 check-0.018
 end-0.017
 cook-0.017
Token box
Token ID3091
Feature activation+1.98e+00

Pos logit contributions
 help+0.012
 little+0.011
 cook+0.011
 end+0.011
 check+0.011
Neg logit contributions
storms-0.025
aces-0.022
 wrists-0.021
 Caroline-0.021
Aud-0.021
Token all
Token ID477
Feature activation+1.33e+00

Pos logit contributions
storms+0.119
 wrists+0.108
aces+0.105
chair+0.102
case+0.101
Neg logit contributions
 help-0.022
 check-0.020
 cook-0.020
 little-0.020
 end-0.018
Token those
Token ID883
Feature activation+3.22e-01

Pos logit contributions
storms+0.064
 wrists+0.059
aces+0.058
case+0.057
uts+0.055
Neg logit contributions
 check-0.028
 help-0.028
 cook-0.027
 little-0.026
 end-0.026
Token years
Token ID812
Feature activation-2.32e+00

Pos logit contributions
storms+0.016
case+0.014
 wrists+0.014
uts+0.014
aces+0.013
Neg logit contributions
 little-0.007
 help-0.007
 check-0.007
 cook-0.006
 end-0.006
Token ago
Token ID2084
Feature activation-4.72e+00

Pos logit contributions
 little+0.028
 check+0.028
 help+0.027
 cook+0.026
 end+0.023
Neg logit contributions
storms-0.134
 wrists-0.119
aces-0.119
case-0.118
chair-0.114
Token.
Token ID13
Feature activation-2.69e-01

Pos logit contributions
 help+0.043
 cook+0.042
 check+0.039
 little+0.038
 end+0.034
Neg logit contributions
storms-0.289
case-0.255
 wrists-0.254
chair-0.251
aces-0.249
Token They
Token ID1119
Feature activation+1.54e-01

Pos logit contributions
 check+0.006
 help+0.006
 cook+0.006
 little+0.006
 end+0.006
Neg logit contributions
storms-0.012
 wrists-0.011
aces-0.011
case-0.011
chair-0.010
Token were
Token ID547
Feature activation+4.96e-01

Pos logit contributions
storms+0.007
aces+0.006
 wrists+0.006
case+0.006
chair+0.006
Neg logit contributions
 help-0.004
 check-0.004
 cook-0.004
 little-0.004
 end-0.004
Token glad
Token ID9675
Feature activation-1.46e+00

Pos logit contributions
storms+0.027
chair+0.023
aces+0.022
 wrists+0.022
case+0.022
Neg logit contributions
 little-0.010
 help-0.010
 two-0.010
 check-0.009
 cook-0.009
Token that
Token ID326
Feature activation-8.73e-01

Pos logit contributions
 help+0.031
 check+0.030
 little+0.030
 cook+0.030
 end+0.027
Neg logit contributions
storms-0.071
case-0.065
aces-0.065
 wrists-0.063
chair-0.062
Token was
Token ID373
Feature activation-2.12e-01

Pos logit contributions
storms+0.022
 wrists+0.020
aces+0.020
case+0.019
chair+0.019
Neg logit contributions
 help-0.012
 check-0.012
 cook-0.012
 little-0.012
 take-0.011
Token painted
Token ID13055
Feature activation-7.35e-01

Pos logit contributions
 check+0.003
 help+0.003
 cook+0.003
 little+0.002
 end+0.002
Neg logit contributions
storms-0.012
 wrists-0.011
case-0.011
aces-0.011
chair-0.011
Token a
Token ID257
Feature activation+9.27e-01

Pos logit contributions
 little+0.013
 cook+0.012
 help+0.012
 check+0.012
 end+0.011
Neg logit contributions
storms-0.039
 wrists-0.036
case-0.034
aces-0.034
uts-0.033
Token long
Token ID890
Feature activation+2.05e+00

Pos logit contributions
storms+0.037
case+0.032
 wrists+0.032
aces+0.031
chair+0.031
Neg logit contributions
 check-0.026
 help-0.026
 cook-0.025
 save-0.024
 permission-0.024
Token time
Token ID640
Feature activation-3.96e-01

Pos logit contributions
storms+0.085
 wrists+0.073
case+0.073
aces+0.073
chair+0.070
Neg logit contributions
 check-0.057
 help-0.056
 cook-0.056
 little-0.053
 end-0.052
Token ago
Token ID2084
Feature activation-4.72e+00

Pos logit contributions
 check+0.008
 help+0.008
 cook+0.008
 little+0.008
 end+0.007
Neg logit contributions
storms-0.019
 wrists-0.017
case-0.017
aces-0.017
chair-0.016
Token,
Token ID11
Feature activation-6.34e-01

Pos logit contributions
 help+0.060
 cook+0.059
 check+0.058
 little+0.058
 end+0.051
Neg logit contributions
storms-0.270
 wrists-0.237
case-0.236
aces-0.231
chair-0.231
Tokenâ
Token ID22940
Feature activation-6.63e-01

Pos logit contributions
 little+0.014
 check+0.014
 help+0.013
 cook+0.013
 two+0.012
Neg logit contributions
storms-0.031
aces-0.028
 wrists-0.027
case-0.027
uts-0.026
Token€
Token ID26391
Feature activation-4.39e-01

Pos logit contributions
 little+0.028
 help+0.028
 check+0.027
 cook+0.027
 permission+0.027
Neg logit contributions
storms-0.017
case-0.015
aces-0.014
chair-0.014
 wrists-0.012
Token her
Token ID607
Feature activation+4.13e-01

Pos logit contributions
 little+0.014
 check+0.014
 cook+0.014
 help+0.013
 save+0.013
Neg logit contributions
storms-0.015
aces-0.014
 wrists-0.014
oline-0.012
rol-0.012
Token mum
Token ID25682
Feature activation+7.34e-01

Pos logit contributions
storms+0.022
aces+0.019
case+0.018
chair+0.018
uts+0.018
Neg logit contributions
 little-0.009
 check-0.008
 help-0.008
 cook-0.008
 end-0.007
Token They
Token ID1119
Feature activation+3.42e-01

Pos logit contributions
storms+0.015
 wrists+0.013
case+0.013
aces+0.013
chair+0.012
Neg logit contributions
 help-0.010
 check-0.010
 cook-0.009
 save-0.009
asting-0.009
Token wanted
Token ID2227
Feature activation-1.34e+00

Pos logit contributions
storms+0.016
aces+0.014
 wrists+0.014
case+0.014
chair+0.013
Neg logit contributions
 help-0.008
 check-0.008
 cook-0.008
 little-0.008
 end-0.007
Token to
Token ID284
Feature activation+6.38e-01

Pos logit contributions
 help+0.022
 check+0.021
 cook+0.021
 little+0.020
 end+0.018
Neg logit contributions
storms-0.076
aces-0.068
case-0.067
 wrists-0.065
chair-0.065
Token make
Token ID787
Feature activation+3.77e-01

Pos logit contributions
storms+0.036
aces+0.032
 wrists+0.032
case+0.031
chair+0.031
Neg logit contributions
 check-0.009
 help-0.009
 cook-0.009
 little-0.008
 save-0.008
Token a
Token ID257
Feature activation+4.37e-01

Pos logit contributions
storms+0.020
 wrists+0.018
case+0.018
aces+0.018
chair+0.018
Neg logit contributions
 check-0.006
 help-0.006
 cook-0.005
 little-0.005
 end-0.005
Token big
Token ID1263
Feature activation+2.55e+00

Pos logit contributions
storms+0.019
case+0.017
 wrists+0.016
aces+0.016
chair+0.016
Neg logit contributions
 check-0.011
 help-0.011
 cook-0.011
 little-0.010
 permission-0.010
Token building
Token ID2615
Feature activation-5.41e-01

Pos logit contributions
storms+0.090
aces+0.076
 wrists+0.074
case+0.073
uts+0.070
Neg logit contributions
 check-0.089
 cook-0.088
 help-0.088
 little-0.085
 end-0.085
Token with
Token ID351
Feature activation+3.23e-01

Pos logit contributions
 cook+0.008
 help+0.008
 check+0.008
 little+0.007
 end+0.007
Neg logit contributions
storms-0.030
 wrists-0.027
aces-0.026
case-0.026
chair-0.026
Token many
Token ID867
Feature activation+7.73e-01

Pos logit contributions
storms+0.016
 wrists+0.014
aces+0.014
case+0.014
chair+0.014
Neg logit contributions
 check-0.006
 help-0.006
 cook-0.006
 little-0.005
 end-0.005
Token rooms
Token ID9519
Feature activation+3.14e-01

Pos logit contributions
storms+0.030
 wrists+0.026
case+0.025
aces+0.025
chair+0.024
Neg logit contributions
 help-0.024
 check-0.024
 cook-0.023
 little-0.023
 end-0.022
Token and
Token ID290
Feature activation+1.30e-01

Pos logit contributions
storms+0.018
 wrists+0.016
case+0.016
aces+0.016
chair+0.015
Neg logit contributions
 check-0.004
 help-0.004
 cook-0.004
 little-0.004
 end-0.004
Token toy
Token ID13373
Feature activation+1.32e+00

Pos logit contributions
storms+0.043
case+0.038
 wrists+0.037
aces+0.037
chair+0.036
Neg logit contributions
 help-0.025
 check-0.025
 cook-0.024
 little-0.024
 end-0.022
Token st
Token ID336
Feature activation+9.77e-01

Pos logit contributions
storms+0.064
case+0.057
 wrists+0.057
aces+0.056
 Caroline+0.054
Neg logit contributions
 help-0.029
 check-0.027
 cook-0.027
 little-0.026
 save-0.025
Tokeneth
Token ID2788
Feature activation+9.34e-01

Pos logit contributions
storms+0.037
 wrists+0.031
aces+0.031
case+0.031
chair+0.029
Neg logit contributions
 check-0.031
 help-0.031
 cook-0.031
 little-0.029
 end-0.029
Tokenoscope
Token ID40326
Feature activation-1.36e+00

Pos logit contributions
storms+0.041
 wrists+0.035
case+0.035
aces+0.034
chair+0.033
Neg logit contributions
 help-0.024
 check-0.024
 cook-0.023
 little-0.022
 end-0.022
Token and
Token ID290
Feature activation+5.40e-01

Pos logit contributions
 help+0.013
 little+0.013
 check+0.011
 cook+0.011
 end+0.011
Neg logit contributions
storms-0.080
 wrists-0.077
aces-0.072
case-0.072
 Caroline-0.070
Token band
Token ID4097
Feature activation+2.50e+00

Pos logit contributions
storms+0.027
 wrists+0.024
case+0.024
aces+0.024
chair+0.023
Neg logit contributions
 check-0.010
 help-0.010
 cook-0.010
 little-0.009
 end-0.009
Tokenages
Token ID1095
Feature activation-2.76e-01

Pos logit contributions
storms+0.091
aces+0.079
 wrists+0.078
case+0.076
uts+0.074
Neg logit contributions
 check-0.081
 cook-0.079
 help-0.077
 little-0.077
 end-0.077
Token to
Token ID284
Feature activation+1.02e-01

Pos logit contributions
 help+0.004
 check+0.004
 cook+0.004
 little+0.004
 end+0.004
Neg logit contributions
storms-0.015
 wrists-0.014
case-0.013
aces-0.013
chair-0.013
Token make
Token ID787
Feature activation-8.22e-02

Pos logit contributions
storms+0.005
 wrists+0.005
case+0.004
chair+0.004
aces+0.004
Neg logit contributions
asting-0.002
 little-0.001
 adult-0.001
 end-0.001
 moral-0.001
Token them
Token ID606
Feature activation+4.78e-02

Pos logit contributions
 check+0.002
 help+0.002
 cook+0.002
 little+0.002
 end+0.002
Neg logit contributions
storms-0.004
 wrists-0.004
case-0.004
aces-0.003
chair-0.003
Token feel
Token ID1254
Feature activation-1.33e+00

Pos logit contributions
storms+0.002
case+0.002
aces+0.002
 wrists+0.002
chair+0.002
Neg logit contributions
 check-0.001
 adult-0.001
 help-0.001
 cook-0.001
 end-0.001
Token.
Token ID13
Feature activation+1.88e-01

Pos logit contributions
 check+0.015
 help+0.014
 save+0.013
 cook+0.013
 forgive+0.013
Neg logit contributions
storms-0.041
case-0.038
 wrists-0.037
aces-0.037
chair-0.036
Token They
Token ID1119
Feature activation+3.13e-01

Pos logit contributions
storms+0.009
aces+0.008
 wrists+0.008
case+0.007
chair+0.007
Neg logit contributions
 cook-0.004
 check-0.004
 help-0.004
 little-0.004
 end-0.004
Token had
Token ID550
Feature activation-1.42e-01

Pos logit contributions
storms+0.015
aces+0.013
 wrists+0.013
case+0.013
uts+0.013
Neg logit contributions
 help-0.007
 check-0.007
 cook-0.007
 little-0.007
 pick-0.007
Token a
Token ID257
Feature activation+1.14e+00

Pos logit contributions
 little+0.001
 help+0.001
 check+0.001
 cook+0.001
 two+0.001
Neg logit contributions
storms-0.010
case-0.008
aces-0.008
chair-0.008
uts-0.008
Token big
Token ID1263
Feature activation+2.39e+00

Pos logit contributions
storms+0.055
case+0.050
 wrists+0.050
aces+0.048
chair+0.047
Neg logit contributions
 check-0.023
 help-0.022
 cook-0.022
asting-0.021
 save-0.020
Token sheet
Token ID9629
Feature activation+2.60e+00

Pos logit contributions
storms+0.111
aces+0.096
uts+0.095
 wrists+0.093
case+0.091
Neg logit contributions
 help-0.061
 cook-0.060
 check-0.060
 end-0.060
 little-0.059
Token that
Token ID326
Feature activation-3.82e-01

Pos logit contributions
storms+0.145
 wrists+0.137
aces+0.128
 Caroline+0.126
yles+0.121
Neg logit contributions
 cook-0.036
 help-0.036
 little-0.032
 check-0.032
 end-0.031
Token they
Token ID484
Feature activation+7.35e-01

Pos logit contributions
 cook+0.008
 help+0.007
 wash+0.007
 little+0.007
 check+0.006
Neg logit contributions
storms-0.020
ement-0.019
 wrists-0.018
aces-0.018
oph-0.018
Token used
Token ID973
Feature activation-4.48e-01

Pos logit contributions
storms+0.036
 wrists+0.030
 Caroline+0.030
aces+0.030
chair+0.029
Neg logit contributions
 help-0.017
 cook-0.017
 check-0.017
 little-0.015
 take-0.015
Token to
Token ID284
Feature activation+2.10e-01

Pos logit contributions
 help+0.008
 check+0.007
 cook+0.007
 little+0.007
 work+0.007
Neg logit contributions
storms-0.024
case-0.022
 wrists-0.021
chair-0.021
aces-0.021
Token make
Token ID787
Feature activation+9.80e-02

Pos logit contributions
storms+0.010
 wrists+0.009
case+0.009
aces+0.009
chair+0.009
Neg logit contributions
 check-0.004
 little-0.004
 adult-0.004
 cook-0.003
 help-0.003
Token the
Token ID262
Feature activation+1.61e-03

Pos logit contributions
 help+0.015
 check+0.015
 cook+0.015
 little+0.015
 end+0.013
Neg logit contributions
storms-0.055
case-0.050
aces-0.049
 wrists-0.049
chair-0.048
Token pengu
Token ID38373
Feature activation-1.17e+00

Pos logit contributions
storms+0.000
 wrists+0.000
aces+0.000
case+0.000
chair+0.000
Neg logit contributions
 check-0.000
 help-0.000
 cook-0.000
 save-0.000
 forgive-0.000
Tokenins
Token ID1040
Feature activation-3.12e-02

Pos logit contributions
 check+0.030
 help+0.030
 cook+0.029
 little+0.028
 save+0.027
Neg logit contributions
storms-0.051
aces-0.044
case-0.044
 wrists-0.044
chair-0.042
Token swim
Token ID9422
Feature activation-6.97e-01

Pos logit contributions
 check+0.001
 help+0.001
 little+0.001
 adult+0.001
 cook+0.001
Neg logit contributions
storms-0.001
aces-0.001
 wrists-0.001
case-0.001
uts-0.001
Token and
Token ID290
Feature activation+3.50e-01

Pos logit contributions
 cook+0.013
 help+0.012
 check+0.012
 little+0.012
 end+0.012
Neg logit contributions
storms-0.036
 wrists-0.032
aces-0.032
case-0.031
chair-0.031
Token slide
Token ID10649
Feature activation+2.66e+00

Pos logit contributions
storms+0.016
case+0.015
aces+0.015
 wrists+0.014
chair+0.014
Neg logit contributions
 help-0.008
 cook-0.008
 check-0.008
 little-0.008
 save-0.007
Token and
Token ID290
Feature activation+8.93e-01

Pos logit contributions
storms+0.155
case+0.144
aces+0.143
 wrists+0.140
chair+0.136
Neg logit contributions
 little-0.029
 help-0.026
 check-0.026
 end-0.026
 cook-0.025
Token w
Token ID266
Feature activation+1.17e+00

Pos logit contributions
storms+0.042
case+0.037
aces+0.037
 wrists+0.036
chair+0.035
Neg logit contributions
 help-0.022
 cook-0.021
 check-0.021
 little-0.020
 save-0.019
Tokenaddle
Token ID37382
Feature activation+9.51e-01

Pos logit contributions
storms+0.043
aces+0.038
 wrists+0.037
case+0.036
uts+0.034
Neg logit contributions
 check-0.039
 cook-0.038
 help-0.037
 little-0.036
 save-0.036
Token.
Token ID13
Feature activation+4.15e-01

Pos logit contributions
storms+0.051
case+0.045
 wrists+0.045
aces+0.045
chair+0.044
Neg logit contributions
 help-0.015
 check-0.015
 cook-0.015
 little-0.014
 end-0.014
Token They
Token ID1119
Feature activation+2.88e-01

Pos logit contributions
storms+0.020
 wrists+0.018
aces+0.018
case+0.018
chair+0.017
Neg logit contributions
 check-0.009
 help-0.009
 cook-0.009
 little-0.008
 end-0.008
Token the
Token ID262
Feature activation-1.51e-02

Pos logit contributions
 check+0.011
 help+0.011
 cook+0.011
 little+0.010
 end+0.010
Neg logit contributions
storms-0.033
aces-0.029
 wrists-0.029
case-0.029
chair-0.028
Token backyard
Token ID24296
Feature activation-8.44e-01

Pos logit contributions
 permission+0.000
 help+0.000
 check+0.000
 forgive+0.000
 save+0.000
Neg logit contributions
storms-0.001
case-0.001
chair-0.001
 wrists-0.001
aces-0.001
Token.
Token ID13
Feature activation-4.80e-02

Pos logit contributions
 check+0.014
 help+0.013
 cook+0.012
 save+0.012
asting+0.012
Neg logit contributions
storms-0.044
case-0.040
aces-0.039
 wrists-0.039
chair-0.038
Token In
Token ID554
Feature activation+1.17e+00

Pos logit contributions
 little+0.001
 check+0.001
 cook+0.001
 help+0.001
 pick+0.001
Neg logit contributions
storms-0.002
 wrists-0.002
aces-0.002
case-0.002
uts-0.002
Token the
Token ID262
Feature activation-7.80e-02

Pos logit contributions
storms+0.073
 wrists+0.064
aces+0.064
uts+0.063
case+0.063
Neg logit contributions
 help-0.014
 end-0.013
 check-0.012
 little-0.012
 two-0.011
Token middle
Token ID3504
Feature activation+2.49e+00

Pos logit contributions
 check+0.002
 permission+0.002
 forgive+0.002
 help+0.002
asting+0.002
Neg logit contributions
storms-0.003
case-0.003
chair-0.002
 wrists-0.002
aces-0.002
Token of
Token ID286
Feature activation-9.13e-01

Pos logit contributions
storms+0.156
 wrists+0.144
aces+0.133
uts+0.130
case+0.128
Neg logit contributions
 little-0.029
 end-0.026
 help-0.026
 what-0.021
 check-0.021
Token playing
Token ID2712
Feature activation-4.89e-01

Pos logit contributions
 cook+0.018
 check+0.017
 help+0.017
 forgive+0.016
asting+0.016
Neg logit contributions
storms-0.044
 wrists-0.039
aces-0.038
case-0.038
chair-0.037
Token she
Token ID673
Feature activation+3.56e-02

Pos logit contributions
 check+0.011
 cook+0.011
 help+0.011
 permission+0.010
 end+0.010
Neg logit contributions
storms-0.022
aces-0.020
case-0.020
 wrists-0.019
chair-0.019
Token heard
Token ID2982
Feature activation-8.04e-01

Pos logit contributions
storms+0.002
 wrists+0.002
aces+0.002
case+0.002
uts+0.002
Neg logit contributions
 check-0.001
 cook-0.001
 help-0.001
 little-0.001
 end-0.001
Token a
Token ID257
Feature activation+4.47e-01

Pos logit contributions
 check+0.008
 help+0.008
 little+0.008
 cook+0.008
 end+0.007
Neg logit contributions
storms-0.047
 wrists-0.043
aces-0.043
case-0.042
chair-0.041
Token apples
Token ID22514
Feature activation-2.87e-01

Pos logit contributions
 help+0.017
 little+0.016
 check+0.016
 cook+0.016
 save+0.014
Neg logit contributions
storms-0.044
case-0.040
 wrists-0.039
aces-0.039
chair-0.038
Token and
Token ID290
Feature activation+1.53e-01

Pos logit contributions
 help+0.004
 cook+0.004
 check+0.003
 end+0.003
 little+0.003
Neg logit contributions
storms-0.017
 wrists-0.015
aces-0.015
case-0.015
chair-0.014
Token cheese
Token ID9891
Feature activation+8.43e-01

Pos logit contributions
storms+0.006
case+0.005
 wrists+0.005
aces+0.005
chair+0.005
Neg logit contributions
 check-0.004
 help-0.004
 cook-0.004
 little-0.004
 end-0.004
Token.
Token ID13
Feature activation+1.70e-01

Pos logit contributions
storms+0.052
 wrists+0.048
 Caroline+0.047
chair+0.047
case+0.046
Neg logit contributions
 help-0.007
 cook-0.007
 end-0.005
 check-0.005
 little-0.005
Token He
Token ID679
Feature activation-7.13e-01

Pos logit contributions
storms+0.008
 wrists+0.007
aces+0.007
case+0.007
chair+0.007
Neg logit contributions
 check-0.003
 cook-0.003
 help-0.003
 little-0.003
 end-0.003
Token thanked
Token ID26280
Feature activation+1.23e+00

Pos logit contributions
 cook+0.016
 check+0.016
 help+0.016
 little+0.015
 take+0.015
Neg logit contributions
storms-0.035
uts-0.030
aces-0.030
alo-0.030
chair-0.029
Token the
Token ID262
Feature activation+5.92e-02

Pos logit contributions
storms+0.067
 wrists+0.060
aces+0.060
case+0.060
chair+0.058
Neg logit contributions
 check-0.017
 cook-0.017
 help-0.017
 little-0.015
 end-0.015
Token doctor
Token ID6253
Feature activation-5.79e-02

Pos logit contributions
storms+0.002
 wrists+0.002
case+0.002
aces+0.002
chair+0.002
Neg logit contributions
 check-0.001
 help-0.001
asting-0.001
 install-0.001
 save-0.001
Token and
Token ID290
Feature activation+4.12e-01

Pos logit contributions
 check+0.001
 help+0.001
 cook+0.001
asting+0.001
 little+0.001
Neg logit contributions
storms-0.003
 wrists-0.003
aces-0.003
case-0.003
chair-0.003
Token his
Token ID465
Feature activation+1.68e-01

Pos logit contributions
storms+0.020
aces+0.017
case+0.017
 wrists+0.017
chair+0.016
Neg logit contributions
 check-0.009
 help-0.009
 cook-0.009
 little-0.009
 save-0.008
Token mom
Token ID1995
Feature activation+1.51e+00

Pos logit contributions
storms+0.008
 wrists+0.007
aces+0.007
case+0.007
chair+0.007
Neg logit contributions
 check-0.004
 cook-0.003
 help-0.003
 little-0.003
 end-0.003
Token dog
Token ID3290
Feature activation+1.79e-01

Pos logit contributions
storms+0.029
case+0.026
 wrists+0.025
aces+0.025
chair+0.024
Neg logit contributions
 check-0.017
 help-0.016
 cook-0.016
 little-0.015
 save-0.015
Token.
Token ID13
Feature activation+5.80e-01

Pos logit contributions
storms+0.008
 wrists+0.007
aces+0.007
case+0.007
chair+0.006
Neg logit contributions
 check-0.005
 help-0.005
 cook-0.005
 little-0.004
 end-0.004
Token He
Token ID679
Feature activation-2.56e-01

Pos logit contributions
storms+0.031
 wrists+0.028
aces+0.028
case+0.027
chair+0.026
Neg logit contributions
 check-0.009
 help-0.009
 cook-0.009
 little-0.008
 end-0.008
Token wished
Token ID16555
Feature activation-1.02e+00

Pos logit contributions
 check+0.006
 help+0.006
 little+0.006
 end+0.006
 cook+0.006
Neg logit contributions
storms-0.012
uts-0.010
aces-0.010
 wrists-0.010
case-0.010
Token he
Token ID339
Feature activation+5.86e-01

Pos logit contributions
 help+0.021
 check+0.019
 little+0.019
 cook+0.019
 end+0.018
Neg logit contributions
storms-0.052
 wrists-0.047
case-0.046
aces-0.045
chair-0.043
Token had
Token ID550
Feature activation+4.28e-01

Pos logit contributions
storms+0.029
 wrists+0.027
ays+0.025
umm+0.025
aces+0.025
Neg logit contributions
 help-0.013
 little-0.013
 check-0.012
 save-0.012
 take-0.012
Token never
Token ID1239
Feature activation+1.65e-01

Pos logit contributions
storms+0.023
case+0.020
chair+0.020
 wrists+0.019
aces+0.019
Neg logit contributions
 help-0.009
 check-0.009
 little-0.008
 permission-0.008
 forgive-0.008
Token gone
Token ID3750
Feature activation+2.74e-02

Pos logit contributions
storms+0.006
aces+0.005
 wrists+0.005
case+0.005
chair+0.005
Neg logit contributions
 check-0.005
 cook-0.005
 help-0.005
 little-0.005
 end-0.005
Token to
Token ID284
Feature activation+1.06e-01

Pos logit contributions
storms+0.001
 wrists+0.001
case+0.001
aces+0.001
chair+0.001
Neg logit contributions
 check-0.001
 help-0.001
 cook-0.000
 little-0.000
 end-0.000
Token the
Token ID262
Feature activation+3.71e-01

Pos logit contributions
storms+0.005
case+0.005
 wrists+0.005
chair+0.004
aces+0.004
Neg logit contributions
 cook-0.002
 check-0.002
asting-0.002
 help-0.002
 little-0.002
Token park
Token ID3952
Feature activation-4.38e-01

Pos logit contributions
storms+0.015
case+0.014
 wrists+0.014
chair+0.013
aces+0.013
Neg logit contributions
 help-0.009
 check-0.009
 cook-0.009
asting-0.009
 permission-0.008
Token could
Token ID714
Feature activation+3.69e-01

Pos logit contributions
storms+0.023
 wrists+0.022
aces+0.020
ays+0.019
case+0.019
Neg logit contributions
 little-0.010
 help-0.010
 check-0.010
 take-0.010
 what-0.009
Token make
Token ID787
Feature activation+4.09e-01

Pos logit contributions
storms+0.020
case+0.018
 wrists+0.017
aces+0.017
chair+0.017
Neg logit contributions
 check-0.005
 cook-0.005
 little-0.005
 help-0.005
 end-0.004
Token her
Token ID607
Feature activation+1.08e-01

Pos logit contributions
storms+0.021
 wrists+0.018
case+0.018
aces+0.018
chair+0.018
Neg logit contributions
 check-0.008
 help-0.008
 cook-0.008
 little-0.007
 end-0.007
Token feel
Token ID1254
Feature activation-1.20e+00

Pos logit contributions
storms+0.006
 wrists+0.005
aces+0.005
case+0.005
chair+0.005
Neg logit contributions
 check-0.002
 help-0.002
 cook-0.002
 little-0.002
 end-0.002
Token better
Token ID1365
Feature activation+9.53e-03

Pos logit contributions
 check+0.020
 cook+0.020
 help+0.018
 save+0.017
 chop+0.017
Neg logit contributions
storms-0.059
case-0.053
aces-0.053
chair-0.052
 wrists-0.051
Token.
Token ID13
Feature activation+1.23e-01

Pos logit contributions
storms+0.001
 wrists+0.000
aces+0.000
case+0.000
chair+0.000
Neg logit contributions
 check-0.000
 help-0.000
 cook-0.000
 little-0.000
 end-0.000
Token His
Token ID2399
Feature activation-2.07e+00

Pos logit contributions
storms+0.006
case+0.005
chair+0.005
aces+0.005
 wrists+0.005
Neg logit contributions
 cook-0.002
 help-0.002
 check-0.002
 end-0.002
asting-0.002
Token sister
Token ID6621
Feature activation+1.30e+00

Pos logit contributions
 check+0.045
 cook+0.043
asting+0.043
 help+0.042
 save+0.041
Neg logit contributions
storms-0.095
case-0.086
 wrists-0.085
aces-0.084
chair-0.083
Token smiled
Token ID13541
Feature activation-1.38e-01

Pos logit contributions
storms+0.067
aces+0.060
 wrists+0.059
uts+0.058
case+0.057
Neg logit contributions
 help-0.028
 little-0.027
 check-0.025
 cook-0.025
 forgive-0.024
Token and
Token ID290
Feature activation-8.15e-02

Pos logit contributions
 check+0.002
 help+0.002
 cook+0.002
 little+0.002
 end+0.002
Neg logit contributions
storms-0.008
 wrists-0.007
aces-0.007
case-0.007
chair-0.007
Token thanked
Token ID26280
Feature activation+1.26e+00

Pos logit contributions
 check+0.002
 help+0.002
 cook+0.002
 little+0.002
 forgive+0.002
Neg logit contributions
storms-0.004
aces-0.004
case-0.003
 wrists-0.003
chair-0.003
Token to
Token ID284
Feature activation+3.07e-01

Pos logit contributions
 help+0.008
 check+0.008
 cook+0.007
 little+0.007
 save+0.007
Neg logit contributions
storms-0.023
aces-0.020
 wrists-0.020
case-0.020
chair-0.019
Token keep
Token ID1394
Feature activation-6.44e-01

Pos logit contributions
storms+0.015
case+0.014
 wrists+0.014
aces+0.013
chair+0.013
Neg logit contributions
 check-0.005
 little-0.005
 cook-0.005
 help-0.005
 permission-0.005
Token the
Token ID262
Feature activation-5.25e-01

Pos logit contributions
 check+0.015
 cook+0.015
 end+0.014
 help+0.014
 little+0.014
Neg logit contributions
storms-0.028
 wrists-0.026
aces-0.024
case-0.024
 Caroline-0.023
Token glass
Token ID5405
Feature activation-1.38e+00

Pos logit contributions
 check+0.016
 help+0.016
 cook+0.016
 little+0.015
 end+0.015
Neg logit contributions
storms-0.020
 wrists-0.018
case-0.018
aces-0.017
chair-0.017
Token jug
Token ID45808
Feature activation-3.60e-01

Pos logit contributions
 help+0.026
 cook+0.024
 little+0.024
 end+0.023
 check+0.022
Neg logit contributions
storms-0.075
 wrists-0.067
aces-0.065
 Caroline-0.064
case-0.062
Token forever
Token ID8097
Feature activation+3.82e-01

Pos logit contributions
 help+0.006
 check+0.006
 cook+0.006
 little+0.006
 end+0.005
Neg logit contributions
storms-0.019
 wrists-0.017
aces-0.017
case-0.017
chair-0.016
Token as
Token ID355
Feature activation-6.90e-01

Pos logit contributions
storms+0.019
 wrists+0.017
aces+0.017
case+0.017
uts+0.016
Neg logit contributions
 check-0.007
 cook-0.006
 help-0.006
 adult-0.006
 save-0.006
Token a
Token ID257
Feature activation+1.49e-01

Pos logit contributions
 check+0.018
 cook+0.017
 help+0.017
 end+0.016
 little+0.016
Neg logit contributions
storms-0.030
 wrists-0.026
aces-0.026
case-0.026
chair-0.025
Token reminder
Token ID15438
Feature activation+1.75e+00

Pos logit contributions
storms+0.006
 wrists+0.006
case+0.006
chair+0.005
 Caroline+0.005
Neg logit contributions
asting-0.003
 cook-0.003
 check-0.003
 chop-0.003
 help-0.003
Token of
Token ID286
Feature activation+3.64e-01

Pos logit contributions
storms+0.092
 wrists+0.082
aces+0.081
case+0.080
chair+0.078
Neg logit contributions
 help-0.031
 check-0.031
 cook-0.030
 little-0.029
 end-0.028
Token their
Token ID511
Feature activation+2.37e-01

Pos logit contributions
storms+0.016
 wrists+0.015
aces+0.014
case+0.014
 Caroline+0.014
Neg logit contributions
 check-0.008
 cook-0.008
 help-0.007
 little-0.007
 save-0.007
Token a
Token ID257
Feature activation-1.15e+00

Pos logit contributions
storms+0.018
case+0.017
aces+0.016
chair+0.016
uts+0.015
Neg logit contributions
 adult-0.007
 permission-0.007
 check-0.007
 help-0.006
 cook-0.006
Token time
Token ID640
Feature activation+6.71e-01

Pos logit contributions
 check+0.029
 help+0.029
 cook+0.029
 save+0.028
 permission+0.028
Neg logit contributions
storms-0.049
case-0.044
aces-0.042
 wrists-0.042
chair-0.041
Token,
Token ID11
Feature activation+1.22e+00

Pos logit contributions
storms+0.037
 wrists+0.034
aces+0.033
case+0.032
chair+0.032
Neg logit contributions
 help-0.010
 little-0.010
 check-0.010
 cook-0.009
 end-0.009
Token there
Token ID612
Feature activation+9.73e-01

Pos logit contributions
storms+0.040
case+0.038
chair+0.034
aces+0.034
 Caroline+0.031
Neg logit contributions
 permission-0.041
 cook-0.041
 check-0.040
 save-0.039
 help-0.039
Token was
Token ID373
Feature activation+5.90e-02

Pos logit contributions
storms+0.056
 wrists+0.052
aces+0.051
uts+0.048
case+0.047
Neg logit contributions
 little-0.015
 end-0.014
 help-0.014
 cook-0.014
 check-0.013
Token a
Token ID257
Feature activation-9.42e-02

Pos logit contributions
storms+0.003
case+0.003
aces+0.003
chair+0.003
 wrists+0.003
Neg logit contributions
 check-0.001
 help-0.001
 save-0.001
 cook-0.001
 permission-0.001
Token man
Token ID582
Feature activation-6.51e-01

Pos logit contributions
 permission+0.002
asting+0.002
ruck+0.002
 help+0.001
 save+0.001
Neg logit contributions
storms-0.004
case-0.004
chair-0.004
 wrists-0.004
 spe-0.004
Token named
Token ID3706
Feature activation+2.69e-01

Pos logit contributions
 permission+0.017
 adult+0.017
 moral+0.016
 end+0.016
arl+0.016
Neg logit contributions
storms-0.025
chair-0.023
case-0.022
aces-0.021
 wrists-0.021
Token Jack
Token ID3619
Feature activation-1.02e+00

Pos logit contributions
storms+0.009
case+0.008
aces+0.008
 wrists+0.008
chair+0.007
Neg logit contributions
 check-0.009
 help-0.009
 cook-0.009
 end-0.009
 forgive-0.009
Token.
Token ID13
Feature activation-1.53e-01

Pos logit contributions
 adult+0.027
 permission+0.026
 help+0.026
 save+0.026
 install+0.026
Neg logit contributions
storms-0.040
aces-0.033
chair-0.033
case-0.032
 wrists-0.032
Token Jack
Token ID3619
Feature activation-3.07e-01

Pos logit contributions
 cook+0.004
 help+0.004
 check+0.004
 little+0.004
 end+0.004
Neg logit contributions
storms-0.006
 wrists-0.006
aces-0.006
case-0.005
chair-0.005
Token so
Token ID523
Feature activation+5.12e-01

Pos logit contributions
storms+0.009
 wrists+0.009
aces+0.009
overed+0.008
case+0.008
Neg logit contributions
 little-0.003
 cook-0.003
 end-0.003
 help-0.003
 check-0.003
Token much
Token ID881
Feature activation+1.59e+00

Pos logit contributions
storms+0.027
 wrists+0.024
aces+0.024
case+0.023
chair+0.022
Neg logit contributions
 check-0.009
 help-0.009
 cook-0.009
 little-0.008
 end-0.008
Token traffic
Token ID4979
Feature activation-1.67e+00

Pos logit contributions
storms+0.070
 wrists+0.062
aces+0.061
case+0.060
chair+0.057
Neg logit contributions
 check-0.041
 help-0.040
 cook-0.040
 little-0.039
 end-0.038
Token?"
Token ID1701
Feature activation-4.20e-01

Pos logit contributions
 cook+0.028
 check+0.027
 end+0.027
 little+0.026
 help+0.026
Neg logit contributions
storms-0.089
 wrists-0.084
aces-0.081
uts-0.077
 Caroline-0.076
Token
Token ID198
Feature activation-2.97e-01

Pos logit contributions
 check+0.010
 help+0.010
 cook+0.009
asting+0.009
 end+0.009
Neg logit contributions
storms-0.019
 wrists-0.017
case-0.017
aces-0.017
chair-0.016
Token
Token ID198
Feature activation-3.12e-01

Pos logit contributions
 cook+0.005
 check+0.005
 help+0.005
 permission+0.005
 save+0.005
Neg logit contributions
storms-0.015
case-0.014
chair-0.013
 wrists-0.013
uts-0.013
TokenHer
Token ID9360
Feature activation-1.09e+00

Pos logit contributions
 check+0.008
 help+0.008
 permission+0.008
 cook+0.007
asting+0.007
Neg logit contributions
storms-0.013
case-0.013
 wrists-0.012
chair-0.012
uts-0.011
Token mom
Token ID1995
Feature activation+1.49e+00

Pos logit contributions
 check+0.024
 cook+0.023
 help+0.023
asting+0.023
 save+0.022
Neg logit contributions
storms-0.050
 wrists-0.045
case-0.044
aces-0.044
chair-0.043
Tokenmy
Token ID1820
Feature activation-7.66e-01

Pos logit contributions
storms+0.064
 wrists+0.056
case+0.056
aces+0.055
chair+0.054
Neg logit contributions
 check-0.039
 help-0.038
 cook-0.038
 little-0.036
 save-0.035
Token said
Token ID531
Feature activation+1.10e-01

Pos logit contributions
 check+0.021
 help+0.020
 cook+0.020
 little+0.019
 end+0.019
Neg logit contributions
storms-0.032
 wrists-0.028
case-0.028
aces-0.028
chair-0.027
Token,
Token ID11
Feature activation+7.52e-01

Pos logit contributions
storms+0.006
aces+0.006
 wrists+0.006
case+0.006
chair+0.005
Neg logit contributions
 help-0.002
 little-0.001
 cook-0.001
 check-0.001
 what-0.001
Token "
Token ID366
Feature activation+2.48e-01

Pos logit contributions
storms+0.007
 Sof+0.005
aces+0.005
 Caroline+0.005
 wrists+0.005
Neg logit contributions
asting-0.009
 save-0.008
 end-0.008
 cook-0.008
 check-0.008
TokenWhy
Token ID5195
Feature activation-9.06e-01

Pos logit contributions
storms+0.011
 wrists+0.009
case+0.009
chair+0.009
als+0.009
Neg logit contributions
 help-0.006
 check-0.006
 work-0.006
 permission-0.006
 little-0.006
Token are
Token ID389
Feature activation+1.38e+00

Pos logit contributions
 help+0.021
 cook+0.020
 check+0.020
 little+0.020
 end+0.019
Neg logit contributions
storms-0.044
 wrists-0.040
aces-0.038
case-0.037
chair-0.037
Token you
Token ID345
Feature activation-2.02e+00

Pos logit contributions
storms+0.062
aces+0.054
case+0.054
 wrists+0.054
chair+0.051
Neg logit contributions
 help-0.035
 check-0.034
 cook-0.034
 little-0.033
 end-0.032
Token sad
Token ID6507
Feature activation-2.31e+00

Pos logit contributions
 check+0.055
 help+0.053
 cook+0.053
 save+0.053
asting+0.050
Neg logit contributions
storms-0.081
aces-0.069
 wrists-0.068
case-0.068
uts-0.065
Token?"
Token ID1701
Feature activation-4.54e-01

Pos logit contributions
 cook+0.044
 help+0.044
 check+0.043
 little+0.043
 end+0.040
Neg logit contributions
storms-0.117
 wrists-0.107
aces-0.106
case-0.104
chair-0.100
Token
Token ID198
Feature activation-1.94e-01

Pos logit contributions
 permission+0.008
asting+0.008
 check+0.008
 cook+0.008
 end+0.008
Neg logit contributions
storms-0.022
case-0.020
chair-0.020
 wrists-0.019
aces-0.019
Token
Token ID198
Feature activation-2.23e-01

Pos logit contributions
 check+0.003
 cook+0.003
 help+0.003
 permission+0.003
 save+0.002
Neg logit contributions
storms-0.010
case-0.009
chair-0.009
aces-0.009
 wrists-0.009
TokenThe
Token ID464
Feature activation-3.74e-01

Pos logit contributions
 permission+0.005
 check+0.005
 cook+0.004
 help+0.004
asting+0.004
Neg logit contributions
storms-0.010
case-0.009
chair-0.009
 wrists-0.009
aces-0.009
Token snake
Token ID17522
Feature activation+6.57e-01

Pos logit contributions
 permission+0.016
asting+0.016
 check+0.016
 install+0.015
 save+0.015
Neg logit contributions
storms-0.009
case-0.008
chair-0.007
 wrists-0.006
oline-0.006
Token said
Token ID531
Feature activation+3.41e-01

Pos logit contributions
storms+0.033
aces+0.030
uts+0.029
 wrists+0.029
case+0.028
Neg logit contributions
 little-0.014
 help-0.013
 cook-0.013
 check-0.013
 end-0.012
Token now
Token ID783
Feature activation-7.16e-01

Pos logit contributions
storms+0.055
 wrists+0.050
aces+0.048
uts+0.046
case+0.045
Neg logit contributions
 help-0.017
 check-0.017
 cook-0.016
 little-0.016
 end-0.016
Token?"
Token ID1701
Feature activation-8.53e-01

Pos logit contributions
 check+0.013
 help+0.013
 cook+0.013
 little+0.012
 end+0.011
Neg logit contributions
storms-0.038
 wrists-0.034
aces-0.034
case-0.033
chair-0.032
Token
Token ID198
Feature activation-8.99e-01

Pos logit contributions
 check+0.027
 help+0.026
 cook+0.026
 little+0.025
 end+0.024
Neg logit contributions
storms-0.033
aces-0.028
 wrists-0.028
case-0.028
chair-0.026
Token
Token ID198
Feature activation-8.43e-01

Pos logit contributions
 cook+0.016
 check+0.015
asting+0.015
 permission+0.014
 help+0.014
Neg logit contributions
storms-0.043
case-0.042
chair-0.040
uts-0.039
 wrists-0.039
Token"
Token ID1
Feature activation+6.75e-01

Pos logit contributions
 check+0.018
 help+0.017
 cook+0.017
 permission+0.016
 work+0.016
Neg logit contributions
storms-0.037
case-0.036
chair-0.034
 wrists-0.034
uts-0.034
TokenYes
Token ID5297
Feature activation-4.07e-01

Pos logit contributions
storms+0.032
aces+0.028
case+0.027
 wrists+0.027
chair+0.026
Neg logit contributions
 check-0.016
 help-0.016
 cook-0.015
 little-0.015
 forgive-0.014
Token,
Token ID11
Feature activation+5.26e-01

Pos logit contributions
 help+0.009
 little+0.008
 check+0.008
 cook+0.008
 end+0.008
Neg logit contributions
storms-0.021
aces-0.018
 wrists-0.018
case-0.018
uts-0.017
Token of
Token ID286
Feature activation+1.29e+00

Pos logit contributions
storms+0.025
 wrists+0.021
aces+0.021
case+0.021
chair+0.020
Neg logit contributions
 help-0.013
 check-0.012
 cook-0.012
 little-0.012
 end-0.011
Token course
Token ID1781
Feature activation+1.32e-01

Pos logit contributions
storms+0.054
aces+0.045
case+0.045
 wrists+0.045
uts+0.044
Neg logit contributions
 help-0.040
 check-0.039
 end-0.038
 little-0.038
 cook-0.037
Token,"
Token ID553
Feature activation-8.93e-01

Pos logit contributions
storms+0.007
aces+0.006
case+0.006
 wrists+0.006
uts+0.006
Neg logit contributions
 help-0.003
 little-0.003
 check-0.003
 what-0.002
 cook-0.002
Token Mom
Token ID11254
Feature activation+2.03e+00

Pos logit contributions
 check+0.024
 help+0.023
 cook+0.023
 little+0.022
 end+0.021
Neg logit contributions
storms-0.039
aces-0.034
case-0.034
 wrists-0.034
chair-0.032
Token walked
Token ID6807
Feature activation-9.87e-02

Pos logit contributions
storms+0.032
aces+0.028
case+0.028
uts+0.027
chair+0.027
Neg logit contributions
 little-0.015
 check-0.014
 cook-0.014
 pick-0.013
 help-0.013
Token to
Token ID284
Feature activation-3.96e-01

Pos logit contributions
 check+0.002
 cook+0.002
 help+0.002
 little+0.002
 end+0.002
Neg logit contributions
storms-0.004
 wrists-0.004
case-0.004
aces-0.004
chair-0.004
Token the
Token ID262
Feature activation+3.84e-01

Pos logit contributions
asting+0.009
 cook+0.009
 end+0.008
ruck+0.008
 working+0.008
Neg logit contributions
storms-0.015
 Caroline-0.015
 wrists-0.014
 Sof-0.014
 Ruth-0.014
Token middle
Token ID3504
Feature activation+1.82e+00

Pos logit contributions
storms+0.016
 wrists+0.014
case+0.014
aces+0.014
chair+0.013
Neg logit contributions
 check-0.010
 help-0.010
 cook-0.010
 save-0.009
asting-0.009
Token of
Token ID286
Feature activation-1.15e+00

Pos logit contributions
storms+0.108
 wrists+0.096
aces+0.094
case+0.092
uts+0.090
Neg logit contributions
 help-0.023
 little-0.022
 end-0.021
 cook-0.021
 check-0.021
Token the
Token ID262
Feature activation-7.26e-01

Pos logit contributions
 check+0.019
 cook+0.019
 help+0.019
 little+0.017
 save+0.017
Neg logit contributions
storms-0.059
 wrists-0.053
aces-0.052
case-0.052
chair-0.051
Token room
Token ID2119
Feature activation-1.41e+00

Pos logit contributions
 check+0.019
 help+0.019
 cook+0.019
 forgive+0.018
asting+0.018
Neg logit contributions
storms-0.029
 wrists-0.026
case-0.026
chair-0.025
aces-0.025
Token and
Token ID290
Feature activation+2.09e-01

Pos logit contributions
 help+0.017
 cook+0.017
 check+0.017
 little+0.017
 end+0.014
Neg logit contributions
storms-0.082
aces-0.073
 wrists-0.073
case-0.072
chair-0.070
Token tw
Token ID665
Feature activation+1.41e+00

Pos logit contributions
storms+0.010
aces+0.009
case+0.009
 wrists+0.008
uts+0.008
Neg logit contributions
 check-0.005
 cook-0.005
 help-0.005
 little-0.005
 pick-0.004
Tokenirled
Token ID49376
Feature activation+6.18e-01

Pos logit contributions
storms+0.035
case+0.027
 wrists+0.026
aces+0.026
chair+0.024
Neg logit contributions
 check-0.063
 help-0.062
 cook-0.062
 little-0.061
 end-0.060
Token around
Token ID1088
Feature activation+5.26e-01

Pos logit contributions
storms+0.027
case+0.025
aces+0.023
 wrists+0.023
uts+0.023
Neg logit contributions
 little-0.017
 help-0.016
 check-0.016
 pick-0.015
 two-0.015
Token home
Token ID1363
Feature activation+1.88e-01

Pos logit contributions
storms+0.064
 wrists+0.057
case+0.057
aces+0.057
chair+0.055
Neg logit contributions
 check-0.022
 cook-0.022
 help-0.021
 little-0.020
 end-0.019
Token,
Token ID11
Feature activation-6.81e-01

Pos logit contributions
storms+0.011
 wrists+0.010
aces+0.010
case+0.009
chair+0.009
Neg logit contributions
 check-0.002
 help-0.002
 cook-0.002
 little-0.002
 end-0.002
Token so
Token ID523
Feature activation+4.81e-01

Pos logit contributions
 check+0.015
 help+0.014
 little+0.014
 cook+0.013
 pick+0.013
Neg logit contributions
storms-0.034
aces-0.030
case-0.030
 wrists-0.029
oline-0.029
Token she
Token ID673
Feature activation+1.33e+00

Pos logit contributions
storms+0.018
 wrists+0.015
case+0.015
aces+0.015
chair+0.014
Neg logit contributions
 check-0.016
 help-0.016
 cook-0.016
 little-0.015
 end-0.015
Token wr
Token ID1319
Feature activation+7.21e-01

Pos logit contributions
storms+0.060
aces+0.052
 wrists+0.052
uts+0.050
case+0.050
Neg logit contributions
 check-0.034
 little-0.033
 help-0.033
 cook-0.033
 save-0.031
Tokenigg
Token ID6950
Feature activation-8.87e-01

Pos logit contributions
storms+0.025
 wrists+0.021
aces+0.021
case+0.021
chair+0.020
Neg logit contributions
 check-0.025
 help-0.024
 cook-0.024
 little-0.024
 end-0.023
Tokenled
Token ID992
Feature activation+5.22e-01

Pos logit contributions
 check+0.036
 help+0.035
 cook+0.035
 little+0.034
 end+0.034
Neg logit contributions
storms-0.026
 wrists-0.021
case-0.021
aces-0.020
chair-0.019
Token away
Token ID1497
Feature activation+1.67e+00

Pos logit contributions
storms+0.028
case+0.025
aces+0.025
 wrists+0.024
uts+0.024
Neg logit contributions
 help-0.010
 check-0.010
 little-0.009
 cook-0.008
 work-0.008
Token from
Token ID422
Feature activation+8.83e-01

Pos logit contributions
storms+0.090
 wrists+0.080
case+0.080
aces+0.080
chair+0.077
Neg logit contributions
 check-0.026
 help-0.025
 cook-0.025
 little-0.023
 end-0.022
Token her
Token ID607
Feature activation-4.81e-01

Pos logit contributions
storms+0.045
case+0.040
aces+0.039
 wrists+0.039
chair+0.038
Neg logit contributions
 check-0.018
 help-0.017
 cook-0.016
 little-0.016
 end-0.016
Token mom
Token ID1995
Feature activation+5.88e-01

Pos logit contributions
 check+0.011
 cook+0.011
 help+0.010
asting+0.010
 save+0.010
Neg logit contributions
storms-0.022
 wrists-0.020
case-0.020
aces-0.019
chair-0.019
Token go
Token ID467
Feature activation+6.68e-01

Pos logit contributions
storms+0.004
 wrists+0.004
case+0.004
aces+0.004
chair+0.004
Neg logit contributions
asting-0.002
 end-0.001
 adult-0.001
 permission-0.001
 check-0.001
Token on
Token ID319
Feature activation-1.82e-01

Pos logit contributions
storms+0.021
chair+0.019
case+0.019
aces+0.018
uts+0.017
Neg logit contributions
 permission-0.021
 adult-0.020
 forgive-0.020
 end-0.020
asting-0.020
Token walks
Token ID11114
Feature activation-2.53e+00

Pos logit contributions
 help+0.006
 check+0.006
 little+0.006
 work+0.006
 cook+0.006
Neg logit contributions
storms-0.007
aces-0.006
 wrists-0.006
case-0.006
uts-0.006
Token in
Token ID287
Feature activation-1.17e+00

Pos logit contributions
 check+0.039
 help+0.039
 cook+0.038
 little+0.035
 pick+0.033
Neg logit contributions
storms-0.138
 wrists-0.125
aces-0.123
case-0.121
chair-0.117
Token the
Token ID262
Feature activation-1.32e-01

Pos logit contributions
 check+0.016
 help+0.016
 cook+0.015
 little+0.014
 end+0.014
Neg logit contributions
storms-0.066
aces-0.059
 wrists-0.059
case-0.059
chair-0.057
Token woods
Token ID16479
Feature activation-2.89e+00

Pos logit contributions
 help+0.003
 check+0.003
 cook+0.003
 save+0.003
 permission+0.003
Neg logit contributions
storms-0.006
case-0.005
 wrists-0.005
aces-0.005
chair-0.005
Token with
Token ID351
Feature activation-1.48e-01

Pos logit contributions
 check+0.048
 help+0.046
 cook+0.046
 little+0.043
 end+0.041
Neg logit contributions
storms-0.151
case-0.135
aces-0.134
 wrists-0.134
chair-0.130
Token her
Token ID607
Feature activation+3.97e-01

Pos logit contributions
 help+0.003
 check+0.003
 little+0.003
 cook+0.002
 pick+0.002
Neg logit contributions
storms-0.008
aces-0.007
case-0.007
 wrists-0.007
chair-0.007
Token dog
Token ID3290
Feature activation+4.90e-01

Pos logit contributions
storms+0.022
aces+0.020
case+0.019
 wrists+0.019
chair+0.019
Neg logit contributions
 help-0.006
 little-0.006
 check-0.006
 cook-0.006
 end-0.005
Token,
Token ID11
Feature activation+5.99e-01

Pos logit contributions
storms+0.023
 wrists+0.020
aces+0.020
case+0.020
chair+0.019
Neg logit contributions
 check-0.011
 cook-0.010
 help-0.010
asting-0.010
 end-0.010
Token Spot
Token ID15899
Feature activation-3.19e-01

Pos logit contributions
storms+0.025
aces+0.022
case+0.022
 wrists+0.021
chair+0.020
Neg logit contributions
 help-0.018
 check-0.018
 cook-0.017
 little-0.017
 pick-0.016
Token Tim
Token ID5045
Feature activation-3.58e-01

Pos logit contributions
storms+0.032
chair+0.027
case+0.026
 wrists+0.025
uts+0.024
Neg logit contributions
 permission-0.040
 check-0.039
asting-0.039
 adult-0.037
 cook-0.037
Tokenmy
Token ID1820
Feature activation-1.05e+00

Pos logit contributions
 help+0.007
 check+0.006
 cook+0.006
 save+0.006
 permission+0.006
Neg logit contributions
storms-0.018
aces-0.016
case-0.016
chair-0.015
 wrists-0.015
Token went
Token ID1816
Feature activation+7.64e-01

Pos logit contributions
 check+0.026
 help+0.026
 cook+0.025
 little+0.024
 end+0.024
Neg logit contributions
storms-0.046
case-0.040
 wrists-0.040
aces-0.040
chair-0.038
Token to
Token ID284
Feature activation-6.35e-01

Pos logit contributions
storms+0.024
chair+0.021
case+0.021
aces+0.020
uts+0.019
Neg logit contributions
asting-0.024
 end-0.024
 forgive-0.024
 adult-0.023
 little-0.023
Token the
Token ID262
Feature activation+1.61e-01

Pos logit contributions
asting+0.016
 forgive+0.013
 permission+0.013
 end+0.013
 adult+0.013
Neg logit contributions
storms-0.026
 wrists-0.023
case-0.023
 Caroline-0.023
chair-0.023
Token museum
Token ID13257
Feature activation-3.44e+00

Pos logit contributions
storms+0.006
case+0.006
 wrists+0.005
chair+0.005
aces+0.005
Neg logit contributions
 help-0.004
 check-0.004
 permission-0.004
asting-0.004
 forgive-0.004
Token with
Token ID351
Feature activation-4.60e-01

Pos logit contributions
 check+0.070
 help+0.068
 cook+0.068
 little+0.064
 end+0.062
Neg logit contributions
storms-0.169
 wrists-0.149
aces-0.149
case-0.148
chair-0.142
Token his
Token ID465
Feature activation-1.47e-01

Pos logit contributions
 check+0.005
 help+0.005
 cook+0.005
 little+0.005
 end+0.004
Neg logit contributions
storms-0.026
 wrists-0.024
case-0.024
aces-0.024
chair-0.023
Token mom
Token ID1995
Feature activation+1.53e+00

Pos logit contributions
 check+0.003
 help+0.003
 cook+0.003
 little+0.003
 end+0.002
Neg logit contributions
storms-0.007
aces-0.007
 wrists-0.007
case-0.007
chair-0.006
Tokenmy
Token ID1820
Feature activation-9.74e-01

Pos logit contributions
storms+0.057
chair+0.047
case+0.047
aces+0.045
 wrists+0.044
Neg logit contributions
 adult-0.039
 save-0.039
 apology-0.038
 survive-0.038
 moral-0.037
Token.
Token ID13
Feature activation-2.09e-01

Pos logit contributions
 check+0.021
 help+0.020
 forgive+0.020
asting+0.020
 save+0.020
Neg logit contributions
storms-0.044
case-0.038
aces-0.037
 wrists-0.037
chair-0.037
Token there
Token ID612
Feature activation+9.23e-01

Pos logit contributions
storms+0.041
case+0.038
aces+0.036
chair+0.035
 wrists+0.034
Neg logit contributions
 check-0.033
 cook-0.033
 permission-0.033
 help-0.033
 save-0.032
Token was
Token ID373
Feature activation-8.28e-02

Pos logit contributions
storms+0.052
 wrists+0.048
aces+0.048
uts+0.045
case+0.044
Neg logit contributions
 little-0.015
 end-0.014
 help-0.014
 cook-0.013
 check-0.013
Token a
Token ID257
Feature activation-1.44e-01

Pos logit contributions
 check+0.001
 help+0.001
 save+0.001
 cook+0.001
 permission+0.001
Neg logit contributions
storms-0.004
case-0.004
aces-0.004
chair-0.004
 wrists-0.004
Token mom
Token ID1995
Feature activation+1.52e+00

Pos logit contributions
 permission+0.003
asting+0.002
 help+0.002
ruck+0.002
 save+0.002
Neg logit contributions
storms-0.007
case-0.006
chair-0.006
 wrists-0.006
 spe-0.006
Tokenmy
Token ID1820
Feature activation-1.29e+00

Pos logit contributions
storms+0.065
chair+0.059
case+0.056
 wrists+0.054
aces+0.053
Neg logit contributions
 permission-0.033
 save-0.032
 help-0.032
 adult-0.031
 check-0.030
Token who
Token ID508
Feature activation-3.38e+00

Pos logit contributions
 permission+0.031
 moral+0.030
 forgive+0.030
 help+0.029
 adult+0.029
Neg logit contributions
storms-0.054
chair-0.048
case-0.043
 wrists-0.043
uts-0.041
Token liked
Token ID8288
Feature activation-1.20e+00

Pos logit contributions
 check+0.091
 help+0.090
 permission+0.089
 adult+0.086
 end+0.085
Neg logit contributions
storms-0.140
case-0.122
chair-0.117
 wrists-0.116
aces-0.115
Token to
Token ID284
Feature activation+4.89e-01

Pos logit contributions
 check+0.024
 cook+0.024
 help+0.024
 little+0.023
 save+0.021
Neg logit contributions
storms-0.059
aces-0.053
case-0.053
 wrists-0.053
chair-0.050
Token bake
Token ID28450
Feature activation+3.67e-01

Pos logit contributions
storms+0.022
case+0.022
 wrists+0.021
chair+0.021
aces+0.021
Neg logit contributions
asting-0.008
 adult-0.007
 permission-0.007
 check-0.007
 end-0.007
Token.
Token ID13
Feature activation+2.83e-01

Pos logit contributions
storms+0.016
 wrists+0.014
aces+0.013
case+0.013
chair+0.013
Neg logit contributions
 check-0.010
 help-0.010
 cook-0.009
 little-0.009
 end-0.009
Token Every
Token ID3887
Feature activation-5.30e-01

Pos logit contributions
storms+0.011
 wrists+0.009
case+0.009
aces+0.009
chair+0.009
Neg logit contributions
 check-0.009
 help-0.009
 cook-0.009
 little-0.008
 end-0.008
Token that
Token ID326
Feature activation-9.70e-01

Pos logit contributions
storms+0.028
 wrists+0.024
case+0.024
aces+0.024
chair+0.023
Neg logit contributions
 check-0.012
 help-0.012
 cook-0.012
 little-0.011
 end-0.011
Token day
Token ID1110
Feature activation-3.03e-01

Pos logit contributions
 help+0.014
 check+0.014
 cook+0.013
 little+0.013
 end+0.013
Neg logit contributions
storms-0.054
 wrists-0.049
aces-0.049
case-0.048
uts-0.046
Token on
Token ID319
Feature activation+2.03e+00

Pos logit contributions
 check+0.007
 help+0.007
 cook+0.007
 little+0.007
 end+0.006
Neg logit contributions
storms-0.014
 wrists-0.012
case-0.012
aces-0.012
chair-0.012
Token,
Token ID11
Feature activation+3.45e-01

Pos logit contributions
storms+0.124
 wrists+0.117
aces+0.115
case+0.112
chair+0.109
Neg logit contributions
 help-0.022
 little-0.022
 end-0.016
 work-0.015
 two-0.014
Token the
Token ID262
Feature activation-5.08e-01

Pos logit contributions
storms+0.019
 wrists+0.017
aces+0.017
case+0.017
chair+0.017
Neg logit contributions
 help-0.005
 check-0.005
 cook-0.005
 little-0.005
 end-0.004
Token little
Token ID1310
Feature activation-2.89e+00

Pos logit contributions
 check+0.005
 cook+0.004
 help+0.004
 permission+0.004
 save+0.004
Neg logit contributions
storms-0.029
case-0.027
 wrists-0.026
aces-0.026
chair-0.026
Token bird
Token ID6512
Feature activation-1.63e-01

Pos logit contributions
 little+0.072
 help+0.070
 check+0.069
 cook+0.067
 end+0.066
Neg logit contributions
storms-0.135
case-0.120
aces-0.119
 wrists-0.118
uts-0.115
Tokenie
Token ID494
Feature activation-7.41e-01

Pos logit contributions
 check+0.004
 help+0.004
 cook+0.004
 little+0.004
 save+0.004
Neg logit contributions
storms-0.007
aces-0.006
case-0.006
 wrists-0.006
chair-0.006
Token was
Token ID373
Feature activation-9.86e-01

Pos logit contributions
 help+0.014
 cook+0.014
 little+0.014
 check+0.013
 end+0.013
Neg logit contributions
storms-0.037
 wrists-0.034
case-0.034
aces-0.033
chair-0.032
Token not
Token ID407
Feature activation-1.62e+00

Pos logit contributions
 help+0.020
 check+0.020
 cook+0.019
 little+0.019
 end+0.018
Neg logit contributions
storms-0.050
 wrists-0.045
aces-0.044
case-0.044
chair-0.043
Token shy
Token ID15800
Feature activation-5.76e-01

Pos logit contributions
 check+0.039
 help+0.039
 cook+0.038
 little+0.038
 end+0.035
Neg logit contributions
storms-0.073
case-0.065
 wrists-0.065
aces-0.064
chair-0.062
Token they
Token ID484
Feature activation+5.33e-01

Pos logit contributions
storms+0.011
aces+0.009
 wrists+0.009
case+0.009
chair+0.009
Neg logit contributions
 help-0.006
 check-0.006
 cook-0.006
 little-0.006
 end-0.005
Token marched
Token ID23558
Feature activation+5.40e-01

Pos logit contributions
storms+0.024
aces+0.021
 wrists+0.020
case+0.020
chair+0.020
Neg logit contributions
 check-0.014
 help-0.014
 cook-0.014
 little-0.013
 end-0.013
Token out
Token ID503
Feature activation-7.28e-01

Pos logit contributions
storms+0.026
aces+0.023
 wrists+0.023
case+0.023
chair+0.022
Neg logit contributions
 help-0.012
 check-0.012
 little-0.011
 cook-0.011
 end-0.011
Token of
Token ID286
Feature activation-1.19e+00

Pos logit contributions
 check+0.014
 help+0.014
 cook+0.014
 little+0.013
 end+0.013
Neg logit contributions
storms-0.036
 wrists-0.032
aces-0.032
case-0.032
chair-0.031
Token the
Token ID262
Feature activation-7.62e-01

Pos logit contributions
 check+0.016
 help+0.016
 cook+0.014
 little+0.014
 end+0.013
Neg logit contributions
storms-0.068
aces-0.061
case-0.061
 wrists-0.060
chair-0.059
Token water
Token ID1660
Feature activation-3.00e+00

Pos logit contributions
 cook+0.020
 check+0.020
 help+0.020
 permission+0.019
 chop+0.019
Neg logit contributions
storms-0.027
 wrists-0.025
case-0.025
 lands-0.024
chair-0.023
Token and
Token ID290
Feature activation-1.64e-01

Pos logit contributions
 help+0.047
 check+0.045
 little+0.044
 cook+0.044
 end+0.042
Neg logit contributions
storms-0.162
aces-0.146
 wrists-0.146
case-0.144
chair-0.141
Token onto
Token ID4291
Feature activation+8.06e-01

Pos logit contributions
 help+0.004
 check+0.004
 little+0.004
 cook+0.004
 save+0.003
Neg logit contributions
storms-0.008
aces-0.007
case-0.007
 wrists-0.007
uts-0.007
Token the
Token ID262
Feature activation+1.82e-01

Pos logit contributions
storms+0.045
aces+0.040
case+0.040
 wrists+0.040
chair+0.039
Neg logit contributions
 check-0.011
 help-0.011
 cook-0.011
 little-0.010
 end-0.009
Token banks
Token ID6341
Feature activation-3.50e+00

Pos logit contributions
storms+0.007
 wrists+0.007
case+0.006
aces+0.006
chair+0.006
Neg logit contributions
 check-0.004
 cook-0.004
 forgive-0.004
 help-0.004
 install-0.004
Token.
Token ID13
Feature activation+2.23e-01

Pos logit contributions
 check+0.070
 cook+0.064
 help+0.064
 save+0.059
 little+0.057
Neg logit contributions
storms-0.171
case-0.153
aces-0.152
 wrists-0.147
chair-0.147