TOP ACTIVATIONS
MAX = 0.101

his mom left the ha ird ress er , happy with
the chair and the ha ird ress er began to cut
in and let the ha ird ress er go to work
They put Lily on a stret cher and carried her to
am ready !" The ha ird ress er put a cape
very refreshed and thanked the tort oise for her good nap
different today ." The ha ird ress er smiled and said
all ," replied the ha ird ress er . " It
hair salon . The ha ird ress er was very kind
like it and the ha ird ress er said it was
" Can we keep the re ind eer , please ?"
happy and thanked the ha ird ress er . As they
's turn . The ha ird ress er asked her what
his mom and the ha ird ress er . " I
very still so the ha ird ress er could do a
so he asked the ha ird ress er to give him
friend who was a ha ird ress er and asked for
him to a reliable ha ird ress er , who always
made sure to keep the m ule tidy and clean ,
we go ?" said the younger brother . " Let

MIN ACTIVATIONS
MIN = -0.082

very restless . Billy was amazed by the bird and smiled
!" His dad was so amazed by the fish too .
. But John was too proud . Suddenly , the steak
The old person was amazed and said ," That 's
Tim my was amazed by the bird 's answer
Tim my was amazed that the owl was so
the wind ." Lily was amazed at how something so small
. Jimmy was amazed at how much they had
's shop , John was amazed . The shop had colorful
Lu cy was so amazed that she yelled , "
His mom and dad looked amazed . They asked him where
." But Lucy was too excited and ran across the road
started singing . Billy was amazed . He thought the bird
L ily was amazed . She wanted to hear
L ily was amazed and asked her mom my
, but he was too excited and he slipped over .
apped , but everyone was amazed . The mighty
but the nurse was very surprised and shouted " O uch
ling . Bob was so amazed by the pretty lights ,
the butterfly ?" Jack was amazed and asked , " Why

INTERVAL 0.055 - 0.101
CONTAINS 0.232%

every day . She liked to pretend that it was a
little house , lived a girl named Sue . Sue had
named Lily . She loved to play outside and explore the
things â € � � col ouring books , stickers and
glad they got their own z ig zag . It looked

INTERVAL 0.009 - 0.055
CONTAINS 32.913%

and reached for their fingers . Lily and Tom felt a
a time , there was a little pig named O ink
share and be nice ," Mom says . " P ill
the icy mountain behind her house . So she put on
. <|endoftext|> Once upon a time , in a small town

INTERVAL -0.036 - 0.009
CONTAINS 63.241%

many buttons and levers . They press and pull them .
old . Claire wanted something special to eat , so she
louder . Suddenly , a voice spoke to him : "
the store manager . She couldn 't find him , so
. He wanted to try too . He ran to the

INTERVAL -0.082 - -0.036
CONTAINS 3.614%

! She asked her mom what it was called . Her
knew his way home . As Tim walked ,
slip again . He also learnt that it is important to
face . The frog got annoyed and slapped the fly with
. The tree looked so happy ! B ub
Token his
Token ID465
Feature activation+3.15e-02

Pos logit contributions
 911+0.043
 Amen+0.042
 liar+0.042
 elbow+0.042
 patients+0.041
Neg logit contributions
That-0.018
 was-0.016
-0.015
This-0.014
 said-0.014
Token mom
Token ID1995
Feature activation-8.14e-04

Pos logit contributions
 911+0.052
 Amen+0.051
 whine+0.048
 elbow+0.048
 Acc+0.047
Neg logit contributions
s-0.048
That-0.047
 sky-0.047
 big-0.046
 beautiful-0.046
Token left
Token ID1364
Feature activation+1.99e-02

Pos logit contributions
That+0.001
s+0.001
This+0.001
aw+0.001
It+0.001
Neg logit contributions
 elbow-0.002
 knee-0.002
 sister-0.002
 sorry-0.002
 whine-0.002
Token the
Token ID262
Feature activation-1.12e-02

Pos logit contributions
 elbow+0.050
 knee+0.049
 sister+0.047
 Amen+0.046
 whine+0.046
Neg logit contributions
s-0.017
That-0.016
This-0.015
aw-0.014
 that-0.014
Token ha
Token ID387
Feature activation+6.12e-02

Pos logit contributions
That+0.022
This+0.021
It+0.020
s+0.020
There+0.020
Neg logit contributions
 elbow-0.017
 knee-0.016
 911-0.015
 sister-0.015
 Kim-0.014
Tokenird
Token ID1447
Feature activation+1.01e-01

Pos logit contributions
 elbow+0.080
 knee+0.075
 sister+0.068
 sorry+0.067
 whine+0.066
Neg logit contributions
s-0.138
aw-0.131
That-0.128
This-0.122
There-0.122
Tokenress
Token ID601
Feature activation+5.91e-02

Pos logit contributions
 elbow+0.131
 sorry+0.116
 whine+0.110
 boo+0.108
 knee+0.107
Neg logit contributions
s-0.252
That-0.206
aw-0.205
st-0.201
ard-0.201
Tokener
Token ID263
Feature activation+4.85e-02

Pos logit contributions
 whine+0.146
 elbow+0.146
 Acc+0.143
 911+0.140
 sorry+0.139
Neg logit contributions
s-0.059
aw-0.053
 that-0.048
!-0.041
's-0.040
Token,
Token ID11
Feature activation+1.90e-02

Pos logit contributions
 elbow+0.114
 knee+0.108
 911+0.103
 sister+0.102
 sorry+0.102
Neg logit contributions
s-0.060
That-0.050
aw-0.048
It-0.044
This-0.044
Token happy
Token ID3772
Feature activation-2.25e-02

Pos logit contributions
 elbow+0.050
 knee+0.048
 sister+0.047
 911+0.046
 whine+0.045
Neg logit contributions
That-0.017
s-0.015
It-0.015
This-0.014
There-0.013
Token with
Token ID351
Feature activation+1.16e-02

Pos logit contributions
That+0.019
s+0.019
This+0.017
It+0.016
aw+0.016
Neg logit contributions
 elbow-0.062
 knee-0.059
 sister-0.057
 whine-0.054
 sorry-0.054
Token the
Token ID262
Feature activation-2.25e-04

Pos logit contributions
 elbow+0.032
 knee+0.031
 sister+0.030
 sorry+0.028
 whine+0.028
Neg logit contributions
That-0.013
This-0.012
s-0.012
It-0.011
There-0.011
Token chair
Token ID5118
Feature activation+1.59e-02

Pos logit contributions
That+0.000
This+0.000
s+0.000
A+0.000
It+0.000
Neg logit contributions
 elbow-0.000
 sister-0.000
 sorry-0.000
 knee-0.000
 Amen-0.000
Token and
Token ID290
Feature activation+5.81e-03

Pos logit contributions
 elbow+0.042
 knee+0.041
 sister+0.040
 sorry+0.038
 whine+0.037
Neg logit contributions
That-0.014
s-0.013
This-0.012
aw-0.012
It-0.011
Token the
Token ID262
Feature activation+1.11e-02

Pos logit contributions
 elbow+0.014
 knee+0.014
 sister+0.013
 sorry+0.013
 911+0.012
Neg logit contributions
That-0.007
s-0.006
This-0.006
It-0.006
aw-0.006
Token ha
Token ID387
Feature activation+5.59e-02

Pos logit contributions
 elbow+0.015
 Amen+0.015
 911+0.015
 Acc+0.014
 knee+0.014
Neg logit contributions
That-0.022
It-0.021
This-0.021
A-0.021
 sky-0.020
Tokenird
Token ID1447
Feature activation+9.91e-02

Pos logit contributions
 elbow+0.066
 knee+0.063
 sister+0.056
 sorry+0.054
 whine+0.051
Neg logit contributions
s-0.132
That-0.128
aw-0.126
This-0.123
There-0.121
Tokenress
Token ID601
Feature activation+4.54e-02

Pos logit contributions
 elbow+0.116
 sorry+0.107
 knee+0.101
 whine+0.095
 boo+0.094
Neg logit contributions
s-0.253
aw-0.219
That-0.218
-0.209
st-0.209
Tokener
Token ID263
Feature activation+3.12e-02

Pos logit contributions
 elbow+0.111
 whine+0.108
 sorry+0.107
 Amen+0.107
 911+0.105
Neg logit contributions
aw-0.046
s-0.046
That-0.040
This-0.034
 that-0.033
Token began
Token ID2540
Feature activation+4.70e-03

Pos logit contributions
 elbow+0.073
 knee+0.071
 sister+0.067
 sorry+0.065
 whine+0.063
Neg logit contributions
That-0.037
s-0.037
This-0.034
aw-0.033
It-0.032
Token to
Token ID284
Feature activation+3.30e-02

Pos logit contributions
 elbow+0.011
 knee+0.011
 sister+0.011
 Amen+0.010
 911+0.010
Neg logit contributions
s-0.005
That-0.005
This-0.004
It-0.004
st-0.004
Token cut
Token ID2005
Feature activation+9.45e-03

Pos logit contributions
 elbow+0.062
 knee+0.060
 911+0.058
 sister+0.057
 sorry+0.056
Neg logit contributions
That-0.051
This-0.051
It-0.048
s-0.047
There-0.045
Token in
Token ID287
Feature activation-7.66e-03

Pos logit contributions
 elbow+0.000
 knee+0.000
 sister+0.000
 sorry+0.000
 whine+0.000
Neg logit contributions
That-0.000
s-0.000
This-0.000
aw-0.000
It-0.000
Token and
Token ID290
Feature activation+9.90e-04

Pos logit contributions
That+0.009
s+0.009
This+0.008
aw+0.008
It+0.008
Neg logit contributions
 elbow-0.018
 knee-0.018
 sister-0.017
 sorry-0.016
 whine-0.016
Token let
Token ID1309
Feature activation-1.59e-02

Pos logit contributions
 elbow+0.002
 knee+0.002
 sister+0.002
 sorry+0.002
 911+0.002
Neg logit contributions
s-0.001
That-0.001
This-0.001
It-0.001
There-0.001
Token the
Token ID262
Feature activation+1.16e-02

Pos logit contributions
That+0.022
This+0.021
It+0.020
s+0.020
aw+0.019
Neg logit contributions
 elbow-0.037
 knee-0.034
 sister-0.032
 whine-0.031
 Amen-0.031
Token ha
Token ID387
Feature activation+6.22e-02

Pos logit contributions
 elbow+0.019
 Acc+0.018
 knee+0.018
 Amen+0.018
 911+0.018
Neg logit contributions
That-0.020
This-0.019
s-0.019
It-0.019
A-0.018
Tokenird
Token ID1447
Feature activation+9.89e-02

Pos logit contributions
 elbow+0.077
 knee+0.071
 sorry+0.064
 sister+0.064
 whine+0.060
Neg logit contributions
s-0.147
aw-0.140
That-0.138
This-0.133
There-0.132
Tokenress
Token ID601
Feature activation+5.87e-02

Pos logit contributions
 elbow+0.125
 knee+0.104
 sorry+0.102
 boo+0.097
 whine+0.095
Neg logit contributions
s-0.253
That-0.222
aw-0.215
It-0.214
This-0.212
Tokener
Token ID263
Feature activation+5.20e-02

Pos logit contributions
 elbow+0.146
 whine+0.136
 knee+0.133
 boo+0.131
 Amen+0.131
Neg logit contributions
s-0.063
aw-0.062
That-0.057
This-0.050
 that-0.049
Token go
Token ID467
Feature activation+1.42e-02

Pos logit contributions
 elbow+0.119
 knee+0.112
 sister+0.104
 sorry+0.104
 Amen+0.102
Neg logit contributions
s-0.069
That-0.063
It-0.058
This-0.057
aw-0.056
Token to
Token ID284
Feature activation+1.88e-02

Pos logit contributions
 elbow+0.040
 knee+0.038
 sister+0.037
 sorry+0.035
 whine+0.034
Neg logit contributions
s-0.013
That-0.011
This-0.010
It-0.010
st-0.009
Token work
Token ID670
Feature activation-1.41e-02

Pos logit contributions
 elbow+0.047
 knee+0.045
 sister+0.044
 Amen+0.042
 sorry+0.041
Neg logit contributions
This-0.018
That-0.018
It-0.017
 me-0.015
 that-0.015
Token They
Token ID1119
Feature activation-1.77e-02

Pos logit contributions
 elbow+0.028
 knee+0.027
 sister+0.026
 sorry+0.025
 whine+0.025
Neg logit contributions
That-0.006
s-0.005
This-0.005
aw-0.004
It-0.004
Token put
Token ID1234
Feature activation+1.74e-02

Pos logit contributions
That+0.022
s+0.021
This+0.020
aw+0.019
It+0.019
Neg logit contributions
 elbow-0.041
 knee-0.040
 sister-0.037
 sorry-0.035
 whine-0.035
Token Lily
Token ID20037
Feature activation+2.07e-02

Pos logit contributions
 elbow+0.036
 knee+0.036
 sister+0.034
 Acc+0.033
 whine+0.032
Neg logit contributions
s-0.024
This-0.023
That-0.022
It-0.021
There-0.021
Token on
Token ID319
Feature activation+6.20e-03

Pos logit contributions
 elbow+0.051
 knee+0.047
 sister+0.046
 Acc+0.045
 sorry+0.045
Neg logit contributions
s-0.024
That-0.019
This-0.019
There-0.018
It-0.018
Token a
Token ID257
Feature activation+4.38e-04

Pos logit contributions
 elbow+0.015
 knee+0.014
 sister+0.014
 sorry+0.013
 whine+0.012
Neg logit contributions
That-0.007
s-0.007
This-0.007
It-0.007
aw-0.006
Token stret
Token ID5759
Feature activation+9.68e-02

Pos logit contributions
 elbow+0.001
 knee+0.001
 sister+0.001
 Acc+0.001
 sorry+0.000
Neg logit contributions
s-0.001
That-0.001
It-0.001
This-0.001
A-0.001
Tokencher
Token ID2044
Feature activation+3.98e-02

Pos logit contributions
 elbow+0.161
 knee+0.152
 sister+0.146
 boo+0.137
 whine+0.132
Neg logit contributions
s-0.180
That-0.166
It-0.158
There-0.155
This-0.155
Token and
Token ID290
Feature activation+2.13e-02

Pos logit contributions
 elbow+0.113
 knee+0.108
 sister+0.106
 sorry+0.101
 whine+0.100
Neg logit contributions
That-0.028
s-0.028
This-0.024
aw-0.023
It-0.022
Token carried
Token ID5281
Feature activation+2.35e-02

Pos logit contributions
 knee+0.052
 elbow+0.051
 Acc+0.046
 sister+0.046
 Amen+0.044
Neg logit contributions
s-0.020
 made-0.019
This-0.019
A-0.019
That-0.019
Token her
Token ID607
Feature activation+2.37e-02

Pos logit contributions
 elbow+0.053
 knee+0.050
 sister+0.046
 Acc+0.044
 boo+0.044
Neg logit contributions
s-0.033
This-0.031
That-0.031
It-0.030
There-0.029
Token to
Token ID284
Feature activation+2.92e-02

Pos logit contributions
 elbow+0.054
 knee+0.050
 sister+0.049
 sorry+0.047
 whine+0.047
Neg logit contributions
s-0.030
This-0.025
That-0.024
There-0.024
st-0.024
Token am
Token ID716
Feature activation+2.21e-02

Pos logit contributions
That+0.001
s+0.001
This+0.001
aw+0.001
It+0.001
Neg logit contributions
 elbow-0.001
 knee-0.001
 sister-0.001
 sorry-0.001
 whine-0.001
Token ready
Token ID3492
Feature activation-1.36e-02

Pos logit contributions
 elbow+0.056
 knee+0.053
 whine+0.048
 sister+0.048
 boo+0.046
Neg logit contributions
That-0.023
It-0.022
s-0.021
This-0.021
aw-0.020
Token!"
Token ID2474
Feature activation+1.31e-02

Pos logit contributions
s+0.014
That+0.014
This+0.013
It+0.013
aw+0.012
Neg logit contributions
 elbow-0.034
 knee-0.033
 sister-0.031
 whine-0.029
 sorry-0.029
Token The
Token ID383
Feature activation-3.86e-03

Pos logit contributions
 elbow+0.036
 knee+0.035
 sister+0.033
 whine+0.032
 sorry+0.032
Neg logit contributions
s-0.011
That-0.011
This-0.010
It-0.009
aw-0.009
Token ha
Token ID387
Feature activation+5.66e-02

Pos logit contributions
That+0.005
 sky+0.004
This+0.004
It+0.004
aw+0.004
Neg logit contributions
 elbow-0.009
 knee-0.008
 whine-0.008
 911-0.008
 Kim-0.007
Tokenird
Token ID1447
Feature activation+9.52e-02

Pos logit contributions
 elbow+0.126
 knee+0.122
 sister+0.114
 sorry+0.113
 911+0.111
Neg logit contributions
s-0.076
That-0.070
aw-0.070
This-0.065
It-0.063
Tokenress
Token ID601
Feature activation+3.81e-02

Pos logit contributions
 elbow+0.131
 sorry+0.113
 knee+0.112
 boo+0.109
 911+0.107
Neg logit contributions
s-0.228
That-0.195
aw-0.192
It-0.185
This-0.182
Tokener
Token ID263
Feature activation+2.29e-02

Pos logit contributions
 elbow+0.091
 whine+0.084
 knee+0.084
 911+0.083
 sorry+0.083
Neg logit contributions
s-0.043
aw-0.042
That-0.041
This-0.035
It-0.034
Token put
Token ID1234
Feature activation+3.70e-03

Pos logit contributions
 elbow+0.054
 knee+0.052
 sister+0.049
 sorry+0.047
 whine+0.046
Neg logit contributions
That-0.027
s-0.027
This-0.025
aw-0.024
It-0.024
Token a
Token ID257
Feature activation+6.43e-03

Pos logit contributions
 elbow+0.008
 knee+0.008
 sister+0.007
 whine+0.007
 sorry+0.007
Neg logit contributions
That-0.005
s-0.005
This-0.005
aw-0.005
It-0.005
Token cape
Token ID36023
Feature activation+2.19e-02

Pos logit contributions
 elbow+0.016
 knee+0.015
 911+0.014
 sister+0.014
 whine+0.014
Neg logit contributions
s-0.007
That-0.007
It-0.006
This-0.006
A-0.006
Token very
Token ID845
Feature activation+1.64e-02

Pos logit contributions
 elbow+0.053
 knee+0.049
 sister+0.047
 911+0.045
 whine+0.045
Neg logit contributions
That-0.026
s-0.024
It-0.024
This-0.024
aw-0.023
Token refreshed
Token ID47193
Feature activation-3.12e-02

Pos logit contributions
 elbow+0.034
 knee+0.030
 sister+0.028
 911+0.028
 whine+0.026
Neg logit contributions
s-0.026
That-0.026
It-0.024
This-0.024
aw-0.023
Token and
Token ID290
Feature activation+2.16e-02

Pos logit contributions
That+0.037
This+0.034
s+0.033
There+0.032
rawling+0.032
Neg logit contributions
 knee-0.071
 elbow-0.070
 sister-0.066
 sorry-0.064
 whine-0.061
Token thanked
Token ID26280
Feature activation+2.49e-02

Pos logit contributions
 elbow+0.046
 knee+0.043
 sister+0.041
 911+0.039
 whine+0.038
Neg logit contributions
That-0.033
s-0.032
It-0.030
This-0.030
There-0.029
Token the
Token ID262
Feature activation-8.98e-04

Pos logit contributions
 elbow+0.071
 knee+0.069
 sister+0.065
 whine+0.063
 boo+0.061
Neg logit contributions
That-0.020
This-0.019
It-0.017
 that-0.015
s-0.015
Token tort
Token ID7619
Feature activation+9.50e-02

Pos logit contributions
That+0.002
s+0.001
This+0.001
aw+0.001
It+0.001
Neg logit contributions
 elbow-0.002
 knee-0.002
 sister-0.001
 sorry-0.001
 whine-0.001
Tokenoise
Token ID25678
Feature activation+3.39e-02

Pos logit contributions
 elbow+0.175
 knee+0.168
 sister+0.158
 sorry+0.147
 whine+0.144
Neg logit contributions
That-0.164
s-0.160
This-0.154
aw-0.150
It-0.148
Token for
Token ID329
Feature activation+2.69e-02

Pos logit contributions
 elbow+0.077
 knee+0.070
 whine+0.064
 sister+0.064
 boo+0.064
Neg logit contributions
s-0.049
That-0.045
aw-0.041
This-0.041
rawling-0.038
Token her
Token ID607
Feature activation+1.92e-02

Pos logit contributions
 elbow+0.055
 knee+0.054
 sister+0.051
 whine+0.049
 boo+0.046
Neg logit contributions
That-0.040
This-0.038
It-0.037
 that-0.034
There-0.034
Token good
Token ID922
Feature activation-2.68e-04

Pos logit contributions
 elbow+0.039
 knee+0.035
 sister+0.032
 whine+0.032
 911+0.031
Neg logit contributions
s-0.029
That-0.028
This-0.027
It-0.026
There-0.025
Token nap
Token ID25422
Feature activation+8.01e-03

Pos logit contributions
s+0.000
That+0.000
aw+0.000
This+0.000
st+0.000
Neg logit contributions
 elbow-0.001
 knee-0.000
 911-0.000
 sister-0.000
 sorry-0.000
Token different
Token ID1180
Feature activation-2.27e-02

Pos logit contributions
 elbow+0.028
 knee+0.028
 sister+0.026
 911+0.024
 whine+0.024
Neg logit contributions
That-0.013
s-0.012
This-0.012
It-0.012
There-0.011
Token today
Token ID1909
Feature activation-1.74e-02

Pos logit contributions
s+0.027
That+0.026
This+0.024
aw+0.024
It+0.023
Neg logit contributions
 elbow-0.054
 knee-0.052
 sister-0.049
 sorry-0.047
 whine-0.046
Token."
Token ID526
Feature activation+1.30e-02

Pos logit contributions
That+0.020
s+0.019
This+0.018
aw+0.017
It+0.017
Neg logit contributions
 elbow-0.042
 knee-0.041
 sister-0.039
 sorry-0.037
 whine-0.036
Token The
Token ID383
Feature activation+1.16e-02

Pos logit contributions
 elbow+0.036
 knee+0.035
 sister+0.033
 whine+0.032
 sorry+0.032
Neg logit contributions
That-0.011
s-0.010
This-0.009
It-0.009
aw-0.008
Token ha
Token ID387
Feature activation+5.57e-02

Pos logit contributions
 elbow+0.019
 911+0.017
 knee+0.016
 Acc+0.016
 Amen+0.016
Neg logit contributions
That-0.021
A-0.020
It-0.020
This-0.020
s-0.019
Tokenird
Token ID1447
Feature activation+9.50e-02

Pos logit contributions
 elbow+0.076
 knee+0.072
 sister+0.064
 sorry+0.064
 911+0.062
Neg logit contributions
s-0.122
That-0.116
aw-0.115
This-0.109
It-0.109
Tokenress
Token ID601
Feature activation+3.73e-02

Pos logit contributions
 elbow+0.126
 sorry+0.110
 knee+0.104
 boo+0.103
 whine+0.100
Neg logit contributions
s-0.230
That-0.195
aw-0.191
It-0.187
-0.185
Tokener
Token ID263
Feature activation+1.77e-02

Pos logit contributions
 elbow+0.090
 whine+0.084
 knee+0.083
 sorry+0.082
 911+0.081
Neg logit contributions
s-0.040
aw-0.039
That-0.039
This-0.032
It-0.032
Token smiled
Token ID13541
Feature activation-2.31e-02

Pos logit contributions
 elbow+0.042
 knee+0.041
 sister+0.039
 sorry+0.037
 whine+0.037
Neg logit contributions
That-0.020
s-0.020
This-0.018
aw-0.017
It-0.017
Token and
Token ID290
Feature activation+3.88e-03

Pos logit contributions
That+0.021
s+0.020
This+0.018
aw+0.017
It+0.017
Neg logit contributions
 elbow-0.061
 knee-0.059
 sister-0.057
 sorry-0.054
 whine-0.054
Token said
Token ID531
Feature activation+2.25e-03

Pos logit contributions
 elbow+0.010
 knee+0.010
 sister+0.010
 911+0.009
 whine+0.009
Neg logit contributions
That-0.003
s-0.003
This-0.003
It-0.003
aw-0.003
Token all
Token ID477
Feature activation-7.82e-03

Pos logit contributions
That+0.024
s+0.023
This+0.022
It+0.021
aw+0.021
Neg logit contributions
 elbow-0.057
 knee-0.055
 sister-0.053
 sorry-0.050
 whine-0.049
Token,"
Token ID553
Feature activation+8.71e-03

Pos logit contributions
That+0.010
s+0.010
This+0.009
aw+0.009
It+0.009
Neg logit contributions
 elbow-0.018
 knee-0.017
 sister-0.016
 sorry-0.015
 whine-0.015
Token replied
Token ID8712
Feature activation+2.32e-02

Pos logit contributions
 elbow+0.024
 knee+0.024
 sister+0.023
 whine+0.022
 sorry+0.022
Neg logit contributions
That-0.007
s-0.006
This-0.006
It-0.006
aw-0.005
Token the
Token ID262
Feature activation+5.54e-03

Pos logit contributions
 elbow+0.061
 Amen+0.058
 knee+0.057
 whine+0.057
 liar+0.055
Neg logit contributions
 that-0.016
s-0.016
That-0.015
This-0.014
 me-0.013
Token ha
Token ID387
Feature activation+5.64e-02

Pos logit contributions
 elbow+0.008
 knee+0.007
 911+0.007
 Amen+0.007
 Acc+0.007
Neg logit contributions
That-0.012
This-0.011
A-0.011
It-0.011
There-0.010
Tokenird
Token ID1447
Feature activation+9.49e-02

Pos logit contributions
 elbow+0.080
 knee+0.074
 sorry+0.068
 911+0.068
 sister+0.066
Neg logit contributions
s-0.125
aw-0.114
That-0.111
ard-0.105
st-0.105
Tokenress
Token ID601
Feature activation+4.31e-02

Pos logit contributions
 elbow+0.133
 sorry+0.111
 knee+0.107
 whine+0.106
 boo+0.106
Neg logit contributions
s-0.235
That-0.187
aw-0.183
!-0.182
ard-0.182
Tokener
Token ID263
Feature activation+3.12e-02

Pos logit contributions
 elbow+0.109
 whine+0.101
 sorry+0.098
 knee+0.098
 sister+0.097
Neg logit contributions
s-0.047
aw-0.042
That-0.040
 that-0.033
ball-0.033
Token.
Token ID13
Feature activation-6.85e-03

Pos logit contributions
 elbow+0.085
 knee+0.081
 sister+0.077
 whine+0.076
 sorry+0.076
Neg logit contributions
s-0.025
That-0.023
aw-0.019
It-0.018
This-0.018
Token "
Token ID366
Feature activation-2.46e-02

Pos logit contributions
That+0.009
s+0.009
This+0.008
aw+0.008
It+0.008
Neg logit contributions
 elbow-0.015
 knee-0.015
 sister-0.014
 sorry-0.013
 whine-0.013
TokenIt
Token ID1026
Feature activation-2.55e-02

Pos logit contributions
s+0.022
That+0.020
aw+0.019
rawling+0.018
ures+0.018
Neg logit contributions
 elbow-0.063
 knee-0.062
 sister-0.058
 sorry-0.055
 whine-0.055
Token hair
Token ID4190
Feature activation+3.32e-02

Pos logit contributions
That+0.037
This+0.037
It+0.035
A+0.034
There+0.034
Neg logit contributions
 elbow-0.037
 knee-0.033
 sister-0.031
 whine-0.029
 sorry-0.029
Token salon
Token ID46608
Feature activation+3.79e-02

Pos logit contributions
 elbow+0.076
 knee+0.071
 sister+0.070
 sorry+0.070
 whine+0.067
Neg logit contributions
s-0.041
That-0.037
aw-0.036
This-0.035
It-0.033
Token.
Token ID13
Feature activation+1.33e-02

Pos logit contributions
 elbow+0.102
 knee+0.094
 sister+0.094
 sorry+0.090
 whine+0.090
Neg logit contributions
s-0.034
aw-0.026
That-0.025
This-0.024
 that-0.023
Token The
Token ID383
Feature activation+8.53e-04

Pos logit contributions
 elbow+0.036
 knee+0.033
 sister+0.031
 whine+0.030
 sorry+0.029
Neg logit contributions
That-0.013
This-0.012
s-0.012
It-0.012
There-0.012
Token ha
Token ID387
Feature activation+5.80e-02

Pos logit contributions
 elbow+0.001
 knee+0.001
 sister+0.001
 911+0.001
 Acc+0.001
Neg logit contributions
That-0.002
This-0.002
A-0.002
It-0.002
There-0.002
Tokenird
Token ID1447
Feature activation+9.41e-02

Pos logit contributions
 elbow+0.067
 knee+0.063
 sister+0.056
 sorry+0.055
 whine+0.054
Neg logit contributions
s-0.137
aw-0.133
That-0.132
This-0.127
There-0.127
Tokenress
Token ID601
Feature activation+3.32e-02

Pos logit contributions
 elbow+0.119
 sorry+0.106
 knee+0.103
 whine+0.101
 boo+0.100
Neg logit contributions
s-0.226
That-0.203
aw-0.199
It-0.197
This-0.192
Tokener
Token ID263
Feature activation+2.05e-02

Pos logit contributions
 elbow+0.084
 whine+0.082
 knee+0.077
 sorry+0.076
 Amen+0.076
Neg logit contributions
aw-0.031
s-0.030
That-0.029
 was-0.025
 that-0.025
Token was
Token ID373
Feature activation+7.96e-03

Pos logit contributions
 elbow+0.055
 knee+0.053
 sister+0.051
 sorry+0.049
 whine+0.048
Neg logit contributions
That-0.017
s-0.017
This-0.015
It-0.014
aw-0.014
Token very
Token ID845
Feature activation+1.03e-02

Pos logit contributions
 elbow+0.019
 knee+0.018
 sister+0.018
 whine+0.016
 911+0.016
Neg logit contributions
That-0.009
It-0.008
s-0.008
This-0.008
aw-0.008
Token kind
Token ID1611
Feature activation-3.29e-02

Pos logit contributions
 elbow+0.022
 knee+0.020
 911+0.019
 sister+0.019
 Amen+0.018
Neg logit contributions
s-0.015
That-0.015
It-0.014
This-0.013
A-0.013
Token like
Token ID588
Feature activation-1.36e-02

Pos logit contributions
 elbow+0.019
 knee+0.018
 sister+0.016
 sorry+0.015
 911+0.015
Neg logit contributions
That-0.014
s-0.013
It-0.013
This-0.013
There-0.012
Token it
Token ID340
Feature activation-2.99e-02

Pos logit contributions
That+0.017
s+0.017
This+0.016
aw+0.015
It+0.015
Neg logit contributions
 elbow-0.031
 knee-0.030
 sister-0.028
 sorry-0.027
 whine-0.026
Token and
Token ID290
Feature activation+7.71e-03

Pos logit contributions
That+0.033
s+0.031
This+0.030
aw+0.028
It+0.028
Neg logit contributions
 elbow-0.073
 knee-0.071
 sister-0.068
 sorry-0.064
 whine-0.063
Token the
Token ID262
Feature activation+5.68e-04

Pos logit contributions
 elbow+0.019
 knee+0.018
 sister+0.017
 whine+0.016
 911+0.016
Neg logit contributions
s-0.008
That-0.008
This-0.007
It-0.007
aw-0.007
Token ha
Token ID387
Feature activation+5.27e-02

Pos logit contributions
 elbow+0.001
 knee+0.001
 911+0.001
 Amen+0.001
 Acc+0.001
Neg logit contributions
That-0.001
This-0.001
 sky-0.001
It-0.001
A-0.001
Tokenird
Token ID1447
Feature activation+9.37e-02

Pos logit contributions
 elbow+0.085
 knee+0.081
 sister+0.075
 sorry+0.073
 whine+0.070
Neg logit contributions
s-0.102
That-0.097
aw-0.096
This-0.091
It-0.090
Tokenress
Token ID601
Feature activation+4.33e-02

Pos logit contributions
 elbow+0.121
 sorry+0.105
 knee+0.104
 boo+0.097
 whine+0.096
Neg logit contributions
s-0.226
That-0.195
aw-0.193
st-0.188
aut-0.186
Tokener
Token ID263
Feature activation+2.82e-02

Pos logit contributions
 elbow+0.103
 whine+0.097
 knee+0.095
 sister+0.093
 sorry+0.093
Neg logit contributions
s-0.048
aw-0.045
That-0.042
This-0.036
ball-0.036
Token said
Token ID531
Feature activation-5.45e-03

Pos logit contributions
 elbow+0.067
 knee+0.065
 sister+0.061
 sorry+0.058
 whine+0.057
Neg logit contributions
s-0.033
That-0.032
aw-0.029
This-0.028
It-0.028
Token it
Token ID340
Feature activation-4.24e-02

Pos logit contributions
That+0.006
s+0.006
aw+0.005
It+0.005
This+0.005
Neg logit contributions
 elbow-0.014
 knee-0.013
 sister-0.012
 whine-0.012
 911-0.011
Token was
Token ID373
Feature activation-1.13e-02

Pos logit contributions
That+0.031
s+0.029
This+0.027
aw+0.025
It+0.024
Neg logit contributions
 elbow-0.119
 knee-0.116
 sister-0.112
 sorry-0.106
 whine-0.105
Token "
Token ID366
Feature activation-8.98e-04

Pos logit contributions
 elbow+0.060
 knee+0.056
 sister+0.052
 whine+0.051
 boo+0.048
Neg logit contributions
That-0.026
It-0.023
s-0.022
This-0.022
A-0.019
TokenCan
Token ID6090
Feature activation-4.70e-04

Pos logit contributions
s+0.001
That+0.001
rawling+0.001
aw+0.001
ures+0.001
Neg logit contributions
 elbow-0.002
 knee-0.002
 sister-0.002
 whine-0.002
 Amen-0.002
Token we
Token ID356
Feature activation+3.07e-02

Pos logit contributions
That+0.001
s+0.001
This+0.001
It+0.001
There+0.001
Neg logit contributions
 elbow-0.001
 knee-0.001
 sister-0.001
 whine-0.001
 sorry-0.001
Token keep
Token ID1394
Feature activation+4.58e-02

Pos logit contributions
 elbow+0.055
 knee+0.052
 sister+0.049
 sorry+0.045
 whine+0.044
Neg logit contributions
That-0.055
s-0.053
This-0.052
It-0.050
aw-0.050
Token the
Token ID262
Feature activation+9.52e-03

Pos logit contributions
 elbow+0.095
 knee+0.091
 sister+0.086
 whine+0.080
 sorry+0.079
Neg logit contributions
That-0.067
s-0.066
This-0.064
It-0.061
aw-0.060
Token re
Token ID302
Feature activation+9.36e-02

Pos logit contributions
 knee+0.013
 elbow+0.013
 sister+0.012
 sorry+0.011
 boo+0.011
Neg logit contributions
pill-0.020
That-0.020
aw-0.020
ures-0.020
rawling-0.020
Tokenind
Token ID521
Feature activation+8.33e-02

Pos logit contributions
 elbow+0.236
 knee+0.226
 sister+0.218
 sorry+0.210
 whine+0.204
Neg logit contributions
s-0.098
That-0.093
This-0.084
aw-0.082
It-0.079
Tokeneer
Token ID28153
Feature activation+5.45e-02

Pos logit contributions
 elbow+0.139
 knee+0.132
 sister+0.122
 sorry+0.115
 whine+0.112
Neg logit contributions
s-0.155
That-0.155
This-0.147
aw-0.144
It-0.141
Token,
Token ID11
Feature activation+3.66e-02

Pos logit contributions
 elbow+0.115
 knee+0.102
 sister+0.096
 whine+0.096
 911+0.092
Neg logit contributions
s-0.081
That-0.075
This-0.072
It-0.071
aw-0.069
Token please
Token ID3387
Feature activation+3.53e-02

Pos logit contributions
 elbow+0.059
 knee+0.051
 sister+0.046
 whine+0.044
 911+0.042
Neg logit contributions
That-0.075
It-0.071
s-0.071
This-0.070
A-0.067
Token?"
Token ID1701
Feature activation+1.86e-02

Pos logit contributions
 elbow+0.093
 knee+0.086
 sister+0.078
 whine+0.076
 911+0.075
Neg logit contributions
That-0.033
A-0.033
It-0.032
s-0.032
This-0.031
Token happy
Token ID3772
Feature activation-3.29e-02

Pos logit contributions
 elbow+0.027
 knee+0.025
 911+0.023
 sister+0.023
 whine+0.021
Neg logit contributions
That-0.021
s-0.020
It-0.019
This-0.019
aw-0.018
Token and
Token ID290
Feature activation+9.79e-03

Pos logit contributions
That+0.033
s+0.031
This+0.030
aw+0.028
It+0.027
Neg logit contributions
 elbow-0.084
 knee-0.082
 sister-0.079
 sorry-0.074
 whine-0.073
Token thanked
Token ID26280
Feature activation+1.59e-02

Pos logit contributions
 elbow+0.024
 knee+0.023
 sister+0.022
 911+0.022
 whine+0.021
Neg logit contributions
That-0.011
s-0.011
This-0.010
It-0.009
aw-0.009
Token the
Token ID262
Feature activation+2.48e-03

Pos logit contributions
 elbow+0.041
 knee+0.040
 whine+0.038
 sister+0.038
 sorry+0.037
Neg logit contributions
That-0.014
This-0.014
It-0.012
 that-0.012
 me-0.012
Token ha
Token ID387
Feature activation+5.74e-02

Pos logit contributions
 elbow+0.004
 knee+0.003
 911+0.003
 sister+0.003
 whine+0.003
Neg logit contributions
That-0.005
This-0.005
It-0.005
A-0.005
There-0.005
Tokenird
Token ID1447
Feature activation+9.35e-02

Pos logit contributions
 elbow+0.079
 knee+0.073
 sorry+0.066
 sister+0.065
 whine+0.065
Neg logit contributions
s-0.130
aw-0.124
That-0.120
This-0.114
It-0.113
Tokenress
Token ID601
Feature activation+5.12e-02

Pos logit contributions
 elbow+0.130
 sorry+0.104
 knee+0.104
 boo+0.103
 whine+0.102
Neg logit contributions
s-0.234
That-0.200
aw-0.192
It-0.188
This-0.187
Tokener
Token ID263
Feature activation+4.07e-02

Pos logit contributions
 elbow+0.132
 whine+0.122
 sorry+0.118
 knee+0.116
 sister+0.116
Neg logit contributions
s-0.056
aw-0.051
That-0.047
 that-0.042
ard-0.039
Token.
Token ID13
Feature activation+5.86e-03

Pos logit contributions
 elbow+0.109
 knee+0.102
 sister+0.097
 whine+0.096
 sorry+0.095
Neg logit contributions
s-0.039
That-0.034
aw-0.030
It-0.028
This-0.028
Token As
Token ID1081
Feature activation+9.42e-03

Pos logit contributions
 elbow+0.013
 knee+0.013
 sister+0.012
 sorry+0.012
 whine+0.011
Neg logit contributions
That-0.008
s-0.007
This-0.007
It-0.007
aw-0.007
Token they
Token ID484
Feature activation-1.30e-02

Pos logit contributions
 elbow+0.024
 knee+0.023
 sister+0.021
 whine+0.021
 forehead+0.020
Neg logit contributions
That-0.011
s-0.011
This-0.010
It-0.010
There-0.010
Token's
Token ID338
Feature activation+2.97e-02

Pos logit contributions
 elbow+0.045
 knee+0.044
 sister+0.042
 sorry+0.041
 whine+0.040
Neg logit contributions
s-0.010
That-0.009
This-0.008
aw-0.007
It-0.007
Token turn
Token ID1210
Feature activation-3.06e-02

Pos logit contributions
 Amen+0.050
 911+0.046
 elbow+0.044
 sorry+0.042
 Acc+0.041
Neg logit contributions
s-0.051
 sky-0.049
 might-0.047
That-0.046
This-0.045
Token.
Token ID13
Feature activation+8.70e-04

Pos logit contributions
That+0.038
s+0.035
This+0.035
aw+0.033
It+0.033
Neg logit contributions
 elbow-0.071
 knee-0.069
 sister-0.065
 sorry-0.062
 whine-0.060
Token The
Token ID383
Feature activation+8.84e-03

Pos logit contributions
 elbow+0.002
 knee+0.002
 sister+0.002
 whine+0.002
 sorry+0.002
Neg logit contributions
That-0.001
s-0.001
This-0.001
It-0.001
There-0.001
Token ha
Token ID387
Feature activation+5.56e-02

Pos logit contributions
 elbow+0.020
 Acc+0.018
 Amen+0.018
 knee+0.018
 911+0.018
Neg logit contributions
 sky-0.010
That-0.010
It-0.010
There-0.010
This-0.010
Tokenird
Token ID1447
Feature activation+9.32e-02

Pos logit contributions
 elbow+0.072
 knee+0.068
 sister+0.061
 sorry+0.059
 whine+0.057
Neg logit contributions
s-0.125
That-0.121
aw-0.120
This-0.116
It-0.115
Tokenress
Token ID601
Feature activation+3.31e-02

Pos logit contributions
 elbow+0.122
 sorry+0.109
 knee+0.104
 boo+0.103
 whine+0.103
Neg logit contributions
s-0.227
That-0.197
aw-0.194
It-0.191
This-0.185
Tokener
Token ID263
Feature activation+1.91e-02

Pos logit contributions
 elbow+0.080
 whine+0.075
 knee+0.074
 sorry+0.072
 sister+0.071
Neg logit contributions
s-0.037
aw-0.037
That-0.036
It-0.030
ball-0.030
Token asked
Token ID1965
Feature activation+9.80e-03

Pos logit contributions
 elbow+0.049
 knee+0.048
 sister+0.046
 sorry+0.044
 whine+0.043
Neg logit contributions
That-0.018
s-0.018
This-0.016
aw-0.016
It-0.016
Token her
Token ID607
Feature activation+2.53e-02

Pos logit contributions
 elbow+0.024
 knee+0.022
 Amen+0.020
 whine+0.019
 forehead+0.018
Neg logit contributions
s-0.012
That-0.012
 me-0.012
 that-0.012
This-0.011
Token what
Token ID644
Feature activation-4.29e-02

Pos logit contributions
 elbow+0.060
 knee+0.052
 whine+0.049
 Amen+0.048
 forehead+0.046
Neg logit contributions
s-0.036
 that-0.033
That-0.031
It-0.029
-0.029
Token his
Token ID465
Feature activation+1.62e-02

Pos logit contributions
 elbow+0.049
 knee+0.048
 whine+0.045
 sister+0.044
 liar+0.044
Neg logit contributions
That-0.024
This-0.023
It-0.022
 that-0.022
 me-0.021
Token mom
Token ID1995
Feature activation+3.34e-02

Pos logit contributions
 elbow+0.030
 whine+0.025
 knee+0.025
 911+0.023
burn+0.023
Neg logit contributions
s-0.025
 beautiful-0.025
That-0.024
 huge-0.023
 me-0.023
Token and
Token ID290
Feature activation+1.74e-02

Pos logit contributions
 elbow+0.078
 knee+0.072
 sister+0.066
 sorry+0.065
 whine+0.064
Neg logit contributions
s-0.047
That-0.041
This-0.038
aw-0.037
It-0.035
Token the
Token ID262
Feature activation-8.47e-03

Pos logit contributions
 elbow+0.042
 knee+0.042
 911+0.040
 whine+0.038
 diaper+0.038
Neg logit contributions
s-0.017
That-0.017
 made-0.014
It-0.014
This-0.014
Token ha
Token ID387
Feature activation+5.17e-02

Pos logit contributions
That+0.012
This+0.011
It+0.011
 sky+0.011
There+0.011
Neg logit contributions
 elbow-0.017
 knee-0.016
 911-0.015
 whine-0.015
 sister-0.015
Tokenird
Token ID1447
Feature activation+9.30e-02

Pos logit contributions
 elbow+0.093
 knee+0.088
 sorry+0.083
 sister+0.083
 911+0.082
Neg logit contributions
s-0.095
aw-0.087
That-0.082
st-0.077
This-0.077
Tokenress
Token ID601
Feature activation+4.58e-02

Pos logit contributions
 elbow+0.124
 sorry+0.109
 knee+0.106
 boo+0.105
 whine+0.104
Neg logit contributions
s-0.228
That-0.189
aw-0.185
st-0.177
aut-0.177
Tokener
Token ID263
Feature activation+3.71e-02

Pos logit contributions
 elbow+0.114
 whine+0.111
 sorry+0.105
 sister+0.105
 911+0.105
Neg logit contributions
s-0.049
aw-0.045
 that-0.037
That-0.037
This-0.031
Token.
Token ID13
Feature activation+3.94e-03

Pos logit contributions
 elbow+0.091
 knee+0.087
 sister+0.083
 sorry+0.082
 whine+0.081
Neg logit contributions
s-0.041
That-0.037
aw-0.034
This-0.033
It-0.032
Token "
Token ID366
Feature activation-9.19e-03

Pos logit contributions
 elbow+0.009
 knee+0.009
 sister+0.008
 sorry+0.008
 whine+0.008
Neg logit contributions
That-0.005
s-0.005
This-0.004
aw-0.004
It-0.004
TokenI
Token ID40
Feature activation+8.70e-03

Pos logit contributions
s+0.009
That+0.008
aw+0.008
ures+0.008
rawling+0.008
Neg logit contributions
 elbow-0.023
 knee-0.022
 sister-0.021
 sorry-0.020
 whine-0.020
Token very
Token ID845
Feature activation+7.91e-03

Pos logit contributions
 elbow+0.033
 knee+0.032
 sister+0.031
 whine+0.029
 sorry+0.028
Neg logit contributions
That-0.025
s-0.024
This-0.023
aw-0.023
It-0.023
Token still
Token ID991
Feature activation-1.89e-03

Pos logit contributions
 elbow+0.017
 knee+0.016
 sister+0.016
 sorry+0.015
 whine+0.014
Neg logit contributions
That-0.011
s-0.011
This-0.010
aw-0.010
It-0.010
Token so
Token ID523
Feature activation+1.14e-02

Pos logit contributions
That+0.002
s+0.002
This+0.002
aw+0.002
It+0.002
Neg logit contributions
 elbow-0.004
 knee-0.004
 sister-0.004
 911-0.004
 whine-0.004
Token the
Token ID262
Feature activation-2.14e-03

Pos logit contributions
 elbow+0.024
 knee+0.023
 sister+0.022
 911+0.021
 whine+0.020
Neg logit contributions
That-0.017
s-0.017
This-0.016
It-0.015
aw-0.015
Token ha
Token ID387
Feature activation+5.51e-02

Pos logit contributions
That+0.004
This+0.004
It+0.004
 sky+0.004
A+0.003
Neg logit contributions
 elbow-0.004
 knee-0.003
 911-0.003
 Acc-0.003
 sister-0.003
Tokenird
Token ID1447
Feature activation+9.25e-02

Pos logit contributions
 elbow+0.079
 knee+0.075
 sorry+0.069
 sister+0.069
 whine+0.067
Neg logit contributions
s-0.116
aw-0.111
That-0.109
This-0.104
There-0.103
Tokenress
Token ID601
Feature activation+4.24e-02

Pos logit contributions
 elbow+0.118
 sorry+0.107
 whine+0.104
 boo+0.101
 knee+0.099
Neg logit contributions
s-0.226
aw-0.192
That-0.189
ard-0.185
st-0.185
Tokener
Token ID263
Feature activation+2.94e-02

Pos logit contributions
 elbow+0.104
 whine+0.099
 sorry+0.096
 911+0.095
 sister+0.094
Neg logit contributions
s-0.045
aw-0.043
That-0.039
ard-0.035
 that-0.034
Token could
Token ID714
Feature activation+3.31e-02

Pos logit contributions
 elbow+0.071
 knee+0.068
 sister+0.065
 sorry+0.063
 911+0.063
Neg logit contributions
s-0.033
That-0.032
This-0.028
aw-0.028
It-0.028
Token do
Token ID466
Feature activation+3.73e-03

Pos logit contributions
 elbow+0.063
 knee+0.059
 sister+0.058
 911+0.055
 sorry+0.053
Neg logit contributions
That-0.052
This-0.050
s-0.050
It-0.050
There-0.047
Token a
Token ID257
Feature activation+2.17e-02

Pos logit contributions
 elbow+0.008
 knee+0.008
 sister+0.008
 whine+0.007
 sorry+0.007
Neg logit contributions
That-0.005
s-0.005
This-0.004
aw-0.004
It-0.004
Token so
Token ID523
Feature activation+1.39e-02

Pos logit contributions
 elbow+0.027
 knee+0.026
 sister+0.025
 whine+0.023
 sorry+0.023
Neg logit contributions
That-0.012
This-0.011
s-0.011
It-0.011
aw-0.010
Token he
Token ID339
Feature activation-2.80e-02

Pos logit contributions
 elbow+0.037
 knee+0.036
 whine+0.034
 boo+0.034
 Acc+0.033
Neg logit contributions
s-0.013
That-0.011
 that-0.011
It-0.009
This-0.009
Token asked
Token ID1965
Feature activation+1.43e-02

Pos logit contributions
That+0.036
This+0.035
s+0.034
ures+0.034
There+0.032
Neg logit contributions
 elbow-0.061
 knee-0.060
 sister-0.058
 boo-0.053
 sorry-0.053
Token the
Token ID262
Feature activation-2.56e-03

Pos logit contributions
 elbow+0.041
 knee+0.037
burn+0.034
sing+0.034
 bleed+0.034
Neg logit contributions
 that-0.013
-0.011
 was-0.010
 all-0.010
 me-0.010
Token ha
Token ID387
Feature activation+5.49e-02

Pos logit contributions
That+0.005
This+0.005
A+0.005
It+0.005
s+0.005
Neg logit contributions
 elbow-0.004
 knee-0.003
 sister-0.003
 911-0.003
 Acc-0.003
Tokenird
Token ID1447
Feature activation+9.23e-02

Pos logit contributions
 elbow+0.079
 knee+0.070
 sorry+0.065
 Amen+0.063
 whine+0.062
Neg logit contributions
s-0.125
aw-0.116
That-0.110
st-0.107
ard-0.106
Tokenress
Token ID601
Feature activation+4.60e-02

Pos logit contributions
 elbow+0.145
 sorry+0.112
 boo+0.112
 knee+0.111
 whine+0.110
Neg logit contributions
s-0.223
That-0.184
A-0.178
It-0.176
st-0.175
Tokener
Token ID263
Feature activation+4.83e-02

Pos logit contributions
 elbow+0.121
 whine+0.109
 sorry+0.106
 knee+0.105
 boo+0.102
Neg logit contributions
s-0.048
aw-0.042
That-0.038
 that-0.037
ball-0.036
Token to
Token ID284
Feature activation+2.94e-02

Pos logit contributions
 elbow+0.110
 knee+0.101
 sorry+0.090
 whine+0.090
 boo+0.089
Neg logit contributions
s-0.069
That-0.061
It-0.056
This-0.055
aw-0.055
Token give
Token ID1577
Feature activation+8.46e-03

Pos logit contributions
 elbow+0.056
 knee+0.051
 sister+0.049
 911+0.046
 sorry+0.045
Neg logit contributions
That-0.048
This-0.047
It-0.045
A-0.044
There-0.042
Token him
Token ID683
Feature activation+2.23e-02

Pos logit contributions
 elbow+0.021
 knee+0.020
 sister+0.019
 whine+0.019
 sorry+0.018
Neg logit contributions
s-0.009
This-0.009
That-0.009
It-0.008
aw-0.008
Token friend
Token ID1545
Feature activation+1.71e-02

Pos logit contributions
 elbow+0.062
 knee+0.056
 whine+0.051
 Amen+0.050
 boo+0.049
Neg logit contributions
s-0.028
That-0.026
It-0.024
 that-0.023
This-0.023
Token who
Token ID508
Feature activation-2.51e-02

Pos logit contributions
 elbow+0.045
 knee+0.042
 whine+0.038
 sister+0.037
 Amen+0.037
Neg logit contributions
s-0.019
That-0.016
aw-0.014
ball-0.014
This-0.013
Token was
Token ID373
Feature activation+2.93e-02

Pos logit contributions
That+0.021
s+0.020
This+0.019
aw+0.018
rawling+0.017
Neg logit contributions
 elbow-0.066
 knee-0.064
 sister-0.063
 sorry-0.060
 whine-0.058
Token a
Token ID257
Feature activation+1.34e-02

Pos logit contributions
 elbow+0.063
 knee+0.055
 sister+0.053
 apology+0.051
 911+0.050
Neg logit contributions
It-0.041
That-0.040
This-0.038
her-0.038
aw-0.038
Token ha
Token ID387
Feature activation+5.26e-02

Pos logit contributions
 elbow+0.035
 knee+0.031
 911+0.028
 Kim+0.028
 whine+0.028
Neg logit contributions
That-0.015
It-0.014
A-0.013
s-0.012
ball-0.012
Tokenird
Token ID1447
Feature activation+9.11e-02

Pos logit contributions
 elbow+0.081
 knee+0.076
 Amen+0.073
 911+0.071
 sorry+0.071
Neg logit contributions
s-0.104
aw-0.096
st-0.091
That-0.090
ard-0.089
Tokenress
Token ID601
Feature activation+4.04e-02

Pos logit contributions
 elbow+0.128
 boo+0.114
 sorry+0.113
 whine+0.111
 knee+0.108
Neg logit contributions
s-0.214
st-0.169
!-0.167
ard-0.166
That-0.165
Tokener
Token ID263
Feature activation+3.91e-02

Pos logit contributions
 elbow+0.103
 whine+0.098
 Acc+0.093
 Amen+0.092
 sister+0.092
Neg logit contributions
aw-0.037
s-0.035
 that-0.031
That-0.029
ball-0.028
Token and
Token ID290
Feature activation+2.03e-02

Pos logit contributions
 elbow+0.094
 knee+0.089
 sister+0.085
 sorry+0.083
 whine+0.082
Neg logit contributions
s-0.044
That-0.042
aw-0.039
This-0.037
It-0.036
Token asked
Token ID1965
Feature activation+5.92e-03

Pos logit contributions
 elbow+0.052
 knee+0.051
 Amen+0.046
 whine+0.046
 forehead+0.044
Neg logit contributions
 made-0.019
That-0.018
s-0.018
 that-0.018
It-0.018
Token for
Token ID329
Feature activation+2.65e-02

Pos logit contributions
 elbow+0.013
 knee+0.012
 Amen+0.011
 whine+0.010
 sister+0.010
Neg logit contributions
s-0.009
That-0.008
This-0.008
aw-0.007
 me-0.007
Token him
Token ID683
Feature activation+8.68e-03

Pos logit contributions
That+0.005
This+0.005
s+0.005
aw+0.005
It+0.005
Neg logit contributions
 elbow-0.013
 knee-0.012
 sister-0.011
 whine-0.011
 sorry-0.010
Token to
Token ID284
Feature activation+1.69e-02

Pos logit contributions
 elbow+0.019
 knee+0.018
 sister+0.017
 sorry+0.016
 whine+0.016
Neg logit contributions
That-0.012
s-0.012
This-0.011
aw-0.011
It-0.011
Token a
Token ID257
Feature activation-2.59e-02

Pos logit contributions
 elbow+0.043
 knee+0.042
 sister+0.039
 sorry+0.037
 whine+0.037
Neg logit contributions
That-0.017
This-0.017
It-0.015
There-0.015
s-0.015
Token reliable
Token ID9314
Feature activation+1.69e-02

Pos logit contributions
That+0.029
s+0.027
This+0.026
aw+0.025
It+0.024
Neg logit contributions
 elbow-0.062
 knee-0.061
 sister-0.059
 sorry-0.055
 whine-0.054
Token ha
Token ID387
Feature activation+4.40e-02

Pos logit contributions
 elbow+0.037
 sorry+0.032
 Acc+0.031
 knee+0.031
 Amen+0.031
Neg logit contributions
 me-0.022
 field-0.021
 sky-0.021
That-0.020
A-0.020
Tokenird
Token ID1447
Feature activation+9.10e-02

Pos logit contributions
 elbow+0.058
 knee+0.053
 sister+0.047
 sorry+0.047
 whine+0.044
Neg logit contributions
s-0.100
aw-0.095
That-0.094
This-0.090
st-0.089
Tokenress
Token ID601
Feature activation+4.16e-02

Pos logit contributions
 elbow+0.123
 knee+0.101
 sorry+0.101
 boo+0.099
 whine+0.097
Neg logit contributions
s-0.219
That-0.194
aw-0.187
This-0.185
st-0.185
Tokener
Token ID263
Feature activation+3.95e-02

Pos logit contributions
 elbow+0.103
 whine+0.096
 sister+0.095
 knee+0.094
 sorry+0.093
Neg logit contributions
aw-0.045
s-0.042
That-0.038
 that-0.037
ball-0.034
Token,
Token ID11
Feature activation+7.29e-03

Pos logit contributions
 elbow+0.094
 knee+0.092
 sister+0.087
 sorry+0.082
 whine+0.081
Neg logit contributions
That-0.046
s-0.044
This-0.042
aw-0.040
It-0.040
Token who
Token ID508
Feature activation-2.43e-02

Pos logit contributions
 elbow+0.018
 knee+0.018
 sister+0.017
 whine+0.016
 sorry+0.016
Neg logit contributions
That-0.007
s-0.007
This-0.007
It-0.007
There-0.006
Token always
Token ID1464
Feature activation+8.80e-03

Pos logit contributions
That+0.023
s+0.021
This+0.021
aw+0.020
There+0.019
Neg logit contributions
 elbow-0.062
 knee-0.060
 sister-0.059
 sorry-0.055
 whine-0.054
Token made
Token ID925
Feature activation+1.59e-02

Pos logit contributions
That+0.021
s+0.021
This+0.019
aw+0.019
It+0.018
Neg logit contributions
 elbow-0.042
 knee-0.041
 sister-0.039
 sorry-0.037
 whine-0.036
Token sure
Token ID1654
Feature activation-6.59e-03

Pos logit contributions
 elbow+0.036
 knee+0.035
 sister+0.033
 whine+0.031
 sorry+0.031
Neg logit contributions
That-0.020
s-0.019
This-0.019
aw-0.018
It-0.018
Token to
Token ID284
Feature activation+3.25e-02

Pos logit contributions
That+0.007
s+0.007
This+0.007
It+0.007
aw+0.006
Neg logit contributions
 elbow-0.016
 knee-0.016
 sister-0.014
 sorry-0.014
 whine-0.014
Token keep
Token ID1394
Feature activation+3.72e-02

Pos logit contributions
 elbow+0.062
 knee+0.057
 sister+0.055
 911+0.052
 sorry+0.050
Neg logit contributions
That-0.053
This-0.051
It-0.050
s-0.047
A-0.047
Token the
Token ID262
Feature activation+5.53e-03

Pos logit contributions
 elbow+0.084
 knee+0.082
 sister+0.079
 whine+0.073
 boo+0.071
Neg logit contributions
That-0.047
This-0.045
s-0.043
It-0.043
There-0.040
Token m
Token ID285
Feature activation+9.09e-02

Pos logit contributions
 elbow+0.011
 knee+0.010
 sister+0.010
 sorry+0.009
 whine+0.009
Neg logit contributions
That-0.009
s-0.009
This-0.008
aw-0.008
It-0.008
Tokenule
Token ID2261
Feature activation+6.15e-02

Pos logit contributions
 elbow+0.152
 knee+0.144
 sister+0.137
 sorry+0.126
 whine+0.122
Neg logit contributions
That-0.170
s-0.169
This-0.160
aw-0.157
It-0.155
Token tidy
Token ID43044
Feature activation+1.62e-02

Pos logit contributions
 elbow+0.134
 knee+0.117
 sister+0.117
 whine+0.112
 Amen+0.111
Neg logit contributions
s-0.087
That-0.078
This-0.075
It-0.071
aw-0.071
Token and
Token ID290
Feature activation+3.14e-02

Pos logit contributions
 elbow+0.042
 knee+0.040
 sister+0.039
 sorry+0.038
 whine+0.037
Neg logit contributions
s-0.014
That-0.014
This-0.012
aw-0.012
It-0.011
Token clean
Token ID3424
Feature activation+1.90e-02

Pos logit contributions
 elbow+0.066
 knee+0.063
 sister+0.060
 911+0.057
 babys+0.055
Neg logit contributions
That-0.044
s-0.043
It-0.041
This-0.041
There-0.038
Token,
Token ID11
Feature activation+2.40e-02

Pos logit contributions
 elbow+0.045
 knee+0.043
 sister+0.042
 sorry+0.040
 whine+0.039
Neg logit contributions
s-0.022
That-0.021
This-0.020
aw-0.019
It-0.019
Token we
Token ID356
Feature activation+6.68e-03

Pos logit contributions
 elbow+0.077
 knee+0.072
 whine+0.068
 forehead+0.066
 911+0.066
Neg logit contributions
That-0.035
s-0.034
It-0.033
This-0.032
A-0.031
Token go
Token ID467
Feature activation+1.61e-02

Pos logit contributions
 elbow+0.011
 knee+0.011
 sister+0.010
 sorry+0.009
 whine+0.009
Neg logit contributions
That-0.013
s-0.012
This-0.012
aw-0.012
It-0.011
Token?"
Token ID1701
Feature activation+3.22e-02

Pos logit contributions
 elbow+0.038
 knee+0.035
 sister+0.033
 whine+0.032
 sorry+0.032
Neg logit contributions
s-0.021
That-0.020
This-0.019
It-0.018
There-0.018
Token said
Token ID531
Feature activation+2.99e-02

Pos logit contributions
 elbow+0.081
 knee+0.079
 sister+0.073
 sorry+0.070
 whine+0.069
Neg logit contributions
That-0.034
s-0.034
This-0.032
It-0.030
There-0.029
Token the
Token ID262
Feature activation+1.14e-02

Pos logit contributions
 elbow+0.077
 knee+0.074
 forehead+0.067
 Acc+0.066
 appointment+0.065
Neg logit contributions
s-0.030
That-0.026
This-0.026
It-0.024
A-0.023
Token younger
Token ID7099
Feature activation+9.06e-02

Pos logit contributions
 elbow+0.020
 knee+0.019
 sister+0.017
 sorry+0.016
 whine+0.016
Neg logit contributions
That-0.021
s-0.020
This-0.020
It-0.019
There-0.019
Token brother
Token ID3956
Feature activation+3.23e-02

Pos logit contributions
 elbow+0.108
 knee+0.095
 sorry+0.079
 whine+0.075
 boo+0.073
Neg logit contributions
s-0.212
That-0.202
aw-0.194
This-0.190
ball-0.187
Token.
Token ID13
Feature activation+5.90e-03

Pos logit contributions
 elbow+0.098
 knee+0.091
 sister+0.087
 sorry+0.085
 911+0.085
Neg logit contributions
s-0.019
That-0.013
aw-0.010
!-0.010
ard-0.010
Token
Token ID198
Feature activation-3.84e-02

Pos logit contributions
 elbow+0.016
 knee+0.016
 sister+0.015
 sorry+0.015
 whine+0.014
Neg logit contributions
That-0.005
s-0.004
This-0.004
aw-0.004
rawling-0.004
Token"
Token ID1
Feature activation-3.09e-02

Pos logit contributions
That+0.031
aw+0.030
pill+0.028
rawling+0.027
This+0.027
Neg logit contributions
 knee-0.093
 elbow-0.093
 sister-0.093
 Kim-0.088
 sorry-0.087
TokenLet
Token ID5756
Feature activation-4.00e-02

Pos logit contributions
aw+0.032
rawling+0.032
s+0.032
ures+0.031
ard+0.030
Neg logit contributions
 elbow-0.066
 knee-0.065
 whine-0.062
 sister-0.061
 sorry-0.060
Token very
Token ID845
Feature activation-5.35e-03

Pos logit contributions
That+0.013
s+0.013
This+0.012
aw+0.012
It+0.012
Neg logit contributions
 elbow-0.020
 knee-0.020
 sister-0.019
 sorry-0.018
 whine-0.017
Token restless
Token ID42537
Feature activation-3.56e-02

Pos logit contributions
That+0.009
This+0.009
There+0.008
ures+0.008
s+0.008
Neg logit contributions
 knee-0.009
 sorry-0.009
 elbow-0.009
 sister-0.009
 whine-0.008
Token.
Token ID13
Feature activation-6.84e-03

Pos logit contributions
That+0.044
This+0.041
s+0.040
rawling+0.038
There+0.038
Neg logit contributions
 elbow-0.080
 knee-0.080
 sister-0.076
 sorry-0.073
 whine-0.069
Token Billy
Token ID15890
Feature activation-1.85e-02

Pos logit contributions
st+0.013
pill+0.013
ures+0.013
aw+0.012
ball+0.012
Neg logit contributions
 knee-0.009
 sorry-0.009
 Kim-0.009
 elbow-0.009
 sister-0.008
Token was
Token ID373
Feature activation+1.90e-02

Pos logit contributions
This+0.024
That+0.024
ures+0.024
There+0.022
rawling+0.022
Neg logit contributions
 knee-0.039
 elbow-0.039
 sister-0.038
 boo-0.035
 whine-0.034
Token amazed
Token ID24562
Feature activation-8.23e-02

Pos logit contributions
 elbow+0.041
 knee+0.039
 sister+0.037
 911+0.034
 whine+0.034
Neg logit contributions
That-0.027
s-0.025
It-0.024
This-0.024
aw-0.024
Token by
Token ID416
Feature activation+2.58e-03

Pos logit contributions
pill+0.153
This+0.151
That+0.145
hest+0.143
There+0.141
Neg logit contributions
 sister-0.117
 knee-0.115
 sorry-0.101
 elbow-0.097
 Kim-0.094
Token the
Token ID262
Feature activation-6.62e-03

Pos logit contributions
 elbow+0.006
 knee+0.005
 sister+0.005
 sorry+0.005
 whine+0.005
Neg logit contributions
That-0.003
s-0.003
This-0.003
aw-0.003
It-0.003
Token bird
Token ID6512
Feature activation-9.52e-03

Pos logit contributions
That+0.011
s+0.010
This+0.010
aw+0.010
rawling+0.010
Neg logit contributions
 elbow-0.013
 knee-0.013
 sister-0.012
 sorry-0.011
 whine-0.011
Token and
Token ID290
Feature activation-1.12e-02

Pos logit contributions
That+0.010
This+0.009
There+0.009
aw+0.009
It+0.009
Neg logit contributions
 elbow-0.023
 knee-0.023
 sister-0.022
 sorry-0.021
 whine-0.020
Token smiled
Token ID13541
Feature activation-2.71e-02

Pos logit contributions
That+0.012
s+0.011
This+0.010
aw+0.010
It+0.010
Neg logit contributions
 elbow-0.028
 knee-0.028
 sister-0.026
 sorry-0.025
 whine-0.025
Token!"
Token ID2474
Feature activation+9.42e-03

Pos logit contributions
That+0.013
This+0.012
There+0.012
A+0.011
It+0.011
Neg logit contributions
 sister-0.021
 knee-0.021
 elbow-0.021
 sorry-0.020
 whine-0.018
Token His
Token ID2399
Feature activation+2.58e-02

Pos logit contributions
 elbow+0.019
 knee+0.018
 sister+0.017
 whine+0.016
 sorry+0.016
Neg logit contributions
That-0.015
s-0.015
This-0.014
It-0.013
aw-0.013
Token dad
Token ID9955
Feature activation-7.06e-03

Pos logit contributions
 elbow+0.037
 knee+0.034
 sister+0.031
 sorry+0.029
 whine+0.028
Neg logit contributions
That-0.055
s-0.054
This-0.052
aw-0.051
It-0.051
Token was
Token ID373
Feature activation+1.03e-02

Pos logit contributions
That+0.010
This+0.010
s+0.010
There+0.009
aw+0.009
Neg logit contributions
 elbow-0.015
 knee-0.014
 sister-0.014
 sorry-0.013
 whine-0.012
Token so
Token ID523
Feature activation-3.37e-03

Pos logit contributions
 knee+0.021
 elbow+0.021
 sister+0.020
 sorry+0.020
 whine+0.018
Neg logit contributions
s-0.015
That-0.015
This-0.014
rawling-0.014
aw-0.013
Token amazed
Token ID24562
Feature activation-8.19e-02

Pos logit contributions
ures+0.007
pill+0.007
This+0.007
st+0.007
aw+0.007
Neg logit contributions
 sorry-0.005
 knee-0.004
 sister-0.004
 elbow-0.004
 whine-0.004
Token by
Token ID416
Feature activation+4.98e-03

Pos logit contributions
This+0.138
That+0.135
pill+0.131
hest+0.126
There+0.124
Neg logit contributions
 knee-0.124
 sister-0.121
 Kim-0.115
 elbow-0.112
 sorry-0.111
Token the
Token ID262
Feature activation-1.49e-02

Pos logit contributions
 elbow+0.010
 knee+0.010
 sister+0.009
 whine+0.008
 sorry+0.008
Neg logit contributions
That-0.008
This-0.007
It-0.007
s-0.007
There-0.007
Token fish
Token ID5916
Feature activation-7.33e-03

Pos logit contributions
That+0.026
pill+0.025
ures+0.025
s+0.024
This+0.024
Neg logit contributions
 knee-0.026
 sister-0.026
 elbow-0.025
 sorry-0.024
 boo-0.023
Token too
Token ID1165
Feature activation-6.41e-03

Pos logit contributions
That+0.010
This+0.009
There+0.009
A+0.009
It+0.008
Neg logit contributions
 knee-0.015
 elbow-0.015
 sister-0.015
 sorry-0.014
 whine-0.013
Token.
Token ID13
Feature activation-9.57e-03

Pos logit contributions
This+0.010
pill+0.010
That+0.010
There+0.010
rawling+0.010
Neg logit contributions
 knee-0.011
 sorry-0.011
 elbow-0.011
 sister-0.010
 Kim-0.009
Token.
Token ID13
Feature activation+5.93e-03

Pos logit contributions
s+0.007
That+0.007
This+0.006
aw+0.006
It+0.006
Neg logit contributions
 elbow-0.020
 knee-0.019
 sister-0.018
 whine-0.018
 boo-0.018
Token But
Token ID887
Feature activation+2.04e-02

Pos logit contributions
 elbow+0.011
 knee+0.011
 sister+0.010
 sorry+0.010
 whine+0.010
Neg logit contributions
That-0.009
s-0.009
aw-0.009
This-0.009
ball-0.008
Token John
Token ID1757
Feature activation-2.14e-02

Pos logit contributions
 elbow+0.046
 knee+0.044
 sister+0.040
 boo+0.038
 whine+0.037
Neg logit contributions
That-0.029
s-0.027
This-0.026
It-0.026
There-0.024
Token was
Token ID373
Feature activation-6.00e-03

Pos logit contributions
That+0.023
This+0.021
s+0.020
There+0.019
It+0.019
Neg logit contributions
 elbow-0.053
 knee-0.052
 sister-0.050
 sorry-0.047
 whine-0.046
Token too
Token ID1165
Feature activation-8.73e-03

Pos logit contributions
That+0.008
s+0.007
This+0.007
It+0.007
aw+0.007
Neg logit contributions
 elbow-0.014
 knee-0.013
 sister-0.013
 whine-0.012
 911-0.012
Token proud
Token ID6613
Feature activation-8.15e-02

Pos logit contributions
ures+0.016
That+0.016
This+0.016
pill+0.016
rawling+0.016
Neg logit contributions
 knee-0.014
 sister-0.014
 sorry-0.014
 elbow-0.013
 whine-0.012
Token.
Token ID13
Feature activation-4.07e-04

Pos logit contributions
That+0.136
This+0.131
It+0.122
There+0.122
A+0.117
Neg logit contributions
 knee-0.147
 sister-0.144
 elbow-0.143
 sorry-0.126
 Kim-0.119
Token Suddenly
Token ID24975
Feature activation+9.30e-04

Pos logit contributions
That+0.001
s+0.001
aw+0.001
This+0.001
ball+0.000
Neg logit contributions
 elbow-0.001
 knee-0.001
 sister-0.001
 sorry-0.001
 whine-0.001
Token,
Token ID11
Feature activation+1.50e-02

Pos logit contributions
 elbow+0.002
 knee+0.002
 sister+0.002
 whine+0.002
 boo+0.002
Neg logit contributions
s-0.001
That-0.001
aw-0.001
It-0.001
There-0.001
Token the
Token ID262
Feature activation-6.66e-04

Pos logit contributions
 elbow+0.036
 knee+0.036
 sister+0.033
 boo+0.032
 whine+0.031
Neg logit contributions
That-0.018
This-0.016
It-0.016
s-0.016
There-0.015
Token steak
Token ID26320
Feature activation-2.56e-02

Pos logit contributions
That+0.001
s+0.001
This+0.001
aw+0.001
rawling+0.001
Neg logit contributions
 elbow-0.001
 knee-0.001
 sister-0.001
 sorry-0.001
 whine-0.001
Token
Token ID198
Feature activation-1.56e-02

Pos logit contributions
That+0.015
s+0.013
This+0.012
It+0.011
aw+0.010
Neg logit contributions
 elbow-0.090
 knee-0.088
 sister-0.085
 sorry-0.081
 whine-0.080
TokenThe
Token ID464
Feature activation+2.13e-02

Pos logit contributions
s+0.009
That+0.009
aw+0.007
This+0.007
ures+0.007
Neg logit contributions
 elbow-0.045
 knee-0.044
 sister-0.042
 sorry-0.041
 whine-0.040
Token old
Token ID1468
Feature activation+5.07e-02

Pos logit contributions
 elbow+0.059
 knee+0.052
 whine+0.050
 Kim+0.048
 sister+0.048
Neg logit contributions
That-0.020
This-0.018
It-0.017
s-0.017
 sky-0.015
Token person
Token ID1048
Feature activation+1.87e-03

Pos logit contributions
 elbow+0.103
 knee+0.091
 whine+0.084
 sister+0.083
 sorry+0.082
Neg logit contributions
That-0.082
s-0.080
It-0.076
This-0.075
-0.070
Token was
Token ID373
Feature activation+1.54e-02

Pos logit contributions
 elbow+0.005
 knee+0.005
 sister+0.004
 sorry+0.004
 911+0.004
Neg logit contributions
That-0.002
s-0.002
This-0.002
aw-0.002
It-0.002
Token amazed
Token ID24562
Feature activation-8.11e-02

Pos logit contributions
 elbow+0.036
 knee+0.034
 sister+0.032
 whine+0.030
 911+0.030
Neg logit contributions
That-0.019
s-0.018
This-0.018
It-0.017
aw-0.017
Token and
Token ID290
Feature activation-4.72e-03

Pos logit contributions
This+0.144
pill+0.139
That+0.136
There+0.136
ures+0.130
Neg logit contributions
 knee-0.127
 sister-0.126
 sorry-0.111
 elbow-0.107
 Kim-0.099
Token said
Token ID531
Feature activation+8.92e-03

Pos logit contributions
That+0.004
s+0.004
This+0.003
It+0.003
aw+0.003
Neg logit contributions
 elbow-0.013
 knee-0.013
 sister-0.012
 whine-0.011
 sorry-0.011
Token,"
Token ID553
Feature activation-9.67e-03

Pos logit contributions
 elbow+0.025
 knee+0.023
 whine+0.021
 sister+0.021
 appointment+0.021
Neg logit contributions
s-0.009
That-0.008
It-0.008
 that-0.008
aw-0.007
TokenThat
Token ID2504
Feature activation-4.44e-02

Pos logit contributions
s+0.010
That+0.010
This+0.009
aw+0.009
rawling+0.009
Neg logit contributions
 elbow-0.023
 knee-0.023
 sister-0.022
 sorry-0.021
 whine-0.020
Token's
Token ID338
Feature activation+7.48e-03

Pos logit contributions
aws+0.060
Inside+0.060
That+0.056
awed+0.056
rawling+0.055
Neg logit contributions
 sister-0.081
 Amen-0.071
 brother-0.069
 singer-0.065
 knee-0.064
Token
Token ID198
Feature activation-4.60e-02

Pos logit contributions
 elbow+0.011
 knee+0.011
 sister+0.010
 sorry+0.010
 whine+0.009
Neg logit contributions
That-0.004
s-0.004
This-0.004
aw-0.004
It-0.004
Token
Token ID198
Feature activation-2.75e-02

Pos logit contributions
ures+0.045
That+0.044
aw+0.043
pill+0.043
rawling+0.041
Neg logit contributions
 knee-0.104
 elbow-0.103
 sister-0.098
 Kim-0.098
 sorry-0.094
TokenTim
Token ID14967
Feature activation-3.77e-03

Pos logit contributions
ures+0.046
pill+0.045
s+0.043
 sky+0.040
aw+0.040
Neg logit contributions
 knee-0.045
 elbow-0.042
 Kim-0.039
 sister-0.039
 sorry-0.038
Tokenmy
Token ID1820
Feature activation-1.87e-02

Pos logit contributions
That+0.007
This+0.006
s+0.006
ures+0.006
There+0.006
Neg logit contributions
 elbow-0.006
 knee-0.006
 sister-0.006
 whine-0.005
 boo-0.005
Token was
Token ID373
Feature activation+2.00e-02

Pos logit contributions
ures+0.025
pill+0.025
This+0.025
That+0.024
There+0.024
Neg logit contributions
 knee-0.038
 elbow-0.036
 sister-0.036
 boo-0.033
 whine-0.031
Token amazed
Token ID24562
Feature activation-8.10e-02

Pos logit contributions
 elbow+0.045
 knee+0.042
 sister+0.041
 911+0.038
 whine+0.038
Neg logit contributions
That-0.027
s-0.025
This-0.025
It-0.024
aw-0.024
Token by
Token ID416
Feature activation+7.99e-03

Pos logit contributions
pill+0.156
This+0.156
That+0.151
hest+0.145
There+0.143
Neg logit contributions
 knee-0.105
 sister-0.105
 sorry-0.088
 Kim-0.083
 elbow-0.082
Token the
Token ID262
Feature activation-8.91e-03

Pos logit contributions
 elbow+0.019
 knee+0.018
 sister+0.017
 sorry+0.016
 whine+0.016
Neg logit contributions
That-0.010
s-0.009
This-0.009
It-0.008
aw-0.008
Token bird
Token ID6512
Feature activation-1.48e-03

Pos logit contributions
That+0.016
s+0.016
ures+0.016
This+0.015
aw+0.015
Neg logit contributions
 knee-0.015
 elbow-0.015
 sister-0.014
 sorry-0.013
 boo-0.012
Token's
Token ID338
Feature activation+8.15e-03

Pos logit contributions
That+0.002
This+0.001
s+0.001
There+0.001
It+0.001
Neg logit contributions
 elbow-0.004
 knee-0.004
 sister-0.003
 sorry-0.003
 whine-0.003
Token answer
Token ID3280
Feature activation-2.63e-02

Pos logit contributions
 elbow+0.016
 knee+0.015
 sister+0.015
 whine+0.013
 sorry+0.013
Neg logit contributions
That-0.012
s-0.012
This-0.012
aw-0.011
It-0.011
Token
Token ID198
Feature activation-5.56e-02

Pos logit contributions
That+0.008
s+0.007
This+0.007
aw+0.007
ures+0.007
Neg logit contributions
 elbow-0.018
 knee-0.018
 sister-0.017
 sorry-0.016
 Kim-0.015
Token
Token ID198
Feature activation-3.34e-02

Pos logit contributions
ures+0.057
That+0.057
aw+0.056
pill+0.055
rawling+0.054
Neg logit contributions
 knee-0.117
 elbow-0.115
 sister-0.112
 Kim-0.112
 sorry-0.107
TokenTim
Token ID14967
Feature activation-1.03e-02

Pos logit contributions
ures+0.056
pill+0.053
als+0.050
 reefs+0.050
s+0.050
Neg logit contributions
 knee-0.051
 elbow-0.047
 Kim-0.046
 sister-0.044
 sorry-0.044
Tokenmy
Token ID1820
Feature activation-2.10e-02

Pos logit contributions
That+0.019
This+0.018
ures+0.018
There+0.018
s+0.017
Neg logit contributions
 elbow-0.017
 knee-0.017
 sister-0.017
 boo-0.014
 whine-0.014
Token was
Token ID373
Feature activation+2.06e-02

Pos logit contributions
ures+0.028
This+0.027
There+0.027
That+0.027
pill+0.026
Neg logit contributions
 knee-0.044
 sister-0.043
 elbow-0.042
 boo-0.039
 whine-0.037
Token amazed
Token ID24562
Feature activation-8.06e-02

Pos logit contributions
 elbow+0.046
 knee+0.042
 sister+0.040
 911+0.039
 whine+0.038
Neg logit contributions
That-0.028
It-0.026
This-0.026
s-0.025
aw-0.025
Token that
Token ID326
Feature activation+1.10e-02

Pos logit contributions
pill+0.158
This+0.156
That+0.150
hest+0.148
ures+0.144
Neg logit contributions
 sister-0.107
 knee-0.103
 sorry-0.090
 elbow-0.082
 Kim-0.080
Token the
Token ID262
Feature activation-4.20e-03

Pos logit contributions
 elbow+0.031
 knee+0.029
 whine+0.027
 forehead+0.026
 boo+0.026
Neg logit contributions
That-0.009
s-0.009
This-0.008
It-0.008
There-0.008
Token owl
Token ID39610
Feature activation-6.56e-03

Pos logit contributions
That+0.008
s+0.008
This+0.008
aw+0.008
ures+0.007
Neg logit contributions
 elbow-0.007
 knee-0.007
 sister-0.006
 sorry-0.006
 whine-0.005
Token was
Token ID373
Feature activation+6.07e-03

Pos logit contributions
That+0.006
This+0.006
aw+0.005
There+0.005
rawling+0.005
Neg logit contributions
 elbow-0.016
 knee-0.016
 sister-0.016
 sorry-0.015
 boo-0.014
Token so
Token ID523
Feature activation-8.77e-03

Pos logit contributions
 elbow+0.012
 knee+0.012
 sister+0.011
 whine+0.010
 sorry+0.010
Neg logit contributions
That-0.009
s-0.009
This-0.009
It-0.009
aw-0.009
Token the
Token ID262
Feature activation-2.53e-02

Pos logit contributions
s+0.021
pill+0.019
boards+0.019
aw+0.018
That+0.018
Neg logit contributions
 sister-0.017
 knee-0.017
 elbow-0.017
 ear-0.016
 sorry-0.016
Token wind
Token ID2344
Feature activation-2.01e-03

Pos logit contributions
That+0.024
s+0.023
This+0.022
aw+0.020
It+0.020
Neg logit contributions
 elbow-0.066
 knee-0.064
 sister-0.062
 sorry-0.059
 whine-0.058
Token."
Token ID526
Feature activation+2.49e-03

Pos logit contributions
That+0.002
This+0.002
s+0.002
There+0.002
aw+0.002
Neg logit contributions
 elbow-0.005
 knee-0.005
 sister-0.004
 sorry-0.004
 whine-0.004
Token Lily
Token ID20037
Feature activation-6.12e-03

Pos logit contributions
 elbow+0.005
 knee+0.005
 sister+0.004
 sorry+0.004
 whine+0.004
Neg logit contributions
That-0.004
s-0.004
This-0.004
It-0.004
aw-0.004
Token was
Token ID373
Feature activation+2.01e-02

Pos logit contributions
ures+0.008
There+0.008
This+0.008
pill+0.008
That+0.008
Neg logit contributions
 knee-0.013
 elbow-0.013
 sister-0.012
 boo-0.011
 whine-0.011
Token amazed
Token ID24562
Feature activation-8.04e-02

Pos logit contributions
 elbow+0.044
 knee+0.043
 sister+0.041
 sorry+0.038
 whine+0.038
Neg logit contributions
That-0.027
s-0.026
This-0.025
aw-0.024
It-0.024
Token at
Token ID379
Feature activation-1.53e-02

Pos logit contributions
This+0.146
pill+0.145
That+0.141
hest+0.138
There+0.137
Neg logit contributions
 knee-0.116
 sister-0.115
 sorry-0.102
 elbow-0.091
 Kim-0.089
Token how
Token ID703
Feature activation-3.01e-03

Pos logit contributions
That+0.020
s+0.019
This+0.018
aw+0.017
It+0.017
Neg logit contributions
 elbow-0.035
 knee-0.034
 sister-0.032
 sorry-0.030
 whine-0.030
Token something
Token ID1223
Feature activation-4.96e-02

Pos logit contributions
That+0.004
s+0.004
This+0.004
aw+0.004
ures+0.003
Neg logit contributions
 elbow-0.007
 knee-0.007
 sister-0.006
 sorry-0.006
 whine-0.006
Token so
Token ID523
Feature activation-8.83e-03

Pos logit contributions
There+0.052
ures+0.050
That+0.050
This+0.049
aw+0.049
Neg logit contributions
 knee-0.107
 elbow-0.103
 sister-0.102
 sorry-0.101
 boo-0.098
Token small
Token ID1402
Feature activation-2.12e-02

Pos logit contributions
ures+0.011
There+0.011
aw+0.011
This+0.011
pill+0.011
Neg logit contributions
 sorry-0.017
 knee-0.017
 sister-0.017
 elbow-0.016
 boo-0.015
Token.
Token ID13
Feature activation-4.23e-03

Pos logit contributions
 knee+0.009
 elbow+0.009
 sister+0.009
 sorry+0.008
 whine+0.008
Neg logit contributions
That-0.006
This-0.006
There-0.005
pill-0.005
A-0.005
Token
Token ID198
Feature activation-5.39e-02

Pos logit contributions
That+0.006
s+0.005
aw+0.005
This+0.005
rawling+0.005
Neg logit contributions
 elbow-0.009
 knee-0.009
 sister-0.008
 sorry-0.008
 whine-0.008
Token
Token ID198
Feature activation-3.58e-02

Pos logit contributions
aw+0.061
pill+0.057
rawling+0.054
als+0.054
That+0.053
Neg logit contributions
 knee-0.108
 elbow-0.106
 sorry-0.106
 sister-0.105
 911-0.101
TokenJimmy
Token ID40335
Feature activation-1.09e-02

Pos logit contributions
pill+0.042
s+0.041
aw+0.041
actus+0.040
ures+0.039
Neg logit contributions
 knee-0.068
 elbow-0.066
 sorry-0.065
 sister-0.063
 firefighter-0.062
Token was
Token ID373
Feature activation+1.63e-02

Pos logit contributions
That+0.012
This+0.012
There+0.011
ures+0.011
aw+0.011
Neg logit contributions
 elbow-0.025
 knee-0.025
 sister-0.025
 boo-0.022
 whine-0.022
Token amazed
Token ID24562
Feature activation-8.02e-02

Pos logit contributions
 elbow+0.043
 knee+0.041
 sister+0.039
 911+0.037
 whine+0.037
Neg logit contributions
That-0.015
This-0.014
It-0.013
s-0.013
aw-0.012
Token at
Token ID379
Feature activation-1.43e-02

Pos logit contributions
pill+0.146
This+0.145
That+0.141
There+0.137
hest+0.133
Neg logit contributions
 sister-0.116
 knee-0.110
 sorry-0.099
 elbow-0.092
 Kim-0.088
Token how
Token ID703
Feature activation-1.12e-03

Pos logit contributions
That+0.018
s+0.018
This+0.017
aw+0.016
It+0.016
Neg logit contributions
 elbow-0.033
 knee-0.032
 sister-0.030
 sorry-0.028
 whine-0.028
Token much
Token ID881
Feature activation-2.77e-03

Pos logit contributions
That+0.002
s+0.001
This+0.001
aw+0.001
It+0.001
Neg logit contributions
 elbow-0.002
 knee-0.002
 sister-0.002
 sorry-0.002
 whine-0.002
Token they
Token ID484
Feature activation-3.85e-02

Pos logit contributions
That+0.004
s+0.004
This+0.004
aw+0.004
It+0.003
Neg logit contributions
 elbow-0.006
 knee-0.006
 sister-0.005
 sorry-0.005
 whine-0.005
Token had
Token ID550
Feature activation-2.98e-02

Pos logit contributions
aw+0.052
pill+0.046
That+0.046
s+0.046
ures+0.045
Neg logit contributions
 elbow-0.079
 sister-0.077
 knee-0.073
 whine-0.070
 sorry-0.069
Token's
Token ID338
Feature activation-1.93e-03

Pos logit contributions
 elbow+0.058
 knee+0.056
 sister+0.053
 sorry+0.051
 whine+0.050
Neg logit contributions
s-0.026
That-0.025
This-0.023
aw-0.022
It-0.021
Token shop
Token ID6128
Feature activation+2.72e-05

Pos logit contributions
s+0.003
That+0.003
This+0.003
aw+0.003
It+0.002
Neg logit contributions
 elbow-0.004
 knee-0.004
 sister-0.003
 sorry-0.003
 whine-0.003
Token,
Token ID11
Feature activation+1.36e-02

Pos logit contributions
 knee+0.000
 elbow+0.000
 sister+0.000
 sorry+0.000
 whine+0.000
Neg logit contributions
That-0.000
This-0.000
There-0.000
aw-0.000
It-0.000
Token John
Token ID1757
Feature activation-8.67e-03

Pos logit contributions
 elbow+0.026
 knee+0.025
 sister+0.023
 boo+0.021
 sorry+0.021
Neg logit contributions
That-0.023
s-0.022
This-0.022
It-0.021
There-0.021
Token was
Token ID373
Feature activation+1.15e-02

Pos logit contributions
That+0.010
This+0.009
There+0.008
rawling+0.008
aw+0.008
Neg logit contributions
 elbow-0.020
 knee-0.020
 sister-0.020
 whine-0.018
 sorry-0.018
Token amazed
Token ID24562
Feature activation-8.01e-02

Pos logit contributions
 elbow+0.027
 knee+0.026
 sister+0.025
 911+0.023
 whine+0.023
Neg logit contributions
That-0.014
s-0.013
This-0.013
It-0.013
aw-0.012
Token.
Token ID13
Feature activation-1.67e-03

Pos logit contributions
That+0.138
This+0.138
pill+0.130
There+0.130
All+0.122
Neg logit contributions
 sister-0.123
 knee-0.119
 elbow-0.106
 sorry-0.105
 Kim-0.101
Token The
Token ID383
Feature activation+4.52e-03

Pos logit contributions
That+0.002
s+0.002
aw+0.002
This+0.002
ball+0.002
Neg logit contributions
 elbow-0.003
 knee-0.003
 sister-0.003
 sorry-0.003
 whine-0.003
Token shop
Token ID6128
Feature activation+9.40e-03

Pos logit contributions
 elbow+0.010
 knee+0.009
 sister+0.009
 sorry+0.008
 whine+0.008
Neg logit contributions
That-0.006
s-0.006
This-0.006
aw-0.006
It-0.006
Token had
Token ID550
Feature activation-1.97e-02

Pos logit contributions
 elbow+0.019
 knee+0.019
 sister+0.018
 sorry+0.016
 whine+0.016
Neg logit contributions
That-0.014
This-0.013
s-0.013
aw-0.013
There-0.012
Token colorful
Token ID20239
Feature activation-1.26e-04

Pos logit contributions
s+0.027
That+0.026
rawling+0.025
aut+0.024
ures+0.024
Neg logit contributions
 elbow-0.042
 knee-0.040
 sister-0.038
 Kim-0.036
 sorry-0.036
Token
Token ID198
Feature activation-3.50e-02

Pos logit contributions
aw+0.049
That+0.046
rawling+0.046
ures+0.043
pill+0.042
Neg logit contributions
 knee-0.123
 elbow-0.122
 sister-0.118
 sorry-0.116
 Kim-0.113
TokenLu
Token ID25596
Feature activation+3.10e-02

Pos logit contributions
s+0.039
aw+0.037
ures+0.036
rawling+0.036
That+0.036
Neg logit contributions
 knee-0.074
 elbow-0.073
 sister-0.070
 sorry-0.068
 whine-0.067
Tokency
Token ID948
Feature activation-1.11e-02

Pos logit contributions
 elbow+0.062
 knee+0.060
 sister+0.059
 whine+0.053
 911+0.051
Neg logit contributions
That-0.047
This-0.044
s-0.043
It-0.042
There-0.042
Token was
Token ID373
Feature activation+2.35e-02

Pos logit contributions
That+0.011
This+0.010
s+0.010
There+0.009
It+0.009
Neg logit contributions
 elbow-0.028
 knee-0.028
 sister-0.027
 sorry-0.025
 whine-0.024
Token so
Token ID523
Feature activation+7.47e-03

Pos logit contributions
 elbow+0.064
 knee+0.058
 sister+0.056
 911+0.055
 whine+0.053
Neg logit contributions
That-0.021
It-0.019
This-0.018
s-0.018
aw-0.017
Token amazed
Token ID24562
Feature activation-8.00e-02

Pos logit contributions
 elbow+0.014
 knee+0.013
 sister+0.012
 911+0.011
 whine+0.011
Neg logit contributions
That-0.013
s-0.013
This-0.013
It-0.012
aw-0.012
Token that
Token ID326
Feature activation+1.76e-02

Pos logit contributions
This+0.127
That+0.126
There+0.115
hest+0.113
pill+0.111
Neg logit contributions
 sister-0.128
 knee-0.126
 Kim-0.116
 elbow-0.115
 sorry-0.113
Token she
Token ID673
Feature activation-2.84e-02

Pos logit contributions
 elbow+0.044
 knee+0.042
 boo+0.037
 sister+0.036
 forehead+0.036
Neg logit contributions
That-0.022
s-0.021
This-0.020
It-0.019
There-0.018
Token yelled
Token ID22760
Feature activation+1.59e-03

Pos logit contributions
That+0.032
This+0.031
ures+0.030
aw+0.030
s+0.030
Neg logit contributions
 elbow-0.064
 knee-0.063
 sister-0.062
 sorry-0.058
 boo-0.056
Token,
Token ID11
Feature activation+1.45e-02

Pos logit contributions
 elbow+0.004
 knee+0.004
 sister+0.004
 whine+0.003
 Amen+0.003
Neg logit contributions
s-0.002
That-0.002
aw-0.002
This-0.002
It-0.002
Token "
Token ID366
Feature activation-1.93e-02

Pos logit contributions
 elbow+0.034
 knee+0.032
 sister+0.031
 whine+0.028
 911+0.028
Neg logit contributions
That-0.019
s-0.018
This-0.017
It-0.017
aw-0.016
Token His
Token ID2399
Feature activation+2.17e-02

Pos logit contributions
 elbow+0.109
 knee+0.107
 sister+0.103
 sorry+0.097
 Kim+0.097
Neg logit contributions
That-0.070
This-0.065
s-0.062
aw-0.060
ures-0.060
Token mom
Token ID1995
Feature activation-2.51e-03

Pos logit contributions
 elbow+0.034
 knee+0.032
 sister+0.029
 whine+0.028
 911+0.027
Neg logit contributions
That-0.042
s-0.041
This-0.040
It-0.039
aw-0.039
Token and
Token ID290
Feature activation+4.02e-03

Pos logit contributions
That+0.003
This+0.003
aw+0.002
There+0.002
s+0.002
Neg logit contributions
 elbow-0.006
 knee-0.006
 sister-0.006
 sorry-0.005
 whine-0.005
Token dad
Token ID9955
Feature activation-1.16e-02

Pos logit contributions
 elbow+0.006
 knee+0.006
 sister+0.005
 whine+0.005
 sorry+0.005
Neg logit contributions
That-0.008
s-0.008
This-0.007
It-0.007
aw-0.007
Token looked
Token ID3114
Feature activation-1.56e-02

Pos logit contributions
That+0.013
This+0.012
s+0.012
aw+0.011
There+0.011
Neg logit contributions
 elbow-0.027
 knee-0.027
 sister-0.026
 sorry-0.025
 whine-0.024
Token amazed
Token ID24562
Feature activation-7.97e-02

Pos logit contributions
That+0.022
This+0.021
s+0.020
There+0.020
rawling+0.020
Neg logit contributions
 elbow-0.032
 knee-0.032
 sister-0.031
 sorry-0.030
 whine-0.028
Token.
Token ID13
Feature activation-2.27e-03

Pos logit contributions
pill+0.134
This+0.129
That+0.127
There+0.123
All+0.118
Neg logit contributions
 sister-0.123
 knee-0.119
 sorry-0.112
 elbow-0.107
 Kim-0.099
Token They
Token ID1119
Feature activation-3.31e-02

Pos logit contributions
That+0.003
s+0.003
aw+0.003
This+0.003
ball+0.003
Neg logit contributions
 elbow-0.005
 knee-0.005
 sister-0.004
 sorry-0.004
 911-0.004
Token asked
Token ID1965
Feature activation+5.17e-03

Pos logit contributions
That+0.039
This+0.038
aw+0.037
pill+0.037
rawling+0.037
Neg logit contributions
 elbow-0.072
 sister-0.071
 knee-0.068
 sorry-0.065
 whine-0.065
Token him
Token ID683
Feature activation+2.56e-02

Pos logit contributions
 elbow+0.011
 knee+0.011
 sister+0.010
 boo+0.009
 whine+0.009
Neg logit contributions
That-0.007
s-0.007
This-0.007
aw-0.007
It-0.007
Token where
Token ID810
Feature activation-2.11e-02

Pos logit contributions
 elbow+0.063
 knee+0.057
 boo+0.051
 whine+0.051
 sister+0.049
Neg logit contributions
s-0.035
That-0.032
This-0.029
It-0.029
 that-0.028
Token."
Token ID526
Feature activation+8.32e-03

Pos logit contributions
 elbow+0.063
 knee+0.059
 sister+0.056
 whine+0.052
 sorry+0.051
Neg logit contributions
s-0.042
That-0.037
This-0.036
There-0.036
It-0.035
Token But
Token ID887
Feature activation+1.72e-02

Pos logit contributions
 elbow+0.011
 knee+0.010
 sister+0.010
 sorry+0.009
 whine+0.008
Neg logit contributions
That-0.018
s-0.018
This-0.017
aw-0.017
It-0.017
Token Lucy
Token ID22162
Feature activation-1.02e-02

Pos logit contributions
 elbow+0.029
 knee+0.028
 sister+0.024
 boo+0.022
 whine+0.022
Neg logit contributions
That-0.033
s-0.032
This-0.031
It-0.030
There-0.029
Token was
Token ID373
Feature activation+2.13e-03

Pos logit contributions
That+0.011
This+0.011
ures+0.011
There+0.011
pill+0.011
Neg logit contributions
 knee-0.024
 elbow-0.024
 sister-0.023
 sorry-0.021
 whine-0.021
Token too
Token ID1165
Feature activation-5.60e-03

Pos logit contributions
 elbow+0.005
 knee+0.004
 sister+0.004
 whine+0.004
 911+0.004
Neg logit contributions
That-0.003
It-0.003
s-0.003
This-0.003
There-0.002
Token excited
Token ID6568
Feature activation-7.96e-02

Pos logit contributions
That+0.009
This+0.009
s+0.009
ures+0.009
aw+0.008
Neg logit contributions
 knee-0.011
 elbow-0.010
 sister-0.010
 sorry-0.010
 whine-0.009
Token and
Token ID290
Feature activation+7.41e-04

Pos logit contributions
That+0.136
This+0.133
There+0.131
It+0.122
ures+0.122
Neg logit contributions
 knee-0.139
 sister-0.136
 elbow-0.130
 whine-0.119
 Kim-0.118
Token ran
Token ID4966
Feature activation+4.09e-03

Pos logit contributions
 elbow+0.002
 knee+0.002
 sister+0.002
 sorry+0.002
 whine+0.002
Neg logit contributions
That-0.001
s-0.001
This-0.001
aw-0.001
It-0.001
Token across
Token ID1973
Feature activation-5.71e-03

Pos logit contributions
 elbow+0.009
 knee+0.009
 sister+0.009
 sorry+0.008
 boo+0.008
Neg logit contributions
s-0.005
That-0.005
This-0.005
It-0.005
There-0.005
Token the
Token ID262
Feature activation-6.71e-03

Pos logit contributions
That+0.005
This+0.004
s+0.004
It+0.004
There+0.004
Neg logit contributions
 elbow-0.016
 knee-0.015
 sister-0.014
 sorry-0.014
 boo-0.014
Token road
Token ID2975
Feature activation+8.17e-03

Pos logit contributions
ures+0.014
pill+0.014
That+0.013
oph+0.013
aw+0.013
Neg logit contributions
 knee-0.009
 elbow-0.008
 sister-0.008
 Kim-0.007
 sorry-0.007
Token started
Token ID2067
Feature activation+2.06e-03

Pos logit contributions
That+0.026
s+0.024
This+0.024
rawling+0.023
aw+0.022
Neg logit contributions
 elbow-0.052
 knee-0.050
 sister-0.048
 sorry-0.046
 whine-0.045
Token singing
Token ID13777
Feature activation-1.29e-02

Pos logit contributions
 elbow+0.005
 knee+0.005
 sister+0.004
 sorry+0.004
 911+0.004
Neg logit contributions
s-0.003
That-0.002
This-0.002
It-0.002
aw-0.002
Token.
Token ID13
Feature activation-2.42e-04

Pos logit contributions
That+0.011
s+0.011
aw+0.010
This+0.010
It+0.010
Neg logit contributions
 elbow-0.035
 knee-0.034
 sister-0.032
 whine-0.030
 sorry-0.030
Token Billy
Token ID15890
Feature activation-1.77e-02

Pos logit contributions
That+0.000
s+0.000
This+0.000
aw+0.000
rawling+0.000
Neg logit contributions
 elbow-0.000
 knee-0.000
 sister-0.000
 sorry-0.000
 whine-0.000
Token was
Token ID373
Feature activation+1.61e-02

Pos logit contributions
That+0.021
This+0.021
ures+0.020
rawling+0.020
There+0.019
Neg logit contributions
 elbow-0.039
 knee-0.038
 sister-0.038
 boo-0.035
 whine-0.034
Token amazed
Token ID24562
Feature activation-7.96e-02

Pos logit contributions
 elbow+0.037
 knee+0.035
 sister+0.034
 whine+0.031
 911+0.031
Neg logit contributions
That-0.020
s-0.019
This-0.019
aw-0.018
It-0.018
Token.
Token ID13
Feature activation-1.46e-03

Pos logit contributions
pill+0.141
This+0.140
That+0.138
There+0.133
hest+0.132
Neg logit contributions
 knee-0.121
 sister-0.117
 sorry-0.107
 elbow-0.102
 Kim-0.097
Token He
Token ID679
Feature activation-2.48e-02

Pos logit contributions
That+0.002
s+0.002
aw+0.002
This+0.002
rawling+0.002
Neg logit contributions
 knee-0.003
 elbow-0.003
 sister-0.003
 sorry-0.003
 whine-0.003
Token thought
Token ID1807
Feature activation-2.14e-02

Pos logit contributions
That+0.030
This+0.030
rawling+0.029
ures+0.028
There+0.028
Neg logit contributions
 elbow-0.052
 sister-0.051
 knee-0.051
 sorry-0.047
 whine-0.047
Token the
Token ID262
Feature activation-3.22e-03

Pos logit contributions
s+0.021
That+0.021
aw+0.019
It+0.019
This+0.018
Neg logit contributions
 elbow-0.055
 knee-0.053
 sister-0.050
 whine-0.049
 Amen-0.048
Token bird
Token ID6512
Feature activation-2.67e-02

Pos logit contributions
That+0.005
s+0.005
This+0.005
aw+0.005
rawling+0.005
Neg logit contributions
 elbow-0.006
 knee-0.006
 sister-0.005
 sorry-0.005
 whine-0.005
Token
Token ID198
Feature activation-3.27e-02

Pos logit contributions
 elbow+0.024
 knee+0.023
 sister+0.023
 sorry+0.022
 whine+0.021
Neg logit contributions
That-0.004
s-0.004
This-0.003
It-0.003
aw-0.003
Token
Token ID198
Feature activation-1.73e-02

Pos logit contributions
That+0.027
ures+0.026
rawling+0.025
pill+0.025
aw+0.024
Neg logit contributions
 elbow-0.080
 knee-0.080
 sister-0.077
 Kim-0.074
 sorry-0.073
TokenL
Token ID43
Feature activation+2.57e-02

Pos logit contributions
ures+0.022
s+0.022
That+0.021
pill+0.021
rawling+0.021
Neg logit contributions
 knee-0.035
 elbow-0.035
 sister-0.033
 sorry-0.031
 whine-0.030
Tokenily
Token ID813
Feature activation-1.46e-02

Pos logit contributions
 elbow+0.038
 knee+0.038
 sister+0.034
 sorry+0.031
 911+0.029
Neg logit contributions
s-0.052
That-0.052
This-0.049
There-0.047
It-0.047
Token was
Token ID373
Feature activation+2.53e-02

Pos logit contributions
That+0.017
This+0.017
There+0.016
s+0.016
ures+0.016
Neg logit contributions
 elbow-0.033
 knee-0.033
 sister-0.031
 whine-0.029
 boo-0.028
Token amazed
Token ID24562
Feature activation-7.95e-02

Pos logit contributions
 elbow+0.059
 knee+0.055
 sister+0.053
 911+0.051
 whine+0.049
Neg logit contributions
That-0.032
It-0.029
aw-0.029
This-0.029
s-0.028
Token.
Token ID13
Feature activation+9.12e-03

Pos logit contributions
This+0.144
pill+0.143
That+0.142
hest+0.136
There+0.133
Neg logit contributions
 sister-0.116
 knee-0.115
 elbow-0.099
 sorry-0.098
 Kim-0.097
Token She
Token ID1375
Feature activation-2.19e-02

Pos logit contributions
 elbow+0.020
 knee+0.020
 sister+0.019
 sorry+0.018
 whine+0.017
Neg logit contributions
That-0.012
s-0.012
This-0.011
aw-0.011
It-0.010
Token wanted
Token ID2227
Feature activation-2.40e-02

Pos logit contributions
That+0.028
This+0.028
ures+0.027
There+0.027
s+0.026
Neg logit contributions
 elbow-0.045
 sister-0.044
 knee-0.044
 sorry-0.040
 whine-0.040
Token to
Token ID284
Feature activation+3.95e-02

Pos logit contributions
That+0.031
s+0.030
This+0.029
aw+0.028
It+0.027
Neg logit contributions
 elbow-0.055
 knee-0.053
 sister-0.050
 sorry-0.047
 whine-0.047
Token hear
Token ID3285
Feature activation-1.16e-02

Pos logit contributions
 elbow+0.081
 knee+0.075
 911+0.068
 sister+0.067
 sorry+0.066
Neg logit contributions
That-0.060
This-0.058
s-0.058
It-0.057
A-0.055
Token
Token ID198
Feature activation-5.55e-02

Pos logit contributions
That+0.014
aw+0.013
This+0.012
pill+0.012
s+0.011
Neg logit contributions
 knee-0.038
 elbow-0.037
 sister-0.036
 sorry-0.036
 Kim-0.035
Token
Token ID198
Feature activation-3.21e-02

Pos logit contributions
aw+0.062
ures+0.059
rawling+0.058
pill+0.056
That+0.056
Neg logit contributions
 knee-0.113
 elbow-0.112
 sister-0.112
 Kim-0.108
 sorry-0.107
TokenL
Token ID43
Feature activation+2.50e-02

Pos logit contributions
ures+0.045
aw+0.044
s+0.043
pill+0.042
rawling+0.041
Neg logit contributions
 knee-0.060
 elbow-0.058
 sister-0.058
 sorry-0.054
 whine-0.052
Tokenily
Token ID813
Feature activation-8.18e-03

Pos logit contributions
 elbow+0.043
 knee+0.041
 sister+0.039
 sorry+0.036
 whine+0.035
Neg logit contributions
That-0.046
s-0.045
This-0.043
aw-0.042
It-0.042
Token was
Token ID373
Feature activation+2.13e-02

Pos logit contributions
ures+0.010
This+0.010
That+0.010
There+0.010
pill+0.009
Neg logit contributions
 sister-0.018
 knee-0.017
 elbow-0.017
 boo-0.016
 whine-0.015
Token amazed
Token ID24562
Feature activation-7.95e-02

Pos logit contributions
 elbow+0.060
 knee+0.055
 sister+0.054
 911+0.053
 whine+0.051
Neg logit contributions
That-0.017
It-0.015
This-0.014
s-0.014
aw-0.014
Token and
Token ID290
Feature activation-6.26e-03

Pos logit contributions
pill+0.144
This+0.143
hest+0.137
ures+0.137
That+0.137
Neg logit contributions
 knee-0.111
 sister-0.111
 sorry-0.094
 Kim-0.092
 elbow-0.089
Token asked
Token ID1965
Feature activation+1.32e-02

Pos logit contributions
That+0.006
s+0.006
This+0.006
It+0.006
aw+0.005
Neg logit contributions
 elbow-0.016
 knee-0.016
 sister-0.015
 sorry-0.014
 whine-0.014
Token her
Token ID607
Feature activation+2.67e-02

Pos logit contributions
 elbow+0.038
 knee+0.035
 whine+0.032
 forehead+0.032
belt+0.032
Neg logit contributions
 that-0.012
-0.011
s-0.011
 me-0.010
That-0.010
Token mom
Token ID1995
Feature activation+3.84e-02

Pos logit contributions
 elbow+0.051
 knee+0.044
 whine+0.039
 forehead+0.038
 bru+0.037
Neg logit contributions
s-0.049
That-0.044
 that-0.043
 big-0.041
-0.041
Tokenmy
Token ID1820
Feature activation+2.12e-02

Pos logit contributions
 elbow+0.078
 knee+0.076
 sister+0.073
 sorry+0.067
 whine+0.066
Neg logit contributions
That-0.058
s-0.055
This-0.054
aw-0.052
It-0.051
Token,
Token ID11
Feature activation+6.61e-03

Pos logit contributions
That+0.032
This+0.029
s+0.029
aw+0.028
It+0.027
Neg logit contributions
 elbow-0.062
 knee-0.062
 sister-0.058
 sorry-0.055
 whine-0.054
Token but
Token ID475
Feature activation+9.14e-03

Pos logit contributions
 elbow+0.014
 knee+0.014
 sister+0.013
 sorry+0.012
 911+0.012
Neg logit contributions
That-0.009
s-0.009
This-0.008
It-0.008
aw-0.008
Token he
Token ID339
Feature activation-3.19e-02

Pos logit contributions
 elbow+0.023
 knee+0.022
 sister+0.021
 whine+0.020
 911+0.019
Neg logit contributions
That-0.010
s-0.009
It-0.009
This-0.009
There-0.009
Token was
Token ID373
Feature activation+1.00e-03

Pos logit contributions
That+0.036
s+0.034
This+0.034
ures+0.031
aw+0.031
Neg logit contributions
 elbow-0.077
 knee-0.076
 sister-0.072
 sorry-0.067
 boo-0.066
Token too
Token ID1165
Feature activation-5.91e-03

Pos logit contributions
 elbow+0.002
 knee+0.002
 sister+0.002
 sorry+0.002
 whine+0.002
Neg logit contributions
That-0.001
s-0.001
This-0.001
It-0.001
aw-0.001
Token excited
Token ID6568
Feature activation-7.95e-02

Pos logit contributions
ures+0.011
This+0.010
pill+0.010
That+0.010
aw+0.010
Neg logit contributions
 knee-0.010
 elbow-0.009
 sister-0.009
 sorry-0.009
 boo-0.008
Token and
Token ID290
Feature activation-3.36e-03

Pos logit contributions
That+0.151
This+0.148
There+0.146
pill+0.138
It+0.136
Neg logit contributions
 knee-0.124
 elbow-0.112
 sister-0.111
 Kim-0.101
 appointment-0.098
Token he
Token ID339
Feature activation-3.03e-02

Pos logit contributions
That+0.004
s+0.004
This+0.003
It+0.003
aw+0.003
Neg logit contributions
 elbow-0.008
 knee-0.008
 sister-0.008
 911-0.007
 sorry-0.007
Token slipped
Token ID18859
Feature activation-8.64e-03

Pos logit contributions
That+0.039
This+0.038
s+0.037
ures+0.037
There+0.035
Neg logit contributions
 elbow-0.067
 knee-0.066
 sister-0.062
 boo-0.057
 sorry-0.057
Token over
Token ID625
Feature activation-7.86e-03

Pos logit contributions
That+0.010
s+0.010
This+0.010
It+0.009
aw+0.009
Neg logit contributions
 elbow-0.020
 knee-0.020
 sister-0.019
 sorry-0.018
 whine-0.017
Token.
Token ID13
Feature activation-1.54e-03

Pos logit contributions
That+0.011
s+0.010
This+0.010
rawling+0.010
aw+0.010
Neg logit contributions
 elbow-0.017
 knee-0.016
 sister-0.016
 sorry-0.015
 whine-0.014
Tokenapped
Token ID6320
Feature activation-1.15e-02

Pos logit contributions
 elbow+0.050
 knee+0.048
 sister+0.045
 sorry+0.040
 whine+0.038
Neg logit contributions
That-0.078
s-0.076
This-0.075
It-0.072
aw-0.072
Token,
Token ID11
Feature activation-2.51e-03

Pos logit contributions
s+0.013
That+0.013
This+0.012
aw+0.011
It+0.011
Neg logit contributions
 elbow-0.028
 knee-0.027
 sister-0.025
 whine-0.024
 sorry-0.024
Token but
Token ID475
Feature activation+1.04e-03

Pos logit contributions
That+0.003
s+0.002
This+0.002
aw+0.002
It+0.002
Neg logit contributions
 elbow-0.006
 knee-0.006
 sister-0.006
 sorry-0.006
 whine-0.005
Token everyone
Token ID2506
Feature activation-3.11e-02

Pos logit contributions
 elbow+0.002
 knee+0.002
 sister+0.002
 sorry+0.002
 whine+0.002
Neg logit contributions
That-0.001
s-0.001
This-0.001
It-0.001
aw-0.001
Token was
Token ID373
Feature activation+2.95e-03

Pos logit contributions
That+0.033
This+0.032
s+0.031
There+0.031
pill+0.030
Neg logit contributions
 elbow-0.073
 knee-0.072
 sister-0.070
 sorry-0.066
 whine-0.065
Token amazed
Token ID24562
Feature activation-7.95e-02

Pos logit contributions
 knee+0.006
 elbow+0.006
 sister+0.005
 sorry+0.005
 whine+0.005
Neg logit contributions
That-0.005
s-0.005
This-0.004
aw-0.004
rawling-0.004
Token.
Token ID13
Feature activation-8.24e-03

Pos logit contributions
pill+0.155
This+0.148
That+0.144
There+0.142
Inside+0.140
Neg logit contributions
 sister-0.107
 knee-0.099
 sorry-0.093
 elbow-0.083
 singer-0.082
Token
Token ID198
Feature activation-5.03e-02

Pos logit contributions
That+0.011
s+0.011
aw+0.011
This+0.010
rawling+0.010
Neg logit contributions
 elbow-0.017
 knee-0.017
 sister-0.016
 sorry-0.016
 911-0.015
Token
Token ID198
Feature activation-3.59e-02

Pos logit contributions
aw+0.050
rawling+0.046
That+0.045
pill+0.045
s+0.044
Neg logit contributions
 knee-0.109
 elbow-0.108
 sorry-0.106
 sister-0.104
 Amen-0.101
TokenThe
Token ID464
Feature activation+6.55e-04

Pos logit contributions
aw+0.037
s+0.036
ures+0.035
pill+0.035
rawling+0.034
Neg logit contributions
 knee-0.075
 sorry-0.073
 elbow-0.073
 Amen-0.069
 sister-0.069
Token mighty
Token ID18680
Feature activation+4.01e-02

Pos logit contributions
 elbow+0.002
 knee+0.001
 sister+0.001
 sorry+0.001
 whine+0.001
Neg logit contributions
That-0.001
s-0.001
This-0.001
It-0.001
aw-0.001
Token but
Token ID475
Feature activation+1.37e-02

Pos logit contributions
s+0.011
That+0.011
This+0.010
It+0.010
aw+0.009
Neg logit contributions
 elbow-0.017
 knee-0.016
 sister-0.016
 Amen-0.015
 sorry-0.015
Token the
Token ID262
Feature activation+1.04e-02

Pos logit contributions
 elbow+0.035
 knee+0.034
 sister+0.032
 Amen+0.032
 Acc+0.031
Neg logit contributions
That-0.014
It-0.013
s-0.013
This-0.012
There-0.012
Token nurse
Token ID15849
Feature activation+4.36e-03

Pos logit contributions
 elbow+0.014
 Acc+0.012
 knee+0.012
 sister+0.010
 Kim+0.010
Neg logit contributions
 sky-0.022
A-0.022
That-0.022
s-0.022
It-0.021
Token was
Token ID373
Feature activation-1.54e-02

Pos logit contributions
 elbow+0.011
 knee+0.011
 sister+0.011
 sorry+0.010
 911+0.010
Neg logit contributions
s-0.004
That-0.004
aw-0.004
This-0.003
It-0.003
Token very
Token ID845
Feature activation-8.58e-03

Pos logit contributions
That+0.019
s+0.019
This+0.018
aw+0.017
It+0.017
Neg logit contributions
 elbow-0.035
 knee-0.034
 sister-0.032
 sorry-0.030
 whine-0.030
Token surprised
Token ID6655
Feature activation-7.94e-02

Pos logit contributions
That+0.012
s+0.012
This+0.011
It+0.011
aw+0.011
Neg logit contributions
 elbow-0.018
 knee-0.018
 sister-0.017
 sorry-0.015
 whine-0.015
Token and
Token ID290
Feature activation-1.02e-02

Pos logit contributions
This+0.131
pill+0.128
That+0.126
There+0.124
ures+0.122
Neg logit contributions
 knee-0.130
 sister-0.124
 Kim-0.124
 elbow-0.122
 sorry-0.113
Token shouted
Token ID17293
Feature activation-6.09e-03

Pos logit contributions
That+0.012
s+0.011
This+0.011
It+0.010
aw+0.010
Neg logit contributions
 elbow-0.025
 knee-0.024
 sister-0.023
 whine-0.021
 sorry-0.021
Token "
Token ID366
Feature activation-2.38e-02

Pos logit contributions
s+0.007
That+0.007
aw+0.006
This+0.006
It+0.006
Neg logit contributions
 elbow-0.015
 knee-0.014
 sister-0.013
 whine-0.013
 Amen-0.012
TokenO
Token ID46
Feature activation+2.86e-02

Pos logit contributions
s+0.029
That+0.029
rawling+0.027
aw+0.027
This+0.027
Neg logit contributions
 elbow-0.052
 knee-0.050
 sister-0.047
 whine-0.045
 sorry-0.044
Tokenuch
Token ID794
Feature activation-4.47e-04

Pos logit contributions
 elbow+0.060
 knee+0.058
 sister+0.053
 whine+0.049
 Kim+0.049
Neg logit contributions
That-0.040
This-0.039
There-0.037
It-0.036
 sky-0.035
Tokenling
Token ID1359
Feature activation-2.82e-02

Pos logit contributions
That+0.055
This+0.052
There+0.048
It+0.046
pill+0.046
Neg logit contributions
 knee-0.080
 elbow-0.080
 sister-0.075
 sorry-0.070
 forehead-0.066
Token.
Token ID13
Feature activation-8.89e-03

Pos logit contributions
That+0.036
This+0.033
s+0.033
There+0.031
It+0.031
Neg logit contributions
 elbow-0.063
 knee-0.061
 sister-0.059
 sorry-0.056
 whine-0.054
Token Bob
Token ID5811
Feature activation-1.30e-02

Pos logit contributions
st+0.013
ball+0.013
That+0.013
ky+0.013
if+0.012
Neg logit contributions
 knee-0.015
 elbow-0.015
 sorry-0.015
 Kim-0.014
 whine-0.014
Token was
Token ID373
Feature activation+2.00e-02

Pos logit contributions
ures+0.017
This+0.016
There+0.016
That+0.015
pill+0.015
Neg logit contributions
 sister-0.027
 knee-0.027
 elbow-0.025
 boo-0.024
 whine-0.023
Token so
Token ID523
Feature activation+3.10e-03

Pos logit contributions
 elbow+0.049
 knee+0.046
 sister+0.044
 911+0.042
 whine+0.042
Neg logit contributions
That-0.023
s-0.021
This-0.021
It-0.020
aw-0.020
Token amazed
Token ID24562
Feature activation-7.94e-02

Pos logit contributions
 knee+0.005
 elbow+0.004
 sister+0.004
 sorry+0.004
 whine+0.004
Neg logit contributions
That-0.006
s-0.006
This-0.006
ures-0.006
aw-0.006
Token by
Token ID416
Feature activation+7.35e-03

Pos logit contributions
This+0.136
That+0.134
There+0.122
These+0.121
pill+0.120
Neg logit contributions
 knee-0.123
 sister-0.117
 Kim-0.112
 elbow-0.110
 sorry-0.104
Token the
Token ID262
Feature activation-1.94e-02

Pos logit contributions
 elbow+0.016
 knee+0.015
 sister+0.015
 sorry+0.014
 whine+0.013
Neg logit contributions
That-0.010
s-0.010
This-0.010
aw-0.009
There-0.009
Token pretty
Token ID2495
Feature activation-6.91e-03

Pos logit contributions
That+0.027
s+0.026
This+0.026
aw+0.025
It+0.024
Neg logit contributions
 elbow-0.041
 knee-0.040
 sister-0.038
 sorry-0.036
 whine-0.035
Token lights
Token ID7588
Feature activation-1.61e-02

Pos logit contributions
That+0.009
s+0.009
This+0.008
aw+0.008
It+0.008
Neg logit contributions
 elbow-0.015
 knee-0.015
 sister-0.014
 sorry-0.013
 whine-0.013
Token,
Token ID11
Feature activation-1.33e-04

Pos logit contributions
That+0.020
This+0.019
rawling+0.018
There+0.017
aw+0.017
Neg logit contributions
 elbow-0.036
 knee-0.036
 sister-0.034
 sorry-0.032
 whine-0.031
Token the
Token ID262
Feature activation-9.36e-03

Pos logit contributions
 elbow+0.007
 knee+0.007
 sister+0.006
 sorry+0.006
 whine+0.006
Neg logit contributions
That-0.004
s-0.004
This-0.004
There-0.004
rawling-0.004
Token butterfly
Token ID35113
Feature activation+9.28e-03

Pos logit contributions
That+0.015
s+0.015
This+0.015
aw+0.014
There+0.014
Neg logit contributions
 elbow-0.018
 knee-0.017
 sister-0.017
 sorry-0.015
 boo-0.014
Token?"
Token ID1701
Feature activation+1.77e-02

Pos logit contributions
 elbow+0.021
 knee+0.020
 sister+0.018
 whine+0.017
 sorry+0.017
Neg logit contributions
s-0.012
That-0.012
This-0.012
aw-0.011
It-0.011
Token Jack
Token ID3619
Feature activation-3.74e-03

Pos logit contributions
 elbow+0.034
 knee+0.034
 sister+0.033
 sorry+0.031
 whine+0.029
Neg logit contributions
That-0.026
s-0.025
aw-0.025
This-0.025
rawling-0.024
Token was
Token ID373
Feature activation+1.42e-02

Pos logit contributions
This+0.005
That+0.005
There+0.005
aw+0.005
ures+0.005
Neg logit contributions
 knee-0.008
 elbow-0.008
 sister-0.007
 sorry-0.007
 whine-0.007
Token amazed
Token ID24562
Feature activation-7.94e-02

Pos logit contributions
 knee+0.027
 elbow+0.026
 sorry+0.025
 sister+0.025
 whine+0.023
Neg logit contributions
s-0.023
That-0.023
This-0.022
There-0.021
rawling-0.021
Token and
Token ID290
Feature activation-7.32e-03

Pos logit contributions
pill+0.154
This+0.149
That+0.142
There+0.142
hest+0.140
Neg logit contributions
 knee-0.119
 sister-0.111
 sorry-0.105
 elbow-0.097
 Kim-0.085
Token asked
Token ID1965
Feature activation+1.24e-02

Pos logit contributions
That+0.008
This+0.008
ures+0.008
s+0.007
There+0.007
Neg logit contributions
 elbow-0.017
 knee-0.017
 sister-0.016
 sorry-0.016
 whine-0.015
Token,
Token ID11
Feature activation+2.13e-02

Pos logit contributions
 elbow+0.036
 knee+0.034
 boo+0.031
 whine+0.031
burn+0.031
Neg logit contributions
s-0.010
That-0.009
-0.008
-0.007
 that-0.007
Token "
Token ID366
Feature activation-1.48e-02

Pos logit contributions
 elbow+0.052
 knee+0.049
 sister+0.046
 whine+0.043
 911+0.042
Neg logit contributions
That-0.026
s-0.024
It-0.024
This-0.023
There-0.021
TokenWhy
Token ID5195
Feature activation-2.23e-03

Pos logit contributions
s+0.019
ures+0.016
aw+0.016
rawling+0.016
 holes+0.015
Neg logit contributions
 knee-0.030
 elbow-0.030
 sister-0.027
 whine-0.026
 Amen-0.026
Token every
Token ID790
Feature activation-9.01e-03

Pos logit contributions
 elbow+0.027
 knee+0.027
 sister+0.026
 sorry+0.024
 whine+0.023
Neg logit contributions
That-0.012
This-0.011
s-0.011
It-0.010
aw-0.010
Token day
Token ID1110
Feature activation-8.36e-03

Pos logit contributions
That+0.010
s+0.009
This+0.009
st+0.009
 sky+0.009
Neg logit contributions
 elbow-0.022
 knee-0.020
 sorry-0.020
 Kim-0.019
 911-0.019
Token.
Token ID13
Feature activation-2.90e-03

Pos logit contributions
That+0.007
s+0.007
This+0.007
aw+0.006
It+0.006
Neg logit contributions
 elbow-0.022
 knee-0.022
 sister-0.021
 sorry-0.020
 whine-0.019
Token She
Token ID1375
Feature activation-1.21e-02

Pos logit contributions
That+0.002
s+0.002
This+0.002
aw+0.002
It+0.002
Neg logit contributions
 elbow-0.008
 knee-0.008
 sister-0.007
 sorry-0.007
 whine-0.007
Token liked
Token ID8288
Feature activation-4.14e-03

Pos logit contributions
That+0.012
s+0.012
This+0.011
aw+0.011
It+0.010
Neg logit contributions
 elbow-0.031
 knee-0.030
 sister-0.029
 sorry-0.027
 whine-0.027
Token to
Token ID284
Feature activation+5.68e-02

Pos logit contributions
That+0.005
s+0.005
This+0.005
It+0.004
There+0.004
Neg logit contributions
 elbow-0.010
 knee-0.009
 sister-0.009
 sorry-0.008
 whine-0.008
Token pretend
Token ID16614
Feature activation-2.23e-02

Pos logit contributions
 elbow+0.108
 911+0.102
 knee+0.101
 sorry+0.097
 sister+0.096
Neg logit contributions
This-0.083
That-0.080
It-0.076
A-0.074
s-0.073
Token that
Token ID326
Feature activation+1.08e-02

Pos logit contributions
That+0.029
s+0.029
This+0.027
aw+0.026
It+0.026
Neg logit contributions
 elbow-0.050
 knee-0.048
 sister-0.046
 sorry-0.043
 whine-0.043
Token it
Token ID340
Feature activation-5.09e-02

Pos logit contributions
 elbow+0.023
 knee+0.023
 sister+0.021
 sorry+0.021
 whine+0.020
Neg logit contributions
That-0.015
s-0.015
This-0.014
It-0.013
aw-0.013
Token was
Token ID373
Feature activation-6.16e-03

Pos logit contributions
aw+0.063
That+0.056
rawling+0.054
oph+0.053
ised+0.052
Neg logit contributions
 sister-0.108
 elbow-0.108
 knee-0.104
 boo-0.096
 Kim-0.092
Token a
Token ID257
Feature activation+1.65e-03

Pos logit contributions
That+0.007
s+0.006
This+0.006
It+0.006
aw+0.006
Neg logit contributions
 elbow-0.015
 knee-0.015
 sister-0.014
 whine-0.013
 sorry-0.013
Token little
Token ID1310
Feature activation+3.78e-03

Pos logit contributions
 elbow+0.009
 knee+0.009
 sister+0.009
 sorry+0.008
 whine+0.008
Neg logit contributions
That-0.005
s-0.005
This-0.005
aw-0.004
It-0.004
Token house
Token ID2156
Feature activation+3.09e-02

Pos logit contributions
 sister+0.007
 knee+0.007
 elbow+0.007
 whine+0.006
 boo+0.006
Neg logit contributions
That-0.006
This-0.006
ures-0.006
rawling-0.005
It-0.005
Token,
Token ID11
Feature activation+2.30e-02

Pos logit contributions
 elbow+0.082
 knee+0.075
 boo+0.070
 whine+0.069
 forehead+0.068
Neg logit contributions
s-0.032
That-0.030
This-0.029
-0.027
 that-0.026
Token lived
Token ID5615
Feature activation+1.66e-02

Pos logit contributions
 elbow+0.062
 knee+0.060
 sister+0.056
 whine+0.053
 sorry+0.053
Neg logit contributions
That-0.022
s-0.020
This-0.020
It-0.019
There-0.018
Token a
Token ID257
Feature activation+3.51e-02

Pos logit contributions
 elbow+0.042
 knee+0.040
 sister+0.038
 whine+0.036
 sorry+0.035
Neg logit contributions
That-0.018
This-0.017
s-0.017
It-0.016
There-0.016
Token girl
Token ID2576
Feature activation+5.84e-02

Pos logit contributions
 elbow+0.098
 knee+0.090
 911+0.085
 whine+0.085
 apology+0.081
Neg logit contributions
That-0.030
It-0.029
s-0.028
A-0.026
This-0.024
Token named
Token ID3706
Feature activation+4.39e-02

Pos logit contributions
 elbow+0.097
 knee+0.087
 sister+0.077
 whine+0.075
 sorry+0.075
Neg logit contributions
s-0.115
That-0.114
aw-0.111
This-0.110
It-0.106
Token Sue
Token ID26089
Feature activation+1.16e-02

Pos logit contributions
 elbow+0.074
 knee+0.069
 sister+0.059
 sorry+0.055
 boo+0.055
Neg logit contributions
That-0.088
s-0.086
This-0.084
It-0.081
There-0.081
Token.
Token ID13
Feature activation+1.12e-02

Pos logit contributions
 elbow+0.026
 knee+0.025
 sister+0.025
 sorry+0.022
 whine+0.022
Neg logit contributions
That-0.015
This-0.013
s-0.013
There-0.013
rawling-0.013
Token Sue
Token ID26089
Feature activation-9.21e-03

Pos logit contributions
 elbow+0.014
 knee+0.013
 sister+0.012
 sorry+0.011
 whine+0.010
Neg logit contributions
That-0.025
s-0.025
This-0.024
aw-0.024
It-0.024
Token had
Token ID550
Feature activation-4.25e-03

Pos logit contributions
That+0.010
ures+0.010
aw+0.009
s+0.009
pill+0.009
Neg logit contributions
 knee-0.021
 sister-0.021
 elbow-0.021
 boo-0.018
 sorry-0.018
Token named
Token ID3706
Feature activation+4.18e-02

Pos logit contributions
 elbow+0.095
 knee+0.084
 911+0.076
 sorry+0.074
 sister+0.074
Neg logit contributions
s-0.096
That-0.092
This-0.089
aw-0.089
st-0.085
Token Lily
Token ID20037
Feature activation+2.21e-02

Pos logit contributions
 elbow+0.074
 knee+0.070
 sister+0.062
 whine+0.057
 sorry+0.057
Neg logit contributions
That-0.079
s-0.077
This-0.075
It-0.073
There-0.072
Token.
Token ID13
Feature activation+1.40e-02

Pos logit contributions
 elbow+0.050
 knee+0.049
 sister+0.047
 whine+0.043
 sorry+0.042
Neg logit contributions
That-0.027
s-0.025
This-0.025
pill-0.025
aw-0.025
Token She
Token ID1375
Feature activation+1.65e-03

Pos logit contributions
 elbow+0.033
 knee+0.032
 sister+0.030
 sorry+0.028
 whine+0.028
Neg logit contributions
That-0.018
s-0.017
This-0.016
It-0.015
aw-0.015
Token loved
Token ID6151
Feature activation+1.06e-02

Pos logit contributions
 elbow+0.004
 knee+0.004
 sister+0.004
 sorry+0.003
 boo+0.003
Neg logit contributions
That-0.002
s-0.002
This-0.002
aw-0.002
ures-0.002
Token to
Token ID284
Feature activation+5.60e-02

Pos logit contributions
 elbow+0.025
 knee+0.024
 sister+0.022
 sorry+0.022
 whine+0.021
Neg logit contributions
That-0.013
s-0.013
This-0.012
It-0.012
aw-0.011
Token play
Token ID711
Feature activation+2.63e-02

Pos logit contributions
 elbow+0.116
 knee+0.108
 911+0.103
 sorry+0.100
 sister+0.099
Neg logit contributions
This-0.078
That-0.074
A-0.072
s-0.071
There-0.070
Token outside
Token ID2354
Feature activation+2.25e-02

Pos logit contributions
 elbow+0.055
 knee+0.053
 sister+0.051
 whine+0.047
 sorry+0.047
Neg logit contributions
That-0.039
s-0.037
This-0.037
It-0.035
aw-0.035
Token and
Token ID290
Feature activation+2.24e-02

Pos logit contributions
 elbow+0.056
 knee+0.053
 sister+0.050
 sorry+0.049
 whine+0.048
Neg logit contributions
s-0.025
That-0.025
This-0.023
It-0.021
There-0.021
Token explore
Token ID7301
Feature activation-2.30e-02

Pos logit contributions
 elbow+0.045
 knee+0.044
 sister+0.039
 911+0.038
 sorry+0.038
Neg logit contributions
That-0.036
s-0.035
This-0.034
It-0.033
There-0.031
Token the
Token ID262
Feature activation-4.72e-02

Pos logit contributions
That+0.034
This+0.031
s+0.031
There+0.030
rawling+0.030
Neg logit contributions
 elbow-0.044
 knee-0.043
 sister-0.042
 911-0.039
 sorry-0.038
Token things
Token ID1243
Feature activation-2.36e-02

Pos logit contributions
 elbow+0.009
 knee+0.009
 sister+0.009
 sorry+0.008
 whine+0.008
Neg logit contributions
That-0.008
This-0.008
s-0.007
It-0.007
There-0.007
Tokenâ
Token ID22940
Feature activation+1.46e-02

Pos logit contributions
That+0.035
This+0.033
aw+0.031
urious+0.031
A+0.031
Neg logit contributions
 elbow-0.045
 knee-0.044
 sister-0.042
 sorry-0.041
 whine-0.038
Token€
Token ID26391
Feature activation+5.24e-02

Pos logit contributions
 elbow+0.027
 knee+0.026
 sister+0.025
 sorry+0.023
 whine+0.022
Neg logit contributions
That-0.025
s-0.024
This-0.023
aw-0.022
It-0.022
Token�
Token ID447
Feature activation+1.83e-02

Pos logit contributions
 elbow+0.095
 knee+0.093
 sister+0.093
 Kim+0.085
 sorry+0.080
Neg logit contributions
That-0.081
This-0.074
aw-0.068
ures-0.066
It-0.066
Token�
Token ID251
Feature activation+8.98e-03

Pos logit contributions
 elbow+0.037
 knee+0.035
 sister+0.033
 sorry+0.032
 whine+0.029
Neg logit contributions
That-0.027
This-0.024
s-0.024
aw-0.024
It-0.023
Tokencol
Token ID4033
Feature activation+6.03e-02

Pos logit contributions
 elbow+0.022
 knee+0.021
 sister+0.020
 sorry+0.019
 whine+0.018
Neg logit contributions
That-0.010
s-0.010
This-0.009
aw-0.009
It-0.009
Tokenouring
Token ID21823
Feature activation-2.65e-02

Pos logit contributions
 elbow+0.133
 knee+0.131
 sister+0.129
 sorry+0.125
 whine+0.116
Neg logit contributions
s-0.075
That-0.075
aw-0.070
This-0.068
rawling-0.067
Token books
Token ID3835
Feature activation-9.64e-03

Pos logit contributions
That+0.038
This+0.035
s+0.035
There+0.035
rawling+0.034
Neg logit contributions
 elbow-0.055
 knee-0.054
 sister-0.052
 sorry-0.048
 whine-0.046
Token,
Token ID11
Feature activation+2.54e-02

Pos logit contributions
That+0.013
This+0.013
rawling+0.012
aw+0.012
A+0.012
Neg logit contributions
 elbow-0.020
 knee-0.020
 sister-0.018
 sorry-0.018
 whine-0.017
Token stickers
Token ID28568
Feature activation-2.70e-03

Pos logit contributions
 elbow+0.059
 knee+0.057
 sister+0.054
 911+0.053
 sorry+0.052
Neg logit contributions
That-0.030
s-0.029
This-0.028
It-0.027
aw-0.027
Token and
Token ID290
Feature activation+8.15e-03

Pos logit contributions
That+0.004
This+0.004
rawling+0.003
urious+0.003
A+0.003
Neg logit contributions
 elbow-0.006
 knee-0.006
 sister-0.005
 sorry-0.005
 Kim-0.005
Token glad
Token ID9675
Feature activation-3.47e-02

Pos logit contributions
 elbow+0.034
 knee+0.031
 911+0.028
 sister+0.027
 whine+0.026
Neg logit contributions
s-0.026
That-0.025
It-0.023
This-0.023
aw-0.022
Token they
Token ID484
Feature activation-2.23e-02

Pos logit contributions
That+0.036
s+0.034
This+0.032
aw+0.031
It+0.030
Neg logit contributions
 elbow-0.088
 knee-0.085
 sister-0.081
 sorry-0.077
 whine-0.076
Token got
Token ID1392
Feature activation+1.38e-02

Pos logit contributions
That+0.021
s+0.020
This+0.019
pill+0.019
rawling+0.019
Neg logit contributions
 elbow-0.056
 knee-0.054
 sister-0.053
 sorry-0.050
 whine-0.049
Token their
Token ID511
Feature activation+1.63e-02

Pos logit contributions
 elbow+0.034
 knee+0.032
 sister+0.029
 Amen+0.029
 911+0.029
Neg logit contributions
s-0.017
That-0.015
aw-0.015
This-0.014
It-0.014
Token own
Token ID898
Feature activation+1.92e-02

Pos logit contributions
 elbow+0.035
 knee+0.034
 sister+0.031
 sorry+0.031
 whine+0.030
Neg logit contributions
s-0.023
That-0.022
This-0.021
It-0.020
There-0.020
Token z
Token ID1976
Feature activation+5.60e-02

Pos logit contributions
 elbow+0.042
 knee+0.040
 sister+0.038
 sorry+0.037
 whine+0.037
Neg logit contributions
s-0.026
That-0.025
This-0.024
It-0.023
aw-0.023
Tokenig
Token ID328
Feature activation+3.29e-02

Pos logit contributions
 whine+0.103
 elbow+0.102
 sorry+0.100
 knee+0.099
 sister+0.098
Neg logit contributions
aw-0.085
s-0.085
st-0.081
That-0.079
ures-0.076
Tokenzag
Token ID50183
Feature activation+1.42e-02

Pos logit contributions
 elbow+0.059
 knee+0.055
 sister+0.053
 sorry+0.050
 whine+0.050
Neg logit contributions
s-0.058
That-0.056
This-0.053
aw-0.052
It-0.052
Token.
Token ID13
Feature activation+1.61e-02

Pos logit contributions
 elbow+0.030
 knee+0.029
 sister+0.027
 sorry+0.026
 whine+0.026
Neg logit contributions
s-0.020
That-0.019
This-0.018
aw-0.018
It-0.017
Token It
Token ID632
Feature activation-3.63e-02

Pos logit contributions
 elbow+0.033
 knee+0.033
 sister+0.032
 sorry+0.030
 whine+0.029
Neg logit contributions
That-0.021
s-0.021
This-0.020
aw-0.020
There-0.019
Token looked
Token ID3114
Feature activation-1.57e-02

Pos logit contributions
ures+0.034
That+0.034
aw+0.033
This+0.033
rawling+0.032
Neg logit contributions
 elbow-0.089
 knee-0.086
 sister-0.086
 boo-0.079
 sorry-0.079
Token and
Token ID290
Feature activation+9.72e-04

Pos logit contributions
That+0.014
s+0.014
aw+0.012
This+0.012
It+0.012
Neg logit contributions
 elbow-0.047
 knee-0.045
 sister-0.043
 whine-0.041
 sorry-0.041
Token reached
Token ID4251
Feature activation-1.23e-02

Pos logit contributions
 elbow+0.002
 knee+0.002
 sister+0.002
 sorry+0.002
 whine+0.002
Neg logit contributions
That-0.001
s-0.001
This-0.001
aw-0.001
It-0.001
Token for
Token ID329
Feature activation+2.14e-02

Pos logit contributions
s+0.015
+0.014
This+0.013
 me+0.013
 sky+0.013
Neg logit contributions
 elbow-0.028
 knee-0.027
 Acc-0.025
 Amen-0.024
 whine-0.024
Token their
Token ID511
Feature activation+1.44e-02

Pos logit contributions
 elbow+0.046
 knee+0.045
 sister+0.042
 whine+0.039
 sorry+0.039
Neg logit contributions
That-0.029
s-0.028
This-0.028
There-0.027
It-0.026
Token fingers
Token ID9353
Feature activation-3.01e-03

Pos logit contributions
 elbow+0.025
 knee+0.025
 sorry+0.023
 Amen+0.023
 911+0.023
Neg logit contributions
s-0.024
That-0.022
This-0.021
 might-0.021
 sky-0.020
Token.
Token ID13
Feature activation+2.31e-02

Pos logit contributions
That+0.004
This+0.004
aw+0.003
s+0.003
It+0.003
Neg logit contributions
 elbow-0.007
 knee-0.007
 sister-0.006
 sorry-0.006
 whine-0.006
Token Lily
Token ID20037
Feature activation-6.45e-03

Pos logit contributions
 elbow+0.057
 knee+0.055
 sister+0.052
 whine+0.049
 sorry+0.048
Neg logit contributions
That-0.026
s-0.025
This-0.024
It-0.023
There-0.022
Token and
Token ID290
Feature activation-2.83e-03

Pos logit contributions
That+0.009
This+0.009
ures+0.008
s+0.008
There+0.008
Neg logit contributions
 elbow-0.014
 knee-0.014
 sister-0.013
 boo-0.012
 whine-0.012
Token Tom
Token ID4186
Feature activation+1.92e-04

Pos logit contributions
That+0.004
s+0.004
This+0.004
aw+0.004
It+0.004
Neg logit contributions
 knee-0.006
 elbow-0.006
 sister-0.005
 sorry-0.005
 Amen-0.005
Token felt
Token ID2936
Feature activation+1.02e-02

Pos logit contributions
 elbow+0.000
 knee+0.000
 sister+0.000
 whine+0.000
 sorry+0.000
Neg logit contributions
That-0.000
This-0.000
s-0.000
It-0.000
rawling-0.000
Token a
Token ID257
Feature activation+1.06e-02

Pos logit contributions
 elbow+0.017
 knee+0.017
 sister+0.016
 sorry+0.014
 whine+0.014
Neg logit contributions
That-0.019
s-0.018
This-0.018
aw-0.017
It-0.017
Token a
Token ID257
Feature activation-9.55e-03

Pos logit contributions
That+0.024
s+0.024
rawling+0.022
aw+0.022
ures+0.022
Neg logit contributions
 elbow-0.035
 knee-0.035
 sister-0.032
 sorry-0.031
 boo-0.029
Token time
Token ID640
Feature activation+2.15e-02

Pos logit contributions
That+0.014
ures+0.013
pill+0.013
als+0.013
s+0.013
Neg logit contributions
 knee-0.018
 elbow-0.018
 sister-0.017
 sorry-0.016
 whine-0.016
Token,
Token ID11
Feature activation+1.70e-02

Pos logit contributions
 elbow+0.047
 knee+0.045
 sister+0.042
 sorry+0.040
 whine+0.040
Neg logit contributions
That-0.030
s-0.029
This-0.027
aw-0.026
It-0.026
Token there
Token ID612
Feature activation-1.63e-02

Pos logit contributions
 elbow+0.043
 knee+0.042
 sister+0.040
 sorry+0.038
 whine+0.037
Neg logit contributions
That-0.018
s-0.017
This-0.016
It-0.015
aw-0.015
Token was
Token ID373
Feature activation+1.85e-02

Pos logit contributions
ures+0.016
aw+0.016
pill+0.016
That+0.016
aut+0.015
Neg logit contributions
 knee-0.038
 elbow-0.038
 sister-0.037
 sorry-0.033
 Kim-0.033
Token a
Token ID257
Feature activation+2.22e-02

Pos logit contributions
 elbow+0.043
 knee+0.042
 sister+0.040
 sorry+0.037
 whine+0.036
Neg logit contributions
That-0.023
s-0.022
This-0.021
aw-0.020
It-0.019
Token little
Token ID1310
Feature activation+2.40e-02

Pos logit contributions
 elbow+0.063
 knee+0.059
 911+0.055
 whine+0.055
 sister+0.054
Neg logit contributions
That-0.018
s-0.016
It-0.016
This-0.015
A-0.014
Token pig
Token ID12967
Feature activation+3.92e-02

Pos logit contributions
 elbow+0.054
 knee+0.049
 whine+0.046
 911+0.045
 sorry+0.043
Neg logit contributions
s-0.033
That-0.031
aw-0.029
It-0.029
ball-0.028
Token named
Token ID3706
Feature activation+3.57e-02

Pos logit contributions
 elbow+0.048
 knee+0.047
 sister+0.043
 sorry+0.037
 whine+0.036
Neg logit contributions
That-0.090
s-0.087
This-0.086
aw-0.084
It-0.084
Token O
Token ID440
Feature activation+2.28e-02

Pos logit contributions
 elbow+0.048
 knee+0.048
 sister+0.046
 sorry+0.043
 911+0.040
Neg logit contributions
That-0.074
aw-0.073
s-0.072
rawling-0.071
This-0.069
Tokenink
Token ID676
Feature activation-8.61e-03

Pos logit contributions
 knee+0.045
 elbow+0.044
 sister+0.041
 sorry+0.039
 babys+0.037
Neg logit contributions
That-0.034
This-0.030
ures-0.029
 sky-0.027
pill-0.027
Token share
Token ID2648
Feature activation-1.34e-02

Pos logit contributions
 elbow+0.047
 knee+0.043
 sister+0.040
 911+0.038
 sorry+0.036
Neg logit contributions
That-0.042
This-0.040
It-0.038
s-0.038
rawling-0.037
Token and
Token ID290
Feature activation+3.88e-02

Pos logit contributions
That+0.018
s+0.017
This+0.017
aw+0.016
It+0.016
Neg logit contributions
 elbow-0.029
 knee-0.028
 sister-0.027
 sorry-0.025
 whine-0.025
Token be
Token ID307
Feature activation+3.04e-02

Pos logit contributions
 elbow+0.078
 knee+0.075
 911+0.069
 sister+0.068
 sorry+0.065
Neg logit contributions
That-0.056
s-0.055
This-0.054
It-0.051
A-0.050
Token nice
Token ID3621
Feature activation-1.87e-02

Pos logit contributions
 elbow+0.076
 knee+0.071
 sister+0.065
 whine+0.064
 babys+0.064
Neg logit contributions
That-0.036
It-0.034
This-0.034
aw-0.033
A-0.032
Token,"
Token ID553
Feature activation+2.46e-02

Pos logit contributions
s+0.022
That+0.020
aw+0.020
This+0.019
It+0.018
Neg logit contributions
 elbow-0.044
 knee-0.042
 sister-0.040
 sorry-0.039
 whine-0.038
Token Mom
Token ID11254
Feature activation+1.70e-02

Pos logit contributions
 elbow+0.043
 knee+0.041
 sister+0.036
 911+0.036
 appointment+0.034
Neg logit contributions
That-0.045
s-0.044
This-0.042
It-0.042
There-0.040
Token says
Token ID1139
Feature activation+3.23e-02

Pos logit contributions
 elbow+0.037
 knee+0.037
 sister+0.035
 sorry+0.034
 whine+0.033
Neg logit contributions
That-0.022
This-0.021
s-0.020
There-0.020
It-0.019
Token.
Token ID13
Feature activation+1.57e-02

Pos logit contributions
 elbow+0.098
 knee+0.092
 sister+0.092
 whine+0.088
 babys+0.087
Neg logit contributions
s-0.019
That-0.010
aw-0.009
It-0.008
 that-0.007
Token "
Token ID366
Feature activation-2.51e-02

Pos logit contributions
 knee+0.029
 elbow+0.029
 sister+0.027
 sorry+0.027
 whine+0.024
Neg logit contributions
This-0.023
That-0.023
aut-0.023
ures-0.023
rawling-0.023
TokenP
Token ID47
Feature activation+2.46e-02

Pos logit contributions
s+0.027
aw+0.023
ures+0.023
rawling+0.023
That+0.022
Neg logit contributions
 elbow-0.058
 knee-0.058
 sister-0.053
 sorry-0.051
 boo-0.050
Tokenill
Token ID359
Feature activation-8.94e-03

Pos logit contributions
 elbow+0.053
 knee+0.051
 sister+0.049
 sorry+0.045
 whine+0.044
Neg logit contributions
That-0.035
s-0.033
This-0.033
It-0.031
There-0.030
Token the
Token ID262
Feature activation-3.92e-02

Pos logit contributions
That+0.017
s+0.016
This+0.016
aw+0.015
It+0.015
Neg logit contributions
 elbow-0.026
 knee-0.025
 sister-0.024
 sorry-0.023
 911-0.022
Token icy
Token ID30284
Feature activation+3.26e-03

Pos logit contributions
That+0.048
s+0.047
ures+0.043
This+0.041
aw+0.041
Neg logit contributions
 knee-0.083
 sister-0.081
 elbow-0.077
 911-0.074
 boo-0.073
Token mountain
Token ID8598
Feature activation+5.54e-03

Pos logit contributions
 knee+0.007
 elbow+0.007
 sister+0.006
 whine+0.006
 sorry+0.006
Neg logit contributions
That-0.005
This-0.004
s-0.004
There-0.004
It-0.004
Token behind
Token ID2157
Feature activation-1.66e-02

Pos logit contributions
 knee+0.012
 elbow+0.011
 sister+0.011
 sorry+0.010
 whine+0.010
Neg logit contributions
That-0.008
This-0.007
It-0.007
s-0.007
aw-0.007
Token her
Token ID607
Feature activation+2.56e-03

Pos logit contributions
That+0.029
This+0.028
s+0.028
It+0.027
aw+0.027
Neg logit contributions
 elbow-0.030
 knee-0.029
 sister-0.027
 sorry-0.025
 whine-0.025
Token house
Token ID2156
Feature activation+2.58e-02

Pos logit contributions
 knee+0.005
 elbow+0.005
 sister+0.005
 sorry+0.004
 911+0.004
Neg logit contributions
That-0.004
This-0.004
aw-0.004
It-0.004
There-0.004
Token.
Token ID13
Feature activation-3.24e-03

Pos logit contributions
 elbow+0.066
 knee+0.064
 sister+0.061
 sorry+0.058
 whine+0.057
Neg logit contributions
That-0.026
s-0.024
This-0.023
aw-0.022
It-0.022
Token So
Token ID1406
Feature activation+2.19e-02

Pos logit contributions
That+0.004
s+0.004
aw+0.004
This+0.004
ures+0.004
Neg logit contributions
 elbow-0.007
 knee-0.007
 sister-0.007
 sorry-0.006
 whine-0.006
Token she
Token ID673
Feature activation-1.21e-02

Pos logit contributions
 elbow+0.061
 knee+0.058
 911+0.053
 boo+0.052
 whine+0.052
Neg logit contributions
s-0.021
That-0.019
 that-0.016
This-0.016
A-0.016
Token put
Token ID1234
Feature activation+2.37e-02

Pos logit contributions
That+0.016
s+0.015
This+0.015
aw+0.014
It+0.014
Neg logit contributions
 elbow-0.027
 knee-0.026
 sister-0.025
 sorry-0.023
 whine-0.023
Token on
Token ID319
Feature activation+2.07e-02

Pos logit contributions
 elbow+0.049
 knee+0.049
 sister+0.048
 whine+0.045
 sorry+0.044
Neg logit contributions
That-0.034
This-0.033
It-0.031
There-0.031
s-0.031
Token.
Token ID13
Feature activation+3.22e-03

Pos logit contributions
That+0.002
s+0.002
This+0.002
aw+0.002
There+0.002
Neg logit contributions
 elbow-0.004
 knee-0.004
 sister-0.004
 sorry-0.004
 whine-0.004
Token<|endoftext|>
Token ID50256
Feature activation-3.04e-02

Pos logit contributions
 elbow+0.007
 knee+0.007
 sister+0.007
 sorry+0.007
 911+0.006
Neg logit contributions
That-0.004
s-0.004
aw-0.003
This-0.003
There-0.003
TokenOnce
Token ID7454
Feature activation-1.75e-02

Pos logit contributions
That+0.032
ized+0.030
aw+0.030
rawling+0.030
aut+0.030
Neg logit contributions
 elbow-0.065
 knee-0.065
 sister-0.063
 sorry-0.059
 Kim-0.058
Token upon
Token ID2402
Feature activation-1.09e-02

Pos logit contributions
That+0.022
This+0.021
aw+0.020
ures+0.019
There+0.018
Neg logit contributions
 elbow-0.039
 sister-0.039
 knee-0.039
 sorry-0.036
 911-0.034
Token a
Token ID257
Feature activation-1.57e-03

Pos logit contributions
s+0.015
That+0.015
aw+0.013
rawling+0.013
This+0.013
Neg logit contributions
 elbow-0.023
 knee-0.023
 sister-0.021
 sorry-0.020
 whine-0.019
Token time
Token ID640
Feature activation+2.44e-02

Pos logit contributions
That+0.002
ures+0.002
s+0.002
This+0.002
pill+0.002
Neg logit contributions
 knee-0.003
 elbow-0.003
 sister-0.003
 sorry-0.003
 whine-0.003
Token,
Token ID11
Feature activation+2.45e-02

Pos logit contributions
 elbow+0.053
 knee+0.051
 sister+0.048
 whine+0.045
 sorry+0.045
Neg logit contributions
That-0.033
s-0.033
This-0.031
It-0.030
aw-0.030
Token in
Token ID287
Feature activation+1.33e-02

Pos logit contributions
 elbow+0.063
 knee+0.061
 sister+0.058
 whine+0.055
 sorry+0.055
Neg logit contributions
That-0.025
s-0.024
This-0.022
It-0.021
There-0.021
Token a
Token ID257
Feature activation+4.57e-03

Pos logit contributions
 elbow+0.027
 knee+0.027
 sister+0.026
 sorry+0.024
 whine+0.023
Neg logit contributions
s-0.020
That-0.019
aw-0.017
aut-0.017
rawling-0.017
Token small
Token ID1402
Feature activation+9.07e-03

Pos logit contributions
 knee+0.010
 sister+0.010
 elbow+0.010
 sorry+0.009
 boo+0.009
Neg logit contributions
s-0.005
That-0.005
ures-0.005
pill-0.005
aw-0.005
Token town
Token ID3240
Feature activation+2.73e-02

Pos logit contributions
 elbow+0.018
 knee+0.017
 sister+0.017
 sorry+0.015
 whine+0.015
Neg logit contributions
That-0.014
s-0.013
This-0.013
aw-0.013
It-0.013
Token many
Token ID867
Feature activation-1.44e-02

Pos logit contributions
That+0.018
s+0.018
This+0.017
aw+0.017
It+0.016
Neg logit contributions
 elbow-0.041
 knee-0.040
 sister-0.037
 whine-0.036
 sorry-0.035
Token buttons
Token ID12163
Feature activation+4.75e-03

Pos logit contributions
That+0.020
s+0.019
This+0.018
aw+0.018
It+0.017
Neg logit contributions
 elbow-0.031
 knee-0.030
 sister-0.029
 sorry-0.027
 whine-0.026
Token and
Token ID290
Feature activation+1.29e-03

Pos logit contributions
 elbow+0.011
 knee+0.011
 sister+0.010
 sorry+0.010
 whine+0.009
Neg logit contributions
That-0.006
This-0.005
s-0.005
rawling-0.005
It-0.005
Token levers
Token ID48872
Feature activation-5.90e-03

Pos logit contributions
 elbow+0.003
 knee+0.003
 sister+0.003
 sorry+0.002
 whine+0.002
Neg logit contributions
s-0.002
That-0.002
This-0.002
aw-0.002
It-0.002
Token.
Token ID13
Feature activation+7.23e-03

Pos logit contributions
That+0.007
This+0.006
s+0.006
rawling+0.006
There+0.006
Neg logit contributions
 elbow-0.014
 knee-0.014
 sister-0.013
 sorry-0.012
 whine-0.012
Token They
Token ID1119
Feature activation-1.46e-02

Pos logit contributions
 elbow+0.019
 knee+0.018
 sister+0.017
 sorry+0.016
 whine+0.016
Neg logit contributions
That-0.007
s-0.007
This-0.006
aw-0.006
It-0.006
Token press
Token ID1803
Feature activation-2.23e-02

Pos logit contributions
That+0.026
s+0.025
This+0.024
aw+0.023
It+0.023
Neg logit contributions
 elbow-0.026
 knee-0.025
 sister-0.024
 sorry-0.022
 whine-0.021
Token and
Token ID290
Feature activation-1.94e-04

Pos logit contributions
That+0.031
This+0.028
s+0.028
rawling+0.028
It+0.027
Neg logit contributions
 elbow-0.047
 knee-0.046
 sister-0.044
 sorry-0.042
 911-0.040
Token pull
Token ID2834
Feature activation-1.05e-02

Pos logit contributions
s+0.000
That+0.000
This+0.000
It+0.000
aw+0.000
Neg logit contributions
 elbow-0.000
 knee-0.000
 sister-0.000
 sorry-0.000
 babys-0.000
Token them
Token ID606
Feature activation-8.28e-03

Pos logit contributions
That+0.012
This+0.011
s+0.011
aw+0.011
rawling+0.011
Neg logit contributions
 elbow-0.024
 knee-0.024
 sister-0.023
 sorry-0.022
 911-0.021
Token.
Token ID13
Feature activation+5.80e-03

Pos logit contributions
That+0.011
s+0.011
This+0.010
aw+0.010
It+0.010
Neg logit contributions
 elbow-0.019
 knee-0.018
 sister-0.017
 sorry-0.016
 whine-0.016
Token old
Token ID1468
Feature activation+1.49e-02

Pos logit contributions
s+0.003
That+0.002
aw+0.002
 sky+0.002
This+0.002
Neg logit contributions
 elbow-0.010
 knee-0.009
 sister-0.008
 whine-0.008
 sorry-0.008
Token.
Token ID13
Feature activation-3.88e-03

Pos logit contributions
 elbow+0.040
 knee+0.039
 sister+0.037
 sorry+0.036
 whine+0.035
Neg logit contributions
That-0.012
s-0.012
aw-0.011
This-0.011
It-0.010
Token Claire
Token ID20758
Feature activation-1.63e-03

Pos logit contributions
aw+0.005
s+0.005
ball+0.005
That+0.005
st+0.004
Neg logit contributions
 knee-0.008
 sister-0.008
 sorry-0.008
 elbow-0.008
 whine-0.007
Token wanted
Token ID2227
Feature activation-1.38e-02

Pos logit contributions
That+0.002
This+0.002
There+0.002
ures+0.002
pill+0.002
Neg logit contributions
 knee-0.003
 sister-0.003
 elbow-0.003
 sorry-0.003
 whine-0.003
Token something
Token ID1223
Feature activation-3.36e-02

Pos logit contributions
That+0.013
s+0.013
This+0.013
It+0.012
A+0.011
Neg logit contributions
 elbow-0.037
 knee-0.035
 sister-0.033
 sorry-0.032
 whine-0.031
Token special
Token ID2041
Feature activation-3.00e-02

Pos logit contributions
That+0.043
s+0.040
This+0.039
aw+0.038
There+0.037
Neg logit contributions
 elbow-0.075
 knee-0.074
 sister-0.071
 sorry-0.067
 whine-0.065
Token to
Token ID284
Feature activation+3.79e-03

Pos logit contributions
That+0.038
s+0.037
This+0.035
aw+0.034
It+0.034
Neg logit contributions
 elbow-0.068
 knee-0.066
 sister-0.063
 sorry-0.059
 whine-0.058
Token eat
Token ID4483
Feature activation-1.41e-02

Pos logit contributions
 elbow+0.008
 knee+0.007
 sister+0.007
 sorry+0.006
 whine+0.006
Neg logit contributions
That-0.006
This-0.006
s-0.006
It-0.005
aw-0.005
Token,
Token ID11
Feature activation+5.89e-03

Pos logit contributions
That+0.017
s+0.016
This+0.015
aw+0.015
It+0.014
Neg logit contributions
 elbow-0.033
 knee-0.032
 sister-0.031
 sorry-0.029
 whine-0.028
Token so
Token ID523
Feature activation+1.49e-02

Pos logit contributions
 elbow+0.015
 knee+0.014
 sister+0.014
 sorry+0.013
 whine+0.013
Neg logit contributions
That-0.006
s-0.006
This-0.005
It-0.005
aw-0.005
Token she
Token ID673
Feature activation-2.11e-02

Pos logit contributions
 elbow+0.032
 knee+0.031
 sister+0.028
 911+0.028
 boo+0.028
Neg logit contributions
s-0.021
That-0.021
This-0.019
It-0.019
A-0.018
Token louder
Token ID27089
Feature activation-2.46e-02

Pos logit contributions
That+0.044
s+0.043
This+0.041
It+0.041
aw+0.040
Neg logit contributions
 elbow-0.055
 knee-0.053
 sister-0.050
 sorry-0.045
 whine-0.045
Token.
Token ID13
Feature activation-4.57e-03

Pos logit contributions
That+0.033
This+0.031
rawling+0.030
pill+0.029
There+0.029
Neg logit contributions
 knee-0.054
 elbow-0.053
 sister-0.050
 sorry-0.048
 whine-0.046
Token Suddenly
Token ID24975
Feature activation-4.69e-03

Pos logit contributions
That+0.005
s+0.005
This+0.005
aw+0.005
It+0.005
Neg logit contributions
 elbow-0.011
 knee-0.011
 sister-0.010
 whine-0.009
 sorry-0.009
Token,
Token ID11
Feature activation+1.43e-02

Pos logit contributions
s+0.005
That+0.005
aw+0.005
There+0.005
It+0.005
Neg logit contributions
 elbow-0.011
 knee-0.011
 sister-0.010
 whine-0.010
 boo-0.010
Token a
Token ID257
Feature activation-8.51e-03

Pos logit contributions
 elbow+0.038
 knee+0.038
 sister+0.035
 whine+0.035
 boo+0.034
Neg logit contributions
That-0.012
There-0.011
It-0.011
A-0.011
s-0.011
Token voice
Token ID3809
Feature activation-4.44e-03

Pos logit contributions
That+0.009
It+0.008
s+0.008
This+0.007
A+0.007
Neg logit contributions
 elbow-0.022
 knee-0.021
 sister-0.019
 911-0.019
 whine-0.019
Token spoke
Token ID5158
Feature activation-3.16e-02

Pos logit contributions
That+0.005
s+0.005
This+0.005
rawling+0.005
aw+0.005
Neg logit contributions
 elbow-0.010
 knee-0.010
 sister-0.009
 sorry-0.009
 whine-0.009
Token to
Token ID284
Feature activation+3.85e-03

Pos logit contributions
That+0.036
s+0.036
This+0.033
aw+0.032
It+0.032
Neg logit contributions
 elbow-0.076
 knee-0.073
 sister-0.070
 sorry-0.066
 whine-0.065
Token him
Token ID683
Feature activation+1.04e-03

Pos logit contributions
 elbow+0.009
 knee+0.009
 sister+0.008
 sorry+0.008
 whine+0.007
Neg logit contributions
That-0.005
This-0.005
s-0.005
It-0.004
There-0.004
Token:
Token ID25
Feature activation+9.55e-03

Pos logit contributions
 elbow+0.003
 knee+0.003
 sister+0.003
 sorry+0.002
 whine+0.002
Neg logit contributions
That-0.001
s-0.001
This-0.001
It-0.001
aw-0.001
Token "
Token ID366
Feature activation-3.52e-02

Pos logit contributions
 elbow+0.022
 knee+0.021
 sister+0.020
 sorry+0.019
 whine+0.019
Neg logit contributions
That-0.012
s-0.012
This-0.011
It-0.011
aw-0.011
Token the
Token ID262
Feature activation-1.02e-02

Pos logit contributions
 elbow+0.054
 knee+0.053
 sister+0.050
 whine+0.047
 sorry+0.046
Neg logit contributions
That-0.031
s-0.029
This-0.029
It-0.028
aw-0.027
Token store
Token ID3650
Feature activation+1.78e-02

Pos logit contributions
That+0.015
s+0.014
aw+0.014
This+0.013
ures+0.013
Neg logit contributions
 elbow-0.021
 knee-0.021
 sister-0.021
 sorry-0.019
 whine-0.018
Token manager
Token ID4706
Feature activation+2.96e-02

Pos logit contributions
 elbow+0.042
 knee+0.040
 sister+0.038
 sorry+0.036
 whine+0.035
Neg logit contributions
That-0.022
s-0.022
This-0.020
aw-0.019
It-0.019
Token.
Token ID13
Feature activation+6.98e-03

Pos logit contributions
 elbow+0.078
 knee+0.075
 sister+0.071
 sorry+0.068
 whine+0.067
Neg logit contributions
s-0.027
That-0.027
This-0.024
aw-0.023
It-0.022
Token She
Token ID1375
Feature activation-2.72e-02

Pos logit contributions
 elbow+0.016
 knee+0.016
 sister+0.015
 sorry+0.014
 whine+0.014
Neg logit contributions
That-0.009
s-0.008
This-0.008
It-0.008
aw-0.008
Token couldn
Token ID3521
Feature activation-2.59e-02

Pos logit contributions
That+0.030
s+0.028
This+0.027
aw+0.026
It+0.025
Neg logit contributions
 elbow-0.067
 knee-0.065
 sister-0.062
 sorry-0.059
 whine-0.058
Token't
Token ID470
Feature activation+1.26e-02

Pos logit contributions
That+0.037
s+0.036
aw+0.035
This+0.035
rawling+0.034
Neg logit contributions
 elbow-0.053
 knee-0.052
 sister-0.049
 sorry-0.046
 whine-0.045
Token find
Token ID1064
Feature activation+8.71e-03

Pos logit contributions
 elbow+0.019
 knee+0.018
 sister+0.017
 sorry+0.015
 whine+0.015
Neg logit contributions
That-0.025
s-0.025
This-0.024
It-0.023
There-0.023
Token him
Token ID683
Feature activation+5.36e-03

Pos logit contributions
 elbow+0.017
 knee+0.017
 sister+0.016
 whine+0.015
 sorry+0.014
Neg logit contributions
That-0.013
s-0.013
This-0.013
aw-0.012
It-0.012
Token,
Token ID11
Feature activation+8.38e-03

Pos logit contributions
 elbow+0.013
 knee+0.012
 sister+0.012
 sorry+0.011
 whine+0.011
Neg logit contributions
That-0.006
s-0.006
This-0.006
aw-0.006
It-0.006
Token so
Token ID523
Feature activation+1.06e-02

Pos logit contributions
 elbow+0.020
 knee+0.019
 sister+0.018
 sorry+0.017
 whine+0.017
Neg logit contributions
That-0.010
s-0.010
This-0.009
It-0.009
aw-0.009
Token.
Token ID13
Feature activation+2.53e-02

Pos logit contributions
 elbow+0.111
 seeker+0.104
 Amen+0.099
 liar+0.098
 bru+0.097
Neg logit contributions
s-0.009
.-0.001
!+0.002
+0.003
 and+0.004
Token He
Token ID679
Feature activation-2.39e-02

Pos logit contributions
 elbow+0.069
 knee+0.067
 sister+0.064
 sorry+0.060
 whine+0.060
Neg logit contributions
That-0.022
s-0.021
This-0.019
It-0.018
There-0.017
Token wanted
Token ID2227
Feature activation-1.50e-02

Pos logit contributions
That+0.029
s+0.028
This+0.028
ures+0.027
rawling+0.027
Neg logit contributions
 elbow-0.055
 knee-0.053
 sister-0.051
 sorry-0.047
 boo-0.046
Token to
Token ID284
Feature activation+4.22e-02

Pos logit contributions
s+0.014
That+0.014
This+0.014
It+0.013
A+0.012
Neg logit contributions
 elbow-0.039
 knee-0.037
 sister-0.035
 Amen-0.034
 sorry-0.034
Token try
Token ID1949
Feature activation-1.76e-02

Pos logit contributions
 elbow+0.086
 knee+0.082
 911+0.078
 sister+0.077
 sorry+0.077
Neg logit contributions
This-0.061
That-0.060
It-0.057
There-0.054
A-0.054
Token too
Token ID1165
Feature activation+6.76e-03

Pos logit contributions
s+0.022
That+0.021
This+0.020
It+0.019
aw+0.019
Neg logit contributions
 elbow-0.041
 knee-0.040
 sister-0.038
 sorry-0.036
 whine-0.036
Token.
Token ID13
Feature activation+1.44e-02

Pos logit contributions
 elbow+0.015
 knee+0.015
 sister+0.014
 sorry+0.013
 whine+0.013
Neg logit contributions
That-0.009
This-0.008
rawling-0.008
pill-0.008
s-0.007
Token He
Token ID679
Feature activation-2.76e-02

Pos logit contributions
 elbow+0.039
 knee+0.038
 sister+0.036
 sorry+0.034
 whine+0.034
Neg logit contributions
That-0.013
s-0.012
This-0.011
aw-0.011
It-0.010
Token ran
Token ID4966
Feature activation+4.28e-03

Pos logit contributions
That+0.034
s+0.033
This+0.032
aw+0.030
rawling+0.030
Neg logit contributions
 elbow-0.063
 knee-0.061
 sister-0.058
 whine-0.054
 sorry-0.053
Token to
Token ID284
Feature activation+1.94e-02

Pos logit contributions
 elbow+0.010
 knee+0.010
 sister+0.009
 sorry+0.009
 whine+0.008
Neg logit contributions
That-0.005
s-0.005
This-0.005
It-0.005
aw-0.005
Token the
Token ID262
Feature activation-1.64e-02

Pos logit contributions
 elbow+0.052
 knee+0.050
 sister+0.045
 sorry+0.045
 liar+0.044
Neg logit contributions
That-0.017
This-0.017
It-0.016
s-0.015
-0.015
Token!
Token ID0
Feature activation+1.64e-03

Pos logit contributions
That+0.009
s+0.009
This+0.008
There+0.008
It+0.008
Neg logit contributions
 elbow-0.031
 knee-0.031
 sister-0.030
 sorry-0.028
 whine-0.027
Token She
Token ID1375
Feature activation-1.99e-02

Pos logit contributions
 elbow+0.004
 knee+0.004
 sister+0.004
 sorry+0.004
 whine+0.004
Neg logit contributions
That-0.002
s-0.002
This-0.001
It-0.001
aw-0.001
Token asked
Token ID1965
Feature activation+1.70e-02

Pos logit contributions
That+0.024
s+0.023
This+0.022
aw+0.021
There+0.021
Neg logit contributions
 elbow-0.046
 knee-0.045
 sister-0.043
 sorry-0.040
 whine-0.039
Token her
Token ID607
Feature activation+2.69e-02

Pos logit contributions
 elbow+0.041
 knee+0.039
 Amen+0.038
 whine+0.036
belt+0.035
Neg logit contributions
 me-0.018
 that-0.018
s-0.017
That-0.017
-0.016
Token mom
Token ID1995
Feature activation+3.34e-02

Pos logit contributions
 elbow+0.051
 knee+0.043
 whine+0.039
 Amen+0.039
 forehead+0.038
Neg logit contributions
s-0.050
 that-0.046
That-0.045
-0.043
It-0.043
Token what
Token ID644
Feature activation-4.07e-02

Pos logit contributions
 elbow+0.068
 knee+0.064
 sister+0.058
 sorry+0.055
 whine+0.055
Neg logit contributions
s-0.053
That-0.053
This-0.049
aw-0.048
It-0.048
Token it
Token ID340
Feature activation-4.84e-02

Pos logit contributions
That+0.054
s+0.053
This+0.050
aw+0.049
There+0.048
Neg logit contributions
 elbow-0.088
 knee-0.086
 sister-0.082
 sorry-0.077
 whine-0.075
Token was
Token ID373
Feature activation-2.03e-02

Pos logit contributions
aw+0.060
ures+0.056
There+0.056
rawling+0.054
That+0.054
Neg logit contributions
 sister-0.098
 elbow-0.097
 knee-0.095
 Kim-0.088
 boo-0.088
Token called
Token ID1444
Feature activation+5.50e-03

Pos logit contributions
That+0.030
This+0.028
There+0.027
aw+0.027
s+0.027
Neg logit contributions
 knee-0.041
 elbow-0.041
 sister-0.040
 sorry-0.037
 whine-0.035
Token.
Token ID13
Feature activation+1.25e-02

Pos logit contributions
 elbow+0.012
 knee+0.012
 sister+0.011
 whine+0.010
 sorry+0.010
Neg logit contributions
That-0.007
s-0.007
This-0.007
aw-0.007
It-0.007
Token Her
Token ID2332
Feature activation+2.99e-02

Pos logit contributions
 elbow+0.030
 knee+0.028
 sister+0.026
 whine+0.025
 911+0.024
Neg logit contributions
s-0.016
That-0.016
This-0.015
It-0.015
-0.014
Token knew
Token ID2993
Feature activation-1.33e-02

Pos logit contributions
That+0.029
s+0.028
This+0.028
ures+0.026
There+0.025
Neg logit contributions
 elbow-0.064
 knee-0.062
 sister-0.060
 whine-0.056
 boo-0.056
Token his
Token ID465
Feature activation+6.81e-03

Pos logit contributions
That+0.012
s+0.011
It+0.011
aw+0.010
 that+0.010
Neg logit contributions
 elbow-0.034
 knee-0.033
 whine-0.032
 sister-0.032
 Amen-0.030
Token way
Token ID835
Feature activation-4.55e-03

Pos logit contributions
 elbow+0.013
 knee+0.011
 911+0.011
 sister+0.011
 whine+0.010
Neg logit contributions
s-0.010
 sky-0.010
That-0.010
 might-0.010
A-0.010
Token home
Token ID1363
Feature activation+1.10e-02

Pos logit contributions
s+0.007
That+0.007
This+0.006
It+0.006
There+0.006
Neg logit contributions
 elbow-0.009
 knee-0.009
 sister-0.008
 sorry-0.008
 whine-0.008
Token.
Token ID13
Feature activation-1.02e-02

Pos logit contributions
 elbow+0.026
 knee+0.026
 sister+0.024
 sorry+0.023
 whine+0.023
Neg logit contributions
That-0.013
s-0.012
This-0.011
aw-0.011
It-0.011
Token
Token ID198
Feature activation-5.59e-02

Pos logit contributions
That+0.009
aw+0.008
s+0.008
This+0.008
ures+0.008
Neg logit contributions
 knee-0.025
 elbow-0.025
 sister-0.024
 Kim-0.024
 sorry-0.023
Token
Token ID198
Feature activation-3.05e-02

Pos logit contributions
That+0.056
aw+0.055
pill+0.054
rawling+0.053
ures+0.053
Neg logit contributions
 knee-0.124
 elbow-0.122
 sister-0.119
 Kim-0.117
 sorry-0.113
TokenAs
Token ID1722
Feature activation+3.88e-03

Pos logit contributions
ures+0.048
pill+0.048
als+0.045
s+0.045
 reefs+0.044
Neg logit contributions
 knee-0.049
 Kim-0.046
 elbow-0.045
 whine-0.044
 sister-0.043
Token Tim
Token ID5045
Feature activation+5.78e-03

Pos logit contributions
 elbow+0.008
 knee+0.008
 sister+0.007
 sorry+0.007
 whine+0.007
Neg logit contributions
That-0.006
s-0.006
This-0.005
It-0.005
There-0.005
Token walked
Token ID6807
Feature activation+8.75e-03

Pos logit contributions
 knee+0.011
 elbow+0.011
 sister+0.010
 whine+0.009
 boo+0.009
Neg logit contributions
That-0.009
ures-0.008
pill-0.008
This-0.008
It-0.008
Token,
Token ID11
Feature activation+1.06e-02

Pos logit contributions
 elbow+0.022
 knee+0.021
 sister+0.019
 boo+0.018
 whine+0.018
Neg logit contributions
s-0.011
That-0.010
This-0.009
There-0.009
It-0.009
Token slip
Token ID13819
Feature activation-1.09e-03

Pos logit contributions
 elbow+0.006
 knee+0.006
 sister+0.006
 Amen+0.005
 whine+0.005
Neg logit contributions
s-0.005
That-0.005
This-0.005
It-0.005
There-0.004
Token again
Token ID757
Feature activation-9.76e-03

Pos logit contributions
That+0.001
s+0.001
This+0.001
aw+0.001
It+0.001
Neg logit contributions
 elbow-0.002
 knee-0.002
 sister-0.002
 sorry-0.002
 whine-0.002
Token.
Token ID13
Feature activation+8.49e-03

Pos logit contributions
That+0.011
s+0.010
This+0.009
aw+0.009
It+0.009
Neg logit contributions
 elbow-0.024
 knee-0.024
 sister-0.023
 sorry-0.021
 whine-0.021
Token He
Token ID679
Feature activation-2.13e-02

Pos logit contributions
 elbow+0.018
 knee+0.018
 sister+0.017
 sorry+0.016
 whine+0.015
Neg logit contributions
That-0.011
s-0.011
aw-0.011
This-0.011
It-0.010
Token also
Token ID635
Feature activation-6.33e-03

Pos logit contributions
That+0.022
s+0.021
This+0.020
aw+0.019
There+0.019
Neg logit contributions
 elbow-0.053
 knee-0.051
 sister-0.049
 sorry-0.046
 whine-0.045
Token learnt
Token ID26338
Feature activation-3.92e-02

Pos logit contributions
That+0.007
s+0.006
This+0.006
aw+0.006
It+0.006
Neg logit contributions
 elbow-0.016
 knee-0.015
 sister-0.015
 sorry-0.014
 whine-0.014
Token that
Token ID326
Feature activation+6.20e-03

Pos logit contributions
That+0.051
s+0.049
This+0.047
aw+0.044
There+0.044
Neg logit contributions
 elbow-0.086
 knee-0.085
 sister-0.080
 sorry-0.076
 911-0.073
Token it
Token ID340
Feature activation-6.21e-02

Pos logit contributions
 elbow+0.015
 knee+0.014
 sister+0.013
 whine+0.013
 forehead+0.012
Neg logit contributions
That-0.008
This-0.007
It-0.007
s-0.007
There-0.006
Token is
Token ID318
Feature activation-2.20e-02

Pos logit contributions
aw+0.072
oph+0.067
That+0.065
This+0.065
There+0.065
Neg logit contributions
 sister-0.141
 knee-0.134
 elbow-0.132
 sorry-0.127
 boo-0.122
Token important
Token ID1593
Feature activation-2.72e-02

Pos logit contributions
That+0.034
s+0.033
This+0.032
aw+0.032
There+0.030
Neg logit contributions
 elbow-0.043
 knee-0.042
 sister-0.040
 sorry-0.038
 whine-0.036
Token to
Token ID284
Feature activation+1.49e-02

Pos logit contributions
That+0.036
s+0.035
This+0.033
aw+0.032
It+0.032
Neg logit contributions
 elbow-0.061
 knee-0.059
 sister-0.056
 sorry-0.052
 whine-0.052
Token face
Token ID1986
Feature activation-1.26e-02

Pos logit contributions
 elbow+0.015
 knee+0.014
 sister+0.013
 911+0.013
 whine+0.012
Neg logit contributions
s-0.015
That-0.014
aw-0.013
This-0.013
 sky-0.013
Token.
Token ID13
Feature activation+4.40e-03

Pos logit contributions
That+0.015
This+0.014
There+0.013
rawling+0.013
It+0.013
Neg logit contributions
 knee-0.028
 elbow-0.028
 sister-0.026
 sorry-0.025
 whine-0.024
Token The
Token ID383
Feature activation+1.21e-03

Pos logit contributions
 elbow+0.009
 knee+0.009
 sister+0.008
 sorry+0.008
 whine+0.008
Neg logit contributions
That-0.006
s-0.006
This-0.006
aw-0.006
It-0.006
Token frog
Token ID21264
Feature activation-8.47e-03

Pos logit contributions
 elbow+0.003
 knee+0.002
 sister+0.002
 whine+0.002
 sorry+0.002
Neg logit contributions
That-0.002
s-0.002
This-0.002
It-0.002
aw-0.002
Token got
Token ID1392
Feature activation+4.78e-03

Pos logit contributions
ures+0.011
This+0.011
That+0.011
There+0.010
rawling+0.010
Neg logit contributions
 sister-0.018
 knee-0.017
 elbow-0.017
 boo-0.015
 sorry-0.015
Token annoyed
Token ID25602
Feature activation-3.75e-02

Pos logit contributions
 elbow+0.011
 knee+0.010
 sister+0.010
 911+0.009
 whine+0.009
Neg logit contributions
s-0.007
That-0.006
It-0.006
This-0.006
aw-0.006
Token and
Token ID290
Feature activation-2.18e-03

Pos logit contributions
That+0.043
This+0.041
s+0.039
There+0.039
It+0.037
Neg logit contributions
 knee-0.087
 elbow-0.087
 sister-0.084
 sorry-0.080
 whine-0.076
Token slapped
Token ID31213
Feature activation+2.95e-03

Pos logit contributions
That+0.003
s+0.002
This+0.002
aw+0.002
It+0.002
Neg logit contributions
 elbow-0.005
 knee-0.005
 sister-0.005
 sorry-0.005
 whine-0.004
Token the
Token ID262
Feature activation-3.83e-03

Pos logit contributions
 elbow+0.007
 knee+0.006
 sister+0.006
 sorry+0.006
 whine+0.006
Neg logit contributions
That-0.004
s-0.004
This-0.004
aw-0.003
It-0.003
Token fly
Token ID6129
Feature activation-5.44e-03

Pos logit contributions
That+0.005
s+0.005
This+0.005
It+0.005
aw+0.005
Neg logit contributions
 elbow-0.008
 knee-0.008
 sister-0.007
 whine-0.007
 sorry-0.007
Token with
Token ID351
Feature activation-1.24e-02

Pos logit contributions
That+0.008
s+0.007
This+0.007
aw+0.007
It+0.007
Neg logit contributions
 elbow-0.012
 knee-0.011
 sister-0.011
 sorry-0.010
 whine-0.010
Token.
Token ID13
Feature activation+6.13e-03

Pos logit contributions
That+0.022
s+0.021
This+0.020
It+0.019
aw+0.018
Neg logit contributions
 elbow-0.052
 knee-0.050
 sister-0.048
 sorry-0.046
 whine-0.045
Token The
Token ID383
Feature activation-3.87e-05

Pos logit contributions
 elbow+0.014
 knee+0.013
 sister+0.013
 sorry+0.012
 whine+0.012
Neg logit contributions
That-0.007
s-0.007
This-0.007
aw-0.007
It-0.006
Token tree
Token ID5509
Feature activation-1.06e-02

Pos logit contributions
That+0.000
This+0.000
s+0.000
It+0.000
 sky+0.000
Neg logit contributions
 elbow-0.000
 knee-0.000
 911-0.000
 whine-0.000
 sister-0.000
Token looked
Token ID3114
Feature activation-9.87e-03

Pos logit contributions
That+0.010
This+0.009
s+0.009
aw+0.008
There+0.008
Neg logit contributions
 elbow-0.027
 knee-0.027
 sister-0.025
 sorry-0.024
 whine-0.024
Token so
Token ID523
Feature activation+4.05e-04

Pos logit contributions
That+0.013
This+0.012
s+0.012
rawling+0.012
ures+0.012
Neg logit contributions
 knee-0.021
 elbow-0.021
 sorry-0.020
 sister-0.020
 boo-0.018
Token happy
Token ID3772
Feature activation-4.29e-02

Pos logit contributions
 sorry+0.001
 knee+0.001
 elbow+0.001
 sister+0.001
 boo+0.001
Neg logit contributions
ures-0.001
That-0.001
This-0.001
pill-0.001
There-0.001
Token!
Token ID0
Feature activation-3.81e-03

Pos logit contributions
That+0.051
This+0.050
There+0.045
s+0.045
rawling+0.044
Neg logit contributions
 elbow-0.097
 knee-0.096
 sister-0.092
 sorry-0.087
 Kim-0.081
Token
Token ID198
Feature activation-4.52e-02

Pos logit contributions
That+0.004
s+0.004
aw+0.004
This+0.004
rawling+0.004
Neg logit contributions
 elbow-0.009
 knee-0.009
 sister-0.009
 sorry-0.008
 Kim-0.008
Token
Token ID198
Feature activation-2.75e-02

Pos logit contributions
aw+0.040
That+0.037
s+0.037
rawling+0.037
ures+0.035
Neg logit contributions
 elbow-0.106
 knee-0.105
 sister-0.101
 sorry-0.099
 Kim-0.095
TokenB
Token ID33
Feature activation+3.30e-02

Pos logit contributions
s+0.024
aw+0.021
ures+0.020
rawling+0.020
actus+0.019
Neg logit contributions
 elbow-0.066
 knee-0.065
 sister-0.063
 sorry-0.060
 whine-0.058
Tokenub
Token ID549
Feature activation+1.63e-02

Pos logit contributions
 knee+0.059
 elbow+0.059
 sister+0.057
 sorry+0.049
 whine+0.048
Neg logit contributions
That-0.054
This-0.052
s-0.050
aw-0.049
It-0.049