TOP ACTIVATIONS
MAX = 0.101

drove away . Where was the boy going ?
more scared . Where are Tom and Mia ?
everyone . Where ver the old man went
be found . Where had the gr umpy rubber
the house . Where ver she went , the
and went back home . Where ver he looked , he
were never lonely again . Where ver the baby went ,
glad . " Where were you , Ben and
felt sorry and confused . Where was the marble ?
games and sang songs . Where ver they went , they
pay for the damage . Where are your parents ?" Tom
â Where is my toy , Mom
did the volcano go ? Where did their dinosaurs go ?
Hello ! Where are you going ? â
olly gasped with surprise . Where did the books he lf
They were very confused . Where did their snow man go
explore the world outside . Where ver he went , Jack
still wears it today . Where ver she goes , it
â Where â s my
very confused and scared . Where did the volcano go ?

MIN ACTIVATIONS
MIN = -0.120

He was a very adventurous sw an .
time , there was a sw an which had a very
, there was a blue sw an . He was swimming
time , there was a sw an swimming in a big
Alex was a very obedient sw an and he always followed
pond , there was a sw an . The sw an
saw a big , graceful sw an swimming in the pond
It was a very beautiful sw an and its feathers were
saw a big , white sw an swimming in the pond
time , there was a sw an named Sam . Sam
, there was a friendly sw an who lived by a
, there was a wee sw an who was very eager
. The girls found a sw an by the river swimming
Suddenly , a big fish sw am over to her .
, there was a little sw an who was different from
p ounce , a graceful sw an named Lily flew down
time there was a fit sw an . She lived near
Suddenly , a friendly dolphin sw am close to her and
a time there was a sw an . His name was
, there was a graceful sw an who liked to spend

INTERVAL 0.046 - 0.101
CONTAINS 4.970%

was still there , waiting for her . She gave him
I watch a video ?" The computer screen suddenly
into the night sky . M olly waved goodbye
they make her happy . One day , Lily
Once upon a time , there was a happy train .

INTERVAL -0.009 - 0.046
CONTAINS 56.946%

with her toys all day long . One day , her
white dress and it fit perfectly ! L ily
out of the oven and put them on a plate .
ink ling , a new star arrived . It spark led
my was happy again and he realized that sometimes conflicts can

INTERVAL -0.064 - -0.009
CONTAINS 37.465%

When they got to Lily 's house , they washed their
As Sam gall oped through the garden , he
'm chewing gum , Tim my ! Want some ?" replied
little girl named Lucy . She was three years old .
to play . Tim my 's turn came and he rolled

INTERVAL -0.120 - -0.064
CONTAINS 0.619%

voice , Don â
said , Yes ! â
<|endoftext|> One day , a little rat needed to be healed
time , there was a beautiful princess . She was so
time , there was a little girl . She had a
Token drove
Token ID10357
Feature activation-3.33e-02

Pos logit contributions
uch+0.021
aches+0.019
at+0.018
ats+0.018
ade+0.018
Neg logit contributions
 antics-0.021
 infant-0.021
 personality-0.021
 Can-0.020
amb-0.019
Token away
Token ID1497
Feature activation+1.34e-02

Pos logit contributions
 antics+0.052
 infant+0.051
 personality+0.050
amb+0.048
 Can+0.048
Neg logit contributions
uch-0.058
aches-0.051
at-0.048
ade-0.047
ats-0.047
Token.
Token ID13
Feature activation-1.10e-02

Pos logit contributions
uch+0.023
aches+0.020
at+0.020
ats+0.019
ade+0.019
Neg logit contributions
 antics-0.022
 infant-0.021
 personality-0.021
 loudly-0.020
 widely-0.020
Token
Token ID198
Feature activation+5.69e-02

Pos logit contributions
 antics+0.020
 infant+0.019
ening+0.018
rier+0.018
 personality+0.018
Neg logit contributions
uch-0.013
aches-0.012
ons-0.012
-0.012
 am-0.012
Token
Token ID198
Feature activation+3.96e-02

Pos logit contributions
uch+0.085
aches+0.074
at+0.067
asis+0.066
ats+0.065
Neg logit contributions
 antics-0.101
 infant-0.100
 personality-0.099
 Even-0.098
amb-0.097
TokenWhere
Token ID8496
Feature activation+1.01e-01

Pos logit contributions
uch+0.058
aches+0.054
ade+0.049
at+0.049
ats+0.049
Neg logit contributions
 antics-0.071
 infant-0.070
 personality-0.068
 Can-0.064
amb-0.064
Token was
Token ID373
Feature activation-2.61e-02

Pos logit contributions
uch+0.164
aches+0.157
asis+0.152
at+0.143
ats+0.142
Neg logit contributions
 infant-0.147
 loudly-0.143
 personality-0.141
 antics-0.140
 hesitant-0.139
Token the
Token ID262
Feature activation-4.01e-03

Pos logit contributions
 antics+0.048
 infant+0.047
 personality+0.046
 Can+0.044
amb+0.044
Neg logit contributions
uch-0.037
aches-0.034
ade-0.030
at-0.030
ats-0.030
Token boy
Token ID2933
Feature activation+1.37e-02

Pos logit contributions
 infant+0.006
 antics+0.006
 personality+0.006
 Can+0.006
 lear+0.006
Neg logit contributions
uch-0.007
aches-0.007
ats-0.006
at-0.006
ons-0.006
Token going
Token ID1016
Feature activation-2.67e-02

Pos logit contributions
uch+0.023
aches+0.022
ade+0.021
ats+0.021
at+0.020
Neg logit contributions
 infant-0.020
 antics-0.020
 personality-0.019
 Can-0.019
 daughter-0.018
Token?
Token ID30
Feature activation-2.71e-02

Pos logit contributions
 infant+0.043
 antics+0.043
 personality+0.042
amb+0.041
 Can+0.041
Neg logit contributions
uch-0.043
aches-0.040
ade-0.036
ats-0.036
at-0.036
Token more
Token ID517
Feature activation-4.06e-03

Pos logit contributions
 infant+0.035
 antics+0.034
 personality+0.033
 Can+0.030
 loudly+0.029
Neg logit contributions
uch-0.078
aches-0.071
at-0.070
ade-0.068
ats-0.068
Token scared
Token ID12008
Feature activation-2.02e-02

Pos logit contributions
 antics+0.005
 infant+0.005
 personality+0.005
 Can+0.005
amb+0.005
Neg logit contributions
uch-0.008
aches-0.007
ade-0.007
ats-0.007
at-0.007
Token.
Token ID13
Feature activation-2.87e-02

Pos logit contributions
 antics+0.036
 infant+0.035
 personality+0.035
amb+0.033
 Can+0.032
Neg logit contributions
uch-0.029
aches-0.027
ade-0.025
ons-0.025
at-0.025
Token
Token ID198
Feature activation+6.16e-02

Pos logit contributions
 antics+0.041
 infant+0.040
 personality+0.039
 Can+0.037
amb+0.036
Neg logit contributions
uch-0.053
aches-0.050
at-0.046
ats-0.046
ade-0.046
Token
Token ID198
Feature activation+4.47e-02

Pos logit contributions
uch+0.122
at+0.111
aches+0.109
asis+0.105
ade+0.101
Neg logit contributions
 She-0.088
 Even-0.087
 It-0.084
 From-0.077
 As-0.077
TokenWhere
Token ID8496
Feature activation+9.95e-02

Pos logit contributions
uch+0.087
aches+0.083
at+0.082
ade+0.080
asis+0.077
Neg logit contributions
 Can-0.052
 infant-0.051
 loudly-0.050
 antics-0.050
 She-0.050
Token are
Token ID389
Feature activation+7.22e-03

Pos logit contributions
asis+0.176
fire+0.174
uch+0.170
aches+0.167
at+0.156
Neg logit contributions
 It-0.135
 She-0.134
 Even-0.126
 loudly-0.125
 Can-0.122
Token Tom
Token ID4186
Feature activation+1.03e-02

Pos logit contributions
uch+0.010
aches+0.010
ade+0.009
ons+0.008
ats+0.008
Neg logit contributions
 antics-0.013
 personality-0.013
 infant-0.013
amb-0.012
 Can-0.012
Token and
Token ID290
Feature activation-1.54e-02

Pos logit contributions
aches+0.012
uch+0.012
ade+0.011
fire+0.011
bel+0.010
Neg logit contributions
 infant-0.019
 antics-0.018
 Can-0.017
 personality-0.017
 She-0.017
Token Mia
Token ID32189
Feature activation+2.25e-02

Pos logit contributions
 infant+0.021
 antics+0.021
 personality+0.020
 Can+0.019
 lear+0.018
Neg logit contributions
uch-0.029
aches-0.027
at-0.026
ats-0.026
ade-0.025
Token?
Token ID30
Feature activation-2.24e-02

Pos logit contributions
ade+0.035
rob+0.034
uch+0.034
ves+0.033
aches+0.032
Neg logit contributions
 She-0.031
 infant-0.031
 antics-0.029
 Can-0.029
 Please-0.029
Token everyone
Token ID2506
Feature activation+4.57e-02

Pos logit contributions
uch+0.057
aches+0.049
at+0.047
asis+0.047
ats+0.046
Neg logit contributions
 antics-0.045
 infant-0.045
 personality-0.043
 loudly-0.041
 Can-0.040
Token.
Token ID13
Feature activation-1.06e-02

Pos logit contributions
uch+0.065
aches+0.057
at+0.052
ats+0.052
ade+0.051
Neg logit contributions
 infant-0.086
 antics-0.085
 personality-0.083
 Can-0.080
amb-0.078
Token
Token ID220
Feature activation+1.51e-03

Pos logit contributions
 antics+0.017
 infant+0.017
 personality+0.016
 Can+0.015
amb+0.015
Neg logit contributions
uch-0.017
aches-0.015
ons-0.015
at-0.014
ats-0.014
Token
Token ID198
Feature activation+6.64e-02

Pos logit contributions
uch+0.002
aches+0.002
asis+0.002
at+0.002
w+0.002
Neg logit contributions
 Even-0.002
 antics-0.002
 It-0.002
 infant-0.002
 She-0.002
Token
Token ID198
Feature activation+4.49e-02

Pos logit contributions
uch+0.119
aches+0.105
ats+0.100
at+0.099
asis+0.098
Neg logit contributions
 Even-0.102
amb-0.095
 Whoever-0.095
 antics-0.095
 As-0.094
TokenWhere
Token ID8496
Feature activation+9.89e-02

Pos logit contributions
uch+0.064
aches+0.057
ats+0.054
at+0.052
asis+0.051
Neg logit contributions
 antics-0.082
 infant-0.080
 personality-0.078
 Can-0.077
 Even-0.077
Tokenver
Token ID332
Feature activation+2.33e-03

Pos logit contributions
uch+0.159
aches+0.152
asis+0.148
at+0.142
ats+0.140
Neg logit contributions
 loudly-0.141
 hesitant-0.140
 infant-0.139
 personality-0.136
 Even-0.136
Token the
Token ID262
Feature activation-8.27e-03

Pos logit contributions
uch+0.003
aches+0.003
ons+0.002
ats+0.002
ade+0.002
Neg logit contributions
 antics-0.005
 infant-0.005
 personality-0.005
amb-0.004
 Can-0.004
Token old
Token ID1468
Feature activation-1.89e-02

Pos logit contributions
 antics+0.011
 infant+0.011
 personality+0.011
amb+0.010
rily+0.010
Neg logit contributions
uch-0.015
aches-0.014
ade-0.013
ats-0.013
ons-0.013
Token man
Token ID582
Feature activation+3.00e-02

Pos logit contributions
 antics+0.030
 infant+0.030
 personality+0.029
 Can+0.027
amb+0.027
Neg logit contributions
uch-0.032
aches-0.030
at-0.027
ats-0.027
ade-0.027
Token went
Token ID1816
Feature activation-2.07e-02

Pos logit contributions
uch+0.046
aches+0.043
ats+0.039
at+0.039
ade+0.039
Neg logit contributions
 antics-0.050
 infant-0.050
 personality-0.048
 Can-0.047
amb-0.045
Token be
Token ID307
Feature activation-3.66e-03

Pos logit contributions
uch+0.025
ats+0.023
at+0.022
w+0.022
ons+0.022
Neg logit contributions
 antics-0.018
 infant-0.018
 personality-0.017
 alien-0.016
 daughter-0.016
Token found
Token ID1043
Feature activation+1.50e-02

Pos logit contributions
 antics+0.006
 personality+0.006
 hesitant+0.006
 infant+0.006
 loudly+0.005
Neg logit contributions
uch-0.006
aches-0.005
at-0.005
asis-0.005
w-0.005
Token.
Token ID13
Feature activation-1.36e-02

Pos logit contributions
uch+0.026
aches+0.023
at+0.021
ats+0.021
asis+0.021
Neg logit contributions
 infant-0.024
 antics-0.024
 Can-0.023
rily-0.022
 loudly-0.022
Token
Token ID198
Feature activation+6.34e-02

Pos logit contributions
 antics+0.024
 infant+0.024
 personality+0.023
amb+0.022
 Can+0.022
Neg logit contributions
uch-0.020
aches-0.018
ats-0.017
ade-0.017
at-0.017
Token
Token ID198
Feature activation+4.42e-02

Pos logit contributions
uch+0.123
aches+0.110
at+0.105
asis+0.102
w+0.100
Neg logit contributions
 Even-0.094
 It-0.089
 From-0.086
 As-0.083
 He-0.082
TokenWhere
Token ID8496
Feature activation+9.88e-02

Pos logit contributions
uch+0.062
aches+0.056
at+0.052
ats+0.052
asis+0.050
Neg logit contributions
 antics-0.080
 infant-0.078
 personality-0.076
 Can-0.076
 Even-0.075
Token had
Token ID550
Feature activation+1.13e-02

Pos logit contributions
aches+0.160
asis+0.159
uch+0.158
at+0.146
ats+0.140
Neg logit contributions
 loudly-0.144
 hesitant-0.142
 It-0.141
 Even-0.137
 infant-0.133
Token the
Token ID262
Feature activation+4.54e-03

Pos logit contributions
uch+0.018
aches+0.016
ats+0.016
at+0.016
ade+0.015
Neg logit contributions
 infant-0.019
 antics-0.019
 personality-0.018
 Can-0.017
urbed-0.017
Token gr
Token ID1036
Feature activation-4.32e-02

Pos logit contributions
uch+0.008
aches+0.007
ade+0.007
ats+0.007
at+0.007
Neg logit contributions
 antics-0.007
 infant-0.007
 personality-0.007
 Can-0.006
amb-0.006
Tokenumpy
Token ID32152
Feature activation-3.25e-02

Pos logit contributions
 antics+0.042
 infant+0.040
 personality+0.039
 Can+0.037
 lear+0.033
Neg logit contributions
uch-0.099
aches-0.094
ade-0.090
ats-0.089
at-0.088
Token rubber
Token ID14239
Feature activation-7.44e-04

Pos logit contributions
 antics+0.044
amb+0.043
rily+0.043
 personality+0.043
 Can+0.043
Neg logit contributions
uch-0.057
aches-0.052
ade-0.049
 done-0.049
 am-0.049
Token the
Token ID262
Feature activation+1.17e-02

Pos logit contributions
uch+0.003
aches+0.003
at+0.003
ade+0.003
ats+0.003
Neg logit contributions
 infant-0.004
 antics-0.004
 personality-0.004
 Can-0.003
 lear-0.003
Token house
Token ID2156
Feature activation+2.83e-02

Pos logit contributions
uch+0.027
aches+0.026
at+0.024
ats+0.024
ade+0.024
Neg logit contributions
 antics-0.011
 infant-0.011
 personality-0.010
 Can-0.009
amb-0.009
Token.
Token ID13
Feature activation-1.37e-02

Pos logit contributions
uch+0.047
aches+0.042
w+0.039
at+0.039
ats+0.039
Neg logit contributions
 infant-0.046
 antics-0.045
 personality-0.043
 loudly-0.042
rily-0.042
Token
Token ID198
Feature activation+5.85e-02

Pos logit contributions
 antics+0.028
 infant+0.026
 personality+0.026
rily+0.025
urbed+0.025
Neg logit contributions
uch-0.016
aches-0.015
ons-0.014
ade-0.014
 am-0.013
Token
Token ID198
Feature activation+3.49e-02

Pos logit contributions
uch+0.116
aches+0.105
asis+0.099
at+0.096
kins+0.095
Neg logit contributions
 Even-0.081
 She-0.080
 It-0.077
 As-0.074
 curiously-0.072
TokenWhere
Token ID8496
Feature activation+9.43e-02

Pos logit contributions
uch+0.042
aches+0.038
ade+0.034
at+0.034
ats+0.033
Neg logit contributions
 antics-0.072
 infant-0.071
 personality-0.070
 Can-0.067
amb-0.066
Tokenver
Token ID332
Feature activation+2.52e-03

Pos logit contributions
uch+0.148
aches+0.147
asis+0.144
at+0.133
fire+0.131
Neg logit contributions
 infant-0.140
 loudly-0.136
 Can-0.134
 hesitant-0.133
 personality-0.133
Token she
Token ID673
Feature activation+2.04e-02

Pos logit contributions
uch+0.004
aches+0.003
ons+0.003
ats+0.003
ade+0.003
Neg logit contributions
 antics-0.005
 infant-0.004
 personality-0.004
amb-0.004
 Can-0.004
Token went
Token ID1816
Feature activation-2.61e-02

Pos logit contributions
aches+0.035
uch+0.034
ats+0.032
at+0.032
asis+0.031
Neg logit contributions
 infant-0.028
 antics-0.027
 personality-0.027
 Can-0.026
amb-0.026
Token,
Token ID11
Feature activation-7.08e-03

Pos logit contributions
 infant+0.041
 antics+0.039
 personality+0.038
 Can+0.038
amb+0.036
Neg logit contributions
uch-0.044
aches-0.041
at-0.038
ats-0.038
ade-0.038
Token the
Token ID262
Feature activation-3.22e-04

Pos logit contributions
 infant+0.011
 antics+0.011
 personality+0.011
 Can+0.010
 daughter+0.010
Neg logit contributions
uch-0.012
aches-0.011
at-0.011
ats-0.011
ade-0.010
Token and
Token ID290
Feature activation-1.51e-02

Pos logit contributions
 antics+0.032
 infant+0.030
 personality+0.028
amb+0.028
rily+0.027
Neg logit contributions
uch-0.034
aches-0.033
 done-0.033
 last-0.031
aster-0.030
Token went
Token ID1816
Feature activation-2.41e-02

Pos logit contributions
 antics+0.025
 infant+0.025
 personality+0.024
 Can+0.023
amb+0.023
Neg logit contributions
uch-0.024
aches-0.022
at-0.020
ats-0.020
ade-0.020
Token back
Token ID736
Feature activation+8.62e-03

Pos logit contributions
 antics+0.040
 infant+0.039
 personality+0.038
 Can+0.036
amb+0.036
Neg logit contributions
uch-0.039
aches-0.036
at-0.033
ats-0.033
ade-0.033
Token home
Token ID1363
Feature activation+3.13e-02

Pos logit contributions
uch+0.016
aches+0.015
ade+0.014
at+0.014
ons+0.014
Neg logit contributions
 antics-0.012
 infant-0.012
 personality-0.012
amb-0.011
 Can-0.011
Token.
Token ID13
Feature activation-1.24e-02

Pos logit contributions
uch+0.050
aches+0.045
at+0.043
ade+0.042
ats+0.042
Neg logit contributions
 antics-0.053
 infant-0.052
 personality-0.051
 widely-0.048
amb-0.047
Token Where
Token ID6350
Feature activation+9.42e-02

Pos logit contributions
 antics+0.024
 infant+0.024
 personality+0.023
 lear+0.021
amb+0.021
Neg logit contributions
uch-0.016
aches-0.014
ons-0.013
at-0.013
 am-0.013
Tokenver
Token ID332
Feature activation+6.41e-03

Pos logit contributions
uch+0.141
asis+0.137
aches+0.136
at+0.131
fire+0.128
Neg logit contributions
 loudly-0.139
 hesitant-0.138
 infant-0.135
 It-0.134
 personality-0.131
Token he
Token ID339
Feature activation+3.03e-02

Pos logit contributions
uch+0.009
aches+0.008
ats+0.007
ons+0.007
ade+0.007
Neg logit contributions
 antics-0.012
 infant-0.012
 personality-0.012
amb-0.011
 Can-0.011
Token looked
Token ID3114
Feature activation-2.74e-02

Pos logit contributions
uch+0.053
aches+0.052
ats+0.048
ons+0.048
ade+0.047
Neg logit contributions
 antics-0.043
 personality-0.041
 infant-0.041
 Can-0.041
amb-0.039
Token,
Token ID11
Feature activation-1.19e-02

Pos logit contributions
 infant+0.040
 antics+0.038
 personality+0.038
 Can+0.038
 loudly+0.037
Neg logit contributions
uch-0.050
aches-0.046
at-0.043
ade-0.042
ons-0.042
Token he
Token ID339
Feature activation+1.64e-02

Pos logit contributions
 infant+0.019
 antics+0.019
 personality+0.019
 Can+0.018
 loudly+0.017
Neg logit contributions
uch-0.020
aches-0.018
at-0.017
ats-0.017
ade-0.017
Token were
Token ID547
Feature activation-2.19e-02

Pos logit contributions
uch+0.013
aches+0.012
ade+0.011
ats+0.011
asis+0.011
Neg logit contributions
 infant-0.013
 personality-0.013
 antics-0.013
amb-0.012
 lear-0.012
Token never
Token ID1239
Feature activation+1.29e-02

Pos logit contributions
 infant+0.034
 antics+0.034
 personality+0.032
amb+0.030
 Can+0.030
Neg logit contributions
uch-0.037
aches-0.034
ade-0.032
at-0.031
ats-0.031
Token lonely
Token ID21757
Feature activation-9.02e-05

Pos logit contributions
uch+0.024
aches+0.022
ats+0.021
at+0.021
ade+0.021
Neg logit contributions
 antics-0.018
 infant-0.018
 personality-0.017
 Can-0.016
amb-0.016
Token again
Token ID757
Feature activation+3.76e-02

Pos logit contributions
 antics+0.000
 infant+0.000
 personality+0.000
amb+0.000
istent+0.000
Neg logit contributions
uch-0.000
aches-0.000
ade-0.000
 done-0.000
at-0.000
Token.
Token ID13
Feature activation-2.00e-02

Pos logit contributions
uch+0.060
aches+0.054
at+0.051
ade+0.051
ats+0.049
Neg logit contributions
 antics-0.066
 infant-0.062
 personality-0.062
 Can-0.059
 loudly-0.059
Token Where
Token ID6350
Feature activation+9.37e-02

Pos logit contributions
 antics+0.024
 infant+0.024
 personality+0.023
 Can+0.022
amb+0.021
Neg logit contributions
uch-0.041
aches-0.039
at-0.036
ats-0.036
ade-0.036
Tokenver
Token ID332
Feature activation+7.62e-03

Pos logit contributions
uch+0.143
asis+0.140
at+0.133
aches+0.131
aster+0.128
Neg logit contributions
 infant-0.140
 hesitant-0.139
 loudly-0.138
 personality-0.134
 It-0.133
Token the
Token ID262
Feature activation-4.60e-03

Pos logit contributions
uch+0.010
aches+0.009
ats+0.008
ade+0.008
at+0.008
Neg logit contributions
 antics-0.015
 infant-0.015
 personality-0.015
 Can-0.014
amb-0.014
Token baby
Token ID5156
Feature activation+3.28e-03

Pos logit contributions
 antics+0.005
 personality+0.004
 infant+0.004
amb+0.004
 Can+0.004
Neg logit contributions
uch-0.010
aches-0.009
ade-0.009
asis-0.009
ons-0.009
Token went
Token ID1816
Feature activation-1.44e-02

Pos logit contributions
uch+0.005
aches+0.005
ade+0.005
ats+0.005
at+0.005
Neg logit contributions
 infant-0.005
 antics-0.005
 personality-0.005
 Can-0.005
amb-0.005
Token,
Token ID11
Feature activation-5.54e-03

Pos logit contributions
 antics+0.022
 infant+0.022
 personality+0.021
 Can+0.021
amb+0.020
Neg logit contributions
uch-0.025
aches-0.023
ats-0.021
at-0.021
ade-0.021
Token glad
Token ID9675
Feature activation+9.27e-03

Pos logit contributions
 antics+0.016
 infant+0.016
 personality+0.015
 Can+0.014
amb+0.013
Neg logit contributions
uch-0.033
aches-0.031
at-0.030
ade-0.030
ats-0.029
Token.
Token ID13
Feature activation-3.25e-02

Pos logit contributions
uch+0.012
aches+0.011
at+0.010
ats+0.010
ade+0.010
Neg logit contributions
 antics-0.018
 infant-0.018
 personality-0.017
 Can-0.017
amb-0.016
Token
Token ID198
Feature activation+6.96e-02

Pos logit contributions
 antics+0.046
 infant+0.043
istent+0.040
urbed+0.040
 personality+0.039
Neg logit contributions
uch-0.056
 am-0.055
aches-0.052
ade-0.051
ats-0.051
Token
Token ID198
Feature activation+5.06e-02

Pos logit contributions
uch+0.153
aches+0.133
at+0.132
asis+0.125
w+0.121
Neg logit contributions
 Even-0.090
 It-0.090
 From-0.088
 She-0.087
 They-0.085
Token"
Token ID1
Feature activation+2.24e-02

Pos logit contributions
uch+0.115
aches+0.106
at+0.105
asis+0.100
ats+0.096
Neg logit contributions
 Can-0.051
 She-0.047
 Even-0.046
amb-0.044
 loudly-0.043
TokenWhere
Token ID8496
Feature activation+9.33e-02

Pos logit contributions
uch+0.037
aches+0.034
ats+0.032
at+0.032
ade+0.031
Neg logit contributions
 antics-0.036
 infant-0.036
 personality-0.035
 Can-0.033
amb-0.033
Token were
Token ID547
Feature activation-6.26e-03

Pos logit contributions
aches+0.148
asis+0.147
fire+0.143
at+0.139
uch+0.133
Neg logit contributions
 Can-0.137
 It-0.131
 infant-0.131
 personality-0.127
 daughter-0.124
Token you
Token ID345
Feature activation+3.70e-03

Pos logit contributions
 infant+0.011
 antics+0.011
 personality+0.010
 Can+0.010
amb+0.010
Neg logit contributions
uch-0.009
aches-0.009
at-0.008
ade-0.008
ats-0.008
Token,
Token ID11
Feature activation-1.52e-02

Pos logit contributions
uch+0.006
aches+0.006
ade+0.006
ats+0.006
at+0.006
Neg logit contributions
 antics-0.006
 infant-0.005
 personality-0.005
 Can-0.005
amb-0.005
Token Ben
Token ID3932
Feature activation-8.54e-03

Pos logit contributions
 antics+0.024
amb+0.023
istent+0.023
 personality+0.022
urbed+0.021
Neg logit contributions
aches-0.023
 last-0.022
 done-0.022
uch-0.022
kins-0.021
Token and
Token ID290
Feature activation-1.95e-02

Pos logit contributions
 infant+0.016
 antics+0.015
 personality+0.015
 Can+0.015
amb+0.014
Neg logit contributions
uch-0.012
aches-0.011
ade-0.010
ats-0.010
at-0.010
Token felt
Token ID2936
Feature activation-3.71e-02

Pos logit contributions
uch+0.027
aches+0.024
at+0.023
ade+0.023
ats+0.023
Neg logit contributions
 antics-0.031
 personality-0.031
 infant-0.031
 Can-0.029
 lear-0.028
Token sorry
Token ID7926
Feature activation-2.36e-02

Pos logit contributions
 infant+0.040
 antics+0.038
 personality+0.038
 Can+0.035
 loudly+0.033
Neg logit contributions
uch-0.084
aches-0.077
at-0.077
ats-0.075
ade-0.074
Token and
Token ID290
Feature activation-2.03e-02

Pos logit contributions
 antics+0.048
 infant+0.046
 personality+0.046
amb+0.044
arted+0.043
Neg logit contributions
uch-0.028
aches-0.026
ade-0.024
 am-0.024
 done-0.023
Token confused
Token ID10416
Feature activation-2.53e-03

Pos logit contributions
 antics+0.023
 infant+0.022
 personality+0.021
amb+0.021
 Can+0.020
Neg logit contributions
uch-0.042
aches-0.041
ade-0.038
 done-0.038
ats-0.037
Token.
Token ID13
Feature activation-2.28e-02

Pos logit contributions
 antics+0.005
 infant+0.005
 personality+0.005
 Can+0.005
amb+0.005
Neg logit contributions
uch-0.003
aches-0.003
ade-0.002
ats-0.002
at-0.002
Token Where
Token ID6350
Feature activation+9.25e-02

Pos logit contributions
 antics+0.045
 infant+0.042
 personality+0.041
urbed+0.041
istent+0.040
Neg logit contributions
uch-0.028
 am-0.026
aches-0.024
ons-0.023
ade-0.023
Token was
Token ID373
Feature activation-1.10e-02

Pos logit contributions
asis+0.163
fire+0.149
at+0.149
aches+0.148
uch+0.148
Neg logit contributions
 It-0.133
 She-0.124
 Can-0.118
 Even-0.117
 hesitant-0.116
Token the
Token ID262
Feature activation+2.62e-03

Pos logit contributions
 antics+0.020
 personality+0.019
 infant+0.019
amb+0.018
 Can+0.018
Neg logit contributions
uch-0.016
aches-0.015
ade-0.013
ons-0.013
at-0.013
Token marble
Token ID30623
Feature activation+3.78e-02

Pos logit contributions
uch+0.005
aches+0.004
ats+0.004
at+0.004
ons+0.004
Neg logit contributions
 antics-0.004
 infant-0.004
 personality-0.004
 Can-0.004
 lear-0.003
Token?
Token ID30
Feature activation-2.30e-02

Pos logit contributions
uch+0.051
ats+0.050
aches+0.050
at+0.048
str+0.047
Neg logit contributions
 infant-0.062
 Can-0.057
 antics-0.057
 alien-0.054
 She-0.054
Token
Token ID198
Feature activation+6.59e-02

Pos logit contributions
 antics+0.045
 infant+0.044
 personality+0.043
urbed+0.040
amb+0.040
Neg logit contributions
uch-0.030
aches-0.026
 am-0.024
ons-0.024
ade-0.024
Token games
Token ID1830
Feature activation-5.70e-03

Pos logit contributions
 antics+0.063
 infant+0.063
 personality+0.061
 Can+0.058
amb+0.057
Neg logit contributions
uch-0.055
aches-0.053
ade-0.048
ats-0.048
at-0.047
Token and
Token ID290
Feature activation-3.19e-03

Pos logit contributions
 antics+0.006
 infant+0.006
 personality+0.006
 lear+0.006
 loudly+0.005
Neg logit contributions
uch-0.013
aches-0.012
at-0.011
w-0.011
asis-0.011
Token sang
Token ID25889
Feature activation-3.02e-02

Pos logit contributions
 antics+0.006
 infant+0.006
 personality+0.006
 Can+0.005
amb+0.005
Neg logit contributions
uch-0.004
aches-0.004
at-0.004
ade-0.004
ats-0.004
Token songs
Token ID7259
Feature activation+1.12e-02

Pos logit contributions
 infant+0.044
 antics+0.044
 personality+0.042
 Can+0.041
amb+0.040
Neg logit contributions
uch-0.050
aches-0.049
ade-0.046
ats-0.046
ons-0.045
Token.
Token ID13
Feature activation-1.13e-02

Pos logit contributions
uch+0.019
aches+0.017
at+0.016
ats+0.016
ade+0.016
Neg logit contributions
 antics-0.018
 infant-0.017
 personality-0.017
 Can-0.016
amb-0.016
Token Where
Token ID6350
Feature activation+9.14e-02

Pos logit contributions
 antics+0.013
 infant+0.013
 personality+0.012
amb+0.011
rily+0.011
Neg logit contributions
uch-0.023
aches-0.021
ats-0.021
at-0.020
ons-0.020
Tokenver
Token ID332
Feature activation+7.53e-03

Pos logit contributions
uch+0.139
asis+0.133
aches+0.127
at+0.127
aster+0.118
Neg logit contributions
 loudly-0.139
 hesitant-0.136
 infant-0.136
 personality-0.135
 It-0.134
Token they
Token ID484
Feature activation+3.14e-02

Pos logit contributions
uch+0.010
aches+0.010
at+0.009
ade+0.009
ats+0.009
Neg logit contributions
 antics-0.014
 infant-0.014
 personality-0.014
 Can-0.013
amb-0.013
Token went
Token ID1816
Feature activation-1.38e-02

Pos logit contributions
uch+0.057
aches+0.053
ats+0.052
at+0.051
ons+0.049
Neg logit contributions
 infant-0.042
 antics-0.041
 personality-0.041
 Can-0.040
arted-0.038
Token,
Token ID11
Feature activation-1.97e-03

Pos logit contributions
 antics+0.021
 infant+0.021
 personality+0.020
 Can+0.020
amb+0.019
Neg logit contributions
uch-0.023
aches-0.022
at-0.020
ats-0.020
ade-0.020
Token they
Token ID484
Feature activation+1.40e-02

Pos logit contributions
 antics+0.004
 infant+0.004
 personality+0.003
 Can+0.003
amb+0.003
Neg logit contributions
uch-0.003
aches-0.003
at-0.002
ats-0.002
ade-0.002
Token pay
Token ID1414
Feature activation+3.55e-03

Pos logit contributions
uch+0.043
aches+0.038
at+0.038
ons+0.038
ats+0.037
Neg logit contributions
 antics-0.043
 infant-0.042
 personality-0.042
amb-0.038
 lear-0.038
Token for
Token ID329
Feature activation+4.67e-02

Pos logit contributions
uch+0.008
asis+0.007
str+0.007
aches+0.007
at+0.007
Neg logit contributions
 infant-0.004
 loudly-0.004
 daughter-0.004
 Can-0.003
 personality-0.003
Token the
Token ID262
Feature activation+2.12e-02

Pos logit contributions
uch+0.086
at+0.079
ade+0.077
asis+0.076
ats+0.073
Neg logit contributions
 infant-0.070
 antics-0.067
 personality-0.064
 Can-0.062
 daughter-0.060
Token damage
Token ID2465
Feature activation+5.53e-02

Pos logit contributions
uch+0.039
aches+0.035
at+0.035
ons+0.034
ats+0.034
Neg logit contributions
 infant-0.033
 antics-0.032
 personality-0.031
 lear-0.029
 Can-0.029
Token.
Token ID13
Feature activation-3.26e-02

Pos logit contributions
uch+0.109
aches+0.103
asis+0.097
at+0.095
ade+0.095
Neg logit contributions
 infant-0.071
 antics-0.065
 Can-0.065
 personality-0.064
 loudly-0.060
Token Where
Token ID6350
Feature activation+9.13e-02

Pos logit contributions
 antics+0.041
rily+0.034
urbed+0.033
arted+0.033
ening+0.033
Neg logit contributions
uch-0.068
 am-0.062
aster-0.060
w-0.059
ons-0.058
Token are
Token ID389
Feature activation+2.08e-02

Pos logit contributions
asis+0.163
fire+0.157
side+0.148
 distance+0.145
at+0.141
Neg logit contributions
 It-0.121
 Can-0.116
 Give-0.113
 infant-0.112
 Go-0.111
Token your
Token ID534
Feature activation-1.42e-02

Pos logit contributions
uch+0.034
at+0.031
asis+0.030
ats+0.030
ade+0.029
Neg logit contributions
 infant-0.034
 antics-0.031
 Can-0.031
 personality-0.030
 adventurers-0.028
Token parents
Token ID3397
Feature activation+4.37e-02

Pos logit contributions
 infant+0.019
 antics+0.019
 personality+0.019
 daughter+0.017
 lear+0.016
Neg logit contributions
uch-0.028
aches-0.025
at-0.025
ade-0.025
ons-0.024
Token?"
Token ID1701
Feature activation-1.26e-02

Pos logit contributions
ade+0.069
aches+0.067
ves+0.066
at+0.065
uch+0.065
Neg logit contributions
 infant-0.063
 Can-0.058
 personality-0.057
 Please-0.057
 antics-0.056
Token Tom
Token ID4186
Feature activation-1.39e-02

Pos logit contributions
 antics+0.020
 Can+0.020
 infant+0.020
 personality+0.019
 Even+0.019
Neg logit contributions
uch-0.021
aches-0.019
at-0.019
ats-0.018
asis-0.018
Token
Token ID198
Feature activation+4.64e-02

Pos logit contributions
uch+0.099
aches+0.088
at+0.082
asis+0.081
ats+0.080
Neg logit contributions
 antics-0.105
 Even-0.105
 personality-0.105
amb-0.104
 infant-0.102
Tokenâ
Token ID22940
Feature activation-6.19e-02

Pos logit contributions
uch+0.068
aches+0.064
ade+0.058
at+0.057
ats+0.057
Neg logit contributions
 antics-0.083
 infant-0.082
 personality-0.080
 Can-0.076
amb-0.075
Token
Token ID26391
Feature activation-4.78e-02

Pos logit contributions
 antics+0.100
 infant+0.098
 personality+0.095
 Can+0.091
amb+0.089
Neg logit contributions
uch-0.102
aches-0.094
at-0.088
ats-0.087
ade-0.086
Token
Token ID129
Feature activation-6.78e-02

Pos logit contributions
 antics+0.070
 personality+0.067
 infant+0.065
 Can+0.063
amb+0.062
Neg logit contributions
uch-0.083
aches-0.082
ade-0.075
at-0.074
ats-0.071
Token
Token ID241
Feature activation+1.83e-02

Pos logit contributions
 infant+0.105
 antics+0.103
 Can+0.102
 personality+0.101
 adventurers+0.095
Neg logit contributions
uch-0.114
aches-0.110
ats-0.102
at-0.101
 done-0.100
TokenWhere
Token ID8496
Feature activation+9.12e-02

Pos logit contributions
uch+0.027
 am+0.024
aches+0.024
w+0.023
ats+0.022
Neg logit contributions
 antics-0.032
 infant-0.032
 personality-0.029
rily-0.029
 adventurers-0.029
Token is
Token ID318
Feature activation-4.57e-03

Pos logit contributions
uch+0.127
asis+0.125
aches+0.122
at+0.117
fire+0.116
Neg logit contributions
 personality-0.148
 infant-0.147
 Can-0.146
 hesitant-0.143
 It-0.142
Token my
Token ID616
Feature activation-1.48e-02

Pos logit contributions
 antics+0.009
 personality+0.008
amb+0.008
istent+0.008
 infant+0.008
Neg logit contributions
uch-0.006
aches-0.006
ons-0.006
ade-0.005
 done-0.005
Token toy
Token ID13373
Feature activation+2.85e-02

Pos logit contributions
 antics+0.023
 infant+0.022
 personality+0.022
amb+0.021
istent+0.021
Neg logit contributions
uch-0.024
aches-0.022
ade-0.021
ats-0.020
 am-0.020
Token,
Token ID11
Feature activation-4.83e-03

Pos logit contributions
uch+0.038
aches+0.037
ats+0.034
at+0.033
ade+0.032
Neg logit contributions
 infant-0.054
 antics-0.051
 Can-0.051
 personality-0.050
 alien-0.048
Token Mom
Token ID11254
Feature activation+1.96e-02

Pos logit contributions
 antics+0.007
 infant+0.007
 personality+0.007
amb+0.006
urbed+0.006
Neg logit contributions
uch-0.008
aches-0.008
ade-0.007
ats-0.007
ons-0.007
Token did
Token ID750
Feature activation-7.38e-03

Pos logit contributions
asis+0.167
at+0.150
uch+0.148
aches+0.148
fire+0.146
Neg logit contributions
 It-0.129
 She-0.116
 loudly-0.115
 Can-0.111
 Even-0.111
Token the
Token ID262
Feature activation+7.35e-03

Pos logit contributions
 infant+0.013
 antics+0.013
 personality+0.012
 Can+0.012
 loudly+0.011
Neg logit contributions
uch-0.011
aches-0.010
at-0.010
ats-0.009
ade-0.009
Token volcano
Token ID32410
Feature activation+7.01e-02

Pos logit contributions
uch+0.012
aches+0.011
ats+0.011
at+0.010
ade+0.010
Neg logit contributions
 infant-0.013
 antics-0.013
 Can-0.012
 personality-0.012
 loudly-0.011
Token go
Token ID467
Feature activation+3.86e-03

Pos logit contributions
ats+0.116
uch+0.114
aches+0.111
at+0.108
live+0.104
Neg logit contributions
 infant-0.096
 Can-0.095
 loudly-0.091
 hesitant-0.087
 He-0.087
Token?
Token ID30
Feature activation-3.04e-02

Pos logit contributions
uch+0.006
aches+0.005
ats+0.005
at+0.005
ade+0.005
Neg logit contributions
 infant-0.007
 Can-0.007
 She-0.006
 loudly-0.006
 antics-0.006
Token Where
Token ID6350
Feature activation+9.08e-02

Pos logit contributions
 antics+0.041
 infant+0.038
 personality+0.038
urbed+0.035
heart+0.035
Neg logit contributions
uch-0.057
aches-0.051
 am-0.050
ons-0.049
ade-0.049
Token did
Token ID750
Feature activation-6.22e-03

Pos logit contributions
asis+0.171
at+0.156
aches+0.152
uch+0.150
fire+0.148
Neg logit contributions
 It-0.124
 loudly-0.110
 Can-0.109
 She-0.107
 hesitant-0.104
Token their
Token ID511
Feature activation-9.70e-03

Pos logit contributions
 infant+0.010
 antics+0.010
 personality+0.009
 Can+0.009
 loudly+0.009
Neg logit contributions
uch-0.011
aches-0.009
at-0.009
ats-0.009
ade-0.009
Token dinosaurs
Token ID27199
Feature activation+4.94e-02

Pos logit contributions
 infant+0.014
 antics+0.014
 personality+0.013
 Can+0.012
 loudly+0.012
Neg logit contributions
uch-0.018
aches-0.017
at-0.016
ats-0.016
ade-0.016
Token go
Token ID467
Feature activation-1.33e-03

Pos logit contributions
ats+0.070
uch+0.069
aches+0.067
at+0.066
rob+0.065
Neg logit contributions
 infant-0.071
 antics-0.069
 loudly-0.068
 Can-0.067
 She-0.067
Token?
Token ID30
Feature activation-2.65e-02

Pos logit contributions
 infant+0.003
 Can+0.002
 She+0.002
 alien+0.002
 loudly+0.002
Neg logit contributions
uch-0.002
aches-0.002
ats-0.002
at-0.001
ade-0.001
Token
Token ID26391
Feature activation-5.60e-02

Pos logit contributions
 antics+0.087
 infant+0.086
 personality+0.083
amb+0.081
 Can+0.078
Neg logit contributions
uch-0.091
aches-0.089
ade-0.082
ats-0.081
ons-0.081
Token
Token ID129
Feature activation-6.71e-02

Pos logit contributions
 antics+0.091
 personality+0.088
amb+0.086
 infant+0.085
 Even+0.085
Neg logit contributions
aches-0.084
uch-0.083
ade-0.078
at-0.075
ons-0.072
Token
Token ID241
Feature activation+2.24e-02

Pos logit contributions
 antics+0.104
 infant+0.103
 personality+0.100
 Can+0.097
heart+0.092
Neg logit contributions
uch-0.116
aches-0.109
at-0.101
ats-0.099
 done-0.098
TokenHello
Token ID15496
Feature activation+7.90e-03

Pos logit contributions
uch+0.033
aches+0.030
at+0.027
ats+0.027
ade+0.027
Neg logit contributions
 antics-0.040
 infant-0.039
 personality-0.038
 Can-0.036
amb-0.036
Token!
Token ID0
Feature activation-1.50e-02

Pos logit contributions
uch+0.012
aches+0.011
ats+0.010
ade+0.010
at+0.010
Neg logit contributions
 antics-0.014
 infant-0.013
 personality-0.013
amb-0.012
 Can-0.012
Token Where
Token ID6350
Feature activation+9.06e-02

Pos logit contributions
 antics+0.019
 infant+0.017
 personality+0.016
rily+0.016
iously+0.015
Neg logit contributions
uch-0.030
 am-0.027
aches-0.027
at-0.026
ons-0.026
Token are
Token ID389
Feature activation+1.32e-02

Pos logit contributions
asis+0.120
fire+0.118
uch+0.116
at+0.110
aches+0.108
Neg logit contributions
 personality-0.147
 infant-0.146
 It-0.146
 Can-0.145
 hesitant-0.140
Token you
Token ID345
Feature activation+9.85e-03

Pos logit contributions
uch+0.018
aches+0.017
ade+0.015
at+0.015
ats+0.015
Neg logit contributions
 antics-0.025
 infant-0.024
 personality-0.024
 Can-0.023
amb-0.023
Token going
Token ID1016
Feature activation-1.78e-02

Pos logit contributions
uch+0.011
aches+0.010
ats+0.009
ade+0.009
at+0.009
Neg logit contributions
 antics-0.021
 infant-0.021
 personality-0.020
 Can-0.019
amb-0.019
Token?
Token ID30
Feature activation-1.80e-02

Pos logit contributions
 infant+0.033
 antics+0.032
 Can+0.032
 personality+0.031
amb+0.031
Neg logit contributions
uch-0.025
aches-0.023
ats-0.020
ade-0.020
ons-0.020
Tokenâ
Token ID22940
Feature activation-5.45e-02

Pos logit contributions
 antics+0.025
 infant+0.022
 personality+0.022
rily+0.021
ening+0.021
Neg logit contributions
uch-0.033
 am-0.031
ons-0.030
aches-0.029
at-0.028
Tokenolly
Token ID5098
Feature activation-2.78e-02

Pos logit contributions
 antics+0.018
 infant+0.018
 personality+0.017
 Can+0.016
amb+0.016
Neg logit contributions
uch-0.020
aches-0.019
ade-0.018
at-0.017
ats-0.017
Token gasped
Token ID45236
Feature activation-4.52e-02

Pos logit contributions
 antics+0.058
 infant+0.058
 personality+0.056
 Can+0.055
amb+0.054
Neg logit contributions
uch-0.029
aches-0.026
 done-0.025
ade-0.024
at-0.023
Token with
Token ID351
Feature activation+1.81e-02

Pos logit contributions
 personality+0.103
 antics+0.102
 Whoever+0.097
amb+0.096
istent+0.096
Neg logit contributions
aches-0.028
uch-0.028
kins-0.028
he-0.027
 last-0.026
Token surprise
Token ID5975
Feature activation+2.92e-03

Pos logit contributions
uch+0.046
aches+0.042
at+0.042
ats+0.041
asis+0.041
Neg logit contributions
 infant-0.016
 personality-0.015
 antics-0.015
 widely-0.013
 Can-0.013
Token.
Token ID13
Feature activation-2.73e-02

Pos logit contributions
uch+0.004
aches+0.004
ats+0.003
ade+0.003
ons+0.003
Neg logit contributions
 antics-0.005
 infant-0.005
 personality-0.005
 Can-0.005
amb-0.005
Token Where
Token ID6350
Feature activation+9.06e-02

Pos logit contributions
 antics+0.033
 cowboy+0.029
amb+0.028
 infant+0.027
 alien+0.026
Neg logit contributions
-0.049
 am-0.048
uch-0.047
ade-0.047
 done-0.045
Token did
Token ID750
Feature activation-7.91e-03

Pos logit contributions
asis+0.153
uch+0.142
aches+0.138
at+0.138
fire+0.136
Neg logit contributions
 It-0.133
 She-0.127
 hesitant-0.127
 loudly-0.125
 Even-0.123
Token the
Token ID262
Feature activation-7.68e-04

Pos logit contributions
 infant+0.013
 antics+0.013
 personality+0.012
 Can+0.012
urbed+0.011
Neg logit contributions
uch-0.013
aches-0.012
at-0.011
ats-0.011
ade-0.011
Token books
Token ID3835
Feature activation+5.09e-02

Pos logit contributions
 antics+0.001
 infant+0.001
 personality+0.001
 Can+0.001
 lear+0.001
Neg logit contributions
uch-0.001
aches-0.001
ats-0.001
at-0.001
ade-0.001
Tokenhe
Token ID258
Feature activation+9.19e-03

Pos logit contributions
uch+0.057
ats+0.054
aches+0.053
at+0.048
str+0.046
Neg logit contributions
 She-0.095
 infant-0.095
 Can-0.094
 It-0.091
 antics-0.091
Tokenlf
Token ID1652
Feature activation+5.08e-02

Pos logit contributions
uch+0.013
aches+0.011
ats+0.010
ade+0.010
at+0.010
Neg logit contributions
 infant-0.017
 antics-0.017
 personality-0.017
 Can-0.016
 daughter-0.016
TokenThey
Token ID2990
Feature activation+1.60e-02

Pos logit contributions
uch+0.087
aches+0.082
at+0.079
asis+0.076
ats+0.074
Neg logit contributions
 Can-0.065
 antics-0.062
 infant-0.062
 personality-0.062
 Even-0.062
Token were
Token ID547
Feature activation-1.84e-02

Pos logit contributions
uch+0.024
aches+0.022
at+0.020
ats+0.020
ade+0.020
Neg logit contributions
 antics-0.028
 infant-0.028
 personality-0.027
 Can-0.025
amb-0.025
Token very
Token ID845
Feature activation-2.17e-02

Pos logit contributions
 antics+0.024
 infant+0.023
 personality+0.022
 Can+0.021
amb+0.021
Neg logit contributions
uch-0.036
aches-0.034
ats-0.032
at-0.032
ade-0.032
Token confused
Token ID10416
Feature activation-1.49e-02

Pos logit contributions
 antics+0.022
 infant+0.021
 personality+0.020
 Can+0.018
amb+0.018
Neg logit contributions
uch-0.049
aches-0.046
at-0.044
ats-0.044
ade-0.044
Token.
Token ID13
Feature activation-4.40e-02

Pos logit contributions
 antics+0.031
 infant+0.031
 personality+0.030
 Can+0.029
amb+0.029
Neg logit contributions
uch-0.017
aches-0.015
ade-0.014
ats-0.014
at-0.013
Token Where
Token ID6350
Feature activation+9.06e-02

Pos logit contributions
 antics+0.057
heart+0.053
urbed+0.051
istent+0.050
 infant+0.050
Neg logit contributions
 am-0.076
-0.076
uch-0.070
fire-0.070
 soda-0.067
Token did
Token ID750
Feature activation-7.90e-03

Pos logit contributions
asis+0.159
uch+0.145
at+0.145
aches+0.144
fire+0.143
Neg logit contributions
 It-0.137
 She-0.127
 Even-0.121
 loudly-0.119
 He-0.119
Token their
Token ID511
Feature activation-1.86e-02

Pos logit contributions
 infant+0.014
 antics+0.013
 personality+0.013
 Can+0.013
 loudly+0.012
Neg logit contributions
uch-0.012
aches-0.011
at-0.011
ats-0.010
ade-0.010
Token snow
Token ID6729
Feature activation-3.39e-03

Pos logit contributions
 antics+0.028
 infant+0.027
 personality+0.026
 daughter+0.024
 Can+0.023
Neg logit contributions
uch-0.034
aches-0.033
at-0.030
ons-0.030
ade-0.030
Tokenman
Token ID805
Feature activation+3.81e-02

Pos logit contributions
 antics+0.004
 infant+0.004
amb+0.004
 personality+0.004
 loudly+0.003
Neg logit contributions
aches-0.006
 else-0.006
uch-0.006
str-0.005
ats-0.005
Token go
Token ID467
Feature activation-6.01e-03

Pos logit contributions
uch+0.057
ade+0.057
ats+0.056
aches+0.056
w+0.053
Neg logit contributions
 infant-0.055
 Can-0.053
 antics-0.051
 It-0.050
 personality-0.049
Token explore
Token ID7301
Feature activation+2.16e-02

Pos logit contributions
uch+0.017
aches+0.015
at+0.014
ade+0.014
ats+0.014
Neg logit contributions
 antics-0.020
 infant-0.020
 personality-0.020
 Can-0.019
amb-0.018
Token the
Token ID262
Feature activation+1.60e-02

Pos logit contributions
uch+0.043
aches+0.035
at+0.035
w+0.034
ats+0.033
Neg logit contributions
 infant-0.032
 loudly-0.031
 antics-0.031
 wondering-0.031
 personality-0.031
Token world
Token ID995
Feature activation+3.89e-02

Pos logit contributions
aches+0.030
uch+0.029
ats+0.027
at+0.027
ade+0.026
Neg logit contributions
 infant-0.023
 antics-0.021
 personality-0.021
 Can-0.020
 wondering-0.019
Token outside
Token ID2354
Feature activation+2.62e-02

Pos logit contributions
uch+0.065
aches+0.058
ats+0.053
w+0.053
at+0.052
Neg logit contributions
 infant-0.063
 antics-0.063
 Can-0.060
 personality-0.059
 loudly-0.059
Token.
Token ID13
Feature activation-3.81e-03

Pos logit contributions
uch+0.037
aches+0.034
ats+0.030
at+0.030
ade+0.030
Neg logit contributions
 antics-0.049
 infant-0.049
 personality-0.047
 Can-0.046
amb-0.044
Token Where
Token ID6350
Feature activation+9.03e-02

Pos logit contributions
 antics+0.007
 infant+0.007
 personality+0.006
 Can+0.006
amb+0.006
Neg logit contributions
uch-0.006
aches-0.005
at-0.005
ade-0.005
ats-0.005
Tokenver
Token ID332
Feature activation+5.73e-03

Pos logit contributions
uch+0.136
asis+0.133
aches+0.130
at+0.126
fire+0.125
Neg logit contributions
 It-0.132
 hesitant-0.131
 loudly-0.131
 personality-0.128
 infant-0.127
Token he
Token ID339
Feature activation+2.95e-02

Pos logit contributions
uch+0.008
aches+0.008
ons+0.007
ade+0.007
 done+0.007
Neg logit contributions
 antics-0.010
 infant-0.010
 personality-0.010
amb-0.009
 Can-0.009
Token went
Token ID1816
Feature activation-1.54e-02

Pos logit contributions
aches+0.050
uch+0.050
ats+0.046
ade+0.045
at+0.044
Neg logit contributions
 antics-0.042
 Can-0.041
 personality-0.041
 infant-0.041
amb-0.039
Token,
Token ID11
Feature activation-8.18e-03

Pos logit contributions
 antics+0.025
 infant+0.024
 personality+0.024
 Can+0.023
amb+0.022
Neg logit contributions
uch-0.025
aches-0.024
at-0.022
ats-0.022
ade-0.022
Token Jack
Token ID3619
Feature activation+3.62e-03

Pos logit contributions
 infant+0.012
 antics+0.012
 personality+0.012
 Can+0.011
 loudly+0.011
Neg logit contributions
uch-0.015
aches-0.014
at-0.014
ats-0.013
ade-0.013
Token still
Token ID991
Feature activation-1.77e-02

Pos logit contributions
 antics+0.000
 infant+0.000
 personality+0.000
 Can+0.000
amb+0.000
Neg logit contributions
uch-0.000
aches-0.000
at-0.000
ats-0.000
ade-0.000
Token wears
Token ID17326
Feature activation-4.25e-03

Pos logit contributions
 antics+0.032
 infant+0.031
 personality+0.029
 Can+0.029
amb+0.029
Neg logit contributions
uch-0.025
aches-0.023
 done-0.022
ats-0.021
 am-0.021
Token it
Token ID340
Feature activation+3.63e-02

Pos logit contributions
 antics+0.006
 personality+0.006
 infant+0.006
 Can+0.006
 lear+0.005
Neg logit contributions
uch-0.008
aches-0.007
at-0.007
asis-0.007
ats-0.007
Token today
Token ID1909
Feature activation+3.31e-02

Pos logit contributions
uch+0.064
aches+0.060
asis+0.055
at+0.055
ats+0.054
Neg logit contributions
 personality-0.054
 infant-0.054
 antics-0.054
 loudly-0.050
 Can-0.050
Token.
Token ID13
Feature activation-1.99e-02

Pos logit contributions
uch+0.051
aches+0.048
at+0.043
ats+0.043
ade+0.042
Neg logit contributions
 antics-0.057
 infant-0.056
 personality-0.056
 Can-0.053
 lear-0.051
Token Where
Token ID6350
Feature activation+9.01e-02

Pos logit contributions
 antics+0.024
 infant+0.021
rily+0.019
 adventurers+0.019
rier+0.019
Neg logit contributions
uch-0.038
 am-0.037
ons-0.036
aches-0.036
he-0.035
Tokenver
Token ID332
Feature activation+6.45e-03

Pos logit contributions
asis+0.137
uch+0.133
aches+0.128
at+0.127
fire+0.126
Neg logit contributions
 personality-0.133
 infant-0.131
 loudly-0.131
 hesitant-0.131
 It-0.129
Token she
Token ID673
Feature activation+2.81e-02

Pos logit contributions
uch+0.010
aches+0.009
ons+0.008
ats+0.008
ade+0.008
Neg logit contributions
 antics-0.011
 infant-0.011
 personality-0.010
amb-0.010
 Can-0.010
Token goes
Token ID2925
Feature activation-4.77e-03

Pos logit contributions
uch+0.048
aches+0.046
ats+0.043
at+0.042
ade+0.042
Neg logit contributions
 antics-0.041
 infant-0.041
 personality-0.041
 Can-0.039
amb-0.038
Token,
Token ID11
Feature activation-1.02e-02

Pos logit contributions
 antics+0.008
 infant+0.008
 personality+0.008
 Can+0.008
amb+0.007
Neg logit contributions
uch-0.007
aches-0.007
at-0.006
ats-0.006
ade-0.006
Token it
Token ID340
Feature activation+3.58e-02

Pos logit contributions
 infant+0.015
 personality+0.014
 antics+0.014
 Can+0.014
 daughter+0.013
Neg logit contributions
uch-0.019
aches-0.017
at-0.017
ats-0.017
ade-0.016
Token
Token ID198
Feature activation+4.60e-02

Pos logit contributions
uch+0.104
aches+0.094
at+0.089
asis+0.088
ats+0.084
Neg logit contributions
 Even-0.105
 antics-0.101
 Can-0.101
 infant-0.101
 personality-0.101
Tokenâ
Token ID22940
Feature activation-6.10e-02

Pos logit contributions
uch+0.071
aches+0.066
at+0.060
ats+0.060
ade+0.060
Neg logit contributions
 antics-0.079
 infant-0.077
 personality-0.075
 Can-0.072
amb-0.071
Token
Token ID26391
Feature activation-5.01e-02

Pos logit contributions
 antics+0.098
 infant+0.096
 personality+0.094
 Can+0.089
amb+0.088
Neg logit contributions
uch-0.100
aches-0.094
ade-0.086
ats-0.086
at-0.086
Token
Token ID129
Feature activation-6.93e-02

Pos logit contributions
 antics+0.075
 personality+0.072
 infant+0.069
 Can+0.068
 Even+0.067
Neg logit contributions
aches-0.084
uch-0.083
ade-0.077
at-0.076
asis-0.071
Token
Token ID241
Feature activation+2.09e-02

Pos logit contributions
 infant+0.097
 antics+0.093
 personality+0.092
 Can+0.091
 lear+0.085
Neg logit contributions
uch-0.129
aches-0.125
ats-0.117
 done-0.115
at-0.114
TokenWhere
Token ID8496
Feature activation+9.01e-02

Pos logit contributions
uch+0.033
aches+0.030
 am+0.028
ade+0.028
ats+0.028
Neg logit contributions
 antics-0.035
 infant-0.034
 personality-0.033
rily-0.031
 Can-0.031
Tokenâ
Token ID22940
Feature activation-4.67e-02

Pos logit contributions
uch+0.125
asis+0.123
aches+0.121
fire+0.118
at+0.115
Neg logit contributions
 personality-0.143
 Can-0.142
 It-0.141
 hesitant-0.140
 Even-0.139
Token
Token ID26391
Feature activation-4.29e-02

Pos logit contributions
 antics+0.076
 infant+0.074
 personality+0.074
 Can+0.072
amb+0.071
Neg logit contributions
aches-0.067
uch-0.066
ons-0.061
ade-0.060
ats-0.060
Token
Token ID8151
Feature activation+3.61e-02

Pos logit contributions
 antics+0.069
 personality+0.066
 Can+0.065
 infant+0.065
 Even+0.061
Neg logit contributions
aches-0.067
uch-0.066
at-0.060
ade-0.060
ats-0.058
Tokens
Token ID82
Feature activation+4.52e-03

Pos logit contributions
aches+0.053
uch+0.051
asis+0.048
fire+0.047
aster+0.044
Neg logit contributions
 Can-0.055
 infant-0.055
 She-0.055
istent-0.055
 Even-0.054
Token my
Token ID616
Feature activation-1.43e-02

Pos logit contributions
uch+0.006
aches+0.006
at+0.005
ats+0.005
ade+0.005
Neg logit contributions
 infant-0.009
 antics-0.008
 personality-0.008
 Can-0.008
 daughter-0.007
Token very
Token ID845
Feature activation-1.93e-02

Pos logit contributions
 antics+0.026
 infant+0.026
 personality+0.025
 Can+0.023
amb+0.023
Neg logit contributions
uch-0.037
aches-0.035
at-0.033
ade-0.033
ats-0.033
Token confused
Token ID10416
Feature activation-1.30e-02

Pos logit contributions
 infant+0.019
 antics+0.019
 personality+0.018
 Can+0.016
 loudly+0.016
Neg logit contributions
uch-0.044
aches-0.041
at-0.040
ats-0.040
ade-0.040
Token and
Token ID290
Feature activation-2.12e-02

Pos logit contributions
 antics+0.019
 infant+0.018
 personality+0.018
 Can+0.017
amb+0.017
Neg logit contributions
uch-0.023
aches-0.022
ade-0.020
ats-0.020
at-0.020
Token scared
Token ID12008
Feature activation-1.09e-02

Pos logit contributions
 antics+0.029
 infant+0.027
amb+0.027
 personality+0.026
 Can+0.024
Neg logit contributions
uch-0.039
aches-0.038
 done-0.037
 am-0.035
ats-0.035
Token.
Token ID13
Feature activation-4.28e-02

Pos logit contributions
 antics+0.026
 personality+0.025
 infant+0.025
amb+0.024
istent+0.023
Neg logit contributions
uch-0.008
aches-0.008
ade-0.007
ons-0.007
ats-0.007
Token Where
Token ID6350
Feature activation+9.00e-02

Pos logit contributions
 antics+0.056
 lear+0.051
heart+0.050
urbed+0.050
amb+0.050
Neg logit contributions
 am-0.071
-0.070
uch-0.070
fire-0.067
 soda-0.064
Token did
Token ID750
Feature activation-7.38e-03

Pos logit contributions
asis+0.167
at+0.150
uch+0.148
aches+0.148
fire+0.146
Neg logit contributions
 It-0.129
 She-0.116
 loudly-0.115
 Can-0.111
 Even-0.111
Token the
Token ID262
Feature activation+7.35e-03

Pos logit contributions
 infant+0.013
 antics+0.013
 personality+0.012
 Can+0.012
 loudly+0.011
Neg logit contributions
uch-0.011
aches-0.010
at-0.010
ats-0.009
ade-0.009
Token volcano
Token ID32410
Feature activation+7.01e-02

Pos logit contributions
uch+0.012
aches+0.011
ats+0.011
at+0.010
ade+0.010
Neg logit contributions
 infant-0.013
 antics-0.013
 Can-0.012
 personality-0.012
 loudly-0.011
Token go
Token ID467
Feature activation+3.86e-03

Pos logit contributions
ats+0.116
uch+0.114
aches+0.111
at+0.108
live+0.104
Neg logit contributions
 infant-0.096
 Can-0.095
 loudly-0.091
 hesitant-0.087
 He-0.087
Token?
Token ID30
Feature activation-3.04e-02

Pos logit contributions
uch+0.006
aches+0.005
ats+0.005
at+0.005
ade+0.005
Neg logit contributions
 infant-0.007
 Can-0.007
 She-0.006
 loudly-0.006
 antics-0.006
Token He
Token ID679
Feature activation-6.83e-03

Pos logit contributions
 antics+0.035
 lear+0.034
arted+0.032
amb+0.031
iously+0.031
Neg logit contributions
uch-0.044
ons-0.043
 am-0.043
he-0.042
-0.041
Token was
Token ID373
Feature activation-4.15e-02

Pos logit contributions
 antics+0.012
 infant+0.012
 personality+0.012
 Can+0.011
amb+0.011
Neg logit contributions
uch-0.010
aches-0.009
ats-0.008
ade-0.008
at-0.008
Token a
Token ID257
Feature activation-3.75e-02

Pos logit contributions
 antics+0.085
istent+0.077
 personality+0.076
urbed+0.076
 infant+0.075
Neg logit contributions
uch-0.052
aches-0.050
 done-0.048
at-0.043
 am-0.042
Token very
Token ID845
Feature activation-5.88e-02

Pos logit contributions
 antics+0.072
arted+0.069
urbed+0.067
iously+0.067
istent+0.067
Neg logit contributions
uch-0.052
str-0.047
 am-0.046
asis-0.045
aches-0.044
Token adventurous
Token ID39770
Feature activation-8.50e-02

Pos logit contributions
 antics+0.095
 infant+0.090
 personality+0.087
urbed+0.085
amb+0.083
Neg logit contributions
uch-0.098
aches-0.093
ade-0.086
at-0.085
ats-0.084
Token sw
Token ID1509
Feature activation-1.20e-01

Pos logit contributions
istent+0.154
 antics+0.151
urbed+0.148
rily+0.147
iously+0.145
Neg logit contributions
asis-0.114
str-0.110
 am-0.108
ade-0.106
lly-0.103
Tokenan
Token ID272
Feature activation-3.33e-02

Pos logit contributions
 infant+0.217
 Can+0.215
 antics+0.210
 personality+0.205
 Even+0.193
Neg logit contributions
uch-0.179
aches-0.166
ats-0.158
at-0.156
ade-0.155
Token.
Token ID13
Feature activation-2.03e-02

Pos logit contributions
 antics+0.072
 infant+0.069
 personality+0.068
 Can+0.065
rily+0.065
Neg logit contributions
uch-0.036
aches-0.033
at-0.030
ats-0.030
ade-0.029
Token
Token ID220
Feature activation+9.07e-03

Pos logit contributions
 antics+0.033
 infant+0.031
 personality+0.030
 lear+0.030
arted+0.030
Neg logit contributions
uch-0.031
aches-0.029
ons-0.028
 am-0.028
at-0.027
Token
Token ID198
Feature activation+5.97e-02

Pos logit contributions
uch+0.018
aches+0.017
asis+0.016
at+0.016
w+0.015
Neg logit contributions
 It-0.013
 She-0.013
 He-0.012
 Even-0.012
 As-0.011
Token
Token ID198
Feature activation+3.20e-02

Pos logit contributions
uch+0.098
aches+0.091
at+0.083
ats+0.081
ade+0.081
Neg logit contributions
 antics-0.095
 Can-0.094
 infant-0.094
 Even-0.094
 personality-0.093
Token time
Token ID640
Feature activation-2.72e-05

Pos logit contributions
uch+0.003
aches+0.002
ons+0.002
asis+0.002
at+0.002
Neg logit contributions
 antics-0.003
 personality-0.003
 infant-0.003
 loudly-0.003
 lear-0.003
Token,
Token ID11
Feature activation+1.44e-02

Pos logit contributions
 antics+0.000
 infant+0.000
 personality+0.000
 Can+0.000
amb+0.000
Neg logit contributions
uch-0.000
aches-0.000
at-0.000
ats-0.000
ade-0.000
Token there
Token ID612
Feature activation+4.67e-02

Pos logit contributions
uch+0.017
aches+0.015
ats+0.014
at+0.014
ade+0.014
Neg logit contributions
 infant-0.030
 antics-0.030
 personality-0.029
 Can-0.028
amb-0.027
Token was
Token ID373
Feature activation-3.62e-02

Pos logit contributions
uch+0.074
aches+0.073
at+0.066
ats+0.066
asis+0.063
Neg logit contributions
 infant-0.075
 antics-0.074
 personality-0.073
 It-0.073
 Even-0.073
Token a
Token ID257
Feature activation-3.67e-02

Pos logit contributions
 antics+0.079
 personality+0.076
istent+0.073
 infant+0.073
amb+0.073
Neg logit contributions
uch-0.039
aches-0.033
ade-0.031
ons-0.031
at-0.030
Token sw
Token ID1509
Feature activation-1.17e-01

Pos logit contributions
arted+0.059
 antics+0.058
iously+0.058
urbed+0.058
amb+0.056
Neg logit contributions
uch-0.058
str-0.055
lly-0.053
aches-0.052
asis-0.052
Tokenan
Token ID272
Feature activation-3.56e-02

Pos logit contributions
 Can+0.216
 infant+0.215
 antics+0.210
 personality+0.207
istent+0.195
Neg logit contributions
uch-0.170
aches-0.158
ade-0.154
at-0.153
ats-0.151
Token which
Token ID543
Feature activation+1.26e-02

Pos logit contributions
 antics+0.057
 infant+0.057
 personality+0.055
amb+0.053
 Can+0.052
Neg logit contributions
uch-0.058
aches-0.054
ats-0.050
ade-0.050
at-0.049
Token had
Token ID550
Feature activation+1.10e-02

Pos logit contributions
uch+0.020
aches+0.019
ats+0.018
at+0.018
ade+0.017
Neg logit contributions
 infant-0.020
 personality-0.020
 antics-0.020
 Can-0.019
amb-0.018
Token a
Token ID257
Feature activation-1.43e-02

Pos logit contributions
uch+0.015
aches+0.014
ade+0.012
ats+0.012
at+0.012
Neg logit contributions
 antics-0.021
 infant-0.020
 personality-0.020
 Can-0.019
amb-0.019
Token very
Token ID845
Feature activation-3.70e-02

Pos logit contributions
 antics+0.028
arted+0.027
 infant+0.027
istent+0.027
urbed+0.027
Neg logit contributions
uch-0.018
aches-0.016
 am-0.015
ade-0.015
str-0.015
Token,
Token ID11
Feature activation+1.52e-02

Pos logit contributions
 infant+0.004
 antics+0.004
 Can+0.004
 personality+0.004
 Even+0.004
Neg logit contributions
uch-0.007
aches-0.006
at-0.006
ats-0.006
asis-0.006
Token there
Token ID612
Feature activation+4.56e-02

Pos logit contributions
uch+0.020
aches+0.020
ats+0.018
at+0.018
ade+0.018
Neg logit contributions
 infant-0.029
 Can-0.028
 personality-0.027
 antics-0.027
 loudly-0.026
Token was
Token ID373
Feature activation-3.63e-02

Pos logit contributions
aches+0.076
uch+0.076
at+0.069
ats+0.068
asis+0.065
Neg logit contributions
 It-0.069
 infant-0.068
 Even-0.067
 Can-0.066
 antics-0.065
Token a
Token ID257
Feature activation-3.70e-02

Pos logit contributions
 antics+0.081
 personality+0.077
istent+0.074
urbed+0.074
 infant+0.073
Neg logit contributions
uch-0.039
aches-0.033
ons-0.031
ade-0.030
at-0.030
Token blue
Token ID4171
Feature activation-5.80e-02

Pos logit contributions
arted+0.062
iously+0.061
 antics+0.061
urbed+0.059
amb+0.058
Neg logit contributions
uch-0.058
str-0.055
lly-0.052
aches-0.051
 else-0.051
Token sw
Token ID1509
Feature activation-1.17e-01

Pos logit contributions
 antics+0.097
istent+0.096
arted+0.094
urbed+0.093
iously+0.090
Neg logit contributions
uch-0.087
ade-0.079
asis-0.077
aches-0.076
at-0.076
Tokenan
Token ID272
Feature activation-3.31e-02

Pos logit contributions
 Can+0.221
 infant+0.219
 antics+0.214
 personality+0.210
istent+0.202
Neg logit contributions
uch-0.167
aches-0.155
at-0.151
ade-0.150
asis-0.149
Token.
Token ID13
Feature activation-3.40e-02

Pos logit contributions
 antics+0.064
 infant+0.062
 personality+0.061
 Can+0.058
amb+0.058
Neg logit contributions
uch-0.044
aches-0.041
at-0.037
ats-0.036
ade-0.036
Token He
Token ID679
Feature activation-1.59e-02

Pos logit contributions
 antics+0.057
urbed+0.053
 lear+0.052
arted+0.051
rier+0.051
Neg logit contributions
 am-0.048
uch-0.047
aster-0.046
aches-0.044
ons-0.043
Token was
Token ID373
Feature activation-4.42e-02

Pos logit contributions
 antics+0.027
 personality+0.027
 infant+0.026
 Can+0.025
amb+0.024
Neg logit contributions
uch-0.024
aches-0.023
ats-0.021
at-0.021
ade-0.021
Token swimming
Token ID14899
Feature activation-4.80e-02

Pos logit contributions
 antics+0.091
istent+0.081
urbed+0.081
 personality+0.081
 lear+0.081
Neg logit contributions
uch-0.056
aches-0.053
 done-0.051
at-0.045
 am-0.045
Token time
Token ID640
Feature activation+2.51e-03

Pos logit contributions
uch+0.000
aches+0.000
ons+0.000
at+0.000
ade+0.000
Neg logit contributions
 antics-0.000
 infant-0.000
 personality-0.000
 Can-0.000
amb-0.000
Token,
Token ID11
Feature activation+1.38e-02

Pos logit contributions
uch+0.005
aches+0.005
at+0.004
ade+0.004
ats+0.004
Neg logit contributions
 antics-0.003
 infant-0.003
 personality-0.003
 Can-0.003
amb-0.003
Token there
Token ID612
Feature activation+5.09e-02

Pos logit contributions
uch+0.017
aches+0.016
ats+0.014
at+0.014
ade+0.014
Neg logit contributions
 infant-0.028
 antics-0.027
 personality-0.027
 Can-0.027
amb-0.025
Token was
Token ID373
Feature activation-3.46e-02

Pos logit contributions
uch+0.084
aches+0.083
at+0.076
ats+0.075
asis+0.072
Neg logit contributions
 It-0.077
 infant-0.077
 Even-0.076
 antics-0.076
 personality-0.075
Token a
Token ID257
Feature activation-3.70e-02

Pos logit contributions
 antics+0.077
 personality+0.073
istent+0.071
urbed+0.071
amb+0.070
Neg logit contributions
uch-0.036
aches-0.030
ons-0.028
ade-0.028
 done-0.028
Token sw
Token ID1509
Feature activation-1.17e-01

Pos logit contributions
arted+0.062
iously+0.061
 antics+0.061
urbed+0.060
istent+0.059
Neg logit contributions
uch-0.056
str-0.054
lly-0.052
asis-0.051
aches-0.051
Tokenan
Token ID272
Feature activation-3.45e-02

Pos logit contributions
 Can+0.222
 infant+0.218
 antics+0.215
 personality+0.211
istent+0.201
Neg logit contributions
uch-0.165
aches-0.152
ade-0.150
at-0.150
asis-0.148
Token swimming
Token ID14899
Feature activation-4.89e-02

Pos logit contributions
 antics+0.059
 infant+0.058
 personality+0.057
 Can+0.054
amb+0.054
Neg logit contributions
uch-0.053
aches-0.049
ats-0.045
ade-0.045
at-0.045
Token in
Token ID287
Feature activation+2.46e-02

Pos logit contributions
 infant+0.087
 antics+0.086
 personality+0.085
amb+0.084
 Can+0.081
Neg logit contributions
uch-0.072
aches-0.068
ons-0.060
ats-0.059
at-0.058
Token a
Token ID257
Feature activation-1.04e-02

Pos logit contributions
uch+0.027
aches+0.025
ade+0.021
at+0.021
ats+0.021
Neg logit contributions
 antics-0.053
 infant-0.052
 personality-0.051
amb-0.049
 Can-0.049
Token big
Token ID1263
Feature activation-4.86e-02

Pos logit contributions
 antics+0.019
 infant+0.019
 personality+0.018
 Can+0.018
amb+0.017
Neg logit contributions
uch-0.015
aches-0.014
at-0.012
ats-0.012
ade-0.012
Token Alex
Token ID4422
Feature activation-2.68e-02

Pos logit contributions
 antics+0.028
 infant+0.027
iously+0.026
rier+0.025
 lear+0.024
Neg logit contributions
-0.029
 am-0.027
<|endoftext|>-0.025
he-0.025
 else-0.025
Token was
Token ID373
Feature activation-4.17e-02

Pos logit contributions
 antics+0.052
 infant+0.052
 personality+0.049
 Can+0.048
rily+0.047
Neg logit contributions
uch-0.034
aches-0.031
ats-0.029
at-0.028
ade-0.028
Token a
Token ID257
Feature activation-3.87e-02

Pos logit contributions
 antics+0.071
 infant+0.066
 personality+0.065
 Can+0.063
amb+0.062
Neg logit contributions
uch-0.064
aches-0.061
 done-0.058
at-0.055
ats-0.055
Token very
Token ID845
Feature activation-5.73e-02

Pos logit contributions
 antics+0.072
arted+0.069
iously+0.066
rily+0.066
istent+0.066
Neg logit contributions
uch-0.053
str-0.050
 am-0.048
lly-0.047
asis-0.046
Token obedient
Token ID41652
Feature activation-8.24e-02

Pos logit contributions
 antics+0.095
 infant+0.088
 personality+0.086
urbed+0.084
amb+0.083
Neg logit contributions
uch-0.092
aches-0.089
ade-0.081
 done-0.080
ats-0.079
Token sw
Token ID1509
Feature activation-1.16e-01

Pos logit contributions
istent+0.117
 antics+0.115
rily+0.112
urbed+0.109
atched+0.108
Neg logit contributions
 last-0.121
ade-0.120
 am-0.119
asis-0.119
aches-0.116
Tokenan
Token ID272
Feature activation-2.99e-02

Pos logit contributions
 Can+0.233
 infant+0.226
 personality+0.213
 antics+0.211
 Even+0.203
Neg logit contributions
uch-0.158
aches-0.152
ats-0.149
at-0.142
ade-0.140
Token and
Token ID290
Feature activation-1.43e-02

Pos logit contributions
 antics+0.059
 infant+0.057
 personality+0.057
 Can+0.054
amb+0.053
Neg logit contributions
uch-0.037
aches-0.034
ats-0.031
at-0.031
ade-0.031
Token he
Token ID339
Feature activation-6.14e-03

Pos logit contributions
 antics+0.025
 infant+0.024
 personality+0.023
amb+0.022
rily+0.022
Neg logit contributions
uch-0.021
aches-0.020
 done-0.018
 am-0.018
ats-0.018
Token always
Token ID1464
Feature activation-3.04e-03

Pos logit contributions
 antics+0.009
 infant+0.009
 personality+0.009
 Can+0.009
amb+0.008
Neg logit contributions
uch-0.010
aches-0.010
at-0.009
ade-0.009
ats-0.009
Token followed
Token ID3940
Feature activation-2.53e-02

Pos logit contributions
 antics+0.004
 infant+0.004
 personality+0.004
 Can+0.004
amb+0.004
Neg logit contributions
uch-0.006
aches-0.005
ade-0.005
at-0.005
ats-0.005
Token pond
Token ID16723
Feature activation+1.18e-02

Pos logit contributions
 antics+0.014
 personality+0.013
 infant+0.013
 Can+0.013
 adventurers+0.012
Neg logit contributions
uch-0.018
aches-0.017
ons-0.017
ats-0.016
at-0.016
Token,
Token ID11
Feature activation+5.55e-03

Pos logit contributions
uch+0.024
rob+0.023
at+0.023
aches+0.022
ats+0.022
Neg logit contributions
 She-0.013
 personality-0.011
 It-0.011
 Can-0.011
 hesitant-0.010
Token there
Token ID612
Feature activation+4.77e-02

Pos logit contributions
uch+0.006
aches+0.005
ats+0.005
at+0.005
ade+0.005
Neg logit contributions
 antics-0.012
 infant-0.012
 personality-0.012
 Can-0.011
amb-0.011
Token was
Token ID373
Feature activation-3.03e-02

Pos logit contributions
uch+0.063
aches+0.058
ats+0.053
at+0.053
asis+0.050
Neg logit contributions
 infant-0.090
 antics-0.090
 personality-0.088
 Can-0.085
 Even-0.082
Token a
Token ID257
Feature activation-3.03e-02

Pos logit contributions
 infant+0.044
 Can+0.042
 personality+0.042
 antics+0.042
 loudly+0.039
Neg logit contributions
uch-0.054
aches-0.051
ats-0.049
at-0.048
ons-0.047
Token sw
Token ID1509
Feature activation-1.16e-01

Pos logit contributions
 antics+0.056
amb+0.053
urbed+0.053
istent+0.052
 infant+0.052
Neg logit contributions
uch-0.042
aches-0.039
ade-0.037
asis-0.037
 am-0.036
Tokenan
Token ID272
Feature activation-3.40e-02

Pos logit contributions
 infant+0.218
 Can+0.210
 antics+0.208
 personality+0.203
istent+0.196
Neg logit contributions
uch-0.170
aches-0.159
ade-0.154
asis-0.150
ats-0.150
Token.
Token ID13
Feature activation-3.13e-02

Pos logit contributions
 antics+0.063
 infant+0.063
 personality+0.061
 Can+0.059
amb+0.059
Neg logit contributions
uch-0.047
aches-0.043
ats-0.039
at-0.039
ade-0.038
Token The
Token ID383
Feature activation-1.81e-02

Pos logit contributions
 antics+0.051
amb+0.046
 infant+0.046
urbed+0.046
 lear+0.045
Neg logit contributions
uch-0.048
ons-0.045
 am-0.044
aches-0.044
aster-0.043
Token sw
Token ID1509
Feature activation-1.02e-01

Pos logit contributions
 antics+0.023
 infant+0.022
 personality+0.022
 Can+0.020
amb+0.020
Neg logit contributions
uch-0.036
aches-0.034
ons-0.032
at-0.032
ats-0.032
Tokenan
Token ID272
Feature activation-1.72e-02

Pos logit contributions
 infant+0.189
 Can+0.179
 antics+0.179
 personality+0.174
 lear+0.170
Neg logit contributions
uch-0.150
aches-0.140
ats-0.132
ade-0.131
asis-0.129
Token saw
Token ID2497
Feature activation+1.62e-02

Pos logit contributions
 antics+0.054
 infant+0.053
 personality+0.052
 Can+0.050
amb+0.049
Neg logit contributions
uch-0.044
aches-0.041
at-0.037
ats-0.037
ade-0.037
Token a
Token ID257
Feature activation-2.35e-02

Pos logit contributions
uch+0.030
aches+0.027
ats+0.027
asis+0.026
at+0.026
Neg logit contributions
 infant-0.023
 antics-0.021
 personality-0.021
 Can-0.021
 loudly-0.020
Token big
Token ID1263
Feature activation-5.04e-02

Pos logit contributions
 antics+0.047
 infant+0.046
 personality+0.045
amb+0.044
urbed+0.043
Neg logit contributions
uch-0.029
aches-0.027
ade-0.024
asis-0.023
ats-0.023
Token,
Token ID11
Feature activation-4.42e-02

Pos logit contributions
 antics+0.098
urbed+0.091
amb+0.091
 personality+0.091
 infant+0.090
Neg logit contributions
uch-0.068
aches-0.062
asis-0.060
ade-0.059
 am-0.056
Token graceful
Token ID44363
Feature activation-6.78e-02

Pos logit contributions
 antics+0.083
amb+0.079
 personality+0.075
urbed+0.075
iously+0.073
Neg logit contributions
uch-0.061
aches-0.058
ade-0.055
 done-0.052
str-0.052
Token sw
Token ID1509
Feature activation-1.16e-01

Pos logit contributions
istent+0.130
arted+0.123
urbed+0.123
iously+0.122
 Whoever+0.121
Neg logit contributions
ade-0.082
asis-0.081
uch-0.079
aches-0.077
str-0.072
Tokenan
Token ID272
Feature activation-3.58e-02

Pos logit contributions
 infant+0.212
 Can+0.205
 antics+0.202
 personality+0.198
 lear+0.187
Neg logit contributions
uch-0.177
aches-0.162
ade-0.155
ats-0.154
at-0.152
Token swimming
Token ID14899
Feature activation-4.84e-02

Pos logit contributions
 antics+0.071
 infant+0.070
 personality+0.069
 Can+0.066
istent+0.064
Neg logit contributions
uch-0.046
aches-0.042
ats-0.038
ade-0.037
at-0.037
Token in
Token ID287
Feature activation+2.20e-02

Pos logit contributions
 antics+0.091
 infant+0.090
 personality+0.087
 Can+0.085
amb+0.084
Neg logit contributions
uch-0.066
aches-0.062
ats-0.055
at-0.055
ons-0.054
Token the
Token ID262
Feature activation+7.85e-03

Pos logit contributions
uch+0.027
aches+0.024
at+0.022
ats+0.022
ade+0.022
Neg logit contributions
 antics-0.045
 infant-0.045
 personality-0.043
 Can-0.042
amb-0.041
Token pond
Token ID16723
Feature activation-5.82e-04

Pos logit contributions
aches+0.014
uch+0.014
ons+0.014
ats+0.013
kins+0.013
Neg logit contributions
 infant-0.010
 personality-0.010
 antics-0.010
 Can-0.010
 Even-0.009
Token It
Token ID632
Feature activation+1.25e-02

Pos logit contributions
 antics+0.039
 infant+0.036
urbed+0.035
 lear+0.034
rier+0.034
Neg logit contributions
uch-0.051
 am-0.049
aches-0.048
asis-0.046
at-0.046
Token was
Token ID373
Feature activation-4.36e-02

Pos logit contributions
uch+0.018
aches+0.016
ats+0.015
ons+0.015
at+0.015
Neg logit contributions
 personality-0.022
 infant-0.022
 antics-0.022
 Can-0.022
amb-0.021
Token a
Token ID257
Feature activation-3.92e-02

Pos logit contributions
 antics+0.078
 infant+0.075
 personality+0.074
 Can+0.071
amb+0.070
Neg logit contributions
uch-0.065
aches-0.060
at-0.054
ade-0.054
ats-0.054
Token very
Token ID845
Feature activation-6.54e-02

Pos logit contributions
 antics+0.076
urbed+0.071
arted+0.071
 infant+0.069
iously+0.069
Neg logit contributions
uch-0.054
aches-0.047
str-0.046
asis-0.045
 am-0.044
Token beautiful
Token ID4950
Feature activation-5.96e-02

Pos logit contributions
 antics+0.112
 infant+0.109
 personality+0.105
 Can+0.101
amb+0.101
Neg logit contributions
uch-0.102
aches-0.095
ade-0.087
at-0.086
ats-0.085
Token sw
Token ID1509
Feature activation-1.16e-01

Pos logit contributions
 antics+0.104
istent+0.099
 infant+0.099
urbed+0.097
arted+0.097
Neg logit contributions
uch-0.086
aches-0.081
asis-0.080
ade-0.078
at-0.074
Tokenan
Token ID272
Feature activation-3.32e-02

Pos logit contributions
 infant+0.204
 antics+0.199
 Can+0.198
 personality+0.194
 lear+0.183
Neg logit contributions
uch-0.180
aches-0.166
at-0.157
ats-0.156
ade-0.156
Token and
Token ID290
Feature activation-2.08e-02

Pos logit contributions
 antics+0.070
 infant+0.068
 personality+0.067
 Can+0.064
amb+0.063
Neg logit contributions
uch-0.039
aches-0.035
at-0.031
ats-0.031
ade-0.031
Token its
Token ID663
Feature activation-2.44e-02

Pos logit contributions
 antics+0.036
 infant+0.035
 personality+0.034
amb+0.033
 Can+0.032
Neg logit contributions
uch-0.032
aches-0.030
ons-0.027
ade-0.027
ats-0.026
Token feathers
Token ID27400
Feature activation+5.67e-03

Pos logit contributions
 antics+0.036
 infant+0.035
 personality+0.035
amb+0.032
 Can+0.032
Neg logit contributions
uch-0.044
aches-0.041
ons-0.038
ats-0.038
at-0.038
Token were
Token ID547
Feature activation-2.08e-02

Pos logit contributions
uch+0.009
aster+0.009
aches+0.008
ats+0.008
at+0.008
Neg logit contributions
 personality-0.009
 antics-0.009
 infant-0.008
amb-0.008
 loudly-0.008
Token saw
Token ID2497
Feature activation+1.75e-02

Pos logit contributions
 infant+0.024
 antics+0.024
 personality+0.023
 Can+0.022
amb+0.022
Neg logit contributions
uch-0.018
aches-0.017
at-0.015
asis-0.014
ats-0.014
Token a
Token ID257
Feature activation-2.16e-02

Pos logit contributions
uch+0.033
aches+0.030
asis+0.030
ats+0.029
at+0.029
Neg logit contributions
 infant-0.026
 personality-0.022
 antics-0.022
 She-0.022
 loudly-0.022
Token big
Token ID1263
Feature activation-5.25e-02

Pos logit contributions
 antics+0.041
 infant+0.039
 personality+0.039
amb+0.038
urbed+0.037
Neg logit contributions
uch-0.030
aches-0.027
ade-0.025
asis-0.024
ats-0.024
Token,
Token ID11
Feature activation-4.39e-02

Pos logit contributions
 antics+0.103
urbed+0.096
 personality+0.095
amb+0.095
istent+0.093
Neg logit contributions
uch-0.069
aches-0.064
asis-0.062
ade-0.061
 am-0.057
Token white
Token ID2330
Feature activation-6.36e-02

Pos logit contributions
 antics+0.084
amb+0.080
urbed+0.075
 personality+0.075
iously+0.074
Neg logit contributions
uch-0.060
aches-0.057
ade-0.053
 done-0.051
str-0.051
Token sw
Token ID1509
Feature activation-1.16e-01

Pos logit contributions
 antics+0.110
istent+0.107
urbed+0.104
 personality+0.102
arted+0.102
Neg logit contributions
uch-0.095
ade-0.088
aches-0.087
asis-0.085
at-0.081
Tokenan
Token ID272
Feature activation-3.36e-02

Pos logit contributions
 infant+0.220
 Can+0.214
 antics+0.209
 personality+0.206
 lear+0.197
Neg logit contributions
uch-0.170
aches-0.155
ade-0.151
ats-0.149
at-0.148
Token swimming
Token ID14899
Feature activation-4.78e-02

Pos logit contributions
 antics+0.065
 infant+0.063
 personality+0.062
 Can+0.059
amb+0.058
Neg logit contributions
uch-0.045
aches-0.041
at-0.037
ats-0.037
ade-0.037
Token in
Token ID287
Feature activation+2.35e-02

Pos logit contributions
 antics+0.093
 infant+0.092
 personality+0.089
 Can+0.086
amb+0.085
Neg logit contributions
uch-0.063
aches-0.058
ats-0.051
at-0.051
ade-0.051
Token the
Token ID262
Feature activation+8.35e-03

Pos logit contributions
uch+0.028
aches+0.025
at+0.022
ats+0.022
ade+0.022
Neg logit contributions
 antics-0.049
 infant-0.048
 personality-0.047
 Can-0.045
amb-0.045
Token pond
Token ID16723
Feature activation+1.11e-03

Pos logit contributions
aches+0.015
uch+0.015
ons+0.015
ats+0.014
kins+0.014
Neg logit contributions
 infant-0.011
 personality-0.010
 Can-0.010
 antics-0.010
 It-0.010
Token time
Token ID640
Feature activation-1.47e-03

Pos logit contributions
aches+0.004
uch+0.004
ons+0.003
asis+0.003
at+0.003
Neg logit contributions
 antics-0.005
 personality-0.004
 infant-0.004
 loudly-0.004
 Even-0.004
Token,
Token ID11
Feature activation+1.77e-02

Pos logit contributions
 infant+0.002
 antics+0.002
 Can+0.002
 personality+0.002
 Even+0.002
Neg logit contributions
uch-0.003
aches-0.003
at-0.003
ats-0.003
ade-0.003
Token there
Token ID612
Feature activation+4.92e-02

Pos logit contributions
uch+0.026
aches+0.025
ats+0.024
at+0.023
asis+0.023
Neg logit contributions
 infant-0.031
 Can-0.030
 It-0.029
 personality-0.029
 hesitant-0.028
Token was
Token ID373
Feature activation-3.47e-02

Pos logit contributions
aches+0.088
uch+0.087
at+0.081
ats+0.079
asis+0.076
Neg logit contributions
 It-0.071
 Even-0.068
 She-0.067
 infant-0.066
 personality-0.064
Token a
Token ID257
Feature activation-3.77e-02

Pos logit contributions
 antics+0.077
 personality+0.073
istent+0.071
urbed+0.071
 infant+0.070
Neg logit contributions
uch-0.037
aches-0.032
ons-0.030
 done-0.029
ade-0.029
Token sw
Token ID1509
Feature activation-1.16e-01

Pos logit contributions
arted+0.063
 antics+0.062
iously+0.061
urbed+0.061
istent+0.060
Neg logit contributions
uch-0.058
str-0.054
aches-0.054
asis-0.052
lly-0.052
Tokenan
Token ID272
Feature activation-3.71e-02

Pos logit contributions
 Can+0.210
 infant+0.210
 antics+0.208
 personality+0.203
istent+0.193
Neg logit contributions
uch-0.169
aches-0.159
ade-0.152
at-0.151
asis-0.149
Token named
Token ID3706
Feature activation-4.87e-02

Pos logit contributions
 antics+0.059
 infant+0.058
 personality+0.056
amb+0.054
 Can+0.054
Neg logit contributions
uch-0.061
aches-0.057
ats-0.053
ade-0.052
at-0.052
Token Sam
Token ID3409
Feature activation-1.62e-02

Pos logit contributions
 antics+0.057
 infant+0.055
 personality+0.053
 Can+0.049
amb+0.048
Neg logit contributions
uch-0.102
aches-0.097
ats-0.090
at-0.090
ade-0.090
Token.
Token ID13
Feature activation-3.83e-02

Pos logit contributions
 infant+0.023
 personality+0.022
amb+0.022
 antics+0.021
 loudly+0.021
Neg logit contributions
uch-0.031
aches-0.026
asis-0.025
ade-0.025
 done-0.025
Token Sam
Token ID3409
Feature activation-2.29e-02

Pos logit contributions
 infant+0.066
 antics+0.065
rily+0.063
urbed+0.063
rier+0.061
Neg logit contributions
uch-0.058
ons-0.051
ade-0.051
aches-0.050
 am-0.050
Token,
Token ID11
Feature activation+1.71e-02

Pos logit contributions
 infant+0.003
 antics+0.003
 personality+0.003
 Can+0.003
amb+0.002
Neg logit contributions
uch-0.004
aches-0.003
at-0.003
ats-0.003
ade-0.003
Token there
Token ID612
Feature activation+4.74e-02

Pos logit contributions
uch+0.021
aches+0.019
ats+0.018
at+0.018
ade+0.017
Neg logit contributions
 infant-0.034
 antics-0.033
 personality-0.033
 Can-0.033
 loudly-0.031
Token was
Token ID373
Feature activation-3.26e-02

Pos logit contributions
uch+0.080
aches+0.078
at+0.072
ats+0.072
asis+0.068
Neg logit contributions
 It-0.071
 infant-0.071
 Even-0.070
 antics-0.069
 personality-0.069
Token a
Token ID257
Feature activation-3.37e-02

Pos logit contributions
 antics+0.072
 personality+0.069
istent+0.067
amb+0.067
 infant+0.066
Neg logit contributions
uch-0.033
aches-0.029
ons-0.027
ade-0.027
 done-0.026
Token friendly
Token ID8030
Feature activation-7.94e-02

Pos logit contributions
arted+0.054
 antics+0.053
iously+0.053
urbed+0.052
istent+0.051
Neg logit contributions
uch-0.053
str-0.051
lly-0.049
aches-0.049
 else-0.048
Token sw
Token ID1509
Feature activation-1.16e-01

Pos logit contributions
urbed+0.135
istent+0.134
iously+0.132
arted+0.132
 antics+0.129
Neg logit contributions
str-0.103
ade-0.102
asis-0.101
 rest-0.100
uch-0.099
Tokenan
Token ID272
Feature activation-3.47e-02

Pos logit contributions
 Can+0.210
 infant+0.208
 antics+0.203
 personality+0.201
urbed+0.189
Neg logit contributions
uch-0.168
aches-0.155
at-0.152
ats-0.151
ade-0.150
Token who
Token ID508
Feature activation-3.41e-03

Pos logit contributions
 antics+0.058
 infant+0.058
 personality+0.056
 Can+0.053
amb+0.053
Neg logit contributions
uch-0.054
aches-0.051
at-0.046
ats-0.046
ade-0.046
Token lived
Token ID5615
Feature activation-7.61e-03

Pos logit contributions
 antics+0.007
 infant+0.007
 personality+0.007
 Can+0.006
amb+0.006
Neg logit contributions
uch-0.004
aches-0.004
at-0.003
ats-0.003
ade-0.003
Token by
Token ID416
Feature activation+2.17e-02

Pos logit contributions
 infant+0.014
 antics+0.013
 personality+0.013
amb+0.013
 Can+0.013
Neg logit contributions
uch-0.011
aches-0.010
ats-0.009
at-0.009
ade-0.009
Token a
Token ID257
Feature activation-4.61e-03

Pos logit contributions
uch+0.027
aches+0.025
ats+0.022
at+0.022
ade+0.021
Neg logit contributions
 antics-0.044
 infant-0.043
 personality-0.042
 Can-0.041
amb-0.040
Token,
Token ID11
Feature activation+1.37e-02

Pos logit contributions
 antics+0.002
 infant+0.002
 personality+0.002
 Can+0.002
amb+0.002
Neg logit contributions
uch-0.003
aches-0.003
at-0.003
ade-0.003
ats-0.003
Token there
Token ID612
Feature activation+4.76e-02

Pos logit contributions
uch+0.016
aches+0.015
ats+0.013
at+0.013
ade+0.013
Neg logit contributions
 infant-0.029
 antics-0.028
 personality-0.028
 Can-0.027
amb-0.026
Token was
Token ID373
Feature activation-3.35e-02

Pos logit contributions
uch+0.078
aches+0.077
at+0.070
ats+0.069
asis+0.067
Neg logit contributions
 infant-0.074
 It-0.073
 Even-0.072
 antics-0.071
 personality-0.071
Token a
Token ID257
Feature activation-3.55e-02

Pos logit contributions
 antics+0.076
 personality+0.072
istent+0.071
urbed+0.070
amb+0.069
Neg logit contributions
uch-0.034
aches-0.028
ons-0.027
ade-0.026
 done-0.026
Token wee
Token ID43671
Feature activation-8.94e-02

Pos logit contributions
arted+0.058
 antics+0.058
iously+0.057
urbed+0.056
istent+0.055
Neg logit contributions
uch-0.055
str-0.053
lly-0.051
aches-0.050
asis-0.050
Token sw
Token ID1509
Feature activation-1.16e-01

Pos logit contributions
 antics+0.132
arted+0.130
istent+0.123
rily+0.123
iously+0.122
Neg logit contributions
uch-0.144
asis-0.134
aches-0.132
ats-0.131
 am-0.131
Tokenan
Token ID272
Feature activation-3.42e-02

Pos logit contributions
 Can+0.216
 infant+0.211
 personality+0.202
 antics+0.202
 Even+0.190
Neg logit contributions
uch-0.173
aches-0.159
ats-0.155
at-0.155
ade-0.152
Token who
Token ID508
Feature activation-3.41e-03

Pos logit contributions
 antics+0.055
 infant+0.055
 personality+0.053
amb+0.051
 Can+0.050
Neg logit contributions
uch-0.056
aches-0.052
ade-0.048
ats-0.047
at-0.047
Token was
Token ID373
Feature activation-3.21e-02

Pos logit contributions
 antics+0.006
 infant+0.006
 personality+0.006
 Can+0.005
amb+0.005
Neg logit contributions
uch-0.005
aches-0.005
ade-0.004
at-0.004
ats-0.004
Token very
Token ID845
Feature activation-3.68e-02

Pos logit contributions
 antics+0.058
 infant+0.054
 personality+0.054
 Can+0.052
rily+0.052
Neg logit contributions
uch-0.048
aches-0.044
 done-0.041
at-0.040
ats-0.040
Token eager
Token ID11069
Feature activation-3.01e-02

Pos logit contributions
 antics+0.056
 infant+0.053
 personality+0.051
 lear+0.049
 Can+0.049
Neg logit contributions
uch-0.066
aches-0.063
 done-0.057
ats-0.056
at-0.056
Token.
Token ID13
Feature activation-4.38e-02

Pos logit contributions
 antics+0.083
 infant+0.081
 personality+0.079
 Can+0.076
urbed+0.074
Neg logit contributions
uch-0.055
aches-0.052
ade-0.048
ats-0.048
at-0.047
Token The
Token ID383
Feature activation-1.59e-02

Pos logit contributions
heart+0.066
 antics+0.065
rier+0.065
urbed+0.062
itive+0.062
Neg logit contributions
fire-0.057
 soda-0.056
 last-0.056
aster-0.052
-0.051
Token girls
Token ID4813
Feature activation-1.35e-02

Pos logit contributions
 antics+0.026
 infant+0.026
 personality+0.025
 Can+0.024
amb+0.024
Neg logit contributions
uch-0.025
aches-0.024
ade-0.022
at-0.021
ats-0.021
Token found
Token ID1043
Feature activation+8.96e-03

Pos logit contributions
 antics+0.024
 infant+0.023
 personality+0.023
 Can+0.022
rily+0.021
Neg logit contributions
uch-0.019
aches-0.019
ats-0.017
ade-0.017
at-0.017
Token a
Token ID257
Feature activation-7.34e-03

Pos logit contributions
uch+0.017
aches+0.015
asis+0.014
at+0.014
ade+0.013
Neg logit contributions
 infant-0.014
 antics-0.013
 daughter-0.013
 personality-0.012
 loudly-0.012
Token sw
Token ID1509
Feature activation-1.15e-01

Pos logit contributions
 antics+0.013
 infant+0.013
 personality+0.013
 Can+0.012
amb+0.012
Neg logit contributions
uch-0.011
aches-0.010
at-0.009
ats-0.009
ons-0.009
Tokenan
Token ID272
Feature activation-2.51e-02

Pos logit contributions
 infant+0.211
 Can+0.207
 antics+0.199
 personality+0.195
 lear+0.186
Neg logit contributions
uch-0.169
aches-0.159
ats-0.155
ade-0.152
at-0.150
Token by
Token ID416
Feature activation+1.62e-02

Pos logit contributions
 antics+0.042
 infant+0.041
 personality+0.041
 Can+0.038
amb+0.038
Neg logit contributions
uch-0.039
aches-0.037
ats-0.034
ade-0.034
at-0.034
Token the
Token ID262
Feature activation+2.06e-03

Pos logit contributions
uch+0.022
aches+0.019
at+0.018
ade+0.018
ats+0.017
Neg logit contributions
 antics-0.032
 infant-0.031
 personality-0.031
 Can-0.029
amb-0.029
Token river
Token ID7850
Feature activation+1.04e-03

Pos logit contributions
uch+0.003
aches+0.003
ade+0.003
at+0.003
ats+0.003
Neg logit contributions
 antics-0.003
 infant-0.003
 personality-0.003
 Can-0.003
amb-0.003
Token swimming
Token ID14899
Feature activation-3.44e-02

Pos logit contributions
uch+0.002
aches+0.002
at+0.002
w+0.002
ats+0.002
Neg logit contributions
 antics-0.002
 infant-0.002
 personality-0.001
 Can-0.001
 loudly-0.001
Token Suddenly
Token ID24975
Feature activation+8.06e-03

Pos logit contributions
rily+0.046
rier+0.045
 antics+0.043
amb+0.043
 infant+0.040
Neg logit contributions
ons-0.070
uch-0.068
 am-0.068
 done-0.068
 last-0.067
Token,
Token ID11
Feature activation-2.80e-04

Pos logit contributions
uch+0.012
aches+0.011
ats+0.010
ons+0.010
ade+0.010
Neg logit contributions
 antics-0.014
 infant-0.014
 personality-0.013
amb-0.013
 Can-0.013
Token a
Token ID257
Feature activation-2.24e-02

Pos logit contributions
 antics+0.000
 infant+0.000
 personality+0.000
 Can+0.000
 lear+0.000
Neg logit contributions
uch-0.000
aches-0.000
at-0.000
ade-0.000
ats-0.000
Token big
Token ID1263
Feature activation-5.17e-02

Pos logit contributions
 antics+0.034
 infant+0.034
 personality+0.033
 Can+0.031
amb+0.030
Neg logit contributions
uch-0.038
aches-0.036
at-0.033
ats-0.033
ade-0.033
Token fish
Token ID5916
Feature activation-9.19e-03

Pos logit contributions
 antics+0.092
amb+0.089
 personality+0.085
urbed+0.084
rily+0.084
Neg logit contributions
uch-0.072
aches-0.067
asis-0.065
ade-0.064
 am-0.063
Token sw
Token ID1509
Feature activation-1.15e-01

Pos logit contributions
 antics+0.013
 infant+0.012
 Can+0.012
amb+0.012
 personality+0.012
Neg logit contributions
aches-0.016
uch-0.016
ats-0.014
at-0.014
ade-0.014
Tokenam
Token ID321
Feature activation-5.47e-02

Pos logit contributions
 infant+0.216
 Can+0.207
 antics+0.195
 personality+0.195
 lear+0.190
Neg logit contributions
uch-0.175
ats-0.164
aches-0.162
at-0.157
ade-0.153
Token over
Token ID625
Feature activation+2.28e-02

Pos logit contributions
 antics+0.095
 infant+0.093
 personality+0.092
 Can+0.086
amb+0.084
Neg logit contributions
uch-0.083
aches-0.076
ade-0.072
at-0.071
ats-0.071
Token to
Token ID284
Feature activation+1.55e-02

Pos logit contributions
uch+0.028
aches+0.028
ons+0.026
ade+0.026
ats+0.025
Neg logit contributions
 antics-0.043
 personality-0.042
 infant-0.042
amb-0.040
istent-0.039
Token her
Token ID607
Feature activation-1.75e-02

Pos logit contributions
uch+0.030
aches+0.027
ade+0.026
ats+0.026
at+0.026
Neg logit contributions
 antics-0.022
 infant-0.022
 personality-0.020
 Can-0.020
 lear-0.019
Token.
Token ID13
Feature activation-3.35e-02

Pos logit contributions
 antics+0.028
 infant+0.028
 personality+0.027
amb+0.025
 Can+0.025
Neg logit contributions
uch-0.030
aches-0.028
at-0.025
ats-0.025
ons-0.025
Token,
Token ID11
Feature activation+1.62e-02

Pos logit contributions
 infant+0.004
 antics+0.004
 personality+0.004
 Can+0.004
 Even+0.003
Neg logit contributions
uch-0.006
aches-0.005
at-0.005
ats-0.005
ade-0.005
Token there
Token ID612
Feature activation+4.70e-02

Pos logit contributions
uch+0.021
aches+0.020
ats+0.018
at+0.018
ade+0.018
Neg logit contributions
 infant-0.032
 Can-0.030
 antics-0.030
 personality-0.030
 loudly-0.028
Token was
Token ID373
Feature activation-3.72e-02

Pos logit contributions
aches+0.082
uch+0.081
at+0.075
ats+0.074
asis+0.071
Neg logit contributions
 It-0.068
 infant-0.066
 Even-0.066
 personality-0.065
 antics-0.064
Token a
Token ID257
Feature activation-3.74e-02

Pos logit contributions
 antics+0.083
 personality+0.079
istent+0.077
urbed+0.076
 infant+0.076
Neg logit contributions
uch-0.038
aches-0.032
ade-0.030
ons-0.030
 done-0.030
Token little
Token ID1310
Feature activation-7.32e-02

Pos logit contributions
arted+0.063
iously+0.062
 antics+0.062
urbed+0.061
istent+0.059
Neg logit contributions
uch-0.057
str-0.054
lly-0.052
aches-0.051
asis-0.051
Token sw
Token ID1509
Feature activation-1.14e-01

Pos logit contributions
 antics+0.121
iously+0.117
urbed+0.115
arted+0.114
istent+0.113
Neg logit contributions
uch-0.116
 cream-0.108
ade-0.107
 last-0.105
asis-0.105
Tokenan
Token ID272
Feature activation-3.62e-02

Pos logit contributions
 Can+0.216
 infant+0.212
 antics+0.208
 personality+0.205
istent+0.193
Neg logit contributions
uch-0.165
aches-0.154
at-0.149
ade-0.148
ats-0.147
Token who
Token ID508
Feature activation-5.95e-03

Pos logit contributions
 infant+0.057
 antics+0.057
 personality+0.055
amb+0.053
 Can+0.053
Neg logit contributions
uch-0.060
aches-0.056
ade-0.051
ats-0.051
at-0.051
Token was
Token ID373
Feature activation-3.52e-02

Pos logit contributions
 personality+0.010
 antics+0.010
 infant+0.010
 Can+0.009
amb+0.009
Neg logit contributions
uch-0.009
aches-0.009
ade-0.008
ats-0.008
at-0.008
Token different
Token ID1180
Feature activation-1.17e-02

Pos logit contributions
 antics+0.065
 infant+0.060
 personality+0.060
 Can+0.059
amb+0.058
Neg logit contributions
uch-0.051
aches-0.048
 done-0.043
 am-0.042
at-0.042
Token from
Token ID422
Feature activation-4.42e-03

Pos logit contributions
 antics+0.026
istent+0.026
 personality+0.025
 Whoever+0.025
urbed+0.024
Neg logit contributions
aches-0.009
uch-0.009
 am-0.008
ade-0.008
asis-0.008
Token p
Token ID279
Feature activation-5.78e-02

Pos logit contributions
 antics+0.009
 infant+0.009
 personality+0.009
 loudly+0.008
 lear+0.008
Neg logit contributions
uch-0.010
ade-0.009
ats-0.009
at-0.009
ons-0.008
Tokenounce
Token ID8652
Feature activation-1.09e-02

Pos logit contributions
 antics+0.088
 personality+0.083
 infant+0.083
amb+0.080
arted+0.079
Neg logit contributions
uch-0.098
aches-0.091
ons-0.084
at-0.083
ade-0.083
Token,
Token ID11
Feature activation-1.91e-02

Pos logit contributions
 antics+0.018
 infant+0.018
 personality+0.017
 Can+0.017
 loudly+0.017
Neg logit contributions
uch-0.017
aches-0.016
asis-0.014
at-0.014
ats-0.014
Token a
Token ID257
Feature activation-2.31e-02

Pos logit contributions
 antics+0.030
 infant+0.029
 personality+0.028
amb+0.027
 Can+0.026
Neg logit contributions
uch-0.031
aches-0.030
ats-0.027
ade-0.027
ons-0.027
Token graceful
Token ID44363
Feature activation-6.58e-02

Pos logit contributions
 antics+0.035
 infant+0.034
 personality+0.034
 Can+0.032
amb+0.031
Neg logit contributions
uch-0.040
aches-0.037
at-0.034
ade-0.034
ats-0.034
Token sw
Token ID1509
Feature activation-1.14e-01

Pos logit contributions
 antics+0.094
amb+0.093
istent+0.092
rily+0.091
 infant+0.089
Neg logit contributions
uch-0.104
aches-0.100
ade-0.097
asis-0.095
 am-0.094
Tokenan
Token ID272
Feature activation-2.95e-02

Pos logit contributions
 infant+0.221
 Can+0.219
 personality+0.201
 antics+0.201
 Even+0.194
Neg logit contributions
uch-0.160
aches-0.151
ats-0.148
ade-0.145
at-0.144
Token named
Token ID3706
Feature activation-3.10e-02

Pos logit contributions
 antics+0.049
 infant+0.048
 personality+0.047
 Can+0.045
amb+0.045
Neg logit contributions
uch-0.047
aches-0.043
ade-0.040
at-0.039
ats-0.039
Token Lily
Token ID20037
Feature activation+5.39e-04

Pos logit contributions
 antics+0.039
 infant+0.038
 personality+0.037
 Can+0.033
amb+0.033
Neg logit contributions
uch-0.061
aches-0.058
ats-0.055
at-0.054
ade-0.054
Token flew
Token ID13112
Feature activation-4.46e-02

Pos logit contributions
uch+0.001
aches+0.001
ade+0.001
at+0.001
ats+0.001
Neg logit contributions
 antics-0.001
 infant-0.001
 personality-0.001
 Can-0.001
amb-0.001
Token down
Token ID866
Feature activation+5.42e-03

Pos logit contributions
 antics+0.084
 infant+0.082
 personality+0.080
 Can+0.076
amb+0.075
Neg logit contributions
uch-0.061
aches-0.057
ade-0.051
ats-0.051
at-0.051
Token time
Token ID640
Feature activation+8.46e-04

Pos logit contributions
 antics+0.006
 personality+0.006
 infant+0.006
 loudly+0.005
amb+0.005
Neg logit contributions
uch-0.004
aches-0.004
ons-0.004
asis-0.004
ade-0.004
Token there
Token ID612
Feature activation+4.96e-02

Pos logit contributions
uch+0.002
aches+0.002
at+0.001
ade+0.001
ats+0.001
Neg logit contributions
 infant-0.001
 antics-0.001
 personality-0.001
 Can-0.001
amb-0.001
Token was
Token ID373
Feature activation-3.49e-02

Pos logit contributions
aches+0.089
uch+0.089
at+0.082
ats+0.082
asis+0.076
Neg logit contributions
 It-0.070
 infant-0.067
 Even-0.067
 She-0.066
 Can-0.064
Token a
Token ID257
Feature activation-3.34e-02

Pos logit contributions
 antics+0.078
 personality+0.074
istent+0.072
 Can+0.071
urbed+0.071
Neg logit contributions
uch-0.037
aches-0.031
ons-0.029
at-0.029
ade-0.029
Token fit
Token ID4197
Feature activation-7.15e-02

Pos logit contributions
arted+0.054
 antics+0.053
iously+0.053
urbed+0.052
istent+0.050
Neg logit contributions
uch-0.053
str-0.051
lly-0.049
asis-0.047
aches-0.047
Token sw
Token ID1509
Feature activation-1.14e-01

Pos logit contributions
 antics+0.112
 personality+0.102
istent+0.100
 Can+0.100
 infant+0.099
Neg logit contributions
uch-0.120
aches-0.113
ade-0.111
at-0.107
ats-0.106
Tokenan
Token ID272
Feature activation-3.29e-02

Pos logit contributions
 Can+0.209
 infant+0.208
 antics+0.199
 personality+0.198
 Even+0.187
Neg logit contributions
uch-0.166
aches-0.154
ade-0.151
ats-0.150
at-0.149
Token.
Token ID13
Feature activation-3.11e-02

Pos logit contributions
 antics+0.066
 infant+0.064
 personality+0.063
 Can+0.061
istent+0.059
Neg logit contributions
uch-0.040
aches-0.037
at-0.033
ats-0.033
ade-0.033
Token She
Token ID1375
Feature activation-1.69e-02

Pos logit contributions
 antics+0.052
urbed+0.049
rier+0.048
 lear+0.048
iously+0.047
Neg logit contributions
 am-0.043
uch-0.042
aster-0.039
-0.038
he-0.038
Token lived
Token ID5615
Feature activation-1.94e-02

Pos logit contributions
 personality+0.028
 antics+0.028
 infant+0.027
 Can+0.026
amb+0.026
Neg logit contributions
uch-0.026
aches-0.024
at-0.022
ats-0.022
ade-0.022
Token near
Token ID1474
Feature activation+1.44e-02

Pos logit contributions
 infant+0.036
 antics+0.036
 personality+0.035
amb+0.034
 Can+0.033
Neg logit contributions
uch-0.027
aches-0.024
ats-0.022
at-0.022
ade-0.022
TokenSuddenly
Token ID38582
Feature activation+1.54e-02

Pos logit contributions
uch+0.055
aches+0.051
ade+0.046
ats+0.045
at+0.045
Neg logit contributions
 antics-0.073
 infant-0.071
 personality-0.069
 Can-0.066
amb-0.065
Token,
Token ID11
Feature activation+1.05e-03

Pos logit contributions
uch+0.021
aches+0.020
ons+0.018
ade+0.018
ats+0.017
Neg logit contributions
 antics-0.030
 infant-0.028
 personality-0.028
amb-0.027
 Can-0.026
Token a
Token ID257
Feature activation-2.10e-02

Pos logit contributions
uch+0.002
aches+0.001
at+0.001
ats+0.001
ade+0.001
Neg logit contributions
 antics-0.002
 infant-0.002
 personality-0.002
 Can-0.002
amb-0.002
Token friendly
Token ID8030
Feature activation-6.49e-02

Pos logit contributions
 antics+0.033
 infant+0.032
 personality+0.031
 Can+0.030
amb+0.029
Neg logit contributions
uch-0.036
aches-0.033
ade-0.031
at-0.031
ats-0.031
Token dolphin
Token ID44136
Feature activation-2.27e-02

Pos logit contributions
 antics+0.100
urbed+0.097
istent+0.095
amb+0.094
rily+0.092
Neg logit contributions
ade-0.092
uch-0.091
aches-0.091
 am-0.089
asis-0.088
Token sw
Token ID1509
Feature activation-1.14e-01

Pos logit contributions
 antics+0.041
 infant+0.040
 personality+0.039
 Can+0.037
amb+0.037
Neg logit contributions
uch-0.033
aches-0.031
ats-0.028
at-0.028
ade-0.028
Tokenam
Token ID321
Feature activation-5.08e-02

Pos logit contributions
 infant+0.214
 Can+0.208
 personality+0.193
 antics+0.192
 lear+0.190
Neg logit contributions
uch-0.172
ats-0.158
aches-0.154
at-0.150
ade-0.146
Token close
Token ID1969
Feature activation-6.97e-03

Pos logit contributions
 antics+0.094
 personality+0.091
 infant+0.091
 Can+0.084
istent+0.083
Neg logit contributions
uch-0.069
aches-0.064
ade-0.061
ats-0.061
at-0.060
Token to
Token ID284
Feature activation+2.19e-02

Pos logit contributions
 antics+0.014
 infant+0.013
 personality+0.013
 Can+0.012
istent+0.012
Neg logit contributions
uch-0.008
aches-0.008
 done-0.007
ade-0.007
ats-0.007
Token her
Token ID607
Feature activation-7.09e-03

Pos logit contributions
uch+0.045
aches+0.040
ade+0.039
ats+0.039
at+0.038
Neg logit contributions
 infant-0.029
 antics-0.029
 personality-0.027
 Can-0.026
 daughter-0.025
Token and
Token ID290
Feature activation-1.31e-02

Pos logit contributions
 antics+0.012
 infant+0.012
 personality+0.012
amb+0.011
 Can+0.011
Neg logit contributions
uch-0.011
aches-0.011
at-0.010
ons-0.010
ade-0.010
Token a
Token ID257
Feature activation+6.89e-04

Pos logit contributions
uch+0.069
aches+0.064
ats+0.059
at+0.058
ade+0.058
Neg logit contributions
 antics-0.084
 infant-0.083
 personality-0.080
 Can-0.077
amb-0.075
Token time
Token ID640
Feature activation-1.08e-03

Pos logit contributions
aches+0.001
uch+0.001
ons+0.001
asis+0.001
at+0.001
Neg logit contributions
 antics-0.001
 personality-0.001
 infant-0.001
 loudly-0.001
 Even-0.001
Token there
Token ID612
Feature activation+5.00e-02

Pos logit contributions
 infant+0.001
 antics+0.001
 personality+0.001
 Can+0.001
amb+0.001
Neg logit contributions
uch-0.002
aches-0.002
at-0.002
ats-0.002
ade-0.002
Token was
Token ID373
Feature activation-3.44e-02

Pos logit contributions
aches+0.088
uch+0.088
ats+0.082
at+0.080
asis+0.076
Neg logit contributions
 It-0.071
 infant-0.071
 She-0.069
 Even-0.068
 Can-0.067
Token a
Token ID257
Feature activation-3.24e-02

Pos logit contributions
 antics+0.075
 personality+0.071
istent+0.069
 infant+0.069
urbed+0.069
Neg logit contributions
uch-0.038
aches-0.032
ons-0.031
ade-0.030
at-0.030
Token sw
Token ID1509
Feature activation-1.13e-01

Pos logit contributions
arted+0.053
iously+0.052
 antics+0.052
urbed+0.052
istent+0.050
Neg logit contributions
uch-0.050
str-0.049
lly-0.047
aches-0.046
 else-0.046
Tokenan
Token ID272
Feature activation-3.10e-02

Pos logit contributions
 Can+0.207
 infant+0.204
 antics+0.201
 personality+0.198
urbed+0.185
Neg logit contributions
uch-0.167
aches-0.156
at-0.151
ade-0.150
ats-0.149
Token.
Token ID13
Feature activation-3.27e-02

Pos logit contributions
 antics+0.052
 infant+0.052
 personality+0.050
amb+0.048
 Can+0.048
Neg logit contributions
uch-0.049
aches-0.045
ats-0.041
ade-0.041
at-0.041
Token His
Token ID2399
Feature activation-3.78e-02

Pos logit contributions
 antics+0.056
urbed+0.052
rier+0.052
 lear+0.051
iously+0.050
Neg logit contributions
 am-0.045
uch-0.043
aster-0.041
-0.040
he-0.040
Token name
Token ID1438
Feature activation+2.08e-02

Pos logit contributions
 antics+0.050
 infant+0.050
 personality+0.047
 Can+0.046
urbed+0.045
Neg logit contributions
uch-0.072
aches-0.068
ade-0.063
ats-0.063
at-0.063
Token was
Token ID373
Feature activation-2.54e-02

Pos logit contributions
uch+0.032
aches+0.031
ats+0.029
at+0.027
asis+0.026
Neg logit contributions
 infant-0.034
 Can-0.032
 It-0.032
 personality-0.032
 antics-0.032
Token,
Token ID11
Feature activation+1.63e-02

Pos logit contributions
 antics+0.002
 infant+0.002
 personality+0.002
 Can+0.002
amb+0.002
Neg logit contributions
uch-0.002
aches-0.002
at-0.002
ats-0.002
ade-0.002
Token there
Token ID612
Feature activation+4.84e-02

Pos logit contributions
uch+0.020
aches+0.018
ats+0.016
at+0.016
ade+0.016
Neg logit contributions
 infant-0.033
 antics-0.032
 personality-0.032
 Can-0.032
amb-0.030
Token was
Token ID373
Feature activation-3.14e-02

Pos logit contributions
uch+0.080
aches+0.079
at+0.073
ats+0.072
asis+0.069
Neg logit contributions
 It-0.074
 infant-0.074
 Even-0.073
 antics-0.071
 personality-0.071
Token a
Token ID257
Feature activation-3.46e-02

Pos logit contributions
 antics+0.070
 personality+0.066
istent+0.064
amb+0.064
 infant+0.064
Neg logit contributions
uch-0.033
aches-0.028
ons-0.027
ade-0.026
 done-0.026
Token graceful
Token ID44363
Feature activation-8.56e-02

Pos logit contributions
 antics+0.055
arted+0.055
iously+0.054
urbed+0.053
amb+0.052
Neg logit contributions
uch-0.055
str-0.052
aches-0.051
lly-0.050
asis-0.050
Token sw
Token ID1509
Feature activation-1.13e-01

Pos logit contributions
istent+0.142
arted+0.141
iously+0.138
 antics+0.136
urbed+0.135
Neg logit contributions
ade-0.116
asis-0.116
uch-0.113
str-0.113
 rest-0.109
Tokenan
Token ID272
Feature activation-3.38e-02

Pos logit contributions
 infant+0.205
 Can+0.204
 antics+0.199
 personality+0.197
istent+0.184
Neg logit contributions
uch-0.166
aches-0.154
ade-0.149
at-0.149
ats-0.147
Token who
Token ID508
Feature activation-2.75e-03

Pos logit contributions
 antics+0.058
 infant+0.057
 personality+0.056
amb+0.053
 Can+0.053
Neg logit contributions
uch-0.052
aches-0.048
ade-0.044
ats-0.044
at-0.044
Token liked
Token ID8288
Feature activation-2.28e-03

Pos logit contributions
 antics+0.005
 infant+0.005
 personality+0.005
 Can+0.005
amb+0.005
Neg logit contributions
uch-0.004
aches-0.004
at-0.003
ats-0.003
ade-0.003
Token to
Token ID284
Feature activation-7.60e-03

Pos logit contributions
 antics+0.005
 infant+0.004
 personality+0.004
istent+0.004
rily+0.004
Neg logit contributions
uch-0.003
aches-0.003
ade-0.002
ons-0.002
 done-0.002
Token spend
Token ID4341
Feature activation-8.30e-03

Pos logit contributions
 antics+0.012
 personality+0.011
 infant+0.011
amb+0.010
 Can+0.010
Neg logit contributions
uch-0.013
aches-0.012
ats-0.012
at-0.011
ons-0.011
Token was
Token ID373
Feature activation-1.59e-02

Pos logit contributions
uch+0.015
aches+0.014
at+0.013
ats+0.013
ade+0.013
Neg logit contributions
 antics-0.021
 infant-0.021
 personality-0.020
 Can-0.019
 loudly-0.019
Token still
Token ID991
Feature activation-7.59e-03

Pos logit contributions
 antics+0.023
 infant+0.023
 personality+0.022
 Can+0.021
amb+0.020
Neg logit contributions
uch-0.028
aches-0.026
ats-0.025
ade-0.025
at-0.025
Token there
Token ID612
Feature activation+4.75e-02

Pos logit contributions
 antics+0.011
 infant+0.011
 personality+0.010
amb+0.010
 Can+0.010
Neg logit contributions
uch-0.014
aches-0.013
 done-0.012
at-0.012
ade-0.012
Token,
Token ID11
Feature activation-2.14e-02

Pos logit contributions
uch+0.086
aches+0.080
at+0.075
ats+0.074
asis+0.073
Neg logit contributions
 antics-0.072
 infant-0.072
 personality-0.070
 Can-0.067
 lear-0.063
Token waiting
Token ID4953
Feature activation-2.71e-02

Pos logit contributions
 antics+0.028
 infant+0.026
amb+0.025
 personality+0.025
rily+0.024
Neg logit contributions
uch-0.040
aches-0.038
 done-0.037
ons-0.035
ade-0.035
Token for
Token ID329
Feature activation+4.84e-02

Pos logit contributions
 infant+0.034
 personality+0.033
 antics+0.033
 Can+0.031
 daughter+0.031
Neg logit contributions
uch-0.056
aches-0.049
at-0.048
ats-0.048
asis-0.047
Token her
Token ID607
Feature activation+7.27e-03

Pos logit contributions
uch+0.096
aches+0.086
at+0.083
asis+0.082
ats+0.081
Neg logit contributions
 antics-0.066
 infant-0.064
 personality-0.062
 Can-0.057
 daughter-0.056
Token.
Token ID13
Feature activation-2.35e-02

Pos logit contributions
uch+0.013
aches+0.012
at+0.012
ats+0.011
asis+0.011
Neg logit contributions
 antics-0.012
 infant-0.012
 personality-0.011
 daughter-0.010
 Can-0.010
Token She
Token ID1375
Feature activation+4.65e-03

Pos logit contributions
 antics+0.025
rily+0.023
 adventurers+0.023
 alien+0.023
urbed+0.023
Neg logit contributions
uch-0.046
 done-0.045
 am-0.045
ade-0.045
he-0.044
Token gave
Token ID2921
Feature activation-1.11e-02

Pos logit contributions
uch+0.006
aches+0.006
ade+0.005
ats+0.005
at+0.005
Neg logit contributions
 antics-0.009
 infant-0.009
 personality-0.008
 Can-0.008
amb-0.008
Token him
Token ID683
Feature activation+3.36e-02

Pos logit contributions
 infant+0.015
 antics+0.015
 personality+0.014
 Can+0.013
amb+0.013
Neg logit contributions
uch-0.021
aches-0.020
asis-0.019
at-0.018
ade-0.018
Token I
Token ID314
Feature activation+2.21e-02

Pos logit contributions
uch+0.011
aches+0.011
ade+0.011
ats+0.011
at+0.011
Neg logit contributions
 infant-0.005
 Can-0.005
 antics-0.005
 personality-0.004
 Please-0.004
Token watch
Token ID2342
Feature activation-1.57e-02

Pos logit contributions
uch+0.034
aches+0.032
ons+0.030
asis+0.029
ade+0.029
Neg logit contributions
 antics-0.036
 infant-0.036
 personality-0.036
 Can-0.034
amb-0.034
Token a
Token ID257
Feature activation+5.55e-03

Pos logit contributions
 antics+0.028
 infant+0.028
 personality+0.027
 Can+0.026
amb+0.025
Neg logit contributions
uch-0.023
aches-0.021
ade-0.020
ats-0.020
at-0.019
Token video
Token ID2008
Feature activation+2.08e-02

Pos logit contributions
uch+0.008
aches+0.007
ade+0.007
ats+0.007
at+0.007
Neg logit contributions
 antics-0.010
 infant-0.010
 personality-0.009
amb-0.009
 Can-0.009
Token?"
Token ID1701
Feature activation-1.36e-02

Pos logit contributions
uch+0.034
aches+0.033
ats+0.030
asis+0.029
str+0.029
Neg logit contributions
 infant-0.032
 antics-0.032
 Can-0.032
 personality-0.031
 loudly-0.030
Token
Token ID198
Feature activation+5.24e-02

Pos logit contributions
 antics+0.024
 infant+0.023
 personality+0.022
amb+0.021
 lear+0.021
Neg logit contributions
uch-0.020
aches-0.019
ade-0.017
ons-0.017
at-0.017
Token
Token ID198
Feature activation+3.71e-02

Pos logit contributions
uch+0.088
aches+0.085
at+0.077
asis+0.077
ats+0.075
Neg logit contributions
 Even-0.081
 Can-0.080
 infant-0.078
 antics-0.078
amb-0.077
TokenThe
Token ID464
Feature activation-7.54e-03

Pos logit contributions
uch+0.055
aches+0.050
ade+0.046
at+0.045
ats+0.045
Neg logit contributions
 antics-0.067
 infant-0.066
 personality-0.064
 Can-0.060
amb-0.060
Token computer
Token ID3644
Feature activation+1.67e-02

Pos logit contributions
 antics+0.010
 infant+0.010
 personality+0.010
amb+0.009
 Can+0.009
Neg logit contributions
uch-0.014
aches-0.013
ade-0.013
ats-0.013
at-0.013
Token screen
Token ID3159
Feature activation+1.62e-02

Pos logit contributions
uch+0.027
aches+0.025
ats+0.023
at+0.023
ade+0.023
Neg logit contributions
 antics-0.027
 infant-0.027
 personality-0.026
 Can-0.025
 lear-0.024
Token suddenly
Token ID6451
Feature activation-8.81e-03

Pos logit contributions
uch+0.027
aches+0.025
ats+0.023
at+0.023
asis+0.023
Neg logit contributions
 antics-0.026
 infant-0.025
 personality-0.025
 loudly-0.024
 Can-0.024
Token into
Token ID656
Feature activation+2.13e-02

Pos logit contributions
 antics+0.056
 infant+0.055
 personality+0.054
 Can+0.051
amb+0.051
Neg logit contributions
uch-0.040
aches-0.037
at-0.033
ats-0.033
ade-0.033
Token the
Token ID262
Feature activation+1.09e-02

Pos logit contributions
uch+0.031
aches+0.028
at+0.026
ats+0.026
ade+0.025
Neg logit contributions
 infant-0.040
 antics-0.040
 personality-0.038
 Can-0.037
urbed-0.036
Token night
Token ID1755
Feature activation-2.14e-03

Pos logit contributions
uch+0.019
aches+0.019
ats+0.018
ade+0.017
at+0.017
Neg logit contributions
 infant-0.017
 antics-0.016
 personality-0.016
 Can-0.015
urbed-0.014
Token sky
Token ID6766
Feature activation+3.05e-02

Pos logit contributions
 infant+0.003
 antics+0.003
 personality+0.003
 Can+0.003
 Even+0.003
Neg logit contributions
uch-0.004
aches-0.004
ats-0.003
at-0.003
w-0.003
Token.
Token ID13
Feature activation-2.03e-02

Pos logit contributions
uch+0.044
aches+0.040
ats+0.036
at+0.035
ade+0.035
Neg logit contributions
 antics-0.057
 infant-0.055
 personality-0.053
 Can-0.051
rily-0.051
Token
Token ID198
Feature activation+5.82e-02

Pos logit contributions
 antics+0.035
 infant+0.033
 personality+0.032
amb+0.031
 lear+0.030
Neg logit contributions
uch-0.029
aches-0.027
 done-0.026
ade-0.025
 am-0.025
Token
Token ID198
Feature activation+4.33e-02

Pos logit contributions
uch+0.107
aches+0.096
ats+0.088
w+0.087
at+0.087
Neg logit contributions
 Even-0.092
 It-0.089
 She-0.087
 Can-0.086
 Its-0.084
TokenM
Token ID44
Feature activation-1.52e-02

Pos logit contributions
uch+0.070
aches+0.065
ats+0.060
at+0.059
asis+0.058
Neg logit contributions
 antics-0.070
 infant-0.069
 personality-0.068
 Can-0.067
 Even-0.064
Tokenolly
Token ID5098
Feature activation-2.55e-02

Pos logit contributions
 antics+0.025
 infant+0.025
 personality+0.024
 Can+0.023
arted+0.022
Neg logit contributions
uch-0.025
aches-0.023
at-0.022
ade-0.021
ats-0.021
Token waved
Token ID26834
Feature activation-6.11e-02

Pos logit contributions
 antics+0.049
 infant+0.048
 personality+0.046
 Can+0.045
amb+0.044
Neg logit contributions
uch-0.034
aches-0.031
at-0.028
ats-0.028
 done-0.028
Token goodbye
Token ID24829
Feature activation-3.91e-02

Pos logit contributions
 antics+0.086
 infant+0.083
istent+0.082
 personality+0.079
 Can+0.074
Neg logit contributions
aches-0.093
uch-0.090
 last-0.089
 done-0.089
ade-0.086
Token they
Token ID484
Feature activation+1.27e-02

Pos logit contributions
 antics+0.049
 infant+0.049
amb+0.045
 personality+0.045
urbed+0.045
Neg logit contributions
uch-0.049
aches-0.048
 done-0.044
 am-0.044
ade-0.043
Token make
Token ID787
Feature activation-5.11e-04

Pos logit contributions
uch+0.019
aches+0.016
ade+0.016
ats+0.015
ons+0.015
Neg logit contributions
 antics-0.023
 infant-0.022
 personality-0.022
 lear-0.021
amb-0.020
Token her
Token ID607
Feature activation-7.19e-03

Pos logit contributions
 antics+0.001
 infant+0.001
 personality+0.001
 Can+0.001
amb+0.001
Neg logit contributions
aches-0.001
uch-0.001
ade-0.001
ons-0.001
ats-0.001
Token happy
Token ID3772
Feature activation-1.64e-02

Pos logit contributions
 antics+0.010
 infant+0.010
 personality+0.010
 daughter+0.009
 Can+0.008
Neg logit contributions
uch-0.014
aches-0.013
at-0.013
w-0.012
ade-0.012
Token.
Token ID13
Feature activation-2.20e-02

Pos logit contributions
 antics+0.032
 infant+0.031
 personality+0.031
 Can+0.029
amb+0.029
Neg logit contributions
uch-0.022
aches-0.020
ats-0.018
at-0.018
ade-0.018
Token
Token ID198
Feature activation+6.22e-02

Pos logit contributions
 antics+0.045
 infant+0.043
 personality+0.041
rily+0.040
urbed+0.040
Neg logit contributions
uch-0.026
aches-0.023
ade-0.022
 am-0.021
ons-0.021
Token
Token ID198
Feature activation+3.90e-02

Pos logit contributions
uch+0.125
aches+0.117
asis+0.110
at+0.109
kins+0.102
Neg logit contributions
 Even-0.087
 She-0.085
 It-0.082
 Can-0.080
 As-0.079
TokenOne
Token ID3198
Feature activation+4.90e-02

Pos logit contributions
uch+0.043
aches+0.039
ade+0.034
ats+0.033
at+0.033
Neg logit contributions
 antics-0.085
 infant-0.082
 personality-0.081
rily-0.078
amb-0.078
Token day
Token ID1110
Feature activation+1.38e-02

Pos logit contributions
uch+0.073
aches+0.069
ons+0.063
ats+0.062
at+0.059
Neg logit contributions
 infant-0.091
 personality-0.085
 antics-0.082
 Can-0.080
 daughter-0.080
Token,
Token ID11
Feature activation-4.69e-03

Pos logit contributions
uch+0.020
aches+0.018
ons+0.017
ade+0.017
ats+0.016
Neg logit contributions
 antics-0.025
 infant-0.024
 personality-0.024
amb-0.023
rily-0.023
Token Lily
Token ID20037
Feature activation-1.26e-02

Pos logit contributions
 antics+0.006
amb+0.006
 personality+0.006
 infant+0.006
istent+0.006
Neg logit contributions
uch-0.009
aches-0.008
ade-0.008
 done-0.008
ons-0.008
TokenOnce
Token ID7454
Feature activation+7.71e-02

Pos logit contributions
aches+0.173
uch+0.165
ats+0.146
at+0.145
asis+0.143
Neg logit contributions
 She-0.088
 It-0.088
 He-0.083
 Even-0.077
 They-0.076
Token upon
Token ID2402
Feature activation+4.75e-02

Pos logit contributions
uch+0.168
aches+0.158
fire+0.157
asis+0.156
at+0.150
Neg logit contributions
 She-0.085
 infant-0.084
 Even-0.079
 daughter-0.076
 curiously-0.076
Token a
Token ID257
Feature activation+4.30e-04

Pos logit contributions
uch+0.077
aches+0.073
ats+0.068
at+0.067
asis+0.066
Neg logit contributions
 antics-0.077
 infant-0.076
 personality-0.073
 Can-0.071
 loudly-0.068
Token time
Token ID640
Feature activation-2.25e-03

Pos logit contributions
aches+0.001
uch+0.001
ons+0.001
asis+0.001
at+0.001
Neg logit contributions
 antics-0.001
 personality-0.001
 infant-0.001
 loudly-0.001
 Even-0.001
Token,
Token ID11
Feature activation+1.60e-02

Pos logit contributions
 infant+0.003
 antics+0.003
 Can+0.003
 personality+0.003
 Even+0.002
Neg logit contributions
uch-0.004
aches-0.004
at-0.004
asis-0.004
ade-0.004
Token there
Token ID612
Feature activation+4.72e-02

Pos logit contributions
uch+0.023
aches+0.022
ats+0.021
at+0.021
asis+0.020
Neg logit contributions
 infant-0.029
 Can-0.027
 personality-0.026
 loudly-0.026
 She-0.026
Token was
Token ID373
Feature activation-3.68e-02

Pos logit contributions
aches+0.086
uch+0.083
at+0.078
ats+0.077
asis+0.074
Neg logit contributions
 It-0.067
 Even-0.065
 infant-0.064
 She-0.063
 Can-0.061
Token a
Token ID257
Feature activation-3.86e-02

Pos logit contributions
 antics+0.081
 personality+0.077
istent+0.074
 infant+0.074
urbed+0.074
Neg logit contributions
uch-0.040
aches-0.034
ons-0.032
ade-0.032
 done-0.031
Token happy
Token ID3772
Feature activation-8.34e-02

Pos logit contributions
arted+0.063
 antics+0.062
iously+0.062
urbed+0.061
istent+0.059
Neg logit contributions
uch-0.061
str-0.057
aches-0.056
asis-0.054
lly-0.054
Token train
Token ID4512
Feature activation-3.30e-02

Pos logit contributions
istent+0.144
arted+0.141
urbed+0.141
iously+0.140
rily+0.136
Neg logit contributions
 rest-0.112
ade-0.110
uch-0.109
asis-0.108
aster-0.107
Token.
Token ID13
Feature activation-3.25e-02

Pos logit contributions
 infant+0.060
 antics+0.059
 personality+0.057
 Can+0.055
amb+0.055
Neg logit contributions
uch-0.048
aches-0.045
ade-0.040
ats-0.040
at-0.040
Token with
Token ID351
Feature activation-1.60e-03

Pos logit contributions
 antics+0.057
 infant+0.057
 personality+0.055
 Can+0.053
amb+0.052
Neg logit contributions
uch-0.047
aches-0.043
ats-0.039
at-0.039
ade-0.038
Token her
Token ID607
Feature activation+1.85e-03

Pos logit contributions
 antics+0.002
 infant+0.002
 personality+0.002
 Can+0.002
amb+0.002
Neg logit contributions
uch-0.003
aches-0.003
ade-0.003
at-0.003
ats-0.003
Token toys
Token ID14958
Feature activation+9.28e-03

Pos logit contributions
uch+0.004
aches+0.004
at+0.003
asis+0.003
ats+0.003
Neg logit contributions
 infant-0.003
 antics-0.002
 personality-0.002
 daughter-0.002
amb-0.002
Token all
Token ID477
Feature activation+3.61e-02

Pos logit contributions
uch+0.021
aches+0.020
asis+0.019
at+0.019
str+0.019
Neg logit contributions
 infant-0.009
 personality-0.008
 antics-0.008
 She-0.008
 loudly-0.008
Token day
Token ID1110
Feature activation+1.06e-02

Pos logit contributions
uch+0.066
aches+0.056
fire+0.053
asis+0.053
ves+0.051
Neg logit contributions
 infant-0.058
 personality-0.053
ening-0.051
 She-0.051
amb-0.050
Token long
Token ID890
Feature activation+1.19e-02

Pos logit contributions
uch+0.021
aches+0.020
at+0.019
asis+0.018
ats+0.018
Neg logit contributions
 infant-0.013
 antics-0.013
 personality-0.013
amb-0.012
 Can-0.012
Token.
Token ID13
Feature activation-2.25e-02

Pos logit contributions
uch+0.020
aches+0.018
asis+0.017
at+0.017
str+0.016
Neg logit contributions
 infant-0.020
 antics-0.020
 personality-0.019
 Can-0.018
 lear-0.018
Token One
Token ID1881
Feature activation+4.52e-02

Pos logit contributions
 antics+0.026
 infant+0.025
rily+0.023
 personality+0.023
urbed+0.023
Neg logit contributions
uch-0.045
aches-0.043
ade-0.042
 am-0.041
ons-0.041
Token day
Token ID1110
Feature activation+8.32e-03

Pos logit contributions
aches+0.062
uch+0.061
ats+0.054
ons+0.054
asis+0.051
Neg logit contributions
 infant-0.087
 personality-0.082
 antics-0.081
 Can-0.077
 daughter-0.075
Token,
Token ID11
Feature activation-4.09e-03

Pos logit contributions
uch+0.014
aches+0.013
at+0.012
ats+0.012
ade+0.012
Neg logit contributions
 antics-0.013
 infant-0.013
 personality-0.013
 Can-0.012
amb-0.012
Token her
Token ID607
Feature activation-2.58e-02

Pos logit contributions
 antics+0.006
 infant+0.006
 personality+0.006
amb+0.005
 Can+0.005
Neg logit contributions
uch-0.007
aches-0.007
ade-0.006
ons-0.006
ats-0.006
Token white
Token ID2330
Feature activation-3.39e-02

Pos logit contributions
 personality+0.008
 infant+0.007
 antics+0.007
 Can+0.007
 daughter+0.007
Neg logit contributions
uch-0.014
aches-0.013
at-0.013
asis-0.012
ats-0.012
Token dress
Token ID6576
Feature activation-1.92e-02

Pos logit contributions
 antics+0.053
 infant+0.052
 personality+0.052
 Can+0.048
amb+0.046
Neg logit contributions
uch-0.058
aches-0.054
at-0.050
ats-0.050
ade-0.049
Token and
Token ID290
Feature activation-1.64e-02

Pos logit contributions
 antics+0.015
 personality+0.014
 infant+0.014
amb+0.013
 loudly+0.013
Neg logit contributions
uch-0.048
aches-0.045
asis-0.043
at-0.042
 am-0.042
Token it
Token ID340
Feature activation+3.53e-02

Pos logit contributions
 antics+0.026
 infant+0.026
 personality+0.026
 Can+0.024
amb+0.024
Neg logit contributions
uch-0.027
aches-0.025
at-0.023
ade-0.023
ats-0.023
Token fit
Token ID4197
Feature activation-1.92e-02

Pos logit contributions
uch+0.060
aches+0.055
asis+0.053
at+0.053
aster+0.052
Neg logit contributions
 personality-0.051
 antics-0.051
ened-0.050
 infant-0.050
 loudly-0.050
Token perfectly
Token ID7138
Feature activation-5.74e-03

Pos logit contributions
 antics+0.027
 infant+0.027
 personality+0.026
 Can+0.024
rily+0.024
Neg logit contributions
uch-0.036
aches-0.033
asis-0.031
at-0.031
ats-0.030
Token!
Token ID0
Feature activation-1.46e-02

Pos logit contributions
 antics+0.010
 infant+0.010
 personality+0.010
rily+0.009
 widely+0.009
Neg logit contributions
uch-0.009
aches-0.008
ats-0.007
at-0.007
ons-0.007
Token
Token ID198
Feature activation+6.61e-02

Pos logit contributions
 antics+0.027
 infant+0.024
 personality+0.024
 adventurers+0.024
urbed+0.024
Neg logit contributions
uch-0.020
ons-0.018
ade-0.018
aches-0.018
 am-0.017
Token
Token ID198
Feature activation+4.29e-02

Pos logit contributions
uch+0.131
aches+0.119
asis+0.115
at+0.111
ats+0.104
Neg logit contributions
 Even-0.095
 She-0.092
 It-0.090
 As-0.088
 From-0.084
TokenL
Token ID43
Feature activation-1.29e-02

Pos logit contributions
uch+0.062
aches+0.058
ade+0.053
ats+0.051
ons+0.051
Neg logit contributions
 antics-0.078
 infant-0.077
 personality-0.075
 Can-0.071
amb-0.070
Tokenily
Token ID813
Feature activation-1.11e-02

Pos logit contributions
 antics+0.024
 infant+0.023
 personality+0.023
 Can+0.022
amb+0.021
Neg logit contributions
uch-0.019
aches-0.018
ade-0.017
at-0.016
ons-0.016
Token out
Token ID503
Feature activation+2.38e-02

Pos logit contributions
uch+0.067
asis+0.061
aches+0.061
str+0.058
 am+0.057
Neg logit contributions
 infant-0.036
 daughter-0.032
 loudly-0.031
amb-0.030
 antics-0.029
Token of
Token ID286
Feature activation+3.32e-02

Pos logit contributions
uch+0.048
aches+0.042
asis+0.041
at+0.040
ats+0.039
Neg logit contributions
 infant-0.032
 antics-0.031
 personality-0.030
 loudly-0.029
rily-0.029
Token the
Token ID262
Feature activation+1.36e-02

Pos logit contributions
uch+0.068
at+0.057
asis+0.056
aches+0.056
ade+0.054
Neg logit contributions
 infant-0.054
 baby-0.046
 daughter-0.044
 personality-0.044
 Can-0.043
Token oven
Token ID14361
Feature activation+2.38e-02

Pos logit contributions
uch+0.027
aches+0.024
at+0.023
asis+0.023
w+0.023
Neg logit contributions
 infant-0.020
 antics-0.017
 personality-0.016
 Can-0.016
 daughter-0.016
Token and
Token ID290
Feature activation-3.39e-03

Pos logit contributions
uch+0.056
aches+0.052
at+0.048
ats+0.048
w+0.048
Neg logit contributions
 infant-0.023
 antics-0.022
 personality-0.020
 Can-0.020
rily-0.020
Token put
Token ID1234
Feature activation+1.87e-03

Pos logit contributions
 infant+0.005
 daughter+0.005
 Can+0.004
 baby+0.004
 lear+0.004
Neg logit contributions
uch-0.007
asis-0.006
at-0.006
aches-0.006
ade-0.006
Token them
Token ID606
Feature activation+3.22e-02

Pos logit contributions
uch+0.004
asis+0.003
aches+0.003
str+0.003
ade+0.003
Neg logit contributions
 infant-0.002
rily-0.002
 It-0.002
 baby-0.002
 Can-0.002
Token on
Token ID319
Feature activation+2.20e-02

Pos logit contributions
uch+0.057
str+0.051
asis+0.049
 am+0.048
ons+0.047
Neg logit contributions
 infant-0.047
 loudly-0.043
rily-0.043
amb-0.041
 daughter-0.040
Token a
Token ID257
Feature activation+1.29e-02

Pos logit contributions
uch+0.029
aches+0.026
at+0.024
ats+0.024
ade+0.024
Neg logit contributions
 antics-0.043
 infant-0.042
 personality-0.041
 Can-0.040
amb-0.039
Token plate
Token ID7480
Feature activation+1.30e-02

Pos logit contributions
uch+0.024
w+0.020
aches+0.020
at+0.020
ats+0.020
Neg logit contributions
 infant-0.020
 antics-0.019
 personality-0.018
rily-0.018
 Can-0.018
Token.
Token ID13
Feature activation-4.45e-02

Pos logit contributions
uch+0.023
aches+0.020
str+0.019
ats+0.019
asis+0.018
Neg logit contributions
 infant-0.020
 antics-0.019
 loudly-0.019
rily-0.019
 lear-0.019
Tokenink
Token ID676
Feature activation-5.51e-02

Pos logit contributions
 antics+0.160
 infant+0.158
 personality+0.154
 Can+0.150
amb+0.145
Neg logit contributions
uch-0.108
aches-0.099
at-0.089
ats-0.089
ade-0.089
Tokenling
Token ID1359
Feature activation-1.99e-02

Pos logit contributions
 antics+0.082
 infant+0.080
 personality+0.077
amb+0.072
urbed+0.072
Neg logit contributions
uch-0.096
aches-0.091
ats-0.083
ade-0.083
at-0.083
Token,
Token ID11
Feature activation-1.51e-02

Pos logit contributions
 antics+0.040
 infant+0.039
 personality+0.039
amb+0.037
 Can+0.037
Neg logit contributions
uch-0.024
aches-0.022
ade-0.020
at-0.020
ats-0.020
Token a
Token ID257
Feature activation-1.87e-02

Pos logit contributions
 antics+0.023
 infant+0.023
 personality+0.022
amb+0.022
 Can+0.021
Neg logit contributions
uch-0.025
aches-0.024
ade-0.022
 done-0.022
ons-0.022
Token new
Token ID649
Feature activation-2.43e-02

Pos logit contributions
 antics+0.028
 infant+0.027
 personality+0.027
amb+0.026
 Can+0.025
Neg logit contributions
uch-0.032
aches-0.030
ade-0.028
at-0.028
ats-0.028
Token star
Token ID3491
Feature activation+2.20e-03

Pos logit contributions
 antics+0.036
 infant+0.034
 personality+0.033
arted+0.033
amb+0.033
Neg logit contributions
uch-0.040
aches-0.038
ade-0.036
 am-0.035
asis-0.035
Token arrived
Token ID5284
Feature activation-3.75e-02

Pos logit contributions
uch+0.003
aches+0.003
ats+0.003
at+0.003
ade+0.003
Neg logit contributions
 infant-0.004
 antics-0.004
 personality-0.003
 Can-0.003
urbed-0.003
Token.
Token ID13
Feature activation-2.68e-02

Pos logit contributions
 antics+0.077
 infant+0.076
 personality+0.074
 Can+0.071
amb+0.071
Neg logit contributions
uch-0.045
aches-0.041
ade-0.037
ats-0.036
at-0.036
Token It
Token ID632
Feature activation+2.60e-02

Pos logit contributions
 antics+0.040
amb+0.036
 infant+0.035
 personality+0.034
 lear+0.034
Neg logit contributions
-0.040
aster-0.040
uch-0.039
ade-0.039
he-0.038
Token spark
Token ID9009
Feature activation-3.59e-02

Pos logit contributions
uch+0.042
aches+0.039
ats+0.037
at+0.036
ons+0.035
Neg logit contributions
 infant-0.041
 antics-0.041
 loudly-0.040
 personality-0.040
 hesitant-0.039
Tokenled
Token ID992
Feature activation-4.03e-02

Pos logit contributions
 antics+0.063
 infant+0.062
 personality+0.060
amb+0.057
 Can+0.057
Neg logit contributions
uch-0.054
aches-0.049
at-0.045
ats-0.044
asis-0.044
Tokenmy
Token ID1820
Feature activation-1.91e-02

Pos logit contributions
 antics+0.012
 infant+0.011
 personality+0.011
 Can+0.011
rily+0.011
Neg logit contributions
uch-0.007
aches-0.006
 done-0.006
ats-0.005
at-0.005
Token was
Token ID373
Feature activation-3.26e-02

Pos logit contributions
 antics+0.041
 infant+0.040
 personality+0.039
 Can+0.039
rily+0.038
Neg logit contributions
uch-0.021
aches-0.018
 done-0.017
ade-0.017
ons-0.017
Token happy
Token ID3772
Feature activation-1.92e-02

Pos logit contributions
 antics+0.048
 personality+0.045
 infant+0.045
amb+0.044
 Can+0.042
Neg logit contributions
uch-0.058
aches-0.056
 done-0.054
at-0.050
ade-0.050
Token again
Token ID757
Feature activation+2.44e-02

Pos logit contributions
 antics+0.039
 infant+0.038
 personality+0.037
amb+0.036
rily+0.036
Neg logit contributions
uch-0.021
aches-0.020
 done-0.019
ons-0.018
ade-0.018
Token and
Token ID290
Feature activation-1.46e-02

Pos logit contributions
uch+0.050
aches+0.047
at+0.044
ats+0.044
ade+0.044
Neg logit contributions
 antics-0.030
 infant-0.029
 personality-0.028
 Can-0.026
amb-0.025
Token he
Token ID339
Feature activation-3.42e-03

Pos logit contributions
 antics+0.028
 infant+0.027
 personality+0.027
 Can+0.026
amb+0.025
Neg logit contributions
uch-0.020
aches-0.018
ade-0.016
ons-0.016
 done-0.016
Token realized
Token ID6939
Feature activation+5.21e-03

Pos logit contributions
 antics+0.006
 infant+0.006
 personality+0.006
 Can+0.006
amb+0.006
Neg logit contributions
uch-0.005
aches-0.004
ade-0.004
ats-0.004
at-0.004
Token that
Token ID326
Feature activation+2.92e-02

Pos logit contributions
uch+0.007
at+0.006
aches+0.006
ats+0.006
asis+0.005
Neg logit contributions
 infant-0.010
 antics-0.010
 personality-0.010
 Can-0.010
 loudly-0.010
Token sometimes
Token ID3360
Feature activation+3.10e-02

Pos logit contributions
uch+0.055
aches+0.052
at+0.049
ats+0.049
ade+0.048
Neg logit contributions
 antics-0.040
 infant-0.040
 personality-0.038
 Can-0.036
amb-0.035
Token conflicts
Token ID12333
Feature activation+4.34e-02

Pos logit contributions
uch+0.039
aches+0.037
 done+0.036
ons+0.036
 am+0.035
Neg logit contributions
amb-0.061
 antics-0.058
rily-0.056
arted-0.055
 personality-0.055
Token can
Token ID460
Feature activation+1.94e-02

Pos logit contributions
uch+0.064
at+0.060
aches+0.059
aster+0.058
asis+0.058
Neg logit contributions
 antics-0.075
 personality-0.074
 infant-0.073
 Can-0.068
 Even-0.068
Token When
Token ID1649
Feature activation+4.46e-02

Pos logit contributions
 antics+0.029
 infant+0.026
urbed+0.026
ening+0.026
 alien+0.026
Neg logit contributions
-0.043
 am-0.042
ade-0.041
uch-0.041
 done-0.040
Token they
Token ID484
Feature activation+1.30e-02

Pos logit contributions
uch+0.060
aches+0.055
ade+0.049
ons+0.049
ats+0.049
Neg logit contributions
 antics-0.086
 infant-0.084
 personality-0.082
amb-0.079
 Can-0.078
Token got
Token ID1392
Feature activation-1.29e-02

Pos logit contributions
uch+0.024
aches+0.022
fire+0.021
asis+0.021
ats+0.021
Neg logit contributions
 Will-0.016
 daughter-0.016
 She-0.016
 Even-0.016
 While-0.015
Token to
Token ID284
Feature activation+2.25e-02

Pos logit contributions
 infant+0.021
 antics+0.020
 personality+0.019
 Can+0.019
amb+0.018
Neg logit contributions
uch-0.021
aches-0.020
at-0.018
ats-0.018
asis-0.018
Token Lily
Token ID20037
Feature activation-1.02e-02

Pos logit contributions
uch+0.026
aches+0.024
ats+0.021
ade+0.021
at+0.021
Neg logit contributions
 antics-0.047
 infant-0.046
 personality-0.045
 Can-0.043
amb-0.043
Token's
Token ID338
Feature activation-2.53e-02

Pos logit contributions
 antics+0.015
 infant+0.015
 personality+0.015
 Can+0.014
amb+0.014
Neg logit contributions
uch-0.018
aches-0.017
ade-0.015
at-0.015
ats-0.015
Token house
Token ID2156
Feature activation+2.58e-02

Pos logit contributions
 antics+0.039
 infant+0.038
 personality+0.037
 Can+0.035
amb+0.035
Neg logit contributions
uch-0.043
aches-0.040
ade-0.037
ats-0.037
at-0.037
Token,
Token ID11
Feature activation+7.45e-04

Pos logit contributions
uch+0.053
aches+0.049
at+0.048
w+0.046
asis+0.046
Neg logit contributions
 antics-0.031
 infant-0.031
 personality-0.030
 loudly-0.028
 widely-0.028
Token they
Token ID484
Feature activation+5.37e-03

Pos logit contributions
uch+0.001
aches+0.001
ade+0.001
ons+0.001
ats+0.001
Neg logit contributions
 antics-0.001
 infant-0.001
 personality-0.001
amb-0.001
 Can-0.001
Token washed
Token ID18989
Feature activation-2.92e-02

Pos logit contributions
uch+0.009
aches+0.008
ade+0.007
asis+0.007
at+0.007
Neg logit contributions
 infant-0.009
 antics-0.008
 personality-0.008
amb-0.008
 lear-0.008
Token their
Token ID511
Feature activation-1.02e-02

Pos logit contributions
 infant+0.048
 antics+0.047
 personality+0.046
 Can+0.043
amb+0.043
Neg logit contributions
uch-0.049
aches-0.045
at-0.042
asis-0.041
ade-0.041
Token
Token ID198
Feature activation+5.78e-02

Pos logit contributions
 antics+0.027
 infant+0.027
 personality+0.026
 Can+0.024
amb+0.024
Neg logit contributions
uch-0.034
aches-0.032
ade-0.030
ats-0.030
at-0.030
Token
Token ID198
Feature activation+3.48e-02

Pos logit contributions
uch+0.108
aches+0.102
asis+0.095
at+0.093
kins+0.087
Neg logit contributions
 Even-0.083
 As-0.079
 Whoever-0.078
 It-0.077
 She-0.075
TokenAs
Token ID1722
Feature activation+2.19e-02

Pos logit contributions
uch+0.062
aches+0.058
at+0.053
ats+0.053
asis+0.053
Neg logit contributions
 antics-0.051
 infant-0.048
 personality-0.048
 Can-0.047
amb-0.046
Token Sam
Token ID3409
Feature activation-1.16e-02

Pos logit contributions
uch+0.042
aches+0.040
at+0.038
ats+0.036
asis+0.036
Neg logit contributions
 antics-0.028
 infant-0.027
 loudly-0.027
 Can-0.026
 personality-0.026
Token gall
Token ID7976
Feature activation-6.04e-02

Pos logit contributions
 antics+0.020
 infant+0.020
 personality+0.019
amb+0.018
 lear+0.018
Neg logit contributions
uch-0.017
aches-0.016
at-0.014
ade-0.014
ats-0.014
Tokenoped
Token ID19458
Feature activation-3.87e-02

Pos logit contributions
 antics+0.092
amb+0.090
 personality+0.088
 infant+0.088
 lear+0.084
Neg logit contributions
uch-0.099
aches-0.097
asis-0.086
ade-0.085
 done-0.085
Token through
Token ID832
Feature activation+1.89e-02

Pos logit contributions
 antics+0.070
 infant+0.069
 personality+0.067
 Can+0.064
amb+0.063
Neg logit contributions
uch-0.056
aches-0.052
at-0.047
ade-0.047
ats-0.047
Token the
Token ID262
Feature activation+5.34e-03

Pos logit contributions
uch+0.026
aches+0.022
at+0.021
ats+0.020
ade+0.020
Neg logit contributions
 antics-0.037
 infant-0.036
 personality-0.036
 Can-0.034
urbed-0.033
Token garden
Token ID11376
Feature activation+2.14e-02

Pos logit contributions
uch+0.009
aches+0.008
at+0.007
ats+0.007
ade+0.007
Neg logit contributions
 antics-0.009
 infant-0.009
 personality-0.009
 Can-0.008
amb-0.008
Token,
Token ID11
Feature activation-6.00e-03

Pos logit contributions
uch+0.041
aches+0.037
at+0.035
ats+0.035
w+0.034
Neg logit contributions
 antics-0.029
 infant-0.028
 personality-0.028
 loudly-0.025
 wondering-0.025
Token he
Token ID339
Feature activation-3.09e-03

Pos logit contributions
 antics+0.010
 infant+0.010
 personality+0.010
 Can+0.010
amb+0.009
Neg logit contributions
uch-0.009
aches-0.008
at-0.008
ats-0.008
ade-0.008
Token'm
Token ID1101
Feature activation-4.65e-02

Pos logit contributions
aches+0.011
uch+0.010
ats+0.010
ons+0.010
ade+0.010
Neg logit contributions
 antics-0.013
 personality-0.013
 Can-0.013
 infant-0.013
amb-0.012
Token chewing
Token ID36615
Feature activation-1.95e-02

Pos logit contributions
 antics+0.087
rily+0.080
 infant+0.080
 personality+0.080
urbed+0.079
Neg logit contributions
uch-0.071
 done-0.064
aches-0.062
ons-0.057
at-0.055
Token gum
Token ID27142
Feature activation+1.19e-02

Pos logit contributions
 infant+0.029
 Can+0.029
 antics+0.028
 personality+0.028
 loudly+0.027
Neg logit contributions
uch-0.031
aches-0.030
asis-0.030
at-0.028
ats-0.028
Token,
Token ID11
Feature activation-2.51e-02

Pos logit contributions
uch+0.020
aches+0.019
asis+0.018
at+0.017
ats+0.017
Neg logit contributions
 infant-0.017
 antics-0.017
 personality-0.017
amb-0.016
 Can-0.016
Token Tim
Token ID5045
Feature activation-1.31e-02

Pos logit contributions
 antics+0.041
urbed+0.040
 lear+0.039
 infant+0.039
rily+0.039
Neg logit contributions
uch-0.037
aches-0.035
 done-0.033
ade-0.033
ons-0.032
Tokenmy
Token ID1820
Feature activation-2.02e-02

Pos logit contributions
 antics+0.024
 infant+0.023
 personality+0.023
 Can+0.022
amb+0.022
Neg logit contributions
uch-0.019
aches-0.017
at-0.016
ade-0.016
ats-0.016
Token!
Token ID0
Feature activation-3.76e-02

Pos logit contributions
 antics+0.042
 personality+0.040
 infant+0.040
amb+0.038
 Can+0.038
Neg logit contributions
uch-0.024
aches-0.022
ade-0.020
ons-0.020
ats-0.020
Token Want
Token ID16168
Feature activation+2.74e-02

Pos logit contributions
 antics+0.048
rily+0.047
iously+0.044
urbed+0.043
ening+0.042
Neg logit contributions
uch-0.073
 am-0.066
ons-0.060
tr-0.060
at-0.059
Token some
Token ID617
Feature activation+5.51e-02

Pos logit contributions
uch+0.047
aches+0.046
asis+0.046
ats+0.045
at+0.045
Neg logit contributions
 Can-0.039
 infant-0.038
 antics-0.036
 personality-0.034
 Even-0.033
Token?"
Token ID1701
Feature activation+1.08e-03

Pos logit contributions
fire+0.084
asis+0.082
uch+0.079
aches+0.078
kins+0.076
Neg logit contributions
 Can-0.093
 infant-0.085
 antics-0.084
 personality-0.082
 Please-0.082
Token replied
Token ID8712
Feature activation-3.06e-02

Pos logit contributions
uch+0.002
aches+0.002
at+0.002
ats+0.002
asis+0.002
Neg logit contributions
 infant-0.001
 antics-0.001
 personality-0.001
 Can-0.001
amb-0.001
Token little
Token ID1310
Feature activation-7.20e-02

Pos logit contributions
 antics+0.055
arted+0.055
iously+0.054
urbed+0.053
istent+0.052
Neg logit contributions
uch-0.056
str-0.052
aches-0.051
lly-0.050
asis-0.050
Token girl
Token ID2576
Feature activation-4.20e-02

Pos logit contributions
 antics+0.117
urbed+0.113
iously+0.113
istent+0.112
arted+0.109
Neg logit contributions
uch-0.112
 cream-0.106
ade-0.105
 last-0.104
asis-0.103
Token named
Token ID3706
Feature activation-4.31e-02

Pos logit contributions
 infant+0.063
 antics+0.062
 personality+0.061
 Can+0.058
amb+0.057
Neg logit contributions
uch-0.074
aches-0.070
ade-0.064
at-0.064
ats-0.064
Token Lucy
Token ID22162
Feature activation-1.82e-02

Pos logit contributions
 antics+0.048
 infant+0.046
 personality+0.045
 Can+0.041
amb+0.041
Neg logit contributions
uch-0.093
aches-0.088
ade-0.082
at-0.082
ats-0.082
Token.
Token ID13
Feature activation-2.79e-02

Pos logit contributions
 antics+0.036
 infant+0.036
 personality+0.035
 Can+0.034
amb+0.033
Neg logit contributions
uch-0.023
aches-0.021
at-0.019
ats-0.019
ade-0.019
Token She
Token ID1375
Feature activation-1.44e-02

Pos logit contributions
rier+0.032
 antics+0.031
 lear+0.030
 warrior+0.029
urbed+0.029
Neg logit contributions
he-0.053
 am-0.052
uch-0.050
ade-0.049
 rest-0.049
Token was
Token ID373
Feature activation-4.43e-02

Pos logit contributions
 antics+0.024
 infant+0.023
 personality+0.022
 Can+0.021
amb+0.021
Neg logit contributions
uch-0.023
aches-0.022
at-0.020
ade-0.020
ats-0.020
Token three
Token ID1115
Feature activation-4.54e-02

Pos logit contributions
 antics+0.094
rily+0.089
urbed+0.089
der+0.086
istent+0.085
Neg logit contributions
uch-0.051
 done-0.048
aches-0.044
 am-0.044
 last-0.043
Token years
Token ID812
Feature activation-3.49e-02

Pos logit contributions
 antics+0.076
 infant+0.073
 personality+0.072
 Can+0.070
amb+0.068
Neg logit contributions
uch-0.070
aches-0.068
ats-0.063
ade-0.062
ons-0.062
Token old
Token ID1468
Feature activation-2.14e-02

Pos logit contributions
 antics+0.059
 personality+0.054
istent+0.051
amb+0.051
urbed+0.050
Neg logit contributions
aches-0.053
uch-0.052
ade-0.051
ons-0.049
at-0.048
Token.
Token ID13
Feature activation-2.17e-02

Pos logit contributions
 antics+0.043
istent+0.043
urbed+0.042
rily+0.042
arted+0.042
Neg logit contributions
 last-0.021
 am-0.021
 cream-0.020
ma-0.020
ons-0.019
Token to
Token ID284
Feature activation+6.96e-03

Pos logit contributions
 infant+0.062
 antics+0.061
rily+0.060
amb+0.060
 personality+0.060
Neg logit contributions
aches-0.028
ons-0.027
 done-0.027
ats-0.027
uch-0.027
Token play
Token ID711
Feature activation-1.35e-02

Pos logit contributions
uch+0.007
aches+0.007
 am+0.006
 done+0.006
ats+0.006
Neg logit contributions
 antics-0.015
 infant-0.015
rily-0.015
amb-0.014
 Can-0.014
Token.
Token ID13
Feature activation-3.92e-02

Pos logit contributions
 antics+0.025
 infant+0.025
 personality+0.024
 Can+0.023
amb+0.023
Neg logit contributions
uch-0.019
aches-0.018
at-0.016
ats-0.016
ade-0.016
Token Tim
Token ID5045
Feature activation-1.84e-02

Pos logit contributions
heart+0.072
rier+0.070
 antics+0.068
 guitarist+0.067
 warrior+0.067
Neg logit contributions
 soda-0.045
ade-0.043
 done-0.038
 rest-0.037
tr-0.037
Tokenmy
Token ID1820
Feature activation-2.82e-02

Pos logit contributions
 antics+0.039
 infant+0.037
 personality+0.037
 Can+0.036
rily+0.036
Neg logit contributions
uch-0.022
aches-0.020
ats-0.019
at-0.019
 done-0.019
Token's
Token ID338
Feature activation-4.44e-02

Pos logit contributions
 antics+0.056
 infant+0.054
 personality+0.053
rily+0.051
 Can+0.051
Neg logit contributions
uch-0.037
aches-0.033
ats-0.033
ade-0.032
ons-0.032
Token turn
Token ID1210
Feature activation-2.88e-02

Pos logit contributions
 antics+0.066
 infant+0.066
 Can+0.064
 personality+0.064
amb+0.063
Neg logit contributions
uch-0.076
aches-0.071
ade-0.068
ats-0.067
 done-0.065
Token came
Token ID1625
Feature activation-3.55e-02

Pos logit contributions
 antics+0.056
 infant+0.054
 personality+0.054
amb+0.051
 Can+0.051
Neg logit contributions
uch-0.037
aches-0.035
ade-0.034
 done-0.032
ats-0.032
Token and
Token ID290
Feature activation-8.75e-03

Pos logit contributions
 personality+0.082
 antics+0.080
 infant+0.078
amb+0.077
istent+0.076
Neg logit contributions
 done-0.029
 last-0.029
uch-0.026
bel-0.024
 am-0.024
Token he
Token ID339
Feature activation-4.09e-03

Pos logit contributions
 antics+0.014
 infant+0.014
 personality+0.014
 Can+0.013
amb+0.013
Neg logit contributions
uch-0.014
aches-0.013
at-0.012
ats-0.012
ade-0.012
Token rolled
Token ID11686
Feature activation-5.20e-02

Pos logit contributions
 antics+0.007
 infant+0.007
 personality+0.007
 Can+0.007
amb+0.007
Neg logit contributions
uch-0.006
aches-0.005
at-0.005
ats-0.005
ade-0.005
Token voice
Token ID3809
Feature activation+5.21e-03

Pos logit contributions
 antics+0.062
 infant+0.061
 personality+0.057
 Can+0.055
amb+0.055
Neg logit contributions
uch-0.109
aches-0.105
ade-0.099
ats-0.098
at-0.098
Token,
Token ID11
Feature activation-1.06e-02

Pos logit contributions
uch+0.009
aches+0.008
at+0.008
ats+0.008
asis+0.008
Neg logit contributions
 antics-0.008
 infant-0.008
 personality-0.008
 Can-0.007
 loudly-0.007
Token
Token ID6184
Feature activation-7.13e-02

Pos logit contributions
 antics+0.012
 infant+0.011
 personality+0.011
amb+0.010
arted+0.009
Neg logit contributions
uch-0.021
aches-0.021
 done-0.020
str-0.020
 am-0.019
Token
Token ID95
Feature activation-6.54e-02

Pos logit contributions
 antics+0.118
 infant+0.117
 personality+0.113
 Can+0.105
amb+0.104
Neg logit contributions
uch-0.117
aches-0.107
at-0.099
ats-0.099
asis-0.098
Token
Token ID26391
Feature activation-4.51e-02

Pos logit contributions
 antics+0.089
 infant+0.088
 personality+0.084
 Can+0.078
amb+0.078
Neg logit contributions
uch-0.125
aches-0.116
at-0.109
ats-0.108
ade-0.107
Token
Token ID129
Feature activation-7.71e-02

Pos logit contributions
 antics+0.068
 personality+0.065
 Can+0.064
 Even+0.062
 infant+0.061
Neg logit contributions
aches-0.074
uch-0.071
ade-0.068
at-0.068
asis-0.064
Token
Token ID241
Feature activation+1.79e-02

Pos logit contributions
 infant+0.112
 antics+0.105
 personality+0.104
 Can+0.103
 lear+0.099
Neg logit contributions
uch-0.140
aches-0.135
ats-0.127
 done-0.126
at-0.125
TokenDon
Token ID3987
Feature activation+2.29e-02

Pos logit contributions
uch+0.027
aches+0.025
ade+0.022
at+0.022
ats+0.022
Neg logit contributions
 antics-0.031
 infant-0.031
 personality-0.030
amb-0.028
 Can-0.028
Tokenâ
Token ID22940
Feature activation-5.03e-02

Pos logit contributions
uch+0.035
aches+0.034
at+0.031
ats+0.031
ade+0.030
Neg logit contributions
 infant-0.037
 antics-0.037
 Can-0.036
 personality-0.035
amb-0.035
Token
Token ID26391
Feature activation-4.11e-02

Pos logit contributions
 antics+0.073
 infant+0.072
amb+0.072
 personality+0.071
 Can+0.070
Neg logit contributions
aches-0.081
uch-0.081
ons-0.075
ade-0.074
ats-0.073
Token
Token ID8151
Feature activation+2.27e-02

Pos logit contributions
 antics+0.063
 personality+0.061
 Can+0.060
 infant+0.060
amb+0.056
Neg logit contributions
uch-0.068
aches-0.068
ade-0.062
at-0.061
ats-0.059
Token said
Token ID531
Feature activation-1.70e-02

Pos logit contributions
 antics+0.039
 infant+0.038
 personality+0.037
 Can+0.036
amb+0.035
Neg logit contributions
uch-0.025
aches-0.023
ade-0.020
ats-0.020
at-0.020
Token,
Token ID11
Feature activation-2.96e-03

Pos logit contributions
 antics+0.029
 infant+0.028
 personality+0.027
 Can+0.026
amb+0.026
Neg logit contributions
uch-0.026
aches-0.025
ade-0.023
at-0.022
ats-0.022
Token
Token ID6184
Feature activation-6.80e-02

Pos logit contributions
 antics+0.004
 infant+0.004
 personality+0.003
heart+0.003
 adventurers+0.003
Neg logit contributions
uch-0.005
aches-0.005
str-0.005
 done-0.005
 am-0.005
Token
Token ID95
Feature activation-6.60e-02

Pos logit contributions
 infant+0.112
 antics+0.110
 personality+0.103
urbed+0.098
rily+0.097
Neg logit contributions
uch-0.116
aches-0.106
at-0.099
asis-0.098
ats-0.097
Token
Token ID26391
Feature activation-4.93e-02

Pos logit contributions
 antics+0.099
 infant+0.098
 personality+0.093
 Can+0.087
amb+0.087
Neg logit contributions
uch-0.117
aches-0.108
at-0.102
ats-0.101
ade-0.100
Token
Token ID129
Feature activation-7.51e-02

Pos logit contributions
 antics+0.076
 personality+0.073
 infant+0.071
 Can+0.071
amb+0.068
Neg logit contributions
uch-0.080
aches-0.078
ade-0.072
at-0.071
ats-0.068
Token
Token ID241
Feature activation+1.12e-02

Pos logit contributions
 infant+0.109
 antics+0.101
 personality+0.098
 Can+0.096
 alien+0.093
Neg logit contributions
uch-0.138
aches-0.134
ats-0.127
at-0.126
 done-0.124
TokenYes
Token ID5297
Feature activation+2.25e-03

Pos logit contributions
uch+0.017
aches+0.016
 am+0.015
w+0.015
ade+0.015
Neg logit contributions
 infant-0.019
 antics-0.018
 personality-0.017
rily-0.017
urbed-0.017
Token!
Token ID0
Feature activation-1.44e-02

Pos logit contributions
uch+0.003
aches+0.003
ade+0.003
ats+0.003
at+0.003
Neg logit contributions
 antics-0.004
 infant-0.004
 personality-0.004
 Can-0.004
amb-0.004
Tokenâ
Token ID22940
Feature activation-4.90e-02

Pos logit contributions
 antics+0.019
 infant+0.017
ening+0.016
urbed+0.016
rily+0.016
Neg logit contributions
uch-0.026
 am-0.025
-0.024
aches-0.023
ons-0.023
Token
Token ID26391
Feature activation-3.83e-02

Pos logit contributions
 antics+0.070
 personality+0.070
amb+0.069
 Can+0.068
 infant+0.068
Neg logit contributions
aches-0.079
uch-0.078
ons-0.072
ade-0.071
ats-0.070
Token<|endoftext|>
Token ID50256
Feature activation+6.32e-02

Pos logit contributions
 antics+0.025
 infant+0.023
 personality+0.022
 lear+0.022
amb+0.021
Neg logit contributions
uch-0.020
aches-0.018
 am-0.018
ade-0.018
ons-0.017
TokenOne
Token ID3198
Feature activation+6.78e-02

Pos logit contributions
aches+0.095
uch+0.093
ats+0.078
at+0.075
ade+0.072
Neg logit contributions
 Can-0.113
 She-0.112
 It-0.112
 infant-0.111
 Even-0.107
Token day
Token ID1110
Feature activation+9.61e-03

Pos logit contributions
uch+0.098
aches+0.094
ats+0.085
ons+0.083
asis+0.082
Neg logit contributions
 infant-0.121
 personality-0.117
 antics-0.115
 daughter-0.110
heart-0.107
Token,
Token ID11
Feature activation+7.79e-04

Pos logit contributions
uch+0.017
aches+0.016
at+0.015
asis+0.015
ats+0.015
Neg logit contributions
 infant-0.014
 antics-0.014
 personality-0.013
 Can-0.013
amb-0.012
Token a
Token ID257
Feature activation-3.10e-02

Pos logit contributions
uch+0.001
aches+0.001
ade+0.001
ats+0.001
at+0.001
Neg logit contributions
 antics-0.001
 infant-0.001
 personality-0.001
 Can-0.001
amb-0.001
Token little
Token ID1310
Feature activation-6.72e-02

Pos logit contributions
 antics+0.038
 infant+0.035
 personality+0.035
amb+0.034
urbed+0.034
Neg logit contributions
uch-0.062
aches-0.057
ade-0.055
 am-0.054
str-0.054
Token rat
Token ID4227
Feature activation-1.65e-02

Pos logit contributions
 antics+0.104
iously+0.099
rily+0.098
urbed+0.095
istent+0.093
Neg logit contributions
uch-0.113
str-0.106
 am-0.104
ade-0.103
 last-0.103
Token needed
Token ID2622
Feature activation+1.54e-02

Pos logit contributions
 infant+0.024
 antics+0.022
 personality+0.022
 Can+0.021
 loudly+0.020
Neg logit contributions
uch-0.030
aches-0.029
at-0.026
ats-0.026
ade-0.026
Token to
Token ID284
Feature activation+1.50e-02

Pos logit contributions
uch+0.024
aches+0.022
at+0.020
ats+0.020
asis+0.020
Neg logit contributions
 infant-0.027
 antics-0.026
 personality-0.026
 Can-0.025
 lear-0.023
Token be
Token ID307
Feature activation-1.96e-02

Pos logit contributions
uch+0.023
aches+0.020
ats+0.020
at+0.020
w+0.019
Neg logit contributions
 infant-0.026
 antics-0.026
 personality-0.024
 Can-0.022
 daughter-0.022
Token healed
Token ID28557
Feature activation-2.43e-02

Pos logit contributions
 infant+0.029
 antics+0.028
 personality+0.028
 hesitant+0.025
 loudly+0.025
Neg logit contributions
uch-0.036
aches-0.031
at-0.030
asis-0.030
ade-0.030
Token time
Token ID640
Feature activation-4.06e-03

Pos logit contributions
aches+0.004
uch+0.004
ons+0.003
asis+0.003
at+0.003
Neg logit contributions
 antics-0.004
 personality-0.004
 infant-0.004
 loudly-0.004
amb-0.004
Token,
Token ID11
Feature activation+1.51e-02

Pos logit contributions
 infant+0.005
 antics+0.005
 personality+0.005
 Can+0.005
amb+0.005
Neg logit contributions
uch-0.008
aches-0.007
at-0.007
ade-0.007
ats-0.007
Token there
Token ID612
Feature activation+4.78e-02

Pos logit contributions
uch+0.021
aches+0.020
ats+0.019
at+0.018
ade+0.018
Neg logit contributions
 infant-0.028
 Can-0.027
 personality-0.026
 antics-0.026
 loudly-0.025
Token was
Token ID373
Feature activation-3.46e-02

Pos logit contributions
aches+0.082
uch+0.082
at+0.075
ats+0.074
asis+0.070
Neg logit contributions
 It-0.071
 Even-0.069
 infant-0.068
 She-0.067
 personality-0.067
Token a
Token ID257
Feature activation-3.75e-02

Pos logit contributions
 antics+0.078
 personality+0.074
istent+0.071
urbed+0.071
 infant+0.071
Neg logit contributions
uch-0.036
aches-0.030
ons-0.029
at-0.028
ade-0.028
Token beautiful
Token ID4950
Feature activation-7.24e-02

Pos logit contributions
arted+0.062
 antics+0.061
iously+0.061
urbed+0.060
istent+0.058
Neg logit contributions
uch-0.059
str-0.055
aches-0.053
lly-0.052
asis-0.052
Token princess
Token ID21752
Feature activation-3.12e-02

Pos logit contributions
arted+0.123
istent+0.121
 antics+0.118
urbed+0.117
iously+0.114
Neg logit contributions
uch-0.105
asis-0.104
ade-0.101
aches-0.098
str-0.095
Token.
Token ID13
Feature activation-3.60e-02

Pos logit contributions
 infant+0.047
 antics+0.046
amb+0.045
 personality+0.045
 lear+0.043
Neg logit contributions
uch-0.055
aches-0.051
ade-0.048
 done-0.046
ats-0.046
Token She
Token ID1375
Feature activation-1.84e-02

Pos logit contributions
 antics+0.034
urbed+0.029
arted+0.028
istent+0.028
rily+0.028
Neg logit contributions
aches-0.076
 am-0.075
uch-0.075
ons-0.074
he-0.074
Token was
Token ID373
Feature activation-4.42e-02

Pos logit contributions
 personality+0.026
 infant+0.026
 Can+0.025
 antics+0.025
amb+0.023
Neg logit contributions
uch-0.033
aches-0.031
ats-0.029
at-0.028
ade-0.028
Token so
Token ID523
Feature activation-3.07e-02

Pos logit contributions
 antics+0.084
 personality+0.076
 infant+0.075
istent+0.075
urbed+0.074
Neg logit contributions
uch-0.062
aches-0.060
 done-0.055
at-0.053
ons-0.051
Token time
Token ID640
Feature activation-4.61e-03

Pos logit contributions
aches+0.001
uch+0.001
ons+0.001
asis+0.001
at+0.001
Neg logit contributions
 antics-0.002
 personality-0.002
 infant-0.002
amb-0.002
 loudly-0.002
Token,
Token ID11
Feature activation+1.84e-02

Pos logit contributions
 antics+0.007
 infant+0.007
 personality+0.006
 Can+0.006
amb+0.006
Neg logit contributions
uch-0.008
aches-0.008
at-0.007
ade-0.007
ats-0.007
Token there
Token ID612
Feature activation+4.43e-02

Pos logit contributions
uch+0.023
aches+0.022
ats+0.020
at+0.020
ade+0.020
Neg logit contributions
 infant-0.037
 antics-0.035
 Can-0.035
 personality-0.035
 loudly-0.033
Token was
Token ID373
Feature activation-3.65e-02

Pos logit contributions
aches+0.072
uch+0.070
at+0.065
ats+0.064
asis+0.061
Neg logit contributions
 infant-0.070
 It-0.068
 antics-0.068
 Even-0.068
 Can-0.067
Token a
Token ID257
Feature activation-3.48e-02

Pos logit contributions
 antics+0.079
 personality+0.076
 infant+0.073
istent+0.072
urbed+0.072
Neg logit contributions
uch-0.040
aches-0.034
ons-0.032
ade-0.032
at-0.032
Token little
Token ID1310
Feature activation-6.77e-02

Pos logit contributions
arted+0.056
 antics+0.056
iously+0.055
urbed+0.054
amb+0.053
Neg logit contributions
uch-0.056
str-0.051
aches-0.050
ade-0.050
asis-0.049
Token girl
Token ID2576
Feature activation-4.09e-02

Pos logit contributions
 antics+0.111
iously+0.106
urbed+0.105
arted+0.105
istent+0.102
Neg logit contributions
uch-0.110
ade-0.103
 cream-0.102
asis-0.099
 last-0.099
Token.
Token ID13
Feature activation-2.94e-02

Pos logit contributions
 infant+0.058
 antics+0.055
 personality+0.055
 Can+0.052
amb+0.051
Neg logit contributions
uch-0.077
aches-0.072
ade-0.067
at-0.066
ats-0.065
Token She
Token ID1375
Feature activation-1.88e-02

Pos logit contributions
 antics+0.022
amb+0.019
urbed+0.019
rily+0.019
 infant+0.018
Neg logit contributions
uch-0.070
aches-0.067
 am-0.067
ade-0.065
ons-0.065
Token had
Token ID550
Feature activation-3.93e-03

Pos logit contributions
 infant+0.027
 personality+0.027
 antics+0.026
 Can+0.026
 loudly+0.024
Neg logit contributions
uch-0.032
aches-0.032
ats-0.029
at-0.029
ade-0.029
Token a
Token ID257
Feature activation-2.39e-02

Pos logit contributions
 antics+0.007
 infant+0.007
 personality+0.007
 Can+0.007
amb+0.006
Neg logit contributions
uch-0.006
aches-0.005
at-0.005
ats-0.005
ade-0.005