TOP ACTIVATIONS
MAX = 8.622

Suddenly , he felt a sensation . He began to slide
it was a big round thing with lots of stars .
It was a round yellow thing - it was cheese !
She held up a shiny thing in her hand . It
It was a big shiny thing - a microphone . He
It was a big green thing . She asked her mother
. A zoom was a thing that could fly in the
a bucket , a round thing for holding sand .
, he felt a strange sensation , like something powerful was
held up a small metal thing . " Maybe it can
, Elsa felt a strange sensation in the water . What
It was a tiny green thing . He picked it up
A gear is a round thing with teeth that can move
. It was a navy thing ! She looked closer and
. It was a shiny thing that looked interesting and it
Fin saw a big green thing floating in the water .
of sticks . Sue felt something was going to happen .
The doctor put a special thing on the little girl 's
tight and put a special thing in her mouth . It
and her mom got a sund ae with lots of topp

MIN ACTIVATIONS
MIN = -8.523

She asked , " Dog gy , can you help me
should not tease the ost ric hes . They are not
, " Hello , pig gy ! You are so big
but be careful . Ost ric hes can be gr umpy
you count how many ost ric hes there are ?"
shouldn 't run after ambul ances . It 's very dangerous
started to chase after Kay lee . She screamed and ran
save it in your pig gy bank ."
, " Yes , Dog gy ! Let 's slide together
, wanted to borrow Zig gy 's favorite toy . Zig
They do not pack ost ric hes anymore . They pack
some money for her pig gy bank . One
save it in your pig gy bank ." While
did not know what ost ric hes were . They looked
wanted to touch the ost ric hes . They reached out
was always there for Bug gy , no matter what .
mom and to the ost ric hes . They learned their
. " Dog gy , you are a bad
ly said , " Bug gy , today was so much
Snow y said to Tig gy , " I found something

INTERVAL 4.335 - 8.622
CONTAINS 0.151%

have found such an amazing thing . Jack knew
my realized that Lily was right . From that day on
? â He looked up to see a friendly face
dug a tunnel and found something sparkling . He was so
loud bark . They look up and see a big dog

INTERVAL 0.049 - 4.335
CONTAINS 46.704%

On the day of the task she smiled cheer fully ,
! Max was so scared he started to cry .
was red and smelled nice . E lla 's
lit the fire and started to cook . After
new flower . As they got closer , they

INTERVAL -4.237 - 0.049
CONTAINS 53.026%

, there was a shape . It was very dirty .
liked to look at the flowers and the insects . She
brave . He stayed . " Hello , dog
know what to do . Suddenly , Lily remembered her mom
veil , and I will never let anything in ."

INTERVAL -8.523 - -4.237
CONTAINS 0.119%

there was a little girl named Lily . She loved to
there lived a generous dinosaur named Todd . Every day ,
Jill , let â s clean it up .
One day , a girl named Lucy went to her friend
other animals for many happy years . <|endoftext|> Once upon a
TokenSuddenly
Token ID38582
Feature activation-1.38e+00

Pos logit contributions
 know+0.015
 named+0.014
atters+0.014
 have+0.014
 done+0.014
Neg logit contributions
 therapist-0.021
cer-0.017
 toddler-0.016
 Riley-0.016
unia-0.016
Token,
Token ID11
Feature activation+5.62e-02

Pos logit contributions
 named+0.007
 have+0.005
 know+0.004
 done+0.003
 called+0.003
Neg logit contributions
 therapist-0.090
ymes-0.079
cer-0.079
rily-0.079
unia-0.077
Token he
Token ID339
Feature activation+1.20e+00

Pos logit contributions
 therapist+0.003
cer+0.002
ymes+0.002
unia+0.002
rily+0.002
Neg logit contributions
 have-0.001
 named-0.001
 know-0.001
 done-0.001
 Tim-0.001
Token felt
Token ID2936
Feature activation-6.10e-01

Pos logit contributions
 therapist+0.051
rier+0.041
cer+0.041
 toddler+0.040
 Clare+0.040
Neg logit contributions
 named-0.030
 know-0.030
 done-0.028
 have-0.028
 called-0.027
Token a
Token ID257
Feature activation+3.56e+00

Pos logit contributions
 know+0.014
 named+0.013
 have+0.013
 much+0.012
 called+0.012
Neg logit contributions
 therapist-0.028
cer-0.024
 toddler-0.023
unia-0.023
rily-0.022
Token sensation
Token ID18098
Feature activation+6.98e+00

Pos logit contributions
 therapist+0.149
cer+0.123
 Clare+0.118
unia+0.118
gers+0.117
Neg logit contributions
 know-0.092
 have-0.084
 named-0.082
atters-0.076
 done-0.075
Token.
Token ID13
Feature activation+2.47e-01

Pos logit contributions
 therapist+0.313
 toddler+0.264
 Clare+0.261
cer+0.258
 Riley+0.258
Neg logit contributions
 named-0.153
 done-0.142
 have-0.142
 know-0.139
 called-0.139
Token He
Token ID679
Feature activation+1.41e-01

Pos logit contributions
 therapist+0.014
cer+0.012
unia+0.012
ymes+0.012
gers+0.011
Neg logit contributions
 know-0.003
 One-0.003
 named-0.002
 have-0.002
 In-0.002
Token began
Token ID2540
Feature activation+3.95e-01

Pos logit contributions
 therapist+0.007
rier+0.006
cer+0.006
 Clare+0.006
gers+0.006
Neg logit contributions
 named-0.003
 called-0.003
 know-0.003
 have-0.002
 both-0.002
Token to
Token ID284
Feature activation+1.30e+00

Pos logit contributions
 therapist+0.020
cer+0.018
rier+0.018
unia+0.018
gers+0.018
Neg logit contributions
 know-0.007
 named-0.006
 have-0.006
 called-0.006
 done-0.005
Token slide
Token ID10649
Feature activation-1.41e+00

Pos logit contributions
 therapist+0.066
rier+0.057
cer+0.057
gers+0.057
rily+0.055
Neg logit contributions
 know-0.027
 have-0.024
 named-0.020
 build-0.019
my-0.019
Token it
Token ID340
Feature activation+2.11e+00

Pos logit contributions
 therapist+0.024
cer+0.020
unia+0.020
ymes+0.020
 avail+0.020
Neg logit contributions
 named-0.007
 have-0.007
 called-0.007
 know-0.007
 because-0.006
Token was
Token ID373
Feature activation+4.14e-01

Pos logit contributions
 therapist+0.090
cer+0.075
 toddler+0.074
 Clare+0.074
rier+0.073
Neg logit contributions
 named-0.050
 know-0.049
 have-0.047
 done-0.047
 called-0.045
Token a
Token ID257
Feature activation+2.89e+00

Pos logit contributions
 therapist+0.019
 toddler+0.015
cer+0.015
 Riley+0.015
 Clare+0.015
Neg logit contributions
 know-0.009
atters-0.008
 named-0.008
 have-0.008
 done-0.007
Token big
Token ID1263
Feature activation+9.29e-01

Pos logit contributions
 therapist+0.129
cer+0.107
 Riley+0.105
 toddler+0.104
 Clare+0.104
Neg logit contributions
 know-0.061
 named-0.061
 have-0.057
atters-0.054
 done-0.054
Token round
Token ID2835
Feature activation+2.52e+00

Pos logit contributions
 therapist+0.043
cer+0.036
 toddler+0.035
 Riley+0.035
 Clare+0.035
Neg logit contributions
 named-0.018
 know-0.018
 have-0.017
atters-0.016
 done-0.016
Token thing
Token ID1517
Feature activation+6.86e+00

Pos logit contributions
 therapist+0.106
cer+0.085
 Riley+0.085
 toddler+0.084
 Clare+0.083
Neg logit contributions
 named-0.061
 know-0.060
 have-0.057
atters-0.056
 done-0.053
Token with
Token ID351
Feature activation+2.78e+00

Pos logit contributions
 therapist+0.321
cer+0.267
 Clare+0.267
 toddler+0.266
unia+0.263
Neg logit contributions
 named-0.136
 know-0.125
 have-0.124
 called-0.118
 done-0.117
Token lots
Token ID6041
Feature activation+5.24e-01

Pos logit contributions
 therapist+0.126
cer+0.104
 toddler+0.102
unia+0.100
 Riley+0.100
Neg logit contributions
 know-0.061
 named-0.058
 have-0.055
atters-0.054
 done-0.053
Token of
Token ID286
Feature activation+6.97e-01

Pos logit contributions
 therapist+0.025
 toddler+0.021
cer+0.021
rier+0.020
 Riley+0.020
Neg logit contributions
 know-0.009
 named-0.009
atters-0.008
 done-0.008
 have-0.008
Token stars
Token ID5788
Feature activation-3.28e-01

Pos logit contributions
 therapist+0.030
cer+0.024
 toddler+0.024
 Riley+0.024
unia+0.024
Neg logit contributions
 know-0.016
 named-0.016
atters-0.015
 have-0.015
 done-0.015
Token.
Token ID13
Feature activation+4.74e-01

Pos logit contributions
 know+0.006
 named+0.006
atters+0.006
 have+0.006
 done+0.006
Neg logit contributions
 therapist-0.015
cer-0.013
 toddler-0.013
 Riley-0.012
 Clare-0.012
Token It
Token ID632
Feature activation-1.03e-01

Pos logit contributions
 therapist+0.009
cer+0.008
unia+0.007
rier+0.007
 toddler+0.007
Neg logit contributions
 know-0.003
 named-0.003
 have-0.003
atters-0.003
 done-0.003
Token was
Token ID373
Feature activation+9.53e-01

Pos logit contributions
 have+0.002
 named+0.002
 know+0.002
 called+0.002
my+0.002
Neg logit contributions
 therapist-0.005
rier-0.004
cer-0.004
 Clare-0.004
rily-0.004
Token a
Token ID257
Feature activation+2.95e+00

Pos logit contributions
 therapist+0.050
cer+0.044
unia+0.043
rier+0.041
rily+0.040
Neg logit contributions
 named-0.015
 called-0.015
 have-0.014
 done-0.013
 know-0.012
Token round
Token ID2835
Feature activation+2.44e+00

Pos logit contributions
 therapist+0.133
edly+0.115
cer+0.114
overed+0.114
rily+0.114
Neg logit contributions
 named-0.061
 have-0.056
 know-0.053
 called-0.051
 done-0.049
Token yellow
Token ID7872
Feature activation+1.24e+00

Pos logit contributions
 therapist+0.107
 Riley+0.088
 Clare+0.088
unia+0.088
 Debbie+0.085
Neg logit contributions
 named-0.057
 have-0.052
 know-0.049
 called-0.047
my-0.046
Token thing
Token ID1517
Feature activation+6.85e+00

Pos logit contributions
 therapist+0.052
 Clare+0.043
 Riley+0.043
unia+0.043
rier+0.041
Neg logit contributions
 named-0.030
 have-0.028
 know-0.026
 called-0.025
my-0.025
Token -
Token ID532
Feature activation+1.01e+00

Pos logit contributions
 therapist+0.350
 Clare+0.296
unia+0.292
 toddler+0.292
cer+0.292
Neg logit contributions
 named-0.107
 have-0.099
 called-0.096
 know-0.095
 done-0.088
Token it
Token ID340
Feature activation+2.61e+00

Pos logit contributions
 therapist+0.052
cer+0.045
unia+0.044
edly+0.044
rier+0.044
Neg logit contributions
 have-0.018
 named-0.017
 know-0.015
 much-0.015
 called-0.014
Token was
Token ID373
Feature activation+9.01e-01

Pos logit contributions
 therapist+0.118
rier+0.100
cer+0.097
 Clare+0.096
 toddler+0.096
Neg logit contributions
 have-0.057
 named-0.057
 know-0.055
 called-0.054
 done-0.052
Token cheese
Token ID9891
Feature activation+1.46e+00

Pos logit contributions
 therapist+0.045
cer+0.038
unia+0.037
 toddler+0.037
rier+0.036
Neg logit contributions
 named-0.016
 called-0.015
 have-0.015
 know-0.015
 done-0.015
Token!
Token ID0
Feature activation+6.37e-01

Pos logit contributions
 therapist+0.075
 toddler+0.064
 Clare+0.064
 Riley+0.063
rier+0.063
Neg logit contributions
 named-0.021
 have-0.019
 know-0.018
 called-0.018
 done-0.018
Token She
Token ID1375
Feature activation-8.33e-01

Pos logit contributions
 therapist+0.003
cer+0.003
rier+0.003
unia+0.003
ymes+0.003
Neg logit contributions
 named-0.001
 know-0.001
 have-0.001
 One-0.001
atters-0.001
Token held
Token ID2714
Feature activation+3.00e+00

Pos logit contributions
 named+0.017
 called+0.015
 have+0.014
 know+0.014
my+0.012
Neg logit contributions
 therapist-0.042
rier-0.038
 avail-0.036
cer-0.035
gers-0.034
Token up
Token ID510
Feature activation+3.42e+00

Pos logit contributions
 therapist+0.149
edly+0.125
lor+0.122
guard+0.121
rier+0.121
Neg logit contributions
 named-0.063
 both-0.061
 have-0.059
 know-0.053
 called-0.051
Token a
Token ID257
Feature activation+3.18e+00

Pos logit contributions
 therapist+0.183
cer+0.160
unia+0.153
rier+0.153
lor+0.151
Neg logit contributions
 named-0.058
 called-0.049
 both-0.046
 One-0.043
 much-0.040
Token shiny
Token ID22441
Feature activation+2.71e+00

Pos logit contributions
 therapist+0.137
overed+0.113
 Riley+0.112
edly+0.110
 Clare+0.108
Neg logit contributions
 named-0.073
 know-0.073
 have-0.072
 called-0.063
 done-0.062
Token thing
Token ID1517
Feature activation+6.77e+00

Pos logit contributions
 therapist+0.112
 Riley+0.091
 Debbie+0.089
 Amelia+0.087
 toddler+0.087
Neg logit contributions
 named-0.071
 have-0.065
 know-0.060
 called-0.060
?"-0.060
Token in
Token ID287
Feature activation+1.01e+00

Pos logit contributions
 therapist+0.343
 Clare+0.293
cer+0.288
unia+0.288
 toddler+0.288
Neg logit contributions
 named-0.107
 have-0.093
 called-0.092
 know-0.091
atters-0.081
Token her
Token ID607
Feature activation+1.08e+00

Pos logit contributions
 therapist+0.051
rier+0.045
overed+0.042
edly+0.041
cer+0.041
Neg logit contributions
 have-0.021
 know-0.021
 named-0.019
 One-0.018
 called-0.017
Token hand
Token ID1021
Feature activation+2.52e+00

Pos logit contributions
 therapist+0.043
cer+0.034
 toddler+0.034
rier+0.033
unia+0.033
Neg logit contributions
 named-0.029
 know-0.029
 have-0.028
 done-0.027
atters-0.026
Token.
Token ID13
Feature activation+1.48e-01

Pos logit contributions
 therapist+0.144
 toddler+0.125
cer+0.124
 Riley+0.123
 Clare+0.123
Neg logit contributions
 named-0.022
 know-0.021
 have-0.018
atters-0.017
 done-0.016
Token It
Token ID632
Feature activation-2.40e-01

Pos logit contributions
 therapist+0.007
cer+0.006
 toddler+0.006
unia+0.005
rier+0.005
Neg logit contributions
 know-0.003
 named-0.003
atters-0.003
 have-0.003
 done-0.003
Token It
Token ID632
Feature activation-1.12e-01

Pos logit contributions
 therapist+0.002
cer+0.002
rier+0.002
unia+0.002
 toddler+0.002
Neg logit contributions
 know-0.001
 have-0.001
 named-0.001
atters-0.001
 One-0.001
Token was
Token ID373
Feature activation+7.93e-01

Pos logit contributions
 named+0.003
 have+0.002
 know+0.002
 called+0.002
 done+0.002
Neg logit contributions
 therapist-0.005
rier-0.004
 Clare-0.004
cer-0.004
 toddler-0.004
Token a
Token ID257
Feature activation+2.88e+00

Pos logit contributions
 therapist+0.041
cer+0.035
rier+0.034
unia+0.034
rily+0.034
Neg logit contributions
 named-0.013
 called-0.012
 have-0.011
 know-0.011
 done-0.011
Token big
Token ID1263
Feature activation+8.81e-01

Pos logit contributions
 therapist+0.131
rily+0.114
edly+0.112
overed+0.110
 Clare+0.109
Neg logit contributions
 named-0.060
 have-0.055
 know-0.052
 called-0.050
 done-0.046
Token shiny
Token ID22441
Feature activation+2.24e+00

Pos logit contributions
 therapist+0.043
edly+0.037
rily+0.037
 Clare+0.036
rier+0.036
Neg logit contributions
 named-0.018
 have-0.017
 know-0.014
 called-0.014
my-0.013
Token thing
Token ID1517
Feature activation+6.70e+00

Pos logit contributions
 therapist+0.093
 Riley+0.076
 Clare+0.074
 Debbie+0.074
edly+0.074
Neg logit contributions
 named-0.060
 have-0.054
 called-0.050
 know-0.050
my-0.048
Token -
Token ID532
Feature activation+1.03e+00

Pos logit contributions
 therapist+0.338
 Clare+0.287
unia+0.281
 toddler+0.281
rier+0.280
Neg logit contributions
 named-0.116
 called-0.103
 have-0.101
 know-0.094
 done-0.089
Token a
Token ID257
Feature activation+2.32e+00

Pos logit contributions
 therapist+0.058
rily+0.049
cer+0.049
unia+0.049
rier+0.049
Neg logit contributions
 have-0.017
 named-0.014
 much-0.014
 know-0.013
 Tim-0.012
Token microphone
Token ID21822
Feature activation+7.49e-01

Pos logit contributions
 therapist+0.101
rily+0.086
edly+0.085
overed+0.084
cer+0.083
Neg logit contributions
 named-0.052
 have-0.050
 know-0.047
 called-0.043
 done-0.042
Token.
Token ID13
Feature activation+3.43e-01

Pos logit contributions
 therapist+0.036
 toddler+0.030
cer+0.030
 Riley+0.029
 Clare+0.029
Neg logit contributions
 named-0.013
 know-0.013
 have-0.012
atters-0.012
 done-0.012
Token He
Token ID679
Feature activation+4.59e-02

Pos logit contributions
 therapist+0.016
cer+0.013
rier+0.013
unia+0.013
 toddler+0.013
Neg logit contributions
 know-0.007
 named-0.007
 have-0.006
atters-0.006
 One-0.006
Token It
Token ID632
Feature activation-4.77e-01

Pos logit contributions
 therapist+0.019
cer+0.016
 toddler+0.016
rier+0.016
unia+0.016
Neg logit contributions
 know-0.006
 named-0.006
 have-0.006
atters-0.006
 done-0.005
Token was
Token ID373
Feature activation+5.60e-01

Pos logit contributions
 named+0.011
 have+0.010
 know+0.010
 called+0.010
 done+0.009
Neg logit contributions
 therapist-0.022
rier-0.019
cer-0.019
rily-0.018
 Clare-0.018
Token a
Token ID257
Feature activation+2.89e+00

Pos logit contributions
 therapist+0.028
cer+0.025
rier+0.024
rily+0.024
 toddler+0.023
Neg logit contributions
 named-0.010
 called-0.009
 know-0.009
 have-0.009
 done-0.009
Token big
Token ID1263
Feature activation+9.69e-01

Pos logit contributions
 therapist+0.122
cer+0.102
rily+0.101
edly+0.100
unia+0.099
Neg logit contributions
 named-0.069
 know-0.065
 have-0.064
 done-0.058
 called-0.058
Token green
Token ID4077
Feature activation+8.80e-01

Pos logit contributions
 therapist+0.047
rily+0.040
edly+0.040
unia+0.039
 Riley+0.039
Neg logit contributions
 named-0.019
 have-0.017
 know-0.017
 called-0.015
 done-0.015
Token thing
Token ID1517
Feature activation+6.64e+00

Pos logit contributions
 therapist+0.036
 Riley+0.029
rily+0.029
 Clare+0.029
rier+0.029
Neg logit contributions
 named-0.023
 have-0.021
 know-0.020
 called-0.019
atters-0.019
Token.
Token ID13
Feature activation+5.50e-01

Pos logit contributions
 therapist+0.325
cer+0.276
 toddler+0.273
 Clare+0.272
rier+0.271
Neg logit contributions
 named-0.121
 have-0.106
 called-0.105
 know-0.103
 done-0.096
Token She
Token ID1375
Feature activation-4.71e-01

Pos logit contributions
 therapist+0.027
cer+0.023
 toddler+0.023
 Riley+0.022
 Clare+0.022
Neg logit contributions
 know-0.009
 named-0.009
atters-0.008
 have-0.008
 done-0.008
Token asked
Token ID1965
Feature activation-7.49e-01

Pos logit contributions
 named+0.010
 know+0.010
 have+0.009
 called+0.009
 done+0.008
Neg logit contributions
 therapist-0.022
rier-0.019
cer-0.019
rily-0.018
unia-0.018
Token her
Token ID607
Feature activation-4.22e-01

Pos logit contributions
 know+0.019
 named+0.019
 have+0.017
 called+0.016
 One+0.016
Neg logit contributions
 therapist-0.032
cer-0.029
rier-0.027
unia-0.026
gers-0.026
Token mother
Token ID2802
Feature activation-9.49e-01

Pos logit contributions
atters+0.011
 know+0.010
 done+0.010
ric+0.010
weet+0.010
Neg logit contributions
 therapist-0.017
 toddler-0.014
 Riley-0.013
 Clare-0.013
 Amelia-0.013
Token.
Token ID13
Feature activation+3.37e-01

Pos logit contributions
 therapist+0.030
 toddler+0.025
 Riley+0.025
 Clare+0.024
cer+0.024
Neg logit contributions
 named-0.014
 know-0.013
 have-0.013
atters-0.013
 done-0.012
Token A
Token ID317
Feature activation+8.98e-01

Pos logit contributions
 therapist+0.015
 toddler+0.013
 Clare+0.013
 Riley+0.013
rier+0.012
Neg logit contributions
 know-0.007
 named-0.006
atters-0.006
 done-0.006
ric-0.006
Token zoom
Token ID19792
Feature activation+7.10e-01

Pos logit contributions
 therapist+0.035
cer+0.028
 toddler+0.028
 Riley+0.028
unia+0.027
Neg logit contributions
 know-0.024
 named-0.023
 have-0.022
atters-0.022
 done-0.022
Token was
Token ID373
Feature activation+8.01e-01

Pos logit contributions
 therapist+0.032
 toddler+0.027
 Riley+0.027
 Clare+0.027
rily+0.026
Neg logit contributions
 named-0.016
 have-0.015
 know-0.014
 done-0.014
atters-0.014
Token a
Token ID257
Feature activation+3.04e+00

Pos logit contributions
 therapist+0.036
cer+0.030
 toddler+0.030
 Riley+0.029
unia+0.029
Neg logit contributions
 know-0.017
 named-0.016
 have-0.015
atters-0.015
 done-0.015
Token thing
Token ID1517
Feature activation+6.61e+00

Pos logit contributions
 therapist+0.122
cer+0.098
 toddler+0.097
 Riley+0.096
 Clare+0.095
Neg logit contributions
 know-0.078
 named-0.076
 have-0.073
atters-0.072
 done-0.070
Token that
Token ID326
Feature activation+1.03e+00

Pos logit contributions
 therapist+0.315
 toddler+0.265
cer+0.263
 Clare+0.261
unia+0.259
Neg logit contributions
 named-0.129
 have-0.116
 know-0.114
 called-0.111
atters-0.108
Token could
Token ID714
Feature activation-1.38e+00

Pos logit contributions
 therapist+0.049
rier+0.044
rily+0.043
 toddler+0.043
cer+0.043
Neg logit contributions
 have-0.022
 named-0.020
 called-0.020
 know-0.020
 done-0.019
Token fly
Token ID6129
Feature activation-1.48e+00

Pos logit contributions
 know+0.025
 have+0.024
 named+0.023
atters+0.022
 done+0.020
Neg logit contributions
 therapist-0.068
cer-0.058
 toddler-0.058
 Riley-0.056
 Clare-0.056
Token in
Token ID287
Feature activation+5.80e-01

Pos logit contributions
 know+0.027
 named+0.026
 have+0.025
atters+0.024
 done+0.023
Neg logit contributions
 therapist-0.072
 toddler-0.060
cer-0.059
 Riley-0.059
 Clare-0.058
Token the
Token ID262
Feature activation-2.00e-01

Pos logit contributions
 therapist+0.027
 toddler+0.021
cer+0.021
 Riley+0.021
rier+0.021
Neg logit contributions
 know-0.013
 have-0.012
 named-0.012
atters-0.011
 done-0.011
Token a
Token ID257
Feature activation+2.37e+00

Pos logit contributions
 know+0.020
atters+0.020
 named+0.020
weet+0.018
 have+0.018
Neg logit contributions
 therapist-0.037
 toddler-0.030
 Riley-0.029
cer-0.029
 Clare-0.029
Token bucket
Token ID19236
Feature activation+2.12e+00

Pos logit contributions
 therapist+0.097
cer+0.078
 toddler+0.078
 Riley+0.076
 Clare+0.075
Neg logit contributions
 know-0.059
 named-0.057
atters-0.056
 have-0.054
 done-0.054
Token,
Token ID11
Feature activation+1.82e+00

Pos logit contributions
 therapist+0.099
cer+0.082
 toddler+0.082
 Riley+0.080
 Clare+0.080
Neg logit contributions
 know-0.041
 named-0.039
atters-0.038
 have-0.037
 done-0.036
Token a
Token ID257
Feature activation+3.11e+00

Pos logit contributions
 therapist+0.086
cer+0.072
rier+0.071
unia+0.070
 toddler+0.069
Neg logit contributions
 named-0.036
 know-0.036
 have-0.034
 called-0.031
 done-0.031
Token round
Token ID2835
Feature activation+2.18e+00

Pos logit contributions
 therapist+0.128
cer+0.103
 toddler+0.102
 Riley+0.100
unia+0.100
Neg logit contributions
 know-0.077
 named-0.076
 have-0.072
atters-0.072
 done-0.070
Token thing
Token ID1517
Feature activation+6.61e+00

Pos logit contributions
 therapist+0.089
cer+0.071
 toddler+0.071
 Riley+0.070
 Clare+0.069
Neg logit contributions
 know-0.055
 named-0.053
atters-0.051
 have-0.051
 done-0.050
Token for
Token ID329
Feature activation+2.16e+00

Pos logit contributions
 therapist+0.317
 toddler+0.264
cer+0.263
 Clare+0.262
unia+0.259
Neg logit contributions
 named-0.121
 know-0.116
 have-0.110
atters-0.108
 done-0.104
Token holding
Token ID4769
Feature activation+1.04e+00

Pos logit contributions
 therapist+0.086
cer+0.069
 toddler+0.069
 Riley+0.067
unia+0.066
Neg logit contributions
 know-0.057
 named-0.055
atters-0.053
 have-0.053
 done-0.052
Token sand
Token ID6450
Feature activation-5.55e-01

Pos logit contributions
 therapist+0.051
 toddler+0.041
cer+0.040
 Clare+0.040
 Debbie+0.039
Neg logit contributions
 named-0.020
 know-0.018
 have-0.018
 both-0.018
 done-0.018
Token.
Token ID13
Feature activation+4.17e-01

Pos logit contributions
 know+0.011
 named+0.010
atters+0.010
 have+0.010
 done+0.009
Neg logit contributions
 therapist-0.026
cer-0.022
 toddler-0.021
 Riley-0.021
unia-0.021
Token
Token ID198
Feature activation-2.16e-01

Pos logit contributions
 therapist+0.019
 Riley+0.016
 toddler+0.016
 Clare+0.016
cer+0.016
Neg logit contributions
 know-0.008
atters-0.008
 named-0.008
 done-0.007
 have-0.007
Token,
Token ID11
Feature activation+4.63e-01

Pos logit contributions
 therapist+0.068
cer+0.059
 toddler+0.058
 Clare+0.058
unia+0.058
Neg logit contributions
 know-0.015
 have-0.014
 named-0.013
 done-0.013
atters-0.012
Token he
Token ID339
Feature activation+1.49e+00

Pos logit contributions
 therapist+0.024
unia+0.020
cer+0.020
rier+0.020
rily+0.020
Neg logit contributions
 have-0.008
 know-0.008
 named-0.008
 called-0.007
 much-0.007
Token felt
Token ID2936
Feature activation-8.43e-02

Pos logit contributions
 therapist+0.068
rier+0.057
cer+0.056
 toddler+0.056
unia+0.055
Neg logit contributions
 named-0.033
 know-0.032
 have-0.029
 done-0.029
 called-0.029
Token a
Token ID257
Feature activation+3.62e+00

Pos logit contributions
 know+0.002
 named+0.002
 have+0.001
atters+0.001
 done+0.001
Neg logit contributions
 therapist-0.004
cer-0.003
 toddler-0.003
unia-0.003
 Riley-0.003
Token strange
Token ID6283
Feature activation+2.97e+00

Pos logit contributions
 therapist+0.155
cer+0.127
 toddler+0.124
 Riley+0.123
unia+0.123
Neg logit contributions
 know-0.085
 named-0.080
 have-0.078
atters-0.077
 done-0.075
Token sensation
Token ID18098
Feature activation+6.60e+00

Pos logit contributions
 therapist+0.122
unia+0.101
 Clare+0.100
 Riley+0.099
cer+0.098
Neg logit contributions
 know-0.077
 named-0.075
 have-0.073
 done-0.072
 called-0.067
Token,
Token ID11
Feature activation+1.06e+00

Pos logit contributions
 therapist+0.292
 toddler+0.244
 Clare+0.240
cer+0.239
unia+0.239
Neg logit contributions
 named-0.145
 done-0.138
 know-0.138
 have-0.136
atters-0.129
Token like
Token ID588
Feature activation+7.39e-01

Pos logit contributions
 therapist+0.052
unia+0.045
cer+0.044
 toddler+0.043
gers+0.042
Neg logit contributions
 named-0.020
 have-0.019
 know-0.019
 done-0.019
 called-0.018
Token something
Token ID1223
Feature activation+3.91e+00

Pos logit contributions
 therapist+0.034
cer+0.030
unia+0.030
rily+0.028
 Clare+0.028
Neg logit contributions
 know-0.015
 named-0.014
 done-0.014
 have-0.014
 much-0.012
Token powerful
Token ID3665
Feature activation+6.39e-01

Pos logit contributions
 therapist+0.178
cer+0.148
 toddler+0.146
unia+0.144
 Clare+0.144
Neg logit contributions
 know-0.085
 named-0.081
 have-0.079
 done-0.077
atters-0.074
Token was
Token ID373
Feature activation-1.91e-01

Pos logit contributions
 therapist+0.034
 toddler+0.029
cer+0.029
 Clare+0.029
unia+0.029
Neg logit contributions
 know-0.009
 named-0.008
 have-0.008
 done-0.008
atters-0.007
Token held
Token ID2714
Feature activation+3.08e+00

Pos logit contributions
 therapist+0.009
rier+0.008
 avail+0.008
 Clare+0.007
cer+0.007
Neg logit contributions
 named-0.003
 called-0.003
 know-0.003
 have-0.003
 built-0.002
Token up
Token ID510
Feature activation+3.54e+00

Pos logit contributions
 therapist+0.166
edly+0.141
ymes+0.133
lor+0.133
cer+0.133
Neg logit contributions
 have-0.055
 named-0.053
 both-0.048
 know-0.044
 One-0.041
Token a
Token ID257
Feature activation+3.32e+00

Pos logit contributions
 therapist+0.191
cer+0.162
unia+0.159
rier+0.155
 parcel+0.154
Neg logit contributions
 named-0.061
 called-0.048
 have-0.046
 both-0.045
 One-0.043
Token small
Token ID1402
Feature activation+2.29e+00

Pos logit contributions
 therapist+0.142
 Riley+0.114
 Clare+0.112
cer+0.112
unia+0.111
Neg logit contributions
 know-0.082
 named-0.077
 have-0.077
atters-0.069
 done-0.069
Token metal
Token ID6147
Feature activation+3.60e+00

Pos logit contributions
 therapist+0.104
 Riley+0.087
 Clare+0.085
 Debbie+0.084
overed+0.082
Neg logit contributions
 know-0.054
 named-0.052
 have-0.051
 called-0.043
 mud-0.042
Token thing
Token ID1517
Feature activation+6.55e+00

Pos logit contributions
 therapist+0.150
 Riley+0.128
 Clare+0.127
 Debbie+0.126
 Amelia+0.122
Neg logit contributions
 named-0.091
 have-0.081
 know-0.079
 called-0.078
atters-0.076
Token.
Token ID13
Feature activation+4.53e-01

Pos logit contributions
 therapist+0.333
 Clare+0.286
unia+0.279
cer+0.279
 Riley+0.278
Neg logit contributions
 named-0.102
 know-0.092
 have-0.090
 called-0.086
atters-0.082
Token "
Token ID366
Feature activation+2.30e+00

Pos logit contributions
 therapist+0.020
 Riley+0.016
 toddler+0.016
 Clare+0.015
rily+0.015
Neg logit contributions
 know-0.010
 done-0.010
atters-0.010
 named-0.010
weet-0.010
TokenMaybe
Token ID13300
Feature activation+2.77e-01

Pos logit contributions
 therapist+0.074
cer+0.056
 toddler+0.055
 Riley+0.053
 Clare+0.053
Neg logit contributions
 know-0.078
 named-0.076
atters-0.075
 have-0.074
 done-0.073
Token it
Token ID340
Feature activation+5.39e-01

Pos logit contributions
 therapist+0.011
cer+0.009
 toddler+0.009
 Riley+0.009
unia+0.009
Neg logit contributions
 know-0.008
 named-0.008
 have-0.007
 called-0.007
 done-0.007
Token can
Token ID460
Feature activation+8.95e-02

Pos logit contributions
 therapist+0.024
 toddler+0.020
cer+0.020
 Riley+0.019
 Clare+0.019
Neg logit contributions
 know-0.012
 named-0.011
 have-0.011
atters-0.011
 done-0.011
Token,
Token ID11
Feature activation+9.45e-02

Pos logit contributions
 have+0.021
 named+0.019
 know+0.018
 done+0.016
 One+0.015
Neg logit contributions
 therapist-0.105
cer-0.097
rily-0.096
rier-0.096
 toddler-0.094
Token Elsa
Token ID19226
Feature activation+3.93e-01

Pos logit contributions
cer+0.004
xc+0.003
 therapist+0.003
overed+0.003
mer+0.003
Neg logit contributions
 have-0.002
 named-0.002
 both-0.002
 much-0.002
 called-0.002
Token felt
Token ID2936
Feature activation-4.93e-01

Pos logit contributions
 therapist+0.018
rier+0.015
 Clare+0.015
rily+0.015
cer+0.015
Neg logit contributions
 named-0.008
 know-0.008
 have-0.007
 done-0.007
 called-0.007
Token a
Token ID257
Feature activation+3.62e+00

Pos logit contributions
 know+0.010
 named+0.010
 have+0.009
 done+0.009
atters+0.009
Neg logit contributions
 therapist-0.023
cer-0.019
 toddler-0.019
unia-0.019
 Clare-0.018
Token strange
Token ID6283
Feature activation+3.02e+00

Pos logit contributions
 therapist+0.153
cer+0.125
 Clare+0.121
 toddler+0.121
unia+0.121
Neg logit contributions
 know-0.088
 named-0.084
 have-0.082
atters-0.079
 done-0.078
Token sensation
Token ID18098
Feature activation+6.55e+00

Pos logit contributions
 therapist+0.124
unia+0.105
 Clare+0.102
 Amelia+0.100
 Riley+0.100
Neg logit contributions
 know-0.077
 named-0.076
 have-0.074
 done-0.073
 called-0.069
Token in
Token ID287
Feature activation+1.26e+00

Pos logit contributions
 therapist+0.298
 toddler+0.257
 Clare+0.251
unia+0.250
cer+0.250
Neg logit contributions
 named-0.136
 done-0.130
 have-0.128
 know-0.122
 called-0.120
Token the
Token ID262
Feature activation-3.98e-02

Pos logit contributions
 therapist+0.053
cer+0.045
rier+0.044
gers+0.044
overed+0.042
Neg logit contributions
 know-0.031
 have-0.030
 named-0.029
 done-0.028
 both-0.027
Token water
Token ID1660
Feature activation-5.91e-01

Pos logit contributions
 know+0.001
 named+0.001
atters+0.001
 have+0.001
 done+0.001
Neg logit contributions
 therapist-0.002
cer-0.002
 toddler-0.002
 Riley-0.001
unia-0.001
Token.
Token ID13
Feature activation+3.63e-01

Pos logit contributions
 named+0.007
 know+0.007
 have+0.007
 done+0.007
atters+0.006
Neg logit contributions
 therapist-0.031
 toddler-0.027
cer-0.027
 Clare-0.026
unia-0.026
Token What
Token ID1867
Feature activation+1.65e+00

Pos logit contributions
 therapist+0.018
cer+0.017
guard+0.016
unia+0.016
gers+0.016
Neg logit contributions
 One-0.005
 have-0.005
 know-0.005
 In-0.004
 Yesterday-0.004
Token It
Token ID632
Feature activation-9.47e-02

Pos logit contributions
 therapist+0.015
cer+0.013
unia+0.012
rier+0.012
ymes+0.012
Neg logit contributions
 know-0.005
 named-0.005
 have-0.004
 One-0.004
atters-0.004
Token was
Token ID373
Feature activation+8.14e-01

Pos logit contributions
 named+0.002
 have+0.002
 know+0.002
 called+0.002
 done+0.002
Neg logit contributions
 therapist-0.004
rier-0.004
 Clare-0.004
 toddler-0.003
cer-0.003
Token a
Token ID257
Feature activation+2.85e+00

Pos logit contributions
 therapist+0.043
cer+0.037
unia+0.036
 toddler+0.035
rier+0.035
Neg logit contributions
 named-0.013
 called-0.012
 have-0.012
 know-0.012
 done-0.011
Token tiny
Token ID7009
Feature activation+5.46e-01

Pos logit contributions
 therapist+0.134
rily+0.113
edly+0.112
overed+0.111
 Clare+0.111
Neg logit contributions
 named-0.056
 have-0.053
 know-0.052
 called-0.046
 done-0.045
Token green
Token ID4077
Feature activation+6.82e-01

Pos logit contributions
 therapist+0.025
 Riley+0.020
edly+0.020
 Clare+0.020
 Debbie+0.020
Neg logit contributions
 named-0.013
 have-0.011
 know-0.011
 called-0.011
 mud-0.010
Token thing
Token ID1517
Feature activation+6.55e+00

Pos logit contributions
 therapist+0.029
 Riley+0.025
 Clare+0.024
rier+0.024
rily+0.024
Neg logit contributions
 named-0.017
 have-0.015
 called-0.014
 know-0.013
 mud-0.013
Token.
Token ID13
Feature activation+4.12e-01

Pos logit contributions
 therapist+0.335
 Clare+0.282
 toddler+0.277
unia+0.276
cer+0.276
Neg logit contributions
 named-0.108
 called-0.097
 have-0.096
 know-0.091
 done-0.083
Token He
Token ID679
Feature activation+1.97e-01

Pos logit contributions
 therapist+0.020
cer+0.017
unia+0.016
rier+0.016
 toddler+0.016
Neg logit contributions
 know-0.008
 named-0.008
 have-0.007
atters-0.007
 One-0.007
Token picked
Token ID6497
Feature activation+1.40e+00

Pos logit contributions
 therapist+0.009
rier+0.007
cer+0.007
 Clare+0.007
 toddler+0.007
Neg logit contributions
 know-0.004
 named-0.004
 have-0.004
 done-0.004
atters-0.004
Token it
Token ID340
Feature activation+1.71e+00

Pos logit contributions
 therapist+0.063
edly+0.051
ymes+0.050
cer+0.049
rier+0.048
Neg logit contributions
 know-0.037
 have-0.036
 both-0.031
 named-0.030
my-0.030
Token up
Token ID510
Feature activation+3.12e+00

Pos logit contributions
 therapist+0.062
 Clare+0.046
cer+0.046
 toddler+0.045
rier+0.044
Neg logit contributions
 know-0.056
 have-0.054
 named-0.054
my-0.052
atters-0.051
Token A
Token ID317
Feature activation+6.19e-01

Pos logit contributions
 know+0.008
 done+0.008
 named+0.008
atters+0.008
 have+0.008
Neg logit contributions
 therapist-0.017
 toddler-0.014
 Riley-0.014
cer-0.014
 Clare-0.014
Token gear
Token ID7733
Feature activation+2.27e+00

Pos logit contributions
 therapist+0.022
 toddler+0.017
cer+0.017
 Riley+0.017
 Clare+0.017
Neg logit contributions
 know-0.019
atters-0.018
 named-0.018
 have-0.017
 done-0.017
Token is
Token ID318
Feature activation+1.52e+00

Pos logit contributions
 therapist+0.104
 toddler+0.088
 Clare+0.087
 Riley+0.087
cer+0.086
Neg logit contributions
 named-0.046
 have-0.045
 know-0.043
 done-0.041
atters-0.040
Token a
Token ID257
Feature activation+2.79e+00

Pos logit contributions
 therapist+0.065
cer+0.052
 toddler+0.052
 Riley+0.052
 Clare+0.051
Neg logit contributions
 know-0.036
atters-0.034
 named-0.033
 have-0.032
 done-0.031
Token round
Token ID2835
Feature activation+2.22e+00

Pos logit contributions
 therapist+0.113
cer+0.092
 toddler+0.090
 Riley+0.089
unia+0.089
Neg logit contributions
 know-0.071
 named-0.069
 have-0.066
atters-0.065
 done-0.064
Token thing
Token ID1517
Feature activation+6.52e+00

Pos logit contributions
 therapist+0.096
 Clare+0.079
unia+0.078
 Riley+0.078
 toddler+0.078
Neg logit contributions
 named-0.051
 know-0.049
 have-0.047
atters-0.044
 done-0.043
Token with
Token ID351
Feature activation+2.59e+00

Pos logit contributions
 therapist+0.310
cer+0.259
 toddler+0.259
 Clare+0.254
unia+0.253
Neg logit contributions
 named-0.119
 know-0.118
 have-0.110
atters-0.108
 done-0.105
Token teeth
Token ID9941
Feature activation-1.75e-01

Pos logit contributions
 therapist+0.111
cer+0.091
 toddler+0.090
 Riley+0.088
unia+0.088
Neg logit contributions
 know-0.060
 named-0.058
atters-0.056
 have-0.055
 done-0.054
Token that
Token ID326
Feature activation+2.22e-01

Pos logit contributions
 know+0.003
 named+0.003
atters+0.003
 have+0.003
 done+0.003
Neg logit contributions
 therapist-0.008
cer-0.007
 toddler-0.007
 Riley-0.007
unia-0.007
Token can
Token ID460
Feature activation-1.29e-01

Pos logit contributions
 therapist+0.010
cer+0.008
 toddler+0.008
rier+0.008
unia+0.008
Neg logit contributions
 know-0.005
 have-0.005
 named-0.005
atters-0.004
 done-0.004
Token move
Token ID1445
Feature activation-7.13e-02

Pos logit contributions
 know+0.003
 named+0.003
atters+0.002
 have+0.002
 done+0.002
Neg logit contributions
 therapist-0.006
cer-0.005
 toddler-0.005
 Riley-0.005
 Clare-0.005
Token.
Token ID13
Feature activation+4.97e-01

Pos logit contributions
 named+0.000
 have+0.000
 called+0.000
 know+0.000
 done+0.000
Neg logit contributions
 therapist-0.001
 toddler-0.001
 Clare-0.001
rier-0.001
cer-0.001
Token It
Token ID632
Feature activation+5.37e-02

Pos logit contributions
 therapist+0.024
cer+0.021
rier+0.020
unia+0.020
 toddler+0.020
Neg logit contributions
 know-0.009
 named-0.009
 have-0.008
atters-0.008
 One-0.007
Token was
Token ID373
Feature activation+9.71e-01

Pos logit contributions
 therapist+0.002
rier+0.002
 toddler+0.002
 Clare+0.002
cer+0.002
Neg logit contributions
 named-0.001
 have-0.001
 know-0.001
 called-0.001
 done-0.001
Token a
Token ID257
Feature activation+3.13e+00

Pos logit contributions
 therapist+0.049
cer+0.043
 toddler+0.041
rier+0.041
unia+0.040
Neg logit contributions
 named-0.017
 called-0.016
 know-0.015
 have-0.015
 done-0.015
Token navy
Token ID23956
Feature activation-1.76e-01

Pos logit contributions
 therapist+0.140
rily+0.117
edly+0.116
cer+0.116
overed+0.115
Neg logit contributions
 named-0.067
 know-0.063
 have-0.062
 called-0.056
 done-0.055
Token thing
Token ID1517
Feature activation+6.51e+00

Pos logit contributions
 named+0.005
 have+0.004
 called+0.004
 know+0.004
atters+0.004
Neg logit contributions
 therapist-0.007
 Riley-0.006
 Clare-0.006
 Debbie-0.006
edly-0.006
Token!
Token ID0
Feature activation+5.29e-01

Pos logit contributions
 therapist+0.330
cer+0.277
 toddler+0.276
 Clare+0.276
rier+0.274
Neg logit contributions
 named-0.111
 called-0.103
 have-0.099
 know-0.090
 done-0.086
Token She
Token ID1375
Feature activation-4.55e-01

Pos logit contributions
 therapist+0.026
cer+0.023
rier+0.022
unia+0.021
rily+0.021
Neg logit contributions
 know-0.009
 named-0.009
 have-0.009
atters-0.008
 One-0.008
Token looked
Token ID3114
Feature activation-9.12e-02

Pos logit contributions
 named+0.010
 know+0.009
 have+0.009
 called+0.009
 done+0.008
Neg logit contributions
 therapist-0.021
rier-0.018
cer-0.017
 Clare-0.017
 Riley-0.017
Token closer
Token ID5699
Feature activation+3.72e+00

Pos logit contributions
 named+0.002
 know+0.002
 have+0.002
 done+0.002
atters+0.002
Neg logit contributions
 therapist-0.004
cer-0.003
rier-0.003
 toddler-0.003
 Riley-0.003
Token and
Token ID290
Feature activation+1.14e+00

Pos logit contributions
 therapist+0.207
cer+0.175
 toddler+0.171
unia+0.171
rier+0.168
Neg logit contributions
 know-0.046
 named-0.042
 done-0.040
 have-0.038
 called-0.035
Token.
Token ID13
Feature activation+1.43e-01

Pos logit contributions
 know+0.019
atters+0.017
 named+0.016
 have+0.016
 done+0.015
Neg logit contributions
 therapist-0.056
cer-0.047
 toddler-0.046
unia-0.045
 Riley-0.045
Token It
Token ID632
Feature activation-2.38e-01

Pos logit contributions
 therapist+0.006
 toddler+0.005
cer+0.005
 Riley+0.005
 Clare+0.005
Neg logit contributions
 know-0.003
 named-0.003
atters-0.003
 done-0.003
 have-0.003
Token was
Token ID373
Feature activation+7.77e-01

Pos logit contributions
 know+0.006
 named+0.006
 have+0.006
 called+0.005
 done+0.005
Neg logit contributions
 therapist-0.010
rier-0.008
 Clare-0.008
cer-0.008
 toddler-0.008
Token a
Token ID257
Feature activation+2.88e+00

Pos logit contributions
 therapist+0.038
cer+0.032
unia+0.031
 toddler+0.031
 Clare+0.031
Neg logit contributions
 named-0.015
 know-0.014
 have-0.013
 done-0.013
 called-0.013
Token shiny
Token ID22441
Feature activation+2.15e+00

Pos logit contributions
 therapist+0.130
cer+0.108
rily+0.107
 Riley+0.106
 Clare+0.106
Neg logit contributions
 named-0.060
 know-0.058
 have-0.056
 done-0.051
 called-0.050
Token thing
Token ID1517
Feature activation+6.51e+00

Pos logit contributions
 therapist+0.090
 Riley+0.073
 Clare+0.071
 Debbie+0.071
edly+0.070
Neg logit contributions
 named-0.056
 have-0.051
 know-0.049
 called-0.047
 done-0.046
Token that
Token ID326
Feature activation+8.22e-01

Pos logit contributions
 therapist+0.319
 Clare+0.267
 toddler+0.264
cer+0.264
unia+0.264
Neg logit contributions
 named-0.118
 have-0.106
 know-0.106
 called-0.101
 done-0.097
Token looked
Token ID3114
Feature activation-6.88e-01

Pos logit contributions
 therapist+0.040
rier+0.036
rily+0.035
 Clare+0.034
cer+0.034
Neg logit contributions
 have-0.016
 know-0.015
 named-0.015
 called-0.015
 done-0.015
Token interesting
Token ID3499
Feature activation+2.68e+00

Pos logit contributions
 know+0.009
 named+0.009
 have+0.008
atters+0.007
 done+0.007
Neg logit contributions
 therapist-0.037
cer-0.031
 toddler-0.031
 Riley-0.031
unia-0.031
Token and
Token ID290
Feature activation+1.33e+00

Pos logit contributions
 therapist+0.156
 toddler+0.132
 Clare+0.131
 Riley+0.131
cer+0.131
Neg logit contributions
 named-0.022
 know-0.022
 have-0.020
 done-0.019
 called-0.016
Token it
Token ID340
Feature activation+3.18e+00

Pos logit contributions
 therapist+0.062
cer+0.051
unia+0.051
rier+0.051
 toddler+0.051
Neg logit contributions
 know-0.029
 have-0.029
 named-0.028
 done-0.027
 called-0.027
Token Fin
Token ID4463
Feature activation-2.17e-02

Pos logit contributions
 know+0.004
 named+0.004
 have+0.004
atters+0.004
 done+0.004
Neg logit contributions
 therapist-0.009
cer-0.007
 toddler-0.007
 Riley-0.007
 Clare-0.007
Token saw
Token ID2497
Feature activation-1.15e+00

Pos logit contributions
atters+0.001
 know+0.001
opl+0.001
ric+0.001
 named+0.001
Neg logit contributions
 therapist-0.001
cer-0.001
unia-0.001
 toddler-0.001
 Riley-0.001
Token a
Token ID257
Feature activation+2.06e+00

Pos logit contributions
 know+0.031
atters+0.031
 done+0.029
 have+0.029
 named+0.029
Neg logit contributions
 therapist-0.043
cer-0.034
 toddler-0.034
 Riley-0.034
unia-0.033
Token big
Token ID1263
Feature activation+7.59e-01

Pos logit contributions
 therapist+0.099
cer+0.082
 Riley+0.081
 Clare+0.080
 toddler+0.080
Neg logit contributions
 know-0.037
 named-0.037
 have-0.034
 done-0.032
atters-0.031
Token green
Token ID4077
Feature activation+7.21e-01

Pos logit contributions
 therapist+0.032
cer+0.026
 toddler+0.025
 Clare+0.025
 Riley+0.025
Neg logit contributions
 named-0.018
 know-0.018
 have-0.017
 done-0.016
atters-0.016
Token thing
Token ID1517
Feature activation+6.50e+00

Pos logit contributions
 therapist+0.028
 Riley+0.022
 Clare+0.022
 toddler+0.022
cer+0.022
Neg logit contributions
 named-0.019
 know-0.018
 have-0.018
atters-0.017
 done-0.017
Token floating
Token ID12462
Feature activation+2.11e+00

Pos logit contributions
 therapist+0.257
cer+0.206
 toddler+0.204
 Riley+0.200
unia+0.198
Neg logit contributions
 know-0.172
 named-0.166
atters-0.161
 have-0.159
 done-0.156
Token in
Token ID287
Feature activation+1.34e+00

Pos logit contributions
 therapist+0.091
 Clare+0.075
 Riley+0.073
 Amelia+0.073
 toddler+0.072
Neg logit contributions
 named-0.052
 called-0.046
 done-0.046
 have-0.045
 know-0.044
Token the
Token ID262
Feature activation+3.45e-01

Pos logit contributions
 therapist+0.045
 toddler+0.037
cer+0.037
 Riley+0.036
unia+0.035
Neg logit contributions
atters-0.041
 know-0.039
 named-0.038
 done-0.038
ric-0.036
Token water
Token ID1660
Feature activation-8.15e-01

Pos logit contributions
 therapist+0.016
cer+0.013
 toddler+0.013
 Riley+0.013
unia+0.013
Neg logit contributions
 know-0.007
 named-0.007
atters-0.007
 have-0.006
 done-0.006
Token.
Token ID13
Feature activation+6.04e-01

Pos logit contributions
 know+0.019
atters+0.017
 named+0.015
 have+0.015
iz+0.015
Neg logit contributions
 therapist-0.035
cer-0.030
 kernels-0.028
 toddler-0.028
rier-0.028
Token of
Token ID286
Feature activation+3.26e-01

Pos logit contributions
 therapist+0.070
cer+0.059
 toddler+0.059
 Riley+0.058
unia+0.058
Neg logit contributions
 know-0.019
 named-0.018
atters-0.017
 have-0.016
 done-0.016
Token sticks
Token ID16461
Feature activation+2.69e-01

Pos logit contributions
 therapist+0.015
cer+0.012
 toddler+0.012
 Riley+0.012
unia+0.012
Neg logit contributions
 know-0.007
 named-0.007
 have-0.007
atters-0.006
 done-0.006
Token.
Token ID13
Feature activation+2.13e-01

Pos logit contributions
 therapist+0.012
cer+0.010
 toddler+0.010
 Riley+0.010
unia+0.010
Neg logit contributions
 know-0.005
 named-0.005
atters-0.005
 have-0.004
 done-0.004
Token Sue
Token ID26089
Feature activation+8.41e-01

Pos logit contributions
 therapist+0.009
 toddler+0.008
overed+0.008
 Clare+0.008
 Riley+0.008
Neg logit contributions
 done-0.004
ric-0.004
 know-0.004
atters-0.004
inations-0.004
Token felt
Token ID2936
Feature activation+3.45e-01

Pos logit contributions
 therapist+0.032
 toddler+0.026
cer+0.026
unia+0.025
 Riley+0.025
Neg logit contributions
atters-0.022
 know-0.022
weet-0.020
ric-0.020
 named-0.020
Token something
Token ID1223
Feature activation+6.50e+00

Pos logit contributions
 therapist+0.017
cer+0.014
 toddler+0.014
 Riley+0.014
 Clare+0.014
Neg logit contributions
 know-0.006
 named-0.006
atters-0.006
 have-0.005
 done-0.005
Token was
Token ID373
Feature activation+1.43e+00

Pos logit contributions
 therapist+0.320
cer+0.272
 toddler+0.266
unia+0.264
 Clare+0.264
Neg logit contributions
 know-0.118
 named-0.111
 have-0.108
 done-0.107
atters-0.101
Token going
Token ID1016
Feature activation+1.07e+00

Pos logit contributions
 therapist+0.048
overed+0.039
rier+0.038
cer+0.037
unia+0.037
Neg logit contributions
 know-0.041
atters-0.039
ric-0.038
weet-0.037
 Yesterday-0.037
Token to
Token ID284
Feature activation+5.77e-01

Pos logit contributions
 therapist+0.041
cer+0.033
 toddler+0.032
unia+0.032
 Riley+0.031
Neg logit contributions
 know-0.029
 named-0.028
atters-0.028
 have-0.027
 done-0.026
Token happen
Token ID1645
Feature activation+1.64e+00

Pos logit contributions
 therapist+0.025
cer+0.021
 toddler+0.021
 Clare+0.021
 Riley+0.020
Neg logit contributions
 named-0.011
atters-0.011
 know-0.011
o-0.010
 done-0.010
Token.
Token ID13
Feature activation+3.03e-01

Pos logit contributions
 therapist+0.094
 toddler+0.080
cer+0.080
unia+0.079
 Riley+0.078
Neg logit contributions
 know-0.016
 named-0.016
 have-0.014
 done-0.013
atters-0.011
Token The
Token ID383
Feature activation-1.37e+00

Pos logit contributions
 know+0.001
 named+0.001
atters+0.001
 done+0.001
 have+0.001
Neg logit contributions
 therapist-0.004
 toddler-0.003
 Riley-0.003
 Clare-0.003
cer-0.003
Token doctor
Token ID6253
Feature activation+1.22e+00

Pos logit contributions
 named+0.035
 know+0.034
 have+0.034
 done+0.032
 called+0.031
Neg logit contributions
 therapist-0.052
cer-0.046
rily-0.044
 Riley-0.044
unia-0.044
Token put
Token ID1234
Feature activation+1.64e+00

Pos logit contributions
 therapist+0.055
cer+0.046
rier+0.046
 toddler+0.045
 Riley+0.045
Neg logit contributions
 named-0.026
 know-0.026
 have-0.024
 done-0.023
 called-0.022
Token a
Token ID257
Feature activation+2.60e+00

Pos logit contributions
 therapist+0.079
cer+0.067
 toddler+0.066
edly+0.066
overed+0.065
Neg logit contributions
 have-0.031
 done-0.029
 both-0.028
my-0.028
 named-0.027
Token special
Token ID2041
Feature activation+4.39e+00

Pos logit contributions
 therapist+0.108
cer+0.091
unia+0.088
 Clare+0.088
 Riley+0.088
Neg logit contributions
 know-0.064
 named-0.061
 have-0.059
 done-0.056
atters-0.054
Token thing
Token ID1517
Feature activation+6.49e+00

Pos logit contributions
 therapist+0.175
rily+0.157
cer+0.157
unia+0.156
 Clare+0.153
Neg logit contributions
 named-0.108
 know-0.106
 have-0.104
 called-0.096
 done-0.095
Token on
Token ID319
Feature activation-4.29e-02

Pos logit contributions
 therapist+0.291
 Clare+0.242
 toddler+0.242
cer+0.240
unia+0.236
Neg logit contributions
 named-0.139
 have-0.134
 know-0.130
 done-0.128
 called-0.128
Token the
Token ID262
Feature activation-6.32e-01

Pos logit contributions
 named+0.001
 have+0.001
 know+0.001
 One+0.001
 called+0.001
Neg logit contributions
 therapist-0.002
cer-0.002
 toddler-0.002
overed-0.002
edly-0.002
Token little
Token ID1310
Feature activation-2.48e+00

Pos logit contributions
 named+0.016
 know+0.015
 have+0.015
 done+0.015
atters+0.014
Neg logit contributions
 therapist-0.025
cer-0.021
 Riley-0.020
 Clare-0.020
unia-0.020
Token girl
Token ID2576
Feature activation-2.46e-01

Pos logit contributions
 named+0.075
 have+0.071
my+0.070
 called+0.069
."+0.067
Neg logit contributions
 therapist-0.080
overed-0.072
 Riley-0.067
 Clare-0.066
rily-0.066
Token's
Token ID338
Feature activation+7.39e-02

Pos logit contributions
 know+0.006
 named+0.005
atters+0.005
 have+0.005
 done+0.005
Neg logit contributions
 therapist-0.011
cer-0.009
 toddler-0.009
 Riley-0.008
 Clare-0.008
Token tight
Token ID5381
Feature activation+1.90e+00

Pos logit contributions
 therapist+0.093
cer+0.083
overed+0.080
 avail+0.080
gers+0.078
Neg logit contributions
 named-0.043
 know-0.043
 have-0.041
 done-0.039
 called-0.037
Token and
Token ID290
Feature activation+1.16e+00

Pos logit contributions
 therapist+0.099
cer+0.087
 Riley+0.085
unia+0.084
 Clare+0.084
Neg logit contributions
 named-0.024
 know-0.024
 have-0.021
 done-0.020
 called-0.020
Token put
Token ID1234
Feature activation+1.67e+00

Pos logit contributions
 therapist+0.058
cer+0.054
rier+0.053
unia+0.052
gers+0.052
Neg logit contributions
 named-0.020
 know-0.019
 called-0.018
 have-0.018
 done-0.017
Token a
Token ID257
Feature activation+2.57e+00

Pos logit contributions
 therapist+0.075
cer+0.068
 toddler+0.063
overed+0.062
rier+0.062
Neg logit contributions
 have-0.037
 done-0.035
 know-0.035
 named-0.034
my-0.033
Token special
Token ID2041
Feature activation+4.26e+00

Pos logit contributions
 therapist+0.100
cer+0.083
 Riley+0.079
 toddler+0.079
unia+0.079
Neg logit contributions
 know-0.068
 named-0.067
 have-0.064
 done-0.062
atters-0.061
Token thing
Token ID1517
Feature activation+6.48e+00

Pos logit contributions
 therapist+0.169
cer+0.152
rily+0.148
 Riley+0.146
unia+0.146
Neg logit contributions
 named-0.106
 know-0.103
 have-0.099
 done-0.095
 called-0.090
Token in
Token ID287
Feature activation+4.67e-01

Pos logit contributions
 therapist+0.285
cer+0.239
 toddler+0.237
 Clare+0.233
 Riley+0.231
Neg logit contributions
 named-0.139
 know-0.138
 have-0.132
atters-0.127
 done-0.127
Token her
Token ID607
Feature activation+2.79e-01

Pos logit contributions
 therapist+0.019
cer+0.016
rier+0.016
overed+0.016
gers+0.015
Neg logit contributions
 know-0.012
 have-0.012
 named-0.012
 done-0.011
 called-0.010
Token mouth
Token ID5422
Feature activation-5.83e-02

Pos logit contributions
 therapist+0.011
 toddler+0.009
cer+0.009
 Riley+0.009
 Clare+0.009
Neg logit contributions
 know-0.007
 named-0.007
atters-0.007
 have-0.007
 done-0.006
Token.
Token ID13
Feature activation+6.34e-01

Pos logit contributions
 know+0.001
 named+0.001
atters+0.001
 have+0.001
 done+0.001
Neg logit contributions
 therapist-0.003
cer-0.003
 toddler-0.003
 Riley-0.002
 Clare-0.002
Token It
Token ID632
Feature activation+1.29e-01

Pos logit contributions
 therapist+0.028
 toddler+0.023
cer+0.023
 Riley+0.023
 Clare+0.023
Neg logit contributions
 know-0.013
 named-0.013
atters-0.013
 done-0.012
 have-0.012
Token and
Token ID290
Feature activation+9.32e-01

Pos logit contributions
 therapist+0.056
rified+0.047
unia+0.046
overed+0.046
rier+0.046
Neg logit contributions
 have-0.031
 named-0.031
 know-0.030
 done-0.027
my-0.026
Token her
Token ID607
Feature activation+1.24e+00

Pos logit contributions
 therapist+0.047
rier+0.046
ymes+0.042
rified+0.042
unia+0.042
Neg logit contributions
 named-0.017
 called-0.015
 have-0.015
 know-0.014
 done-0.012
Token mom
Token ID1995
Feature activation+8.07e-01

Pos logit contributions
 therapist+0.057
cer+0.048
 toddler+0.046
unia+0.046
 Riley+0.046
Neg logit contributions
 know-0.025
 named-0.024
 have-0.023
 done-0.022
atters-0.022
Token got
Token ID1392
Feature activation+3.84e-01

Pos logit contributions
 therapist+0.036
cer+0.029
 toddler+0.029
 Riley+0.029
unia+0.028
Neg logit contributions
 know-0.018
 named-0.017
atters-0.016
 have-0.016
 done-0.016
Token a
Token ID257
Feature activation+3.02e+00

Pos logit contributions
 therapist+0.019
cer+0.017
unia+0.016
edly+0.015
 avail+0.015
Neg logit contributions
 know-0.008
 named-0.007
 done-0.007
 have-0.007
 called-0.007
Token sund
Token ID37437
Feature activation+6.47e+00

Pos logit contributions
 therapist+0.136
cer+0.113
edly+0.113
unia+0.111
 Riley+0.110
Neg logit contributions
 know-0.066
 have-0.063
 named-0.062
 done-0.059
atters-0.054
Tokenae
Token ID3609
Feature activation+4.82e+00

Pos logit contributions
 therapist+0.172
cer+0.128
 toddler+0.119
rier+0.115
unia+0.110
Neg logit contributions
 know-0.261
 named-0.251
atters-0.248
 have-0.246
weet-0.237
Token with
Token ID351
Feature activation+3.13e+00

Pos logit contributions
 therapist+0.235
cer+0.199
 Riley+0.195
unia+0.195
 toddler+0.194
Neg logit contributions
 know-0.083
 named-0.083
 have-0.082
 done-0.075
atters-0.067
Token lots
Token ID6041
Feature activation+4.41e-01

Pos logit contributions
 therapist+0.147
cer+0.130
rier+0.123
unia+0.120
gers+0.119
Neg logit contributions
 know-0.066
 named-0.062
 have-0.060
 done-0.056
 both-0.056
Token of
Token ID286
Feature activation+8.16e-01

Pos logit contributions
 therapist+0.023
cer+0.019
 toddler+0.019
 Riley+0.019
 Clare+0.019
Neg logit contributions
 know-0.006
 named-0.006
atters-0.006
 have-0.005
 done-0.005
Token topp
Token ID23126
Feature activation+3.12e+00

Pos logit contributions
 therapist+0.035
cer+0.030
unia+0.029
rier+0.028
 toddler+0.028
Neg logit contributions
 know-0.020
 named-0.018
 have-0.018
 done-0.018
atters-0.016
Token She
Token ID1375
Feature activation-7.22e-01

Pos logit contributions
 therapist+0.016
 toddler+0.013
 Riley+0.013
 Amelia+0.013
unia+0.012
Neg logit contributions
 done-0.009
atters-0.009
 know-0.009
 named-0.009
ric-0.009
Token asked
Token ID1965
Feature activation-7.71e-01

Pos logit contributions
atters+0.020
 know+0.019
 done+0.018
weet+0.017
ric+0.017
Neg logit contributions
 therapist-0.028
 toddler-0.022
unia-0.021
 Riley-0.021
cer-0.021
Token,
Token ID11
Feature activation+5.02e-01

Pos logit contributions
 know+0.018
atters+0.017
 named+0.017
 done+0.016
 have+0.016
Neg logit contributions
 therapist-0.032
cer-0.026
 toddler-0.026
 Riley-0.026
unia-0.026
Token "
Token ID366
Feature activation+1.92e+00

Pos logit contributions
 therapist+0.018
 Riley+0.014
 toddler+0.014
 Clare+0.014
 Amelia+0.013
Neg logit contributions
atters-0.016
 know-0.015
 done-0.014
inations-0.014
opl-0.014
TokenDog
Token ID32942
Feature activation-2.67e+00

Pos logit contributions
 therapist+0.073
rier+0.058
cer+0.058
rily+0.057
 Riley+0.057
Neg logit contributions
 named-0.055
 know-0.054
 have-0.051
my-0.050
atters-0.048
Tokengy
Token ID1360
Feature activation-8.52e+00

Pos logit contributions
atters+0.070
 done+0.069
 know+0.067
inations+0.066
 supposed+0.062
Neg logit contributions
 therapist-0.095
cer-0.081
unia-0.079
 Riley-0.075
 Amelia-0.075
Token,
Token ID11
Feature activation-3.00e-01

Pos logit contributions
atters+0.180
 know+0.179
weet+0.170
inations+0.167
 done+0.161
Neg logit contributions
 therapist-0.342
cer-0.288
unia-0.282
 toddler-0.281
 Riley-0.276
Token can
Token ID460
Feature activation-1.91e-01

Pos logit contributions
atters+0.006
 know+0.005
 named+0.005
 done+0.005
 Yesterday+0.005
Neg logit contributions
 therapist-0.013
 toddler-0.011
 Riley-0.011
 Clare-0.011
cer-0.011
Token you
Token ID345
Feature activation+4.82e-01

Pos logit contributions
 know+0.005
 named+0.004
 have+0.004
atters+0.004
 done+0.004
Neg logit contributions
 therapist-0.008
cer-0.007
 toddler-0.007
rier-0.007
rily-0.007
Token help
Token ID1037
Feature activation+8.29e-01

Pos logit contributions
 therapist+0.016
unia+0.013
 parcel+0.013
 toddler+0.013
 jew+0.013
Neg logit contributions
ric-0.013
atters-0.012
opl-0.012
weet-0.011
 brothers-0.011
Token me
Token ID502
Feature activation-1.56e+00

Pos logit contributions
 therapist+0.026
unia+0.021
 Riley+0.020
cer+0.020
 toddler+0.020
Neg logit contributions
atters-0.026
 done-0.024
ric-0.024
 Yesterday-0.024
 know-0.024
Token should
Token ID815
Feature activation-2.70e+00

Pos logit contributions
 therapist+0.041
 toddler+0.036
cer+0.036
unia+0.035
ed+0.034
Neg logit contributions
opl-0.031
weet-0.028
atters-0.027
arter-0.027
ric-0.027
Token not
Token ID407
Feature activation-2.80e+00

Pos logit contributions
atters+0.045
weet+0.042
 named+0.040
 know+0.039
o+0.039
Neg logit contributions
 therapist-0.122
cer-0.102
 Clare-0.102
 Riley-0.101
unia-0.100
Token tease
Token ID34691
Feature activation-8.70e-01

Pos logit contributions
inations+0.065
aver+0.063
atters+0.063
 Yesterday+0.062
 glad+0.062
Neg logit contributions
 therapist-0.091
cer-0.086
No-0.080
 toddler-0.077
guard-0.076
Token the
Token ID262
Feature activation-2.07e+00

Pos logit contributions
 know+0.020
 named+0.020
 have+0.019
atters+0.018
 done+0.018
Neg logit contributions
 therapist-0.038
 toddler-0.031
cer-0.030
rily-0.030
 Clare-0.030
Token ost
Token ID23619
Feature activation-3.49e+00

Pos logit contributions
 know+0.060
atters+0.059
 named+0.059
 have+0.055
 Yesterday+0.055
Neg logit contributions
 therapist-0.074
rier-0.058
 toddler-0.057
unia-0.056
 Clare-0.056
Tokenric
Token ID1173
Feature activation-8.50e+00

Pos logit contributions
 know+0.075
 Yesterday+0.071
 have+0.065
 much+0.061
 named+0.059
Neg logit contributions
 therapist-0.133
unia-0.127
cer-0.127
rier-0.125
overed-0.121
Tokenhes
Token ID956
Feature activation-2.18e+00

Pos logit contributions
 know+0.178
 named+0.173
 Yesterday+0.160
atters+0.156
 have+0.153
Neg logit contributions
 therapist-0.361
rier-0.300
cer-0.296
unia-0.294
 kernels-0.288
Token.
Token ID13
Feature activation-1.60e+00

Pos logit contributions
 know+0.033
 named+0.032
atters+0.030
 have+0.029
 done+0.028
Neg logit contributions
 therapist-0.110
cer-0.093
 toddler-0.093
 Riley-0.091
 Clare-0.090
Token They
Token ID1119
Feature activation-1.76e+00

Pos logit contributions
 done+0.044
inations+0.038
 know+0.037
 built+0.036
 named+0.035
Neg logit contributions
 therapist-0.060
rier-0.051
 Clare-0.051
 Riley-0.050
 toddler-0.048
Token are
Token ID389
Feature activation+3.62e-01

Pos logit contributions
atters+0.031
 named+0.031
 know+0.030
weet+0.028
 done+0.027
Neg logit contributions
 therapist-0.080
cer-0.067
 toddler-0.066
unia-0.065
 Riley-0.064
Token not
Token ID407
Feature activation-6.49e-01

Pos logit contributions
 therapist+0.016
cer+0.013
rier+0.013
 Riley+0.013
 toddler+0.013
Neg logit contributions
atters-0.007
 know-0.007
 Yesterday-0.006
 named-0.006
 One-0.006
Token,
Token ID11
Feature activation+1.20e+00

Pos logit contributions
 named+0.006
 know+0.006
 have+0.005
 done+0.004
 much+0.004
Neg logit contributions
 therapist-0.040
cer-0.036
 toddler-0.035
unia-0.034
 Riley-0.033
Token "
Token ID366
Feature activation+2.44e+00

Pos logit contributions
 therapist+0.047
 toddler+0.038
 Riley+0.038
 Clare+0.037
rier+0.036
Neg logit contributions
 know-0.032
atters-0.031
 named-0.029
 done-0.029
 have-0.028
TokenHello
Token ID15496
Feature activation-5.98e-01

Pos logit contributions
 therapist+0.081
rier+0.061
 Riley+0.061
cer+0.060
 toddler+0.060
Neg logit contributions
 know-0.079
 named-0.079
 have-0.076
atters-0.074
my-0.074
Token,
Token ID11
Feature activation-1.27e+00

Pos logit contributions
 know+0.012
atters+0.011
 named+0.011
 have+0.011
 done+0.011
Neg logit contributions
 therapist-0.028
cer-0.023
 toddler-0.023
 Riley-0.022
unia-0.022
Token pig
Token ID12967
Feature activation-3.05e+00

Pos logit contributions
atters+0.040
 know+0.038
 done+0.038
 named+0.036
inations+0.035
Neg logit contributions
 therapist-0.042
 toddler-0.034
 Clare-0.034
 Riley-0.033
rier-0.033
Tokengy
Token ID1360
Feature activation-8.11e+00

Pos logit contributions
 know+0.070
atters+0.068
 done+0.066
 have+0.062
 Yesterday+0.062
Neg logit contributions
 therapist-0.127
cer-0.113
unia-0.105
 toddler-0.103
 Riley-0.102
Token!
Token ID0
Feature activation-1.75e+00

Pos logit contributions
 know+0.162
atters+0.146
 have+0.141
 done+0.140
 named+0.137
Neg logit contributions
 therapist-0.372
cer-0.317
 toddler-0.310
unia-0.304
 Riley-0.300
Token You
Token ID921
Feature activation+1.15e+00

Pos logit contributions
 done+0.053
 know+0.050
atters+0.048
 named+0.044
inations+0.044
Neg logit contributions
 therapist-0.064
 toddler-0.052
 Riley-0.051
 Clare-0.050
 Amelia-0.049
Token are
Token ID389
Feature activation+4.10e-01

Pos logit contributions
 therapist+0.046
unia+0.040
 toddler+0.040
cer+0.040
 parcel+0.039
Neg logit contributions
atters-0.023
 named-0.021
ric-0.021
weet-0.021
omo-0.021
Token so
Token ID523
Feature activation-1.71e+00

Pos logit contributions
 therapist+0.019
 toddler+0.016
cer+0.016
 Riley+0.016
 Clare+0.016
Neg logit contributions
 know-0.008
atters-0.008
 named-0.007
 have-0.007
 done-0.007
Token big
Token ID1263
Feature activation-2.18e+00

Pos logit contributions
atters+0.042
 know+0.042
 done+0.040
ric+0.040
 named+0.039
Neg logit contributions
 therapist-0.065
 Clare-0.054
 toddler-0.053
unia-0.052
rier-0.052
Token but
Token ID475
Feature activation-7.91e-01

Pos logit contributions
 know+0.032
 have+0.032
 named+0.031
 done+0.029
weet+0.027
Neg logit contributions
 therapist-0.074
cer-0.063
unia-0.061
edly-0.060
 toddler-0.060
Token be
Token ID307
Feature activation-4.63e-01

Pos logit contributions
 know+0.014
 named+0.013
 have+0.013
 done+0.012
 called+0.011
Neg logit contributions
 therapist-0.040
cer-0.033
 toddler-0.033
 Riley-0.032
unia-0.032
Token careful
Token ID8161
Feature activation+3.17e+00

Pos logit contributions
atters+0.011
 know+0.011
 Yesterday+0.011
 have+0.010
 named+0.010
Neg logit contributions
 therapist-0.018
rier-0.015
cer-0.015
unia-0.015
 Riley-0.014
Token.
Token ID13
Feature activation-2.01e+00

Pos logit contributions
 therapist+0.159
cer+0.134
 toddler+0.133
 Riley+0.131
unia+0.130
Neg logit contributions
 know-0.051
 named-0.048
atters-0.046
 have-0.044
 done-0.043
Token Ost
Token ID38919
Feature activation-4.15e-01

Pos logit contributions
 done+0.059
 supposed+0.051
inations+0.050
 know+0.050
atters+0.048
Neg logit contributions
 therapist-0.074
 Riley-0.063
rier-0.060
 Clare-0.060
 toddler-0.058
Tokenric
Token ID1173
Feature activation-8.07e+00

Pos logit contributions
 know+0.006
atters+0.006
inations+0.006
 named+0.006
ooters+0.005
Neg logit contributions
 therapist-0.018
unia-0.018
cer-0.017
 toddler-0.016
 Riley-0.016
Tokenhes
Token ID956
Feature activation-2.81e+00

Pos logit contributions
 named+0.175
 know+0.167
 Yesterday+0.159
atters+0.150
 One+0.144
Neg logit contributions
 therapist-0.326
rier-0.271
unia-0.268
 kernels-0.267
cer-0.266
Token can
Token ID460
Feature activation-9.17e-01

Pos logit contributions
 know+0.062
 named+0.060
atters+0.059
 have+0.055
 done+0.055
Neg logit contributions
 therapist-0.121
cer-0.099
 toddler-0.097
 Riley-0.096
unia-0.096
Token be
Token ID307
Feature activation-2.11e+00

Pos logit contributions
atters+0.020
 done+0.019
 named+0.018
iz+0.017
 know+0.017
Neg logit contributions
 therapist-0.038
unia-0.030
 Riley-0.030
 Clare-0.030
 kernels-0.030
Token gr
Token ID1036
Feature activation-2.11e+00

Pos logit contributions
 Yesterday+0.050
atters+0.050
 know+0.048
 named+0.045
opl+0.045
Neg logit contributions
 therapist-0.078
cer-0.065
unia-0.065
rier-0.064
rily-0.063
Tokenumpy
Token ID32152
Feature activation-4.12e+00

Pos logit contributions
 named+0.063
 have+0.059
 Yesterday+0.058
 done+0.054
 build+0.054
Neg logit contributions
ingly-0.059
 therapist-0.059
ymes-0.056
cer-0.054
rier-0.052
Token you
Token ID345
Feature activation-5.93e-01

Pos logit contributions
 know+0.001
 have+0.001
 named+0.001
atters+0.001
 done+0.001
Neg logit contributions
 therapist-0.001
cer-0.001
 toddler-0.001
rily-0.001
rier-0.001
Token count
Token ID954
Feature activation-2.34e-02

Pos logit contributions
atters+0.015
 supposed+0.014
opl+0.013
 done+0.013
ric+0.013
Neg logit contributions
 therapist-0.020
 Clare-0.017
unia-0.017
 Amelia-0.016
 brid-0.016
Token how
Token ID703
Feature activation+1.95e+00

Pos logit contributions
 know+0.001
atters+0.001
 named+0.001
 done+0.001
 have+0.001
Neg logit contributions
 therapist-0.001
 Riley-0.001
cer-0.001
 Clare-0.001
rier-0.001
Token many
Token ID867
Feature activation-2.43e+00

Pos logit contributions
 therapist+0.079
 toddler+0.064
 Clare+0.063
 Riley+0.063
cer+0.062
Neg logit contributions
 know-0.046
atters-0.046
 named-0.045
 done-0.042
 have-0.042
Token ost
Token ID23619
Feature activation-3.02e+00

Pos logit contributions
atters+0.072
 know+0.068
 named+0.066
 Yesterday+0.064
 done+0.064
Neg logit contributions
 therapist-0.083
 Clare-0.066
unia-0.065
 toddler-0.064
cer-0.063
Tokenric
Token ID1173
Feature activation-8.06e+00

Pos logit contributions
 know+0.058
 named+0.055
 Yesterday+0.054
 much+0.053
 have+0.051
Neg logit contributions
unia-0.110
 therapist-0.108
cer-0.107
overed-0.104
ed-0.101
Tokenhes
Token ID956
Feature activation-1.93e+00

Pos logit contributions
 named+0.178
omo+0.168
 Yesterday+0.168
 know+0.167
atters+0.161
Neg logit contributions
 therapist-0.306
unia-0.251
ymes-0.251
overed-0.251
 kernels-0.251
Token there
Token ID612
Feature activation+6.34e-01

Pos logit contributions
 know+0.039
 named+0.037
atters+0.036
 have+0.035
 done+0.034
Neg logit contributions
 therapist-0.088
cer-0.073
 toddler-0.073
 Riley-0.071
 Clare-0.071
Token are
Token ID389
Feature activation-1.76e+00

Pos logit contributions
 therapist+0.031
cer+0.026
 Clare+0.026
unia+0.026
 toddler+0.026
Neg logit contributions
atters-0.009
 named-0.009
 know-0.009
 done-0.007
iz-0.007
Token?"
Token ID1701
Feature activation+7.71e-01

Pos logit contributions
atters+0.042
 Yesterday+0.036
 named+0.035
 know+0.035
iz+0.034
Neg logit contributions
 therapist-0.064
 Clare-0.053
unia-0.053
 kernels-0.053
rier-0.052
Token
Token ID198
Feature activation-2.86e-02

Pos logit contributions
 therapist+0.031
 Riley+0.026
 Rosa+0.025
 toddler+0.025
 Clare+0.025
Neg logit contributions
 supposed-0.019
ric-0.019
atters-0.018
opl-0.018
 done-0.018
Token shouldn
Token ID6584
Feature activation-3.25e+00

Pos logit contributions
 therapist+0.079
unia+0.071
 toddler+0.071
 parcel+0.070
 Clare+0.068
Neg logit contributions
opl-0.035
weet-0.032
atters-0.030
ric-0.029
aver-0.028
Token't
Token ID470
Feature activation-4.45e+00

Pos logit contributions
 know+0.058
 named+0.057
atters+0.056
 done+0.052
 have+0.049
Neg logit contributions
 therapist-0.154
 toddler-0.128
cer-0.127
 Riley-0.126
unia-0.126
Token run
Token ID1057
Feature activation-4.22e+00

Pos logit contributions
atters+0.092
 done+0.088
weet+0.088
aver+0.088
ric+0.087
Neg logit contributions
 therapist-0.178
cer-0.151
unia-0.151
 toddler-0.150
 Clare-0.146
Token after
Token ID706
Feature activation+1.01e+00

Pos logit contributions
 know+0.104
 named+0.103
 have+0.099
atters+0.095
 done+0.094
Neg logit contributions
 therapist-0.180
 toddler-0.146
cer-0.143
 Riley-0.141
unia-0.139
Token ambul
Token ID18907
Feature activation-3.58e+00

Pos logit contributions
 therapist+0.038
cer+0.031
 toddler+0.030
unia+0.029
rier+0.029
Neg logit contributions
 know-0.028
 named-0.028
 have-0.027
atters-0.027
 done-0.026
Tokenances
Token ID1817
Feature activation-8.03e+00

Pos logit contributions
 named+0.108
 have+0.097
 know+0.094
atters+0.094
 Yesterday+0.091
Neg logit contributions
 therapist-0.118
cer-0.094
ingly-0.094
rier-0.093
edly-0.093
Token.
Token ID13
Feature activation-1.51e+00

Pos logit contributions
 know+0.112
 named+0.105
 have+0.098
atters+0.097
 done+0.094
Neg logit contributions
 therapist-0.419
cer-0.357
 toddler-0.355
 Riley-0.349
unia-0.347
Token It
Token ID632
Feature activation-1.63e+00

Pos logit contributions
 done+0.039
 know+0.034
atters+0.033
 built+0.032
inations+0.032
Neg logit contributions
 therapist-0.060
 Clare-0.050
 toddler-0.050
 Riley-0.049
 Amelia-0.048
Token's
Token ID338
Feature activation-9.41e-01

Pos logit contributions
 know+0.040
 named+0.038
atters+0.037
 have+0.037
 done+0.036
Neg logit contributions
 therapist-0.068
cer-0.055
 toddler-0.055
 Riley-0.054
 Clare-0.053
Token very
Token ID845
Feature activation-1.74e+00

Pos logit contributions
atters+0.018
 know+0.018
 named+0.017
weet+0.015
 have+0.015
Neg logit contributions
 therapist-0.041
 Clare-0.034
 Riley-0.034
cer-0.034
unia-0.034
Token dangerous
Token ID4923
Feature activation-6.07e-01

Pos logit contributions
 know+0.038
atters+0.038
 named+0.037
ric+0.035
weet+0.034
Neg logit contributions
 therapist-0.073
rier-0.058
 toddler-0.058
gers-0.058
 Clare-0.058
Token started
Token ID2067
Feature activation-1.37e+00

Pos logit contributions
 therapist+0.029
rier+0.025
cer+0.025
ymes+0.024
 toddler+0.024
Neg logit contributions
 named-0.013
 called-0.013
 know-0.012
 have-0.012
 done-0.011
Token to
Token ID284
Feature activation+6.20e-01

Pos logit contributions
 know+0.027
 named+0.026
 have+0.025
 done+0.023
 called+0.022
Neg logit contributions
 therapist-0.066
cer-0.056
unia-0.055
 toddler-0.055
rier-0.054
Token chase
Token ID15505
Feature activation-2.49e-01

Pos logit contributions
 therapist+0.029
cer+0.024
 toddler+0.023
rier+0.023
 Riley+0.023
Neg logit contributions
 know-0.014
 have-0.013
 named-0.013
atters-0.012
 done-0.011
Token after
Token ID706
Feature activation+1.31e+00

Pos logit contributions
 know+0.005
 named+0.005
 have+0.005
 both+0.004
 called+0.004
Neg logit contributions
 therapist-0.012
cer-0.009
 toddler-0.009
rier-0.009
 Riley-0.009
Token Kay
Token ID17356
Feature activation-4.21e+00

Pos logit contributions
 therapist+0.054
cer+0.044
 toddler+0.043
unia+0.041
rier+0.041
Neg logit contributions
 know-0.034
 named-0.034
 have-0.032
atters-0.030
 done-0.030
Tokenlee
Token ID7197
Feature activation-7.94e+00

Pos logit contributions
 know+0.096
 named+0.095
atters+0.089
 have+0.089
 done+0.087
Neg logit contributions
 therapist-0.184
 toddler-0.149
cer-0.148
 Riley-0.145
rier-0.145
Token.
Token ID13
Feature activation-1.18e-01

Pos logit contributions
 know+0.097
 named+0.095
 have+0.083
 done+0.081
atters+0.079
Neg logit contributions
 therapist-0.428
 toddler-0.366
cer-0.365
rier-0.360
unia-0.359
Token She
Token ID1375
Feature activation-1.08e+00

Pos logit contributions
 know+0.003
 named+0.003
atters+0.003
 have+0.003
 done+0.002
Neg logit contributions
 therapist-0.005
cer-0.004
 toddler-0.004
unia-0.004
 Riley-0.004
Token screamed
Token ID25421
Feature activation-3.03e-01

Pos logit contributions
 named+0.025
 know+0.023
 called+0.022
 have+0.021
 done+0.019
Neg logit contributions
 therapist-0.050
rier-0.046
rily-0.042
 avail-0.042
cer-0.042
Token and
Token ID290
Feature activation+7.28e-01

Pos logit contributions
 named+0.004
 both+0.003
 have+0.003
 know+0.003
 because+0.003
Neg logit contributions
 therapist-0.017
cer-0.014
unia-0.014
ymes-0.014
 avail-0.014
Token ran
Token ID4966
Feature activation-1.07e+00

Pos logit contributions
 therapist+0.035
rier+0.030
cer+0.030
ymes+0.030
rily+0.029
Neg logit contributions
 named-0.016
 know-0.015
 called-0.015
 have-0.014
 done-0.013
Token save
Token ID3613
Feature activation+8.87e-01

Pos logit contributions
 therapist+0.026
cer+0.022
 toddler+0.022
 Clare+0.022
 Riley+0.022
Neg logit contributions
atters-0.012
weet-0.012
 named-0.011
 know-0.011
 done-0.011
Token it
Token ID340
Feature activation-8.60e-01

Pos logit contributions
 therapist+0.041
 toddler+0.033
cer+0.033
atively+0.032
edly+0.032
Neg logit contributions
 know-0.022
 have-0.020
 named-0.019
 done-0.019
atters-0.018
Token in
Token ID287
Feature activation-1.45e+00

Pos logit contributions
ric+0.020
 named+0.019
 know+0.018
atters+0.018
iz+0.017
Neg logit contributions
 therapist-0.034
rier-0.029
cer-0.029
rily-0.028
gers-0.028
Token your
Token ID534
Feature activation+8.95e-01

Pos logit contributions
atters+0.039
 know+0.038
 named+0.037
 done+0.034
o+0.034
Neg logit contributions
 therapist-0.053
cer-0.044
 toddler-0.044
 Clare-0.043
unia-0.042
Token pig
Token ID12967
Feature activation-2.98e+00

Pos logit contributions
 therapist+0.038
rier+0.032
rily+0.032
cer+0.031
overed+0.031
Neg logit contributions
 named-0.022
 have-0.021
 know-0.020
 done-0.019
atters-0.018
Tokengy
Token ID1360
Feature activation-7.85e+00

Pos logit contributions
atters+0.070
weet+0.066
 know+0.066
iz+0.062
 done+0.062
Neg logit contributions
 therapist-0.120
cer-0.102
unia-0.096
 toddler-0.096
gers-0.093
Token bank
Token ID3331
Feature activation-2.82e+00

Pos logit contributions
 know+0.265
atters+0.257
 named+0.254
 have+0.248
 done+0.247
Neg logit contributions
 therapist-0.249
cer-0.188
 toddler-0.186
unia-0.178
 Riley-0.176
Token."
Token ID526
Feature activation+8.58e-01

Pos logit contributions
 know+0.042
 named+0.041
atters+0.041
 have+0.038
ric+0.035
Neg logit contributions
 therapist-0.138
cer-0.118
rier-0.117
 toddler-0.115
 Clare-0.113
Token
Token ID220
Feature activation-9.86e-01

Pos logit contributions
 therapist+0.033
 toddler+0.026
 Clare+0.026
 Riley+0.026
ed+0.025
Neg logit contributions
py-0.022
opl-0.022
aver-0.022
 supposed-0.022
ric-0.021
Token
Token ID198
Feature activation+2.05e-01

Pos logit contributions
 named+0.022
weet+0.021
 done+0.021
ric+0.021
 built+0.020
Neg logit contributions
 therapist-0.037
rily-0.031
 Riley-0.031
 Clare-0.031
overed-0.031
Token
Token ID198
Feature activation-1.29e-01

Pos logit contributions
 therapist+0.008
 Clare+0.006
 Riley+0.006
 toddler+0.006
cer+0.006
Neg logit contributions
atters-0.005
ric-0.005
arter-0.005
opl-0.005
 earthquake-0.005
Token,
Token ID11
Feature activation+1.52e+00

Pos logit contributions
 therapist+0.001
cer+0.001
 toddler+0.001
unia+0.001
 Clare+0.001
Neg logit contributions
 named-0.000
 know-0.000
 have-0.000
 done-0.000
atters-0.000
Token "
Token ID366
Feature activation+2.44e+00

Pos logit contributions
 therapist+0.060
 toddler+0.048
 Riley+0.048
 Clare+0.047
rier+0.046
Neg logit contributions
 know-0.041
atters-0.040
 done-0.037
 named-0.037
 have-0.036
TokenYes
Token ID5297
Feature activation+2.03e+00

Pos logit contributions
 therapist+0.077
rily+0.071
rier+0.068
cer+0.065
 Clare+0.062
Neg logit contributions
 know-0.081
 named-0.079
 have-0.076
o-0.074
my-0.072
Token,
Token ID11
Feature activation-9.52e-01

Pos logit contributions
 therapist+0.076
 Riley+0.065
rier+0.065
 toddler+0.063
 Clare+0.062
Neg logit contributions
atters-0.051
 know-0.051
ric-0.050
 named-0.045
 done-0.045
Token Dog
Token ID8532
Feature activation-9.88e-01

Pos logit contributions
 know+0.020
atters+0.019
 named+0.019
 done+0.018
 have+0.018
Neg logit contributions
 therapist-0.042
 toddler-0.035
 Riley-0.034
cer-0.034
 Clare-0.034
Tokengy
Token ID1360
Feature activation-7.80e+00

Pos logit contributions
 know+0.023
 done+0.022
atters+0.022
 have+0.021
 named+0.020
Neg logit contributions
 therapist-0.040
cer-0.035
unia-0.033
 toddler-0.032
gers-0.032
Token!
Token ID0
Feature activation-1.46e+00

Pos logit contributions
 know+0.151
 named+0.140
atters+0.138
ric+0.137
weet+0.136
Neg logit contributions
 therapist-0.341
cer-0.292
 toddler-0.278
gers-0.276
 Riley-0.273
Token Let
Token ID3914
Feature activation+1.96e+00

Pos logit contributions
 know+0.039
 done+0.037
 named+0.037
atters+0.037
 have+0.036
Neg logit contributions
 therapist-0.057
 toddler-0.045
cer-0.045
 Riley-0.045
 Clare-0.044
Token's
Token ID338
Feature activation-3.00e+00

Pos logit contributions
 therapist+0.092
cer+0.076
 toddler+0.075
rier+0.075
edly+0.074
Neg logit contributions
 know-0.040
 named-0.038
 have-0.037
 done-0.035
atters-0.034
Token slide
Token ID10649
Feature activation-2.41e+00

Pos logit contributions
weet+0.075
atters+0.072
py+0.070
aver+0.069
inations+0.068
Neg logit contributions
 therapist-0.091
cer-0.085
 Riley-0.081
 toddler-0.081
 Clare-0.077
Token together
Token ID1978
Feature activation-8.18e-01

Pos logit contributions
 know+0.078
 named+0.075
atters+0.072
 done+0.069
 have+0.069
Neg logit contributions
 therapist-0.079
cer-0.063
 toddler-0.060
 Riley-0.059
gers-0.058
Token,
Token ID11
Feature activation+1.42e+00

Pos logit contributions
cer+0.005
 therapist+0.005
ed+0.004
unia+0.004
 toddler+0.003
Neg logit contributions
atters-0.008
 know-0.008
inations-0.008
 done-0.007
opl-0.007
Token wanted
Token ID2227
Feature activation-1.38e+00

Pos logit contributions
 therapist+0.057
cer+0.047
 toddler+0.047
 Riley+0.046
 Clare+0.045
Neg logit contributions
 know-0.034
atters-0.034
 named-0.031
 done-0.031
 have-0.030
Token to
Token ID284
Feature activation-1.49e-01

Pos logit contributions
 know+0.033
 named+0.032
 have+0.031
 done+0.030
atters+0.030
Neg logit contributions
 therapist-0.059
cer-0.049
 toddler-0.048
rier-0.047
rily-0.047
Token borrow
Token ID8804
Feature activation-6.19e-01

Pos logit contributions
atters+0.003
ric+0.003
 named+0.003
 done+0.003
o+0.003
Neg logit contributions
 therapist-0.006
 Riley-0.005
 toddler-0.005
 Clare-0.005
cer-0.005
Token Zig
Token ID24992
Feature activation-1.22e+00

Pos logit contributions
 know+0.012
 have+0.011
 named+0.011
 One+0.010
 much+0.010
Neg logit contributions
 therapist-0.032
 toddler-0.026
cer-0.026
rily-0.026
edly-0.025
Tokengy
Token ID1360
Feature activation-7.69e+00

Pos logit contributions
 know+0.034
 done+0.032
 named+0.031
atters+0.031
 have+0.031
Neg logit contributions
 therapist-0.043
cer-0.038
unia-0.037
overed-0.034
rily-0.034
Token's
Token ID338
Feature activation-1.20e+00

Pos logit contributions
 know+0.182
atters+0.178
 named+0.171
 done+0.166
 have+0.165
Neg logit contributions
 therapist-0.316
cer-0.257
 toddler-0.256
unia-0.248
 Riley-0.246
Token favorite
Token ID4004
Feature activation+4.82e-01

Pos logit contributions
 know+0.034
 named+0.033
atters+0.032
 have+0.031
 done+0.031
Neg logit contributions
 therapist-0.045
 toddler-0.036
cer-0.036
 Riley-0.035
unia-0.035
Token toy
Token ID13373
Feature activation-4.38e-01

Pos logit contributions
 therapist+0.018
cer+0.014
 toddler+0.014
 Riley+0.014
 Clare+0.014
Neg logit contributions
 know-0.014
 named-0.013
 have-0.013
atters-0.013
 done-0.012
Token.
Token ID13
Feature activation+6.68e-01

Pos logit contributions
 know+0.008
atters+0.008
 done+0.007
ric+0.007
 have+0.007
Neg logit contributions
 therapist-0.021
cer-0.017
 toddler-0.017
unia-0.017
gers-0.017
Token Zig
Token ID24992
Feature activation+1.00e+00

Pos logit contributions
 therapist+0.031
cer+0.026
 toddler+0.026
 Riley+0.025
 Clare+0.025
Neg logit contributions
 know-0.013
 named-0.013
 have-0.012
atters-0.012
 done-0.011
Token They
Token ID1119
Feature activation-1.01e+00

Pos logit contributions
 done+0.016
inations+0.015
aver+0.015
 named+0.014
weet+0.014
Neg logit contributions
 therapist-0.022
rier-0.019
cer-0.019
 Riley-0.019
 toddler-0.019
Token do
Token ID466
Feature activation+1.20e+00

Pos logit contributions
weet+0.019
atters+0.018
 named+0.018
ric+0.018
o+0.018
Neg logit contributions
 therapist-0.043
unia-0.036
 toddler-0.035
cer-0.035
 Clare-0.035
Token not
Token ID407
Feature activation-1.69e+00

Pos logit contributions
 therapist+0.045
cer+0.037
 Riley+0.034
overed+0.034
unia+0.034
Neg logit contributions
weet-0.028
atters-0.028
 named-0.027
aver-0.027
 know-0.026
Token pack
Token ID2353
Feature activation+1.24e-01

Pos logit contributions
ric+0.035
opl+0.035
iz+0.034
atters+0.034
 Yesterday+0.033
Neg logit contributions
 therapist-0.060
cer-0.055
guard-0.051
gers-0.050
unia-0.049
Token ost
Token ID23619
Feature activation-2.51e+00

Pos logit contributions
 therapist+0.006
 toddler+0.005
unia+0.005
 Riley+0.005
 parcel+0.005
Neg logit contributions
 done-0.002
 have-0.002
 know-0.002
 both-0.002
 named-0.002
Tokenric
Token ID1173
Feature activation-7.68e+00

Pos logit contributions
 know+0.042
 named+0.038
 have+0.037
 Yesterday+0.034
atters+0.033
Neg logit contributions
 therapist-0.118
cer-0.107
unia-0.103
ymes-0.101
rier-0.099
Tokenhes
Token ID956
Feature activation-1.45e+00

Pos logit contributions
 know+0.148
 named+0.144
atters+0.129
 have+0.126
 Yesterday+0.122
Neg logit contributions
 therapist-0.348
rier-0.288
cer-0.287
unia-0.284
 Riley-0.280
Token anymore
Token ID7471
Feature activation-1.15e+00

Pos logit contributions
 know+0.030
 named+0.029
 have+0.028
 done+0.027
atters+0.027
Neg logit contributions
 therapist-0.067
 toddler-0.056
cer-0.055
 Riley-0.054
unia-0.054
Token.
Token ID13
Feature activation-8.49e-01

Pos logit contributions
 know+0.017
 named+0.016
atters+0.015
 have+0.014
 done+0.014
Neg logit contributions
 therapist-0.059
cer-0.050
 toddler-0.050
 Riley-0.049
unia-0.048
Token They
Token ID1119
Feature activation-1.10e+00

Pos logit contributions
 done+0.022
inations+0.022
aver+0.021
py+0.020
 named+0.020
Neg logit contributions
 therapist-0.029
 Riley-0.026
 Clare-0.026
 toddler-0.025
 Amelia-0.025
Token pack
Token ID2353
Feature activation+1.96e+00

Pos logit contributions
weet+0.020
 named+0.020
atters+0.020
o+0.019
aver+0.018
Neg logit contributions
 therapist-0.048
 toddler-0.039
cer-0.039
unia-0.039
 Riley-0.039
Token some
Token ID617
Feature activation+1.67e+00

Pos logit contributions
 know+0.015
 have+0.014
 named+0.014
 done+0.013
 both+0.012
Neg logit contributions
 therapist-0.029
cer-0.025
 toddler-0.024
unia-0.024
 Riley-0.024
Token money
Token ID1637
Feature activation+4.90e-01

Pos logit contributions
 therapist+0.064
cer+0.051
 toddler+0.050
 Riley+0.049
 Clare+0.049
Neg logit contributions
 know-0.047
 named-0.045
atters-0.044
 have-0.043
 done-0.043
Token for
Token ID329
Feature activation+8.93e-01

Pos logit contributions
 therapist+0.028
 Riley+0.024
 toddler+0.024
cer+0.023
unia+0.023
Neg logit contributions
 know-0.005
 named-0.005
 have-0.004
 done-0.004
 called-0.004
Token her
Token ID607
Feature activation-2.06e+00

Pos logit contributions
 therapist+0.032
 toddler+0.025
cer+0.025
 Clare+0.025
 Riley+0.024
Neg logit contributions
 know-0.026
atters-0.026
 named-0.026
 have-0.025
 done-0.024
Token pig
Token ID12967
Feature activation-3.05e+00

Pos logit contributions
 know+0.044
 named+0.043
 have+0.041
 done+0.039
atters+0.039
Neg logit contributions
 therapist-0.092
cer-0.077
 toddler-0.075
 Riley-0.074
unia-0.074
Tokengy
Token ID1360
Feature activation-7.58e+00

Pos logit contributions
atters+0.062
 know+0.058
weet+0.057
 done+0.052
 Yesterday+0.051
Neg logit contributions
 therapist-0.133
cer-0.114
 toddler-0.109
unia-0.108
gers-0.107
Token bank
Token ID3331
Feature activation-2.61e+00

Pos logit contributions
 know+0.183
atters+0.182
 named+0.167
weet+0.165
 done+0.165
Neg logit contributions
 therapist-0.309
 toddler-0.250
cer-0.249
unia-0.239
gers-0.236
Token.
Token ID13
Feature activation+3.73e-01

Pos logit contributions
 know+0.037
 named+0.035
atters+0.033
 have+0.032
 done+0.030
Neg logit contributions
 therapist-0.135
cer-0.114
 toddler-0.114
 Riley-0.111
 Clare-0.111
Token
Token ID198
Feature activation-1.77e-01

Pos logit contributions
 therapist+0.014
 Clare+0.012
 toddler+0.012
 Riley+0.012
 Amelia+0.011
Neg logit contributions
 done-0.010
inations-0.009
 supposed-0.009
atters-0.009
 know-0.009
Token
Token ID198
Feature activation-5.20e-01

Pos logit contributions
atters+0.004
 done+0.004
 know+0.003
ric+0.003
weet+0.003
Neg logit contributions
 therapist-0.008
 toddler-0.006
 Riley-0.006
 Clare-0.006
cer-0.006
TokenOne
Token ID3198
Feature activation-2.15e+00

Pos logit contributions
 know+0.010
atters+0.009
 done+0.009
 named+0.009
ric+0.008
Neg logit contributions
 therapist-0.024
cer-0.020
 toddler-0.020
 Clare-0.020
unia-0.020
Token save
Token ID3613
Feature activation+7.23e-01

Pos logit contributions
 therapist+0.003
cer+0.003
 Clare+0.003
 Riley+0.003
unia+0.003
Neg logit contributions
weet-0.002
atters-0.002
 named-0.002
 done-0.002
ric-0.002
Token it
Token ID340
Feature activation-1.08e+00

Pos logit contributions
 therapist+0.032
cer+0.026
 toddler+0.026
edly+0.025
atively+0.025
Neg logit contributions
 know-0.018
 have-0.017
 done-0.016
 named-0.016
atters-0.015
Token in
Token ID287
Feature activation-1.68e+00

Pos logit contributions
 know+0.019
 named+0.019
atters+0.018
 have+0.017
ric+0.016
Neg logit contributions
 therapist-0.051
cer-0.043
rier-0.042
 toddler-0.042
rily-0.042
Token your
Token ID534
Feature activation+7.77e-01

Pos logit contributions
 know+0.040
 named+0.039
atters+0.037
 have+0.037
 done+0.036
Neg logit contributions
 therapist-0.071
cer-0.058
 toddler-0.057
 Riley-0.056
unia-0.056
Token pig
Token ID12967
Feature activation-3.13e+00

Pos logit contributions
 therapist+0.032
overed+0.026
cer+0.026
rier+0.025
rily+0.025
Neg logit contributions
 named-0.020
 have-0.019
 know-0.019
 done-0.018
atters-0.017
Tokengy
Token ID1360
Feature activation-7.58e+00

Pos logit contributions
 know+0.063
atters+0.061
 named+0.058
 have+0.056
 done+0.055
Neg logit contributions
 therapist-0.141
cer-0.120
 toddler-0.115
unia-0.114
 Riley-0.112
Token bank
Token ID3331
Feature activation-2.76e+00

Pos logit contributions
 know+0.236
 named+0.230
atters+0.224
 have+0.221
 done+0.217
Neg logit contributions
 therapist-0.263
cer-0.204
 toddler-0.202
 Riley-0.198
 Clare-0.195
Token."
Token ID526
Feature activation+4.47e-01

Pos logit contributions
 know+0.035
 named+0.033
atters+0.031
 have+0.030
 done+0.027
Neg logit contributions
 therapist-0.146
cer-0.125
 toddler-0.124
 Riley-0.122
rier-0.121
Token
Token ID198
Feature activation-5.86e-02

Pos logit contributions
 therapist+0.021
 toddler+0.017
 Riley+0.017
 Clare+0.017
 Amelia+0.017
Neg logit contributions
 done-0.009
 know-0.008
atters-0.008
weet-0.008
ric-0.008
Token
Token ID198
Feature activation-3.83e-01

Pos logit contributions
atters+0.001
 done+0.001
ric+0.001
 know+0.001
weet+0.001
Neg logit contributions
 therapist-0.003
 Riley-0.002
 toddler-0.002
 Clare-0.002
 Amelia-0.002
TokenWhile
Token ID3633
Feature activation-3.19e-01

Pos logit contributions
 know+0.012
 named+0.012
atters+0.011
 have+0.011
 done+0.011
Neg logit contributions
 therapist-0.014
cer-0.011
 toddler-0.010
 Riley-0.010
rier-0.010
Token did
Token ID750
Feature activation+3.32e-01

Pos logit contributions
 therapist+0.046
 toddler+0.040
cer+0.038
unia+0.037
 Riley+0.037
Neg logit contributions
atters-0.035
weet-0.033
opl-0.032
 named-0.032
 know-0.032
Token not
Token ID407
Feature activation-6.44e-01

Pos logit contributions
 therapist+0.016
cer+0.013
 Riley+0.013
 Clare+0.013
 toddler+0.013
Neg logit contributions
 know-0.005
 named-0.005
atters-0.005
weet-0.005
my-0.004
Token know
Token ID760
Feature activation-2.69e+00

Pos logit contributions
ric+0.013
atters+0.012
opl+0.011
 Later+0.011
omo+0.011
Neg logit contributions
 therapist-0.026
 Clare-0.023
cer-0.023
unia-0.022
 stranger-0.022
Token what
Token ID644
Feature activation+2.06e+00

Pos logit contributions
 know+0.051
atters+0.048
 named+0.047
 have+0.045
 done+0.044
Neg logit contributions
 therapist-0.125
cer-0.105
 toddler-0.104
 Riley-0.103
 Clare-0.102
Token ost
Token ID23619
Feature activation-2.99e+00

Pos logit contributions
 therapist+0.078
cer+0.062
 toddler+0.061
 Riley+0.059
rier+0.059
Neg logit contributions
 know-0.059
 named-0.058
 have-0.056
atters-0.055
 done-0.054
Tokenric
Token ID1173
Feature activation-7.58e+00

Pos logit contributions
 know+0.059
 named+0.057
 much+0.057
 mud+0.054
 Yesterday+0.053
Neg logit contributions
unia-0.107
cer-0.104
 therapist-0.103
overed-0.103
ed-0.097
Tokenhes
Token ID956
Feature activation-1.96e+00

Pos logit contributions
 know+0.156
 named+0.156
atters+0.146
 Yesterday+0.138
 have+0.134
Neg logit contributions
 therapist-0.329
rier-0.269
unia-0.268
cer-0.264
 kernels-0.261
Token were
Token ID547
Feature activation-2.42e+00

Pos logit contributions
 know+0.043
 named+0.042
 have+0.040
atters+0.039
 done+0.039
Neg logit contributions
 therapist-0.086
cer-0.071
 toddler-0.071
 Riley-0.069
unia-0.069
Token.
Token ID13
Feature activation+2.71e-01

Pos logit contributions
atters+0.053
 Yesterday+0.047
 named+0.046
 know+0.044
iz+0.044
Neg logit contributions
 therapist-0.098
cer-0.081
 Clare-0.079
unia-0.079
 toddler-0.078
Token They
Token ID1119
Feature activation-2.31e-01

Pos logit contributions
 therapist+0.015
cer+0.013
unia+0.013
 toddler+0.013
rier+0.013
Neg logit contributions
 know-0.003
 named-0.003
 have-0.002
atters-0.002
 One-0.002
Token looked
Token ID3114
Feature activation-9.28e-01

Pos logit contributions
atters+0.005
 know+0.005
 named+0.005
 have+0.005
 done+0.005
Neg logit contributions
 therapist-0.009
 toddler-0.008
cer-0.008
 Riley-0.007
unia-0.007
Token wanted
Token ID2227
Feature activation-1.71e+00

Pos logit contributions
 know+0.011
atters+0.010
 named+0.010
 have+0.010
 done+0.010
Neg logit contributions
 therapist-0.019
 toddler-0.016
cer-0.016
 Riley-0.015
unia-0.015
Token to
Token ID284
Feature activation-2.28e-01

Pos logit contributions
 know+0.035
 named+0.034
 have+0.033
 done+0.031
 called+0.029
Neg logit contributions
 therapist-0.080
cer-0.070
rily-0.067
unia-0.067
rier-0.067
Token touch
Token ID3638
Feature activation-1.50e-01

Pos logit contributions
atters+0.005
weet+0.005
 done+0.004
 Yesterday+0.004
 named+0.004
Neg logit contributions
 therapist-0.009
cer-0.008
unia-0.008
 Riley-0.008
 Clare-0.008
Token the
Token ID262
Feature activation-1.17e+00

Pos logit contributions
 named+0.003
 know+0.003
 have+0.003
 called+0.003
my+0.002
Neg logit contributions
 therapist-0.007
overed-0.006
cer-0.006
 toddler-0.006
edly-0.006
Token ost
Token ID23619
Feature activation-2.79e+00

Pos logit contributions
 know+0.033
atters+0.033
 named+0.032
 have+0.030
 done+0.030
Neg logit contributions
 therapist-0.043
unia-0.033
 toddler-0.033
cer-0.033
rier-0.033
Tokenric
Token ID1173
Feature activation-7.57e+00

Pos logit contributions
 know+0.065
 have+0.056
 much+0.055
 Yesterday+0.052
 named+0.050
Neg logit contributions
unia-0.100
cer-0.098
 therapist-0.097
overed-0.092
mer-0.092
Tokenhes
Token ID956
Feature activation-1.23e+00

Pos logit contributions
 know+0.173
 named+0.166
 Yesterday+0.155
 have+0.151
atters+0.149
Neg logit contributions
 therapist-0.305
overed-0.246
rier-0.245
rily-0.243
unia-0.242
Token.
Token ID13
Feature activation-3.58e-01

Pos logit contributions
 named+0.015
 know+0.015
 have+0.014
 done+0.013
atters+0.012
Neg logit contributions
 therapist-0.065
 toddler-0.057
cer-0.056
unia-0.055
 Clare-0.055
Token They
Token ID1119
Feature activation-8.11e-01

Pos logit contributions
 know+0.005
 named+0.004
atters+0.004
 have+0.004
 done+0.004
Neg logit contributions
 therapist-0.019
cer-0.016
 toddler-0.016
 Riley-0.016
unia-0.016
Token reached
Token ID4251
Feature activation-1.48e+00

Pos logit contributions
 know+0.018
 named+0.018
 have+0.017
atters+0.016
 done+0.016
Neg logit contributions
 therapist-0.036
cer-0.030
 toddler-0.029
 Riley-0.029
rier-0.029
Token out
Token ID503
Feature activation+2.92e-01

Pos logit contributions
 know+0.040
 named+0.039
atters+0.038
 have+0.038
 done+0.037
Neg logit contributions
 therapist-0.059
cer-0.047
 toddler-0.046
rier-0.045
unia-0.045
Token was
Token ID373
Feature activation+1.30e-01

Pos logit contributions
 therapist+0.020
cer+0.016
 toddler+0.016
 Riley+0.015
unia+0.015
Neg logit contributions
 know-0.016
 named-0.016
atters-0.015
 have-0.015
 done-0.015
Token always
Token ID1464
Feature activation+1.02e+00

Pos logit contributions
 therapist+0.006
 Riley+0.005
 Clare+0.005
unia+0.004
cer+0.004
Neg logit contributions
atters-0.003
 know-0.003
 have-0.002
ric-0.002
 done-0.002
Token there
Token ID612
Feature activation+2.20e+00

Pos logit contributions
 therapist+0.050
 toddler+0.041
cer+0.041
unia+0.040
 Riley+0.040
Neg logit contributions
 know-0.019
 named-0.019
 have-0.017
 done-0.017
atters-0.016
Token for
Token ID329
Feature activation+1.39e+00

Pos logit contributions
 therapist+0.100
cer+0.082
 toddler+0.082
unia+0.080
 Riley+0.080
Neg logit contributions
 know-0.046
 named-0.045
 have-0.043
 done-0.041
atters-0.040
Token Bug
Token ID15217
Feature activation-8.78e-01

Pos logit contributions
 therapist+0.053
 Riley+0.043
 Clare+0.043
 toddler+0.043
cer+0.042
Neg logit contributions
atters-0.037
 know-0.034
 named-0.033
 done-0.032
 have-0.032
Tokengy
Token ID1360
Feature activation-7.50e+00

Pos logit contributions
 know+0.017
 named+0.017
atters+0.016
 have+0.016
 done+0.015
Neg logit contributions
 therapist-0.040
cer-0.034
 toddler-0.033
 Riley-0.033
unia-0.033
Token,
Token ID11
Feature activation+6.89e-01

Pos logit contributions
 know+0.140
atters+0.133
 named+0.129
 have+0.124
 done+0.121
Neg logit contributions
 therapist-0.349
cer-0.290
 toddler-0.288
 Riley-0.282
unia-0.281
Token no
Token ID645
Feature activation+1.03e-02

Pos logit contributions
 therapist+0.036
rier+0.031
 parcel+0.031
unia+0.030
 toddler+0.030
Neg logit contributions
 named-0.012
 have-0.011
 called-0.010
my-0.010
 know-0.010
Token matter
Token ID2300
Feature activation+1.30e+00

Pos logit contributions
 therapist+0.000
cer+0.000
 toddler+0.000
 Riley+0.000
 Clare+0.000
Neg logit contributions
atters-0.000
 know-0.000
 named-0.000
 done-0.000
 have-0.000
Token what
Token ID644
Feature activation+1.42e+00

Pos logit contributions
 therapist+0.044
cer+0.041
 Riley+0.038
 toddler+0.037
 Amelia+0.037
Neg logit contributions
atters-0.034
 know-0.033
 done-0.032
aver-0.028
 Yesterday-0.028
Token.
Token ID13
Feature activation-1.73e-01

Pos logit contributions
 therapist+0.066
cer+0.056
rier+0.056
 toddler+0.055
rily+0.055
Neg logit contributions
 named-0.029
 know-0.029
 have-0.028
 done-0.025
 called-0.024
Token mom
Token ID1995
Feature activation-1.56e+00

Pos logit contributions
 therapist+0.012
 Clare+0.009
 Riley+0.009
 toddler+0.009
unia+0.009
Neg logit contributions
inations-0.017
atters-0.017
 done-0.015
 know-0.015
weet-0.015
Token and
Token ID290
Feature activation-7.69e-01

Pos logit contributions
atters+0.035
 know+0.035
 named+0.032
weet+0.031
 done+0.031
Neg logit contributions
 therapist-0.066
 toddler-0.054
rier-0.053
cer-0.052
 Riley-0.051
Token to
Token ID284
Feature activation+5.40e-01

Pos logit contributions
 know+0.019
 named+0.018
 have+0.017
atters+0.017
 done+0.017
Neg logit contributions
 therapist-0.032
cer-0.027
 toddler-0.026
 Riley-0.026
unia-0.026
Token the
Token ID262
Feature activation-9.77e-01

Pos logit contributions
 therapist+0.019
 toddler+0.015
 Riley+0.015
cer+0.015
 Clare+0.014
Neg logit contributions
atters-0.016
 know-0.015
 done-0.014
 named-0.014
 Yesterday-0.014
Token ost
Token ID23619
Feature activation-2.85e+00

Pos logit contributions
atters+0.029
 know+0.029
 named+0.027
 have+0.026
weet+0.026
Neg logit contributions
 therapist-0.035
 toddler-0.027
unia-0.027
rier-0.027
 Clare-0.027
Tokenric
Token ID1173
Feature activation-7.49e+00

Pos logit contributions
 know+0.074
 have+0.067
 Yesterday+0.066
 much+0.066
 One+0.058
Neg logit contributions
 therapist-0.094
unia-0.092
cer-0.092
we-0.087
overed-0.087
Tokenhes
Token ID956
Feature activation-1.09e+00

Pos logit contributions
 know+0.149
 named+0.140
 have+0.129
atters+0.128
 Yesterday+0.128
Neg logit contributions
 therapist-0.334
rier-0.276
cer-0.274
unia-0.273
rily-0.269
Token.
Token ID13
Feature activation-3.29e-01

Pos logit contributions
 know+0.018
 named+0.017
 have+0.016
atters+0.016
 done+0.015
Neg logit contributions
 therapist-0.054
cer-0.046
 toddler-0.046
 Riley-0.045
unia-0.045
Token They
Token ID1119
Feature activation-7.88e-01

Pos logit contributions
 know+0.006
 done+0.006
atters+0.006
weet+0.005
 named+0.005
Neg logit contributions
 therapist-0.016
 toddler-0.013
 Riley-0.013
 Clare-0.013
edly-0.013
Token learned
Token ID4499
Feature activation-1.05e+00

Pos logit contributions
atters+0.018
 know+0.018
 named+0.017
ric+0.017
weet+0.017
Neg logit contributions
 therapist-0.033
 toddler-0.027
cer-0.026
unia-0.026
 Riley-0.026
Token their
Token ID511
Feature activation+3.01e-01

Pos logit contributions
 know+0.026
 named+0.026
atters+0.025
 have+0.024
 done+0.024
Neg logit contributions
 therapist-0.043
cer-0.034
 toddler-0.034
 Riley-0.034
 Clare-0.033
Token.
Token ID13
Feature activation-8.43e-02

Pos logit contributions
 know+0.025
atters+0.024
ric+0.023
 named+0.023
 done+0.022
Neg logit contributions
 therapist-0.061
cer-0.052
rier-0.050
 toddler-0.050
unia-0.050
Token
Token ID198
Feature activation-3.71e-01

Pos logit contributions
 know+0.002
 named+0.002
atters+0.002
 have+0.002
 done+0.002
Neg logit contributions
 therapist-0.004
cer-0.003
 toddler-0.003
 Riley-0.003
 Clare-0.003
Token
Token ID198
Feature activation-6.45e-01

Pos logit contributions
 done+0.009
weet+0.009
ric+0.008
 built+0.008
atters+0.008
Neg logit contributions
 therapist-0.014
 Riley-0.011
 Debbie-0.011
 Amelia-0.011
 Clare-0.011
Token"
Token ID1
Feature activation+2.19e+00

Pos logit contributions
 know+0.017
atters+0.016
 done+0.016
 named+0.016
ric+0.016
Neg logit contributions
 therapist-0.026
cer-0.020
 toddler-0.020
unia-0.020
 Riley-0.019
TokenDog
Token ID32942
Feature activation-2.76e+00

Pos logit contributions
 therapist+0.084
rily+0.084
rier+0.082
cer+0.080
 Amelia+0.079
Neg logit contributions
 named-0.046
 have-0.046
Tim-0.040
 called-0.039
 mud-0.038
Tokengy
Token ID1360
Feature activation-7.47e+00

Pos logit contributions
 know+0.048
 named+0.047
atters+0.044
 have+0.043
 done+0.041
Neg logit contributions
 therapist-0.135
 toddler-0.113
cer-0.112
 Riley-0.110
 Clare-0.110
Token,
Token ID11
Feature activation-2.12e-01

Pos logit contributions
 know+0.128
 named+0.123
atters+0.116
 have+0.115
 done+0.111
Neg logit contributions
 therapist-0.368
 toddler-0.307
cer-0.307
 Riley-0.302
 Clare-0.301
Token you
Token ID345
Feature activation+1.83e+00

Pos logit contributions
 know+0.004
 named+0.004
 have+0.004
atters+0.004
 done+0.003
Neg logit contributions
 therapist-0.010
cer-0.009
 toddler-0.008
unia-0.008
rier-0.008
Token are
Token ID389
Feature activation-3.22e-02

Pos logit contributions
 therapist+0.059
 toddler+0.056
rified+0.055
cer+0.054
 parcel+0.052
Neg logit contributions
opl-0.045
weet-0.043
ric-0.043
 Yesterday-0.040
atters-0.039
Token a
Token ID257
Feature activation+1.44e+00

Pos logit contributions
 know+0.001
atters+0.001
ric+0.001
 Yesterday+0.001
opl+0.001
Neg logit contributions
 therapist-0.001
unia-0.001
rier-0.001
 toddler-0.001
 Riley-0.001
Token bad
Token ID2089
Feature activation+1.77e+00

Pos logit contributions
 therapist+0.067
cer+0.056
 toddler+0.055
 Riley+0.055
 Clare+0.054
Neg logit contributions
 know-0.028
 named-0.027
 have-0.026
atters-0.025
 done-0.024
Tokenly
Token ID306
Feature activation+3.39e-01

Pos logit contributions
 done+0.045
 know+0.045
 named+0.043
atters+0.043
 have+0.042
Neg logit contributions
 therapist-0.043
 toddler-0.034
 Amelia-0.032
 parcel-0.032
 Clare-0.032
Token said
Token ID531
Feature activation-8.67e-01

Pos logit contributions
 therapist+0.015
rier+0.014
rily+0.013
cer+0.013
 avail+0.013
Neg logit contributions
 named-0.007
 called-0.007
 done-0.006
 know-0.006
 have-0.006
Token,
Token ID11
Feature activation+6.63e-01

Pos logit contributions
 named+0.014
 know+0.012
 much+0.011
 have+0.011
 done+0.010
Neg logit contributions
 therapist-0.044
cer-0.042
unia-0.039
ymes-0.039
 toddler-0.038
Token "
Token ID366
Feature activation+2.33e+00

Pos logit contributions
 therapist+0.031
cer+0.028
unia+0.026
 toddler+0.026
gers+0.025
Neg logit contributions
 named-0.012
 know-0.012
 have-0.011
 called-0.010
atters-0.010
TokenBug
Token ID25624
Feature activation-2.51e+00

Pos logit contributions
 therapist+0.081
rier+0.071
rily+0.067
 parcel+0.066
overed+0.065
Neg logit contributions
my-0.066
 have-0.063
 named-0.063
weet-0.063
o-0.062
Tokengy
Token ID1360
Feature activation-7.46e+00

Pos logit contributions
 know+0.042
 named+0.040
 have+0.038
atters+0.036
 done+0.035
Neg logit contributions
 therapist-0.125
 toddler-0.105
cer-0.104
 Clare-0.102
 Riley-0.102
Token,
Token ID11
Feature activation-5.07e-02

Pos logit contributions
 know+0.123
 named+0.119
atters+0.110
 have+0.110
 done+0.106
Neg logit contributions
 therapist-0.369
cer-0.310
 toddler-0.310
 Riley-0.304
 Clare-0.302
Token today
Token ID1909
Feature activation+8.83e-01

Pos logit contributions
 know+0.001
 have+0.001
 named+0.001
 done+0.001
atters+0.001
Neg logit contributions
 therapist-0.002
cer-0.002
edly-0.002
unia-0.002
 toddler-0.002
Token was
Token ID373
Feature activation-1.31e+00

Pos logit contributions
 therapist+0.044
edly+0.037
cer+0.037
rily+0.037
 toddler+0.037
Neg logit contributions
 have-0.016
 know-0.015
 done-0.014
 named-0.014
."-0.013
Token so
Token ID523
Feature activation-1.68e+00

Pos logit contributions
 know+0.028
atters+0.027
 named+0.027
 have+0.025
 done+0.024
Neg logit contributions
 therapist-0.057
cer-0.047
 toddler-0.047
 Riley-0.047
 Clare-0.046
Token much
Token ID881
Feature activation-1.81e+00

Pos logit contributions
 know+0.020
 named+0.020
atters+0.019
 have+0.017
 done+0.016
Neg logit contributions
 therapist-0.089
 toddler-0.075
 Riley-0.075
cer-0.075
 Clare-0.074
Token Snow
Token ID7967
Feature activation+2.50e-01

Pos logit contributions
 named+0.001
 know+0.001
 have+0.001
atters+0.001
 done+0.001
Neg logit contributions
 therapist-0.001
cer-0.001
 toddler-0.001
rier-0.001
 Clare-0.001
Tokeny
Token ID88
Feature activation-2.31e+00

Pos logit contributions
 therapist+0.007
cer+0.006
 toddler+0.005
unia+0.005
 Riley+0.005
Neg logit contributions
 know-0.009
atters-0.009
 named-0.009
 have-0.009
 done-0.009
Token said
Token ID531
Feature activation-7.09e-01

Pos logit contributions
 named+0.045
 know+0.043
 have+0.040
 called+0.038
 done+0.037
Neg logit contributions
 therapist-0.109
rier-0.094
cer-0.091
rily-0.090
 Riley-0.090
Token to
Token ID284
Feature activation+9.44e-01

Pos logit contributions
 know+0.007
 named+0.007
 have+0.007
 done+0.005
 much+0.005
Neg logit contributions
 therapist-0.041
cer-0.038
unia-0.036
 toddler-0.035
 parcel-0.035
Token Tig
Token ID14189
Feature activation-3.05e-01

Pos logit contributions
 therapist+0.037
cer+0.030
 Riley+0.030
 toddler+0.030
 Clare+0.029
Neg logit contributions
 know-0.025
atters-0.024
 named-0.024
 done-0.023
 have-0.023
Tokengy
Token ID1360
Feature activation-7.43e+00

Pos logit contributions
 named+0.006
 know+0.006
atters+0.005
 have+0.005
my+0.005
Neg logit contributions
 therapist-0.014
 toddler-0.012
 Clare-0.012
 Riley-0.012
cer-0.012
Token,
Token ID11
Feature activation+5.27e-01

Pos logit contributions
 know+0.128
atters+0.123
 named+0.118
 have+0.114
 done+0.112
Neg logit contributions
 therapist-0.356
 toddler-0.298
cer-0.297
 Riley-0.291
 Clare-0.290
Token "
Token ID366
Feature activation+1.91e+00

Pos logit contributions
 therapist+0.026
cer+0.024
rier+0.022
unia+0.022
rily+0.022
Neg logit contributions
 named-0.009
 know-0.008
 have-0.008
 called-0.008
 One-0.007
TokenI
Token ID40
Feature activation-4.49e-01

Pos logit contributions
 therapist+0.065
rier+0.059
rily+0.056
 parcel+0.055
overed+0.054
Neg logit contributions
my-0.059
o-0.055
 named-0.055
 have-0.055
 know-0.052
Token found
Token ID1043
Feature activation-1.28e+00

Pos logit contributions
 know+0.007
 named+0.006
 have+0.006
atters+0.006
 done+0.005
Neg logit contributions
 therapist-0.024
cer-0.020
 toddler-0.020
 Riley-0.020
rier-0.020
Token something
Token ID1223
Feature activation+4.43e+00

Pos logit contributions
 named+0.025
 know+0.024
 have+0.022
atters+0.021
 done+0.021
Neg logit contributions
 therapist-0.061
cer-0.050
 toddler-0.049
overed-0.048
rily-0.048
Token have
Token ID423
Feature activation-1.24e+00

Pos logit contributions
 therapist+0.069
 Clare+0.059
 Riley+0.059
cer+0.058
 toddler+0.058
Neg logit contributions
atters-0.019
 named-0.017
o-0.017
weet-0.016
 know-0.016
Token found
Token ID1043
Feature activation-1.55e+00

Pos logit contributions
 done+0.025
 named+0.024
 know+0.023
 have+0.022
 called+0.020
Neg logit contributions
 therapist-0.061
rier-0.055
rily-0.051
 toddler-0.051
cer-0.049
Token such
Token ID884
Feature activation+9.30e-01

Pos logit contributions
 know+0.028
 named+0.026
 have+0.024
 much+0.024
 done+0.024
Neg logit contributions
 therapist-0.079
edly-0.066
overed-0.064
 toddler-0.063
unia-0.063
Token an
Token ID281
Feature activation+1.45e+00

Pos logit contributions
 therapist+0.049
cer+0.041
overed+0.040
rier+0.039
rily+0.039
Neg logit contributions
 know-0.015
 named-0.015
 have-0.014
 much-0.013
 done-0.012
Token amazing
Token ID4998
Feature activation+2.37e-01

Pos logit contributions
 therapist+0.052
 toddler+0.045
cer+0.042
 parcel+0.041
 Clare+0.041
Neg logit contributions
atters-0.038
 know-0.038
ric-0.037
 named-0.037
 have-0.036
Token thing
Token ID1517
Feature activation+4.58e+00

Pos logit contributions
 therapist+0.010
rier+0.009
unia+0.008
rily+0.008
 Riley+0.008
Neg logit contributions
 know-0.006
 named-0.005
 have-0.005
 done-0.005
my-0.005
Token.
Token ID13
Feature activation+1.63e-01

Pos logit contributions
 therapist+0.239
rier+0.203
 Clare+0.202
unia+0.202
 toddler+0.202
Neg logit contributions
 named-0.068
 have-0.057
 called-0.057
o-0.056
 know-0.054
Token
Token ID198
Feature activation-3.83e-01

Pos logit contributions
 therapist+0.008
 Riley+0.007
 toddler+0.007
 Clare+0.007
cer+0.006
Neg logit contributions
 done-0.003
 know-0.003
atters-0.003
 named-0.003
weet-0.002
Token
Token ID198
Feature activation-6.19e-01

Pos logit contributions
atters+0.007
 know+0.007
 named+0.006
 done+0.006
 have+0.006
Neg logit contributions
 therapist-0.018
 Riley-0.015
 toddler-0.015
 Clare-0.015
cer-0.014
TokenJack
Token ID14295
Feature activation+1.32e+00

Pos logit contributions
 know+0.015
 named+0.015
atters+0.014
 have+0.014
 done+0.014
Neg logit contributions
 therapist-0.026
cer-0.021
 toddler-0.021
rier-0.020
 Riley-0.020
Token knew
Token ID2993
Feature activation-7.74e-01

Pos logit contributions
 therapist+0.056
rier+0.048
cer+0.046
 toddler+0.046
 Clare+0.045
Neg logit contributions
 named-0.031
 know-0.030
 have-0.028
 done-0.028
my-0.027
Tokenmy
Token ID1820
Feature activation-5.43e-01

Pos logit contributions
 therapist+0.031
cer+0.027
 toddler+0.027
 Riley+0.026
unia+0.026
Neg logit contributions
 know-0.004
 named-0.003
atters-0.003
 have-0.003
 done-0.002
Token realized
Token ID6939
Feature activation-1.03e+00

Pos logit contributions
 know+0.014
 named+0.014
 done+0.013
 have+0.013
 called+0.012
Neg logit contributions
 therapist-0.021
rier-0.019
 Clare-0.018
cer-0.018
 avail-0.018
Token that
Token ID326
Feature activation-4.42e-01

Pos logit contributions
 named+0.019
 know+0.016
 called+0.016
 have+0.016
 done+0.015
Neg logit contributions
 therapist-0.050
cer-0.044
unia-0.043
 toddler-0.042
gers-0.041
Token Lily
Token ID20037
Feature activation+7.55e-02

Pos logit contributions
 have+0.010
 know+0.010
 named+0.010
my+0.009
 done+0.009
Neg logit contributions
 therapist-0.020
unia-0.018
rier-0.018
rily-0.017
cer-0.017
Token was
Token ID373
Feature activation+5.52e-02

Pos logit contributions
 therapist+0.003
rier+0.003
 Clare+0.003
cer+0.003
rily+0.003
Neg logit contributions
 know-0.002
 have-0.002
 named-0.001
 done-0.001
my-0.001
Token right
Token ID826
Feature activation+4.58e+00

Pos logit contributions
 therapist+0.003
cer+0.002
 toddler+0.002
 Riley+0.002
rily+0.002
Neg logit contributions
 know-0.001
 named-0.001
 have-0.001
 done-0.001
atters-0.001
Token.
Token ID13
Feature activation+5.51e-01

Pos logit contributions
 therapist+0.255
 Clare+0.216
 Riley+0.213
unia+0.212
 toddler+0.211
Neg logit contributions
 named-0.051
 done-0.048
my-0.045
 called-0.042
 know-0.039
Token From
Token ID3574
Feature activation-6.86e-01

Pos logit contributions
 therapist+0.024
 toddler+0.020
 Riley+0.020
cer+0.020
 Clare+0.019
Neg logit contributions
 know-0.012
 named-0.012
 done-0.011
 have-0.011
atters-0.011
Token that
Token ID326
Feature activation-1.75e+00

Pos logit contributions
 named+0.009
 called+0.008
 done+0.008
 know+0.007
my+0.006
Neg logit contributions
 therapist-0.036
rier-0.032
cer-0.032
 toddler-0.031
rily-0.031
Token day
Token ID1110
Feature activation+4.92e-01

Pos logit contributions
 know+0.028
 named+0.027
atters+0.025
 have+0.025
 done+0.024
Neg logit contributions
 therapist-0.088
cer-0.074
 toddler-0.073
 Riley-0.072
 Clare-0.072
Token on
Token ID319
Feature activation-2.11e+00

Pos logit contributions
 therapist+0.017
cer+0.013
 toddler+0.013
 Riley+0.013
unia+0.012
Neg logit contributions
 know-0.015
atters-0.015
 named-0.014
ric-0.014
weet-0.014
Token?
Token ID30
Feature activation+2.17e-01

Pos logit contributions
 therapist+0.190
cer+0.162
unia+0.159
 toddler+0.159
rier+0.157
Neg logit contributions
 know-0.046
 named-0.045
 have-0.041
 done-0.036
atters-0.034
Tokenâ
Token ID22940
Feature activation+1.04e+00

Pos logit contributions
 therapist+0.010
cer+0.008
 toddler+0.008
 Riley+0.008
unia+0.008
Neg logit contributions
 know-0.005
 named-0.004
atters-0.004
 have-0.004
 done-0.004
Token
Token ID26391
Feature activation-8.28e-01

Pos logit contributions
 therapist+0.038
cer+0.030
 toddler+0.030
 Riley+0.029
unia+0.029
Neg logit contributions
 know-0.031
 named-0.030
atters-0.029
 have-0.028
 done-0.028
Token He
Token ID679
Feature activation-6.59e-01

Pos logit contributions
 know+0.015
 named+0.014
atters+0.014
 have+0.013
 done+0.013
Neg logit contributions
 therapist-0.039
cer-0.033
 toddler-0.033
 Riley-0.032
 Clare-0.032
Token looked
Token ID3114
Feature activation-1.03e+00

Pos logit contributions
 know+0.014
 named+0.014
 have+0.013
 called+0.012
 done+0.011
Neg logit contributions
 therapist-0.030
rier-0.027
cer-0.027
rily-0.026
 toddler-0.026
Token up
Token ID510
Feature activation+4.79e+00

Pos logit contributions
 named+0.021
 know+0.020
 called+0.019
 have+0.019
 done+0.018
Neg logit contributions
 therapist-0.049
cer-0.041
unia-0.040
edly-0.039
 toddler-0.039
Token to
Token ID284
Feature activation+2.93e+00

Pos logit contributions
 therapist+0.266
unia+0.227
cer+0.224
ymes+0.222
 toddler+0.219
Neg logit contributions
 named-0.059
 have-0.052
 called-0.051
 done-0.050
 know-0.048
Token see
Token ID766
Feature activation-1.57e+00

Pos logit contributions
 therapist+0.138
cer+0.116
rier+0.115
overed+0.113
rily+0.113
Neg logit contributions
 know-0.068
 have-0.064
 named-0.060
 done-0.051
 called-0.049
Token a
Token ID257
Feature activation+1.10e+00

Pos logit contributions
 know+0.026
 named+0.026
 have+0.024
."+0.022
 called+0.022
Neg logit contributions
 therapist-0.078
edly-0.066
overed-0.066
cer-0.066
 toddler-0.065
Token friendly
Token ID8030
Feature activation+1.71e+00

Pos logit contributions
 therapist+0.046
edly+0.039
rily+0.039
cer+0.039
overed+0.038
Neg logit contributions
 named-0.026
 know-0.025
 have-0.025
 called-0.022
 done-0.022
Token face
Token ID1986
Feature activation+4.38e-01

Pos logit contributions
 therapist+0.067
edly+0.059
rily+0.057
cer+0.057
overed+0.056
Neg logit contributions
 named-0.048
 know-0.043
 have-0.042
 called-0.040
weet-0.038
Token dug
Token ID18735
Feature activation+1.30e+00

Pos logit contributions
 know+0.012
 named+0.012
atters+0.011
 have+0.011
 done+0.011
Neg logit contributions
 therapist-0.022
cer-0.018
 toddler-0.018
 Riley-0.017
 Clare-0.017
Token a
Token ID257
Feature activation+2.63e+00

Pos logit contributions
 therapist+0.069
 toddler+0.056
 Riley+0.055
 Clare+0.055
 Debbie+0.054
Neg logit contributions
 named-0.022
 know-0.021
 much-0.020
 called-0.020
 have-0.020
Token tunnel
Token ID13275
Feature activation+5.19e-01

Pos logit contributions
 therapist+0.106
cer+0.086
 toddler+0.086
 Riley+0.083
unia+0.083
Neg logit contributions
 know-0.067
 named-0.065
atters-0.063
 have-0.062
 done-0.060
Token and
Token ID290
Feature activation+1.21e+00

Pos logit contributions
 therapist+0.023
 toddler+0.019
 Riley+0.019
 Clare+0.019
cer+0.019
Neg logit contributions
 named-0.012
 know-0.011
atters-0.011
 have-0.011
 done-0.011
Token found
Token ID1043
Feature activation-2.52e-01

Pos logit contributions
 therapist+0.059
rier+0.051
 toddler+0.049
cer+0.049
 Clare+0.049
Neg logit contributions
 named-0.024
 know-0.024
 have-0.022
 called-0.021
 done-0.021
Token something
Token ID1223
Feature activation+4.38e+00

Pos logit contributions
 know+0.006
 named+0.005
atters+0.005
 have+0.005
 done+0.005
Neg logit contributions
 therapist-0.011
cer-0.009
 toddler-0.009
 Clare-0.009
 Riley-0.009
Token sparkling
Token ID39072
Feature activation+5.30e-01

Pos logit contributions
 therapist+0.200
cer+0.166
 toddler+0.163
 Clare+0.161
 Riley+0.161
Neg logit contributions
 know-0.095
 named-0.094
 have-0.090
 done-0.086
 called-0.084
Token.
Token ID13
Feature activation+1.48e-01

Pos logit contributions
 therapist+0.027
 Riley+0.022
 Debbie+0.022
 Clare+0.022
 Amelia+0.021
Neg logit contributions
 named-0.010
 called-0.009
 done-0.009
 have-0.009
."-0.008
Token He
Token ID679
Feature activation-6.14e-02

Pos logit contributions
 therapist+0.007
cer+0.006
unia+0.006
rier+0.006
 toddler+0.006
Neg logit contributions
 know-0.002
 named-0.002
 have-0.002
atters-0.002
 One-0.002
Token was
Token ID373
Feature activation+9.15e-01

Pos logit contributions
 know+0.001
 named+0.001
atters+0.001
 have+0.001
 done+0.001
Neg logit contributions
 therapist-0.003
cer-0.002
 toddler-0.002
 Riley-0.002
 Clare-0.002
Token so
Token ID523
Feature activation-6.81e-01

Pos logit contributions
 therapist+0.042
 toddler+0.035
cer+0.035
 Riley+0.035
 Clare+0.034
Neg logit contributions
 know-0.017
atters-0.017
 named-0.016
 have-0.015
 done-0.015
Token loud
Token ID7812
Feature activation+1.44e-01

Pos logit contributions
 therapist+0.028
cer+0.022
 toddler+0.022
 Riley+0.022
 Clare+0.022
Neg logit contributions
 know-0.018
 named-0.018
atters-0.017
 have-0.017
 done-0.017
Token bark
Token ID21405
Feature activation+4.44e-01

Pos logit contributions
 therapist+0.006
cer+0.005
 toddler+0.005
 Riley+0.005
 Clare+0.005
Neg logit contributions
 know-0.004
 named-0.004
atters-0.003
 have-0.003
 done-0.003
Token.
Token ID13
Feature activation-3.36e-01

Pos logit contributions
 therapist+0.023
cer+0.020
 toddler+0.019
 Riley+0.019
unia+0.019
Neg logit contributions
 know-0.006
 named-0.006
atters-0.006
 have-0.005
 done-0.005
Token They
Token ID1119
Feature activation-5.19e-01

Pos logit contributions
 named+0.005
 know+0.005
 have+0.005
atters+0.004
 One+0.004
Neg logit contributions
 therapist-0.017
cer-0.015
unia-0.014
rier-0.014
 toddler-0.014
Token look
Token ID804
Feature activation+7.11e-01

Pos logit contributions
 know+0.009
 have+0.008
 named+0.008
atters+0.008
 done+0.007
Neg logit contributions
 therapist-0.026
cer-0.022
 toddler-0.022
rier-0.021
rily-0.021
Token up
Token ID510
Feature activation+4.50e+00

Pos logit contributions
 therapist+0.029
cer+0.023
 toddler+0.022
unia+0.022
 Clare+0.022
Neg logit contributions
 know-0.020
 named-0.020
 have-0.018
atters-0.018
 done-0.018
Token and
Token ID290
Feature activation+1.04e+00

Pos logit contributions
 therapist+0.259
unia+0.215
cer+0.215
 toddler+0.215
 Riley+0.211
Neg logit contributions
 know-0.048
 named-0.047
 have-0.045
 done-0.038
 called-0.037
Token see
Token ID766
Feature activation-1.26e+00

Pos logit contributions
 therapist+0.068
rily+0.058
rier+0.058
edly+0.058
cer+0.057
Neg logit contributions
 know-0.009
 have-0.008
 named-0.006
 see-0.003
 called-0.003
Token a
Token ID257
Feature activation+1.60e+00

Pos logit contributions
 know+0.025
 named+0.025
 have+0.023
atters+0.022
 called+0.020
Neg logit contributions
 therapist-0.061
cer-0.050
 toddler-0.049
rily-0.049
unia-0.049
Token big
Token ID1263
Feature activation+1.88e-01

Pos logit contributions
 therapist+0.065
cer+0.052
rily+0.051
 Riley+0.051
 toddler+0.051
Neg logit contributions
 know-0.041
 named-0.040
 have-0.039
atters-0.037
 done-0.036
Token dog
Token ID3290
Feature activation+1.37e+00

Pos logit contributions
 therapist+0.008
rily+0.007
edly+0.007
 Riley+0.006
 Clare+0.006
Neg logit contributions
 named-0.005
 have-0.004
 know-0.004
atters-0.004
 called-0.004
Token On
Token ID1550
Feature activation+1.55e-01

Pos logit contributions
 therapist+0.016
 toddler+0.014
 Riley+0.013
 Clare+0.013
 Amelia+0.013
Neg logit contributions
 named-0.007
 know-0.007
atters-0.007
 done-0.007
aver-0.006
Token the
Token ID262
Feature activation-1.95e+00

Pos logit contributions
 therapist+0.007
cer+0.006
rier+0.006
 toddler+0.005
rily+0.005
Neg logit contributions
 named-0.004
 done-0.004
 have-0.004
 know-0.003
 One-0.003
Token day
Token ID1110
Feature activation+4.82e-01

Pos logit contributions
 know+0.042
 named+0.040
 have+0.039
 done+0.038
atters+0.037
Neg logit contributions
 therapist-0.087
cer-0.073
 toddler-0.070
 Riley-0.070
 Clare-0.070
Token of
Token ID286
Feature activation-1.66e+00

Pos logit contributions
 therapist+0.025
cer+0.021
unia+0.021
 toddler+0.021
rier+0.021
Neg logit contributions
 know-0.007
 named-0.007
 have-0.007
 done-0.006
atters-0.006
Token the
Token ID262
Feature activation-2.03e+00

Pos logit contributions
 named+0.037
 have+0.037
 know+0.036
 done+0.035
 called+0.031
Neg logit contributions
 therapist-0.076
rier-0.065
cer-0.064
rily-0.063
gers-0.061
Token task
Token ID4876
Feature activation+1.28e+00

Pos logit contributions
 know+0.050
 have+0.048
 named+0.047
 done+0.047
 One+0.043
Neg logit contributions
 therapist-0.084
cer-0.070
rily-0.068
 Riley-0.068
overed-0.068
Token she
Token ID673
Feature activation-9.79e-01

Pos logit contributions
 therapist+0.064
rily+0.058
rier+0.057
cer+0.057
unia+0.056
Neg logit contributions
 have-0.020
 named-0.020
 done-0.020
 know-0.019
 called-0.017
Token smiled
Token ID13541
Feature activation+1.16e+00

Pos logit contributions
 know+0.021
 named+0.020
 have+0.019
atters+0.019
 done+0.018
Neg logit contributions
 therapist-0.044
cer-0.037
rier-0.037
 toddler-0.036
 Riley-0.036
Token cheer
Token ID14042
Feature activation+2.03e+00

Pos logit contributions
 therapist+0.066
unia+0.057
 parcel+0.056
 Riley+0.055
cer+0.055
Neg logit contributions
 because-0.011
 named-0.010
 both-0.010
."-0.009
 called-0.009
Tokenfully
Token ID2759
Feature activation+1.53e+00

Pos logit contributions
 therapist+0.105
unia+0.092
 toddler+0.092
 parcel+0.090
cer+0.090
Neg logit contributions
 done-0.026
 because-0.025
."-0.025
 both-0.025
my-0.024
Token,
Token ID11
Feature activation+4.36e-01

Pos logit contributions
 therapist+0.076
cer+0.065
 toddler+0.063
unia+0.063
 Riley+0.063
Neg logit contributions
 know-0.025
 named-0.025
 have-0.023
 done-0.022
atters-0.022
Token!
Token ID0
Feature activation+4.51e-01

Pos logit contributions
 therapist+0.079
cer+0.066
 toddler+0.065
 Clare+0.064
 Riley+0.064
Neg logit contributions
 know-0.031
 named-0.030
 have-0.028
atters-0.027
 done-0.027
Token Max
Token ID5436
Feature activation+1.88e+00

Pos logit contributions
 therapist+0.021
 toddler+0.017
 Riley+0.017
cer+0.017
 Clare+0.017
Neg logit contributions
 know-0.009
atters-0.008
 named-0.008
 done-0.008
 have-0.008
Token was
Token ID373
Feature activation+1.36e+00

Pos logit contributions
 therapist+0.082
rier+0.068
cer+0.068
 Clare+0.066
 toddler+0.066
Neg logit contributions
 named-0.043
 know-0.043
 have-0.040
 done-0.038
 called-0.038
Token so
Token ID523
Feature activation-4.40e-01

Pos logit contributions
 therapist+0.066
cer+0.056
 toddler+0.054
unia+0.054
 Clare+0.053
Neg logit contributions
 know-0.025
 named-0.024
 have-0.022
 done-0.022
atters-0.022
Token scared
Token ID12008
Feature activation-9.39e-01

Pos logit contributions
 know+0.009
 named+0.009
atters+0.009
 have+0.009
 done+0.008
Neg logit contributions
 therapist-0.020
cer-0.016
 toddler-0.016
 Riley-0.016
 Clare-0.016
Token he
Token ID339
Feature activation+1.40e+00

Pos logit contributions
 named+0.017
 know+0.016
 have+0.015
 called+0.014
 done+0.014
Neg logit contributions
 therapist-0.046
cer-0.039
unia-0.039
 toddler-0.038
 Clare-0.038
Token started
Token ID2067
Feature activation-6.32e-01

Pos logit contributions
 therapist+0.062
cer+0.052
 toddler+0.051
 Clare+0.050
rier+0.050
Neg logit contributions
 named-0.031
 know-0.031
 have-0.029
 done-0.028
atters-0.027
Token to
Token ID284
Feature activation+8.63e-01

Pos logit contributions
 know+0.014
 named+0.014
 have+0.013
 done+0.012
 called+0.011
Neg logit contributions
 therapist-0.029
cer-0.024
unia-0.024
rier-0.024
 toddler-0.023
Token cry
Token ID3960
Feature activation-8.49e-01

Pos logit contributions
 therapist+0.039
cer+0.032
 toddler+0.032
rier+0.031
unia+0.031
Neg logit contributions
 know-0.019
 named-0.018
 have-0.017
atters-0.017
 done-0.016
Token.
Token ID13
Feature activation+8.43e-02

Pos logit contributions
 know+0.011
 named+0.010
atters+0.010
 have+0.010
 done+0.009
Neg logit contributions
 therapist-0.045
cer-0.038
 toddler-0.038
 Riley-0.037
unia-0.037
Token
Token ID198
Feature activation-3.28e-01

Pos logit contributions
 therapist+0.004
 toddler+0.003
 Riley+0.003
cer+0.003
 Clare+0.003
Neg logit contributions
 know-0.001
 named-0.001
atters-0.001
 have-0.001
 done-0.001
Token was
Token ID373
Feature activation+1.13e+00

Pos logit contributions
 therapist+0.005
cer+0.004
rier+0.004
 toddler+0.004
 Riley+0.004
Neg logit contributions
 know-0.003
 named-0.003
 have-0.003
atters-0.003
 done-0.003
Token red
Token ID2266
Feature activation-4.02e-01

Pos logit contributions
 therapist+0.050
cer+0.041
 toddler+0.041
 Riley+0.041
 Clare+0.040
Neg logit contributions
 know-0.024
 named-0.023
atters-0.022
 have-0.022
 done-0.021
Token and
Token ID290
Feature activation+9.55e-01

Pos logit contributions
 named+0.007
 know+0.007
atters+0.006
 have+0.006
 done+0.006
Neg logit contributions
 therapist-0.020
cer-0.016
 toddler-0.016
 Riley-0.016
 Clare-0.016
Token smelled
Token ID37609
Feature activation-2.28e-01

Pos logit contributions
 therapist+0.043
cer+0.037
rier+0.036
 toddler+0.035
unia+0.035
Neg logit contributions
 know-0.020
 named-0.020
 have-0.020
 done-0.019
 called-0.018
Token nice
Token ID3621
Feature activation+1.65e+00

Pos logit contributions
 know+0.004
 named+0.004
 have+0.004
 done+0.003
atters+0.003
Neg logit contributions
 therapist-0.011
cer-0.009
 toddler-0.009
 Riley-0.009
unia-0.009
Token.
Token ID13
Feature activation+4.83e-01

Pos logit contributions
 therapist+0.087
cer+0.074
 toddler+0.074
 Riley+0.073
unia+0.072
Neg logit contributions
 know-0.022
 named-0.021
atters-0.019
 have-0.019
 done-0.018
Token
Token ID198
Feature activation-2.45e-01

Pos logit contributions
 therapist+0.024
cer+0.020
 toddler+0.020
 Riley+0.020
unia+0.019
Neg logit contributions
 know-0.008
 named-0.008
atters-0.007
 have-0.007
 done-0.007
Token
Token ID198
Feature activation-5.49e-01

Pos logit contributions
atters+0.004
 know+0.004
 done+0.004
 named+0.004
ric+0.003
Neg logit contributions
 therapist-0.012
 Clare-0.010
 Riley-0.010
 toddler-0.010
cer-0.010
TokenE
Token ID36
Feature activation-4.16e-01

Pos logit contributions
 know+0.013
 named+0.012
atters+0.012
 done+0.011
 have+0.011
Neg logit contributions
 therapist-0.024
cer-0.019
 toddler-0.019
 Riley-0.019
 Clare-0.019
Tokenlla
Token ID8466
Feature activation-8.71e-01

Pos logit contributions
 know+0.016
 named+0.016
 done+0.015
atters+0.015
 have+0.015
Neg logit contributions
 therapist-0.011
 toddler-0.008
rier-0.008
 Riley-0.008
cer-0.008
Token's
Token ID338
Feature activation+8.79e-01

Pos logit contributions
 know+0.020
atters+0.020
 named+0.019
 have+0.018
 done+0.018
Neg logit contributions
 therapist-0.037
 toddler-0.030
cer-0.030
 Riley-0.029
unia-0.029
Token lit
Token ID6578
Feature activation-1.47e-01

Pos logit contributions
 therapist+0.044
cer+0.036
 toddler+0.036
 Riley+0.035
 Clare+0.035
Neg logit contributions
 know-0.023
 named-0.022
atters-0.021
 have-0.021
 done-0.020
Token the
Token ID262
Feature activation+6.44e-01

Pos logit contributions
 know+0.004
 named+0.004
 have+0.004
 done+0.003
atters+0.003
Neg logit contributions
 therapist-0.006
cer-0.005
 toddler-0.005
unia-0.005
gers-0.005
Token fire
Token ID2046
Feature activation+6.35e-01

Pos logit contributions
 therapist+0.030
 Riley+0.026
 Debbie+0.025
 Clare+0.025
cer+0.025
Neg logit contributions
 named-0.012
 have-0.012
 done-0.011
 know-0.011
 build-0.011
Token and
Token ID290
Feature activation+7.71e-01

Pos logit contributions
 therapist+0.029
cer+0.024
 toddler+0.024
 Clare+0.024
 Riley+0.024
Neg logit contributions
 know-0.012
 named-0.012
 have-0.012
atters-0.012
 done-0.011
Token started
Token ID2067
Feature activation-1.21e+00

Pos logit contributions
 therapist+0.035
cer+0.028
unia+0.028
 toddler+0.028
rier+0.028
Neg logit contributions
 know-0.017
 named-0.017
 have-0.016
 done-0.016
atters-0.015
Token to
Token ID284
Feature activation+9.66e-01

Pos logit contributions
 know+0.031
 named+0.030
atters+0.029
 have+0.028
 done+0.028
Neg logit contributions
 therapist-0.049
cer-0.040
 toddler-0.040
 Riley-0.039
unia-0.039
Token cook
Token ID4255
Feature activation+4.87e-01

Pos logit contributions
 therapist+0.041
 toddler+0.034
cer+0.034
 Riley+0.033
 Clare+0.033
Neg logit contributions
atters-0.020
 named-0.020
 know-0.019
 done-0.019
o-0.018
Token.
Token ID13
Feature activation-9.82e-02

Pos logit contributions
 therapist+0.026
 toddler+0.022
unia+0.021
cer+0.021
 Clare+0.021
Neg logit contributions
 know-0.007
 have-0.007
 done-0.007
 named-0.007
atters-0.006
Token
Token ID198
Feature activation-4.98e-01

Pos logit contributions
inations+0.003
ric+0.003
 supposed+0.003
py+0.003
 done+0.003
Neg logit contributions
 therapist-0.003
 Riley-0.002
ed-0.002
overed-0.002
 toddler-0.002
Token
Token ID198
Feature activation-7.43e-01

Pos logit contributions
ric+0.014
 earthquake+0.014
arter+0.014
atters+0.013
py+0.013
Neg logit contributions
 therapist-0.015
 Riley-0.013
Alex-0.012
 toddler-0.012
Eric-0.012
TokenAfter
Token ID3260
Feature activation+4.46e-01

Pos logit contributions
ric+0.025
atters+0.023
arter+0.023
 named+0.023
py+0.023
Neg logit contributions
 therapist-0.019
No-0.017
Re-0.016
gers-0.015
cer-0.015
Token new
Token ID649
Feature activation+4.97e-01

Pos logit contributions
 therapist+0.008
cer+0.006
 toddler+0.006
 Riley+0.006
 Clare+0.006
Neg logit contributions
 know-0.006
 named-0.006
atters-0.005
 have-0.005
 done-0.005
Token flower
Token ID15061
Feature activation-3.37e-01

Pos logit contributions
 therapist+0.022
 Riley+0.018
cer+0.018
 toddler+0.018
rier+0.018
Neg logit contributions
 named-0.011
 know-0.011
 have-0.010
."-0.009
 called-0.009
Token.
Token ID13
Feature activation+4.00e-01

Pos logit contributions
 know+0.007
 named+0.006
atters+0.006
 have+0.006
 done+0.006
Neg logit contributions
 therapist-0.016
cer-0.013
 toddler-0.013
 Riley-0.013
unia-0.012
Token
Token ID198
Feature activation-6.24e-02

Pos logit contributions
 therapist+0.017
 Riley+0.015
 toddler+0.014
 Clare+0.014
overed+0.014
Neg logit contributions
 done-0.009
 know-0.008
atters-0.008
aver-0.008
ric-0.008
Token
Token ID198
Feature activation-3.81e-01

Pos logit contributions
atters+0.002
ric+0.002
 done+0.002
 earthquake+0.002
arter+0.001
Neg logit contributions
 therapist-0.002
 Riley-0.002
 Debbie-0.002
 Amelia-0.002
 Clare-0.002
TokenAs
Token ID1722
Feature activation+1.20e-01

Pos logit contributions
 know+0.009
 named+0.008
atters+0.008
 done+0.008
 have+0.008
Neg logit contributions
 therapist-0.016
cer-0.013
 toddler-0.013
 Riley-0.013
unia-0.013
Token they
Token ID484
Feature activation+9.30e-01

Pos logit contributions
 therapist+0.005
cer+0.004
 toddler+0.004
 Riley+0.004
 Clare+0.003
Neg logit contributions
 know-0.003
 named-0.003
atters-0.003
 have-0.003
 done-0.003
Token got
Token ID1392
Feature activation-1.21e+00

Pos logit contributions
 therapist+0.036
 toddler+0.029
cer+0.029
unia+0.028
 Riley+0.028
Neg logit contributions
 know-0.025
atters-0.024
 named-0.023
 done-0.022
 have-0.022
Token closer
Token ID5699
Feature activation+1.84e+00

Pos logit contributions
 named+0.030
 done+0.028
 know+0.028
 have+0.027
 called+0.027
Neg logit contributions
 therapist-0.054
cer-0.045
rier-0.045
rily-0.043
 toddler-0.042
Token,
Token ID11
Feature activation-1.17e-01

Pos logit contributions
 therapist+0.106
cer+0.091
unia+0.090
rier+0.089
 toddler+0.088
Neg logit contributions
 know-0.019
 named-0.017
 done-0.016
 have-0.016
 called-0.015
Token they
Token ID484
Feature activation+1.32e+00

Pos logit contributions
 named+0.002
 have+0.002
 called+0.002
 both+0.002
 know+0.002
Neg logit contributions
 therapist-0.006
rier-0.005
rily-0.005
 parcel-0.005
cer-0.005
Token,
Token ID11
Feature activation-5.23e-01

Pos logit contributions
 therapist+0.086
rier+0.073
cer+0.073
unia+0.072
 toddler+0.072
Neg logit contributions
 named-0.017
 know-0.016
 have-0.015
 done-0.013
atters-0.012
Token there
Token ID612
Feature activation+1.17e+00

Pos logit contributions
 named+0.014
 know+0.014
 have+0.014
atters+0.013
 done+0.013
Neg logit contributions
 therapist-0.021
cer-0.016
rier-0.016
 toddler-0.016
 Riley-0.016
Token was
Token ID373
Feature activation-1.48e+00

Pos logit contributions
 therapist+0.056
rier+0.049
edly+0.046
rily+0.045
 Clare+0.045
Neg logit contributions
 have-0.025
 named-0.025
 know-0.024
 done-0.021
my-0.021
Token a
Token ID257
Feature activation+1.57e+00

Pos logit contributions
 know+0.033
 named+0.032
atters+0.031
 have+0.030
 done+0.030
Neg logit contributions
 therapist-0.064
cer-0.053
 toddler-0.052
 Riley-0.051
unia-0.051
Token shape
Token ID5485
Feature activation+2.61e+00

Pos logit contributions
 therapist+0.063
 toddler+0.053
cer+0.050
 Clare+0.049
 Riley+0.049
Neg logit contributions
atters-0.041
 know-0.041
 done-0.039
ric-0.037
 have-0.035
Token.
Token ID13
Feature activation-1.26e-01

Pos logit contributions
 therapist+0.122
cer+0.102
 toddler+0.102
unia+0.096
rier+0.096
Neg logit contributions
 know-0.055
atters-0.051
 done-0.047
 have-0.045
weet-0.043
Token It
Token ID632
Feature activation-6.85e-02

Pos logit contributions
 know+0.003
 named+0.003
atters+0.003
 done+0.003
ric+0.003
Neg logit contributions
 therapist-0.005
 toddler-0.004
 Riley-0.004
 Clare-0.004
cer-0.004
Token was
Token ID373
Feature activation+7.59e-01

Pos logit contributions
atters+0.003
 know+0.002
weet+0.002
ooters+0.002
arl+0.002
Neg logit contributions
 therapist-0.002
cer-0.001
 toddler-0.001
unia-0.001
 Riley-0.001
Token very
Token ID845
Feature activation-7.43e-03

Pos logit contributions
 therapist+0.028
 toddler+0.023
 Riley+0.022
cer+0.022
 Clare+0.022
Neg logit contributions
atters-0.021
 know-0.020
opl-0.017
ooters-0.017
 have-0.017
Token dirty
Token ID11841
Feature activation-1.70e-01

Pos logit contributions
atters+0.000
ric+0.000
my+0.000
 named+0.000
 know+0.000
Neg logit contributions
 therapist-0.000
 toddler-0.000
rier-0.000
 Riley-0.000
 Clare-0.000
Token.
Token ID13
Feature activation+1.11e-01

Pos logit contributions
 named+0.002
 know+0.002
 done+0.002
 have+0.002
atters+0.001
Neg logit contributions
 therapist-0.009
unia-0.008
 Clare-0.008
 toddler-0.008
 Riley-0.008
Token liked
Token ID8288
Feature activation+6.31e-02

Pos logit contributions
 named+0.023
 have+0.020
 know+0.020
 called+0.018
 done+0.018
Neg logit contributions
 therapist-0.067
rier-0.061
rily-0.058
 toddler-0.058
cer-0.057
Token to
Token ID284
Feature activation-1.04e-01

Pos logit contributions
 therapist+0.003
cer+0.002
rier+0.002
unia+0.002
 toddler+0.002
Neg logit contributions
 know-0.001
 named-0.001
 have-0.001
 done-0.001
atters-0.001
Token look
Token ID804
Feature activation-1.08e+00

Pos logit contributions
atters+0.002
 know+0.002
 named+0.002
 done+0.002
o+0.002
Neg logit contributions
 therapist-0.004
 toddler-0.004
 Clare-0.004
cer-0.004
 Riley-0.003
Token at
Token ID379
Feature activation+2.28e+00

Pos logit contributions
 know+0.027
 named+0.027
atters+0.026
 have+0.025
 done+0.025
Neg logit contributions
 therapist-0.044
cer-0.035
 toddler-0.035
 Riley-0.034
 Clare-0.034
Token the
Token ID262
Feature activation-6.98e-01

Pos logit contributions
 therapist+0.104
cer+0.085
 toddler+0.083
rier+0.083
 Riley+0.082
Neg logit contributions
 know-0.050
 named-0.048
 have-0.046
atters-0.044
 done-0.044
Token flowers
Token ID12734
Feature activation-2.02e+00

Pos logit contributions
 named+0.014
 know+0.014
 have+0.013
 called+0.012
 done+0.012
Neg logit contributions
 therapist-0.032
cer-0.026
 Riley-0.026
 toddler-0.025
rily-0.025
Token and
Token ID290
Feature activation-4.38e-01

Pos logit contributions
 know+0.049
atters+0.048
 named+0.045
 done+0.043
ric+0.042
Neg logit contributions
 therapist-0.082
cer-0.067
unia-0.066
 toddler-0.066
rier-0.065
Token the
Token ID262
Feature activation-1.80e-01

Pos logit contributions
 know+0.009
 have+0.008
 named+0.008
 called+0.006
 done+0.006
Neg logit contributions
 therapist-0.022
cer-0.019
rier-0.019
rily-0.018
 toddler-0.018
Token insects
Token ID19435
Feature activation+7.97e-01

Pos logit contributions
 named+0.004
 know+0.004
 have+0.004
 done+0.003
 called+0.003
Neg logit contributions
 therapist-0.008
cer-0.007
 Riley-0.007
rily-0.007
 toddler-0.006
Token.
Token ID13
Feature activation-7.34e-02

Pos logit contributions
 therapist+0.040
cer+0.033
 toddler+0.033
 Riley+0.033
unia+0.032
Neg logit contributions
 know-0.013
 named-0.012
atters-0.012
 have-0.011
 done-0.011
Token She
Token ID1375
Feature activation-9.95e-01

Pos logit contributions
 done+0.002
 know+0.002
aver+0.001
inations+0.001
atters+0.001
Neg logit contributions
 therapist-0.003
 Clare-0.003
 Riley-0.003
 toddler-0.003
 Amelia-0.003
Token brave
Token ID14802
Feature activation+2.45e+00

Pos logit contributions
 therapist+0.039
 toddler+0.032
cer+0.032
 Riley+0.032
unia+0.031
Neg logit contributions
 know-0.018
atters-0.017
 named-0.016
 have-0.016
weet-0.016
Token.
Token ID13
Feature activation+1.59e-01

Pos logit contributions
 therapist+0.130
cer+0.109
unia+0.108
 toddler+0.108
 Clare+0.107
Neg logit contributions
 know-0.034
 named-0.033
 have-0.030
atters-0.028
 done-0.028
Token He
Token ID679
Feature activation-1.83e-01

Pos logit contributions
 therapist+0.006
 Riley+0.006
 toddler+0.005
 Reggie+0.005
 Debbie+0.005
Neg logit contributions
inations-0.004
py-0.003
weet-0.003
 named-0.003
 know-0.003
Token stayed
Token ID9658
Feature activation+2.78e+00

Pos logit contributions
 named+0.004
 know+0.004
 have+0.003
 called+0.003
 done+0.003
Neg logit contributions
 therapist-0.009
rier-0.007
cer-0.007
rily-0.007
 toddler-0.007
Token.
Token ID13
Feature activation-1.05e-02

Pos logit contributions
 therapist+0.124
 toddler+0.100
 Clare+0.097
 Riley+0.096
 Amelia+0.095
Neg logit contributions
 named-0.070
 know-0.063
 done-0.062
 have-0.062
 called-0.061
Token
Token ID198
Feature activation-2.64e-01

Pos logit contributions
inations+0.000
py+0.000
weet+0.000
 named+0.000
 done+0.000
Neg logit contributions
 therapist-0.000
 Riley-0.000
 Reggie-0.000
 toddler-0.000
 Debbie-0.000
Token
Token ID198
Feature activation-5.80e-01

Pos logit contributions
ric+0.006
atters+0.006
weet+0.006
 done+0.006
arter+0.006
Neg logit contributions
 therapist-0.010
 Riley-0.009
 Debbie-0.008
 Clare-0.008
 toddler-0.008
Token"
Token ID1
Feature activation+2.05e+00

Pos logit contributions
 know+0.017
atters+0.016
 named+0.016
ric+0.015
 have+0.015
Neg logit contributions
 therapist-0.022
cer-0.017
 toddler-0.017
gers-0.016
 Riley-0.016
TokenHello
Token ID15496
Feature activation-6.95e-01

Pos logit contributions
 therapist+0.072
rily+0.065
rier+0.064
 parcel+0.059
cer+0.059
Neg logit contributions
 named-0.058
 know-0.057
 have-0.057
my-0.055
o-0.054
Token,
Token ID11
Feature activation-1.41e+00

Pos logit contributions
 know+0.012
 named+0.011
atters+0.010
 have+0.010
 done+0.010
Neg logit contributions
 therapist-0.034
cer-0.029
 toddler-0.029
 Riley-0.028
unia-0.028
Token dog
Token ID3290
Feature activation-1.94e+00

Pos logit contributions
 know+0.039
 named+0.038
 have+0.037
atters+0.035
 done+0.035
Neg logit contributions
 therapist-0.056
cer-0.045
rily-0.043
edly-0.043
rier-0.043
Token know
Token ID760
Feature activation-3.52e+00

Pos logit contributions
ric+0.047
opl+0.041
omo+0.038
 named+0.037
atters+0.035
Neg logit contributions
 therapist-0.089
 stranger-0.083
cer-0.083
 toddler-0.079
 brid-0.079
Token what
Token ID644
Feature activation+1.45e+00

Pos logit contributions
 know+0.057
atters+0.054
 named+0.052
 have+0.050
 done+0.049
Neg logit contributions
 therapist-0.172
cer-0.146
 toddler-0.145
 Riley-0.144
 Clare-0.142
Token to
Token ID284
Feature activation-1.39e+00

Pos logit contributions
 therapist+0.062
cer+0.051
rier+0.050
 toddler+0.050
rily+0.050
Neg logit contributions
 know-0.035
 named-0.035
 have-0.033
 done-0.032
atters-0.031
Token do
Token ID466
Feature activation-1.53e+00

Pos logit contributions
 named+0.031
atters+0.031
 know+0.029
ric+0.029
weet+0.028
Neg logit contributions
 therapist-0.056
cer-0.048
 toddler-0.046
 Riley-0.046
unia-0.046
Token.
Token ID13
Feature activation-2.56e-01

Pos logit contributions
 know+0.012
 have+0.012
 done+0.011
 named+0.011
."+0.008
Neg logit contributions
 therapist-0.092
 toddler-0.079
unia-0.078
 Riley-0.078
 Clare-0.078
Token Suddenly
Token ID24975
Feature activation-1.84e+00

Pos logit contributions
 know+0.005
weet+0.005
atters+0.005
 done+0.005
 named+0.005
Neg logit contributions
 therapist-0.011
 toddler-0.010
 Riley-0.009
edly-0.009
 Clare-0.009
Token,
Token ID11
Feature activation-4.87e-01

Pos logit contributions
 named+0.019
 have+0.015
 know+0.015
 done+0.013
 called+0.012
Neg logit contributions
 therapist-0.110
cer-0.095
rily-0.094
 toddler-0.093
rier-0.093
Token Lily
Token ID20037
Feature activation+1.96e-01

Pos logit contributions
 named+0.011
 know+0.010
 have+0.010
 done+0.009
 called+0.009
Neg logit contributions
 therapist-0.022
cer-0.019
rier-0.018
unia-0.018
 toddler-0.018
Token remembered
Token ID12086
Feature activation-5.68e-01

Pos logit contributions
 therapist+0.008
cer+0.006
 toddler+0.006
 Riley+0.006
unia+0.006
Neg logit contributions
 know-0.005
atters-0.005
 named-0.005
 have-0.005
 done-0.005
Token her
Token ID607
Feature activation+2.15e-01

Pos logit contributions
 named+0.010
 know+0.009
 have+0.008
 done+0.008
 called+0.007
Neg logit contributions
 therapist-0.030
cer-0.025
rier-0.025
 toddler-0.025
unia-0.024
Token mom
Token ID1995
Feature activation+8.47e-01

Pos logit contributions
 therapist+0.009
 toddler+0.007
cer+0.007
 Riley+0.007
unia+0.007
Neg logit contributions
atters-0.005
 know-0.005
 named-0.005
 have-0.005
weet-0.005
Token veil
Token ID30615
Feature activation-6.84e-01

Pos logit contributions
 know+0.015
."+0.015
 have+0.014
?"+0.013
 named+0.013
Neg logit contributions
 therapist-0.015
overed-0.015
edly-0.014
vered-0.014
cer-0.013
Token,
Token ID11
Feature activation+5.67e-02

Pos logit contributions
 know+0.013
 named+0.012
atters+0.012
 have+0.011
 done+0.011
Neg logit contributions
 therapist-0.032
cer-0.027
 toddler-0.027
 Riley-0.026
unia-0.026
Token and
Token ID290
Feature activation+2.35e-01

Pos logit contributions
 therapist+0.003
cer+0.002
 toddler+0.002
 Riley+0.002
unia+0.002
Neg logit contributions
 know-0.001
 named-0.001
 have-0.001
atters-0.001
 done-0.001
Token I
Token ID314
Feature activation+9.54e-01

Pos logit contributions
 therapist+0.011
cer+0.009
rily+0.009
rier+0.009
edly+0.009
Neg logit contributions
 know-0.006
 have-0.005
 named-0.005
 called-0.005
 done-0.005
Token will
Token ID481
Feature activation+1.68e+00

Pos logit contributions
 therapist+0.044
cer+0.037
 toddler+0.037
 Riley+0.036
unia+0.036
Neg logit contributions
 know-0.018
 named-0.018
atters-0.017
 have-0.016
 done-0.016
Token never
Token ID1239
Feature activation-2.53e+00

Pos logit contributions
 therapist+0.077
cer+0.064
 toddler+0.063
 Clare+0.063
 Riley+0.063
Neg logit contributions
atters-0.031
 named-0.030
 know-0.030
 done-0.028
o-0.027
Token let
Token ID1309
Feature activation-5.92e-01

Pos logit contributions
atters+0.050
 named+0.047
ric+0.045
 Yesterday+0.043
 know+0.042
Neg logit contributions
 therapist-0.108
cer-0.092
 toddler-0.091
unia-0.089
 Clare-0.088
Token anything
Token ID1997
Feature activation+3.75e-01

Pos logit contributions
 know+0.018
 named+0.017
 have+0.017
atters+0.016
 done+0.016
Neg logit contributions
 therapist-0.022
cer-0.017
 toddler-0.017
 Riley-0.016
unia-0.016
Token in
Token ID287
Feature activation-1.86e+00

Pos logit contributions
 therapist+0.017
cer+0.014
 toddler+0.014
 Riley+0.014
unia+0.014
Neg logit contributions
 know-0.008
 named-0.007
 have-0.007
atters-0.007
 done-0.007
Token."
Token ID526
Feature activation+3.48e-01

Pos logit contributions
 know+0.044
 have+0.042
 named+0.040
 done+0.038
atters+0.036
Neg logit contributions
 therapist-0.084
 toddler-0.067
cer-0.067
edly-0.066
rier-0.066
Token
Token ID198
Feature activation-8.60e-02

Pos logit contributions
 therapist+0.015
 toddler+0.012
 Clare+0.012
 Riley+0.012
 Amelia+0.012
Neg logit contributions
 named-0.007
atters-0.007
 know-0.007
ric-0.007
 done-0.007
Token there
Token ID612
Feature activation+9.43e-01

Pos logit contributions
 know+0.038
atters+0.036
 named+0.035
 done+0.034
weet+0.034
Neg logit contributions
 therapist-0.037
cer-0.030
 toddler-0.029
unia-0.029
 Riley-0.028
Token was
Token ID373
Feature activation-1.75e+00

Pos logit contributions
 therapist+0.038
rier+0.031
 toddler+0.030
cer+0.030
 Clare+0.030
Neg logit contributions
 know-0.025
 named-0.024
 have-0.024
 done-0.023
atters-0.023
Token a
Token ID257
Feature activation+1.25e+00

Pos logit contributions
 know+0.050
atters+0.049
ric+0.046
 done+0.044
 named+0.044
Neg logit contributions
 therapist-0.063
 toddler-0.052
 Clare-0.050
cer-0.050
 Riley-0.050
Token little
Token ID1310
Feature activation-6.86e-01

Pos logit contributions
 therapist+0.050
 toddler+0.042
cer+0.040
 Clare+0.039
 Riley+0.039
Neg logit contributions
atters-0.033
 know-0.033
 done-0.031
ric-0.030
 have-0.029
Token girl
Token ID2576
Feature activation+7.25e-01

Pos logit contributions
 named+0.024
 know+0.022
 have+0.022
my+0.021
 called+0.021
Neg logit contributions
 therapist-0.022
overed-0.017
rily-0.016
edly-0.016
 Riley-0.016
Token named
Token ID3706
Feature activation-4.41e+00

Pos logit contributions
 therapist+0.038
cer+0.033
 toddler+0.032
unia+0.032
 Clare+0.031
Neg logit contributions
atters-0.009
 know-0.009
 done-0.008
ric-0.007
upid-0.007
Token Lily
Token ID20037
Feature activation+5.76e-01

Pos logit contributions
opl+0.136
ric+0.133
aver+0.122
 know+0.120
inations+0.118
Neg logit contributions
 therapist-0.142
 Clare-0.134
 Amelia-0.129
 Ivy-0.125
 Clara-0.125
Token.
Token ID13
Feature activation-4.61e-01

Pos logit contributions
 therapist+0.020
cer+0.019
 toddler+0.017
unia+0.017
ingly+0.016
Neg logit contributions
atters-0.015
inations-0.014
 done-0.014
ooters-0.014
 dam-0.013
Token She
Token ID1375
Feature activation-9.12e-01

Pos logit contributions
inations+0.016
 done+0.016
ric+0.016
aver+0.016
 know+0.016
Neg logit contributions
 therapist-0.012
 Clare-0.010
 Riley-0.010
 Amelia-0.010
 Ivy-0.010
Token loved
Token ID6151
Feature activation+4.16e-01

Pos logit contributions
atters+0.024
 know+0.023
ric+0.022
weet+0.022
upid+0.021
Neg logit contributions
 therapist-0.036
cer-0.028
 toddler-0.028
unia-0.027
 Riley-0.027
Token to
Token ID284
Feature activation+1.37e-01

Pos logit contributions
 therapist+0.016
 toddler+0.013
cer+0.013
 Riley+0.013
 Clare+0.012
Neg logit contributions
 know-0.011
atters-0.011
 named-0.011
 have-0.010
 done-0.010
Token there
Token ID612
Feature activation+7.57e-01

Pos logit contributions
 therapist+0.085
cer+0.072
 toddler+0.071
rier+0.071
unia+0.070
Neg logit contributions
 know-0.019
 named-0.019
 have-0.018
 done-0.016
atters-0.015
Token lived
Token ID5615
Feature activation-1.90e+00

Pos logit contributions
 therapist+0.032
rier+0.026
cer+0.025
 toddler+0.025
 Clare+0.025
Neg logit contributions
 know-0.019
 named-0.019
 have-0.019
 done-0.017
atters-0.017
Token a
Token ID257
Feature activation+4.80e-01

Pos logit contributions
 know+0.047
atters+0.045
 named+0.044
 have+0.042
 done+0.042
Neg logit contributions
 therapist-0.077
cer-0.063
 toddler-0.063
 Riley-0.062
unia-0.062
Token generous
Token ID14431
Feature activation+2.49e+00

Pos logit contributions
 therapist+0.019
 toddler+0.016
cer+0.015
 Clare+0.015
 Riley+0.015
Neg logit contributions
atters-0.013
 know-0.013
 done-0.012
ric-0.012
opl-0.011
Token dinosaur
Token ID28257
Feature activation-8.32e-01

Pos logit contributions
 therapist+0.092
cer+0.073
 toddler+0.072
 Riley+0.071
unia+0.070
Neg logit contributions
 know-0.071
 named-0.071
 have-0.067
atters-0.066
 done-0.064
Token named
Token ID3706
Feature activation-4.31e+00

Pos logit contributions
 know+0.015
atters+0.015
 done+0.014
ric+0.012
upid+0.011
Neg logit contributions
 therapist-0.041
cer-0.035
 toddler-0.034
unia-0.033
gers-0.033
Token Todd
Token ID14377
Feature activation+1.05e-01

Pos logit contributions
 know+0.117
atters+0.114
 named+0.110
 done+0.109
 have+0.106
Neg logit contributions
 therapist-0.164
 toddler-0.133
 Riley-0.132
cer-0.132
 Clare-0.130
Token.
Token ID13
Feature activation+8.45e-03

Pos logit contributions
 therapist+0.004
cer+0.004
 toddler+0.003
unia+0.003
rily+0.003
Neg logit contributions
atters-0.003
 know-0.002
 Yesterday-0.002
 done-0.002
opl-0.002
Token Every
Token ID3887
Feature activation-7.29e-01

Pos logit contributions
 therapist+0.000
cer+0.000
 toddler+0.000
 Riley+0.000
 Clare+0.000
Neg logit contributions
 know-0.000
 named-0.000
atters-0.000
 have-0.000
 done-0.000
Token day
Token ID1110
Feature activation+1.71e+00

Pos logit contributions
 know+0.012
 named+0.011
atters+0.011
 have+0.011
 done+0.010
Neg logit contributions
 therapist-0.036
cer-0.030
 toddler-0.030
 Riley-0.030
 Clare-0.030
Token,
Token ID11
Feature activation+1.94e-01

Pos logit contributions
 therapist+0.089
cer+0.074
 toddler+0.074
unia+0.073
rier+0.073
Neg logit contributions
 know-0.026
 named-0.024
 have-0.023
 done-0.022
atters-0.020
Token Jill
Token ID22772
Feature activation-1.52e+00

Pos logit contributions
 therapist+0.200
cer+0.168
 toddler+0.167
 Riley+0.164
unia+0.163
Neg logit contributions
 know-0.070
 named-0.066
atters-0.063
 have-0.062
 done-0.060
Token,
Token ID11
Feature activation+3.72e-01

Pos logit contributions
 know+0.024
 named+0.023
 have+0.022
 done+0.021
atters+0.020
Neg logit contributions
 therapist-0.077
cer-0.065
 toddler-0.065
 Riley-0.064
rier-0.064
Token let
Token ID1309
Feature activation+8.44e-01

Pos logit contributions
 therapist+0.017
cer+0.014
unia+0.014
edly+0.013
rily+0.013
Neg logit contributions
 have-0.009
 named-0.009
 know-0.009
 done-0.008
 called-0.007
Tokenâ
Token ID22940
Feature activation+3.56e-01

Pos logit contributions
 therapist+0.036
cer+0.030
 toddler+0.029
 Riley+0.029
unia+0.029
Neg logit contributions
 know-0.020
 named-0.019
 have-0.018
atters-0.018
 done-0.018
Token
Token ID26391
Feature activation-2.07e+00

Pos logit contributions
 therapist+0.013
cer+0.010
 toddler+0.010
unia+0.010
 Clare+0.010
Neg logit contributions
 know-0.010
 named-0.010
atters-0.010
 have-0.009
ric-0.009
Token
Token ID8151
Feature activation-4.30e+00

Pos logit contributions
 know+0.053
 named+0.051
atters+0.050
 done+0.048
weet+0.047
Neg logit contributions
 therapist-0.082
cer-0.066
 toddler-0.065
 Riley-0.065
 Clare-0.063
Tokens
Token ID82
Feature activation-3.42e+00

Pos logit contributions
 know+0.121
atters+0.115
 named+0.106
weet+0.105
 done+0.105
Neg logit contributions
 therapist-0.157
cer-0.128
 toddler-0.126
 Riley-0.124
 Clare-0.117
Token clean
Token ID3424
Feature activation+2.11e-01

Pos logit contributions
 know+0.054
 named+0.053
atters+0.052
 done+0.047
 have+0.047
Neg logit contributions
 therapist-0.166
cer-0.141
 toddler-0.139
 Riley-0.137
 Clare-0.136
Token it
Token ID340
Feature activation+3.03e-01

Pos logit contributions
 therapist+0.009
edly+0.007
unia+0.007
overed+0.007
 toddler+0.007
Neg logit contributions
."-0.005
 have-0.005
 done-0.005
 named-0.004
 know-0.004
Token up
Token ID510
Feature activation+2.28e+00

Pos logit contributions
 therapist+0.010
 toddler+0.007
 Clare+0.007
cer+0.007
 Riley+0.007
Neg logit contributions
 know-0.011
atters-0.010
 have-0.010
 named-0.010
 done-0.010
Token.
Token ID13
Feature activation-2.21e-01

Pos logit contributions
 therapist+0.123
 toddler+0.104
cer+0.102
unia+0.102
 Riley+0.102
Neg logit contributions
 know-0.030
 named-0.028
 have-0.027
 done-0.027
atters-0.024
TokenOne
Token ID3198
Feature activation-2.93e+00

Pos logit contributions
 therapist+0.128
cer+0.109
 toddler+0.108
 Riley+0.107
unia+0.106
Neg logit contributions
 know-0.036
atters-0.033
 named-0.032
 done-0.030
weet-0.029
Token day
Token ID1110
Feature activation+1.14e+00

Pos logit contributions
atters+0.072
 know+0.062
 named+0.062
ric+0.061
py+0.056
Neg logit contributions
 therapist-0.119
 toddler-0.101
cer-0.096
 Riley-0.095
gers-0.094
Token,
Token ID11
Feature activation-3.79e-01

Pos logit contributions
 therapist+0.066
rier+0.057
 toddler+0.056
rily+0.056
cer+0.056
Neg logit contributions
 named-0.012
 have-0.011
 done-0.010
 know-0.010
 called-0.008
Token a
Token ID257
Feature activation+8.87e-01

Pos logit contributions
 named+0.007
 have+0.007
 know+0.006
o+0.005
 done+0.005
Neg logit contributions
 therapist-0.020
rier-0.017
 toddler-0.016
 parcel-0.016
 Riley-0.016
Token girl
Token ID2576
Feature activation+7.33e-01

Pos logit contributions
 therapist+0.038
 toddler+0.032
cer+0.031
 Riley+0.030
unia+0.030
Neg logit contributions
 know-0.020
atters-0.020
 named-0.019
 done-0.018
 have-0.018
Token named
Token ID3706
Feature activation-4.24e+00

Pos logit contributions
 therapist+0.045
cer+0.040
 toddler+0.040
unia+0.039
 Riley+0.039
Neg logit contributions
 know-0.003
atters-0.002
 done-0.001
 named-0.001
 have-0.000
Token Lucy
Token ID22162
Feature activation+5.51e-01

Pos logit contributions
opl+0.124
ric+0.122
inations+0.109
 brothers+0.106
aver+0.105
Neg logit contributions
 therapist-0.138
 Clare-0.126
 Amelia-0.124
 toddler-0.120
unia-0.119
Token went
Token ID1816
Feature activation-1.31e+00

Pos logit contributions
 therapist+0.024
cer+0.019
 toddler+0.019
 Riley+0.019
unia+0.019
Neg logit contributions
 know-0.013
 named-0.012
atters-0.012
 have-0.012
 done-0.011
Token to
Token ID284
Feature activation+1.03e+00

Pos logit contributions
 know+0.042
atters+0.039
ric+0.039
 named+0.038
weet+0.037
Neg logit contributions
 therapist-0.043
 toddler-0.032
 Riley-0.032
cer-0.032
gers-0.031
Token her
Token ID607
Feature activation-2.43e-01

Pos logit contributions
 therapist+0.040
 toddler+0.033
 Clare+0.033
 Riley+0.032
cer+0.031
Neg logit contributions
atters-0.026
 know-0.025
 named-0.025
 done-0.024
ric-0.023
Token friend
Token ID1545
Feature activation-2.03e+00

Pos logit contributions
 know+0.006
atters+0.006
 named+0.006
 done+0.006
 have+0.005
Neg logit contributions
 therapist-0.010
 toddler-0.008
cer-0.008
 Riley-0.008
 Clare-0.008
Token other
Token ID584
Feature activation-3.07e+00

Pos logit contributions
 know+0.030
atters+0.030
 named+0.030
 have+0.027
 done+0.027
Neg logit contributions
 therapist-0.049
 toddler-0.040
cer-0.039
 Clare-0.038
unia-0.038
Token animals
Token ID4695
Feature activation-2.10e+00

Pos logit contributions
 know+0.075
atters+0.072
 named+0.072
 done+0.068
 have+0.068
Neg logit contributions
 therapist-0.126
cer-0.103
 toddler-0.103
unia-0.099
 Riley-0.099
Token for
Token ID329
Feature activation+4.22e-01

Pos logit contributions
 know+0.033
 named+0.032
 have+0.030
atters+0.028
 done+0.028
Neg logit contributions
 therapist-0.106
cer-0.089
 toddler-0.089
 Riley-0.088
 Clare-0.087
Token many
Token ID867
Feature activation-2.68e+00

Pos logit contributions
 therapist+0.017
cer+0.014
 toddler+0.014
 Riley+0.013
 Clare+0.013
Neg logit contributions
 know-0.011
atters-0.010
 named-0.010
 have-0.010
 done-0.009
Token happy
Token ID3772
Feature activation-2.62e+00

Pos logit contributions
 know+0.054
 named+0.053
 have+0.050
 done+0.048
atters+0.043
Neg logit contributions
 therapist-0.127
 Riley-0.105
cer-0.104
 toddler-0.103
 Clare-0.103
Token years
Token ID812
Feature activation-4.63e+00

Pos logit contributions
 know+0.065
 named+0.062
 have+0.061
 done+0.058
atters+0.053
Neg logit contributions
 therapist-0.113
unia-0.091
 Riley-0.091
rily-0.089
cer-0.088
Token.
Token ID13
Feature activation-2.20e-01

Pos logit contributions
 named+0.055
 know+0.054
 have+0.047
 done+0.047
atters+0.045
Neg logit contributions
 therapist-0.249
cer-0.214
 toddler-0.212
unia-0.210
 Riley-0.209
Token<|endoftext|>
Token ID50256
Feature activation+2.66e+00

Pos logit contributions
 done+0.004
 know+0.004
atters+0.004
 named+0.004
inations+0.003
Neg logit contributions
 therapist-0.010
 Riley-0.009
 toddler-0.009
 Clare-0.008
 Amelia-0.008
TokenOnce
Token ID7454
Feature activation-7.32e-01

Pos logit contributions
 therapist+0.122
cer+0.101
 toddler+0.101
 Riley+0.099
unia+0.098
Neg logit contributions
 know-0.054
 named-0.051
atters-0.050
 done-0.047
 have-0.047
Token upon
Token ID2402
Feature activation+1.08e+00

Pos logit contributions
atters+0.022
 know+0.022
 named+0.021
weet+0.020
 have+0.020
Neg logit contributions
 therapist-0.024
 toddler-0.019
cer-0.019
 Riley-0.019
 Clare-0.019
Token a
Token ID257
Feature activation-2.54e+00

Pos logit contributions
 therapist+0.045
cer+0.036
 toddler+0.036
 Riley+0.035
 Clare+0.035
Neg logit contributions
 know-0.025
 named-0.025
atters-0.025
 done-0.023
 have-0.023