TOP ACTIVATIONS
MAX = 20.862

, I will marry you !" They had a big party
properly so he could fly . The ant was
her mom went back home . L ily
Can you spin for me ?" Spot spun around and around
bled on the cherry together . It was a very tasty
together and share their sand . Tom agreed and they started
and played together every day . <|endoftext|> Once upon a time
to enter a cooking competition . Tim my was so excited
to play the trumpet too . One day , Ellie forgot
the sun started to set . As they were
swim with you , duck !" And they sw am together
and they took a nap . They were happy and tired
you . Come with me ." The daughter was
I had so much fun ." Max replied , " Me
" Yes , you can ." Lily put the wrap on
a flower with her pen . Her friend liked it very
folder away and played together . <|endoftext|> Once upon a time
me find my way home ?" B enny thought
playing together and having fun . He was never lonely again
's fix it for you ." She took some pins and

MIN ACTIVATIONS
MIN = -73.295

't know the rh in oc eros was wild . Can
he would take the av oc ados and scatter them around
am taking the rh in oc eros to a new home
give to the rh in oc eros . Lily walked to
as he opened the ch ocol ates and started eating them
reward . It was ch ocol ates and sweets !
giving him too many ch ocol ates ." The king learned
it to the rh in oc eros . When he tasted
that the bad rh in oc eros was now nice .
there was a rh in oc eros who was always naughty
many more adventures with the mer maid . The
quiet compared to the noisy se ag ull s outside .
children put on their bath ro bes and ran around in
. The more , the mer rier . Come to my
" The rh in oc eros is in there ,
his head out of his bur row and saw something new
't touch the rh in oc eros ! It might bite
They find the rh in oc eros . They are gentle
Everyday he would take the av oc ados and scatter them
was a big rh in oc eros . He was troubled

INTERVAL -2.677 - 20.862
CONTAINS 64.359%

any dog ." Mom says . She looks up
bear was so happy . He said , " I did
high in the sky . When they landed , Lily said
were playing in the garden . They liked to swing on
They cry . They wish they had shared the car and

INTERVAL -26.217 - -2.677
CONTAINS 35.006%

story ?" L ily nodded and said , "
a lot and Ben cannot go outside . He is sad
beetles , so they can eat !" His friends
play around , maybe the hunter will think we 're not
store and bought the blue color . They came back home

INTERVAL -49.756 - -26.217
CONTAINS 0.633%

Please help me shake this rad ish , it 's too
She had peanut butter and jelly and some apples . After
a great job on that castle Sam , I 'm so
saw a picture of a key . It was in the
bit of cocoa on his fur ! After

INTERVAL -73.295 - -49.756
CONTAINS 0.002%

give to the rh in oc eros . Lily walked to
he would take the av oc ados and scatter them around
They find the rh in oc eros . They are gentle
them and never threw a tant rum again . <|endoftext|> Once
't know the rh in oc eros was wild . Can
Token,
Token ID11
Feature activation+1.17e+01

Pos logit contributions
 Curious+0.099
 Everything+0.098
Um+0.098
 �+0.097
Well+0.097
Neg logit contributions
ins-0.065
cer-0.061
ies-0.056
ates-0.056
ishing-0.054
Token I
Token ID314
Feature activation+1.50e+01

Pos logit contributions
Well+0.085
 Curious+0.084
Listen+0.083
Um+0.083
 Everything+0.079
Neg logit contributions
ies-0.092
--0.091
ins-0.090
cer-0.087
ie-0.081
Token will
Token ID481
Feature activation+1.27e+01

Pos logit contributions
Well+0.102
 Curious+0.101
Listen+0.100
Um+0.100
 Everything+0.096
Neg logit contributions
ies-0.126
ins-0.123
--0.123
cer-0.120
ie-0.110
Token marry
Token ID12479
Feature activation+8.22e+00

Pos logit contributions
Well+0.094
 Curious+0.093
Listen+0.093
Um+0.093
 Everything+0.088
Neg logit contributions
ins-0.097
ies-0.096
cer-0.093
--0.092
ishing-0.084
Token you
Token ID345
Feature activation+1.01e+01

Pos logit contributions
Um+0.052
Listen+0.052
That+0.052
Well+0.051
Hi+0.051
Neg logit contributions
ins-0.068
ies-0.067
ishing-0.067
cer-0.063
kin-0.061
Token!"
Token ID2474
Feature activation+2.08e+01

Pos logit contributions
 Curious+0.093
 One+0.091
 �+0.090
Um+0.089
Well+0.089
Neg logit contributions
ins-0.048
cer-0.047
ishing-0.044
ies-0.043
web-0.038
Token They
Token ID1119
Feature activation+1.49e+01

Pos logit contributions
Well+0.139
 Curious+0.137
Listen+0.136
Um+0.136
 Everything+0.129
Neg logit contributions
ies-0.176
ins-0.173
--0.171
cer-0.168
ie-0.154
Token had
Token ID550
Feature activation+1.49e+01

Pos logit contributions
Well+0.103
 Curious+0.103
Listen+0.102
Um+0.102
 Everything+0.097
Neg logit contributions
ies-0.125
--0.120
ins-0.119
cer-0.117
ie-0.110
Token a
Token ID257
Feature activation+1.07e+01

Pos logit contributions
Well+0.116
 Curious+0.114
Listen+0.114
Um+0.113
 Everything+0.109
Neg logit contributions
ies-0.110
ins-0.108
--0.106
cer-0.105
ie-0.095
Token big
Token ID1263
Feature activation+7.43e+00

Pos logit contributions
Well+0.075
 Curious+0.073
Um+0.073
Listen+0.072
 Everything+0.070
Neg logit contributions
ies-0.090
--0.089
ins-0.086
cer-0.085
ie-0.078
Token party
Token ID2151
Feature activation+7.29e+00

Pos logit contributions
 Curious+0.052
Well+0.052
Listen+0.051
Um+0.050
 Everything+0.049
Neg logit contributions
ins-0.061
ies-0.058
cer-0.057
--0.055
ates-0.051
Token properly
Token ID6105
Feature activation+5.31e+00

Pos logit contributions
 Curious+0.016
Well+0.016
 Everywhere+0.016
 �+0.016
Listen+0.016
Neg logit contributions
ins-0.011
ies-0.010
cer-0.010
--0.009
ates-0.009
Token so
Token ID523
Feature activation+1.31e+01

Pos logit contributions
Well+0.053
 Curious+0.052
Listen+0.052
 Everywhere+0.051
 �+0.051
Neg logit contributions
ins-0.027
ies-0.026
cer-0.024
--0.021
ie-0.020
Token he
Token ID339
Feature activation+1.07e+01

Pos logit contributions
Well+0.081
Listen+0.078
 Curious+0.078
Um+0.077
Over+0.074
Neg logit contributions
ies-0.118
ins-0.118
cer-0.114
--0.113
ie-0.104
Token could
Token ID714
Feature activation+1.20e+01

Pos logit contributions
Well+0.050
 Curious+0.048
Um+0.048
Listen+0.048
Over+0.044
Neg logit contributions
ies-0.111
ins-0.111
cer-0.108
--0.108
ie-0.100
Token fly
Token ID6129
Feature activation+6.00e+00

Pos logit contributions
Well+0.075
 Curious+0.074
Um+0.073
Listen+0.073
Over+0.070
Neg logit contributions
ins-0.106
ies-0.105
cer-0.102
--0.101
ishing-0.092
Token.
Token ID13
Feature activation+2.02e+01

Pos logit contributions
 �+0.042
 Everywhere+0.042
 Curious+0.041
 daunting+0.041
Listen+0.040
Neg logit contributions
ins-0.049
ies-0.044
cer-0.042
ates-0.041
akers-0.040
Token
Token ID198
Feature activation+1.76e+01

Pos logit contributions
Um+0.100
Listen+0.098
Well+0.091
Over+0.089
 Curious+0.088
Neg logit contributions
--0.202
ins-0.200
ies-0.199
cer-0.193
er-0.188
Token
Token ID198
Feature activation+1.77e+01

Pos logit contributions
Um+0.125
 Curious+0.118
Listen+0.116
 daunting+0.115
Well+0.115
Neg logit contributions
--0.141
ins-0.136
ies-0.131
ie-0.126
cer-0.123
TokenThe
Token ID464
Feature activation+1.18e+01

Pos logit contributions
 Curious+0.109
 There+0.109
 Everything+0.105
Um+0.104
 daunting+0.101
Neg logit contributions
--0.144
ins-0.142
e-0.137
ep-0.136
er-0.133
Token ant
Token ID1885
Feature activation+4.23e+00

Pos logit contributions
Well+0.111
 Curious+0.111
Um+0.111
Listen+0.110
 Unfortunately+0.109
Neg logit contributions
ies-0.068
--0.067
ins-0.064
cer-0.060
ie-0.057
Token was
Token ID373
Feature activation+1.45e+01

Pos logit contributions
Well+0.028
 Curious+0.027
Listen+0.027
Um+0.027
 Everything+0.026
Neg logit contributions
ies-0.036
ins-0.036
--0.035
cer-0.035
ie-0.032
Token her
Token ID607
Feature activation+1.05e+01

Pos logit contributions
Well+0.133
 Curious+0.132
Listen+0.131
Um+0.130
 Everything+0.126
Neg logit contributions
ins-0.101
ies-0.100
cer-0.096
--0.096
ates-0.085
Token mom
Token ID1995
Feature activation+8.87e+00

Pos logit contributions
Well+0.070
 Curious+0.070
Um+0.069
Listen+0.068
 Everything+0.065
Neg logit contributions
ins-0.089
ies-0.087
cer-0.087
--0.084
ishing-0.078
Token went
Token ID1816
Feature activation+1.30e+01

Pos logit contributions
 Curious+0.062
Um+0.062
Well+0.061
Listen+0.061
 Everywhere+0.059
Neg logit contributions
ies-0.076
--0.074
ins-0.072
cer-0.070
ie-0.068
Token back
Token ID736
Feature activation+1.15e+01

Pos logit contributions
Well+0.113
 Curious+0.113
Listen+0.112
Um+0.111
 Everything+0.108
Neg logit contributions
ies-0.082
ins-0.081
cer-0.078
--0.077
ie-0.069
Token home
Token ID1363
Feature activation+9.76e+00

Pos logit contributions
 Curious+0.088
Well+0.086
Listen+0.086
Um+0.083
 Everything+0.083
Neg logit contributions
ins-0.085
ies-0.083
cer-0.082
--0.076
kin-0.076
Token.
Token ID13
Feature activation+2.01e+01

Pos logit contributions
 Curious+0.088
 Unfortunately+0.087
Well+0.087
 Sadly+0.086
Listen+0.083
Neg logit contributions
ins-0.056
cer-0.055
ies-0.053
ie-0.048
ates-0.048
Token
Token ID220
Feature activation+1.09e+01

Pos logit contributions
Um+0.152
Listen+0.149
Over+0.145
Well+0.144
Hmm+0.142
Neg logit contributions
ies-0.136
--0.133
."-0.130
.-0.127
ins-0.126
Token
Token ID198
Feature activation+1.77e+01

Pos logit contributions
Well+0.055
 Curious+0.054
Listen+0.053
Um+0.053
 Everything+0.050
Neg logit contributions
ies-0.110
ins-0.109
--0.107
cer-0.106
ie-0.099
Token
Token ID198
Feature activation+1.79e+01

Pos logit contributions
Um+0.110
Well+0.107
 Curious+0.105
Listen+0.105
Hmm+0.100
Neg logit contributions
ies-0.154
--0.153
ins-0.153
cer-0.144
ie-0.138
TokenL
Token ID43
Feature activation+6.71e+00

Pos logit contributions
 Curious+0.111
Um+0.108
 Everything+0.108
Well+0.102
 daunting+0.101
Neg logit contributions
--0.156
ins-0.148
ies-0.146
cer-0.141
e-0.136
Tokenily
Token ID813
Feature activation+1.10e+01

Pos logit contributions
Well+0.044
Listen+0.042
Over+0.040
 Curious+0.039
 Everything+0.039
Neg logit contributions
cer-0.056
ies-0.054
--0.051
ates-0.050
ishing-0.050
Token Can
Token ID1680
Feature activation+1.49e+01

Pos logit contributions
Well+0.081
Um+0.079
Listen+0.079
 Curious+0.074
Over+0.074
Neg logit contributions
cer-0.127
--0.126
ies-0.124
ins-0.123
ab-0.115
Token you
Token ID345
Feature activation+1.37e+01

Pos logit contributions
 Curious+0.131
Well+0.131
Listen+0.129
Um+0.128
 daunting+0.126
Neg logit contributions
--0.088
ies-0.086
ins-0.080
cer-0.078
ie-0.078
Token spin
Token ID7906
Feature activation+7.05e+00

Pos logit contributions
Well+0.095
 Curious+0.093
Listen+0.093
Um+0.092
 Everything+0.088
Neg logit contributions
ies-0.113
--0.110
ins-0.109
cer-0.107
ie-0.100
Token for
Token ID329
Feature activation+1.20e+01

Pos logit contributions
 Curious+0.056
Well+0.055
Listen+0.055
 Everywhere+0.054
Um+0.053
Neg logit contributions
ins-0.052
ies-0.050
cer-0.047
ates-0.043
kins-0.043
Token me
Token ID502
Feature activation+7.52e+00

Pos logit contributions
Listen+0.094
Um+0.094
 Curious+0.093
Well+0.092
 Everything+0.092
Neg logit contributions
ies-0.091
ins-0.089
cer-0.089
--0.080
ishing-0.075
Token?"
Token ID1701
Feature activation+2.00e+01

Pos logit contributions
 Curious+0.068
Listen+0.067
Um+0.067
Well+0.066
 On+0.065
Neg logit contributions
ins-0.045
cer-0.042
ies-0.042
kins-0.036
ates-0.035
Token Spot
Token ID15899
Feature activation+9.48e+00

Pos logit contributions
Um+0.151
Listen+0.150
Well+0.149
 Curious+0.145
Over+0.142
Neg logit contributions
--0.147
ies-0.145
ins-0.141
cer-0.137
ie-0.128
Token spun
Token ID26843
Feature activation+1.18e+01

Pos logit contributions
Listen+0.072
Well+0.071
Um+0.069
Over+0.068
Hmm+0.068
Neg logit contributions
--0.074
ies-0.074
ins-0.068
cer-0.066
ie-0.065
Token around
Token ID1088
Feature activation+1.27e+01

Pos logit contributions
 Curious+0.112
Well+0.112
Listen+0.110
Um+0.110
 Everything+0.107
Neg logit contributions
ies-0.068
ins-0.068
cer-0.064
--0.062
ie-0.055
Token and
Token ID290
Feature activation+1.71e+01

Pos logit contributions
 Curious+0.120
Well+0.118
Listen+0.114
 �+0.112
Um+0.112
Neg logit contributions
ins-0.081
ies-0.078
cer-0.074
ates-0.066
ishing-0.066
Token around
Token ID1088
Feature activation+1.24e+01

Pos logit contributions
Well+0.125
Listen+0.124
Um+0.123
 Curious+0.122
Over+0.117
Neg logit contributions
ies-0.132
--0.129
ins-0.128
cer-0.126
ie-0.116
Tokenbled
Token ID9342
Feature activation+2.94e+00

Pos logit contributions
Over+0.041
Well+0.040
Listen+0.040
 Curious+0.038
Finally+0.036
Neg logit contributions
cer-0.047
--0.047
ies-0.047
ins-0.046
er-0.044
Token on
Token ID319
Feature activation+8.55e+00

Pos logit contributions
Well+0.024
 Curious+0.023
Listen+0.023
 Everything+0.023
Um+0.022
Neg logit contributions
ins-0.020
cer-0.019
ies-0.019
--0.017
kin-0.017
Token the
Token ID262
Feature activation+1.01e+01

Pos logit contributions
 Curious+0.065
Well+0.064
Listen+0.063
Um+0.062
 Everything+0.062
Neg logit contributions
ins-0.066
ies-0.065
cer-0.061
--0.061
ie-0.056
Token cherry
Token ID23612
Feature activation-6.20e-01

Pos logit contributions
Well+0.074
 Curious+0.073
Listen+0.073
Um+0.073
 Everything+0.070
Neg logit contributions
ies-0.079
--0.078
ins-0.077
cer-0.074
ie-0.069
Token together
Token ID1978
Feature activation+8.50e+00

Pos logit contributions
ins+0.004
cer+0.004
ates+0.004
ies+0.004
ishing+0.004
Neg logit contributions
Well-0.005
Over-0.005
 Unfortunately-0.004
 �-0.004
On-0.004
Token.
Token ID13
Feature activation+2.00e+01

Pos logit contributions
 �+0.083
Well+0.079
 Everywhere+0.078
 Curious+0.078
 On+0.078
Neg logit contributions
ins-0.045
ies-0.041
cer-0.039
--0.034
ishing-0.034
Token It
Token ID632
Feature activation+1.27e+01

Pos logit contributions
Um+0.166
Listen+0.157
Hmm+0.149
Well+0.147
Over+0.146
Neg logit contributions
--0.133
."-0.125
er-0.124
ies-0.122
cer-0.120
Token was
Token ID373
Feature activation+1.44e+01

Pos logit contributions
 Curious+0.097
Listen+0.097
Well+0.097
Um+0.095
 Everything+0.093
Neg logit contributions
ies-0.096
--0.094
ins-0.093
cer-0.091
ie-0.085
Token a
Token ID257
Feature activation+1.12e+01

Pos logit contributions
Well+0.101
 Curious+0.101
Listen+0.098
Um+0.098
 Everywhere+0.096
Neg logit contributions
ins-0.116
ies-0.115
cer-0.112
--0.110
ishing-0.100
Token very
Token ID845
Feature activation+8.23e+00

Pos logit contributions
Well+0.085
Listen+0.084
 Curious+0.083
 Everything+0.082
Um+0.082
Neg logit contributions
--0.086
ies-0.085
cer-0.081
ins-0.077
kin-0.072
Token tasty
Token ID25103
Feature activation+5.64e+00

Pos logit contributions
Well+0.058
 Curious+0.057
Listen+0.057
Um+0.057
 Everything+0.055
Neg logit contributions
ies-0.068
--0.066
ins-0.066
cer-0.064
ie-0.059
Token together
Token ID1978
Feature activation+8.48e+00

Pos logit contributions
Well+0.066
Listen+0.066
 �+0.064
 Everywhere+0.064
 Curious+0.063
Neg logit contributions
ins-0.055
ies-0.055
cer-0.054
ie-0.047
--0.046
Token and
Token ID290
Feature activation+1.44e+01

Pos logit contributions
Well+0.080
 �+0.078
 On+0.078
On+0.078
 Curious+0.078
Neg logit contributions
ins-0.045
ies-0.044
cer-0.043
--0.037
ishing-0.035
Token share
Token ID2648
Feature activation+8.73e+00

Pos logit contributions
Um+0.080
 Curious+0.080
Well+0.079
 Everything+0.078
 Something+0.077
Neg logit contributions
ins-0.142
ies-0.132
--0.131
cer-0.129
ishing-0.125
Token their
Token ID511
Feature activation+7.25e+00

Pos logit contributions
 Something+0.050
On+0.050
That+0.048
 Curious+0.047
 �+0.047
Neg logit contributions
cer-0.075
ins-0.072
ishing-0.071
ies-0.071
ab-0.070
Token sand
Token ID6450
Feature activation-3.80e+00

Pos logit contributions
 Curious+0.068
Well+0.068
Um+0.067
Listen+0.067
 Everything+0.066
Neg logit contributions
ins-0.039
ies-0.039
cer-0.039
--0.037
ie-0.033
Token.
Token ID13
Feature activation+2.00e+01

Pos logit contributions
cer+0.027
ins+0.024
ishing+0.022
ile+0.021
ab+0.021
Neg logit contributions
On-0.026
 Curious-0.025
Inside-0.025
 On-0.024
Over-0.024
Token Tom
Token ID4186
Feature activation+1.01e+01

Pos logit contributions
Um+0.134
Listen+0.129
Well+0.128
 Curious+0.127
Hmm+0.125
Neg logit contributions
ies-0.160
ins-0.160
--0.160
cer-0.154
er-0.141
Token agreed
Token ID4987
Feature activation+1.27e+01

Pos logit contributions
Listen+0.076
Well+0.074
Um+0.074
 Curious+0.074
 Everything+0.071
Neg logit contributions
ies-0.080
--0.079
ins-0.078
cer-0.075
ie-0.070
Token and
Token ID290
Feature activation+1.61e+01

Pos logit contributions
Well+0.138
 Curious+0.136
Listen+0.135
Um+0.134
 Everything+0.131
Neg logit contributions
ies-0.055
ins-0.054
cer-0.052
--0.050
ie-0.042
Token they
Token ID484
Feature activation+1.41e+01

Pos logit contributions
Well+0.091
 Curious+0.089
Um+0.085
Listen+0.085
 Everything+0.083
Neg logit contributions
ies-0.152
ins-0.152
cer-0.149
--0.146
ie-0.138
Token started
Token ID2067
Feature activation+1.26e+01

Pos logit contributions
Well+0.095
Listen+0.095
 Curious+0.095
Um+0.094
 Everything+0.090
Neg logit contributions
ies-0.119
ins-0.116
--0.115
cer-0.114
ie-0.106
Token and
Token ID290
Feature activation+1.45e+01

Pos logit contributions
Well+0.066
That+0.063
 Curious+0.062
 Unfortunately+0.061
 �+0.061
Neg logit contributions
ins-0.059
ies-0.056
ishing-0.051
cer-0.048
ie-0.048
Token played
Token ID2826
Feature activation+1.12e+01

Pos logit contributions
Well+0.076
Listen+0.073
Um+0.070
 Curious+0.070
 Everything+0.068
Neg logit contributions
ins-0.147
ies-0.140
cer-0.139
--0.137
ishing-0.134
Token together
Token ID1978
Feature activation+9.09e+00

Pos logit contributions
Over+0.086
 Curious+0.086
 Everything+0.083
That+0.083
 Unfortunately+0.083
Neg logit contributions
ies-0.081
ins-0.076
ie-0.072
cer-0.069
ater-0.064
Token every
Token ID790
Feature activation+8.80e+00

Pos logit contributions
 One+0.061
 �+0.060
On+0.060
Well+0.058
Over+0.058
Neg logit contributions
ins-0.078
ies-0.072
akers-0.066
cer-0.065
ishing-0.064
Token day
Token ID1110
Feature activation+8.88e+00

Pos logit contributions
Listen+0.068
Well+0.067
 Curious+0.067
Over+0.064
Um+0.064
Neg logit contributions
ins-0.070
ies-0.067
cer-0.062
ates-0.059
--0.059
Token.
Token ID13
Feature activation+2.00e+01

Pos logit contributions
Well+0.087
 Unfortunately+0.085
 Curious+0.084
 On+0.084
 �+0.084
Neg logit contributions
ins-0.046
ies-0.042
cer-0.039
ishing-0.036
ates-0.034
Token<|endoftext|>
Token ID50256
Feature activation+1.34e+01

Pos logit contributions
Um+0.133
 Curious+0.128
Listen+0.126
Hmm+0.123
 daunting+0.122
Neg logit contributions
--0.166
ies-0.161
ins-0.159
cer-0.151
ie-0.146
TokenOnce
Token ID7454
Feature activation+1.24e+01

Pos logit contributions
Well+0.045
 Curious+0.045
Listen+0.043
 Everywhere+0.040
 Everything+0.040
Neg logit contributions
ies-0.165
ins-0.160
cer-0.155
--0.152
ishing-0.152
Token upon
Token ID2402
Feature activation+1.45e+01

Pos logit contributions
Well+0.077
 Curious+0.075
Listen+0.075
Over+0.073
Um+0.072
Neg logit contributions
ins-0.108
cer-0.108
ies-0.102
--0.102
ates-0.099
Token a
Token ID257
Feature activation+1.21e+01

Pos logit contributions
 Curious+0.091
Over+0.088
Listen+0.086
 Something+0.082
Well+0.081
Neg logit contributions
ies-0.139
ins-0.138
ishing-0.133
cer-0.131
--0.124
Token time
Token ID640
Feature activation+1.33e+01

Pos logit contributions
 Curious+0.081
Well+0.079
Listen+0.079
In+0.078
 On+0.078
Neg logit contributions
ins-0.103
ies-0.100
cer-0.098
er-0.092
ates-0.090
Token to
Token ID284
Feature activation+1.20e+01

Pos logit contributions
Well+0.092
 Curious+0.090
Um+0.089
Listen+0.088
Over+0.085
Neg logit contributions
ies-0.068
--0.068
ins-0.067
cer-0.063
ie-0.058
Token enter
Token ID3802
Feature activation+8.21e+00

Pos logit contributions
Well+0.092
 Curious+0.089
Listen+0.089
Um+0.089
Over+0.084
Neg logit contributions
ies-0.091
--0.090
ins-0.089
cer-0.085
ie-0.080
Token a
Token ID257
Feature activation+8.80e+00

Pos logit contributions
Listen+0.053
 Curious+0.053
 Everywhere+0.053
Well+0.052
 On+0.052
Neg logit contributions
ins-0.070
ies-0.069
cer-0.066
--0.060
ab-0.058
Token cooking
Token ID10801
Feature activation+3.40e+00

Pos logit contributions
Well+0.073
 Curious+0.070
Listen+0.070
Um+0.070
 Everything+0.067
Neg logit contributions
--0.063
ies-0.062
ins-0.060
cer-0.058
ie-0.054
Token competition
Token ID5449
Feature activation+5.22e+00

Pos logit contributions
 On+0.026
 Everything+0.026
Listen+0.026
 Curious+0.025
Well+0.025
Neg logit contributions
ies-0.024
cer-0.023
ins-0.022
ishing-0.019
ie-0.019
Token.
Token ID13
Feature activation+1.99e+01

Pos logit contributions
 On+0.047
On+0.047
 One+0.046
 Who+0.045
Um+0.045
Neg logit contributions
ins-0.030
ies-0.027
cer-0.026
ishing-0.024
kin-0.024
Token Tim
Token ID5045
Feature activation+1.01e+01

Pos logit contributions
Um+0.162
Listen+0.157
Well+0.154
Over+0.150
Hmm+0.149
Neg logit contributions
--0.139
ies-0.134
ins-0.132
cer-0.123
ie-0.120
Tokenmy
Token ID1820
Feature activation+9.75e+00

Pos logit contributions
Well+0.087
 Curious+0.086
Listen+0.084
Um+0.084
 Everything+0.081
Neg logit contributions
ins-0.064
ies-0.062
cer-0.062
--0.059
ates-0.054
Token was
Token ID373
Feature activation+1.40e+01

Pos logit contributions
Listen+0.079
Um+0.077
 Curious+0.077
Well+0.077
 Everywhere+0.077
Neg logit contributions
--0.074
ies-0.071
ins-0.067
cer-0.062
ie-0.062
Token so
Token ID523
Feature activation+1.48e+01

Pos logit contributions
Well+0.098
Listen+0.096
Um+0.096
 Curious+0.095
 Everything+0.090
Neg logit contributions
ies-0.114
ins-0.113
--0.112
cer-0.109
ie-0.099
Token excited
Token ID6568
Feature activation+1.00e+01

Pos logit contributions
Um+0.107
Well+0.107
Listen+0.107
 Curious+0.103
 Everything+0.102
Neg logit contributions
ies-0.119
--0.119
ins-0.119
cer-0.109
ie-0.100
Token to
Token ID284
Feature activation+1.26e+01

Pos logit contributions
Well+0.100
 Curious+0.099
Um+0.098
Listen+0.098
 Everything+0.094
Neg logit contributions
ies-0.089
ins-0.087
--0.086
cer-0.084
ie-0.076
Token play
Token ID711
Feature activation+8.82e+00

Pos logit contributions
Well+0.087
 Curious+0.085
Listen+0.084
Um+0.084
 Everything+0.081
Neg logit contributions
ies-0.105
ins-0.103
--0.102
cer-0.101
ie-0.092
Token the
Token ID262
Feature activation+9.93e+00

Pos logit contributions
 Curious+0.073
Listen+0.071
Well+0.071
 Everything+0.071
That+0.070
Neg logit contributions
ies-0.060
ins-0.058
cer-0.053
ishing-0.049
akers-0.049
Token trumpet
Token ID40334
Feature activation+1.92e+00

Pos logit contributions
Well+0.079
 Curious+0.078
Listen+0.078
Um+0.074
Over+0.074
Neg logit contributions
ies-0.073
ins-0.071
cer-0.067
--0.066
ie-0.061
Token too
Token ID1165
Feature activation+7.75e+00

Pos logit contributions
Well+0.018
 Curious+0.018
Listen+0.018
 Everything+0.018
Um+0.018
Neg logit contributions
ins-0.010
ies-0.010
cer-0.010
--0.009
ishing-0.008
Token.
Token ID13
Feature activation+1.99e+01

Pos logit contributions
Well+0.095
 Curious+0.094
Listen+0.094
Um+0.094
 Everything+0.091
Neg logit contributions
ies-0.023
ins-0.022
--0.021
cer-0.020
ie-0.015
Token One
Token ID1881
Feature activation+1.36e+01

Pos logit contributions
Um+0.108
Listen+0.102
Hmm+0.095
Well+0.094
Over+0.094
Neg logit contributions
--0.186
ies-0.181
ins-0.180
er-0.176
cer-0.168
Token day
Token ID1110
Feature activation+1.44e+01

Pos logit contributions
Well+0.122
 Curious+0.120
Listen+0.120
Um+0.119
 Everything+0.115
Neg logit contributions
ies-0.086
ins-0.085
cer-0.081
--0.080
ie-0.070
Token,
Token ID11
Feature activation+1.67e+01

Pos logit contributions
Well+0.170
 Curious+0.168
Listen+0.167
Um+0.167
 Everything+0.162
Neg logit contributions
ies-0.049
ins-0.046
--0.045
cer-0.043
ie-0.034
Token Ellie
Token ID44801
Feature activation+8.19e+00

Pos logit contributions
Well+0.123
Um+0.123
Listen+0.121
 Curious+0.119
 Everything+0.115
Neg logit contributions
ies-0.127
--0.126
cer-0.118
ins-0.117
ie-0.112
Token forgot
Token ID16453
Feature activation+1.40e+01

Pos logit contributions
Well+0.057
 Curious+0.057
Listen+0.056
Um+0.055
Over+0.053
Neg logit contributions
ins-0.065
ies-0.065
cer-0.062
--0.062
ates-0.058
Token the
Token ID262
Feature activation+1.13e+01

Pos logit contributions
Well+0.128
 Curious+0.126
Listen+0.125
Um+0.125
 Everything+0.120
Neg logit contributions
ies-0.111
ins-0.109
--0.106
cer-0.105
ie-0.094
Token sun
Token ID4252
Feature activation+5.09e+00

Pos logit contributions
Well+0.093
Um+0.093
 Curious+0.092
Listen+0.090
Hmm+0.087
Neg logit contributions
ies-0.079
--0.077
ins-0.075
cer-0.073
ie-0.066
Token started
Token ID2067
Feature activation+9.37e+00

Pos logit contributions
Well+0.027
 Curious+0.026
Listen+0.026
Over+0.025
 Everything+0.024
Neg logit contributions
ins-0.050
ies-0.049
cer-0.048
--0.046
ates-0.045
Token to
Token ID284
Feature activation+1.27e+01

Pos logit contributions
Well+0.072
 Curious+0.071
Listen+0.071
Um+0.070
 Everything+0.068
Neg logit contributions
ins-0.069
ies-0.069
cer-0.067
--0.066
ie-0.059
Token set
Token ID900
Feature activation+8.37e+00

Pos logit contributions
Well+0.099
 Curious+0.098
Listen+0.097
Um+0.097
 Everything+0.093
Neg logit contributions
ies-0.094
ins-0.092
--0.090
cer-0.089
ie-0.081
Token.
Token ID13
Feature activation+1.99e+01

Pos logit contributions
 �+0.075
 On+0.072
 Curious+0.072
On+0.072
+0.071
Neg logit contributions
ins-0.053
ies-0.042
ates-0.040
cer-0.039
ebra-0.038
Token
Token ID198
Feature activation+1.71e+01

Pos logit contributions
Um+0.140
Listen+0.130
Hmm+0.129
 daunting+0.126
Well+0.122
Neg logit contributions
--0.150
."-0.143
ies-0.142
ins-0.142
cer-0.140
Token
Token ID198
Feature activation+1.74e+01

Pos logit contributions
Um+0.109
Well+0.106
 Curious+0.106
Listen+0.104
Hmm+0.101
Neg logit contributions
--0.148
ies-0.144
ins-0.143
cer-0.136
ie-0.133
TokenAs
Token ID1722
Feature activation+1.41e+01

Pos logit contributions
 Curious+0.100
Um+0.099
 Everything+0.096
Well+0.091
 There+0.090
Neg logit contributions
--0.150
ep-0.146
ins-0.143
cer-0.143
e-0.141
Token they
Token ID484
Feature activation+1.23e+01

Pos logit contributions
Well+0.092
Listen+0.090
 Curious+0.090
Um+0.089
 Everything+0.085
Neg logit contributions
ins-0.123
ies-0.122
cer-0.118
--0.118
ie-0.108
Token were
Token ID547
Feature activation+1.21e+01

Pos logit contributions
 Curious+0.079
Um+0.078
Listen+0.077
Well+0.077
 Everything+0.073
Neg logit contributions
ies-0.108
--0.106
ins-0.105
cer-0.104
ie-0.099
Token swim
Token ID9422
Feature activation+6.15e+00

Pos logit contributions
 Curious+0.069
Listen+0.069
Well+0.068
Um+0.068
 Everything+0.067
Neg logit contributions
ins-0.110
ies-0.108
cer-0.104
--0.104
ishing-0.096
Token with
Token ID351
Feature activation+1.20e+01

Pos logit contributions
 Curious+0.060
Well+0.059
Listen+0.059
Um+0.058
 Unfortunately+0.058
Neg logit contributions
ins-0.031
ies-0.027
ates-0.025
cer-0.024
--0.023
Token you
Token ID345
Feature activation+1.05e+01

Pos logit contributions
Listen+0.089
Um+0.088
 Curious+0.088
 Everything+0.088
Well+0.086
Neg logit contributions
ies-0.088
cer-0.088
ins-0.087
--0.081
ishing-0.076
Token,
Token ID11
Feature activation+1.38e+01

Pos logit contributions
 Curious+0.104
Um+0.101
 Everywhere+0.099
Well+0.099
 Unfortunately+0.098
Neg logit contributions
ins-0.050
cer-0.047
ies-0.043
--0.042
ishing-0.041
Token duck
Token ID22045
Feature activation+7.20e+00

Pos logit contributions
Well+0.105
 Curious+0.104
Listen+0.103
Um+0.102
 Everything+0.099
Neg logit contributions
ies-0.104
ins-0.103
--0.101
cer-0.100
ie-0.090
Token!"
Token ID2474
Feature activation+1.98e+01

Pos logit contributions
 Curious+0.069
Well+0.066
That+0.065
Um+0.065
 Sadly+0.065
Neg logit contributions
ates-0.028
ins-0.027
els-0.023
cer-0.023
rob-0.023
Token And
Token ID843
Feature activation+1.48e+01

Pos logit contributions
Um+0.109
Listen+0.108
Well+0.108
 Curious+0.106
Over+0.101
Neg logit contributions
ies-0.190
--0.188
ins-0.188
cer-0.180
ie-0.170
Token they
Token ID484
Feature activation+1.39e+01

Pos logit contributions
Well+0.116
 Curious+0.115
Listen+0.113
Um+0.113
 Everything+0.109
Neg logit contributions
ies-0.108
ins-0.107
--0.104
cer-0.104
ie-0.093
Token sw
Token ID1509
Feature activation+3.89e+00

Pos logit contributions
Well+0.102
 Curious+0.102
Um+0.101
Listen+0.101
 Everything+0.096
Neg logit contributions
ies-0.109
ins-0.105
--0.105
cer-0.104
ie-0.095
Tokenam
Token ID321
Feature activation+1.13e+01

Pos logit contributions
 Curious+0.031
Well+0.031
Listen+0.031
Um+0.030
Over+0.030
Neg logit contributions
ins-0.025
ies-0.025
cer-0.024
--0.023
ie-0.021
Token together
Token ID1978
Feature activation+1.15e+01

Pos logit contributions
 Curious+0.099
Well+0.097
Listen+0.096
 Unfortunately+0.093
 Everything+0.093
Neg logit contributions
ins-0.073
ies-0.068
cer-0.066
--0.061
ates-0.061
Token and
Token ID290
Feature activation+1.52e+01

Pos logit contributions
Well+0.114
Um+0.114
Listen+0.112
 Curious+0.112
Over+0.110
Neg logit contributions
ies-0.055
--0.055
ins-0.053
cer-0.048
er-0.043
Token they
Token ID484
Feature activation+1.34e+01

Pos logit contributions
Well+0.101
 Curious+0.100
Listen+0.099
Um+0.098
 Everything+0.094
Neg logit contributions
ies-0.129
ins-0.127
--0.125
cer-0.123
ie-0.113
Token took
Token ID1718
Feature activation+1.17e+01

Pos logit contributions
 Curious+0.093
Well+0.091
Um+0.091
Listen+0.091
 Everything+0.087
Neg logit contributions
ies-0.113
--0.108
ins-0.107
cer-0.105
ie-0.102
Token a
Token ID257
Feature activation+1.07e+01

Pos logit contributions
Listen+0.083
 Everything+0.083
 Curious+0.082
Well+0.082
Um+0.079
Neg logit contributions
ins-0.096
ies-0.095
cer-0.090
--0.085
ates-0.081
Token nap
Token ID25422
Feature activation+7.32e+00

Pos logit contributions
Well+0.093
Um+0.091
Listen+0.089
 Everything+0.088
 Curious+0.088
Neg logit contributions
--0.074
ies-0.073
ins-0.065
cer-0.064
ie-0.062
Token.
Token ID13
Feature activation+1.98e+01

Pos logit contributions
Well+0.049
Unfortunately+0.048
 Unfortunately+0.048
On+0.048
Luckily+0.048
Neg logit contributions
ins-0.045
cks-0.044
els-0.043
orns-0.042
cer-0.041
Token They
Token ID1119
Feature activation+1.51e+01

Pos logit contributions
Um+0.130
Well+0.122
Listen+0.121
Hmm+0.121
Over+0.117
Neg logit contributions
--0.160
ies-0.158
ins-0.156
."-0.147
!"-0.146
Token were
Token ID547
Feature activation+1.35e+01

Pos logit contributions
 Curious+0.101
Well+0.100
Um+0.100
Listen+0.099
 Everything+0.095
Neg logit contributions
ies-0.130
--0.124
ins-0.123
cer-0.122
ie-0.118
Token happy
Token ID3772
Feature activation+1.33e+01

Pos logit contributions
 Curious+0.071
Well+0.070
 Everything+0.067
Listen+0.067
Over+0.066
Neg logit contributions
ins-0.128
cer-0.127
ies-0.124
--0.122
ater-0.118
Token and
Token ID290
Feature activation+1.63e+01

Pos logit contributions
Well+0.109
Listen+0.104
 Curious+0.103
Um+0.101
 Everything+0.101
Neg logit contributions
ins-0.094
cer-0.092
ies-0.091
--0.085
ates-0.084
Token tired
Token ID10032
Feature activation+1.15e+01

Pos logit contributions
Well+0.102
 Curious+0.100
Listen+0.099
Um+0.099
 Everything+0.094
Neg logit contributions
ies-0.145
ins-0.144
--0.140
cer-0.139
ie-0.128
Token you
Token ID345
Feature activation+8.27e+00

Pos logit contributions
Well+0.093
Um+0.093
Listen+0.092
 Everything+0.090
 Curious+0.090
Neg logit contributions
ies-0.077
cer-0.077
ins-0.075
--0.071
ishing-0.065
Token.
Token ID13
Feature activation+1.56e+01

Pos logit contributions
 Curious+0.087
Um+0.084
Well+0.084
Listen+0.084
 Everything+0.084
Neg logit contributions
cer-0.035
ins-0.035
ies-0.032
ishing-0.029
--0.029
Token Come
Token ID7911
Feature activation+1.21e+01

Pos logit contributions
Well+0.096
Listen+0.096
Um+0.095
 Curious+0.094
Over+0.092
Neg logit contributions
--0.130
ins-0.129
ies-0.128
cer-0.127
er-0.118
Token with
Token ID351
Feature activation+1.34e+01

Pos logit contributions
Well+0.126
 Curious+0.125
Listen+0.124
Um+0.123
 Everything+0.120
Neg logit contributions
ies-0.057
ins-0.057
cer-0.053
--0.052
ie-0.045
Token me
Token ID502
Feature activation+8.91e+00

Pos logit contributions
Listen+0.099
 Curious+0.095
Um+0.094
Well+0.094
 Everything+0.093
Neg logit contributions
ins-0.106
ies-0.104
cer-0.104
--0.093
ates-0.090
Token."
Token ID526
Feature activation+1.98e+01

Pos logit contributions
 Curious+0.085
Well+0.085
 Unfortunately+0.084
 Everything+0.083
Listen+0.083
Neg logit contributions
ins-0.047
ies-0.045
cer-0.044
--0.037
ishing-0.037
Token
Token ID198
Feature activation+1.77e+01

Pos logit contributions
Um+0.144
Listen+0.140
 daunting+0.131
Over+0.129
Hmm+0.127
Neg logit contributions
--0.154
ins-0.149
ies-0.148
cer-0.140
.-0.134
Token
Token ID198
Feature activation+1.78e+01

Pos logit contributions
Um+0.136
 daunting+0.131
Listen+0.127
 Curious+0.126
Hmm+0.126
Neg logit contributions
--0.132
ins-0.127
ies-0.122
!"-0.119
cer-0.114
TokenThe
Token ID464
Feature activation+1.10e+01

Pos logit contributions
 Everything+0.125
 There+0.125
 daunting+0.122
 Curious+0.119
Um+0.115
Neg logit contributions
ins-0.134
--0.134
e-0.126
ep-0.125
cer-0.122
Token daughter
Token ID4957
Feature activation+7.84e+00

Pos logit contributions
Well+0.087
Listen+0.086
 Curious+0.085
 Everything+0.084
Um+0.084
Neg logit contributions
ies-0.080
--0.078
ins-0.074
cer-0.072
ie-0.068
Token was
Token ID373
Feature activation+1.41e+01

Pos logit contributions
Listen+0.063
Um+0.062
Well+0.062
 Curious+0.060
Hmm+0.059
Neg logit contributions
--0.059
ies-0.057
ins-0.053
cer-0.052
er-0.049
Token I
Token ID314
Feature activation+1.39e+01

Pos logit contributions
Well+0.094
Listen+0.094
Um+0.093
 Curious+0.093
Over+0.088
Neg logit contributions
ies-0.124
ins-0.124
--0.124
cer-0.122
ie-0.111
Token had
Token ID550
Feature activation+1.22e+01

Pos logit contributions
Well+0.086
 Curious+0.083
Um+0.083
Listen+0.083
 Everything+0.078
Neg logit contributions
ins-0.125
ies-0.124
cer-0.120
--0.120
ishing-0.111
Token so
Token ID523
Feature activation+1.14e+01

Pos logit contributions
Well+0.087
 Curious+0.086
Um+0.085
Listen+0.085
 Everything+0.081
Neg logit contributions
ies-0.098
ins-0.097
cer-0.094
--0.093
ie-0.085
Token much
Token ID881
Feature activation+4.25e+00

Pos logit contributions
Well+0.057
 Curious+0.055
Listen+0.054
Um+0.054
Over+0.050
Neg logit contributions
ies-0.116
ins-0.115
cer-0.114
--0.112
ie-0.105
Token fun
Token ID1257
Feature activation+7.36e+00

Pos logit contributions
Well+0.024
 Curious+0.024
Listen+0.024
Um+0.023
 Everything+0.022
Neg logit contributions
cer-0.041
ies-0.041
ins-0.040
--0.038
ie-0.037
Token."
Token ID526
Feature activation+1.98e+01

Pos logit contributions
Well+0.064
 Unfortunately+0.060
 Curious+0.059
 Who+0.059
 All+0.059
Neg logit contributions
cer-0.040
ins-0.038
ies-0.037
ile-0.035
kin-0.035
Token Max
Token ID5436
Feature activation+9.20e+00

Pos logit contributions
Um+0.152
Well+0.150
Listen+0.149
 Curious+0.147
Hmm+0.141
Neg logit contributions
ies-0.146
--0.144
ins-0.140
cer-0.136
ie-0.126
Token replied
Token ID8712
Feature activation+1.42e+01

Pos logit contributions
Listen+0.078
Um+0.077
Well+0.075
Hmm+0.074
 Curious+0.074
Neg logit contributions
--0.066
ies-0.065
ins-0.057
cer-0.057
ie-0.056
Token,
Token ID11
Feature activation+1.27e+01

Pos logit contributions
 Curious+0.150
Well+0.148
Listen+0.147
 Everything+0.145
Um+0.144
Neg logit contributions
ins-0.065
cer-0.063
ies-0.063
ates-0.052
--0.052
Token "
Token ID366
Feature activation+1.81e+01

Pos logit contributions
Um+0.091
Hmm+0.088
Listen+0.087
 daunting+0.087
 Everywhere+0.085
Neg logit contributions
--0.102
ies-0.098
ins-0.094
er-0.089
cer-0.086
TokenMe
Token ID5308
Feature activation+9.44e+00

Pos logit contributions
 Curious+0.074
Well+0.074
Um+0.072
Listen+0.072
 Everything+0.067
Neg logit contributions
ies-0.197
ins-0.196
--0.194
cer-0.193
ie-0.179
Token "
Token ID366
Feature activation+1.74e+01

Pos logit contributions
Um+0.072
Well+0.071
Listen+0.071
 Curious+0.069
Hmm+0.068
Neg logit contributions
--0.095
ies-0.093
ins-0.093
er-0.084
cer-0.084
TokenYes
Token ID5297
Feature activation+1.03e+01

Pos logit contributions
 Everything+0.095
 Curious+0.091
rolling+0.089
 devastated+0.086
 There+0.085
Neg logit contributions
--0.161
cer-0.148
ins-0.147
er-0.143
 than-0.141
Token,
Token ID11
Feature activation+1.06e+01

Pos logit contributions
 Curious+0.097
Well+0.097
 Everything+0.096
Um+0.095
Listen+0.095
Neg logit contributions
ins-0.058
cer-0.056
ates-0.049
ies-0.049
ishing-0.044
Token you
Token ID345
Feature activation+1.42e+01

Pos logit contributions
Well+0.079
Listen+0.078
 Curious+0.077
Um+0.077
 Everything+0.074
Neg logit contributions
ins-0.081
cer-0.080
ies-0.079
--0.076
ates-0.072
Token can
Token ID460
Feature activation+1.16e+01

Pos logit contributions
Well+0.122
 Curious+0.119
Listen+0.119
Um+0.118
 Unfortunately+0.110
Neg logit contributions
ies-0.098
--0.098
cer-0.089
ins-0.087
!"-0.083
Token."
Token ID526
Feature activation+1.98e+01

Pos logit contributions
Well+0.097
 Curious+0.096
Listen+0.096
Um+0.096
 Everything+0.091
Neg logit contributions
--0.080
ies-0.080
ins-0.076
cer-0.075
ie-0.068
Token Lily
Token ID20037
Feature activation+1.06e+01

Pos logit contributions
Well+0.118
 Curious+0.116
Listen+0.115
Um+0.115
 Everything+0.108
Neg logit contributions
ies-0.183
ins-0.181
--0.178
cer-0.176
ie-0.163
Token put
Token ID1234
Feature activation+1.16e+01

Pos logit contributions
Well+0.079
Listen+0.078
 Curious+0.078
Um+0.078
 Everything+0.074
Neg logit contributions
ies-0.082
ins-0.081
--0.080
cer-0.078
ie-0.071
Token the
Token ID262
Feature activation+1.23e+01

Pos logit contributions
Well+0.097
Listen+0.094
 Curious+0.094
 Everything+0.092
Um+0.090
Neg logit contributions
ins-0.080
ies-0.079
cer-0.077
--0.070
ater-0.066
Token wrap
Token ID14441
Feature activation+3.81e+00

Pos logit contributions
Well+0.103
 Curious+0.102
Listen+0.101
Um+0.101
 Everything+0.097
Neg logit contributions
ies-0.084
ins-0.083
--0.080
cer-0.080
ie-0.071
Token on
Token ID319
Feature activation+1.07e+01

Pos logit contributions
Well+0.035
 Curious+0.035
 Everywhere+0.035
Over+0.034
Listen+0.034
Neg logit contributions
ins-0.024
ies-0.020
cer-0.020
akers-0.017
ishing-0.017
Token a
Token ID257
Feature activation+8.37e+00

Pos logit contributions
Well+0.082
 Everything+0.081
 Curious+0.080
Um+0.080
Listen+0.080
Neg logit contributions
ies-0.085
ins-0.085
cer-0.085
--0.082
ishing-0.077
Token flower
Token ID15061
Feature activation+1.34e+00

Pos logit contributions
Well+0.061
 Curious+0.060
Um+0.060
Listen+0.059
 Everything+0.056
Neg logit contributions
ins-0.065
ies-0.065
cer-0.062
--0.061
ishing-0.057
Token with
Token ID351
Feature activation+1.08e+01

Pos logit contributions
Well+0.012
 �+0.012
 On+0.012
Um+0.012
 One+0.012
Neg logit contributions
ins-0.007
cer-0.006
ies-0.006
ishing-0.006
ates-0.005
Token her
Token ID607
Feature activation+7.24e+00

Pos logit contributions
Well+0.080
 Curious+0.079
Um+0.079
Listen+0.078
 Everything+0.075
Neg logit contributions
ies-0.083
ins-0.082
cer-0.080
--0.079
ie-0.072
Token pen
Token ID3112
Feature activation+8.37e-01

Pos logit contributions
Well+0.059
Um+0.059
 Curious+0.059
 Something+0.056
 Who+0.055
Neg logit contributions
ins-0.049
cer-0.046
ies-0.044
--0.041
ie-0.040
Token.
Token ID13
Feature activation+1.98e+01

Pos logit contributions
 Curious+0.008
Well+0.008
 �+0.008
 On+0.008
 One+0.008
Neg logit contributions
ins-0.004
cer-0.004
ies-0.004
ishing-0.003
--0.003
Token Her
Token ID2332
Feature activation+1.18e+01

Pos logit contributions
Um+0.114
Listen+0.111
Well+0.107
Hmm+0.105
Over+0.101
Neg logit contributions
ies-0.182
ins-0.179
--0.178
cer-0.169
ie-0.164
Token friend
Token ID1545
Feature activation+9.80e+00

Pos logit contributions
 Curious+0.109
Well+0.108
Um+0.108
Listen+0.106
 Everything+0.103
Neg logit contributions
ins-0.069
cer-0.067
ies-0.066
--0.062
ie-0.056
Token liked
Token ID8288
Feature activation+1.40e+01

Pos logit contributions
Listen+0.076
Well+0.076
Um+0.074
 Curious+0.074
Hmm+0.072
Neg logit contributions
ies-0.076
--0.074
ins-0.071
ie-0.069
cer-0.069
Token it
Token ID340
Feature activation+8.76e+00

Pos logit contributions
Well+0.107
Listen+0.105
 Curious+0.105
Um+0.104
 Everything+0.101
Neg logit contributions
cer-0.104
ies-0.103
ins-0.103
--0.098
ishing-0.090
Token very
Token ID845
Feature activation+1.05e+01

Pos logit contributions
Well+0.075
 Curious+0.075
Listen+0.073
Um+0.073
 Everything+0.071
Neg logit contributions
ins-0.057
ies-0.057
cer-0.054
--0.053
ishing-0.048
Token folder
Token ID9483
Feature activation+7.33e-01

Pos logit contributions
Well+0.088
 Curious+0.086
Um+0.086
Listen+0.086
 Everything+0.082
Neg logit contributions
ies-0.069
ins-0.067
--0.066
cer-0.064
ie-0.057
Token away
Token ID1497
Feature activation+9.19e+00

Pos logit contributions
 Curious+0.006
Well+0.006
Listen+0.006
Over+0.006
 Everywhere+0.006
Neg logit contributions
ins-0.005
ies-0.005
cer-0.005
--0.004
ie-0.004
Token and
Token ID290
Feature activation+1.43e+01

Pos logit contributions
Well+0.088
 Curious+0.087
Listen+0.086
Um+0.085
 Everywhere+0.084
Neg logit contributions
ins-0.050
ies-0.046
cer-0.045
--0.042
ishing-0.039
Token played
Token ID2826
Feature activation+1.15e+01

Pos logit contributions
Well+0.092
 Curious+0.090
Listen+0.090
Um+0.089
 Everything+0.085
Neg logit contributions
ins-0.125
ies-0.124
--0.121
cer-0.120
ie-0.110
Token together
Token ID1978
Feature activation+9.19e+00

Pos logit contributions
 Curious+0.099
 Everything+0.095
Listen+0.094
Well+0.093
Over+0.093
Neg logit contributions
ies-0.075
ins-0.072
ie-0.067
cer-0.066
ishing-0.060
Token.
Token ID13
Feature activation+1.98e+01

Pos logit contributions
 One+0.075
On+0.073
 On+0.073
 Unfortunately+0.073
 �+0.073
Neg logit contributions
ins-0.066
ies-0.061
cer-0.056
ishing-0.053
akers-0.051
Token<|endoftext|>
Token ID50256
Feature activation+1.36e+01

Pos logit contributions
Um+0.131
Hmm+0.120
 daunting+0.120
Listen+0.119
 Curious+0.116
Neg logit contributions
--0.172
ies-0.158
ins-0.158
."-0.157
e-0.155
TokenOnce
Token ID7454
Feature activation+1.26e+01

Pos logit contributions
 Curious+0.048
Well+0.048
Listen+0.047
 Everywhere+0.043
Over+0.043
Neg logit contributions
ies-0.165
ins-0.160
cer-0.155
--0.152
ishing-0.151
Token upon
Token ID2402
Feature activation+1.45e+01

Pos logit contributions
Well+0.083
 Curious+0.082
Listen+0.081
Um+0.079
Over+0.079
Neg logit contributions
ins-0.106
cer-0.104
ies-0.101
--0.100
ates-0.094
Token a
Token ID257
Feature activation+1.25e+01

Pos logit contributions
 Curious+0.091
Over+0.089
Listen+0.087
Well+0.082
 Something+0.082
Neg logit contributions
ies-0.138
ins-0.137
ishing-0.131
cer-0.131
--0.124
Token time
Token ID640
Feature activation+1.32e+01

Pos logit contributions
 Curious+0.086
Well+0.085
Listen+0.085
In+0.084
 On+0.084
Neg logit contributions
ins-0.103
ies-0.101
cer-0.098
er-0.092
ishing-0.089
Token me
Token ID502
Feature activation+9.23e+00

Pos logit contributions
Listen+0.081
Um+0.080
 Curious+0.079
Well+0.078
 Everything+0.077
Neg logit contributions
ins-0.090
cer-0.088
ies-0.086
kin-0.081
--0.081
Token find
Token ID1064
Feature activation+1.09e+01

Pos logit contributions
 Curious+0.057
 On+0.053
 Unfortunately+0.053
 Something+0.053
Um+0.052
Neg logit contributions
ins-0.081
cer-0.077
ies-0.075
--0.069
ates-0.069
Token my
Token ID616
Feature activation+7.13e+00

Pos logit contributions
 Everything+0.077
 Curious+0.077
 Something+0.074
Um+0.074
 An+0.074
Neg logit contributions
ins-0.085
ies-0.083
cer-0.083
kin-0.080
ates-0.071
Token way
Token ID835
Feature activation+9.12e+00

Pos logit contributions
 Curious+0.053
Um+0.050
 Everywhere+0.050
 Sadly+0.049
 Something+0.049
Neg logit contributions
ins-0.059
ies-0.051
cer-0.051
--0.049
ie-0.046
Token home
Token ID1363
Feature activation+6.48e+00

Pos logit contributions
 Curious+0.081
Listen+0.080
 Everywhere+0.080
Well+0.079
 Everything+0.078
Neg logit contributions
ins-0.058
ies-0.051
cer-0.051
--0.050
ates-0.046
Token?"
Token ID1701
Feature activation+1.98e+01

Pos logit contributions
Listen+0.058
 Curious+0.057
 On+0.057
 Everywhere+0.057
 Everything+0.057
Neg logit contributions
ins-0.037
cer-0.034
ies-0.033
--0.030
ates-0.028
Token
Token ID198
Feature activation+1.76e+01

Pos logit contributions
Um+0.129
Listen+0.128
Well+0.125
Over+0.123
 daunting+0.121
Neg logit contributions
--0.166
ins-0.158
ies-0.158
ie-0.149
cer-0.149
Token
Token ID198
Feature activation+1.73e+01

Pos logit contributions
Well+0.090
 Curious+0.088
Listen+0.087
Um+0.087
 Everything+0.081
Neg logit contributions
ies-0.176
ins-0.174
--0.172
cer-0.170
ie-0.158
TokenB
Token ID33
Feature activation+7.35e+00

Pos logit contributions
Um+0.139
Well+0.138
 Everything+0.138
 Curious+0.136
Listen+0.136
Neg logit contributions
--0.125
ins-0.117
ies-0.115
cer-0.114
er-0.107
Tokenenny
Token ID11870
Feature activation+7.27e+00

Pos logit contributions
Well+0.048
Listen+0.047
Um+0.047
 Curious+0.047
 Everything+0.045
Neg logit contributions
ies-0.063
ins-0.063
--0.063
cer-0.062
ie-0.056
Token thought
Token ID1807
Feature activation+1.38e+01

Pos logit contributions
Listen+0.051
Well+0.050
Um+0.049
Over+0.048
 Curious+0.048
Neg logit contributions
--0.060
ies-0.060
ins-0.056
cer-0.056
ie-0.054
Token playing
Token ID2712
Feature activation+9.01e+00

Pos logit contributions
Well+0.085
 Curious+0.080
 Unfortunately+0.079
Over+0.079
Listen+0.079
Neg logit contributions
cer-0.070
ins-0.067
ies-0.067
ie-0.062
kin-0.061
Token together
Token ID1978
Feature activation+8.51e+00

Pos logit contributions
 Curious+0.065
 On+0.063
Well+0.063
On+0.063
Over+0.063
Neg logit contributions
ies-0.064
ins-0.063
cer-0.057
ie-0.055
ater-0.052
Token and
Token ID290
Feature activation+1.44e+01

Pos logit contributions
 �+0.060
On+0.059
Well+0.056
 One+0.055
 Everywhere+0.055
Neg logit contributions
ins-0.066
ies-0.061
cer-0.055
ishing-0.054
rim-0.054
Token having
Token ID1719
Feature activation+1.03e+01

Pos logit contributions
Well+0.112
Listen+0.110
 Everything+0.106
 Curious+0.106
Um+0.104
Neg logit contributions
ins-0.108
ies-0.105
--0.102
cer-0.101
ater-0.096
Token fun
Token ID1257
Feature activation+6.34e+00

Pos logit contributions
Well+0.067
 Curious+0.067
Listen+0.065
Um+0.064
 Everything+0.063
Neg logit contributions
ies-0.089
ins-0.088
cer-0.086
--0.085
ishing-0.078
Token.
Token ID13
Feature activation+1.98e+01

Pos logit contributions
Well+0.048
 All+0.044
On+0.043
That+0.043
All+0.042
Neg logit contributions
ins-0.043
ies-0.042
an-0.042
isher-0.041
kin-0.040
Token He
Token ID679
Feature activation+1.36e+01

Pos logit contributions
Um+0.123
Listen+0.113
 Curious+0.110
 daunting+0.108
Hmm+0.108
Neg logit contributions
--0.172
ies-0.166
cer-0.166
ins-0.158
ie-0.158
Token was
Token ID373
Feature activation+1.44e+01

Pos logit contributions
 Curious+0.092
Um+0.091
Well+0.091
Listen+0.090
 Everything+0.086
Neg logit contributions
ies-0.115
--0.113
ins-0.111
cer-0.110
ie-0.103
Token never
Token ID1239
Feature activation+1.19e+01

Pos logit contributions
Well+0.094
 Curious+0.093
Listen+0.091
Um+0.091
 Everything+0.087
Neg logit contributions
ies-0.125
ins-0.125
cer-0.120
--0.119
ie-0.110
Token lonely
Token ID21757
Feature activation+7.97e+00

Pos logit contributions
 Curious+0.060
Um+0.055
Well+0.054
Listen+0.053
 Everything+0.051
Neg logit contributions
ins-0.122
ies-0.119
cer-0.118
--0.116
ates-0.115
Token again
Token ID757
Feature activation+6.86e+00

Pos logit contributions
 Curious+0.049
Um+0.046
Well+0.046
 daunting+0.045
 Who+0.045
Neg logit contributions
ins-0.075
ies-0.070
cks-0.067
ates-0.066
ep-0.064
Token's
Token ID338
Feature activation+1.27e+01

Pos logit contributions
Well+0.112
 Curious+0.112
Listen+0.111
Um+0.109
 Everything+0.107
Neg logit contributions
ies-0.111
ins-0.111
cer-0.108
--0.104
ates-0.098
Token fix
Token ID4259
Feature activation+7.58e+00

Pos logit contributions
 Curious+0.056
 Everything+0.056
 On+0.053
Um+0.053
 Something+0.052
Neg logit contributions
ins-0.142
ies-0.130
cer-0.129
--0.125
ishing-0.120
Token it
Token ID340
Feature activation+4.43e+00

Pos logit contributions
 Everything+0.034
That+0.031
Listen+0.031
Well+0.031
 An+0.031
Neg logit contributions
ins-0.075
cer-0.075
ies-0.074
kin-0.073
ater-0.067
Token for
Token ID329
Feature activation+1.24e+01

Pos logit contributions
 On+0.039
On+0.039
 Unfortunately+0.038
Inside+0.038
Unfortunately+0.037
Neg logit contributions
cer-0.024
ins-0.024
ies-0.022
ebra-0.019
ates-0.019
Token you
Token ID345
Feature activation+9.11e+00

Pos logit contributions
Well+0.105
Listen+0.105
Um+0.105
 Curious+0.102
 Everything+0.100
Neg logit contributions
ies-0.083
ins-0.082
cer-0.079
--0.074
ie-0.068
Token."
Token ID526
Feature activation+1.98e+01

Pos logit contributions
 Curious+0.084
 Unfortunately+0.083
 One+0.082
Um+0.081
On+0.081
Neg logit contributions
ins-0.048
cer-0.047
rob-0.040
ies-0.040
ishing-0.038
Token She
Token ID1375
Feature activation+1.32e+01

Pos logit contributions
Um+0.130
Listen+0.125
Well+0.121
 Curious+0.118
Over+0.116
Neg logit contributions
ies-0.174
--0.173
ins-0.169
cer-0.157
er-0.154
Token took
Token ID1718
Feature activation+1.28e+01

Pos logit contributions
Well+0.087
 Curious+0.086
Listen+0.085
Um+0.085
 Everything+0.081
Neg logit contributions
ies-0.114
ins-0.112
--0.111
cer-0.109
ie-0.100
Token some
Token ID617
Feature activation+7.39e+00

Pos logit contributions
 Curious+0.095
Well+0.095
Listen+0.094
 Everything+0.093
Um+0.092
Neg logit contributions
ies-0.099
ins-0.099
cer-0.096
--0.090
ates-0.084
Token pins
Token ID20567
Feature activation+5.46e+00

Pos logit contributions
 Curious+0.060
Well+0.059
Listen+0.059
Um+0.058
 Everything+0.057
Neg logit contributions
cer-0.051
ins-0.049
ies-0.048
ates-0.041
--0.041
Token and
Token ID290
Feature activation+1.59e+01

Pos logit contributions
Well+0.039
 �+0.039
+0.038
On+0.038
Over+0.038
Neg logit contributions
ins-0.037
cer-0.036
ies-0.033
ates-0.031
cks-0.030
Token't
Token ID470
Feature activation-2.02e+01

Pos logit contributions
Well+0.147
 Curious+0.146
Listen+0.145
Um+0.145
 Everything+0.140
Neg logit contributions
ies-0.063
ins-0.062
--0.061
cer-0.057
ie-0.049
Token know
Token ID760
Feature activation-1.86e+01

Pos logit contributions
ins+0.194
cer+0.184
ies+0.182
-+0.179
ates+0.177
Neg logit contributions
 Curious-0.111
Um-0.110
Listen-0.110
Well-0.109
 Everything-0.104
Token the
Token ID262
Feature activation-1.95e+01

Pos logit contributions
cer+0.138
ins+0.137
ies+0.132
ishing+0.126
-+0.125
Neg logit contributions
 Everything-0.136
Well-0.133
 Curious-0.133
Um-0.133
Listen-0.132
Token rh
Token ID9529
Feature activation-4.80e+01

Pos logit contributions
ins+0.152
ies+0.150
cer+0.145
-+0.143
ishing+0.131
Neg logit contributions
Well-0.143
 Curious-0.143
Um-0.141
Listen-0.141
 Everything-0.135
Tokenin
Token ID259
Feature activation-3.24e+01

Pos logit contributions
ies+0.295
et+0.254
ins+0.246
ishing+0.244
er+0.244
Neg logit contributions
Well-0.384
Um-0.363
Over-0.363
Two-0.359
Listen-0.350
Tokenoc
Token ID420
Feature activation-6.12e+01

Pos logit contributions
ies+0.156
ins+0.155
cer+0.142
-+0.136
ishing+0.122
Neg logit contributions
Well-0.336
 Curious-0.331
Um-0.329
Listen-0.324
 Everything-0.316
Tokeneros
Token ID27498
Feature activation-3.02e+01

Pos logit contributions
ies+0.393
kin+0.366
ins+0.344
cer+0.313
kins+0.307
Neg logit contributions
Well-0.517
Um-0.504
 Somebody-0.472
 Unfortunately-0.468
 Everything-0.466
Token was
Token ID373
Feature activation-1.63e+01

Pos logit contributions
ies+0.240
ins+0.236
-+0.232
cer+0.230
ie+0.209
Neg logit contributions
Well-0.216
 Curious-0.213
Listen-0.211
Um-0.211
 Everything-0.201
Token wild
Token ID4295
Feature activation-2.32e+01

Pos logit contributions
ies+0.135
ins+0.134
cer+0.129
-+0.124
ie+0.116
Neg logit contributions
 Curious-0.111
Um-0.110
Well-0.109
Listen-0.109
 Everything-0.105
Token.
Token ID13
Feature activation-8.68e+00

Pos logit contributions
ins+0.149
kin+0.147
ates+0.135
kins+0.132
cer+0.128
Neg logit contributions
 Curious-0.193
 Everything-0.187
 �-0.183
Well-0.181
 Everywhere-0.180
Token Can
Token ID1680
Feature activation-1.12e+01

Pos logit contributions
-+0.073
ins+0.072
cer+0.071
ies+0.070
er+0.070
Neg logit contributions
Listen-0.051
Over-0.048
Well-0.046
Um-0.045
Hmm-0.042
Token he
Token ID339
Feature activation-1.77e+01

Pos logit contributions
ins+0.204
ies+0.202
cer+0.196
-+0.194
ishing+0.179
Neg logit contributions
Well-0.161
 Curious-0.159
Um-0.157
Listen-0.156
 Everything-0.151
Token would
Token ID561
Feature activation-1.39e+01

Pos logit contributions
ies+0.162
ins+0.159
-+0.158
cer+0.156
ie+0.145
Neg logit contributions
Well-0.105
 Curious-0.103
Listen-0.103
Um-0.102
 Everything-0.097
Token take
Token ID1011
Feature activation-1.77e+01

Pos logit contributions
ins+0.125
ies+0.124
cer+0.120
-+0.119
ishing+0.110
Neg logit contributions
 Curious-0.084
Well-0.084
Listen-0.082
Um-0.082
 Everything-0.079
Token the
Token ID262
Feature activation-1.94e+01

Pos logit contributions
ins+0.147
ies+0.145
cer+0.138
ishing+0.123
-+0.123
Neg logit contributions
 Curious-0.122
 Everything-0.122
Well-0.121
Listen-0.120
 Everywhere-0.119
Token av
Token ID1196
Feature activation-5.10e+01

Pos logit contributions
ins+0.142
ies+0.140
cer+0.136
-+0.133
ie+0.119
Neg logit contributions
 Curious-0.151
Well-0.150
Um-0.147
Listen-0.147
 Everything-0.142
Tokenoc
Token ID420
Feature activation-5.99e+01

Pos logit contributions
ies+0.308
kin+0.298
ins+0.290
cer+0.271
web+0.258
Neg logit contributions
Well-0.433
Finally-0.423
Hmm-0.419
Um-0.415
 Unfortunately-0.414
Tokenados
Token ID22484
Feature activation-3.06e+01

Pos logit contributions
kin+0.378
ies+0.378
cer+0.365
ins+0.342
web+0.307
Neg logit contributions
Well-0.491
Um-0.478
 Unfortunately-0.458
 Curious-0.456
 Somebody-0.453
Token and
Token ID290
Feature activation-1.15e+01

Pos logit contributions
ins+0.223
ies+0.218
cer+0.210
-+0.202
ishing+0.189
Neg logit contributions
Well-0.231
 Everywhere-0.220
 Curious-0.218
Listen-0.218
Um-0.216
Token scatter
Token ID41058
Feature activation-2.11e+01

Pos logit contributions
ins+0.093
ies+0.092
cer+0.090
-+0.089
ishing+0.081
Neg logit contributions
Well-0.080
 Curious-0.079
Um-0.078
Listen-0.077
 Everything-0.074
Token them
Token ID606
Feature activation-2.04e+01

Pos logit contributions
ies+0.155
ins+0.154
cer+0.150
-+0.142
kin+0.133
Neg logit contributions
Well-0.159
 Everywhere-0.156
 Curious-0.155
 Everything-0.153
Listen-0.153
Token around
Token ID1088
Feature activation-1.69e+01

Pos logit contributions
ins+0.108
cer+0.101
ies+0.100
-+0.096
ates+0.083
Neg logit contributions
 Curious-0.191
Well-0.188
 Everywhere-0.184
Over-0.184
Listen-0.182
Token am
Token ID716
Feature activation-1.53e+01

Pos logit contributions
ies+0.121
-+0.119
ins+0.117
cer+0.116
ie+0.106
Neg logit contributions
Well-0.109
 Curious-0.108
Listen-0.108
Um-0.106
 Everything-0.102
Token taking
Token ID2263
Feature activation-2.09e+01

Pos logit contributions
ies+0.125
cer+0.123
ins+0.122
-+0.117
ishing+0.110
Neg logit contributions
Well-0.104
Um-0.104
 Curious-0.103
Listen-0.101
 Everything-0.100
Token the
Token ID262
Feature activation-2.14e+01

Pos logit contributions
ies+0.162
ins+0.160
cer+0.155
ie+0.140
kin+0.139
Neg logit contributions
Listen-0.152
 Curious-0.151
Well-0.151
 Everything-0.150
Um-0.149
Token rh
Token ID9529
Feature activation-4.44e+01

Pos logit contributions
ins+0.163
ies+0.158
cer+0.153
-+0.148
ishing+0.137
Neg logit contributions
 Curious-0.158
Well-0.157
Listen-0.156
Um-0.155
Over-0.148
Tokenin
Token ID259
Feature activation-3.31e+01

Pos logit contributions
ies+0.277
kin+0.251
et+0.251
er+0.240
ps+0.237
Neg logit contributions
Well-0.324
Two-0.323
Over-0.319
Um-0.307
oral-0.301
Tokenoc
Token ID420
Feature activation-5.97e+01

Pos logit contributions
ins+0.216
ies+0.206
cer+0.181
kin+0.177
ishing+0.175
Neg logit contributions
Well-0.283
Um-0.278
 Curious-0.277
 Who-0.269
 Unfortunately-0.268
Tokeneros
Token ID27498
Feature activation-3.07e+01

Pos logit contributions
ies+0.384
kin+0.365
ins+0.347
cer+0.322
kins+0.305
Neg logit contributions
Well-0.482
Um-0.470
 Unfortunately-0.444
 Somebody-0.443
 Everything-0.437
Token to
Token ID284
Feature activation-1.25e+01

Pos logit contributions
ins+0.190
ies+0.186
cer+0.175
kin+0.165
ishing+0.159
Neg logit contributions
 Curious-0.255
Well-0.253
Um-0.251
Listen-0.249
 Unfortunately-0.246
Token a
Token ID257
Feature activation-1.67e+01

Pos logit contributions
ies+0.094
ins+0.093
-+0.090
cer+0.089
ie+0.080
Neg logit contributions
Well-0.095
 Curious-0.094
Um-0.094
Listen-0.094
 Everything-0.090
Token new
Token ID649
Feature activation-2.34e+01

Pos logit contributions
ies+0.133
-+0.131
ins+0.131
cer+0.128
ie+0.118
Neg logit contributions
Well-0.119
 Curious-0.116
Listen-0.116
Um-0.115
 Everything-0.110
Token home
Token ID1363
Feature activation-2.14e+01

Pos logit contributions
ins+0.190
ies+0.176
kin+0.155
cer+0.154
kins+0.153
Neg logit contributions
 Curious-0.168
Um-0.165
 Everywhere-0.163
Well-0.162
 Unfortunately-0.162
Token give
Token ID1577
Feature activation-1.92e+01

Pos logit contributions
ins+0.157
ies+0.152
cer+0.148
-+0.147
ishing+0.137
Neg logit contributions
 Curious-0.104
Um-0.104
Well-0.103
Listen-0.102
 Everything-0.101
Token to
Token ID284
Feature activation-1.26e+01

Pos logit contributions
cer+0.148
ins+0.145
ies+0.142
kin+0.134
-+0.132
Neg logit contributions
Listen-0.132
Well-0.131
 Everything-0.129
Um-0.127
 Curious-0.126
Token the
Token ID262
Feature activation-1.85e+01

Pos logit contributions
ins+0.099
cer+0.099
ies+0.096
-+0.094
ishing+0.088
Neg logit contributions
 Everything-0.088
 Curious-0.088
Well-0.086
 Everywhere-0.084
Listen-0.084
Token rh
Token ID9529
Feature activation-4.27e+01

Pos logit contributions
ins+0.130
ies+0.129
cer+0.126
-+0.125
ishing+0.112
Neg logit contributions
Well-0.148
 Curious-0.147
Um-0.145
Listen-0.144
 Everything-0.139
Tokenin
Token ID259
Feature activation-3.02e+01

Pos logit contributions
ies+0.261
et+0.236
cer+0.231
kin+0.231
ishing+0.226
Neg logit contributions
Well-0.330
Two-0.321
Um-0.320
Over-0.319
oral-0.312
Tokenoc
Token ID420
Feature activation-5.76e+01

Pos logit contributions
ins+0.149
ies+0.145
cer+0.140
ishing+0.122
-+0.122
Neg logit contributions
 Curious-0.305
Well-0.305
Um-0.299
Listen-0.294
 Everything-0.292
Tokeneros
Token ID27498
Feature activation-2.80e+01

Pos logit contributions
ies+0.363
kin+0.335
ins+0.324
cer+0.304
kins+0.282
Neg logit contributions
Well-0.487
Um-0.486
Listen-0.455
 Unfortunately-0.450
 Somebody-0.449
Token.
Token ID13
Feature activation-7.72e+00

Pos logit contributions
ins+0.151
ies+0.151
cer+0.148
-+0.138
ishing+0.127
Neg logit contributions
 Curious-0.264
Well-0.261
Um-0.257
Listen-0.256
 Everything-0.255
Token Lily
Token ID20037
Feature activation-1.47e+01

Pos logit contributions
ins+0.069
-+0.067
cer+0.067
ies+0.067
er+0.064
Neg logit contributions
Listen-0.043
Over-0.040
Um-0.040
Well-0.040
Hmm-0.037
Token walked
Token ID6807
Feature activation-1.58e+01

Pos logit contributions
ies+0.114
ins+0.113
cer+0.110
-+0.109
ates+0.099
Neg logit contributions
Well-0.107
 Curious-0.107
Listen-0.103
Um-0.103
 Everything-0.100
Token to
Token ID284
Feature activation-7.57e+00

Pos logit contributions
ins+0.106
ies+0.105
cer+0.100
-+0.097
ie+0.088
Neg logit contributions
 Curious-0.136
Well-0.135
Listen-0.134
Um-0.131
 Everything-0.128
Token as
Token ID355
Feature activation-9.84e+00

Pos logit contributions
ins+0.108
ies+0.095
cer+0.093
-+0.081
ishing+0.080
Neg logit contributions
 Curious-0.246
Well-0.243
 Everything-0.238
 �-0.238
Listen-0.237
Token he
Token ID339
Feature activation-1.22e+01

Pos logit contributions
ins+0.096
ies+0.094
cer+0.092
-+0.090
ishing+0.087
Neg logit contributions
Well-0.053
 Curious-0.051
Listen-0.051
Um-0.050
Over-0.048
Token opened
Token ID4721
Feature activation-1.89e+01

Pos logit contributions
ies+0.110
-+0.109
ins+0.108
cer+0.106
ie+0.098
Neg logit contributions
Well-0.074
 Curious-0.074
Listen-0.074
Um-0.072
 Everything-0.070
Token the
Token ID262
Feature activation-1.54e+01

Pos logit contributions
ins+0.146
ies+0.140
cer+0.132
-+0.126
kin+0.120
Neg logit contributions
 Curious-0.139
 Everywhere-0.135
Well-0.135
 Everything-0.134
 �-0.132
Token ch
Token ID442
Feature activation-3.06e+01

Pos logit contributions
ies+0.109
-+0.106
ins+0.106
cer+0.103
ie+0.093
Neg logit contributions
Well-0.126
Listen-0.123
 Curious-0.123
Um-0.122
Over-0.116
Tokenocol
Token ID4668
Feature activation-5.68e+01

Pos logit contributions
cer+0.160
ies+0.158
ins+0.153
-+0.132
ie+0.126
Neg logit contributions
 Everything-0.272
 Curious-0.271
Well-0.270
Um-0.265
 Everywhere-0.261
Tokenates
Token ID689
Feature activation-2.68e+01

Pos logit contributions
ies+0.268
cer+0.250
 doll+0.244
ie+0.225
!"+0.213
Neg logit contributions
Well-0.485
Um-0.484
Fr-0.481
In-0.471
Finally-0.467
Token and
Token ID290
Feature activation-8.14e+00

Pos logit contributions
ins+0.138
ies+0.133
cer+0.119
-+0.117
ishing+0.108
Neg logit contributions
Well-0.252
 Curious-0.251
 �-0.246
 Unfortunately-0.242
Over-0.241
Token started
Token ID2067
Feature activation-1.30e+01

Pos logit contributions
ins+0.072
ies+0.071
-+0.068
cer+0.067
ie+0.063
Neg logit contributions
Well-0.052
 Curious-0.051
Um-0.051
Listen-0.050
 Everything-0.048
Token eating
Token ID6600
Feature activation-1.77e+01

Pos logit contributions
ies+0.093
ins+0.092
cer+0.090
-+0.088
ie+0.081
Neg logit contributions
Well-0.102
 Curious-0.102
Listen-0.101
Um-0.100
 Everything-0.097
Token them
Token ID606
Feature activation-1.72e+01

Pos logit contributions
ies+0.131
ins+0.129
cer+0.127
-+0.118
ishing+0.114
Neg logit contributions
 �-0.129
Well-0.128
 Curious-0.127
Listen-0.123
 Everything-0.123
Token reward
Token ID6721
Feature activation-2.99e+01

Pos logit contributions
ins+0.288
ies+0.254
ates+0.246
-+0.244
cer+0.240
Neg logit contributions
 Curious-0.291
Well-0.277
 Everything-0.275
Um-0.273
 �-0.272
Token.
Token ID13
Feature activation-1.11e+01

Pos logit contributions
ins+0.154
cer+0.150
ies+0.149
ishing+0.135
-+0.135
Neg logit contributions
Well-0.280
Um-0.268
Listen-0.267
 �-0.265
 Curious-0.263
Token It
Token ID632
Feature activation-1.53e+01

Pos logit contributions
ins+0.105
ies+0.104
cer+0.104
-+0.104
er+0.097
Neg logit contributions
Listen-0.056
Um-0.053
Well-0.053
Over-0.051
 Curious-0.050
Token was
Token ID373
Feature activation-1.34e+01

Pos logit contributions
ies+0.119
ins+0.117
-+0.116
cer+0.114
ie+0.105
Neg logit contributions
Well-0.112
Listen-0.111
 Curious-0.111
Um-0.109
 Everything-0.105
Token ch
Token ID442
Feature activation-2.57e+01

Pos logit contributions
ies+0.103
ins+0.102
-+0.099
cer+0.098
ie+0.089
Neg logit contributions
Well-0.100
 Curious-0.099
Um-0.098
Listen-0.098
 Everything-0.094
Tokenocol
Token ID4668
Feature activation-5.61e+01

Pos logit contributions
ies+0.134
ins+0.129
cer+0.125
-+0.123
ie+0.105
Neg logit contributions
Well-0.256
 Curious-0.254
Um-0.250
Listen-0.249
 Everything-0.245
Tokenates
Token ID689
Feature activation-2.87e+01

Pos logit contributions
ies+0.263
cer+0.231
 doll+0.221
-+0.209
ie+0.208
Neg logit contributions
Um-0.501
Well-0.496
Fr-0.492
Listen-0.475
In-0.469
Token and
Token ID290
Feature activation-1.31e+01

Pos logit contributions
ins+0.153
ies+0.144
cer+0.134
ishing+0.125
-+0.125
Neg logit contributions
Well-0.256
 Curious-0.255
 �-0.253
-0.252
 Everywhere-0.248
Token sweets
Token ID42402
Feature activation-2.83e+01

Pos logit contributions
ins+0.107
ies+0.103
cer+0.101
-+0.100
ishing+0.094
Neg logit contributions
Well-0.091
Um-0.088
 Curious-0.088
Listen-0.087
 Everything-0.084
Token!
Token ID0
Feature activation-6.42e+00

Pos logit contributions
ins+0.166
ies+0.150
ishing+0.137
cer+0.136
ates+0.126
Neg logit contributions
Well-0.256
 Curious-0.244
That-0.244
 �-0.243
Over-0.242
Token
Token ID198
Feature activation-4.65e+00

Pos logit contributions
ins+0.058
-+0.057
cer+0.057
ies+0.057
er+0.053
Neg logit contributions
Listen-0.036
Um-0.034
Well-0.034
Over-0.033
 Curious-0.033
Token giving
Token ID3501
Feature activation-1.85e+01

Pos logit contributions
ins+0.118
ies+0.117
cer+0.116
-+0.108
ishing+0.102
Neg logit contributions
Well-0.114
Um-0.114
Listen-0.113
 Curious-0.111
That-0.109
Token him
Token ID683
Feature activation-1.81e+01

Pos logit contributions
ins+0.144
cer+0.142
ies+0.140
kin+0.139
web+0.129
Neg logit contributions
Listen-0.124
 Everything-0.123
 Curious-0.123
Um-0.121
Well-0.121
Token too
Token ID1165
Feature activation-1.74e+01

Pos logit contributions
ins+0.133
cer+0.132
ies+0.129
-+0.120
ates+0.114
Neg logit contributions
 Curious-0.130
Listen-0.129
Um-0.128
Well-0.127
 Everything-0.125
Token many
Token ID867
Feature activation-1.96e+01

Pos logit contributions
ies+0.158
ins+0.157
cer+0.154
-+0.152
ie+0.140
Neg logit contributions
Well-0.104
 Curious-0.103
Listen-0.102
Um-0.101
 Everything-0.096
Token ch
Token ID442
Feature activation-2.91e+01

Pos logit contributions
cer+0.124
ies+0.107
ins+0.106
ishing+0.100
kin+0.099
Neg logit contributions
Well-0.165
Um-0.164
 Curious-0.164
 Everywhere-0.162
 Everything-0.159
Tokenocol
Token ID4668
Feature activation-5.57e+01

Pos logit contributions
cer+0.181
ies+0.154
web+0.152
-+0.144
ins+0.142
Neg logit contributions
Well-0.223
Um-0.220
 Everywhere-0.218
 Everything-0.217
Listen-0.214
Tokenates
Token ID689
Feature activation-2.76e+01

Pos logit contributions
ies+0.264
 doll+0.255
cer+0.247
ie+0.224
 rug+0.224
Neg logit contributions
Um-0.438
Fr-0.432
Well-0.430
Less-0.424
Listen-0.423
Token."
Token ID526
Feature activation-3.95e+00

Pos logit contributions
ins+0.118
cer+0.114
ies+0.114
ishing+0.101
-+0.095
Neg logit contributions
Well-0.267
-0.259
Um-0.255
 �-0.255
 Curious-0.255
Token The
Token ID383
Feature activation-1.04e+01

Pos logit contributions
ies+0.036
ins+0.036
-+0.036
cer+0.035
ie+0.033
Neg logit contributions
Listen-0.022
Well-0.022
Um-0.022
 Curious-0.022
Over-0.021
Token king
Token ID5822
Feature activation-2.07e+01

Pos logit contributions
ins+0.077
ies+0.076
cer+0.076
-+0.074
ishing+0.068
Neg logit contributions
Well-0.079
 Curious-0.078
Um-0.078
Listen-0.077
Over-0.073
Token learned
Token ID4499
Feature activation-8.69e+00

Pos logit contributions
ins+0.179
ies+0.178
cer+0.177
-+0.171
ishing+0.162
Neg logit contributions
Well-0.130
 Curious-0.126
Um-0.124
Listen-0.120
Over-0.115
Token it
Token ID340
Feature activation-2.07e+01

Pos logit contributions
cer+0.150
ins+0.147
ies+0.145
ater+0.138
-+0.136
Neg logit contributions
Well-0.121
 Everything-0.120
Listen-0.115
 Curious-0.114
Um-0.114
Token to
Token ID284
Feature activation-1.03e+01

Pos logit contributions
ins+0.155
ies+0.151
cer+0.150
-+0.143
ates+0.131
Neg logit contributions
 Curious-0.154
Listen-0.152
Well-0.152
Um-0.150
 Everything-0.147
Token the
Token ID262
Feature activation-1.76e+01

Pos logit contributions
ins+0.085
ies+0.083
cer+0.081
-+0.080
ishing+0.075
Neg logit contributions
Well-0.071
Um-0.070
 Everything-0.069
Listen-0.069
 Curious-0.068
Token rh
Token ID9529
Feature activation-4.04e+01

Pos logit contributions
ins+0.134
ies+0.132
-+0.127
cer+0.126
ishing+0.116
Neg logit contributions
Well-0.132
Um-0.131
 Curious-0.131
Listen-0.129
 Everything-0.124
Tokenin
Token ID259
Feature activation-2.83e+01

Pos logit contributions
ies+0.247
et+0.221
kin+0.213
cer+0.212
ishing+0.211
Neg logit contributions
Well-0.314
Over-0.302
Two-0.302
Um-0.298
oral-0.294
Tokenoc
Token ID420
Feature activation-5.54e+01

Pos logit contributions
ins+0.141
ies+0.138
cer+0.129
-+0.117
ishing+0.112
Neg logit contributions
 Curious-0.283
Well-0.282
Um-0.278
Listen-0.271
 Unfortunately-0.270
Tokeneros
Token ID27498
Feature activation-2.71e+01

Pos logit contributions
ies+0.349
kin+0.322
ins+0.319
cer+0.290
kins+0.275
Neg logit contributions
Well-0.468
Um-0.459
Listen-0.435
 Unfortunately-0.431
 Everything-0.430
Token.
Token ID13
Feature activation-6.76e+00

Pos logit contributions
ins+0.106
ies+0.105
cer+0.101
-+0.094
ishing+0.081
Neg logit contributions
 Curious-0.295
Well-0.293
Um-0.289
Listen-0.287
 Everything-0.286
Token When
Token ID1649
Feature activation-1.12e+01

Pos logit contributions
ins+0.060
-+0.059
cer+0.058
ies+0.057
er+0.055
Neg logit contributions
Listen-0.038
Um-0.036
Over-0.036
Well-0.035
Hmm-0.034
Token he
Token ID339
Feature activation-1.46e+01

Pos logit contributions
ies+0.083
-+0.081
ins+0.079
cer+0.079
ie+0.072
Neg logit contributions
Well-0.086
 Curious-0.086
Um-0.085
Listen-0.084
Over-0.080
Token tasted
Token ID29187
Feature activation-1.55e+01

Pos logit contributions
ies+0.123
-+0.122
ins+0.120
cer+0.118
ie+0.110
Neg logit contributions
Well-0.099
 Curious-0.098
Listen-0.098
Um-0.097
 Everything-0.093
Token that
Token ID326
Feature activation-1.45e+01

Pos logit contributions
ins+0.131
cer+0.122
ies+0.119
ates+0.117
kin+0.113
Neg logit contributions
Well-0.173
 Curious-0.166
Listen-0.164
 Everything-0.162
Um-0.159
Token the
Token ID262
Feature activation-1.69e+01

Pos logit contributions
ies+0.122
ins+0.120
-+0.119
cer+0.118
ie+0.108
Neg logit contributions
Well-0.097
 Curious-0.096
Um-0.095
Listen-0.095
 Everything-0.090
Token bad
Token ID2089
Feature activation-2.19e+01

Pos logit contributions
ies+0.128
ins+0.127
-+0.123
cer+0.122
ie+0.110
Neg logit contributions
Well-0.127
 Curious-0.125
Listen-0.124
Um-0.124
 Everything-0.119
Token rh
Token ID9529
Feature activation-4.22e+01

Pos logit contributions
ins+0.160
ies+0.158
cer+0.153
-+0.152
ates+0.138
Neg logit contributions
Well-0.170
 Curious-0.167
Listen-0.166
Um-0.165
 Everything-0.160
Tokenin
Token ID259
Feature activation-2.89e+01

Pos logit contributions
ies+0.230
cer+0.203
ishing+0.202
ins+0.198
-+0.197
Neg logit contributions
Well-0.385
Over-0.364
Um-0.358
Listen-0.357
Two-0.348
Tokenoc
Token ID420
Feature activation-5.50e+01

Pos logit contributions
ies+0.140
ins+0.137
cer+0.129
-+0.128
ie+0.109
Neg logit contributions
Well-0.299
 Curious-0.295
Um-0.293
Listen-0.292
 Everything-0.283
Tokeneros
Token ID27498
Feature activation-2.72e+01

Pos logit contributions
ies+0.328
ins+0.290
kin+0.290
cer+0.275
packs+0.258
Neg logit contributions
Well-0.492
Um-0.472
Listen-0.447
 Everything-0.442
 Somebody-0.441
Token was
Token ID373
Feature activation-1.41e+01

Pos logit contributions
ies+0.231
-+0.227
ins+0.226
cer+0.222
ie+0.205
Neg logit contributions
Well-0.180
Listen-0.179
 Curious-0.178
Um-0.177
 Everything-0.168
Token now
Token ID783
Feature activation-1.70e+01

Pos logit contributions
ies+0.113
ins+0.111
cer+0.108
-+0.108
ie+0.098
Neg logit contributions
Well-0.101
 Curious-0.100
Listen-0.099
Um-0.098
 Everything-0.095
Token nice
Token ID3621
Feature activation-1.74e+01

Pos logit contributions
ies+0.118
ins+0.116
-+0.114
cer+0.112
ie+0.101
Neg logit contributions
Well-0.139
 Curious-0.138
Listen-0.137
Um-0.136
 Everything-0.131
Token.
Token ID13
Feature activation-5.80e+00

Pos logit contributions
ins+0.099
ies+0.095
cer+0.089
-+0.084
ates+0.081
Neg logit contributions
Well-0.163
 Curious-0.159
 Everything-0.157
Listen-0.156
Um-0.152
Token there
Token ID612
Feature activation-1.47e+01

Pos logit contributions
ins+0.080
ies+0.079
cer+0.076
-+0.068
ates+0.068
Neg logit contributions
Well-0.139
 Curious-0.137
Listen-0.133
 Everything-0.132
Um-0.131
Token was
Token ID373
Feature activation-1.19e+01

Pos logit contributions
ies+0.111
ins+0.109
-+0.107
cer+0.106
ie+0.096
Neg logit contributions
Well-0.112
 Curious-0.110
Listen-0.110
Um-0.110
 Everything-0.104
Token a
Token ID257
Feature activation-1.45e+01

Pos logit contributions
ies+0.078
ins+0.077
-+0.077
cer+0.075
ie+0.067
Neg logit contributions
Well-0.100
 Curious-0.098
Listen-0.098
Um-0.098
 Everything-0.094
Token rh
Token ID9529
Feature activation-4.03e+01

Pos logit contributions
ies+0.107
-+0.106
cer+0.101
ins+0.101
ie+0.091
Neg logit contributions
Well-0.114
Listen-0.113
 Curious-0.110
Um-0.110
 Everything-0.107
Tokenin
Token ID259
Feature activation-2.85e+01

Pos logit contributions
ies+0.271
keeping+0.243
et+0.240
packs+0.239
ishing+0.236
Neg logit contributions
Over-0.291
Two-0.286
Well-0.279
Listen-0.264
Um-0.263
Tokenoc
Token ID420
Feature activation-5.36e+01

Pos logit contributions
ins+0.177
ies+0.166
cer+0.152
ishing+0.151
ates+0.144
Neg logit contributions
Well-0.254
Um-0.248
 Curious-0.248
 Who-0.235
Two-0.234
Tokeneros
Token ID27498
Feature activation-2.86e+01

Pos logit contributions
ies+0.363
kin+0.356
ins+0.338
cer+0.311
kins+0.308
Neg logit contributions
Um-0.406
Well-0.403
 Curious-0.382
 Somebody-0.378
 Unfortunately-0.376
Token who
Token ID508
Feature activation-1.52e+01

Pos logit contributions
ins+0.150
ies+0.145
cer+0.134
ishing+0.129
kin+0.124
Neg logit contributions
 Curious-0.272
Well-0.269
Um-0.261
Over-0.256
Listen-0.253
Token was
Token ID373
Feature activation-1.43e+01

Pos logit contributions
-+0.121
ies+0.119
cer+0.117
ins+0.116
ie+0.107
Neg logit contributions
Listen-0.110
Um-0.108
Well-0.107
 Curious-0.106
 Everything-0.101
Token always
Token ID1464
Feature activation-1.71e+01

Pos logit contributions
ies+0.119
ins+0.118
cer+0.113
-+0.111
ishing+0.104
Neg logit contributions
 Curious-0.099
Well-0.097
Um-0.097
Listen-0.094
 Everything-0.093
Token naughty
Token ID45128
Feature activation-1.74e+01

Pos logit contributions
ins+0.136
ies+0.136
cer+0.122
-+0.120
ie+0.118
Neg logit contributions
 Curious-0.125
Well-0.114
Listen-0.113
Um-0.112
 Everything-0.111
Token many
Token ID867
Feature activation-2.16e+01

Pos logit contributions
-+0.083
ins+0.080
ies+0.079
cer+0.074
er+0.069
Neg logit contributions
 Curious-0.112
Well-0.112
Um-0.110
Over-0.106
 Everywhere-0.106
Token more
Token ID517
Feature activation-2.10e+01

Pos logit contributions
cer+0.166
ies+0.164
ins+0.163
-+0.151
kin+0.142
Neg logit contributions
Well-0.160
 Curious-0.156
Listen-0.154
Um-0.153
 Everywhere-0.149
Token adventures
Token ID17545
Feature activation-2.23e+01

Pos logit contributions
cer+0.175
ins+0.173
ies+0.170
-+0.157
ates+0.152
Neg logit contributions
Well-0.140
 Curious-0.140
Um-0.134
Listen-0.132
 Everywhere-0.132
Token with
Token ID351
Feature activation-1.22e+01

Pos logit contributions
ins+0.124
ies+0.122
cer+0.114
ishing+0.099
ater+0.099
Neg logit contributions
Well-0.208
 Everywhere-0.196
 Curious-0.194
 �-0.193
Listen-0.193
Token the
Token ID262
Feature activation-1.58e+01

Pos logit contributions
ies+0.086
ins+0.086
cer+0.084
-+0.081
ishing+0.074
Neg logit contributions
Well-0.098
 Curious-0.095
Listen-0.095
Um-0.094
 Everything-0.093
Token mer
Token ID4017
Feature activation-5.31e+01

Pos logit contributions
ies+0.120
ins+0.118
-+0.116
cer+0.115
ie+0.104
Neg logit contributions
Well-0.119
 Curious-0.117
Listen-0.116
Um-0.116
 Everything-0.111
Tokenmaid
Token ID23151
Feature activation-2.42e+01

Pos logit contributions
e+0.266
cer+0.260
orns+0.256
ins+0.245
ates+0.236
Neg logit contributions
 Curious-0.449
Um-0.436
Well-0.433
 Who-0.429
 Everything-0.418
Token.
Token ID13
Feature activation-4.21e+00

Pos logit contributions
ins+0.155
ies+0.151
cer+0.149
-+0.144
ates+0.132
Neg logit contributions
Well-0.204
 Curious-0.201
Um-0.196
Listen-0.195
 Everything-0.192
Token
Token ID198
Feature activation-3.64e+00

Pos logit contributions
ins+0.037
ies+0.037
-+0.037
cer+0.036
ie+0.034
Neg logit contributions
Listen-0.024
Um-0.024
Over-0.023
Well-0.022
 Curious-0.022
Token
Token ID198
Feature activation-2.98e+00

Pos logit contributions
ins+0.030
-+0.030
ies+0.029
cer+0.028
ie+0.027
Neg logit contributions
Um-0.024
 Curious-0.024
Listen-0.023
Well-0.023
Hmm-0.022
TokenThe
Token ID464
Feature activation-1.00e+01

Pos logit contributions
ins+0.029
-+0.028
cer+0.027
ies+0.027
e+0.027
Neg logit contributions
 Curious-0.015
 Everything-0.014
 daunting-0.014
 There-0.014
Um-0.014
Token quiet
Token ID5897
Feature activation-3.76e+01

Pos logit contributions
 Curious+0.052
Well+0.051
Um+0.050
 Everything+0.050
Listen+0.050
Neg logit contributions
ins-0.043
ies-0.039
cer-0.038
ates-0.036
--0.036
Token compared
Token ID3688
Feature activation-2.77e+01

Pos logit contributions
ins+0.212
cer+0.196
ates+0.183
ishing+0.179
ies+0.170
Neg logit contributions
 Curious-0.327
 Everything-0.322
 �-0.319
Well-0.318
Listen-0.315
Token to
Token ID284
Feature activation-1.27e+01

Pos logit contributions
ins+0.199
cer+0.198
ies+0.190
-+0.183
ates+0.183
Neg logit contributions
Well-0.202
Listen-0.199
 Everywhere-0.195
 Curious-0.193
 Everything-0.192
Token the
Token ID262
Feature activation-2.04e+01

Pos logit contributions
ins+0.097
ies+0.097
cer+0.094
-+0.093
ishing+0.085
Neg logit contributions
Well-0.093
 Curious-0.092
Listen-0.092
Um-0.091
 Everything-0.090
Token noisy
Token ID31210
Feature activation-2.90e+01

Pos logit contributions
ins+0.157
ies+0.154
cer+0.149
-+0.147
ishing+0.135
Neg logit contributions
 Curious-0.150
Well-0.150
Listen-0.148
Um-0.147
 Everything-0.142
Token se
Token ID384
Feature activation-5.31e+01

Pos logit contributions
ins+0.206
ies+0.190
cer+0.190
ates+0.183
ishing+0.177
Neg logit contributions
 Curious-0.222
 Everything-0.221
Listen-0.220
Well-0.220
Um-0.208
Tokenag
Token ID363
Feature activation-3.32e+01

Pos logit contributions
cer+0.329
ies+0.308
 chefs+0.283
packs+0.264
kin+0.262
Neg logit contributions
Listen-0.424
Well-0.418
Finally-0.412
 Everything-0.403
Sadly-0.400
Tokenull
Token ID724
Feature activation-2.70e+01

Pos logit contributions
cer+0.201
ins+0.195
ies+0.193
ishing+0.191
 chefs+0.183
Neg logit contributions
Well-0.261
 Everything-0.254
 Curious-0.250
Listen-0.249
 Everywhere-0.247
Tokens
Token ID82
Feature activation-1.84e+01

Pos logit contributions
ins+0.167
cer+0.163
ies+0.155
ates+0.151
-+0.143
Neg logit contributions
Well-0.224
 Curious-0.223
Listen-0.213
 Everything-0.211
Um-0.211
Token outside
Token ID2354
Feature activation-2.16e+01

Pos logit contributions
ins+0.093
ies+0.088
cer+0.085
-+0.081
ie+0.073
Neg logit contributions
 Curious-0.180
Well-0.180
Listen-0.175
Um-0.174
 Everywhere-0.174
Token.
Token ID13
Feature activation-7.50e+00

Pos logit contributions
ins+0.090
ies+0.089
cer+0.082
-+0.079
ishing+0.070
Neg logit contributions
Well-0.230
 Curious-0.229
Listen-0.227
 Everything-0.225
Um-0.222
Token children
Token ID1751
Feature activation-2.96e+01

Pos logit contributions
Listen+0.106
Um+0.104
Hmm+0.104
 Curious+0.100
Well+0.100
Neg logit contributions
--0.111
ies-0.106
cer-0.098
ins-0.097
ie-0.093
Token put
Token ID1234
Feature activation-2.22e+01

Pos logit contributions
ins+0.192
ies+0.190
cer+0.188
-+0.186
ishing+0.169
Neg logit contributions
Well-0.247
 Curious-0.243
Um-0.239
Listen-0.238
 Everything-0.234
Token on
Token ID319
Feature activation-1.66e+01

Pos logit contributions
ins+0.148
cer+0.147
ies+0.147
-+0.132
ater+0.132
Neg logit contributions
Well-0.166
 Everything-0.166
Listen-0.162
 Curious-0.161
 Everywhere-0.159
Token their
Token ID511
Feature activation-2.44e+01

Pos logit contributions
ies+0.120
ins+0.119
cer+0.117
-+0.110
ishing+0.104
Neg logit contributions
 Curious-0.128
Well-0.126
Listen-0.124
 Everything-0.124
Um-0.123
Token bath
Token ID7837
Feature activation-3.03e+01

Pos logit contributions
ins+0.152
ies+0.150
cer+0.145
-+0.141
ie+0.127
Neg logit contributions
 Curious-0.214
Well-0.212
Um-0.210
Listen-0.210
 Everything-0.203
Tokenro
Token ID305
Feature activation-5.29e+01

Pos logit contributions
cer+0.176
ins+0.173
cks+0.170
ile+0.162
ies+0.157
Neg logit contributions
 Everywhere-0.244
Well-0.236
 Everything-0.235
 Curious-0.230
 Unfortunately-0.225
Tokenbes
Token ID12636
Feature activation-3.17e+01

Pos logit contributions
ins+0.366
ies+0.343
ie+0.334
cer+0.326
 chefs+0.318
Neg logit contributions
 Curious-0.371
 �-0.364
 hardly-0.360
Luckily-0.352
 Unfortunately-0.351
Token and
Token ID290
Feature activation-1.24e+01

Pos logit contributions
ins+0.162
ies+0.157
cer+0.150
-+0.134
ishing+0.127
Neg logit contributions
Well-0.308
 Curious-0.307
Listen-0.300
 Everywhere-0.299
 On-0.297
Token ran
Token ID4966
Feature activation-1.75e+01

Pos logit contributions
ins+0.105
ies+0.104
-+0.101
cer+0.101
ates+0.092
Neg logit contributions
Well-0.081
 Curious-0.080
Listen-0.080
Um-0.079
 Everything-0.076
Token around
Token ID1088
Feature activation-1.55e+01

Pos logit contributions
ins+0.130
ies+0.126
cer+0.122
-+0.110
kin+0.110
Neg logit contributions
 Curious-0.136
Listen-0.135
Well-0.133
 Everything-0.132
 Unfortunately-0.129
Token in
Token ID287
Feature activation-1.27e+01

Pos logit contributions
ins+0.110
ies+0.106
cer+0.101
-+0.092
ishing+0.091
Neg logit contributions
Well-0.125
 Curious-0.125
Listen-0.123
 Everything-0.121
 �-0.120
Token.
Token ID13
Feature activation-1.12e+01

Pos logit contributions
ins+0.136
cer+0.126
ates+0.117
ies+0.116
ishing+0.116
Neg logit contributions
Well-0.203
Um-0.201
 �-0.200
 Curious-0.199
-0.195
Token The
Token ID383
Feature activation-1.52e+01

Pos logit contributions
ins+0.095
cer+0.094
ies+0.094
-+0.094
e+0.087
Neg logit contributions
Listen-0.067
Um-0.065
Well-0.065
 Curious-0.064
Over-0.062
Token more
Token ID517
Feature activation-1.97e+01

Pos logit contributions
ies+0.107
ins+0.106
-+0.103
cer+0.103
ie+0.092
Neg logit contributions
Well-0.123
 Curious-0.121
Listen-0.120
Um-0.120
 Everything-0.115
Token,
Token ID11
Feature activation-9.35e+00

Pos logit contributions
ins+0.129
cer+0.129
ies+0.123
-+0.118
ishing+0.115
Neg logit contributions
Well-0.164
 Curious-0.160
Um-0.157
 Everything-0.156
Listen-0.154
Token the
Token ID262
Feature activation-1.41e+01

Pos logit contributions
ins+0.070
ies+0.069
cer+0.067
-+0.066
ishing+0.060
Neg logit contributions
Well-0.071
 Curious-0.070
Listen-0.069
Um-0.068
 Everything-0.068
Token mer
Token ID4017
Feature activation-5.25e+01

Pos logit contributions
ies+0.105
ins+0.105
cer+0.101
-+0.101
ie+0.091
Neg logit contributions
Well-0.108
 Curious-0.106
Um-0.106
Listen-0.104
 Everything-0.100
Tokenrier
Token ID5277
Feature activation-3.75e+01

Pos logit contributions
orns+0.282
e+0.256
cer+0.254
ins+0.243
ates+0.238
Neg logit contributions
 Who-0.431
Um-0.428
 Curious-0.426
Well-0.418
 Everything-0.401
Token.
Token ID13
Feature activation-6.83e+00

Pos logit contributions
ins+0.207
ishing+0.200
cer+0.180
ies+0.180
ates+0.175
Neg logit contributions
Well-0.333
Um-0.319
 Who-0.317
 Curious-0.305
Listen-0.302
Token Come
Token ID7911
Feature activation-1.24e+01

Pos logit contributions
ins+0.062
cer+0.061
ies+0.059
-+0.058
ates+0.058
Neg logit contributions
Listen-0.036
Um-0.035
Well-0.034
Over-0.033
 Curious-0.032
Token to
Token ID284
Feature activation-7.90e+00

Pos logit contributions
ies+0.055
ins+0.054
cer+0.051
-+0.050
ie+0.042
Neg logit contributions
Well-0.133
 Curious-0.132
Listen-0.131
Um-0.130
 Everything-0.127
Token my
Token ID616
Feature activation-1.55e+01

Pos logit contributions
ins+0.057
ies+0.057
-+0.055
cer+0.055
ishing+0.048
Neg logit contributions
Well-0.062
Um-0.061
 Curious-0.061
Listen-0.061
 Everything-0.059
Token
Token ID198
Feature activation-7.43e+00

Pos logit contributions
ins+0.073
-+0.072
ies+0.071
cer+0.068
ie+0.064
Neg logit contributions
Well-0.051
 Curious-0.051
Um-0.051
Listen-0.050
Over-0.048
Token"
Token ID1
Feature activation-7.50e+00

Pos logit contributions
ins+0.059
-+0.057
cer+0.057
ies+0.055
e+0.052
Neg logit contributions
 Curious-0.049
Well-0.048
Um-0.048
Listen-0.047
 Everything-0.047
TokenThe
Token ID464
Feature activation-1.46e+01

Pos logit contributions
cer+0.072
ins+0.071
ab+0.067
-+0.067
ies+0.065
Neg logit contributions
 daunting-0.035
 Sadly-0.035
 Unfortunately-0.035
 Everything-0.035
 Everywhere-0.034
Token rh
Token ID9529
Feature activation-3.83e+01

Pos logit contributions
ins+0.111
ies+0.111
cer+0.106
-+0.106
ie+0.096
Neg logit contributions
Well-0.110
 Curious-0.110
Um-0.108
Listen-0.108
 Everything-0.104
Tokenin
Token ID259
Feature activation-2.40e+01

Pos logit contributions
ies+0.220
ishing+0.193
ins+0.188
et+0.188
cer+0.185
Neg logit contributions
Well-0.327
Um-0.311
Over-0.311
Listen-0.309
Two-0.306
Tokenoc
Token ID420
Feature activation-5.21e+01

Pos logit contributions
ins+0.118
ies+0.114
cer+0.106
-+0.096
ishing+0.093
Neg logit contributions
Well-0.244
 Curious-0.243
Um-0.240
Listen-0.236
 Everything-0.231
Tokeneros
Token ID27498
Feature activation-2.35e+01

Pos logit contributions
ies+0.330
kin+0.306
ins+0.302
cer+0.280
packs+0.275
Neg logit contributions
Um-0.428
Well-0.425
Listen-0.400
 Who-0.394
 Somebody-0.393
Token is
Token ID318
Feature activation-1.10e+01

Pos logit contributions
ins+0.156
ies+0.154
cer+0.147
-+0.142
ishing+0.131
Neg logit contributions
Well-0.195
 Curious-0.195
Um-0.192
Listen-0.188
 Everything-0.185
Token in
Token ID287
Feature activation-1.38e+01

Pos logit contributions
ins+0.084
ies+0.084
cer+0.081
-+0.079
ie+0.073
Neg logit contributions
 Curious-0.081
Well-0.081
Um-0.080
Listen-0.080
 Everything-0.077
Token there
Token ID612
Feature activation-1.51e+01

Pos logit contributions
ins+0.107
ies+0.106
cer+0.097
-+0.096
ishing+0.090
Neg logit contributions
 Curious-0.104
Well-0.103
Um-0.102
Listen-0.102
 Everything-0.099
Token,
Token ID11
Feature activation-8.96e+00

Pos logit contributions
ins+0.064
ies+0.061
cer+0.057
-+0.056
ishing+0.048
Neg logit contributions
 Curious-0.162
Well-0.161
Um-0.159
Listen-0.158
 Everything-0.157
Token his
Token ID465
Feature activation-2.95e+01

Pos logit contributions
 Curious+0.065
Listen+0.065
Um+0.064
Well+0.064
 Everything+0.063
Neg logit contributions
ins-0.037
ies-0.035
cer-0.035
--0.032
kin-0.032
Token head
Token ID1182
Feature activation-2.91e+01

Pos logit contributions
ins+0.234
cer+0.211
ies+0.208
-+0.201
ishing+0.192
Neg logit contributions
 Curious-0.214
Um-0.210
 Everywhere-0.208
Well-0.205
 �-0.198
Token out
Token ID503
Feature activation-2.45e+01

Pos logit contributions
ins+0.135
ies+0.131
cer+0.123
-+0.116
ishing+0.106
Neg logit contributions
 Curious-0.295
Well-0.294
Listen-0.289
Um-0.286
 Everything-0.282
Token of
Token ID286
Feature activation-2.17e+01

Pos logit contributions
ins+0.128
ies+0.126
cer+0.125
-+0.117
ates+0.103
Neg logit contributions
 Curious-0.239
Well-0.236
Um-0.233
Listen-0.232
 Everything-0.228
Token his
Token ID465
Feature activation-2.57e+01

Pos logit contributions
ins+0.187
ies+0.183
cer+0.175
-+0.170
ater+0.158
Neg logit contributions
 Curious-0.140
Listen-0.138
 Everywhere-0.137
Well-0.136
 Everything-0.135
Token bur
Token ID4356
Feature activation-5.20e+01

Pos logit contributions
ins+0.185
ies+0.171
cer+0.170
-+0.160
er+0.150
Neg logit contributions
 Curious-0.204
Um-0.199
Well-0.194
 Everywhere-0.192
 Everything-0.190
Tokenrow
Token ID808
Feature activation-3.36e+01

Pos logit contributions
cer+0.331
kin+0.316
e+0.303
ins+0.294
 chefs+0.286
Neg logit contributions
 Curious-0.334
 daunting-0.329
 Unfortunately-0.324
rolling-0.323
 Everywhere-0.322
Token and
Token ID290
Feature activation-1.21e+01

Pos logit contributions
ins+0.166
cer+0.149
ies+0.147
e+0.131
ishing+0.129
Neg logit contributions
 Curious-0.320
Well-0.307
 Everywhere-0.307
 �-0.306
 Everything-0.305
Token saw
Token ID2497
Feature activation-1.37e+01

Pos logit contributions
ins+0.108
ies+0.105
cer+0.102
-+0.102
ates+0.095
Neg logit contributions
 Curious-0.076
Well-0.075
Um-0.074
Listen-0.073
 Everything-0.070
Token something
Token ID1223
Feature activation-1.70e+01

Pos logit contributions
ies+0.108
ins+0.107
cer+0.104
-+0.100
ishing+0.095
Neg logit contributions
 Curious-0.099
Well-0.098
 Everything-0.096
Um-0.095
Listen-0.094
Token new
Token ID649
Feature activation-1.95e+01

Pos logit contributions
-+0.137
ies+0.136
ins+0.131
cer+0.127
ie+0.119
Neg logit contributions
Well-0.123
 Curious-0.122
Listen-0.121
Um-0.121
 Everything-0.115
Token't
Token ID470
Feature activation-8.61e+00

Pos logit contributions
-+0.110
ies+0.106
ins+0.104
cer+0.100
ie+0.099
Neg logit contributions
 Curious-0.065
Well-0.065
Um-0.065
Listen-0.063
Hmm-0.060
Token touch
Token ID3638
Feature activation-1.83e+01

Pos logit contributions
ies+0.069
ins+0.067
-+0.067
cer+0.065
ie+0.060
Neg logit contributions
Well-0.063
 Curious-0.062
Listen-0.062
Um-0.061
 Everything-0.059
Token the
Token ID262
Feature activation-1.59e+01

Pos logit contributions
ies+0.150
cer+0.148
ins+0.147
kin+0.146
ates+0.140
Neg logit contributions
 Everything-0.112
That-0.110
Listen-0.107
Well-0.106
 Curious-0.106
Token rh
Token ID9529
Feature activation-3.85e+01

Pos logit contributions
ins+0.123
ies+0.114
cer+0.112
-+0.108
ishing+0.107
Neg logit contributions
 Curious-0.110
Um-0.109
Well-0.108
Listen-0.107
 Everything-0.103
Tokenin
Token ID259
Feature activation-2.86e+01

Pos logit contributions
ies+0.251
ishing+0.231
packs+0.229
et+0.229
keeping+0.225
Neg logit contributions
Two-0.266
Well-0.264
Over-0.255
That-0.247
Um-0.247
Tokenoc
Token ID420
Feature activation-5.14e+01

Pos logit contributions
ins+0.198
ies+0.192
ishing+0.176
ates+0.165
kin+0.161
Neg logit contributions
Well-0.218
That-0.217
 Who-0.216
-0.215
 Curious-0.211
Tokeneros
Token ID27498
Feature activation-2.61e+01

Pos logit contributions
ies+0.333
kin+0.313
ins+0.297
packs+0.278
cer+0.277
Neg logit contributions
Well-0.416
Um-0.409
Listen-0.380
 Somebody-0.378
 Curious-0.376
Token!
Token ID0
Feature activation-7.11e+00

Pos logit contributions
ins+0.122
ies+0.116
ates+0.108
cer+0.104
ishing+0.103
Neg logit contributions
-0.249
 Curious-0.236
 Who-0.235
That-0.232
Well-0.229
Token It
Token ID632
Feature activation-1.31e+01

Pos logit contributions
-+0.063
ins+0.062
cer+0.062
!"+0.060
e+0.059
Neg logit contributions
Listen-0.039
Well-0.037
Over-0.036
 daunting-0.036
Um-0.035
Token might
Token ID1244
Feature activation-1.55e+01

Pos logit contributions
ins+0.097
ies+0.096
cer+0.094
-+0.090
ates+0.087
Neg logit contributions
 Curious-0.100
Well-0.100
Um-0.099
Listen-0.096
 Everything-0.094
Token bite
Token ID13197
Feature activation-1.90e+01

Pos logit contributions
ins+0.134
ies+0.134
cer+0.129
-+0.129
ishing+0.118
Neg logit contributions
Well-0.100
 Curious-0.098
Listen-0.098
Um-0.098
 Everything-0.092
Token They
Token ID1119
Feature activation-1.01e+01

Pos logit contributions
ins+0.072
-+0.070
cer+0.069
ies+0.069
!"+0.066
Neg logit contributions
Listen-0.042
Well-0.040
Um-0.040
Over-0.039
 Curious-0.038
Token find
Token ID1064
Feature activation-1.11e+01

Pos logit contributions
ies+0.081
ins+0.079
-+0.078
cer+0.077
ie+0.070
Neg logit contributions
Well-0.071
 Curious-0.070
Listen-0.070
Um-0.070
 Everything-0.066
Token the
Token ID262
Feature activation-1.29e+01

Pos logit contributions
ies+0.087
ins+0.087
cer+0.084
-+0.082
ie+0.076
Neg logit contributions
 Curious-0.080
Listen-0.079
Well-0.079
 Everything-0.079
Um-0.078
Token rh
Token ID9529
Feature activation-3.67e+01

Pos logit contributions
ins+0.098
ies+0.095
-+0.092
cer+0.091
ie+0.086
Neg logit contributions
 Curious-0.096
Listen-0.096
Well-0.096
Um-0.094
Over-0.092
Tokenin
Token ID259
Feature activation-2.48e+01

Pos logit contributions
ies+0.215
et+0.195
ishing+0.189
er+0.186
cer+0.186
Neg logit contributions
Well-0.288
Over-0.286
Two-0.279
Listen-0.273
Um-0.267
Tokenoc
Token ID420
Feature activation-5.13e+01

Pos logit contributions
ins+0.146
ies+0.139
cer+0.127
ishing+0.124
ates+0.115
Neg logit contributions
Well-0.222
 Curious-0.216
Over-0.210
Um-0.209
Listen-0.209
Tokeneros
Token ID27498
Feature activation-2.44e+01

Pos logit contributions
ies+0.332
ins+0.303
kin+0.293
cer+0.274
ie+0.263
Neg logit contributions
Well-0.413
Um-0.406
 Somebody-0.391
Listen-0.387
 Everything-0.385
Token.
Token ID13
Feature activation-4.15e+00

Pos logit contributions
ins+0.137
ies+0.132
cer+0.130
-+0.120
ishing+0.114
Neg logit contributions
 Curious-0.218
Well-0.215
Listen-0.212
 Everything-0.209
Over-0.208
Token They
Token ID1119
Feature activation-8.20e+00

Pos logit contributions
."+0.029
cer+0.028
-+0.028
ins+0.028
.+0.028
Neg logit contributions
Listen-0.029
Um-0.027
Over-0.027
Days-0.027
Well-0.027
Token are
Token ID389
Feature activation-1.09e+01

Pos logit contributions
ies+0.065
-+0.062
cer+0.062
ins+0.062
ie+0.056
Neg logit contributions
Well-0.060
Listen-0.059
 Curious-0.058
Um-0.058
 Everything-0.055
Token gentle
Token ID10296
Feature activation-1.77e+01

Pos logit contributions
ins+0.093
cer+0.093
ies+0.093
-+0.091
ie+0.085
Neg logit contributions
 Curious-0.069
Well-0.068
Um-0.068
Listen-0.067
 Everything-0.065
Token Everyday
Token ID48154
Feature activation-2.42e+01

Pos logit contributions
 Curious+0.088
Um+0.088
Listen+0.087
Well+0.087
Hmm+0.082
Neg logit contributions
ies-0.101
ins-0.097
--0.096
cer-0.095
ie-0.090
Token he
Token ID339
Feature activation-1.77e+01

Pos logit contributions
ins+0.204
ies+0.202
cer+0.196
-+0.194
ishing+0.179
Neg logit contributions
Well-0.161
 Curious-0.159
Um-0.157
Listen-0.156
 Everything-0.151
Token would
Token ID561
Feature activation-1.39e+01

Pos logit contributions
ies+0.162
ins+0.159
-+0.158
cer+0.156
ie+0.145
Neg logit contributions
Well-0.105
 Curious-0.103
Listen-0.103
Um-0.102
 Everything-0.097
Token take
Token ID1011
Feature activation-1.77e+01

Pos logit contributions
ins+0.125
ies+0.124
cer+0.120
-+0.119
ishing+0.110
Neg logit contributions
 Curious-0.084
Well-0.084
Listen-0.082
Um-0.082
 Everything-0.079
Token the
Token ID262
Feature activation-1.94e+01

Pos logit contributions
ins+0.147
ies+0.145
cer+0.138
ishing+0.123
-+0.123
Neg logit contributions
 Curious-0.122
 Everything-0.122
Well-0.121
Listen-0.120
 Everywhere-0.119
Token av
Token ID1196
Feature activation-5.10e+01

Pos logit contributions
ins+0.142
ies+0.140
cer+0.136
-+0.133
ie+0.119
Neg logit contributions
 Curious-0.151
Well-0.150
Um-0.147
Listen-0.147
 Everything-0.142
Tokenoc
Token ID420
Feature activation-5.99e+01

Pos logit contributions
ies+0.308
kin+0.298
ins+0.290
cer+0.271
web+0.258
Neg logit contributions
Well-0.433
Finally-0.423
Hmm-0.419
Um-0.415
 Unfortunately-0.414
Tokenados
Token ID22484
Feature activation-3.06e+01

Pos logit contributions
kin+0.378
ies+0.378
cer+0.365
ins+0.342
web+0.307
Neg logit contributions
Well-0.491
Um-0.478
 Unfortunately-0.458
 Curious-0.456
 Somebody-0.453
Token and
Token ID290
Feature activation-1.15e+01

Pos logit contributions
ins+0.223
ies+0.218
cer+0.210
-+0.202
ishing+0.189
Neg logit contributions
Well-0.231
 Everywhere-0.220
 Curious-0.218
Listen-0.218
Um-0.216
Token scatter
Token ID41058
Feature activation-2.11e+01

Pos logit contributions
ins+0.093
ies+0.092
cer+0.090
-+0.089
ishing+0.081
Neg logit contributions
Well-0.080
 Curious-0.079
Um-0.078
Listen-0.077
 Everything-0.074
Token them
Token ID606
Feature activation-2.04e+01

Pos logit contributions
ies+0.155
ins+0.154
cer+0.150
-+0.142
kin+0.133
Neg logit contributions
Well-0.159
 Everywhere-0.156
 Curious-0.155
 Everything-0.153
Listen-0.153
Token was
Token ID373
Feature activation-1.09e+01

Pos logit contributions
ies+0.104
ins+0.102
-+0.100
cer+0.099
ishing+0.090
Neg logit contributions
Well-0.089
 Curious-0.087
Listen-0.086
Um-0.086
 Everything-0.083
Token a
Token ID257
Feature activation-1.39e+01

Pos logit contributions
ies+0.070
-+0.069
ins+0.069
cer+0.068
ie+0.060
Neg logit contributions
Well-0.093
 Curious-0.092
Listen-0.091
Um-0.091
 Everything-0.087
Token big
Token ID1263
Feature activation-2.21e+01

Pos logit contributions
ies+0.104
-+0.103
ins+0.099
cer+0.099
ie+0.089
Neg logit contributions
Well-0.108
Listen-0.107
 Curious-0.105
Um-0.104
 Everything-0.102
Token rh
Token ID9529
Feature activation-3.89e+01

Pos logit contributions
ins+0.160
ies+0.146
cer+0.142
ates+0.134
-+0.133
Neg logit contributions
 Curious-0.178
Well-0.174
 Everything-0.170
Um-0.170
Listen-0.169
Tokenin
Token ID259
Feature activation-2.75e+01

Pos logit contributions
ies+0.259
et+0.235
ishing+0.233
keeping+0.228
packs+0.226
Neg logit contributions
Over-0.285
Two-0.277
Well-0.272
Listen-0.257
Um-0.255
Tokenoc
Token ID420
Feature activation-5.09e+01

Pos logit contributions
ins+0.177
ies+0.168
ishing+0.153
cer+0.150
ates+0.143
Neg logit contributions
Well-0.239
 Curious-0.232
Um-0.231
 Who-0.225
Two-0.222
Tokeneros
Token ID27498
Feature activation-2.75e+01

Pos logit contributions
ies+0.347
kin+0.338
ins+0.323
cer+0.298
kins+0.291
Neg logit contributions
Well-0.381
Um-0.380
 Curious-0.365
 Somebody-0.360
 Everything-0.355
Token.
Token ID13
Feature activation-5.79e+00

Pos logit contributions
ins+0.136
ies+0.132
cer+0.120
ishing+0.115
kin+0.111
Neg logit contributions
 Curious-0.269
Well-0.267
Um-0.258
Over-0.254
Listen-0.251
Token He
Token ID679
Feature activation-1.11e+01

Pos logit contributions
-+0.051
."+0.050
!"+0.050
cer+0.049
er+0.049
Neg logit contributions
Listen-0.033
Well-0.030
Um-0.029
Over-0.029
Days-0.028
Token was
Token ID373
Feature activation-1.07e+01

Pos logit contributions
-+0.082
ies+0.075
cer+0.072
ins+0.070
ie+0.068
Neg logit contributions
Listen-0.090
Well-0.087
 Everything-0.085
Hmm-0.085
Um-0.085
Token troubled
Token ID17840
Feature activation-1.42e+01

Pos logit contributions
ies+0.086
ins+0.085
-+0.083
cer+0.082
ie+0.075
Neg logit contributions
Well-0.077
 Curious-0.077
Um-0.076
Listen-0.075
 Everything-0.073
Token any
Token ID597
Feature activation-1.00e+01

Pos logit contributions
cer+0.015
ins+0.015
ies+0.015
kin+0.015
akers+0.013
Neg logit contributions
Um-0.010
Listen-0.010
 Who-0.010
 An-0.009
Well-0.009
Token dog
Token ID3290
Feature activation-1.08e+01

Pos logit contributions
ins+0.078
ishing+0.076
ies+0.075
ates+0.075
cer+0.074
Neg logit contributions
 Curious-0.069
Um-0.069
Listen-0.068
Over-0.067
Well-0.067
Token."
Token ID526
Feature activation+6.37e+00

Pos logit contributions
ins+0.064
ates+0.061
ies+0.058
cer+0.054
kin+0.052
Neg logit contributions
-0.094
 Curious-0.093
Listen-0.092
Over-0.091
 Who-0.090
Token Mom
Token ID11254
Feature activation-5.67e+00

Pos logit contributions
Well+0.057
Listen+0.057
Over+0.055
 Curious+0.055
Hmm+0.055
Neg logit contributions
--0.038
ies-0.036
.-0.033
."-0.028
cer-0.028
Token says
Token ID1139
Feature activation-1.77e+00

Pos logit contributions
ies+0.040
ins+0.039
-+0.038
cer+0.038
ie+0.034
Neg logit contributions
Well-0.046
 Curious-0.045
Listen-0.045
Um-0.045
 Everything-0.043
Token.
Token ID13
Feature activation+4.14e+00

Pos logit contributions
cer+0.010
ins+0.009
ies+0.009
orns+0.008
ates+0.008
Neg logit contributions
 �-0.017
 Curious-0.017
 There-0.016
Listen-0.016
 Everything-0.016
Token
Token ID198
Feature activation+6.07e+00

Pos logit contributions
Listen+0.026
Um+0.026
Over+0.025
Hmm+0.025
Well+0.025
Neg logit contributions
--0.035
.-0.032
ies-0.031
ins-0.031
er-0.030
Token
Token ID198
Feature activation+5.87e+00

Pos logit contributions
Well+0.032
 Curious+0.032
Um+0.032
Listen+0.032
 Everything+0.030
Neg logit contributions
ies-0.059
ins-0.058
--0.058
cer-0.057
ie-0.053
TokenShe
Token ID3347
Feature activation+1.55e-01

Pos logit contributions
 Curious+0.045
Well+0.045
Listen+0.044
Um+0.044
 Everything+0.043
Neg logit contributions
ies-0.044
ins-0.043
--0.043
cer-0.042
ie-0.037
Token looks
Token ID3073
Feature activation-1.89e+00

Pos logit contributions
 Curious+0.001
Well+0.001
Um+0.001
Listen+0.001
 Everything+0.001
Neg logit contributions
ies-0.001
ins-0.001
cer-0.001
--0.001
ie-0.001
Token up
Token ID510
Feature activation-6.00e+00

Pos logit contributions
ins+0.009
ies+0.009
cer+0.009
-+0.008
ates+0.007
Neg logit contributions
 Curious-0.019
Well-0.019
Listen-0.019
Um-0.019
 Everything-0.019
Token bear
Token ID6842
Feature activation-2.75e+00

Pos logit contributions
Well+0.049
Listen+0.049
Um+0.048
 Curious+0.048
 Unfortunately+0.048
Neg logit contributions
ies-0.024
--0.024
ins-0.021
ie-0.020
cer-0.020
Token was
Token ID373
Feature activation+6.66e+00

Pos logit contributions
ies+0.022
-+0.022
ins+0.022
cer+0.021
ie+0.020
Neg logit contributions
Well-0.019
Listen-0.019
Um-0.019
 Curious-0.019
 Everything-0.018
Token so
Token ID523
Feature activation+7.27e+00

Pos logit contributions
 Curious+0.044
Well+0.044
Um+0.043
Listen+0.043
 Everything+0.041
Neg logit contributions
ies-0.057
ins-0.056
cer-0.055
--0.054
ie-0.051
Token happy
Token ID3772
Feature activation+4.07e+00

Pos logit contributions
Um+0.055
Listen+0.055
Well+0.055
 Curious+0.054
 Everything+0.053
Neg logit contributions
--0.056
ies-0.055
ins-0.055
cer-0.051
kin-0.047
Token.
Token ID13
Feature activation+1.07e+01

Pos logit contributions
Well+0.029
 Curious+0.027
 Unfortunately+0.027
 Who+0.026
That+0.026
Neg logit contributions
ins-0.032
kin-0.031
ies-0.030
ates-0.029
cer-0.029
Token He
Token ID679
Feature activation+6.07e+00

Pos logit contributions
Listen+0.065
Um+0.063
Over+0.060
Hmm+0.059
Uh+0.058
Neg logit contributions
--0.093
er-0.092
."-0.087
ins-0.086
cer-0.084
Token said
Token ID531
Feature activation+8.46e+00

Pos logit contributions
Listen+0.039
Well+0.039
Um+0.039
 Curious+0.038
 Everything+0.037
Neg logit contributions
--0.053
ies-0.052
ins-0.051
cer-0.050
ie-0.047
Token,
Token ID11
Feature activation+6.92e+00

Pos logit contributions
Well+0.082
 Curious+0.081
Listen+0.081
Um+0.080
 Everything+0.078
Neg logit contributions
ies-0.047
ins-0.046
--0.044
cer-0.044
ie-0.038
Token "
Token ID366
Feature activation+1.15e+01

Pos logit contributions
Um+0.055
Hmm+0.053
Listen+0.053
Over+0.052
Well+0.052
Neg logit contributions
--0.050
ies-0.043
er-0.042
ins-0.042
.-0.041
TokenI
Token ID40
Feature activation+8.46e+00

Pos logit contributions
 Curious+0.049
Well+0.048
Um+0.047
Listen+0.047
 Everything+0.044
Neg logit contributions
ies-0.124
ins-0.124
cer-0.123
--0.122
ie-0.113
Token did
Token ID750
Feature activation+5.34e+00

Pos logit contributions
Well+0.050
Um+0.049
 Curious+0.048
Listen+0.048
Over+0.045
Neg logit contributions
ins-0.078
ies-0.078
cer-0.074
--0.073
ie-0.069
Token high
Token ID1029
Feature activation+2.70e+00

Pos logit contributions
 Curious+0.070
Well+0.068
Listen+0.067
 Everywhere+0.066
 Everything+0.065
Neg logit contributions
ins-0.066
ies-0.060
cer-0.060
--0.055
ishing-0.055
Token in
Token ID287
Feature activation+8.52e+00

Pos logit contributions
 On+0.021
On+0.020
Well+0.020
 Curious+0.019
 Unfortunately+0.019
Neg logit contributions
ins-0.020
kin-0.018
cer-0.016
ies-0.016
e-0.016
Token the
Token ID262
Feature activation+7.15e+00

Pos logit contributions
Well+0.061
 Curious+0.060
Listen+0.060
Um+0.058
 Everything+0.057
Neg logit contributions
ins-0.070
ies-0.068
cer-0.065
--0.063
ishing-0.060
Token sky
Token ID6766
Feature activation+3.72e+00

Pos logit contributions
Well+0.049
Um+0.048
Listen+0.047
 Curious+0.047
 Everything+0.044
Neg logit contributions
--0.060
ies-0.060
ins-0.058
cer-0.057
ie-0.053
Token.
Token ID13
Feature activation+1.78e+01

Pos logit contributions
 �+0.028
 One+0.028
On+0.028
 On+0.028
 Curious+0.027
Neg logit contributions
ins-0.027
isher-0.023
cer-0.022
ishing-0.022
ates-0.022
Token When
Token ID1649
Feature activation+1.25e+01

Pos logit contributions
Listen+0.117
Um+0.117
Well+0.113
Hmm+0.111
Over+0.108
Neg logit contributions
--0.142
ies-0.142
ins-0.137
er-0.133
."-0.129
Token they
Token ID484
Feature activation+1.07e+01

Pos logit contributions
Well+0.095
 Curious+0.095
Um+0.094
Listen+0.092
 Everywhere+0.088
Neg logit contributions
ies-0.096
--0.093
ins-0.090
cer-0.089
ie-0.082
Token landed
Token ID11406
Feature activation+9.30e+00

Pos logit contributions
 Curious+0.071
Listen+0.070
Um+0.070
Well+0.070
 Everything+0.067
Neg logit contributions
ies-0.091
--0.091
ins-0.086
cer-0.085
ie-0.083
Token,
Token ID11
Feature activation+1.47e+01

Pos logit contributions
 Curious+0.096
Well+0.096
Listen+0.095
 Everything+0.094
Over+0.093
Neg logit contributions
ies-0.043
ins-0.043
cer-0.041
--0.036
ie-0.035
Token Lily
Token ID20037
Feature activation+9.44e+00

Pos logit contributions
Well+0.105
 Curious+0.104
Listen+0.103
Um+0.102
 Everything+0.098
Neg logit contributions
ies-0.118
ins-0.117
--0.114
cer-0.113
ie-0.103
Token said
Token ID531
Feature activation+1.37e+01

Pos logit contributions
Well+0.062
 Curious+0.061
Um+0.059
Listen+0.059
Over+0.057
Neg logit contributions
ins-0.080
ies-0.080
cer-0.078
--0.077
ie-0.071
Token were
Token ID547
Feature activation+2.44e+00

Pos logit contributions
ies+0.046
ins+0.045
-+0.045
cer+0.044
ie+0.040
Neg logit contributions
Well-0.049
 Curious-0.048
Um-0.048
Listen-0.048
 Everything-0.046
Token playing
Token ID2712
Feature activation-8.27e-01

Pos logit contributions
 Curious+0.018
 Everything+0.018
Well+0.018
 Sadly+0.018
Listen+0.018
Neg logit contributions
cer-0.018
ies-0.017
ins-0.017
er-0.016
--0.016
Token in
Token ID287
Feature activation+5.23e-01

Pos logit contributions
ies+0.006
ins+0.006
keeping+0.005
akers+0.005
aron+0.005
Neg logit contributions
Over-0.007
On-0.006
In-0.006
Sure-0.006
 Curious-0.006
Token the
Token ID262
Feature activation-2.44e+00

Pos logit contributions
Well+0.004
 Curious+0.004
Listen+0.004
Um+0.004
 Everything+0.004
Neg logit contributions
ins-0.004
ies-0.004
--0.004
cer-0.004
ishing-0.003
Token garden
Token ID11376
Feature activation-6.68e+00

Pos logit contributions
ies+0.018
ins+0.018
-+0.017
cer+0.017
ates+0.015
Neg logit contributions
Well-0.019
 Curious-0.019
Listen-0.019
Um-0.018
 Everything-0.018
Token.
Token ID13
Feature activation+8.66e+00

Pos logit contributions
ins+0.047
rob+0.044
ab+0.043
asc+0.042
cer+0.041
Neg logit contributions
 One-0.052
On-0.051
 On-0.048
Over-0.047
 All-0.046
Token They
Token ID1119
Feature activation+4.84e+00

Pos logit contributions
Well+0.056
Listen+0.055
Um+0.055
Hmm+0.050
 Curious+0.050
Neg logit contributions
ies-0.074
cer-0.071
ins-0.069
--0.069
kin-0.064
Token liked
Token ID8288
Feature activation+2.72e+00

Pos logit contributions
Well+0.033
Um+0.033
Listen+0.033
 Curious+0.033
 Everything+0.031
Neg logit contributions
ies-0.040
--0.039
cer-0.038
ins-0.038
ie-0.035
Token to
Token ID284
Feature activation+4.46e+00

Pos logit contributions
Well+0.021
 Curious+0.020
Listen+0.020
Um+0.020
 Everything+0.019
Neg logit contributions
ies-0.020
ins-0.020
cer-0.020
--0.020
ie-0.018
Token swing
Token ID9628
Feature activation-4.81e+00

Pos logit contributions
Well+0.033
Listen+0.032
Um+0.032
 Curious+0.032
 Everything+0.030
Neg logit contributions
ies-0.035
--0.034
ins-0.034
cer-0.033
ie-0.031
Token on
Token ID319
Feature activation-1.00e+00

Pos logit contributions
ins+0.036
cer+0.032
ater+0.031
ishing+0.031
ies+0.031
Neg logit contributions
 �-0.038
 Everywhere-0.037
Well-0.036
 One-0.036
Over-0.036
Token They
Token ID1119
Feature activation+8.41e+00

Pos logit contributions
Listen+0.073
Over+0.072
Um+0.072
Well+0.071
Hmm+0.069
Neg logit contributions
ies-0.089
cer-0.087
--0.086
ins-0.086
."-0.081
Token cry
Token ID3960
Feature activation+3.78e+00

Pos logit contributions
Well+0.074
Listen+0.071
Um+0.071
 Curious+0.069
Over+0.067
Neg logit contributions
ies-0.052
--0.048
!"-0.045
cer-0.044
.-0.043
Token.
Token ID13
Feature activation+1.17e+01

Pos logit contributions
Well+0.041
 Curious+0.040
Listen+0.040
Um+0.040
 Everything+0.039
Neg logit contributions
ins-0.016
ies-0.016
cer-0.015
--0.015
ates-0.013
Token They
Token ID1119
Feature activation+9.07e+00

Pos logit contributions
Over+0.084
Listen+0.082
Well+0.082
Um+0.081
Hmm+0.081
Neg logit contributions
."-0.081
--0.080
.-0.079
cer-0.079
ies-0.078
Token wish
Token ID4601
Feature activation+3.69e+00

Pos logit contributions
Well+0.075
Listen+0.073
Um+0.072
 Curious+0.071
Over+0.069
Neg logit contributions
ies-0.061
--0.058
cer-0.054
!"-0.053
ie-0.052
Token they
Token ID484
Feature activation+6.14e+00

Pos logit contributions
Well+0.020
Listen+0.019
Um+0.019
 Curious+0.019
 Everything+0.018
Neg logit contributions
ins-0.036
cer-0.036
ies-0.036
--0.035
ishing-0.033
Token had
Token ID550
Feature activation+5.45e+00

Pos logit contributions
 Curious+0.042
Well+0.041
Um+0.040
 Everywhere+0.039
 Everything+0.039
Neg logit contributions
ies-0.051
--0.050
cer-0.049
ins-0.048
ie-0.045
Token shared
Token ID4888
Feature activation+1.13e+00

Pos logit contributions
Well+0.036
Listen+0.036
Um+0.036
 Curious+0.035
Over+0.034
Neg logit contributions
ies-0.046
ins-0.046
cer-0.044
--0.044
ie-0.041
Token the
Token ID262
Feature activation+3.10e+00

Pos logit contributions
 Something+0.007
Listen+0.007
 �+0.007
That+0.007
Well+0.007
Neg logit contributions
ins-0.009
ies-0.009
ab-0.009
ishing-0.008
ie-0.008
Token car
Token ID1097
Feature activation-5.18e+00

Pos logit contributions
Well+0.028
 Curious+0.028
Listen+0.028
Um+0.028
 Everything+0.027
Neg logit contributions
ies-0.019
ins-0.018
--0.018
cer-0.018
ie-0.015
Token and
Token ID290
Feature activation+6.26e+00

Pos logit contributions
ins+0.034
ies+0.029
cer+0.029
ishing+0.029
ates+0.029
Neg logit contributions
-0.042
 Unfortunately-0.042
 �-0.041
 On-0.041
Well-0.041
Token story
Token ID1621
Feature activation-1.59e+01

Pos logit contributions
ies+0.089
ins+0.089
cer+0.085
-+0.085
ie+0.076
Neg logit contributions
Well-0.096
 Curious-0.095
Listen-0.095
Um-0.094
 Everything-0.090
Token?"
Token ID1701
Feature activation+5.00e-01

Pos logit contributions
cer+0.090
ins+0.087
ies+0.085
ishing+0.079
ates+0.072
Neg logit contributions
Listen-0.145
Well-0.144
Um-0.140
 Everything-0.139
 Curious-0.139
Token
Token ID198
Feature activation+2.40e-01

Pos logit contributions
Listen+0.003
Um+0.003
Well+0.003
Over+0.003
 Curious+0.003
Neg logit contributions
ies-0.004
ins-0.004
--0.004
cer-0.004
ie-0.004
Token
Token ID198
Feature activation+3.39e-01

Pos logit contributions
Well+0.001
 Curious+0.001
Listen+0.001
Um+0.001
 Everything+0.001
Neg logit contributions
ies-0.002
ins-0.002
--0.002
cer-0.002
ie-0.002
TokenL
Token ID43
Feature activation-1.46e+01

Pos logit contributions
 Curious+0.003
Well+0.003
Um+0.003
 Everything+0.003
Listen+0.003
Neg logit contributions
ins-0.002
--0.002
ies-0.002
cer-0.002
ie-0.002
Tokenily
Token ID813
Feature activation-9.19e+00

Pos logit contributions
ies+0.126
cer+0.126
-+0.114
ishing+0.111
ins+0.109
Neg logit contributions
Well-0.095
Listen-0.091
Um-0.087
 Everything-0.087
 Curious-0.083
Token nodded
Token ID14464
Feature activation-1.04e+01

Pos logit contributions
ies+0.072
-+0.071
ins+0.071
cer+0.068
ie+0.063
Neg logit contributions
Well-0.067
Listen-0.066
Um-0.066
 Curious-0.066
 Everything-0.063
Token and
Token ID290
Feature activation-2.46e+00

Pos logit contributions
ins+0.061
cer+0.058
ies+0.055
ates+0.052
-+0.049
Neg logit contributions
 Curious-0.100
Listen-0.097
Well-0.095
Um-0.094
 �-0.093
Token said
Token ID531
Feature activation-4.24e+00

Pos logit contributions
ins+0.025
ies+0.024
cer+0.024
-+0.023
ates+0.023
Neg logit contributions
 Curious-0.013
Well-0.013
Um-0.012
Listen-0.012
Over-0.011
Token,
Token ID11
Feature activation-4.00e+00

Pos logit contributions
ins+0.023
cer+0.023
ies+0.022
ishing+0.020
ater+0.019
Neg logit contributions
 Curious-0.041
 �-0.041
Well-0.040
 Everything-0.040
Over-0.040
Token "
Token ID366
Feature activation+3.54e-01

Pos logit contributions
-+0.035
ies+0.035
ins+0.034
cer+0.033
er+0.031
Neg logit contributions
Um-0.025
Listen-0.024
Well-0.024
 Curious-0.024
Hmm-0.023
Token a
Token ID257
Feature activation-6.37e+00

Pos logit contributions
ies+0.087
ins+0.086
cer+0.077
ishing+0.075
ishes+0.073
Neg logit contributions
 Curious-0.057
Over-0.056
On-0.056
 Everywhere-0.055
In-0.054
Token lot
Token ID1256
Feature activation-1.27e+01

Pos logit contributions
ies+0.051
-+0.050
ins+0.050
cer+0.049
ie+0.045
Neg logit contributions
Well-0.046
 Curious-0.045
Listen-0.045
Um-0.044
 Everything-0.043
Token and
Token ID290
Feature activation-3.55e+00

Pos logit contributions
ies+0.084
ins+0.081
cer+0.079
ishing+0.072
ates+0.065
Neg logit contributions
 Everywhere-0.110
 On-0.109
Well-0.107
On-0.107
In-0.106
Token Ben
Token ID3932
Feature activation-8.00e+00

Pos logit contributions
ins+0.029
ies+0.029
-+0.028
cer+0.027
ishing+0.026
Neg logit contributions
Well-0.025
Listen-0.025
 Curious-0.024
Um-0.024
 Everything-0.024
Token cannot
Token ID2314
Feature activation-4.03e+00

Pos logit contributions
ies+0.058
ins+0.057
-+0.055
cer+0.055
ie+0.050
Neg logit contributions
Well-0.063
 Curious-0.062
Listen-0.061
Um-0.061
 Everything-0.059
Token go
Token ID467
Feature activation-7.87e+00

Pos logit contributions
ies+0.032
-+0.031
ins+0.031
cer+0.030
ie+0.028
Neg logit contributions
Well-0.030
 Curious-0.029
Listen-0.029
Um-0.029
 Everything-0.027
Token outside
Token ID2354
Feature activation-1.19e+01

Pos logit contributions
ins+0.052
cer+0.048
ies+0.046
ates+0.044
kin+0.044
Neg logit contributions
 Curious-0.068
Listen-0.067
 On-0.065
Well-0.065
 Everywhere-0.065
Token.
Token ID13
Feature activation+3.06e+00

Pos logit contributions
ins+0.076
ies+0.067
cer+0.066
ates+0.062
ishing+0.062
Neg logit contributions
 On-0.105
 Curious-0.104
 One-0.104
 Unfortunately-0.104
 Everywhere-0.102
Token He
Token ID679
Feature activation-2.63e+00

Pos logit contributions
Well+0.018
Listen+0.017
Um+0.016
Over+0.015
Finally+0.015
Neg logit contributions
--0.027
cer-0.027
ies-0.026
ins-0.026
kin-0.025
Token is
Token ID318
Feature activation-1.53e+00

Pos logit contributions
ies+0.018
ins+0.018
-+0.018
cer+0.017
ie+0.016
Neg logit contributions
Well-0.022
Listen-0.021
 Curious-0.021
Um-0.021
Over-0.020
Token sad
Token ID6507
Feature activation-4.37e+00

Pos logit contributions
ins+0.013
ies+0.013
cer+0.013
-+0.013
ie+0.012
Neg logit contributions
 Curious-0.010
Well-0.010
Um-0.010
Listen-0.010
 Everything-0.009
Token beetles
Token ID46716
Feature activation-9.16e+00

Pos logit contributions
ins+0.049
cer+0.045
ies+0.044
ishing+0.041
ates+0.039
Neg logit contributions
 Everywhere-0.080
-0.080
 �-0.079
 Curious-0.079
 Everything-0.079
Token,
Token ID11
Feature activation+2.58e+00

Pos logit contributions
ins+0.050
ies+0.046
cer+0.046
ishing+0.042
-+0.041
Neg logit contributions
-0.082
Well-0.080
 Curious-0.080
 Everywhere-0.080
 On-0.079
Token so
Token ID523
Feature activation+5.22e+00

Pos logit contributions
Well+0.022
Listen+0.022
 Curious+0.022
Um+0.022
Over+0.021
Neg logit contributions
ies-0.017
ins-0.017
--0.017
cer-0.016
ie-0.015
Token they
Token ID484
Feature activation+1.98e+00

Pos logit contributions
Well+0.035
Um+0.034
 Curious+0.034
Listen+0.033
 Everything+0.032
Neg logit contributions
ins-0.044
ies-0.043
cer-0.042
--0.042
ishing-0.040
Token can
Token ID460
Feature activation+2.89e+00

Pos logit contributions
Well+0.014
 Curious+0.013
Listen+0.013
Um+0.013
 Everything+0.013
Neg logit contributions
ies-0.016
ins-0.016
--0.016
cer-0.016
ie-0.014
Token eat
Token ID4483
Feature activation-3.73e+00

Pos logit contributions
Well+0.022
 Curious+0.022
Listen+0.022
Um+0.021
 Everything+0.020
Neg logit contributions
ies-0.022
--0.021
ins-0.021
cer-0.021
ie-0.020
Token!"
Token ID2474
Feature activation+9.26e+00

Pos logit contributions
cer+0.026
ies+0.025
ins+0.024
-+0.023
ishing+0.022
Neg logit contributions
Well-0.027
Listen-0.027
 Everywhere-0.027
 Curious-0.026
 �-0.026
Token
Token ID198
Feature activation+8.43e+00

Pos logit contributions
Listen+0.048
Um+0.047
Over+0.044
Well+0.044
Hmm+0.043
Neg logit contributions
--0.091
ins-0.085
ies-0.084
cer-0.084
er-0.082
Token
Token ID198
Feature activation+8.68e+00

Pos logit contributions
 daunting+0.063
 Curious+0.062
Um+0.062
Hmm+0.061
Listen+0.060
Neg logit contributions
--0.063
ins-0.058
!"-0.052
cer-0.052
.-0.051
TokenHis
Token ID6653
Feature activation-2.33e-02

Pos logit contributions
 Curious+0.057
 daunting+0.056
 Everything+0.055
 There+0.055
 Everywhere+0.053
Neg logit contributions
--0.072
ins-0.066
cer-0.063
ep-0.063
e-0.060
Token friends
Token ID2460
Feature activation-6.55e-01

Pos logit contributions
ins+0.000
ies+0.000
cer+0.000
-+0.000
ie+0.000
Neg logit contributions
 Curious-0.000
Well-0.000
Um-0.000
Listen-0.000
 Everything-0.000
Token play
Token ID711
Feature activation-2.70e+01

Pos logit contributions
Um+0.090
Listen+0.088
Hmm+0.087
Well+0.086
Over+0.085
Neg logit contributions
--0.094
ies-0.089
ins-0.087
er-0.081
!"-0.076
Token around
Token ID1088
Feature activation-2.53e+01

Pos logit contributions
ins+0.191
ies+0.187
akers+0.183
ishing+0.179
isher+0.173
Neg logit contributions
 One-0.192
 All-0.189
Over-0.184
On-0.180
 Curious-0.180
Token,
Token ID11
Feature activation-1.11e+01

Pos logit contributions
ins+0.254
akers+0.239
ies+0.233
ishing+0.221
ater+0.217
Neg logit contributions
 One-0.124
 �-0.122
 On-0.122
On-0.118
 Everywhere-0.118
Token maybe
Token ID3863
Feature activation-1.15e+01

Pos logit contributions
ins+0.084
ies+0.084
-+0.081
cer+0.081
ates+0.072
Neg logit contributions
Well-0.084
 Curious-0.083
Listen-0.082
Um-0.081
 Everything-0.079
Token the
Token ID262
Feature activation-1.54e+01

Pos logit contributions
ins+0.107
ies+0.096
cer+0.095
ates+0.095
ater+0.094
Neg logit contributions
Um-0.075
Listen-0.073
Well-0.072
 Curious-0.068
 �-0.066
Token hunter
Token ID19177
Feature activation-2.53e+01

Pos logit contributions
ins+0.115
ies+0.112
cer+0.107
-+0.107
ishing+0.098
Neg logit contributions
 Curious-0.120
Well-0.120
Um-0.117
Listen-0.116
 Everything-0.113
Token will
Token ID481
Feature activation-1.15e+01

Pos logit contributions
ins+0.230
ies+0.228
cer+0.221
-+0.220
ates+0.205
Neg logit contributions
Well-0.149
 Curious-0.147
Listen-0.145
Um-0.144
 Everything-0.137
Token think
Token ID892
Feature activation-1.31e+01

Pos logit contributions
ies+0.096
ins+0.094
-+0.093
cer+0.092
ie+0.084
Neg logit contributions
Well-0.079
 Curious-0.077
Listen-0.077
Um-0.076
 Everything-0.073
Token we
Token ID356
Feature activation-1.13e+01

Pos logit contributions
ins+0.090
ies+0.090
cer+0.088
-+0.082
ishing+0.078
Neg logit contributions
Well-0.106
 Curious-0.104
Listen-0.104
 Everything-0.103
Um-0.102
Token're
Token ID821
Feature activation-1.41e+01

Pos logit contributions
ies+0.093
-+0.090
ins+0.089
cer+0.089
ie+0.082
Neg logit contributions
Well-0.079
 Curious-0.079
Listen-0.078
Um-0.078
 Everything-0.073
Token not
Token ID407
Feature activation-1.39e+01

Pos logit contributions
ins+0.111
cer+0.110
ies+0.109
-+0.107
ie+0.099
Neg logit contributions
 Curious-0.092
 Everything-0.090
Listen-0.089
Um-0.088
Well-0.087
Token store
Token ID3650
Feature activation+6.86e-01

Pos logit contributions
Well+0.036
Listen+0.034
Um+0.034
 Curious+0.034
 Everything+0.032
Neg logit contributions
--0.026
ies-0.025
cer-0.024
ins-0.024
er-0.022
Token and
Token ID290
Feature activation+9.09e+00

Pos logit contributions
Well+0.007
 Curious+0.007
Listen+0.007
Um+0.007
 Unfortunately+0.007
Neg logit contributions
ins-0.003
ies-0.003
cer-0.003
--0.003
ates-0.003
Token bought
Token ID5839
Feature activation+6.05e+00

Pos logit contributions
Well+0.067
Listen+0.066
Um+0.066
 Curious+0.064
 Everywhere+0.061
Neg logit contributions
--0.072
ies-0.071
cer-0.065
ins-0.064
ie-0.062
Token the
Token ID262
Feature activation+4.55e+00

Pos logit contributions
Well+0.045
 Curious+0.045
Listen+0.044
Um+0.044
 Everything+0.043
Neg logit contributions
ies-0.046
ins-0.045
cer-0.045
--0.043
ishing-0.039
Token blue
Token ID4171
Feature activation-6.12e+00

Pos logit contributions
Well+0.039
Listen+0.038
Um+0.038
 Curious+0.038
Over+0.036
Neg logit contributions
ies-0.031
--0.030
ins-0.029
cer-0.028
ie-0.026
Token color
Token ID3124
Feature activation-2.70e+00

Pos logit contributions
ins+0.041
ies+0.039
cer+0.039
-+0.037
ie+0.034
Neg logit contributions
Well-0.052
 Curious-0.051
Listen-0.050
Um-0.050
 Everything-0.050
Token.
Token ID13
Feature activation+1.34e+01

Pos logit contributions
ins+0.013
cer+0.012
ies+0.012
-+0.011
ishing+0.010
Neg logit contributions
Well-0.028
 Curious-0.027
Listen-0.027
Um-0.027
 Everything-0.027
Token They
Token ID1119
Feature activation+8.08e+00

Pos logit contributions
Um+0.099
Listen+0.099
Well+0.096
Over+0.093
Hmm+0.092
Neg logit contributions
ies-0.099
--0.098
ins-0.095
er-0.091
cer-0.090
Token came
Token ID1625
Feature activation+7.23e+00

Pos logit contributions
Listen+0.062
Um+0.061
 Curious+0.061
Well+0.059
 Everything+0.058
Neg logit contributions
ies-0.065
--0.063
ins-0.059
ie-0.058
cer-0.058
Token back
Token ID736
Feature activation+3.69e+00

Pos logit contributions
Well+0.069
 Curious+0.068
Listen+0.068
Um+0.068
 Everything+0.066
Neg logit contributions
ies-0.041
ins-0.040
--0.039
cer-0.038
ie-0.033
Token home
Token ID1363
Feature activation+1.84e+00

Pos logit contributions
Well+0.038
 Curious+0.037
Listen+0.037
Um+0.037
 Everything+0.036
Neg logit contributions
ies-0.018
ins-0.018
--0.017
cer-0.017
ie-0.014
TokenPlease
Token ID5492
Feature activation-1.91e+01

Pos logit contributions
ins+0.150
cer+0.150
-+0.146
ies+0.140
e+0.138
Neg logit contributions
 Curious-0.063
 Everything-0.062
 Everywhere-0.061
 daunting-0.059
Um-0.058
Token help
Token ID1037
Feature activation-1.72e+01

Pos logit contributions
ins+0.136
ies+0.135
cer+0.130
-+0.129
ishing+0.116
Neg logit contributions
Well-0.152
 Curious-0.151
Um-0.150
Listen-0.149
 Everything-0.145
Token me
Token ID502
Feature activation-1.96e+01

Pos logit contributions
ins+0.122
cer+0.122
ies+0.121
-+0.108
ab+0.107
Neg logit contributions
 Everything-0.130
Listen-0.128
Um-0.127
Well-0.126
 Curious-0.125
Token shake
Token ID13279
Feature activation-2.29e+01

Pos logit contributions
ins+0.117
cer+0.112
ies+0.111
-+0.095
rob+0.095
Neg logit contributions
 Curious-0.170
 Unfortunately-0.165
 Everything-0.165
 Something-0.162
 On-0.161
Token this
Token ID428
Feature activation-2.13e+01

Pos logit contributions
ins+0.152
ies+0.147
cer+0.143
-+0.131
ates+0.129
Neg logit contributions
Well-0.183
 �-0.181
Listen-0.180
 Curious-0.180
 Everything-0.179
Token rad
Token ID2511
Feature activation-4.30e+01

Pos logit contributions
ins+0.144
ies+0.139
cer+0.136
-+0.127
ates+0.120
Neg logit contributions
 Curious-0.176
Well-0.170
Um-0.169
Listen-0.167
 Everywhere-0.166
Tokenish
Token ID680
Feature activation-2.80e+01

Pos logit contributions
kin+0.226
e+0.190
ins+0.187
ies+0.185
ates+0.183
Neg logit contributions
Well-0.385
Um-0.382
Hmm-0.373
 Curious-0.367
 Somebody-0.355
Token,
Token ID11
Feature activation-9.37e+00

Pos logit contributions
ins+0.120
ies+0.119
cer+0.119
-+0.102
ishing+0.097
Neg logit contributions
Well-0.279
 Curious-0.278
 Everything-0.271
Listen-0.269
 �-0.268
Token it
Token ID340
Feature activation-1.44e+01

Pos logit contributions
ies+0.075
ins+0.074
-+0.073
cer+0.072
ie+0.066
Neg logit contributions
Well-0.067
 Curious-0.066
Listen-0.065
Um-0.065
 Everything-0.062
Token's
Token ID338
Feature activation-1.28e+01

Pos logit contributions
ies+0.114
ins+0.114
cer+0.112
-+0.110
ates+0.101
Neg logit contributions
 Curious-0.102
Well-0.102
Um-0.099
Listen-0.098
 Everything-0.096
Token too
Token ID1165
Feature activation-1.64e+01

Pos logit contributions
ies+0.107
ins+0.107
cer+0.105
-+0.101
ie+0.092
Neg logit contributions
Well-0.086
Um-0.086
 Curious-0.086
Listen-0.085
 Everything-0.083
Token She
Token ID1375
Feature activation-1.90e+01

Pos logit contributions
Um+0.115
Listen+0.109
Well+0.107
 daunting+0.104
 Curious+0.103
Neg logit contributions
--0.133
ins-0.129
ies-0.125
cer-0.120
!"-0.118
Token had
Token ID550
Feature activation-1.41e+01

Pos logit contributions
ies+0.166
ins+0.164
-+0.161
cer+0.159
ie+0.147
Neg logit contributions
Well-0.122
 Curious-0.120
Listen-0.119
Um-0.118
 Everything-0.112
Token peanut
Token ID26636
Feature activation-3.06e+01

Pos logit contributions
ies+0.102
ins+0.101
-+0.101
cer+0.096
ie+0.086
Neg logit contributions
Well-0.110
 Curious-0.109
Listen-0.107
Um-0.106
 Everything-0.103
Token butter
Token ID9215
Feature activation-2.95e+01

Pos logit contributions
cer+0.214
packs+0.195
keeping+0.193
ins+0.191
web+0.189
Neg logit contributions
Well-0.201
Um-0.196
Listen-0.185
Hmm-0.183
Over-0.181
Token and
Token ID290
Feature activation-1.19e+01

Pos logit contributions
ins+0.181
ishing+0.159
cks+0.157
cer+0.154
ymes+0.154
Neg logit contributions
Well-0.226
That-0.219
 �-0.216
-0.208
In-0.206
Token jelly
Token ID27644
Feature activation-2.81e+01

Pos logit contributions
ins+0.094
cer+0.092
ies+0.090
-+0.088
ishing+0.085
Neg logit contributions
Well-0.085
Um-0.084
Listen-0.081
 Curious-0.081
Hmm-0.077
Token and
Token ID290
Feature activation-1.08e+01

Pos logit contributions
cer+0.151
ins+0.142
ishing+0.141
ies+0.134
ep+0.133
Neg logit contributions
Well-0.240
Um-0.219
That-0.218
Over-0.210
Listen-0.209
Token some
Token ID617
Feature activation-1.99e+01

Pos logit contributions
ins+0.087
cer+0.085
ies+0.083
-+0.082
ishing+0.080
Neg logit contributions
Well-0.075
Um-0.074
Listen-0.071
 Curious-0.070
Hmm-0.067
Token apples
Token ID22514
Feature activation-2.74e+01

Pos logit contributions
cer+0.140
ins+0.136
ies+0.135
-+0.129
ishing+0.124
Neg logit contributions
Well-0.155
Um-0.151
Listen-0.149
 Curious-0.147
Over-0.142
Token.
Token ID13
Feature activation-4.88e+00

Pos logit contributions
ins+0.158
cer+0.145
ies+0.139
ishing+0.137
ates+0.132
Neg logit contributions
Well-0.238
 �-0.227
Over-0.222
That-0.221
On-0.218
Token After
Token ID2293
Feature activation-1.11e+01

Pos logit contributions
ins+0.050
ies+0.049
-+0.048
cer+0.048
ie+0.045
Neg logit contributions
Listen-0.022
Well-0.021
Um-0.021
 Curious-0.021
Over-0.020
Token a
Token ID257
Feature activation-1.44e+01

Pos logit contributions
ins+0.112
ies+0.110
cer+0.105
-+0.101
ishing+0.096
Neg logit contributions
 Curious-0.098
Well-0.097
Um-0.096
Listen-0.096
 Everything-0.094
Token great
Token ID1049
Feature activation-1.70e+01

Pos logit contributions
ies+0.140
ins+0.138
-+0.135
cer+0.135
ie+0.125
Neg logit contributions
Well-0.078
 Curious-0.077
Listen-0.076
Um-0.076
 Everything-0.071
Token job
Token ID1693
Feature activation-1.85e+01

Pos logit contributions
ins+0.131
ies+0.121
cer+0.115
ates+0.110
ishing+0.108
Neg logit contributions
 Curious-0.132
Well-0.132
Listen-0.129
Um-0.127
 Everything-0.125
Token on
Token ID319
Feature activation-1.51e+01

Pos logit contributions
ins+0.096
cer+0.094
ies+0.092
ishing+0.083
ates+0.079
Neg logit contributions
Well-0.174
 Everything-0.168
 �-0.167
 Curious-0.166
 Who-0.165
Token that
Token ID326
Feature activation-1.40e+01

Pos logit contributions
ies+0.108
ins+0.106
cer+0.106
-+0.096
ishing+0.093
Neg logit contributions
 Everything-0.114
Well-0.114
Listen-0.112
 Curious-0.111
Um-0.109
Token castle
Token ID16669
Feature activation-2.63e+01

Pos logit contributions
ins+0.101
ies+0.101
cer+0.096
-+0.093
ates+0.087
Neg logit contributions
Well-0.109
 Curious-0.109
Listen-0.108
 Everything-0.106
Um-0.105
Token Sam
Token ID3409
Feature activation-1.94e+01

Pos logit contributions
ins+0.162
cer+0.157
ies+0.154
ishing+0.145
ates+0.142
Neg logit contributions
Well-0.212
 Everything-0.207
 Curious-0.201
 Who-0.199
Listen-0.197
Token,
Token ID11
Feature activation-7.62e+00

Pos logit contributions
ins+0.135
ies+0.133
cer+0.129
-+0.125
ates+0.116
Neg logit contributions
Well-0.153
 Curious-0.150
Um-0.148
Listen-0.147
 Everything-0.144
Token I
Token ID314
Feature activation-7.82e+00

Pos logit contributions
ies+0.055
ins+0.055
cer+0.053
-+0.053
ishing+0.047
Neg logit contributions
Well-0.060
 Curious-0.059
Listen-0.058
Um-0.058
 Everything-0.057
Token'm
Token ID1101
Feature activation-9.91e+00

Pos logit contributions
ies+0.075
ins+0.074
-+0.072
cer+0.072
ie+0.067
Neg logit contributions
Well-0.044
 Curious-0.043
Um-0.042
Listen-0.042
 Everything-0.040
Token so
Token ID523
Feature activation-9.88e+00

Pos logit contributions
ies+0.093
cer+0.090
ins+0.089
-+0.086
ie+0.081
Neg logit contributions
Well-0.060
 Curious-0.060
Um-0.058
Listen-0.058
 Everything-0.058
Token saw
Token ID2497
Feature activation-1.06e+01

Pos logit contributions
-+0.183
ies+0.182
ins+0.178
cer+0.168
ie+0.163
Neg logit contributions
Listen-0.178
Well-0.176
 Curious-0.174
Um-0.173
Over-0.168
Token a
Token ID257
Feature activation-1.31e+01

Pos logit contributions
ies+0.077
cer+0.076
ins+0.075
-+0.072
ishing+0.067
Neg logit contributions
Well-0.083
 Curious-0.082
Listen-0.082
Um-0.081
 Everything-0.080
Token picture
Token ID4286
Feature activation-2.38e+01

Pos logit contributions
ies+0.104
-+0.102
ins+0.100
cer+0.099
ie+0.088
Neg logit contributions
Well-0.096
Listen-0.094
Um-0.093
 Curious-0.093
 Everything-0.089
Token of
Token ID286
Feature activation-1.19e+01

Pos logit contributions
ins+0.164
ies+0.163
cer+0.160
-+0.142
ishing+0.142
Neg logit contributions
Well-0.193
 Curious-0.188
Listen-0.187
 Everything-0.185
Over-0.183
Token a
Token ID257
Feature activation-1.39e+01

Pos logit contributions
ies+0.092
ins+0.090
-+0.088
cer+0.087
ishing+0.080
Neg logit contributions
Well-0.089
 Curious-0.089
Listen-0.088
Um-0.087
 Everything-0.086
Token key
Token ID1994
Feature activation-2.85e+01

Pos logit contributions
ins+0.104
ies+0.104
cer+0.100
-+0.099
ie+0.091
Neg logit contributions
Well-0.104
 Curious-0.104
Um-0.103
Listen-0.102
 Everything-0.098
Token.
Token ID13
Feature activation-4.43e+00

Pos logit contributions
ins+0.166
ies+0.142
cer+0.133
ishing+0.120
ates+0.119
Neg logit contributions
Well-0.267
 �-0.261
 Curious-0.259
 On-0.257
On-0.254
Token It
Token ID632
Feature activation-1.16e+01

Pos logit contributions
ins+0.041
-+0.041
er+0.039
cer+0.039
ies+0.038
Neg logit contributions
Listen-0.024
Um-0.022
Over-0.022
Well-0.021
Hmm-0.020
Token was
Token ID373
Feature activation-1.08e+01

Pos logit contributions
-+0.074
ies+0.065
er+0.062
ie+0.060
cer+0.059
Neg logit contributions
Listen-0.109
 Everything-0.103
 Everywhere-0.102
 Curious-0.101
Well-0.101
Token in
Token ID287
Feature activation-1.13e+01

Pos logit contributions
ies+0.075
-+0.074
ins+0.073
cer+0.071
ie+0.064
Neg logit contributions
Well-0.089
 Curious-0.087
Listen-0.087
Um-0.086
 Everything-0.083
Token the
Token ID262
Feature activation-1.36e+01

Pos logit contributions
ies+0.080
ins+0.079
-+0.077
cer+0.076
ie+0.068
Neg logit contributions
Well-0.092
 Curious-0.091
Listen-0.090
Um-0.090
 Everything-0.087
Token bit
Token ID1643
Feature activation-2.43e+01

Pos logit contributions
ins+0.196
ies+0.184
cer+0.176
ates+0.175
-+0.174
Neg logit contributions
 Curious-0.184
Well-0.179
Um-0.178
Listen-0.173
 Everything-0.168
Token of
Token ID286
Feature activation-1.93e+01

Pos logit contributions
ies+0.152
ins+0.151
cer+0.149
-+0.145
ates+0.133
Neg logit contributions
Well-0.212
 Curious-0.208
Um-0.207
Listen-0.206
 Everything-0.199
Token cocoa
Token ID35845
Feature activation-2.90e+01

Pos logit contributions
ies+0.147
ins+0.147
cer+0.144
-+0.139
ishing+0.127
Neg logit contributions
Well-0.142
 Curious-0.142
Listen-0.139
Um-0.139
 Everything-0.137
Token on
Token ID319
Feature activation-1.84e+01

Pos logit contributions
ins+0.143
ies+0.136
cer+0.134
-+0.119
ishing+0.115
Neg logit contributions
Well-0.289
 Curious-0.278
 Everything-0.278
Listen-0.278
Um-0.274
Token his
Token ID465
Feature activation-2.46e+01

Pos logit contributions
ins+0.143
ies+0.142
cer+0.134
-+0.128
ishing+0.126
Neg logit contributions
Well-0.134
 Curious-0.130
 Everything-0.130
Listen-0.129
 Everywhere-0.128
Token fur
Token ID9230
Feature activation-2.76e+01

Pos logit contributions
ins+0.182
cer+0.170
ies+0.169
-+0.153
ishing+0.151
Neg logit contributions
 Curious-0.187
Um-0.184
 Everywhere-0.180
Well-0.179
 Unfortunately-0.177
Token!
Token ID0
Feature activation-7.43e+00

Pos logit contributions
ins+0.125
cer+0.115
ies+0.111
ishing+0.100
-+0.097
Neg logit contributions
Well-0.275
 Curious-0.273
 Everything-0.265
Um-0.263
Listen-0.262
Token
Token ID220
Feature activation-1.48e+01

Pos logit contributions
ins+0.071
-+0.070
cer+0.070
ies+0.069
ates+0.065
Neg logit contributions
Listen-0.037
Um-0.035
Well-0.035
Over-0.034
 Curious-0.033
Token
Token ID198
Feature activation-5.22e+00

Pos logit contributions
ies+0.145
ins+0.145
-+0.142
cer+0.140
ie+0.130
Neg logit contributions
Well-0.078
Listen-0.076
Um-0.076
 Curious-0.076
Over-0.070
Token
Token ID198
Feature activation-4.43e+00

Pos logit contributions
ins+0.043
-+0.041
ies+0.039
cer+0.039
ates+0.037
Neg logit contributions
Um-0.033
Listen-0.033
 Curious-0.032
 daunting-0.032
Well-0.032
TokenAfter
Token ID3260
Feature activation-9.75e+00

Pos logit contributions
ins+0.041
-+0.040
cer+0.039
e+0.038
ies+0.036
Neg logit contributions
 Curious-0.022
 daunting-0.022
Um-0.021
 Everything-0.021
Listen-0.021
Token give
Token ID1577
Feature activation-1.92e+01

Pos logit contributions
ins+0.157
ies+0.152
cer+0.148
-+0.147
ishing+0.137
Neg logit contributions
 Curious-0.104
Um-0.104
Well-0.103
Listen-0.102
 Everything-0.101
Token to
Token ID284
Feature activation-1.26e+01

Pos logit contributions
cer+0.148
ins+0.145
ies+0.142
kin+0.134
-+0.132
Neg logit contributions
Listen-0.132
Well-0.131
 Everything-0.129
Um-0.127
 Curious-0.126
Token the
Token ID262
Feature activation-1.85e+01

Pos logit contributions
ins+0.099
cer+0.099
ies+0.096
-+0.094
ishing+0.088
Neg logit contributions
 Everything-0.088
 Curious-0.088
Well-0.086
 Everywhere-0.084
Listen-0.084
Token rh
Token ID9529
Feature activation-4.27e+01

Pos logit contributions
ins+0.130
ies+0.129
cer+0.126
-+0.125
ishing+0.112
Neg logit contributions
Well-0.148
 Curious-0.147
Um-0.145
Listen-0.144
 Everything-0.139
Tokenin
Token ID259
Feature activation-3.02e+01

Pos logit contributions
ies+0.261
et+0.236
cer+0.231
kin+0.231
ishing+0.226
Neg logit contributions
Well-0.330
Two-0.321
Um-0.320
Over-0.319
oral-0.312
Tokenoc
Token ID420
Feature activation-5.76e+01

Pos logit contributions
ins+0.149
ies+0.145
cer+0.140
ishing+0.122
-+0.122
Neg logit contributions
 Curious-0.305
Well-0.305
Um-0.299
Listen-0.294
 Everything-0.292
Tokeneros
Token ID27498
Feature activation-2.80e+01

Pos logit contributions
ies+0.363
kin+0.335
ins+0.324
cer+0.304
kins+0.282
Neg logit contributions
Well-0.487
Um-0.486
Listen-0.455
 Unfortunately-0.450
 Somebody-0.449
Token.
Token ID13
Feature activation-7.72e+00

Pos logit contributions
ins+0.151
ies+0.151
cer+0.148
-+0.138
ishing+0.127
Neg logit contributions
 Curious-0.264
Well-0.261
Um-0.257
Listen-0.256
 Everything-0.255
Token Lily
Token ID20037
Feature activation-1.47e+01

Pos logit contributions
ins+0.069
-+0.067
cer+0.067
ies+0.067
er+0.064
Neg logit contributions
Listen-0.043
Over-0.040
Um-0.040
Well-0.040
Hmm-0.037
Token walked
Token ID6807
Feature activation-1.58e+01

Pos logit contributions
ies+0.114
ins+0.113
cer+0.110
-+0.109
ates+0.099
Neg logit contributions
Well-0.107
 Curious-0.107
Listen-0.103
Um-0.103
 Everything-0.100
Token to
Token ID284
Feature activation-7.57e+00

Pos logit contributions
ins+0.106
ies+0.105
cer+0.100
-+0.097
ie+0.088
Neg logit contributions
 Curious-0.136
Well-0.135
Listen-0.134
Um-0.131
 Everything-0.128
Token he
Token ID339
Feature activation-1.77e+01

Pos logit contributions
ins+0.204
ies+0.202
cer+0.196
-+0.194
ishing+0.179
Neg logit contributions
Well-0.161
 Curious-0.159
Um-0.157
Listen-0.156
 Everything-0.151
Token would
Token ID561
Feature activation-1.39e+01

Pos logit contributions
ies+0.162
ins+0.159
-+0.158
cer+0.156
ie+0.145
Neg logit contributions
Well-0.105
 Curious-0.103
Listen-0.103
Um-0.102
 Everything-0.097
Token take
Token ID1011
Feature activation-1.77e+01

Pos logit contributions
ins+0.125
ies+0.124
cer+0.120
-+0.119
ishing+0.110
Neg logit contributions
 Curious-0.084
Well-0.084
Listen-0.082
Um-0.082
 Everything-0.079
Token the
Token ID262
Feature activation-1.94e+01

Pos logit contributions
ins+0.147
ies+0.145
cer+0.138
ishing+0.123
-+0.123
Neg logit contributions
 Curious-0.122
 Everything-0.122
Well-0.121
Listen-0.120
 Everywhere-0.119
Token av
Token ID1196
Feature activation-5.10e+01

Pos logit contributions
ins+0.142
ies+0.140
cer+0.136
-+0.133
ie+0.119
Neg logit contributions
 Curious-0.151
Well-0.150
Um-0.147
Listen-0.147
 Everything-0.142
Tokenoc
Token ID420
Feature activation-5.99e+01

Pos logit contributions
ies+0.308
kin+0.298
ins+0.290
cer+0.271
web+0.258
Neg logit contributions
Well-0.433
Finally-0.423
Hmm-0.419
Um-0.415
 Unfortunately-0.414
Tokenados
Token ID22484
Feature activation-3.06e+01

Pos logit contributions
kin+0.378
ies+0.378
cer+0.365
ins+0.342
web+0.307
Neg logit contributions
Well-0.491
Um-0.478
 Unfortunately-0.458
 Curious-0.456
 Somebody-0.453
Token and
Token ID290
Feature activation-1.15e+01

Pos logit contributions
ins+0.223
ies+0.218
cer+0.210
-+0.202
ishing+0.189
Neg logit contributions
Well-0.231
 Everywhere-0.220
 Curious-0.218
Listen-0.218
Um-0.216
Token scatter
Token ID41058
Feature activation-2.11e+01

Pos logit contributions
ins+0.093
ies+0.092
cer+0.090
-+0.089
ishing+0.081
Neg logit contributions
Well-0.080
 Curious-0.079
Um-0.078
Listen-0.077
 Everything-0.074
Token them
Token ID606
Feature activation-2.04e+01

Pos logit contributions
ies+0.155
ins+0.154
cer+0.150
-+0.142
kin+0.133
Neg logit contributions
Well-0.159
 Everywhere-0.156
 Curious-0.155
 Everything-0.153
Listen-0.153
Token around
Token ID1088
Feature activation-1.69e+01

Pos logit contributions
ins+0.108
cer+0.101
ies+0.100
-+0.096
ates+0.083
Neg logit contributions
 Curious-0.191
Well-0.188
 Everywhere-0.184
Over-0.184
Listen-0.182
Token They
Token ID1119
Feature activation-1.01e+01

Pos logit contributions
ins+0.072
-+0.070
cer+0.069
ies+0.069
!"+0.066
Neg logit contributions
Listen-0.042
Well-0.040
Um-0.040
Over-0.039
 Curious-0.038
Token find
Token ID1064
Feature activation-1.11e+01

Pos logit contributions
ies+0.081
ins+0.079
-+0.078
cer+0.077
ie+0.070
Neg logit contributions
Well-0.071
 Curious-0.070
Listen-0.070
Um-0.070
 Everything-0.066
Token the
Token ID262
Feature activation-1.29e+01

Pos logit contributions
ies+0.087
ins+0.087
cer+0.084
-+0.082
ie+0.076
Neg logit contributions
 Curious-0.080
Listen-0.079
Well-0.079
 Everything-0.079
Um-0.078
Token rh
Token ID9529
Feature activation-3.67e+01

Pos logit contributions
ins+0.098
ies+0.095
-+0.092
cer+0.091
ie+0.086
Neg logit contributions
 Curious-0.096
Listen-0.096
Well-0.096
Um-0.094
Over-0.092
Tokenin
Token ID259
Feature activation-2.48e+01

Pos logit contributions
ies+0.215
et+0.195
ishing+0.189
er+0.186
cer+0.186
Neg logit contributions
Well-0.288
Over-0.286
Two-0.279
Listen-0.273
Um-0.267
Tokenoc
Token ID420
Feature activation-5.13e+01

Pos logit contributions
ins+0.146
ies+0.139
cer+0.127
ishing+0.124
ates+0.115
Neg logit contributions
Well-0.222
 Curious-0.216
Over-0.210
Um-0.209
Listen-0.209
Tokeneros
Token ID27498
Feature activation-2.44e+01

Pos logit contributions
ies+0.332
ins+0.303
kin+0.293
cer+0.274
ie+0.263
Neg logit contributions
Well-0.413
Um-0.406
 Somebody-0.391
Listen-0.387
 Everything-0.385
Token.
Token ID13
Feature activation-4.15e+00

Pos logit contributions
ins+0.137
ies+0.132
cer+0.130
-+0.120
ishing+0.114
Neg logit contributions
 Curious-0.218
Well-0.215
Listen-0.212
 Everything-0.209
Over-0.208
Token They
Token ID1119
Feature activation-8.20e+00

Pos logit contributions
."+0.029
cer+0.028
-+0.028
ins+0.028
.+0.028
Neg logit contributions
Listen-0.029
Um-0.027
Over-0.027
Days-0.027
Well-0.027
Token are
Token ID389
Feature activation-1.09e+01

Pos logit contributions
ies+0.065
-+0.062
cer+0.062
ins+0.062
ie+0.056
Neg logit contributions
Well-0.060
Listen-0.059
 Curious-0.058
Um-0.058
 Everything-0.055
Token gentle
Token ID10296
Feature activation-1.77e+01

Pos logit contributions
ins+0.093
cer+0.093
ies+0.093
-+0.091
ie+0.085
Neg logit contributions
 Curious-0.069
Well-0.068
Um-0.068
Listen-0.067
 Everything-0.065
Token them
Token ID606
Feature activation-1.92e+01

Pos logit contributions
cer+0.080
ies+0.078
ins+0.077
-+0.072
ishing+0.069
Neg logit contributions
Well-0.080
Listen-0.080
Um-0.079
 Curious-0.077
 Everything-0.076
Token and
Token ID290
Feature activation-9.78e+00

Pos logit contributions
ins+0.105
cer+0.102
ies+0.094
-+0.090
ishing+0.083
Neg logit contributions
Well-0.168
 Curious-0.167
 Everywhere-0.165
On-0.165
Listen-0.164
Token never
Token ID1239
Feature activation-1.18e+01

Pos logit contributions
ins+0.084
cer+0.083
ies+0.081
-+0.081
ates+0.076
Neg logit contributions
Well-0.063
Um-0.062
Listen-0.062
 Curious-0.060
 Everything-0.059
Token threw
Token ID9617
Feature activation-1.99e+01

Pos logit contributions
ins+0.109
ies+0.108
-+0.106
cer+0.105
ates+0.097
Neg logit contributions
Well-0.069
 Curious-0.068
Um-0.068
Listen-0.068
 Everything-0.064
Token a
Token ID257
Feature activation-1.67e+01

Pos logit contributions
ins+0.152
cer+0.150
ies+0.148
-+0.135
ates+0.133
Neg logit contributions
Listen-0.138
Well-0.137
 Everything-0.135
Um-0.135
 Curious-0.134
Token tant
Token ID24246
Feature activation-5.09e+01

Pos logit contributions
ins+0.124
ies+0.120
cer+0.120
ates+0.119
ishing+0.110
Neg logit contributions
 Curious-0.119
Listen-0.115
Well-0.113
Um-0.113
Over-0.113
Tokenrum
Token ID6582
Feature activation-2.29e+01

Pos logit contributions
cer+0.353
 fort+0.334
 tent+0.332
 room+0.329
kin+0.320
Neg logit contributions
Um-0.318
Fr-0.311
Listen-0.308
Well-0.300
oral-0.272
Token again
Token ID757
Feature activation-1.67e+01

Pos logit contributions
ins+0.106
ies+0.099
cer+0.094
ater+0.092
ishing+0.091
Neg logit contributions
Well-0.232
 Everything-0.221
Listen-0.220
That-0.216
 �-0.216
Token.
Token ID13
Feature activation-3.20e+00

Pos logit contributions
ins+0.077
cer+0.068
ies+0.066
ates+0.062
-+0.057
Neg logit contributions
Well-0.170
 Curious-0.167
Um-0.166
Listen-0.164
 Unfortunately-0.162
Token<|endoftext|>
Token ID50256
Feature activation-1.07e+01

Pos logit contributions
ins+0.027
ies+0.027
-+0.027
cer+0.026
er+0.025
Neg logit contributions
Listen-0.019
Over-0.018
Um-0.018
Well-0.018
 Curious-0.017
TokenOnce
Token ID7454
Feature activation-7.70e+00

Pos logit contributions
ies+0.134
ins+0.128
cer+0.124
-+0.123
ishing+0.122
Neg logit contributions
Well-0.037
 Curious-0.036
Listen-0.034
 Everything-0.034
 Everywhere-0.033
Token't
Token ID470
Feature activation-2.02e+01

Pos logit contributions
Well+0.147
 Curious+0.146
Listen+0.145
Um+0.145
 Everything+0.140
Neg logit contributions
ies-0.063
ins-0.062
--0.061
cer-0.057
ie-0.049
Token know
Token ID760
Feature activation-1.86e+01

Pos logit contributions
ins+0.194
cer+0.184
ies+0.182
-+0.179
ates+0.177
Neg logit contributions
 Curious-0.111
Um-0.110
Listen-0.110
Well-0.109
 Everything-0.104
Token the
Token ID262
Feature activation-1.95e+01

Pos logit contributions
cer+0.138
ins+0.137
ies+0.132
ishing+0.126
-+0.125
Neg logit contributions
 Everything-0.136
Well-0.133
 Curious-0.133
Um-0.133
Listen-0.132
Token rh
Token ID9529
Feature activation-4.80e+01

Pos logit contributions
ins+0.152
ies+0.150
cer+0.145
-+0.143
ishing+0.131
Neg logit contributions
Well-0.143
 Curious-0.143
Um-0.141
Listen-0.141
 Everything-0.135
Tokenin
Token ID259
Feature activation-3.24e+01

Pos logit contributions
ies+0.295
et+0.254
ins+0.246
ishing+0.244
er+0.244
Neg logit contributions
Well-0.384
Um-0.363
Over-0.363
Two-0.359
Listen-0.350
Tokenoc
Token ID420
Feature activation-6.12e+01

Pos logit contributions
ies+0.156
ins+0.155
cer+0.142
-+0.136
ishing+0.122
Neg logit contributions
Well-0.336
 Curious-0.331
Um-0.329
Listen-0.324
 Everything-0.316
Tokeneros
Token ID27498
Feature activation-3.02e+01

Pos logit contributions
ies+0.393
kin+0.366
ins+0.344
cer+0.313
kins+0.307
Neg logit contributions
Well-0.517
Um-0.504
 Somebody-0.472
 Unfortunately-0.468
 Everything-0.466
Token was
Token ID373
Feature activation-1.63e+01

Pos logit contributions
ies+0.240
ins+0.236
-+0.232
cer+0.230
ie+0.209
Neg logit contributions
Well-0.216
 Curious-0.213
Listen-0.211
Um-0.211
 Everything-0.201
Token wild
Token ID4295
Feature activation-2.32e+01

Pos logit contributions
ies+0.135
ins+0.134
cer+0.129
-+0.124
ie+0.116
Neg logit contributions
 Curious-0.111
Um-0.110
Well-0.109
Listen-0.109
 Everything-0.105
Token.
Token ID13
Feature activation-8.68e+00

Pos logit contributions
ins+0.149
kin+0.147
ates+0.135
kins+0.132
cer+0.128
Neg logit contributions
 Curious-0.193
 Everything-0.187
 �-0.183
Well-0.181
 Everywhere-0.180
Token Can
Token ID1680
Feature activation-1.12e+01

Pos logit contributions
-+0.073
ins+0.072
cer+0.071
ies+0.070
er+0.070
Neg logit contributions
Listen-0.051
Over-0.048
Well-0.046
Um-0.045
Hmm-0.042