TOP ACTIVATIONS
MAX = 0.271

go home and they said goodbye by giving each other a
hug the people and say goodbye . They thank the pilot
on and they both said goodbye to the puppy .
hugged each other and said goodbye . L ily
that day , Lily said goodbye to the pond and its
thanked the man and said goodbye , and then began his
a pat . They said goodbye to each other and promised
her green purse and said goodbye to Tim my . She
The bird said goodbye and flew away . Ellie
ched , the mouse bid goodbye to the huge ant and
garden . Lily is the seeker and Ben is the h
go home . She said goodbye to the old man and
go home . She said goodbye to the shadow and thanked
Tom said . They waved goodbye to the friendly man .
inside . We can say goodbye and thank him for being
is okay . They say goodbye to the dog and the
friends were sad to say goodbye to the plane . But
their fun , Sarah said goodbye to the L azy Cart
structure was . They said goodbye and went home , happy
all done , Anne said goodbye to her new friend and

MIN ACTIVATIONS
MIN = -0.145

Un lock the door and let
than Uncle ! Un cle was so happy he
down . Un cle looked at her and
my dog ?" Un cle Ben looked at Lily
the coat . Un cle Mike said to himself
not nice ." Un cle Tom laughed . He
â Friend 2 :
the backyard . Un cle sprayed them with the
the rocks , adding and subtract ing . They spent so
The mouse was happy to oblig e and quickly ran off
was counting , adding and subtract ing like a pro .
little boy said , " Un cle , are you stealing
even did some adding and subtract ing with the flowers .
didn 't know what " gas " was , but it
before . " Un ic orn is a rare
. Tim said , " Un cle Jack , let 's
the screen . 1 , 2 , 3 , 4 !
. He counted 1 , 2 , 3 , 4 !
and Dad was happy to oblig e . He went out
' Aunt ies and un cles can give me fake pills

INTERVAL 0.167 - 0.271
CONTAINS 0.023%

The girl was too pet rified to move and just stared
crazy creatures . Tim said goodbye to Mel on and thanked
a pat . They said goodbye to each other and promised
garden . Lily is the seeker and Ben is the h
's floor , she said goodbye to the man and went

INTERVAL 0.063 - 0.167
CONTAINS 3.236%

laughed so hard that they fell on the grass .
scared , but then she realized it was just another boat
, there was a bear named Ted . He loved to
they heard a loud noise from the kitchen . It was
there was a little boy named Tim my . Tim my

INTERVAL -0.041 - 0.063
CONTAINS 88.775%

He did not know he was in trouble . He only
their walk was over , the little girl and the old
to everyone . He saw Sara playing with her dolls in
not enter ". She opened it and turned on the light
you want to see it ?" Sam did not

INTERVAL -0.145 - -0.041
CONTAINS 7.966%

big bird flying really high in the sky . She pointed
One day , however , the metal bird got
backpack that she carried to school every day . One day
made a mistake . He said ,
the piano . We can both play it and be happy
Token go
Token ID467
Feature activation+1.27e-01

Pos logit contributions
pan+0.129
 coward+0.125
 Woo+0.122
otion+0.120
rier+0.117
Neg logit contributions
pa-0.108
 own-0.092
 work-0.091
 search-0.091
 adventures-0.087
Token home
Token ID1363
Feature activation+4.11e-02

Pos logit contributions
pan+0.253
 Woo+0.246
rol+0.233
rier+0.225
otion+0.224
Neg logit contributions
pa-0.169
 adventures-0.137
 work-0.133
 family-0.129
 each-0.125
Token and
Token ID290
Feature activation+6.76e-02

Pos logit contributions
pan+0.089
 Woo+0.085
rier+0.083
rol+0.081
otion+0.078
Neg logit contributions
pa-0.046
 adventures-0.035
 work-0.035
 search-0.033
 ideas-0.031
Token they
Token ID484
Feature activation+8.44e-02

Pos logit contributions
pan+0.134
 Woo+0.129
rol+0.123
 coward+0.120
rified+0.120
Neg logit contributions
pa-0.085
 adventures-0.069
 work-0.068
 each-0.063
 search-0.061
Token said
Token ID531
Feature activation+2.95e-02

Pos logit contributions
pan+0.170
 Woo+0.157
rol+0.157
rified+0.155
 coward+0.155
Neg logit contributions
pa-0.106
 adventures-0.090
 work-0.084
 search-0.077
cles-0.076
Token goodbye
Token ID24829
Feature activation+1.86e-01

Pos logit contributions
pan+0.042
rol+0.040
 Woo+0.039
rier+0.039
rified+0.038
Neg logit contributions
pa-0.055
 work-0.051
 adventures-0.049
 each-0.048
 family-0.048
Token by
Token ID416
Feature activation+2.06e-02

Pos logit contributions
pan+0.370
 Woo+0.359
rier+0.353
rol+0.327
rified+0.324
Neg logit contributions
pa-0.227
 work-0.182
 adventures-0.180
 search-0.174
 un-0.160
Token giving
Token ID3501
Feature activation+3.43e-02

Pos logit contributions
pan+0.038
 Woo+0.036
otion+0.035
rier+0.034
rol+0.034
Neg logit contributions
pa-0.029
 work-0.025
 adventures-0.025
 search-0.023
 own-0.023
Token each
Token ID1123
Feature activation-5.10e-02

Pos logit contributions
 Woo+0.060
rified+0.056
rier+0.054
pan+0.053
arling+0.050
Neg logit contributions
pa-0.043
 work-0.042
 search-0.041
 adventures-0.040
 un-0.038
Token other
Token ID584
Feature activation+4.11e-02

Pos logit contributions
pa+0.062
 work+0.053
 adventures+0.051
 search+0.050
 each+0.047
Neg logit contributions
pan-0.104
 Woo-0.098
rier-0.098
rified-0.096
rol-0.096
Token a
Token ID257
Feature activation+2.65e-02

Pos logit contributions
pan+0.075
 Woo+0.074
rified+0.071
rol+0.070
rier+0.069
Neg logit contributions
pa-0.051
 work-0.044
 search-0.043
 adventures-0.040
 asked-0.039
Token hug
Token ID16225
Feature activation+1.66e-01

Pos logit contributions
pan+0.243
 Woo+0.225
rol+0.222
rified+0.221
 coward+0.221
Neg logit contributions
pa-0.153
 adventures-0.123
 work-0.118
 family-0.108
 search-0.108
Token the
Token ID262
Feature activation+8.35e-02

Pos logit contributions
pan+0.345
 Woo+0.344
rier+0.317
rified+0.317
rol+0.310
Neg logit contributions
pa-0.188
 work-0.167
 adventures-0.163
 search-0.162
 own-0.148
Token people
Token ID661
Feature activation+3.34e-02

Pos logit contributions
pan+0.144
rier+0.135
 Woo+0.131
 coward+0.127
rified+0.127
Neg logit contributions
pa-0.132
 work-0.113
 adventures-0.112
 search-0.109
 each-0.107
Token and
Token ID290
Feature activation+5.32e-02

Pos logit contributions
pan+0.072
rier+0.069
rol+0.068
 Woo+0.067
otion+0.067
Neg logit contributions
pa-0.037
 adventures-0.031
 work-0.031
 search-0.029
 family-0.026
Token say
Token ID910
Feature activation+2.79e-02

Pos logit contributions
pan+0.101
 Woo+0.098
 coward+0.090
 Fif+0.087
rified+0.087
Neg logit contributions
pa-0.071
 work-0.058
 adventures-0.058
 search-0.055
 own-0.053
Token goodbye
Token ID24829
Feature activation+1.82e-01

Pos logit contributions
pan+0.049
rol+0.046
rier+0.045
 Woo+0.045
rified+0.044
Neg logit contributions
pa-0.044
 work-0.040
 adventures-0.038
 each-0.037
 family-0.037
Token.
Token ID13
Feature activation+4.00e-02

Pos logit contributions
pan+0.382
rier+0.376
 Woo+0.373
rol+0.345
rified+0.340
Neg logit contributions
pa-0.210
 work-0.166
 adventures-0.163
 search-0.158
 un-0.138
Token They
Token ID1119
Feature activation+3.82e-02

Pos logit contributions
pan+0.083
rier+0.078
rol+0.077
otion+0.076
rified+0.076
Neg logit contributions
pa-0.050
 work-0.044
 adventures-0.042
 search-0.040
 family-0.040
Token thank
Token ID5875
Feature activation+6.75e-02

Pos logit contributions
pan+0.074
 Woo+0.071
rol+0.068
rified+0.068
 coward+0.068
Neg logit contributions
pa-0.050
 adventures-0.041
 work-0.040
 search-0.038
 family-0.037
Token the
Token ID262
Feature activation+2.50e-02

Pos logit contributions
 Woo+0.113
otion+0.100
 Fif+0.099
 Rib+0.097
 Lia+0.097
Neg logit contributions
pa-0.099
 search-0.092
 adventures-0.087
 work-0.083
 un-0.080
Token pilot
Token ID8022
Feature activation+5.26e-02

Pos logit contributions
pan+0.050
rier+0.048
otion+0.047
 Woo+0.047
 coward+0.046
Neg logit contributions
pa-0.032
 adventures-0.026
 search-0.026
 work-0.026
 each-0.024
Token on
Token ID319
Feature activation+7.23e-02

Pos logit contributions
pan+0.275
 Woo+0.262
rol+0.259
rier+0.254
otion+0.246
Neg logit contributions
pa-0.162
 work-0.130
 adventures-0.127
 search-0.120
 each-0.113
Token and
Token ID290
Feature activation+6.82e-02

Pos logit contributions
pan+0.153
 Woo+0.139
rier+0.136
otion+0.132
rol+0.131
Neg logit contributions
pa-0.087
 work-0.070
 adventures-0.069
 search-0.064
 family-0.063
Token they
Token ID484
Feature activation+8.39e-02

Pos logit contributions
pan+0.128
 Woo+0.119
rol+0.115
rier+0.114
rified+0.113
Neg logit contributions
pa-0.095
 work-0.080
 adventures-0.079
 search-0.075
 family-0.073
Token both
Token ID1111
Feature activation-1.29e-02

Pos logit contributions
pan+0.174
rol+0.161
rified+0.157
 Woo+0.156
 coward+0.155
Neg logit contributions
pa-0.104
 adventures-0.085
 work-0.083
 search-0.076
 packages-0.073
Token said
Token ID531
Feature activation+2.00e-02

Pos logit contributions
pa+0.016
 adventures+0.013
 work+0.013
 search+0.012
 packages+0.012
Neg logit contributions
pan-0.026
rol-0.024
 coward-0.023
rified-0.023
otion-0.023
Token goodbye
Token ID24829
Feature activation+1.81e-01

Pos logit contributions
pan+0.035
rol+0.034
 Woo+0.033
rified+0.033
rier+0.032
Neg logit contributions
pa-0.031
 work-0.028
 adventures-0.027
 each-0.027
 family-0.026
Token to
Token ID284
Feature activation+3.29e-02

Pos logit contributions
pan+0.378
rier+0.364
 Woo+0.360
rol+0.340
rified+0.336
Neg logit contributions
pa-0.211
 work-0.167
 adventures-0.164
 search-0.159
 un-0.144
Token the
Token ID262
Feature activation+3.25e-02

Pos logit contributions
 Woo+0.057
pan+0.053
 Fif+0.050
 coward+0.049
otion+0.049
Neg logit contributions
pa-0.045
 adventures-0.041
 work-0.039
 search-0.039
urious-0.038
Token puppy
Token ID26188
Feature activation+4.82e-02

Pos logit contributions
pan+0.057
rier+0.054
 Woo+0.053
 coward+0.051
rol+0.051
Neg logit contributions
pa-0.050
 work-0.042
 adventures-0.041
 search-0.041
 each-0.040
Token.
Token ID13
Feature activation+2.63e-02

Pos logit contributions
pan+0.111
 Woo+0.105
rier+0.103
rol+0.103
rified+0.102
Neg logit contributions
pa-0.048
 work-0.037
 adventures-0.036
 search-0.033
 family-0.031
Token
Token ID198
Feature activation+6.48e-03

Pos logit contributions
pan+0.053
rier+0.049
rol+0.049
 Woo+0.049
otion+0.048
Neg logit contributions
pa-0.034
 work-0.030
 adventures-0.029
 search-0.028
 family-0.027
Token hugged
Token ID41130
Feature activation+1.71e-01

Pos logit contributions
pan+0.084
 Woo+0.078
rier+0.076
otion+0.076
 coward+0.076
Neg logit contributions
pa-0.071
 adventures-0.060
 family-0.054
 work-0.054
 search-0.054
Token each
Token ID1123
Feature activation+1.12e-02

Pos logit contributions
 Woo+0.322
pan+0.310
rier+0.278
rified+0.277
otion+0.274
Neg logit contributions
pa-0.238
 adventures-0.222
 work-0.217
 search-0.216
 own-0.195
Token other
Token ID584
Feature activation+7.58e-02

Pos logit contributions
pan+0.023
 Woo+0.021
rier+0.021
rified+0.021
rol+0.021
Neg logit contributions
pa-0.014
 adventures-0.012
 work-0.012
 search-0.012
 un-0.011
Token and
Token ID290
Feature activation+5.91e-02

Pos logit contributions
pan+0.158
 Woo+0.151
rier+0.151
rified+0.144
rol+0.144
Neg logit contributions
pa-0.086
 adventures-0.066
 work-0.063
 search-0.062
Daddy-0.058
Token said
Token ID531
Feature activation+2.64e-02

Pos logit contributions
 Woo+0.121
pan+0.121
rol+0.110
rified+0.109
rier+0.109
Neg logit contributions
pa-0.070
 work-0.059
 adventures-0.057
 search-0.055
 own-0.051
Token goodbye
Token ID24829
Feature activation+1.80e-01

Pos logit contributions
pan+0.050
rol+0.048
rified+0.047
rier+0.047
 Woo+0.047
Neg logit contributions
pa-0.036
 work-0.033
 adventures-0.031
 each-0.030
 family-0.030
Token.
Token ID13
Feature activation+4.23e-02

Pos logit contributions
pan+0.385
 Woo+0.375
rier+0.374
rol+0.347
rified+0.341
Neg logit contributions
pa-0.203
 work-0.163
 adventures-0.163
 search-0.153
 un-0.137
Token
Token ID198
Feature activation+1.37e-02

Pos logit contributions
pan+0.091
rier+0.085
rol+0.085
 Woo+0.084
otion+0.084
Neg logit contributions
pa-0.049
 work-0.042
 adventures-0.040
 search-0.039
 family-0.038
Token
Token ID198
Feature activation+6.69e-03

Pos logit contributions
 Woo+0.029
pan+0.027
rol+0.027
otion+0.027
Flo+0.026
Neg logit contributions
pa-0.016
 work-0.015
 adventures-0.013
 search-0.012
 ideas-0.012
TokenL
Token ID43
Feature activation+7.08e-02

Pos logit contributions
pan+0.011
 Woo+0.011
rol+0.010
otion+0.010
rified+0.010
Neg logit contributions
pa-0.011
 work-0.010
 adventures-0.010
 search-0.009
 each-0.009
Tokenily
Token ID813
Feature activation+2.68e-02

Pos logit contributions
pan+0.074
 Woo+0.065
rier+0.064
rol+0.063
rified+0.062
Neg logit contributions
pa-0.165
 work-0.152
 adventures-0.146
 each-0.142
 family-0.142
Token that
Token ID326
Feature activation+6.24e-02

Pos logit contributions
pan+0.034
otion+0.032
 Woo+0.032
rol+0.030
rified+0.029
Neg logit contributions
pa-0.029
 work-0.025
 adventures-0.025
 search-0.024
 own-0.024
Token day
Token ID1110
Feature activation+1.23e-02

Pos logit contributions
pan+0.139
 Woo+0.128
otion+0.127
rol+0.124
rier+0.122
Neg logit contributions
pa-0.064
 work-0.052
 adventures-0.050
 each-0.049
 search-0.047
Token,
Token ID11
Feature activation+6.61e-02

Pos logit contributions
pan+0.026
 Woo+0.025
rol+0.025
rier+0.024
otion+0.024
Neg logit contributions
pa-0.015
 work-0.013
 adventures-0.012
 search-0.011
 each-0.011
Token Lily
Token ID20037
Feature activation+8.57e-02

Pos logit contributions
pan+0.101
 Woo+0.097
rol+0.092
rier+0.091
otion+0.090
Neg logit contributions
pa-0.116
 work-0.104
 adventures-0.100
 search-0.097
 each-0.094
Token said
Token ID531
Feature activation+1.78e-02

Pos logit contributions
pan+0.205
 Woo+0.197
rol+0.192
rified+0.191
otion+0.191
Neg logit contributions
pa-0.074
 work-0.061
 adventures-0.058
 search-0.055
 each-0.051
Token goodbye
Token ID24829
Feature activation+1.78e-01

Pos logit contributions
rol+0.044
pan+0.043
rified+0.042
ainted+0.041
rier+0.041
Neg logit contributions
 work-0.014
pa-0.014
 family-0.012
 each-0.012
 dad-0.012
Token to
Token ID284
Feature activation+3.01e-02

Pos logit contributions
pan+0.339
 Woo+0.330
rier+0.320
rol+0.300
rified+0.294
Neg logit contributions
pa-0.243
 adventures-0.201
 work-0.197
 search-0.190
 un-0.175
Token the
Token ID262
Feature activation+2.65e-02

Pos logit contributions
 Woo+0.051
pan+0.042
otion+0.040
 Fif+0.039
 downward+0.039
Neg logit contributions
pa-0.046
 adventures-0.043
 search-0.043
cles-0.041
 work-0.040
Token pond
Token ID16723
Feature activation+4.16e-02

Pos logit contributions
pan+0.048
rier+0.045
 Woo+0.045
 coward+0.043
otion+0.043
Neg logit contributions
pa-0.039
 work-0.033
 adventures-0.033
 each-0.032
 search-0.032
Token and
Token ID290
Feature activation+1.82e-02

Pos logit contributions
pan+0.101
 Woo+0.096
rier+0.096
rol+0.096
otion+0.094
Neg logit contributions
pa-0.036
 work-0.030
 adventures-0.028
 search-0.025
 family-0.024
Token its
Token ID663
Feature activation+7.65e-02

Pos logit contributions
pan+0.043
 Woo+0.041
rol+0.041
rier+0.040
otion+0.040
Neg logit contributions
pa-0.017
 work-0.014
 adventures-0.014
 search-0.013
 family-0.012
Token thanked
Token ID26280
Feature activation+1.06e-01

Pos logit contributions
 Woo+0.049
pan+0.049
 coward+0.046
rier+0.046
rol+0.044
Neg logit contributions
pa-0.034
 work-0.025
 adventures-0.025
 each-0.024
 ideas-0.023
Token the
Token ID262
Feature activation+8.32e-02

Pos logit contributions
 Woo+0.197
pan+0.196
rier+0.183
otion+0.176
rol+0.176
Neg logit contributions
pa-0.130
 adventures-0.123
 work-0.122
 search-0.119
 own-0.116
Token man
Token ID582
Feature activation+3.46e-02

Pos logit contributions
pan+0.142
rier+0.137
 Woo+0.131
rified+0.127
 coward+0.126
Neg logit contributions
pa-0.130
 adventures-0.109
 work-0.108
 search-0.107
 each-0.106
Token and
Token ID290
Feature activation+5.59e-02

Pos logit contributions
pan+0.069
rier+0.066
rol+0.065
 Woo+0.064
rified+0.064
Neg logit contributions
pa-0.040
 adventures-0.036
 work-0.034
 search-0.031
 family-0.030
Token said
Token ID531
Feature activation+2.21e-02

Pos logit contributions
pan+0.110
 Woo+0.109
rol+0.101
rified+0.099
 coward+0.097
Neg logit contributions
pa-0.071
 adventures-0.061
 work-0.060
 own-0.056
 search-0.056
Token goodbye
Token ID24829
Feature activation+1.76e-01

Pos logit contributions
pan+0.040
 Woo+0.037
rier+0.037
rol+0.037
otion+0.036
Neg logit contributions
pa-0.034
 work-0.031
 adventures-0.029
 search-0.028
 family-0.028
Token,
Token ID11
Feature activation+5.54e-02

Pos logit contributions
pan+0.363
rier+0.361
 Woo+0.354
rol+0.330
rified+0.329
Neg logit contributions
pa-0.207
 work-0.168
 adventures-0.164
 search-0.157
 own-0.139
Token and
Token ID290
Feature activation+2.88e-02

Pos logit contributions
pan+0.104
 Woo+0.101
rol+0.096
rier+0.095
otion+0.092
Neg logit contributions
pa-0.079
 work-0.068
 adventures-0.067
 search-0.061
 own-0.059
Token then
Token ID788
Feature activation-3.24e-02

Pos logit contributions
pan+0.053
 Woo+0.052
rol+0.049
rified+0.049
rier+0.048
Neg logit contributions
pa-0.040
 work-0.034
 adventures-0.033
 search-0.031
 own-0.031
Token began
Token ID2540
Feature activation+2.86e-02

Pos logit contributions
pa+0.048
 work+0.041
 adventures+0.041
 search+0.039
 own+0.038
Neg logit contributions
pan-0.059
 Woo-0.057
rol-0.054
rier-0.053
rified-0.052
Token his
Token ID465
Feature activation+6.76e-03

Pos logit contributions
pan+0.053
 Woo+0.050
rol+0.048
rier+0.047
 downward+0.045
Neg logit contributions
pa-0.040
 work-0.032
 adventures-0.032
 own-0.030
 search-0.030
Token a
Token ID257
Feature activation+1.20e-01

Pos logit contributions
pan+0.047
rified+0.046
 Woo+0.044
 coward+0.044
ainted+0.043
Neg logit contributions
pa-0.034
 work-0.028
 adventures-0.027
 search-0.025
 day-0.025
Token pat
Token ID1458
Feature activation+1.11e-01

Pos logit contributions
pan+0.233
 Woo+0.216
rier+0.215
 coward+0.212
rol+0.208
Neg logit contributions
pa-0.168
 work-0.138
 each-0.133
 adventures-0.132
 search-0.123
Token.
Token ID13
Feature activation+5.25e-02

Pos logit contributions
pan+0.224
 Woo+0.210
rified+0.206
rol+0.206
rier+0.206
Neg logit contributions
pa-0.145
 adventures-0.132
 work-0.128
 search-0.122
 family-0.118
Token They
Token ID1119
Feature activation+6.19e-02

Pos logit contributions
pan+0.095
 Woo+0.090
rol+0.089
otion+0.087
rier+0.087
Neg logit contributions
pa-0.079
 work-0.072
 adventures-0.068
 search-0.064
 each-0.063
Token said
Token ID531
Feature activation+2.16e-02

Pos logit contributions
pan+0.124
 Woo+0.115
rol+0.113
rified+0.112
otion+0.108
Neg logit contributions
pa-0.080
 adventures-0.068
 work-0.063
 family-0.061
 search-0.060
Token goodbye
Token ID24829
Feature activation+1.76e-01

Pos logit contributions
pan+0.042
rol+0.041
rified+0.039
otion+0.039
rier+0.039
Neg logit contributions
pa-0.029
 work-0.027
 each-0.026
 adventures-0.025
 family-0.025
Token to
Token ID284
Feature activation+3.03e-02

Pos logit contributions
pan+0.337
 Woo+0.326
rier+0.325
rol+0.306
rified+0.299
Neg logit contributions
pa-0.233
 work-0.193
 adventures-0.192
 search-0.181
 family-0.166
Token each
Token ID1123
Feature activation-5.34e-02

Pos logit contributions
 Woo+0.054
pan+0.045
 Fif+0.044
 Rib+0.042
otion+0.042
Neg logit contributions
pa-0.045
 adventures-0.040
 search-0.038
 work-0.038
cles-0.038
Token other
Token ID584
Feature activation+4.07e-02

Pos logit contributions
pa+0.060
 work+0.051
 adventures+0.048
 search+0.046
 each+0.044
Neg logit contributions
pan-0.118
 Woo-0.111
rol-0.111
rier-0.111
otion-0.110
Token and
Token ID290
Feature activation+2.25e-02

Pos logit contributions
pan+0.085
 Woo+0.078
rier+0.077
rified+0.075
otion+0.073
Neg logit contributions
pa-0.045
 search-0.037
 adventures-0.037
 work-0.036
 own-0.033
Token promised
Token ID8072
Feature activation+5.15e-02

Pos logit contributions
pan+0.046
 Woo+0.045
rol+0.042
rified+0.042
 coward+0.042
Neg logit contributions
pa-0.027
 adventures-0.022
 work-0.022
 search-0.020
 own-0.019
Token her
Token ID607
Feature activation+7.88e-02

Pos logit contributions
pan+0.136
otion+0.119
rol+0.115
 Woo+0.114
 downward+0.113
Neg logit contributions
pa-0.110
 work-0.087
cles-0.083
 adventures-0.083
 family-0.082
Token green
Token ID4077
Feature activation+2.88e-02

Pos logit contributions
pan+0.134
rier+0.119
 Woo+0.118
rified+0.115
rol+0.114
Neg logit contributions
pa-0.114
 work-0.100
cles-0.091
 each-0.090
 adventures-0.088
Token purse
Token ID27517
Feature activation+7.31e-02

Pos logit contributions
pan+0.049
 Woo+0.043
rier+0.042
rol+0.041
rified+0.040
Neg logit contributions
pa-0.047
 work-0.042
 adventures-0.039
 each-0.037
 search-0.037
Token and
Token ID290
Feature activation+4.47e-02

Pos logit contributions
pan+0.171
rol+0.159
rier+0.159
rified+0.158
 Woo+0.157
Neg logit contributions
pa-0.072
 work-0.063
 adventures-0.058
 search-0.055
 family-0.050
Token said
Token ID531
Feature activation+1.73e-02

Pos logit contributions
pan+0.082
 Woo+0.078
rol+0.073
rier+0.073
rified+0.071
Neg logit contributions
pa-0.065
 work-0.052
 adventures-0.051
 search-0.048
 family-0.047
Token goodbye
Token ID24829
Feature activation+1.75e-01

Pos logit contributions
pan+0.040
rol+0.039
rified+0.038
 Woo+0.038
rier+0.038
Neg logit contributions
pa-0.016
 work-0.014
 each-0.013
 adventures-0.013
 family-0.012
Token to
Token ID284
Feature activation+2.90e-02

Pos logit contributions
pan+0.358
rier+0.350
 Woo+0.346
rol+0.329
rified+0.321
Neg logit contributions
pa-0.210
 work-0.172
 adventures-0.172
 search-0.159
 family-0.145
Token Tim
Token ID5045
Feature activation+2.11e-02

Pos logit contributions
 Woo+0.049
pan+0.046
otion+0.044
 Fif+0.043
 Lia+0.042
Neg logit contributions
pa-0.042
 adventures-0.039
 work-0.037
 search-0.036
cles-0.035
Tokenmy
Token ID1820
Feature activation-5.10e-02

Pos logit contributions
pan+0.045
 Woo+0.045
rol+0.044
rified+0.043
rier+0.043
Neg logit contributions
 adventures-0.018
pa-0.018
 work-0.015
 search-0.015
 family-0.014
Token.
Token ID13
Feature activation+1.91e-02

Pos logit contributions
pa+0.059
 adventures+0.053
Daddy+0.047
Dad+0.045
 family+0.045
Neg logit contributions
pan-0.098
rier-0.094
 Woo-0.093
otion-0.089
rol-0.087
Token She
Token ID1375
Feature activation+3.25e-02

Pos logit contributions
pan+0.037
 Woo+0.035
rol+0.035
rier+0.035
otion+0.034
Neg logit contributions
pa-0.026
 work-0.023
 adventures-0.022
 search-0.021
 each-0.020
Token
Token ID198
Feature activation+6.10e-02

Pos logit contributions
 coward+0.079
 Fif+0.079
pan+0.078
rier+0.077
 Woo+0.076
Neg logit contributions
pa-0.077
Daddy-0.064
 work-0.059
Dad-0.055
 journals-0.054
Token
Token ID198
Feature activation+4.16e-02

Pos logit contributions
 Woo+0.118
pan+0.118
rol+0.115
otion+0.112
 coward+0.109
Neg logit contributions
pa-0.079
 work-0.072
 adventures-0.065
 each-0.060
 search-0.059
TokenThe
Token ID464
Feature activation+4.31e-02

Pos logit contributions
pan+0.071
 Woo+0.069
rol+0.067
otion+0.065
rified+0.064
Neg logit contributions
pa-0.065
 work-0.060
 adventures-0.056
 search-0.053
 each-0.053
Token bird
Token ID6512
Feature activation+9.28e-02

Pos logit contributions
pan+0.075
rier+0.073
 Woo+0.070
 coward+0.069
rified+0.068
Neg logit contributions
pa-0.065
 adventures-0.055
 work-0.054
 search-0.053
 each-0.052
Token said
Token ID531
Feature activation+1.31e-02

Pos logit contributions
pan+0.176
rol+0.166
rified+0.164
otion+0.160
rier+0.160
Neg logit contributions
pa-0.127
 work-0.110
 adventures-0.107
 family-0.101
 search-0.101
Token goodbye
Token ID24829
Feature activation+1.74e-01

Pos logit contributions
rol+0.032
pan+0.031
otion+0.030
rified+0.030
 Woo+0.030
Neg logit contributions
pa-0.010
 each-0.009
 work-0.009
 family-0.008
 dad-0.008
Token and
Token ID290
Feature activation+2.50e-02

Pos logit contributions
pan+0.362
rier+0.350
 Woo+0.344
rol+0.325
rified+0.324
Neg logit contributions
pa-0.215
 work-0.178
 adventures-0.175
 search-0.163
 family-0.152
Token flew
Token ID13112
Feature activation+8.62e-02

Pos logit contributions
pan+0.046
 Woo+0.044
rol+0.042
rier+0.042
rified+0.042
Neg logit contributions
pa-0.036
 work-0.032
 adventures-0.031
 search-0.029
 family-0.029
Token away
Token ID1497
Feature activation+7.87e-02

Pos logit contributions
pan+0.155
rol+0.144
 Woo+0.143
otion+0.143
rier+0.143
Neg logit contributions
pa-0.131
 work-0.119
 adventures-0.113
 search-0.110
 each-0.109
Token.
Token ID13
Feature activation+1.87e-02

Pos logit contributions
pan+0.185
rier+0.178
 Woo+0.173
rol+0.171
rified+0.168
Neg logit contributions
pa-0.082
 work-0.059
 adventures-0.058
 search-0.052
 family-0.051
Token Ellie
Token ID44801
Feature activation+3.82e-02

Pos logit contributions
pan+0.039
rier+0.036
rol+0.036
otion+0.035
 Woo+0.035
Neg logit contributions
pa-0.023
 work-0.021
 adventures-0.020
 search-0.019
 family-0.019
Tokenched
Token ID1740
Feature activation+1.46e-01

Pos logit contributions
pan+0.178
rol+0.165
rier+0.165
rified+0.161
 Woo+0.160
Neg logit contributions
pa-0.115
 work-0.100
 adventures-0.095
 search-0.094
 family-0.090
Token,
Token ID11
Feature activation+8.78e-02

Pos logit contributions
pan+0.273
 Woo+0.261
rier+0.257
rified+0.253
rol+0.252
Neg logit contributions
pa-0.212
 adventures-0.185
 work-0.185
 search-0.175
 family-0.171
Token the
Token ID262
Feature activation+5.42e-02

Pos logit contributions
pan+0.176
 Woo+0.164
rier+0.162
rol+0.161
otion+0.159
Neg logit contributions
pa-0.118
 work-0.102
 adventures-0.099
 search-0.093
 family-0.091
Token mouse
Token ID10211
Feature activation+9.64e-02

Pos logit contributions
pan+0.092
rier+0.085
 Woo+0.083
rified+0.081
rol+0.080
Neg logit contributions
pa-0.090
 work-0.078
 adventures-0.077
 search-0.075
 each-0.073
Token bid
Token ID8406
Feature activation+6.58e-02

Pos logit contributions
pan+0.213
rol+0.199
 Woo+0.199
rier+0.199
rified+0.196
Neg logit contributions
pa-0.107
 work-0.091
 adventures-0.086
 search-0.081
 family-0.079
Token goodbye
Token ID24829
Feature activation+1.74e-01

Pos logit contributions
pan+0.136
rol+0.128
rier+0.127
 Woo+0.127
otion+0.126
Neg logit contributions
pa-0.083
 work-0.073
 adventures-0.068
 search-0.065
 each-0.065
Token to
Token ID284
Feature activation+2.92e-02

Pos logit contributions
pan+0.369
rier+0.358
 Woo+0.356
rified+0.339
rol+0.336
Neg logit contributions
pa-0.210
 work-0.171
 adventures-0.168
 search-0.161
 family-0.149
Token the
Token ID262
Feature activation+2.64e-02

Pos logit contributions
pan+0.053
 Woo+0.052
rier+0.050
rified+0.049
otion+0.049
Neg logit contributions
pa-0.041
 adventures-0.036
 work-0.036
 search-0.034
 family-0.032
Token huge
Token ID3236
Feature activation-6.82e-04

Pos logit contributions
pan+0.042
rier+0.040
rified+0.038
 Woo+0.038
 coward+0.038
Neg logit contributions
pa-0.045
 adventures-0.038
 work-0.037
 each-0.037
 search-0.037
Token ant
Token ID1885
Feature activation+1.44e-02

Pos logit contributions
pa+0.001
 adventures+0.001
 work+0.001
 search+0.001
 each+0.001
Neg logit contributions
pan-0.001
rier-0.001
 Woo-0.001
rified-0.001
otion-0.001
Token and
Token ID290
Feature activation+1.66e-02

Pos logit contributions
pan+0.031
rol+0.028
 Woo+0.028
rified+0.028
rier+0.028
Neg logit contributions
pa-0.016
 work-0.013
 adventures-0.013
 search-0.011
 each-0.011
Token garden
Token ID11376
Feature activation+7.71e-02

Pos logit contributions
rol+0.078
pan+0.078
rified+0.071
ainted+0.071
rier+0.070
Neg logit contributions
 work-0.073
pa-0.072
 family-0.070
 each-0.067
 dad-0.067
Token.
Token ID13
Feature activation+5.31e-02

Pos logit contributions
pan+0.171
rol+0.160
 Woo+0.155
rier+0.155
otion+0.154
Neg logit contributions
pa-0.084
 work-0.069
 adventures-0.062
 search-0.060
 un-0.056
Token Lily
Token ID20037
Feature activation+8.35e-02

Pos logit contributions
pan+0.091
 Woo+0.088
rol+0.087
otion+0.085
rified+0.084
Neg logit contributions
pa-0.083
 work-0.076
 adventures-0.072
 search-0.068
 each-0.066
Token is
Token ID318
Feature activation+6.39e-02

Pos logit contributions
pan+0.186
 Woo+0.179
rol+0.176
rified+0.174
otion+0.171
Neg logit contributions
pa-0.082
 work-0.067
 adventures-0.065
 search-0.060
 family-0.057
Token the
Token ID262
Feature activation+3.81e-02

Pos logit contributions
pan+0.132
rol+0.124
 Woo+0.124
rier+0.124
rified+0.122
Neg logit contributions
pa-0.079
 work-0.071
 adventures-0.066
 search-0.064
 family-0.063
Token seeker
Token ID45993
Feature activation+1.74e-01

Pos logit contributions
pan+0.069
rier+0.065
 Woo+0.065
rol+0.064
otion+0.063
Neg logit contributions
pa-0.057
 work-0.051
 adventures-0.049
 search-0.047
 each-0.046
Token and
Token ID290
Feature activation+4.50e-02

Pos logit contributions
pan+0.442
rol+0.438
rier+0.433
 Woo+0.430
otion+0.421
Neg logit contributions
pa-0.127
 work-0.095
 each-0.085
 adventures-0.082
 family-0.080
Token Ben
Token ID3932
Feature activation+2.74e-02

Pos logit contributions
pan+0.080
rier+0.077
rol+0.075
otion+0.074
rified+0.073
Neg logit contributions
pa-0.067
 work-0.065
 each-0.062
 family-0.062
 adventures-0.060
Token is
Token ID318
Feature activation+4.63e-02

Pos logit contributions
pan+0.062
rier+0.059
rol+0.058
 Woo+0.058
otion+0.057
Neg logit contributions
pa-0.029
 work-0.025
 adventures-0.024
 search-0.022
 each-0.022
Token the
Token ID262
Feature activation+2.56e-02

Pos logit contributions
pan+0.095
rier+0.093
rol+0.092
rified+0.091
 Woo+0.091
Neg logit contributions
pa-0.057
 work-0.053
 adventures-0.049
 search-0.047
 family-0.046
Token h
Token ID289
Feature activation+6.90e-02

Pos logit contributions
pan+0.041
rol+0.039
rier+0.038
 Woo+0.038
otion+0.038
Neg logit contributions
pa-0.043
 work-0.040
 adventures-0.039
 family-0.037
 search-0.037
Token go
Token ID467
Feature activation+1.27e-01

Pos logit contributions
pa+0.005
 work+0.004
 adventures+0.004
 search+0.004
 family+0.004
Neg logit contributions
pan-0.009
 Woo-0.008
rol-0.008
rier-0.008
otion-0.008
Token home
Token ID1363
Feature activation+4.11e-02

Pos logit contributions
pan+0.253
 Woo+0.246
rol+0.233
rier+0.225
otion+0.224
Neg logit contributions
pa-0.169
 adventures-0.137
 work-0.133
 family-0.129
 each-0.125
Token.
Token ID13
Feature activation+5.03e-02

Pos logit contributions
pan+0.089
 Woo+0.085
rier+0.083
rol+0.081
otion+0.078
Neg logit contributions
pa-0.046
 adventures-0.035
 work-0.035
 search-0.033
 ideas-0.031
Token She
Token ID1375
Feature activation+5.03e-02

Pos logit contributions
pan+0.094
 Woo+0.091
rol+0.090
otion+0.088
rified+0.087
Neg logit contributions
pa-0.071
 work-0.064
 adventures-0.060
 search-0.057
 each-0.056
Token said
Token ID531
Feature activation+1.15e-02

Pos logit contributions
pan+0.104
 Woo+0.099
rol+0.098
rified+0.095
 coward+0.094
Neg logit contributions
pa-0.058
 adventures-0.048
 work-0.048
 search-0.043
 family-0.043
Token goodbye
Token ID24829
Feature activation+1.73e-01

Pos logit contributions
pan+0.024
rol+0.024
rified+0.023
rier+0.023
otion+0.023
Neg logit contributions
pa-0.013
 work-0.012
 each-0.012
 family-0.011
 adventures-0.011
Token to
Token ID284
Feature activation+2.77e-02

Pos logit contributions
pan+0.350
 Woo+0.336
rier+0.332
rol+0.317
rified+0.312
Neg logit contributions
pa-0.217
 work-0.181
 adventures-0.178
 search-0.168
 family-0.157
Token the
Token ID262
Feature activation+2.98e-02

Pos logit contributions
 Woo+0.049
pan+0.047
otion+0.044
 Fif+0.043
rier+0.042
Neg logit contributions
pa-0.038
 adventures-0.034
 work-0.033
 search-0.033
cles-0.032
Token old
Token ID1468
Feature activation-2.51e-02

Pos logit contributions
pan+0.052
rier+0.049
 Woo+0.048
 coward+0.047
rified+0.046
Neg logit contributions
pa-0.046
 work-0.039
 adventures-0.038
 each-0.038
 search-0.038
Token man
Token ID582
Feature activation-5.34e-03

Pos logit contributions
pa+0.036
 adventures+0.031
 work+0.031
 search+0.031
 each+0.030
Neg logit contributions
pan-0.045
 Woo-0.042
rier-0.041
otion-0.039
rified-0.038
Token and
Token ID290
Feature activation+1.63e-02

Pos logit contributions
pa+0.006
 adventures+0.005
 work+0.005
 family+0.004
 search+0.004
Neg logit contributions
pan-0.011
rol-0.010
rier-0.010
otion-0.010
 Woo-0.010
Token go
Token ID467
Feature activation+1.27e-01

Pos logit contributions
pan+0.039
rified+0.036
 Woo+0.036
rol+0.035
 coward+0.035
Neg logit contributions
pa-0.020
 work-0.017
 adventures-0.017
 search-0.016
 own-0.014
Token home
Token ID1363
Feature activation+4.11e-02

Pos logit contributions
pan+0.253
 Woo+0.246
rol+0.233
rier+0.225
otion+0.224
Neg logit contributions
pa-0.169
 adventures-0.137
 work-0.133
 family-0.129
 each-0.125
Token.
Token ID13
Feature activation+5.03e-02

Pos logit contributions
pan+0.089
 Woo+0.085
rier+0.083
rol+0.081
otion+0.078
Neg logit contributions
pa-0.046
 adventures-0.035
 work-0.035
 search-0.033
 ideas-0.031
Token She
Token ID1375
Feature activation+5.03e-02

Pos logit contributions
pan+0.094
 Woo+0.091
rol+0.090
otion+0.088
rified+0.087
Neg logit contributions
pa-0.071
 work-0.064
 adventures-0.060
 search-0.057
 each-0.056
Token said
Token ID531
Feature activation+1.15e-02

Pos logit contributions
pan+0.104
 Woo+0.099
rol+0.098
rified+0.095
 coward+0.094
Neg logit contributions
pa-0.058
 adventures-0.048
 work-0.048
 search-0.043
 family-0.043
Token goodbye
Token ID24829
Feature activation+1.73e-01

Pos logit contributions
pan+0.024
rol+0.024
rified+0.023
rier+0.023
otion+0.023
Neg logit contributions
pa-0.013
 work-0.012
 each-0.012
 family-0.011
 adventures-0.011
Token to
Token ID284
Feature activation+2.77e-02

Pos logit contributions
pan+0.350
 Woo+0.336
rier+0.332
rol+0.317
rified+0.312
Neg logit contributions
pa-0.217
 work-0.181
 adventures-0.178
 search-0.168
 family-0.157
Token the
Token ID262
Feature activation+2.98e-02

Pos logit contributions
 Woo+0.049
pan+0.047
otion+0.044
 Fif+0.043
rier+0.042
Neg logit contributions
pa-0.038
 adventures-0.034
 work-0.033
 search-0.033
cles-0.032
Token shadow
Token ID9082
Feature activation+5.01e-02

Pos logit contributions
pan+0.052
rier+0.049
 Woo+0.048
 coward+0.047
rified+0.046
Neg logit contributions
pa-0.046
 work-0.039
 adventures-0.038
 each-0.038
 search-0.038
Token and
Token ID290
Feature activation+1.93e-02

Pos logit contributions
pan+0.114
rol+0.106
rier+0.105
 Woo+0.105
otion+0.104
Neg logit contributions
pa-0.053
 work-0.045
 adventures-0.042
 search-0.039
 family-0.038
Token thanked
Token ID26280
Feature activation+1.01e-02

Pos logit contributions
pan+0.041
 Woo+0.039
rol+0.038
rified+0.038
rier+0.037
Neg logit contributions
pa-0.022
 work-0.019
 adventures-0.018
 search-0.017
 family-0.016
Token Tom
Token ID4186
Feature activation+9.91e-02

Pos logit contributions
pan+0.085
otion+0.082
rol+0.081
 Lia+0.077
rified+0.077
Neg logit contributions
 adventures-0.053
 work-0.051
pa-0.049
 search-0.045
 own-0.044
Token said
Token ID531
Feature activation+5.98e-02

Pos logit contributions
pan+0.211
rol+0.202
 Woo+0.202
rified+0.201
 coward+0.194
Neg logit contributions
pa-0.105
 adventures-0.093
 work-0.090
 search-0.083
 family-0.079
Token.
Token ID13
Feature activation+7.19e-02

Pos logit contributions
pan+0.133
 Woo+0.125
rier+0.125
rol+0.124
otion+0.123
Neg logit contributions
pa-0.066
 work-0.056
 adventures-0.054
 search-0.050
 family-0.047
Token They
Token ID1119
Feature activation+6.57e-02

Pos logit contributions
pan+0.151
 Woo+0.142
rol+0.141
rier+0.141
otion+0.139
Neg logit contributions
pa-0.088
 work-0.077
 adventures-0.073
 search-0.069
 family-0.067
Token waved
Token ID26834
Feature activation+7.44e-02

Pos logit contributions
pan+0.135
 Woo+0.129
rified+0.125
rol+0.125
 coward+0.123
Neg logit contributions
pa-0.082
 work-0.067
 adventures-0.067
 search-0.062
 family-0.059
Token goodbye
Token ID24829
Feature activation+1.73e-01

Pos logit contributions
pan+0.141
 Woo+0.133
rier+0.133
rol+0.131
rified+0.130
Neg logit contributions
pa-0.106
 work-0.093
 adventures-0.088
 search-0.086
 family-0.082
Token to
Token ID284
Feature activation+2.94e-02

Pos logit contributions
pan+0.320
rier+0.316
 Woo+0.314
rified+0.289
rol+0.287
Neg logit contributions
pa-0.236
 work-0.188
 search-0.188
 adventures-0.188
 un-0.170
Token the
Token ID262
Feature activation+3.09e-02

Pos logit contributions
 Woo+0.047
pan+0.041
 Lia+0.040
 Fif+0.040
 coward+0.040
Neg logit contributions
pa-0.044
 adventures-0.039
 search-0.038
 work-0.037
urious-0.037
Token friendly
Token ID8030
Feature activation+8.34e-03

Pos logit contributions
pan+0.053
rier+0.050
 Woo+0.049
 coward+0.048
rified+0.047
Neg logit contributions
pa-0.049
 work-0.041
 adventures-0.040
 search-0.040
 each-0.039
Token man
Token ID582
Feature activation-3.68e-03

Pos logit contributions
pan+0.014
rier+0.013
 Woo+0.013
rified+0.013
rol+0.013
Neg logit contributions
pa-0.013
 work-0.011
 adventures-0.011
 search-0.011
 own-0.010
Token.
Token ID13
Feature activation+2.25e-02

Pos logit contributions
pa+0.004
 adventures+0.003
 work+0.003
Daddy+0.003
 search+0.003
Neg logit contributions
pan-0.008
rol-0.007
rier-0.007
otion-0.007
 Woo-0.007
Token inside
Token ID2641
Feature activation+5.83e-02

Pos logit contributions
pan+0.150
rol+0.140
 Woo+0.140
rier+0.138
 coward+0.136
Neg logit contributions
pa-0.083
 work-0.066
 adventures-0.064
 search-0.058
Daddy-0.056
Token.
Token ID13
Feature activation+5.56e-02

Pos logit contributions
pan+0.120
otion+0.103
rol+0.102
 Woo+0.100
rier+0.100
Neg logit contributions
pa-0.068
 adventures-0.056
 work-0.053
cles-0.051
Daddy-0.049
Token We
Token ID775
Feature activation+8.41e-02

Pos logit contributions
pan+0.097
 Woo+0.094
rol+0.093
otion+0.090
rified+0.089
Neg logit contributions
pa-0.086
 work-0.078
 adventures-0.074
 search-0.069
 each-0.068
Token can
Token ID460
Feature activation+5.22e-02

Pos logit contributions
pan+0.153
rol+0.142
rified+0.139
 Woo+0.139
otion+0.138
Neg logit contributions
pa-0.121
 adventures-0.106
 work-0.097
 family-0.093
 packages-0.093
Token say
Token ID910
Feature activation+1.90e-02

Pos logit contributions
pan+0.103
otion+0.092
 Woo+0.092
rier+0.090
rol+0.088
Neg logit contributions
pa-0.069
 adventures-0.059
 day-0.051
 family-0.050
 work-0.050
Token goodbye
Token ID24829
Feature activation+1.72e-01

Pos logit contributions
pan+0.030
 Woo+0.028
rier+0.028
rol+0.028
otion+0.027
Neg logit contributions
pa-0.033
 work-0.030
 adventures-0.029
 search-0.028
 family-0.027
Token and
Token ID290
Feature activation+3.16e-02

Pos logit contributions
pan+0.321
 Woo+0.311
rier+0.308
rol+0.294
rified+0.287
Neg logit contributions
pa-0.231
 work-0.200
 adventures-0.197
 search-0.194
 family-0.177
Token thank
Token ID5875
Feature activation+6.35e-02

Pos logit contributions
pan+0.058
 Woo+0.054
 coward+0.053
rier+0.051
 disappears+0.050
Neg logit contributions
pa-0.045
 adventures-0.035
cles-0.032
 work-0.032
 search-0.030
Token him
Token ID683
Feature activation+9.91e-04

Pos logit contributions
 Woo+0.070
 Fif+0.057
 Rib+0.057
 Lia+0.053
rier+0.052
Neg logit contributions
 search-0.119
 adventures-0.119
 needed-0.118
pa-0.116
 work-0.112
Token for
Token ID329
Feature activation-2.32e-02

Pos logit contributions
pan+0.002
 Woo+0.002
rier+0.002
rol+0.002
otion+0.002
Neg logit contributions
pa-0.001
 work-0.001
 adventures-0.001
 search-0.001
 family-0.001
Token being
Token ID852
Feature activation-3.15e-02

Pos logit contributions
pa+0.034
 adventures+0.030
 search+0.030
 own+0.029
 work+0.029
Neg logit contributions
pan-0.040
 Woo-0.039
 coward-0.036
otion-0.035
rier-0.035
Token is
Token ID318
Feature activation+1.13e-01

Pos logit contributions
pan+0.270
rol+0.250
rier+0.247
 Woo+0.246
otion+0.242
Neg logit contributions
pa-0.160
 adventures-0.124
 work-0.124
 family-0.115
 search-0.115
Token okay
Token ID8788
Feature activation+9.06e-02

Pos logit contributions
pan+0.195
 Woo+0.182
rier+0.181
 coward+0.174
rol+0.172
Neg logit contributions
pa-0.180
 adventures-0.140
 work-0.140
 search-0.140
 family-0.130
Token.
Token ID13
Feature activation+5.14e-02

Pos logit contributions
pan+0.181
rier+0.175
otion+0.170
 Woo+0.167
 Fif+0.160
Neg logit contributions
pa-0.106
 work-0.089
 adventures-0.088
 search-0.080
cles-0.079
Token They
Token ID1119
Feature activation+5.15e-02

Pos logit contributions
pan+0.086
rier+0.079
rol+0.079
 Woo+0.078
otion+0.077
Neg logit contributions
pa-0.084
 work-0.076
 adventures-0.073
 search-0.072
 family-0.071
Token say
Token ID910
Feature activation+2.06e-02

Pos logit contributions
pan+0.106
 Woo+0.099
rol+0.098
rier+0.098
rified+0.097
Neg logit contributions
pa-0.064
 work-0.054
 adventures-0.053
 search-0.049
 family-0.048
Token goodbye
Token ID24829
Feature activation+1.72e-01

Pos logit contributions
rol+0.043
pan+0.041
ainted+0.040
rified+0.039
rier+0.038
Neg logit contributions
pa-0.026
 work-0.025
 each-0.025
 family-0.024
 dad-0.024
Token to
Token ID284
Feature activation+2.79e-02

Pos logit contributions
pan+0.349
 Woo+0.333
rier+0.332
rol+0.321
rified+0.316
Neg logit contributions
pa-0.216
 work-0.181
 adventures-0.179
 search-0.167
 family-0.158
Token the
Token ID262
Feature activation+3.12e-02

Pos logit contributions
 Woo+0.051
pan+0.046
otion+0.043
 Fif+0.043
 Rib+0.042
Neg logit contributions
pa-0.038
 adventures-0.035
 search-0.033
 work-0.033
cles-0.032
Token dog
Token ID3290
Feature activation+7.94e-02

Pos logit contributions
pan+0.054
rier+0.051
 Woo+0.050
otion+0.048
rol+0.048
Neg logit contributions
pa-0.049
 work-0.043
 adventures-0.042
 search-0.041
 each-0.040
Token and
Token ID290
Feature activation+2.37e-02

Pos logit contributions
pan+0.197
 Woo+0.187
rol+0.186
rier+0.186
otion+0.184
Neg logit contributions
pa-0.067
 work-0.054
 adventures-0.050
 search-0.046
 family-0.044
Token the
Token ID262
Feature activation+1.80e-02

Pos logit contributions
pan+0.048
 Woo+0.046
rol+0.043
rier+0.043
 coward+0.043
Neg logit contributions
pa-0.030
 work-0.025
 adventures-0.025
 search-0.023
 family-0.022
Token friends
Token ID2460
Feature activation+2.34e-02

Pos logit contributions
pan+0.135
 Woo+0.128
rol+0.128
rier+0.127
rified+0.126
Neg logit contributions
pa-0.075
 work-0.062
 adventures-0.060
 search-0.057
 each-0.053
Token were
Token ID547
Feature activation+5.84e-02

Pos logit contributions
pan+0.052
 Woo+0.050
rol+0.050
rified+0.048
otion+0.048
Neg logit contributions
pa-0.024
 work-0.020
 adventures-0.020
 search-0.018
 own-0.017
Token sad
Token ID6507
Feature activation+1.07e-01

Pos logit contributions
pan+0.116
rol+0.109
 Woo+0.109
rier+0.108
otion+0.107
Neg logit contributions
pa-0.077
 work-0.069
 adventures-0.065
 search-0.063
 each-0.060
Token to
Token ID284
Feature activation+3.54e-02

Pos logit contributions
pan+0.222
 Woo+0.213
rier+0.213
otion+0.206
rified+0.201
Neg logit contributions
pa-0.129
 search-0.097
 work-0.096
 adventures-0.096
 un-0.087
Token say
Token ID910
Feature activation+1.42e-02

Pos logit contributions
 Woo+0.064
pan+0.063
otion+0.059
 coward+0.055
 drown+0.055
Neg logit contributions
pa-0.048
 adventures-0.043
cles-0.042
 packages-0.041
urious-0.038
Token goodbye
Token ID24829
Feature activation+1.71e-01

Pos logit contributions
pan+0.025
rol+0.023
rier+0.023
 Woo+0.023
rified+0.022
Neg logit contributions
pa-0.022
 work-0.020
 adventures-0.019
 each-0.019
 family-0.019
Token to
Token ID284
Feature activation+2.49e-02

Pos logit contributions
pan+0.337
 Woo+0.332
rier+0.327
rol+0.305
rified+0.302
Neg logit contributions
pa-0.214
 work-0.177
 adventures-0.177
 search-0.172
 un-0.150
Token the
Token ID262
Feature activation+2.33e-02

Pos logit contributions
 Woo+0.042
pan+0.038
otion+0.036
rified+0.036
 coward+0.035
Neg logit contributions
pa-0.038
 adventures-0.034
 work-0.034
 search-0.033
 un-0.031
Token plane
Token ID6614
Feature activation+6.27e-02

Pos logit contributions
pan+0.042
rier+0.039
 Woo+0.039
rol+0.038
rified+0.038
Neg logit contributions
pa-0.035
 work-0.030
 adventures-0.029
 search-0.029
 each-0.028
Token.
Token ID13
Feature activation+1.32e-02

Pos logit contributions
pan+0.146
rier+0.134
rol+0.133
 Woo+0.132
otion+0.130
Neg logit contributions
pa-0.063
 work-0.050
 adventures-0.049
 search-0.044
 family-0.041
Token But
Token ID887
Feature activation+1.92e-02

Pos logit contributions
pan+0.027
rier+0.026
rol+0.025
 Woo+0.025
otion+0.025
Neg logit contributions
pa-0.017
 work-0.015
 adventures-0.014
 search-0.013
 family-0.013
Token their
Token ID511
Feature activation+1.07e-01

Pos logit contributions
pan+0.202
 Woo+0.197
otion+0.192
rier+0.182
 downward+0.181
Neg logit contributions
pa-0.136
 search-0.109
 work-0.106
 each-0.103
 adventures-0.102
Token fun
Token ID1257
Feature activation+2.75e-02

Pos logit contributions
pan+0.205
 coward+0.199
rier+0.197
 Woo+0.195
 downward+0.191
Neg logit contributions
pa-0.122
 each-0.088
 search-0.086
 adventures-0.084
 work-0.083
Token,
Token ID11
Feature activation+7.62e-02

Pos logit contributions
pan+0.058
 Woo+0.053
rol+0.052
otion+0.052
rier+0.051
Neg logit contributions
pa-0.030
 adventures-0.022
 work-0.022
 search-0.022
Daddy-0.021
Token Sarah
Token ID10490
Feature activation+4.33e-02

Pos logit contributions
pan+0.158
 Woo+0.151
rol+0.148
rier+0.147
otion+0.144
Neg logit contributions
pa-0.094
 work-0.081
 adventures-0.076
 search-0.073
 each-0.069
Token said
Token ID531
Feature activation+1.50e-02

Pos logit contributions
pan+0.102
 Woo+0.096
rol+0.096
rier+0.095
otion+0.094
Neg logit contributions
pa-0.041
 work-0.034
 adventures-0.032
 search-0.030
 family-0.029
Token goodbye
Token ID24829
Feature activation+1.71e-01

Pos logit contributions
rol+0.029
pan+0.029
rified+0.028
 Woo+0.028
otion+0.027
Neg logit contributions
pa-0.019
 work-0.019
 each-0.017
 family-0.017
 adventures-0.016
Token to
Token ID284
Feature activation+2.69e-02

Pos logit contributions
pan+0.349
 Woo+0.330
rier+0.330
rol+0.312
rified+0.309
Neg logit contributions
pa-0.220
 adventures-0.178
 work-0.177
 search-0.166
 family-0.155
Token the
Token ID262
Feature activation+2.51e-02

Pos logit contributions
 Woo+0.043
pan+0.040
otion+0.038
 downward+0.035
 coward+0.035
Neg logit contributions
pa-0.039
 adventures-0.036
 search-0.034
 work-0.034
urious-0.033
Token L
Token ID406
Feature activation+5.17e-02

Pos logit contributions
pan+0.045
rier+0.042
 Woo+0.042
rol+0.041
rified+0.040
Neg logit contributions
pa-0.038
 work-0.032
 adventures-0.031
 search-0.031
 each-0.030
Tokenazy
Token ID12582
Feature activation-1.28e-02

Pos logit contributions
pan+0.068
rier+0.064
 Woo+0.063
rol+0.061
rified+0.060
Neg logit contributions
pa-0.104
 work-0.095
 adventures-0.091
 search-0.088
 each-0.087
Token Cart
Token ID13690
Feature activation-6.41e-02

Pos logit contributions
pa+0.018
 work+0.016
 adventures+0.015
 search+0.015
 own+0.014
Neg logit contributions
pan-0.024
rified-0.022
 Woo-0.021
otion-0.021
rier-0.021
Token structure
Token ID4645
Feature activation+9.90e-02

Pos logit contributions
pan+0.196
rier+0.186
 Woo+0.178
 coward+0.173
rol+0.172
Neg logit contributions
pa-0.174
 work-0.146
 adventures-0.145
 search-0.142
 each-0.141
Token was
Token ID373
Feature activation+6.48e-02

Pos logit contributions
pan+0.223
rol+0.210
rier+0.205
otion+0.204
rified+0.202
Neg logit contributions
pa-0.107
 work-0.088
 adventures-0.082
 search-0.077
 family-0.076
Token.
Token ID13
Feature activation+4.69e-02

Pos logit contributions
pan+0.123
rol+0.114
rier+0.112
 Woo+0.111
otion+0.110
Neg logit contributions
pa-0.094
 work-0.080
 adventures-0.077
 search-0.074
 family-0.072
Token They
Token ID1119
Feature activation+5.63e-02

Pos logit contributions
pan+0.082
 Woo+0.078
rol+0.078
rier+0.075
otion+0.075
Neg logit contributions
pa-0.073
 work-0.066
 adventures-0.063
 search-0.060
 each-0.058
Token said
Token ID531
Feature activation+1.38e-02

Pos logit contributions
pan+0.115
rol+0.108
 Woo+0.105
rified+0.105
otion+0.103
Neg logit contributions
pa-0.070
 adventures-0.056
 work-0.056
 search-0.051
 family-0.049
Token goodbye
Token ID24829
Feature activation+1.71e-01

Pos logit contributions
pan+0.028
rol+0.027
rified+0.026
 Woo+0.026
rier+0.026
Neg logit contributions
pa-0.017
 work-0.016
 each-0.015
 adventures-0.015
 family-0.015
Token and
Token ID290
Feature activation+2.97e-02

Pos logit contributions
pan+0.334
rier+0.319
 Woo+0.317
rol+0.299
rified+0.296
Neg logit contributions
pa-0.225
 work-0.182
 adventures-0.181
 search-0.172
 family-0.159
Token went
Token ID1816
Feature activation+2.43e-02

Pos logit contributions
pan+0.061
 Woo+0.059
rified+0.057
rol+0.056
 coward+0.056
Neg logit contributions
pa-0.035
 adventures-0.029
 work-0.029
 search-0.026
 own-0.026
Token home
Token ID1363
Feature activation-2.41e-02

Pos logit contributions
pan+0.053
 Woo+0.050
rol+0.050
rier+0.050
otion+0.049
Neg logit contributions
pa-0.028
 work-0.023
 adventures-0.022
 search-0.021
 family-0.021
Token,
Token ID11
Feature activation+4.26e-02

Pos logit contributions
pa+0.028
 adventures+0.020
 work+0.019
 search+0.018
 each+0.017
Neg logit contributions
pan-0.054
rier-0.050
 Woo-0.049
rol-0.047
otion-0.047
Token happy
Token ID3772
Feature activation+4.48e-03

Pos logit contributions
pan+0.076
 Woo+0.073
rol+0.070
rier+0.067
otion+0.066
Neg logit contributions
pa-0.065
 work-0.056
 adventures-0.054
 search-0.049
 each-0.049
Token all
Token ID477
Feature activation+4.19e-02

Pos logit contributions
pan+0.239
rol+0.226
rier+0.225
 Woo+0.224
otion+0.220
Neg logit contributions
pa-0.159
 work-0.138
 adventures-0.131
 search-0.127
 each-0.120
Token done
Token ID1760
Feature activation+5.48e-03

Pos logit contributions
pan+0.083
rol+0.079
 Woo+0.076
rier+0.076
otion+0.075
Neg logit contributions
pa-0.056
 work-0.046
 adventures-0.045
 search-0.044
 family-0.042
Token,
Token ID11
Feature activation+7.35e-02

Pos logit contributions
pan+0.012
rier+0.011
 Woo+0.011
otion+0.011
rol+0.011
Neg logit contributions
pa-0.006
 adventures-0.005
 search-0.005
 work-0.005
 family-0.004
Token Anne
Token ID15397
Feature activation+3.95e-02

Pos logit contributions
pan+0.134
 Woo+0.126
rol+0.123
rier+0.122
otion+0.119
Neg logit contributions
pa-0.108
 work-0.093
 adventures-0.090
 search-0.086
 each-0.081
Token said
Token ID531
Feature activation+1.39e-02

Pos logit contributions
pan+0.087
 Woo+0.080
rol+0.080
rified+0.079
otion+0.078
Neg logit contributions
pa-0.040
 adventures-0.034
 work-0.034
 search-0.031
 own-0.028
Token goodbye
Token ID24829
Feature activation+1.71e-01

Pos logit contributions
pan+0.029
rol+0.028
 Woo+0.027
rier+0.027
rified+0.027
Neg logit contributions
pa-0.017
 work-0.015
 each-0.014
 adventures-0.014
 family-0.014
Token to
Token ID284
Feature activation+2.52e-02

Pos logit contributions
pan+0.352
rier+0.334
 Woo+0.330
rol+0.311
rified+0.308
Neg logit contributions
pa-0.217
 adventures-0.176
 work-0.172
 search-0.165
 family-0.152
Token her
Token ID607
Feature activation+2.87e-02

Pos logit contributions
pan+0.038
 Woo+0.037
otion+0.035
 downward+0.035
 coward+0.033
Neg logit contributions
pa-0.038
 adventures-0.034
 work-0.033
cles-0.032
urious-0.032
Token new
Token ID649
Feature activation+4.06e-02

Pos logit contributions
pan+0.052
rier+0.049
 Woo+0.048
 downward+0.047
 coward+0.046
Neg logit contributions
pa-0.036
Daddy-0.027
Dad-0.026
 each-0.026
cles-0.026
Token friend
Token ID1545
Feature activation+3.34e-02

Pos logit contributions
pan+0.077
rier+0.071
 Woo+0.070
 coward+0.068
rol+0.067
Neg logit contributions
pa-0.050
 work-0.040
 each-0.039
 search-0.037
 asked-0.036
Token and
Token ID290
Feature activation+7.37e-03

Pos logit contributions
pan+0.070
 Woo+0.066
rified+0.064
rol+0.064
otion+0.063
Neg logit contributions
pa-0.035
 work-0.031
 adventures-0.029
 search-0.027
 own-0.026
Token
Token ID6184
Feature activation-2.44e-02

Pos logit contributions
pa+0.019
 work+0.017
 adventures+0.016
 search+0.015
 family+0.015
Neg logit contributions
pan-0.018
 Woo-0.017
rol-0.016
rier-0.016
rified-0.016
Token
Token ID95
Feature activation-1.69e-02

Pos logit contributions
pa+0.042
 work+0.039
 search+0.037
 adventures+0.036
 each+0.035
Neg logit contributions
pan-0.040
rol-0.037
rier-0.036
 Woo-0.036
 coward-0.035
Token
Token ID26391
Feature activation-3.69e-02

Pos logit contributions
pa+0.029
 work+0.026
 adventures+0.025
 search+0.024
 family+0.024
Neg logit contributions
pan-0.027
 Woo-0.025
rol-0.025
rier-0.025
otion-0.024
Token
Token ID129
Feature activation+4.01e-02

Pos logit contributions
pa+0.043
 work+0.037
 adventures+0.035
 search+0.032
 each+0.032
Neg logit contributions
pan-0.078
 Woo-0.074
rol-0.074
otion-0.073
rified-0.073
Token
Token ID241
Feature activation-3.91e-02

Pos logit contributions
pan+0.066
 Woo+0.063
rol+0.061
rier+0.061
otion+0.061
Neg logit contributions
pa-0.065
 work-0.058
 adventures-0.057
 family-0.055
 search-0.055
TokenUn
Token ID3118
Feature activation-1.45e-01

Pos logit contributions
pa+0.058
 work+0.052
 adventures+0.049
 search+0.045
 family+0.045
Neg logit contributions
 Woo-0.070
pan-0.069
rol-0.068
otion-0.066
rified-0.066
Tokenlock
Token ID5354
Feature activation-5.13e-02

Pos logit contributions
 adventures+0.242
 work+0.238
pa+0.233
 family+0.226
 search+0.226
Neg logit contributions
pan-0.205
rier-0.198
rified-0.196
rol-0.191
otion-0.191
Token the
Token ID262
Feature activation-1.81e-02

Pos logit contributions
pa+0.056
 work+0.048
 adventures+0.045
 search+0.042
 each+0.041
Neg logit contributions
pan-0.114
rier-0.108
rol-0.107
 Woo-0.107
otion-0.106
Token door
Token ID3420
Feature activation+2.34e-03

Pos logit contributions
pa+0.029
 each+0.026
 adventures+0.025
 work+0.024
 search+0.024
Neg logit contributions
rified-0.027
pan-0.027
rier-0.026
 coward-0.026
 liar-0.025
Token and
Token ID290
Feature activation-1.47e-02

Pos logit contributions
pan+0.005
otion+0.005
rol+0.005
rier+0.005
rified+0.005
Neg logit contributions
pa-0.003
 adventures-0.002
 work-0.002
 family-0.002
 each-0.002
Token let
Token ID1309
Feature activation+2.55e-02

Pos logit contributions
pa+0.022
 adventures+0.020
 work+0.019
 search+0.018
 own+0.018
Neg logit contributions
pan-0.025
 Woo-0.025
rified-0.024
rol-0.023
otion-0.023
Token than
Token ID621
Feature activation-7.81e-02

Pos logit contributions
pa+0.012
 work+0.009
 search+0.009
 un+0.009
Daddy+0.009
Neg logit contributions
pan-0.020
rier-0.020
rified-0.018
rol-0.018
 Woo-0.018
Token Uncle
Token ID23169
Feature activation-4.98e-02

Pos logit contributions
pa+0.119
 work+0.107
 adventures+0.103
 search+0.102
 family+0.095
Neg logit contributions
pan-0.134
rier-0.127
 Woo-0.127
rified-0.125
rol-0.123
Token!
Token ID0
Feature activation-2.68e-02

Pos logit contributions
pa+0.075
 work+0.073
 search+0.068
 adventures+0.065
 own+0.065
Neg logit contributions
rified-0.078
 Woo-0.076
pan-0.075
rol-0.072
rier-0.071
Token
Token ID198
Feature activation-3.71e-02

Pos logit contributions
pa+0.040
 work+0.038
 adventures+0.036
 search+0.034
 own+0.032
Neg logit contributions
 Woo-0.047
pan-0.046
rol-0.044
rified-0.044
rier-0.042
Token
Token ID198
Feature activation-3.78e-02

Pos logit contributions
pa+0.046
 work+0.042
 adventures+0.037
 ideas+0.036
 search+0.035
Neg logit contributions
 Woo-0.074
pan-0.073
rol-0.070
otion-0.069
Flo-0.068
TokenUn
Token ID3118
Feature activation-1.44e-01

Pos logit contributions
pa+0.063
 work+0.061
 search+0.056
 adventures+0.055
 own+0.054
Neg logit contributions
 Woo-0.059
pan-0.058
rol-0.054
 coward-0.052
rified-0.051
Tokencle
Token ID2375
Feature activation-8.84e-02

Pos logit contributions
 search+0.192
 adventures+0.191
 own+0.189
 work+0.182
 un+0.182
Neg logit contributions
pan-0.239
plets-0.229
rier-0.227
rified-0.219
otion-0.217
Token was
Token ID373
Feature activation-2.64e-02

Pos logit contributions
pa+0.118
 work+0.111
 adventures+0.105
 search+0.103
 own+0.099
Neg logit contributions
 Woo-0.159
pan-0.158
rified-0.158
rol-0.157
rier-0.149
Token so
Token ID523
Feature activation-3.96e-02

Pos logit contributions
pa+0.039
 work+0.035
 adventures+0.033
 search+0.032
 each+0.031
Neg logit contributions
pan-0.049
rol-0.046
 Woo-0.045
rier-0.045
otion-0.045
Token happy
Token ID3772
Feature activation-3.36e-02

Pos logit contributions
pa+0.060
 work+0.051
 adventures+0.051
 search+0.048
 each+0.046
Neg logit contributions
pan-0.071
rier-0.068
rol-0.068
 Woo-0.067
otion-0.067
Token he
Token ID339
Feature activation-2.87e-02

Pos logit contributions
pa+0.044
 work+0.037
 adventures+0.035
 search+0.035
 own+0.034
Neg logit contributions
pan-0.065
 Woo-0.063
rier-0.062
rol-0.061
otion-0.060
Token down
Token ID866
Feature activation-4.06e-02

Pos logit contributions
pa+0.031
 adventures+0.027
 work+0.026
 search+0.024
 own+0.024
Neg logit contributions
pan-0.033
 Woo-0.032
rol-0.031
rified-0.030
rier-0.030
Token.
Token ID13
Feature activation-2.89e-02

Pos logit contributions
pa+0.045
 work+0.037
 adventures+0.035
 search+0.033
 family+0.032
Neg logit contributions
pan-0.091
rier-0.086
rol-0.085
 Woo-0.085
rified-0.083
Token
Token ID220
Feature activation-6.91e-03

Pos logit contributions
pa+0.054
 work+0.050
 adventures+0.049
 adventure+0.045
 search+0.045
Neg logit contributions
 Woo-0.040
pan-0.037
rified-0.036
rol-0.035
otion-0.035
Token
Token ID198
Feature activation-3.58e-02

Pos logit contributions
pa+0.008
 work+0.006
 adventures+0.006
 search+0.006
 family+0.005
Neg logit contributions
pan-0.015
rol-0.015
 Woo-0.015
otion-0.015
rier-0.014
Token
Token ID198
Feature activation-3.68e-02

Pos logit contributions
pa+0.045
 work+0.041
 adventures+0.037
 search+0.035
 ideas+0.033
Neg logit contributions
 Woo-0.073
pan-0.070
rol-0.069
otion-0.068
ula-0.068
TokenUn
Token ID3118
Feature activation-1.44e-01

Pos logit contributions
pa+0.068
 work+0.064
 adventures+0.060
 search+0.060
 own+0.057
Neg logit contributions
 Woo-0.052
pan-0.050
rol-0.047
ula-0.046
otion-0.046
Tokencle
Token ID2375
Feature activation-8.59e-02

Pos logit contributions
 own+0.200
 adventures+0.197
 search+0.197
 family+0.187
 work+0.184
Neg logit contributions
pan-0.237
plets-0.222
ner-0.219
rier-0.215
rified-0.214
Token looked
Token ID3114
Feature activation-1.79e-02

Pos logit contributions
pa+0.117
 work+0.111
 adventures+0.108
 search+0.105
 own+0.103
Neg logit contributions
 Woo-0.149
rified-0.149
pan-0.149
rol-0.148
rier-0.138
Token at
Token ID379
Feature activation-4.74e-02

Pos logit contributions
pa+0.022
 work+0.019
 adventures+0.018
 search+0.017
 family+0.016
Neg logit contributions
pan-0.038
rol-0.036
 Woo-0.036
rier-0.036
otion-0.035
Token her
Token ID607
Feature activation-1.85e-02

Pos logit contributions
pa+0.064
 work+0.057
 adventures+0.056
 search+0.054
 family+0.052
Neg logit contributions
pan-0.087
 Woo-0.086
rified-0.084
rol-0.081
rier-0.080
Token and
Token ID290
Feature activation-1.91e-02

Pos logit contributions
pa+0.025
 adventures+0.020
 work+0.018
 search+0.017
Daddy+0.017
Neg logit contributions
pan-0.035
rier-0.034
rified-0.032
otion-0.031
 downward-0.031
Token my
Token ID616
Feature activation+2.33e-02

Pos logit contributions
pa+0.117
 work+0.107
 search+0.101
 adventures+0.100
 family+0.091
Neg logit contributions
 Woo-0.094
 coward-0.080
pan-0.079
 Fif-0.078
rol-0.077
Token dog
Token ID3290
Feature activation+3.84e-02

Pos logit contributions
pan+0.055
 Woo+0.052
rier+0.052
rol+0.052
otion+0.051
Neg logit contributions
pa-0.023
 work-0.019
 adventures-0.018
 search-0.017
 each-0.016
Token?"
Token ID1701
Feature activation-9.62e-03

Pos logit contributions
pan+0.072
 Woo+0.070
rier+0.067
 coward+0.066
rified+0.066
Neg logit contributions
pa-0.053
 work-0.046
 adventures-0.045
 search-0.043
 each-0.040
Token
Token ID198
Feature activation-2.21e-02

Pos logit contributions
pa+0.015
 work+0.014
 adventures+0.014
 search+0.013
 ideas+0.012
Neg logit contributions
 Woo-0.016
pan-0.015
rified-0.014
otion-0.014
 coward-0.014
Token
Token ID198
Feature activation-2.37e-02

Pos logit contributions
pa+0.024
 work+0.023
 adventures+0.021
 search+0.019
 ideas+0.019
Neg logit contributions
 Woo-0.048
pan-0.045
otion-0.043
 coward-0.042
rified-0.042
TokenUn
Token ID3118
Feature activation-1.43e-01

Pos logit contributions
pa+0.045
 work+0.043
 adventures+0.041
 search+0.039
 own+0.038
Neg logit contributions
 Woo-0.032
pan-0.030
 coward-0.027
rified-0.026
 downward-0.026
Tokencle
Token ID2375
Feature activation-8.53e-02

Pos logit contributions
 adventures+0.177
 work+0.173
 search+0.166
 own+0.165
 family+0.161
Neg logit contributions
pan-0.272
rier-0.253
 liar-0.250
 coward-0.246
 Woo-0.245
Token Ben
Token ID3932
Feature activation-2.19e-02

Pos logit contributions
pa+0.156
 work+0.151
 adventures+0.147
 own+0.142
 search+0.141
Neg logit contributions
rified-0.109
 Woo-0.108
pan-0.107
otion-0.099
rol-0.098
Token looked
Token ID3114
Feature activation-1.42e-02

Pos logit contributions
pa+0.030
 work+0.027
 adventures+0.027
 own+0.025
 day+0.024
Neg logit contributions
pan-0.039
rified-0.038
 Woo-0.038
otion-0.036
 coward-0.035
Token at
Token ID379
Feature activation-4.58e-02

Pos logit contributions
pa+0.017
 work+0.015
 adventures+0.014
 search+0.014
 family+0.013
Neg logit contributions
pan-0.029
rier-0.028
 Woo-0.028
rified-0.028
rol-0.028
Token Lily
Token ID20037
Feature activation+1.64e-02

Pos logit contributions
pa+0.065
 work+0.058
 adventures+0.056
 search+0.053
 family+0.053
Neg logit contributions
pan-0.086
 Woo-0.082
rified-0.079
rier-0.078
rol-0.077
Token the
Token ID262
Feature activation-2.45e-02

Pos logit contributions
pa+0.052
 work+0.042
 adventures+0.040
ossible+0.039
 search+0.039
Neg logit contributions
pan-0.071
 Woo-0.067
rier-0.067
rified-0.063
 coward-0.062
Token coat
Token ID13209
Feature activation-5.01e-03

Pos logit contributions
pa+0.034
 each+0.029
 work+0.029
 adventures+0.028
 search+0.027
Neg logit contributions
pan-0.044
 Woo-0.043
rier-0.042
rified-0.041
rol-0.040
Token.
Token ID13
Feature activation-2.66e-02

Pos logit contributions
pa+0.005
 work+0.004
 adventures+0.004
 search+0.004
 each+0.003
Neg logit contributions
pan-0.012
 Woo-0.011
rol-0.011
rier-0.011
otion-0.011
Token
Token ID198
Feature activation-3.47e-02

Pos logit contributions
pa+0.039
 work+0.038
 adventures+0.036
 search+0.032
 each+0.031
Neg logit contributions
 Woo-0.046
rified-0.044
rol-0.043
pan-0.042
 Fif-0.041
Token
Token ID198
Feature activation-3.56e-02

Pos logit contributions
pa+0.044
 work+0.041
 adventures+0.038
 ideas+0.036
 each+0.034
Neg logit contributions
 Woo-0.067
pan-0.064
otion-0.064
rol-0.063
 coward-0.062
TokenUn
Token ID3118
Feature activation-1.43e-01

Pos logit contributions
pa+0.063
 work+0.060
 adventures+0.056
 search+0.055
 ideas+0.053
Neg logit contributions
 Woo-0.051
pan-0.049
 coward-0.048
rol-0.048
rified-0.046
Tokencle
Token ID2375
Feature activation-8.88e-02

Pos logit contributions
 search+0.200
 adventures+0.198
 own+0.194
 work+0.184
 family+0.184
Neg logit contributions
pan-0.227
rier-0.224
rified-0.214
plets-0.211
rol-0.210
Token Mike
Token ID4995
Feature activation-8.98e-02

Pos logit contributions
pa+0.114
 work+0.108
 adventures+0.105
 search+0.101
 own+0.097
Neg logit contributions
rified-0.164
pan-0.164
 Woo-0.163
rol-0.161
rier-0.155
Token said
Token ID531
Feature activation-4.59e-02

Pos logit contributions
pa+0.100
 adventures+0.094
 work+0.091
 search+0.084
 own+0.080
Neg logit contributions
pan-0.185
rified-0.179
 Woo-0.173
rol-0.170
ainted-0.169
Token to
Token ID284
Feature activation-2.16e-02

Pos logit contributions
pa+0.055
 work+0.049
 adventures+0.045
 each+0.044
 family+0.043
Neg logit contributions
pan-0.096
rol-0.092
 Woo-0.090
rier-0.090
otion-0.089
Token himself
Token ID2241
Feature activation-1.16e-01

Pos logit contributions
 adventures+0.033
pa+0.032
 work+0.030
 search+0.029
 interests+0.029
Neg logit contributions
 Woo-0.034
pan-0.032
rified-0.031
otion-0.030
rier-0.029
Token not
Token ID407
Feature activation+6.44e-03

Pos logit contributions
pa+0.053
 adventures+0.048
 search+0.047
 work+0.046
 family+0.043
Neg logit contributions
 Woo-0.047
pan-0.047
otion-0.044
rier-0.044
 coward-0.043
Token nice
Token ID3621
Feature activation-1.48e-02

Pos logit contributions
pan+0.011
 Woo+0.010
rier+0.010
 coward+0.010
rol+0.010
Neg logit contributions
pa-0.010
 search-0.009
 adventures-0.009
 work-0.009
 own-0.008
Token."
Token ID526
Feature activation-2.30e-04

Pos logit contributions
pa+0.015
 search+0.013
 adventures+0.013
 work+0.012
 own+0.011
Neg logit contributions
pan-0.033
rier-0.031
otion-0.030
 Woo-0.030
rol-0.030
Token
Token ID198
Feature activation-2.00e-02

Pos logit contributions
pa+0.000
 work+0.000
 adventures+0.000
 search+0.000
 family+0.000
Neg logit contributions
pan-0.000
 Woo-0.000
rier-0.000
rol-0.000
otion-0.000
Token
Token ID198
Feature activation-2.16e-02

Pos logit contributions
pa+0.023
 work+0.020
 adventures+0.019
 search+0.019
 ideas+0.017
Neg logit contributions
 Woo-0.042
pan-0.040
otion-0.039
ula-0.037
 downward-0.037
TokenUn
Token ID3118
Feature activation-1.41e-01

Pos logit contributions
pa+0.035
 work+0.033
 adventures+0.032
 search+0.031
 own+0.029
Neg logit contributions
 Woo-0.035
pan-0.035
otion-0.032
rier-0.031
 coward-0.031
Tokencle
Token ID2375
Feature activation-8.68e-02

Pos logit contributions
 adventures+0.183
 search+0.169
 work+0.160
 own+0.156
 family+0.156
Neg logit contributions
pan-0.273
rier-0.250
atching-0.247
otion-0.242
arling-0.240
Token Tom
Token ID4186
Feature activation-3.23e-02

Pos logit contributions
pa+0.150
 work+0.140
 adventures+0.137
 search+0.132
 own+0.129
Neg logit contributions
pan-0.129
 Woo-0.123
rified-0.121
otion-0.118
rier-0.117
Token laughed
Token ID13818
Feature activation+1.49e-02

Pos logit contributions
pa+0.034
 adventures+0.030
 work+0.029
 search+0.027
 own+0.027
Neg logit contributions
pan-0.070
 Woo-0.067
rified-0.067
otion-0.066
 coward-0.064
Token.
Token ID13
Feature activation-1.42e-02

Pos logit contributions
pan+0.035
rier+0.035
 Woo+0.034
rified+0.034
otion+0.033
Neg logit contributions
pa-0.014
 work-0.011
 adventures-0.010
 search-0.010
 family-0.009
Token He
Token ID679
Feature activation-4.19e-02

Pos logit contributions
pa+0.020
 work+0.018
 adventures+0.017
 search+0.016
 each+0.016
Neg logit contributions
pan-0.027
 Woo-0.025
rier-0.025
rol-0.025
otion-0.025
Tokenâ
Token ID22940
Feature activation-9.53e-03

Pos logit contributions
pa+0.042
 work+0.038
 adventures+0.037
 search+0.035
 family+0.035
Neg logit contributions
pan-0.035
 Woo-0.033
rol-0.033
rier-0.032
otion-0.032
Token
Token ID26391
Feature activation-5.83e-02

Pos logit contributions
pa+0.017
 adventures+0.016
 work+0.015
 family+0.015
 search+0.015
Neg logit contributions
pan-0.014
rol-0.013
 Woo-0.013
rified-0.013
otion-0.013
Token
Token ID198
Feature activation-3.77e-02

Pos logit contributions
pa+0.100
 work+0.091
 adventures+0.086
 search+0.084
 each+0.082
Neg logit contributions
pan-0.090
 Woo-0.087
rol-0.085
otion-0.083
rified-0.082
Token
Token ID198
Feature activation-3.84e-02

Pos logit contributions
pa+0.045
 work+0.039
 adventures+0.035
 search+0.033
 each+0.030
Neg logit contributions
 Woo-0.081
pan-0.079
rol-0.078
otion-0.077
ula-0.075
TokenFriend
Token ID23331
Feature activation-8.04e-02

Pos logit contributions
pa+0.064
 work+0.058
 adventures+0.054
 search+0.053
 family+0.051
Neg logit contributions
pan-0.064
 Woo-0.063
rol-0.061
otion-0.059
rier-0.058
Token 2
Token ID362
Feature activation-1.41e-01

Pos logit contributions
pa+0.144
 adventures+0.132
 work+0.131
 search+0.130
 family+0.124
Neg logit contributions
pan-0.116
rol-0.111
rified-0.110
 Woo-0.109
otion-0.109
Token:
Token ID25
Feature activation-7.83e-02

Pos logit contributions
pa+0.186
 search+0.168
 adventures+0.165
 work+0.165
 visit+0.149
Neg logit contributions
pan-0.252
rol-0.248
rified-0.247
 Woo-0.241
rier-0.240
Token
Token ID6184
Feature activation-4.40e-02

Pos logit contributions
pa+0.134
 work+0.119
 adventures+0.117
 search+0.111
 family+0.105
Neg logit contributions
pan-0.125
 Woo-0.121
rol-0.117
rier-0.116
otion-0.115
Token
Token ID95
Feature activation-3.62e-02

Pos logit contributions
pa+0.076
 work+0.071
 adventures+0.066
 each+0.065
 search+0.065
Neg logit contributions
pan-0.070
rier-0.064
rol-0.064
 Woo-0.062
rified-0.062
Token
Token ID26391
Feature activation-6.09e-02

Pos logit contributions
pa+0.065
 work+0.059
 adventures+0.058
 search+0.056
 family+0.056
Neg logit contributions
pan-0.054
 Woo-0.050
rol-0.050
rier-0.050
otion-0.049
Token
Token ID129
Feature activation+2.15e-02

Pos logit contributions
pa+0.104
 work+0.095
 adventures+0.090
 search+0.088
 each+0.085
Neg logit contributions
pan-0.094
 Woo-0.091
rol-0.089
otion-0.086
rified-0.085
Token the
Token ID262
Feature activation-1.85e-02

Pos logit contributions
pa+0.021
cles+0.017
 adventures+0.016
 ideas+0.015
 needed+0.015
Neg logit contributions
 Woo-0.025
pan-0.024
otion-0.022
 downward-0.021
 Fif-0.021
Token backyard
Token ID24296
Feature activation-1.52e-02

Pos logit contributions
pa+0.025
 work+0.022
 adventures+0.021
 search+0.020
 family+0.020
Neg logit contributions
pan-0.036
 Woo-0.034
rol-0.034
rier-0.034
otion-0.033
Token.
Token ID13
Feature activation-1.82e-02

Pos logit contributions
pa+0.014
 work+0.011
 adventures+0.010
 search+0.009
 family+0.009
Neg logit contributions
pan-0.036
rol-0.035
 Woo-0.034
rier-0.034
otion-0.034
Token
Token ID198
Feature activation-2.66e-02

Pos logit contributions
pa+0.025
 work+0.024
 each+0.023
 search+0.023
 family+0.023
Neg logit contributions
pan-0.034
rier-0.032
otion-0.030
 coward-0.030
rol-0.030
Token
Token ID198
Feature activation-2.83e-02

Pos logit contributions
pa+0.024
 work+0.019
 adventures+0.018
 search+0.016
 family+0.015
Neg logit contributions
pan-0.064
 Woo-0.063
rol-0.062
otion-0.061
rier-0.060
TokenUn
Token ID3118
Feature activation-1.41e-01

Pos logit contributions
pa+0.033
 work+0.030
 adventures+0.029
 each+0.029
 search+0.028
Neg logit contributions
pan-0.059
rier-0.058
rified-0.055
rol-0.054
otion-0.054
Tokencle
Token ID2375
Feature activation-8.01e-02

Pos logit contributions
 adventures+0.178
 own+0.173
pa+0.173
 work+0.173
 search+0.169
Neg logit contributions
pan-0.261
rier-0.238
plets-0.237
atching-0.230
 Woo-0.229
Token sprayed
Token ID35729
Feature activation+3.16e-02

Pos logit contributions
pa+0.115
 work+0.105
 adventures+0.100
 search+0.097
 own+0.093
Neg logit contributions
pan-0.145
 Woo-0.139
rol-0.137
rified-0.136
rier-0.135
Token them
Token ID606
Feature activation-3.50e-03

Pos logit contributions
pan+0.062
 Woo+0.059
rol+0.059
rier+0.058
rified+0.058
Neg logit contributions
pa-0.043
 work-0.037
 adventures-0.035
 search-0.034
 family-0.033
Token with
Token ID351
Feature activation-1.59e-02

Pos logit contributions
pa+0.005
 work+0.004
 adventures+0.004
 family+0.003
 search+0.003
Neg logit contributions
pan-0.007
rier-0.007
 Woo-0.007
rol-0.007
otion-0.007
Token the
Token ID262
Feature activation-2.44e-02

Pos logit contributions
pa+0.024
 work+0.020
 adventures+0.019
 search+0.019
 own+0.018
Neg logit contributions
pan-0.030
 Woo-0.028
rol-0.027
rier-0.026
rified-0.026
Token the
Token ID262
Feature activation-2.84e-02

Pos logit contributions
pa+0.024
 work+0.020
 adventures+0.019
 search+0.019
 family+0.018
Neg logit contributions
pan-0.040
 Woo-0.038
rier-0.037
otion-0.037
rol-0.037
Token rocks
Token ID12586
Feature activation-6.78e-02

Pos logit contributions
pa+0.039
 adventures+0.031
 work+0.031
 search+0.031
 each+0.030
Neg logit contributions
pan-0.054
rier-0.052
rol-0.050
 Woo-0.050
rified-0.050
Token,
Token ID11
Feature activation-7.80e-03

Pos logit contributions
pa+0.084
 work+0.068
 adventures+0.065
 search+0.063
 family+0.062
Neg logit contributions
pan-0.145
rier-0.134
rol-0.133
otion-0.130
 Woo-0.130
Token adding
Token ID4375
Feature activation-6.00e-02

Pos logit contributions
pa+0.010
 work+0.009
 adventures+0.008
 search+0.008
 family+0.008
Neg logit contributions
pan-0.016
 Woo-0.015
rol-0.015
rier-0.014
otion-0.014
Token and
Token ID290
Feature activation-2.10e-02

Pos logit contributions
pa+0.107
Daddy+0.083
 family+0.082
cle+0.081
 grandparents+0.079
Neg logit contributions
pan-0.094
rol-0.085
rier-0.084
 Woo-0.081
 downward-0.080
Token subtract
Token ID34128
Feature activation-1.40e-01

Pos logit contributions
pa+0.040
 own+0.031
ossible+0.031
Daddy+0.031
 family+0.031
Neg logit contributions
pan-0.029
rol-0.025
 Woo-0.024
 downward-0.024
 Rib-0.024
Tokening
Token ID278
Feature activation-4.25e-03

Pos logit contributions
pa+0.248
 search+0.219
 family+0.211
 adventures+0.207
 work+0.205
Neg logit contributions
rier-0.223
pan-0.220
rol-0.216
 downward-0.196
otion-0.194
Token.
Token ID13
Feature activation-3.02e-02

Pos logit contributions
pa+0.006
 family+0.005
 adventures+0.004
 search+0.004
 work+0.004
Neg logit contributions
pan-0.008
rier-0.008
rol-0.007
rified-0.007
 Woo-0.007
Token They
Token ID1119
Feature activation-7.75e-03

Pos logit contributions
pa+0.048
 work+0.043
 adventures+0.041
 search+0.040
 each+0.039
Neg logit contributions
pan-0.053
 Woo-0.049
rier-0.049
rol-0.049
otion-0.048
Token spent
Token ID3377
Feature activation-2.64e-02

Pos logit contributions
pa+0.010
 adventures+0.009
 packages+0.009
 surroundings+0.008
 work+0.008
Neg logit contributions
pan-0.014
rol-0.013
otion-0.012
 Woo-0.012
 coward-0.012
Token so
Token ID523
Feature activation-3.71e-02

Pos logit contributions
pa+0.040
 work+0.033
 search+0.029
 packages+0.029
 family+0.029
Neg logit contributions
 Woo-0.044
pan-0.044
rol-0.041
 downward-0.040
rified-0.040
Token The
Token ID383
Feature activation-2.21e-02

Pos logit contributions
pa+0.023
 work+0.021
 adventures+0.020
 search+0.019
 each+0.019
Neg logit contributions
pan-0.035
rol-0.033
rier-0.033
 Woo-0.033
otion-0.032
Token mouse
Token ID10211
Feature activation+3.23e-02

Pos logit contributions
pa+0.045
 adventures+0.039
 work+0.038
 each+0.038
 search+0.037
Neg logit contributions
pan-0.028
rier-0.027
 coward-0.026
rified-0.026
 Woo-0.026
Token was
Token ID373
Feature activation-2.29e-02

Pos logit contributions
pan+0.066
rified+0.062
rier+0.060
rol+0.060
 coward+0.060
Neg logit contributions
pa-0.040
 work-0.034
 adventures-0.033
 search-0.029
 own-0.028
Token happy
Token ID3772
Feature activation-3.08e-02

Pos logit contributions
pa+0.035
 work+0.030
 adventures+0.029
 search+0.027
 family+0.027
Neg logit contributions
pan-0.042
rier-0.039
 Woo-0.039
rol-0.039
otion-0.038
Token to
Token ID284
Feature activation-2.21e-02

Pos logit contributions
pa+0.038
 work+0.032
 adventures+0.030
 search+0.029
 own+0.028
Neg logit contributions
pan-0.064
 Woo-0.060
rier-0.059
rified-0.059
otion-0.058
Token oblig
Token ID6263
Feature activation-1.39e-01

Pos logit contributions
pa+0.031
 adventures+0.027
 packages+0.023
 grand+0.023
 work+0.022
Neg logit contributions
pan-0.042
 Woo-0.040
 coward-0.038
otion-0.037
rier-0.037
Tokene
Token ID68
Feature activation-2.05e-02

Pos logit contributions
 work+0.243
 adventures+0.237
pa+0.233
 search+0.223
 family+0.212
Neg logit contributions
rified-0.201
pan-0.190
rol-0.186
otion-0.179
rier-0.178
Token and
Token ID290
Feature activation-1.86e-02

Pos logit contributions
pa+0.016
 work+0.013
 each+0.012
 adventures+0.011
 family+0.011
Neg logit contributions
pan-0.051
rol-0.049
otion-0.049
 Woo-0.048
rier-0.048
Token quickly
Token ID2952
Feature activation-8.89e-03

Pos logit contributions
pa+0.026
 adventures+0.021
 work+0.021
 search+0.020
 family+0.019
Neg logit contributions
pan-0.036
 Woo-0.034
rier-0.034
rol-0.033
rified-0.033
Token ran
Token ID4966
Feature activation+2.41e-02

Pos logit contributions
pa+0.012
 work+0.010
 adventures+0.010
 search+0.010
 family+0.009
Neg logit contributions
pan-0.018
 Woo-0.016
rier-0.016
rol-0.016
rified-0.016
Token off
Token ID572
Feature activation+1.06e-02

Pos logit contributions
pan+0.046
rier+0.043
otion+0.043
rol+0.043
 Woo+0.042
Neg logit contributions
pa-0.034
 work-0.032
 adventures-0.030
 search-0.029
 each-0.029
Token was
Token ID373
Feature activation-2.05e-02

Pos logit contributions
pa+0.005
 adventures+0.004
 work+0.004
 search+0.004
 packages+0.004
Neg logit contributions
pan-0.007
rol-0.006
rified-0.006
 Woo-0.006
ainted-0.006
Token counting
Token ID14143
Feature activation-7.70e-02

Pos logit contributions
pa+0.029
 work+0.025
 adventures+0.024
 search+0.023
 family+0.023
Neg logit contributions
pan-0.040
 Woo-0.037
rol-0.037
rier-0.037
otion-0.036
Token,
Token ID11
Feature activation+6.60e-04

Pos logit contributions
pa+0.104
 work+0.082
 search+0.082
 family+0.080
 adventures+0.077
Neg logit contributions
pan-0.150
rier-0.144
rified-0.140
rol-0.139
 Woo-0.137
Token adding
Token ID4375
Feature activation-5.94e-02

Pos logit contributions
pan+0.001
 Woo+0.001
rol+0.001
rier+0.001
rified+0.001
Neg logit contributions
pa-0.001
 work-0.001
 adventures-0.001
 search-0.001
 family-0.001
Token and
Token ID290
Feature activation-1.65e-02

Pos logit contributions
pa+0.098
 family+0.075
Daddy+0.075
cle+0.073
 search+0.073
Neg logit contributions
pan-0.095
 downward-0.091
rier-0.089
rol-0.086
rified-0.085
Token subtract
Token ID34128
Feature activation-1.38e-01

Pos logit contributions
pa+0.028
ossible+0.023
 own+0.022
 family+0.022
by+0.021
Neg logit contributions
pan-0.024
 downward-0.021
rol-0.021
ner-0.021
otion-0.020
Tokening
Token ID278
Feature activation-7.45e-03

Pos logit contributions
pa+0.252
 search+0.223
 adventures+0.217
 family+0.215
 work+0.214
Neg logit contributions
pan-0.212
rier-0.204
rol-0.204
otion-0.190
rified-0.187
Token like
Token ID588
Feature activation-5.01e-02

Pos logit contributions
pa+0.010
 family+0.008
 search+0.007
 adventures+0.007
Daddy+0.007
Neg logit contributions
pan-0.015
rier-0.014
rified-0.013
rol-0.013
 downward-0.013
Token a
Token ID257
Feature activation-1.65e-02

Pos logit contributions
pa+0.070
 work+0.058
 search+0.055
 adventures+0.055
 family+0.053
Neg logit contributions
pan-0.096
 Woo-0.090
rier-0.089
rol-0.088
otion-0.087
Token pro
Token ID386
Feature activation+5.38e-03

Pos logit contributions
pa+0.028
 work+0.022
 adventures+0.022
 each+0.022
 search+0.021
Neg logit contributions
pan-0.028
rier-0.027
 coward-0.026
 Woo-0.025
rol-0.025
Token.
Token ID13
Feature activation-2.59e-02

Pos logit contributions
pan+0.012
rier+0.011
otion+0.011
rol+0.011
 Woo+0.011
Neg logit contributions
pa-0.006
 search-0.005
 work-0.005
 adventures-0.005
 family-0.005
Token little
Token ID1310
Feature activation+1.96e-02

Pos logit contributions
pa+0.033
 work+0.029
 each+0.029
 adventures+0.029
 search+0.028
Neg logit contributions
 Woo-0.033
rified-0.031
pan-0.030
rier-0.030
 coward-0.029
Token boy
Token ID2933
Feature activation-1.72e-02

Pos logit contributions
pan+0.043
 Woo+0.041
rier+0.040
rified+0.040
rol+0.039
Neg logit contributions
pa-0.022
 work-0.018
 adventures-0.018
 search-0.016
 each-0.016
Token said
Token ID531
Feature activation-3.90e-02

Pos logit contributions
pa+0.021
 adventures+0.017
 work+0.017
 search+0.016
 own+0.015
Neg logit contributions
pan-0.033
 Woo-0.033
rol-0.032
rified-0.031
ainted-0.030
Token,
Token ID11
Feature activation+7.10e-04

Pos logit contributions
pa+0.038
 work+0.034
 each+0.033
 family+0.032
 adventures+0.030
Neg logit contributions
pan-0.089
rol-0.086
rified-0.084
otion-0.084
 Woo-0.083
Token "
Token ID366
Feature activation-8.33e-03

Pos logit contributions
pan+0.001
 Woo+0.001
rier+0.001
otion+0.001
rol+0.001
Neg logit contributions
pa-0.001
 work-0.001
 adventures-0.001
 search-0.001
 own-0.001
TokenUn
Token ID3118
Feature activation-1.38e-01

Pos logit contributions
 work+0.016
pa+0.016
 adventures+0.015
 search+0.014
 own+0.014
Neg logit contributions
rol-0.012
 Woo-0.011
rified-0.011
pan-0.010
rier-0.010
Tokencle
Token ID2375
Feature activation-7.81e-02

Pos logit contributions
 adventures+0.177
 search+0.173
 work+0.171
 own+0.169
 family+0.165
Neg logit contributions
pan-0.221
rier-0.219
 Woo-0.217
adies-0.215
otion-0.214
Token,
Token ID11
Feature activation-8.24e-04

Pos logit contributions
pa+0.109
 work+0.103
 adventures+0.101
 search+0.099
 own+0.091
Neg logit contributions
 Woo-0.131
pan-0.130
rol-0.126
rified-0.126
rier-0.124
Token are
Token ID389
Feature activation-1.20e-03

Pos logit contributions
pa+0.001
 work+0.001
 adventures+0.001
 search+0.001
 family+0.001
Neg logit contributions
pan-0.001
 Woo-0.001
otion-0.001
rier-0.001
rol-0.001
Token you
Token ID345
Feature activation-1.48e-03

Pos logit contributions
pa+0.002
 work+0.002
 adventures+0.002
 search+0.002
 family+0.002
Neg logit contributions
 Woo-0.002
rol-0.002
pan-0.002
rier-0.002
otion-0.002
Token stealing
Token ID16568
Feature activation+5.32e-02

Pos logit contributions
pa+0.002
 work+0.002
 adventures+0.002
 search+0.002
 each+0.002
Neg logit contributions
pan-0.003
 Woo-0.003
rier-0.003
rol-0.003
otion-0.003
Token even
Token ID772
Feature activation-5.53e-02

Pos logit contributions
pa+0.006
 adventures+0.005
 packages+0.005
 surroundings+0.005
 work+0.005
Neg logit contributions
pan-0.007
rol-0.007
otion-0.007
rified-0.006
ainted-0.006
Token did
Token ID750
Feature activation-1.89e-02

Pos logit contributions
pa+0.081
 adventures+0.064
 work+0.063
 family+0.063
 search+0.062
Neg logit contributions
pan-0.091
 coward-0.085
 Lia-0.084
rified-0.084
 Woo-0.084
Token some
Token ID617
Feature activation-7.25e-03

Pos logit contributions
pa+0.026
 work+0.024
 adventures+0.022
 search+0.021
 each+0.021
Neg logit contributions
pan-0.037
rol-0.034
 Woo-0.034
rier-0.034
rified-0.034
Token adding
Token ID4375
Feature activation-5.67e-02

Pos logit contributions
pa+0.012
 search+0.009
 family+0.009
 work+0.009
 adventures+0.009
Neg logit contributions
pan-0.012
rier-0.012
 downward-0.011
otion-0.011
 coward-0.011
Token and
Token ID290
Feature activation-1.28e-02

Pos logit contributions
pa+0.094
Daddy+0.072
 family+0.071
 search+0.069
 work+0.069
Neg logit contributions
pan-0.093
 downward-0.087
rier-0.087
rol-0.085
 Woo-0.084
Token subtract
Token ID34128
Feature activation-1.38e-01

Pos logit contributions
pa+0.022
ossible+0.018
 family+0.017
 grandparents+0.017
 own+0.016
Neg logit contributions
pan-0.019
 downward-0.016
 swarm-0.016
otion-0.016
 disappears-0.016
Tokening
Token ID278
Feature activation-1.67e-03

Pos logit contributions
pa+0.226
 search+0.200
 family+0.193
 work+0.188
 adventures+0.188
Neg logit contributions
pan-0.230
rol-0.227
rier-0.225
otion-0.211
 downward-0.206
Token with
Token ID351
Feature activation-1.11e-02

Pos logit contributions
pa+0.002
 family+0.002
 search+0.002
 adventures+0.002
 work+0.002
Neg logit contributions
pan-0.003
rier-0.003
rol-0.003
otion-0.003
rified-0.003
Token the
Token ID262
Feature activation-1.99e-02

Pos logit contributions
pa+0.015
 work+0.012
 search+0.012
 adventures+0.011
 family+0.011
Neg logit contributions
pan-0.021
 Woo-0.019
rier-0.019
rified-0.019
otion-0.019
Token flowers
Token ID12734
Feature activation-2.65e-02

Pos logit contributions
pa+0.032
 search+0.023
 adventures+0.023
 each+0.023
 un+0.023
Neg logit contributions
pan-0.033
rier-0.033
 downward-0.032
rified-0.031
 coward-0.030
Token.
Token ID13
Feature activation-1.76e-02

Pos logit contributions
pa+0.032
 adventures+0.024
 work+0.024
 search+0.022
 family+0.022
Neg logit contributions
pan-0.058
rier-0.053
otion-0.053
rol-0.053
 Woo-0.051
Token didn
Token ID1422
Feature activation+1.36e-02

Pos logit contributions
pa+0.062
 adventures+0.055
 work+0.052
 own+0.048
 search+0.048
Neg logit contributions
pan-0.087
rified-0.083
 Woo-0.081
rol-0.081
 coward-0.080
Token't
Token ID470
Feature activation+6.53e-03

Pos logit contributions
pan+0.015
rol+0.014
rier+0.014
 Woo+0.013
rified+0.013
Neg logit contributions
pa-0.030
 work-0.028
 adventures-0.027
 search-0.026
 each-0.026
Token know
Token ID760
Feature activation-9.68e-02

Pos logit contributions
pan+0.013
 Woo+0.012
 coward+0.012
otion+0.012
ainted+0.011
Neg logit contributions
pa-0.009
 adventures-0.007
 family-0.006
 grand-0.006
 work-0.006
Token what
Token ID644
Feature activation-3.53e-02

Pos logit contributions
pa+0.128
 work+0.117
 adventures+0.109
 search+0.108
 own+0.102
Neg logit contributions
pan-0.188
otion-0.176
rier-0.175
 Woo-0.175
rol-0.173
Token "
Token ID366
Feature activation-1.28e-02

Pos logit contributions
pa+0.057
 work+0.050
 forth+0.047
 search+0.047
urious+0.046
Neg logit contributions
pan-0.057
 Woo-0.052
 coward-0.050
otion-0.049
 liar-0.048
Tokengas
Token ID22649
Feature activation-1.37e-01

Pos logit contributions
pa+0.024
 adventures+0.022
 work+0.022
 family+0.021
 adventure+0.021
Neg logit contributions
ainted-0.018
rol-0.018
rified-0.017
pan-0.016
plets-0.016
Token"
Token ID1
Feature activation-4.24e-02

Pos logit contributions
pa+0.262
 adventures+0.256
 work+0.244
 search+0.240
 each+0.229
Neg logit contributions
pan-0.189
otion-0.181
rier-0.173
rol-0.165
rified-0.165
Token was
Token ID373
Feature activation-1.93e-02

Pos logit contributions
 search+0.061
pa+0.060
 adventures+0.058
 work+0.058
ossible+0.056
Neg logit contributions
rified-0.066
rier-0.065
ainted-0.065
pan-0.062
rol-0.062
Token,
Token ID11
Feature activation+8.24e-04

Pos logit contributions
pa+0.031
 work+0.028
 adventures+0.027
 search+0.027
 family+0.026
Neg logit contributions
pan-0.033
 Woo-0.031
rier-0.031
otion-0.030
rol-0.030
Token but
Token ID475
Feature activation-1.71e-02

Pos logit contributions
pan+0.001
rier+0.001
rified+0.001
 Woo+0.001
otion+0.001
Neg logit contributions
pa-0.001
 work-0.001
 adventures-0.001
 search-0.001
 own-0.001
Token it
Token ID340
Feature activation+6.42e-03

Pos logit contributions
pa+0.027
 adventures+0.026
 work+0.026
 search+0.024
 own+0.023
Neg logit contributions
pan-0.027
rified-0.025
rier-0.025
otion-0.025
 Woo-0.024
Token before
Token ID878
Feature activation-1.16e-02

Pos logit contributions
pan+0.001
rol+0.001
 Woo+0.001
rier+0.001
rified+0.001
Neg logit contributions
pa-0.000
 work-0.000
 adventures-0.000
 search-0.000
 family-0.000
Token.
Token ID13
Feature activation-9.47e-03

Pos logit contributions
pa+0.011
 work+0.008
 adventures+0.007
 search+0.007
 own+0.007
Neg logit contributions
pan-0.027
otion-0.026
 Woo-0.026
rier-0.025
rified-0.025
Token
Token ID198
Feature activation-2.10e-02

Pos logit contributions
pa+0.011
 work+0.010
 adventures+0.010
 each+0.009
 search+0.009
Neg logit contributions
pan-0.020
rol-0.019
rier-0.019
 Woo-0.018
otion-0.018
Token
Token ID198
Feature activation-2.26e-02

Pos logit contributions
pa+0.024
 work+0.021
 adventures+0.019
 search+0.018
 ideas+0.017
Neg logit contributions
 Woo-0.043
pan-0.043
otion-0.042
rol-0.041
 Lia-0.040
Token"
Token ID1
Feature activation-3.28e-02

Pos logit contributions
pa+0.037
 work+0.033
 adventures+0.032
 search+0.031
 family+0.030
Neg logit contributions
pan-0.039
 Woo-0.036
rol-0.036
rier-0.036
otion-0.035
TokenUn
Token ID3118
Feature activation-1.37e-01

Pos logit contributions
pa+0.059
 work+0.054
 adventures+0.050
 search+0.050
 family+0.049
Neg logit contributions
pan-0.051
 Woo-0.048
rier-0.047
rified-0.046
otion-0.046
Tokenic
Token ID291
Feature activation-8.14e-02

Pos logit contributions
 adventures+0.175
 family+0.175
 own+0.167
 work+0.166
pa+0.166
Neg logit contributions
pan-0.239
rier-0.221
otion-0.214
 coward-0.210
adies-0.209
Tokenorn
Token ID1211
Feature activation-1.96e-02

Pos logit contributions
pa+0.113
 adventures+0.113
 search+0.108
 work+0.102
 family+0.102
Neg logit contributions
pan-0.145
otion-0.134
rier-0.133
rified-0.129
 coward-0.127
Token is
Token ID318
Feature activation-5.08e-03

Pos logit contributions
pa+0.026
 adventures+0.022
 work+0.022
 search+0.021
 family+0.021
Neg logit contributions
pan-0.038
rier-0.036
otion-0.035
 Woo-0.035
rol-0.035
Token a
Token ID257
Feature activation-1.21e-02

Pos logit contributions
pa+0.008
 work+0.006
 adventures+0.006
 search+0.006
 each+0.006
Neg logit contributions
pan-0.009
 Woo-0.009
rier-0.009
rol-0.008
otion-0.008
Token rare
Token ID4071
Feature activation-2.62e-02

Pos logit contributions
pa+0.019
 work+0.017
 adventures+0.016
 search+0.016
 each+0.016
Neg logit contributions
pan-0.021
 Woo-0.020
rier-0.020
rol-0.019
otion-0.019
Token.
Token ID13
Feature activation-3.29e-03

Pos logit contributions
pa+0.002
 work+0.002
 adventures+0.002
 search+0.002
 each+0.002
Neg logit contributions
pan-0.009
 Woo-0.008
rol-0.008
rier-0.008
otion-0.008
Token Tim
Token ID5045
Feature activation-1.09e-02

Pos logit contributions
pa+0.005
 work+0.004
 adventures+0.004
 search+0.004
 family+0.004
Neg logit contributions
pan-0.006
 Woo-0.006
rol-0.006
otion-0.005
rified-0.005
Token said
Token ID531
Feature activation-2.87e-02

Pos logit contributions
pa+0.010
 work+0.009
 adventures+0.009
 search+0.009
 family+0.008
Neg logit contributions
pan-0.022
ainted-0.022
rol-0.022
 Woo-0.021
rified-0.021
Token,
Token ID11
Feature activation+1.86e-02

Pos logit contributions
pa+0.019
 work+0.017
 each+0.017
 family+0.016
 dad+0.015
Neg logit contributions
rol-0.073
pan-0.072
rified-0.071
ainted-0.070
 Woo-0.069
Token "
Token ID366
Feature activation-2.41e-03

Pos logit contributions
pan+0.030
 Woo+0.028
rier+0.027
otion+0.027
rol+0.027
Neg logit contributions
pa-0.032
 work-0.029
 adventures-0.028
 search-0.027
 family-0.026
TokenUn
Token ID3118
Feature activation-1.37e-01

Pos logit contributions
pa+0.005
 work+0.005
 adventures+0.004
 family+0.004
 search+0.004
Neg logit contributions
rol-0.003
 Woo-0.003
pan-0.003
rified-0.003
plets-0.003
Tokencle
Token ID2375
Feature activation-7.70e-02

Pos logit contributions
pa+0.209
 adventures+0.204
 family+0.202
 work+0.202
 search+0.198
Neg logit contributions
pan-0.190
plets-0.185
rier-0.181
arling-0.177
 liar-0.176
Token Jack
Token ID3619
Feature activation-4.26e-02

Pos logit contributions
pa+0.104
 work+0.096
 adventures+0.092
 search+0.089
 own+0.084
Neg logit contributions
pan-0.141
 Woo-0.136
rier-0.132
rified-0.132
rol-0.132
Token,
Token ID11
Feature activation+7.07e-03

Pos logit contributions
pa+0.052
 adventures+0.046
 work+0.045
 search+0.043
 family+0.041
Neg logit contributions
pan-0.085
 Woo-0.080
otion-0.079
rified-0.079
rol-0.079
Token let
Token ID1309
Feature activation+2.65e-02

Pos logit contributions
pan+0.012
 Woo+0.011
otion+0.011
rier+0.011
rol+0.011
Neg logit contributions
pa-0.011
 work-0.010
 adventures-0.010
 search-0.009
 family-0.009
Token's
Token ID338
Feature activation-3.39e-02

Pos logit contributions
pan+0.058
 Woo+0.058
rified+0.055
rol+0.055
otion+0.055
Neg logit contributions
pa-0.026
 adventures-0.022
 work-0.021
 search-0.020
 family-0.018
Token the
Token ID262
Feature activation-2.32e-02

Pos logit contributions
pa+0.036
 work+0.031
 adventures+0.028
 search+0.028
 family+0.028
Neg logit contributions
pan-0.057
 Woo-0.054
rier-0.053
rified-0.052
rol-0.051
Token screen
Token ID3159
Feature activation-4.78e-02

Pos logit contributions
pa+0.040
 work+0.035
 adventures+0.033
 search+0.033
 asked+0.033
Neg logit contributions
pan-0.037
rier-0.034
 Woo-0.034
rified-0.033
rol-0.032
Token.
Token ID13
Feature activation-2.72e-02

Pos logit contributions
pa+0.042
 work+0.032
 adventures+0.029
 search+0.027
 family+0.027
Neg logit contributions
pan-0.118
rol-0.111
rier-0.111
otion-0.108
rified-0.108
Token 1
Token ID352
Feature activation-1.12e-01

Pos logit contributions
pa+0.043
 work+0.041
 adventures+0.039
 search+0.038
 each+0.038
Neg logit contributions
pan-0.046
rier-0.043
 Woo-0.042
rol-0.042
otion-0.041
Token,
Token ID11
Feature activation-3.25e-03

Pos logit contributions
pa+0.139
 work+0.123
 search+0.119
 adventures+0.118
 own+0.110
Neg logit contributions
pan-0.221
rol-0.215
rified-0.213
 Woo-0.210
rier-0.208
Token 2
Token ID362
Feature activation-1.35e-01

Pos logit contributions
pa+0.005
 adventures+0.004
 work+0.004
 search+0.004
 own+0.004
Neg logit contributions
pan-0.006
rier-0.006
 Woo-0.006
rified-0.005
 coward-0.005
Token,
Token ID11
Feature activation-3.33e-03

Pos logit contributions
pa+0.180
 work+0.161
 search+0.157
 adventures+0.149
 own+0.146
Neg logit contributions
pan-0.248
rol-0.238
rified-0.237
otion-0.236
rier-0.235
Token 3
Token ID513
Feature activation-8.07e-02

Pos logit contributions
pa+0.005
 adventures+0.004
 work+0.004
 search+0.004
 own+0.004
Neg logit contributions
pan-0.006
rier-0.006
 Woo-0.006
rified-0.005
rol-0.005
Token,
Token ID11
Feature activation-3.80e-03

Pos logit contributions
pa+0.087
 work+0.079
 search+0.073
 adventures+0.067
 own+0.067
Neg logit contributions
pan-0.165
rol-0.162
rified-0.161
rier-0.159
otion-0.158
Token 4
Token ID604
Feature activation-1.06e-01

Pos logit contributions
pa+0.005
 work+0.005
 adventures+0.005
 search+0.004
 own+0.004
Neg logit contributions
pan-0.007
rol-0.007
 Woo-0.007
rier-0.007
otion-0.007
Token!
Token ID0
Feature activation-2.59e-02

Pos logit contributions
pa+0.138
 search+0.118
 work+0.118
 adventures+0.110
 own+0.107
Neg logit contributions
pan-0.199
rier-0.194
rified-0.190
rol-0.188
otion-0.187
Token.
Token ID13
Feature activation-2.35e-02

Pos logit contributions
pa+0.051
 work+0.038
 adventures+0.037
 search+0.033
 family+0.033
Neg logit contributions
pan-0.141
rol-0.133
rier-0.133
 Woo-0.132
otion-0.131
Token He
Token ID679
Feature activation-5.04e-02

Pos logit contributions
pa+0.024
 work+0.021
 adventures+0.019
 search+0.018
 each+0.018
Neg logit contributions
pan-0.054
rier-0.051
rol-0.051
 Woo-0.050
otion-0.050
Token counted
Token ID14789
Feature activation-8.35e-02

Pos logit contributions
pa+0.074
 work+0.060
 adventures+0.055
 search+0.054
 day+0.051
Neg logit contributions
 Woo-0.085
rol-0.083
pan-0.082
 Fif-0.082
 coward-0.080
Token 1
Token ID352
Feature activation-1.10e-01

Pos logit contributions
pa+0.112
 work+0.095
 adventures+0.089
 family+0.086
 search+0.086
Neg logit contributions
pan-0.158
rol-0.156
rier-0.152
 Woo-0.151
 downward-0.148
Token,
Token ID11
Feature activation-9.22e-04

Pos logit contributions
pa+0.152
 work+0.133
 search+0.124
 adventures+0.122
 family+0.122
Neg logit contributions
rier-0.196
pan-0.196
rol-0.195
rified-0.193
 downward-0.186
Token 2
Token ID362
Feature activation-1.35e-01

Pos logit contributions
pa+0.002
 adventures+0.001
 work+0.001
 search+0.001
 own+0.001
Neg logit contributions
pan-0.002
 Woo-0.002
rier-0.001
rified-0.001
 downward-0.001
Token,
Token ID11
Feature activation-1.54e-03

Pos logit contributions
pa+0.200
 work+0.173
 search+0.171
 own+0.158
 visit+0.157
Neg logit contributions
rier-0.225
pan-0.218
rified-0.218
rol-0.217
 Woo-0.214
Token 3
Token ID513
Feature activation-8.13e-02

Pos logit contributions
pa+0.003
 adventures+0.002
 work+0.002
 search+0.002
 own+0.002
Neg logit contributions
pan-0.003
 Woo-0.003
rier-0.003
rol-0.002
rified-0.002
Token,
Token ID11
Feature activation-2.39e-03

Pos logit contributions
pa+0.089
 work+0.083
 search+0.076
 visit+0.071
 own+0.069
Neg logit contributions
rol-0.158
rified-0.155
pan-0.154
rier-0.153
 Woo-0.152
Token 4
Token ID604
Feature activation-1.07e-01

Pos logit contributions
pa+0.003
 work+0.003
 adventures+0.003
 own+0.003
 search+0.003
Neg logit contributions
pan-0.004
 Woo-0.004
rol-0.004
rier-0.004
otion-0.004
Token!
Token ID0
Feature activation-2.45e-02

Pos logit contributions
pa+0.149
 search+0.127
 work+0.126
 visit+0.114
 own+0.113
Neg logit contributions
rier-0.187
rified-0.182
pan-0.181
 Woo-0.177
rol-0.177
Token and
Token ID290
Feature activation-1.72e-02

Pos logit contributions
pa+0.061
 work+0.049
 adventures+0.048
 search+0.047
Daddy+0.045
Neg logit contributions
pan-0.096
rier-0.089
otion-0.085
ion-0.084
 Woo-0.084
Token Dad
Token ID17415
Feature activation-1.07e-01

Pos logit contributions
pa+0.026
 adventures+0.020
 search+0.019
father+0.018
cles+0.018
Neg logit contributions
pan-0.029
rier-0.027
 coward-0.026
 Woo-0.026
 Fif-0.025
Token was
Token ID373
Feature activation-2.51e-02

Pos logit contributions
 adventures+0.128
pa+0.125
 search+0.112
 work+0.107
 surroundings+0.102
Neg logit contributions
pan-0.195
otion-0.179
 Woo-0.175
rier-0.173
rol-0.173
Token happy
Token ID3772
Feature activation-3.11e-02

Pos logit contributions
pa+0.036
 adventures+0.031
 work+0.031
 search+0.029
by+0.026
Neg logit contributions
pan-0.047
rier-0.045
rol-0.043
otion-0.043
 Woo-0.042
Token to
Token ID284
Feature activation-2.26e-02

Pos logit contributions
pa+0.037
 work+0.029
 search+0.028
 adventures+0.027
 own+0.027
Neg logit contributions
pan-0.062
rier-0.060
 Woo-0.057
otion-0.057
 coward-0.055
Token oblig
Token ID6263
Feature activation-1.35e-01

Pos logit contributions
pa+0.034
 adventures+0.034
 belongings+0.030
cles+0.030
 surroundings+0.029
Neg logit contributions
pan-0.036
otion-0.033
 Woo-0.030
 coward-0.030
rier-0.029
Tokene
Token ID68
Feature activation-1.91e-02

Pos logit contributions
 work+0.257
 adventures+0.254
 search+0.239
pa+0.223
 adventure+0.221
Neg logit contributions
rified-0.162
otion-0.157
rol-0.155
pan-0.150
rier-0.143
Token.
Token ID13
Feature activation-1.65e-02

Pos logit contributions
pa+0.013
 work+0.010
 adventures+0.009
 each+0.008
 family+0.008
Neg logit contributions
pan-0.050
rol-0.048
 Woo-0.047
otion-0.047
rier-0.047
Token He
Token ID679
Feature activation-4.49e-02

Pos logit contributions
pa+0.018
 work+0.016
 adventures+0.015
 search+0.014
 family+0.013
Neg logit contributions
pan-0.036
 Woo-0.034
rier-0.034
rol-0.034
rified-0.034
Token went
Token ID1816
Feature activation-1.92e-02

Pos logit contributions
pa+0.059
 adventures+0.053
 search+0.048
 work+0.047
 own+0.045
Neg logit contributions
pan-0.083
 Woo-0.076
 coward-0.074
rol-0.071
otion-0.071
Token out
Token ID503
Feature activation+2.27e-02

Pos logit contributions
pa+0.024
 adventures+0.020
 work+0.020
 family+0.018
 each+0.018
Neg logit contributions
pan-0.040
 Woo-0.038
rol-0.037
rier-0.037
otion-0.036
Token '
Token ID705
Feature activation-7.89e-02

Pos logit contributions
pa+0.047
 work+0.043
 each+0.040
 adventures+0.040
 family+0.038
Neg logit contributions
pan-0.072
rol-0.071
rified-0.068
 Woo-0.068
rier-0.067
Token Aunt
Token ID38074
Feature activation-2.82e-02

Pos logit contributions
pa+0.157
 work+0.145
 family+0.144
 adventures+0.143
 dad+0.142
Neg logit contributions
rol-0.100
 Woo-0.098
rified-0.095
plets-0.091
pan-0.088
Tokenies
Token ID444
Feature activation-3.01e-02

Pos logit contributions
pa+0.039
 work+0.034
 adventures+0.033
 search+0.031
 family+0.031
Neg logit contributions
pan-0.055
 Woo-0.051
rol-0.051
rier-0.051
otion-0.050
Token and
Token ID290
Feature activation-1.41e-02

Pos logit contributions
pa+0.045
 adventures+0.040
 work+0.039
 search+0.038
 family+0.037
Neg logit contributions
pan-0.054
otion-0.050
rier-0.049
rol-0.049
rified-0.049
Token un
Token ID555
Feature activation-1.18e-01

Pos logit contributions
pa+0.021
 adventures+0.018
 work+0.018
 search+0.017
 own+0.016
Neg logit contributions
pan-0.025
 Woo-0.024
 coward-0.022
rier-0.022
rified-0.021
Tokencles
Token ID5427
Feature activation-1.35e-01

Pos logit contributions
 adventures+0.189
pa+0.182
 family+0.180
 work+0.176
 day+0.174
Neg logit contributions
rol-0.188
pan-0.187
rier-0.176
 Woo-0.172
ainted-0.166
Token can
Token ID460
Feature activation-7.09e-03

Pos logit contributions
pa+0.187
 adventures+0.174
 search+0.159
 work+0.155
 family+0.150
Neg logit contributions
pan-0.262
rier-0.239
rol-0.238
otion-0.233
 Woo-0.233
Token give
Token ID1577
Feature activation-2.74e-02

Pos logit contributions
pa+0.010
 adventures+0.009
cles+0.007
 day+0.007
 family+0.007
Neg logit contributions
pan-0.013
 Woo-0.013
rier-0.012
rol-0.012
otion-0.012
Token me
Token ID502
Feature activation-7.35e-03

Pos logit contributions
pa+0.036
 adventures+0.032
 work+0.031
 search+0.031
 family+0.029
Neg logit contributions
 Woo-0.050
pan-0.048
rier-0.048
rol-0.047
rified-0.046
Token fake
Token ID8390
Feature activation+1.00e-02

Pos logit contributions
pa+0.010
 adventures+0.009
 work+0.009
 search+0.008
 family+0.008
Neg logit contributions
pan-0.014
 Woo-0.013
rier-0.013
rol-0.013
rified-0.012
Token pills
Token ID19521
Feature activation+9.04e-03

Pos logit contributions
pan+0.013
 Woo+0.013
rier+0.012
rol+0.012
rified+0.011
Neg logit contributions
pa-0.020
 work-0.017
 adventures-0.017
 search-0.017
 family-0.016
Token The
Token ID383
Feature activation-1.80e-03

Pos logit contributions
pa+0.001
 work+0.001
 search+0.001
 each+0.001
 asked+0.001
Neg logit contributions
pan-0.002
otion-0.001
rol-0.001
rier-0.001
 coward-0.001
Token girl
Token ID2576
Feature activation+3.22e-02

Pos logit contributions
pa+0.003
 work+0.003
 adventures+0.003
 each+0.003
 search+0.003
Neg logit contributions
pan-0.002
rified-0.002
rier-0.002
 Woo-0.002
 coward-0.002
Token was
Token ID373
Feature activation+4.69e-04

Pos logit contributions
pan+0.067
rified+0.064
 Woo+0.063
rol+0.062
 coward+0.061
Neg logit contributions
pa-0.038
 work-0.031
 adventures-0.031
 search-0.027
 own-0.026
Token too
Token ID1165
Feature activation-3.64e-02

Pos logit contributions
pan+0.001
rier+0.001
rol+0.001
 Woo+0.001
otion+0.001
Neg logit contributions
pa-0.001
 work-0.001
 adventures-0.001
 search-0.001
 family-0.001
Token pet
Token ID4273
Feature activation+2.28e-02

Pos logit contributions
pa+0.068
 adventures+0.059
 work+0.059
 search+0.057
 each+0.055
Neg logit contributions
pan-0.052
 Woo-0.051
rier-0.050
 coward-0.048
otion-0.047
Tokenrified
Token ID41301
Feature activation+1.70e-01

Pos logit contributions
pan+0.033
otion+0.029
rol+0.029
rier+0.028
 coward+0.028
Neg logit contributions
pa-0.042
 each-0.041
 work-0.038
 family-0.037
 adventures-0.036
Token to
Token ID284
Feature activation+3.99e-03

Pos logit contributions
pan+0.418
rol+0.396
 Woo+0.396
rier+0.396
otion+0.391
Neg logit contributions
pa-0.144
 work-0.118
 adventures-0.108
 search-0.099
 each-0.096
Token move
Token ID1445
Feature activation-1.22e-02

Pos logit contributions
pan+0.007
 Woo+0.007
otion+0.007
 coward+0.007
 Fif+0.006
Neg logit contributions
pa-0.006
 adventures-0.005
 family-0.005
 work-0.005
ossible-0.004
Token and
Token ID290
Feature activation+7.34e-03

Pos logit contributions
pa+0.013
 work+0.011
 adventures+0.010
 search+0.010
 each+0.010
Neg logit contributions
pan-0.027
rol-0.026
 Woo-0.026
rier-0.026
otion-0.025
Token just
Token ID655
Feature activation+3.47e-02

Pos logit contributions
pan+0.014
rier+0.014
rol+0.014
 Woo+0.013
otion+0.013
Neg logit contributions
pa-0.010
 work-0.009
 adventures-0.008
 search-0.008
 each-0.008
Token stared
Token ID18484
Feature activation+1.81e-02

Pos logit contributions
pan+0.064
rified+0.061
rol+0.060
 Woo+0.060
otion+0.060
Neg logit contributions
pa-0.050
 work-0.042
 adventures-0.041
 family-0.039
 search-0.038
Token crazy
Token ID7165
Feature activation+5.97e-02

Pos logit contributions
pan+0.144
rol+0.139
 coward+0.135
rier+0.134
rified+0.134
Neg logit contributions
pa-0.080
 work-0.065
 search-0.061
 adventures-0.060
 un-0.059
Token creatures
Token ID8109
Feature activation+5.14e-02

Pos logit contributions
pan+0.123
rier+0.114
rol+0.111
 Woo+0.110
otion+0.109
Neg logit contributions
pa-0.069
 search-0.055
 work-0.055
 un-0.052
 adventures-0.051
Token.
Token ID13
Feature activation+3.74e-02

Pos logit contributions
pan+0.110
rol+0.101
 Woo+0.100
otion+0.100
rier+0.099
Neg logit contributions
pa-0.062
 work-0.051
 adventures-0.048
 search-0.046
 family-0.045
Token Tim
Token ID5045
Feature activation+2.05e-02

Pos logit contributions
pan+0.066
 Woo+0.064
rol+0.063
otion+0.061
rified+0.061
Neg logit contributions
pa-0.057
 work-0.052
 adventures-0.049
 search-0.046
 each-0.045
Token said
Token ID531
Feature activation+5.93e-03

Pos logit contributions
pan+0.046
 Woo+0.046
rol+0.046
rified+0.046
ainted+0.044
Neg logit contributions
pa-0.017
 work-0.014
 adventures-0.014
 search-0.013
 family-0.012
Token goodbye
Token ID24829
Feature activation+1.69e-01

Pos logit contributions
pan+0.014
rol+0.013
 Woo+0.013
rified+0.013
otion+0.013
Neg logit contributions
pa-0.006
 work-0.005
 each-0.005
 adventures-0.004
 family-0.004
Token to
Token ID284
Feature activation+2.49e-02

Pos logit contributions
pan+0.332
rier+0.322
 Woo+0.322
rol+0.300
rified+0.298
Neg logit contributions
pa-0.219
 adventures-0.182
 work-0.180
 search-0.169
 family-0.157
Token Mel
Token ID5616
Feature activation-1.56e-02

Pos logit contributions
 Woo+0.044
pan+0.040
 Fif+0.038
otion+0.037
 coward+0.037
Neg logit contributions
pa-0.034
 adventures-0.031
 work-0.029
 search-0.029
urious-0.028
Tokenon
Token ID261
Feature activation-3.35e-02

Pos logit contributions
pa+0.017
 work+0.015
 adventures+0.014
 search+0.013
 family+0.013
Neg logit contributions
pan-0.035
 Woo-0.033
rier-0.033
rol-0.032
otion-0.032
Token and
Token ID290
Feature activation+1.19e-02

Pos logit contributions
pa+0.036
 work+0.031
 adventures+0.030
 search+0.028
 each+0.027
Neg logit contributions
pan-0.074
rier-0.070
rol-0.069
 Woo-0.069
otion-0.068
Token thanked
Token ID26280
Feature activation+7.67e-03

Pos logit contributions
pan+0.024
 Woo+0.024
rol+0.023
rified+0.022
rier+0.022
Neg logit contributions
pa-0.014
 work-0.012
 adventures-0.012
 search-0.011
 own-0.011
Token a
Token ID257
Feature activation+1.20e-01

Pos logit contributions
pan+0.047
rified+0.046
 Woo+0.044
 coward+0.044
ainted+0.043
Neg logit contributions
pa-0.034
 work-0.028
 adventures-0.027
 search-0.025
 day-0.025
Token pat
Token ID1458
Feature activation+1.11e-01

Pos logit contributions
pan+0.233
 Woo+0.216
rier+0.215
 coward+0.212
rol+0.208
Neg logit contributions
pa-0.168
 work-0.138
 each-0.133
 adventures-0.132
 search-0.123
Token.
Token ID13
Feature activation+5.25e-02

Pos logit contributions
pan+0.224
 Woo+0.210
rified+0.206
rol+0.206
rier+0.206
Neg logit contributions
pa-0.145
 adventures-0.132
 work-0.128
 search-0.122
 family-0.118
Token They
Token ID1119
Feature activation+6.19e-02

Pos logit contributions
pan+0.095
 Woo+0.090
rol+0.089
otion+0.087
rier+0.087
Neg logit contributions
pa-0.079
 work-0.072
 adventures-0.068
 search-0.064
 each-0.063
Token said
Token ID531
Feature activation+2.16e-02

Pos logit contributions
pan+0.124
 Woo+0.115
rol+0.113
rified+0.112
otion+0.108
Neg logit contributions
pa-0.080
 adventures-0.068
 work-0.063
 family-0.061
 search-0.060
Token goodbye
Token ID24829
Feature activation+1.76e-01

Pos logit contributions
pan+0.042
rol+0.041
rified+0.039
otion+0.039
rier+0.039
Neg logit contributions
pa-0.029
 work-0.027
 each-0.026
 adventures-0.025
 family-0.025
Token to
Token ID284
Feature activation+3.03e-02

Pos logit contributions
pan+0.337
 Woo+0.326
rier+0.325
rol+0.306
rified+0.299
Neg logit contributions
pa-0.233
 work-0.193
 adventures-0.192
 search-0.181
 family-0.166
Token each
Token ID1123
Feature activation-5.34e-02

Pos logit contributions
 Woo+0.054
pan+0.045
 Fif+0.044
 Rib+0.042
otion+0.042
Neg logit contributions
pa-0.045
 adventures-0.040
 search-0.038
 work-0.038
cles-0.038
Token other
Token ID584
Feature activation+4.07e-02

Pos logit contributions
pa+0.060
 work+0.051
 adventures+0.048
 search+0.046
 each+0.044
Neg logit contributions
pan-0.118
 Woo-0.111
rol-0.111
rier-0.111
otion-0.110
Token and
Token ID290
Feature activation+2.25e-02

Pos logit contributions
pan+0.085
 Woo+0.078
rier+0.077
rified+0.075
otion+0.073
Neg logit contributions
pa-0.045
 search-0.037
 adventures-0.037
 work-0.036
 own-0.033
Token promised
Token ID8072
Feature activation+5.15e-02

Pos logit contributions
pan+0.046
 Woo+0.045
rol+0.042
rified+0.042
 coward+0.042
Neg logit contributions
pa-0.027
 adventures-0.022
 work-0.022
 search-0.020
 own-0.019
Token garden
Token ID11376
Feature activation+7.71e-02

Pos logit contributions
rol+0.078
pan+0.078
rified+0.071
ainted+0.071
rier+0.070
Neg logit contributions
 work-0.073
pa-0.072
 family-0.070
 each-0.067
 dad-0.067
Token.
Token ID13
Feature activation+5.31e-02

Pos logit contributions
pan+0.171
rol+0.160
 Woo+0.155
rier+0.155
otion+0.154
Neg logit contributions
pa-0.084
 work-0.069
 adventures-0.062
 search-0.060
 un-0.056
Token Lily
Token ID20037
Feature activation+8.35e-02

Pos logit contributions
pan+0.091
 Woo+0.088
rol+0.087
otion+0.085
rified+0.084
Neg logit contributions
pa-0.083
 work-0.076
 adventures-0.072
 search-0.068
 each-0.066
Token is
Token ID318
Feature activation+6.39e-02

Pos logit contributions
pan+0.186
 Woo+0.179
rol+0.176
rified+0.174
otion+0.171
Neg logit contributions
pa-0.082
 work-0.067
 adventures-0.065
 search-0.060
 family-0.057
Token the
Token ID262
Feature activation+3.81e-02

Pos logit contributions
pan+0.132
rol+0.124
 Woo+0.124
rier+0.124
rified+0.122
Neg logit contributions
pa-0.079
 work-0.071
 adventures-0.066
 search-0.064
 family-0.063
Token seeker
Token ID45993
Feature activation+1.74e-01

Pos logit contributions
pan+0.069
rier+0.065
 Woo+0.065
rol+0.064
otion+0.063
Neg logit contributions
pa-0.057
 work-0.051
 adventures-0.049
 search-0.047
 each-0.046
Token and
Token ID290
Feature activation+4.50e-02

Pos logit contributions
pan+0.442
rol+0.438
rier+0.433
 Woo+0.430
otion+0.421
Neg logit contributions
pa-0.127
 work-0.095
 each-0.085
 adventures-0.082
 family-0.080
Token Ben
Token ID3932
Feature activation+2.74e-02

Pos logit contributions
pan+0.080
rier+0.077
rol+0.075
otion+0.074
rified+0.073
Neg logit contributions
pa-0.067
 work-0.065
 each-0.062
 family-0.062
 adventures-0.060
Token is
Token ID318
Feature activation+4.63e-02

Pos logit contributions
pan+0.062
rier+0.059
rol+0.058
 Woo+0.058
otion+0.057
Neg logit contributions
pa-0.029
 work-0.025
 adventures-0.024
 search-0.022
 each-0.022
Token the
Token ID262
Feature activation+2.56e-02

Pos logit contributions
pan+0.095
rier+0.093
rol+0.092
rified+0.091
 Woo+0.091
Neg logit contributions
pa-0.057
 work-0.053
 adventures-0.049
 search-0.047
 family-0.046
Token h
Token ID289
Feature activation+6.90e-02

Pos logit contributions
pan+0.041
rol+0.039
rier+0.038
 Woo+0.038
otion+0.038
Neg logit contributions
pa-0.043
 work-0.040
 adventures-0.039
 family-0.037
 search-0.037
Token's
Token ID338
Feature activation+3.50e-02

Pos logit contributions
pan+0.053
 Woo+0.050
rol+0.048
otion+0.048
rier+0.046
Neg logit contributions
pa-0.029
 adventures-0.028
 work-0.026
 search-0.024
 ideas-0.024
Token floor
Token ID4314
Feature activation+2.40e-02

Pos logit contributions
pan+0.063
rier+0.061
 Woo+0.056
rol+0.054
 liar+0.053
Neg logit contributions
pa-0.047
 adventures-0.037
 work-0.037
 each-0.037
 search-0.036
Token,
Token ID11
Feature activation+7.02e-02

Pos logit contributions
pan+0.058
rier+0.053
rol+0.053
 Woo+0.052
otion+0.051
Neg logit contributions
pa-0.024
 work-0.018
 adventures-0.018
 search-0.017
 family-0.017
Token she
Token ID673
Feature activation+6.90e-02

Pos logit contributions
pan+0.133
 Woo+0.124
rol+0.118
rier+0.118
otion+0.118
Neg logit contributions
pa-0.102
 work-0.088
 adventures-0.086
 search-0.081
 own-0.078
Token said
Token ID531
Feature activation+1.02e-02

Pos logit contributions
pan+0.133
rol+0.124
otion+0.123
 Woo+0.120
 coward+0.119
Neg logit contributions
pa-0.084
 adventures-0.074
 work-0.070
 search-0.068
 each-0.064
Token goodbye
Token ID24829
Feature activation+1.68e-01

Pos logit contributions
pan+0.020
rol+0.020
 Woo+0.019
rier+0.019
rified+0.019
Neg logit contributions
pa-0.013
 work-0.012
 adventures-0.011
 each-0.011
 family-0.011
Token to
Token ID284
Feature activation+2.47e-02

Pos logit contributions
pan+0.325
 Woo+0.309
rier+0.309
rol+0.292
rified+0.285
Neg logit contributions
pa-0.224
 adventures-0.189
 work-0.188
 search-0.177
 family-0.165
Token the
Token ID262
Feature activation+2.22e-02

Pos logit contributions
 Woo+0.041
pan+0.035
otion+0.034
 Fif+0.032
 Lia+0.031
Neg logit contributions
pa-0.037
 adventures-0.036
urious-0.034
 search-0.033
cles-0.033
Token man
Token ID582
Feature activation-4.91e-03

Pos logit contributions
pan+0.039
rier+0.037
 Woo+0.036
rified+0.035
rol+0.034
Neg logit contributions
pa-0.034
 work-0.029
 adventures-0.029
 each-0.028
 search-0.028
Token and
Token ID290
Feature activation+1.73e-02

Pos logit contributions
pa+0.006
 adventures+0.005
 work+0.005
 family+0.004
 search+0.004
Neg logit contributions
pan-0.010
rol-0.009
otion-0.009
rier-0.009
 Woo-0.009
Token went
Token ID1816
Feature activation+1.31e-02

Pos logit contributions
pan+0.035
 Woo+0.034
rol+0.031
 coward+0.031
rified+0.031
Neg logit contributions
pa-0.021
 adventures-0.018
 work-0.017
 search-0.016
 own-0.016
Token laughed
Token ID13818
Feature activation+3.66e-02

Pos logit contributions
pan+0.037
 Woo+0.035
rol+0.035
rified+0.034
rier+0.034
Neg logit contributions
pa-0.024
 work-0.021
 adventures-0.020
 search-0.019
 family-0.018
Token so
Token ID523
Feature activation-1.48e-02

Pos logit contributions
pan+0.098
rol+0.095
otion+0.093
 Woo+0.093
rier+0.093
Neg logit contributions
pa-0.022
 work-0.018
 adventures-0.016
 each-0.016
 family-0.014
Token hard
Token ID1327
Feature activation-6.77e-03

Pos logit contributions
pa+0.021
 work+0.019
 adventures+0.018
 search+0.018
 family+0.017
Neg logit contributions
pan-0.028
 Woo-0.026
rier-0.026
rol-0.026
otion-0.026
Token that
Token ID326
Feature activation+1.78e-03

Pos logit contributions
pa+0.007
 work+0.005
 search+0.005
 adventures+0.005
 family+0.005
Neg logit contributions
pan-0.016
rier-0.015
 Woo-0.015
otion-0.014
rified-0.014
Token they
Token ID484
Feature activation+3.65e-02

Pos logit contributions
pan+0.003
 Woo+0.003
otion+0.003
rier+0.003
 coward+0.002
Neg logit contributions
pa-0.003
 work-0.003
 adventures-0.003
 search-0.002
 own-0.002
Token fell
Token ID3214
Feature activation+7.86e-02

Pos logit contributions
pan+0.073
 Woo+0.068
rol+0.068
rier+0.067
rified+0.067
Neg logit contributions
pa-0.049
 work-0.042
 adventures-0.041
 search-0.039
 family-0.038
Token on
Token ID319
Feature activation+1.60e-03

Pos logit contributions
pan+0.163
otion+0.162
rier+0.159
 Woo+0.158
rified+0.157
Neg logit contributions
pa-0.083
 work-0.080
 each-0.076
 adventures-0.071
 search-0.069
Token the
Token ID262
Feature activation+4.75e-03

Pos logit contributions
pan+0.004
 Woo+0.003
rier+0.003
rol+0.003
otion+0.003
Neg logit contributions
pa-0.002
 work-0.002
 adventures-0.001
 search-0.001
 family-0.001
Token grass
Token ID8701
Feature activation+2.43e-02

Pos logit contributions
pan+0.008
rol+0.008
 Woo+0.008
rier+0.008
otion+0.007
Neg logit contributions
pa-0.008
 work-0.007
 adventures-0.007
 search-0.006
 family-0.006
Token.
Token ID13
Feature activation+5.06e-03

Pos logit contributions
pan+0.065
 Woo+0.064
otion+0.063
rier+0.063
rol+0.063
Neg logit contributions
pa-0.013
 work-0.010
 each-0.009
 adventures-0.008
 search-0.008
Token
Token ID198
Feature activation-2.80e-03

Pos logit contributions
pan+0.013
 Woo+0.012
rol+0.012
rier+0.012
otion+0.012
Neg logit contributions
pa-0.004
 work-0.003
 adventures-0.003
 search-0.003
 family-0.003
Token scared
Token ID12008
Feature activation+5.20e-02

Pos logit contributions
pa+0.005
 work+0.005
 adventures+0.005
 search+0.005
 family+0.004
Neg logit contributions
pan-0.006
rier-0.005
rol-0.005
 Woo-0.005
otion-0.005
Token,
Token ID11
Feature activation+1.93e-02

Pos logit contributions
pan+0.136
rol+0.129
rier+0.128
 Woo+0.127
otion+0.127
Neg logit contributions
pa-0.038
 work-0.031
 adventures-0.028
 each-0.028
 family-0.025
Token but
Token ID475
Feature activation+4.39e-03

Pos logit contributions
pan+0.036
rol+0.034
rier+0.034
 Woo+0.034
otion+0.033
Neg logit contributions
pa-0.027
 work-0.025
 adventures-0.024
 search-0.023
 each-0.023
Token then
Token ID788
Feature activation-4.42e-02

Pos logit contributions
pan+0.008
rified+0.008
 Woo+0.008
otion+0.008
rier+0.007
Neg logit contributions
pa-0.006
 work-0.006
 adventures-0.005
 search-0.005
 family-0.005
Token she
Token ID673
Feature activation+3.48e-02

Pos logit contributions
pa+0.074
 work+0.067
 adventures+0.065
 search+0.062
 family+0.062
Neg logit contributions
pan-0.073
rol-0.068
rier-0.068
 Woo-0.067
otion-0.066
Token realized
Token ID6939
Feature activation+6.36e-02

Pos logit contributions
pan+0.058
 coward+0.058
rified+0.057
 Woo+0.057
plets+0.054
Neg logit contributions
pa-0.052
 work-0.046
 adventures-0.042
 search-0.039
 own-0.038
Token it
Token ID340
Feature activation+2.78e-02

Pos logit contributions
pan+0.112
 Woo+0.104
rol+0.104
rier+0.104
otion+0.102
Neg logit contributions
pa-0.099
 work-0.089
 adventures-0.085
 search-0.082
 family-0.081
Token was
Token ID373
Feature activation+1.24e-03

Pos logit contributions
pan+0.063
rified+0.062
otion+0.061
 Woo+0.060
rol+0.060
Neg logit contributions
pa-0.027
 work-0.021
 adventures-0.020
 family-0.019
 search-0.018
Token just
Token ID655
Feature activation+3.48e-02

Pos logit contributions
pan+0.002
 Woo+0.002
rier+0.002
otion+0.002
rol+0.002
Neg logit contributions
pa-0.002
 work-0.002
 adventures-0.001
 search-0.001
 family-0.001
Token another
Token ID1194
Feature activation+8.00e-03

Pos logit contributions
pan+0.068
 Woo+0.064
rier+0.063
rified+0.063
rol+0.063
Neg logit contributions
pa-0.047
 work-0.039
 adventures-0.038
 search-0.037
 family-0.037
Token boat
Token ID8848
Feature activation+4.09e-02

Pos logit contributions
pan+0.014
rier+0.012
 coward+0.012
 Woo+0.012
otion+0.012
Neg logit contributions
pa-0.013
 work-0.010
 adventures-0.010
 each-0.010
 un-0.010
Token,
Token ID11
Feature activation+3.50e-02

Pos logit contributions
pa+0.007
 work+0.006
 adventures+0.005
 search+0.005
 family+0.005
Neg logit contributions
pan-0.018
rier-0.017
rol-0.017
 Woo-0.016
rified-0.016
Token there
Token ID612
Feature activation+1.80e-02

Pos logit contributions
pan+0.068
 Woo+0.063
rol+0.063
rier+0.063
otion+0.062
Neg logit contributions
pa-0.049
 work-0.043
 adventures-0.041
 search-0.039
 family-0.039
Token was
Token ID373
Feature activation+5.75e-03

Pos logit contributions
pan+0.043
otion+0.042
 Woo+0.042
rol+0.041
rified+0.041
Neg logit contributions
pa-0.016
 work-0.013
 adventures-0.011
 search-0.010
 each-0.009
Token a
Token ID257
Feature activation+1.24e-02

Pos logit contributions
pan+0.014
rier+0.013
rol+0.013
rified+0.013
otion+0.013
Neg logit contributions
pa-0.005
 work-0.005
 adventures-0.004
 each-0.004
 family-0.004
Token bear
Token ID6842
Feature activation+2.37e-02

Pos logit contributions
pan+0.027
rol+0.025
rier+0.025
 Woo+0.025
otion+0.025
Neg logit contributions
pa-0.014
 work-0.013
 adventures-0.012
 search-0.011
 family-0.011
Token named
Token ID3706
Feature activation+1.11e-01

Pos logit contributions
pan+0.038
 Woo+0.035
rol+0.035
rier+0.035
otion+0.034
Neg logit contributions
pa-0.040
 work-0.037
 adventures-0.035
 search-0.034
 family-0.033
Token Ted
Token ID11396
Feature activation-2.07e-03

Pos logit contributions
pan+0.189
rier+0.174
rol+0.171
otion+0.169
rified+0.167
Neg logit contributions
pa-0.186
 work-0.169
 adventures-0.162
 search-0.159
 each-0.157
Token.
Token ID13
Feature activation+6.12e-03

Pos logit contributions
 work+0.002
pa+0.002
 packages+0.002
 adventures+0.002
cles+0.002
Neg logit contributions
pan-0.004
 Woo-0.004
otion-0.004
rified-0.004
 coward-0.003
Token He
Token ID679
Feature activation-3.64e-02

Pos logit contributions
pan+0.012
rier+0.011
rol+0.011
 Woo+0.011
otion+0.011
Neg logit contributions
pa-0.009
 work-0.008
 adventures-0.008
 search-0.008
 each-0.007
Token loved
Token ID6151
Feature activation-9.22e-03

Pos logit contributions
pa+0.045
 work+0.037
 ideas+0.033
 adventures+0.032
 search+0.032
Neg logit contributions
 Woo-0.073
 Fif-0.067
 coward-0.067
Flo-0.067
pan-0.067
Token to
Token ID284
Feature activation-6.29e-03

Pos logit contributions
pa+0.014
 search+0.011
 work+0.011
 own+0.010
Daddy+0.010
Neg logit contributions
 Woo-0.016
pan-0.015
 Fif-0.014
 downward-0.014
rol-0.013
Token they
Token ID484
Feature activation+1.51e-01

Pos logit contributions
pan+0.010
 Woo+0.010
rier+0.010
 seeker+0.009
otion+0.009
Neg logit contributions
pa-0.013
 adventures-0.011
 work-0.011
cles-0.011
father-0.011
Token heard
Token ID2982
Feature activation+1.68e-01

Pos logit contributions
pan+0.311
 Woo+0.289
rol+0.289
 coward+0.286
rified+0.284
Neg logit contributions
pa-0.182
 adventures-0.148
 work-0.145
 search-0.134
 packages-0.125
Token a
Token ID257
Feature activation+8.17e-02

Pos logit contributions
pan+0.328
 Woo+0.311
rier+0.306
otion+0.305
rol+0.305
Neg logit contributions
pa-0.217
 work-0.187
 search-0.177
 adventures-0.176
 own-0.168
Token loud
Token ID7812
Feature activation+9.06e-02

Pos logit contributions
pan+0.146
rier+0.138
 Woo+0.135
rol+0.132
otion+0.132
Neg logit contributions
pa-0.126
 work-0.106
 adventures-0.105
 each-0.100
 search-0.098
Token noise
Token ID7838
Feature activation+7.57e-02

Pos logit contributions
pan+0.163
rier+0.150
 Woo+0.149
otion+0.148
rol+0.147
Neg logit contributions
pa-0.138
 work-0.121
 adventures-0.119
 search-0.113
 family-0.112
Token from
Token ID422
Feature activation+6.40e-02

Pos logit contributions
pan+0.176
otion+0.165
 Woo+0.160
rier+0.160
rified+0.159
Neg logit contributions
pa-0.075
 adventures-0.061
 work-0.060
 own-0.058
 search-0.056
Token the
Token ID262
Feature activation+4.02e-02

Pos logit contributions
pan+0.126
 Woo+0.119
otion+0.114
rier+0.114
rol+0.113
Neg logit contributions
pa-0.083
 work-0.070
 adventures-0.069
 family-0.066
 own-0.066
Token kitchen
Token ID9592
Feature activation+3.99e-02

Pos logit contributions
pan+0.069
rier+0.063
 Woo+0.063
rol+0.062
rified+0.062
Neg logit contributions
pa-0.065
 work-0.058
 adventures-0.056
 search-0.053
 each-0.053
Token.
Token ID13
Feature activation+3.15e-02

Pos logit contributions
pan+0.094
rier+0.089
 Woo+0.086
rol+0.086
rified+0.086
Neg logit contributions
pa-0.039
 work-0.032
 adventures-0.030
 search-0.028
 each-0.026
Token It
Token ID632
Feature activation+2.38e-02

Pos logit contributions
pan+0.061
rier+0.057
rol+0.056
 Woo+0.055
otion+0.055
Neg logit contributions
pa-0.043
 work-0.038
 adventures-0.036
 search-0.036
 family-0.036
Token was
Token ID373
Feature activation+1.62e-02

Pos logit contributions
pan+0.049
rified+0.046
 swarm+0.046
rol+0.045
 Woo+0.045
Neg logit contributions
pa-0.029
 work-0.023
 adventures-0.023
 own-0.021
 ideas-0.020
Token there
Token ID612
Feature activation-1.01e-02

Pos logit contributions
pan+0.015
 Woo+0.014
rol+0.014
rier+0.014
otion+0.014
Neg logit contributions
pa-0.012
 work-0.011
 adventures-0.010
 search-0.010
 family-0.010
Token was
Token ID373
Feature activation-1.89e-02

Pos logit contributions
pa+0.010
 work+0.008
 adventures+0.007
 search+0.006
 packages+0.006
Neg logit contributions
pan-0.023
otion-0.022
 Woo-0.022
rified-0.022
rol-0.022
Token a
Token ID257
Feature activation-1.11e-02

Pos logit contributions
pa+0.016
 work+0.014
 adventures+0.014
 each+0.013
 family+0.013
Neg logit contributions
pan-0.048
rier-0.046
rol-0.045
rified-0.044
otion-0.044
Token little
Token ID1310
Feature activation+2.37e-02

Pos logit contributions
pa+0.012
 work+0.011
 family+0.010
 adventures+0.010
 search+0.009
Neg logit contributions
pan-0.025
rol-0.024
otion-0.023
rier-0.023
rified-0.023
Token boy
Token ID2933
Feature activation-1.23e-02

Pos logit contributions
pan+0.051
rol+0.049
otion+0.048
rier+0.047
rified+0.047
Neg logit contributions
pa-0.028
 work-0.025
 family-0.023
 adventures-0.023
 each-0.022
Token named
Token ID3706
Feature activation+9.05e-02

Pos logit contributions
pa+0.023
 work+0.022
 adventures+0.021
 search+0.020
 family+0.020
Neg logit contributions
pan-0.016
 Woo-0.015
otion-0.015
rol-0.015
rified-0.014
Token Tim
Token ID5045
Feature activation-1.19e-02

Pos logit contributions
pan+0.137
 Woo+0.133
rol+0.127
otion+0.126
rier+0.124
Neg logit contributions
pa-0.161
 work-0.144
 adventures-0.141
 search-0.133
 family-0.132
Tokenmy
Token ID1820
Feature activation-8.29e-02

Pos logit contributions
 adventures+0.010
 work+0.009
pa+0.009
 packages+0.008
 search+0.008
Neg logit contributions
pan-0.024
 Woo-0.024
ainted-0.023
rified-0.023
otion-0.023
Token.
Token ID13
Feature activation-9.91e-03

Pos logit contributions
pa+0.075
 adventures+0.065
 work+0.064
 search+0.057
 ideas+0.055
Neg logit contributions
pan-0.179
 Woo-0.178
 coward-0.165
rified-0.165
otion-0.163
Token Tim
Token ID5045
Feature activation-8.18e-03

Pos logit contributions
pa+0.016
 work+0.015
 adventures+0.014
 search+0.013
 family+0.013
Neg logit contributions
pan-0.016
 Woo-0.016
rol-0.015
otion-0.015
rier-0.015
Tokenmy
Token ID1820
Feature activation-8.34e-02

Pos logit contributions
 work+0.007
pa+0.006
 adventures+0.006
 packages+0.006
 search+0.006
Neg logit contributions
 Woo-0.016
pan-0.016
rified-0.016
ainted-0.016
 seeker-0.015
Token He
Token ID679
Feature activation-2.47e-02

Pos logit contributions
pan+0.001
rol+0.001
rier+0.001
 Woo+0.001
otion+0.001
Neg logit contributions
pa-0.000
 work-0.000
 adventures-0.000
 search-0.000
 family-0.000
Token did
Token ID750
Feature activation-4.08e-03

Pos logit contributions
pa+0.030
 work+0.025
 adventures+0.024
 search+0.022
 family+0.022
Neg logit contributions
pan-0.051
 Woo-0.049
rified-0.049
rol-0.048
 coward-0.048
Token not
Token ID407
Feature activation+2.34e-02

Pos logit contributions
pa+0.006
 work+0.006
 adventures+0.005
 search+0.005
 each+0.005
Neg logit contributions
pan-0.008
rol-0.007
rier-0.007
rified-0.007
 Woo-0.007
Token know
Token ID760
Feature activation-8.02e-02

Pos logit contributions
pan+0.041
 coward+0.040
 Woo+0.038
 liar+0.038
otion+0.037
Neg logit contributions
pa-0.034
 adventures-0.028
 family-0.026
 packages-0.026
 grandparents-0.026
Token he
Token ID339
Feature activation-3.90e-03

Pos logit contributions
pa+0.113
 work+0.100
 adventures+0.096
 search+0.091
 each+0.091
Neg logit contributions
pan-0.153
rol-0.144
 Woo-0.143
rier-0.143
rified-0.140
Token was
Token ID373
Feature activation-1.32e-03

Pos logit contributions
pa+0.005
 work+0.004
 adventures+0.004
 family+0.004
 search+0.004
Neg logit contributions
rified-0.008
 coward-0.007
ainted-0.007
 Woo-0.007
rol-0.007
Token in
Token ID287
Feature activation-2.29e-02

Pos logit contributions
pa+0.002
 work+0.002
 adventures+0.002
 search+0.002
 family+0.002
Neg logit contributions
pan-0.002
rier-0.002
 Woo-0.002
otion-0.002
rol-0.002
Token trouble
Token ID5876
Feature activation+4.86e-02

Pos logit contributions
pa+0.033
 work+0.028
 adventures+0.026
 own+0.025
 each+0.025
Neg logit contributions
pan-0.041
 Woo-0.040
rier-0.037
 coward-0.037
rified-0.037
Token.
Token ID13
Feature activation+5.07e-03

Pos logit contributions
pan+0.111
rier+0.107
otion+0.107
 Woo+0.106
rified+0.103
Neg logit contributions
pa-0.048
 work-0.036
 adventures-0.035
 search-0.034
 family-0.032
Token He
Token ID679
Feature activation-2.04e-02

Pos logit contributions
pan+0.011
 Woo+0.011
rier+0.011
rol+0.011
otion+0.011
Neg logit contributions
pa-0.005
 work-0.005
 adventures-0.004
 search-0.004
 each-0.004
Token only
Token ID691
Feature activation-1.42e-02

Pos logit contributions
pa+0.025
 work+0.021
 adventures+0.020
 search+0.018
 family+0.018
Neg logit contributions
pan-0.043
 Woo-0.041
rified-0.040
rol-0.040
rier-0.040
Token their
Token ID511
Feature activation+1.07e-02

Pos logit contributions
pan+0.044
rol+0.042
rier+0.041
otion+0.041
 Woo+0.040
Neg logit contributions
pa-0.029
 work-0.027
 adventures-0.025
 family-0.024
 search-0.024
Token walk
Token ID2513
Feature activation-8.27e-03

Pos logit contributions
 Woo+0.020
 goodbye+0.019
 downward+0.019
pan+0.018
rified+0.018
Neg logit contributions
pa-0.013
my-0.009
Daddy-0.009
 asked-0.009
cles-0.009
Token was
Token ID373
Feature activation-1.25e-02

Pos logit contributions
pa+0.009
 work+0.008
 adventures+0.008
 search+0.007
 family+0.007
Neg logit contributions
pan-0.018
 Woo-0.017
rol-0.017
rier-0.017
otion-0.017
Token over
Token ID625
Feature activation-3.20e-02

Pos logit contributions
pa+0.016
 work+0.015
 adventures+0.014
 search+0.013
 each+0.013
Neg logit contributions
pan-0.025
rier-0.024
rified-0.023
otion-0.023
 Woo-0.023
Token,
Token ID11
Feature activation+3.49e-03

Pos logit contributions
pa+0.043
 work+0.036
 adventures+0.034
 own+0.034
 search+0.033
Neg logit contributions
pan-0.058
 Woo-0.058
rified-0.055
rol-0.054
otion-0.053
Token the
Token ID262
Feature activation-1.26e-02

Pos logit contributions
pan+0.008
rol+0.007
rier+0.007
otion+0.007
 Woo+0.007
Neg logit contributions
pa-0.004
 work-0.004
 adventures-0.003
 family-0.003
 each-0.003
Token little
Token ID1310
Feature activation+2.90e-02

Pos logit contributions
pa+0.019
 adventures+0.017
 search+0.016
 each+0.016
ossible+0.015
Neg logit contributions
rier-0.021
pan-0.020
rified-0.019
 Woo-0.018
 coward-0.018
Token girl
Token ID2576
Feature activation+1.82e-02

Pos logit contributions
pan+0.059
rier+0.056
 Woo+0.056
rified+0.055
rol+0.055
Neg logit contributions
pa-0.037
 work-0.032
 adventures-0.031
 search-0.030
 each-0.029
Token and
Token ID290
Feature activation-8.59e-03

Pos logit contributions
pan+0.037
 Woo+0.036
rified+0.036
rol+0.035
 coward+0.034
Neg logit contributions
pa-0.022
 work-0.019
 adventures-0.018
 search-0.016
 own-0.016
Token the
Token ID262
Feature activation-1.18e-02

Pos logit contributions
pa+0.011
 adventures+0.010
 work+0.010
 own+0.010
 search+0.010
Neg logit contributions
pan-0.016
rified-0.015
rier-0.015
 Woo-0.015
otion-0.014
Token old
Token ID1468
Feature activation-4.88e-02

Pos logit contributions
pa+0.018
 adventures+0.015
 search+0.015
 each+0.015
 work+0.014
Neg logit contributions
rier-0.020
pan-0.019
rified-0.018
 Woo-0.018
 coward-0.017
Token to
Token ID284
Feature activation-1.78e-02

Pos logit contributions
pa+0.013
 work+0.011
 adventures+0.010
 search+0.009
 family+0.009
Neg logit contributions
pan-0.031
rier-0.029
 Woo-0.029
rol-0.029
otion-0.029
Token everyone
Token ID2506
Feature activation-5.31e-02

Pos logit contributions
pa+0.023
 work+0.020
 search+0.019
 adventures+0.019
ossible+0.017
Neg logit contributions
 Woo-0.032
rier-0.032
pan-0.032
 coward-0.031
 Lia-0.031
Token.
Token ID13
Feature activation-1.47e-02

Pos logit contributions
pa+0.057
 work+0.047
 search+0.045
 adventures+0.045
ossible+0.039
Neg logit contributions
pan-0.114
rier-0.109
otion-0.109
rol-0.105
 Woo-0.103
Token He
Token ID679
Feature activation-4.18e-02

Pos logit contributions
pa+0.019
 work+0.016
 adventures+0.015
 search+0.014
 family+0.014
Neg logit contributions
pan-0.029
 Woo-0.029
rol-0.028
rier-0.028
otion-0.028
Token saw
Token ID2497
Feature activation+2.51e-02

Pos logit contributions
pa+0.050
 work+0.042
 adventures+0.038
 search+0.038
 family+0.034
Neg logit contributions
pan-0.083
 Woo-0.083
 coward-0.081
rified-0.078
rol-0.078
Token Sara
Token ID24799
Feature activation+7.74e-03

Pos logit contributions
pan+0.055
rol+0.051
rier+0.051
 Woo+0.050
otion+0.050
Neg logit contributions
pa-0.030
 work-0.025
 adventures-0.024
 each-0.024
 family-0.023
Token playing
Token ID2712
Feature activation-5.37e-02

Pos logit contributions
pan+0.016
otion+0.016
 Woo+0.015
rol+0.015
rier+0.015
Neg logit contributions
pa-0.009
 adventures-0.007
 work-0.007
 search-0.007
 family-0.006
Token with
Token ID351
Feature activation-1.19e-02

Pos logit contributions
pa+0.062
 work+0.053
 adventures+0.050
 search+0.047
 family+0.046
Neg logit contributions
pan-0.117
 Woo-0.110
rier-0.109
rol-0.109
otion-0.108
Token her
Token ID607
Feature activation-8.32e-03

Pos logit contributions
pa+0.014
 work+0.012
 adventures+0.011
 search+0.011
 family+0.010
Neg logit contributions
pan-0.025
 Woo-0.024
rier-0.024
rol-0.024
otion-0.023
Token dolls
Token ID36062
Feature activation-1.34e-02

Pos logit contributions
pa+0.012
 search+0.009
Daddy+0.009
 adventures+0.009
 asked+0.009
Neg logit contributions
pan-0.015
 Woo-0.013
rier-0.013
 coward-0.013
 plun-0.013
Token in
Token ID287
Feature activation-3.97e-02

Pos logit contributions
pa+0.017
 work+0.013
 adventures+0.013
 search+0.012
Daddy+0.011
Neg logit contributions
pan-0.028
 Woo-0.026
otion-0.025
rier-0.024
rified-0.024
Token not
Token ID407
Feature activation+9.99e-03

Pos logit contributions
pan+0.062
 Woo+0.058
rol+0.058
rier+0.058
otion+0.057
Neg logit contributions
pa-0.059
 work-0.053
 adventures-0.051
 search-0.049
 family-0.048
Token enter
Token ID3802
Feature activation-2.55e-02

Pos logit contributions
 Woo+0.016
 swarm+0.016
pan+0.016
otion+0.016
 Fif+0.015
Neg logit contributions
pa-0.014
cles-0.012
 adventures-0.012
 favorites-0.010
by-0.010
Token".
Token ID1911
Feature activation-1.01e-02

Pos logit contributions
pa+0.034
 work+0.030
 adventures+0.029
 each+0.028
 search+0.027
Neg logit contributions
pan-0.050
rol-0.047
rier-0.047
 Woo-0.047
otion-0.046
Token She
Token ID1375
Feature activation+1.18e-02

Pos logit contributions
pa+0.013
 work+0.012
 search+0.011
 adventures+0.011
 each+0.011
Neg logit contributions
pan-0.020
rier-0.019
rol-0.018
otion-0.018
 Woo-0.018
Token opened
Token ID4721
Feature activation+1.51e-02

Pos logit contributions
pan+0.022
rified+0.021
 Woo+0.021
 coward+0.020
ainted+0.020
Neg logit contributions
pa-0.016
 work-0.014
 adventures-0.014
 own-0.012
 day-0.012
Token it
Token ID340
Feature activation+1.17e-02

Pos logit contributions
pan+0.035
 coward+0.035
rol+0.035
otion+0.034
rier+0.034
Neg logit contributions
pa-0.014
 work-0.013
 each-0.012
 search-0.011
 adventures-0.011
Token and
Token ID290
Feature activation-3.22e-03

Pos logit contributions
pan+0.029
 Woo+0.028
rol+0.028
rier+0.028
otion+0.027
Neg logit contributions
pa-0.010
 work-0.008
 adventures-0.007
 search-0.007
 each-0.006
Token turned
Token ID2900
Feature activation-5.93e-03

Pos logit contributions
pa+0.003
 work+0.003
 adventures+0.003
 search+0.002
 family+0.002
Neg logit contributions
pan-0.007
 Woo-0.007
rol-0.007
rier-0.007
otion-0.007
Token on
Token ID319
Feature activation-1.62e-02

Pos logit contributions
pa+0.007
 work+0.006
 adventures+0.005
 each+0.005
 search+0.005
Neg logit contributions
pan-0.013
rol-0.012
rier-0.012
 Woo-0.012
otion-0.012
Token the
Token ID262
Feature activation-9.48e-03

Pos logit contributions
pa+0.018
 work+0.016
 adventures+0.015
 search+0.014
 each+0.014
Neg logit contributions
pan-0.036
rol-0.034
rier-0.033
 Woo-0.033
otion-0.033
Token light
Token ID1657
Feature activation-7.82e-04

Pos logit contributions
pa+0.016
 work+0.015
 adventures+0.014
 search+0.014
 each+0.014
Neg logit contributions
pan-0.015
rier-0.014
 Woo-0.014
rol-0.014
otion-0.014
Token you
Token ID345
Feature activation-1.95e-03

Pos logit contributions
 Woo+0.081
pan+0.079
otion+0.079
 coward+0.075
rified+0.075
Neg logit contributions
pa-0.086
 work-0.075
 search-0.071
 adventures-0.071
 family-0.068
Token want
Token ID765
Feature activation-4.27e-02

Pos logit contributions
pa+0.003
 adventures+0.003
 family+0.002
 packages+0.002
 grandparents+0.002
Neg logit contributions
pan-0.003
otion-0.003
 coward-0.003
 Woo-0.003
 seeker-0.003
Token to
Token ID284
Feature activation-1.50e-02

Pos logit contributions
pa+0.053
 work+0.044
 adventures+0.042
 search+0.040
 family+0.040
Neg logit contributions
pan-0.088
 Woo-0.085
rol-0.082
otion-0.081
rier-0.081
Token see
Token ID766
Feature activation-2.21e-02

Pos logit contributions
pa+0.018
 work+0.014
 adventures+0.014
 search+0.013
 family+0.013
Neg logit contributions
pan-0.032
 Woo-0.031
rier-0.030
rol-0.030
otion-0.029
Token it
Token ID340
Feature activation+6.39e-03

Pos logit contributions
pa+0.036
 work+0.033
 adventures+0.031
 search+0.031
 family+0.030
Neg logit contributions
pan-0.034
 Woo-0.034
rier-0.032
otion-0.032
rified-0.032
Token?"
Token ID1701
Feature activation-5.70e-03

Pos logit contributions
pan+0.012
 Woo+0.011
rier+0.011
otion+0.011
rol+0.011
Neg logit contributions
pa-0.009
 work-0.008
 adventures-0.007
 family-0.007
 search-0.007
Token
Token ID198
Feature activation-2.07e-02

Pos logit contributions
pa+0.006
 work+0.005
 adventures+0.005
 search+0.004
 each+0.004
Neg logit contributions
pan-0.013
 Woo-0.013
rier-0.012
otion-0.012
 coward-0.012
Token
Token ID198
Feature activation-2.20e-02

Pos logit contributions
pa+0.019
 work+0.016
 adventures+0.014
 search+0.013
 family+0.012
Neg logit contributions
pan-0.049
 Woo-0.048
otion-0.046
rol-0.046
rier-0.046
TokenSam
Token ID16305
Feature activation-2.29e-02

Pos logit contributions
pa+0.027
 work+0.024
 adventures+0.023
 search+0.022
 each+0.022
Neg logit contributions
pan-0.046
rol-0.044
rier-0.044
 Woo-0.043
otion-0.043
Token did
Token ID750
Feature activation-1.65e-02

Pos logit contributions
pa+0.033
 work+0.030
 adventures+0.028
 family+0.026
 own+0.026
Neg logit contributions
pan-0.041
rified-0.040
ainted-0.037
 Woo-0.037
 downward-0.036
Token not
Token ID407
Feature activation+8.66e-03

Pos logit contributions
 work+0.022
pa+0.022
 adventures+0.019
 each+0.019
 search+0.019
Neg logit contributions
pan-0.033
rol-0.032
rified-0.030
 Woo-0.030
rier-0.030
Token big
Token ID1263
Feature activation-2.34e-02

Pos logit contributions
pa+0.019
 work+0.017
 adventures+0.016
 search+0.015
 family+0.015
Neg logit contributions
pan-0.033
rol-0.031
rier-0.031
 Woo-0.031
otion-0.030
Token bird
Token ID6512
Feature activation+3.72e-02

Pos logit contributions
pa+0.033
 work+0.030
 adventures+0.028
 search+0.027
 each+0.027
Neg logit contributions
pan-0.044
 Woo-0.041
rier-0.041
rol-0.041
otion-0.040
Token flying
Token ID7348
Feature activation+6.07e-03

Pos logit contributions
pan+0.078
rier+0.075
 Woo+0.074
rol+0.074
rified+0.073
Neg logit contributions
pa-0.045
 work-0.040
 adventures-0.038
 search-0.037
 each-0.036
Token really
Token ID1107
Feature activation-7.38e-02

Pos logit contributions
pan+0.016
otion+0.016
rier+0.016
 coward+0.016
 Woo+0.016
Neg logit contributions
 each-0.003
 work-0.003
pa-0.003
 day-0.003
 search-0.003
Token high
Token ID1029
Feature activation-1.91e-03

Pos logit contributions
 work+0.091
 each+0.090
 own+0.081
pa+0.079
 adventures+0.077
Neg logit contributions
pan-0.157
rol-0.148
otion-0.144
rified-0.141
 Woo-0.138
Token in
Token ID287
Feature activation-4.18e-02

Pos logit contributions
pa+0.002
 work+0.002
 adventures+0.001
 search+0.001
 each+0.001
Neg logit contributions
pan-0.005
rol-0.004
 Woo-0.004
rier-0.004
otion-0.004
Token the
Token ID262
Feature activation-2.28e-02

Pos logit contributions
pa+0.044
 work+0.038
 adventures+0.036
 search+0.034
 each+0.033
Neg logit contributions
pan-0.096
rier-0.092
rol-0.092
 Woo-0.090
otion-0.090
Token sky
Token ID6766
Feature activation-2.51e-02

Pos logit contributions
pa+0.034
 work+0.031
 adventures+0.030
 search+0.029
 family+0.029
Neg logit contributions
pan-0.042
rier-0.039
rol-0.039
otion-0.039
 Woo-0.039
Token.
Token ID13
Feature activation-1.79e-02

Pos logit contributions
pa+0.023
 work+0.017
 adventures+0.016
 search+0.014
 family+0.014
Neg logit contributions
pan-0.061
rol-0.058
rier-0.057
 Woo-0.057
otion-0.057
Token She
Token ID1375
Feature activation-7.15e-03

Pos logit contributions
pa+0.021
 work+0.020
 asked+0.020
 search+0.020
 family+0.019
Neg logit contributions
pan-0.037
rier-0.036
rol-0.036
otion-0.035
 coward-0.035
Token pointed
Token ID6235
Feature activation-8.93e-03

Pos logit contributions
pa+0.009
 work+0.007
 adventures+0.007
 search+0.006
 own+0.006
Neg logit contributions
 Woo-0.014
pan-0.014
rified-0.014
 coward-0.013
rol-0.013
Token
Token ID198
Feature activation-2.42e-02

Pos logit contributions
pa+0.021
 work+0.019
 adventures+0.016
 each+0.015
 search+0.015
Neg logit contributions
 Woo-0.031
rol-0.029
pan-0.029
 Fif-0.028
otion-0.028
Token
Token ID198
Feature activation-2.60e-02

Pos logit contributions
pa+0.030
 work+0.028
 packages+0.023
 ideas+0.023
 adventures+0.023
Neg logit contributions
 Woo-0.046
otion-0.045
pan-0.045
rol-0.044
 coward-0.042
TokenOne
Token ID3198
Feature activation-7.19e-02

Pos logit contributions
pa+0.029
 work+0.025
 adventures+0.021
 search+0.020
 family+0.020
Neg logit contributions
pan-0.055
 Woo-0.055
rol-0.054
otion-0.053
 coward-0.052
Token day
Token ID1110
Feature activation-6.19e-02

Pos logit contributions
pa+0.061
 work+0.046
 adventures+0.041
 search+0.039
 packages+0.036
Neg logit contributions
 Woo-0.165
pan-0.162
otion-0.159
rified-0.158
 coward-0.157
Token,
Token ID11
Feature activation+1.86e-04

Pos logit contributions
pa+0.052
 work+0.045
 adventures+0.040
 family+0.039
 each+0.038
Neg logit contributions
pan-0.156
rol-0.149
rier-0.148
otion-0.144
 Woo-0.143
Token however
Token ID2158
Feature activation-4.60e-02

Pos logit contributions
pan+0.000
rol+0.000
rier+0.000
otion+0.000
 Woo+0.000
Neg logit contributions
pa-0.000
 work-0.000
 adventures-0.000
 family-0.000
 each-0.000
Token,
Token ID11
Feature activation-4.12e-04

Pos logit contributions
pa+0.037
 work+0.033
 adventures+0.029
 family+0.028
 each+0.028
Neg logit contributions
pan-0.116
rol-0.111
rier-0.109
 Woo-0.108
otion-0.107
Token the
Token ID262
Feature activation-2.53e-02

Pos logit contributions
pa+0.000
 work+0.000
 adventures+0.000
 family+0.000
 search+0.000
Neg logit contributions
pan-0.001
rol-0.001
rier-0.001
otion-0.001
 Woo-0.001
Token metal
Token ID6147
Feature activation-4.09e-02

Pos logit contributions
pa+0.045
 work+0.038
 adventures+0.038
 search+0.037
 each+0.036
Neg logit contributions
pan-0.039
rier-0.037
 Woo-0.036
rified-0.035
otion-0.034
Token bird
Token ID6512
Feature activation+3.40e-02

Pos logit contributions
pa+0.064
 work+0.056
 adventures+0.054
 search+0.053
 each+0.051
Neg logit contributions
pan-0.073
rier-0.066
rified-0.063
 coward-0.063
 Woo-0.062
Token got
Token ID1392
Feature activation-1.21e-02

Pos logit contributions
pan+0.069
rified+0.067
 Woo+0.065
rol+0.065
otion+0.064
Neg logit contributions
pa-0.040
 work-0.035
 adventures-0.033
 search-0.031
 each-0.030
Token backpack
Token ID22526
Feature activation-2.07e-03

Pos logit contributions
pa+0.020
 work+0.018
 adventures+0.017
 search+0.016
 each+0.016
Neg logit contributions
pan-0.027
 Woo-0.025
rier-0.025
rol-0.025
otion-0.025
Token that
Token ID326
Feature activation-8.61e-03

Pos logit contributions
pa+0.003
 work+0.002
 adventures+0.002
 search+0.002
 family+0.002
Neg logit contributions
pan-0.004
 Woo-0.004
rol-0.004
rier-0.004
otion-0.004
Token she
Token ID673
Feature activation+2.52e-02

Pos logit contributions
pa+0.016
 work+0.014
 adventures+0.014
 search+0.013
 each+0.013
Neg logit contributions
pan-0.012
 Woo-0.012
rol-0.011
otion-0.011
rier-0.011
Token carried
Token ID5281
Feature activation-2.04e-02

Pos logit contributions
 Woo+0.045
 coward+0.044
rol+0.043
pan+0.043
ainted+0.043
Neg logit contributions
pa-0.033
 work-0.026
 adventures-0.025
 search-0.024
 dad-0.023
Token to
Token ID284
Feature activation-6.91e-03

Pos logit contributions
pa+0.029
 work+0.024
 adventures+0.024
 family+0.023
 search+0.022
Neg logit contributions
 Woo-0.038
pan-0.037
rol-0.036
rier-0.035
rified-0.034
Token school
Token ID1524
Feature activation-6.11e-02

Pos logit contributions
pa+0.011
by+0.008
cles+0.008
 adventures+0.008
 work+0.007
Neg logit contributions
 Woo-0.012
pan-0.011
 downward-0.010
otion-0.010
rier-0.010
Token every
Token ID790
Feature activation-6.54e-02

Pos logit contributions
pa+0.077
 work+0.057
 family+0.052
 search+0.051
Daddy+0.051
Neg logit contributions
pan-0.118
 Woo-0.112
rol-0.110
anger-0.108
rier-0.106
Token day
Token ID1110
Feature activation-4.23e-02

Pos logit contributions
pa+0.048
 adventures+0.031
 work+0.030
 each+0.029
 search+0.023
Neg logit contributions
pan-0.165
 Woo-0.158
otion-0.155
rified-0.153
rier-0.152
Token.
Token ID13
Feature activation-1.09e-03

Pos logit contributions
pa+0.054
 work+0.041
 adventures+0.037
 ideas+0.035
 search+0.035
Neg logit contributions
pan-0.083
 Woo-0.082
rier-0.078
rol-0.075
ula-0.075
Token One
Token ID1881
Feature activation-4.77e-02

Pos logit contributions
pa+0.001
 work+0.001
 adventures+0.001
 search+0.001
 each+0.001
Neg logit contributions
pan-0.002
 Woo-0.002
rol-0.002
otion-0.002
rier-0.002
Token day
Token ID1110
Feature activation-4.20e-02

Pos logit contributions
pa+0.040
 adventures+0.029
 work+0.029
 search+0.025
 each+0.024
Neg logit contributions
 Woo-0.108
otion-0.106
 seeker-0.105
rified-0.104
pan-0.104
Token made
Token ID925
Feature activation-1.96e-02

Pos logit contributions
pa+0.076
 work+0.059
 search+0.059
 adventures+0.058
 family+0.057
Neg logit contributions
pan-0.080
 Woo-0.078
 coward-0.076
 liar-0.073
ainted-0.070
Token a
Token ID257
Feature activation-1.70e-02

Pos logit contributions
pa+0.027
 search+0.022
 adventures+0.022
 work+0.022
 family+0.021
Neg logit contributions
 Woo-0.036
pan-0.035
rier-0.034
rol-0.034
rified-0.033
Token mistake
Token ID7457
Feature activation+2.89e-02

Pos logit contributions
pa+0.027
 work+0.022
 adventures+0.022
 each+0.021
 search+0.021
Neg logit contributions
pan-0.030
rier-0.028
 Woo-0.028
 coward-0.027
rol-0.027
Token.
Token ID13
Feature activation-2.48e-02

Pos logit contributions
pan+0.068
rier+0.065
 Woo+0.064
otion+0.064
rified+0.063
Neg logit contributions
pa-0.028
 work-0.023
 adventures-0.022
 search-0.021
 family-0.019
Token He
Token ID679
Feature activation-4.37e-02

Pos logit contributions
pa+0.037
 work+0.033
 adventures+0.032
 search+0.030
 each+0.030
Neg logit contributions
pan-0.044
 Woo-0.043
rol-0.042
rier-0.042
rified-0.042
Token said
Token ID531
Feature activation-4.29e-02

Pos logit contributions
pa+0.058
 adventures+0.049
 work+0.047
 search+0.043
 projects+0.041
Neg logit contributions
pan-0.080
rified-0.078
 Woo-0.077
 coward-0.076
ainted-0.074
Token,
Token ID11
Feature activation+1.04e-03

Pos logit contributions
pa+0.041
 work+0.036
 each+0.032
 adventures+0.032
 family+0.031
Neg logit contributions
pan-0.099
rol-0.096
otion-0.093
 Woo-0.093
rified-0.093
Token
Token ID6184
Feature activation-3.38e-02

Pos logit contributions
pan+0.002
 Woo+0.001
rier+0.001
otion+0.001
rified+0.001
Neg logit contributions
pa-0.002
 adventures-0.002
 work-0.002
 search-0.002
 own-0.002
Token
Token ID95
Feature activation-2.57e-02

Pos logit contributions
pa+0.060
 work+0.057
 adventures+0.053
 search+0.052
 each+0.052
Neg logit contributions
pan-0.052
rol-0.049
rier-0.046
otion-0.045
 Woo-0.045
Token
Token ID26391
Feature activation-5.18e-02

Pos logit contributions
pa+0.046
 work+0.042
 adventures+0.041
 family+0.040
 search+0.039
Neg logit contributions
pan-0.039
 Woo-0.036
rier-0.035
rol-0.035
rified-0.035
Token
Token ID129
Feature activation+3.30e-02

Pos logit contributions
pa+0.087
 work+0.082
 adventures+0.076
 each+0.075
 search+0.074
Neg logit contributions
pan-0.078
 Woo-0.076
rified-0.073
rol-0.071
otion-0.071
Token the
Token ID262
Feature activation-1.85e-02

Pos logit contributions
pa+0.052
 work+0.044
 search+0.044
 adventures+0.041
 family+0.039
Neg logit contributions
pan-0.050
 Woo-0.050
rier-0.048
otion-0.045
rified-0.044
Token piano
Token ID19132
Feature activation-1.35e-02

Pos logit contributions
pa+0.032
 work+0.027
 each+0.026
 search+0.026
 adventures+0.025
Neg logit contributions
pan-0.030
 Woo-0.027
otion-0.026
rier-0.026
rified-0.025
Token.
Token ID13
Feature activation-2.07e-02

Pos logit contributions
pa+0.018
 work+0.014
 adventures+0.014
 search+0.014
 family+0.013
Neg logit contributions
pan-0.027
rier-0.025
rol-0.024
 Woo-0.024
otion-0.024
Token We
Token ID775
Feature activation+1.70e-02

Pos logit contributions
pa+0.035
 work+0.032
 each+0.031
 search+0.030
 adventures+0.030
Neg logit contributions
pan-0.034
rol-0.031
rier-0.030
otion-0.030
 Woo-0.029
Token can
Token ID460
Feature activation-9.55e-03

Pos logit contributions
pan+0.027
 Woo+0.024
 coward+0.024
otion+0.024
rol+0.023
Neg logit contributions
pa-0.030
 adventures-0.026
 family-0.023
 work-0.023
 search-0.023
Token both
Token ID1111
Feature activation-8.14e-02

Pos logit contributions
pa+0.013
 adventures+0.011
 day+0.009
 family+0.009
 work+0.009
Neg logit contributions
pan-0.018
 Woo-0.017
rier-0.016
otion-0.016
rol-0.016
Token play
Token ID711
Feature activation-3.53e-02

Pos logit contributions
pa+0.122
 adventures+0.093
 packages+0.086
Daddy+0.084
 search+0.084
Neg logit contributions
pan-0.126
ner-0.116
otion-0.114
rol-0.112
 Woo-0.112
Token it
Token ID340
Feature activation+3.49e-03

Pos logit contributions
pa+0.052
 work+0.042
 search+0.040
 adventures+0.039
 family+0.038
Neg logit contributions
pan-0.066
 Woo-0.063
rier-0.063
otion-0.059
rified-0.058
Token and
Token ID290
Feature activation-1.55e-02

Pos logit contributions
pan+0.006
rier+0.006
 Woo+0.006
rol+0.006
otion+0.006
Neg logit contributions
pa-0.006
 work-0.004
 search-0.004
 family-0.004
 adventures-0.004
Token be
Token ID307
Feature activation-6.54e-02

Pos logit contributions
pa+0.025
 adventures+0.017
cles+0.017
dad+0.017
urious+0.017
Neg logit contributions
pan-0.027
 Woo-0.026
rier-0.024
 coward-0.024
 disappears-0.023
Token happy
Token ID3772
Feature activation-2.94e-02

Pos logit contributions
pa+0.087
 work+0.073
 search+0.069
 adventures+0.068
 each+0.060
Neg logit contributions
pan-0.125
 Woo-0.122
rier-0.120
rol-0.116
 coward-0.115