TOP ACTIVATIONS
MAX = 4.083

had two long arms . Fasc inated , she picked it
think of the storm ?". Jill thought for a moment and
What made the rainbow ?". Jill answered , " A storm
" What is that ?" Jessica said , scared .
jealous of your friends and don 't be mean to them
listen to your parents and don 't touch things that can
loved to play with his orange ball . He would throw
I take a nap now ?" asked Walter .
buy too much food and don 't eat it all .
What is a language ?" Lily asks . "
can happen even when you don 't expect it . <|endoftext|>
Now she could look at the sun sets pieces every day
They both marvel ed at the comet . Its tail spark
." They stopped to watch the sunset . Suddenly , Nina
, we can go see the sunset ." As
of her with her favorite toy . Lily smiled and said
parents , even if we don 't want to . <|endoftext|>
, while Tim was watching the sunset , he saw something
important ?" The voice replied , " This bridge
own way and that we don 't always need material possessions

MIN ACTIVATIONS
MIN = -3.805

. " In a box under the tree ."
. Ben takes a bite of the banana .
them . He lives in a house near the woods .
She turned on the f auc et and filled the sink
a phone . He calls a doctor . He also calls
A man in a blue uniform runs to the
loved to play with Wh isk ers , but one day
He drank from the f auc et . But he still
A man in a boat saw them and came
shells . They hear the se ag ull s and the
He said he was in a hurry . He had to
fun making mud pies , cars , and castles . But
opard . Tim runs to a man who works in the
talks to the dog in a soft voice . He gives
. He takes them to a small room . He locks
their hats and coats and sne aked out of the house
. In the middle of the night , Tim
fell asleep . In the middle of the night , she
. She puts them in a bucket . She fills the
a big chair , a loud drill and a bitter paste

INTERVAL 2.111 - 4.083
CONTAINS 0.403%

drawing on anything again . From that day on , she
it , grand pa ?", Mary asked . Grand
was so excited to see the dolphin swimming around .
of presents ! His eyes lit up as he saw all
language they were speaking and they told him it was Spanish

INTERVAL 0.139 - 2.111
CONTAINS 42.095%

was about a little girl who never gave up , even
eat . Max was so happy and gob bled it up
They hear their mom getting closer . They are scared .
says , " I love you too . Thank you for
elly . It was an onion ! Grace was so surprised

INTERVAL -1.833 - 0.139
CONTAINS 56.861%

organize my toys now ." She went back to her room
for many days and nights and eventually he found the silver
€ Daddy handed me a box with a string . I
pretty things . One day , she found a shiny pin
find a better friend and a better bug ."

INTERVAL -3.805 - -1.833
CONTAINS 0.641%

wrong . They saw Buddy and Wh isk ers fighting over
There might be monsters or traps or th orns !"
gave Lily the tw ig . Lily looked at it and
. He gets a band - aid and puts it on
Help , help !" Tom and Jen cry . They run
Token had
Token ID550
Feature activation+1.72e-01

Pos logit contributions
ement+0.055
ised+0.053
inated+0.053
work+0.051
used+0.050
Neg logit contributions
isk-0.043
eh-0.035
aks-0.035
 germ-0.033
coat-0.032
Token two
Token ID734
Feature activation+4.21e-01

Pos logit contributions
ised+0.007
ement+0.007
inated+0.007
work+0.006
el+0.006
Neg logit contributions
isk-0.008
aks-0.007
arl-0.007
ives-0.007
eh-0.006
Token long
Token ID890
Feature activation+3.93e-01

Pos logit contributions
ised+0.017
inated+0.017
ement+0.016
work+0.016
el+0.015
Neg logit contributions
isk-0.019
aks-0.017
ives-0.016
arl-0.016
eh-0.016
Token arms
Token ID5101
Feature activation+1.25e+00

Pos logit contributions
ised+0.016
ement+0.016
inated+0.016
work+0.015
el+0.015
Neg logit contributions
isk-0.018
aks-0.016
eh-0.014
ives-0.014
arl-0.014
Token.
Token ID13
Feature activation+2.65e-01

Pos logit contributions
ised+0.050
ement+0.050
inated+0.047
ths+0.047
 comet+0.046
Neg logit contributions
isk-0.055
aks-0.049
ives-0.048
arl-0.047
coat-0.046
Token Fasc
Token ID41467
Feature activation+4.08e+00

Pos logit contributions
ised+0.010
ement+0.010
inated+0.010
work+0.010
el+0.009
Neg logit contributions
isk-0.013
aks-0.011
arl-0.010
ives-0.010
eh-0.010
Tokeninated
Token ID3898
Feature activation-4.02e-01

Pos logit contributions
ement+0.103
 comet+0.081
 diamonds+0.078
 avail+0.068
work+0.068
Neg logit contributions
isk-0.249
 fetch-0.214
atters-0.206
ro-0.204
eh-0.201
Token,
Token ID11
Feature activation+1.01e+00

Pos logit contributions
isk+0.019
aks+0.016
arl+0.016
aid+0.016
ives+0.015
Neg logit contributions
ised-0.016
ement-0.016
inated-0.016
el-0.016
work-0.015
Token she
Token ID673
Feature activation+1.30e+00

Pos logit contributions
ement+0.044
ised+0.043
inated+0.043
work+0.042
 horizon+0.039
Neg logit contributions
isk-0.043
aks-0.035
 germ-0.034
aid-0.034
ens-0.034
Token picked
Token ID6497
Feature activation+1.30e+00

Pos logit contributions
ement+0.070
work+0.067
ised+0.065
inated+0.064
 horizon+0.063
Neg logit contributions
isk-0.042
eh-0.034
aks-0.034
coat-0.031
 comes-0.030
Token it
Token ID340
Feature activation+1.48e+00

Pos logit contributions
ised+0.047
ement+0.045
inated+0.044
izing+0.042
 horizon+0.042
Neg logit contributions
isk-0.062
aks-0.058
ens-0.055
ives-0.054
 comes-0.053
Token think
Token ID892
Feature activation+2.18e-01

Pos logit contributions
ement+0.034
ised+0.034
inated+0.033
ths+0.032
el+0.032
Neg logit contributions
isk-0.018
aks-0.014
ives-0.013
arl-0.013
eh-0.013
Token of
Token ID286
Feature activation+8.70e-01

Pos logit contributions
ised+0.008
ement+0.008
inated+0.008
work+0.008
el+0.008
Neg logit contributions
isk-0.010
aks-0.009
arl-0.009
ives-0.009
eh-0.008
Token the
Token ID262
Feature activation+7.13e-01

Pos logit contributions
ement+0.037
ised+0.037
inated+0.036
el+0.034
itz+0.034
Neg logit contributions
isk-0.037
aks-0.032
arl-0.030
 germ-0.030
eh-0.029
Token storm
Token ID6388
Feature activation-3.34e-01

Pos logit contributions
ised+0.032
inated+0.032
ement+0.032
work+0.029
el+0.029
Neg logit contributions
isk-0.029
aks-0.025
 germ-0.024
arl-0.024
ives-0.024
Token?".
Token ID43634
Feature activation+1.25e-01

Pos logit contributions
isk+0.017
aks+0.015
arl+0.015
aid+0.015
eh+0.015
Neg logit contributions
ised-0.011
ement-0.011
inated-0.011
work-0.011
el-0.010
Token Jill
Token ID22772
Feature activation+3.99e+00

Pos logit contributions
ised+0.006
ement+0.006
inated+0.006
work+0.006
el+0.006
Neg logit contributions
isk-0.005
aks-0.004
arl-0.004
ives-0.004
eh-0.004
Token thought
Token ID1807
Feature activation+2.18e-01

Pos logit contributions
 avail+0.229
 horizon+0.222
 diamonds+0.220
ement+0.214
izing+0.214
Neg logit contributions
isk-0.098
 saw-0.079
 came-0.077
ens-0.074
 helped-0.072
Token for
Token ID329
Feature activation+6.37e-01

Pos logit contributions
ised+0.008
inated+0.008
ement+0.008
work+0.008
el+0.007
Neg logit contributions
isk-0.011
aks-0.009
arl-0.009
ives-0.009
eh-0.009
Token a
Token ID257
Feature activation-2.14e-01

Pos logit contributions
inated+0.028
ement+0.028
ised+0.027
el+0.026
work+0.025
Neg logit contributions
isk-0.027
aks-0.023
aid-0.021
 backup-0.021
 germ-0.021
Token moment
Token ID2589
Feature activation+6.25e-01

Pos logit contributions
isk+0.009
aks+0.008
ives+0.007
arl+0.007
aid+0.007
Neg logit contributions
inated-0.009
ised-0.009
ement-0.009
work-0.009
 sunset-0.009
Token and
Token ID290
Feature activation+1.94e+00

Pos logit contributions
inated+0.029
ised+0.028
ement+0.028
el+0.027
work+0.026
Neg logit contributions
isk-0.025
aks-0.021
ives-0.020
arl-0.020
aid-0.019
TokenWhat
Token ID2061
Feature activation+5.77e-01

Pos logit contributions
inated+0.075
ement+0.075
ised+0.072
el+0.067
work+0.066
Neg logit contributions
isk-0.074
eh-0.061
aks-0.059
aid-0.058
arl-0.056
Token made
Token ID925
Feature activation+7.61e-01

Pos logit contributions
ised+0.030
inated+0.029
ement+0.029
 horizon+0.028
el+0.027
Neg logit contributions
isk-0.020
aks-0.017
ives-0.016
 germ-0.015
eh-0.015
Token the
Token ID262
Feature activation+2.10e+00

Pos logit contributions
inated+0.030
ement+0.030
ised+0.030
work+0.028
 horizon+0.027
Neg logit contributions
isk-0.034
aks-0.031
ives-0.028
arl-0.028
eh-0.028
Token rainbow
Token ID27223
Feature activation+1.05e+00

Pos logit contributions
ement+0.095
ised+0.094
inated+0.092
used+0.089
 curiously+0.088
Neg logit contributions
isk-0.071
 germ-0.064
aks-0.061
 comes-0.061
 welcome-0.061
Token?".
Token ID43634
Feature activation-9.50e-02

Pos logit contributions
ised+0.043
inated+0.043
ement+0.043
el+0.040
itz+0.040
Neg logit contributions
isk-0.044
aks-0.040
eh-0.037
ives-0.037
arl-0.036
Token Jill
Token ID22772
Feature activation+3.94e+00

Pos logit contributions
isk+0.004
aks+0.003
arl+0.003
ives+0.003
aid+0.003
Neg logit contributions
ised-0.004
ement-0.004
inated-0.004
work-0.004
el-0.004
Token answered
Token ID9373
Feature activation+1.41e+00

Pos logit contributions
 avail+0.227
 horizon+0.209
ement+0.205
 diamonds+0.202
 fullest+0.201
Neg logit contributions
isk-0.090
 came-0.087
 saw-0.082
 woke-0.080
 helped-0.080
Token,
Token ID11
Feature activation+8.78e-01

Pos logit contributions
inated+0.070
ised+0.069
 horizon+0.066
ement+0.065
work+0.065
Neg logit contributions
isk-0.046
smart-0.035
aks-0.034
aid-0.034
 backup-0.033
Token "
Token ID366
Feature activation+2.42e+00

Pos logit contributions
ised+0.049
ement+0.048
el+0.047
inated+0.047
work+0.047
Neg logit contributions
isk-0.026
aks-0.019
arl-0.019
eh-0.019
ives-0.018
TokenA
Token ID32
Feature activation+1.69e+00

Pos logit contributions
ement+0.116
 avail+0.111
ised+0.110
inated+0.103
ths+0.101
Neg logit contributions
isk-0.079
and-0.079
ives-0.074
aks-0.074
aid-0.073
Token storm
Token ID6388
Feature activation-1.15e-01

Pos logit contributions
ement+0.062
ised+0.062
inated+0.059
el+0.058
 curiously+0.056
Neg logit contributions
isk-0.071
aks-0.070
eh-0.068
arl-0.068
smart-0.066
Token"
Token ID1
Feature activation+1.42e+00

Pos logit contributions
ement+0.028
inated+0.028
ised+0.027
work+0.027
el+0.027
Neg logit contributions
isk-0.023
aks-0.020
ives-0.019
arl-0.018
eh-0.018
TokenWhat
Token ID2061
Feature activation-1.02e-01

Pos logit contributions
ement+0.068
ised+0.063
inated+0.063
el+0.059
 int+0.059
Neg logit contributions
isk-0.057
aks-0.048
aid-0.046
ives-0.045
eh-0.045
Token is
Token ID318
Feature activation+2.27e+00

Pos logit contributions
isk+0.004
aks+0.003
arl+0.003
eh+0.003
aid+0.003
Neg logit contributions
ised-0.005
ement-0.005
inated-0.005
work-0.005
el-0.005
Token that
Token ID326
Feature activation+7.15e-01

Pos logit contributions
ement+0.121
ised+0.115
 horizon+0.106
efully+0.106
itz+0.104
Neg logit contributions
 germ-0.062
 comes-0.060
 backup-0.060
eh-0.060
isk-0.060
Token?"
Token ID1701
Feature activation+2.02e+00

Pos logit contributions
ised+0.029
ement+0.029
inated+0.029
el+0.027
work+0.027
Neg logit contributions
isk-0.032
aks-0.028
arl-0.026
eh-0.026
 germ-0.026
Token Jessica
Token ID17352
Feature activation+3.83e+00

Pos logit contributions
inated+0.088
ement+0.087
itz+0.085
work+0.085
ble+0.084
Neg logit contributions
aks-0.069
isk-0.068
smart-0.064
 germ-0.063
 comes-0.062
Token said
Token ID531
Feature activation+1.55e+00

Pos logit contributions
ement+0.213
 sunset+0.196
 horizon+0.194
work+0.193
 fullest+0.187
Neg logit contributions
isk-0.093
 came-0.079
aks-0.078
 bark-0.078
 woke-0.078
Token,
Token ID11
Feature activation+2.00e+00

Pos logit contributions
ement+0.073
ised+0.072
inated+0.070
ths+0.069
 horizon+0.068
Neg logit contributions
isk-0.053
aks-0.046
smart-0.046
and-0.043
 backup-0.043
Token scared
Token ID12008
Feature activation+5.51e-01

Pos logit contributions
ement+0.087
ised+0.084
work+0.081
el+0.080
ths+0.078
Neg logit contributions
isk-0.081
eh-0.069
coat-0.065
aks-0.064
 fetch-0.064
Token.
Token ID13
Feature activation+1.11e+00

Pos logit contributions
inated+0.023
ised+0.023
ement+0.022
work+0.022
el+0.021
Neg logit contributions
isk-0.024
aks-0.021
arl-0.020
ives-0.020
aid-0.019
Token
Token ID198
Feature activation+1.01e+00

Pos logit contributions
ised+0.055
ement+0.055
inated+0.053
work+0.052
el+0.052
Neg logit contributions
isk-0.038
aks-0.032
ives-0.031
arl-0.031
smart-0.029
Token jealous
Token ID19354
Feature activation-2.21e-01

Pos logit contributions
isk+0.024
aks+0.020
arl+0.020
ives+0.020
coat+0.020
Neg logit contributions
ised-0.021
inated-0.020
ement-0.019
work-0.019
el-0.019
Token of
Token ID286
Feature activation+2.55e-01

Pos logit contributions
isk+0.009
aks+0.008
eh+0.007
arl+0.007
aid+0.007
Neg logit contributions
ement-0.010
ised-0.010
inated-0.010
work-0.009
el-0.009
Token your
Token ID534
Feature activation-3.75e-01

Pos logit contributions
ised+0.010
inated+0.010
ement+0.010
work+0.010
el+0.010
Neg logit contributions
isk-0.012
aks-0.010
arl-0.009
ives-0.009
eh-0.009
Token friends
Token ID2460
Feature activation+9.35e-01

Pos logit contributions
isk+0.018
aks+0.016
arl+0.015
ives+0.015
aid+0.015
Neg logit contributions
ised-0.014
ement-0.014
inated-0.014
work-0.013
el-0.013
Token and
Token ID290
Feature activation-9.55e-02

Pos logit contributions
inated+0.040
ised+0.039
ement+0.038
el+0.037
 sunset+0.037
Neg logit contributions
isk-0.041
aks-0.035
ives-0.034
smart-0.032
arl-0.032
Token don
Token ID836
Feature activation+3.79e+00

Pos logit contributions
isk+0.005
aks+0.004
arl+0.004
eh+0.004
ives+0.004
Neg logit contributions
ised-0.004
inated-0.004
ement-0.004
work-0.003
el-0.003
Token't
Token ID470
Feature activation+1.18e+00

Pos logit contributions
ishment+0.213
ement+0.208
 reflection+0.201
 pearl+0.201
 horizon+0.200
Neg logit contributions
isk-0.099
aster-0.079
ives-0.079
iren-0.078
 fetch-0.077
Token be
Token ID307
Feature activation-6.31e-01

Pos logit contributions
ement+0.070
ised+0.067
inated+0.065
 horizon+0.062
el+0.061
Neg logit contributions
isk-0.035
aks-0.026
arl-0.025
aid-0.025
 germ-0.024
Token mean
Token ID1612
Feature activation-3.29e-02

Pos logit contributions
isk+0.030
arl+0.025
ives+0.025
aks+0.025
coat+0.024
Neg logit contributions
ised-0.025
inated-0.024
ement-0.023
work-0.023
el-0.022
Token to
Token ID284
Feature activation+3.50e-01

Pos logit contributions
isk+0.002
aks+0.001
arl+0.001
eh+0.001
aid+0.001
Neg logit contributions
ised-0.001
ement-0.001
inated-0.001
work-0.001
el-0.001
Token them
Token ID606
Feature activation+1.07e+00

Pos logit contributions
ement+0.013
ised+0.013
inated+0.013
work+0.012
el+0.012
Neg logit contributions
isk-0.017
aks-0.015
arl-0.014
ives-0.014
aid-0.014
Token listen
Token ID6004
Feature activation-7.56e-01

Pos logit contributions
ement+0.031
ised+0.031
inated+0.030
work+0.028
el+0.028
Neg logit contributions
isk-0.024
aks-0.020
arl-0.020
ives-0.019
aid-0.019
Token to
Token ID284
Feature activation-4.30e-01

Pos logit contributions
isk+0.034
aks+0.027
eh+0.027
aid+0.027
arl+0.027
Neg logit contributions
ised-0.032
inated-0.032
ement-0.031
work-0.031
 int-0.030
Token your
Token ID534
Feature activation+7.87e-01

Pos logit contributions
isk+0.019
ives+0.016
aks+0.016
coat+0.016
aid+0.016
Neg logit contributions
ised-0.018
inated-0.017
el-0.017
 sunset-0.017
work-0.017
Token parents
Token ID3397
Feature activation+1.73e-01

Pos logit contributions
ised+0.033
ement+0.033
inated+0.032
ized+0.029
el+0.029
Neg logit contributions
isk-0.036
arl-0.032
 germ-0.031
aks-0.030
 backup-0.030
Token and
Token ID290
Feature activation-4.83e-01

Pos logit contributions
ised+0.007
ement+0.007
inated+0.007
work+0.006
el+0.006
Neg logit contributions
isk-0.008
aks-0.007
arl-0.007
ives-0.007
eh-0.006
Token don
Token ID836
Feature activation+3.77e+00

Pos logit contributions
isk+0.022
aks+0.018
eh+0.018
arl+0.018
ives+0.018
Neg logit contributions
ised-0.020
inated-0.020
work-0.019
ement-0.019
el-0.019
Token't
Token ID470
Feature activation+1.39e+00

Pos logit contributions
ishment+0.204
ement+0.196
 reflection+0.193
 horizon+0.192
 diamonds+0.192
Neg logit contributions
isk-0.100
iren-0.090
aster-0.088
arl-0.086
aks-0.086
Token touch
Token ID3638
Feature activation+4.85e-01

Pos logit contributions
ement+0.089
ised+0.083
inated+0.079
ths+0.077
 horizon+0.077
Neg logit contributions
isk-0.038
aks-0.029
arl-0.027
ens-0.027
aid-0.026
Token things
Token ID1243
Feature activation+2.74e-01

Pos logit contributions
inated+0.020
ised+0.020
ement+0.020
work+0.019
el+0.018
Neg logit contributions
isk-0.021
aks-0.019
arl-0.018
ives-0.017
aid-0.017
Token that
Token ID326
Feature activation-7.52e-01

Pos logit contributions
ised+0.010
inated+0.009
ement+0.009
el+0.009
work+0.009
Neg logit contributions
isk-0.014
aks-0.012
arl-0.012
ives-0.012
aid-0.012
Token can
Token ID460
Feature activation+8.25e-01

Pos logit contributions
isk+0.028
eh+0.023
aks+0.022
arl+0.022
coat+0.021
Neg logit contributions
el-0.036
inated-0.036
work-0.036
ised-0.036
ement-0.036
Token loved
Token ID6151
Feature activation-2.19e-01

Pos logit contributions
isk+0.011
arl+0.009
aks+0.009
ives+0.009
aid+0.009
Neg logit contributions
ised-0.010
inated-0.010
ement-0.010
el-0.010
work-0.009
Token to
Token ID284
Feature activation-1.02e+00

Pos logit contributions
isk+0.009
aks+0.007
arl+0.007
ives+0.007
eh+0.007
Neg logit contributions
inated-0.010
ised-0.010
ement-0.010
 sunset-0.010
work-0.010
Token play
Token ID711
Feature activation-5.63e-01

Pos logit contributions
isk+0.042
aks+0.036
arl+0.034
ives+0.034
eh+0.032
Neg logit contributions
 sunset-0.047
inated-0.046
 comet-0.045
el-0.045
work-0.044
Token with
Token ID351
Feature activation+1.14e-02

Pos logit contributions
isk+0.029
arl+0.025
aks+0.025
ives+0.024
aid+0.024
Neg logit contributions
ised-0.020
inated-0.020
ement-0.019
 sunset-0.018
el-0.018
Token his
Token ID465
Feature activation+1.33e+00

Pos logit contributions
ised+0.001
inated+0.001
ement+0.001
el+0.001
work+0.001
Neg logit contributions
isk-0.000
aks-0.000
arl-0.000
ives-0.000
eh-0.000
Token orange
Token ID10912
Feature activation+3.76e+00

Pos logit contributions
ement+0.079
inated+0.077
ised+0.077
 horizon+0.076
 int+0.075
Neg logit contributions
isk-0.032
aks-0.027
coat-0.026
eh-0.026
 germ-0.025
Token ball
Token ID2613
Feature activation+5.18e-01

Pos logit contributions
ement+0.244
uct+0.243
used+0.235
ised+0.231
itating+0.230
Neg logit contributions
 banana-0.064
 fetch-0.056
 cookie-0.052
 fort-0.052
 books-0.051
Token.
Token ID13
Feature activation-5.00e-01

Pos logit contributions
ised+0.019
ement+0.019
inated+0.019
work+0.018
el+0.018
Neg logit contributions
isk-0.025
aks-0.022
eh-0.021
ives-0.021
arl-0.021
Token He
Token ID679
Feature activation-4.80e-01

Pos logit contributions
isk+0.022
aks+0.019
eh+0.018
aid+0.018
ives+0.018
Neg logit contributions
inated-0.021
ement-0.021
ised-0.021
 sunset-0.021
el-0.020
Token would
Token ID561
Feature activation-5.62e-01

Pos logit contributions
isk+0.021
arl+0.019
aks+0.018
ives+0.018
aid+0.017
Neg logit contributions
inated-0.019
ised-0.019
el-0.019
ement-0.018
work-0.018
Token throw
Token ID3714
Feature activation+6.11e-01

Pos logit contributions
isk+0.020
aks+0.017
ives+0.016
arl+0.016
aid+0.015
Neg logit contributions
inated-0.029
ised-0.027
el-0.027
 sunset-0.027
ement-0.027
Token I
Token ID314
Feature activation-7.67e-01

Pos logit contributions
ement+0.041
ised+0.041
inated+0.041
work+0.038
 sunset+0.038
Neg logit contributions
isk-0.038
arl-0.030
aid-0.030
aks-0.030
ens-0.030
Token take
Token ID1011
Feature activation+6.82e-02

Pos logit contributions
isk+0.029
aks+0.025
eh+0.023
arl+0.023
ives+0.022
Neg logit contributions
inated-0.035
el-0.035
ised-0.035
ement-0.035
 sunset-0.035
Token a
Token ID257
Feature activation-5.44e-01

Pos logit contributions
ised+0.003
ement+0.003
inated+0.003
el+0.002
work+0.002
Neg logit contributions
isk-0.003
aks-0.003
arl-0.003
ives-0.003
eh-0.003
Token nap
Token ID25422
Feature activation+2.46e-01

Pos logit contributions
isk+0.017
aks+0.014
ives+0.012
arl+0.012
eh+0.012
Neg logit contributions
ement-0.029
ised-0.029
inated-0.029
 sunset-0.029
el-0.028
Token now
Token ID783
Feature activation+5.83e-01

Pos logit contributions
ised+0.009
inated+0.009
ement+0.009
work+0.008
el+0.008
Neg logit contributions
isk-0.012
aks-0.011
arl-0.010
ives-0.010
eh-0.010
Token?"
Token ID1701
Feature activation+3.75e+00

Pos logit contributions
inated+0.021
ised+0.021
ement+0.021
work+0.020
el+0.020
Neg logit contributions
isk-0.028
aks-0.025
arl-0.024
ives-0.024
eh-0.023
Token asked
Token ID1965
Feature activation+3.18e+00

Pos logit contributions
work+0.196
ement+0.192
ths+0.186
itz+0.182
izing+0.180
Neg logit contributions
ens-0.087
 came-0.086
 Sue-0.086
 comes-0.086
 see-0.085
Token Walter
Token ID15819
Feature activation+9.39e-01

Pos logit contributions
inated+0.134
ths+0.133
 horizon+0.131
ement+0.129
itz+0.129
Neg logit contributions
isk-0.113
aks-0.107
ens-0.100
 welcome-0.098
aid-0.096
Token.
Token ID13
Feature activation+1.98e+00

Pos logit contributions
ement+0.037
ised+0.036
inated+0.036
work+0.034
 sunset+0.034
Neg logit contributions
isk-0.044
aks-0.038
eh-0.036
ives-0.035
aid-0.035
Token
Token ID198
Feature activation+1.20e+00

Pos logit contributions
ised+0.111
izing+0.111
ths+0.109
ement+0.109
work+0.107
Neg logit contributions
isk-0.052
aks-0.043
 welcome-0.039
arl-0.038
and-0.038
Token
Token ID198
Feature activation+1.58e+00

Pos logit contributions
ement+0.060
ised+0.059
inated+0.059
work+0.059
izing+0.057
Neg logit contributions
isk-0.040
aks-0.037
arl-0.032
aid-0.032
and-0.031
Token buy
Token ID2822
Feature activation-3.80e-01

Pos logit contributions
isk+0.009
aks+0.008
arl+0.007
eh+0.007
ives+0.007
Neg logit contributions
ised-0.011
inated-0.011
ement-0.011
work-0.010
el-0.010
Token too
Token ID1165
Feature activation-3.26e-02

Pos logit contributions
isk+0.017
aks+0.014
arl+0.014
eh+0.014
ives+0.014
Neg logit contributions
ised-0.016
ement-0.015
inated-0.015
work-0.015
el-0.015
Token much
Token ID881
Feature activation-4.59e-01

Pos logit contributions
isk+0.001
aks+0.001
arl+0.001
ives+0.001
eh+0.001
Neg logit contributions
ised-0.002
ement-0.002
inated-0.002
el-0.001
work-0.001
Token food
Token ID2057
Feature activation-8.48e-01

Pos logit contributions
isk+0.020
aks+0.016
eh+0.015
arl+0.015
coat+0.015
Neg logit contributions
ised-0.020
ement-0.020
inated-0.020
 sunset-0.019
work-0.019
Token and
Token ID290
Feature activation+1.09e-01

Pos logit contributions
isk+0.040
aks+0.034
eh+0.034
arl+0.033
aid+0.033
Neg logit contributions
ised-0.032
ement-0.032
inated-0.031
work-0.031
el-0.030
Token don
Token ID836
Feature activation+3.74e+00

Pos logit contributions
ised+0.004
ement+0.004
inated+0.004
work+0.004
el+0.004
Neg logit contributions
isk-0.005
aks-0.004
arl-0.004
ives-0.004
eh-0.004
Token't
Token ID470
Feature activation+1.37e+00

Pos logit contributions
ishment+0.203
ement+0.199
 reflection+0.195
inated+0.193
work+0.191
Neg logit contributions
isk-0.099
aster-0.085
iren-0.081
ives-0.080
arl-0.078
Token eat
Token ID4483
Feature activation+6.76e-02

Pos logit contributions
ement+0.080
ised+0.076
inated+0.074
ths+0.070
el+0.070
Neg logit contributions
isk-0.042
arl-0.030
ens-0.030
aks-0.030
 germ-0.030
Token it
Token ID340
Feature activation-1.38e-01

Pos logit contributions
ised+0.003
ement+0.003
inated+0.003
work+0.003
el+0.003
Neg logit contributions
isk-0.003
aks-0.003
arl-0.003
ives-0.002
eh-0.002
Token all
Token ID477
Feature activation+5.12e-01

Pos logit contributions
isk+0.007
aks+0.006
arl+0.006
ives+0.006
eh+0.006
Neg logit contributions
ised-0.005
ement-0.005
inated-0.005
work-0.005
el-0.005
Token.
Token ID13
Feature activation+1.27e+00

Pos logit contributions
ement+0.021
ised+0.021
inated+0.020
el+0.019
work+0.019
Neg logit contributions
isk-0.023
aks-0.020
ives-0.019
arl-0.019
eh-0.019
TokenWhat
Token ID2061
Feature activation+2.92e-01

Pos logit contributions
ement+0.046
el+0.045
ised+0.045
inated+0.045
work+0.043
Neg logit contributions
isk-0.045
aid-0.040
aks-0.039
ives-0.038
arl-0.038
Token is
Token ID318
Feature activation+2.09e+00

Pos logit contributions
ised+0.014
inated+0.014
ement+0.014
work+0.013
el+0.013
Neg logit contributions
isk-0.011
aks-0.009
ives-0.009
arl-0.009
aid-0.009
Token a
Token ID257
Feature activation+8.75e-01

Pos logit contributions
ised+0.111
ement+0.110
 horizon+0.102
inated+0.098
ths+0.098
Neg logit contributions
 germ-0.062
aks-0.062
isk-0.060
 robber-0.060
 Luna-0.059
Token language
Token ID3303
Feature activation+1.18e+00

Pos logit contributions
ised+0.041
ement+0.041
inated+0.040
el+0.039
work+0.037
Neg logit contributions
isk-0.034
aks-0.029
 germ-0.028
arl-0.028
eh-0.027
Token?"
Token ID1701
Feature activation-5.20e-01

Pos logit contributions
inated+0.047
ised+0.047
ement+0.046
el+0.043
itz+0.042
Neg logit contributions
isk-0.052
aks-0.047
arl-0.044
ives-0.044
aid-0.043
Token Lily
Token ID20037
Feature activation+3.74e+00

Pos logit contributions
isk+0.017
aks+0.013
arl+0.013
ives+0.013
eh+0.012
Neg logit contributions
ised-0.029
ement-0.028
 sunset-0.027
inated-0.027
el-0.027
Token asks
Token ID7893
Feature activation+1.19e-01

Pos logit contributions
work+0.222
ised+0.216
izing+0.212
ized+0.210
ement+0.208
Neg logit contributions
 comes-0.090
 wakes-0.072
 hears-0.072
aks-0.070
 sees-0.070
Token.
Token ID13
Feature activation-4.09e-01

Pos logit contributions
ised+0.004
ement+0.004
inated+0.004
work+0.004
el+0.004
Neg logit contributions
isk-0.006
aks-0.005
arl-0.005
ives-0.005
eh-0.005
Token
Token ID198
Feature activation-1.03e-01

Pos logit contributions
isk+0.020
aks+0.017
eh+0.017
 germ+0.016
aid+0.016
Neg logit contributions
inated-0.015
ement-0.015
ised-0.015
 sunset-0.014
el-0.014
Token
Token ID198
Feature activation-6.50e-01

Pos logit contributions
isk+0.005
aks+0.004
arl+0.004
ives+0.004
eh+0.004
Neg logit contributions
ised-0.004
ement-0.004
inated-0.004
el-0.004
work-0.004
Token"
Token ID1
Feature activation+1.77e+00

Pos logit contributions
isk+0.026
 germ+0.022
aks+0.021
arl+0.021
ives+0.020
Neg logit contributions
ised-0.031
inated-0.029
ement-0.028
el-0.028
izing-0.027
Token can
Token ID460
Feature activation+1.13e+00

Pos logit contributions
ement+0.053
ised+0.052
inated+0.052
el+0.050
work+0.049
Neg logit contributions
isk-0.030
aks-0.025
ives-0.024
arl-0.024
aid-0.023
Token happen
Token ID1645
Feature activation+2.07e-01

Pos logit contributions
ement+0.062
inated+0.060
ised+0.060
work+0.057
ths+0.056
Neg logit contributions
isk-0.037
aks-0.027
arl-0.027
ives-0.027
aid-0.025
Token even
Token ID772
Feature activation-5.53e-02

Pos logit contributions
ised+0.008
ement+0.008
inated+0.008
work+0.008
el+0.008
Neg logit contributions
isk-0.009
aks-0.008
arl-0.008
ives-0.008
eh-0.007
Token when
Token ID618
Feature activation+1.45e+00

Pos logit contributions
isk+0.002
aks+0.002
arl+0.002
ives+0.002
eh+0.002
Neg logit contributions
ised-0.002
ement-0.002
inated-0.002
work-0.002
el-0.002
Token you
Token ID345
Feature activation+6.75e-01

Pos logit contributions
ement+0.065
inated+0.063
ised+0.062
work+0.060
el+0.057
Neg logit contributions
isk-0.058
aks-0.051
 germ-0.050
 backup-0.047
arl-0.047
Token don
Token ID836
Feature activation+3.70e+00

Pos logit contributions
ement+0.036
ised+0.036
inated+0.035
el+0.033
work+0.033
Neg logit contributions
isk-0.023
aks-0.018
ives-0.018
arl-0.017
aid-0.016
Token't
Token ID470
Feature activation+1.62e+00

Pos logit contributions
ement+0.184
ishment+0.179
inated+0.174
 reflection+0.172
work+0.172
Neg logit contributions
isk-0.118
ives-0.099
aks-0.099
arl-0.097
iren-0.095
Token expect
Token ID1607
Feature activation+9.44e-01

Pos logit contributions
ement+0.100
ised+0.092
ths+0.090
inated+0.089
work+0.086
Neg logit contributions
isk-0.046
arl-0.034
aks-0.033
ens-0.031
ives-0.031
Token it
Token ID340
Feature activation+1.71e-01

Pos logit contributions
inated+0.038
ised+0.038
ement+0.038
work+0.036
el+0.035
Neg logit contributions
isk-0.042
aks-0.037
ives-0.035
arl-0.034
 germ-0.034
Token.
Token ID13
Feature activation+2.58e-01

Pos logit contributions
ised+0.007
ement+0.007
inated+0.007
work+0.007
el+0.007
Neg logit contributions
isk-0.007
aks-0.006
arl-0.006
ives-0.006
eh-0.006
Token<|endoftext|>
Token ID50256
Feature activation+2.19e-02

Pos logit contributions
ised+0.011
ement+0.011
inated+0.011
work+0.010
el+0.010
Neg logit contributions
isk-0.011
aks-0.009
arl-0.009
ives-0.009
aid-0.009
Token Now
Token ID2735
Feature activation-4.20e-01

Pos logit contributions
isk+0.001
aks+0.001
arl+0.001
eh+0.001
ives+0.001
Neg logit contributions
inated-0.001
ised-0.001
ement-0.001
work-0.001
el-0.001
Token she
Token ID673
Feature activation+9.70e-01

Pos logit contributions
isk+0.021
aks+0.019
arl+0.018
eh+0.018
aid+0.018
Neg logit contributions
ised-0.015
inated-0.015
ement-0.015
el-0.014
 sunset-0.014
Token could
Token ID714
Feature activation+7.48e-01

Pos logit contributions
ised+0.046
inated+0.046
ement+0.046
izing+0.043
 horizon+0.042
Neg logit contributions
isk-0.037
aks-0.031
ives-0.031
aid-0.029
 comes-0.029
Token look
Token ID804
Feature activation+3.31e-01

Pos logit contributions
ement+0.039
ised+0.039
inated+0.038
work+0.037
el+0.036
Neg logit contributions
isk-0.025
aks-0.020
aid-0.020
coat-0.019
arl-0.018
Token at
Token ID379
Feature activation+1.78e+00

Pos logit contributions
ised+0.012
inated+0.012
ement+0.012
el+0.012
work+0.012
Neg logit contributions
isk-0.016
aks-0.014
arl-0.013
ives-0.013
eh-0.013
Token the
Token ID262
Feature activation+3.69e+00

Pos logit contributions
inated+0.085
ement+0.081
ised+0.079
itz+0.077
 int+0.076
Neg logit contributions
isk-0.064
aks-0.055
aid-0.050
 backup-0.049
ens-0.048
Token sun
Token ID4252
Feature activation+1.39e+00

Pos logit contributions
inated+0.133
used+0.133
itz+0.126
amb+0.125
ised+0.122
Neg logit contributions
 shady-0.136
 welcome-0.133
 cookies-0.130
 sees-0.129
 cookie-0.127
Tokensets
Token ID28709
Feature activation+1.58e+00

Pos logit contributions
ised+0.055
inated+0.054
ement+0.052
itz+0.048
el+0.047
Neg logit contributions
isk-0.063
coat-0.052
aks-0.052
ives-0.050
ens-0.050
Token pieces
Token ID5207
Feature activation+4.49e-01

Pos logit contributions
ised+0.064
inated+0.063
ement+0.061
el+0.059
itz+0.057
Neg logit contributions
isk-0.069
aks-0.059
ives-0.058
coat-0.057
eh-0.054
Token every
Token ID790
Feature activation+1.14e+00

Pos logit contributions
ised+0.018
inated+0.018
ement+0.018
work+0.017
el+0.017
Neg logit contributions
isk-0.020
aks-0.018
ives-0.017
arl-0.017
eh-0.016
Token day
Token ID1110
Feature activation+5.00e-01

Pos logit contributions
inated+0.054
ised+0.054
ement+0.053
work+0.050
itz+0.050
Neg logit contributions
isk-0.039
aks-0.037
eh-0.036
aster-0.032
 germ-0.032
Token They
Token ID1119
Feature activation+2.47e+00

Pos logit contributions
ised+0.075
ement+0.073
ized+0.071
izing+0.070
ths+0.068
Neg logit contributions
isk-0.049
aks-0.041
ives-0.040
 Luna-0.039
and-0.039
Token both
Token ID1111
Feature activation+2.10e+00

Pos logit contributions
ement+0.122
izing+0.116
 avail+0.109
inated+0.109
ised+0.106
Neg logit contributions
isk-0.080
aks-0.080
 came-0.075
 climbed-0.075
aid-0.072
Token marvel
Token ID22672
Feature activation+1.55e+00

Pos logit contributions
ement+0.115
izing+0.113
 avail+0.110
ised+0.107
inated+0.107
Neg logit contributions
aks-0.055
isk-0.051
 comes-0.048
ives-0.047
aid-0.046
Tokened
Token ID276
Feature activation+2.08e+00

Pos logit contributions
inated+0.071
 horizon+0.070
 sunset+0.069
ement+0.069
 avail+0.069
Neg logit contributions
isk-0.050
aks-0.049
ives-0.047
arl-0.043
smart-0.043
Token at
Token ID379
Feature activation+2.38e+00

Pos logit contributions
inated+0.088
itz+0.085
 avail+0.085
ised+0.084
ement+0.084
Neg logit contributions
aks-0.071
isk-0.068
smart-0.063
 comes-0.062
ives-0.062
Token the
Token ID262
Feature activation+3.69e+00

Pos logit contributions
ble+0.124
 horizon+0.119
inated+0.118
ised+0.117
inct+0.117
Neg logit contributions
isk-0.055
 Sue-0.051
aks-0.051
 Luna-0.048
 comes-0.048
Token comet
Token ID31733
Feature activation+2.46e+00

Pos logit contributions
inated+0.127
ised+0.114
used+0.112
bed+0.107
ths+0.105
Neg logit contributions
 wake-0.149
 welcome-0.141
 firefighter-0.139
 comes-0.139
 nice-0.138
Token.
Token ID13
Feature activation+2.25e+00

Pos logit contributions
ised+0.111
inated+0.106
ement+0.103
itz+0.102
 avail+0.098
Neg logit contributions
isk-0.088
aks-0.081
ives-0.073
smart-0.072
 comes-0.072
Token Its
Token ID6363
Feature activation+3.12e+00

Pos logit contributions
ement+0.113
ised+0.111
ized+0.109
izing+0.107
ble+0.106
Neg logit contributions
isk-0.068
aks-0.061
 comes-0.060
 Luna-0.060
 Kim-0.056
Token tail
Token ID7894
Feature activation+1.57e+00

Pos logit contributions
ement+0.126
ivating+0.120
yles+0.119
yle+0.118
ised+0.117
Neg logit contributions
 welcome-0.094
 shady-0.092
aks-0.091
 hug-0.090
 firefighter-0.085
Token spark
Token ID9009
Feature activation+1.42e+00

Pos logit contributions
ised+0.069
ement+0.068
inated+0.066
work+0.064
itz+0.063
Neg logit contributions
isk-0.060
aks-0.053
ives-0.051
 comes-0.049
coat-0.048
Token."
Token ID526
Feature activation+9.80e-01

Pos logit contributions
ised+0.044
inated+0.043
ement+0.043
el+0.040
itz+0.039
Neg logit contributions
isk-0.041
aks-0.038
ives-0.036
arl-0.035
aid-0.034
Token They
Token ID1119
Feature activation+1.64e+00

Pos logit contributions
ement+0.044
ised+0.044
inated+0.044
work+0.044
ths+0.042
Neg logit contributions
isk-0.038
arl-0.033
aks-0.032
ives-0.031
aid-0.030
Token stopped
Token ID5025
Feature activation+1.66e+00

Pos logit contributions
ement+0.080
inated+0.076
izing+0.076
ised+0.075
 horizon+0.071
Neg logit contributions
isk-0.062
aks-0.053
aid-0.050
eh-0.049
ens-0.048
Token to
Token ID284
Feature activation+9.53e-01

Pos logit contributions
ement+0.079
ised+0.077
inated+0.075
izing+0.073
ified+0.071
Neg logit contributions
isk-0.057
 comes-0.048
aks-0.048
smart-0.047
arl-0.047
Token watch
Token ID2342
Feature activation+1.41e+00

Pos logit contributions
ement+0.051
ised+0.051
inated+0.049
el+0.047
izing+0.047
Neg logit contributions
isk-0.032
aks-0.024
aid-0.024
arl-0.023
ives-0.022
Token the
Token ID262
Feature activation+3.68e+00

Pos logit contributions
ised+0.066
inated+0.065
ement+0.064
ths+0.062
itz+0.060
Neg logit contributions
isk-0.050
aks-0.043
ives-0.040
arl-0.040
 germ-0.040
Token sunset
Token ID26428
Feature activation+1.56e+00

Pos logit contributions
inated+0.103
used+0.100
ised+0.099
 avail+0.093
itz+0.093
Neg logit contributions
 welcome-0.162
 shady-0.162
 cookies-0.160
 banana-0.157
 cookie-0.154
Token.
Token ID13
Feature activation+1.46e+00

Pos logit contributions
ised+0.068
inated+0.067
ement+0.066
el+0.062
itz+0.062
Neg logit contributions
isk-0.061
aks-0.054
ives-0.052
coat-0.049
aid-0.048
Token Suddenly
Token ID24975
Feature activation+7.91e-01

Pos logit contributions
ement+0.071
ised+0.070
ths+0.069
work+0.069
izing+0.067
Neg logit contributions
isk-0.050
arl-0.045
aks-0.042
ives-0.041
 germ-0.037
Token,
Token ID11
Feature activation+1.70e+00

Pos logit contributions
inated+0.039
ement+0.039
ised+0.038
work+0.037
el+0.036
Neg logit contributions
isk-0.027
aks-0.024
 germ-0.022
arl-0.022
ives-0.022
Token Nina
Token ID36300
Feature activation+1.05e+00

Pos logit contributions
ement+0.083
inated+0.080
ised+0.076
work+0.075
ths+0.073
Neg logit contributions
isk-0.057
 germ-0.051
ens-0.049
 backup-0.047
aks-0.047
Token,
Token ID11
Feature activation-7.71e-01

Pos logit contributions
ised+0.010
inated+0.010
ement+0.010
work+0.009
el+0.009
Neg logit contributions
isk-0.009
aks-0.007
arl-0.007
ives-0.007
eh-0.007
Token we
Token ID356
Feature activation+2.12e-01

Pos logit contributions
isk+0.042
aks+0.038
eh+0.036
ives+0.035
arl+0.035
Neg logit contributions
ised-0.025
inated-0.024
ement-0.024
el-0.023
 comet-0.023
Token can
Token ID460
Feature activation+1.30e+00

Pos logit contributions
ised+0.011
ement+0.011
inated+0.011
work+0.011
el+0.011
Neg logit contributions
isk-0.007
aks-0.006
arl-0.005
ives-0.005
aid-0.005
Token go
Token ID467
Feature activation+4.97e-02

Pos logit contributions
ised+0.079
ement+0.079
inated+0.075
izing+0.074
work+0.074
Neg logit contributions
isk-0.032
aks-0.024
arl-0.022
ives-0.022
coat-0.022
Token see
Token ID766
Feature activation+1.61e+00

Pos logit contributions
ised+0.002
inated+0.002
ement+0.002
work+0.002
el+0.002
Neg logit contributions
isk-0.002
aks-0.002
arl-0.002
ives-0.002
eh-0.002
Token the
Token ID262
Feature activation+3.68e+00

Pos logit contributions
inated+0.072
ised+0.072
ement+0.069
work+0.067
itz+0.067
Neg logit contributions
isk-0.057
aks-0.056
 comes-0.050
smart-0.049
and-0.049
Token sunset
Token ID26428
Feature activation+9.73e-01

Pos logit contributions
used+0.150
bed+0.150
ised+0.148
inated+0.144
ica+0.139
Neg logit contributions
 welcome-0.134
 nice-0.124
 shady-0.123
 wake-0.119
 comes-0.118
Token."
Token ID526
Feature activation-2.18e-02

Pos logit contributions
ised+0.040
ement+0.039
inated+0.039
el+0.036
itz+0.035
Neg logit contributions
isk-0.043
aks-0.039
ives-0.036
arl-0.035
coat-0.035
Token
Token ID198
Feature activation-6.75e-02

Pos logit contributions
isk+0.001
aks+0.001
arl+0.001
eh+0.001
ives+0.001
Neg logit contributions
ised-0.001
ement-0.001
inated-0.001
el-0.001
work-0.001
Token
Token ID198
Feature activation+2.96e-01

Pos logit contributions
isk+0.003
aks+0.003
arl+0.003
ives+0.003
eh+0.003
Neg logit contributions
ised-0.003
ement-0.003
inated-0.003
el-0.003
work-0.002
TokenAs
Token ID1722
Feature activation+6.98e-01

Pos logit contributions
ement+0.019
inated+0.019
ised+0.019
work+0.019
el+0.018
Neg logit contributions
isk-0.007
aks-0.005
eh-0.004
arl-0.004
ives-0.004
Token of
Token ID286
Feature activation+7.02e-01

Pos logit contributions
ised+0.007
ement+0.007
inated+0.007
work+0.007
el+0.007
Neg logit contributions
isk-0.005
aks-0.004
arl-0.003
ives-0.003
eh-0.003
Token her
Token ID607
Feature activation+1.69e+00

Pos logit contributions
ised+0.035
ement+0.035
inated+0.035
ths+0.033
work+0.033
Neg logit contributions
isk-0.026
aks-0.020
arl-0.019
 germ-0.019
aid-0.019
Token with
Token ID351
Feature activation+8.44e-01

Pos logit contributions
inated+0.082
ement+0.079
ised+0.079
 int+0.077
ths+0.076
Neg logit contributions
isk-0.062
coat-0.049
ives-0.047
aks-0.047
arl-0.047
Token her
Token ID607
Feature activation+1.42e+00

Pos logit contributions
ement+0.039
inated+0.038
ised+0.038
work+0.036
 horizon+0.036
Neg logit contributions
isk-0.033
aks-0.029
aid-0.028
ives-0.027
arl-0.027
Token favorite
Token ID4004
Feature activation+7.65e-01

Pos logit contributions
inated+0.067
ised+0.066
ths+0.064
 horizon+0.062
ement+0.062
Neg logit contributions
isk-0.052
coat-0.045
ives-0.044
aks-0.042
 robber-0.042
Token toy
Token ID13373
Feature activation+3.68e+00

Pos logit contributions
ised+0.034
inated+0.034
ement+0.032
ths+0.032
work+0.031
Neg logit contributions
isk-0.031
aks-0.026
 germ-0.026
coat-0.026
aid-0.026
Token.
Token ID13
Feature activation-5.09e-01

Pos logit contributions
ised+0.211
ement+0.195
 int+0.194
uct+0.194
 avail+0.193
Neg logit contributions
isk-0.085
 germ-0.084
 robber-0.079
coat-0.078
 v-0.076
Token Lily
Token ID20037
Feature activation+1.48e+00

Pos logit contributions
isk+0.021
aks+0.018
eh+0.017
aid+0.017
arl+0.017
Neg logit contributions
inated-0.023
ised-0.023
ement-0.022
 sunset-0.022
el-0.021
Token smiled
Token ID13541
Feature activation+1.97e+00

Pos logit contributions
ement+0.075
ised+0.070
 horizon+0.069
work+0.069
inated+0.067
Neg logit contributions
isk-0.054
coat-0.043
aks-0.041
ives-0.040
ens-0.039
Token and
Token ID290
Feature activation+6.55e-01

Pos logit contributions
ement+0.112
used+0.107
inated+0.106
ised+0.105
el+0.104
Neg logit contributions
isk-0.050
coat-0.037
 germ-0.033
 robber-0.032
ens-0.031
Token said
Token ID531
Feature activation-8.52e-01

Pos logit contributions
ement+0.030
ised+0.029
inated+0.028
work+0.028
el+0.027
Neg logit contributions
isk-0.027
aks-0.023
coat-0.022
aid-0.021
 germ-0.021
Token parents
Token ID3397
Feature activation+5.81e-01

Pos logit contributions
ement+0.032
ised+0.032
inated+0.032
work+0.030
el+0.029
Neg logit contributions
isk-0.039
arl-0.034
ives-0.033
 germ-0.032
aks-0.032
Token,
Token ID11
Feature activation+9.27e-03

Pos logit contributions
ised+0.023
inated+0.023
ement+0.022
el+0.021
 sunset+0.021
Neg logit contributions
isk-0.027
aks-0.023
ives-0.022
arl-0.022
aid-0.022
Token even
Token ID772
Feature activation-2.71e-02

Pos logit contributions
ised+0.000
ement+0.000
inated+0.000
work+0.000
el+0.000
Neg logit contributions
isk-0.000
aks-0.000
arl-0.000
ives-0.000
eh-0.000
Token if
Token ID611
Feature activation+1.80e+00

Pos logit contributions
isk+0.001
aks+0.001
arl+0.001
ives+0.001
eh+0.001
Neg logit contributions
ised-0.001
inated-0.001
ement-0.001
work-0.001
el-0.001
Token we
Token ID356
Feature activation+1.29e+00

Pos logit contributions
ement+0.082
inated+0.079
ised+0.075
 horizon+0.073
work+0.072
Neg logit contributions
isk-0.065
 germ-0.062
 backup-0.060
aks-0.060
 comes-0.059
Token don
Token ID836
Feature activation+3.68e+00

Pos logit contributions
ement+0.072
ised+0.070
inated+0.069
el+0.065
work+0.065
Neg logit contributions
isk-0.040
ives-0.033
aks-0.031
aid-0.031
arl-0.031
Token't
Token ID470
Feature activation+1.70e+00

Pos logit contributions
ishment+0.190
ement+0.187
work+0.186
inated+0.184
 horizon+0.180
Neg logit contributions
isk-0.106
ives-0.094
arl-0.089
iren-0.089
aks-0.086
Token want
Token ID765
Feature activation+1.77e+00

Pos logit contributions
ement+0.100
ised+0.093
inated+0.091
work+0.091
 horizon+0.088
Neg logit contributions
isk-0.050
arl-0.040
coat-0.040
ives-0.039
aid-0.039
Token to
Token ID284
Feature activation+1.88e+00

Pos logit contributions
ised+0.097
ement+0.096
inated+0.096
work+0.091
inct+0.088
Neg logit contributions
isk-0.043
aks-0.043
ives-0.043
 backup-0.042
smart-0.042
Token.
Token ID13
Feature activation+8.78e-01

Pos logit contributions
ement+0.101
ised+0.096
inated+0.093
work+0.092
 horizon+0.089
Neg logit contributions
isk-0.059
coat-0.051
arl-0.049
 germ-0.048
aid-0.048
Token<|endoftext|>
Token ID50256
Feature activation+4.78e-01

Pos logit contributions
ised+0.040
ement+0.040
inated+0.040
work+0.038
 horizon+0.037
Neg logit contributions
isk-0.036
aks-0.029
arl-0.028
ives-0.028
aid-0.028
Token,
Token ID11
Feature activation+7.70e-01

Pos logit contributions
ised+0.003
ement+0.003
inated+0.003
work+0.003
el+0.003
Neg logit contributions
isk-0.003
aks-0.003
arl-0.003
ives-0.003
eh-0.003
Token while
Token ID981
Feature activation+1.12e+00

Pos logit contributions
ement+0.042
inated+0.041
ised+0.040
 horizon+0.040
work+0.039
Neg logit contributions
isk-0.024
aks-0.019
eh-0.019
aid-0.018
ens-0.018
Token Tim
Token ID5045
Feature activation+2.10e-02

Pos logit contributions
ement+0.060
inated+0.060
ised+0.059
work+0.057
 horizon+0.056
Neg logit contributions
isk-0.034
aks-0.030
eh-0.028
smart-0.026
ens-0.025
Token was
Token ID373
Feature activation-1.69e-01

Pos logit contributions
ised+0.001
inated+0.001
ement+0.001
el+0.001
work+0.001
Neg logit contributions
isk-0.001
aks-0.001
arl-0.001
ives-0.001
aid-0.001
Token watching
Token ID4964
Feature activation+7.74e-01

Pos logit contributions
isk+0.008
aks+0.007
arl+0.006
ives+0.006
aid+0.006
Neg logit contributions
inated-0.007
ised-0.007
ement-0.007
work-0.006
el-0.006
Token the
Token ID262
Feature activation+3.67e+00

Pos logit contributions
ised+0.036
ement+0.036
inated+0.035
work+0.033
el+0.033
Neg logit contributions
isk-0.030
aks-0.025
 germ-0.024
eh-0.023
arl-0.023
Token sunset
Token ID26428
Feature activation+9.38e-01

Pos logit contributions
used+0.118
ised+0.116
yle+0.113
ica+0.108
 avail+0.108
Neg logit contributions
 welcome-0.155
 wake-0.149
 banana-0.146
 shady-0.146
 comes-0.144
Token,
Token ID11
Feature activation+4.86e-01

Pos logit contributions
ised+0.043
inated+0.043
ement+0.042
itz+0.039
work+0.039
Neg logit contributions
isk-0.035
aks-0.031
ives-0.029
eh-0.028
 germ-0.028
Token he
Token ID339
Feature activation+5.76e-01

Pos logit contributions
ement+0.024
inated+0.024
ised+0.024
work+0.023
el+0.022
Neg logit contributions
isk-0.018
aks-0.015
eh-0.014
aid-0.014
 germ-0.014
Token saw
Token ID2497
Feature activation+2.82e-01

Pos logit contributions
ement+0.035
inated+0.034
ised+0.034
work+0.033
el+0.032
Neg logit contributions
isk-0.015
aks-0.012
eh-0.011
 germ-0.010
ives-0.010
Token something
Token ID1223
Feature activation+1.92e+00

Pos logit contributions
ised+0.011
ement+0.011
inated+0.011
work+0.011
el+0.011
Neg logit contributions
isk-0.013
aks-0.011
aid-0.010
eh-0.010
arl-0.010
Token important
Token ID1593
Feature activation+4.67e-01

Pos logit contributions
ement+0.013
ised+0.013
inated+0.013
el+0.012
work+0.012
Neg logit contributions
isk-0.015
aks-0.013
arl-0.012
eh-0.012
ives-0.012
Token?"
Token ID1701
Feature activation+9.72e-01

Pos logit contributions
ement+0.019
ised+0.019
inated+0.019
el+0.018
work+0.018
Neg logit contributions
isk-0.021
aks-0.018
arl-0.017
ives-0.017
eh-0.017
Token
Token ID198
Feature activation+5.15e-01

Pos logit contributions
ised+0.046
ement+0.046
inated+0.046
izing+0.044
work+0.044
Neg logit contributions
isk-0.035
aks-0.029
smart-0.028
aid-0.027
eh-0.027
Token
Token ID198
Feature activation+3.94e-01

Pos logit contributions
inated+0.023
ement+0.023
ised+0.023
work+0.023
el+0.022
Neg logit contributions
isk-0.021
aks-0.018
arl-0.017
aid-0.017
eh-0.016
TokenThe
Token ID464
Feature activation-2.76e-01

Pos logit contributions
ement+0.017
inated+0.017
ised+0.017
work+0.016
el+0.016
Neg logit contributions
isk-0.017
aks-0.015
aid-0.014
eh-0.014
arl-0.014
Token voice
Token ID3809
Feature activation+3.67e+00

Pos logit contributions
isk+0.013
aks+0.011
ives+0.011
coat+0.011
aid+0.011
Neg logit contributions
ised-0.010
inated-0.010
ement-0.010
 sunset-0.010
work-0.010
Token replied
Token ID8712
Feature activation+5.98e-01

Pos logit contributions
inated+0.165
 horizon+0.162
 avail+0.161
 sunset+0.156
yles+0.155
Neg logit contributions
isk-0.108
 comes-0.107
aks-0.103
smart-0.102
aid-0.101
Token,
Token ID11
Feature activation+1.22e+00

Pos logit contributions
ised+0.027
inated+0.026
ement+0.026
 horizon+0.025
el+0.025
Neg logit contributions
isk-0.024
aks-0.020
arl-0.019
eh-0.019
ives-0.019
Token "
Token ID366
Feature activation+1.73e+00

Pos logit contributions
ised+0.072
ement+0.071
el+0.071
inated+0.070
work+0.069
Neg logit contributions
isk-0.031
arl-0.022
aks-0.021
eh-0.020
smart-0.020
TokenThis
Token ID1212
Feature activation+1.78e-01

Pos logit contributions
ement+0.081
ised+0.075
 avail+0.073
inated+0.073
 curiously+0.070
Neg logit contributions
isk-0.066
and-0.059
aid-0.059
ives-0.057
aks-0.056
Token bridge
Token ID7696
Feature activation-1.42e-01

Pos logit contributions
ised+0.009
ement+0.009
inated+0.009
work+0.008
el+0.008
Neg logit contributions
isk-0.007
aks-0.006
arl-0.005
ives-0.005
eh-0.005
Token own
Token ID898
Feature activation+3.29e-01

Pos logit contributions
ised+0.004
ement+0.004
inated+0.003
work+0.003
el+0.003
Neg logit contributions
isk-0.005
aks-0.004
arl-0.004
ives-0.004
eh-0.004
Token way
Token ID835
Feature activation+1.72e+00

Pos logit contributions
ised+0.013
inated+0.013
ement+0.013
work+0.012
el+0.012
Neg logit contributions
isk-0.015
aks-0.013
arl-0.013
ives-0.012
eh-0.012
Token and
Token ID290
Feature activation+3.73e-01

Pos logit contributions
ement+0.077
inated+0.075
ised+0.074
 sunset+0.070
ths+0.069
Neg logit contributions
isk-0.064
aks-0.059
ives-0.055
arl-0.055
 backup-0.054
Token that
Token ID326
Feature activation+1.45e+00

Pos logit contributions
ement+0.016
ised+0.016
inated+0.016
el+0.015
work+0.015
Neg logit contributions
isk-0.016
aks-0.013
arl-0.013
ives-0.012
aid-0.012
Token we
Token ID356
Feature activation+1.69e+00

Pos logit contributions
ement+0.061
inated+0.061
ised+0.059
el+0.055
itz+0.055
Neg logit contributions
isk-0.060
aks-0.055
 germ-0.050
smart-0.050
 comes-0.050
Token don
Token ID836
Feature activation+3.67e+00

Pos logit contributions
ement+0.101
inated+0.097
ised+0.096
 horizon+0.093
itz+0.093
Neg logit contributions
isk-0.045
aks-0.037
ives-0.037
aid-0.033
arl-0.032
Token't
Token ID470
Feature activation+1.05e+00

Pos logit contributions
ishment+0.183
ement+0.180
inated+0.179
 reflection+0.177
work+0.174
Neg logit contributions
isk-0.110
ives-0.100
aks-0.096
arl-0.092
smart-0.091
Token always
Token ID1464
Feature activation+2.67e-01

Pos logit contributions
ement+0.061
ised+0.059
inated+0.059
el+0.056
work+0.056
Neg logit contributions
isk-0.031
aks-0.024
arl-0.023
ives-0.022
aid-0.022
Token need
Token ID761
Feature activation+1.26e+00

Pos logit contributions
ised+0.014
ement+0.014
inated+0.013
work+0.013
el+0.013
Neg logit contributions
isk-0.009
aks-0.008
arl-0.007
ives-0.007
aid-0.007
Token material
Token ID2587
Feature activation+1.59e-01

Pos logit contributions
ement+0.061
inated+0.061
ised+0.061
work+0.058
el+0.057
Neg logit contributions
isk-0.043
aks-0.040
 backup-0.038
ives-0.037
smart-0.036
Token possessions
Token ID23309
Feature activation+4.97e-01

Pos logit contributions
ised+0.006
ement+0.006
inated+0.006
work+0.006
el+0.006
Neg logit contributions
isk-0.007
aks-0.006
arl-0.006
ives-0.006
eh-0.006
Token.
Token ID13
Feature activation-2.61e-01

Pos logit contributions
isk+0.001
aks+0.001
arl+0.001
eh+0.001
ives+0.001
Neg logit contributions
ised-0.000
ement-0.000
inated-0.000
el-0.000
work-0.000
Token
Token ID198
Feature activation-4.03e-01

Pos logit contributions
isk+0.013
aks+0.011
eh+0.010
aid+0.010
ives+0.010
Neg logit contributions
inated-0.010
ised-0.010
ement-0.010
el-0.009
 sunset-0.009
Token
Token ID198
Feature activation-5.54e-01

Pos logit contributions
isk+0.020
aks+0.017
eh+0.017
ives+0.017
coat+0.017
Neg logit contributions
ised-0.015
inated-0.014
ement-0.014
el-0.014
 sunset-0.013
Token"
Token ID1
Feature activation+3.41e-01

Pos logit contributions
isk+0.025
ives+0.019
coat+0.019
 germ+0.019
aks+0.019
Neg logit contributions
ised-0.025
inated-0.024
el-0.023
ement-0.023
 sunset-0.022
TokenIn
Token ID818
Feature activation-1.42e-01

Pos logit contributions
ement+0.013
ised+0.013
inated+0.012
el+0.012
work+0.012
Neg logit contributions
isk-0.017
aks-0.015
arl-0.014
aid-0.014
ives-0.014
Token a
Token ID257
Feature activation-3.71e+00

Pos logit contributions
isk+0.007
aks+0.006
ives+0.006
arl+0.006
aid+0.006
Neg logit contributions
ised-0.005
ement-0.005
inated-0.005
el-0.005
work-0.005
Token box
Token ID3091
Feature activation-9.59e-01

Pos logit contributions
isk+0.197
ives+0.185
aid+0.179
ows+0.177
arl+0.162
Neg logit contributions
 top-0.131
 sunset-0.120
 comet-0.102
 destination-0.102
 sight-0.099
Token under
Token ID739
Feature activation-4.24e-02

Pos logit contributions
isk+0.051
aks+0.044
aid+0.043
ives+0.043
arl+0.042
Neg logit contributions
ement-0.033
ised-0.032
el-0.032
inated-0.030
work-0.030
Token the
Token ID262
Feature activation-1.14e+00

Pos logit contributions
isk+0.002
aks+0.002
arl+0.002
ives+0.002
aid+0.002
Neg logit contributions
ised-0.002
ement-0.002
inated-0.002
el-0.002
work-0.001
Token tree
Token ID5509
Feature activation-1.10e+00

Pos logit contributions
isk+0.059
aks+0.049
ives+0.048
aid+0.046
smart+0.046
Neg logit contributions
 sunset-0.044
 horizon-0.044
 comet-0.042
 top-0.042
el-0.041
Token."
Token ID526
Feature activation+6.51e-01

Pos logit contributions
isk+0.059
aid+0.053
arl+0.052
aks+0.051
ives+0.050
Neg logit contributions
ised-0.034
el-0.034
inated-0.034
ement-0.033
work-0.032
Token.
Token ID13
Feature activation-1.66e-01

Pos logit contributions
isk+0.026
aks+0.023
eh+0.022
arl+0.022
ives+0.022
Neg logit contributions
ement-0.013
ised-0.013
el-0.012
inated-0.012
work-0.012
Token
Token ID198
Feature activation-1.63e-01

Pos logit contributions
isk+0.008
aks+0.007
eh+0.006
arl+0.006
ives+0.006
Neg logit contributions
inated-0.007
ement-0.007
ised-0.006
el-0.006
work-0.006
Token
Token ID198
Feature activation-5.00e-01

Pos logit contributions
isk+0.008
aks+0.007
ives+0.006
arl+0.006
eh+0.006
Neg logit contributions
ised-0.006
ement-0.006
inated-0.006
el-0.006
 sunset-0.006
TokenBen
Token ID11696
Feature activation-1.04e-01

Pos logit contributions
isk+0.021
aks+0.016
ives+0.016
 germ+0.016
arl+0.016
Neg logit contributions
ised-0.023
inated-0.023
el-0.022
ement-0.022
 sunset-0.021
Token takes
Token ID2753
Feature activation-1.30e+00

Pos logit contributions
isk+0.004
aks+0.004
arl+0.003
eh+0.003
aid+0.003
Neg logit contributions
ement-0.005
ised-0.005
inated-0.005
el-0.004
work-0.004
Token a
Token ID257
Feature activation-3.66e+00

Pos logit contributions
isk+0.075
arl+0.065
coat+0.064
ives+0.061
eh+0.060
Neg logit contributions
el-0.040
 top-0.038
 sunset-0.037
ement-0.036
ised-0.036
Token bite
Token ID13197
Feature activation-7.57e-01

Pos logit contributions
isk+0.178
 villains+0.141
aks+0.128
ives+0.126
and+0.123
Neg logit contributions
 top-0.145
 int-0.141
 sunset-0.137
el-0.135
 curiously-0.135
Token of
Token ID286
Feature activation-9.01e-01

Pos logit contributions
isk+0.036
eh+0.028
arl+0.028
aks+0.027
coat+0.027
Neg logit contributions
el-0.032
ement-0.032
ised-0.031
 sunset-0.030
inated-0.029
Token the
Token ID262
Feature activation-2.14e+00

Pos logit contributions
isk+0.043
ives+0.037
aks+0.036
arl+0.036
coat+0.035
Neg logit contributions
el-0.034
ement-0.034
 sunset-0.034
ised-0.034
inated-0.034
Token banana
Token ID25996
Feature activation+3.69e-02

Pos logit contributions
isk+0.077
aks+0.062
smart+0.060
arl+0.059
ives+0.058
Neg logit contributions
 sunset-0.108
 top-0.106
ement-0.103
el-0.101
 reflection-0.097
Token.
Token ID13
Feature activation-1.48e+00

Pos logit contributions
ised+0.001
ement+0.001
inated+0.001
work+0.001
el+0.001
Neg logit contributions
isk-0.002
aks-0.002
arl-0.002
ives-0.002
eh-0.002
Token them
Token ID606
Feature activation-1.92e-01

Pos logit contributions
isk+0.031
eh+0.027
aks+0.027
coat+0.027
ives+0.026
Neg logit contributions
ised-0.020
ement-0.020
el-0.019
inated-0.019
 sunset-0.019
Token.
Token ID13
Feature activation-1.64e+00

Pos logit contributions
isk+0.010
aks+0.009
eh+0.008
arl+0.008
ives+0.008
Neg logit contributions
inated-0.007
ised-0.007
ement-0.007
el-0.006
work-0.006
Token He
Token ID679
Feature activation-1.89e+00

Pos logit contributions
isk+0.083
eh+0.080
coat+0.069
aks+0.068
arl+0.067
Neg logit contributions
el-0.057
ement-0.057
inated-0.056
ised-0.054
work-0.053
Token lives
Token ID3160
Feature activation-1.75e+00

Pos logit contributions
isk+0.060
aid+0.056
and+0.055
coat+0.055
eh+0.052
Neg logit contributions
el-0.094
ement-0.092
 curiously-0.089
inated-0.086
inct-0.086
Token in
Token ID287
Feature activation-7.84e-01

Pos logit contributions
isk+0.115
ives+0.103
 bisc+0.102
coat+0.102
 robber+0.100
Neg logit contributions
ised-0.036
 curiously-0.033
ized-0.032
ement-0.031
work-0.031
Token a
Token ID257
Feature activation-3.65e+00

Pos logit contributions
isk+0.043
ives+0.039
eh+0.038
coat+0.038
arl+0.037
Neg logit contributions
el-0.025
ised-0.024
 sunset-0.024
ement-0.023
work-0.022
Token house
Token ID2156
Feature activation-2.06e+00

Pos logit contributions
isk+0.220
ives+0.217
ows+0.196
coat+0.188
aid+0.188
Neg logit contributions
 top-0.113
 sunset-0.112
 comet-0.099
 horizon-0.092
 sight-0.090
Token near
Token ID1474
Feature activation-4.00e-01

Pos logit contributions
isk+0.135
aid+0.122
arl+0.120
ives+0.120
eh+0.119
Neg logit contributions
ement-0.042
ised-0.041
el-0.038
work-0.038
ized-0.036
Token the
Token ID262
Feature activation-1.67e-01

Pos logit contributions
isk+0.020
eh+0.018
ives+0.018
arl+0.017
aid+0.017
Neg logit contributions
ised-0.014
el-0.014
ement-0.014
inated-0.013
 sunset-0.013
Token woods
Token ID16479
Feature activation-1.09e+00

Pos logit contributions
isk+0.008
aks+0.007
ives+0.007
aid+0.006
arl+0.006
Neg logit contributions
ised-0.006
inated-0.006
ement-0.006
 sunset-0.006
work-0.006
Token.
Token ID13
Feature activation-1.87e+00

Pos logit contributions
isk+0.068
aid+0.061
arl+0.060
eh+0.059
coat+0.059
Neg logit contributions
ement-0.026
ised-0.026
work-0.024
inated-0.024
el-0.024
Token She
Token ID1375
Feature activation-1.58e+00

Pos logit contributions
isk+0.059
eh+0.054
aks+0.052
aid+0.051
 germ+0.048
Neg logit contributions
inated-0.038
ised-0.035
ement-0.035
 sunset-0.034
el-0.033
Token turned
Token ID2900
Feature activation-6.15e-01

Pos logit contributions
arl+0.076
isk+0.073
ives+0.068
aks+0.066
and+0.063
Neg logit contributions
ised-0.054
el-0.053
 curiously-0.051
 suddenly-0.050
inated-0.050
Token on
Token ID319
Feature activation-5.86e-01

Pos logit contributions
isk+0.035
eh+0.028
aks+0.028
arl+0.028
ives+0.027
Neg logit contributions
ised-0.020
el-0.020
ement-0.020
 sunset-0.019
inated-0.019
Token the
Token ID262
Feature activation-7.01e-01

Pos logit contributions
isk+0.029
aks+0.025
eh+0.025
ives+0.024
aid+0.024
Neg logit contributions
ement-0.022
ised-0.022
el-0.022
inated-0.021
 sunset-0.021
Token f
Token ID277
Feature activation-2.05e+00

Pos logit contributions
isk+0.032
aks+0.026
eh+0.024
aid+0.023
ives+0.023
Neg logit contributions
ement-0.030
inated-0.029
ised-0.029
 horizon-0.029
 sunset-0.029
Tokenauc
Token ID14272
Feature activation-3.62e+00

Pos logit contributions
isk+0.080
 fetch+0.066
 robber+0.061
 shady+0.061
 robbers+0.060
Neg logit contributions
ement-0.104
work-0.094
el-0.092
ised-0.091
inated-0.084
Tokenet
Token ID316
Feature activation-1.03e+00

Pos logit contributions
 backup+0.194
isk+0.181
 shady+0.180
arl+0.177
 liar+0.177
Neg logit contributions
el-0.083
ement-0.081
work-0.078
ised-0.075
inated-0.067
Token and
Token ID290
Feature activation-1.11e+00

Pos logit contributions
isk+0.054
aks+0.048
eh+0.047
arl+0.046
aid+0.045
Neg logit contributions
ement-0.034
work-0.034
el-0.034
ised-0.033
inated-0.032
Token filled
Token ID5901
Feature activation-5.31e-01

Pos logit contributions
isk+0.054
arl+0.047
ives+0.046
aks+0.045
eh+0.042
Neg logit contributions
ised-0.042
inated-0.040
work-0.040
el-0.039
 sunset-0.039
Token the
Token ID262
Feature activation-8.45e-01

Pos logit contributions
isk+0.028
ives+0.024
arl+0.024
aks+0.023
eh+0.023
Neg logit contributions
ised-0.018
ement-0.018
inated-0.018
el-0.018
 sunset-0.017
Token sink
Token ID14595
Feature activation-1.17e+00

Pos logit contributions
isk+0.037
aks+0.028
smart+0.028
aid+0.028
ives+0.028
Neg logit contributions
 sunset-0.037
ement-0.037
ised-0.036
el-0.036
inated-0.036
Token a
Token ID257
Feature activation-2.53e+00

Pos logit contributions
isk+0.119
arl+0.112
ives+0.110
ows+0.104
and+0.102
Neg logit contributions
el-0.043
 sunset-0.038
 once-0.036
ised-0.035
 top-0.034
Token phone
Token ID3072
Feature activation-9.34e-01

Pos logit contributions
isk+0.133
ives+0.120
and+0.118
aks+0.110
aked+0.104
Neg logit contributions
 comet-0.087
 sunset-0.085
 top-0.083
 reflection-0.080
 curiously-0.077
Token.
Token ID13
Feature activation-1.80e+00

Pos logit contributions
isk+0.048
aks+0.043
eh+0.040
arl+0.040
ives+0.040
Neg logit contributions
inated-0.032
work-0.031
ement-0.031
ised-0.031
el-0.030
Token He
Token ID679
Feature activation-8.39e-01

Pos logit contributions
isk+0.094
coat+0.081
eh+0.078
aks+0.077
akers+0.073
Neg logit contributions
el-0.062
ement-0.060
inated-0.059
ised-0.057
 sunset-0.056
Token calls
Token ID3848
Feature activation-3.10e+00

Pos logit contributions
isk+0.032
arl+0.027
aks+0.026
aid+0.026
coat+0.026
Neg logit contributions
ement-0.039
inated-0.038
el-0.038
ised-0.037
 curiously-0.036
Token a
Token ID257
Feature activation-3.60e+00

Pos logit contributions
isk+0.177
eh+0.176
aid+0.170
aks+0.168
coat+0.168
Neg logit contributions
 curiously-0.070
el-0.069
 top-0.069
 mummy-0.065
 int-0.064
Token doctor
Token ID6253
Feature activation-1.80e+00

Pos logit contributions
isk+0.187
aks+0.152
ives+0.152
aked+0.151
atters+0.148
Neg logit contributions
 comet-0.124
 top-0.118
 sunset-0.118
 reflection-0.110
 mother-0.106
Token.
Token ID13
Feature activation-1.79e+00

Pos logit contributions
isk+0.107
eh+0.097
aks+0.097
arl+0.097
aid+0.096
Neg logit contributions
work-0.046
ement-0.039
el-0.038
inated-0.037
 int-0.034
Token He
Token ID679
Feature activation-7.82e-01

Pos logit contributions
isk+0.096
coat+0.084
aks+0.083
eh+0.082
ives+0.077
Neg logit contributions
el-0.055
inated-0.054
ement-0.054
ised-0.053
 sunset-0.052
Token also
Token ID635
Feature activation-1.68e+00

Pos logit contributions
isk+0.030
aks+0.025
arl+0.025
coat+0.024
eh+0.023
Neg logit contributions
ement-0.036
inated-0.036
el-0.035
ised-0.035
work-0.034
Token calls
Token ID3848
Feature activation-3.15e+00

Pos logit contributions
isk+0.073
and+0.066
coat+0.066
arl+0.063
eh+0.061
Neg logit contributions
el-0.063
work-0.061
inated-0.060
 int-0.059
 curiously-0.059
Token
Token ID198
Feature activation-4.83e-01

Pos logit contributions
isk+0.012
aks+0.010
eh+0.010
aid+0.009
arl+0.009
Neg logit contributions
inated-0.011
ised-0.011
ement-0.011
el-0.010
 sunset-0.010
Token
Token ID198
Feature activation-5.84e-01

Pos logit contributions
isk+0.024
aks+0.021
eh+0.020
coat+0.020
ives+0.020
Neg logit contributions
ised-0.017
inated-0.017
ement-0.017
el-0.016
 sunset-0.016
TokenA
Token ID32
Feature activation-1.76e+00

Pos logit contributions
isk+0.026
aks+0.019
coat+0.019
 germ+0.019
smart+0.019
Neg logit contributions
ised-0.026
inated-0.026
el-0.024
ement-0.024
 sunset-0.024
Token man
Token ID582
Feature activation-2.03e+00

Pos logit contributions
isk+0.096
ives+0.074
aks+0.072
and+0.071
osc+0.070
Neg logit contributions
 sunset-0.068
 comet-0.067
 brightly-0.061
ised-0.061
 top-0.060
Token in
Token ID287
Feature activation-9.35e-01

Pos logit contributions
isk+0.071
coat+0.060
arl+0.059
aid+0.054
eh+0.053
Neg logit contributions
el-0.109
ement-0.103
used-0.103
 curiously-0.101
work-0.098
Token a
Token ID257
Feature activation-3.60e+00

Pos logit contributions
isk+0.052
ives+0.046
eh+0.045
coat+0.045
aid+0.044
Neg logit contributions
el-0.030
ised-0.029
ement-0.028
inated-0.027
 sunset-0.027
Token blue
Token ID4171
Feature activation-2.07e+00

Pos logit contributions
isk+0.218
ives+0.198
coat+0.182
ows+0.181
aid+0.180
Neg logit contributions
 top-0.119
 sunset-0.108
 comet-0.105
 sight-0.095
 horizon-0.093
Token uniform
Token ID8187
Feature activation-5.07e-01

Pos logit contributions
isk+0.105
ives+0.092
aid+0.091
eh+0.089
arl+0.088
Neg logit contributions
el-0.084
 sunset-0.070
 top-0.068
ement-0.067
 comet-0.066
Token runs
Token ID4539
Feature activation-1.41e+00

Pos logit contributions
isk+0.021
aks+0.018
arl+0.017
eh+0.017
aid+0.017
Neg logit contributions
ement-0.022
inated-0.022
ised-0.022
work-0.021
el-0.021
Token to
Token ID284
Feature activation-1.22e+00

Pos logit contributions
isk+0.074
aid+0.065
aks+0.064
eh+0.063
 germ+0.061
Neg logit contributions
work-0.044
inated-0.044
el-0.044
 int-0.043
 sunset-0.042
Token the
Token ID262
Feature activation-1.03e+00

Pos logit contributions
isk+0.062
aks+0.060
ives+0.059
eh+0.058
coat+0.057
Neg logit contributions
 curiously-0.039
el-0.038
 sunset-0.038
 top-0.037
inated-0.036
Token loved
Token ID6151
Feature activation-1.30e+00

Pos logit contributions
ised+0.001
inated+0.001
ement+0.001
work+0.001
el+0.001
Neg logit contributions
isk-0.001
aks-0.001
arl-0.001
ives-0.001
eh-0.001
Token to
Token ID284
Feature activation-1.78e+00

Pos logit contributions
isk+0.059
aks+0.058
ives+0.055
arl+0.054
tted+0.053
Neg logit contributions
 sunset-0.051
inated-0.047
el-0.046
work-0.044
 int-0.044
Token play
Token ID711
Feature activation-1.71e+00

Pos logit contributions
isk+0.073
aks+0.071
ives+0.070
arl+0.067
and+0.067
Neg logit contributions
inated-0.073
 sunset-0.073
el-0.068
 comet-0.068
 top-0.066
Token with
Token ID351
Feature activation-9.92e-01

Pos logit contributions
isk+0.097
arl+0.091
ives+0.087
aks+0.086
rost+0.086
Neg logit contributions
ised-0.046
 top-0.039
inated-0.039
izing-0.038
 curiously-0.038
Token Wh
Token ID854
Feature activation-2.01e+00

Pos logit contributions
isk+0.035
aks+0.034
ives+0.032
arl+0.032
aid+0.030
Neg logit contributions
el-0.047
ised-0.046
inated-0.045
 sunset-0.043
ement-0.043
Tokenisk
Token ID1984
Feature activation-3.59e+00

Pos logit contributions
aid+0.017
 firefighter+0.017
 backup+0.016
aks+0.015
 germ+0.013
Neg logit contributions
el-0.142
ement-0.134
ised-0.130
itz-0.130
work-0.128
Tokeners
Token ID364
Feature activation-6.97e-01

Pos logit contributions
arl+0.125
 bisc+0.113
 germ+0.107
 shady+0.103
isk+0.099
Neg logit contributions
work-0.154
ised-0.147
ized-0.146
izing-0.144
ble-0.141
Token,
Token ID11
Feature activation-2.04e+00

Pos logit contributions
isk+0.036
aks+0.033
arl+0.033
aid+0.031
eh+0.031
Neg logit contributions
ised-0.022
ement-0.022
inated-0.022
work-0.022
el-0.021
Token but
Token ID475
Feature activation-9.60e-01

Pos logit contributions
aks+0.119
arl+0.111
ives+0.110
aked+0.107
aid+0.104
Neg logit contributions
 mostly-0.051
 suddenly-0.050
 curiously-0.049
 sunset-0.049
 just-0.049
Token one
Token ID530
Feature activation-1.34e+00

Pos logit contributions
isk+0.040
ives+0.040
aks+0.039
arl+0.037
eh+0.035
Neg logit contributions
ised-0.039
 sunset-0.038
el-0.037
inated-0.037
work-0.036
Token day
Token ID1110
Feature activation-9.61e-01

Pos logit contributions
isk+0.081
arl+0.078
ives+0.074
 backup+0.074
aks+0.073
Neg logit contributions
el-0.031
 sunset-0.030
 once-0.029
work-0.029
 comet-0.028
Token He
Token ID679
Feature activation-9.26e-01

Pos logit contributions
isk+0.070
eh+0.061
aks+0.059
akers+0.054
coat+0.054
Neg logit contributions
inated-0.053
ement-0.050
 sunset-0.050
ised-0.049
 top-0.049
Token drank
Token ID24070
Feature activation-7.76e-01

Pos logit contributions
isk+0.041
arl+0.038
aks+0.034
ives+0.034
smart+0.032
Neg logit contributions
ised-0.038
inated-0.037
el-0.037
ement-0.036
 curiously-0.036
Token from
Token ID422
Feature activation+3.66e-01

Pos logit contributions
isk+0.042
aks+0.034
eh+0.034
ives+0.033
arl+0.033
Neg logit contributions
ised-0.027
 sunset-0.027
ement-0.026
el-0.026
inated-0.026
Token the
Token ID262
Feature activation-6.65e-01

Pos logit contributions
ement+0.018
ised+0.018
inated+0.018
work+0.017
el+0.017
Neg logit contributions
isk-0.014
aks-0.012
eh-0.011
arl-0.011
aid-0.011
Token f
Token ID277
Feature activation-1.68e+00

Pos logit contributions
isk+0.030
aks+0.024
ives+0.023
smart+0.022
aid+0.022
Neg logit contributions
ement-0.028
 sunset-0.028
ised-0.028
inated-0.028
 horizon-0.028
Tokenauc
Token ID14272
Feature activation-3.59e+00

Pos logit contributions
isk+0.061
 robber+0.052
 fetch+0.051
 robbers+0.051
 firefighter+0.045
Neg logit contributions
ement-0.089
el-0.085
ised-0.082
work-0.081
inated-0.076
Tokenet
Token ID316
Feature activation-1.83e-01

Pos logit contributions
 backup+0.201
 robbers+0.189
 robber+0.187
and+0.186
 charity+0.184
Neg logit contributions
el-0.087
ement-0.073
ised-0.069
work-0.068
inated-0.060
Token.
Token ID13
Feature activation-1.36e+00

Pos logit contributions
isk+0.010
aks+0.009
arl+0.008
eh+0.008
ives+0.008
Neg logit contributions
ised-0.006
ement-0.006
inated-0.006
el-0.006
work-0.006
Token But
Token ID887
Feature activation-4.04e-01

Pos logit contributions
isk+0.069
eh+0.060
aks+0.058
coat+0.054
akers+0.053
Neg logit contributions
inated-0.051
 sunset-0.048
ement-0.048
ised-0.047
 top-0.047
Token he
Token ID339
Feature activation-5.74e-01

Pos logit contributions
isk+0.021
aks+0.018
ives+0.018
arl+0.018
coat+0.017
Neg logit contributions
ised-0.014
ement-0.014
inated-0.014
el-0.013
 sunset-0.013
Token still
Token ID991
Feature activation-5.56e-01

Pos logit contributions
isk+0.027
arl+0.024
aks+0.023
ives+0.022
aid+0.022
Neg logit contributions
ised-0.022
ement-0.022
inated-0.021
el-0.021
work-0.021
Token
Token ID198
Feature activation-3.05e-02

Pos logit contributions
ement+0.032
ised+0.031
inated+0.031
work+0.030
el+0.030
Neg logit contributions
isk-0.024
aks-0.021
arl-0.020
ives-0.019
 backup-0.018
Token
Token ID198
Feature activation-1.98e-01

Pos logit contributions
isk+0.001
aks+0.001
arl+0.001
ives+0.001
eh+0.001
Neg logit contributions
ised-0.001
inated-0.001
ement-0.001
el-0.001
work-0.001
TokenA
Token ID32
Feature activation-1.77e+00

Pos logit contributions
isk+0.010
aks+0.008
ives+0.008
arl+0.008
coat+0.008
Neg logit contributions
ised-0.008
inated-0.008
ement-0.007
el-0.007
work-0.007
Token man
Token ID582
Feature activation-2.05e+00

Pos logit contributions
isk+0.094
ives+0.076
aks+0.070
atters+0.069
ens+0.068
Neg logit contributions
 sunset-0.068
 comet-0.067
ised-0.065
 brightly-0.063
work-0.062
Token in
Token ID287
Feature activation-8.47e-01

Pos logit contributions
isk+0.092
arl+0.090
coat+0.077
 germ+0.074
ives+0.073
Neg logit contributions
el-0.087
ement-0.085
 curiously-0.083
 suddenly-0.083
used-0.079
Token a
Token ID257
Feature activation-3.59e+00

Pos logit contributions
isk+0.046
ives+0.041
eh+0.040
coat+0.040
aid+0.039
Neg logit contributions
el-0.028
ised-0.027
ement-0.026
inated-0.025
 sunset-0.025
Token boat
Token ID8848
Feature activation-1.58e+00

Pos logit contributions
isk+0.221
ives+0.204
ows+0.190
ens+0.184
coat+0.183
Neg logit contributions
 top-0.117
 sunset-0.105
 comet-0.097
 horizon-0.091
 brightly-0.089
Token saw
Token ID2497
Feature activation-9.83e-01

Pos logit contributions
isk+0.069
arl+0.059
ives+0.056
 germ+0.056
aid+0.055
Neg logit contributions
work-0.067
ement-0.066
ised-0.065
el-0.065
inated-0.064
Token them
Token ID606
Feature activation+9.37e-01

Pos logit contributions
isk+0.056
coat+0.051
eh+0.049
aks+0.048
aid+0.048
Neg logit contributions
ement-0.029
el-0.029
ised-0.028
inated-0.026
work-0.026
Token and
Token ID290
Feature activation-1.45e+00

Pos logit contributions
ised+0.042
ement+0.041
inated+0.040
work+0.038
ths+0.037
Neg logit contributions
isk-0.039
aks-0.034
arl-0.031
ives-0.031
smart-0.030
Token came
Token ID1625
Feature activation-6.90e-01

Pos logit contributions
isk+0.073
arl+0.071
ives+0.067
aks+0.063
coat+0.062
Neg logit contributions
ised-0.049
 suddenly-0.048
 curiously-0.048
work-0.047
inated-0.047
Token shells
Token ID19679
Feature activation+1.39e-01

Pos logit contributions
isk+0.010
aks+0.008
ives+0.008
arl+0.008
eh+0.008
Neg logit contributions
ised-0.009
ement-0.009
inated-0.009
work-0.009
 sunset-0.009
Token.
Token ID13
Feature activation-2.37e-02

Pos logit contributions
ised+0.005
ement+0.005
inated+0.005
work+0.005
el+0.005
Neg logit contributions
isk-0.007
aks-0.006
arl-0.006
ives-0.006
eh-0.006
Token They
Token ID1119
Feature activation-8.21e-02

Pos logit contributions
isk+0.001
aks+0.001
eh+0.001
arl+0.001
ives+0.001
Neg logit contributions
inated-0.001
ised-0.001
ement-0.001
el-0.001
work-0.001
Token hear
Token ID3285
Feature activation+1.67e-01

Pos logit contributions
isk+0.003
aks+0.002
arl+0.002
eh+0.002
ives+0.002
Neg logit contributions
ised-0.004
inated-0.004
ement-0.004
work-0.004
el-0.004
Token the
Token ID262
Feature activation+3.12e-01

Pos logit contributions
ised+0.007
ement+0.007
inated+0.007
work+0.006
el+0.006
Neg logit contributions
isk-0.008
aks-0.007
arl-0.006
ives-0.006
eh-0.006
Token se
Token ID384
Feature activation-3.57e+00

Pos logit contributions
ised+0.013
inated+0.013
ement+0.013
el+0.012
work+0.012
Neg logit contributions
isk-0.014
aks-0.012
arl-0.012
ives-0.011
eh-0.011
Tokenag
Token ID363
Feature activation-2.97e-01

Pos logit contributions
 robbers+0.135
 shady+0.130
isk+0.126
 backup+0.121
 cafeteria+0.120
Neg logit contributions
ement-0.145
el-0.136
 int-0.132
work-0.130
ised-0.127
Tokenull
Token ID724
Feature activation-4.36e-01

Pos logit contributions
isk+0.015
aks+0.013
ives+0.012
arl+0.012
aid+0.012
Neg logit contributions
ised-0.011
ement-0.011
inated-0.011
el-0.010
work-0.010
Tokens
Token ID82
Feature activation+2.45e-01

Pos logit contributions
isk+0.023
aks+0.020
eh+0.019
arl+0.019
ives+0.019
Neg logit contributions
inated-0.015
ement-0.015
ised-0.014
el-0.014
work-0.014
Token and
Token ID290
Feature activation-6.10e-01

Pos logit contributions
ised+0.010
ement+0.010
inated+0.010
work+0.009
el+0.009
Neg logit contributions
isk-0.011
aks-0.010
arl-0.009
ives-0.009
aid-0.009
Token the
Token ID262
Feature activation-7.18e-01

Pos logit contributions
isk+0.027
aks+0.022
arl+0.022
ives+0.022
eh+0.022
Neg logit contributions
inated-0.026
ised-0.026
 sunset-0.026
work-0.025
el-0.025
Token He
Token ID679
Feature activation+9.52e-02

Pos logit contributions
isk+0.027
aks+0.023
eh+0.022
ives+0.022
aid+0.021
Neg logit contributions
inated-0.027
ement-0.027
ised-0.027
 sunset-0.025
el-0.025
Token said
Token ID531
Feature activation-1.46e+00

Pos logit contributions
ised+0.004
ement+0.004
inated+0.004
work+0.003
el+0.003
Neg logit contributions
isk-0.005
aks-0.004
arl-0.004
ives-0.004
eh-0.004
Token he
Token ID339
Feature activation-3.58e-01

Pos logit contributions
isk+0.084
aks+0.075
eh+0.070
coat+0.069
ives+0.069
Neg logit contributions
ement-0.045
 sunset-0.045
ised-0.042
work-0.042
 int-0.041
Token was
Token ID373
Feature activation-4.71e-01

Pos logit contributions
isk+0.016
arl+0.014
aks+0.014
eh+0.014
ives+0.013
Neg logit contributions
ement-0.014
ised-0.014
inated-0.014
work-0.013
el-0.013
Token in
Token ID287
Feature activation-1.22e+00

Pos logit contributions
isk+0.020
ives+0.016
aks+0.016
arl+0.016
coat+0.015
Neg logit contributions
ement-0.020
inated-0.020
ised-0.020
work-0.020
 sunset-0.019
Token a
Token ID257
Feature activation-3.57e+00

Pos logit contributions
isk+0.066
ives+0.057
eh+0.056
coat+0.056
aid+0.055
Neg logit contributions
el-0.038
 sunset-0.038
ised-0.037
ement-0.036
 top-0.035
Token hurry
Token ID23290
Feature activation-1.42e+00

Pos logit contributions
isk+0.184
ives+0.178
ows+0.163
aid+0.154
ens+0.147
Neg logit contributions
 top-0.125
 sunset-0.123
 comet-0.111
 sight-0.109
 destination-0.107
Token.
Token ID13
Feature activation-6.52e-01

Pos logit contributions
isk+0.079
eh+0.071
aks+0.071
aid+0.070
ives+0.069
Neg logit contributions
work-0.041
ised-0.041
ement-0.039
 horizon-0.038
inated-0.037
Token He
Token ID679
Feature activation-2.97e-02

Pos logit contributions
isk+0.029
aks+0.025
eh+0.024
ives+0.023
coat+0.023
Neg logit contributions
inated-0.027
ement-0.027
ised-0.027
 sunset-0.025
el-0.025
Token had
Token ID550
Feature activation-9.38e-01

Pos logit contributions
isk+0.001
aks+0.001
arl+0.001
ives+0.001
eh+0.001
Neg logit contributions
ised-0.001
ement-0.001
inated-0.001
work-0.001
el-0.001
Token to
Token ID284
Feature activation-1.35e+00

Pos logit contributions
isk+0.049
arl+0.044
ives+0.041
aks+0.041
aid+0.039
Neg logit contributions
ised-0.030
inated-0.030
 sunset-0.030
el-0.029
work-0.028
Token fun
Token ID1257
Feature activation+9.79e-01

Pos logit contributions
ised+0.002
ement+0.002
inated+0.002
work+0.002
el+0.002
Neg logit contributions
isk-0.002
aks-0.001
arl-0.001
ives-0.001
eh-0.001
Token making
Token ID1642
Feature activation+9.31e-02

Pos logit contributions
inated+0.042
ement+0.041
ised+0.040
 horizon+0.039
el+0.038
Neg logit contributions
isk-0.040
aks-0.038
eh-0.034
ives-0.033
arl-0.033
Token mud
Token ID17492
Feature activation-1.90e+00

Pos logit contributions
ised+0.004
ement+0.004
inated+0.004
work+0.003
el+0.003
Neg logit contributions
isk-0.004
aks-0.004
arl-0.004
ives-0.003
eh-0.003
Token pies
Token ID47199
Feature activation-1.41e+00

Pos logit contributions
isk+0.074
aks+0.065
arl+0.064
 backup+0.063
aid+0.061
Neg logit contributions
work-0.084
ised-0.084
el-0.081
inated-0.080
ement-0.080
Token,
Token ID11
Feature activation-1.67e+00

Pos logit contributions
isk+0.076
 germ+0.067
aid+0.066
arl+0.065
aks+0.065
Neg logit contributions
ised-0.043
work-0.043
ement-0.043
inated-0.042
 sunset-0.041
Token cars
Token ID5006
Feature activation-3.56e+00

Pos logit contributions
isk+0.084
arl+0.080
ives+0.077
aks+0.077
aid+0.074
Neg logit contributions
 sunset-0.055
ised-0.053
work-0.052
inated-0.051
 horizon-0.051
Token,
Token ID11
Feature activation-2.07e+00

Pos logit contributions
 germ+0.219
isk+0.214
mare+0.205
arl+0.201
aid+0.197
Neg logit contributions
el-0.057
ised-0.055
 top-0.055
work-0.052
 both-0.047
Token and
Token ID290
Feature activation-2.26e+00

Pos logit contributions
isk+0.108
arl+0.100
ives+0.100
aks+0.097
aid+0.093
Neg logit contributions
el-0.064
 sunset-0.064
 top-0.063
ised-0.061
work-0.061
Token castles
Token ID49203
Feature activation-1.96e+00

Pos logit contributions
isk+0.105
arl+0.098
ives+0.093
aks+0.089
eh+0.086
Neg logit contributions
 sunset-0.091
 diamonds-0.084
work-0.083
ised-0.081
el-0.081
Token.
Token ID13
Feature activation+3.71e-01

Pos logit contributions
isk+0.110
aid+0.098
arl+0.098
 germ+0.097
eh+0.096
Neg logit contributions
work-0.054
ement-0.054
ised-0.052
 sunset-0.050
 top-0.049
Token But
Token ID887
Feature activation+1.80e-01

Pos logit contributions
ised+0.018
ement+0.017
inated+0.017
work+0.016
el+0.016
Neg logit contributions
isk-0.015
aks-0.012
arl-0.012
ives-0.011
eh-0.011
Tokenopard
Token ID15478
Feature activation-9.00e-01

Pos logit contributions
isk+0.034
aks+0.030
smart+0.027
 germ+0.027
arl+0.027
Neg logit contributions
inated-0.035
el-0.035
ement-0.034
ised-0.034
work-0.033
Token.
Token ID13
Feature activation-1.15e+00

Pos logit contributions
isk+0.050
arl+0.045
aks+0.044
aid+0.044
eh+0.042
Neg logit contributions
inated-0.026
ement-0.025
ised-0.025
work-0.025
el-0.025
Token Tim
Token ID5045
Feature activation-1.37e+00

Pos logit contributions
isk+0.064
coat+0.055
aks+0.054
eh+0.054
aid+0.054
Neg logit contributions
inated-0.038
ement-0.035
ised-0.035
el-0.034
 curiously-0.034
Token runs
Token ID4539
Feature activation-2.04e+00

Pos logit contributions
isk+0.064
arl+0.057
eh+0.056
aks+0.055
aid+0.054
Neg logit contributions
el-0.051
inated-0.048
 curiously-0.048
ement-0.047
 top-0.046
Token to
Token ID284
Feature activation-1.16e+00

Pos logit contributions
isk+0.116
aks+0.104
eh+0.102
 germ+0.099
aid+0.097
Neg logit contributions
 top-0.056
 once-0.050
 curiously-0.049
 closely-0.049
 suddenly-0.048
Token a
Token ID257
Feature activation-3.53e+00

Pos logit contributions
isk+0.057
ives+0.055
aks+0.055
coat+0.053
arl+0.052
Neg logit contributions
 curiously-0.040
 top-0.039
el-0.039
 sunset-0.038
inated-0.037
Token man
Token ID582
Feature activation-1.96e+00

Pos logit contributions
isk+0.192
ives+0.185
aked+0.170
aid+0.162
aks+0.160
Neg logit contributions
 top-0.121
 sunset-0.117
 comet-0.110
 horizon-0.105
 reflection-0.105
Token who
Token ID508
Feature activation-2.28e+00

Pos logit contributions
isk+0.112
arl+0.107
 germ+0.100
 robber+0.098
coat+0.097
Neg logit contributions
el-0.054
 curiously-0.049
work-0.049
inated-0.048
 closely-0.047
Token works
Token ID2499
Feature activation-5.58e-01

Pos logit contributions
isk+0.091
and+0.087
arl+0.086
coat+0.082
aid+0.078
Neg logit contributions
el-0.097
ement-0.090
work-0.087
 comet-0.086
 curiously-0.085
Token in
Token ID287
Feature activation-1.27e+00

Pos logit contributions
isk+0.031
ives+0.026
 germ+0.026
arl+0.025
coat+0.025
Neg logit contributions
ised-0.018
ement-0.018
el-0.017
inated-0.016
izing-0.016
Token the
Token ID262
Feature activation-1.77e+00

Pos logit contributions
isk+0.070
ives+0.066
arl+0.063
coat+0.062
eh+0.061
Neg logit contributions
el-0.039
 sunset-0.035
ised-0.035
 top-0.034
ement-0.033
Token talks
Token ID6130
Feature activation-2.05e+00

Pos logit contributions
isk+0.040
aks+0.035
arl+0.034
aid+0.034
and+0.033
Neg logit contributions
inated-0.050
ement-0.049
el-0.048
ised-0.048
 curiously-0.047
Token to
Token ID284
Feature activation-6.41e-01

Pos logit contributions
isk+0.116
eh+0.110
aks+0.104
aster+0.099
cod+0.096
Neg logit contributions
 once-0.054
 int-0.054
 brightly-0.054
 curiously-0.053
work-0.053
Token the
Token ID262
Feature activation-9.21e-01

Pos logit contributions
isk+0.032
aks+0.030
ives+0.030
arl+0.029
aid+0.029
Neg logit contributions
ised-0.021
inated-0.021
el-0.021
ement-0.021
 sunset-0.021
Token dog
Token ID3290
Feature activation-1.05e+00

Pos logit contributions
isk+0.039
aks+0.032
eh+0.031
ives+0.030
smart+0.030
Neg logit contributions
 sunset-0.041
ised-0.040
inated-0.040
 comet-0.039
ement-0.039
Token in
Token ID287
Feature activation-1.40e+00

Pos logit contributions
isk+0.060
aks+0.056
arl+0.055
eh+0.054
aid+0.054
Neg logit contributions
inated-0.027
ement-0.026
el-0.025
 curiously-0.025
work-0.025
Token a
Token ID257
Feature activation-3.53e+00

Pos logit contributions
isk+0.076
ives+0.071
eh+0.069
coat+0.068
aid+0.066
Neg logit contributions
el-0.047
ised-0.041
 top-0.040
 sunset-0.039
ement-0.039
Token soft
Token ID2705
Feature activation-2.79e+00

Pos logit contributions
isk+0.205
ives+0.180
rost+0.160
aid+0.157
and+0.156
Neg logit contributions
 top-0.113
 sunset-0.102
el-0.098
 comet-0.093
 jet-0.092
Token voice
Token ID3809
Feature activation-6.34e-01

Pos logit contributions
ives+0.156
isk+0.152
aid+0.138
 backup+0.137
oke+0.136
Neg logit contributions
el-0.096
ised-0.081
inated-0.080
 sunset-0.080
 top-0.076
Token.
Token ID13
Feature activation-1.10e+00

Pos logit contributions
isk+0.038
aks+0.033
eh+0.033
arl+0.032
ives+0.032
Neg logit contributions
ement-0.018
inated-0.016
ised-0.016
work-0.016
el-0.016
Token He
Token ID679
Feature activation+3.55e-01

Pos logit contributions
isk+0.057
aks+0.049
eh+0.049
coat+0.048
aid+0.047
Neg logit contributions
inated-0.039
 sunset-0.038
ement-0.038
ised-0.037
 int-0.036
Token gives
Token ID3607
Feature activation-7.20e-01

Pos logit contributions
ised+0.018
ement+0.018
inated+0.018
work+0.017
el+0.017
Neg logit contributions
isk-0.013
aks-0.010
ives-0.010
arl-0.010
eh-0.009
Token.
Token ID13
Feature activation-1.28e+00

Pos logit contributions
isk+0.041
aks+0.035
arl+0.035
 germ+0.035
eh+0.035
Neg logit contributions
inated-0.024
el-0.023
ised-0.023
ement-0.023
 int-0.022
Token He
Token ID679
Feature activation-6.75e-01

Pos logit contributions
isk+0.063
eh+0.055
aks+0.053
coat+0.052
aid+0.050
Neg logit contributions
ised-0.048
ement-0.046
inated-0.046
 sunset-0.046
 top-0.045
Token takes
Token ID2753
Feature activation-1.85e+00

Pos logit contributions
isk+0.025
aks+0.021
arl+0.021
aid+0.020
eh+0.020
Neg logit contributions
inated-0.031
ement-0.031
ised-0.031
el-0.031
work-0.029
Token them
Token ID606
Feature activation-7.52e-01

Pos logit contributions
isk+0.112
arl+0.101
coat+0.099
eh+0.097
ives+0.096
Neg logit contributions
el-0.052
 top-0.044
ised-0.043
 sunset-0.042
 curiously-0.038
Token to
Token ID284
Feature activation-1.96e+00

Pos logit contributions
isk+0.038
aks+0.032
eh+0.032
arl+0.032
aid+0.031
Neg logit contributions
inated-0.026
ised-0.025
el-0.025
ement-0.025
work-0.025
Token a
Token ID257
Feature activation-3.50e+00

Pos logit contributions
ives+0.115
isk+0.108
coat+0.107
aks+0.107
eh+0.106
Neg logit contributions
 sunset-0.050
 top-0.046
 curiously-0.042
el-0.042
 avail-0.040
Token small
Token ID1402
Feature activation-2.16e+00

Pos logit contributions
ives+0.185
isk+0.182
aid+0.167
aked+0.165
ows+0.160
Neg logit contributions
 top-0.124
 sunset-0.117
 comet-0.104
 horizon-0.098
 destination-0.097
Token room
Token ID2119
Feature activation-1.88e+00

Pos logit contributions
isk+0.113
ives+0.103
aid+0.100
arl+0.098
aks+0.093
Neg logit contributions
 sunset-0.075
 top-0.074
el-0.074
ised-0.072
 horizon-0.070
Token.
Token ID13
Feature activation-1.49e+00

Pos logit contributions
isk+0.122
aid+0.108
eh+0.107
aks+0.106
 germ+0.105
Neg logit contributions
ement-0.041
ised-0.037
work-0.036
 int-0.035
 curiously-0.034
Token He
Token ID679
Feature activation-3.11e-01

Pos logit contributions
isk+0.077
eh+0.064
coat+0.064
aks+0.064
aid+0.060
Neg logit contributions
inated-0.051
 sunset-0.051
ised-0.051
ement-0.051
 top-0.048
Token locks
Token ID19253
Feature activation-7.30e-01

Pos logit contributions
isk+0.012
aks+0.010
arl+0.010
aid+0.009
ives+0.009
Neg logit contributions
inated-0.015
ement-0.015
ised-0.015
el-0.014
work-0.014
Token their
Token ID511
Feature activation-7.59e-01

Pos logit contributions
isk+0.002
aks+0.001
arl+0.001
ives+0.001
eh+0.001
Neg logit contributions
ised-0.002
inated-0.002
ement-0.002
el-0.002
work-0.002
Token hats
Token ID23910
Feature activation-6.80e-01

Pos logit contributions
isk+0.029
ives+0.024
aks+0.024
arl+0.023
aid+0.023
Neg logit contributions
ement-0.036
ised-0.035
inated-0.035
 sunset-0.035
el-0.035
Token and
Token ID290
Feature activation-1.77e+00

Pos logit contributions
isk+0.034
aks+0.029
eh+0.028
arl+0.028
aid+0.028
Neg logit contributions
ised-0.024
ement-0.024
inated-0.024
work-0.024
el-0.023
Token coats
Token ID30720
Feature activation-3.12e-01

Pos logit contributions
isk+0.079
and+0.066
ives+0.065
arl+0.065
smart+0.061
Neg logit contributions
 sunset-0.074
el-0.071
ised-0.070
work-0.069
inated-0.069
Token and
Token ID290
Feature activation-2.15e+00

Pos logit contributions
isk+0.015
aks+0.013
eh+0.013
arl+0.012
ives+0.012
Neg logit contributions
ised-0.012
ement-0.012
inated-0.012
work-0.011
el-0.011
Token sne
Token ID10505
Feature activation-3.50e+00

Pos logit contributions
isk+0.098
and+0.080
arl+0.079
ives+0.077
smart+0.074
Neg logit contributions
 sunset-0.089
ised-0.082
work-0.081
el-0.081
inated-0.081
Tokenaked
Token ID4335
Feature activation-1.44e+00

Pos logit contributions
 shady+0.096
isk+0.096
 cafeteria+0.086
 ranger+0.085
 backup+0.080
Neg logit contributions
el-0.179
ised-0.159
efully-0.146
work-0.144
ement-0.143
Token out
Token ID503
Feature activation-9.25e-01

Pos logit contributions
isk+0.076
arl+0.063
aid+0.063
aks+0.062
ives+0.061
Neg logit contributions
ised-0.049
inated-0.047
ement-0.047
work-0.047
 int-0.047
Token of
Token ID286
Feature activation-8.29e-01

Pos logit contributions
isk+0.047
ives+0.038
aks+0.038
aid+0.038
eh+0.038
Neg logit contributions
el-0.033
ement-0.033
work-0.033
ised-0.033
inated-0.032
Token the
Token ID262
Feature activation-1.46e+00

Pos logit contributions
isk+0.044
ives+0.039
eh+0.037
aks+0.037
arl+0.036
Neg logit contributions
ised-0.027
 sunset-0.027
el-0.027
ement-0.026
inated-0.025
Token house
Token ID2156
Feature activation-1.26e+00

Pos logit contributions
isk+0.069
ives+0.057
aid+0.056
smart+0.054
aks+0.053
Neg logit contributions
 sunset-0.060
 horizon-0.058
el-0.056
 comet-0.056
work-0.055
Token.
Token ID13
Feature activation+3.79e-01

Pos logit contributions
isk+0.026
aks+0.023
ives+0.023
arl+0.023
aid+0.023
Neg logit contributions
ised-0.015
ement-0.014
inated-0.014
work-0.013
 sunset-0.013
Token
Token ID198
Feature activation-5.27e-01

Pos logit contributions
ement+0.017
ised+0.017
inated+0.016
work+0.016
el+0.016
Neg logit contributions
isk-0.016
aks-0.013
arl-0.013
eh-0.013
ives-0.012
Token
Token ID198
Feature activation-6.58e-01

Pos logit contributions
isk+0.026
aks+0.022
arl+0.022
ives+0.022
aid+0.021
Neg logit contributions
ised-0.019
ement-0.019
inated-0.019
el-0.018
 sunset-0.018
TokenIn
Token ID818
Feature activation-3.63e-01

Pos logit contributions
isk+0.030
arl+0.026
ives+0.026
aks+0.026
 germ+0.026
Neg logit contributions
inated-0.027
ised-0.027
 sunset-0.024
izing-0.023
ement-0.023
Token the
Token ID262
Feature activation-1.93e+00

Pos logit contributions
isk+0.019
ives+0.017
aks+0.016
arl+0.016
aid+0.016
Neg logit contributions
ised-0.013
inated-0.012
ement-0.012
el-0.012
 sunset-0.012
Token middle
Token ID3504
Feature activation-3.48e+00

Pos logit contributions
isk+0.110
aid+0.109
ives+0.108
coat+0.101
and+0.100
Neg logit contributions
 sunset-0.064
 top-0.062
 horizon-0.058
 destination-0.048
 comet-0.048
Token of
Token ID286
Feature activation-6.70e-01

Pos logit contributions
isk+0.230
 robber+0.205
arl+0.205
coat+0.204
 liar+0.197
Neg logit contributions
el-0.089
ement-0.084
 once-0.074
ised-0.072
 top-0.062
Token the
Token ID262
Feature activation-1.21e+00

Pos logit contributions
isk+0.036
ives+0.035
aid+0.032
arl+0.032
coat+0.032
Neg logit contributions
ised-0.021
 sunset-0.021
el-0.020
inated-0.020
work-0.019
Token night
Token ID1755
Feature activation-1.29e+00

Pos logit contributions
isk+0.063
aid+0.059
ives+0.056
arl+0.053
coat+0.053
Neg logit contributions
 sunset-0.047
 horizon-0.043
 top-0.042
 comet-0.042
ised-0.040
Token,
Token ID11
Feature activation+9.33e-01

Pos logit contributions
isk+0.073
aid+0.062
arl+0.061
aks+0.060
coat+0.058
Neg logit contributions
 sunset-0.043
ised-0.042
 horizon-0.041
ement-0.041
work-0.040
Token Tim
Token ID5045
Feature activation-1.53e+00

Pos logit contributions
ement+0.052
inated+0.050
ised+0.049
el+0.048
work+0.048
Neg logit contributions
isk-0.028
aks-0.023
eh-0.022
ens-0.021
 robber-0.020
Token fell
Token ID3214
Feature activation-1.76e-01

Pos logit contributions
ised+0.000
inated+0.000
ement+0.000
work+0.000
el+0.000
Neg logit contributions
isk-0.000
aks-0.000
arl-0.000
ives-0.000
eh-0.000
Token asleep
Token ID16039
Feature activation-2.20e-01

Pos logit contributions
isk+0.007
aks+0.006
ives+0.006
arl+0.006
eh+0.006
Neg logit contributions
ised-0.008
inated-0.008
ement-0.008
el-0.008
work-0.008
Token.
Token ID13
Feature activation-1.56e-01

Pos logit contributions
isk+0.011
aks+0.010
arl+0.009
ives+0.009
eh+0.009
Neg logit contributions
ised-0.008
ement-0.008
inated-0.008
work-0.007
el-0.007
Token In
Token ID554
Feature activation+2.13e-01

Pos logit contributions
isk+0.007
aks+0.007
eh+0.006
ives+0.006
arl+0.006
Neg logit contributions
inated-0.006
ised-0.006
ement-0.006
 sunset-0.006
el-0.006
Token the
Token ID262
Feature activation-1.00e+00

Pos logit contributions
ised+0.009
inated+0.009
ement+0.009
work+0.008
el+0.008
Neg logit contributions
isk-0.010
aks-0.008
arl-0.008
eh-0.008
ives-0.008
Token middle
Token ID3504
Feature activation-3.48e+00

Pos logit contributions
isk+0.059
aks+0.054
ives+0.054
aid+0.051
arl+0.050
Neg logit contributions
 sunset-0.029
 horizon-0.026
ement-0.026
el-0.025
 top-0.025
Token of
Token ID286
Feature activation+2.67e-01

Pos logit contributions
isk+0.214
 robber+0.193
arl+0.187
inters+0.177
 stret+0.177
Neg logit contributions
ement-0.111
 once-0.104
el-0.103
 sunset-0.088
 suddenly-0.081
Token the
Token ID262
Feature activation-3.37e-01

Pos logit contributions
ement+0.011
inated+0.011
ised+0.011
work+0.010
el+0.010
Neg logit contributions
isk-0.012
aks-0.010
arl-0.010
eh-0.010
aid-0.010
Token night
Token ID1755
Feature activation-1.45e+00

Pos logit contributions
isk+0.018
aks+0.016
ives+0.016
arl+0.015
aid+0.015
Neg logit contributions
 sunset-0.011
ement-0.011
ised-0.011
inated-0.011
el-0.010
Token,
Token ID11
Feature activation+8.81e-03

Pos logit contributions
isk+0.082
arl+0.069
aks+0.069
aid+0.067
 robber+0.065
Neg logit contributions
 sunset-0.050
ement-0.049
ised-0.048
 horizon-0.047
el-0.046
Token she
Token ID673
Feature activation-7.48e-01

Pos logit contributions
ised+0.000
ement+0.000
inated+0.000
work+0.000
el+0.000
Neg logit contributions
isk-0.000
aks-0.000
arl-0.000
ives-0.000
eh-0.000
Token.
Token ID13
Feature activation-9.16e-01

Pos logit contributions
isk+0.063
eh+0.055
 germ+0.054
aks+0.053
arl+0.052
Neg logit contributions
work-0.038
ised-0.038
ement-0.037
el-0.037
inated-0.037
Token She
Token ID1375
Feature activation-2.26e-01

Pos logit contributions
isk+0.047
aks+0.040
eh+0.040
aid+0.038
 germ+0.037
Neg logit contributions
inated-0.034
ement-0.032
ised-0.032
 sunset-0.032
 int-0.030
Token puts
Token ID7584
Feature activation-6.86e-01

Pos logit contributions
isk+0.008
aks+0.007
arl+0.007
eh+0.006
ives+0.006
Neg logit contributions
ement-0.011
ised-0.011
inated-0.011
el-0.011
work-0.011
Token them
Token ID606
Feature activation-4.85e-02

Pos logit contributions
isk+0.036
eh+0.030
aks+0.030
arl+0.030
ives+0.030
Neg logit contributions
ised-0.023
ement-0.023
el-0.023
inated-0.022
work-0.022
Token in
Token ID287
Feature activation-7.07e-02

Pos logit contributions
isk+0.002
aks+0.002
arl+0.002
ives+0.002
eh+0.002
Neg logit contributions
ised-0.002
inated-0.002
ement-0.002
work-0.002
el-0.002
Token a
Token ID257
Feature activation-3.47e+00

Pos logit contributions
isk+0.003
aks+0.003
ives+0.003
arl+0.003
eh+0.003
Neg logit contributions
ised-0.003
ement-0.003
inated-0.003
el-0.003
work-0.002
Token bucket
Token ID19236
Feature activation-2.06e+00

Pos logit contributions
isk+0.200
ives+0.168
and+0.163
ens+0.157
aid+0.149
Neg logit contributions
 top-0.119
 sunset-0.105
 jet-0.099
ement-0.093
 comet-0.093
Token.
Token ID13
Feature activation-1.05e+00

Pos logit contributions
isk+0.126
 germ+0.108
 robber+0.106
 backup+0.105
arl+0.102
Neg logit contributions
el-0.057
ement-0.053
ised-0.053
work-0.050
 horizon-0.048
Token She
Token ID1375
Feature activation+1.03e-01

Pos logit contributions
isk+0.054
aks+0.046
eh+0.046
aid+0.044
 germ+0.043
Neg logit contributions
inated-0.038
ement-0.036
ised-0.036
 sunset-0.036
 int-0.034
Token fills
Token ID23816
Feature activation+2.19e-01

Pos logit contributions
ised+0.005
ement+0.005
inated+0.005
work+0.005
el+0.005
Neg logit contributions
isk-0.004
aks-0.003
arl-0.003
ives-0.003
eh-0.003
Token the
Token ID262
Feature activation-6.77e-01

Pos logit contributions
ised+0.009
ement+0.009
inated+0.009
work+0.008
el+0.008
Neg logit contributions
isk-0.010
aks-0.009
arl-0.008
eh-0.008
ives-0.008
Token a
Token ID257
Feature activation-1.68e+00

Pos logit contributions
isk+0.092
arl+0.080
aks+0.076
ives+0.076
eh+0.073
Neg logit contributions
el-0.057
 sunset-0.057
ised-0.056
inated-0.055
work-0.054
Token big
Token ID1263
Feature activation-2.20e+00

Pos logit contributions
isk+0.084
ives+0.071
aks+0.071
arl+0.066
and+0.065
Neg logit contributions
 sunset-0.063
 comet-0.062
ement-0.062
 top-0.060
 int-0.059
Token chair
Token ID5118
Feature activation-1.72e+00

Pos logit contributions
isk+0.109
ives+0.093
and+0.091
arl+0.087
aks+0.086
Neg logit contributions
ement-0.087
 sunset-0.087
el-0.085
 comet-0.084
 top-0.083
Token,
Token ID11
Feature activation-1.02e+00

Pos logit contributions
isk+0.093
aks+0.084
eh+0.083
arl+0.081
aid+0.080
Neg logit contributions
ement-0.054
work-0.053
ised-0.049
el-0.048
inated-0.048
Token a
Token ID257
Feature activation-2.44e+00

Pos logit contributions
isk+0.055
arl+0.047
ives+0.046
aks+0.046
eh+0.044
Neg logit contributions
ement-0.034
ised-0.034
inated-0.033
el-0.033
 sunset-0.032
Token loud
Token ID7812
Feature activation-3.45e+00

Pos logit contributions
isk+0.123
aks+0.103
ives+0.103
aked+0.095
and+0.093
Neg logit contributions
 comet-0.096
 sunset-0.095
 top-0.093
ement-0.089
 brightly-0.087
Token drill
Token ID16007
Feature activation-1.67e+00

Pos logit contributions
isk+0.176
ives+0.164
aks+0.156
and+0.152
arl+0.148
Neg logit contributions
 sunset-0.119
el-0.112
 comet-0.112
 top-0.111
ement-0.111
Token and
Token ID290
Feature activation-1.12e+00

Pos logit contributions
isk+0.088
aks+0.079
eh+0.077
arl+0.076
aid+0.076
Neg logit contributions
ement-0.054
work-0.053
inated-0.052
ised-0.052
el-0.050
Token a
Token ID257
Feature activation-1.89e+00

Pos logit contributions
isk+0.061
arl+0.053
aks+0.051
ives+0.050
eh+0.049
Neg logit contributions
ised-0.036
 sunset-0.034
inated-0.034
work-0.034
ement-0.033
Token bitter
Token ID12922
Feature activation-2.11e+00

Pos logit contributions
isk+0.092
aks+0.076
ives+0.073
arl+0.069
ens+0.068
Neg logit contributions
 sunset-0.075
 comet-0.074
 top-0.074
ement-0.073
 int-0.071
Token paste
Token ID17008
Feature activation-1.13e+00

Pos logit contributions
isk+0.104
aks+0.089
ives+0.083
arl+0.081
smart+0.080
Neg logit contributions
 sunset-0.082
ement-0.080
el-0.077
 comet-0.076
 top-0.075
Token drawing
Token ID8263
Feature activation-1.24e-01

Pos logit contributions
ement+0.081
inated+0.078
ised+0.078
work+0.075
itz+0.070
Neg logit contributions
isk-0.057
 germ-0.049
arl-0.048
aks-0.048
 backup-0.046
Token on
Token ID319
Feature activation+1.11e+00

Pos logit contributions
isk+0.006
aks+0.005
arl+0.005
eh+0.005
ives+0.005
Neg logit contributions
ement-0.005
ised-0.005
inated-0.004
work-0.004
el-0.004
Token anything
Token ID1997
Feature activation+6.53e-01

Pos logit contributions
inated+0.047
ised+0.047
ement+0.046
work+0.043
itz+0.042
Neg logit contributions
isk-0.046
aks-0.041
 germ-0.038
smart-0.038
 backup-0.037
Token again
Token ID757
Feature activation+4.54e-01

Pos logit contributions
inated+0.027
ised+0.027
ement+0.027
work+0.026
el+0.025
Neg logit contributions
isk-0.028
aks-0.025
ives-0.024
arl-0.023
aid-0.023
Token.
Token ID13
Feature activation+2.95e-01

Pos logit contributions
inated+0.017
ised+0.016
ement+0.016
work+0.015
el+0.015
Neg logit contributions
isk-0.022
aks-0.020
arl-0.019
ives-0.019
eh-0.018
Token From
Token ID3574
Feature activation+2.45e+00

Pos logit contributions
ised+0.012
ement+0.012
inated+0.012
work+0.012
el+0.011
Neg logit contributions
isk-0.013
aks-0.011
arl-0.011
ives-0.011
aid-0.011
Token that
Token ID326
Feature activation+6.69e-01

Pos logit contributions
ement+0.131
inated+0.130
work+0.125
inct+0.124
 summit+0.123
Neg logit contributions
aid-0.060
isk-0.059
ens-0.056
aks-0.052
arl-0.048
Token day
Token ID1110
Feature activation-3.51e-01

Pos logit contributions
ised+0.031
ement+0.031
inated+0.031
work+0.028
el+0.028
Neg logit contributions
isk-0.026
aks-0.023
eh-0.021
 germ-0.021
arl-0.021
Token on
Token ID319
Feature activation+4.30e-01

Pos logit contributions
isk+0.018
aks+0.015
ives+0.014
arl+0.014
eh+0.014
Neg logit contributions
ised-0.013
ement-0.013
inated-0.012
el-0.012
 sunset-0.012
Token,
Token ID11
Feature activation-3.88e-01

Pos logit contributions
inated+0.021
ised+0.020
ement+0.020
work+0.020
 horizon+0.019
Neg logit contributions
isk-0.016
aks-0.014
arl-0.013
eh-0.013
ives-0.012
Token she
Token ID673
Feature activation+8.26e-01

Pos logit contributions
isk+0.013
aks+0.012
arl+0.011
ives+0.010
eh+0.010
Neg logit contributions
ised-0.020
ement-0.020
inated-0.020
 sunset-0.020
el-0.019
Token it
Token ID340
Feature activation-1.02e-01

Pos logit contributions
ement+0.124
 horizon+0.115
ised+0.115
itz+0.109
efully+0.108
Neg logit contributions
eh-0.059
 backup-0.058
 germ-0.058
aks-0.057
 robber-0.056
Token,
Token ID11
Feature activation+1.19e+00

Pos logit contributions
isk+0.005
aks+0.005
arl+0.004
ives+0.004
aid+0.004
Neg logit contributions
ised-0.003
inated-0.003
ement-0.003
work-0.003
el-0.003
Token grand
Token ID4490
Feature activation+3.11e-01

Pos logit contributions
ised+0.054
inated+0.052
ement+0.052
work+0.049
ized+0.048
Neg logit contributions
isk-0.047
aks-0.041
arl-0.039
 germ-0.038
aid-0.038
Tokenpa
Token ID8957
Feature activation-1.88e-01

Pos logit contributions
ised+0.011
inated+0.011
ement+0.010
work+0.010
el+0.010
Neg logit contributions
isk-0.016
aks-0.014
arl-0.014
eh-0.013
ives-0.013
Token?",
Token ID35379
Feature activation+1.42e+00

Pos logit contributions
isk+0.010
aks+0.009
ives+0.009
arl+0.009
aid+0.009
Neg logit contributions
ised-0.006
inated-0.006
ement-0.006
work-0.005
el-0.005
Token Mary
Token ID5335
Feature activation+2.53e+00

Pos logit contributions
inated+0.068
ement+0.067
ised+0.067
work+0.067
 horizon+0.064
Neg logit contributions
isk-0.046
aks-0.042
aid-0.038
smart-0.038
aster-0.038
Token asked
Token ID1965
Feature activation+9.87e-01

Pos logit contributions
ement+0.126
ised+0.119
work+0.119
 horizon+0.117
izing+0.115
Neg logit contributions
isk-0.083
eh-0.072
aks-0.069
smart-0.067
and-0.066
Token.
Token ID13
Feature activation+8.20e-01

Pos logit contributions
inated+0.042
ised+0.042
ement+0.041
work+0.039
 horizon+0.039
Neg logit contributions
isk-0.042
aks-0.036
smart-0.034
eh-0.033
aid-0.033
Token
Token ID198
Feature activation+5.58e-01

Pos logit contributions
ised+0.039
ement+0.039
inated+0.038
work+0.037
el+0.037
Neg logit contributions
isk-0.031
aks-0.025
arl-0.024
ives-0.024
aid-0.023
Token
Token ID198
Feature activation+1.95e-01

Pos logit contributions
ement+0.025
ised+0.025
inated+0.025
work+0.024
el+0.023
Neg logit contributions
isk-0.023
aks-0.020
arl-0.018
ives-0.018
eh-0.018
TokenGrand
Token ID23581
Feature activation+1.62e+00

Pos logit contributions
ement+0.010
ised+0.010
inated+0.010
work+0.009
el+0.009
Neg logit contributions
isk-0.007
aks-0.006
arl-0.005
ives-0.005
eh-0.005
Token was
Token ID373
Feature activation+8.75e-01

Pos logit contributions
ement+0.069
ised+0.067
 avail+0.066
 horizon+0.065
inated+0.065
Neg logit contributions
isk-0.045
aks-0.044
eh-0.039
 comes-0.039
and-0.039
Token so
Token ID523
Feature activation+9.34e-01

Pos logit contributions
ement+0.045
ised+0.044
inated+0.043
el+0.042
 horizon+0.042
Neg logit contributions
isk-0.029
aks-0.026
arl-0.023
eh-0.023
aid-0.022
Token excited
Token ID6568
Feature activation+1.27e+00

Pos logit contributions
ement+0.055
ised+0.054
inated+0.053
work+0.052
el+0.052
Neg logit contributions
isk-0.022
aks-0.020
arl-0.019
aid-0.017
eh-0.016
Token to
Token ID284
Feature activation+5.67e-01

Pos logit contributions
ement+0.065
ised+0.064
inated+0.063
el+0.060
itz+0.060
Neg logit contributions
isk-0.040
aks-0.034
ives-0.032
ens-0.032
eh-0.032
Token see
Token ID766
Feature activation+9.08e-01

Pos logit contributions
ement+0.037
ised+0.037
inated+0.036
work+0.035
el+0.035
Neg logit contributions
isk-0.012
aks-0.009
arl-0.008
aid-0.007
ives-0.007
Token the
Token ID262
Feature activation+2.15e+00

Pos logit contributions
ised+0.041
inated+0.040
ement+0.040
 horizon+0.037
work+0.036
Neg logit contributions
isk-0.036
aks-0.033
smart-0.030
ives-0.030
aid-0.030
Token dolphin
Token ID44136
Feature activation+1.67e+00

Pos logit contributions
ths+0.113
ement+0.109
used+0.108
inated+0.108
ised+0.107
Neg logit contributions
 germ-0.059
aks-0.058
 welcome-0.057
 shady-0.056
 wake-0.054
Token swimming
Token ID14899
Feature activation+8.05e-01

Pos logit contributions
ised+0.078
ement+0.073
inated+0.071
ths+0.071
itz+0.070
Neg logit contributions
isk-0.065
aks-0.056
ives-0.053
 comes-0.053
smart-0.053
Token around
Token ID1088
Feature activation+1.02e+00

Pos logit contributions
ement+0.031
inated+0.031
ised+0.031
el+0.029
ths+0.029
Neg logit contributions
isk-0.037
aks-0.034
eh-0.031
arl-0.031
ives-0.030
Token.
Token ID13
Feature activation+1.41e+00

Pos logit contributions
ised+0.044
inated+0.043
ement+0.042
itz+0.040
ths+0.040
Neg logit contributions
isk-0.043
aks-0.038
smart-0.034
eh-0.034
arl-0.034
Token
Token ID198
Feature activation+1.51e-01

Pos logit contributions
ement+0.079
el+0.076
ised+0.076
ths+0.075
ble+0.074
Neg logit contributions
isk-0.040
aks-0.032
ives-0.031
 fetch-0.029
ens-0.028
Token of
Token ID286
Feature activation+7.13e-01

Pos logit contributions
ised+0.002
ement+0.002
inated+0.002
work+0.002
el+0.002
Neg logit contributions
isk-0.001
aks-0.001
arl-0.001
ives-0.001
eh-0.001
Token presents
Token ID10969
Feature activation+1.47e+00

Pos logit contributions
inated+0.032
ised+0.031
ement+0.030
work+0.030
el+0.029
Neg logit contributions
isk-0.028
aks-0.026
ives-0.024
aid-0.024
eh-0.023
Token!
Token ID0
Feature activation-3.52e-02

Pos logit contributions
ised+0.062
inated+0.061
ement+0.057
el+0.054
itz+0.054
Neg logit contributions
isk-0.063
aks-0.057
ives-0.054
coat-0.053
arl-0.051
Token His
Token ID2399
Feature activation+8.93e-01

Pos logit contributions
isk+0.002
aks+0.001
arl+0.001
ives+0.001
eh+0.001
Neg logit contributions
ised-0.002
inated-0.002
ement-0.002
work-0.001
el-0.001
Token eyes
Token ID2951
Feature activation+6.30e-01

Pos logit contributions
ised+0.036
inated+0.035
ement+0.034
izing+0.033
work+0.032
Neg logit contributions
isk-0.041
aks-0.035
 germ-0.033
arl-0.033
 backup-0.033
Token lit
Token ID6578
Feature activation+2.27e+00

Pos logit contributions
ised+0.023
inated+0.022
ement+0.022
work+0.021
el+0.021
Neg logit contributions
isk-0.031
aks-0.027
ives-0.026
arl-0.025
aid-0.025
Token up
Token ID510
Feature activation+1.61e+00

Pos logit contributions
ement+0.073
ised+0.073
used+0.071
inated+0.071
itz+0.068
Neg logit contributions
isk-0.112
aks-0.097
coat-0.090
ers-0.088
 shady-0.088
Token as
Token ID355
Feature activation+5.09e-01

Pos logit contributions
inated+0.084
ement+0.083
ised+0.080
itz+0.078
used+0.077
Neg logit contributions
isk-0.048
aks-0.042
eh-0.038
 shady-0.037
coat-0.036
Token he
Token ID339
Feature activation+5.55e-01

Pos logit contributions
inated+0.025
ised+0.025
ement+0.024
el+0.023
work+0.023
Neg logit contributions
isk-0.019
aks-0.016
arl-0.015
aid-0.015
eh-0.014
Token saw
Token ID2497
Feature activation+4.48e-01

Pos logit contributions
inated+0.027
ement+0.027
ised+0.027
work+0.026
 sunset+0.025
Neg logit contributions
isk-0.021
aks-0.017
eh-0.016
ives-0.016
aid-0.015
Token all
Token ID477
Feature activation+2.66e-01

Pos logit contributions
inated+0.017
ised+0.017
ement+0.016
work+0.015
el+0.015
Neg logit contributions
isk-0.022
aks-0.019
ives-0.018
arl-0.018
aid-0.018
Token language
Token ID3303
Feature activation+1.51e+00

Pos logit contributions
ised+0.013
inated+0.013
ement+0.013
el+0.012
work+0.012
Neg logit contributions
isk-0.014
aks-0.013
ives-0.012
arl-0.012
eh-0.012
Token they
Token ID484
Feature activation+1.43e+00

Pos logit contributions
inated+0.067
ised+0.067
ement+0.065
el+0.062
itz+0.061
Neg logit contributions
isk-0.059
aks-0.052
ives-0.049
smart-0.049
aid-0.048
Token were
Token ID547
Feature activation+1.19e+00

Pos logit contributions
inated+0.070
ement+0.070
ised+0.070
el+0.065
work+0.064
Neg logit contributions
isk-0.054
aks-0.043
ives-0.042
ens-0.040
smart-0.039
Token speaking
Token ID5486
Feature activation+1.25e+00

Pos logit contributions
ised+0.052
inated+0.050
ement+0.050
el+0.047
ths+0.046
Neg logit contributions
isk-0.051
aks-0.044
ives-0.042
arl-0.040
smart-0.040
Token and
Token ID290
Feature activation+7.07e-01

Pos logit contributions
ised+0.053
inated+0.053
ement+0.052
el+0.050
itz+0.048
Neg logit contributions
isk-0.053
aks-0.047
ives-0.044
smart-0.043
arl-0.042
Token they
Token ID484
Feature activation+2.23e+00

Pos logit contributions
ement+0.030
ised+0.029
inated+0.029
el+0.028
work+0.027
Neg logit contributions
isk-0.031
aks-0.026
arl-0.025
 germ-0.024
aid-0.024
Token told
Token ID1297
Feature activation+6.90e-01

Pos logit contributions
inated+0.103
ement+0.101
ised+0.098
izing+0.095
 horizon+0.094
Neg logit contributions
isk-0.088
 comes-0.069
aks-0.069
aid-0.067
smart-0.066
Token him
Token ID683
Feature activation-2.45e-01

Pos logit contributions
inated+0.030
ised+0.030
ement+0.030
work+0.028
el+0.028
Neg logit contributions
isk-0.029
aks-0.024
arl-0.023
ives-0.023
 germ-0.022
Token it
Token ID340
Feature activation+1.38e+00

Pos logit contributions
isk+0.011
aks+0.010
arl+0.010
eh+0.009
ives+0.009
Neg logit contributions
ised-0.010
ement-0.010
inated-0.010
work-0.009
el-0.009
Token was
Token ID373
Feature activation+1.09e+00

Pos logit contributions
ised+0.062
inated+0.061
ement+0.060
 horizon+0.057
el+0.057
Neg logit contributions
isk-0.058
aks-0.047
ives-0.045
smart-0.044
 germ-0.044
Token Spanish
Token ID7897
Feature activation+7.70e-01

Pos logit contributions
ised+0.047
ement+0.047
inated+0.046
el+0.043
work+0.042
Neg logit contributions
isk-0.048
aks-0.041
eh-0.038
 germ-0.038
arl-0.037
Token was
Token ID373
Feature activation+4.17e-01

Pos logit contributions
inated+0.039
ised+0.039
ement+0.038
el+0.036
work+0.036
Neg logit contributions
isk-0.028
aks-0.025
ives-0.023
aid-0.022
arl-0.022
Token about
Token ID546
Feature activation+9.25e-01

Pos logit contributions
ement+0.017
ised+0.017
inated+0.017
el+0.016
work+0.016
Neg logit contributions
isk-0.019
aks-0.016
eh-0.015
arl-0.015
aid-0.015
Token a
Token ID257
Feature activation+5.26e-01

Pos logit contributions
ement+0.044
ised+0.043
inated+0.042
el+0.040
 horizon+0.039
Neg logit contributions
isk-0.037
aks-0.030
 germ-0.030
eh-0.029
ives-0.028
Token little
Token ID1310
Feature activation+2.01e-01

Pos logit contributions
ised+0.023
ement+0.023
inated+0.023
el+0.022
work+0.022
Neg logit contributions
isk-0.022
aks-0.018
eh-0.017
 germ-0.017
arl-0.017
Token girl
Token ID2576
Feature activation-4.37e-01

Pos logit contributions
ised+0.009
ement+0.009
inated+0.009
el+0.008
work+0.008
Neg logit contributions
isk-0.008
aks-0.007
arl-0.007
ives-0.007
eh-0.007
Token who
Token ID508
Feature activation+2.53e-01

Pos logit contributions
isk+0.020
arl+0.017
aks+0.017
aid+0.016
ives+0.016
Neg logit contributions
ement-0.017
ised-0.017
inated-0.017
work-0.017
el-0.017
Token never
Token ID1239
Feature activation-8.39e-01

Pos logit contributions
ised+0.011
ement+0.011
inated+0.011
work+0.010
el+0.010
Neg logit contributions
isk-0.011
aks-0.009
ives-0.009
arl-0.009
eh-0.009
Token gave
Token ID2921
Feature activation+2.48e-01

Pos logit contributions
isk+0.037
arl+0.032
aks+0.032
 backup+0.029
ives+0.029
Neg logit contributions
work-0.035
ised-0.034
inated-0.034
 sunset-0.034
ement-0.033
Token up
Token ID510
Feature activation+2.63e-01

Pos logit contributions
ised+0.008
ement+0.008
inated+0.008
work+0.008
el+0.008
Neg logit contributions
isk-0.013
aks-0.011
arl-0.011
ives-0.011
eh-0.011
Token,
Token ID11
Feature activation-3.22e-01

Pos logit contributions
ised+0.010
inated+0.009
ement+0.009
work+0.009
el+0.009
Neg logit contributions
isk-0.013
aks-0.011
arl-0.011
ives-0.011
eh-0.011
Token even
Token ID772
Feature activation-5.30e-01

Pos logit contributions
isk+0.016
aks+0.014
arl+0.014
ives+0.013
eh+0.013
Neg logit contributions
ement-0.012
ised-0.012
inated-0.012
el-0.011
 sunset-0.011
Token eat
Token ID4483
Feature activation+8.07e-02

Pos logit contributions
isk+0.038
aks+0.035
ives+0.034
arl+0.033
and+0.032
Neg logit contributions
 sunset-0.059
el-0.059
ised-0.059
inated-0.058
 curiously-0.056
Token.
Token ID13
Feature activation-1.02e-01

Pos logit contributions
ised+0.003
ement+0.003
inated+0.003
work+0.003
el+0.003
Neg logit contributions
isk-0.004
aks-0.004
arl-0.003
ives-0.003
eh-0.003
Token Max
Token ID5436
Feature activation+1.25e-01

Pos logit contributions
isk+0.004
aks+0.003
arl+0.003
ives+0.003
eh+0.003
Neg logit contributions
inated-0.005
ised-0.005
ement-0.005
el-0.005
work-0.005
Token was
Token ID373
Feature activation-4.69e-02

Pos logit contributions
ised+0.005
ement+0.005
inated+0.005
work+0.005
el+0.005
Neg logit contributions
isk-0.006
aks-0.005
arl-0.005
ives-0.005
eh-0.005
Token so
Token ID523
Feature activation-6.85e-01

Pos logit contributions
isk+0.002
aks+0.002
arl+0.002
ives+0.002
aid+0.002
Neg logit contributions
ised-0.002
inated-0.002
ement-0.002
el-0.002
work-0.002
Token happy
Token ID3772
Feature activation+3.65e-01

Pos logit contributions
isk+0.032
 backup+0.026
ives+0.026
aks+0.025
arl+0.025
Neg logit contributions
ised-0.028
inated-0.028
ement-0.027
 sunset-0.027
el-0.026
Token and
Token ID290
Feature activation-3.05e-01

Pos logit contributions
ement+0.016
ised+0.016
inated+0.016
el+0.015
work+0.015
Neg logit contributions
isk-0.016
aks-0.013
ives-0.013
eh-0.013
arl-0.012
Token gob
Token ID48484
Feature activation+4.73e-01

Pos logit contributions
isk+0.012
aks+0.010
arl+0.010
ives+0.010
aid+0.010
Neg logit contributions
inated-0.014
ised-0.014
ement-0.014
 sunset-0.013
el-0.013
Tokenbled
Token ID9342
Feature activation-2.49e-01

Pos logit contributions
ised+0.010
inated+0.010
ement+0.010
work+0.010
el+0.009
Neg logit contributions
isk-0.030
aks-0.027
arl-0.027
eh-0.026
ives-0.026
Token it
Token ID340
Feature activation+3.40e-01

Pos logit contributions
isk+0.015
aks+0.013
arl+0.013
ives+0.013
eh+0.013
Neg logit contributions
ised-0.007
ement-0.007
inated-0.006
el-0.006
 sunset-0.006
Token up
Token ID510
Feature activation+1.56e+00

Pos logit contributions
ement+0.008
ised+0.008
inated+0.008
work+0.007
el+0.007
Neg logit contributions
isk-0.021
aks-0.019
ives-0.018
arl-0.018
eh-0.018
Token They
Token ID1119
Feature activation+4.84e-01

Pos logit contributions
ised+0.001
inated+0.001
ement+0.001
el+0.001
work+0.001
Neg logit contributions
isk-0.001
aks-0.001
arl-0.001
eh-0.001
ives-0.001
Token hear
Token ID3285
Feature activation+2.96e-01

Pos logit contributions
ised+0.026
ement+0.026
inated+0.026
work+0.025
 sunset+0.024
Neg logit contributions
isk-0.016
aks-0.013
ives-0.012
arl-0.012
aid-0.012
Token their
Token ID511
Feature activation+1.60e+00

Pos logit contributions
ised+0.012
inated+0.012
ement+0.012
work+0.011
el+0.011
Neg logit contributions
isk-0.014
aks-0.012
arl-0.011
ives-0.011
aid-0.011
Token mom
Token ID1995
Feature activation-1.49e+00

Pos logit contributions
ised+0.081
ement+0.080
inated+0.077
izing+0.074
ized+0.073
Neg logit contributions
isk-0.053
arl-0.051
 germ-0.050
 backup-0.048
coat-0.047
Token getting
Token ID1972
Feature activation+4.25e-01

Pos logit contributions
isk+0.073
aid+0.066
eh+0.065
arl+0.063
aks+0.063
Neg logit contributions
el-0.057
 curiously-0.051
ement-0.051
inated-0.049
 int-0.048
Token closer
Token ID5699
Feature activation+1.99e-01

Pos logit contributions
inated+0.018
ement+0.018
ised+0.018
work+0.017
 horizon+0.016
Neg logit contributions
isk-0.018
aks-0.016
aid-0.015
arl-0.015
eh-0.015
Token.
Token ID13
Feature activation-2.97e-01

Pos logit contributions
ised+0.007
ement+0.007
inated+0.007
work+0.007
el+0.006
Neg logit contributions
isk-0.010
aks-0.009
arl-0.008
ives-0.008
eh-0.008
Token They
Token ID1119
Feature activation-5.09e-01

Pos logit contributions
isk+0.014
aks+0.012
eh+0.012
aid+0.011
arl+0.011
Neg logit contributions
inated-0.012
ised-0.012
ement-0.011
el-0.011
 sunset-0.011
Token are
Token ID389
Feature activation-3.32e-01

Pos logit contributions
isk+0.018
aks+0.015
eh+0.014
aid+0.014
arl+0.013
Neg logit contributions
inated-0.025
ement-0.025
ised-0.025
el-0.025
 curiously-0.024
Token scared
Token ID12008
Feature activation-1.71e-01

Pos logit contributions
isk+0.016
ives+0.013
aks+0.013
arl+0.013
aid+0.013
Neg logit contributions
ised-0.013
inated-0.013
ement-0.013
el-0.012
work-0.012
Token.
Token ID13
Feature activation-3.73e-02

Pos logit contributions
isk+0.010
aks+0.008
eh+0.008
ives+0.008
aid+0.008
Neg logit contributions
ised-0.005
ement-0.005
inated-0.005
el-0.005
work-0.005
Token says
Token ID1139
Feature activation-1.17e+00

Pos logit contributions
isk+0.014
aks+0.012
arl+0.012
eh+0.012
aid+0.011
Neg logit contributions
ement-0.013
ised-0.012
inated-0.012
el-0.012
work-0.012
Token,
Token ID11
Feature activation-1.08e+00

Pos logit contributions
isk+0.070
aks+0.059
eh+0.058
ives+0.055
 germ+0.055
Neg logit contributions
ement-0.038
ised-0.034
work-0.034
inated-0.034
el-0.033
Token "
Token ID366
Feature activation-7.33e-02

Pos logit contributions
isk+0.052
aks+0.048
ives+0.044
eh+0.044
 germ+0.042
Neg logit contributions
inated-0.040
ement-0.039
work-0.038
 sunset-0.038
 int-0.037
TokenI
Token ID40
Feature activation-5.95e-01

Pos logit contributions
isk+0.004
aks+0.003
arl+0.003
ives+0.003
eh+0.003
Neg logit contributions
ised-0.003
ement-0.003
inated-0.003
work-0.003
el-0.003
Token love
Token ID1842
Feature activation+1.08e-01

Pos logit contributions
isk+0.027
aks+0.024
eh+0.024
arl+0.023
ives+0.022
Neg logit contributions
ised-0.023
work-0.022
ement-0.022
inated-0.022
 sunset-0.022
Token you
Token ID345
Feature activation+3.85e-01

Pos logit contributions
ised+0.004
ement+0.004
inated+0.004
work+0.004
el+0.004
Neg logit contributions
isk-0.006
aks-0.005
arl-0.005
ives-0.005
eh-0.005
Token too
Token ID1165
Feature activation+5.74e-01

Pos logit contributions
inated+0.014
ised+0.014
ement+0.014
work+0.013
el+0.013
Neg logit contributions
isk-0.019
aks-0.017
arl-0.016
ives-0.016
aid-0.016
Token.
Token ID13
Feature activation-9.94e-01

Pos logit contributions
ised+0.027
inated+0.027
ement+0.026
work+0.025
el+0.025
Neg logit contributions
isk-0.022
aks-0.019
arl-0.019
ives-0.018
aid-0.017
Token Thank
Token ID6952
Feature activation+2.46e-02

Pos logit contributions
isk+0.050
aks+0.044
eh+0.043
coat+0.040
aid+0.040
Neg logit contributions
inated-0.035
ement-0.034
work-0.033
el-0.033
ised-0.033
Token you
Token ID345
Feature activation-6.44e-01

Pos logit contributions
ised+0.001
ement+0.001
inated+0.001
el+0.001
work+0.001
Neg logit contributions
isk-0.001
aks-0.001
arl-0.001
ives-0.001
eh-0.001
Token for
Token ID329
Feature activation-8.02e-01

Pos logit contributions
isk+0.030
aks+0.026
eh+0.025
aid+0.025
ives+0.024
Neg logit contributions
ement-0.025
work-0.024
ised-0.024
inated-0.024
 sunset-0.024
Tokenelly
Token ID6148
Feature activation+7.33e-01

Pos logit contributions
isk+0.010
aks+0.009
eh+0.008
 germ+0.008
 backup+0.008
Neg logit contributions
ised-0.012
ement-0.012
inated-0.012
el-0.011
work-0.011
Token.
Token ID13
Feature activation+1.08e+00

Pos logit contributions
ised+0.028
inated+0.027
ement+0.027
work+0.025
el+0.025
Neg logit contributions
isk-0.035
aks-0.031
ives-0.029
arl-0.029
coat-0.028
Token It
Token ID632
Feature activation+7.29e-01

Pos logit contributions
ement+0.048
ised+0.047
inated+0.045
ths+0.045
work+0.045
Neg logit contributions
isk-0.043
aks-0.036
arl-0.036
ives-0.035
eh-0.034
Token was
Token ID373
Feature activation+5.63e-01

Pos logit contributions
ement+0.032
inated+0.031
ised+0.031
work+0.030
el+0.029
Neg logit contributions
isk-0.031
aks-0.026
eh-0.025
ives-0.024
coat-0.024
Token an
Token ID281
Feature activation+1.82e+00

Pos logit contributions
ised+0.023
ement+0.023
inated+0.023
work+0.021
el+0.021
Neg logit contributions
isk-0.026
aks-0.022
eh-0.021
arl-0.021
 germ-0.020
Token onion
Token ID21670
Feature activation+7.74e-01

Pos logit contributions
ised+0.080
inated+0.079
ement+0.078
work+0.077
ths+0.074
Neg logit contributions
isk-0.071
aks-0.062
eh-0.061
 germ-0.059
 robber-0.057
Token!
Token ID0
Feature activation+1.16e+00

Pos logit contributions
ised+0.025
ement+0.025
inated+0.025
work+0.022
 sunset+0.022
Neg logit contributions
isk-0.041
aks-0.036
eh-0.035
ives-0.034
coat-0.034
Token Grace
Token ID16156
Feature activation+1.19e+00

Pos logit contributions
ement+0.048
ised+0.046
ths+0.045
work+0.045
inated+0.045
Neg logit contributions
isk-0.049
aks-0.043
arl-0.042
ives-0.041
eh-0.040
Token was
Token ID373
Feature activation+5.28e-01

Pos logit contributions
ement+0.054
ised+0.052
inated+0.051
 sunset+0.051
work+0.051
Neg logit contributions
isk-0.047
aks-0.041
eh-0.039
ives-0.038
coat-0.037
Token so
Token ID523
Feature activation+3.71e-01

Pos logit contributions
ised+0.020
ement+0.020
inated+0.020
work+0.019
el+0.019
Neg logit contributions
isk-0.025
aks-0.022
eh-0.021
arl-0.020
ives-0.020
Token surprised
Token ID6655
Feature activation+1.37e+00

Pos logit contributions
ement+0.016
ised+0.016
inated+0.016
work+0.015
el+0.015
Neg logit contributions
isk-0.016
aks-0.014
arl-0.013
ives-0.013
eh-0.013
Token organize
Token ID16481
Feature activation+6.38e-03

Pos logit contributions
isk+0.010
aks+0.009
eh+0.008
ives+0.008
arl+0.008
Neg logit contributions
inated-0.010
ised-0.010
ement-0.010
work-0.010
el-0.010
Token my
Token ID616
Feature activation-3.28e-01

Pos logit contributions
ised+0.000
ement+0.000
inated+0.000
el+0.000
work+0.000
Neg logit contributions
isk-0.000
aks-0.000
arl-0.000
ives-0.000
eh-0.000
Token toys
Token ID14958
Feature activation-5.85e-01

Pos logit contributions
isk+0.013
aks+0.011
ives+0.010
arl+0.010
aid+0.010
Neg logit contributions
ised-0.015
ement-0.015
inated-0.015
work-0.014
 sunset-0.014
Token now
Token ID783
Feature activation+1.41e-01

Pos logit contributions
isk+0.030
aks+0.026
eh+0.025
arl+0.025
aid+0.025
Neg logit contributions
ised-0.020
ement-0.020
inated-0.019
work-0.019
el-0.018
Token."
Token ID526
Feature activation+5.92e-02

Pos logit contributions
ised+0.005
ement+0.005
inated+0.005
work+0.005
el+0.005
Neg logit contributions
isk-0.007
aks-0.006
arl-0.006
ives-0.006
eh-0.006
Token She
Token ID1375
Feature activation-6.19e-01

Pos logit contributions
ised+0.002
ement+0.002
inated+0.002
el+0.002
work+0.002
Neg logit contributions
isk-0.003
aks-0.002
arl-0.002
ives-0.002
eh-0.002
Token went
Token ID1816
Feature activation-1.99e+00

Pos logit contributions
isk+0.026
aks+0.023
arl+0.023
ives+0.021
eh+0.021
Neg logit contributions
ised-0.027
ement-0.026
inated-0.026
el-0.025
work-0.024
Token back
Token ID736
Feature activation-1.58e+00

Pos logit contributions
isk+0.113
cod+0.101
eh+0.096
aks+0.096
 germ+0.092
Neg logit contributions
 curiously-0.051
 top-0.050
 up-0.050
 closely-0.049
 sunset-0.048
Token to
Token ID284
Feature activation-4.99e-01

Pos logit contributions
isk+0.081
aid+0.072
eh+0.072
 liar+0.072
arl+0.071
Neg logit contributions
ised-0.048
 sunset-0.047
el-0.047
work-0.046
inated-0.046
Token her
Token ID607
Feature activation-3.00e-01

Pos logit contributions
isk+0.024
aks+0.023
ives+0.022
aid+0.022
eh+0.021
Neg logit contributions
inated-0.017
 sunset-0.017
el-0.017
ised-0.017
ement-0.017
Token room
Token ID2119
Feature activation-1.98e+00

Pos logit contributions
isk+0.014
aks+0.012
ives+0.011
arl+0.011
aid+0.011
Neg logit contributions
ised-0.012
inated-0.012
ement-0.012
work-0.011
 sunset-0.011
Token for
Token ID329
Feature activation-1.31e+00

Pos logit contributions
ised+0.003
ement+0.003
inated+0.003
work+0.003
el+0.003
Neg logit contributions
isk-0.004
aks-0.004
arl-0.004
ives-0.004
eh-0.004
Token many
Token ID867
Feature activation-4.36e-01

Pos logit contributions
isk+0.077
arl+0.074
ives+0.071
coat+0.070
eh+0.068
Neg logit contributions
ised-0.036
 top-0.035
 sunset-0.035
 curiously-0.033
el-0.032
Token days
Token ID1528
Feature activation-3.18e-01

Pos logit contributions
isk+0.023
arl+0.020
ives+0.019
aks+0.019
aid+0.019
Neg logit contributions
ised-0.016
ement-0.016
el-0.015
inated-0.015
 sunset-0.014
Token and
Token ID290
Feature activation-1.49e+00

Pos logit contributions
isk+0.015
arl+0.013
aks+0.013
aid+0.012
 germ+0.012
Neg logit contributions
ised-0.013
ement-0.012
inated-0.012
el-0.012
work-0.012
Token nights
Token ID12513
Feature activation-1.49e+00

Pos logit contributions
arl+0.088
isk+0.085
ives+0.080
aid+0.076
aks+0.076
Neg logit contributions
ised-0.043
 curiously-0.039
 suddenly-0.039
ths-0.038
 sunset-0.038
Token and
Token ID290
Feature activation-9.57e-01

Pos logit contributions
isk+0.084
arl+0.078
aks+0.073
 germ+0.072
aid+0.071
Neg logit contributions
ised-0.044
ement-0.043
work-0.042
inated-0.042
 curiously-0.041
Token eventually
Token ID4191
Feature activation-4.69e-01

Pos logit contributions
isk+0.055
arl+0.054
ives+0.050
aks+0.050
aid+0.049
Neg logit contributions
ised-0.027
 sunset-0.025
inated-0.024
 curiously-0.024
ement-0.024
Token he
Token ID339
Feature activation-4.44e-01

Pos logit contributions
isk+0.021
arl+0.019
aks+0.018
aid+0.017
ives+0.017
Neg logit contributions
ised-0.019
inated-0.019
ement-0.019
 sunset-0.019
el-0.018
Token found
Token ID1043
Feature activation-3.76e-01

Pos logit contributions
isk+0.018
arl+0.016
aks+0.015
aid+0.015
ives+0.014
Neg logit contributions
ised-0.020
inated-0.019
ement-0.019
work-0.019
el-0.019
Token the
Token ID262
Feature activation+1.49e+00

Pos logit contributions
isk+0.021
arl+0.019
aks+0.018
ives+0.018
coat+0.018
Neg logit contributions
ement-0.012
ised-0.012
inated-0.011
el-0.011
 sunset-0.011
Token silver
Token ID8465
Feature activation+1.10e+00

Pos logit contributions
ised+0.071
ement+0.067
ths+0.067
inated+0.066
ized+0.061
Neg logit contributions
isk-0.056
 germ-0.048
 robber-0.048
 backup-0.048
aks-0.048
Token€
Token ID26391
Feature activation-6.31e-01

Pos logit contributions
isk+0.018
aks+0.015
arl+0.015
eh+0.014
ives+0.014
Neg logit contributions
ised-0.013
ement-0.013
inated-0.013
work-0.012
el-0.012
Token Daddy
Token ID26926
Feature activation+7.47e-01

Pos logit contributions
isk+0.032
aks+0.027
arl+0.027
aid+0.027
eh+0.026
Neg logit contributions
ised-0.023
inated-0.022
ement-0.022
el-0.021
 sunset-0.021
Token handed
Token ID10158
Feature activation+4.11e-01

Pos logit contributions
ised+0.032
ement+0.032
inated+0.032
el+0.030
 sunset+0.030
Neg logit contributions
isk-0.032
aks-0.027
ives-0.026
aid-0.026
coat-0.025
Token me
Token ID502
Feature activation+2.09e-02

Pos logit contributions
ised+0.016
inated+0.016
ement+0.016
work+0.015
 sunset+0.014
Neg logit contributions
isk-0.019
aks-0.017
arl-0.016
ives-0.016
aid-0.016
Token a
Token ID257
Feature activation-9.25e-02

Pos logit contributions
ised+0.001
ement+0.001
inated+0.001
work+0.001
el+0.001
Neg logit contributions
isk-0.001
aks-0.001
arl-0.001
ives-0.001
eh-0.001
Token box
Token ID3091
Feature activation+3.36e-02

Pos logit contributions
isk+0.004
aks+0.003
arl+0.003
ives+0.003
eh+0.003
Neg logit contributions
ised-0.004
ement-0.004
inated-0.004
work-0.004
el-0.004
Token with
Token ID351
Feature activation+2.66e-01

Pos logit contributions
ised+0.001
ement+0.001
inated+0.001
work+0.001
el+0.001
Neg logit contributions
isk-0.001
aks-0.001
arl-0.001
ives-0.001
eh-0.001
Token a
Token ID257
Feature activation+8.25e-01

Pos logit contributions
ised+0.010
ement+0.010
inated+0.010
work+0.009
el+0.009
Neg logit contributions
isk-0.013
aks-0.011
arl-0.011
ives-0.011
eh-0.010
Token string
Token ID4731
Feature activation+1.38e+00

Pos logit contributions
ised+0.035
ement+0.035
inated+0.034
el+0.033
work+0.032
Neg logit contributions
isk-0.035
aks-0.031
arl-0.029
eh-0.029
aid-0.029
Token.
Token ID13
Feature activation-2.26e-01

Pos logit contributions
ised+0.059
ement+0.058
inated+0.057
ths+0.055
 sunset+0.053
Neg logit contributions
isk-0.056
aks-0.052
arl-0.049
ives-0.049
coat-0.049
Token I
Token ID314
Feature activation+8.13e-02

Pos logit contributions
isk+0.010
aks+0.009
eh+0.008
arl+0.008
aid+0.008
Neg logit contributions
ised-0.009
ement-0.009
inated-0.009
el-0.008
work-0.008
Token pretty
Token ID2495
Feature activation+3.06e-01

Pos logit contributions
isk+0.026
arl+0.022
aks+0.021
ives+0.021
coat+0.021
Neg logit contributions
ised-0.021
ement-0.020
inated-0.020
el-0.020
 sunset-0.019
Token things
Token ID1243
Feature activation+1.87e+00

Pos logit contributions
ised+0.012
ement+0.012
inated+0.012
work+0.012
el+0.011
Neg logit contributions
isk-0.014
aks-0.012
arl-0.011
ives-0.011
eh-0.011
Token.
Token ID13
Feature activation+1.62e-01

Pos logit contributions
ised+0.095
inated+0.095
ement+0.093
ths+0.090
inct+0.090
Neg logit contributions
aks-0.057
isk-0.055
ives-0.055
aid-0.053
and-0.051
Token One
Token ID1881
Feature activation-5.50e-01

Pos logit contributions
ised+0.006
ement+0.006
inated+0.006
work+0.005
el+0.005
Neg logit contributions
isk-0.008
aks-0.007
arl-0.007
ives-0.007
eh-0.007
Token day
Token ID1110
Feature activation-3.88e-01

Pos logit contributions
isk+0.030
ives+0.026
aks+0.026
arl+0.026
 backup+0.025
Neg logit contributions
ised-0.017
inated-0.017
 sunset-0.017
el-0.017
ement-0.017
Token,
Token ID11
Feature activation-1.43e-01

Pos logit contributions
isk+0.019
aks+0.016
arl+0.015
ives+0.015
aid+0.015
Neg logit contributions
ised-0.015
inated-0.015
ement-0.015
el-0.014
 sunset-0.014
Token she
Token ID673
Feature activation-6.34e-01

Pos logit contributions
isk+0.007
aks+0.006
ives+0.006
arl+0.006
aid+0.005
Neg logit contributions
ised-0.006
inated-0.006
ement-0.006
el-0.005
 sunset-0.005
Token found
Token ID1043
Feature activation+2.10e-04

Pos logit contributions
isk+0.025
arl+0.022
aid+0.021
ives+0.021
aks+0.020
Neg logit contributions
ised-0.029
inated-0.029
el-0.029
 sunset-0.028
ement-0.027
Token a
Token ID257
Feature activation+1.88e-01

Pos logit contributions
ised+0.000
ement+0.000
inated+0.000
el+0.000
work+0.000
Neg logit contributions
isk-0.000
aks-0.000
arl-0.000
ives-0.000
eh-0.000
Token shiny
Token ID22441
Feature activation+1.59e+00

Pos logit contributions
ised+0.008
ement+0.008
inated+0.008
work+0.008
el+0.008
Neg logit contributions
isk-0.008
aks-0.007
arl-0.006
eh-0.006
ives-0.006
Token pin
Token ID6757
Feature activation+5.63e-01

Pos logit contributions
ised+0.068
inated+0.067
ement+0.067
ths+0.064
work+0.063
Neg logit contributions
isk-0.064
eh-0.056
coat-0.055
aks-0.055
 germ-0.055
Token find
Token ID1064
Feature activation-4.49e-01

Pos logit contributions
ement+0.028
ised+0.028
inated+0.027
work+0.026
el+0.026
Neg logit contributions
isk-0.020
aks-0.015
arl-0.015
aid-0.014
ives-0.014
Token a
Token ID257
Feature activation-1.40e+00

Pos logit contributions
isk+0.023
aks+0.020
arl+0.019
ives+0.019
coat+0.019
Neg logit contributions
ement-0.016
ised-0.016
el-0.015
inated-0.015
work-0.015
Token better
Token ID1365
Feature activation-1.32e+00

Pos logit contributions
isk+0.068
aks+0.060
ives+0.058
arl+0.055
aked+0.054
Neg logit contributions
 sunset-0.053
 comet-0.052
 top-0.050
 reflection-0.049
inated-0.048
Token friend
Token ID1545
Feature activation-9.16e-01

Pos logit contributions
isk+0.068
ives+0.057
aks+0.056
arl+0.056
enced+0.053
Neg logit contributions
 sunset-0.046
 comet-0.044
el-0.044
 reflection-0.043
inated-0.043
Token and
Token ID290
Feature activation-9.19e-01

Pos logit contributions
isk+0.050
arl+0.043
aks+0.043
eh+0.042
aid+0.042
Neg logit contributions
work-0.027
ement-0.027
inated-0.027
ised-0.026
el-0.025
Token a
Token ID257
Feature activation-1.69e+00

Pos logit contributions
isk+0.042
aks+0.037
arl+0.035
eh+0.034
ives+0.034
Neg logit contributions
inated-0.036
ised-0.036
work-0.035
 sunset-0.035
ement-0.034
Token better
Token ID1365
Feature activation-1.05e+00

Pos logit contributions
isk+0.079
aks+0.069
ives+0.066
arl+0.062
aked+0.061
Neg logit contributions
 sunset-0.068
 comet-0.066
work-0.063
 top-0.063
inated-0.063
Token bug
Token ID5434
Feature activation-1.94e+00

Pos logit contributions
isk+0.052
aks+0.044
ives+0.044
arl+0.042
aid+0.041
Neg logit contributions
 sunset-0.038
inated-0.037
ised-0.036
 comet-0.036
ement-0.036
Token."
Token ID526
Feature activation+3.68e-01

Pos logit contributions
isk+0.112
aks+0.100
arl+0.100
eh+0.094
aid+0.094
Neg logit contributions
work-0.050
inated-0.046
ement-0.044
ised-0.043
el-0.041
Token
Token ID198
Feature activation-4.61e-01

Pos logit contributions
ised+0.017
ement+0.016
inated+0.016
work+0.016
el+0.016
Neg logit contributions
isk-0.015
aks-0.013
arl-0.012
ives-0.012
aid-0.012
Token
Token ID198
Feature activation-3.91e-01

Pos logit contributions
isk+0.023
aks+0.020
ives+0.019
eh+0.019
 germ+0.019
Neg logit contributions
ised-0.016
inated-0.016
ement-0.016
el-0.015
 sunset-0.015
Token wrong
Token ID2642
Feature activation+6.52e-01

Pos logit contributions
ised+0.017
ement+0.017
inated+0.017
el+0.016
work+0.016
Neg logit contributions
isk-0.025
aks-0.022
eh-0.021
arl-0.021
coat-0.020
Token.
Token ID13
Feature activation-3.54e-01

Pos logit contributions
inated+0.025
ised+0.025
ement+0.025
work+0.023
el+0.023
Neg logit contributions
isk-0.031
aks-0.027
arl-0.025
ives-0.025
eh-0.025
Token They
Token ID1119
Feature activation-8.35e-02

Pos logit contributions
isk+0.015
aks+0.013
arl+0.012
aid+0.012
eh+0.012
Neg logit contributions
ised-0.015
inated-0.015
ement-0.015
el-0.014
work-0.014
Token saw
Token ID2497
Feature activation-5.04e-01

Pos logit contributions
isk+0.003
aks+0.003
arl+0.002
ives+0.002
eh+0.002
Neg logit contributions
ised-0.004
inated-0.004
ement-0.004
work-0.004
el-0.004
Token Buddy
Token ID36896
Feature activation-2.60e-01

Pos logit contributions
isk+0.022
aks+0.020
arl+0.019
eh+0.019
coat+0.019
Neg logit contributions
ised-0.021
inated-0.021
ement-0.020
work-0.020
el-0.019
Token and
Token ID290
Feature activation-2.12e+00

Pos logit contributions
isk+0.012
aks+0.011
arl+0.010
aid+0.010
ives+0.010
Neg logit contributions
inated-0.010
ised-0.010
ement-0.010
work-0.009
el-0.009
Token Wh
Token ID854
Feature activation-1.44e+00

Pos logit contributions
arl+0.112
ives+0.105
aks+0.104
isk+0.101
wn+0.092
Neg logit contributions
ised-0.065
 suddenly-0.064
 couldn-0.059
 curiously-0.059
inated-0.058
Tokenisk
Token ID1984
Feature activation-2.67e+00

Pos logit contributions
aid+0.008
aster+0.007
 germ+0.007
aks+0.005
 backup+0.004
Neg logit contributions
el-0.106
ement-0.103
ised-0.102
inated-0.102
work-0.100
Tokeners
Token ID364
Feature activation-6.05e-01

Pos logit contributions
arl+0.087
 germ+0.081
 bisc+0.079
coat+0.078
isk+0.076
Neg logit contributions
work-0.125
ised-0.120
izing-0.115
ement-0.114
ized-0.114
Token fighting
Token ID4330
Feature activation-6.20e-01

Pos logit contributions
isk+0.030
aks+0.027
arl+0.026
eh+0.026
aid+0.026
Neg logit contributions
ement-0.021
ised-0.021
inated-0.021
work-0.020
el-0.020
Token over
Token ID625
Feature activation-8.63e-01

Pos logit contributions
isk+0.034
aks+0.030
arl+0.030
eh+0.030
ives+0.029
Neg logit contributions
ised-0.018
ement-0.018
inated-0.018
work-0.018
 sunset-0.017
Token There
Token ID1318
Feature activation-1.02e-01

Pos logit contributions
ised+0.005
ement+0.005
inated+0.005
work+0.005
el+0.005
Neg logit contributions
isk-0.006
aks-0.005
arl-0.005
ives-0.005
eh-0.005
Token might
Token ID1244
Feature activation-9.69e-01

Pos logit contributions
isk+0.004
aks+0.003
arl+0.003
eh+0.003
ives+0.003
Neg logit contributions
ised-0.005
ement-0.005
inated-0.005
el-0.005
work-0.005
Token be
Token ID307
Feature activation-1.66e+00

Pos logit contributions
isk+0.045
ives+0.041
aks+0.040
arl+0.040
aid+0.040
Neg logit contributions
el-0.037
inated-0.035
 sunset-0.035
work-0.035
ised-0.034
Token monsters
Token ID10963
Feature activation-1.58e+00

Pos logit contributions
isk+0.085
arl+0.079
ives+0.077
aid+0.073
coat+0.071
Neg logit contributions
ised-0.055
 sunset-0.052
el-0.051
inated-0.050
work-0.050
Token or
Token ID393
Feature activation-1.53e+00

Pos logit contributions
isk+0.079
aid+0.067
eh+0.067
aks+0.065
arl+0.064
Neg logit contributions
inated-0.055
ised-0.055
work-0.054
ement-0.053
el-0.051
Token traps
Token ID20348
Feature activation-1.91e+00

Pos logit contributions
isk+0.066
aks+0.057
ives+0.055
arl+0.055
coat+0.053
Neg logit contributions
work-0.063
 sunset-0.063
inated-0.063
 comet-0.063
ised-0.062
Token or
Token ID393
Feature activation-1.26e+00

Pos logit contributions
isk+0.102
aid+0.087
eh+0.087
coat+0.084
 germ+0.084
Neg logit contributions
ised-0.057
inated-0.057
ement-0.054
work-0.054
el-0.054
Token th
Token ID294
Feature activation+2.98e-01

Pos logit contributions
isk+0.054
aks+0.047
arl+0.045
ives+0.045
coat+0.044
Neg logit contributions
inated-0.052
work-0.052
ised-0.052
 sunset-0.051
 comet-0.051
Tokenorns
Token ID19942
Feature activation-1.11e+00

Pos logit contributions
ement+0.018
ised+0.017
inated+0.017
work+0.017
el+0.017
Neg logit contributions
isk-0.008
aks-0.007
arl-0.006
eh-0.006
ives-0.006
Token!"
Token ID2474
Feature activation-8.82e-01

Pos logit contributions
isk+0.059
aid+0.049
aks+0.049
eh+0.049
 germ+0.048
Neg logit contributions
ised-0.035
inated-0.035
el-0.033
work-0.033
ement-0.032
Token
Token ID198
Feature activation-7.75e-01

Pos logit contributions
isk+0.049
aks+0.042
eh+0.041
ives+0.041
arl+0.040
Neg logit contributions
ised-0.029
ement-0.028
 sunset-0.026
inated-0.025
 curiously-0.025
Token gave
Token ID2921
Feature activation-7.66e-01

Pos logit contributions
ised+0.004
ement+0.004
inated+0.004
work+0.003
el+0.003
Neg logit contributions
isk-0.004
aks-0.004
arl-0.003
ives-0.003
eh-0.003
Token Lily
Token ID20037
Feature activation-6.02e-01

Pos logit contributions
isk+0.039
ives+0.035
arl+0.033
aks+0.033
eh+0.033
Neg logit contributions
el-0.028
ised-0.027
ement-0.027
 sunset-0.025
inated-0.025
Token the
Token ID262
Feature activation+2.41e-01

Pos logit contributions
isk+0.030
arl+0.027
aks+0.026
ives+0.025
eh+0.025
Neg logit contributions
ised-0.021
inated-0.021
el-0.021
ement-0.021
work-0.020
Token tw
Token ID665
Feature activation+9.45e-02

Pos logit contributions
ised+0.010
inated+0.010
ement+0.010
work+0.009
el+0.009
Neg logit contributions
isk-0.011
aks-0.009
arl-0.009
ives-0.009
aid-0.009
Tokenig
Token ID328
Feature activation-1.79e-01

Pos logit contributions
ised+0.004
ement+0.004
inated+0.004
work+0.003
el+0.003
Neg logit contributions
isk-0.005
aks-0.004
arl-0.004
ives-0.004
eh-0.004
Token.
Token ID13
Feature activation-1.89e+00

Pos logit contributions
isk+0.008
aks+0.007
arl+0.007
ives+0.006
eh+0.006
Neg logit contributions
inated-0.007
ised-0.007
ement-0.007
work-0.007
el-0.007
Token Lily
Token ID20037
Feature activation+4.74e-02

Pos logit contributions
isk+0.100
eh+0.095
aks+0.091
cod+0.086
ats+0.086
Neg logit contributions
inated-0.063
ement-0.061
 curiously-0.059
 int-0.059
ised-0.058
Token looked
Token ID3114
Feature activation+6.82e-01

Pos logit contributions
ised+0.002
ement+0.002
inated+0.002
work+0.002
el+0.002
Neg logit contributions
isk-0.002
aks-0.002
arl-0.002
ives-0.002
eh-0.002
Token at
Token ID379
Feature activation+1.01e+00

Pos logit contributions
inated+0.025
work+0.025
ised+0.024
ement+0.024
itz+0.023
Neg logit contributions
isk-0.032
aks-0.030
eh-0.028
arl-0.028
ives-0.027
Token it
Token ID340
Feature activation+2.20e+00

Pos logit contributions
inated+0.044
ised+0.043
work+0.043
ement+0.042
inct+0.040
Neg logit contributions
isk-0.043
ens-0.036
aks-0.035
arl-0.034
aid-0.033
Token and
Token ID290
Feature activation+3.48e-01

Pos logit contributions
ised+0.115
inated+0.110
itz+0.108
work+0.107
ement+0.106
Neg logit contributions
isk-0.062
aks-0.057
ows-0.053
 fetch-0.050
and-0.048
Token.
Token ID13
Feature activation-1.25e+00

Pos logit contributions
isk+0.039
arl+0.034
aks+0.034
eh+0.034
aid+0.033
Neg logit contributions
ement-0.018
el-0.017
ised-0.017
inated-0.017
work-0.017
Token He
Token ID679
Feature activation+3.66e-02

Pos logit contributions
isk+0.064
eh+0.058
ives+0.057
aks+0.056
coat+0.053
Neg logit contributions
inated-0.045
 curiously-0.044
ement-0.043
ised-0.042
el-0.041
Token gets
Token ID3011
Feature activation-3.54e-02

Pos logit contributions
ised+0.002
ement+0.002
inated+0.002
el+0.002
work+0.002
Neg logit contributions
isk-0.001
aks-0.001
arl-0.001
ives-0.001
eh-0.001
Token a
Token ID257
Feature activation-2.34e+00

Pos logit contributions
isk+0.002
aks+0.001
arl+0.001
ives+0.001
eh+0.001
Neg logit contributions
ised-0.001
ement-0.001
inated-0.001
el-0.001
work-0.001
Token band
Token ID4097
Feature activation-2.33e+00

Pos logit contributions
isk+0.116
aks+0.098
ives+0.097
enced+0.088
aked+0.087
Neg logit contributions
 comet-0.088
 sunset-0.086
 reflection-0.085
ble-0.085
 curiously-0.084
Token-
Token ID12
Feature activation-1.91e+00

Pos logit contributions
isk+0.115
 cafeteria+0.103
aks+0.095
 mats+0.094
arl+0.094
Neg logit contributions
el-0.079
work-0.079
ble-0.073
ised-0.072
inated-0.070
Tokenaid
Token ID1698
Feature activation-3.10e-01

Pos logit contributions
isk+0.046
aks+0.027
arl+0.026
 robber+0.024
 germ+0.023
Neg logit contributions
el-0.122
ised-0.120
inated-0.115
work-0.115
ized-0.112
Token and
Token ID290
Feature activation-2.36e-01

Pos logit contributions
isk+0.016
aks+0.014
eh+0.013
arl+0.013
ives+0.013
Neg logit contributions
ement-0.011
ised-0.011
inated-0.010
el-0.010
work-0.010
Token puts
Token ID7584
Feature activation-1.79e-01

Pos logit contributions
isk+0.009
aks+0.008
arl+0.008
ives+0.007
eh+0.007
Neg logit contributions
ised-0.011
inated-0.011
ement-0.011
el-0.010
 sunset-0.010
Token it
Token ID340
Feature activation-5.82e-01

Pos logit contributions
isk+0.010
aks+0.009
ives+0.009
arl+0.009
eh+0.009
Neg logit contributions
ised-0.005
ement-0.005
inated-0.005
el-0.005
work-0.005
Token on
Token ID319
Feature activation+1.65e-01

Pos logit contributions
isk+0.030
aks+0.025
ives+0.025
 germ+0.025
arl+0.024
Neg logit contributions
ised-0.020
inated-0.020
ement-0.020
el-0.019
work-0.019
TokenHelp
Token ID22087
Feature activation-6.05e-01

Pos logit contributions
ement+0.022
inated+0.021
ised+0.021
work+0.020
el+0.020
Neg logit contributions
isk-0.020
aks-0.017
aid-0.017
arl-0.016
eh-0.016
Token,
Token ID11
Feature activation-1.51e+00

Pos logit contributions
isk+0.033
aks+0.028
ives+0.028
arl+0.028
aid+0.028
Neg logit contributions
ised-0.020
el-0.019
ement-0.019
inated-0.018
work-0.017
Token help
Token ID1037
Feature activation-7.32e-01

Pos logit contributions
isk+0.067
aks+0.059
ives+0.058
coat+0.056
and+0.056
Neg logit contributions
 comet-0.062
work-0.062
 top-0.058
 sunset-0.058
 mummy-0.058
Token!"
Token ID2474
Feature activation-6.39e-02

Pos logit contributions
isk+0.041
arl+0.037
aks+0.037
ives+0.037
aid+0.037
Neg logit contributions
ised-0.021
el-0.021
ement-0.020
inated-0.020
work-0.019
Token Tom
Token ID4186
Feature activation-7.37e-01

Pos logit contributions
isk+0.002
aks+0.002
arl+0.002
ives+0.002
eh+0.002
Neg logit contributions
ised-0.003
inated-0.003
ement-0.003
el-0.003
work-0.003
Token and
Token ID290
Feature activation-2.52e+00

Pos logit contributions
isk+0.038
arl+0.033
aid+0.032
aks+0.032
eh+0.032
Neg logit contributions
ised-0.025
el-0.024
ement-0.024
inated-0.024
 curiously-0.024
Token Jen
Token ID13364
Feature activation-5.49e-01

Pos logit contributions
ives+0.116
aks+0.110
isk+0.108
and+0.101
ows+0.100
Neg logit contributions
 mum-0.089
 mother-0.086
 mummy-0.085
 sunset-0.085
 top-0.084
Token cry
Token ID3960
Feature activation+2.68e-01

Pos logit contributions
isk+0.022
aks+0.018
arl+0.018
eh+0.018
 germ+0.017
Neg logit contributions
ised-0.025
inated-0.025
ement-0.024
el-0.024
 curiously-0.023
Token.
Token ID13
Feature activation+4.22e-01

Pos logit contributions
ement+0.009
ised+0.009
inated+0.009
work+0.009
el+0.009
Neg logit contributions
isk-0.014
aks-0.012
arl-0.012
ives-0.011
aid-0.011
Token They
Token ID1119
Feature activation-6.29e-01

Pos logit contributions
ement+0.022
ised+0.022
inated+0.021
work+0.021
el+0.021
Neg logit contributions
isk-0.015
aks-0.012
arl-0.012
ives-0.011
aid-0.011
Token run
Token ID1057
Feature activation-1.58e+00

Pos logit contributions
isk+0.022
aks+0.019
eh+0.018
arl+0.017
aid+0.017
Neg logit contributions
inated-0.031
ised-0.031
ement-0.030
el-0.030
 curiously-0.030