TOP ACTIVATIONS
MAX = 0.101

had two long arms . Fasc inated , she picked it
think of the storm ?". Jill thought for a moment and
What made the rainbow ?". Jill answered , " A storm
" What is that ?" Jessica said , scared .
jealous of your friends and don 't be mean to them
loved to play with his orange ball . He would throw
listen to your parents and don 't touch things that can
I take a nap now ?" asked Walter .
What is a language ?" Lily asks . "
buy too much food and don 't eat it all .
can happen even when you don 't expect it . <|endoftext|>
of her with her favorite toy . Lily smiled and said
Now she could look at the sun sets pieces every day
parents , even if we don 't want to . <|endoftext|>
." They stopped to watch the sunset . Suddenly , Nina
They both marvel ed at the comet . Its tail spark
important ?" The voice replied , " This bridge
, we can go see the sunset ." As
, while Tim was watching the sunset , he saw something
own way and that we don 't always need material possessions

MIN ACTIVATIONS
MIN = -0.093

. " In a box under the tree ."
. Ben takes a bite of the banana .
them . He lives in a house near the woods .
She turned on the f auc et and filled the sink
a phone . He calls a doctor . He also calls
A man in a blue uniform runs to the
loved to play with Wh isk ers , but one day
He drank from the f auc et . But he still
A man in a boat saw them and came
shells . They hear the se ag ull s and the
He said he was in a hurry . He had to
fun making mud pies , cars , and castles . But
talks to the dog in a soft voice . He gives
opard . Tim runs to a man who works in the
. He takes them to a small room . He locks
their hats and coats and sne aked out of the house
fell asleep . In the middle of the night , she
. In the middle of the night , Tim
. She puts them in a bucket . She fills the
a big chair , a loud drill and a bitter paste

INTERVAL 0.052 - 0.101
CONTAINS 0.380%

and pretty . Daisy was capt ivated and she couldn 't
Are you lost ?" the voice said again . This time
disappointed and sad that Lily didn 't listen to her and
Lily were playing with their toy cars in the garden .
" Give me the orange , Ben ! It is

INTERVAL 0.004 - 0.052
CONTAINS 41.337%

until she knew the poem by heart . On
them glow in the dark . Billy was so happy to
your glue . It is pretty ." They hug
He was getting ready to hurry back to
carrying that candle ?" John said , " This

INTERVAL -0.045 - 0.004
CONTAINS 57.613%

. The bunny wanted to play and bounce around , but
boy and not play tricks on Sue . They played together
Mom watches them and smiles . She is proud
. She had a favorite chair that she loved to sit
, let 's go !" When they arrived at

INTERVAL -0.093 - -0.045
CONTAINS 0.670%

. You can use your long neck as an umbrella !
asked his mom for a cheerful sandwich . As he took
. They saw many animals , like lions , ze br
They had dolls , cars , blocks and books .
shield ?" Tom and Mia say , " We
Token had
Token ID550
Feature activation+3.93e-03

Pos logit contributions
ement+0.055
ised+0.053
inated+0.053
work+0.051
el+0.050
Neg logit contributions
isk-0.043
eh-0.035
aks-0.035
coat-0.033
 germ-0.033
Token two
Token ID734
Feature activation+1.02e-02

Pos logit contributions
ised+0.006
ement+0.006
inated+0.006
work+0.006
el+0.006
Neg logit contributions
isk-0.007
aks-0.006
arl-0.006
ives-0.006
eh-0.006
Token long
Token ID890
Feature activation+9.60e-03

Pos logit contributions
ised+0.017
inated+0.016
ement+0.016
work+0.016
el+0.015
Neg logit contributions
isk-0.019
aks-0.017
ives-0.016
arl-0.016
eh-0.015
Token arms
Token ID5101
Feature activation+3.07e-02

Pos logit contributions
ised+0.016
ement+0.016
inated+0.016
work+0.015
el+0.015
Neg logit contributions
isk-0.017
aks-0.015
eh-0.014
ives-0.014
arl-0.014
Token.
Token ID13
Feature activation+6.44e-03

Pos logit contributions
ised+0.051
ement+0.050
inated+0.047
ths+0.047
 comet+0.045
Neg logit contributions
isk-0.054
aks-0.049
ives-0.048
arl-0.047
coat-0.047
Token Fasc
Token ID41467
Feature activation+1.01e-01

Pos logit contributions
ised+0.010
ement+0.010
inated+0.010
work+0.010
el+0.009
Neg logit contributions
isk-0.012
aks-0.011
arl-0.010
ives-0.010
eh-0.010
Tokeninated
Token ID3898
Feature activation-9.51e-03

Pos logit contributions
ement+0.103
 comet+0.082
 diamonds+0.079
work+0.070
 avail+0.069
Neg logit contributions
isk-0.250
 fetch-0.215
atters-0.208
ro-0.205
eh-0.202
Token,
Token ID11
Feature activation+2.45e-02

Pos logit contributions
isk+0.018
aks+0.015
arl+0.015
aid+0.015
ives+0.015
Neg logit contributions
ised-0.016
ement-0.016
inated-0.015
el-0.015
work-0.015
Token she
Token ID673
Feature activation+3.17e-02

Pos logit contributions
ement+0.043
ised+0.043
inated+0.042
work+0.042
 horizon+0.039
Neg logit contributions
isk-0.042
aks-0.034
ens-0.034
aid-0.034
 germ-0.034
Token picked
Token ID6497
Feature activation+3.17e-02

Pos logit contributions
ement+0.070
work+0.067
ised+0.065
inated+0.064
el+0.063
Neg logit contributions
isk-0.041
eh-0.034
aks-0.034
coat-0.031
 comes-0.030
Token it
Token ID340
Feature activation+3.66e-02

Pos logit contributions
ised+0.046
ement+0.045
inated+0.044
work+0.042
izing+0.041
Neg logit contributions
isk-0.061
aks-0.058
ens-0.055
ives-0.054
 comes-0.053
Token think
Token ID892
Feature activation+5.42e-03

Pos logit contributions
ised+0.036
ement+0.036
inated+0.035
ths+0.033
el+0.033
Neg logit contributions
isk-0.018
aks-0.015
ives-0.013
arl-0.013
eh-0.013
Token of
Token ID286
Feature activation+2.20e-02

Pos logit contributions
ised+0.008
inated+0.008
ement+0.008
work+0.008
el+0.008
Neg logit contributions
isk-0.010
aks-0.009
arl-0.009
ives-0.009
eh-0.009
Token the
Token ID262
Feature activation+1.76e-02

Pos logit contributions
ised+0.038
ement+0.038
inated+0.037
el+0.035
work+0.034
Neg logit contributions
isk-0.037
aks-0.033
arl-0.031
 germ-0.030
eh-0.029
Token storm
Token ID6388
Feature activation-7.90e-03

Pos logit contributions
ised+0.033
inated+0.032
ement+0.032
work+0.030
el+0.029
Neg logit contributions
isk-0.029
aks-0.025
 germ-0.024
arl-0.024
ives-0.024
Token?".
Token ID43634
Feature activation+3.47e-03

Pos logit contributions
isk+0.017
aks+0.015
arl+0.014
aid+0.014
ives+0.014
Neg logit contributions
ised-0.011
ement-0.011
inated-0.011
work-0.010
el-0.010
Token Jill
Token ID22772
Feature activation+9.83e-02

Pos logit contributions
ised+0.007
inated+0.007
ement+0.007
work+0.006
el+0.006
Neg logit contributions
isk-0.005
aks-0.005
arl-0.004
ives-0.004
eh-0.004
Token thought
Token ID1807
Feature activation+5.09e-03

Pos logit contributions
 avail+0.231
 horizon+0.223
 diamonds+0.221
ement+0.215
izing+0.215
Neg logit contributions
isk-0.097
 saw-0.079
 came-0.077
ens-0.075
 helped-0.072
Token for
Token ID329
Feature activation+1.60e-02

Pos logit contributions
ised+0.008
inated+0.008
ement+0.008
work+0.007
el+0.007
Neg logit contributions
isk-0.010
aks-0.009
arl-0.008
ives-0.008
eh-0.008
Token a
Token ID257
Feature activation-5.60e-03

Pos logit contributions
inated+0.029
ement+0.028
ised+0.028
el+0.026
work+0.026
Neg logit contributions
isk-0.028
aks-0.024
aid-0.022
 backup-0.022
eh-0.021
Token moment
Token ID2589
Feature activation+1.54e-02

Pos logit contributions
isk+0.010
aks+0.008
ives+0.008
arl+0.008
aid+0.008
Neg logit contributions
inated-0.010
ised-0.010
ement-0.010
work-0.009
 sunset-0.009
Token and
Token ID290
Feature activation+4.80e-02

Pos logit contributions
inated+0.029
ised+0.029
ement+0.028
el+0.027
work+0.027
Neg logit contributions
isk-0.025
aks-0.021
ives-0.020
arl-0.020
aid-0.019
TokenWhat
Token ID2061
Feature activation+1.43e-02

Pos logit contributions
inated+0.075
ement+0.075
ised+0.072
work+0.067
el+0.066
Neg logit contributions
isk-0.074
eh-0.061
aks-0.059
aid-0.058
arl-0.056
Token made
Token ID925
Feature activation+1.88e-02

Pos logit contributions
ised+0.030
inated+0.029
ement+0.029
el+0.028
 horizon+0.028
Neg logit contributions
isk-0.020
aks-0.017
ives-0.016
 germ-0.015
eh-0.015
Token the
Token ID262
Feature activation+5.16e-02

Pos logit contributions
inated+0.031
ement+0.031
ised+0.031
work+0.028
 horizon+0.027
Neg logit contributions
isk-0.033
aks-0.031
ives-0.028
arl-0.028
eh-0.028
Token rainbow
Token ID27223
Feature activation+2.58e-02

Pos logit contributions
ement+0.095
ised+0.095
inated+0.092
used+0.089
 curiously+0.089
Neg logit contributions
isk-0.070
 germ-0.063
 comes-0.061
aks-0.061
 welcome-0.061
Token?".
Token ID43634
Feature activation-1.90e-03

Pos logit contributions
ised+0.044
inated+0.043
ement+0.043
el+0.040
itz+0.040
Neg logit contributions
isk-0.044
aks-0.040
ives-0.037
eh-0.037
arl-0.037
Token Jill
Token ID22772
Feature activation+9.72e-02

Pos logit contributions
isk+0.003
aks+0.003
arl+0.002
ives+0.002
aid+0.002
Neg logit contributions
ised-0.004
ement-0.004
inated-0.004
work-0.004
el-0.003
Token answered
Token ID9373
Feature activation+3.50e-02

Pos logit contributions
 avail+0.228
 horizon+0.210
ement+0.206
 diamonds+0.203
 fullest+0.202
Neg logit contributions
isk-0.089
 came-0.087
 saw-0.082
 helped-0.080
 woke-0.080
Token,
Token ID11
Feature activation+2.15e-02

Pos logit contributions
inated+0.071
ised+0.071
 horizon+0.067
work+0.067
ement+0.066
Neg logit contributions
isk-0.046
smart-0.036
aks-0.035
aid-0.034
 backup-0.034
Token "
Token ID366
Feature activation+5.96e-02

Pos logit contributions
ised+0.048
ement+0.048
el+0.047
inated+0.047
work+0.047
Neg logit contributions
isk-0.026
arl-0.019
aks-0.019
eh-0.019
ives-0.018
TokenA
Token ID32
Feature activation+4.12e-02

Pos logit contributions
ement+0.117
 avail+0.112
ised+0.111
inated+0.104
 curiously+0.102
Neg logit contributions
isk-0.079
and-0.079
ives-0.074
aid-0.074
aks-0.074
Token storm
Token ID6388
Feature activation-2.97e-03

Pos logit contributions
ement+0.061
ised+0.061
inated+0.059
el+0.058
 curiously+0.056
Neg logit contributions
isk-0.070
aks-0.069
arl-0.068
eh-0.068
smart-0.065
Token"
Token ID1
Feature activation+3.51e-02

Pos logit contributions
ement+0.029
inated+0.028
ised+0.028
work+0.028
el+0.028
Neg logit contributions
isk-0.023
aks-0.021
ives-0.019
arl-0.019
eh-0.018
TokenWhat
Token ID2061
Feature activation-2.50e-03

Pos logit contributions
ement+0.068
ised+0.064
inated+0.063
el+0.060
 int+0.060
Neg logit contributions
isk-0.057
aks-0.049
aid-0.047
ives-0.045
Rex-0.045
Token is
Token ID318
Feature activation+5.56e-02

Pos logit contributions
isk+0.004
aks+0.003
arl+0.003
eh+0.003
aid+0.003
Neg logit contributions
ised-0.005
inated-0.005
ement-0.005
work-0.005
el-0.005
Token that
Token ID326
Feature activation+1.81e-02

Pos logit contributions
ement+0.121
ised+0.115
 horizon+0.106
efully+0.106
itz+0.104
Neg logit contributions
 germ-0.062
 comes-0.061
eh-0.060
 backup-0.060
isk-0.059
Token?"
Token ID1701
Feature activation+5.01e-02

Pos logit contributions
ised+0.030
ement+0.030
inated+0.030
work+0.028
el+0.028
Neg logit contributions
isk-0.032
aks-0.028
arl-0.027
eh-0.027
 germ-0.027
Token Jessica
Token ID17352
Feature activation+9.45e-02

Pos logit contributions
inated+0.089
ement+0.088
work+0.086
itz+0.086
ble+0.086
Neg logit contributions
aks-0.069
isk-0.068
smart-0.064
 comes-0.063
 germ-0.063
Token said
Token ID531
Feature activation+3.82e-02

Pos logit contributions
ement+0.214
 sunset+0.197
 horizon+0.195
work+0.195
 fullest+0.189
Neg logit contributions
isk-0.093
 came-0.079
 bark-0.078
aks-0.078
 woke-0.077
Token,
Token ID11
Feature activation+4.90e-02

Pos logit contributions
ement+0.073
ised+0.073
inated+0.071
ths+0.070
 horizon+0.069
Neg logit contributions
isk-0.053
aks-0.046
smart-0.046
and-0.043
 backup-0.043
Token scared
Token ID12008
Feature activation+1.35e-02

Pos logit contributions
ement+0.087
ised+0.084
work+0.082
el+0.080
ths+0.077
Neg logit contributions
isk-0.081
eh-0.069
coat-0.066
aks-0.064
 fetch-0.064
Token.
Token ID13
Feature activation+2.76e-02

Pos logit contributions
inated+0.023
ised+0.023
ement+0.022
work+0.022
el+0.021
Neg logit contributions
isk-0.024
aks-0.020
arl-0.020
ives-0.020
aid-0.019
Token
Token ID198
Feature activation+2.51e-02

Pos logit contributions
ised+0.056
ement+0.056
inated+0.054
work+0.053
el+0.053
Neg logit contributions
isk-0.038
aks-0.032
ives-0.031
arl-0.031
smart-0.030
Token jealous
Token ID19354
Feature activation-5.58e-03

Pos logit contributions
isk+0.024
aks+0.020
arl+0.020
ives+0.020
coat+0.020
Neg logit contributions
ised-0.021
inated-0.020
ement-0.019
work-0.019
el-0.019
Token of
Token ID286
Feature activation+6.28e-03

Pos logit contributions
isk+0.009
aks+0.008
eh+0.007
arl+0.007
aid+0.007
Neg logit contributions
ised-0.010
ement-0.010
inated-0.010
work-0.010
el-0.010
Token your
Token ID534
Feature activation-9.08e-03

Pos logit contributions
ised+0.010
inated+0.010
ement+0.010
work+0.010
el+0.010
Neg logit contributions
isk-0.011
aks-0.010
arl-0.009
ives-0.009
eh-0.009
Token friends
Token ID2460
Feature activation+2.29e-02

Pos logit contributions
isk+0.018
aks+0.016
arl+0.015
ives+0.015
aid+0.015
Neg logit contributions
ised-0.014
ement-0.014
inated-0.014
work-0.013
el-0.013
Token and
Token ID290
Feature activation-1.86e-03

Pos logit contributions
inated+0.040
ised+0.039
ement+0.038
el+0.037
 sunset+0.037
Neg logit contributions
isk-0.040
aks-0.035
ives-0.034
arl-0.032
smart-0.032
Token don
Token ID836
Feature activation+9.32e-02

Pos logit contributions
isk+0.004
aks+0.003
arl+0.003
ives+0.003
eh+0.003
Neg logit contributions
ised-0.003
inated-0.003
ement-0.003
work-0.003
el-0.003
Token't
Token ID470
Feature activation+2.89e-02

Pos logit contributions
ishment+0.214
ement+0.208
 reflection+0.202
 pearl+0.201
 horizon+0.200
Neg logit contributions
isk-0.099
aster-0.080
ives-0.080
iren-0.077
 fetch-0.077
Token be
Token ID307
Feature activation-1.55e-02

Pos logit contributions
ement+0.070
ised+0.067
inated+0.065
 horizon+0.062
work+0.062
Neg logit contributions
isk-0.035
aks-0.026
arl-0.026
aid-0.025
ens-0.024
Token mean
Token ID1612
Feature activation-7.41e-04

Pos logit contributions
isk+0.030
arl+0.026
ives+0.025
aks+0.025
coat+0.024
Neg logit contributions
ised-0.025
inated-0.024
ement-0.023
work-0.023
el-0.022
Token to
Token ID284
Feature activation+8.78e-03

Pos logit contributions
isk+0.001
aks+0.001
arl+0.001
eh+0.001
ives+0.001
Neg logit contributions
ised-0.001
ement-0.001
inated-0.001
work-0.001
el-0.001
Token them
Token ID606
Feature activation+2.66e-02

Pos logit contributions
ement+0.014
ised+0.013
inated+0.013
work+0.013
el+0.012
Neg logit contributions
isk-0.017
aks-0.015
arl-0.015
ives-0.014
aid-0.014
Token loved
Token ID6151
Feature activation-5.30e-03

Pos logit contributions
isk+0.010
arl+0.009
ives+0.009
aks+0.009
aid+0.008
Neg logit contributions
ised-0.009
inated-0.009
ement-0.009
el-0.009
work-0.009
Token to
Token ID284
Feature activation-2.48e-02

Pos logit contributions
isk+0.009
aks+0.007
arl+0.007
ives+0.007
eh+0.007
Neg logit contributions
inated-0.010
ised-0.010
ement-0.010
 sunset-0.010
work-0.010
Token play
Token ID711
Feature activation-1.33e-02

Pos logit contributions
isk+0.041
aks+0.035
arl+0.034
ives+0.034
eh+0.032
Neg logit contributions
 sunset-0.046
inated-0.046
el-0.044
 comet-0.044
work-0.044
Token with
Token ID351
Feature activation+5.43e-04

Pos logit contributions
isk+0.028
arl+0.024
aks+0.024
ives+0.023
aid+0.023
Neg logit contributions
ised-0.020
inated-0.019
ement-0.019
el-0.018
 sunset-0.018
Token his
Token ID465
Feature activation+3.31e-02

Pos logit contributions
ised+0.001
inated+0.001
ement+0.001
el+0.001
work+0.001
Neg logit contributions
isk-0.001
aks-0.001
arl-0.001
ives-0.001
aid-0.001
Token orange
Token ID10912
Feature activation+9.29e-02

Pos logit contributions
ement+0.080
inated+0.078
ised+0.078
 horizon+0.077
 int+0.076
Neg logit contributions
isk-0.032
aks-0.028
coat-0.027
eh-0.027
 germ-0.025
Token ball
Token ID2613
Feature activation+1.28e-02

Pos logit contributions
ement+0.246
uct+0.243
used+0.237
ised+0.234
itating+0.233
Neg logit contributions
 banana-0.065
 fetch-0.056
 cookie-0.053
 fort-0.052
 books-0.052
Token.
Token ID13
Feature activation-1.24e-02

Pos logit contributions
ised+0.020
ement+0.019
inated+0.019
work+0.018
el+0.018
Neg logit contributions
isk-0.025
aks-0.022
eh-0.021
ives-0.021
arl-0.021
Token He
Token ID679
Feature activation-1.20e-02

Pos logit contributions
isk+0.022
aks+0.019
eh+0.018
aid+0.018
ives+0.018
Neg logit contributions
inated-0.021
ement-0.021
ised-0.021
 sunset-0.021
el-0.021
Token would
Token ID561
Feature activation-1.41e-02

Pos logit contributions
isk+0.022
arl+0.019
ives+0.018
aks+0.018
aid+0.018
Neg logit contributions
inated-0.020
ised-0.020
el-0.019
ement-0.019
work-0.019
Token throw
Token ID3714
Feature activation+1.48e-02

Pos logit contributions
isk+0.020
aks+0.017
ives+0.016
arl+0.016
aid+0.016
Neg logit contributions
inated-0.029
ised-0.028
el-0.028
 sunset-0.028
work-0.028
Token listen
Token ID6004
Feature activation-1.84e-02

Pos logit contributions
ement+0.031
ised+0.031
inated+0.030
work+0.029
el+0.028
Neg logit contributions
isk-0.024
aks-0.020
arl-0.020
ives-0.020
aid-0.019
Token to
Token ID284
Feature activation-1.05e-02

Pos logit contributions
isk+0.034
aks+0.027
eh+0.026
arl+0.026
aid+0.026
Neg logit contributions
ised-0.032
inated-0.032
work-0.031
ement-0.031
 int-0.030
Token your
Token ID534
Feature activation+1.95e-02

Pos logit contributions
isk+0.018
ives+0.016
coat+0.016
aks+0.016
aid+0.016
Neg logit contributions
ised-0.018
inated-0.017
el-0.017
 sunset-0.017
work-0.017
Token parents
Token ID3397
Feature activation+4.54e-03

Pos logit contributions
ised+0.033
ement+0.033
inated+0.032
ized+0.030
el+0.029
Neg logit contributions
isk-0.036
arl-0.032
 germ-0.032
 backup-0.030
aks-0.030
Token and
Token ID290
Feature activation-1.14e-02

Pos logit contributions
ised+0.007
inated+0.007
ement+0.007
work+0.007
el+0.007
Neg logit contributions
isk-0.008
aks-0.007
arl-0.007
ives-0.007
aid-0.007
Token don
Token ID836
Feature activation+9.26e-02

Pos logit contributions
isk+0.021
aks+0.018
eh+0.017
arl+0.017
ives+0.017
Neg logit contributions
ised-0.019
inated-0.019
work-0.019
ement-0.018
el-0.018
Token't
Token ID470
Feature activation+3.40e-02

Pos logit contributions
ishment+0.204
ement+0.196
 reflection+0.193
 horizon+0.192
 diamonds+0.191
Neg logit contributions
isk-0.099
iren-0.090
aster-0.088
arl-0.087
aks-0.086
Token touch
Token ID3638
Feature activation+1.22e-02

Pos logit contributions
ement+0.089
ised+0.083
inated+0.078
ths+0.077
work+0.077
Neg logit contributions
isk-0.037
aks-0.029
arl-0.027
ens-0.027
aid-0.026
Token things
Token ID1243
Feature activation+6.48e-03

Pos logit contributions
inated+0.020
ised+0.020
ement+0.020
work+0.019
el+0.019
Neg logit contributions
isk-0.022
aks-0.019
arl-0.018
ives-0.018
aid-0.018
Token that
Token ID326
Feature activation-1.84e-02

Pos logit contributions
ised+0.009
inated+0.009
ement+0.009
work+0.009
el+0.009
Neg logit contributions
isk-0.013
aks-0.012
arl-0.011
ives-0.011
aid-0.011
Token can
Token ID460
Feature activation+2.04e-02

Pos logit contributions
isk+0.027
eh+0.023
aks+0.022
arl+0.022
coat+0.021
Neg logit contributions
work-0.036
el-0.036
inated-0.036
ised-0.036
ement-0.036
Token I
Token ID314
Feature activation-1.90e-02

Pos logit contributions
ement+0.041
ised+0.041
inated+0.041
work+0.039
 sunset+0.038
Neg logit contributions
isk-0.038
arl-0.030
aid-0.030
ens-0.030
aks-0.030
Token take
Token ID1011
Feature activation+1.50e-03

Pos logit contributions
isk+0.029
aks+0.025
arl+0.023
eh+0.023
ives+0.023
Neg logit contributions
inated-0.036
el-0.036
ised-0.035
ement-0.035
 sunset-0.035
Token a
Token ID257
Feature activation-1.40e-02

Pos logit contributions
ised+0.002
ement+0.002
inated+0.002
work+0.002
el+0.002
Neg logit contributions
isk-0.003
aks-0.002
arl-0.002
ives-0.002
eh-0.002
Token nap
Token ID25422
Feature activation+6.71e-03

Pos logit contributions
isk+0.018
aks+0.014
ives+0.013
arl+0.013
eh+0.012
Neg logit contributions
ised-0.031
ement-0.031
inated-0.031
 sunset-0.030
work-0.030
Token now
Token ID783
Feature activation+1.45e-02

Pos logit contributions
ised+0.010
inated+0.010
ement+0.010
work+0.009
el+0.009
Neg logit contributions
isk-0.014
aks-0.012
arl-0.012
ives-0.011
eh-0.011
Token?"
Token ID1701
Feature activation+9.24e-02

Pos logit contributions
ised+0.022
inated+0.022
ement+0.021
work+0.020
el+0.020
Neg logit contributions
isk-0.028
aks-0.025
arl-0.025
ives-0.024
eh-0.024
Token asked
Token ID1965
Feature activation+7.84e-02

Pos logit contributions
work+0.198
ement+0.192
ths+0.186
itz+0.182
izing+0.180
Neg logit contributions
ens-0.089
 Sue-0.087
 came-0.086
 comes-0.086
 see-0.086
Token Walter
Token ID15819
Feature activation+2.32e-02

Pos logit contributions
inated+0.135
ths+0.133
 horizon+0.131
work+0.130
ement+0.129
Neg logit contributions
isk-0.113
aks-0.108
ens-0.102
 welcome-0.099
aid-0.097
Token.
Token ID13
Feature activation+4.93e-02

Pos logit contributions
ement+0.037
ised+0.037
inated+0.036
work+0.034
 sunset+0.034
Neg logit contributions
isk-0.044
aks-0.038
eh-0.036
ives-0.036
aid-0.036
Token
Token ID198
Feature activation+2.99e-02

Pos logit contributions
ised+0.113
izing+0.113
ths+0.111
ement+0.110
work+0.109
Neg logit contributions
isk-0.052
aks-0.043
 welcome-0.039
arl-0.039
and-0.038
Token
Token ID198
Feature activation+3.97e-02

Pos logit contributions
ement+0.062
work+0.061
ised+0.061
inated+0.060
izing+0.058
Neg logit contributions
isk-0.040
aks-0.037
arl-0.033
aid-0.032
eh-0.032
TokenWhat
Token ID2061
Feature activation+7.22e-03

Pos logit contributions
ement+0.046
el+0.045
ised+0.045
inated+0.045
work+0.043
Neg logit contributions
isk-0.045
aid-0.040
aks-0.039
ives-0.038
arl-0.038
Token is
Token ID318
Feature activation+5.13e-02

Pos logit contributions
ised+0.014
inated+0.014
ement+0.014
work+0.013
el+0.013
Neg logit contributions
isk-0.011
aks-0.009
ives-0.009
arl-0.009
aid-0.009
Token a
Token ID257
Feature activation+2.16e-02

Pos logit contributions
ised+0.111
ement+0.110
 horizon+0.102
inated+0.098
el+0.097
Neg logit contributions
 germ-0.062
aks-0.061
 robber-0.059
isk-0.059
 Luna-0.059
Token language
Token ID3303
Feature activation+2.95e-02

Pos logit contributions
ised+0.042
ement+0.041
inated+0.040
el+0.039
work+0.038
Neg logit contributions
isk-0.034
aks-0.029
arl-0.028
 germ-0.028
eh-0.027
Token?"
Token ID1701
Feature activation-1.32e-02

Pos logit contributions
inated+0.048
ised+0.048
ement+0.047
el+0.044
itz+0.043
Neg logit contributions
isk-0.052
aks-0.047
arl-0.045
ives-0.044
aid-0.044
Token Lily
Token ID20037
Feature activation+9.19e-02

Pos logit contributions
isk+0.018
aks+0.014
arl+0.013
ives+0.013
eh+0.012
Neg logit contributions
ised-0.030
ement-0.029
 sunset-0.028
inated-0.028
el-0.027
Token asks
Token ID7893
Feature activation+2.78e-03

Pos logit contributions
work+0.224
ised+0.217
izing+0.212
ized+0.210
ement+0.208
Neg logit contributions
 comes-0.090
 hears-0.072
 wakes-0.071
aks-0.070
 sees-0.070
Token.
Token ID13
Feature activation-1.05e-02

Pos logit contributions
ised+0.004
inated+0.004
ement+0.004
work+0.004
el+0.004
Neg logit contributions
isk-0.006
aks-0.005
arl-0.005
ives-0.005
eh-0.005
Token
Token ID198
Feature activation-2.69e-03

Pos logit contributions
isk+0.021
aks+0.018
eh+0.017
aid+0.017
 germ+0.017
Neg logit contributions
inated-0.016
ised-0.016
ement-0.016
 sunset-0.015
el-0.014
Token
Token ID198
Feature activation-1.59e-02

Pos logit contributions
isk+0.005
aks+0.004
arl+0.004
ives+0.004
eh+0.004
Neg logit contributions
ised-0.004
ement-0.004
inated-0.004
el-0.004
work-0.004
Token"
Token ID1
Feature activation+4.37e-02

Pos logit contributions
isk+0.026
 germ+0.021
aks+0.021
arl+0.021
ives+0.020
Neg logit contributions
ised-0.031
inated-0.029
ement-0.028
el-0.028
izing-0.027
Token buy
Token ID2822
Feature activation-9.03e-03

Pos logit contributions
isk+0.009
aks+0.008
arl+0.007
eh+0.007
ives+0.007
Neg logit contributions
ised-0.011
inated-0.011
ement-0.011
work-0.010
el-0.010
Token too
Token ID1165
Feature activation-8.09e-04

Pos logit contributions
isk+0.017
aks+0.014
arl+0.014
ives+0.013
eh+0.013
Neg logit contributions
ised-0.015
ement-0.015
inated-0.015
work-0.014
el-0.014
Token much
Token ID881
Feature activation-1.13e-02

Pos logit contributions
isk+0.001
aks+0.001
arl+0.001
ives+0.001
eh+0.001
Neg logit contributions
ised-0.002
inated-0.002
ement-0.002
work-0.002
el-0.002
Token food
Token ID2057
Feature activation-2.06e-02

Pos logit contributions
isk+0.020
aks+0.016
arl+0.015
eh+0.015
coat+0.015
Neg logit contributions
ised-0.021
ement-0.020
inated-0.020
work-0.020
 sunset-0.019
Token and
Token ID290
Feature activation+2.84e-03

Pos logit contributions
isk+0.040
aks+0.034
eh+0.033
arl+0.033
aid+0.032
Neg logit contributions
ised-0.032
ement-0.031
inated-0.031
work-0.031
el-0.030
Token don
Token ID836
Feature activation+9.18e-02

Pos logit contributions
ised+0.005
ement+0.005
inated+0.005
work+0.004
el+0.004
Neg logit contributions
isk-0.005
aks-0.005
arl-0.004
ives-0.004
eh-0.004
Token't
Token ID470
Feature activation+3.35e-02

Pos logit contributions
ishment+0.203
ement+0.199
 reflection+0.195
inated+0.193
work+0.192
Neg logit contributions
isk-0.098
aster-0.086
iren-0.081
ives-0.080
arl-0.079
Token eat
Token ID4483
Feature activation+1.78e-03

Pos logit contributions
ement+0.081
ised+0.077
inated+0.074
ths+0.071
el+0.070
Neg logit contributions
isk-0.041
arl-0.031
ens-0.030
aks-0.029
aid-0.029
Token it
Token ID340
Feature activation-3.27e-03

Pos logit contributions
ised+0.003
ement+0.003
inated+0.003
work+0.003
el+0.003
Neg logit contributions
isk-0.003
aks-0.003
arl-0.003
ives-0.003
eh-0.003
Token all
Token ID477
Feature activation+1.27e-02

Pos logit contributions
isk+0.007
aks+0.006
arl+0.006
ives+0.006
aid+0.006
Neg logit contributions
ised-0.005
ement-0.005
inated-0.005
work-0.004
el-0.004
Token.
Token ID13
Feature activation+3.13e-02

Pos logit contributions
ised+0.021
ement+0.021
inated+0.021
el+0.019
work+0.019
Neg logit contributions
isk-0.023
aks-0.021
ives-0.019
arl-0.019
eh-0.019
Token can
Token ID460
Feature activation+2.83e-02

Pos logit contributions
ement+0.052
ised+0.052
inated+0.052
el+0.049
work+0.049
Neg logit contributions
isk-0.030
aks-0.025
ives-0.024
arl-0.024
aid-0.023
Token happen
Token ID1645
Feature activation+5.41e-03

Pos logit contributions
ement+0.063
inated+0.062
ised+0.062
work+0.058
ths+0.057
Neg logit contributions
isk-0.037
arl-0.027
aks-0.027
ives-0.027
aid-0.026
Token even
Token ID772
Feature activation-1.49e-03

Pos logit contributions
ised+0.009
inated+0.009
ement+0.009
work+0.009
el+0.008
Neg logit contributions
isk-0.010
aks-0.009
arl-0.008
ives-0.008
eh-0.008
Token when
Token ID618
Feature activation+3.55e-02

Pos logit contributions
isk+0.003
aks+0.002
arl+0.002
ives+0.002
eh+0.002
Neg logit contributions
ised-0.003
ement-0.003
inated-0.003
work-0.002
el-0.002
Token you
Token ID345
Feature activation+1.68e-02

Pos logit contributions
ement+0.064
inated+0.063
ised+0.062
work+0.060
el+0.058
Neg logit contributions
isk-0.057
aks-0.051
 germ-0.049
arl-0.047
 backup-0.047
Token don
Token ID836
Feature activation+9.13e-02

Pos logit contributions
ement+0.037
ised+0.036
inated+0.036
el+0.034
work+0.034
Neg logit contributions
isk-0.023
aks-0.018
ives-0.018
arl-0.017
aid-0.017
Token't
Token ID470
Feature activation+4.02e-02

Pos logit contributions
ement+0.185
ishment+0.181
inated+0.175
work+0.174
 reflection+0.174
Neg logit contributions
isk-0.117
ives-0.100
aks-0.099
arl-0.098
iren-0.095
Token expect
Token ID1607
Feature activation+2.32e-02

Pos logit contributions
ement+0.101
ised+0.094
ths+0.091
inated+0.090
work+0.088
Neg logit contributions
isk-0.045
arl-0.034
aks-0.033
ens-0.032
ives-0.031
Token it
Token ID340
Feature activation+4.26e-03

Pos logit contributions
ised+0.038
inated+0.038
ement+0.038
work+0.036
el+0.035
Neg logit contributions
isk-0.041
aks-0.037
ives-0.035
arl-0.034
 germ-0.034
Token.
Token ID13
Feature activation+6.24e-03

Pos logit contributions
ised+0.007
ement+0.007
inated+0.007
work+0.007
el+0.007
Neg logit contributions
isk-0.008
aks-0.006
arl-0.006
ives-0.006
eh-0.006
Token<|endoftext|>
Token ID50256
Feature activation+4.53e-04

Pos logit contributions
ised+0.011
ement+0.011
inated+0.011
work+0.010
el+0.010
Neg logit contributions
isk-0.011
aks-0.009
arl-0.009
ives-0.009
aid-0.009
Token of
Token ID286
Feature activation+1.74e-02

Pos logit contributions
ised+0.007
ement+0.007
inated+0.007
work+0.006
el+0.006
Neg logit contributions
isk-0.004
aks-0.003
arl-0.003
ives-0.003
eh-0.003
Token her
Token ID607
Feature activation+4.18e-02

Pos logit contributions
ised+0.036
ement+0.035
inated+0.035
work+0.033
ths+0.033
Neg logit contributions
isk-0.026
aks-0.021
arl-0.019
aid-0.019
 germ-0.019
Token with
Token ID351
Feature activation+2.09e-02

Pos logit contributions
inated+0.083
ised+0.080
ement+0.079
 int+0.077
ths+0.076
Neg logit contributions
isk-0.062
coat-0.050
ives-0.048
arl-0.047
aks-0.047
Token her
Token ID607
Feature activation+3.48e-02

Pos logit contributions
ement+0.040
ised+0.039
inated+0.039
work+0.036
 horizon+0.036
Neg logit contributions
isk-0.033
aks-0.030
aid-0.028
ives-0.027
arl-0.027
Token favorite
Token ID4004
Feature activation+1.89e-02

Pos logit contributions
inated+0.067
ised+0.067
ths+0.065
 horizon+0.063
ement+0.062
Neg logit contributions
isk-0.051
coat-0.045
ives-0.044
aks-0.042
 robber-0.042
Token toy
Token ID13373
Feature activation+9.07e-02

Pos logit contributions
ised+0.035
inated+0.034
ement+0.033
ths+0.032
work+0.032
Neg logit contributions
isk-0.031
aks-0.027
 germ-0.026
coat-0.026
aid-0.026
Token.
Token ID13
Feature activation-1.25e-02

Pos logit contributions
ised+0.212
ement+0.195
 int+0.195
 avail+0.194
uct+0.194
Neg logit contributions
isk-0.085
 germ-0.084
 robber-0.080
coat-0.079
 v-0.077
Token Lily
Token ID20037
Feature activation+3.64e-02

Pos logit contributions
isk+0.021
aks+0.018
aid+0.017
eh+0.017
arl+0.017
Neg logit contributions
inated-0.023
ised-0.023
ement-0.022
 sunset-0.022
el-0.021
Token smiled
Token ID13541
Feature activation+4.90e-02

Pos logit contributions
ement+0.075
ised+0.070
 horizon+0.069
work+0.069
inated+0.068
Neg logit contributions
isk-0.054
coat-0.044
aks-0.041
ives-0.040
ens-0.040
Token and
Token ID290
Feature activation+1.61e-02

Pos logit contributions
ement+0.113
used+0.109
inated+0.108
ised+0.107
el+0.106
Neg logit contributions
isk-0.050
coat-0.037
 germ-0.033
 robber-0.032
ens-0.032
Token said
Token ID531
Feature activation-2.07e-02

Pos logit contributions
ement+0.030
ised+0.029
inated+0.028
work+0.028
el+0.028
Neg logit contributions
isk-0.027
aks-0.023
coat-0.022
aid-0.021
eh-0.021
Token Now
Token ID2735
Feature activation-1.05e-02

Pos logit contributions
isk+0.001
aks+0.001
arl+0.001
eh+0.001
ives+0.001
Neg logit contributions
ised-0.001
inated-0.001
ement-0.001
work-0.001
el-0.001
Token she
Token ID673
Feature activation+2.37e-02

Pos logit contributions
isk+0.021
aks+0.019
arl+0.019
eh+0.019
aid+0.018
Neg logit contributions
ised-0.015
inated-0.015
ement-0.015
el-0.015
 sunset-0.014
Token could
Token ID714
Feature activation+1.84e-02

Pos logit contributions
ised+0.046
inated+0.046
ement+0.045
izing+0.043
work+0.042
Neg logit contributions
isk-0.037
aks-0.031
ives-0.031
aid-0.029
 comes-0.029
Token look
Token ID804
Feature activation+8.16e-03

Pos logit contributions
ement+0.040
ised+0.039
inated+0.039
work+0.037
el+0.037
Neg logit contributions
isk-0.025
aks-0.020
aid-0.020
coat-0.019
arl-0.019
Token at
Token ID379
Feature activation+4.38e-02

Pos logit contributions
ised+0.012
inated+0.012
ement+0.012
work+0.012
el+0.012
Neg logit contributions
isk-0.016
aks-0.014
arl-0.013
ives-0.013
eh-0.013
Token the
Token ID262
Feature activation+9.07e-02

Pos logit contributions
inated+0.085
ement+0.081
ised+0.079
inct+0.077
used+0.077
Neg logit contributions
isk-0.064
aks-0.055
aid-0.051
 backup-0.049
ens-0.049
Token sun
Token ID4252
Feature activation+3.42e-02

Pos logit contributions
inated+0.134
used+0.133
itz+0.125
amb+0.125
ised+0.122
Neg logit contributions
 shady-0.136
 welcome-0.133
 cookies-0.130
 sees-0.129
 cookie-0.127
Tokensets
Token ID28709
Feature activation+3.87e-02

Pos logit contributions
ised+0.055
inated+0.054
ement+0.052
itz+0.048
el+0.047
Neg logit contributions
isk-0.063
coat-0.052
aks-0.052
ives-0.051
ens-0.050
Token pieces
Token ID5207
Feature activation+1.07e-02

Pos logit contributions
ised+0.064
inated+0.063
ement+0.060
el+0.059
itz+0.056
Neg logit contributions
isk-0.068
ives-0.058
aks-0.058
coat-0.056
eh-0.054
Token every
Token ID790
Feature activation+2.74e-02

Pos logit contributions
ised+0.018
inated+0.017
ement+0.017
work+0.016
el+0.016
Neg logit contributions
isk-0.019
aks-0.017
ives-0.016
arl-0.016
eh-0.016
Token day
Token ID1110
Feature activation+1.19e-02

Pos logit contributions
inated+0.053
ised+0.053
ement+0.052
work+0.050
izing+0.049
Neg logit contributions
isk-0.038
aks-0.036
eh-0.035
aster-0.032
coat-0.032
Token parents
Token ID3397
Feature activation+1.46e-02

Pos logit contributions
ised+0.032
ement+0.032
inated+0.032
work+0.031
el+0.029
Neg logit contributions
isk-0.039
arl-0.034
ives-0.033
aks-0.033
 germ-0.032
Token,
Token ID11
Feature activation+2.53e-04

Pos logit contributions
ised+0.023
inated+0.023
ement+0.023
el+0.022
work+0.021
Neg logit contributions
isk-0.028
aks-0.024
ives-0.023
arl-0.023
aid-0.022
Token even
Token ID772
Feature activation-6.59e-04

Pos logit contributions
ised+0.000
inated+0.000
ement+0.000
work+0.000
el+0.000
Neg logit contributions
isk-0.001
aks-0.000
arl-0.000
ives-0.000
eh-0.000
Token if
Token ID611
Feature activation+4.41e-02

Pos logit contributions
isk+0.001
aks+0.001
arl+0.001
ives+0.001
eh+0.001
Neg logit contributions
ised-0.001
inated-0.001
ement-0.001
work-0.001
el-0.001
Token we
Token ID356
Feature activation+3.13e-02

Pos logit contributions
ement+0.082
inated+0.080
ised+0.075
work+0.073
 horizon+0.073
Neg logit contributions
isk-0.064
 germ-0.061
 backup-0.060
aks-0.059
 comes-0.058
Token don
Token ID836
Feature activation+9.05e-02

Pos logit contributions
ement+0.072
ised+0.069
inated+0.069
el+0.065
work+0.065
Neg logit contributions
isk-0.039
ives-0.033
aid-0.031
aks-0.031
arl-0.030
Token't
Token ID470
Feature activation+4.13e-02

Pos logit contributions
ishment+0.190
ement+0.188
work+0.188
inated+0.185
 pearl+0.180
Neg logit contributions
isk-0.105
ives-0.095
arl-0.090
iren-0.088
aks-0.086
Token want
Token ID765
Feature activation+4.34e-02

Pos logit contributions
ement+0.099
ised+0.093
work+0.090
inated+0.090
 horizon+0.087
Neg logit contributions
isk-0.049
arl-0.040
coat-0.040
ives-0.039
aid-0.039
Token to
Token ID284
Feature activation+4.60e-02

Pos logit contributions
ised+0.097
inated+0.096
ement+0.096
work+0.091
inct+0.088
Neg logit contributions
ives-0.043
aks-0.043
isk-0.043
 backup-0.042
smart-0.041
Token.
Token ID13
Feature activation+2.17e-02

Pos logit contributions
ement+0.100
ised+0.096
inated+0.092
work+0.092
 horizon+0.089
Neg logit contributions
isk-0.058
coat-0.051
arl-0.049
aid-0.048
 germ-0.048
Token<|endoftext|>
Token ID50256
Feature activation+1.16e-02

Pos logit contributions
ised+0.041
ement+0.040
inated+0.040
work+0.038
el+0.037
Neg logit contributions
isk-0.036
aks-0.030
arl-0.029
ives-0.029
aid-0.028
Token."
Token ID526
Feature activation+2.40e-02

Pos logit contributions
ised+0.043
inated+0.043
ement+0.042
el+0.039
work+0.039
Neg logit contributions
isk-0.040
aks-0.038
ives-0.035
arl-0.035
aid-0.034
Token They
Token ID1119
Feature activation+3.98e-02

Pos logit contributions
ement+0.044
ised+0.044
work+0.044
inated+0.044
ths+0.042
Neg logit contributions
isk-0.038
arl-0.033
aks-0.032
ives-0.032
aid-0.030
Token stopped
Token ID5025
Feature activation+4.10e-02

Pos logit contributions
ement+0.079
inated+0.076
izing+0.075
ised+0.075
 horizon+0.070
Neg logit contributions
isk-0.061
aks-0.053
aid-0.050
eh-0.049
ens-0.048
Token to
Token ID284
Feature activation+2.30e-02

Pos logit contributions
ement+0.080
ised+0.078
inated+0.075
izing+0.073
ified+0.071
Neg logit contributions
isk-0.057
 comes-0.049
aks-0.048
smart-0.047
arl-0.047
Token watch
Token ID2342
Feature activation+3.46e-02

Pos logit contributions
ised+0.050
ement+0.050
inated+0.048
el+0.047
izing+0.046
Neg logit contributions
isk-0.031
aks-0.024
aid-0.023
arl-0.023
ives-0.022
Token the
Token ID262
Feature activation+9.05e-02

Pos logit contributions
ised+0.067
inated+0.066
ement+0.064
ths+0.062
itz+0.060
Neg logit contributions
isk-0.050
aks-0.043
ives-0.041
arl-0.040
 germ-0.039
Token sunset
Token ID26428
Feature activation+3.81e-02

Pos logit contributions
inated+0.103
used+0.100
ised+0.099
 avail+0.093
itz+0.093
Neg logit contributions
 shady-0.162
 welcome-0.162
 cookies-0.161
 banana-0.157
 cookie-0.155
Token.
Token ID13
Feature activation+3.59e-02

Pos logit contributions
ised+0.068
inated+0.066
ement+0.066
el+0.062
itz+0.061
Neg logit contributions
isk-0.060
aks-0.053
ives-0.052
coat-0.049
aid-0.048
Token Suddenly
Token ID24975
Feature activation+1.95e-02

Pos logit contributions
ement+0.070
ised+0.070
work+0.069
ths+0.069
izing+0.067
Neg logit contributions
isk-0.050
arl-0.045
aks-0.042
ives-0.041
ens-0.037
Token,
Token ID11
Feature activation+4.19e-02

Pos logit contributions
inated+0.039
ement+0.039
ised+0.038
work+0.037
el+0.036
Neg logit contributions
isk-0.027
aks-0.024
arl-0.022
ives-0.022
 germ-0.022
Token Nina
Token ID36300
Feature activation+2.58e-02

Pos logit contributions
ement+0.084
inated+0.081
ised+0.076
work+0.076
ths+0.073
Neg logit contributions
isk-0.056
 germ-0.051
ens-0.049
 backup-0.047
arl-0.047
Token They
Token ID1119
Feature activation+6.05e-02

Pos logit contributions
ised+0.075
ement+0.073
ized+0.071
izing+0.070
work+0.068
Neg logit contributions
isk-0.049
aks-0.041
ives-0.040
 Luna-0.040
 comes-0.039
Token both
Token ID1111
Feature activation+5.13e-02

Pos logit contributions
ement+0.122
izing+0.116
 avail+0.109
inated+0.108
ised+0.106
Neg logit contributions
aks-0.080
isk-0.080
 climbed-0.075
 came-0.074
aid-0.073
Token marvel
Token ID22672
Feature activation+3.83e-02

Pos logit contributions
ement+0.114
izing+0.112
 avail+0.109
ised+0.107
inated+0.106
Neg logit contributions
aks-0.055
isk-0.051
 comes-0.048
ives-0.047
aid-0.046
Tokened
Token ID276
Feature activation+5.12e-02

Pos logit contributions
inated+0.071
 horizon+0.070
 sunset+0.069
ement+0.069
 avail+0.069
Neg logit contributions
isk-0.050
aks-0.049
ives-0.048
arl-0.044
smart-0.043
Token at
Token ID379
Feature activation+5.82e-02

Pos logit contributions
inated+0.088
itz+0.085
 avail+0.085
ised+0.085
ement+0.084
Neg logit contributions
aks-0.070
isk-0.068
 comes-0.063
smart-0.062
ives-0.062
Token the
Token ID262
Feature activation+9.05e-02

Pos logit contributions
ble+0.124
inated+0.118
 horizon+0.118
ised+0.117
inct+0.117
Neg logit contributions
isk-0.055
 Sue-0.052
aks-0.051
 comes-0.049
 Luna-0.049
Token comet
Token ID31733
Feature activation+6.04e-02

Pos logit contributions
inated+0.127
ised+0.114
used+0.111
bed+0.108
ths+0.105
Neg logit contributions
 wake-0.149
 welcome-0.141
 comes-0.139
 firefighter-0.139
 nice-0.138
Token.
Token ID13
Feature activation+5.54e-02

Pos logit contributions
ised+0.111
inated+0.106
ement+0.103
itz+0.101
 avail+0.098
Neg logit contributions
isk-0.087
aks-0.080
ives-0.073
 comes-0.072
smart-0.072
Token Its
Token ID6363
Feature activation+7.70e-02

Pos logit contributions
ement+0.114
ised+0.111
ized+0.110
izing+0.108
ble+0.107
Neg logit contributions
isk-0.068
aks-0.061
 comes-0.061
 Luna-0.061
 Kim-0.057
Token tail
Token ID7894
Feature activation+3.88e-02

Pos logit contributions
ement+0.126
ivating+0.120
yles+0.120
yle+0.119
ised+0.118
Neg logit contributions
 welcome-0.094
 shady-0.093
aks-0.091
 hug-0.090
 firefighter-0.085
Token spark
Token ID9009
Feature activation+3.49e-02

Pos logit contributions
ised+0.069
ement+0.068
inated+0.067
work+0.065
itz+0.063
Neg logit contributions
isk-0.059
aks-0.053
ives-0.052
 comes-0.049
coat-0.048
Token important
Token ID1593
Feature activation+1.20e-02

Pos logit contributions
ised+0.014
ement+0.014
inated+0.014
el+0.013
work+0.013
Neg logit contributions
isk-0.016
aks-0.013
arl-0.013
eh-0.013
ives-0.013
Token?"
Token ID1701
Feature activation+2.42e-02

Pos logit contributions
ement+0.020
ised+0.020
inated+0.020
el+0.019
work+0.019
Neg logit contributions
isk-0.022
aks-0.019
arl-0.018
ives-0.018
eh-0.017
Token
Token ID198
Feature activation+1.28e-02

Pos logit contributions
ised+0.047
inated+0.047
ement+0.047
izing+0.045
work+0.045
Neg logit contributions
isk-0.035
aks-0.030
smart-0.028
aid-0.028
eh-0.027
Token
Token ID198
Feature activation+1.02e-02

Pos logit contributions
inated+0.024
ement+0.024
ised+0.023
work+0.023
el+0.022
Neg logit contributions
isk-0.021
aks-0.018
arl-0.017
aid-0.017
eh-0.017
TokenThe
Token ID464
Feature activation-6.49e-03

Pos logit contributions
ement+0.018
inated+0.017
ised+0.017
work+0.017
el+0.017
Neg logit contributions
isk-0.018
aks-0.016
aid-0.015
eh-0.015
arl-0.014
Token voice
Token ID3809
Feature activation+9.04e-02

Pos logit contributions
isk+0.013
aks+0.011
ives+0.011
coat+0.011
arl+0.011
Neg logit contributions
ised-0.010
inated-0.010
ement-0.010
work-0.009
 sunset-0.009
Token replied
Token ID8712
Feature activation+1.55e-02

Pos logit contributions
inated+0.166
 horizon+0.163
 avail+0.162
 sunset+0.156
yles+0.156
Neg logit contributions
 comes-0.108
isk-0.107
aks-0.103
smart-0.102
aid-0.101
Token,
Token ID11
Feature activation+3.02e-02

Pos logit contributions
ised+0.028
inated+0.028
ement+0.027
el+0.027
 horizon+0.027
Neg logit contributions
isk-0.025
aks-0.021
arl-0.020
ives-0.020
eh-0.020
Token "
Token ID366
Feature activation+4.28e-02

Pos logit contributions
ised+0.073
ement+0.072
el+0.071
inated+0.070
work+0.070
Neg logit contributions
isk-0.031
arl-0.022
aks-0.021
eh-0.020
smart-0.020
TokenThis
Token ID1212
Feature activation+4.35e-03

Pos logit contributions
ement+0.082
ised+0.076
 avail+0.074
inated+0.073
 curiously+0.071
Neg logit contributions
isk-0.066
aid-0.059
and-0.059
ives-0.058
aks-0.056
Token bridge
Token ID7696
Feature activation-3.51e-03

Pos logit contributions
ised+0.009
ement+0.009
inated+0.009
work+0.008
el+0.008
Neg logit contributions
isk-0.007
aks-0.006
arl-0.005
ives-0.005
eh-0.005
Token,
Token ID11
Feature activation-1.96e-02

Pos logit contributions
ised+0.009
inated+0.009
ement+0.009
work+0.008
el+0.008
Neg logit contributions
isk-0.008
aks-0.007
arl-0.007
ives-0.006
aid-0.006
Token we
Token ID356
Feature activation+4.85e-03

Pos logit contributions
isk+0.043
aks+0.039
eh+0.037
ives+0.037
arl+0.036
Neg logit contributions
ised-0.026
inated-0.025
ement-0.025
el-0.024
 comet-0.024
Token can
Token ID460
Feature activation+3.19e-02

Pos logit contributions
ised+0.011
ement+0.011
inated+0.011
work+0.010
el+0.010
Neg logit contributions
isk-0.006
aks-0.005
arl-0.005
ives-0.005
aid-0.005
Token go
Token ID467
Feature activation+1.05e-03

Pos logit contributions
ised+0.080
ement+0.079
inated+0.075
work+0.074
izing+0.074
Neg logit contributions
isk-0.032
aks-0.024
arl-0.023
ives-0.022
coat-0.022
Token see
Token ID766
Feature activation+3.92e-02

Pos logit contributions
ised+0.002
inated+0.002
ement+0.002
work+0.002
el+0.002
Neg logit contributions
isk-0.002
aks-0.001
arl-0.001
ives-0.001
eh-0.001
Token the
Token ID262
Feature activation+9.03e-02

Pos logit contributions
ised+0.071
inated+0.071
ement+0.068
work+0.067
itz+0.066
Neg logit contributions
isk-0.056
aks-0.056
 comes-0.050
smart-0.049
ives-0.048
Token sunset
Token ID26428
Feature activation+2.34e-02

Pos logit contributions
bed+0.151
used+0.150
ised+0.148
inated+0.143
ica+0.139
Neg logit contributions
 welcome-0.134
 nice-0.124
 shady-0.123
 wake-0.120
 comes-0.119
Token."
Token ID526
Feature activation-7.19e-04

Pos logit contributions
ised+0.039
ement+0.038
inated+0.038
el+0.036
itz+0.034
Neg logit contributions
isk-0.042
aks-0.038
ives-0.036
arl-0.035
coat-0.034
Token
Token ID198
Feature activation-1.70e-03

Pos logit contributions
isk+0.001
aks+0.001
arl+0.001
eh+0.001
ives+0.001
Neg logit contributions
ised-0.001
inated-0.001
ement-0.001
work-0.001
el-0.001
Token
Token ID198
Feature activation+7.21e-03

Pos logit contributions
isk+0.003
aks+0.003
arl+0.003
ives+0.003
eh+0.003
Neg logit contributions
ised-0.003
ement-0.003
inated-0.003
el-0.003
work-0.003
TokenAs
Token ID1722
Feature activation+1.67e-02

Pos logit contributions
ement+0.019
inated+0.019
ised+0.019
work+0.019
el+0.018
Neg logit contributions
isk-0.006
aks-0.005
arl-0.004
eh-0.004
ives-0.004
Token,
Token ID11
Feature activation+1.88e-02

Pos logit contributions
ised+0.003
ement+0.003
inated+0.003
work+0.003
el+0.003
Neg logit contributions
isk-0.003
aks-0.003
arl-0.003
ives-0.003
eh-0.003
Token while
Token ID981
Feature activation+2.75e-02

Pos logit contributions
ement+0.041
inated+0.040
ised+0.040
 horizon+0.039
work+0.039
Neg logit contributions
isk-0.024
aks-0.019
eh-0.019
aid-0.019
ens-0.018
Token Tim
Token ID5045
Feature activation+4.55e-04

Pos logit contributions
ement+0.060
inated+0.060
ised+0.060
work+0.057
 horizon+0.057
Neg logit contributions
isk-0.034
aks-0.030
eh-0.028
smart-0.026
ens-0.026
Token was
Token ID373
Feature activation-4.43e-03

Pos logit contributions
ised+0.001
inated+0.001
ement+0.001
work+0.001
el+0.001
Neg logit contributions
isk-0.001
aks-0.001
arl-0.001
ives-0.001
aid-0.001
Token watching
Token ID4964
Feature activation+1.94e-02

Pos logit contributions
isk+0.008
aks+0.007
arl+0.007
ives+0.007
aid+0.007
Neg logit contributions
inated-0.007
ised-0.007
ement-0.007
work-0.007
el-0.007
Token the
Token ID262
Feature activation+9.03e-02

Pos logit contributions
ised+0.037
ement+0.036
inated+0.036
work+0.034
el+0.034
Neg logit contributions
isk-0.030
aks-0.025
eh-0.024
 germ-0.024
arl-0.024
Token sunset
Token ID26428
Feature activation+2.30e-02

Pos logit contributions
used+0.118
ised+0.117
yle+0.113
ica+0.109
 avail+0.108
Neg logit contributions
 welcome-0.155
 wake-0.149
 banana-0.147
 shady-0.147
 comes-0.144
Token,
Token ID11
Feature activation+1.20e-02

Pos logit contributions
ised+0.043
inated+0.042
ement+0.042
work+0.039
itz+0.039
Neg logit contributions
isk-0.035
aks-0.031
ives-0.029
eh-0.028
arl-0.028
Token he
Token ID339
Feature activation+1.43e-02

Pos logit contributions
ement+0.024
inated+0.024
ised+0.024
work+0.023
el+0.023
Neg logit contributions
isk-0.018
aks-0.015
eh-0.014
aid-0.014
 germ-0.014
Token saw
Token ID2497
Feature activation+7.24e-03

Pos logit contributions
ement+0.035
ised+0.035
inated+0.035
work+0.034
el+0.033
Neg logit contributions
isk-0.015
aks-0.012
eh-0.011
ives-0.010
coat-0.010
Token something
Token ID1223
Feature activation+4.68e-02

Pos logit contributions
ised+0.012
ement+0.012
inated+0.012
work+0.011
el+0.011
Neg logit contributions
isk-0.013
aks-0.012
aid-0.011
eh-0.011
arl-0.011
Token own
Token ID898
Feature activation+8.49e-03

Pos logit contributions
ised+0.004
ement+0.004
inated+0.004
work+0.004
el+0.004
Neg logit contributions
isk-0.006
aks-0.005
arl-0.005
ives-0.005
eh-0.005
Token way
Token ID835
Feature activation+4.23e-02

Pos logit contributions
ised+0.014
inated+0.014
ement+0.013
work+0.013
el+0.013
Neg logit contributions
isk-0.016
aks-0.014
arl-0.013
ives-0.013
aid-0.013
Token and
Token ID290
Feature activation+9.47e-03

Pos logit contributions
ement+0.077
inated+0.075
ised+0.074
 sunset+0.070
ths+0.069
Neg logit contributions
isk-0.064
aks-0.059
ives-0.056
arl-0.056
 backup-0.054
Token that
Token ID326
Feature activation+3.58e-02

Pos logit contributions
ement+0.017
ised+0.017
inated+0.017
el+0.016
work+0.016
Neg logit contributions
isk-0.016
aks-0.014
arl-0.013
ives-0.013
aid-0.013
Token we
Token ID356
Feature activation+4.18e-02

Pos logit contributions
inated+0.061
ement+0.061
ised+0.060
el+0.056
itz+0.055
Neg logit contributions
isk-0.060
aks-0.055
 comes-0.050
 germ-0.050
smart-0.050
Token don
Token ID836
Feature activation+9.03e-02

Pos logit contributions
ement+0.102
inated+0.099
ised+0.098
 horizon+0.094
itz+0.094
Neg logit contributions
isk-0.045
ives-0.037
aks-0.037
aid-0.033
arl-0.032
Token't
Token ID470
Feature activation+2.56e-02

Pos logit contributions
ishment+0.183
ement+0.181
inated+0.180
 reflection+0.178
work+0.176
Neg logit contributions
isk-0.109
ives-0.100
aks-0.096
arl-0.092
smart-0.091
Token always
Token ID1464
Feature activation+6.73e-03

Pos logit contributions
ement+0.061
ised+0.059
inated+0.058
work+0.056
el+0.056
Neg logit contributions
isk-0.031
aks-0.024
arl-0.023
ives-0.022
aid-0.022
Token need
Token ID761
Feature activation+3.07e-02

Pos logit contributions
ised+0.014
ement+0.014
inated+0.014
work+0.013
el+0.013
Neg logit contributions
isk-0.010
aks-0.008
arl-0.007
ives-0.007
aid-0.007
Token material
Token ID2587
Feature activation+4.23e-03

Pos logit contributions
ement+0.061
inated+0.061
ised+0.060
work+0.058
el+0.056
Neg logit contributions
isk-0.043
aks-0.040
 backup-0.038
ives-0.037
arl-0.036
Token possessions
Token ID23309
Feature activation+1.22e-02

Pos logit contributions
ised+0.007
ement+0.007
inated+0.007
work+0.006
el+0.006
Neg logit contributions
isk-0.008
aks-0.007
arl-0.007
ives-0.007
eh-0.006
Token.
Token ID13
Feature activation-6.47e-03

Pos logit contributions
isk+0.001
aks+0.001
arl+0.001
ives+0.001
eh+0.001
Neg logit contributions
ised-0.001
ement-0.001
inated-0.001
work-0.001
el-0.001
Token
Token ID198
Feature activation-9.91e-03

Pos logit contributions
isk+0.013
aks+0.011
eh+0.010
aid+0.010
ives+0.010
Neg logit contributions
ised-0.010
inated-0.010
ement-0.010
el-0.009
 sunset-0.009
Token
Token ID198
Feature activation-1.35e-02

Pos logit contributions
isk+0.020
aks+0.017
eh+0.017
ives+0.017
coat+0.017
Neg logit contributions
ised-0.015
inated-0.014
ement-0.014
el-0.014
 sunset-0.013
Token"
Token ID1
Feature activation+8.48e-03

Pos logit contributions
isk+0.024
ives+0.019
coat+0.019
aks+0.019
 germ+0.019
Neg logit contributions
ised-0.024
inated-0.023
el-0.023
ement-0.022
 sunset-0.022
TokenIn
Token ID818
Feature activation-3.49e-03

Pos logit contributions
ement+0.013
ised+0.013
inated+0.013
el+0.012
work+0.012
Neg logit contributions
isk-0.017
aks-0.015
arl-0.014
aid-0.014
ives-0.014
Token a
Token ID257
Feature activation-9.12e-02

Pos logit contributions
isk+0.007
aks+0.006
ives+0.006
arl+0.006
aid+0.006
Neg logit contributions
ised-0.005
ement-0.005
inated-0.005
el-0.005
work-0.005
Token box
Token ID3091
Feature activation-2.35e-02

Pos logit contributions
isk+0.197
ives+0.186
aid+0.180
ows+0.178
arl+0.163
Neg logit contributions
 top-0.131
 sunset-0.120
 destination-0.103
 comet-0.103
 sight-0.099
Token under
Token ID739
Feature activation-1.11e-03

Pos logit contributions
isk+0.051
aks+0.043
aid+0.043
ives+0.043
arl+0.042
Neg logit contributions
ement-0.033
ised-0.033
el-0.032
work-0.030
inated-0.030
Token the
Token ID262
Feature activation-2.79e-02

Pos logit contributions
isk+0.002
aks+0.002
arl+0.002
ives+0.002
aid+0.002
Neg logit contributions
ised-0.002
inated-0.002
ement-0.002
el-0.002
work-0.002
Token tree
Token ID5509
Feature activation-2.71e-02

Pos logit contributions
isk+0.058
aks+0.048
ives+0.048
aid+0.046
smart+0.046
Neg logit contributions
 sunset-0.044
 horizon-0.044
 comet-0.042
 top-0.042
el-0.041
Token."
Token ID526
Feature activation+1.58e-02

Pos logit contributions
isk+0.059
aid+0.053
arl+0.052
aks+0.051
ives+0.051
Neg logit contributions
ised-0.035
el-0.034
inated-0.034
work-0.033
ement-0.033
Token.
Token ID13
Feature activation-4.25e-03

Pos logit contributions
isk+0.026
aks+0.023
eh+0.022
arl+0.022
ives+0.022
Neg logit contributions
ement-0.013
ised-0.013
el-0.013
inated-0.012
work-0.012
Token
Token ID198
Feature activation-4.13e-03

Pos logit contributions
isk+0.008
aks+0.007
eh+0.007
arl+0.007
ives+0.007
Neg logit contributions
inated-0.007
ised-0.007
ement-0.007
work-0.006
el-0.006
Token
Token ID198
Feature activation-1.22e-02

Pos logit contributions
isk+0.008
aks+0.007
ives+0.007
arl+0.007
eh+0.006
Neg logit contributions
ised-0.007
inated-0.006
ement-0.006
el-0.006
work-0.006
TokenBen
Token ID11696
Feature activation-2.82e-03

Pos logit contributions
isk+0.020
aks+0.016
ives+0.016
 germ+0.016
arl+0.016
Neg logit contributions
ised-0.023
inated-0.022
el-0.022
ement-0.022
 sunset-0.021
Token takes
Token ID2753
Feature activation-3.20e-02

Pos logit contributions
isk+0.005
aks+0.004
arl+0.004
aid+0.004
eh+0.004
Neg logit contributions
ised-0.005
ement-0.005
inated-0.005
work-0.005
el-0.005
Token a
Token ID257
Feature activation-9.03e-02

Pos logit contributions
isk+0.075
arl+0.066
coat+0.064
ives+0.061
aid+0.060
Neg logit contributions
el-0.041
 top-0.038
 sunset-0.037
ised-0.036
ement-0.036
Token bite
Token ID13197
Feature activation-1.87e-02

Pos logit contributions
isk+0.178
 villains+0.141
aks+0.129
ives+0.127
and+0.123
Neg logit contributions
 top-0.146
 int-0.142
 sunset-0.139
el-0.136
 curiously-0.136
Token of
Token ID286
Feature activation-2.24e-02

Pos logit contributions
isk+0.036
eh+0.028
arl+0.028
aks+0.028
coat+0.027
Neg logit contributions
el-0.033
ement-0.032
ised-0.031
work-0.030
 sunset-0.030
Token the
Token ID262
Feature activation-5.27e-02

Pos logit contributions
isk+0.043
ives+0.037
aks+0.037
arl+0.037
coat+0.036
Neg logit contributions
el-0.035
ement-0.035
 sunset-0.035
ised-0.034
inated-0.034
Token banana
Token ID25996
Feature activation+7.94e-04

Pos logit contributions
isk+0.077
aks+0.062
smart+0.060
arl+0.059
ives+0.059
Neg logit contributions
 sunset-0.108
 top-0.106
ement-0.103
el-0.102
 reflection-0.098
Token.
Token ID13
Feature activation-3.65e-02

Pos logit contributions
ised+0.001
ement+0.001
inated+0.001
work+0.001
el+0.001
Neg logit contributions
isk-0.002
aks-0.001
arl-0.001
ives-0.001
eh-0.001
Token them
Token ID606
Feature activation-4.83e-03

Pos logit contributions
isk+0.032
eh+0.028
aks+0.028
coat+0.028
ives+0.027
Neg logit contributions
ised-0.020
ement-0.020
el-0.020
inated-0.019
 sunset-0.019
Token.
Token ID13
Feature activation-4.04e-02

Pos logit contributions
isk+0.010
aks+0.009
arl+0.008
eh+0.008
ives+0.008
Neg logit contributions
ised-0.007
inated-0.007
ement-0.007
el-0.007
work-0.006
Token He
Token ID679
Feature activation-4.68e-02

Pos logit contributions
isk+0.083
eh+0.080
coat+0.069
aks+0.069
arl+0.068
Neg logit contributions
el-0.057
ement-0.056
inated-0.056
ised-0.054
work-0.053
Token lives
Token ID3160
Feature activation-4.28e-02

Pos logit contributions
isk+0.060
aid+0.056
and+0.056
coat+0.055
eh+0.052
Neg logit contributions
el-0.095
ement-0.092
 curiously-0.090
inated-0.087
inct-0.087
Token in
Token ID287
Feature activation-1.90e-02

Pos logit contributions
isk+0.115
ives+0.103
 bisc+0.102
coat+0.102
 robber+0.100
Neg logit contributions
ised-0.036
 curiously-0.033
ized-0.032
work-0.031
ement-0.031
Token a
Token ID257
Feature activation-8.93e-02

Pos logit contributions
isk+0.042
ives+0.038
eh+0.037
coat+0.037
arl+0.037
Neg logit contributions
el-0.024
ised-0.024
 sunset-0.023
ement-0.022
work-0.022
Token house
Token ID2156
Feature activation-5.05e-02

Pos logit contributions
isk+0.218
ives+0.217
ows+0.196
coat+0.187
aid+0.187
Neg logit contributions
 sunset-0.112
 top-0.112
 comet-0.099
 horizon-0.092
 sight-0.089
Token near
Token ID1474
Feature activation-9.73e-03

Pos logit contributions
isk+0.135
aid+0.122
arl+0.120
ives+0.120
eh+0.119
Neg logit contributions
ement-0.042
ised-0.042
el-0.039
work-0.038
ized-0.036
Token the
Token ID262
Feature activation-3.87e-03

Pos logit contributions
isk+0.020
ives+0.017
eh+0.017
arl+0.017
aid+0.017
Neg logit contributions
ised-0.014
el-0.014
ement-0.014
inated-0.013
work-0.013
Token woods
Token ID16479
Feature activation-2.68e-02

Pos logit contributions
isk+0.007
aks+0.006
ives+0.006
aid+0.006
arl+0.006
Neg logit contributions
ised-0.006
inated-0.006
ement-0.006
 sunset-0.006
work-0.006
Token.
Token ID13
Feature activation-4.59e-02

Pos logit contributions
isk+0.067
aid+0.061
arl+0.060
eh+0.059
coat+0.059
Neg logit contributions
ement-0.027
ised-0.027
work-0.025
inated-0.025
el-0.024
Token She
Token ID1375
Feature activation-3.92e-02

Pos logit contributions
isk+0.059
eh+0.054
aks+0.052
aid+0.051
ives+0.048
Neg logit contributions
inated-0.038
ised-0.035
ement-0.035
 sunset-0.034
el-0.033
Token turned
Token ID2900
Feature activation-1.53e-02

Pos logit contributions
arl+0.076
isk+0.073
ives+0.068
aks+0.066
and+0.064
Neg logit contributions
ised-0.054
el-0.053
 curiously-0.052
 suddenly-0.051
inated-0.050
Token on
Token ID319
Feature activation-1.46e-02

Pos logit contributions
isk+0.036
eh+0.029
aks+0.028
arl+0.028
ives+0.028
Neg logit contributions
ised-0.021
el-0.020
ement-0.020
 sunset-0.019
inated-0.019
Token the
Token ID262
Feature activation-1.73e-02

Pos logit contributions
isk+0.029
aks+0.025
eh+0.025
ives+0.025
arl+0.024
Neg logit contributions
ement-0.022
ised-0.022
el-0.022
inated-0.022
 sunset-0.021
Token f
Token ID277
Feature activation-5.03e-02

Pos logit contributions
isk+0.032
aks+0.026
eh+0.024
ives+0.024
aid+0.024
Neg logit contributions
ement-0.030
ised-0.030
inated-0.029
 horizon-0.029
work-0.029
Tokenauc
Token ID14272
Feature activation-8.88e-02

Pos logit contributions
isk+0.079
 fetch+0.066
 robber+0.061
 shady+0.061
 robbers+0.060
Neg logit contributions
ement-0.103
work-0.094
el-0.092
ised-0.090
inated-0.083
Tokenet
Token ID316
Feature activation-2.53e-02

Pos logit contributions
 backup+0.193
isk+0.180
 shady+0.180
arl+0.177
 liar+0.176
Neg logit contributions
el-0.084
ement-0.081
work-0.080
ised-0.076
inated-0.068
Token and
Token ID290
Feature activation-2.75e-02

Pos logit contributions
isk+0.053
aks+0.048
eh+0.047
arl+0.046
aid+0.045
Neg logit contributions
work-0.035
ement-0.034
el-0.034
ised-0.033
inated-0.033
Token filled
Token ID5901
Feature activation-1.30e-02

Pos logit contributions
isk+0.054
arl+0.048
ives+0.046
aks+0.045
eh+0.043
Neg logit contributions
ised-0.043
work-0.041
inated-0.041
el-0.040
 sunset-0.039
Token the
Token ID262
Feature activation-2.04e-02

Pos logit contributions
isk+0.027
ives+0.024
arl+0.024
aks+0.023
eh+0.023
Neg logit contributions
ised-0.018
ement-0.018
inated-0.018
el-0.018
work-0.017
Token sink
Token ID14595
Feature activation-2.88e-02

Pos logit contributions
isk+0.037
aks+0.028
aid+0.028
smart+0.028
ives+0.028
Neg logit contributions
ised-0.036
 sunset-0.036
el-0.036
ement-0.036
inated-0.036
Token a
Token ID257
Feature activation-6.19e-02

Pos logit contributions
isk+0.118
arl+0.112
ives+0.110
ows+0.104
and+0.102
Neg logit contributions
el-0.044
 sunset-0.038
 once-0.036
ised-0.036
 top-0.034
Token phone
Token ID3072
Feature activation-2.27e-02

Pos logit contributions
isk+0.132
ives+0.119
and+0.117
aks+0.109
aked+0.103
Neg logit contributions
 comet-0.087
 sunset-0.085
 top-0.083
 reflection-0.080
 curiously-0.077
Token.
Token ID13
Feature activation-4.42e-02

Pos logit contributions
isk+0.047
aks+0.042
arl+0.040
eh+0.040
ives+0.039
Neg logit contributions
inated-0.032
work-0.032
ement-0.031
ised-0.031
el-0.030
Token He
Token ID679
Feature activation-2.08e-02

Pos logit contributions
isk+0.094
coat+0.081
eh+0.078
aks+0.077
akers+0.073
Neg logit contributions
el-0.062
ement-0.060
inated-0.059
ised-0.057
 curiously-0.056
Token calls
Token ID3848
Feature activation-7.62e-02

Pos logit contributions
isk+0.032
arl+0.027
aks+0.027
aid+0.026
coat+0.026
Neg logit contributions
ement-0.039
el-0.039
inated-0.039
ised-0.038
work-0.037
Token a
Token ID257
Feature activation-8.83e-02

Pos logit contributions
eh+0.176
isk+0.176
aid+0.170
coat+0.168
aks+0.167
Neg logit contributions
 curiously-0.071
el-0.070
 top-0.069
 mummy-0.066
 int-0.064
Token doctor
Token ID6253
Feature activation-4.42e-02

Pos logit contributions
isk+0.185
ives+0.152
aks+0.152
aked+0.150
atters+0.149
Neg logit contributions
 comet-0.124
 top-0.118
 sunset-0.118
 reflection-0.111
 mother-0.107
Token.
Token ID13
Feature activation-4.40e-02

Pos logit contributions
isk+0.106
eh+0.097
aks+0.097
arl+0.097
aid+0.096
Neg logit contributions
work-0.047
ement-0.039
el-0.038
inated-0.037
 int-0.034
Token He
Token ID679
Feature activation-1.95e-02

Pos logit contributions
isk+0.096
coat+0.084
aks+0.083
eh+0.082
ives+0.077
Neg logit contributions
el-0.055
inated-0.054
ement-0.054
ised-0.053
 sunset-0.052
Token also
Token ID635
Feature activation-4.13e-02

Pos logit contributions
isk+0.030
aks+0.025
arl+0.025
coat+0.024
eh+0.024
Neg logit contributions
ement-0.037
inated-0.036
el-0.036
ised-0.036
work-0.035
Token calls
Token ID3848
Feature activation-7.74e-02

Pos logit contributions
isk+0.073
coat+0.066
and+0.066
arl+0.063
eh+0.061
Neg logit contributions
el-0.064
work-0.062
inated-0.061
 curiously-0.060
ised-0.060
Token
Token ID198
Feature activation-1.20e-02

Pos logit contributions
isk+0.012
aks+0.010
eh+0.010
aid+0.010
arl+0.010
Neg logit contributions
inated-0.011
ised-0.011
ement-0.011
el-0.010
 sunset-0.010
Token
Token ID198
Feature activation-1.43e-02

Pos logit contributions
isk+0.025
aks+0.021
eh+0.021
coat+0.021
ives+0.020
Neg logit contributions
ised-0.017
inated-0.017
ement-0.017
el-0.016
 sunset-0.016
TokenA
Token ID32
Feature activation-4.34e-02

Pos logit contributions
isk+0.025
aks+0.019
coat+0.019
 germ+0.019
ives+0.019
Neg logit contributions
ised-0.026
inated-0.025
el-0.024
ement-0.024
ble-0.024
Token man
Token ID582
Feature activation-5.01e-02

Pos logit contributions
isk+0.095
ives+0.075
aks+0.073
and+0.071
osc+0.070
Neg logit contributions
 sunset-0.068
 comet-0.067
 brightly-0.062
ised-0.061
 top-0.060
Token in
Token ID287
Feature activation-2.31e-02

Pos logit contributions
isk+0.071
coat+0.060
arl+0.060
aid+0.054
eh+0.053
Neg logit contributions
el-0.110
ement-0.104
used-0.103
 curiously-0.102
work-0.100
Token a
Token ID257
Feature activation-8.82e-02

Pos logit contributions
isk+0.052
ives+0.046
coat+0.046
eh+0.046
aid+0.045
Neg logit contributions
el-0.031
ised-0.029
ement-0.028
inated-0.027
 sunset-0.027
Token blue
Token ID4171
Feature activation-5.03e-02

Pos logit contributions
isk+0.216
ives+0.198
coat+0.182
ows+0.181
aid+0.180
Neg logit contributions
 top-0.118
 sunset-0.108
 comet-0.105
 sight-0.095
 horizon-0.093
Token uniform
Token ID8187
Feature activation-1.27e-02

Pos logit contributions
isk+0.103
ives+0.091
aid+0.090
eh+0.088
arl+0.087
Neg logit contributions
el-0.083
 sunset-0.070
 top-0.067
ement-0.067
 comet-0.065
Token runs
Token ID4539
Feature activation-3.47e-02

Pos logit contributions
isk+0.022
aks+0.018
arl+0.018
eh+0.017
aid+0.017
Neg logit contributions
ement-0.023
inated-0.023
ised-0.022
work-0.022
el-0.022
Token to
Token ID284
Feature activation-3.04e-02

Pos logit contributions
isk+0.074
aid+0.065
aks+0.064
eh+0.063
 germ+0.060
Neg logit contributions
work-0.045
el-0.044
inated-0.044
 int-0.042
ised-0.042
Token the
Token ID262
Feature activation-2.52e-02

Pos logit contributions
isk+0.062
aks+0.060
ives+0.060
eh+0.058
coat+0.058
Neg logit contributions
 curiously-0.039
el-0.039
 sunset-0.038
 top-0.038
inated-0.036
Token loved
Token ID6151
Feature activation-3.21e-02

Pos logit contributions
ised+0.002
inated+0.002
ement+0.002
work+0.002
el+0.002
Neg logit contributions
isk-0.002
aks-0.002
arl-0.002
ives-0.002
aid-0.002
Token to
Token ID284
Feature activation-4.35e-02

Pos logit contributions
isk+0.059
aks+0.058
ives+0.055
arl+0.054
tted+0.052
Neg logit contributions
 sunset-0.051
inated-0.047
el-0.046
work-0.045
ised-0.044
Token play
Token ID711
Feature activation-4.15e-02

Pos logit contributions
isk+0.072
aks+0.070
ives+0.070
arl+0.067
and+0.066
Neg logit contributions
inated-0.073
 sunset-0.072
el-0.068
 comet-0.068
 top-0.066
Token with
Token ID351
Feature activation-2.42e-02

Pos logit contributions
isk+0.095
arl+0.091
ives+0.086
aks+0.085
rost+0.085
Neg logit contributions
ised-0.046
 top-0.039
inated-0.039
izing-0.038
 curiously-0.038
Token Wh
Token ID854
Feature activation-4.92e-02

Pos logit contributions
isk+0.035
aks+0.033
ives+0.032
arl+0.032
aid+0.030
Neg logit contributions
el-0.047
ised-0.046
inated-0.045
 sunset-0.043
ement-0.043
Tokenisk
Token ID1984
Feature activation-8.81e-02

Pos logit contributions
aid+0.018
 firefighter+0.017
 backup+0.016
aks+0.015
 germ+0.013
Neg logit contributions
el-0.141
ement-0.133
ised-0.129
itz-0.128
work-0.127
Tokeners
Token ID364
Feature activation-1.68e-02

Pos logit contributions
arl+0.126
 bisc+0.114
 germ+0.107
 shady+0.104
 lick+0.099
Neg logit contributions
work-0.155
ised-0.147
ized-0.146
izing-0.144
ble-0.141
Token,
Token ID11
Feature activation-5.05e-02

Pos logit contributions
isk+0.035
aks+0.032
arl+0.032
aid+0.031
eh+0.030
Neg logit contributions
ised-0.022
ement-0.022
work-0.022
inated-0.021
el-0.020
Token but
Token ID475
Feature activation-2.34e-02

Pos logit contributions
aks+0.119
arl+0.112
ives+0.111
aked+0.108
aid+0.105
Neg logit contributions
 mostly-0.051
 suddenly-0.051
 curiously-0.050
 sunset-0.050
 just-0.049
Token one
Token ID530
Feature activation-3.29e-02

Pos logit contributions
ives+0.040
isk+0.039
aks+0.039
arl+0.037
eh+0.034
Neg logit contributions
ised-0.039
 sunset-0.038
el-0.037
inated-0.037
work-0.036
Token day
Token ID1110
Feature activation-2.36e-02

Pos logit contributions
isk+0.081
arl+0.078
ives+0.074
 backup+0.074
aks+0.073
Neg logit contributions
el-0.031
 sunset-0.030
work-0.029
 once-0.029
 comet-0.028
Token He
Token ID679
Feature activation-2.30e-02

Pos logit contributions
isk+0.070
eh+0.061
aks+0.059
coat+0.054
akers+0.054
Neg logit contributions
inated-0.053
ement-0.050
 sunset-0.050
ised-0.049
 top-0.048
Token drank
Token ID24070
Feature activation-1.93e-02

Pos logit contributions
isk+0.041
arl+0.038
ives+0.035
aks+0.035
aid+0.032
Neg logit contributions
ised-0.039
inated-0.038
el-0.037
ement-0.037
 curiously-0.036
Token from
Token ID422
Feature activation+8.87e-03

Pos logit contributions
isk+0.042
aks+0.035
eh+0.034
ives+0.034
arl+0.034
Neg logit contributions
ised-0.027
 sunset-0.027
ement-0.027
el-0.027
inated-0.026
Token the
Token ID262
Feature activation-1.63e-02

Pos logit contributions
ised+0.017
ement+0.017
inated+0.017
work+0.017
el+0.016
Neg logit contributions
isk-0.014
aks-0.011
eh-0.011
arl-0.011
aid-0.010
Token f
Token ID277
Feature activation-4.12e-02

Pos logit contributions
isk+0.030
aks+0.024
ives+0.023
smart+0.022
aid+0.022
Neg logit contributions
ement-0.028
 sunset-0.028
ised-0.028
inated-0.028
work-0.028
Tokenauc
Token ID14272
Feature activation-8.81e-02

Pos logit contributions
isk+0.061
 robber+0.052
 fetch+0.051
 robbers+0.051
 firefighter+0.045
Neg logit contributions
ement-0.089
el-0.085
ised-0.082
work-0.081
inated-0.076
Tokenet
Token ID316
Feature activation-4.61e-03

Pos logit contributions
 backup+0.200
 robbers+0.188
 robber+0.186
and+0.185
 charity+0.183
Neg logit contributions
el-0.088
ement-0.074
work-0.070
ised-0.070
inated-0.060
Token.
Token ID13
Feature activation-3.33e-02

Pos logit contributions
isk+0.010
aks+0.009
arl+0.008
ives+0.008
eh+0.008
Neg logit contributions
ised-0.006
ement-0.006
inated-0.006
work-0.006
el-0.006
Token But
Token ID887
Feature activation-9.95e-03

Pos logit contributions
isk+0.069
eh+0.061
aks+0.058
coat+0.054
cod+0.053
Neg logit contributions
inated-0.051
 sunset-0.048
ement-0.048
ised-0.047
 top-0.047
Token he
Token ID339
Feature activation-1.43e-02

Pos logit contributions
isk+0.021
aks+0.018
ives+0.018
arl+0.018
coat+0.017
Neg logit contributions
ised-0.014
ement-0.014
inated-0.014
el-0.014
 sunset-0.013
Token still
Token ID991
Feature activation-1.38e-02

Pos logit contributions
isk+0.027
arl+0.024
aks+0.024
ives+0.023
aid+0.022
Neg logit contributions
ised-0.022
ement-0.022
inated-0.022
el-0.022
work-0.021
Token
Token ID198
Feature activation-6.46e-04

Pos logit contributions
ement+0.033
ised+0.032
inated+0.031
work+0.031
el+0.031
Neg logit contributions
isk-0.024
aks-0.021
arl-0.020
ives-0.019
aid-0.018
Token
Token ID198
Feature activation-4.66e-03

Pos logit contributions
isk+0.001
aks+0.001
arl+0.001
ives+0.001
eh+0.001
Neg logit contributions
ised-0.001
inated-0.001
ement-0.001
work-0.001
el-0.001
TokenA
Token ID32
Feature activation-4.35e-02

Pos logit contributions
isk+0.009
ives+0.008
aks+0.008
arl+0.007
coat+0.007
Neg logit contributions
ised-0.007
inated-0.007
ement-0.007
el-0.007
work-0.007
Token man
Token ID582
Feature activation-5.04e-02

Pos logit contributions
isk+0.093
ives+0.077
atters+0.070
aks+0.070
ens+0.069
Neg logit contributions
 sunset-0.068
 comet-0.067
ised-0.066
 brightly-0.063
work-0.062
Token in
Token ID287
Feature activation-2.09e-02

Pos logit contributions
isk+0.092
arl+0.091
coat+0.077
ives+0.074
inters+0.074
Neg logit contributions
el-0.087
ement-0.085
 suddenly-0.083
 curiously-0.083
work-0.080
Token a
Token ID257
Feature activation-8.80e-02

Pos logit contributions
isk+0.046
ives+0.041
eh+0.040
coat+0.040
aid+0.040
Neg logit contributions
el-0.028
ised-0.028
ement-0.026
inated-0.025
 sunset-0.025
Token boat
Token ID8848
Feature activation-3.89e-02

Pos logit contributions
isk+0.220
ives+0.204
ows+0.189
ens+0.184
coat+0.183
Neg logit contributions
 top-0.116
 sunset-0.105
 comet-0.097
 horizon-0.091
 brightly-0.090
Token saw
Token ID2497
Feature activation-2.43e-02

Pos logit contributions
isk+0.068
arl+0.059
ives+0.056
aid+0.055
 germ+0.055
Neg logit contributions
work-0.068
ement-0.066
ised-0.065
el-0.065
inated-0.064
Token them
Token ID606
Feature activation+2.29e-02

Pos logit contributions
isk+0.056
coat+0.052
eh+0.049
aks+0.049
aid+0.048
Neg logit contributions
el-0.029
ement-0.029
ised-0.029
work-0.026
inated-0.026
Token and
Token ID290
Feature activation-3.57e-02

Pos logit contributions
ised+0.042
ement+0.040
inated+0.040
work+0.038
ths+0.037
Neg logit contributions
isk-0.038
aks-0.034
arl-0.031
ives-0.031
smart-0.030
Token came
Token ID1625
Feature activation-1.71e-02

Pos logit contributions
isk+0.073
arl+0.072
ives+0.068
aks+0.064
coat+0.063
Neg logit contributions
ised-0.049
 suddenly-0.049
work-0.048
 curiously-0.048
inated-0.048
Token shells
Token ID19679
Feature activation+2.99e-03

Pos logit contributions
isk+0.010
aks+0.008
ives+0.008
arl+0.008
aid+0.008
Neg logit contributions
ised-0.009
ement-0.009
inated-0.009
work-0.009
el-0.009
Token.
Token ID13
Feature activation-6.82e-04

Pos logit contributions
ised+0.004
inated+0.004
ement+0.004
work+0.004
el+0.004
Neg logit contributions
isk-0.006
aks-0.005
arl-0.005
ives-0.005
eh-0.005
Token They
Token ID1119
Feature activation-2.39e-03

Pos logit contributions
isk+0.001
aks+0.001
eh+0.001
arl+0.001
ives+0.001
Neg logit contributions
ised-0.001
inated-0.001
ement-0.001
work-0.001
el-0.001
Token hear
Token ID3285
Feature activation+4.15e-03

Pos logit contributions
isk+0.003
aks+0.003
arl+0.002
eh+0.002
ives+0.002
Neg logit contributions
ised-0.005
inated-0.005
ement-0.005
work-0.005
el-0.005
Token the
Token ID262
Feature activation+7.65e-03

Pos logit contributions
ised+0.007
ement+0.007
inated+0.007
work+0.007
el+0.006
Neg logit contributions
isk-0.008
aks-0.007
arl-0.006
ives-0.006
aid-0.006
Token se
Token ID384
Feature activation-8.79e-02

Pos logit contributions
ised+0.013
inated+0.013
ement+0.013
work+0.012
el+0.012
Neg logit contributions
isk-0.014
aks-0.012
arl-0.012
ives-0.011
eh-0.011
Tokenag
Token ID363
Feature activation-6.80e-03

Pos logit contributions
 robbers+0.136
 shady+0.131
isk+0.125
 backup+0.121
 mats+0.120
Neg logit contributions
ement-0.145
el-0.137
 int-0.132
work-0.132
ised-0.128
Tokenull
Token ID724
Feature activation-1.09e-02

Pos logit contributions
isk+0.014
aks+0.012
ives+0.012
arl+0.012
aid+0.012
Neg logit contributions
ised-0.010
inated-0.010
ement-0.010
work-0.010
el-0.010
Tokens
Token ID82
Feature activation+5.97e-03

Pos logit contributions
isk+0.023
aks+0.020
eh+0.019
arl+0.019
ives+0.019
Neg logit contributions
inated-0.015
ement-0.015
ised-0.015
el-0.015
work-0.015
Token and
Token ID290
Feature activation-1.52e-02

Pos logit contributions
ised+0.010
ement+0.010
inated+0.010
work+0.009
el+0.009
Neg logit contributions
isk-0.011
aks-0.009
arl-0.009
ives-0.009
aid-0.009
Token the
Token ID262
Feature activation-1.77e-02

Pos logit contributions
isk+0.027
aks+0.023
arl+0.022
ives+0.022
eh+0.022
Neg logit contributions
inated-0.027
ised-0.026
 sunset-0.026
work-0.026
el-0.025
Token He
Token ID679
Feature activation+1.96e-03

Pos logit contributions
isk+0.027
aks+0.022
eh+0.022
ives+0.022
coat+0.021
Neg logit contributions
inated-0.027
ement-0.027
ised-0.027
 sunset-0.025
work-0.025
Token said
Token ID531
Feature activation-3.59e-02

Pos logit contributions
ised+0.003
ement+0.003
inated+0.003
work+0.003
el+0.003
Neg logit contributions
isk-0.004
aks-0.003
arl-0.003
ives-0.003
eh-0.003
Token he
Token ID339
Feature activation-8.82e-03

Pos logit contributions
isk+0.083
aks+0.074
eh+0.069
ives+0.069
coat+0.069
Neg logit contributions
ement-0.045
 sunset-0.045
ised-0.043
work-0.043
inated-0.042
Token was
Token ID373
Feature activation-1.17e-02

Pos logit contributions
isk+0.016
arl+0.015
aks+0.015
eh+0.014
ives+0.014
Neg logit contributions
ement-0.014
ised-0.014
inated-0.014
work-0.013
el-0.013
Token in
Token ID287
Feature activation-2.99e-02

Pos logit contributions
isk+0.020
ives+0.017
aks+0.016
arl+0.016
coat+0.016
Neg logit contributions
ement-0.021
inated-0.021
ised-0.021
work-0.020
el-0.020
Token a
Token ID257
Feature activation-8.77e-02

Pos logit contributions
isk+0.065
ives+0.057
eh+0.056
coat+0.056
aid+0.055
Neg logit contributions
el-0.038
ised-0.038
 sunset-0.038
ement-0.036
 comet-0.035
Token hurry
Token ID23290
Feature activation-3.48e-02

Pos logit contributions
isk+0.183
ives+0.179
ows+0.164
aid+0.154
ens+0.148
Neg logit contributions
 top-0.125
 sunset-0.123
 comet-0.111
 sight-0.109
 destination-0.108
Token.
Token ID13
Feature activation-1.60e-02

Pos logit contributions
isk+0.079
eh+0.071
aks+0.071
aid+0.070
ives+0.069
Neg logit contributions
work-0.042
ised-0.041
ement-0.039
 horizon-0.038
inated-0.037
Token He
Token ID679
Feature activation-1.13e-03

Pos logit contributions
isk+0.029
aks+0.025
eh+0.024
ives+0.024
coat+0.023
Neg logit contributions
inated-0.027
ement-0.027
ised-0.027
 sunset-0.025
el-0.025
Token had
Token ID550
Feature activation-2.31e-02

Pos logit contributions
isk+0.002
aks+0.002
arl+0.002
ives+0.002
eh+0.002
Neg logit contributions
ised-0.002
ement-0.002
inated-0.002
work-0.002
el-0.002
Token to
Token ID284
Feature activation-3.34e-02

Pos logit contributions
isk+0.049
arl+0.044
ives+0.041
aks+0.041
aid+0.040
Neg logit contributions
ised-0.031
inated-0.030
 sunset-0.030
el-0.029
work-0.028
Token fun
Token ID1257
Feature activation+2.41e-02

Pos logit contributions
ised+0.002
ement+0.002
inated+0.002
work+0.002
el+0.002
Neg logit contributions
isk-0.001
aks-0.001
arl-0.001
ives-0.001
eh-0.001
Token making
Token ID1642
Feature activation+2.47e-03

Pos logit contributions
inated+0.042
ement+0.041
ised+0.040
 horizon+0.039
el+0.039
Neg logit contributions
isk-0.040
aks-0.038
eh-0.034
ives-0.033
arl-0.033
Token mud
Token ID17492
Feature activation-4.65e-02

Pos logit contributions
ised+0.004
ement+0.004
inated+0.004
work+0.004
el+0.004
Neg logit contributions
isk-0.005
aks-0.004
arl-0.004
ives-0.004
eh-0.004
Token pies
Token ID47199
Feature activation-3.50e-02

Pos logit contributions
isk+0.073
aks+0.064
arl+0.064
 backup+0.063
aid+0.061
Neg logit contributions
work-0.085
ised-0.084
el-0.082
inated-0.080
ble-0.080
Token,
Token ID11
Feature activation-4.10e-02

Pos logit contributions
isk+0.077
 germ+0.067
aid+0.066
arl+0.066
aks+0.066
Neg logit contributions
ised-0.044
work-0.044
ement-0.043
inated-0.042
 sunset-0.041
Token cars
Token ID5006
Feature activation-8.74e-02

Pos logit contributions
isk+0.084
arl+0.080
ives+0.078
aks+0.077
aid+0.074
Neg logit contributions
 sunset-0.055
ised-0.054
work-0.053
inated-0.052
el-0.051
Token,
Token ID11
Feature activation-5.10e-02

Pos logit contributions
 germ+0.218
isk+0.214
mare+0.204
arl+0.202
aid+0.198
Neg logit contributions
el-0.057
ised-0.056
 top-0.054
work-0.054
ement-0.046
Token and
Token ID290
Feature activation-5.56e-02

Pos logit contributions
isk+0.108
arl+0.100
ives+0.100
aks+0.097
aid+0.093
Neg logit contributions
el-0.065
 sunset-0.064
 top-0.063
work-0.062
ised-0.062
Token castles
Token ID49203
Feature activation-4.81e-02

Pos logit contributions
isk+0.104
arl+0.098
ives+0.094
aks+0.089
eh+0.086
Neg logit contributions
 sunset-0.091
 diamonds-0.085
work-0.085
ised-0.082
el-0.081
Token.
Token ID13
Feature activation+9.07e-03

Pos logit contributions
isk+0.109
aid+0.098
arl+0.098
 germ+0.096
eh+0.095
Neg logit contributions
work-0.055
ement-0.054
ised-0.052
 sunset-0.050
 top-0.049
Token But
Token ID887
Feature activation+4.55e-03

Pos logit contributions
ised+0.017
ement+0.017
inated+0.017
work+0.016
el+0.016
Neg logit contributions
isk-0.014
aks-0.012
arl-0.012
ives-0.011
eh-0.011
Token talks
Token ID6130
Feature activation-5.03e-02

Pos logit contributions
isk+0.040
aks+0.035
arl+0.035
aid+0.035
and+0.034
Neg logit contributions
inated-0.051
ement-0.049
el-0.049
ised-0.049
 curiously-0.048
Token to
Token ID284
Feature activation-1.60e-02

Pos logit contributions
isk+0.114
eh+0.109
aks+0.103
aster+0.098
cod+0.095
Neg logit contributions
 once-0.054
 brightly-0.054
work-0.054
 int-0.054
 curiously-0.053
Token the
Token ID262
Feature activation-2.27e-02

Pos logit contributions
isk+0.032
aks+0.030
ives+0.030
arl+0.030
aid+0.029
Neg logit contributions
ised-0.022
el-0.021
inated-0.021
ement-0.021
 sunset-0.021
Token dog
Token ID3290
Feature activation-2.59e-02

Pos logit contributions
isk+0.039
aks+0.032
eh+0.031
ives+0.030
aid+0.030
Neg logit contributions
 sunset-0.041
ised-0.041
inated-0.040
 comet-0.039
ement-0.039
Token in
Token ID287
Feature activation-3.45e-02

Pos logit contributions
isk+0.060
aks+0.057
arl+0.056
eh+0.055
aid+0.055
Neg logit contributions
inated-0.028
ement-0.026
el-0.026
work-0.026
 curiously-0.025
Token a
Token ID257
Feature activation-8.68e-02

Pos logit contributions
isk+0.076
ives+0.071
eh+0.069
coat+0.068
aid+0.067
Neg logit contributions
el-0.048
ised-0.041
 top-0.040
 sunset-0.039
ement-0.039
Token soft
Token ID2705
Feature activation-6.80e-02

Pos logit contributions
isk+0.204
ives+0.180
rost+0.160
aid+0.157
and+0.155
Neg logit contributions
 top-0.113
 sunset-0.102
el-0.098
 comet-0.093
 jet-0.092
Token voice
Token ID3809
Feature activation-1.58e-02

Pos logit contributions
ives+0.155
isk+0.150
aid+0.137
 backup+0.135
oke+0.135
Neg logit contributions
el-0.095
ised-0.081
inated-0.080
 sunset-0.080
 top-0.075
Token.
Token ID13
Feature activation-2.71e-02

Pos logit contributions
isk+0.039
aks+0.033
eh+0.033
arl+0.033
ives+0.032
Neg logit contributions
ement-0.018
inated-0.017
work-0.017
ised-0.017
el-0.016
Token He
Token ID679
Feature activation+8.40e-03

Pos logit contributions
isk+0.057
aks+0.049
eh+0.049
coat+0.049
aid+0.047
Neg logit contributions
inated-0.039
 sunset-0.038
ement-0.038
ised-0.037
 int-0.036
Token gives
Token ID3607
Feature activation-1.78e-02

Pos logit contributions
ised+0.018
ement+0.017
inated+0.017
work+0.017
el+0.016
Neg logit contributions
isk-0.012
aks-0.010
ives-0.009
arl-0.009
eh-0.009
Tokenopard
Token ID15478
Feature activation-2.22e-02

Pos logit contributions
isk+0.034
aks+0.030
arl+0.027
smart+0.027
eh+0.027
Neg logit contributions
inated-0.035
el-0.035
ement-0.034
ised-0.034
work-0.034
Token.
Token ID13
Feature activation-2.84e-02

Pos logit contributions
isk+0.050
arl+0.045
aks+0.045
aid+0.044
ives+0.043
Neg logit contributions
inated-0.026
work-0.025
ised-0.025
ement-0.025
el-0.025
Token Tim
Token ID5045
Feature activation-3.40e-02

Pos logit contributions
isk+0.064
coat+0.055
aks+0.054
eh+0.054
aid+0.054
Neg logit contributions
inated-0.038
ised-0.035
ement-0.035
el-0.034
 curiously-0.034
Token runs
Token ID4539
Feature activation-5.02e-02

Pos logit contributions
isk+0.065
arl+0.058
eh+0.056
aks+0.055
aid+0.055
Neg logit contributions
el-0.052
inated-0.048
 curiously-0.048
ement-0.047
 top-0.046
Token to
Token ID284
Feature activation-2.88e-02

Pos logit contributions
isk+0.116
aks+0.105
eh+0.102
 germ+0.099
aid+0.098
Neg logit contributions
 top-0.056
 once-0.050
 curiously-0.050
 suddenly-0.048
 closely-0.048
Token a
Token ID257
Feature activation-8.66e-02

Pos logit contributions
isk+0.057
ives+0.056
aks+0.055
coat+0.053
arl+0.053
Neg logit contributions
 curiously-0.041
 top-0.039
el-0.039
 sunset-0.038
inated-0.037
Token man
Token ID582
Feature activation-4.85e-02

Pos logit contributions
isk+0.190
ives+0.184
aked+0.169
aid+0.162
aks+0.159
Neg logit contributions
 top-0.121
 sunset-0.117
 comet-0.110
 reflection-0.105
 horizon-0.105
Token who
Token ID508
Feature activation-5.64e-02

Pos logit contributions
isk+0.112
arl+0.108
 germ+0.100
 robber+0.099
coat+0.098
Neg logit contributions
el-0.054
work-0.050
 curiously-0.050
inated-0.048
 closely-0.047
Token works
Token ID2499
Feature activation-1.40e-02

Pos logit contributions
isk+0.091
arl+0.087
and+0.087
coat+0.083
aid+0.079
Neg logit contributions
el-0.098
ement-0.090
work-0.089
 comet-0.087
 curiously-0.086
Token in
Token ID287
Feature activation-3.10e-02

Pos logit contributions
isk+0.032
ives+0.026
 germ+0.026
arl+0.026
coat+0.026
Neg logit contributions
ised-0.019
ement-0.018
el-0.017
inated-0.016
work-0.016
Token the
Token ID262
Feature activation-4.34e-02

Pos logit contributions
isk+0.069
ives+0.066
arl+0.064
coat+0.062
eh+0.061
Neg logit contributions
el-0.039
 sunset-0.035
ised-0.035
 top-0.033
ement-0.033
Token.
Token ID13
Feature activation-3.14e-02

Pos logit contributions
isk+0.041
aks+0.036
arl+0.036
 germ+0.036
aid+0.035
Neg logit contributions
inated-0.024
el-0.024
ised-0.024
ement-0.023
work-0.022
Token He
Token ID679
Feature activation-1.71e-02

Pos logit contributions
isk+0.063
eh+0.055
aks+0.053
coat+0.052
aid+0.051
Neg logit contributions
ised-0.048
ement-0.046
inated-0.046
 sunset-0.046
 top-0.045
Token takes
Token ID2753
Feature activation-4.55e-02

Pos logit contributions
isk+0.026
aks+0.022
arl+0.022
aid+0.021
coat+0.020
Neg logit contributions
inated-0.033
ement-0.032
ised-0.032
el-0.032
work-0.031
Token them
Token ID606
Feature activation-1.89e-02

Pos logit contributions
isk+0.112
arl+0.102
coat+0.100
eh+0.098
ives+0.096
Neg logit contributions
el-0.052
 top-0.044
ised-0.043
 sunset-0.042
work-0.038
Token to
Token ID284
Feature activation-4.81e-02

Pos logit contributions
isk+0.038
aks+0.033
eh+0.033
arl+0.033
aid+0.032
Neg logit contributions
inated-0.026
ised-0.026
el-0.026
work-0.025
ement-0.025
Token a
Token ID257
Feature activation-8.61e-02

Pos logit contributions
ives+0.115
isk+0.108
coat+0.107
aks+0.107
eh+0.106
Neg logit contributions
 sunset-0.050
 top-0.046
 curiously-0.043
el-0.042
 avail-0.040
Token small
Token ID1402
Feature activation-5.28e-02

Pos logit contributions
ives+0.185
isk+0.182
aid+0.167
aked+0.164
ows+0.160
Neg logit contributions
 top-0.124
 sunset-0.117
 comet-0.105
 horizon-0.098
 destination-0.098
Token room
Token ID2119
Feature activation-4.65e-02

Pos logit contributions
isk+0.112
ives+0.103
aid+0.100
arl+0.098
aks+0.093
Neg logit contributions
 sunset-0.074
el-0.073
 top-0.073
ised-0.071
 horizon-0.070
Token.
Token ID13
Feature activation-3.66e-02

Pos logit contributions
isk+0.122
aid+0.109
eh+0.107
aks+0.106
arl+0.105
Neg logit contributions
ement-0.041
ised-0.038
work-0.037
 int-0.035
 curiously-0.035
Token He
Token ID679
Feature activation-8.05e-03

Pos logit contributions
isk+0.077
eh+0.065
coat+0.064
aks+0.064
aid+0.061
Neg logit contributions
inated-0.051
 sunset-0.051
ised-0.051
ement-0.050
 top-0.047
Token locks
Token ID19253
Feature activation-1.81e-02

Pos logit contributions
isk+0.012
aks+0.010
arl+0.010
aid+0.010
ives+0.010
Neg logit contributions
inated-0.015
ised-0.015
ement-0.015
el-0.015
work-0.015
Token their
Token ID511
Feature activation-1.86e-02

Pos logit contributions
isk+0.002
aks+0.001
arl+0.001
ives+0.001
eh+0.001
Neg logit contributions
ised-0.002
inated-0.002
ement-0.002
work-0.002
el-0.002
Token hats
Token ID23910
Feature activation-1.65e-02

Pos logit contributions
isk+0.029
ives+0.024
aks+0.024
arl+0.023
aid+0.023
Neg logit contributions
ised-0.035
ement-0.035
inated-0.035
 sunset-0.035
el-0.035
Token and
Token ID290
Feature activation-4.38e-02

Pos logit contributions
isk+0.033
aks+0.029
eh+0.028
arl+0.028
aid+0.028
Neg logit contributions
ised-0.024
ement-0.024
work-0.024
inated-0.024
el-0.023
Token coats
Token ID30720
Feature activation-7.52e-03

Pos logit contributions
isk+0.079
ives+0.066
and+0.066
arl+0.066
smart+0.061
Neg logit contributions
 sunset-0.075
el-0.071
ised-0.071
work-0.070
inated-0.070
Token and
Token ID290
Feature activation-5.32e-02

Pos logit contributions
isk+0.015
aks+0.013
eh+0.012
arl+0.012
ives+0.012
Neg logit contributions
ised-0.012
inated-0.011
ement-0.011
work-0.011
el-0.011
Token sne
Token ID10505
Feature activation-8.61e-02

Pos logit contributions
isk+0.098
arl+0.080
and+0.080
ives+0.078
smart+0.074
Neg logit contributions
 sunset-0.090
ised-0.083
work-0.083
el-0.082
inated-0.082
Tokenaked
Token ID4335
Feature activation-3.56e-02

Pos logit contributions
 shady+0.097
isk+0.095
 cafeteria+0.086
 ranger+0.084
 backup+0.080
Neg logit contributions
el-0.179
ised-0.160
efully-0.147
work-0.145
ement-0.143
Token out
Token ID503
Feature activation-2.29e-02

Pos logit contributions
isk+0.076
arl+0.063
aid+0.063
ives+0.062
aks+0.062
Neg logit contributions
ised-0.049
work-0.047
inated-0.047
ement-0.047
 int-0.047
Token of
Token ID286
Feature activation-2.06e-02

Pos logit contributions
isk+0.047
ives+0.039
aid+0.039
aks+0.038
eh+0.038
Neg logit contributions
el-0.034
work-0.034
ised-0.033
ement-0.033
inated-0.032
Token the
Token ID262
Feature activation-3.57e-02

Pos logit contributions
isk+0.044
ives+0.040
eh+0.037
aks+0.037
arl+0.037
Neg logit contributions
ised-0.028
el-0.028
 sunset-0.028
ement-0.026
work-0.026
Token house
Token ID2156
Feature activation-3.11e-02

Pos logit contributions
isk+0.069
ives+0.057
aid+0.056
smart+0.053
aks+0.053
Neg logit contributions
 sunset-0.060
 horizon-0.057
el-0.056
 comet-0.056
work-0.056
Token fell
Token ID3214
Feature activation-4.12e-03

Pos logit contributions
ised+0.000
inated+0.000
ement+0.000
work+0.000
el+0.000
Neg logit contributions
isk-0.000
aks-0.000
arl-0.000
ives-0.000
eh-0.000
Token asleep
Token ID16039
Feature activation-5.10e-03

Pos logit contributions
isk+0.007
aks+0.006
ives+0.006
arl+0.005
eh+0.005
Neg logit contributions
ised-0.008
inated-0.007
ement-0.007
work-0.007
el-0.007
Token.
Token ID13
Feature activation-3.72e-03

Pos logit contributions
isk+0.010
aks+0.009
arl+0.009
ives+0.009
eh+0.009
Neg logit contributions
ised-0.008
ement-0.007
inated-0.007
work-0.007
el-0.007
Token In
Token ID554
Feature activation+5.43e-03

Pos logit contributions
isk+0.007
aks+0.006
eh+0.006
ives+0.006
arl+0.006
Neg logit contributions
inated-0.006
ised-0.006
ement-0.006
work-0.005
el-0.005
Token the
Token ID262
Feature activation-2.46e-02

Pos logit contributions
ised+0.009
inated+0.009
ement+0.009
work+0.009
el+0.008
Neg logit contributions
isk-0.010
aks-0.009
arl-0.008
ives-0.008
eh-0.008
Token middle
Token ID3504
Feature activation-8.57e-02

Pos logit contributions
isk+0.059
aks+0.054
ives+0.054
aid+0.051
arl+0.051
Neg logit contributions
 sunset-0.029
 horizon-0.026
ement-0.026
el-0.026
ised-0.025
Token of
Token ID286
Feature activation+6.59e-03

Pos logit contributions
isk+0.213
 robber+0.193
arl+0.188
inters+0.178
 stret+0.177
Neg logit contributions
ement-0.111
 once-0.104
el-0.104
 sunset-0.088
 suddenly-0.082
Token the
Token ID262
Feature activation-8.26e-03

Pos logit contributions
ised+0.011
inated+0.011
ement+0.011
work+0.011
el+0.010
Neg logit contributions
isk-0.012
aks-0.010
arl-0.010
eh-0.010
aid-0.010
Token night
Token ID1755
Feature activation-3.55e-02

Pos logit contributions
isk+0.018
aks+0.016
ives+0.016
arl+0.015
aid+0.015
Neg logit contributions
 sunset-0.011
ised-0.011
ement-0.011
inated-0.011
el-0.010
Token,
Token ID11
Feature activation+5.62e-04

Pos logit contributions
isk+0.081
arl+0.069
aks+0.068
aid+0.067
 robber+0.065
Neg logit contributions
 sunset-0.049
ised-0.048
ement-0.048
 horizon-0.046
el-0.046
Token she
Token ID673
Feature activation-1.83e-02

Pos logit contributions
ised+0.001
ement+0.001
inated+0.001
work+0.001
el+0.001
Neg logit contributions
isk-0.001
aks-0.001
arl-0.001
ives-0.001
eh-0.001
Token.
Token ID13
Feature activation+9.29e-03

Pos logit contributions
isk+0.025
aks+0.022
ives+0.022
arl+0.022
aid+0.022
Neg logit contributions
ised-0.014
ement-0.014
inated-0.014
work-0.013
 sunset-0.013
Token
Token ID198
Feature activation-1.30e-02

Pos logit contributions
ised+0.017
ement+0.017
inated+0.016
work+0.016
el+0.016
Neg logit contributions
isk-0.016
aks-0.013
arl-0.013
ives-0.013
eh-0.012
Token
Token ID198
Feature activation-1.61e-02

Pos logit contributions
isk+0.026
arl+0.022
aks+0.022
ives+0.022
aid+0.022
Neg logit contributions
ised-0.019
inated-0.019
ement-0.019
el-0.018
 sunset-0.018
TokenIn
Token ID818
Feature activation-8.60e-03

Pos logit contributions
isk+0.030
arl+0.026
ives+0.026
aks+0.026
 germ+0.025
Neg logit contributions
ised-0.027
inated-0.027
 sunset-0.024
izing-0.023
ement-0.023
Token the
Token ID262
Feature activation-4.72e-02

Pos logit contributions
isk+0.018
ives+0.016
arl+0.015
aks+0.015
aid+0.015
Neg logit contributions
ised-0.013
inated-0.012
ement-0.012
el-0.012
work-0.012
Token middle
Token ID3504
Feature activation-8.55e-02

Pos logit contributions
isk+0.110
aid+0.109
ives+0.108
coat+0.101
run+0.100
Neg logit contributions
 sunset-0.063
 top-0.061
 horizon-0.058
 destination-0.048
 comet-0.048
Token of
Token ID286
Feature activation-1.64e-02

Pos logit contributions
isk+0.229
arl+0.204
coat+0.204
 robber+0.204
 liar+0.195
Neg logit contributions
el-0.090
ement-0.085
 once-0.075
ised-0.074
 top-0.063
Token the
Token ID262
Feature activation-2.96e-02

Pos logit contributions
isk+0.036
ives+0.035
aid+0.032
arl+0.032
coat+0.032
Neg logit contributions
ised-0.021
el-0.021
 sunset-0.021
inated-0.020
work-0.020
Token night
Token ID1755
Feature activation-3.14e-02

Pos logit contributions
isk+0.063
aid+0.060
ives+0.056
arl+0.053
coat+0.053
Neg logit contributions
 sunset-0.046
 horizon-0.042
 top-0.042
 comet-0.041
ised-0.040
Token,
Token ID11
Feature activation+2.30e-02

Pos logit contributions
isk+0.072
aid+0.062
arl+0.061
aks+0.059
coat+0.058
Neg logit contributions
 sunset-0.043
ised-0.042
 horizon-0.041
ement-0.040
work-0.040
Token Tim
Token ID5045
Feature activation-3.76e-02

Pos logit contributions
ement+0.052
inated+0.050
ised+0.049
el+0.049
work+0.049
Neg logit contributions
isk-0.027
aks-0.023
eh-0.022
ens-0.021
aid-0.020
Token.
Token ID13
Feature activation-2.27e-02

Pos logit contributions
isk+0.063
eh+0.056
 germ+0.054
aks+0.054
arl+0.052
Neg logit contributions
work-0.039
ised-0.038
el-0.037
inated-0.037
ement-0.037
Token She
Token ID1375
Feature activation-5.84e-03

Pos logit contributions
isk+0.047
eh+0.041
aks+0.041
aid+0.039
 germ+0.037
Neg logit contributions
inated-0.034
ised-0.033
ement-0.033
 sunset-0.032
 int-0.030
Token puts
Token ID7584
Feature activation-1.72e-02

Pos logit contributions
isk+0.009
aks+0.007
arl+0.007
eh+0.007
ives+0.007
Neg logit contributions
ement-0.012
ised-0.012
inated-0.012
work-0.011
el-0.011
Token them
Token ID606
Feature activation-1.25e-03

Pos logit contributions
isk+0.036
arl+0.031
eh+0.031
aks+0.031
ives+0.031
Neg logit contributions
ised-0.024
ement-0.024
el-0.023
work-0.023
inated-0.023
Token in
Token ID287
Feature activation-1.74e-03

Pos logit contributions
isk+0.002
aks+0.002
arl+0.002
ives+0.002
eh+0.002
Neg logit contributions
ised-0.002
inated-0.002
ement-0.002
work-0.002
el-0.002
Token a
Token ID257
Feature activation-8.53e-02

Pos logit contributions
isk+0.003
aks+0.003
ives+0.003
arl+0.003
eh+0.003
Neg logit contributions
ised-0.003
ement-0.003
inated-0.003
el-0.003
work-0.003
Token bucket
Token ID19236
Feature activation-5.05e-02

Pos logit contributions
isk+0.199
ives+0.169
and+0.162
ens+0.158
aid+0.150
Neg logit contributions
 top-0.119
 sunset-0.105
 jet-0.099
ement-0.093
 comet-0.093
Token.
Token ID13
Feature activation-2.59e-02

Pos logit contributions
isk+0.125
 germ+0.107
 robber+0.105
 backup+0.105
arl+0.102
Neg logit contributions
el-0.058
ement-0.053
ised-0.053
work-0.051
inated-0.048
Token She
Token ID1375
Feature activation+2.22e-03

Pos logit contributions
isk+0.054
aks+0.046
eh+0.046
aid+0.044
 germ+0.043
Neg logit contributions
inated-0.038
ement-0.037
ised-0.036
 sunset-0.036
 horizon-0.034
Token fills
Token ID23816
Feature activation+5.45e-03

Pos logit contributions
ised+0.005
ement+0.005
inated+0.005
work+0.004
el+0.004
Neg logit contributions
isk-0.003
aks-0.003
arl-0.002
ives-0.002
eh-0.002
Token the
Token ID262
Feature activation-1.64e-02

Pos logit contributions
ised+0.009
ement+0.009
inated+0.009
work+0.008
el+0.008
Neg logit contributions
isk-0.010
aks-0.009
arl-0.009
ives-0.008
eh-0.008
Token a
Token ID257
Feature activation-4.13e-02

Pos logit contributions
isk+0.092
arl+0.081
ives+0.077
aks+0.077
eh+0.073
Neg logit contributions
el-0.058
 sunset-0.057
ised-0.056
inated-0.056
work-0.055
Token big
Token ID1263
Feature activation-5.41e-02

Pos logit contributions
isk+0.084
ives+0.071
aks+0.071
arl+0.066
and+0.065
Neg logit contributions
 sunset-0.063
 comet-0.063
ement-0.062
 top-0.060
 int-0.059
Token chair
Token ID5118
Feature activation-4.23e-02

Pos logit contributions
isk+0.109
ives+0.093
and+0.091
arl+0.087
aks+0.086
Neg logit contributions
ement-0.087
 sunset-0.087
el-0.086
 comet-0.084
 top-0.083
Token,
Token ID11
Feature activation-2.53e-02

Pos logit contributions
isk+0.093
aks+0.084
eh+0.083
arl+0.082
aid+0.080
Neg logit contributions
work-0.054
ement-0.054
ised-0.050
el-0.049
inated-0.048
Token a
Token ID257
Feature activation-6.02e-02

Pos logit contributions
isk+0.055
arl+0.048
ives+0.047
aks+0.046
eh+0.044
Neg logit contributions
ement-0.034
ised-0.034
inated-0.033
el-0.033
 sunset-0.032
Token loud
Token ID7812
Feature activation-8.49e-02

Pos logit contributions
isk+0.123
ives+0.104
aks+0.103
aked+0.095
arl+0.093
Neg logit contributions
 comet-0.096
 sunset-0.095
 top-0.093
ement-0.089
 brightly-0.087
Token drill
Token ID16007
Feature activation-4.11e-02

Pos logit contributions
isk+0.176
ives+0.165
aks+0.156
and+0.152
arl+0.149
Neg logit contributions
 sunset-0.120
el-0.113
 comet-0.112
ement-0.111
 top-0.111
Token and
Token ID290
Feature activation-2.79e-02

Pos logit contributions
isk+0.088
aks+0.079
eh+0.078
arl+0.077
aid+0.076
Neg logit contributions
ement-0.054
work-0.054
inated-0.053
ised-0.052
el-0.051
Token a
Token ID257
Feature activation-4.66e-02

Pos logit contributions
isk+0.062
arl+0.054
aks+0.052
ives+0.051
eh+0.049
Neg logit contributions
ised-0.036
inated-0.035
 sunset-0.035
work-0.034
ement-0.034
Token bitter
Token ID12922
Feature activation-5.18e-02

Pos logit contributions
isk+0.092
aks+0.076
ives+0.074
ens+0.069
arl+0.069
Neg logit contributions
 sunset-0.075
 comet-0.075
 top-0.074
ement-0.073
 int-0.071
Token paste
Token ID17008
Feature activation-2.80e-02

Pos logit contributions
isk+0.104
aks+0.089
ives+0.084
arl+0.082
smart+0.080
Neg logit contributions
 sunset-0.082
ement-0.080
el-0.077
 comet-0.076
work-0.076
Token and
Token ID290
Feature activation+1.72e-02

Pos logit contributions
inated+0.095
ised+0.092
used+0.090
ement+0.088
ths+0.087
Neg logit contributions
isk-0.067
aks-0.064
coat-0.060
ives-0.059
 shady-0.058
Token pretty
Token ID2495
Feature activation+4.22e-02

Pos logit contributions
ement+0.028
inated+0.028
ised+0.028
work+0.026
el+0.026
Neg logit contributions
isk-0.031
aks-0.028
aid-0.026
eh-0.026
arl-0.025
Token.
Token ID13
Feature activation+2.06e-02

Pos logit contributions
inated+0.072
ised+0.071
ement+0.068
ths+0.067
el+0.066
Neg logit contributions
isk-0.066
aks-0.060
ives-0.058
coat-0.058
 comes-0.054
Token Daisy
Token ID40355
Feature activation+3.09e-02

Pos logit contributions
ement+0.034
work+0.033
ised+0.033
el+0.033
inated+0.033
Neg logit contributions
isk-0.037
aks-0.032
ives-0.032
eh-0.030
arl-0.030
Token was
Token ID373
Feature activation+1.49e-02

Pos logit contributions
ement+0.056
ised+0.054
inated+0.053
work+0.053
 sunset+0.051
Neg logit contributions
isk-0.051
aks-0.043
ives-0.041
eh-0.041
coat-0.041
Token capt
Token ID3144
Feature activation+6.04e-02

Pos logit contributions
ised+0.022
ement+0.022
inated+0.022
work+0.021
el+0.021
Neg logit contributions
isk-0.029
aks-0.026
eh-0.025
ives-0.024
aid-0.024
Tokenivated
Token ID30829
Feature activation+5.46e-02

Pos logit contributions
 sunset+0.096
sets+0.094
 avail+0.093
 comet+0.092
 horizon+0.092
Neg logit contributions
isk-0.093
aid-0.091
arl-0.091
enced-0.086
eh-0.086
Token and
Token ID290
Feature activation+1.03e-02

Pos logit contributions
inated+0.108
ble+0.108
work+0.106
 comet+0.103
 summit+0.101
Neg logit contributions
isk-0.063
ens-0.060
and-0.054
smart-0.052
aks-0.052
Token she
Token ID673
Feature activation+3.37e-02

Pos logit contributions
ement+0.016
ised+0.016
inated+0.016
work+0.016
el+0.016
Neg logit contributions
isk-0.019
aks-0.017
ives-0.016
eh-0.016
aid-0.016
Token couldn
Token ID3521
Feature activation+4.27e-02

Pos logit contributions
ement+0.060
ised+0.058
inated+0.057
work+0.057
el+0.055
Neg logit contributions
isk-0.056
aks-0.047
eh-0.046
coat-0.045
ives-0.045
Token't
Token ID470
Feature activation+2.08e-02

Pos logit contributions
ised+0.075
inated+0.070
ement+0.069
ishment+0.068
work+0.068
Neg logit contributions
isk-0.074
ives-0.066
aks-0.064
arl-0.061
aster-0.059
TokenAre
Token ID8491
Feature activation+3.10e-02

Pos logit contributions
ement+0.046
inated+0.044
ised+0.044
work+0.043
el+0.042
Neg logit contributions
isk-0.045
aid-0.039
aks-0.039
ives-0.037
eh-0.037
Token you
Token ID345
Feature activation+1.05e-02

Pos logit contributions
ement+0.057
ised+0.057
inated+0.056
 horizon+0.053
work+0.052
Neg logit contributions
isk-0.051
 backup-0.041
ens-0.041
arl-0.040
aid-0.039
Token lost
Token ID2626
Feature activation+2.79e-02

Pos logit contributions
ised+0.018
ement+0.018
inated+0.018
work+0.017
el+0.017
Neg logit contributions
isk-0.018
aks-0.016
arl-0.015
eh-0.015
ives-0.015
Token?"
Token ID1701
Feature activation+2.48e-02

Pos logit contributions
ement+0.047
inated+0.047
ised+0.046
 int+0.043
work+0.043
Neg logit contributions
isk-0.048
aks-0.042
arl-0.041
eh-0.040
coat-0.040
Token the
Token ID262
Feature activation+3.68e-02

Pos logit contributions
ement+0.053
work+0.052
inated+0.052
ised+0.051
el+0.049
Neg logit contributions
isk-0.030
aks-0.026
ens-0.025
aid-0.024
coat-0.024
Token voice
Token ID3809
Feature activation+5.80e-02

Pos logit contributions
inated+0.081
ement+0.079
work+0.078
 int+0.077
ised+0.077
Neg logit contributions
isk-0.035
arl-0.035
 germ-0.034
 firefighter-0.033
aks-0.033
Token said
Token ID531
Feature activation+6.07e-03

Pos logit contributions
 horizon+0.108
inated+0.106
 sunset+0.100
ised+0.098
ement+0.097
Neg logit contributions
aid-0.078
smart-0.076
isk-0.076
aks-0.073
 comes-0.069
Token again
Token ID757
Feature activation+3.34e-02

Pos logit contributions
ised+0.009
inated+0.009
ement+0.009
work+0.008
el+0.008
Neg logit contributions
isk-0.012
aks-0.011
arl-0.010
ives-0.010
aid-0.010
Token.
Token ID13
Feature activation+4.69e-03

Pos logit contributions
inated+0.064
ised+0.063
ement+0.062
work+0.059
el+0.059
Neg logit contributions
isk-0.050
aks-0.042
arl-0.042
smart-0.040
ives-0.038
Token This
Token ID770
Feature activation+3.48e-02

Pos logit contributions
ised+0.009
ement+0.009
inated+0.009
work+0.009
el+0.009
Neg logit contributions
isk-0.007
aks-0.006
arl-0.006
ives-0.006
eh-0.005
Token time
Token ID640
Feature activation+1.09e-02

Pos logit contributions
inated+0.072
ised+0.070
ement+0.069
izing+0.067
used+0.066
Neg logit contributions
aks-0.042
isk-0.041
 comes-0.040
 germ-0.038
eh-0.036
Token disappointed
Token ID11679
Feature activation+1.22e-02

Pos logit contributions
ised+0.011
inated+0.011
ement+0.011
work+0.011
el+0.011
Neg logit contributions
isk-0.011
aks-0.010
arl-0.009
eh-0.009
aid-0.009
Token and
Token ID290
Feature activation-1.02e-02

Pos logit contributions
inated+0.023
ised+0.022
ement+0.022
work+0.022
el+0.021
Neg logit contributions
isk-0.020
aks-0.017
arl-0.017
ives-0.016
aid-0.016
Token sad
Token ID6507
Feature activation+9.17e-03

Pos logit contributions
isk+0.022
aks+0.020
ives+0.019
arl+0.019
eh+0.018
Neg logit contributions
ised-0.014
inated-0.014
ement-0.013
work-0.013
el-0.013
Token that
Token ID326
Feature activation+6.96e-03

Pos logit contributions
ised+0.014
inated+0.014
ement+0.014
work+0.013
el+0.013
Neg logit contributions
isk-0.018
aks-0.015
arl-0.015
ives-0.015
aid-0.015
Token Lily
Token ID20037
Feature activation+1.46e-02

Pos logit contributions
ised+0.013
inated+0.012
ement+0.012
work+0.012
el+0.012
Neg logit contributions
isk-0.012
aks-0.010
arl-0.010
ives-0.009
aid-0.009
Token didn
Token ID1422
Feature activation+5.40e-02

Pos logit contributions
ised+0.024
inated+0.024
ement+0.024
work+0.023
el+0.022
Neg logit contributions
isk-0.027
aks-0.023
ives-0.022
aid-0.022
eh-0.022
Token't
Token ID470
Feature activation-5.51e-03

Pos logit contributions
ised+0.108
ement+0.107
ishment+0.105
inated+0.103
 diamonds+0.100
Neg logit contributions
isk-0.081
arl-0.069
ives-0.068
 backup-0.065
aster-0.063
Token listen
Token ID6004
Feature activation+2.63e-03

Pos logit contributions
isk+0.008
aks+0.007
eh+0.007
arl+0.006
ives+0.006
Neg logit contributions
ised-0.011
inated-0.011
ement-0.011
work-0.010
 sunset-0.010
Token to
Token ID284
Feature activation+1.05e-02

Pos logit contributions
ised+0.005
ement+0.004
inated+0.004
work+0.004
el+0.004
Neg logit contributions
isk-0.005
aks-0.004
arl-0.004
ives-0.004
eh-0.004
Token her
Token ID607
Feature activation+1.87e-02

Pos logit contributions
ised+0.019
ement+0.019
inated+0.019
work+0.018
el+0.017
Neg logit contributions
isk-0.018
aks-0.015
arl-0.015
aid-0.014
 germ-0.014
Token and
Token ID290
Feature activation-9.10e-03

Pos logit contributions
inated+0.031
ised+0.031
ement+0.030
el+0.029
work+0.028
Neg logit contributions
isk-0.035
arl-0.028
aks-0.028
 germ-0.028
aid-0.028
Token Lily
Token ID20037
Feature activation-3.33e-03

Pos logit contributions
isk+0.020
arl+0.017
aks+0.016
ives+0.016
eh+0.016
Neg logit contributions
inated-0.017
ised-0.017
 sunset-0.017
work-0.017
el-0.017
Token were
Token ID547
Feature activation-3.69e-03

Pos logit contributions
isk+0.006
aks+0.005
arl+0.005
eh+0.005
ives+0.005
Neg logit contributions
ised-0.006
inated-0.006
ement-0.006
el-0.006
work-0.006
Token playing
Token ID2712
Feature activation-2.21e-03

Pos logit contributions
isk+0.007
aks+0.006
ives+0.006
arl+0.006
aid+0.006
Neg logit contributions
ised-0.006
ement-0.006
inated-0.006
el-0.005
work-0.005
Token with
Token ID351
Feature activation+5.04e-05

Pos logit contributions
isk+0.005
aks+0.004
arl+0.004
ives+0.004
aid+0.004
Neg logit contributions
ised-0.003
inated-0.003
ement-0.003
work-0.003
el-0.003
Token their
Token ID511
Feature activation+3.31e-02

Pos logit contributions
ised+0.000
inated+0.000
ement+0.000
work+0.000
el+0.000
Neg logit contributions
isk-0.000
aks-0.000
arl-0.000
ives-0.000
aid-0.000
Token toy
Token ID13373
Feature activation+6.02e-02

Pos logit contributions
ement+0.079
inated+0.078
ised+0.077
 horizon+0.074
work+0.073
Neg logit contributions
isk-0.033
aks-0.029
eh-0.026
coat-0.026
 germ-0.025
Token cars
Token ID5006
Feature activation+2.22e-03

Pos logit contributions
ement+0.150
inated+0.149
 mesmer+0.145
ised+0.145
 horizon+0.143
Neg logit contributions
isk-0.052
 fetch-0.041
 fort-0.041
 robber-0.041
eh-0.039
Token in
Token ID287
Feature activation+1.35e-02

Pos logit contributions
ised+0.003
ement+0.003
inated+0.003
work+0.003
el+0.003
Neg logit contributions
isk-0.005
aks-0.004
arl-0.004
ives-0.004
eh-0.004
Token the
Token ID262
Feature activation-1.54e-02

Pos logit contributions
ement+0.025
inated+0.025
ised+0.025
work+0.024
 int+0.023
Neg logit contributions
isk-0.022
aks-0.020
arl-0.018
eh-0.017
 backup-0.017
Token garden
Token ID11376
Feature activation+6.41e-03

Pos logit contributions
isk+0.028
ives+0.024
aid+0.024
arl+0.024
aks+0.023
Neg logit contributions
 sunset-0.026
el-0.026
ised-0.025
inated-0.025
work-0.024
Token.
Token ID13
Feature activation-1.85e-02

Pos logit contributions
ised+0.009
inated+0.009
ement+0.009
work+0.008
el+0.008
Neg logit contributions
isk-0.013
aks-0.012
arl-0.011
ives-0.011
eh-0.011
Token
Token ID198
Feature activation-7.60e-03

Pos logit contributions
isk+0.005
aks+0.005
arl+0.004
ives+0.004
eh+0.004
Neg logit contributions
ised-0.004
inated-0.004
ement-0.004
el-0.004
work-0.004
Token"
Token ID1
Feature activation+1.91e-02

Pos logit contributions
isk+0.013
aks+0.010
arl+0.010
ives+0.010
coat+0.010
Neg logit contributions
ised-0.014
inated-0.014
ement-0.013
el-0.013
izing-0.013
TokenGive
Token ID23318
Feature activation+2.39e-02

Pos logit contributions
ement+0.037
work+0.036
inated+0.036
ised+0.036
el+0.035
Neg logit contributions
isk-0.030
aks-0.026
aid-0.026
eh-0.025
ives-0.024
Token me
Token ID502
Feature activation-1.90e-02

Pos logit contributions
ement+0.043
inated+0.043
ised+0.043
work+0.042
 horizon+0.040
Neg logit contributions
isk-0.040
aks-0.035
ens-0.034
ives-0.032
 germ-0.032
Token the
Token ID262
Feature activation+6.27e-02

Pos logit contributions
isk+0.040
aks+0.036
ives+0.035
eh+0.034
arl+0.034
Neg logit contributions
ised-0.026
el-0.025
ement-0.025
inated-0.025
work-0.024
Token orange
Token ID10912
Feature activation+7.50e-02

Pos logit contributions
ised+0.134
ths+0.131
izing+0.122
used+0.121
inct+0.121
Neg logit contributions
isk-0.073
 banana-0.073
 shady-0.072
 bisc-0.071
 cookie-0.066
Token,
Token ID11
Feature activation+7.61e-03

Pos logit contributions
ths+0.166
ised+0.165
ement+0.159
 mesmer+0.150
inct+0.149
Neg logit contributions
 banana-0.084
 cookie-0.080
isk-0.076
coat-0.075
 fetch-0.075
Token Ben
Token ID3932
Feature activation-1.25e-02

Pos logit contributions
ised+0.012
inated+0.012
ement+0.012
work+0.011
el+0.011
Neg logit contributions
isk-0.015
aks-0.013
arl-0.012
ives-0.012
aid-0.012
Token!
Token ID0
Feature activation+1.61e-02

Pos logit contributions
isk+0.028
arl+0.024
aks+0.024
ives+0.024
aid+0.024
Neg logit contributions
ised-0.016
inated-0.016
ement-0.015
el-0.015
work-0.015
Token It
Token ID632
Feature activation+1.65e-02

Pos logit contributions
ised+0.028
ement+0.027
inated+0.026
work+0.025
 sunset+0.025
Neg logit contributions
isk-0.029
arl-0.025
aks-0.025
ives-0.025
aid-0.023
Token is
Token ID318
Feature activation+1.09e-02

Pos logit contributions
ised+0.032
ement+0.032
inated+0.031
 horizon+0.030
 sunset+0.029
Neg logit contributions
isk-0.027
aks-0.022
ives-0.021
eh-0.020
aid-0.020
Token until
Token ID1566
Feature activation+2.37e-02

Pos logit contributions
isk+0.019
aks+0.017
arl+0.016
ives+0.016
eh+0.016
Neg logit contributions
ement-0.014
ised-0.014
inated-0.014
el-0.014
work-0.013
Token she
Token ID673
Feature activation+1.34e-02

Pos logit contributions
inated+0.044
ised+0.044
ement+0.044
work+0.042
izing+0.040
Neg logit contributions
isk-0.036
aks-0.031
 backup-0.030
 germ-0.030
aster-0.030
Token knew
Token ID2993
Feature activation+2.47e-02

Pos logit contributions
ised+0.029
inated+0.029
ement+0.029
work+0.028
el+0.027
Neg logit contributions
isk-0.018
aks-0.015
ives-0.014
eh-0.014
aid-0.013
Token the
Token ID262
Feature activation+3.44e-02

Pos logit contributions
ised+0.041
inated+0.040
ement+0.038
work+0.037
 horizon+0.036
Neg logit contributions
isk-0.046
aks-0.038
aid-0.038
smart-0.038
 backup-0.038
Token poem
Token ID21247
Feature activation-4.77e-03

Pos logit contributions
ised+0.064
inated+0.063
ement+0.062
work+0.058
el+0.058
Neg logit contributions
isk-0.054
 germ-0.051
ives-0.048
aks-0.048
eh-0.046
Token by
Token ID416
Feature activation+1.78e-02

Pos logit contributions
isk+0.009
aks+0.008
arl+0.008
eh+0.008
ives+0.008
Neg logit contributions
ement-0.007
ised-0.007
inated-0.007
work-0.007
el-0.007
Token heart
Token ID2612
Feature activation+6.45e-04

Pos logit contributions
ised+0.028
ement+0.028
inated+0.028
work+0.026
el+0.025
Neg logit contributions
isk-0.035
aks-0.029
arl-0.028
 germ-0.027
eh-0.027
Token.
Token ID13
Feature activation+1.11e-02

Pos logit contributions
ised+0.001
ement+0.001
inated+0.001
work+0.001
el+0.001
Neg logit contributions
isk-0.001
aks-0.001
arl-0.001
ives-0.001
eh-0.001
Token
Token ID198
Feature activation+4.76e-03

Pos logit contributions
ised+0.021
ement+0.021
inated+0.021
work+0.020
el+0.020
Neg logit contributions
isk-0.018
aks-0.015
ives-0.014
arl-0.014
aid-0.014
Token
Token ID198
Feature activation+8.34e-03

Pos logit contributions
ised+0.008
inated+0.008
ement+0.008
work+0.008
el+0.008
Neg logit contributions
isk-0.008
aks-0.007
arl-0.007
ives-0.007
eh-0.007
TokenOn
Token ID2202
Feature activation+2.14e-02

Pos logit contributions
ement+0.016
ised+0.016
inated+0.015
work+0.015
el+0.015
Neg logit contributions
isk-0.014
aks-0.012
ives-0.011
arl-0.011
eh-0.011
Token them
Token ID606
Feature activation+8.49e-03

Pos logit contributions
isk+0.024
aks+0.021
arl+0.021
ives+0.021
eh+0.020
Neg logit contributions
ised-0.016
ement-0.016
inated-0.016
el-0.015
work-0.015
Token glow
Token ID19634
Feature activation+6.51e-03

Pos logit contributions
ised+0.016
ement+0.016
inated+0.016
work+0.015
el+0.015
Neg logit contributions
isk-0.013
aks-0.011
arl-0.011
ives-0.011
aid-0.011
Token in
Token ID287
Feature activation+1.94e-02

Pos logit contributions
ised+0.009
inated+0.009
ement+0.009
work+0.008
el+0.008
Neg logit contributions
isk-0.014
aks-0.012
arl-0.012
ives-0.011
eh-0.011
Token the
Token ID262
Feature activation-7.43e-03

Pos logit contributions
inated+0.035
ement+0.035
ised+0.035
work+0.033
ths+0.032
Neg logit contributions
isk-0.032
aks-0.028
arl-0.025
smart-0.025
 backup-0.025
Token dark
Token ID3223
Feature activation-1.12e-03

Pos logit contributions
isk+0.015
aks+0.013
ives+0.012
arl+0.012
aid+0.012
Neg logit contributions
ised-0.011
ement-0.011
inated-0.011
el-0.011
 sunset-0.011
Token.
Token ID13
Feature activation+1.30e-02

Pos logit contributions
isk+0.002
aks+0.002
arl+0.002
ives+0.002
aid+0.002
Neg logit contributions
ised-0.002
ement-0.002
inated-0.002
el-0.001
work-0.001
Token Billy
Token ID15890
Feature activation+1.57e-03

Pos logit contributions
ised+0.023
ement+0.023
inated+0.022
work+0.022
el+0.022
Neg logit contributions
isk-0.023
aks-0.019
ives-0.018
arl-0.018
aid-0.018
Token was
Token ID373
Feature activation+8.46e-04

Pos logit contributions
ised+0.003
inated+0.002
ement+0.002
work+0.002
el+0.002
Neg logit contributions
isk-0.003
aks-0.003
arl-0.002
ives-0.002
eh-0.002
Token so
Token ID523
Feature activation-4.14e-03

Pos logit contributions
ised+0.001
ement+0.001
inated+0.001
work+0.001
el+0.001
Neg logit contributions
isk-0.002
aks-0.002
arl-0.002
ives-0.002
aid-0.001
Token happy
Token ID3772
Feature activation+1.11e-02

Pos logit contributions
isk+0.007
aks+0.006
arl+0.006
ives+0.006
eh+0.006
Neg logit contributions
ised-0.008
inated-0.007
ement-0.007
work-0.007
el-0.007
Token to
Token ID284
Feature activation-4.04e-03

Pos logit contributions
ised+0.018
inated+0.017
ement+0.017
el+0.016
work+0.016
Neg logit contributions
isk-0.021
aks-0.018
ives-0.018
arl-0.017
eh-0.017
Token your
Token ID534
Feature activation+5.04e-03

Pos logit contributions
ement+0.036
ised+0.036
inated+0.035
 horizon+0.033
work+0.033
Neg logit contributions
isk-0.023
aks-0.020
arl-0.019
aid-0.019
 germ-0.018
Token glue
Token ID22749
Feature activation-3.13e-03

Pos logit contributions
ised+0.009
inated+0.009
ement+0.009
work+0.009
el+0.008
Neg logit contributions
isk-0.009
aks-0.007
arl-0.007
ives-0.007
eh-0.007
Token.
Token ID13
Feature activation-2.43e-04

Pos logit contributions
isk+0.007
aks+0.006
arl+0.006
ives+0.005
eh+0.005
Neg logit contributions
ised-0.004
ement-0.004
inated-0.004
work-0.004
el-0.004
Token It
Token ID632
Feature activation+6.02e-03

Pos logit contributions
isk+0.000
aks+0.000
arl+0.000
ives+0.000
eh+0.000
Neg logit contributions
ised-0.000
inated-0.000
ement-0.000
work-0.000
el-0.000
Token is
Token ID318
Feature activation+2.07e-02

Pos logit contributions
ised+0.012
ement+0.012
inated+0.012
work+0.011
el+0.011
Neg logit contributions
isk-0.009
aks-0.008
ives-0.007
arl-0.007
aid-0.007
Token pretty
Token ID2495
Feature activation+1.33e-02

Pos logit contributions
ised+0.040
ement+0.040
inated+0.040
work+0.037
el+0.037
Neg logit contributions
isk-0.032
aks-0.029
eh-0.026
arl-0.026
aid-0.025
Token."
Token ID526
Feature activation-3.07e-02

Pos logit contributions
ised+0.023
inated+0.023
ement+0.023
el+0.021
work+0.021
Neg logit contributions
isk-0.023
aks-0.020
ives-0.019
arl-0.019
aid-0.019
Token
Token ID198
Feature activation-2.95e-02

Pos logit contributions
isk+0.076
eh+0.065
coat+0.064
aks+0.063
 germ+0.063
Neg logit contributions
 sunset-0.033
ised-0.030
ement-0.030
inated-0.028
 once-0.027
Token
Token ID198
Feature activation-2.60e-02

Pos logit contributions
isk+0.068
coat+0.062
 robber+0.059
 germ+0.059
ives+0.059
Neg logit contributions
ised-0.035
ement-0.032
el-0.032
 sunset-0.032
 curiously-0.030
TokenThey
Token ID2990
Feature activation-3.70e-03

Pos logit contributions
isk+0.041
coat+0.035
 germ+0.034
ives+0.034
 robber+0.032
Neg logit contributions
ised-0.051
el-0.046
ement-0.046
izing-0.045
 sunset-0.044
Token hug
Token ID16225
Feature activation+1.99e-02

Pos logit contributions
isk+0.005
aks+0.004
arl+0.004
eh+0.004
ives+0.004
Neg logit contributions
ised-0.008
inated-0.008
ement-0.007
work-0.007
el-0.007
Token
Token ID220
Feature activation-1.87e-02

Pos logit contributions
isk+0.001
aks+0.000
arl+0.000
ives+0.000
eh+0.000
Neg logit contributions
ised-0.001
inated-0.001
ement-0.001
work-0.001
el-0.001
Token
Token ID198
Feature activation-2.35e-02

Pos logit contributions
isk+0.037
 germ+0.032
aks+0.032
eh+0.032
aid+0.032
Neg logit contributions
ised-0.027
ement-0.027
inated-0.026
el-0.025
work-0.025
Token
Token ID198
Feature activation-1.49e-02

Pos logit contributions
isk+0.051
 germ+0.045
arl+0.043
coat+0.043
aks+0.043
Neg logit contributions
ised-0.031
inated-0.029
ement-0.029
 sunset-0.028
 curiously-0.027
TokenHe
Token ID1544
Feature activation+5.04e-03

Pos logit contributions
isk+0.029
arl+0.024
aks+0.024
 germ+0.024
ives+0.024
Neg logit contributions
ised-0.023
inated-0.022
ement-0.021
 sunset-0.021
el-0.021
Token was
Token ID373
Feature activation-7.47e-03

Pos logit contributions
ised+0.009
ement+0.009
inated+0.009
work+0.009
el+0.009
Neg logit contributions
isk-0.009
aks-0.007
ives-0.007
arl-0.007
eh-0.007
Token getting
Token ID1972
Feature activation+1.06e-02

Pos logit contributions
isk+0.016
arl+0.015
aks+0.015
ives+0.014
aid+0.014
Neg logit contributions
ised-0.010
inated-0.009
ement-0.009
work-0.009
el-0.009
Token ready
Token ID3492
Feature activation-2.18e-02

Pos logit contributions
inated+0.018
ement+0.018
ised+0.018
work+0.017
el+0.017
Neg logit contributions
isk-0.019
aks-0.017
aid-0.016
arl-0.016
ives-0.015
Token to
Token ID284
Feature activation+2.32e-02

Pos logit contributions
isk+0.040
aks+0.034
eh+0.033
aid+0.033
arl+0.033
Neg logit contributions
ement-0.037
ised-0.036
work-0.036
inated-0.035
 int-0.035
Token hurry
Token ID23290
Feature activation+1.92e-02

Pos logit contributions
ement+0.054
ised+0.053
inated+0.051
work+0.049
izing+0.049
Neg logit contributions
isk-0.029
aks-0.023
arl-0.021
 backup-0.021
aster-0.020
Token back
Token ID736
Feature activation-2.55e-03

Pos logit contributions
inated+0.033
ised+0.033
ement+0.033
work+0.031
el+0.031
Neg logit contributions
isk-0.034
aks-0.029
coat-0.027
arl-0.027
ives-0.027
Token to
Token ID284
Feature activation-6.56e-03

Pos logit contributions
isk+0.005
aks+0.004
arl+0.004
ives+0.004
aid+0.004
Neg logit contributions
ised-0.004
inated-0.004
ement-0.004
work-0.004
el-0.004
Token carrying
Token ID6872
Feature activation+2.14e-02

Pos logit contributions
ement+0.051
ised+0.051
inated+0.049
 horizon+0.046
ths+0.045
Neg logit contributions
isk-0.048
arl-0.041
aks-0.040
eh-0.038
 germ-0.037
Token that
Token ID326
Feature activation+4.44e-02

Pos logit contributions
inated+0.037
ement+0.037
ised+0.036
work+0.035
itz+0.034
Neg logit contributions
isk-0.036
aks-0.033
eh-0.030
aid-0.030
 backup-0.029
Token candle
Token ID26839
Feature activation-5.18e-03

Pos logit contributions
inated+0.077
ised+0.077
ths+0.074
ement+0.074
itz+0.073
Neg logit contributions
aks-0.065
 germ-0.061
isk-0.060
 welcome-0.060
 comes-0.058
Token?"
Token ID1701
Feature activation+3.81e-02

Pos logit contributions
isk+0.011
aks+0.010
arl+0.009
ives+0.009
aid+0.009
Neg logit contributions
ised-0.007
inated-0.007
ement-0.007
work-0.007
el-0.006
Token
Token ID198
Feature activation+1.50e-02

Pos logit contributions
work+0.059
ised+0.059
inated+0.059
ble+0.058
ement+0.057
Neg logit contributions
isk-0.066
aks-0.057
aster-0.055
and-0.054
ens-0.053
Token
Token ID198
Feature activation+1.40e-02

Pos logit contributions
ised+0.028
inated+0.028
ement+0.027
work+0.027
el+0.026
Neg logit contributions
isk-0.024
aks-0.022
arl-0.020
ives-0.020
eh-0.019
TokenJohn
Token ID7554
Feature activation+3.56e-02

Pos logit contributions
ement+0.025
inated+0.025
ised+0.025
work+0.025
el+0.024
Neg logit contributions
isk-0.023
aks-0.021
eh-0.019
ives-0.019
arl-0.019
Token said
Token ID531
Feature activation-1.14e-02

Pos logit contributions
ement+0.062
inated+0.061
ised+0.061
work+0.060
 sunset+0.059
Neg logit contributions
isk-0.061
aks-0.050
eh-0.047
ens-0.047
coat-0.047
Token,
Token ID11
Feature activation+9.92e-03

Pos logit contributions
isk+0.023
aks+0.019
arl+0.019
ives+0.018
eh+0.018
Neg logit contributions
ement-0.018
ised-0.018
inated-0.018
work-0.017
el-0.017
Token "
Token ID366
Feature activation+3.29e-02

Pos logit contributions
ised+0.021
ement+0.021
inated+0.021
el+0.020
work+0.020
Neg logit contributions
isk-0.013
aks-0.011
arl-0.010
ives-0.010
eh-0.010
TokenThis
Token ID1212
Feature activation+4.62e-02

Pos logit contributions
ement+0.063
ised+0.059
inated+0.058
work+0.056
 avail+0.054
Neg logit contributions
isk-0.051
aid-0.045
aks-0.044
and-0.043
coat-0.043
Token.
Token ID13
Feature activation+7.45e-03

Pos logit contributions
isk+0.005
aks+0.005
arl+0.005
ives+0.005
aid+0.005
Neg logit contributions
ised-0.003
inated-0.003
ement-0.003
work-0.003
el-0.003
Token The
Token ID383
Feature activation+6.65e-03

Pos logit contributions
ised+0.013
ement+0.013
inated+0.013
work+0.013
el+0.013
Neg logit contributions
isk-0.013
aks-0.011
ives-0.010
arl-0.010
eh-0.010
Token bunny
Token ID44915
Feature activation+1.71e-02

Pos logit contributions
ised+0.013
ement+0.013
inated+0.012
work+0.012
el+0.012
Neg logit contributions
isk-0.011
aks-0.009
arl-0.009
eh-0.009
aid-0.009
Token wanted
Token ID2227
Feature activation-2.19e-02

Pos logit contributions
ement+0.032
ised+0.031
inated+0.031
work+0.030
el+0.030
Neg logit contributions
isk-0.028
aks-0.024
eh-0.023
ives-0.022
aid-0.022
Token to
Token ID284
Feature activation-2.99e-03

Pos logit contributions
isk+0.045
arl+0.041
aks+0.037
ives+0.037
coat+0.036
Neg logit contributions
 sunset-0.033
inated-0.032
el-0.031
work-0.031
ised-0.030
Token play
Token ID711
Feature activation-2.41e-02

Pos logit contributions
isk+0.004
aks+0.004
arl+0.003
ives+0.003
eh+0.003
Neg logit contributions
inated-0.006
ised-0.006
ement-0.006
el-0.006
work-0.006
Token and
Token ID290
Feature activation-1.90e-02

Pos logit contributions
isk+0.055
arl+0.050
aks+0.048
ives+0.048
ens+0.045
Neg logit contributions
ised-0.030
ement-0.029
inated-0.028
 sunset-0.027
work-0.026
Token bounce
Token ID20110
Feature activation-3.33e-05

Pos logit contributions
isk+0.031
arl+0.027
aks+0.027
ives+0.025
eh+0.024
Neg logit contributions
inated-0.037
 sunset-0.036
ised-0.036
 curiously-0.034
work-0.034
Token around
Token ID1088
Feature activation+1.71e-02

Pos logit contributions
isk+0.000
aks+0.000
arl+0.000
ives+0.000
eh+0.000
Neg logit contributions
ised-0.000
ement-0.000
inated-0.000
work-0.000
el-0.000
Token,
Token ID11
Feature activation-3.34e-03

Pos logit contributions
ised+0.030
ement+0.030
inated+0.029
work+0.029
el+0.027
Neg logit contributions
isk-0.029
aks-0.025
eh-0.024
smart-0.023
ives-0.023
Token but
Token ID475
Feature activation+2.23e-02

Pos logit contributions
isk+0.006
aks+0.005
arl+0.005
ives+0.005
aid+0.005
Neg logit contributions
ised-0.006
inated-0.006
ement-0.006
work-0.005
el-0.005
Token boy
Token ID2933
Feature activation-2.93e-02

Pos logit contributions
isk+0.046
aks+0.040
arl+0.037
ives+0.037
eh+0.037
Neg logit contributions
work-0.033
ised-0.032
 sunset-0.032
 comet-0.032
el-0.032
Token and
Token ID290
Feature activation-3.50e-02

Pos logit contributions
isk+0.063
arl+0.059
aks+0.057
eh+0.055
coat+0.054
Neg logit contributions
work-0.037
ement-0.037
ised-0.034
inated-0.033
el-0.033
Token not
Token ID407
Feature activation+1.58e-03

Pos logit contributions
isk+0.064
aks+0.058
arl+0.058
ives+0.054
eh+0.054
Neg logit contributions
work-0.056
inated-0.054
ised-0.054
 sunset-0.054
 comet-0.050
Token play
Token ID711
Feature activation-3.42e-02

Pos logit contributions
ised+0.003
inated+0.003
ement+0.003
work+0.003
el+0.003
Neg logit contributions
isk-0.002
aks-0.002
arl-0.002
ives-0.002
eh-0.002
Token tricks
Token ID15910
Feature activation-1.74e-02

Pos logit contributions
isk+0.079
arl+0.074
aks+0.069
aid+0.069
ives+0.068
Neg logit contributions
ised-0.041
ement-0.037
inated-0.036
izing-0.035
ized-0.035
Token on
Token ID319
Feature activation-1.23e-02

Pos logit contributions
isk+0.039
aks+0.033
arl+0.033
eh+0.033
aid+0.032
Neg logit contributions
ement-0.023
ised-0.022
work-0.021
inated-0.021
el-0.020
Token Sue
Token ID26089
Feature activation-3.81e-03

Pos logit contributions
isk+0.023
aks+0.020
arl+0.020
ives+0.019
aid+0.019
Neg logit contributions
ised-0.020
ement-0.020
el-0.019
inated-0.019
work-0.019
Token.
Token ID13
Feature activation-1.98e-02

Pos logit contributions
isk+0.008
aks+0.007
arl+0.007
aid+0.007
ives+0.007
Neg logit contributions
ised-0.005
ement-0.005
inated-0.005
work-0.005
el-0.005
Token They
Token ID1119
Feature activation+2.22e-02

Pos logit contributions
isk+0.034
aks+0.033
eh+0.031
aid+0.030
arl+0.030
Neg logit contributions
inated-0.032
 sunset-0.031
ement-0.031
ised-0.030
work-0.030
Token played
Token ID2826
Feature activation-4.59e-02

Pos logit contributions
ement+0.045
ised+0.044
inated+0.044
izing+0.042
work+0.041
Neg logit contributions
isk-0.036
aks-0.029
aid-0.027
ens-0.027
ives-0.027
Token together
Token ID1978
Feature activation-1.29e-02

Pos logit contributions
isk+0.118
arl+0.114
aid+0.106
 germ+0.104
coat+0.104
Neg logit contributions
ised-0.042
ement-0.037
 curiously-0.035
 top-0.034
 sunset-0.034
Token
Token ID198
Feature activation-4.85e-03

Pos logit contributions
ised+0.042
ement+0.042
inated+0.040
el+0.040
work+0.039
Neg logit contributions
isk-0.025
aks-0.023
arl-0.022
ives-0.021
aid-0.019
Token
Token ID198
Feature activation-3.67e-03

Pos logit contributions
isk+0.009
aks+0.008
arl+0.008
ives+0.008
eh+0.008
Neg logit contributions
ised-0.008
inated-0.008
ement-0.008
work-0.007
el-0.007
TokenMom
Token ID29252
Feature activation-2.94e-02

Pos logit contributions
isk+0.007
aks+0.006
ives+0.006
arl+0.006
coat+0.006
Neg logit contributions
ised-0.006
inated-0.006
ement-0.006
work-0.006
el-0.006
Token watches
Token ID16860
Feature activation+8.76e-04

Pos logit contributions
isk+0.055
eh+0.051
aks+0.046
 germ+0.046
coat+0.045
Neg logit contributions
ement-0.048
el-0.046
work-0.046
ised-0.042
 sunset-0.042
Token them
Token ID606
Feature activation+1.09e-03

Pos logit contributions
ised+0.001
ement+0.001
inated+0.001
work+0.001
el+0.001
Neg logit contributions
isk-0.002
aks-0.001
arl-0.001
ives-0.001
eh-0.001
Token and
Token ID290
Feature activation-9.59e-03

Pos logit contributions
ised+0.002
inated+0.002
ement+0.002
work+0.002
el+0.002
Neg logit contributions
isk-0.002
aks-0.002
arl-0.002
ives-0.002
eh-0.002
Token smiles
Token ID21845
Feature activation+2.93e-02

Pos logit contributions
isk+0.018
aks+0.016
eh+0.015
arl+0.015
ives+0.015
Neg logit contributions
ement-0.016
ised-0.016
inated-0.015
work-0.015
 sunset-0.015
Token.
Token ID13
Feature activation-5.05e-03

Pos logit contributions
ised+0.053
inated+0.053
ement+0.052
ths+0.049
el+0.049
Neg logit contributions
isk-0.048
aks-0.043
arl-0.042
ives-0.041
smart-0.039
Token She
Token ID1375
Feature activation+1.49e-03

Pos logit contributions
isk+0.009
aks+0.008
eh+0.007
aid+0.007
arl+0.007
Neg logit contributions
ised-0.009
inated-0.009
ement-0.009
 sunset-0.008
work-0.008
Token is
Token ID318
Feature activation-6.44e-04

Pos logit contributions
ised+0.003
ement+0.003
inated+0.003
work+0.003
el+0.003
Neg logit contributions
isk-0.002
aks-0.002
arl-0.002
ives-0.002
eh-0.002
Token proud
Token ID6613
Feature activation+8.07e-03

Pos logit contributions
isk+0.001
aks+0.001
ives+0.001
arl+0.001
eh+0.001
Neg logit contributions
ised-0.001
ement-0.001
inated-0.001
work-0.001
el-0.001
Token.
Token ID13
Feature activation-2.62e-02

Pos logit contributions
ement+0.015
ised+0.015
inated+0.014
work+0.014
el+0.013
Neg logit contributions
isk-0.020
aks-0.017
eh-0.016
arl-0.016
ives-0.016
Token She
Token ID1375
Feature activation-1.10e-02

Pos logit contributions
isk+0.054
aks+0.050
aid+0.048
eh+0.048
 backup+0.048
Neg logit contributions
inated-0.037
 sunset-0.036
el-0.034
ised-0.034
ement-0.034
Token had
Token ID550
Feature activation-1.94e-03

Pos logit contributions
isk+0.022
arl+0.019
aks+0.018
ives+0.018
 backup+0.018
Neg logit contributions
inated-0.017
ised-0.016
el-0.016
 sunset-0.016
work-0.016
Token a
Token ID257
Feature activation+8.85e-03

Pos logit contributions
isk+0.004
aks+0.003
arl+0.003
ives+0.003
aid+0.003
Neg logit contributions
ised-0.003
inated-0.003
ement-0.003
work-0.003
el-0.003
Token favorite
Token ID4004
Feature activation+2.05e-02

Pos logit contributions
ised+0.017
ement+0.017
inated+0.016
work+0.016
el+0.016
Neg logit contributions
isk-0.014
aks-0.012
arl-0.012
eh-0.012
coat-0.011
Token chair
Token ID5118
Feature activation-1.97e-02

Pos logit contributions
ised+0.042
ement+0.041
inated+0.041
work+0.039
el+0.039
Neg logit contributions
isk-0.029
eh-0.024
aks-0.024
coat-0.024
 germ-0.023
Token that
Token ID326
Feature activation-3.35e-02

Pos logit contributions
isk+0.043
aks+0.039
arl+0.039
ives+0.038
aid+0.037
Neg logit contributions
work-0.025
ement-0.025
inated-0.024
el-0.024
ised-0.024
Token she
Token ID673
Feature activation-2.48e-02

Pos logit contributions
isk+0.078
arl+0.073
ives+0.069
aks+0.069
 backup+0.068
Neg logit contributions
el-0.038
 sunset-0.038
inated-0.037
ised-0.036
 comet-0.036
Token loved
Token ID6151
Feature activation-3.61e-03

Pos logit contributions
isk+0.051
arl+0.047
aks+0.043
ives+0.043
 backup+0.043
Neg logit contributions
ised-0.035
inated-0.033
el-0.033
ement-0.032
work-0.031
Token to
Token ID284
Feature activation-3.14e-02

Pos logit contributions
isk+0.006
aks+0.005
arl+0.005
ives+0.005
eh+0.005
Neg logit contributions
ised-0.007
inated-0.007
ement-0.007
el-0.006
work-0.006
Token sit
Token ID1650
Feature activation+1.79e-02

Pos logit contributions
isk+0.055
aks+0.046
ives+0.044
arl+0.043
 backup+0.042
Neg logit contributions
el-0.055
inated-0.055
ised-0.055
 sunset-0.054
 comet-0.054
Token,
Token ID11
Feature activation-3.13e-02

Pos logit contributions
isk+0.011
aks+0.010
arl+0.009
ives+0.009
eh+0.009
Neg logit contributions
ised-0.009
ement-0.009
inated-0.009
work-0.008
el-0.008
Token let
Token ID1309
Feature activation+1.74e-03

Pos logit contributions
isk+0.055
aks+0.053
eh+0.050
aked+0.046
ives+0.046
Neg logit contributions
ised-0.056
izing-0.053
ement-0.052
inated-0.052
 sunset-0.050
Token's
Token ID338
Feature activation+9.50e-03

Pos logit contributions
ised+0.003
ement+0.003
inated+0.003
work+0.002
el+0.002
Neg logit contributions
isk-0.003
aks-0.003
arl-0.003
ives-0.003
eh-0.003
Token go
Token ID467
Feature activation-2.46e-02

Pos logit contributions
ement+0.021
ised+0.021
inated+0.020
work+0.020
el+0.019
Neg logit contributions
isk-0.013
aks-0.010
arl-0.010
ives-0.009
aid-0.009
Token!"
Token ID2474
Feature activation-8.80e-03

Pos logit contributions
isk+0.050
eh+0.042
aks+0.042
 germ+0.042
ives+0.041
Neg logit contributions
el-0.035
ised-0.034
ement-0.034
inated-0.034
work-0.033
Token
Token ID198
Feature activation-1.29e-02

Pos logit contributions
isk+0.015
aks+0.013
eh+0.012
arl+0.012
aid+0.012
Neg logit contributions
ised-0.016
inated-0.015
ement-0.015
 sunset-0.014
el-0.014
Token
Token ID198
Feature activation+3.89e-03

Pos logit contributions
isk+0.026
aks+0.022
arl+0.022
ives+0.022
eh+0.022
Neg logit contributions
ised-0.019
inated-0.018
ement-0.018
 sunset-0.017
el-0.017
TokenWhen
Token ID2215
Feature activation+1.82e-02

Pos logit contributions
ised+0.007
ement+0.007
inated+0.007
work+0.007
el+0.007
Neg logit contributions
isk-0.006
aks-0.005
arl-0.005
ives-0.005
eh-0.005
Token they
Token ID484
Feature activation-1.77e-02

Pos logit contributions
ement+0.035
ised+0.035
inated+0.034
work+0.033
el+0.032
Neg logit contributions
isk-0.029
aks-0.024
ens-0.023
arl-0.023
 germ-0.023
Token arrived
Token ID5284
Feature activation+1.80e-03

Pos logit contributions
isk+0.027
aks+0.024
arl+0.022
eh+0.022
 germ+0.022
Neg logit contributions
ised-0.034
ement-0.033
inated-0.033
el-0.033
 sunset-0.032
Token at
Token ID379
Feature activation-1.19e-03

Pos logit contributions
ised+0.003
ement+0.003
inated+0.003
work+0.002
el+0.002
Neg logit contributions
isk-0.004
aks-0.003
arl-0.003
ives-0.003
eh-0.003
Token.
Token ID13
Feature activation-1.27e-02

Pos logit contributions
ised+0.000
ement+0.000
inated+0.000
work+0.000
el+0.000
Neg logit contributions
isk-0.001
aks-0.000
arl-0.000
ives-0.000
aid-0.000
Token You
Token ID921
Feature activation+1.07e-02

Pos logit contributions
isk+0.022
aks+0.019
eh+0.018
aid+0.017
arl+0.017
Neg logit contributions
ised-0.022
inated-0.022
ement-0.022
work-0.021
el-0.021
Token can
Token ID460
Feature activation+1.08e-02

Pos logit contributions
ement+0.021
ised+0.021
inated+0.020
work+0.020
el+0.019
Neg logit contributions
isk-0.017
aks-0.014
ives-0.014
arl-0.013
aid-0.013
Token use
Token ID779
Feature activation+5.86e-03

Pos logit contributions
ised+0.022
ement+0.022
inated+0.022
work+0.021
el+0.020
Neg logit contributions
isk-0.016
aks-0.013
ives-0.012
arl-0.012
aid-0.012
Token your
Token ID534
Feature activation-2.81e-02

Pos logit contributions
ised+0.010
ement+0.010
inated+0.010
work+0.010
el+0.010
Neg logit contributions
isk-0.010
aks-0.009
arl-0.008
ives-0.008
eh-0.008
Token long
Token ID890
Feature activation-5.39e-02

Pos logit contributions
isk+0.054
aks+0.045
ives+0.045
arl+0.044
aid+0.044
Neg logit contributions
el-0.044
inated-0.043
ised-0.043
 sunset-0.043
ement-0.042
Token neck
Token ID7393
Feature activation-2.26e-02

Pos logit contributions
isk+0.115
aid+0.103
ives+0.103
arl+0.101
 backup+0.096
Neg logit contributions
el-0.074
inated-0.072
ised-0.070
 sunset-0.070
 top-0.066
Token as
Token ID355
Feature activation+1.73e-03

Pos logit contributions
isk+0.041
aks+0.034
arl+0.033
 germ+0.032
ives+0.032
Neg logit contributions
ised-0.039
inated-0.038
ement-0.037
el-0.037
work-0.037
Token an
Token ID281
Feature activation+2.01e-02

Pos logit contributions
ised+0.003
ement+0.003
inated+0.003
work+0.003
el+0.003
Neg logit contributions
isk-0.003
aks-0.003
arl-0.003
ives-0.003
eh-0.003
Token umbrella
Token ID25510
Feature activation-2.98e-02

Pos logit contributions
ised+0.039
inated+0.039
ement+0.038
work+0.038
izing+0.036
Neg logit contributions
isk-0.032
aks-0.026
eh-0.026
arl-0.025
aid-0.025
Token!
Token ID0
Feature activation-5.01e-03

Pos logit contributions
isk+0.062
aks+0.056
aid+0.053
arl+0.053
 germ+0.053
Neg logit contributions
work-0.040
ised-0.040
inated-0.039
ement-0.038
el-0.038
Token asked
Token ID1965
Feature activation-3.36e-02

Pos logit contributions
arl+0.064
isk+0.063
aks+0.058
ives+0.056
smart+0.053
Neg logit contributions
inated-0.054
 curiously-0.052
ised-0.051
work-0.050
 sunset-0.050
Token his
Token ID465
Feature activation-1.29e-02

Pos logit contributions
isk+0.071
ives+0.067
coat+0.065
arl+0.065
eh+0.064
Neg logit contributions
 curiously-0.045
 mummy-0.042
 sunset-0.042
el-0.041
ised-0.041
Token mom
Token ID1995
Feature activation-3.42e-02

Pos logit contributions
isk+0.032
aks+0.029
ives+0.028
arl+0.028
aid+0.028
Neg logit contributions
 sunset-0.012
ement-0.012
inated-0.012
work-0.012
ised-0.012
Token for
Token ID329
Feature activation-2.35e-02

Pos logit contributions
isk+0.065
aks+0.063
eh+0.060
aid+0.059
coat+0.057
Neg logit contributions
work-0.050
 int-0.050
 curiously-0.049
el-0.048
ement-0.048
Token a
Token ID257
Feature activation-5.99e-02

Pos logit contributions
isk+0.049
ives+0.042
arl+0.041
aks+0.041
coat+0.040
Neg logit contributions
ised-0.034
el-0.034
ement-0.034
 sunset-0.032
ths-0.031
Token cheerful
Token ID37999
Feature activation-5.87e-02

Pos logit contributions
isk+0.111
aks+0.094
ives+0.087
aked+0.085
arl+0.082
Neg logit contributions
 sunset-0.106
 comet-0.103
 top-0.102
 mummy-0.099
 curiously-0.096
Token sandwich
Token ID20433
Feature activation-3.33e-02

Pos logit contributions
isk+0.119
ives+0.102
arl+0.100
aks+0.100
atters+0.097
Neg logit contributions
 sunset-0.095
 comet-0.085
 sunrise-0.081
 mummy-0.080
ement-0.079
Token.
Token ID13
Feature activation-1.18e-02

Pos logit contributions
isk+0.077
aks+0.070
arl+0.068
eh+0.065
aid+0.065
Neg logit contributions
work-0.035
ement-0.035
el-0.031
ised-0.031
inated-0.031
Token As
Token ID1081
Feature activation+3.68e-03

Pos logit contributions
isk+0.023
aks+0.021
arl+0.020
eh+0.020
ives+0.020
Neg logit contributions
inated-0.018
ised-0.017
ement-0.017
el-0.017
 sunset-0.016
Token he
Token ID339
Feature activation-1.80e-02

Pos logit contributions
ised+0.006
ement+0.006
inated+0.006
work+0.006
el+0.006
Neg logit contributions
isk-0.007
aks-0.006
arl-0.005
ives-0.005
eh-0.005
Token took
Token ID1718
Feature activation-1.71e-02

Pos logit contributions
isk+0.031
arl+0.029
aks+0.027
ives+0.027
 germ+0.026
Neg logit contributions
ised-0.031
el-0.030
ement-0.030
inated-0.030
ths-0.029
Token.
Token ID13
Feature activation-3.13e-03

Pos logit contributions
ised+0.024
inated+0.024
ement+0.024
el+0.022
 sunset+0.022
Neg logit contributions
isk-0.029
aks-0.026
ives-0.024
arl-0.024
aid-0.023
Token They
Token ID1119
Feature activation-8.74e-03

Pos logit contributions
isk+0.005
aks+0.005
arl+0.004
aid+0.004
eh+0.004
Neg logit contributions
inated-0.006
ised-0.006
ement-0.006
work-0.005
el-0.005
Token saw
Token ID2497
Feature activation-2.53e-02

Pos logit contributions
isk+0.016
aks+0.014
arl+0.014
eh+0.013
ives+0.013
Neg logit contributions
ised-0.014
inated-0.014
ement-0.014
work-0.014
el-0.014
Token many
Token ID867
Feature activation-1.66e-02

Pos logit contributions
isk+0.053
arl+0.048
ives+0.046
coat+0.046
aks+0.045
Neg logit contributions
 sunset-0.036
ised-0.034
work-0.034
el-0.034
inated-0.033
Token animals
Token ID4695
Feature activation-2.21e-02

Pos logit contributions
isk+0.034
arl+0.030
ives+0.029
aid+0.029
coat+0.029
Neg logit contributions
ised-0.024
ement-0.024
el-0.023
 sunset-0.023
inated-0.022
Token,
Token ID11
Feature activation-5.97e-02

Pos logit contributions
isk+0.047
arl+0.042
aid+0.040
eh+0.040
aks+0.039
Neg logit contributions
ement-0.030
work-0.029
inated-0.029
ised-0.029
el-0.028
Token like
Token ID588
Feature activation-4.75e-02

Pos logit contributions
arl+0.121
isk+0.121
aks+0.108
aid+0.106
ives+0.104
Neg logit contributions
 sunset-0.075
 curiously-0.074
 brightly-0.074
work-0.073
 closely-0.070
Token lions
Token ID31420
Feature activation-3.75e-02

Pos logit contributions
isk+0.089
arl+0.084
aid+0.079
aks+0.078
eh+0.077
Neg logit contributions
 sunset-0.074
work-0.070
inated-0.069
 diamonds-0.068
el-0.067
Token,
Token ID11
Feature activation-4.32e-02

Pos logit contributions
isk+0.078
arl+0.073
aid+0.073
eh+0.070
 germ+0.070
Neg logit contributions
ement-0.050
ised-0.048
el-0.046
inated-0.045
work-0.045
Token ze
Token ID41271
Feature activation-6.61e-02

Pos logit contributions
isk+0.076
arl+0.075
aks+0.071
aid+0.068
ives+0.065
Neg logit contributions
 sunset-0.067
work-0.065
inated-0.064
 top-0.062
 curiously-0.061
Tokenbr
Token ID1671
Feature activation-3.67e-02

Pos logit contributions
isk+0.102
 germ+0.080
aid+0.076
cloth+0.075
rost+0.074
Neg logit contributions
ement-0.131
ised-0.117
 sunset-0.116
el-0.114
izing-0.114
Token They
Token ID1119
Feature activation-8.61e-03

Pos logit contributions
isk+0.002
aks+0.002
arl+0.002
eh+0.002
ives+0.001
Neg logit contributions
ised-0.003
inated-0.003
ement-0.002
el-0.002
work-0.002
Token had
Token ID550
Feature activation-2.00e-02

Pos logit contributions
isk+0.016
arl+0.013
aks+0.013
ives+0.013
eh+0.013
Neg logit contributions
ised-0.014
inated-0.014
ement-0.014
el-0.014
work-0.014
Token dolls
Token ID36062
Feature activation-5.38e-02

Pos logit contributions
isk+0.043
arl+0.038
ives+0.037
aks+0.036
aid+0.035
Neg logit contributions
ised-0.027
inated-0.026
 sunset-0.026
work-0.025
ement-0.025
Token,
Token ID11
Feature activation-3.81e-02

Pos logit contributions
isk+0.124
eh+0.115
 germ+0.111
arl+0.108
aster+0.108
Neg logit contributions
work-0.059
ised-0.055
ement-0.052
el-0.052
inated-0.049
Token cars
Token ID5006
Feature activation-7.39e-02

Pos logit contributions
isk+0.063
arl+0.056
aks+0.055
eh+0.051
ives+0.051
Neg logit contributions
 sunset-0.066
el-0.066
work-0.063
 comet-0.063
ised-0.063
Token,
Token ID11
Feature activation-5.43e-02

Pos logit contributions
eh+0.185
isk+0.185
 germ+0.182
vous+0.170
aster+0.170
Neg logit contributions
work-0.051
el-0.048
ised-0.047
 once-0.045
ement-0.045
Token blocks
Token ID7021
Feature activation-4.75e-02

Pos logit contributions
isk+0.105
arl+0.093
eh+0.088
aks+0.086
ives+0.085
Neg logit contributions
el-0.081
 sunset-0.079
 diamonds-0.078
 comet-0.075
work-0.074
Token and
Token ID290
Feature activation-5.01e-02

Pos logit contributions
isk+0.110
eh+0.102
 germ+0.099
aid+0.098
aks+0.094
Neg logit contributions
work-0.051
el-0.050
ement-0.049
ised-0.049
inated-0.046
Token books
Token ID3835
Feature activation-4.24e-03

Pos logit contributions
isk+0.094
arl+0.082
aks+0.076
eh+0.073
ives+0.073
Neg logit contributions
 sunset-0.086
ised-0.079
 diamonds-0.079
 comet-0.079
work-0.079
Token.
Token ID13
Feature activation+7.81e-03

Pos logit contributions
isk+0.009
aks+0.008
arl+0.008
eh+0.008
ives+0.008
Neg logit contributions
ised-0.005
ement-0.005
inated-0.005
work-0.005
el-0.005
Token
Token ID198
Feature activation-9.56e-03

Pos logit contributions
ised+0.017
ement+0.017
inated+0.017
work+0.017
el+0.017
Neg logit contributions
isk-0.010
aks-0.008
arl-0.008
ives-0.008
aid-0.007
Token shield
Token ID7614
Feature activation+2.18e-03

Pos logit contributions
ised+0.022
ement+0.022
inated+0.022
work+0.021
el+0.021
Neg logit contributions
isk-0.018
aks-0.016
arl-0.015
eh-0.015
ives-0.015
Token?"
Token ID1701
Feature activation+6.76e-03

Pos logit contributions
ised+0.003
ement+0.003
inated+0.003
work+0.003
el+0.003
Neg logit contributions
isk-0.005
aks-0.004
arl-0.004
ives-0.004
eh-0.004
Token
Token ID198
Feature activation-8.48e-03

Pos logit contributions
ised+0.012
inated+0.012
ement+0.012
work+0.011
el+0.011
Neg logit contributions
isk-0.012
aks-0.010
arl-0.010
ives-0.010
aid-0.009
Token
Token ID198
Feature activation-2.55e-02

Pos logit contributions
isk+0.017
aks+0.015
eh+0.014
ives+0.014
arl+0.014
Neg logit contributions
ised-0.013
ement-0.012
inated-0.012
el-0.012
 sunset-0.012
TokenTom
Token ID13787
Feature activation-9.25e-03

Pos logit contributions
isk+0.035
coat+0.027
ives+0.027
eh+0.026
 germ+0.026
Neg logit contributions
ised-0.056
inated-0.052
el-0.051
itz-0.050
 curiously-0.049
Token and
Token ID290
Feature activation-4.70e-02

Pos logit contributions
isk+0.018
aks+0.015
aid+0.015
eh+0.015
arl+0.015
Neg logit contributions
ement-0.014
ised-0.014
inated-0.014
el-0.013
work-0.013
Token Mia
Token ID32189
Feature activation+2.00e-02

Pos logit contributions
ives+0.085
aks+0.080
isk+0.078
coat+0.075
akers+0.074
Neg logit contributions
 mother-0.071
el-0.070
 curiously-0.069
 mum-0.068
 mummy-0.065
Token say
Token ID910
Feature activation-1.09e-02

Pos logit contributions
ised+0.046
ement+0.046
inated+0.046
work+0.045
el+0.044
Neg logit contributions
isk-0.023
aks-0.018
ives-0.018
arl-0.018
aid-0.016
Token,
Token ID11
Feature activation-1.69e-02

Pos logit contributions
isk+0.022
aks+0.019
eh+0.018
ives+0.018
coat+0.017
Neg logit contributions
ement-0.017
ised-0.017
inated-0.017
work-0.016
el-0.016
Token "
Token ID366
Feature activation+3.94e-02

Pos logit contributions
isk+0.030
aks+0.028
eh+0.026
ives+0.025
aid+0.025
Neg logit contributions
inated-0.029
ement-0.028
ised-0.028
work-0.027
 sunset-0.027
TokenWe
Token ID1135
Feature activation+7.88e-03

Pos logit contributions
ement+0.076
 curiously+0.069
 int+0.068
 horizon+0.068
 avail+0.068
Neg logit contributions
isk-0.059
aid-0.058
aks-0.057
Rex-0.057
L-0.056