TOP ACTIVATIONS
MAX = 9.163

her toys and showed her friends the beautiful basket she painted
gentle and he showed his daughter how to give the horse
mom and not showed her friend the books he lf .
a big house with his wife . Jim worked very hard
." She smiled and hugged mother . They sat down at
happy to share and be friends . <|endoftext|> Once upon a
back home and showed his friends . Everyone was so impressed
the trees and showed his friends the beautiful flowers and the
to town to show her friends the water . Her friends
music , and Max showed everyone how clever he was with
something special to show her friends . <|endoftext|> Once upon a
again and he showed his friends his cool band - aid
have and give love with friends and family . <|endoftext|> Once
happy that he gave his wife a big hug , and
wear it to show her friends . They all thought it
more fun to share with friends . <|endoftext|> Once upon a
the classroom and showed his friends his impressive hopping skills .
back down and showed his friends . " See
helpful to discuss solutions with friends . <|endoftext|> Once upon a
<|endoftext|> Anna and Ben are friends . They like to play

MIN ACTIVATIONS
MIN = -13.258

them feel secure . The pers istant love of their parents
sur fer . Whenever the sur fer balanced on the board
to him . What a pers istant boy he was !
the oven and taken the las agna . But
from the rocks and the seaw eed , but the net
They climbed off the tr amp oline and hugged each
river . Suddenly , the sw an said , " Step
Suddenly , the tr amp oline collapsed beneath her
best to stay strong and foc ussed , no matter how
" Maybe the sw an was hungry and in
started to understand that the sw an â € ™ s
and terrifying happened . The tr icycle suddenly stopped working !
play with them . The sw an was happy to join
see each other . The sw an on the left flew
the corn field . The st alks looked happy to see
heavy ! Finally , the sw an got help from the
hurt . The sw an is angry . It
her special song . The sw an was so happy to
's wound healed . The sw an thanked him , and
pond ! The sw an felt so happy to

INTERVAL 3.558 - 9.163
CONTAINS 3.131%

the park happily everyday with her aunt .
- cream for Tom and Mia . Tom and
<|endoftext|> Anna and Ben are friends . They like to play
with her mom my and daddy . One day , they
to the forest with her family . They stayed in a

INTERVAL -2.047 - 3.558
CONTAINS 84.282%

see her . They hugged and laughed and Jack told her
was a girl named Grace . She was very graceful and
always ran home on time after that . <|endoftext|> Once upon
every day . <|endoftext|> Once upon a time , there was
day , Jack went fishing with his dad . Jack loved

INTERVAL -7.653 - -2.047
CONTAINS 12.384%

plants . Sara saw a big purple pr une on the
, she saw a big box on the floor . She
both wanted to learn how to spell . So
smiled . Then it was time to go home
man . He swung the axe and chopped off the snow

INTERVAL -13.258 - -7.653
CONTAINS 0.203%

. â € � � Good morning ! â
driver smiled back . � � € � � Don â
in the sun . The cher ries on the tree were
! Sarah tw irled and spun and felt
." The charming le opard did not give up
Token her
Token ID607
Feature activation+2.94e+00

Pos logit contributions
 twins+0.045
aky+0.045
sm+0.044
ady+0.041
 living+0.041
Neg logit contributions
 destination-0.066
OM-0.065
tons-0.062
overed-0.061
 coaster-0.061
Token toys
Token ID14958
Feature activation+2.85e+00

Pos logit contributions
 twins+0.042
sm+0.042
aky+0.038
old+0.038
 who+0.037
Neg logit contributions
OM-0.096
 destination-0.096
tons-0.090
riage-0.089
overed-0.088
Token and
Token ID290
Feature activation+1.82e+00

Pos logit contributions
 twins+0.038
 who+0.032
 of+0.032
 living+0.032
sm+0.032
Neg logit contributions
OM-0.106
 destination-0.098
riage-0.097
tons-0.095
 coaster-0.094
Token showed
Token ID3751
Feature activation+2.59e+00

Pos logit contributions
 twins+0.029
sm+0.028
 who+0.025
aky+0.025
 of+0.025
Neg logit contributions
OM-0.059
 destination-0.056
tons-0.054
riage-0.054
 coaster-0.053
Token her
Token ID607
Feature activation+5.08e+00

Pos logit contributions
 twins+0.040
aky+0.038
sm+0.038
 living+0.035
able+0.034
Neg logit contributions
 destination-0.081
OM-0.080
overed-0.076
olog-0.075
riage-0.075
Token friends
Token ID2460
Feature activation+9.16e+00

Pos logit contributions
sm+0.077
 twins+0.077
aky+0.076
 cared+0.070
able+0.070
Neg logit contributions
 destination-0.160
OM-0.156
riage-0.148
 coaster-0.145
tons-0.145
Token the
Token ID262
Feature activation+1.49e+00

Pos logit contributions
 twins+0.135
sm+0.120
aky+0.111
 living+0.109
 cared+0.109
Neg logit contributions
OM-0.310
 destination-0.295
riage-0.286
tons-0.283
 coaster-0.282
Token beautiful
Token ID4950
Feature activation-1.20e+00

Pos logit contributions
sm+0.033
old+0.031
aky+0.030
 twins+0.030
ons+0.030
Neg logit contributions
 destination-0.039
OM-0.038
tons-0.035
 slope-0.034
 coaster-0.034
Token basket
Token ID7988
Feature activation+2.14e-01

Pos logit contributions
OM+0.035
 destination+0.033
tons+0.031
riage+0.031
 coaster+0.031
Neg logit contributions
 twins-0.024
sm-0.022
 who-0.021
 living-0.021
aky-0.021
Token she
Token ID673
Feature activation+3.23e-01

Pos logit contributions
 twins+0.003
sm+0.003
aky+0.003
 living+0.003
ons+0.003
Neg logit contributions
OM-0.007
 destination-0.007
overed-0.006
riage-0.006
tons-0.006
Token painted
Token ID13055
Feature activation+6.02e-01

Pos logit contributions
 twins+0.004
sm+0.004
 of+0.004
 who+0.004
 living+0.004
Neg logit contributions
OM-0.011
 destination-0.011
riage-0.010
tons-0.010
 coaster-0.010
Token gentle
Token ID10296
Feature activation+9.50e-01

Pos logit contributions
OM+0.001
 destination+0.001
tons+0.001
riage+0.001
 coaster+0.001
Neg logit contributions
 twins-0.001
sm-0.001
ons-0.001
 living-0.001
 sisters-0.001
Token and
Token ID290
Feature activation+1.80e+00

Pos logit contributions
 twins+0.013
sm+0.011
ons+0.010
 living+0.010
 sisters+0.010
Neg logit contributions
OM-0.033
 destination-0.032
tons-0.031
 coaster-0.031
overed-0.030
Token he
Token ID339
Feature activation-1.37e+00

Pos logit contributions
 twins+0.027
 who+0.025
sm+0.025
 of+0.024
 living+0.024
Neg logit contributions
OM-0.064
 destination-0.059
ACK-0.057
riage-0.057
 coaster-0.056
Token showed
Token ID3751
Feature activation+2.93e+00

Pos logit contributions
OM+0.044
 destination+0.040
riage+0.039
tons+0.039
 coaster+0.038
Neg logit contributions
 twins-0.025
sm-0.023
 who-0.022
 living-0.022
 of-0.021
Token his
Token ID465
Feature activation+2.93e+00

Pos logit contributions
 twins+0.047
aky+0.039
 living+0.038
 sisters+0.038
sm+0.036
Neg logit contributions
OM-0.092
 destination-0.091
tons-0.088
overed-0.087
olog-0.086
Token daughter
Token ID4957
Feature activation+9.15e+00

Pos logit contributions
 twins+0.051
sm+0.049
ons+0.047
aky+0.046
 who+0.044
Neg logit contributions
 destination-0.086
OM-0.085
tons-0.080
 coaster-0.079
riage-0.078
Token how
Token ID703
Feature activation+1.40e+00

Pos logit contributions
 twins+0.174
sm+0.160
aky+0.156
able+0.150
 cared+0.147
Neg logit contributions
 destination-0.254
OM-0.250
tons-0.238
riage-0.237
 coaster-0.231
Token to
Token ID284
Feature activation-2.48e+00

Pos logit contributions
 twins+0.025
 sisters+0.021
ons+0.021
aky+0.021
sm+0.020
Neg logit contributions
OM-0.040
 destination-0.038
tons-0.038
overed-0.037
riage-0.037
Token give
Token ID1577
Feature activation+1.40e+00

Pos logit contributions
OM+0.071
 destination+0.064
riage+0.063
overed+0.062
ACK+0.062
Neg logit contributions
 twins-0.052
 who-0.050
sm-0.049
 of-0.048
 living-0.047
Token the
Token ID262
Feature activation+1.76e+00

Pos logit contributions
 twins+0.028
 living+0.025
sm+0.025
 sisters+0.025
aky+0.024
Neg logit contributions
 destination-0.037
OM-0.037
tons-0.036
riage-0.034
olog-0.034
Token horse
Token ID8223
Feature activation+2.98e+00

Pos logit contributions
ons+0.035
sm+0.032
 cared+0.031
 twins+0.031
unts+0.031
Neg logit contributions
 destination-0.047
tons-0.045
OM-0.044
 coaster-0.042
ER-0.041
Token mom
Token ID1995
Feature activation+4.84e+00

Pos logit contributions
 twins+0.048
sm+0.041
 who+0.040
 living+0.040
 of+0.039
Neg logit contributions
OM-0.112
 destination-0.102
riage-0.100
overed-0.099
 coaster-0.099
Token and
Token ID290
Feature activation+2.30e+00

Pos logit contributions
 twins+0.074
sm+0.073
aks+0.065
aky+0.065
ons+0.064
Neg logit contributions
OM-0.148
 destination-0.146
riage-0.141
tons-0.138
 surely-0.136
Token not
Token ID407
Feature activation-4.14e-01

Pos logit contributions
sm+0.044
 twins+0.044
aky+0.040
able+0.040
ons+0.039
Neg logit contributions
OM-0.066
 destination-0.062
riage-0.059
tons-0.058
olog-0.058
Token showed
Token ID3751
Feature activation+2.33e+00

Pos logit contributions
OM+0.013
 destination+0.012
riage+0.012
tons+0.012
overed+0.012
Neg logit contributions
 twins-0.008
 living-0.007
 who-0.007
sm-0.007
 of-0.007
Token her
Token ID607
Feature activation+5.46e+00

Pos logit contributions
 twins+0.029
aky+0.025
sm+0.025
 living+0.023
 cared+0.022
Neg logit contributions
OM-0.082
 destination-0.082
tons-0.078
riage-0.077
ACK-0.076
Token friend
Token ID1545
Feature activation+9.15e+00

Pos logit contributions
 twins+0.095
sm+0.090
aky+0.088
 cared+0.086
ons+0.086
Neg logit contributions
 destination-0.161
OM-0.159
riage-0.150
tons-0.148
ths-0.144
Token the
Token ID262
Feature activation+1.51e+00

Pos logit contributions
 twins+0.151
sm+0.133
aky+0.128
 living+0.125
 sisters+0.125
Neg logit contributions
OM-0.283
 destination-0.276
riage-0.269
tons-0.265
 coaster-0.256
Token books
Token ID3835
Feature activation-2.05e-01

Pos logit contributions
 twins+0.026
sm+0.025
aky+0.024
 who+0.023
old+0.023
Neg logit contributions
OM-0.048
 destination-0.046
 coaster-0.043
riage-0.042
ACK-0.041
Tokenhe
Token ID258
Feature activation-4.79e-01

Pos logit contributions
OM+0.007
 destination+0.007
riage+0.007
ACK+0.006
tons+0.006
Neg logit contributions
 twins-0.003
 of-0.003
 who-0.003
sm-0.003
 everywhere-0.003
Tokenlf
Token ID1652
Feature activation+9.96e-01

Pos logit contributions
OM+0.008
 destination+0.006
 coaster+0.006
riage+0.006
tons+0.006
Neg logit contributions
 twins-0.016
sm-0.015
 who-0.015
 living-0.015
aky-0.015
Token.
Token ID13
Feature activation+4.83e-01

Pos logit contributions
 twins+0.015
sm+0.014
 living+0.013
 who+0.013
 sisters+0.012
Neg logit contributions
OM-0.033
 destination-0.032
riage-0.030
tons-0.030
 coaster-0.030
Token a
Token ID257
Feature activation+1.57e+00

Pos logit contributions
OM+0.008
tons+0.007
 coaster+0.007
 destination+0.007
 surely+0.007
Neg logit contributions
as-0.005
aky-0.005
ons-0.005
old-0.005
sm-0.005
Token big
Token ID1263
Feature activation-2.92e-01

Pos logit contributions
 twins+0.021
sm+0.019
ons+0.018
 of+0.018
 cared+0.017
Neg logit contributions
OM-0.055
 destination-0.053
tons-0.049
 coaster-0.049
riage-0.049
Token house
Token ID2156
Feature activation+1.18e+00

Pos logit contributions
OM+0.009
 destination+0.008
riage+0.008
tons+0.008
 coaster+0.008
Neg logit contributions
 twins-0.006
sm-0.005
 who-0.005
 of-0.005
 cared-0.005
Token with
Token ID351
Feature activation+4.01e+00

Pos logit contributions
 twins+0.021
sm+0.019
 sisters+0.018
ons+0.017
aky+0.017
Neg logit contributions
OM-0.037
 destination-0.034
riage-0.033
tons-0.033
overed-0.033
Token his
Token ID465
Feature activation+2.84e+00

Pos logit contributions
sm+0.066
able+0.065
 who+0.063
 twins+0.063
aky+0.061
Neg logit contributions
OM-0.108
 destination-0.101
ACK-0.099
 coaster-0.097
raper-0.097
Token wife
Token ID3656
Feature activation+9.08e+00

Pos logit contributions
sm+0.051
ons+0.048
 cared+0.046
aky+0.046
 of+0.046
Neg logit contributions
OM-0.081
 destination-0.080
tons-0.074
castle-0.072
 coaster-0.071
Token.
Token ID13
Feature activation+1.85e-01

Pos logit contributions
sm+0.210
and+0.206
old+0.205
lessness+0.199
 cared+0.196
Neg logit contributions
 destination-0.180
riage-0.169
tons-0.164
OM-0.153
castle-0.148
Token Jim
Token ID5395
Feature activation-3.51e-01

Pos logit contributions
 twins+0.003
sm+0.003
aky+0.003
ons+0.003
 sisters+0.003
Neg logit contributions
OM-0.006
 destination-0.005
riage-0.005
overed-0.005
tons-0.005
Token worked
Token ID3111
Feature activation-3.93e+00

Pos logit contributions
OM+0.010
tons+0.009
 destination+0.009
 coaster+0.009
riage+0.009
Neg logit contributions
 twins-0.006
sm-0.006
 sisters-0.006
 everywhere-0.006
 mice-0.006
Token very
Token ID845
Feature activation+1.15e+00

Pos logit contributions
OM+0.133
 destination+0.123
riage+0.120
tons+0.118
 coaster+0.117
Neg logit contributions
 twins-0.064
sm-0.058
 living-0.055
 who-0.055
 of-0.054
Token hard
Token ID1327
Feature activation-1.83e+00

Pos logit contributions
 twins+0.015
 many+0.015
sm+0.014
 of+0.014
 everywhere+0.014
Neg logit contributions
OM-0.042
 destination-0.040
riage-0.039
ACK-0.039
 coaster-0.039
Token."
Token ID526
Feature activation+1.53e-01

Pos logit contributions
 twins+0.006
sm+0.005
 living+0.005
 who+0.005
 of+0.005
Neg logit contributions
OM-0.013
 destination-0.012
riage-0.012
tons-0.012
 coaster-0.012
Token She
Token ID1375
Feature activation+1.69e-01

Pos logit contributions
sm+0.002
aky+0.002
able+0.002
 cared+0.002
ons+0.002
Neg logit contributions
OM-0.005
 destination-0.005
tons-0.005
riage-0.005
ACK-0.005
Token smiled
Token ID13541
Feature activation+2.94e+00

Pos logit contributions
 twins+0.003
 who+0.002
sm+0.002
 living+0.002
 of+0.002
Neg logit contributions
OM-0.006
 destination-0.006
riage-0.006
tons-0.005
 slope-0.005
Token and
Token ID290
Feature activation+1.98e+00

Pos logit contributions
ons+0.045
sm+0.042
 twins+0.041
 cared+0.037
old+0.037
Neg logit contributions
OM-0.094
 destination-0.092
overed-0.086
tons-0.086
riage-0.086
Token hugged
Token ID41130
Feature activation+3.27e+00

Pos logit contributions
 twins+0.026
sm+0.022
 who+0.022
 living+0.021
 of+0.020
Neg logit contributions
OM-0.074
 destination-0.070
riage-0.067
tons-0.067
 coaster-0.066
Token mother
Token ID2802
Feature activation+8.96e+00

Pos logit contributions
 twins+0.048
sm+0.042
 who+0.040
 living+0.040
 of+0.039
Neg logit contributions
OM-0.116
 destination-0.108
riage-0.104
tons-0.103
 coaster-0.103
Token.
Token ID13
Feature activation+1.34e+00

Pos logit contributions
ons+0.147
 twins+0.146
sm+0.142
 living+0.136
aky+0.133
Neg logit contributions
OM-0.256
 destination-0.256
riage-0.250
tons-0.247
 coaster-0.236
Token They
Token ID1119
Feature activation+1.63e-01

Pos logit contributions
sm+0.017
ons+0.017
 twins+0.016
able+0.016
aky+0.015
Neg logit contributions
OM-0.047
 destination-0.043
riage-0.043
tons-0.042
overed-0.042
Token sat
Token ID3332
Feature activation-1.80e+00

Pos logit contributions
 twins+0.003
sm+0.002
 living+0.002
 of+0.002
ons+0.002
Neg logit contributions
OM-0.005
 destination-0.005
tons-0.005
riage-0.005
 surely-0.005
Token down
Token ID866
Feature activation-2.83e+00

Pos logit contributions
OM+0.058
 destination+0.053
riage+0.051
 coaster+0.051
olog+0.051
Neg logit contributions
 twins-0.035
sm-0.030
 who-0.030
 living-0.030
 of-0.029
Token at
Token ID379
Feature activation-1.32e+00

Pos logit contributions
OM+0.106
 destination+0.098
 coaster+0.096
riage+0.095
tons+0.095
Neg logit contributions
 twins-0.039
sm-0.034
 of-0.034
 living-0.033
 who-0.032
Token happy
Token ID3772
Feature activation+1.83e+00

Pos logit contributions
 twins+0.011
sm+0.011
ons+0.011
aky+0.010
old+0.010
Neg logit contributions
OM-0.016
 destination-0.015
tons-0.015
 coaster-0.014
 surely-0.014
Token to
Token ID284
Feature activation-1.53e+00

Pos logit contributions
 twins+0.023
aky+0.023
sm+0.023
ons+0.022
 who+0.021
Neg logit contributions
OM-0.063
 destination-0.060
 coaster-0.058
overed-0.057
tons-0.057
Token share
Token ID2648
Feature activation+2.86e+00

Pos logit contributions
OM+0.056
overed+0.051
 destination+0.051
 coaster+0.050
tons+0.050
Neg logit contributions
 twins-0.022
 of-0.019
 who-0.019
 love-0.019
sm-0.019
Token and
Token ID290
Feature activation+3.31e+00

Pos logit contributions
 twins+0.053
aky+0.053
sm+0.052
 sisters+0.049
 cared+0.048
Neg logit contributions
 destination-0.077
OM-0.076
tons-0.073
 coaster-0.073
riage-0.069
Token be
Token ID307
Feature activation+1.42e-01

Pos logit contributions
 twins+0.076
sm+0.073
aky+0.069
ons+0.069
 who+0.069
Neg logit contributions
OM-0.085
 destination-0.077
riage-0.074
 coaster-0.073
tons-0.073
Token friends
Token ID2460
Feature activation+8.94e+00

Pos logit contributions
 twins+0.002
 everywhere+0.002
ons+0.002
sm+0.002
 living+0.002
Neg logit contributions
OM-0.004
 destination-0.004
riage-0.004
ACK-0.004
olog-0.004
Token.
Token ID13
Feature activation+1.05e+00

Pos logit contributions
 twins+0.139
sm+0.123
 living+0.118
ons+0.118
 sisters+0.117
Neg logit contributions
OM-0.291
 destination-0.274
tons-0.270
riage-0.266
 coaster-0.264
Token<|endoftext|>
Token ID50256
Feature activation+4.61e+00

Pos logit contributions
aky+0.018
ons+0.017
 seen+0.017
able+0.017
 sisters+0.016
Neg logit contributions
OM-0.030
tons-0.030
riage-0.028
olog-0.028
 destination-0.027
TokenOnce
Token ID7454
Feature activation+1.27e+00

Pos logit contributions
 sisters+0.066
 cared+0.064
 twins+0.061
aky+0.061
sm+0.060
Neg logit contributions
OM-0.151
 destination-0.137
tons-0.135
 rink-0.133
overed-0.133
Token upon
Token ID2402
Feature activation+1.39e+00

Pos logit contributions
 cared+0.026
sm+0.025
 mice+0.024
 twins+0.024
 chickens+0.024
Neg logit contributions
OM-0.033
 destination-0.032
tons-0.030
 surely-0.030
overed-0.029
Token a
Token ID257
Feature activation+7.61e-01

Pos logit contributions
 kittens+0.033
 sisters+0.033
ann+0.033
ons+0.032
ison+0.032
Neg logit contributions
OM-0.031
 coaster-0.024
 rink-0.023
 top-0.022
ACK-0.022
Token back
Token ID736
Feature activation-6.36e-01

Pos logit contributions
 twins+0.006
 sisters+0.005
sm+0.005
 cared+0.005
aky+0.005
Neg logit contributions
OM-0.007
 destination-0.007
 surely-0.007
 coaster-0.007
olog-0.007
Token home
Token ID1363
Feature activation+1.22e+00

Pos logit contributions
OM+0.023
 destination+0.021
riage+0.021
tons+0.020
 coaster+0.020
Neg logit contributions
 twins-0.009
sm-0.008
 living-0.007
 who-0.007
 of-0.007
Token and
Token ID290
Feature activation+9.25e-01

Pos logit contributions
 twins+0.013
sm+0.011
 who+0.011
 of+0.011
 living+0.011
Neg logit contributions
OM-0.048
 destination-0.045
riage-0.045
tons-0.044
ACK-0.043
Token showed
Token ID3751
Feature activation+2.66e+00

Pos logit contributions
 twins+0.015
 who+0.014
 living+0.013
 cared+0.013
 of+0.013
Neg logit contributions
OM-0.033
 destination-0.030
riage-0.029
tons-0.028
overed-0.028
Token his
Token ID465
Feature activation+2.91e+00

Pos logit contributions
 twins+0.057
sm+0.052
 living+0.051
aky+0.050
 sisters+0.049
Neg logit contributions
OM-0.072
 destination-0.069
tons-0.065
riage-0.065
 coaster-0.065
Token friends
Token ID2460
Feature activation+8.93e+00

Pos logit contributions
 twins+0.046
sm+0.044
 who+0.041
aky+0.041
 living+0.040
Neg logit contributions
OM-0.093
 destination-0.091
riage-0.085
tons-0.085
 coaster-0.084
Token.
Token ID13
Feature activation-7.83e-02

Pos logit contributions
 twins+0.150
sm+0.135
aky+0.129
 cared+0.126
 living+0.125
Neg logit contributions
OM-0.280
 destination-0.270
tons-0.258
riage-0.258
 coaster-0.254
Token Everyone
Token ID11075
Feature activation+4.52e+00

Pos logit contributions
OM+0.003
 destination+0.003
riage+0.002
tons+0.002
overed+0.002
Neg logit contributions
 twins-0.001
sm-0.001
 who-0.001
 living-0.001
aky-0.001
Token was
Token ID373
Feature activation+5.35e-01

Pos logit contributions
 twins+0.087
sm+0.079
aky+0.076
 sisters+0.074
ons+0.074
Neg logit contributions
OM-0.124
 destination-0.116
overed-0.116
tons-0.114
 coaster-0.114
Token so
Token ID523
Feature activation+2.06e+00

Pos logit contributions
 twins+0.007
sm+0.007
ons+0.007
able+0.006
 living+0.006
Neg logit contributions
OM-0.018
 destination-0.017
tons-0.017
overed-0.016
 surely-0.016
Token impressed
Token ID12617
Feature activation+2.01e+00

Pos logit contributions
 twins+0.031
sm+0.028
ons+0.028
 living+0.028
 who+0.027
Neg logit contributions
OM-0.067
 destination-0.063
overed-0.062
 surely-0.060
tons-0.060
Token the
Token ID262
Feature activation+5.67e-01

Pos logit contributions
 twins+0.021
aky+0.020
sm+0.020
able+0.019
ady+0.018
Neg logit contributions
 destination-0.026
OM-0.024
 slope-0.023
 peak-0.023
tons-0.023
Token trees
Token ID7150
Feature activation+7.08e-01

Pos logit contributions
 twins+0.011
sm+0.010
 who+0.009
 of+0.009
ons+0.009
Neg logit contributions
OM-0.017
 destination-0.017
riage-0.015
 coaster-0.015
tons-0.015
Token and
Token ID290
Feature activation+5.87e-01

Pos logit contributions
 twins+0.013
sm+0.013
aky+0.011
 sisters+0.011
ons+0.011
Neg logit contributions
OM-0.021
 destination-0.020
overed-0.019
tons-0.019
 surely-0.019
Token showed
Token ID3751
Feature activation+2.40e+00

Pos logit contributions
 twins+0.007
sm+0.006
 who+0.006
 living+0.006
 of+0.005
Neg logit contributions
OM-0.022
 destination-0.021
riage-0.020
tons-0.020
 coaster-0.020
Token his
Token ID465
Feature activation+2.80e+00

Pos logit contributions
 twins+0.046
sm+0.044
aky+0.044
 living+0.042
able+0.040
Neg logit contributions
 destination-0.066
OM-0.063
overed-0.061
 coaster-0.059
riage-0.059
Token friends
Token ID2460
Feature activation+8.89e+00

Pos logit contributions
sm+0.024
 twins+0.023
 who+0.020
ons+0.020
 of+0.019
Neg logit contributions
 destination-0.109
OM-0.109
 coaster-0.101
riage-0.100
tons-0.100
Token the
Token ID262
Feature activation+1.52e+00

Pos logit contributions
 twins+0.172
sm+0.160
aky+0.156
 cared+0.152
able+0.148
Neg logit contributions
OM-0.250
 destination-0.240
tons-0.229
riage-0.229
overed-0.226
Token beautiful
Token ID4950
Feature activation-1.34e+00

Pos logit contributions
sm+0.031
ons+0.030
 twins+0.030
old+0.029
 who+0.028
Neg logit contributions
OM-0.042
 destination-0.042
 coaster-0.038
tons-0.037
 slope-0.037
Token flowers
Token ID12734
Feature activation+2.97e+00

Pos logit contributions
OM+0.042
riage+0.038
 destination+0.038
olog+0.037
tons+0.037
Neg logit contributions
 twins-0.025
 living-0.023
sm-0.023
 of-0.023
 who-0.022
Token and
Token ID290
Feature activation+2.01e+00

Pos logit contributions
 twins+0.053
sm+0.052
aky+0.049
able+0.047
 cared+0.047
Neg logit contributions
 destination-0.083
OM-0.081
overed-0.080
castle-0.078
tons-0.076
Token the
Token ID262
Feature activation+5.94e-03

Pos logit contributions
sm+0.036
 twins+0.034
aky+0.033
ons+0.031
able+0.031
Neg logit contributions
 destination-0.056
OM-0.056
riage-0.054
castle-0.052
 coaster-0.052
Token to
Token ID284
Feature activation-4.58e-01

Pos logit contributions
OM+0.004
 destination+0.004
riage+0.004
tons+0.004
 coaster+0.004
Neg logit contributions
 twins-0.002
sm-0.002
 living-0.002
 who-0.002
 of-0.002
Token town
Token ID3240
Feature activation+4.45e-01

Pos logit contributions
OM+0.016
 destination+0.015
riage+0.015
tons+0.015
 coaster+0.014
Neg logit contributions
 twins-0.006
sm-0.006
aky-0.005
 who-0.005
able-0.005
Token to
Token ID284
Feature activation-9.86e-01

Pos logit contributions
 twins+0.006
 who+0.005
sm+0.005
 living+0.005
 of+0.005
Neg logit contributions
OM-0.017
 destination-0.015
riage-0.015
tons-0.015
ACK-0.015
Token show
Token ID905
Feature activation+3.88e+00

Pos logit contributions
OM+0.031
 destination+0.028
riage+0.027
 coaster+0.027
overed+0.027
Neg logit contributions
 twins-0.019
sm-0.017
 living-0.017
 who-0.017
 of-0.016
Token her
Token ID607
Feature activation+5.14e+00

Pos logit contributions
 twins+0.044
aky+0.042
sm+0.040
able+0.035
ak+0.035
Neg logit contributions
 destination-0.138
OM-0.135
tons-0.132
riage-0.129
overed-0.127
Token friends
Token ID2460
Feature activation+8.89e+00

Pos logit contributions
 twins+0.087
aky+0.085
sm+0.083
ons+0.079
able+0.078
Neg logit contributions
 destination-0.152
OM-0.150
riage-0.141
tons-0.139
 surely-0.136
Token the
Token ID262
Feature activation+1.50e+00

Pos logit contributions
 twins+0.152
aky+0.136
sm+0.134
 cared+0.129
ons+0.126
Neg logit contributions
OM-0.264
 destination-0.261
riage-0.251
tons-0.250
 coaster-0.245
Token water
Token ID1660
Feature activation-1.40e+00

Pos logit contributions
 twins+0.029
sm+0.027
 who+0.026
ons+0.025
 living+0.025
Neg logit contributions
OM-0.045
 destination-0.042
riage-0.040
tons-0.040
 coaster-0.040
Token.
Token ID13
Feature activation+6.78e-01

Pos logit contributions
OM+0.050
 destination+0.047
riage+0.045
tons+0.045
overed+0.044
Neg logit contributions
 twins-0.020
sm-0.017
 who-0.016
 living-0.016
 of-0.015
Token Her
Token ID2332
Feature activation-3.96e-01

Pos logit contributions
aky+0.014
able+0.012
ak+0.011
 twins+0.011
 sisters+0.011
Neg logit contributions
OM-0.019
overed-0.018
tons-0.018
 destination-0.018
olog-0.017
Token friends
Token ID2460
Feature activation+4.42e+00

Pos logit contributions
OM+0.016
 destination+0.015
riage+0.015
tons+0.015
 coaster+0.015
Neg logit contributions
 twins-0.004
sm-0.003
 who-0.003
 living-0.003
 of-0.003
Token music
Token ID2647
Feature activation+5.81e-01

Pos logit contributions
OM+0.023
 destination+0.022
tons+0.021
riage+0.021
 coaster+0.021
Neg logit contributions
 twins-0.011
sm-0.010
 living-0.009
 of-0.009
ons-0.009
Token,
Token ID11
Feature activation-7.23e-02

Pos logit contributions
 twins+0.008
 who+0.007
 living+0.006
sm+0.006
 of+0.006
Neg logit contributions
OM-0.022
 destination-0.020
riage-0.020
ACK-0.019
tons-0.019
Token and
Token ID290
Feature activation+9.36e-01

Pos logit contributions
OM+0.002
 destination+0.002
tons+0.002
 coaster+0.002
riage+0.002
Neg logit contributions
 twins-0.001
sm-0.001
ons-0.001
aky-0.001
 living-0.001
Token Max
Token ID5436
Feature activation+2.74e-01

Pos logit contributions
 twins+0.015
sm+0.014
 living+0.013
 who+0.013
 of+0.013
Neg logit contributions
OM-0.031
 destination-0.029
riage-0.028
tons-0.028
 coaster-0.027
Token showed
Token ID3751
Feature activation+2.68e+00

Pos logit contributions
 twins+0.005
sm+0.004
 living+0.004
 who+0.004
 of+0.004
Neg logit contributions
OM-0.009
 destination-0.008
riage-0.008
tons-0.008
 coaster-0.008
Token everyone
Token ID2506
Feature activation+8.86e+00

Pos logit contributions
 twins+0.047
sm+0.045
aky+0.044
 living+0.041
ons+0.040
Neg logit contributions
 destination-0.079
OM-0.077
tons-0.073
 coaster-0.072
olog-0.071
Token how
Token ID703
Feature activation+9.61e-01

Pos logit contributions
aky+0.176
 twins+0.173
unts+0.158
sm+0.158
aks+0.158
Neg logit contributions
 destination-0.214
tons-0.212
 coaster-0.210
OM-0.209
riage-0.205
Token clever
Token ID14169
Feature activation-8.88e-01

Pos logit contributions
 twins+0.016
sm+0.014
aky+0.014
 who+0.013
 sisters+0.013
Neg logit contributions
OM-0.030
 destination-0.028
riage-0.027
overed-0.027
 coaster-0.027
Token he
Token ID339
Feature activation-3.40e-01

Pos logit contributions
OM+0.022
 coaster+0.021
tons+0.021
 destination+0.021
 surely+0.020
Neg logit contributions
sm-0.018
aky-0.017
ady-0.017
 twins-0.017
 living-0.016
Token was
Token ID373
Feature activation-1.83e-01

Pos logit contributions
OM+0.013
 destination+0.012
riage+0.011
tons+0.011
ACK+0.011
Neg logit contributions
 twins-0.004
 who-0.004
 cared-0.004
sm-0.004
 living-0.004
Token with
Token ID351
Feature activation+3.16e+00

Pos logit contributions
OM+0.006
 destination+0.006
riage+0.006
 coaster+0.006
tons+0.006
Neg logit contributions
 twins-0.003
sm-0.002
 living-0.002
ons-0.002
 sisters-0.002
Token something
Token ID1223
Feature activation+3.23e+00

Pos logit contributions
 twins+0.017
sm+0.015
aky+0.014
 living+0.014
 sisters+0.014
Neg logit contributions
OM-0.033
 destination-0.032
riage-0.030
 coaster-0.030
overed-0.030
Token special
Token ID2041
Feature activation+1.21e+00

Pos logit contributions
 twins+0.059
ons+0.052
sm+0.051
aky+0.050
aks+0.050
Neg logit contributions
OM-0.095
 destination-0.092
tons-0.089
olog-0.086
 surely-0.085
Token to
Token ID284
Feature activation-7.32e-01

Pos logit contributions
 twins+0.018
sm+0.016
 who+0.015
 living+0.015
 of+0.015
Neg logit contributions
OM-0.043
 destination-0.040
riage-0.038
tons-0.038
 coaster-0.038
Token show
Token ID905
Feature activation+3.81e+00

Pos logit contributions
OM+0.024
 destination+0.022
overed+0.021
olog+0.021
ACK+0.021
Neg logit contributions
 twins-0.013
 who-0.012
 of-0.012
sm-0.012
 living-0.012
Token her
Token ID607
Feature activation+4.95e+00

Pos logit contributions
 twins+0.042
sm+0.038
 living+0.034
 who+0.032
aky+0.032
Neg logit contributions
OM-0.145
 destination-0.138
riage-0.133
tons-0.131
olog-0.130
Token friends
Token ID2460
Feature activation+8.86e+00

Pos logit contributions
sm+0.076
aky+0.076
 twins+0.074
aks+0.073
able+0.069
Neg logit contributions
 destination-0.157
OM-0.153
riage-0.142
tons-0.140
 coaster-0.140
Token.
Token ID13
Feature activation+1.24e-01

Pos logit contributions
 twins+0.148
sm+0.133
aky+0.126
 cared+0.120
ons+0.120
Neg logit contributions
OM-0.279
 destination-0.270
tons-0.259
riage-0.258
 coaster-0.253
Token<|endoftext|>
Token ID50256
Feature activation+4.52e+00

Pos logit contributions
aky+0.002
 twins+0.002
sm+0.002
aks+0.002
ons+0.002
Neg logit contributions
OM-0.004
olog-0.004
 destination-0.004
riage-0.004
ACK-0.004
TokenOnce
Token ID7454
Feature activation+1.04e+00

Pos logit contributions
 sisters+0.075
aky+0.072
 cared+0.068
 twins+0.067
 kittens+0.066
Neg logit contributions
OM-0.139
 destination-0.124
 coaster-0.123
 rink-0.121
tons-0.121
Token upon
Token ID2402
Feature activation+1.19e+00

Pos logit contributions
 chickens+0.022
 cared+0.022
sm+0.021
 mice+0.021
 twins+0.021
Neg logit contributions
OM-0.026
 destination-0.026
riage-0.023
overed-0.023
 surely-0.023
Token a
Token ID257
Feature activation+5.47e-01

Pos logit contributions
 kittens+0.027
ison+0.027
 sisters+0.027
able+0.027
as+0.027
Neg logit contributions
OM-0.027
 coaster-0.022
 rink-0.020
 top-0.020
tons-0.020
Token again
Token ID757
Feature activation-1.75e-01

Pos logit contributions
 twins+0.051
sm+0.049
aky+0.048
aks+0.047
ons+0.047
Neg logit contributions
OM-0.073
 destination-0.068
tons-0.066
 coaster-0.065
overed-0.065
Token and
Token ID290
Feature activation-2.53e-02

Pos logit contributions
OM+0.007
 destination+0.006
riage+0.006
overed+0.006
olog+0.006
Neg logit contributions
 twins-0.002
 who-0.002
sm-0.002
 living-0.002
 of-0.002
Token he
Token ID339
Feature activation-1.88e+00

Pos logit contributions
OM+0.001
 destination+0.001
riage+0.001
ACK+0.001
overed+0.001
Neg logit contributions
 twins-0.000
 who-0.000
 living-0.000
 of-0.000
sm-0.000
Token showed
Token ID3751
Feature activation+2.84e+00

Pos logit contributions
OM+0.067
 destination+0.063
riage+0.061
tons+0.060
overed+0.060
Neg logit contributions
 twins-0.027
sm-0.024
 who-0.024
 living-0.023
 of-0.022
Token his
Token ID465
Feature activation+2.96e+00

Pos logit contributions
 twins+0.043
sm+0.038
 living+0.036
 who+0.036
 of+0.034
Neg logit contributions
OM-0.098
 destination-0.092
riage-0.089
tons-0.088
 coaster-0.087
Token friends
Token ID2460
Feature activation+8.85e+00

Pos logit contributions
sm+0.060
 twins+0.059
 who+0.056
aky+0.054
 living+0.054
Neg logit contributions
OM-0.082
 destination-0.078
tons-0.073
 coaster-0.072
riage-0.071
Token his
Token ID465
Feature activation+2.46e+00

Pos logit contributions
 twins+0.167
aky+0.157
sm+0.149
aks+0.146
 sisters+0.145
Neg logit contributions
OM-0.248
 destination-0.239
 coaster-0.230
tons-0.229
riage-0.228
Token cool
Token ID3608
Feature activation-1.34e+00

Pos logit contributions
sm+0.044
 twins+0.043
 who+0.040
aky+0.040
aks+0.039
Neg logit contributions
OM-0.074
 destination-0.070
 coaster-0.067
tons-0.066
riage-0.063
Token band
Token ID4097
Feature activation-2.15e+00

Pos logit contributions
OM+0.039
 destination+0.036
riage+0.035
tons+0.034
 coaster+0.034
Neg logit contributions
 twins-0.028
sm-0.026
 living-0.025
 who-0.025
 of-0.024
Token-
Token ID12
Feature activation-2.64e+00

Pos logit contributions
OM+0.049
 coaster+0.045
overed+0.044
ER+0.043
castle+0.041
Neg logit contributions
 twins-0.048
 mice-0.047
 chickens-0.046
 pigs-0.045
 kittens-0.045
Tokenaid
Token ID1698
Feature activation+9.54e-02

Pos logit contributions
OM+0.081
 coaster+0.074
olog+0.071
late+0.068
riage+0.068
Neg logit contributions
 twins-0.050
 sisters-0.045
 living-0.043
 everywhere-0.043
packs-0.039
Token have
Token ID423
Feature activation+6.20e-01

Pos logit contributions
OM+0.098
 destination+0.088
 coaster+0.085
riage+0.085
overed+0.085
Neg logit contributions
 twins-0.075
 who-0.069
sm-0.068
 living-0.068
 of-0.067
Token and
Token ID290
Feature activation+2.18e+00

Pos logit contributions
 twins+0.011
sm+0.011
aky+0.010
ons+0.010
 everywhere+0.010
Neg logit contributions
 destination-0.017
OM-0.017
 surely-0.016
ths-0.016
tons-0.016
Token give
Token ID1577
Feature activation+1.08e+00

Pos logit contributions
 twins+0.046
sm+0.044
aky+0.040
ons+0.040
 living+0.040
Neg logit contributions
OM-0.060
 destination-0.057
riage-0.053
tons-0.053
 surely-0.053
Token love
Token ID1842
Feature activation+3.89e+00

Pos logit contributions
 twins+0.024
sm+0.022
aky+0.021
 living+0.021
 who+0.020
Neg logit contributions
 destination-0.027
OM-0.027
riage-0.025
tons-0.024
 surely-0.024
Token with
Token ID351
Feature activation+3.70e+00

Pos logit contributions
 twins+0.060
sm+0.052
 living+0.048
ons+0.048
 sisters+0.048
Neg logit contributions
OM-0.131
 destination-0.124
riage-0.119
tons-0.119
overed-0.116
Token friends
Token ID2460
Feature activation+8.85e+00

Pos logit contributions
 twins+0.059
aky+0.055
sm+0.053
aks+0.050
able+0.049
Neg logit contributions
 destination-0.106
OM-0.105
tons-0.103
riage-0.101
overed-0.096
Token and
Token ID290
Feature activation+3.46e+00

Pos logit contributions
 twins+0.144
sm+0.124
 living+0.117
 sisters+0.116
ons+0.114
Neg logit contributions
OM-0.283
 destination-0.270
tons-0.266
riage-0.263
 coaster-0.256
Token family
Token ID1641
Feature activation+7.51e+00

Pos logit contributions
sm+0.057
 twins+0.056
aky+0.054
able+0.051
aks+0.050
Neg logit contributions
OM-0.105
 destination-0.102
riage-0.094
tons-0.094
 surely-0.094
Token.
Token ID13
Feature activation-3.31e-01

Pos logit contributions
 twins+0.127
aky+0.116
sm+0.112
ons+0.111
aks+0.106
Neg logit contributions
 destination-0.209
tons-0.204
OM-0.203
riage-0.201
 surely-0.192
Token<|endoftext|>
Token ID50256
Feature activation+3.87e+00

Pos logit contributions
OM+0.011
riage+0.010
 destination+0.010
tons+0.009
ACK+0.009
Neg logit contributions
aky-0.005
ons-0.005
 twins-0.005
aks-0.005
sm-0.005
TokenOnce
Token ID7454
Feature activation+9.71e-01

Pos logit contributions
 twins+0.061
 sisters+0.059
aky+0.058
 cared+0.057
sm+0.057
Neg logit contributions
OM-0.123
 destination-0.111
tons-0.109
overed-0.108
 coaster-0.107
Token happy
Token ID3772
Feature activation+1.72e+00

Pos logit contributions
 twins+0.026
sm+0.022
 living+0.021
 who+0.021
ons+0.020
Neg logit contributions
OM-0.083
 destination-0.077
riage-0.075
tons-0.075
olog-0.074
Token that
Token ID326
Feature activation-9.87e-01

Pos logit contributions
 twins+0.029
sm+0.026
 living+0.025
 who+0.024
aky+0.024
Neg logit contributions
OM-0.056
 destination-0.052
tons-0.050
riage-0.050
overed-0.050
Token he
Token ID339
Feature activation-2.09e+00

Pos logit contributions
OM+0.032
 destination+0.029
riage+0.028
tons+0.028
 coaster+0.028
Neg logit contributions
 twins-0.017
sm-0.016
 who-0.015
 living-0.015
 of-0.014
Token gave
Token ID2921
Feature activation+1.88e+00

Pos logit contributions
OM+0.082
 destination+0.076
riage+0.076
ACK+0.074
raper+0.073
Neg logit contributions
 who-0.025
 cared-0.024
 twins-0.024
 of-0.023
 were-0.022
Token his
Token ID465
Feature activation+2.50e+00

Pos logit contributions
 twins+0.028
sm+0.025
 living+0.024
aky+0.023
 who+0.023
Neg logit contributions
 destination-0.059
OM-0.059
tons-0.056
 surely-0.056
riage-0.055
Token wife
Token ID3656
Feature activation+8.84e+00

Pos logit contributions
 twins+0.044
sm+0.040
 living+0.039
 who+0.038
 of+0.038
Neg logit contributions
OM-0.079
 destination-0.074
riage-0.071
tons-0.071
 coaster-0.070
Token a
Token ID257
Feature activation+2.45e+00

Pos logit contributions
 twins+0.147
unts+0.131
 living+0.122
aky+0.121
 sisters+0.119
Neg logit contributions
 destination-0.254
OM-0.253
 coaster-0.244
tons-0.234
riage-0.233
Token big
Token ID1263
Feature activation-1.31e-01

Pos logit contributions
 twins+0.043
ons+0.038
aky+0.037
 who+0.036
 mice+0.035
Neg logit contributions
OM-0.075
 destination-0.071
 surely-0.069
 coaster-0.067
ER-0.067
Token hug
Token ID16225
Feature activation+3.03e+00

Pos logit contributions
OM+0.005
 destination+0.005
riage+0.004
tons+0.004
 coaster+0.004
Neg logit contributions
 twins-0.002
sm-0.001
 who-0.001
 living-0.001
ons-0.001
Token,
Token ID11
Feature activation+1.88e-01

Pos logit contributions
 twins+0.035
sm+0.030
 who+0.029
 of+0.028
 living+0.028
Neg logit contributions
OM-0.117
 destination-0.109
riage-0.107
tons-0.105
overed-0.105
Token and
Token ID290
Feature activation+2.10e+00

Pos logit contributions
 twins+0.003
sm+0.002
ons+0.002
 living+0.002
aky+0.002
Neg logit contributions
OM-0.006
 destination-0.006
tons-0.006
 coaster-0.006
riage-0.006
Token wear
Token ID5806
Feature activation+1.02e+00

Pos logit contributions
OM+0.065
riage+0.060
overed+0.060
 destination+0.059
ths+0.057
Neg logit contributions
 twins-0.043
 of-0.043
 who-0.042
 love-0.040
 living-0.039
Token it
Token ID340
Feature activation+1.48e+00

Pos logit contributions
 twins+0.021
sm+0.019
 who+0.019
 living+0.018
 of+0.018
Neg logit contributions
OM-0.030
 destination-0.028
riage-0.027
tons-0.026
 coaster-0.026
Token to
Token ID284
Feature activation-8.13e-01

Pos logit contributions
 twins+0.023
 who+0.021
sm+0.020
 of+0.020
 everywhere+0.020
Neg logit contributions
OM-0.052
 destination-0.047
riage-0.047
tons-0.046
 coaster-0.046
Token show
Token ID905
Feature activation+3.25e+00

Pos logit contributions
OM+0.028
overed+0.025
 coaster+0.025
 destination+0.025
ths+0.025
Neg logit contributions
 twins-0.013
 who-0.012
 love-0.012
 of-0.012
 many-0.012
Token her
Token ID607
Feature activation+5.22e+00

Pos logit contributions
 twins+0.032
sm+0.027
 living+0.025
 who+0.025
 of+0.024
Neg logit contributions
OM-0.130
 destination-0.123
riage-0.119
tons-0.118
 coaster-0.118
Token friends
Token ID2460
Feature activation+8.83e+00

Pos logit contributions
sm+0.061
 twins+0.061
aky+0.059
able+0.055
 cared+0.054
Neg logit contributions
 destination-0.186
OM-0.182
riage-0.171
 coaster-0.170
castle-0.168
Token.
Token ID13
Feature activation+1.01e+00

Pos logit contributions
 twins+0.148
sm+0.134
aky+0.129
 cared+0.127
 sisters+0.121
Neg logit contributions
OM-0.270
 destination-0.263
riage-0.256
tons-0.250
 coaster-0.249
Token They
Token ID1119
Feature activation-1.87e-01

Pos logit contributions
aky+0.015
sm+0.015
 twins+0.014
able+0.013
ons+0.012
Neg logit contributions
OM-0.034
 destination-0.032
overed-0.031
olog-0.031
riage-0.030
Token all
Token ID477
Feature activation+1.06e+00

Pos logit contributions
OM+0.005
 destination+0.005
riage+0.005
tons+0.005
olog+0.005
Neg logit contributions
 twins-0.004
sm-0.003
 living-0.003
able-0.003
aks-0.003
Token thought
Token ID1807
Feature activation+2.09e+00

Pos logit contributions
 twins+0.021
sm+0.019
 living+0.018
 mice+0.018
ons+0.018
Neg logit contributions
OM-0.029
riage-0.027
 destination-0.027
overed-0.026
 coaster-0.026
Token it
Token ID340
Feature activation-4.51e-01

Pos logit contributions
 twins+0.042
sm+0.038
 living+0.037
 who+0.036
aky+0.035
Neg logit contributions
OM-0.062
 destination-0.057
tons-0.055
riage-0.055
overed-0.054
Token more
Token ID517
Feature activation+1.32e+00

Pos logit contributions
 twins+0.014
sm+0.012
 living+0.012
 who+0.012
 of+0.012
Neg logit contributions
OM-0.028
 destination-0.025
riage-0.025
 coaster-0.025
tons-0.024
Token fun
Token ID1257
Feature activation+1.97e-01

Pos logit contributions
 twins+0.022
sm+0.020
 who+0.019
 living+0.019
 of+0.018
Neg logit contributions
OM-0.044
 destination-0.041
riage-0.039
tons-0.039
 coaster-0.039
Token to
Token ID284
Feature activation-2.95e+00

Pos logit contributions
 twins+0.003
sm+0.003
 living+0.003
 who+0.003
 of+0.003
Neg logit contributions
OM-0.007
 destination-0.006
riage-0.006
tons-0.006
 coaster-0.006
Token share
Token ID2648
Feature activation+2.84e+00

Pos logit contributions
OM+0.100
overed+0.093
ACK+0.091
riage+0.091
olog+0.090
Neg logit contributions
 love-0.046
 twins-0.045
 who-0.045
 of-0.043
 living-0.042
Token with
Token ID351
Feature activation+3.93e+00

Pos logit contributions
 twins+0.044
sm+0.039
 living+0.037
 who+0.037
 of+0.036
Neg logit contributions
OM-0.097
 destination-0.091
riage-0.087
tons-0.087
 coaster-0.086
Token friends
Token ID2460
Feature activation+8.82e+00

Pos logit contributions
 twins+0.061
sm+0.057
aky+0.053
 living+0.053
 everywhere+0.051
Neg logit contributions
OM-0.122
 destination-0.118
tons-0.114
riage-0.112
ths-0.109
Token.
Token ID13
Feature activation+6.42e-01

Pos logit contributions
 twins+0.133
sm+0.120
 living+0.110
 sisters+0.109
ons+0.107
Neg logit contributions
OM-0.287
 destination-0.275
tons-0.268
riage-0.266
 coaster-0.261
Token<|endoftext|>
Token ID50256
Feature activation+3.59e+00

Pos logit contributions
 twins+0.008
sm+0.008
 everywhere+0.007
aky+0.007
ons+0.007
Neg logit contributions
OM-0.023
riage-0.021
tons-0.021
 destination-0.021
ACK-0.020
TokenOnce
Token ID7454
Feature activation+1.02e+00

Pos logit contributions
 sisters+0.047
 twins+0.046
 cared+0.042
aky+0.042
sm+0.041
Neg logit contributions
OM-0.125
 destination-0.114
 coaster-0.112
riage-0.110
overed-0.109
Token upon
Token ID2402
Feature activation+6.78e-01

Pos logit contributions
sm+0.020
 cared+0.019
 twins+0.019
 kisses+0.019
 chickens+0.019
Neg logit contributions
OM-0.028
 destination-0.027
overed-0.025
riage-0.025
 surely-0.025
Token a
Token ID257
Feature activation+8.57e-01

Pos logit contributions
ons+0.014
 sisters+0.014
 kittens+0.014
old+0.014
ison+0.014
Neg logit contributions
OM-0.016
 coaster-0.014
 rink-0.013
tons-0.013
ACK-0.013
Token the
Token ID262
Feature activation-2.00e-01

Pos logit contributions
OM+0.043
olog+0.039
overed+0.039
riage+0.039
 destination+0.038
Neg logit contributions
 twins-0.013
 who-0.012
 living-0.011
 of-0.011
sm-0.011
Token classroom
Token ID15806
Feature activation+1.48e-01

Pos logit contributions
OM+0.006
riage+0.006
overed+0.006
olog+0.006
ths+0.005
Neg logit contributions
 twins-0.004
 living-0.004
 who-0.004
 sisters-0.004
 of-0.004
Token and
Token ID290
Feature activation+8.62e-01

Pos logit contributions
 twins+0.002
 who+0.001
 of+0.001
 living+0.001
sm+0.001
Neg logit contributions
OM-0.006
riage-0.005
 destination-0.005
tons-0.005
ACK-0.005
Token showed
Token ID3751
Feature activation+2.44e+00

Pos logit contributions
 who+0.012
 living+0.011
 twins+0.011
 cared+0.011
 everywhere+0.011
Neg logit contributions
OM-0.032
 destination-0.029
riage-0.029
umm-0.029
tons-0.029
Token his
Token ID465
Feature activation+2.94e+00

Pos logit contributions
 twins+0.043
sm+0.039
 living+0.037
 who+0.037
 of+0.035
Neg logit contributions
OM-0.078
 destination-0.074
riage-0.070
tons-0.070
 coaster-0.069
Token friends
Token ID2460
Feature activation+8.82e+00

Pos logit contributions
 twins+0.022
sm+0.019
 who+0.016
 living+0.016
 of+0.015
Neg logit contributions
OM-0.122
 destination-0.117
riage-0.112
tons-0.111
 coaster-0.111
Token his
Token ID465
Feature activation+2.48e+00

Pos logit contributions
 twins+0.163
sm+0.147
aky+0.139
 living+0.137
ons+0.136
Neg logit contributions
OM-0.267
 destination-0.254
riage-0.242
tons-0.241
 coaster-0.238
Token impressive
Token ID8036
Feature activation-6.43e-01

Pos logit contributions
 twins+0.038
sm+0.036
 who+0.034
aky+0.033
 of+0.033
Neg logit contributions
OM-0.082
 destination-0.079
riage-0.074
 coaster-0.074
tons-0.073
Token hopping
Token ID47153
Feature activation-3.84e+00

Pos logit contributions
OM+0.020
tons+0.018
riage+0.017
 destination+0.017
olog+0.017
Neg logit contributions
 twins-0.013
 living-0.012
sm-0.011
 of-0.011
 who-0.011
Token skills
Token ID4678
Feature activation+6.09e-01

Pos logit contributions
OM+0.119
riage+0.107
 destination+0.105
tons+0.104
ths+0.103
Neg logit contributions
 twins-0.074
 of-0.072
 living-0.071
sm-0.069
 everywhere-0.069
Token.
Token ID13
Feature activation-3.52e-02

Pos logit contributions
 twins+0.007
sm+0.006
 who+0.006
 of+0.006
 living+0.006
Neg logit contributions
OM-0.023
 destination-0.021
riage-0.021
tons-0.021
 coaster-0.021
Token back
Token ID736
Feature activation-1.14e+00

Pos logit contributions
OM+0.078
riage+0.069
 destination+0.068
tons+0.067
olog+0.066
Neg logit contributions
 twins-0.054
 everywhere-0.052
 of-0.052
sm-0.051
 who-0.050
Token down
Token ID866
Feature activation-1.51e+00

Pos logit contributions
OM+0.040
 destination+0.037
riage+0.036
tons+0.036
olog+0.036
Neg logit contributions
 twins-0.017
 living-0.015
sm-0.015
 who-0.015
 of-0.015
Token and
Token ID290
Feature activation+1.26e-01

Pos logit contributions
OM+0.059
 destination+0.055
riage+0.055
tons+0.054
 coaster+0.053
Neg logit contributions
 twins-0.017
 of-0.016
sm-0.015
 living-0.014
 who-0.014
Token showed
Token ID3751
Feature activation+2.50e+00

Pos logit contributions
 who+0.002
 everywhere+0.002
 living+0.002
 twins+0.002
 cared+0.002
Neg logit contributions
OM-0.005
riage-0.004
raper-0.004
 destination-0.004
tons-0.004
Token his
Token ID465
Feature activation+2.88e+00

Pos logit contributions
 twins+0.049
sm+0.045
 living+0.043
 who+0.042
 of+0.041
Neg logit contributions
OM-0.075
 destination-0.070
tons-0.067
riage-0.067
 coaster-0.066
Token friends
Token ID2460
Feature activation+8.79e+00

Pos logit contributions
sm+0.028
 twins+0.026
 who+0.024
able+0.023
aky+0.023
Neg logit contributions
 destination-0.109
OM-0.108
 coaster-0.100
tons-0.100
castle-0.099
Token.
Token ID13
Feature activation-1.12e-01

Pos logit contributions
 twins+0.149
sm+0.137
aky+0.133
 cared+0.128
able+0.125
Neg logit contributions
OM-0.266
 destination-0.261
riage-0.248
tons-0.247
 coaster-0.246
Token
Token ID198
Feature activation+8.94e-01

Pos logit contributions
OM+0.004
 destination+0.004
tons+0.003
overed+0.003
riage+0.003
Neg logit contributions
sm-0.002
 twins-0.002
aky-0.001
able-0.001
ons-0.001
Token
Token ID198
Feature activation-6.71e-01

Pos logit contributions
 twins+0.010
sm+0.009
 cared+0.009
able+0.009
 sisters+0.009
Neg logit contributions
OM-0.033
 destination-0.031
overed-0.030
 coaster-0.030
olog-0.030
Token"
Token ID1
Feature activation-1.26e+00

Pos logit contributions
OM+0.022
 destination+0.021
riage+0.021
ACK+0.020
tons+0.020
Neg logit contributions
 twins-0.011
 who-0.011
 living-0.011
 of-0.011
sm-0.010
TokenSee
Token ID6214
Feature activation-3.07e+00

Pos logit contributions
OM+0.035
 destination+0.032
riage+0.030
tons+0.030
 coaster+0.030
Neg logit contributions
 twins-0.028
sm-0.026
 living-0.025
 who-0.025
 of-0.025
Token helpful
Token ID7613
Feature activation-4.63e-01

Pos logit contributions
OM+0.014
 destination+0.013
riage+0.012
 coaster+0.012
overed+0.012
Neg logit contributions
 twins-0.008
sm-0.007
 living-0.007
 who-0.007
 of-0.007
Token to
Token ID284
Feature activation-5.46e-01

Pos logit contributions
OM+0.013
 destination+0.013
riage+0.012
tons+0.011
olog+0.011
Neg logit contributions
sm-0.009
 twins-0.008
aky-0.008
ons-0.008
 who-0.008
Token discuss
Token ID2112
Feature activation+3.10e+00

Pos logit contributions
OM+0.017
 destination+0.015
riage+0.015
 coaster+0.015
overed+0.014
Neg logit contributions
 twins-0.011
sm-0.010
 who-0.010
 living-0.010
 of-0.009
Token solutions
Token ID8136
Feature activation+1.23e+00

Pos logit contributions
 twins+0.050
sm+0.045
 who+0.045
 living+0.044
 of+0.043
Neg logit contributions
OM-0.106
 destination-0.097
riage-0.094
overed-0.093
tons-0.093
Token with
Token ID351
Feature activation+3.44e+00

Pos logit contributions
 twins+0.020
sm+0.018
 sisters+0.017
ons+0.017
 cared+0.016
Neg logit contributions
OM-0.039
 destination-0.038
riage-0.036
tons-0.036
overed-0.035
Token friends
Token ID2460
Feature activation+8.78e+00

Pos logit contributions
 twins+0.057
sm+0.051
aky+0.049
ons+0.047
 sisters+0.047
Neg logit contributions
OM-0.101
 destination-0.100
tons-0.096
riage-0.094
 surely-0.091
Token.
Token ID13
Feature activation+1.31e-01

Pos logit contributions
 twins+0.127
sm+0.112
ons+0.101
 sisters+0.100
 living+0.098
Neg logit contributions
OM-0.296
 destination-0.285
tons-0.277
riage-0.276
 coaster-0.267
Token<|endoftext|>
Token ID50256
Feature activation+3.78e+00

Pos logit contributions
aky+0.002
 twins+0.002
 sisters+0.002
sm+0.002
able+0.002
Neg logit contributions
OM-0.004
riage-0.004
 destination-0.004
tons-0.004
olog-0.004
TokenOnce
Token ID7454
Feature activation+8.06e-01

Pos logit contributions
 sisters+0.052
 twins+0.050
aky+0.049
 cared+0.047
sm+0.046
Neg logit contributions
OM-0.129
 destination-0.118
 coaster-0.115
tons-0.114
overed-0.114
Token upon
Token ID2402
Feature activation+8.13e-01

Pos logit contributions
sm+0.017
 cared+0.017
 chickens+0.017
 mice+0.017
 kittens+0.016
Neg logit contributions
 destination-0.020
OM-0.020
riage-0.018
tons-0.018
overed-0.018
Token a
Token ID257
Feature activation+6.16e-01

Pos logit contributions
 kittens+0.018
 sisters+0.018
able+0.018
ison+0.018
ons+0.018
Neg logit contributions
OM-0.019
 coaster-0.016
 rink-0.015
 skate-0.014
tons-0.014
Token<|endoftext|>
Token ID50256
Feature activation+4.82e+00

Pos logit contributions
sm+0.023
ons+0.023
 twins+0.023
aky+0.022
 everywhere+0.022
Neg logit contributions
OM-0.086
riage-0.079
 destination-0.077
tons-0.077
ACK-0.076
TokenAnna
Token ID31160
Feature activation+1.89e+00

Pos logit contributions
 twins+0.088
sm+0.085
 cared+0.085
 sisters+0.084
 who+0.083
Neg logit contributions
OM-0.138
 destination-0.123
tons-0.121
riage-0.121
overed-0.120
Token and
Token ID290
Feature activation+1.09e+00

Pos logit contributions
 twins+0.031
sm+0.029
 of+0.026
 sisters+0.026
 who+0.026
Neg logit contributions
OM-0.060
 destination-0.056
tons-0.054
riage-0.054
 coaster-0.053
Token Ben
Token ID3932
Feature activation+1.64e+00

Pos logit contributions
sm+0.024
aky+0.023
 twins+0.022
ons+0.022
 living+0.021
Neg logit contributions
OM-0.028
tons-0.026
riage-0.025
 surely-0.025
 destination-0.024
Token are
Token ID389
Feature activation+1.50e+00

Pos logit contributions
 twins+0.026
sm+0.024
ons+0.022
 of+0.022
 sisters+0.022
Neg logit contributions
OM-0.052
 destination-0.049
riage-0.048
tons-0.048
olog-0.047
Token friends
Token ID2460
Feature activation+8.78e+00

Pos logit contributions
sm+0.030
able+0.028
 cared+0.027
ons+0.027
jo+0.026
Neg logit contributions
 destination-0.038
tons-0.036
OM-0.036
ths-0.034
riage-0.033
Token.
Token ID13
Feature activation+1.27e+00

Pos logit contributions
ons+0.182
sm+0.177
 mice+0.173
 know+0.173
 kittens+0.173
Neg logit contributions
tons-0.187
 destination-0.177
overed-0.174
OM-0.173
riage-0.166
Token They
Token ID1119
Feature activation+4.85e-01

Pos logit contributions
ons+0.023
sm+0.021
 twins+0.019
 everywhere+0.019
aky+0.019
Neg logit contributions
tons-0.038
OM-0.036
riage-0.036
 destination-0.034
overed-0.033
Token like
Token ID588
Feature activation+2.40e+00

Pos logit contributions
sm+0.010
 twins+0.010
ons+0.010
 mice+0.009
 kittens+0.009
Neg logit contributions
OM-0.012
 surely-0.011
tons-0.011
riage-0.011
 destination-0.011
Token to
Token ID284
Feature activation-3.45e+00

Pos logit contributions
aky+0.057
ady+0.053
sm+0.051
 twins+0.050
able+0.050
Neg logit contributions
OM-0.054
 coaster-0.050
ER-0.049
 roller-0.048
 slope-0.048
Token play
Token ID711
Feature activation-3.38e-01

Pos logit contributions
OM+0.102
 destination+0.094
riage+0.090
 coaster+0.089
tons+0.089
Neg logit contributions
 twins-0.069
sm-0.065
 living-0.061
 who-0.061
 everywhere-0.059
Token them
Token ID606
Feature activation+4.29e+00

Pos logit contributions
 twins+0.007
sm+0.007
 who+0.006
 living+0.006
 of+0.006
Neg logit contributions
OM-0.017
 destination-0.015
riage-0.015
tons-0.015
 coaster-0.015
Token feel
Token ID1254
Feature activation+3.37e+00

Pos logit contributions
 twins+0.073
sm+0.068
 living+0.064
 who+0.063
 of+0.061
Neg logit contributions
OM-0.139
 destination-0.130
riage-0.124
tons-0.123
 coaster-0.122
Token secure
Token ID5713
Feature activation-1.02e+00

Pos logit contributions
 twins+0.066
sm+0.061
ons+0.059
 who+0.058
 living+0.058
Neg logit contributions
OM-0.095
 destination-0.091
tons-0.086
 surely-0.086
riage-0.086
Token.
Token ID13
Feature activation+2.42e-01

Pos logit contributions
OM+0.041
 destination+0.037
riage+0.037
olog+0.036
 coaster+0.036
Neg logit contributions
 twins-0.012
 who-0.010
 living-0.010
 of-0.009
sm-0.009
Token The
Token ID383
Feature activation-1.07e+00

Pos logit contributions
 twins+0.003
sm+0.003
 living+0.003
aky+0.003
 everywhere+0.002
Neg logit contributions
OM-0.009
 destination-0.008
tons-0.008
riage-0.008
 coaster-0.008
Token pers
Token ID2774
Feature activation-1.33e+01

Pos logit contributions
OM+0.035
 destination+0.032
riage+0.031
overed+0.031
tons+0.031
Neg logit contributions
 twins-0.019
 living-0.017
sm-0.017
 who-0.017
 of-0.016
Tokenistant
Token ID10167
Feature activation-4.99e+00

Pos logit contributions
OM+0.276
tons+0.273
ACK+0.272
 destination+0.265
 slope+0.251
Neg logit contributions
sm-0.327
able-0.323
ons-0.322
aks-0.320
 twins-0.315
Token love
Token ID1842
Feature activation+1.41e+00

Pos logit contributions
OM+0.156
riage+0.139
 destination+0.138
overed+0.137
olog+0.136
Neg logit contributions
 twins-0.099
 of-0.088
 living-0.086
 who-0.085
sm-0.084
Token of
Token ID286
Feature activation+8.63e-01

Pos logit contributions
 twins+0.021
sm+0.019
 who+0.018
 living+0.018
 of+0.017
Neg logit contributions
OM-0.049
 destination-0.046
riage-0.045
tons-0.044
 coaster-0.044
Token their
Token ID511
Feature activation+6.96e-01

Pos logit contributions
 twins+0.014
 who+0.013
 living+0.013
sm+0.013
 of+0.012
Neg logit contributions
OM-0.030
 destination-0.027
riage-0.026
olog-0.026
ACK-0.026
Token parents
Token ID3397
Feature activation+5.22e+00

Pos logit contributions
 twins+0.014
sm+0.013
 living+0.013
 who+0.012
 of+0.012
Neg logit contributions
OM-0.021
 destination-0.019
riage-0.018
tons-0.018
 coaster-0.018
Token sur
Token ID969
Feature activation-1.10e+01

Pos logit contributions
OM+0.009
olog+0.009
ACK+0.008
ider+0.008
riage+0.008
Neg logit contributions
 twins-0.008
 living-0.007
 of-0.007
 many-0.007
 love-0.006
Tokenfer
Token ID2232
Feature activation+1.67e-01

Pos logit contributions
OM+0.171
 destination+0.166
tons+0.159
late+0.153
olog+0.151
Neg logit contributions
 twins-0.352
 who-0.349
sm-0.338
ons-0.338
 of-0.334
Token.
Token ID13
Feature activation-4.93e-01

Pos logit contributions
 twins+0.003
sm+0.003
 who+0.003
 living+0.003
 of+0.003
Neg logit contributions
OM-0.005
 destination-0.005
riage-0.005
tons-0.005
 coaster-0.005
Token Whenever
Token ID21326
Feature activation+5.17e-01

Pos logit contributions
OM+0.017
 destination+0.016
riage+0.015
overed+0.015
olog+0.015
Neg logit contributions
 twins-0.007
sm-0.007
ons-0.006
aky-0.006
able-0.006
Token the
Token ID262
Feature activation-2.75e+00

Pos logit contributions
 twins+0.008
sm+0.007
aky+0.007
 living+0.006
 who+0.006
Neg logit contributions
OM-0.017
 destination-0.017
tons-0.016
riage-0.016
overed-0.016
Token sur
Token ID969
Feature activation-1.31e+01

Pos logit contributions
OM+0.068
olog+0.066
late+0.064
ACK+0.063
riage+0.061
Neg logit contributions
 twins-0.075
 living-0.067
 friends-0.065
 family-0.065
 of-0.065
Tokenfer
Token ID2232
Feature activation-2.68e+00

Pos logit contributions
 destination+0.214
OM+0.205
late+0.201
tons+0.196
olog+0.193
Neg logit contributions
 who-0.409
 twins-0.408
ons-0.392
sm-0.387
 cared-0.384
Token balanced
Token ID12974
Feature activation-5.93e+00

Pos logit contributions
OM+0.089
riage+0.080
 destination+0.080
overed+0.078
tons+0.078
Neg logit contributions
 twins-0.045
 who-0.042
 of-0.041
 cared-0.040
 living-0.040
Token on
Token ID319
Feature activation-1.09e+00

Pos logit contributions
OM+0.217
olog+0.206
riage+0.200
ACK+0.199
raper+0.198
Neg logit contributions
 of-0.085
 twins-0.077
 who-0.071
 were-0.069
ons-0.065
Token the
Token ID262
Feature activation-1.27e+00

Pos logit contributions
OM+0.039
 destination+0.036
riage+0.035
olog+0.035
 coaster+0.035
Neg logit contributions
 twins-0.015
sm-0.013
 who-0.013
 living-0.013
 of-0.013
Token board
Token ID3096
Feature activation-3.97e+00

Pos logit contributions
OM+0.036
olog+0.034
ths+0.033
overed+0.033
riage+0.032
Neg logit contributions
 twins-0.028
 living-0.027
 everywhere-0.025
 who-0.024
 many-0.024
Token to
Token ID284
Feature activation+2.66e-01

Pos logit contributions
 twins+0.003
 of+0.003
 who+0.003
sm+0.003
 living+0.003
Neg logit contributions
OM-0.011
riage-0.010
 coaster-0.010
olog-0.010
 destination-0.010
Token him
Token ID683
Feature activation+2.28e+00

Pos logit contributions
 twins+0.004
sm+0.003
 who+0.003
 living+0.003
 of+0.003
Neg logit contributions
OM-0.010
 destination-0.009
riage-0.009
tons-0.009
 coaster-0.009
Token.
Token ID13
Feature activation+3.56e-01

Pos logit contributions
 twins+0.028
 who+0.026
 living+0.024
 of+0.024
sm+0.024
Neg logit contributions
OM-0.089
 destination-0.079
riage-0.079
ACK-0.078
 coaster-0.077
Token What
Token ID1867
Feature activation+5.97e-02

Pos logit contributions
 twins+0.005
sm+0.005
aky+0.004
 who+0.004
 living+0.004
Neg logit contributions
OM-0.013
 destination-0.012
tons-0.012
riage-0.011
overed-0.011
Token a
Token ID257
Feature activation-7.18e-01

Pos logit contributions
 twins+0.001
sm+0.001
 living+0.001
 everywhere+0.001
aky+0.001
Neg logit contributions
OM-0.002
tons-0.002
 surely-0.002
 destination-0.002
riage-0.002
Token pers
Token ID2774
Feature activation-1.27e+01

Pos logit contributions
OM+0.025
 destination+0.023
riage+0.023
overed+0.023
tons+0.022
Neg logit contributions
 twins-0.011
sm-0.010
 living-0.010
 who-0.010
 of-0.009
Tokenistant
Token ID10167
Feature activation-4.30e+00

Pos logit contributions
ACK+0.268
 destination+0.260
tons+0.258
OM+0.254
castle+0.237
Neg logit contributions
sm-0.312
ons-0.312
aks-0.307
ak-0.304
able-0.296
Token boy
Token ID2933
Feature activation+1.14e+00

Pos logit contributions
OM+0.134
riage+0.122
overed+0.121
 destination+0.120
olog+0.119
Neg logit contributions
 twins-0.084
 of-0.080
 who-0.074
sm-0.073
able-0.073
Token he
Token ID339
Feature activation-1.44e+00

Pos logit contributions
 twins+0.019
sm+0.017
 living+0.016
 who+0.016
 of+0.016
Neg logit contributions
OM-0.038
 destination-0.036
riage-0.034
tons-0.034
 coaster-0.034
Token was
Token ID373
Feature activation-1.00e+00

Pos logit contributions
OM+0.052
 destination+0.048
tons+0.047
overed+0.047
riage+0.047
Neg logit contributions
 twins-0.020
 who-0.018
sm-0.018
 living-0.017
 of-0.017
Token!
Token ID0
Feature activation+2.71e-01

Pos logit contributions
OM+0.032
 destination+0.029
riage+0.028
tons+0.028
olog+0.028
Neg logit contributions
 twins-0.019
sm-0.017
 who-0.017
 living-0.017
 of-0.016
Token the
Token ID262
Feature activation+3.32e-02

Pos logit contributions
OM+0.012
 destination+0.011
riage+0.011
tons+0.011
 coaster+0.011
Neg logit contributions
 twins-0.005
sm-0.005
 living-0.005
 who-0.005
 of-0.004
Token oven
Token ID14361
Feature activation-8.34e-01

Pos logit contributions
 twins+0.001
sm+0.001
 living+0.001
 who+0.001
 of+0.001
Neg logit contributions
OM-0.001
 destination-0.001
riage-0.001
tons-0.001
overed-0.001
Token and
Token ID290
Feature activation+7.08e-01

Pos logit contributions
OM+0.032
tons+0.030
riage+0.030
 destination+0.029
overed+0.029
Neg logit contributions
 twins-0.009
 of-0.008
 living-0.008
 who-0.008
 everywhere-0.008
Token taken
Token ID2077
Feature activation-1.13e+00

Pos logit contributions
 twins+0.012
 who+0.011
 living+0.011
sm+0.011
 of+0.011
Neg logit contributions
OM-0.024
 destination-0.022
riage-0.021
tons-0.021
 coaster-0.021
Token the
Token ID262
Feature activation+2.23e-01

Pos logit contributions
OM+0.035
 destination+0.034
 surely+0.032
olog+0.032
ACK+0.032
Neg logit contributions
 twins-0.020
sm-0.017
 who-0.017
 sisters-0.016
 living-0.016
Token las
Token ID39990
Feature activation-1.25e+01

Pos logit contributions
 twins+0.004
sm+0.004
 who+0.004
 living+0.004
 sisters+0.004
Neg logit contributions
OM-0.007
 destination-0.006
riage-0.006
 coaster-0.006
tons-0.006
Tokenagna
Token ID48669
Feature activation-3.86e+00

Pos logit contributions
OM+0.243
tons+0.230
overed+0.228
riage+0.222
 destination+0.209
Neg logit contributions
 twins-0.367
 living-0.335
sm-0.333
 who-0.328
 cared-0.324
Token.
Token ID13
Feature activation-7.34e-03

Pos logit contributions
OM+0.131
 destination+0.122
riage+0.117
tons+0.116
 coaster+0.116
Neg logit contributions
 twins-0.061
sm-0.055
 who-0.052
 living-0.052
 of-0.050
Token
Token ID198
Feature activation+7.99e-01

Pos logit contributions
OM+0.000
 destination+0.000
tons+0.000
riage+0.000
ACK+0.000
Neg logit contributions
 twins-0.000
sm-0.000
aky-0.000
 everywhere-0.000
 living-0.000
Token
Token ID198
Feature activation-7.72e-01

Pos logit contributions
 twins+0.012
 sisters+0.011
 cared+0.011
 kittens+0.011
able+0.010
Neg logit contributions
OM-0.026
 rink-0.023
overed-0.023
 destination-0.023
 coaster-0.023
TokenBut
Token ID1537
Feature activation-3.77e+00

Pos logit contributions
OM+0.024
 destination+0.022
tons+0.021
 coaster+0.021
riage+0.021
Neg logit contributions
 twins-0.014
sm-0.013
 living-0.012
 who-0.012
 everywhere-0.012
Token from
Token ID422
Feature activation+3.12e-01

Pos logit contributions
 of+0.003
 everywhere+0.003
 who+0.002
 many+0.002
 were+0.002
Neg logit contributions
OM-0.008
riage-0.008
 coaster-0.007
raper-0.007
 destination-0.007
Token the
Token ID262
Feature activation-5.30e-01

Pos logit contributions
 twins+0.003
 who+0.003
 everywhere+0.003
 her+0.003
 living+0.003
Neg logit contributions
OM-0.013
olog-0.012
riage-0.012
ider-0.012
overed-0.011
Token rocks
Token ID12586
Feature activation-2.28e+00

Pos logit contributions
OM+0.017
riage+0.015
overed+0.015
olog+0.015
 destination+0.015
Neg logit contributions
 twins-0.011
 living-0.010
 everywhere-0.009
 who-0.009
 of-0.009
Token and
Token ID290
Feature activation+7.14e-01

Pos logit contributions
OM+0.078
 destination+0.072
riage+0.071
 coaster+0.069
tons+0.069
Neg logit contributions
 twins-0.035
sm-0.032
 of-0.032
 who-0.032
 living-0.031
Token the
Token ID262
Feature activation-1.11e+00

Pos logit contributions
 twins+0.014
 who+0.013
 of+0.013
 living+0.013
 everywhere+0.012
Neg logit contributions
OM-0.022
riage-0.020
overed-0.020
 destination-0.020
 coaster-0.019
Token seaw
Token ID32085
Feature activation-1.25e+01

Pos logit contributions
OM+0.036
 destination+0.032
riage+0.032
olog+0.031
overed+0.031
Neg logit contributions
 twins-0.021
 living-0.018
sm-0.018
 who-0.018
 of-0.018
Tokeneed
Token ID2308
Feature activation-3.77e+00

Pos logit contributions
riage+0.348
OM+0.317
ths+0.309
tons+0.301
 surely+0.299
Neg logit contributions
ak-0.294
aky-0.279
aks-0.269
 twins-0.263
ady-0.252
Token,
Token ID11
Feature activation-1.64e+00

Pos logit contributions
OM+0.128
 destination+0.117
riage+0.115
tons+0.113
 coaster+0.112
Neg logit contributions
 twins-0.061
sm-0.055
 of-0.053
 who-0.053
 living-0.053
Token but
Token ID475
Feature activation-2.37e+00

Pos logit contributions
OM+0.055
 destination+0.050
riage+0.050
 coaster+0.048
overed+0.048
Neg logit contributions
 twins-0.028
 who-0.026
 living-0.025
 of-0.025
sm-0.024
Token the
Token ID262
Feature activation-1.46e+00

Pos logit contributions
OM+0.087
 coaster+0.082
riage+0.080
ACK+0.079
 destination+0.079
Neg logit contributions
 of-0.032
 everywhere-0.031
 were-0.031
 twins-0.030
 who-0.030
Token net
Token ID2010
Feature activation-3.28e+00

Pos logit contributions
OM+0.048
riage+0.043
olog+0.042
 destination+0.042
overed+0.042
Neg logit contributions
 twins-0.028
 living-0.025
 who-0.023
 of-0.023
 everywhere-0.023
Token
Token ID198
Feature activation-6.53e-01

Pos logit contributions
 twins+0.009
 sisters+0.008
sm+0.008
 cared+0.007
able+0.007
Neg logit contributions
OM-0.036
 destination-0.033
olog-0.033
 coaster-0.033
tons-0.033
TokenThey
Token ID2990
Feature activation-6.68e-01

Pos logit contributions
OM+0.021
 destination+0.019
riage+0.019
tons+0.019
 coaster+0.018
Neg logit contributions
 twins-0.012
sm-0.011
 who-0.010
 living-0.010
 of-0.010
Token climbed
Token ID19952
Feature activation-5.29e+00

Pos logit contributions
OM+0.020
 destination+0.019
riage+0.018
tons+0.018
 coaster+0.018
Neg logit contributions
 twins-0.013
sm-0.012
 who-0.012
 living-0.012
 of-0.011
Token off
Token ID572
Feature activation-9.52e-01

Pos logit contributions
OM+0.167
riage+0.151
olog+0.143
 destination+0.139
raper+0.138
Neg logit contributions
 everywhere-0.106
 twins-0.103
 of-0.103
 many-0.100
 living-0.098
Token the
Token ID262
Feature activation-2.28e+00

Pos logit contributions
OM+0.039
riage+0.037
 coaster+0.035
olog+0.035
ution+0.034
Neg logit contributions
 of-0.011
 everywhere-0.010
 twins-0.010
 who-0.009
 many-0.009
Token tr
Token ID491
Feature activation-1.25e+01

Pos logit contributions
OM+0.059
overed+0.055
riage+0.054
olog+0.052
 destination+0.050
Neg logit contributions
 living-0.057
 everywhere-0.055
 twins-0.054
 many-0.052
 of-0.052
Tokenamp
Token ID696
Feature activation-6.93e+00

Pos logit contributions
riage+0.228
tons+0.224
OM+0.221
olog+0.211
 surely+0.204
Neg logit contributions
ons-0.358
sm-0.356
 twins-0.346
 of-0.346
 living-0.336
Tokenoline
Token ID14453
Feature activation-3.22e+00

Pos logit contributions
olog+0.192
 destination+0.186
OM+0.183
riage+0.179
tons+0.173
Neg logit contributions
 of-0.156
sm-0.145
 living-0.142
ons-0.139
 twins-0.134
Token and
Token ID290
Feature activation+3.81e-01

Pos logit contributions
OM+0.122
riage+0.114
 destination+0.112
tons+0.109
ACK+0.109
Neg logit contributions
 twins-0.039
 of-0.035
 who-0.034
 living-0.034
sm-0.033
Token hugged
Token ID41130
Feature activation+3.03e+00

Pos logit contributions
 twins+0.007
 who+0.007
 of+0.006
 were+0.006
 everywhere+0.006
Neg logit contributions
OM-0.012
 destination-0.011
riage-0.011
raper-0.011
ider-0.011
Token each
Token ID1123
Feature activation+3.13e+00

Pos logit contributions
 who+0.040
 twins+0.038
 love+0.032
sm+0.032
 of+0.032
Neg logit contributions
OM-0.119
ider-0.108
 destination-0.107
castle-0.107
raper-0.105
Token river
Token ID7850
Feature activation-3.74e+00

Pos logit contributions
OM+0.015
overed+0.014
riage+0.013
olog+0.013
tons+0.012
Neg logit contributions
 twins-0.015
 living-0.014
 who-0.013
aky-0.013
 many-0.013
Token.
Token ID13
Feature activation-7.57e-01

Pos logit contributions
OM+0.139
 destination+0.129
riage+0.127
tons+0.125
 coaster+0.124
Neg logit contributions
 twins-0.048
sm-0.042
 who-0.041
 living-0.041
 of-0.040
Token Suddenly
Token ID24975
Feature activation-9.65e-01

Pos logit contributions
OM+0.024
 destination+0.023
riage+0.022
tons+0.022
 coaster+0.021
Neg logit contributions
 twins-0.013
sm-0.012
 living-0.012
 who-0.012
 of-0.011
Token,
Token ID11
Feature activation-2.66e+00

Pos logit contributions
OM+0.026
 destination+0.025
tons+0.024
overed+0.023
olog+0.023
Neg logit contributions
 twins-0.021
sm-0.019
aky-0.019
 living-0.018
 who-0.018
Token the
Token ID262
Feature activation-1.91e+00

Pos logit contributions
OM+0.085
 destination+0.080
tons+0.078
riage+0.076
 surely+0.076
Neg logit contributions
 twins-0.045
sm-0.042
 living-0.040
aky-0.039
 who-0.039
Token sw
Token ID1509
Feature activation-1.25e+01

Pos logit contributions
OM+0.061
olog+0.055
riage+0.055
 destination+0.054
overed+0.054
Neg logit contributions
 twins-0.039
 living-0.033
 who-0.031
 friends-0.031
 of-0.031
Tokenan
Token ID272
Feature activation-1.42e+00

Pos logit contributions
OM+0.366
riage+0.353
tons+0.346
olog+0.333
 coaster+0.327
Neg logit contributions
 twins-0.253
 friends-0.226
sm-0.224
aky-0.221
 living-0.219
Token said
Token ID531
Feature activation+2.61e+00

Pos logit contributions
OM+0.043
 destination+0.040
riage+0.039
tons+0.038
olog+0.038
Neg logit contributions
 twins-0.028
sm-0.025
 who-0.025
 living-0.024
 of-0.024
Token,
Token ID11
Feature activation-8.47e-01

Pos logit contributions
 twins+0.039
 of+0.037
 who+0.035
 living+0.035
sm+0.035
Neg logit contributions
OM-0.094
 destination-0.086
riage-0.084
 coaster-0.083
ACK-0.082
Token "
Token ID366
Feature activation-2.93e+00

Pos logit contributions
OM+0.029
 destination+0.027
riage+0.026
 coaster+0.025
ACK+0.025
Neg logit contributions
 twins-0.015
 who-0.014
 living-0.013
 everywhere-0.013
sm-0.012
TokenStep
Token ID8600
Feature activation-5.45e+00

Pos logit contributions
OM+0.081
 destination+0.074
riage+0.071
tons+0.070
 coaster+0.069
Neg logit contributions
 twins-0.066
sm-0.061
 living-0.059
 who-0.059
 of-0.058
Token
Token ID198
Feature activation+1.56e+00

Pos logit contributions
 twins+0.002
 cared+0.002
sm+0.002
 sisters+0.002
able+0.002
Neg logit contributions
OM-0.008
overed-0.008
 destination-0.007
olog-0.007
tons-0.007
Token
Token ID198
Feature activation-2.70e-01

Pos logit contributions
 twins+0.017
 sisters+0.013
sm+0.013
 living+0.013
 cared+0.012
Neg logit contributions
OM-0.061
 destination-0.056
overed-0.055
tons-0.054
olog-0.054
TokenSuddenly
Token ID38582
Feature activation-1.33e+00

Pos logit contributions
OM+0.008
 destination+0.008
riage+0.007
tons+0.007
 coaster+0.007
Neg logit contributions
 twins-0.005
sm-0.005
 who-0.005
 living-0.005
 of-0.005
Token,
Token ID11
Feature activation-2.86e+00

Pos logit contributions
OM+0.044
 destination+0.041
riage+0.039
 coaster+0.039
tons+0.039
Neg logit contributions
 twins-0.023
sm-0.021
 of-0.020
 who-0.020
 living-0.020
Token the
Token ID262
Feature activation-1.91e+00

Pos logit contributions
OM+0.095
 destination+0.089
riage+0.086
 coaster+0.085
tons+0.084
Neg logit contributions
 twins-0.048
sm-0.044
 who-0.042
 of-0.042
 living-0.041
Token tr
Token ID491
Feature activation-1.24e+01

Pos logit contributions
OM+0.060
riage+0.056
 destination+0.056
olog+0.056
overed+0.054
Neg logit contributions
 twins-0.038
 living-0.033
 of-0.031
 sisters-0.031
 who-0.031
Tokenamp
Token ID696
Feature activation-6.38e+00

Pos logit contributions
riage+0.242
OM+0.234
 destination+0.229
tons+0.229
olog+0.228
Neg logit contributions
ons-0.351
sm-0.348
 twins-0.341
 of-0.327
aky-0.319
Tokenoline
Token ID14453
Feature activation-3.61e+00

Pos logit contributions
 destination+0.170
olog+0.166
OM+0.160
riage+0.157
tons+0.155
Neg logit contributions
 of-0.147
sm-0.146
 twins-0.140
ons-0.139
 who-0.132
Token collapsed
Token ID14707
Feature activation-2.16e+00

Pos logit contributions
OM+0.118
riage+0.118
 destination+0.111
tons+0.110
raper+0.110
Neg logit contributions
 of-0.058
 twins-0.056
 were-0.056
 everywhere-0.052
 who-0.051
Token beneath
Token ID11061
Feature activation+5.45e-01

Pos logit contributions
OM+0.086
riage+0.083
 destination+0.080
olog+0.079
tons+0.078
Neg logit contributions
 twins-0.023
 everywhere-0.021
 living-0.020
 of-0.020
sm-0.020
Token her
Token ID607
Feature activation+1.81e+00

Pos logit contributions
 twins+0.007
sm+0.006
 living+0.006
 who+0.006
aky+0.005
Neg logit contributions
OM-0.020
 destination-0.019
riage-0.018
tons-0.018
overed-0.018
Token best
Token ID1266
Feature activation+7.40e-01

Pos logit contributions
OM+0.026
 destination+0.024
riage+0.023
 coaster+0.023
overed+0.023
Neg logit contributions
 twins-0.017
 who-0.015
sm-0.015
 living-0.015
 of-0.015
Token to
Token ID284
Feature activation-2.46e+00

Pos logit contributions
 twins+0.010
sm+0.009
 who+0.008
 living+0.008
 of+0.008
Neg logit contributions
OM-0.027
 destination-0.025
riage-0.024
tons-0.024
 coaster-0.024
Token stay
Token ID2652
Feature activation-2.44e+00

Pos logit contributions
OM+0.079
 destination+0.071
overed+0.071
 coaster+0.070
riage+0.070
Neg logit contributions
 twins-0.044
 who-0.041
 of-0.040
 living-0.040
 love-0.040
Token strong
Token ID1913
Feature activation-1.62e-01

Pos logit contributions
OM+0.085
riage+0.076
 destination+0.075
 coaster+0.075
olog+0.074
Neg logit contributions
 twins-0.040
 who-0.037
 living-0.036
sm-0.035
 of-0.034
Token and
Token ID290
Feature activation+1.47e+00

Pos logit contributions
OM+0.006
riage+0.006
 destination+0.006
overed+0.006
olog+0.006
Neg logit contributions
 twins-0.002
 who-0.002
 of-0.002
 living-0.002
sm-0.002
Token foc
Token ID2133
Feature activation-1.24e+01

Pos logit contributions
 love+0.023
 who+0.022
 living+0.021
 many+0.021
 twins+0.021
Neg logit contributions
OM-0.051
overed-0.047
ACK-0.047
 coaster-0.046
 destination-0.046
Tokenussed
Token ID29569
Feature activation-3.31e+00

Pos logit contributions
riage+0.230
OM+0.225
 slope+0.220
 coaster+0.215
tons+0.212
Neg logit contributions
old-0.328
 of-0.297
 twins-0.294
unts-0.294
 who-0.293
Token,
Token ID11
Feature activation-3.22e-01

Pos logit contributions
OM+0.116
riage+0.106
 destination+0.105
olog+0.102
ACK+0.102
Neg logit contributions
 twins-0.052
 everywhere-0.047
 who-0.047
 living-0.047
 of-0.046
Token no
Token ID645
Feature activation+1.20e-01

Pos logit contributions
OM+0.010
 destination+0.009
riage+0.009
 coaster+0.009
overed+0.009
Neg logit contributions
 twins-0.006
 who-0.005
 living-0.005
 of-0.005
sm-0.005
Token matter
Token ID2300
Feature activation-4.65e-01

Pos logit contributions
 twins+0.003
 of+0.003
 who+0.003
 living+0.003
 love+0.003
Neg logit contributions
OM-0.003
overed-0.003
olog-0.003
riage-0.003
 coaster-0.003
Token how
Token ID703
Feature activation+1.41e+00

Pos logit contributions
OM+0.015
 destination+0.014
 coaster+0.014
overed+0.014
olog+0.013
Neg logit contributions
 twins-0.008
 who-0.008
 of-0.008
 living-0.008
sm-0.007
Token
Token ID198
Feature activation+9.09e-01

Pos logit contributions
ons+0.013
sm+0.013
aky+0.012
 twins+0.012
 sisters+0.012
Neg logit contributions
OM-0.033
ACK-0.030
tons-0.030
overed-0.030
riage-0.030
Token
Token ID198
Feature activation-8.05e-01

Pos logit contributions
 sisters+0.013
 cared+0.012
sm+0.012
 twins+0.012
able+0.012
Neg logit contributions
OM-0.029
 destination-0.026
overed-0.026
riage-0.025
olog-0.025
Token"
Token ID1
Feature activation-1.42e+00

Pos logit contributions
OM+0.023
 destination+0.021
riage+0.021
tons+0.020
 coaster+0.020
Neg logit contributions
 twins-0.017
sm-0.016
 living-0.015
 who-0.015
 of-0.015
TokenMaybe
Token ID13300
Feature activation-4.40e-01

Pos logit contributions
OM+0.040
 destination+0.038
riage+0.036
 coaster+0.035
olog+0.035
Neg logit contributions
 twins-0.031
sm-0.029
 who-0.029
 cared-0.027
 living-0.027
Token the
Token ID262
Feature activation-1.61e+00

Pos logit contributions
OM+0.016
 coaster+0.015
 destination+0.015
ER+0.015
 slope+0.015
Neg logit contributions
 who-0.007
 of-0.007
 twins-0.007
 friends-0.006
-0.006
Token sw
Token ID1509
Feature activation-1.24e+01

Pos logit contributions
OM+0.046
 destination+0.042
riage+0.041
olog+0.041
overed+0.041
Neg logit contributions
 twins-0.036
 living-0.031
sm-0.031
 who-0.031
 of-0.030
Tokenan
Token ID272
Feature activation-1.18e+00

Pos logit contributions
tons+0.386
riage+0.384
OM+0.375
ths+0.373
 surely+0.360
Neg logit contributions
 twins-0.218
 friends-0.186
ak-0.176
 living-0.169
sm-0.168
Token was
Token ID373
Feature activation-1.03e+00

Pos logit contributions
OM+0.033
 destination+0.031
riage+0.030
tons+0.030
 coaster+0.029
Neg logit contributions
 twins-0.024
sm-0.022
 sisters-0.021
 living-0.021
ons-0.021
Token hungry
Token ID14720
Feature activation+8.90e-01

Pos logit contributions
OM+0.031
 destination+0.029
tons+0.028
riage+0.028
 coaster+0.027
Neg logit contributions
 twins-0.019
sm-0.018
 living-0.017
 who-0.017
 of-0.017
Token and
Token ID290
Feature activation+3.78e-01

Pos logit contributions
 twins+0.015
sm+0.014
 living+0.013
 who+0.013
 sisters+0.012
Neg logit contributions
OM-0.029
 destination-0.027
tons-0.026
riage-0.026
 coaster-0.025
Token in
Token ID287
Feature activation-1.36e+00

Pos logit contributions
 twins+0.006
sm+0.006
ons+0.005
 living+0.005
 of+0.005
Neg logit contributions
OM-0.012
 destination-0.011
riage-0.011
tons-0.011
overed-0.011
Token started
Token ID2067
Feature activation-1.47e+00

Pos logit contributions
OM+0.125
riage+0.111
 destination+0.111
ACK+0.108
tons+0.108
Neg logit contributions
 twins-0.063
 of-0.060
 who-0.060
sm-0.059
 living-0.058
Token to
Token ID284
Feature activation-3.31e+00

Pos logit contributions
OM+0.054
 destination+0.050
riage+0.049
tons+0.048
 coaster+0.048
Neg logit contributions
 twins-0.020
sm-0.018
 living-0.017
 who-0.017
 of-0.017
Token understand
Token ID1833
Feature activation+2.81e+00

Pos logit contributions
OM+0.094
overed+0.090
ider+0.086
olog+0.085
ths+0.083
Neg logit contributions
 twins-0.069
 who-0.068
 love-0.067
 living-0.064
 were-0.062
Token that
Token ID326
Feature activation-4.46e-01

Pos logit contributions
 who+0.044
 twins+0.043
 of+0.041
 living+0.040
sm+0.039
Neg logit contributions
OM-0.101
 coaster-0.089
 destination-0.089
ACK-0.088
overed-0.087
Token the
Token ID262
Feature activation-1.06e+00

Pos logit contributions
OM+0.016
 coaster+0.014
 destination+0.014
ACK+0.014
ider+0.014
Neg logit contributions
 twins-0.007
 who-0.006
 of-0.006
 living-0.006
 many-0.006
Token sw
Token ID1509
Feature activation-1.24e+01

Pos logit contributions
OM+0.037
riage+0.033
olog+0.033
overed+0.033
ACK+0.032
Neg logit contributions
 twins-0.019
 living-0.016
 of-0.016
 who-0.015
 friends-0.015
Tokenan
Token ID272
Feature activation-6.81e-01

Pos logit contributions
OM+0.368
riage+0.353
tons+0.338
ths+0.326
olog+0.326
Neg logit contributions
 twins-0.241
 friends-0.223
 living-0.209
ak-0.205
aky-0.204
Tokenâ
Token ID22940
Feature activation-7.00e+00

Pos logit contributions
OM+0.022
 destination+0.020
riage+0.020
tons+0.020
 coaster+0.019
Neg logit contributions
 twins-0.012
sm-0.011
 living-0.010
 who-0.010
 of-0.010
Token€
Token ID26391
Feature activation-3.94e+00

Pos logit contributions
 destination+0.144
castle+0.140
ths+0.135
raper+0.133
OM+0.130
Neg logit contributions
able-0.179
 cared-0.179
 company-0.174
aks-0.172
ak-0.166
Tokenâ„¢
Token ID8151
Feature activation-2.18e+00

Pos logit contributions
OM+0.106
 destination+0.096
riage+0.092
tons+0.091
 coaster+0.090
Neg logit contributions
 twins-0.091
sm-0.084
 living-0.082
 who-0.082
 of-0.080
Tokens
Token ID82
Feature activation+5.71e-01

Pos logit contributions
OM+0.073
 destination+0.068
tons+0.065
riage+0.065
overed+0.064
Neg logit contributions
 twins-0.035
sm-0.031
 living-0.030
 sisters-0.029
 who-0.029
Token and
Token ID290
Feature activation+1.13e+00

Pos logit contributions
OM+0.034
 destination+0.032
riage+0.031
tons+0.031
olog+0.031
Neg logit contributions
 twins-0.011
sm-0.010
 of-0.010
 who-0.010
 living-0.010
Token terrifying
Token ID17623
Feature activation-2.47e+00

Pos logit contributions
 everywhere+0.014
 many+0.013
 twins+0.013
 who+0.013
 of+0.013
Neg logit contributions
OM-0.045
 coaster-0.039
ACK-0.039
 destination-0.039
overed-0.039
Token happened
Token ID3022
Feature activation-1.68e+00

Pos logit contributions
OM+0.093
 destination+0.088
olog+0.086
riage+0.085
tons+0.085
Neg logit contributions
 twins-0.030
sm-0.029
 of-0.027
 living-0.027
 everywhere-0.027
Token.
Token ID13
Feature activation-5.63e-01

Pos logit contributions
OM+0.070
 destination+0.067
riage+0.065
tons+0.064
olog+0.063
Neg logit contributions
 twins-0.014
sm-0.013
 living-0.013
 who-0.013
 everywhere-0.011
Token The
Token ID383
Feature activation-8.53e-01

Pos logit contributions
OM+0.022
 destination+0.021
riage+0.020
 coaster+0.020
castle+0.020
Neg logit contributions
 twins-0.007
 of-0.006
 living-0.006
sm-0.005
 who-0.005
Token tr
Token ID491
Feature activation-1.24e+01

Pos logit contributions
OM+0.027
 destination+0.026
riage+0.026
overed+0.025
tons+0.025
Neg logit contributions
 twins-0.016
 living-0.015
 of-0.014
 everywhere-0.013
sm-0.013
Tokenicycle
Token ID35298
Feature activation-3.94e+00

Pos logit contributions
OM+0.235
riage+0.228
 destination+0.225
tons+0.219
olog+0.216
Neg logit contributions
ons-0.359
sm-0.351
 twins-0.346
 of-0.327
 living-0.325
Token suddenly
Token ID6451
Feature activation-2.06e+00

Pos logit contributions
riage+0.136
OM+0.133
tons+0.125
raper+0.124
 destination+0.123
Neg logit contributions
 of-0.063
 were-0.060
 everywhere-0.056
 who-0.054
 twins-0.053
Token stopped
Token ID5025
Feature activation-2.60e+00

Pos logit contributions
OM+0.064
 destination+0.059
riage+0.059
tons+0.058
 coaster+0.056
Neg logit contributions
 twins-0.038
 of-0.035
sm-0.035
 who-0.034
 living-0.034
Token working
Token ID1762
Feature activation-4.13e+00

Pos logit contributions
OM+0.101
 destination+0.095
riage+0.094
olog+0.092
 coaster+0.092
Neg logit contributions
 twins-0.031
 who-0.027
sm-0.026
 living-0.026
 of-0.025
Token!
Token ID0
Feature activation-2.88e-01

Pos logit contributions
OM+0.169
riage+0.161
 destination+0.153
ACK+0.152
tons+0.151
Neg logit contributions
 everywhere-0.039
 twins-0.037
 of-0.035
 living-0.034
 who-0.033
Token play
Token ID711
Feature activation-1.41e+00

Pos logit contributions
OM+0.019
 destination+0.018
riage+0.017
tons+0.017
 coaster+0.016
Neg logit contributions
 twins-0.013
sm-0.013
 living-0.012
 who-0.012
ons-0.011
Token with
Token ID351
Feature activation+2.03e+00

Pos logit contributions
OM+0.047
 destination+0.044
riage+0.042
tons+0.042
 coaster+0.041
Neg logit contributions
 twins-0.024
sm-0.021
 who-0.021
 living-0.020
 of-0.020
Token them
Token ID606
Feature activation+4.09e+00

Pos logit contributions
 twins+0.034
aky+0.032
 living+0.029
sm+0.029
unts+0.028
Neg logit contributions
 destination-0.058
OM-0.057
tons-0.057
riage-0.054
 coaster-0.054
Token.
Token ID13
Feature activation+6.75e-01

Pos logit contributions
 twins+0.059
sm+0.054
 living+0.050
 who+0.048
ons+0.048
Neg logit contributions
OM-0.140
 destination-0.134
riage-0.128
tons-0.128
 coaster-0.126
Token The
Token ID383
Feature activation-1.10e+00

Pos logit contributions
 twins+0.010
sm+0.009
aky+0.009
 sisters+0.008
 living+0.008
Neg logit contributions
OM-0.023
 destination-0.022
riage-0.021
olog-0.021
tons-0.021
Token sw
Token ID1509
Feature activation-1.24e+01

Pos logit contributions
OM+0.035
 destination+0.033
riage+0.031
tons+0.031
 coaster+0.031
Neg logit contributions
 twins-0.020
sm-0.018
 living-0.017
 who-0.017
 of-0.017
Tokenan
Token ID272
Feature activation-7.23e-01

Pos logit contributions
OM+0.367
riage+0.350
tons+0.348
 surely+0.332
olog+0.331
Neg logit contributions
 twins-0.252
 friends-0.219
sm-0.212
aky-0.208
 living-0.206
Token was
Token ID373
Feature activation+6.99e-02

Pos logit contributions
OM+0.022
 destination+0.021
tons+0.020
 coaster+0.020
overed+0.020
Neg logit contributions
 twins-0.013
sm-0.012
 living-0.012
 of-0.011
ons-0.011
Token happy
Token ID3772
Feature activation+2.12e+00

Pos logit contributions
ons+0.001
sm+0.001
able+0.001
old+0.001
 twins+0.001
Neg logit contributions
tons-0.002
OM-0.002
 destination-0.002
overed-0.002
 surely-0.002
Token to
Token ID284
Feature activation-1.80e+00

Pos logit contributions
 twins+0.036
sm+0.033
aky+0.032
ons+0.031
aks+0.031
Neg logit contributions
OM-0.066
 destination-0.062
overed-0.061
tons-0.060
 coaster-0.060
Token join
Token ID4654
Feature activation+6.54e-01

Pos logit contributions
OM+0.057
 destination+0.053
riage+0.051
tons+0.051
 coaster+0.050
Neg logit contributions
 twins-0.033
sm-0.030
 who-0.028
 living-0.028
 of-0.028
Token see
Token ID766
Feature activation-5.52e-01

Pos logit contributions
OM+0.070
 destination+0.063
riage+0.062
tons+0.061
 coaster+0.061
Neg logit contributions
 twins-0.049
 who-0.046
sm-0.046
 of-0.045
 living-0.044
Token each
Token ID1123
Feature activation+8.30e-02

Pos logit contributions
OM+0.017
 destination+0.016
riage+0.015
tons+0.015
overed+0.015
Neg logit contributions
 twins-0.010
sm-0.009
 living-0.009
ons-0.009
aky-0.009
Token other
Token ID584
Feature activation+1.96e+00

Pos logit contributions
 twins+0.001
sm+0.001
 living+0.001
ons+0.001
 sisters+0.001
Neg logit contributions
OM-0.003
 destination-0.003
tons-0.003
 coaster-0.003
ACK-0.003
Token.
Token ID13
Feature activation-5.95e-01

Pos logit contributions
 twins+0.032
sm+0.030
ons+0.028
 living+0.028
 cared+0.027
Neg logit contributions
OM-0.063
 destination-0.061
tons-0.057
riage-0.057
 coaster-0.056
Token The
Token ID383
Feature activation-9.10e-01

Pos logit contributions
OM+0.022
 destination+0.020
riage+0.020
 coaster+0.020
tons+0.020
Neg logit contributions
 twins-0.008
 who-0.007
 living-0.006
 of-0.006
 cared-0.006
Token sw
Token ID1509
Feature activation-1.23e+01

Pos logit contributions
OM+0.028
 destination+0.025
riage+0.025
tons+0.024
 coaster+0.024
Neg logit contributions
 twins-0.018
sm-0.016
 living-0.016
 who-0.016
 of-0.016
Tokenan
Token ID272
Feature activation-9.76e-01

Pos logit contributions
OM+0.345
riage+0.316
tons+0.313
olog+0.302
 coaster+0.299
Neg logit contributions
 twins-0.275
sm-0.249
 living-0.241
 who-0.238
 friends-0.235
Token on
Token ID319
Feature activation-2.53e+00

Pos logit contributions
OM+0.030
 destination+0.028
riage+0.027
 coaster+0.027
tons+0.027
Neg logit contributions
 twins-0.018
sm-0.016
 living-0.016
 who-0.015
 of-0.015
Token the
Token ID262
Feature activation-2.01e+00

Pos logit contributions
OM+0.097
riage+0.089
olog+0.088
 destination+0.088
 coaster+0.086
Neg logit contributions
 twins-0.032
 who-0.028
 of-0.027
 living-0.027
sm-0.026
Token left
Token ID1364
Feature activation-2.51e+00

Pos logit contributions
OM+0.065
riage+0.058
overed+0.057
olog+0.057
 destination+0.057
Neg logit contributions
 twins-0.038
 living-0.035
 who-0.033
 everywhere-0.032
sm-0.032
Token flew
Token ID13112
Feature activation-2.75e+00

Pos logit contributions
OM+0.080
 destination+0.078
tons+0.073
overed+0.073
 surely+0.071
Neg logit contributions
 twins-0.041
sm-0.039
ons-0.036
 sisters-0.035
aky-0.035
Token the
Token ID262
Feature activation-1.81e-01

Pos logit contributions
OM+0.019
 destination+0.018
riage+0.017
tons+0.017
 coaster+0.017
Neg logit contributions
 twins-0.009
sm-0.008
aky-0.007
 who-0.007
able-0.007
Token corn
Token ID11676
Feature activation-3.00e+00

Pos logit contributions
OM+0.006
 destination+0.005
riage+0.005
overed+0.005
tons+0.005
Neg logit contributions
 twins-0.003
 living-0.003
sm-0.003
 who-0.003
 of-0.003
Token field
Token ID2214
Feature activation-8.24e-01

Pos logit contributions
OM+0.073
 destination+0.073
tons+0.064
riage+0.064
 surely+0.064
Neg logit contributions
 twins-0.071
sm-0.068
 cared-0.063
able-0.062
ons-0.062
Token.
Token ID13
Feature activation+1.63e-01

Pos logit contributions
OM+0.026
 destination+0.025
overed+0.024
 surely+0.023
riage+0.023
Neg logit contributions
 twins-0.014
sm-0.013
 sisters-0.012
aky-0.012
ons-0.012
Token The
Token ID383
Feature activation-1.10e+00

Pos logit contributions
 twins+0.003
sm+0.002
aky+0.002
 sisters+0.002
 living+0.002
Neg logit contributions
OM-0.005
 destination-0.005
tons-0.005
riage-0.005
overed-0.005
Token st
Token ID336
Feature activation-1.23e+01

Pos logit contributions
OM+0.032
 destination+0.031
riage+0.029
tons+0.029
 surely+0.029
Neg logit contributions
 twins-0.020
sm-0.020
ons-0.019
old-0.018
 of-0.018
Tokenalks
Token ID23833
Feature activation-1.97e-01

Pos logit contributions
OM+0.307
 destination+0.281
overed+0.264
olog+0.253
riage+0.252
Neg logit contributions
 twins-0.313
sm-0.290
 who-0.288
 living-0.286
 cared-0.280
Token looked
Token ID3114
Feature activation-2.17e+00

Pos logit contributions
OM+0.006
 destination+0.006
 surely+0.006
overed+0.005
ths+0.005
Neg logit contributions
 twins-0.004
sm-0.003
 living-0.003
 who-0.003
 kittens-0.003
Token happy
Token ID3772
Feature activation+3.12e+00

Pos logit contributions
OM+0.063
 destination+0.062
tons+0.057
 surely+0.057
overed+0.057
Neg logit contributions
 twins-0.042
sm-0.037
ons-0.037
 sisters-0.036
 living-0.036
Token to
Token ID284
Feature activation-1.17e+00

Pos logit contributions
 twins+0.046
sm+0.041
 living+0.038
 who+0.037
ons+0.037
Neg logit contributions
OM-0.109
 destination-0.102
tons-0.098
riage-0.098
overed-0.097
Token see
Token ID766
Feature activation+1.90e+00

Pos logit contributions
OM+0.041
 destination+0.038
 coaster+0.037
overed+0.037
olog+0.036
Neg logit contributions
 twins-0.018
sm-0.016
 of-0.016
 who-0.016
 living-0.015
Token heavy
Token ID4334
Feature activation-1.37e+00

Pos logit contributions
 twins+0.020
sm+0.019
 of+0.018
 who+0.017
 many+0.017
Neg logit contributions
OM-0.045
 destination-0.043
riage-0.042
tons-0.041
 coaster-0.041
Token!
Token ID0
Feature activation-1.46e-01

Pos logit contributions
OM+0.048
 destination+0.045
riage+0.044
tons+0.043
 coaster+0.043
Neg logit contributions
 twins-0.020
sm-0.018
 living-0.018
 of-0.018
 who-0.018
Token Finally
Token ID9461
Feature activation-9.60e-01

Pos logit contributions
OM+0.005
 destination+0.005
riage+0.005
 coaster+0.005
tons+0.005
Neg logit contributions
 twins-0.002
sm-0.002
 who-0.002
 living-0.002
 of-0.002
Token,
Token ID11
Feature activation-2.66e+00

Pos logit contributions
OM+0.034
 destination+0.031
 coaster+0.031
riage+0.031
ACK+0.030
Neg logit contributions
 twins-0.015
 of-0.013
sm-0.013
 who-0.013
 everywhere-0.012
Token the
Token ID262
Feature activation-2.23e+00

Pos logit contributions
OM+0.104
 destination+0.096
 coaster+0.096
riage+0.095
olog+0.095
Neg logit contributions
 twins-0.029
 who-0.026
 of-0.026
 many-0.025
sm-0.025
Token sw
Token ID1509
Feature activation-1.23e+01

Pos logit contributions
OM+0.071
riage+0.064
 destination+0.062
 coaster+0.062
olog+0.062
Neg logit contributions
 twins-0.046
 living-0.040
 sisters-0.039
 friends-0.039
 who-0.038
Tokenan
Token ID272
Feature activation-1.52e+00

Pos logit contributions
OM+0.376
riage+0.367
tons+0.357
 coaster+0.338
olog+0.329
Neg logit contributions
 twins-0.235
 friends-0.217
sm-0.206
aky-0.201
 living-0.200
Token got
Token ID1392
Feature activation-1.80e+00

Pos logit contributions
OM+0.048
riage+0.045
 destination+0.044
tons+0.043
overed+0.042
Neg logit contributions
 twins-0.027
 who-0.026
sm-0.026
 of-0.026
 living-0.025
Token help
Token ID1037
Feature activation-1.06e+00

Pos logit contributions
OM+0.054
 destination+0.049
riage+0.047
tons+0.046
 coaster+0.046
Neg logit contributions
 twins-0.037
sm-0.034
 who-0.033
 living-0.033
 of-0.033
Token from
Token ID422
Feature activation+8.14e-01

Pos logit contributions
OM+0.037
 destination+0.035
riage+0.034
tons+0.034
 coaster+0.033
Neg logit contributions
 twins-0.015
sm-0.014
 living-0.013
 who-0.013
 of-0.012
Token the
Token ID262
Feature activation+1.68e-01

Pos logit contributions
 twins+0.010
sm+0.009
 who+0.009
 living+0.008
 of+0.008
Neg logit contributions
OM-0.031
 destination-0.029
olog-0.028
riage-0.028
 coaster-0.028
Token hurt
Token ID5938
Feature activation+1.02e+00

Pos logit contributions
ons+0.002
 twins+0.002
sm+0.002
aky+0.002
able+0.002
Neg logit contributions
OM-0.004
 destination-0.004
tons-0.004
overed-0.004
riage-0.004
Token.
Token ID13
Feature activation+1.73e+00

Pos logit contributions
ons+0.016
 twins+0.016
sm+0.015
 living+0.015
old+0.014
Neg logit contributions
OM-0.032
tons-0.030
 destination-0.029
riage-0.029
ACK-0.028
Token
Token ID198
Feature activation+1.48e+00

Pos logit contributions
ons+0.027
aky+0.025
sm+0.025
old+0.023
 twins+0.023
Neg logit contributions
OM-0.058
ACK-0.052
riage-0.052
tons-0.052
overed-0.051
Token
Token ID198
Feature activation-4.13e-01

Pos logit contributions
 sisters+0.020
 twins+0.020
able+0.019
 cared+0.019
aky+0.019
Neg logit contributions
OM-0.049
 destination-0.044
overed-0.043
riage-0.043
tons-0.042
TokenThe
Token ID464
Feature activation-1.94e+00

Pos logit contributions
OM+0.012
 destination+0.011
riage+0.010
tons+0.010
olog+0.010
Neg logit contributions
 twins-0.009
sm-0.008
 living-0.008
 everywhere-0.007
 of-0.007
Token sw
Token ID1509
Feature activation-1.23e+01

Pos logit contributions
OM+0.054
 destination+0.049
olog+0.047
riage+0.047
tons+0.046
Neg logit contributions
 twins-0.046
 living-0.041
sm-0.040
 who-0.040
 of-0.039
Tokenan
Token ID272
Feature activation-5.06e-01

Pos logit contributions
OM+0.353
riage+0.332
tons+0.332
olog+0.320
 coaster+0.316
Neg logit contributions
 twins-0.260
sm-0.227
 friends-0.226
aky-0.220
 who-0.220
Token is
Token ID318
Feature activation-3.82e-01

Pos logit contributions
OM+0.015
 destination+0.013
riage+0.013
tons+0.013
 coaster+0.013
Neg logit contributions
 twins-0.010
sm-0.010
 living-0.009
 who-0.009
 of-0.009
Token angry
Token ID7954
Feature activation+7.27e-01

Pos logit contributions
OM+0.010
 destination+0.010
tons+0.010
overed+0.009
 surely+0.009
Neg logit contributions
ons-0.008
sm-0.007
old-0.007
able-0.007
 twins-0.007
Token.
Token ID13
Feature activation-1.63e-01

Pos logit contributions
ons+0.012
Hope+0.012
aky+0.012
Beck+0.012
 sisters+0.012
Neg logit contributions
OM-0.020
overed-0.019
tons-0.018
 coaster-0.018
ider-0.018
Token It
Token ID632
Feature activation-1.52e+00

Pos logit contributions
OM+0.005
 destination+0.005
riage+0.005
tons+0.005
overed+0.005
Neg logit contributions
 twins-0.003
sm-0.003
 living-0.003
 who-0.003
ons-0.002
Token her
Token ID607
Feature activation+2.60e+00

Pos logit contributions
 twins+0.040
sm+0.035
 living+0.034
 who+0.033
ons+0.032
Neg logit contributions
OM-0.088
 destination-0.083
riage-0.080
 coaster-0.079
tons-0.079
Token special
Token ID2041
Feature activation-9.22e-01

Pos logit contributions
 twins+0.047
sm+0.046
ons+0.042
 of+0.042
 who+0.042
Neg logit contributions
OM-0.078
 destination-0.075
 coaster-0.070
tons-0.070
riage-0.070
Token song
Token ID3496
Feature activation+1.73e+00

Pos logit contributions
OM+0.027
 destination+0.025
riage+0.024
 coaster+0.024
tons+0.023
Neg logit contributions
 twins-0.018
sm-0.017
 who-0.016
 living-0.016
 of-0.016
Token.
Token ID13
Feature activation+5.05e-01

Pos logit contributions
 twins+0.026
sm+0.024
 who+0.021
 living+0.021
ons+0.021
Neg logit contributions
OM-0.058
 destination-0.056
riage-0.053
tons-0.052
 coaster-0.052
Token The
Token ID383
Feature activation-1.14e+00

Pos logit contributions
 twins+0.007
sm+0.007
aky+0.006
able+0.006
 everywhere+0.006
Neg logit contributions
OM-0.018
 destination-0.016
tons-0.016
riage-0.016
ACK-0.016
Token sw
Token ID1509
Feature activation-1.23e+01

Pos logit contributions
OM+0.038
 destination+0.035
riage+0.034
tons+0.034
 coaster+0.033
Neg logit contributions
 twins-0.019
sm-0.017
 who-0.016
 living-0.016
 of-0.016
Tokenan
Token ID272
Feature activation-1.11e+00

Pos logit contributions
OM+0.355
riage+0.340
tons+0.337
olog+0.316
 slope+0.315
Neg logit contributions
 twins-0.249
sm-0.224
 friends-0.221
aky-0.218
 living-0.217
Token was
Token ID373
Feature activation-2.86e-01

Pos logit contributions
OM+0.034
 destination+0.032
riage+0.030
tons+0.030
 coaster+0.030
Neg logit contributions
 twins-0.021
sm-0.019
 living-0.018
 who-0.017
ons-0.017
Token so
Token ID523
Feature activation+2.18e+00

Pos logit contributions
OM+0.008
 destination+0.008
tons+0.008
 surely+0.008
overed+0.008
Neg logit contributions
 twins-0.005
sm-0.005
ons-0.005
able-0.005
 living-0.004
Token happy
Token ID3772
Feature activation+1.92e+00

Pos logit contributions
 twins+0.029
ons+0.027
sm+0.026
 living+0.025
 who+0.024
Neg logit contributions
OM-0.071
 destination-0.069
 surely-0.068
overed-0.068
tons-0.067
Token to
Token ID284
Feature activation-2.27e+00

Pos logit contributions
 twins+0.032
sm+0.029
 living+0.027
 who+0.027
aky+0.027
Neg logit contributions
OM-0.063
 destination-0.059
tons-0.056
riage-0.056
overed-0.056
Token's
Token ID338
Feature activation-7.59e-01

Pos logit contributions
OM+0.031
 destination+0.030
tons+0.028
riage+0.028
overed+0.028
Neg logit contributions
 twins-0.022
sm-0.020
 living-0.019
ons-0.019
 who-0.019
Token wound
Token ID11699
Feature activation-3.73e+00

Pos logit contributions
OM+0.022
 destination+0.022
tons+0.020
 surely+0.020
riage+0.020
Neg logit contributions
 twins-0.014
sm-0.014
ons-0.013
 who-0.012
old-0.012
Token healed
Token ID28557
Feature activation-2.06e+00

Pos logit contributions
OM+0.112
 destination+0.104
tons+0.099
riage+0.099
overed+0.098
Neg logit contributions
 twins-0.072
sm-0.065
 living-0.063
 who-0.062
 sisters-0.060
Token.
Token ID13
Feature activation-2.44e-01

Pos logit contributions
OM+0.072
 destination+0.065
riage+0.065
overed+0.063
 coaster+0.063
Neg logit contributions
 twins-0.032
sm-0.030
 who-0.030
 of-0.029
 everywhere-0.028
Token The
Token ID383
Feature activation-1.37e+00

Pos logit contributions
OM+0.009
 destination+0.008
riage+0.008
tons+0.008
overed+0.008
Neg logit contributions
 twins-0.003
sm-0.003
 living-0.003
 who-0.003
 of-0.003
Token sw
Token ID1509
Feature activation-1.23e+01

Pos logit contributions
OM+0.043
 destination+0.040
riage+0.038
olog+0.038
tons+0.038
Neg logit contributions
 twins-0.027
sm-0.023
 who-0.023
 living-0.023
 of-0.023
Tokenan
Token ID272
Feature activation-1.04e+00

Pos logit contributions
OM+0.360
riage+0.339
tons+0.337
olog+0.321
 coaster+0.319
Neg logit contributions
 twins-0.250
sm-0.224
 friends-0.224
aky-0.218
 who-0.213
Token thanked
Token ID26280
Feature activation+1.73e+00

Pos logit contributions
OM+0.032
 destination+0.030
tons+0.028
riage+0.028
 coaster+0.028
Neg logit contributions
 twins-0.020
sm-0.018
 living-0.017
 who-0.017
 of-0.016
Token him
Token ID683
Feature activation+1.53e+00

Pos logit contributions
 twins+0.025
sm+0.022
 who+0.022
 of+0.021
 living+0.021
Neg logit contributions
OM-0.062
 destination-0.057
riage-0.056
olog-0.055
 coaster-0.055
Token,
Token ID11
Feature activation-7.80e-01

Pos logit contributions
 twins+0.021
 who+0.019
sm+0.018
 of+0.018
 living+0.018
Neg logit contributions
OM-0.056
 destination-0.051
riage-0.050
 coaster-0.049
ACK-0.049
Token and
Token ID290
Feature activation+3.49e-01

Pos logit contributions
OM+0.027
 destination+0.026
riage+0.025
tons+0.025
overed+0.025
Neg logit contributions
 twins-0.011
sm-0.010
 living-0.009
 who-0.009
 of-0.009
Token pond
Token ID16723
Feature activation-1.97e+00

Pos logit contributions
OM+0.015
overed+0.014
olog+0.014
riage+0.014
 coaster+0.013
Neg logit contributions
 twins-0.009
 living-0.008
 who-0.008
sm-0.008
 of-0.008
Token!
Token ID0
Feature activation-1.69e-01

Pos logit contributions
OM+0.068
 destination+0.063
riage+0.061
tons+0.061
 coaster+0.060
Neg logit contributions
 twins-0.030
sm-0.027
 living-0.026
 who-0.026
 of-0.025
Token
Token ID198
Feature activation+6.84e-01

Pos logit contributions
OM+0.006
 destination+0.006
overed+0.006
tons+0.005
 surely+0.005
Neg logit contributions
 twins-0.002
aky-0.002
sm-0.002
 living-0.002
ons-0.002
Token
Token ID198
Feature activation-8.80e-01

Pos logit contributions
 twins+0.011
 sisters+0.010
 cousins+0.010
 kittens+0.010
 cared+0.009
Neg logit contributions
OM-0.021
overed-0.020
 surely-0.019
tons-0.019
 destination-0.019
TokenThe
Token ID464
Feature activation-1.83e+00

Pos logit contributions
OM+0.026
 destination+0.024
tons+0.023
olog+0.023
 surely+0.023
Neg logit contributions
 twins-0.017
 living-0.015
sm-0.015
 everywhere-0.015
ons-0.015
Token sw
Token ID1509
Feature activation-1.23e+01

Pos logit contributions
OM+0.068
 destination+0.063
 coaster+0.061
riage+0.061
olog+0.061
Neg logit contributions
 twins-0.026
sm-0.022
 who-0.021
 living-0.021
 of-0.020
Tokenan
Token ID272
Feature activation-1.02e+00

Pos logit contributions
OM+0.354
riage+0.327
tons+0.324
 coaster+0.315
olog+0.310
Neg logit contributions
 twins-0.263
sm-0.241
 living-0.230
 friends-0.229
aky-0.226
Token felt
Token ID2936
Feature activation+1.44e+00

Pos logit contributions
OM+0.033
 destination+0.030
riage+0.029
tons+0.029
 coaster+0.028
Neg logit contributions
 twins-0.018
sm-0.017
 who-0.017
 living-0.016
 of-0.016
Token so
Token ID523
Feature activation+2.19e+00

Pos logit contributions
 twins+0.025
sm+0.022
ons+0.022
 living+0.021
 who+0.020
Neg logit contributions
OM-0.045
 destination-0.043
tons-0.042
 surely-0.042
overed-0.041
Token happy
Token ID3772
Feature activation+2.25e+00

Pos logit contributions
 twins+0.031
sm+0.027
 who+0.026
 living+0.026
ons+0.026
Neg logit contributions
OM-0.076
 destination-0.072
tons-0.069
overed-0.069
riage-0.069
Token to
Token ID284
Feature activation-2.04e+00

Pos logit contributions
 twins+0.036
sm+0.033
 living+0.031
 who+0.030
ons+0.030
Neg logit contributions
OM-0.075
 destination-0.070
overed-0.067
tons-0.067
riage-0.067
Token the
Token ID262
Feature activation+7.46e-02

Pos logit contributions
OM+0.042
 destination+0.038
riage+0.037
tons+0.037
overed+0.037
Neg logit contributions
 twins-0.019
 living-0.017
sm-0.017
 who-0.016
 of-0.016
Token park
Token ID3952
Feature activation-1.62e+00

Pos logit contributions
 twins+0.001
 living+0.001
 who+0.001
 many+0.001
 love+0.001
Neg logit contributions
OM-0.003
overed-0.002
riage-0.002
iously-0.002
olog-0.002
Token happily
Token ID18177
Feature activation+4.78e-01

Pos logit contributions
OM+0.058
 destination+0.052
riage+0.052
tons+0.052
ACK+0.051
Neg logit contributions
 twins-0.024
 who-0.022
 of-0.022
 living-0.022
sm-0.021
Token everyday
Token ID10908
Feature activation+1.53e+00

Pos logit contributions
 twins+0.006
 who+0.005
 of+0.005
 living+0.005
sm+0.005
Neg logit contributions
OM-0.019
 destination-0.017
riage-0.017
ACK-0.017
tons-0.017
Token with
Token ID351
Feature activation+3.15e+00

Pos logit contributions
 twins+0.021
sm+0.018
 living+0.017
 who+0.017
 of+0.016
Neg logit contributions
OM-0.055
 destination-0.052
riage-0.050
tons-0.050
 coaster-0.049
Token her
Token ID607
Feature activation+3.61e+00

Pos logit contributions
aky+0.047
 twins+0.045
ak+0.042
able+0.042
sm+0.041
Neg logit contributions
 destination-0.101
tons-0.098
OM-0.093
 surely-0.093
ths-0.091
Token aunt
Token ID25949
Feature activation+7.26e+00

Pos logit contributions
aky+0.063
sm+0.062
ons+0.061
aks+0.060
 twins+0.060
Neg logit contributions
 destination-0.111
OM-0.099
riage-0.093
 coaster-0.093
tons-0.092
Token.
Token ID13
Feature activation-4.37e-01

Pos logit contributions
 twins+0.121
sm+0.114
aky+0.114
aks+0.114
unts+0.112
Neg logit contributions
 destination-0.207
tons-0.198
OM-0.196
riage-0.195
 coaster-0.190
Token
Token ID220
Feature activation-1.52e-01

Pos logit contributions
OM+0.014
 destination+0.014
tons+0.013
riage+0.013
olog+0.013
Neg logit contributions
aky-0.007
 twins-0.006
able-0.006
ons-0.006
ak-0.006
Token
Token ID198
Feature activation+1.06e+00

Pos logit contributions
OM+0.006
 destination+0.006
overed+0.005
tons+0.005
riage+0.005
Neg logit contributions
 twins-0.001
sm-0.001
 cared-0.001
able-0.001
 who-0.001
Token
Token ID198
Feature activation-3.50e-01

Pos logit contributions
 twins+0.017
able+0.017
 cared+0.016
 sisters+0.016
 mice+0.015
Neg logit contributions
OM-0.032
overed-0.030
tons-0.029
 coaster-0.029
olog-0.028
Token-
Token ID12
Feature activation-2.53e+00

Pos logit contributions
OM+0.082
 destination+0.074
riage+0.073
overed+0.071
ACK+0.071
Neg logit contributions
 twins-0.061
sm-0.057
 who-0.056
 of-0.055
 living-0.055
Tokencream
Token ID36277
Feature activation-2.17e+00

Pos logit contributions
OM+0.059
 destination+0.053
riage+0.050
tons+0.049
 coaster+0.049
Neg logit contributions
 twins-0.067
sm-0.063
 living-0.061
 who-0.061
 of-0.060
Token for
Token ID329
Feature activation+1.88e+00

Pos logit contributions
OM+0.081
 destination+0.074
ACK+0.073
riage+0.073
overed+0.073
Neg logit contributions
 twins-0.030
 of-0.028
 who-0.027
 living-0.026
 everywhere-0.025
Token Tom
Token ID4186
Feature activation+2.32e+00

Pos logit contributions
 twins+0.031
sm+0.027
 who+0.027
 living+0.026
 of+0.026
Neg logit contributions
OM-0.065
 destination-0.060
riage-0.058
olog-0.058
overed-0.057
Token and
Token ID290
Feature activation+3.50e+00

Pos logit contributions
 twins+0.037
sm+0.036
 sisters+0.032
 cared+0.031
ons+0.031
Neg logit contributions
OM-0.069
 destination-0.068
tons-0.066
 coaster-0.065
 rink-0.063
Token Mia
Token ID32189
Feature activation+3.61e+00

Pos logit contributions
 twins+0.074
 who+0.066
sm+0.066
 living+0.065
 of+0.064
Neg logit contributions
OM-0.105
 destination-0.095
riage-0.091
overed-0.090
 coaster-0.090
Token.
Token ID13
Feature activation+1.64e+00

Pos logit contributions
 twins+0.060
sm+0.059
ons+0.054
 cared+0.053
 sisters+0.051
Neg logit contributions
 destination-0.106
OM-0.103
tons-0.099
riage-0.097
olog-0.096
Token
Token ID198
Feature activation+1.55e+00

Pos logit contributions
 twins+0.019
sm+0.017
 living+0.016
 who+0.016
ons+0.015
Neg logit contributions
OM-0.062
 destination-0.058
riage-0.056
tons-0.056
 coaster-0.055
Token
Token ID198
Feature activation-3.93e-01

Pos logit contributions
 sisters+0.019
 twins+0.019
 cared+0.018
sm+0.016
 kittens+0.016
Neg logit contributions
OM-0.055
 destination-0.051
 rink-0.049
 coaster-0.049
overed-0.049
TokenTom
Token ID13787
Feature activation-1.06e+00

Pos logit contributions
OM+0.012
 destination+0.012
riage+0.011
overed+0.011
tons+0.011
Neg logit contributions
 twins-0.007
sm-0.007
 who-0.007
 of-0.007
 living-0.006
Token and
Token ID290
Feature activation+2.60e-01

Pos logit contributions
OM+0.040
 destination+0.038
tons+0.036
 coaster+0.036
riage+0.036
Neg logit contributions
 twins-0.012
sm-0.011
 who-0.010
 living-0.010
 of-0.010
Token<|endoftext|>
Token ID50256
Feature activation+4.38e+00

Pos logit contributions
 twins+0.025
sm+0.025
ons+0.022
 everywhere+0.022
 living+0.022
Neg logit contributions
OM-0.071
 destination-0.064
riage-0.064
tons-0.064
ACK-0.063
TokenAnna
Token ID31160
Feature activation+1.89e+00

Pos logit contributions
 twins+0.081
sm+0.079
 cared+0.078
 who+0.077
 sisters+0.077
Neg logit contributions
OM-0.124
 destination-0.110
tons-0.109
riage-0.108
 coaster-0.107
Token and
Token ID290
Feature activation+1.15e+00

Pos logit contributions
 twins+0.028
sm+0.025
 of+0.023
 sisters+0.023
 living+0.022
Neg logit contributions
OM-0.064
 destination-0.060
tons-0.058
 coaster-0.058
riage-0.057
Token Ben
Token ID3932
Feature activation+1.77e+00

Pos logit contributions
sm+0.030
aky+0.027
 twins+0.027
able+0.026
ons+0.026
Neg logit contributions
OM-0.025
tons-0.024
 surely-0.022
 destination-0.022
riage-0.022
Token are
Token ID389
Feature activation+1.27e+00

Pos logit contributions
 twins+0.026
sm+0.024
ons+0.023
 of+0.023
 sisters+0.022
Neg logit contributions
OM-0.059
 destination-0.054
tons-0.053
riage-0.053
olog-0.052
Token friends
Token ID2460
Feature activation+8.53e+00

Pos logit contributions
sm+0.024
 cared+0.022
able+0.021
ons+0.021
seed+0.020
Neg logit contributions
 destination-0.035
OM-0.033
tons-0.033
ths-0.032
 surely-0.031
Token.
Token ID13
Feature activation+1.27e+00

Pos logit contributions
ons+0.165
sm+0.163
 twins+0.157
aks+0.156
old+0.153
Neg logit contributions
tons-0.209
 destination-0.202
OM-0.197
riage-0.188
overed-0.188
Token They
Token ID1119
Feature activation+4.04e-01

Pos logit contributions
ons+0.023
sm+0.021
 everywhere+0.019
 twins+0.019
old+0.018
Neg logit contributions
tons-0.038
OM-0.037
riage-0.035
 destination-0.034
 surely-0.033
Token like
Token ID588
Feature activation+2.30e+00

Pos logit contributions
ons+0.009
sm+0.009
 twins+0.009
 mice+0.008
 kittens+0.008
Neg logit contributions
OM-0.009
 surely-0.009
tons-0.009
riage-0.008
 destination-0.008
Token to
Token ID284
Feature activation-3.41e+00

Pos logit contributions
aky+0.045
 twins+0.043
sm+0.043
ady+0.042
able+0.041
Neg logit contributions
OM-0.062
 coaster-0.056
tons-0.055
 destination-0.055
 slope-0.055
Token play
Token ID711
Feature activation-6.08e-01

Pos logit contributions
OM+0.106
 destination+0.098
riage+0.094
tons+0.093
 coaster+0.092
Neg logit contributions
 twins-0.064
sm-0.059
 living-0.056
 who-0.056
 of-0.055
Token with
Token ID351
Feature activation+2.22e+00

Pos logit contributions
OM+0.055
 destination+0.053
overed+0.050
 coaster+0.050
tons+0.049
Neg logit contributions
 twins-0.032
sm-0.028
 sisters-0.028
 cared-0.027
 living-0.026
Token her
Token ID607
Feature activation+3.51e+00

Pos logit contributions
 twins+0.038
 cared+0.037
aky+0.036
lessness+0.032
 company+0.031
Neg logit contributions
ths-0.060
 destination-0.059
 peak-0.059
 top-0.059
 slope-0.057
Token mom
Token ID1995
Feature activation+6.27e+00

Pos logit contributions
sm+0.061
old+0.061
 cared+0.058
aky+0.057
aks+0.055
Neg logit contributions
 destination-0.102
castle-0.097
OM-0.096
ths-0.093
 slope-0.091
Tokenmy
Token ID1820
Feature activation+4.58e+00

Pos logit contributions
sm+0.097
old+0.090
 twins+0.088
unts+0.084
 mice+0.084
Neg logit contributions
 destination-0.186
OM-0.178
castle-0.171
riage-0.168
tons-0.168
Token and
Token ID290
Feature activation+2.83e+00

Pos logit contributions
sm+0.076
 twins+0.076
 cared+0.067
old+0.067
 kisses+0.064
Neg logit contributions
OM-0.135
 destination-0.134
tons-0.128
 coaster-0.127
ER-0.124
Token daddy
Token ID35822
Feature activation+5.08e+00

Pos logit contributions
sm+0.054
 twins+0.050
able+0.049
aky+0.048
aks+0.047
Neg logit contributions
OM-0.085
 destination-0.079
tons-0.076
riage-0.076
ths-0.076
Token.
Token ID13
Feature activation+2.67e-01

Pos logit contributions
 twins+0.092
sm+0.089
 kisses+0.082
 mice+0.082
old+0.081
Neg logit contributions
OM-0.141
 destination-0.139
tons-0.132
 coaster-0.129
castle-0.129
Token One
Token ID1881
Feature activation-2.14e-01

Pos logit contributions
 twins+0.006
sm+0.006
ons+0.005
aky+0.005
able+0.005
Neg logit contributions
OM-0.008
 destination-0.007
riage-0.007
tons-0.007
ACK-0.006
Token day
Token ID1110
Feature activation-2.73e-01

Pos logit contributions
OM+0.007
 destination+0.006
riage+0.006
tons+0.006
 coaster+0.006
Neg logit contributions
 twins-0.004
sm-0.003
 sisters-0.003
ons-0.003
 living-0.003
Token,
Token ID11
Feature activation-2.01e+00

Pos logit contributions
OM+0.007
 destination+0.006
tons+0.006
olog+0.006
overed+0.006
Neg logit contributions
 twins-0.006
sm-0.006
aky-0.006
 who-0.005
 cared-0.005
Token they
Token ID484
Feature activation-6.74e-01

Pos logit contributions
OM+0.060
tons+0.054
 destination+0.054
riage+0.053
overed+0.052
Neg logit contributions
 twins-0.037
sm-0.037
aky-0.036
old-0.035
aks-0.035
Token to
Token ID284
Feature activation-1.77e+00

Pos logit contributions
OM+0.122
 destination+0.112
riage+0.111
tons+0.109
olog+0.108
Neg logit contributions
 twins-0.039
 living-0.035
sm-0.033
 who-0.033
 of-0.033
Token the
Token ID262
Feature activation+1.40e-01

Pos logit contributions
OM+0.063
 destination+0.058
riage+0.056
tons+0.056
 coaster+0.056
Neg logit contributions
 twins-0.026
sm-0.023
 who-0.022
 living-0.022
 of-0.021
Token forest
Token ID8222
Feature activation-2.22e+00

Pos logit contributions
 living+0.003
 twins+0.003
 who+0.002
 many+0.002
 everywhere+0.002
Neg logit contributions
OM-0.005
overed-0.004
olog-0.004
riage-0.004
tons-0.004
Token with
Token ID351
Feature activation+1.93e+00

Pos logit contributions
OM+0.072
 destination+0.069
overed+0.065
tons+0.064
 coaster+0.064
Neg logit contributions
 twins-0.037
sm-0.033
ons-0.031
 sisters-0.030
aky-0.030
Token her
Token ID607
Feature activation+3.12e+00

Pos logit contributions
 twins+0.027
sm+0.026
aky+0.025
able+0.024
 who+0.023
Neg logit contributions
 destination-0.062
OM-0.061
 surely-0.058
tons-0.058
riage-0.056
Token family
Token ID1641
Feature activation+6.60e+00

Pos logit contributions
 twins+0.060
sm+0.059
aky+0.054
 who+0.054
ons+0.053
Neg logit contributions
OM-0.089
 destination-0.087
tons-0.081
riage-0.079
 coaster-0.079
Token.
Token ID13
Feature activation+4.68e-01

Pos logit contributions
 twins+0.111
sm+0.110
aky+0.101
ons+0.098
aks+0.097
Neg logit contributions
 destination-0.193
OM-0.189
tons-0.184
riage-0.178
 coaster-0.172
Token They
Token ID1119
Feature activation-3.27e-01

Pos logit contributions
sm+0.008
 twins+0.007
ons+0.007
aky+0.007
able+0.007
Neg logit contributions
OM-0.015
 destination-0.014
tons-0.014
overed-0.014
ACK-0.014
Token stayed
Token ID9658
Feature activation-2.10e+00

Pos logit contributions
OM+0.011
 destination+0.010
tons+0.010
riage+0.010
 coaster+0.010
Neg logit contributions
 twins-0.005
sm-0.004
 living-0.004
 of-0.004
able-0.004
Token in
Token ID287
Feature activation-1.25e+00

Pos logit contributions
OM+0.066
 destination+0.063
tons+0.060
riage+0.059
overed+0.059
Neg logit contributions
 twins-0.037
sm-0.034
 who-0.032
 living-0.032
 of-0.031
Token a
Token ID257
Feature activation+5.58e-01

Pos logit contributions
 destination+0.036
OM+0.034
tons+0.033
riage+0.032
 surely+0.032
Neg logit contributions
 twins-0.024
aky-0.023
sm-0.023
ons-0.022
 cared-0.022
Token see
Token ID766
Feature activation+1.66e+00

Pos logit contributions
OM+0.076
overed+0.070
 coaster+0.070
 destination+0.069
olog+0.069
Neg logit contributions
 twins-0.021
 of-0.020
 who-0.018
 love-0.018
 living-0.017
Token her
Token ID607
Feature activation+1.76e+00

Pos logit contributions
 twins+0.032
aky+0.031
sm+0.028
aks+0.027
unts+0.027
Neg logit contributions
 destination-0.045
OM-0.043
tons-0.042
riage-0.041
 coaster-0.041
Token.
Token ID13
Feature activation+6.43e-03

Pos logit contributions
 twins+0.030
sm+0.027
 who+0.027
 living+0.026
 of+0.026
Neg logit contributions
OM-0.060
 destination-0.053
riage-0.053
ACK-0.053
olog-0.052
Token They
Token ID1119
Feature activation-8.00e-01

Pos logit contributions
 twins+0.000
sm+0.000
aky+0.000
ons+0.000
 who+0.000
Neg logit contributions
OM-0.000
 destination-0.000
riage-0.000
tons-0.000
overed-0.000
Token hugged
Token ID41130
Feature activation+2.17e+00

Pos logit contributions
OM+0.025
 destination+0.024
tons+0.023
riage+0.023
 surely+0.023
Neg logit contributions
 twins-0.014
sm-0.013
 living-0.012
 who-0.011
ons-0.011
Token and
Token ID290
Feature activation+2.24e+00

Pos logit contributions
 twins+0.037
sm+0.036
aky+0.036
 living+0.033
ons+0.033
Neg logit contributions
OM-0.064
 destination-0.063
tons-0.062
 surely-0.059
riage-0.058
Token laughed
Token ID13818
Feature activation+2.63e+00

Pos logit contributions
 twins+0.039
sm+0.036
 living+0.034
 who+0.034
 of+0.032
Neg logit contributions
OM-0.072
 destination-0.067
riage-0.064
tons-0.064
 coaster-0.063
Token and
Token ID290
Feature activation+1.92e+00

Pos logit contributions
 twins+0.044
sm+0.039
 who+0.038
 living+0.038
 of+0.037
Neg logit contributions
OM-0.088
 destination-0.081
riage-0.079
tons-0.078
 coaster-0.077
Token Jack
Token ID3619
Feature activation+1.45e-01

Pos logit contributions
 twins+0.031
sm+0.028
 who+0.027
 living+0.027
 of+0.026
Neg logit contributions
OM-0.066
 destination-0.060
riage-0.058
ACK-0.058
 coaster-0.057
Token told
Token ID1297
Feature activation+2.78e+00

Pos logit contributions
 twins+0.003
sm+0.003
 living+0.002
 who+0.002
 of+0.002
Neg logit contributions
OM-0.004
 destination-0.004
tons-0.004
 coaster-0.004
riage-0.004
Token her
Token ID607
Feature activation+3.30e+00

Pos logit contributions
 twins+0.035
sm+0.032
 living+0.029
 who+0.029
aky+0.029
Neg logit contributions
OM-0.100
 destination-0.096
tons-0.093
riage-0.092
 coaster-0.091
Token was
Token ID373
Feature activation-6.15e-01

Pos logit contributions
OM+0.038
 destination+0.035
tons+0.033
riage+0.033
 coaster+0.033
Neg logit contributions
 twins-0.024
sm-0.022
 living-0.021
 who-0.021
 sisters-0.020
Token a
Token ID257
Feature activation+1.14e+00

Pos logit contributions
OM+0.016
tons+0.014
ACK+0.013
 coaster+0.013
overed+0.013
Neg logit contributions
 cared-0.012
lessness-0.012
old-0.012
as-0.012
ons-0.012
Token girl
Token ID2576
Feature activation+3.40e+00

Pos logit contributions
 twins+0.014
 who+0.012
 living+0.011
sm+0.011
 of+0.010
Neg logit contributions
OM-0.045
 destination-0.042
riage-0.041
overed-0.041
ACK-0.040
Token named
Token ID3706
Feature activation+2.42e+00

Pos logit contributions
 twins+0.053
ons+0.046
 sisters+0.045
 everywhere+0.045
umbs+0.043
Neg logit contributions
OM-0.099
tons-0.092
riage-0.092
ACK-0.088
 destination-0.088
Token Grace
Token ID16156
Feature activation+1.92e+00

Pos logit contributions
 twins+0.047
sm+0.043
 who+0.041
 living+0.041
ons+0.041
Neg logit contributions
OM-0.074
 destination-0.067
riage-0.065
tons-0.064
olog-0.063
Token.
Token ID13
Feature activation+1.19e-01

Pos logit contributions
ons+0.043
 twins+0.039
aks+0.038
 Some+0.038
 kisses+0.037
Neg logit contributions
riage-0.040
OM-0.040
 destination-0.039
tons-0.039
olog-0.037
Token She
Token ID1375
Feature activation-4.49e-01

Pos logit contributions
ons+0.002
sm+0.002
 twins+0.002
old+0.002
aky+0.002
Neg logit contributions
OM-0.003
 destination-0.003
tons-0.003
olog-0.003
riage-0.003
Token was
Token ID373
Feature activation+2.36e-01

Pos logit contributions
OM+0.016
 coaster+0.015
tons+0.015
 destination+0.014
olog+0.014
Neg logit contributions
 twins-0.005
sm-0.005
ons-0.004
 sisters-0.004
 of-0.004
Token very
Token ID845
Feature activation+2.52e+00

Pos logit contributions
ons+0.004
sm+0.004
able+0.004
aks+0.004
 twins+0.004
Neg logit contributions
OM-0.007
tons-0.007
 surely-0.006
 destination-0.006
ACK-0.006
Token graceful
Token ID44363
Feature activation+1.58e+00

Pos logit contributions
 twins+0.036
sm+0.032
 who+0.030
 living+0.030
 of+0.029
Neg logit contributions
OM-0.089
 destination-0.083
riage-0.080
tons-0.080
overed-0.079
Token and
Token ID290
Feature activation+2.18e+00

Pos logit contributions
 twins+0.021
sm+0.019
 living+0.017
 who+0.017
 of+0.017
Neg logit contributions
OM-0.058
 destination-0.054
tons-0.052
riage-0.052
 coaster-0.052
Token always
Token ID1464
Feature activation-2.56e+00

Pos logit contributions
 twins+0.010
sm+0.009
 who+0.009
 living+0.009
 of+0.009
Neg logit contributions
OM-0.017
 destination-0.016
riage-0.015
tons-0.015
 coaster-0.015
Token ran
Token ID4966
Feature activation-2.60e+00

Pos logit contributions
OM+0.080
 destination+0.072
overed+0.072
 coaster+0.071
ACK+0.070
Neg logit contributions
 twins-0.049
 who-0.048
 living-0.047
 cared-0.046
 love-0.045
Token home
Token ID1363
Feature activation+1.63e+00

Pos logit contributions
OM+0.076
 destination+0.067
riage+0.067
tons+0.064
 coaster+0.064
Neg logit contributions
 twins-0.056
sm-0.052
 of-0.051
 who-0.051
 living-0.051
Token on
Token ID319
Feature activation-5.59e-02

Pos logit contributions
 twins+0.029
sm+0.026
 living+0.025
 who+0.025
 of+0.025
Neg logit contributions
OM-0.053
 destination-0.049
riage-0.047
tons-0.047
 coaster-0.046
Token time
Token ID640
Feature activation+1.56e+00

Pos logit contributions
OM+0.002
 destination+0.002
riage+0.002
tons+0.002
 coaster+0.002
Neg logit contributions
 twins-0.001
sm-0.001
 living-0.001
 who-0.001
 of-0.001
Token after
Token ID706
Feature activation+7.89e-01

Pos logit contributions
 twins+0.021
sm+0.018
 living+0.018
 who+0.018
 of+0.017
Neg logit contributions
OM-0.058
 destination-0.053
riage-0.053
tons-0.052
 coaster-0.051
Token that
Token ID326
Feature activation-1.46e+00

Pos logit contributions
 twins+0.013
sm+0.012
 living+0.012
 of+0.011
 who+0.011
Neg logit contributions
OM-0.026
 destination-0.025
riage-0.024
olog-0.023
tons-0.023
Token.
Token ID13
Feature activation-4.60e-01

Pos logit contributions
OM+0.049
 destination+0.045
riage+0.043
overed+0.043
 coaster+0.043
Neg logit contributions
 twins-0.025
sm-0.022
 living-0.022
 who-0.022
 of-0.022
Token<|endoftext|>
Token ID50256
Feature activation+3.23e+00

Pos logit contributions
OM+0.016
tons+0.014
riage+0.014
 destination+0.014
 surely+0.014
Neg logit contributions
 everywhere-0.006
 twins-0.006
aky-0.006
ons-0.006
sm-0.006
TokenOnce
Token ID7454
Feature activation+4.43e-01

Pos logit contributions
 twins+0.048
 sisters+0.045
sm+0.044
aky+0.043
 cared+0.043
Neg logit contributions
OM-0.108
 destination-0.098
tons-0.095
riage-0.095
 coaster-0.095
Token upon
Token ID2402
Feature activation+6.16e-01

Pos logit contributions
 twins+0.009
sm+0.009
 cared+0.009
 mice+0.009
 chickens+0.009
Neg logit contributions
 destination-0.012
OM-0.012
riage-0.011
overed-0.011
tons-0.011
Token every
Token ID790
Feature activation+9.13e-01

Pos logit contributions
OM+0.026
 destination+0.024
riage+0.023
tons+0.023
 coaster+0.023
Neg logit contributions
 twins-0.016
sm-0.015
 living-0.014
 who-0.014
 of-0.014
Token day
Token ID1110
Feature activation-3.10e-02

Pos logit contributions
 twins+0.015
sm+0.013
 who+0.013
 living+0.013
ons+0.012
Neg logit contributions
OM-0.030
 destination-0.029
riage-0.027
tons-0.027
 coaster-0.027
Token.
Token ID13
Feature activation+4.19e-01

Pos logit contributions
OM+0.001
 destination+0.001
tons+0.001
overed+0.001
 coaster+0.001
Neg logit contributions
 twins-0.001
sm-0.000
aky-0.000
 cared-0.000
 kisses-0.000
Token<|endoftext|>
Token ID50256
Feature activation+4.76e+00

Pos logit contributions
aky+0.008
 seen+0.008
able+0.007
ady+0.007
old+0.007
Neg logit contributions
OM-0.011
 surely-0.010
tons-0.010
riage-0.009
Wo-0.009
TokenOnce
Token ID7454
Feature activation+1.27e+00

Pos logit contributions
 sisters+0.071
aky+0.066
 twins+0.065
 cared+0.064
sm+0.061
Neg logit contributions
OM-0.156
 destination-0.141
 coaster-0.138
overed-0.137
 rink-0.136
Token upon
Token ID2402
Feature activation+1.11e+00

Pos logit contributions
 cared+0.028
 mice+0.026
sm+0.026
 chickens+0.026
 twins+0.026
Neg logit contributions
 destination-0.030
OM-0.030
tons-0.027
 surely-0.027
overed-0.027
Token a
Token ID257
Feature activation+6.16e-01

Pos logit contributions
 kittens+0.026
 sisters+0.026
ison+0.025
ann+0.025
ons+0.025
Neg logit contributions
OM-0.025
 coaster-0.021
 rink-0.019
tons-0.019
achable-0.018
Token time
Token ID640
Feature activation-4.00e-01

Pos logit contributions
 twins+0.014
 sisters+0.013
ons+0.013
 kittens+0.013
 mice+0.012
Neg logit contributions
OM-0.017
 destination-0.014
tons-0.014
 coaster-0.014
AP-0.013
Token,
Token ID11
Feature activation-1.01e+00

Pos logit contributions
OM+0.011
 destination+0.009
tons+0.009
overed+0.009
riage+0.009
Neg logit contributions
 twins-0.009
sm-0.008
 cared-0.008
aky-0.008
 sisters-0.008
Token there
Token ID612
Feature activation-1.02e+00

Pos logit contributions
OM+0.027
tons+0.025
 surely+0.024
overed+0.024
riage+0.024
Neg logit contributions
 kisses-0.020
 twins-0.020
ons-0.019
old-0.019
 sisters-0.019
Token was
Token ID373
Feature activation-3.71e-01

Pos logit contributions
OM+0.030
 destination+0.027
tons+0.026
riage+0.025
 coaster+0.025
Neg logit contributions
 twins-0.021
sm-0.020
 sisters-0.019
aky-0.019
 who-0.018
Token day
Token ID1110
Feature activation+1.00e+00

Pos logit contributions
 twins+0.016
sm+0.015
ons+0.014
 who+0.014
 sisters+0.014
Neg logit contributions
OM-0.027
 destination-0.025
tons-0.024
riage-0.024
 surely-0.024
Token,
Token ID11
Feature activation-9.51e-01

Pos logit contributions
 twins+0.019
sm+0.017
 living+0.016
 who+0.016
 of+0.016
Neg logit contributions
OM-0.031
 destination-0.029
riage-0.028
tons-0.028
 coaster-0.028
Token Jack
Token ID3619
Feature activation-1.09e+00

Pos logit contributions
OM+0.033
tons+0.031
overed+0.030
 destination+0.030
 coaster+0.029
Neg logit contributions
 twins-0.014
sm-0.012
 sisters-0.012
ons-0.012
 living-0.011
Token went
Token ID1816
Feature activation-2.29e+00

Pos logit contributions
OM+0.027
 coaster+0.024
 destination+0.024
ACK+0.024
tons+0.024
Neg logit contributions
 twins-0.023
 sisters-0.021
sm-0.021
 company-0.020
 mice-0.020
Token fishing
Token ID12478
Feature activation-3.25e+00

Pos logit contributions
OM+0.072
 destination+0.068
 coaster+0.065
tons+0.064
ER+0.064
Neg logit contributions
 twins-0.039
sm-0.036
 cared-0.034
aky-0.034
 who-0.033
Token with
Token ID351
Feature activation+2.93e+00

Pos logit contributions
OM+0.112
 destination+0.103
riage+0.101
tons+0.101
olog+0.099
Neg logit contributions
 twins-0.052
 who-0.048
 of-0.047
sm-0.046
 living-0.046
Token his
Token ID465
Feature activation+2.43e+00

Pos logit contributions
sm+0.065
able+0.058
 twins+0.058
 who+0.057
aky+0.056
Neg logit contributions
 destination-0.072
OM-0.070
 coaster-0.065
tons-0.064
castle-0.064
Token dad
Token ID9955
Feature activation+5.52e+00

Pos logit contributions
sm+0.037
 twins+0.034
ons+0.032
 who+0.031
able+0.031
Neg logit contributions
OM-0.081
 destination-0.080
tons-0.074
 coaster-0.073
riage-0.072
Token.
Token ID13
Feature activation-3.18e-01

Pos logit contributions
 twins+0.087
sm+0.082
 living+0.072
ons+0.072
 cared+0.072
Neg logit contributions
OM-0.178
 destination-0.171
tons-0.165
riage-0.164
 coaster-0.160
Token Jack
Token ID3619
Feature activation-9.72e-01

Pos logit contributions
OM+0.012
 destination+0.012
riage+0.011
tons+0.011
 coaster+0.011
Neg logit contributions
 twins-0.004
 living-0.003
 of-0.003
 who-0.003
sm-0.003
Token loved
Token ID6151
Feature activation+2.23e+00

Pos logit contributions
OM+0.033
 destination+0.031
tons+0.029
riage+0.029
 coaster+0.029
Neg logit contributions
 twins-0.015
sm-0.014
 living-0.013
 who-0.013
 of-0.013
Token plants
Token ID6134
Feature activation+1.69e+00

Pos logit contributions
old+0.022
sm+0.022
 twins+0.022
 who+0.021
ons+0.021
Neg logit contributions
OM-0.027
 destination-0.025
 slope-0.025
castle-0.023
ACK-0.023
Token.
Token ID13
Feature activation+8.95e-01

Pos logit contributions
 twins+0.028
sm+0.027
old+0.024
 cared+0.024
ons+0.024
Neg logit contributions
OM-0.051
 destination-0.050
overed-0.048
tons-0.047
 slope-0.047
Token Sara
Token ID24799
Feature activation+9.84e-01

Pos logit contributions
 twins+0.012
 who+0.010
 living+0.010
sm+0.010
 of+0.010
Neg logit contributions
OM-0.033
 destination-0.031
 coaster-0.030
riage-0.030
olog-0.030
Token saw
Token ID2497
Feature activation+5.91e-01

Pos logit contributions
 twins+0.015
sm+0.013
 of+0.012
 living+0.012
 who+0.012
Neg logit contributions
OM-0.033
 destination-0.031
tons-0.030
 coaster-0.030
olog-0.030
Token a
Token ID257
Feature activation+2.57e-01

Pos logit contributions
sm+0.011
aky+0.010
able+0.010
 twins+0.010
aks+0.010
Neg logit contributions
OM-0.016
overed-0.015
tons-0.014
ACK-0.014
 destination-0.014
Token big
Token ID1263
Feature activation-2.08e+00

Pos logit contributions
 twins+0.003
 living+0.003
 who+0.003
sm+0.003
 of+0.003
Neg logit contributions
OM-0.010
 destination-0.009
riage-0.009
overed-0.009
tons-0.009
Token purple
Token ID14032
Feature activation-1.93e+00

Pos logit contributions
OM+0.073
riage+0.065
overed+0.064
 destination+0.064
olog+0.064
Neg logit contributions
 twins-0.036
 living-0.033
 who-0.031
 of-0.031
 everywhere-0.030
Token pr
Token ID778
Feature activation-1.02e+01

Pos logit contributions
OM+0.066
riage+0.062
olog+0.062
 destination+0.061
ths+0.060
Neg logit contributions
 twins-0.032
 living-0.029
 of-0.029
 who-0.029
 everywhere-0.027
Tokenune
Token ID1726
Feature activation-1.04e+00

Pos logit contributions
OM+0.255
riage+0.240
 destination+0.237
tons+0.232
 coaster+0.230
Neg logit contributions
 twins-0.240
sm-0.226
 living-0.220
 of-0.218
ons-0.217
Token on
Token ID319
Feature activation-1.16e+00

Pos logit contributions
OM+0.041
 destination+0.039
riage+0.038
olog+0.037
ACK+0.036
Neg logit contributions
 twins-0.012
 of-0.012
 living-0.011
 who-0.011
sm-0.010
Token the
Token ID262
Feature activation-1.67e+00

Pos logit contributions
OM+0.044
 destination+0.041
riage+0.040
olog+0.040
 coaster+0.039
Neg logit contributions
 twins-0.014
 who-0.012
sm-0.012
 living-0.012
 of-0.012
Token,
Token ID11
Feature activation-2.29e+00

Pos logit contributions
OM+0.014
 destination+0.012
riage+0.012
tons+0.012
overed+0.012
Neg logit contributions
 twins-0.009
sm-0.008
 living-0.008
 who-0.008
 of-0.008
Token she
Token ID673
Feature activation-1.46e+00

Pos logit contributions
OM+0.077
 destination+0.070
riage+0.068
overed+0.068
tons+0.068
Neg logit contributions
 twins-0.036
aky-0.034
sm-0.034
aks-0.032
 living-0.031
Token saw
Token ID2497
Feature activation+2.59e-01

Pos logit contributions
OM+0.050
 destination+0.047
riage+0.045
tons+0.045
 coaster+0.044
Neg logit contributions
 twins-0.022
sm-0.020
 living-0.019
 who-0.019
 of-0.018
Token a
Token ID257
Feature activation+1.49e-01

Pos logit contributions
aky+0.004
 twins+0.004
able+0.004
aks+0.004
unts+0.004
Neg logit contributions
OM-0.008
overed-0.007
tons-0.007
ACK-0.007
 destination-0.007
Token big
Token ID1263
Feature activation-2.12e+00

Pos logit contributions
 twins+0.002
 living+0.002
 who+0.002
sm+0.001
 everywhere+0.001
Neg logit contributions
OM-0.006
riage-0.005
overed-0.005
 destination-0.005
ACK-0.005
Token box
Token ID3091
Feature activation-2.50e+00

Pos logit contributions
OM+0.075
overed+0.068
riage+0.067
ider+0.066
olog+0.066
Neg logit contributions
 twins-0.037
 living-0.034
 of-0.032
 who-0.032
 everywhere-0.031
Token on
Token ID319
Feature activation-1.61e+00

Pos logit contributions
OM+0.085
 destination+0.079
riage+0.076
tons+0.076
overed+0.076
Neg logit contributions
 twins-0.039
sm-0.035
 living-0.032
 who-0.032
 sisters-0.031
Token the
Token ID262
Feature activation-1.89e+00

Pos logit contributions
OM+0.060
 destination+0.055
riage+0.054
olog+0.054
overed+0.053
Neg logit contributions
 twins-0.022
 who-0.018
sm-0.018
 living-0.018
 of-0.018
Token floor
Token ID4314
Feature activation-1.54e+00

Pos logit contributions
OM+0.044
 destination+0.039
riage+0.038
overed+0.038
tons+0.037
Neg logit contributions
 twins-0.051
 living-0.048
 who-0.047
sm-0.046
 everywhere-0.045
Token.
Token ID13
Feature activation+2.05e-01

Pos logit contributions
OM+0.057
 destination+0.054
riage+0.052
tons+0.052
 coaster+0.051
Neg logit contributions
 twins-0.019
sm-0.017
 who-0.016
 living-0.016
 of-0.016
Token She
Token ID1375
Feature activation-6.62e-01

Pos logit contributions
 twins+0.003
sm+0.003
aky+0.003
able+0.003
 sisters+0.002
Neg logit contributions
OM-0.007
 destination-0.007
ACK-0.006
riage-0.006
olog-0.006
Token both
Token ID1111
Feature activation+1.46e+00

Pos logit contributions
OM+0.007
riage+0.006
 destination+0.006
tons+0.006
 surely+0.006
Neg logit contributions
sm-0.004
 twins-0.004
ons-0.003
old-0.003
 kisses-0.003
Token wanted
Token ID2227
Feature activation+9.85e-01

Pos logit contributions
 twins+0.022
ons+0.020
sm+0.019
 everywhere+0.019
 mice+0.018
Neg logit contributions
OM-0.045
riage-0.042
tons-0.041
 destination-0.040
olog-0.040
Token to
Token ID284
Feature activation-2.11e+00

Pos logit contributions
 twins+0.017
sm+0.017
aky+0.017
able+0.016
ons+0.015
Neg logit contributions
OM-0.030
 destination-0.028
tons-0.027
 coaster-0.026
ACK-0.026
Token learn
Token ID2193
Feature activation+3.49e+00

Pos logit contributions
OM+0.066
 destination+0.060
overed+0.060
riage+0.060
tons+0.059
Neg logit contributions
 twins-0.040
 of-0.035
 who-0.035
 living-0.034
sm-0.034
Token how
Token ID703
Feature activation+1.92e+00

Pos logit contributions
 twins+0.059
sm+0.055
 living+0.051
aky+0.050
 who+0.050
Neg logit contributions
OM-0.112
 destination-0.105
riage-0.101
tons-0.099
 coaster-0.099
Token to
Token ID284
Feature activation-2.95e+00

Pos logit contributions
 twins+0.030
sm+0.026
aky+0.026
 who+0.025
 living+0.025
Neg logit contributions
OM-0.063
 destination-0.060
riage-0.058
olog-0.057
tons-0.057
Token spell
Token ID4822
Feature activation-9.52e-01

Pos logit contributions
OM+0.096
overed+0.091
ER+0.089
ACK+0.088
riage+0.088
Neg logit contributions
 twins-0.049
 who-0.047
 love-0.046
 of-0.045
 understand-0.044
Token.
Token ID13
Feature activation+6.72e-01

Pos logit contributions
OM+0.032
 destination+0.030
riage+0.029
tons+0.028
 coaster+0.028
Neg logit contributions
 twins-0.015
sm-0.014
 living-0.013
 who-0.013
 of-0.013
Token
Token ID198
Feature activation+1.40e+00

Pos logit contributions
sm+0.009
 twins+0.008
aky+0.007
 living+0.007
 of+0.007
Neg logit contributions
OM-0.024
 destination-0.024
riage-0.023
ACK-0.022
olog-0.022
Token
Token ID198
Feature activation-6.20e-01

Pos logit contributions
 sisters+0.021
 twins+0.021
sm+0.021
 chickens+0.020
 cared+0.020
Neg logit contributions
OM-0.045
 destination-0.043
overed-0.039
AP-0.038
riage-0.038
TokenSo
Token ID2396
Feature activation-3.54e+00

Pos logit contributions
OM+0.016
 destination+0.015
olog+0.014
 coaster+0.014
riage+0.014
Neg logit contributions
 twins-0.014
sm-0.014
 living-0.013
 sisters-0.012
 who-0.012
Token smiled
Token ID13541
Feature activation+2.84e+00

Pos logit contributions
 twins+0.024
 who+0.022
 cared+0.021
 living+0.021
 were+0.021
Neg logit contributions
OM-0.056
 destination-0.051
riage-0.049
raper-0.049
ACK-0.048
Token.
Token ID13
Feature activation+9.19e-01

Pos logit contributions
 twins+0.037
 who+0.034
 of+0.033
 living+0.030
 everywhere+0.030
Neg logit contributions
OM-0.112
riage-0.099
 coaster-0.099
ACK-0.098
 destination-0.097
Token
Token ID198
Feature activation+1.35e+00

Pos logit contributions
 twins+0.011
sm+0.009
 who+0.009
 living+0.008
 of+0.008
Neg logit contributions
OM-0.035
 destination-0.033
riage-0.032
tons-0.032
 coaster-0.032
Token
Token ID198
Feature activation-3.16e-01

Pos logit contributions
 twins+0.014
 sisters+0.011
sm+0.011
 cared+0.011
 living+0.010
Neg logit contributions
OM-0.053
 destination-0.048
overed-0.048
tons-0.048
 coaster-0.047
TokenThen
Token ID6423
Feature activation-1.99e+00

Pos logit contributions
OM+0.010
 destination+0.010
riage+0.009
ACK+0.009
tons+0.009
Neg logit contributions
 twins-0.005
sm-0.005
 of-0.005
 who-0.005
 living-0.005
Token it
Token ID340
Feature activation-2.71e+00

Pos logit contributions
OM+0.074
overed+0.066
olog+0.066
riage+0.065
 destination+0.064
Neg logit contributions
 twins-0.028
 who-0.025
 of-0.025
 friends-0.025
 many-0.024
Token was
Token ID373
Feature activation-1.15e+00

Pos logit contributions
OM+0.099
riage+0.095
ident+0.093
ACK+0.091
tons+0.090
Neg logit contributions
 of-0.038
 were-0.036
 cared-0.034
sm-0.033
 twins-0.033
Token time
Token ID640
Feature activation+2.27e-01

Pos logit contributions
OM+0.043
 destination+0.039
olog+0.039
riage+0.039
 coaster+0.038
Neg logit contributions
 twins-0.015
sm-0.013
 of-0.013
 living-0.013
 who-0.012
Token to
Token ID284
Feature activation-4.14e+00

Pos logit contributions
 twins+0.004
sm+0.003
 living+0.003
 who+0.003
 of+0.003
Neg logit contributions
OM-0.008
 destination-0.007
riage-0.007
tons-0.007
 coaster-0.007
Token go
Token ID467
Feature activation-4.09e+00

Pos logit contributions
OM+0.136
ider+0.136
overed+0.133
ths+0.126
olog+0.124
Neg logit contributions
 twins-0.067
 love-0.063
 of-0.062
 family-0.060
 her-0.059
Token home
Token ID1363
Feature activation+1.34e+00

Pos logit contributions
OM+0.133
riage+0.117
 destination+0.117
tons+0.117
olog+0.116
Neg logit contributions
 twins-0.074
 of-0.068
 who-0.066
 living-0.066
sm-0.066
Tokenman
Token ID805
Feature activation+2.78e+00

Pos logit contributions
OM+0.192
 destination+0.177
riage+0.171
tons+0.170
 coaster+0.168
Neg logit contributions
 twins-0.114
sm-0.103
 who-0.100
 living-0.100
 of-0.097
Token.
Token ID13
Feature activation+6.57e-01

Pos logit contributions
 twins+0.046
sm+0.043
able+0.038
aky+0.038
 sisters+0.038
Neg logit contributions
OM-0.088
 destination-0.086
olog-0.081
 coaster-0.080
ACK-0.080
Token He
Token ID679
Feature activation-1.08e+00

Pos logit contributions
 twins+0.008
+0.007
 of+0.007
 many+0.007
 who+0.007
Neg logit contributions
OM-0.024
 destination-0.023
tons-0.023
riage-0.023
ths-0.023
Token swung
Token ID28507
Feature activation-2.16e+00

Pos logit contributions
OM+0.036
riage+0.033
 destination+0.033
overed+0.032
tons+0.031
Neg logit contributions
 twins-0.019
 who-0.018
 cared-0.017
sm-0.017
 of-0.016
Token the
Token ID262
Feature activation-3.10e-01

Pos logit contributions
OM+0.079
riage+0.073
 destination+0.073
olog+0.070
tons+0.070
Neg logit contributions
 twins-0.030
 of-0.028
 everywhere-0.028
sm-0.027
 who-0.027
Token axe
Token ID23272
Feature activation-2.51e+00

Pos logit contributions
OM+0.008
 destination+0.007
 coaster+0.007
ACK+0.007
 slope+0.007
Neg logit contributions
 twins-0.007
sm-0.007
 who-0.007
 cared-0.006
 sisters-0.006
Token and
Token ID290
Feature activation+3.01e-01

Pos logit contributions
OM+0.098
riage+0.094
 destination+0.088
tons+0.086
olog+0.086
Neg logit contributions
 everywhere-0.031
 of-0.030
 who-0.028
 twins-0.027
 many-0.026
Token chopped
Token ID20720
Feature activation-2.85e+00

Pos logit contributions
 twins+0.005
 who+0.005
 everywhere+0.005
 living+0.005
 of+0.005
Neg logit contributions
OM-0.010
 destination-0.009
riage-0.009
olog-0.009
raper-0.009
Token off
Token ID572
Feature activation+4.61e-01

Pos logit contributions
OM+0.096
riage+0.089
 destination+0.089
tons+0.086
olog+0.085
Neg logit contributions
 twins-0.048
 of-0.044
 everywhere-0.044
sm-0.043
 many-0.042
Token the
Token ID262
Feature activation-8.05e-01

Pos logit contributions
 twins+0.007
sm+0.006
 who+0.006
 living+0.006
 of+0.006
Neg logit contributions
OM-0.016
 destination-0.015
riage-0.015
tons-0.014
 coaster-0.014
Token snow
Token ID6729
Feature activation-6.58e+00

Pos logit contributions
OM+0.026
 destination+0.024
riage+0.023
overed+0.023
tons+0.023
Neg logit contributions
 twins-0.015
 living-0.013
sm-0.013
 who-0.013
 of-0.012
Token.
Token ID13
Feature activation+6.41e-01

Pos logit contributions
 twins+0.017
 who+0.016
 everywhere+0.016
 of+0.016
 living+0.015
Neg logit contributions
OM-0.063
riage-0.059
 destination-0.058
tons-0.057
ACK-0.057
Token
Token ID198
Feature activation+1.46e+00

Pos logit contributions
 twins+0.010
sm+0.009
 who+0.009
 living+0.009
 of+0.009
Neg logit contributions
OM-0.022
 destination-0.020
riage-0.020
tons-0.019
overed-0.019
Token
Token ID198
Feature activation-5.35e-01

Pos logit contributions
 twins+0.025
 sisters+0.024
 kittens+0.021
 mice+0.021
 cared+0.021
Neg logit contributions
OM-0.048
 destination-0.043
 coaster-0.041
tons-0.040
overed-0.040
Tokenâ
Token ID22940
Feature activation-8.26e+00

Pos logit contributions
OM+0.017
 destination+0.015
riage+0.015
tons+0.015
 coaster+0.015
Neg logit contributions
 twins-0.010
sm-0.009
 living-0.009
 who-0.009
 of-0.009
Token€
Token ID26391
Feature activation-4.94e+00

Pos logit contributions
 destination+0.179
OM+0.174
overed+0.164
ths+0.159
castle+0.157
Neg logit contributions
 twins-0.208
 who-0.205
sm-0.203
 cared-0.202
 living-0.201
Token�
Token ID129
Feature activation-8.22e+00

Pos logit contributions
OM+0.141
 destination+0.125
 coaster+0.123
olog+0.118
ER+0.115
Neg logit contributions
 twins-0.111
 sisters-0.101
sm-0.101
 of-0.099
 living-0.097
Token�
Token ID241
Feature activation-2.32e+00

Pos logit contributions
OM+0.184
riage+0.168
umm+0.155
 destination+0.153
 surely+0.152
Neg logit contributions
 twins-0.216
 who-0.213
 living-0.209
 everywhere-0.201
-0.200
TokenGood
Token ID10248
Feature activation-3.97e+00

Pos logit contributions
OM+0.069
 destination+0.060
tons+0.060
 coaster+0.059
riage+0.059
Neg logit contributions
 twins-0.047
sm-0.043
 sisters-0.043
ons-0.042
 living-0.042
Token morning
Token ID3329
Feature activation+8.11e-02

Pos logit contributions
OM+0.114
overed+0.109
 destination+0.107
raper+0.104
riage+0.102
Neg logit contributions
 twins-0.090
 of-0.076
 living-0.075
 friends-0.075
 who-0.074
Token!
Token ID0
Feature activation-1.99e+00

Pos logit contributions
 twins+0.001
 of+0.001
 who+0.001
 living+0.001
 everywhere+0.001
Neg logit contributions
OM-0.003
overed-0.003
riage-0.003
ACK-0.003
 destination-0.003
Tokenâ
Token ID22940
Feature activation-7.44e+00

Pos logit contributions
OM+0.060
 destination+0.057
riage+0.053
overed+0.053
 coaster+0.053
Neg logit contributions
 twins-0.039
sm-0.036
 who-0.035
 of-0.035
 living-0.035
Token driver
Token ID4639
Feature activation-9.42e-02

Pos logit contributions
OM+0.054
 destination+0.048
overed+0.047
riage+0.047
olog+0.047
Neg logit contributions
 twins-0.041
 living-0.036
sm-0.035
 who-0.035
 of-0.035
Token smiled
Token ID13541
Feature activation+2.19e+00

Pos logit contributions
OM+0.003
 destination+0.003
riage+0.003
tons+0.003
ACK+0.003
Neg logit contributions
 twins-0.002
 who-0.001
sm-0.001
 of-0.001
 living-0.001
Token back
Token ID736
Feature activation-4.14e-01

Pos logit contributions
 twins+0.025
 who+0.022
 of+0.022
sm+0.020
 living+0.020
Neg logit contributions
OM-0.088
 destination-0.079
riage-0.079
 coaster-0.079
ACK-0.077
Token.
Token ID13
Feature activation-5.58e-01

Pos logit contributions
OM+0.018
riage+0.016
 destination+0.016
olog+0.016
ACK+0.016
Neg logit contributions
 twins-0.004
 who-0.003
 of-0.003
 living-0.003
sm-0.003
Token �
Token ID6184
Feature activation-7.74e+00

Pos logit contributions
OM+0.021
 destination+0.020
 coaster+0.019
ths+0.019
riage+0.019
Neg logit contributions
 twins-0.007
 of-0.006
 living-0.006
 who-0.006
sm-0.006
Token�
Token ID95
Feature activation-8.65e+00

Pos logit contributions
 destination+0.159
ACK+0.156
OM+0.152
ths+0.135
 avail+0.134
Neg logit contributions
sm-0.206
 who-0.201
-0.190
old-0.187
 cared-0.183
Token€
Token ID26391
Feature activation-4.02e+00

Pos logit contributions
ths+0.205
 destination+0.202
OM+0.197
overed+0.197
 avail+0.192
Neg logit contributions
 who-0.185
 living-0.184
sm-0.181
 cared-0.181
aks-0.181
Token�
Token ID129
Feature activation-7.98e+00

Pos logit contributions
OM+0.115
 destination+0.105
riage+0.101
tons+0.100
 coaster+0.099
Neg logit contributions
 twins-0.085
sm-0.079
 who-0.076
 living-0.076
 of-0.075
Token�
Token ID241
Feature activation-1.40e+00

Pos logit contributions
OM+0.192
riage+0.163
ACK+0.162
ths+0.162
 destination+0.160
Neg logit contributions
 love-0.190
 living-0.187
 twins-0.186
 loved-0.185
aks-0.185
TokenDon
Token ID3987
Feature activation-3.37e+00

Pos logit contributions
OM+0.041
 destination+0.038
riage+0.036
tons+0.035
 coaster+0.035
Neg logit contributions
 twins-0.029
sm-0.027
 who-0.026
 living-0.026
 of-0.025
Tokenâ
Token ID22940
Feature activation-8.05e+00

Pos logit contributions
OM+0.105
 destination+0.104
ACK+0.103
castle+0.098
ident+0.097
Neg logit contributions
 of-0.067
-0.058
 twins-0.056
sm-0.055
 who-0.053
Token in
Token ID287
Feature activation+1.17e-01

Pos logit contributions
OM+0.067
 destination+0.062
riage+0.060
 coaster+0.059
olog+0.059
Neg logit contributions
 twins-0.035
sm-0.031
 who-0.031
 living-0.030
 of-0.030
Token the
Token ID262
Feature activation+1.82e+00

Pos logit contributions
 twins+0.002
sm+0.002
 who+0.002
 cared+0.002
 living+0.002
Neg logit contributions
OM-0.004
 destination-0.004
tons-0.003
riage-0.003
ACK-0.003
Token sun
Token ID4252
Feature activation+5.34e-01

Pos logit contributions
 twins+0.035
sm+0.032
 who+0.031
 living+0.031
 of+0.030
Neg logit contributions
OM-0.055
 destination-0.051
riage-0.048
tons-0.048
 coaster-0.048
Token.
Token ID13
Feature activation+6.86e-01

Pos logit contributions
 twins+0.008
sm+0.007
 who+0.006
 living+0.006
ons+0.006
Neg logit contributions
OM-0.019
 destination-0.017
riage-0.017
tons-0.017
 coaster-0.016
Token The
Token ID383
Feature activation-2.64e-01

Pos logit contributions
 twins+0.013
sm+0.013
aky+0.012
 sisters+0.012
able+0.012
Neg logit contributions
OM-0.021
 destination-0.019
tons-0.018
riage-0.018
ACK-0.018
Token cher
Token ID21382
Feature activation-8.61e+00

Pos logit contributions
OM+0.008
 destination+0.007
riage+0.007
tons+0.007
 coaster+0.007
Neg logit contributions
 twins-0.005
sm-0.005
 who-0.005
 living-0.005
 of-0.005
Tokenries
Token ID1678
Feature activation-1.03e+00

Pos logit contributions
OM+0.221
 destination+0.201
riage+0.191
tons+0.188
overed+0.187
Neg logit contributions
 twins-0.207
sm-0.192
 living-0.187
 who-0.187
 of-0.182
Token on
Token ID319
Feature activation-2.08e+00

Pos logit contributions
OM+0.033
 destination+0.031
 surely+0.031
overed+0.029
tons+0.029
Neg logit contributions
 twins-0.018
sm-0.016
old-0.015
 sisters-0.014
aky-0.014
Token the
Token ID262
Feature activation-1.71e+00

Pos logit contributions
OM+0.070
 destination+0.064
riage+0.063
olog+0.062
overed+0.062
Neg logit contributions
 twins-0.034
 who-0.030
sm-0.030
 living-0.030
 of-0.029
Token tree
Token ID5509
Feature activation-2.92e+00

Pos logit contributions
OM+0.044
 destination+0.040
riage+0.038
tons+0.037
 coaster+0.037
Neg logit contributions
 twins-0.042
sm-0.039
 living-0.038
 who-0.038
 of-0.037
Token were
Token ID547
Feature activation+1.12e+00

Pos logit contributions
OM+0.089
 destination+0.082
overed+0.077
 surely+0.077
riage+0.077
Neg logit contributions
 twins-0.055
sm-0.049
 sisters-0.045
old-0.043
able-0.043
Token!
Token ID0
Feature activation-4.42e-01

Pos logit contributions
OM+0.006
 destination+0.006
tons+0.006
riage+0.006
overed+0.006
Neg logit contributions
 twins-0.003
sm-0.002
 living-0.002
 who-0.002
ons-0.002
Token
Token ID220
Feature activation-3.30e-01

Pos logit contributions
OM+0.016
 destination+0.015
tons+0.014
riage+0.014
overed+0.014
Neg logit contributions
 twins-0.006
sm-0.006
 living-0.005
 who-0.005
ons-0.005
Token
Token ID198
Feature activation+5.54e-01

Pos logit contributions
OM+0.012
overed+0.011
 destination+0.011
olog+0.010
AP+0.010
Neg logit contributions
 twins-0.004
 cared-0.004
sm-0.004
able-0.004
 sisters-0.004
Token
Token ID198
Feature activation-1.07e+00

Pos logit contributions
 twins+0.007
 sisters+0.006
sm+0.006
 cared+0.006
 everywhere+0.006
Neg logit contributions
OM-0.019
 destination-0.018
tons-0.017
overed-0.017
olog-0.017
TokenSarah
Token ID29284
Feature activation-1.28e+00

Pos logit contributions
OM+0.031
 destination+0.028
tons+0.027
olog+0.027
riage+0.027
Neg logit contributions
 twins-0.022
sm-0.020
 everywhere-0.019
 living-0.019
ons-0.019
Token tw
Token ID665
Feature activation-8.20e+00

Pos logit contributions
OM+0.041
 destination+0.038
riage+0.037
tons+0.036
 coaster+0.036
Neg logit contributions
 twins-0.022
 who-0.021
sm-0.020
 living-0.020
 of-0.020
Tokenirled
Token ID49376
Feature activation-4.12e-01

Pos logit contributions
OM+0.157
riage+0.133
 destination+0.130
 coaster+0.127
ACK+0.125
Neg logit contributions
 twins-0.249
sm-0.242
ons-0.238
 of-0.236
 who-0.234
Token and
Token ID290
Feature activation+9.10e-01

Pos logit contributions
OM+0.015
riage+0.014
 destination+0.014
ACK+0.013
tons+0.013
Neg logit contributions
 twins-0.006
 of-0.005
sm-0.005
 everywhere-0.005
 who-0.005
Token spun
Token ID26843
Feature activation-2.34e+00

Pos logit contributions
 twins+0.016
 everywhere+0.016
 living+0.016
 who+0.016
 of+0.016
Neg logit contributions
OM-0.029
 destination-0.027
riage-0.026
raper-0.025
ACK-0.025
Token and
Token ID290
Feature activation+8.66e-01

Pos logit contributions
OM+0.087
riage+0.080
 destination+0.077
ACK+0.075
olog+0.074
Neg logit contributions
 everywhere-0.033
 twins-0.033
 of-0.032
 who-0.031
sm-0.030
Token felt
Token ID2936
Feature activation+1.76e+00

Pos logit contributions
 everywhere+0.015
 living+0.015
 of+0.014
 twins+0.014
 who+0.014
Neg logit contributions
OM-0.029
 destination-0.026
riage-0.026
raper-0.025
tons-0.025
Token."
Token ID526
Feature activation+9.99e-01

Pos logit contributions
OM+0.022
 destination+0.020
ACK+0.019
riage+0.019
overed+0.019
Neg logit contributions
 twins-0.010
 who-0.009
 of-0.009
 living-0.009
sm-0.008
Token
Token ID198
Feature activation+3.67e-01

Pos logit contributions
 twins+0.013
sm+0.012
 living+0.011
ons+0.011
 everywhere+0.011
Neg logit contributions
OM-0.035
 destination-0.033
tons-0.033
riage-0.032
overed-0.032
Token
Token ID198
Feature activation-1.15e+00

Pos logit contributions
 twins+0.004
 everywhere+0.004
 sisters+0.004
 cared+0.004
sm+0.004
Neg logit contributions
OM-0.013
 destination-0.012
tons-0.012
 coaster-0.012
overed-0.012
TokenThe
Token ID464
Feature activation-2.22e+00

Pos logit contributions
OM+0.034
 destination+0.030
tons+0.030
riage+0.029
 coaster+0.029
Neg logit contributions
 twins-0.024
sm-0.021
 living-0.021
 everywhere-0.021
 of-0.020
Token charming
Token ID23332
Feature activation-3.97e+00

Pos logit contributions
OM+0.072
 destination+0.066
riage+0.064
tons+0.063
overed+0.063
Neg logit contributions
 twins-0.040
sm-0.036
 living-0.035
 who-0.035
 of-0.034
Token le
Token ID443
Feature activation-8.12e+00

Pos logit contributions
OM+0.120
 destination+0.114
riage+0.108
tons+0.105
 coaster+0.105
Neg logit contributions
 twins-0.072
sm-0.069
 who-0.064
ons-0.064
 living-0.063
Tokenopard
Token ID15478
Feature activation-1.06e+00

Pos logit contributions
OM+0.192
 destination+0.179
riage+0.167
tons+0.164
 coaster+0.160
Neg logit contributions
 twins-0.216
sm-0.203
 who-0.199
 living-0.196
 of-0.193
Token did
Token ID750
Feature activation-2.90e+00

Pos logit contributions
OM+0.035
 destination+0.032
riage+0.031
tons+0.031
ACK+0.030
Neg logit contributions
 twins-0.018
 who-0.017
sm-0.016
 living-0.016
 of-0.016
Token not
Token ID407
Feature activation-1.62e+00

Pos logit contributions
OM+0.100
riage+0.090
ACK+0.090
umm+0.090
 slope+0.088
Neg logit contributions
 twins-0.046
 who-0.045
 of-0.044
 many-0.043
 love-0.042
Token give
Token ID1577
Feature activation+4.06e-01

Pos logit contributions
OM+0.046
 destination+0.042
riage+0.040
tons+0.040
 surely+0.040
Neg logit contributions
 twins-0.034
sm-0.032
 living-0.031
ons-0.031
 who-0.030
Token up
Token ID510
Feature activation+1.23e+00

Pos logit contributions
 twins+0.010
sm+0.009
 living+0.009
 who+0.009
 of+0.009
Neg logit contributions
OM-0.010
 destination-0.009
riage-0.009
tons-0.009
 coaster-0.009