TOP ACTIVATIONS
MAX = 8.749

His sister cried , " Tim my , you
birds again ," mom said , smiling and kissing Anna and
L ily sob bed , " Ben , why .
that hammer ?" Mom asked , picking up Ben and checking
drop it !" Tom yelled , following Sara .
happened here ?" Mom asks , as she looks at Ben
the sink ?" Mom asked , lifting him out . She
, okay ?" mom says , hugging Anna .
a mess !" Mom said , seeing the blocks and the
's ruined !" Mom said , showing them the cake .
ie ?" her mom asked , coming into the room .
Mom saw Sara , Ben , and the ostr
, run !" Sam shouted , pulling L ila along .
digging holes !" Mom said , stern ly .
His sister yelled , " Tim my , stop
it together ." Dad said , smiling . "
hurt !" Mom my said , rubbing her eye . "
't know ," mom said , hugging him and Lily .
you off ," mom says , taking Lily in her arms
friends ." their parents say , taking their hands and leading

MIN ACTIVATIONS
MIN = -8.149

flashlight , and a magn ifying glass . As
When Daisy put the magn ifying glass down , the cater
, handing him the magn ifying glass . She looked around
his mum opened the l <|endoftext|> Dave and Amy were out
examine it with the magn ifying glass ?" "
her own salt perfectly ! <|endoftext|> Once upon a time ,
a flashlight and a magn ifying glass to examine their finds
borrowed her mom 's magn ifying glass to take a closer
made of dark yarn . <|endoftext|> Once there was a boy
full . The end ! <|endoftext|> Once there was a happy
She takes out a magn ifying glass . She looks at
love ." <|endoftext|> <|endoftext|> <|endoftext|> <|endoftext|> Anna and Ben are playing
felt very happy too ! <|endoftext|> Once upon a time ,
could help make dinner . <|endoftext|> Once upon a time there
was in good hands . <|endoftext|> Once upon a time there
the necklace and the magn ifying glass and started to examine
was not lonely anymore . <|endoftext|> Once upon a time ,
her feel extra special . <|endoftext|> Once upon a time ,
The End . <|endoftext|> Todd was so excited !
The End . <|endoftext|> Once upon a time there

INTERVAL 4.524 - 8.749
CONTAINS 2.791%

Her mom smiled and said , " See , Lily ?
looked at her with big , brown eyes and said ,
to playing in the sand , pushing her toy car around
day , her mom said , " S ara , it
from side to side . Then , he said to kick

INTERVAL 0.300 - 4.524
CONTAINS 40.496%

sand . The camel was frustrated ! Then ,
back together . " You are very welcome
a voice spoke . It was not the flower , but
together . The fish always looked so pretty and spicy !
getting very dark . " Oh no ," murm

INTERVAL -3.925 - 0.300
CONTAINS 53.958%

was an old man . He had a special mask that
day , it figured out how to zip !
But they are lost . They do not see the trail
of the big kids says . He tries to
very pretty and y ummy . They wanted to try them

INTERVAL -8.149 - -3.925
CONTAINS 2.754%

L ily wanted to help the pot feel better
. <|endoftext|> W illy loved to whistle . He went outside
in a shy voice , " I 'm just so excited
so excited and tried to pick it , but the butterfly
fun . The end . <|endoftext|> Once upon a time ,
Token
Token ID198
Feature activation+3.32e+00

Pos logit contributions
|+0.030
 meets+0.027
 play+0.027
 meet+0.026
 plays+0.026
Neg logit contributions
 impressed-0.049
aging-0.048
broken-0.048
ower-0.047
outing-0.047
Token
Token ID198
Feature activation+3.18e+00

Pos logit contributions
 impressed+0.089
aging+0.088
broken+0.087
ower+0.087
outing+0.087
Neg logit contributions
|-0.053
 play-0.047
 meets-0.047
 meet-0.046
 plays-0.044
TokenHis
Token ID6653
Feature activation+1.03e+00

Pos logit contributions
 impressed+0.083
aging+0.082
 mesmer+0.081
outing+0.081
ower+0.081
Neg logit contributions
|-0.052
 play-0.048
 meet-0.045
 borrow-0.045
 meets-0.044
Token sister
Token ID6621
Feature activation+1.17e+00

Pos logit contributions
 impressed+0.023
aging+0.022
broken+0.022
ower+0.021
 pleased+0.021
Neg logit contributions
|-0.022
 meets-0.020
 meet-0.020
 play-0.019
 plays-0.019
Token cried
Token ID16896
Feature activation+2.92e+00

Pos logit contributions
 impressed+0.024
aging+0.024
ower+0.024
outing+0.023
broken+0.023
Neg logit contributions
|-0.025
 play-0.023
 meets-0.023
 plays-0.023
 meet-0.023
Token,
Token ID11
Feature activation+8.75e+00

Pos logit contributions
 impressed+0.072
 mesmer+0.067
 pleased+0.067
ower+0.067
outing+0.065
Neg logit contributions
 play-0.059
 plays-0.053
|-0.053
 borrow-0.051
 him-0.050
Token "
Token ID366
Feature activation-2.82e+00

Pos logit contributions
 impressed+0.189
broken+0.187
outing+0.187
ower+0.183
uttering+0.181
Neg logit contributions
 play-0.173
 meet-0.161
|-0.160
 plays-0.158
 have-0.154
TokenTim
Token ID14967
Feature activation+2.12e+00

Pos logit contributions
|+0.058
 meets+0.052
 play+0.052
 meet+0.051
 plays+0.050
Neg logit contributions
 impressed-0.064
aging-0.061
 pleased-0.061
ower-0.060
outing-0.060
Tokenmy
Token ID1820
Feature activation+2.55e+00

Pos logit contributions
 impressed+0.067
aging+0.066
ower+0.065
broken+0.065
outing+0.065
Neg logit contributions
|-0.023
 meets-0.020
 play-0.020
 meet-0.019
 plays-0.019
Token,
Token ID11
Feature activation+6.05e+00

Pos logit contributions
 impressed+0.040
aging+0.039
broken+0.038
ower+0.038
outing+0.037
Neg logit contributions
|-0.070
 meets-0.065
 meet-0.063
 play-0.063
 plays-0.062
Token you
Token ID345
Feature activation+2.36e-01

Pos logit contributions
 impressed+0.130
aging+0.127
ower+0.124
 pleased+0.124
outing+0.123
Neg logit contributions
|-0.129
 meets-0.121
 plays-0.111
 meet-0.108
 play-0.106
Token birds
Token ID10087
Feature activation+1.33e+00

Pos logit contributions
|+0.014
 meets+0.013
 borrow+0.013
 plays+0.012
 fits+0.012
Neg logit contributions
 impressed-0.012
 sudden-0.011
ower-0.011
 pleased-0.011
broken-0.011
Token again
Token ID757
Feature activation+1.60e+00

Pos logit contributions
 impressed+0.024
broken+0.023
 pleased+0.022
aging+0.022
ower+0.022
Neg logit contributions
|-0.033
 meets-0.030
Ready-0.029
 borrow-0.029
 plays-0.029
Token,"
Token ID553
Feature activation+3.03e+00

Pos logit contributions
 impressed+0.041
aging+0.040
ower+0.039
broken+0.039
outing+0.039
Neg logit contributions
|-0.028
 meets-0.025
 play-0.025
 meet-0.024
 plays-0.024
Token mom
Token ID1995
Feature activation+2.87e+00

Pos logit contributions
 impressed+0.063
aging+0.060
 pleased+0.060
outing+0.059
broken+0.059
Neg logit contributions
|-0.068
 meets-0.062
 play-0.060
 meet-0.060
 plays-0.060
Token said
Token ID531
Feature activation+5.77e+00

Pos logit contributions
 impressed+0.050
ower+0.050
aging+0.050
broken+0.049
 pleased+0.048
Neg logit contributions
|-0.071
 meets-0.067
 play-0.067
 plays-0.066
 meet-0.066
Token,
Token ID11
Feature activation+8.63e+00

Pos logit contributions
 impressed+0.120
ower+0.117
aging+0.114
 mesmer+0.113
outing+0.113
Neg logit contributions
 play-0.117
|-0.115
 borrow-0.112
 plays-0.112
 meet-0.109
Token smiling
Token ID16755
Feature activation+3.37e+00

Pos logit contributions
ower+0.178
outing+0.172
aging+0.168
oles+0.167
iring+0.166
Neg logit contributions
 play-0.191
 meet-0.178
|-0.173
 plays-0.172
 borrow-0.170
Token and
Token ID290
Feature activation+3.97e+00

Pos logit contributions
 impressed+0.075
ower+0.074
aging+0.072
 sudden+0.072
outing+0.072
Neg logit contributions
 play-0.065
 plays-0.062
|-0.059
 meets-0.058
 borrow-0.057
Token kissing
Token ID25847
Feature activation+1.86e+00

Pos logit contributions
 impressed+0.073
ower+0.072
aging+0.072
outing+0.071
broken+0.071
Neg logit contributions
|-0.094
 play-0.090
 meets-0.089
 plays-0.088
 meet-0.087
Token Anna
Token ID11735
Feature activation+1.17e+00

Pos logit contributions
 impressed+0.041
aging+0.041
ower+0.040
broken+0.040
outing+0.040
Neg logit contributions
|-0.038
 meets-0.035
 play-0.034
 meet-0.034
 plays-0.033
Token and
Token ID290
Feature activation+3.63e+00

Pos logit contributions
 impressed+0.026
broken+0.025
aging+0.025
 pleased+0.025
outing+0.024
Neg logit contributions
|-0.023
 meets-0.022
 meet-0.021
Ready-0.020
 play-0.020
Token
Token ID198
Feature activation+3.77e+00

Pos logit contributions
aging+0.107
 impressed+0.107
ower+0.107
outing+0.106
broken+0.106
Neg logit contributions
|-0.064
 play-0.058
 meets-0.058
 meet-0.055
 plays-0.055
TokenL
Token ID43
Feature activation+2.87e+00

Pos logit contributions
ower+0.085
aging+0.085
 impressed+0.083
outing+0.081
 mesmer+0.080
Neg logit contributions
|-0.077
 play-0.072
 meets-0.071
 meet-0.068
 plays-0.068
Tokenily
Token ID813
Feature activation+9.22e-01

Pos logit contributions
 impressed+0.060
aging+0.059
broken+0.058
ower+0.058
outing+0.058
Neg logit contributions
|-0.062
 meets-0.058
 play-0.057
 meet-0.056
 plays-0.056
Token sob
Token ID27355
Feature activation+3.18e+00

Pos logit contributions
ower+0.020
 impressed+0.020
aging+0.020
uttering+0.019
 pleased+0.019
Neg logit contributions
|-0.018
 play-0.018
 meets-0.018
 plays-0.018
 meet-0.017
Tokenbed
Token ID3077
Feature activation+4.42e+00

Pos logit contributions
 impressed+0.082
 pleased+0.081
 intense+0.073
 mesmer+0.073
aver+0.071
Neg logit contributions
 play-0.052
 plays-0.052
|-0.051
 meets-0.049
 him-0.043
Token,
Token ID11
Feature activation+8.62e+00

Pos logit contributions
 impressed+0.114
 pleased+0.109
ower+0.108
 mesmer+0.107
aging+0.106
Neg logit contributions
 play-0.075
|-0.071
 plays-0.070
 meets-0.067
 meet-0.067
Token "
Token ID366
Feature activation-2.57e+00

Pos logit contributions
ower+0.185
broken+0.177
iring+0.176
outing+0.175
aging+0.173
Neg logit contributions
 play-0.178
 plays-0.167
 meet-0.165
 meets-0.165
 have-0.161
TokenBen
Token ID11696
Feature activation+2.11e+00

Pos logit contributions
|+0.044
 play+0.041
 meets+0.039
 plays+0.039
 meet+0.038
Neg logit contributions
 impressed-0.065
aging-0.063
ower-0.062
 pleased-0.062
outing-0.061
Token,
Token ID11
Feature activation+6.12e+00

Pos logit contributions
 impressed+0.027
aging+0.026
ower+0.026
broken+0.026
outing+0.025
Neg logit contributions
|-0.063
 meets-0.058
 meet-0.057
 play-0.057
 plays-0.056
Token why
Token ID1521
Feature activation-1.30e+00

Pos logit contributions
 impressed+0.158
 pleased+0.151
aging+0.151
outing+0.149
ower+0.148
Neg logit contributions
|-0.106
 meets-0.094
 plays-0.087
 borrow-0.086
 meet-0.086
Token.
Token ID13
Feature activation-2.16e+00

Pos logit contributions
|+0.025
 meets+0.023
 meet+0.022
 play+0.022
 plays+0.021
Neg logit contributions
 impressed-0.031
aging-0.030
ower-0.030
outing-0.030
 pleased-0.030
Token that
Token ID326
Feature activation-3.34e+00

Pos logit contributions
|+0.033
 meets+0.030
 play+0.029
 meet+0.029
 plays+0.029
Neg logit contributions
 impressed-0.043
aging-0.043
ower-0.042
outing-0.042
broken-0.042
Token hammer
Token ID15554
Feature activation-3.04e+00

Pos logit contributions
|+0.044
 play+0.043
 meets+0.041
 meet+0.040
 plays+0.040
Neg logit contributions
 impressed-0.095
aging-0.094
outing-0.093
broken-0.093
 pleased-0.092
Token?"
Token ID1701
Feature activation+3.51e+00

Pos logit contributions
|+0.055
 meets+0.049
 meet+0.048
 play+0.047
 plays+0.046
Neg logit contributions
 impressed-0.075
ower-0.074
aging-0.074
broken-0.073
outing-0.073
Token Mom
Token ID11254
Feature activation+2.54e+00

Pos logit contributions
broken+0.086
ower+0.084
king+0.082
aging+0.082
outing+0.081
Neg logit contributions
 play-0.066
 plays-0.060
 meets-0.060
 meet-0.060
|-0.058
Token asked
Token ID1965
Feature activation+2.77e+00

Pos logit contributions
 impressed+0.053
 pleased+0.052
ower+0.052
broken+0.051
aging+0.051
Neg logit contributions
|-0.052
 play-0.051
 plays-0.050
 meets-0.049
 meet-0.048
Token,
Token ID11
Feature activation+8.55e+00

Pos logit contributions
 impressed+0.066
ower+0.063
 pleased+0.063
broken+0.062
 mesmer+0.062
Neg logit contributions
 play-0.049
|-0.048
 borrow-0.046
 meets-0.045
 plays-0.045
Token picking
Token ID10868
Feature activation-3.38e+00

Pos logit contributions
ower+0.204
andel+0.200
agg+0.199
oles+0.198
ooked+0.192
Neg logit contributions
 play-0.164
 meet-0.138
 build-0.134
 borrow-0.132
 join-0.132
Token up
Token ID510
Feature activation+2.14e+00

Pos logit contributions
|+0.067
 meets+0.061
 meet+0.059
 play+0.058
 plays+0.057
Neg logit contributions
 impressed-0.079
aging-0.079
outing-0.077
broken-0.076
ower-0.076
Token Ben
Token ID3932
Feature activation+6.07e-01

Pos logit contributions
 impressed+0.050
aging+0.049
ower+0.049
broken+0.049
outing+0.048
Neg logit contributions
|-0.041
 meets-0.038
 play-0.037
 meet-0.036
 plays-0.036
Token and
Token ID290
Feature activation+3.79e+00

Pos logit contributions
 impressed+0.011
aging+0.011
broken+0.011
ower+0.011
outing+0.011
Neg logit contributions
|-0.015
 meets-0.014
 meet-0.013
 play-0.013
 plays-0.013
Token checking
Token ID10627
Feature activation-1.45e+00

Pos logit contributions
 impressed+0.093
aging+0.091
ower+0.090
broken+0.090
outing+0.089
Neg logit contributions
|-0.069
 meets-0.063
 play-0.062
 meet-0.061
 plays-0.061
Token drop
Token ID4268
Feature activation+6.63e-01

Pos logit contributions
 impressed+0.142
aging+0.139
ower+0.136
outing+0.136
broken+0.136
Neg logit contributions
|-0.099
 meets-0.090
 plays-0.085
 play-0.085
 meet-0.085
Token it
Token ID340
Feature activation-2.04e+00

Pos logit contributions
aging+0.015
 impressed+0.015
broken+0.015
outing+0.015
ower+0.015
Neg logit contributions
|-0.013
 meets-0.012
 meet-0.011
 play-0.011
 plays-0.011
Token!"
Token ID2474
Feature activation+2.64e+00

Pos logit contributions
|+0.042
 meets+0.039
 play+0.038
 meet+0.038
 plays+0.037
Neg logit contributions
 impressed-0.044
aging-0.044
broken-0.043
ower-0.043
outing-0.043
Token Tom
Token ID4186
Feature activation+1.95e+00

Pos logit contributions
 impressed+0.074
broken+0.073
ower+0.072
outing+0.071
 pleased+0.071
Neg logit contributions
|-0.039
 play-0.038
 meets-0.036
 plays-0.035
 meet-0.034
Token yelled
Token ID22760
Feature activation+3.34e+00

Pos logit contributions
uttering+0.049
ower+0.047
 impressed+0.046
 pleased+0.046
andel+0.044
Neg logit contributions
 play-0.036
|-0.034
 plays-0.033
 hide-0.032
 meets-0.031
Token,
Token ID11
Feature activation+8.52e+00

Pos logit contributions
 impressed+0.072
outing+0.070
ower+0.070
 pleased+0.068
broken+0.067
Neg logit contributions
 play-0.068
|-0.065
 plays-0.065
 meets-0.063
 borrow-0.062
Token following
Token ID1708
Feature activation+3.04e-02

Pos logit contributions
agg+0.200
ment+0.199
ower+0.196
iring+0.191
ised+0.189
Neg logit contributions
 play-0.170
 meet-0.146
 plays-0.143
 hide-0.139
 join-0.139
Token Sara
Token ID24799
Feature activation+1.50e+00

Pos logit contributions
 impressed+0.001
aging+0.001
ower+0.001
broken+0.001
outing+0.001
Neg logit contributions
|-0.000
 meets-0.000
 play-0.000
 meet-0.000
 plays-0.000
Token.
Token ID13
Feature activation-7.97e-01

Pos logit contributions
 impressed+0.030
aging+0.030
broken+0.029
ower+0.029
outing+0.029
Neg logit contributions
|-0.034
 meets-0.032
 play-0.030
 meet-0.030
 plays-0.030
Token
Token ID198
Feature activation+3.12e+00

Pos logit contributions
|+0.011
 play+0.010
 meets+0.010
 plays+0.010
 meet+0.010
Neg logit contributions
 impressed-0.023
broken-0.022
ower-0.022
aging-0.022
 pleased-0.022
Token
Token ID198
Feature activation+2.81e+00

Pos logit contributions
 impressed+0.087
aging+0.086
broken+0.086
ower+0.085
outing+0.085
Neg logit contributions
|-0.047
 play-0.042
 meets-0.042
 meet-0.040
 plays-0.040
Token happened
Token ID3022
Feature activation+4.00e-01

Pos logit contributions
|+0.068
 meets+0.061
 meet+0.059
 play+0.058
 plays+0.057
Neg logit contributions
 impressed-0.075
 pleased-0.069
ower-0.067
aging-0.067
outing-0.066
Token here
Token ID994
Feature activation-8.07e-01

Pos logit contributions
aging+0.008
outing+0.008
broken+0.008
 impressed+0.008
ower+0.008
Neg logit contributions
 play-0.009
|-0.009
 meets-0.008
 borrow-0.008
 plays-0.008
Token?"
Token ID1701
Feature activation+3.78e+00

Pos logit contributions
|+0.015
 meets+0.014
 play+0.014
 meet+0.014
 plays+0.013
Neg logit contributions
 impressed-0.019
aging-0.019
ower-0.018
broken-0.018
outing-0.018
Token Mom
Token ID11254
Feature activation+2.74e+00

Pos logit contributions
broken+0.093
ower+0.093
aging+0.091
king+0.091
outing+0.090
Neg logit contributions
 play-0.068
 plays-0.063
 meets-0.063
 meet-0.062
|-0.062
Token asks
Token ID7893
Feature activation+2.87e+00

Pos logit contributions
 impressed+0.070
outing+0.069
aging+0.068
ower+0.066
broken+0.065
Neg logit contributions
|-0.049
 meets-0.041
 meet-0.041
 play-0.040
 borrow-0.039
Token,
Token ID11
Feature activation+8.51e+00

Pos logit contributions
 impressed+0.061
ower+0.059
 pleased+0.058
 mesmer+0.058
broken+0.058
Neg logit contributions
 play-0.058
|-0.056
 borrow-0.055
 plays-0.055
 meets-0.054
Token as
Token ID355
Feature activation+4.88e+00

Pos logit contributions
ower+0.205
andel+0.201
oles+0.200
agg+0.199
ooked+0.194
Neg logit contributions
 play-0.158
 have-0.130
 build-0.130
 borrow-0.128
 hide-0.128
Token she
Token ID673
Feature activation-1.12e-01

Pos logit contributions
ower+0.115
aging+0.115
 impressed+0.115
broken+0.114
outing+0.113
Neg logit contributions
 play-0.086
|-0.086
 plays-0.082
 meets-0.080
 borrow-0.079
Token looks
Token ID3073
Feature activation+1.76e+00

Pos logit contributions
|+0.002
 meets+0.002
 play+0.001
 meet+0.001
 borrow+0.001
Neg logit contributions
 impressed-0.003
aging-0.003
outing-0.003
broken-0.003
ower-0.003
Token at
Token ID379
Feature activation-9.13e-01

Pos logit contributions
ower+0.035
broken+0.034
 impressed+0.034
outing+0.033
aging+0.033
Neg logit contributions
 play-0.039
 meets-0.038
|-0.038
 plays-0.038
 meet-0.036
Token Ben
Token ID3932
Feature activation+3.87e-01

Pos logit contributions
|+0.017
 meets+0.016
 meet+0.015
 play+0.015
 plays+0.015
Neg logit contributions
 impressed-0.022
aging-0.021
outing-0.021
broken-0.021
ower-0.021
Token the
Token ID262
Feature activation-2.58e+00

Pos logit contributions
|+0.058
 meets+0.054
 play+0.051
 meet+0.051
 plays+0.051
Neg logit contributions
 impressed-0.068
aging-0.067
ower-0.065
broken-0.065
outing-0.064
Token sink
Token ID14595
Feature activation-1.54e+00

Pos logit contributions
|+0.047
 meets+0.039
 borrow+0.038
 meet+0.038
 plays+0.037
Neg logit contributions
aging-0.064
ower-0.064
 impressed-0.062
outing-0.062
 sudden-0.062
Token?"
Token ID1701
Feature activation+3.39e+00

Pos logit contributions
|+0.025
 meets+0.023
 play+0.023
 meet+0.022
 plays+0.022
Neg logit contributions
 impressed-0.040
aging-0.039
ower-0.039
broken-0.039
outing-0.039
Token Mom
Token ID11254
Feature activation+2.66e+00

Pos logit contributions
ower+0.087
broken+0.086
 impressed+0.085
aging+0.085
outing+0.085
Neg logit contributions
 play-0.055
|-0.055
 meets-0.053
 plays-0.052
 meet-0.052
Token asked
Token ID1965
Feature activation+2.67e+00

Pos logit contributions
ower+0.061
 impressed+0.061
broken+0.059
aging+0.059
 pleased+0.059
Neg logit contributions
|-0.049
 play-0.048
 meets-0.046
 plays-0.046
 meet-0.045
Token,
Token ID11
Feature activation+8.50e+00

Pos logit contributions
 impressed+0.063
ower+0.061
aging+0.061
broken+0.060
outing+0.060
Neg logit contributions
|-0.050
 play-0.047
 meets-0.046
 plays-0.045
 meet-0.045
Token lifting
Token ID16842
Feature activation-4.26e-01

Pos logit contributions
ower+0.203
oles+0.192
andel+0.187
agg+0.187
ooked+0.183
Neg logit contributions
 play-0.165
 meet-0.143
 borrow-0.141
 have-0.138
 join-0.138
Token him
Token ID683
Feature activation+2.30e-01

Pos logit contributions
|+0.008
 meets+0.008
 meet+0.007
 play+0.007
 plays+0.007
Neg logit contributions
aging-0.010
 impressed-0.010
outing-0.010
broken-0.010
ower-0.009
Token out
Token ID503
Feature activation+3.71e+00

Pos logit contributions
 impressed+0.006
aging+0.006
ower+0.006
broken+0.006
outing+0.006
Neg logit contributions
|-0.004
 play-0.004
 meets-0.004
 meet-0.003
 plays-0.003
Token.
Token ID13
Feature activation-5.82e-01

Pos logit contributions
ower+0.101
 impressed+0.099
aging+0.095
outing+0.095
 pleased+0.095
Neg logit contributions
|-0.056
 play-0.056
 plays-0.052
 borrow-0.051
 meet-0.048
Token She
Token ID1375
Feature activation+3.04e-01

Pos logit contributions
 play+0.007
 meets+0.007
|+0.007
 meet+0.006
 plays+0.006
Neg logit contributions
broken-0.017
aging-0.017
 impressed-0.017
ower-0.017
outing-0.017
Token,
Token ID11
Feature activation+5.95e+00

Pos logit contributions
 impressed+0.016
aging+0.015
ower+0.015
broken+0.015
outing+0.015
Neg logit contributions
|-0.017
 meets-0.016
 play-0.015
 meet-0.015
 plays-0.015
Token okay
Token ID8788
Feature activation+2.77e+00

Pos logit contributions
 impressed+0.107
aging+0.103
ower+0.102
broken+0.101
outing+0.101
Neg logit contributions
|-0.147
 meets-0.137
 play-0.135
 meet-0.133
 plays-0.133
Token?"
Token ID1701
Feature activation+3.25e+00

Pos logit contributions
 impressed+0.068
aging+0.067
ower+0.066
broken+0.066
outing+0.066
Neg logit contributions
|-0.050
 meets-0.046
 play-0.044
 meet-0.044
 plays-0.043
Token mom
Token ID1995
Feature activation+2.69e+00

Pos logit contributions
 impressed+0.064
aging+0.062
ower+0.061
broken+0.061
outing+0.060
Neg logit contributions
|-0.075
 meets-0.070
 play-0.069
 meet-0.068
 plays-0.068
Token says
Token ID1139
Feature activation+5.62e+00

Pos logit contributions
 impressed+0.064
outing+0.062
aging+0.061
broken+0.060
 pleased+0.060
Neg logit contributions
|-0.052
 meets-0.046
 play-0.046
 meet-0.045
 plays-0.044
Token,
Token ID11
Feature activation+8.49e+00

Pos logit contributions
 impressed+0.118
ower+0.115
 mesmer+0.112
broken+0.111
aging+0.111
Neg logit contributions
 play-0.114
|-0.109
 borrow-0.109
 plays-0.108
 meet-0.104
Token hugging
Token ID46292
Feature activation+4.37e-01

Pos logit contributions
ower+0.186
oles+0.179
agg+0.172
iring+0.171
ooked+0.170
Neg logit contributions
 play-0.176
 meet-0.158
 borrow-0.155
 plays-0.152
 join-0.149
Token Anna
Token ID11735
Feature activation+2.04e+00

Pos logit contributions
outing+0.010
aging+0.010
 impressed+0.010
broken+0.010
 knowing+0.009
Neg logit contributions
|-0.008
 meets-0.007
 meet-0.007
 borrow-0.007
 fits-0.007
Token.
Token ID13
Feature activation-4.13e-01

Pos logit contributions
 impressed+0.041
aging+0.040
broken+0.040
outing+0.039
ower+0.039
Neg logit contributions
|-0.045
 meets-0.042
 play-0.041
 meet-0.041
 plays-0.041
Token
Token ID198
Feature activation+3.48e+00

Pos logit contributions
|+0.008
 meets+0.007
 play+0.007
 meet+0.007
 plays+0.007
Neg logit contributions
 impressed-0.010
aging-0.010
ower-0.010
broken-0.010
outing-0.010
Token
Token ID198
Feature activation+3.26e+00

Pos logit contributions
 impressed+0.085
aging+0.082
ower+0.081
broken+0.081
outing+0.081
Neg logit contributions
|-0.063
 meets-0.058
 play-0.057
 meet-0.056
 plays-0.056
Token a
Token ID257
Feature activation+4.17e-02

Pos logit contributions
|+0.035
 meets+0.030
Ready+0.030
uggle+0.030
 meet+0.030
Neg logit contributions
 impressed-0.031
aging-0.029
 pleased-0.028
outing-0.028
ower-0.028
Token mess
Token ID2085
Feature activation-5.06e-01

Pos logit contributions
 impressed+0.001
 pleased+0.001
 sudden+0.001
ower+0.001
broken+0.001
Neg logit contributions
|-0.001
 meets-0.001
 meet-0.001
uggle-0.001
 plays-0.001
Token!"
Token ID2474
Feature activation+3.10e+00

Pos logit contributions
|+0.008
 meets+0.007
 meet+0.007
 play+0.006
 borrow+0.006
Neg logit contributions
ower-0.014
 impressed-0.014
outing-0.013
aging-0.013
broken-0.013
Token Mom
Token ID11254
Feature activation+2.78e+00

Pos logit contributions
broken+0.085
ower+0.085
aging+0.084
 impressed+0.084
outing+0.083
Neg logit contributions
 play-0.045
|-0.045
 meets-0.042
 meet-0.041
 plays-0.041
Token said
Token ID531
Feature activation+4.68e+00

Pos logit contributions
 impressed+0.057
ower+0.057
 pleased+0.055
aging+0.055
 mesmer+0.054
Neg logit contributions
|-0.058
 play-0.058
 plays-0.055
 borrow-0.054
 meets-0.054
Token,
Token ID11
Feature activation+8.46e+00

Pos logit contributions
 impressed+0.105
ower+0.102
 mesmer+0.102
outing+0.101
 pleased+0.099
Neg logit contributions
 play-0.085
 borrow-0.080
|-0.078
 plays-0.076
 join-0.074
Token seeing
Token ID4379
Feature activation-3.62e-01

Pos logit contributions
ower+0.194
agg+0.186
oles+0.182
iring+0.180
ment+0.180
Neg logit contributions
 play-0.164
 meet-0.144
 borrow-0.140
 hide-0.138
 have-0.138
Token the
Token ID262
Feature activation+7.49e-01

Pos logit contributions
|+0.007
 meets+0.006
 play+0.006
 meet+0.006
 plays+0.006
Neg logit contributions
 impressed-0.009
aging-0.008
outing-0.008
broken-0.008
ower-0.008
Token blocks
Token ID7021
Feature activation+9.11e-01

Pos logit contributions
 impressed+0.016
broken+0.015
 pleased+0.014
ower+0.014
outing+0.014
Neg logit contributions
|-0.017
 meets-0.016
 meet-0.015
 plays-0.015
 borrow-0.015
Token and
Token ID290
Feature activation+2.32e+00

Pos logit contributions
 impressed+0.018
aging+0.018
broken+0.018
outing+0.017
 pleased+0.017
Neg logit contributions
|-0.021
 meets-0.019
 meet-0.019
 plays-0.018
 play-0.018
Token the
Token ID262
Feature activation+1.56e-01

Pos logit contributions
 impressed+0.054
ower+0.054
aging+0.054
broken+0.053
outing+0.053
Neg logit contributions
|-0.044
 meets-0.041
 play-0.041
 plays-0.040
 meet-0.039
Token's
Token ID338
Feature activation-2.59e-01

Pos logit contributions
|+0.053
 recess+0.052
uggle+0.051
 borrow+0.050
 meet+0.049
Neg logit contributions
broken-0.056
 impressed-0.056
outing-0.055
aging-0.055
 pleased-0.054
Token ruined
Token ID20484
Feature activation-5.53e-02

Pos logit contributions
|+0.006
 play+0.005
 meets+0.005
 meet+0.005
 plays+0.005
Neg logit contributions
 impressed-0.005
aging-0.005
ower-0.005
broken-0.005
outing-0.005
Token!"
Token ID2474
Feature activation+2.39e+00

Pos logit contributions
|+0.001
 meets+0.001
 meet+0.001
 play+0.001
 plays+0.001
Neg logit contributions
 impressed-0.001
ower-0.001
aging-0.001
broken-0.001
outing-0.001
Token Mom
Token ID11254
Feature activation+2.73e+00

Pos logit contributions
 impressed+0.048
broken+0.047
aging+0.047
ower+0.047
outing+0.046
Neg logit contributions
|-0.053
 play-0.051
 meets-0.050
 plays-0.049
 meet-0.049
Token said
Token ID531
Feature activation+5.06e+00

Pos logit contributions
ower+0.057
aging+0.054
 impressed+0.054
uttering+0.054
broken+0.053
Neg logit contributions
|-0.058
 play-0.057
 plays-0.055
 meets-0.054
 meet-0.054
Token,
Token ID11
Feature activation+8.39e+00

Pos logit contributions
 impressed+0.110
ower+0.106
outing+0.105
 mesmer+0.103
 sudden+0.102
Neg logit contributions
 play-0.098
|-0.093
 borrow-0.093
 plays-0.090
 meet-0.087
Token showing
Token ID4478
Feature activation-4.52e-01

Pos logit contributions
ower+0.184
iring+0.176
outing+0.175
oles+0.171
ised+0.169
Neg logit contributions
 play-0.170
 meet-0.154
 borrow-0.149
|-0.149
 plays-0.145
Token them
Token ID606
Feature activation+8.62e-01

Pos logit contributions
|+0.010
 meets+0.009
 meet+0.009
 fits+0.008
 hunt+0.008
Neg logit contributions
aging-0.009
broken-0.009
 impressed-0.009
outing-0.009
 pleased-0.009
Token the
Token ID262
Feature activation-4.39e-01

Pos logit contributions
aging+0.017
 impressed+0.017
outing+0.016
broken+0.016
 pleased+0.016
Neg logit contributions
|-0.020
 meets-0.019
 meet-0.017
 fits-0.017
Ready-0.017
Token cake
Token ID12187
Feature activation+2.29e-01

Pos logit contributions
|+0.009
 meets+0.008
 plays+0.008
 play+0.008
 meet+0.008
Neg logit contributions
 impressed-0.010
aging-0.010
ower-0.010
 pleased-0.009
broken-0.009
Token.
Token ID13
Feature activation-8.56e-01

Pos logit contributions
 impressed+0.005
aging+0.005
ower+0.004
broken+0.004
outing+0.004
Neg logit contributions
|-0.005
 meets-0.005
 play-0.005
 meet-0.005
 plays-0.005
Tokenie
Token ID494
Feature activation+2.29e+00

Pos logit contributions
 impressed+0.060
aging+0.059
broken+0.057
ower+0.057
outing+0.057
Neg logit contributions
|-0.059
 play-0.055
 meets-0.055
 plays-0.054
 meet-0.053
Token?"
Token ID1701
Feature activation+3.56e+00

Pos logit contributions
 impressed+0.054
ower+0.052
broken+0.052
aging+0.051
 pleased+0.051
Neg logit contributions
|-0.044
 meets-0.039
 meet-0.037
 plays-0.037
 play-0.036
Token her
Token ID607
Feature activation+2.65e+00

Pos logit contributions
 impressed+0.092
aging+0.089
 pleased+0.088
ower+0.087
broken+0.087
Neg logit contributions
|-0.061
 meets-0.055
 meet-0.052
 play-0.052
 plays-0.052
Token mom
Token ID1995
Feature activation+2.80e+00

Pos logit contributions
 impressed+0.059
 pleased+0.058
outing+0.058
ower+0.056
izing+0.056
Neg logit contributions
 play-0.049
 meets-0.043
 plays-0.043
 doll-0.043
 borrow-0.041
Token asked
Token ID1965
Feature activation+2.53e+00

Pos logit contributions
 impressed+0.068
ower+0.068
 pleased+0.067
broken+0.066
aging+0.064
Neg logit contributions
 play-0.048
|-0.046
 plays-0.044
 borrow-0.044
 meets-0.044
Token,
Token ID11
Feature activation+8.38e+00

Pos logit contributions
 impressed+0.061
ower+0.059
aging+0.059
 pleased+0.058
outing+0.058
Neg logit contributions
|-0.046
 play-0.044
 meets-0.042
 plays-0.042
 meet-0.041
Token coming
Token ID2406
Feature activation+1.11e+00

Pos logit contributions
ower+0.195
oles+0.188
agg+0.185
andel+0.180
ment+0.179
Neg logit contributions
 play-0.169
 meet-0.145
 borrow-0.142
 plays-0.139
|-0.138
Token into
Token ID656
Feature activation+9.05e-01

Pos logit contributions
 impressed+0.024
aging+0.024
broken+0.023
ower+0.023
outing+0.023
Neg logit contributions
|-0.023
 meets-0.022
 play-0.021
 plays-0.021
 meet-0.021
Token the
Token ID262
Feature activation-8.95e-01

Pos logit contributions
 impressed+0.020
aging+0.020
broken+0.020
 pleased+0.019
outing+0.019
Neg logit contributions
|-0.018
 meets-0.017
 plays-0.016
 meet-0.016
 play-0.016
Token room
Token ID2119
Feature activation+6.91e-02

Pos logit contributions
|+0.019
 meets+0.016
 meet+0.016
uggle+0.016
 borrow+0.015
Neg logit contributions
 sudden-0.019
 impressed-0.019
broken-0.018
 pleased-0.018
aging-0.018
Token.
Token ID13
Feature activation-5.10e-01

Pos logit contributions
 impressed+0.002
ower+0.002
aging+0.002
 pleased+0.002
broken+0.002
Neg logit contributions
 play-0.001
|-0.001
 meets-0.001
 plays-0.001
 meet-0.001
Token
Token ID198
Feature activation+3.70e+00

Pos logit contributions
 impressed+0.015
aging+0.015
ower+0.014
outing+0.014
broken+0.014
Neg logit contributions
|-0.011
 meets-0.010
 play-0.010
 meet-0.010
 plays-0.010
Token
Token ID198
Feature activation+3.32e+00

Pos logit contributions
aging+0.098
 impressed+0.098
broken+0.098
outing+0.098
ower+0.097
Neg logit contributions
|-0.057
 play-0.054
 meets-0.053
 meet-0.051
 plays-0.051
TokenMom
Token ID29252
Feature activation+1.80e+00

Pos logit contributions
aging+0.063
 impressed+0.062
ower+0.062
broken+0.061
outing+0.061
Neg logit contributions
|-0.076
 play-0.074
 meets-0.073
 meet-0.072
 plays-0.071
Token saw
Token ID2497
Feature activation+8.90e-01

Pos logit contributions
 impressed+0.033
aging+0.031
broken+0.031
outing+0.031
ower+0.031
Neg logit contributions
|-0.044
 meets-0.041
 play-0.041
 meet-0.040
 plays-0.040
Token Sara
Token ID24799
Feature activation+9.91e-01

Pos logit contributions
ower+0.023
broken+0.023
 impressed+0.023
aging+0.023
 pleased+0.023
Neg logit contributions
|-0.014
 meets-0.014
 plays-0.013
 play-0.013
 meet-0.013
Token,
Token ID11
Feature activation+8.37e+00

Pos logit contributions
 impressed+0.017
aging+0.017
broken+0.016
ower+0.016
outing+0.016
Neg logit contributions
|-0.025
 meets-0.023
 play-0.023
 meet-0.023
 plays-0.023
Token Ben
Token ID3932
Feature activation+1.44e+00

Pos logit contributions
ower+0.207
aging+0.194
ooked+0.182
outing+0.181
broken+0.180
Neg logit contributions
 play-0.159
|-0.147
 meet-0.146
 meets-0.144
 plays-0.143
Token,
Token ID11
Feature activation+7.69e+00

Pos logit contributions
 impressed+0.024
aging+0.023
broken+0.022
ower+0.022
outing+0.022
Neg logit contributions
|-0.038
 meets-0.036
 play-0.035
 meet-0.035
 plays-0.034
Token and
Token ID290
Feature activation+4.40e+00

Pos logit contributions
ower+0.168
aging+0.156
outing+0.152
broken+0.148
ised+0.146
Neg logit contributions
 play-0.165
|-0.157
 meet-0.154
 plays-0.152
 meets-0.152
Token the
Token ID262
Feature activation+2.94e-01

Pos logit contributions
ower+0.099
 impressed+0.098
aging+0.096
broken+0.096
outing+0.095
Neg logit contributions
|-0.088
 play-0.084
 meets-0.084
 plays-0.083
 meet-0.080
Token ostr
Token ID42387
Feature activation+3.84e+00

Pos logit contributions
 impressed+0.006
broken+0.006
outing+0.006
ower+0.006
aging+0.006
Neg logit contributions
 play-0.006
 meet-0.006
|-0.006
 meets-0.006
 plays-0.006
Token,
Token ID11
Feature activation+5.41e+00

Pos logit contributions
outing+0.062
 impressed+0.062
 pleased+0.059
ower+0.058
broken+0.058
Neg logit contributions
 play-0.060
|-0.059
 meets-0.058
 meet-0.058
 borrow-0.057
Token run
Token ID1057
Feature activation+9.90e-01

Pos logit contributions
 impressed+0.142
aging+0.140
ower+0.137
 pleased+0.136
broken+0.135
Neg logit contributions
|-0.088
 meets-0.080
 plays-0.074
 play-0.073
 meet-0.072
Token!"
Token ID2474
Feature activation+2.58e+00

Pos logit contributions
 impressed+0.023
aging+0.022
ower+0.022
broken+0.022
outing+0.022
Neg logit contributions
|-0.019
 meets-0.018
 play-0.018
 meet-0.017
 plays-0.017
Token Sam
Token ID3409
Feature activation+2.67e+00

Pos logit contributions
 impressed+0.063
aging+0.061
ower+0.061
broken+0.060
outing+0.060
Neg logit contributions
|-0.047
 meets-0.044
 play-0.042
 meet-0.042
 plays-0.042
Token shouted
Token ID17293
Feature activation+3.52e+00

Pos logit contributions
ower+0.059
uttering+0.059
outing+0.057
 impressed+0.056
 pleased+0.056
Neg logit contributions
 play-0.054
|-0.053
 plays-0.051
 meets-0.050
 meet-0.050
Token,
Token ID11
Feature activation+8.37e+00

Pos logit contributions
 impressed+0.066
ower+0.063
outing+0.063
aging+0.062
broken+0.062
Neg logit contributions
|-0.083
 play-0.080
 meets-0.078
 plays-0.077
 meet-0.077
Token pulling
Token ID10427
Feature activation+1.87e-01

Pos logit contributions
ower+0.196
ment+0.193
outing+0.191
agg+0.189
iring+0.189
Neg logit contributions
 play-0.161
 meet-0.147
 plays-0.138
 join-0.137
 hide-0.136
Token L
Token ID406
Feature activation+3.70e+00

Pos logit contributions
 impressed+0.004
broken+0.003
aging+0.003
ower+0.003
outing+0.003
Neg logit contributions
|-0.004
 play-0.004
 meets-0.004
 plays-0.004
 meet-0.004
Tokenila
Token ID10102
Feature activation+6.96e-01

Pos logit contributions
 impressed+0.047
ower+0.047
outing+0.046
aging+0.046
broken+0.044
Neg logit contributions
|-0.109
 meets-0.104
 play-0.104
 meet-0.103
 plays-0.102
Token along
Token ID1863
Feature activation+2.29e+00

Pos logit contributions
 impressed+0.014
aging+0.014
broken+0.014
ower+0.014
outing+0.013
Neg logit contributions
|-0.015
 meets-0.014
 play-0.014
 plays-0.014
 meet-0.014
Token.
Token ID13
Feature activation-9.46e-01

Pos logit contributions
 impressed+0.052
ower+0.051
aging+0.051
outing+0.051
broken+0.050
Neg logit contributions
|-0.045
 play-0.043
 meets-0.041
 meet-0.041
 plays-0.041
Token digging
Token ID18894
Feature activation-1.87e+00

Pos logit contributions
 impressed+0.057
aging+0.055
ower+0.055
broken+0.054
outing+0.054
Neg logit contributions
|-0.067
 meets-0.062
 play-0.061
 meet-0.060
 plays-0.060
Token holes
Token ID10421
Feature activation+1.78e+00

Pos logit contributions
|+0.044
 meets+0.040
 meet+0.040
 play+0.039
 plays+0.039
Neg logit contributions
 impressed-0.036
aging-0.035
ower-0.035
broken-0.035
 pleased-0.034
Token!"
Token ID2474
Feature activation+1.82e+00

Pos logit contributions
 impressed+0.030
aging+0.028
ower+0.028
outing+0.028
broken+0.028
Neg logit contributions
|-0.046
 meets-0.043
 meet-0.042
 play-0.042
 plays-0.041
Token Mom
Token ID11254
Feature activation+2.50e+00

Pos logit contributions
 impressed+0.045
aging+0.044
ower+0.043
broken+0.043
outing+0.043
Neg logit contributions
|-0.033
 meets-0.030
 play-0.029
 meet-0.029
 plays-0.029
Token said
Token ID531
Feature activation+4.82e+00

Pos logit contributions
 impressed+0.052
aging+0.050
ower+0.050
broken+0.050
outing+0.049
Neg logit contributions
|-0.055
 meets-0.051
 play-0.051
 plays-0.050
 meet-0.050
Token,
Token ID11
Feature activation+8.36e+00

Pos logit contributions
 impressed+0.115
ower+0.113
aging+0.113
outing+0.111
broken+0.111
Neg logit contributions
 play-0.080
|-0.080
 borrow-0.077
 plays-0.076
 meets-0.074
Token stern
Token ID26370
Feature activation+4.18e+00

Pos logit contributions
ower+0.179
outing+0.167
aging+0.166
oles+0.163
iring+0.163
Neg logit contributions
 play-0.179
 meet-0.166
|-0.162
 plays-0.161
 borrow-0.160
Tokenly
Token ID306
Feature activation+4.82e+00

Pos logit contributions
 impressed+0.078
 pleased+0.071
 mesmer+0.070
 sudden+0.069
aging+0.068
Neg logit contributions
 play-0.100
|-0.100
 plays-0.098
 meets-0.095
 meet-0.095
Token.
Token ID13
Feature activation-6.15e-01

Pos logit contributions
 impressed+0.115
ower+0.115
 mesmer+0.112
iring+0.112
 pleased+0.111
Neg logit contributions
 play-0.081
 plays-0.077
 borrow-0.075
 meet-0.072
 meets-0.072
Token
Token ID198
Feature activation+2.99e+00

Pos logit contributions
|+0.011
 meets+0.010
 play+0.010
 meet+0.009
 plays+0.009
Neg logit contributions
 impressed-0.015
aging-0.015
ower-0.015
broken-0.015
outing-0.015
Token
Token ID198
Feature activation+2.76e+00

Pos logit contributions
 impressed+0.074
aging+0.072
ower+0.071
broken+0.071
outing+0.071
Neg logit contributions
|-0.053
 meets-0.049
 play-0.048
 meet-0.047
 plays-0.047
Token
Token ID198
Feature activation+2.94e+00

Pos logit contributions
|+0.039
 meets+0.037
 meet+0.035
 plays+0.035
 play+0.034
Neg logit contributions
 impressed-0.044
ower-0.043
aging-0.043
outing-0.042
 pleased-0.042
Token
Token ID198
Feature activation+2.91e+00

Pos logit contributions
 impressed+0.078
aging+0.078
broken+0.077
ower+0.077
outing+0.076
Neg logit contributions
|-0.047
 meets-0.042
 play-0.042
 meet-0.041
 plays-0.039
TokenHis
Token ID6653
Feature activation+1.21e+00

Pos logit contributions
 impressed+0.058
aging+0.056
 mesmer+0.056
 pleased+0.056
ower+0.056
Neg logit contributions
|-0.065
 play-0.062
 borrow-0.059
 meet-0.059
 meets-0.058
Token sister
Token ID6621
Feature activation+1.27e+00

Pos logit contributions
 impressed+0.027
aging+0.026
broken+0.026
ower+0.026
 pleased+0.025
Neg logit contributions
|-0.026
 meets-0.023
 meet-0.023
 play-0.022
 plays-0.022
Token yelled
Token ID22760
Feature activation+1.94e+00

Pos logit contributions
 impressed+0.027
aging+0.026
ower+0.026
broken+0.026
outing+0.025
Neg logit contributions
|-0.027
 meets-0.025
 play-0.025
 meet-0.025
 plays-0.025
Token,
Token ID11
Feature activation+8.34e+00

Pos logit contributions
 impressed+0.037
ower+0.036
broken+0.036
outing+0.035
 pleased+0.035
Neg logit contributions
 play-0.043
 plays-0.042
|-0.042
 meets-0.041
 meet-0.040
Token "
Token ID366
Feature activation-3.16e+00

Pos logit contributions
broken+0.186
 impressed+0.184
outing+0.177
ower+0.177
 pleased+0.176
Neg logit contributions
 play-0.161
 meet-0.150
 plays-0.149
|-0.143
 meets-0.142
TokenTim
Token ID14967
Feature activation+1.70e+00

Pos logit contributions
|+0.055
 meets+0.051
 plays+0.049
 meet+0.049
 play+0.048
Neg logit contributions
 impressed-0.079
broken-0.078
aging-0.078
outing-0.078
ower-0.078
Tokenmy
Token ID1820
Feature activation+2.08e+00

Pos logit contributions
 impressed+0.050
broken+0.049
outing+0.049
aging+0.049
ower+0.049
Neg logit contributions
|-0.023
 meets-0.020
 meet-0.019
 play-0.018
 plays-0.018
Token,
Token ID11
Feature activation+5.60e+00

Pos logit contributions
 impressed+0.027
aging+0.026
ower+0.026
broken+0.026
outing+0.026
Neg logit contributions
|-0.064
 meets-0.058
 meet-0.056
 play-0.055
 plays-0.055
Token stop
Token ID2245
Feature activation+2.76e+00

Pos logit contributions
 impressed+0.127
aging+0.123
ower+0.122
outing+0.121
 pleased+0.119
Neg logit contributions
|-0.116
 meets-0.107
 plays-0.095
 meet-0.093
 fits-0.090
Token it
Token ID340
Feature activation-2.47e+00

Pos logit contributions
|+0.082
 meets+0.072
 meet+0.070
 play+0.069
 borrow+0.068
Neg logit contributions
 impressed-0.077
aging-0.075
ower-0.075
outing-0.073
 pleased-0.072
Token together
Token ID1978
Feature activation+4.95e-01

Pos logit contributions
|+0.064
 meets+0.058
 meet+0.057
 plays+0.056
 play+0.055
Neg logit contributions
 impressed-0.042
broken-0.040
aging-0.040
ower-0.040
 pleased-0.039
Token."
Token ID526
Feature activation+1.94e+00

Pos logit contributions
 impressed+0.010
ower+0.009
broken+0.009
 pleased+0.009
outing+0.009
Neg logit contributions
|-0.012
 meets-0.011
 meet-0.010
 plays-0.010
 play-0.010
Token Dad
Token ID17415
Feature activation+2.37e+00

Pos logit contributions
 impressed+0.038
aging+0.036
ower+0.036
broken+0.036
outing+0.036
Neg logit contributions
|-0.045
 meets-0.043
 play-0.042
 meet-0.041
 plays-0.041
Token said
Token ID531
Feature activation+5.21e+00

Pos logit contributions
ower+0.052
aging+0.050
broken+0.049
uttering+0.049
 impressed+0.048
Neg logit contributions
|-0.046
 play-0.046
 plays-0.045
 meets-0.044
 borrow-0.044
Token,
Token ID11
Feature activation+8.32e+00

Pos logit contributions
 impressed+0.102
ower+0.100
broken+0.098
aging+0.097
outing+0.097
Neg logit contributions
 play-0.112
|-0.112
 borrow-0.108
 plays-0.107
 meets-0.105
Token smiling
Token ID16755
Feature activation+3.06e+00

Pos logit contributions
ower+0.185
oles+0.177
andel+0.173
iring+0.172
outing+0.172
Neg logit contributions
 play-0.171
 meet-0.154
 borrow-0.151
 join-0.149
 build-0.144
Token.
Token ID13
Feature activation-4.73e-01

Pos logit contributions
 impressed+0.078
ower+0.077
aging+0.076
 sudden+0.075
outing+0.075
Neg logit contributions
 play-0.047
 plays-0.044
|-0.041
 meets-0.041
 borrow-0.041
Token
Token ID198
Feature activation+3.70e+00

Pos logit contributions
|+0.008
 meets+0.007
 play+0.007
 plays+0.007
 meet+0.007
Neg logit contributions
 impressed-0.012
aging-0.012
ower-0.012
broken-0.012
outing-0.012
Token
Token ID198
Feature activation+3.47e+00

Pos logit contributions
 impressed+0.098
aging+0.098
ower+0.097
broken+0.096
outing+0.096
Neg logit contributions
|-0.059
 play-0.054
 meets-0.054
 meet-0.052
 plays-0.052
Token"
Token ID1
Feature activation-1.44e+00

Pos logit contributions
aging+0.073
ower+0.071
 impressed+0.070
 mesmer+0.068
outing+0.068
Neg logit contributions
|-0.075
 play-0.073
 meets-0.070
 plays-0.069
 meet-0.068
Token hurt
Token ID5938
Feature activation+5.04e-01

Pos logit contributions
|+0.050
 meets+0.045
 meet+0.044
 play+0.043
 borrow+0.042
Neg logit contributions
 impressed-0.076
aging-0.074
broken-0.073
ower-0.073
outing-0.072
Token!"
Token ID2474
Feature activation+2.66e+00

Pos logit contributions
 impressed+0.013
aging+0.012
 pleased+0.012
ower+0.012
broken+0.012
Neg logit contributions
|-0.009
 play-0.008
 meets-0.008
 plays-0.008
 meet-0.008
Token Mom
Token ID11254
Feature activation+2.64e+00

Pos logit contributions
king+0.069
 impressed+0.069
broken+0.069
ower+0.066
outing+0.065
Neg logit contributions
 play-0.046
 plays-0.042
|-0.041
 borrow-0.040
 meets-0.040
Tokenmy
Token ID1820
Feature activation+3.10e+00

Pos logit contributions
 impressed+0.095
ower+0.093
aging+0.092
 pleased+0.092
 mesmer+0.091
Neg logit contributions
|-0.016
 play-0.015
 plays-0.013
 meets-0.012
 meet-0.012
Token said
Token ID531
Feature activation+4.55e+00

Pos logit contributions
uttering+0.088
ower+0.083
andel+0.078
 impressed+0.078
 pleased+0.077
Neg logit contributions
 play-0.046
's-0.046
 plays-0.043
 borrow-0.041
 meets-0.040
Token,
Token ID11
Feature activation+8.32e+00

Pos logit contributions
 impressed+0.120
 mesmer+0.114
 sudden+0.113
outing+0.113
ower+0.112
Neg logit contributions
 play-0.070
 borrow-0.065
 plays-0.061
|-0.059
 join-0.058
Token rubbing
Token ID27714
Feature activation+6.09e-01

Pos logit contributions
agg+0.183
ower+0.180
iring+0.179
ment+0.177
andel+0.175
Neg logit contributions
 play-0.167
 meet-0.142
 plays-0.141
 build-0.138
 have-0.136
Token her
Token ID607
Feature activation+5.47e-01

Pos logit contributions
 impressed+0.019
 pleased+0.018
ower+0.018
broken+0.017
aging+0.017
Neg logit contributions
|-0.008
 play-0.007
 plays-0.007
 meets-0.006
 borrow-0.006
Token eye
Token ID4151
Feature activation+5.67e-01

Pos logit contributions
 impressed+0.011
ower+0.011
aging+0.011
outing+0.011
 pleased+0.011
Neg logit contributions
|-0.012
 play-0.012
 meets-0.011
 plays-0.011
 meet-0.011
Token.
Token ID13
Feature activation-4.05e-01

Pos logit contributions
 impressed+0.015
 pleased+0.014
 mesmer+0.014
aging+0.013
ower+0.013
Neg logit contributions
 play-0.009
 borrow-0.009
 plays-0.008
|-0.008
 fits-0.008
Token "
Token ID366
Feature activation-2.55e+00

Pos logit contributions
 play+0.005
 meets+0.005
 meet+0.005
 plays+0.005
|+0.005
Neg logit contributions
broken-0.011
 impressed-0.011
itive-0.011
ised-0.011
aver-0.011
Token't
Token ID470
Feature activation-6.23e-01

Pos logit contributions
 meets+0.021
|+0.021
 plays+0.019
 fits+0.018
 meet+0.018
Neg logit contributions
 impressed-0.036
aging-0.035
ower-0.035
broken-0.034
outing-0.033
Token know
Token ID760
Feature activation+1.59e-01

Pos logit contributions
|+0.015
 meets+0.014
 fits+0.013
 plays+0.013
uggle+0.013
Neg logit contributions
 impressed-0.012
broken-0.011
ower-0.011
aging-0.011
outing-0.011
Token,"
Token ID553
Feature activation+3.26e+00

Pos logit contributions
 impressed+0.003
outing+0.003
aging+0.003
ower+0.003
 pleased+0.003
Neg logit contributions
|-0.004
 meets-0.004
 meet-0.003
 play-0.003
 plays-0.003
Token mom
Token ID1995
Feature activation+1.86e+00

Pos logit contributions
 impressed+0.084
 pleased+0.081
aging+0.080
outing+0.079
broken+0.079
Neg logit contributions
|-0.057
 meets-0.050
 meet-0.049
 play-0.048
 plays-0.047
Token said
Token ID531
Feature activation+4.97e+00

Pos logit contributions
ower+0.042
aging+0.042
 impressed+0.041
 mesmer+0.040
broken+0.040
Neg logit contributions
|-0.036
 play-0.035
 plays-0.033
 meets-0.033
 borrow-0.033
Token,
Token ID11
Feature activation+8.31e+00

Pos logit contributions
 impressed+0.104
ower+0.102
aging+0.099
outing+0.099
broken+0.098
Neg logit contributions
 play-0.100
|-0.099
 borrow-0.095
 plays-0.095
 meets-0.093
Token hugging
Token ID46292
Feature activation+3.64e-01

Pos logit contributions
ower+0.182
iring+0.173
andel+0.170
oles+0.169
agg+0.168
Neg logit contributions
 play-0.172
 meet-0.155
 borrow-0.150
 plays-0.149
 join-0.146
Token him
Token ID683
Feature activation+1.91e+00

Pos logit contributions
outing+0.009
aging+0.008
 impressed+0.008
broken+0.008
 sudden+0.008
Neg logit contributions
|-0.007
 meets-0.006
 meet-0.006
 fits-0.006
 borrow-0.006
Token and
Token ID290
Feature activation+3.63e+00

Pos logit contributions
 impressed+0.044
ower+0.043
aging+0.042
outing+0.042
broken+0.042
Neg logit contributions
|-0.037
 play-0.034
 meets-0.034
 plays-0.033
 meet-0.033
Token Lily
Token ID20037
Feature activation+4.27e-01

Pos logit contributions
ower+0.071
 impressed+0.070
aging+0.070
outing+0.069
broken+0.069
Neg logit contributions
|-0.081
 play-0.078
 meets-0.078
 plays-0.076
 meet-0.075
Token.
Token ID13
Feature activation-7.10e-01

Pos logit contributions
 impressed+0.010
aging+0.010
broken+0.009
 pleased+0.009
outing+0.009
Neg logit contributions
|-0.009
 meets-0.008
 meet-0.008
 play-0.007
 plays-0.007
Token you
Token ID345
Feature activation-1.15e+00

Pos logit contributions
 impressed+0.003
aging+0.003
ower+0.003
broken+0.003
outing+0.003
Neg logit contributions
|-0.002
 meets-0.002
 play-0.002
 meet-0.002
 plays-0.002
Token off
Token ID572
Feature activation+2.03e+00

Pos logit contributions
|+0.022
 meets+0.020
 play+0.020
 meet+0.020
 plays+0.020
Neg logit contributions
 impressed-0.027
aging-0.026
ower-0.026
outing-0.026
broken-0.026
Token,"
Token ID553
Feature activation+3.07e+00

Pos logit contributions
 impressed+0.043
aging+0.042
broken+0.042
 pleased+0.041
ower+0.041
Neg logit contributions
|-0.043
 meets-0.040
 meet-0.038
 plays-0.038
 play-0.038
Token mom
Token ID1995
Feature activation+2.21e+00

Pos logit contributions
 impressed+0.068
aging+0.067
ower+0.067
broken+0.066
outing+0.066
Neg logit contributions
|-0.062
 play-0.058
 meets-0.058
 plays-0.057
 meet-0.056
Token says
Token ID1139
Feature activation+5.13e+00

Pos logit contributions
 impressed+0.044
outing+0.042
aging+0.040
 pleased+0.040
 amused+0.039
Neg logit contributions
|-0.051
 meets-0.045
 play-0.044
 meet-0.044
uggle-0.043
Token,
Token ID11
Feature activation+8.31e+00

Pos logit contributions
 impressed+0.115
ower+0.111
broken+0.110
aging+0.109
outing+0.109
Neg logit contributions
|-0.100
 play-0.097
 plays-0.093
 meets-0.093
 borrow-0.093
Token taking
Token ID2263
Feature activation-1.95e+00

Pos logit contributions
ower+0.192
oles+0.183
iring+0.182
outing+0.182
andel+0.176
Neg logit contributions
 play-0.166
 borrow-0.147
 meet-0.146
 plays-0.142
 join-0.138
Token Lily
Token ID20037
Feature activation+1.08e-01

Pos logit contributions
|+0.046
 meets+0.043
 fits+0.041
 meet+0.041
 Welcome+0.039
Neg logit contributions
aging-0.035
 sudden-0.031
broken-0.030
ment-0.029
aged-0.029
Token in
Token ID287
Feature activation-8.84e-01

Pos logit contributions
aging+0.002
 impressed+0.002
broken+0.002
ower+0.002
outing+0.002
Neg logit contributions
|-0.002
 meets-0.002
 play-0.002
 meet-0.002
 plays-0.002
Token her
Token ID607
Feature activation-6.81e-01

Pos logit contributions
|+0.017
 meets+0.016
 play+0.015
 meet+0.015
 plays+0.015
Neg logit contributions
 impressed-0.021
aging-0.020
broken-0.020
ower-0.020
 pleased-0.020
Token arms
Token ID5101
Feature activation+2.28e+00

Pos logit contributions
|+0.016
 meets+0.015
 meet+0.014
Ready+0.014
 borrow+0.014
Neg logit contributions
aging-0.013
 sudden-0.012
broken-0.012
ower-0.012
 impressed-0.012
Token friends
Token ID2460
Feature activation-1.24e+00

Pos logit contributions
|+0.039
 meets+0.034
 meet+0.033
 borrow+0.033
 plays+0.032
Neg logit contributions
ower-0.036
 impressed-0.036
aging-0.036
 pleased-0.035
broken-0.035
Token."
Token ID526
Feature activation+1.85e+00

Pos logit contributions
|+0.022
 meets+0.019
 plays+0.018
 meet+0.018
Ready+0.018
Neg logit contributions
 impressed-0.031
broken-0.030
ower-0.030
 pleased-0.030
aging-0.030
Token their
Token ID511
Feature activation+2.80e+00

Pos logit contributions
 impressed+0.044
aging+0.042
broken+0.042
 pleased+0.042
ower+0.041
Neg logit contributions
|-0.035
 meets-0.032
 meet-0.031
 play-0.031
 plays-0.030
Token parents
Token ID3397
Feature activation+2.24e+00

Pos logit contributions
 impressed+0.057
 pleased+0.055
outing+0.054
ower+0.054
 mesmer+0.053
Neg logit contributions
 play-0.057
 meets-0.056
 plays-0.055
|-0.054
 meet-0.053
Token say
Token ID910
Feature activation+4.79e+00

Pos logit contributions
ower+0.048
 impressed+0.048
aging+0.047
broken+0.047
 pleased+0.046
Neg logit contributions
|-0.046
 play-0.044
 meets-0.043
 meet-0.043
 plays-0.042
Token,
Token ID11
Feature activation+8.30e+00

Pos logit contributions
 impressed+0.102
 mesmer+0.101
ower+0.101
 intense+0.098
outing+0.098
Neg logit contributions
 play-0.092
 borrow-0.091
 plays-0.086
 meet-0.084
 join-0.084
Token taking
Token ID2263
Feature activation-1.54e+00

Pos logit contributions
ower+0.186
iring+0.183
oles+0.182
agg+0.181
andel+0.181
Neg logit contributions
 play-0.167
 meet-0.148
 borrow-0.141
 plays-0.141
 join-0.139
Token their
Token ID511
Feature activation+9.63e-01

Pos logit contributions
|+0.032
 meets+0.029
 fits+0.028
 meet+0.028
 play+0.027
Neg logit contributions
aging-0.033
broken-0.032
 impressed-0.032
 sudden-0.031
 pleased-0.030
Token hands
Token ID2832
Feature activation+2.82e+00

Pos logit contributions
aging+0.019
 impressed+0.019
 sudden+0.019
broken+0.018
outing+0.018
Neg logit contributions
|-0.022
 meets-0.019
 meet-0.019
 borrow-0.019
 fits-0.018
Token and
Token ID290
Feature activation+3.14e+00

Pos logit contributions
 impressed+0.048
ower+0.045
aging+0.045
 pleased+0.044
outing+0.044
Neg logit contributions
|-0.071
 play-0.070
 borrow-0.067
 plays-0.066
 meet-0.066
Token leading
Token ID3756
Feature activation-4.13e-01

Pos logit contributions
 impressed+0.060
aging+0.058
broken+0.057
ower+0.056
 pleased+0.056
Neg logit contributions
|-0.075
 meets-0.070
 play-0.068
 meet-0.068
 plays-0.067
Token flashlight
Token ID38371
Feature activation+1.11e+00

Pos logit contributions
 impressed+0.023
 pleased+0.022
ower+0.022
broken+0.022
aging+0.021
Neg logit contributions
|-0.023
 meets-0.021
 meet-0.020
 plays-0.020
 play-0.020
Token,
Token ID11
Feature activation+3.87e+00

Pos logit contributions
 impressed+0.018
aging+0.017
ower+0.017
broken+0.017
outing+0.017
Neg logit contributions
|-0.030
 meets-0.027
 meet-0.026
 plays-0.026
 play-0.026
Token and
Token ID290
Feature activation+9.83e-01

Pos logit contributions
 impressed+0.063
ower+0.061
outing+0.061
aging+0.061
broken+0.061
Neg logit contributions
|-0.100
 play-0.096
 meets-0.096
 plays-0.094
 meet-0.094
Token a
Token ID257
Feature activation+9.10e-01

Pos logit contributions
 impressed+0.019
aging+0.018
 pleased+0.017
broken+0.017
 knowing+0.017
Neg logit contributions
|-0.024
 meets-0.022
 meet-0.021
 play-0.021
 plays-0.021
Token magn
Token ID7842
Feature activation-6.79e-01

Pos logit contributions
 impressed+0.020
 pleased+0.019
ower+0.019
broken+0.019
aging+0.019
Neg logit contributions
|-0.019
 meets-0.017
 meet-0.016
 plays-0.016
 play-0.016
Tokenifying
Token ID4035
Feature activation-8.15e+00

Pos logit contributions
|+0.016
 meets+0.015
 play+0.015
 plays+0.015
 meet+0.015
Neg logit contributions
aging-0.012
outing-0.012
 impressed-0.012
ower-0.012
broken-0.012
Token glass
Token ID5405
Feature activation-3.10e-01

Pos logit contributions
|+0.167
 meets+0.164
 meet+0.156
 plays+0.153
 play+0.148
Neg logit contributions
 impressed-0.165
broken-0.162
ower-0.161
 pleased-0.160
aging-0.159
Token.
Token ID13
Feature activation-2.51e+00

Pos logit contributions
|+0.008
 meets+0.007
 meet+0.007
 plays+0.007
 play+0.007
Neg logit contributions
ower-0.006
 impressed-0.006
aging-0.006
broken-0.005
outing-0.005
Token
Token ID198
Feature activation+1.82e+00

Pos logit contributions
|+0.049
 meets+0.046
 play+0.044
 plays+0.043
 meet+0.043
Neg logit contributions
 impressed-0.056
 pleased-0.053
 mesmer-0.052
ower-0.052
outing-0.052
Token
Token ID198
Feature activation+1.83e+00

Pos logit contributions
 impressed+0.040
 pleased+0.038
ower+0.037
aging+0.037
outing+0.037
Neg logit contributions
|-0.036
 meets-0.034
 plays-0.034
 play-0.034
 fits-0.033
TokenAs
Token ID1722
Feature activation+4.43e+00

Pos logit contributions
 impressed+0.030
broken+0.029
aging+0.029
ower+0.029
 pleased+0.028
Neg logit contributions
|-0.048
 meets-0.045
 play-0.044
 plays-0.044
 meet-0.044
Token When
Token ID1649
Feature activation+2.08e+00

Pos logit contributions
|+0.041
 meets+0.038
 fits+0.036
 plays+0.035
 play+0.035
Neg logit contributions
 impressed-0.044
 pleased-0.042
ower-0.041
 mesmer-0.041
outing-0.040
Token Daisy
Token ID40355
Feature activation-4.43e-01

Pos logit contributions
aging+0.032
 impressed+0.032
 pleased+0.031
 knowing+0.030
 sudden+0.029
Neg logit contributions
|-0.055
 meets-0.050
uggle-0.046
 meet-0.046
 fits-0.045
Token put
Token ID1234
Feature activation-2.83e+00

Pos logit contributions
|+0.010
uggle+0.009
 meets+0.009
Ready+0.009
 Friendship+0.009
Neg logit contributions
 impressed-0.007
aging-0.007
 pleased-0.007
 mesmer-0.007
broken-0.007
Token the
Token ID262
Feature activation-1.85e+00

Pos logit contributions
|+0.067
 meets+0.062
 meet+0.060
 play+0.057
 plays+0.057
Neg logit contributions
aging-0.053
 impressed-0.053
 pleased-0.052
ower-0.051
outing-0.049
Token magn
Token ID7842
Feature activation-3.03e-01

Pos logit contributions
|+0.040
 meets+0.036
 meet+0.034
 plays+0.034
 play+0.032
Neg logit contributions
 impressed-0.040
ower-0.040
aging-0.039
broken-0.039
 pleased-0.039
Tokenifying
Token ID4035
Feature activation-8.06e+00

Pos logit contributions
|+0.007
 meets+0.007
 play+0.007
 meet+0.007
 plays+0.007
Neg logit contributions
aging-0.005
outing-0.005
ower-0.005
broken-0.005
 impressed-0.005
Token glass
Token ID5405
Feature activation-5.32e-01

Pos logit contributions
|+0.158
 meets+0.152
 meet+0.146
 plays+0.145
 play+0.143
Neg logit contributions
 impressed-0.174
ower-0.169
broken-0.167
 pleased-0.167
aging-0.167
Token down
Token ID866
Feature activation-1.47e-01

Pos logit contributions
|+0.015
 meets+0.014
 meet+0.014
 plays+0.013
 play+0.013
Neg logit contributions
ower-0.007
 impressed-0.007
aging-0.007
broken-0.007
outing-0.007
Token,
Token ID11
Feature activation+4.47e+00

Pos logit contributions
|+0.004
 meets+0.004
 meet+0.004
 plays+0.004
 play+0.004
Neg logit contributions
 impressed-0.002
aging-0.002
 pleased-0.002
ower-0.002
outing-0.002
Token the
Token ID262
Feature activation-5.87e-01

Pos logit contributions
 impressed+0.075
 pleased+0.071
aging+0.070
 sudden+0.068
 mesmer+0.067
Neg logit contributions
|-0.116
 meets-0.109
 plays-0.103
 meet-0.103
 borrow-0.100
Token cater
Token ID20825
Feature activation+3.71e-01

Pos logit contributions
|+0.012
 meets+0.011
 meet+0.011
 plays+0.011
 borrow+0.010
Neg logit contributions
 impressed-0.013
ower-0.013
aging-0.012
broken-0.012
 sudden-0.012
Token,
Token ID11
Feature activation+6.62e+00

Pos logit contributions
 impressed+0.069
broken+0.067
ower+0.067
aging+0.067
outing+0.066
Neg logit contributions
 play-0.067
 borrow-0.064
|-0.064
 plays-0.063
 meet-0.063
Token handing
Token ID22786
Feature activation-3.06e+00

Pos logit contributions
aging+0.132
ower+0.130
broken+0.129
outing+0.128
 impressed+0.126
Neg logit contributions
 play-0.142
|-0.138
 meet-0.137
 meets-0.133
 plays-0.132
Token him
Token ID683
Feature activation-4.41e-01

Pos logit contributions
|+0.067
 meets+0.062
 fits+0.057
 meet+0.055
 plays+0.053
Neg logit contributions
aging-0.056
 knowing-0.054
 as-0.054
ised-0.052
king-0.052
Token the
Token ID262
Feature activation-1.79e+00

Pos logit contributions
|+0.012
 meets+0.011
Ready+0.011
 meet+0.010
 Welcome+0.010
Neg logit contributions
aging-0.006
 impressed-0.006
broken-0.006
 pleased-0.006
outing-0.005
Token magn
Token ID7842
Feature activation-7.57e-02

Pos logit contributions
|+0.040
 meets+0.034
 plays+0.032
 meet+0.032
 play+0.031
Neg logit contributions
 impressed-0.039
ower-0.039
 pleased-0.038
aging-0.038
outing-0.037
Tokenifying
Token ID4035
Feature activation-7.77e+00

Pos logit contributions
|+0.002
 meets+0.002
 meet+0.002
 play+0.002
 plays+0.002
Neg logit contributions
outing-0.001
ower-0.001
broken-0.001
aging-0.001
king-0.001
Token glass
Token ID5405
Feature activation-1.05e-01

Pos logit contributions
|+0.162
 meets+0.156
 meet+0.150
 plays+0.146
 play+0.143
Neg logit contributions
 impressed-0.156
 pleased-0.150
ower-0.148
broken-0.148
outing-0.147
Token.
Token ID13
Feature activation-2.17e+00

Pos logit contributions
|+0.003
 meets+0.003
 meet+0.002
 plays+0.002
 play+0.002
Neg logit contributions
 impressed-0.002
ower-0.002
outing-0.002
broken-0.002
aging-0.002
Token She
Token ID1375
Feature activation-8.55e-02

Pos logit contributions
|+0.049
 meets+0.045
 meet+0.042
 plays+0.041
 fits+0.041
Neg logit contributions
 impressed-0.044
ower-0.043
outing-0.041
 pleased-0.041
aging-0.040
Token looked
Token ID3114
Feature activation+1.47e+00

Pos logit contributions
|+0.002
 play+0.002
 meets+0.002
 plays+0.002
 meet+0.002
Neg logit contributions
ower-0.002
aging-0.002
outing-0.002
 impressed-0.002
broken-0.002
Token around
Token ID1088
Feature activation+1.25e+00

Pos logit contributions
ower+0.029
aging+0.028
broken+0.027
outing+0.027
oles+0.027
Neg logit contributions
 play-0.033
 meets-0.032
|-0.032
 plays-0.032
 meet-0.030
Token his
Token ID465
Feature activation-9.28e-01

Pos logit contributions
 impressed+0.118
ower+0.118
broken+0.116
outing+0.116
aging+0.115
Neg logit contributions
|-0.071
 play-0.068
 plays-0.065
 meets-0.063
 borrow-0.062
Token mum
Token ID25682
Feature activation-1.37e-01

Pos logit contributions
|+0.024
 meets+0.023
 play+0.023
 meet+0.022
 plays+0.022
Neg logit contributions
 impressed-0.015
aging-0.015
ower-0.015
broken-0.015
outing-0.015
Token opened
Token ID4721
Feature activation-1.10e+00

Pos logit contributions
|+0.003
 play+0.003
 meets+0.003
 plays+0.003
 meet+0.003
Neg logit contributions
 impressed-0.003
ower-0.003
aging-0.003
outing-0.003
broken-0.003
Token the
Token ID262
Feature activation-2.44e+00

Pos logit contributions
|+0.022
 meets+0.020
 meet+0.019
 play+0.019
 plays+0.019
Neg logit contributions
 impressed-0.025
aging-0.025
broken-0.024
 pleased-0.024
outing-0.024
Token l
Token ID300
Feature activation-1.04e+00

Pos logit contributions
|+0.064
 meets+0.057
 meet+0.056
 plays+0.054
 play+0.053
Neg logit contributions
aging-0.041
broken-0.041
 impressed-0.041
outing-0.040
ower-0.040
Token<|endoftext|>
Token ID50256
Feature activation-7.73e+00

Pos logit contributions
|+0.025
 play+0.023
 meets+0.023
 meet+0.023
 plays+0.023
Neg logit contributions
 impressed-0.019
aging-0.019
outing-0.019
ower-0.019
broken-0.018
TokenDave
Token ID27984
Feature activation-2.48e-01

Pos logit contributions
 shares+0.172
 fits+0.170
 belong+0.169
 borrow+0.168
 meets+0.167
Neg logit contributions
 sudden-0.099
ower-0.099
aging-0.097
ment-0.095
 impressed-0.093
Token and
Token ID290
Feature activation-1.44e+00

Pos logit contributions
|+0.007
 meets+0.007
 play+0.007
 meet+0.007
 plays+0.007
Neg logit contributions
 impressed-0.003
aging-0.003
ower-0.003
outing-0.003
broken-0.003
Token Amy
Token ID14235
Feature activation-1.44e+00

Pos logit contributions
|+0.032
 meets+0.029
 fits+0.028
uggle+0.027
Ready+0.027
Neg logit contributions
 impressed-0.029
 pleased-0.028
 knowing-0.027
 mesmer-0.027
 understanding-0.026
Token were
Token ID547
Feature activation-2.71e-02

Pos logit contributions
|+0.033
 meets+0.029
 fits+0.028
Ready+0.028
uggle+0.027
Neg logit contributions
aging-0.026
 impressed-0.025
 mesmer-0.025
 sudden-0.024
 knowing-0.024
Token out
Token ID503
Feature activation+2.55e+00

Pos logit contributions
|+0.001
 meets+0.001
 play+0.001
 meet+0.000
 plays+0.000
Neg logit contributions
 impressed-0.001
aging-0.001
ower-0.001
broken-0.001
outing-0.001
Token examine
Token ID10716
Feature activation-3.93e+00

Pos logit contributions
|+0.072
 meets+0.065
 plays+0.059
Ready+0.059
Check+0.059
Neg logit contributions
 impressed-0.078
 pleased-0.077
ower-0.076
aging-0.073
broken-0.073
Token it
Token ID340
Feature activation-3.31e+00

Pos logit contributions
|+0.095
 meets+0.081
 meet+0.080
 play+0.078
 plays+0.077
Neg logit contributions
 impressed-0.077
aging-0.073
 pleased-0.071
ised-0.070
outing-0.070
Token with
Token ID351
Feature activation-2.16e+00

Pos logit contributions
|+0.067
 meets+0.060
 meet+0.057
 plays+0.057
 play+0.057
Neg logit contributions
 impressed-0.076
aging-0.072
ower-0.072
broken-0.072
 pleased-0.072
Token the
Token ID262
Feature activation-1.89e+00

Pos logit contributions
|+0.040
 meets+0.034
 plays+0.032
 meet+0.031
 play+0.031
Neg logit contributions
 impressed-0.054
aging-0.053
 pleased-0.051
ower-0.051
broken-0.050
Token magn
Token ID7842
Feature activation-6.12e-01

Pos logit contributions
|+0.045
 meets+0.038
 plays+0.036
 meet+0.035
uggle+0.035
Neg logit contributions
 impressed-0.038
ower-0.037
aging-0.037
 pleased-0.037
outing-0.036
Tokenifying
Token ID4035
Feature activation-7.63e+00

Pos logit contributions
|+0.016
 meets+0.015
 play+0.015
 meet+0.015
 plays+0.015
Neg logit contributions
ower-0.010
outing-0.010
aging-0.009
king-0.009
broken-0.009
Token glass
Token ID5405
Feature activation-7.82e-01

Pos logit contributions
|+0.168
 meets+0.162
 meet+0.154
 plays+0.150
 play+0.145
Neg logit contributions
 impressed-0.133
 pleased-0.131
ower-0.129
broken-0.128
ment-0.128
Token?"
Token ID1701
Feature activation+7.20e-01

Pos logit contributions
|+0.016
 meets+0.015
 meet+0.014
 plays+0.014
 play+0.014
Neg logit contributions
 impressed-0.017
ower-0.017
outing-0.016
broken-0.016
 pleased-0.016
Token
Token ID198
Feature activation+2.58e+00

Pos logit contributions
 impressed+0.016
 pleased+0.015
 sudden+0.015
aging+0.015
ower+0.014
Neg logit contributions
|-0.015
 meets-0.014
 meet-0.013
 plays-0.013
 fits-0.013
Token
Token ID198
Feature activation+2.20e+00

Pos logit contributions
 impressed+0.061
aging+0.059
ower+0.058
broken+0.058
 pleased+0.058
Neg logit contributions
|-0.049
 meets-0.045
 play-0.044
 meet-0.043
 plays-0.043
Token"
Token ID1
Feature activation-2.12e+00

Pos logit contributions
 impressed+0.043
aging+0.043
ower+0.041
outing+0.041
broken+0.041
Neg logit contributions
|-0.050
 play-0.048
 meets-0.047
 meet-0.046
 plays-0.046
Token her
Token ID607
Feature activation-1.53e+00

Pos logit contributions
|+0.026
 play+0.025
 meets+0.024
 plays+0.024
 meet+0.023
Neg logit contributions
 impressed-0.042
ower-0.042
broken-0.041
outing-0.041
aging-0.040
Token own
Token ID898
Feature activation-1.97e+00

Pos logit contributions
 play+0.030
 meets+0.029
 plays+0.028
 fits+0.028
 meet+0.028
Neg logit contributions
 impressed-0.034
broken-0.034
outing-0.034
ower-0.032
aging-0.032
Token salt
Token ID8268
Feature activation+1.69e-01

Pos logit contributions
 play+0.039
 meets+0.037
 borrow+0.036
 plays+0.036
 meet+0.036
Neg logit contributions
 impressed-0.044
outing-0.043
broken-0.042
ower-0.041
aging-0.041
Token perfectly
Token ID7138
Feature activation+3.60e-01

Pos logit contributions
 impressed+0.003
aging+0.003
ower+0.003
broken+0.003
outing+0.003
Neg logit contributions
|-0.004
 meets-0.004
 meet-0.004
 play-0.004
 plays-0.004
Token!
Token ID0
Feature activation-2.45e+00

Pos logit contributions
 impressed+0.008
aging+0.008
outing+0.008
ower+0.008
 sudden+0.008
Neg logit contributions
 play-0.006
|-0.006
 borrow-0.006
 meets-0.006
 fits-0.006
Token<|endoftext|>
Token ID50256
Feature activation-7.59e+00

Pos logit contributions
|+0.051
 meets+0.046
 play+0.044
 hunt+0.043
 meet+0.042
Neg logit contributions
 impressed-0.052
 knowing-0.050
 pleased-0.050
ower-0.048
 mesmer-0.048
TokenOnce
Token ID7454
Feature activation-6.59e-01

Pos logit contributions
 fits+0.182
 shares+0.182
 borrow+0.181
 meets+0.179
 plays+0.178
Neg logit contributions
 impressed-0.094
 mesmer-0.087
 pleased-0.085
 sudden-0.084
 knowing-0.083
Token upon
Token ID2402
Feature activation-2.99e+00

Pos logit contributions
|+0.017
 meets+0.016
 play+0.016
 meet+0.016
 plays+0.015
Neg logit contributions
 impressed-0.011
aging-0.011
ower-0.011
broken-0.010
outing-0.010
Token a
Token ID257
Feature activation+8.05e-01

Pos logit contributions
|+0.070
 meets+0.066
 play+0.064
 plays+0.064
 meet+0.064
Neg logit contributions
 impressed-0.056
aging-0.054
broken-0.053
 pleased-0.053
ower-0.053
Token time
Token ID640
Feature activation+1.04e+00

Pos logit contributions
aging+0.020
 impressed+0.020
broken+0.020
outing+0.020
ower+0.019
Neg logit contributions
|-0.016
 play-0.014
 meet-0.013
 meets-0.013
 plays-0.013
Token,
Token ID11
Feature activation+6.64e-01

Pos logit contributions
broken+0.020
 impressed+0.020
aging+0.020
outing+0.019
ower+0.019
Neg logit contributions
|-0.025
 play-0.023
 meets-0.022
 meet-0.022
 plays-0.022
Token a
Token ID257
Feature activation-1.29e+00

Pos logit contributions
|+0.069
 meets+0.063
 play+0.062
 plays+0.062
 borrow+0.061
Neg logit contributions
 impressed-0.049
aging-0.047
 pleased-0.045
broken-0.044
 mesmer-0.044
Token flashlight
Token ID38371
Feature activation+1.00e+00

Pos logit contributions
|+0.032
 meets+0.029
 plays+0.028
 meet+0.028
 play+0.028
Neg logit contributions
 impressed-0.024
ower-0.022
aging-0.022
 pleased-0.022
broken-0.022
Token and
Token ID290
Feature activation+3.39e-01

Pos logit contributions
 impressed+0.017
aging+0.016
 pleased+0.016
broken+0.016
ower+0.016
Neg logit contributions
|-0.026
 play-0.025
 meets-0.025
 plays-0.024
 meet-0.024
Token a
Token ID257
Feature activation+4.68e-02

Pos logit contributions
 impressed+0.006
aging+0.006
ower+0.006
broken+0.006
 pleased+0.006
Neg logit contributions
|-0.008
 meets-0.007
 meet-0.007
 plays-0.007
 play-0.007
Token magn
Token ID7842
Feature activation-1.05e+00

Pos logit contributions
 impressed+0.001
aging+0.001
ower+0.001
 pleased+0.001
broken+0.001
Neg logit contributions
|-0.001
 meets-0.001
 meet-0.001
 plays-0.001
 play-0.001
Tokenifying
Token ID4035
Feature activation-7.59e+00

Pos logit contributions
|+0.025
 meets+0.024
 play+0.023
 plays+0.023
 meet+0.023
Neg logit contributions
aging-0.019
outing-0.019
ower-0.019
 impressed-0.019
broken-0.019
Token glass
Token ID5405
Feature activation-2.69e-01

Pos logit contributions
|+0.152
 meets+0.148
 meet+0.140
 plays+0.139
 play+0.135
Neg logit contributions
 impressed-0.157
ower-0.153
broken-0.152
aging-0.152
 pleased-0.151
Token to
Token ID284
Feature activation-3.51e+00

Pos logit contributions
|+0.006
 meets+0.006
 meet+0.005
 play+0.005
 plays+0.005
Neg logit contributions
 impressed-0.005
aging-0.005
ower-0.005
outing-0.005
broken-0.005
Token examine
Token ID10716
Feature activation-3.87e+00

Pos logit contributions
|+0.055
 meets+0.049
 plays+0.043
Ready+0.042
 fits+0.042
Neg logit contributions
aging-0.092
 impressed-0.091
ower-0.090
 pleased-0.088
broken-0.086
Token their
Token ID511
Feature activation-1.16e+00

Pos logit contributions
|+0.092
 meets+0.083
 meet+0.081
 play+0.079
 borrow+0.079
Neg logit contributions
aging-0.073
 impressed-0.073
ower-0.070
outing-0.069
iring-0.067
Token finds
Token ID7228
Feature activation-1.85e+00

Pos logit contributions
|+0.027
 meets+0.023
 borrow+0.022
 meet+0.022
 fits+0.021
Neg logit contributions
aging-0.023
ower-0.023
 sudden-0.022
 impressed-0.022
broken-0.022
Token borrowed
Token ID22546
Feature activation-2.46e+00

Pos logit contributions
 play+0.015
|+0.014
 plays+0.014
 meets+0.014
 meet+0.013
Neg logit contributions
ower-0.023
outing-0.022
aging-0.021
uttering-0.021
broken-0.021
Token her
Token ID607
Feature activation-2.37e+00

Pos logit contributions
|+0.052
 meets+0.047
 play+0.046
 plays+0.045
 meet+0.045
Neg logit contributions
 impressed-0.054
aging-0.052
 pleased-0.051
broken-0.051
ower-0.051
Token mom
Token ID1995
Feature activation+1.03e+00

Pos logit contributions
|+0.050
 play+0.048
 meets+0.047
 meet+0.046
 plays+0.045
Neg logit contributions
 impressed-0.051
outing-0.049
aging-0.049
ower-0.049
broken-0.049
Token's
Token ID338
Feature activation-6.40e-01

Pos logit contributions
 impressed+0.026
aging+0.025
broken+0.025
ower+0.025
outing+0.025
Neg logit contributions
|-0.018
 meets-0.017
 play-0.016
 meet-0.016
 plays-0.016
Token magn
Token ID7842
Feature activation-9.00e-01

Pos logit contributions
|+0.014
 meets+0.013
 play+0.013
 plays+0.012
 meet+0.012
Neg logit contributions
 impressed-0.013
aging-0.013
ower-0.013
broken-0.013
 pleased-0.013
Tokenifying
Token ID4035
Feature activation-7.52e+00

Pos logit contributions
|+0.020
 meets+0.018
 play+0.018
 meet+0.018
 plays+0.018
Neg logit contributions
 impressed-0.019
aging-0.018
ower-0.018
broken-0.018
 pleased-0.018
Token glass
Token ID5405
Feature activation-8.60e-01

Pos logit contributions
|+0.161
 meets+0.151
 meet+0.145
 plays+0.142
 play+0.140
Neg logit contributions
 impressed-0.150
aging-0.146
 pleased-0.145
ower-0.144
broken-0.144
Token to
Token ID284
Feature activation-3.06e+00

Pos logit contributions
|+0.022
 meets+0.020
 meet+0.020
 play+0.020
 plays+0.020
Neg logit contributions
 impressed-0.015
aging-0.015
ower-0.014
broken-0.014
outing-0.014
Token take
Token ID1011
Feature activation-3.99e+00

Pos logit contributions
|+0.049
 meets+0.043
 plays+0.041
 fits+0.039
 play+0.038
Neg logit contributions
 impressed-0.082
aging-0.080
 pleased-0.079
ower-0.078
broken-0.078
Token a
Token ID257
Feature activation-1.06e+00

Pos logit contributions
|+0.087
 meets+0.084
 fits+0.076
 plays+0.076
 meet+0.073
Neg logit contributions
aging-0.080
 knowing-0.074
 impressed-0.072
 pleased-0.072
 sudden-0.070
Token closer
Token ID5699
Feature activation-4.95e-01

Pos logit contributions
|+0.020
 meets+0.019
 plays+0.018
 play+0.018
 meet+0.018
Neg logit contributions
 impressed-0.025
aging-0.024
broken-0.024
ower-0.024
 pleased-0.024
Token made
Token ID925
Feature activation-1.37e+00

Pos logit contributions
|+0.010
 meets+0.009
 play+0.009
 meet+0.009
 plays+0.009
Neg logit contributions
 impressed-0.008
aging-0.008
broken-0.007
ower-0.007
 pleased-0.007
Token of
Token ID286
Feature activation-1.26e+00

Pos logit contributions
|+0.031
 meets+0.028
 meet+0.026
 plays+0.026
 play+0.026
Neg logit contributions
 impressed-0.030
 pleased-0.029
aging-0.028
 knowing-0.027
broken-0.027
Token dark
Token ID3223
Feature activation+1.74e+00

Pos logit contributions
|+0.029
 meets+0.026
 meet+0.025
 plays+0.025
 play+0.025
Neg logit contributions
 impressed-0.027
aging-0.025
 pleased-0.025
broken-0.025
 knowing-0.024
Token yarn
Token ID21181
Feature activation-3.65e-01

Pos logit contributions
 impressed+0.034
broken+0.034
aging+0.034
ower+0.033
 sudden+0.032
Neg logit contributions
|-0.039
 meets-0.037
 meet-0.036
 borrow-0.035
 play-0.035
Token.
Token ID13
Feature activation-1.99e+00

Pos logit contributions
|+0.010
 meets+0.009
 meet+0.008
 plays+0.008
Ready+0.008
Neg logit contributions
 impressed-0.006
aging-0.006
broken-0.006
 knowing-0.006
ower-0.006
Token<|endoftext|>
Token ID50256
Feature activation-7.51e+00

Pos logit contributions
|+0.037
 meets+0.034
 play+0.032
 plays+0.032
 meet+0.031
Neg logit contributions
 impressed-0.048
 pleased-0.046
ower-0.046
aging-0.046
 mesmer-0.045
TokenOnce
Token ID7454
Feature activation-2.66e-01

Pos logit contributions
 borrow+0.181
 shares+0.179
 meets+0.178
 fits+0.177
 plays+0.175
Neg logit contributions
 impressed-0.097
 mesmer-0.088
 pleased-0.087
ower-0.085
 sudden-0.084
Token there
Token ID612
Feature activation-4.69e+00

Pos logit contributions
|+0.007
 meets+0.006
 play+0.006
 meet+0.006
 plays+0.006
Neg logit contributions
 impressed-0.005
aging-0.005
broken-0.005
ower-0.005
outing-0.004
Token was
Token ID373
Feature activation+8.42e-01

Pos logit contributions
|+0.095
 meets+0.091
 meet+0.087
 plays+0.086
 borrow+0.085
Neg logit contributions
 impressed-0.100
 pleased-0.096
aging-0.093
ower-0.092
broken-0.092
Token a
Token ID257
Feature activation+2.12e-01

Pos logit contributions
outing+0.020
 impressed+0.020
broken+0.020
ower+0.020
aging+0.019
Neg logit contributions
|-0.016
 play-0.016
 plays-0.015
 meet-0.015
 meets-0.014
Token boy
Token ID2933
Feature activation-1.10e-01

Pos logit contributions
aging+0.004
outing+0.003
iously+0.003
broken+0.003
ower+0.003
Neg logit contributions
 play-0.006
|-0.005
 meet-0.005
 plays-0.005
 meets-0.005
Token full
Token ID1336
Feature activation-8.71e-01

Pos logit contributions
outing+0.033
ower+0.032
broken+0.031
aging+0.031
 impressed+0.031
Neg logit contributions
 play-0.043
|-0.043
 plays-0.042
 meets-0.041
 meet-0.041
Token.
Token ID13
Feature activation-2.26e+00

Pos logit contributions
|+0.016
 play+0.016
 plays+0.015
 meets+0.015
 meet+0.015
Neg logit contributions
 impressed-0.020
aging-0.020
ower-0.020
broken-0.019
outing-0.019
Token The
Token ID383
Feature activation+2.54e-02

Pos logit contributions
|+0.043
 meets+0.039
 play+0.037
 meet+0.037
 plays+0.036
Neg logit contributions
 impressed-0.054
ower-0.052
aging-0.052
 pleased-0.051
outing-0.050
Token end
Token ID886
Feature activation+1.24e+00

Pos logit contributions
 impressed+0.000
aging+0.000
ower+0.000
outing+0.000
broken+0.000
Neg logit contributions
|-0.001
 play-0.001
 meets-0.001
 meet-0.001
 plays-0.001
Token!
Token ID0
Feature activation-3.06e+00

Pos logit contributions
 impressed+0.023
ower+0.022
aging+0.022
broken+0.022
outing+0.022
Neg logit contributions
|-0.030
 meets-0.028
 meet-0.027
 borrow-0.026
 plays-0.026
Token<|endoftext|>
Token ID50256
Feature activation-7.46e+00

Pos logit contributions
|+0.058
 meets+0.055
 play+0.053
 meet+0.053
 plays+0.052
Neg logit contributions
 impressed-0.070
 pleased-0.067
aging-0.067
ower-0.067
broken-0.066
TokenOnce
Token ID7454
Feature activation-3.27e-01

Pos logit contributions
 meets+0.174
|+0.171
 shares+0.171
 borrow+0.171
 plays+0.171
Neg logit contributions
 impressed-0.104
ower-0.096
 pleased-0.095
aging-0.095
 sudden-0.094
Token there
Token ID612
Feature activation-4.86e+00

Pos logit contributions
|+0.008
 meets+0.007
 play+0.007
 meet+0.007
 plays+0.007
Neg logit contributions
 impressed-0.006
aging-0.006
broken-0.006
ower-0.006
outing-0.006
Token was
Token ID373
Feature activation+7.31e-01

Pos logit contributions
|+0.099
 meets+0.094
 meet+0.090
 plays+0.089
 borrow+0.088
Neg logit contributions
 impressed-0.103
 pleased-0.098
aging-0.096
ower-0.095
broken-0.095
Token a
Token ID257
Feature activation+1.84e-01

Pos logit contributions
outing+0.018
 impressed+0.018
broken+0.018
ower+0.018
aging+0.017
Neg logit contributions
|-0.013
 play-0.013
 plays-0.012
 meet-0.012
 meets-0.012
Token happy
Token ID3772
Feature activation+2.50e-01

Pos logit contributions
aging+0.003
outing+0.003
broken+0.003
iously+0.003
 impressed+0.003
Neg logit contributions
 play-0.005
|-0.005
 plays-0.005
 meet-0.005
 meets-0.004
Token She
Token ID1375
Feature activation-4.38e-01

Pos logit contributions
|+0.023
 play+0.022
 meets+0.022
 meet+0.022
 plays+0.021
Neg logit contributions
 impressed-0.033
broken-0.033
aging-0.033
outing-0.032
ower-0.032
Token takes
Token ID2753
Feature activation-1.93e+00

Pos logit contributions
|+0.009
 meet+0.007
uggle+0.007
 play+0.007
 meets+0.006
Neg logit contributions
 impressed-0.011
aging-0.010
 pleased-0.010
 mesmer-0.010
outing-0.010
Token out
Token ID503
Feature activation+3.97e+00

Pos logit contributions
|+0.047
 meets+0.042
 meet+0.040
 plays+0.038
 play+0.038
Neg logit contributions
aging-0.038
 impressed-0.035
broken-0.033
 pleased-0.033
ower-0.033
Token a
Token ID257
Feature activation-4.65e-01

Pos logit contributions
ower+0.099
 impressed+0.097
broken+0.095
outing+0.095
 pleased+0.094
Neg logit contributions
|-0.068
 play-0.067
 plays-0.064
 borrow-0.062
 meet-0.062
Token magn
Token ID7842
Feature activation+1.87e-01

Pos logit contributions
|+0.011
 meets+0.010
 plays+0.010
 meet+0.010
 play+0.010
Neg logit contributions
 impressed-0.009
 pleased-0.009
aging-0.008
ower-0.008
broken-0.008
Tokenifying
Token ID4035
Feature activation-7.44e+00

Pos logit contributions
 impressed+0.004
aging+0.004
ower+0.004
broken+0.004
 pleased+0.004
Neg logit contributions
|-0.004
 meets-0.004
 play-0.004
 meet-0.004
 plays-0.004
Token glass
Token ID5405
Feature activation+3.06e-01

Pos logit contributions
|+0.148
 meets+0.140
 meet+0.135
 plays+0.134
 play+0.132
Neg logit contributions
 impressed-0.163
broken-0.157
 pleased-0.156
aging-0.156
ower-0.155
Token.
Token ID13
Feature activation-2.58e+00

Pos logit contributions
 impressed+0.006
aging+0.006
ower+0.006
broken+0.006
outing+0.005
Neg logit contributions
|-0.007
 meets-0.007
 play-0.007
 meet-0.007
 plays-0.007
Token She
Token ID1375
Feature activation-9.01e-01

Pos logit contributions
|+0.047
 meets+0.043
 play+0.041
 meet+0.041
 plays+0.040
Neg logit contributions
 impressed-0.064
aging-0.062
ower-0.062
outing-0.061
broken-0.061
Token looks
Token ID3073
Feature activation+1.82e+00

Pos logit contributions
|+0.017
 meet+0.013
 play+0.013
uggle+0.013
 meets+0.013
Neg logit contributions
 impressed-0.023
aging-0.022
 mesmer-0.021
 pleased-0.021
outing-0.021
Token at
Token ID379
Feature activation-1.68e+00

Pos logit contributions
ower+0.039
aging+0.037
broken+0.037
outing+0.036
oles+0.035
Neg logit contributions
 play-0.039
 meets-0.038
 plays-0.038
|-0.038
 meet-0.036
Token love
Token ID1842
Feature activation+2.19e-02

Pos logit contributions
|+0.028
 meets+0.025
 meet+0.024
 play+0.023
 plays+0.023
Neg logit contributions
 impressed-0.024
aging-0.023
 pleased-0.023
ower-0.022
outing-0.022
Token."
Token ID526
Feature activation+5.52e-01

Pos logit contributions
 impressed+0.000
aging+0.000
ower+0.000
outing+0.000
 pleased+0.000
Neg logit contributions
|-0.001
 meets-0.000
 meet-0.000
 play-0.000
 plays-0.000
Token<|endoftext|>
Token ID50256
Feature activation-6.33e+00

Pos logit contributions
 impressed+0.011
 pleased+0.011
aging+0.010
ower+0.010
 knowing+0.010
Neg logit contributions
|-0.012
 meets-0.011
 fits-0.011
 meet-0.011
 plays-0.011
Token<|endoftext|>
Token ID50256
Feature activation-6.88e+00

Pos logit contributions
 borrow+0.147
 fits+0.145
 build+0.143
 shares+0.141
 meet+0.141
Neg logit contributions
 impressed-0.079
oles-0.076
 sudden-0.074
ower-0.073
ment-0.073
Token<|endoftext|>
Token ID50256
Feature activation-7.21e+00

Pos logit contributions
 fits+0.155
 borrow+0.150
 shares+0.149
 meets+0.148
 occasions+0.145
Neg logit contributions
 impressed-0.098
 sudden-0.097
 mesmer-0.090
oles-0.090
ower-0.089
Token<|endoftext|>
Token ID50256
Feature activation-7.44e+00

Pos logit contributions
 fits+0.155
 shares+0.148
 borrow+0.148
 meets+0.146
 plays+0.143
Neg logit contributions
 sudden-0.117
 impressed-0.115
 mesmer-0.107
ised-0.105
Suddenly-0.105
TokenAnna
Token ID31160
Feature activation-1.38e+00

Pos logit contributions
 fits+0.154
 shares+0.147
 borrow+0.146
 meets+0.144
 plays+0.142
Neg logit contributions
 sudden-0.130
 impressed-0.126
 mesmer-0.118
ised-0.116
Suddenly-0.116
Token and
Token ID290
Feature activation-2.61e+00

Pos logit contributions
 play+0.034
|+0.034
 meet+0.033
 plays+0.032
 meets+0.032
Neg logit contributions
 impressed-0.026
outing-0.025
aging-0.024
ower-0.024
broken-0.024
Token Ben
Token ID3932
Feature activation-1.89e+00

Pos logit contributions
|+0.048
 meets+0.045
 plays+0.042
 play+0.042
 meet+0.042
Neg logit contributions
 impressed-0.062
aging-0.061
ower-0.060
 pleased-0.060
broken-0.059
Token are
Token ID389
Feature activation-6.58e-01

Pos logit contributions
|+0.051
 meets+0.048
 play+0.048
 meet+0.047
 plays+0.047
Neg logit contributions
 impressed-0.029
aging-0.028
ower-0.027
broken-0.027
outing-0.027
Token playing
Token ID2712
Feature activation-2.70e+00

Pos logit contributions
 play+0.014
|+0.013
 meet+0.013
 meets+0.013
 plays+0.013
Neg logit contributions
broken-0.014
 impressed-0.014
outing-0.014
ower-0.013
edly-0.013
Token felt
Token ID2936
Feature activation+3.16e+00

Pos logit contributions
|+0.010
 meets+0.009
 meet+0.009
 play+0.008
 borrow+0.008
Neg logit contributions
 impressed-0.009
broken-0.008
 pleased-0.008
aging-0.008
outing-0.008
Token very
Token ID845
Feature activation+4.30e+00

Pos logit contributions
ower+0.050
aging+0.048
outing+0.048
broken+0.045
king+0.043
Neg logit contributions
|-0.085
 play-0.084
 meet-0.081
 meets-0.081
 plays-0.081
Token happy
Token ID3772
Feature activation+2.34e+00

Pos logit contributions
aging+0.064
ower+0.062
broken+0.062
outing+0.061
king+0.061
Neg logit contributions
 play-0.116
 meet-0.111
 meets-0.109
|-0.109
 plays-0.108
Token too
Token ID1165
Feature activation+1.68e+00

Pos logit contributions
 impressed+0.041
aging+0.040
ower+0.040
outing+0.040
broken+0.040
Neg logit contributions
|-0.057
 play-0.055
 meets-0.054
 plays-0.054
 meet-0.053
Token!
Token ID0
Feature activation-2.39e+00

Pos logit contributions
outing+0.049
aging+0.047
broken+0.047
aver+0.046
 impressed+0.046
Neg logit contributions
 play-0.023
 meet-0.020
|-0.019
 have-0.019
 plays-0.019
Token<|endoftext|>
Token ID50256
Feature activation-7.42e+00

Pos logit contributions
|+0.050
 meets+0.048
 borrow+0.046
 play+0.046
 plays+0.045
Neg logit contributions
 impressed-0.047
ower-0.046
 mesmer-0.045
 pleased-0.044
outing-0.044
TokenOnce
Token ID7454
Feature activation-3.07e-01

Pos logit contributions
 meets+0.173
 borrow+0.172
 shares+0.172
 plays+0.170
 fits+0.170
Neg logit contributions
 impressed-0.104
 pleased-0.093
 mesmer-0.092
ower-0.092
 knowing-0.090
Token upon
Token ID2402
Feature activation-2.56e+00

Pos logit contributions
|+0.008
 meets+0.007
 play+0.007
 meet+0.007
 plays+0.007
Neg logit contributions
 impressed-0.005
aging-0.005
broken-0.005
ower-0.005
outing-0.005
Token a
Token ID257
Feature activation+8.37e-01

Pos logit contributions
|+0.059
 meets+0.055
 play+0.054
 meet+0.054
 plays+0.053
Neg logit contributions
 impressed-0.050
aging-0.048
broken-0.047
ower-0.047
 pleased-0.047
Token time
Token ID640
Feature activation+1.53e+00

Pos logit contributions
aging+0.022
 impressed+0.022
broken+0.022
outing+0.021
iring+0.021
Neg logit contributions
|-0.016
 play-0.014
 meet-0.013
 plays-0.013
 meets-0.013
Token,
Token ID11
Feature activation+1.17e+00

Pos logit contributions
 impressed+0.028
broken+0.028
aging+0.028
outing+0.027
ower+0.027
Neg logit contributions
|-0.037
 play-0.035
 meets-0.034
 plays-0.034
 meet-0.034
Token could
Token ID714
Feature activation-2.31e+00

Pos logit contributions
|+0.049
 meet+0.044
 meets+0.044
 play+0.043
 borrow+0.043
Neg logit contributions
 impressed-0.045
broken-0.043
aging-0.043
 pleased-0.043
ower-0.042
Token help
Token ID1037
Feature activation-2.75e+00

Pos logit contributions
|+0.033
 meets+0.028
 plays+0.024
Ready+0.024
 meet+0.023
Neg logit contributions
 impressed-0.067
aging-0.067
ower-0.066
 pleased-0.065
broken-0.065
Token make
Token ID787
Feature activation-4.35e+00

Pos logit contributions
|+0.050
 meets+0.046
 play+0.046
 meet+0.045
 plays+0.044
Neg logit contributions
 impressed-0.067
aging-0.066
ower-0.065
broken-0.065
outing-0.065
Token dinner
Token ID8073
Feature activation+6.96e-01

Pos logit contributions
|+0.095
 meets+0.086
 meet+0.084
 play+0.084
 plays+0.082
Neg logit contributions
 impressed-0.094
aging-0.091
 pleased-0.089
ower-0.088
broken-0.088
Token.
Token ID13
Feature activation-2.09e+00

Pos logit contributions
 impressed+0.014
 pleased+0.013
aging+0.013
broken+0.013
ower+0.013
Neg logit contributions
|-0.017
 meets-0.015
 meet-0.015
 plays-0.015
 play-0.015
Token<|endoftext|>
Token ID50256
Feature activation-7.42e+00

Pos logit contributions
|+0.044
 meets+0.039
 meet+0.038
 play+0.038
 borrow+0.037
Neg logit contributions
 impressed-0.047
 pleased-0.045
ower-0.043
aging-0.043
 knowing-0.042
TokenOnce
Token ID7454
Feature activation-2.61e-01

Pos logit contributions
 shares+0.183
 fits+0.183
 borrow+0.183
 meets+0.180
 plays+0.179
Neg logit contributions
 impressed-0.088
 mesmer-0.079
 pleased-0.078
 sudden-0.077
aging-0.075
Token upon
Token ID2402
Feature activation-2.87e+00

Pos logit contributions
|+0.007
 meets+0.006
 play+0.006
 meet+0.006
 plays+0.006
Neg logit contributions
 impressed-0.004
aging-0.004
ower-0.004
broken-0.004
outing-0.004
Token a
Token ID257
Feature activation+8.83e-01

Pos logit contributions
|+0.067
 meets+0.064
 play+0.061
 meet+0.061
 plays+0.061
Neg logit contributions
 impressed-0.054
aging-0.052
broken-0.051
ower-0.051
 pleased-0.051
Token time
Token ID640
Feature activation+1.20e+00

Pos logit contributions
aging+0.023
 impressed+0.023
broken+0.022
outing+0.022
ower+0.021
Neg logit contributions
|-0.017
 play-0.015
 meet-0.014
 plays-0.014
 meets-0.014
Token there
Token ID612
Feature activation-4.67e+00

Pos logit contributions
broken+0.023
 impressed+0.023
aging+0.023
outing+0.022
 pleased+0.022
Neg logit contributions
|-0.028
 play-0.026
 meets-0.025
 meet-0.025
 plays-0.025
Token was
Token ID373
Feature activation+1.61e+00

Pos logit contributions
|+0.014
 meet+0.013
 meets+0.012
 play+0.012
Ready+0.012
Neg logit contributions
 impressed-0.015
broken-0.015
aging-0.015
 pleased-0.015
outing-0.014
Token in
Token ID287
Feature activation-2.59e-01

Pos logit contributions
 impressed+0.028
 pleased+0.027
aging+0.026
broken+0.025
 knowing+0.025
Neg logit contributions
|-0.044
 meets-0.039
 plays-0.038
 play-0.038
 meet-0.038
Token good
Token ID922
Feature activation+6.89e-01

Pos logit contributions
|+0.005
 meets+0.005
 play+0.005
 meet+0.005
 plays+0.005
Neg logit contributions
 impressed-0.006
aging-0.006
ower-0.005
broken-0.005
outing-0.005
Token hands
Token ID2832
Feature activation+7.46e-01

Pos logit contributions
aging+0.016
broken+0.016
 impressed+0.016
ower+0.015
 pleased+0.015
Neg logit contributions
|-0.013
 meets-0.012
 meet-0.012
 play-0.011
Ready-0.011
Token.
Token ID13
Feature activation-2.27e+00

Pos logit contributions
 impressed+0.011
broken+0.010
aging+0.010
ower+0.010
 pleased+0.010
Neg logit contributions
|-0.021
 meets-0.020
 meet-0.019
 play-0.019
 plays-0.019
Token<|endoftext|>
Token ID50256
Feature activation-7.39e+00

Pos logit contributions
|+0.049
 meets+0.045
 play+0.045
 plays+0.044
 borrow+0.044
Neg logit contributions
 impressed-0.047
 pleased-0.044
ower-0.043
aging-0.043
 knowing-0.042
TokenOnce
Token ID7454
Feature activation-3.75e-01

Pos logit contributions
 fits+0.162
|+0.162
 meets+0.162
 plays+0.162
 borrow+0.160
Neg logit contributions
 impressed-0.124
 pleased-0.112
 mesmer-0.110
 knowing-0.109
ower-0.108
Token upon
Token ID2402
Feature activation-2.80e+00

Pos logit contributions
|+0.009
 meets+0.008
 play+0.008
 meet+0.008
 plays+0.008
Neg logit contributions
 impressed-0.007
aging-0.007
broken-0.007
ower-0.007
outing-0.007
Token a
Token ID257
Feature activation+7.81e-01

Pos logit contributions
|+0.062
 meets+0.058
 play+0.058
 meet+0.057
 plays+0.056
Neg logit contributions
 impressed-0.057
aging-0.055
ower-0.055
broken-0.055
outing-0.054
Token time
Token ID640
Feature activation+8.38e-01

Pos logit contributions
aging+0.021
 impressed+0.020
broken+0.020
outing+0.020
ower+0.020
Neg logit contributions
|-0.015
 play-0.013
 meet-0.012
 plays-0.012
 meets-0.012
Token there
Token ID612
Feature activation-4.79e+00

Pos logit contributions
 impressed+0.015
broken+0.015
aging+0.014
outing+0.014
ower+0.014
Neg logit contributions
|-0.021
 play-0.020
 meets-0.019
 meet-0.019
 plays-0.019
Token the
Token ID262
Feature activation-1.64e+00

Pos logit contributions
 impressed+0.099
aging+0.097
broken+0.095
ower+0.095
outing+0.095
Neg logit contributions
|-0.077
 meets-0.070
 play-0.069
 meet-0.068
 plays-0.067
Token necklace
Token ID38987
Feature activation-4.44e-01

Pos logit contributions
|+0.036
 meets+0.032
 plays+0.030
 meet+0.030
 borrow+0.029
Neg logit contributions
aging-0.034
 impressed-0.033
ower-0.033
broken-0.033
 pleased-0.033
Token and
Token ID290
Feature activation+1.33e+00

Pos logit contributions
|+0.012
 play+0.012
 meets+0.011
 meet+0.011
 borrow+0.011
Neg logit contributions
 impressed-0.007
aging-0.006
outing-0.006
 pleased-0.006
broken-0.006
Token the
Token ID262
Feature activation-8.25e-01

Pos logit contributions
 impressed+0.026
aging+0.025
broken+0.025
 pleased+0.024
ower+0.024
Neg logit contributions
|-0.031
 meets-0.028
 meet-0.027
 play-0.027
 plays-0.026
Token magn
Token ID7842
Feature activation-7.56e-02

Pos logit contributions
|+0.018
 meets+0.016
 plays+0.015
 meet+0.015
 borrow+0.015
Neg logit contributions
 impressed-0.017
aging-0.017
 pleased-0.017
ower-0.017
broken-0.017
Tokenifying
Token ID4035
Feature activation-7.39e+00

Pos logit contributions
|+0.002
 meets+0.002
 play+0.002
 meet+0.002
 plays+0.002
Neg logit contributions
 impressed-0.001
aging-0.001
broken-0.001
ower-0.001
outing-0.001
Token glass
Token ID5405
Feature activation+1.73e-01

Pos logit contributions
|+0.144
 meets+0.137
 meet+0.131
 plays+0.130
 play+0.129
Neg logit contributions
 impressed-0.162
broken-0.157
aging-0.155
 pleased-0.155
ower-0.155
Token and
Token ID290
Feature activation+8.80e-01

Pos logit contributions
 impressed+0.003
aging+0.003
ower+0.003
outing+0.003
broken+0.003
Neg logit contributions
|-0.004
 meets-0.004
 meet-0.004
 play-0.004
 plays-0.004
Token started
Token ID2067
Feature activation+7.63e-01

Pos logit contributions
 impressed+0.018
aging+0.018
broken+0.018
ower+0.017
 pleased+0.017
Neg logit contributions
|-0.019
 meets-0.018
 play-0.017
 meet-0.017
 plays-0.017
Token to
Token ID284
Feature activation-2.00e+00

Pos logit contributions
 impressed+0.018
aging+0.018
broken+0.017
ower+0.017
 pleased+0.017
Neg logit contributions
|-0.015
 meets-0.013
 meet-0.013
 play-0.013
 plays-0.012
Token examine
Token ID10716
Feature activation-2.74e+00

Pos logit contributions
|+0.033
 meets+0.033
 Welcome+0.027
Ready+0.027
 fits+0.025
Neg logit contributions
 impressed-0.049
aging-0.048
broken-0.047
 mesmer-0.047
 pleased-0.046
Token was
Token ID373
Feature activation+1.62e+00

Pos logit contributions
 impressed+0.016
aging+0.014
broken+0.014
 pleased+0.014
 mesmer+0.013
Neg logit contributions
|-0.019
Ready-0.016
 meet-0.016
 borrow-0.015
uggle-0.015
Token not
Token ID407
Feature activation+1.53e+00

Pos logit contributions
ower+0.028
outing+0.028
aging+0.028
 impressed+0.027
broken+0.027
Neg logit contributions
|-0.040
 meets-0.039
 play-0.039
 meet-0.038
 plays-0.038
Token lonely
Token ID21757
Feature activation+4.59e-01

Pos logit contributions
ower+0.024
outing+0.024
aging+0.023
broken+0.022
 impressed+0.022
Neg logit contributions
|-0.040
 play-0.039
 meets-0.038
 meet-0.038
 plays-0.037
Token anymore
Token ID7471
Feature activation+9.01e-01

Pos logit contributions
 impressed+0.011
aging+0.011
broken+0.011
ower+0.011
 pleased+0.011
Neg logit contributions
|-0.008
 meets-0.008
 meet-0.007
 play-0.007
 plays-0.007
Token.
Token ID13
Feature activation-3.01e+00

Pos logit contributions
 impressed+0.019
aging+0.019
ower+0.019
broken+0.019
outing+0.019
Neg logit contributions
|-0.019
 meets-0.018
 play-0.017
 meet-0.017
 plays-0.017
Token<|endoftext|>
Token ID50256
Feature activation-7.39e+00

Pos logit contributions
 meets+0.065
|+0.064
 borrow+0.060
 fits+0.060
 plays+0.059
Neg logit contributions
 impressed-0.057
ower-0.054
 pleased-0.053
oles-0.052
iring-0.051
TokenOnce
Token ID7454
Feature activation-1.27e-01

Pos logit contributions
|+0.164
 meets+0.164
 fits+0.164
 plays+0.163
 shares+0.161
Neg logit contributions
 impressed-0.118
aging-0.107
ower-0.106
 pleased-0.106
 mesmer-0.104
Token upon
Token ID2402
Feature activation-2.86e+00

Pos logit contributions
|+0.003
 meets+0.003
 play+0.003
 meet+0.003
 plays+0.003
Neg logit contributions
 impressed-0.002
aging-0.002
broken-0.002
ower-0.002
outing-0.002
Token a
Token ID257
Feature activation+6.99e-01

Pos logit contributions
|+0.068
 meets+0.064
 plays+0.062
 meet+0.062
 play+0.062
Neg logit contributions
 impressed-0.053
aging-0.050
broken-0.050
 pleased-0.049
ower-0.049
Token time
Token ID640
Feature activation+1.24e+00

Pos logit contributions
aging+0.018
 impressed+0.018
broken+0.018
outing+0.017
 pleased+0.017
Neg logit contributions
|-0.013
 play-0.012
 meet-0.011
 plays-0.011
 meets-0.011
Token,
Token ID11
Feature activation+8.30e-01

Pos logit contributions
broken+0.024
 impressed+0.024
aging+0.023
outing+0.023
 pleased+0.022
Neg logit contributions
|-0.030
 play-0.028
 meets-0.027
 meet-0.027
 borrow-0.026
Token her
Token ID607
Feature activation+1.70e-02

Pos logit contributions
|+0.040
 meets+0.037
 play+0.035
 plays+0.035
 meet+0.035
Neg logit contributions
 impressed-0.042
aging-0.041
 pleased-0.040
ower-0.040
outing-0.040
Token feel
Token ID1254
Feature activation+2.21e+00

Pos logit contributions
 impressed+0.000
aging+0.000
broken+0.000
 pleased+0.000
ower+0.000
Neg logit contributions
|-0.000
 meets-0.000
 play-0.000
 meet-0.000
 plays-0.000
Token extra
Token ID3131
Feature activation+3.07e+00

Pos logit contributions
ower+0.043
outing+0.043
aging+0.043
broken+0.040
uff+0.039
Neg logit contributions
|-0.050
 play-0.050
 plays-0.048
 meets-0.048
 meet-0.048
Token special
Token ID2041
Feature activation-4.44e-03

Pos logit contributions
ower+0.048
outing+0.048
aging+0.048
 impressed+0.047
broken+0.046
Neg logit contributions
|-0.079
 play-0.078
 meets-0.076
 plays-0.076
 meet-0.075
Token.
Token ID13
Feature activation-2.71e+00

Pos logit contributions
|+0.000
 meets+0.000
 meet+0.000
 play+0.000
 plays+0.000
Neg logit contributions
 impressed-0.000
aging-0.000
 pleased-0.000
broken-0.000
ower-0.000
Token<|endoftext|>
Token ID50256
Feature activation-7.39e+00

Pos logit contributions
|+0.052
 meets+0.048
 play+0.047
 plays+0.046
 borrow+0.046
Neg logit contributions
 impressed-0.063
ower-0.060
aging-0.060
 pleased-0.060
outing-0.059
TokenOnce
Token ID7454
Feature activation-3.37e-01

Pos logit contributions
 meets+0.171
 borrow+0.171
 shares+0.170
 fits+0.169
 plays+0.169
Neg logit contributions
 impressed-0.107
 pleased-0.096
 mesmer-0.095
ower-0.094
 knowing-0.093
Token upon
Token ID2402
Feature activation-2.65e+00

Pos logit contributions
|+0.008
 meets+0.008
 play+0.008
 meet+0.008
 plays+0.008
Neg logit contributions
 impressed-0.006
aging-0.006
broken-0.006
ower-0.006
outing-0.006
Token a
Token ID257
Feature activation+1.07e+00

Pos logit contributions
|+0.060
 meets+0.057
 play+0.056
 meet+0.055
 plays+0.055
Neg logit contributions
 impressed-0.052
aging-0.050
ower-0.050
broken-0.049
outing-0.049
Token time
Token ID640
Feature activation+1.04e+00

Pos logit contributions
aging+0.028
 impressed+0.028
broken+0.027
outing+0.027
iring+0.027
Neg logit contributions
|-0.020
 play-0.018
 meet-0.016
 plays-0.016
 meets-0.016
Token,
Token ID11
Feature activation+6.03e-01

Pos logit contributions
 impressed+0.018
aging+0.018
broken+0.017
outing+0.017
ower+0.017
Neg logit contributions
|-0.027
 play-0.025
 meets-0.025
 meet-0.024
 plays-0.024
Token
Token ID198
Feature activation+2.51e+00

Pos logit contributions
|+0.051
 meets+0.049
 borrow+0.048
 play+0.048
 plays+0.047
Neg logit contributions
 impressed-0.049
 pleased-0.047
ower-0.046
aging-0.045
 mesmer-0.045
Token
Token ID198
Feature activation+2.19e+00

Pos logit contributions
 impressed+0.053
aging+0.052
broken+0.052
ower+0.052
outing+0.052
Neg logit contributions
|-0.054
 meets-0.050
 play-0.049
 meet-0.048
 plays-0.048
TokenThe
Token ID464
Feature activation+9.15e-02

Pos logit contributions
aging+0.037
 impressed+0.037
ower+0.036
outing+0.036
broken+0.036
Neg logit contributions
|-0.057
 play-0.052
 meets-0.052
 meet-0.052
 plays-0.050
Token End
Token ID5268
Feature activation+1.13e-02

Pos logit contributions
 impressed+0.001
 pleased+0.001
ower+0.001
aging+0.001
broken+0.001
Neg logit contributions
|-0.002
 meets-0.002
 plays-0.002
 borrow-0.002
 meet-0.002
Token.
Token ID13
Feature activation-3.04e+00

Pos logit contributions
 impressed+0.000
aging+0.000
broken+0.000
outing+0.000
ower+0.000
Neg logit contributions
|-0.000
 play-0.000
 meets-0.000
 plays-0.000
 meet-0.000
Token<|endoftext|>
Token ID50256
Feature activation-7.38e+00

Pos logit contributions
|+0.058
 meets+0.057
 play+0.056
 plays+0.055
 fits+0.053
Neg logit contributions
 impressed-0.067
 pleased-0.065
ower-0.064
aging-0.063
broken-0.062
TokenTodd
Token ID42817
Feature activation+1.39e+00

Pos logit contributions
 fits+0.175
 shares+0.174
 borrow+0.173
 meets+0.172
 plays+0.170
Neg logit contributions
 impressed-0.096
 mesmer-0.087
 sudden-0.087
ower-0.087
 pleased-0.086
Token was
Token ID373
Feature activation+1.36e+00

Pos logit contributions
 impressed+0.023
aging+0.022
broken+0.022
ower+0.022
 pleased+0.022
Neg logit contributions
|-0.036
 meets-0.033
 play-0.033
 meet-0.032
 plays-0.032
Token so
Token ID523
Feature activation+3.26e+00

Pos logit contributions
outing+0.032
ower+0.031
oles+0.031
broken+0.030
edly+0.030
Neg logit contributions
 play-0.026
|-0.025
 plays-0.024
 meets-0.024
 meet-0.024
Token excited
Token ID6568
Feature activation+2.83e+00

Pos logit contributions
king+0.050
broken+0.048
outing+0.044
ower+0.044
edly+0.044
Neg logit contributions
 play-0.092
 plays-0.085
|-0.084
 meet-0.083
 meets-0.083
Token!
Token ID0
Feature activation-3.69e+00

Pos logit contributions
 impressed+0.062
aging+0.060
broken+0.060
ower+0.060
outing+0.059
Neg logit contributions
|-0.058
 meets-0.054
 play-0.054
 plays-0.053
 meet-0.053
Token
Token ID198
Feature activation+2.39e+00

Pos logit contributions
|+0.041
 meets+0.037
 meet+0.035
 play+0.035
 plays+0.034
Neg logit contributions
 impressed-0.044
 pleased-0.042
ower-0.040
aging-0.040
 knowing-0.040
Token
Token ID198
Feature activation+1.92e+00

Pos logit contributions
 impressed+0.058
aging+0.056
ower+0.055
broken+0.055
 pleased+0.055
Neg logit contributions
|-0.044
 meets-0.041
 play-0.040
 plays-0.039
 meet-0.039
TokenThe
Token ID464
Feature activation-1.51e-01

Pos logit contributions
aging+0.033
 impressed+0.033
ower+0.033
outing+0.032
broken+0.032
Neg logit contributions
|-0.048
 play-0.045
 meets-0.045
 meet-0.044
 plays-0.044
Token End
Token ID5268
Feature activation-4.39e-02

Pos logit contributions
|+0.004
 meets+0.003
 play+0.003
 meet+0.003
 plays+0.003
Neg logit contributions
 impressed-0.003
aging-0.003
ower-0.003
broken-0.003
outing-0.003
Token.
Token ID13
Feature activation-3.07e+00

Pos logit contributions
|+0.001
 play+0.001
 meets+0.001
 plays+0.001
 meet+0.001
Neg logit contributions
 impressed-0.001
aging-0.001
broken-0.001
outing-0.001
ower-0.001
Token<|endoftext|>
Token ID50256
Feature activation-7.38e+00

Pos logit contributions
|+0.062
 meets+0.060
 play+0.058
 plays+0.058
 meet+0.056
Neg logit contributions
 impressed-0.066
 pleased-0.063
 knowing-0.061
ower-0.060
aging-0.060
TokenOnce
Token ID7454
Feature activation-2.57e-01

Pos logit contributions
 borrow+0.176
 fits+0.176
 shares+0.174
 meets+0.173
 belong+0.171
Neg logit contributions
 impressed-0.088
 sudden-0.085
 mesmer-0.081
 pleased-0.080
ower-0.079
Token upon
Token ID2402
Feature activation-2.86e+00

Pos logit contributions
|+0.007
 meets+0.006
 play+0.006
 meet+0.006
 plays+0.006
Neg logit contributions
 impressed-0.004
aging-0.004
ower-0.004
outing-0.004
broken-0.004
Token a
Token ID257
Feature activation+6.59e-01

Pos logit contributions
|+0.069
 meets+0.066
 plays+0.063
 meet+0.063
 play+0.062
Neg logit contributions
 impressed-0.052
aging-0.050
broken-0.049
 pleased-0.049
ower-0.048
Token time
Token ID640
Feature activation+1.50e+00

Pos logit contributions
aging+0.017
 impressed+0.017
outing+0.017
broken+0.017
ower+0.016
Neg logit contributions
|-0.013
 play-0.011
 meet-0.010
 plays-0.010
 visit-0.010
Token there
Token ID612
Feature activation-4.57e+00

Pos logit contributions
 impressed+0.026
broken+0.026
aging+0.026
outing+0.025
ower+0.025
Neg logit contributions
|-0.038
 play-0.036
 meets-0.035
 meet-0.035
 plays-0.035
TokenHer
Token ID9360
Feature activation+8.99e-01

Pos logit contributions
 impressed+0.032
aging+0.032
ower+0.031
outing+0.030
broken+0.030
Neg logit contributions
|-0.069
 play-0.065
 meets-0.065
 meet-0.065
 plays-0.064
Token mom
Token ID1995
Feature activation+1.92e+00

Pos logit contributions
 impressed+0.015
aging+0.015
ower+0.015
broken+0.015
outing+0.015
Neg logit contributions
|-0.023
 meets-0.022
 play-0.021
 meet-0.021
 plays-0.021
Token smiled
Token ID13541
Feature activation+3.97e+00

Pos logit contributions
ower+0.042
aging+0.041
 impressed+0.041
outing+0.040
broken+0.040
Neg logit contributions
|-0.039
 play-0.037
 meets-0.037
 meet-0.036
 plays-0.036
Token and
Token ID290
Feature activation+3.59e+00

Pos logit contributions
ower+0.092
oles+0.090
ittens+0.089
outing+0.088
aging+0.087
Neg logit contributions
 play-0.080
 plays-0.071
 join-0.066
 will-0.066
 meet-0.065
Token said
Token ID531
Feature activation+3.50e+00

Pos logit contributions
ower+0.079
outing+0.077
aging+0.077
broken+0.073
iring+0.069
Neg logit contributions
 play-0.075
 plays-0.072
|-0.071
 meet-0.070
 meets-0.070
Token,
Token ID11
Feature activation+6.68e+00

Pos logit contributions
aging+0.061
 impressed+0.061
broken+0.060
ower+0.060
outing+0.059
Neg logit contributions
|-0.084
 play-0.081
 meets-0.080
 plays-0.080
 meet-0.079
Token "
Token ID366
Feature activation-2.95e+00

Pos logit contributions
aging+0.144
broken+0.142
outing+0.138
ower+0.138
 impressed+0.135
Neg logit contributions
 play-0.131
 meet-0.127
|-0.126
 plays-0.123
 meets-0.120
TokenSee
Token ID6214
Feature activation+1.60e+00

Pos logit contributions
|+0.059
 meets+0.055
 play+0.054
 plays+0.053
 meet+0.052
Neg logit contributions
 impressed-0.067
broken-0.064
 pleased-0.064
outing-0.063
ower-0.063
Token,
Token ID11
Feature activation+5.28e+00

Pos logit contributions
 impressed+0.029
aging+0.028
ower+0.028
broken+0.027
outing+0.027
Neg logit contributions
|-0.040
 meets-0.037
 play-0.036
 meet-0.036
 plays-0.036
Token Lily
Token ID20037
Feature activation+2.07e+00

Pos logit contributions
 impressed+0.125
aging+0.119
 pleased+0.119
broken+0.118
ower+0.118
Neg logit contributions
|-0.103
 meets-0.094
 play-0.089
 meet-0.089
 plays-0.088
Token?
Token ID30
Feature activation-2.73e+00

Pos logit contributions
 impressed+0.035
broken+0.034
ower+0.034
aging+0.033
 pleased+0.033
Neg logit contributions
|-0.055
 meets-0.050
 meet-0.048
 play-0.047
Ready-0.046
Token looked
Token ID3114
Feature activation+1.58e+00

Pos logit contributions
ower+0.009
aging+0.009
 impressed+0.009
broken+0.009
outing+0.009
Neg logit contributions
|-0.012
 play-0.012
 meets-0.012
 plays-0.011
 meet-0.011
Token at
Token ID379
Feature activation-1.08e+00

Pos logit contributions
ower+0.035
aging+0.035
outing+0.034
broken+0.033
 impressed+0.033
Neg logit contributions
|-0.031
 play-0.031
 meets-0.030
 plays-0.030
 meet-0.028
Token her
Token ID607
Feature activation-9.07e-01

Pos logit contributions
|+0.020
 meets+0.020
 meet+0.018
 fits+0.018
 plays+0.018
Neg logit contributions
 impressed-0.024
aging-0.024
broken-0.023
 knowing-0.023
 pleased-0.023
Token with
Token ID351
Feature activation-1.57e-01

Pos logit contributions
 play+0.022
 meet+0.020
 plays+0.020
|+0.020
 borrow+0.020
Neg logit contributions
outing-0.016
 impressed-0.016
ower-0.015
aging-0.015
 pleased-0.015
Token big
Token ID1263
Feature activation-4.63e-01

Pos logit contributions
 play+0.003
|+0.003
 meet+0.003
 plays+0.003
 meets+0.003
Neg logit contributions
ower-0.003
outing-0.003
broken-0.003
aging-0.003
 impressed-0.003
Token,
Token ID11
Feature activation+5.46e+00

Pos logit contributions
 play+0.012
 meets+0.011
 plays+0.011
|+0.011
 meet+0.011
Neg logit contributions
outing-0.008
broken-0.008
 impressed-0.008
ower-0.007
aging-0.007
Token brown
Token ID7586
Feature activation-1.41e+00

Pos logit contributions
aging+0.113
outing+0.109
ower+0.108
oles+0.105
uff+0.104
Neg logit contributions
 play-0.121
|-0.112
 plays-0.111
 meet-0.108
 meets-0.106
Token eyes
Token ID2951
Feature activation+2.13e+00

Pos logit contributions
 play+0.035
 plays+0.034
|+0.033
 meet+0.033
 meets+0.033
Neg logit contributions
 impressed-0.026
outing-0.025
 pleased-0.024
broken-0.024
aging-0.022
Token and
Token ID290
Feature activation+2.12e+00

Pos logit contributions
 impressed+0.050
ower+0.049
outing+0.049
 pleased+0.048
iring+0.047
Neg logit contributions
 play-0.038
 plays-0.033
 borrow-0.032
 meet-0.031
 have-0.030
Token said
Token ID531
Feature activation+2.49e+00

Pos logit contributions
outing+0.047
ower+0.046
broken+0.045
iring+0.043
aging+0.043
Neg logit contributions
 play-0.045
 plays-0.043
 meets-0.040
 meet-0.040
 fits-0.039
Token,
Token ID11
Feature activation+6.10e+00

Pos logit contributions
broken+0.046
aging+0.046
ower+0.045
 impressed+0.045
outing+0.045
Neg logit contributions
 play-0.055
|-0.055
 plays-0.053
 meets-0.053
 borrow-0.053
Token to
Token ID284
Feature activation-1.50e+00

Pos logit contributions
 impressed+0.016
aging+0.016
broken+0.016
ower+0.016
 pleased+0.016
Neg logit contributions
|-0.014
 meets-0.013
 play-0.013
 meet-0.013
 plays-0.013
Token playing
Token ID2712
Feature activation-1.81e+00

Pos logit contributions
|+0.030
 meets+0.027
 fits+0.026
 plays+0.025
 borrow+0.025
Neg logit contributions
 impressed-0.035
aging-0.033
 knowing-0.033
 pleased-0.033
 mesmer-0.032
Token in
Token ID287
Feature activation-2.47e+00

Pos logit contributions
|+0.048
 meets+0.044
 meet+0.043
 borrow+0.042
 plays+0.042
Neg logit contributions
 impressed-0.031
 pleased-0.029
aging-0.029
broken-0.028
ower-0.027
Token the
Token ID262
Feature activation-2.98e+00

Pos logit contributions
|+0.054
 meets+0.050
 plays+0.047
 fits+0.047
 meet+0.047
Neg logit contributions
 impressed-0.052
 pleased-0.049
aging-0.048
 mesmer-0.048
 knowing-0.047
Token sand
Token ID6450
Feature activation+4.42e-01

Pos logit contributions
|+0.064
 meets+0.058
 play+0.057
 meet+0.056
 plays+0.056
Neg logit contributions
 impressed-0.064
aging-0.061
broken-0.061
ower-0.061
 pleased-0.060
Token,
Token ID11
Feature activation+5.15e+00

Pos logit contributions
 impressed+0.007
aging+0.007
ower+0.007
broken+0.007
outing+0.006
Neg logit contributions
|-0.012
 meets-0.011
 meet-0.011
 play-0.011
 plays-0.011
Token pushing
Token ID7796
Feature activation-9.44e-01

Pos logit contributions
ower+0.089
outing+0.087
aging+0.086
broken+0.085
 impressed+0.083
Neg logit contributions
|-0.126
 play-0.125
 meets-0.123
 meet-0.120
 plays-0.120
Token her
Token ID607
Feature activation-9.95e-01

Pos logit contributions
|+0.021
 meets+0.019
 play+0.019
 meet+0.019
 plays+0.019
Neg logit contributions
 impressed-0.020
aging-0.019
ower-0.019
broken-0.019
outing-0.019
Token toy
Token ID13373
Feature activation-1.73e+00

Pos logit contributions
|+0.026
 meets+0.024
 plays+0.023
 meet+0.023
 play+0.023
Neg logit contributions
 impressed-0.017
aging-0.017
ower-0.016
 pleased-0.016
broken-0.016
Token car
Token ID1097
Feature activation-8.09e-01

Pos logit contributions
|+0.043
 meets+0.038
 meet+0.037
 borrow+0.035
Ready+0.035
Neg logit contributions
aging-0.034
ower-0.033
broken-0.032
 impressed-0.031
king-0.030
Token around
Token ID1088
Feature activation-3.80e-01

Pos logit contributions
|+0.018
 meets+0.017
 meet+0.016
 plays+0.016
 play+0.016
Neg logit contributions
 impressed-0.016
aging-0.016
broken-0.016
ower-0.016
outing-0.015
Token day
Token ID1110
Feature activation+3.57e+00

Pos logit contributions
 play+0.013
|+0.013
 plays+0.012
 meet+0.012
 meets+0.011
Neg logit contributions
aging-0.025
 impressed-0.025
outing-0.025
broken-0.025
ower-0.025
Token,
Token ID11
Feature activation+4.14e+00

Pos logit contributions
 impressed+0.082
broken+0.081
outing+0.077
ower+0.076
 pleased+0.076
Neg logit contributions
 play-0.071
|-0.068
 plays-0.068
 meets-0.067
 meet-0.066
Token her
Token ID607
Feature activation-8.62e-01

Pos logit contributions
 impressed+0.121
broken+0.120
ower+0.116
king+0.114
outing+0.113
Neg logit contributions
 play-0.057
|-0.050
 plays-0.047
 have-0.047
 meet-0.047
Token mom
Token ID1995
Feature activation+5.57e-01

Pos logit contributions
|+0.021
 meets+0.019
 play+0.019
 meet+0.019
 plays+0.019
Neg logit contributions
 impressed-0.016
aging-0.016
ower-0.015
broken-0.015
outing-0.015
Token said
Token ID531
Feature activation+1.51e+00

Pos logit contributions
 impressed+0.012
aging+0.012
broken+0.011
ower+0.011
outing+0.011
Neg logit contributions
|-0.012
 meets-0.011
 play-0.011
 meet-0.011
 plays-0.010
Token,
Token ID11
Feature activation+5.58e+00

Pos logit contributions
 impressed+0.027
aging+0.026
ower+0.026
broken+0.026
outing+0.026
Neg logit contributions
|-0.037
 meets-0.035
 play-0.035
 meet-0.034
 plays-0.034
Token "
Token ID366
Feature activation-4.02e+00

Pos logit contributions
 impressed+0.098
aging+0.096
broken+0.095
ower+0.094
outing+0.094
Neg logit contributions
|-0.138
 play-0.131
 meets-0.131
 meet-0.128
 plays-0.128
TokenS
Token ID50
Feature activation+1.42e+00

Pos logit contributions
|+0.098
 meets+0.092
 play+0.091
 meet+0.089
 plays+0.089
Neg logit contributions
 impressed-0.074
aging-0.071
ower-0.070
broken-0.070
outing-0.070
Tokenara
Token ID3301
Feature activation+1.17e+00

Pos logit contributions
 impressed+0.031
aging+0.030
broken+0.029
ower+0.029
 pleased+0.029
Neg logit contributions
|-0.030
 meets-0.028
 play-0.028
 plays-0.027
 meet-0.027
Token,
Token ID11
Feature activation+4.25e+00

Pos logit contributions
 impressed+0.013
aging+0.012
ower+0.012
broken+0.012
 knowing+0.012
Neg logit contributions
|-0.037
 meets-0.034
Ready-0.033
 meet-0.033
 play-0.032
Token it
Token ID340
Feature activation-3.16e+00

Pos logit contributions
 impressed+0.102
 pleased+0.096
 knowing+0.095
ower+0.093
aging+0.093
Neg logit contributions
|-0.083
 meets-0.073
 plays-0.065
 borrow-0.063
 meet-0.063
Token from
Token ID422
Feature activation+3.86e-01

Pos logit contributions
|+0.003
 meets+0.003
 play+0.003
 meet+0.003
 plays+0.003
Neg logit contributions
 impressed-0.002
aging-0.002
ower-0.002
broken-0.002
outing-0.002
Token side
Token ID1735
Feature activation+1.01e+00

Pos logit contributions
 impressed+0.009
aging+0.009
ower+0.009
broken+0.009
outing+0.009
Neg logit contributions
|-0.007
 meets-0.007
 play-0.007
 meet-0.007
 plays-0.007
Token to
Token ID284
Feature activation-2.87e+00

Pos logit contributions
 impressed+0.020
aging+0.019
ower+0.019
broken+0.019
outing+0.019
Neg logit contributions
|-0.023
 meets-0.021
 play-0.021
 meet-0.021
 plays-0.021
Token side
Token ID1735
Feature activation+4.88e-01

Pos logit contributions
|+0.065
 fits+0.059
 meets+0.058
 plays+0.056
 shares+0.055
Neg logit contributions
 impressed-0.053
 knowing-0.050
 pleased-0.049
aging-0.049
 mesmer-0.048
Token.
Token ID13
Feature activation-1.83e+00

Pos logit contributions
 impressed+0.009
aging+0.009
 pleased+0.009
broken+0.009
ower+0.009
Neg logit contributions
|-0.012
 meets-0.011
 plays-0.010
 meet-0.010
 play-0.010
Token Then
Token ID3244
Feature activation+4.61e+00

Pos logit contributions
|+0.033
 meets+0.030
 play+0.030
 meet+0.029
 plays+0.029
Neg logit contributions
 impressed-0.045
aging-0.044
ower-0.043
broken-0.043
outing-0.043
Token,
Token ID11
Feature activation+5.58e+00

Pos logit contributions
outing+0.101
ower+0.101
 impressed+0.100
broken+0.100
aging+0.096
Neg logit contributions
 play-0.093
|-0.092
 meets-0.090
 meet-0.089
 plays-0.088
Token he
Token ID339
Feature activation+5.60e-02

Pos logit contributions
outing+0.144
ower+0.143
broken+0.142
 impressed+0.140
aging+0.137
Neg logit contributions
 play-0.095
|-0.090
 meet-0.088
 meets-0.086
 borrow-0.084
Token said
Token ID531
Feature activation+2.05e+00

Pos logit contributions
ower+0.001
aging+0.001
 impressed+0.001
outing+0.001
broken+0.001
Neg logit contributions
|-0.001
 meets-0.001
 play-0.001
 plays-0.001
 meet-0.001
Token to
Token ID284
Feature activation-5.54e-01

Pos logit contributions
ower+0.042
outing+0.042
broken+0.041
aging+0.041
 impressed+0.041
Neg logit contributions
 play-0.042
|-0.041
 plays-0.041
 meets-0.041
 meet-0.040
Token kick
Token ID4829
Feature activation-1.95e+00

Pos logit contributions
|+0.011
 meets+0.010
 plays+0.010
 play+0.010
 fits+0.010
Neg logit contributions
 impressed-0.012
aging-0.012
 pleased-0.012
broken-0.012
ower-0.012
Token sand
Token ID6450
Feature activation+2.30e+00

Pos logit contributions
ower+0.038
 impressed+0.038
outing+0.036
broken+0.036
aging+0.036
Neg logit contributions
|-0.035
 play-0.034
 meets-0.033
 meet-0.032
 borrow-0.032
Token.
Token ID13
Feature activation-2.36e+00

Pos logit contributions
 impressed+0.045
 pleased+0.042
outing+0.042
broken+0.041
ower+0.041
Neg logit contributions
|-0.052
 play-0.050
 meets-0.049
 plays-0.048
 meet-0.047
Token The
Token ID383
Feature activation+2.14e-01

Pos logit contributions
|+0.043
 meets+0.039
 play+0.038
 plays+0.037
 meet+0.037
Neg logit contributions
 impressed-0.058
aging-0.056
 pleased-0.056
ower-0.056
outing-0.055
Token camel
Token ID41021
Feature activation+4.98e-01

Pos logit contributions
broken+0.005
ower+0.005
 impressed+0.005
outing+0.004
aging+0.004
Neg logit contributions
 play-0.004
 meet-0.004
 meets-0.004
|-0.004
 will-0.004
Token was
Token ID373
Feature activation+2.65e+00

Pos logit contributions
ower+0.011
 impressed+0.011
aging+0.011
outing+0.011
broken+0.011
Neg logit contributions
|-0.010
 play-0.009
 meets-0.009
 meet-0.009
 plays-0.009
Token frustrated
Token ID14718
Feature activation+2.98e+00

Pos logit contributions
ower+0.056
itive+0.052
outing+0.049
ised+0.048
ment+0.047
Neg logit contributions
 play-0.057
|-0.055
 meets-0.054
 meet-0.054
 have-0.052
Token!
Token ID0
Feature activation-1.93e+00

Pos logit contributions
 impressed+0.052
ower+0.052
aging+0.051
outing+0.051
broken+0.051
Neg logit contributions
 play-0.070
|-0.068
 plays-0.067
 meets-0.065
 borrow-0.064
Token
Token ID198
Feature activation+2.44e+00

Pos logit contributions
|+0.035
 meets+0.032
 play+0.032
 plays+0.031
 meet+0.031
Neg logit contributions
 impressed-0.047
 pleased-0.045
aging-0.045
ower-0.045
outing-0.045
Token
Token ID198
Feature activation+2.19e+00

Pos logit contributions
 impressed+0.062
aging+0.061
ower+0.060
broken+0.060
outing+0.060
Neg logit contributions
|-0.043
 meets-0.039
 play-0.038
 meet-0.037
 plays-0.037
TokenThen
Token ID6423
Feature activation+3.85e+00

Pos logit contributions
 impressed+0.038
aging+0.037
ower+0.037
outing+0.036
broken+0.036
Neg logit contributions
|-0.055
 meets-0.052
 play-0.051
 meet-0.051
 plays-0.050
Token,
Token ID11
Feature activation+5.26e+00

Pos logit contributions
 impressed+0.077
aging+0.074
ower+0.074
broken+0.073
outing+0.073
Neg logit contributions
|-0.087
 meets-0.082
 play-0.081
 meet-0.079
 plays-0.079
Token back
Token ID736
Feature activation-6.14e-01

Pos logit contributions
|+0.032
 meets+0.030
 meet+0.029
 plays+0.029
 play+0.028
Neg logit contributions
 impressed-0.024
broken-0.024
aging-0.024
ower-0.023
 pleased-0.023
Token together
Token ID1978
Feature activation+6.72e-01

Pos logit contributions
|+0.014
 meets+0.014
 meet+0.013
 plays+0.013
 play+0.012
Neg logit contributions
aging-0.012
broken-0.012
ower-0.011
 impressed-0.011
outing-0.011
Token.
Token ID13
Feature activation-1.71e+00

Pos logit contributions
 impressed+0.012
aging+0.012
broken+0.012
 pleased+0.012
 knowing+0.012
Neg logit contributions
|-0.016
 meets-0.015
 meet-0.015
 plays-0.015
 play-0.014
Token
Token ID220
Feature activation+4.94e-01

Pos logit contributions
|+0.036
 meets+0.035
 meet+0.033
 play+0.033
 plays+0.032
Neg logit contributions
 impressed-0.036
 pleased-0.035
ower-0.034
aging-0.034
broken-0.033
Token
Token ID198
Feature activation+2.87e+00

Pos logit contributions
 impressed+0.011
ower+0.010
aging+0.010
 pleased+0.010
broken+0.010
Neg logit contributions
|-0.010
 meets-0.010
 play-0.009
 plays-0.009
 meet-0.009
Token
Token ID198
Feature activation+2.49e+00

Pos logit contributions
 impressed+0.069
aging+0.067
ower+0.066
broken+0.066
 pleased+0.066
Neg logit contributions
|-0.052
 meets-0.049
 play-0.047
 plays-0.047
 meet-0.047
Token"
Token ID1
Feature activation-2.14e+00

Pos logit contributions
 impressed+0.039
aging+0.037
ower+0.037
broken+0.037
outing+0.036
Neg logit contributions
|-0.067
 meets-0.064
 play-0.063
 meet-0.062
 plays-0.062
TokenYou
Token ID1639
Feature activation-1.64e+00

Pos logit contributions
|+0.042
 meets+0.040
 play+0.038
 plays+0.038
 meet+0.038
Neg logit contributions
 impressed-0.049
aging-0.047
 pleased-0.047
broken-0.047
ower-0.047
Token are
Token ID389
Feature activation+1.96e+00

Pos logit contributions
|+0.026
 meets+0.024
 plays+0.022
 play+0.022
 meet+0.021
Neg logit contributions
 impressed-0.044
 pleased-0.042
broken-0.041
aging-0.041
 mesmer-0.040
Token very
Token ID845
Feature activation+3.45e+00

Pos logit contributions
 impressed+0.040
 pleased+0.038
aging+0.035
broken+0.035
 mesmer+0.034
Neg logit contributions
|-0.046
 meets-0.044
 plays-0.042
 play-0.041
 meet-0.040
Token welcome
Token ID7062
Feature activation+2.12e+00

Pos logit contributions
 impressed+0.057
aging+0.056
ower+0.056
broken+0.055
outing+0.055
Neg logit contributions
|-0.090
 meets-0.084
 play-0.084
 meet-0.083
 plays-0.082
Token a
Token ID257
Feature activation+2.95e+00

Pos logit contributions
 impressed+0.120
aging+0.117
broken+0.115
 pleased+0.113
ower+0.112
Neg logit contributions
|-0.107
 meets-0.099
 meet-0.096
 play-0.093
 plays-0.093
Token voice
Token ID3809
Feature activation+2.13e+00

Pos logit contributions
aging+0.055
outing+0.053
ower+0.052
broken+0.051
 impressed+0.050
Neg logit contributions
 play-0.069
|-0.067
 meet-0.065
 borrow-0.064
 meets-0.063
Token spoke
Token ID5158
Feature activation+1.41e+00

Pos logit contributions
ower+0.047
 impressed+0.046
broken+0.046
outing+0.045
aging+0.045
Neg logit contributions
|-0.043
 play-0.042
 meet-0.041
 meets-0.040
 plays-0.040
Token.
Token ID13
Feature activation-1.37e+00

Pos logit contributions
 impressed+0.037
ower+0.035
 pleased+0.035
 sudden+0.035
aging+0.034
Neg logit contributions
 play-0.022
|-0.022
 plays-0.021
 borrow-0.021
 meet-0.020
Token It
Token ID632
Feature activation-3.02e+00

Pos logit contributions
|+0.028
 meets+0.026
 play+0.026
 meet+0.025
 plays+0.025
Neg logit contributions
 impressed-0.030
aging-0.029
broken-0.029
ower-0.029
outing-0.029
Token was
Token ID373
Feature activation+1.69e+00

Pos logit contributions
|+0.059
 play+0.058
 meets+0.057
 meet+0.056
 plays+0.055
Neg logit contributions
ower-0.069
outing-0.067
aging-0.067
 impressed-0.065
broken-0.065
Token not
Token ID407
Feature activation+1.57e+00

Pos logit contributions
ower+0.041
outing+0.040
aging+0.039
broken+0.039
 impressed+0.039
Neg logit contributions
|-0.032
 play-0.030
 meet-0.029
 meets-0.029
 borrow-0.028
Token the
Token ID262
Feature activation-4.51e-01

Pos logit contributions
ower+0.030
outing+0.030
aging+0.029
broken+0.029
 impressed+0.029
Neg logit contributions
|-0.036
 play-0.034
 meets-0.034
 meet-0.034
 borrow-0.033
Token flower
Token ID15061
Feature activation-1.20e+00

Pos logit contributions
|+0.009
 meets+0.008
 play+0.008
 plays+0.008
 meet+0.007
Neg logit contributions
 impressed-0.011
aging-0.010
ower-0.010
broken-0.010
outing-0.010
Token,
Token ID11
Feature activation+5.83e+00

Pos logit contributions
|+0.031
 meets+0.028
 meet+0.027
 plays+0.027
 play+0.027
Neg logit contributions
 impressed-0.021
aging-0.020
ower-0.020
 pleased-0.020
outing-0.020
Token but
Token ID475
Feature activation+3.04e+00

Pos logit contributions
 impressed+0.087
aging+0.087
broken+0.085
ower+0.085
outing+0.084
Neg logit contributions
|-0.155
 play-0.151
 meets-0.149
 plays-0.148
 meet-0.147
Token together
Token ID1978
Feature activation+1.18e+00

Pos logit contributions
aging+0.013
 impressed+0.013
ower+0.013
outing+0.013
broken+0.013
Neg logit contributions
|-0.017
 play-0.016
 meets-0.016
 meet-0.016
 plays-0.016
Token.
Token ID13
Feature activation-2.42e+00

Pos logit contributions
 impressed+0.021
aging+0.021
ower+0.021
outing+0.021
broken+0.021
Neg logit contributions
|-0.027
 play-0.027
 meets-0.026
 meet-0.026
 plays-0.026
Token The
Token ID383
Feature activation+6.42e-02

Pos logit contributions
|+0.055
 meets+0.053
 play+0.051
 plays+0.050
 fits+0.050
Neg logit contributions
 knowing-0.042
 impressed-0.041
 pleased-0.039
ower-0.039
 mesmer-0.039
Token fish
Token ID5916
Feature activation+4.80e-01

Pos logit contributions
broken+0.001
aging+0.001
ower+0.001
outing+0.001
 impressed+0.001
Neg logit contributions
 play-0.001
|-0.001
 meets-0.001
 meet-0.001
 plays-0.001
Token always
Token ID1464
Feature activation+3.08e-02

Pos logit contributions
 impressed+0.010
 pleased+0.010
broken+0.010
aging+0.010
ower+0.009
Neg logit contributions
|-0.011
 meets-0.010
 meet-0.009
 play-0.009
 plays-0.009
Token looked
Token ID3114
Feature activation+1.12e+00

Pos logit contributions
 impressed+0.001
aging+0.001
ower+0.001
broken+0.001
outing+0.001
Neg logit contributions
|-0.001
 meets-0.001
 play-0.001
 meet-0.001
 plays-0.001
Token so
Token ID523
Feature activation+2.82e+00

Pos logit contributions
ower+0.021
 impressed+0.021
aging+0.020
outing+0.020
broken+0.020
Neg logit contributions
|-0.026
 play-0.025
 meets-0.025
 meet-0.024
 plays-0.024
Token pretty
Token ID2495
Feature activation+1.34e+00

Pos logit contributions
 impressed+0.042
aging+0.040
ower+0.040
broken+0.039
outing+0.039
Neg logit contributions
|-0.079
 meets-0.074
 play-0.074
 meet-0.073
 plays-0.072
Token and
Token ID290
Feature activation+1.95e+00

Pos logit contributions
 impressed+0.023
aging+0.022
ower+0.022
broken+0.022
outing+0.022
Neg logit contributions
|-0.034
 meets-0.032
 play-0.032
 meet-0.031
 plays-0.031
Token spicy
Token ID26880
Feature activation+1.13e+00

Pos logit contributions
 impressed+0.035
 pleased+0.033
aging+0.032
ower+0.031
broken+0.031
Neg logit contributions
|-0.050
 meets-0.046
 meet-0.045
 play-0.044
 plays-0.044
Token!
Token ID0
Feature activation-2.10e+00

Pos logit contributions
ower+0.020
 impressed+0.020
aging+0.020
 pleased+0.019
 knowing+0.019
Neg logit contributions
|-0.029
 meets-0.026
 meet-0.025
 borrow-0.024
 plays-0.024
Token getting
Token ID1972
Feature activation+1.38e+00

Pos logit contributions
ower+0.048
ment+0.047
itive+0.046
ittens+0.046
edly+0.045
Neg logit contributions
 play-0.034
 have-0.028
 meet-0.028
 visit-0.028
|-0.027
Token very
Token ID845
Feature activation+3.42e+00

Pos logit contributions
ower+0.034
itive+0.031
aging+0.030
oles+0.030
uff+0.029
Neg logit contributions
 play-0.026
|-0.025
 plays-0.023
 will-0.023
 borrow-0.022
Token dark
Token ID3223
Feature activation+3.14e+00

Pos logit contributions
itive+0.082
ittens+0.078
ower+0.077
king+0.076
aged+0.074
Neg logit contributions
 play-0.062
|-0.051
 will-0.051
 visit-0.049
 plays-0.048
Token.
Token ID13
Feature activation-2.09e+00

Pos logit contributions
 impressed+0.060
 pleased+0.058
outing+0.058
ower+0.055
broken+0.055
Neg logit contributions
|-0.074
 play-0.072
 plays-0.069
 meets-0.068
 meet-0.067
Token
Token ID198
Feature activation+2.29e+00

Pos logit contributions
|+0.032
 meets+0.029
 play+0.029
 meet+0.028
 plays+0.028
Neg logit contributions
 impressed-0.057
aging-0.056
broken-0.055
ower-0.055
outing-0.055
Token
Token ID198
Feature activation+1.95e+00

Pos logit contributions
 impressed+0.057
aging+0.056
ower+0.055
broken+0.055
outing+0.055
Neg logit contributions
|-0.040
 meets-0.037
 play-0.036
 meet-0.035
 plays-0.035
Token"
Token ID1
Feature activation-2.76e+00

Pos logit contributions
aging+0.039
 impressed+0.038
ower+0.038
outing+0.037
broken+0.037
Neg logit contributions
|-0.044
 play-0.041
 meets-0.041
 meet-0.040
 plays-0.039
TokenOh
Token ID5812
Feature activation+3.04e+00

Pos logit contributions
|+0.055
 meets+0.049
 play+0.049
 meet+0.049
 plays+0.047
Neg logit contributions
 impressed-0.064
aging-0.063
ower-0.062
broken-0.061
 pleased-0.061
Token no
Token ID645
Feature activation+3.17e+00

Pos logit contributions
 impressed+0.063
aging+0.062
outing+0.062
broken+0.062
ower+0.062
Neg logit contributions
|-0.064
 play-0.063
 meets-0.061
 meet-0.060
 plays-0.060
Token,"
Token ID553
Feature activation+2.68e+00

Pos logit contributions
outing+0.087
 impressed+0.087
aging+0.086
broken+0.084
ower+0.084
Neg logit contributions
 play-0.047
|-0.047
 meet-0.043
 plays-0.042
 meets-0.042
Token murm
Token ID38023
Feature activation+5.26e+00

Pos logit contributions
 impressed+0.046
aging+0.045
ower+0.045
broken+0.045
outing+0.044
Neg logit contributions
|-0.067
 play-0.063
 meets-0.063
 plays-0.062
 meet-0.062
Token was
Token ID373
Feature activation+9.60e-01

Pos logit contributions
|+0.089
 meets+0.086
 meet+0.082
 borrow+0.080
 plays+0.080
Neg logit contributions
 impressed-0.086
aging-0.082
 pleased-0.081
ower-0.080
 sudden-0.080
Token an
Token ID281
Feature activation+5.72e-01

Pos logit contributions
outing+0.023
ower+0.022
 impressed+0.022
broken+0.022
aging+0.022
Neg logit contributions
|-0.019
 play-0.018
 meet-0.017
 plays-0.017
 meets-0.017
Token old
Token ID1468
Feature activation-1.25e+00

Pos logit contributions
outing+0.011
aging+0.011
ower+0.011
broken+0.010
ment+0.010
Neg logit contributions
 play-0.013
|-0.012
 plays-0.012
 meet-0.012
 borrow-0.011
Token man
Token ID582
Feature activation+4.58e-01

Pos logit contributions
 play+0.025
|+0.025
 plays+0.024
 meets+0.023
 meet+0.023
Neg logit contributions
 impressed-0.027
outing-0.027
aging-0.027
broken-0.026
 pleased-0.026
Token.
Token ID13
Feature activation-2.90e+00

Pos logit contributions
 impressed+0.009
ower+0.009
aging+0.009
outing+0.009
broken+0.009
Neg logit contributions
|-0.010
 play-0.010
 meets-0.010
 meet-0.010
 plays-0.010
Token He
Token ID679
Feature activation-1.37e+00

Pos logit contributions
|+0.056
 meets+0.056
 fits+0.050
 meet+0.050
 plays+0.049
Neg logit contributions
 impressed-0.062
aging-0.060
ower-0.059
 pleased-0.059
 mesmer-0.058
Token had
Token ID550
Feature activation-2.60e+00

Pos logit contributions
|+0.027
 play+0.026
 meets+0.025
 plays+0.025
 meet+0.024
Neg logit contributions
outing-0.032
 impressed-0.032
ower-0.031
aging-0.031
broken-0.031
Token a
Token ID257
Feature activation-1.50e+00

Pos logit contributions
|+0.061
 meets+0.057
 play+0.056
 meet+0.055
 plays+0.055
Neg logit contributions
 impressed-0.050
aging-0.049
broken-0.048
ower-0.048
 pleased-0.047
Token special
Token ID2041
Feature activation-2.07e+00

Pos logit contributions
|+0.035
 play+0.034
 meets+0.033
 meet+0.033
 plays+0.032
Neg logit contributions
 impressed-0.028
aging-0.027
outing-0.027
broken-0.027
ower-0.027
Token mask
Token ID9335
Feature activation-1.12e+00

Pos logit contributions
|+0.037
 play+0.034
 meets+0.034
 meet+0.033
 plays+0.033
Neg logit contributions
 impressed-0.051
aging-0.050
outing-0.049
broken-0.049
ower-0.049
Token that
Token ID326
Feature activation-2.56e+00

Pos logit contributions
|+0.030
 meets+0.028
 meet+0.027
 play+0.027
 plays+0.027
Neg logit contributions
 impressed-0.018
aging-0.017
ower-0.017
broken-0.017
outing-0.016
Token day
Token ID1110
Feature activation+2.80e+00

Pos logit contributions
 impressed+0.023
aging+0.023
ower+0.023
broken+0.022
outing+0.022
Neg logit contributions
|-0.016
 play-0.015
 meets-0.015
 meet-0.014
 plays-0.014
Token,
Token ID11
Feature activation+3.69e+00

Pos logit contributions
 impressed+0.039
aging+0.038
 pleased+0.037
ower+0.036
broken+0.036
Neg logit contributions
|-0.081
 meets-0.075
 meet-0.073
 play-0.072
 plays-0.072
Token it
Token ID340
Feature activation-2.83e+00

Pos logit contributions
 impressed+0.068
aging+0.066
 pleased+0.065
broken+0.064
 sudden+0.063
Neg logit contributions
|-0.092
 meets-0.084
 meet-0.080
 plays-0.079
 play-0.077
Token figured
Token ID11638
Feature activation-2.56e+00

Pos logit contributions
|+0.062
 meets+0.057
 meet+0.056
 play+0.056
 plays+0.055
Neg logit contributions
 impressed-0.059
aging-0.057
broken-0.057
 pleased-0.055
ower-0.055
Token out
Token ID503
Feature activation-7.58e-01

Pos logit contributions
|+0.063
 meets+0.055
 plays+0.052
 meet+0.051
 play+0.050
Neg logit contributions
aging-0.048
 impressed-0.047
 knowing-0.046
 pleased-0.045
broken-0.044
Token how
Token ID703
Feature activation-2.41e+00

Pos logit contributions
|+0.017
 play+0.017
 plays+0.016
 meets+0.016
 borrow+0.016
Neg logit contributions
ower-0.015
 impressed-0.015
aging-0.014
outing-0.014
broken-0.014
Token to
Token ID284
Feature activation-5.21e+00

Pos logit contributions
|+0.058
 meets+0.052
 meet+0.050
 plays+0.047
Ready+0.046
Neg logit contributions
 impressed-0.049
 pleased-0.047
broken-0.046
aging-0.045
 knowing-0.044
Token zip
Token ID19974
Feature activation-3.68e+00

Pos logit contributions
|+0.095
 meets+0.091
 Welcome+0.084
 plays+0.079
 shares+0.077
Neg logit contributions
aging-0.118
broken-0.118
 impressed-0.117
ower-0.112
ble-0.112
Token!
Token ID0
Feature activation-3.08e+00

Pos logit contributions
|+0.087
 meets+0.080
 meet+0.079
 borrow+0.072
 play+0.071
Neg logit contributions
 impressed-0.069
broken-0.067
aging-0.067
ower-0.066
ised-0.065
Token
Token ID198
Feature activation+1.44e+00

Pos logit contributions
 meets+0.067
|+0.064
 meet+0.060
 plays+0.060
 fits+0.060
Neg logit contributions
 sudden-0.055
ised-0.055
Suddenly-0.054
 impressed-0.054
ower-0.054
Token
Token ID198
Feature activation+1.15e+00

Pos logit contributions
 impressed+0.031
 pleased+0.029
aging+0.029
ower+0.028
 sudden+0.028
Neg logit contributions
|-0.029
 meets-0.028
 plays-0.028
 play-0.027
 fits-0.027
Token But
Token ID887
Feature activation+2.59e+00

Pos logit contributions
|+0.039
 meets+0.036
 play+0.035
 meet+0.034
 plays+0.034
Neg logit contributions
 impressed-0.063
broken-0.062
aging-0.062
ower-0.061
outing-0.060
Token they
Token ID484
Feature activation-4.46e-01

Pos logit contributions
ower+0.072
broken+0.072
outing+0.070
 impressed+0.069
aging+0.068
Neg logit contributions
 play-0.040
|-0.038
 meets-0.038
 meet-0.037
 plays-0.036
Token are
Token ID389
Feature activation+1.76e+00

Pos logit contributions
|+0.008
 meets+0.007
 plays+0.007
Ready+0.006
 fits+0.006
Neg logit contributions
 impressed-0.011
aging-0.011
 pleased-0.011
outing-0.011
ower-0.010
Token lost
Token ID2626
Feature activation+2.84e+00

Pos logit contributions
ower+0.037
outing+0.033
ment+0.033
ised+0.032
aging+0.032
Neg logit contributions
 play-0.039
|-0.038
 meets-0.038
 meet-0.036
 plays-0.036
Token.
Token ID13
Feature activation-2.60e+00

Pos logit contributions
 impressed+0.066
ower+0.064
aging+0.063
 pleased+0.063
outing+0.063
Neg logit contributions
|-0.054
 play-0.053
 plays-0.049
 meets-0.049
 borrow-0.048
Token They
Token ID1119
Feature activation-5.84e-01

Pos logit contributions
|+0.044
 meets+0.040
 play+0.039
 meet+0.038
 plays+0.038
Neg logit contributions
 impressed-0.067
aging-0.066
broken-0.065
ower-0.065
outing-0.065
Token do
Token ID466
Feature activation-2.13e-01

Pos logit contributions
|+0.010
 meets+0.009
 plays+0.008
 fits+0.008
uggle+0.008
Neg logit contributions
aging-0.014
 impressed-0.014
 pleased-0.013
 mesmer-0.013
 sudden-0.013
Token not
Token ID407
Feature activation+4.71e-01

Pos logit contributions
|+0.006
 meets+0.005
 play+0.005
 meet+0.005
 plays+0.005
Neg logit contributions
ower-0.004
outing-0.003
 impressed-0.003
aging-0.003
 pleased-0.003
Token see
Token ID766
Feature activation-1.91e+00

Pos logit contributions
 impressed+0.010
aging+0.009
 knowing+0.009
 pleased+0.009
broken+0.009
Neg logit contributions
|-0.010
 meets-0.009
 plays-0.009
 fits-0.009
uggle-0.008
Token the
Token ID262
Feature activation+1.87e-01

Pos logit contributions
|+0.035
 meets+0.033
 play+0.033
 meet+0.032
 plays+0.032
Neg logit contributions
 impressed-0.046
ower-0.045
aging-0.045
broken-0.045
outing-0.045
Token trail
Token ID8025
Feature activation+6.87e-01

Pos logit contributions
 impressed+0.004
aging+0.004
ower+0.004
broken+0.004
outing+0.004
Neg logit contributions
|-0.004
 meets-0.004
 play-0.004
 meet-0.004
 plays-0.003
Token of
Token ID286
Feature activation-8.77e-01

Pos logit contributions
 impressed+0.036
aging+0.035
ower+0.034
outing+0.034
broken+0.034
Neg logit contributions
|-0.048
 meets-0.045
 play-0.044
 meet-0.044
 plays-0.044
Token the
Token ID262
Feature activation+1.01e+00

Pos logit contributions
|+0.017
 meets+0.015
 play+0.015
 plays+0.015
 meet+0.015
Neg logit contributions
 impressed-0.021
aging-0.020
broken-0.020
outing-0.020
ower-0.020
Token big
Token ID1263
Feature activation+6.77e-01

Pos logit contributions
 impressed+0.020
aging+0.020
ower+0.020
broken+0.020
outing+0.019
Neg logit contributions
|-0.023
 meets-0.021
 play-0.021
 meet-0.021
 plays-0.021
Token kids
Token ID3988
Feature activation+9.41e-01

Pos logit contributions
 impressed+0.015
aging+0.015
ower+0.014
broken+0.014
outing+0.014
Neg logit contributions
|-0.014
 meets-0.013
 play-0.013
 meet-0.013
 plays-0.012
Token says
Token ID1139
Feature activation+3.03e+00

Pos logit contributions
 impressed+0.021
outing+0.020
aging+0.020
 pleased+0.020
ower+0.019
Neg logit contributions
|-0.020
 meets-0.017
 plays-0.017
 fits-0.016
Ready-0.016
Token.
Token ID13
Feature activation-1.17e+00

Pos logit contributions
 impressed+0.073
ower+0.071
aging+0.071
broken+0.070
 pleased+0.069
Neg logit contributions
|-0.053
 play-0.051
 meets-0.050
 plays-0.049
 borrow-0.048
Token
Token ID198
Feature activation+2.52e+00

Pos logit contributions
|+0.017
 meets+0.016
 play+0.016
 meet+0.015
 plays+0.015
Neg logit contributions
aging-0.033
broken-0.032
 impressed-0.032
ower-0.032
outing-0.031
Token
Token ID198
Feature activation+2.33e+00

Pos logit contributions
aging+0.071
 impressed+0.070
ower+0.070
broken+0.069
outing+0.069
Neg logit contributions
|-0.036
 play-0.033
 meets-0.033
 meet-0.032
 plays-0.031
TokenHe
Token ID1544
Feature activation-5.65e-02

Pos logit contributions
aging+0.051
ower+0.049
 impressed+0.048
 pleased+0.047
itive+0.047
Neg logit contributions
|-0.048
 play-0.047
 meets-0.047
 meet-0.045
 plays-0.044
Token tries
Token ID8404
Feature activation-1.34e+00

Pos logit contributions
|+0.001
 meet+0.001
uggle+0.001
 play+0.001
 meets+0.001
Neg logit contributions
 impressed-0.001
outing-0.001
aging-0.001
 pleased-0.001
broken-0.001
Token to
Token ID284
Feature activation-3.42e+00

Pos logit contributions
|+0.021
 play+0.020
 meets+0.019
 meet+0.019
 plays+0.019
Neg logit contributions
 impressed-0.036
ower-0.035
aging-0.035
broken-0.035
outing-0.035
Token very
Token ID845
Feature activation+2.69e+00

Pos logit contributions
 impressed+0.016
 pleased+0.015
aging+0.014
 mesmer+0.013
 knowing+0.013
Neg logit contributions
|-0.031
 meets-0.028
 plays-0.028
 borrow-0.028
 play-0.028
Token pretty
Token ID2495
Feature activation+6.65e-01

Pos logit contributions
 impressed+0.041
aging+0.041
broken+0.041
ower+0.041
outing+0.040
Neg logit contributions
|-0.071
 meets-0.068
 play-0.068
 meet-0.067
 plays-0.066
Token and
Token ID290
Feature activation+1.37e+00

Pos logit contributions
 impressed+0.012
aging+0.012
broken+0.012
ower+0.012
 pleased+0.012
Neg logit contributions
|-0.016
 meets-0.015
 play-0.015
 meet-0.015
 plays-0.015
Token y
Token ID331
Feature activation-1.54e-01

Pos logit contributions
 impressed+0.021
 pleased+0.020
aging+0.019
ower+0.019
broken+0.019
Neg logit contributions
|-0.038
 meets-0.036
 play-0.034
 meet-0.034
 plays-0.034
Tokenummy
Token ID13513
Feature activation+8.90e-01

Pos logit contributions
|+0.004
 play+0.004
 plays+0.004
 meets+0.004
 borrow+0.004
Neg logit contributions
 impressed-0.002
 pleased-0.002
broken-0.002
aging-0.002
 mesmer-0.002
Token.
Token ID13
Feature activation-1.43e+00

Pos logit contributions
 impressed+0.017
aging+0.017
 pleased+0.016
ower+0.016
broken+0.016
Neg logit contributions
|-0.021
 meets-0.019
 meet-0.019
 play-0.018
 plays-0.018
Token They
Token ID1119
Feature activation-5.62e-02

Pos logit contributions
|+0.025
 meets+0.023
 play+0.022
 meet+0.022
 plays+0.022
Neg logit contributions
 impressed-0.036
aging-0.035
broken-0.035
ower-0.035
outing-0.035
Token wanted
Token ID2227
Feature activation-2.62e+00

Pos logit contributions
|+0.001
 meets+0.001
 play+0.001
 meet+0.001
 plays+0.001
Neg logit contributions
 impressed-0.001
aging-0.001
broken-0.001
ower-0.001
 pleased-0.001
Token to
Token ID284
Feature activation-4.11e+00

Pos logit contributions
|+0.059
 meets+0.051
 fits+0.049
 plays+0.049
 play+0.048
Neg logit contributions
 impressed-0.055
aging-0.053
 knowing-0.052
 pleased-0.052
outing-0.050
Token try
Token ID1949
Feature activation-3.61e+00

Pos logit contributions
|+0.066
 meets+0.059
 fits+0.057
Ready+0.054
 plays+0.052
Neg logit contributions
 impressed-0.108
 pleased-0.103
 mesmer-0.100
 understanding-0.100
 knowing-0.098
Token them
Token ID606
Feature activation-2.84e+00

Pos logit contributions
|+0.070
 meets+0.062
 play+0.061
 plays+0.060
 meet+0.060
Neg logit contributions
 impressed-0.086
aging-0.082
 pleased-0.081
ower-0.081
broken-0.080
Token
Token ID198
Feature activation+2.49e+00

Pos logit contributions
|+0.038
 meets+0.036
 meet+0.034
 plays+0.034
 play+0.034
Neg logit contributions
 impressed-0.036
 pleased-0.034
broken-0.032
ower-0.032
outing-0.032
Token
Token ID198
Feature activation+2.27e+00

Pos logit contributions
 impressed+0.059
aging+0.057
ower+0.057
broken+0.057
 pleased+0.056
Neg logit contributions
|-0.046
 meets-0.043
 play-0.042
 plays-0.041
 meet-0.041
TokenL
Token ID43
Feature activation+2.24e+00

Pos logit contributions
aging+0.050
 impressed+0.049
ower+0.048
outing+0.047
 pleased+0.046
Neg logit contributions
|-0.048
 play-0.045
 meets-0.043
 plays-0.043
 meet-0.043
Tokenily
Token ID813
Feature activation+1.02e-01

Pos logit contributions
 impressed+0.046
aging+0.045
broken+0.044
ower+0.044
outing+0.044
Neg logit contributions
|-0.049
 meets-0.047
 play-0.046
 meet-0.046
 plays-0.045
Token wanted
Token ID2227
Feature activation-2.95e+00

Pos logit contributions
 impressed+0.002
ower+0.002
aging+0.002
outing+0.002
 pleased+0.002
Neg logit contributions
|-0.002
 play-0.002
 meets-0.002
 plays-0.002
 meet-0.002
Token to
Token ID284
Feature activation-4.29e+00

Pos logit contributions
|+0.080
 fits+0.069
Ready+0.067
 meets+0.067
uggle+0.066
Neg logit contributions
 knowing-0.049
 impressed-0.048
 realizing-0.044
 pleased-0.043
 understanding-0.043
Token help
Token ID1037
Feature activation-2.48e+00

Pos logit contributions
|+0.077
 meets+0.071
Ready+0.071
rob+0.071
 fits+0.067
Neg logit contributions
 understanding-0.098
 impressed-0.097
 pleased-0.093
broken-0.090
 knowing-0.089
Token the
Token ID262
Feature activation-1.09e+00

Pos logit contributions
|+0.062
 meets+0.057
 fits+0.054
 plays+0.053
Ready+0.053
Neg logit contributions
 impressed-0.046
aging-0.044
 knowing-0.043
 pleased-0.042
broken-0.041
Token pot
Token ID1787
Feature activation-1.60e+00

Pos logit contributions
 play+0.021
 meets+0.020
|+0.019
 meet+0.019
 plays+0.019
Neg logit contributions
 impressed-0.025
broken-0.024
ower-0.024
outing-0.024
king-0.024
Token feel
Token ID1254
Feature activation+1.40e+00

Pos logit contributions
|+0.043
 meets+0.040
 plays+0.038
 meet+0.038
 play+0.038
Neg logit contributions
 impressed-0.026
 pleased-0.024
aging-0.024
broken-0.024
outing-0.023
Token better
Token ID1365
Feature activation+3.52e+00

Pos logit contributions
 impressed+0.034
 pleased+0.033
aging+0.033
broken+0.033
outing+0.032
Neg logit contributions
|-0.026
 meets-0.024
 meet-0.022
 play-0.022
 plays-0.022
Token.
Token ID13
Feature activation-2.47e+00

Pos logit contributions
 impressed+0.022
aging+0.021
ower+0.021
broken+0.021
 pleased+0.020
Neg logit contributions
|-0.027
 meets-0.025
 play-0.025
 meet-0.025
 plays-0.025
Token<|endoftext|>
Token ID50256
Feature activation-6.80e+00

Pos logit contributions
|+0.053
 meets+0.052
 play+0.051
 shares+0.050
 hunt+0.050
Neg logit contributions
 impressed-0.045
 mesmer-0.043
 knowing-0.043
 pleased-0.043
ower-0.041
TokenW
Token ID54
Feature activation+1.14e+00

Pos logit contributions
 shares+0.143
 borrow+0.142
 fits+0.140
 meets+0.139
 meet+0.135
Neg logit contributions
 sudden-0.109
ower-0.108
 impressed-0.107
uff-0.105
 mesmer-0.105
Tokenilly
Token ID6548
Feature activation-8.04e-01

Pos logit contributions
 impressed+0.022
aging+0.021
ower+0.021
broken+0.021
outing+0.021
Neg logit contributions
|-0.027
 meets-0.025
 play-0.025
 meet-0.024
 plays-0.024
Token loved
Token ID6151
Feature activation-2.02e+00

Pos logit contributions
|+0.024
 meets+0.022
 meet+0.021
 borrow+0.021
Ready+0.021
Neg logit contributions
 impressed-0.010
aging-0.009
broken-0.009
ower-0.009
 knowing-0.009
Token to
Token ID284
Feature activation-4.93e+00

Pos logit contributions
|+0.046
 meets+0.042
 fits+0.041
Ready+0.037
 meet+0.037
Neg logit contributions
 impressed-0.042
aging-0.041
 knowing-0.040
 sudden-0.039
 pleased-0.038
Token whistle
Token ID16121
Feature activation-2.09e+00

Pos logit contributions
|+0.070
 meets+0.062
 fits+0.053
Ready+0.051
Check+0.047
Neg logit contributions
 impressed-0.141
 pleased-0.135
 mesmer-0.133
ower-0.132
aging-0.132
Token.
Token ID13
Feature activation-4.22e+00

Pos logit contributions
|+0.048
 meets+0.044
 play+0.044
 meet+0.043
 plays+0.043
Neg logit contributions
 impressed-0.041
aging-0.040
ower-0.040
broken-0.040
outing-0.039
Token He
Token ID679
Feature activation-2.27e+00

Pos logit contributions
 meets+0.093
|+0.088
 fits+0.081
 shares+0.078
 meet+0.076
Neg logit contributions
 knowing-0.076
uff-0.076
 mesmer-0.075
ower-0.075
 impressed-0.075
Token went
Token ID1816
Feature activation-1.18e+00

Pos logit contributions
|+0.050
 meets+0.044
 meet+0.044
 borrow+0.042
 play+0.042
Neg logit contributions
 impressed-0.047
aging-0.045
broken-0.044
 pleased-0.044
 sudden-0.043
Token outside
Token ID2354
Feature activation+8.34e-01

Pos logit contributions
|+0.024
 meets+0.022
 fits+0.021
 meet+0.021
 plays+0.020
Neg logit contributions
 impressed-0.026
aging-0.026
 pleased-0.025
ower-0.025
broken-0.024
Token in
Token ID287
Feature activation+2.79e-01

Pos logit contributions
 impressed+0.034
aging+0.033
ower+0.033
broken+0.033
outing+0.032
Neg logit contributions
|-0.041
 play-0.039
 meets-0.039
 plays-0.039
 meet-0.038
Token a
Token ID257
Feature activation+3.13e-01

Pos logit contributions
 impressed+0.005
aging+0.005
 pleased+0.005
broken+0.005
ower+0.005
Neg logit contributions
|-0.006
 meets-0.006
 play-0.006
 plays-0.006
 meet-0.006
Token shy
Token ID15800
Feature activation+2.28e+00

Pos logit contributions
 impressed+0.005
 pleased+0.004
broken+0.004
 sudden+0.004
aging+0.004
Neg logit contributions
|-0.009
 meets-0.008
 plays-0.008
 meet-0.008
 play-0.008
Token voice
Token ID3809
Feature activation+1.56e+00

Pos logit contributions
 impressed+0.037
aging+0.035
ower+0.035
broken+0.035
 pleased+0.035
Neg logit contributions
|-0.060
 meets-0.057
 play-0.056
 meet-0.055
 plays-0.055
Token,
Token ID11
Feature activation+5.69e+00

Pos logit contributions
 impressed+0.030
aging+0.029
broken+0.029
 sudden+0.029
ower+0.029
Neg logit contributions
|-0.034
 play-0.034
 plays-0.033
 meet-0.033
 meets-0.033
Token "
Token ID366
Feature activation-4.29e+00

Pos logit contributions
aging+0.109
ower+0.108
broken+0.108
outing+0.107
 impressed+0.107
Neg logit contributions
 play-0.124
|-0.124
 meet-0.122
 meets-0.120
 plays-0.119
TokenI
Token ID40
Feature activation-2.80e+00

Pos logit contributions
 meets+0.078
|+0.077
 plays+0.076
 play+0.074
 fits+0.074
Neg logit contributions
broken-0.096
outing-0.095
 impressed-0.093
ower-0.093
 mesmer-0.091
Token'm
Token ID1101
Feature activation+1.56e+00

Pos logit contributions
|+0.041
 meets+0.036
 plays+0.033
Ready+0.033
Check+0.032
Neg logit contributions
 impressed-0.074
 pleased-0.072
broken-0.070
aging-0.070
ower-0.069
Token just
Token ID655
Feature activation+1.21e+00

Pos logit contributions
 impressed+0.026
aging+0.025
ower+0.024
broken+0.024
 pleased+0.024
Neg logit contributions
|-0.041
 meets-0.038
 play-0.038
 plays-0.037
 meet-0.037
Token so
Token ID523
Feature activation+3.18e+00

Pos logit contributions
 impressed+0.021
aging+0.020
ower+0.020
broken+0.020
 pleased+0.020
Neg logit contributions
|-0.030
 meets-0.029
 play-0.028
 meet-0.028
 plays-0.028
Token excited
Token ID6568
Feature activation+2.51e+00

Pos logit contributions
 impressed+0.040
aging+0.040
broken+0.039
ower+0.039
outing+0.038
Neg logit contributions
|-0.094
 meets-0.089
 play-0.089
 meet-0.087
 plays-0.087
Token so
Token ID523
Feature activation+4.22e+00

Pos logit contributions
ower+0.042
outing+0.042
ment+0.041
ittens+0.040
itive+0.039
Neg logit contributions
 meet-0.044
 play-0.043
 meets-0.041
|-0.040
 have-0.040
Token excited
Token ID6568
Feature activation+3.17e+00

Pos logit contributions
der+0.078
iously+0.078
king+0.076
itive+0.076
ittens+0.076
Neg logit contributions
 play-0.105
 meet-0.102
 have-0.098
 join-0.092
 meets-0.091
Token and
Token ID290
Feature activation+2.26e+00

Pos logit contributions
 impressed+0.067
broken+0.065
outing+0.065
aging+0.064
 pleased+0.064
Neg logit contributions
|-0.065
 play-0.065
 meets-0.062
 plays-0.062
 meet-0.061
Token tried
Token ID3088
Feature activation-2.27e+00

Pos logit contributions
ower+0.046
outing+0.046
aging+0.045
broken+0.045
 impressed+0.043
Neg logit contributions
 play-0.049
 meet-0.047
|-0.047
 meets-0.046
 plays-0.046
Token to
Token ID284
Feature activation-3.57e+00

Pos logit contributions
|+0.034
 play+0.033
 meets+0.032
 meet+0.032
 plays+0.031
Neg logit contributions
ower-0.060
 impressed-0.060
broken-0.060
outing-0.060
aging-0.059
Token pick
Token ID2298
Feature activation-5.72e+00

Pos logit contributions
|+0.048
 play+0.048
 meet+0.046
 meets+0.043
 borrow+0.043
Neg logit contributions
 impressed-0.102
outing-0.101
ower-0.101
aging-0.101
broken-0.100
Token it
Token ID340
Feature activation-3.78e+00

Pos logit contributions
|+0.129
 meets+0.113
 meet+0.109
 play+0.108
 plays+0.108
Neg logit contributions
 impressed-0.118
aging-0.112
 pleased-0.112
 mesmer-0.109
 knowing-0.108
Token,
Token ID11
Feature activation+4.98e+00

Pos logit contributions
|+0.066
 meets+0.060
 play+0.059
 meet+0.058
 plays+0.057
Neg logit contributions
 impressed-0.096
aging-0.094
ower-0.093
broken-0.093
outing-0.092
Token but
Token ID475
Feature activation+2.07e+00

Pos logit contributions
broken+0.081
ower+0.081
aging+0.079
outing+0.079
 impressed+0.076
Neg logit contributions
 play-0.128
 meet-0.124
|-0.122
 meets-0.121
 plays-0.120
Token the
Token ID262
Feature activation-9.16e-01

Pos logit contributions
 impressed+0.049
aging+0.048
ower+0.047
broken+0.047
outing+0.047
Neg logit contributions
|-0.040
 meets-0.036
 play-0.036
 meet-0.035
 plays-0.035
Token butterfly
Token ID35113
Feature activation-1.38e+00

Pos logit contributions
|+0.017
 play+0.016
 meet+0.016
 meets+0.016
 plays+0.015
Neg logit contributions
aging-0.022
 impressed-0.022
outing-0.021
broken-0.021
ower-0.021
Token fun
Token ID1257
Feature activation+2.89e+00

Pos logit contributions
|+0.096
 meets+0.091
 fits+0.090
 borrow+0.088
 plays+0.087
Neg logit contributions
 impressed-0.064
 pleased-0.058
 sudden-0.057
 mesmer-0.055
 knowing-0.053
Token.
Token ID13
Feature activation-1.73e+00

Pos logit contributions
 impressed+0.056
aging+0.053
ower+0.053
broken+0.051
 pleased+0.051
Neg logit contributions
|-0.069
 meets-0.064
 meet-0.062
 borrow-0.062
 plays-0.061
Token The
Token ID383
Feature activation+3.05e-01

Pos logit contributions
|+0.034
 meets+0.033
 meet+0.031
 play+0.031
 borrow+0.031
Neg logit contributions
 impressed-0.038
 pleased-0.036
aging-0.036
ower-0.036
broken-0.035
Token end
Token ID886
Feature activation+2.36e+00

Pos logit contributions
 impressed+0.007
aging+0.007
ower+0.007
broken+0.007
outing+0.007
Neg logit contributions
|-0.006
 meets-0.006
 play-0.006
 meet-0.006
 plays-0.005
Token.
Token ID13
Feature activation-2.61e+00

Pos logit contributions
 impressed+0.043
ower+0.042
broken+0.041
 pleased+0.040
outing+0.040
Neg logit contributions
|-0.057
 meets-0.055
 meet-0.053
 borrow-0.051
 play-0.050
Token<|endoftext|>
Token ID50256
Feature activation-6.22e+00

Pos logit contributions
|+0.048
 meets+0.045
 play+0.044
 plays+0.043
 meet+0.043
Neg logit contributions
 impressed-0.062
aging-0.060
 pleased-0.059
ower-0.059
broken-0.059
TokenOnce
Token ID7454
Feature activation+3.31e-01

Pos logit contributions
 fits+0.144
 meets+0.143
 plays+0.142
 shares+0.141
|+0.140
Neg logit contributions
 impressed-0.093
 sudden-0.087
ower-0.087
aging-0.086
 pleased-0.085
Token upon
Token ID2402
Feature activation-2.45e+00

Pos logit contributions
aging+0.006
 impressed+0.006
broken+0.006
ower+0.006
outing+0.006
Neg logit contributions
|-0.008
 play-0.007
 meets-0.007
 meet-0.007
 plays-0.007
Token a
Token ID257
Feature activation+1.29e+00

Pos logit contributions
|+0.054
 meets+0.050
 play+0.050
 meet+0.049
 plays+0.049
Neg logit contributions
 impressed-0.050
aging-0.049
ower-0.049
outing-0.048
broken-0.048
Token time
Token ID640
Feature activation+1.42e+00

Pos logit contributions
aging+0.035
 impressed+0.035
broken+0.034
outing+0.034
ment+0.034
Neg logit contributions
|-0.023
 play-0.021
 meet-0.018
 plays-0.018
 meets-0.018
Token,
Token ID11
Feature activation+8.95e-01

Pos logit contributions
broken+0.027
 impressed+0.027
outing+0.027
aging+0.027
ower+0.026
Neg logit contributions
|-0.034
 play-0.032
 meets-0.031
 meet-0.031
 plays-0.030