TOP ACTIVATIONS
MAX = 0.150

son a valuable lesson . Working together and being polite will
plenty of water everyday . Working hard is important , but
disagree with a friend . Working together can make the most
. While resting , the bird saw a
shoes and lay down , resting beneath the tree . He
house . Everything she found at the top
her mom was right . Eating different foods was important for
blue bike with t as sels on the handle bars and
sip some water . After resting he was ready to practice
first ." After resting , Lily felt better .
delight . As the story progressed , she curled up close
the two said goodbye before parting ways . The happy little
they were fast asleep , resting in the sun . After
waved goodbye and kept on sailing out of the lake and
She held him closely , resting her hand on his shoulder
to worry about mis pl acing his phone ever again .
how it stayed , blo oming and making loops , for
From then on , whenever there was a disagreement between Marvin
and his life had begun anew . He was a cheerful
rest . While she was resting , she saw a beautiful

MIN ACTIVATIONS
MIN = -0.266

. <|endoftext|> Jack was three years old and he wanted to
. <|endoftext|> Jack was 3 years old . He loved to
. <|endoftext|> John was 3 years old . He loved to
M olly was only three years old . She liked to
. <|endoftext|> Alice was three years old and loved to explore
called Mia who was three years old . She was excited
Mia . She was three years old and the happiest little
Ann . Ann was three years old . One day ,
Sam . She was three years old . Sam was very
<|endoftext|> M olly was 3 years old and very excited .
. <|endoftext|> Max was 3 years old and loved to stir
named Emma who was three years old . She was very
Once there was a three - year - old who found
Lucy . She was three years old and loved to explore
Lily . She was three years old and liked to have
Sally . She was three years old and very happy .
Amy . She was three years old and loved to have
little girl who was three years old . On her third
Tim . Tim was three years old . He was always
Tyler . She was three years old and was having a

INTERVAL 0.046 - 0.150
CONTAINS 19.738%

was so surprised , but he knew he 'd made a
she wanted to write about it in her diary . She
name was R ia and she was three - years -
not have dared to climb the big tree . <|endoftext|> One
little girl went to the field . She saw the dull

INTERVAL -0.058 - 0.046
CONTAINS 68.998%

. L ily was sad because she really wanted
pizza place . He put on his favorite blue cap and
friends . One day , Tim my and his friends went
of buttons and levers . They want to press the buttons
leaves too . But he knew his friend was right .

INTERVAL -0.162 - -0.058
CONTAINS 11.174%

shampoo ?" Jack bark ed and nodded his head
there was a little girl named Lily . She loved to
was peaceful . <|endoftext|> Anna and Ben are friends . They
to herself , " This is too difficult !"
, there was a little girl named Lily . She loved

INTERVAL -0.266 - -0.162
CONTAINS 0.091%

. She was only three years old and she loved going
<|endoftext|> Tom was a big boy who liked to play with
with a picture of a cat on it . Lucy was
Lily . Lily was 3 years old . Lily wanted to
Lucy . She was three years old and loved to go
Token son
Token ID3367
Feature activation-3.86e-02

Pos logit contributions
Finally+0.121
boards+0.120
Soon+0.112
Exc+0.111
At+0.108
Neg logit contributions
e-0.115
 supposed-0.094
ow-0.088
oof-0.087
 yours-0.079
Token a
Token ID257
Feature activation-4.07e-03

Pos logit contributions
e+0.064
 supposed+0.059
oof+0.057
 image+0.052
 possession+0.049
Neg logit contributions
Finally-0.061
boards-0.061
Soon-0.057
At-0.056
 Finally-0.055
Token valuable
Token ID8119
Feature activation+1.58e-02

Pos logit contributions
e+0.008
 supposed+0.007
ow+0.007
oof+0.006
 seen+0.006
Neg logit contributions
Finally-0.006
Soon-0.006
boards-0.005
At-0.005
Eventually-0.005
Token lesson
Token ID11483
Feature activation-6.05e-02

Pos logit contributions
Finally+0.014
Soon+0.012
boards+0.012
Days+0.011
oral+0.010
Neg logit contributions
e-0.043
 supposed-0.035
oof-0.034
 seen-0.034
ow-0.034
Token.
Token ID13
Feature activation+1.06e-02

Pos logit contributions
e+0.109
 supposed+0.094
oof+0.088
 image+0.087
ow+0.083
Neg logit contributions
Finally-0.098
boards-0.096
Soon-0.091
orting-0.088
At-0.087
Token Working
Token ID14594
Feature activation+1.50e-01

Pos logit contributions
Finally+0.016
Days+0.016
boards+0.015
Soon+0.014
orting+0.014
Neg logit contributions
e-0.019
 supposed-0.017
 such-0.016
ow-0.016
 yours-0.015
Token together
Token ID1978
Feature activation+9.17e-02

Pos logit contributions
Finally+0.232
boards+0.219
 Pole+0.207
Soon+0.203
Days+0.203
Neg logit contributions
e-0.278
 supposed-0.253
 image-0.224
 such-0.217
 possession-0.213
Token and
Token ID290
Feature activation+1.01e-02

Pos logit contributions
Finally+0.174
boards+0.163
Soon+0.158
Days+0.157
Exc+0.156
Neg logit contributions
e-0.151
 supposed-0.128
 image-0.111
ow-0.108
 such-0.107
Token being
Token ID852
Feature activation+6.24e-02

Pos logit contributions
Finally+0.019
boards+0.018
Soon+0.017
Days+0.017
Eventually+0.016
Neg logit contributions
e-0.017
 supposed-0.014
 image-0.013
ow-0.012
oof-0.012
Token polite
Token ID23507
Feature activation+1.67e-02

Pos logit contributions
Finally+0.109
Soon+0.101
boards+0.098
Eventually+0.096
At+0.095
Neg logit contributions
e-0.104
 supposed-0.090
oof-0.087
 image-0.083
ow-0.083
Token will
Token ID481
Feature activation+2.69e-02

Pos logit contributions
Finally+0.032
boards+0.029
Soon+0.029
Eventually+0.028
Exc+0.028
Neg logit contributions
e-0.026
 supposed-0.023
 image-0.020
ow-0.019
oof-0.019
Token plenty
Token ID6088
Feature activation+5.63e-02

Pos logit contributions
Finally+0.112
boards+0.105
Soon+0.102
Days+0.099
Eventually+0.099
Neg logit contributions
e-0.090
 supposed-0.076
 image-0.067
oof-0.067
ow-0.066
Token of
Token ID286
Feature activation-1.05e-02

Pos logit contributions
Finally+0.126
boards+0.116
+0.113
Days+0.113
Soon+0.113
Neg logit contributions
e-0.068
 supposed-0.055
ow-0.045
 such-0.045
 yours-0.045
Token water
Token ID1660
Feature activation+5.48e-02

Pos logit contributions
e+0.021
 supposed+0.019
ow+0.017
oof+0.017
 image+0.017
Neg logit contributions
Finally-0.015
Soon-0.013
boards-0.013
At-0.013
Eventually-0.013
Token everyday
Token ID10908
Feature activation+8.10e-02

Pos logit contributions
Finally+0.111
Soon+0.100
boards+0.098
Eventually+0.098
Days+0.097
Neg logit contributions
e-0.083
 supposed-0.071
ow-0.062
 image-0.060
 yours-0.058
Token.
Token ID13
Feature activation+5.15e-03

Pos logit contributions
Finally+0.148
boards+0.139
Soon+0.135
Eventually+0.130
At+0.129
Neg logit contributions
e-0.133
 supposed-0.115
oof-0.102
 image-0.102
ow-0.099
Token Working
Token ID14594
Feature activation+1.46e-01

Pos logit contributions
Finally+0.007
boards+0.006
Soon+0.006
Days+0.006
orting+0.006
Neg logit contributions
e-0.011
 supposed-0.010
oof-0.009
ow-0.009
 image-0.009
Token hard
Token ID1327
Feature activation+1.54e-02

Pos logit contributions
Finally+0.228
boards+0.214
 Pole+0.202
Soon+0.198
Exc+0.195
Neg logit contributions
e-0.281
 supposed-0.243
 such-0.217
 image-0.215
 possession-0.210
Token is
Token ID318
Feature activation-7.27e-03

Pos logit contributions
Finally+0.032
Exc+0.029
Days+0.029
boards+0.028
Eventually+0.028
Neg logit contributions
e-0.020
 supposed-0.018
 such-0.016
ow-0.014
 yours-0.014
Token important
Token ID1593
Feature activation+2.05e-02

Pos logit contributions
e+0.011
 supposed+0.010
 image+0.008
oof+0.008
ow+0.008
Neg logit contributions
Finally-0.014
boards-0.014
Soon-0.013
Exc-0.013
orting-0.012
Token,
Token ID11
Feature activation+8.18e-02

Pos logit contributions
Finally+0.036
boards+0.034
Soon+0.033
Eventually+0.031
orting+0.031
Neg logit contributions
e-0.035
 supposed-0.030
oof-0.027
 image-0.027
ow-0.026
Token but
Token ID475
Feature activation-4.21e-03

Pos logit contributions
Finally+0.133
Soon+0.122
boards+0.122
At+0.117
Eventually+0.116
Neg logit contributions
e-0.148
 supposed-0.131
oof-0.120
 image-0.119
ow-0.117
Token disagree
Token ID12546
Feature activation+3.05e-02

Pos logit contributions
Finally+0.025
boards+0.024
Soon+0.023
Eventually+0.022
At+0.022
Neg logit contributions
e-0.020
 supposed-0.017
 image-0.015
oof-0.015
ow-0.014
Token with
Token ID351
Feature activation+1.24e-02

Pos logit contributions
Finally+0.054
boards+0.049
Soon+0.048
Eventually+0.047
Days+0.046
Neg logit contributions
e-0.053
 supposed-0.047
 image-0.041
ow-0.041
oof-0.040
Token a
Token ID257
Feature activation-1.44e-02

Pos logit contributions
Finally+0.021
Soon+0.020
boards+0.020
Days+0.019
Eventually+0.019
Neg logit contributions
e-0.021
 supposed-0.018
oof-0.017
ow-0.016
 image-0.016
Token friend
Token ID1545
Feature activation-5.68e-02

Pos logit contributions
e+0.025
 supposed+0.023
 image+0.020
oof+0.020
ow+0.020
Neg logit contributions
Finally-0.024
boards-0.023
Soon-0.022
Eventually-0.021
orting-0.021
Token.
Token ID13
Feature activation-1.28e-03

Pos logit contributions
e+0.102
 supposed+0.097
oof+0.094
 image+0.088
leaf+0.086
Neg logit contributions
boards-0.076
orting-0.073
Finally-0.072
At-0.069
Soon-0.067
Token Working
Token ID14594
Feature activation+1.45e-01

Pos logit contributions
e+0.003
 supposed+0.003
oof+0.002
 image+0.002
ow+0.002
Neg logit contributions
Finally-0.002
boards-0.001
Soon-0.001
Days-0.001
Eventually-0.001
Token together
Token ID1978
Feature activation+8.72e-02

Pos logit contributions
Finally+0.207
boards+0.194
Soon+0.179
Exc+0.179
Days+0.178
Neg logit contributions
e-0.288
 supposed-0.262
 image-0.236
oof-0.231
 yours-0.225
Token can
Token ID460
Feature activation+6.23e-03

Pos logit contributions
Finally+0.148
boards+0.141
Exc+0.133
Soon+0.133
Days+0.131
Neg logit contributions
e-0.160
 supposed-0.139
 image-0.124
ow-0.120
 yours-0.120
Token make
Token ID787
Feature activation+3.44e-02

Pos logit contributions
boards+0.013
Finally+0.013
Soon+0.012
side+0.012
 Finally+0.012
Neg logit contributions
e-0.008
 supposed-0.007
ow-0.006
 image-0.006
oof-0.006
Token the
Token ID262
Feature activation+5.48e-02

Pos logit contributions
Finally+0.049
Soon+0.045
boards+0.043
Eventually+0.043
orting+0.042
Neg logit contributions
e-0.068
 supposed-0.061
oof-0.057
ow-0.056
 image-0.055
Token most
Token ID749
Feature activation-1.06e-02

Pos logit contributions
Finally+0.089
boards+0.083
Soon+0.080
Eventually+0.077
At+0.076
Neg logit contributions
e-0.102
 supposed-0.090
oof-0.082
ow-0.080
 image-0.080
Token.
Token ID13
Feature activation-9.79e-03

Pos logit contributions
Finally+0.046
boards+0.043
Soon+0.041
At+0.040
side+0.040
Neg logit contributions
e-0.049
 supposed-0.045
oof-0.041
 image-0.040
leaf-0.038
Token
Token ID220
Feature activation+5.48e-02

Pos logit contributions
e+0.013
 supposed+0.010
ow+0.010
 a+0.010
oof+0.010
Neg logit contributions
Days-0.019
uing-0.018
boards-0.018
Finally-0.018
orting-0.017
Token
Token ID198
Feature activation+5.47e-02

Pos logit contributions
boards+0.075
Finally+0.074
Days+0.069
Exc+0.067
Soon+0.067
Neg logit contributions
e-0.111
 supposed-0.099
ow-0.091
oof-0.090
 image-0.084
Token
Token ID198
Feature activation+5.37e-02

Pos logit contributions
boards+0.086
orting+0.080
+0.080
Days+0.078
obb+0.076
Neg logit contributions
e-0.101
 supposed-0.082
oof-0.077
ow-0.072
 seen-0.071
TokenWhile
Token ID3633
Feature activation+3.58e-02

Pos logit contributions
boards+0.071
orting+0.068
add+0.064
lights+0.063
angers+0.063
Neg logit contributions
e-0.114
 supposed-0.093
oof-0.086
ggie-0.081
 image-0.080
Token resting
Token ID19186
Feature activation+1.42e-01

Pos logit contributions
Finally+0.081
boards+0.080
Days+0.075
orting+0.074
Exc+0.074
Neg logit contributions
e-0.045
 supposed-0.036
 image-0.030
ow-0.029
oof-0.029
Token,
Token ID11
Feature activation+8.86e-02

Pos logit contributions
Finally+0.333
Days+0.313
boards+0.312
Eventually+0.302
 Pole+0.295
Neg logit contributions
e-0.167
 such-0.109
ow-0.109
 a-0.100
 ear-0.100
Token the
Token ID262
Feature activation+5.93e-02

Pos logit contributions
Finally+0.204
boards+0.197
Soon+0.190
Days+0.186
orting+0.186
Neg logit contributions
e-0.103
 supposed-0.081
 image-0.069
oof-0.067
ow-0.066
Token bird
Token ID6512
Feature activation+1.33e-03

Pos logit contributions
Finally+0.117
boards+0.110
Soon+0.107
Eventually+0.104
orting+0.103
Neg logit contributions
e-0.089
 supposed-0.077
 image-0.067
oof-0.067
ow-0.065
Token saw
Token ID2497
Feature activation+5.41e-02

Pos logit contributions
Finally+0.002
boards+0.002
orting+0.002
Soon+0.002
Days+0.002
Neg logit contributions
e-0.002
 supposed-0.002
 seen-0.002
ow-0.002
 image-0.002
Token a
Token ID257
Feature activation-2.58e-02

Pos logit contributions
Finally+0.111
Soon+0.109
boards+0.107
Exc+0.103
Eventually+0.102
Neg logit contributions
e-0.074
 supposed-0.064
oof-0.052
ow-0.049
 possession-0.048
Token shoes
Token ID10012
Feature activation+7.43e-03

Pos logit contributions
Finally+0.058
boards+0.056
Soon+0.053
orting+0.053
At+0.052
Neg logit contributions
e-0.066
 supposed-0.059
ow-0.051
 image-0.050
oof-0.050
Token and
Token ID290
Feature activation-3.88e-03

Pos logit contributions
Finally+0.011
boards+0.010
Soon+0.010
orting+0.010
At+0.010
Neg logit contributions
e-0.015
 supposed-0.013
oof-0.012
 image-0.012
leaf-0.011
Token lay
Token ID3830
Feature activation+8.29e-02

Pos logit contributions
e+0.009
 supposed+0.008
 image+0.007
oof+0.007
ow+0.007
Neg logit contributions
Finally-0.005
boards-0.004
Soon-0.004
At-0.004
Exc-0.004
Token down
Token ID866
Feature activation+9.93e-02

Pos logit contributions
Finally+0.148
boards+0.139
Soon+0.135
Eventually+0.130
At+0.128
Neg logit contributions
e-0.141
 supposed-0.123
 image-0.109
oof-0.109
ow-0.106
Token,
Token ID11
Feature activation+9.25e-02

Pos logit contributions
Finally+0.208
boards+0.191
Days+0.189
Eventually+0.187
Soon+0.186
Neg logit contributions
e-0.140
 supposed-0.119
ow-0.103
 image-0.100
 such-0.094
Token resting
Token ID19186
Feature activation+1.42e-01

Pos logit contributions
Finally+0.163
boards+0.153
Soon+0.150
Eventually+0.143
At+0.143
Neg logit contributions
e-0.157
 supposed-0.137
oof-0.123
 image-0.122
ow-0.119
Token beneath
Token ID11061
Feature activation+1.18e-02

Pos logit contributions
Finally+0.302
Days+0.273
Soon+0.271
Eventually+0.270
boards+0.267
Neg logit contributions
e-0.204
 supposed-0.158
 image-0.145
 such-0.137
 ear-0.135
Token the
Token ID262
Feature activation+6.24e-02

Pos logit contributions
Finally+0.020
boards+0.020
Soon+0.019
Exc+0.019
 Finally+0.018
Neg logit contributions
e-0.019
 supposed-0.019
oof-0.016
ow-0.015
 image-0.015
Token tree
Token ID5509
Feature activation+4.59e-02

Pos logit contributions
Finally+0.094
boards+0.088
Soon+0.085
Eventually+0.081
At+0.080
Neg logit contributions
e-0.123
 supposed-0.110
 image-0.099
oof-0.099
ow-0.097
Token.
Token ID13
Feature activation+2.14e-03

Pos logit contributions
Finally+0.078
boards+0.074
Soon+0.071
At+0.068
Eventually+0.068
Neg logit contributions
e-0.079
 supposed-0.071
oof-0.065
 image-0.062
ow-0.060
Token He
Token ID679
Feature activation+9.47e-02

Pos logit contributions
Days+0.003
Finally+0.003
boards+0.003
uing+0.003
orting+0.003
Neg logit contributions
e-0.004
 supposed-0.003
ow-0.003
oof-0.003
 ear-0.003
Token house
Token ID2156
Feature activation+5.84e-02

Pos logit contributions
Finally+0.046
boards+0.042
Soon+0.041
At+0.040
 Finally+0.040
Neg logit contributions
e-0.065
 supposed-0.058
oof-0.053
leaf-0.052
ow-0.052
Token.
Token ID13
Feature activation+4.97e-03

Pos logit contributions
Finally+0.095
boards+0.093
Soon+0.088
At+0.086
orting+0.085
Neg logit contributions
e-0.101
 supposed-0.090
oof-0.084
 image-0.080
leaf-0.077
Token
Token ID220
Feature activation+5.94e-02

Pos logit contributions
Finally+0.007
boards+0.006
Days+0.006
Soon+0.006
orting+0.006
Neg logit contributions
e-0.010
 supposed-0.009
oof-0.008
ow-0.008
 image-0.008
Token
Token ID198
Feature activation+6.16e-02

Pos logit contributions
boards+0.076
Finally+0.075
Days+0.071
Soon+0.069
Exc+0.068
Neg logit contributions
e-0.124
 supposed-0.113
ow-0.105
oof-0.100
 such-0.094
Token
Token ID198
Feature activation+6.12e-02

Pos logit contributions
boards+0.091
orting+0.083
ways+0.082
+0.081
Finally+0.080
Neg logit contributions
e-0.120
 supposed-0.102
oof-0.094
ow-0.089
leaf-0.087
TokenEverything
Token ID19693
Feature activation+1.42e-01

Pos logit contributions
boards+0.074
 Pole+0.069
orting+0.066
ways+0.065
angers+0.060
Neg logit contributions
e-0.136
 supposed-0.116
oof-0.106
leaf-0.102
 such-0.101
Token she
Token ID673
Feature activation+1.01e-01

Pos logit contributions
Finally+0.286
Soon+0.257
Eventually+0.256
boards+0.256
Days+0.255
Neg logit contributions
e-0.202
 supposed-0.180
ow-0.153
 image-0.153
 such-0.151
Token found
Token ID1043
Feature activation+8.88e-02

Pos logit contributions
Finally+0.126
boards+0.120
Soon+0.113
At+0.112
Exc+0.106
Neg logit contributions
e-0.215
 supposed-0.195
oof-0.186
 image-0.184
ow-0.176
Token at
Token ID379
Feature activation+1.77e-02

Pos logit contributions
Finally+0.174
boards+0.165
Soon+0.160
Eventually+0.155
At+0.154
Neg logit contributions
e-0.133
 supposed-0.114
oof-0.100
 image-0.100
ow-0.097
Token the
Token ID262
Feature activation+6.00e-02

Pos logit contributions
Finally+0.030
Soon+0.029
stairs+0.028
side+0.028
 Finally+0.028
Neg logit contributions
e-0.029
 supposed-0.024
oof-0.023
 image-0.022
leaf-0.021
Token top
Token ID1353
Feature activation+5.31e-02

Pos logit contributions
Finally+0.075
boards+0.068
Soon+0.065
Eventually+0.061
Days+0.061
Neg logit contributions
e-0.135
 supposed-0.122
oof-0.112
 image-0.111
ow-0.110
Token her
Token ID607
Feature activation+5.04e-02

Pos logit contributions
Finally+0.131
boards+0.127
Soon+0.124
At+0.122
Exc+0.119
Neg logit contributions
e-0.164
 supposed-0.142
oof-0.138
ow-0.134
leaf-0.126
Token mom
Token ID1995
Feature activation-3.64e-02

Pos logit contributions
Finally+0.058
boards+0.058
 Finally+0.054
At+0.051
Soon+0.050
Neg logit contributions
e-0.111
 supposed-0.096
leaf-0.090
ow-0.088
oof-0.088
Token was
Token ID373
Feature activation-1.03e-02

Pos logit contributions
e+0.072
 supposed+0.065
oof+0.061
 image+0.060
 yours+0.058
Neg logit contributions
Finally-0.050
boards-0.047
Soon-0.044
At-0.043
orting-0.042
Token right
Token ID826
Feature activation+2.11e-02

Pos logit contributions
e+0.020
oof+0.017
 supposed+0.016
 image+0.016
ow+0.016
Neg logit contributions
Finally-0.015
Soon-0.014
boards-0.014
At-0.013
orting-0.013
Token.
Token ID13
Feature activation+7.83e-03

Pos logit contributions
Finally+0.038
boards+0.036
Soon+0.035
Eventually+0.033
At+0.033
Neg logit contributions
e-0.036
 supposed-0.031
 image-0.027
oof-0.027
ow-0.027
Token Eating
Token ID37537
Feature activation+1.41e-01

Pos logit contributions
Finally+0.014
Days+0.013
Soon+0.013
Eventually+0.012
boards+0.012
Neg logit contributions
 supposed-0.009
e-0.009
 ear-0.009
 yours-0.009
 image-0.009
Token different
Token ID1180
Feature activation+6.81e-02

Pos logit contributions
Finally+0.274
boards+0.255
Soon+0.248
Days+0.243
Eventually+0.243
Neg logit contributions
e-0.220
 supposed-0.190
 image-0.168
oof-0.163
ow-0.161
Token foods
Token ID9013
Feature activation+2.23e-02

Pos logit contributions
Finally+0.121
boards+0.121
Soon+0.113
orting+0.110
At+0.110
Neg logit contributions
e-0.108
 supposed-0.095
oof-0.083
ow-0.082
 image-0.079
Token was
Token ID373
Feature activation-1.93e-02

Pos logit contributions
Finally+0.042
Days+0.038
Exc+0.038
Soon+0.038
Eventually+0.037
Neg logit contributions
e-0.037
 supposed-0.032
ow-0.028
 image-0.028
 such-0.028
Token important
Token ID1593
Feature activation+1.62e-02

Pos logit contributions
e+0.030
 supposed+0.027
 image+0.023
oof+0.023
ow+0.022
Neg logit contributions
Finally-0.037
boards-0.035
Soon-0.034
Eventually-0.032
Exc-0.032
Token for
Token ID329
Feature activation-1.30e-02

Pos logit contributions
Finally+0.027
boards+0.026
Soon+0.025
orting+0.024
Eventually+0.023
Neg logit contributions
e-0.029
 supposed-0.025
oof-0.023
 image-0.023
ow-0.022
Token blue
Token ID4171
Feature activation+2.89e-02

Pos logit contributions
e+0.027
 supposed+0.024
oof+0.022
ow+0.021
 image+0.020
Neg logit contributions
Finally-0.018
Soon-0.017
boards-0.016
Exc-0.015
Days-0.015
Token bike
Token ID7161
Feature activation+1.82e-02

Pos logit contributions
Finally+0.044
boards+0.043
Soon+0.040
orting+0.040
Exc+0.040
Neg logit contributions
 supposed-0.052
e-0.050
oof-0.042
beh-0.041
ow-0.040
Token with
Token ID351
Feature activation-6.04e-04

Pos logit contributions
Finally+0.029
boards+0.028
Soon+0.026
At+0.025
orting+0.025
Neg logit contributions
e-0.033
 supposed-0.030
oof-0.028
 image-0.026
ow-0.025
Token t
Token ID256
Feature activation-7.14e-03

Pos logit contributions
e+0.001
 supposed+0.001
oof+0.001
ow+0.001
 image+0.001
Neg logit contributions
Finally-0.001
boards-0.001
Soon-0.001
At-0.001
Eventually-0.001
Tokenas
Token ID292
Feature activation-1.57e-02

Pos logit contributions
e+0.013
 supposed+0.012
oof+0.011
 image+0.011
 ear+0.010
Neg logit contributions
Finally-0.011
boards-0.011
Soon-0.010
side-0.009
Eventually-0.009
Tokensels
Token ID14002
Feature activation+1.41e-01

Pos logit contributions
e+0.027
 supposed+0.025
oof+0.023
 image+0.022
ow+0.021
Neg logit contributions
Finally-0.025
boards-0.024
orting-0.023
Soon-0.023
side-0.022
Token on
Token ID319
Feature activation-1.40e-03

Pos logit contributions
Finally+0.275
Soon+0.252
boards+0.244
Eventually+0.244
Days+0.243
Neg logit contributions
e-0.213
 supposed-0.188
 image-0.166
ow-0.160
 seen-0.151
Token the
Token ID262
Feature activation+6.26e-02

Pos logit contributions
e+0.002
 supposed+0.002
oof+0.002
ow+0.002
 image+0.001
Neg logit contributions
Finally-0.003
Soon-0.003
boards-0.003
At-0.003
Exc-0.002
Token handle
Token ID5412
Feature activation+2.29e-02

Pos logit contributions
Finally+0.099
boards+0.094
Soon+0.090
At+0.086
Exc+0.085
Neg logit contributions
e-0.116
 supposed-0.104
oof-0.095
ow-0.094
 image-0.090
Tokenbars
Token ID34046
Feature activation+4.24e-02

Pos logit contributions
Finally+0.041
boards+0.038
Soon+0.038
Eventually+0.036
Days+0.036
Neg logit contributions
e-0.038
 supposed-0.033
 image-0.029
oof-0.029
ow-0.029
Token and
Token ID290
Feature activation-2.47e-03

Pos logit contributions
Finally+0.069
boards+0.067
Soon+0.063
orting+0.061
At+0.061
Neg logit contributions
e-0.075
 supposed-0.067
oof-0.061
 image-0.060
ow-0.057
Token sip
Token ID31145
Feature activation+3.59e-02

Pos logit contributions
e+0.016
 supposed+0.015
oof+0.013
ow+0.013
 image+0.013
Neg logit contributions
Finally-0.015
Soon-0.014
boards-0.014
At-0.014
Eventually-0.013
Token some
Token ID617
Feature activation+2.67e-02

Pos logit contributions
Finally+0.082
Days+0.075
Eventually+0.074
+0.073
Soon+0.073
Neg logit contributions
e-0.044
 such-0.032
 supposed-0.032
 image-0.031
 yours-0.027
Token water
Token ID1660
Feature activation+5.60e-02

Pos logit contributions
Finally+0.048
boards+0.045
Soon+0.044
Days+0.042
Eventually+0.042
Neg logit contributions
e-0.044
 supposed-0.038
 image-0.034
oof-0.034
ow-0.033
Token.
Token ID13
Feature activation-5.76e-03

Pos logit contributions
Finally+0.126
Soon+0.114
Eventually+0.113
Days+0.113
Exc+0.110
Neg logit contributions
e-0.073
 supposed-0.061
 image-0.054
ow-0.053
 such-0.049
Token After
Token ID2293
Feature activation+4.12e-02

Pos logit contributions
e+0.011
 ear+0.009
 supposed+0.009
 image+0.009
oof+0.009
Neg logit contributions
Finally-0.008
Days-0.008
orting-0.008
uing-0.007
Soon-0.007
Token resting
Token ID19186
Feature activation+1.40e-01

Pos logit contributions
Finally+0.093
boards+0.089
Soon+0.085
orting+0.084
Exc+0.084
Neg logit contributions
e-0.051
 supposed-0.041
oof-0.033
 image-0.033
ow-0.032
Token he
Token ID339
Feature activation+1.12e-01

Pos logit contributions
Finally+0.304
Days+0.279
Exc+0.277
boards+0.276
Eventually+0.270
Neg logit contributions
e-0.189
 such-0.138
ow-0.124
 supposed-0.124
 image-0.119
Token was
Token ID373
Feature activation-1.65e-02

Pos logit contributions
Finally+0.177
boards+0.166
Soon+0.161
Eventually+0.153
At+0.152
Neg logit contributions
e-0.209
 supposed-0.188
oof-0.169
 image-0.169
ow-0.165
Token ready
Token ID3492
Feature activation+6.18e-02

Pos logit contributions
e+0.025
 supposed+0.022
ow+0.018
 image+0.018
oof+0.017
Neg logit contributions
Finally-0.033
boards-0.032
Exc-0.030
Soon-0.030
-0.030
Token to
Token ID284
Feature activation-1.56e-02

Pos logit contributions
Finally+0.130
Days+0.120
Exc+0.117
Soon+0.117
boards+0.117
Neg logit contributions
e-0.086
 supposed-0.075
ow-0.065
 such-0.062
 image-0.061
Token practice
Token ID3357
Feature activation+5.94e-02

Pos logit contributions
e+0.025
 supposed+0.023
oof+0.020
 image+0.020
ow+0.019
Neg logit contributions
Finally-0.028
boards-0.025
Soon-0.025
At-0.024
orting-0.024
Token first
Token ID717
Feature activation+1.18e-02

Pos logit contributions
Finally+0.136
boards+0.128
Soon+0.126
Eventually+0.120
orting+0.120
Neg logit contributions
e-0.122
 supposed-0.105
oof-0.095
 image-0.094
ow-0.093
Token."
Token ID526
Feature activation+1.81e-03

Pos logit contributions
Finally+0.026
boards+0.025
Days+0.025
Exc+0.025
Soon+0.024
Neg logit contributions
e-0.015
 supposed-0.012
ow-0.010
 yours-0.010
 image-0.010
Token
Token ID198
Feature activation+5.49e-02

Pos logit contributions
Finally+0.003
boards+0.003
Days+0.003
Soon+0.002
add+0.002
Neg logit contributions
e-0.003
 supposed-0.003
oof-0.003
ow-0.003
 image-0.003
Token
Token ID198
Feature activation+5.51e-02

Pos logit contributions
boards+0.068
Finally+0.068
Days+0.060
orting+0.060
Soon+0.060
Neg logit contributions
e-0.118
 supposed-0.104
oof-0.099
ow-0.095
 image-0.093
TokenAfter
Token ID3260
Feature activation+4.51e-02

Pos logit contributions
boards+0.064
 Pole+0.058
orting+0.056
add+0.054
ways+0.051
Neg logit contributions
e-0.124
 supposed-0.107
oof-0.103
 image-0.100
othy-0.093
Token resting
Token ID19186
Feature activation+1.40e-01

Pos logit contributions
Finally+0.098
boards+0.095
Soon+0.089
orting+0.089
Exc+0.089
Neg logit contributions
e-0.059
 supposed-0.049
 image-0.040
ow-0.040
oof-0.039
Token,
Token ID11
Feature activation+8.64e-02

Pos logit contributions
Finally+0.310
Days+0.283
boards+0.280
Exc+0.279
Eventually+0.276
Neg logit contributions
e-0.182
 such-0.133
ow-0.132
 image-0.130
 supposed-0.129
Token Lily
Token ID20037
Feature activation+6.93e-03

Pos logit contributions
Finally+0.141
boards+0.136
orting+0.128
Soon+0.126
Days+0.125
Neg logit contributions
e-0.155
 supposed-0.133
 image-0.123
ow-0.118
oof-0.117
Token felt
Token ID2936
Feature activation+1.43e-02

Pos logit contributions
Finally+0.012
boards+0.011
Soon+0.011
Eventually+0.010
orting+0.010
Neg logit contributions
e-0.012
 supposed-0.011
 image-0.010
oof-0.010
ow-0.009
Token better
Token ID1365
Feature activation+6.39e-02

Pos logit contributions
Finally+0.028
boards+0.027
Soon+0.026
Exc+0.025
orting+0.025
Neg logit contributions
e-0.021
 supposed-0.019
ow-0.015
 image-0.015
oof-0.015
Token.
Token ID13
Feature activation-2.26e-03

Pos logit contributions
Finally+0.114
boards+0.108
Soon+0.105
Eventually+0.100
At+0.100
Neg logit contributions
e-0.107
 supposed-0.094
oof-0.084
 image-0.083
ow-0.081
Token delight
Token ID10974
Feature activation+4.84e-02

Pos logit contributions
Finally+0.052
Soon+0.048
boards+0.048
 Finally+0.047
Eventually+0.047
Neg logit contributions
e-0.037
 supposed-0.035
oof-0.033
ow-0.031
beh-0.030
Token.
Token ID13
Feature activation+5.54e-03

Pos logit contributions
Finally+0.091
boards+0.085
Soon+0.083
Eventually+0.080
Exc+0.079
Neg logit contributions
e-0.077
 supposed-0.067
 image-0.059
oof-0.059
ow-0.058
Token As
Token ID1081
Feature activation+6.82e-02

Pos logit contributions
orting+0.008
Finally+0.008
Days+0.008
boards+0.007
add+0.007
Neg logit contributions
e-0.010
 supposed-0.009
 image-0.009
 ear-0.008
othy-0.008
Token the
Token ID262
Feature activation+6.80e-02

Pos logit contributions
Finally+0.080
boards+0.073
Soon+0.070
Eventually+0.065
Exc+0.064
Neg logit contributions
e-0.157
 supposed-0.142
 image-0.131
oof-0.131
ow-0.129
Token story
Token ID1621
Feature activation+2.54e-02

Pos logit contributions
Finally+0.110
boards+0.108
Soon+0.102
At+0.097
Exc+0.094
Neg logit contributions
e-0.121
 supposed-0.109
oof-0.097
 possession-0.096
 yours-0.095
Token progressed
Token ID28775
Feature activation+1.37e-01

Pos logit contributions
Finally+0.047
orting+0.043
Days+0.043
Eventually+0.042
Soon+0.041
Neg logit contributions
e-0.041
 image-0.031
 supposed-0.031
ow-0.031
 ever-0.028
Token,
Token ID11
Feature activation+1.05e-01

Pos logit contributions
Finally+0.266
Days+0.240
Soon+0.239
boards+0.237
Eventually+0.237
Neg logit contributions
e-0.220
 supposed-0.178
ow-0.164
 image-0.162
 such-0.159
Token she
Token ID673
Feature activation+1.14e-01

Pos logit contributions
Finally+0.226
boards+0.215
Soon+0.206
Days+0.206
orting+0.203
Neg logit contributions
e-0.136
 supposed-0.112
 image-0.100
oof-0.090
ow-0.089
Token curled
Token ID41108
Feature activation+5.29e-02

Pos logit contributions
Finally+0.180
boards+0.168
Soon+0.163
Eventually+0.155
At+0.154
Neg logit contributions
e-0.215
 supposed-0.191
 image-0.171
oof-0.171
ow-0.168
Token up
Token ID510
Feature activation+7.42e-02

Pos logit contributions
Finally+0.108
boards+0.102
Soon+0.100
Eventually+0.096
 Finally+0.096
Neg logit contributions
e-0.075
 supposed-0.066
oof-0.056
 image-0.055
ow-0.053
Token close
Token ID1969
Feature activation+3.23e-02

Pos logit contributions
Finally+0.126
boards+0.120
Soon+0.116
At+0.111
Exc+0.110
Neg logit contributions
e-0.129
 supposed-0.114
oof-0.103
 image-0.100
ow-0.098
Token the
Token ID262
Feature activation+6.72e-02

Pos logit contributions
e+0.026
 supposed+0.024
ow+0.021
 image+0.021
oof+0.021
Neg logit contributions
Finally-0.015
Soon-0.014
At-0.013
 Finally-0.013
Exc-0.013
Token two
Token ID734
Feature activation+3.41e-02

Pos logit contributions
Finally+0.110
Soon+0.100
boards+0.099
Eventually+0.095
Days+0.095
Neg logit contributions
e-0.124
 supposed-0.111
oof-0.103
ow-0.099
 image-0.095
Token said
Token ID531
Feature activation+2.50e-02

Pos logit contributions
Finally+0.065
boards+0.061
Soon+0.060
Eventually+0.058
orting+0.057
Neg logit contributions
e-0.054
 supposed-0.046
 image-0.040
oof-0.040
ow-0.039
Token goodbye
Token ID24829
Feature activation+3.49e-02

Pos logit contributions
Finally+0.047
boards+0.044
Soon+0.043
At+0.041
Eventually+0.041
Neg logit contributions
e-0.040
 supposed-0.034
 image-0.030
oof-0.030
ow-0.030
Token before
Token ID878
Feature activation+8.77e-03

Pos logit contributions
Finally+0.066
boards+0.062
Soon+0.060
Eventually+0.058
Exc+0.058
Neg logit contributions
e-0.055
 supposed-0.048
 image-0.042
oof-0.042
ow-0.041
Token parting
Token ID44823
Feature activation+1.37e-01

Pos logit contributions
Finally+0.017
boards+0.016
Soon+0.016
Eventually+0.015
Days+0.015
Neg logit contributions
e-0.013
 supposed-0.012
 image-0.010
oof-0.010
ow-0.010
Token ways
Token ID2842
Feature activation+2.73e-02

Pos logit contributions
Finally+0.234
boards+0.221
Soon+0.219
orting+0.218
 Finally+0.209
Neg logit contributions
e-0.221
 supposed-0.211
oof-0.181
 image-0.172
ow-0.170
Token.
Token ID13
Feature activation+1.23e-02

Pos logit contributions
Finally+0.046
boards+0.043
Soon+0.042
orting+0.040
At+0.040
Neg logit contributions
e-0.048
 supposed-0.042
oof-0.039
 image-0.038
ow-0.037
Token The
Token ID383
Feature activation+7.49e-02

Pos logit contributions
Finally+0.018
boards+0.017
Soon+0.016
Days+0.015
orting+0.015
Neg logit contributions
e-0.024
 supposed-0.021
oof-0.020
ow-0.020
 image-0.019
Token happy
Token ID3772
Feature activation+3.80e-02

Pos logit contributions
Finally+0.125
Soon+0.113
boards+0.113
Days+0.108
Eventually+0.108
Neg logit contributions
e-0.138
 supposed-0.124
oof-0.111
ow-0.109
 image-0.107
Token little
Token ID1310
Feature activation+4.02e-03

Pos logit contributions
Finally+0.060
Soon+0.056
Exc+0.056
At+0.055
boards+0.054
Neg logit contributions
e-0.069
 supposed-0.063
oof-0.056
ow-0.054
 yours-0.053
Token they
Token ID484
Feature activation+8.95e-02

Pos logit contributions
Finally+0.114
boards+0.101
Soon+0.099
Days+0.095
Eventually+0.094
Neg logit contributions
e-0.161
 supposed-0.144
 seen-0.128
 image-0.127
ow-0.127
Token were
Token ID547
Feature activation+5.92e-03

Pos logit contributions
Finally+0.138
boards+0.131
Soon+0.125
orting+0.121
Days+0.119
Neg logit contributions
e-0.181
 supposed-0.152
ow-0.140
 image-0.137
oof-0.136
Token fast
Token ID3049
Feature activation-1.02e-02

Pos logit contributions
Finally+0.010
Soon+0.009
boards+0.009
Eventually+0.009
At+0.009
Neg logit contributions
e-0.010
 supposed-0.009
oof-0.008
 image-0.008
 possession-0.008
Token asleep
Token ID16039
Feature activation+7.53e-02

Pos logit contributions
e+0.021
 supposed+0.019
oof+0.018
 image+0.017
ow+0.017
Neg logit contributions
Finally-0.013
boards-0.013
Soon-0.012
orting-0.011
At-0.011
Token,
Token ID11
Feature activation+8.48e-02

Pos logit contributions
Finally+0.163
boards+0.156
Exc+0.150
Days+0.150
Soon+0.148
Neg logit contributions
e-0.108
 supposed-0.084
ow-0.070
 such-0.070
 image-0.068
Token resting
Token ID19186
Feature activation+1.37e-01

Pos logit contributions
Finally+0.148
boards+0.139
Soon+0.138
At+0.134
Eventually+0.131
Neg logit contributions
e-0.138
 supposed-0.121
oof-0.111
 image-0.110
ow-0.106
Token in
Token ID287
Feature activation+1.83e-02

Pos logit contributions
Finally+0.272
boards+0.245
Soon+0.244
Days+0.242
Eventually+0.240
Neg logit contributions
e-0.215
 supposed-0.174
 image-0.158
oof-0.148
ow-0.148
Token the
Token ID262
Feature activation+5.82e-02

Pos logit contributions
Finally+0.027
Soon+0.026
At+0.026
side+0.026
 Finally+0.024
Neg logit contributions
 supposed-0.031
e-0.030
beh-0.028
ow-0.028
leaf-0.027
Token sun
Token ID4252
Feature activation+6.23e-02

Pos logit contributions
Finally+0.081
boards+0.075
Soon+0.072
Eventually+0.068
Days+0.068
Neg logit contributions
e-0.121
 supposed-0.107
 image-0.099
oof-0.099
ow-0.096
Token.
Token ID13
Feature activation+4.67e-03

Pos logit contributions
Finally+0.125
boards+0.115
Soon+0.114
Days+0.112
Exc+0.112
Neg logit contributions
e-0.096
 supposed-0.080
 image-0.072
ow-0.071
oof-0.068
Token After
Token ID2293
Feature activation+4.45e-02

Pos logit contributions
Finally+0.006
boards+0.006
Days+0.006
Soon+0.006
orting+0.006
Neg logit contributions
e-0.009
 supposed-0.009
oof-0.008
 image-0.008
ow-0.007
Token waved
Token ID26834
Feature activation+3.10e-02

Pos logit contributions
Finally+0.022
boards+0.020
Soon+0.020
At+0.019
Eventually+0.019
Neg logit contributions
e-0.024
 supposed-0.022
oof-0.019
 image-0.019
ow-0.019
Token goodbye
Token ID24829
Feature activation+4.42e-02

Pos logit contributions
Finally+0.056
boards+0.053
Soon+0.051
Exc+0.050
orting+0.049
Neg logit contributions
e-0.051
 supposed-0.046
oof-0.040
 image-0.039
ow-0.038
Token and
Token ID290
Feature activation+2.05e-02

Pos logit contributions
Finally+0.087
boards+0.083
Soon+0.080
Eventually+0.077
Days+0.077
Neg logit contributions
e-0.066
 supposed-0.055
ow-0.050
 image-0.049
oof-0.048
Token kept
Token ID4030
Feature activation+7.22e-02

Pos logit contributions
Finally+0.035
boards+0.033
Soon+0.031
Eventually+0.031
Days+0.031
Neg logit contributions
e-0.036
 supposed-0.031
 image-0.028
oof-0.028
ow-0.028
Token on
Token ID319
Feature activation+1.99e-02

Pos logit contributions
Finally+0.137
Soon+0.130
At+0.127
boards+0.124
orting+0.123
Neg logit contributions
e-0.104
 supposed-0.091
 image-0.086
oof-0.083
ow-0.077
Token sailing
Token ID28499
Feature activation+1.36e-01

Pos logit contributions
Finally+0.041
boards+0.038
Soon+0.038
At+0.036
Eventually+0.036
Neg logit contributions
e-0.028
 supposed-0.024
oof-0.021
 image-0.021
ow-0.020
Token out
Token ID503
Feature activation+6.07e-02

Pos logit contributions
Finally+0.250
boards+0.234
Soon+0.228
Eventually+0.221
Days+0.220
Neg logit contributions
e-0.226
 supposed-0.189
 image-0.171
oof-0.170
ow-0.169
Token of
Token ID286
Feature activation+2.22e-02

Pos logit contributions
Finally+0.117
boards+0.110
Soon+0.107
Eventually+0.103
Days+0.103
Neg logit contributions
e-0.095
 supposed-0.080
 image-0.070
ow-0.070
oof-0.068
Token the
Token ID262
Feature activation+6.54e-02

Pos logit contributions
Finally+0.034
Soon+0.034
At+0.033
stairs+0.032
Exc+0.031
Neg logit contributions
e-0.038
 supposed-0.035
ow-0.031
oof-0.031
 image-0.030
Token lake
Token ID13546
Feature activation+7.36e-02

Pos logit contributions
Finally+0.119
boards+0.114
Eventually+0.105
 Finally+0.105
ways+0.104
Neg logit contributions
e-0.102
 supposed-0.087
 image-0.086
oof-0.079
 possession-0.077
Token and
Token ID290
Feature activation+4.25e-03

Pos logit contributions
Finally+0.140
boards+0.130
Soon+0.128
Exc+0.124
Eventually+0.123
Neg logit contributions
e-0.117
 supposed-0.099
 image-0.088
ow-0.087
oof-0.085
TokenShe
Token ID3347
Feature activation+6.93e-02

Pos logit contributions
boards+0.056
orting+0.053
 Pole+0.049
angers+0.046
ways+0.045
Neg logit contributions
e-0.119
 supposed-0.098
oof-0.090
leaf-0.089
-0.086
Token held
Token ID2714
Feature activation+6.98e-02

Pos logit contributions
Finally+0.114
boards+0.108
Soon+0.103
orting+0.101
Eventually+0.099
Neg logit contributions
e-0.128
 supposed-0.109
ow-0.099
 image-0.097
oof-0.097
Token him
Token ID683
Feature activation+4.36e-02

Pos logit contributions
Finally+0.120
Soon+0.111
boards+0.111
 Finally+0.110
At+0.108
Neg logit contributions
 supposed-0.111
e-0.110
 image-0.094
oof-0.091
 seen-0.086
Token closely
Token ID7173
Feature activation+7.18e-02

Pos logit contributions
Finally+0.081
boards+0.075
Soon+0.074
At+0.071
Eventually+0.071
Neg logit contributions
e-0.070
 supposed-0.062
oof-0.055
 image-0.054
ow-0.051
Token,
Token ID11
Feature activation+8.46e-02

Pos logit contributions
Finally+0.134
boards+0.124
Soon+0.123
Days+0.118
Eventually+0.118
Neg logit contributions
e-0.115
 supposed-0.098
 image-0.088
ow-0.088
oof-0.086
Token resting
Token ID19186
Feature activation+1.36e-01

Pos logit contributions
Finally+0.159
boards+0.153
Soon+0.142
orting+0.141
Days+0.140
Neg logit contributions
e-0.135
 supposed-0.113
ow-0.101
 image-0.100
 ear-0.095
Token her
Token ID607
Feature activation+3.91e-02

Pos logit contributions
Finally+0.268
Soon+0.240
Days+0.237
Eventually+0.236
boards+0.236
Neg logit contributions
e-0.213
 supposed-0.166
 image-0.155
ow-0.149
 such-0.147
Token hand
Token ID1021
Feature activation-2.62e-02

Pos logit contributions
Finally+0.069
boards+0.066
Soon+0.063
At+0.062
Eventually+0.061
Neg logit contributions
e-0.064
 supposed-0.058
oof-0.051
 image-0.049
ow-0.048
Token on
Token ID319
Feature activation+5.06e-03

Pos logit contributions
e+0.044
 supposed+0.037
ow+0.034
 image+0.033
oof+0.033
Neg logit contributions
Finally-0.047
boards-0.044
Soon-0.043
Eventually-0.042
Days-0.041
Token his
Token ID465
Feature activation+2.40e-02

Pos logit contributions
Finally+0.010
boards+0.009
Soon+0.009
At+0.009
Exc+0.009
Neg logit contributions
e-0.007
 supposed-0.007
oof-0.006
 image-0.006
ow-0.006
Token shoulder
Token ID8163
Feature activation-3.38e-02

Pos logit contributions
Finally+0.039
boards+0.037
Days+0.036
Soon+0.035
At+0.035
Neg logit contributions
e-0.041
 supposed-0.038
oof-0.034
leaf-0.032
ow-0.032
Token to
Token ID284
Feature activation-3.33e-02

Pos logit contributions
Finally+0.062
boards+0.059
Soon+0.059
At+0.058
Eventually+0.056
Neg logit contributions
e-0.043
 supposed-0.035
oof-0.033
 image-0.033
ow-0.031
Token worry
Token ID5490
Feature activation+1.29e-02

Pos logit contributions
 supposed+0.056
e+0.055
 image+0.049
oof+0.049
 possession+0.048
Neg logit contributions
Finally-0.052
Soon-0.046
At-0.046
Exc-0.045
Days-0.044
Token about
Token ID546
Feature activation+4.23e-02

Pos logit contributions
Finally+0.027
boards+0.025
Soon+0.025
Eventually+0.024
Exc+0.024
Neg logit contributions
e-0.018
 supposed-0.015
 image-0.013
ow-0.013
oof-0.013
Token mis
Token ID2984
Feature activation-9.97e-03

Pos logit contributions
Finally+0.076
boards+0.074
Soon+0.072
At+0.069
orting+0.069
Neg logit contributions
e-0.065
 supposed-0.054
oof-0.052
ow-0.049
 image-0.048
Tokenpl
Token ID489
Feature activation+2.38e-03

Pos logit contributions
e+0.015
 supposed+0.013
oof+0.012
 image+0.012
ow+0.011
Neg logit contributions
Finally-0.019
boards-0.018
Soon-0.018
Days-0.017
Eventually-0.017
Tokenacing
Token ID4092
Feature activation+1.36e-01

Pos logit contributions
Finally+0.003
boards+0.003
Soon+0.003
Eventually+0.003
orting+0.003
Neg logit contributions
e-0.005
 supposed-0.004
oof-0.004
 image-0.004
ow-0.004
Token his
Token ID465
Feature activation+3.92e-02

Pos logit contributions
boards+0.237
Finally+0.233
Soon+0.219
orting+0.215
At+0.213
Neg logit contributions
e-0.213
 supposed-0.189
oof-0.179
ow-0.163
 image-0.162
Token phone
Token ID3072
Feature activation+6.62e-03

Pos logit contributions
Finally+0.066
boards+0.063
Soon+0.060
At+0.059
Days+0.059
Neg logit contributions
e-0.064
 supposed-0.057
oof-0.052
ow-0.050
 image-0.046
Token ever
Token ID1683
Feature activation-3.48e-02

Pos logit contributions
Finally+0.011
boards+0.011
Soon+0.010
Eventually+0.010
At+0.010
Neg logit contributions
e-0.012
 supposed-0.010
 image-0.009
oof-0.009
ow-0.009
Token again
Token ID757
Feature activation+6.20e-02

Pos logit contributions
e+0.076
 supposed+0.068
oof+0.063
 image+0.063
ow+0.062
Neg logit contributions
Finally-0.044
boards-0.041
Soon-0.039
Eventually-0.037
At-0.037
Token.
Token ID13
Feature activation+6.30e-03

Pos logit contributions
Finally+0.106
boards+0.100
Soon+0.098
At+0.094
Exc+0.092
Neg logit contributions
e-0.105
 supposed-0.093
oof-0.085
 image-0.083
ow-0.082
Token how
Token ID703
Feature activation+5.71e-02

Pos logit contributions
Finally+0.035
Soon+0.034
At+0.032
Days+0.031
Eventually+0.030
Neg logit contributions
e-0.045
leaf-0.041
 supposed-0.039
 arrest-0.037
child-0.037
Token it
Token ID340
Feature activation+9.42e-02

Pos logit contributions
Finally+0.093
Soon+0.089
boards+0.084
At+0.084
 Finally+0.083
Neg logit contributions
e-0.101
 supposed-0.082
oof-0.082
 image-0.077
ow-0.075
Token stayed
Token ID9658
Feature activation+8.71e-02

Pos logit contributions
Finally+0.147
boards+0.136
Soon+0.132
Eventually+0.127
Days+0.124
Neg logit contributions
e-0.182
 supposed-0.162
 image-0.146
oof-0.143
ow-0.142
Token,
Token ID11
Feature activation+1.02e-01

Pos logit contributions
Finally+0.154
boards+0.142
Soon+0.142
At+0.135
Eventually+0.135
Neg logit contributions
e-0.147
 supposed-0.129
oof-0.115
 image-0.115
ow-0.113
Token blo
Token ID24924
Feature activation+9.08e-02

Pos logit contributions
Finally+0.161
Soon+0.152
boards+0.145
At+0.145
orting+0.142
Neg logit contributions
e-0.184
 supposed-0.163
oof-0.150
ow-0.145
 image-0.144
Tokenoming
Token ID3383
Feature activation+1.35e-01

Pos logit contributions
Finally+0.110
boards+0.104
Exc+0.100
Days+0.097
Soon+0.093
Neg logit contributions
e-0.200
 supposed-0.177
ow-0.171
 image-0.164
 such-0.160
Token and
Token ID290
Feature activation+1.08e-02

Pos logit contributions
Finally+0.237
boards+0.221
Soon+0.215
Eventually+0.207
Days+0.205
Neg logit contributions
e-0.237
 supposed-0.204
 image-0.186
oof-0.180
ow-0.179
Token making
Token ID1642
Feature activation+5.03e-02

Pos logit contributions
Finally+0.019
boards+0.018
Soon+0.018
Eventually+0.017
At+0.017
Neg logit contributions
e-0.018
 supposed-0.016
oof-0.014
 image-0.014
ow-0.014
Token loops
Token ID23607
Feature activation+4.84e-02

Pos logit contributions
Finally+0.069
Soon+0.066
Eventually+0.062
Days+0.058
 until+0.058
Neg logit contributions
e-0.097
 supposed-0.089
oof-0.085
ow-0.083
leaf-0.079
Token,
Token ID11
Feature activation+9.61e-02

Pos logit contributions
Finally+0.079
boards+0.076
Soon+0.073
orting+0.070
At+0.069
Neg logit contributions
e-0.085
 supposed-0.076
oof-0.070
 image-0.068
ow-0.066
Token for
Token ID329
Feature activation+7.68e-03

Pos logit contributions
Finally+0.155
Soon+0.145
boards+0.144
At+0.138
Eventually+0.136
Neg logit contributions
e-0.172
 supposed-0.153
oof-0.140
ow-0.137
 image-0.133
TokenFrom
Token ID4863
Feature activation+2.27e-02

Pos logit contributions
boards+0.081
Finally+0.070
ways+0.069
orting+0.069
 Finally+0.065
Neg logit contributions
e-0.128
 supposed-0.107
oof-0.099
leaf-0.096
 image-0.095
Token then
Token ID788
Feature activation+7.28e-02

Pos logit contributions
Finally+0.029
Soon+0.028
orting+0.026
Then+0.025
At+0.025
Neg logit contributions
e-0.049
 supposed-0.046
 image-0.040
ow-0.039
oof-0.039
Token on
Token ID319
Feature activation+1.11e-02

Pos logit contributions
boards+0.160
Finally+0.154
Exc+0.148
Days+0.145
 Pole+0.142
Neg logit contributions
e-0.107
 supposed-0.079
ow-0.068
oof-0.064
 ear-0.063
Token,
Token ID11
Feature activation+9.08e-02

Pos logit contributions
Finally+0.019
boards+0.018
Soon+0.017
Days+0.017
Exc+0.016
Neg logit contributions
e-0.020
 supposed-0.017
ow-0.015
oof-0.015
 image-0.015
Token whenever
Token ID8797
Feature activation+8.77e-02

Pos logit contributions
Finally+0.157
boards+0.147
Soon+0.143
Eventually+0.137
At+0.136
Neg logit contributions
e-0.158
 supposed-0.139
 image-0.124
oof-0.124
ow-0.121
Token there
Token ID612
Feature activation+1.35e-01

Pos logit contributions
Finally+0.159
boards+0.148
Soon+0.147
At+0.141
Eventually+0.139
Neg logit contributions
e-0.142
 supposed-0.126
oof-0.111
 image-0.111
ow-0.108
Token was
Token ID373
Feature activation+5.84e-03

Pos logit contributions
Finally+0.236
Soon+0.213
boards+0.212
Days+0.208
Eventually+0.207
Neg logit contributions
e-0.245
 supposed-0.205
 image-0.189
ow-0.188
oof-0.180
Token a
Token ID257
Feature activation-5.67e-03

Pos logit contributions
Finally+0.012
boards+0.011
Soon+0.011
Eventually+0.011
At+0.011
Neg logit contributions
e-0.008
 supposed-0.007
oof-0.006
 image-0.006
ow-0.006
Token disagreement
Token ID25800
Feature activation+2.75e-02

Pos logit contributions
e+0.010
 supposed+0.008
oof+0.008
 image+0.007
ow+0.007
Neg logit contributions
Finally-0.010
boards-0.009
Soon-0.009
Eventually-0.009
At-0.009
Token between
Token ID1022
Feature activation-1.64e-02

Pos logit contributions
Finally+0.048
boards+0.044
Soon+0.044
Eventually+0.042
Days+0.042
Neg logit contributions
e-0.048
 supposed-0.042
 image-0.037
ow-0.037
oof-0.037
Token Marvin
Token ID35105
Feature activation+7.93e-03

Pos logit contributions
e+0.029
 supposed+0.026
oof+0.023
 image+0.023
ow+0.023
Neg logit contributions
Finally-0.027
boards-0.025
Soon-0.025
Eventually-0.023
At-0.023
Token and
Token ID290
Feature activation-3.64e-03

Pos logit contributions
Finally+0.142
boards+0.132
Soon+0.132
At+0.126
Eventually+0.125
Neg logit contributions
e-0.151
 supposed-0.134
oof-0.122
 image-0.119
ow-0.118
Token his
Token ID465
Feature activation+4.22e-02

Pos logit contributions
e+0.007
 supposed+0.006
oof+0.006
 image+0.006
ow+0.006
Neg logit contributions
Finally-0.005
boards-0.005
Soon-0.005
Eventually-0.005
At-0.005
Token life
Token ID1204
Feature activation+6.39e-02

Pos logit contributions
Finally+0.077
Soon+0.070
Days+0.069
boards+0.068
Eventually+0.068
Neg logit contributions
e-0.068
 supposed-0.060
oof-0.054
ow-0.051
 image-0.050
Token had
Token ID550
Feature activation+2.42e-02

Pos logit contributions
Finally+0.101
boards+0.094
Soon+0.091
Eventually+0.087
At+0.086
Neg logit contributions
e-0.120
 supposed-0.106
 image-0.096
oof-0.096
ow-0.094
Token begun
Token ID9258
Feature activation+5.09e-02

Pos logit contributions
Finally+0.044
boards+0.042
Soon+0.041
At+0.039
Eventually+0.039
Neg logit contributions
e-0.039
 supposed-0.033
oof-0.030
 image-0.030
ow-0.029
Token anew
Token ID44951
Feature activation+1.35e-01

Pos logit contributions
Finally+0.098
boards+0.093
Soon+0.091
At+0.087
Eventually+0.087
Neg logit contributions
e-0.078
 supposed-0.066
oof-0.059
 image-0.059
ow-0.057
Token.
Token ID13
Feature activation+8.65e-03

Pos logit contributions
Finally+0.277
boards+0.255
Soon+0.254
Eventually+0.247
Days+0.244
Neg logit contributions
e-0.199
 supposed-0.166
 image-0.144
ow-0.141
oof-0.136
Token He
Token ID679
Feature activation+9.41e-02

Pos logit contributions
Finally+0.010
boards+0.010
Soon+0.009
Days+0.009
orting+0.009
Neg logit contributions
e-0.019
 supposed-0.017
oof-0.016
ow-0.016
 image-0.016
Token was
Token ID373
Feature activation-1.65e-02

Pos logit contributions
Finally+0.147
boards+0.134
Soon+0.133
At+0.127
Eventually+0.127
Neg logit contributions
e-0.176
 supposed-0.160
oof-0.145
 image-0.143
ow-0.140
Token a
Token ID257
Feature activation-1.89e-02

Pos logit contributions
oof+0.026
e+0.026
 image+0.023
 supposed+0.023
leaf+0.022
Neg logit contributions
Finally-0.027
Soon-0.026
Eventually-0.025
At-0.024
side-0.023
Token cheerful
Token ID37999
Feature activation+3.68e-03

Pos logit contributions
e+0.033
 supposed+0.030
oof+0.027
ow+0.027
 image+0.025
Neg logit contributions
Finally-0.034
Soon-0.031
Days-0.029
Eventually-0.029
At-0.029
Token rest
Token ID1334
Feature activation+7.94e-02

Pos logit contributions
e+0.034
 supposed+0.034
 image+0.029
oof+0.029
 possession+0.028
Neg logit contributions
Finally-0.030
Soon-0.027
boards-0.026
At-0.026
Exc-0.026
Token.
Token ID13
Feature activation-3.95e-03

Pos logit contributions
Finally+0.137
boards+0.130
Soon+0.127
At+0.121
orting+0.120
Neg logit contributions
e-0.136
 supposed-0.118
oof-0.107
 image-0.106
ow-0.104
Token While
Token ID2893
Feature activation+2.64e-02

Pos logit contributions
e+0.007
 supposed+0.006
 ear+0.006
 image+0.006
ow+0.006
Neg logit contributions
Finally-0.006
Days-0.006
Soon-0.006
orting-0.006
boards-0.005
Token she
Token ID673
Feature activation+1.06e-01

Pos logit contributions
Finally+0.046
boards+0.043
Soon+0.042
Eventually+0.040
At+0.040
Neg logit contributions
e-0.046
 supposed-0.040
 image-0.036
oof-0.036
ow-0.035
Token was
Token ID373
Feature activation-1.31e-02

Pos logit contributions
Finally+0.184
boards+0.175
Soon+0.167
orting+0.164
Eventually+0.162
Neg logit contributions
e-0.188
 supposed-0.158
oof-0.141
 image-0.141
ow-0.140
Token resting
Token ID19186
Feature activation+1.35e-01

Pos logit contributions
e+0.016
 supposed+0.014
ow+0.012
oof+0.010
 image+0.010
Neg logit contributions
Finally-0.030
boards-0.029
Soon-0.028
Exc-0.027
Eventually-0.027
Token,
Token ID11
Feature activation+8.73e-02

Pos logit contributions
Finally+0.317
Days+0.290
Eventually+0.286
boards+0.286
About+0.279
Neg logit contributions
e-0.159
 such-0.116
 ear-0.107
ow-0.104
 a-0.098
Token she
Token ID673
Feature activation+1.08e-01

Pos logit contributions
Finally+0.125
boards+0.117
Soon+0.114
At+0.106
Eventually+0.106
Neg logit contributions
e-0.177
 supposed-0.161
oof-0.147
 image-0.145
ow-0.142
Token saw
Token ID2497
Feature activation+5.77e-02

Pos logit contributions
Finally+0.170
boards+0.157
Soon+0.153
orting+0.147
Eventually+0.146
Neg logit contributions
e-0.205
 supposed-0.179
 image-0.162
oof-0.160
ow-0.158
Token a
Token ID257
Feature activation-2.24e-02

Pos logit contributions
boards+0.109
Finally+0.107
Soon+0.107
Exc+0.101
Eventually+0.099
Neg logit contributions
e-0.089
 supposed-0.077
oof-0.069
ow-0.062
beh-0.061
Token beautiful
Token ID4950
Feature activation+1.61e-02

Pos logit contributions
e+0.041
 supposed+0.036
oof+0.033
ow+0.032
 image+0.031
Neg logit contributions
Finally-0.037
boards-0.034
Soon-0.034
Exc-0.032
At-0.032
Token.
Token ID13
Feature activation-1.08e-01

Pos logit contributions
Finally+0.027
Soon+0.026
Days+0.025
Eventually+0.025
orting+0.024
Neg logit contributions
e-0.029
 supposed-0.025
oof-0.025
ow-0.023
tails-0.022
Token<|endoftext|>
Token ID50256
Feature activation-1.04e-01

Pos logit contributions
e+0.243
 supposed+0.218
oof+0.203
 image+0.201
ow+0.199
Neg logit contributions
Finally-0.128
boards-0.118
Soon-0.112
Days-0.106
Eventually-0.105
TokenJack
Token ID14295
Feature activation-1.03e-01

Pos logit contributions
e+0.257
 supposed+0.233
oof+0.216
 image+0.215
ow+0.213
Neg logit contributions
Finally-0.105
boards-0.094
Soon-0.089
Days-0.083
Eventually-0.083
Token was
Token ID373
Feature activation-1.36e-01

Pos logit contributions
e+0.194
 supposed+0.189
oof+0.171
 image+0.166
leaf+0.165
Neg logit contributions
Finally-0.138
boards-0.132
At-0.128
Soon-0.125
Exc-0.123
Token three
Token ID1115
Feature activation-1.25e-01

Pos logit contributions
e+0.197
 image+0.175
 possession+0.173
 supposed+0.169
oof+0.161
Neg logit contributions
Finally-0.251
Soon-0.241
Eventually-0.233
At-0.228
 Finally-0.226
Token years
Token ID812
Feature activation-2.22e-01

Pos logit contributions
e+0.174
 supposed+0.167
oof+0.146
 yours+0.141
ow+0.134
Neg logit contributions
Finally-0.236
boards-0.234
Exc-0.224
Soon-0.220
Days-0.220
Token old
Token ID1468
Feature activation-9.78e-02

Pos logit contributions
e+0.389
 supposed+0.333
oof+0.304
 image+0.294
ow+0.291
Neg logit contributions
Finally-0.380
boards-0.362
Soon-0.351
Exc-0.333
At-0.333
Token and
Token ID290
Feature activation-1.26e-01

Pos logit contributions
e+0.203
 supposed+0.193
oof+0.176
 yours+0.165
ow+0.163
Neg logit contributions
boards-0.125
Exc-0.119
Finally-0.116
-0.113
orting-0.110
Token he
Token ID339
Feature activation+1.09e-02

Pos logit contributions
e+0.297
 supposed+0.275
oof+0.272
 image+0.263
ow+0.257
Neg logit contributions
Finally-0.119
Soon-0.108
 Finally-0.106
boards-0.095
Then-0.095
Token wanted
Token ID2227
Feature activation-7.53e-02

Pos logit contributions
Finally+0.015
At+0.014
Soon+0.014
boards+0.013
 Finally+0.013
Neg logit contributions
e-0.021
 supposed-0.019
oof-0.019
 image-0.018
ow-0.018
Token to
Token ID284
Feature activation-1.03e-01

Pos logit contributions
e+0.149
oof+0.121
 supposed+0.120
 image+0.111
ow+0.111
Neg logit contributions
boards-0.117
Finally-0.111
Soon-0.108
At-0.102
Exc-0.098
Token.
Token ID13
Feature activation-1.08e-01

Pos logit contributions
Finally+0.241
Soon+0.223
Eventually+0.222
Days+0.220
orting+0.218
Neg logit contributions
e-0.126
 supposed-0.100
ow-0.100
 image-0.093
 ear-0.084
Token<|endoftext|>
Token ID50256
Feature activation-1.04e-01

Pos logit contributions
e+0.243
 supposed+0.218
oof+0.203
 image+0.201
ow+0.199
Neg logit contributions
Finally-0.128
boards-0.118
Soon-0.112
Days-0.106
Eventually-0.105
TokenJack
Token ID14295
Feature activation-1.03e-01

Pos logit contributions
e+0.257
 supposed+0.233
oof+0.216
 image+0.215
ow+0.213
Neg logit contributions
Finally-0.105
boards-0.094
Soon-0.089
Days-0.083
Eventually-0.083
Token was
Token ID373
Feature activation-1.36e-01

Pos logit contributions
e+0.194
 supposed+0.189
oof+0.171
 image+0.166
leaf+0.165
Neg logit contributions
Finally-0.138
boards-0.132
At-0.128
Soon-0.125
Exc-0.123
Token 3
Token ID513
Feature activation-1.54e-01

Pos logit contributions
e+0.197
 image+0.175
 possession+0.173
 supposed+0.169
oof+0.161
Neg logit contributions
Finally-0.251
Soon-0.241
Eventually-0.233
At-0.228
 Finally-0.226
Token years
Token ID812
Feature activation-2.22e-01

Pos logit contributions
 supposed+0.215
 yours+0.204
e+0.197
oof+0.192
leaf+0.175
Neg logit contributions
boards-0.274
Finally-0.266
Exc-0.260
Days-0.255
Soon-0.251
Token old
Token ID1468
Feature activation-9.53e-02

Pos logit contributions
e+0.391
 supposed+0.335
oof+0.307
 image+0.295
ow+0.293
Neg logit contributions
Finally-0.377
boards-0.360
Soon-0.349
Exc-0.331
At-0.330
Token.
Token ID13
Feature activation-1.02e-01

Pos logit contributions
e+0.198
 supposed+0.189
oof+0.175
 yours+0.164
ow+0.161
Neg logit contributions
boards-0.120
Exc-0.114
Finally-0.112
-0.111
orting-0.106
Token He
Token ID679
Feature activation+4.83e-03

Pos logit contributions
e+0.209
 supposed+0.194
 image+0.179
ow+0.170
oof+0.166
Neg logit contributions
Finally-0.138
Days-0.123
Soon-0.122
orting-0.122
boards-0.122
Token loved
Token ID6151
Feature activation-6.86e-02

Pos logit contributions
Finally+0.007
boards+0.007
Soon+0.006
At+0.006
Exc+0.006
Neg logit contributions
e-0.009
 supposed-0.008
oof-0.008
 image-0.008
ow-0.008
Token to
Token ID284
Feature activation-1.06e-01

Pos logit contributions
e+0.142
oof+0.118
ow+0.111
 supposed+0.107
 tal+0.100
Neg logit contributions
boards-0.097
Soon-0.089
At-0.088
Finally-0.085
Exc-0.084
Token.
Token ID13
Feature activation-9.26e-02

Pos logit contributions
e+0.137
 supposed+0.121
oof+0.109
 image+0.107
ow+0.105
Neg logit contributions
Finally-0.126
boards-0.120
Soon-0.116
orting-0.111
At-0.110
Token<|endoftext|>
Token ID50256
Feature activation-1.08e-01

Pos logit contributions
e+0.210
 supposed+0.189
oof+0.175
 image+0.175
ow+0.172
Neg logit contributions
Finally-0.110
boards-0.100
Soon-0.095
Eventually-0.089
Days-0.089
TokenJohn
Token ID7554
Feature activation-1.15e-01

Pos logit contributions
e+0.262
 supposed+0.233
oof+0.217
 image+0.215
ow+0.215
Neg logit contributions
Finally-0.114
boards-0.102
Soon-0.097
Days-0.095
Eventually-0.091
Token was
Token ID373
Feature activation-1.38e-01

Pos logit contributions
e+0.204
 supposed+0.196
 image+0.179
oof+0.178
 possession+0.174
Neg logit contributions
Finally-0.169
boards-0.157
At-0.157
Soon-0.155
Exc-0.149
Token 3
Token ID513
Feature activation-1.57e-01

Pos logit contributions
e+0.215
 image+0.195
 possession+0.189
 supposed+0.183
oof+0.180
Neg logit contributions
Finally-0.244
Soon-0.234
Eventually-0.224
At-0.221
 Finally-0.219
Token years
Token ID812
Feature activation-2.20e-01

Pos logit contributions
 supposed+0.214
 yours+0.203
e+0.194
oof+0.191
leaf+0.173
Neg logit contributions
boards-0.280
Finally-0.277
Exc-0.270
Days-0.264
Then-0.262
Token old
Token ID1468
Feature activation-9.59e-02

Pos logit contributions
e+0.393
 supposed+0.336
oof+0.309
 image+0.296
ow+0.294
Neg logit contributions
Finally-0.371
boards-0.355
Soon-0.343
Exc-0.326
At-0.325
Token.
Token ID13
Feature activation-1.01e-01

Pos logit contributions
e+0.203
 supposed+0.194
oof+0.178
 yours+0.169
ow+0.164
Neg logit contributions
boards-0.117
Exc-0.110
-0.108
Finally-0.108
orting-0.103
Token He
Token ID679
Feature activation+5.38e-03

Pos logit contributions
e+0.217
 supposed+0.203
 image+0.187
ow+0.183
oof+0.179
Neg logit contributions
Finally-0.125
boards-0.111
Days-0.110
Soon-0.109
orting-0.108
Token loved
Token ID6151
Feature activation-6.84e-02

Pos logit contributions
Finally+0.008
boards+0.007
Soon+0.007
At+0.007
Exc+0.007
Neg logit contributions
e-0.010
 supposed-0.010
oof-0.009
 image-0.009
ow-0.009
Token to
Token ID284
Feature activation-1.05e-01

Pos logit contributions
e+0.145
oof+0.120
ow+0.113
 supposed+0.109
illa+0.103
Neg logit contributions
boards-0.093
Soon-0.085
At-0.084
Exc-0.081
Finally-0.079
TokenM
Token ID44
Feature activation-1.15e-01

Pos logit contributions
e+0.264
 supposed+0.243
 image+0.225
oof+0.225
ow+0.222
Neg logit contributions
Finally-0.098
boards-0.087
Soon-0.082
Eventually-0.075
At-0.074
Tokenolly
Token ID5098
Feature activation-1.09e-01

Pos logit contributions
e+0.257
 supposed+0.243
 image+0.215
 possession+0.213
oof+0.208
Neg logit contributions
Finally-0.146
Soon-0.127
boards-0.122
Eventually-0.119
At-0.117
Token was
Token ID373
Feature activation-1.40e-01

Pos logit contributions
 supposed+0.217
e+0.216
oof+0.192
 image+0.186
 possession+0.186
Neg logit contributions
Finally-0.138
boards-0.135
At-0.128
Exc-0.128
Soon-0.125
Token only
Token ID691
Feature activation-1.31e-01

Pos logit contributions
e+0.208
 possession+0.200
 image+0.193
oof+0.180
 hairc+0.180
Neg logit contributions
Finally-0.246
Soon-0.235
Eventually-0.230
 Finally-0.230
At-0.223
Token three
Token ID1115
Feature activation-1.19e-01

Pos logit contributions
oof+0.245
e+0.237
 supposed+0.230
 possession+0.225
 image+0.221
Neg logit contributions
Soon-0.151
Finally-0.145
 until-0.145
Then-0.135
 Finally-0.135
Token years
Token ID812
Feature activation-2.11e-01

Pos logit contributions
e+0.131
 supposed+0.125
oof+0.104
 yours+0.099
 image+0.093
Neg logit contributions
Finally-0.259
boards-0.253
Exc-0.242
Soon-0.241
Days-0.240
Token old
Token ID1468
Feature activation-9.25e-02

Pos logit contributions
e+0.360
 supposed+0.311
oof+0.280
 image+0.275
ow+0.270
Neg logit contributions
Finally-0.371
boards-0.352
Soon-0.342
Eventually-0.326
Exc-0.324
Token.
Token ID13
Feature activation-9.80e-02

Pos logit contributions
e+0.190
 supposed+0.183
oof+0.168
 yours+0.159
ow+0.154
Neg logit contributions
boards-0.115
Finally-0.109
-0.106
Exc-0.105
orting-0.101
Token She
Token ID1375
Feature activation+4.03e-04

Pos logit contributions
e+0.197
 supposed+0.173
ow+0.159
 image+0.157
oof+0.156
Neg logit contributions
Finally-0.139
boards-0.128
orting-0.125
Soon-0.125
Days-0.124
Token liked
Token ID8288
Feature activation-6.68e-02

Pos logit contributions
Finally+0.001
boards+0.001
Soon+0.001
At+0.001
Exc+0.001
Neg logit contributions
e-0.001
 supposed-0.001
oof-0.001
 image-0.001
ow-0.001
Token to
Token ID284
Feature activation-1.05e-01

Pos logit contributions
e+0.140
oof+0.116
 supposed+0.110
ow+0.108
 tal+0.101
Neg logit contributions
boards-0.093
Soon-0.085
At-0.085
Finally-0.082
Exc-0.082
Token.
Token ID13
Feature activation-6.82e-02

Pos logit contributions
e+0.223
 supposed+0.204
oof+0.189
 image+0.187
ow+0.176
Neg logit contributions
Finally-0.204
boards-0.201
Soon-0.189
At-0.187
orting-0.186
Token<|endoftext|>
Token ID50256
Feature activation-9.07e-02

Pos logit contributions
e+0.156
 supposed+0.140
oof+0.131
 image+0.129
ow+0.128
Neg logit contributions
Finally-0.079
boards-0.073
Soon-0.068
Days-0.065
Eventually-0.064
TokenAlice
Token ID44484
Feature activation-1.05e-01

Pos logit contributions
e+0.208
 supposed+0.182
oof+0.169
ow+0.166
 image+0.166
Neg logit contributions
Finally-0.110
boards-0.099
Days-0.095
Soon-0.095
Eventually-0.091
Token was
Token ID373
Feature activation-1.25e-01

Pos logit contributions
e+0.198
 supposed+0.187
 image+0.172
oof+0.170
ow+0.164
Neg logit contributions
Finally-0.146
At-0.133
boards-0.133
Soon-0.132
 Finally-0.127
Token three
Token ID1115
Feature activation-1.15e-01

Pos logit contributions
e+0.199
 possession+0.182
 image+0.180
oof+0.171
leaf+0.168
Neg logit contributions
Finally-0.211
Soon-0.205
 Finally-0.197
Eventually-0.196
At-0.191
Token years
Token ID812
Feature activation-2.09e-01

Pos logit contributions
e+0.143
 supposed+0.137
oof+0.117
 yours+0.110
ow+0.107
Neg logit contributions
Finally-0.232
boards-0.229
Exc-0.223
Days-0.217
Soon-0.216
Token old
Token ID1468
Feature activation-8.42e-02

Pos logit contributions
e+0.366
 supposed+0.314
oof+0.286
 image+0.277
ow+0.275
Neg logit contributions
Finally-0.359
boards-0.342
Soon-0.331
Exc-0.315
At-0.314
Token and
Token ID290
Feature activation-1.16e-01

Pos logit contributions
e+0.182
 supposed+0.174
oof+0.159
 yours+0.148
ow+0.148
Neg logit contributions
boards-0.100
Exc-0.094
-0.092
Finally-0.091
orting-0.087
Token loved
Token ID6151
Feature activation-7.52e-02

Pos logit contributions
e+0.271
oof+0.246
 supposed+0.244
 image+0.242
ow+0.234
Neg logit contributions
Finally-0.113
 Finally-0.103
Soon-0.103
Then-0.092
At-0.090
Token to
Token ID284
Feature activation-1.06e-01

Pos logit contributions
e+0.161
oof+0.131
ow+0.122
 supposed+0.120
 tal+0.113
Neg logit contributions
boards-0.100
Soon-0.093
At-0.092
Finally-0.091
Exc-0.090
Token explore
Token ID7301
Feature activation+5.34e-03

Pos logit contributions
 supposed+0.193
e+0.192
oof+0.171
 possession+0.167
ow+0.161
Neg logit contributions
Finally-0.144
At-0.135
boards-0.134
Soon-0.132
Exc-0.130
Token called
Token ID1444
Feature activation-1.48e-01

Pos logit contributions
 supposed+0.284
e+0.280
oof+0.264
 image+0.252
 possession+0.248
Neg logit contributions
Finally-0.253
boards-0.230
-0.229
 Finally-0.228
Exc-0.228
Token Mia
Token ID32189
Feature activation-1.24e-01

Pos logit contributions
e+0.238
 supposed+0.219
oof+0.198
 image+0.193
 possession+0.181
Neg logit contributions
Finally-0.239
boards-0.234
 Finally-0.222
Soon-0.220
orting-0.219
Token who
Token ID508
Feature activation-6.03e-02

Pos logit contributions
e+0.220
 supposed+0.201
oof+0.183
 image+0.181
 possession+0.171
Neg logit contributions
Finally-0.196
boards-0.185
Soon-0.177
Exc-0.174
At-0.174
Token was
Token ID373
Feature activation-1.29e-01

Pos logit contributions
e+0.119
 supposed+0.109
oof+0.102
 image+0.099
ow+0.095
Neg logit contributions
Finally-0.083
boards-0.079
Soon-0.074
Exc-0.073
At-0.073
Token three
Token ID1115
Feature activation-1.16e-01

Pos logit contributions
e+0.185
oof+0.163
 supposed+0.152
 image+0.151
 possession+0.148
Neg logit contributions
Finally-0.240
Soon-0.226
At-0.217
Eventually-0.214
 Finally-0.212
Token years
Token ID812
Feature activation-2.08e-01

Pos logit contributions
e+0.132
 supposed+0.124
oof+0.115
 yours+0.104
ow+0.097
Neg logit contributions
boards-0.242
Finally-0.240
Exc-0.231
Days-0.229
Soon-0.225
Token old
Token ID1468
Feature activation-8.10e-02

Pos logit contributions
e+0.316
 supposed+0.280
 image+0.248
ow+0.238
oof+0.235
Neg logit contributions
Finally-0.402
boards-0.374
Soon-0.367
Eventually-0.356
Days-0.350
Token.
Token ID13
Feature activation-7.48e-02

Pos logit contributions
e+0.156
 supposed+0.143
oof+0.131
ow+0.123
 image+0.120
Neg logit contributions
boards-0.118
Finally-0.116
Exc-0.107
Soon-0.106
orting-0.105
Token She
Token ID1375
Feature activation+9.59e-03

Pos logit contributions
e+0.159
 supposed+0.139
ow+0.132
oof+0.132
 image+0.130
Neg logit contributions
Finally-0.096
boards-0.090
Soon-0.085
Days-0.084
orting-0.084
Token was
Token ID373
Feature activation-1.02e-01

Pos logit contributions
Finally+0.016
boards+0.015
Soon+0.015
Eventually+0.014
orting+0.014
Neg logit contributions
e-0.017
 supposed-0.015
 image-0.013
oof-0.013
ow-0.013
Token excited
Token ID6568
Feature activation-6.49e-02

Pos logit contributions
e+0.171
oof+0.165
 possession+0.145
 image+0.141
 supposed+0.138
Neg logit contributions
Finally-0.152
Soon-0.146
 Finally-0.146
At-0.140
Eventually-0.139
Token Mia
Token ID32189
Feature activation-1.25e-01

Pos logit contributions
e+0.292
 supposed+0.259
oof+0.234
 image+0.233
ow+0.222
Neg logit contributions
Finally-0.253
boards-0.246
Soon-0.233
Exc-0.228
At-0.223
Token.
Token ID13
Feature activation-9.17e-02

Pos logit contributions
e+0.220
 supposed+0.204
 image+0.184
oof+0.183
 possession+0.173
Neg logit contributions
Finally-0.199
boards-0.187
Soon-0.181
At-0.177
Exc-0.176
Token She
Token ID1375
Feature activation-3.89e-03

Pos logit contributions
e+0.178
ow+0.147
 supposed+0.146
oof+0.143
 image+0.140
Neg logit contributions
Finally-0.141
boards-0.134
Days-0.131
Soon-0.125
orting-0.125
Token was
Token ID373
Feature activation-1.18e-01

Pos logit contributions
e+0.008
 supposed+0.007
 image+0.006
oof+0.006
ow+0.006
Neg logit contributions
Finally-0.006
Soon-0.005
boards-0.005
At-0.005
Exc-0.005
Token three
Token ID1115
Feature activation-1.14e-01

Pos logit contributions
e+0.186
oof+0.171
 possession+0.168
 image+0.156
 tal+0.155
Neg logit contributions
Finally-0.190
Soon-0.187
 Finally-0.175
At-0.173
Eventually-0.172
Token years
Token ID812
Feature activation-2.07e-01

Pos logit contributions
e+0.126
 supposed+0.118
oof+0.107
 yours+0.100
ow+0.097
Neg logit contributions
boards-0.239
Finally-0.238
Exc-0.229
Days-0.226
At-0.225
Token old
Token ID1468
Feature activation-7.99e-02

Pos logit contributions
e+0.343
 supposed+0.298
oof+0.264
 image+0.263
ow+0.258
Neg logit contributions
Finally-0.373
boards-0.352
Soon-0.342
Eventually-0.328
At-0.326
Token and
Token ID290
Feature activation-1.04e-01

Pos logit contributions
e+0.165
 supposed+0.152
oof+0.140
ow+0.132
 yours+0.131
Neg logit contributions
boards-0.106
Finally-0.100
Exc-0.094
orting-0.093
 Finally-0.091
Token the
Token ID262
Feature activation-1.45e-02

Pos logit contributions
e+0.199
 supposed+0.177
oof+0.170
ow+0.164
 image+0.160
Neg logit contributions
Finally-0.150
Soon-0.138
boards-0.135
 Finally-0.133
At-0.132
Token happiest
Token ID49414
Feature activation-6.64e-03

Pos logit contributions
e+0.029
 supposed+0.025
oof+0.023
ow+0.023
 image+0.021
Neg logit contributions
Finally-0.022
boards-0.020
Soon-0.019
Exc-0.019
Days-0.018
Token little
Token ID1310
Feature activation-7.83e-02

Pos logit contributions
e+0.014
 supposed+0.012
oof+0.011
ow+0.011
 image+0.010
Neg logit contributions
Finally-0.009
boards-0.009
Soon-0.008
Exc-0.008
At-0.008
Token Ann
Token ID5506
Feature activation-1.29e-01

Pos logit contributions
e+0.235
 supposed+0.216
oof+0.191
 image+0.188
 possession+0.176
Neg logit contributions
Finally-0.239
boards-0.236
Soon-0.222
 Finally-0.221
Exc-0.217
Token.
Token ID13
Feature activation-8.16e-02

Pos logit contributions
e+0.228
 supposed+0.210
 image+0.190
oof+0.189
 possession+0.178
Neg logit contributions
Finally-0.202
boards-0.191
Soon-0.182
At-0.179
Exc-0.178
Token Ann
Token ID5506
Feature activation-1.02e-01

Pos logit contributions
e+0.168
 supposed+0.145
ow+0.141
oof+0.138
 image+0.132
Neg logit contributions
Finally-0.111
boards-0.107
Days-0.100
orting-0.099
Soon-0.099
Token was
Token ID373
Feature activation-1.12e-01

Pos logit contributions
e+0.185
 supposed+0.166
oof+0.149
 image+0.149
ow+0.144
Neg logit contributions
Finally-0.164
boards-0.154
Soon-0.149
Eventually-0.142
At-0.142
Token three
Token ID1115
Feature activation-1.11e-01

Pos logit contributions
e+0.171
oof+0.162
 possession+0.149
 image+0.144
 tal+0.139
Neg logit contributions
Finally-0.188
Soon-0.182
 Finally-0.173
Eventually-0.171
At-0.171
Token years
Token ID812
Feature activation-2.06e-01

Pos logit contributions
e+0.119
 supposed+0.111
oof+0.100
 yours+0.092
ow+0.087
Neg logit contributions
Finally-0.238
boards-0.235
Exc-0.226
Soon-0.225
Days-0.224
Token old
Token ID1468
Feature activation-7.97e-02

Pos logit contributions
e+0.350
 supposed+0.303
oof+0.273
 image+0.268
ow+0.263
Neg logit contributions
Finally-0.363
boards-0.344
Soon-0.334
Eventually-0.318
At-0.318
Token.
Token ID13
Feature activation-7.19e-02

Pos logit contributions
e+0.161
 supposed+0.147
oof+0.135
ow+0.127
 image+0.126
Neg logit contributions
boards-0.109
Finally-0.108
Exc-0.098
Soon-0.097
orting-0.097
Token One
Token ID1881
Feature activation+4.48e-02

Pos logit contributions
e+0.150
 supposed+0.131
ow+0.127
oof+0.124
 image+0.120
Neg logit contributions
Finally-0.094
boards-0.092
orting-0.087
Soon-0.085
Days-0.085
Token day
Token ID1110
Feature activation-1.28e-02

Pos logit contributions
Finally+0.059
Soon+0.056
boards+0.056
At+0.055
 Finally+0.051
Neg logit contributions
e-0.091
 supposed-0.083
oof-0.078
 yours-0.073
leaf-0.071
Token,
Token ID11
Feature activation+2.71e-02

Pos logit contributions
e+0.024
 supposed+0.021
ow+0.019
 image+0.019
oof+0.019
Neg logit contributions
Finally-0.021
boards-0.019
Soon-0.019
Eventually-0.018
orting-0.018
Token Sam
Token ID3409
Feature activation-1.18e-01

Pos logit contributions
e+0.338
 supposed+0.304
oof+0.284
 image+0.283
 possession+0.268
Neg logit contributions
Finally-0.216
boards-0.207
Soon-0.198
Exc-0.197
At-0.191
Token.
Token ID13
Feature activation-9.01e-02

Pos logit contributions
e+0.204
 supposed+0.182
 image+0.164
oof+0.163
ow+0.154
Neg logit contributions
Finally-0.199
boards-0.189
Soon-0.182
Exc-0.178
At-0.176
Token She
Token ID1375
Feature activation-6.29e-05

Pos logit contributions
e+0.163
 supposed+0.135
oof+0.132
ow+0.131
 image+0.130
Neg logit contributions
Finally-0.150
boards-0.140
Days-0.136
Soon-0.135
Eventually-0.132
Token was
Token ID373
Feature activation-1.17e-01

Pos logit contributions
e+0.000
 supposed+0.000
 image+0.000
oof+0.000
ow+0.000
Neg logit contributions
Finally-0.000
Soon-0.000
boards-0.000
At-0.000
Exc-0.000
Token three
Token ID1115
Feature activation-1.13e-01

Pos logit contributions
e+0.174
 possession+0.165
oof+0.163
 tal+0.154
 image+0.154
Neg logit contributions
Finally-0.188
Soon-0.188
 Finally-0.176
At-0.175
Eventually-0.172
Token years
Token ID812
Feature activation-2.05e-01

Pos logit contributions
e+0.130
 supposed+0.128
oof+0.116
 yours+0.113
leaf+0.104
Neg logit contributions
boards-0.229
Finally-0.223
Exc-0.220
Days-0.215
At-0.214
Token old
Token ID1468
Feature activation-7.86e-02

Pos logit contributions
e+0.357
 supposed+0.307
oof+0.279
 image+0.271
ow+0.269
Neg logit contributions
Finally-0.355
boards-0.337
Soon-0.327
Exc-0.312
At-0.311
Token.
Token ID13
Feature activation-7.78e-02

Pos logit contributions
e+0.164
 supposed+0.152
oof+0.139
 yours+0.132
ow+0.131
Neg logit contributions
boards-0.104
Finally-0.097
Exc-0.092
orting-0.090
 Finally-0.088
Token Sam
Token ID3409
Feature activation-8.77e-02

Pos logit contributions
e+0.166
 supposed+0.146
oof+0.139
ow+0.137
 image+0.136
Neg logit contributions
Finally-0.101
boards-0.095
Soon-0.090
Days-0.088
orting-0.085
Token was
Token ID373
Feature activation-9.54e-02

Pos logit contributions
e+0.154
 supposed+0.135
 image+0.121
oof+0.120
ow+0.118
Neg logit contributions
Finally-0.150
boards-0.140
Soon-0.136
Eventually-0.130
orting-0.129
Token very
Token ID845
Feature activation-1.04e-01

Pos logit contributions
e+0.152
oof+0.147
 possession+0.134
leaf+0.127
 image+0.127
Neg logit contributions
Finally-0.142
 Finally-0.141
Soon-0.140
At-0.132
Eventually-0.130
Token<|endoftext|>
Token ID50256
Feature activation-8.65e-02

Pos logit contributions
e+0.148
 supposed+0.133
oof+0.123
 image+0.122
ow+0.121
Neg logit contributions
Finally-0.077
boards-0.071
Soon-0.067
Days-0.063
orting-0.063
TokenM
Token ID44
Feature activation-9.29e-02

Pos logit contributions
e+0.201
 supposed+0.174
oof+0.162
ow+0.160
 image+0.158
Neg logit contributions
Finally-0.103
Days-0.093
boards-0.092
Soon-0.090
Eventually-0.086
Tokenolly
Token ID5098
Feature activation-8.90e-02

Pos logit contributions
e+0.207
 supposed+0.192
 image+0.174
 possession+0.167
oof+0.167
Neg logit contributions
Finally-0.120
Soon-0.103
boards-0.101
Eventually-0.098
At-0.096
Token was
Token ID373
Feature activation-1.24e-01

Pos logit contributions
e+0.169
 supposed+0.158
oof+0.141
 image+0.139
ow+0.135
Neg logit contributions
Finally-0.130
boards-0.126
Soon-0.118
Exc-0.117
At-0.117
Token 3
Token ID513
Feature activation-1.40e-01

Pos logit contributions
e+0.190
 possession+0.175
oof+0.169
 image+0.169
leaf+0.161
Neg logit contributions
Finally-0.213
Soon-0.206
 Finally-0.203
Eventually-0.198
At-0.193
Token years
Token ID812
Feature activation-2.05e-01

Pos logit contributions
 supposed+0.196
 yours+0.190
oof+0.174
e+0.173
leaf+0.162
Neg logit contributions
boards-0.237
Finally-0.235
Exc-0.231
Days-0.229
 until-0.225
Token old
Token ID1468
Feature activation-8.23e-02

Pos logit contributions
e+0.347
 supposed+0.300
oof+0.269
 image+0.265
ow+0.260
Neg logit contributions
Finally-0.365
boards-0.345
Soon-0.335
Eventually-0.320
Exc-0.319
Token and
Token ID290
Feature activation-1.14e-01

Pos logit contributions
e+0.172
 supposed+0.163
oof+0.152
 yours+0.142
ow+0.140
Neg logit contributions
boards-0.102
Exc-0.098
-0.098
Finally-0.097
orting-0.091
Token very
Token ID845
Feature activation-1.08e-01

Pos logit contributions
e+0.250
 supposed+0.227
oof+0.225
 image+0.221
ow+0.214
Neg logit contributions
Finally-0.130
Soon-0.117
 Finally-0.117
boards-0.108
Eventually-0.106
Token excited
Token ID6568
Feature activation-7.09e-02

Pos logit contributions
e+0.182
 supposed+0.156
oof+0.152
 image+0.140
ow+0.136
Neg logit contributions
Finally-0.183
Soon-0.168
boards-0.165
 Finally-0.164
Eventually-0.160
Token.
Token ID13
Feature activation-8.59e-02

Pos logit contributions
e+0.158
oof+0.125
 supposed+0.125
ez+0.114
 tal+0.114
Neg logit contributions
boards-0.096
ways-0.079
Finally-0.078
side-0.077
orting-0.076
Token.
Token ID13
Feature activation-6.61e-02

Pos logit contributions
e+0.058
 supposed+0.050
oof+0.048
ow+0.045
 image+0.045
Neg logit contributions
Finally-0.055
boards-0.055
Soon-0.051
At-0.049
orting-0.049
Token<|endoftext|>
Token ID50256
Feature activation-8.37e-02

Pos logit contributions
e+0.151
 supposed+0.136
oof+0.127
 image+0.125
ow+0.124
Neg logit contributions
Finally-0.077
boards-0.071
Soon-0.067
Days-0.063
orting-0.063
TokenMax
Token ID11518
Feature activation-1.04e-01

Pos logit contributions
e+0.192
 supposed+0.167
oof+0.154
ow+0.153
 image+0.151
Neg logit contributions
Finally-0.102
boards-0.092
Days-0.091
Soon-0.089
Eventually-0.085
Token was
Token ID373
Feature activation-1.21e-01

Pos logit contributions
e+0.191
 supposed+0.175
oof+0.157
 image+0.157
ow+0.153
Neg logit contributions
Finally-0.158
boards-0.150
Soon-0.144
At-0.141
Exc-0.140
Token 3
Token ID513
Feature activation-1.38e-01

Pos logit contributions
e+0.181
 image+0.160
 possession+0.154
oof+0.151
 supposed+0.150
Neg logit contributions
Finally-0.223
Soon-0.214
Eventually-0.204
 Finally-0.203
At-0.201
Token years
Token ID812
Feature activation-2.05e-01

Pos logit contributions
 supposed+0.187
 yours+0.173
e+0.168
oof+0.163
leaf+0.151
Neg logit contributions
boards-0.249
Finally-0.244
Exc-0.239
Days-0.238
Soon-0.228
Token old
Token ID1468
Feature activation-8.02e-02

Pos logit contributions
e+0.347
 supposed+0.301
oof+0.269
 image+0.265
ow+0.261
Neg logit contributions
Finally-0.364
boards-0.344
Soon-0.334
Eventually-0.319
Exc-0.318
Token and
Token ID290
Feature activation-1.13e-01

Pos logit contributions
e+0.171
 supposed+0.164
oof+0.151
 yours+0.142
ow+0.141
Neg logit contributions
boards-0.097
-0.090
Exc-0.090
Finally-0.090
orting-0.084
Token loved
Token ID6151
Feature activation-7.27e-02

Pos logit contributions
e+0.253
 supposed+0.228
oof+0.227
 image+0.222
ow+0.221
Neg logit contributions
Finally-0.126
Soon-0.115
 Finally-0.112
Then-0.103
boards-0.102
Token to
Token ID284
Feature activation-1.03e-01

Pos logit contributions
e+0.149
oof+0.122
ow+0.114
 supposed+0.113
 image+0.102
Neg logit contributions
boards-0.104
Soon-0.096
Finally-0.095
At-0.093
Exc-0.091
Token stir
Token ID11240
Feature activation+6.12e-03

Pos logit contributions
e+0.185
 supposed+0.184
oof+0.165
 possession+0.159
ow+0.156
Neg logit contributions
Finally-0.145
boards-0.134
At-0.134
Soon-0.133
Exc-0.126
Token named
Token ID3706
Feature activation-1.56e-01

Pos logit contributions
 supposed+0.211
leaf+0.195
e+0.184
 yours+0.183
oof+0.182
Neg logit contributions
Finally-0.202
At-0.200
Then-0.194
Soon-0.190
Exc-0.189
Token Emma
Token ID18966
Feature activation-9.60e-02

Pos logit contributions
e+0.329
 supposed+0.296
 image+0.269
oof+0.269
ow+0.265
Neg logit contributions
Finally-0.218
boards-0.202
Soon-0.194
Eventually-0.183
Exc-0.181
Token who
Token ID508
Feature activation-6.41e-02

Pos logit contributions
e+0.185
 supposed+0.170
oof+0.161
ow+0.161
leaf+0.161
Neg logit contributions
Finally-0.127
At-0.118
Exc-0.115
Soon-0.114
Then-0.113
Token was
Token ID373
Feature activation-1.28e-01

Pos logit contributions
e+0.116
 supposed+0.105
oof+0.100
 image+0.099
ow+0.096
Neg logit contributions
Finally-0.099
Soon-0.088
At-0.087
boards-0.085
Days-0.084
Token three
Token ID1115
Feature activation-1.19e-01

Pos logit contributions
oof+0.206
e+0.204
 possession+0.197
leaf+0.196
 tal+0.191
Neg logit contributions
Soon-0.182
Finally-0.180
 Finally-0.171
At-0.168
add-0.166
Token years
Token ID812
Feature activation-2.05e-01

Pos logit contributions
e+0.169
 supposed+0.153
oof+0.134
 yours+0.129
ow+0.127
Neg logit contributions
Finally-0.226
boards-0.221
Exc-0.214
Days-0.210
Soon-0.209
Token old
Token ID1468
Feature activation-8.27e-02

Pos logit contributions
e+0.320
 supposed+0.283
 image+0.251
oof+0.243
ow+0.241
Neg logit contributions
Finally-0.391
boards-0.365
Soon-0.356
Eventually-0.345
Days-0.339
Token.
Token ID13
Feature activation-8.91e-02

Pos logit contributions
e+0.160
 supposed+0.145
oof+0.129
ow+0.126
 yours+0.124
Neg logit contributions
Finally-0.121
boards-0.119
Exc-0.111
Soon-0.110
orting-0.108
Token She
Token ID1375
Feature activation+1.56e-02

Pos logit contributions
e+0.149
 supposed+0.142
 image+0.131
 such+0.124
 ear+0.120
Neg logit contributions
Finally-0.146
boards-0.135
Soon-0.133
Eventually-0.131
orting-0.131
Token was
Token ID373
Feature activation-9.55e-02

Pos logit contributions
Finally+0.028
boards+0.026
Soon+0.025
Eventually+0.024
At+0.024
Neg logit contributions
e-0.027
 supposed-0.023
 image-0.021
oof-0.021
ow-0.020
Token very
Token ID845
Feature activation-1.02e-01

Pos logit contributions
e+0.159
oof+0.152
 tal+0.145
 possession+0.142
keeping+0.136
Neg logit contributions
Soon-0.137
 Finally-0.135
Finally-0.134
At-0.129
 Days-0.122
TokenOnce
Token ID7454
Feature activation-3.11e-02

Pos logit contributions
e+0.344
 supposed+0.334
 image+0.310
oof+0.305
ow+0.300
Neg logit contributions
Finally-0.094
boards-0.080
Soon-0.075
At-0.071
Exc-0.067
Token there
Token ID612
Feature activation+1.04e-02

Pos logit contributions
e+0.056
 supposed+0.051
ow+0.045
 image+0.045
oof+0.044
Neg logit contributions
Finally-0.050
boards-0.048
Soon-0.045
Days-0.045
Exc-0.044
Token was
Token ID373
Feature activation-1.35e-01

Pos logit contributions
Finally+0.017
boards+0.015
Soon+0.015
Eventually+0.014
Days+0.014
Neg logit contributions
e-0.019
 supposed-0.017
 image-0.015
oof-0.015
ow-0.015
Token a
Token ID257
Feature activation-1.32e-01

Pos logit contributions
e+0.189
 supposed+0.179
 possession+0.178
 image+0.172
beh+0.169
Neg logit contributions
Soon-0.246
Finally-0.238
There-0.230
At-0.227
Then-0.225
Token three
Token ID1115
Feature activation-1.34e-01

Pos logit contributions
e+0.231
 supposed+0.201
 image+0.183
oof+0.181
ow+0.174
Neg logit contributions
Finally-0.223
boards-0.215
Soon-0.202
Eventually-0.196
orting-0.196
Token-
Token ID12
Feature activation-2.04e-01

Pos logit contributions
e+0.212
 supposed+0.198
oof+0.169
 yours+0.165
ow+0.163
Neg logit contributions
Finally-0.240
boards-0.231
Soon-0.223
Exc-0.219
Days-0.216
Tokenyear
Token ID1941
Feature activation-1.69e-01

Pos logit contributions
e+0.361
 supposed+0.349
oof+0.322
 image+0.313
 fright+0.302
Neg logit contributions
Finally-0.313
Soon-0.291
Eventually-0.278
boards-0.272
Days-0.269
Token-
Token ID12
Feature activation-1.91e-01

Pos logit contributions
e+0.279
 supposed+0.246
 such+0.220
oof+0.218
 yours+0.200
Neg logit contributions
boards-0.284
Finally-0.277
add-0.272
Soon-0.269
Days-0.268
Tokenold
Token ID727
Feature activation-1.34e-01

Pos logit contributions
e+0.217
 supposed+0.194
 image+0.168
oof+0.166
 fright+0.158
Neg logit contributions
Finally-0.426
Soon-0.405
boards-0.401
Eventually-0.393
Days-0.384
Token who
Token ID508
Feature activation-6.95e-02

Pos logit contributions
 supposed+0.274
e+0.243
oof+0.236
 yours+0.232
 ours+0.225
Neg logit contributions
boards-0.154
-0.147
 until-0.144
Exc-0.140
 Finally-0.131
Token found
Token ID1043
Feature activation-4.99e-03

Pos logit contributions
e+0.129
 supposed+0.121
oof+0.114
 image+0.112
 yours+0.108
Neg logit contributions
Finally-0.104
boards-0.095
Soon-0.092
At-0.090
 Finally-0.088
Token Lucy
Token ID22162
Feature activation-1.08e-01

Pos logit contributions
e+0.335
 supposed+0.304
oof+0.283
 image+0.283
 possession+0.267
Neg logit contributions
Finally-0.208
boards-0.198
Exc-0.190
Soon-0.188
 Finally-0.183
Token.
Token ID13
Feature activation-9.07e-02

Pos logit contributions
e+0.203
 supposed+0.185
 image+0.179
oof+0.179
leaf+0.167
Neg logit contributions
Finally-0.149
At-0.135
Exc-0.134
-0.133
boards-0.132
Token She
Token ID1375
Feature activation+1.07e-03

Pos logit contributions
e+0.176
 supposed+0.154
oof+0.145
 image+0.144
ow+0.143
Neg logit contributions
Finally-0.133
boards-0.126
Soon-0.120
Days-0.118
orting-0.115
Token was
Token ID373
Feature activation-1.14e-01

Pos logit contributions
Finally+0.002
Soon+0.001
boards+0.001
At+0.001
Exc+0.001
Neg logit contributions
e-0.002
 supposed-0.002
 image-0.002
oof-0.002
ow-0.002
Token three
Token ID1115
Feature activation-1.12e-01

Pos logit contributions
e+0.174
oof+0.167
 possession+0.162
 image+0.156
 tal+0.153
Neg logit contributions
Finally-0.180
Soon-0.177
 Finally-0.169
At-0.168
Eventually-0.165
Token years
Token ID812
Feature activation-2.04e-01

Pos logit contributions
e+0.128
 supposed+0.121
oof+0.110
 yours+0.105
ow+0.098
Neg logit contributions
boards-0.229
Finally-0.228
Exc-0.223
Days-0.219
At-0.218
Token old
Token ID1468
Feature activation-7.87e-02

Pos logit contributions
e+0.355
 supposed+0.306
oof+0.278
 image+0.270
ow+0.267
Neg logit contributions
Finally-0.351
boards-0.333
Soon-0.324
Exc-0.309
At-0.308
Token and
Token ID290
Feature activation-1.03e-01

Pos logit contributions
e+0.163
 supposed+0.149
oof+0.137
ow+0.129
 yours+0.129
Neg logit contributions
boards-0.105
Finally-0.101
Exc-0.094
orting-0.093
Soon-0.091
Token loved
Token ID6151
Feature activation-7.02e-02

Pos logit contributions
e+0.202
 supposed+0.179
oof+0.172
ow+0.167
 image+0.165
Neg logit contributions
Finally-0.147
Soon-0.135
boards-0.130
At-0.130
 Finally-0.127
Token to
Token ID284
Feature activation-1.06e-01

Pos logit contributions
e+0.142
oof+0.117
ow+0.109
 supposed+0.106
 image+0.099
Neg logit contributions
boards-0.097
Finally-0.094
Soon-0.092
At-0.092
Exc-0.089
Token explore
Token ID7301
Feature activation+4.75e-03

Pos logit contributions
e+0.185
 supposed+0.178
oof+0.157
 possession+0.152
ow+0.148
Neg logit contributions
Finally-0.161
At-0.147
Soon-0.145
boards-0.144
Exc-0.142
Token Lily
Token ID20037
Feature activation-1.17e-01

Pos logit contributions
e+0.226
 supposed+0.200
 image+0.175
oof+0.175
ow+0.162
Neg logit contributions
Finally-0.273
boards-0.264
Soon-0.254
orting-0.244
Exc-0.243
Token.
Token ID13
Feature activation-9.19e-02

Pos logit contributions
e+0.204
 supposed+0.179
 image+0.163
oof+0.163
ow+0.153
Neg logit contributions
Finally-0.194
boards-0.184
Soon-0.176
At-0.170
Eventually-0.169
Token She
Token ID1375
Feature activation-5.07e-03

Pos logit contributions
e+0.185
 supposed+0.155
oof+0.153
ow+0.148
 image+0.147
Neg logit contributions
Finally-0.131
boards-0.125
Days-0.119
Soon-0.117
Eventually-0.112
Token was
Token ID373
Feature activation-1.14e-01

Pos logit contributions
e+0.009
 supposed+0.008
 image+0.007
oof+0.007
ow+0.007
Neg logit contributions
Finally-0.008
boards-0.008
Soon-0.008
Eventually-0.007
At-0.007
Token three
Token ID1115
Feature activation-1.14e-01

Pos logit contributions
e+0.168
oof+0.158
 possession+0.149
 image+0.142
 tal+0.133
Neg logit contributions
Finally-0.194
Soon-0.188
Eventually-0.179
 Finally-0.178
At-0.175
Token years
Token ID812
Feature activation-2.04e-01

Pos logit contributions
e+0.124
 supposed+0.117
oof+0.106
 yours+0.098
ow+0.096
Neg logit contributions
Finally-0.242
boards-0.240
Exc-0.231
Days-0.228
Soon-0.227
Token old
Token ID1468
Feature activation-8.10e-02

Pos logit contributions
e+0.332
 supposed+0.290
 image+0.256
oof+0.254
ow+0.249
Neg logit contributions
Finally-0.374
boards-0.352
Soon-0.343
Eventually-0.330
At-0.326
Token and
Token ID290
Feature activation-1.06e-01

Pos logit contributions
e+0.168
 supposed+0.156
oof+0.144
ow+0.137
 yours+0.133
Neg logit contributions
boards-0.107
Finally-0.101
Exc-0.095
orting-0.093
 Finally-0.091
Token liked
Token ID8288
Feature activation-7.53e-02

Pos logit contributions
e+0.195
 supposed+0.172
oof+0.162
ow+0.157
 image+0.156
Neg logit contributions
Finally-0.168
Soon-0.154
boards-0.153
At-0.147
 Finally-0.145
Token to
Token ID284
Feature activation-1.10e-01

Pos logit contributions
e+0.147
oof+0.121
 supposed+0.117
ow+0.114
 image+0.105
Neg logit contributions
boards-0.110
Finally-0.106
Soon-0.103
At-0.102
Exc-0.100
Token have
Token ID423
Feature activation-5.23e-02

Pos logit contributions
e+0.183
 supposed+0.180
oof+0.154
 possession+0.149
ow+0.146
Neg logit contributions
Finally-0.174
At-0.158
Soon-0.158
boards-0.156
Exc-0.154
Token Sally
Token ID25737
Feature activation-1.13e-01

Pos logit contributions
e+0.294
 supposed+0.259
 image+0.232
oof+0.232
ow+0.228
Neg logit contributions
Finally-0.265
boards-0.248
Soon-0.241
Eventually-0.230
At-0.228
Token.
Token ID13
Feature activation-8.94e-02

Pos logit contributions
e+0.203
 supposed+0.181
 image+0.164
oof+0.163
ow+0.158
Neg logit contributions
Finally-0.182
boards-0.171
Soon-0.165
At-0.159
Exc-0.158
Token She
Token ID1375
Feature activation-2.78e-03

Pos logit contributions
e+0.168
 supposed+0.149
oof+0.142
 image+0.137
ow+0.135
Neg logit contributions
Finally-0.130
boards-0.126
Days-0.121
Soon-0.117
Eventually-0.112
Token was
Token ID373
Feature activation-1.17e-01

Pos logit contributions
e+0.005
 supposed+0.005
 image+0.004
oof+0.004
ow+0.004
Neg logit contributions
Finally-0.004
Soon-0.004
boards-0.004
At-0.004
Exc-0.004
Token three
Token ID1115
Feature activation-1.16e-01

Pos logit contributions
e+0.183
oof+0.174
 possession+0.170
 image+0.161
 tal+0.156
Neg logit contributions
Finally-0.186
Soon-0.181
 Finally-0.173
Eventually-0.171
At-0.169
Token years
Token ID812
Feature activation-2.03e-01

Pos logit contributions
e+0.133
 supposed+0.124
oof+0.115
 yours+0.109
ow+0.105
Neg logit contributions
Finally-0.238
boards-0.237
Exc-0.232
Days-0.227
At-0.226
Token old
Token ID1468
Feature activation-8.19e-02

Pos logit contributions
e+0.341
 supposed+0.295
oof+0.263
 image+0.261
ow+0.256
Neg logit contributions
Finally-0.364
boards-0.343
Soon-0.334
Eventually-0.320
At-0.318
Token and
Token ID290
Feature activation-1.05e-01

Pos logit contributions
e+0.176
 supposed+0.163
oof+0.151
ow+0.143
 yours+0.142
Neg logit contributions
boards-0.103
Finally-0.094
Exc-0.091
orting-0.087
 Finally-0.087
Token very
Token ID845
Feature activation-1.03e-01

Pos logit contributions
e+0.203
 supposed+0.179
oof+0.170
ow+0.167
 image+0.163
Neg logit contributions
Finally-0.157
Soon-0.143
boards-0.139
At-0.137
 Finally-0.135
Token happy
Token ID3772
Feature activation-4.14e-02

Pos logit contributions
e+0.151
 supposed+0.128
oof+0.113
 image+0.111
ow+0.108
Neg logit contributions
Finally-0.204
boards-0.192
Soon-0.189
Eventually-0.181
At-0.180
Token.
Token ID13
Feature activation-7.07e-02

Pos logit contributions
e+0.080
 supposed+0.068
oof+0.063
ow+0.060
 image+0.058
Neg logit contributions
Finally-0.062
boards-0.062
Soon-0.058
 Finally-0.055
Exc-0.055
Token Amy
Token ID14235
Feature activation-1.17e-01

Pos logit contributions
e+0.294
 supposed+0.259
 image+0.232
oof+0.232
ow+0.228
Neg logit contributions
Finally-0.265
boards-0.248
Soon-0.241
Eventually-0.230
At-0.228
Token.
Token ID13
Feature activation-8.74e-02

Pos logit contributions
e+0.213
 supposed+0.197
 image+0.180
oof+0.178
 possession+0.172
Neg logit contributions
Finally-0.173
boards-0.165
Soon-0.155
At-0.154
Exc-0.154
Token She
Token ID1375
Feature activation-2.03e-04

Pos logit contributions
e+0.174
 supposed+0.148
 image+0.145
oof+0.144
ow+0.139
Neg logit contributions
Finally-0.126
boards-0.119
Days-0.116
Soon-0.114
orting-0.111
Token was
Token ID373
Feature activation-1.16e-01

Pos logit contributions
e+0.000
 supposed+0.000
 image+0.000
oof+0.000
ow+0.000
Neg logit contributions
Finally-0.000
boards-0.000
Soon-0.000
At-0.000
Exc-0.000
Token three
Token ID1115
Feature activation-1.15e-01

Pos logit contributions
e+0.177
 possession+0.168
oof+0.167
 image+0.157
 tal+0.155
Neg logit contributions
Finally-0.184
Soon-0.181
 Finally-0.172
At-0.168
Eventually-0.167
Token years
Token ID812
Feature activation-2.03e-01

Pos logit contributions
e+0.135
 supposed+0.131
oof+0.121
 yours+0.115
leaf+0.111
Neg logit contributions
boards-0.228
Finally-0.225
Exc-0.221
Days-0.216
At-0.215
Token old
Token ID1468
Feature activation-8.17e-02

Pos logit contributions
e+0.350
 supposed+0.301
oof+0.272
 image+0.266
ow+0.263
Neg logit contributions
Finally-0.356
boards-0.337
Soon-0.327
Exc-0.312
At-0.311
Token and
Token ID290
Feature activation-1.05e-01

Pos logit contributions
e+0.178
 supposed+0.167
oof+0.155
 yours+0.146
ow+0.146
Neg logit contributions
boards-0.100
Exc-0.087
Finally-0.087
 Finally-0.082
 until-0.082
Token loved
Token ID6151
Feature activation-7.12e-02

Pos logit contributions
e+0.203
 supposed+0.180
oof+0.174
ow+0.168
 image+0.161
Neg logit contributions
Finally-0.149
Soon-0.137
boards-0.134
 Finally-0.132
At-0.131
Token to
Token ID284
Feature activation-1.07e-01

Pos logit contributions
e+0.145
oof+0.117
ow+0.112
 supposed+0.109
othy+0.100
Neg logit contributions
boards-0.098
Finally-0.094
Soon-0.091
At-0.091
Exc-0.088
Token have
Token ID423
Feature activation-5.12e-02

Pos logit contributions
e+0.184
 supposed+0.175
oof+0.152
 possession+0.149
ow+0.146
Neg logit contributions
Finally-0.167
At-0.152
Soon-0.151
boards-0.149
Exc-0.146
Token little
Token ID1310
Feature activation-1.29e-01

Pos logit contributions
e+0.243
 supposed+0.215
ow+0.197
oof+0.195
 image+0.188
Neg logit contributions
Finally-0.210
boards-0.190
Soon-0.189
At-0.180
Eventually-0.180
Token girl
Token ID2576
Feature activation-1.38e-01

Pos logit contributions
e+0.247
 supposed+0.231
oof+0.210
ow+0.198
leaf+0.188
Neg logit contributions
Finally-0.181
Soon-0.168
boards-0.166
Exc-0.163
 Finally-0.158
Token who
Token ID508
Feature activation-5.97e-02

Pos logit contributions
e+0.231
 supposed+0.220
oof+0.204
 image+0.196
 possession+0.188
Neg logit contributions
Finally-0.218
boards-0.198
Soon-0.196
At-0.195
 Finally-0.190
Token was
Token ID373
Feature activation-1.30e-01

Pos logit contributions
e+0.109
 supposed+0.103
oof+0.097
 image+0.094
 yours+0.089
Neg logit contributions
Finally-0.084
boards-0.079
At-0.077
Soon-0.077
 Finally-0.073
Token three
Token ID1115
Feature activation-1.19e-01

Pos logit contributions
e+0.192
oof+0.184
 image+0.165
 possession+0.163
 supposed+0.162
Neg logit contributions
Finally-0.218
Soon-0.209
At-0.198
Eventually-0.196
 Finally-0.194
Token years
Token ID812
Feature activation-2.03e-01

Pos logit contributions
e+0.134
 supposed+0.129
oof+0.121
 yours+0.109
ow+0.103
Neg logit contributions
Finally-0.248
boards-0.245
Exc-0.236
Days-0.232
Soon-0.232
Token old
Token ID1468
Feature activation-7.71e-02

Pos logit contributions
e+0.325
 supposed+0.284
 image+0.252
oof+0.246
ow+0.244
Neg logit contributions
Finally-0.379
boards-0.355
Soon-0.346
Eventually-0.334
Days-0.330
Token.
Token ID13
Feature activation-7.04e-02

Pos logit contributions
e+0.149
 supposed+0.138
oof+0.127
ow+0.119
 image+0.116
Neg logit contributions
boards-0.111
Finally-0.109
Exc-0.101
Soon-0.099
orting-0.099
Token On
Token ID1550
Feature activation-1.49e-02

Pos logit contributions
e+0.150
 supposed+0.133
oof+0.125
ow+0.123
 image+0.123
Neg logit contributions
Finally-0.089
boards-0.086
Soon-0.079
Days-0.078
orting-0.077
Token her
Token ID607
Feature activation-4.62e-02

Pos logit contributions
e+0.023
 supposed+0.022
oof+0.019
 image+0.018
ow+0.017
Neg logit contributions
Finally-0.027
Soon-0.026
boards-0.025
At-0.025
side-0.025
Token third
Token ID2368
Feature activation-4.50e-02

Pos logit contributions
e+0.086
 supposed+0.076
oof+0.075
leaf+0.071
 yours+0.069
Neg logit contributions
Finally-0.072
Soon-0.066
At-0.065
Days-0.064
Eventually-0.063
Token Tim
Token ID5045
Feature activation-1.14e-01

Pos logit contributions
e+0.289
 supposed+0.255
 image+0.232
oof+0.228
ow+0.218
Neg logit contributions
Finally-0.274
boards-0.265
Soon-0.251
Exc-0.246
 Finally-0.242
Token.
Token ID13
Feature activation-9.62e-02

Pos logit contributions
e+0.183
 supposed+0.162
 image+0.152
oof+0.148
leaf+0.143
Neg logit contributions
Finally-0.203
boards-0.186
Soon-0.185
At-0.182
Exc-0.180
Token Tim
Token ID5045
Feature activation-9.10e-02

Pos logit contributions
e+0.146
 supposed+0.124
oof+0.119
 image+0.112
ow+0.111
Neg logit contributions
Finally-0.183
boards-0.174
Days-0.172
Soon-0.168
Eventually-0.163
Token was
Token ID373
Feature activation-1.18e-01

Pos logit contributions
e+0.163
 supposed+0.145
 image+0.134
oof+0.133
ow+0.130
Neg logit contributions
Finally-0.146
boards-0.135
Soon-0.133
At-0.129
Eventually-0.127
Token three
Token ID1115
Feature activation-1.12e-01

Pos logit contributions
e+0.180
oof+0.160
 possession+0.157
 image+0.157
leaf+0.157
Neg logit contributions
Finally-0.200
Soon-0.197
 Finally-0.184
Eventually-0.182
At-0.181
Token years
Token ID812
Feature activation-2.03e-01

Pos logit contributions
e+0.126
 supposed+0.115
oof+0.101
 yours+0.094
ow+0.093
Neg logit contributions
boards-0.242
Finally-0.240
Exc-0.232
Days-0.227
Soon-0.226
Token old
Token ID1468
Feature activation-7.74e-02

Pos logit contributions
e+0.325
 supposed+0.285
 image+0.252
oof+0.248
ow+0.244
Neg logit contributions
Finally-0.378
boards-0.354
Soon-0.345
Eventually-0.333
Days-0.329
Token.
Token ID13
Feature activation-7.63e-02

Pos logit contributions
e+0.155
 supposed+0.140
oof+0.128
ow+0.122
 image+0.121
Neg logit contributions
boards-0.110
Finally-0.108
orting-0.098
Exc-0.098
Soon-0.098
Token He
Token ID679
Feature activation+1.74e-02

Pos logit contributions
e+0.164
 supposed+0.149
oof+0.142
 image+0.139
ow+0.137
Neg logit contributions
Finally-0.095
boards-0.087
Soon-0.084
Days-0.082
Eventually-0.079
Token was
Token ID373
Feature activation-9.51e-02

Pos logit contributions
Finally+0.029
boards+0.028
Soon+0.027
Eventually+0.026
orting+0.025
Neg logit contributions
e-0.031
 supposed-0.027
 image-0.025
oof-0.024
ow-0.024
Token always
Token ID1464
Feature activation-2.40e-02

Pos logit contributions
e+0.164
oof+0.149
 possession+0.138
leaf+0.135
 image+0.135
Neg logit contributions
Finally-0.137
 Finally-0.136
Soon-0.134
At-0.126
 Days-0.124
Token Tyler
Token ID14886
Feature activation-1.38e-01

Pos logit contributions
e+0.292
 supposed+0.259
oof+0.234
 image+0.233
ow+0.222
Neg logit contributions
Finally-0.253
boards-0.246
Soon-0.233
Exc-0.228
At-0.223
Token.
Token ID13
Feature activation-8.59e-02

Pos logit contributions
e+0.227
 supposed+0.199
 image+0.176
oof+0.176
ow+0.171
Neg logit contributions
Finally-0.248
boards-0.233
Soon-0.227
Eventually-0.218
At-0.216
Token She
Token ID1375
Feature activation+9.15e-04

Pos logit contributions
e+0.163
 supposed+0.142
oof+0.133
ow+0.131
 image+0.123
Neg logit contributions
Finally-0.132
boards-0.125
Days-0.119
Soon-0.118
orting-0.115
Token was
Token ID373
Feature activation-1.14e-01

Pos logit contributions
Finally+0.001
Soon+0.001
boards+0.001
At+0.001
Eventually+0.001
Neg logit contributions
e-0.002
 supposed-0.002
oof-0.001
 image-0.001
ow-0.001
Token three
Token ID1115
Feature activation-1.12e-01

Pos logit contributions
e+0.181
oof+0.177
 possession+0.163
 image+0.156
 tal+0.155
Neg logit contributions
Finally-0.181
Soon-0.177
Eventually-0.167
 Finally-0.167
At-0.165
Token years
Token ID812
Feature activation-2.03e-01

Pos logit contributions
e+0.125
 supposed+0.119
oof+0.111
 yours+0.106
ow+0.100
Neg logit contributions
boards-0.231
Finally-0.231
Exc-0.223
Days-0.221
At-0.219
Token old
Token ID1468
Feature activation-7.78e-02

Pos logit contributions
e+0.325
 supposed+0.285
 image+0.252
oof+0.247
ow+0.244
Neg logit contributions
Finally-0.377
boards-0.354
Soon-0.344
Eventually-0.332
Days-0.328
Token and
Token ID290
Feature activation-1.05e-01

Pos logit contributions
e+0.159
 supposed+0.146
oof+0.135
ow+0.128
 image+0.126
Neg logit contributions
boards-0.105
Finally-0.102
Exc-0.094
Soon-0.093
orting-0.092
Token was
Token ID373
Feature activation-1.01e-01

Pos logit contributions
e+0.198
 supposed+0.175
oof+0.167
 image+0.164
ow+0.161
Neg logit contributions
Finally-0.156
Soon-0.144
boards-0.142
At-0.138
 Finally-0.135
Token having
Token ID1719
Feature activation-5.15e-02

Pos logit contributions
e+0.166
oof+0.155
 image+0.135
 possession+0.134
leaf+0.134
Neg logit contributions
Finally-0.160
 Finally-0.154
Soon-0.151
Eventually-0.146
At-0.145
Token a
Token ID257
Feature activation-9.75e-02

Pos logit contributions
e+0.084
oof+0.072
 image+0.069
 supposed+0.069
ow+0.069
Neg logit contributions
Finally-0.088
boards-0.085
Soon-0.082
At-0.080
Eventually-0.080
Token was
Token ID373
Feature activation-3.31e-02

Pos logit contributions
e+0.030
 supposed+0.029
oof+0.026
 image+0.025
 yours+0.024
Neg logit contributions
Finally-0.028
boards-0.025
Soon-0.025
At-0.024
Exc-0.024
Token so
Token ID523
Feature activation-3.64e-02

Pos logit contributions
oof+0.054
e+0.052
 possession+0.049
 image+0.048
 supposed+0.047
Neg logit contributions
Finally-0.052
Soon-0.049
Eventually-0.048
At-0.047
Days-0.045
Token surprised
Token ID6655
Feature activation+2.21e-03

Pos logit contributions
e+0.063
 supposed+0.057
oof+0.054
 image+0.052
 possession+0.050
Neg logit contributions
Finally-0.061
Soon-0.056
Eventually-0.053
At-0.053
boards-0.052
Token,
Token ID11
Feature activation+6.46e-02

Pos logit contributions
Finally+0.004
boards+0.004
Soon+0.004
orting+0.003
Eventually+0.003
Neg logit contributions
e-0.004
 supposed-0.003
oof-0.003
 image-0.003
ow-0.003
Token but
Token ID475
Feature activation-1.35e-02

Pos logit contributions
Finally+0.076
boards+0.068
Soon+0.068
Exc+0.063
At+0.063
Neg logit contributions
e-0.148
 supposed-0.134
oof-0.124
 image-0.123
ow-0.121
Token he
Token ID339
Feature activation+9.80e-02

Pos logit contributions
e+0.027
 supposed+0.025
oof+0.022
 image+0.022
ow+0.022
Neg logit contributions
Finally-0.020
boards-0.018
Soon-0.018
Eventually-0.017
At-0.017
Token knew
Token ID2993
Feature activation+1.45e-02

Pos logit contributions
Finally+0.147
boards+0.131
Soon+0.131
At+0.127
Eventually+0.125
Neg logit contributions
e-0.189
 supposed-0.175
oof-0.161
 image-0.158
ow-0.153
Token he
Token ID339
Feature activation+1.12e-01

Pos logit contributions
Finally+0.019
Soon+0.017
boards+0.017
orting+0.016
Eventually+0.016
Neg logit contributions
e-0.031
 supposed-0.028
oof-0.026
 image-0.026
ow-0.024
Token'd
Token ID1549
Feature activation+1.62e-02

Pos logit contributions
Finally+0.122
boards+0.116
At+0.106
Exc+0.103
Soon+0.102
Neg logit contributions
e-0.249
 supposed-0.223
oof-0.223
 image-0.216
ow-0.208
Token made
Token ID925
Feature activation+3.50e-02

Pos logit contributions
Finally+0.029
Soon+0.026
At+0.026
boards+0.026
Days+0.025
Neg logit contributions
e-0.026
 supposed-0.022
oof-0.021
 image-0.021
ow-0.021
Token a
Token ID257
Feature activation-2.23e-02

Pos logit contributions
Soon+0.044
Finally+0.044
 until+0.044
add+0.042
Eventually+0.041
Neg logit contributions
e-0.060
 supposed-0.058
 image-0.055
 Amen-0.053
oof-0.053
Token she
Token ID673
Feature activation+6.54e-02

Pos logit contributions
Finally+0.041
Soon+0.037
boards+0.036
At+0.036
 Finally+0.033
Neg logit contributions
e-0.102
 supposed-0.092
oof-0.089
ow-0.085
leaf-0.084
Token wanted
Token ID2227
Feature activation-2.23e-02

Pos logit contributions
Finally+0.089
boards+0.083
Soon+0.079
Exc+0.078
At+0.078
Neg logit contributions
e-0.133
 supposed-0.119
oof-0.114
ow-0.113
 image-0.110
Token to
Token ID284
Feature activation-5.53e-02

Pos logit contributions
e+0.038
 supposed+0.033
oof+0.031
 image+0.029
ow+0.029
Neg logit contributions
Finally-0.038
boards-0.037
Soon-0.035
At-0.034
Exc-0.033
Token write
Token ID3551
Feature activation+4.10e-02

Pos logit contributions
e+0.096
 supposed+0.093
oof+0.083
ow+0.080
 possession+0.079
Neg logit contributions
Finally-0.085
At-0.078
Soon-0.075
Exc-0.071
Inside-0.070
Token about
Token ID546
Feature activation+1.93e-02

Pos logit contributions
Finally+0.082
boards+0.078
Soon+0.076
Eventually+0.073
At+0.072
Neg logit contributions
e-0.060
 supposed-0.052
oof-0.045
 image-0.044
ow-0.044
Token it
Token ID340
Feature activation+5.28e-02

Pos logit contributions
Finally+0.024
Exc+0.023
boards+0.023
Soon+0.023
At+0.022
Neg logit contributions
e-0.038
 supposed-0.035
ow-0.033
oof-0.032
vous-0.031
Token in
Token ID287
Feature activation-1.56e-02

Pos logit contributions
Finally+0.104
boards+0.097
Soon+0.095
Eventually+0.093
Exc+0.091
Neg logit contributions
e-0.081
 supposed-0.070
 image-0.061
ow-0.059
oof-0.059
Token her
Token ID607
Feature activation-4.06e-03

Pos logit contributions
e+0.025
 supposed+0.023
ow+0.022
oof+0.022
leaf+0.021
Neg logit contributions
Finally-0.025
At-0.024
Soon-0.024
side-0.022
stairs-0.022
Token diary
Token ID26339
Feature activation-5.76e-03

Pos logit contributions
e+0.007
 supposed+0.007
oof+0.006
ow+0.006
leaf+0.006
Neg logit contributions
Finally-0.006
At-0.006
boards-0.006
Soon-0.006
-0.006
Token.
Token ID13
Feature activation-2.25e-02

Pos logit contributions
e+0.010
 supposed+0.008
 image+0.007
oof+0.007
ow+0.007
Neg logit contributions
Finally-0.010
boards-0.010
Soon-0.010
Eventually-0.009
At-0.009
Token She
Token ID1375
Feature activation+5.92e-02

Pos logit contributions
e+0.041
 supposed+0.039
 image+0.036
oof+0.035
ow+0.035
Neg logit contributions
Finally-0.032
Days-0.030
boards-0.030
Soon-0.029
orting-0.028
Token name
Token ID1438
Feature activation-3.37e-02

Pos logit contributions
Finally+0.030
boards+0.029
Days+0.028
 Finally+0.027
Soon+0.027
Neg logit contributions
e-0.040
 supposed-0.033
 yours-0.031
oof-0.031
 seen-0.030
Token was
Token ID373
Feature activation-2.93e-02

Pos logit contributions
e+0.066
 supposed+0.062
oof+0.056
ow+0.055
 image+0.054
Neg logit contributions
Finally-0.047
boards-0.046
Soon-0.042
At-0.041
Exc-0.041
Token R
Token ID371
Feature activation-3.86e-02

Pos logit contributions
e+0.063
 supposed+0.057
 possession+0.053
 image+0.053
oof+0.052
Neg logit contributions
Finally-0.033
Soon-0.031
boards-0.030
 Finally-0.029
At-0.028
Tokenia
Token ID544
Feature activation-4.79e-02

Pos logit contributions
e+0.066
 supposed+0.057
oof+0.051
 image+0.051
ow+0.050
Neg logit contributions
Finally-0.068
boards-0.064
Soon-0.063
Eventually-0.060
At-0.059
Token and
Token ID290
Feature activation-4.48e-02

Pos logit contributions
e+0.097
 supposed+0.095
leaf+0.088
oof+0.084
 image+0.081
Neg logit contributions
Finally-0.053
Exc-0.052
boards-0.052
At-0.051
orting-0.048
Token she
Token ID673
Feature activation+8.08e-02

Pos logit contributions
e+0.118
 supposed+0.106
oof+0.100
ow+0.100
 image+0.099
Neg logit contributions
Soon-0.028
Finally-0.027
 they-0.026
Exc-0.026
At-0.025
Token was
Token ID373
Feature activation-3.98e-02

Pos logit contributions
Finally+0.094
At+0.086
Soon+0.083
boards+0.082
Then+0.081
Neg logit contributions
e-0.160
 supposed-0.152
ow-0.147
 yours-0.146
oof-0.145
Token three
Token ID1115
Feature activation-4.39e-02

Pos logit contributions
e+0.068
 tal+0.068
oof+0.067
 possession+0.064
keeping+0.064
Neg logit contributions
 Finally-0.052
Finally-0.050
Soon-0.049
 Days-0.047
About-0.047
Token-
Token ID12
Feature activation-1.06e-01

Pos logit contributions
e+0.058
 supposed+0.054
oof+0.050
 yours+0.049
ow+0.046
Neg logit contributions
boards-0.083
Exc-0.079
Finally-0.079
Days-0.077
At-0.075
Tokenyears
Token ID19002
Feature activation-9.01e-02

Pos logit contributions
e+0.155
 supposed+0.142
 image+0.137
oof+0.135
 tal+0.129
Neg logit contributions
Finally-0.192
Soon-0.179
boards-0.176
Days-0.174
Eventually-0.172
Token-
Token ID12
Feature activation-1.07e-01

Pos logit contributions
e+0.159
 supposed+0.131
 Amen+0.125
 seen+0.124
 fright+0.121
Neg logit contributions
boards-0.144
orting-0.125
Days-0.125
add-0.124
Exc-0.118
Token not
Token ID407
Feature activation-2.59e-02

Pos logit contributions
e+0.071
 supposed+0.064
oof+0.057
 image+0.057
leaf+0.056
Neg logit contributions
Finally-0.073
Soon-0.065
At-0.064
Days-0.063
boards-0.063
Token have
Token ID423
Feature activation+2.73e-02

Pos logit contributions
e+0.041
 supposed+0.036
oof+0.033
 image+0.032
 possession+0.031
Neg logit contributions
Finally-0.047
Soon-0.043
boards-0.043
At-0.042
Exc-0.041
Token dared
Token ID28765
Feature activation+8.07e-03

Pos logit contributions
Finally+0.051
boards+0.048
Soon+0.048
At+0.046
Days+0.046
Neg logit contributions
e-0.042
 supposed-0.034
 image-0.033
oof-0.032
ow-0.030
Token to
Token ID284
Feature activation-2.46e-02

Pos logit contributions
Finally+0.012
boards+0.012
Soon+0.011
orting+0.011
Days+0.011
Neg logit contributions
e-0.015
 supposed-0.013
 image-0.012
oof-0.012
leaf-0.012
Token climb
Token ID12080
Feature activation+5.46e-02

Pos logit contributions
e+0.044
 supposed+0.039
 possession+0.037
oof+0.037
leaf+0.036
Neg logit contributions
Finally-0.035
Soon-0.033
At-0.032
Exc-0.032
orting-0.031
Token the
Token ID262
Feature activation+6.31e-02

Pos logit contributions
Finally+0.090
boards+0.089
Soon+0.085
orting+0.083
Exc+0.081
Neg logit contributions
e-0.094
 supposed-0.083
 image-0.075
oof-0.075
leaf-0.071
Token big
Token ID1263
Feature activation+2.79e-02

Pos logit contributions
Finally+0.102
boards+0.094
Soon+0.093
Days+0.089
At+0.089
Neg logit contributions
e-0.121
 supposed-0.105
oof-0.097
ow-0.093
 image-0.092
Token tree
Token ID5509
Feature activation+3.20e-02

Pos logit contributions
Finally+0.040
boards+0.040
Soon+0.038
Exc+0.037
At+0.037
Neg logit contributions
e-0.054
 supposed-0.050
oof-0.044
ow-0.043
leaf-0.042
Token.
Token ID13
Feature activation-4.53e-04

Pos logit contributions
Finally+0.046
boards+0.044
Soon+0.042
At+0.041
orting+0.041
Neg logit contributions
e-0.062
 supposed-0.056
oof-0.053
leaf-0.051
 image-0.050
Token<|endoftext|>
Token ID50256
Feature activation-2.25e-02

Pos logit contributions
e+0.001
 supposed+0.001
oof+0.001
ow+0.001
 image+0.001
Neg logit contributions
Finally-0.001
boards-0.001
Days-0.001
Soon-0.001
-0.001
TokenOne
Token ID3198
Feature activation+6.51e-02

Pos logit contributions
e+0.054
 supposed+0.050
 image+0.046
oof+0.046
ow+0.045
Neg logit contributions
Finally-0.023
boards-0.021
Soon-0.020
Eventually-0.019
At-0.018
Token little
Token ID1310
Feature activation-4.13e-02

Pos logit contributions
e+0.054
 supposed+0.047
 image+0.044
oof+0.042
ow+0.040
Neg logit contributions
Finally-0.064
boards-0.062
Soon-0.058
Eventually-0.057
orting-0.057
Token girl
Token ID2576
Feature activation-5.34e-02

Pos logit contributions
e+0.086
 supposed+0.079
oof+0.070
ow+0.067
 image+0.067
Neg logit contributions
Finally-0.054
boards-0.052
Soon-0.052
Exc-0.050
orting-0.049
Token went
Token ID1816
Feature activation+2.22e-02

Pos logit contributions
 supposed+0.091
e+0.088
oof+0.083
 image+0.080
 yours+0.079
Neg logit contributions
Exc-0.077
At-0.077
boards-0.077
Finally-0.074
Soon-0.071
Token to
Token ID284
Feature activation-3.61e-02

Pos logit contributions
Finally+0.036
boards+0.036
Soon+0.035
orting+0.034
At+0.033
Neg logit contributions
e-0.039
 supposed-0.033
 image-0.031
oof-0.031
 possession-0.029
Token the
Token ID262
Feature activation+4.44e-02

Pos logit contributions
e+0.074
 supposed+0.073
ez+0.068
oof+0.068
 possession+0.067
Neg logit contributions
stairs-0.040
At-0.039
Soon-0.035
Outside-0.035
Finally-0.032
Token field
Token ID2214
Feature activation+4.92e-02

Pos logit contributions
Finally+0.079
boards+0.079
orting+0.072
ways+0.071
Eventually+0.071
Neg logit contributions
e-0.069
 supposed-0.061
 image-0.058
oof-0.053
ow-0.051
Token.
Token ID13
Feature activation-2.14e-02

Pos logit contributions
Finally+0.077
boards+0.074
Soon+0.071
At+0.071
side+0.069
Neg logit contributions
e-0.087
 supposed-0.080
oof-0.074
 image-0.071
leaf-0.068
Token She
Token ID1375
Feature activation+7.08e-02

Pos logit contributions
e+0.040
 supposed+0.035
ow+0.035
 image+0.034
 ear+0.032
Neg logit contributions
Finally-0.034
Soon-0.030
Eventually-0.029
Days-0.029
orting-0.028
Token saw
Token ID2497
Feature activation+3.98e-02

Pos logit contributions
Finally+0.119
boards+0.111
Soon+0.107
orting+0.105
Eventually+0.104
Neg logit contributions
e-0.129
 supposed-0.111
ow-0.099
 image-0.098
oof-0.098
Token the
Token ID262
Feature activation+4.92e-02

Pos logit contributions
boards+0.075
Finally+0.073
Soon+0.073
Exc+0.070
At+0.069
Neg logit contributions
e-0.061
 supposed-0.052
oof-0.047
 possession-0.042
 image-0.041
Token dull
Token ID19222
Feature activation+3.09e-02

Pos logit contributions
Finally+0.078
boards+0.076
Soon+0.069
orting+0.069
Exc+0.068
Neg logit contributions
e-0.089
 supposed-0.081
 image-0.077
oof-0.072
ow-0.070
Token.
Token ID13
Feature activation-1.22e-02

Pos logit contributions
boards+0.002
Finally+0.002
orting+0.002
Soon+0.002
At+0.002
Neg logit contributions
e-0.002
oof-0.002
 supposed-0.002
 image-0.002
leaf-0.002
Token
Token ID198
Feature activation+4.95e-02

Pos logit contributions
e+0.016
oof+0.014
 such+0.013
ow+0.012
 my+0.012
Neg logit contributions
Days-0.023
boards-0.021
Finally-0.021
fires-0.020
Soon-0.020
Token
Token ID198
Feature activation+4.91e-02

Pos logit contributions
boards+0.068
Finally+0.066
Days+0.061
+0.060
orting+0.059
Neg logit contributions
e-0.101
 supposed-0.087
oof-0.083
ow-0.080
 seen-0.076
TokenL
Token ID43
Feature activation-3.36e-02

Pos logit contributions
boards+0.068
 Pole+0.066
angers+0.057
orting+0.055
line+0.055
Neg logit contributions
e-0.101
oof-0.084
 supposed-0.083
 image-0.075
 my-0.071
Tokenily
Token ID813
Feature activation+4.97e-03

Pos logit contributions
e+0.079
 supposed+0.071
 image+0.066
oof+0.066
ow+0.065
Neg logit contributions
Finally-0.036
boards-0.035
Soon-0.032
orting-0.031
Exc-0.031
Token was
Token ID373
Feature activation-3.47e-02

Pos logit contributions
Finally+0.009
boards+0.008
Soon+0.008
orting+0.008
Eventually+0.008
Neg logit contributions
e-0.008
 supposed-0.007
 image-0.006
oof-0.006
ow-0.006
Token sad
Token ID6507
Feature activation+1.40e-02

Pos logit contributions
e+0.070
oof+0.063
 image+0.059
leaf+0.058
 possession+0.058
Neg logit contributions
Finally-0.046
Soon-0.043
Days-0.043
Eventually-0.041
At-0.040
Token because
Token ID780
Feature activation-1.20e-02

Pos logit contributions
Finally+0.035
boards+0.034
Days+0.032
+0.032
Eventually+0.032
Neg logit contributions
 supposed-0.012
e-0.011
ow-0.010
 possession-0.008
 image-0.008
Token she
Token ID673
Feature activation+8.14e-02

Pos logit contributions
e+0.030
 supposed+0.027
oof+0.026
ow+0.025
leaf+0.025
Neg logit contributions
Finally-0.011
Soon-0.011
At-0.010
boards-0.010
Exc-0.009
Token really
Token ID1107
Feature activation-2.30e-02

Pos logit contributions
Finally+0.126
boards+0.113
Soon+0.112
Days+0.109
At+0.108
Neg logit contributions
e-0.154
 supposed-0.132
oof-0.127
 image-0.125
 yours-0.123
Token wanted
Token ID2227
Feature activation-8.90e-04

Pos logit contributions
e+0.043
 supposed+0.038
 image+0.035
oof+0.035
ow+0.034
Neg logit contributions
Finally-0.034
Soon-0.031
boards-0.031
At-0.030
 Finally-0.030
Token pizza
Token ID14256
Feature activation+4.32e-02

Pos logit contributions
e+0.076
 supposed+0.071
ow+0.065
oof+0.062
ez+0.060
Neg logit contributions
boards-0.045
Exc-0.043
Finally-0.042
At-0.042
Soon-0.040
Token place
Token ID1295
Feature activation+1.51e-02

Pos logit contributions
Finally+0.078
boards+0.073
Soon+0.071
Eventually+0.069
Days+0.069
Neg logit contributions
e-0.071
 supposed-0.062
 image-0.055
oof-0.054
ow-0.053
Token.
Token ID13
Feature activation-2.19e-02

Pos logit contributions
boards+0.023
Finally+0.023
Soon+0.021
side+0.021
At+0.021
Neg logit contributions
e-0.028
 supposed-0.024
oof-0.023
 image-0.022
ow-0.021
Token He
Token ID679
Feature activation+8.05e-02

Pos logit contributions
e+0.027
 supposed+0.026
 such+0.022
oof+0.021
 image+0.021
Neg logit contributions
Finally-0.041
Days-0.041
orting-0.038
boards-0.038
Soon-0.038
Token put
Token ID1234
Feature activation+7.69e-02

Pos logit contributions
Finally+0.117
boards+0.110
Soon+0.105
At+0.104
Exc+0.103
Neg logit contributions
e-0.159
 supposed-0.144
oof-0.132
 image-0.132
ow-0.129
Token on
Token ID319
Feature activation+3.35e-03

Pos logit contributions
Finally+0.140
boards+0.135
Soon+0.131
At+0.125
 Finally+0.124
Neg logit contributions
e-0.120
 supposed-0.110
 image-0.095
oof-0.094
ow-0.093
Token his
Token ID465
Feature activation+3.26e-02

Pos logit contributions
Finally+0.005
boards+0.005
Soon+0.005
At+0.005
side+0.005
Neg logit contributions
 supposed-0.006
e-0.005
ow-0.005
beh-0.005
 image-0.005
Token favorite
Token ID4004
Feature activation-3.12e-02

Pos logit contributions
boards+0.046
Finally+0.045
At+0.041
 Finally+0.041
Soon+0.041
Neg logit contributions
e-0.061
 supposed-0.058
beh-0.052
ow-0.051
oof-0.049
Token blue
Token ID4171
Feature activation+1.27e-02

Pos logit contributions
 supposed+0.058
e+0.054
beh+0.051
ow+0.050
oof+0.045
Neg logit contributions
boards-0.047
Finally-0.042
At-0.042
Exc-0.040
-0.039
Token cap
Token ID1451
Feature activation-4.48e-04

Pos logit contributions
boards+0.020
Finally+0.019
At+0.018
+0.018
Soon+0.017
Neg logit contributions
 supposed-0.023
e-0.022
beh-0.019
ow-0.019
oof-0.018
Token and
Token ID290
Feature activation-1.87e-02

Pos logit contributions
e+0.001
 supposed+0.001
ow+0.001
 image+0.001
oof+0.001
Neg logit contributions
Finally-0.001
Soon-0.001
Eventually-0.001
Days-0.001
Exc-0.001
Token friends
Token ID2460
Feature activation-7.18e-02

Pos logit contributions
e+0.028
 supposed+0.024
ow+0.021
oof+0.020
 image+0.020
Neg logit contributions
Finally-0.036
boards-0.034
Soon-0.033
At-0.032
Eventually-0.032
Token.
Token ID13
Feature activation-5.63e-02

Pos logit contributions
e+0.107
 supposed+0.087
 image+0.075
ow+0.074
oof+0.071
Neg logit contributions
Finally-0.153
boards-0.140
Soon-0.140
Eventually-0.137
Days-0.134
Token One
Token ID1881
Feature activation+7.19e-02

Pos logit contributions
e+0.095
oof+0.085
 supposed+0.080
 yours+0.079
 image+0.078
Neg logit contributions
Finally-0.091
Days-0.085
Soon-0.083
Eventually-0.083
boards-0.082
Token day
Token ID1110
Feature activation+1.12e-02

Pos logit contributions
Finally+0.100
Soon+0.096
At+0.094
boards+0.089
Days+0.086
Neg logit contributions
e-0.143
 supposed-0.129
oof-0.115
ow-0.113
 image-0.113
Token,
Token ID11
Feature activation+4.49e-02

Pos logit contributions
Finally+0.020
boards+0.018
Days+0.018
Soon+0.018
Eventually+0.017
Neg logit contributions
e-0.019
 supposed-0.017
oof-0.015
ow-0.015
 image-0.014
Token Tim
Token ID5045
Feature activation-3.49e-02

Pos logit contributions
Finally+0.057
boards+0.052
Soon+0.050
Eventually+0.047
At+0.046
Neg logit contributions
e-0.100
 supposed-0.090
 image-0.083
oof-0.083
ow-0.081
Tokenmy
Token ID1820
Feature activation-3.71e-02

Pos logit contributions
e+0.055
 supposed+0.046
leaf+0.044
oof+0.044
 image+0.043
Neg logit contributions
Finally-0.061
boards-0.060
At-0.058
Soon-0.058
Exc-0.056
Token and
Token ID290
Feature activation-5.86e-02

Pos logit contributions
e+0.069
 supposed+0.062
oof+0.059
 image+0.058
ow+0.057
Neg logit contributions
Finally-0.055
boards-0.054
At-0.053
Soon-0.052
Exc-0.050
Token his
Token ID465
Feature activation-5.20e-03

Pos logit contributions
e+0.102
 supposed+0.091
ow+0.078
leaf+0.077
oof+0.077
Neg logit contributions
Finally-0.095
boards-0.093
Exc-0.092
At-0.092
Soon-0.092
Token friends
Token ID2460
Feature activation-5.46e-02

Pos logit contributions
e+0.006
 supposed+0.005
 image+0.004
oof+0.004
ow+0.004
Neg logit contributions
Finally-0.012
boards-0.011
Soon-0.011
Eventually-0.011
Exc-0.011
Token went
Token ID1816
Feature activation+7.54e-03

Pos logit contributions
e+0.104
 supposed+0.092
ow+0.081
 image+0.080
 seen+0.079
Neg logit contributions
Finally-0.089
boards-0.082
Soon-0.082
Eventually-0.077
orting-0.076
Token of
Token ID286
Feature activation-8.35e-02

Pos logit contributions
e+0.183
 supposed+0.161
oof+0.144
 image+0.142
ow+0.140
Neg logit contributions
Finally-0.189
boards-0.182
Soon-0.174
Exc-0.167
orting-0.166
Token buttons
Token ID12163
Feature activation-8.78e-02

Pos logit contributions
e+0.156
 supposed+0.137
ow+0.129
oof+0.124
 image+0.117
Neg logit contributions
Finally-0.128
boards-0.125
Exc-0.122
Soon-0.119
At-0.115
Token and
Token ID290
Feature activation-7.59e-02

Pos logit contributions
e+0.127
 supposed+0.111
ow+0.096
 image+0.094
oof+0.086
Neg logit contributions
Finally-0.185
Soon-0.167
Eventually-0.165
Days-0.165
boards-0.163
Token levers
Token ID48872
Feature activation-8.10e-02

Pos logit contributions
e+0.145
 supposed+0.136
oof+0.124
ow+0.120
 possession+0.116
Neg logit contributions
boards-0.105
Finally-0.104
Soon-0.100
Exc-0.097
At-0.096
Token.
Token ID13
Feature activation-5.91e-02

Pos logit contributions
e+0.122
 supposed+0.102
 image+0.089
ow+0.088
oof+0.083
Neg logit contributions
Finally-0.166
Soon-0.151
boards-0.150
Eventually-0.147
Days-0.147
Token They
Token ID1119
Feature activation+1.18e-02

Pos logit contributions
e+0.093
 a+0.078
 an+0.078
oof+0.076
 Mark+0.075
Neg logit contributions
Days-0.100
Finally-0.096
uing-0.095
boards-0.089
orting-0.088
Token want
Token ID765
Feature activation-6.88e-02

Pos logit contributions
Finally+0.021
boards+0.019
Soon+0.019
At+0.018
Exc+0.018
Neg logit contributions
e-0.020
 supposed-0.018
oof-0.016
 image-0.015
 possession-0.015
Token to
Token ID284
Feature activation-8.12e-02

Pos logit contributions
e+0.102
 supposed+0.087
 image+0.076
oof+0.076
ow+0.074
Neg logit contributions
Finally-0.137
boards-0.129
Soon-0.126
Eventually-0.122
At-0.121
Token press
Token ID1803
Feature activation-4.75e-02

Pos logit contributions
e+0.145
 supposed+0.139
 possession+0.121
 image+0.121
oof+0.120
Neg logit contributions
Finally-0.127
Soon-0.117
At-0.114
Exc-0.106
boards-0.105
Token the
Token ID262
Feature activation+1.11e-02

Pos logit contributions
e+0.075
 supposed+0.065
 image+0.057
oof+0.057
ow+0.056
Neg logit contributions
Finally-0.089
boards-0.084
Soon-0.082
Eventually-0.079
At-0.078
Token buttons
Token ID12163
Feature activation-6.59e-02

Pos logit contributions
Finally+0.018
boards+0.017
Soon+0.016
Exc+0.016
At+0.015
Neg logit contributions
e-0.021
 supposed-0.018
oof-0.016
ow-0.016
 image-0.015
Token leaves
Token ID5667
Feature activation+3.29e-02

Pos logit contributions
Finally+0.043
boards+0.040
Soon+0.039
Eventually+0.038
Days+0.037
Neg logit contributions
e-0.045
 supposed-0.039
 image-0.036
oof-0.035
ow-0.034
Token too
Token ID1165
Feature activation-3.93e-02

Pos logit contributions
Finally+0.057
boards+0.054
Soon+0.052
orting+0.050
At+0.049
Neg logit contributions
e-0.056
 supposed-0.049
oof-0.044
 image-0.044
ow-0.042
Token.
Token ID13
Feature activation+4.80e-03

Pos logit contributions
e+0.056
 supposed+0.049
ow+0.042
 image+0.042
oof+0.041
Neg logit contributions
Finally-0.080
boards-0.076
Soon-0.074
Eventually-0.072
At-0.071
Token But
Token ID887
Feature activation+2.39e-02

Pos logit contributions
Finally+0.007
side+0.007
Days+0.007
boards+0.007
Soon+0.007
Neg logit contributions
e-0.007
ow-0.007
 ear-0.007
oof-0.006
 supposed-0.006
Token he
Token ID339
Feature activation+1.18e-01

Pos logit contributions
Finally+0.038
boards+0.037
Soon+0.034
Days+0.033
Exc+0.033
Neg logit contributions
e-0.046
 supposed-0.039
ow-0.037
oof-0.036
 image-0.035
Token knew
Token ID2993
Feature activation+3.07e-02

Pos logit contributions
Finally+0.180
boards+0.164
Soon+0.161
At+0.154
Eventually+0.153
Neg logit contributions
e-0.226
 supposed-0.205
 image-0.185
oof-0.184
ow-0.178
Token his
Token ID465
Feature activation+5.29e-02

Pos logit contributions
Finally+0.038
boards+0.035
Soon+0.034
orting+0.032
At+0.032
Neg logit contributions
e-0.068
 supposed-0.061
 image-0.058
oof-0.056
ow-0.054
Token friend
Token ID1545
Feature activation-3.05e-02

Pos logit contributions
Finally+0.094
Days+0.084
Soon+0.084
boards+0.081
Eventually+0.081
Neg logit contributions
e-0.093
 supposed-0.075
leaf-0.070
oof-0.064
ow-0.064
Token was
Token ID373
Feature activation-7.67e-03

Pos logit contributions
e+0.062
 supposed+0.056
oof+0.052
 image+0.051
leaf+0.049
Neg logit contributions
Finally-0.039
boards-0.039
At-0.035
Soon-0.034
orting-0.034
Token right
Token ID826
Feature activation+2.10e-02

Pos logit contributions
e+0.014
oof+0.012
 supposed+0.011
 image+0.011
 possession+0.011
Neg logit contributions
Finally-0.012
Soon-0.011
At-0.010
orting-0.010
Days-0.010
Token.
Token ID13
Feature activation+9.86e-03

Pos logit contributions
Finally+0.034
boards+0.032
Soon+0.031
orting+0.030
Exc+0.029
Neg logit contributions
e-0.038
 supposed-0.034
oof-0.031
 image-0.030
ow-0.029
Token shampoo
Token ID47259
Feature activation-7.17e-02

Pos logit contributions
e+0.143
 supposed+0.141
ow+0.118
 image+0.117
oof+0.112
Neg logit contributions
Finally-0.158
Soon-0.152
boards-0.149
At-0.143
Exc-0.143
Token?"
Token ID1701
Feature activation-8.73e-02

Pos logit contributions
e+0.113
 supposed+0.097
 image+0.086
oof+0.086
ow+0.084
Neg logit contributions
Finally-0.136
boards-0.127
Soon-0.124
Eventually-0.120
Days-0.119
Token
Token ID198
Feature activation-1.07e-02

Pos logit contributions
e+0.190
 supposed+0.174
 image+0.158
oof+0.157
ow+0.154
Neg logit contributions
Finally-0.111
boards-0.101
Soon-0.099
Exc-0.092
Eventually-0.092
Token
Token ID198
Feature activation-6.54e-03

Pos logit contributions
e+0.027
 supposed+0.025
 image+0.023
ow+0.023
oof+0.023
Neg logit contributions
Finally-0.010
Soon-0.009
boards-0.008
Exc-0.008
At-0.008
TokenJack
Token ID14295
Feature activation-5.53e-02

Pos logit contributions
e+0.016
 supposed+0.015
 image+0.014
oof+0.014
ow+0.014
Neg logit contributions
Finally-0.006
boards-0.006
Soon-0.005
Eventually-0.005
At-0.005
Token bark
Token ID21405
Feature activation-8.95e-02

Pos logit contributions
e+0.108
 supposed+0.103
oof+0.090
 image+0.089
 yours+0.087
Neg logit contributions
Finally-0.079
boards-0.074
Soon-0.070
Exc-0.070
At-0.069
Tokened
Token ID276
Feature activation-1.61e-02

Pos logit contributions
e+0.171
ow+0.140
 supposed+0.136
 image+0.130
oof+0.125
Neg logit contributions
Finally-0.153
Soon-0.136
Eventually-0.131
Days-0.127
boards-0.127
Token and
Token ID290
Feature activation-6.05e-02

Pos logit contributions
e+0.032
 supposed+0.031
oof+0.027
 image+0.026
ow+0.025
Neg logit contributions
Finally-0.022
Exc-0.021
boards-0.021
Soon-0.020
 Finally-0.019
Token nodded
Token ID14464
Feature activation-3.58e-03

Pos logit contributions
e+0.114
 supposed+0.103
 image+0.091
oof+0.091
ow+0.089
Neg logit contributions
Finally-0.095
boards-0.088
Soon-0.087
Exc-0.083
At-0.082
Token his
Token ID465
Feature activation-1.40e-02

Pos logit contributions
e+0.006
 supposed+0.005
oof+0.005
 image+0.005
ow+0.004
Neg logit contributions
Finally-0.006
boards-0.006
Soon-0.006
Exc-0.006
Eventually-0.006
Token head
Token ID1182
Feature activation-7.88e-02

Pos logit contributions
e+0.026
 supposed+0.023
 such+0.020
 seen+0.020
leaf+0.020
Neg logit contributions
Finally-0.023
Soon-0.021
Days-0.021
 Finally-0.020
Eventually-0.020
Token there
Token ID612
Feature activation+9.17e-02

Pos logit contributions
Finally+0.071
boards+0.066
Soon+0.063
Eventually+0.060
orting+0.060
Neg logit contributions
e-0.096
 supposed-0.085
 image-0.077
oof-0.077
ow-0.076
Token was
Token ID373
Feature activation-5.63e-02

Pos logit contributions
Finally+0.145
boards+0.135
Soon+0.131
At+0.125
Eventually+0.124
Neg logit contributions
e-0.171
 supposed-0.153
oof-0.138
 image-0.137
ow-0.134
Token a
Token ID257
Feature activation-6.44e-02

Pos logit contributions
beh+0.092
ow+0.090
e+0.089
 supposed+0.088
 possession+0.087
Neg logit contributions
Soon-0.090
There-0.088
Finally-0.088
At-0.085
Next-0.081
Token little
Token ID1310
Feature activation-5.79e-02

Pos logit contributions
e+0.116
 supposed+0.102
 image+0.092
oof+0.091
ow+0.089
Neg logit contributions
Finally-0.107
boards-0.100
Soon-0.097
Eventually-0.093
orting-0.092
Token girl
Token ID2576
Feature activation-7.52e-02

Pos logit contributions
e+0.126
 supposed+0.119
ow+0.104
oof+0.103
 seen+0.100
Neg logit contributions
Finally-0.068
boards-0.065
Soon-0.064
Exc-0.064
orting-0.060
Token named
Token ID3706
Feature activation-8.82e-02

Pos logit contributions
 supposed+0.127
leaf+0.117
e+0.111
 yours+0.110
oof+0.110
Neg logit contributions
At-0.105
Exc-0.104
Finally-0.103
Soon-0.099
-0.099
Token Lily
Token ID20037
Feature activation-3.96e-02

Pos logit contributions
e+0.168
 supposed+0.154
 image+0.137
oof+0.135
ow+0.134
Neg logit contributions
Finally-0.136
boards-0.131
Soon-0.122
orting-0.120
Eventually-0.119
Token.
Token ID13
Feature activation-4.52e-02

Pos logit contributions
e+0.084
leaf+0.080
 supposed+0.079
oof+0.079
ow+0.076
Neg logit contributions
At-0.037
Exc-0.035
Finally-0.035
Then-0.033
They-0.032
Token She
Token ID1375
Feature activation+6.17e-02

Pos logit contributions
e+0.071
 image+0.059
 supposed+0.059
oof+0.055
 ear+0.055
Neg logit contributions
Finally-0.081
Soon-0.074
Eventually-0.074
Days-0.073
boards-0.072
Token loved
Token ID6151
Feature activation-1.44e-02

Pos logit contributions
Finally+0.086
At+0.078
Soon+0.078
Then+0.074
boards+0.074
Neg logit contributions
e-0.121
 supposed-0.109
ow-0.107
oof-0.104
 yours-0.103
Token to
Token ID284
Feature activation-5.54e-02

Pos logit contributions
e+0.028
oof+0.023
 supposed+0.022
ow+0.022
 image+0.021
Neg logit contributions
boards-0.021
Soon-0.020
At-0.020
Finally-0.019
Exc-0.019
Token was
Token ID373
Feature activation-8.43e-02

Pos logit contributions
e+0.016
 supposed+0.013
 image+0.012
ow+0.012
oof+0.012
Neg logit contributions
Finally-0.016
Soon-0.014
boards-0.014
Eventually-0.014
orting-0.014
Token peaceful
Token ID12309
Feature activation-1.38e-02

Pos logit contributions
e+0.139
 supposed+0.117
oof+0.108
 image+0.106
ow+0.105
Neg logit contributions
Finally-0.154
Soon-0.143
boards-0.142
At-0.136
Eventually-0.135
Token.
Token ID13
Feature activation-5.99e-02

Pos logit contributions
e+0.025
 supposed+0.022
oof+0.020
 image+0.019
ow+0.019
Neg logit contributions
Finally-0.023
boards-0.022
Soon-0.021
orting-0.020
Eventually-0.020
Token<|endoftext|>
Token ID50256
Feature activation-8.09e-02

Pos logit contributions
e+0.126
 supposed+0.109
oof+0.106
ow+0.098
 image+0.097
Neg logit contributions
Finally-0.080
boards-0.079
Days-0.073
Soon-0.070
orting-0.068
TokenAnna
Token ID31160
Feature activation-9.50e-02

Pos logit contributions
e+0.204
 supposed+0.184
oof+0.170
 image+0.170
ow+0.168
Neg logit contributions
Finally-0.079
boards-0.069
Soon-0.066
Days-0.062
Eventually-0.062
Token and
Token ID290
Feature activation-1.01e-01

Pos logit contributions
e+0.175
 supposed+0.162
 image+0.148
oof+0.147
ow+0.143
Neg logit contributions
Finally-0.139
boards-0.134
Soon-0.128
At-0.127
Exc-0.124
Token Ben
Token ID3932
Feature activation-7.41e-02

Pos logit contributions
 supposed+0.207
e+0.194
 possession+0.186
 image+0.186
leaf+0.178
Neg logit contributions
 they-0.117
 They-0.112
Exc-0.109
-0.108
At-0.108
Token are
Token ID389
Feature activation-7.15e-02

Pos logit contributions
e+0.140
 supposed+0.123
oof+0.111
 image+0.111
ow+0.109
Neg logit contributions
Finally-0.118
boards-0.110
Soon-0.106
Eventually-0.102
Days-0.101
Token friends
Token ID2460
Feature activation-1.11e-01

Pos logit contributions
e+0.117
 image+0.100
 possession+0.097
 supposed+0.094
leaf+0.093
Neg logit contributions
Finally-0.126
Soon-0.120
Eventually-0.115
At-0.114
 Finally-0.113
Token.
Token ID13
Feature activation-8.96e-02

Pos logit contributions
 supposed+0.217
e+0.215
 image+0.195
oof+0.193
leaf+0.187
Neg logit contributions
Finally-0.141
orting-0.126
Soon-0.126
Exc-0.125
At-0.124
Token They
Token ID1119
Feature activation+8.36e-04

Pos logit contributions
e+0.215
oof+0.182
 supposed+0.175
ow+0.172
 image+0.167
Neg logit contributions
Finally-0.105
boards-0.103
Days-0.094
orting-0.090
Soon-0.088
Token to
Token ID284
Feature activation-4.95e-02

Pos logit contributions
e+0.017
 supposed+0.014
ow+0.013
 image+0.013
oof+0.012
Neg logit contributions
Finally-0.019
boards-0.018
Soon-0.017
At-0.017
Days-0.017
Token herself
Token ID5223
Feature activation+3.91e-02

Pos logit contributions
e+0.088
 supposed+0.087
 possession+0.076
oof+0.076
 image+0.075
Neg logit contributions
Finally-0.072
Soon-0.068
Days-0.062
Exc-0.062
Eventually-0.062
Token,
Token ID11
Feature activation+5.73e-02

Pos logit contributions
Finally+0.060
boards+0.056
Soon+0.054
Eventually+0.052
At+0.051
Neg logit contributions
e-0.075
 supposed-0.067
 image-0.060
oof-0.060
ow-0.059
Token "
Token ID366
Feature activation+3.99e-02

Pos logit contributions
boards+0.084
Finally+0.083
Days+0.073
Soon+0.073
add+0.073
Neg logit contributions
e-0.112
 supposed-0.096
ow-0.091
 ear-0.090
 image-0.088
TokenThis
Token ID1212
Feature activation+2.13e-02

Pos logit contributions
Finally+0.049
boards+0.047
Soon+0.043
orting+0.041
Eventually+0.040
Neg logit contributions
e-0.090
 supposed-0.079
oof-0.073
 image-0.072
ow-0.072
Token is
Token ID318
Feature activation-5.98e-02

Pos logit contributions
Finally+0.033
boards+0.032
Soon+0.030
At+0.029
+0.028
Neg logit contributions
e-0.039
 supposed-0.035
oof-0.033
ow-0.031
 image-0.031
Token too
Token ID1165
Feature activation-8.39e-02

Pos logit contributions
e+0.104
 supposed+0.077
 image+0.076
oof+0.075
leaf+0.073
Neg logit contributions
Finally-0.104
Soon-0.099
Days-0.093
At-0.093
Eventually-0.090
Token difficult
Token ID2408
Feature activation-1.29e-02

Pos logit contributions
e+0.152
 supposed+0.130
oof+0.119
 image+0.117
ow+0.117
Neg logit contributions
Finally-0.141
boards-0.130
Soon-0.130
At-0.122
Exc-0.122
Token!"
Token ID2474
Feature activation-2.97e-02

Pos logit contributions
e+0.023
 supposed+0.019
oof+0.018
 image+0.018
ow+0.017
Neg logit contributions
Finally-0.021
boards-0.020
Soon-0.019
orting-0.019
At-0.018
Token
Token ID198
Feature activation+3.21e-02

Pos logit contributions
e+0.062
ow+0.051
 supposed+0.050
oof+0.048
 such+0.047
Neg logit contributions
Finally-0.040
boards-0.039
Days-0.038
Soon-0.036
add-0.034
Token
Token ID198
Feature activation+3.38e-02

Pos logit contributions
boards+0.042
Finally+0.038
orting+0.035
ways+0.034
Soon+0.034
Neg logit contributions
e-0.070
 supposed-0.060
oof-0.057
ow-0.055
 image-0.053
Token,
Token ID11
Feature activation+2.10e-02

Pos logit contributions
e+0.065
 supposed+0.057
 image+0.052
oof+0.052
ow+0.051
Neg logit contributions
Finally-0.051
boards-0.047
Soon-0.046
Eventually-0.044
Days-0.043
Token there
Token ID612
Feature activation+6.78e-02

Pos logit contributions
Finally+0.035
boards+0.034
orting+0.031
Soon+0.031
Days+0.031
Neg logit contributions
e-0.039
 supposed-0.033
 image-0.030
ow-0.030
oof-0.030
Token was
Token ID373
Feature activation-8.10e-02

Pos logit contributions
Finally+0.111
boards+0.104
Soon+0.101
Eventually+0.096
At+0.096
Neg logit contributions
e-0.123
 supposed-0.109
oof-0.098
 image-0.098
ow-0.096
Token a
Token ID257
Feature activation-8.38e-02

Pos logit contributions
e+0.116
beh+0.112
 supposed+0.111
ow+0.110
 possession+0.107
Neg logit contributions
Soon-0.149
Finally-0.147
At-0.140
There-0.140
Then-0.136
Token little
Token ID1310
Feature activation-8.12e-02

Pos logit contributions
e+0.147
 supposed+0.129
 image+0.117
oof+0.115
ow+0.111
Neg logit contributions
Finally-0.142
boards-0.135
Soon-0.129
Eventually-0.125
orting-0.124
Token girl
Token ID2576
Feature activation-1.00e-01

Pos logit contributions
e+0.166
 supposed+0.154
oof+0.135
ow+0.134
 image+0.130
Neg logit contributions
Finally-0.109
boards-0.103
Soon-0.101
Exc-0.098
orting-0.095
Token named
Token ID3706
Feature activation-1.15e-01

Pos logit contributions
 supposed+0.167
leaf+0.151
e+0.142
oof+0.142
 yours+0.141
Neg logit contributions
At-0.145
Exc-0.140
Finally-0.140
Soon-0.137
Then-0.135
Token Lily
Token ID20037
Feature activation-6.43e-02

Pos logit contributions
e+0.216
 supposed+0.195
 image+0.175
ow+0.173
oof+0.173
Neg logit contributions
Finally-0.180
boards-0.176
Soon-0.162
orting-0.159
Eventually-0.158
Token.
Token ID13
Feature activation-6.85e-02

Pos logit contributions
e+0.131
 supposed+0.126
leaf+0.126
oof+0.122
ow+0.119
Neg logit contributions
At-0.064
Exc-0.061
Finally-0.060
They-0.057
Then-0.057
Token She
Token ID1375
Feature activation+3.92e-02

Pos logit contributions
e+0.111
 supposed+0.087
 image+0.087
oof+0.085
 ear+0.081
Neg logit contributions
Finally-0.125
Days-0.116
boards-0.116
Eventually-0.115
uing-0.114
Token loved
Token ID6151
Feature activation-3.47e-02

Pos logit contributions
Finally+0.056
Soon+0.051
At+0.051
boards+0.049
Then+0.048
Neg logit contributions
e-0.075
 supposed-0.069
ow-0.066
oof-0.064
 image-0.064
Token.
Token ID13
Feature activation-7.52e-02

Pos logit contributions
e+0.203
 supposed+0.197
oof+0.188
leaf+0.187
ow+0.179
Neg logit contributions
Finally-0.095
At-0.095
Exc-0.093
-0.085
Then-0.085
Token She
Token ID1375
Feature activation+2.81e-02

Pos logit contributions
e+0.148
 supposed+0.134
 image+0.128
oof+0.124
ow+0.120
Neg logit contributions
Finally-0.109
boards-0.100
Soon-0.098
Eventually-0.095
Days-0.095
Token was
Token ID373
Feature activation-8.77e-02

Pos logit contributions
Finally+0.041
Soon+0.037
At+0.037
boards+0.036
Then+0.035
Neg logit contributions
e-0.053
 supposed-0.049
ow-0.046
 image-0.046
oof-0.046
Token only
Token ID691
Feature activation-8.20e-02

Pos logit contributions
oof+0.144
e+0.143
 possession+0.141
 tal+0.140
rum+0.139
Neg logit contributions
Soon-0.119
Finally-0.119
 Finally-0.115
At-0.110
add-0.107
Token three
Token ID1115
Feature activation-7.54e-02

Pos logit contributions
e+0.158
oof+0.142
 supposed+0.139
 image+0.131
 tal+0.125
Neg logit contributions
Finally-0.102
Soon-0.099
 Finally-0.093
boards-0.091
About-0.091
Token years
Token ID812
Feature activation-1.67e-01

Pos logit contributions
e+0.078
 supposed+0.067
oof+0.058
ow+0.052
 image+0.052
Neg logit contributions
Finally-0.172
boards-0.171
Exc-0.163
Soon-0.162
Days-0.161
Token old
Token ID1468
Feature activation-4.87e-02

Pos logit contributions
e+0.231
 supposed+0.207
 image+0.185
ow+0.173
oof+0.169
Neg logit contributions
Finally-0.344
boards-0.316
Soon-0.311
Eventually-0.307
Days-0.299
Token and
Token ID290
Feature activation-7.85e-02

Pos logit contributions
e+0.093
 supposed+0.084
oof+0.076
ow+0.073
 image+0.072
Neg logit contributions
Finally-0.073
boards-0.071
Soon-0.066
Exc-0.065
orting-0.064
Token she
Token ID673
Feature activation+4.35e-02

Pos logit contributions
e+0.148
 supposed+0.129
oof+0.125
 image+0.120
ow+0.119
Neg logit contributions
Finally-0.118
Soon-0.108
boards-0.106
 Finally-0.103
At-0.101
Token loved
Token ID6151
Feature activation-4.36e-02

Pos logit contributions
Finally+0.064
boards+0.057
Soon+0.057
At+0.057
Days+0.055
Neg logit contributions
e-0.079
 supposed-0.073
oof-0.072
 yours-0.071
 image-0.070
Token going
Token ID1016
Feature activation+1.02e-02

Pos logit contributions
e+0.087
oof+0.071
ow+0.067
 supposed+0.065
 tal+0.061
Neg logit contributions
boards-0.061
Finally-0.058
Soon-0.058
At-0.058
Exc-0.056
Token<|endoftext|>
Token ID50256
Feature activation-1.31e-01

Pos logit contributions
Finally+0.141
boards+0.131
Soon+0.127
Eventually+0.122
orting+0.121
Neg logit contributions
e-0.157
 supposed-0.138
 image-0.124
oof-0.124
ow-0.121
TokenTom
Token ID13787
Feature activation-1.45e-01

Pos logit contributions
e+0.344
 supposed+0.334
 image+0.310
oof+0.305
ow+0.300
Neg logit contributions
Finally-0.094
boards-0.080
Soon-0.075
At-0.071
Exc-0.067
Token was
Token ID373
Feature activation-1.53e-01

Pos logit contributions
e+0.247
 supposed+0.236
 image+0.209
 possession+0.206
oof+0.203
Neg logit contributions
Finally-0.227
boards-0.220
Soon-0.206
At-0.206
Exc-0.204
Token a
Token ID257
Feature activation-1.41e-01

Pos logit contributions
e+0.224
 image+0.198
 possession+0.196
 supposed+0.184
oof+0.184
Neg logit contributions
Finally-0.277
Soon-0.268
Eventually-0.260
 Finally-0.258
At-0.250
Token big
Token ID1263
Feature activation-1.09e-01

Pos logit contributions
e+0.257
 supposed+0.229
oof+0.205
 image+0.203
ow+0.201
Neg logit contributions
Finally-0.232
boards-0.213
Soon-0.211
Exc-0.200
Eventually-0.200
Token boy
Token ID2933
Feature activation-1.69e-01

Pos logit contributions
e+0.219
 supposed+0.212
oof+0.182
ow+0.176
leaf+0.175
Neg logit contributions
Finally-0.141
boards-0.134
Soon-0.132
Exc-0.132
orting-0.129
Token who
Token ID508
Feature activation-8.20e-02

Pos logit contributions
 supposed+0.286
e+0.267
oof+0.254
 yours+0.247
 image+0.245
Neg logit contributions
Finally-0.250
boards-0.236
At-0.233
Soon-0.231
orting-0.229
Token liked
Token ID8288
Feature activation-9.36e-02

Pos logit contributions
e+0.154
 supposed+0.136
 image+0.122
oof+0.122
ow+0.121
Neg logit contributions
Finally-0.130
boards-0.121
Soon-0.117
Eventually-0.112
orting-0.111
Token to
Token ID284
Feature activation-1.18e-01

Pos logit contributions
e+0.199
oof+0.166
 supposed+0.164
ow+0.152
illa+0.143
Neg logit contributions
boards-0.131
Soon-0.119
Finally-0.117
At-0.117
Exc-0.116
Token play
Token ID711
Feature activation-4.31e-02

Pos logit contributions
e+0.199
 supposed+0.193
oof+0.165
 possession+0.158
ow+0.157
Neg logit contributions
Finally-0.188
boards-0.178
Soon-0.174
Exc-0.170
At-0.169
Token with
Token ID351
Feature activation-8.35e-02

Pos logit contributions
e+0.063
 supposed+0.046
 image+0.045
oof+0.042
ow+0.041
Neg logit contributions
Finally-0.091
boards-0.083
Soon-0.082
Eventually-0.082
Days-0.080
Token with
Token ID351
Feature activation-1.14e-01

Pos logit contributions
e+0.260
 supposed+0.227
oof+0.203
 image+0.202
ow+0.197
Neg logit contributions
Finally-0.262
boards-0.249
Soon-0.240
Eventually-0.229
At-0.228
Token a
Token ID257
Feature activation-1.32e-01

Pos logit contributions
e+0.197
 supposed+0.176
oof+0.158
ow+0.156
 image+0.155
Neg logit contributions
Finally-0.187
boards-0.185
Soon-0.174
Exc-0.169
At-0.169
Token picture
Token ID4286
Feature activation-1.14e-01

Pos logit contributions
e+0.253
 supposed+0.221
ow+0.211
oof+0.202
 image+0.188
Neg logit contributions
Finally-0.212
boards-0.194
Soon-0.190
At-0.184
Exc-0.182
Token of
Token ID286
Feature activation-9.73e-02

Pos logit contributions
e+0.192
 supposed+0.168
oof+0.150
 image+0.147
ow+0.145
Neg logit contributions
Finally-0.200
boards-0.193
Soon-0.184
At-0.176
orting-0.175
Token a
Token ID257
Feature activation-1.12e-01

Pos logit contributions
e+0.143
 supposed+0.126
ow+0.114
oof+0.107
 image+0.105
Neg logit contributions
Finally-0.178
boards-0.177
Soon-0.171
At-0.165
Exc-0.161
Token cat
Token ID3797
Feature activation-1.63e-01

Pos logit contributions
e+0.200
 supposed+0.175
ow+0.165
oof+0.158
 image+0.146
Neg logit contributions
Finally-0.192
boards-0.178
Soon-0.174
At-0.167
Exc-0.166
Token on
Token ID319
Feature activation-9.41e-02

Pos logit contributions
e+0.280
 supposed+0.257
oof+0.238
 image+0.229
 possession+0.224
Neg logit contributions
boards-0.249
Finally-0.248
Soon-0.231
orting-0.228
At-0.227
Token it
Token ID340
Feature activation-1.86e-02

Pos logit contributions
e+0.140
 supposed+0.129
oof+0.111
ow+0.109
 image+0.107
Neg logit contributions
Finally-0.178
boards-0.169
Soon-0.167
At-0.163
Exc-0.159
Token.
Token ID13
Feature activation-7.46e-02

Pos logit contributions
e+0.029
 supposed+0.025
 image+0.022
ow+0.022
oof+0.022
Neg logit contributions
Finally-0.035
boards-0.033
Soon-0.032
Eventually-0.031
Days-0.031
Token Lucy
Token ID22162
Feature activation-7.70e-02

Pos logit contributions
e+0.160
 supposed+0.141
oof+0.132
 image+0.130
ow+0.129
Neg logit contributions
Finally-0.097
boards-0.090
Soon-0.086
Days-0.083
orting-0.082
Token was
Token ID373
Feature activation-9.91e-02

Pos logit contributions
e+0.140
 supposed+0.130
oof+0.119
 image+0.116
 possession+0.113
Neg logit contributions
Finally-0.118
boards-0.111
Soon-0.106
At-0.105
Exc-0.104
Token Lily
Token ID20037
Feature activation-8.04e-02

Pos logit contributions
e+0.273
 supposed+0.246
 image+0.224
oof+0.223
ow+0.220
Neg logit contributions
Finally-0.181
boards-0.167
Soon-0.161
Eventually-0.152
orting-0.150
Token.
Token ID13
Feature activation-8.42e-02

Pos logit contributions
e+0.162
 supposed+0.155
leaf+0.155
oof+0.150
ow+0.145
Neg logit contributions
At-0.084
Finally-0.084
Then-0.078
Exc-0.077
Soon-0.076
Token Lily
Token ID20037
Feature activation-6.64e-02

Pos logit contributions
e+0.145
 supposed+0.117
oof+0.117
 image+0.114
ow+0.111
Neg logit contributions
Finally-0.147
boards-0.141
Days-0.135
Soon-0.134
Eventually-0.133
Token was
Token ID373
Feature activation-8.81e-02

Pos logit contributions
e+0.125
 supposed+0.116
ow+0.109
oof+0.108
 image+0.106
Neg logit contributions
Finally-0.093
At-0.086
Soon-0.085
boards-0.082
Then-0.082
Token 3
Token ID513
Feature activation-1.23e-01

Pos logit contributions
e+0.140
oof+0.136
 possession+0.131
leaf+0.129
keeping+0.128
Neg logit contributions
Soon-0.127
Finally-0.124
 Finally-0.122
At-0.116
Then-0.113
Token years
Token ID812
Feature activation-1.84e-01

Pos logit contributions
 supposed+0.158
e+0.155
 yours+0.151
leaf+0.141
oof+0.135
Neg logit contributions
boards-0.219
Exc-0.219
Finally-0.215
Days-0.213
Soon-0.205
Token old
Token ID1468
Feature activation-6.30e-02

Pos logit contributions
e+0.269
 supposed+0.240
 image+0.215
ow+0.203
oof+0.201
Neg logit contributions
Finally-0.365
boards-0.338
Soon-0.331
Eventually-0.324
Days-0.316
Token.
Token ID13
Feature activation-7.09e-02

Pos logit contributions
e+0.125
 supposed+0.113
oof+0.102
ow+0.099
 yours+0.097
Neg logit contributions
Finally-0.090
boards-0.087
Soon-0.082
Exc-0.081
orting-0.079
Token Lily
Token ID20037
Feature activation-5.43e-02

Pos logit contributions
e+0.124
 supposed+0.114
 image+0.105
oof+0.102
ow+0.100
Neg logit contributions
Finally-0.114
Days-0.106
Soon-0.105
boards-0.104
Eventually-0.102
Token wanted
Token ID2227
Feature activation-3.77e-02

Pos logit contributions
e+0.095
 supposed+0.084
oof+0.076
 image+0.075
ow+0.074
Neg logit contributions
Finally-0.091
boards-0.085
Soon-0.083
At-0.081
Eventually-0.079
Token to
Token ID284
Feature activation-7.52e-02

Pos logit contributions
e+0.072
 supposed+0.059
oof+0.059
 image+0.055
ow+0.053
Neg logit contributions
boards-0.055
Finally-0.054
Soon-0.053
At-0.052
Exc-0.049
Token Lucy
Token ID22162
Feature activation-9.00e-02

Pos logit contributions
e+0.275
 supposed+0.247
 image+0.225
oof+0.224
ow+0.222
Neg logit contributions
Finally-0.190
boards-0.179
Soon-0.170
Eventually-0.161
Exc-0.160
Token.
Token ID13
Feature activation-8.20e-02

Pos logit contributions
oof+0.175
 supposed+0.174
e+0.174
leaf+0.172
 image+0.167
Neg logit contributions
Finally-0.091
At-0.091
Then-0.086
-0.082
Soon-0.080
Token She
Token ID1375
Feature activation+1.93e-02

Pos logit contributions
e+0.146
 supposed+0.126
ow+0.120
oof+0.120
 image+0.117
Neg logit contributions
Finally-0.135
boards-0.132
Soon-0.121
Days-0.121
orting-0.120
Token was
Token ID373
Feature activation-9.73e-02

Pos logit contributions
Finally+0.028
Soon+0.026
At+0.026
boards+0.025
Then+0.025
Neg logit contributions
e-0.036
 supposed-0.034
ow-0.032
oof-0.032
 image-0.032
Token three
Token ID1115
Feature activation-9.57e-02

Pos logit contributions
e+0.158
oof+0.155
 possession+0.153
 tal+0.151
rum+0.150
Neg logit contributions
Soon-0.140
Finally-0.134
At-0.128
 Finally-0.128
Eventually-0.123
Token years
Token ID812
Feature activation-1.84e-01

Pos logit contributions
e+0.113
 supposed+0.106
oof+0.091
 yours+0.088
ow+0.085
Neg logit contributions
boards-0.196
Finally-0.195
Exc-0.191
Days-0.186
At-0.186
Token old
Token ID1468
Feature activation-6.30e-02

Pos logit contributions
e+0.257
 supposed+0.232
 image+0.207
ow+0.193
oof+0.190
Neg logit contributions
Finally-0.377
boards-0.349
Soon-0.339
Eventually-0.336
Days-0.327
Token and
Token ID290
Feature activation-9.18e-02

Pos logit contributions
e+0.124
 supposed+0.112
oof+0.101
ow+0.098
 image+0.096
Neg logit contributions
Finally-0.091
boards-0.089
Soon-0.083
Exc-0.083
orting-0.081
Token loved
Token ID6151
Feature activation-5.45e-02

Pos logit contributions
e+0.171
 supposed+0.151
oof+0.143
 image+0.139
ow+0.139
Neg logit contributions
Finally-0.140
Soon-0.129
boards-0.125
At-0.123
 Finally-0.121
Token to
Token ID284
Feature activation-9.10e-02

Pos logit contributions
e+0.110
oof+0.089
ow+0.084
 supposed+0.080
 tal+0.076
Neg logit contributions
boards-0.079
Soon-0.074
At-0.074
Finally-0.073
Exc-0.071
Token go
Token ID467
Feature activation-6.63e-03

Pos logit contributions
e+0.168
 supposed+0.166
 possession+0.146
oof+0.145
ow+0.140
Neg logit contributions
Finally-0.120
At-0.113
boards-0.111
Soon-0.111
Exc-0.107