TOP ACTIVATIONS
MAX = 0.187

thanked the star fish for letting them touch it and left
Thank you Mom my for letting me help ," said Lily
it can forgive you for letting it go . That 's
thank you to them for letting him see them . He
you to the park for letting us play here ."
" Thank you for letting us pet him ," Ben
as a thank you for letting it cook for them .
tree and thanked it for letting them enjoy its beauty .
he could serve others by letting them read his story .
, " Thank you for letting me borrow your microscope ,
, " Thank you for letting us know . Maybe one
had learned that love means letting go sometimes . She had
. And thank you for letting us play with Max .
different . Thank you for letting me play with it ."
thank you to Lily for letting it go . From that
" Thank you for letting us pet your dog ,"
, " Thank you for letting me fly your k ite
, " Thank you for letting me borrow your toy car
Okay , thank you for letting me play with it !"
it . Thank you for letting me paint your car .

MIN ACTIVATIONS
MIN = -0.223

Julie . She was a very polite and kind girl who
Gina . Gina was a very compassionate all igator , and
Julie . She was a very brave and adventurous girl who
Alex . Alex was a very obedient sw an and he
sight and both felt a very peaceful feeling inside .
. Tw inkle was a very obedient star . Every night
Molly . She was a very intelligent girl . Every day
it ! It was a very beautiful butterfly fl uttering around
Tim . Tim was a very good baby . He loved
Blue . Blue was a very interesting toy . He could
Tim . Tim was a very tidy kid . He always
Jack . He was a very compassionate boy , who always
Spot . Spot was a very good dog . He liked
Polly . Polly was a very talk ative par rot ,
Sprite . Sprite was a very fit and happy pup .
. Fl uffy was a very nos y cat . She
Bob was a very happy man , because he
Lucy . She was a very curious three year old and
bracelet . It was a very special bracelet with spark les
ie . He was a very calm bear and he liked

INTERVAL 0.084 - 0.187
CONTAINS 4.284%

Tim my didn 't want to clean his room because he
was so tired and wanted to go back to sleep .
But they forgot to set a timer or check
We should listen to our friends and respect their style .
. They did not want to wear hats and coats and

INTERVAL -0.018 - 0.084
CONTAINS 57.524%

taste , but also about kindness and friendship . <|endoftext|> Anna
here !" L ily comes to see . She
came in and said , " Time to turn off the
Lily told her about the bird 's broken wing and how
another way in . She saw a hole in the wall

INTERVAL -0.121 - -0.018
CONTAINS 37.044%

. One day , Tim was walking through
. <|endoftext|> One day , a girl named Sue had a
her . Then , a bright light shine through the trees
He would roll and roll and roll . Everywhere he went
a good friend . You did the right thing ."

INTERVAL -0.223 - -0.121
CONTAINS 1.148%

<|endoftext|> John and Jay were very good friends . They were
The little girl was very happy that she met the
warm l ily became a happy reminder of the special family
reached it , she was so excited that she started to
up , he felt a strange , t ingly feeling and
Token thanked
Token ID26280
Feature activation+7.75e-02

Pos logit contributions
 rebuild+0.049
keeping+0.048
 find+0.047
 meet+0.046
 build+0.046
Neg logit contributions
ww-0.043
 brightly-0.042
 sudden-0.041
 problem-0.041
opus-0.039
Token the
Token ID262
Feature activation+5.51e-02

Pos logit contributions
 rebuild+0.136
ws+0.131
talk+0.130
keeping+0.129
iece+0.127
Neg logit contributions
 brightly-0.158
 sudden-0.148
ww-0.146
 problem-0.144
ls-0.140
Token star
Token ID3491
Feature activation-2.49e-02

Pos logit contributions
 rebuild+0.120
keeping+0.118
 find+0.115
 meet+0.114
iece+0.113
Neg logit contributions
ww-0.085
 brightly-0.084
 sudden-0.081
 problem-0.078
ls-0.076
Tokenfish
Token ID11084
Feature activation+1.75e-02

Pos logit contributions
ww+0.042
 brightly+0.042
 sudden+0.040
 problem+0.040
ls+0.037
Neg logit contributions
 rebuild-0.052
keeping-0.051
iece-0.049
 find-0.049
talk-0.049
Token for
Token ID329
Feature activation+8.62e-02

Pos logit contributions
 rebuild+0.035
keeping+0.034
talk+0.033
iece+0.033
 find+0.033
Neg logit contributions
 brightly-0.031
ww-0.030
 sudden-0.030
 problem-0.029
ls-0.027
Token letting
Token ID9616
Feature activation+1.72e-01

Pos logit contributions
 rebuild+0.159
talk+0.158
keeping+0.154
iece+0.152
race+0.150
Neg logit contributions
 brightly-0.164
ww-0.158
 sudden-0.156
 problem-0.150
 importantly-0.141
Token them
Token ID606
Feature activation+1.01e-01

Pos logit contributions
 rebuild+0.334
keeping+0.329
talk+0.321
iece+0.315
ws+0.313
Neg logit contributions
 brightly-0.311
ww-0.297
 problem-0.289
 sudden-0.285
ls-0.280
Token touch
Token ID3638
Feature activation+7.44e-02

Pos logit contributions
keeping+0.130
iece+0.124
talk+0.123
ws+0.119
 rebuild+0.119
Neg logit contributions
 brightly-0.244
ww-0.229
 problem-0.228
 sudden-0.227
ls-0.222
Token it
Token ID340
Feature activation+5.55e-03

Pos logit contributions
 rebuild+0.127
keeping+0.125
 find+0.118
iece+0.118
talk+0.117
Neg logit contributions
ww-0.152
 brightly-0.150
 sudden-0.148
 problem-0.146
ls-0.137
Token and
Token ID290
Feature activation-3.95e-02

Pos logit contributions
 rebuild+0.010
keeping+0.010
talk+0.009
iece+0.009
 find+0.009
Neg logit contributions
 brightly-0.011
ww-0.011
 sudden-0.011
 problem-0.011
ls-0.010
Token left
Token ID1364
Feature activation+6.18e-02

Pos logit contributions
 brightly+0.077
 sudden+0.073
ww+0.072
 problem+0.072
ls+0.068
Neg logit contributions
keeping-0.070
talk-0.069
 rebuild-0.069
iece-0.068
race-0.068
TokenThank
Token ID10449
Feature activation+3.84e-02

Pos logit contributions
 rebuild+0.037
keeping+0.037
iece+0.035
ws+0.034
 find+0.034
Neg logit contributions
 brightly-0.045
ww-0.045
 sudden-0.044
 problem-0.041
opus-0.039
Token you
Token ID345
Feature activation+6.66e-02

Pos logit contributions
 rebuild+0.089
keeping+0.088
talk+0.085
iece+0.084
 find+0.084
Neg logit contributions
 brightly-0.055
ww-0.054
 sudden-0.053
 problem-0.052
ls-0.048
Token Mom
Token ID11254
Feature activation+7.36e-02

Pos logit contributions
 find+0.133
 rebuild+0.133
keeping+0.133
 meet+0.130
 build+0.126
Neg logit contributions
 sudden-0.114
ww-0.111
 brightly-0.109
ls-0.103
 problem-0.103
Tokenmy
Token ID1820
Feature activation+5.73e-02

Pos logit contributions
 rebuild+0.142
keeping+0.137
iece+0.133
race+0.132
talk+0.132
Neg logit contributions
 brightly-0.134
ww-0.134
 problem-0.132
 sudden-0.123
ummy-0.122
Token for
Token ID329
Feature activation+9.28e-02

Pos logit contributions
 rebuild+0.121
keeping+0.117
iece+0.114
race+0.114
talk+0.114
Neg logit contributions
 brightly-0.094
ww-0.094
 problem-0.092
 sudden-0.087
ummy-0.082
Token letting
Token ID9616
Feature activation+1.71e-01

Pos logit contributions
 rebuild+0.188
talk+0.188
keeping+0.185
iece+0.184
race+0.180
Neg logit contributions
ww-0.159
 brightly-0.157
 sudden-0.153
 problem-0.150
opus-0.134
Token me
Token ID502
Feature activation+8.30e-02

Pos logit contributions
 rebuild+0.446
keeping+0.444
iece+0.442
talk+0.436
race+0.426
Neg logit contributions
ww-0.189
 brightly-0.187
 problem-0.177
 sudden-0.177
ummy-0.154
Token help
Token ID1037
Feature activation+9.99e-02

Pos logit contributions
keeping+0.094
iece+0.089
 rebuild+0.087
talk+0.084
ws+0.081
Neg logit contributions
 brightly-0.216
ww-0.214
 sudden-0.209
 problem-0.209
ls-0.198
Token,"
Token ID553
Feature activation+4.07e-02

Pos logit contributions
keeping+0.253
 rebuild+0.252
iece+0.251
talk+0.250
ws+0.244
Neg logit contributions
 brightly-0.122
ww-0.116
 sudden-0.112
 problem-0.111
ummy-0.102
Token said
Token ID531
Feature activation+1.69e-02

Pos logit contributions
 rebuild+0.086
keeping+0.084
talk+0.081
 find+0.081
iece+0.080
Neg logit contributions
 brightly-0.067
ww-0.066
 sudden-0.065
 problem-0.064
ls-0.060
Token Lily
Token ID20037
Feature activation+5.28e-02

Pos logit contributions
 rebuild+0.034
keeping+0.034
talk+0.032
iece+0.032
ws+0.032
Neg logit contributions
 brightly-0.029
ww-0.029
 sudden-0.028
 problem-0.028
ls-0.026
Token it
Token ID340
Feature activation-2.61e-02

Pos logit contributions
 brightly+0.102
ww+0.101
 sudden+0.100
 problem+0.095
ls+0.090
Neg logit contributions
 rebuild-0.140
keeping-0.137
 find-0.134
 meet-0.132
 build-0.131
Token can
Token ID460
Feature activation+4.84e-03

Pos logit contributions
 brightly+0.056
ww+0.055
 sudden+0.055
 problem+0.051
 importantly+0.050
Neg logit contributions
 rebuild-0.043
 find-0.041
 build-0.040
 meet-0.040
keeping-0.039
Token forgive
Token ID20927
Feature activation+6.95e-02

Pos logit contributions
keeping+0.009
iece+0.008
talk+0.007
ws+0.007
five+0.007
Neg logit contributions
 brightly-0.011
 problem-0.010
ww-0.010
 sudden-0.010
 lot-0.010
Token you
Token ID345
Feature activation+6.45e-02

Pos logit contributions
 rebuild+0.152
keeping+0.150
talk+0.144
iece+0.143
 find+0.143
Neg logit contributions
 brightly-0.109
ww-0.108
 sudden-0.105
 problem-0.103
ls-0.096
Token for
Token ID329
Feature activation+9.07e-02

Pos logit contributions
keeping+0.113
 rebuild+0.113
talk+0.109
iece+0.106
race+0.106
Neg logit contributions
 brightly-0.130
ww-0.126
 problem-0.124
 sudden-0.123
ls-0.116
Token letting
Token ID9616
Feature activation+1.70e-01

Pos logit contributions
 rebuild+0.178
keeping+0.175
talk+0.168
iece+0.167
 find+0.166
Neg logit contributions
 brightly-0.163
ww-0.162
 sudden-0.158
 problem-0.156
ls-0.146
Token it
Token ID340
Feature activation+8.05e-04

Pos logit contributions
keeping+0.465
 rebuild+0.463
talk+0.457
iece+0.457
ws+0.443
Neg logit contributions
ww-0.166
 brightly-0.165
 problem-0.162
 sudden-0.148
ls-0.137
Token go
Token ID467
Feature activation+7.44e-02

Pos logit contributions
keeping+0.001
 rebuild+0.001
talk+0.001
iece+0.001
race+0.001
Neg logit contributions
 brightly-0.002
ww-0.002
 problem-0.002
ls-0.002
 sudden-0.002
Token.
Token ID13
Feature activation+4.28e-02

Pos logit contributions
 rebuild+0.132
keeping+0.130
iece+0.126
talk+0.124
ws+0.122
Neg logit contributions
 brightly-0.147
ww-0.144
 problem-0.142
 sudden-0.141
ls-0.136
Token That
Token ID1320
Feature activation-5.59e-02

Pos logit contributions
keeping+0.077
 rebuild+0.077
talk+0.074
iece+0.074
ws+0.072
Neg logit contributions
 brightly-0.083
ww-0.083
 sudden-0.080
 problem-0.077
 importantly-0.073
Token's
Token ID338
Feature activation-1.95e-03

Pos logit contributions
 brightly+0.111
 sudden+0.109
ww+0.107
ls+0.104
opus+0.100
Neg logit contributions
 rebuild-0.097
 find-0.096
 meet-0.093
 build-0.092
keeping-0.091
Token thank
Token ID5875
Feature activation+4.65e-02

Pos logit contributions
 meet+0.054
 rebuild+0.054
 build+0.053
 find+0.053
 join+0.053
Neg logit contributions
 sudden-0.042
ww-0.040
 brightly-0.039
 problem-0.038
uddle-0.036
Token you
Token ID345
Feature activation+5.97e-02

Pos logit contributions
 rebuild+0.115
keeping+0.115
talk+0.110
ws+0.110
iece+0.109
Neg logit contributions
 brightly-0.061
ww-0.059
 sudden-0.056
 problem-0.054
ls-0.051
Token to
Token ID284
Feature activation+1.09e-01

Pos logit contributions
 rebuild+0.124
keeping+0.121
talk+0.116
iece+0.115
race+0.115
Neg logit contributions
 brightly-0.101
ww-0.099
 sudden-0.096
 problem-0.096
ls-0.090
Token them
Token ID606
Feature activation+7.38e-02

Pos logit contributions
keeping+0.229
iece+0.220
talk+0.217
 rebuild+0.213
race+0.208
Neg logit contributions
 brightly-0.180
 problem-0.171
 lot-0.169
 sudden-0.163
ww-0.163
Token for
Token ID329
Feature activation+8.98e-02

Pos logit contributions
keeping+0.168
 rebuild+0.166
talk+0.160
iece+0.159
ws+0.157
Neg logit contributions
 brightly-0.109
ww-0.104
 problem-0.102
 sudden-0.099
ls-0.096
Token letting
Token ID9616
Feature activation+1.70e-01

Pos logit contributions
talk+0.182
 rebuild+0.175
iece+0.174
keeping+0.172
race+0.168
Neg logit contributions
 brightly-0.153
 sudden-0.148
ww-0.147
 problem-0.145
 lot-0.135
Token him
Token ID683
Feature activation+7.02e-02

Pos logit contributions
 rebuild+0.312
keeping+0.308
iece+0.299
talk+0.293
ws+0.289
Neg logit contributions
 brightly-0.313
ww-0.305
 problem-0.302
ls-0.294
 sudden-0.291
Token see
Token ID766
Feature activation+8.05e-02

Pos logit contributions
keeping+0.088
iece+0.084
 rebuild+0.083
talk+0.077
race+0.076
Neg logit contributions
 brightly-0.171
ww-0.168
 problem-0.165
 sudden-0.162
ls-0.161
Token them
Token ID606
Feature activation+7.40e-02

Pos logit contributions
 rebuild+0.143
keeping+0.142
talk+0.136
iece+0.134
race+0.134
Neg logit contributions
 brightly-0.161
ww-0.155
 sudden-0.152
 problem-0.151
ls-0.146
Token.
Token ID13
Feature activation+3.85e-02

Pos logit contributions
 rebuild+0.137
keeping+0.136
talk+0.130
iece+0.129
race+0.128
Neg logit contributions
 brightly-0.140
ww-0.136
 sudden-0.133
 problem-0.133
ls-0.126
Token He
Token ID679
Feature activation-2.10e-03

Pos logit contributions
 rebuild+0.065
keeping+0.064
talk+0.061
 find+0.061
 meet+0.061
Neg logit contributions
ww-0.079
 brightly-0.078
 problem-0.077
 sudden-0.077
ls-0.072
Token you
Token ID345
Feature activation+6.21e-02

Pos logit contributions
keeping+0.134
 rebuild+0.133
iece+0.127
ws+0.126
talk+0.126
Neg logit contributions
 brightly-0.061
ww-0.060
 sudden-0.056
 problem-0.054
ls-0.053
Token to
Token ID284
Feature activation+1.07e-01

Pos logit contributions
 rebuild+0.120
keeping+0.117
race+0.112
talk+0.112
iece+0.111
Neg logit contributions
 brightly-0.115
ww-0.110
 problem-0.110
 sudden-0.105
ls-0.104
Token the
Token ID262
Feature activation+6.48e-02

Pos logit contributions
keeping+0.237
talk+0.222
iece+0.219
race+0.215
 rebuild+0.209
Neg logit contributions
 brightly-0.185
 problem-0.170
 lot-0.164
 sudden-0.162
ls-0.160
Token park
Token ID3952
Feature activation-6.12e-03

Pos logit contributions
 rebuild+0.136
keeping+0.134
talk+0.130
iece+0.128
race+0.128
Neg logit contributions
 brightly-0.109
ww-0.106
 sudden-0.105
 problem-0.105
ls-0.097
Token for
Token ID329
Feature activation+9.08e-02

Pos logit contributions
 brightly+0.011
 problem+0.011
ww+0.011
ls+0.010
 sudden+0.010
Neg logit contributions
 rebuild-0.012
keeping-0.011
iece-0.011
ws-0.011
race-0.011
Token letting
Token ID9616
Feature activation+1.69e-01

Pos logit contributions
talk+0.197
iece+0.188
keeping+0.185
race+0.179
 rebuild+0.176
Neg logit contributions
 sudden-0.145
 brightly-0.143
 problem-0.140
ww-0.138
 beautiful-0.132
Token us
Token ID514
Feature activation+8.99e-02

Pos logit contributions
keeping+0.434
iece+0.422
 rebuild+0.420
talk+0.412
race+0.406
Neg logit contributions
 problem-0.191
ww-0.190
 brightly-0.187
ls-0.184
 lot-0.173
Token play
Token ID711
Feature activation+8.25e-02

Pos logit contributions
keeping+0.121
iece+0.116
talk+0.104
ws+0.102
race+0.100
Neg logit contributions
 brightly-0.206
 problem-0.203
ww-0.198
 beautiful-0.198
 sudden-0.194
Token here
Token ID994
Feature activation+2.92e-02

Pos logit contributions
 rebuild+0.179
keeping+0.177
talk+0.176
race+0.172
iece+0.170
Neg logit contributions
 brightly-0.132
ww-0.120
 problem-0.118
 sudden-0.115
ls-0.114
Token."
Token ID526
Feature activation+4.36e-02

Pos logit contributions
 rebuild+0.052
keeping+0.051
talk+0.050
iece+0.050
race+0.049
Neg logit contributions
 brightly-0.058
ww-0.055
 problem-0.055
 sudden-0.054
ls-0.052
Token
Token ID198
Feature activation-2.64e-02

Pos logit contributions
keeping+0.106
 rebuild+0.105
ws+0.104
iece+0.101
ried+0.099
Neg logit contributions
 brightly-0.061
 sudden-0.060
ls-0.055
ww-0.054
 problem-0.051
Token
Token ID198
Feature activation-3.64e-02

Pos logit contributions
 brightly+0.052
ww+0.052
 sudden+0.051
 problem+0.050
ls+0.047
Neg logit contributions
 rebuild-0.058
keeping-0.057
talk-0.054
iece-0.054
 find-0.054
Token"
Token ID1
Feature activation+6.88e-02

Pos logit contributions
 brightly+0.065
ww+0.064
 sudden+0.063
 problem+0.062
ls+0.058
Neg logit contributions
 rebuild-0.072
keeping-0.071
talk-0.068
iece-0.068
 find-0.067
TokenThank
Token ID10449
Feature activation+3.27e-02

Pos logit contributions
 rebuild+0.111
keeping+0.109
talk+0.105
iece+0.103
 meet+0.103
Neg logit contributions
 brightly-0.146
ww-0.144
 problem-0.143
 sudden-0.142
ls-0.136
Token you
Token ID345
Feature activation+6.34e-02

Pos logit contributions
 rebuild+0.075
keeping+0.074
talk+0.071
 find+0.071
iece+0.071
Neg logit contributions
 brightly-0.048
ww-0.047
 sudden-0.046
 problem-0.045
ls-0.042
Token for
Token ID329
Feature activation+9.60e-02

Pos logit contributions
 rebuild+0.145
keeping+0.144
 find+0.141
 meet+0.137
talk+0.137
Neg logit contributions
 brightly-0.090
 sudden-0.090
ww-0.090
 problem-0.083
opus-0.080
Token letting
Token ID9616
Feature activation+1.68e-01

Pos logit contributions
 rebuild+0.186
keeping+0.184
talk+0.183
iece+0.179
race+0.175
Neg logit contributions
 brightly-0.173
ww-0.171
 sudden-0.167
 problem-0.165
ls-0.151
Token us
Token ID514
Feature activation+9.90e-02

Pos logit contributions
 rebuild+0.452
keeping+0.443
iece+0.440
talk+0.435
ws+0.428
Neg logit contributions
ww-0.177
 brightly-0.171
 problem-0.165
 sudden-0.157
ls-0.149
Token pet
Token ID4273
Feature activation+8.35e-02

Pos logit contributions
keeping+0.122
iece+0.115
 rebuild+0.113
talk+0.109
ws+0.108
Neg logit contributions
 brightly-0.246
ww-0.244
 problem-0.239
 sudden-0.236
ls-0.226
Token him
Token ID683
Feature activation+6.77e-02

Pos logit contributions
 rebuild+0.148
keeping+0.145
iece+0.138
talk+0.138
 find+0.136
Neg logit contributions
 brightly-0.167
ww-0.165
 sudden-0.160
 problem-0.159
ls-0.152
Token,"
Token ID553
Feature activation+3.83e-02

Pos logit contributions
 rebuild+0.122
keeping+0.121
 find+0.117
talk+0.115
 meet+0.114
Neg logit contributions
 brightly-0.130
 sudden-0.130
ww-0.129
 problem-0.123
opus-0.117
Token Ben
Token ID3932
Feature activation+5.97e-02

Pos logit contributions
 rebuild+0.073
keeping+0.072
talk+0.069
 find+0.069
iece+0.069
Neg logit contributions
 brightly-0.070
ww-0.070
 sudden-0.068
 problem-0.067
ls-0.063
Token as
Token ID355
Feature activation-9.33e-03

Pos logit contributions
keeping+0.032
race+0.032
talk+0.032
ws+0.032
 rebuild+0.032
Neg logit contributions
 brightly-0.025
 problem-0.024
 sudden-0.022
ement-0.021
ww-0.021
Token a
Token ID257
Feature activation-2.79e-02

Pos logit contributions
ww+0.016
 brightly+0.016
 sudden+0.016
 problem+0.016
opus+0.015
Neg logit contributions
 rebuild-0.019
keeping-0.018
 find-0.018
 meet-0.017
 build-0.017
Token thank
Token ID5875
Feature activation+3.63e-02

Pos logit contributions
ww+0.047
 brightly+0.046
ls+0.045
 sudden+0.044
 problem+0.043
Neg logit contributions
 rebuild-0.056
 find-0.055
 meet-0.054
keeping-0.054
 build-0.053
Token you
Token ID345
Feature activation+5.24e-02

Pos logit contributions
 rebuild+0.091
keeping+0.090
talk+0.087
ws+0.086
race+0.085
Neg logit contributions
 brightly-0.049
ww-0.046
 sudden-0.044
 problem-0.043
ls-0.042
Token for
Token ID329
Feature activation+8.54e-02

Pos logit contributions
 rebuild+0.107
keeping+0.103
talk+0.103
race+0.098
ws+0.098
Neg logit contributions
 brightly-0.089
 problem-0.088
ww-0.084
ls-0.081
 sudden-0.079
Token letting
Token ID9616
Feature activation+1.67e-01

Pos logit contributions
talk+0.175
keeping+0.167
iece+0.163
 rebuild+0.162
five+0.161
Neg logit contributions
 brightly-0.147
 problem-0.143
 sudden-0.141
ww-0.139
 lot-0.130
Token it
Token ID340
Feature activation+2.90e-04

Pos logit contributions
keeping+0.394
talk+0.391
iece+0.379
 rebuild+0.378
race+0.374
Neg logit contributions
 brightly-0.214
 problem-0.211
 lot-0.199
ls-0.192
ww-0.191
Token cook
Token ID4255
Feature activation+5.06e-02

Pos logit contributions
keeping+0.000
iece+0.000
talk+0.000
race+0.000
 rebuild+0.000
Neg logit contributions
 brightly-0.001
ww-0.001
 problem-0.001
ls-0.001
 beautiful-0.001
Token for
Token ID329
Feature activation+9.00e-02

Pos logit contributions
 rebuild+0.104
keeping+0.103
talk+0.101
race+0.099
iece+0.099
Neg logit contributions
 brightly-0.087
ww-0.083
 problem-0.081
 sudden-0.080
ls-0.076
Token them
Token ID606
Feature activation+6.72e-02

Pos logit contributions
 rebuild+0.182
talk+0.181
keeping+0.181
race+0.177
iece+0.177
Neg logit contributions
 brightly-0.157
 sudden-0.149
ww-0.147
 problem-0.144
ls-0.134
Token.
Token ID13
Feature activation+3.67e-02

Pos logit contributions
 rebuild+0.122
keeping+0.121
talk+0.119
iece+0.119
race+0.117
Neg logit contributions
 brightly-0.128
ww-0.123
 problem-0.121
 sudden-0.119
ls-0.115
Token tree
Token ID5509
Feature activation-3.31e-02

Pos logit contributions
ww+0.082
 brightly+0.081
 sudden+0.080
ls+0.073
 problem+0.071
Neg logit contributions
 rebuild-0.108
keeping-0.106
 find-0.105
 build-0.103
 meet-0.102
Token and
Token ID290
Feature activation-5.10e-02

Pos logit contributions
 brightly+0.059
ww+0.058
 sudden+0.057
 problem+0.056
ls+0.053
Neg logit contributions
 rebuild-0.065
keeping-0.064
talk-0.061
iece-0.061
 find-0.061
Token thanked
Token ID26280
Feature activation+7.09e-02

Pos logit contributions
 brightly+0.092
ww+0.091
 sudden+0.089
 problem+0.088
ls+0.082
Neg logit contributions
 rebuild-0.099
keeping-0.098
talk-0.094
iece-0.094
race-0.092
Token it
Token ID340
Feature activation-1.19e-02

Pos logit contributions
 rebuild+0.126
keeping+0.120
iece+0.120
talk+0.119
ws+0.119
Neg logit contributions
 brightly-0.144
ww-0.136
 sudden-0.134
 problem-0.132
ls-0.126
Token for
Token ID329
Feature activation+8.35e-02

Pos logit contributions
 brightly+0.021
 problem+0.020
ww+0.019
 sudden+0.019
ls+0.018
Neg logit contributions
keeping-0.024
 rebuild-0.023
talk-0.023
ws-0.022
iece-0.022
Token letting
Token ID9616
Feature activation+1.66e-01

Pos logit contributions
talk+0.170
 rebuild+0.161
iece+0.158
ws+0.157
five+0.155
Neg logit contributions
 brightly-0.148
ww-0.135
 problem-0.135
 sudden-0.135
 importantly-0.129
Token them
Token ID606
Feature activation+9.21e-02

Pos logit contributions
 rebuild+0.365
talk+0.354
keeping+0.352
ws+0.347
iece+0.347
Neg logit contributions
 brightly-0.250
ww-0.239
 problem-0.238
ls-0.221
 lot-0.218
Token enjoy
Token ID2883
Feature activation+3.72e-02

Pos logit contributions
keeping+0.118
iece+0.115
talk+0.111
ws+0.109
 rebuild+0.105
Neg logit contributions
 brightly-0.221
 problem-0.207
ww-0.206
 beautiful-0.203
ls-0.203
Token its
Token ID663
Feature activation+1.17e-02

Pos logit contributions
keeping+0.062
 rebuild+0.061
 find+0.058
 build+0.056
 meet+0.056
Neg logit contributions
ww-0.079
 sudden-0.076
 problem-0.075
 brightly-0.075
opus-0.073
Token beauty
Token ID8737
Feature activation-1.28e-02

Pos logit contributions
 rebuild+0.025
 find+0.025
keeping+0.024
 meet+0.024
 build+0.024
Neg logit contributions
ww-0.020
 sudden-0.018
 brightly-0.018
opus-0.018
ls-0.018
Token.
Token ID13
Feature activation+3.14e-02

Pos logit contributions
ww+0.027
 sudden+0.027
 brightly+0.027
 problem+0.026
opus+0.025
Neg logit contributions
keeping-0.021
 rebuild-0.021
 find-0.019
iece-0.019
talk-0.019
Token he
Token ID339
Feature activation-2.08e-02

Pos logit contributions
 brightly+0.103
ww+0.102
 sudden+0.099
 problem+0.098
ls+0.091
Neg logit contributions
 rebuild-0.144
keeping-0.142
talk-0.137
iece-0.136
 find-0.135
Token could
Token ID714
Feature activation+3.11e-02

Pos logit contributions
 brightly+0.036
ww+0.035
 problem+0.034
 sudden+0.034
ls+0.031
Neg logit contributions
 rebuild-0.043
keeping-0.042
talk-0.041
iece-0.040
race-0.040
Token serve
Token ID4691
Feature activation+6.85e-02

Pos logit contributions
keeping+0.040
talk+0.036
iece+0.035
ws+0.034
 rebuild+0.034
Neg logit contributions
 brightly-0.079
 problem-0.076
 sudden-0.073
ww-0.073
 brand-0.070
Token others
Token ID1854
Feature activation+8.32e-02

Pos logit contributions
 rebuild+0.120
keeping+0.119
 find+0.116
 build+0.112
 meet+0.112
Neg logit contributions
ww-0.138
 sudden-0.134
 brightly-0.134
 problem-0.130
ls-0.124
Token by
Token ID416
Feature activation+7.82e-02

Pos logit contributions
 rebuild+0.149
keeping+0.146
 find+0.144
 meet+0.139
 build+0.138
Neg logit contributions
ww-0.163
 sudden-0.162
 brightly-0.158
 problem-0.155
ls-0.148
Token letting
Token ID9616
Feature activation+1.66e-01

Pos logit contributions
 rebuild+0.145
keeping+0.142
 find+0.136
 build+0.135
iece+0.134
Neg logit contributions
ww-0.152
 brightly-0.149
 sudden-0.145
 problem-0.143
ls-0.137
Token them
Token ID606
Feature activation+8.11e-02

Pos logit contributions
keeping+0.270
 rebuild+0.269
talk+0.263
ws+0.260
iece+0.254
Neg logit contributions
 brightly-0.351
 problem-0.334
ww-0.329
 sudden-0.325
 lot-0.311
Token read
Token ID1100
Feature activation+8.72e-02

Pos logit contributions
keeping+0.110
 rebuild+0.108
talk+0.103
iece+0.102
race+0.100
Neg logit contributions
 brightly-0.194
ww-0.188
 problem-0.185
 sudden-0.183
ls-0.176
Token his
Token ID465
Feature activation+2.60e-02

Pos logit contributions
 rebuild+0.152
keeping+0.149
 find+0.143
iece+0.140
 build+0.140
Neg logit contributions
ww-0.176
 sudden-0.173
 brightly-0.169
 problem-0.167
opus-0.160
Token story
Token ID1621
Feature activation-9.22e-04

Pos logit contributions
 find+0.051
 meet+0.048
 rebuild+0.048
 build+0.048
keeping+0.048
Neg logit contributions
ww-0.047
 sudden-0.046
 brightly-0.045
ls-0.043
 problem-0.041
Token.
Token ID13
Feature activation+3.33e-02

Pos logit contributions
 brightly+0.002
ww+0.002
 sudden+0.002
 problem+0.002
ls+0.002
Neg logit contributions
 rebuild-0.002
keeping-0.002
talk-0.001
 find-0.001
iece-0.001
Token,
Token ID11
Feature activation-7.14e-02

Pos logit contributions
 rebuild+0.048
keeping+0.047
iece+0.046
 find+0.046
 build+0.046
Neg logit contributions
 sudden-0.024
 brightly-0.024
ww-0.024
 problem-0.022
opus-0.021
Token "
Token ID366
Feature activation+9.70e-03

Pos logit contributions
 brightly+0.117
ww+0.117
 sudden+0.113
 problem+0.112
ls+0.105
Neg logit contributions
 rebuild-0.150
keeping-0.147
iece-0.142
 find-0.142
 meet-0.141
TokenThank
Token ID10449
Feature activation+2.01e-02

Pos logit contributions
 rebuild+0.018
keeping+0.017
 find+0.017
ws+0.016
iece+0.016
Neg logit contributions
 brightly-0.020
ww-0.019
 sudden-0.019
 problem-0.017
opus-0.017
Token you
Token ID345
Feature activation+5.77e-02

Pos logit contributions
 rebuild+0.049
keeping+0.049
talk+0.047
iece+0.047
 find+0.046
Neg logit contributions
 brightly-0.027
ww-0.027
 sudden-0.025
 problem-0.025
ls-0.023
Token for
Token ID329
Feature activation+8.76e-02

Pos logit contributions
 rebuild+0.133
 find+0.130
keeping+0.128
 meet+0.127
 build+0.127
Neg logit contributions
 sudden-0.081
 brightly-0.081
ww-0.080
 problem-0.073
ls-0.070
Token letting
Token ID9616
Feature activation+1.66e-01

Pos logit contributions
talk+0.190
keeping+0.179
iece+0.176
 rebuild+0.172
ws+0.171
Neg logit contributions
ww-0.143
 problem-0.140
 brightly-0.139
 sudden-0.139
 lot-0.126
Token me
Token ID502
Feature activation+7.01e-02

Pos logit contributions
keeping+0.483
 rebuild+0.478
iece+0.470
talk+0.469
race+0.455
Neg logit contributions
ww-0.142
 problem-0.137
 brightly-0.130
 sudden-0.119
ls-0.116
Token borrow
Token ID8804
Feature activation+1.05e-01

Pos logit contributions
keeping+0.072
 rebuild+0.067
iece+0.066
talk+0.062
ws+0.060
Neg logit contributions
 brightly-0.191
ww-0.189
 problem-0.187
 sudden-0.185
ls-0.178
Token your
Token ID534
Feature activation+4.70e-02

Pos logit contributions
 rebuild+0.197
keeping+0.193
 find+0.189
 meet+0.185
talk+0.184
Neg logit contributions
 brightly-0.193
ww-0.193
 sudden-0.192
 problem-0.184
opus-0.176
Token microscope
Token ID36396
Feature activation-1.68e-02

Pos logit contributions
 find+0.096
 rebuild+0.092
keeping+0.091
 meet+0.091
 save+0.089
Neg logit contributions
ww-0.083
 brightly-0.081
 sudden-0.077
very-0.073
 importantly-0.073
Token,
Token ID11
Feature activation-6.60e-02

Pos logit contributions
 sudden+0.037
ww+0.036
 brightly+0.035
 problem+0.034
opus+0.033
Neg logit contributions
keeping-0.027
 rebuild-0.027
 find-0.026
 build-0.025
talk-0.025
Token,
Token ID11
Feature activation-7.86e-02

Pos logit contributions
 rebuild+0.040
keeping+0.038
iece+0.038
 find+0.038
 build+0.037
Neg logit contributions
ww-0.020
 brightly-0.020
 sudden-0.020
 problem-0.019
opus-0.017
Token "
Token ID366
Feature activation+5.89e-03

Pos logit contributions
ww+0.138
 brightly+0.136
 problem+0.132
 sudden+0.131
ls+0.125
Neg logit contributions
 rebuild-0.157
keeping-0.151
 find-0.149
 meet-0.148
iece-0.148
TokenThank
Token ID10449
Feature activation+1.89e-02

Pos logit contributions
 rebuild+0.011
keeping+0.011
 find+0.010
iece+0.010
talk+0.010
Neg logit contributions
 brightly-0.012
ww-0.011
 sudden-0.011
 problem-0.011
ls-0.010
Token you
Token ID345
Feature activation+5.74e-02

Pos logit contributions
 rebuild+0.046
keeping+0.046
iece+0.044
talk+0.044
 find+0.044
Neg logit contributions
 brightly-0.025
ww-0.025
 sudden-0.024
 problem-0.023
ls-0.022
Token for
Token ID329
Feature activation+8.75e-02

Pos logit contributions
 rebuild+0.133
keeping+0.130
 find+0.128
 meet+0.126
 build+0.126
Neg logit contributions
 brightly-0.081
 sudden-0.080
ww-0.080
 problem-0.075
ls-0.071
Token letting
Token ID9616
Feature activation+1.66e-01

Pos logit contributions
talk+0.188
keeping+0.183
iece+0.181
 rebuild+0.177
ws+0.174
Neg logit contributions
ww-0.144
 brightly-0.140
 sudden-0.139
 problem-0.137
 lot-0.124
Token us
Token ID514
Feature activation+8.86e-02

Pos logit contributions
keeping+0.449
iece+0.445
 rebuild+0.438
talk+0.437
ws+0.430
Neg logit contributions
ww-0.167
 problem-0.162
 brightly-0.161
 sudden-0.151
 lot-0.144
Token know
Token ID760
Feature activation+2.63e-02

Pos logit contributions
keeping+0.099
iece+0.091
 rebuild+0.088
talk+0.087
ws+0.085
Neg logit contributions
 brightly-0.236
ww-0.231
 problem-0.229
 sudden-0.226
ls-0.216
Token.
Token ID13
Feature activation+5.23e-02

Pos logit contributions
 rebuild+0.047
keeping+0.045
 find+0.044
talk+0.043
iece+0.043
Neg logit contributions
 brightly-0.052
ww-0.052
 sudden-0.051
 problem-0.050
ls-0.047
Token Maybe
Token ID6674
Feature activation-1.66e-02

Pos logit contributions
 rebuild+0.097
keeping+0.096
iece+0.092
talk+0.092
 find+0.090
Neg logit contributions
 brightly-0.099
ww-0.099
 sudden-0.094
 problem-0.092
opus-0.088
Token one
Token ID530
Feature activation-2.50e-02

Pos logit contributions
 brightly+0.026
ww+0.026
 sudden+0.025
 problem+0.025
ls+0.023
Neg logit contributions
 rebuild-0.036
keeping-0.036
talk-0.034
iece-0.034
 find-0.034
Token had
Token ID550
Feature activation+1.28e-02

Pos logit contributions
 rebuild+0.001
keeping+0.001
 find+0.001
talk+0.001
 meet+0.001
Neg logit contributions
 brightly-0.002
ww-0.002
 sudden-0.002
 problem-0.002
ls-0.002
Token learned
Token ID4499
Feature activation+4.24e-02

Pos logit contributions
 rebuild+0.021
keeping+0.020
iece+0.020
talk+0.019
race+0.019
Neg logit contributions
 brightly-0.027
ww-0.027
 sudden-0.027
 problem-0.027
ls-0.025
Token that
Token ID326
Feature activation+1.76e-02

Pos logit contributions
 rebuild+0.075
keeping+0.073
 find+0.073
talk+0.070
 fix+0.070
Neg logit contributions
 sudden-0.085
ww-0.083
 brightly-0.079
opus-0.078
 problem-0.078
Token love
Token ID1842
Feature activation+3.07e-03

Pos logit contributions
 rebuild+0.032
 find+0.031
keeping+0.031
 meet+0.030
 build+0.030
Neg logit contributions
ww-0.034
 sudden-0.034
 brightly-0.032
opus-0.031
uddle-0.031
Token means
Token ID1724
Feature activation-2.32e-02

Pos logit contributions
 rebuild+0.006
keeping+0.006
 find+0.006
 meet+0.005
 build+0.005
Neg logit contributions
 sudden-0.006
ww-0.006
 brightly-0.006
 problem-0.005
opus-0.005
Token letting
Token ID9616
Feature activation+1.66e-01

Pos logit contributions
ww+0.046
opus+0.044
uddle+0.044
 sudden+0.043
 brightly+0.042
Neg logit contributions
 rebuild-0.041
 find-0.041
 make-0.040
 join-0.040
 permission-0.039
Token go
Token ID467
Feature activation+7.54e-02

Pos logit contributions
keeping+0.315
talk+0.301
 rebuild+0.301
iece+0.297
five+0.296
Neg logit contributions
 brightly-0.306
 problem-0.301
ww-0.291
 lot-0.280
ls-0.277
Token sometimes
Token ID3360
Feature activation-2.38e-03

Pos logit contributions
 rebuild+0.155
keeping+0.152
ws+0.150
iece+0.149
talk+0.147
Neg logit contributions
 brightly-0.125
 problem-0.122
ww-0.119
 sudden-0.116
ls-0.116
Token.
Token ID13
Feature activation+5.24e-02

Pos logit contributions
 brightly+0.004
 problem+0.004
ww+0.004
 sudden+0.004
ls+0.003
Neg logit contributions
keeping-0.005
 rebuild-0.005
talk-0.005
ws-0.005
iece-0.005
Token She
Token ID1375
Feature activation+6.92e-03

Pos logit contributions
 rebuild+0.098
keeping+0.096
talk+0.092
iece+0.092
 find+0.090
Neg logit contributions
 brightly-0.100
ww-0.098
 sudden-0.096
 problem-0.094
ls-0.089
Token had
Token ID550
Feature activation+1.72e-02

Pos logit contributions
 rebuild+0.011
keeping+0.011
 find+0.010
talk+0.010
 meet+0.010
Neg logit contributions
ww-0.015
 brightly-0.015
 sudden-0.015
 problem-0.014
ls-0.014
Token.
Token ID13
Feature activation+5.22e-02

Pos logit contributions
 sudden+0.083
 brightly+0.078
ww+0.076
 importantly+0.074
 problem+0.073
Neg logit contributions
keeping-0.071
 rebuild-0.069
 find-0.067
talk-0.067
 build-0.067
Token And
Token ID843
Feature activation-5.16e-02

Pos logit contributions
 rebuild+0.105
keeping+0.103
iece+0.100
talk+0.100
 find+0.097
Neg logit contributions
 brightly-0.093
ww-0.092
 sudden-0.085
 problem-0.083
ummy-0.081
Token thank
Token ID5875
Feature activation+4.70e-02

Pos logit contributions
 brightly+0.090
 sudden+0.090
ww+0.090
 problem+0.087
ls+0.083
Neg logit contributions
 rebuild-0.102
keeping-0.099
 find-0.098
 meet-0.097
 build-0.096
Token you
Token ID345
Feature activation+6.34e-02

Pos logit contributions
 rebuild+0.113
keeping+0.113
iece+0.108
talk+0.108
ws+0.106
Neg logit contributions
 brightly-0.064
ww-0.064
 sudden-0.061
 problem-0.060
ls-0.056
Token for
Token ID329
Feature activation+9.05e-02

Pos logit contributions
 rebuild+0.128
keeping+0.125
iece+0.120
talk+0.120
race+0.119
Neg logit contributions
 brightly-0.112
ww-0.111
 problem-0.108
 sudden-0.105
ls-0.100
Token letting
Token ID9616
Feature activation+1.66e-01

Pos logit contributions
talk+0.193
iece+0.186
keeping+0.179
 rebuild+0.178
ws+0.174
Neg logit contributions
ww-0.150
 brightly-0.145
 problem-0.142
 sudden-0.141
 beautiful-0.130
Token us
Token ID514
Feature activation+9.20e-02

Pos logit contributions
keeping+0.464
 rebuild+0.462
iece+0.460
talk+0.449
ws+0.437
Neg logit contributions
ww-0.149
 problem-0.147
 brightly-0.138
 lot-0.125
 sudden-0.125
Token play
Token ID711
Feature activation+8.17e-02

Pos logit contributions
keeping+0.115
iece+0.113
 rebuild+0.098
talk+0.096
race+0.096
Neg logit contributions
 brightly-0.228
ww-0.224
 problem-0.220
 sudden-0.213
 lot-0.212
Token with
Token ID351
Feature activation+6.10e-02

Pos logit contributions
 rebuild+0.161
keeping+0.159
talk+0.153
iece+0.152
race+0.151
Neg logit contributions
 brightly-0.146
ww-0.144
 sudden-0.140
 problem-0.138
ls-0.130
Token Max
Token ID5436
Feature activation+7.57e-02

Pos logit contributions
 rebuild+0.124
keeping+0.120
 find+0.118
 meet+0.116
 build+0.116
Neg logit contributions
ww-0.105
 brightly-0.104
 sudden-0.102
 problem-0.100
ls-0.092
Token.
Token ID13
Feature activation+4.28e-02

Pos logit contributions
 rebuild+0.133
keeping+0.130
 find+0.124
talk+0.123
iece+0.123
Neg logit contributions
 brightly-0.151
ww-0.151
 sudden-0.148
 problem-0.143
ls-0.136
Token different
Token ID1180
Feature activation-6.63e-02

Pos logit contributions
ww+0.113
 brightly+0.112
 sudden+0.112
 problem+0.108
ls+0.103
Neg logit contributions
 rebuild-0.108
keeping-0.107
 find-0.103
 meet-0.101
iece-0.101
Token.
Token ID13
Feature activation+4.79e-02

Pos logit contributions
 brightly+0.135
ww+0.134
 sudden+0.131
 problem+0.129
ls+0.122
Neg logit contributions
 rebuild-0.114
keeping-0.113
talk-0.107
iece-0.106
 find-0.106
Token Thank
Token ID6952
Feature activation+3.90e-02

Pos logit contributions
keeping+0.087
 rebuild+0.086
iece+0.081
race+0.079
talk+0.079
Neg logit contributions
 brightly-0.096
ww-0.092
 sudden-0.086
 problem-0.085
 importantly-0.083
Token you
Token ID345
Feature activation+5.50e-02

Pos logit contributions
 rebuild+0.096
keeping+0.094
iece+0.091
talk+0.091
ws+0.090
Neg logit contributions
 brightly-0.052
ww-0.052
 sudden-0.049
 problem-0.048
ls-0.044
Token for
Token ID329
Feature activation+8.63e-02

Pos logit contributions
 rebuild+0.112
keeping+0.110
talk+0.105
iece+0.105
 find+0.105
Neg logit contributions
 brightly-0.095
ww-0.094
 sudden-0.092
 problem-0.090
ls-0.085
Token letting
Token ID9616
Feature activation+1.66e-01

Pos logit contributions
talk+0.177
keeping+0.170
iece+0.170
 rebuild+0.168
race+0.163
Neg logit contributions
 sudden-0.145
 brightly-0.145
ww-0.143
 problem-0.143
 lot-0.129
Token me
Token ID502
Feature activation+7.21e-02

Pos logit contributions
keeping+0.460
 rebuild+0.458
iece+0.447
talk+0.444
race+0.435
Neg logit contributions
ww-0.154
 problem-0.152
 brightly-0.148
ls-0.134
 lot-0.127
Token play
Token ID711
Feature activation+7.94e-02

Pos logit contributions
keeping+0.082
iece+0.076
 rebuild+0.073
talk+0.069
race+0.067
Neg logit contributions
 brightly-0.190
ww-0.185
 problem-0.184
 sudden-0.182
ls-0.174
Token with
Token ID351
Feature activation+6.12e-02

Pos logit contributions
 rebuild+0.162
keeping+0.161
talk+0.155
iece+0.153
race+0.153
Neg logit contributions
 brightly-0.138
ww-0.132
 problem-0.128
 sudden-0.127
ls-0.120
Token it
Token ID340
Feature activation-6.03e-03

Pos logit contributions
 rebuild+0.135
keeping+0.133
 find+0.129
 meet+0.127
 build+0.126
Neg logit contributions
ww-0.095
 brightly-0.094
 sudden-0.091
 problem-0.090
ls-0.083
Token."
Token ID526
Feature activation+4.75e-02

Pos logit contributions
 brightly+0.013
ww+0.013
 sudden+0.013
 problem+0.013
ls+0.012
Neg logit contributions
 rebuild-0.009
keeping-0.009
 find-0.009
talk-0.009
iece-0.009
Token thank
Token ID5875
Feature activation+5.53e-02

Pos logit contributions
 rebuild+0.110
 meet+0.106
 find+0.106
keeping+0.106
iece+0.106
Neg logit contributions
 sudden-0.106
ww-0.105
 brightly-0.104
 problem-0.100
ls-0.096
Token you
Token ID345
Feature activation+6.13e-02

Pos logit contributions
 rebuild+0.133
keeping+0.131
talk+0.127
iece+0.126
ws+0.125
Neg logit contributions
 brightly-0.076
ww-0.075
 sudden-0.072
 problem-0.070
ls-0.066
Token to
Token ID284
Feature activation+1.08e-01

Pos logit contributions
 rebuild+0.124
keeping+0.121
talk+0.117
ws+0.115
iece+0.115
Neg logit contributions
 brightly-0.107
ww-0.105
 problem-0.102
 sudden-0.101
ls-0.095
Token Lily
Token ID20037
Feature activation+6.78e-02

Pos logit contributions
talk+0.249
keeping+0.247
iece+0.243
 rebuild+0.232
ws+0.232
Neg logit contributions
 brightly-0.159
 lot-0.151
 problem-0.148
 beautiful-0.142
 sudden-0.142
Token for
Token ID329
Feature activation+8.92e-02

Pos logit contributions
 rebuild+0.157
keeping+0.155
talk+0.153
ws+0.149
iece+0.147
Neg logit contributions
 problem-0.092
 brightly-0.091
ww-0.089
ls-0.082
ummy-0.082
Token letting
Token ID9616
Feature activation+1.66e-01

Pos logit contributions
talk+0.185
keeping+0.175
 rebuild+0.174
iece+0.174
ws+0.171
Neg logit contributions
 brightly-0.152
 sudden-0.147
 problem-0.145
ww-0.145
 importantly-0.136
Token it
Token ID340
Feature activation+6.92e-04

Pos logit contributions
 rebuild+0.355
talk+0.349
ws+0.348
keeping+0.348
iece+0.342
Neg logit contributions
 brightly-0.256
 problem-0.246
ww-0.239
 lot-0.238
ls-0.230
Token go
Token ID467
Feature activation+7.38e-02

Pos logit contributions
keeping+0.001
iece+0.001
talk+0.001
 rebuild+0.001
ws+0.001
Neg logit contributions
 brightly-0.002
ww-0.002
 problem-0.002
ls-0.002
 sudden-0.002
Token.
Token ID13
Feature activation+4.26e-02

Pos logit contributions
 rebuild+0.142
keeping+0.138
ws+0.137
iece+0.136
talk+0.134
Neg logit contributions
 brightly-0.134
ww-0.131
 problem-0.127
 sudden-0.126
ls-0.125
Token From
Token ID3574
Feature activation+2.62e-03

Pos logit contributions
 rebuild+0.073
keeping+0.073
 find+0.070
 meet+0.069
iece+0.068
Neg logit contributions
 brightly-0.086
ww-0.086
 problem-0.083
 sudden-0.083
opus-0.078
Token that
Token ID326
Feature activation+2.33e-02

Pos logit contributions
 rebuild+0.005
 find+0.005
keeping+0.005
 meet+0.004
iece+0.004
Neg logit contributions
 sudden-0.005
 problem-0.005
ww-0.005
 brightly-0.005
opus-0.004
Token
Token ID198
Feature activation-2.96e-02

Pos logit contributions
 brightly+0.043
 sudden+0.043
ww+0.043
 problem+0.041
ls+0.040
Neg logit contributions
keeping-0.061
 rebuild-0.061
iece-0.059
talk-0.058
race-0.057
Token"
Token ID1
Feature activation+5.74e-02

Pos logit contributions
 brightly+0.050
ww+0.050
 sudden+0.049
 problem+0.048
ls+0.045
Neg logit contributions
 rebuild-0.061
keeping-0.061
talk-0.058
iece-0.058
 find-0.057
TokenThank
Token ID10449
Feature activation+2.13e-02

Pos logit contributions
 rebuild+0.104
keeping+0.103
 find+0.097
iece+0.097
talk+0.096
Neg logit contributions
 brightly-0.115
ww-0.114
 sudden-0.111
 problem-0.106
ls-0.101
Token you
Token ID345
Feature activation+6.00e-02

Pos logit contributions
 rebuild+0.053
keeping+0.052
iece+0.050
talk+0.050
 find+0.049
Neg logit contributions
ww-0.028
 brightly-0.028
 sudden-0.026
 problem-0.026
ls-0.024
Token for
Token ID329
Feature activation+9.00e-02

Pos logit contributions
 rebuild+0.144
keeping+0.142
 find+0.140
 meet+0.138
talk+0.137
Neg logit contributions
 brightly-0.079
 sudden-0.078
ww-0.077
 problem-0.072
ls-0.068
Token letting
Token ID9616
Feature activation+1.66e-01

Pos logit contributions
talk+0.192
iece+0.183
keeping+0.181
 rebuild+0.179
five+0.175
Neg logit contributions
ww-0.154
 sudden-0.144
 problem-0.144
 brightly-0.144
 cute-0.132
Token us
Token ID514
Feature activation+8.94e-02

Pos logit contributions
 rebuild+0.496
keeping+0.495
iece+0.492
talk+0.478
ws+0.468
Neg logit contributions
ww-0.126
 problem-0.119
 brightly-0.107
 sudden-0.098
 lot-0.097
Token pet
Token ID4273
Feature activation+7.71e-02

Pos logit contributions
keeping+0.113
iece+0.111
 rebuild+0.106
talk+0.100
ws+0.096
Neg logit contributions
ww-0.222
 brightly-0.222
 problem-0.217
 sudden-0.213
ls-0.208
Token your
Token ID534
Feature activation+4.09e-02

Pos logit contributions
 rebuild+0.124
keeping+0.121
iece+0.115
talk+0.115
 find+0.113
Neg logit contributions
 brightly-0.166
ww-0.165
 sudden-0.160
 problem-0.159
ls-0.154
Token dog
Token ID3290
Feature activation+6.16e-02

Pos logit contributions
keeping+0.073
 rebuild+0.073
 find+0.073
 meet+0.073
 greet+0.069
Neg logit contributions
ww-0.078
 brightly-0.078
 sudden-0.076
 problem-0.073
opus-0.071
Token,"
Token ID553
Feature activation+2.99e-02

Pos logit contributions
 rebuild+0.140
keeping+0.138
 find+0.134
talk+0.133
 meet+0.133
Neg logit contributions
 brightly-0.091
ww-0.089
 sudden-0.088
 problem-0.085
ls-0.078
Token,
Token ID11
Feature activation-7.43e-02

Pos logit contributions
 rebuild+0.041
keeping+0.040
 find+0.038
iece+0.038
 meet+0.038
Neg logit contributions
ww-0.046
 brightly-0.046
 sudden-0.046
 problem-0.045
ls-0.042
Token "
Token ID366
Feature activation+9.84e-03

Pos logit contributions
ww+0.134
 problem+0.130
 brightly+0.130
 sudden+0.129
ls+0.124
Neg logit contributions
 rebuild-0.143
keeping-0.139
 find-0.139
 meet-0.138
iece-0.134
TokenThank
Token ID10449
Feature activation+2.46e-02

Pos logit contributions
 rebuild+0.019
keeping+0.018
ws+0.018
 build+0.018
 find+0.017
Neg logit contributions
ww-0.020
 brightly-0.019
 sudden-0.019
opus-0.017
very-0.016
Token you
Token ID345
Feature activation+6.04e-02

Pos logit contributions
 rebuild+0.060
keeping+0.059
talk+0.057
iece+0.057
 find+0.057
Neg logit contributions
 brightly-0.033
ww-0.033
 sudden-0.031
 problem-0.030
ls-0.028
Token for
Token ID329
Feature activation+9.24e-02

Pos logit contributions
 rebuild+0.117
keeping+0.115
 find+0.114
 meet+0.112
talk+0.110
Neg logit contributions
 sudden-0.107
 brightly-0.105
ww-0.103
 problem-0.100
ls-0.098
Token letting
Token ID9616
Feature activation+1.66e-01

Pos logit contributions
talk+0.183
 rebuild+0.178
iece+0.178
keeping+0.178
ws+0.169
Neg logit contributions
ww-0.166
 brightly-0.160
 sudden-0.158
 problem-0.156
 brand-0.142
Token me
Token ID502
Feature activation+7.61e-02

Pos logit contributions
 rebuild+0.468
keeping+0.465
iece+0.456
talk+0.456
ws+0.446
Neg logit contributions
ww-0.155
 brightly-0.146
 problem-0.142
 sudden-0.136
ls-0.120
Token fly
Token ID6129
Feature activation+4.06e-02

Pos logit contributions
keeping+0.084
iece+0.082
 rebuild+0.081
talk+0.076
ws+0.076
Neg logit contributions
 brightly-0.199
ww-0.197
 problem-0.192
 sudden-0.192
 brand-0.182
Token your
Token ID534
Feature activation+4.15e-02

Pos logit contributions
 rebuild+0.075
keeping+0.074
 find+0.071
talk+0.070
 meet+0.070
Neg logit contributions
 brightly-0.076
ww-0.076
 sudden-0.075
 problem-0.074
ls-0.069
Token k
Token ID479
Feature activation-1.20e-03

Pos logit contributions
 find+0.088
keeping+0.087
 rebuild+0.087
 meet+0.085
 build+0.083
Neg logit contributions
ww-0.067
 brightly-0.065
 sudden-0.062
opus-0.059
ls-0.059
Tokenite
Token ID578
Feature activation+2.26e-02

Pos logit contributions
 brightly+0.002
ww+0.002
 sudden+0.002
 problem+0.002
ls+0.002
Neg logit contributions
 rebuild-0.003
keeping-0.003
talk-0.003
race-0.003
 find-0.002
Token,
Token ID11
Feature activation-7.60e-02

Pos logit contributions
 rebuild+0.048
iece+0.047
 find+0.046
five+0.046
keeping+0.046
Neg logit contributions
 sudden-0.030
 brightly-0.028
ww-0.028
 problem-0.027
opus-0.025
Token "
Token ID366
Feature activation+7.09e-03

Pos logit contributions
ww+0.138
 brightly+0.135
 problem+0.134
 sudden+0.134
ls+0.126
Neg logit contributions
 rebuild-0.146
keeping-0.142
 find-0.141
 meet-0.140
 build-0.138
TokenThank
Token ID10449
Feature activation+1.94e-02

Pos logit contributions
 rebuild+0.014
keeping+0.013
 find+0.013
ws+0.013
 build+0.013
Neg logit contributions
ww-0.014
 brightly-0.014
 sudden-0.013
very-0.011
opus-0.011
Token you
Token ID345
Feature activation+5.79e-02

Pos logit contributions
 rebuild+0.050
keeping+0.049
ws+0.047
talk+0.047
iece+0.047
Neg logit contributions
ww-0.025
 brightly-0.024
 sudden-0.022
 problem-0.022
ls-0.021
Token for
Token ID329
Feature activation+8.88e-02

Pos logit contributions
 rebuild+0.109
keeping+0.106
 find+0.105
iece+0.103
 build+0.103
Neg logit contributions
 brightly-0.107
 sudden-0.106
ww-0.105
 problem-0.100
ls-0.097
Token letting
Token ID9616
Feature activation+1.66e-01

Pos logit contributions
talk+0.184
iece+0.176
keeping+0.174
 rebuild+0.171
ws+0.166
Neg logit contributions
ww-0.156
 sudden-0.147
 brightly-0.146
 problem-0.146
 brand-0.135
Token me
Token ID502
Feature activation+6.94e-02

Pos logit contributions
keeping+0.491
 rebuild+0.481
talk+0.480
iece+0.475
five+0.460
Neg logit contributions
ww-0.126
 problem-0.124
 brightly-0.110
 lot-0.100
 sudden-0.097
Token borrow
Token ID8804
Feature activation+1.05e-01

Pos logit contributions
keeping+0.083
iece+0.081
 rebuild+0.075
ws+0.073
talk+0.070
Neg logit contributions
ww-0.176
 brightly-0.176
 problem-0.175
 sudden-0.169
 brand-0.166
Token your
Token ID534
Feature activation+4.48e-02

Pos logit contributions
 rebuild+0.222
keeping+0.218
 find+0.210
talk+0.209
iece+0.208
Neg logit contributions
 brightly-0.173
ww-0.172
 sudden-0.168
 problem-0.163
ls-0.154
Token toy
Token ID13373
Feature activation-1.46e-02

Pos logit contributions
 rebuild+0.094
 find+0.093
 build+0.090
keeping+0.090
 meet+0.089
Neg logit contributions
ww-0.077
 brightly-0.074
 sudden-0.071
ls-0.067
opus-0.066
Token car
Token ID1097
Feature activation+2.29e-02

Pos logit contributions
 brightly+0.026
 sudden+0.026
ww+0.026
 problem+0.024
ls+0.023
Neg logit contributions
 rebuild-0.029
keeping-0.029
 find-0.028
 build-0.028
talk-0.027
TokenOkay
Token ID16454
Feature activation+2.77e-03

Pos logit contributions
 rebuild+0.009
keeping+0.009
talk+0.009
iece+0.009
 find+0.009
Neg logit contributions
 brightly-0.012
ww-0.012
 sudden-0.012
 problem-0.012
ls-0.011
Token,
Token ID11
Feature activation-4.74e-02

Pos logit contributions
 rebuild+0.007
 find+0.007
keeping+0.007
 build+0.006
 meet+0.006
Neg logit contributions
 brightly-0.004
 sudden-0.004
ww-0.004
 problem-0.003
ls-0.003
Token thank
Token ID5875
Feature activation+4.89e-02

Pos logit contributions
 brightly+0.083
ww+0.083
 sudden+0.081
 problem+0.079
ls+0.075
Neg logit contributions
 rebuild-0.095
keeping-0.093
talk-0.089
 find-0.089
iece-0.089
Token you
Token ID345
Feature activation+5.92e-02

Pos logit contributions
 rebuild+0.128
keeping+0.127
ws+0.123
iece+0.123
talk+0.122
Neg logit contributions
ww-0.059
 brightly-0.056
 sudden-0.053
 problem-0.050
opus-0.048
Token for
Token ID329
Feature activation+8.85e-02

Pos logit contributions
 rebuild+0.134
keeping+0.132
talk+0.128
iece+0.127
ws+0.126
Neg logit contributions
 brightly-0.089
ww-0.088
 problem-0.085
 sudden-0.084
ls-0.078
Token letting
Token ID9616
Feature activation+1.66e-01

Pos logit contributions
talk+0.186
iece+0.180
keeping+0.177
ws+0.172
five+0.171
Neg logit contributions
ww-0.149
 sudden-0.147
 problem-0.144
 brightly-0.141
 brand-0.132
Token me
Token ID502
Feature activation+7.15e-02

Pos logit contributions
keeping+0.511
iece+0.507
talk+0.504
 rebuild+0.499
ws+0.488
Neg logit contributions
ww-0.108
 problem-0.101
 brightly-0.092
 lot-0.082
 sudden-0.079
Token play
Token ID711
Feature activation+7.84e-02

Pos logit contributions
keeping+0.087
iece+0.082
 rebuild+0.075
ws+0.073
talk+0.073
Neg logit contributions
ww-0.179
 brightly-0.179
 problem-0.179
 sudden-0.175
 brand-0.168
Token with
Token ID351
Feature activation+5.86e-02

Pos logit contributions
keeping+0.161
 rebuild+0.160
talk+0.156
iece+0.154
race+0.153
Neg logit contributions
 brightly-0.134
ww-0.128
 problem-0.125
 sudden-0.125
ls-0.116
Token it
Token ID340
Feature activation-1.21e-02

Pos logit contributions
 rebuild+0.129
keeping+0.125
 find+0.124
 build+0.121
 meet+0.120
Neg logit contributions
ww-0.092
 brightly-0.092
 sudden-0.089
 problem-0.088
ls-0.081
Token!"
Token ID2474
Feature activation+3.89e-02

Pos logit contributions
 sudden+0.027
 brightly+0.027
ww+0.026
 problem+0.025
opus+0.024
Neg logit contributions
 rebuild-0.019
keeping-0.018
 find-0.018
 build-0.017
iece-0.017
Token it
Token ID340
Feature activation-1.73e-02

Pos logit contributions
ww+0.009
 brightly+0.009
 sudden+0.009
 problem+0.008
ls+0.008
Neg logit contributions
 rebuild-0.009
keeping-0.009
 meet-0.009
 find-0.008
 build-0.008
Token.
Token ID13
Feature activation+4.67e-02

Pos logit contributions
 sudden+0.039
ww+0.039
 brightly+0.038
 problem+0.037
ls+0.035
Neg logit contributions
 rebuild-0.026
 find-0.024
keeping-0.024
 build-0.024
 meet-0.024
Token Thank
Token ID6952
Feature activation+4.47e-02

Pos logit contributions
 rebuild+0.102
keeping+0.100
talk+0.096
iece+0.095
 find+0.093
Neg logit contributions
ww-0.077
 brightly-0.077
 importantly-0.067
 sudden-0.065
ummy-0.065
Token you
Token ID345
Feature activation+5.38e-02

Pos logit contributions
 rebuild+0.112
keeping+0.110
iece+0.106
talk+0.106
ws+0.105
Neg logit contributions
ww-0.058
 brightly-0.058
 sudden-0.054
 problem-0.053
ls-0.049
Token for
Token ID329
Feature activation+8.90e-02

Pos logit contributions
 rebuild+0.100
keeping+0.099
 find+0.097
 meet+0.096
iece+0.095
Neg logit contributions
 brightly-0.099
 sudden-0.099
ww-0.098
 problem-0.094
ls-0.089
Token letting
Token ID9616
Feature activation+1.66e-01

Pos logit contributions
talk+0.197
iece+0.186
keeping+0.183
 rebuild+0.180
ws+0.177
Neg logit contributions
ww-0.140
 brightly-0.137
 sudden-0.136
 problem-0.135
 beautiful-0.124
Token me
Token ID502
Feature activation+7.32e-02

Pos logit contributions
 rebuild+0.499
keeping+0.491
iece+0.481
talk+0.477
race+0.467
Neg logit contributions
ww-0.131
 problem-0.121
 brightly-0.115
 sudden-0.100
ls-0.099
Token paint
Token ID7521
Feature activation+4.74e-02

Pos logit contributions
keeping+0.098
iece+0.088
 rebuild+0.088
talk+0.085
ws+0.084
Neg logit contributions
 brightly-0.179
 problem-0.176
ww-0.175
 sudden-0.173
 beautiful-0.164
Token your
Token ID534
Feature activation+4.51e-02

Pos logit contributions
 rebuild+0.091
keeping+0.089
iece+0.086
 find+0.085
talk+0.085
Neg logit contributions
ww-0.086
 brightly-0.086
 sudden-0.086
 problem-0.083
ls-0.078
Token car
Token ID1097
Feature activation+2.35e-02

Pos logit contributions
 rebuild+0.096
keeping+0.093
 find+0.090
talk+0.090
iece+0.090
Neg logit contributions
 brightly-0.074
ww-0.074
 sudden-0.072
 problem-0.069
ls-0.066
Token.
Token ID13
Feature activation+4.89e-02

Pos logit contributions
keeping+0.043
 rebuild+0.043
race+0.041
iece+0.041
talk+0.041
Neg logit contributions
 brightly-0.044
ww-0.043
 problem-0.043
 sudden-0.040
ls-0.040
Token Julie
Token ID21946
Feature activation+1.10e-02

Pos logit contributions
 rebuild+0.053
ws+0.051
iece+0.050
keeping+0.050
talk+0.050
Neg logit contributions
 brightly-0.045
ww-0.043
 sudden-0.041
 problem-0.041
ls-0.040
Token.
Token ID13
Feature activation+2.04e-02

Pos logit contributions
 rebuild+0.021
keeping+0.021
race+0.020
talk+0.020
ws+0.020
Neg logit contributions
 brightly-0.020
 sudden-0.019
 problem-0.019
ww-0.019
ls-0.018
Token She
Token ID1375
Feature activation-2.28e-02

Pos logit contributions
 rebuild+0.039
keeping+0.038
talk+0.036
ws+0.036
iece+0.036
Neg logit contributions
 brightly-0.038
ww-0.037
 sudden-0.036
 problem-0.035
ls-0.033
Token was
Token ID373
Feature activation-7.89e-02

Pos logit contributions
ww+0.045
 problem+0.042
 brightly+0.042
 sudden+0.042
ls+0.041
Neg logit contributions
 find-0.040
 rebuild-0.040
 meet-0.039
keeping-0.038
 build-0.038
Token a
Token ID257
Feature activation-7.04e-02

Pos logit contributions
ww+0.132
ls+0.122
 brightly+0.119
opus+0.118
 problem+0.118
Neg logit contributions
 rebuild-0.169
 find-0.168
 meet-0.166
keeping-0.162
 build-0.162
Token very
Token ID845
Feature activation-2.21e-01

Pos logit contributions
ls+0.126
ement+0.112
ittens+0.109
uddle+0.104
ww+0.103
Neg logit contributions
 find-0.155
 meet-0.145
 greet-0.140
 fix-0.140
 join-0.139
Token polite
Token ID23507
Feature activation-6.15e-02

Pos logit contributions
ww+0.346
 sudden+0.333
ement+0.320
opus+0.318
ls+0.314
Neg logit contributions
 find-0.501
 meet-0.465
 greet-0.454
 build-0.446
 buy-0.445
Token and
Token ID290
Feature activation-7.02e-02

Pos logit contributions
ww+0.105
 sudden+0.102
 brightly+0.102
 problem+0.097
ls+0.096
Neg logit contributions
keeping-0.123
 rebuild-0.123
 find-0.122
 meet-0.118
talk-0.116
Token kind
Token ID1611
Feature activation-2.47e-02

Pos logit contributions
ww+0.113
 sudden+0.109
 problem+0.108
ls+0.106
opus+0.105
Neg logit contributions
 rebuild-0.147
 find-0.144
 meet-0.143
 build-0.141
keeping-0.141
Token girl
Token ID2576
Feature activation-1.06e-03

Pos logit contributions
 brightly+0.040
ww+0.039
 sudden+0.038
 problem+0.038
ls+0.035
Neg logit contributions
 rebuild-0.053
keeping-0.052
talk-0.050
iece-0.050
 find-0.050
Token who
Token ID508
Feature activation+8.99e-03

Pos logit contributions
 brightly+0.002
ww+0.002
 sudden+0.002
 problem+0.002
ls+0.002
Neg logit contributions
 rebuild-0.002
keeping-0.002
iece-0.002
talk-0.002
ws-0.002
Token Gina
Token ID43002
Feature activation+5.64e-02

Pos logit contributions
 rebuild+0.134
keeping+0.130
iece+0.127
talk+0.126
ws+0.125
Neg logit contributions
 brightly-0.110
ww-0.106
 sudden-0.104
 problem-0.100
ls-0.097
Token.
Token ID13
Feature activation+2.55e-02

Pos logit contributions
 rebuild+0.118
ws+0.112
keeping+0.111
talk+0.111
race+0.111
Neg logit contributions
 brightly-0.095
 sudden-0.089
ww-0.088
ls-0.085
 problem-0.083
Token Gina
Token ID43002
Feature activation+3.45e-02

Pos logit contributions
 rebuild+0.045
keeping+0.045
 meet+0.043
 find+0.043
iece+0.042
Neg logit contributions
ww-0.049
 brightly-0.049
 problem-0.048
 sudden-0.048
ls-0.046
Token was
Token ID373
Feature activation-7.62e-02

Pos logit contributions
 rebuild+0.072
keeping+0.071
 find+0.068
talk+0.068
iece+0.068
Neg logit contributions
 brightly-0.057
ww-0.057
 sudden-0.055
 problem-0.054
ls-0.051
Token a
Token ID257
Feature activation-6.96e-02

Pos logit contributions
ww+0.128
ls+0.121
opus+0.118
 brightly+0.117
 problem+0.117
Neg logit contributions
 find-0.161
 meet-0.160
 rebuild-0.159
 build-0.155
 join-0.152
Token very
Token ID845
Feature activation-2.21e-01

Pos logit contributions
ls+0.124
ement+0.117
ittens+0.108
uddle+0.104
mmm+0.104
Neg logit contributions
 find-0.152
 meet-0.141
 join-0.135
 greet-0.133
 make-0.132
Token compassionate
Token ID32533
Feature activation-5.39e-02

Pos logit contributions
 sudden+0.344
ww+0.337
ement+0.327
 brightly+0.317
opus+0.314
Neg logit contributions
 find-0.492
 meet-0.465
 greet-0.448
 share-0.445
 build-0.444
Token all
Token ID477
Feature activation-1.65e-02

Pos logit contributions
 sudden+0.092
ww+0.091
 brightly+0.088
ls+0.085
 problem+0.084
Neg logit contributions
 find-0.107
keeping-0.107
 rebuild-0.106
 meet-0.102
 build-0.101
Tokenigator
Token ID23823
Feature activation-4.18e-02

Pos logit contributions
 brightly+0.029
ww+0.029
 sudden+0.029
 problem+0.028
ls+0.027
Neg logit contributions
 rebuild-0.033
keeping-0.032
 find-0.031
talk-0.030
iece-0.030
Token,
Token ID11
Feature activation-9.38e-02

Pos logit contributions
 brightly+0.081
ww+0.081
 sudden+0.079
 problem+0.078
ls+0.073
Neg logit contributions
 rebuild-0.075
keeping-0.074
 find-0.071
talk-0.070
iece-0.070
Token and
Token ID290
Feature activation-5.65e-02

Pos logit contributions
ww+0.163
 problem+0.157
 brightly+0.154
 sudden+0.152
ls+0.149
Neg logit contributions
 rebuild-0.190
keeping-0.185
 find-0.184
 meet-0.181
 build-0.180
Token Julie
Token ID21946
Feature activation-1.10e-02

Pos logit contributions
 brightly+0.063
ww+0.062
 sudden+0.061
 problem+0.060
ls+0.056
Neg logit contributions
 rebuild-0.069
keeping-0.068
talk-0.064
iece-0.064
 find-0.064
Token.
Token ID13
Feature activation+2.08e-02

Pos logit contributions
 brightly+0.019
ww+0.018
 sudden+0.018
 problem+0.018
ls+0.017
Neg logit contributions
 rebuild-0.022
keeping-0.022
race-0.021
talk-0.021
iece-0.020
Token She
Token ID1375
Feature activation-1.75e-02

Pos logit contributions
 rebuild+0.038
keeping+0.037
talk+0.035
iece+0.035
ws+0.035
Neg logit contributions
 brightly-0.040
ww-0.040
 sudden-0.039
 problem-0.038
ls-0.036
Token was
Token ID373
Feature activation-7.98e-02

Pos logit contributions
ww+0.032
 brightly+0.032
 sudden+0.031
 problem+0.031
ls+0.029
Neg logit contributions
 rebuild-0.033
keeping-0.032
 find-0.031
 meet-0.031
talk-0.031
Token a
Token ID257
Feature activation-6.99e-02

Pos logit contributions
ww+0.118
 brightly+0.110
ls+0.107
 problem+0.105
 sudden+0.104
Neg logit contributions
 rebuild-0.186
 find-0.181
 meet-0.178
keeping-0.178
 build-0.177
Token very
Token ID845
Feature activation-2.20e-01

Pos logit contributions
ls+0.123
ement+0.108
ww+0.104
ittens+0.104
ummy+0.103
Neg logit contributions
 find-0.151
 meet-0.141
 fix-0.140
 save-0.138
 join-0.138
Token brave
Token ID14802
Feature activation-9.40e-02

Pos logit contributions
ww+0.330
opus+0.306
 sudden+0.304
 brightly+0.302
ement+0.300
Neg logit contributions
 find-0.498
 meet-0.467
 rebuild-0.461
 build-0.459
 greet-0.456
Token and
Token ID290
Feature activation-7.22e-02

Pos logit contributions
ww+0.160
 brightly+0.156
 sudden+0.154
ls+0.147
 problem+0.146
Neg logit contributions
 rebuild-0.188
 find-0.185
keeping-0.184
 meet-0.178
talk-0.177
Token adventurous
Token ID39770
Feature activation-8.80e-02

Pos logit contributions
ww+0.101
 brightly+0.097
 sudden+0.097
 problem+0.096
ls+0.092
Neg logit contributions
 rebuild-0.166
 find-0.162
keeping-0.161
 meet-0.160
 build-0.159
Token girl
Token ID2576
Feature activation-3.11e-03

Pos logit contributions
ww+0.150
 brightly+0.147
 sudden+0.146
 problem+0.138
ls+0.138
Neg logit contributions
 rebuild-0.175
keeping-0.174
 find-0.171
 meet-0.167
talk-0.166
Token who
Token ID508
Feature activation+7.34e-03

Pos logit contributions
 brightly+0.006
ww+0.005
 sudden+0.005
 problem+0.005
ls+0.005
Neg logit contributions
 rebuild-0.006
keeping-0.006
talk-0.006
ws-0.006
iece-0.006
Token Alex
Token ID4422
Feature activation+1.44e-02

Pos logit contributions
 rebuild+0.098
iece+0.094
keeping+0.094
talk+0.092
ws+0.092
Neg logit contributions
 brightly-0.072
ww-0.071
 sudden-0.067
ls-0.064
 problem-0.064
Token.
Token ID13
Feature activation+2.98e-02

Pos logit contributions
 rebuild+0.026
keeping+0.026
talk+0.024
iece+0.024
race+0.024
Neg logit contributions
 brightly-0.028
ww-0.027
 sudden-0.027
 problem-0.027
ls-0.025
Token Alex
Token ID4422
Feature activation+5.87e-04

Pos logit contributions
 rebuild+0.053
keeping+0.051
 meet+0.050
 find+0.050
iece+0.049
Neg logit contributions
ww-0.059
 brightly-0.058
 problem-0.057
 sudden-0.057
ls-0.054
Token was
Token ID373
Feature activation-7.36e-02

Pos logit contributions
 rebuild+0.001
 find+0.001
 meet+0.001
keeping+0.001
 build+0.001
Neg logit contributions
ww-0.001
 problem-0.001
 brightly-0.001
 sudden-0.001
ls-0.001
Token a
Token ID257
Feature activation-6.83e-02

Pos logit contributions
ww+0.121
 brightly+0.115
 problem+0.112
ls+0.112
 sudden+0.111
Neg logit contributions
 rebuild-0.157
 find-0.155
 meet-0.153
 build-0.151
keeping-0.148
Token very
Token ID845
Feature activation-2.19e-01

Pos logit contributions
ls+0.117
ement+0.112
ummy+0.103
uddle+0.098
ittens+0.097
Neg logit contributions
 find-0.151
 meet-0.143
 join-0.136
 make-0.135
 save-0.134
Token obedient
Token ID41652
Feature activation-5.86e-02

Pos logit contributions
 sudden+0.319
ww+0.312
ement+0.303
 brightly+0.303
ummy+0.287
Neg logit contributions
 find-0.507
 meet-0.484
 build-0.467
 rebuild-0.463
 greet-0.462
Token sw
Token ID1509
Feature activation-7.59e-02

Pos logit contributions
 brightly+0.097
ww+0.096
 sudden+0.094
 problem+0.091
ls+0.087
Neg logit contributions
 rebuild-0.122
keeping-0.121
 find-0.116
talk-0.115
 meet-0.114
Tokenan
Token ID272
Feature activation+5.21e-03

Pos logit contributions
 brightly+0.151
ww+0.150
 sudden+0.146
 problem+0.145
ls+0.138
Neg logit contributions
 rebuild-0.134
keeping-0.132
iece-0.125
talk-0.125
 find-0.124
Token and
Token ID290
Feature activation-5.64e-02

Pos logit contributions
 rebuild+0.009
keeping+0.009
 find+0.009
talk+0.009
iece+0.009
Neg logit contributions
 brightly-0.010
ww-0.010
 sudden-0.010
 problem-0.010
ls-0.009
Token he
Token ID339
Feature activation-3.06e-02

Pos logit contributions
ww+0.104
 brightly+0.103
 sudden+0.101
 problem+0.101
ls+0.096
Neg logit contributions
 rebuild-0.105
 find-0.102
 meet-0.101
keeping-0.101
 build-0.100
Token sight
Token ID6504
Feature activation-9.09e-02

Pos logit contributions
 rebuild+0.181
keeping+0.178
 find+0.172
 build+0.169
ws+0.168
Neg logit contributions
ww-0.090
 brightly-0.081
opus-0.080
 sudden-0.078
 problem-0.078
Token and
Token ID290
Feature activation-6.10e-02

Pos logit contributions
ww+0.185
 sudden+0.177
 brightly+0.176
 problem+0.173
opus+0.165
Neg logit contributions
 rebuild-0.158
keeping-0.151
 find-0.147
iece-0.145
 build-0.143
Token both
Token ID1111
Feature activation-3.57e-02

Pos logit contributions
ww+0.107
 brightly+0.106
 problem+0.102
 sudden+0.102
ls+0.096
Neg logit contributions
 rebuild-0.122
keeping-0.119
 find-0.116
iece-0.114
 meet-0.114
Token felt
Token ID2936
Feature activation-5.56e-02

Pos logit contributions
 brightly+0.064
ww+0.063
 sudden+0.062
 problem+0.061
ls+0.057
Neg logit contributions
 rebuild-0.070
keeping-0.070
talk-0.067
iece-0.066
 find-0.066
Token a
Token ID257
Feature activation-7.09e-02

Pos logit contributions
ww+0.079
 brightly+0.077
 problem+0.075
 sudden+0.075
ls+0.070
Neg logit contributions
 rebuild-0.131
keeping-0.126
 find-0.124
iece-0.123
 meet-0.123
Token very
Token ID845
Feature activation-2.19e-01

Pos logit contributions
ls+0.101
ww+0.098
 problem+0.091
opus+0.089
ummy+0.086
Neg logit contributions
 rebuild-0.162
 find-0.160
 save-0.154
 meet-0.153
 make-0.152
Token peaceful
Token ID12309
Feature activation-1.06e-01

Pos logit contributions
ww+0.327
 brightly+0.305
 sudden+0.299
 problem+0.292
opus+0.289
Neg logit contributions
 find-0.499
 rebuild-0.494
 meet-0.472
 build-0.470
 make-0.461
Token feeling
Token ID4203
Feature activation-5.92e-02

Pos logit contributions
ww+0.185
 brightly+0.183
 sudden+0.177
 problem+0.174
ls+0.170
Neg logit contributions
 rebuild-0.211
keeping-0.208
 find-0.201
iece-0.198
 meet-0.198
Token inside
Token ID2641
Feature activation+3.34e-02

Pos logit contributions
 brightly+0.107
ww+0.104
 sudden+0.103
 problem+0.100
ls+0.095
Neg logit contributions
 rebuild-0.116
keeping-0.116
talk-0.112
ws-0.110
race-0.109
Token.
Token ID13
Feature activation+2.63e-02

Pos logit contributions
 rebuild+0.060
keeping+0.059
talk+0.057
iece+0.056
race+0.055
Neg logit contributions
 brightly-0.066
ww-0.064
 sudden-0.064
 problem-0.063
ls-0.059
Token
Token ID198
Feature activation-3.47e-02

Pos logit contributions
 rebuild+0.041
 find+0.040
 meet+0.040
keeping+0.039
race+0.039
Neg logit contributions
ww-0.055
 problem-0.055
 brightly-0.055
 sudden-0.052
opus-0.050
Token.
Token ID13
Feature activation+2.59e-02

Pos logit contributions
 problem+0.071
ww+0.070
 sudden+0.065
 brightly+0.064
uddle+0.063
Neg logit contributions
 find-0.052
 rebuild-0.051
 meet-0.051
iece-0.050
keeping-0.050
Token Tw
Token ID1815
Feature activation-4.07e-02

Pos logit contributions
 rebuild+0.048
 meet+0.048
 find+0.047
keeping+0.047
iece+0.047
Neg logit contributions
ww-0.047
 sudden-0.047
 problem-0.046
 brightly-0.046
ls-0.043
Tokeninkle
Token ID19894
Feature activation-3.66e-02

Pos logit contributions
 brightly+0.046
ww+0.046
 sudden+0.045
 problem+0.041
 brand+0.038
Neg logit contributions
 rebuild-0.105
 find-0.104
 meet-0.102
ws-0.102
iece-0.102
Token was
Token ID373
Feature activation-7.35e-02

Pos logit contributions
 problem+0.073
ww+0.072
 brightly+0.067
 brand+0.066
 sudden+0.066
Neg logit contributions
 find-0.058
 rebuild-0.058
 meet-0.057
iece-0.057
 make-0.056
Token a
Token ID257
Feature activation-6.27e-02

Pos logit contributions
ww+0.124
 problem+0.122
 sudden+0.120
ls+0.120
 brightly+0.119
Neg logit contributions
 rebuild-0.145
 find-0.144
 meet-0.144
keeping-0.141
 build-0.141
Token very
Token ID845
Feature activation-2.19e-01

Pos logit contributions
ls+0.113
ittens+0.095
ement+0.093
uddle+0.087
ummy+0.085
Neg logit contributions
 find-0.140
 make-0.131
 meet-0.128
 fix-0.126
 build-0.124
Token obedient
Token ID41652
Feature activation-5.57e-02

Pos logit contributions
 sudden+0.306
ww+0.288
ement+0.277
ls+0.274
opus+0.267
Neg logit contributions
 find-0.529
 meet-0.506
 build-0.483
 greet-0.481
 make-0.480
Token star
Token ID3491
Feature activation-6.90e-02

Pos logit contributions
ww+0.091
 brightly+0.091
 sudden+0.090
 problem+0.086
ls+0.083
Neg logit contributions
 rebuild-0.116
keeping-0.115
 find-0.112
 meet-0.109
talk-0.109
Token.
Token ID13
Feature activation+2.71e-02

Pos logit contributions
ww+0.145
 brightly+0.144
 sudden+0.144
 problem+0.139
ls+0.133
Neg logit contributions
 rebuild-0.111
keeping-0.109
 find-0.105
 build-0.103
 meet-0.103
Token Every
Token ID3887
Feature activation-5.79e-03

Pos logit contributions
 rebuild+0.046
 meet+0.045
 find+0.044
keeping+0.044
talk+0.044
Neg logit contributions
ww-0.055
 sudden-0.054
 brightly-0.053
 problem-0.053
ls-0.051
Token night
Token ID1755
Feature activation-7.08e-02

Pos logit contributions
ls+0.011
 sudden+0.011
ww+0.011
 brightly+0.011
uddle+0.011
Neg logit contributions
 find-0.010
 meet-0.010
 build-0.009
 rebuild-0.009
 make-0.009
Token Molly
Token ID30236
Feature activation+1.37e-02

Pos logit contributions
 rebuild+0.085
ws+0.083
talk+0.082
iece+0.081
five+0.079
Neg logit contributions
ww-0.055
 brightly-0.055
 sudden-0.052
 problem-0.052
ummy-0.049
Token.
Token ID13
Feature activation+2.60e-02

Pos logit contributions
 rebuild+0.029
keeping+0.028
ws+0.028
iece+0.027
talk+0.027
Neg logit contributions
 brightly-0.023
 sudden-0.022
 problem-0.021
ww-0.021
ls-0.020
Token She
Token ID1375
Feature activation-2.03e-02

Pos logit contributions
 rebuild+0.043
keeping+0.042
 find+0.042
 meet+0.041
iece+0.040
Neg logit contributions
ww-0.053
 brightly-0.053
 problem-0.052
 sudden-0.052
ls-0.049
Token was
Token ID373
Feature activation-7.30e-02

Pos logit contributions
ww+0.043
 problem+0.042
ls+0.042
 brightly+0.041
 sudden+0.040
Neg logit contributions
 find-0.033
 meet-0.032
 rebuild-0.032
 build-0.031
 buy-0.030
Token a
Token ID257
Feature activation-6.70e-02

Pos logit contributions
ww+0.137
ls+0.133
opus+0.128
 brightly+0.126
 problem+0.125
Neg logit contributions
 find-0.141
 rebuild-0.139
 meet-0.138
 build-0.136
race-0.132
Token very
Token ID845
Feature activation-2.19e-01

Pos logit contributions
ls+0.123
ement+0.114
izing+0.101
ww+0.100
uddle+0.100
Neg logit contributions
 find-0.141
 fix-0.129
 meet-0.128
 make-0.125
 save-0.125
Token intelligent
Token ID12661
Feature activation-7.86e-02

Pos logit contributions
 sudden+0.339
ww+0.326
ement+0.322
 brightly+0.319
opus+0.299
Neg logit contributions
 find-0.497
 meet-0.465
 buy-0.450
 build-0.448
 share-0.445
Token girl
Token ID2576
Feature activation-7.54e-03

Pos logit contributions
 sudden+0.142
ww+0.142
 brightly+0.139
ls+0.132
 problem+0.129
Neg logit contributions
 find-0.147
 rebuild-0.147
keeping-0.144
 meet-0.139
 build-0.139
Token.
Token ID13
Feature activation+2.69e-02

Pos logit contributions
 brightly+0.014
ww+0.014
 sudden+0.014
 problem+0.014
ls+0.013
Neg logit contributions
 rebuild-0.014
keeping-0.014
talk-0.013
iece-0.013
ws-0.013
Token Every
Token ID3887
Feature activation-5.74e-03

Pos logit contributions
 rebuild+0.044
 find+0.043
 meet+0.043
keeping+0.042
 build+0.041
Neg logit contributions
ww-0.056
 brightly-0.055
 problem-0.054
 sudden-0.054
ls-0.051
Token day
Token ID1110
Feature activation-7.07e-02

Pos logit contributions
 sudden+0.011
 brightly+0.010
ls+0.010
ww+0.010
 brand+0.010
Neg logit contributions
 meet-0.010
 find-0.010
race-0.010
keeping-0.010
 rebuild-0.010
Token it
Token ID340
Feature activation-3.08e-02

Pos logit contributions
 rebuild+0.024
keeping+0.024
iece+0.023
 find+0.023
talk+0.023
Neg logit contributions
ww-0.022
 brightly-0.021
 sudden-0.021
 problem-0.021
ls-0.019
Token!
Token ID0
Feature activation+1.64e-02

Pos logit contributions
ww+0.054
 brightly+0.051
 sudden+0.051
 problem+0.050
ls+0.047
Neg logit contributions
 rebuild-0.063
 find-0.060
iece-0.060
keeping-0.060
 build-0.059
Token It
Token ID632
Feature activation-4.62e-02

Pos logit contributions
 rebuild+0.031
keeping+0.030
talk+0.029
iece+0.029
 find+0.029
Neg logit contributions
 brightly-0.031
ww-0.030
 sudden-0.030
 problem-0.029
ls-0.028
Token was
Token ID373
Feature activation-7.38e-02

Pos logit contributions
ww+0.088
 problem+0.081
 sudden+0.081
 brightly+0.080
opus+0.078
Neg logit contributions
 rebuild-0.086
 find-0.085
 build-0.082
 meet-0.082
keeping-0.081
Token a
Token ID257
Feature activation-6.52e-02

Pos logit contributions
ww+0.133
opus+0.126
ls+0.125
uddle+0.120
 problem+0.118
Neg logit contributions
 find-0.147
 rebuild-0.146
 build-0.143
 meet-0.143
keeping-0.142
Token very
Token ID845
Feature activation-2.19e-01

Pos logit contributions
ls+0.102
ement+0.090
ww+0.086
ittens+0.085
 importantly+0.080
Neg logit contributions
 find-0.151
 meet-0.143
 build-0.139
 fix-0.137
 save-0.137
Token beautiful
Token ID4950
Feature activation-1.32e-01

Pos logit contributions
ww+0.321
opus+0.297
 sudden+0.290
ls+0.280
ement+0.280
Neg logit contributions
 find-0.533
 meet-0.491
 build-0.484
 greet-0.474
 buy-0.474
Token butterfly
Token ID35113
Feature activation-4.25e-02

Pos logit contributions
ww+0.231
ls+0.227
 sudden+0.223
 importantly+0.207
opus+0.205
Neg logit contributions
 find-0.263
keeping-0.250
 meet-0.246
 build-0.246
 rebuild-0.243
Token fl
Token ID781
Feature activation-1.31e-01

Pos logit contributions
ww+0.080
 sudden+0.080
 brightly+0.078
 problem+0.077
ls+0.071
Neg logit contributions
keeping-0.076
 find-0.075
 rebuild-0.075
 meet-0.073
iece-0.072
Tokenuttering
Token ID33598
Feature activation-1.51e-02

Pos logit contributions
ww+0.195
 brightly+0.192
 sudden+0.190
 problem+0.189
ls+0.169
Neg logit contributions
 rebuild-0.294
keeping-0.291
iece-0.283
talk-0.281
 find-0.280
Token around
Token ID1088
Feature activation+2.94e-02

Pos logit contributions
ww+0.031
 brightly+0.030
 sudden+0.030
 problem+0.030
ls+0.028
Neg logit contributions
 rebuild-0.025
keeping-0.025
iece-0.024
talk-0.024
 find-0.024
Token Tim
Token ID5045
Feature activation+5.52e-02

Pos logit contributions
 rebuild+0.090
iece+0.089
keeping+0.086
ws+0.086
talk+0.086
Neg logit contributions
 brightly-0.069
ww-0.069
 sudden-0.065
 problem-0.062
ls-0.061
Token.
Token ID13
Feature activation+2.95e-02

Pos logit contributions
keeping+0.105
 rebuild+0.098
five+0.096
iece+0.096
talk+0.096
Neg logit contributions
 problem-0.100
 sudden-0.099
 brightly-0.097
ww-0.093
 cute-0.090
Token Tim
Token ID5045
Feature activation+4.31e-02

Pos logit contributions
 rebuild+0.051
keeping+0.050
 find+0.047
 meet+0.047
 build+0.047
Neg logit contributions
 brightly-0.059
ww-0.059
 sudden-0.058
 problem-0.057
ls-0.055
Token was
Token ID373
Feature activation-7.03e-02

Pos logit contributions
 rebuild+0.087
keeping+0.085
talk+0.082
iece+0.082
 find+0.081
Neg logit contributions
 brightly-0.075
ww-0.074
 sudden-0.073
 problem-0.071
ls-0.067
Token a
Token ID257
Feature activation-6.67e-02

Pos logit contributions
ww+0.122
 brightly+0.117
ls+0.117
opus+0.113
 problem+0.113
Neg logit contributions
 rebuild-0.143
 find-0.142
 build-0.140
 meet-0.139
race-0.134
Token very
Token ID845
Feature activation-2.18e-01

Pos logit contributions
ls+0.121
ement+0.115
ummy+0.098
uddle+0.097
izing+0.097
Neg logit contributions
 find-0.145
 fix-0.136
 meet-0.134
 save-0.133
 build-0.132
Token good
Token ID922
Feature activation-1.10e-01

Pos logit contributions
 sudden+0.325
ement+0.320
 brightly+0.320
ww+0.306
ls+0.285
Neg logit contributions
 find-0.491
 meet-0.467
 build-0.460
 rebuild-0.451
 share-0.450
Token baby
Token ID5156
Feature activation-4.10e-02

Pos logit contributions
 brightly+0.196
ww+0.196
 sudden+0.194
ls+0.181
 problem+0.179
Neg logit contributions
 rebuild-0.216
 find-0.212
keeping-0.206
 build-0.203
talk-0.203
Token.
Token ID13
Feature activation+2.62e-02

Pos logit contributions
ww+0.087
 brightly+0.087
 sudden+0.085
 problem+0.083
ls+0.079
Neg logit contributions
 rebuild-0.066
keeping-0.066
 find-0.063
talk-0.061
 build-0.061
Token He
Token ID679
Feature activation-1.74e-02

Pos logit contributions
 rebuild+0.049
keeping+0.048
talk+0.046
iece+0.046
 find+0.046
Neg logit contributions
 brightly-0.049
ww-0.049
 sudden-0.048
 problem-0.047
ls-0.044
Token loved
Token ID6151
Feature activation-2.13e-02

Pos logit contributions
ww+0.034
 brightly+0.034
 sudden+0.033
 problem+0.033
ls+0.031
Neg logit contributions
 rebuild-0.031
 find-0.030
 meet-0.029
 build-0.029
keeping-0.029
Token Blue
Token ID4518
Feature activation+3.18e-02

Pos logit contributions
 rebuild+0.070
keeping+0.068
talk+0.066
iece+0.065
 find+0.065
Neg logit contributions
 brightly-0.064
ww-0.064
 sudden-0.062
 problem-0.061
ls-0.057
Token.
Token ID13
Feature activation+3.03e-02

Pos logit contributions
 rebuild+0.047
keeping+0.046
 find+0.046
 meet+0.044
iece+0.043
Neg logit contributions
ww-0.072
 brightly-0.070
 sudden-0.069
 problem-0.068
ls-0.065
Token Blue
Token ID4518
Feature activation+2.56e-02

Pos logit contributions
 rebuild+0.051
keeping+0.050
 meet+0.050
 find+0.049
 build+0.048
Neg logit contributions
ww-0.061
 problem-0.060
 sudden-0.060
 brightly-0.060
ls-0.056
Token was
Token ID373
Feature activation-7.07e-02

Pos logit contributions
 find+0.041
 rebuild+0.040
 meet+0.039
 build+0.038
 make+0.037
Neg logit contributions
ww-0.056
 brightly-0.053
 problem-0.053
 sudden-0.053
ummy-0.050
Token a
Token ID257
Feature activation-6.58e-02

Pos logit contributions
ww+0.126
 sudden+0.124
opus+0.122
uddle+0.121
 brightly+0.121
Neg logit contributions
 find-0.136
 meet-0.134
 rebuild-0.131
 build-0.130
 greet-0.125
Token very
Token ID845
Feature activation-2.18e-01

Pos logit contributions
ement+0.115
ls+0.114
izing+0.100
ised+0.098
ths+0.097
Neg logit contributions
 find-0.147
 meet-0.142
 make-0.132
 fix-0.131
 build-0.129
Token interesting
Token ID3499
Feature activation-1.10e-01

Pos logit contributions
 sudden+0.334
ement+0.309
 brightly+0.305
ww+0.290
ths+0.287
Neg logit contributions
 find-0.502
 meet-0.486
 share-0.468
 build-0.454
 buy-0.452
Token toy
Token ID13373
Feature activation-5.12e-02

Pos logit contributions
 sudden+0.197
 brightly+0.188
ww+0.187
ls+0.180
 importantly+0.173
Neg logit contributions
keeping-0.218
 find-0.218
 meet-0.208
 rebuild-0.206
 build-0.205
Token.
Token ID13
Feature activation+2.77e-02

Pos logit contributions
 sudden+0.105
ww+0.105
 brightly+0.103
 problem+0.098
ls+0.094
Neg logit contributions
 rebuild-0.084
keeping-0.083
 find-0.083
 build-0.081
 meet-0.080
Token He
Token ID679
Feature activation-1.76e-02

Pos logit contributions
 rebuild+0.047
keeping+0.046
 meet+0.045
 find+0.045
 build+0.044
Neg logit contributions
ww-0.056
 brightly-0.055
 problem-0.055
 sudden-0.055
ls-0.052
Token could
Token ID714
Feature activation+1.88e-02

Pos logit contributions
ww+0.037
 brightly+0.035
 problem+0.035
 sudden+0.035
ummy+0.033
Neg logit contributions
 rebuild-0.030
 find-0.030
 meet-0.029
 build-0.028
keeping-0.028
Token Tim
Token ID5045
Feature activation+5.50e-02

Pos logit contributions
 rebuild+0.089
keeping+0.086
iece+0.084
ws+0.082
talk+0.082
Neg logit contributions
 brightly-0.100
ww-0.098
 sudden-0.095
 problem-0.094
ls-0.091
Token.
Token ID13
Feature activation+3.23e-02

Pos logit contributions
keeping+0.109
 rebuild+0.105
talk+0.102
race+0.100
five+0.100
Neg logit contributions
 brightly-0.095
 sudden-0.094
 problem-0.093
ww-0.089
 lot-0.086
Token Tim
Token ID5045
Feature activation+4.21e-02

Pos logit contributions
 rebuild+0.055
keeping+0.054
 meet+0.051
 find+0.051
 build+0.051
Neg logit contributions
 brightly-0.066
ww-0.066
 sudden-0.065
 problem-0.064
ls-0.061
Token was
Token ID373
Feature activation-6.99e-02

Pos logit contributions
 rebuild+0.087
keeping+0.086
talk+0.082
iece+0.082
 find+0.081
Neg logit contributions
 brightly-0.071
ww-0.070
 sudden-0.069
 problem-0.067
ls-0.063
Token a
Token ID257
Feature activation-6.74e-02

Pos logit contributions
ww+0.118
 brightly+0.113
ls+0.111
 sudden+0.107
opus+0.107
Neg logit contributions
 rebuild-0.148
 find-0.147
 build-0.145
 meet-0.144
race-0.138
Token very
Token ID845
Feature activation-2.18e-01

Pos logit contributions
ls+0.113
ement+0.111
ummy+0.099
 importantly+0.097
ww+0.096
Neg logit contributions
 find-0.148
 meet-0.140
 build-0.139
 fix-0.137
 save-0.136
Token tidy
Token ID43044
Feature activation-2.01e-02

Pos logit contributions
 sudden+0.311
 brightly+0.308
ement+0.299
ww+0.299
opus+0.270
Neg logit contributions
 find-0.505
 meet-0.481
 build-0.480
 rebuild-0.472
 share-0.462
Token kid
Token ID5141
Feature activation+1.10e-02

Pos logit contributions
ww+0.037
 brightly+0.037
 sudden+0.036
 problem+0.034
ls+0.034
Neg logit contributions
 rebuild-0.039
keeping-0.038
 find-0.037
 build-0.037
talk-0.036
Token.
Token ID13
Feature activation+2.80e-02

Pos logit contributions
 rebuild+0.019
keeping+0.019
talk+0.018
iece+0.018
 find+0.018
Neg logit contributions
 brightly-0.022
ww-0.022
 sudden-0.021
 problem-0.021
ls-0.020
Token He
Token ID679
Feature activation-1.75e-02

Pos logit contributions
 rebuild+0.051
keeping+0.051
talk+0.048
iece+0.048
 find+0.047
Neg logit contributions
 brightly-0.054
ww-0.053
 sudden-0.052
 problem-0.051
ls-0.048
Token always
Token ID1464
Feature activation-2.72e-02

Pos logit contributions
ww+0.033
 brightly+0.033
 sudden+0.032
 problem+0.031
ls+0.030
Neg logit contributions
 rebuild-0.033
keeping-0.032
 find-0.032
 build-0.031
talk-0.031
Token Jack
Token ID3619
Feature activation+2.84e-02

Pos logit contributions
keeping+0.089
 rebuild+0.087
iece+0.086
talk+0.084
race+0.083
Neg logit contributions
 brightly-0.069
ww-0.068
 sudden-0.066
 problem-0.064
ls-0.063
Token.
Token ID13
Feature activation+2.96e-02

Pos logit contributions
keeping+0.063
 rebuild+0.061
race+0.061
five+0.059
iece+0.059
Neg logit contributions
 brightly-0.045
 sudden-0.042
 problem-0.041
ww-0.041
 cute-0.038
Token He
Token ID679
Feature activation-2.12e-02

Pos logit contributions
 rebuild+0.059
keeping+0.059
talk+0.056
iece+0.055
 find+0.055
Neg logit contributions
 brightly-0.053
ww-0.051
 sudden-0.050
 problem-0.049
ls-0.046
Token was
Token ID373
Feature activation-7.32e-02

Pos logit contributions
ww+0.041
 problem+0.041
 brightly+0.041
ls+0.039
 sudden+0.039
Neg logit contributions
 rebuild-0.037
 find-0.037
 meet-0.036
 build-0.036
keeping-0.036
Token a
Token ID257
Feature activation-6.46e-02

Pos logit contributions
ww+0.128
 brightly+0.122
ls+0.121
 problem+0.120
 sudden+0.117
Neg logit contributions
 rebuild-0.147
 find-0.146
 build-0.144
 meet-0.142
keeping-0.138
Token very
Token ID845
Feature activation-2.17e-01

Pos logit contributions
ls+0.108
ement+0.103
ww+0.092
izing+0.088
 problem+0.088
Neg logit contributions
 find-0.144
 fix-0.136
 build-0.133
 meet-0.132
 save-0.131
Token compassionate
Token ID32533
Feature activation-5.78e-02

Pos logit contributions
 sudden+0.330
ement+0.320
ww+0.313
 brightly+0.313
izing+0.293
Neg logit contributions
 find-0.494
 build-0.470
 meet-0.469
 buy-0.455
 share-0.452
Token boy
Token ID2933
Feature activation+1.04e-02

Pos logit contributions
ww+0.100
 sudden+0.099
 brightly+0.099
 problem+0.093
ls+0.093
Neg logit contributions
 rebuild-0.115
keeping-0.113
 find-0.112
talk-0.108
 build-0.108
Token,
Token ID11
Feature activation-9.19e-02

Pos logit contributions
 rebuild+0.017
keeping+0.017
talk+0.016
iece+0.016
 find+0.016
Neg logit contributions
 brightly-0.022
ww-0.021
 sudden-0.021
 problem-0.021
ls-0.020
Token who
Token ID508
Feature activation+4.21e-03

Pos logit contributions
ww+0.211
 problem+0.206
 brightly+0.206
 sudden+0.202
ls+0.199
Neg logit contributions
 rebuild-0.137
 find-0.131
 build-0.128
keeping-0.127
 meet-0.126
Token always
Token ID1464
Feature activation-3.24e-02

Pos logit contributions
 rebuild+0.008
keeping+0.008
 find+0.008
talk+0.008
 meet+0.008
Neg logit contributions
ww-0.008
 brightly-0.008
 sudden-0.007
 problem-0.007
ls-0.007
Token Spot
Token ID15899
Feature activation+6.81e-02

Pos logit contributions
 rebuild+0.065
keeping+0.062
iece+0.061
talk+0.061
ws+0.060
Neg logit contributions
 brightly-0.061
ww-0.061
 sudden-0.059
 problem-0.057
ls-0.055
Token.
Token ID13
Feature activation+2.52e-02

Pos logit contributions
 rebuild+0.133
keeping+0.131
iece+0.127
talk+0.125
race+0.125
Neg logit contributions
 brightly-0.123
ww-0.119
 sudden-0.117
 problem-0.117
ls-0.112
Token Spot
Token ID15899
Feature activation+5.53e-02

Pos logit contributions
 rebuild+0.038
keeping+0.037
 meet+0.037
 find+0.036
race+0.036
Neg logit contributions
 brightly-0.055
ww-0.054
 sudden-0.054
 problem-0.054
ls-0.051
Token was
Token ID373
Feature activation-7.27e-02

Pos logit contributions
 find+0.092
 meet+0.089
 rebuild+0.089
 build+0.085
keeping+0.082
Neg logit contributions
 brightly-0.117
ww-0.117
 problem-0.114
 sudden-0.113
ement-0.108
Token a
Token ID257
Feature activation-6.71e-02

Pos logit contributions
ww+0.120
 brightly+0.117
ls+0.117
opus+0.115
 sudden+0.113
Neg logit contributions
 find-0.151
 meet-0.149
 rebuild-0.148
 build-0.146
race-0.142
Token very
Token ID845
Feature activation-2.17e-01

Pos logit contributions
ement+0.119
ls+0.119
ummy+0.103
izing+0.101
uddle+0.101
Neg logit contributions
 find-0.143
 meet-0.137
 fix-0.131
 save-0.130
 make-0.130
Token good
Token ID922
Feature activation-1.10e-01

Pos logit contributions
 sudden+0.338
 brightly+0.327
ement+0.326
ww+0.304
izing+0.292
Neg logit contributions
 find-0.479
 meet-0.465
 build-0.441
 share-0.437
 greet-0.433
Token dog
Token ID3290
Feature activation+2.81e-02

Pos logit contributions
 brightly+0.201
 sudden+0.198
ww+0.198
 problem+0.186
ls+0.182
Neg logit contributions
 rebuild-0.210
 find-0.206
keeping-0.204
talk-0.199
 meet-0.198
Token.
Token ID13
Feature activation+2.43e-02

Pos logit contributions
 rebuild+0.051
keeping+0.050
 find+0.049
talk+0.048
 meet+0.048
Neg logit contributions
 brightly-0.054
ww-0.054
 sudden-0.053
 problem-0.052
ls-0.049
Token He
Token ID679
Feature activation-1.86e-02

Pos logit contributions
 rebuild+0.042
keeping+0.041
 meet+0.040
 find+0.040
talk+0.039
Neg logit contributions
 brightly-0.049
ww-0.048
 sudden-0.047
 problem-0.047
ls-0.045
Token liked
Token ID8288
Feature activation-3.15e-02

Pos logit contributions
ww+0.039
 brightly+0.038
 problem+0.038
 sudden+0.037
ummy+0.037
Neg logit contributions
 find-0.030
 rebuild-0.030
 meet-0.030
 build-0.029
talk-0.028
Token Polly
Token ID36898
Feature activation+4.27e-02

Pos logit contributions
 rebuild+0.068
keeping+0.066
talk+0.063
iece+0.063
 find+0.062
Neg logit contributions
 brightly-0.060
ww-0.060
 sudden-0.058
 problem-0.057
ls-0.054
Token.
Token ID13
Feature activation+2.78e-02

Pos logit contributions
 rebuild+0.073
keeping+0.072
talk+0.068
iece+0.068
 find+0.068
Neg logit contributions
 brightly-0.087
ww-0.086
 sudden-0.085
 problem-0.083
ls-0.079
Token Polly
Token ID36898
Feature activation+3.51e-02

Pos logit contributions
 rebuild+0.056
keeping+0.055
talk+0.053
iece+0.052
 find+0.052
Neg logit contributions
 brightly-0.049
ww-0.048
 sudden-0.047
 problem-0.046
ls-0.043
Token was
Token ID373
Feature activation-6.86e-02

Pos logit contributions
 find+0.056
 meet+0.054
 rebuild+0.052
 build+0.050
 make+0.049
Neg logit contributions
ww-0.077
 brightly-0.075
 problem-0.075
 sudden-0.074
ummy-0.072
Token a
Token ID257
Feature activation-6.43e-02

Pos logit contributions
ww+0.118
 brightly+0.114
ls+0.114
uddle+0.112
opus+0.112
Neg logit contributions
 find-0.139
 meet-0.138
 build-0.132
 rebuild-0.131
race-0.127
Token very
Token ID845
Feature activation-2.17e-01

Pos logit contributions
ement+0.111
ls+0.110
uddle+0.096
ummy+0.094
ised+0.092
Neg logit contributions
 find-0.143
 meet-0.136
 make-0.126
 fix-0.125
 save-0.124
Token talk
Token ID1561
Feature activation+5.85e-03

Pos logit contributions
 sudden+0.327
 brightly+0.305
ement+0.305
ww+0.289
 problem+0.283
Neg logit contributions
 find-0.503
 meet-0.485
 share-0.458
 build-0.447
 buy-0.447
Tokenative
Token ID876
Feature activation-6.75e-02

Pos logit contributions
 rebuild+0.012
keeping+0.012
iece+0.011
 find+0.011
talk+0.011
Neg logit contributions
ww-0.010
 sudden-0.010
 brightly-0.010
 problem-0.009
ls-0.009
Token par
Token ID1582
Feature activation-3.48e-02

Pos logit contributions
 brightly+0.112
ww+0.111
 sudden+0.109
 problem+0.104
ls+0.101
Neg logit contributions
keeping-0.139
 rebuild-0.137
 find-0.134
 meet-0.133
talk-0.132
Tokenrot
Token ID10599
Feature activation+7.99e-03

Pos logit contributions
 brightly+0.050
ww+0.049
 sudden+0.048
 problem+0.047
ls+0.043
Neg logit contributions
 rebuild-0.081
keeping-0.079
talk-0.077
iece-0.077
 find-0.076
Token,
Token ID11
Feature activation-9.37e-02

Pos logit contributions
 rebuild+0.013
keeping+0.013
 find+0.012
 meet+0.012
iece+0.012
Neg logit contributions
 brightly-0.017
 sudden-0.017
ww-0.017
 problem-0.016
ls-0.015
Token Sprite
Token ID33132
Feature activation+1.50e-02

Pos logit contributions
 rebuild+0.052
keeping+0.051
 find+0.050
 meet+0.049
race+0.049
Neg logit contributions
 brightly-0.072
ww-0.070
 sudden-0.070
 problem-0.069
ls-0.065
Token.
Token ID13
Feature activation+2.89e-02

Pos logit contributions
 rebuild+0.031
keeping+0.030
talk+0.029
race+0.029
ws+0.029
Neg logit contributions
 brightly-0.026
 sudden-0.025
ww-0.024
 problem-0.023
ls-0.023
Token Sprite
Token ID33132
Feature activation+1.53e-02

Pos logit contributions
 rebuild+0.054
keeping+0.053
talk+0.051
 find+0.051
iece+0.051
Neg logit contributions
 brightly-0.054
ww-0.053
 sudden-0.052
 problem-0.051
ls-0.048
Token was
Token ID373
Feature activation-7.01e-02

Pos logit contributions
 rebuild+0.031
keeping+0.030
 find+0.029
talk+0.029
 meet+0.029
Neg logit contributions
 brightly-0.027
ww-0.026
 sudden-0.026
 problem-0.025
ls-0.024
Token a
Token ID257
Feature activation-6.29e-02

Pos logit contributions
ww+0.108
 brightly+0.106
ls+0.104
opus+0.103
ement+0.102
Neg logit contributions
 find-0.153
 meet-0.150
 build-0.148
 rebuild-0.147
race-0.144
Token very
Token ID845
Feature activation-2.16e-01

Pos logit contributions
ls+0.098
ement+0.098
 sudden+0.084
 problem+0.084
 brightly+0.083
Neg logit contributions
 find-0.143
 meet-0.137
 fix-0.134
 save-0.133
 make-0.132
Token fit
Token ID4197
Feature activation-5.26e-02

Pos logit contributions
 sudden+0.328
ement+0.325
 brightly+0.319
ww+0.288
 problem+0.285
Neg logit contributions
 find-0.482
 meet-0.469
 build-0.449
 share-0.448
 buy-0.442
Token and
Token ID290
Feature activation-6.02e-02

Pos logit contributions
 brightly+0.093
ww+0.092
 sudden+0.092
 problem+0.087
ls+0.084
Neg logit contributions
 rebuild-0.103
keeping-0.103
 find-0.099
talk-0.097
 build-0.097
Token happy
Token ID3772
Feature activation-1.10e-01

Pos logit contributions
 brightly+0.103
 sudden+0.103
ww+0.102
 problem+0.101
ls+0.096
Neg logit contributions
 rebuild-0.119
 find-0.115
 build-0.115
keeping-0.115
 meet-0.114
Token pup
Token ID15552
Feature activation+8.01e-03

Pos logit contributions
ww+0.227
 sudden+0.226
 brightly+0.224
ls+0.220
 problem+0.215
Neg logit contributions
keeping-0.186
 find-0.177
 rebuild-0.176
 meet-0.169
 build-0.167
Token.
Token ID13
Feature activation+2.95e-02

Pos logit contributions
 rebuild+0.014
keeping+0.014
talk+0.013
iece+0.013
 find+0.013
Neg logit contributions
 brightly-0.016
ww-0.016
 sudden-0.015
 problem-0.015
ls-0.014
Token.
Token ID13
Feature activation+3.10e-02

Pos logit contributions
 rebuild+0.059
keeping+0.058
talk+0.055
iece+0.055
race+0.055
Neg logit contributions
 brightly-0.069
ww-0.067
 sudden-0.066
 problem-0.066
ls-0.062
Token Fl
Token ID1610
Feature activation-1.91e-02

Pos logit contributions
 rebuild+0.050
keeping+0.049
 meet+0.048
 find+0.048
 build+0.047
Neg logit contributions
ww-0.065
 brightly-0.065
 sudden-0.064
 problem-0.064
ls-0.060
Tokenuffy
Token ID15352
Feature activation+2.73e-02

Pos logit contributions
 brightly+0.026
 sudden+0.026
ww+0.026
 brand+0.024
 problem+0.024
Neg logit contributions
 rebuild-0.044
 find-0.042
isk-0.041
keeping-0.041
ws-0.041
Token was
Token ID373
Feature activation-6.67e-02

Pos logit contributions
 find+0.042
 meet+0.041
 rebuild+0.040
 build+0.040
 make+0.038
Neg logit contributions
ww-0.061
 brightly-0.060
 sudden-0.060
 problem-0.058
ement-0.057
Token a
Token ID257
Feature activation-6.46e-02

Pos logit contributions
ww+0.107
opus+0.104
 brightly+0.103
 sudden+0.102
ls+0.102
Neg logit contributions
 find-0.142
 meet-0.140
 rebuild-0.137
 build-0.137
 make-0.133
Token very
Token ID845
Feature activation-2.16e-01

Pos logit contributions
ement+0.117
ls+0.113
izing+0.095
ths+0.095
ised+0.095
Neg logit contributions
 find-0.140
 meet-0.133
 fix-0.133
 make-0.132
 save-0.129
Token nos
Token ID43630
Feature activation-3.70e-02

Pos logit contributions
 sudden+0.325
ement+0.324
 brightly+0.310
ths+0.288
ww+0.284
Neg logit contributions
 find-0.484
 meet-0.476
 share-0.460
 buy-0.445
 build-0.445
Tokeny
Token ID88
Feature activation-7.94e-02

Pos logit contributions
 brightly+0.061
 sudden+0.060
ww+0.060
 problem+0.058
ls+0.054
Neg logit contributions
keeping-0.077
 rebuild-0.076
talk-0.073
iece-0.072
race-0.072
Token cat
Token ID3797
Feature activation+3.21e-02

Pos logit contributions
 sudden+0.168
 brightly+0.164
ww+0.161
ls+0.159
 importantly+0.154
Neg logit contributions
 find-0.125
 meet-0.119
 save-0.116
 rebuild-0.115
keeping-0.114
Token.
Token ID13
Feature activation+3.16e-02

Pos logit contributions
 rebuild+0.055
keeping+0.054
talk+0.051
 find+0.051
iece+0.051
Neg logit contributions
 brightly-0.066
ww-0.065
 sudden-0.064
 problem-0.063
ls-0.060
Token She
Token ID1375
Feature activation-1.42e-02

Pos logit contributions
 rebuild+0.051
keeping+0.049
 meet+0.049
 find+0.049
 build+0.048
Neg logit contributions
ww-0.067
 brightly-0.066
 sudden-0.066
 problem-0.065
ls-0.061
Token
Token ID198
Feature activation-3.36e-02

Pos logit contributions
 rebuild+0.022
keeping+0.022
 meet+0.021
talk+0.021
 find+0.021
Neg logit contributions
 brightly-0.026
ww-0.026
 problem-0.025
 sudden-0.025
ls-0.024
Token
Token ID198
Feature activation-3.50e-02

Pos logit contributions
 brightly+0.059
ww+0.059
 sudden+0.058
 problem+0.056
ls+0.053
Neg logit contributions
 rebuild-0.067
keeping-0.066
iece-0.063
talk-0.063
 find-0.062
TokenBob
Token ID18861
Feature activation-1.45e-03

Pos logit contributions
 brightly+0.061
ww+0.061
 sudden+0.059
 problem+0.058
ls+0.054
Neg logit contributions
 rebuild-0.071
keeping-0.070
talk-0.067
iece-0.066
 find-0.066
Token was
Token ID373
Feature activation-7.54e-02

Pos logit contributions
 brightly+0.003
ww+0.003
 sudden+0.003
 problem+0.003
ls+0.002
Neg logit contributions
 rebuild-0.003
keeping-0.003
 find-0.003
talk-0.003
 meet-0.003
Token a
Token ID257
Feature activation-6.76e-02

Pos logit contributions
ww+0.092
 brightly+0.092
 sudden+0.086
 problem+0.085
ls+0.081
Neg logit contributions
 rebuild-0.192
keeping-0.187
 find-0.184
 meet-0.183
iece-0.182
Token very
Token ID845
Feature activation-2.16e-01

Pos logit contributions
ls+0.120
ummy+0.107
ement+0.107
ww+0.102
 brightly+0.099
Neg logit contributions
 find-0.143
 save-0.137
 rebuild-0.135
 meet-0.135
 join-0.134
Token happy
Token ID3772
Feature activation-1.22e-01

Pos logit contributions
ww+0.317
 brightly+0.316
 sudden+0.300
ummy+0.298
ement+0.296
Neg logit contributions
 find-0.484
 rebuild-0.467
 meet-0.466
 build-0.457
five-0.455
Token man
Token ID582
Feature activation+3.48e-02

Pos logit contributions
ww+0.205
 brightly+0.199
ls+0.199
 sudden+0.193
 problem+0.187
Neg logit contributions
 find-0.244
keeping-0.242
 rebuild-0.240
 meet-0.234
 build-0.229
Token,
Token ID11
Feature activation-9.25e-02

Pos logit contributions
 rebuild+0.077
ws+0.076
keeping+0.075
talk+0.073
iece+0.072
Neg logit contributions
 brightly-0.053
ww-0.050
 problem-0.050
 sudden-0.050
opus-0.045
Token because
Token ID780
Feature activation-7.28e-02

Pos logit contributions
ww+0.160
 brightly+0.159
 problem+0.155
 sudden+0.153
ls+0.147
Neg logit contributions
 rebuild-0.186
keeping-0.183
 find-0.178
 meet-0.176
 build-0.174
Token he
Token ID339
Feature activation-3.00e-02

Pos logit contributions
 sudden+0.129
ww+0.128
uddle+0.124
 problem+0.122
opus+0.118
Neg logit contributions
 rebuild-0.136
 find-0.133
 build-0.133
keeping-0.133
 join-0.131
Token Lucy
Token ID22162
Feature activation+2.65e-02

Pos logit contributions
 rebuild+0.081
keeping+0.078
iece+0.076
talk+0.076
ws+0.076
Neg logit contributions
ww-0.064
 brightly-0.064
 sudden-0.061
 problem-0.060
ls-0.058
Token.
Token ID13
Feature activation+2.90e-02

Pos logit contributions
keeping+0.062
 rebuild+0.061
iece+0.061
race+0.061
talk+0.059
Neg logit contributions
 brightly-0.038
 sudden-0.035
 cute-0.035
ww-0.033
 beautiful-0.033
Token She
Token ID1375
Feature activation-1.91e-02

Pos logit contributions
 rebuild+0.052
keeping+0.051
talk+0.049
iece+0.049
 find+0.049
Neg logit contributions
 brightly-0.056
ww-0.056
 sudden-0.055
 problem-0.054
ls-0.051
Token was
Token ID373
Feature activation-7.00e-02

Pos logit contributions
 brightly+0.036
ww+0.035
 problem+0.034
 sudden+0.034
ls+0.032
Neg logit contributions
 rebuild-0.036
keeping-0.035
 find-0.035
 meet-0.034
 build-0.034
Token a
Token ID257
Feature activation-6.29e-02

Pos logit contributions
ww+0.108
 brightly+0.104
ls+0.102
opus+0.101
 problem+0.101
Neg logit contributions
 find-0.154
 rebuild-0.152
 meet-0.151
 build-0.150
race-0.146
Token very
Token ID845
Feature activation-2.16e-01

Pos logit contributions
ls+0.105
ement+0.103
ww+0.088
izing+0.088
uddle+0.087
Neg logit contributions
 find-0.139
 fix-0.130
 meet-0.129
 make-0.126
 save-0.125
Token curious
Token ID11040
Feature activation-8.03e-02

Pos logit contributions
 sudden+0.337
ement+0.324
 brightly+0.313
ww+0.308
izing+0.296
Neg logit contributions
 find-0.492
 meet-0.466
 build-0.454
 buy-0.449
 share-0.449
Token three
Token ID1115
Feature activation-7.96e-02

Pos logit contributions
 sudden+0.146
ww+0.142
 brightly+0.138
ls+0.137
 problem+0.128
Neg logit contributions
 find-0.151
keeping-0.148
 rebuild-0.148
 meet-0.146
 build-0.143
Token year
Token ID614
Feature activation-1.27e-01

Pos logit contributions
ww+0.109
 brightly+0.104
 sudden+0.104
 brand+0.099
opus+0.097
Neg logit contributions
keeping-0.182
 find-0.179
talk-0.174
 rebuild-0.173
 meet-0.171
Token old
Token ID1468
Feature activation-4.36e-02

Pos logit contributions
 sudden+0.245
ww+0.224
 brightly+0.208
opus+0.198
 problem+0.198
Neg logit contributions
keeping-0.253
 find-0.237
 rebuild-0.231
 meet-0.225
talk-0.221
Token and
Token ID290
Feature activation-6.11e-02

Pos logit contributions
ww+0.094
 brightly+0.091
 sudden+0.091
 problem+0.088
ls+0.086
Neg logit contributions
keeping-0.071
 rebuild-0.069
 find-0.066
talk-0.066
 meet-0.064
Token bracelet
Token ID42893
Feature activation-4.70e-02

Pos logit contributions
 sudden+0.093
ww+0.093
ls+0.090
 importantly+0.087
 brightly+0.087
Neg logit contributions
 find-0.100
keeping-0.095
 meet-0.090
 save-0.090
 build-0.090
Token.
Token ID13
Feature activation+2.81e-02

Pos logit contributions
 brightly+0.087
ww+0.086
 sudden+0.084
 problem+0.083
ls+0.078
Neg logit contributions
 rebuild-0.089
keeping-0.088
iece-0.084
talk-0.084
 find-0.084
Token It
Token ID632
Feature activation-4.30e-02

Pos logit contributions
 rebuild+0.053
keeping+0.052
talk+0.050
iece+0.049
ws+0.049
Neg logit contributions
 brightly-0.054
ww-0.052
 sudden-0.051
 problem-0.050
ls-0.047
Token was
Token ID373
Feature activation-6.81e-02

Pos logit contributions
ww+0.090
 problem+0.085
 sudden+0.084
ummy+0.083
opus+0.082
Neg logit contributions
 find-0.074
 make-0.070
 rebuild-0.069
 build-0.069
 meet-0.068
Token a
Token ID257
Feature activation-6.33e-02

Pos logit contributions
ww+0.118
opus+0.113
ls+0.106
 sudden+0.106
uddle+0.105
Neg logit contributions
 find-0.142
 rebuild-0.136
 build-0.136
 make-0.134
 meet-0.133
Token very
Token ID845
Feature activation-2.16e-01

Pos logit contributions
ls+0.099
ement+0.089
ittens+0.082
ummy+0.081
ww+0.080
Neg logit contributions
 find-0.146
 fix-0.137
 make-0.135
 meet-0.133
 buy-0.133
Token special
Token ID2041
Feature activation-9.94e-02

Pos logit contributions
ww+0.289
 sudden+0.284
opus+0.282
ement+0.270
ittens+0.262
Neg logit contributions
 find-0.537
 buy-0.471
 share-0.471
 meet-0.464
 make-0.464
Token bracelet
Token ID42893
Feature activation-4.83e-02

Pos logit contributions
 sudden+0.179
ls+0.173
ww+0.173
 brightly+0.164
opus+0.158
Neg logit contributions
 find-0.197
keeping-0.185
 buy-0.177
 fix-0.177
 meet-0.176
Token with
Token ID351
Feature activation+1.65e-02

Pos logit contributions
 sudden+0.091
ww+0.089
 brightly+0.089
 problem+0.085
ls+0.080
Neg logit contributions
keeping-0.090
 rebuild-0.089
 find-0.087
iece-0.084
 build-0.084
Token spark
Token ID9009
Feature activation-1.20e-01

Pos logit contributions
keeping+0.034
 find+0.034
 rebuild+0.034
 meet+0.034
iece+0.033
Neg logit contributions
ww-0.027
 problem-0.025
 sudden-0.025
opus-0.025
 brightly-0.025
Tokenles
Token ID829
Feature activation+1.37e-02

Pos logit contributions
ww+0.255
 sudden+0.254
 brightly+0.237
 problem+0.235
opus+0.231
Neg logit contributions
iece-0.191
keeping-0.187
 rebuild-0.179
talk-0.178
ws-0.175
Tokenie
Token ID494
Feature activation+1.27e-02

Pos logit contributions
 rebuild+0.079
keeping+0.077
 find+0.074
talk+0.074
iece+0.073
Neg logit contributions
ww-0.067
 brightly-0.066
 sudden-0.065
 problem-0.063
ls-0.059
Token.
Token ID13
Feature activation+2.71e-02

Pos logit contributions
 rebuild+0.021
keeping+0.021
 find+0.019
talk+0.019
iece+0.019
Neg logit contributions
 brightly-0.027
ww-0.026
 sudden-0.026
 problem-0.025
ls-0.024
Token He
Token ID679
Feature activation-1.51e-02

Pos logit contributions
 rebuild+0.046
keeping+0.045
 meet+0.044
 find+0.043
 build+0.043
Neg logit contributions
ww-0.055
 brightly-0.054
 problem-0.054
 sudden-0.053
ls-0.052
Token was
Token ID373
Feature activation-7.07e-02

Pos logit contributions
ww+0.032
 problem+0.030
 brightly+0.030
 sudden+0.030
ummy+0.030
Neg logit contributions
 find-0.025
 rebuild-0.024
 meet-0.024
 build-0.023
talk-0.023
Token a
Token ID257
Feature activation-6.41e-02

Pos logit contributions
ww+0.117
ls+0.112
opus+0.108
 brightly+0.108
 problem+0.107
Neg logit contributions
 find-0.151
 rebuild-0.150
 meet-0.148
 build-0.147
 greet-0.143
Token very
Token ID845
Feature activation-2.16e-01

Pos logit contributions
ls+0.120
ement+0.106
uddle+0.098
ww+0.097
izing+0.095
Neg logit contributions
 find-0.139
 fix-0.129
 meet-0.129
 save-0.128
 make-0.124
Token calm
Token ID9480
Feature activation-5.08e-02

Pos logit contributions
 sudden+0.319
ww+0.315
 brightly+0.311
ement+0.307
ls+0.281
Neg logit contributions
 find-0.492
 meet-0.470
 build-0.458
 rebuild-0.450
 greet-0.449
Token bear
Token ID6842
Feature activation-4.16e-03

Pos logit contributions
ww+0.088
 sudden+0.087
 brightly+0.086
 problem+0.082
ls+0.081
Neg logit contributions
 rebuild-0.100
keeping-0.100
 find-0.098
talk-0.095
 meet-0.095
Token and
Token ID290
Feature activation-5.94e-02

Pos logit contributions
 brightly+0.007
ww+0.007
 sudden+0.007
 problem+0.007
ls+0.006
Neg logit contributions
 rebuild-0.009
keeping-0.008
talk-0.008
iece-0.008
ws-0.008
Token he
Token ID339
Feature activation-3.17e-02

Pos logit contributions
ww+0.108
 brightly+0.106
 problem+0.104
 sudden+0.103
ls+0.099
Neg logit contributions
 rebuild-0.114
keeping-0.110
 find-0.109
 build-0.108
 meet-0.108
Token liked
Token ID8288
Feature activation-2.72e-02

Pos logit contributions
ww+0.065
 brightly+0.062
 problem+0.062
 sudden+0.060
ummy+0.060
Neg logit contributions
 rebuild-0.054
 find-0.053
 meet-0.052
 build-0.051
keeping-0.049
Token Tim
Token ID5045
Feature activation+5.70e-02

Pos logit contributions
 rebuild+0.053
keeping+0.051
 build+0.049
 find+0.049
 meet+0.049
Neg logit contributions
ww-0.064
 brightly-0.064
 sudden-0.062
 problem-0.061
ls-0.059
Tokenmy
Token ID1820
Feature activation+2.52e-02

Pos logit contributions
 rebuild+0.068
keeping+0.067
iece+0.062
talk+0.062
 find+0.061
Neg logit contributions
 brightly-0.145
ww-0.145
 sudden-0.142
 problem-0.142
ls-0.135
Token didn
Token ID1422
Feature activation-2.66e-02

Pos logit contributions
 rebuild+0.045
 find+0.045
 build+0.043
 fix+0.042
 make+0.041
Neg logit contributions
 brightly-0.049
ww-0.049
 sudden-0.046
ls-0.045
 importantly-0.045
Token't
Token ID470
Feature activation-1.88e-02

Pos logit contributions
 brightly+0.050
ww+0.050
 sudden+0.049
 problem+0.048
ls+0.045
Neg logit contributions
 rebuild-0.050
keeping-0.048
 find-0.048
 build-0.047
race-0.047
Token want
Token ID765
Feature activation+4.42e-02

Pos logit contributions
 problem+0.042
ww+0.042
 brightly+0.042
 lot+0.041
 important+0.039
Neg logit contributions
keeping-0.030
 rebuild-0.027
iece-0.027
ws-0.024
race-0.023
Token to
Token ID284
Feature activation+1.03e-01

Pos logit contributions
 rebuild+0.068
keeping+0.066
 find+0.064
 build+0.063
talk+0.063
Neg logit contributions
ww-0.099
 brightly-0.098
 sudden-0.097
 problem-0.093
opus-0.092
Token clean
Token ID3424
Feature activation+3.09e-02

Pos logit contributions
keeping+0.186
iece+0.176
talk+0.170
ws+0.169
 rebuild+0.163
Neg logit contributions
 brightly-0.208
 problem-0.204
ww-0.203
 sudden-0.196
ummy-0.192
Token his
Token ID465
Feature activation+2.96e-02

Pos logit contributions
 rebuild+0.050
keeping+0.049
 find+0.047
iece+0.047
 build+0.047
Neg logit contributions
ww-0.069
 sudden-0.066
 brightly-0.066
 problem-0.063
opus-0.062
Token room
Token ID2119
Feature activation-1.28e-02

Pos logit contributions
 rebuild+0.056
 find+0.055
 build+0.053
keeping+0.052
iece+0.050
Neg logit contributions
ww-0.061
 brightly-0.058
 sudden-0.056
opus-0.053
mber-0.052
Token because
Token ID780
Feature activation-5.67e-02

Pos logit contributions
ww+0.018
 brightly+0.018
 sudden+0.017
 problem+0.016
ls+0.015
Neg logit contributions
 rebuild-0.030
keeping-0.030
 find-0.029
iece-0.029
talk-0.029
Token he
Token ID339
Feature activation-2.05e-02

Pos logit contributions
ww+0.106
 sudden+0.104
opus+0.096
 brightly+0.096
 problem+0.096
Neg logit contributions
 build-0.103
 rebuild-0.103
 find-0.102
keeping-0.100
 buy-0.097
Token was
Token ID373
Feature activation-6.87e-02

Pos logit contributions
ww+0.024
 brightly+0.023
ummy+0.022
 problem+0.022
 sudden+0.022
Neg logit contributions
 rebuild-0.020
 find-0.020
 meet-0.019
keeping-0.019
talk-0.019
Token so
Token ID523
Feature activation-1.68e-01

Pos logit contributions
ww+0.099
 brightly+0.096
 sudden+0.091
 problem+0.089
opus+0.087
Neg logit contributions
 rebuild-0.162
 find-0.157
keeping-0.157
 meet-0.154
 build-0.152
Token tired
Token ID10032
Feature activation-9.50e-02

Pos logit contributions
ww+0.250
 brightly+0.232
 problem+0.226
 sudden+0.225
opus+0.222
Neg logit contributions
 rebuild-0.383
 find-0.377
 meet-0.369
keeping-0.365
 build-0.355
Token and
Token ID290
Feature activation-6.02e-02

Pos logit contributions
 brightly+0.135
ww+0.134
 sudden+0.130
 problem+0.127
ls+0.118
Neg logit contributions
 rebuild-0.221
keeping-0.218
talk-0.210
iece-0.210
 find-0.210
Token wanted
Token ID2227
Feature activation+3.24e-02

Pos logit contributions
 brightly+0.091
ww+0.090
 sudden+0.087
 problem+0.086
ls+0.080
Neg logit contributions
 rebuild-0.135
keeping-0.134
talk-0.128
iece-0.128
 find-0.128
Token to
Token ID284
Feature activation+9.53e-02

Pos logit contributions
 rebuild+0.060
keeping+0.059
iece+0.057
talk+0.057
race+0.056
Neg logit contributions
 brightly-0.062
ww-0.060
 sudden-0.060
 problem-0.059
ls-0.055
Token go
Token ID467
Feature activation+5.79e-02

Pos logit contributions
iece+0.138
ws+0.126
talk+0.123
keeping+0.122
five+0.116
Neg logit contributions
 brightly-0.216
 lot-0.208
 problem-0.207
 sudden-0.204
 beautiful-0.204
Token back
Token ID736
Feature activation+4.34e-02

Pos logit contributions
 rebuild+0.112
iece+0.111
keeping+0.111
ws+0.109
talk+0.106
Neg logit contributions
 brightly-0.101
 problem-0.099
ls-0.095
 sudden-0.095
ww-0.095
Token to
Token ID284
Feature activation+1.05e-01

Pos logit contributions
 rebuild+0.085
iece+0.082
keeping+0.082
talk+0.080
ws+0.080
Neg logit contributions
 brightly-0.078
ww-0.075
 problem-0.074
 sudden-0.074
ls-0.071
Token sleep
Token ID3993
Feature activation+1.77e-03

Pos logit contributions
talk+0.209
iece+0.200
keeping+0.196
 rebuild+0.194
ws+0.191
Neg logit contributions
 brightly-0.182
 problem-0.181
 sudden-0.174
ww-0.166
 lot-0.164
Token.
Token ID13
Feature activation+3.06e-02

Pos logit contributions
 rebuild+0.003
keeping+0.003
talk+0.003
iece+0.003
 find+0.003
Neg logit contributions
 brightly-0.003
ww-0.003
 sudden-0.003
 problem-0.003
ls-0.003
Token
Token ID198
Feature activation-2.80e-02

Pos logit contributions
 rebuild+0.054
keeping+0.053
 find+0.051
talk+0.050
 meet+0.050
Neg logit contributions
ww-0.072
 brightly-0.070
 problem-0.069
 sudden-0.068
opus-0.065
Token
Token ID198
Feature activation-2.96e-02

Pos logit contributions
 brightly+0.043
 sudden+0.042
ww+0.041
ls+0.039
 problem+0.039
Neg logit contributions
keeping-0.062
 rebuild-0.062
talk-0.060
iece-0.060
race-0.059
TokenBut
Token ID1537
Feature activation-3.80e-02

Pos logit contributions
 brightly+0.053
ww+0.053
 sudden+0.052
 problem+0.049
ls+0.047
Neg logit contributions
keeping-0.058
 rebuild-0.058
talk-0.055
iece-0.055
ws-0.055
Token they
Token ID484
Feature activation+1.81e-02

Pos logit contributions
ww+0.079
 brightly+0.076
opus+0.075
 sudden+0.074
 problem+0.074
Neg logit contributions
 rebuild-0.062
 find-0.061
iece-0.060
keeping-0.059
 save-0.058
Token forgot
Token ID16453
Feature activation+3.76e-02

Pos logit contributions
 rebuild+0.036
keeping+0.035
talk+0.034
iece+0.034
 find+0.034
Neg logit contributions
 brightly-0.032
ww-0.032
 sudden-0.031
 problem-0.031
ls-0.029
Token to
Token ID284
Feature activation+8.49e-02

Pos logit contributions
 rebuild+0.066
keeping+0.064
 find+0.062
 build+0.061
 meet+0.060
Neg logit contributions
ww-0.076
 brightly-0.075
 sudden-0.074
 problem-0.072
opus-0.069
Token set
Token ID900
Feature activation+3.51e-02

Pos logit contributions
iece+0.129
keeping+0.124
race+0.114
talk+0.111
five+0.106
Neg logit contributions
 brightly-0.187
 problem-0.185
ww-0.184
 sudden-0.182
 lot-0.177
Token a
Token ID257
Feature activation-3.18e-02

Pos logit contributions
keeping+0.064
 rebuild+0.063
talk+0.062
race+0.061
iece+0.060
Neg logit contributions
 brightly-0.068
 problem-0.065
 sudden-0.064
ww-0.063
ls-0.063
Token timer
Token ID19781
Feature activation-1.68e-02

Pos logit contributions
ww+0.047
 brightly+0.047
 importantly+0.045
ement+0.045
opus+0.043
Neg logit contributions
 find-0.070
 rebuild-0.069
 save-0.067
 build-0.065
 buy-0.065
Token or
Token ID393
Feature activation-4.73e-02

Pos logit contributions
 problem+0.024
 brightly+0.023
 sudden+0.022
ww+0.021
ls+0.019
Neg logit contributions
 rebuild-0.038
ws-0.036
keeping-0.036
race-0.035
 greet-0.034
Token check
Token ID2198
Feature activation+5.80e-02

Pos logit contributions
 brightly+0.098
 sudden+0.095
 problem+0.093
ww+0.091
 brand+0.085
Neg logit contributions
keeping-0.076
race-0.075
iece-0.075
 rebuild-0.074
talk-0.073
Token We
Token ID775
Feature activation+8.01e-02

Pos logit contributions
 rebuild+0.078
keeping+0.076
talk+0.072
iece+0.072
 find+0.072
Neg logit contributions
 brightly-0.094
ww-0.093
 sudden-0.092
 problem-0.090
ls-0.086
Token should
Token ID815
Feature activation+3.48e-02

Pos logit contributions
keeping+0.153
 rebuild+0.151
iece+0.149
race+0.143
talk+0.141
Neg logit contributions
 brightly-0.146
ww-0.144
 problem-0.143
 sudden-0.140
ls-0.133
Token listen
Token ID6004
Feature activation+2.48e-02

Pos logit contributions
keeping+0.050
iece+0.049
 rebuild+0.045
talk+0.045
race+0.044
Neg logit contributions
ww-0.080
 problem-0.080
 brightly-0.079
 sudden-0.076
 lot-0.074
Token to
Token ID284
Feature activation+1.08e-01

Pos logit contributions
 rebuild+0.042
keeping+0.041
 find+0.039
talk+0.039
iece+0.038
Neg logit contributions
ww-0.051
 brightly-0.051
 sudden-0.050
 problem-0.049
ls-0.047
Token our
Token ID674
Feature activation+3.55e-02

Pos logit contributions
keeping+0.202
 rebuild+0.199
talk+0.199
iece+0.198
race+0.195
Neg logit contributions
 brightly-0.204
ww-0.197
 problem-0.194
 sudden-0.193
 lot-0.179
Token friends
Token ID2460
Feature activation+1.06e-01

Pos logit contributions
 rebuild+0.072
keeping+0.070
 find+0.068
talk+0.067
 meet+0.067
Neg logit contributions
ww-0.063
 brightly-0.062
 sudden-0.060
 problem-0.057
ls-0.055
Token and
Token ID290
Feature activation-3.01e-02

Pos logit contributions
 rebuild+0.238
keeping+0.236
talk+0.229
iece+0.226
race+0.226
Neg logit contributions
 brightly-0.158
ww-0.153
 problem-0.151
 sudden-0.148
ls-0.138
Token respect
Token ID2461
Feature activation+7.11e-02

Pos logit contributions
 brightly+0.073
 problem+0.070
ww+0.070
 sudden+0.070
 lot+0.066
Neg logit contributions
iece-0.039
race-0.036
talk-0.036
keeping-0.036
 rebuild-0.035
Token their
Token ID511
Feature activation+3.92e-02

Pos logit contributions
 rebuild+0.126
keeping+0.122
iece+0.121
race+0.119
talk+0.118
Neg logit contributions
 brightly-0.142
ww-0.138
 problem-0.135
 sudden-0.133
ls-0.128
Token style
Token ID3918
Feature activation-1.65e-02

Pos logit contributions
 rebuild+0.090
keeping+0.089
talk+0.085
iece+0.085
 find+0.085
Neg logit contributions
 brightly-0.058
ww-0.057
 sudden-0.056
 problem-0.054
ls-0.051
Token.
Token ID13
Feature activation+4.26e-02

Pos logit contributions
 brightly+0.031
ww+0.030
 problem+0.029
 sudden+0.028
ls+0.028
Neg logit contributions
 rebuild-0.031
keeping-0.030
talk-0.030
race-0.029
 meet-0.029
Token.
Token ID13
Feature activation+4.08e-02

Pos logit contributions
 rebuild+0.038
keeping+0.037
race+0.036
talk+0.035
iece+0.035
Neg logit contributions
 brightly-0.034
 problem-0.032
 sudden-0.032
ww-0.032
ls-0.030
Token They
Token ID1119
Feature activation+2.65e-02

Pos logit contributions
 rebuild+0.066
keeping+0.064
talk+0.061
 find+0.061
iece+0.061
Neg logit contributions
 brightly-0.088
ww-0.087
 sudden-0.085
 problem-0.084
ls-0.080
Token did
Token ID750
Feature activation-3.54e-02

Pos logit contributions
keeping+0.051
 rebuild+0.050
iece+0.048
race+0.047
talk+0.046
Neg logit contributions
 brightly-0.050
 sudden-0.048
ww-0.048
 problem-0.047
ls-0.045
Token not
Token ID407
Feature activation-8.70e-03

Pos logit contributions
ww+0.064
 brightly+0.063
 sudden+0.061
 problem+0.059
opus+0.058
Neg logit contributions
 rebuild-0.071
 find-0.068
 build-0.066
keeping-0.066
 meet-0.066
Token want
Token ID765
Feature activation+4.34e-02

Pos logit contributions
ww+0.019
 problem+0.019
 brightly+0.019
 lot+0.018
 cute+0.018
Neg logit contributions
keeping-0.013
iece-0.013
 rebuild-0.012
race-0.011
ried-0.011
Token to
Token ID284
Feature activation+1.06e-01

Pos logit contributions
 rebuild+0.070
keeping+0.069
 find+0.066
talk+0.066
 meet+0.066
Neg logit contributions
ww-0.092
 brightly-0.091
 sudden-0.090
 problem-0.088
opus-0.085
Token wear
Token ID5806
Feature activation+5.72e-02

Pos logit contributions
keeping+0.162
iece+0.158
 rebuild+0.146
ws+0.146
talk+0.143
Neg logit contributions
 brightly-0.239
 problem-0.232
ww-0.231
 sudden-0.229
 lot-0.225
Token hats
Token ID23910
Feature activation+4.17e-02

Pos logit contributions
 rebuild+0.090
keeping+0.087
 find+0.086
 meet+0.084
 build+0.083
Neg logit contributions
ww-0.126
 sudden-0.122
 brightly-0.120
 problem-0.118
opus-0.118
Token and
Token ID290
Feature activation-3.97e-02

Pos logit contributions
 rebuild+0.085
keeping+0.084
talk+0.080
iece+0.080
race+0.080
Neg logit contributions
 brightly-0.072
ww-0.069
 problem-0.068
 sudden-0.067
ls-0.064
Token coats
Token ID30720
Feature activation+1.83e-02

Pos logit contributions
 brightly+0.068
ww+0.068
 sudden+0.066
 problem+0.065
ls+0.061
Neg logit contributions
 rebuild-0.080
keeping-0.079
 find-0.076
talk-0.076
 meet-0.075
Token and
Token ID290
Feature activation-5.47e-02

Pos logit contributions
 rebuild+0.038
keeping+0.038
race+0.037
ws+0.037
talk+0.036
Neg logit contributions
 brightly-0.031
 problem-0.028
ww-0.027
 sudden-0.027
 brand-0.026
Token taste
Token ID6938
Feature activation-3.07e-02

Pos logit contributions
 rebuild+0.061
keeping+0.061
 find+0.059
 meet+0.058
 build+0.057
Neg logit contributions
ww-0.062
 brightly-0.059
 sudden-0.059
opus-0.057
 problem-0.056
Token,
Token ID11
Feature activation-8.14e-02

Pos logit contributions
ww+0.055
 sudden+0.055
opus+0.052
 brightly+0.051
 problem+0.050
Neg logit contributions
 rebuild-0.059
 find-0.056
keeping-0.055
 make-0.054
 build-0.054
Token but
Token ID475
Feature activation-3.52e-02

Pos logit contributions
ww+0.152
 sudden+0.144
 problem+0.142
 brightly+0.141
opus+0.138
Neg logit contributions
 rebuild-0.157
keeping-0.152
 find-0.149
 build-0.146
iece-0.145
Token also
Token ID635
Feature activation-5.97e-02

Pos logit contributions
ww+0.064
 sudden+0.061
opus+0.059
uddle+0.058
 brightly+0.057
Neg logit contributions
 rebuild-0.070
 find-0.068
keeping-0.066
 build-0.066
 meet-0.065
Token about
Token ID546
Feature activation+1.37e-02

Pos logit contributions
ww+0.097
 brightly+0.096
 sudden+0.094
 problem+0.091
opus+0.087
Neg logit contributions
 rebuild-0.127
keeping-0.123
 find-0.121
 meet-0.118
 build-0.118
Token kindness
Token ID23887
Feature activation+1.25e-02

Pos logit contributions
 rebuild+0.025
keeping+0.024
 find+0.023
 meet+0.023
 build+0.023
Neg logit contributions
ww-0.027
 brightly-0.026
 sudden-0.026
 problem-0.025
opus-0.024
Token and
Token ID290
Feature activation-4.19e-02

Pos logit contributions
 rebuild+0.026
keeping+0.025
talk+0.024
iece+0.024
 find+0.024
Neg logit contributions
 brightly-0.021
ww-0.021
 sudden-0.020
 problem-0.020
ls-0.019
Token friendship
Token ID14738
Feature activation-1.17e-02

Pos logit contributions
 brightly+0.082
ww+0.081
 sudden+0.080
 problem+0.078
ls+0.074
Neg logit contributions
 rebuild-0.075
keeping-0.074
talk-0.070
 find-0.070
iece-0.070
Token.
Token ID13
Feature activation+3.93e-02

Pos logit contributions
 brightly+0.023
ww+0.022
 problem+0.021
 sudden+0.021
ls+0.020
Neg logit contributions
 rebuild-0.021
keeping-0.021
talk-0.020
iece-0.020
race-0.020
Token<|endoftext|>
Token ID50256
Feature activation+4.70e-02

Pos logit contributions
keeping+0.064
 rebuild+0.064
 meet+0.061
 find+0.060
talk+0.060
Neg logit contributions
 brightly-0.082
ww-0.082
 problem-0.081
 sudden-0.081
ls-0.077
TokenAnna
Token ID31160
Feature activation+4.32e-02

Pos logit contributions
 rebuild+0.113
keeping+0.108
iece+0.106
talk+0.106
ws+0.105
Neg logit contributions
 brightly-0.065
ww-0.064
 sudden-0.061
 problem-0.061
ls-0.058
Token here
Token ID994
Feature activation-2.95e-04

Pos logit contributions
 rebuild+0.115
keeping+0.114
 find+0.109
talk+0.108
iece+0.107
Neg logit contributions
 brightly-0.111
ww-0.110
 sudden-0.108
 problem-0.106
ls-0.099
Token!"
Token ID2474
Feature activation+4.22e-02

Pos logit contributions
ww+0.001
 brightly+0.001
 sudden+0.001
 problem+0.001
ls+0.000
Neg logit contributions
 rebuild-0.001
keeping-0.001
 find-0.001
talk-0.001
iece-0.001
Token
Token ID198
Feature activation-2.89e-02

Pos logit contributions
 rebuild+0.091
keeping+0.089
iece+0.085
talk+0.085
ws+0.085
Neg logit contributions
 brightly-0.070
 sudden-0.068
ww-0.067
 problem-0.065
ls-0.062
Token
Token ID198
Feature activation-3.02e-02

Pos logit contributions
 brightly+0.038
 sudden+0.037
ww+0.036
ls+0.035
 lot+0.033
Neg logit contributions
keeping-0.071
 rebuild-0.071
iece-0.070
ws-0.067
race-0.067
TokenL
Token ID43
Feature activation+1.59e-02

Pos logit contributions
ww+0.051
 brightly+0.050
 sudden+0.049
 problem+0.044
 lot+0.043
Neg logit contributions
keeping-0.065
 rebuild-0.064
ws-0.062
 meet-0.060
 greet-0.060
Tokenily
Token ID813
Feature activation+5.10e-02

Pos logit contributions
 rebuild+0.025
keeping+0.025
 greet+0.024
 meet+0.023
talk+0.023
Neg logit contributions
 problem-0.033
ww-0.033
ls-0.032
ement-0.031
uddle-0.031
Token comes
Token ID2058
Feature activation+5.34e-02

Pos logit contributions
 rebuild+0.092
 find+0.088
 build+0.086
keeping+0.086
 meet+0.085
Neg logit contributions
ww-0.103
 brightly-0.101
 sudden-0.099
 problem-0.094
 importantly-0.090
Token to
Token ID284
Feature activation+8.87e-02

Pos logit contributions
ws+0.109
keeping+0.108
 rebuild+0.107
talk+0.102
 supervision+0.099
Neg logit contributions
 brightly-0.092
 sudden-0.088
 problem-0.083
ls-0.077
 beautiful-0.076
Token see
Token ID766
Feature activation+4.70e-02

Pos logit contributions
keeping+0.219
iece+0.217
ws+0.211
oscope+0.209
talk+0.209
Neg logit contributions
 brightly-0.112
 lot-0.106
 beautiful-0.102
 new-0.102
 long-0.097
Token.
Token ID13
Feature activation+2.79e-02

Pos logit contributions
keeping+0.104
 rebuild+0.101
talk+0.098
race+0.097
ried+0.096
Neg logit contributions
 brightly-0.076
 problem-0.070
ls-0.069
 sudden-0.069
 brand-0.066
Token She
Token ID1375
Feature activation-9.65e-03

Pos logit contributions
 rebuild+0.059
keeping+0.057
iece+0.055
talk+0.055
ws+0.055
Neg logit contributions
 brightly-0.048
 problem-0.045
ww-0.045
 sudden-0.045
ls-0.042
Token came
Token ID1625
Feature activation+4.83e-02

Pos logit contributions
 rebuild+0.133
keeping+0.129
talk+0.124
 find+0.124
iece+0.123
Neg logit contributions
 brightly-0.153
ww-0.152
 sudden-0.147
 problem-0.145
ls-0.138
Token in
Token ID287
Feature activation+7.27e-02

Pos logit contributions
 rebuild+0.097
keeping+0.095
talk+0.093
iece+0.093
ws+0.093
Neg logit contributions
 brightly-0.084
 sudden-0.083
 problem-0.080
ww-0.078
ls-0.074
Token and
Token ID290
Feature activation-3.86e-02

Pos logit contributions
 rebuild+0.174
iece+0.174
keeping+0.172
talk+0.171
ws+0.170
Neg logit contributions
 brightly-0.099
 problem-0.098
 sudden-0.096
ww-0.088
 lot-0.085
Token said
Token ID531
Feature activation+1.93e-02

Pos logit contributions
 brightly+0.075
ww+0.075
 sudden+0.072
 problem+0.072
ls+0.068
Neg logit contributions
 rebuild-0.070
keeping-0.068
 find-0.066
talk-0.065
iece-0.065
Token,
Token ID11
Feature activation-7.10e-02

Pos logit contributions
 rebuild+0.042
keeping+0.040
 find+0.040
 save+0.039
 meet+0.039
Neg logit contributions
ww-0.030
 sudden-0.029
 brightly-0.028
 problem-0.026
opus-0.026
Token "
Token ID366
Feature activation+9.35e-03

Pos logit contributions
ww+0.146
 brightly+0.135
 sudden+0.135
 problem+0.133
ls+0.132
Neg logit contributions
 rebuild-0.121
 find-0.117
 buy-0.115
 build-0.115
 meet-0.114
TokenTime
Token ID7575
Feature activation-1.57e-02

Pos logit contributions
 rebuild+0.017
keeping+0.017
ws+0.016
 build+0.016
 find+0.016
Neg logit contributions
ww-0.019
 sudden-0.018
 brightly-0.018
 problem-0.017
ls-0.016
Token to
Token ID284
Feature activation+1.18e-01

Pos logit contributions
 brightly+0.031
ww+0.031
 sudden+0.030
 problem+0.030
ls+0.028
Neg logit contributions
 rebuild-0.028
keeping-0.027
talk-0.026
iece-0.026
 find-0.026
Token turn
Token ID1210
Feature activation+3.31e-03

Pos logit contributions
iece+0.160
keeping+0.149
five+0.141
talk+0.140
ws+0.136
Neg logit contributions
 brightly-0.277
 problem-0.275
ww-0.274
 lot-0.266
 sudden-0.266
Token off
Token ID572
Feature activation+4.26e-02

Pos logit contributions
 rebuild+0.006
keeping+0.006
 find+0.006
 build+0.005
 meet+0.005
Neg logit contributions
 sudden-0.006
ww-0.006
 brightly-0.006
opus-0.006
 problem-0.006
Token the
Token ID262
Feature activation+5.84e-02

Pos logit contributions
 rebuild+0.082
keeping+0.081
 find+0.078
 meet+0.077
 build+0.077
Neg logit contributions
 brightly-0.076
ww-0.076
 sudden-0.076
 problem-0.073
opus-0.069
Token Lily
Token ID20037
Feature activation+5.09e-02

Pos logit contributions
keeping+0.051
 rebuild+0.051
 buy+0.050
 find+0.049
 meet+0.049
Neg logit contributions
ww-0.068
 brightly-0.067
 problem-0.067
 sudden-0.066
ls-0.063
Token told
Token ID1297
Feature activation+6.37e-02

Pos logit contributions
 find+0.090
 rebuild+0.089
 meet+0.084
 build+0.084
 buy+0.083
Neg logit contributions
ww-0.106
 brightly-0.106
 sudden-0.098
 importantly-0.096
 problem-0.096
Token her
Token ID607
Feature activation+3.22e-02

Pos logit contributions
 rebuild+0.113
keeping+0.112
iece+0.108
talk+0.107
ws+0.106
Neg logit contributions
 brightly-0.126
ww-0.124
 problem-0.121
 sudden-0.120
ls-0.113
Token about
Token ID546
Feature activation+2.71e-02

Pos logit contributions
 rebuild+0.065
 find+0.062
keeping+0.062
 meet+0.060
 build+0.060
Neg logit contributions
ww-0.058
 brightly-0.058
 sudden-0.057
 problem-0.053
opus-0.051
Token the
Token ID262
Feature activation+4.77e-02

Pos logit contributions
 rebuild+0.038
keeping+0.037
 find+0.036
 meet+0.036
 build+0.035
Neg logit contributions
ww-0.065
 brightly-0.063
 sudden-0.063
 problem-0.061
uddle-0.060
Token bird
Token ID6512
Feature activation+3.97e-02

Pos logit contributions
 find+0.098
 rebuild+0.096
keeping+0.095
 meet+0.095
 build+0.093
Neg logit contributions
ww-0.080
 brightly-0.076
 sudden-0.075
ls-0.073
opus-0.071
Token's
Token ID338
Feature activation-9.77e-03

Pos logit contributions
 rebuild+0.084
keeping+0.082
 find+0.081
 build+0.079
 meet+0.079
Neg logit contributions
ww-0.066
 brightly-0.064
 sudden-0.063
 problem-0.060
opus-0.057
Token broken
Token ID5445
Feature activation-1.12e-04

Pos logit contributions
ww+0.016
 brightly+0.016
 sudden+0.015
 problem+0.014
ls+0.014
Neg logit contributions
 rebuild-0.021
keeping-0.020
 find-0.020
 meet-0.020
 build-0.019
Token wing
Token ID8539
Feature activation+2.06e-02

Pos logit contributions
ww+0.000
 brightly+0.000
 sudden+0.000
 problem+0.000
ls+0.000
Neg logit contributions
 rebuild-0.000
keeping-0.000
 find-0.000
iece-0.000
 build-0.000
Token and
Token ID290
Feature activation-5.03e-02

Pos logit contributions
 rebuild+0.045
keeping+0.044
talk+0.043
iece+0.043
race+0.043
Neg logit contributions
 brightly-0.032
ww-0.031
 problem-0.030
 sudden-0.030
ls-0.028
Token how
Token ID703
Feature activation-5.86e-03

Pos logit contributions
 brightly+0.085
ww+0.084
 sudden+0.081
 problem+0.080
ls+0.075
Neg logit contributions
 rebuild-0.105
keeping-0.103
 find-0.098
iece-0.098
talk-0.098
Token another
Token ID1194
Feature activation-1.72e-02

Pos logit contributions
keeping+0.123
 rebuild+0.121
talk+0.118
iece+0.116
race+0.115
Neg logit contributions
 brightly-0.113
 sudden-0.108
 problem-0.106
ww-0.101
ls-0.099
Token way
Token ID835
Feature activation-2.53e-02

Pos logit contributions
 brightly+0.032
 problem+0.031
ww+0.031
 sudden+0.031
ls+0.028
Neg logit contributions
 rebuild-0.033
keeping-0.033
iece-0.032
talk-0.031
ws-0.031
Token in
Token ID287
Feature activation+7.49e-02

Pos logit contributions
 brightly+0.042
 problem+0.041
ls+0.039
ww+0.038
 sudden+0.038
Neg logit contributions
 rebuild-0.053
ws-0.052
keeping-0.051
five-0.050
talk-0.050
Token.
Token ID13
Feature activation+3.63e-02

Pos logit contributions
 rebuild+0.144
talk+0.138
five+0.138
iece+0.137
ws+0.137
Neg logit contributions
 brightly-0.139
 problem-0.138
 sudden-0.134
 lot-0.127
ww-0.125
Token She
Token ID1375
Feature activation-4.13e-03

Pos logit contributions
 rebuild+0.077
keeping+0.077
iece+0.073
talk+0.073
ws+0.072
Neg logit contributions
 brightly-0.060
 sudden-0.058
ww-0.056
 problem-0.054
ls-0.051
Token saw
Token ID2497
Feature activation+3.23e-02

Pos logit contributions
 brightly+0.008
 sudden+0.008
ww+0.007
 problem+0.007
ls+0.007
Neg logit contributions
 rebuild-0.008
keeping-0.008
iece-0.007
race-0.007
talk-0.007
Token a
Token ID257
Feature activation-4.25e-02

Pos logit contributions
keeping+0.066
 rebuild+0.065
talk+0.062
race+0.061
 meet+0.061
Neg logit contributions
 brightly-0.058
 sudden-0.055
ww-0.054
 problem-0.054
ls-0.052
Token hole
Token ID7604
Feature activation-8.51e-02

Pos logit contributions
ww+0.067
 brightly+0.064
ls+0.062
ement+0.060
 sudden+0.060
Neg logit contributions
 rebuild-0.090
 find-0.089
 build-0.086
keeping-0.085
 save-0.085
Token in
Token ID287
Feature activation+6.61e-02

Pos logit contributions
 brightly+0.146
 problem+0.135
ls+0.132
 sudden+0.130
ww+0.129
Neg logit contributions
keeping-0.181
 rebuild-0.179
talk-0.174
 meet-0.171
race-0.169
Token the
Token ID262
Feature activation+3.26e-02

Pos logit contributions
 rebuild+0.122
talk+0.117
iece+0.117
ws+0.114
keeping+0.114
Neg logit contributions
 brightly-0.131
 sudden-0.124
 problem-0.122
ww-0.118
 lot-0.115
Token wall
Token ID3355
Feature activation-2.42e-02

Pos logit contributions
ws+0.073
race+0.072
 rebuild+0.072
keeping+0.069
talk+0.068
Neg logit contributions
 problem-0.051
 lot-0.050
 brightly-0.050
 sudden-0.049
ww-0.042
Token.
Token ID13
Feature activation+2.81e-02

Pos logit contributions
 brightly+0.041
ww+0.041
 sudden+0.039
 problem+0.039
ls+0.037
Neg logit contributions
 rebuild-0.034
keeping-0.033
iece-0.032
 find-0.032
talk-0.032
Token
Token ID220
Feature activation-1.10e-01

Pos logit contributions
keeping+0.058
 rebuild+0.058
iece+0.055
talk+0.055
ws+0.054
Neg logit contributions
 brightly-0.048
ww-0.046
 sudden-0.046
 problem-0.045
ummy-0.042
Token
Token ID198
Feature activation-3.08e-02

Pos logit contributions
 brightly+0.146
 sudden+0.136
ls+0.128
ww+0.126
 problem+0.126
Neg logit contributions
keeping-0.264
 rebuild-0.263
talk-0.254
iece-0.252
race-0.252
Token
Token ID198
Feature activation-3.13e-02

Pos logit contributions
 brightly+0.052
 sudden+0.050
ww+0.050
 problem+0.048
ls+0.045
Neg logit contributions
 rebuild-0.064
keeping-0.064
iece-0.061
ws-0.060
talk-0.060
TokenOne
Token ID3198
Feature activation-5.08e-02

Pos logit contributions
 brightly+0.054
ww+0.053
 problem+0.052
 sudden+0.052
ls+0.049
Neg logit contributions
 rebuild-0.063
keeping-0.060
 find-0.060
 build-0.060
talk-0.060
Token day
Token ID1110
Feature activation-8.76e-02

Pos logit contributions
 brightly+0.083
 sudden+0.083
ww+0.081
 problem+0.079
ls+0.078
Neg logit contributions
 find-0.103
 rebuild-0.102
 meet-0.101
 build-0.099
keeping-0.096
Token,
Token ID11
Feature activation-7.93e-02

Pos logit contributions
ww+0.146
 brightly+0.145
opus+0.141
 problem+0.138
 sudden+0.137
Neg logit contributions
 build-0.170
 rebuild-0.167
 find-0.164
 buy-0.163
 save-0.163
Token Tim
Token ID5045
Feature activation+4.70e-02

Pos logit contributions
ww+0.197
 problem+0.190
opus+0.187
ls+0.182
 brightly+0.182
Neg logit contributions
 build-0.102
 rebuild-0.101
 buy-0.096
 find-0.095
 save-0.092
Token was
Token ID373
Feature activation-6.21e-02

Pos logit contributions
 rebuild+0.085
keeping+0.084
five+0.081
iece+0.080
talk+0.079
Neg logit contributions
 sudden-0.090
ww-0.089
 problem-0.088
 brightly-0.087
ls-0.082
Token walking
Token ID6155
Feature activation+4.99e-02

Pos logit contributions
ww+0.116
 brightly+0.113
ls+0.103
opus+0.103
ement+0.102
Neg logit contributions
 find-0.123
 rebuild-0.123
 build-0.118
 meet-0.117
race-0.115
Token through
Token ID832
Feature activation+4.61e-02

Pos logit contributions
keeping+0.096
 rebuild+0.096
ws+0.090
iece+0.090
talk+0.090
Neg logit contributions
 brightly-0.094
ww-0.088
 sudden-0.088
 problem-0.087
ls-0.083
Token.
Token ID13
Feature activation+3.37e-02

Pos logit contributions
 rebuild+0.061
 find+0.060
keeping+0.059
iece+0.057
 build+0.057
Neg logit contributions
ww-0.061
 sudden-0.057
 brightly-0.056
opus-0.053
ls-0.052
Token<|endoftext|>
Token ID50256
Feature activation+4.59e-02

Pos logit contributions
 rebuild+0.069
keeping+0.067
iece+0.065
talk+0.065
race+0.064
Neg logit contributions
ww-0.058
 brightly-0.058
 sudden-0.055
 problem-0.055
ls-0.052
TokenOne
Token ID3198
Feature activation-4.12e-02

Pos logit contributions
 rebuild+0.114
iece+0.108
keeping+0.108
ws+0.107
talk+0.106
Neg logit contributions
 brightly-0.059
ww-0.056
 sudden-0.055
 problem-0.054
ls-0.054
Token day
Token ID1110
Feature activation-8.10e-02

Pos logit contributions
 brightly+0.059
ww+0.058
 sudden+0.057
 problem+0.055
ls+0.052
Neg logit contributions
 rebuild-0.095
keeping-0.094
 find-0.091
 meet-0.091
iece-0.090
Token,
Token ID11
Feature activation-6.89e-02

Pos logit contributions
 brightly+0.143
 sudden+0.139
 brand+0.136
ement+0.127
 importantly+0.125
Neg logit contributions
 find-0.154
 meet-0.152
 join-0.148
 build-0.145
 buy-0.143
Token a
Token ID257
Feature activation-3.38e-02

Pos logit contributions
ww+0.136
 problem+0.134
 brightly+0.134
 sudden+0.133
opus+0.127
Neg logit contributions
 rebuild-0.117
 find-0.115
 meet-0.111
keeping-0.111
 build-0.110
Token girl
Token ID2576
Feature activation-2.30e-03

Pos logit contributions
ement+0.042
 brightly+0.041
ww+0.041
ls+0.041
 problem+0.041
Neg logit contributions
 find-0.079
 meet-0.077
 rebuild-0.077
 fix-0.075
 build-0.075
Token named
Token ID3706
Feature activation+3.99e-02

Pos logit contributions
ww+0.004
 sudden+0.004
 brightly+0.003
 problem+0.003
ls+0.003
Neg logit contributions
iece-0.005
 rebuild-0.005
keeping-0.005
ws-0.005
talk-0.005
Token Sue
Token ID26089
Feature activation+5.74e-02

Pos logit contributions
 rebuild+0.085
keeping+0.083
iece+0.082
talk+0.080
ws+0.079
Neg logit contributions
ww-0.067
 brightly-0.066
 sudden-0.062
 problem-0.061
 lot-0.059
Token had
Token ID550
Feature activation-4.49e-04

Pos logit contributions
 rebuild+0.126
iece+0.126
keeping+0.123
race+0.118
 supervision+0.118
Neg logit contributions
ww-0.087
 brightly-0.082
 problem-0.080
 sudden-0.079
ls-0.079
Token a
Token ID257
Feature activation-4.38e-02

Pos logit contributions
 brightly+0.001
ww+0.001
 sudden+0.001
 problem+0.001
ls+0.001
Neg logit contributions
 rebuild-0.001
keeping-0.001
talk-0.001
 find-0.001
iece-0.001
Token her
Token ID607
Feature activation+3.00e-02

Pos logit contributions
ws+0.187
talk+0.179
iece+0.177
keeping+0.175
wed+0.168
Neg logit contributions
 brightly-0.166
 sudden-0.152
 lot-0.146
 cute-0.142
 problem-0.137
Token.
Token ID13
Feature activation+3.17e-02

Pos logit contributions
 rebuild+0.054
keeping+0.053
talk+0.051
ws+0.051
iece+0.050
Neg logit contributions
 brightly-0.060
 sudden-0.058
 problem-0.057
ww-0.056
ls-0.053
Token Then
Token ID3244
Feature activation-4.23e-02

Pos logit contributions
keeping+0.048
 rebuild+0.048
 meet+0.047
 find+0.046
 buy+0.046
Neg logit contributions
ww-0.070
 problem-0.068
 brightly-0.067
 sudden-0.066
ls-0.066
Token,
Token ID11
Feature activation-7.70e-02

Pos logit contributions
ww+0.062
 brightly+0.061
opus+0.060
 sudden+0.060
uddle+0.058
Neg logit contributions
 rebuild-0.091
 find-0.090
 save-0.089
keeping-0.089
 meet-0.089
Token a
Token ID257
Feature activation-4.25e-02

Pos logit contributions
ww+0.143
 problem+0.138
 brightly+0.135
 sudden+0.135
opus+0.134
Neg logit contributions
 rebuild-0.142
keeping-0.138
 find-0.137
 meet-0.135
race-0.135
Token bright
Token ID6016
Feature activation-1.04e-01

Pos logit contributions
ls+0.067
ww+0.066
 brightly+0.063
ummy+0.062
ement+0.060
Neg logit contributions
 rebuild-0.086
 find-0.085
 meet-0.084
 greet-0.083
 save-0.083
Token light
Token ID1657
Feature activation-6.93e-02

Pos logit contributions
ls+0.182
ww+0.179
 sudden+0.171
opus+0.164
uddle+0.160
Neg logit contributions
 find-0.197
 rebuild-0.196
 build-0.192
 greet-0.191
 cheer-0.191
Token shine
Token ID18340
Feature activation-4.96e-02

Pos logit contributions
ww+0.128
 brightly+0.124
 sudden+0.121
 problem+0.120
ls+0.114
Neg logit contributions
 rebuild-0.131
keeping-0.126
iece-0.123
talk-0.122
 find-0.122
Token through
Token ID832
Feature activation+3.73e-02

Pos logit contributions
ww+0.086
 sudden+0.084
 brightly+0.083
 problem+0.081
ls+0.076
Neg logit contributions
 rebuild-0.100
keeping-0.097
iece-0.095
talk-0.093
 find-0.093
Token the
Token ID262
Feature activation+2.32e-02

Pos logit contributions
 rebuild+0.054
keeping+0.053
iece+0.049
 find+0.049
talk+0.049
Neg logit contributions
ww-0.086
 brightly-0.085
 problem-0.083
 sudden-0.083
ls-0.079
Token trees
Token ID7150
Feature activation+2.73e-02

Pos logit contributions
 rebuild+0.046
keeping+0.045
 find+0.044
 meet+0.044
iece+0.044
Neg logit contributions
ww-0.041
 brightly-0.040
 sudden-0.039
 problem-0.038
ls-0.037
Token He
Token ID679
Feature activation-1.03e-02

Pos logit contributions
 rebuild+0.060
keeping+0.060
talk+0.057
iece+0.056
ws+0.056
Neg logit contributions
 brightly-0.059
ww-0.057
 sudden-0.056
 problem-0.055
ls-0.051
Token would
Token ID561
Feature activation+1.89e-02

Pos logit contributions
ww+0.021
 brightly+0.021
 sudden+0.020
 problem+0.020
ummy+0.019
Neg logit contributions
 rebuild-0.018
 find-0.017
 build-0.017
 meet-0.017
keeping-0.016
Token roll
Token ID4836
Feature activation+3.92e-02

Pos logit contributions
keeping+0.026
 rebuild+0.025
iece+0.024
talk+0.024
race+0.023
Neg logit contributions
 brightly-0.046
ww-0.044
 sudden-0.044
 problem-0.044
 lot-0.041
Token and
Token ID290
Feature activation-3.12e-02

Pos logit contributions
 rebuild+0.063
keeping+0.063
 find+0.061
talk+0.060
iece+0.059
Neg logit contributions
ww-0.082
 sudden-0.082
 problem-0.080
 brightly-0.080
opus-0.077
Token roll
Token ID4836
Feature activation+3.60e-02

Pos logit contributions
 brightly+0.064
 sudden+0.062
ww+0.062
 problem+0.060
ls+0.059
Neg logit contributions
keeping-0.054
 rebuild-0.050
talk-0.050
iece-0.049
ws-0.047
Token and
Token ID290
Feature activation-3.58e-02

Pos logit contributions
 rebuild+0.063
keeping+0.062
 find+0.060
talk+0.059
iece+0.059
Neg logit contributions
ww-0.071
 sudden-0.070
 brightly-0.070
 problem-0.069
opus-0.066
Token roll
Token ID4836
Feature activation+3.45e-02

Pos logit contributions
 brightly+0.072
 sudden+0.070
ww+0.069
 problem+0.068
ls+0.067
Neg logit contributions
keeping-0.063
talk-0.059
 rebuild-0.058
iece-0.057
ws-0.055
Token.
Token ID13
Feature activation+3.73e-02

Pos logit contributions
 rebuild+0.060
keeping+0.059
talk+0.056
 find+0.056
iece+0.056
Neg logit contributions
ww-0.069
 brightly-0.068
 sudden-0.068
 problem-0.067
opus-0.063
Token Everywhere
Token ID48201
Feature activation-4.99e-02

Pos logit contributions
keeping+0.076
 rebuild+0.076
talk+0.071
iece+0.071
ws+0.070
Neg logit contributions
 brightly-0.065
 sudden-0.062
ww-0.061
 problem-0.058
ummy-0.055
Token he
Token ID339
Feature activation-1.83e-03

Pos logit contributions
ww+0.085
 sudden+0.083
 brightly+0.082
 problem+0.079
uddle+0.077
Neg logit contributions
 rebuild-0.099
 find-0.096
 build-0.096
keeping-0.094
 meet-0.093
Token went
Token ID1816
Feature activation+6.20e-02

Pos logit contributions
 brightly+0.004
ww+0.004
 sudden+0.004
 problem+0.004
ls+0.004
Neg logit contributions
 rebuild-0.003
keeping-0.003
talk-0.003
 find-0.003
iece-0.003
Token a
Token ID257
Feature activation-2.00e-02

Pos logit contributions
 rebuild+0.096
keeping+0.094
 find+0.091
talk+0.091
 meet+0.090
Neg logit contributions
 brightly-0.059
ww-0.058
 sudden-0.057
 problem-0.056
ls-0.053
Token good
Token ID922
Feature activation-8.18e-02

Pos logit contributions
 brightly+0.026
ls+0.025
 sudden+0.024
ww+0.024
ement+0.024
Neg logit contributions
 rebuild-0.049
 find-0.049
 meet-0.048
 save-0.047
keeping-0.046
Token friend
Token ID1545
Feature activation+6.50e-02

Pos logit contributions
ww+0.120
 problem+0.120
 brightly+0.118
 sudden+0.111
ls+0.101
Neg logit contributions
 rebuild-0.191
keeping-0.185
iece-0.183
talk-0.178
ws-0.177
Token.
Token ID13
Feature activation+4.48e-02

Pos logit contributions
 rebuild+0.125
keeping+0.120
iece+0.117
talk+0.116
race+0.115
Neg logit contributions
 brightly-0.120
ww-0.119
 problem-0.116
 sudden-0.112
ls-0.108
Token You
Token ID921
Feature activation+2.80e-02

Pos logit contributions
 rebuild+0.093
keeping+0.089
iece+0.088
talk+0.087
ws+0.085
Neg logit contributions
ww-0.079
 brightly-0.077
 sudden-0.070
 importantly-0.069
 problem-0.069
Token did
Token ID750
Feature activation-2.57e-02

Pos logit contributions
keeping+0.040
iece+0.038
 rebuild+0.035
race+0.035
talk+0.035
Neg logit contributions
 problem-0.066
ww-0.065
 brightly-0.064
 sudden-0.061
 lot-0.061
Token the
Token ID262
Feature activation+5.85e-02

Pos logit contributions
 brightly+0.051
ww+0.051
 sudden+0.051
opus+0.048
 problem+0.048
Neg logit contributions
 rebuild-0.047
 find-0.046
 meet-0.044
keeping-0.043
 build-0.043
Token right
Token ID826
Feature activation-4.09e-03

Pos logit contributions
keeping+0.110
 rebuild+0.108
talk+0.105
ws+0.100
race+0.100
Neg logit contributions
 problem-0.114
 brightly-0.111
ww-0.109
 sudden-0.102
 lot-0.099
Token thing
Token ID1517
Feature activation-2.75e-02

Pos logit contributions
 problem+0.005
ww+0.005
 brightly+0.005
 sudden+0.004
ls+0.004
Neg logit contributions
 rebuild-0.011
race-0.010
iece-0.010
keeping-0.010
talk-0.010
Token."
Token ID526
Feature activation+5.29e-02

Pos logit contributions
 brightly+0.056
ww+0.055
 sudden+0.054
 problem+0.053
ls+0.050
Neg logit contributions
 rebuild-0.047
keeping-0.047
talk-0.044
iece-0.044
 find-0.044
Token
Token ID198
Feature activation-2.12e-02

Pos logit contributions
 rebuild+0.101
keeping+0.099
talk+0.095
iece+0.095
 find+0.094
Neg logit contributions
 brightly-0.098
ww-0.097
 sudden-0.095
 problem-0.094
ls-0.088
Token<|endoftext|>
Token ID50256
Feature activation+4.96e-02

Pos logit contributions
 rebuild+0.070
keeping+0.069
iece+0.066
talk+0.066
 find+0.065
Neg logit contributions
 brightly-0.067
ww-0.067
 sudden-0.065
 problem-0.064
ls-0.060
TokenJohn
Token ID7554
Feature activation+3.14e-02

Pos logit contributions
 rebuild+0.123
ws+0.115
iece+0.115
keeping+0.115
talk+0.115
Neg logit contributions
 brightly-0.065
ww-0.061
 sudden-0.060
 problem-0.059
ls-0.058
Token and
Token ID290
Feature activation-2.75e-02

Pos logit contributions
 rebuild+0.066
keeping+0.065
iece+0.063
talk+0.062
race+0.062
Neg logit contributions
 brightly-0.051
ww-0.050
 problem-0.049
 sudden-0.049
ls-0.045
Token Jay
Token ID9180
Feature activation+3.94e-03

Pos logit contributions
ww+0.047
 brightly+0.046
 sudden+0.043
 problem+0.042
 cute+0.039
Neg logit contributions
 rebuild-0.057
ws-0.056
iece-0.055
keeping-0.054
talk-0.053
Token were
Token ID547
Feature activation-5.47e-02

Pos logit contributions
keeping+0.008
 rebuild+0.008
iece+0.008
race+0.007
talk+0.007
Neg logit contributions
 brightly-0.007
ww-0.007
 problem-0.007
 sudden-0.006
ls-0.006
Token very
Token ID845
Feature activation-1.89e-01

Pos logit contributions
ww+0.098
 brightly+0.095
 sudden+0.089
opus+0.088
 problem+0.087
Neg logit contributions
 find-0.109
 rebuild-0.108
 meet-0.106
 build-0.105
race-0.104
Token good
Token ID922
Feature activation-9.46e-02

Pos logit contributions
ww+0.343
 problem+0.301
opus+0.300
ls+0.299
 brightly+0.298
Neg logit contributions
 find-0.388
 meet-0.376
 rebuild-0.373
 join-0.369
 build-0.367
Token friends
Token ID2460
Feature activation+7.06e-02

Pos logit contributions
ww+0.139
 sudden+0.129
 brightly+0.128
 problem+0.123
 brand+0.117
Neg logit contributions
keeping-0.212
 rebuild-0.212
talk-0.209
race-0.207
 find-0.207
Token.
Token ID13
Feature activation+3.10e-02

Pos logit contributions
 rebuild+0.133
keeping+0.129
ws+0.126
talk+0.124
 build+0.123
Neg logit contributions
 brightly-0.135
 sudden-0.128
ww-0.127
 problem-0.126
ls-0.118
Token They
Token ID1119
Feature activation+1.53e-02

Pos logit contributions
 rebuild+0.055
keeping+0.054
iece+0.051
talk+0.050
race+0.050
Neg logit contributions
 brightly-0.063
ww-0.061
 sudden-0.060
 problem-0.060
ls-0.056
Token were
Token ID547
Feature activation-6.15e-02

Pos logit contributions
keeping+0.030
 rebuild+0.030
iece+0.029
talk+0.028
race+0.028
Neg logit contributions
 brightly-0.028
ww-0.027
 sudden-0.026
 problem-0.026
ls-0.024
Token
Token ID198
Feature activation-3.20e-02

Pos logit contributions
 brightly+0.049
 sudden+0.049
ww+0.048
 problem+0.044
ls+0.043
Neg logit contributions
 rebuild-0.063
keeping-0.062
iece-0.061
talk-0.060
ws-0.060
TokenThe
Token ID464
Feature activation-3.04e-03

Pos logit contributions
 brightly+0.065
ww+0.064
 sudden+0.063
 problem+0.061
ls+0.058
Neg logit contributions
 rebuild-0.056
keeping-0.055
talk-0.052
iece-0.052
 find-0.052
Token little
Token ID1310
Feature activation-6.06e-02

Pos logit contributions
ww+0.004
 brightly+0.004
 sudden+0.004
ls+0.004
 problem+0.004
Neg logit contributions
 rebuild-0.007
 find-0.007
 meet-0.007
keeping-0.007
 build-0.007
Token girl
Token ID2576
Feature activation+1.24e-02

Pos logit contributions
ww+0.110
 brightly+0.107
 sudden+0.105
 problem+0.102
ls+0.098
Neg logit contributions
 rebuild-0.115
keeping-0.113
 find-0.113
iece-0.109
 meet-0.108
Token was
Token ID373
Feature activation-6.80e-02

Pos logit contributions
 rebuild+0.023
 find+0.022
keeping+0.022
 meet+0.022
 build+0.021
Neg logit contributions
ww-0.024
 brightly-0.023
 problem-0.023
 sudden-0.022
ls-0.021
Token very
Token ID845
Feature activation-1.98e-01

Pos logit contributions
ww+0.088
 brightly+0.086
 problem+0.082
 sudden+0.081
ls+0.079
Neg logit contributions
 rebuild-0.168
keeping-0.164
 find-0.163
 meet-0.162
iece-0.159
Token happy
Token ID3772
Feature activation-1.15e-01

Pos logit contributions
ww+0.313
 problem+0.295
 brightly+0.294
 sudden+0.286
opus+0.281
Neg logit contributions
 rebuild-0.434
 find-0.431
 meet-0.417
 join-0.409
 build-0.405
Token that
Token ID326
Feature activation-4.21e-03

Pos logit contributions
ww+0.238
 sudden+0.222
 problem+0.217
 brightly+0.216
opus+0.214
Neg logit contributions
keeping-0.198
 find-0.196
 rebuild-0.196
 meet-0.184
 join-0.184
Token she
Token ID673
Feature activation-2.02e-02

Pos logit contributions
 brightly+0.006
ww+0.006
 sudden+0.006
 problem+0.005
ls+0.005
Neg logit contributions
 rebuild-0.010
keeping-0.010
 find-0.010
iece-0.010
talk-0.009
Token met
Token ID1138
Feature activation+7.25e-02

Pos logit contributions
 brightly+0.038
ww+0.038
 sudden+0.037
 problem+0.037
ls+0.035
Neg logit contributions
 rebuild-0.038
keeping-0.037
 find-0.035
talk-0.035
iece-0.035
Token the
Token ID262
Feature activation+4.28e-02

Pos logit contributions
 rebuild+0.139
keeping+0.138
talk+0.135
ws+0.134
race+0.130
Neg logit contributions
 brightly-0.138
 sudden-0.127
ww-0.125
 problem-0.125
ls-0.116
Token warm
Token ID5814
Feature activation-6.00e-02

Pos logit contributions
 find+0.105
 meet+0.101
 rebuild+0.099
keeping+0.098
 buy+0.098
Neg logit contributions
ww-0.054
 brightly-0.049
ls-0.049
 sudden-0.044
 problem-0.043
Token l
Token ID300
Feature activation-2.82e-02

Pos logit contributions
 brightly+0.085
ww+0.085
 sudden+0.084
 problem+0.082
ls+0.078
Neg logit contributions
keeping-0.135
 rebuild-0.133
 find-0.132
 meet-0.132
iece-0.128
Tokenily
Token ID813
Feature activation+5.99e-02

Pos logit contributions
 brightly+0.064
ww+0.064
 sudden+0.062
 problem+0.062
ls+0.060
Neg logit contributions
 rebuild-0.043
keeping-0.041
talk-0.039
 build-0.038
 find-0.038
Token became
Token ID2627
Feature activation-4.18e-02

Pos logit contributions
 rebuild+0.117
keeping+0.115
talk+0.110
iece+0.109
 find+0.109
Neg logit contributions
 brightly-0.108
ww-0.107
 sudden-0.104
 problem-0.103
ls-0.097
Token a
Token ID257
Feature activation-6.25e-02

Pos logit contributions
ww+0.082
 brightly+0.081
 sudden+0.079
 problem+0.079
ls+0.076
Neg logit contributions
 rebuild-0.074
keeping-0.072
 find-0.070
iece-0.069
 build-0.068
Token happy
Token ID3772
Feature activation-1.23e-01

Pos logit contributions
ls+0.096
ement+0.082
ww+0.080
ittens+0.078
opus+0.076
Neg logit contributions
 find-0.145
 join-0.140
 rebuild-0.139
 meet-0.139
 save-0.138
Token reminder
Token ID15438
Feature activation-3.89e-02

Pos logit contributions
ww+0.208
ls+0.205
 brightly+0.203
 sudden+0.200
 problem+0.192
Neg logit contributions
 find-0.242
keeping-0.240
 rebuild-0.239
 meet-0.236
 join-0.228
Token of
Token ID286
Feature activation+8.77e-02

Pos logit contributions
 brightly+0.064
ww+0.064
 sudden+0.063
 problem+0.061
ls+0.058
Neg logit contributions
 rebuild-0.081
keeping-0.080
 find-0.077
talk-0.077
iece-0.076
Token the
Token ID262
Feature activation+4.39e-02

Pos logit contributions
 rebuild+0.142
keeping+0.137
 find+0.132
iece+0.129
 meet+0.129
Neg logit contributions
ww-0.189
 brightly-0.186
 sudden-0.183
 problem-0.181
ls-0.174
Token special
Token ID2041
Feature activation-5.89e-02

Pos logit contributions
 find+0.102
 meet+0.101
 greet+0.096
 join+0.096
 save+0.095
Neg logit contributions
ww-0.059
ls-0.057
 brightly-0.056
 sudden-0.052
opus-0.049
Token family
Token ID1641
Feature activation+5.35e-02

Pos logit contributions
 brightly+0.101
 sudden+0.101
ww+0.098
ls+0.097
 problem+0.092
Neg logit contributions
 find-0.114
 meet-0.113
 join-0.109
 greet-0.109
keeping-0.108
Token reached
Token ID4251
Feature activation+4.62e-02

Pos logit contributions
 brightly+0.024
ww+0.023
 sudden+0.022
 problem+0.022
ls+0.021
Neg logit contributions
 rebuild-0.016
 find-0.016
 meet-0.016
 build-0.015
keeping-0.015
Token it
Token ID340
Feature activation-1.95e-02

Pos logit contributions
keeping+0.081
 rebuild+0.080
talk+0.077
iece+0.074
ws+0.074
Neg logit contributions
 brightly-0.093
 problem-0.091
ww-0.089
 sudden-0.088
ls-0.086
Token,
Token ID11
Feature activation-7.79e-02

Pos logit contributions
ww+0.029
 sudden+0.028
 brightly+0.028
 problem+0.026
opus+0.025
Neg logit contributions
 rebuild-0.045
 find-0.044
iece-0.043
 build-0.043
keeping-0.043
Token she
Token ID673
Feature activation-2.10e-02

Pos logit contributions
 brightly+0.106
ww+0.103
 sudden+0.101
 problem+0.098
ls+0.091
Neg logit contributions
 rebuild-0.188
keeping-0.185
talk-0.180
iece-0.178
ws-0.177
Token was
Token ID373
Feature activation-6.41e-02

Pos logit contributions
ww+0.044
 brightly+0.043
 problem+0.041
 sudden+0.041
ls+0.039
Neg logit contributions
 rebuild-0.036
 find-0.035
keeping-0.034
 meet-0.034
 build-0.033
Token so
Token ID523
Feature activation-1.64e-01

Pos logit contributions
 brightly+0.081
ww+0.080
 sudden+0.077
 problem+0.075
ls+0.069
Neg logit contributions
 rebuild-0.160
keeping-0.158
iece-0.153
 find-0.153
talk-0.152
Token excited
Token ID6568
Feature activation-1.10e-01

Pos logit contributions
ww+0.243
 problem+0.236
 brightly+0.233
 sudden+0.231
opus+0.219
Neg logit contributions
 rebuild-0.371
 find-0.363
 meet-0.358
keeping-0.354
 build-0.349
Token that
Token ID326
Feature activation-3.88e-03

Pos logit contributions
ww+0.222
 sudden+0.220
 brightly+0.216
 problem+0.209
opus+0.202
Neg logit contributions
keeping-0.190
 rebuild-0.183
 find-0.182
 meet-0.176
iece-0.175
Token she
Token ID673
Feature activation-1.70e-02

Pos logit contributions
 brightly+0.005
ww+0.005
 sudden+0.005
 problem+0.004
ls+0.004
Neg logit contributions
 rebuild-0.010
keeping-0.010
talk-0.009
iece-0.009
 find-0.009
Token started
Token ID2067
Feature activation+3.26e-02

Pos logit contributions
ww+0.034
 brightly+0.034
 problem+0.033
 sudden+0.033
ummy+0.032
Neg logit contributions
 rebuild-0.029
 find-0.029
keeping-0.028
 meet-0.028
 build-0.027
Token to
Token ID284
Feature activation+8.66e-02

Pos logit contributions
 rebuild+0.061
keeping+0.060
talk+0.058
iece+0.057
 find+0.057
Neg logit contributions
 brightly-0.061
ww-0.061
 sudden-0.059
 problem-0.059
ls-0.055
Token up
Token ID510
Feature activation+8.32e-03

Pos logit contributions
ww+0.021
 sudden+0.021
 brightly+0.021
 problem+0.020
opus+0.019
Neg logit contributions
 rebuild-0.019
 find-0.019
 build-0.018
iece-0.018
keeping-0.018
Token,
Token ID11
Feature activation-7.68e-02

Pos logit contributions
 rebuild+0.018
 find+0.018
keeping+0.018
 build+0.018
iece+0.018
Neg logit contributions
ww-0.013
 sudden-0.012
 brightly-0.012
 problem-0.012
opus-0.011
Token he
Token ID339
Feature activation-2.25e-02

Pos logit contributions
 brightly+0.108
ww+0.107
 sudden+0.103
 problem+0.101
ls+0.094
Neg logit contributions
 rebuild-0.180
keeping-0.178
iece-0.171
talk-0.171
 find-0.171
Token felt
Token ID2936
Feature activation-5.00e-02

Pos logit contributions
ww+0.057
 brightly+0.053
ummy+0.052
 problem+0.052
 importantly+0.051
Neg logit contributions
 find-0.031
 rebuild-0.030
 build-0.028
 meet-0.027
talk-0.027
Token a
Token ID257
Feature activation-6.36e-02

Pos logit contributions
 brightly+0.083
 sudden+0.079
ww+0.079
 problem+0.077
ls+0.072
Neg logit contributions
keeping-0.107
 rebuild-0.106
talk-0.103
iece-0.100
race-0.100
Token strange
Token ID6283
Feature activation-1.22e-01

Pos logit contributions
ww+0.099
ls+0.095
 brightly+0.093
 problem+0.092
 sudden+0.087
Neg logit contributions
 find-0.135
 rebuild-0.135
 save-0.129
 build-0.127
 meet-0.127
Token,
Token ID11
Feature activation-1.07e-01

Pos logit contributions
ww+0.242
 sudden+0.235
 brightly+0.234
ls+0.228
opus+0.219
Neg logit contributions
 rebuild-0.209
 find-0.209
keeping-0.206
iece-0.201
 build-0.200
Token t
Token ID256
Feature activation-1.01e-01

Pos logit contributions
ww+0.155
 brightly+0.148
 sudden+0.145
 problem+0.144
ls+0.136
Neg logit contributions
 rebuild-0.245
keeping-0.241
 find-0.238
iece-0.234
 build-0.233
Tokeningly
Token ID4420
Feature activation-5.47e-02

Pos logit contributions
 brightly+0.165
ww+0.163
 sudden+0.159
 problem+0.156
ls+0.146
Neg logit contributions
 rebuild-0.215
keeping-0.211
talk-0.202
iece-0.202
 find-0.202
Token feeling
Token ID4203
Feature activation-6.02e-02

Pos logit contributions
 brightly+0.100
ww+0.098
 sudden+0.096
 problem+0.095
ls+0.089
Neg logit contributions
 rebuild-0.106
keeping-0.104
talk-0.100
iece-0.099
race-0.098
Token and
Token ID290
Feature activation-7.04e-02

Pos logit contributions
 brightly+0.104
 sudden+0.099
ww+0.099
 problem+0.097
ls+0.091
Neg logit contributions
 rebuild-0.124
keeping-0.124
talk-0.120
race-0.117
 meet-0.116