TOP ACTIVATIONS
MAX = 0.160

When Daisy put the magn ifying glass down , the cater
her mom my watched in amaz ement as the machine flew
But I 'm a bit disturbed by all the noise in
then it was time to eat . They both ate the
She couldn 't wait to eat the cookies she had made
. Maybe he wants to eat us ." Ben said .
out a big blanket and ate cup cakes and sandwiches .
of juicy red strawberries to eat . Lily was so happy
was too fast and it ate the mouse in one bite
It looks like someone 's blocking the way . Come outside
said it was time to eat . They went to a
. I 'm sorry we disturbed the wild animal ."
shrimp that were ready to eat . What a wonderful day
is full of things to explore and learn . You can
girl was very happy and ate her hot dog . From
snow flew everywhere and it covered him from head to toe
until it was time to eat again . The gray whale
the batter and pour it onto the pan . She watched
years old and loved to explore new things . One day
years old and loved to explore and make new friends .

MIN ACTIVATIONS
MIN = -0.134

from the tank . Hel io was thankful and thanked T
for hours . Whenever L ola was feeling sad , the
fashion . Every day when Hannah got dressed , her mum
't split again ! Whenever Sally was writing in the notebook
As P ingu was resting , a group
very enthusiastic . Every time Amelia got an answer right her
it was Spanish . J ax asked them to teach him
Whenever M andy went out , she would
special fairy . Whenever the girl wore the necklace , she
ummy . Whenever Lynn ate her printed food ,
as Hel io . Hel io explained that their oxygen was
special thing . Even though Sam still didn 't like dancing
jungle . As Hannah walked further and further away
. As Holly 's mother walked away ,
noticed something strange . Whenever Molly said the word " y
shore . Tr acey was relieved . Next time
recording very tidy . Whenever Joe felt sad he would watch
After that day , whenever Jenny went to the park ,
ble and shake . Tr acey was scared and started to
From then on , when Gus was in the forest ,

INTERVAL 0.087 - 0.160
CONTAINS 0.156%

fell off the stove and spilled soup on the floor .
Do you want to jump in it with me ?" Anna
time at the golden horn on the top of the carriage
. When he went to eat the food in the pan
picture and they hung it on their tree house wall .

INTERVAL 0.013 - 0.087
CONTAINS 37.762%

. <|endoftext|> Once upon a time there was a Mom ,
succeeded , that was so brave of you !"
. Her dad smiled and started pushing her around the park
words and pictures . Suddenly , Lily heard a
, there was a hunter named Tim . Tim loved to

INTERVAL -0.060 - 0.013
CONTAINS 54.225%

. But today the park is closed . There is a
. " Mom my , look !" Lily shouted . "
could make happy music . But when he blew into it
he saw a rabbit in the garden . The
They asked the boy , " Please , can we play

INTERVAL -0.134 - -0.060
CONTAINS 7.857%

have some milk ?" His mom said , " Sure ,
road trip ." They were having fun , but
But it did not . It ate Lily 's hand .
to nap . Every morning he woke up and he thanked
into it , and slowly he made it to the top
Token When
Token ID1649
Feature activation+5.85e-02

Pos logit contributions
ummy+0.064
 ourselves+0.062
ach+0.059
 lunch+0.058
ings+0.058
Neg logit contributions
 forgot-0.124
'd-0.113
 remembered-0.105
 regretted-0.103
 missed-0.102
Token Daisy
Token ID40355
Feature activation-6.16e-02

Pos logit contributions
ummy+0.062
 ourselves+0.060
 lunch+0.057
ach+0.057
ings+0.055
Neg logit contributions
 forgot-0.150
'd-0.137
 remembered-0.128
 regretted-0.126
 missed-0.125
Token put
Token ID1234
Feature activation+5.52e-02

Pos logit contributions
 forgot+0.093
'd+0.081
 cared+0.075
 regretted+0.075
 sped+0.072
Neg logit contributions
,-0.116
 the-0.113
?-0.112
 up-0.110
 that-0.110
Token the
Token ID262
Feature activation+2.81e-02

Pos logit contributions
ummy+0.057
ach+0.055
 ourselves+0.052
ings+0.044
time+0.043
Neg logit contributions
 forgot-0.139
'd-0.128
 remembered-0.126
 knew-0.121
 struggled-0.120
Token magn
Token ID7842
Feature activation+3.25e-02

Pos logit contributions
ummy+0.039
 ourselves+0.038
ach+0.037
 lunch+0.037
ings+0.036
Neg logit contributions
 forgot-0.063
'd-0.056
 remembered-0.052
 missed-0.051
 regretted-0.051
Tokenifying
Token ID4035
Feature activation+1.13e-01

Pos logit contributions
 ourselves+0.051
ummy+0.051
ach+0.049
 lunch+0.049
ings+0.048
Neg logit contributions
 forgot-0.064
'd-0.059
 remembered-0.053
 regretted-0.051
 missed-0.051
Token glass
Token ID5405
Feature activation+4.68e-02

Pos logit contributions
ach+0.178
 ourselves+0.171
ummy+0.162
ires+0.162
iggle+0.161
Neg logit contributions
 forgot-0.199
'd-0.173
 bit-0.170
 remembered-0.166
 missed-0.163
Token down
Token ID866
Feature activation+5.74e-02

Pos logit contributions
 ourselves+0.036
ummy+0.036
ach+0.035
 lunch+0.030
ings+0.030
Neg logit contributions
 forgot-0.131
'd-0.120
 remembered-0.113
 cared-0.111
 regretted-0.111
Token,
Token ID11
Feature activation+4.08e-02

Pos logit contributions
ach+0.043
ummy+0.042
 ourselves+0.041
 lunch+0.034
ings+0.034
Neg logit contributions
 forgot-0.161
'd-0.148
 remembered-0.141
 knew-0.138
 hadn-0.137
Token the
Token ID262
Feature activation+2.82e-02

Pos logit contributions
ummy+0.058
ach+0.057
ings+0.053
 ourselves+0.053
ags+0.049
Neg logit contributions
 forgot-0.095
'd-0.083
 remembered-0.080
 knew-0.077
 missed-0.076
Token cater
Token ID20825
Feature activation+2.34e-03

Pos logit contributions
ummy+0.039
 lunch+0.038
 ourselves+0.037
 something+0.037
ach+0.036
Neg logit contributions
 forgot-0.063
'd-0.057
 remembered-0.052
 missed-0.052
 regretted-0.051
Token her
Token ID607
Feature activation+5.46e-02

Pos logit contributions
 forgot+0.076
'd+0.075
 sped+0.068
 hadn+0.068
 regretted+0.065
Neg logit contributions
 something-0.052
 the-0.052
 with-0.049
 that-0.049
 her-0.049
Token mom
Token ID1995
Feature activation-1.17e-04

Pos logit contributions
ummy+0.072
 ourselves+0.070
 lunch+0.067
ach+0.067
ings+0.066
Neg logit contributions
 forgot-0.126
'd-0.114
 remembered-0.105
 regretted-0.103
 missed-0.103
Tokenmy
Token ID1820
Feature activation-4.51e-03

Pos logit contributions
 forgot+0.000
'd+0.000
 sped+0.000
 struggled+0.000
 regretted+0.000
Neg logit contributions
ummy-0.000
 lunch-0.000
?-0.000
 ourselves-0.000
 outside-0.000
Token watched
Token ID7342
Feature activation+9.68e-02

Pos logit contributions
 forgot+0.010
'd+0.009
 regretted+0.008
 sped+0.008
 struggled+0.008
Neg logit contributions
ummy-0.006
 ourselves-0.006
 up-0.006
 lunch-0.006
 outside-0.006
Token in
Token ID287
Feature activation+5.81e-02

Pos logit contributions
ings+0.080
 ourselves+0.079
uggle+0.079
ummy+0.078
ach+0.074
Neg logit contributions
 forgot-0.235
'd-0.215
 knew-0.208
 hadn-0.205
 switched-0.204
Token amaz
Token ID40642
Feature activation+1.12e-01

Pos logit contributions
ummy+0.054
ach+0.049
 ourselves+0.048
ings+0.045
uggle+0.044
Neg logit contributions
 forgot-0.151
'd-0.139
 remembered-0.135
 cared-0.133
 knew-0.131
Tokenement
Token ID972
Feature activation+6.57e-02

Pos logit contributions
 ourselves+0.187
uggle+0.183
 lunch+0.176
ings+0.175
ble+0.165
Neg logit contributions
 forgot-0.168
'd-0.165
 cared-0.155
 regretted-0.151
 remembered-0.149
Token as
Token ID355
Feature activation+6.35e-02

Pos logit contributions
ummy+0.046
 ourselves+0.046
ach+0.045
uggle+0.044
ings+0.044
Neg logit contributions
 forgot-0.180
'd-0.163
 remembered-0.154
 knew-0.152
 regretted-0.149
Token the
Token ID262
Feature activation+3.13e-02

Pos logit contributions
ummy+0.056
 ourselves+0.053
ach+0.051
ings+0.049
 lunch+0.048
Neg logit contributions
 forgot-0.174
'd-0.160
 remembered-0.151
 regretted-0.148
 missed-0.147
Token machine
Token ID4572
Feature activation-1.24e-02

Pos logit contributions
ummy+0.041
 lunch+0.040
 something+0.038
 ourselves+0.037
time+0.037
Neg logit contributions
 forgot-0.074
'd-0.066
 remembered-0.062
 missed-0.061
 struggled-0.060
Token flew
Token ID13112
Feature activation+5.78e-02

Pos logit contributions
 forgot+0.023
'd+0.019
 regretted+0.019
 remembered+0.019
 missed+0.018
Neg logit contributions
 that-0.021
 outside-0.020
 up-0.020
 something-0.020
time-0.020
TokenBut
Token ID1537
Feature activation+6.23e-02

Pos logit contributions
 forgot+0.101
'd+0.095
 remembered+0.091
 cared+0.089
 regretted+0.089
Neg logit contributions
ummy-0.009
 ourselves-0.007
 lunch-0.007
ings-0.006
ach-0.006
Token I
Token ID314
Feature activation+1.95e-03

Pos logit contributions
ummy+0.063
 ourselves+0.060
ach+0.057
ings+0.056
 lunch+0.055
Neg logit contributions
 forgot-0.164
'd-0.149
 remembered-0.141
 regretted-0.138
 missed-0.137
Token'm
Token ID1101
Feature activation+3.44e-02

Pos logit contributions
?+0.003
?"+0.003
 something+0.003
?'+0.003
,"+0.003
Neg logit contributions
 forgot-0.004
'd-0.003
 sped-0.003
 regretted-0.003
 switched-0.003
Token a
Token ID257
Feature activation+2.14e-02

Pos logit contributions
ummy+0.050
 ourselves+0.050
 lunch+0.049
 something+0.049
 up+0.047
Neg logit contributions
 forgot-0.072
'd-0.066
 regretted-0.060
 cared-0.060
 remembered-0.059
Token bit
Token ID1643
Feature activation+5.14e-02

Pos logit contributions
 lunch+0.028
 something+0.027
ummy+0.027
 dinner+0.026
time+0.026
Neg logit contributions
 forgot-0.048
ave-0.043
'd-0.042
 counted-0.041
 struggled-0.041
Token disturbed
Token ID24069
Feature activation+1.05e-01

Pos logit contributions
ummy+0.087
 ourselves+0.083
 something+0.082
ach+0.082
 lunch+0.080
Neg logit contributions
 forgot-0.100
'd-0.091
 remembered-0.081
 missed-0.079
 sped-0.079
Token by
Token ID416
Feature activation+6.67e-02

Pos logit contributions
uttering+0.105
uggle+0.098
ings+0.092
ummy+0.090
 lunch+0.089
Neg logit contributions
 forgot-0.226
 cared-0.206
 needed-0.205
'd-0.205
 knew-0.204
Token all
Token ID477
Feature activation+3.89e-02

Pos logit contributions
ummy+0.064
ach+0.059
uggle+0.058
time+0.056
ings+0.055
Neg logit contributions
 forgot-0.169
'd-0.150
 remembered-0.147
 knew-0.144
 missed-0.144
Token the
Token ID262
Feature activation+4.36e-02

Pos logit contributions
ummy+0.030
ach+0.028
 ourselves+0.027
ings+0.026
 lunch+0.026
Neg logit contributions
 forgot-0.111
'd-0.102
 remembered-0.096
 regretted-0.095
 knew-0.095
Token noise
Token ID7838
Feature activation+3.88e-02

Pos logit contributions
ummy+0.058
 lunch+0.055
 ourselves+0.055
ach+0.053
 something+0.052
Neg logit contributions
 forgot-0.102
'd-0.091
 remembered-0.086
 missed-0.084
 cared-0.083
Token in
Token ID287
Feature activation+6.29e-02

Pos logit contributions
ummy+0.019
 ourselves+0.018
ach+0.016
 lunch+0.015
ings+0.015
Neg logit contributions
 forgot-0.121
'd-0.113
 remembered-0.107
 regretted-0.105
 missed-0.105
Token then
Token ID788
Feature activation+2.89e-02

Pos logit contributions
 ourselves+0.030
 lunch+0.029
 something+0.029
ummy+0.028
 up+0.028
Neg logit contributions
 forgot-0.039
'd-0.036
 remembered-0.032
 regretted-0.032
 missed-0.031
Token it
Token ID340
Feature activation-9.01e-03

Pos logit contributions
 something+0.043
 lunch+0.043
 ourselves+0.042
ummy+0.042
 up+0.041
Neg logit contributions
 forgot-0.058
'd-0.053
 regretted-0.047
 cared-0.047
 missed-0.047
Token was
Token ID373
Feature activation+4.88e-02

Pos logit contributions
 forgot+0.012
'd+0.010
 cared+0.010
 regretted+0.010
 sped+0.009
Neg logit contributions
 up-0.018
 something-0.017
.-0.017
 that-0.017
?-0.017
Token time
Token ID640
Feature activation+7.54e-02

Pos logit contributions
ummy+0.056
 ourselves+0.054
ach+0.052
 lunch+0.051
ings+0.051
Neg logit contributions
 forgot-0.121
'd-0.110
 remembered-0.103
 regretted-0.101
 missed-0.100
Token to
Token ID284
Feature activation+5.14e-02

Pos logit contributions
ummy+0.072
 ourselves+0.070
ach+0.069
ings+0.063
 yourselves+0.062
Neg logit contributions
 forgot-0.200
'd-0.187
 remembered-0.173
 knew-0.169
 hadn-0.168
Token eat
Token ID4483
Feature activation+1.04e-01

Pos logit contributions
ummy+0.089
ings+0.081
ach+0.080
 ourselves+0.080
ires+0.078
Neg logit contributions
 forgot-0.103
'd-0.090
 remembered-0.083
 missed-0.081
 switched-0.079
Token.
Token ID13
Feature activation+3.79e-02

Pos logit contributions
uttering+0.100
ach+0.098
uggle+0.098
ummy+0.096
ires+0.090
Neg logit contributions
 forgot-0.245
'd-0.215
 remembered-0.212
 knew-0.212
 needed-0.210
Token They
Token ID1119
Feature activation-4.65e-02

Pos logit contributions
ummy+0.041
 ourselves+0.039
ach+0.037
 lunch+0.036
ings+0.036
Neg logit contributions
 forgot-0.098
'd-0.089
 remembered-0.084
 regretted-0.082
 missed-0.081
Token both
Token ID1111
Feature activation-3.35e-02

Pos logit contributions
 forgot+0.060
'd+0.053
ave+0.053
 sped+0.050
rie+0.047
Neg logit contributions
 the-0.093
 that-0.090
 her-0.089
 lunch-0.088
 him-0.088
Token ate
Token ID15063
Feature activation+8.17e-02

Pos logit contributions
 forgot+0.036
'd+0.032
 sped+0.028
ave+0.026
 remembered+0.026
Neg logit contributions
 the-0.072
.-0.070
 that-0.070
,"-0.068
 something-0.068
Token the
Token ID262
Feature activation+3.37e-02

Pos logit contributions
ummy+0.097
ach+0.095
uggle+0.087
ires+0.086
 yourselves+0.086
Neg logit contributions
 forgot-0.188
 knew-0.162
 remembered-0.162
 missed-0.160
'd-0.159
Token She
Token ID1375
Feature activation-3.23e-02

Pos logit contributions
 forgot+0.071
'd+0.065
 remembered+0.060
 regretted+0.059
 cared+0.058
Neg logit contributions
ummy-0.041
 lunch-0.038
 ourselves-0.038
ach-0.037
ings-0.036
Token couldn
Token ID3521
Feature activation+5.34e-02

Pos logit contributions
 forgot+0.044
'd+0.038
 sped+0.036
ave+0.034
 struggled+0.033
Neg logit contributions
 up-0.063
?"-0.063
 that-0.062
 something-0.062
,"-0.062
Token't
Token ID470
Feature activation+1.77e-02

Pos logit contributions
ummy+0.072
 ourselves+0.070
ach+0.068
 lunch+0.066
ings+0.066
Neg logit contributions
 forgot-0.122
'd-0.110
 remembered-0.102
 regretted-0.099
 missed-0.099
Token wait
Token ID4043
Feature activation+7.38e-02

Pos logit contributions
 ourselves+0.025
ummy+0.025
ach+0.024
 lunch+0.024
 outside+0.023
Neg logit contributions
 forgot-0.038
'd-0.034
 remembered-0.032
 regretted-0.032
 cared-0.031
Token to
Token ID284
Feature activation+4.28e-02

Pos logit contributions
ummy+0.086
 ourselves+0.081
ach+0.081
ings+0.078
uggle+0.073
Neg logit contributions
 forgot-0.189
'd-0.173
 remembered-0.162
 knew-0.156
 missed-0.153
Token eat
Token ID4483
Feature activation+1.04e-01

Pos logit contributions
ummy+0.073
ings+0.067
ach+0.067
 ourselves+0.067
time+0.065
Neg logit contributions
 forgot-0.088
'd-0.076
 remembered-0.071
 missed-0.068
 knew-0.067
Token the
Token ID262
Feature activation+4.54e-02

Pos logit contributions
ummy+0.110
uttering+0.107
ach+0.107
uggle+0.105
ings+0.097
Neg logit contributions
 forgot-0.230
'd-0.201
 remembered-0.198
 had-0.195
 knew-0.194
Token cookies
Token ID14746
Feature activation+2.53e-02

Pos logit contributions
ummy+0.055
 ourselves+0.053
ach+0.051
 lunch+0.050
ings+0.050
Neg logit contributions
 forgot-0.110
'd-0.100
 remembered-0.093
 regretted-0.091
 missed-0.091
Token she
Token ID673
Feature activation-3.81e-02

Pos logit contributions
ummy+0.022
ach+0.022
 ourselves+0.021
ings+0.020
 yourselves+0.019
Neg logit contributions
 forgot-0.070
'd-0.065
 remembered-0.059
 knew-0.057
 managed-0.057
Token had
Token ID550
Feature activation+3.81e-02

Pos logit contributions
 forgot+0.038
 sped+0.035
 regretted+0.031
 switched+0.030
 officer+0.030
Neg logit contributions
?"-0.081
 outside-0.081
?-0.080
.-0.080
,"-0.078
Token made
Token ID925
Feature activation+6.63e-02

Pos logit contributions
 lunch+0.062
ummy+0.061
 ourselves+0.061
 something+0.061
ach+0.059
Neg logit contributions
 forgot-0.071
'd-0.063
 regretted-0.058
 remembered-0.057
 missed-0.057
Token.
Token ID13
Feature activation+5.64e-02

Pos logit contributions
ummy+0.004
 ourselves+0.004
ach+0.004
 yourselves+0.004
 lunch+0.003
Neg logit contributions
 forgot-0.017
'd-0.015
 remembered-0.015
 knew-0.014
 missed-0.014
Token Maybe
Token ID6674
Feature activation+5.13e-02

Pos logit contributions
ummy+0.067
 ourselves+0.064
ach+0.061
 lunch+0.060
ings+0.059
Neg logit contributions
 forgot-0.139
'd-0.126
 remembered-0.118
 regretted-0.116
 missed-0.114
Token he
Token ID339
Feature activation-3.53e-02

Pos logit contributions
ummy+0.061
 ourselves+0.057
ach+0.057
ings+0.057
 lunch+0.052
Neg logit contributions
 forgot-0.127
'd-0.115
 remembered-0.109
 regretted-0.105
 missed-0.105
Token wants
Token ID3382
Feature activation+7.97e-02

Pos logit contributions
 forgot+0.060
 switched+0.051
 sped+0.050
Uh+0.050
 struggled+0.050
Neg logit contributions
?"-0.059
?-0.059
 up-0.058
.-0.057
,"-0.057
Token to
Token ID284
Feature activation+4.07e-02

Pos logit contributions
ummy+0.053
 ourselves+0.051
ach+0.046
 lunch+0.045
ings+0.044
Neg logit contributions
 forgot-0.237
'd-0.219
 remembered-0.207
 regretted-0.204
 missed-0.203
Token eat
Token ID4483
Feature activation+1.03e-01

Pos logit contributions
ummy+0.076
ings+0.068
ach+0.068
ires+0.066
 ourselves+0.065
Neg logit contributions
 forgot-0.076
'd-0.066
 remembered-0.060
 missed-0.059
 knew-0.059
Token us
Token ID514
Feature activation+2.90e-02

Pos logit contributions
uttering+0.142
ummy+0.123
utter+0.122
ings+0.117
uggle+0.117
Neg logit contributions
 forgot-0.210
 knew-0.186
 had-0.184
 needed-0.183
 remembered-0.183
Token."
Token ID526
Feature activation+5.09e-02

Pos logit contributions
ummy+0.029
ach+0.025
ires+0.024
ings+0.023
uttering+0.023
Neg logit contributions
 forgot-0.073
'd-0.067
 remembered-0.064
 knew-0.062
 cared-0.061
Token Ben
Token ID3932
Feature activation-5.30e-02

Pos logit contributions
ummy+0.053
 ourselves+0.051
ach+0.049
ings+0.047
 lunch+0.046
Neg logit contributions
 forgot-0.132
'd-0.119
 remembered-0.113
 missed-0.111
 knew-0.111
Token said
Token ID531
Feature activation+3.60e-02

Pos logit contributions
 forgot+0.100
'd+0.095
 hadn+0.084
 sped+0.081
 regretted+0.079
Neg logit contributions
 something-0.083
 up-0.083
?-0.082
 the-0.080
 lunch-0.080
Token.
Token ID13
Feature activation+3.55e-02

Pos logit contributions
ummy+0.027
ach+0.026
 ourselves+0.026
 yourselves+0.022
 lunch+0.021
Neg logit contributions
 forgot-0.105
'd-0.098
 remembered-0.094
 knew-0.092
 missed-0.091
Token out
Token ID503
Feature activation+9.75e-02

Pos logit contributions
 forgot+0.014
'd+0.013
 remembered+0.012
 regretted+0.012
 missed+0.012
Neg logit contributions
ummy-0.006
 ourselves-0.006
ach-0.006
 lunch-0.006
 something-0.005
Token a
Token ID257
Feature activation+4.56e-02

Pos logit contributions
ummy+0.086
 ourselves+0.083
ach+0.079
uggle+0.073
ings+0.070
Neg logit contributions
 forgot-0.264
'd-0.242
 remembered-0.233
 knew-0.226
 regretted-0.225
Token big
Token ID1263
Feature activation+5.30e-02

Pos logit contributions
ummy+0.059
 lunch+0.056
 ourselves+0.055
ach+0.054
 something+0.054
Neg logit contributions
 forgot-0.106
'd-0.095
 remembered-0.089
 missed-0.087
 regretted-0.087
Token blanket
Token ID18447
Feature activation+4.87e-02

Pos logit contributions
 ourselves+0.074
ummy+0.073
ach+0.070
ings+0.068
 lunch+0.066
Neg logit contributions
 forgot-0.117
'd-0.106
 remembered-0.099
 regretted-0.096
 missed-0.095
Token and
Token ID290
Feature activation-8.54e-04

Pos logit contributions
 ourselves+0.041
ummy+0.041
ach+0.039
ings+0.036
 lunch+0.035
Neg logit contributions
 forgot-0.134
'd-0.126
 remembered-0.117
 missed-0.114
 regretted-0.114
Token ate
Token ID15063
Feature activation+1.03e-01

Pos logit contributions
 forgot+0.002
'd+0.002
 regretted+0.002
 remembered+0.002
 missed+0.002
Neg logit contributions
 something-0.001
 lunch-0.001
 ourselves-0.001
ummy-0.001
 up-0.001
Token cup
Token ID6508
Feature activation+4.86e-02

Pos logit contributions
ach+0.107
uttering+0.101
ummy+0.099
uggle+0.098
ush+0.096
Neg logit contributions
 forgot-0.233
'd-0.212
 knew-0.209
 needed-0.204
 had-0.203
Tokencakes
Token ID37263
Feature activation+3.71e-02

Pos logit contributions
 ourselves+0.082
 yourselves+0.079
ach+0.077
?'+0.075
 your+0.075
Neg logit contributions
 forgot-0.084
'd-0.075
ave-0.067
 needed-0.065
 switched-0.065
Token and
Token ID290
Feature activation+6.19e-03

Pos logit contributions
 yourselves+0.038
 ourselves+0.036
ummy+0.036
ach+0.036
ires+0.035
Neg logit contributions
 forgot-0.091
'd-0.083
 counted-0.077
 had-0.077
 talked-0.075
Token sandwiches
Token ID29777
Feature activation+5.45e-02

Pos logit contributions
 lunch+0.008
 ourselves+0.008
 something+0.008
ummy+0.008
 dinner+0.007
Neg logit contributions
 forgot-0.014
'd-0.013
 regretted-0.012
 remembered-0.012
 missed-0.011
Token.
Token ID13
Feature activation+3.75e-02

Pos logit contributions
 ourselves+0.042
ummy+0.042
ach+0.042
 yourselves+0.039
ings+0.035
Neg logit contributions
 forgot-0.152
'd-0.140
 remembered-0.130
 knew-0.129
 cared-0.128
Token of
Token ID286
Feature activation+9.92e-02

Pos logit contributions
 forgot+0.098
'd+0.092
 hadn+0.083
 regretted+0.081
 missed+0.081
Neg logit contributions
 that-0.041
 something-0.040
 the-0.040
 outside-0.040
 lunch-0.040
Token juicy
Token ID35075
Feature activation+9.16e-02

Pos logit contributions
ummy+0.093
ach+0.087
 ourselves+0.085
ings+0.082
time+0.073
Neg logit contributions
 forgot-0.264
'd-0.241
 remembered-0.230
 missed-0.224
 regretted-0.224
Token red
Token ID2266
Feature activation+6.80e-02

Pos logit contributions
 ourselves+0.117
uggle+0.109
ach+0.107
uttering+0.106
time+0.105
Neg logit contributions
 forgot-0.195
'd-0.170
 regretted-0.169
 remembered-0.167
 knew-0.165
Token strawberries
Token ID36973
Feature activation+4.58e-02

Pos logit contributions
 ourselves+0.080
ummy+0.078
ach+0.076
ings+0.072
time+0.070
Neg logit contributions
 forgot-0.159
'd-0.146
 remembered-0.136
 regretted-0.134
 cared-0.133
Token to
Token ID284
Feature activation+5.33e-02

Pos logit contributions
ummy+0.035
ach+0.034
 ourselves+0.033
ings+0.030
 yourselves+0.029
Neg logit contributions
 forgot-0.128
'd-0.120
 remembered-0.110
 knew-0.108
 needed-0.108
Token eat
Token ID4483
Feature activation+1.02e-01

Pos logit contributions
ummy+0.073
 ourselves+0.068
ach+0.068
ings+0.067
time+0.063
Neg logit contributions
 forgot-0.121
'd-0.109
 remembered-0.101
 missed-0.098
 knew-0.098
Token.
Token ID13
Feature activation+3.97e-02

Pos logit contributions
ummy+0.087
ach+0.085
uttering+0.080
uggle+0.080
 ourselves+0.079
Neg logit contributions
 forgot-0.256
'd-0.240
 knew-0.225
 remembered-0.222
 cared-0.221
Token Lily
Token ID20037
Feature activation-5.43e-02

Pos logit contributions
ummy+0.047
 ourselves+0.044
ach+0.042
 lunch+0.042
ings+0.041
Neg logit contributions
 forgot-0.098
'd-0.089
 remembered-0.083
 regretted-0.082
 missed-0.081
Token was
Token ID373
Feature activation+1.69e-02

Pos logit contributions
 forgot+0.083
'd+0.076
 sped+0.075
 switched+0.068
 officer+0.068
Neg logit contributions
 up-0.095
,-0.094
 the-0.094
 with-0.092
ummy-0.091
Token so
Token ID523
Feature activation+3.26e-02

Pos logit contributions
 up+0.022
 something+0.022
ummy+0.021
 lunch+0.021
 outside+0.021
Neg logit contributions
 forgot-0.038
'd-0.035
 regretted-0.032
 cared-0.031
ave-0.031
Token happy
Token ID3772
Feature activation+6.57e-02

Pos logit contributions
ummy+0.058
 ourselves+0.057
ach+0.055
 lunch+0.054
ings+0.054
Neg logit contributions
 forgot-0.061
'd-0.054
 remembered-0.049
 regretted-0.048
 missed-0.047
Token was
Token ID373
Feature activation+5.34e-02

Pos logit contributions
 forgot+0.057
'd+0.052
 remembered+0.050
 knew+0.047
 regretted+0.046
Neg logit contributions
 ourselves-0.049
ach-0.048
ummy-0.046
 yourselves-0.045
time-0.044
Token too
Token ID1165
Feature activation+5.28e-02

Pos logit contributions
ummy+0.065
 ourselves+0.064
 lunch+0.063
 something+0.062
ach+0.061
Neg logit contributions
 forgot-0.126
'd-0.114
 remembered-0.105
 regretted-0.104
 cared-0.103
Token fast
Token ID3049
Feature activation+5.44e-02

Pos logit contributions
ummy+0.079
 ourselves+0.077
ach+0.074
 lunch+0.073
ings+0.073
Neg logit contributions
 forgot-0.114
'd-0.102
 remembered-0.094
 regretted-0.092
 missed-0.091
Token and
Token ID290
Feature activation-1.49e-02

Pos logit contributions
ummy+0.032
 ourselves+0.030
ach+0.028
 lunch+0.026
ings+0.026
Neg logit contributions
 forgot-0.165
'd-0.153
 remembered-0.145
 regretted-0.143
 missed-0.142
Token it
Token ID340
Feature activation-3.07e-02

Pos logit contributions
 forgot+0.032
'd+0.030
ave+0.029
 regretted+0.029
 cared+0.028
Neg logit contributions
 the-0.019
 up-0.019
 something-0.019
 that-0.018
 your-0.018
Token ate
Token ID15063
Feature activation+1.02e-01

Pos logit contributions
 forgot+0.040
'd+0.037
 cared+0.036
 nephew+0.035
 remembered+0.035
Neg logit contributions
 up-0.057
 with-0.055
 the-0.055
 that-0.054
 it-0.054
Token the
Token ID262
Feature activation+4.94e-02

Pos logit contributions
ummy+0.095
ach+0.094
uggle+0.088
uttering+0.085
ings+0.080
Neg logit contributions
 forgot-0.255
'd-0.232
 knew-0.224
 remembered-0.220
 missed-0.219
Token mouse
Token ID10211
Feature activation+1.20e-02

Pos logit contributions
ummy+0.065
 ourselves+0.063
ach+0.060
ings+0.059
 lunch+0.059
Neg logit contributions
 forgot-0.114
'd-0.103
 remembered-0.096
 regretted-0.094
 missed-0.093
Token in
Token ID287
Feature activation+5.88e-02

Pos logit contributions
ummy+0.011
 ourselves+0.010
ach+0.010
ings+0.009
 lunch+0.009
Neg logit contributions
 forgot-0.033
'd-0.030
 remembered-0.028
 regretted-0.028
 missed-0.028
Token one
Token ID530
Feature activation+1.78e-02

Pos logit contributions
ummy+0.045
ach+0.039
 ourselves+0.034
uggle+0.034
ires+0.031
Neg logit contributions
 forgot-0.167
'd-0.153
 remembered-0.149
 knew-0.146
 struggled-0.146
Token bite
Token ID13197
Feature activation+6.06e-02

Pos logit contributions
 ourselves+0.024
ach+0.024
ummy+0.024
ings+0.023
 yourselves+0.022
Neg logit contributions
 forgot-0.038
'd-0.035
 remembered-0.032
 cared-0.031
 regretted-0.031
TokenIt
Token ID1026
Feature activation+1.54e-02

Pos logit contributions
 ourselves+0.051
ach+0.050
ummy+0.049
time+0.049
 something+0.048
Neg logit contributions
 forgot-0.084
'd-0.075
 remembered-0.069
 regretted-0.069
 missed-0.068
Token looks
Token ID3073
Feature activation+5.86e-02

Pos logit contributions
 up+0.025
 something+0.025
 ourselves+0.024
ummy+0.024
?+0.024
Neg logit contributions
 forgot-0.030
'd-0.025
 regretted-0.025
 struggled-0.024
 remembered-0.024
Token like
Token ID588
Feature activation+8.27e-02

Pos logit contributions
ummy+0.083
 ourselves+0.080
ings+0.077
ach+0.076
 lunch+0.072
Neg logit contributions
 forgot-0.130
'd-0.117
 remembered-0.109
 missed-0.106
 regretted-0.106
Token someone
Token ID2130
Feature activation-2.23e-02

Pos logit contributions
ummy+0.062
ings+0.058
ach+0.057
 ourselves+0.053
uggle+0.047
Neg logit contributions
 forgot-0.234
'd-0.218
 remembered-0.205
 cared-0.203
 knew-0.200
Token's
Token ID338
Feature activation+2.58e-02

Pos logit contributions
 forgot+0.049
'd+0.045
 regretted+0.042
 sped+0.042
 struggled+0.042
Neg logit contributions
 ourselves-0.028
 something-0.028
?-0.028
 your-0.027
 outside-0.027
Token blocking
Token ID12013
Feature activation+1.02e-01

Pos logit contributions
ummy+0.031
 ourselves+0.031
ach+0.029
 lunch+0.029
ings+0.029
Neg logit contributions
 forgot-0.062
'd-0.056
 remembered-0.053
 regretted-0.052
 missed-0.051
Token the
Token ID262
Feature activation+5.61e-02

Pos logit contributions
ummy+0.095
uggle+0.083
time+0.080
ach+0.079
uttering+0.078
Neg logit contributions
 forgot-0.261
'd-0.237
 remembered-0.233
 knew-0.228
 cared-0.220
Token way
Token ID835
Feature activation+6.24e-02

Pos logit contributions
ummy+0.091
 ourselves+0.089
ach+0.086
ings+0.085
 lunch+0.082
Neg logit contributions
 forgot-0.111
'd-0.100
 remembered-0.090
 regretted-0.089
 missed-0.088
Token.
Token ID13
Feature activation+2.51e-02

Pos logit contributions
ummy+0.051
ach+0.050
 lunch+0.049
 ourselves+0.044
ings+0.043
Neg logit contributions
 forgot-0.167
'd-0.158
 remembered-0.142
 knew-0.141
 hadn-0.140
Token Come
Token ID7911
Feature activation+1.38e-02

Pos logit contributions
ummy+0.031
 ourselves+0.030
ach+0.028
 something+0.027
 lunch+0.027
Neg logit contributions
 forgot-0.062
'd-0.055
 regretted-0.052
 remembered-0.052
 struggled-0.051
Token outside
Token ID2354
Feature activation+4.88e-02

Pos logit contributions
ummy+0.014
 ourselves+0.013
ach+0.011
 lunch+0.011
ings+0.011
Neg logit contributions
 forgot-0.036
'd-0.034
 remembered-0.032
 missed-0.030
 sped-0.030
Token said
Token ID531
Feature activation+3.30e-02

Pos logit contributions
 forgot+0.035
'd+0.030
 sped+0.029
 regretted+0.028
 struggled+0.028
Neg logit contributions
?-0.048
?"-0.048
 that-0.047
ummy-0.046
 up-0.046
Token it
Token ID340
Feature activation-4.77e-03

Pos logit contributions
ummy+0.031
 ourselves+0.029
ach+0.029
ings+0.028
 lunch+0.026
Neg logit contributions
 forgot-0.090
'd-0.083
 remembered-0.078
 missed-0.076
 regretted-0.076
Token was
Token ID373
Feature activation+3.92e-02

Pos logit contributions
 forgot+0.010
 regretted+0.008
 remembered+0.008
 sped+0.008
 competed+0.008
Neg logit contributions
time-0.007
 up-0.007
 something-0.006
.-0.006
 with-0.006
Token time
Token ID640
Feature activation+6.28e-02

Pos logit contributions
ummy+0.052
 ourselves+0.050
ach+0.050
ings+0.048
 lunch+0.044
Neg logit contributions
 forgot-0.092
'd-0.083
 remembered-0.078
 missed-0.076
 regretted-0.075
Token to
Token ID284
Feature activation+4.77e-02

Pos logit contributions
ummy+0.065
ach+0.064
 ourselves+0.063
ings+0.058
 yourselves+0.057
Neg logit contributions
 forgot-0.162
'd-0.150
 remembered-0.139
 knew-0.136
 missed-0.134
Token eat
Token ID4483
Feature activation+1.02e-01

Pos logit contributions
ummy+0.090
ires+0.086
ings+0.084
uttering+0.083
ach+0.082
Neg logit contributions
 forgot-0.086
'd-0.074
 remembered-0.067
 switched-0.067
 drove-0.066
Token.
Token ID13
Feature activation+2.17e-02

Pos logit contributions
uttering+0.136
ach+0.116
utter+0.112
ires+0.111
overed+0.111
Neg logit contributions
 forgot-0.208
 had-0.184
 needed-0.183
 knew-0.181
 bit-0.181
Token They
Token ID1119
Feature activation-4.38e-02

Pos logit contributions
ummy+0.023
 ourselves+0.022
 lunch+0.021
ach+0.020
ings+0.020
Neg logit contributions
 forgot-0.056
'd-0.052
 remembered-0.048
 regretted-0.047
 missed-0.047
Token went
Token ID1816
Feature activation+3.88e-02

Pos logit contributions
 forgot+0.061
'd+0.055
 fastest+0.053
ologist+0.052
ave+0.052
Neg logit contributions
 the-0.084
 lunch-0.082
 dinner-0.079
 her-0.079
 that-0.079
Token to
Token ID284
Feature activation+5.38e-02

Pos logit contributions
ummy+0.035
 ourselves+0.034
ach+0.033
ings+0.031
 yourselves+0.029
Neg logit contributions
 forgot-0.106
'd-0.096
 remembered-0.091
 missed-0.089
 regretted-0.088
Token a
Token ID257
Feature activation+2.85e-02

Pos logit contributions
ummy+0.066
ach+0.062
ings+0.060
ires+0.058
time+0.056
Neg logit contributions
 forgot-0.135
 missed-0.116
 remembered-0.114
 knew-0.114
'd-0.112
Token.
Token ID13
Feature activation+2.47e-02

Pos logit contributions
ummy+0.048
ings+0.041
ach+0.041
 ourselves+0.041
ires+0.041
Neg logit contributions
 forgot-0.108
'd-0.102
 remembered-0.094
 drove-0.090
 sped-0.090
Token I
Token ID314
Feature activation-1.25e-02

Pos logit contributions
ummy+0.029
 ourselves+0.029
ach+0.027
 lunch+0.026
 something+0.026
Neg logit contributions
 forgot-0.061
'd-0.055
 regretted-0.052
 remembered-0.052
 struggled-0.051
Token'm
Token ID1101
Feature activation+2.66e-02

Pos logit contributions
 forgot+0.021
 regretted+0.018
 cared+0.017
'd+0.017
 hadn+0.016
Neg logit contributions
?-0.021
?"-0.021
!'-0.021
 here-0.020
.'-0.020
Token sorry
Token ID7926
Feature activation+6.30e-02

Pos logit contributions
 ourselves+0.032
ummy+0.032
 lunch+0.032
 something+0.031
 outside+0.030
Neg logit contributions
 forgot-0.063
'd-0.058
 regretted-0.053
 remembered-0.053
 cared-0.053
Token we
Token ID356
Feature activation-1.59e-02

Pos logit contributions
ach+0.073
ags+0.069
ings+0.067
he+0.066
uggle+0.063
Neg logit contributions
 forgot-0.155
'd-0.139
 remembered-0.129
 knew-0.127
 missed-0.123
Token disturbed
Token ID24069
Feature activation+1.01e-01

Pos logit contributions
 forgot+0.021
'd+0.018
 cared+0.017
 remembered+0.017
 regretted+0.016
Neg logit contributions
?"-0.031
 outside-0.031
 up-0.031
?-0.030
,"-0.030
Token the
Token ID262
Feature activation+4.55e-02

Pos logit contributions
uttering+0.120
ummy+0.113
ings+0.109
uggle+0.108
asty+0.103
Neg logit contributions
 forgot-0.211
 cared-0.193
 wished-0.191
 missed-0.184
 remembered-0.183
Token wild
Token ID4295
Feature activation+4.38e-02

Pos logit contributions
ummy+0.065
 ourselves+0.063
ach+0.061
 lunch+0.060
ings+0.060
Neg logit contributions
 forgot-0.100
'd-0.090
 remembered-0.083
 regretted-0.081
 missed-0.081
Token animal
Token ID5044
Feature activation+2.27e-02

Pos logit contributions
ummy+0.060
 ourselves+0.059
ach+0.058
ings+0.056
 lunch+0.055
Neg logit contributions
 forgot-0.095
'd-0.086
 remembered-0.080
 regretted-0.079
 knew-0.078
Token."
Token ID526
Feature activation+3.73e-02

Pos logit contributions
ummy+0.014
 ourselves+0.013
ach+0.012
ings+0.011
 lunch+0.011
Neg logit contributions
 forgot-0.069
'd-0.064
 remembered-0.060
 regretted-0.059
 missed-0.059
Token
Token ID220
Feature activation+3.40e-02

Pos logit contributions
ummy+0.039
 ourselves+0.037
ach+0.036
 lunch+0.035
ings+0.035
Neg logit contributions
 forgot-0.097
'd-0.088
 remembered-0.083
 regretted-0.081
 missed-0.081
Token shrimp
Token ID30060
Feature activation+3.81e-02

Pos logit contributions
 ourselves+0.079
ummy+0.075
ach+0.071
ings+0.071
 itself+0.069
Neg logit contributions
 forgot-0.159
'd-0.142
 remembered-0.135
 cared-0.134
 regretted-0.133
Token that
Token ID326
Feature activation+2.48e-02

Pos logit contributions
ummy+0.026
 ourselves+0.025
ach+0.023
ings+0.022
 lunch+0.021
Neg logit contributions
 forgot-0.111
'd-0.103
 remembered-0.097
 regretted-0.096
 cared-0.095
Token were
Token ID547
Feature activation+3.51e-02

Pos logit contributions
ummy+0.037
 ourselves+0.036
 lunch+0.036
 something+0.036
time+0.035
Neg logit contributions
 forgot-0.050
'd-0.045
 regretted-0.043
 remembered-0.042
ave-0.042
Token ready
Token ID3492
Feature activation+5.72e-02

Pos logit contributions
ummy+0.041
 ourselves+0.039
ach+0.038
 lunch+0.037
ings+0.036
Neg logit contributions
 forgot-0.087
'd-0.079
 remembered-0.074
 regretted-0.073
 missed-0.072
Token to
Token ID284
Feature activation+4.01e-02

Pos logit contributions
ummy+0.050
 ourselves+0.049
ach+0.048
ings+0.045
 yourselves+0.044
Neg logit contributions
 forgot-0.153
'd-0.146
 remembered-0.132
 knew-0.130
 hadn-0.127
Token eat
Token ID4483
Feature activation+1.01e-01

Pos logit contributions
ummy+0.062
 ourselves+0.058
ach+0.058
ings+0.057
 lunch+0.054
Neg logit contributions
 forgot-0.085
'd-0.076
 remembered-0.070
 missed-0.068
 knew-0.067
Token.
Token ID13
Feature activation+3.60e-02

Pos logit contributions
ummy+0.090
uggle+0.081
ings+0.081
ach+0.081
uttering+0.079
Neg logit contributions
 forgot-0.248
'd-0.232
 knew-0.218
 remembered-0.216
 cared-0.216
Token What
Token ID1867
Feature activation+3.91e-02

Pos logit contributions
ummy+0.041
 ourselves+0.039
ach+0.037
 lunch+0.037
ings+0.036
Neg logit contributions
 forgot-0.091
'd-0.083
 remembered-0.078
 regretted-0.076
 missed-0.075
Token a
Token ID257
Feature activation+9.79e-03

Pos logit contributions
ummy+0.050
 ourselves+0.049
 lunch+0.046
ach+0.046
 something+0.045
Neg logit contributions
 forgot-0.092
'd-0.082
 remembered-0.077
 regretted-0.076
 missed-0.075
Token wonderful
Token ID7932
Feature activation+2.22e-02

Pos logit contributions
ummy+0.013
 lunch+0.013
ach+0.012
 something+0.012
 ourselves+0.012
Neg logit contributions
 forgot-0.023
'd-0.020
 remembered-0.019
 regretted-0.019
 struggled-0.019
Token day
Token ID1110
Feature activation+3.78e-02

Pos logit contributions
 ourselves+0.032
ummy+0.031
ach+0.029
ings+0.029
 yourselves+0.029
Neg logit contributions
 forgot-0.049
'd-0.044
 remembered-0.041
 cared-0.040
 knew-0.040
Token is
Token ID318
Feature activation+5.75e-02

Pos logit contributions
 forgot+0.124
'd+0.114
 remembered+0.105
 struggled+0.105
 regretted+0.103
Neg logit contributions
 that-0.089
.-0.084
,"-0.083
 with-0.081
 up-0.081
Token full
Token ID1336
Feature activation+8.33e-02

Pos logit contributions
 ourselves+0.063
 something+0.060
ummy+0.060
 lunch+0.060
 your+0.057
Neg logit contributions
 forgot-0.144
'd-0.132
 remembered-0.122
 regretted-0.122
 cared-0.120
Token of
Token ID286
Feature activation+6.45e-02

Pos logit contributions
 ourselves+0.079
ummy+0.076
ach+0.074
ings+0.072
 lunch+0.065
Neg logit contributions
 forgot-0.216
'd-0.201
 remembered-0.186
 knew-0.184
 regretted-0.184
Token things
Token ID1243
Feature activation+2.81e-02

Pos logit contributions
ummy+0.079
 ourselves+0.078
 lunch+0.075
ach+0.074
 something+0.073
Neg logit contributions
 forgot-0.155
'd-0.142
 remembered-0.131
 missed-0.128
 regretted-0.128
Token to
Token ID284
Feature activation+4.51e-02

Pos logit contributions
ummy+0.020
ach+0.019
ings+0.018
 ourselves+0.018
 lunch+0.016
Neg logit contributions
 forgot-0.080
'd-0.075
 remembered-0.069
 knew-0.068
 cared-0.068
Token explore
Token ID7301
Feature activation+1.01e-01

Pos logit contributions
ummy+0.067
ach+0.062
 ourselves+0.061
ings+0.061
time+0.058
Neg logit contributions
 forgot-0.095
'd-0.086
 remembered-0.079
 knew-0.078
 cared-0.077
Token and
Token ID290
Feature activation+7.20e-03

Pos logit contributions
ummy+0.106
ach+0.100
ings+0.098
uck+0.096
uttering+0.095
Neg logit contributions
 forgot-0.234
'd-0.229
 knew-0.208
 had-0.207
 remembered-0.207
Token learn
Token ID2193
Feature activation+7.97e-02

Pos logit contributions
 ourselves+0.009
ummy+0.009
 lunch+0.009
 something+0.009
ach+0.009
Neg logit contributions
 forgot-0.016
'd-0.015
 remembered-0.014
 regretted-0.014
 missed-0.013
Token.
Token ID13
Feature activation+2.60e-02

Pos logit contributions
ummy+0.093
ach+0.090
 ourselves+0.077
ings+0.077
 lunch+0.077
Neg logit contributions
 forgot-0.187
'd-0.175
 knew-0.161
 cared-0.160
 regretted-0.160
Token You
Token ID921
Feature activation-1.93e-02

Pos logit contributions
ummy+0.030
 ourselves+0.029
ach+0.027
 lunch+0.027
ings+0.026
Neg logit contributions
 forgot-0.065
'd-0.059
 remembered-0.055
 regretted-0.055
 missed-0.054
Token can
Token ID460
Feature activation+2.35e-02

Pos logit contributions
 forgot+0.034
 regretted+0.031
 sped+0.030
ave+0.029
 sank+0.028
Neg logit contributions
?"-0.029
 the-0.029
 that-0.029
 something-0.029
 ourselves-0.028
Token girl
Token ID2576
Feature activation+5.41e-03

Pos logit contributions
 ourselves+0.038
ach+0.036
 itself+0.035
 yourselves+0.035
ate+0.032
Neg logit contributions
 forgot-0.073
 remembered-0.064
 knew-0.062
 had-0.061
'd-0.061
Token was
Token ID373
Feature activation+4.48e-02

Pos logit contributions
ummy+0.007
!'+0.007
 something+0.007
 outside+0.007
 that+0.007
Neg logit contributions
 forgot-0.011
'd-0.010
 sped-0.009
 regretted-0.009
 struggled-0.009
Token very
Token ID845
Feature activation+2.75e-02

Pos logit contributions
ummy+0.058
 ourselves+0.056
ach+0.054
 lunch+0.054
ings+0.053
Neg logit contributions
 forgot-0.104
'd-0.095
 remembered-0.087
 regretted-0.086
 cared-0.085
Token happy
Token ID3772
Feature activation+7.06e-02

Pos logit contributions
ummy+0.042
 up+0.039
 your+0.038
ach+0.038
 something+0.038
Neg logit contributions
 forgot-0.056
'd-0.054
 sped-0.047
 remembered-0.047
 regretted-0.047
Token and
Token ID290
Feature activation-1.27e-02

Pos logit contributions
 ourselves+0.055
ummy+0.054
ach+0.053
ings+0.050
uggle+0.046
Neg logit contributions
 forgot-0.197
'd-0.179
 remembered-0.172
 cared-0.169
 knew-0.169
Token ate
Token ID15063
Feature activation+1.01e-01

Pos logit contributions
 forgot+0.021
'd+0.021
 struggled+0.019
ave+0.018
mo+0.018
Neg logit contributions
 that-0.021
 the-0.021
 your-0.021
 up-0.021
 her-0.021
Token her
Token ID607
Feature activation+2.81e-02

Pos logit contributions
ach+0.112
uttering+0.098
ummy+0.098
ings+0.096
uggle+0.096
Neg logit contributions
 forgot-0.226
 knew-0.205
 cared-0.201
 remembered-0.197
 talked-0.195
Token hot
Token ID3024
Feature activation+5.49e-02

Pos logit contributions
ummy+0.029
ach+0.028
 ourselves+0.028
ings+0.026
 yourselves+0.025
Neg logit contributions
 forgot-0.071
'd-0.064
 remembered-0.061
 cared-0.060
 regretted-0.060
Token dog
Token ID3290
Feature activation+1.96e-02

Pos logit contributions
 ourselves+0.070
ach+0.062
 yourselves+0.061
 itself+0.060
ires+0.059
Neg logit contributions
 forgot-0.122
'd-0.110
 cared-0.106
 remembered-0.105
 knew-0.103
Token.
Token ID13
Feature activation+2.13e-02

Pos logit contributions
ummy+0.010
 ourselves+0.010
ach+0.009
 lunch+0.008
ings+0.008
Neg logit contributions
 forgot-0.061
'd-0.057
 remembered-0.054
 regretted-0.053
 missed-0.053
Token From
Token ID3574
Feature activation+4.66e-02

Pos logit contributions
ummy+0.024
 ourselves+0.023
ach+0.022
 lunch+0.022
ings+0.022
Neg logit contributions
 forgot-0.053
'd-0.048
 remembered-0.045
 regretted-0.044
 missed-0.044
Token snow
Token ID6729
Feature activation+6.57e-02

Pos logit contributions
 forgot+0.063
'd+0.058
 remembered+0.053
 missed+0.053
 knew+0.053
Neg logit contributions
ummy-0.019
ach-0.017
 ourselves-0.016
 lunch-0.016
ings-0.015
Token flew
Token ID13112
Feature activation+7.53e-02

Pos logit contributions
ummy+0.080
 ourselves+0.078
ach+0.075
 lunch+0.069
ings+0.069
Neg logit contributions
 forgot-0.159
'd-0.144
 remembered-0.133
 missed-0.131
 regretted-0.130
Token everywhere
Token ID8347
Feature activation+7.47e-02

Pos logit contributions
ummy+0.110
 ourselves+0.104
ach+0.099
 yourselves+0.098
 lunch+0.096
Neg logit contributions
 forgot-0.158
'd-0.151
 sped-0.136
 remembered-0.134
 knew-0.132
Token and
Token ID290
Feature activation+1.36e-02

Pos logit contributions
ummy+0.054
 ourselves+0.051
ach+0.050
ings+0.045
 lunch+0.043
Neg logit contributions
 forgot-0.217
'd-0.201
 remembered-0.189
 missed-0.185
 knew-0.185
Token it
Token ID340
Feature activation-1.77e-02

Pos logit contributions
 ourselves+0.020
 something+0.020
 lunch+0.019
 outside+0.019
 up+0.019
Neg logit contributions
 forgot-0.028
'd-0.026
 remembered-0.023
 regretted-0.023
 cared-0.023
Token covered
Token ID5017
Feature activation+1.01e-01

Pos logit contributions
 forgot+0.025
'd+0.025
 officer+0.022
 cared+0.022
 nephew+0.021
Neg logit contributions
 up-0.031
 that-0.031
 outside-0.031
 the-0.031
 with-0.030
Token him
Token ID683
Feature activation+1.71e-02

Pos logit contributions
uggle+0.104
ummy+0.101
ach+0.093
uttering+0.092
ush+0.090
Neg logit contributions
 forgot-0.218
 had-0.207
'd-0.203
 hadn-0.203
 knew-0.201
Token from
Token ID422
Feature activation+6.16e-02

Pos logit contributions
ummy+0.013
ach+0.011
 ourselves+0.011
 lunch+0.010
time+0.010
Neg logit contributions
 forgot-0.049
'd-0.044
 remembered-0.042
 regretted-0.042
 missed-0.042
Token head
Token ID1182
Feature activation+4.59e-02

Pos logit contributions
ummy+0.071
ach+0.065
ings+0.065
time+0.062
 ourselves+0.061
Neg logit contributions
 forgot-0.151
'd-0.133
 remembered-0.133
 regretted-0.132
 wished-0.131
Token to
Token ID284
Feature activation+3.74e-02

Pos logit contributions
ummy+0.030
 ourselves+0.027
ach+0.026
ings+0.022
 lunch+0.022
Neg logit contributions
 forgot-0.136
'd-0.127
 remembered-0.120
 regretted-0.118
 missed-0.118
Token toe
Token ID21189
Feature activation+4.33e-02

Pos logit contributions
ummy+0.052
ach+0.047
 ourselves+0.047
ings+0.047
time+0.044
Neg logit contributions
 forgot-0.087
'd-0.078
 remembered-0.073
 missed-0.072
 knew-0.071
Token until
Token ID1566
Feature activation+4.98e-02

Pos logit contributions
ummy+0.028
 ourselves+0.026
ach+0.024
ings+0.021
 lunch+0.021
Neg logit contributions
 forgot-0.142
'd-0.131
 remembered-0.124
 regretted-0.122
 missed-0.122
Token it
Token ID340
Feature activation-4.17e-03

Pos logit contributions
ummy+0.042
ach+0.039
 ourselves+0.039
ings+0.037
.'+0.032
Neg logit contributions
 forgot-0.143
'd-0.128
 remembered-0.124
 missed-0.122
 regretted-0.120
Token was
Token ID373
Feature activation+4.95e-02

Pos logit contributions
 forgot+0.006
'd+0.005
 cared+0.005
 regretted+0.005
 remembered+0.005
Neg logit contributions
time-0.008
 up-0.008
 something-0.008
 lunch-0.008
 outside-0.008
Token time
Token ID640
Feature activation+7.55e-02

Pos logit contributions
ach+0.075
ummy+0.075
 ourselves+0.073
ings+0.069
uggle+0.068
Neg logit contributions
 forgot-0.108
'd-0.094
 remembered-0.090
 missed-0.089
 hadn-0.086
Token to
Token ID284
Feature activation+4.93e-02

Pos logit contributions
ach+0.083
 ourselves+0.083
ummy+0.081
 yourselves+0.078
ings+0.074
Neg logit contributions
 forgot-0.189
'd-0.175
 remembered-0.161
 knew-0.157
 missed-0.156
Token eat
Token ID4483
Feature activation+1.01e-01

Pos logit contributions
ummy+0.092
ings+0.084
ach+0.084
ires+0.084
uttering+0.080
Neg logit contributions
 forgot-0.093
'd-0.077
 missed-0.074
 remembered-0.073
 switched-0.072
Token again
Token ID757
Feature activation+6.01e-02

Pos logit contributions
uttering+0.110
ach+0.108
ires+0.104
ummy+0.104
uggle+0.101
Neg logit contributions
 forgot-0.223
 knew-0.194
 remembered-0.194
 had-0.193
'd-0.192
Token.
Token ID13
Feature activation+3.39e-02

Pos logit contributions
ummy+0.049
ings+0.044
ach+0.044
 ourselves+0.043
uggle+0.043
Neg logit contributions
 forgot-0.170
'd-0.150
 remembered-0.149
 knew-0.142
 needed-0.140
Token The
Token ID383
Feature activation+1.75e-02

Pos logit contributions
ummy+0.036
 ourselves+0.034
 lunch+0.032
ach+0.032
ings+0.031
Neg logit contributions
 forgot-0.088
'd-0.080
 remembered-0.075
 regretted-0.074
 missed-0.073
Token gray
Token ID12768
Feature activation+1.60e-02

Pos logit contributions
 lunch+0.023
ummy+0.023
 something+0.022
 ourselves+0.021
 dinner+0.021
Neg logit contributions
 forgot-0.040
'd-0.037
 remembered-0.034
 missed-0.033
ave-0.033
Token whale
Token ID22206
Feature activation-5.81e-02

Pos logit contributions
ummy+0.024
 ourselves+0.023
ach+0.023
 lunch+0.023
ings+0.022
Neg logit contributions
 forgot-0.034
'd-0.031
 remembered-0.028
 regretted-0.027
 missed-0.027
Token the
Token ID262
Feature activation+7.38e-02

Pos logit contributions
ummy+0.013
ings+0.012
ach+0.011
 ourselves+0.011
time+0.011
Neg logit contributions
 forgot-0.028
'd-0.025
 remembered-0.024
 missed-0.023
 regretted-0.023
Token batter
Token ID4984
Feature activation+3.75e-02

Pos logit contributions
ummy+0.104
 ourselves+0.101
 lunch+0.099
ach+0.097
ings+0.095
Neg logit contributions
 forgot-0.164
'd-0.147
 remembered-0.136
 missed-0.133
 regretted-0.133
Token and
Token ID290
Feature activation+7.20e-03

Pos logit contributions
ummy+0.036
 ourselves+0.035
ach+0.033
 lunch+0.032
ings+0.031
Neg logit contributions
 forgot-0.100
'd-0.091
 remembered-0.085
 regretted-0.084
 missed-0.083
Token pour
Token ID12797
Feature activation+8.38e-02

Pos logit contributions
 lunch+0.012
 ourselves+0.012
ummy+0.012
 something+0.012
 up+0.011
Neg logit contributions
 forgot-0.014
'd-0.012
 regretted-0.011
 remembered-0.011
 missed-0.011
Token it
Token ID340
Feature activation+2.97e-02

Pos logit contributions
ummy+0.087
ach+0.086
 ourselves+0.079
uggle+0.074
uttering+0.073
Neg logit contributions
 forgot-0.202
'd-0.181
 knew-0.172
 switched-0.171
 needed-0.169
Token onto
Token ID4291
Feature activation+1.01e-01

Pos logit contributions
ummy+0.027
 ourselves+0.026
ach+0.025
ings+0.023
 yourselves+0.022
Neg logit contributions
 forgot-0.080
'd-0.074
 remembered-0.069
 missed-0.068
 regretted-0.067
Token the
Token ID262
Feature activation+5.46e-02

Pos logit contributions
ummy+0.095
uttering+0.090
ach+0.089
uggle+0.086
ires+0.083
Neg logit contributions
 forgot-0.241
'd-0.224
 had-0.213
 needed-0.212
 knew-0.211
Token pan
Token ID3425
Feature activation+3.49e-02

Pos logit contributions
ummy+0.076
 ourselves+0.074
ach+0.072
ings+0.070
 lunch+0.068
Neg logit contributions
 forgot-0.121
'd-0.109
 remembered-0.100
 regretted-0.098
 missed-0.097
Token.
Token ID13
Feature activation+3.85e-02

Pos logit contributions
ummy+0.019
 ourselves+0.018
ach+0.018
 lunch+0.015
ings+0.015
Neg logit contributions
 forgot-0.106
'd-0.098
 remembered-0.092
 regretted-0.090
 knew-0.090
Token She
Token ID1375
Feature activation-5.52e-02

Pos logit contributions
ummy+0.045
 ourselves+0.043
ach+0.041
 lunch+0.040
ings+0.040
Neg logit contributions
 forgot-0.096
'd-0.087
 remembered-0.082
 regretted-0.080
 missed-0.079
Token watched
Token ID7342
Feature activation+7.24e-02

Pos logit contributions
 forgot+0.084
'd+0.079
ave+0.071
 sped+0.070
 regretted+0.070
Neg logit contributions
 the-0.097
 that-0.097
 up-0.096
?"-0.094
 something-0.094
Token years
Token ID812
Feature activation+7.06e-02

Pos logit contributions
 forgot+0.017
'd+0.015
 remembered+0.014
 missed+0.014
 cared+0.013
Neg logit contributions
ummy-0.009
 ourselves-0.008
 lunch-0.008
 outside-0.008
 something-0.008
Token old
Token ID1468
Feature activation+5.55e-02

Pos logit contributions
 ourselves+0.082
ummy+0.079
ings+0.079
ach+0.079
 lunch+0.073
Neg logit contributions
 forgot-0.170
'd-0.154
 remembered-0.144
 knew-0.143
 regretted-0.143
Token and
Token ID290
Feature activation+4.05e-03

Pos logit contributions
ummy+0.059
 ourselves+0.056
ach+0.054
 lunch+0.054
ings+0.053
Neg logit contributions
 forgot-0.143
'd-0.130
 remembered-0.121
 missed-0.119
 regretted-0.118
Token loved
Token ID6151
Feature activation+6.44e-02

Pos logit contributions
 something+0.006
 the+0.006
 that+0.006
 her+0.006
 outside+0.005
Neg logit contributions
 forgot-0.008
'd-0.007
ave-0.007
 regretted-0.007
 remembered-0.007
Token to
Token ID284
Feature activation+3.81e-02

Pos logit contributions
ummy+0.054
 ourselves+0.052
ach+0.049
 lunch+0.047
ings+0.047
Neg logit contributions
 forgot-0.180
'd-0.165
 remembered-0.156
 regretted-0.153
 missed-0.152
Token explore
Token ID7301
Feature activation+1.00e-01

Pos logit contributions
ummy+0.058
ach+0.054
 ourselves+0.054
ings+0.053
 yourselves+0.049
Neg logit contributions
 forgot-0.083
'd-0.076
 remembered-0.070
 knew-0.068
 missed-0.067
Token new
Token ID649
Feature activation+5.99e-02

Pos logit contributions
ocolate+0.150
uck+0.148
ach+0.147
ummy+0.143
ings+0.141
Neg logit contributions
 had-0.182
'd-0.181
 would-0.178
 knew-0.176
 forgot-0.175
Token things
Token ID1243
Feature activation+4.32e-02

Pos logit contributions
ummy+0.072
 ourselves+0.065
ach+0.062
 lunch+0.062
ings+0.061
Neg logit contributions
 forgot-0.145
'd-0.132
 remembered-0.125
 cared-0.124
 regretted-0.123
Token.
Token ID13
Feature activation+2.45e-02

Pos logit contributions
ummy+0.040
ach+0.038
ings+0.037
 ourselves+0.034
 lunch+0.034
Neg logit contributions
 forgot-0.115
'd-0.110
 needed-0.100
 had-0.099
 cared-0.099
Token One
Token ID1881
Feature activation+1.32e-02

Pos logit contributions
ummy+0.031
 ourselves+0.030
ach+0.029
ings+0.028
 lunch+0.028
Neg logit contributions
 forgot-0.058
'd-0.052
 remembered-0.049
 regretted-0.048
 missed-0.048
Token day
Token ID1110
Feature activation+5.63e-02

Pos logit contributions
ummy+0.016
 ourselves+0.015
 lunch+0.015
ach+0.015
 something+0.015
Neg logit contributions
 forgot-0.032
'd-0.029
 remembered-0.027
 regretted-0.027
 missed-0.026
Token years
Token ID812
Feature activation+7.06e-02

Pos logit contributions
 forgot+0.015
'd+0.014
 remembered+0.014
 regretted+0.014
 struggled+0.013
Neg logit contributions
ummy-0.002
 ourselves-0.002
time-0.001
ach-0.001
 lunch-0.001
Token old
Token ID1468
Feature activation+5.55e-02

Pos logit contributions
 ourselves+0.082
ummy+0.079
ings+0.079
ach+0.079
 lunch+0.073
Neg logit contributions
 forgot-0.170
'd-0.154
 remembered-0.144
 knew-0.143
 regretted-0.143
Token and
Token ID290
Feature activation+4.05e-03

Pos logit contributions
ummy+0.059
 ourselves+0.056
ach+0.054
 lunch+0.054
ings+0.053
Neg logit contributions
 forgot-0.143
'd-0.130
 remembered-0.121
 missed-0.119
 regretted-0.118
Token loved
Token ID6151
Feature activation+6.44e-02

Pos logit contributions
 something+0.006
 the+0.006
 that+0.006
 her+0.006
 outside+0.005
Neg logit contributions
 forgot-0.008
'd-0.007
ave-0.007
 regretted-0.007
 remembered-0.007
Token to
Token ID284
Feature activation+3.81e-02

Pos logit contributions
ummy+0.054
 ourselves+0.052
ach+0.049
 lunch+0.047
ings+0.047
Neg logit contributions
 forgot-0.180
'd-0.165
 remembered-0.156
 regretted-0.153
 missed-0.152
Token explore
Token ID7301
Feature activation+1.00e-01

Pos logit contributions
ummy+0.058
ach+0.054
 ourselves+0.054
ings+0.053
 yourselves+0.049
Neg logit contributions
 forgot-0.083
'd-0.076
 remembered-0.070
 knew-0.068
 missed-0.067
Token and
Token ID290
Feature activation+1.22e-02

Pos logit contributions
ocolate+0.150
uck+0.148
ach+0.147
ummy+0.143
ings+0.141
Neg logit contributions
 had-0.182
'd-0.181
 would-0.178
 knew-0.176
 forgot-0.175
Token make
Token ID787
Feature activation+6.98e-02

Pos logit contributions
ummy+0.014
 ourselves+0.014
ach+0.013
 lunch+0.013
ings+0.013
Neg logit contributions
 forgot-0.030
'd-0.027
 remembered-0.025
 regretted-0.025
 missed-0.024
Token new
Token ID649
Feature activation+4.49e-02

Pos logit contributions
ummy+0.090
ach+0.078
 ourselves+0.077
ings+0.074
?'+0.073
Neg logit contributions
 forgot-0.162
'd-0.143
 cared-0.136
 knew-0.136
 remembered-0.135
Token friends
Token ID2460
Feature activation+2.75e-02

Pos logit contributions
ummy+0.062
 ourselves+0.059
ach+0.059
ings+0.056
 lunch+0.056
Neg logit contributions
 forgot-0.099
'd-0.088
 remembered-0.082
 cared-0.081
 regretted-0.081
Token.
Token ID13
Feature activation+2.26e-02

Pos logit contributions
ummy+0.022
ach+0.018
 ourselves+0.017
ings+0.017
 lunch+0.017
Neg logit contributions
 forgot-0.079
'd-0.075
 remembered-0.069
 cared-0.069
 knew-0.068
Token from
Token ID422
Feature activation+1.93e-02

Pos logit contributions
 forgot+0.090
'd+0.082
 remembered+0.078
 regretted+0.077
 missed+0.077
Neg logit contributions
ummy-0.024
 ourselves-0.024
ings-0.022
 lunch-0.022
 something-0.021
Token the
Token ID262
Feature activation-1.66e-02

Pos logit contributions
ummy+0.011
 ourselves+0.010
ach+0.010
ings+0.009
 lunch+0.009
Neg logit contributions
 forgot-0.059
'd-0.055
 remembered-0.052
 regretted-0.051
 missed-0.051
Token tank
Token ID6873
Feature activation-1.43e-02

Pos logit contributions
 forgot+0.040
'd+0.035
 remembered+0.033
ave+0.032
 missed+0.032
Neg logit contributions
 ourselves-0.021
ummy-0.021
 lunch-0.021
 outside-0.020
 something-0.020
Token.
Token ID13
Feature activation-1.30e-02

Pos logit contributions
 forgot+0.046
'd+0.042
 remembered+0.040
 missed+0.040
 regretted+0.040
Neg logit contributions
ummy-0.006
 ourselves-0.006
ach-0.005
 lunch-0.005
ings-0.005
Token Hel
Token ID5053
Feature activation-2.92e-02

Pos logit contributions
 forgot+0.034
'd+0.030
 remembered+0.029
 regretted+0.028
 missed+0.028
Neg logit contributions
ummy-0.015
 ourselves-0.014
ings-0.013
ach-0.013
 something-0.012
Tokenio
Token ID952
Feature activation-1.34e-01

Pos logit contributions
 forgot+0.059
'd+0.052
 remembered+0.048
 missed+0.047
 cared+0.047
Neg logit contributions
ummy-0.048
ach-0.047
 ourselves-0.046
ings-0.045
 lunch-0.044
Token was
Token ID373
Feature activation-2.47e-02

Pos logit contributions
 forgot+0.238
Uh+0.198
 regretted+0.193
'd+0.191
 missed+0.189
Neg logit contributions
 that-0.219
 with-0.218
,-0.218
.-0.210
ummy-0.207
Token thankful
Token ID25535
Feature activation+1.86e-02

Pos logit contributions
 forgot+0.055
'd+0.049
 regretted+0.046
 struggled+0.044
ave+0.044
Neg logit contributions
 something-0.033
 up-0.032
 that-0.032
 the-0.031
 ourselves-0.031
Token and
Token ID290
Feature activation-5.26e-02

Pos logit contributions
ummy+0.013
 ourselves+0.013
ach+0.012
ings+0.012
 lunch+0.011
Neg logit contributions
 forgot-0.054
'd-0.050
 remembered-0.047
 regretted-0.046
 knew-0.046
Token thanked
Token ID26280
Feature activation+3.51e-02

Pos logit contributions
 forgot+0.082
'd+0.077
 struggled+0.065
ave+0.065
 missed+0.064
Neg logit contributions
 that-0.102
 the-0.101
 something-0.096
 with-0.096
 up-0.095
Token T
Token ID309
Feature activation-2.03e-02

Pos logit contributions
ummy+0.039
 ourselves+0.037
ach+0.035
ings+0.035
 lunch+0.035
Neg logit contributions
 forgot-0.088
'd-0.080
 remembered-0.075
 regretted-0.074
 missed-0.073
Token for
Token ID329
Feature activation+1.12e-02

Pos logit contributions
ummy+0.009
 ourselves+0.009
ach+0.008
ings+0.008
 lunch+0.008
Neg logit contributions
 forgot-0.030
'd-0.027
 remembered-0.026
 missed-0.025
 regretted-0.025
Token hours
Token ID2250
Feature activation+6.44e-03

Pos logit contributions
ummy+0.012
 ourselves+0.012
ach+0.011
ings+0.011
 lunch+0.011
Neg logit contributions
 forgot-0.028
'd-0.026
 remembered-0.024
 regretted-0.024
 missed-0.024
Token.
Token ID13
Feature activation-6.53e-03

Pos logit contributions
ummy+0.004
 ourselves+0.004
ach+0.004
ings+0.003
 lunch+0.003
Neg logit contributions
 forgot-0.019
'd-0.018
 remembered-0.017
 regretted-0.017
 missed-0.017
Token Whenever
Token ID21326
Feature activation+1.76e-02

Pos logit contributions
 forgot+0.017
'd+0.015
 remembered+0.014
 regretted+0.014
 struggled+0.014
Neg logit contributions
ummy-0.008
 ourselves-0.007
 lunch-0.007
time-0.007
ings-0.007
Token L
Token ID406
Feature activation-4.16e-02

Pos logit contributions
ummy+0.018
 ourselves+0.017
ach+0.016
 lunch+0.016
ings+0.016
Neg logit contributions
 forgot-0.046
'd-0.043
 remembered-0.040
 regretted-0.039
 missed-0.039
Tokenola
Token ID5708
Feature activation-1.33e-01

Pos logit contributions
 forgot+0.085
'd+0.074
 remembered+0.068
 knew+0.067
 regretted+0.067
Neg logit contributions
 ourselves-0.066
ummy-0.064
 lunch-0.062
ach-0.062
ings-0.061
Token was
Token ID373
Feature activation+5.71e-03

Pos logit contributions
Uh+0.183
 ounces+0.175
 forgot+0.173
'd+0.170
 sizes+0.164
Neg logit contributions
,-0.235
?-0.225
 outside-0.218
?"-0.216
 with-0.215
Token feeling
Token ID4203
Feature activation+3.41e-04

Pos logit contributions
ummy+0.010
 lunch+0.010
 something+0.010
 outside+0.009
 up+0.009
Neg logit contributions
 forgot-0.011
'd-0.010
 remembered-0.008
 regretted-0.008
 cared-0.008
Token sad
Token ID6507
Feature activation+1.43e-02

Pos logit contributions
 something+0.001
ummy+0.001
 lunch+0.001
 outside+0.001
time+0.001
Neg logit contributions
 forgot-0.001
'd-0.001
 remembered-0.000
 regretted-0.000
 hadn-0.000
Token,
Token ID11
Feature activation-7.49e-03

Pos logit contributions
ummy+0.006
 ourselves+0.006
ach+0.005
 lunch+0.005
ings+0.005
Neg logit contributions
 forgot-0.046
'd-0.043
 remembered-0.040
 regretted-0.040
 missed-0.040
Token the
Token ID262
Feature activation-3.38e-02

Pos logit contributions
 forgot+0.021
'd+0.019
 remembered+0.018
 missed+0.018
 regretted+0.018
Neg logit contributions
ummy-0.006
 ourselves-0.006
 something-0.006
 lunch-0.006
ach-0.006
Token fashion
Token ID6977
Feature activation-6.44e-03

Pos logit contributions
ummy+0.010
 ourselves+0.009
ach+0.008
ings+0.008
 lunch+0.007
Neg logit contributions
 forgot-0.061
'd-0.057
 remembered-0.054
 regretted-0.053
 missed-0.053
Token.
Token ID13
Feature activation-4.47e-03

Pos logit contributions
 forgot+0.019
'd+0.018
 remembered+0.017
 regretted+0.016
 cared+0.016
Neg logit contributions
ummy-0.004
 ourselves-0.004
ach-0.004
 lunch-0.003
ings-0.003
Token Every
Token ID3887
Feature activation-1.15e-04

Pos logit contributions
 forgot+0.009
'd+0.008
 missed+0.007
 regretted+0.007
 remembered+0.007
Neg logit contributions
 ourselves-0.007
ach-0.007
ummy-0.007
ings-0.007
 lunch-0.006
Token day
Token ID1110
Feature activation+2.03e-02

Pos logit contributions
 forgot+0.000
'd+0.000
 remembered+0.000
 regretted+0.000
 missed+0.000
Neg logit contributions
ummy-0.000
time-0.000
 lunch-0.000
 something-0.000
 ourselves-0.000
Token when
Token ID618
Feature activation+1.29e-02

Pos logit contributions
 ourselves+0.018
ummy+0.018
ach+0.017
ings+0.016
 lunch+0.015
Neg logit contributions
 forgot-0.056
'd-0.051
 remembered-0.049
 regretted-0.048
 missed-0.047
Token Hannah
Token ID26013
Feature activation-1.31e-01

Pos logit contributions
ach+0.016
ummy+0.016
 ourselves+0.015
ings+0.015
 itself+0.014
Neg logit contributions
 forgot-0.031
'd-0.027
 remembered-0.026
 missed-0.026
 regretted-0.025
Token got
Token ID1392
Feature activation+1.70e-02

Pos logit contributions
 forgot+0.206
'd+0.176
 sped+0.169
 regretted+0.167
 charged+0.163
Neg logit contributions
,-0.227
?-0.223
 outside-0.223
 with-0.215
.-0.213
Token dressed
Token ID12049
Feature activation+8.31e-03

Pos logit contributions
ummy+0.014
 ourselves+0.013
ach+0.013
ings+0.012
 lunch+0.011
Neg logit contributions
 forgot-0.048
'd-0.044
 remembered-0.042
 regretted-0.041
 missed-0.041
Token,
Token ID11
Feature activation-4.09e-03

Pos logit contributions
 ourselves+0.007
ummy+0.006
ach+0.006
 lunch+0.006
 yourselves+0.005
Neg logit contributions
 forgot-0.023
'd-0.021
 remembered-0.020
 knew-0.020
 cared-0.019
Token her
Token ID607
Feature activation-2.40e-02

Pos logit contributions
 forgot+0.011
'd+0.010
 remembered+0.009
 regretted+0.009
 missed+0.009
Neg logit contributions
ummy-0.004
 ourselves-0.004
ach-0.004
ings-0.004
 lunch-0.004
Token mum
Token ID25682
Feature activation-8.74e-02

Pos logit contributions
 forgot+0.059
'd+0.053
 remembered+0.049
 regretted+0.049
 missed+0.048
Neg logit contributions
ummy-0.029
 lunch-0.027
 something-0.027
 ourselves-0.026
ach-0.026
Token't
Token ID470
Feature activation-1.32e-02

Pos logit contributions
ummy+0.048
ach+0.047
 ourselves+0.047
 lunch+0.046
ings+0.045
Neg logit contributions
 forgot-0.058
'd-0.054
 missed-0.049
 hadn-0.047
 switched-0.047
Token split
Token ID6626
Feature activation+2.82e-02

Pos logit contributions
 forgot+0.026
'd+0.024
 remembered+0.023
 regretted+0.022
ave+0.022
Neg logit contributions
 something-0.019
 ourselves-0.019
?-0.018
 your-0.018
ummy-0.018
Token again
Token ID757
Feature activation+7.32e-03

Pos logit contributions
ummy+0.027
ach+0.025
 ourselves+0.025
 lunch+0.024
time+0.023
Neg logit contributions
 forgot-0.075
'd-0.067
 remembered-0.065
 regretted-0.063
 missed-0.062
Token!
Token ID0
Feature activation-5.63e-03

Pos logit contributions
ummy+0.004
 ourselves+0.004
ach+0.004
 lunch+0.003
ings+0.003
Neg logit contributions
 forgot-0.022
'd-0.021
 remembered-0.020
 regretted-0.019
 missed-0.019
Token Whenever
Token ID21326
Feature activation+1.32e-02

Pos logit contributions
 forgot+0.013
'd+0.012
 knew+0.011
 remembered+0.011
 cared+0.011
Neg logit contributions
ach-0.008
 ourselves-0.008
ings-0.007
 lunch-0.007
ummy-0.007
Token Sally
Token ID25737
Feature activation-1.30e-01

Pos logit contributions
ach+0.018
ings+0.018
ummy+0.018
 ourselves+0.017
ags+0.017
Neg logit contributions
 forgot-0.032
 remembered-0.027
'd-0.026
 knew-0.026
 missed-0.026
Token was
Token ID373
Feature activation-3.17e-03

Pos logit contributions
 forgot+0.183
 sped+0.180
 arrested+0.178
'd+0.175
Uh+0.167
Neg logit contributions
?-0.233
 outside-0.222
,-0.220
 up-0.219
?"-0.214
Token writing
Token ID3597
Feature activation+4.00e-03

Pos logit contributions
 forgot+0.006
'd+0.005
 cared+0.005
 sped+0.005
 regretted+0.005
Neg logit contributions
 something-0.005
 outside-0.005
ummy-0.005
 up-0.005
 lunch-0.005
Token in
Token ID287
Feature activation+3.14e-03

Pos logit contributions
ummy+0.002
 ourselves+0.002
ach+0.002
ings+0.002
 lunch+0.002
Neg logit contributions
 forgot-0.013
'd-0.012
 remembered-0.011
 regretted-0.011
 missed-0.011
Token the
Token ID262
Feature activation-2.22e-02

Pos logit contributions
ummy+0.002
 ourselves+0.002
ach+0.002
 lunch+0.002
ings+0.002
Neg logit contributions
 forgot-0.009
'd-0.008
 remembered-0.008
 cared-0.008
 missed-0.008
Token notebook
Token ID20922
Feature activation-2.75e-02

Pos logit contributions
 forgot+0.050
'd+0.045
 remembered+0.041
 missed+0.041
 regretted+0.041
Neg logit contributions
ummy-0.031
 ourselves-0.029
 lunch-0.029
ach-0.028
 something-0.028
Token
Token ID220
Feature activation+8.94e-03

Pos logit contributions
 forgot+0.009
'd+0.008
 remembered+0.008
 regretted+0.007
 missed+0.007
Neg logit contributions
ummy-0.005
 ourselves-0.005
 lunch-0.005
ach-0.005
ings-0.005
Token
Token ID198
Feature activation+2.57e-02

Pos logit contributions
 ourselves+0.013
ach+0.013
ummy+0.013
 lunch+0.012
 yourselves+0.012
Neg logit contributions
 forgot-0.018
'd-0.017
 knew-0.016
 missed-0.015
 hadn-0.015
Token
Token ID198
Feature activation+7.16e-03

Pos logit contributions
 ourselves+0.038
ummy+0.037
 yourselves+0.035
ach+0.035
 lunch+0.034
Neg logit contributions
 forgot-0.053
'd-0.049
 knew-0.046
 missed-0.043
 cared-0.043
TokenAs
Token ID1722
Feature activation+2.66e-02

Pos logit contributions
ummy+0.010
ach+0.009
 ourselves+0.009
ings+0.009
time+0.009
Neg logit contributions
 forgot-0.016
'd-0.015
 remembered-0.014
 regretted-0.014
 missed-0.014
Token P
Token ID350
Feature activation-5.16e-02

Pos logit contributions
ummy+0.035
 ourselves+0.034
ach+0.032
 lunch+0.032
ings+0.032
Neg logit contributions
 forgot-0.062
'd-0.056
 remembered-0.052
 regretted-0.051
 missed-0.051
Tokeningu
Token ID6680
Feature activation-1.29e-01

Pos logit contributions
 forgot+0.083
'd+0.071
 remembered+0.063
 regretted+0.061
 missed+0.061
Neg logit contributions
ummy-0.107
 ourselves-0.102
ach-0.102
 lunch-0.101
ings-0.100
Token was
Token ID373
Feature activation+8.84e-04

Pos logit contributions
 forgot+0.192
'd+0.178
 hadn+0.164
 cared+0.158
ave+0.156
Neg logit contributions
 lunch-0.243
 dinner-0.232
 the-0.226
,-0.225
 something-0.225
Token resting
Token ID19186
Feature activation+2.65e-02

Pos logit contributions
ummy+0.001
 ourselves+0.001
ach+0.001
ings+0.001
.'+0.001
Neg logit contributions
 forgot-0.002
'd-0.002
 remembered-0.002
 missed-0.002
 regretted-0.002
Token,
Token ID11
Feature activation+1.50e-02

Pos logit contributions
 ourselves+0.028
ach+0.026
ummy+0.025
.'+0.023
ings+0.023
Neg logit contributions
 forgot-0.070
 remembered-0.063
'd-0.062
 knew-0.062
 missed-0.060
Token a
Token ID257
Feature activation-2.47e-02

Pos logit contributions
ummy+0.023
 ourselves+0.023
ach+0.022
ings+0.022
 lunch+0.021
Neg logit contributions
 forgot-0.031
'd-0.028
 remembered-0.026
 regretted-0.025
 missed-0.025
Token group
Token ID1448
Feature activation-2.16e-02

Pos logit contributions
 forgot+0.059
'd+0.054
ave+0.051
 struggled+0.050
 remembered+0.049
Neg logit contributions
ummy-0.029
 something-0.028
 lunch-0.027
time-0.027
 outside-0.026
Token very
Token ID845
Feature activation-3.17e-02

Pos logit contributions
 forgot+0.058
'd+0.054
 regretted+0.051
ave+0.050
 cared+0.050
Neg logit contributions
 the-0.034
 something-0.034
 up-0.033
 outside-0.032
 with-0.032
Token enthusiastic
Token ID21848
Feature activation+1.20e-02

Pos logit contributions
'd+0.066
 forgot+0.065
ave+0.060
 regretted+0.060
 cared+0.059
Neg logit contributions
 with-0.041
 the-0.041
 up-0.041
 something-0.040
 early-0.040
Token.
Token ID13
Feature activation-7.56e-03

Pos logit contributions
 ourselves+0.008
ummy+0.008
ach+0.007
ings+0.007
 lunch+0.007
Neg logit contributions
 forgot-0.035
'd-0.033
 remembered-0.031
 cared-0.030
 regretted-0.030
Token Every
Token ID3887
Feature activation-9.64e-03

Pos logit contributions
 forgot+0.018
'd+0.016
 remembered+0.015
 regretted+0.015
 missed+0.015
Neg logit contributions
ummy-0.010
 ourselves-0.009
ach-0.009
ings-0.009
 lunch-0.009
Token time
Token ID640
Feature activation-2.56e-03

Pos logit contributions
 forgot+0.024
'd+0.022
 remembered+0.020
 regretted+0.020
 missed+0.020
Neg logit contributions
ummy-0.011
 lunch-0.010
time-0.010
 something-0.010
 ourselves-0.010
Token Amelia
Token ID49769
Feature activation-1.28e-01

Pos logit contributions
 forgot+0.007
'd+0.006
 remembered+0.006
 regretted+0.006
 missed+0.006
Neg logit contributions
ummy-0.002
 ourselves-0.002
 something-0.002
 lunch-0.002
ings-0.002
Token got
Token ID1392
Feature activation-1.03e-02

Pos logit contributions
 forgot+0.174
Uh+0.171
 officer+0.159
 hadn+0.154
'd+0.152
Neg logit contributions
,-0.243
 the-0.226
 with-0.224
 outside-0.221
 that-0.219
Token an
Token ID281
Feature activation-2.33e-02

Pos logit contributions
 forgot+0.031
'd+0.029
 hadn+0.027
 regretted+0.027
 missed+0.027
Neg logit contributions
 up-0.006
 something-0.006
 outside-0.006
 lunch-0.005
ummy-0.005
Token answer
Token ID3280
Feature activation-1.58e-02

Pos logit contributions
 forgot+0.050
'd+0.045
 remembered+0.041
 regretted+0.040
 missed+0.040
Neg logit contributions
ummy-0.035
 ourselves-0.034
ach-0.033
 lunch-0.032
ings-0.032
Token right
Token ID826
Feature activation-8.54e-03

Pos logit contributions
 forgot+0.050
'd+0.046
 remembered+0.044
 regretted+0.044
 missed+0.044
Neg logit contributions
ummy-0.007
 ourselves-0.006
ach-0.006
ings-0.006
 lunch-0.006
Token her
Token ID607
Feature activation-4.43e-02

Pos logit contributions
 forgot+0.024
'd+0.022
 remembered+0.021
 regretted+0.020
 knew+0.020
Neg logit contributions
ummy-0.007
 ourselves-0.007
 lunch-0.006
ach-0.006
ings-0.006
Token it
Token ID340
Feature activation-4.92e-02

Pos logit contributions
 forgot+0.108
'd+0.099
 remembered+0.095
 regretted+0.094
 struggled+0.093
Neg logit contributions
 ourselves-0.026
ummy-0.024
 something-0.024
 lunch-0.022
ings-0.022
Token was
Token ID373
Feature activation-7.59e-03

Pos logit contributions
 forgot+0.097
 regretted+0.081
ave+0.081
 struggled+0.078
Uh+0.078
Neg logit contributions
 something-0.076
.-0.076
 up-0.076
 that-0.075
 with-0.073
Token Spanish
Token ID7897
Feature activation-1.58e-02

Pos logit contributions
 forgot+0.020
'd+0.018
 regretted+0.017
 remembered+0.017
 struggled+0.017
Neg logit contributions
 ourselves-0.007
 something-0.007
ummy-0.007
 lunch-0.007
 up-0.007
Token.
Token ID13
Feature activation+8.73e-03

Pos logit contributions
 forgot+0.052
'd+0.048
 remembered+0.046
 struggled+0.046
 regretted+0.046
Neg logit contributions
ummy-0.005
 ourselves-0.004
 something-0.004
 that-0.003
ings-0.003
Token J
Token ID449
Feature activation-4.52e-02

Pos logit contributions
ummy+0.009
 ourselves+0.009
ach+0.009
 lunch+0.008
ings+0.008
Neg logit contributions
 forgot-0.023
'd-0.021
 remembered-0.019
 regretted-0.019
 missed-0.019
Tokenax
Token ID897
Feature activation-1.26e-01

Pos logit contributions
 forgot+0.086
'd+0.077
 knew+0.071
 regretted+0.069
 remembered+0.069
Neg logit contributions
 ourselves-0.080
 lunch-0.074
ummy-0.073
time-0.072
ings-0.072
Token asked
Token ID1965
Feature activation-1.27e-07

Pos logit contributions
 forgot+0.227
Uh+0.210
 ounces+0.203
'd+0.197
 strongest+0.197
Neg logit contributions
,-0.203
 with-0.194
 that-0.190
 the-0.184
 up-0.176
Token them
Token ID606
Feature activation-5.29e-02

Feature not active on prev token;
no predictions were affected.
Token to
Token ID284
Feature activation-1.74e-02

Pos logit contributions
 forgot+0.152
'd+0.140
 remembered+0.132
 regretted+0.130
 missed+0.130
Neg logit contributions
ummy-0.040
 ourselves-0.038
ach-0.035
 lunch-0.034
ings-0.034
Token teach
Token ID4545
Feature activation+3.78e-02

Pos logit contributions
 forgot+0.035
'd+0.031
 remembered+0.028
 missed+0.027
 regretted+0.027
Neg logit contributions
ummy-0.029
 ourselves-0.028
ach-0.027
ings-0.027
 lunch-0.026
Token him
Token ID683
Feature activation-3.81e-02

Pos logit contributions
ummy+0.022
 ourselves+0.019
ach+0.018
ings+0.017
 lunch+0.016
Neg logit contributions
 forgot-0.114
'd-0.106
 remembered-0.101
 cared-0.100
 regretted-0.099
Token
Token ID220
Feature activation+5.33e-03

Pos logit contributions
 forgot+0.017
'd+0.015
 remembered+0.014
 regretted+0.014
 missed+0.014
Neg logit contributions
ummy-0.008
 ourselves-0.008
ach-0.007
 lunch-0.007
ings-0.007
Token
Token ID198
Feature activation+2.84e-02

Pos logit contributions
ach+0.008
ummy+0.007
 ourselves+0.007
 lunch+0.007
 dinner+0.007
Neg logit contributions
 forgot-0.011
'd-0.010
 knew-0.009
 hadn-0.009
 missed-0.009
Token
Token ID198
Feature activation+4.74e-03

Pos logit contributions
 ourselves+0.035
ummy+0.035
ach+0.034
 yourselves+0.033
 lunch+0.033
Neg logit contributions
 forgot-0.065
'd-0.061
 knew-0.056
 remembered-0.055
 missed-0.054
TokenWhenever
Token ID28877
Feature activation+1.62e-02

Pos logit contributions
ummy+0.006
 ourselves+0.006
ach+0.006
time+0.006
 lunch+0.006
Neg logit contributions
 forgot-0.011
'd-0.010
 remembered-0.009
 regretted-0.009
 missed-0.009
Token M
Token ID337
Feature activation-3.79e-02

Pos logit contributions
ummy+0.020
 ourselves+0.020
ach+0.020
ings+0.019
 lunch+0.018
Neg logit contributions
 forgot-0.039
'd-0.035
 remembered-0.033
 knew-0.032
 regretted-0.032
Tokenandy
Token ID10757
Feature activation-1.26e-01

Pos logit contributions
 forgot+0.086
'd+0.078
 remembered+0.072
 regretted+0.070
 knew+0.070
Neg logit contributions
 ourselves-0.050
ummy-0.049
ach-0.047
 lunch-0.046
ings-0.046
Token went
Token ID1816
Feature activation+2.55e-04

Pos logit contributions
 forgot+0.179
Uh+0.161
'd+0.153
olded+0.149
 switched+0.148
Neg logit contributions
,-0.228
 outside-0.221
?-0.219
 the-0.219
?"-0.219
Token out
Token ID503
Feature activation+8.46e-03

Pos logit contributions
ummy+0.000
 ourselves+0.000
ach+0.000
ings+0.000
.'+0.000
Neg logit contributions
 forgot-0.001
'd-0.001
 remembered-0.001
 missed-0.001
 regretted-0.001
Token,
Token ID11
Feature activation-7.33e-03

Pos logit contributions
ummy+0.007
 ourselves+0.007
ach+0.006
ings+0.006
 lunch+0.006
Neg logit contributions
 forgot-0.024
'd-0.022
 remembered-0.021
 cared-0.020
 missed-0.020
Token she
Token ID673
Feature activation-9.21e-02

Pos logit contributions
 forgot+0.021
'd+0.019
 remembered+0.018
 regretted+0.018
 missed+0.018
Neg logit contributions
ummy-0.006
 ourselves-0.006
 something-0.005
 lunch-0.005
ings-0.005
Token would
Token ID561
Feature activation-1.82e-02

Pos logit contributions
Uh+0.121
 forgot+0.106
 ounces+0.104
 sped+0.095
ave+0.095
Neg logit contributions
?-0.179
,-0.179
?"-0.177
 outside-0.175
.-0.174
Token special
Token ID2041
Feature activation-1.88e-02

Pos logit contributions
 forgot+0.048
ave+0.045
 struggled+0.043
 sped+0.041
'd+0.041
Neg logit contributions
 something-0.026
 lunch-0.025
ummy-0.024
 outside-0.023
 dinner-0.023
Token fairy
Token ID25607
Feature activation-3.47e-02

Pos logit contributions
 forgot+0.043
'd+0.039
 remembered+0.036
 regretted+0.035
 missed+0.035
Neg logit contributions
ummy-0.025
 ourselves-0.024
ach-0.023
 lunch-0.023
ings-0.023
Token.
Token ID13
Feature activation-2.00e-03

Pos logit contributions
 forgot+0.100
'd+0.092
 remembered+0.087
 regretted+0.086
 missed+0.086
Neg logit contributions
ummy-0.024
 ourselves-0.023
ach-0.021
 lunch-0.021
ings-0.021
Token Whenever
Token ID21326
Feature activation+6.96e-03

Pos logit contributions
 forgot+0.005
'd+0.005
 remembered+0.004
 regretted+0.004
 missed+0.004
Neg logit contributions
ummy-0.002
 ourselves-0.002
ach-0.002
 lunch-0.002
ings-0.002
Token the
Token ID262
Feature activation-3.52e-02

Pos logit contributions
ummy+0.005
 ourselves+0.005
ach+0.005
ings+0.004
 lunch+0.004
Neg logit contributions
 forgot-0.020
'd-0.019
 remembered-0.018
 missed-0.017
 regretted-0.017
Token girl
Token ID2576
Feature activation-1.26e-01

Pos logit contributions
 forgot+0.073
'd+0.067
 sped+0.061
 managed+0.060
 struggled+0.060
Neg logit contributions
ummy-0.052
 lunch-0.052
 something-0.052
 ourselves-0.049
 outside-0.048
Token wore
Token ID12408
Feature activation+8.43e-03

Pos logit contributions
Uh+0.184
 forgot+0.175
 sped+0.165
 sizes+0.165
chie+0.163
Neg logit contributions
,-0.228
 with-0.214
?-0.205
 that-0.201
 up-0.199
Token the
Token ID262
Feature activation-2.60e-02

Pos logit contributions
ummy+0.006
ach+0.005
 ourselves+0.005
ings+0.004
 lunch+0.004
Neg logit contributions
 forgot-0.025
'd-0.023
 remembered-0.023
 missed-0.022
 cared-0.022
Token necklace
Token ID38987
Feature activation-4.40e-02

Pos logit contributions
 forgot+0.056
'd+0.051
 remembered+0.046
 regretted+0.045
 missed+0.045
Neg logit contributions
ummy-0.037
 ourselves-0.037
 lunch-0.036
 something-0.036
ings-0.035
Token,
Token ID11
Feature activation-1.52e-02

Pos logit contributions
 forgot+0.145
'd+0.135
 regretted+0.128
 remembered+0.128
 missed+0.127
Neg logit contributions
ummy-0.013
 ourselves-0.010
 something-0.010
 lunch-0.009
ings-0.009
Token she
Token ID673
Feature activation-9.73e-02

Pos logit contributions
 forgot+0.045
'd+0.042
 remembered+0.039
 regretted+0.039
 missed+0.039
Neg logit contributions
ummy-0.009
 ourselves-0.009
 something-0.009
 lunch-0.008
ings-0.008
Tokenummy
Token ID13513
Feature activation-4.37e-03

Pos logit contributions
 something+0.007
 your+0.006
 ourselves+0.006
 yourselves+0.006
?"+0.006
Neg logit contributions
 forgot-0.028
'd-0.025
 cared-0.024
ave-0.024
 regretted-0.024
Token.
Token ID13
Feature activation-1.23e-02

Pos logit contributions
 forgot+0.013
'd+0.012
 remembered+0.011
 regretted+0.011
 cared+0.011
Neg logit contributions
 ourselves-0.003
ummy-0.003
ach-0.003
 yourselves-0.002
ings-0.002
Token
Token ID198
Feature activation+2.16e-02

Pos logit contributions
 forgot+0.033
'd+0.030
 regretted+0.028
 remembered+0.028
 missed+0.027
Neg logit contributions
ummy-0.012
ings-0.011
 ourselves-0.011
 something-0.010
 up-0.010
Token
Token ID198
Feature activation-5.67e-03

Pos logit contributions
ummy+0.026
 ourselves+0.025
ach+0.024
 lunch+0.023
 yourselves+0.023
Neg logit contributions
 forgot-0.052
'd-0.048
 knew-0.045
 remembered-0.045
 missed-0.044
TokenWhenever
Token ID28877
Feature activation+6.20e-04

Pos logit contributions
 forgot+0.014
'd+0.012
 regretted+0.012
 missed+0.011
 remembered+0.011
Neg logit contributions
 ourselves-0.006
time-0.006
ummy-0.006
 something-0.006
 that-0.006
Token Lynn
Token ID24868
Feature activation-1.26e-01

Pos logit contributions
ummy+0.001
 ourselves+0.001
ach+0.001
 lunch+0.001
ings+0.001
Neg logit contributions
 forgot-0.001
'd-0.001
 remembered-0.001
 regretted-0.001
 missed-0.001
Token ate
Token ID15063
Feature activation+3.83e-02

Pos logit contributions
 forgot+0.175
'd+0.159
 sped+0.146
 arrested+0.135
 switched+0.135
Neg logit contributions
?-0.229
,-0.228
 that-0.224
 the-0.222
 with-0.222
Token her
Token ID607
Feature activation-3.52e-02

Pos logit contributions
 ourselves+0.040
ach+0.040
ummy+0.036
.'+0.034
 yourselves+0.032
Neg logit contributions
 forgot-0.091
 remembered-0.081
 knew-0.080
'd-0.079
 missed-0.079
Token printed
Token ID10398
Feature activation+4.88e-03

Pos logit contributions
 forgot+0.090
'd+0.081
 remembered+0.077
 cared+0.075
 missed+0.075
Neg logit contributions
ummy-0.038
 ourselves-0.037
ach-0.035
ings-0.034
 yourselves-0.032
Token food
Token ID2057
Feature activation-1.33e-02

Pos logit contributions
 ourselves+0.004
ummy+0.003
ach+0.003
 yourselves+0.003
ings+0.003
Neg logit contributions
 forgot-0.014
'd-0.013
 remembered-0.012
 cared-0.012
 knew-0.012
Token,
Token ID11
Feature activation-1.24e-02

Pos logit contributions
 forgot+0.039
'd+0.036
 remembered+0.034
 knew+0.033
 cared+0.033
Neg logit contributions
 ourselves-0.009
ach-0.009
ummy-0.008
 yourselves-0.007
ings-0.007
Token as
Token ID355
Feature activation+1.40e-02

Pos logit contributions
 forgot+0.011
'd+0.010
 remembered+0.009
 regretted+0.009
 missed+0.009
Neg logit contributions
ummy-0.002
 ourselves-0.002
ach-0.002
 lunch-0.002
ings-0.002
Token Hel
Token ID5053
Feature activation-1.81e-02

Pos logit contributions
ummy+0.012
 ourselves+0.012
ach+0.011
 lunch+0.011
 something+0.010
Neg logit contributions
 forgot-0.039
'd-0.035
 remembered-0.033
 regretted-0.033
 missed-0.033
Tokenio
Token ID952
Feature activation-8.99e-02

Pos logit contributions
 forgot+0.033
'd+0.029
 remembered+0.026
 regretted+0.026
 missed+0.026
Neg logit contributions
ummy-0.033
 ourselves-0.032
ach-0.031
 lunch-0.031
ings-0.031
Token.
Token ID13
Feature activation-4.31e-03

Pos logit contributions
 forgot+0.176
 missed+0.144
Uh+0.144
 regretted+0.143
'd+0.141
Neg logit contributions
.-0.143
 that-0.139
,-0.134
 ourselves-0.134
 something-0.133
Token Hel
Token ID5053
Feature activation-2.97e-02

Pos logit contributions
 forgot+0.011
'd+0.010
 remembered+0.010
 regretted+0.009
 missed+0.009
Neg logit contributions
ummy-0.005
 ourselves-0.004
ach-0.004
ings-0.004
 something-0.004
Tokenio
Token ID952
Feature activation-1.25e-01

Pos logit contributions
 forgot+0.061
'd+0.054
 remembered+0.049
 missed+0.048
 cared+0.048
Neg logit contributions
ummy-0.048
ach-0.048
 ourselves-0.046
ings-0.045
 lunch-0.043
Token explained
Token ID4893
Feature activation+1.01e-02

Pos logit contributions
 forgot+0.210
Uh+0.173
 regretted+0.165
 missed+0.165
'd+0.159
Neg logit contributions
 with-0.214
 that-0.213
,-0.212
.-0.208
 up-0.201
Token that
Token ID326
Feature activation+1.20e-04

Pos logit contributions
ummy+0.007
 ourselves+0.006
ach+0.006
 lunch+0.006
ings+0.006
Neg logit contributions
 forgot-0.029
'd-0.027
 remembered-0.026
 regretted-0.026
 knew-0.026
Token their
Token ID511
Feature activation-4.06e-02

Pos logit contributions
 something+0.000
 ourselves+0.000
 that+0.000
ummy+0.000
 lunch+0.000
Neg logit contributions
 forgot-0.000
'd-0.000
ave-0.000
 remembered-0.000
 regretted-0.000
Token oxygen
Token ID11863
Feature activation-5.23e-02

Pos logit contributions
 forgot+0.094
'd+0.080
ave+0.076
 remembered+0.075
 cared+0.075
Neg logit contributions
 ourselves-0.056
 lunch-0.055
ummy-0.055
 up-0.054
ach-0.053
Token was
Token ID373
Feature activation-1.61e-02

Pos logit contributions
 forgot+0.098
'd+0.081
 remembered+0.078
 regretted+0.077
ave+0.076
Neg logit contributions
ummy-0.087
 ourselves-0.086
 that-0.086
 up-0.086
ings-0.085
Token special
Token ID2041
Feature activation-2.44e-02

Pos logit contributions
 forgot+0.050
'd+0.046
 remembered+0.042
 regretted+0.042
 missed+0.041
Neg logit contributions
ummy-0.029
 ourselves-0.029
 lunch-0.027
ings-0.027
ach-0.027
Token thing
Token ID1517
Feature activation-2.05e-02

Pos logit contributions
 forgot+0.056
'd+0.051
 remembered+0.047
 regretted+0.047
 missed+0.046
Neg logit contributions
ummy-0.032
 ourselves-0.032
 lunch-0.031
ings-0.030
ach-0.030
Token.
Token ID13
Feature activation-7.33e-03

Pos logit contributions
 forgot+0.064
'd+0.059
 remembered+0.057
 regretted+0.056
 missed+0.056
Neg logit contributions
ummy-0.011
 ourselves-0.009
 lunch-0.008
ings-0.008
ach-0.008
Token Even
Token ID3412
Feature activation-8.56e-03

Pos logit contributions
 forgot+0.018
'd+0.017
 remembered+0.016
 regretted+0.015
 cared+0.015
Neg logit contributions
ummy-0.009
 ourselves-0.008
ings-0.008
ach-0.008
 lunch-0.008
Token though
Token ID996
Feature activation+1.56e-02

Pos logit contributions
 forgot+0.023
'd+0.022
 regretted+0.020
 remembered+0.020
 missed+0.020
Neg logit contributions
 something-0.007
ummy-0.007
 up-0.007
 ourselves-0.007
 the-0.007
Token Sam
Token ID3409
Feature activation-1.25e-01

Pos logit contributions
ummy+0.016
ach+0.015
 ourselves+0.015
ings+0.014
 lunch+0.013
Neg logit contributions
 forgot-0.041
'd-0.037
 remembered-0.035
 missed-0.034
 regretted-0.034
Token still
Token ID991
Feature activation-7.11e-02

Pos logit contributions
Uh+0.179
 arrested+0.178
 officer+0.174
 forgot+0.171
'd+0.169
Neg logit contributions
,-0.209
 with-0.205
 that-0.199
 the-0.199
.-0.198
Token didn
Token ID1422
Feature activation+1.94e-02

Pos logit contributions
 forgot+0.078
'd+0.078
Uh+0.076
 arrested+0.075
 sped+0.070
Neg logit contributions
.-0.147
,-0.145
 the-0.144
 that-0.142
 something-0.142
Token't
Token ID470
Feature activation-1.40e-02

Pos logit contributions
ach+0.032
 ourselves+0.032
ummy+0.032
 lunch+0.031
 yourselves+0.030
Neg logit contributions
 forgot-0.034
'd-0.032
 missed-0.028
 hadn-0.028
 knew-0.027
Token like
Token ID588
Feature activation+9.37e-03

Pos logit contributions
 forgot+0.031
'd+0.029
 regretted+0.028
 remembered+0.028
 hadn+0.027
Neg logit contributions
 that-0.017
 ourselves-0.017
 something-0.017
 with-0.017
.-0.016
Token dancing
Token ID15360
Feature activation-2.59e-03

Pos logit contributions
ummy+0.008
 ourselves+0.007
ach+0.007
ings+0.006
 lunch+0.006
Neg logit contributions
 forgot-0.026
'd-0.024
 remembered-0.023
 regretted-0.023
 missed-0.023
Token jungle
Token ID20712
Feature activation+2.25e-02

Pos logit contributions
 ourselves+0.024
ach+0.023
ummy+0.023
ings+0.023
 lunch+0.021
Neg logit contributions
 forgot-0.031
'd-0.027
 remembered-0.025
 regretted-0.025
 missed-0.025
Token.
Token ID13
Feature activation-1.33e-04

Pos logit contributions
ummy+0.016
 ourselves+0.015
ach+0.014
ings+0.013
 lunch+0.012
Neg logit contributions
 forgot-0.065
'd-0.060
 remembered-0.057
 cared-0.056
 knew-0.056
Token
Token ID198
Feature activation+2.56e-02

Pos logit contributions
 forgot+0.000
'd+0.000
 missed+0.000
 remembered+0.000
 regretted+0.000
Neg logit contributions
 ourselves-0.000
ach-0.000
ummy-0.000
ings-0.000
 yourselves-0.000
Token
Token ID198
Feature activation+9.14e-03

Pos logit contributions
 ourselves+0.040
ummy+0.038
ach+0.037
 yourselves+0.036
 lunch+0.035
Neg logit contributions
 forgot-0.053
'd-0.050
 knew-0.045
 missed-0.044
 remembered-0.043
TokenAs
Token ID1722
Feature activation+2.18e-02

Pos logit contributions
ummy+0.016
 ourselves+0.015
ach+0.015
ings+0.015
 lunch+0.015
Neg logit contributions
 forgot-0.018
'd-0.016
 remembered-0.014
 regretted-0.014
 missed-0.014
Token Hannah
Token ID26013
Feature activation-1.25e-01

Pos logit contributions
ummy+0.027
 ourselves+0.027
ach+0.027
ings+0.026
 lunch+0.024
Neg logit contributions
 forgot-0.053
'd-0.047
 remembered-0.044
 missed-0.043
 regretted-0.043
Token walked
Token ID6807
Feature activation+5.56e-03

Pos logit contributions
 forgot+0.192
'd+0.174
 hadn+0.153
ave+0.151
 cared+0.150
Neg logit contributions
 outside-0.225
,-0.224
 the-0.217
 with-0.215
 lunch-0.213
Token further
Token ID2252
Feature activation+1.33e-02

Pos logit contributions
ummy+0.006
 ourselves+0.005
ach+0.005
ings+0.005
 yourselves+0.005
Neg logit contributions
 forgot-0.015
'd-0.013
 remembered-0.013
 knew-0.012
 sped-0.012
Token and
Token ID290
Feature activation-3.56e-02

Pos logit contributions
ummy+0.014
 ourselves+0.012
ach+0.012
 lunch+0.012
time+0.012
Neg logit contributions
 forgot-0.035
'd-0.031
 remembered-0.031
 knew-0.029
 regretted-0.029
Token further
Token ID2252
Feature activation+4.31e-03

Pos logit contributions
 forgot+0.090
'd+0.085
 cared+0.078
 hadn+0.078
 regretted+0.078
Neg logit contributions
 something-0.035
 up-0.034
 outside-0.033
 ourselves-0.033
 lunch-0.032
Token away
Token ID1497
Feature activation+1.52e-02

Pos logit contributions
ummy+0.004
 ourselves+0.003
ach+0.003
 lunch+0.003
ings+0.003
Neg logit contributions
 forgot-0.012
'd-0.011
 remembered-0.011
 regretted-0.010
 missed-0.010
Token.
Token ID13
Feature activation+1.01e-03

Pos logit contributions
 forgot+0.034
'd+0.031
 remembered+0.029
 regretted+0.029
 missed+0.029
Neg logit contributions
ummy-0.008
 ourselves-0.007
ach-0.007
 lunch-0.006
ings-0.006
Token
Token ID220
Feature activation+8.26e-03

Pos logit contributions
 ourselves+0.001
ach+0.001
ummy+0.001
ings+0.001
 lunch+0.001
Neg logit contributions
 forgot-0.002
'd-0.002
 remembered-0.002
 missed-0.002
 regretted-0.002
Token
Token ID198
Feature activation+3.40e-02

Pos logit contributions
ach+0.011
ummy+0.011
 lunch+0.011
 ourselves+0.011
 dinner+0.010
Neg logit contributions
 forgot-0.018
'd-0.016
 missed-0.015
 knew-0.015
 hadn-0.014
Token
Token ID198
Feature activation+1.04e-02

Pos logit contributions
 ourselves+0.050
ummy+0.049
ach+0.049
 yourselves+0.048
ings+0.046
Neg logit contributions
 forgot-0.071
'd-0.064
 knew-0.061
 missed-0.059
 remembered-0.057
TokenAs
Token ID1722
Feature activation+2.58e-02

Pos logit contributions
ummy+0.017
 ourselves+0.016
ach+0.016
ings+0.015
 lunch+0.015
Neg logit contributions
 forgot-0.022
'd-0.019
 remembered-0.018
 regretted-0.017
 missed-0.017
Token Holly
Token ID7764
Feature activation-1.25e-01

Pos logit contributions
ummy+0.033
 ourselves+0.033
ach+0.033
ings+0.031
 yourselves+0.029
Neg logit contributions
 forgot-0.062
'd-0.054
 remembered-0.052
 missed-0.051
 regretted-0.050
Token's
Token ID338
Feature activation-2.73e-02

Pos logit contributions
 forgot+0.183
'd+0.167
mo+0.155
 coward+0.150
 regretted+0.148
Neg logit contributions
,-0.234
 outside-0.222
 the-0.221
 with-0.220
 that-0.219
Token mother
Token ID2802
Feature activation-1.12e-01

Pos logit contributions
 forgot+0.062
'd+0.055
 regretted+0.050
 cared+0.049
 remembered+0.049
Neg logit contributions
ummy-0.043
 lunch-0.039
 outside-0.036
time-0.036
 dinner-0.036
Token walked
Token ID6807
Feature activation+8.52e-03

Pos logit contributions
 forgot+0.183
'd+0.160
 sizes+0.147
 regretted+0.143
 coward+0.140
Neg logit contributions
,-0.202
 outside-0.199
 the-0.194
 with-0.193
 that-0.192
Token away
Token ID1497
Feature activation+2.68e-03

Pos logit contributions
ummy+0.009
 ourselves+0.008
ach+0.008
ings+0.008
 lunch+0.008
Neg logit contributions
 forgot-0.022
'd-0.020
 remembered-0.019
 regretted-0.019
 missed-0.019
Token,
Token ID11
Feature activation+4.83e-03

Pos logit contributions
ummy+0.001
 ourselves+0.001
ach+0.001
 lunch+0.001
ings+0.001
Neg logit contributions
 forgot-0.008
'd-0.008
 remembered-0.007
 regretted-0.007
 missed-0.007
Token noticed
Token ID6810
Feature activation+2.67e-02

Pos logit contributions
 forgot+0.160
'd+0.146
Uh+0.146
ave+0.142
 ounces+0.134
Neg logit contributions
,-0.188
 the-0.180
 that-0.180
 with-0.177
.-0.174
Token something
Token ID1223
Feature activation-2.46e-02

Pos logit contributions
ach+0.024
ummy+0.024
ings+0.022
 ourselves+0.021
uggle+0.020
Neg logit contributions
 forgot-0.071
 knew-0.066
 remembered-0.065
'd-0.065
 wished-0.065
Token strange
Token ID6283
Feature activation+6.19e-03

Pos logit contributions
 forgot+0.054
'd+0.049
 remembered+0.045
 regretted+0.044
 missed+0.044
Neg logit contributions
ummy-0.035
 ourselves-0.035
ach-0.033
ings-0.032
 lunch-0.032
Token.
Token ID13
Feature activation+3.30e-03

Pos logit contributions
ummy+0.006
 ourselves+0.006
 yourselves+0.005
ach+0.005
uggle+0.005
Neg logit contributions
 forgot-0.017
'd-0.016
 remembered-0.015
 cared-0.015
 knew-0.015
Token Whenever
Token ID21326
Feature activation+1.37e-02

Pos logit contributions
 ourselves+0.004
ach+0.004
ummy+0.004
ings+0.004
 lunch+0.004
Neg logit contributions
 forgot-0.007
'd-0.007
 remembered-0.006
 missed-0.006
 regretted-0.006
Token Molly
Token ID30236
Feature activation-1.25e-01

Pos logit contributions
ach+0.016
ummy+0.016
ings+0.015
 ourselves+0.015
ags+0.014
Neg logit contributions
 forgot-0.035
'd-0.031
 remembered-0.031
 missed-0.029
 knew-0.029
Token said
Token ID531
Feature activation-1.57e-02

Pos logit contributions
 forgot+0.168
'd+0.158
ave+0.157
 sizes+0.150
via+0.150
Neg logit contributions
,-0.223
 that-0.213
 outside-0.212
 the-0.211
 with-0.209
Token the
Token ID262
Feature activation-3.63e-02

Pos logit contributions
 forgot+0.044
'd+0.041
 remembered+0.039
 knew+0.038
 regretted+0.038
Neg logit contributions
ummy-0.013
ach-0.012
 ourselves-0.012
ings-0.011
 lunch-0.011
Token word
Token ID1573
Feature activation-3.92e-02

Pos logit contributions
 forgot+0.084
'd+0.075
 remembered+0.069
 managed+0.067
 cared+0.067
Neg logit contributions
 ourselves-0.048
 something-0.048
ummy-0.047
 outside-0.046
 up-0.046
Token "
Token ID366
Feature activation-3.24e-02

Pos logit contributions
 forgot+0.104
'd+0.091
 regretted+0.086
 remembered+0.086
 struggled+0.085
Neg logit contributions
 something-0.039
 that-0.038
ummy-0.038
ings-0.036
time-0.036
Tokeny
Token ID88
Feature activation-5.21e-02

Pos logit contributions
 forgot+0.071
'd+0.059
 regretted+0.058
 remembered+0.057
 missed+0.056
Neg logit contributions
 ourselves-0.047
ach-0.046
ings-0.046
time-0.046
 something-0.045
Token shore
Token ID15191
Feature activation-1.21e-02

Pos logit contributions
 forgot+0.049
'd+0.044
 remembered+0.041
 cared+0.041
 missed+0.041
Neg logit contributions
 ourselves-0.021
ummy-0.021
 lunch-0.020
ach-0.019
 something-0.019
Token.
Token ID13
Feature activation-7.37e-03

Pos logit contributions
 forgot+0.037
'd+0.034
 remembered+0.032
 missed+0.032
 regretted+0.032
Neg logit contributions
ummy-0.007
 ourselves-0.006
ach-0.006
ings-0.006
 lunch-0.005
Token
Token ID198
Feature activation+1.98e-02

Pos logit contributions
 forgot+0.019
'd+0.017
 remembered+0.016
 regretted+0.016
 struggled+0.016
Neg logit contributions
ummy-0.008
 ourselves-0.008
ings-0.008
 something-0.007
ach-0.007
Token
Token ID198
Feature activation+9.38e-04

Pos logit contributions
ummy+0.017
 ourselves+0.016
ach+0.015
 lunch+0.015
ings+0.015
Neg logit contributions
 forgot-0.055
'd-0.050
 remembered-0.047
 missed-0.047
 regretted-0.046
TokenTr
Token ID2898
Feature activation-8.03e-02

Pos logit contributions
 ourselves+0.001
ach+0.001
time+0.001
 something+0.001
ings+0.001
Neg logit contributions
 forgot-0.002
'd-0.002
 remembered-0.002
 regretted-0.002
 missed-0.002
Tokenacey
Token ID25415
Feature activation-1.25e-01

Pos logit contributions
 forgot+0.143
'd+0.126
 roared+0.119
 sped+0.116
 regretted+0.112
Neg logit contributions
ummy-0.152
ings-0.147
ach-0.141
ate-0.140
 ourselves-0.139
Token was
Token ID373
Feature activation-1.90e-02

Pos logit contributions
 forgot+0.213
Uh+0.189
'd+0.185
Um+0.176
 cared+0.174
Neg logit contributions
,-0.198
 that-0.198
 with-0.191
 the-0.187
.-0.181
Token relieved
Token ID22598
Feature activation+2.22e-02

Pos logit contributions
 forgot+0.045
'd+0.041
 regretted+0.038
 cared+0.038
ave+0.037
Neg logit contributions
 something-0.022
 up-0.022
 outside-0.022
 ourselves-0.021
 the-0.021
Token.
Token ID13
Feature activation-1.24e-02

Pos logit contributions
 ourselves+0.020
ummy+0.020
ach+0.019
ings+0.018
 lunch+0.017
Neg logit contributions
 forgot-0.060
'd-0.055
 remembered-0.053
 knew-0.052
 regretted-0.051
Token Next
Token ID7406
Feature activation-9.86e-03

Pos logit contributions
 forgot+0.033
'd+0.030
 remembered+0.028
 regretted+0.027
 missed+0.027
Neg logit contributions
ummy-0.013
 ourselves-0.012
ings-0.012
ach-0.011
time-0.011
Token time
Token ID640
Feature activation+9.38e-04

Pos logit contributions
 forgot+0.028
'd+0.026
ave+0.025
 remembered+0.025
 regretted+0.024
Neg logit contributions
 something-0.007
ummy-0.007
 that-0.006
 lunch-0.006
time-0.006
Token recording
Token ID8296
Feature activation-8.87e-03

Pos logit contributions
 forgot+0.073
'd+0.065
 regretted+0.062
 remembered+0.061
 missed+0.061
Neg logit contributions
ummy-0.041
 ourselves-0.040
 lunch-0.040
 outside-0.039
time-0.039
Token very
Token ID845
Feature activation-1.14e-02

Pos logit contributions
 forgot+0.025
'd+0.023
 remembered+0.022
 regretted+0.022
 missed+0.022
Neg logit contributions
ummy-0.007
 ourselves-0.007
ings-0.007
 something-0.007
 lunch-0.006
Token tidy
Token ID43044
Feature activation-2.47e-03

Pos logit contributions
 forgot+0.027
'd+0.025
 regretted+0.023
 remembered+0.023
 cared+0.023
Neg logit contributions
ummy-0.015
 ourselves-0.014
ach-0.014
time-0.014
 up-0.013
Token.
Token ID13
Feature activation-1.94e-03

Pos logit contributions
 forgot+0.007
'd+0.006
 remembered+0.006
 regretted+0.006
 missed+0.006
Neg logit contributions
ummy-0.002
 ourselves-0.002
ach-0.002
ings-0.002
 lunch-0.002
Token Whenever
Token ID21326
Feature activation+1.24e-02

Pos logit contributions
 forgot+0.005
'd+0.004
 remembered+0.004
 regretted+0.004
 missed+0.004
Neg logit contributions
ummy-0.003
 ourselves-0.002
ach-0.002
 lunch-0.002
ings-0.002
Token Joe
Token ID5689
Feature activation-1.24e-01

Pos logit contributions
ummy+0.017
 ourselves+0.016
ach+0.016
ings+0.016
 lunch+0.015
Neg logit contributions
 forgot-0.029
'd-0.026
 remembered-0.024
 knew-0.024
 regretted-0.023
Token felt
Token ID2936
Feature activation-3.30e-03

Pos logit contributions
 forgot+0.181
Uh+0.170
Aw+0.163
'd+0.159
Looks+0.158
Neg logit contributions
?-0.213
,-0.212
 outside-0.208
 the-0.205
 with-0.203
Token sad
Token ID6507
Feature activation+1.11e-02

Pos logit contributions
 forgot+0.007
'd+0.007
 remembered+0.006
 hadn+0.006
 regretted+0.006
Neg logit contributions
 something-0.005
 up-0.005
 outside-0.005
ummy-0.005
 that-0.005
Token he
Token ID339
Feature activation-7.77e-02

Pos logit contributions
ummy+0.004
 ourselves+0.003
ach+0.003
 lunch+0.002
ings+0.002
Neg logit contributions
 forgot-0.037
'd-0.035
 remembered-0.033
 regretted-0.032
 missed-0.032
Token would
Token ID561
Feature activation-2.22e-02

Pos logit contributions
 forgot+0.096
Uh+0.092
 sped+0.082
 arrested+0.080
mo+0.076
Neg logit contributions
?-0.161
.-0.157
 outside-0.156
 up-0.153
,-0.152
Token watch
Token ID2342
Feature activation+2.65e-02

Pos logit contributions
 forgot+0.048
'd+0.042
 regretted+0.042
ave+0.041
 cared+0.040
Neg logit contributions
 that-0.029
 ourselves-0.029
 something-0.028
?-0.027
 with-0.027
TokenAfter
Token ID3260
Feature activation+1.45e-02

Pos logit contributions
ummy+0.005
 ourselves+0.005
ach+0.005
time+0.005
 something+0.004
Neg logit contributions
 forgot-0.009
'd-0.008
 remembered-0.008
 regretted-0.008
 missed-0.008
Token that
Token ID326
Feature activation-1.94e-02

Pos logit contributions
ummy+0.010
 ourselves+0.010
ach+0.009
ings+0.009
 lunch+0.008
Neg logit contributions
 forgot-0.042
'd-0.039
 remembered-0.037
 missed-0.036
 regretted-0.036
Token day
Token ID1110
Feature activation+8.37e-03

Pos logit contributions
 forgot+0.054
'd+0.049
 remembered+0.046
 missed+0.045
 regretted+0.045
Neg logit contributions
 ourselves-0.017
ummy-0.017
ach-0.016
ings-0.015
 lunch-0.015
Token,
Token ID11
Feature activation+1.88e-02

Pos logit contributions
ummy+0.006
 ourselves+0.005
ach+0.005
 lunch+0.005
ings+0.005
Neg logit contributions
 forgot-0.025
'd-0.023
 remembered-0.022
 regretted-0.021
 cared-0.021
Token whenever
Token ID8797
Feature activation+2.01e-02

Pos logit contributions
ummy+0.022
 ourselves+0.022
ach+0.021
ings+0.020
 lunch+0.020
Neg logit contributions
 forgot-0.047
'd-0.042
 remembered-0.039
 missed-0.038
 cared-0.038
Token Jenny
Token ID28437
Feature activation-1.24e-01

Pos logit contributions
ummy+0.026
ach+0.025
ings+0.025
 ourselves+0.025
 yourselves+0.022
Neg logit contributions
 forgot-0.048
'd-0.042
 remembered-0.040
 knew-0.039
 cared-0.038
Token went
Token ID1816
Feature activation+1.22e-02

Pos logit contributions
 forgot+0.158
'd+0.151
 arrested+0.134
 switched+0.133
 sped+0.132
Neg logit contributions
?-0.246
?"-0.234
,-0.233
.-0.225
 the-0.224
Token to
Token ID284
Feature activation+1.67e-02

Pos logit contributions
 ourselves+0.012
ummy+0.012
ach+0.011
ings+0.011
 itself+0.010
Neg logit contributions
 forgot-0.031
'd-0.028
 remembered-0.028
 regretted-0.027
 missed-0.027
Token the
Token ID262
Feature activation-1.15e-02

Pos logit contributions
ummy+0.023
ach+0.022
ires+0.021
ings+0.020
iring+0.019
Neg logit contributions
 forgot-0.038
 remembered-0.032
 missed-0.032
 knew-0.032
'd-0.031
Token park
Token ID3952
Feature activation-7.68e-03

Pos logit contributions
 forgot+0.028
'd+0.025
 remembered+0.024
 regretted+0.024
 missed+0.023
Neg logit contributions
ummy-0.014
 ourselves-0.014
ach-0.013
ings-0.013
 lunch-0.012
Token,
Token ID11
Feature activation-3.16e-03

Pos logit contributions
 forgot+0.023
'd+0.022
 remembered+0.020
 regretted+0.020
 missed+0.020
Neg logit contributions
ummy-0.005
 ourselves-0.004
ach-0.004
ings-0.004
 lunch-0.004
Tokenble
Token ID903
Feature activation+6.41e-03

Pos logit contributions
 lunch+0.027
ummy+0.025
uggle+0.024
 dinner+0.022
 ourselves+0.022
Neg logit contributions
 forgot-0.063
'd-0.058
 remembered-0.055
 sped-0.053
 switched-0.052
Token and
Token ID290
Feature activation-2.88e-02

Pos logit contributions
ummy+0.005
 ourselves+0.005
ach+0.004
ings+0.004
 lunch+0.004
Neg logit contributions
 forgot-0.018
'd-0.017
 remembered-0.016
 regretted-0.016
 missed-0.016
Token shake
Token ID13279
Feature activation+3.18e-02

Pos logit contributions
 forgot+0.070
'd+0.066
ave+0.064
 regretted+0.062
 cared+0.060
Neg logit contributions
 the-0.035
 something-0.033
 that-0.032
 up-0.032
 it-0.031
Token.
Token ID13
Feature activation+3.80e-03

Pos logit contributions
ummy+0.021
 ourselves+0.019
ach+0.018
 lunch+0.018
ings+0.016
Neg logit contributions
 forgot-0.094
'd-0.087
 remembered-0.083
 regretted-0.081
 missed-0.081
Token Tr
Token ID833
Feature activation-7.87e-02

Pos logit contributions
ummy+0.004
ings+0.004
 ourselves+0.004
 something+0.004
 itself+0.004
Neg logit contributions
 forgot-0.010
'd-0.009
 remembered-0.008
 regretted-0.008
 cared-0.008
Tokenacey
Token ID25415
Feature activation-1.24e-01

Pos logit contributions
 forgot+0.139
'd+0.124
 sped+0.118
 roared+0.117
 missed+0.111
Neg logit contributions
ummy-0.147
ings-0.141
ach-0.137
ate-0.133
.'-0.133
Token was
Token ID373
Feature activation-2.06e-02

Pos logit contributions
 forgot+0.176
Uh+0.162
Um+0.154
'd+0.153
 cared+0.150
Neg logit contributions
,-0.220
 that-0.216
 with-0.213
.-0.208
 the-0.203
Token scared
Token ID12008
Feature activation+3.19e-02

Pos logit contributions
 forgot+0.047
'd+0.044
 cared+0.041
ave+0.040
 regretted+0.040
Neg logit contributions
 something-0.026
 up-0.026
 outside-0.025
 the-0.025
 that-0.024
Token and
Token ID290
Feature activation-5.94e-02

Pos logit contributions
 ourselves+0.030
ach+0.028
ummy+0.028
ings+0.027
 lunch+0.027
Neg logit contributions
 forgot-0.086
'd-0.077
 remembered-0.074
 regretted-0.073
 knew-0.073
Token started
Token ID2067
Feature activation+3.77e-03

Pos logit contributions
 forgot+0.083
'd+0.081
ave+0.071
 cared+0.067
 regretted+0.063
Neg logit contributions
 the-0.120
 that-0.118
 up-0.118
 something-0.117
 her-0.116
Token to
Token ID284
Feature activation-9.33e-03

Pos logit contributions
 something+0.002
ings+0.002
 ourselves+0.002
ummy+0.002
 up+0.002
Neg logit contributions
 forgot-0.012
'd-0.011
 remembered-0.010
 cared-0.010
 regretted-0.010
Token From
Token ID3574
Feature activation+2.97e-03

Pos logit contributions
 forgot+0.032
'd+0.030
 remembered+0.027
 cared+0.026
 regretted+0.026
Neg logit contributions
ummy-0.021
 ourselves-0.019
ings-0.018
ach-0.018
 lunch-0.018
Token then
Token ID788
Feature activation-2.30e-02

Pos logit contributions
ings+0.004
ummy+0.004
ach+0.004
.'+0.004
 ourselves+0.004
Neg logit contributions
 forgot-0.007
 remembered-0.006
'd-0.006
 knew-0.006
 wished-0.006
Token on
Token ID319
Feature activation+6.68e-03

Pos logit contributions
 forgot+0.070
'd+0.064
 cared+0.060
 regretted+0.060
 remembered+0.060
Neg logit contributions
ummy-0.014
 ourselves-0.013
 lunch-0.012
ings-0.012
time-0.012
Token,
Token ID11
Feature activation+6.83e-03

Pos logit contributions
ummy+0.004
 ourselves+0.003
ach+0.003
ings+0.003
 lunch+0.003
Neg logit contributions
 forgot-0.021
'd-0.019
 remembered-0.018
 regretted-0.018
 missed-0.018
Token when
Token ID618
Feature activation-2.44e-03

Pos logit contributions
ummy+0.008
 ourselves+0.007
ach+0.007
 lunch+0.007
ings+0.007
Neg logit contributions
 forgot-0.017
'd-0.016
 remembered-0.015
 regretted-0.014
 missed-0.014
Token Gus
Token ID37307
Feature activation-1.24e-01

Pos logit contributions
 forgot+0.006
'd+0.006
 remembered+0.005
 regretted+0.005
 missed+0.005
Neg logit contributions
ummy-0.003
 ourselves-0.003
ach-0.003
ings-0.002
 lunch-0.002
Token was
Token ID373
Feature activation-4.45e-03

Pos logit contributions
 forgot+0.190
'd+0.178
 cared+0.165
Uh+0.165
ave+0.163
Neg logit contributions
,-0.211
 the-0.206
 up-0.203
?-0.196
 outside-0.195
Token in
Token ID287
Feature activation+7.29e-03

Pos logit contributions
 forgot+0.010
'd+0.009
 cared+0.008
 regretted+0.008
 hadn+0.008
Neg logit contributions
 something-0.006
 lunch-0.006
 outside-0.006
ummy-0.006
 up-0.006
Token the
Token ID262
Feature activation-2.61e-02

Pos logit contributions
ummy+0.006
 ourselves+0.006
ach+0.005
ings+0.005
 lunch+0.005
Neg logit contributions
 forgot-0.021
'd-0.019
 remembered-0.018
 regretted-0.018
 missed-0.018
Token forest
Token ID8222
Feature activation-2.88e-02

Pos logit contributions
 forgot+0.065
'd+0.060
 hadn+0.055
ave+0.054
 sped+0.053
Neg logit contributions
 something-0.032
 lunch-0.030
 outside-0.029
 up-0.028
ummy-0.028
Token,
Token ID11
Feature activation-1.25e-02

Pos logit contributions
 forgot+0.084
'd+0.079
 remembered+0.074
 regretted+0.073
 switched+0.073
Neg logit contributions
 something-0.016
time-0.015
 that-0.015
ummy-0.014
ings-0.014
Token fell
Token ID3214
Feature activation+7.92e-02

Pos logit contributions
 ounces+0.147
 forgot+0.144
'd+0.144
mo+0.140
 struggled+0.138
Neg logit contributions
 that-0.139
 with-0.120
 the-0.120
 a-0.119
?"-0.118
Token off
Token ID572
Feature activation+8.87e-02

Pos logit contributions
ummy+0.090
ach+0.090
 ourselves+0.083
ings+0.077
 lunch+0.076
Neg logit contributions
 forgot-0.198
'd-0.179
 remembered-0.170
 missed-0.168
 regretted-0.168
Token the
Token ID262
Feature activation+6.91e-02

Pos logit contributions
ummy+0.097
ach+0.097
 ourselves+0.091
ires+0.084
ings+0.082
Neg logit contributions
 forgot-0.225
'd-0.196
 missed-0.195
 remembered-0.192
 regretted-0.190
Token stove
Token ID27635
Feature activation+7.19e-02

Pos logit contributions
ummy+0.102
 ourselves+0.101
ach+0.100
ings+0.096
 lunch+0.091
Neg logit contributions
 forgot-0.147
'd-0.131
 regretted-0.120
 remembered-0.120
 missed-0.118
Token and
Token ID290
Feature activation-2.10e-03

Pos logit contributions
ummy+0.059
 ourselves+0.058
ach+0.058
ings+0.049
 lunch+0.049
Neg logit contributions
 forgot-0.198
'd-0.182
 remembered-0.170
 regretted-0.169
 missed-0.168
Token spilled
Token ID31896
Feature activation+8.87e-02

Pos logit contributions
 forgot+0.004
'd+0.004
 cared+0.004
 remembered+0.004
ave+0.004
Neg logit contributions
 up-0.003
 something-0.003
 ourselves-0.003
 the-0.003
 that-0.003
Token soup
Token ID17141
Feature activation+3.25e-02

Pos logit contributions
ach+0.098
ummy+0.097
 ourselves+0.090
ires+0.082
ings+0.081
Neg logit contributions
 forgot-0.218
'd-0.193
 missed-0.189
 regretted-0.186
 remembered-0.186
Token on
Token ID319
Feature activation+9.15e-02

Pos logit contributions
ummy+0.032
 ourselves+0.030
ach+0.030
ings+0.026
 yourselves+0.025
Neg logit contributions
 forgot-0.088
'd-0.078
 remembered-0.074
 missed-0.073
 regretted-0.073
Token the
Token ID262
Feature activation+5.85e-02

Pos logit contributions
uttering+0.100
ach+0.097
ummy+0.088
uggle+0.087
ires+0.079
Neg logit contributions
 forgot-0.217
 knew-0.194
'd-0.188
 remembered-0.188
 had-0.187
Token floor
Token ID4314
Feature activation+6.05e-02

Pos logit contributions
ummy+0.095
ach+0.094
 ourselves+0.094
ings+0.091
time+0.085
Neg logit contributions
 forgot-0.114
'd-0.101
 regretted-0.091
 remembered-0.091
 knew-0.090
Token.
Token ID13
Feature activation+2.98e-02

Pos logit contributions
ummy+0.036
 ourselves+0.034
ach+0.033
 lunch+0.027
ings+0.025
Neg logit contributions
 forgot-0.183
'd-0.168
 remembered-0.159
 missed-0.156
 regretted-0.156
Token Do
Token ID2141
Feature activation+7.69e-02

Pos logit contributions
 ourselves+0.028
ummy+0.028
ach+0.028
 lunch+0.026
ings+0.025
Neg logit contributions
 forgot-0.048
'd-0.042
 knew-0.041
 missed-0.041
 remembered-0.040
Token you
Token ID345
Feature activation+2.57e-02

Pos logit contributions
ach+0.068
ummy+0.068
ings+0.066
 outside+0.060
 ourselves+0.060
Neg logit contributions
 forgot-0.211
'd-0.190
 remembered-0.182
 knew-0.178
 cared-0.177
Token want
Token ID765
Feature activation+8.16e-02

Pos logit contributions
 ourselves+0.036
ummy+0.036
 something+0.035
 lunch+0.035
ach+0.034
Neg logit contributions
 forgot-0.054
'd-0.048
 regretted-0.045
 remembered-0.044
 sped-0.044
Token to
Token ID284
Feature activation+5.01e-02

Pos logit contributions
ummy+0.075
ach+0.071
 ourselves+0.067
ings+0.067
time+0.060
Neg logit contributions
 forgot-0.219
'd-0.203
 remembered-0.193
 cared-0.188
 missed-0.188
Token jump
Token ID4391
Feature activation+8.84e-02

Pos logit contributions
ummy+0.084
ings+0.079
ach+0.077
 ourselves+0.076
ires+0.074
Neg logit contributions
 forgot-0.101
'd-0.087
 remembered-0.081
 missed-0.079
 cared-0.077
Token in
Token ID287
Feature activation+9.20e-02

Pos logit contributions
ummy+0.119
ment+0.108
ach+0.104
ings+0.104
ires+0.103
Neg logit contributions
 forgot-0.173
'd-0.167
 missed-0.155
 had-0.155
 remembered-0.154
Token it
Token ID340
Feature activation+5.91e-02

Pos logit contributions
uttering+0.112
ummy+0.107
ach+0.105
uggle+0.103
ires+0.103
Neg logit contributions
 forgot-0.185
 remembered-0.168
 had-0.165
 wished-0.163
 pretended-0.162
Token with
Token ID351
Feature activation+6.88e-02

Pos logit contributions
ach+0.050
 ourselves+0.044
ummy+0.044
ings+0.041
 yourselves+0.038
Neg logit contributions
 forgot-0.155
'd-0.145
 remembered-0.136
 knew-0.133
 needed-0.132
Token me
Token ID502
Feature activation+2.32e-02

Pos logit contributions
ummy+0.107
ach+0.102
ings+0.097
uttering+0.094
uggle+0.094
Neg logit contributions
 forgot-0.135
'd-0.118
 remembered-0.117
 knew-0.113
 missed-0.110
Token?"
Token ID1701
Feature activation+4.22e-02

Pos logit contributions
ummy+0.012
ach+0.011
 ourselves+0.010
ings+0.009
 lunch+0.008
Neg logit contributions
 forgot-0.072
'd-0.067
 remembered-0.064
 missed-0.062
 knew-0.062
Token Anna
Token ID11735
Feature activation-3.78e-02

Pos logit contributions
ummy+0.046
 ourselves+0.045
ach+0.043
ings+0.043
 lunch+0.041
Neg logit contributions
 forgot-0.106
'd-0.095
 remembered-0.091
 knew-0.089
 cared-0.089
Token time
Token ID640
Feature activation+9.25e-02

Pos logit contributions
ummy+0.010
ach+0.008
ings+0.008
 lunch+0.007
 ourselves+0.007
Neg logit contributions
 forgot-0.033
'd-0.031
 remembered-0.030
 knew-0.029
 missed-0.028
Token at
Token ID379
Feature activation+8.40e-02

Pos logit contributions
ummy+0.069
 ourselves+0.065
ach+0.060
ings+0.057
 lunch+0.052
Neg logit contributions
 forgot-0.268
'd-0.246
 remembered-0.233
 missed-0.228
 regretted-0.226
Token the
Token ID262
Feature activation+6.15e-02

Pos logit contributions
ummy+0.067
ach+0.063
ings+0.057
 ourselves+0.051
time+0.050
Neg logit contributions
 forgot-0.236
'd-0.210
 remembered-0.208
 missed-0.206
 knew-0.201
Token golden
Token ID10861
Feature activation+5.00e-02

Pos logit contributions
ummy+0.076
 ourselves+0.073
ach+0.070
ings+0.069
 lunch+0.069
Neg logit contributions
 forgot-0.147
'd-0.134
 remembered-0.124
 regretted-0.122
 missed-0.121
Token horn
Token ID12718
Feature activation+3.02e-02

Pos logit contributions
 ourselves+0.071
ummy+0.071
ach+0.067
ings+0.066
 lunch+0.064
Neg logit contributions
 forgot-0.107
'd-0.098
 remembered-0.090
 regretted-0.088
 cared-0.087
Token on
Token ID319
Feature activation+8.70e-02

Pos logit contributions
ummy+0.020
 ourselves+0.018
ings+0.017
ach+0.017
 lunch+0.017
Neg logit contributions
 forgot-0.090
'd-0.082
 remembered-0.078
 regretted-0.077
 missed-0.077
Token the
Token ID262
Feature activation+4.94e-02

Pos logit contributions
ummy+0.065
 ourselves+0.061
ach+0.057
uggle+0.055
iggle+0.049
Neg logit contributions
 forgot-0.234
'd-0.219
 remembered-0.209
 missed-0.203
 knew-0.203
Token top
Token ID1353
Feature activation+3.44e-02

Pos logit contributions
ummy+0.071
 ourselves+0.070
ach+0.067
 lunch+0.067
ings+0.064
Neg logit contributions
 forgot-0.109
'd-0.097
 remembered-0.090
 missed-0.088
 regretted-0.088
Token of
Token ID286
Feature activation+6.82e-02

Pos logit contributions
ummy+0.026
 ourselves+0.023
ach+0.022
ings+0.022
 lunch+0.021
Neg logit contributions
 forgot-0.100
'd-0.092
 remembered-0.088
 missed-0.086
 regretted-0.086
Token the
Token ID262
Feature activation+3.60e-02

Pos logit contributions
ummy+0.045
 ourselves+0.039
ach+0.039
ings+0.037
time+0.032
Neg logit contributions
 forgot-0.198
'd-0.185
 remembered-0.177
 missed-0.173
 cared-0.173
Token carriage
Token ID25739
Feature activation+1.65e-02

Pos logit contributions
ummy+0.046
 lunch+0.045
 ourselves+0.045
ach+0.044
 outside+0.043
Neg logit contributions
 forgot-0.086
'd-0.077
 remembered-0.072
 missed-0.071
 struggled-0.070
Token.
Token ID13
Feature activation+4.95e-02

Pos logit contributions
ummy+0.066
 ourselves+0.065
 lunch+0.061
ach+0.061
ings+0.061
Neg logit contributions
 forgot-0.113
'd-0.102
 remembered-0.095
 regretted-0.093
 missed-0.092
Token When
Token ID1649
Feature activation+5.70e-02

Pos logit contributions
ummy+0.060
 ourselves+0.058
ach+0.055
 lunch+0.054
ings+0.054
Neg logit contributions
 forgot-0.120
'd-0.109
 remembered-0.102
 regretted-0.100
 missed-0.099
Token he
Token ID339
Feature activation-4.05e-02

Pos logit contributions
ummy+0.051
 ourselves+0.049
 lunch+0.046
ach+0.046
ings+0.044
Neg logit contributions
 forgot-0.155
'd-0.143
 remembered-0.134
 regretted-0.132
 missed-0.131
Token went
Token ID1816
Feature activation+5.04e-02

Pos logit contributions
 forgot+0.057
'd+0.050
ave+0.047
 regretted+0.047
 cared+0.046
Neg logit contributions
?-0.079
 the-0.078
?"-0.077
,-0.077
 something-0.075
Token to
Token ID284
Feature activation+5.23e-02

Pos logit contributions
 ourselves+0.058
ummy+0.055
ach+0.054
ings+0.051
 yourselves+0.049
Neg logit contributions
 forgot-0.120
'd-0.112
 remembered-0.106
 knew-0.102
 missed-0.101
Token eat
Token ID4483
Feature activation+8.69e-02

Pos logit contributions
ummy+0.072
ach+0.066
 ourselves+0.063
ings+0.063
time+0.061
Neg logit contributions
 forgot-0.122
'd-0.107
 remembered-0.103
 knew-0.102
 missed-0.101
Token the
Token ID262
Feature activation+3.53e-02

Pos logit contributions
uttering+0.116
ach+0.108
 ourselves+0.106
ummy+0.102
ires+0.101
Neg logit contributions
 forgot-0.166
 knew-0.156
 had-0.153
'd-0.153
 remembered-0.150
Token food
Token ID2057
Feature activation+2.07e-02

Pos logit contributions
ummy+0.044
 ourselves+0.043
ach+0.041
ings+0.041
 lunch+0.039
Neg logit contributions
 forgot-0.083
'd-0.075
 remembered-0.069
 regretted-0.069
 cared-0.068
Token in
Token ID287
Feature activation+6.46e-02

Pos logit contributions
 ourselves+0.013
ummy+0.013
ach+0.013
ings+0.011
 yourselves+0.011
Neg logit contributions
 forgot-0.061
'd-0.056
 remembered-0.053
 knew-0.052
 cared-0.051
Token the
Token ID262
Feature activation+3.59e-02

Pos logit contributions
ummy+0.058
ach+0.056
uggle+0.053
 ourselves+0.053
uttering+0.050
Neg logit contributions
 forgot-0.168
'd-0.153
 remembered-0.152
 knew-0.146
 missed-0.145
Token pan
Token ID3425
Feature activation+2.27e-02

Pos logit contributions
ummy+0.044
 lunch+0.041
 ourselves+0.041
ach+0.039
 something+0.039
Neg logit contributions
 forgot-0.089
'd-0.080
 remembered-0.075
 missed-0.073
 regretted-0.072
Token picture
Token ID4286
Feature activation+7.94e-02

Pos logit contributions
 forgot+0.028
'd+0.025
 remembered+0.023
 regretted+0.023
 cared+0.022
Neg logit contributions
ummy-0.024
 ourselves-0.024
ach-0.023
 lunch-0.023
ings-0.022
Token and
Token ID290
Feature activation+2.07e-02

Pos logit contributions
ummy+0.066
 ourselves+0.063
ach+0.058
 lunch+0.058
ings+0.058
Neg logit contributions
 forgot-0.221
'd-0.204
 remembered-0.192
 regretted-0.189
 missed-0.188
Token they
Token ID484
Feature activation-1.89e-02

Pos logit contributions
 something+0.034
 lunch+0.033
 ourselves+0.033
 that+0.032
 outside+0.032
Neg logit contributions
 forgot-0.038
'd-0.035
 regretted-0.031
ave-0.031
 remembered-0.031
Token hung
Token ID9174
Feature activation+6.68e-02

Pos logit contributions
 forgot+0.028
'd+0.025
 sped+0.025
 regretted+0.024
Uh+0.024
Neg logit contributions
 that-0.033
 the-0.033
 outside-0.032
 her-0.032
 him-0.031
Token it
Token ID340
Feature activation+1.53e-02

Pos logit contributions
ummy+0.053
ach+0.044
 ourselves+0.042
uggle+0.041
time+0.041
Neg logit contributions
 forgot-0.184
'd-0.168
 remembered-0.162
 knew-0.160
 missed-0.159
Token on
Token ID319
Feature activation+8.95e-02

Pos logit contributions
ummy+0.013
 ourselves+0.012
ach+0.011
ings+0.011
 lunch+0.011
Neg logit contributions
 forgot-0.043
'd-0.040
 remembered-0.037
 missed-0.037
 regretted-0.036
Token their
Token ID511
Feature activation+2.11e-02

Pos logit contributions
ummy+0.078
uggle+0.072
ach+0.065
uttering+0.062
 ourselves+0.062
Neg logit contributions
 forgot-0.231
'd-0.210
 remembered-0.205
 knew-0.204
 cared-0.200
Token tree
Token ID5509
Feature activation+4.26e-02

Pos logit contributions
ummy+0.028
 ourselves+0.028
 lunch+0.027
ach+0.026
ings+0.026
Neg logit contributions
 forgot-0.048
'd-0.044
 remembered-0.041
 regretted-0.040
 missed-0.039
Token house
Token ID2156
Feature activation+5.10e-02

Pos logit contributions
ummy+0.042
 ourselves+0.039
ach+0.038
 yourselves+0.035
ings+0.034
Neg logit contributions
 forgot-0.111
'd-0.102
 remembered-0.094
 regretted-0.093
 missed-0.092
Token wall
Token ID3355
Feature activation+4.92e-02

Pos logit contributions
ummy+0.031
 ourselves+0.027
ach+0.026
 lunch+0.022
ings+0.022
Neg logit contributions
 forgot-0.154
'd-0.143
 remembered-0.134
 regretted-0.132
 cared-0.132
Token.
Token ID13
Feature activation+3.20e-02

Pos logit contributions
ummy+0.024
 ourselves+0.022
ach+0.019
 lunch+0.018
ings+0.018
Neg logit contributions
 forgot-0.155
'd-0.144
 remembered-0.136
 regretted-0.134
 missed-0.134
Token.
Token ID13
Feature activation-6.29e-03

Pos logit contributions
 forgot+0.016
'd+0.015
 remembered+0.014
 regretted+0.014
 missed+0.014
Neg logit contributions
ummy-0.002
 ourselves-0.002
ach-0.002
 lunch-0.002
ings-0.002
Token<|endoftext|>
Token ID50256
Feature activation+2.54e-02

Pos logit contributions
 forgot+0.016
'd+0.014
 remembered+0.014
 regretted+0.013
 missed+0.013
Neg logit contributions
ummy-0.007
 ourselves-0.007
ach-0.006
 lunch-0.006
ings-0.006
TokenOnce
Token ID7454
Feature activation+1.48e-02

Pos logit contributions
 ourselves+0.037
ummy+0.036
ach+0.034
 yourselves+0.032
ings+0.032
Neg logit contributions
 forgot-0.052
'd-0.051
 remembered-0.045
 hadn-0.045
 missed-0.045
Token upon
Token ID2402
Feature activation+4.20e-02

Pos logit contributions
 ourselves+0.025
ach+0.024
 yourselves+0.024
 lunch+0.022
 itself+0.021
Neg logit contributions
 forgot-0.031
'd-0.028
 knew-0.026
 remembered-0.025
 needed-0.024
Token a
Token ID257
Feature activation+2.64e-02

Pos logit contributions
ummy+0.066
ings+0.059
ach+0.059
 ourselves+0.051
ires+0.050
Neg logit contributions
 forgot-0.096
'd-0.092
 had-0.085
 cared-0.084
 remembered-0.084
Token time
Token ID640
Feature activation+1.36e-02

Pos logit contributions
 ourselves+0.043
ummy+0.039
 yourselves+0.039
ach+0.038
ings+0.038
Neg logit contributions
 forgot-0.057
'd-0.052
 remembered-0.047
 missed-0.047
 cared-0.046
Token there
Token ID612
Feature activation-2.13e-02

Pos logit contributions
ags+0.020
 ourselves+0.020
 yourselves+0.019
ate+0.018
 toast+0.017
Neg logit contributions
 forgot-0.029
'd-0.026
 had-0.026
 hadn-0.026
 knew-0.025
Token was
Token ID373
Feature activation+3.57e-02

Pos logit contributions
 forgot+0.035
'd+0.030
 remembered+0.027
 regretted+0.026
 missed+0.026
Neg logit contributions
ummy-0.042
 ourselves-0.041
ach-0.040
 lunch-0.040
ings-0.040
Token a
Token ID257
Feature activation+9.40e-03

Pos logit contributions
ings+0.056
ach+0.050
ummy+0.049
 ourselves+0.046
 yourselves+0.045
Neg logit contributions
 forgot-0.072
 pretended-0.067
 had-0.067
 missed-0.067
 remembered-0.066
Token Mom
Token ID11254
Feature activation-3.71e-02

Pos logit contributions
ummy+0.012
 lunch+0.011
 outside+0.011
time+0.011
 something+0.011
Neg logit contributions
 forgot-0.022
'd-0.020
ave-0.019
 struggled-0.018
 regretted-0.018
Token,
Token ID11
Feature activation-1.76e-02

Pos logit contributions
 forgot+0.056
'd+0.048
 remembered+0.042
 regretted+0.041
 missed+0.040
Neg logit contributions
ummy-0.079
 ourselves-0.078
ach-0.076
ings-0.075
 lunch-0.075
Token succeeded
Token ID14131
Feature activation+1.95e-02

Pos logit contributions
 forgot+0.071
 regretted+0.068
mo+0.065
Uh+0.062
 sank+0.062
Neg logit contributions
?"-0.081
?-0.080
 a-0.079
 the-0.078
 something-0.078
Token,
Token ID11
Feature activation+5.94e-03

Pos logit contributions
uggle+0.018
ummy+0.018
time+0.016
ings+0.015
 ourselves+0.015
Neg logit contributions
 forgot-0.051
'd-0.047
 remembered-0.046
 hadn-0.043
 knew-0.043
Token that
Token ID326
Feature activation+1.15e-02

Pos logit contributions
ummy+0.005
ings+0.005
 ourselves+0.005
ach+0.005
 lunch+0.005
Neg logit contributions
 forgot-0.016
'd-0.015
 remembered-0.014
 knew-0.013
 hadn-0.013
Token was
Token ID373
Feature activation-7.60e-03

Pos logit contributions
ummy+0.018
 ourselves+0.018
 lunch+0.017
ach+0.017
ings+0.017
Neg logit contributions
 forgot-0.024
'd-0.021
 remembered-0.019
 regretted-0.019
 missed-0.019
Token so
Token ID523
Feature activation-3.54e-03

Pos logit contributions
 forgot+0.020
'd+0.018
 remembered+0.017
 regretted+0.017
 missed+0.017
Neg logit contributions
ummy-0.008
 ourselves-0.007
ach-0.007
 lunch-0.007
ings-0.007
Token brave
Token ID14802
Feature activation+1.89e-02

Pos logit contributions
 forgot+0.008
'd+0.007
 remembered+0.007
 regretted+0.006
 missed+0.006
Neg logit contributions
ummy-0.005
 ourselves-0.005
ach-0.005
 something-0.005
 lunch-0.005
Token of
Token ID286
Feature activation+1.55e-02

Pos logit contributions
 ourselves+0.015
ummy+0.015
ings+0.014
ach+0.014
 lunch+0.013
Neg logit contributions
 forgot-0.053
'd-0.049
 remembered-0.046
 cared-0.045
 regretted-0.045
Token you
Token ID345
Feature activation-3.23e-02

Pos logit contributions
ummy+0.013
ach+0.013
ings+0.012
uggle+0.010
 ourselves+0.010
Neg logit contributions
 forgot-0.042
'd-0.039
 remembered-0.037
 knew-0.036
 hadn-0.036
Token!"
Token ID2474
Feature activation+3.72e-03

Pos logit contributions
 forgot+0.101
'd+0.093
 regretted+0.090
 cared+0.089
 remembered+0.089
Neg logit contributions
 ourselves-0.014
ummy-0.014
 lunch-0.012
ach-0.011
 something-0.011
Token
Token ID198
Feature activation+2.87e-02

Pos logit contributions
ach+0.005
 ourselves+0.005
ummy+0.004
ings+0.004
 lunch+0.004
Neg logit contributions
 forgot-0.009
'd-0.008
 knew-0.007
 cared-0.007
 hadn-0.007
Token
Token ID198
Feature activation+1.06e-02

Pos logit contributions
ach+0.041
 avail+0.040
 ourselves+0.040
ings+0.040
 yourselves+0.038
Neg logit contributions
 forgot-0.055
 knew-0.052
'd-0.050
 learned-0.050
 had-0.049
Token.
Token ID13
Feature activation+1.90e-02

Pos logit contributions
 ourselves+0.049
ummy+0.048
ach+0.047
 yourselves+0.041
ings+0.041
Neg logit contributions
 forgot-0.151
'd-0.141
 remembered-0.133
 knew-0.132
 hadn-0.130
Token Her
Token ID2332
Feature activation-1.53e-02

Pos logit contributions
ummy+0.021
 ourselves+0.019
ings+0.018
ach+0.018
 lunch+0.018
Neg logit contributions
 forgot-0.049
'd-0.045
 remembered-0.042
 regretted-0.041
 cared-0.041
Token dad
Token ID9955
Feature activation-6.30e-02

Pos logit contributions
 forgot+0.036
'd+0.033
 remembered+0.030
 cared+0.030
 missed+0.030
Neg logit contributions
ummy-0.019
 ourselves-0.018
 lunch-0.018
ach-0.018
ings-0.017
Token smiled
Token ID13541
Feature activation+1.76e-02

Pos logit contributions
 forgot+0.099
 regretted+0.082
 missed+0.079
 villains+0.077
'd+0.076
Neg logit contributions
 that-0.119
,-0.118
 with-0.118
 the-0.113
 outside-0.110
Token and
Token ID290
Feature activation-2.96e-02

Pos logit contributions
 ourselves+0.018
ummy+0.017
ach+0.017
ings+0.016
 lunch+0.016
Neg logit contributions
 forgot-0.046
'd-0.042
 remembered-0.040
 regretted-0.039
 knew-0.039
Token started
Token ID2067
Feature activation+2.94e-02

Pos logit contributions
'd+0.048
 forgot+0.048
ave+0.040
 missed+0.039
 regretted+0.039
Neg logit contributions
 the-0.055
 that-0.055
 with-0.052
 up-0.052
 her-0.052
Token pushing
Token ID7796
Feature activation+5.03e-02

Pos logit contributions
ummy+0.029
 ourselves+0.028
ach+0.027
 lunch+0.027
ings+0.026
Neg logit contributions
 forgot-0.078
'd-0.071
 remembered-0.067
 regretted-0.066
 missed-0.065
Token her
Token ID607
Feature activation-2.02e-02

Pos logit contributions
ummy+0.032
 ourselves+0.029
ach+0.025
 lunch+0.024
ings+0.024
Neg logit contributions
 forgot-0.150
'd-0.139
 remembered-0.133
 cared-0.130
 regretted-0.130
Token around
Token ID1088
Feature activation+3.47e-02

Pos logit contributions
 forgot+0.052
'd+0.047
 remembered+0.044
 regretted+0.044
 missed+0.043
Neg logit contributions
ummy-0.022
 ourselves-0.021
ach-0.020
 lunch-0.019
ings-0.019
Token the
Token ID262
Feature activation-4.18e-04

Pos logit contributions
ummy+0.019
 ourselves+0.018
ach+0.016
ings+0.015
 lunch+0.015
Neg logit contributions
 forgot-0.107
'd-0.099
 remembered-0.094
 regretted-0.093
 missed-0.093
Token park
Token ID3952
Feature activation+1.11e-02

Pos logit contributions
 forgot+0.001
'd+0.001
 remembered+0.001
 regretted+0.001
 missed+0.001
Neg logit contributions
ummy-0.000
 ourselves-0.000
ach-0.000
 lunch-0.000
ings-0.000
Token words
Token ID2456
Feature activation-4.39e-03

Pos logit contributions
ummy+0.005
 ourselves+0.005
ach+0.005
 lunch+0.005
ings+0.005
Neg logit contributions
 forgot-0.009
'd-0.008
 remembered-0.008
 regretted-0.007
 cared-0.007
Token and
Token ID290
Feature activation-2.64e-02

Pos logit contributions
 forgot+0.012
'd+0.011
 remembered+0.011
 regretted+0.011
 missed+0.011
Neg logit contributions
ummy-0.003
 ourselves-0.003
ach-0.003
 lunch-0.003
ings-0.003
Token pictures
Token ID5986
Feature activation+1.07e-03

Pos logit contributions
 forgot+0.055
'd+0.053
ave+0.047
 sped+0.046
 missed+0.046
Neg logit contributions
 something-0.037
 lunch-0.036
 ourselves-0.036
 the-0.035
 that-0.034
Token.
Token ID13
Feature activation+1.65e-02

Pos logit contributions
ummy+0.001
 ourselves+0.001
ach+0.001
 lunch+0.001
ings+0.001
Neg logit contributions
 forgot-0.003
'd-0.003
 remembered-0.003
 regretted-0.003
 missed-0.003
Token
Token ID198
Feature activation+4.01e-02

Pos logit contributions
 ourselves+0.022
ummy+0.022
ach+0.021
ings+0.021
 lunch+0.020
Neg logit contributions
 forgot-0.037
'd-0.033
 remembered-0.031
 missed-0.031
 regretted-0.031
Token
Token ID198
Feature activation+2.31e-02

Pos logit contributions
 ourselves+0.051
ummy+0.050
ach+0.047
 lunch+0.047
 yourselves+0.045
Neg logit contributions
 forgot-0.093
'd-0.086
 knew-0.079
 remembered-0.079
 missed-0.077
TokenSuddenly
Token ID38582
Feature activation+3.42e-02

Pos logit contributions
ummy+0.033
 ourselves+0.032
ach+0.031
 lunch+0.030
ings+0.030
Neg logit contributions
 forgot-0.051
'd-0.046
 remembered-0.043
 regretted-0.042
 missed-0.042
Token,
Token ID11
Feature activation+2.80e-02

Pos logit contributions
 ourselves+0.030
ach+0.029
ummy+0.027
 yourselves+0.025
ires+0.024
Neg logit contributions
 forgot-0.096
 remembered-0.086
'd-0.085
 knew-0.082
 missed-0.082
Token Lily
Token ID20037
Feature activation-8.12e-02

Pos logit contributions
ach+0.030
ummy+0.030
 ourselves+0.029
ings+0.028
.'+0.025
Neg logit contributions
 forgot-0.074
 remembered-0.064
'd-0.063
 missed-0.061
 knew-0.061
Token heard
Token ID2982
Feature activation+5.37e-02

Pos logit contributions
 forgot+0.123
'd+0.118
 arrested+0.110
 cared+0.109
 sped+0.108
Neg logit contributions
 outside-0.140
,-0.136
.-0.133
 with-0.129
 something-0.129
Token a
Token ID257
Feature activation-1.76e-03

Pos logit contributions
 ourselves+0.047
ummy+0.045
ach+0.045
ings+0.041
.'+0.036
Neg logit contributions
 forgot-0.145
'd-0.130
 knew-0.129
 remembered-0.129
 missed-0.128
Token,
Token ID11
Feature activation+2.28e-03

Pos logit contributions
 yourselves+0.020
 ourselves+0.019
asty+0.018
ach+0.018
ags+0.018
Neg logit contributions
 forgot-0.028
 had-0.024
 missed-0.024
'd-0.024
 remembered-0.024
Token there
Token ID612
Feature activation-1.16e-02

Pos logit contributions
ach+0.003
ummy+0.003
 ourselves+0.003
ings+0.003
 yourselves+0.003
Neg logit contributions
 forgot-0.005
 had-0.005
'd-0.005
 remembered-0.005
 missed-0.005
Token was
Token ID373
Feature activation+3.46e-02

Pos logit contributions
 forgot+0.014
'd+0.012
 missed+0.010
 knew+0.010
 remembered+0.010
Neg logit contributions
 ourselves-0.029
ummy-0.028
ach-0.028
 yourselves-0.028
.'-0.027
Token a
Token ID257
Feature activation+1.04e-02

Pos logit contributions
ings+0.063
ach+0.056
ummy+0.054
 ourselves+0.052
 yourselves+0.051
Neg logit contributions
 forgot-0.060
 missed-0.059
 had-0.058
 pretended-0.058
 learned-0.057
Token hunter
Token ID19177
Feature activation-5.95e-03

Pos logit contributions
ummy+0.015
 ourselves+0.014
 lunch+0.014
ach+0.013
 something+0.013
Neg logit contributions
 forgot-0.023
'd-0.021
 remembered-0.019
 regretted-0.019
 missed-0.019
Token named
Token ID3706
Feature activation+1.36e-02

Pos logit contributions
 forgot+0.010
'd+0.009
 remembered+0.008
 cared+0.008
 missed+0.008
Neg logit contributions
ummy-0.011
 ourselves-0.011
ach-0.011
 lunch-0.011
ings-0.010
Token Tim
Token ID5045
Feature activation-3.67e-02

Pos logit contributions
ings+0.020
ummy+0.020
 ourselves+0.019
ach+0.018
.'+0.018
Neg logit contributions
 forgot-0.029
'd-0.026
 missed-0.024
 remembered-0.024
 drove-0.023
Token.
Token ID13
Feature activation-1.81e-02

Pos logit contributions
 forgot+0.105
'd+0.098
 remembered+0.093
 knew+0.092
 cared+0.092
Neg logit contributions
ummy-0.032
 ourselves-0.029
ach-0.028
 lunch-0.026
ings-0.026
Token Tim
Token ID5045
Feature activation-7.43e-02

Pos logit contributions
 forgot+0.045
'd+0.041
 missed+0.038
 remembered+0.038
 cared+0.038
Neg logit contributions
 ourselves-0.020
ummy-0.019
ings-0.019
ach-0.019
 lunch-0.017
Token loved
Token ID6151
Feature activation+2.06e-02

Pos logit contributions
 forgot+0.119
'd+0.104
 regretted+0.092
 remembered+0.090
 switched+0.089
Neg logit contributions
 outside-0.139
 up-0.137
 ourselves-0.136
 with-0.135
ummy-0.135
Token to
Token ID284
Feature activation+4.48e-03

Pos logit contributions
ummy+0.018
 ourselves+0.017
ach+0.016
ings+0.016
time+0.014
Neg logit contributions
 forgot-0.057
'd-0.052
 remembered-0.050
 missed-0.049
 regretted-0.049
Token.
Token ID13
Feature activation+9.19e-03

Pos logit contributions
ummy+0.031
ach+0.027
 ourselves+0.027
 yourselves+0.025
ings+0.024
Neg logit contributions
 forgot-0.100
'd-0.094
 cared-0.089
 remembered-0.087
 needed-0.086
Token But
Token ID887
Feature activation-2.46e-03

Pos logit contributions
 ourselves+0.016
ummy+0.016
ach+0.015
ings+0.015
 yourselves+0.014
Neg logit contributions
 forgot-0.017
'd-0.016
 missed-0.014
 remembered-0.014
 cared-0.014
Token today
Token ID1909
Feature activation+9.89e-03

Pos logit contributions
 forgot+0.006
'd+0.006
 remembered+0.005
 missed+0.005
 knew+0.005
Neg logit contributions
ummy-0.003
 ourselves-0.003
ings-0.003
ach-0.003
 lunch-0.002
Token the
Token ID262
Feature activation-3.19e-03

Pos logit contributions
 ourselves+0.012
ummy+0.011
ach+0.011
ags+0.011
ings+0.010
Neg logit contributions
 forgot-0.025
'd-0.022
 missed-0.020
 remembered-0.020
 knew-0.020
Token park
Token ID3952
Feature activation-1.77e-02

Pos logit contributions
 forgot+0.008
'd+0.007
 remembered+0.007
 regretted+0.007
 missed+0.007
Neg logit contributions
 lunch-0.004
ummy-0.004
 ourselves-0.003
time-0.003
 something-0.003
Token is
Token ID318
Feature activation-3.19e-05

Pos logit contributions
 forgot+0.035
'd+0.032
 regretted+0.030
 remembered+0.030
ave+0.030
Neg logit contributions
time-0.026
ummy-0.025
 up-0.025
 something-0.025
 that-0.025
Token closed
Token ID4838
Feature activation+5.62e-02

Pos logit contributions
 forgot+0.000
'd+0.000
 cared+0.000
 regretted+0.000
 remembered+0.000
Neg logit contributions
 something-0.000
 lunch-0.000
 up-0.000
ummy-0.000
 outside-0.000
Token.
Token ID13
Feature activation+9.35e-03

Pos logit contributions
ummy+0.055
ach+0.053
 ourselves+0.053
uggle+0.052
 lunch+0.049
Neg logit contributions
 forgot-0.140
'd-0.130
 remembered-0.122
 regretted-0.120
 needed-0.118
Token There
Token ID1318
Feature activation-7.91e-04

Pos logit contributions
 ourselves+0.012
ummy+0.012
ach+0.012
ings+0.012
ble+0.011
Neg logit contributions
 forgot-0.021
'd-0.019
 missed-0.018
 remembered-0.017
 hadn-0.017
Token is
Token ID318
Feature activation+1.49e-02

Pos logit contributions
 forgot+0.002
'd+0.001
 remembered+0.001
 missed+0.001
 knew+0.001
Neg logit contributions
ummy-0.001
 ourselves-0.001
ach-0.001
 lunch-0.001
ings-0.001
Token a
Token ID257
Feature activation-1.28e-02

Pos logit contributions
ummy+0.010
ach+0.009
 ourselves+0.009
ings+0.008
 lunch+0.007
Neg logit contributions
 forgot-0.045
'd-0.042
 remembered-0.040
 missed-0.039
 regretted-0.039
Token.
Token ID13
Feature activation-5.21e-03

Pos logit contributions
ummy+0.040
 ourselves+0.040
ach+0.039
 lunch+0.036
ings+0.036
Neg logit contributions
 forgot-0.046
'd-0.041
 remembered-0.037
 regretted-0.037
 struggled-0.036
Token "
Token ID366
Feature activation-1.94e-02

Pos logit contributions
 forgot+0.014
'd+0.013
 remembered+0.012
 regretted+0.012
 cared+0.012
Neg logit contributions
ummy-0.005
 ourselves-0.004
 lunch-0.004
 something-0.004
time-0.004
TokenMom
Token ID29252
Feature activation-3.96e-02

Pos logit contributions
 forgot+0.046
'd+0.041
 regretted+0.038
 remembered+0.038
 missed+0.038
Neg logit contributions
 ourselves-0.023
ach-0.023
ummy-0.023
 something-0.022
 outside-0.021
Tokenmy
Token ID1820
Feature activation-4.78e-02

Pos logit contributions
 forgot+0.056
'd+0.048
 remembered+0.042
 regretted+0.040
 missed+0.039
Neg logit contributions
ummy-0.087
 ourselves-0.086
ach-0.084
 lunch-0.083
ings-0.083
Token,
Token ID11
Feature activation-1.02e-02

Pos logit contributions
 forgot+0.153
'd+0.143
 remembered+0.136
 regretted+0.134
 missed+0.134
Neg logit contributions
ummy-0.020
 ourselves-0.017
ach-0.015
 lunch-0.014
ings-0.014
Token look
Token ID804
Feature activation-3.48e-04

Pos logit contributions
 forgot+0.022
'd+0.020
 remembered+0.019
 missed+0.018
 regretted+0.018
Neg logit contributions
ummy-0.015
 ourselves-0.015
ach-0.014
ings-0.014
 lunch-0.014
Token!"
Token ID2474
Feature activation+4.90e-03

Pos logit contributions
 forgot+0.001
 remembered+0.001
'd+0.001
 knew+0.001
 regretted+0.001
Neg logit contributions
ummy-0.000
ings-0.000
 ourselves-0.000
time-0.000
uggle-0.000
Token Lily
Token ID20037
Feature activation-9.31e-02

Pos logit contributions
ummy+0.005
 lunch+0.004
 ourselves+0.004
 something+0.004
 that+0.004
Neg logit contributions
 forgot-0.013
'd-0.012
 remembered-0.011
 regretted-0.011
 missed-0.011
Token shouted
Token ID17293
Feature activation+1.33e-02

Pos logit contributions
'd+0.156
 forgot+0.145
 sped+0.122
 cared+0.121
mo+0.119
Neg logit contributions
 with-0.166
 something-0.166
,-0.164
 that-0.163
 the-0.162
Token.
Token ID13
Feature activation+1.21e-02

Pos logit contributions
 ourselves+0.013
 yourselves+0.013
ummy+0.013
ach+0.012
ings+0.012
Neg logit contributions
 forgot-0.034
 missed-0.030
 knew-0.030
 remembered-0.029
'd-0.028
Token "
Token ID366
Feature activation-1.45e-02

Pos logit contributions
ummy+0.015
 ourselves+0.015
ach+0.014
 lunch+0.014
ings+0.014
Neg logit contributions
 forgot-0.029
'd-0.026
 remembered-0.024
 regretted-0.024
 missed-0.024
Token could
Token ID714
Feature activation+2.50e-03

Pos logit contributions
 forgot+0.085
Uh+0.074
 regretted+0.068
'd+0.067
 sped+0.067
Neg logit contributions
.-0.138
?-0.138
?"-0.133
 up-0.132
 outside-0.131
Token make
Token ID787
Feature activation+3.69e-02

Pos logit contributions
ummy+0.004
ach+0.004
 ourselves+0.004
 yourselves+0.004
ings+0.004
Neg logit contributions
 forgot-0.005
'd-0.005
 knew-0.004
 remembered-0.004
 missed-0.004
Token happy
Token ID3772
Feature activation+1.62e-02

Pos logit contributions
ummy+0.032
ach+0.027
 ourselves+0.025
uggle+0.024
ings+0.024
Neg logit contributions
 forgot-0.100
'd-0.088
 knew-0.088
 remembered-0.087
 regretted-0.085
Token music
Token ID2647
Feature activation-6.96e-03

Pos logit contributions
ummy+0.017
ach+0.016
 ourselves+0.016
ings+0.015
uggle+0.015
Neg logit contributions
 forgot-0.041
'd-0.036
 remembered-0.035
 regretted-0.035
 cared-0.034
Token.
Token ID13
Feature activation+3.44e-03

Pos logit contributions
 forgot+0.021
'd+0.019
 remembered+0.018
 missed+0.018
 regretted+0.018
Neg logit contributions
ummy-0.005
 ourselves-0.004
ach-0.004
ings-0.004
 lunch-0.004
Token But
Token ID887
Feature activation-1.45e-02

Pos logit contributions
ummy+0.005
 ourselves+0.005
 lunch+0.005
ach+0.005
ings+0.005
Neg logit contributions
 forgot-0.008
'd-0.007
 remembered-0.006
 regretted-0.006
 missed-0.006
Token when
Token ID618
Feature activation+1.73e-03

Pos logit contributions
 forgot+0.039
'd+0.036
 cared+0.033
 regretted+0.033
 remembered+0.033
Neg logit contributions
ummy-0.013
 ourselves-0.013
 something-0.013
 lunch-0.013
ach-0.012
Token he
Token ID339
Feature activation-8.67e-02

Pos logit contributions
ummy+0.002
 ourselves+0.002
 lunch+0.002
 something+0.002
time+0.002
Neg logit contributions
 forgot-0.004
'd-0.004
 remembered-0.003
 cared-0.003
 regretted-0.003
Token blew
Token ID17948
Feature activation+2.99e-02

Pos logit contributions
 forgot+0.115
'd+0.100
ave+0.081
 sizes+0.080
 sped+0.080
Neg logit contributions
?-0.177
 the-0.175
 something-0.175
 with-0.174
 outside-0.173
Token into
Token ID656
Feature activation+1.75e-02

Pos logit contributions
ummy+0.023
 ourselves+0.021
ach+0.020
ings+0.018
 yourselves+0.017
Neg logit contributions
 forgot-0.084
'd-0.077
 remembered-0.075
 knew-0.073
 struggled-0.073
Token it
Token ID340
Feature activation-2.21e-02

Pos logit contributions
ummy+0.011
 ourselves+0.009
ach+0.009
ings+0.008
 lunch+0.008
Neg logit contributions
 forgot-0.053
'd-0.049
 remembered-0.047
 missed-0.046
 struggled-0.046
Token he
Token ID339
Feature activation-7.22e-02

Pos logit contributions
 forgot+0.015
'd+0.014
 cared+0.013
ave+0.013
 regretted+0.012
Neg logit contributions
 something-0.007
 the-0.007
 lunch-0.007
 that-0.007
 up-0.006
Token saw
Token ID2497
Feature activation+3.31e-02

Pos logit contributions
 forgot+0.098
 sizes+0.095
 cared+0.091
 officer+0.091
'd+0.087
Neg logit contributions
 something-0.129
 the-0.125
 with-0.123
,-0.121
 outside-0.119
Token a
Token ID257
Feature activation-9.43e-03

Pos logit contributions
ummy+0.027
ach+0.025
 ourselves+0.023
ings+0.022
.'+0.020
Neg logit contributions
 forgot-0.095
'd-0.086
 remembered-0.084
 knew-0.083
 regretted-0.083
Token rabbit
Token ID22746
Feature activation-1.45e-02

Pos logit contributions
 forgot+0.023
ave+0.021
'd+0.021
 cared+0.020
 missed+0.020
Neg logit contributions
 something-0.010
ummy-0.010
 lunch-0.010
ings-0.009
time-0.009
Token in
Token ID287
Feature activation+3.52e-02

Pos logit contributions
 forgot+0.034
'd+0.032
 remembered+0.030
 regretted+0.029
 hadn+0.029
Neg logit contributions
ummy-0.018
 ourselves-0.017
ach-0.016
ings-0.016
 yourselves-0.016
Token the
Token ID262
Feature activation+4.92e-03

Pos logit contributions
ummy+0.032
 ourselves+0.030
ach+0.028
uggle+0.026
ings+0.025
Neg logit contributions
 forgot-0.093
'd-0.087
 remembered-0.086
 struggled-0.084
 cared-0.083
Token garden
Token ID11376
Feature activation+2.83e-02

Pos logit contributions
ummy+0.006
 ourselves+0.005
 lunch+0.005
 something+0.005
ach+0.005
Neg logit contributions
 forgot-0.012
'd-0.011
 remembered-0.010
 missed-0.010
 regretted-0.010
Token.
Token ID13
Feature activation+9.51e-03

Pos logit contributions
ummy+0.014
 ourselves+0.013
ach+0.012
ings+0.010
 lunch+0.010
Neg logit contributions
 forgot-0.089
'd-0.083
 remembered-0.078
 regretted-0.077
 cared-0.077
Token
Token ID198
Feature activation+4.66e-02

Pos logit contributions
 ourselves+0.012
ummy+0.012
ach+0.012
ings+0.011
 lunch+0.011
Neg logit contributions
 forgot-0.022
'd-0.020
 remembered-0.019
 missed-0.018
 cared-0.018
Token
Token ID198
Feature activation+2.60e-02

Pos logit contributions
 ourselves+0.069
ummy+0.068
ach+0.064
 lunch+0.063
 yourselves+0.062
Neg logit contributions
 forgot-0.097
'd-0.091
 knew-0.083
 hadn-0.081
 remembered-0.080
TokenThe
Token ID464
Feature activation-1.66e-02

Pos logit contributions
ummy+0.036
 ourselves+0.035
ings+0.033
ach+0.033
 lunch+0.033
Neg logit contributions
 forgot-0.059
'd-0.054
 remembered-0.050
 regretted-0.048
 cared-0.048
Token They
Token ID1119
Feature activation-7.98e-02

Pos logit contributions
 forgot+0.009
'd+0.008
 remembered+0.007
 regretted+0.007
 missed+0.007
Neg logit contributions
ummy-0.006
 ourselves-0.006
ach-0.005
 lunch-0.005
ings-0.005
Token asked
Token ID1965
Feature activation+1.37e-02

Pos logit contributions
 forgot+0.097
 proven+0.097
 ounces+0.096
ologist+0.096
'd+0.095
Neg logit contributions
 him-0.150
 her-0.145
.-0.145
 that-0.144
 the-0.142
Token the
Token ID262
Feature activation-1.75e-02

Pos logit contributions
ach+0.016
ummy+0.015
 ourselves+0.014
 yourselves+0.014
ings+0.014
Neg logit contributions
 forgot-0.032
 remembered-0.029
 knew-0.029
'd-0.029
 missed-0.028
Token boy
Token ID2933
Feature activation-3.55e-02

Pos logit contributions
 forgot+0.030
'd+0.026
 remembered+0.024
 regretted+0.023
 missed+0.023
Neg logit contributions
ummy-0.034
 ourselves-0.032
 lunch-0.032
 something-0.032
ach-0.031
Token,
Token ID11
Feature activation-7.07e-03

Pos logit contributions
 forgot+0.107
'd+0.099
 remembered+0.094
 missed+0.092
 regretted+0.092
Neg logit contributions
ummy-0.022
 ourselves-0.021
ach-0.020
ings-0.018
 lunch-0.017
Token "
Token ID366
Feature activation-7.80e-03

Pos logit contributions
 forgot+0.016
'd+0.014
 remembered+0.013
 missed+0.013
 regretted+0.013
Neg logit contributions
ummy-0.010
 ourselves-0.010
ach-0.009
ings-0.009
 lunch-0.009
TokenPlease
Token ID5492
Feature activation+2.59e-03

Pos logit contributions
 forgot+0.017
'd+0.015
 remembered+0.014
 regretted+0.014
 missed+0.014
Neg logit contributions
ummy-0.012
 ourselves-0.011
ach-0.011
ings-0.011
 lunch-0.011
Token,
Token ID11
Feature activation+7.80e-03

Pos logit contributions
ummy+0.002
 ourselves+0.002
ach+0.002
ings+0.002
 lunch+0.002
Neg logit contributions
 forgot-0.007
'd-0.007
 remembered-0.007
 missed-0.006
 knew-0.006
Token can
Token ID460
Feature activation+2.81e-02

Pos logit contributions
ummy+0.015
 ourselves+0.014
ach+0.014
ings+0.014
 lunch+0.013
Neg logit contributions
 forgot-0.014
'd-0.012
 remembered-0.011
 missed-0.011
 regretted-0.011
Token we
Token ID356
Feature activation-2.41e-02

Pos logit contributions
ummy+0.038
ach+0.036
ings+0.036
 ourselves+0.035
 lunch+0.034
Neg logit contributions
 forgot-0.065
'd-0.058
 remembered-0.054
 missed-0.053
 hadn-0.052
Token play
Token ID711
Feature activation+2.94e-02

Pos logit contributions
 forgot+0.055
'd+0.049
 remembered+0.046
 missed+0.044
 regretted+0.044
Neg logit contributions
ummy-0.033
 ourselves-0.032
ach-0.031
ings-0.031
 lunch-0.030
Token have
Token ID423
Feature activation+2.80e-02

Pos logit contributions
 forgot+0.053
'd+0.047
 remembered+0.044
 regretted+0.044
 missed+0.043
Neg logit contributions
ummy-0.035
 ourselves-0.033
 lunch-0.032
ach-0.032
ings-0.032
Token some
Token ID617
Feature activation+2.33e-02

Pos logit contributions
ummy+0.024
ach+0.023
ings+0.023
 ourselves+0.021
iggle+0.019
Neg logit contributions
 forgot-0.077
'd-0.070
 remembered-0.068
 missed-0.067
 cared-0.066
Token milk
Token ID7545
Feature activation+1.16e-02

Pos logit contributions
 ourselves+0.032
ach+0.031
ings+0.030
ummy+0.030
 itself+0.029
Neg logit contributions
 forgot-0.049
'd-0.044
 remembered-0.042
 regretted-0.041
 needed-0.041
Token?"
Token ID1701
Feature activation+1.48e-02

Pos logit contributions
 ourselves+0.010
 itself+0.010
ach+0.009
ires+0.009
ummy+0.008
Neg logit contributions
 forgot-0.031
'd-0.028
 remembered-0.027
 needed-0.026
 missed-0.026
Token His
Token ID2399
Feature activation-1.09e-02

Pos logit contributions
 ourselves+0.021
ummy+0.020
ach+0.020
ings+0.019
 yourselves+0.018
Neg logit contributions
 forgot-0.033
'd-0.029
 remembered-0.028
 knew-0.027
 missed-0.027
Token mom
Token ID1995
Feature activation-7.16e-02

Pos logit contributions
 forgot+0.028
'd+0.025
 remembered+0.023
 regretted+0.023
 cared+0.023
Neg logit contributions
ummy-0.012
 ourselves-0.011
 lunch-0.011
ach-0.011
ings-0.011
Token said
Token ID531
Feature activation+7.35e-04

Pos logit contributions
 villains+0.127
 forgot+0.127
 sped+0.122
'd+0.115
 regretted+0.114
Neg logit contributions
,-0.115
?"-0.114
 with-0.113
 that-0.112
 up-0.110
Token,
Token ID11
Feature activation-2.52e-03

Pos logit contributions
 ourselves+0.001
ach+0.001
ummy+0.001
ings+0.001
 yourselves+0.001
Neg logit contributions
 forgot-0.002
'd-0.002
 remembered-0.002
 knew-0.002
 missed-0.002
Token "
Token ID366
Feature activation-8.56e-03

Pos logit contributions
 forgot+0.006
'd+0.006
 regretted+0.005
 remembered+0.005
 missed+0.005
Neg logit contributions
ummy-0.003
 ourselves-0.003
 lunch-0.003
 something-0.003
ach-0.003
TokenSure
Token ID19457
Feature activation+1.22e-02

Pos logit contributions
 forgot+0.021
 remembered+0.018
 missed+0.018
'd+0.018
 regretted+0.018
Neg logit contributions
 ourselves-0.010
ummy-0.010
time-0.010
ach-0.009
 something-0.009
Token,
Token ID11
Feature activation+6.75e-03

Pos logit contributions
 ourselves+0.008
ummy+0.008
ach+0.008
ings+0.007
ires+0.007
Neg logit contributions
 forgot-0.036
'd-0.033
 remembered-0.032
 knew-0.031
 regretted-0.030
Token road
Token ID2975
Feature activation+2.29e-02

Pos logit contributions
 forgot+0.003
'd+0.002
 remembered+0.002
 regretted+0.002
 missed+0.002
Neg logit contributions
ummy-0.002
 ourselves-0.002
 lunch-0.001
ach-0.001
ings-0.001
Token trip
Token ID5296
Feature activation+2.89e-02

Pos logit contributions
ummy+0.021
 ourselves+0.018
ach+0.018
ings+0.017
 lunch+0.017
Neg logit contributions
 forgot-0.062
'd-0.057
 remembered-0.052
 missed-0.052
 cared-0.051
Token."
Token ID526
Feature activation+1.02e-02

Pos logit contributions
ummy+0.029
ulp+0.027
 lunch+0.025
ings+0.025
ach+0.025
Neg logit contributions
 forgot-0.073
'd-0.068
 remembered-0.060
 missed-0.060
 needed-0.060
Token
Token ID198
Feature activation+3.18e-02

Pos logit contributions
ummy+0.010
 ourselves+0.010
ach+0.010
ings+0.009
 lunch+0.009
Neg logit contributions
 forgot-0.027
'd-0.024
 remembered-0.023
 missed-0.022
 cared-0.022
Token
Token ID198
Feature activation+1.49e-02

Pos logit contributions
ummy+0.045
ach+0.045
 ourselves+0.045
 lunch+0.043
 yourselves+0.042
Neg logit contributions
 forgot-0.065
'd-0.058
 knew-0.057
 remembered-0.054
 missed-0.052
TokenThey
Token ID2990
Feature activation-6.90e-02

Pos logit contributions
ummy+0.018
 ourselves+0.017
ach+0.017
 lunch+0.016
ings+0.016
Neg logit contributions
 forgot-0.037
'd-0.033
 remembered-0.031
 regretted-0.031
 missed-0.031
Token were
Token ID547
Feature activation-1.45e-02

Pos logit contributions
 forgot+0.091
 regretted+0.086
ience+0.085
 ounces+0.081
Uh+0.079
Neg logit contributions
 the-0.137
 a-0.128
,-0.124
 outside-0.121
 lunch-0.119
Token having
Token ID1719
Feature activation+1.95e-02

Pos logit contributions
 forgot+0.031
'd+0.030
 cared+0.027
 regretted+0.027
ave+0.026
Neg logit contributions
 up-0.020
 ourselves-0.020
 outside-0.020
 lunch-0.020
 something-0.019
Token fun
Token ID1257
Feature activation+2.33e-02

Pos logit contributions
ummy+0.027
 ourselves+0.026
ach+0.025
ings+0.025
 lunch+0.024
Neg logit contributions
 forgot-0.044
'd-0.040
 remembered-0.037
 missed-0.036
 regretted-0.036
Token,
Token ID11
Feature activation-6.23e-03

Pos logit contributions
ummy+0.024
 ourselves+0.019
ings+0.019
ach+0.019
?'+0.018
Neg logit contributions
 forgot-0.063
'd-0.056
 remembered-0.054
 pretended-0.054
 sped-0.053
Token but
Token ID475
Feature activation-1.26e-02

Pos logit contributions
 forgot+0.017
'd+0.016
ave+0.016
 regretted+0.016
 remembered+0.015
Neg logit contributions
 lunch-0.005
 something-0.005
 ourselves-0.005
 with-0.005
 that-0.005
Token But
Token ID887
Feature activation-1.56e-02

Pos logit contributions
ach+0.005
ummy+0.005
 ourselves+0.005
 lunch+0.005
ings+0.005
Neg logit contributions
 forgot-0.006
'd-0.005
 missed-0.005
 knew-0.005
 remembered-0.005
Token it
Token ID340
Feature activation-7.77e-02

Pos logit contributions
 forgot+0.047
'd+0.043
 remembered+0.042
 missed+0.040
 regretted+0.040
Neg logit contributions
ummy-0.010
ach-0.010
 ourselves-0.009
ings-0.009
 lunch-0.008
Token did
Token ID750
Feature activation+2.47e-02

Pos logit contributions
 forgot+0.135
rie+0.131
ave+0.130
 competed+0.129
 officer+0.128
Neg logit contributions
 with-0.113
 the-0.111
 that-0.110
 something-0.108
 up-0.105
Token not
Token ID407
Feature activation-1.10e-02

Pos logit contributions
ummy+0.048
ach+0.045
ings+0.041
uggle+0.041
 ourselves+0.040
Neg logit contributions
 forgot-0.048
'd-0.040
 remembered-0.038
 missed-0.037
 hadn-0.036
Token.
Token ID13
Feature activation-2.18e-03

Pos logit contributions
 forgot+0.025
'd+0.023
 regretted+0.022
 remembered+0.021
 hadn+0.021
Neg logit contributions
 lunch-0.013
 ourselves-0.013
 something-0.013
?-0.013
 outside-0.012
Token It
Token ID632
Feature activation-7.71e-02

Pos logit contributions
 forgot+0.004
'd+0.004
 missed+0.004
 remembered+0.004
 hadn+0.004
Neg logit contributions
ach-0.003
 ourselves-0.003
ummy-0.003
ings-0.003
 lunch-0.003
Token ate
Token ID15063
Feature activation+4.17e-02

Pos logit contributions
 forgot+0.133
aries+0.125
 lifetime+0.125
rie+0.124
 competed+0.122
Neg logit contributions
 with-0.116
 the-0.113
.-0.111
 a-0.110
 that-0.109
Token Lily
Token ID20037
Feature activation-6.80e-02

Pos logit contributions
ach+0.048
ummy+0.043
uggle+0.038
uttering+0.038
?'+0.037
Neg logit contributions
 forgot-0.103
'd-0.091
 knew-0.091
 remembered-0.091
 missed-0.090
Token's
Token ID338
Feature activation-1.97e-02

Pos logit contributions
 forgot+0.123
'd+0.109
 remembered+0.098
 regretted+0.096
 missed+0.094
Neg logit contributions
ummy-0.122
 ourselves-0.121
 lunch-0.118
ings-0.117
ach-0.117
Token hand
Token ID1021
Feature activation-1.36e-02

Pos logit contributions
 forgot+0.041
'd+0.037
 remembered+0.034
 regretted+0.033
 missed+0.033
Neg logit contributions
ummy-0.030
 ourselves-0.030
ach-0.029
ings-0.028
 lunch-0.028
Token.
Token ID13
Feature activation-3.41e-03

Pos logit contributions
 forgot+0.041
'd+0.038
 remembered+0.036
 regretted+0.035
 missed+0.035
Neg logit contributions
ummy-0.009
 ourselves-0.008
ach-0.008
ings-0.007
 lunch-0.007
Token to
Token ID284
Feature activation+2.88e-02

Pos logit contributions
ach+0.008
ummy+0.008
 ourselves+0.008
ings+0.007
 yourselves+0.006
Neg logit contributions
 forgot-0.019
'd-0.017
 remembered-0.017
 missed-0.016
 regretted-0.016
Token nap
Token ID25422
Feature activation+4.11e-02

Pos logit contributions
ach+0.042
ummy+0.042
ings+0.041
uttering+0.035
ires+0.035
Neg logit contributions
 forgot-0.060
'd-0.054
 remembered-0.054
 knew-0.054
 missed-0.052
Token.
Token ID13
Feature activation+1.13e-04

Pos logit contributions
ummy+0.044
ach+0.042
 ourselves+0.041
uggle+0.040
ings+0.036
Neg logit contributions
 forgot-0.100
 remembered-0.091
'd-0.090
 knew-0.085
 needed-0.084
Token Every
Token ID3887
Feature activation+4.32e-03

Pos logit contributions
 ourselves+0.000
ummy+0.000
ach+0.000
ings+0.000
 lunch+0.000
Neg logit contributions
 forgot-0.000
'd-0.000
 remembered-0.000
 cared-0.000
 missed-0.000
Token morning
Token ID3329
Feature activation+2.09e-02

Pos logit contributions
ummy+0.006
 ourselves+0.006
ach+0.006
ings+0.006
 lunch+0.005
Neg logit contributions
 forgot-0.010
'd-0.009
 remembered-0.008
 regretted-0.008
 missed-0.008
Token he
Token ID339
Feature activation-1.03e-01

Pos logit contributions
 ourselves+0.019
ummy+0.019
ach+0.018
ings+0.017
 lunch+0.016
Neg logit contributions
 forgot-0.057
'd-0.052
 remembered-0.050
 regretted-0.048
 missed-0.048
Token woke
Token ID19092
Feature activation+3.12e-02

Pos logit contributions
 forgot+0.139
Uh+0.113
 proven+0.111
 sped+0.107
 hadn+0.106
Neg logit contributions
?"-0.209
 outside-0.206
?-0.204
.-0.199
 the-0.194
Token up
Token ID510
Feature activation+2.47e-02

Pos logit contributions
ions+0.035
ings+0.032
ach+0.032
room+0.032
ocolate+0.032
Neg logit contributions
 forgot-0.072
 regretted-0.065
 missed-0.065
 remembered-0.065
 knew-0.063
Token and
Token ID290
Feature activation-5.08e-02

Pos logit contributions
 ourselves+0.027
ach+0.024
ummy+0.023
 yourselves+0.022
ings+0.022
Neg logit contributions
 forgot-0.064
 remembered-0.057
'd-0.057
 missed-0.056
 knew-0.055
Token he
Token ID339
Feature activation-9.93e-02

Pos logit contributions
 forgot+0.073
'd+0.064
ave+0.052
 sped+0.052
 regretted+0.052
Neg logit contributions
 something-0.105
 up-0.104
 outside-0.103
 ourselves-0.102
 lunch-0.101
Token thanked
Token ID26280
Feature activation+2.87e-02

Pos logit contributions
 forgot+0.132
Uh+0.122
 competed+0.114
 winner+0.113
 sped+0.112
Neg logit contributions
?"-0.192
 the-0.190
.-0.190
 something-0.187
 outside-0.185
Token into
Token ID656
Feature activation+2.00e-02

Pos logit contributions
 forgot+0.075
'd+0.069
 remembered+0.065
 regretted+0.064
 missed+0.064
Neg logit contributions
ummy-0.021
 ourselves-0.019
ach-0.018
ings-0.017
 lunch-0.017
Token it
Token ID340
Feature activation-1.54e-02

Pos logit contributions
ummy+0.019
ach+0.017
uggle+0.015
time+0.015
 ourselves+0.015
Neg logit contributions
 forgot-0.054
'd-0.048
 struggled-0.048
 remembered-0.048
 knew-0.047
Token,
Token ID11
Feature activation-2.56e-02

Pos logit contributions
 forgot+0.043
'd+0.039
 remembered+0.037
 regretted+0.036
 missed+0.036
Neg logit contributions
ummy-0.013
 ourselves-0.013
ach-0.012
ings-0.012
 lunch-0.011
Token and
Token ID290
Feature activation-5.08e-02

Pos logit contributions
 forgot+0.066
'd+0.062
 regretted+0.057
ave+0.057
 remembered+0.057
Neg logit contributions
 something-0.024
 outside-0.023
 up-0.023
 lunch-0.022
 that-0.022
Token slowly
Token ID6364
Feature activation-3.32e-02

Pos logit contributions
 forgot+0.099
'd+0.095
 cared+0.084
ave+0.083
 regretted+0.082
Neg logit contributions
 something-0.078
 up-0.075
 the-0.075
 that-0.074
 lunch-0.074
Token he
Token ID339
Feature activation-8.81e-02

Pos logit contributions
 forgot+0.047
'd+0.040
 remembered+0.033
 cared+0.033
 missed+0.033
Neg logit contributions
 lunch-0.070
 something-0.069
ummy-0.069
 ourselves-0.069
 outside-0.069
Token made
Token ID925
Feature activation+2.23e-02

Pos logit contributions
 forgot+0.120
Uh+0.107
 cared+0.104
 officer+0.102
ience+0.100
Neg logit contributions
 up-0.168
 with-0.165
 something-0.162
 outside-0.161
 the-0.158
Token it
Token ID340
Feature activation-3.98e-02

Pos logit contributions
ummy+0.022
ach+0.020
 ourselves+0.019
ings+0.018
.'+0.017
Neg logit contributions
 forgot-0.059
'd-0.053
 knew-0.051
 remembered-0.051
 sped-0.050
Token to
Token ID284
Feature activation-3.75e-03

Pos logit contributions
 forgot+0.104
'd+0.095
 remembered+0.089
 regretted+0.088
 missed+0.087
Neg logit contributions
ummy-0.040
 ourselves-0.039
ach-0.037
 lunch-0.036
ings-0.036
Token the
Token ID262
Feature activation-2.56e-02

Pos logit contributions
 forgot+0.010
'd+0.009
 remembered+0.008
 knew+0.008
 missed+0.008
Neg logit contributions
ummy-0.005
ach-0.004
ings-0.004
 ourselves-0.003
time-0.003
Token top
Token ID1353
Feature activation-1.29e-02

Pos logit contributions
 forgot+0.073
'd+0.067
 remembered+0.064
 missed+0.063
 regretted+0.063
Neg logit contributions
ummy-0.020
 ourselves-0.020
 lunch-0.018
ach-0.017
ings-0.017