TOP ACTIVATIONS
MAX = 10.261

Lily and her family went back to the park . As
. So she happily went back to her chamber , understanding
, brother ." They went back to playing , safe and
Lily 's mom my went back to the store and asked
. They planned to go back soon ! <|endoftext|> Once there
dad , can I go back to bed ? I am
very brave and he went back to his play room .
L ily went back to her house , feeling
The little family went back to the park together and
next day , they went back to the park and Lily
could not wait to go back and discover more surprises .
the two best friends went back to playing and having fun
. The swing would go back and forth in a fun
!" They ran back to the hole . But
their raft around and went back to the pool . They
Lucy and her mom went back to the park . They
L ily 's dad went back to the bedroom and closed
the bow and ran off back home . The
He decided he should go back inside and get something else
mom and daughter happily went back inside .

MIN ACTIVATIONS
MIN = -9.212

are so silly ! Why does your dad have a mustache
not need water ? Why does it have sharp points ?"
in the desert ? Why does it not need water ?
is that man ? Why does he have a whistle ?"
kids so mean ? Why did they try to be something
a big mess . Why did you do that ?" her
that , mom ? Why did you stop me from eating
, " What ? Why did she quit ? What happened
are you fighting ? Why did you waste the food ?"
coats , shoes , and scar ves . Tom
animals , hats , and scar ves as props . They
swe aters and hats and scar ves from the wool .
this is gross . Why did he do this ?" Tom
Ben , why . Why did you do this . Why
were you thinking ? Why did you go inside the pengu
are you fighting ? Why did you take my paper ?
did you do ? Why did you not behave ? You
over the button ? Why did you break the thread ?
are you doing ? Why did you throw the potato ?
you like your hats and scar ves . You are our

INTERVAL 5.392 - 10.261
CONTAINS 0.620%

apologized and gave the toys back . L ily
waited for Jack to come back , but he never did
ir py loved to sing all day long . One day
more steps forward . <|endoftext|> One day , Mum and Dad
her happy . She went back to the barrel every day

INTERVAL 0.524 - 5.392
CONTAINS 37.668%

was so excited that she wanted to take it home with
it rolled right into the middle of the sandy pile .
boy decided to try some quiet time . He
sure . They think maybe they should go . But they
were best friends . They liked to play with toy soldiers

INTERVAL -4.344 - 0.524
CONTAINS 59.040%

Are you sure you want to make spaghetti for dinner ?
touch the c actus again . From that day on ,
nods . He likes the smell too . He smiles and
baby asked the m ama , " What has changed ?"
in her backyard . One day while playing , Sarah found

INTERVAL -9.212 - -4.344
CONTAINS 2.672%

forgive me ." The people looked at Ben .
with him . His friends said yes and started getting ready
keeper said it was a kit to make the cake .
long , tall grass and thought she might find something special
about who was better . But then , they saw a
Token Lily
Token ID20037
Feature activation+3.35e+00

Pos logit contributions
ashing+0.035
ashes+0.034
elly+0.033
ind+0.032
umpy+0.030
Neg logit contributions
 replied-0.019
lf-0.015
 responded-0.014
chief-0.014
 wouldn-0.013
Token and
Token ID290
Feature activation-1.12e+00

Pos logit contributions
ashing+0.189
ashes+0.188
elly+0.180
ind+0.166
umpy+0.162
Neg logit contributions
 replied-0.134
lf-0.122
chief-0.110
 responded-0.107
man-0.105
Token her
Token ID607
Feature activation+1.61e+00

Pos logit contributions
 replied+0.052
lf+0.047
chief+0.044
 responded+0.043
man+0.042
Neg logit contributions
ashing-0.055
ashes-0.055
elly-0.052
ind-0.048
umpy-0.045
Token family
Token ID1641
Feature activation+2.68e+00

Pos logit contributions
ashes+0.091
ashing+0.090
elly+0.086
ind+0.083
umpy+0.077
Neg logit contributions
 replied-0.065
lf-0.055
 responded-0.052
chief-0.051
 wouldn-0.048
Token went
Token ID1816
Feature activation+6.95e+00

Pos logit contributions
ashing+0.111
ashes+0.109
elly+0.105
ind+0.097
umpy+0.089
Neg logit contributions
 replied-0.153
lf-0.135
 responded-0.129
chief-0.128
 wouldn-0.124
Token back
Token ID736
Feature activation+9.62e+00

Pos logit contributions
elly+0.347
ashes+0.328
ashing+0.313
umpy+0.301
ind+0.294
Neg logit contributions
 replied-0.370
 said-0.298
man-0.287
lf-0.285
 responded-0.282
Token to
Token ID284
Feature activation+1.08e+00

Pos logit contributions
elly+0.489
ashing+0.483
ashes+0.483
umpy+0.462
ind+0.434
Neg logit contributions
 replied-0.459
 said-0.397
 rang-0.371
 answered-0.360
 told-0.345
Token the
Token ID262
Feature activation+9.30e-01

Pos logit contributions
ashing+0.058
elly+0.058
ashes+0.058
ind+0.052
umpy+0.052
Neg logit contributions
 replied-0.050
lf-0.042
 responded-0.039
chief-0.038
 rang-0.037
Token park
Token ID3952
Feature activation+2.35e+00

Pos logit contributions
ashing+0.048
ashes+0.047
elly+0.046
umpy+0.041
ind+0.041
Neg logit contributions
 replied-0.044
lf-0.038
man-0.035
 responded-0.035
chief-0.034
Token.
Token ID13
Feature activation+1.50e+00

Pos logit contributions
ashing+0.100
ashes+0.099
elly+0.096
ind+0.089
umpy+0.082
Neg logit contributions
 replied-0.132
lf-0.115
 responded-0.110
chief-0.109
 wouldn-0.106
Token As
Token ID1081
Feature activation+1.79e+00

Pos logit contributions
ashing+0.067
ashes+0.067
elly+0.065
ind+0.061
umpy+0.056
Neg logit contributions
 replied-0.081
lf-0.069
 responded-0.067
chief-0.065
 wouldn-0.064
Token.
Token ID13
Feature activation+9.88e-01

Pos logit contributions
ashing+0.091
ashes+0.090
elly+0.087
ind+0.081
umpy+0.075
Neg logit contributions
 replied-0.112
lf-0.096
 responded-0.093
chief-0.092
 wouldn-0.089
Token So
Token ID1406
Feature activation-9.48e-01

Pos logit contributions
ashing+0.046
ashes+0.046
elly+0.045
ind+0.041
umpy+0.039
Neg logit contributions
 replied-0.051
lf-0.044
 responded-0.042
chief-0.041
 wouldn-0.040
Token she
Token ID673
Feature activation+3.33e+00

Pos logit contributions
 replied+0.042
lf+0.037
chief+0.036
 responded+0.036
man+0.035
Neg logit contributions
ashing-0.043
ashes-0.042
elly-0.041
ind-0.039
away-0.037
Token happily
Token ID18177
Feature activation+5.66e+00

Pos logit contributions
ashing+0.179
ashes+0.178
elly+0.167
ind+0.163
umpy+0.151
Neg logit contributions
 replied-0.127
lf-0.115
chief-0.113
 responded-0.103
man-0.102
Token went
Token ID1816
Feature activation+5.90e+00

Pos logit contributions
ashing+0.314
ashes+0.309
elly+0.294
ind+0.280
umpy+0.263
Neg logit contributions
 replied-0.234
lf-0.206
chief-0.190
 responded-0.185
man-0.178
Token back
Token ID736
Feature activation+9.11e+00

Pos logit contributions
elly+0.251
ashes+0.249
ashing+0.248
ind+0.226
umpy+0.216
Neg logit contributions
 replied-0.336
lf-0.286
chief-0.275
 responded-0.272
man-0.269
Token to
Token ID284
Feature activation+8.24e-01

Pos logit contributions
ashing+0.373
ashes+0.368
elly+0.362
umpy+0.325
ind+0.323
Neg logit contributions
 replied-0.524
lf-0.446
 said-0.433
chief-0.430
 responded-0.427
Token her
Token ID607
Feature activation+3.99e+00

Pos logit contributions
ashing+0.039
ashes+0.038
elly+0.037
ind+0.035
umpy+0.032
Neg logit contributions
 replied-0.042
lf-0.036
 responded-0.035
chief-0.034
 wouldn-0.034
Token chamber
Token ID11847
Feature activation+3.12e+00

Pos logit contributions
ashing+0.214
ashes+0.212
elly+0.207
ind+0.190
umpy+0.184
Neg logit contributions
 replied-0.183
lf-0.156
 responded-0.144
chief-0.143
 wouldn-0.139
Token,
Token ID11
Feature activation+2.08e+00

Pos logit contributions
ashing+0.137
ashes+0.135
elly+0.131
ind+0.121
umpy+0.112
Neg logit contributions
 replied-0.171
lf-0.148
 responded-0.142
chief-0.140
 wouldn-0.137
Token understanding
Token ID4547
Feature activation-1.39e+00

Pos logit contributions
ashing+0.099
ashes+0.090
elly+0.084
ind+0.082
and+0.079
Neg logit contributions
 replied-0.105
lf-0.094
 responded-0.088
man-0.088
master-0.087
Token,
Token ID11
Feature activation+1.04e+00

Pos logit contributions
ashing+0.182
ashes+0.179
elly+0.170
ind+0.158
umpy+0.148
Neg logit contributions
 replied-0.212
lf-0.194
 responded-0.177
chief-0.176
man-0.171
Token brother
Token ID3956
Feature activation-3.80e-01

Pos logit contributions
ashes+0.051
ashing+0.051
elly+0.048
ind+0.044
umpy+0.043
Neg logit contributions
 replied-0.047
lf-0.043
chief-0.041
 responded-0.039
 rang-0.037
Token."
Token ID526
Feature activation+7.38e-01

Pos logit contributions
 replied+0.019
lf+0.017
chief+0.017
 responded+0.016
 wouldn+0.015
Neg logit contributions
ashing-0.017
ashes-0.017
elly-0.016
ind-0.015
umpy-0.014
Token They
Token ID1119
Feature activation+8.35e-01

Pos logit contributions
ashing+0.036
ashes+0.036
elly+0.035
ind+0.033
umpy+0.031
Neg logit contributions
 replied-0.036
lf-0.031
 responded-0.029
chief-0.029
 wouldn-0.028
Token went
Token ID1816
Feature activation+5.33e+00

Pos logit contributions
ashes+0.044
ashing+0.043
elly+0.039
ind+0.038
umpy+0.036
Neg logit contributions
 replied-0.035
lf-0.033
chief-0.030
man-0.029
 responded-0.028
Token back
Token ID736
Feature activation+9.11e+00

Pos logit contributions
elly+0.270
ind+0.248
ashing+0.245
umpy+0.242
ashes+0.237
Neg logit contributions
 replied-0.265
 said-0.217
man-0.216
lf-0.205
 responded-0.204
Token to
Token ID284
Feature activation+1.03e+00

Pos logit contributions
elly+0.513
ashing+0.510
umpy+0.505
ashes+0.496
ind+0.486
Neg logit contributions
 replied-0.364
 said-0.320
 answered-0.284
 rang-0.269
 told-0.267
Token playing
Token ID2712
Feature activation+4.15e+00

Pos logit contributions
elly+0.055
ashing+0.054
ashes+0.054
ind+0.050
umpy+0.050
Neg logit contributions
 replied-0.049
lf-0.041
 responded-0.038
chief-0.037
 said-0.037
Token,
Token ID11
Feature activation+2.89e+00

Pos logit contributions
ashing+0.186
elly+0.185
ashes+0.180
ind+0.175
umpy+0.162
Neg logit contributions
 replied-0.226
lf-0.196
 responded-0.182
chief-0.177
 wouldn-0.177
Token safe
Token ID3338
Feature activation+3.72e+00

Pos logit contributions
ashing+0.143
ashes+0.138
elly+0.127
ind+0.119
and+0.113
Neg logit contributions
 replied-0.145
lf-0.124
 responded-0.120
chief-0.119
man-0.115
Token and
Token ID290
Feature activation+5.51e-01

Pos logit contributions
ashing+0.190
elly+0.185
ashes+0.185
ind+0.174
umpy+0.166
Neg logit contributions
 replied-0.174
lf-0.152
 responded-0.138
man-0.137
chief-0.135
Token Lily
Token ID20037
Feature activation+1.54e+00

Pos logit contributions
ashing+0.075
ashes+0.074
elly+0.071
ind+0.066
and+0.064
Neg logit contributions
 replied-0.065
lf-0.057
chief-0.056
 responded-0.053
man-0.051
Token's
Token ID338
Feature activation-2.04e-01

Pos logit contributions
ashing+0.083
ashes+0.081
elly+0.080
ind+0.074
umpy+0.072
Neg logit contributions
 replied-0.064
lf-0.057
chief-0.054
 responded-0.052
man-0.050
Token mom
Token ID1995
Feature activation+2.16e+00

Pos logit contributions
 replied+0.008
lf+0.006
 responded+0.006
chief+0.006
 wouldn+0.006
Neg logit contributions
ashing-0.012
ashes-0.012
elly-0.012
ind-0.011
umpy-0.010
Tokenmy
Token ID1820
Feature activation-7.36e-01

Pos logit contributions
ashing+0.115
elly+0.106
ashes+0.103
ind+0.101
grown+0.096
Neg logit contributions
lf-0.081
 replied-0.080
chief-0.077
 button-0.070
 lid-0.067
Token went
Token ID1816
Feature activation+6.06e+00

Pos logit contributions
 replied+0.025
lf+0.024
chief+0.022
 responded+0.020
master+0.020
Neg logit contributions
ashing-0.042
ashes-0.040
elly-0.039
ind-0.037
umpy-0.036
Token back
Token ID736
Feature activation+9.09e+00

Pos logit contributions
elly+0.285
ashes+0.280
ashing+0.277
ind+0.255
umpy+0.254
Neg logit contributions
 replied-0.319
lf-0.257
man-0.255
 said-0.254
 responded-0.249
Token to
Token ID284
Feature activation+6.51e-01

Pos logit contributions
ashing+0.420
ashes+0.413
elly+0.404
umpy+0.380
ind+0.370
Neg logit contributions
 replied-0.480
 said-0.397
lf-0.386
 rang-0.381
 responded-0.377
Token the
Token ID262
Feature activation+2.83e-01

Pos logit contributions
ashing+0.033
ashes+0.033
elly+0.032
ind+0.031
umpy+0.028
Neg logit contributions
 replied-0.030
lf-0.026
 responded-0.025
chief-0.024
 wouldn-0.024
Token store
Token ID3650
Feature activation+2.56e+00

Pos logit contributions
ashing+0.015
ashes+0.014
elly+0.014
ind+0.013
umpy+0.012
Neg logit contributions
 replied-0.014
lf-0.012
 responded-0.011
chief-0.011
man-0.010
Token and
Token ID290
Feature activation-8.12e-01

Pos logit contributions
ashing+0.110
ashes+0.107
elly+0.104
ind+0.096
umpy+0.089
Neg logit contributions
 replied-0.142
lf-0.125
 responded-0.120
chief-0.119
 wouldn-0.115
Token asked
Token ID1965
Feature activation-2.55e-01

Pos logit contributions
 replied+0.034
chief+0.030
lf+0.030
master+0.029
 responded+0.029
Neg logit contributions
ashing-0.042
ashes-0.039
elly-0.037
ind-0.035
grown-0.034
Token.
Token ID13
Feature activation+9.00e-01

Pos logit contributions
ashing+0.071
ashes+0.069
elly+0.063
ind+0.058
away+0.054
Neg logit contributions
 replied-0.137
lf-0.125
 responded-0.119
chief-0.118
man-0.115
Token They
Token ID1119
Feature activation+1.55e+00

Pos logit contributions
ashing+0.041
ashes+0.040
elly+0.039
ind+0.036
umpy+0.034
Neg logit contributions
 replied-0.047
lf-0.041
chief-0.039
 responded-0.039
 wouldn-0.038
Token planned
Token ID6027
Feature activation+2.53e+00

Pos logit contributions
ashes+0.083
ashing+0.082
elly+0.076
ind+0.074
umpy+0.070
Neg logit contributions
 replied-0.064
lf-0.058
chief-0.055
man-0.052
 responded-0.051
Token to
Token ID284
Feature activation-3.87e-01

Pos logit contributions
ashing+0.102
ashes+0.101
elly+0.096
ind+0.089
umpy+0.081
Neg logit contributions
 replied-0.148
lf-0.129
 responded-0.125
chief-0.123
 wouldn-0.120
Token go
Token ID467
Feature activation+6.23e+00

Pos logit contributions
 replied+0.021
chief+0.018
 responded+0.018
lf+0.018
 wouldn+0.018
Neg logit contributions
ashes-0.015
ashing-0.014
ind-0.014
elly-0.013
away-0.013
Token back
Token ID736
Feature activation+9.08e+00

Pos logit contributions
elly+0.297
ashing+0.282
ashes+0.281
ind+0.270
umpy+0.260
Neg logit contributions
 replied-0.326
 said-0.264
lf-0.260
man-0.254
 responded-0.250
Token soon
Token ID2582
Feature activation+3.35e+00

Pos logit contributions
ashing+0.458
ashes+0.450
elly+0.439
umpy+0.413
ind+0.395
Neg logit contributions
 replied-0.430
 said-0.363
lf-0.350
 rang-0.342
 answered-0.335
Token!
Token ID0
Feature activation+9.75e-01

Pos logit contributions
ashing+0.145
ashes+0.144
elly+0.140
ind+0.129
umpy+0.120
Neg logit contributions
 replied-0.185
lf-0.161
 responded-0.154
chief-0.152
 wouldn-0.149
Token<|endoftext|>
Token ID50256
Feature activation+2.68e+00

Pos logit contributions
ashing+0.047
ashes+0.047
elly+0.045
ind+0.042
umpy+0.040
Neg logit contributions
 replied-0.048
lf-0.042
 responded-0.040
chief-0.039
 wouldn-0.038
TokenOnce
Token ID7454
Feature activation-1.29e-01

Pos logit contributions
ashing+0.131
ashes+0.129
elly+0.123
ind+0.118
umpy+0.108
Neg logit contributions
 replied-0.136
lf-0.115
 responded-0.110
chief-0.107
man-0.105
Token there
Token ID612
Feature activation-4.13e-01

Pos logit contributions
 replied+0.007
lf+0.006
chief+0.006
 responded+0.006
 rang+0.006
Neg logit contributions
ashing-0.005
elly-0.005
ashes-0.005
ind-0.005
umpy-0.004
Token dad
Token ID9955
Feature activation+9.38e-01

Pos logit contributions
ashes+0.048
ashing+0.048
elly+0.047
ind+0.041
grown+0.041
Neg logit contributions
 replied-0.048
lf-0.043
chief-0.041
 responded-0.041
master-0.039
Token,
Token ID11
Feature activation+1.06e+00

Pos logit contributions
ashing+0.038
ashes+0.036
elly+0.036
ind+0.032
oky+0.030
Neg logit contributions
 replied-0.049
lf-0.047
chief-0.043
 responded-0.042
master-0.041
Token can
Token ID460
Feature activation-2.61e+00

Pos logit contributions
ashing+0.051
ashes+0.050
elly+0.049
ind+0.044
grown+0.043
Neg logit contributions
 replied-0.050
lf-0.043
chief-0.042
 responded-0.041
master-0.039
Token I
Token ID314
Feature activation+5.29e+00

Pos logit contributions
 replied+0.138
lf+0.125
 responded+0.125
flake+0.123
man+0.120
Neg logit contributions
ashes-0.088
ashing-0.086
ind-0.080
elly-0.080
 hate-0.071
Token go
Token ID467
Feature activation+5.95e+00

Pos logit contributions
ashing+0.266
ashes+0.265
elly+0.255
ind+0.239
umpy+0.227
Neg logit contributions
 replied-0.251
lf-0.218
 responded-0.204
chief-0.204
man-0.193
Token back
Token ID736
Feature activation+9.08e+00

Pos logit contributions
ashing+0.272
ashes+0.268
elly+0.268
ind+0.242
umpy+0.232
Neg logit contributions
 replied-0.318
lf-0.266
 responded-0.254
chief-0.254
man-0.252
Token to
Token ID284
Feature activation+7.42e-01

Pos logit contributions
ashing+0.478
ashes+0.471
elly+0.448
ind+0.422
umpy+0.405
Neg logit contributions
 replied-0.432
lf-0.356
 responded-0.337
 said-0.333
man-0.330
Token bed
Token ID3996
Feature activation-6.78e-01

Pos logit contributions
ashing+0.045
ashes+0.044
elly+0.043
ind+0.041
umpy+0.039
Neg logit contributions
 replied-0.029
lf-0.024
 responded-0.022
chief-0.022
 wouldn-0.020
Token?
Token ID30
Feature activation-1.29e+00

Pos logit contributions
 replied+0.033
lf+0.029
 responded+0.026
man+0.026
chief+0.026
Neg logit contributions
ashes-0.034
ashing-0.034
elly-0.032
ind-0.030
umpy-0.029
Token I
Token ID314
Feature activation+4.08e+00

Pos logit contributions
 replied+0.064
lf+0.054
 responded+0.052
chief+0.051
 wouldn+0.050
Neg logit contributions
ashing-0.064
ashes-0.063
elly-0.061
ind-0.057
umpy-0.053
Token am
Token ID716
Feature activation-3.51e-01

Pos logit contributions
ashing+0.178
ashes+0.175
elly+0.169
ind+0.155
umpy+0.148
Neg logit contributions
 replied-0.223
lf-0.200
chief-0.186
 responded-0.186
man-0.179
Token very
Token ID845
Feature activation+4.34e+00

Pos logit contributions
ashing+0.052
ashes+0.050
elly+0.048
ind+0.047
umpy+0.044
Neg logit contributions
 replied-0.063
lf-0.055
chief-0.053
 responded-0.052
 wouldn-0.051
Token brave
Token ID14802
Feature activation+2.90e+00

Pos logit contributions
ashes+0.209
ashing+0.209
elly+0.205
ind+0.189
and+0.173
Neg logit contributions
 replied-0.222
lf-0.182
 responded-0.181
 said-0.173
 wouldn-0.169
Token and
Token ID290
Feature activation+3.90e-02

Pos logit contributions
ashing+0.148
ashes+0.147
elly+0.141
ind+0.135
umpy+0.125
Neg logit contributions
 replied-0.140
lf-0.116
 responded-0.112
chief-0.110
 wouldn-0.108
Token he
Token ID339
Feature activation+2.61e+00

Pos logit contributions
ashing+0.002
ashes+0.002
elly+0.002
umpy+0.002
away+0.002
Neg logit contributions
 replied-0.002
lf-0.002
chief-0.001
flake-0.001
on-0.001
Token went
Token ID1816
Feature activation+5.88e+00

Pos logit contributions
ashes+0.130
ashing+0.130
elly+0.119
umpy+0.116
ind+0.116
Neg logit contributions
 replied-0.106
lf-0.100
chief-0.098
 Steven-0.091
flake-0.085
Token back
Token ID736
Feature activation+9.07e+00

Pos logit contributions
elly+0.262
ashing+0.259
ashes+0.256
ind+0.235
umpy+0.222
Neg logit contributions
 replied-0.321
lf-0.270
chief-0.262
 responded-0.261
man-0.257
Token to
Token ID284
Feature activation+5.93e-01

Pos logit contributions
ashing+0.400
elly+0.395
ashes+0.390
ind+0.355
umpy+0.351
Neg logit contributions
 replied-0.493
 said-0.407
 responded-0.399
lf-0.397
chief-0.396
Token his
Token ID465
Feature activation+2.85e+00

Pos logit contributions
ashes+0.026
ashing+0.026
ind+0.024
elly+0.023
away+0.021
Neg logit contributions
 replied-0.031
lf-0.026
 responded-0.026
 wouldn-0.026
chief-0.025
Token play
Token ID711
Feature activation+3.02e+00

Pos logit contributions
ashing+0.154
ashes+0.152
elly+0.149
ind+0.138
umpy+0.132
Neg logit contributions
 replied-0.129
lf-0.109
 responded-0.102
chief-0.100
 wouldn-0.097
Tokenroom
Token ID3823
Feature activation-5.01e-01

Pos logit contributions
ashing+0.128
ashes+0.126
elly+0.124
ind+0.115
umpy+0.105
Neg logit contributions
 replied-0.171
lf-0.152
 responded-0.142
man-0.138
chief-0.138
Token.
Token ID13
Feature activation+1.15e+00

Pos logit contributions
 replied+0.027
lf+0.023
 responded+0.022
chief+0.022
 wouldn+0.022
Neg logit contributions
ashing-0.022
ashes-0.022
elly-0.021
ind-0.020
umpy-0.019
Token
Token ID198
Feature activation-4.52e-01

Pos logit contributions
 replied+0.157
lf+0.147
chief+0.138
 wouldn+0.136
 responded+0.136
Neg logit contributions
ashing-0.128
ashes-0.124
elly-0.123
umpy-0.108
ind-0.108
Token
Token ID198
Feature activation-4.18e-01

Pos logit contributions
 replied+0.022
lf+0.019
 responded+0.018
chief+0.017
 wouldn+0.017
Neg logit contributions
ashing-0.023
ashes-0.022
elly-0.022
ind-0.021
umpy-0.019
TokenL
Token ID43
Feature activation+1.80e+00

Pos logit contributions
 replied+0.023
lf+0.020
 responded+0.019
chief+0.018
man+0.018
Neg logit contributions
ashes-0.019
ashing-0.019
elly-0.018
ind-0.017
oky-0.015
Tokenily
Token ID813
Feature activation+1.38e+00

Pos logit contributions
ashes+0.068
ashing+0.068
elly+0.061
ind+0.057
oky+0.051
Neg logit contributions
 replied-0.113
lf-0.100
 responded-0.094
 wouldn-0.091
man-0.090
Token went
Token ID1816
Feature activation+5.14e+00

Pos logit contributions
ashing+0.080
ashes+0.078
elly+0.075
ind+0.071
umpy+0.068
Neg logit contributions
lf-0.045
 replied-0.043
chief-0.040
man-0.038
master-0.038
Token back
Token ID736
Feature activation+9.05e+00

Pos logit contributions
elly+0.228
ashing+0.222
ashes+0.221
ind+0.207
umpy+0.196
Neg logit contributions
 replied-0.283
lf-0.240
man-0.229
 responded-0.227
 said-0.226
Token to
Token ID284
Feature activation+8.07e-01

Pos logit contributions
ashing+0.411
ashes+0.407
elly+0.394
ind+0.365
umpy+0.358
Neg logit contributions
 replied-0.486
lf-0.408
 said-0.396
 responded-0.388
man-0.385
Token her
Token ID607
Feature activation+3.78e+00

Pos logit contributions
ashing+0.040
ashes+0.039
elly+0.038
ind+0.036
umpy+0.033
Neg logit contributions
 replied-0.040
lf-0.034
 responded-0.033
chief-0.032
 wouldn-0.031
Token house
Token ID2156
Feature activation+1.76e+00

Pos logit contributions
ashing+0.211
ashes+0.208
elly+0.203
ind+0.193
umpy+0.181
Neg logit contributions
 replied-0.162
lf-0.133
 responded-0.128
chief-0.125
 wouldn-0.121
Token,
Token ID11
Feature activation+1.62e+00

Pos logit contributions
ashing+0.078
ashes+0.076
elly+0.074
ind+0.068
umpy+0.063
Neg logit contributions
 replied-0.095
lf-0.082
 responded-0.079
chief-0.078
 wouldn-0.076
Token feeling
Token ID4203
Feature activation+1.56e+00

Pos logit contributions
ashing+0.077
ashes+0.070
elly+0.067
ind+0.066
and+0.060
Neg logit contributions
 replied-0.082
lf-0.073
 responded-0.069
chief-0.067
man-0.067
Token
Token ID198
Feature activation-1.35e-01

Pos logit contributions
 replied+0.005
lf+0.004
 responded+0.004
chief+0.004
man+0.004
Neg logit contributions
ashing-0.006
ashes-0.006
elly-0.005
ind-0.005
umpy-0.005
TokenThe
Token ID464
Feature activation-3.65e+00

Pos logit contributions
 replied+0.007
lf+0.006
 responded+0.005
man+0.005
 Steven+0.005
Neg logit contributions
ashes-0.007
ashing-0.007
elly-0.006
ind-0.006
oky-0.005
Token little
Token ID1310
Feature activation-7.74e-02

Pos logit contributions
 replied+0.165
chief+0.136
lf+0.136
 responded+0.135
 wouldn+0.129
Neg logit contributions
ashing-0.188
ashes-0.188
elly-0.182
ind-0.177
umpy-0.162
Token family
Token ID1641
Feature activation+2.53e+00

Pos logit contributions
 replied+0.004
lf+0.003
 responded+0.003
chief+0.003
 wouldn+0.003
Neg logit contributions
ashing-0.004
ashes-0.004
elly-0.004
ind-0.004
umpy-0.003
Token went
Token ID1816
Feature activation+5.53e+00

Pos logit contributions
ashing+0.151
ashes+0.149
elly+0.145
ind+0.138
umpy+0.132
Neg logit contributions
 replied-0.086
chief-0.076
lf-0.073
 responded-0.066
 Steven-0.064
Token back
Token ID736
Feature activation+9.04e+00

Pos logit contributions
elly+0.253
ashes+0.248
ashing+0.247
ind+0.228
umpy+0.221
Neg logit contributions
 replied-0.299
lf-0.247
man-0.242
 responded-0.241
 said-0.235
Token to
Token ID284
Feature activation+4.73e-01

Pos logit contributions
ashing+0.430
ashes+0.424
elly+0.418
umpy+0.391
ind+0.384
Neg logit contributions
 replied-0.457
 said-0.380
lf-0.366
 rang-0.364
 answered-0.362
Token the
Token ID262
Feature activation+2.84e-01

Pos logit contributions
ashing+0.024
ashes+0.024
elly+0.023
ind+0.022
umpy+0.020
Neg logit contributions
 replied-0.022
lf-0.019
 responded-0.018
chief-0.018
 wouldn-0.018
Token park
Token ID3952
Feature activation+1.66e+00

Pos logit contributions
ashing+0.015
ashes+0.015
elly+0.014
ind+0.013
umpy+0.013
Neg logit contributions
 replied-0.013
lf-0.011
 responded-0.011
man-0.010
chief-0.010
Token together
Token ID1978
Feature activation+2.00e+00

Pos logit contributions
ashing+0.070
ashes+0.067
elly+0.064
ind+0.058
umpy+0.054
Neg logit contributions
 replied-0.094
lf-0.084
chief-0.080
 responded-0.080
 wouldn-0.077
Token and
Token ID290
Feature activation-1.06e+00

Pos logit contributions
ashing+0.079
ashes+0.077
elly+0.074
ind+0.067
umpy+0.061
Neg logit contributions
 replied-0.117
lf-0.105
chief-0.100
 responded-0.100
man-0.096
Token next
Token ID1306
Feature activation+3.38e+00

Pos logit contributions
 replied+0.167
lf+0.142
chief+0.139
 responded+0.138
 wouldn+0.134
Neg logit contributions
ashes-0.190
ashing-0.187
elly-0.183
ind-0.176
umpy-0.163
Token day
Token ID1110
Feature activation+1.05e-01

Pos logit contributions
ashes+0.142
elly+0.138
ashing+0.137
away+0.115
oky+0.114
Neg logit contributions
 replied-0.176
chief-0.164
lf-0.160
 wouldn-0.159
 Steven-0.152
Token,
Token ID11
Feature activation+1.45e+00

Pos logit contributions
ashing+0.005
ashes+0.005
elly+0.005
ind+0.004
umpy+0.004
Neg logit contributions
 replied-0.005
lf-0.005
chief-0.004
 responded-0.004
 wouldn-0.004
Token they
Token ID484
Feature activation+2.62e+00

Pos logit contributions
ashing+0.066
ashes+0.064
elly+0.062
ind+0.056
and+0.053
Neg logit contributions
 replied-0.073
lf-0.065
chief-0.063
 responded-0.062
 wouldn-0.059
Token went
Token ID1816
Feature activation+5.55e+00

Pos logit contributions
ashing+0.140
ashes+0.138
elly+0.131
ind+0.126
umpy+0.118
Neg logit contributions
 replied-0.113
lf-0.100
chief-0.095
 responded-0.091
man-0.088
Token back
Token ID736
Feature activation+9.03e+00

Pos logit contributions
elly+0.279
ashing+0.260
ashes+0.258
ind+0.255
umpy+0.253
Neg logit contributions
 replied-0.278
man-0.225
 said-0.221
lf-0.217
 nose-0.212
Token to
Token ID284
Feature activation+8.66e-01

Pos logit contributions
ashing+0.445
ashes+0.438
elly+0.435
umpy+0.417
ind+0.410
Neg logit contributions
 replied-0.440
 said-0.370
 answered-0.345
 rang-0.345
lf-0.340
Token the
Token ID262
Feature activation+7.26e-01

Pos logit contributions
ashing+0.049
elly+0.048
ashes+0.048
ind+0.044
umpy+0.044
Neg logit contributions
 replied-0.038
lf-0.032
 responded-0.029
chief-0.029
 nose-0.028
Token park
Token ID3952
Feature activation+1.97e+00

Pos logit contributions
ashing+0.037
ashes+0.036
elly+0.035
ind+0.032
umpy+0.031
Neg logit contributions
 replied-0.035
lf-0.031
 responded-0.028
chief-0.028
man-0.028
Token and
Token ID290
Feature activation-5.55e-01

Pos logit contributions
ashing+0.086
ashes+0.085
elly+0.081
ind+0.075
umpy+0.069
Neg logit contributions
 replied-0.108
lf-0.095
 responded-0.091
chief-0.090
 wouldn-0.087
Token Lily
Token ID20037
Feature activation+1.62e+00

Pos logit contributions
 replied+0.026
lf+0.024
chief+0.023
master+0.022
man+0.022
Neg logit contributions
ashing-0.027
ashes-0.026
elly-0.023
ind-0.022
and-0.021
Token could
Token ID714
Feature activation-3.58e-01

Pos logit contributions
ashing+0.153
ashes+0.150
elly+0.142
ind+0.132
umpy+0.127
Neg logit contributions
 replied-0.133
lf-0.130
chief-0.120
 responded-0.110
flake-0.109
Token not
Token ID407
Feature activation-1.70e-01

Pos logit contributions
 replied+0.017
lf+0.015
chief+0.014
 responded+0.014
 Steven+0.013
Neg logit contributions
ashing-0.017
ashes-0.017
elly-0.015
ind-0.015
umpy-0.014
Token wait
Token ID4043
Feature activation+2.65e+00

Pos logit contributions
 replied+0.008
lf+0.007
 responded+0.006
chief+0.006
 wouldn+0.006
Neg logit contributions
ashing-0.009
ashes-0.009
elly-0.008
ind-0.008
umpy-0.007
Token to
Token ID284
Feature activation-1.52e-01

Pos logit contributions
ashing+0.119
ashes+0.118
elly+0.117
ind+0.109
umpy+0.102
Neg logit contributions
 replied-0.142
lf-0.121
 responded-0.117
chief-0.114
 wouldn-0.113
Token go
Token ID467
Feature activation+6.00e+00

Pos logit contributions
 replied+0.007
lf+0.006
chief+0.006
 responded+0.006
 wouldn+0.006
Neg logit contributions
ashes-0.008
ashing-0.008
elly-0.007
ind-0.007
away-0.006
Token back
Token ID736
Feature activation+9.02e+00

Pos logit contributions
elly+0.281
ashing+0.276
ashes+0.273
ind+0.250
umpy+0.241
Neg logit contributions
 replied-0.320
 said-0.259
lf-0.254
 responded-0.254
man-0.251
Token and
Token ID290
Feature activation+5.57e-02

Pos logit contributions
ashing+0.438
ashes+0.431
elly+0.421
ind+0.387
umpy+0.374
Neg logit contributions
 replied-0.461
 said-0.379
lf-0.373
 responded-0.366
man-0.356
Token discover
Token ID7073
Feature activation-1.73e+00

Pos logit contributions
ashing+0.002
ashes+0.002
elly+0.002
ind+0.002
away+0.002
Neg logit contributions
 replied-0.003
lf-0.003
chief-0.002
 responded-0.002
on-0.002
Token more
Token ID517
Feature activation+4.75e+00

Pos logit contributions
 replied+0.083
lf+0.070
 responded+0.067
chief+0.066
 wouldn+0.064
Neg logit contributions
ashing-0.088
ashes-0.088
elly-0.085
ind-0.080
umpy-0.075
Token surprises
Token ID24072
Feature activation+1.71e+00

Pos logit contributions
ashing+0.224
ashes+0.219
elly+0.211
ind+0.198
umpy+0.184
Neg logit contributions
 replied-0.248
lf-0.217
 responded-0.203
chief-0.195
 wouldn-0.195
Token.
Token ID13
Feature activation+5.65e-01

Pos logit contributions
ashing+0.074
ashes+0.073
elly+0.071
ind+0.066
umpy+0.061
Neg logit contributions
 replied-0.094
lf-0.082
 responded-0.078
chief-0.077
 wouldn-0.075
Token the
Token ID262
Feature activation-1.69e+00

Pos logit contributions
 replied+0.091
lf+0.083
chief+0.076
man+0.073
 responded+0.073
Neg logit contributions
ashing-0.115
ashes-0.114
elly-0.104
ind-0.100
 alone-0.093
Token two
Token ID734
Feature activation+7.41e-01

Pos logit contributions
 replied+0.079
lf+0.066
 responded+0.064
chief+0.064
 wouldn+0.061
Neg logit contributions
ashes-0.086
ashing-0.085
elly-0.082
ind-0.079
umpy-0.073
Token best
Token ID1266
Feature activation+3.99e+00

Pos logit contributions
ashing+0.038
ashes+0.038
elly+0.037
ind+0.035
umpy+0.032
Neg logit contributions
 replied-0.035
lf-0.030
 responded-0.028
chief-0.028
 wouldn-0.027
Token friends
Token ID2460
Feature activation+3.48e+00

Pos logit contributions
ashing+0.177
ashes+0.176
elly+0.169
ind+0.156
umpy+0.144
Neg logit contributions
 replied-0.214
lf-0.187
chief-0.180
 responded-0.179
 wouldn-0.173
Token went
Token ID1816
Feature activation+6.05e+00

Pos logit contributions
ashing+0.191
ashes+0.189
elly+0.183
ind+0.173
umpy+0.164
Neg logit contributions
 replied-0.148
lf-0.127
chief-0.118
 responded-0.117
man-0.111
Token back
Token ID736
Feature activation+9.01e+00

Pos logit contributions
elly+0.294
ashing+0.281
ashes+0.280
ind+0.261
umpy+0.256
Neg logit contributions
 replied-0.312
lf-0.250
 responded-0.247
man-0.246
 said-0.242
Token to
Token ID284
Feature activation+4.02e-01

Pos logit contributions
elly+0.397
ashing+0.394
ashes+0.389
ind+0.357
umpy+0.353
Neg logit contributions
 replied-0.488
 said-0.398
 responded-0.396
lf-0.395
chief-0.388
Token playing
Token ID2712
Feature activation+3.79e+00

Pos logit contributions
ashing+0.019
ashes+0.019
elly+0.018
ind+0.017
umpy+0.015
Neg logit contributions
 replied-0.021
lf-0.018
 responded-0.017
chief-0.017
 wouldn-0.017
Token and
Token ID290
Feature activation-5.04e-01

Pos logit contributions
ashing+0.165
elly+0.163
ashes+0.160
ind+0.150
umpy+0.141
Neg logit contributions
 replied-0.213
lf-0.188
 responded-0.173
man-0.169
 wouldn-0.168
Token having
Token ID1719
Feature activation+1.18e-02

Pos logit contributions
 replied+0.029
lf+0.025
chief+0.025
 responded+0.025
master+0.024
Neg logit contributions
ashing-0.021
ashes-0.020
elly-0.016
ind-0.016
and-0.015
Token fun
Token ID1257
Feature activation+4.87e+00

Pos logit contributions
ashing+0.001
ashes+0.001
elly+0.001
ind+0.000
umpy+0.000
Neg logit contributions
 replied-0.001
lf-0.001
 responded-0.001
chief-0.000
 wouldn-0.000
Token.
Token ID13
Feature activation+1.75e+00

Pos logit contributions
 replied+0.068
lf+0.060
 responded+0.058
chief+0.058
 Steven+0.056
Neg logit contributions
ashes-0.069
ashing-0.068
elly-0.062
ind-0.061
umpy-0.058
Token The
Token ID383
Feature activation-3.69e+00

Pos logit contributions
ashing+0.076
ashes+0.075
elly+0.073
ind+0.067
umpy+0.063
Neg logit contributions
 replied-0.096
lf-0.084
 responded-0.080
chief-0.079
 wouldn-0.078
Token swing
Token ID9628
Feature activation-1.80e+00

Pos logit contributions
 replied+0.167
lf+0.141
chief+0.139
 responded+0.137
 wouldn+0.132
Neg logit contributions
ashes-0.186
ashing-0.186
elly-0.181
ind-0.175
umpy-0.160
Token would
Token ID561
Feature activation-7.03e-01

Pos logit contributions
 replied+0.061
lf+0.054
chief+0.053
 Steven+0.053
 Andrew+0.049
Neg logit contributions
ashes-0.107
ashing-0.105
elly-0.100
umpy-0.095
ind-0.091
Token go
Token ID467
Feature activation+5.83e+00

Pos logit contributions
 replied+0.034
lf+0.030
chief+0.029
 responded+0.029
 Steven+0.028
Neg logit contributions
ashes-0.033
ashing-0.032
ind-0.028
elly-0.027
away-0.027
Token back
Token ID736
Feature activation+9.00e+00

Pos logit contributions
ashing+0.241
ashes+0.237
elly+0.228
ind+0.212
umpy+0.194
Neg logit contributions
 replied-0.335
lf-0.294
 responded-0.282
chief-0.278
 wouldn-0.270
Token and
Token ID290
Feature activation+2.07e-01

Pos logit contributions
ashing+0.402
ashes+0.397
elly+0.386
ind+0.356
umpy+0.336
Neg logit contributions
 replied-0.486
lf-0.418
 responded-0.400
chief-0.394
 wouldn-0.389
Token forth
Token ID6071
Feature activation-9.33e-01

Pos logit contributions
ashes+0.009
ashing+0.009
ind+0.008
elly+0.007
away+0.007
Neg logit contributions
 replied-0.011
lf-0.010
 responded-0.009
man-0.009
chief-0.009
Token in
Token ID287
Feature activation+1.55e+00

Pos logit contributions
 replied+0.049
lf+0.042
 responded+0.041
chief+0.040
 wouldn+0.039
Neg logit contributions
ashing-0.043
ashes-0.042
elly-0.041
ind-0.038
umpy-0.036
Token a
Token ID257
Feature activation-3.30e-01

Pos logit contributions
ashes+0.081
ashing+0.081
elly+0.080
ind+0.075
umpy+0.070
Neg logit contributions
 replied-0.070
lf-0.057
 responded-0.055
chief-0.055
 wouldn-0.053
Token fun
Token ID1257
Feature activation+5.76e+00

Pos logit contributions
 replied+0.017
lf+0.014
 responded+0.014
chief+0.013
 wouldn+0.013
Neg logit contributions
ashing-0.016
ashes-0.016
elly-0.015
ind-0.014
umpy-0.013
Token!"
Token ID2474
Feature activation+4.91e-01

Pos logit contributions
ashing+0.075
ashes+0.074
elly+0.071
ind+0.066
umpy+0.062
Neg logit contributions
 replied-0.089
lf-0.081
chief-0.077
 responded-0.075
 wouldn-0.072
Token
Token ID198
Feature activation+8.23e-02

Pos logit contributions
ashes+0.024
ashing+0.024
elly+0.023
ind+0.021
umpy+0.021
Neg logit contributions
 replied-0.022
lf-0.021
chief-0.019
 wouldn-0.019
 responded-0.018
Token
Token ID198
Feature activation+4.93e-03

Pos logit contributions
ashing+0.004
ashes+0.004
elly+0.004
ind+0.004
umpy+0.004
Neg logit contributions
 replied-0.004
lf-0.003
 responded-0.003
chief-0.003
man-0.003
TokenThey
Token ID2990
Feature activation+8.80e-01

Pos logit contributions
ashing+0.000
ashes+0.000
elly+0.000
ind+0.000
oky+0.000
Neg logit contributions
 replied-0.000
lf-0.000
 responded-0.000
 Steven-0.000
man-0.000
Token ran
Token ID4966
Feature activation+5.03e+00

Pos logit contributions
ashes+0.046
ashing+0.044
ind+0.040
elly+0.040
away+0.037
Neg logit contributions
 replied-0.036
lf-0.034
chief-0.032
man-0.031
 responded-0.029
Token back
Token ID736
Feature activation+8.99e+00

Pos logit contributions
elly+0.267
ashing+0.247
ind+0.244
ashes+0.237
umpy+0.235
Neg logit contributions
 replied-0.237
 said-0.191
man-0.186
 responded-0.180
 wouldn-0.179
Token to
Token ID284
Feature activation+8.48e-01

Pos logit contributions
ashing+0.445
elly+0.437
ashes+0.428
ind+0.402
umpy+0.401
Neg logit contributions
 replied-0.433
 said-0.358
lf-0.341
man-0.341
 responded-0.338
Token the
Token ID262
Feature activation+7.99e-01

Pos logit contributions
ashing+0.044
ashes+0.044
elly+0.042
ind+0.040
umpy+0.037
Neg logit contributions
 replied-0.039
lf-0.034
 responded-0.032
chief-0.031
 wouldn-0.031
Token hole
Token ID7604
Feature activation-6.30e-01

Pos logit contributions
ashing+0.043
ashes+0.042
elly+0.042
ind+0.038
umpy+0.037
Neg logit contributions
 replied-0.036
lf-0.031
man-0.028
 responded-0.028
 nose-0.027
Token.
Token ID13
Feature activation+1.36e+00

Pos logit contributions
 replied+0.034
lf+0.029
 responded+0.028
chief+0.028
 wouldn+0.027
Neg logit contributions
ashing-0.028
ashes-0.028
elly-0.027
ind-0.025
umpy-0.023
Token But
Token ID887
Feature activation-4.79e+00

Pos logit contributions
ashing+0.051
ashes+0.051
elly+0.049
ind+0.045
umpy+0.041
Neg logit contributions
 replied-0.083
lf-0.073
 responded-0.071
chief-0.070
 wouldn-0.068
Token their
Token ID511
Feature activation+2.84e+00

Pos logit contributions
elly+0.026
ashing+0.025
ashes+0.025
umpy+0.024
ind+0.023
Neg logit contributions
 replied-0.019
lf-0.015
 nose-0.014
 said-0.014
chief-0.014
Token raft
Token ID31812
Feature activation+1.38e-01

Pos logit contributions
ashing+0.143
ashes+0.142
elly+0.137
ind+0.130
umpy+0.121
Neg logit contributions
 replied-0.136
lf-0.116
 responded-0.110
chief-0.108
 wouldn-0.105
Token around
Token ID1088
Feature activation+2.87e+00

Pos logit contributions
ashing+0.007
ashes+0.006
elly+0.006
ind+0.005
umpy+0.005
Neg logit contributions
 replied-0.007
lf-0.006
chief-0.006
 responded-0.006
 Steven-0.006
Token and
Token ID290
Feature activation-1.30e+00

Pos logit contributions
ashing+0.145
ashes+0.143
elly+0.138
ind+0.132
umpy+0.121
Neg logit contributions
 replied-0.136
lf-0.115
 responded-0.111
chief-0.110
 wouldn-0.105
Token went
Token ID1816
Feature activation+5.66e+00

Pos logit contributions
 replied+0.054
lf+0.050
chief+0.044
 responded+0.044
man+0.044
Neg logit contributions
ashing-0.069
ashes-0.069
elly-0.060
ind-0.059
away-0.055
Token back
Token ID736
Feature activation+8.99e+00

Pos logit contributions
elly+0.241
ashes+0.236
ashing+0.235
ind+0.215
umpy+0.203
Neg logit contributions
 replied-0.321
lf-0.270
chief-0.264
 responded-0.263
 wouldn-0.257
Token to
Token ID284
Feature activation+4.12e-01

Pos logit contributions
ashing+0.382
elly+0.380
ashes+0.379
ind+0.345
umpy+0.330
Neg logit contributions
 replied-0.502
lf-0.410
 responded-0.410
chief-0.406
 said-0.404
Token the
Token ID262
Feature activation+4.11e-01

Pos logit contributions
ashing+0.018
ashes+0.017
elly+0.016
ind+0.016
away+0.014
Neg logit contributions
 replied-0.022
lf-0.020
 responded-0.019
chief-0.019
man-0.018
Token pool
Token ID5933
Feature activation+9.37e-01

Pos logit contributions
ashing+0.020
ashes+0.020
elly+0.019
ind+0.019
umpy+0.017
Neg logit contributions
 replied-0.020
lf-0.017
 responded-0.016
chief-0.016
 wouldn-0.015
Token.
Token ID13
Feature activation+7.44e-01

Pos logit contributions
ashing+0.040
ashes+0.039
elly+0.038
ind+0.035
umpy+0.033
Neg logit contributions
 replied-0.052
lf-0.046
 responded-0.044
chief-0.044
 wouldn-0.042
Token They
Token ID1119
Feature activation+5.93e-01

Pos logit contributions
ashing+0.034
ashes+0.034
elly+0.033
ind+0.030
umpy+0.028
Neg logit contributions
 replied-0.039
lf-0.034
 responded-0.032
chief-0.032
 wouldn-0.031
Token Lucy
Token ID22162
Feature activation+2.05e+00

Pos logit contributions
ashing+0.097
ashes+0.093
elly+0.091
ind+0.084
and+0.083
Neg logit contributions
 replied-0.092
chief-0.081
lf-0.080
 responded-0.079
master-0.073
Token and
Token ID290
Feature activation-1.89e+00

Pos logit contributions
ashing+0.113
ashes+0.111
elly+0.109
ind+0.103
umpy+0.098
Neg logit contributions
 replied-0.087
lf-0.075
chief-0.069
 responded-0.069
man-0.064
Token her
Token ID607
Feature activation+7.88e-01

Pos logit contributions
 replied+0.089
lf+0.079
chief+0.079
 responded+0.076
master+0.073
Neg logit contributions
ashing-0.086
elly-0.083
ashes-0.082
ind-0.079
and-0.072
Token mom
Token ID1995
Feature activation+1.59e+00

Pos logit contributions
ashing+0.047
ashes+0.046
elly+0.045
ind+0.044
umpy+0.040
Neg logit contributions
 replied-0.028
lf-0.023
chief-0.022
 responded-0.022
 wouldn-0.020
Token went
Token ID1816
Feature activation+5.73e+00

Pos logit contributions
ashing+0.084
ashes+0.079
elly+0.078
ind+0.074
umpy+0.068
Neg logit contributions
 replied-0.069
lf-0.064
chief-0.061
 responded-0.056
master-0.054
Token back
Token ID736
Feature activation+8.99e+00

Pos logit contributions
elly+0.292
ashes+0.282
ashing+0.273
umpy+0.271
ind+0.259
Neg logit contributions
 replied-0.283
 said-0.221
man-0.221
lf-0.218
 nose-0.212
Token to
Token ID284
Feature activation+4.80e-01

Pos logit contributions
ashing+0.418
ashes+0.418
elly+0.408
umpy+0.401
ind+0.374
Neg logit contributions
 replied-0.462
 said-0.390
 rang-0.373
 answered-0.365
 responded-0.354
Token the
Token ID262
Feature activation+2.70e-01

Pos logit contributions
ashing+0.025
ashes+0.025
elly+0.024
ind+0.023
umpy+0.021
Neg logit contributions
 replied-0.021
lf-0.018
 responded-0.017
chief-0.017
 wouldn-0.017
Token park
Token ID3952
Feature activation+1.73e+00

Pos logit contributions
ashing+0.014
ashes+0.014
elly+0.014
ind+0.012
umpy+0.012
Neg logit contributions
 replied-0.013
lf-0.011
 responded-0.010
man-0.010
chief-0.010
Token.
Token ID13
Feature activation+1.21e+00

Pos logit contributions
ashing+0.073
ashes+0.072
elly+0.070
ind+0.064
umpy+0.059
Neg logit contributions
 replied-0.098
lf-0.086
 responded-0.082
chief-0.081
 wouldn-0.079
Token They
Token ID1119
Feature activation+8.29e-01

Pos logit contributions
ashes+0.060
ashing+0.059
elly+0.058
ind+0.056
umpy+0.051
Neg logit contributions
 replied-0.060
lf-0.049
 responded-0.048
chief-0.047
 rang-0.045
TokenL
Token ID43
Feature activation+1.42e+00

Pos logit contributions
 replied+0.019
lf+0.017
 responded+0.016
man+0.015
 said+0.015
Neg logit contributions
ashing-0.017
ashes-0.016
elly-0.016
ind-0.014
oky-0.013
Tokenily
Token ID813
Feature activation+9.06e-01

Pos logit contributions
ashing+0.060
ashes+0.059
elly+0.054
ind+0.052
oky+0.047
Neg logit contributions
 replied-0.081
lf-0.072
 responded-0.066
man-0.065
on-0.065
Token's
Token ID338
Feature activation-9.04e-01

Pos logit contributions
ashing+0.049
ashes+0.048
elly+0.046
ind+0.042
umpy+0.041
Neg logit contributions
lf-0.034
 replied-0.032
chief-0.031
man-0.029
master-0.028
Token dad
Token ID9955
Feature activation+8.07e-01

Pos logit contributions
 replied+0.033
lf+0.027
 responded+0.026
chief+0.025
 wouldn+0.024
Neg logit contributions
ashes-0.056
ashing-0.055
elly-0.053
ind-0.051
umpy-0.048
Token went
Token ID1816
Feature activation+5.27e+00

Pos logit contributions
ashing+0.047
ashes+0.045
elly+0.044
ind+0.042
grown+0.040
Neg logit contributions
lf-0.023
chief-0.022
 replied-0.021
flake-0.020
man-0.019
Token back
Token ID736
Feature activation+8.97e+00

Pos logit contributions
elly+0.267
ashes+0.260
ashing+0.257
ind+0.238
umpy+0.236
Neg logit contributions
 replied-0.259
man-0.210
 said-0.210
lf-0.207
 responded-0.198
Token to
Token ID284
Feature activation+7.33e-01

Pos logit contributions
ashing+0.457
ashes+0.449
elly+0.435
umpy+0.408
ind+0.405
Neg logit contributions
 replied-0.438
lf-0.364
 said-0.361
man-0.344
 answered-0.342
Token the
Token ID262
Feature activation+4.73e-01

Pos logit contributions
ashing+0.042
ashes+0.042
elly+0.041
ind+0.039
umpy+0.037
Neg logit contributions
 replied-0.031
lf-0.025
 responded-0.024
chief-0.023
 wouldn-0.022
Token bedroom
Token ID14043
Feature activation+3.80e-01

Pos logit contributions
ashing+0.027
ashes+0.026
elly+0.026
ind+0.023
umpy+0.023
Neg logit contributions
 replied-0.021
lf-0.018
man-0.016
 nose-0.016
 responded-0.016
Token and
Token ID290
Feature activation-8.75e-01

Pos logit contributions
ashing+0.017
ashes+0.017
elly+0.016
ind+0.015
umpy+0.014
Neg logit contributions
 replied-0.020
lf-0.018
 responded-0.017
chief-0.017
 wouldn-0.016
Token closed
Token ID4838
Feature activation-1.32e-02

Pos logit contributions
 replied+0.034
lf+0.031
chief+0.030
man+0.028
 responded+0.028
Neg logit contributions
ashing-0.048
ashes-0.046
elly-0.043
ind-0.041
and-0.040
Token the
Token ID262
Feature activation+1.30e+00

Pos logit contributions
ashing+0.034
ashes+0.034
ind+0.033
elly+0.032
away+0.030
Neg logit contributions
 replied-0.037
lf-0.033
 responded-0.031
chief-0.031
man-0.030
Token bow
Token ID9563
Feature activation-3.74e-01

Pos logit contributions
ashing+0.067
ashes+0.067
elly+0.065
ind+0.064
umpy+0.058
Neg logit contributions
 replied-0.058
lf-0.048
 responded-0.047
chief-0.047
 wouldn-0.046
Token and
Token ID290
Feature activation-3.21e-02

Pos logit contributions
 replied+0.021
lf+0.019
 responded+0.018
chief+0.018
 Steven+0.017
Neg logit contributions
ashing-0.015
ashes-0.014
elly-0.013
ind-0.012
away-0.012
Token ran
Token ID4966
Feature activation+4.96e+00

Pos logit contributions
lf+0.001
 replied+0.001
pron+0.001
chief+0.001
on+0.001
Neg logit contributions
ashes-0.001
ashing-0.001
away-0.001
ind-0.001
elly-0.001
Token off
Token ID572
Feature activation+1.93e+00

Pos logit contributions
elly+0.257
ashes+0.246
ashing+0.245
ind+0.232
oky+0.225
Neg logit contributions
 replied-0.244
chief-0.193
 responded-0.186
 said-0.183
lf-0.183
Token back
Token ID736
Feature activation+8.96e+00

Pos logit contributions
ashing+0.087
elly+0.087
ashes+0.087
ind+0.079
umpy+0.074
Neg logit contributions
 replied-0.105
lf-0.089
 responded-0.086
chief-0.085
man-0.082
Token home
Token ID1363
Feature activation+3.35e+00

Pos logit contributions
ashing+0.354
ashes+0.349
elly+0.345
ind+0.307
umpy+0.300
Neg logit contributions
 replied-0.539
lf-0.453
 responded-0.443
chief-0.441
 said-0.435
Token.
Token ID13
Feature activation+7.01e-01

Pos logit contributions
ashing+0.137
ashes+0.134
elly+0.128
ind+0.119
umpy+0.110
Neg logit contributions
 replied-0.193
lf-0.168
 responded-0.163
chief-0.160
 wouldn-0.157
Token
Token ID198
Feature activation-5.56e-01

Pos logit contributions
ashing+0.033
ashes+0.032
elly+0.032
ind+0.029
umpy+0.028
Neg logit contributions
 replied-0.035
lf-0.031
 responded-0.029
chief-0.029
 wouldn-0.028
Token
Token ID198
Feature activation-5.41e-01

Pos logit contributions
 replied+0.027
lf+0.023
 responded+0.022
chief+0.022
 wouldn+0.022
Neg logit contributions
ashing-0.027
ashes-0.027
elly-0.026
ind-0.025
umpy-0.023
TokenThe
Token ID464
Feature activation-4.04e+00

Pos logit contributions
 replied+0.027
lf+0.024
 responded+0.022
chief+0.022
man+0.021
Neg logit contributions
ashing-0.026
ashes-0.026
elly-0.025
ind-0.023
umpy-0.022
TokenHe
Token ID1544
Feature activation+4.15e-01

Pos logit contributions
ashes+0.006
ashing+0.006
elly+0.006
ind+0.005
oky+0.005
Neg logit contributions
 replied-0.007
lf-0.006
 responded-0.005
man-0.005
 Steven-0.005
Token decided
Token ID3066
Feature activation+2.33e+00

Pos logit contributions
ashes+0.023
ashing+0.022
elly+0.021
ind+0.021
umpy+0.019
Neg logit contributions
 replied-0.015
lf-0.014
chief-0.014
man-0.013
flake-0.013
Token he
Token ID339
Feature activation+3.79e+00

Pos logit contributions
ashing+0.072
ashes+0.069
elly+0.063
ind+0.058
away+0.056
Neg logit contributions
 replied-0.149
lf-0.139
 responded-0.131
chief-0.129
man-0.127
Token should
Token ID815
Feature activation-4.08e+00

Pos logit contributions
ashing+0.208
ashes+0.203
elly+0.194
ind+0.184
umpy+0.180
Neg logit contributions
 replied-0.152
lf-0.142
chief-0.130
 responded-0.120
 Steven-0.117
Token go
Token ID467
Feature activation+5.51e+00

Pos logit contributions
 replied+0.183
lf+0.156
 responded+0.154
chief+0.150
 chuckled+0.148
Neg logit contributions
ashes-0.196
ashing-0.193
elly-0.177
ind-0.176
umpy-0.167
Token back
Token ID736
Feature activation+8.95e+00

Pos logit contributions
elly+0.276
ashing+0.268
ashes+0.266
ind+0.245
oky+0.243
Neg logit contributions
 replied-0.268
lf-0.214
 said-0.212
chief-0.212
man-0.208
Token inside
Token ID2641
Feature activation+2.29e+00

Pos logit contributions
ashing+0.423
ashes+0.418
elly+0.402
ind+0.358
umpy+0.356
Neg logit contributions
 replied-0.464
lf-0.395
 said-0.377
man-0.371
 responded-0.371
Token and
Token ID290
Feature activation-7.02e-01

Pos logit contributions
ashing+0.116
ashes+0.115
elly+0.111
ind+0.104
umpy+0.098
Neg logit contributions
 replied-0.110
lf-0.095
 responded-0.089
chief-0.087
 wouldn-0.086
Token get
Token ID651
Feature activation+2.61e+00

Pos logit contributions
 replied+0.036
lf+0.032
 responded+0.031
chief+0.030
master+0.030
Neg logit contributions
ashing-0.030
ashes-0.029
elly-0.025
ind-0.025
away-0.024
Token something
Token ID1223
Feature activation-4.19e-01

Pos logit contributions
elly+0.138
ashes+0.138
ashing+0.137
ind+0.125
umpy+0.122
Neg logit contributions
 replied-0.121
 wouldn-0.098
lf-0.097
 responded-0.095
chief-0.095
Token else
Token ID2073
Feature activation+2.79e+00

Pos logit contributions
 replied+0.021
lf+0.018
 responded+0.017
 wouldn+0.017
chief+0.017
Neg logit contributions
ashing-0.020
ashes-0.020
elly-0.020
ind-0.019
umpy-0.018
Token mom
Token ID1995
Feature activation+1.89e+00

Pos logit contributions
 replied+0.069
lf+0.058
 responded+0.056
chief+0.056
 wouldn+0.053
Neg logit contributions
ashes-0.074
ashing-0.074
elly-0.071
ind-0.068
umpy-0.063
Token and
Token ID290
Feature activation-2.22e+00

Pos logit contributions
ashing+0.119
ashes+0.112
elly+0.108
ind+0.103
able+0.101
Neg logit contributions
 replied-0.056
lf-0.055
chief-0.050
 Steven-0.046
 responded-0.043
Token daughter
Token ID4957
Feature activation-1.97e+00

Pos logit contributions
 replied+0.101
lf+0.090
chief+0.089
 responded+0.083
master+0.081
Neg logit contributions
ashing-0.107
ashes-0.106
elly-0.095
ind-0.090
 youths-0.087
Token happily
Token ID18177
Feature activation+5.14e+00

Pos logit contributions
 replied+0.086
lf+0.077
chief+0.071
 responded+0.069
man+0.067
Neg logit contributions
ashing-0.105
ashes-0.104
elly-0.097
ind-0.091
umpy-0.087
Token went
Token ID1816
Feature activation+5.75e+00

Pos logit contributions
ashing+0.295
ashes+0.291
elly+0.278
ind+0.263
umpy+0.250
Neg logit contributions
 replied-0.208
lf-0.181
chief-0.166
 responded-0.163
man-0.155
Token back
Token ID736
Feature activation+8.95e+00

Pos logit contributions
elly+0.276
ashing+0.258
ashes+0.257
ind+0.243
umpy+0.241
Neg logit contributions
 replied-0.304
 said-0.242
chief-0.238
 responded-0.237
lf-0.237
Token inside
Token ID2641
Feature activation+2.05e+00

Pos logit contributions
elly+0.434
ashing+0.432
ashes+0.425
umpy+0.399
ind+0.389
Neg logit contributions
 replied-0.442
 said-0.364
lf-0.346
chief-0.344
 responded-0.341
Token.
Token ID13
Feature activation+8.69e-01

Pos logit contributions
ashing+0.090
ashes+0.090
elly+0.087
ind+0.080
umpy+0.075
Neg logit contributions
 replied-0.112
lf-0.097
 responded-0.093
chief-0.091
 wouldn-0.089
Token
Token ID220
Feature activation-3.20e+00

Pos logit contributions
ashing+0.045
ashes+0.044
elly+0.043
ind+0.040
umpy+0.038
Neg logit contributions
 replied-0.041
lf-0.035
 responded-0.033
chief-0.032
 wouldn-0.032
Token
Token ID198
Feature activation-5.24e-01

Pos logit contributions
 replied+0.154
lf+0.132
 responded+0.125
chief+0.123
 wouldn+0.120
Neg logit contributions
ashing-0.161
ashes-0.159
elly-0.155
ind-0.144
umpy-0.136
Token
Token ID198
Feature activation-5.38e-01

Pos logit contributions
 replied+0.026
lf+0.022
 responded+0.021
chief+0.021
 wouldn+0.020
Neg logit contributions
ashing-0.026
ashes-0.026
elly-0.025
ind-0.023
umpy-0.022
Token are
Token ID389
Feature activation+1.77e-02

Pos logit contributions
ashes+0.002
ashing+0.002
elly+0.002
ind+0.001
umpy+0.001
Neg logit contributions
 replied-0.002
lf-0.002
chief-0.002
 responded-0.002
man-0.002
Token so
Token ID523
Feature activation-2.82e-01

Pos logit contributions
ashing+0.001
ashes+0.001
elly+0.001
ind+0.001
umpy+0.001
Neg logit contributions
 replied-0.001
lf-0.001
 responded-0.001
chief-0.001
man-0.001
Token silly
Token ID14397
Feature activation+1.11e-01

Pos logit contributions
 replied+0.014
lf+0.012
 said+0.011
 responded+0.011
 rang+0.010
Neg logit contributions
ashes-0.014
ashing-0.014
elly-0.013
ind-0.012
and-0.012
Token!
Token ID0
Feature activation+8.41e-03

Pos logit contributions
ashing+0.006
ashes+0.006
elly+0.006
ind+0.006
umpy+0.005
Neg logit contributions
 replied-0.005
lf-0.004
 responded-0.004
chief-0.004
man-0.004
Token Why
Token ID4162
Feature activation-5.94e+00

Pos logit contributions
ashing+0.000
ashes+0.000
ind+0.000
elly+0.000
away+0.000
Neg logit contributions
 replied-0.000
 Steven-0.000
 responded-0.000
lf-0.000
chief-0.000
Token does
Token ID857
Feature activation-9.21e+00

Pos logit contributions
 replied+0.324
chief+0.319
lf+0.291
 responded+0.290
man+0.289
Neg logit contributions
ashes-0.206
ashing-0.190
elly-0.183
 hate-0.182
able-0.173
Token your
Token ID534
Feature activation-6.92e-01

Pos logit contributions
 replied+0.415
 responded+0.349
lf+0.338
 wouldn+0.336
flake+0.331
Neg logit contributions
ashes-0.454
ashing-0.438
elly-0.434
ind-0.419
umpy-0.388
Token dad
Token ID9955
Feature activation-6.70e-01

Pos logit contributions
 replied+0.033
lf+0.027
 responded+0.026
chief+0.026
 wouldn+0.025
Neg logit contributions
ashing-0.036
ashes-0.036
elly-0.034
ind-0.032
umpy-0.030
Token have
Token ID423
Feature activation-1.32e+00

Pos logit contributions
 replied+0.028
lf+0.026
chief+0.024
 responded+0.023
man+0.023
Neg logit contributions
ashing-0.035
ashes-0.034
elly-0.033
ind-0.032
umpy-0.030
Token a
Token ID257
Feature activation-2.40e+00

Pos logit contributions
 replied+0.064
lf+0.055
 responded+0.053
chief+0.052
man+0.050
Neg logit contributions
ashing-0.064
ashes-0.064
elly-0.062
ind-0.058
umpy-0.054
Token mustache
Token ID49303
Feature activation-5.08e+00

Pos logit contributions
 replied+0.101
lf+0.083
 responded+0.078
chief+0.077
 wouldn+0.074
Neg logit contributions
ashing-0.137
ashes-0.136
elly-0.133
ind-0.125
umpy-0.119
Token not
Token ID407
Feature activation-2.35e+00

Pos logit contributions
 replied+0.143
lf+0.140
chief+0.131
 responded+0.122
 Steven+0.120
Neg logit contributions
ashes-0.158
ashing-0.152
elly-0.145
ind-0.139
umpy-0.137
Token need
Token ID761
Feature activation+8.39e-01

Pos logit contributions
 replied+0.109
lf+0.096
chief+0.092
 responded+0.089
 wouldn+0.085
Neg logit contributions
ashes-0.120
ashing-0.119
elly-0.115
ind-0.110
umpy-0.104
Token water
Token ID1660
Feature activation-1.48e+00

Pos logit contributions
ashes+0.040
ashing+0.040
elly+0.039
ind+0.037
umpy+0.034
Neg logit contributions
 replied-0.041
lf-0.036
 responded-0.034
chief-0.033
man-0.033
Token?
Token ID30
Feature activation-2.14e+00

Pos logit contributions
 replied+0.070
lf+0.062
 responded+0.058
chief+0.057
 chuckled+0.054
Neg logit contributions
ashing-0.075
ashes-0.075
elly-0.073
ind-0.067
umpy-0.064
Token Why
Token ID4162
Feature activation-6.04e+00

Pos logit contributions
 replied+0.110
lf+0.100
 responded+0.092
chief+0.090
 wouldn+0.088
Neg logit contributions
ashes-0.098
elly-0.097
ashing-0.097
ind-0.086
umpy-0.084
Token does
Token ID857
Feature activation-9.17e+00

Pos logit contributions
chief+0.302
 replied+0.292
 responded+0.276
 chuckled+0.272
lf+0.268
Neg logit contributions
elly-0.214
ashes-0.207
ashing-0.198
ind-0.187
 hate-0.186
Token it
Token ID340
Feature activation-3.35e+00

Pos logit contributions
 replied+0.402
 responded+0.348
flake+0.338
 chuckled+0.338
man+0.335
Neg logit contributions
ashes-0.411
ind-0.405
 unnecessary-0.389
elly-0.386
away-0.382
Token have
Token ID423
Feature activation-1.43e+00

Pos logit contributions
 replied+0.150
lf+0.148
chief+0.135
 responded+0.128
 Steven+0.125
Neg logit contributions
ashes-0.167
ashing-0.161
elly-0.151
ind-0.145
umpy-0.143
Token sharp
Token ID7786
Feature activation-1.81e+00

Pos logit contributions
 replied+0.073
lf+0.065
chief+0.060
 responded+0.060
man+0.058
Neg logit contributions
ashes-0.067
ashing-0.065
elly-0.064
ind-0.060
umpy-0.055
Token points
Token ID2173
Feature activation-3.90e-02

Pos logit contributions
 replied+0.083
lf+0.072
 responded+0.067
man+0.064
chief+0.064
Neg logit contributions
ashes-0.095
ashing-0.094
elly-0.092
ind-0.086
umpy-0.080
Token?"
Token ID1701
Feature activation-1.83e+00

Pos logit contributions
 replied+0.002
lf+0.002
chief+0.002
 responded+0.002
man+0.002
Neg logit contributions
ashing-0.002
ashes-0.002
elly-0.002
ind-0.002
umpy-0.001
Token in
Token ID287
Feature activation+4.29e-01

Pos logit contributions
ashing+0.084
ashes+0.083
elly+0.079
ind+0.074
umpy+0.070
Neg logit contributions
 replied-0.074
lf-0.061
 responded-0.058
 wouldn-0.057
man-0.056
Token the
Token ID262
Feature activation-5.94e-01

Pos logit contributions
elly+0.022
ashes+0.022
ashing+0.022
ind+0.021
umpy+0.020
Neg logit contributions
 replied-0.019
lf-0.016
 responded-0.016
chief-0.015
 wouldn-0.015
Token desert
Token ID10326
Feature activation-4.88e-01

Pos logit contributions
 replied+0.028
lf+0.022
 responded+0.022
 nose+0.022
chief+0.022
Neg logit contributions
ashing-0.031
ashes-0.031
elly-0.029
ind-0.027
and-0.026
Token?
Token ID30
Feature activation-1.60e+00

Pos logit contributions
 replied+0.023
lf+0.020
chief+0.019
 responded+0.019
 wouldn+0.018
Neg logit contributions
ashing-0.024
ashes-0.024
elly-0.024
ind-0.022
umpy-0.021
Token Why
Token ID4162
Feature activation-5.88e+00

Pos logit contributions
 replied+0.080
lf+0.071
 responded+0.066
chief+0.065
 wouldn+0.063
Neg logit contributions
ashes-0.076
ashing-0.076
elly-0.075
ind-0.068
umpy-0.066
Token does
Token ID857
Feature activation-9.01e+00

Pos logit contributions
chief+0.290
 replied+0.286
 responded+0.270
 chuckled+0.266
lf+0.261
Neg logit contributions
elly-0.219
ashes-0.213
ashing-0.206
ind-0.194
umpy-0.183
Token it
Token ID340
Feature activation-3.21e+00

Pos logit contributions
 replied+0.406
 responded+0.351
 chuckled+0.342
flake+0.341
lf+0.339
Neg logit contributions
ashes-0.394
ind-0.388
elly-0.374
away-0.372
 unnecessary-0.362
Token not
Token ID407
Feature activation-2.35e+00

Pos logit contributions
 replied+0.143
lf+0.140
chief+0.131
 responded+0.122
 Steven+0.120
Neg logit contributions
ashes-0.158
ashing-0.152
elly-0.145
ind-0.139
umpy-0.137
Token need
Token ID761
Feature activation+8.39e-01

Pos logit contributions
 replied+0.109
lf+0.096
chief+0.092
 responded+0.089
 wouldn+0.085
Neg logit contributions
ashes-0.120
ashing-0.119
elly-0.115
ind-0.110
umpy-0.104
Token water
Token ID1660
Feature activation-1.48e+00

Pos logit contributions
ashes+0.040
ashing+0.040
elly+0.039
ind+0.037
umpy+0.034
Neg logit contributions
 replied-0.041
lf-0.036
 responded-0.034
chief-0.033
man-0.033
Token?
Token ID30
Feature activation-2.14e+00

Pos logit contributions
 replied+0.070
lf+0.062
 responded+0.058
chief+0.057
 chuckled+0.054
Neg logit contributions
ashing-0.075
ashes-0.075
elly-0.073
ind-0.067
umpy-0.064
Token is
Token ID318
Feature activation-8.35e-01

Pos logit contributions
 replied+0.113
lf+0.101
chief+0.096
 responded+0.092
man+0.087
Neg logit contributions
ashes-0.128
elly-0.125
ashing-0.125
ind-0.117
umpy-0.110
Token that
Token ID326
Feature activation-2.62e+00

Pos logit contributions
 replied+0.036
lf+0.030
 responded+0.029
chief+0.029
 wouldn+0.027
Neg logit contributions
ashing-0.045
ashes-0.044
elly-0.044
ind-0.042
umpy-0.038
Token man
Token ID582
Feature activation-1.52e+00

Pos logit contributions
lf+0.132
chief+0.128
 replied+0.125
 wouldn+0.117
 chuckled+0.115
Neg logit contributions
elly-0.106
ashes-0.101
ashing-0.100
umpy-0.097
away-0.091
Token?
Token ID30
Feature activation-1.51e+00

Pos logit contributions
 replied+0.075
chief+0.066
lf+0.065
 responded+0.064
 wouldn+0.060
Neg logit contributions
ashing-0.070
elly-0.067
ashes-0.066
ind-0.064
umpy-0.060
Token Why
Token ID4162
Feature activation-5.85e+00

Pos logit contributions
 replied+0.080
lf+0.071
 responded+0.067
chief+0.066
 wouldn+0.064
Neg logit contributions
ashing-0.068
ashes-0.067
elly-0.066
ind-0.060
umpy-0.058
Token does
Token ID857
Feature activation-9.01e+00

Pos logit contributions
 replied+0.321
chief+0.320
lf+0.301
 responded+0.292
 chuckled+0.286
Neg logit contributions
ashes-0.181
elly-0.178
ashing-0.167
umpy-0.162
 hate-0.157
Token he
Token ID339
Feature activation+8.45e-02

Pos logit contributions
 replied+0.442
 responded+0.391
flake+0.391
 wouldn+0.383
chief+0.371
Neg logit contributions
ind-0.357
ashes-0.348
elly-0.329
away-0.329
umpy-0.326
Token have
Token ID423
Feature activation-1.41e+00

Pos logit contributions
ashes+0.004
ashing+0.004
elly+0.004
ind+0.004
umpy+0.004
Neg logit contributions
 replied-0.004
lf-0.003
chief-0.003
 responded-0.003
man-0.003
Token a
Token ID257
Feature activation-2.39e+00

Pos logit contributions
 replied+0.069
lf+0.062
chief+0.058
 responded+0.057
man+0.057
Neg logit contributions
ashes-0.065
elly-0.062
ashing-0.062
ind-0.060
umpy-0.055
Token whistle
Token ID16121
Feature activation-3.95e+00

Pos logit contributions
lf+0.095
 replied+0.088
chief+0.078
 wouldn+0.078
 responded+0.074
Neg logit contributions
ashing-0.124
ashes-0.122
elly-0.116
ind-0.115
away-0.110
Token?"
Token ID1701
Feature activation-1.14e+00

Pos logit contributions
 replied+0.199
chief+0.192
lf+0.186
 responded+0.177
 chuckled+0.171
Neg logit contributions
ashing-0.168
ashes-0.167
elly-0.163
umpy-0.150
away-0.145
Token kids
Token ID3988
Feature activation+1.30e-01

Pos logit contributions
 replied+0.003
lf+0.003
 responded+0.002
chief+0.002
 wouldn+0.002
Neg logit contributions
ashing-0.003
ashes-0.003
elly-0.003
ind-0.002
umpy-0.002
Token so
Token ID523
Feature activation-1.21e+00

Pos logit contributions
ashing+0.006
ashes+0.006
elly+0.006
ind+0.005
umpy+0.005
Neg logit contributions
 replied-0.007
lf-0.006
 responded-0.005
chief-0.005
 wouldn-0.005
Token mean
Token ID1612
Feature activation+8.75e-01

Pos logit contributions
 replied+0.065
lf+0.055
 responded+0.054
 wouldn+0.052
 said+0.051
Neg logit contributions
ashing-0.054
ashes-0.053
elly-0.052
ind-0.049
and-0.044
Token?
Token ID30
Feature activation-2.32e+00

Pos logit contributions
ashing+0.039
elly+0.039
ashes+0.039
ind+0.034
umpy+0.033
Neg logit contributions
 replied-0.046
lf-0.041
chief-0.040
 responded-0.039
 wouldn-0.037
Token Why
Token ID4162
Feature activation-6.23e+00

Pos logit contributions
 replied+0.115
lf+0.094
 responded+0.093
chief+0.090
man+0.088
Neg logit contributions
ashing-0.115
ashes-0.113
elly-0.108
ind-0.105
umpy-0.094
Token did
Token ID750
Feature activation-8.93e+00

Pos logit contributions
chief+0.261
 replied+0.257
 responded+0.223
lf+0.221
man+0.209
Neg logit contributions
ashing-0.288
ashes-0.286
elly-0.280
able-0.262
 hate-0.258
Token they
Token ID484
Feature activation-3.16e-01

Pos logit contributions
 replied+0.473
 responded+0.405
lf+0.404
chief+0.402
flake+0.388
Neg logit contributions
ashes-0.374
ashing-0.365
elly-0.358
ind-0.349
umpy-0.312
Token try
Token ID1949
Feature activation-3.98e+00

Pos logit contributions
 replied+0.015
lf+0.013
chief+0.013
 responded+0.012
man+0.012
Neg logit contributions
ashes-0.015
ashing-0.015
elly-0.014
ind-0.014
umpy-0.013
Token to
Token ID284
Feature activation-2.36e+00

Pos logit contributions
 replied+0.211
lf+0.182
 responded+0.174
chief+0.171
 wouldn+0.167
Neg logit contributions
ashing-0.182
ashes-0.180
elly-0.175
ind-0.163
umpy-0.151
Token be
Token ID307
Feature activation+1.10e+00

Pos logit contributions
 replied+0.109
lf+0.093
chief+0.092
 responded+0.091
 wouldn+0.089
Neg logit contributions
ashes-0.117
ashing-0.111
ind-0.111
elly-0.106
away-0.102
Token something
Token ID1223
Feature activation-1.84e+00

Pos logit contributions
ashing+0.047
ashes+0.047
elly+0.045
ind+0.043
umpy+0.040
Neg logit contributions
 replied-0.059
lf-0.052
chief-0.051
 responded-0.049
 wouldn-0.047
Token a
Token ID257
Feature activation-2.71e+00

Pos logit contributions
 replied+0.157
 responded+0.140
lf+0.137
chief+0.133
 wouldn+0.133
Neg logit contributions
ashes-0.134
ind-0.130
elly-0.128
ashing-0.128
and-0.115
Token big
Token ID1263
Feature activation+5.72e-01

Pos logit contributions
 replied+0.142
lf+0.124
 responded+0.120
chief+0.120
 wouldn+0.114
Neg logit contributions
ashing-0.123
ashes-0.123
elly-0.119
ind-0.113
umpy-0.103
Token mess
Token ID2085
Feature activation-2.74e-01

Pos logit contributions
ashes+0.027
ashing+0.027
elly+0.027
ind+0.025
umpy+0.023
Neg logit contributions
 replied-0.028
lf-0.024
 responded-0.024
chief-0.023
 wouldn-0.023
Token.
Token ID13
Feature activation+5.61e-01

Pos logit contributions
 replied+0.013
lf+0.012
man+0.011
 said+0.011
 wouldn+0.010
Neg logit contributions
ashing-0.014
ashes-0.014
elly-0.013
ind-0.012
oky-0.011
Token Why
Token ID4162
Feature activation-6.21e+00

Pos logit contributions
ashing+0.027
ashes+0.027
elly+0.026
ind+0.024
umpy+0.023
Neg logit contributions
 replied-0.028
lf-0.024
 responded-0.023
chief-0.023
 wouldn-0.022
Token did
Token ID750
Feature activation-8.89e+00

Pos logit contributions
chief+0.294
 replied+0.292
 responded+0.259
lf+0.256
 Steven+0.249
Neg logit contributions
ashes-0.273
ashing-0.264
elly-0.259
 hate-0.237
able-0.231
Token you
Token ID345
Feature activation+5.48e-01

Pos logit contributions
 replied+0.566
 responded+0.518
flake+0.507
man+0.504
 chuckled+0.492
Neg logit contributions
ashes-0.248
elly-0.237
ind-0.234
ashing-0.227
 unnecessary-0.186
Token do
Token ID466
Feature activation-5.24e+00

Pos logit contributions
ashes+0.025
ashing+0.024
elly+0.022
ind+0.021
umpy+0.020
Neg logit contributions
 replied-0.027
lf-0.025
chief-0.024
 responded-0.024
master-0.023
Token that
Token ID326
Feature activation-3.31e+00

Pos logit contributions
 replied+0.248
 responded+0.223
chief+0.206
 chuckled+0.202
man+0.201
Neg logit contributions
ashes-0.245
ashing-0.240
elly-0.234
ind-0.231
away-0.205
Token?"
Token ID1701
Feature activation-2.06e+00

Pos logit contributions
 replied+0.165
chief+0.159
 responded+0.152
 chuckled+0.149
 wouldn+0.146
Neg logit contributions
elly-0.138
ashing-0.134
ashes-0.130
ind-0.123
umpy-0.118
Token her
Token ID607
Feature activation-1.92e-01

Pos logit contributions
 replied+0.088
lf+0.074
 responded+0.070
chief+0.069
 wouldn+0.066
Neg logit contributions
ashing-0.114
ashes-0.113
elly-0.111
ind-0.104
umpy-0.099
Token that
Token ID326
Feature activation-2.93e+00

Pos logit contributions
 replied+0.221
 responded+0.193
lf+0.183
man+0.178
master+0.174
Neg logit contributions
ashes-0.209
ashing-0.204
ind-0.200
elly-0.198
away-0.183
Token,
Token ID11
Feature activation-2.02e-01

Pos logit contributions
 replied+0.155
 chuckled+0.139
 responded+0.138
chief+0.138
lf+0.135
Neg logit contributions
elly-0.126
ashes-0.116
ashing-0.115
umpy-0.109
ind-0.104
Token mom
Token ID1995
Feature activation+4.77e-01

Pos logit contributions
 replied+0.011
 responded+0.010
chief+0.010
 rang+0.010
lf+0.009
Neg logit contributions
elly-0.008
ind-0.007
ashing-0.007
ashes-0.007
umpy-0.007
Token?
Token ID30
Feature activation-2.14e+00

Pos logit contributions
elly+0.021
ashing+0.021
ashes+0.019
ind+0.019
umpy+0.018
Neg logit contributions
 replied-0.025
chief-0.022
lf-0.022
 responded-0.021
 chuckled-0.021
Token Why
Token ID4162
Feature activation-6.17e+00

Pos logit contributions
 replied+0.121
lf+0.106
 responded+0.102
chief+0.100
 wouldn+0.098
Neg logit contributions
ashing-0.090
ashes-0.089
elly-0.086
ind-0.079
umpy-0.073
Token did
Token ID750
Feature activation-8.85e+00

Pos logit contributions
 replied+0.323
chief+0.320
 responded+0.295
 chuckled+0.289
man+0.284
Neg logit contributions
elly-0.217
ashes-0.212
ashing-0.202
 hate-0.190
able-0.187
Token you
Token ID345
Feature activation+5.43e-01

Pos logit contributions
 replied+0.548
 responded+0.501
flake+0.499
man+0.492
 chuckled+0.475
Neg logit contributions
ind-0.235
ashes-0.235
elly-0.229
ashing-0.211
 unnecessary-0.209
Token stop
Token ID2245
Feature activation+2.61e-01

Pos logit contributions
ashes+0.024
ashing+0.023
ind+0.022
elly+0.021
umpy+0.021
Neg logit contributions
 replied-0.026
lf-0.024
chief-0.023
 responded-0.023
master-0.022
Token me
Token ID502
Feature activation+4.45e+00

Pos logit contributions
ashing+0.013
ashes+0.013
elly+0.013
ind+0.012
umpy+0.011
Neg logit contributions
 replied-0.012
lf-0.011
 responded-0.010
chief-0.010
 wouldn-0.010
Token from
Token ID422
Feature activation-9.10e-01

Pos logit contributions
ashing+0.216
ashes+0.212
elly+0.209
ind+0.190
umpy+0.185
Neg logit contributions
 replied-0.220
lf-0.195
 responded-0.186
chief-0.186
 wouldn-0.176
Token eating
Token ID6600
Feature activation-1.46e-01

Pos logit contributions
 replied+0.043
 responded+0.035
lf+0.034
 wouldn+0.033
chief+0.033
Neg logit contributions
ashing-0.046
ashes-0.045
elly-0.045
ind-0.044
away-0.039
Token,
Token ID11
Feature activation+3.43e-01

Pos logit contributions
 replied+0.277
lf+0.211
 responded+0.207
chief+0.201
 said+0.195
Neg logit contributions
ashing-0.410
ashes-0.403
elly-0.395
ind-0.385
umpy-0.358
Token "
Token ID366
Feature activation+2.74e+00

Pos logit contributions
ashing+0.019
ashes+0.018
elly+0.018
ind+0.016
umpy+0.016
Neg logit contributions
 replied-0.014
lf-0.012
 responded-0.011
chief-0.011
master-0.011
TokenWhat
Token ID2061
Feature activation-3.81e+00

Pos logit contributions
ashing+0.127
ashes+0.125
elly+0.121
ind+0.113
umpy+0.105
Neg logit contributions
 replied-0.146
lf-0.126
 responded-0.121
chief-0.118
 wouldn-0.116
Token?
Token ID30
Feature activation-1.72e+00

Pos logit contributions
 replied+0.160
chief+0.133
lf+0.132
 responded+0.127
man+0.119
Neg logit contributions
ashing-0.213
elly-0.212
ashes-0.211
ind-0.191
umpy-0.188
Token Why
Token ID4162
Feature activation-6.07e+00

Pos logit contributions
 replied+0.086
lf+0.071
chief+0.069
 responded+0.069
 wouldn+0.067
Neg logit contributions
ashing-0.084
ashes-0.083
elly-0.079
ind-0.077
umpy-0.070
Token did
Token ID750
Feature activation-8.83e+00

Pos logit contributions
 replied+0.326
chief+0.325
 responded+0.295
man+0.288
master+0.286
Neg logit contributions
ashing-0.226
ashes-0.220
elly-0.210
 hate-0.192
umpy-0.187
Token she
Token ID673
Feature activation+5.67e-01

Pos logit contributions
 replied+0.524
 responded+0.465
flake+0.460
man+0.455
chief+0.446
Neg logit contributions
ashes-0.285
elly-0.271
ashing-0.267
ind-0.266
 unnecessary-0.227
Token quit
Token ID11238
Feature activation+5.96e-02

Pos logit contributions
ashes+0.029
ashing+0.028
elly+0.027
ind+0.026
umpy+0.025
Neg logit contributions
 replied-0.026
lf-0.023
chief-0.022
 responded-0.021
man-0.020
Token?
Token ID30
Feature activation-1.95e+00

Pos logit contributions
ashing+0.003
ashes+0.003
elly+0.003
ind+0.003
umpy+0.003
Neg logit contributions
 replied-0.003
lf-0.002
 responded-0.002
chief-0.002
 wouldn-0.002
Token What
Token ID1867
Feature activation-4.56e+00

Pos logit contributions
 replied+0.103
lf+0.087
 responded+0.084
chief+0.083
 wouldn+0.082
Neg logit contributions
ashing-0.092
ashes-0.090
elly-0.085
ind-0.082
umpy-0.074
Token happened
Token ID3022
Feature activation-1.41e+00

Pos logit contributions
 replied+0.221
chief+0.212
lf+0.187
 responded+0.185
man+0.176
Neg logit contributions
elly-0.212
ashing-0.208
ashes-0.204
and-0.188
umpy-0.183
Token are
Token ID389
Feature activation-3.05e+00

Pos logit contributions
 replied+0.318
chief+0.300
 responded+0.278
lf+0.276
man+0.271
Neg logit contributions
ashes-0.253
ashing-0.249
elly-0.242
umpy-0.211
ind-0.202
Token you
Token ID345
Feature activation+8.98e-01

Pos logit contributions
 replied+0.158
lf+0.135
 responded+0.130
chief+0.128
 wouldn+0.124
Neg logit contributions
ashing-0.144
ashes-0.142
elly-0.138
ind-0.129
umpy-0.120
Token fighting
Token ID4330
Feature activation-1.83e+00

Pos logit contributions
ashing+0.039
ashes+0.038
elly+0.037
ind+0.034
umpy+0.032
Neg logit contributions
 replied-0.051
lf-0.044
 responded-0.043
chief-0.042
 wouldn-0.041
Token?
Token ID30
Feature activation-1.98e+00

Pos logit contributions
 replied+0.093
lf+0.080
chief+0.078
 responded+0.077
 wouldn+0.073
Neg logit contributions
ashing-0.088
ashes-0.087
elly-0.085
ind-0.079
umpy-0.074
Token Why
Token ID4162
Feature activation-6.12e+00

Pos logit contributions
 replied+0.092
lf+0.076
chief+0.073
 responded+0.072
on+0.071
Neg logit contributions
ashing-0.104
ashes-0.102
elly-0.095
ind-0.094
away-0.088
Token did
Token ID750
Feature activation-8.83e+00

Pos logit contributions
 replied+0.337
chief+0.334
 responded+0.302
man+0.292
lf+0.292
Neg logit contributions
ashes-0.232
ashing-0.227
elly-0.223
umpy-0.192
 hate-0.185
Token you
Token ID345
Feature activation+7.13e-01

Pos logit contributions
 replied+0.551
 responded+0.498
flake+0.490
man+0.479
chief+0.475
Neg logit contributions
ashes-0.269
ashing-0.252
ind-0.250
elly-0.250
 youths-0.201
Token waste
Token ID7030
Feature activation-1.33e-01

Pos logit contributions
ashing+0.033
ashes+0.033
elly+0.030
ind+0.028
umpy+0.028
Neg logit contributions
 replied-0.035
chief-0.032
lf-0.031
 responded-0.030
master-0.029
Token the
Token ID262
Feature activation-1.21e+00

Pos logit contributions
 replied+0.006
lf+0.005
 responded+0.005
chief+0.005
 wouldn+0.005
Neg logit contributions
ashing-0.007
ashes-0.007
elly-0.007
ind-0.006
umpy-0.006
Token food
Token ID2057
Feature activation-7.69e-01

Pos logit contributions
 replied+0.051
lf+0.043
 responded+0.040
chief+0.039
 wouldn+0.038
Neg logit contributions
ashing-0.068
ashes-0.066
elly-0.065
ind-0.062
umpy-0.058
Token?"
Token ID1701
Feature activation-1.38e+00

Pos logit contributions
 replied+0.039
lf+0.033
 responded+0.033
chief+0.032
 wouldn+0.031
Neg logit contributions
ashing-0.036
ashes-0.036
elly-0.035
ind-0.032
umpy-0.031
Token coats
Token ID30720
Feature activation-1.23e+00

Pos logit contributions
ashes+0.067
ashing+0.066
elly+0.063
ind+0.061
umpy+0.055
Neg logit contributions
 replied-0.072
lf-0.061
 responded-0.059
man-0.059
chief-0.058
Token,
Token ID11
Feature activation+1.71e+00

Pos logit contributions
 replied+0.070
lf+0.064
 responded+0.062
master+0.059
chief+0.058
Neg logit contributions
ashing-0.046
ashes-0.045
elly-0.041
ind-0.038
 outside-0.034
Token shoes
Token ID10012
Feature activation-1.04e+00

Pos logit contributions
ashes+0.082
ashing+0.081
elly+0.078
ind+0.073
umpy+0.067
Neg logit contributions
 replied-0.085
lf-0.073
 responded-0.070
chief-0.068
man-0.068
Token,
Token ID11
Feature activation+1.27e+00

Pos logit contributions
 replied+0.056
lf+0.049
 responded+0.048
chief+0.046
 wouldn+0.045
Neg logit contributions
ashing-0.045
ashes-0.045
elly-0.043
ind-0.040
umpy-0.036
Token and
Token ID290
Feature activation-1.80e+00

Pos logit contributions
ashes+0.058
ashing+0.058
elly+0.055
ind+0.052
umpy+0.047
Neg logit contributions
 replied-0.066
lf-0.057
 responded-0.055
chief-0.053
man-0.053
Token scar
Token ID10153
Feature activation-8.82e+00

Pos logit contributions
 replied+0.088
lf+0.077
chief+0.072
 responded+0.072
man+0.071
Neg logit contributions
ashes-0.087
ashing-0.086
elly-0.083
ind-0.077
umpy-0.072
Tokenves
Token ID1158
Feature activation-2.25e+00

Pos logit contributions
 replied+0.292
 responded+0.231
 Steven+0.201
lf+0.197
 chuckled+0.196
Neg logit contributions
ashes-0.572
ashing-0.565
ind-0.523
elly-0.521
and-0.489
Token.
Token ID13
Feature activation+4.65e-01

Pos logit contributions
 replied+0.139
lf+0.128
 responded+0.121
chief+0.120
 chuckled+0.115
Neg logit contributions
ashes-0.080
ashing-0.079
elly-0.074
ind-0.065
umpy-0.059
Token
Token ID198
Feature activation-3.30e-01

Pos logit contributions
ashing+0.024
ind+0.023
elly+0.023
ashes+0.022
umpy+0.020
Neg logit contributions
 replied-0.023
 responded-0.018
lf-0.018
 said-0.018
chief-0.017
Token
Token ID198
Feature activation-1.91e-01

Pos logit contributions
 replied+0.013
lf+0.011
 said+0.010
 responded+0.009
man+0.009
Neg logit contributions
ashing-0.020
ashes-0.020
elly-0.019
ind-0.018
umpy-0.017
TokenTom
Token ID13787
Feature activation-1.10e+00

Pos logit contributions
 replied+0.008
lf+0.007
 said+0.006
 responded+0.006
 rang+0.006
Neg logit contributions
ashing-0.011
ashes-0.011
elly-0.011
ind-0.010
 unnecessary-0.010
Token animals
Token ID4695
Feature activation+3.69e-01

Pos logit contributions
ashes+0.016
ashing+0.015
elly+0.015
ind+0.015
away+0.013
Neg logit contributions
 replied-0.017
lf-0.015
 responded-0.015
chief-0.014
 wouldn-0.014
Token,
Token ID11
Feature activation+1.65e+00

Pos logit contributions
ashing+0.015
ashes+0.015
elly+0.014
ind+0.013
umpy+0.012
Neg logit contributions
 replied-0.021
lf-0.019
 responded-0.018
chief-0.018
 wouldn-0.017
Token hats
Token ID23910
Feature activation-2.89e+00

Pos logit contributions
ashes+0.075
ashing+0.074
elly+0.070
ind+0.064
grown+0.059
Neg logit contributions
 replied-0.086
lf-0.077
chief-0.073
 responded-0.071
man-0.069
Token,
Token ID11
Feature activation+1.04e+00

Pos logit contributions
 replied+0.162
lf+0.149
 responded+0.139
chief+0.136
 chuckled+0.130
Neg logit contributions
ashing-0.117
ashes-0.115
elly-0.107
ind-0.096
umpy-0.090
Token and
Token ID290
Feature activation-1.64e+00

Pos logit contributions
ashes+0.046
ashing+0.045
elly+0.043
ind+0.041
umpy+0.036
Neg logit contributions
 replied-0.057
lf-0.049
 responded-0.048
chief-0.046
 wouldn-0.045
Token scar
Token ID10153
Feature activation-8.78e+00

Pos logit contributions
 replied+0.083
lf+0.073
chief+0.069
 responded+0.068
man+0.066
Neg logit contributions
ashes-0.076
ashing-0.072
elly-0.070
ind-0.066
grown-0.060
Tokenves
Token ID1158
Feature activation-2.60e+00

Pos logit contributions
 replied+0.290
 responded+0.227
lf+0.198
chief+0.197
 Steven+0.195
Neg logit contributions
ashes-0.564
ashing-0.556
ind-0.526
elly-0.523
and-0.493
Token as
Token ID355
Feature activation+1.72e+00

Pos logit contributions
 replied+0.147
lf+0.134
 responded+0.125
chief+0.124
 Steven+0.117
Neg logit contributions
ashes-0.106
ashing-0.105
elly-0.098
ind-0.088
umpy-0.082
Token props
Token ID25744
Feature activation-1.37e+00

Pos logit contributions
ashing+0.089
ashes+0.089
elly+0.086
ind+0.081
umpy+0.075
Neg logit contributions
 replied-0.081
lf-0.067
 responded-0.065
chief-0.064
 wouldn-0.060
Token.
Token ID13
Feature activation+2.58e-01

Pos logit contributions
 replied+0.074
lf+0.064
 responded+0.061
chief+0.061
 wouldn+0.058
Neg logit contributions
ashing-0.061
ashes-0.061
elly-0.058
ind-0.054
umpy-0.050
Token They
Token ID1119
Feature activation-1.38e-01

Pos logit contributions
ashing+0.010
ashes+0.010
elly+0.010
ind+0.009
umpy+0.008
Neg logit contributions
 replied-0.015
lf-0.013
 responded-0.013
chief-0.013
 Steven-0.012
Token swe
Token ID3490
Feature activation+1.63e+00

Pos logit contributions
 replied+0.167
lf+0.150
 responded+0.143
 rang+0.136
 wouldn+0.134
Neg logit contributions
ashes-0.160
ashing-0.153
elly-0.148
ind-0.146
and-0.131
Tokenaters
Token ID8605
Feature activation-1.23e-01

Pos logit contributions
ashing+0.075
ashes+0.072
elly+0.068
umpy+0.066
away+0.064
Neg logit contributions
 replied-0.082
chief-0.069
 nose-0.065
 responded-0.064
lf-0.062
Token and
Token ID290
Feature activation-1.12e+00

Pos logit contributions
 replied+0.007
lf+0.006
 responded+0.006
chief+0.006
 rang+0.005
Neg logit contributions
ashes-0.005
ashing-0.005
elly-0.005
ind-0.005
umpy-0.004
Token hats
Token ID23910
Feature activation-3.44e+00

Pos logit contributions
 replied+0.053
lf+0.049
flake+0.046
man+0.046
 responded+0.046
Neg logit contributions
ashes-0.052
ashing-0.044
elly-0.042
ind-0.039
 yards-0.039
Token and
Token ID290
Feature activation-1.84e+00

Pos logit contributions
 replied+0.199
lf+0.181
 responded+0.171
chief+0.168
 Steven+0.160
Neg logit contributions
ashing-0.137
ashes-0.135
elly-0.127
ind-0.114
umpy-0.108
Token scar
Token ID10153
Feature activation-8.78e+00

Pos logit contributions
 replied+0.089
lf+0.083
man+0.077
flake+0.075
 responded+0.075
Neg logit contributions
ashes-0.085
ashing-0.075
elly-0.072
ind-0.068
 yards-0.064
Tokenves
Token ID1158
Feature activation-2.28e+00

Pos logit contributions
 replied+0.302
 responded+0.239
lf+0.224
 Steven+0.213
chief+0.209
Neg logit contributions
ashes-0.556
ashing-0.550
elly-0.519
ind-0.504
and-0.485
Token from
Token ID422
Feature activation-1.30e+00

Pos logit contributions
 replied+0.139
lf+0.128
 responded+0.119
chief+0.118
 Steven+0.114
Neg logit contributions
ashes-0.084
ashing-0.083
elly-0.077
ind-0.066
and-0.063
Token the
Token ID262
Feature activation-1.16e+00

Pos logit contributions
 replied+0.054
lf+0.045
 responded+0.042
chief+0.041
 wouldn+0.040
Neg logit contributions
ashing-0.074
ashes-0.073
elly-0.072
ind-0.068
umpy-0.064
Token wool
Token ID25749
Feature activation-5.07e+00

Pos logit contributions
 replied+0.058
lf+0.050
 responded+0.047
chief+0.045
 rang+0.045
Neg logit contributions
ashes-0.057
ashing-0.057
elly-0.054
ind-0.053
umpy-0.048
Token.
Token ID13
Feature activation+1.47e-03

Pos logit contributions
 replied+0.280
lf+0.246
 responded+0.244
 chuckled+0.231
 rang+0.231
Neg logit contributions
ashing-0.204
ashes-0.202
elly-0.192
ind-0.181
away-0.158
Token this
Token ID428
Feature activation-2.81e+00

Pos logit contributions
elly+0.055
ashes+0.055
ashing+0.048
grown+0.047
oky+0.046
Neg logit contributions
 replied-0.083
 responded-0.073
man-0.072
flake-0.070
chief-0.070
Token is
Token ID318
Feature activation-9.49e-01

Pos logit contributions
 replied+0.120
 responded+0.099
 chuckled+0.098
lf+0.097
chief+0.095
Neg logit contributions
ashes-0.153
elly-0.150
ashing-0.145
ind-0.134
umpy-0.130
Token gross
Token ID10319
Feature activation+2.65e-01

Pos logit contributions
 replied+0.049
lf+0.043
 responded+0.043
chief+0.041
 chuckled+0.039
Neg logit contributions
ashes-0.041
ashing-0.040
elly-0.039
ind-0.038
oky-0.035
Token.
Token ID13
Feature activation+7.65e-01

Pos logit contributions
ashing+0.014
ashes+0.014
elly+0.014
ind+0.013
umpy+0.012
Neg logit contributions
 replied-0.012
lf-0.010
 responded-0.009
chief-0.009
 wouldn-0.009
Token Why
Token ID4162
Feature activation-5.94e+00

Pos logit contributions
ashing+0.040
ashes+0.039
elly+0.038
ind+0.036
umpy+0.034
Neg logit contributions
 replied-0.036
lf-0.030
 responded-0.029
chief-0.028
 wouldn-0.027
Token did
Token ID750
Feature activation-8.74e+00

Pos logit contributions
 replied+0.304
chief+0.294
 responded+0.271
man+0.263
lf+0.263
Neg logit contributions
ashes-0.245
ashing-0.236
elly-0.233
 hate-0.209
umpy-0.202
Token he
Token ID339
Feature activation-1.45e-01

Pos logit contributions
 replied+0.503
 responded+0.450
flake+0.443
man+0.437
lf+0.427
Neg logit contributions
ashes-0.300
elly-0.274
ashing-0.271
ind-0.265
 unnecessary-0.241
Token do
Token ID466
Feature activation-5.22e+00

Pos logit contributions
 replied+0.007
lf+0.006
 responded+0.006
chief+0.006
man+0.005
Neg logit contributions
ashes-0.007
ashing-0.007
elly-0.007
ind-0.007
umpy-0.006
Token this
Token ID428
Feature activation-2.70e+00

Pos logit contributions
 replied+0.240
 responded+0.209
 chuckled+0.188
lf+0.187
chief+0.185
Neg logit contributions
ashes-0.257
ashing-0.245
elly-0.240
ind-0.240
away-0.222
Token?"
Token ID1701
Feature activation-1.86e+00

Pos logit contributions
 replied+0.129
lf+0.109
 responded+0.108
chief+0.107
 chuckled+0.102
Neg logit contributions
ashing-0.133
ashes-0.133
elly-0.130
ind-0.121
umpy-0.113
Token Tom
Token ID4186
Feature activation-1.92e+00

Pos logit contributions
 replied+0.088
lf+0.080
chief+0.077
 wouldn+0.075
 responded+0.074
Neg logit contributions
ashes-0.088
ashing-0.088
elly-0.087
ind-0.077
umpy-0.075
TokenBen
Token ID11696
Feature activation-1.27e+00

Pos logit contributions
ashing+0.169
ashes+0.165
elly+0.158
ind+0.149
umpy+0.138
Neg logit contributions
 replied-0.160
lf-0.134
 responded-0.129
chief-0.127
 wouldn-0.124
Token,
Token ID11
Feature activation+8.36e-01

Pos logit contributions
 replied+0.065
lf+0.058
 responded+0.054
chief+0.054
 chuckled+0.052
Neg logit contributions
ashing-0.057
ashes-0.057
elly-0.057
ind-0.051
umpy-0.050
Token why
Token ID1521
Feature activation-3.94e+00

Pos logit contributions
ashing+0.038
ashes+0.037
elly+0.035
ind+0.032
and+0.031
Neg logit contributions
 replied-0.043
 responded-0.037
lf-0.037
master-0.035
chief-0.035
Token.
Token ID13
Feature activation+1.29e-01

Pos logit contributions
 replied+0.191
chief+0.181
 responded+0.177
master+0.165
 chuckled+0.161
Neg logit contributions
ashing-0.168
elly-0.165
ashes-0.163
umpy-0.150
 alone-0.139
Token Why
Token ID4162
Feature activation-6.26e+00

Pos logit contributions
ashing+0.007
ashes+0.006
elly+0.006
ind+0.006
umpy+0.005
Neg logit contributions
 replied-0.006
lf-0.005
 responded-0.005
chief-0.005
 wouldn-0.005
Token did
Token ID750
Feature activation-8.72e+00

Pos logit contributions
 replied+0.319
chief+0.311
 responded+0.295
master+0.280
lf+0.279
Neg logit contributions
ashing-0.234
ashes-0.233
elly-0.217
 hate-0.207
umpy-0.199
Token you
Token ID345
Feature activation+8.34e-01

Pos logit contributions
 replied+0.494
 responded+0.439
flake+0.422
man+0.418
chief+0.413
Neg logit contributions
ashes-0.302
ashing-0.290
ind-0.288
elly-0.286
away-0.249
Token do
Token ID466
Feature activation-5.16e+00

Pos logit contributions
ashes+0.037
ashing+0.037
elly+0.034
ind+0.034
umpy+0.032
Neg logit contributions
 replied-0.042
lf-0.037
 responded-0.036
chief-0.035
master-0.034
Token this
Token ID428
Feature activation-2.36e+00

Pos logit contributions
 replied+0.225
 responded+0.187
lf+0.181
chief+0.174
 wouldn+0.170
Neg logit contributions
ashing-0.270
ashes-0.269
elly-0.263
ind-0.256
away-0.232
Token.
Token ID13
Feature activation+1.99e-01

Pos logit contributions
 replied+0.121
lf+0.105
chief+0.103
 responded+0.103
 chuckled+0.097
Neg logit contributions
ashing-0.109
ashes-0.108
elly-0.106
ind-0.097
umpy-0.092
Token Why
Token ID4162
Feature activation-6.04e+00

Pos logit contributions
ashing+0.010
ashes+0.010
elly+0.010
ind+0.009
umpy+0.009
Neg logit contributions
 replied-0.010
lf-0.008
 responded-0.008
chief-0.008
 wouldn-0.007
Token were
Token ID547
Feature activation-9.20e-01

Pos logit contributions
 replied+0.147
lf+0.122
chief+0.122
 responded+0.116
man+0.108
Neg logit contributions
ashing-0.200
ashes-0.197
elly-0.197
ind-0.179
umpy-0.177
Token you
Token ID345
Feature activation+2.46e+00

Pos logit contributions
 replied+0.046
lf+0.039
 responded+0.037
chief+0.036
 wouldn+0.036
Neg logit contributions
ashing-0.046
ashes-0.045
elly-0.044
ind-0.041
umpy-0.039
Token thinking
Token ID3612
Feature activation-3.66e+00

Pos logit contributions
ashing+0.116
ind+0.112
elly+0.105
ashes+0.104
umpy+0.098
Neg logit contributions
 replied-0.117
chief-0.103
lf-0.101
 responded-0.101
master-0.100
Token?
Token ID30
Feature activation-1.59e+00

Pos logit contributions
 replied+0.193
lf+0.165
chief+0.164
 responded+0.163
 wouldn+0.153
Neg logit contributions
ashing-0.164
ashes-0.160
elly-0.159
ind-0.146
umpy-0.138
Token Why
Token ID4162
Feature activation-6.11e+00

Pos logit contributions
 replied+0.077
lf+0.062
chief+0.062
 responded+0.060
 wouldn+0.060
Neg logit contributions
ashing-0.080
ashes-0.079
elly-0.075
ind-0.073
away-0.069
Token did
Token ID750
Feature activation-8.71e+00

Pos logit contributions
 replied+0.304
chief+0.303
 responded+0.275
lf+0.265
master+0.259
Neg logit contributions
ashing-0.238
ashes-0.235
elly-0.230
umpy-0.209
 hate-0.204
Token you
Token ID345
Feature activation+7.35e-01

Pos logit contributions
 replied+0.502
 responded+0.443
lf+0.429
man+0.427
flake+0.424
Neg logit contributions
ashes-0.316
ashing-0.303
elly-0.300
ind-0.292
umpy-0.246
Token go
Token ID467
Feature activation+3.55e+00

Pos logit contributions
ashes+0.033
ashing+0.033
ind+0.030
elly+0.030
umpy+0.029
Neg logit contributions
 replied-0.035
lf-0.032
chief-0.031
 responded-0.030
master-0.029
Token inside
Token ID2641
Feature activation+2.88e-01

Pos logit contributions
ashing+0.171
elly+0.167
ashes+0.165
ind+0.150
umpy+0.145
Neg logit contributions
 replied-0.176
man-0.145
lf-0.145
 said-0.141
 rang-0.137
Token the
Token ID262
Feature activation-8.18e-01

Pos logit contributions
ashing+0.015
ashes+0.015
elly+0.015
ind+0.014
umpy+0.013
Neg logit contributions
 replied-0.013
lf-0.011
 responded-0.010
chief-0.010
 wouldn-0.010
Token pengu
Token ID38373
Feature activation-5.78e-01

Pos logit contributions
 replied+0.033
lf+0.030
 button+0.027
 nose+0.027
man+0.027
Neg logit contributions
ashing-0.048
ashes-0.045
elly-0.044
umpy-0.040
ind-0.038
Token are
Token ID389
Feature activation-2.95e+00

Pos logit contributions
 replied+0.305
chief+0.299
 responded+0.272
man+0.262
lf+0.261
Neg logit contributions
ashes-0.256
ashing-0.248
elly-0.247
umpy-0.221
 hate-0.209
Token you
Token ID345
Feature activation+1.06e+00

Pos logit contributions
 replied+0.162
lf+0.139
 responded+0.137
chief+0.134
man+0.132
Neg logit contributions
ashing-0.125
ashes-0.124
elly-0.121
ind-0.114
umpy-0.102
Token fighting
Token ID4330
Feature activation-1.63e+00

Pos logit contributions
ashing+0.045
ashes+0.043
elly+0.042
ind+0.041
umpy+0.037
Neg logit contributions
 replied-0.060
 responded-0.051
lf-0.051
chief-0.050
man-0.049
Token?
Token ID30
Feature activation-1.81e+00

Pos logit contributions
 replied+0.085
chief+0.075
lf+0.074
 responded+0.073
 wouldn+0.068
Neg logit contributions
ashing-0.073
ashes-0.073
elly-0.072
ind-0.066
umpy-0.063
Token Why
Token ID4162
Feature activation-6.09e+00

Pos logit contributions
 replied+0.087
lf+0.072
 responded+0.069
chief+0.069
 wouldn+0.067
Neg logit contributions
ashing-0.093
ashes-0.091
elly-0.087
ind-0.084
away-0.078
Token did
Token ID750
Feature activation-8.71e+00

Pos logit contributions
chief+0.324
 replied+0.314
 responded+0.287
lf+0.271
man+0.271
Neg logit contributions
ashes-0.238
elly-0.234
ashing-0.232
umpy-0.208
 hate-0.202
Token you
Token ID345
Feature activation+8.71e-01

Pos logit contributions
 replied+0.548
 responded+0.499
flake+0.492
man+0.481
chief+0.473
Neg logit contributions
ashes-0.254
elly-0.239
ind-0.238
ashing-0.237
umpy-0.188
Token take
Token ID1011
Feature activation+2.42e-01

Pos logit contributions
ashes+0.042
ashing+0.041
elly+0.038
ind+0.036
umpy+0.035
Neg logit contributions
 replied-0.041
chief-0.037
lf-0.037
 responded-0.035
man-0.035
Token my
Token ID616
Feature activation+2.78e+00

Pos logit contributions
ashes+0.012
ashing+0.011
elly+0.011
ind+0.011
away+0.010
Neg logit contributions
 replied-0.012
lf-0.011
 responded-0.010
chief-0.010
 wouldn-0.010
Token paper
Token ID3348
Feature activation-3.97e+00

Pos logit contributions
ashes+0.177
ashing+0.173
elly+0.172
ind+0.164
away+0.154
Neg logit contributions
 replied-0.096
 responded-0.073
lf-0.072
chief-0.067
 wouldn-0.067
Token?
Token ID30
Feature activation-1.63e+00

Pos logit contributions
 replied+0.216
 responded+0.188
lf+0.187
chief+0.182
 chuckled+0.177
Neg logit contributions
ashes-0.169
elly-0.166
ashing-0.164
ind-0.153
away-0.142
Token did
Token ID750
Feature activation-6.99e+00

Pos logit contributions
 replied+0.154
lf+0.125
chief+0.121
 responded+0.119
 wouldn+0.110
Neg logit contributions
ashing-0.228
ashes-0.227
elly-0.225
ind-0.208
umpy-0.202
Token you
Token ID345
Feature activation+2.20e+00

Pos logit contributions
 replied+0.382
lf+0.325
 responded+0.325
chief+0.315
 wouldn+0.308
Neg logit contributions
ashes-0.292
ashing-0.291
elly-0.282
ind-0.266
away-0.238
Token do
Token ID466
Feature activation-3.96e+00

Pos logit contributions
ashing+0.096
ashes+0.091
elly+0.087
umpy+0.085
ind+0.079
Neg logit contributions
chief-0.109
 replied-0.106
lf-0.104
master-0.101
man-0.096
Token?
Token ID30
Feature activation-1.54e+00

Pos logit contributions
 replied+0.196
lf+0.167
 responded+0.161
chief+0.159
 wouldn+0.153
Neg logit contributions
ashing-0.194
ashes-0.191
elly-0.187
ind-0.174
umpy-0.164
Token Why
Token ID4162
Feature activation-5.98e+00

Pos logit contributions
 replied+0.077
lf+0.066
 responded+0.063
chief+0.063
 wouldn+0.060
Neg logit contributions
ashing-0.076
ashes-0.075
elly-0.072
ind-0.068
umpy-0.063
Token did
Token ID750
Feature activation-8.71e+00

Pos logit contributions
 replied+0.302
chief+0.294
 responded+0.265
lf+0.257
man+0.251
Neg logit contributions
ashes-0.253
ashing-0.245
elly-0.240
 hate-0.217
umpy-0.217
Token you
Token ID345
Feature activation+7.50e-01

Pos logit contributions
 replied+0.531
 responded+0.477
flake+0.471
man+0.463
 wouldn+0.455
Neg logit contributions
ashes-0.270
elly-0.255
ashing-0.250
ind-0.250
 unnecessary-0.209
Token not
Token ID407
Feature activation-2.34e+00

Pos logit contributions
ashes+0.034
ashing+0.033
elly+0.031
ind+0.030
umpy+0.029
Neg logit contributions
 replied-0.036
lf-0.032
 responded-0.032
chief-0.031
master-0.030
Token behave
Token ID17438
Feature activation-7.41e-01

Pos logit contributions
 replied+0.097
lf+0.078
 responded+0.073
 said+0.072
 rang+0.071
Neg logit contributions
ashing-0.136
ashes-0.133
elly-0.131
ind-0.123
oky-0.115
Token?
Token ID30
Feature activation-1.97e+00

Pos logit contributions
 replied+0.037
lf+0.031
 responded+0.030
chief+0.029
 wouldn+0.028
Neg logit contributions
ashing-0.037
ashes-0.036
elly-0.035
ind-0.033
umpy-0.031
Token You
Token ID921
Feature activation-1.48e+00

Pos logit contributions
 replied+0.099
lf+0.084
 responded+0.080
chief+0.079
 wouldn+0.077
Neg logit contributions
ashing-0.096
ashes-0.095
elly-0.091
ind-0.086
umpy-0.080
Token over
Token ID625
Feature activation+1.57e+00

Pos logit contributions
 replied+0.150
lf+0.128
 responded+0.124
chief+0.124
 wouldn+0.118
Neg logit contributions
ashing-0.147
ashes-0.147
elly-0.143
ind-0.134
umpy-0.126
Token the
Token ID262
Feature activation-1.33e+00

Pos logit contributions
ashing+0.091
ashes+0.090
elly+0.087
ind+0.082
umpy+0.078
Neg logit contributions
 replied-0.065
lf-0.054
 responded-0.050
chief-0.049
 wouldn-0.048
Token button
Token ID4936
Feature activation-3.76e+00

Pos logit contributions
 replied+0.060
 responded+0.049
lf+0.047
 chuckled+0.046
chief+0.046
Neg logit contributions
ashes-0.069
elly-0.066
ashing-0.066
ind-0.064
umpy-0.061
Token?
Token ID30
Feature activation-1.73e+00

Pos logit contributions
 replied+0.197
lf+0.175
chief+0.171
 responded+0.169
 chuckled+0.160
Neg logit contributions
ashes-0.164
ashing-0.163
elly-0.162
ind-0.143
umpy-0.140
Token Why
Token ID4162
Feature activation-6.10e+00

Pos logit contributions
 replied+0.086
lf+0.073
 responded+0.070
chief+0.069
 wouldn+0.067
Neg logit contributions
ashing-0.085
ashes-0.084
elly-0.081
ind-0.077
umpy-0.071
Token did
Token ID750
Feature activation-8.71e+00

Pos logit contributions
chief+0.293
 replied+0.274
 responded+0.244
man+0.240
master+0.233
Neg logit contributions
ashes-0.260
elly-0.257
ashing-0.257
 hate-0.234
umpy-0.232
Token you
Token ID345
Feature activation+8.99e-01

Pos logit contributions
 replied+0.550
 responded+0.502
flake+0.497
man+0.486
chief+0.472
Neg logit contributions
ashes-0.248
ind-0.235
ashing-0.230
elly-0.229
 unnecessary-0.194
Token break
Token ID2270
Feature activation-7.94e-02

Pos logit contributions
ashes+0.041
ashing+0.040
elly+0.036
ind+0.036
umpy+0.035
Neg logit contributions
 replied-0.043
lf-0.040
chief-0.039
 responded-0.038
man-0.037
Token the
Token ID262
Feature activation-1.30e+00

Pos logit contributions
 replied+0.004
lf+0.003
 responded+0.003
chief+0.003
 wouldn+0.003
Neg logit contributions
ashes-0.004
ashing-0.004
elly-0.004
ind-0.004
umpy-0.004
Token thread
Token ID4704
Feature activation-3.23e+00

Pos logit contributions
 replied+0.060
 responded+0.051
 Andrew+0.050
 chuckled+0.049
 Scott+0.048
Neg logit contributions
ashes-0.062
elly-0.059
ind-0.058
 alone-0.055
ashing-0.055
Token?
Token ID30
Feature activation-1.75e+00

Pos logit contributions
 replied+0.172
lf+0.150
 responded+0.147
chief+0.144
 chuckled+0.139
Neg logit contributions
ashes-0.143
ashing-0.142
elly-0.140
ind-0.124
umpy-0.122
Token are
Token ID389
Feature activation-8.43e-01

Pos logit contributions
 replied+0.150
lf+0.124
 responded+0.118
chief+0.118
 wouldn+0.111
Neg logit contributions
ashing-0.193
ashes-0.191
elly-0.188
ind-0.175
umpy-0.167
Token you
Token ID345
Feature activation+2.46e+00

Pos logit contributions
 replied+0.045
lf+0.039
 responded+0.038
chief+0.037
man+0.036
Neg logit contributions
ashing-0.038
ashes-0.037
elly-0.036
ind-0.034
umpy-0.031
Token doing
Token ID1804
Feature activation-3.04e+00

Pos logit contributions
ashing+0.121
ashes+0.113
elly+0.113
ind+0.112
umpy+0.103
Neg logit contributions
 replied-0.115
chief-0.101
master-0.098
 responded-0.098
lf-0.096
Token?
Token ID30
Feature activation-1.36e+00

Pos logit contributions
 replied+0.143
lf+0.119
 responded+0.116
chief+0.115
 wouldn+0.111
Neg logit contributions
ashing-0.155
ashes-0.153
elly-0.150
ind-0.141
umpy-0.132
Token Why
Token ID4162
Feature activation-5.98e+00

Pos logit contributions
 replied+0.066
lf+0.056
chief+0.053
 responded+0.053
 wouldn+0.051
Neg logit contributions
ashing-0.068
ashes-0.067
elly-0.064
ind-0.062
away-0.057
Token did
Token ID750
Feature activation-8.70e+00

Pos logit contributions
 replied+0.313
chief+0.294
 responded+0.275
lf+0.271
master+0.264
Neg logit contributions
ashes-0.253
ashing-0.248
elly-0.244
umpy-0.215
ind-0.199
Token you
Token ID345
Feature activation+7.43e-01

Pos logit contributions
 replied+0.524
 responded+0.469
flake+0.458
man+0.456
 wouldn+0.449
Neg logit contributions
ashes-0.280
elly-0.269
ind-0.263
ashing-0.262
umpy-0.215
Token throw
Token ID3714
Feature activation-2.15e-01

Pos logit contributions
ashes+0.034
ashing+0.033
elly+0.030
ind+0.030
umpy+0.028
Neg logit contributions
 replied-0.036
lf-0.032
chief-0.032
 responded-0.032
man-0.031
Token the
Token ID262
Feature activation-7.68e-01

Pos logit contributions
 replied+0.009
lf+0.007
 responded+0.007
chief+0.007
 wouldn+0.007
Neg logit contributions
ashing-0.012
ashes-0.012
elly-0.012
ind-0.011
umpy-0.011
Token potato
Token ID21219
Feature activation-4.27e+00

Pos logit contributions
 replied+0.033
lf+0.027
 responded+0.026
 rang+0.025
 chuckled+0.024
Neg logit contributions
ashes-0.044
elly-0.044
ashing-0.043
ind-0.040
umpy-0.039
Token?
Token ID30
Feature activation-1.81e+00

Pos logit contributions
 replied+0.224
lf+0.194
 responded+0.186
chief+0.183
 wouldn+0.178
Neg logit contributions
ashing-0.196
ashes-0.194
elly-0.188
ind-0.174
umpy-0.163
Token you
Token ID345
Feature activation+1.58e+00

Pos logit contributions
 replied+0.171
lf+0.148
 responded+0.141
chief+0.139
 wouldn+0.134
Neg logit contributions
ashing-0.158
ashes-0.158
elly-0.153
ind-0.144
umpy-0.132
Token like
Token ID588
Feature activation-9.49e-01

Pos logit contributions
ashing+0.069
ashes+0.068
elly+0.065
ind+0.061
umpy+0.056
Neg logit contributions
 replied-0.087
lf-0.076
 responded-0.073
chief-0.072
 wouldn-0.070
Token your
Token ID534
Feature activation+3.72e-01

Pos logit contributions
 replied+0.044
lf+0.037
 responded+0.035
chief+0.034
 wouldn+0.033
Neg logit contributions
ashing-0.050
ashes-0.050
elly-0.049
ind-0.046
umpy-0.043
Token hats
Token ID23910
Feature activation-3.29e+00

Pos logit contributions
ashes+0.018
ashing+0.017
elly+0.017
ind+0.016
oky+0.014
Neg logit contributions
 replied-0.018
lf-0.016
 responded-0.015
 wouldn-0.015
chief-0.015
Token and
Token ID290
Feature activation-1.80e+00

Pos logit contributions
 replied+0.194
lf+0.175
 responded+0.167
chief+0.166
 chuckled+0.157
Neg logit contributions
ashing-0.128
ashes-0.127
elly-0.122
ind-0.111
umpy-0.104
Token scar
Token ID10153
Feature activation-8.70e+00

Pos logit contributions
 replied+0.102
 responded+0.089
lf+0.089
 wouldn+0.084
master+0.083
Neg logit contributions
ashes-0.073
ind-0.066
ashing-0.066
elly-0.062
 yards-0.052
Tokenves
Token ID1158
Feature activation-2.31e+00

Pos logit contributions
 replied+0.344
 responded+0.282
chief+0.261
lf+0.255
 Steven+0.255
Neg logit contributions
ashes-0.510
ashing-0.509
elly-0.482
ind-0.481
umpy-0.450
Token.
Token ID13
Feature activation+3.26e-01

Pos logit contributions
 replied+0.136
lf+0.122
 responded+0.116
chief+0.116
 wouldn+0.110
Neg logit contributions
ashes-0.091
ashing-0.091
elly-0.087
ind-0.079
umpy-0.073
Token You
Token ID921
Feature activation-9.67e-01

Pos logit contributions
ashing+0.016
ashes+0.016
elly+0.015
ind+0.014
umpy+0.013
Neg logit contributions
 replied-0.016
lf-0.014
 responded-0.013
chief-0.013
 wouldn-0.013
Token are
Token ID389
Feature activation-4.47e-01

Pos logit contributions
 replied+0.051
lf+0.048
chief+0.046
 responded+0.043
man+0.042
Neg logit contributions
ashes-0.042
ashing-0.040
elly-0.037
ind-0.035
umpy-0.035
Token our
Token ID674
Feature activation+1.06e+00

Pos logit contributions
 replied+0.025
lf+0.021
 responded+0.021
chief+0.021
 wouldn+0.020
Neg logit contributions
ashing-0.019
ashes-0.018
elly-0.018
ind-0.018
umpy-0.015
Token apologized
Token ID22893
Feature activation-9.39e-02

Pos logit contributions
lf+0.040
chief+0.038
flake+0.034
 replied+0.034
man+0.031
Neg logit contributions
ashing-0.102
elly-0.094
ashes-0.093
ind-0.091
umpy-0.088
Token and
Token ID290
Feature activation-8.19e-01

Pos logit contributions
 replied+0.005
lf+0.004
 responded+0.004
chief+0.004
 wouldn+0.004
Neg logit contributions
ashing-0.004
ashes-0.004
elly-0.004
ind-0.004
umpy-0.004
Token gave
Token ID2921
Feature activation-8.52e-01

Pos logit contributions
 replied+0.028
lf+0.028
chief+0.027
flake+0.025
man+0.024
Neg logit contributions
ashing-0.045
ashes-0.043
elly-0.040
ind-0.038
away-0.037
Token the
Token ID262
Feature activation-1.59e+00

Pos logit contributions
 replied+0.048
lf+0.043
 responded+0.041
chief+0.040
 rang+0.039
Neg logit contributions
ashing-0.036
ashes-0.035
elly-0.034
ind-0.032
umpy-0.028
Token toys
Token ID14958
Feature activation-2.11e-02

Pos logit contributions
 replied+0.082
lf+0.070
chief+0.069
 responded+0.067
 wouldn+0.064
Neg logit contributions
ashing-0.075
ashes-0.075
elly-0.071
ind-0.066
umpy-0.064
Token back
Token ID736
Feature activation+6.67e+00

Pos logit contributions
 replied+0.001
lf+0.001
 wouldn+0.001
 responded+0.001
chief+0.001
Neg logit contributions
ashing-0.001
elly-0.001
ashes-0.001
ind-0.001
umpy-0.001
Token.
Token ID13
Feature activation+4.63e-01

Pos logit contributions
ashing+0.271
ashes+0.268
elly+0.260
ind+0.239
umpy+0.223
Neg logit contributions
 replied-0.389
lf-0.337
 responded-0.325
chief-0.321
 wouldn-0.317
Token
Token ID198
Feature activation-7.80e-01

Pos logit contributions
ashing+0.022
ashes+0.022
elly+0.021
ind+0.019
umpy+0.018
Neg logit contributions
 replied-0.023
lf-0.021
 responded-0.020
chief-0.019
 wouldn-0.019
Token
Token ID198
Feature activation-7.56e-01

Pos logit contributions
 replied+0.037
lf+0.031
 responded+0.029
chief+0.029
 wouldn+0.028
Neg logit contributions
ashing-0.041
ashes-0.040
elly-0.039
ind-0.037
umpy-0.034
TokenL
Token ID43
Feature activation+1.16e+00

Pos logit contributions
 replied+0.035
lf+0.031
 responded+0.026
 wouldn+0.025
 said+0.025
Neg logit contributions
ashes-0.041
ashing-0.041
elly-0.040
ind-0.037
oky-0.036
Tokenily
Token ID813
Feature activation+7.00e-01

Pos logit contributions
ashing+0.048
ashes+0.048
ind+0.044
elly+0.042
oky+0.040
Neg logit contributions
 replied-0.065
lf-0.057
 responded-0.052
 wouldn-0.052
on-0.051
Token waited
Token ID13488
Feature activation-1.04e+00

Pos logit contributions
ashing+0.015
ashes+0.014
elly+0.013
ind+0.013
umpy+0.013
Neg logit contributions
lf-0.010
chief-0.009
 replied-0.009
flake-0.009
on-0.008
Token for
Token ID329
Feature activation+1.17e-01

Pos logit contributions
 replied+0.053
lf+0.046
 responded+0.044
chief+0.043
 wouldn+0.042
Neg logit contributions
ashing-0.049
ashes-0.049
elly-0.047
ind-0.044
umpy-0.041
Token Jack
Token ID3619
Feature activation+1.52e+00

Pos logit contributions
ashes+0.005
ashing+0.005
elly+0.005
ind+0.004
away+0.004
Neg logit contributions
 replied-0.006
 responded-0.005
lf-0.005
man-0.005
chief-0.005
Token to
Token ID284
Feature activation-1.49e+00

Pos logit contributions
ashing+0.063
ashes+0.063
elly+0.061
ind+0.056
umpy+0.052
Neg logit contributions
 replied-0.087
lf-0.076
 responded-0.073
chief-0.072
 wouldn-0.071
Token come
Token ID1282
Feature activation+2.06e+00

Pos logit contributions
 replied+0.066
lf+0.060
chief+0.057
man+0.055
 responded+0.055
Neg logit contributions
ashes-0.073
ashing-0.071
ind-0.066
elly-0.065
away-0.063
Token back
Token ID736
Feature activation+7.29e+00

Pos logit contributions
ashes+0.103
elly+0.102
ashing+0.102
ind+0.091
umpy+0.090
Neg logit contributions
 replied-0.100
man-0.086
 nose-0.080
 wouldn-0.080
lf-0.080
Token,
Token ID11
Feature activation+1.83e+00

Pos logit contributions
ashing+0.390
ashes+0.389
elly+0.375
ind+0.353
umpy+0.336
Neg logit contributions
 replied-0.336
lf-0.280
 responded-0.261
man-0.259
 said-0.255
Token but
Token ID475
Feature activation-5.13e+00

Pos logit contributions
ashing+0.086
ashes+0.078
elly+0.070
and+0.068
ind+0.066
Neg logit contributions
 replied-0.093
lf-0.086
 responded-0.079
master-0.079
man-0.078
Token he
Token ID339
Feature activation+4.73e-01

Pos logit contributions
 replied+0.202
chief+0.181
 responded+0.169
lf+0.167
man+0.164
Neg logit contributions
ashing-0.266
elly-0.243
ashes-0.242
 alone-0.231
ind-0.225
Token never
Token ID1239
Feature activation+2.93e-03

Pos logit contributions
ashing+0.027
ashes+0.025
elly+0.024
ind+0.024
umpy+0.023
Neg logit contributions
 replied-0.016
chief-0.015
lf-0.015
 Steven-0.014
sets-0.013
Token did
Token ID750
Feature activation-6.31e+00

Pos logit contributions
ashing+0.000
ashes+0.000
elly+0.000
ind+0.000
umpy+0.000
Neg logit contributions
 replied-0.000
lf-0.000
chief-0.000
 responded-0.000
man-0.000
Tokenir
Token ID343
Feature activation-2.32e+00

Pos logit contributions
 replied+0.130
lf+0.117
chief+0.103
 responded+0.102
on+0.102
Neg logit contributions
ashes-0.118
ashing-0.116
elly-0.111
oky-0.100
away-0.100
Tokenpy
Token ID9078
Feature activation-7.35e-01

Pos logit contributions
 replied+0.115
lf+0.098
 responded+0.095
chief+0.094
 wouldn+0.090
Neg logit contributions
ashing-0.114
ashes-0.113
elly-0.111
ind-0.105
umpy-0.100
Token loved
Token ID6151
Feature activation+2.42e+00

Pos logit contributions
 replied+0.033
lf+0.028
 responded+0.027
chief+0.026
 wouldn+0.025
Neg logit contributions
ashing-0.039
ashes-0.039
elly-0.038
ind-0.036
umpy-0.033
Token to
Token ID284
Feature activation-1.01e+00

Pos logit contributions
ashing+0.094
ashes+0.093
elly+0.090
ind+0.082
umpy+0.076
Neg logit contributions
 replied-0.145
lf-0.127
 responded-0.123
chief-0.121
 wouldn-0.118
Token sing
Token ID1702
Feature activation-8.39e-01

Pos logit contributions
 replied+0.050
lf+0.043
 responded+0.043
 wouldn+0.042
chief+0.042
Neg logit contributions
ashes-0.048
ashing-0.045
elly-0.041
ind-0.040
away-0.038
Token all
Token ID477
Feature activation+5.65e+00

Pos logit contributions
 replied+0.039
lf+0.034
 responded+0.031
man+0.030
 wouldn+0.030
Neg logit contributions
ashing-0.044
ashes-0.044
elly-0.043
ind-0.040
umpy-0.038
Token day
Token ID1110
Feature activation+1.50e+00

Pos logit contributions
ashing+0.296
ashes+0.293
elly+0.285
ind+0.268
umpy+0.252
Neg logit contributions
 replied-0.263
lf-0.222
 responded-0.212
chief-0.209
 wouldn-0.202
Token long
Token ID890
Feature activation+2.49e+00

Pos logit contributions
ashing+0.059
ashes+0.057
elly+0.055
ind+0.051
umpy+0.046
Neg logit contributions
 replied-0.089
lf-0.077
 responded-0.076
chief-0.075
 wouldn-0.074
Token.
Token ID13
Feature activation+1.41e+00

Pos logit contributions
ashing+0.127
ashes+0.125
elly+0.121
ind+0.112
umpy+0.106
Neg logit contributions
 replied-0.123
lf-0.108
 responded-0.097
chief-0.096
 said-0.096
Token One
Token ID1881
Feature activation+3.72e+00

Pos logit contributions
ashing+0.061
ashes+0.060
elly+0.058
ind+0.054
umpy+0.050
Neg logit contributions
 replied-0.077
lf-0.068
 responded-0.065
chief-0.064
 wouldn-0.063
Token day
Token ID1110
Feature activation-4.68e-01

Pos logit contributions
ashing+0.193
ashes+0.190
elly+0.186
ind+0.174
umpy+0.164
Neg logit contributions
 replied-0.175
lf-0.148
 responded-0.141
chief-0.139
 wouldn-0.135
Token more
Token ID517
Feature activation+4.89e+00

Pos logit contributions
ashes+0.034
ashing+0.033
and+0.030
away+0.029
elly+0.028
Neg logit contributions
 replied-0.043
lf-0.038
man-0.037
chief-0.037
flake-0.037
Token steps
Token ID4831
Feature activation+6.89e-01

Pos logit contributions
ashing+0.218
ashes+0.214
elly+0.209
ind+0.195
umpy+0.181
Neg logit contributions
 replied-0.268
lf-0.232
 responded-0.222
chief-0.217
 wouldn-0.213
Token forward
Token ID2651
Feature activation+4.39e+00

Pos logit contributions
ashes+0.031
elly+0.031
ashing+0.030
ind+0.027
umpy+0.027
Neg logit contributions
 replied-0.036
chief-0.031
lf-0.030
 wouldn-0.030
 responded-0.029
Token.
Token ID13
Feature activation+2.84e-01

Pos logit contributions
ashing+0.214
ashes+0.212
elly+0.208
ind+0.194
umpy+0.184
Neg logit contributions
 replied-0.219
lf-0.185
chief-0.176
 responded-0.175
 wouldn-0.175
Token<|endoftext|>
Token ID50256
Feature activation+1.73e+00

Pos logit contributions
ashing+0.015
ashes+0.015
elly+0.014
ind+0.014
umpy+0.013
Neg logit contributions
 replied-0.013
lf-0.011
 responded-0.011
chief-0.010
 wouldn-0.010
TokenOne
Token ID3198
Feature activation+5.59e+00

Pos logit contributions
ashing+0.078
ashes+0.077
elly+0.076
ind+0.069
umpy+0.065
Neg logit contributions
 replied-0.091
lf-0.079
 responded-0.076
chief-0.075
 wouldn-0.073
Token day
Token ID1110
Feature activation+3.90e-01

Pos logit contributions
ashing+0.273
ashes+0.266
elly+0.262
ind+0.241
umpy+0.230
Neg logit contributions
 replied-0.278
lf-0.241
chief-0.233
 responded-0.229
 wouldn-0.223
Token,
Token ID11
Feature activation+2.46e+00

Pos logit contributions
ashes+0.022
ashing+0.022
elly+0.021
ind+0.021
umpy+0.019
Neg logit contributions
 replied-0.017
lf-0.014
 responded-0.013
chief-0.013
 wouldn-0.012
Token Mum
Token ID38367
Feature activation+2.28e+00

Pos logit contributions
ashes+0.161
ashing+0.156
elly+0.152
ind+0.145
umpy+0.140
Neg logit contributions
 replied-0.090
lf-0.068
 responded-0.064
 said-0.061
man-0.061
Token and
Token ID290
Feature activation-1.14e+00

Pos logit contributions
ashing+0.122
elly+0.121
ashes+0.115
ind+0.112
umpy+0.105
Neg logit contributions
 replied-0.091
lf-0.082
chief-0.081
 responded-0.076
master-0.076
Token Dad
Token ID17415
Feature activation+1.67e+00

Pos logit contributions
 replied+0.048
lf+0.041
 responded+0.038
chief+0.038
 wouldn+0.036
Neg logit contributions
ashing-0.064
ashes-0.063
elly-0.061
ind-0.058
umpy-0.054
Token her
Token ID607
Feature activation+1.19e+00

Pos logit contributions
 replied+0.187
lf+0.173
chief+0.169
 rang+0.167
 responded+0.163
Neg logit contributions
ashes-0.114
ashing-0.113
elly-0.108
ind-0.104
and-0.099
Token happy
Token ID3772
Feature activation+4.06e-02

Pos logit contributions
ashes+0.053
ashing+0.052
elly+0.047
umpy+0.043
and+0.043
Neg logit contributions
 replied-0.063
lf-0.057
 responded-0.054
chief-0.052
man-0.051
Token.
Token ID13
Feature activation+3.24e-01

Pos logit contributions
ashes+0.001
ashing+0.001
elly+0.001
ind+0.001
umpy+0.001
Neg logit contributions
 replied-0.002
lf-0.002
chief-0.002
 responded-0.002
 Steven-0.002
Token She
Token ID1375
Feature activation-2.02e-01

Pos logit contributions
ashing+0.015
elly+0.014
ashes+0.014
ind+0.013
umpy+0.012
Neg logit contributions
 replied-0.017
lf-0.015
chief-0.014
 responded-0.014
 wouldn-0.014
Token went
Token ID1816
Feature activation+4.18e+00

Pos logit contributions
lf+0.008
chief+0.008
 replied+0.007
 Steven+0.007
man+0.007
Neg logit contributions
ashing-0.009
ashes-0.009
elly-0.009
umpy-0.008
ind-0.008
Token back
Token ID736
Feature activation+7.74e+00

Pos logit contributions
elly+0.185
ashing+0.183
ashes+0.179
ind+0.169
umpy+0.159
Neg logit contributions
 replied-0.227
lf-0.196
chief-0.189
 responded-0.184
man-0.182
Token to
Token ID284
Feature activation-2.02e-02

Pos logit contributions
ashing+0.337
ashes+0.331
elly+0.329
ind+0.303
umpy+0.292
Neg logit contributions
 replied-0.429
lf-0.368
chief-0.353
 responded-0.349
 said-0.348
Token the
Token ID262
Feature activation-3.40e-01

Pos logit contributions
 replied+0.001
lf+0.001
 responded+0.001
chief+0.001
man+0.001
Neg logit contributions
ashing-0.001
ashes-0.001
elly-0.001
ind-0.001
and-0.001
Token barrel
Token ID9036
Feature activation-1.39e+00

Pos logit contributions
 replied+0.014
lf+0.012
 responded+0.011
man+0.010
chief+0.010
Neg logit contributions
ashes-0.020
ashing-0.020
elly-0.019
ind-0.018
umpy-0.017
Token every
Token ID790
Feature activation+1.30e+00

Pos logit contributions
 replied+0.073
lf+0.064
chief+0.061
 responded+0.061
 rang+0.058
Neg logit contributions
ashing-0.062
ashes-0.060
elly-0.058
ind-0.053
umpy-0.050
Token day
Token ID1110
Feature activation+4.16e-01

Pos logit contributions
ashing+0.067
ashes+0.066
elly+0.065
ind+0.061
umpy+0.057
Neg logit contributions
 replied-0.061
lf-0.052
 responded-0.049
chief-0.049
 wouldn-0.047
Token was
Token ID373
Feature activation-8.97e-01

Pos logit contributions
 replied+0.034
chief+0.032
lf+0.030
 Steven+0.029
 responded+0.028
Neg logit contributions
ashing-0.056
elly-0.054
ashes-0.053
ind-0.052
umpy-0.048
Token so
Token ID523
Feature activation-5.48e-01

Pos logit contributions
 replied+0.049
lf+0.043
chief+0.041
 responded+0.041
 wouldn+0.039
Neg logit contributions
ashing-0.039
ashes-0.038
elly-0.037
ind-0.035
umpy-0.033
Token excited
Token ID6568
Feature activation+1.07e-01

Pos logit contributions
 replied+0.028
lf+0.023
 responded+0.023
 said+0.022
 rang+0.022
Neg logit contributions
ashes-0.027
ashing-0.027
elly-0.026
ind-0.024
oky-0.022
Token that
Token ID326
Feature activation-2.62e+00

Pos logit contributions
ashing+0.005
elly+0.005
ashes+0.005
ind+0.004
umpy+0.004
Neg logit contributions
 replied-0.006
lf-0.005
chief-0.005
 responded-0.005
man-0.005
Token she
Token ID673
Feature activation+2.54e+00

Pos logit contributions
 replied+0.142
chief+0.132
 responded+0.125
lf+0.122
man+0.121
Neg logit contributions
elly-0.091
ashing-0.089
ashes-0.083
ind-0.083
 alone-0.081
Token wanted
Token ID2227
Feature activation+8.02e-01

Pos logit contributions
ashing+0.141
ashes+0.134
elly+0.130
ind+0.129
umpy+0.120
Neg logit contributions
 replied-0.100
chief-0.086
lf-0.084
 responded-0.081
man-0.079
Token to
Token ID284
Feature activation-2.07e+00

Pos logit contributions
ashes+0.024
ashing+0.024
elly+0.023
ind+0.021
and+0.019
Neg logit contributions
 replied-0.053
lf-0.048
 responded-0.046
chief-0.045
man-0.045
Token take
Token ID1011
Feature activation+1.26e+00

Pos logit contributions
 replied+0.103
lf+0.089
 responded+0.086
chief+0.085
man+0.083
Neg logit contributions
ashes-0.099
ashing-0.097
elly-0.093
ind-0.089
umpy-0.083
Token it
Token ID340
Feature activation+8.98e-01

Pos logit contributions
ashes+0.075
ashing+0.073
elly+0.072
ind+0.069
away+0.065
Neg logit contributions
 replied-0.048
lf-0.041
 responded-0.037
man-0.036
chief-0.035
Token home
Token ID1363
Feature activation+2.55e+00

Pos logit contributions
ashing+0.035
ashes+0.034
elly+0.033
ind+0.031
umpy+0.028
Neg logit contributions
 replied-0.054
lf-0.047
 responded-0.045
chief-0.045
 wouldn-0.044
Token with
Token ID351
Feature activation+5.29e-01

Pos logit contributions
ashing+0.104
ashes+0.102
elly+0.099
ind+0.092
umpy+0.085
Neg logit contributions
 replied-0.147
lf-0.127
 responded-0.125
chief-0.123
 wouldn-0.119
Token it
Token ID340
Feature activation-8.64e-01

Pos logit contributions
ashing+0.001
ashes+0.001
elly+0.001
ind+0.001
away+0.001
Neg logit contributions
 replied-0.001
lf-0.000
chief-0.000
 responded-0.000
man-0.000
Token rolled
Token ID11686
Feature activation+3.59e+00

Pos logit contributions
lf+0.034
chief+0.032
 replied+0.032
on+0.031
 Steven+0.031
Neg logit contributions
ashing-0.043
ashes-0.041
ind-0.038
away-0.037
elly-0.037
Token right
Token ID826
Feature activation+2.37e+00

Pos logit contributions
ashing+0.134
ashes+0.133
elly+0.126
ind+0.117
umpy+0.106
Neg logit contributions
 replied-0.220
lf-0.195
 responded-0.187
chief-0.184
 rang-0.179
Token into
Token ID656
Feature activation+2.69e+00

Pos logit contributions
ashing+0.096
ashes+0.095
elly+0.091
ind+0.084
umpy+0.077
Neg logit contributions
 replied-0.139
lf-0.122
 responded-0.117
chief-0.115
 wouldn-0.112
Token the
Token ID262
Feature activation+6.18e-01

Pos logit contributions
ashes+0.139
ashing+0.136
elly+0.136
ind+0.130
grown+0.121
Neg logit contributions
 replied-0.123
lf-0.099
 responded-0.098
chief-0.097
 wouldn-0.096
Token middle
Token ID3504
Feature activation+3.52e+00

Pos logit contributions
ashing+0.031
ashes+0.031
elly+0.030
ind+0.029
umpy+0.027
Neg logit contributions
 replied-0.030
lf-0.025
 responded-0.024
chief-0.024
 wouldn-0.023
Token of
Token ID286
Feature activation+2.06e+00

Pos logit contributions
elly+0.108
ashes+0.105
ashing+0.104
and+0.089
umpy+0.088
Neg logit contributions
 replied-0.236
chief-0.211
lf-0.207
 responded-0.204
 wouldn-0.203
Token the
Token ID262
Feature activation+4.16e-01

Pos logit contributions
ashes+0.097
ashing+0.096
elly+0.095
ind+0.095
grown+0.089
Neg logit contributions
 replied-0.100
 wouldn-0.081
 responded-0.080
chief-0.079
lf-0.078
Token sandy
Token ID44039
Feature activation+8.74e-01

Pos logit contributions
ashing+0.021
ashes+0.021
elly+0.020
ind+0.019
umpy+0.018
Neg logit contributions
 replied-0.020
lf-0.017
 responded-0.016
chief-0.016
 wouldn-0.016
Token pile
Token ID14540
Feature activation+2.81e-01

Pos logit contributions
ashing+0.045
ashes+0.045
elly+0.043
ind+0.041
umpy+0.038
Neg logit contributions
 replied-0.041
lf-0.035
 responded-0.033
chief-0.032
man-0.031
Token.
Token ID13
Feature activation+3.57e-01

Pos logit contributions
ashing+0.010
ashes+0.010
elly+0.009
and+0.008
umpy+0.008
Neg logit contributions
 replied-0.017
lf-0.016
chief-0.015
 responded-0.015
 rang-0.015
Token boy
Token ID2933
Feature activation-4.20e-01

Pos logit contributions
 replied+0.089
chief+0.077
 responded+0.076
lf+0.074
 wouldn+0.071
Neg logit contributions
ashing-0.077
ashes-0.075
elly-0.074
ind-0.067
umpy-0.065
Token decided
Token ID3066
Feature activation+2.02e+00

Pos logit contributions
 replied+0.018
lf+0.017
chief+0.016
 responded+0.015
 Steven+0.015
Neg logit contributions
ashing-0.021
ashes-0.021
elly-0.020
ind-0.019
umpy-0.018
Token to
Token ID284
Feature activation-9.13e-01

Pos logit contributions
ashing+0.052
ashes+0.047
elly+0.046
ind+0.043
away+0.042
Neg logit contributions
 replied-0.137
lf-0.126
 responded-0.123
flake-0.120
chief-0.120
Token try
Token ID1949
Feature activation-1.60e+00

Pos logit contributions
 replied+0.044
lf+0.040
chief+0.039
 responded+0.038
master+0.037
Neg logit contributions
ashes-0.040
ashing-0.038
ind-0.037
elly-0.036
away-0.034
Token some
Token ID617
Feature activation+1.94e+00

Pos logit contributions
 replied+0.084
 responded+0.071
lf+0.071
master+0.066
 rang+0.066
Neg logit contributions
ashing-0.073
ashes-0.071
elly-0.069
ind-0.068
and-0.061
Token quiet
Token ID5897
Feature activation+2.96e+00

Pos logit contributions
ashing+0.101
ashes+0.099
elly+0.096
ind+0.091
umpy+0.086
Neg logit contributions
 replied-0.092
lf-0.077
 responded-0.074
chief-0.071
 wouldn-0.071
Token time
Token ID640
Feature activation+1.51e+00

Pos logit contributions
ashing+0.145
ashes+0.143
elly+0.139
ind+0.131
umpy+0.122
Neg logit contributions
 replied-0.147
lf-0.126
 responded-0.120
chief-0.117
 wouldn-0.115
Token.
Token ID13
Feature activation+9.76e-01

Pos logit contributions
ashing+0.065
ashes+0.064
elly+0.062
ind+0.057
umpy+0.052
Neg logit contributions
 replied-0.084
lf-0.072
 responded-0.071
chief-0.069
 wouldn-0.066
Token
Token ID198
Feature activation-4.72e-02

Pos logit contributions
ashing+0.042
ashes+0.041
elly+0.041
and+0.035
umpy+0.035
Neg logit contributions
 replied-0.053
lf-0.049
 responded-0.045
 wouldn-0.044
chief-0.044
Token
Token ID198
Feature activation-6.19e-04

Pos logit contributions
 replied+0.003
lf+0.002
chief+0.002
 responded+0.002
 wouldn+0.002
Neg logit contributions
ashing-0.002
ashes-0.002
elly-0.002
ind-0.002
and-0.002
TokenHe
Token ID1544
Feature activation-6.16e-02

Pos logit contributions
 replied+0.000
lf+0.000
 responded+0.000
chief+0.000
man+0.000
Neg logit contributions
ashing-0.000
ashes-0.000
elly-0.000
ind-0.000
umpy-0.000
Token sure
Token ID1654
Feature activation-3.52e+00

Pos logit contributions
 replied+0.053
lf+0.046
chief+0.044
 responded+0.044
 wouldn+0.042
Neg logit contributions
ashing-0.045
ashes-0.044
elly-0.043
ind-0.040
umpy-0.038
Token.
Token ID13
Feature activation+8.31e-01

Pos logit contributions
 replied+0.197
lf+0.171
 responded+0.164
chief+0.161
 wouldn+0.159
Neg logit contributions
ashing-0.153
ashes-0.151
elly-0.146
ind-0.136
umpy-0.125
Token They
Token ID1119
Feature activation+1.62e-01

Pos logit contributions
ashing+0.032
ashes+0.031
elly+0.030
ind+0.029
umpy+0.025
Neg logit contributions
 replied-0.051
lf-0.043
 responded-0.043
chief-0.042
man-0.041
Token think
Token ID892
Feature activation-4.80e+00

Pos logit contributions
ashes+0.008
ashing+0.008
elly+0.008
ind+0.007
umpy+0.007
Neg logit contributions
 replied-0.008
lf-0.007
chief-0.006
 responded-0.006
man-0.006
Token maybe
Token ID3863
Feature activation+3.69e-01

Pos logit contributions
 replied+0.204
lf+0.168
 responded+0.161
chief+0.158
man+0.153
Neg logit contributions
ashing-0.269
ashes-0.267
elly-0.261
ind-0.244
umpy-0.229
Token they
Token ID484
Feature activation+1.24e+00

Pos logit contributions
ashes+0.019
ashing+0.019
elly+0.018
ind+0.017
grown+0.017
Neg logit contributions
 replied-0.016
lf-0.014
chief-0.013
 responded-0.013
man-0.012
Token should
Token ID815
Feature activation-5.88e+00

Pos logit contributions
ashing+0.070
ashes+0.069
elly+0.068
ind+0.064
umpy+0.060
Neg logit contributions
 replied-0.052
lf-0.043
 responded-0.041
chief-0.040
 wouldn-0.039
Token go
Token ID467
Feature activation+4.43e+00

Pos logit contributions
 replied+0.241
lf+0.200
 responded+0.191
chief+0.188
 wouldn+0.180
Neg logit contributions
ashes-0.329
ashing-0.325
elly-0.310
ind-0.304
umpy-0.282
Token.
Token ID13
Feature activation+9.78e-01

Pos logit contributions
elly+0.228
ashing+0.221
ashes+0.214
ind+0.202
oky+0.197
Neg logit contributions
 replied-0.212
 said-0.178
man-0.170
chief-0.161
 wouldn-0.160
Token But
Token ID887
Feature activation-4.98e+00

Pos logit contributions
ashing+0.043
ashes+0.042
elly+0.040
ind+0.039
umpy+0.035
Neg logit contributions
 replied-0.055
 responded-0.045
lf-0.045
chief-0.044
 said-0.043
Token they
Token ID484
Feature activation+1.12e-01

Pos logit contributions
 replied+0.234
lf+0.208
chief+0.199
 responded+0.197
flake+0.189
Neg logit contributions
ashing-0.229
ashes-0.218
elly-0.210
ind-0.203
grown-0.194
Token were
Token ID547
Feature activation+1.43e+00

Pos logit contributions
 replied+0.039
 responded+0.028
 said+0.028
 wouldn+0.027
lf+0.027
Neg logit contributions
ashing-0.060
ashes-0.060
elly-0.059
ind-0.056
and-0.053
Token best
Token ID1266
Feature activation+5.06e+00

Pos logit contributions
ashes+0.077
elly+0.074
away+0.071
ashing+0.071
ind+0.066
Neg logit contributions
 replied-0.066
 said-0.051
lf-0.050
 nose-0.049
 responded-0.049
Token friends
Token ID2460
Feature activation+4.46e+00

Pos logit contributions
ashing+0.287
elly+0.282
ashes+0.279
ind+0.263
away+0.256
Neg logit contributions
 replied-0.214
 said-0.160
 responded-0.159
lf-0.155
 nose-0.154
Token.
Token ID13
Feature activation+1.80e+00

Pos logit contributions
ashes+0.222
ashing+0.215
elly+0.209
ind+0.195
umpy+0.183
Neg logit contributions
 replied-0.229
lf-0.187
 responded-0.181
 said-0.178
man-0.176
Token They
Token ID1119
Feature activation+1.14e+00

Pos logit contributions
ind+0.079
ashing+0.077
ashes+0.076
elly+0.072
away+0.071
Neg logit contributions
 replied-0.099
 responded-0.079
 nose-0.078
lf-0.077
 said-0.076
Token liked
Token ID8288
Feature activation+6.79e-01

Pos logit contributions
ashing+0.056
ashes+0.056
elly+0.055
ind+0.051
umpy+0.048
Neg logit contributions
 replied-0.057
lf-0.048
 responded-0.046
chief-0.045
 wouldn-0.045
Token to
Token ID284
Feature activation-1.39e+00

Pos logit contributions
ashing+0.030
ashes+0.029
elly+0.029
ind+0.026
umpy+0.026
Neg logit contributions
 replied-0.037
lf-0.032
 responded-0.031
chief-0.030
 wouldn-0.030
Token play
Token ID711
Feature activation+2.80e+00

Pos logit contributions
 replied+0.087
lf+0.077
 responded+0.075
chief+0.074
 wouldn+0.072
Neg logit contributions
ashing-0.049
ashes-0.049
elly-0.046
ind-0.042
umpy-0.038
Token with
Token ID351
Feature activation+1.18e+00

Pos logit contributions
elly+0.130
ashing+0.128
ashes+0.122
umpy+0.120
ind+0.118
Neg logit contributions
 replied-0.153
lf-0.138
man-0.121
 said-0.121
 nose-0.120
Token toy
Token ID13373
Feature activation+2.55e-01

Pos logit contributions
ashing+0.065
elly+0.061
ashes+0.061
umpy+0.059
ind+0.055
Neg logit contributions
 replied-0.053
lf-0.048
 nose-0.045
 said-0.043
 rang-0.042
Token soldiers
Token ID5795
Feature activation+2.30e+00

Pos logit contributions
ashing+0.015
elly+0.014
umpy+0.013
ashes+0.013
ind+0.012
Neg logit contributions
 replied-0.011
lf-0.009
 said-0.009
 wouldn-0.008
 nose-0.008
TokenAre
Token ID8491
Feature activation-2.25e+00

Pos logit contributions
ashing+0.158
ashes+0.156
elly+0.145
ind+0.138
umpy+0.129
Neg logit contributions
 replied-0.174
lf-0.148
 responded-0.144
chief-0.136
man-0.135
Token you
Token ID345
Feature activation+1.80e+00

Pos logit contributions
 replied+0.127
lf+0.111
 responded+0.109
man+0.108
master+0.104
Neg logit contributions
ashing-0.092
ashes-0.090
elly-0.087
ind-0.081
umpy-0.072
Token sure
Token ID1654
Feature activation-3.71e+00

Pos logit contributions
ashing+0.092
ashes+0.088
elly+0.083
ind+0.076
umpy+0.075
Neg logit contributions
 replied-0.086
lf-0.076
chief-0.071
 responded-0.070
man-0.069
Token you
Token ID345
Feature activation+1.71e+00

Pos logit contributions
 replied+0.183
lf+0.153
 responded+0.153
chief+0.151
man+0.146
Neg logit contributions
ashing-0.176
ashes-0.172
elly-0.172
ind-0.159
umpy-0.148
Token want
Token ID765
Feature activation+9.90e-01

Pos logit contributions
ashing+0.084
ashes+0.083
elly+0.079
ind+0.071
umpy+0.070
Neg logit contributions
 replied-0.084
lf-0.076
chief-0.072
 responded-0.070
man-0.069
Token to
Token ID284
Feature activation-2.30e+00

Pos logit contributions
ashes+0.034
ashing+0.034
elly+0.033
ind+0.029
and+0.026
Neg logit contributions
 replied-0.060
lf-0.054
chief-0.053
 responded-0.052
master-0.051
Token make
Token ID787
Feature activation-3.37e+00

Pos logit contributions
 replied+0.110
lf+0.098
chief+0.095
 responded+0.095
 wouldn+0.094
Neg logit contributions
ashes-0.105
ashing-0.102
elly-0.097
ind-0.092
umpy-0.087
Token spaghetti
Token ID48241
Feature activation-2.72e+00

Pos logit contributions
 replied+0.172
 responded+0.158
 rang+0.157
 wouldn+0.156
chief+0.150
Neg logit contributions
ashes-0.133
ind-0.131
elly-0.130
ashing-0.130
 outside-0.116
Token for
Token ID329
Feature activation+7.27e-01

Pos logit contributions
 replied+0.134
lf+0.115
chief+0.114
 responded+0.114
 wouldn+0.109
Neg logit contributions
ashing-0.128
ashes-0.128
elly-0.122
ind-0.112
umpy-0.105
Token dinner
Token ID8073
Feature activation-2.32e-01

Pos logit contributions
ashing+0.035
ashes+0.035
elly+0.034
ind+0.033
and+0.030
Neg logit contributions
 replied-0.034
 responded-0.029
chief-0.028
 wouldn-0.028
lf-0.027
Token?
Token ID30
Feature activation-1.06e+00

Pos logit contributions
 replied+0.010
lf+0.009
chief+0.009
 responded+0.008
 wouldn+0.008
Neg logit contributions
ashing-0.012
ashes-0.012
elly-0.012
ind-0.011
umpy-0.010
Token touch
Token ID3638
Feature activation-2.49e+00

Pos logit contributions
ashing+0.019
ashes+0.018
elly+0.018
ind+0.017
umpy+0.016
Neg logit contributions
 replied-0.015
lf-0.012
 responded-0.012
chief-0.012
 wouldn-0.011
Token the
Token ID262
Feature activation-1.64e+00

Pos logit contributions
 replied+0.109
lf+0.094
 responded+0.087
chief+0.081
man+0.081
Neg logit contributions
ashes-0.135
ashing-0.134
elly-0.133
ind-0.128
umpy-0.117
Token c
Token ID269
Feature activation-1.19e+00

Pos logit contributions
 replied+0.081
lf+0.069
 responded+0.066
chief+0.064
 wouldn+0.063
Neg logit contributions
ashing-0.081
ashes-0.080
elly-0.079
ind-0.075
umpy-0.069
Tokenactus
Token ID34144
Feature activation-2.77e+00

Pos logit contributions
 replied+0.054
lf+0.045
chief+0.045
man+0.041
 wouldn+0.041
Neg logit contributions
ashes-0.061
ashing-0.061
elly-0.056
and-0.051
ind-0.051
Token again
Token ID757
Feature activation+3.69e+00

Pos logit contributions
 replied+0.145
lf+0.125
 responded+0.120
chief+0.118
 wouldn+0.114
Neg logit contributions
ashing-0.129
ashes-0.127
elly-0.124
ind-0.115
umpy-0.107
Token.
Token ID13
Feature activation+2.10e-01

Pos logit contributions
ashing+0.157
ashes+0.155
elly+0.150
ind+0.139
umpy+0.128
Neg logit contributions
 replied-0.208
lf-0.181
 responded-0.174
chief-0.171
 wouldn-0.168
Token From
Token ID3574
Feature activation-3.17e-01

Pos logit contributions
ashing+0.010
ashes+0.010
elly+0.010
ind+0.009
umpy+0.008
Neg logit contributions
 replied-0.011
lf-0.009
 responded-0.009
chief-0.009
 wouldn-0.009
Token that
Token ID326
Feature activation-2.52e+00

Pos logit contributions
 replied+0.014
lf+0.013
 responded+0.012
chief+0.012
 wouldn+0.012
Neg logit contributions
ind-0.015
ashing-0.015
elly-0.015
ashes-0.014
away-0.013
Token day
Token ID1110
Feature activation-1.36e+00

Pos logit contributions
 replied+0.145
chief+0.135
lf+0.133
 responded+0.132
master+0.131
Neg logit contributions
ashing-0.092
elly-0.085
ashes-0.078
ind-0.078
umpy-0.074
Token on
Token ID319
Feature activation-2.76e+00

Pos logit contributions
 replied+0.067
lf+0.057
 responded+0.054
chief+0.053
 wouldn+0.052
Neg logit contributions
ashes-0.068
ashing-0.068
elly-0.066
ind-0.062
umpy-0.057
Token,
Token ID11
Feature activation+1.06e+00

Pos logit contributions
 replied+0.141
lf+0.121
 responded+0.119
chief+0.115
master+0.112
Neg logit contributions
ashing-0.125
elly-0.122
ashes-0.122
ind-0.113
umpy-0.105
Token nods
Token ID34833
Feature activation-1.58e+00

Pos logit contributions
 replied+0.049
lf+0.048
man+0.044
 responded+0.041
flake+0.041
Neg logit contributions
ashes-0.057
ashing-0.053
elly-0.051
ind-0.047
oky-0.044
Token.
Token ID13
Feature activation+3.72e-01

Pos logit contributions
 replied+0.089
lf+0.077
 responded+0.074
chief+0.073
 wouldn+0.072
Neg logit contributions
ashing-0.068
ashes-0.067
elly-0.065
ind-0.060
umpy-0.055
Token He
Token ID679
Feature activation-1.25e+00

Pos logit contributions
ashing+0.019
ashes+0.019
elly+0.019
ind+0.018
umpy+0.016
Neg logit contributions
 replied-0.018
lf-0.015
 responded-0.014
chief-0.014
 wouldn-0.013
Token likes
Token ID7832
Feature activation-8.50e-01

Pos logit contributions
 replied+0.055
lf+0.049
 responded+0.044
chief+0.044
man+0.044
Neg logit contributions
ashes-0.067
ashing-0.066
elly-0.064
ind-0.060
umpy-0.056
Token the
Token ID262
Feature activation-1.95e+00

Pos logit contributions
 replied+0.038
lf+0.032
 responded+0.031
chief+0.030
 wouldn+0.029
Neg logit contributions
ashes-0.046
ashing-0.045
elly-0.044
ind-0.041
and-0.040
Token smell
Token ID8508
Feature activation-2.70e+00

Pos logit contributions
 replied+0.093
lf+0.089
 Steven+0.085
chief+0.082
 wouldn+0.081
Neg logit contributions
ashes-0.099
ashing-0.090
elly-0.086
ind-0.077
 outdoors-0.075
Token too
Token ID1165
Feature activation+2.75e+00

Pos logit contributions
 replied+0.137
lf+0.115
 responded+0.110
 said+0.108
 wouldn+0.107
Neg logit contributions
ashing-0.130
ashes-0.126
elly-0.121
ind-0.120
umpy-0.107
Token.
Token ID13
Feature activation+6.46e-01

Pos logit contributions
ashing+0.145
ashes+0.142
elly+0.134
ind+0.132
umpy+0.118
Neg logit contributions
 replied-0.129
lf-0.106
 said-0.103
 responded-0.100
 nose-0.100
Token He
Token ID679
Feature activation-1.10e+00

Pos logit contributions
ashing+0.031
ashes+0.031
elly+0.030
ind+0.028
umpy+0.026
Neg logit contributions
 replied-0.033
lf-0.028
 responded-0.027
chief-0.026
 wouldn-0.026
Token smiles
Token ID21845
Feature activation-8.75e-01

Pos logit contributions
 replied+0.047
lf+0.041
 responded+0.038
chief+0.038
man+0.037
Neg logit contributions
ashes-0.060
ashing-0.060
elly-0.058
ind-0.054
umpy-0.051
Token and
Token ID290
Feature activation-8.55e-01

Pos logit contributions
 replied+0.041
 responded+0.032
lf+0.032
 said+0.032
man+0.031
Neg logit contributions
ashing-0.046
ashes-0.046
elly-0.044
ind-0.043
umpy-0.039
Token baby
Token ID5156
Feature activation-9.13e-01

Pos logit contributions
 replied+0.154
lf+0.127
chief+0.126
 responded+0.122
 wouldn+0.120
Neg logit contributions
ind-0.227
ashing-0.225
elly-0.225
ashes-0.219
umpy-0.202
Token asked
Token ID1965
Feature activation-7.95e-01

Pos logit contributions
 replied+0.036
lf+0.032
chief+0.031
 Steven+0.029
 responded+0.029
Neg logit contributions
ind-0.047
ashing-0.046
elly-0.046
ashes-0.042
umpy-0.041
Token the
Token ID262
Feature activation-2.15e+00

Pos logit contributions
 replied+0.038
lf+0.034
chief+0.032
 responded+0.032
 rang+0.031
Neg logit contributions
ashing-0.036
elly-0.036
ashes-0.035
ind-0.035
away-0.032
Token m
Token ID285
Feature activation-2.21e+00

Pos logit contributions
 replied+0.085
lf+0.074
chief+0.069
 wouldn+0.066
 responded+0.066
Neg logit contributions
elly-0.120
ashing-0.119
ind-0.118
ashes-0.117
umpy-0.107
Tokenama
Token ID1689
Feature activation-9.19e-01

Pos logit contributions
 replied+0.070
lf+0.054
 responded+0.050
chief+0.047
 wouldn+0.047
Neg logit contributions
ashing-0.147
elly-0.146
ashes-0.144
ind-0.140
umpy-0.132
Token,
Token ID11
Feature activation+3.83e-01

Pos logit contributions
 replied+0.048
lf+0.045
chief+0.042
 responded+0.041
 rang+0.040
Neg logit contributions
ashing-0.040
elly-0.039
ashes-0.039
ind-0.035
umpy-0.032
Token "
Token ID366
Feature activation+3.38e+00

Pos logit contributions
ashing+0.023
elly+0.021
ashes+0.021
ind+0.020
grown+0.019
Neg logit contributions
 replied-0.014
lf-0.013
chief-0.011
 responded-0.011
 rang-0.011
TokenWhat
Token ID2061
Feature activation-3.45e+00

Pos logit contributions
elly+0.086
ind+0.085
ashing+0.076
ashes+0.069
grown+0.063
Neg logit contributions
 replied-0.239
lf-0.218
chief-0.209
 responded-0.207
 wouldn-0.205
Token has
Token ID468
Feature activation-3.72e-01

Pos logit contributions
 replied+0.158
chief+0.147
lf+0.141
 responded+0.130
man+0.125
Neg logit contributions
elly-0.173
ashes-0.164
ashing-0.163
umpy-0.148
and-0.145
Token changed
Token ID3421
Feature activation+1.69e+00

Pos logit contributions
 replied+0.017
lf+0.015
chief+0.014
 responded+0.014
man+0.014
Neg logit contributions
ashing-0.019
ashes-0.019
elly-0.019
ind-0.017
umpy-0.016
Token?"
Token ID1701
Feature activation-1.04e+00

Pos logit contributions
elly+0.078
ashing+0.078
ashes+0.075
ind+0.072
umpy+0.067
Neg logit contributions
 replied-0.083
lf-0.075
chief-0.071
 responded-0.070
master-0.067
Token in
Token ID287
Feature activation+1.81e+00

Pos logit contributions
ashes+0.280
ashing+0.265
elly+0.253
 unnecessary+0.253
arl+0.243
Neg logit contributions
 replied-0.115
lf-0.106
 it-0.095
on-0.087
 -0.085
Token her
Token ID607
Feature activation+4.35e+00

Pos logit contributions
ashing+0.147
ashes+0.143
elly+0.137
umpy+0.131
ind+0.126
Neg logit contributions
 replied-0.039
lf-0.035
on-0.027
 it-0.027
 -0.025
Token backyard
Token ID24296
Feature activation+2.25e+00

Pos logit contributions
ashes+0.289
ashing+0.277
elly+0.265
umpy+0.256
and+0.252
Neg logit contributions
 replied-0.139
 nose-0.118
lf-0.111
 mother-0.101
 said-0.100
Token.
Token ID13
Feature activation+1.61e+00

Pos logit contributions
ashes+0.115
ashing+0.112
elly+0.110
ind+0.104
umpy+0.101
Neg logit contributions
 replied-0.109
lf-0.092
 responded-0.084
 nose-0.084
man-0.083
Token One
Token ID1881
Feature activation+3.80e+00

Pos logit contributions
ashing+0.075
ashes+0.073
elly+0.072
ind+0.066
umpy+0.062
Neg logit contributions
 replied-0.082
lf-0.072
 responded-0.069
chief-0.069
 wouldn-0.067
Token day
Token ID1110
Feature activation-3.66e-01

Pos logit contributions
ashes+0.215
ashing+0.211
elly+0.204
ind+0.197
umpy+0.183
Neg logit contributions
 replied-0.165
lf-0.133
 responded-0.126
man-0.119
 wouldn-0.119
Token while
Token ID981
Feature activation+4.52e-01

Pos logit contributions
 replied+0.017
lf+0.014
 responded+0.014
chief+0.013
 wouldn+0.013
Neg logit contributions
ashing-0.019
ashes-0.019
elly-0.019
ind-0.018
umpy-0.017
Token playing
Token ID2712
Feature activation+3.02e+00

Pos logit contributions
ashes+0.019
ashing+0.019
elly+0.018
ind+0.017
umpy+0.016
Neg logit contributions
 replied-0.026
lf-0.023
 responded-0.021
chief-0.021
 wouldn-0.021
Token,
Token ID11
Feature activation+1.99e+00

Pos logit contributions
ind+0.157
elly+0.155
ashing+0.154
ashes+0.150
umpy+0.142
Neg logit contributions
 replied-0.153
lf-0.134
 nose-0.115
on-0.114
 said-0.114
Token Sarah
Token ID10490
Feature activation+4.18e-02

Pos logit contributions
ashing+0.099
elly+0.094
ashes+0.094
grown+0.086
ind+0.085
Neg logit contributions
 replied-0.088
 responded-0.074
chief-0.074
man-0.073
master-0.070
Token found
Token ID1043
Feature activation+4.99e-01

Pos logit contributions
ashing+0.002
ashes+0.002
elly+0.002
ind+0.002
umpy+0.002
Neg logit contributions
 replied-0.002
lf-0.001
 responded-0.001
chief-0.001
man-0.001
Token forgive
Token ID20927
Feature activation-2.05e+00

Pos logit contributions
 replied+0.040
lf+0.037
 responded+0.035
chief+0.034
master+0.033
Neg logit contributions
ashes-0.039
ashing-0.038
elly-0.036
ind-0.034
umpy-0.033
Token me
Token ID502
Feature activation+4.13e+00

Pos logit contributions
 replied+0.108
lf+0.092
 responded+0.091
chief+0.089
 wouldn+0.089
Neg logit contributions
ashes-0.091
ashing-0.090
elly-0.089
ind-0.086
umpy-0.075
Token."
Token ID526
Feature activation-5.35e-01

Pos logit contributions
ashing+0.196
ashes+0.194
elly+0.190
ind+0.177
oky+0.161
Neg logit contributions
 replied-0.213
lf-0.178
 responded-0.171
chief-0.167
 wouldn-0.167
Token
Token ID198
Feature activation-8.50e-01

Pos logit contributions
 replied+0.025
lf+0.020
 responded+0.019
chief+0.019
 wouldn+0.018
Neg logit contributions
ashing-0.029
ashes-0.029
elly-0.028
ind-0.027
umpy-0.025
Token
Token ID198
Feature activation-8.32e-01

Pos logit contributions
 replied+0.038
lf+0.031
 responded+0.029
chief+0.028
man+0.028
Neg logit contributions
ashing-0.048
ashes-0.047
elly-0.045
ind-0.043
umpy-0.040
TokenThe
Token ID464
Feature activation-4.40e+00

Pos logit contributions
 replied+0.038
lf+0.033
man+0.030
 said+0.030
 responded+0.030
Neg logit contributions
ashing-0.048
ashes-0.046
elly-0.043
ind-0.040
oky-0.037
Token people
Token ID661
Feature activation+8.97e-01

Pos logit contributions
 replied+0.200
lf+0.168
 responded+0.159
chief+0.155
man+0.151
Neg logit contributions
ashing-0.238
ashes-0.235
elly-0.229
ind-0.214
umpy-0.202
Token looked
Token ID3114
Feature activation-1.93e+00

Pos logit contributions
ashing+0.053
ashes+0.051
elly+0.050
ind+0.046
umpy+0.045
Neg logit contributions
 replied-0.033
lf-0.029
chief-0.029
 Steven-0.026
man-0.025
Token at
Token ID379
Feature activation-6.36e-01

Pos logit contributions
 replied+0.124
 said+0.103
 responded+0.100
 wouldn+0.096
 explained+0.095
Neg logit contributions
elly-0.072
ashing-0.070
ind-0.069
ashes-0.069
umpy-0.061
Token Ben
Token ID3932
Feature activation-8.49e-02

Pos logit contributions
 replied+0.028
lf+0.023
 responded+0.021
chief+0.021
 wouldn+0.020
Neg logit contributions
ashing-0.036
ashes-0.035
elly-0.034
ind-0.032
umpy-0.031
Token.
Token ID13
Feature activation+5.60e-01

Pos logit contributions
 replied+0.005
lf+0.004
 responded+0.004
chief+0.004
 rang+0.004
Neg logit contributions
ashing-0.004
ashes-0.004
elly-0.003
ind-0.003
umpy-0.003
Token with
Token ID351
Feature activation+1.14e+00

Pos logit contributions
elly+0.184
ashing+0.181
ashes+0.179
ind+0.173
oky+0.163
Neg logit contributions
 replied-0.194
 said-0.151
 responded-0.150
lf-0.149
man-0.146
Token him
Token ID683
Feature activation+2.56e+00

Pos logit contributions
ashing+0.053
ashes+0.052
elly+0.050
ind+0.047
umpy+0.044
Neg logit contributions
 replied-0.060
lf-0.052
 responded-0.049
chief-0.049
 wouldn-0.047
Token.
Token ID13
Feature activation+7.56e-01

Pos logit contributions
ashing+0.105
ashes+0.103
elly+0.099
ind+0.091
umpy+0.084
Neg logit contributions
 replied-0.147
lf-0.131
 responded-0.124
chief-0.123
man-0.118
Token His
Token ID2399
Feature activation-1.35e+00

Pos logit contributions
ashes+0.034
ashing+0.034
elly+0.031
umpy+0.027
ind+0.027
Neg logit contributions
 replied-0.038
lf-0.037
 wouldn-0.033
chief-0.033
 responded-0.032
Token friends
Token ID2460
Feature activation+9.38e-01

Pos logit contributions
 replied+0.072
lf+0.066
chief+0.061
 responded+0.061
 wouldn+0.059
Neg logit contributions
ashes-0.059
ashing-0.059
elly-0.054
ind-0.053
umpy-0.047
Token said
Token ID531
Feature activation-5.08e+00

Pos logit contributions
ashing+0.054
ashes+0.049
elly+0.046
ind+0.044
umpy+0.043
Neg logit contributions
chief-0.033
lf-0.033
 replied-0.032
man-0.030
master-0.030
Token yes
Token ID3763
Feature activation-1.21e+00

Pos logit contributions
 replied+0.181
lf+0.162
chief+0.146
man+0.142
 responded+0.139
Neg logit contributions
ashing-0.297
ashes-0.286
elly-0.280
ind-0.258
grown-0.255
Token and
Token ID290
Feature activation-2.29e-01

Pos logit contributions
 replied+0.066
lf+0.062
chief+0.059
 responded+0.058
 rang+0.056
Neg logit contributions
ashing-0.049
ashes-0.046
elly-0.045
ind-0.042
umpy-0.040
Token started
Token ID2067
Feature activation+3.50e+00

Pos logit contributions
lf+0.010
 replied+0.010
chief+0.010
pron+0.009
master+0.009
Neg logit contributions
ashing-0.011
ashes-0.010
elly-0.009
grown-0.008
and-0.008
Token getting
Token ID1972
Feature activation+3.91e+00

Pos logit contributions
elly+0.162
ashing+0.161
ashes+0.160
ind+0.149
umpy+0.142
Neg logit contributions
 replied-0.182
lf-0.151
 responded-0.149
chief-0.146
 wouldn-0.144
Token ready
Token ID3492
Feature activation+7.40e-01

Pos logit contributions
elly+0.227
ashing+0.209
ashes+0.208
ind+0.207
umpy+0.203
Neg logit contributions
 replied-0.159
 said-0.136
 responded-0.124
 wouldn-0.124
 gl-0.119
Token keeper
Token ID28356
Feature activation-1.71e-01

Pos logit contributions
chief+0.034
flake+0.030
lf+0.029
 replied+0.028
 Steven+0.028
Neg logit contributions
ashing-0.033
elly-0.033
ashes-0.031
umpy-0.030
able-0.028
Token said
Token ID531
Feature activation-5.45e+00

Pos logit contributions
chief+0.004
lf+0.004
 Steven+0.003
 nose+0.003
flake+0.003
Neg logit contributions
ashing-0.011
elly-0.010
ashes-0.010
able-0.010
sm-0.010
Token it
Token ID340
Feature activation-2.02e+00

Pos logit contributions
 replied+0.210
lf+0.186
 responded+0.170
chief+0.170
man+0.165
Neg logit contributions
ashing-0.283
elly-0.275
ashes-0.270
grown-0.254
ind-0.249
Token was
Token ID373
Feature activation-4.07e-01

Pos logit contributions
 replied+0.069
lf+0.067
chief+0.065
 responded+0.058
andra+0.058
Neg logit contributions
elly-0.101
ashing-0.099
ashes-0.093
ind-0.093
grown-0.092
Token a
Token ID257
Feature activation-1.26e+00

Pos logit contributions
 replied+0.021
chief+0.019
lf+0.019
 responded+0.018
 chuckled+0.017
Neg logit contributions
ashing-0.016
elly-0.015
ind-0.015
ashes-0.014
grown-0.014
Token kit
Token ID6220
Feature activation-4.82e+00

Pos logit contributions
 replied+0.060
lf+0.051
chief+0.051
 responded+0.050
 wouldn+0.047
Neg logit contributions
ashing-0.060
ashes-0.059
elly-0.059
ind-0.055
away-0.051
Token to
Token ID284
Feature activation-1.44e+00

Pos logit contributions
 replied+0.288
 responded+0.249
chief+0.248
lf+0.248
 rang+0.233
Neg logit contributions
ashing-0.184
ashes-0.174
elly-0.173
ind-0.152
umpy-0.148
Token make
Token ID787
Feature activation-3.21e+00

Pos logit contributions
 replied+0.070
lf+0.060
chief+0.060
 responded+0.059
man+0.057
Neg logit contributions
ashes-0.067
ashing-0.065
elly-0.063
ind-0.061
away-0.057
Token the
Token ID262
Feature activation-1.52e+00

Pos logit contributions
 replied+0.154
lf+0.134
 responded+0.131
chief+0.128
 rang+0.124
Neg logit contributions
ashes-0.153
ashing-0.148
elly-0.145
ind-0.141
and-0.134
Token cake
Token ID12187
Feature activation-2.04e+00

Pos logit contributions
 replied+0.069
chief+0.058
 responded+0.057
lf+0.056
 chuckled+0.054
Neg logit contributions
ashes-0.080
ashing-0.077
elly-0.075
ind-0.072
oky-0.067
Token.
Token ID13
Feature activation+2.92e-01

Pos logit contributions
 replied+0.116
 responded+0.102
chief+0.101
lf+0.099
 chuckled+0.099
Neg logit contributions
ashes-0.080
ashing-0.076
elly-0.071
ind-0.066
away-0.064
Token long
Token ID890
Feature activation+1.99e+00

Pos logit contributions
 replied+0.014
chief+0.012
 responded+0.012
lf+0.011
 wouldn+0.011
Neg logit contributions
ashing-0.012
elly-0.011
ashes-0.011
ind-0.011
umpy-0.010
Token,
Token ID11
Feature activation+2.71e+00

Pos logit contributions
ashing+0.101
elly+0.100
ashes+0.099
ind+0.095
away+0.088
Neg logit contributions
 replied-0.087
 wouldn-0.072
 responded-0.071
chief-0.069
 chuckled-0.067
Token tall
Token ID7331
Feature activation+2.64e-01

Pos logit contributions
ashing+0.113
ashes+0.111
elly+0.110
ind+0.103
umpy+0.094
Neg logit contributions
 replied-0.146
lf-0.129
chief-0.125
 responded-0.123
 wouldn-0.120
Token grass
Token ID8701
Feature activation-4.59e-01

Pos logit contributions
ashes+0.015
ashing+0.015
elly+0.015
ind+0.014
umpy+0.013
Neg logit contributions
 replied-0.010
 responded-0.008
 wouldn-0.008
chief-0.008
lf-0.008
Token and
Token ID290
Feature activation-1.06e+00

Pos logit contributions
 replied+0.028
lf+0.024
 Steven+0.024
 responded+0.024
chief+0.023
Neg logit contributions
ashing-0.017
ashes-0.016
elly-0.016
ind-0.015
umpy-0.013
Token thought
Token ID1807
Feature activation-6.41e+00

Pos logit contributions
 replied+0.051
lf+0.045
chief+0.044
man+0.043
 responded+0.042
Neg logit contributions
ashes-0.049
ashing-0.048
elly-0.046
ind-0.043
grown-0.041
Token she
Token ID673
Feature activation+1.54e+00

Pos logit contributions
 replied+0.258
lf+0.211
 responded+0.204
chief+0.199
man+0.192
Neg logit contributions
ashing-0.363
ashes-0.356
elly-0.347
ind-0.323
and-0.317
Token might
Token ID1244
Feature activation-3.40e+00

Pos logit contributions
ashing+0.071
ind+0.068
elly+0.066
grown+0.065
ashes+0.064
Neg logit contributions
 replied-0.058
lf-0.057
chief-0.056
pron-0.050
man-0.049
Token find
Token ID1064
Feature activation+5.04e-01

Pos logit contributions
 replied+0.172
lf+0.151
 responded+0.141
chief+0.141
man+0.137
Neg logit contributions
ashes-0.161
ashing-0.159
elly-0.149
ind-0.139
umpy-0.136
Token something
Token ID1223
Feature activation-1.15e+00

Pos logit contributions
ashing+0.033
ashes+0.033
elly+0.032
ind+0.031
umpy+0.029
Neg logit contributions
 replied-0.017
lf-0.013
 responded-0.012
chief-0.012
 wouldn-0.011
Token special
Token ID2041
Feature activation+1.95e+00

Pos logit contributions
 replied+0.058
lf+0.048
 responded+0.047
 wouldn+0.046
 said+0.046
Neg logit contributions
ashing-0.056
ashes-0.056
elly-0.054
ind-0.051
umpy-0.048
Token about
Token ID546
Feature activation-3.07e-01

Pos logit contributions
 replied+0.029
lf+0.025
 responded+0.025
chief+0.025
 wouldn+0.023
Neg logit contributions
ashing-0.023
ashes-0.023
elly-0.022
ind-0.020
umpy-0.019
Token who
Token ID508
Feature activation-1.97e+00

Pos logit contributions
 replied+0.015
lf+0.013
 responded+0.012
chief+0.012
man+0.012
Neg logit contributions
ashing-0.015
ashes-0.015
elly-0.015
ind-0.014
and-0.013
Token was
Token ID373
Feature activation-3.40e-01

Pos logit contributions
 replied+0.075
chief+0.065
lf+0.063
 responded+0.059
man+0.057
Neg logit contributions
ashing-0.113
ashes-0.109
elly-0.108
ind-0.103
grown-0.096
Token better
Token ID1365
Feature activation+2.18e+00

Pos logit contributions
 replied+0.017
lf+0.015
 responded+0.014
chief+0.014
man+0.013
Neg logit contributions
ashing-0.017
ashes-0.016
elly-0.016
ind-0.015
umpy-0.014
Token.
Token ID13
Feature activation+5.71e-01

Pos logit contributions
ashes+0.088
ashing+0.088
elly+0.085
ind+0.079
oky+0.073
Neg logit contributions
 replied-0.122
lf-0.105
 wouldn-0.103
 said-0.103
 responded-0.100
Token But
Token ID887
Feature activation-4.66e+00

Pos logit contributions
ashing+0.025
ashes+0.024
elly+0.024
ind+0.023
umpy+0.021
Neg logit contributions
 replied-0.032
lf-0.027
 responded-0.026
chief-0.026
 wouldn-0.025
Token then
Token ID788
Feature activation-5.36e-01

Pos logit contributions
 replied+0.191
lf+0.167
chief+0.161
 responded+0.156
man+0.154
Neg logit contributions
ashing-0.253
ashes-0.240
elly-0.233
ind-0.225
grown-0.212
Token,
Token ID11
Feature activation+4.78e-02

Pos logit contributions
 replied+0.025
lf+0.021
 responded+0.020
chief+0.020
 wouldn+0.019
Neg logit contributions
ashing-0.027
ashes-0.027
elly-0.026
ind-0.025
umpy-0.023
Token they
Token ID484
Feature activation+1.45e+00

Pos logit contributions
ashing+0.002
ashes+0.002
elly+0.002
grown+0.002
ind+0.002
Neg logit contributions
 replied-0.002
lf-0.002
chief-0.002
 responded-0.002
man-0.002
Token saw
Token ID2497
Feature activation-1.46e+00

Pos logit contributions
ashing+0.069
ashes+0.064
elly+0.059
ind+0.059
grown+0.055
Neg logit contributions
 replied-0.067
chief-0.064
lf-0.059
man-0.056
 Fairy-0.056
Token a
Token ID257
Feature activation-2.08e+00

Pos logit contributions
 replied+0.073
lf+0.068
chief+0.065
 responded+0.063
man+0.060
Neg logit contributions
ashes-0.064
ashing-0.064
elly-0.063
ind-0.058
grown-0.056