TOP ACTIVATIONS
MAX = 135.682

window . It said " Army ". " Wow
and started to chew . Crunch , crunch , he went
But they kept trying . Tap , tap , tap ,
and said " Tap ! Tap ! I found you !"
the ground that said " short cut ". She thought it
door made a sound . Knock , knock , knock .
went to the park and performed for a long time .
as it slowly began to dawn on him that he was
Ben took the stick and tapped the ac orn with it
spread far and wide . Simple acts of kindness can often
a way back . He died by the wide lake .
to it and said ' sun '. Then Mike ran off
King was cruel and had enslaved many of the people of
, you can reach your goal . <|endoftext|> Once upon a
wear the hat while he performed his tricks . He put
to help . The spirit served the rabbit some y ummy
got there , he was welcomed with a smile from his
show that the careful bean performed . And they all lived
and cars . Sometimes they knocked them down and started again
the tied rope around it served as a reminder to her

MIN ACTIVATIONS
MIN = -268.152

you go , my d arl ings . Enjoy your lunch
's wonderful , my d arl ings !" Mom my says
reward . It was ch ocol ates and sweets !
Everyday he would take the av oc ados and scatter them
When Daisy put the magn ifying glass down , the cater
. She loved to eat av oc ados . One day
mum and her three - year - old daughter went to
. She loved to eat av oc ados because they were
't know the rh in oc eros was wild . Can
Ch ir py and Wh isk ers knew that poison was
and play with your or nam ents . Maybe you can
them and never threw a tant rum again . <|endoftext|> Once
. Did you know that av oc ados are very healthy
he would take the av oc ados and scatter them around
oll ip ops , ch ocol ates , g ummies ,
It turned out that the av oc ados were bad and
give to the rh in oc eros . Lily walked to
still couldn 't find any av oc ados . Finally ,
am taking the rh in oc eros to a new home
". The 3 - year - old thought hard and

INTERVAL 34.724 - 135.682
CONTAINS 27.782%

" Let 's play something else !" Tom said .
and make things . One day , Lily has an idea
swim in the pool . She thanked the life guard for
with him and his magic wand . He said , "
<|endoftext|> Once upon a time , there was a big blue

INTERVAL -66.235 - 34.724
CONTAINS 61.474%

! This apple is bad . It has a bad taste
From then on , he remembered an important lesson . No
his big bat and hits their ball . He hits it
corn field and the dog had a wonderful time running around
day adm iring the flower . To Buzz y

INTERVAL -167.193 - -66.235
CONTAINS 10.695%

if she promised to always be safe and listen to her
't stop spinning it in her room . From that day
yard . The dog was very sad and hungry . True
to leave the park with their ruined dolls . They did
look at the tree . After a few

INTERVAL -268.152 - -167.193
CONTAINS 0.048%

I should have the books he lf , because it is
! Alex had used past - date cheese ! Sam yelled
's mom was even ang rier and Lily felt even sad
the farmer scattered the av oc ados , he noticed one
driver would greet the po pp ies and thank them for
Token window
Token ID4324
Feature activation+5.17e+01

Pos logit contributions
 and+0.167
,+0.153
owed+0.146
 said+0.141
 answered+0.138
Neg logit contributions
nam-0.144
 ey-0.122
Mic-0.119
igsaw-0.119
 microsc-0.115
Token.
Token ID13
Feature activation+7.36e+01

Pos logit contributions
 replied+0.055
 and+0.051
 squeezed+0.050
 explained+0.050
uddled+0.050
Neg logit contributions
 sill-0.192
 ey-0.175
reating-0.169
amental-0.168
igsaw-0.166
Token It
Token ID632
Feature activation+6.33e+01

Pos logit contributions
 and+0.216
,+0.193
.+0.168
!+0.159
 -+0.158
Neg logit contributions
nam-0.241
igsaw-0.240
 ey-0.225
Mic-0.224
 microsc-0.211
Token said
Token ID531
Feature activation+8.29e+01

Pos logit contributions
;+0.144
 and+0.140
,+0.136
 -+0.132
 –+0.128
Neg logit contributions
nam-0.191
 redd-0.169
Looks-0.167
 Amen-0.163
 microsc-0.163
Token "
Token ID366
Feature activation+7.15e+01

Pos logit contributions
 argued+0.162
hed+0.153
 and+0.142
 approached+0.140
 watched+0.140
Neg logit contributions
nam-0.259
igsaw-0.252
 Amen-0.251
 Rib-0.247
 microsc-0.241
TokenArmy
Token ID45272
Feature activation+1.36e+02

Pos logit contributions
,+0.330
 and+0.323
:+0.287
 -+0.278
!+0.277
Neg logit contributions
Mic-0.121
nam-0.116
Ta-0.108
igsaw-0.102
Looks-0.095
Token".
Token ID1911
Feature activation+9.06e+01

Pos logit contributions
 shook+0.268
 and+0.264
 touched+0.250
 felt+0.247
 explained+0.246
Neg logit contributions
igsaw-0.421
 Amen-0.413
arl-0.406
ifying-0.396
reating-0.382
Token
Token ID198
Feature activation+4.81e+01

Pos logit contributions
 and+0.257
,+0.233
.+0.200
!+0.196
 -+0.196
Neg logit contributions
nam-0.286
igsaw-0.284
 ey-0.266
Mic-0.265
 microsc-0.248
Token
Token ID198
Feature activation+4.88e+01

Pos logit contributions
 and+0.118
,+0.099
.+0.088
 said+0.074
!+0.071
Neg logit contributions
nam-0.203
igsaw-0.200
 ey-0.191
Mic-0.185
 redd-0.182
Token"
Token ID1
Feature activation+7.91e+01

Pos logit contributions
 and+0.149
,+0.131
.+0.117
 said+0.105
!+0.103
Neg logit contributions
nam-0.168
igsaw-0.167
 ey-0.160
Mic-0.153
 Amen-0.149
TokenWow
Token ID22017
Feature activation+5.69e+01

Pos logit contributions
 and+0.311
,+0.281
.+0.259
 said+0.243
!+0.240
Neg logit contributions
nam-0.191
igsaw-0.190
 ey-0.175
Mic-0.169
 microsc-0.159
Token and
Token ID290
Feature activation+7.94e+01

Pos logit contributions
 rang+0.048
 spoke+0.044
 raged+0.044
 replied+0.038
 died+0.038
Neg logit contributions
 ju-0.246
 redd-0.246
nam-0.238
igsaw-0.236
 sill-0.233
Token started
Token ID2067
Feature activation+7.07e+01

Pos logit contributions
 and+0.153
,+0.136
;+0.125
 -+0.124
.+0.117
Neg logit contributions
nam-0.316
 redd-0.282
Mic-0.271
 ey-0.270
 ju-0.269
Token to
Token ID284
Feature activation+6.42e+01

Pos logit contributions
 and+0.173
,+0.149
 -+0.131
 said+0.130
;+0.128
Neg logit contributions
nam-0.237
igsaw-0.222
 ey-0.218
Mic-0.213
 microsc-0.213
Token chew
Token ID34722
Feature activation+6.14e+01

Pos logit contributions
 and+0.209
,+0.185
 -+0.168
.+0.166
;+0.163
Neg logit contributions
nam-0.182
igsaw-0.171
 ey-0.170
Mic-0.169
 redd-0.158
Token.
Token ID13
Feature activation+7.45e+01

Pos logit contributions
 and+0.100
 rang+0.100
 replied+0.097
owed+0.097
 answered+0.092
Neg logit contributions
nam-0.228
 redd-0.214
Mic-0.212
igsaw-0.210
 ju-0.203
Token Crunch
Token ID38531
Feature activation+1.20e+02

Pos logit contributions
 and+0.270
,+0.244
.+0.223
 -+0.214
!+0.212
Neg logit contributions
igsaw-0.190
nam-0.184
 ey-0.172
Mic-0.170
 microsc-0.160
Token,
Token ID11
Feature activation+7.63e+01

Pos logit contributions
 and+0.198
 whispered+0.183
 approached+0.182
 spoke+0.177
 explained+0.174
Neg logit contributions
igsaw-0.440
 redd-0.376
 ju-0.371
lon-0.370
umm-0.367
Token crunch
Token ID23527
Feature activation+4.83e+01

Pos logit contributions
 and+0.233
,+0.222
.+0.212
 -+0.209
;+0.201
Neg logit contributions
nam-0.209
 redd-0.200
Mic-0.192
 ey-0.191
 ju-0.185
Token,
Token ID11
Feature activation+7.73e+01

Pos logit contributions
 whispered+0.140
 and+0.136
 explained+0.135
 approached+0.134
 agree+0.133
Neg logit contributions
igsaw-0.084
reating-0.079
 redd-0.077
 ey-0.074
rou-0.072
Token he
Token ID339
Feature activation+5.90e+01

Pos logit contributions
,+0.139
 and+0.139
 -+0.135
.+0.132
;+0.129
Neg logit contributions
 redd-0.281
nam-0.272
 ju-0.269
 ey-0.264
Mic-0.261
Token went
Token ID1816
Feature activation+8.51e+01

Pos logit contributions
;+0.159
 -+0.147
 –+0.147
'.+0.145
land+0.144
Neg logit contributions
 ju-0.145
 redd-0.138
nam-0.128
Mic-0.127
 triangular-0.112
Token But
Token ID887
Feature activation+7.36e+01

Pos logit contributions
 and+0.214
,+0.194
le+0.191
 -+0.183
.+0.183
Neg logit contributions
igsaw-0.145
nam-0.141
 redd-0.137
Mic-0.133
 microsc-0.124
Token they
Token ID484
Feature activation+4.61e+01

Pos logit contributions
 and+0.154
,+0.129
.+0.106
!+0.090
 -+0.089
Neg logit contributions
nam-0.315
igsaw-0.310
 ey-0.297
Mic-0.296
 Amen-0.284
Token kept
Token ID4030
Feature activation+8.24e+01

Pos logit contributions
 and+0.105
,+0.091
;+0.088
 -+0.086
!+0.084
Neg logit contributions
nam-0.157
Mic-0.138
 microsc-0.129
 gol-0.129
ptions-0.123
Token trying
Token ID2111
Feature activation+6.19e+01

Pos logit contributions
 and+0.245
,+0.214
.+0.188
!+0.169
 said+0.166
Neg logit contributions
igsaw-0.290
nam-0.288
 ey-0.273
Mic-0.273
 Amen-0.257
Token.
Token ID13
Feature activation+6.75e+01

Pos logit contributions
 replied+0.078
owed+0.074
 said+0.072
 rang+0.072
 sat+0.068
Neg logit contributions
nam-0.255
 microsc-0.246
igsaw-0.236
 ey-0.225
Ru-0.219
Token Tap
Token ID16880
Feature activation+1.19e+02

Pos logit contributions
 and+0.223
,+0.203
!+0.188
.+0.187
le+0.185
Neg logit contributions
igsaw-0.167
nam-0.158
Mic-0.150
 redd-0.148
 microsc-0.141
Token,
Token ID11
Feature activation+7.00e+01

Pos logit contributions
 and+0.104
 spoke+0.100
 said+0.099
 wanted+0.089
 answered+0.087
Neg logit contributions
igsaw-0.528
 avail-0.501
nam-0.489
ihu-0.488
amental-0.487
Token tap
Token ID9814
Feature activation+6.88e+01

Pos logit contributions
 and+0.227
,+0.207
.+0.191
 -+0.187
!+0.180
Neg logit contributions
nam-0.204
igsaw-0.189
 ey-0.186
Mic-0.176
 redd-0.171
Token,
Token ID11
Feature activation+7.10e+01

Pos logit contributions
int+0.134
moon+0.113
 approached+0.111
 replied+0.108
 answered+0.107
Neg logit contributions
 downward-0.201
 spec-0.196
 microsc-0.193
rou-0.188
amental-0.178
Token tap
Token ID9814
Feature activation+6.97e+01

Pos logit contributions
owed+0.191
 -+0.189
 and+0.184
;+0.184
,+0.179
Neg logit contributions
nam-0.195
 ey-0.185
 redd-0.172
ihu-0.165
igsaw-0.164
Token,
Token ID11
Feature activation+7.20e+01

Pos logit contributions
int+0.131
 overheard+0.126
 approached+0.120
 replied+0.118
 awoke+0.117
Neg logit contributions
 downward-0.211
 microsc-0.204
 spec-0.199
rou-0.196
arl-0.190
Token and
Token ID290
Feature activation+6.72e+01

Pos logit contributions
owed+0.064
ided+0.060
 agree+0.059
 repeated+0.059
 died+0.059
Neg logit contributions
igsaw-0.131
 microsc-0.124
 ey-0.123
reating-0.120
 Wed-0.117
Token said
Token ID531
Feature activation+6.92e+01

Pos logit contributions
 and+0.136
le+0.132
ops+0.120
int+0.117
;+0.117
Neg logit contributions
nam-0.251
years-0.201
 microsc-0.198
 redd-0.193
 ju-0.188
Token "
Token ID366
Feature activation+5.73e+01

Pos logit contributions
 and+0.095
 because+0.047
 said+0.047
hed+0.045
 whispered+0.044
Neg logit contributions
nam-0.308
igsaw-0.281
 microsc-0.275
 Amen-0.271
 redd-0.270
TokenTap
Token ID45081
Feature activation+9.70e+01

Pos logit contributions
,+0.235
 and+0.231
:+0.212
.+0.201
 -+0.192
Neg logit contributions
Mic-0.133
nam-0.130
 redd-0.115
Ta-0.111
Looks-0.107
Token!
Token ID0
Feature activation+7.34e+01

Pos logit contributions
 said+0.192
 spoke+0.181
 shook+0.180
 hated+0.172
 wanted+0.171
Neg logit contributions
igsaw-0.375
 Amen-0.333
 outer-0.324
amental-0.315
arl-0.301
Token Tap
Token ID16880
Feature activation+1.14e+02

Pos logit contributions
 and+0.272
,+0.245
.+0.229
 said+0.214
!+0.213
Neg logit contributions
nam-0.184
igsaw-0.184
 Amen-0.172
Mic-0.168
 ey-0.162
Token!
Token ID0
Feature activation+7.47e+01

Pos logit contributions
 said+0.197
 spoke+0.182
 hated+0.180
 needed+0.179
 wanted+0.177
Neg logit contributions
igsaw-0.448
amental-0.410
 Amen-0.393
 outer-0.388
ihu-0.387
Token I
Token ID314
Feature activation+3.93e+01

Pos logit contributions
 and+0.269
,+0.242
.+0.223
!+0.208
 -+0.208
Neg logit contributions
nam-0.200
igsaw-0.198
Mic-0.183
 Amen-0.181
 ey-0.177
Token found
Token ID1043
Feature activation+6.97e+01

Pos logit contributions
;+0.118
 and+0.118
:+0.117
,+0.110
 cheeks+0.110
Neg logit contributions
|-0.076
 Amen-0.072
 ey-0.065
Ta-0.064
igsaw-0.064
Token you
Token ID345
Feature activation+2.92e+01

Pos logit contributions
 and+0.254
,+0.222
 nodded+0.220
 shook+0.215
owed+0.213
Neg logit contributions
nam-0.167
igsaw-0.145
 microsc-0.145
 outer-0.129
 ey-0.127
Token!"
Token ID2474
Feature activation+7.10e+01

Pos logit contributions
 replied+0.082
 shook+0.076
 explained+0.075
anged+0.075
 nodded+0.072
Neg logit contributions
 queens-0.066
 yours-0.063
nam-0.062
rou-0.061
ifying-0.061
Token the
Token ID262
Feature activation+4.43e+01

Pos logit contributions
 and+0.158
,+0.140
.+0.122
 said+0.114
 -+0.114
Neg logit contributions
nam-0.183
igsaw-0.179
 ey-0.167
Mic-0.167
 microsc-0.157
Token ground
Token ID2323
Feature activation+5.12e+01

Pos logit contributions
 and+0.142
,+0.128
.+0.112
 -+0.108
 said+0.108
Neg logit contributions
nam-0.134
igsaw-0.126
Mic-0.121
 ey-0.118
 redd-0.112
Token that
Token ID326
Feature activation+5.72e+01

Pos logit contributions
int+0.080
 agree+0.075
hew+0.072
anna+0.072
 discussed+0.071
Neg logit contributions
saw-0.150
 ey-0.150
 abrupt-0.147
igsaw-0.142
nam-0.141
Token said
Token ID531
Feature activation+7.73e+01

Pos logit contributions
 and+0.128
,+0.102
;+0.092
 -+0.091
:+0.083
Neg logit contributions
nam-0.237
 redd-0.194
Mic-0.187
 microsc-0.186
 Gw-0.186
Token "
Token ID366
Feature activation+6.50e+01

Pos logit contributions
 and+0.190
int+0.163
le+0.162
 watched+0.157
hed+0.153
Neg logit contributions
nam-0.222
igsaw-0.219
 microsc-0.215
 redd-0.190
 ey-0.186
Tokenshort
Token ID19509
Feature activation+1.11e+02

Pos logit contributions
,+0.284
 and+0.282
:+0.253
.+0.238
 -+0.233
Neg logit contributions
Mic-0.132
nam-0.129
Ta-0.107
igsaw-0.107
Oh-0.103
Tokencut
Token ID8968
Feature activation-9.30e+00

Pos logit contributions
 watered+0.305
 felt+0.292
 and+0.292
 said+0.287
 admired+0.287
Neg logit contributions
igsaw-0.328
arl-0.293
 gol-0.274
ifying-0.263
reating-0.261
Token".
Token ID1911
Feature activation+8.35e+01

Pos logit contributions
 Amen+0.020
amental+0.019
Mic+0.019
igsaw+0.018
 gol+0.018
Neg logit contributions
 and-0.029
 said-0.027
 whispered-0.027
 shook-0.026
 touched-0.026
Token She
Token ID1375
Feature activation+6.03e+01

Pos logit contributions
 and+0.290
,+0.258
 -+0.253
!+0.249
;+0.240
Neg logit contributions
igsaw-0.206
nam-0.193
 ey-0.169
Mic-0.166
 microsc-0.157
Token thought
Token ID1807
Feature activation+8.22e+01

Pos logit contributions
.+0.114
,+0.111
,"+0.109
;+0.107
 and+0.105
Neg logit contributions
Mic-0.190
Ru-0.178
amental-0.178
nam-0.174
Spe-0.169
Token it
Token ID340
Feature activation+6.31e+01

Pos logit contributions
int+0.277
hi+0.273
thank+0.270
 approached+0.266
 overheard+0.264
Neg logit contributions
igsaw-0.139
nam-0.127
 microsc-0.123
 math-0.117
 Lorenzo-0.101
Token door
Token ID3420
Feature activation+4.62e+01

Pos logit contributions
 and+0.102
,+0.092
.+0.083
 -+0.081
!+0.081
Neg logit contributions
nam-0.091
igsaw-0.082
 ey-0.081
Mic-0.078
 Amen-0.076
Token made
Token ID925
Feature activation+6.04e+01

Pos logit contributions
anas+0.129
 agree+0.110
bee+0.109
eness+0.109
;+0.103
Neg logit contributions
saw-0.097
nam-0.094
hatt-0.088
 ey-0.084
aila-0.084
Token a
Token ID257
Feature activation+2.48e+01

Pos logit contributions
 and+0.197
 nodded+0.186
;+0.184
anas+0.181
owed+0.180
Neg logit contributions
nam-0.102
____-0.092
 outer-0.090
igsaw-0.086
ptions-0.086
Token sound
Token ID2128
Feature activation+3.94e+01

Pos logit contributions
 and+0.084
,+0.075
.+0.068
!+0.062
 said+0.061
Neg logit contributions
nam-0.074
igsaw-0.074
Mic-0.070
 ey-0.069
 Amen-0.065
Token.
Token ID13
Feature activation+5.62e+01

Pos logit contributions
anas+0.104
hew+0.103
orted+0.101
 necks+0.096
 declined+0.095
Neg logit contributions
 outer-0.099
reating-0.091
saw-0.085
 sill-0.080
 gloomy-0.078
Token Knock
Token ID29061
Feature activation+1.10e+02

Pos logit contributions
 and+0.198
,+0.178
 -+0.174
!+0.171
;+0.164
Neg logit contributions
nam-0.132
igsaw-0.119
Mic-0.109
 ey-0.106
 Amen-0.099
Token,
Token ID11
Feature activation+5.84e+01

Pos logit contributions
 argued+0.145
 kept+0.144
owed+0.144
 nodded+0.143
hed+0.143
Neg logit contributions
igsaw-0.401
nam-0.393
 Amen-0.387
 outer-0.352
 Gw-0.350
Token knock
Token ID10643
Feature activation+5.50e+01

Pos logit contributions
,+0.154
 and+0.152
ops+0.152
 -+0.152
owed+0.149
Neg logit contributions
nam-0.185
 ey-0.152
 Amen-0.140
 redd-0.140
 microsc-0.133
Token,
Token ID11
Feature activation+5.95e+01

Pos logit contributions
anas+0.158
int+0.157
age+0.156
 replied+0.138
 declined+0.138
Neg logit contributions
 recess-0.106
nam-0.102
 your-0.099
saw-0.097
 Lena-0.095
Token knock
Token ID10643
Feature activation+5.61e+01

Pos logit contributions
owed+0.127
ops+0.124
 -+0.121
;+0.117
anas+0.116
Neg logit contributions
nam-0.213
 ey-0.186
 redd-0.179
 Amen-0.170
arl-0.166
Token.
Token ID13
Feature activation+5.96e+01

Pos logit contributions
anas+0.184
int+0.179
age+0.178
 agree+0.171
 declined+0.170
Neg logit contributions
nam-0.104
 recess-0.089
reating-0.087
 your-0.087
saw-0.082
Token went
Token ID1816
Feature activation+8.16e+01

Pos logit contributions
,"+0.021
orting+0.021
;+0.021
 anymore+0.021
',"+0.021
Neg logit contributions
Jul-0.018
 Avery-0.017
iam-0.017
amental-0.017
 Aid-0.016
Token to
Token ID284
Feature activation+6.53e+01

Pos logit contributions
 and+0.175
,+0.152
 said+0.131
 -+0.130
.+0.129
Neg logit contributions
igsaw-0.288
nam-0.286
 ey-0.265
 Amen-0.256
reating-0.254
Token the
Token ID262
Feature activation+5.29e+01

Pos logit contributions
 and+0.191
,+0.168
.+0.148
!+0.134
 -+0.133
Neg logit contributions
nam-0.225
igsaw-0.222
 ey-0.210
Mic-0.210
 Amen-0.198
Token park
Token ID3952
Feature activation+6.25e+01

Pos logit contributions
 and+0.192
,+0.173
.+0.156
 -+0.151
 because+0.150
Neg logit contributions
nam-0.136
igsaw-0.129
 ey-0.123
 redd-0.117
 microsc-0.116
Token and
Token ID290
Feature activation+8.40e+01

Pos logit contributions
 died+0.093
 blew+0.078
 slapped+0.071
les+0.070
le+0.069
Neg logit contributions
nam-0.228
igsaw-0.223
saw-0.198
amental-0.197
 sill-0.197
Token performed
Token ID6157
Feature activation+1.09e+02

Pos logit contributions
 and+0.168
,+0.138
.+0.134
;+0.133
 -+0.132
Neg logit contributions
nam-0.331
 microsc-0.286
igsaw-0.281
 redd-0.264
saw-0.264
Token for
Token ID329
Feature activation+6.55e+01

Pos logit contributions
 and+0.311
 died+0.297
 wanted+0.286
 approached+0.278
 woke+0.275
Neg logit contributions
nam-0.304
igsaw-0.298
 ey-0.260
 microsc-0.257
 outer-0.247
Token a
Token ID257
Feature activation+4.69e+01

Pos logit contributions
 and+0.199
,+0.176
.+0.155
 -+0.143
 said+0.143
Neg logit contributions
nam-0.216
igsaw-0.210
 ey-0.197
Mic-0.194
 microsc-0.188
Token long
Token ID890
Feature activation+1.61e+01

Pos logit contributions
 and+0.181
,+0.154
owed+0.153
 -+0.148
:+0.146
Neg logit contributions
igsaw-0.111
nam-0.103
Mic-0.087
 microsc-0.086
 gol-0.079
Token time
Token ID640
Feature activation+6.11e+01

Pos logit contributions
 and+0.056
owed+0.050
 -+0.049
 said+0.049
,+0.049
Neg logit contributions
igsaw-0.039
Mic-0.032
 ey-0.031
nam-0.030
ihu-0.027
Token.
Token ID13
Feature activation+7.95e+01

Pos logit contributions
 replied+0.103
 woke+0.100
 died+0.095
 shook+0.091
 lied+0.087
Neg logit contributions
igsaw-0.216
nam-0.207
 outer-0.193
 recess-0.193
 microsc-0.192
Token as
Token ID355
Feature activation+6.21e+01

Pos logit contributions
owed+0.186
int+0.152
le+0.145
uddled+0.144
,"+0.143
Neg logit contributions
nam-0.220
 microsc-0.194
 ey-0.192
 ju-0.191
 redd-0.189
Token it
Token ID340
Feature activation+6.24e+01

Pos logit contributions
 and+0.160
,+0.142
.+0.122
!+0.104
 said+0.100
Neg logit contributions
igsaw-0.254
 ey-0.243
nam-0.242
Mic-0.237
 Amen-0.228
Token slowly
Token ID6364
Feature activation+6.39e+01

Pos logit contributions
 and+0.138
,"+0.123
",+0.118
!"+0.118
,+0.118
Neg logit contributions
Looks-0.172
 ju-0.169
 microsc-0.168
Ru-0.165
 redd-0.164
Token began
Token ID2540
Feature activation+6.94e+01

Pos logit contributions
!"+0.152
anas+0.150
?"+0.150
'."+0.143
!".+0.139
Neg logit contributions
 spec-0.150
 gol-0.144
 ey-0.143
 mer-0.139
ubb-0.138
Token to
Token ID284
Feature activation+6.37e+01

Pos logit contributions
+0.184
 replied+0.163
anas+0.162
int+0.161
 and+0.159
Neg logit contributions
 microsc-0.190
nam-0.168
 ey-0.163
igsaw-0.162
 outer-0.155
Token dawn
Token ID17577
Feature activation+1.09e+02

Pos logit contributions
uddled+0.216
 and+0.203
ed+0.201
owed+0.191
ages+0.187
Neg logit contributions
 outer-0.150
nam-0.147
 ey-0.140
 avail-0.137
igsaw-0.129
Token on
Token ID319
Feature activation+6.22e+01

Pos logit contributions
 told+0.146
 explained+0.146
 and+0.145
 agree+0.137
 said+0.135
Neg logit contributions
igsaw-0.422
 gol-0.365
 downward-0.356
 microsc-0.355
arl-0.349
Token him
Token ID683
Feature activation+4.70e+01

Pos logit contributions
owed+0.234
 said+0.207
uddled+0.207
 rang+0.204
 and+0.204
Neg logit contributions
nam-0.114
igsaw-0.114
 microsc-0.110
 outer-0.098
 avail-0.093
Token that
Token ID326
Feature activation+6.40e+01

Pos logit contributions
uddled+0.097
 blew+0.094
int+0.092
 nodded+0.091
 checked+0.091
Neg logit contributions
 microsc-0.141
igsaw-0.129
 possible-0.114
rou-0.113
 sill-0.111
Token he
Token ID339
Feature activation+5.67e+01

Pos logit contributions
 and+0.210
owed+0.209
int+0.199
uddled+0.195
anas+0.180
Neg logit contributions
nam-0.165
 microsc-0.148
igsaw-0.125
 ey-0.122
 outer-0.119
Token was
Token ID373
Feature activation+6.21e+01

Pos logit contributions
,+0.137
 and+0.135
;+0.127
 but+0.125
,"+0.124
Neg logit contributions
 microsc-0.121
years-0.120
pill-0.118
rou-0.117
 brav-0.108
TokenBen
Token ID11696
Feature activation+5.24e+01

Pos logit contributions
 and+0.150
,+0.131
.+0.117
!+0.102
 said+0.102
Neg logit contributions
nam-0.153
igsaw-0.149
 Amen-0.135
 ey-0.130
 redd-0.127
Token took
Token ID1718
Feature activation+8.10e+01

Pos logit contributions
;+0.121
'."+0.121
',"+0.120
".+0.119
,"+0.118
Neg logit contributions
 ey-0.134
Ru-0.127
amental-0.120
oren-0.111
 amaz-0.108
Token the
Token ID262
Feature activation+5.12e+01

Pos logit contributions
 and+0.190
,+0.171
.+0.142
!+0.109
 said+0.104
Neg logit contributions
nam-0.375
 ey-0.357
Mic-0.357
igsaw-0.355
 Amen-0.345
Token stick
Token ID4859
Feature activation+7.36e+01

Pos logit contributions
 and+0.147
,+0.130
.+0.113
!+0.101
 said+0.100
Neg logit contributions
nam-0.184
igsaw-0.183
 ey-0.171
Mic-0.170
 Amen-0.168
Token and
Token ID290
Feature activation+8.25e+01

Pos logit contributions
 and+0.036
 watched+0.035
 replied+0.034
 said+0.033
 rang+0.029
Neg logit contributions
igsaw-0.341
nam-0.336
 ey-0.324
 redd-0.312
 microsc-0.307
Token tapped
Token ID22499
Feature activation+1.09e+02

Pos logit contributions
 and+0.160
,+0.137
.+0.123
 -+0.123
;+0.121
Neg logit contributions
nam-0.336
igsaw-0.306
 ey-0.303
 microsc-0.292
Mic-0.291
Token the
Token ID262
Feature activation+5.32e+01

Pos logit contributions
 and+0.322
 rang+0.303
,+0.292
owed+0.285
;+0.284
Neg logit contributions
nam-0.258
igsaw-0.234
 ey-0.224
 redd-0.223
 microsc-0.211
Token ac
Token ID936
Feature activation+2.37e+01

Pos logit contributions
 and+0.160
,+0.143
.+0.126
 -+0.117
 said+0.116
Neg logit contributions
nam-0.173
igsaw-0.167
 ey-0.163
Mic-0.162
 redd-0.152
Tokenorn
Token ID1211
Feature activation+3.23e+01

Pos logit contributions
 and+0.082
,+0.072
.+0.065
!+0.061
 -+0.055
Neg logit contributions
nam-0.087
 Amen-0.078
igsaw-0.078
 microsc-0.073
ptions-0.072
Token with
Token ID351
Feature activation+7.50e+01

Pos logit contributions
 discussed+0.053
anas+0.051
 overheard+0.051
 watched+0.048
 explained+0.048
Neg logit contributions
nam-0.107
 redd-0.097
igsaw-0.093
 ey-0.090
 microsc-0.090
Token it
Token ID340
Feature activation+6.97e+01

Pos logit contributions
 and+0.209
,+0.182
.+0.160
!+0.143
 said+0.139
Neg logit contributions
nam-0.277
igsaw-0.274
Mic-0.263
 ey-0.258
 Amen-0.252
Token spread
Token ID4104
Feature activation+8.69e+01

Pos logit contributions
ops+0.217
 and+0.195
 replied+0.184
owed+0.183
uddled+0.181
Neg logit contributions
nam-0.144
 possible-0.116
 redd-0.113
 rewarding-0.112
 queens-0.110
Token far
Token ID1290
Feature activation+4.18e+01

Pos logit contributions
 rang+0.210
 and+0.193
 sat+0.192
 scanned+0.189
 approached+0.188
Neg logit contributions
igsaw-0.236
 ey-0.234
nam-0.232
 microsc-0.223
 redd-0.215
Token and
Token ID290
Feature activation+8.05e+01

Pos logit contributions
anas+0.101
owed+0.100
 replied+0.099
 rang+0.099
ops+0.098
Neg logit contributions
nam-0.115
 ey-0.109
igsaw-0.109
 outer-0.105
 redd-0.102
Token wide
Token ID3094
Feature activation+1.73e+01

Pos logit contributions
 and+0.257
,+0.230
.+0.205
!+0.187
 -+0.186
Neg logit contributions
nam-0.253
igsaw-0.248
 ey-0.234
Mic-0.233
 Amen-0.220
Token.
Token ID13
Feature activation+7.40e+01

Pos logit contributions
 replied+0.024
 said+0.024
 rang+0.023
 explained+0.021
 got+0.020
Neg logit contributions
igsaw-0.065
 ey-0.063
 outer-0.062
 microsc-0.062
nam-0.061
Token Simple
Token ID17427
Feature activation+1.09e+02

Pos logit contributions
 and+0.175
,+0.143
.+0.123
 because+0.107
 -+0.104
Neg logit contributions
nam-0.328
 ey-0.297
igsaw-0.291
 redd-0.289
Mic-0.287
Token acts
Token ID6529
Feature activation+5.66e+01

Pos logit contributions
 and+0.340
 rang+0.333
 said+0.326
 sat+0.320
 replied+0.316
Neg logit contributions
igsaw-0.250
 ey-0.238
 microsc-0.204
ubb-0.188
 redd-0.180
Token of
Token ID286
Feature activation+2.87e+01

Pos logit contributions
 rang+0.147
 scanned+0.141
 sat+0.139
 approached+0.136
 walked+0.130
Neg logit contributions
igsaw-0.133
 ey-0.125
arl-0.120
rou-0.120
 gol-0.117
Token kindness
Token ID23887
Feature activation+7.20e+01

Pos logit contributions
 and+0.097
,+0.088
 said+0.083
 rang+0.083
owed+0.082
Neg logit contributions
nam-0.071
igsaw-0.068
 ey-0.066
 microsc-0.062
 Amen-0.059
Token can
Token ID460
Feature activation+4.67e+01

Pos logit contributions
int+0.194
 overheard+0.183
 approached+0.178
 scanned+0.174
anas+0.173
Neg logit contributions
igsaw-0.141
 ey-0.132
 gol-0.126
arl-0.124
 possible-0.124
Token often
Token ID1690
Feature activation+5.20e+01

Pos logit contributions
 and+0.124
owed+0.119
ed+0.119
uddled+0.115
 sat+0.113
Neg logit contributions
 possible-0.105
 Amen-0.093
Ru-0.093
Wal-0.092
 avail-0.092
Token a
Token ID257
Feature activation+4.26e+01

Pos logit contributions
 and+0.207
,+0.182
.+0.161
 said+0.155
 -+0.153
Neg logit contributions
nam-0.202
igsaw-0.195
 ey-0.185
Mic-0.183
 microsc-0.177
Token way
Token ID835
Feature activation+8.70e+01

Pos logit contributions
 and+0.156
owed+0.139
,+0.138
:+0.128
.+0.128
Neg logit contributions
igsaw-0.092
nam-0.090
Mic-0.084
 redd-0.082
 microsc-0.079
Token back
Token ID736
Feature activation+4.87e+01

Pos logit contributions
 and+0.167
,+0.135
 said+0.118
 gave+0.106
 replied+0.106
Neg logit contributions
igsaw-0.336
nam-0.333
 ey-0.313
 microsc-0.306
Mic-0.303
Token.
Token ID13
Feature activation+7.50e+01

Pos logit contributions
 and+0.111
,+0.094
 said+0.083
 replied+0.079
 gave+0.078
Neg logit contributions
igsaw-0.174
nam-0.165
 ey-0.161
 microsc-0.153
Mic-0.150
Token He
Token ID679
Feature activation+5.20e+01

Pos logit contributions
 and+0.211
,+0.185
.+0.162
 said+0.144
!+0.144
Neg logit contributions
nam-0.271
igsaw-0.264
 ey-0.252
Mic-0.250
 Amen-0.239
Token died
Token ID3724
Feature activation+1.09e+02

Pos logit contributions
;+0.117
.+0.115
,+0.112
,"+0.109
'."+0.106
Neg logit contributions
iam-0.124
 gol-0.122
Mic-0.121
Looks-0.120
years-0.119
Token by
Token ID416
Feature activation+5.18e+01

Pos logit contributions
 replied+0.169
int+0.167
 explained+0.158
 spoke+0.158
'+0.158
Neg logit contributions
nam-0.392
keeping-0.325
rier-0.317
rou-0.312
igsaw-0.312
Token the
Token ID262
Feature activation+5.43e+01

Pos logit contributions
 and+0.156
,+0.143
owed+0.139
 said+0.138
 rang+0.135
Neg logit contributions
nam-0.134
igsaw-0.131
 microsc-0.121
 ey-0.116
 Lorenzo-0.106
Token wide
Token ID3094
Feature activation+2.06e+01

Pos logit contributions
owed+0.205
 and+0.203
,+0.194
 -+0.182
;+0.182
Neg logit contributions
 Amen-0.097
nam-0.091
 redd-0.088
 ey-0.086
igsaw-0.083
Token lake
Token ID13546
Feature activation+7.52e+01

Pos logit contributions
owed+0.073
 explained+0.064
 listened+0.063
 spoke+0.063
ave+0.063
Neg logit contributions
igsaw-0.039
 outer-0.038
 camps-0.037
 ey-0.036
 Amen-0.036
Token.
Token ID13
Feature activation+7.88e+01

Pos logit contributions
int+0.101
 rang+0.090
owed+0.086
 tapped+0.082
 whispered+0.078
Neg logit contributions
igsaw-0.274
rou-0.243
spe-0.234
 sill-0.231
 microsc-0.228
Token to
Token ID284
Feature activation+6.33e+01

Pos logit contributions
ops+0.174
owed+0.165
 rang+0.157
anas+0.154
 died+0.153
Neg logit contributions
nam-0.281
 ey-0.267
 redd-0.258
 microsc-0.247
igsaw-0.231
Token it
Token ID340
Feature activation+6.35e+01

Pos logit contributions
 and+0.181
,+0.159
.+0.140
 -+0.126
!+0.126
Neg logit contributions
nam-0.221
igsaw-0.215
 ey-0.205
Mic-0.203
 Amen-0.194
Token and
Token ID290
Feature activation+8.03e+01

Pos logit contributions
 died+0.125
 fought+0.119
 drowned+0.113
 argued+0.099
Ble+0.098
Neg logit contributions
nam-0.194
 downward-0.189
 outer-0.178
 redd-0.177
amental-0.172
Token said
Token ID531
Feature activation+8.23e+01

Pos logit contributions
 and+0.151
ides+0.137
 -+0.134
;+0.134
eness+0.130
Neg logit contributions
nam-0.325
 microsc-0.244
 ju-0.244
 redd-0.243
 Gw-0.241
Token '
Token ID705
Feature activation+7.23e+01

Pos logit contributions
 and+0.073
 said+0.016
.+0.012
 because+0.008
 whispered+0.006
Neg logit contributions
nam-0.421
igsaw-0.399
 microsc-0.387
 ey-0.377
 Amen-0.377
Tokensun
Token ID19155
Feature activation+1.08e+02

Pos logit contributions
 and+0.296
,+0.278
:+0.246
.+0.239
 -+0.233
Neg logit contributions
nam-0.166
Mic-0.147
igsaw-0.143
 redd-0.137
 Amen-0.134
Token'.
Token ID4458
Feature activation+7.78e+01

Pos logit contributions
 hid+0.269
 and+0.251
 checked+0.246
 scanned+0.236
 realized+0.236
Neg logit contributions
igsaw-0.341
 Amen-0.292
 redd-0.284
pill-0.283
ifying-0.281
Token Then
Token ID3244
Feature activation+6.85e+01

Pos logit contributions
 and+0.298
,+0.256
le+0.252
!+0.248
 watered+0.247
Neg logit contributions
 ey-0.134
nam-0.133
 microsc-0.129
igsaw-0.127
 Esk-0.104
Token Mike
Token ID4995
Feature activation+5.72e+01

Pos logit contributions
 and+0.106
.+0.057
,+0.054
owed+0.054
 died+0.052
Neg logit contributions
nam-0.292
 microsc-0.259
igsaw-0.257
 ey-0.255
 avail-0.254
Token ran
Token ID4966
Feature activation+8.59e+01

Pos logit contributions
 than+0.138
;+0.132
eness+0.132
oor+0.131
".+0.127
Neg logit contributions
iam-0.120
 begg-0.120
saw-0.118
Ru-0.114
lon-0.113
Token off
Token ID572
Feature activation+5.91e+01

Pos logit contributions
 and+0.172
,+0.146
.+0.122
 said+0.109
 -+0.106
Neg logit contributions
igsaw-0.352
nam-0.350
 ey-0.331
Mic-0.330
 microsc-0.316
Token King
Token ID2677
Feature activation+5.11e+01

Pos logit contributions
nam+0.006
igsaw+0.006
 ey+0.006
 microsc+0.005
 queens+0.005
Neg logit contributions
owed-0.010
 and-0.010
ided-0.009
 replied-0.009
int-0.009
Token was
Token ID373
Feature activation+5.66e+01

Pos logit contributions
zes+0.095
EEP+0.094
int+0.093
irus+0.090
ANG+0.088
Neg logit contributions
 microsc-0.124
pill-0.122
spe-0.118
igsaw-0.116
nam-0.115
Token cruel
Token ID12177
Feature activation+3.59e+01

Pos logit contributions
 and+0.175
owed+0.156
,+0.154
ops+0.151
 replied+0.146
Neg logit contributions
nam-0.153
igsaw-0.123
 ey-0.121
years-0.118
ptions-0.118
Token and
Token ID290
Feature activation+7.88e+01

Pos logit contributions
 rang+0.051
owed+0.045
int+0.045
 replied+0.044
 approached+0.037
Neg logit contributions
reating-0.118
 microsc-0.116
 ey-0.113
igsaw-0.113
rou-0.110
Token had
Token ID550
Feature activation+5.96e+01

Pos logit contributions
 and+0.237
,+0.208
.+0.183
 -+0.181
;+0.177
Neg logit contributions
nam-0.265
 microsc-0.208
 redd-0.207
 Amen-0.204
 ey-0.203
Token enslaved
Token ID39216
Feature activation+1.08e+02

Pos logit contributions
 and+0.183
,+0.171
owed+0.162
our+0.147
;+0.145
Neg logit contributions
nam-0.158
 microsc-0.144
 redd-0.135
 ju-0.130
 queens-0.126
Token many
Token ID867
Feature activation+4.72e+01

Pos logit contributions
 shook+0.270
 and+0.261
 rang+0.259
 nodded+0.255
 replied+0.252
Neg logit contributions
nam-0.270
igsaw-0.269
 microsc-0.262
 outer-0.251
iam-0.242
Token of
Token ID286
Feature activation+2.74e+01

Pos logit contributions
owed+0.150
 replied+0.148
 and+0.143
 rang+0.139
 said+0.132
Neg logit contributions
 microsc-0.109
igsaw-0.104
 ey-0.101
nam-0.101
 queens-0.092
Token the
Token ID262
Feature activation+5.18e+01

Pos logit contributions
 and+0.085
owed+0.081
 replied+0.078
,+0.078
 rang+0.075
Neg logit contributions
nam-0.075
 microsc-0.068
igsaw-0.065
 ey-0.063
 outer-0.056
Token people
Token ID661
Feature activation+7.74e+01

Pos logit contributions
 and+0.192
owed+0.186
,+0.171
 replied+0.168
:+0.161
Neg logit contributions
nam-0.119
 microsc-0.106
 ey-0.105
igsaw-0.105
 queens-0.096
Token of
Token ID286
Feature activation+2.88e+01

Pos logit contributions
uddled+0.125
 replied+0.121
 shook+0.121
owed+0.115
int+0.114
Neg logit contributions
nam-0.275
 microsc-0.242
igsaw-0.241
saw-0.234
reating-0.223
Token,
Token ID11
Feature activation+6.82e+01

Pos logit contributions
 rang+0.080
 sat+0.077
 approached+0.067
 scanned+0.065
 entered+0.065
Neg logit contributions
igsaw-0.243
nam-0.239
Ru-0.226
arl-0.222
 microsc-0.220
Token you
Token ID345
Feature activation+3.27e+01

Pos logit contributions
 and+0.206
,+0.183
.+0.172
 said+0.165
owed+0.163
Neg logit contributions
nam-0.196
igsaw-0.178
 ey-0.174
 microsc-0.168
Mic-0.165
Token can
Token ID460
Feature activation+3.88e+01

Pos logit contributions
 and+0.095
uddled+0.093
;+0.089
 scanned+0.088
 replied+0.088
Neg logit contributions
Ru-0.057
nam-0.056
 Amen-0.054
igsaw-0.053
 brav-0.050
Token reach
Token ID3151
Feature activation+6.46e+01

Pos logit contributions
 and+0.110
,+0.097
 shook+0.090
 said+0.089
owed+0.089
Neg logit contributions
 Amen-0.096
Mic-0.096
igsaw-0.091
nam-0.091
Ru-0.089
Token your
Token ID534
Feature activation+2.69e+01

Pos logit contributions
 rang+0.184
owed+0.179
 woke+0.176
 replied+0.176
 sat+0.175
Neg logit contributions
nam-0.172
 outer-0.157
igsaw-0.157
 microsc-0.150
 ey-0.150
Tokengoal
Token ID35231
Feature activation+1.08e+02

Pos logit contributions
 and+0.088
 came+0.081
,+0.081
owed+0.081
 said+0.080
Neg logit contributions
nam-0.068
igsaw-0.060
 redd-0.055
 ey-0.054
Mic-0.054
Token.
Token ID13
Feature activation+7.02e+01

Pos logit contributions
 and+0.190
 checked+0.188
 scanned+0.184
 rang+0.182
 replied+0.181
Neg logit contributions
igsaw-0.380
nam-0.358
 queens-0.352
ubb-0.350
 possible-0.349
Token<|endoftext|>
Token ID50256
Feature activation+7.08e+01

Pos logit contributions
 and+0.250
,+0.227
 said+0.212
 got+0.212
 started+0.208
Neg logit contributions
igsaw-0.166
 Deal-0.154
 Amen-0.153
nam-0.151
Mic-0.149
TokenOnce
Token ID7454
Feature activation+6.07e+01

Pos logit contributions
 and+0.221
,+0.197
 because+0.174
 said+0.170
 -+0.167
Neg logit contributions
igsaw-0.198
Mic-0.182
 ey-0.179
nam-0.179
 Amen-0.172
Token upon
Token ID2402
Feature activation+1.63e+01

Pos logit contributions
 and+0.221
,+0.209
.+0.186
!+0.165
 said+0.163
Neg logit contributions
nam-0.187
Mic-0.185
igsaw-0.180
 ey-0.170
 Amen-0.170
Token a
Token ID257
Feature activation+4.21e+01

Pos logit contributions
 and+0.041
,+0.034
.+0.032
!+0.025
 because+0.023
Neg logit contributions
igsaw-0.070
nam-0.069
 ey-0.067
Mic-0.066
 redd-0.063
Token wear
Token ID5806
Feature activation+5.61e+01

Pos logit contributions
 and+0.205
;+0.185
uddled+0.183
 -+0.179
ed+0.178
Neg logit contributions
 Amen-0.130
 outer-0.129
 avail-0.127
nam-0.125
igsaw-0.122
Token the
Token ID262
Feature activation+4.98e+01

Pos logit contributions
 and+0.162
owed+0.148
,+0.144
 stopped+0.137
 shook+0.137
Neg logit contributions
nam-0.152
 microsc-0.137
igsaw-0.135
 outer-0.128
 ey-0.124
Token hat
Token ID6877
Feature activation+5.92e+01

Pos logit contributions
 and+0.145
,+0.135
 -+0.119
.+0.117
:+0.117
Neg logit contributions
 ey-0.146
nam-0.145
 redd-0.139
igsaw-0.138
Mic-0.132
Token while
Token ID981
Feature activation+7.94e+01

Pos logit contributions
 watched+0.088
 approached+0.087
uddled+0.083
 died+0.083
 blew+0.083
Neg logit contributions
nam-0.219
igsaw-0.196
reating-0.187
 possible-0.176
 ju-0.176
Token he
Token ID339
Feature activation+5.52e+01

Pos logit contributions
 and+0.262
owed+0.223
,+0.222
 replied+0.214
hed+0.212
Neg logit contributions
nam-0.220
 avail-0.167
 microsc-0.162
chard-0.154
 ey-0.152
Token performed
Token ID6157
Feature activation+1.08e+02

Pos logit contributions
;+0.152
 –+0.131
 -+0.130
 —+0.128
,+0.127
Neg logit contributions
V-0.133
andon-0.132
iam-0.130
 Rib-0.129
uka-0.128
Token his
Token ID465
Feature activation+3.75e+01

Pos logit contributions
 and+0.258
 died+0.227
 approached+0.227
 shook+0.224
 stopped+0.223
Neg logit contributions
nam-0.346
igsaw-0.325
 microsc-0.309
 outer-0.307
 ey-0.290
Token tricks
Token ID15910
Feature activation+7.73e+01

Pos logit contributions
 and+0.129
,+0.119
;+0.115
 -+0.114
hed+0.114
Neg logit contributions
nam-0.079
igsaw-0.076
 ey-0.072
 avail-0.065
 microsc-0.065
Token.
Token ID13
Feature activation+7.53e+01

Pos logit contributions
 calmed+0.150
 checked+0.149
 died+0.147
uddled+0.146
 approached+0.142
Neg logit contributions
nam-0.232
igsaw-0.229
reating-0.226
 chess-0.213
 possible-0.213
Token He
Token ID679
Feature activation+5.34e+01

Pos logit contributions
 and+0.285
,+0.265
 -+0.258
!+0.254
owed+0.246
Neg logit contributions
igsaw-0.155
nam-0.142
 redd-0.129
Mic-0.129
 ey-0.118
Token put
Token ID1234
Feature activation+8.62e+01

Pos logit contributions
;+0.132
 -+0.119
 –+0.118
 —+0.115
',+0.111
Neg logit contributions
nam-0.148
iam-0.141
V-0.136
 microsc-0.133
 Deliver-0.132
Token to
Token ID284
Feature activation+6.44e+01

Pos logit contributions
 and+0.213
owed+0.192
 replied+0.176
,+0.175
;+0.171
Neg logit contributions
nam-0.244
 microsc-0.229
Mic-0.226
 Lorenzo-0.209
 Amen-0.208
Token help
Token ID1037
Feature activation+7.03e+01

Pos logit contributions
 and+0.202
,+0.171
;+0.160
 -+0.159
:+0.156
Neg logit contributions
 Amen-0.167
Mic-0.164
nam-0.160
igsaw-0.156
 ey-0.153
Token.
Token ID13
Feature activation+7.40e+01

Pos logit contributions
uddled+0.140
 and+0.137
owed+0.129
 rang+0.128
 nodded+0.124
Neg logit contributions
nam-0.235
Mic-0.215
 avail-0.208
 microsc-0.197
 Lorenzo-0.196
Token The
Token ID383
Feature activation+6.73e+01

Pos logit contributions
owed+0.257
 and+0.257
le+0.251
 -+0.249
 watered+0.238
Neg logit contributions
Mic-0.119
igsaw-0.115
nam-0.114
 ju-0.104
 redd-0.102
Token spirit
Token ID4437
Feature activation+5.80e+01

Pos logit contributions
 and+0.186
,+0.163
.+0.138
 because+0.115
 said+0.114
Neg logit contributions
igsaw-0.266
nam-0.254
 ey-0.250
ptions-0.232
 microsc-0.232
Token served
Token ID4983
Feature activation+1.07e+02

Pos logit contributions
oor+0.156
anas+0.154
EL+0.149
ides+0.147
ops+0.146
Neg logit contributions
igator-0.113
Mic-0.108
 Amen-0.106
Prin-0.101
nam-0.101
Token the
Token ID262
Feature activation+5.36e+01

Pos logit contributions
 and+0.342
 rang+0.327
;+0.317
owed+0.316
 -+0.315
Neg logit contributions
nam-0.229
Mic-0.206
 microsc-0.199
igsaw-0.186
 ey-0.178
Token rabbit
Token ID22746
Feature activation+6.42e+01

Pos logit contributions
 and+0.178
,+0.162
.+0.149
 -+0.146
owed+0.143
Neg logit contributions
nam-0.148
Mic-0.147
igsaw-0.139
 ey-0.132
 microsc-0.129
Token some
Token ID617
Feature activation+4.26e+01

Pos logit contributions
 rang+0.161
our+0.142
 repeated+0.141
ops+0.140
+0.138
Neg logit contributions
igsaw-0.165
nam-0.162
 microsc-0.144
 redd-0.141
 Lorenzo-0.141
Token y
Token ID331
Feature activation+2.90e+01

Pos logit contributions
 and+0.147
 rang+0.145
owed+0.144
+0.138
:+0.138
Neg logit contributions
 ey-0.071
 redd-0.067
nam-0.066
 microsc-0.066
Mic-0.062
Tokenummy
Token ID13513
Feature activation+2.39e+01

Pos logit contributions
 and+0.095
,+0.089
.+0.078
!+0.071
 -+0.070
Neg logit contributions
nam-0.109
 Amen-0.093
Mic-0.093
igsaw-0.089
 redd-0.086
Token got
Token ID1392
Feature activation+6.17e+01

Pos logit contributions
 and+0.099
owed+0.085
.+0.085
,"+0.084
anas+0.083
Neg logit contributions
igsaw-0.171
 gol-0.170
iam-0.168
 ey-0.167
 jan-0.163
Token there
Token ID612
Feature activation+5.27e+01

Pos logit contributions
 and+0.150
,+0.132
.+0.107
 said+0.093
!+0.091
Neg logit contributions
igsaw-0.255
nam-0.251
Mic-0.243
 ey-0.243
 Amen-0.231
Token,
Token ID11
Feature activation+7.60e+01

Pos logit contributions
anas+0.090
int+0.084
owed+0.079
 watered+0.078
 whispered+0.075
Neg logit contributions
nam-0.174
 microsc-0.155
igsaw-0.154
 Tues-0.154
 avail-0.150
Token he
Token ID339
Feature activation+5.75e+01

Pos logit contributions
 and+0.226
,+0.199
.+0.178
 -+0.163
!+0.162
Neg logit contributions
nam-0.254
igsaw-0.237
 ey-0.225
Mic-0.223
 Amen-0.215
Token was
Token ID373
Feature activation+6.30e+01

Pos logit contributions
eness+0.136
",+0.136
',"+0.136
;+0.134
red+0.132
Neg logit contributions
iam-0.136
 sch-0.132
 microsc-0.126
nam-0.124
rou-0.121
Token welcomed
Token ID15846
Feature activation+1.07e+02

Pos logit contributions
 and+0.207
,+0.183
ops+0.178
 replied+0.172
 -+0.171
Neg logit contributions
nam-0.163
years-0.127
ptions-0.123
rou-0.117
 Amen-0.116
Token with
Token ID351
Feature activation+7.45e+01

Pos logit contributions
 and+0.208
,+0.197
 said+0.196
 died+0.188
 got+0.187
Neg logit contributions
nam-0.361
igsaw-0.323
reating-0.315
 microsc-0.311
amental-0.310
Token a
Token ID257
Feature activation+4.75e+01

Pos logit contributions
 and+0.235
,+0.209
.+0.184
 -+0.173
!+0.170
Neg logit contributions
nam-0.237
igsaw-0.226
 ey-0.211
Mic-0.209
 microsc-0.202
Token smile
Token ID8212
Feature activation+7.42e+01

Pos logit contributions
 and+0.180
owed+0.161
,+0.156
.+0.152
 because+0.151
Neg logit contributions
igsaw-0.110
nam-0.107
 ey-0.096
Mic-0.092
 microsc-0.089
Token from
Token ID422
Feature activation+5.94e+01

Pos logit contributions
 shook+0.130
uddled+0.124
 died+0.123
 said+0.120
 and+0.120
Neg logit contributions
igsaw-0.218
reating-0.212
ifying-0.206
nam-0.203
 Amen-0.198
Token his
Token ID465
Feature activation+4.19e+01

Pos logit contributions
 and+0.191
,+0.169
.+0.153
!+0.136
 because+0.134
Neg logit contributions
igsaw-0.192
nam-0.191
Mic-0.182
 ey-0.179
 Amen-0.167
Token show
Token ID905
Feature activation+6.20e+01

Pos logit contributions
 and+0.164
,+0.148
.+0.133
 -+0.126
!+0.124
Neg logit contributions
nam-0.134
igsaw-0.134
Mic-0.124
 ey-0.123
 redd-0.116
Token that
Token ID326
Feature activation+6.07e+01

Pos logit contributions
int+0.145
+0.127
aned+0.126
wr+0.126
use+0.125
Neg logit contributions
igsaw-0.149
 chess-0.147
 student-0.146
reating-0.146
 gol-0.145
Token the
Token ID262
Feature activation+4.91e+01

Pos logit contributions
 and+0.195
owed+0.192
int+0.178
 -+0.169
;+0.169
Neg logit contributions
nam-0.162
 microsc-0.129
igsaw-0.127
 redd-0.120
 gol-0.118
Token careful
Token ID8161
Feature activation+6.19e+01

Pos logit contributions
 and+0.173
,+0.156
.+0.141
!+0.129
 said+0.129
Neg logit contributions
nam-0.141
igsaw-0.138
 ey-0.130
Mic-0.129
 Amen-0.121
Token bean
Token ID26394
Feature activation+4.72e+01

Pos logit contributions
 and+0.217
,+0.202
 said+0.189
:+0.179
 answered+0.179
Neg logit contributions
igsaw-0.150
nam-0.135
 ey-0.128
 redd-0.127
Mic-0.125
Token performed
Token ID6157
Feature activation+1.07e+02

Pos logit contributions
int+0.111
 :+0.110
ged+0.110
oned+0.110
;+0.108
Neg logit contributions
 microsc-0.089
pill-0.088
igators-0.086
igator-0.084
igsaw-0.083
Token.
Token ID13
Feature activation+7.34e+01

Pos logit contributions
 approached+0.215
 rang+0.210
 replied+0.208
 sat+0.203
 shook+0.193
Neg logit contributions
igsaw-0.361
nam-0.336
 outer-0.336
 microsc-0.333
 chess-0.301
Token And
Token ID843
Feature activation+7.93e+01

Pos logit contributions
 and+0.243
,+0.222
 -+0.204
!+0.202
 kept+0.200
Neg logit contributions
igsaw-0.185
Mic-0.169
 Amen-0.168
nam-0.160
 microsc-0.155
Token they
Token ID484
Feature activation+5.52e+01

Pos logit contributions
 and+0.188
,+0.161
.+0.144
!+0.130
 -+0.129
Neg logit contributions
nam-0.297
igsaw-0.284
Mic-0.268
 ey-0.266
 microsc-0.262
Token all
Token ID477
Feature activation+5.35e+01

Pos logit contributions
 and+0.148
;+0.133
,+0.129
 -+0.128
owed+0.122
Neg logit contributions
nam-0.157
 microsc-0.131
 narrative-0.129
 gol-0.129
igsaw-0.128
Token lived
Token ID5615
Feature activation+7.76e+01

Pos logit contributions
 and+0.127
;+0.114
 -+0.110
 but+0.107
,+0.105
Neg logit contributions
nam-0.149
 Amen-0.144
igsaw-0.137
 microsc-0.137
 gol-0.137
Token and
Token ID290
Feature activation+8.00e+01

Pos logit contributions
 and+0.048
,+0.020
.-0.004
!-0.022
 said-0.023
Neg logit contributions
nam-0.428
igsaw-0.427
Mic-0.411
 ey-0.410
 Amen-0.400
Token cars
Token ID5006
Feature activation+5.51e+01

Pos logit contributions
 and+0.236
,+0.199
.+0.176
 said+0.152
 because+0.149
Neg logit contributions
igsaw-0.303
Mic-0.280
nam-0.275
 ey-0.274
 Amen-0.267
Token.
Token ID13
Feature activation+7.46e+01

Pos logit contributions
 replied+0.092
owed+0.086
int+0.081
 answered+0.080
 declined+0.080
Neg logit contributions
reating-0.185
nam-0.183
 gol-0.175
 recess-0.174
saw-0.172
Token Sometimes
Token ID8975
Feature activation+4.93e+01

Pos logit contributions
 and+0.204
,+0.176
.+0.156
 said+0.132
 because+0.129
Neg logit contributions
nam-0.290
igsaw-0.281
Mic-0.271
 ey-0.268
 Amen-0.258
Token they
Token ID484
Feature activation+5.61e+01

Pos logit contributions
 and+0.141
owed+0.131
int+0.124
hed+0.117
 grabbed+0.113
Neg logit contributions
nam-0.136
 microsc-0.120
 avail-0.120
igsaw-0.109
 Tues-0.107
Token knocked
Token ID13642
Feature activation+1.07e+02

Pos logit contributions
 and+0.134
;+0.134
,+0.130
 -+0.130
 –+0.123
Neg logit contributions
nam-0.197
ptions-0.155
Mic-0.153
 redd-0.149
 microsc-0.147
Token them
Token ID606
Feature activation+3.92e+01

Pos logit contributions
owed+0.257
 replied+0.245
 and+0.241
,+0.235
 -+0.228
Neg logit contributions
nam-0.320
reating-0.291
igsaw-0.284
 ey-0.271
 microsc-0.266
Token down
Token ID866
Feature activation+5.10e+01

Pos logit contributions
anas+0.124
 replied+0.117
 declined+0.117
uddled+0.116
 nodded+0.112
Neg logit contributions
nam-0.097
igsaw-0.075
saw-0.074
 outer-0.069
 redd-0.068
Token and
Token ID290
Feature activation+8.43e+01

Pos logit contributions
int+0.127
 replied+0.126
anna+0.117
 spoke+0.116
 whispered+0.113
Neg logit contributions
igsaw-0.150
nam-0.142
 microsc-0.136
 outer-0.135
 math-0.130
Token started
Token ID2067
Feature activation+7.69e+01

Pos logit contributions
 and+0.194
;+0.188
 -+0.188
 –+0.178
owed+0.176
Neg logit contributions
nam-0.335
 microsc-0.246
 redd-0.239
 ey-0.231
ptions-0.231
Token again
Token ID757
Feature activation+7.04e+01

Pos logit contributions
 and+0.167
,+0.142
 said+0.128
 -+0.126
 replied+0.125
Neg logit contributions
nam-0.290
igsaw-0.275
 microsc-0.260
 ey-0.255
ptions-0.248
Token the
Token ID262
Feature activation+5.10e+01

Pos logit contributions
 and+0.220
,+0.198
.+0.177
!+0.167
 -+0.166
Neg logit contributions
nam-0.215
igsaw-0.202
 ey-0.193
Mic-0.191
 microsc-0.181
Token tied
Token ID8165
Feature activation+8.02e+01

Pos logit contributions
 and+0.182
,+0.169
:+0.160
owed+0.160
 -+0.159
Neg logit contributions
nam-0.122
 redd-0.113
igsaw-0.112
 ey-0.105
 ju-0.102
Token rope
Token ID17182
Feature activation+6.52e+01

Pos logit contributions
 and+0.185
anas+0.184
 woke+0.172
 said+0.168
 replied+0.168
Neg logit contributions
igsaw-0.234
nam-0.228
 ey-0.217
 redd-0.215
 Lorenzo-0.197
Token around
Token ID1088
Feature activation+5.12e+01

Pos logit contributions
anas+0.158
int+0.133
 woke+0.127
 agree+0.127
 overheard+0.126
Neg logit contributions
igsaw-0.186
nam-0.171
 redd-0.150
 ju-0.145
 sill-0.144
Token it
Token ID340
Feature activation+6.68e+01

Pos logit contributions
int+0.185
 rang+0.173
 replied+0.171
owed+0.170
uddled+0.170
Neg logit contributions
nam-0.098
igsaw-0.091
 microsc-0.085
 ey-0.075
 Lorenzo-0.068
Token served
Token ID4983
Feature activation+1.07e+02

Pos logit contributions
anas+0.148
int+0.138
wo+0.133
round+0.133
resp+0.128
Neg logit contributions
igsaw-0.173
 ju-0.162
 sill-0.157
nam-0.152
 redd-0.142
Token as
Token ID355
Feature activation+6.76e+01

Pos logit contributions
 rang+0.265
 replied+0.235
 overheard+0.233
 and+0.232
 approached+0.230
Neg logit contributions
nam-0.301
igsaw-0.288
 microsc-0.266
 redd-0.261
 ju-0.258
Token a
Token ID257
Feature activation+4.68e+01

Pos logit contributions
 and+0.214
,+0.191
.+0.170
!+0.155
 -+0.154
Neg logit contributions
nam-0.216
igsaw-0.213
 ey-0.202
Mic-0.201
 Amen-0.190
Token reminder
Token ID15438
Feature activation+9.72e+01

Pos logit contributions
 and+0.170
owed+0.155
,+0.147
 because+0.143
red+0.137
Neg logit contributions
igsaw-0.114
Mic-0.095
nam-0.093
 gol-0.092
 ju-0.090
Token to
Token ID284
Feature activation+6.95e+01

Pos logit contributions
 argued+0.358
 roared+0.342
 talked+0.340
 rang+0.340
anas+0.339
Neg logit contributions
igsaw-0.178
 ey-0.110
rou-0.108
 possible-0.104
reating-0.096
Token her
Token ID607
Feature activation+4.52e+01

Pos logit contributions
 and+0.197
,+0.173
.+0.156
 -+0.151
;+0.148
Neg logit contributions
nam-0.226
igsaw-0.217
 ey-0.197
 microsc-0.184
Mic-0.182
Token you
Token ID345
Feature activation-1.63e+02

Pos logit contributions
 and+0.255
,+0.222
.+0.196
!+0.177
 -+0.175
Neg logit contributions
nam-0.327
igsaw-0.316
 ey-0.301
Mic-0.300
 Amen-0.286
Token go
Token ID467
Feature activation-1.31e+02

Pos logit contributions
nam+0.389
 Amen+0.388
 possible+0.368
 narrative+0.334
 Deal+0.327
Neg logit contributions
uddled-0.407
 scanned-0.376
 and-0.365
anas-0.350
owed-0.328
Token,
Token ID11
Feature activation-9.47e+01

Pos logit contributions
 Amen+0.326
 outer+0.312
reating+0.302
nam+0.301
saw+0.298
Neg logit contributions
owed-0.343
red-0.309
 nodded-0.307
 shook-0.299
 answered-0.295
Token my
Token ID616
Feature activation-1.31e+02

Pos logit contributions
nam+0.258
 redd+0.224
igsaw+0.220
 ju+0.220
 ey+0.219
Neg logit contributions
owed-0.250
 and-0.243
hed-0.235
red-0.229
uddled-0.229
Token d
Token ID288
Feature activation-1.25e+02

Pos logit contributions
nam+0.281
igsaw+0.278
 ey+0.256
 gol+0.246
 Amen+0.245
Neg logit contributions
 and-0.420
hed-0.396
 spoke-0.389
owed-0.385
 approached-0.384
Tokenarl
Token ID7063
Feature activation-2.55e+02

Pos logit contributions
igsaw+0.124
arl+0.115
 ey+0.090
reating+0.060
ocol+0.054
Neg logit contributions
 and-0.656
,-0.630
 said-0.604
 watched-0.579
 told-0.579
Tokenings
Token ID654
Feature activation-8.31e+01

Pos logit contributions
igsaw+0.818
 ey+0.772
arl+0.759
Mic+0.748
nam+0.743
Neg logit contributions
 and-0.643
,-0.547
 because-0.513
.-0.498
 --0.489
Token.
Token ID13
Feature activation-7.46e+01

Pos logit contributions
reating+0.231
arl+0.217
 concession+0.202
 Amen+0.202
igsaw+0.195
Neg logit contributions
owed-0.169
 cheeks-0.167
 knees-0.167
anged-0.167
 argued-0.163
Token Enjoy
Token ID18179
Feature activation-9.90e+01

Pos logit contributions
nam+0.200
igsaw+0.196
 ey+0.184
Mic+0.183
 Amen+0.174
Neg logit contributions
 and-0.271
,-0.245
.-0.223
!-0.208
 --0.206
Token your
Token ID534
Feature activation-1.13e+02

Pos logit contributions
nam+0.199
igsaw+0.188
 ey+0.185
reating+0.180
 avail+0.161
Neg logit contributions
 and-0.314
 rang-0.302
 spoke-0.300
owed-0.299
 stood-0.296
Token lunch
Token ID9965
Feature activation-9.60e+01

Pos logit contributions
nam+0.216
 ey+0.215
igsaw+0.204
 microsc+0.179
ocol+0.163
Neg logit contributions
 and-0.418
owed-0.391
hed-0.383
;-0.374
 --0.369
Token's
Token ID338
Feature activation-1.19e+02

Pos logit contributions
 redd+0.264
 ju+0.250
igsaw+0.238
nam+0.231
 Amen+0.227
Neg logit contributions
owed-0.255
 approached-0.243
 entered-0.236
ged-0.233
 and-0.229
Token wonderful
Token ID7932
Feature activation-1.13e+02

Pos logit contributions
nam+0.322
 Amen+0.256
 Deal+0.243
igsaw+0.242
 redd+0.240
Neg logit contributions
 and-0.413
 approached-0.363
hed-0.357
,-0.357
 spoke-0.357
Token,
Token ID11
Feature activation-8.21e+01

Pos logit contributions
igsaw+0.496
nam+0.470
 ey+0.459
 Amen+0.444
 microsc+0.441
Neg logit contributions
 and-0.129
 entered-0.118
 rang-0.114
 got-0.106
 died-0.106
Token my
Token ID616
Feature activation-1.19e+02

Pos logit contributions
igsaw+0.171
nam+0.169
 redd+0.148
Mic+0.142
 microsc+0.141
Neg logit contributions
 and-0.291
owed-0.280
,-0.265
hed-0.258
 rang-0.256
Token d
Token ID288
Feature activation-1.14e+02

Pos logit contributions
nam+0.252
igsaw+0.247
 ey+0.212
rou+0.204
 gol+0.198
Neg logit contributions
 spoke-0.406
 approached-0.405
 and-0.403
 entered-0.398
hed-0.396
Tokenarl
Token ID7063
Feature activation-2.44e+02

Pos logit contributions
igsaw+0.160
 ey+0.124
arl+0.121
nam+0.096
rou+0.084
Neg logit contributions
 and-0.585
,-0.556
 said-0.521
.-0.504
 watched-0.495
Tokenings
Token ID654
Feature activation-7.36e+01

Pos logit contributions
igsaw+0.745
 ey+0.705
arl+0.663
nam+0.663
Mic+0.661
Neg logit contributions
 and-0.672
,-0.584
 because-0.556
.-0.531
 --0.518
Token!"
Token ID2474
Feature activation-6.27e+01

Pos logit contributions
arl+0.201
igsaw+0.191
reating+0.176
ifying+0.171
 narrative+0.169
Neg logit contributions
 knees-0.173
owed-0.157
ided-0.155
anged-0.154
iously-0.153
Token Mom
Token ID11254
Feature activation-8.15e+01

Pos logit contributions
nam+0.146
Mic+0.127
 Amen+0.123
igsaw+0.122
 Gw+0.116
Neg logit contributions
owed-0.190
 and-0.178
uddled-0.171
 --0.168
 stayed-0.165
Tokenmy
Token ID1820
Feature activation-1.29e+02

Pos logit contributions
 Amen+0.222
 ey+0.211
iam+0.193
rou+0.192
 Deal+0.191
Neg logit contributions
 and-0.178
;-0.173
 because-0.173
,-0.169
 --0.167
Token says
Token ID1139
Feature activation-6.24e+01

Pos logit contributions
 Amen+0.343
 Justin+0.342
 Alexa+0.338
 Avery+0.327
 Luke+0.324
Neg logit contributions
 than-0.265
-0.248
 because-0.236
Exit-0.231
age-0.226
Token reward
Token ID6721
Feature activation-1.07e+02

Pos logit contributions
 ey+0.475
nam+0.472
igsaw+0.466
 microsc+0.398
 redd+0.390
Neg logit contributions
owed-0.516
 and-0.472
 rang-0.470
 listened-0.469
uddled-0.457
Token.
Token ID13
Feature activation-9.04e+01

Pos logit contributions
nam+0.332
 possible+0.309
reating+0.306
igsaw+0.297
 yours+0.289
Neg logit contributions
 rang-0.203
 turned-0.190
 woke-0.184
int-0.184
 tapped-0.180
Token It
Token ID632
Feature activation-1.01e+02

Pos logit contributions
igsaw+0.217
nam+0.216
 Amen+0.210
Mic+0.206
 Deal+0.180
Neg logit contributions
 and-0.309
,-0.279
owed-0.257
 kept-0.254
!-0.253
Token was
Token ID373
Feature activation-9.63e+01

Pos logit contributions
 ju+0.258
 redd+0.257
Looks+0.237
 Amen+0.225
 brav+0.223
Neg logit contributions
 and-0.235
;-0.232
,-0.216
 --0.208
uddled-0.208
Token ch
Token ID442
Feature activation-1.16e+02

Pos logit contributions
nam+0.247
 redd+0.204
igsaw+0.203
 ju+0.186
 outer+0.184
Neg logit contributions
 and-0.306
ops-0.280
owed-0.279
uddled-0.275
,-0.272
Tokenocol
Token ID4668
Feature activation-2.39e+02

Pos logit contributions
igsaw+0.199
 ey+0.167
nam+0.162
Mic+0.146
years+0.130
Neg logit contributions
 and-0.545
,-0.508
 said-0.479
.-0.475
 --0.451
Tokenates
Token ID689
Feature activation-9.23e+01

Pos logit contributions
nam+0.635
igsaw+0.618
 ey+0.587
Mic+0.586
 Amen+0.530
Neg logit contributions
 and-0.869
,-0.787
.-0.712
 said-0.673
 because-0.658
Token and
Token ID290
Feature activation-6.54e+01

Pos logit contributions
ifying+0.304
nam+0.289
igsaw+0.287
rou+0.282
 possible+0.273
Neg logit contributions
 checked-0.154
 tapped-0.146
 rang-0.144
 approached-0.141
 calmed-0.139
Token sweets
Token ID42402
Feature activation-1.02e+02

Pos logit contributions
nam+0.225
 microsc+0.168
igsaw+0.166
 queens+0.166
 redd+0.163
Neg logit contributions
 and-0.173
int-0.154
owed-0.147
,-0.144
uddled-0.140
Token!
Token ID0
Feature activation-5.61e+01

Pos logit contributions
 ju+0.316
nam+0.313
 possible+0.285
hel+0.280
 redd+0.279
Neg logit contributions
int-0.185
anged-0.173
action-0.148
 tapped-0.148
okes-0.146
Token
Token ID198
Feature activation-9.36e+01

Pos logit contributions
igsaw+0.140
nam+0.138
Mic+0.135
 Amen+0.125
 Deal+0.117
Neg logit contributions
 and-0.193
,-0.179
.-0.158
 --0.157
owed-0.157
Token Everyday
Token ID48154
Feature activation-1.27e+02

Pos logit contributions
anas+0.150
iously+0.148
int+0.147
ages+0.145
ANG+0.142
Neg logit contributions
nam-0.156
rou-0.116
 gol-0.115
 microsc-0.112
 mer-0.110
Token he
Token ID339
Feature activation-1.19e+02

Pos logit contributions
 Wed+0.277
nam+0.259
igsaw+0.250
 Amen+0.246
 Tues+0.244
Neg logit contributions
owed-0.366
uddled-0.361
ave-0.351
hed-0.335
 and-0.332
Token would
Token ID561
Feature activation-1.09e+02

Pos logit contributions
 Amen+0.286
nam+0.254
 brav+0.239
die+0.234
 redd+0.230
Neg logit contributions
uddled-0.305
;-0.259
eness-0.251
 and-0.250
orting-0.245
Token take
Token ID1011
Feature activation-1.07e+02

Pos logit contributions
 Amen+0.270
nam+0.207
 Tues+0.181
 outer+0.181
igsaw+0.164
Neg logit contributions
uddled-0.384
ed-0.335
orted-0.323
awed-0.309
led-0.295
Token the
Token ID262
Feature activation-1.06e+02

Pos logit contributions
igsaw+0.295
nam+0.284
Mic+0.262
 microsc+0.245
 ey+0.239
Neg logit contributions
 and-0.336
,-0.290
 said-0.272
owed-0.271
:-0.265
Token av
Token ID1196
Feature activation-2.38e+02

Pos logit contributions
nam+0.283
igsaw+0.268
 ey+0.246
 redd+0.239
Mic+0.237
Neg logit contributions
 and-0.367
,-0.337
:-0.319
owed-0.313
 --0.307
Tokenoc
Token ID420
Feature activation-2.19e+02

Pos logit contributions
nam+0.950
 ey+0.907
igsaw+0.882
 microsc+0.877
 Amen+0.845
Neg logit contributions
 and-0.655
,-0.559
.-0.455
!-0.427
 --0.397
Tokenados
Token ID22484
Feature activation-1.11e+02

Pos logit contributions
nam+0.541
igsaw+0.536
Mic+0.499
 ey+0.494
 Amen+0.454
Neg logit contributions
 and-0.836
,-0.758
.-0.697
 said-0.661
!-0.645
Token and
Token ID290
Feature activation-6.52e+01

Pos logit contributions
nam+0.367
igsaw+0.334
Mic+0.304
 ju+0.300
 possible+0.296
Neg logit contributions
 woke-0.228
 rang-0.226
 died-0.225
 cheeks-0.221
 replied-0.219
Token scatter
Token ID41058
Feature activation-7.37e+01

Pos logit contributions
nam+0.223
Mic+0.190
igsaw+0.186
 redd+0.183
 ju+0.174
Neg logit contributions
 and-0.184
,-0.157
owed-0.147
.-0.145
 --0.143
Token them
Token ID606
Feature activation-1.07e+02

Pos logit contributions
nam+0.197
igsaw+0.195
 redd+0.179
Mic+0.176
 ju+0.168
Neg logit contributions
 and-0.215
owed-0.212
 rang-0.204
 replied-0.191
,-0.190
Token When
Token ID1649
Feature activation-8.90e+01

Pos logit contributions
nam+0.230
 Amen+0.223
igsaw+0.220
Mic+0.204
 ey+0.201
Neg logit contributions
 and-0.345
,-0.309
owed-0.291
le-0.289
!-0.289
Token Daisy
Token ID40355
Feature activation-1.17e+02

Pos logit contributions
nam+0.228
igsaw+0.204
 Amen+0.184
 microsc+0.184
 avail+0.183
Neg logit contributions
 and-0.304
owed-0.260
;-0.239
uddled-0.238
int-0.237
Token put
Token ID1234
Feature activation-7.02e+01

Pos logit contributions
uka+0.264
pill+0.244
 Avery+0.239
Ru+0.238
Isa+0.238
Neg logit contributions
eness-0.287
ikes-0.281
 than-0.277
anas-0.273
ower-0.268
Token the
Token ID262
Feature activation-1.00e+02

Pos logit contributions
nam+0.231
igsaw+0.218
 ey+0.209
 redd+0.192
Mic+0.188
Neg logit contributions
 and-0.190
,-0.161
 --0.148
 wanted-0.146
;-0.146
Token magn
Token ID7842
Feature activation-1.05e+02

Pos logit contributions
nam+0.264
 ey+0.241
igsaw+0.233
 redd+0.221
 microsc+0.200
Neg logit contributions
 and-0.353
,-0.326
 because-0.311
owed-0.308
:-0.305
Tokenifying
Token ID4035
Feature activation-2.34e+02

Pos logit contributions
igsaw+0.079
nam+0.061
ifying+0.056
 ey+0.042
Mic+0.027
Neg logit contributions
 and-0.554
,-0.528
 said-0.500
.-0.499
 because-0.496
Token glass
Token ID5405
Feature activation-9.07e+01

Pos logit contributions
igsaw+0.646
nam+0.640
 ey+0.602
Mic+0.579
 Amen+0.548
Neg logit contributions
 and-0.817
,-0.733
.-0.665
 --0.618
 said-0.617
Token down
Token ID866
Feature activation-9.92e+01

Pos logit contributions
igsaw+0.383
 ey+0.333
nam+0.312
 ju+0.304
 redd+0.302
Neg logit contributions
anas-0.120
 watched-0.113
 rang-0.110
 listened-0.109
 explained-0.104
Token,
Token ID11
Feature activation-6.87e+01

Pos logit contributions
igsaw+0.345
 ey+0.334
nam+0.331
 microsc+0.297
 redd+0.293
Neg logit contributions
anas-0.208
 rang-0.166
 explained-0.165
int-0.163
 agree-0.160
Token the
Token ID262
Feature activation-8.83e+01

Pos logit contributions
nam+0.230
igsaw+0.222
Mic+0.208
 ey+0.207
 microsc+0.197
Neg logit contributions
 and-0.203
,-0.176
.-0.158
 --0.145
!-0.143
Token cater
Token ID20825
Feature activation-1.07e+02

Pos logit contributions
nam+0.227
 ey+0.213
igsaw+0.205
 microsc+0.184
Mic+0.178
Neg logit contributions
 and-0.303
,-0.275
owed-0.267
 because-0.259
;-0.259
Token.
Token ID13
Feature activation-8.30e+01

Pos logit contributions
nam+0.442
 Amen+0.404
rou+0.399
Acc+0.381
Mic+0.371
Neg logit contributions
 cheeks-0.178
ar-0.163
 scanned-0.162
ops-0.152
 shook-0.152
Token She
Token ID1375
Feature activation-9.14e+01

Pos logit contributions
nam+0.271
igsaw+0.266
 ey+0.252
Mic+0.250
 Amen+0.237
Neg logit contributions
 and-0.256
,-0.228
.-0.202
!-0.183
 --0.182
Token loved
Token ID6151
Feature activation-6.63e+01

Pos logit contributions
Mic+0.173
 Amen+0.173
Acc+0.165
rou+0.163
nam+0.156
Neg logit contributions
eness-0.263
iously-0.237
ides-0.235
oves-0.234
ided-0.232
Token to
Token ID284
Feature activation-8.26e+01

Pos logit contributions
nam+0.281
igsaw+0.275
 ey+0.264
Mic+0.264
 microsc+0.253
Neg logit contributions
 and-0.137
,-0.116
.-0.096
 --0.082
!-0.082
Token eat
Token ID4483
Feature activation-8.74e+01

Pos logit contributions
nam+0.242
igsaw+0.230
 ey+0.230
 Amen+0.227
Mic+0.223
Neg logit contributions
 and-0.235
,-0.203
 --0.194
ed-0.191
;-0.187
Token av
Token ID1196
Feature activation-2.29e+02

Pos logit contributions
nam+0.214
 ey+0.188
Mic+0.183
igsaw+0.183
 microsc+0.181
Neg logit contributions
 and-0.296
,-0.270
owed-0.260
 --0.251
 said-0.250
Tokenoc
Token ID420
Feature activation-2.08e+02

Pos logit contributions
nam+0.997
igsaw+0.956
 ey+0.941
 microsc+0.924
 Deal+0.910
Neg logit contributions
 and-0.621
,-0.529
.-0.414
!-0.389
 --0.329
Tokenados
Token ID22484
Feature activation-1.03e+02

Pos logit contributions
nam+0.513
igsaw+0.506
Mic+0.471
 ey+0.468
 Amen+0.424
Neg logit contributions
 and-0.798
,-0.723
.-0.665
 said-0.620
!-0.616
Token.
Token ID13
Feature activation-6.53e+01

Pos logit contributions
nam+0.415
Mic+0.367
igsaw+0.363
 ju+0.350
 redd+0.340
Neg logit contributions
 woke-0.144
 shook-0.143
 died-0.143
 and-0.139
 replied-0.138
Token One
Token ID1881
Feature activation-7.40e+01

Pos logit contributions
nam+0.241
igsaw+0.237
 ey+0.226
Mic+0.224
 Amen+0.214
Neg logit contributions
 and-0.176
,-0.153
.-0.133
!-0.117
 --0.117
Token day
Token ID1110
Feature activation-7.29e+01

Pos logit contributions
nam+0.215
igsaw+0.212
 ey+0.197
Mic+0.197
 redd+0.184
Neg logit contributions
 and-0.249
,-0.223
.-0.201
!-0.188
 --0.187
Token mum
Token ID25682
Feature activation-1.35e+02

Pos logit contributions
hed+0.227
 replied+0.225
 handed+0.213
int+0.213
owed+0.212
Neg logit contributions
 possible-0.157
nam-0.128
 outer-0.119
reating-0.113
 Wed-0.113
Token and
Token ID290
Feature activation-8.96e+01

Pos logit contributions
 Amen+0.413
nam+0.401
rou+0.386
 av+0.384
 ey+0.372
Neg logit contributions
 than-0.152
ANG-0.150
Exit-0.150
 pierced-0.150
anas-0.146
Token her
Token ID607
Feature activation-1.27e+02

Pos logit contributions
nam+0.312
chard+0.219
igsaw+0.217
 redd+0.216
 Amen+0.216
Neg logit contributions
 and-0.211
,-0.182
;-0.176
 --0.175
owed-0.170
Token three
Token ID1115
Feature activation-1.40e+02

Pos logit contributions
nam+0.278
igsaw+0.232
 ey+0.216
 microsc+0.202
 Amen+0.197
Neg logit contributions
 and-0.428
,-0.380
 because-0.374
owed-0.373
hed-0.369
Token-
Token ID12
Feature activation-1.86e+02

Pos logit contributions
years+0.276
igsaw+0.271
nam+0.264
 Amen+0.256
ocol+0.250
Neg logit contributions
 and-0.399
 rang-0.397
ANG-0.380
 because-0.373
 broke-0.352
Tokenyear
Token ID1941
Feature activation-2.26e+02

Pos logit contributions
nam+0.274
igsaw+0.249
Mic+0.220
 ey+0.206
 Amen+0.198
Neg logit contributions
 and-0.906
,-0.846
.-0.778
!-0.746
 because-0.744
Token-
Token ID12
Feature activation-1.82e+02

Pos logit contributions
igsaw+0.876
nam+0.826
 ey+0.791
 microsc+0.756
 redd+0.753
Neg logit contributions
 and-0.464
,-0.389
 said-0.325
.-0.318
 because-0.316
Tokenold
Token ID727
Feature activation-1.27e+02

Pos logit contributions
nam+0.388
igsaw+0.375
Mic+0.338
years+0.317
 ey+0.315
Neg logit contributions
 and-0.753
,-0.699
.-0.628
 because-0.607
!-0.595
Token daughter
Token ID4957
Feature activation-9.94e+01

Pos logit contributions
nam+0.312
years+0.274
igsaw+0.268
iam+0.259
 spec+0.253
Neg logit contributions
 and-0.336
 shook-0.296
 because-0.294
,-0.289
 grew-0.288
Token went
Token ID1816
Feature activation-7.04e+01

Pos logit contributions
nam+0.289
years+0.268
 microsc+0.255
igsaw+0.252
 dining+0.248
Neg logit contributions
 and-0.159
uddled-0.157
int-0.153
 because-0.152
wr-0.151
Token to
Token ID284
Feature activation-8.44e+01

Pos logit contributions
igsaw+0.204
nam+0.185
 Amen+0.180
reating+0.177
 outer+0.169
Neg logit contributions
owed-0.163
 liked-0.159
 and-0.158
 hated-0.157
 wanted-0.156
Token.
Token ID13
Feature activation-7.72e+01

Pos logit contributions
nam+0.415
 Amen+0.387
rou+0.364
Acc+0.358
Mic+0.337
Neg logit contributions
 cheeks-0.170
ops-0.170
ar-0.162
 scanned-0.150
anas-0.148
Token She
Token ID1375
Feature activation-8.63e+01

Pos logit contributions
nam+0.266
igsaw+0.260
 ey+0.249
Mic+0.245
 Amen+0.233
Neg logit contributions
 and-0.229
,-0.203
.-0.179
!-0.158
 --0.157
Token loved
Token ID6151
Feature activation-6.19e+01

Pos logit contributions
Mic+0.138
 Amen+0.136
Acc+0.130
nam+0.126
rou+0.118
Neg logit contributions
eness-0.299
iously-0.266
ides-0.262
oves-0.261
ales-0.253
Token to
Token ID284
Feature activation-7.89e+01

Pos logit contributions
nam+0.266
igsaw+0.260
 ey+0.250
Mic+0.250
 microsc+0.240
Neg logit contributions
 and-0.125
,-0.105
.-0.086
!-0.073
 --0.073
Token eat
Token ID4483
Feature activation-8.33e+01

Pos logit contributions
nam+0.203
igsaw+0.202
 ey+0.199
 Amen+0.198
Mic+0.184
Neg logit contributions
 and-0.235
ed-0.218
,-0.201
 --0.200
owed-0.198
Token av
Token ID1196
Feature activation-2.24e+02

Pos logit contributions
nam+0.206
 ey+0.182
igsaw+0.181
Mic+0.178
 microsc+0.176
Neg logit contributions
 and-0.282
,-0.255
owed-0.242
 --0.236
 said-0.235
Tokenoc
Token ID420
Feature activation-2.03e+02

Pos logit contributions
nam+0.986
igsaw+0.940
 ey+0.927
 microsc+0.905
 Deal+0.899
Neg logit contributions
 and-0.611
,-0.525
.-0.406
!-0.383
 --0.322
Tokenados
Token ID22484
Feature activation-9.81e+01

Pos logit contributions
nam+0.507
igsaw+0.498
Mic+0.464
 ey+0.462
 Amen+0.420
Neg logit contributions
 and-0.779
,-0.705
.-0.647
 said-0.603
!-0.600
Token because
Token ID780
Feature activation-5.14e+01

Pos logit contributions
nam+0.396
igsaw+0.350
Mic+0.348
 ju+0.333
 redd+0.325
Neg logit contributions
 woke-0.136
 died-0.135
 shook-0.131
 and-0.129
 stopped-0.128
Token they
Token ID484
Feature activation-7.94e+01

Pos logit contributions
igsaw+0.217
 ey+0.216
nam+0.206
Mic+0.203
 Deal+0.197
Neg logit contributions
 and-0.120
,-0.103
.-0.085
 because-0.073
 --0.069
Token were
Token ID547
Feature activation-7.38e+01

Pos logit contributions
nam+0.311
Mic+0.269
 redd+0.258
 ju+0.247
 Amen+0.245
Neg logit contributions
 and-0.183
,-0.155
.-0.134
!-0.129
owed-0.129
Token't
Token ID470
Feature activation-1.58e+02

Pos logit contributions
 and+0.157
hed+0.132
owed+0.128
int+0.125
le+0.124
Neg logit contributions
nam-0.323
igsaw-0.278
 redd-0.275
 microsc-0.269
 av-0.268
Token know
Token ID760
Feature activation-1.15e+02

Pos logit contributions
nam+0.444
 possible+0.389
 Amen+0.372
 gol+0.346
reating+0.339
Neg logit contributions
uddled-0.441
 and-0.422
ops-0.380
hed-0.362
-0.353
Token the
Token ID262
Feature activation-1.16e+02

Pos logit contributions
nam+0.309
 microsc+0.269
reating+0.246
 outer+0.230
 queens+0.223
Neg logit contributions
hed-0.351
int-0.343
orted-0.330
ave-0.322
 replied-0.313
Token rh
Token ID9529
Feature activation-1.47e+02

Pos logit contributions
nam+0.305
 ey+0.281
igsaw+0.281
 redd+0.264
 microsc+0.252
Neg logit contributions
 and-0.399
,-0.369
owed-0.339
 --0.323
:-0.319
Tokenin
Token ID259
Feature activation-1.63e+02

Pos logit contributions
nam+0.364
igsaw+0.349
Mic+0.329
 ey+0.321
 Amen+0.303
Neg logit contributions
 and-0.564
,-0.516
.-0.467
 --0.439
 said-0.438
Tokenoc
Token ID420
Feature activation-2.24e+02

Pos logit contributions
 Amen+0.184
igsaw+0.155
 ju+0.142
Mic+0.137
 gol+0.131
Neg logit contributions
 and-0.641
uddled-0.603
,-0.592
 shook-0.577
ided-0.574
Tokeneros
Token ID27498
Feature activation-1.07e+02

Pos logit contributions
igsaw+0.474
nam+0.460
 ey+0.429
Mic+0.422
 Amen+0.407
Neg logit contributions
 and-0.894
,-0.809
 said-0.749
.-0.747
 because-0.715
Token was
Token ID373
Feature activation-9.45e+01

Pos logit contributions
 microsc+0.194
spe+0.191
 av+0.190
 possible+0.190
 brav+0.187
Neg logit contributions
int-0.244
 cheeks-0.240
oves-0.232
our-0.227
zes-0.219
Token wild
Token ID4295
Feature activation-1.14e+02

Pos logit contributions
nam+0.233
 possible+0.190
 Amen+0.188
igsaw+0.184
 redd+0.179
Neg logit contributions
 and-0.284
owed-0.278
ops-0.274
,-0.254
our-0.247
Token.
Token ID13
Feature activation-7.08e+01

Pos logit contributions
igsaw+0.418
 microsc+0.405
reating+0.400
 spec+0.396
keeping+0.389
Neg logit contributions
 held-0.132
 repeated-0.130
 squeezed-0.127
 replied-0.126
 tapped-0.122
Token Can
Token ID1680
Feature activation-9.11e+01

Pos logit contributions
nam+0.170
igsaw+0.168
Mic+0.160
 Amen+0.159
 ey+0.144
Neg logit contributions
 and-0.249
,-0.225
!-0.203
.-0.201
 got-0.201
Token Ch
Token ID609
Feature activation-1.16e+02

Pos logit contributions
nam+0.197
Mic+0.197
igsaw+0.187
 Amen+0.167
 Deal+0.162
Neg logit contributions
 and-0.285
,-0.260
owed-0.244
 kept-0.242
 watered-0.240
Tokenir
Token ID343
Feature activation-2.00e+02

Pos logit contributions
igsaw+0.248
Mic+0.221
 ey+0.210
nam+0.199
ihu+0.185
Neg logit contributions
 and-0.485
,-0.445
.-0.402
 said-0.390
 because-0.384
Tokenpy
Token ID9078
Feature activation-1.42e+02

Pos logit contributions
 ey+0.435
Mic+0.424
igsaw+0.406
aph+0.391
nam+0.374
Neg logit contributions
 and-0.619
,-0.592
 handed-0.505
 because-0.502
 scanned-0.497
Token and
Token ID290
Feature activation-6.86e+01

Pos logit contributions
 Amen+0.343
die+0.343
pill+0.338
rou+0.319
 spec+0.302
Neg logit contributions
uddled-0.241
 Prop-0.235
anas-0.232
 afford-0.225
ANG-0.224
Token Wh
Token ID854
Feature activation-1.20e+02

Pos logit contributions
nam+0.196
Mic+0.168
 ey+0.160
igsaw+0.159
 microsc+0.159
Neg logit contributions
 and-0.221
,-0.193
 --0.181
.-0.180
;-0.175
Tokenisk
Token ID1984
Feature activation-2.23e+02

Pos logit contributions
Mic+0.176
igsaw+0.158
 ey+0.142
 microsc+0.125
ptions+0.123
Neg logit contributions
 and-0.550
,-0.512
 said-0.452
 gave-0.449
.-0.448
Tokeners
Token ID364
Feature activation-1.02e+02

Pos logit contributions
nam+0.671
igsaw+0.669
Mic+0.669
 ey+0.610
ptions+0.599
Neg logit contributions
 and-0.672
,-0.613
.-0.521
 because-0.501
 said-0.496
Token knew
Token ID2993
Feature activation-5.40e+01

Pos logit contributions
nam+0.206
 Amen+0.187
 narrative+0.182
Mic+0.176
Looks+0.169
Neg logit contributions
ry-0.206
 than-0.201
ails-0.200
int-0.199
ishing-0.199
Token that
Token ID326
Feature activation-8.08e+01

Pos logit contributions
nam+0.140
 microsc+0.128
 Gw+0.123
 Lorenzo+0.120
 Beat+0.120
Neg logit contributions
 and-0.134
hed-0.123
 agree-0.116
,-0.111
int-0.108
Token poison
Token ID8764
Feature activation-7.97e+01

Pos logit contributions
nam+0.219
 microsc+0.194
 Amen+0.175
 Lorenzo+0.167
Mic+0.164
Neg logit contributions
 and-0.218
int-0.218
owed-0.197
hed-0.197
ored-0.195
Token was
Token ID373
Feature activation-7.77e+01

Pos logit contributions
 microsc+0.156
 brav+0.153
 spec+0.152
 narrative+0.152
igators+0.152
Neg logit contributions
 agree-0.200
anas-0.173
our-0.172
int-0.172
round-0.171
Token and
Token ID290
Feature activation-8.46e+01

Pos logit contributions
nam+0.332
 late+0.327
 possible+0.326
igsaw+0.326
keeping+0.325
Neg logit contributions
 shook-0.196
 spoke-0.179
 tapped-0.179
 argued-0.176
 approached-0.173
Token play
Token ID711
Feature activation-9.01e+01

Pos logit contributions
nam+0.304
 Amen+0.273
igsaw+0.259
 microsc+0.251
 ey+0.251
Neg logit contributions
 and-0.212
,-0.180
 --0.163
.-0.161
;-0.160
Token with
Token ID351
Feature activation-8.90e+01

Pos logit contributions
 Amen+0.252
igsaw+0.244
nam+0.236
 Tues+0.231
 outer+0.228
Neg logit contributions
 rang-0.205
 answered-0.205
 shook-0.196
owed-0.195
 spoke-0.194
Token your
Token ID534
Feature activation-1.22e+02

Pos logit contributions
nam+0.227
igsaw+0.207
 microsc+0.195
 ey+0.190
Mic+0.188
Neg logit contributions
 and-0.306
,-0.272
 said-0.252
 replied-0.252
owed-0.252
Token or
Token ID393
Feature activation-8.53e+01

Pos logit contributions
nam+0.333
igsaw+0.305
 ey+0.299
 microsc+0.283
Mic+0.275
Neg logit contributions
 and-0.419
,-0.387
 --0.354
owed-0.351
;-0.347
Tokennam
Token ID7402
Feature activation-2.22e+02

Pos logit contributions
nam+0.257
igsaw+0.256
 ey+0.240
Mic+0.238
 Amen+0.225
Neg logit contributions
 and-0.284
,-0.255
.-0.228
!-0.209
 said-0.208
Tokenents
Token ID658
Feature activation-1.26e+02

Pos logit contributions
 ey+0.936
 Amen+0.922
 narrative+0.920
igsaw+0.920
 Deal+0.907
Neg logit contributions
 and-0.521
,-0.384
.-0.364
!-0.326
 --0.281
Token.
Token ID13
Feature activation-6.96e+01

Pos logit contributions
nam+0.449
reating+0.425
igsaw+0.406
keeping+0.400
 possible+0.391
Neg logit contributions
 replied-0.194
 shook-0.163
 explained-0.159
int-0.157
 tapped-0.156
Token Maybe
Token ID6674
Feature activation-7.48e+01

Pos logit contributions
igsaw+0.267
nam+0.267
 ey+0.258
Mic+0.244
 avail+0.236
Neg logit contributions
 and-0.189
,-0.159
.-0.141
 because-0.125
 said-0.115
Token you
Token ID345
Feature activation-1.01e+02

Pos logit contributions
nam+0.256
igsaw+0.224
Mic+0.222
 ey+0.212
 Amen+0.212
Neg logit contributions
 and-0.209
,-0.180
.-0.159
!-0.155
 --0.152
Token can
Token ID460
Feature activation-9.37e+01

Pos logit contributions
nam+0.105
ifying+0.097
 Amen+0.092
 seeker+0.090
 narrative+0.082
Neg logit contributions
uddled-0.382
 replied-0.381
iously-0.374
 cheeks-0.370
;-0.357
Token them
Token ID606
Feature activation-1.07e+02

Pos logit contributions
nam+0.232
 microsc+0.185
 moderation+0.178
Mic+0.172
igsaw+0.167
Neg logit contributions
 and-0.267
 replied-0.241
,-0.239
 rang-0.238
 shook-0.237
Token and
Token ID290
Feature activation-5.68e+01

Pos logit contributions
nam+0.360
 microsc+0.272
 possible+0.267
 queens+0.263
 ju+0.259
Neg logit contributions
 replied-0.254
 rang-0.242
int-0.239
 shook-0.236
 tapped-0.231
Token never
Token ID1239
Feature activation-6.90e+01

Pos logit contributions
nam+0.199
 ey+0.168
 microsc+0.163
 Amen+0.162
igsaw+0.159
Neg logit contributions
 and-0.144
,-0.131
!-0.116
.-0.115
 --0.114
Token threw
Token ID9617
Feature activation-5.02e+01

Pos logit contributions
nam+0.240
igsaw+0.196
 ey+0.195
 redd+0.192
 microsc+0.191
Neg logit contributions
 and-0.176
,-0.136
 --0.134
uddled-0.129
;-0.129
Token a
Token ID257
Feature activation-9.15e+01

Pos logit contributions
nam+0.127
 redd+0.104
igsaw+0.101
 microsc+0.095
 ey+0.094
Neg logit contributions
 replied-0.158
owed-0.157
 rang-0.157
 and-0.154
uddled-0.147
Token tant
Token ID24246
Feature activation-2.21e+02

Pos logit contributions
igsaw+0.236
nam+0.233
 ey+0.217
Mic+0.216
 redd+0.206
Neg logit contributions
 and-0.318
,-0.292
 --0.267
.-0.263
 because-0.262
Tokenrum
Token ID6582
Feature activation-7.45e+01

Pos logit contributions
nam+0.828
igsaw+0.811
 ey+0.779
Mic+0.761
 redd+0.746
Neg logit contributions
 and-0.585
,-0.519
.-0.452
 --0.406
!-0.403
Token again
Token ID757
Feature activation-6.35e+01

Pos logit contributions
nam+0.277
igsaw+0.263
reating+0.258
 Amen+0.257
rou+0.256
Neg logit contributions
 and-0.089
 whispered-0.078
 answered-0.077
 touched-0.076
 spoke-0.076
Token.
Token ID13
Feature activation-5.04e+01

Pos logit contributions
nam+0.182
igsaw+0.178
 avail+0.168
 possible+0.167
 microsc+0.164
Neg logit contributions
uddled-0.127
int-0.121
owed-0.114
hed-0.113
ave-0.113
Token<|endoftext|>
Token ID50256
Feature activation-4.98e+01

Pos logit contributions
nam+0.200
igsaw+0.195
 ey+0.188
Mic+0.183
 redd+0.176
Neg logit contributions
 and-0.130
,-0.111
.-0.098
 said-0.085
 because-0.084
TokenOnce
Token ID7454
Feature activation-5.74e+01

Pos logit contributions
nam+0.212
igsaw+0.201
 ey+0.193
Mic+0.191
 redd+0.186
Neg logit contributions
 and-0.124
,-0.105
.-0.094
!-0.078
 said-0.074
Token.
Token ID13
Feature activation-7.31e+01

Pos logit contributions
 Amen+0.362
rou+0.343
 redd+0.296
ini+0.295
die+0.292
Neg logit contributions
 scanned-0.201
 approached-0.201
owed-0.183
uddled-0.179
 argued-0.176
Token Did
Token ID7731
Feature activation-8.45e+01

Pos logit contributions
nam+0.212
igsaw+0.207
 ey+0.195
Mic+0.194
 Amen+0.181
Neg logit contributions
 and-0.252
,-0.227
.-0.204
!-0.188
 --0.188
Token you
Token ID345
Feature activation-1.03e+02

Pos logit contributions
nam+0.237
igsaw+0.232
 ey+0.217
Mic+0.216
 Amen+0.202
Neg logit contributions
 and-0.299
,-0.270
.-0.244
!-0.225
 --0.224
Token know
Token ID760
Feature activation-7.73e+01

Pos logit contributions
nam+0.227
 Amen+0.197
igsaw+0.194
 gol+0.173
 ey+0.172
Neg logit contributions
uddled-0.305
 and-0.301
ed-0.277
:-0.271
-0.269
Token that
Token ID326
Feature activation-7.39e+01

Pos logit contributions
nam+0.224
reating+0.180
 microsc+0.169
 outer+0.158
 queens+0.152
Neg logit contributions
 and-0.234
hed-0.225
red-0.215
ANG-0.214
ed-0.212
Token av
Token ID1196
Feature activation-2.20e+02

Pos logit contributions
nam+0.188
 microsc+0.146
 redd+0.137
igsaw+0.136
 ey+0.131
Neg logit contributions
 and-0.252
owed-0.233
int-0.218
uddled-0.215
hed-0.214
Tokenoc
Token ID420
Feature activation-1.99e+02

Pos logit contributions
nam+0.852
igsaw+0.835
 ey+0.802
 microsc+0.773
 Amen+0.742
Neg logit contributions
 and-0.620
,-0.544
.-0.452
!-0.425
 said-0.402
Tokenados
Token ID22484
Feature activation-9.45e+01

Pos logit contributions
nam+0.436
igsaw+0.434
Mic+0.410
 ey+0.395
 Amen+0.362
Neg logit contributions
 and-0.806
,-0.726
.-0.677
 said-0.646
 because-0.630
Token are
Token ID389
Feature activation-9.34e+01

Pos logit contributions
nam+0.176
igsaw+0.156
 ju+0.151
 ey+0.149
igator+0.146
Neg logit contributions
uddled-0.254
ided-0.246
orted-0.237
int-0.235
anged-0.232
Token very
Token ID845
Feature activation-8.23e+01

Pos logit contributions
nam+0.220
 redd+0.177
Mic+0.173
 ju+0.172
 possible+0.164
Neg logit contributions
owed-0.319
 and-0.319
hed-0.315
uddled-0.297
ed-0.295
Token healthy
Token ID5448
Feature activation-9.90e+01

Pos logit contributions
nam+0.201
igsaw+0.187
 redd+0.170
Mic+0.168
 ey+0.164
Neg logit contributions
 and-0.318
,-0.284
.-0.257
 --0.248
!-0.245
Token he
Token ID339
Feature activation-1.19e+02

Pos logit contributions
 Wed+0.277
nam+0.259
igsaw+0.250
 Amen+0.246
 Tues+0.244
Neg logit contributions
owed-0.366
uddled-0.361
ave-0.351
hed-0.335
 and-0.332
Token would
Token ID561
Feature activation-1.09e+02

Pos logit contributions
 Amen+0.286
nam+0.254
 brav+0.239
die+0.234
 redd+0.230
Neg logit contributions
uddled-0.305
;-0.259
eness-0.251
 and-0.250
orting-0.245
Token take
Token ID1011
Feature activation-1.07e+02

Pos logit contributions
 Amen+0.270
nam+0.207
 Tues+0.181
 outer+0.181
igsaw+0.164
Neg logit contributions
uddled-0.384
ed-0.335
orted-0.323
awed-0.309
led-0.295
Token the
Token ID262
Feature activation-1.06e+02

Pos logit contributions
igsaw+0.295
nam+0.284
Mic+0.262
 microsc+0.245
 ey+0.239
Neg logit contributions
 and-0.336
,-0.290
 said-0.272
owed-0.271
:-0.265
Token av
Token ID1196
Feature activation-2.38e+02

Pos logit contributions
nam+0.283
igsaw+0.268
 ey+0.246
 redd+0.239
Mic+0.237
Neg logit contributions
 and-0.367
,-0.337
:-0.319
owed-0.313
 --0.307
Tokenoc
Token ID420
Feature activation-2.19e+02

Pos logit contributions
nam+0.950
 ey+0.907
igsaw+0.882
 microsc+0.877
 Amen+0.845
Neg logit contributions
 and-0.655
,-0.559
.-0.455
!-0.427
 --0.397
Tokenados
Token ID22484
Feature activation-1.11e+02

Pos logit contributions
nam+0.541
igsaw+0.536
Mic+0.499
 ey+0.494
 Amen+0.454
Neg logit contributions
 and-0.836
,-0.758
.-0.697
 said-0.661
!-0.645
Token and
Token ID290
Feature activation-6.52e+01

Pos logit contributions
nam+0.367
igsaw+0.334
Mic+0.304
 ju+0.300
 possible+0.296
Neg logit contributions
 woke-0.228
 rang-0.226
 died-0.225
 cheeks-0.221
 replied-0.219
Token scatter
Token ID41058
Feature activation-7.37e+01

Pos logit contributions
nam+0.223
Mic+0.190
igsaw+0.186
 redd+0.183
 ju+0.174
Neg logit contributions
 and-0.184
,-0.157
owed-0.147
.-0.145
 --0.143
Token them
Token ID606
Feature activation-1.07e+02

Pos logit contributions
nam+0.197
igsaw+0.195
 redd+0.179
Mic+0.176
 ju+0.168
Neg logit contributions
 and-0.215
owed-0.212
 rang-0.204
 replied-0.191
,-0.190
Token around
Token ID1088
Feature activation-9.35e+01

Pos logit contributions
nam+0.327
igsaw+0.248
 ju+0.224
 microsc+0.220
chard+0.215
Neg logit contributions
 replied-0.277
 rang-0.275
 woke-0.271
anas-0.269
uddled-0.268
Tokenoll
Token ID692
Feature activation-1.85e+02

Pos logit contributions
igsaw+0.123
nam+0.106
 ey+0.105
Mic+0.096
years+0.079
Neg logit contributions
 and-0.461
,-0.427
.-0.396
 said-0.389
 because-0.384
Tokenip
Token ID541
Feature activation-1.34e+02

Pos logit contributions
nam+0.394
igsaw+0.321
ocol+0.320
years+0.300
 microsc+0.296
Neg logit contributions
 and-0.649
 because-0.545
,-0.533
 entered-0.528
 --0.525
Tokenops
Token ID2840
Feature activation-9.15e+01

Pos logit contributions
pill+0.281
Mic+0.256
rob+0.240
 av+0.237
rou+0.225
Neg logit contributions
 entered-0.360
 answered-0.354
 scanned-0.352
 Spy-0.345
 tapped-0.340
Token,
Token ID11
Feature activation-6.23e+01

Pos logit contributions
igsaw+0.304
reating+0.282
nam+0.280
 queens+0.276
 av+0.274
Neg logit contributions
 replied-0.141
 rang-0.131
 spoke-0.125
 nodded-0.124
 answered-0.123
Token ch
Token ID442
Feature activation-9.83e+01

Pos logit contributions
nam+0.201
 redd+0.171
 ey+0.169
 ju+0.165
 microsc+0.165
Neg logit contributions
owed-0.155
,-0.139
 and-0.139
 --0.133
uddled-0.130
Tokenocol
Token ID4668
Feature activation-2.19e+02

Pos logit contributions
igsaw+0.108
 ey+0.102
nam+0.084
ocol+0.081
Mic+0.076
Neg logit contributions
 and-0.498
,-0.460
.-0.440
 said-0.439
 --0.426
Tokenates
Token ID689
Feature activation-7.41e+01

Pos logit contributions
nam+0.629
igsaw+0.616
 ey+0.578
Mic+0.576
 Amen+0.538
Neg logit contributions
 and-0.756
,-0.681
.-0.613
!-0.565
 said-0.563
Token,
Token ID11
Feature activation-5.58e+01

Pos logit contributions
 microsc+0.230
rou+0.226
nam+0.226
ifying+0.224
igsaw+0.221
Neg logit contributions
int-0.094
 tapped-0.086
 approached-0.085
ave-0.083
action-0.083
Token g
Token ID308
Feature activation-9.60e+01

Pos logit contributions
nam+0.221
 redd+0.212
 ju+0.211
 ey+0.210
 microsc+0.189
Neg logit contributions
owed-0.077
uddled-0.059
 agree-0.059
int-0.056
 --0.052
Tokenummies
Token ID39578
Feature activation-9.53e+01

Pos logit contributions
igsaw+0.234
 ey+0.207
nam+0.196
Mic+0.184
ocol+0.184
Neg logit contributions
 and-0.372
,-0.331
 said-0.312
.-0.300
 gave-0.298
Token,
Token ID11
Feature activation-5.18e+01

Pos logit contributions
igsaw+0.369
pill+0.339
 microsc+0.335
 queens+0.320
 ey+0.319
Neg logit contributions
owed-0.102
ered-0.098
eded-0.092
uddled-0.092
 approached-0.092
Token It
Token ID632
Feature activation-8.54e+01

Pos logit contributions
igsaw+0.184
nam+0.179
Mic+0.177
 Amen+0.164
 ey+0.152
Neg logit contributions
 and-0.255
,-0.237
 --0.209
!-0.207
.-0.207
Token turned
Token ID2900
Feature activation-5.23e+01

Pos logit contributions
 Amen+0.288
nam+0.265
Mic+0.265
Looks+0.254
 redd+0.249
Neg logit contributions
 and-0.167
,-0.160
;-0.142
 --0.132
anas-0.125
Token out
Token ID503
Feature activation-9.67e+01

Pos logit contributions
nam+0.107
 ju+0.105
 redd+0.101
Mic+0.098
 tant+0.097
Neg logit contributions
anas-0.180
 agree-0.179
owed-0.167
ops-0.157
 watched-0.144
Token that
Token ID326
Feature activation-7.55e+01

Pos logit contributions
igsaw+0.265
 outer+0.250
 ey+0.246
 microsc+0.243
 downward+0.230
Neg logit contributions
ops-0.255
anas-0.238
hed-0.202
owed-0.199
lder-0.197
Token the
Token ID262
Feature activation-8.59e+01

Pos logit contributions
nam+0.189
 redd+0.144
 microsc+0.143
 ju+0.141
 outer+0.128
Neg logit contributions
 and-0.243
int-0.233
anas-0.231
owed-0.230
:-0.220
Token av
Token ID1196
Feature activation-2.18e+02

Pos logit contributions
nam+0.235
Mic+0.214
igsaw+0.211
 ey+0.209
 redd+0.205
Neg logit contributions
 and-0.281
,-0.262
owed-0.236
 --0.235
:-0.230
Tokenoc
Token ID420
Feature activation-1.99e+02

Pos logit contributions
nam+0.997
igsaw+0.924
 microsc+0.884
 ey+0.883
iam+0.858
Neg logit contributions
 and-0.539
,-0.455
.-0.336
!-0.336
 got-0.292
Tokenados
Token ID22484
Feature activation-9.19e+01

Pos logit contributions
nam+0.496
igsaw+0.491
 ey+0.456
Mic+0.455
 Amen+0.419
Neg logit contributions
 and-0.756
,-0.686
.-0.629
 said-0.588
!-0.583
Token were
Token ID547
Feature activation-7.46e+01

Pos logit contributions
nam+0.158
 ju+0.135
he+0.132
 ey+0.127
hel+0.126
Neg logit contributions
anas-0.252
int-0.238
action-0.228
EL-0.225
four-0.225
Token bad
Token ID2089
Feature activation-8.48e+01

Pos logit contributions
nam+0.166
 swollen+0.123
 ju+0.122
 redd+0.118
Mic+0.111
Neg logit contributions
owed-0.251
ops-0.242
ANG-0.237
our-0.236
 replied-0.235
Token and
Token ID290
Feature activation-4.63e+01

Pos logit contributions
igsaw+0.240
 microsc+0.240
reating+0.230
keeping+0.227
 spec+0.224
Neg logit contributions
 rang-0.144
 approached-0.138
 touched-0.138
 replied-0.135
int-0.134
Token give
Token ID1577
Feature activation-1.01e+02

Pos logit contributions
nam+0.372
igsaw+0.329
 ey+0.319
Mic+0.317
 Amen+0.301
Neg logit contributions
 and-0.324
,-0.285
 --0.259
;-0.256
owed-0.247
Token to
Token ID284
Feature activation-1.03e+02

Pos logit contributions
nam+0.273
 redd+0.213
Mic+0.203
 microsc+0.201
igsaw+0.196
Neg logit contributions
 and-0.320
,-0.291
 rang-0.275
;-0.274
 --0.268
Token the
Token ID262
Feature activation-1.11e+02

Pos logit contributions
nam+0.288
igsaw+0.265
 microsc+0.242
 ey+0.241
Mic+0.237
Neg logit contributions
 and-0.342
,-0.307
 --0.275
;-0.269
:-0.268
Token rh
Token ID9529
Feature activation-1.43e+02

Pos logit contributions
nam+0.283
 redd+0.253
 ey+0.247
Mic+0.244
 ju+0.242
Neg logit contributions
 and-0.362
owed-0.343
,-0.341
:-0.319
 --0.317
Tokenin
Token ID259
Feature activation-1.56e+02

Pos logit contributions
nam+0.381
igsaw+0.375
 ey+0.351
Mic+0.345
 Amen+0.321
Neg logit contributions
 and-0.527
,-0.477
.-0.435
!-0.402
 said-0.399
Tokenoc
Token ID420
Feature activation-2.18e+02

Pos logit contributions
igsaw+0.234
 Amen+0.219
nam+0.171
Mic+0.169
 ju+0.140
Neg logit contributions
 shook-0.616
uddled-0.589
 kept-0.588
 argued-0.586
 ignored-0.583
Tokeneros
Token ID27498
Feature activation-1.03e+02

Pos logit contributions
igsaw+0.471
nam+0.454
 ey+0.430
Mic+0.428
 Amen+0.407
Neg logit contributions
 and-0.874
,-0.788
.-0.734
 said-0.730
 because-0.702
Token.
Token ID13
Feature activation-7.18e+01

Pos logit contributions
nam+0.277
igsaw+0.267
 microsc+0.259
 av+0.259
reating+0.235
Neg logit contributions
 rang-0.206
 shook-0.201
 scanned-0.196
our-0.188
 tapped-0.187
Token Lily
Token ID20037
Feature activation-1.07e+02

Pos logit contributions
nam+0.175
igsaw+0.174
Mic+0.174
 Amen+0.162
 microsc+0.147
Neg logit contributions
 and-0.247
,-0.226
 --0.204
owed-0.204
 kept-0.203
Token walked
Token ID6807
Feature activation-5.97e+01

Pos logit contributions
die+0.298
rou+0.272
Mic+0.265
Ru+0.263
 Lorenzo+0.255
Neg logit contributions
;-0.197
fulness-0.192
fall-0.191
ides-0.189
ring-0.185
Token to
Token ID284
Feature activation-7.59e+01

Pos logit contributions
nam+0.200
igsaw+0.194
Mic+0.188
 Amen+0.180
 redd+0.178
Neg logit contributions
 and-0.123
,-0.109
 died-0.098
owed-0.098
 --0.097
Token still
Token ID991
Feature activation-8.90e+01

Pos logit contributions
nam+0.183
igsaw+0.146
 ey+0.143
Mic+0.143
 microsc+0.142
Neg logit contributions
 and-0.175
,-0.147
.-0.145
!-0.135
 --0.132
Token couldn
Token ID3521
Feature activation-6.66e+01

Pos logit contributions
nam+0.288
 microsc+0.220
 ey+0.217
rou+0.210
years+0.208
Neg logit contributions
 and-0.211
anas-0.189
hed-0.186
 replied-0.184
uddled-0.179
Token't
Token ID470
Feature activation-1.02e+02

Pos logit contributions
nam+0.170
igsaw+0.168
Mic+0.155
 ey+0.155
 Amen+0.144
Neg logit contributions
 and-0.248
,-0.225
.-0.205
 said-0.190
!-0.190
Token find
Token ID1064
Feature activation-7.21e+01

Pos logit contributions
nam+0.301
igsaw+0.285
Mic+0.275
 ey+0.274
 avail+0.258
Neg logit contributions
 and-0.291
,-0.251
 said-0.227
 replied-0.227
 shook-0.215
Token any
Token ID597
Feature activation-9.45e+01

Pos logit contributions
nam+0.224
igsaw+0.212
 ey+0.200
Mic+0.198
 microsc+0.189
Neg logit contributions
 and-0.230
,-0.201
.-0.177
 said-0.171
 --0.167
Token av
Token ID1196
Feature activation-2.18e+02

Pos logit contributions
nam+0.247
 ey+0.236
 redd+0.225
igsaw+0.224
 microsc+0.223
Neg logit contributions
 and-0.279
 replied-0.254
,-0.244
owed-0.244
 said-0.238
Tokenoc
Token ID420
Feature activation-1.98e+02

Pos logit contributions
nam+0.952
igsaw+0.914
 ey+0.894
 Amen+0.875
 Deal+0.864
Neg logit contributions
 and-0.565
,-0.483
.-0.387
!-0.380
 --0.337
Tokenados
Token ID22484
Feature activation-9.14e+01

Pos logit contributions
igsaw+0.447
nam+0.446
Mic+0.420
 ey+0.403
 Amen+0.363
Neg logit contributions
 and-0.795
,-0.720
.-0.666
 said-0.639
 because-0.617
Token.
Token ID13
Feature activation-5.56e+01

Pos logit contributions
nam+0.340
 possible+0.285
chard+0.285
 microsc+0.281
keeping+0.280
Neg logit contributions
 replied-0.138
 nodded-0.132
 cheeks-0.129
int-0.128
anas-0.123
Token Finally
Token ID9461
Feature activation-4.81e+01

Pos logit contributions
igsaw+0.157
nam+0.156
Mic+0.151
 ey+0.138
 microsc+0.132
Neg logit contributions
 and-0.183
,-0.164
.-0.146
!-0.142
 --0.138
Token,
Token ID11
Feature activation-5.05e+01

Pos logit contributions
igsaw+0.171
 avail+0.171
nam+0.167
 recess+0.157
 Lorenzo+0.155
Neg logit contributions
 and-0.076
anas-0.070
owed-0.065
uddled-0.061
 liked-0.056
Token am
Token ID716
Feature activation-1.21e+02

Pos logit contributions
 Amen+0.203
 ju+0.176
 redd+0.164
|+0.151
 gol+0.148
Neg logit contributions
uddled-0.403
iously-0.402
anas-0.387
opes-0.379
ors-0.376
Token taking
Token ID2263
Feature activation-9.90e+01

Pos logit contributions
nam+0.295
 Amen+0.248
 possible+0.215
igsaw+0.206
 Deal+0.196
Neg logit contributions
 and-0.418
owed-0.376
ops-0.372
hed-0.371
uddled-0.361
Token the
Token ID262
Feature activation-1.09e+02

Pos logit contributions
nam+0.221
igsaw+0.219
 microsc+0.187
Mic+0.185
 Wed+0.165
Neg logit contributions
 and-0.328
owed-0.319
 rang-0.300
 scanned-0.293
uddled-0.293
Token rh
Token ID9529
Feature activation-1.41e+02

Pos logit contributions
igsaw+0.277
nam+0.274
 ey+0.241
 redd+0.238
Mic+0.237
Neg logit contributions
 and-0.380
,-0.348
owed-0.340
 --0.327
:-0.325
Tokenin
Token ID259
Feature activation-1.56e+02

Pos logit contributions
nam+0.388
igsaw+0.384
 ey+0.362
Mic+0.350
 microsc+0.332
Neg logit contributions
 and-0.514
,-0.463
.-0.422
!-0.391
 --0.384
Tokenoc
Token ID420
Feature activation-2.18e+02

Pos logit contributions
igsaw+0.233
 Amen+0.232
Mic+0.221
nam+0.185
 gol+0.171
Neg logit contributions
 shook-0.609
 and-0.602
 nodded-0.585
 shrugged-0.567
 whispered-0.560
Tokeneros
Token ID27498
Feature activation-1.03e+02

Pos logit contributions
igsaw+0.459
nam+0.437
 ey+0.419
Mic+0.413
 Amen+0.408
Neg logit contributions
 and-0.873
,-0.784
 said-0.735
.-0.730
 because-0.700
Token to
Token ID284
Feature activation-8.40e+01

Pos logit contributions
igsaw+0.270
reating+0.261
 possible+0.257
nam+0.249
 queens+0.246
Neg logit contributions
 nodded-0.246
 shook-0.243
 shrugged-0.216
 cheeks-0.212
 scanned-0.207
Token a
Token ID257
Feature activation-1.02e+02

Pos logit contributions
nam+0.231
igsaw+0.225
Mic+0.213
 Amen+0.201
 outer+0.198
Neg logit contributions
 and-0.275
,-0.244
owed-0.226
 shook-0.214
.-0.214
Token new
Token ID649
Feature activation-1.13e+02

Pos logit contributions
igsaw+0.265
nam+0.264
Mic+0.248
 Amen+0.228
 ey+0.227
Neg logit contributions
 and-0.370
,-0.330
.-0.300
 said-0.292
 --0.285
Token home
Token ID1363
Feature activation-7.83e+01

Pos logit contributions
 Amen+0.232
igsaw+0.227
 outer+0.215
nam+0.211
Mic+0.205
Neg logit contributions
owed-0.406
 shook-0.371
 and-0.365
ed-0.359
 said-0.355
Token".
Token ID1911
Feature activation-8.10e+01

Pos logit contributions
igsaw+0.741
 ey+0.729
nam+0.721
 Amen+0.713
Mic+0.711
Neg logit contributions
 and-0.141
,-0.095
 watched-0.048
 --0.048
 gave-0.046
Token
Token ID198
Feature activation-1.18e+02

Pos logit contributions
Mic+0.158
igsaw+0.157
nam+0.157
 Amen+0.136
 Deal+0.135
Neg logit contributions
 and-0.281
 watered-0.272
owed-0.268
,-0.254
le-0.249
TokenThe
Token ID464
Feature activation-9.69e+01

Pos logit contributions
igsaw+0.391
nam+0.388
Mic+0.379
 ey+0.368
 Amen+0.357
Neg logit contributions
 and-0.336
,-0.298
.-0.256
 --0.238
!-0.236
Token 3
Token ID513
Feature activation-1.35e+02

Pos logit contributions
nam+0.274
igsaw+0.269
Mic+0.255
 ey+0.253
 Amen+0.237
Neg logit contributions
 and-0.324
,-0.293
.-0.258
 --0.255
 because-0.250
Token-
Token ID12
Feature activation-1.79e+02

Pos logit contributions
igsaw+0.264
 Amen+0.251
 ey+0.248
nam+0.241
year+0.228
Neg logit contributions
 and-0.464
,-0.430
 because-0.404
 --0.394
.-0.390
Tokenyear
Token ID1941
Feature activation-2.18e+02

Pos logit contributions
nam+0.309
igsaw+0.307
 ey+0.278
Mic+0.263
 microsc+0.238
Neg logit contributions
 and-0.832
,-0.766
.-0.717
 said-0.686
 --0.674
Token-
Token ID12
Feature activation-1.77e+02

Pos logit contributions
igsaw+0.949
nam+0.940
 ey+0.904
Mic+0.898
 Amen+0.863
Neg logit contributions
 and-0.395
,-0.322
.-0.259
!-0.218
 --0.217
Tokenold
Token ID727
Feature activation-1.21e+02

Pos logit contributions
nam+0.457
igsaw+0.448
 ey+0.418
Mic+0.413
 Amen+0.384
Neg logit contributions
 and-0.668
,-0.606
.-0.553
 said-0.514
!-0.513
Token thought
Token ID1807
Feature activation-6.55e+01

Pos logit contributions
nam+0.376
years+0.352
iam+0.342
igsaw+0.322
saw+0.319
Neg logit contributions
 and-0.227
 because-0.213
;-0.199
,-0.195
 --0.195
Token hard
Token ID1327
Feature activation-9.45e+01

Pos logit contributions
igsaw+0.177
year+0.154
nam+0.153
iam+0.148
 possible+0.140
Neg logit contributions
anas-0.138
hi-0.138
action-0.133
 rang-0.131
 pet-0.130
Token and
Token ID290
Feature activation-6.25e+01

Pos logit contributions
igsaw+0.352
 microsc+0.302
 gol+0.281
 narrative+0.278
bench+0.276
Neg logit contributions
 drank-0.119
 stayed-0.117
 laid-0.115
 sprayed-0.114
 touched-0.113
Token"
Token ID1
Feature activation+5.74e+01

Pos logit contributions
 and+0.075
,+0.064
.+0.055
 said+0.049
 -+0.048
Neg logit contributions
nam-0.113
igsaw-0.112
 ey-0.106
Mic-0.103
 redd-0.101
TokenLet
Token ID5756
Feature activation+6.76e+01

Pos logit contributions
 and+0.209
,+0.189
.+0.171
!+0.158
 said+0.158
Neg logit contributions
nam-0.158
igsaw-0.154
 ey-0.144
Mic-0.143
 Amen-0.133
Token's
Token ID338
Feature activation+3.15e+01

Pos logit contributions
 and+0.114
,+0.093
.+0.069
!+0.054
 said+0.053
Neg logit contributions
nam-0.339
igsaw-0.331
 ey-0.318
Mic-0.312
 microsc-0.303
Token play
Token ID711
Feature activation+5.94e+01

Pos logit contributions
 and+0.079
,+0.067
.+0.058
 -+0.054
 said+0.053
Neg logit contributions
nam-0.116
igsaw-0.113
Mic-0.109
 ey-0.108
 Amen-0.105
Token something
Token ID1223
Feature activation+4.14e+01

Pos logit contributions
 and+0.141
owed+0.123
 shook+0.122
 rang+0.121
 wanted+0.121
Neg logit contributions
igsaw-0.178
nam-0.172
 Amen-0.162
 ey-0.157
amental-0.154
Token else
Token ID2073
Feature activation+3.82e+01

Pos logit contributions
 and+0.154
hed+0.141
red+0.139
 approached+0.139
 replied+0.136
Neg logit contributions
nam-0.072
igsaw-0.060
 possible-0.058
 queens-0.055
 redd-0.054
Token!"
Token ID2474
Feature activation+6.65e+01

Pos logit contributions
 replied+0.054
 woke+0.054
 and+0.053
 turned+0.053
 approached+0.053
Neg logit contributions
nam-0.145
igsaw-0.137
 Amen-0.126
 possible-0.126
 queens-0.125
Token Tom
Token ID4186
Feature activation+3.77e+01

Pos logit contributions
 and+0.231
,+0.205
.+0.187
 -+0.181
!+0.180
Neg logit contributions
nam-0.176
igsaw-0.169
 ey-0.161
Mic-0.150
 microsc-0.148
Token said
Token ID531
Feature activation+7.12e+01

Pos logit contributions
+0.052
 -+0.051
 died+0.050
;+0.050
 and+0.047
Neg logit contributions
nam-0.120
Mic-0.119
 Gw-0.108
ptions-0.106
Lisa-0.106
Token.
Token ID13
Feature activation+6.24e+01

Pos logit contributions
 and+0.057
,+0.029
 -+0.001
.+0.001
!+0.000
Neg logit contributions
nam-0.377
igsaw-0.370
Mic-0.351
 Amen-0.349
 ey-0.349
Token
Token ID198
Feature activation+3.45e+01

Pos logit contributions
 and+0.201
,+0.184
 -+0.165
!+0.164
.+0.160
Neg logit contributions
nam-0.157
igsaw-0.157
Mic-0.150
 ey-0.147
 Amen-0.144
Token and
Token ID290
Feature activation+5.55e+01

Pos logit contributions
 and+0.099
 answered+0.088
 rang+0.087
owed+0.087
,+0.086
Neg logit contributions
nam-0.154
Mic-0.148
igsaw-0.140
amental-0.140
Ru-0.137
Token make
Token ID787
Feature activation+4.55e+01

Pos logit contributions
 and+0.128
,+0.110
.+0.089
 said+0.076
 wanted+0.075
Neg logit contributions
igsaw-0.248
Mic-0.238
 ey-0.237
 Deal-0.229
arl-0.226
Token things
Token ID1243
Feature activation+4.36e+01

Pos logit contributions
 and+0.138
,+0.115
.+0.111
!+0.090
 said+0.090
Neg logit contributions
igsaw-0.183
 Amen-0.181
Mic-0.173
nam-0.172
 ey-0.168
Token.
Token ID13
Feature activation+5.13e+01

Pos logit contributions
 and+0.070
,+0.055
.+0.044
!+0.031
 said+0.029
Neg logit contributions
nam-0.213
igsaw-0.212
 ey-0.206
Mic-0.206
 Amen-0.197
Token One
Token ID1881
Feature activation+3.61e+01

Pos logit contributions
 and+0.134
,+0.117
.+0.102
 said+0.086
 because+0.085
Neg logit contributions
nam-0.199
igsaw-0.193
Mic-0.186
 ey-0.186
 Amen-0.177
Token day
Token ID1110
Feature activation+3.97e+01

Pos logit contributions
 and+0.142
 -+0.130
;+0.129
,+0.128
owed+0.127
Neg logit contributions
igsaw-0.050
nam-0.038
 ey-0.038
 redd-0.036
 sunny-0.033
Token,
Token ID11
Feature activation+5.28e+01

Pos logit contributions
 and+0.034
,+0.003
 said+0.003
.+0.002
 -+0.001
Neg logit contributions
nam-0.204
igsaw-0.196
 microsc-0.188
 ey-0.183
Mic-0.182
Token Lily
Token ID20037
Feature activation+1.73e+01

Pos logit contributions
 and+0.125
,+0.109
.+0.090
 said+0.077
!+0.075
Neg logit contributions
igsaw-0.219
nam-0.214
 ey-0.212
Mic-0.210
 Amen-0.203
Token has
Token ID468
Feature activation+4.24e+01

Pos logit contributions
owed+0.034
ided+0.026
;+0.026
eness+0.026
 -+0.025
Neg logit contributions
 ey-0.056
lon-0.052
 gol-0.051
 microsc-0.051
 abrupt-0.051
Token an
Token ID281
Feature activation+2.25e+01

Pos logit contributions
 and+0.133
,+0.121
.+0.104
 -+0.102
!+0.101
Neg logit contributions
nam-0.132
igsaw-0.121
Mic-0.112
 ey-0.111
 microsc-0.110
Token idea
Token ID2126
Feature activation+4.79e+01

Pos logit contributions
 and+0.077
,+0.070
.+0.064
!+0.059
 said+0.059
Neg logit contributions
igsaw-0.062
nam-0.062
 ey-0.058
Mic-0.056
 redd-0.053
Token swim
Token ID9422
Feature activation+6.22e+01

Pos logit contributions
 and+0.155
,+0.128
.+0.111
 -+0.110
 said+0.110
Neg logit contributions
igsaw-0.201
nam-0.201
 Amen-0.199
 ey-0.194
Mic-0.191
Token in
Token ID287
Feature activation+5.18e+01

Pos logit contributions
 replied+0.140
 rang+0.135
 explained+0.134
 answered+0.133
 checked+0.124
Neg logit contributions
nam-0.167
amental-0.155
aph-0.151
igsaw-0.149
 queens-0.145
Token the
Token ID262
Feature activation+4.93e+01

Pos logit contributions
 and+0.152
,+0.134
.+0.117
 said+0.110
 -+0.108
Neg logit contributions
nam-0.173
igsaw-0.171
 ey-0.161
Mic-0.159
 Amen-0.152
Token pool
Token ID5933
Feature activation+5.81e+01

Pos logit contributions
 and+0.178
,+0.161
.+0.146
!+0.134
 said+0.133
Neg logit contributions
nam-0.139
igsaw-0.135
Mic-0.128
 ey-0.126
 Amen-0.118
Token.
Token ID13
Feature activation+7.28e+01

Pos logit contributions
 rang+0.119
 replied+0.107
 explained+0.099
 slapped+0.097
 answered+0.097
Neg logit contributions
 sill-0.178
reating-0.166
nam-0.164
igsaw-0.163
 downward-0.161
Token She
Token ID1375
Feature activation+6.33e+01

Pos logit contributions
 and+0.171
,+0.144
.+0.121
 because+0.109
!+0.088
Neg logit contributions
nam-0.337
igsaw-0.312
 ey-0.308
ptions-0.308
Mic-0.305
Token thanked
Token ID26280
Feature activation+9.13e+01

Pos logit contributions
 and+0.119
,+0.116
.+0.113
;+0.109
 -+0.109
Neg logit contributions
Mic-0.194
Ru-0.194
 microsc-0.188
rou-0.186
 gol-0.186
Token the
Token ID262
Feature activation+5.22e+01

Pos logit contributions
 and+0.267
,+0.234
.+0.204
!+0.180
 said+0.179
Neg logit contributions
igsaw-0.325
nam-0.323
 ey-0.309
Mic-0.300
 redd-0.289
Token life
Token ID1204
Feature activation+4.98e+01

Pos logit contributions
 and+0.174
,+0.156
.+0.140
!+0.128
 said+0.127
Neg logit contributions
nam-0.159
igsaw-0.156
 ey-0.147
Mic-0.147
 Amen-0.137
Tokenguard
Token ID14864
Feature activation+5.38e+01

Pos logit contributions
 and+0.167
,+0.154
 -+0.142
 said+0.141
.+0.140
Neg logit contributions
igsaw-0.117
nam-0.105
Mic-0.097
 gol-0.094
 Amen-0.093
Token for
Token ID329
Feature activation+6.39e+01

Pos logit contributions
 rang+0.068
 scanned+0.059
 struck+0.058
 touched+0.056
 shook+0.056
Neg logit contributions
 Amen-0.216
nam-0.206
 Lorenzo-0.205
igsaw-0.202
reating-0.198
Token with
Token ID351
Feature activation+5.64e+01

Pos logit contributions
 and+0.108
 said+0.099
 replied+0.097
 watched+0.094
 answered+0.092
Neg logit contributions
nam-0.204
igsaw-0.187
 ey-0.186
 redd-0.180
amental-0.179
Token him
Token ID683
Feature activation+3.34e+01

Pos logit contributions
 and+0.176
,+0.152
 said+0.147
 replied+0.145
 -+0.140
Neg logit contributions
nam-0.159
 microsc-0.139
igsaw-0.138
 ey-0.137
Mic-0.131
Token and
Token ID290
Feature activation+6.68e+01

Pos logit contributions
 approached+0.065
 rang+0.065
 replied+0.063
uddled+0.063
 blew+0.062
Neg logit contributions
nam-0.118
 microsc-0.096
reating-0.085
 math-0.085
 redd-0.083
Token his
Token ID465
Feature activation+2.23e+01

Pos logit contributions
 and+0.174
,+0.151
.+0.131
 -+0.120
!+0.120
Neg logit contributions
nam-0.249
igsaw-0.236
 ey-0.230
Mic-0.225
 Amen-0.217
Token magic
Token ID5536
Feature activation+4.82e+01

Pos logit contributions
 and+0.075
 -+0.067
;+0.067
,+0.066
hed+0.066
Neg logit contributions
nam-0.060
 ey-0.052
 microsc-0.046
 redd-0.045
igsaw-0.044
Token wand
Token ID11569
Feature activation+6.34e+01

Pos logit contributions
 and+0.156
 explained+0.146
 said+0.139
 ordered+0.138
 replied+0.137
Neg logit contributions
 ey-0.108
igsaw-0.092
nam-0.087
 microsc-0.084
 Lorenzo-0.083
Token.
Token ID13
Feature activation+6.27e+01

Pos logit contributions
int+0.120
+0.117
+0.113
 approached+0.109
 overheard+0.107
Neg logit contributions
nam-0.167
 sill-0.163
 microsc-0.159
 narrative-0.145
 angles-0.145
Token He
Token ID679
Feature activation+4.02e+01

Pos logit contributions
 and+0.194
,+0.171
.+0.154
 because+0.135
 said+0.134
Neg logit contributions
nam-0.217
igsaw-0.212
 ey-0.200
Mic-0.200
 Amen-0.190
Token said
Token ID531
Feature activation+7.23e+01

Pos logit contributions
 and+0.070
,+0.069
;+0.063
 -+0.061
!+0.055
Neg logit contributions
nam-0.157
Mic-0.147
 redd-0.143
 ey-0.140
Looks-0.135
Token,
Token ID11
Feature activation+6.49e+01

Pos logit contributions
 and+0.064
!+0.014
 said+0.013
.+0.013
 because+0.008
Neg logit contributions
nam-0.361
igsaw-0.345
 microsc-0.336
 ey-0.333
 Amen-0.332
Token "
Token ID366
Feature activation+6.10e+01

Pos logit contributions
le+0.246
!+0.219
owed+0.214
les+0.213
 and+0.212
Neg logit contributions
nam-0.113
 Amen-0.097
 Do-0.094
Mic-0.087
 microsc-0.081
Token<|endoftext|>
Token ID50256
Feature activation+3.39e+01

Pos logit contributions
 and+0.090
,+0.079
.+0.068
!+0.062
 -+0.062
Neg logit contributions
nam-0.118
igsaw-0.117
Mic-0.111
 ey-0.111
 Amen-0.106
TokenOnce
Token ID7454
Feature activation+2.45e+01

Pos logit contributions
 and+0.079
,+0.067
.+0.062
!+0.047
 said+0.043
Neg logit contributions
nam-0.157
igsaw-0.150
 ey-0.144
Mic-0.141
 redd-0.139
Token upon
Token ID2402
Feature activation-1.89e+01

Pos logit contributions
,+0.076
 and+0.075
.+0.064
:+0.054
 said+0.050
Neg logit contributions
nam-0.101
Mic-0.099
igsaw-0.096
Ta-0.094
 ey-0.094
Token a
Token ID257
Feature activation+6.39e+00

Pos logit contributions
igsaw+0.115
umm+0.110
 ey+0.109
ocol+0.105
nam+0.104
Neg logit contributions
 and-0.037
.-0.028
,-0.022
 was-0.013
!-0.013
Token time
Token ID640
Feature activation+1.95e+01

Pos logit contributions
 and+0.018
,+0.015
.+0.013
!+0.011
 because+0.010
Neg logit contributions
nam-0.028
igsaw-0.025
 ey-0.025
 microsc-0.024
reating-0.023
Token,
Token ID11
Feature activation+3.72e+01

Pos logit contributions
,+0.000
 and-0.000
.-0.012
!-0.018
:-0.018
Neg logit contributions
nam-0.135
igsaw-0.130
Mic-0.130
 ey-0.129
 redd-0.128
Token there
Token ID612
Feature activation+1.60e+01

Pos logit contributions
 and+0.082
,+0.077
.+0.058
:+0.042
 at+0.042
Neg logit contributions
igsaw-0.186
Mic-0.177
nam-0.176
 ey-0.176
reating-0.175
Token was
Token ID373
Feature activation+2.60e+01

Pos logit contributions
 and+0.022
,+0.016
.+0.012
 -+0.009
!+0.009
Neg logit contributions
nam-0.075
igsaw-0.075
 ey-0.073
Mic-0.072
 Amen-0.069
Token a
Token ID257
Feature activation+9.43e+00

Pos logit contributions
 and+0.043
,+0.028
.+0.020
 because+0.008
 at+0.004
Neg logit contributions
igsaw-0.155
 ey-0.154
arl-0.151
nam-0.150
ubb-0.150
Token big
Token ID1263
Feature activation-2.07e+01

Pos logit contributions
 and+0.029
,+0.026
.+0.022
 said+0.019
!+0.019
Neg logit contributions
igsaw-0.034
nam-0.033
 ey-0.032
Mic-0.030
 microsc-0.030
Token blue
Token ID4171
Feature activation-1.32e+01

Pos logit contributions
igsaw+0.061
 ey+0.058
nam+0.057
 Amen+0.053
Mic+0.053
Neg logit contributions
owed-0.056
 and-0.055
 --0.049
.-0.048
ed-0.048
Token!
Token ID0
Feature activation-1.52e+00

Pos logit contributions
rou+0.120
 my+0.118
 your+0.115
arl+0.114
 ey+0.108
Neg logit contributions
 scanned-0.111
 rang-0.110
 approached-0.109
 spoke-0.108
 watered-0.106
Token This
Token ID770
Feature activation-2.22e+01

Pos logit contributions
igsaw+0.004
nam+0.004
 ey+0.004
Mic+0.004
 Amen+0.004
Neg logit contributions
 and-0.005
,-0.005
.-0.004
 --0.004
 said-0.004
Token apple
Token ID17180
Feature activation-3.15e+01

Pos logit contributions
 redd+0.069
igsaw+0.067
 ey+0.065
Mic+0.064
nam+0.064
Neg logit contributions
 and-0.056
,-0.046
owed-0.044
 --0.042
 said-0.042
Token is
Token ID318
Feature activation-2.13e+01

Pos logit contributions
 ey+0.062
igsaw+0.060
 redd+0.058
 ju+0.053
pill+0.050
Neg logit contributions
 approached-0.093
 discussed-0.091
owed-0.089
 whispered-0.087
 spoke-0.087
Token bad
Token ID2089
Feature activation-3.77e+01

Pos logit contributions
nam+0.051
 redd+0.042
igsaw+0.040
 ey+0.039
 ju+0.039
Neg logit contributions
 and-0.079
,-0.072
owed-0.069
.-0.065
 --0.065
Token.
Token ID13
Feature activation-8.78e+00

Pos logit contributions
reating+0.099
 queens+0.092
igsaw+0.089
 yours+0.088
 chefs+0.086
Neg logit contributions
owed-0.085
 rang-0.083
 scanned-0.080
 spoke-0.080
 approached-0.078
Token It
Token ID632
Feature activation-1.98e+01

Pos logit contributions
nam+0.028
igsaw+0.028
Mic+0.026
 ey+0.026
 microsc+0.024
Neg logit contributions
 and-0.028
,-0.025
.-0.022
!-0.020
 said-0.020
Token has
Token ID468
Feature activation-1.82e+01

Pos logit contributions
nam+0.065
 ey+0.064
igsaw+0.063
Mic+0.062
 redd+0.061
Neg logit contributions
 and-0.049
,-0.044
 --0.038
.-0.038
;-0.037
Token a
Token ID257
Feature activation-3.74e+01

Pos logit contributions
nam+0.040
 redd+0.037
 ey+0.034
 microsc+0.034
 ju+0.031
Neg logit contributions
 and-0.063
owed-0.062
,-0.061
 agree-0.057
 replied-0.056
Token bad
Token ID2089
Feature activation-3.31e+01

Pos logit contributions
nam+0.093
igsaw+0.091
 ey+0.089
Mic+0.088
 redd+0.085
Neg logit contributions
 and-0.130
,-0.116
owed-0.107
.-0.107
 --0.103
Token taste
Token ID6938
Feature activation-2.38e+01

Pos logit contributions
 ey+0.077
 redd+0.070
igsaw+0.065
 avail+0.063
nam+0.062
Neg logit contributions
 and-0.096
owed-0.091
 approached-0.091
 watched-0.090
 joined-0.090
Token From
Token ID3574
Feature activation+2.35e+01

Pos logit contributions
 and+0.095
,+0.087
.+0.079
 -+0.075
!+0.074
Neg logit contributions
igsaw-0.076
Mic-0.073
nam-0.071
 ey-0.068
 Amen-0.067
Token then
Token ID788
Feature activation+1.49e+01

Pos logit contributions
 and+0.048
,+0.040
.+0.035
!+0.030
 -+0.029
Neg logit contributions
nam-0.094
igsaw-0.091
Mic-0.089
 ey-0.088
 Amen-0.086
Token on
Token ID319
Feature activation+1.92e+01

Pos logit contributions
 and+0.036
.+0.030
 said+0.029
owed+0.029
 shook+0.029
Neg logit contributions
Mic-0.044
Ru-0.043
igsaw-0.043
 Amen-0.042
 narrative-0.042
Token,
Token ID11
Feature activation+3.22e+01

Pos logit contributions
owed+0.020
 and+0.019
 touched+0.016
 sat+0.016
 rang+0.015
Neg logit contributions
Mic-0.078
igsaw-0.077
Ru-0.076
nam-0.076
 Amen-0.074
Token he
Token ID339
Feature activation+1.35e+01

Pos logit contributions
 and+0.100
,+0.089
.+0.079
!+0.072
 -+0.072
Neg logit contributions
nam-0.104
igsaw-0.102
Mic-0.097
 ey-0.096
 Amen-0.091
Token remembered
Token ID12086
Feature activation+3.28e+01

Pos logit contributions
owed+0.030
 cheeks+0.030
,"+0.029
ly+0.028
;+0.027
Neg logit contributions
Ru-0.036
 known-0.033
Mic-0.032
Wal-0.032
 gol-0.031
Token an
Token ID281
Feature activation+1.58e+00

Pos logit contributions
 and+0.088
owed+0.087
ops+0.081
hed+0.078
le+0.076
Neg logit contributions
nam-0.088
 microsc-0.085
igsaw-0.075
 outer-0.074
 Wed-0.074
Token important
Token ID1593
Feature activation+1.32e+01

Pos logit contributions
 and+0.006
,+0.005
.+0.005
 said+0.005
!+0.005
Neg logit contributions
igsaw-0.004
nam-0.004
 ey-0.003
Mic-0.003
 Amen-0.003
Token lesson
Token ID11483
Feature activation+3.46e+01

Pos logit contributions
 and+0.047
,+0.042
.+0.039
 said+0.039
 -+0.037
Neg logit contributions
igsaw-0.034
 Amen-0.029
Mic-0.029
nam-0.028
 ey-0.028
Token.
Token ID13
Feature activation+3.52e+01

Pos logit contributions
 and+0.067
,+0.055
 said+0.048
 replied+0.046
 shook+0.045
Neg logit contributions
nam-0.127
igsaw-0.125
 microsc-0.121
 ey-0.118
reating-0.115
Token No
Token ID1400
Feature activation+4.50e+01

Pos logit contributions
 and+0.098
,+0.083
.+0.073
 because+0.065
!+0.064
Neg logit contributions
nam-0.140
igsaw-0.127
 ey-0.125
 redd-0.119
Mic-0.119
Token his
Token ID465
Feature activation-2.60e+01

Pos logit contributions
nam+0.004
arl+0.003
 ey+0.003
 narrative+0.003
 abrupt+0.003
Neg logit contributions
owed-0.004
 replied-0.004
anas-0.004
 --0.004
;-0.004
Token big
Token ID1263
Feature activation-4.75e+01

Pos logit contributions
nam+0.077
igsaw+0.074
 ey+0.071
Mic+0.068
 redd+0.062
Neg logit contributions
 and-0.082
,-0.075
 --0.068
.-0.067
!-0.065
Token bat
Token ID7365
Feature activation-1.26e+00

Pos logit contributions
igsaw+0.120
nam+0.116
 ey+0.112
 gol+0.102
Mic+0.099
Neg logit contributions
 and-0.152
owed-0.147
 --0.138
,-0.135
 watched-0.135
Token and
Token ID290
Feature activation+1.97e+01

Pos logit contributions
arl+0.004
 narrative+0.004
Mic+0.004
saw+0.004
 gol+0.004
Neg logit contributions
owed-0.002
;-0.002
 --0.002
 –-0.002
 replied-0.002
Token hits
Token ID7127
Feature activation+2.31e+01

Pos logit contributions
 and+0.053
,+0.046
.+0.040
 -+0.038
!+0.037
Neg logit contributions
nam-0.071
igsaw-0.068
 ey-0.066
Mic-0.064
 microsc-0.062
Token their
Token ID511
Feature activation-2.75e+01

Pos logit contributions
 and+0.077
,+0.070
.+0.062
 -+0.061
!+0.060
Neg logit contributions
nam-0.064
igsaw-0.061
 ey-0.058
Mic-0.056
 microsc-0.054
Token ball
Token ID2613
Feature activation+5.10e+00

Pos logit contributions
nam+0.085
igsaw+0.079
 ey+0.076
Mic+0.072
 microsc+0.071
Neg logit contributions
 and-0.086
,-0.078
 --0.069
.-0.068
 said-0.066
Token.
Token ID13
Feature activation+1.68e+01

Pos logit contributions
 replied+0.006
 and+0.006
 explained+0.005
 said+0.005
 answered+0.005
Neg logit contributions
nam-0.020
igsaw-0.018
arl-0.018
 ey-0.018
 redd-0.018
Token He
Token ID679
Feature activation-6.60e+00

Pos logit contributions
 and+0.062
,+0.057
 -+0.055
owed+0.055
!+0.053
Neg logit contributions
igsaw-0.036
nam-0.034
 ey-0.034
Mic-0.030
 redd-0.029
Token hits
Token ID7127
Feature activation+2.65e+01

Pos logit contributions
nam+0.021
 ey+0.021
 microsc+0.019
igsaw+0.019
Mic+0.019
Neg logit contributions
 and-0.018
,-0.016
 --0.014
.-0.014
;-0.014
Token it
Token ID340
Feature activation+1.02e+01

Pos logit contributions
 and+0.084
,+0.075
.+0.066
!+0.061
 -+0.061
Neg logit contributions
nam-0.084
igsaw-0.083
 ey-0.078
Mic-0.078
 Amen-0.073
Token corn
Token ID11676
Feature activation-6.57e+01

Pos logit contributions
igsaw+0.160
nam+0.152
 ey+0.151
Mic+0.148
 redd+0.137
Neg logit contributions
 and-0.246
,-0.228
.-0.207
owed-0.206
 --0.200
Tokenfield
Token ID3245
Feature activation-6.20e+01

Pos logit contributions
nam+0.131
 ey+0.122
igsaw+0.111
 Amen+0.110
amental+0.108
Neg logit contributions
 and-0.202
 stopped-0.192
owed-0.189
 rang-0.188
 wanted-0.188
Token and
Token ID290
Feature activation-3.35e+01

Pos logit contributions
nam+0.186
 ey+0.183
 moderation+0.182
 dining+0.179
 sill+0.175
Neg logit contributions
 rang-0.106
 raged-0.101
 fought-0.092
uddled-0.092
ops-0.091
Token the
Token ID262
Feature activation-6.20e+01

Pos logit contributions
nam+0.120
 microsc+0.097
 redd+0.097
 ju+0.097
chard+0.096
Neg logit contributions
 and-0.073
;-0.065
 --0.063
,-0.062
:-0.061
Token dog
Token ID3290
Feature activation-3.86e+01

Pos logit contributions
nam+0.154
 redd+0.139
Mic+0.132
 ey+0.125
 ju+0.123
Neg logit contributions
 and-0.217
,-0.211
owed-0.200
 --0.193
:-0.189
Token had
Token ID550
Feature activation-4.98e+01

Pos logit contributions
 Fro+0.084
 Rib+0.075
 ju+0.069
nam+0.067
 fro+0.066
Neg logit contributions
mmm-0.095
apers-0.093
mmmm-0.090
ues-0.088
',"-0.087
Token a
Token ID257
Feature activation-6.62e+01

Pos logit contributions
 ju+0.114
 redd+0.113
nam+0.103
 microsc+0.096
 pic+0.090
Neg logit contributions
owed-0.164
 and-0.162
,-0.156
our-0.142
le-0.142
Token wonderful
Token ID7932
Feature activation-5.94e+01

Pos logit contributions
igsaw+0.125
nam+0.119
Mic+0.119
 Amen+0.117
 redd+0.117
Neg logit contributions
 and-0.256
owed-0.236
,-0.225
.-0.224
ed-0.216
Token time
Token ID640
Feature activation-5.12e+01

Pos logit contributions
 Amen+0.096
igsaw+0.089
 ey+0.080
 redd+0.080
 avail+0.080
Neg logit contributions
owed-0.209
 and-0.203
olded-0.197
 said-0.196
 rang-0.196
Token running
Token ID2491
Feature activation-4.44e+01

Pos logit contributions
nam+0.183
 microsc+0.177
igsaw+0.177
rou+0.172
years+0.166
Neg logit contributions
 rang-0.074
 replied-0.072
 shook-0.071
 woke-0.070
 felt-0.066
Token around
Token ID1088
Feature activation-5.48e+01

Pos logit contributions
igsaw+0.139
nam+0.130
 Lorenzo+0.121
 gol+0.118
 microsc+0.118
Neg logit contributions
 explained-0.095
 rang-0.092
 checked-0.090
owed-0.090
 answered-0.090
Token day
Token ID1110
Feature activation+3.53e-01

Pos logit contributions
Mic+0.019
 redd+0.019
nam+0.019
igsaw+0.019
 ey+0.018
Neg logit contributions
 and-0.036
,-0.035
:-0.032
 --0.031
;-0.031
Token adm
Token ID6178
Feature activation-4.88e+00

Pos logit contributions
 and+0.001
 replied+0.001
:+0.001
 shook+0.001
,+0.001
Neg logit contributions
nam-0.001
igsaw-0.001
years-0.001
 queens-0.001
Mic-0.001
Tokeniring
Token ID3428
Feature activation-7.25e+01

Pos logit contributions
nam+0.009
igsaw+0.009
Mic+0.008
 Amen+0.008
 ey+0.008
Neg logit contributions
 and-0.023
,-0.021
.-0.020
!-0.018
 --0.018
Token the
Token ID262
Feature activation-7.24e+00

Pos logit contributions
nam+0.167
igsaw+0.148
 microsc+0.143
 ey+0.142
 outer+0.133
Neg logit contributions
 said-0.224
 replied-0.224
 and-0.224
 rang-0.222
 explained-0.220
Token flower
Token ID15061
Feature activation+8.16e+00

Pos logit contributions
 redd+0.014
 ey+0.014
 ju+0.012
igsaw+0.012
nam+0.012
Neg logit contributions
 and-0.027
owed-0.026
,-0.026
:-0.024
 rang-0.024
Token.
Token ID13
Feature activation+1.61e+01

Pos logit contributions
 rang+0.011
 lied+0.011
 shook+0.011
 slapped+0.011
 checked+0.011
Neg logit contributions
nam-0.028
igsaw-0.028
 microsc-0.027
 ju-0.027
 ey-0.026
Token
Token ID198
Feature activation-1.23e+01

Pos logit contributions
 and+0.049
,+0.044
.+0.039
!+0.036
 -+0.036
Neg logit contributions
igsaw-0.051
nam-0.051
Mic-0.048
 ey-0.048
 Amen-0.046
Token
Token ID198
Feature activation-1.13e+01

Pos logit contributions
nam+0.050
igsaw+0.048
 ey+0.046
Mic+0.045
 microsc+0.043
Neg logit contributions
 and-0.031
,-0.026
.-0.023
 said-0.020
!-0.020
TokenTo
Token ID2514
Feature activation+4.30e+01

Pos logit contributions
nam+0.034
igsaw+0.033
 ey+0.031
Mic+0.030
 microsc+0.029
Neg logit contributions
 and-0.039
,-0.035
.-0.031
!-0.029
 --0.029
Token Buzz
Token ID15753
Feature activation-2.41e+00

Pos logit contributions
 and+0.139
,+0.115
;+0.112
.+0.112
 -+0.112
Neg logit contributions
igsaw-0.104
 Amen-0.098
 Lorenzo-0.090
 outer-0.088
Mic-0.086
Tokeny
Token ID88
Feature activation-6.00e+01

Pos logit contributions
rou+0.005
igsaw+0.005
amental+0.005
die+0.005
Ru+0.005
Neg logit contributions
 and-0.006
 stamped-0.006
 shook-0.006
;-0.005
 rang-0.005
Token if
Token ID611
Feature activation-8.70e+01

Pos logit contributions
igsaw+0.345
reating+0.332
nam+0.299
 microsc+0.289
 Wed+0.285
Neg logit contributions
 rang-0.136
owed-0.134
int-0.132
 listened-0.128
 watched-0.126
Token she
Token ID673
Feature activation-8.03e+01

Pos logit contributions
nam+0.166
 avail+0.128
 possible+0.125
igsaw+0.117
 microsc+0.114
Neg logit contributions
 and-0.311
int-0.294
hed-0.285
anas-0.277
owed-0.273
Token promised
Token ID8072
Feature activation-4.62e+01

Pos logit contributions
 brav+0.130
pill+0.129
 ju+0.122
nam+0.120
 rightful+0.120
Neg logit contributions
;-0.177
uddled-0.176
wr-0.170
oves-0.170
irus-0.170
Token to
Token ID284
Feature activation-6.87e+01

Pos logit contributions
igsaw+0.169
nam+0.164
reating+0.162
 microsc+0.151
Mic+0.150
Neg logit contributions
 and-0.065
 watched-0.063
 shook-0.062
 sat-0.061
 blew-0.060
Token always
Token ID1464
Feature activation-5.94e+01

Pos logit contributions
nam+0.209
 Amen+0.202
Mic+0.192
igsaw+0.189
 avail+0.184
Neg logit contributions
 and-0.186
,-0.158
ed-0.150
uddled-0.149
owed-0.148
Token be
Token ID307
Feature activation-6.89e+01

Pos logit contributions
nam+0.177
 Amen+0.171
 avail+0.159
igsaw+0.153
 Deal+0.152
Neg logit contributions
 and-0.153
uddled-0.145
ed-0.134
,-0.125
owed-0.125
Token safe
Token ID3338
Feature activation-7.45e+01

Pos logit contributions
nam+0.181
 Amen+0.147
 redd+0.140
igsaw+0.137
Mic+0.134
Neg logit contributions
 and-0.235
ops-0.208
,-0.207
 rang-0.193
owed-0.191
Token and
Token ID290
Feature activation-4.26e+01

Pos logit contributions
nam+0.289
igsaw+0.261
 queens+0.259
keeping+0.259
 Amen+0.256
Neg logit contributions
 replied-0.111
 rang-0.105
 shook-0.097
 answered-0.093
owed-0.092
Token listen
Token ID6004
Feature activation-6.11e+01

Pos logit contributions
nam+0.149
igsaw+0.139
 ey+0.132
Mic+0.131
 Amen+0.130
Neg logit contributions
 and-0.120
,-0.104
.-0.091
 --0.084
!-0.084
Token to
Token ID284
Feature activation-5.95e+01

Pos logit contributions
nam+0.164
igsaw+0.158
 possible+0.157
 outer+0.146
rou+0.143
Neg logit contributions
 rang-0.143
 checked-0.130
int-0.128
irus-0.127
 scanned-0.127
Token her
Token ID607
Feature activation-8.21e+01

Pos logit contributions
nam+0.162
igsaw+0.145
Mic+0.133
 Amen+0.131
 microsc+0.126
Neg logit contributions
 and-0.204
,-0.182
 shook-0.163
 said-0.163
owed-0.161
Token't
Token ID470
Feature activation-7.32e+01

Pos logit contributions
nam+0.097
igsaw+0.096
 ey+0.089
Mic+0.088
 Amen+0.083
Neg logit contributions
 and-0.139
,-0.126
.-0.115
!-0.107
 --0.106
Token stop
Token ID2245
Feature activation-5.55e+01

Pos logit contributions
nam+0.269
igsaw+0.260
Mic+0.245
 ey+0.241
 Amen+0.238
Neg logit contributions
 and-0.191
,-0.165
.-0.139
 said-0.128
!-0.128
Token spinning
Token ID19493
Feature activation-5.46e+01

Pos logit contributions
nam+0.154
 microsc+0.132
 ey+0.131
igsaw+0.128
 avail+0.121
Neg logit contributions
 and-0.163
owed-0.156
,-0.143
 agreed-0.142
 said-0.139
Token it
Token ID340
Feature activation-4.52e+01

Pos logit contributions
nam+0.194
igsaw+0.192
 microsc+0.167
reating+0.165
 outer+0.164
Neg logit contributions
 and-0.106
 said-0.101
 replied-0.100
 answered-0.098
owed-0.098
Token in
Token ID287
Feature activation-5.50e+01

Pos logit contributions
igsaw+0.142
nam+0.126
 possible+0.122
 outer+0.112
 downward+0.109
Neg logit contributions
anas-0.113
 nodded-0.095
int-0.091
 watched-0.090
uddled-0.090
Token her
Token ID607
Feature activation-6.74e+01

Pos logit contributions
nam+0.184
igsaw+0.181
 ey+0.169
Mic+0.168
 microsc+0.160
Neg logit contributions
 and-0.163
,-0.144
.-0.126
 said-0.117
 --0.116
Token room
Token ID2119
Feature activation-5.52e+01

Pos logit contributions
nam+0.189
igsaw+0.186
 ey+0.168
Mic+0.157
 microsc+0.154
Neg logit contributions
 and-0.235
,-0.209
 said-0.187
.-0.186
 --0.185
Token.
Token ID13
Feature activation-3.17e+01

Pos logit contributions
nam+0.181
igsaw+0.174
 sill+0.159
reating+0.157
 microsc+0.150
Neg logit contributions
int-0.105
AAAA-0.090
owed-0.090
 replied-0.086
 tapped-0.084
Token From
Token ID3574
Feature activation-3.60e+01

Pos logit contributions
nam+0.110
igsaw+0.108
 ey+0.102
Mic+0.102
 Amen+0.097
Neg logit contributions
 and-0.092
,-0.081
.-0.071
!-0.064
 --0.064
Token that
Token ID326
Feature activation-4.02e+01

Pos logit contributions
nam+0.129
igsaw+0.120
 ey+0.116
Mic+0.115
 gol+0.109
Neg logit contributions
 and-0.081
,-0.069
.-0.062
owed-0.060
 came-0.055
Token day
Token ID1110
Feature activation-4.12e+01

Pos logit contributions
igsaw+0.101
nam+0.099
Mic+0.094
 Amen+0.089
 ey+0.086
Neg logit contributions
 and-0.143
,-0.127
.-0.120
 --0.115
 said-0.113
Token yard
Token ID12699
Feature activation-8.52e+01

Pos logit contributions
nam+0.300
igsaw+0.276
 ey+0.266
Mic+0.253
 microsc+0.244
Neg logit contributions
 and-0.360
,-0.313
owed-0.284
 --0.282
 said-0.280
Token.
Token ID13
Feature activation-6.91e+01

Pos logit contributions
nam+0.287
keeping+0.254
 sill+0.250
chard+0.241
 redd+0.233
Neg logit contributions
int-0.147
 rang-0.121
 tapped-0.119
le-0.115
 repeated-0.113
Token The
Token ID383
Feature activation-7.41e+01

Pos logit contributions
nam+0.178
igsaw+0.174
Mic+0.165
 ey+0.155
 Amen+0.151
Neg logit contributions
 and-0.240
,-0.217
.-0.197
 --0.193
!-0.192
Token dog
Token ID3290
Feature activation-6.58e+01

Pos logit contributions
nam+0.247
igsaw+0.242
 ey+0.229
Mic+0.229
 Amen+0.214
Neg logit contributions
 and-0.215
,-0.193
.-0.170
 --0.159
!-0.158
Token was
Token ID373
Feature activation-7.76e+01

Pos logit contributions
 Fro+0.107
 Ho+0.092
 microsc+0.090
 True+0.089
 Fl+0.089
Neg logit contributions
oor-0.209
ides-0.202
',"-0.200
eness-0.195
'."-0.192
Token very
Token ID845
Feature activation-8.60e+01

Pos logit contributions
nam+0.232
igsaw+0.211
Mic+0.210
 Amen+0.205
 ey+0.202
Neg logit contributions
 and-0.241
,-0.215
.-0.195
!-0.185
 --0.184
Token sad
Token ID6507
Feature activation-8.91e+01

Pos logit contributions
nam+0.236
igsaw+0.221
Mic+0.210
 ey+0.209
 Amen+0.204
Neg logit contributions
 and-0.298
,-0.264
.-0.240
 --0.230
!-0.229
Token and
Token ID290
Feature activation-5.11e+01

Pos logit contributions
igsaw+0.215
 microsc+0.203
 Lorenzo+0.202
arl+0.192
reating+0.191
Neg logit contributions
 rang-0.187
int-0.185
hed-0.185
eded-0.181
 spoke-0.181
Token hungry
Token ID14720
Feature activation-9.16e+01

Pos logit contributions
nam+0.188
 Amen+0.165
igsaw+0.161
 ey+0.160
 microsc+0.158
Neg logit contributions
 and-0.134
,-0.114
.-0.099
 --0.098
!-0.094
Token.
Token ID13
Feature activation-5.37e+01

Pos logit contributions
reating+0.253
 downward+0.237
 ju+0.234
nam+0.234
 microsc+0.233
Neg logit contributions
int-0.192
owed-0.169
 rang-0.163
-0.163
our-0.161
Token True
Token ID6407
Feature activation-8.26e+01

Pos logit contributions
nam+0.162
igsaw+0.158
Mic+0.150
 ey+0.148
 Amen+0.141
Neg logit contributions
 and-0.171
,-0.154
.-0.136
!-0.129
 --0.129
Token to
Token ID284
Feature activation-5.23e+01

Pos logit contributions
nam+0.169
 microsc+0.137
 redd+0.134
 ju+0.130
ptions+0.130
Neg logit contributions
 and-0.166
,-0.150
owed-0.146
le-0.140
 replied-0.136
Token leave
Token ID2666
Feature activation-4.39e+01

Pos logit contributions
nam+0.193
igsaw+0.187
Mic+0.179
 ey+0.177
 Amen+0.173
Neg logit contributions
 and-0.133
,-0.115
.-0.097
 --0.091
 said-0.089
Token the
Token ID262
Feature activation-6.37e+01

Pos logit contributions
nam+0.101
 microsc+0.075
 avail+0.070
 moderation+0.068
igsaw+0.068
Neg logit contributions
owed-0.139
int-0.136
 replied-0.134
 rang-0.132
 and-0.125
Token park
Token ID3952
Feature activation-5.27e+01

Pos logit contributions
nam+0.172
Mic+0.166
igsaw+0.162
 ey+0.153
 redd+0.148
Neg logit contributions
 and-0.204
,-0.196
owed-0.186
 --0.174
:-0.171
Token with
Token ID351
Feature activation-4.07e+01

Pos logit contributions
nam+0.193
igsaw+0.188
spe+0.163
 microsc+0.159
keeping+0.159
Neg logit contributions
int-0.088
 replied-0.071
 tapped-0.063
 nodded-0.062
 squeezed-0.059
Token their
Token ID511
Feature activation-8.03e+01

Pos logit contributions
nam+0.113
igsaw+0.096
 microsc+0.093
Mic+0.090
 ey+0.088
Neg logit contributions
 and-0.138
,-0.122
 --0.111
;-0.109
 replied-0.108
Token ruined
Token ID20484
Feature activation-4.16e+01

Pos logit contributions
nam+0.193
 microsc+0.167
igsaw+0.166
 ey+0.155
Mic+0.145
Neg logit contributions
 and-0.293
,-0.264
;-0.258
 --0.258
owed-0.256
Token dolls
Token ID36062
Feature activation-5.69e+01

Pos logit contributions
nam+0.100
igsaw+0.081
 microsc+0.080
 ey+0.077
 Tues+0.065
Neg logit contributions
int-0.135
owed-0.129
 spoke-0.128
ave-0.128
 and-0.128
Token.
Token ID13
Feature activation-3.23e+01

Pos logit contributions
nam+0.235
igsaw+0.225
 microsc+0.203
keeping+0.202
Mic+0.202
Neg logit contributions
 replied-0.068
int-0.058
 nodded-0.057
 shook-0.057
 answered-0.055
Token They
Token ID1119
Feature activation-3.75e+01

Pos logit contributions
nam+0.124
igsaw+0.122
 ey+0.117
Mic+0.116
 Amen+0.110
Neg logit contributions
 and-0.084
,-0.072
.-0.063
 said-0.055
!-0.053
Token did
Token ID750
Feature activation-4.89e+01

Pos logit contributions
nam+0.145
 Amen+0.111
Mic+0.110
 microsc+0.109
years+0.105
Neg logit contributions
 and-0.089
,-0.081
;-0.078
 --0.076
!-0.073
Token look
Token ID804
Feature activation-7.36e+01

Pos logit contributions
igsaw+0.198
 Amen+0.191
nam+0.175
 possible+0.167
 avail+0.166
Neg logit contributions
uddled-0.188
ops-0.159
 and-0.157
hed-0.152
opes-0.151
Token at
Token ID379
Feature activation-6.21e+01

Pos logit contributions
nam+0.247
ifying+0.204
 outer+0.204
year+0.201
years+0.197
Neg logit contributions
 rang-0.160
 roared-0.138
anas-0.132
 fought-0.132
 tapped-0.127
Token the
Token ID262
Feature activation-6.16e+01

Pos logit contributions
nam+0.182
igsaw+0.158
 avail+0.139
 microsc+0.139
 ey+0.133
Neg logit contributions
 and-0.198
,-0.177
owed-0.167
 got-0.165
 woke-0.163
Token tree
Token ID5509
Feature activation-5.09e+01

Pos logit contributions
nam+0.159
igsaw+0.146
 ey+0.142
 redd+0.131
 microsc+0.128
Neg logit contributions
 and-0.212
,-0.200
owed-0.192
 said-0.179
:-0.178
Token.
Token ID13
Feature activation-3.63e+01

Pos logit contributions
nam+0.208
igsaw+0.194
reating+0.171
 ju+0.171
 microsc+0.168
Neg logit contributions
 tapped-0.058
 squeezed-0.057
 shook-0.056
 woke-0.053
 watched-0.053
Token
Token ID220
Feature activation-8.26e+01

Pos logit contributions
igsaw+0.102
nam+0.095
Mic+0.092
 Amen+0.088
 Deal+0.082
Neg logit contributions
 and-0.120
,-0.110
 --0.096
.-0.096
!-0.095
Token
Token ID198
Feature activation-6.40e+01

Pos logit contributions
igsaw+0.264
nam+0.260
Mic+0.248
 ey+0.241
 Amen+0.230
Neg logit contributions
 and-0.242
,-0.214
.-0.183
 --0.176
!-0.174
Token
Token ID198
Feature activation-6.21e+01

Pos logit contributions
nam+0.227
igsaw+0.225
Mic+0.215
 ey+0.212
 Amen+0.203
Neg logit contributions
 and-0.176
,-0.155
.-0.134
!-0.121
 --0.121
TokenAfter
Token ID3260
Feature activation-3.30e+01

Pos logit contributions
igsaw+0.144
nam+0.141
Mic+0.140
 Amen+0.124
 ey+0.117
Neg logit contributions
 and-0.243
,-0.228
.-0.205
!-0.193
 because-0.192
Token a
Token ID257
Feature activation-6.07e+01

Pos logit contributions
nam+0.075
igsaw+0.074
 microsc+0.064
 recess+0.064
 Amen+0.059
Neg logit contributions
 and-0.111
owed-0.101
int-0.097
les-0.095
le-0.094
Token few
Token ID1178
Feature activation-5.03e+01

Pos logit contributions
igsaw+0.212
nam+0.206
Mic+0.205
 Amen+0.184
 redd+0.174
Neg logit contributions
 and-0.162
,-0.129
.-0.120
 --0.115
owed-0.115
TokenI
Token ID40
Feature activation-8.20e+01

Pos logit contributions
nam+0.167
Mic+0.164
igsaw+0.153
 Amen+0.143
 ey+0.142
Neg logit contributions
 and-0.317
,-0.306
.-0.271
 --0.259
!-0.256
Token should
Token ID815
Feature activation-7.04e+01

Pos logit contributions
 Amen+0.171
 seeker+0.119
 gol+0.119
 redd+0.117
iam+0.117
Neg logit contributions
 cheeks-0.264
iously-0.261
uddled-0.257
 and-0.251
opes-0.244
Token have
Token ID423
Feature activation-9.20e+01

Pos logit contributions
 Amen+0.186
nam+0.180
igsaw+0.175
Mic+0.160
 avail+0.157
Neg logit contributions
 and-0.220
owed-0.183
ed-0.177
,-0.173
uddled-0.167
Token the
Token ID262
Feature activation-8.69e+01

Pos logit contributions
nam+0.292
 queens+0.212
Mic+0.204
ptions+0.202
Looks+0.200
Neg logit contributions
 and-0.283
owed-0.259
ry-0.245
;-0.239
le-0.235
Token books
Token ID3835
Feature activation-9.56e+01

Pos logit contributions
nam+0.202
igsaw+0.196
 redd+0.179
Mic+0.178
 ey+0.176
Neg logit contributions
 and-0.324
,-0.297
owed-0.281
:-0.278
 --0.270
Tokenhe
Token ID258
Feature activation-1.72e+02

Pos logit contributions
igsaw+0.242
nam+0.224
reating+0.201
arl+0.172
 redd+0.172
Neg logit contributions
 watched-0.267
 approached-0.267
 stepped-0.262
 spoke-0.261
 tapped-0.260
Tokenlf
Token ID1652
Feature activation-1.25e+02

Pos logit contributions
igsaw+0.152
 Amen+0.109
nam+0.086
reating+0.062
 Wed+0.058
Neg logit contributions
 and-0.761
 shook-0.746
owed-0.727
 touched-0.727
 calmed-0.721
Token,
Token ID11
Feature activation-5.75e+01

Pos logit contributions
igsaw+0.449
nam+0.449
reating+0.396
 queens+0.391
 redd+0.362
Neg logit contributions
 and-0.213
 replied-0.212
 approached-0.211
 shook-0.207
 watched-0.207
Token because
Token ID780
Feature activation-4.72e+01

Pos logit contributions
nam+0.216
 ey+0.189
 microsc+0.186
 redd+0.183
igsaw+0.182
Neg logit contributions
ed-0.110
hed-0.108
owed-0.105
 approached-0.100
 and-0.098
Token it
Token ID340
Feature activation-6.40e+01

Pos logit contributions
nam+0.151
igsaw+0.144
 ey+0.137
Mic+0.135
 Amen+0.129
Neg logit contributions
 and-0.148
,-0.132
.-0.118
!-0.109
 --0.108
Token is
Token ID318
Feature activation-6.40e+01

Pos logit contributions
nam+0.195
igsaw+0.186
 ey+0.183
 microsc+0.182
 redd+0.181
Neg logit contributions
 and-0.162
,-0.148
;-0.137
 --0.135
.-0.130
Token!
Token ID0
Feature activation-8.33e+01

Pos logit contributions
 ey+0.551
nam+0.500
 redd+0.464
igsaw+0.456
amental+0.442
Neg logit contributions
owed-0.379
 explained-0.377
 listened-0.365
 told-0.365
 entered-0.358
Token Alex
Token ID4422
Feature activation-1.12e+02

Pos logit contributions
nam+0.179
igsaw+0.178
 Amen+0.176
Mic+0.169
 ey+0.148
Neg logit contributions
 and-0.285
,-0.259
owed-0.242
le-0.239
 listened-0.238
Token had
Token ID550
Feature activation-1.01e+02

Pos logit contributions
 Alex+0.226
 spec+0.220
 Maya+0.215
pill+0.215
Alex+0.212
Neg logit contributions
uddled-0.249
orting-0.235
uning-0.230
 depress-0.228
int-0.224
Token used
Token ID973
Feature activation-6.66e+01

Pos logit contributions
nam+0.272
 redd+0.233
 microsc+0.233
Mic+0.218
 ju+0.216
Neg logit contributions
 and-0.318
,-0.310
owed-0.293
;-0.257
.-0.254
Token past
Token ID1613
Feature activation-1.12e+02

Pos logit contributions
nam+0.179
igsaw+0.170
 microsc+0.154
 ey+0.153
 redd+0.146
Neg logit contributions
 and-0.210
,-0.192
owed-0.179
 woke-0.174
;-0.170
Token-
Token ID12
Feature activation-1.75e+02

Pos logit contributions
nam+0.327
igsaw+0.314
 ey+0.304
 microsc+0.298
Mic+0.287
Neg logit contributions
 and-0.336
,-0.301
 said-0.269
.-0.268
 gave-0.263
Tokendate
Token ID4475
Feature activation-1.22e+02

Pos logit contributions
igsaw+0.505
nam+0.502
Mic+0.468
 ey+0.464
 microsc+0.440
Neg logit contributions
 and-0.577
,-0.524
.-0.467
 said-0.437
!-0.431
Token cheese
Token ID9891
Feature activation-1.02e+02

Pos logit contributions
nam+0.399
 microsc+0.377
igsaw+0.364
 math+0.344
 ey+0.337
Neg logit contributions
 died-0.225
 blew-0.200
-0.199
 woke-0.191
use-0.182
Token!
Token ID0
Feature activation-6.06e+01

Pos logit contributions
igsaw+0.363
Mic+0.354
 ju+0.341
 microsc+0.340
 ey+0.332
Neg logit contributions
 rang-0.140
 woke-0.123
 died-0.122
 listened-0.119
 entered-0.117
Token Sam
Token ID3409
Feature activation-1.02e+02

Pos logit contributions
igsaw+0.167
nam+0.158
Mic+0.152
 ey+0.146
 Amen+0.143
Neg logit contributions
 and-0.211
,-0.191
.-0.169
!-0.162
 --0.162
Token yelled
Token ID22760
Feature activation-6.26e+01

Pos logit contributions
Mic+0.256
die+0.240
pill+0.234
rou+0.213
years+0.208
Neg logit contributions
our-0.222
wr-0.219
 Prop-0.214
ides-0.213
orting-0.211
Token's
Token ID338
Feature activation-1.44e+02

Pos logit contributions
,+0.156
;+0.152
 and+0.147
",+0.146
 agree+0.144
Neg logit contributions
Mic-0.179
nam-0.173
 microsc-0.172
 ey-0.170
pill-0.168
Token mom
Token ID1995
Feature activation-1.11e+02

Pos logit contributions
nam+0.409
 Amen+0.363
igsaw+0.353
 ey+0.352
 redd+0.327
Neg logit contributions
 and-0.435
,-0.403
owed-0.376
hed-0.352
 came-0.352
Token was
Token ID373
Feature activation-1.12e+02

Pos logit contributions
 Amen+0.293
rou+0.287
my+0.221
Acc+0.218
nam+0.210
Neg logit contributions
ided-0.218
edly-0.217
ves-0.211
Exit-0.205
 pierced-0.203
Token even
Token ID772
Feature activation-9.41e+01

Pos logit contributions
nam+0.277
 Amen+0.257
igsaw+0.200
rou+0.195
years+0.189
Neg logit contributions
 and-0.361
owed-0.340
ops-0.334
,-0.317
ly-0.313
Token ang
Token ID3550
Feature activation-1.24e+02

Pos logit contributions
nam+0.266
 redd+0.219
 Amen+0.212
igsaw+0.205
 ey+0.203
Neg logit contributions
 and-0.321
,-0.288
 --0.259
.-0.256
;-0.253
Tokenrier
Token ID5277
Feature activation-1.94e+02

Pos logit contributions
nam+0.173
igsaw+0.165
 ey+0.145
Mic+0.144
 Amen+0.122
Neg logit contributions
 and-0.618
,-0.577
.-0.539
!-0.510
 --0.509
Token and
Token ID290
Feature activation-6.92e+01

Pos logit contributions
rou+0.643
 gol+0.628
rier+0.624
 Amen+0.621
arl+0.608
Neg logit contributions
 searched-0.206
 approached-0.182
 entered-0.181
 squeezed-0.174
 rubbed-0.173
Token Lily
Token ID20037
Feature activation-1.12e+02

Pos logit contributions
nam+0.251
 Amen+0.224
igsaw+0.224
 redd+0.219
Mic+0.218
Neg logit contributions
 and-0.181
,-0.153
.-0.134
 --0.130
;-0.125
Token felt
Token ID2936
Feature activation-7.43e+01

Pos logit contributions
rou+0.297
iam+0.277
uka+0.261
Looks+0.243
ott+0.242
Neg logit contributions
 than-0.230
 because-0.189
action-0.183
;-0.182
 and-0.179
Token even
Token ID772
Feature activation-7.46e+01

Pos logit contributions
nam+0.181
 Amen+0.152
igsaw+0.151
 ey+0.150
 redd+0.143
Neg logit contributions
 and-0.260
,-0.232
owed-0.211
!-0.202
 rang-0.201
Token sad
Token ID6507
Feature activation-9.83e+01

Pos logit contributions
nam+0.218
igsaw+0.202
 ey+0.194
Mic+0.191
 Amen+0.187
Neg logit contributions
 and-0.251
,-0.225
.-0.200
 --0.191
!-0.188
Token the
Token ID262
Feature activation-6.51e+01

Pos logit contributions
nam+0.130
igsaw+0.127
 microsc+0.114
 avail+0.114
Mic+0.112
Neg logit contributions
 and-0.154
owed-0.132
,-0.127
.-0.121
:-0.117
Token farmer
Token ID18739
Feature activation-5.35e+01

Pos logit contributions
Mic+0.162
nam+0.161
igsaw+0.160
 Amen+0.142
 redd+0.138
Neg logit contributions
 and-0.236
,-0.207
owed-0.199
.-0.193
 --0.191
Token scattered
Token ID16830
Feature activation-3.17e+01

Pos logit contributions
Mic+0.133
saw+0.132
 gol+0.130
amental+0.128
 av+0.123
Neg logit contributions
 because-0.107
;-0.106
 than-0.105
."-0.099
.-0.097
Token the
Token ID262
Feature activation-6.15e+01

Pos logit contributions
igsaw+0.079
nam+0.074
 ey+0.069
Mic+0.064
 microsc+0.057
Neg logit contributions
owed-0.101
anas-0.097
uddled-0.092
our-0.088
 rang-0.087
Token av
Token ID1196
Feature activation-1.93e+02

Pos logit contributions
nam+0.113
Mic+0.109
igsaw+0.107
 redd+0.097
 ey+0.092
Neg logit contributions
 and-0.260
,-0.243
owed-0.238
 said-0.228
:-0.226
Tokenoc
Token ID420
Feature activation-1.74e+02

Pos logit contributions
nam+0.845
igsaw+0.755
 ey+0.752
 microsc+0.743
 Deal+0.738
Neg logit contributions
 and-0.505
,-0.431
.-0.326
!-0.323
 --0.295
Tokenados
Token ID22484
Feature activation-6.91e+01

Pos logit contributions
igsaw+0.388
nam+0.380
Mic+0.360
 ey+0.357
 Amen+0.316
Neg logit contributions
 and-0.702
,-0.638
.-0.593
 said-0.575
!-0.553
Token,
Token ID11
Feature activation-3.39e+01

Pos logit contributions
nam+0.176
 ju+0.163
igsaw+0.158
hel+0.155
 ey+0.152
Neg logit contributions
anas-0.179
anna-0.154
aron-0.139
 Birthday-0.137
ana-0.136
Token he
Token ID339
Feature activation-5.27e+01

Pos logit contributions
nam+0.096
igsaw+0.077
 microsc+0.075
Mic+0.073
 redd+0.072
Neg logit contributions
 and-0.111
,-0.100
owed-0.098
:-0.094
.-0.092
Token noticed
Token ID6810
Feature activation-2.56e+01

Pos logit contributions
 microsc+0.127
nam+0.123
years+0.121
rou+0.113
 ey+0.111
Neg logit contributions
 because-0.112
;-0.110
orting-0.109
 and-0.106
round-0.106
Token one
Token ID530
Feature activation-5.68e+01

Pos logit contributions
nam+0.063
 microsc+0.060
igsaw+0.056
Mic+0.050
 outer+0.050
Neg logit contributions
 and-0.072
owed-0.067
,-0.065
our-0.062
 agree-0.062
Token driver
Token ID4639
Feature activation-4.61e+01

Pos logit contributions
nam+0.122
Mic+0.100
 gol+0.098
 Amen+0.096
 redd+0.096
Neg logit contributions
 and-0.153
owed-0.152
,-0.142
 --0.142
;-0.140
Token would
Token ID561
Feature activation-5.20e+01

Pos logit contributions
spe+0.124
rou+0.100
saw+0.097
iam+0.093
 brav+0.090
Neg logit contributions
 iv-0.099
;-0.096
wr-0.093
 —-0.091
ulp-0.089
Token greet
Token ID12589
Feature activation-5.05e+01

Pos logit contributions
 Amen+0.127
nam+0.120
 outer+0.101
igsaw+0.099
 ey+0.094
Neg logit contributions
ed-0.167
uddled-0.163
owed-0.155
led-0.147
 and-0.135
Token the
Token ID262
Feature activation-5.78e+01

Pos logit contributions
nam+0.106
 microsc+0.083
reating+0.079
igsaw+0.078
years+0.076
Neg logit contributions
 fell-0.156
 and-0.154
 checked-0.151
 rang-0.151
owed-0.149
Token po
Token ID745
Feature activation-5.62e+01

Pos logit contributions
nam+0.145
 gol+0.117
 redd+0.115
Mic+0.109
 ju+0.107
Neg logit contributions
owed-0.216
 and-0.194
:-0.192
,-0.189
;-0.186
Tokenpp
Token ID381
Feature activation-1.73e+02

Pos logit contributions
nam+0.161
igsaw+0.156
 ey+0.151
Mic+0.143
 microsc+0.137
Neg logit contributions
 and-0.201
,-0.181
.-0.164
!-0.153
 --0.151
Tokenies
Token ID444
Feature activation-6.21e+01

Pos logit contributions
nam+0.525
igsaw+0.521
 redd+0.496
Mic+0.488
 ey+0.484
Neg logit contributions
 and-0.450
,-0.413
 said-0.375
 shook-0.364
.-0.361
Token and
Token ID290
Feature activation-2.34e+01

Pos logit contributions
nam+0.204
 outer+0.179
igsaw+0.178
reating+0.176
 Amen+0.172
Neg logit contributions
 died-0.121
 lied-0.112
 rang-0.110
ANG-0.109
 slapped-0.106
Token thank
Token ID5875
Feature activation-3.98e+01

Pos logit contributions
nam+0.083
 Amen+0.068
igsaw+0.068
 microsc+0.067
 ey+0.066
Neg logit contributions
 and-0.059
,-0.053
 --0.048
.-0.048
;-0.048
Token them
Token ID606
Feature activation-6.58e+01

Pos logit contributions
nam+0.108
Mic+0.086
igsaw+0.084
 avail+0.078
 microsc+0.074
Neg logit contributions
 and-0.142
,-0.126
 --0.116
;-0.113
 shook-0.112
Token for
Token ID329
Feature activation-4.03e+01

Pos logit contributions
reating+0.202
igsaw+0.197
nam+0.190
 microsc+0.183
 outer+0.172
Neg logit contributions
 shook-0.165
 died-0.144
 checked-0.143
 scanned-0.142
 tapped-0.139