TOP ACTIVATIONS
MAX = 0.292

a rabbit , a squirrel and a fox . They all
, Ben feels his nose and ears are cold . He
a hat and a fish and a dish . They laugh
a bird , a fish and a dog . They wrapped
a bird and a fish and a ..." They
's friends , a squirrel and a bird , saw the
a cow , a duck and a snake . Lily repeated
a man with a beard and a robe . He looks
a boat , a fish and a frog . They make
a boat , a fish and a frog . She liked
a cat and a hat and a fish and a dish
. Tim saw a squirrel and wanted to play with it
. He is a squirrel and he lives in the tree
a boat , a frog and a cup . They made
a prince with a crown and a sword . She used
a hat and a beard and a big smile . He
. Tom is a clown and Jen is a dancer .
furry animals - a squirrel and a mouse . The mouse
. Spot saw a squirrel and wanted to chase it .
eddy bear , a ball and a book . She thinks

MIN ACTIVATIONS
MIN = -0.296

sign ' happy ', ' birth day ', and ' you
should be compassionate . Comp assion ate means you care about
. " Mar riage is when two people love
by saying , " Mar riage is when two people who
cat said , " Mar riage is when two animals love
sky before they walked fore shadow ed the down pour that
, but it was fore shadow ed in the beginning when
heavy bag had been fore shadow ed , and that setup
" Cl osed - Fore shadow ing Storm ".
was the setup that fore shadow ed the bad ending .
go . " Which way , Tom ?" Lily
that it was a ' shadow ' of her own shape
herbs . " Which butter do we need ,
story . " Which story do you want ,
. He had been fore shadow ed about this bad ending
eat . " Which cereal should I load ?"
long . " Which one should we get ,
that they were going to outer space . They watched out
, and even went to outer space ! It was the
girl remembered the earlier fore shadow ing , and it saved

INTERVAL 0.145 - 0.292
CONTAINS 6.302%

three , four , five , six , seven , eight
. The shade is cool and lovely . They feel happy
was walking by the park and she spotted a quiet rag
the living room with Mom and Dad . They pick some
She saw all the animals and creatures below . She spread

INTERVAL -0.002 - 0.145
CONTAINS 42.393%

happy . He was so excited that he started to tw
dragon was very scary and Tim my didn 't know what
and found lots of other fun games to play . Eventually
best food she ever tasted ! She thought she would go
Tim and Anna look at the box . They

INTERVAL -0.149 - -0.002
CONTAINS 49.611%

saw the bird again . The bird had a little envelope
heavy for her . Just then , her big
she looked around and saw a young boy . He was
ball again ." L ila and Ben hugged Sam
no more . They realized that the lightning had come to

INTERVAL -0.296 - -0.149
CONTAINS 1.694%

playing together . <|endoftext|> Once upon a time , there was
rock and pushed it to the oy ster . Tim said
€ He grabbed the cr ay on and wrote
stroll in the park ! <|endoftext|> In the morning , Mama
her task ! <|endoftext|> Once upon a time , there was
Token a
Token ID257
Feature activation-7.33e-02

Pos logit contributions
ceed+0.176
Whe+0.155
V+0.153
Three+0.152
 quickest+0.148
Neg logit contributions
ies-0.197
 coughing-0.181
atted-0.178
anged-0.175
 throats-0.174
Token rabbit
Token ID22746
Feature activation+5.89e-02

Pos logit contributions
ceed+0.114
Whe+0.101
V+0.099
Three+0.099
hest+0.096
Neg logit contributions
ies-0.122
 coughing-0.112
atted-0.110
anged-0.108
 Tim-0.107
Token,
Token ID11
Feature activation+1.92e-01

Pos logit contributions
ies+0.063
 throats+0.058
atted+0.057
anged+0.056
bie+0.054
Neg logit contributions
ceed-0.120
 Animal-0.112
 Believe-0.111
lor-0.109
hest-0.109
Token a
Token ID257
Feature activation-4.15e-02

Pos logit contributions
ies+0.349
atted+0.319
anged+0.318
 throats+0.315
 coughing+0.313
Neg logit contributions
ceed-0.265
Whe-0.231
V-0.230
Three-0.226
 Believe-0.222
Token squirrel
Token ID33039
Feature activation+1.24e-01

Pos logit contributions
ceed+0.067
Whe+0.060
V+0.060
Three+0.059
 quickest+0.058
Neg logit contributions
ies-0.065
 coughing-0.059
atted-0.059
anged-0.058
 throats-0.057
Token and
Token ID290
Feature activation+2.92e-01

Pos logit contributions
 throats+0.133
ies+0.120
atted+0.116
anged+0.113
 tongues+0.112
Neg logit contributions
 Animal-0.234
ceed-0.233
hest-0.226
lor-0.221
 Zoo-0.221
Token a
Token ID257
Feature activation-2.29e-02

Pos logit contributions
ies+0.385
 coughing+0.350
atted+0.347
anged+0.338
 throats+0.337
Neg logit contributions
ceed-0.546
V-0.493
Three-0.490
Whe-0.489
 quickest-0.472
Token fox
Token ID21831
Feature activation+8.33e-02

Pos logit contributions
ceed+0.039
Whe+0.035
V+0.035
Three+0.034
 quickest+0.033
Neg logit contributions
ies-0.034
 coughing-0.031
atted-0.031
anged-0.030
 throats-0.030
Token.
Token ID13
Feature activation+2.37e-02

Pos logit contributions
ies+0.103
 throats+0.093
 coughing+0.092
atted+0.092
anged+0.089
Neg logit contributions
ceed-0.154
 Animal-0.138
V-0.137
 Believe-0.137
lor-0.136
Token They
Token ID1119
Feature activation+5.15e-02

Pos logit contributions
 Tim+0.031
ies+0.030
 they+0.029
 coughing+0.028
atted+0.028
Neg logit contributions
ceed-0.045
Whe-0.040
lor-0.039
hest-0.039
redible-0.039
Token all
Token ID477
Feature activation+5.69e-02

Pos logit contributions
ies+0.061
 coughing+0.056
atted+0.054
 throats+0.053
anged+0.052
Neg logit contributions
ceed-0.101
V-0.093
Three-0.092
Whe-0.091
 quickest-0.089
Token,
Token ID11
Feature activation+1.60e-01

Pos logit contributions
bie+0.024
ies+0.024
atted+0.023
attered+0.022
ety+0.022
Neg logit contributions
 middle-0.033
Three-0.033
ceed-0.032
V-0.031
 date-0.031
Token Ben
Token ID3932
Feature activation+6.49e-02

Pos logit contributions
atted+0.162
ies+0.156
anged+0.143
lee+0.143
 coughing+0.139
Neg logit contributions
ceed-0.335
Whe-0.334
 Ready-0.326
Ready-0.324
Three-0.321
Token feels
Token ID5300
Feature activation-3.23e-02

Pos logit contributions
ies+0.108
 Joey+0.099
atted+0.097
bie+0.097
 coughing+0.096
Neg logit contributions
ceed-0.097
Whe-0.090
V-0.090
 Ready-0.087
Three-0.086
Token his
Token ID465
Feature activation+4.31e-02

Pos logit contributions
ceed+0.059
Whe+0.056
V+0.055
 biggest+0.053
 quickest+0.053
Neg logit contributions
ies-0.043
atted-0.037
anged-0.037
bie-0.036
 throats-0.036
Token nose
Token ID9686
Feature activation+5.59e-02

Pos logit contributions
ies+0.061
 coughing+0.056
atted+0.056
 Joey+0.055
 Tim+0.055
Neg logit contributions
ceed-0.072
Whe-0.068
 biggest-0.067
 quickest-0.066
 fastest-0.064
Token and
Token ID290
Feature activation+2.92e-01

Pos logit contributions
ies+0.074
atted+0.068
anged+0.067
bie+0.067
 coughing+0.066
Neg logit contributions
ceed-0.102
V-0.090
hest-0.090
Whe-0.090
 quickest-0.089
Token ears
Token ID11368
Feature activation+1.00e-01

Pos logit contributions
ies+0.369
dd+0.347
anged+0.337
atted+0.336
 badly+0.325
Neg logit contributions
Whe-0.500
ceed-0.499
 biggest-0.477
V-0.473
 middle-0.464
Token are
Token ID389
Feature activation-1.54e-02

Pos logit contributions
ies+0.126
atted+0.112
anged+0.109
bie+0.109
 coughing+0.109
Neg logit contributions
ceed-0.194
Whe-0.176
V-0.176
hest-0.171
Three-0.171
Token cold
Token ID4692
Feature activation+7.82e-02

Pos logit contributions
ceed+0.027
 quickest+0.025
 fastest+0.025
V+0.025
 biggest+0.024
Neg logit contributions
ies-0.019
anged-0.018
 Joey-0.018
atted-0.017
bie-0.017
Token.
Token ID13
Feature activation+2.83e-02

Pos logit contributions
atted+0.134
bie+0.130
 couldn+0.126
 Joey+0.125
anged+0.125
Neg logit contributions
ceed-0.103
Whe-0.098
hest-0.098
 middle-0.097
 quickest-0.092
Token He
Token ID679
Feature activation+6.65e-02

Pos logit contributions
ies+0.038
atted+0.032
 coughing+0.032
anged+0.032
i+0.032
Neg logit contributions
ceed-0.054
Whe-0.051
V-0.050
Three-0.049
 Ready-0.048
Token a
Token ID257
Feature activation-2.91e-02

Pos logit contributions
ies+0.495
atted+0.447
 coughing+0.446
anged+0.439
bie+0.432
Neg logit contributions
ceed-0.388
Whe-0.340
V-0.338
Three-0.331
 quickest-0.321
Token hat
Token ID6877
Feature activation+4.69e-02

Pos logit contributions
ceed+0.050
Whe+0.045
V+0.044
Three+0.044
hest+0.043
Neg logit contributions
ies-0.043
 coughing-0.040
atted-0.038
 Tim-0.038
 throats-0.038
Token and
Token ID290
Feature activation+2.86e-01

Pos logit contributions
ies+0.047
 coughing+0.040
atted+0.039
anged+0.038
 throats+0.038
Neg logit contributions
ceed-0.103
Whe-0.095
V-0.094
Three-0.094
hest-0.092
Token a
Token ID257
Feature activation-1.94e-02

Pos logit contributions
ies+0.459
 coughing+0.434
 they+0.426
 Tim+0.424
atted+0.416
Neg logit contributions
ceed-0.449
Whe-0.398
redible-0.392
Three-0.391
V-0.390
Token fish
Token ID5916
Feature activation+7.60e-02

Pos logit contributions
ceed+0.034
Whe+0.031
hest+0.030
Three+0.030
V+0.030
Neg logit contributions
ies-0.028
 coughing-0.026
 they-0.025
 throats-0.025
 Tim-0.025
Token and
Token ID290
Feature activation+2.91e-01

Pos logit contributions
ies+0.090
atted+0.081
 coughing+0.080
anged+0.078
 throats+0.076
Neg logit contributions
ceed-0.151
V-0.136
Whe-0.136
Three-0.135
 quickest-0.133
Token a
Token ID257
Feature activation-1.42e-02

Pos logit contributions
ies+0.485
 coughing+0.453
 Tim+0.440
 they+0.439
atted+0.438
Neg logit contributions
ceed-0.446
Whe-0.394
V-0.388
Three-0.387
redible-0.382
Token dish
Token ID9433
Feature activation+6.02e-02

Pos logit contributions
ceed+0.025
Whe+0.022
V+0.022
Three+0.022
hest+0.022
Neg logit contributions
ies-0.021
 coughing-0.019
 throats-0.018
 Tim-0.018
 they-0.018
Token.
Token ID13
Feature activation+2.19e-02

Pos logit contributions
ies+0.073
atted+0.068
 coughing+0.067
anged+0.065
 Tim+0.064
Neg logit contributions
ceed-0.115
Three-0.103
Whe-0.103
V-0.102
hest-0.102
Token They
Token ID1119
Feature activation+4.90e-02

Pos logit contributions
ies+0.030
 Tim+0.030
 they+0.029
 coughing+0.028
atted+0.028
Neg logit contributions
ceed-0.039
Whe-0.034
hest-0.034
 quickest-0.034
Three-0.034
Token laugh
Token ID6487
Feature activation+4.06e-02

Pos logit contributions
ies+0.082
 coughing+0.075
 they+0.074
 Tim+0.074
i+0.073
Neg logit contributions
ceed-0.075
Whe-0.066
lor-0.065
V-0.065
Three-0.064
Token a
Token ID257
Feature activation-4.32e-02

Pos logit contributions
ies+0.282
 coughing+0.258
atted+0.251
 Tim+0.249
 they+0.246
Neg logit contributions
ceed-0.314
Whe-0.281
V-0.277
Three-0.275
 quickest-0.270
Token bird
Token ID6512
Feature activation+5.98e-02

Pos logit contributions
ceed+0.075
Whe+0.068
Three+0.066
 quickest+0.065
V+0.065
Neg logit contributions
ies-0.064
 coughing-0.058
 they-0.057
 Tim-0.056
 and-0.055
Token,
Token ID11
Feature activation+1.97e-01

Pos logit contributions
ies+0.042
 coughing+0.036
atted+0.035
anged+0.033
 throats+0.032
Neg logit contributions
ceed-0.147
Whe-0.135
V-0.135
Three-0.135
hest-0.133
Token a
Token ID257
Feature activation-3.39e-02

Pos logit contributions
ies+0.266
 coughing+0.240
atted+0.233
 Tim+0.231
 they+0.228
Neg logit contributions
ceed-0.365
Whe-0.330
V-0.326
Three-0.324
 quickest-0.319
Token fish
Token ID5916
Feature activation+7.10e-02

Pos logit contributions
ceed+0.059
Whe+0.053
Three+0.052
V+0.051
hest+0.051
Neg logit contributions
ies-0.050
 coughing-0.045
 they-0.044
 Tim-0.044
 throats-0.043
Token and
Token ID290
Feature activation+2.90e-01

Pos logit contributions
ies+0.047
 coughing+0.038
atted+0.036
anged+0.034
 throats+0.033
Neg logit contributions
ceed-0.180
Whe-0.167
V-0.166
Three-0.166
hest-0.163
Token a
Token ID257
Feature activation-1.98e-02

Pos logit contributions
ies+0.497
 coughing+0.457
 Tim+0.447
atted+0.446
 they+0.446
Neg logit contributions
ceed-0.434
Whe-0.383
V-0.376
Three-0.374
hest-0.367
Token dog
Token ID3290
Feature activation+8.95e-02

Pos logit contributions
ceed+0.034
Whe+0.031
V+0.030
Three+0.030
hest+0.029
Neg logit contributions
ies-0.030
 coughing-0.027
 Tim-0.026
 they-0.026
atted-0.026
Token.
Token ID13
Feature activation+2.33e-02

Pos logit contributions
ies+0.092
 coughing+0.080
atted+0.078
anged+0.075
 throats+0.075
Neg logit contributions
ceed-0.194
Whe-0.178
V-0.177
Three-0.176
hest-0.173
Token They
Token ID1119
Feature activation+5.09e-02

Pos logit contributions
ies+0.024
 they+0.024
 Tim+0.024
 coughing+0.023
 throats+0.022
Neg logit contributions
ceed-0.049
 quickest-0.044
Whe-0.044
redible-0.044
hest-0.044
Token wrapped
Token ID12908
Feature activation-1.53e-02

Pos logit contributions
ies+0.059
 coughing+0.052
atted+0.050
 they+0.050
 Tim+0.050
Neg logit contributions
ceed-0.105
Whe-0.096
V-0.095
Three-0.094
 quickest-0.093
Token a
Token ID257
Feature activation-2.52e-02

Pos logit contributions
ies+0.394
 they+0.386
 Tim+0.380
 coughing+0.369
 and+0.357
Neg logit contributions
ceed-0.450
lor-0.418
Whe-0.413
V-0.406
redible-0.406
Token bird
Token ID6512
Feature activation+7.29e-02

Pos logit contributions
ceed+0.043
Whe+0.038
V+0.038
Three+0.038
 quickest+0.037
Neg logit contributions
ies-0.038
 coughing-0.034
atted-0.034
anged-0.033
 throats-0.033
Token and
Token ID290
Feature activation+2.79e-01

Pos logit contributions
ies+0.102
 coughing+0.094
atted+0.093
anged+0.091
 Tim+0.090
Neg logit contributions
ceed-0.129
Whe-0.116
V-0.114
Three-0.114
hest-0.112
Token a
Token ID257
Feature activation-1.78e-02

Pos logit contributions
ies+0.418
 they+0.404
 Tim+0.398
 coughing+0.387
 and+0.376
Neg logit contributions
ceed-0.462
lor-0.424
Whe-0.422
redible-0.419
V-0.416
Token fish
Token ID5916
Feature activation+7.75e-02

Pos logit contributions
ceed+0.030
Whe+0.027
V+0.027
Three+0.027
 quickest+0.026
Neg logit contributions
ies-0.027
 coughing-0.024
atted-0.024
anged-0.023
 Tim-0.023
Token and
Token ID290
Feature activation+2.90e-01

Pos logit contributions
ies+0.106
 coughing+0.096
atted+0.095
anged+0.092
 Tim+0.092
Neg logit contributions
ceed-0.141
Whe-0.127
V-0.126
Three-0.125
hest-0.123
Token a
Token ID257
Feature activation-1.48e-02

Pos logit contributions
ies+0.454
 coughing+0.417
 they+0.416
 Tim+0.416
 throats+0.403
Neg logit contributions
ceed-0.472
Whe-0.424
V-0.421
lor-0.420
Three-0.417
Token..."
Token ID9313
Feature activation-2.18e-02

Pos logit contributions
ceed+0.025
Whe+0.023
V+0.022
Three+0.022
hest+0.022
Neg logit contributions
ies-0.022
 coughing-0.020
atted-0.020
anged-0.019
 Tim-0.019
Token
Token ID198
Feature activation-1.22e-01

Pos logit contributions
ceed+0.027
Whe+0.025
V+0.024
Three+0.023
 Ready+0.022
Neg logit contributions
ies-0.043
 coughing-0.039
atted-0.039
anged-0.038
 throats-0.038
Token
Token ID198
Feature activation-1.28e-01

Pos logit contributions
ceed+0.191
 quickest+0.173
Whe+0.171
Three+0.165
 exact+0.163
Neg logit contributions
ies-0.205
 they-0.188
 coughing-0.187
 Tim-0.186
 and-0.182
TokenThey
Token ID2990
Feature activation+3.93e-02

Pos logit contributions
ceed+0.205
Whe+0.183
 quickest+0.182
Three+0.176
hest+0.175
Neg logit contributions
ies-0.208
 coughing-0.189
 they-0.189
 Tim-0.181
 throats-0.180
Token's
Token ID338
Feature activation-2.36e-02

Pos logit contributions
ies+0.136
 coughing+0.126
 throats+0.122
atted+0.120
anged+0.117
Neg logit contributions
ceed-0.207
V-0.188
Three-0.187
 Believe-0.185
 quickest-0.185
Token friends
Token ID2460
Feature activation+5.51e-02

Pos logit contributions
ceed+0.032
 biggest+0.030
 date+0.028
 quickest+0.027
 nic+0.027
Neg logit contributions
ies-0.040
anged-0.038
i-0.038
 coughing-0.038
atted-0.038
Token,
Token ID11
Feature activation+1.84e-01

Pos logit contributions
ies+0.062
 coughing+0.055
atted+0.054
 throats+0.053
anged+0.052
Neg logit contributions
ceed-0.114
Whe-0.104
V-0.103
Three-0.103
 quickest-0.101
Token a
Token ID257
Feature activation-3.86e-02

Pos logit contributions
ies+0.235
atted+0.212
anged+0.211
 throats+0.208
 coughing+0.205
Neg logit contributions
ceed-0.336
 Animal-0.316
Whe-0.311
 Believe-0.310
V-0.310
Token squirrel
Token ID33039
Feature activation+1.24e-01

Pos logit contributions
ceed+0.065
Whe+0.059
hest+0.057
V+0.056
Three+0.056
Neg logit contributions
ies-0.060
 coughing-0.055
 Tim-0.053
 they-0.053
atted-0.051
Token and
Token ID290
Feature activation+2.88e-01

Pos logit contributions
 throats+0.139
 coughing+0.125
 tongues+0.124
ies+0.123
 accidents+0.122
Neg logit contributions
 Animal-0.229
Three-0.219
ceed-0.219
 Zoo-0.217
hest-0.215
Token a
Token ID257
Feature activation-2.09e-02

Pos logit contributions
ies+0.350
 coughing+0.319
atted+0.317
 throats+0.304
anged+0.303
Neg logit contributions
ceed-0.567
V-0.515
Whe-0.513
Three-0.510
 quickest-0.499
Token bird
Token ID6512
Feature activation+7.54e-02

Pos logit contributions
ceed+0.037
Whe+0.033
V+0.033
Three+0.032
hest+0.032
Neg logit contributions
ies-0.030
 coughing-0.028
atted-0.027
 Tim-0.026
anged-0.026
Token,
Token ID11
Feature activation+2.08e-01

Pos logit contributions
ies+0.080
 coughing+0.073
 throats+0.071
atted+0.070
bie+0.068
Neg logit contributions
ceed-0.153
Three-0.140
V-0.139
 quickest-0.138
hest-0.138
Token saw
Token ID2497
Feature activation-9.59e-02

Pos logit contributions
ies+0.181
 coughing+0.151
 throats+0.150
atted+0.150
anged+0.148
Neg logit contributions
ceed-0.478
Whe-0.442
V-0.439
 Believe-0.436
Three-0.436
Token the
Token ID262
Feature activation-1.23e-01

Pos logit contributions
ceed+0.200
Whe+0.182
V+0.181
Three+0.180
hest+0.176
Neg logit contributions
ies-0.108
 coughing-0.096
atted-0.092
anged-0.090
 throats-0.090
Token a
Token ID257
Feature activation-2.72e-02

Pos logit contributions
ies+0.263
 coughing+0.233
atted+0.229
anged+0.223
 throats+0.221
Neg logit contributions
ceed-0.410
Whe-0.374
V-0.370
Three-0.368
 quickest-0.360
Token cow
Token ID9875
Feature activation+1.11e-01

Pos logit contributions
ceed+0.047
Whe+0.042
V+0.041
Three+0.041
hest+0.041
Neg logit contributions
ies-0.040
 coughing-0.037
 Tim-0.036
 they-0.035
atted-0.035
Token,
Token ID11
Feature activation+2.13e-01

Pos logit contributions
ies+0.091
atted+0.081
 coughing+0.077
anged+0.075
bie+0.073
Neg logit contributions
ceed-0.251
 Animal-0.235
V-0.231
Three-0.230
 Zoo-0.230
Token a
Token ID257
Feature activation-2.38e-02

Pos logit contributions
ies+0.206
 coughing+0.176
atted+0.173
anged+0.167
 throats+0.165
Neg logit contributions
ceed-0.477
Whe-0.440
V-0.435
Three-0.434
 quickest-0.426
Token duck
Token ID22045
Feature activation+9.41e-02

Pos logit contributions
ceed+0.041
Whe+0.037
V+0.036
Three+0.036
hest+0.036
Neg logit contributions
ies-0.035
 coughing-0.032
 Tim-0.031
 they-0.031
atted-0.031
Token and
Token ID290
Feature activation+2.88e-01

Pos logit contributions
ies+0.072
atted+0.066
 coughing+0.064
anged+0.063
 throats+0.061
Neg logit contributions
ceed-0.214
 Animal-0.199
V-0.195
hest-0.194
Three-0.194
Token a
Token ID257
Feature activation-1.41e-02

Pos logit contributions
ies+0.462
 they+0.428
 coughing+0.427
 Tim+0.425
atted+0.412
Neg logit contributions
ceed-0.460
Whe-0.414
redible-0.403
lor-0.402
V-0.400
Token snake
Token ID17522
Feature activation+1.15e-01

Pos logit contributions
ceed+0.024
Whe+0.022
V+0.021
Three+0.021
hest+0.021
Neg logit contributions
ies-0.021
 coughing-0.019
 Tim-0.018
atted-0.018
 they-0.018
Token.
Token ID13
Feature activation+1.75e-02

Pos logit contributions
ies+0.135
atted+0.120
 coughing+0.120
anged+0.115
bie+0.113
Neg logit contributions
ceed-0.227
Whe-0.205
Three-0.204
V-0.204
 quickest-0.202
Token Lily
Token ID20037
Feature activation+6.16e-02

Pos logit contributions
ies+0.015
 Tim+0.015
 they+0.014
 coughing+0.014
atted+0.013
Neg logit contributions
ceed-0.040
Whe-0.037
 quickest-0.037
redible-0.037
hest-0.036
Token repeated
Token ID5100
Feature activation+5.13e-03

Pos logit contributions
ies+0.071
 coughing+0.064
atted+0.062
anged+0.062
 throats+0.062
Neg logit contributions
ceed-0.124
V-0.114
Three-0.113
Whe-0.113
 Believe-0.109
Token a
Token ID257
Feature activation-3.93e-02

Pos logit contributions
ceed+0.011
V+0.010
Whe+0.010
redible+0.010
 Believe+0.010
Neg logit contributions
ies-0.008
atted-0.007
anged-0.007
 throats-0.007
 coughing-0.007
Token man
Token ID582
Feature activation+1.11e-01

Pos logit contributions
ceed+0.063
V+0.056
Whe+0.055
Three+0.055
redible+0.053
Neg logit contributions
ies-0.062
atted-0.058
 coughing-0.057
anged-0.056
 throats-0.055
Token with
Token ID351
Feature activation-6.18e-03

Pos logit contributions
ies+0.160
 throats+0.151
atted+0.147
 Joey+0.142
lee+0.140
Neg logit contributions
ceed-0.175
 Believe-0.161
 Animal-0.159
V-0.159
lor-0.159
Token a
Token ID257
Feature activation-4.50e-02

Pos logit contributions
ceed+0.010
Whe+0.009
Three+0.009
V+0.009
redible+0.009
Neg logit contributions
ies-0.010
 coughing-0.009
 Tim-0.009
atted-0.009
 they-0.009
Token beard
Token ID21213
Feature activation+8.67e-02

Pos logit contributions
ceed+0.074
Whe+0.065
V+0.065
Three+0.065
Insp+0.063
Neg logit contributions
ies-0.069
atted-0.063
 coughing-0.062
anged-0.062
i-0.061
Token and
Token ID290
Feature activation+2.88e-01

Pos logit contributions
ies+0.100
anged+0.091
bie+0.090
atted+0.090
 Joey+0.089
Neg logit contributions
ceed-0.168
lor-0.152
V-0.148
 exact-0.147
 Animal-0.146
Token a
Token ID257
Feature activation-2.28e-02

Pos logit contributions
ies+0.536
 coughing+0.490
atted+0.488
anged+0.478
 throats+0.475
Neg logit contributions
ceed-0.389
Whe-0.338
V-0.334
Three-0.331
 quickest-0.315
Token robe
Token ID33192
Feature activation+3.33e-02

Pos logit contributions
ceed+0.039
Whe+0.035
V+0.035
Three+0.035
Insp+0.034
Neg logit contributions
ies-0.033
atted-0.030
 coughing-0.030
anged-0.029
 Tim-0.029
Token.
Token ID13
Feature activation+2.79e-02

Pos logit contributions
ies+0.041
 Joey+0.037
bie+0.036
atted+0.036
 Tim+0.036
Neg logit contributions
ceed-0.063
Whe-0.056
lor-0.056
V-0.056
Three-0.055
Token He
Token ID679
Feature activation+6.72e-02

Pos logit contributions
ies+0.034
 coughing+0.031
 Tim+0.030
atted+0.030
 they+0.029
Neg logit contributions
ceed-0.055
Whe-0.050
V-0.049
Three-0.049
hest-0.049
Token looks
Token ID3073
Feature activation+4.08e-02

Pos logit contributions
ies+0.117
 coughing+0.108
atted+0.106
 throats+0.105
anged+0.105
Neg logit contributions
ceed-0.098
Whe-0.086
V-0.085
Three-0.085
 Believe-0.081
Token a
Token ID257
Feature activation-4.01e-02

Pos logit contributions
ies+0.338
atted+0.308
 coughing+0.307
anged+0.302
 throats+0.296
Neg logit contributions
ceed-0.283
Whe-0.259
V-0.252
Three-0.250
 quickest-0.242
Token boat
Token ID8848
Feature activation+4.99e-02

Pos logit contributions
ceed+0.064
V+0.058
Whe+0.057
Three+0.057
 quickest+0.055
Neg logit contributions
ies-0.062
atted-0.058
anged-0.057
 coughing-0.057
i-0.055
Token,
Token ID11
Feature activation+1.98e-01

Pos logit contributions
ies+0.037
 coughing+0.032
atted+0.030
anged+0.029
 throats+0.029
Neg logit contributions
ceed-0.120
Whe-0.111
V-0.110
Three-0.110
hest-0.109
Token a
Token ID257
Feature activation-3.29e-02

Pos logit contributions
ies+0.326
 coughing+0.298
atted+0.294
anged+0.289
 throats+0.286
Neg logit contributions
ceed-0.308
Whe-0.277
V-0.271
Three-0.270
 quickest-0.263
Token fish
Token ID5916
Feature activation+7.03e-02

Pos logit contributions
ceed+0.055
Whe+0.049
V+0.049
Three+0.048
 quickest+0.047
Neg logit contributions
ies-0.050
 coughing-0.046
atted-0.045
anged-0.044
 throats-0.044
Token and
Token ID290
Feature activation+2.86e-01

Pos logit contributions
ies+0.052
 coughing+0.046
atted+0.045
anged+0.043
 throats+0.042
Neg logit contributions
ceed-0.165
V-0.152
Whe-0.152
Three-0.152
lor-0.151
Token a
Token ID257
Feature activation-2.01e-02

Pos logit contributions
ies+0.501
 coughing+0.463
atted+0.454
 Tim+0.447
anged+0.445
Neg logit contributions
ceed-0.417
Whe-0.367
V-0.361
Three-0.359
hest-0.348
Token frog
Token ID21264
Feature activation+9.12e-02

Pos logit contributions
ceed+0.033
Whe+0.030
V+0.030
Three+0.030
 quickest+0.029
Neg logit contributions
ies-0.030
atted-0.028
 coughing-0.028
anged-0.027
 Tim-0.026
Token.
Token ID13
Feature activation+1.66e-02

Pos logit contributions
ies+0.107
 coughing+0.098
atted+0.097
anged+0.094
 throats+0.093
Neg logit contributions
ceed-0.175
Whe-0.158
V-0.158
Three-0.157
 Animal-0.157
Token They
Token ID1119
Feature activation+5.27e-02

Pos logit contributions
 Tim+0.018
ies+0.018
 they+0.017
 Tommy+0.017
 coughing+0.016
Neg logit contributions
ceed-0.036
lor-0.032
hest-0.031
 quickest-0.031
redible-0.031
Token make
Token ID787
Feature activation-1.14e-01

Pos logit contributions
ies+0.083
 coughing+0.075
 Tim+0.074
 they+0.073
atted+0.073
Neg logit contributions
ceed-0.088
Whe-0.077
lor-0.076
V-0.076
Three-0.076
Token a
Token ID257
Feature activation-3.75e-02

Pos logit contributions
ies+0.364
atted+0.334
anged+0.330
 coughing+0.328
 throats+0.320
Neg logit contributions
ceed-0.264
Whe-0.243
V-0.234
Three-0.232
 quickest-0.226
Token boat
Token ID8848
Feature activation+5.14e-02

Pos logit contributions
ceed+0.056
V+0.052
Whe+0.051
Three+0.051
 biggest+0.050
Neg logit contributions
ies-0.060
atted-0.057
anged-0.057
 coughing-0.055
i-0.054
Token,
Token ID11
Feature activation+2.01e-01

Pos logit contributions
ies+0.042
 coughing+0.038
atted+0.037
anged+0.036
 throats+0.035
Neg logit contributions
ceed-0.117
 Animal-0.107
hest-0.107
Three-0.106
lor-0.106
Token a
Token ID257
Feature activation-3.06e-02

Pos logit contributions
ies+0.303
atted+0.273
anged+0.273
 coughing+0.271
 throats+0.265
Neg logit contributions
ceed-0.333
Whe-0.306
V-0.298
Three-0.298
 quickest-0.292
Token fish
Token ID5916
Feature activation+7.25e-02

Pos logit contributions
ceed+0.048
V+0.044
Whe+0.043
Three+0.043
 quickest+0.042
Neg logit contributions
ies-0.048
atted-0.045
anged-0.045
 coughing-0.044
i-0.043
Token and
Token ID290
Feature activation+2.86e-01

Pos logit contributions
ies+0.057
atted+0.053
 coughing+0.052
anged+0.051
 throats+0.049
Neg logit contributions
ceed-0.163
 Animal-0.152
lor-0.151
V-0.150
Three-0.150
Token a
Token ID257
Feature activation-1.86e-02

Pos logit contributions
ies+0.517
 coughing+0.478
atted+0.472
anged+0.464
 throats+0.460
Neg logit contributions
ceed-0.402
Whe-0.352
V-0.347
Three-0.344
 quickest-0.333
Token frog
Token ID21264
Feature activation+9.30e-02

Pos logit contributions
ceed+0.030
V+0.027
Whe+0.027
Three+0.027
 quickest+0.026
Neg logit contributions
ies-0.028
atted-0.027
anged-0.026
 coughing-0.026
i-0.025
Token.
Token ID13
Feature activation+1.85e-02

Pos logit contributions
ies+0.114
atted+0.107
 coughing+0.106
anged+0.103
 throats+0.101
Neg logit contributions
ceed-0.171
 Animal-0.154
V-0.153
lor-0.153
hest-0.152
Token She
Token ID1375
Feature activation+6.06e-02

Pos logit contributions
 Tim+0.020
 they+0.017
ies+0.017
 Tommy+0.017
 Billy+0.016
Neg logit contributions
ceed-0.042
lor-0.038
 quickest-0.038
hest-0.038
redible-0.037
Token liked
Token ID8288
Feature activation-1.43e-02

Pos logit contributions
ies+0.081
 coughing+0.077
 accidents+0.076
anged+0.076
 throats+0.075
Neg logit contributions
ceed-0.102
V-0.098
Three-0.096
Whe-0.093
 nic-0.092
Token a
Token ID257
Feature activation-5.50e-02

Pos logit contributions
ies+0.063
atted+0.059
anged+0.058
 coughing+0.056
i+0.055
Neg logit contributions
ceed-0.070
V-0.063
Whe-0.062
 Ready-0.062
Three-0.061
Token cat
Token ID3797
Feature activation+6.31e-02

Pos logit contributions
ceed+0.088
Whe+0.078
V+0.077
Three+0.076
hest+0.075
Neg logit contributions
ies-0.089
 coughing-0.082
atted-0.079
 Tim-0.078
 throats-0.078
Token and
Token ID290
Feature activation+2.74e-01

Pos logit contributions
ies+0.064
atted+0.058
anged+0.055
 coughing+0.054
bie+0.054
Neg logit contributions
ceed-0.137
V-0.125
Whe-0.125
hest-0.123
 quickest-0.123
Token a
Token ID257
Feature activation-2.91e-02

Pos logit contributions
ies+0.495
atted+0.447
 coughing+0.446
anged+0.439
bie+0.432
Neg logit contributions
ceed-0.388
Whe-0.340
V-0.338
Three-0.331
 quickest-0.321
Token hat
Token ID6877
Feature activation+4.69e-02

Pos logit contributions
ceed+0.050
Whe+0.045
V+0.044
Three+0.044
hest+0.043
Neg logit contributions
ies-0.043
 coughing-0.040
atted-0.038
 Tim-0.038
 throats-0.038
Token and
Token ID290
Feature activation+2.86e-01

Pos logit contributions
ies+0.047
 coughing+0.040
atted+0.039
anged+0.038
 throats+0.038
Neg logit contributions
ceed-0.103
Whe-0.095
V-0.094
Three-0.094
hest-0.092
Token a
Token ID257
Feature activation-1.94e-02

Pos logit contributions
ies+0.459
 coughing+0.434
 they+0.426
 Tim+0.424
atted+0.416
Neg logit contributions
ceed-0.449
Whe-0.398
redible-0.392
Three-0.391
V-0.390
Token fish
Token ID5916
Feature activation+7.60e-02

Pos logit contributions
ceed+0.034
Whe+0.031
hest+0.030
Three+0.030
V+0.030
Neg logit contributions
ies-0.028
 coughing-0.026
 they-0.025
 throats-0.025
 Tim-0.025
Token and
Token ID290
Feature activation+2.91e-01

Pos logit contributions
ies+0.090
atted+0.081
 coughing+0.080
anged+0.078
 throats+0.076
Neg logit contributions
ceed-0.151
V-0.136
Whe-0.136
Three-0.135
 quickest-0.133
Token a
Token ID257
Feature activation-1.42e-02

Pos logit contributions
ies+0.485
 coughing+0.453
 Tim+0.440
 they+0.439
atted+0.438
Neg logit contributions
ceed-0.446
Whe-0.394
V-0.388
Three-0.387
redible-0.382
Token dish
Token ID9433
Feature activation+6.02e-02

Pos logit contributions
ceed+0.025
Whe+0.022
V+0.022
Three+0.022
hest+0.022
Neg logit contributions
ies-0.021
 coughing-0.019
 throats-0.018
 Tim-0.018
 they-0.018
Token.
Token ID13
Feature activation+4.50e-03

Pos logit contributions
dd+0.063
bie+0.063
atted+0.063
i+0.060
 coughing+0.060
Neg logit contributions
ceed-0.051
 biggest-0.050
 Museum-0.048
 Animal-0.048
 quickest-0.047
Token Tim
Token ID5045
Feature activation+9.08e-02

Pos logit contributions
ies+0.008
 coughing+0.007
atted+0.007
anged+0.007
 throats+0.007
Neg logit contributions
ceed-0.007
Whe-0.006
V-0.006
Three-0.006
hest-0.006
Token saw
Token ID2497
Feature activation-1.07e-01

Pos logit contributions
ies+0.123
 throats+0.122
 coughing+0.116
atted+0.116
 accidents+0.116
Neg logit contributions
ceed-0.150
V-0.145
Three-0.141
 quickest-0.137
 nic-0.135
Token a
Token ID257
Feature activation-6.41e-02

Pos logit contributions
ceed+0.158
Whe+0.139
V+0.138
Three+0.137
hest+0.131
Neg logit contributions
ies-0.187
 coughing-0.172
atted-0.170
anged-0.167
 throats-0.166
Token squirrel
Token ID33039
Feature activation+1.08e-01

Pos logit contributions
ceed+0.093
V+0.083
Three+0.082
Whe+0.081
redible+0.079
Neg logit contributions
ies-0.109
atted-0.105
anged-0.103
 coughing-0.101
 throats-0.100
Token and
Token ID290
Feature activation+2.86e-01

Pos logit contributions
 throats+0.145
atted+0.137
lee+0.135
 tongues+0.132
bie+0.132
Neg logit contributions
ceed-0.183
hest-0.176
 Animal-0.168
 middle-0.165
V-0.165
Token wanted
Token ID2227
Feature activation-5.87e-02

Pos logit contributions
ies+0.339
atted+0.324
anged+0.313
 coughing+0.313
i+0.305
Neg logit contributions
ceed-0.528
V-0.494
Three-0.486
 biggest-0.476
Whe-0.470
Token to
Token ID284
Feature activation+1.15e-02

Pos logit contributions
 biggest+0.106
ceed+0.104
 Animal+0.103
Three+0.103
 Ready+0.103
Neg logit contributions
ies-0.076
atted-0.065
dd-0.062
i-0.062
anged-0.061
Token play
Token ID711
Feature activation-9.20e-02

Pos logit contributions
ies+0.017
 Tim+0.017
 coughing+0.015
 Billy+0.015
 and+0.015
Neg logit contributions
ceed-0.020
lor-0.018
Whe-0.018
redible-0.018
 quickest-0.018
Token with
Token ID351
Feature activation-6.49e-02

Pos logit contributions
ceed+0.235
lor+0.225
Whe+0.220
Three+0.217
redible+0.215
Neg logit contributions
 Tim-0.068
 and-0.064
 because-0.061
 they-0.055
ies-0.053
Token it
Token ID340
Feature activation-2.20e-02

Pos logit contributions
ceed+0.103
V+0.100
 biggest+0.098
 middle+0.097
 Animal+0.096
Neg logit contributions
ies-0.093
bie-0.088
anged-0.088
 coughing-0.086
 throats-0.083
Token.
Token ID13
Feature activation+1.87e-03

Pos logit contributions
ies+0.025
atted+0.023
 coughing+0.023
anged+0.023
 throats+0.023
Neg logit contributions
ceed-0.038
Whe-0.035
V-0.034
hest-0.034
 quickest-0.034
Token He
Token ID679
Feature activation+4.29e-02

Pos logit contributions
 Tim+0.002
 they+0.002
 Billy+0.002
 Tommy+0.002
 because+0.002
Neg logit contributions
ceed-0.004
redible-0.003
 quickest-0.003
hest-0.003
lor-0.003
Token is
Token ID318
Feature activation-1.32e-02

Pos logit contributions
ies+0.057
 coughing+0.052
atted+0.050
anged+0.049
 throats+0.049
Neg logit contributions
ceed-0.080
Whe-0.072
V-0.072
Three-0.071
 quickest-0.070
Token a
Token ID257
Feature activation-4.36e-02

Pos logit contributions
ceed+0.023
Whe+0.020
V+0.020
 quickest+0.020
 biggest+0.020
Neg logit contributions
ies-0.020
anged-0.018
atted-0.018
 coughing-0.018
 throats-0.018
Token squirrel
Token ID33039
Feature activation+1.18e-01

Pos logit contributions
ceed+0.069
Whe+0.062
V+0.061
Three+0.061
 quickest+0.059
Neg logit contributions
ies-0.071
 coughing-0.065
atted-0.064
anged-0.063
 throats-0.062
Token and
Token ID290
Feature activation+2.85e-01

Pos logit contributions
ies+0.144
atted+0.137
anged+0.135
 coughing+0.134
 throats+0.132
Neg logit contributions
ceed-0.221
hest-0.205
V-0.201
Whe-0.200
Three-0.199
Token he
Token ID339
Feature activation+9.65e-02

Pos logit contributions
ies+0.362
 coughing+0.322
atted+0.316
anged+0.308
 throats+0.304
Neg logit contributions
ceed-0.556
Whe-0.505
V-0.500
Three-0.497
 quickest-0.486
Token lives
Token ID3160
Feature activation-7.08e-02

Pos logit contributions
ies+0.139
 coughing+0.126
atted+0.124
anged+0.121
 throats+0.120
Neg logit contributions
ceed-0.170
Whe-0.153
V-0.151
Three-0.150
 quickest-0.147
Token in
Token ID287
Feature activation-3.53e-02

Pos logit contributions
ceed+0.154
Whe+0.141
V+0.140
Three+0.139
hest+0.136
Neg logit contributions
ies-0.074
 coughing-0.064
atted-0.062
 Tim-0.060
anged-0.060
Token the
Token ID262
Feature activation-1.20e-01

Pos logit contributions
ceed+0.073
Whe+0.066
Three+0.065
V+0.064
redible+0.064
Neg logit contributions
 Tim-0.040
ies-0.040
 coughing-0.038
 and-0.038
 they-0.038
Token tree
Token ID5509
Feature activation+1.04e-02

Pos logit contributions
ceed+0.211
hest+0.194
lor+0.184
ieve+0.183
 Believe+0.183
Neg logit contributions
 Tim-0.171
 and-0.164
 coughing-0.160
ies-0.158
 peanuts-0.155
Token a
Token ID257
Feature activation-3.59e-02

Pos logit contributions
ies+0.338
atted+0.310
 coughing+0.308
anged+0.304
 throats+0.298
Neg logit contributions
ceed-0.303
Whe-0.276
V-0.269
Three-0.268
hest-0.260
Token boat
Token ID8848
Feature activation+5.24e-02

Pos logit contributions
ceed+0.062
Whe+0.056
V+0.055
Three+0.055
 quickest+0.054
Neg logit contributions
ies-0.052
 coughing-0.048
atted-0.047
anged-0.046
 Tim-0.046
Token,
Token ID11
Feature activation+2.03e-01

Pos logit contributions
ies+0.039
 coughing+0.034
atted+0.033
anged+0.031
 throats+0.031
Neg logit contributions
ceed-0.125
hest-0.114
Whe-0.114
Three-0.114
V-0.114
Token a
Token ID257
Feature activation-3.04e-02

Pos logit contributions
ies+0.276
 coughing+0.248
atted+0.245
anged+0.240
 throats+0.236
Neg logit contributions
ceed-0.371
Whe-0.340
V-0.334
Three-0.333
 quickest-0.326
Token frog
Token ID21264
Feature activation+8.78e-02

Pos logit contributions
ceed+0.053
Whe+0.047
V+0.046
Three+0.046
hest+0.045
Neg logit contributions
ies-0.045
 coughing-0.041
atted-0.040
 Tim-0.039
anged-0.039
Token and
Token ID290
Feature activation+2.85e-01

Pos logit contributions
ies+0.070
atted+0.062
 coughing+0.062
anged+0.060
 throats+0.058
Neg logit contributions
ceed-0.199
 Animal-0.187
lor-0.185
hest-0.184
V-0.183
Token a
Token ID257
Feature activation-1.66e-02

Pos logit contributions
ies+0.510
 coughing+0.472
atted+0.464
anged+0.455
 throats+0.453
Neg logit contributions
ceed-0.406
Whe-0.357
V-0.351
Three-0.349
 quickest-0.337
Token cup
Token ID6508
Feature activation+1.84e-02

Pos logit contributions
ceed+0.028
Whe+0.026
V+0.025
Three+0.025
hest+0.024
Neg logit contributions
ies-0.025
 coughing-0.023
atted-0.022
 Tim-0.022
anged-0.022
Token.
Token ID13
Feature activation+1.37e-02

Pos logit contributions
ies+0.021
 coughing+0.019
atted+0.019
anged+0.018
bie+0.018
Neg logit contributions
ceed-0.037
Whe-0.033
hest-0.033
Three-0.033
V-0.033
Token They
Token ID1119
Feature activation+4.95e-02

Pos logit contributions
ies+0.015
 Tim+0.014
 they+0.014
 coughing+0.014
atted+0.013
Neg logit contributions
ceed-0.030
lor-0.026
hest-0.026
Whe-0.026
redible-0.026
Token made
Token ID925
Feature activation-1.04e-01

Pos logit contributions
ies+0.064
 coughing+0.059
atted+0.057
anged+0.056
 throats+0.055
Neg logit contributions
ceed-0.093
Whe-0.085
V-0.085
Three-0.084
 quickest-0.082
Token a
Token ID257
Feature activation-6.24e-02

Pos logit contributions
ceed+0.166
Whe+0.148
V+0.145
Three+0.145
 quickest+0.141
Neg logit contributions
ies-0.176
 coughing-0.161
atted-0.158
 Tim-0.156
 throats-0.154
Token prince
Token ID19716
Feature activation+8.72e-02

Pos logit contributions
ceed+0.106
Whe+0.095
V+0.092
Three+0.092
hest+0.091
Neg logit contributions
ies-0.095
 coughing-0.087
 Tim-0.083
atted-0.083
 throats-0.082
Token with
Token ID351
Feature activation-7.03e-03

Pos logit contributions
ies+0.089
 coughing+0.080
atted+0.080
anged+0.078
 throats+0.077
Neg logit contributions
ceed-0.187
Whe-0.170
V-0.169
Three-0.167
hest-0.166
Token a
Token ID257
Feature activation-4.65e-02

Pos logit contributions
ceed+0.010
V+0.008
Three+0.008
Whe+0.008
 biggest+0.008
Neg logit contributions
ies-0.013
atted-0.012
 coughing-0.012
anged-0.012
bie-0.011
Token crown
Token ID12389
Feature activation+6.73e-02

Pos logit contributions
ceed+0.076
V+0.068
Whe+0.068
Three+0.067
 quickest+0.065
Neg logit contributions
ies-0.071
atted-0.066
anged-0.065
 coughing-0.065
 Tim-0.063
Token and
Token ID290
Feature activation+2.85e-01

Pos logit contributions
ies+0.054
 coughing+0.049
atted+0.049
 Joey+0.047
anged+0.047
Neg logit contributions
ceed-0.155
lor-0.142
hest-0.141
Three-0.140
Whe-0.140
Token a
Token ID257
Feature activation-2.48e-02

Pos logit contributions
ies+0.507
 coughing+0.469
atted+0.462
anged+0.453
 throats+0.450
Neg logit contributions
ceed-0.407
Whe-0.356
V-0.351
Three-0.349
 quickest-0.337
Token sword
Token ID8429
Feature activation+7.16e-02

Pos logit contributions
ceed+0.041
Whe+0.037
V+0.036
Three+0.036
 quickest+0.035
Neg logit contributions
ies-0.038
atted-0.035
 coughing-0.035
anged-0.034
 Tim-0.033
Token.
Token ID13
Feature activation+2.27e-02

Pos logit contributions
ies+0.077
 coughing+0.068
atted+0.066
anged+0.064
 Tim+0.064
Neg logit contributions
ceed-0.152
Whe-0.138
V-0.137
Three-0.137
hest-0.134
Token She
Token ID1375
Feature activation+5.74e-02

Pos logit contributions
 Tim+0.028
 they+0.025
 Tommy+0.025
 Billy+0.024
 Johnny+0.024
Neg logit contributions
ceed-0.047
 quickest-0.043
lor-0.042
redible-0.042
 exact-0.042
Token used
Token ID973
Feature activation-3.04e-02

Pos logit contributions
ies+0.072
 coughing+0.065
atted+0.063
anged+0.061
 throats+0.061
Neg logit contributions
ceed-0.112
Whe-0.102
V-0.101
Three-0.100
hest-0.098
Token a
Token ID257
Feature activation-4.87e-02

Pos logit contributions
ceed+0.059
Whe+0.052
hest+0.051
Three+0.051
V+0.051
Neg logit contributions
ies-0.057
 coughing-0.055
 Tim-0.053
atted-0.052
 they-0.052
Token hat
Token ID6877
Feature activation+3.20e-02

Pos logit contributions
ceed+0.073
V+0.066
Three+0.066
 biggest+0.065
 date+0.064
Neg logit contributions
ies-0.079
atted-0.074
anged-0.074
i-0.072
 Tim-0.071
Token and
Token ID290
Feature activation+2.72e-01

Pos logit contributions
ies+0.028
bie+0.027
atted+0.026
anged+0.026
 Joey+0.025
Neg logit contributions
ceed-0.069
lor-0.065
 date-0.063
hest-0.063
 middle-0.062
Token a
Token ID257
Feature activation-2.78e-02

Pos logit contributions
ies+0.546
atted+0.504
anged+0.496
 coughing+0.495
i+0.484
Neg logit contributions
ceed-0.327
Whe-0.282
Three-0.280
V-0.279
 biggest-0.265
Token beard
Token ID21213
Feature activation+9.23e-02

Pos logit contributions
ceed+0.045
V+0.040
Three+0.040
Whe+0.040
 biggest+0.039
Neg logit contributions
ies-0.042
atted-0.039
anged-0.039
i-0.038
 coughing-0.038
Token and
Token ID290
Feature activation+2.85e-01

Pos logit contributions
ies+0.102
bie+0.094
 Joey+0.092
atted+0.089
anged+0.088
Neg logit contributions
ceed-0.180
lor-0.165
V-0.161
 exact-0.160
Three-0.160
Token a
Token ID257
Feature activation-1.66e-02

Pos logit contributions
ies+0.551
atted+0.504
 coughing+0.500
anged+0.493
i+0.487
Neg logit contributions
ceed-0.367
Whe-0.316
V-0.312
Three-0.312
 quickest-0.294
Token big
Token ID1263
Feature activation+3.19e-02

Pos logit contributions
ceed+0.028
Whe+0.025
V+0.025
Three+0.025
 quickest+0.024
Neg logit contributions
ies-0.024
atted-0.022
anged-0.022
 coughing-0.022
i-0.021
Token smile
Token ID8212
Feature activation+9.01e-02

Pos logit contributions
ies+0.047
anged+0.045
atted+0.044
 coughing+0.043
i+0.043
Neg logit contributions
ceed-0.049
 date-0.045
Whe-0.045
 biggest-0.045
 exact-0.044
Token.
Token ID13
Feature activation+2.01e-02

Pos logit contributions
ies+0.112
 coughing+0.109
atted+0.108
 Tim+0.106
 Joey+0.106
Neg logit contributions
ceed-0.160
lor-0.140
Whe-0.138
V-0.138
 date-0.137
Token He
Token ID679
Feature activation+6.24e-02

Pos logit contributions
ies+0.026
 Tim+0.024
 coughing+0.024
 they+0.023
atted+0.023
Neg logit contributions
ceed-0.039
Whe-0.035
 quickest-0.034
Three-0.034
V-0.034
Token.
Token ID13
Feature activation+2.20e-03

Pos logit contributions
ies+0.063
atted+0.058
 coughing+0.058
anged+0.056
 throats+0.055
Neg logit contributions
ceed-0.138
Whe-0.126
hest-0.125
Three-0.124
V-0.124
Token Tom
Token ID4186
Feature activation+6.93e-02

Pos logit contributions
ies+0.003
 Tim+0.002
 coughing+0.002
atted+0.002
 they+0.002
Neg logit contributions
ceed-0.004
Whe-0.004
V-0.004
hest-0.004
Three-0.004
Token is
Token ID318
Feature activation+3.92e-03

Pos logit contributions
ies+0.126
 throats+0.125
 badly+0.125
 coughing+0.123
 accidents+0.121
Neg logit contributions
ceed-0.081
V-0.077
 quickest-0.075
 Believe-0.075
 Animal-0.075
Token a
Token ID257
Feature activation-3.57e-02

Pos logit contributions
ies+0.005
 coughing+0.005
atted+0.005
 Tim+0.005
 they+0.005
Neg logit contributions
ceed-0.007
Whe-0.006
Three-0.006
V-0.006
hest-0.006
Token clown
Token ID25573
Feature activation+1.25e-01

Pos logit contributions
ceed+0.057
Whe+0.052
V+0.051
Three+0.051
 quickest+0.050
Neg logit contributions
ies-0.057
atted-0.052
 coughing-0.052
anged-0.051
 throats-0.050
Token and
Token ID290
Feature activation+2.85e-01

Pos logit contributions
ies+0.130
atted+0.118
 coughing+0.118
 throats+0.117
anged+0.112
Neg logit contributions
ceed-0.258
Whe-0.239
Three-0.237
V-0.236
hest-0.235
Token Jen
Token ID13364
Feature activation+1.05e-01

Pos logit contributions
ies+0.290
 coughing+0.262
atted+0.249
anged+0.245
 throats+0.241
Neg logit contributions
ceed-0.616
Whe-0.582
V-0.572
Three-0.566
Ready-0.549
Token is
Token ID318
Feature activation+2.08e-02

Pos logit contributions
ies+0.139
 coughing+0.125
atted+0.122
anged+0.119
 Tim+0.119
Neg logit contributions
ceed-0.198
Whe-0.180
V-0.178
Three-0.177
hest-0.173
Token a
Token ID257
Feature activation-2.64e-02

Pos logit contributions
ies+0.027
 coughing+0.025
atted+0.024
 Tim+0.024
 they+0.023
Neg logit contributions
ceed-0.040
Whe-0.036
Three-0.036
V-0.035
hest-0.035
Token dancer
Token ID38619
Feature activation+8.26e-02

Pos logit contributions
ceed+0.049
Whe+0.045
V+0.044
Three+0.044
 quickest+0.043
Neg logit contributions
ies-0.035
 coughing-0.032
atted-0.031
anged-0.031
 throats-0.030
Token.
Token ID13
Feature activation+2.33e-02

Pos logit contributions
ies+0.086
 coughing+0.074
atted+0.070
 they+0.070
 Tim+0.069
Neg logit contributions
ceed-0.181
Whe-0.166
V-0.165
Three-0.165
redible-0.161
Token furry
Token ID40857
Feature activation+4.61e-02

Pos logit contributions
anged+0.012
dd+0.012
 throats+0.011
 coughing+0.011
 Joey+0.011
Neg logit contributions
 biggest-0.009
 quickest-0.008
 fastest-0.008
ceed-0.008
 nic-0.008
Token animals
Token ID4695
Feature activation+6.65e-02

Pos logit contributions
anged+0.081
 coughing+0.080
 badly+0.079
atted+0.079
 throats+0.078
Neg logit contributions
 Animal-0.051
 biggest-0.049
 Zoo-0.048
ceed-0.047
Three-0.046
Token -
Token ID532
Feature activation+1.55e-01

Pos logit contributions
 throats+0.105
 accidents+0.103
 tongues+0.102
ies+0.101
 burnt+0.098
Neg logit contributions
 Animal-0.080
ceed-0.080
 date-0.073
 Believe-0.072
 Zoo-0.072
Token a
Token ID257
Feature activation-4.17e-02

Pos logit contributions
ies+0.230
 coughing+0.210
atted+0.206
anged+0.201
 throats+0.200
Neg logit contributions
ceed-0.267
Whe-0.241
V-0.238
Three-0.236
 quickest-0.230
Token squirrel
Token ID33039
Feature activation+1.26e-01

Pos logit contributions
ceed+0.070
Whe+0.063
V+0.062
Three+0.062
 quickest+0.060
Neg logit contributions
ies-0.063
 coughing-0.058
atted-0.057
anged-0.055
 throats-0.055
Token and
Token ID290
Feature activation+2.85e-01

Pos logit contributions
 throats+0.146
 accidents+0.136
anged+0.135
atted+0.130
 tongues+0.129
Neg logit contributions
ceed-0.221
hest-0.219
 Animal-0.216
lor-0.212
V-0.203
Token a
Token ID257
Feature activation-2.59e-02

Pos logit contributions
ies+0.512
atted+0.480
 coughing+0.477
anged+0.471
 throats+0.460
Neg logit contributions
ceed-0.392
Whe-0.347
V-0.346
Three-0.340
 quickest-0.326
Token mouse
Token ID10211
Feature activation+8.46e-02

Pos logit contributions
ceed+0.048
Whe+0.043
V+0.042
Three+0.042
hest+0.041
Neg logit contributions
ies-0.036
 coughing-0.032
atted-0.031
 Tim-0.031
 throats-0.031
Token.
Token ID13
Feature activation+6.01e-03

Pos logit contributions
ies+0.104
atted+0.097
anged+0.097
 coughing+0.096
 throats+0.096
Neg logit contributions
ceed-0.151
 Animal-0.138
lor-0.136
hest-0.135
Three-0.134
Token The
Token ID383
Feature activation-1.15e-01

Pos logit contributions
ies+0.008
 coughing+0.007
 Tim+0.007
atted+0.007
 they+0.007
Neg logit contributions
ceed-0.011
Whe-0.010
V-0.010
Three-0.010
hest-0.010
Token mouse
Token ID10211
Feature activation+5.00e-02

Pos logit contributions
ceed+0.229
Whe+0.218
lor+0.210
hest+0.208
Three+0.205
Neg logit contributions
ies-0.149
 Tim-0.138
 coughing-0.134
 Billy-0.129
 and-0.127
Token.
Token ID13
Feature activation+7.84e-03

Pos logit contributions
ies+0.012
 coughing+0.010
 they+0.010
 Tim+0.010
atted+0.010
Neg logit contributions
ceed-0.029
Whe-0.027
V-0.027
Three-0.026
redible-0.026
Token Spot
Token ID15899
Feature activation+8.56e-02

Pos logit contributions
ies+0.012
i+0.010
atted+0.010
 coughing+0.010
bie+0.010
Neg logit contributions
ceed-0.014
Whe-0.013
V-0.013
Three-0.013
 Ready-0.012
Token saw
Token ID2497
Feature activation-1.08e-01

Pos logit contributions
ies+0.099
 coughing+0.088
atted+0.086
anged+0.083
 throats+0.083
Neg logit contributions
ceed-0.175
Whe-0.160
V-0.159
Three-0.158
 quickest-0.154
Token a
Token ID257
Feature activation-6.29e-02

Pos logit contributions
ceed+0.163
Whe+0.144
V+0.141
Three+0.141
hest+0.137
Neg logit contributions
ies-0.183
 coughing-0.170
atted-0.166
 Tim-0.165
anged-0.162
Token squirrel
Token ID33039
Feature activation+1.07e-01

Pos logit contributions
ceed+0.100
Whe+0.089
V+0.088
Three+0.087
 quickest+0.085
Neg logit contributions
ies-0.102
 coughing-0.093
atted-0.092
anged-0.090
 throats-0.090
Token and
Token ID290
Feature activation+2.85e-01

Pos logit contributions
 throats+0.100
ies+0.095
atted+0.089
anged+0.086
 tongues+0.086
Neg logit contributions
ceed-0.226
hest-0.218
 Animal-0.213
V-0.213
 quickest-0.210
Token wanted
Token ID2227
Feature activation-5.95e-02

Pos logit contributions
ies+0.313
atted+0.294
anged+0.288
 coughing+0.288
 throats+0.283
Neg logit contributions
ceed-0.554
V-0.529
Three-0.519
Whe-0.517
Ready-0.499
Token to
Token ID284
Feature activation+1.05e-02

Pos logit contributions
ceed+0.127
 Animal+0.123
V+0.122
 biggest+0.122
 Believe+0.121
Neg logit contributions
ies-0.060
atted-0.050
anged-0.048
 throats-0.046
 coughing-0.044
Token chase
Token ID15505
Feature activation-5.17e-02

Pos logit contributions
 Tim+0.012
ies+0.011
 and+0.011
 because+0.010
 Billy+0.010
Neg logit contributions
ceed-0.023
lor-0.021
redible-0.020
 Museum-0.020
 quickest-0.020
Token it
Token ID340
Feature activation-2.41e-02

Pos logit contributions
ceed+0.101
V+0.099
Whe+0.097
 Animal+0.094
 Believe+0.093
Neg logit contributions
ies-0.061
bie-0.054
atted-0.053
 coughing-0.053
 throats-0.051
Token.
Token ID13
Feature activation+1.19e-02

Pos logit contributions
ceed+0.052
Whe+0.047
V+0.047
Three+0.047
 quickest+0.046
Neg logit contributions
ies-0.026
 coughing-0.023
atted-0.022
anged-0.021
 Tim-0.021
Tokeneddy
Token ID21874
Feature activation+1.57e-01

Pos logit contributions
 coughing+0.092
 Joey+0.089
 eating+0.084
 accidentally+0.081
 they+0.081
Neg logit contributions
ceed-0.160
Three-0.153
V-0.152
hest-0.151
lor-0.150
Token bear
Token ID6842
Feature activation+1.17e-01

Pos logit contributions
ies+0.250
atted+0.236
 coughing+0.236
anged+0.235
 throats+0.226
Neg logit contributions
ceed-0.237
V-0.213
Whe-0.211
Three-0.205
 quickest-0.201
Token,
Token ID11
Feature activation+2.10e-01

Pos logit contributions
ies+0.114
atted+0.106
 coughing+0.099
anged+0.099
 throats+0.097
Neg logit contributions
ceed-0.248
 Animal-0.229
lor-0.227
V-0.227
 Museum-0.224
Token a
Token ID257
Feature activation-2.60e-02

Pos logit contributions
ies+0.341
anged+0.313
atted+0.312
 coughing+0.302
 throats+0.298
Neg logit contributions
ceed-0.320
Whe-0.293
V-0.287
Three-0.283
 quickest-0.279
Token ball
Token ID2613
Feature activation+4.05e-02

Pos logit contributions
ceed+0.042
Whe+0.038
V+0.038
Three+0.037
 quickest+0.036
Neg logit contributions
ies-0.041
 coughing-0.037
atted-0.037
anged-0.036
 throats-0.036
Token and
Token ID290
Feature activation+2.85e-01

Pos logit contributions
ies+0.035
 coughing+0.031
atted+0.031
bie+0.030
anged+0.030
Neg logit contributions
ceed-0.092
hest-0.085
V-0.084
Three-0.084
 quickest-0.083
Token a
Token ID257
Feature activation-1.62e-02

Pos logit contributions
ies+0.474
 coughing+0.441
 Tim+0.432
 they+0.431
atted+0.428
Neg logit contributions
ceed-0.435
Whe-0.384
V-0.377
Three-0.375
redible-0.372
Token book
Token ID1492
Feature activation+2.65e-02

Pos logit contributions
ceed+0.027
Whe+0.025
V+0.024
Three+0.024
hest+0.024
Neg logit contributions
ies-0.025
 coughing-0.022
atted-0.022
 Tim-0.021
anged-0.021
Token.
Token ID13
Feature activation+2.59e-02

Pos logit contributions
ies+0.027
 coughing+0.024
atted+0.023
 Tim+0.023
 they+0.022
Neg logit contributions
ceed-0.058
Whe-0.053
V-0.053
Three-0.052
redible-0.051
Token She
Token ID1375
Feature activation+6.28e-02

Pos logit contributions
ies+0.031
 Tim+0.029
 coughing+0.028
 they+0.028
atted+0.027
Neg logit contributions
ceed-0.052
Whe-0.047
hest-0.046
 quickest-0.046
Three-0.046
Token thinks
Token ID6834
Feature activation+1.84e-02

Pos logit contributions
ies+0.102
 coughing+0.093
anged+0.091
 throats+0.091
atted+0.090
Neg logit contributions
ceed-0.098
V-0.089
Three-0.087
Whe-0.087
 Believe-0.085
Token sign
Token ID1051
Feature activation+1.19e-02

Pos logit contributions
ies+0.029
 and+0.026
 coughing+0.026
 they+0.026
 Tim+0.025
Neg logit contributions
ceed-0.027
Whe-0.025
V-0.023
lor-0.023
 quickest-0.022
Token '
Token ID705
Feature activation-8.75e-02

Pos logit contributions
ies+0.014
 coughing+0.013
atted+0.013
anged+0.013
bie+0.013
Neg logit contributions
ceed-0.023
Whe-0.020
 quickest-0.020
hest-0.020
Three-0.020
Tokenhappy
Token ID34191
Feature activation-1.79e-02

Pos logit contributions
ceed+0.136
lor+0.119
 biggest+0.117
 quickest+0.116
 Believe+0.116
Neg logit contributions
ies-0.141
anged-0.128
 Tim-0.128
 they-0.128
 coughing-0.127
Token',
Token ID3256
Feature activation-1.66e-02

Pos logit contributions
ceed+0.026
Whe+0.024
V+0.023
Three+0.023
 quickest+0.022
Neg logit contributions
ies-0.032
 coughing-0.029
 they-0.028
atted-0.028
 throats-0.028
Token '
Token ID705
Feature activation-7.87e-02

Pos logit contributions
ceed+0.039
Whe+0.036
 biggest+0.036
Three+0.036
V+0.036
Neg logit contributions
ies-0.012
anged-0.011
 coughing-0.011
atted-0.010
bie-0.010
Tokenbirth
Token ID24280
Feature activation-2.96e-01

Pos logit contributions
ceed+0.120
hest+0.103
 quickest+0.103
lor+0.102
 biggest+0.100
Neg logit contributions
ies-0.128
 they-0.120
anged-0.119
bie-0.118
 Tim-0.118
Tokenday
Token ID820
Feature activation-5.13e-02

Pos logit contributions
 fastest+0.433
 quickest+0.432
Whe+0.430
ceed+0.421
hest+0.413
Neg logit contributions
 and-0.466
ies-0.460
--0.450
bie-0.429
 coughing-0.423
Token',
Token ID3256
Feature activation-1.70e-02

Pos logit contributions
ceed+0.093
Whe+0.088
V+0.086
Three+0.085
 quickest+0.083
Neg logit contributions
ies-0.072
 they-0.066
 coughing-0.064
 Tim-0.063
 and-0.062
Token and
Token ID290
Feature activation+2.14e-01

Pos logit contributions
ceed+0.040
Whe+0.037
V+0.036
Three+0.036
hest+0.036
Neg logit contributions
ies-0.014
 coughing-0.012
anged-0.012
atted-0.012
 throats-0.011
Token '
Token ID705
Feature activation-7.12e-02

Pos logit contributions
ies+0.254
 coughing+0.223
anged+0.215
atted+0.211
bie+0.207
Neg logit contributions
ceed-0.431
Whe-0.387
V-0.382
 exact-0.379
Three-0.379
Tokenyou
Token ID5832
Feature activation-1.44e-02

Pos logit contributions
ceed+0.108
hest+0.093
 quickest+0.093
lor+0.093
Whe+0.091
Neg logit contributions
ies-0.117
 they-0.108
anged-0.107
 coughing-0.106
bie-0.106
Token should
Token ID815
Feature activation+6.56e-03

Pos logit contributions
ceed+0.050
Whe+0.042
V+0.041
Three+0.041
hest+0.040
Neg logit contributions
ies-0.078
 coughing-0.073
atted-0.072
anged-0.071
 throats-0.071
Token be
Token ID307
Feature activation-6.55e-02

Pos logit contributions
ies+0.011
atted+0.010
anged+0.010
 coughing+0.010
bie+0.009
Neg logit contributions
ceed-0.010
V-0.009
Whe-0.009
Three-0.009
 Believe-0.009
Token compassionate
Token ID32533
Feature activation+2.90e-02

Pos logit contributions
ceed+0.126
lor+0.123
Whe+0.121
Three+0.119
hest+0.118
Neg logit contributions
ies-0.083
 mad-0.078
 coughing-0.078
 Tim-0.076
atted-0.075
Token.
Token ID13
Feature activation-5.15e-03

Pos logit contributions
ies+0.036
atted+0.033
 coughing+0.033
anged+0.032
bie+0.032
Neg logit contributions
ceed-0.055
V-0.049
 Believe-0.048
Whe-0.048
 quickest-0.048
Token Comp
Token ID3082
Feature activation+2.35e-02

Pos logit contributions
ceed+0.007
Whe+0.006
Three+0.006
V+0.006
hest+0.006
Neg logit contributions
ies-0.010
 coughing-0.009
atted-0.009
 Tim-0.009
anged-0.009
Tokenassion
Token ID11857
Feature activation-2.84e-01

Pos logit contributions
ies+0.048
atted+0.042
i+0.042
anged+0.042
 Tim+0.041
Neg logit contributions
Whe-0.029
ceed-0.029
Three-0.027
 quickest-0.025
 biggest-0.025
Tokenate
Token ID378
Feature activation-4.84e-02

Pos logit contributions
Whe+0.457
ceed+0.427
 exact+0.412
Insp+0.404
 quickest+0.404
Neg logit contributions
ies-0.515
 and-0.510
--0.452
s-0.411
 than-0.405
Token means
Token ID1724
Feature activation-8.71e-02

Pos logit contributions
ceed+0.061
hest+0.050
Whe+0.050
 quickest+0.050
V+0.049
Neg logit contributions
ies-0.089
 coughing-0.088
anged-0.086
atted-0.084
 throats-0.084
Token you
Token ID345
Feature activation-5.93e-04

Pos logit contributions
ceed+0.151
 Believe+0.131
 quickest+0.130
V+0.130
Whe+0.130
Neg logit contributions
ies-0.122
anged-0.112
 coughing-0.112
atted-0.112
bie-0.109
Token care
Token ID1337
Feature activation+5.92e-02

Pos logit contributions
ceed+0.001
Whe+0.001
V+0.001
Three+0.001
redible+0.001
Neg logit contributions
ies-0.001
 coughing-0.001
 they-0.001
 Tim-0.001
atted-0.001
Token about
Token ID546
Feature activation+2.83e-02

Pos logit contributions
ies+0.048
 and+0.044
 because+0.044
 Tim+0.038
 they+0.037
Neg logit contributions
Whe-0.146
Three-0.145
ceed-0.143
lor-0.141
V-0.138
Token.
Token ID13
Feature activation+1.27e-03

Pos logit contributions
ies+0.105
 throats+0.099
 coughing+0.094
atted+0.092
anged+0.092
Neg logit contributions
ceed-0.161
 Believe-0.146
V-0.144
 exact-0.142
Three-0.142
Token
Token ID198
Feature activation-1.30e-01

Pos logit contributions
ies+0.002
 coughing+0.002
atted+0.002
anged+0.002
 throats+0.002
Neg logit contributions
ceed-0.002
Whe-0.002
V-0.002
Three-0.002
 quickest-0.002
Token
Token ID198
Feature activation-1.32e-01

Pos logit contributions
ceed+0.223
 quickest+0.205
 exact+0.201
Whe+0.201
 nic+0.195
Neg logit contributions
 Tim-0.189
 they-0.181
ies-0.176
 and-0.174
 Joey-0.173
Token"
Token ID1
Feature activation-1.02e-01

Pos logit contributions
ceed+0.227
Whe+0.203
 quickest+0.199
V+0.198
hest+0.195
Neg logit contributions
ies-0.190
 Tim-0.177
 coughing-0.177
 they-0.174
atted-0.170
TokenMar
Token ID7676
Feature activation+4.66e-02

Pos logit contributions
ceed+0.140
 quickest+0.115
hest+0.114
Insp+0.112
ieve+0.112
Neg logit contributions
 they-0.180
ies-0.174
 Tim-0.173
 coughing-0.171
 he-0.164
Tokenriage
Token ID4087
Feature activation-2.70e-01

Pos logit contributions
ies+0.063
 coughing+0.057
atted+0.056
anged+0.054
 Tim+0.054
Neg logit contributions
ceed-0.087
Whe-0.079
V-0.078
Three-0.078
hest-0.076
Token is
Token ID318
Feature activation-6.04e-02

Pos logit contributions
ceed+0.488
V+0.447
Whe+0.441
 quickest+0.435
Insp+0.433
Neg logit contributions
 and-0.379
ies-0.377
 they-0.344
 Tim-0.343
 because-0.323
Token when
Token ID618
Feature activation-9.29e-03

Pos logit contributions
ceed+0.132
Whe+0.121
V+0.119
Three+0.119
 quickest+0.117
Neg logit contributions
ies-0.062
 coughing-0.054
atted-0.053
anged-0.051
 throats-0.051
Token two
Token ID734
Feature activation-2.94e-02

Pos logit contributions
ceed+0.014
Whe+0.013
 quickest+0.013
 biggest+0.013
Three+0.013
Neg logit contributions
ies-0.014
atted-0.014
anged-0.014
 coughing-0.013
 throats-0.013
Token people
Token ID661
Feature activation+2.14e-02

Pos logit contributions
ceed+0.046
 quickest+0.041
hest+0.040
Whe+0.040
 biggest+0.040
Neg logit contributions
ies-0.043
 coughing-0.042
anged-0.042
atted-0.041
bie-0.041
Token love
Token ID1842
Feature activation+4.49e-02

Pos logit contributions
ies+0.033
 coughing+0.032
atted+0.031
 throats+0.031
anged+0.031
Neg logit contributions
ceed-0.033
 quickest-0.029
Whe-0.029
hest-0.028
 Believe-0.028
Token by
Token ID416
Feature activation-3.12e-02

Pos logit contributions
ceed+0.110
V+0.099
 Believe+0.099
Three+0.098
Whe+0.098
Neg logit contributions
ies-0.035
 coughing-0.031
atted-0.030
anged-0.029
 throats-0.028
Token saying
Token ID2282
Feature activation-3.50e-02

Pos logit contributions
ceed+0.065
Whe+0.061
Three+0.060
V+0.059
hest+0.059
Neg logit contributions
ies-0.035
 coughing-0.032
 Tim-0.030
atted-0.030
 they-0.030
Token,
Token ID11
Feature activation+1.37e-01

Pos logit contributions
ceed+0.064
Whe+0.060
V+0.059
Three+0.059
 quickest+0.058
Neg logit contributions
ies-0.047
 they-0.045
 coughing-0.043
 Tim-0.042
atted-0.042
Token "
Token ID366
Feature activation-1.18e-01

Pos logit contributions
ies+0.223
 coughing+0.205
atted+0.201
 Tim+0.197
anged+0.197
Neg logit contributions
ceed-0.216
Whe-0.192
V-0.189
Three-0.188
hest-0.183
TokenMar
Token ID7676
Feature activation+4.98e-02

Pos logit contributions
ceed+0.165
redible+0.137
 quickest+0.136
 Museum+0.135
ieve+0.134
Neg logit contributions
ies-0.201
 they-0.191
bie-0.184
 and-0.184
i-0.184
Tokenriage
Token ID4087
Feature activation-2.67e-01

Pos logit contributions
ies+0.106
i+0.100
bie+0.096
atted+0.096
anged+0.096
Neg logit contributions
ceed-0.059
hest-0.051
Whe-0.049
Three-0.048
V-0.047
Token is
Token ID318
Feature activation-5.96e-02

Pos logit contributions
ceed+0.499
Whe+0.478
ieve+0.468
 quickest+0.463
 exact+0.457
Neg logit contributions
 and-0.385
ies-0.336
 Tim-0.308
 they-0.307
 because-0.305
Token when
Token ID618
Feature activation-8.84e-03

Pos logit contributions
ceed+0.145
Three+0.138
V+0.137
Whe+0.137
hest+0.134
Neg logit contributions
ies-0.046
 they-0.041
 because-0.038
 Tim-0.038
 and-0.035
Token two
Token ID734
Feature activation-3.02e-02

Pos logit contributions
ceed+0.016
Whe+0.014
Three+0.014
V+0.014
 quickest+0.014
Neg logit contributions
ies-0.012
atted-0.011
 coughing-0.011
anged-0.011
 throats-0.010
Token people
Token ID661
Feature activation+1.89e-02

Pos logit contributions
ceed+0.048
Whe+0.042
 quickest+0.042
Three+0.042
hest+0.042
Neg logit contributions
ies-0.045
 coughing-0.043
anged-0.043
atted-0.042
 Tim-0.042
Token who
Token ID508
Feature activation+2.55e-02

Pos logit contributions
ies+0.030
 coughing+0.028
atted+0.027
 throats+0.027
anged+0.027
Neg logit contributions
ceed-0.030
Whe-0.026
Three-0.026
V-0.026
 quickest-0.026
Token cat
Token ID3797
Feature activation+5.41e-02

Pos logit contributions
ceed+0.251
lor+0.238
hest+0.235
Whe+0.230
Three+0.226
Neg logit contributions
ies-0.092
 coughing-0.089
 Tim-0.082
 and-0.081
 mad-0.079
Token said
Token ID531
Feature activation-2.21e-02

Pos logit contributions
ies+0.070
anged+0.065
 coughing+0.065
 throats+0.065
atted+0.064
Neg logit contributions
ceed-0.098
V-0.087
 Animal-0.086
 Believe-0.085
Three-0.085
Token,
Token ID11
Feature activation+1.60e-01

Pos logit contributions
ceed+0.046
 Believe+0.040
V+0.040
 biggest+0.040
 Anything+0.040
Neg logit contributions
ies-0.023
bie-0.021
anged-0.021
 throats-0.020
 coughing-0.020
Token "
Token ID366
Feature activation-1.08e-01

Pos logit contributions
ies+0.276
 coughing+0.250
atted+0.248
anged+0.246
 throats+0.242
Neg logit contributions
ceed-0.239
Whe-0.210
V-0.209
Three-0.207
Insp-0.199
TokenMar
Token ID7676
Feature activation+6.46e-02

Pos logit contributions
ceed+0.157
redible+0.129
 quickest+0.129
 Museum+0.128
 Ready+0.128
Neg logit contributions
ies-0.184
 they-0.180
 coughing-0.172
ho-0.172
 because-0.165
Tokenriage
Token ID4087
Feature activation-2.64e-01

Pos logit contributions
ies+0.093
i+0.087
atted+0.085
anged+0.085
ho+0.081
Neg logit contributions
ceed-0.119
hest-0.110
Whe-0.105
Three-0.104
V-0.101
Token is
Token ID318
Feature activation-5.66e-02

Pos logit contributions
ceed+0.480
Whe+0.454
 exact+0.444
redible+0.444
Three+0.437
Neg logit contributions
 and-0.378
ies-0.343
 they-0.321
 Tim-0.321
 because-0.319
Token when
Token ID618
Feature activation-6.34e-03

Pos logit contributions
ceed+0.128
Whe+0.119
V+0.118
Three+0.118
hest+0.115
Neg logit contributions
ies-0.054
 coughing-0.046
 Tim-0.044
 they-0.044
atted-0.044
Token two
Token ID734
Feature activation-2.36e-02

Pos logit contributions
ceed+0.011
Whe+0.009
 quickest+0.009
Three+0.009
V+0.009
Neg logit contributions
ies-0.009
atted-0.009
anged-0.009
 coughing-0.008
 throats-0.008
Token animals
Token ID4695
Feature activation+2.91e-02

Pos logit contributions
ceed+0.036
Whe+0.032
 quickest+0.031
Three+0.031
V+0.031
Neg logit contributions
ies-0.038
 coughing-0.036
atted-0.035
anged-0.035
bie-0.034
Token love
Token ID1842
Feature activation+5.28e-02

Pos logit contributions
ies+0.046
 coughing+0.043
atted+0.042
anged+0.041
 Tim+0.041
Neg logit contributions
ceed-0.046
Whe-0.041
V-0.040
Three-0.040
 quickest-0.039
Token sky
Token ID6766
Feature activation+1.02e-02

Pos logit contributions
ceed+0.224
V+0.196
Whe+0.196
Three+0.192
hest+0.189
Neg logit contributions
ies-0.255
 coughing-0.237
 Tim-0.229
atted-0.229
 throats-0.227
Token before
Token ID878
Feature activation+2.11e-03

Pos logit contributions
ies+0.009
 coughing+0.007
atted+0.007
 Tim+0.007
anged+0.007
Neg logit contributions
ceed-0.024
Whe-0.022
V-0.022
Three-0.022
hest-0.022
Token they
Token ID484
Feature activation+1.64e-02

Pos logit contributions
ies+0.003
anged+0.003
atted+0.003
bie+0.002
 throats+0.002
Neg logit contributions
ceed-0.004
Whe-0.003
Three-0.003
Ready-0.003
 Zoo-0.003
Token walked
Token ID6807
Feature activation-7.79e-02

Pos logit contributions
ies+0.023
 coughing+0.020
atted+0.020
 Tim+0.019
anged+0.019
Neg logit contributions
ceed-0.030
Whe-0.028
V-0.027
Three-0.027
hest-0.026
Token fore
Token ID1674
Feature activation+3.45e-02

Pos logit contributions
ceed+0.159
Whe+0.146
V+0.144
Three+0.144
 quickest+0.142
Neg logit contributions
ies-0.088
 coughing-0.080
atted-0.077
 throats-0.075
anged-0.075
Tokenshadow
Token ID19106
Feature activation-2.43e-01

Pos logit contributions
ies+0.059
 coughing+0.054
atted+0.053
anged+0.052
 Tim+0.052
Neg logit contributions
ceed-0.052
Whe-0.046
V-0.045
Three-0.045
hest-0.044
Tokened
Token ID276
Feature activation-4.43e-02

Pos logit contributions
ceed+0.538
Whe+0.499
V+0.496
Three+0.491
hest+0.481
Neg logit contributions
ies-0.243
atted-0.209
 coughing-0.204
 Tim-0.200
anged-0.196
Token the
Token ID262
Feature activation-1.73e-01

Pos logit contributions
ceed+0.092
 Believe+0.084
Whe+0.084
 quickest+0.084
redible+0.083
Neg logit contributions
ies-0.045
 coughing-0.041
anged-0.040
atted-0.039
 throats-0.038
Token down
Token ID866
Feature activation-1.14e-01

Pos logit contributions
ceed+0.277
Whe+0.251
V+0.247
Three+0.241
hest+0.241
Neg logit contributions
ies-0.277
 coughing-0.255
 Tim-0.247
 they-0.246
 throats-0.243
Tokenpour
Token ID48681
Feature activation-7.72e-02

Pos logit contributions
ceed+0.259
Whe+0.239
V+0.237
Three+0.236
hest+0.231
Neg logit contributions
ies-0.105
 coughing-0.090
atted-0.087
anged-0.083
 Tim-0.083
Token that
Token ID326
Feature activation-3.82e-02

Pos logit contributions
ceed+0.167
Whe+0.155
V+0.153
Three+0.152
 quickest+0.148
Neg logit contributions
ies-0.080
 coughing-0.070
atted-0.068
 they-0.067
 Tim-0.067
Token,
Token ID11
Feature activation+1.35e-01

Pos logit contributions
ceed+0.003
Whe+0.003
V+0.003
Three+0.003
 quickest+0.003
Neg logit contributions
ies-0.004
 coughing-0.004
atted-0.004
 Tim-0.004
 they-0.004
Token but
Token ID475
Feature activation+7.46e-02

Pos logit contributions
ies+0.156
anged+0.149
atted+0.146
 throats+0.139
 coughing+0.139
Neg logit contributions
ceed-0.266
Whe-0.246
V-0.239
 Ready-0.237
Ready-0.236
Token it
Token ID340
Feature activation-3.52e-03

Pos logit contributions
ies+0.077
anged+0.073
atted+0.072
 coughing+0.069
 throats+0.066
Neg logit contributions
ceed-0.150
Whe-0.140
 quickest-0.139
 exact-0.137
 Anything-0.136
Token was
Token ID373
Feature activation-9.61e-03

Pos logit contributions
ceed+0.007
Whe+0.006
V+0.006
Three+0.006
 quickest+0.006
Neg logit contributions
ies-0.005
 coughing-0.004
atted-0.004
anged-0.004
 throats-0.004
Token fore
Token ID1674
Feature activation+7.36e-02

Pos logit contributions
ceed+0.017
 quickest+0.016
 exact+0.015
V+0.015
 biggest+0.015
Neg logit contributions
ies-0.013
anged-0.012
atted-0.012
 throats-0.012
 coughing-0.011
Tokenshadow
Token ID19106
Feature activation-2.33e-01

Pos logit contributions
ies+0.122
 coughing+0.111
atted+0.109
anged+0.106
i+0.106
Neg logit contributions
ceed-0.115
Whe-0.105
Three-0.101
V-0.101
 Believe-0.097
Tokened
Token ID276
Feature activation-4.01e-02

Pos logit contributions
ceed+0.497
Whe+0.460
V+0.455
Three+0.452
hest+0.442
Neg logit contributions
ies-0.250
atted-0.217
 coughing-0.214
 Tim-0.211
anged-0.205
Token in
Token ID287
Feature activation-8.00e-02

Pos logit contributions
ceed+0.080
 quickest+0.073
 Believe+0.072
 exact+0.072
Whe+0.071
Neg logit contributions
ies-0.045
 coughing-0.041
atted-0.040
anged-0.040
bie-0.038
Token the
Token ID262
Feature activation-1.51e-01

Pos logit contributions
ceed+0.144
Whe+0.130
V+0.128
Three+0.128
 quickest+0.125
Neg logit contributions
ies-0.113
 coughing-0.102
atted-0.100
anged-0.098
 throats-0.097
Token beginning
Token ID3726
Feature activation-1.03e-01

Pos logit contributions
ceed+0.240
Whe+0.215
V+0.212
hest+0.208
Three+0.207
Neg logit contributions
ies-0.238
 coughing-0.227
 Tim-0.217
 throats-0.217
 they-0.212
Token when
Token ID618
Feature activation-1.41e-02

Pos logit contributions
ceed+0.225
Whe+0.207
V+0.205
Three+0.204
hest+0.200
Neg logit contributions
ies-0.104
 coughing-0.091
atted-0.088
 Tim-0.084
anged-0.084
Token heavy
Token ID4334
Feature activation+9.83e-04

Pos logit contributions
ceed+0.232
Whe+0.208
V+0.205
Three+0.204
 quickest+0.198
Neg logit contributions
ies-0.216
 coughing-0.197
atted-0.193
anged-0.189
 Tim-0.189
Token bag
Token ID6131
Feature activation-6.93e-02

Pos logit contributions
atted+0.002
ies+0.002
anged+0.002
 coughing+0.002
 throats+0.002
Neg logit contributions
ceed-0.001
 exact-0.001
 date-0.001
 biggest-0.001
 quickest-0.001
Token had
Token ID550
Feature activation-4.82e-02

Pos logit contributions
ceed+0.148
hest+0.139
 quickest+0.136
 exact+0.134
 biggest+0.133
Neg logit contributions
ies-0.069
 coughing-0.062
 throats-0.061
anged-0.060
atted-0.060
Token been
Token ID587
Feature activation-4.67e-02

Pos logit contributions
ceed+0.092
Whe+0.083
V+0.082
Three+0.082
 quickest+0.081
Neg logit contributions
ies-0.062
 coughing-0.056
atted-0.055
anged-0.053
 throats-0.053
Token fore
Token ID1674
Feature activation+7.12e-02

Pos logit contributions
ceed+0.088
 quickest+0.080
 exact+0.079
hest+0.078
 fastest+0.077
Neg logit contributions
ies-0.057
 throats-0.051
anged-0.051
atted-0.050
 coughing-0.049
Tokenshadow
Token ID19106
Feature activation-2.30e-01

Pos logit contributions
ies+0.119
 coughing+0.109
atted+0.106
anged+0.104
i+0.103
Neg logit contributions
ceed-0.111
Whe-0.100
V-0.097
Three-0.097
 Believe-0.093
Tokened
Token ID276
Feature activation-2.73e-02

Pos logit contributions
ceed+0.491
Whe+0.461
V+0.456
Three+0.450
Insp+0.437
Neg logit contributions
ies-0.249
atted-0.214
 coughing-0.209
 Tim-0.208
anged-0.203
Token,
Token ID11
Feature activation+1.27e-01

Pos logit contributions
ceed+0.053
 quickest+0.049
 Believe+0.048
 exact+0.048
 biggest+0.047
Neg logit contributions
ies-0.031
 coughing-0.027
atted-0.027
anged-0.027
 throats-0.027
Token and
Token ID290
Feature activation+2.10e-01

Pos logit contributions
ies+0.109
anged+0.088
atted+0.083
bie+0.077
 throats+0.077
Neg logit contributions
ceed-0.297
 Believe-0.276
Insp-0.275
Whe-0.275
 exact-0.274
Token that
Token ID326
Feature activation-5.35e-03

Pos logit contributions
ies+0.270
atted+0.231
anged+0.230
 coughing+0.229
 throats+0.226
Neg logit contributions
ceed-0.403
 Believe-0.360
 quickest-0.353
Insp-0.351
Whe-0.351
Token setup
Token ID9058
Feature activation-5.98e-02

Pos logit contributions
ceed+0.011
 Believe+0.010
 quickest+0.010
Whe+0.010
V+0.010
Neg logit contributions
ies-0.006
atted-0.005
anged-0.005
 coughing-0.005
 throats-0.005
Token "
Token ID366
Feature activation-1.11e-01

Pos logit contributions
ceed+0.085
Whe+0.077
V+0.076
Three+0.076
hest+0.075
Neg logit contributions
ies-0.046
 coughing-0.041
atted-0.040
anged-0.038
 throats-0.038
TokenCl
Token ID2601
Feature activation-6.55e-02

Pos logit contributions
ceed+0.169
lor+0.149
 Believe+0.148
 biggest+0.146
Insp+0.145
Neg logit contributions
ies-0.179
i-0.160
 and-0.160
 Tim-0.159
bie-0.157
Tokenosed
Token ID1335
Feature activation-8.34e-03

Pos logit contributions
ceed+0.112
V+0.099
Whe+0.099
hest+0.098
Three+0.098
Neg logit contributions
ies-0.101
atted-0.091
anged-0.089
 coughing-0.088
i-0.088
Token -
Token ID532
Feature activation+1.02e-01

Pos logit contributions
ceed+0.016
Whe+0.015
V+0.015
Three+0.015
Insp+0.014
Neg logit contributions
ies-0.011
 they-0.010
 coughing-0.010
i-0.010
 Tim-0.009
Token Fore
Token ID4558
Feature activation+5.35e-02

Pos logit contributions
ies+0.139
 they+0.133
 Tim+0.131
 and+0.127
 coughing+0.125
Neg logit contributions
ceed-0.181
Whe-0.166
V-0.165
Three-0.164
hest-0.159
Tokenshadow
Token ID19106
Feature activation-2.30e-01

Pos logit contributions
ies+0.088
 coughing+0.079
atted+0.078
i+0.077
 they+0.077
Neg logit contributions
ceed-0.084
Whe-0.078
Three-0.076
V-0.076
 quickest-0.072
Tokening
Token ID278
Feature activation-3.31e-02

Pos logit contributions
ceed+0.499
Whe+0.482
V+0.477
Three+0.465
hest+0.463
Neg logit contributions
ies-0.235
atted-0.203
 Tim-0.194
i-0.193
 and-0.185
Token Storm
Token ID8865
Feature activation+6.09e-02

Pos logit contributions
ceed+0.066
Whe+0.061
V+0.060
Three+0.060
hest+0.058
Neg logit contributions
ies-0.040
 coughing-0.035
atted-0.034
 Tim-0.033
anged-0.033
Token".
Token ID1911
Feature activation-3.84e-02

Pos logit contributions
ies+0.091
 coughing+0.081
atted+0.080
 they+0.079
i+0.078
Neg logit contributions
ceed-0.105
Whe-0.096
V-0.095
Three-0.094
Insp-0.091
Token
Token ID198
Feature activation-1.31e-01

Pos logit contributions
ceed+0.069
V+0.061
Whe+0.060
 quickest+0.060
hest+0.060
Neg logit contributions
ies-0.052
 Tim-0.050
 coughing-0.049
 they-0.049
atted-0.048
Token
Token ID198
Feature activation-1.35e-01

Pos logit contributions
ceed+0.230
 quickest+0.223
 biggest+0.212
 fastest+0.211
Whe+0.209
Neg logit contributions
 Tim-0.175
 and-0.169
 they-0.167
 quickly-0.166
 Tommy-0.165
Token was
Token ID373
Feature activation-2.88e-02

Pos logit contributions
ceed+0.162
 quickest+0.146
 Believe+0.142
 exact+0.141
 Animal+0.141
Neg logit contributions
ies-0.170
atted-0.164
anged-0.160
 coughing-0.157
 throats-0.153
Token the
Token ID262
Feature activation-1.06e-01

Pos logit contributions
ceed+0.049
Whe+0.043
V+0.043
 quickest+0.042
Three+0.042
Neg logit contributions
ies-0.045
atted-0.040
anged-0.039
 coughing-0.039
 throats-0.038
Token setup
Token ID9058
Feature activation-6.43e-02

Pos logit contributions
ceed+0.172
Whe+0.155
V+0.152
Three+0.149
hest+0.147
Neg logit contributions
ies-0.168
 coughing-0.161
 Tim-0.152
 throats-0.151
 they-0.150
Token that
Token ID326
Feature activation-2.41e-02

Pos logit contributions
ceed+0.152
hest+0.140
lor+0.137
 Believe+0.136
 date+0.136
Neg logit contributions
ies-0.049
 throats-0.041
 coughing-0.040
atted-0.040
anged-0.040
Token fore
Token ID1674
Feature activation+4.05e-02

Pos logit contributions
ceed+0.053
 Believe+0.049
 quickest+0.048
Whe+0.048
 Museum+0.048
Neg logit contributions
ies-0.024
atted-0.021
 coughing-0.020
anged-0.019
 throats-0.019
Tokenshadow
Token ID19106
Feature activation-2.30e-01

Pos logit contributions
ies+0.062
atted+0.055
i+0.054
 coughing+0.053
anged+0.053
Neg logit contributions
ceed-0.069
Whe-0.066
Three-0.062
V-0.062
 Anything-0.060
Tokened
Token ID276
Feature activation-2.56e-02

Pos logit contributions
ceed+0.507
Whe+0.479
V+0.475
Three+0.466
hest+0.459
Neg logit contributions
ies-0.231
atted-0.202
 Tim-0.192
anged-0.186
 coughing-0.185
Token the
Token ID262
Feature activation-1.56e-01

Pos logit contributions
ceed+0.051
 quickest+0.046
 Believe+0.046
 exact+0.045
 Animal+0.045
Neg logit contributions
ies-0.029
 coughing-0.026
anged-0.025
 throats-0.025
atted-0.025
Token bad
Token ID2089
Feature activation-2.05e-03

Pos logit contributions
ceed+0.248
Whe+0.228
V+0.218
hest+0.216
lor+0.215
Neg logit contributions
ies-0.252
 coughing-0.237
 Tim-0.227
 throats-0.224
 they-0.221
Token ending
Token ID7464
Feature activation-4.39e-02

Pos logit contributions
ceed+0.003
 quickest+0.003
Ready+0.003
 Believe+0.003
Three+0.003
Neg logit contributions
ies-0.003
atted-0.003
 coughing-0.003
anged-0.003
ries-0.003
Token.
Token ID13
Feature activation-1.75e-02

Pos logit contributions
ceed+0.093
hest+0.085
 quickest+0.084
lor+0.082
 Believe+0.082
Neg logit contributions
ies-0.044
 coughing-0.041
 throats-0.040
atted-0.038
anged-0.038
Token go
Token ID467
Feature activation-3.63e-02

Pos logit contributions
 Tim+0.012
ies+0.011
 they+0.011
 and+0.011
 quickly+0.010
Neg logit contributions
ceed-0.016
lor-0.015
Whe-0.015
Three-0.014
V-0.014
Token.
Token ID13
Feature activation-4.34e-03

Pos logit contributions
ceed+0.060
 quickest+0.060
 fastest+0.059
hest+0.057
 exact+0.057
Neg logit contributions
 throats-0.049
atted-0.049
ies-0.048
 coughing-0.047
anged-0.046
Token
Token ID198
Feature activation-1.35e-01

Pos logit contributions
ceed+0.009
Whe+0.008
hest+0.008
lor+0.008
 quickest+0.008
Neg logit contributions
 Tim-0.005
ies-0.005
 they-0.005
 coughing-0.005
 Tommy-0.005
Token
Token ID198
Feature activation-1.36e-01

Pos logit contributions
ceed+0.223
Whe+0.204
 quickest+0.201
redible+0.191
 exact+0.190
Neg logit contributions
ies-0.208
 Tim-0.207
 they-0.203
 coughing-0.196
 and-0.192
Token"
Token ID1
Feature activation-1.07e-01

Pos logit contributions
ceed+0.214
Whe+0.194
 quickest+0.187
 biggest+0.183
 Believe+0.182
Neg logit contributions
ies-0.220
 they-0.214
 Tim-0.207
 coughing-0.201
 and-0.198
TokenWhich
Token ID13828
Feature activation-2.29e-01

Pos logit contributions
ceed+0.170
 biggest+0.158
hest+0.158
lor+0.157
 Anything+0.155
Neg logit contributions
 Tim-0.154
 they-0.153
 Tommy-0.144
ies-0.140
 coughing-0.138
Token way
Token ID835
Feature activation-2.80e-02

Pos logit contributions
ceed+0.418
Whe+0.378
redible+0.372
V+0.364
hest+0.361
Neg logit contributions
 they-0.346
ies-0.330
 Tim-0.325
 and-0.312
 because-0.299
Token,
Token ID11
Feature activation+1.14e-01

Pos logit contributions
ceed+0.064
Whe+0.059
V+0.059
Three+0.058
lor+0.056
Neg logit contributions
ies-0.028
 they-0.024
 Tim-0.024
 coughing-0.023
atted-0.022
Token Tom
Token ID4186
Feature activation+4.26e-02

Pos logit contributions
ies+0.112
 Tim+0.102
 they+0.100
 coughing+0.091
 Tommy+0.090
Neg logit contributions
ceed-0.255
Three-0.235
Whe-0.235
V-0.233
lor-0.232
Token?"
Token ID1701
Feature activation-4.62e-02

Pos logit contributions
ies+0.078
 throats+0.076
atted+0.075
 coughing+0.075
anged+0.075
Neg logit contributions
ceed-0.054
 quickest-0.045
V-0.044
Whe-0.044
lor-0.044
Token Lily
Token ID20037
Feature activation+6.77e-02

Pos logit contributions
ceed+0.100
Whe+0.093
V+0.093
Three+0.092
 quickest+0.089
Neg logit contributions
ies-0.048
 coughing-0.041
atted-0.041
anged-0.039
i-0.039
Token that
Token ID326
Feature activation-2.55e-02

Pos logit contributions
ceed+0.057
 Believe+0.051
 Anything+0.050
 biggest+0.048
 Animal+0.048
Neg logit contributions
bie-0.072
anged-0.070
 tongues-0.066
ies-0.066
 throats-0.066
Token it
Token ID340
Feature activation-2.77e-02

Pos logit contributions
ceed+0.052
 Believe+0.048
 Anything+0.047
 Ready+0.047
 biggest+0.046
Neg logit contributions
ies-0.027
anged-0.027
atted-0.027
 throats-0.024
bie-0.024
Token was
Token ID373
Feature activation-2.86e-02

Pos logit contributions
ceed+0.042
 Believe+0.037
 quickest+0.037
 Animal+0.036
 fastest+0.035
Neg logit contributions
ies-0.045
 throats-0.043
anged-0.042
atted-0.042
 coughing-0.041
Token a
Token ID257
Feature activation-5.29e-02

Pos logit contributions
ceed+0.045
 quickest+0.040
 exact+0.039
 biggest+0.039
 Animal+0.039
Neg logit contributions
ies-0.046
anged-0.043
atted-0.043
 throats-0.041
 coughing-0.040
Token '
Token ID705
Feature activation-5.66e-02

Pos logit contributions
ceed+0.076
Whe+0.066
V+0.065
Three+0.065
 quickest+0.063
Neg logit contributions
ies-0.094
atted-0.087
 coughing-0.087
anged-0.085
 throats-0.084
Tokenshadow
Token ID19106
Feature activation-2.28e-01

Pos logit contributions
ceed+0.091
lor+0.079
 biggest+0.077
 Believe+0.077
Insp+0.076
Neg logit contributions
ies-0.089
 Tim-0.082
 coughing-0.082
anged-0.082
i-0.080
Token'
Token ID6
Feature activation-7.11e-02

Pos logit contributions
ceed+0.495
Whe+0.478
V+0.467
Three+0.459
hest+0.455
Neg logit contributions
ies-0.240
atted-0.209
 Tim-0.193
 and-0.192
i-0.192
Token of
Token ID286
Feature activation-5.73e-02

Pos logit contributions
ceed+0.147
Whe+0.133
V+0.132
Three+0.131
 quickest+0.129
Neg logit contributions
ies-0.081
 coughing-0.072
atted-0.071
anged-0.068
 throats-0.068
Token her
Token ID607
Feature activation-5.34e-02

Pos logit contributions
ceed+0.102
Whe+0.090
V+0.090
Three+0.089
 Believe+0.088
Neg logit contributions
ies-0.082
 coughing-0.073
atted-0.073
anged-0.072
 throats-0.070
Token own
Token ID898
Feature activation-5.89e-02

Pos logit contributions
ceed+0.095
Whe+0.086
V+0.085
Three+0.084
 quickest+0.084
Neg logit contributions
ies-0.074
 coughing-0.068
atted-0.067
anged-0.065
 throats-0.064
Token shape
Token ID5485
Feature activation-5.21e-02

Pos logit contributions
ceed+0.103
Whe+0.093
V+0.092
Three+0.091
hest+0.089
Neg logit contributions
ies-0.088
 coughing-0.079
atted-0.077
 Tim-0.076
anged-0.075
Token herbs
Token ID25411
Feature activation+6.52e-02

Pos logit contributions
ceed+0.056
Three+0.050
Whe+0.050
V+0.050
hest+0.049
Neg logit contributions
ies-0.052
 coughing-0.048
atted-0.047
anged-0.047
 Tim-0.045
Token.
Token ID13
Feature activation+3.09e-03

Pos logit contributions
atted+0.067
ies+0.066
anged+0.065
bie+0.063
 coughing+0.062
Neg logit contributions
ceed-0.127
hest-0.123
 quickest-0.118
 exact-0.117
 biggest-0.116
Token
Token ID198
Feature activation-1.31e-01

Pos logit contributions
 Tim+0.004
 they+0.004
 Tommy+0.003
 coughing+0.003
 and+0.003
Neg logit contributions
ceed-0.006
 quickest-0.006
redible-0.006
 fastest-0.005
hest-0.005
Token
Token ID198
Feature activation-1.33e-01

Pos logit contributions
ceed+0.233
 quickest+0.215
Whe+0.210
 fastest+0.204
 exact+0.202
Neg logit contributions
 they-0.188
ies-0.188
 Tim-0.188
 and-0.179
 coughing-0.177
Token"
Token ID1
Feature activation-1.05e-01

Pos logit contributions
ceed+0.213
 biggest+0.195
 quickest+0.193
Whe+0.191
 exact+0.188
Neg logit contributions
 they-0.205
ies-0.205
 coughing-0.194
 and-0.188
 badly-0.182
TokenWhich
Token ID13828
Feature activation-2.28e-01

Pos logit contributions
ceed+0.185
 Believe+0.171
ieve+0.170
lor+0.170
 quickest+0.166
Neg logit contributions
 they-0.131
 Tim-0.120
 and-0.119
ies-0.118
 throats-0.118
Token butter
Token ID9215
Feature activation+6.42e-03

Pos logit contributions
ceed+0.417
Whe+0.380
V+0.368
Three+0.363
redible+0.362
Neg logit contributions
ies-0.328
 coughing-0.296
 they-0.293
 throats-0.284
 Tim-0.284
Token do
Token ID466
Feature activation-8.77e-02

Pos logit contributions
ies+0.011
 coughing+0.010
atted+0.010
anged+0.010
 throats+0.010
Neg logit contributions
ceed-0.010
Whe-0.008
V-0.008
Three-0.008
hest-0.008
Token we
Token ID356
Feature activation-3.38e-02

Pos logit contributions
ceed+0.184
V+0.173
Whe+0.173
Three+0.170
Ready+0.168
Neg logit contributions
ies-0.104
 they-0.097
 Tim-0.089
 coughing-0.085
i-0.083
Token need
Token ID761
Feature activation-1.16e-01

Pos logit contributions
ceed+0.056
Whe+0.051
V+0.050
Three+0.050
hest+0.048
Neg logit contributions
ies-0.052
 coughing-0.047
atted-0.046
 Tim-0.045
anged-0.045
Token,
Token ID11
Feature activation+1.16e-01

Pos logit contributions
ceed+0.209
Whe+0.187
Three+0.186
V+0.185
hest+0.184
Neg logit contributions
ies-0.161
 coughing-0.146
atted-0.144
anged-0.140
 throats-0.139
Token story
Token ID1621
Feature activation-9.18e-03

Pos logit contributions
ceed+0.131
Whe+0.121
 quickest+0.115
Three+0.113
hest+0.113
Neg logit contributions
ies-0.124
 and-0.112
 coughing-0.109
 they-0.108
bie-0.106
Token.
Token ID13
Feature activation+1.16e-02

Pos logit contributions
ceed+0.022
Whe+0.022
Three+0.021
V+0.021
Ready+0.020
Neg logit contributions
ies-0.007
 and-0.006
 they-0.006
 because-0.006
 coughing-0.005
Token
Token ID198
Feature activation-1.25e-01

Pos logit contributions
ies+0.016
 Tim+0.015
 coughing+0.015
 they+0.015
atted+0.015
Neg logit contributions
ceed-0.021
Whe-0.019
hest-0.018
 quickest-0.018
Three-0.018
Token
Token ID198
Feature activation-1.28e-01

Pos logit contributions
ceed+0.219
 quickest+0.211
Whe+0.203
 biggest+0.201
 fastest+0.200
Neg logit contributions
 Tim-0.174
 they-0.174
ies-0.173
 and-0.169
 Tommy-0.159
Token"
Token ID1
Feature activation-1.01e-01

Pos logit contributions
ceed+0.186
Whe+0.166
 biggest+0.162
 quickest+0.161
lor+0.156
Neg logit contributions
ies-0.219
 they-0.204
 coughing-0.202
 Tim-0.198
 and-0.193
TokenWhich
Token ID13828
Feature activation-2.27e-01

Pos logit contributions
ceed+0.171
hest+0.159
 quickest+0.157
ieve+0.153
 exact+0.150
Neg logit contributions
 they-0.144
 she-0.134
 and-0.129
ho-0.121
 he-0.120
Token story
Token ID1621
Feature activation-4.09e-02

Pos logit contributions
ceed+0.455
Whe+0.440
redible+0.437
hest+0.432
tted+0.415
Neg logit contributions
 and-0.276
 they-0.276
ies-0.268
 to-0.257
--0.245
Token do
Token ID466
Feature activation-8.46e-02

Pos logit contributions
Whe+0.076
lor+0.074
ceed+0.073
Three+0.072
 quickest+0.071
Neg logit contributions
 and-0.053
 they-0.052
ies-0.052
 because-0.047
 to-0.047
Token you
Token ID345
Feature activation-1.35e-02

Pos logit contributions
ceed+0.192
Whe+0.183
V+0.179
Three+0.178
redible+0.177
Neg logit contributions
 they-0.087
ies-0.083
 Tim-0.074
 he-0.073
 because-0.068
Token want
Token ID765
Feature activation-7.17e-02

Pos logit contributions
ceed+0.028
Whe+0.026
lor+0.026
V+0.025
Three+0.025
Neg logit contributions
ies-0.016
 and-0.015
 because-0.015
 Tim-0.014
i-0.014
Token,
Token ID11
Feature activation+1.12e-01

Pos logit contributions
ceed+0.184
Whe+0.179
Three+0.172
V+0.171
redible+0.170
Neg logit contributions
ies-0.046
 and-0.037
 they-0.037
 Tim-0.037
 coughing-0.037
Token.
Token ID13
Feature activation+7.23e-03

Pos logit contributions
ceed+0.041
 quickest+0.038
hest+0.038
Three+0.037
 exact+0.037
Neg logit contributions
ies-0.023
atted-0.020
 throats-0.019
 coughing-0.019
anged-0.019
Token He
Token ID679
Feature activation+5.24e-02

Pos logit contributions
ies+0.010
 coughing+0.008
atted+0.008
anged+0.008
i+0.008
Neg logit contributions
ceed-0.014
Whe-0.013
V-0.012
Three-0.012
 Believe-0.012
Token had
Token ID550
Feature activation-3.91e-02

Pos logit contributions
ies+0.074
ries+0.065
atted+0.063
 throats+0.062
 coughing+0.062
Neg logit contributions
ceed-0.089
V-0.086
Three-0.084
 quickest-0.083
 nic-0.082
Token been
Token ID587
Feature activation-4.18e-02

Pos logit contributions
ceed+0.075
Whe+0.068
V+0.068
Three+0.068
 quickest+0.067
Neg logit contributions
ies-0.051
 coughing-0.044
atted-0.044
anged-0.043
 throats-0.042
Token fore
Token ID1674
Feature activation+7.15e-02

Pos logit contributions
ceed+0.085
Whe+0.078
V+0.077
Three+0.077
 quickest+0.077
Neg logit contributions
ies-0.049
 coughing-0.041
 throats-0.041
atted-0.041
anged-0.040
Tokenshadow
Token ID19106
Feature activation-2.26e-01

Pos logit contributions
ies+0.125
 coughing+0.115
atted+0.113
 Tim+0.111
 throats+0.111
Neg logit contributions
ceed-0.104
Whe-0.091
V-0.090
Three-0.089
hest-0.087
Tokened
Token ID276
Feature activation-2.69e-02

Pos logit contributions
ceed+0.498
Whe+0.463
V+0.458
Three+0.454
 quickest+0.445
Neg logit contributions
ies-0.229
atted-0.197
 coughing-0.193
 Tim-0.189
anged-0.186
Token about
Token ID546
Feature activation-5.88e-03

Pos logit contributions
ceed+0.057
 quickest+0.053
 Believe+0.052
Whe+0.052
 Animal+0.052
Neg logit contributions
ies-0.028
 coughing-0.024
atted-0.023
anged-0.023
 throats-0.023
Token this
Token ID428
Feature activation-1.19e-01

Pos logit contributions
ceed+0.010
Whe+0.009
 Believe+0.009
V+0.009
 Animal+0.009
Neg logit contributions
ies-0.009
atted-0.008
anged-0.008
 coughing-0.007
 throats-0.007
Token bad
Token ID2089
Feature activation+8.38e-03

Pos logit contributions
ceed+0.192
Whe+0.170
 quickest+0.169
Three+0.168
 exact+0.167
Neg logit contributions
ies-0.186
 coughing-0.171
atted-0.171
anged-0.167
 throats-0.165
Token ending
Token ID7464
Feature activation-4.02e-02

Pos logit contributions
ies+0.014
atted+0.014
 coughing+0.013
anged+0.013
 throats+0.013
Neg logit contributions
ceed-0.012
 quickest-0.011
 exact-0.011
 Believe-0.010
 biggest-0.010
Token eat
Token ID4483
Feature activation-5.94e-02

Pos logit contributions
ies+0.034
 coughing+0.031
atted+0.031
anged+0.030
 throats+0.030
Neg logit contributions
ceed-0.032
Whe-0.028
V-0.028
Three-0.027
 quickest-0.027
Token.
Token ID13
Feature activation+1.14e-02

Pos logit contributions
ceed+0.104
 date+0.097
hest+0.096
 exact+0.096
redible+0.096
Neg logit contributions
anged-0.078
atted-0.075
ies-0.075
 coughing-0.070
bie-0.068
Token
Token ID198
Feature activation-1.28e-01

Pos logit contributions
ies+0.017
 coughing+0.015
atted+0.015
anged+0.015
 Tim+0.015
Neg logit contributions
ceed-0.020
Whe-0.018
V-0.018
Three-0.018
hest-0.017
Token
Token ID198
Feature activation-1.31e-01

Pos logit contributions
ceed+0.237
Whe+0.214
 quickest+0.212
 exact+0.203
 fastest+0.201
Neg logit contributions
 Tim-0.183
 they-0.177
ies-0.175
 and-0.170
 Tommy-0.165
Token"
Token ID1
Feature activation-9.94e-02

Pos logit contributions
ceed+0.190
Whe+0.167
lor+0.159
 quickest+0.158
 biggest+0.155
Neg logit contributions
ies-0.229
 Tim-0.214
 coughing-0.211
 they-0.211
atted-0.205
TokenWhich
Token ID13828
Feature activation-2.26e-01

Pos logit contributions
ceed+0.135
lor+0.114
Whe+0.112
hest+0.111
 Believe+0.110
Neg logit contributions
ies-0.175
 coughing-0.169
 Tim-0.168
 they-0.167
 throats-0.161
Token cereal
Token ID33158
Feature activation+8.16e-03

Pos logit contributions
ceed+0.410
Whe+0.368
redible+0.361
V+0.360
Three+0.353
Neg logit contributions
ies-0.320
 coughing-0.298
 they-0.295
 Tim-0.293
 and-0.281
Token should
Token ID815
Feature activation-9.16e-03

Pos logit contributions
ies+0.014
 coughing+0.013
atted+0.012
anged+0.012
 Tim+0.012
Neg logit contributions
ceed-0.012
Whe-0.011
V-0.011
Three-0.011
hest-0.010
Token I
Token ID314
Feature activation-6.12e-02

Pos logit contributions
ceed+0.012
Three+0.011
hest+0.011
Whe+0.010
V+0.010
Neg logit contributions
ies-0.017
atted-0.016
anged-0.015
 coughing-0.015
bie-0.015
Token load
Token ID3440
Feature activation-1.18e-01

Pos logit contributions
ceed+0.084
Whe+0.073
Three+0.072
V+0.072
hest+0.070
Neg logit contributions
ies-0.112
 coughing-0.104
atted-0.103
anged-0.100
 throats-0.100
Token?"
Token ID1701
Feature activation-6.17e-02

Pos logit contributions
ceed+0.223
Whe+0.202
V+0.200
Three+0.199
hest+0.195
Neg logit contributions
ies-0.154
 coughing-0.139
atted-0.136
anged-0.133
 throats-0.132
Token long
Token ID890
Feature activation+5.57e-02

Pos logit contributions
ceed+0.035
 biggest+0.033
 quickest+0.032
 exact+0.032
 fastest+0.031
Neg logit contributions
anged-0.040
ies-0.039
atted-0.038
 throats-0.037
 coughing-0.037
Token.
Token ID13
Feature activation+9.82e-03

Pos logit contributions
ies+0.047
bie+0.047
anged+0.044
 Johnny+0.043
atted+0.042
Neg logit contributions
ceed-0.116
 exact-0.113
 quickest-0.112
Three-0.110
 biggest-0.110
Token
Token ID198
Feature activation-1.26e-01

Pos logit contributions
 Tim+0.014
 they+0.013
ies+0.013
 Tommy+0.013
 coughing+0.012
Neg logit contributions
ceed-0.018
Whe-0.016
lor-0.015
 quickest-0.015
hest-0.015
Token
Token ID198
Feature activation-1.29e-01

Pos logit contributions
ceed+0.211
Whe+0.189
 quickest+0.186
redible+0.177
 exact+0.177
Neg logit contributions
ies-0.196
 Tim-0.192
 they-0.189
 coughing-0.181
 and-0.180
Token"
Token ID1
Feature activation-1.00e-01

Pos logit contributions
ceed+0.193
Whe+0.169
lor+0.163
 quickest+0.161
 Believe+0.160
Neg logit contributions
ies-0.225
 they-0.210
 coughing-0.206
 Tim-0.206
atted-0.197
TokenWhich
Token ID13828
Feature activation-2.25e-01

Pos logit contributions
ceed+0.168
lor+0.161
ieve+0.151
 Believe+0.146
 quickest+0.142
Neg logit contributions
 they-0.147
 Tim-0.135
 and-0.130
 Tommy-0.127
ies-0.123
Token one
Token ID530
Feature activation-9.70e-02

Pos logit contributions
ceed+0.426
Whe+0.392
V+0.381
Three+0.372
redible+0.368
Neg logit contributions
ies-0.311
 they-0.302
 Tim-0.287
 and-0.280
 coughing-0.271
Token should
Token ID815
Feature activation-7.16e-03

Pos logit contributions
ceed+0.138
 quickest+0.118
 Believe+0.118
Whe+0.118
hest+0.116
Neg logit contributions
ies-0.164
 coughing-0.158
atted-0.157
anged-0.154
 throats-0.154
Token we
Token ID356
Feature activation-4.45e-02

Pos logit contributions
ceed+0.010
Three+0.009
hest+0.009
 quickest+0.009
Whe+0.009
Neg logit contributions
ies-0.013
atted-0.012
 coughing-0.011
anged-0.011
bie-0.011
Token get
Token ID651
Feature activation-1.03e-01

Pos logit contributions
ceed+0.069
Whe+0.063
V+0.061
lor+0.060
Three+0.060
Neg logit contributions
ies-0.074
 Tim-0.068
 coughing-0.068
 they-0.067
 because-0.066
Token,
Token ID11
Feature activation+1.17e-01

Pos logit contributions
ceed+0.172
hest+0.151
 biggest+0.148
Three+0.147
V+0.147
Neg logit contributions
ies-0.152
 coughing-0.141
anged-0.139
atted-0.138
 throats-0.137
Token that
Token ID326
Feature activation+1.95e-02

Pos logit contributions
ies+0.314
atted+0.290
 coughing+0.288
anged+0.276
 throats+0.268
Neg logit contributions
ceed-0.505
Whe-0.459
Three-0.456
V-0.452
 Ready-0.451
Token they
Token ID484
Feature activation+4.12e-02

Pos logit contributions
ies+0.021
 coughing+0.019
atted+0.018
anged+0.018
 throats+0.017
Neg logit contributions
ceed-0.041
Whe-0.038
V-0.038
Three-0.037
 quickest-0.037
Token were
Token ID547
Feature activation-3.21e-02

Pos logit contributions
ies+0.050
 coughing+0.046
atted+0.045
 throats+0.044
anged+0.043
Neg logit contributions
ceed-0.081
V-0.072
Whe-0.072
Three-0.072
hest-0.072
Token going
Token ID1016
Feature activation-3.15e-02

Pos logit contributions
ceed+0.059
 quickest+0.053
Whe+0.052
 biggest+0.052
 fastest+0.052
Neg logit contributions
ies-0.044
atted-0.039
anged-0.039
 throats-0.039
 coughing-0.039
Token to
Token ID284
Feature activation-2.38e-02

Pos logit contributions
ceed+0.073
 quickest+0.069
 biggest+0.068
Three+0.068
 fastest+0.068
Neg logit contributions
ies-0.025
 coughing-0.022
atted-0.022
 throats-0.022
i-0.020
Token outer
Token ID12076
Feature activation-2.25e-01

Pos logit contributions
ceed+0.043
lor+0.043
Whe+0.039
redible+0.038
hest+0.038
Neg logit contributions
 Tim-0.035
 they-0.032
 and-0.031
 John-0.031
 Billy-0.030
Token space
Token ID2272
Feature activation-2.22e-02

Pos logit contributions
ceed+0.440
Whe+0.393
redible+0.385
lor+0.366
 quickest+0.360
Neg logit contributions
 and-0.309
 Tim-0.286
ies-0.284
 they-0.278
 because-0.277
Token.
Token ID13
Feature activation-7.68e-03

Pos logit contributions
ceed+0.048
Whe+0.044
V+0.043
Three+0.043
 quickest+0.042
Neg logit contributions
ies-0.023
 coughing-0.020
atted-0.020
anged-0.019
 throats-0.019
Token They
Token ID1119
Feature activation+3.26e-02

Pos logit contributions
ceed+0.015
Whe+0.013
V+0.013
redible+0.013
Three+0.013
Neg logit contributions
ies-0.009
 Tim-0.009
 they-0.009
 coughing-0.009
atted-0.009
Token watched
Token ID7342
Feature activation-8.72e-02

Pos logit contributions
ies+0.038
 coughing+0.035
atted+0.034
 throats+0.033
anged+0.032
Neg logit contributions
ceed-0.065
V-0.060
Three-0.059
Whe-0.059
hest-0.058
Token out
Token ID503
Feature activation-4.70e-02

Pos logit contributions
ceed+0.152
V+0.135
Whe+0.134
redible+0.133
 Believe+0.132
Neg logit contributions
ies-0.119
 coughing-0.115
bie-0.111
anged-0.111
atted-0.110
Token,
Token ID11
Feature activation+1.64e-01

Pos logit contributions
ies+0.026
 coughing+0.023
atted+0.022
anged+0.021
 throats+0.020
Neg logit contributions
ceed-0.115
Whe-0.107
hest-0.106
redible-0.106
 quickest-0.105
Token and
Token ID290
Feature activation+2.29e-01

Pos logit contributions
anged+0.161
ies+0.147
atted+0.143
 throats+0.136
 badly+0.135
Neg logit contributions
Whe-0.340
ceed-0.338
 nic-0.331
 biggest-0.329
 quickest-0.323
Token even
Token ID772
Feature activation+1.83e-01

Pos logit contributions
ies+0.300
atted+0.281
anged+0.280
 coughing+0.276
 throats+0.269
Neg logit contributions
ceed-0.409
Whe-0.377
V-0.372
 Anything-0.370
 biggest-0.369
Token went
Token ID1816
Feature activation-2.11e-02

Pos logit contributions
ies+0.249
 coughing+0.229
atted+0.227
anged+0.221
 throats+0.220
Neg logit contributions
ceed-0.332
Whe-0.300
V-0.295
Three-0.293
 quickest-0.291
Token to
Token ID284
Feature activation-2.67e-03

Pos logit contributions
ceed+0.046
Whe+0.042
V+0.042
Three+0.041
 quickest+0.041
Neg logit contributions
ies-0.022
atted-0.019
 coughing-0.018
anged-0.018
 throats-0.018
Token outer
Token ID12076
Feature activation-2.25e-01

Pos logit contributions
ceed+0.006
lor+0.005
redible+0.005
Three+0.005
hest+0.005
Neg logit contributions
 Tim-0.003
 they-0.003
 and-0.003
 eating-0.003
 because-0.003
Token space
Token ID2272
Feature activation-2.84e-02

Pos logit contributions
ceed+0.429
Whe+0.377
redible+0.373
 quickest+0.358
hest+0.354
Neg logit contributions
ies-0.320
 and-0.309
 Tim-0.291
 because-0.286
 they-0.277
Token!
Token ID0
Feature activation-5.18e-03

Pos logit contributions
ceed+0.063
Whe+0.058
V+0.057
Three+0.057
 quickest+0.056
Neg logit contributions
ies-0.028
 coughing-0.025
atted-0.024
anged-0.023
 Tim-0.023
Token It
Token ID632
Feature activation-3.49e-02

Pos logit contributions
ceed+0.009
 quickest+0.008
redible+0.008
Three+0.008
Whe+0.008
Neg logit contributions
 Tim-0.007
ies-0.007
 they-0.007
 because-0.007
 Tommy-0.007
Token was
Token ID373
Feature activation-1.72e-02

Pos logit contributions
ceed+0.077
Whe+0.071
V+0.070
Three+0.070
redible+0.069
Neg logit contributions
ies-0.037
 they-0.033
 Tim-0.032
 coughing-0.031
 and-0.030
Token the
Token ID262
Feature activation-1.01e-01

Pos logit contributions
ceed+0.029
Whe+0.027
Three+0.026
V+0.026
hest+0.025
Neg logit contributions
ies-0.026
 coughing-0.023
atted-0.023
 Tim-0.023
 they-0.022
Token girl
Token ID2576
Feature activation+4.17e-02

Pos logit contributions
dd+0.076
anged+0.076
 coughing+0.076
 throats+0.075
atted+0.075
Neg logit contributions
 biggest-0.040
 exact-0.039
ceed-0.039
 quickest-0.038
 nic-0.038
Token remembered
Token ID12086
Feature activation-1.68e-02

Pos logit contributions
ies+0.063
 throats+0.058
 coughing+0.055
 accidents+0.055
anged+0.055
Neg logit contributions
ceed-0.061
 nic-0.059
Three-0.058
 Believe-0.055
V-0.055
Token the
Token ID262
Feature activation-1.30e-01

Pos logit contributions
ceed+0.025
 exact+0.022
 Believe+0.022
redible+0.022
 Anything+0.022
Neg logit contributions
ies-0.025
anged-0.025
atted-0.023
 throats-0.023
attered-0.022
Token earlier
Token ID2961
Feature activation+3.98e-04

Pos logit contributions
ceed+0.205
Whe+0.181
V+0.179
Three+0.178
 quickest+0.176
Neg logit contributions
ies-0.209
atted-0.192
 coughing-0.191
anged-0.189
 throats-0.184
Token fore
Token ID1674
Feature activation+5.81e-02

Pos logit contributions
ies+0.000
atted+0.000
 coughing+0.000
anged+0.000
 throats+0.000
Neg logit contributions
ceed-0.001
Whe-0.001
V-0.001
Three-0.001
 Believe-0.001
Tokenshadow
Token ID19106
Feature activation-2.24e-01

Pos logit contributions
ies+0.097
 coughing+0.088
atted+0.087
anged+0.085
i+0.084
Neg logit contributions
ceed-0.091
Whe-0.083
V-0.080
Three-0.080
 Believe-0.076
Tokening
Token ID278
Feature activation-3.63e-02

Pos logit contributions
ceed+0.486
Whe+0.450
V+0.446
Three+0.443
hest+0.435
Neg logit contributions
ies-0.234
atted-0.201
 coughing-0.198
 Tim-0.194
anged-0.190
Token,
Token ID11
Feature activation+1.23e-01

Pos logit contributions
ceed+0.075
 quickest+0.067
redible+0.067
Whe+0.067
 Believe+0.067
Neg logit contributions
ies-0.039
anged-0.034
 coughing-0.034
atted-0.034
 throats-0.033
Token and
Token ID290
Feature activation+2.06e-01

Pos logit contributions
ies+0.138
anged+0.124
atted+0.120
 throats+0.116
 coughing+0.114
Neg logit contributions
ceed-0.251
Whe-0.231
 Believe-0.231
 exact-0.230
Insp-0.229
Token it
Token ID340
Feature activation-5.51e-03

Pos logit contributions
ies+0.265
 throats+0.240
 coughing+0.238
anged+0.235
atted+0.233
Neg logit contributions
ceed-0.378
 Believe-0.337
Insp-0.328
 Ready-0.327
 quickest-0.326
Token saved
Token ID7448
Feature activation-6.48e-02

Pos logit contributions
ceed+0.011
Whe+0.010
V+0.010
Three+0.010
 quickest+0.009
Neg logit contributions
ies-0.007
 coughing-0.006
atted-0.006
anged-0.006
 throats-0.006
Token three
Token ID1115
Feature activation+2.39e-02

Pos logit contributions
 they+0.254
ies+0.248
 Tim+0.247
 and+0.247
 because+0.233
Neg logit contributions
ceed-0.197
lor-0.172
redible-0.157
hest-0.157
 Believe-0.157
Token,
Token ID11
Feature activation+1.58e-01

Pos logit contributions
ies+0.020
 coughing+0.018
anged+0.017
atted+0.017
 throats+0.016
Neg logit contributions
ceed-0.055
Whe-0.050
Three-0.050
V-0.050
hest-0.050
Token four
Token ID1440
Feature activation+8.80e-03

Pos logit contributions
 they+0.294
ies+0.294
 Tim+0.293
 and+0.292
 because+0.276
Neg logit contributions
ceed-0.210
lor-0.181
Whe-0.167
hest-0.167
redible-0.166
Token,
Token ID11
Feature activation+1.67e-01

Pos logit contributions
ies+0.006
 coughing+0.004
atted+0.004
 they+0.004
 Tim+0.004
Neg logit contributions
ceed-0.023
Whe-0.022
V-0.022
Three-0.021
Ready-0.021
Token five
Token ID1936
Feature activation-1.18e-02

Pos logit contributions
 and+0.308
ies+0.301
 they+0.299
 Tim+0.292
 because+0.284
Neg logit contributions
ceed-0.231
lor-0.197
hest-0.185
redible-0.184
Whe-0.184
Token,
Token ID11
Feature activation+1.74e-01

Pos logit contributions
ceed+0.028
Whe+0.026
V+0.026
Three+0.025
redible+0.025
Neg logit contributions
ies-0.011
 coughing-0.009
atted-0.008
 they-0.008
 Tim-0.008
Token six
Token ID2237
Feature activation-4.99e-02

Pos logit contributions
 and+0.328
ies+0.319
 they+0.316
 Tim+0.307
 because+0.302
Neg logit contributions
ceed-0.237
lor-0.198
hest-0.191
redible-0.190
Whe-0.188
Token,
Token ID11
Feature activation+1.72e-01

Pos logit contributions
ceed+0.129
Whe+0.121
V+0.120
Three+0.118
Ready+0.117
Neg logit contributions
ies-0.035
 coughing-0.026
atted-0.025
 they-0.024
i-0.023
Token seven
Token ID3598
Feature activation-2.93e-02

Pos logit contributions
ies+0.317
 and+0.309
 they+0.300
 coughing+0.297
 Tim+0.294
Neg logit contributions
ceed-0.232
lor-0.194
 Believe-0.188
hest-0.187
redible-0.183
Token,
Token ID11
Feature activation+1.64e-01

Pos logit contributions
ceed+0.077
Whe+0.071
V+0.071
Three+0.070
redible+0.069
Neg logit contributions
ies-0.019
 coughing-0.014
atted-0.014
 they-0.013
 Tim-0.013
Token eight
Token ID3624
Feature activation+3.77e-03

Pos logit contributions
ies+0.255
 and+0.240
 coughing+0.238
 they+0.237
 Tim+0.234
Neg logit contributions
ceed-0.269
hest-0.234
Whe-0.231
lor-0.228
 Believe-0.226
Token.
Token ID13
Feature activation+1.63e-02

Pos logit contributions
ies+0.018
atted+0.017
 coughing+0.017
anged+0.016
 throats+0.016
Neg logit contributions
ceed-0.030
hest-0.027
 quickest-0.027
Three-0.027
V-0.026
Token The
Token ID383
Feature activation-1.14e-01

Pos logit contributions
ies+0.018
 coughing+0.016
atted+0.016
 Tim+0.016
 they+0.015
Neg logit contributions
ceed-0.034
Whe-0.031
V-0.031
Three-0.031
hest-0.030
Token shade
Token ID17979
Feature activation-1.99e-03

Pos logit contributions
ceed+0.209
lor+0.189
hest+0.186
Whe+0.183
V+0.183
Neg logit contributions
ies-0.164
 Tim-0.144
 coughing-0.143
 and-0.141
 they-0.137
Token is
Token ID318
Feature activation-5.23e-03

Pos logit contributions
ceed+0.002
Three+0.002
Whe+0.002
 middle+0.002
hest+0.002
Neg logit contributions
ies-0.004
 Billy-0.004
 Joey-0.004
 coughing-0.004
 Johnny-0.004
Token cool
Token ID3608
Feature activation+7.18e-02

Pos logit contributions
ceed+0.009
 biggest+0.008
redible+0.008
V+0.008
 exact+0.008
Neg logit contributions
anged-0.008
ies-0.007
atted-0.007
olph-0.007
i-0.007
Token and
Token ID290
Feature activation+2.44e-01

Pos logit contributions
anged+0.144
atted+0.140
bie+0.135
olph+0.131
 Johnny+0.129
Neg logit contributions
 biggest-0.074
hest-0.070
 exact-0.068
 quickest-0.067
 edge-0.066
Token lovely
Token ID14081
Feature activation+2.94e-02

Pos logit contributions
ies+0.355
anged+0.331
atted+0.327
 coughing+0.321
i+0.310
Neg logit contributions
ceed-0.411
Whe-0.379
V-0.368
 quickest-0.365
 biggest-0.363
Token.
Token ID13
Feature activation+1.55e-02

Pos logit contributions
anged+0.048
atted+0.047
ies+0.046
bie+0.046
 coughing+0.045
Neg logit contributions
ceed-0.040
 biggest-0.036
 quickest-0.036
 date-0.036
 Animal-0.035
Token They
Token ID1119
Feature activation+4.81e-02

Pos logit contributions
ies+0.017
 Tim+0.016
 they+0.016
 coughing+0.016
 throats+0.015
Neg logit contributions
ceed-0.033
hest-0.029
redible-0.029
 quickest-0.029
lor-0.029
Token feel
Token ID1254
Feature activation-9.81e-03

Pos logit contributions
ies+0.082
 coughing+0.076
atted+0.074
 Tim+0.073
anged+0.073
Neg logit contributions
ceed-0.073
Whe-0.064
V-0.063
Three-0.063
hest-0.061
Token happy
Token ID3772
Feature activation+7.55e-02

Pos logit contributions
ceed+0.017
Whe+0.016
V+0.016
 biggest+0.015
 Believe+0.015
Neg logit contributions
ies-0.014
anged-0.013
atted-0.012
 coughing-0.012
bie-0.012
Token was
Token ID373
Feature activation-1.57e-02

Pos logit contributions
 accidents+0.135
 throats+0.130
othe+0.128
ries+0.127
 badly+0.124
Neg logit contributions
 smallest-0.077
 lead-0.075
 nic-0.075
Three-0.074
 exact-0.074
Token walking
Token ID6155
Feature activation-1.19e-02

Pos logit contributions
ceed+0.032
V+0.029
 biggest+0.029
 quickest+0.029
Three+0.029
Neg logit contributions
ies-0.017
 throats-0.015
anged-0.015
atted-0.014
i-0.014
Token by
Token ID416
Feature activation-2.17e-02

Pos logit contributions
Whe+0.018
 middle+0.018
ceed+0.018
Three+0.018
 biggest+0.018
Neg logit contributions
 throats-0.017
atted-0.017
 tongues-0.016
 coughing-0.016
ies-0.016
Token the
Token ID262
Feature activation-1.39e-01

Pos logit contributions
ceed+0.044
Whe+0.040
V+0.039
Three+0.039
hest+0.039
Neg logit contributions
ies-0.026
 coughing-0.023
atted-0.023
 Tim-0.023
anged-0.022
Token park
Token ID3952
Feature activation+1.29e-02

Pos logit contributions
ceed+0.193
Whe+0.165
hest+0.160
lor+0.159
V+0.159
Neg logit contributions
ies-0.256
 coughing-0.240
 Tim-0.236
 they-0.229
 throats-0.226
Token and
Token ID290
Feature activation+2.41e-01

Pos logit contributions
ies+0.014
atted+0.013
 throats+0.013
dd+0.013
 coughing+0.012
Neg logit contributions
ceed-0.025
 quickest-0.024
Three-0.023
 Believe-0.023
 center-0.023
Token she
Token ID673
Feature activation+7.57e-02

Pos logit contributions
ies+0.298
atted+0.280
 throats+0.279
 coughing+0.261
 tongues+0.261
Neg logit contributions
ceed-0.430
Whe-0.423
Ready-0.419
V-0.412
 Ready-0.412
Token spotted
Token ID13489
Feature activation-1.04e-01

Pos logit contributions
ies+0.111
ries+0.105
 throats+0.103
atted+0.101
 accidents+0.098
Neg logit contributions
ceed-0.116
V-0.115
Three-0.112
Whe-0.110
Ready-0.107
Token a
Token ID257
Feature activation-7.45e-02

Pos logit contributions
ceed+0.161
Whe+0.143
Three+0.141
V+0.140
hest+0.138
Neg logit contributions
ies-0.170
 coughing-0.158
 Tim-0.157
atted-0.154
 they-0.153
Token quiet
Token ID5897
Feature activation+1.13e-02

Pos logit contributions
ceed+0.113
V+0.101
Whe+0.101
Three+0.100
redible+0.100
Neg logit contributions
ies-0.121
atted-0.117
anged-0.115
 coughing-0.111
 throats-0.111
Token rag
Token ID34232
Feature activation+4.32e-02

Pos logit contributions
atted+0.018
anged+0.018
ies+0.018
 coughing+0.017
 throats+0.017
Neg logit contributions
ceed-0.016
 date-0.015
Whe-0.014
 Animal-0.014
 exact-0.014
Token the
Token ID262
Feature activation-1.62e-01

Pos logit contributions
ceed+0.005
Whe+0.005
V+0.005
Three+0.005
hest+0.005
Neg logit contributions
ies-0.004
 coughing-0.003
atted-0.003
 Tim-0.003
anged-0.003
Token living
Token ID2877
Feature activation-1.57e-01

Pos logit contributions
ceed+0.289
Whe+0.264
hest+0.251
lor+0.247
V+0.247
Neg logit contributions
ies-0.227
 coughing-0.221
 they-0.218
 Tim-0.212
 throats-0.205
Token room
Token ID2119
Feature activation-1.51e-02

Pos logit contributions
ceed+0.330
Whe+0.301
hest+0.299
V+0.294
Three+0.287
Neg logit contributions
ies-0.196
 coughing-0.175
 and-0.172
 throats-0.163
 they-0.160
Token with
Token ID351
Feature activation-5.32e-02

Pos logit contributions
ceed+0.029
Three+0.027
V+0.026
Whe+0.026
hest+0.026
Neg logit contributions
ies-0.018
atted-0.016
 coughing-0.016
anged-0.015
i-0.015
Token Mom
Token ID11254
Feature activation+1.38e-01

Pos logit contributions
ceed+0.115
V+0.107
Whe+0.107
 biggest+0.105
Three+0.104
Neg logit contributions
ies-0.054
atted-0.046
 coughing-0.045
anged-0.045
ho-0.041
Token and
Token ID290
Feature activation+2.24e-01

Pos logit contributions
dd+0.115
anged+0.114
 throats+0.114
atted+0.112
ies+0.108
Neg logit contributions
ceed-0.280
 date-0.272
 Animal-0.272
 exact-0.270
V-0.270
Token Dad
Token ID17415
Feature activation+1.60e-01

Pos logit contributions
ies+0.277
 badly+0.253
atted+0.241
 coughing+0.224
anged+0.223
Neg logit contributions
Whe-0.414
ceed-0.412
 Ready-0.406
Three-0.398
 Three-0.397
Token.
Token ID13
Feature activation+1.32e-02

Pos logit contributions
ies+0.196
atted+0.184
 throats+0.180
 coughing+0.179
anged+0.174
Neg logit contributions
ceed-0.294
V-0.266
 Believe-0.262
Whe-0.262
Three-0.262
Token They
Token ID1119
Feature activation+4.46e-02

Pos logit contributions
ies+0.022
i+0.020
atted+0.019
ie+0.018
 coughing+0.018
Neg logit contributions
Whe-0.021
ceed-0.021
V-0.020
Three-0.020
 Ready-0.020
Token pick
Token ID2298
Feature activation-6.29e-02

Pos logit contributions
ies+0.074
 coughing+0.068
 they+0.067
 Tim+0.067
i+0.066
Neg logit contributions
ceed-0.069
Whe-0.061
lor-0.060
V-0.060
Three-0.059
Token some
Token ID617
Feature activation-1.07e-01

Pos logit contributions
ceed+0.110
V+0.099
Whe+0.099
Three+0.097
hest+0.095
Neg logit contributions
ies-0.091
 coughing-0.084
atted-0.081
anged-0.081
 throats-0.080
Token She
Token ID1375
Feature activation+4.71e-02

Pos logit contributions
ceed+0.005
Whe+0.005
redible+0.005
V+0.005
hest+0.005
Neg logit contributions
ies-0.003
 Tim-0.003
 coughing-0.003
 and-0.003
 they-0.003
Token saw
Token ID2497
Feature activation-1.15e-01

Pos logit contributions
ies+0.055
 coughing+0.049
atted+0.048
 throats+0.047
anged+0.047
Neg logit contributions
ceed-0.094
V-0.086
Three-0.086
Whe-0.085
 quickest-0.083
Token all
Token ID477
Feature activation+3.37e-02

Pos logit contributions
ceed+0.185
Whe+0.164
V+0.162
Three+0.160
 quickest+0.156
Neg logit contributions
ies-0.184
 coughing-0.169
atted-0.165
anged-0.164
 throats-0.163
Token the
Token ID262
Feature activation-1.03e-01

Pos logit contributions
ies+0.051
 throats+0.048
 coughing+0.048
anged+0.048
atted+0.046
Neg logit contributions
ceed-0.054
Whe-0.049
 exact-0.047
 Believe-0.047
hest-0.047
Token animals
Token ID4695
Feature activation+2.75e-02

Pos logit contributions
ceed+0.171
V+0.149
Whe+0.149
Three+0.147
lor+0.144
Neg logit contributions
ies-0.170
 coughing-0.151
i-0.146
atted-0.145
 they-0.143
Token and
Token ID290
Feature activation+2.52e-01

Pos logit contributions
 throats+0.029
ies+0.029
anged+0.028
bie+0.027
atted+0.027
Neg logit contributions
ceed-0.051
 Believe-0.048
 quickest-0.048
hest-0.048
 Animal-0.048
Token creatures
Token ID8109
Feature activation+2.66e-02

Pos logit contributions
ies+0.362
anged+0.350
atted+0.350
 coughing+0.347
 throats+0.339
Neg logit contributions
ceed-0.399
Whe-0.361
Three-0.358
 biggest-0.358
 quickest-0.355
Token below
Token ID2174
Feature activation-5.67e-03

Pos logit contributions
ies+0.031
 throats+0.029
anged+0.029
 coughing+0.028
bie+0.028
Neg logit contributions
ceed-0.050
hest-0.045
 Believe-0.045
 quickest-0.045
 Animal-0.044
Token.
Token ID13
Feature activation+5.41e-03

Pos logit contributions
ceed+0.011
Whe+0.010
 date+0.010
 Believe+0.010
V+0.010
Neg logit contributions
ies-0.007
 coughing-0.006
atted-0.006
anged-0.006
 throats-0.006
Token She
Token ID1375
Feature activation+5.00e-02

Pos logit contributions
ies+0.006
 they+0.006
 Tim+0.006
 and+0.006
 because+0.006
Neg logit contributions
ceed-0.011
Whe-0.010
redible-0.010
hest-0.010
V-0.010
Token spread
Token ID4104
Feature activation-7.23e-02

Pos logit contributions
ies+0.056
 coughing+0.051
 throats+0.049
atted+0.049
anged+0.049
Neg logit contributions
ceed-0.101
V-0.092
Three-0.092
Whe-0.091
 Believe-0.089
Token happy
Token ID3772
Feature activation+4.24e-02

Pos logit contributions
ies+0.136
i+0.120
 throats+0.119
ries+0.117
anged+0.114
Neg logit contributions
ceed-0.186
 biggest-0.173
Three-0.171
 fastest-0.169
 quickest-0.169
Token.
Token ID13
Feature activation-4.75e-03

Pos logit contributions
bie+0.047
anged+0.046
 throats+0.045
atted+0.045
 tongues+0.044
Neg logit contributions
ceed-0.070
 Believe-0.068
 center-0.066
 centre-0.066
 most-0.066
Token He
Token ID679
Feature activation+4.82e-02

Pos logit contributions
ceed+0.010
Whe+0.009
V+0.009
Three+0.009
hest+0.009
Neg logit contributions
ies-0.005
 Tim-0.005
 coughing-0.005
atted-0.005
 they-0.004
Token was
Token ID373
Feature activation-7.45e-03

Pos logit contributions
ies+0.059
 coughing+0.054
 throats+0.052
ries+0.052
anged+0.052
Neg logit contributions
ceed-0.090
V-0.082
Three-0.082
Insp-0.080
 Animal-0.079
Token so
Token ID523
Feature activation+1.09e-01

Pos logit contributions
ceed+0.014
 quickest+0.012
Whe+0.012
 biggest+0.012
 exact+0.012
Neg logit contributions
ies-0.010
 throats-0.009
anged-0.009
atted-0.008
 coughing-0.008
Token excited
Token ID6568
Feature activation+6.76e-02

Pos logit contributions
ies+0.143
 throats+0.128
anged+0.128
ries+0.121
 coughing+0.121
Neg logit contributions
ceed-0.207
 biggest-0.184
Whe-0.183
 quickest-0.183
Three-0.182
Token that
Token ID326
Feature activation-6.84e-03

Pos logit contributions
 throats+0.094
bie+0.089
 coughing+0.088
 tongues+0.088
anged+0.087
Neg logit contributions
ceed-0.103
 exact-0.101
 Believe-0.098
 biggest-0.098
 Animal-0.097
Token he
Token ID339
Feature activation+7.06e-02

Pos logit contributions
ceed+0.017
Whe+0.016
 Believe+0.016
 quickest+0.016
 biggest+0.016
Neg logit contributions
ies-0.005
atted-0.004
anged-0.004
 throats-0.004
 coughing-0.004
Token started
Token ID2067
Feature activation-4.77e-02

Pos logit contributions
ies+0.076
 coughing+0.071
anged+0.070
 throats+0.070
atted+0.068
Neg logit contributions
ceed-0.144
V-0.130
 quickest-0.129
Three-0.128
Whe-0.128
Token to
Token ID284
Feature activation+1.71e-02

Pos logit contributions
ceed+0.092
 Believe+0.085
 basics+0.085
Insp+0.083
 date+0.083
Neg logit contributions
ies-0.051
anged-0.044
i-0.043
dd-0.043
atted-0.041
Token tw
Token ID665
Feature activation-4.57e-02

Pos logit contributions
ies+0.028
atted+0.025
 coughing+0.025
anged+0.025
 throats+0.024
Neg logit contributions
ceed-0.027
Whe-0.024
V-0.024
Three-0.023
 quickest-0.023
Token dragon
Token ID10441
Feature activation+6.27e-02

Pos logit contributions
Whe+0.214
ceed+0.214
V+0.204
redible+0.202
Three+0.201
Neg logit contributions
ies-0.148
 coughing-0.132
 they-0.131
 he-0.129
 and-0.128
Token was
Token ID373
Feature activation+4.65e-03

Pos logit contributions
 throats+0.103
anged+0.099
ies+0.098
bie+0.096
 Tommy+0.096
Neg logit contributions
ceed-0.089
 Guardian-0.074
 Animal-0.074
 Museum-0.073
 grant-0.073
Token very
Token ID845
Feature activation+1.43e-01

Pos logit contributions
ies+0.007
anged+0.006
atted+0.006
 throats+0.006
i+0.006
Neg logit contributions
ceed-0.009
V-0.008
 biggest-0.008
 quickest-0.007
 Animal-0.007
Token scary
Token ID14343
Feature activation+5.53e-02

Pos logit contributions
anged+0.266
 throats+0.261
atted+0.256
ries+0.251
ies+0.251
Neg logit contributions
ceed-0.185
 biggest-0.162
V-0.162
 fastest-0.158
Insp-0.154
Token and
Token ID290
Feature activation+2.43e-01

Pos logit contributions
bie+0.087
 throats+0.081
atted+0.081
 tongues+0.078
ies+0.077
Neg logit contributions
ceed-0.073
 Believe-0.072
 Animal-0.071
 quickest-0.069
hest-0.069
Token Tim
Token ID5045
Feature activation+1.04e-01

Pos logit contributions
ies+0.297
anged+0.293
atted+0.282
 throats+0.264
bie+0.261
Neg logit contributions
ceed-0.466
V-0.423
 fastest-0.412
 quickest-0.411
 biggest-0.400
Tokenmy
Token ID1820
Feature activation+1.46e-01

Pos logit contributions
ies+0.124
 throats+0.122
 they+0.121
 coughing+0.120
atted+0.118
Neg logit contributions
ceed-0.195
V-0.178
 quickest-0.172
Whe-0.171
Three-0.169
Token didn
Token ID1422
Feature activation-1.81e-02

Pos logit contributions
ries+0.216
 throats+0.210
 Joey+0.208
ies+0.206
atted+0.205
Neg logit contributions
ceed-0.216
V-0.197
 quickest-0.189
 Animal-0.188
 fastest-0.187
Token't
Token ID470
Feature activation+4.78e-02

Pos logit contributions
ceed+0.028
Whe+0.026
Three+0.025
 quickest+0.025
lor+0.025
Neg logit contributions
ies-0.027
 Tim-0.027
 coughing-0.027
 they-0.026
 badly-0.025
Token know
Token ID760
Feature activation-3.00e-02

Pos logit contributions
ies+0.067
 coughing+0.059
 Tim+0.058
atted+0.057
 they+0.056
Neg logit contributions
ceed-0.088
Whe-0.079
V-0.078
Three-0.078
lor-0.077
Token what
Token ID644
Feature activation-6.06e-02

Pos logit contributions
ceed+0.051
redible+0.045
 Ready+0.045
Whe+0.045
 biggest+0.045
Neg logit contributions
ies-0.041
 throats-0.039
anged-0.039
 coughing-0.038
atted-0.038
Token and
Token ID290
Feature activation+2.45e-01

Pos logit contributions
ceed+0.048
 Believe+0.043
 quickest+0.043
hest+0.043
lor+0.042
Neg logit contributions
ies-0.027
 coughing-0.024
atted-0.024
 throats-0.024
anged-0.023
Token found
Token ID1043
Feature activation-1.00e-01

Pos logit contributions
ies+0.318
 coughing+0.291
atted+0.284
anged+0.276
 throats+0.274
Neg logit contributions
ceed-0.466
Whe-0.415
V-0.410
Three-0.408
 quickest-0.406
Token lots
Token ID6041
Feature activation-3.16e-02

Pos logit contributions
ceed+0.198
Whe+0.185
Three+0.183
V+0.181
redible+0.175
Neg logit contributions
ies-0.123
 Tim-0.115
 they-0.112
atted-0.108
 Joey-0.105
Token of
Token ID286
Feature activation-1.50e-02

Pos logit contributions
hest+0.046
ceed+0.044
 biggest+0.040
Three+0.040
 Believe+0.040
Neg logit contributions
bie-0.051
 throats-0.051
 coughing-0.051
ies-0.051
anged-0.048
Token other
Token ID584
Feature activation-3.09e-02

Pos logit contributions
ceed+0.024
Whe+0.022
Three+0.022
V+0.022
hest+0.022
Neg logit contributions
ies-0.023
 coughing-0.021
anged-0.021
atted-0.021
 throats-0.021
Token fun
Token ID1257
Feature activation+6.45e-02

Pos logit contributions
ceed+0.054
Whe+0.049
Three+0.049
 quickest+0.049
V+0.048
Neg logit contributions
ies-0.043
 coughing-0.040
atted-0.039
 throats-0.038
anged-0.038
Token games
Token ID1830
Feature activation-9.90e-03

Pos logit contributions
 coughing+0.100
 throats+0.099
bie+0.097
anged+0.095
atted+0.095
Neg logit contributions
ceed-0.095
hest-0.088
 exact-0.087
 biggest-0.085
 quickest-0.084
Token to
Token ID284
Feature activation+1.56e-02

Pos logit contributions
ceed+0.019
hest+0.017
 quickest+0.017
Whe+0.017
lor+0.017
Neg logit contributions
ies-0.012
 coughing-0.011
atted-0.011
 throats-0.011
anged-0.011
Token play
Token ID711
Feature activation-9.29e-02

Pos logit contributions
 Tim+0.020
 they+0.017
 and+0.017
ies+0.017
 Joey+0.016
Neg logit contributions
ceed-0.032
lor-0.030
Whe-0.029
V-0.028
 quickest-0.028
Token.
Token ID13
Feature activation+9.80e-04

Pos logit contributions
ceed+0.236
Whe+0.224
Three+0.223
V+0.219
lor+0.218
Neg logit contributions
 and-0.066
 Tim-0.061
ies-0.059
 they-0.058
 because-0.057
Token Eventually
Token ID16178
Feature activation-2.27e-02

Pos logit contributions
ies+0.001
 Tim+0.001
 they+0.001
 coughing+0.001
atted+0.001
Neg logit contributions
ceed-0.002
Whe-0.002
V-0.002
Three-0.002
redible-0.002
Token best
Token ID1266
Feature activation+6.45e-02

Pos logit contributions
ceed+0.104
V+0.085
Three+0.083
Whe+0.081
lor+0.080
Neg logit contributions
ies-0.218
 coughing-0.204
 Tim-0.200
 they-0.199
i-0.199
Token food
Token ID2057
Feature activation+3.13e-02

Pos logit contributions
anged+0.110
lee+0.109
atted+0.109
ies+0.107
 coughing+0.104
Neg logit contributions
 date-0.089
Whe-0.086
 biggest-0.086
ceed-0.086
 exact-0.082
Token she
Token ID673
Feature activation+8.30e-02

Pos logit contributions
atted+0.032
anged+0.032
ies+0.032
 coughing+0.031
 throats+0.029
Neg logit contributions
ceed-0.064
hest-0.061
 quickest-0.060
 date-0.059
 biggest-0.059
Token ever
Token ID1683
Feature activation+1.42e-01

Pos logit contributions
ies+0.082
 coughing+0.072
atted+0.071
anged+0.069
 throats+0.068
Neg logit contributions
ceed-0.184
Whe-0.169
Three-0.166
V-0.166
 quickest-0.166
Token tasted
Token ID29187
Feature activation+1.01e-02

Pos logit contributions
ies+0.164
 coughing+0.156
atted+0.153
 throats+0.152
anged+0.144
Neg logit contributions
ceed-0.271
Three-0.247
Whe-0.247
hest-0.246
 quickest-0.243
Token!
Token ID0
Feature activation+1.26e-02

Pos logit contributions
anged+0.014
ies+0.013
atted+0.013
 throats+0.013
 coughing+0.013
Neg logit contributions
ceed-0.018
hest-0.017
 quickest-0.017
 exact-0.016
 biggest-0.016
Token She
Token ID1375
Feature activation+4.86e-02

Pos logit contributions
 Tim+0.018
 Tommy+0.017
 Billy+0.016
 they+0.016
 Johnny+0.016
Neg logit contributions
ceed-0.023
 quickest-0.022
redible-0.022
 fastest-0.021
hest-0.021
Token thought
Token ID1807
Feature activation+4.16e-03

Pos logit contributions
ies+0.072
anged+0.068
 coughing+0.068
 throats+0.064
atted+0.063
Neg logit contributions
ceed-0.079
 nic-0.071
Whe-0.069
hest-0.069
Three-0.068
Token she
Token ID673
Feature activation+5.86e-02

Pos logit contributions
ies+0.005
anged+0.005
 coughing+0.005
 throats+0.005
bie+0.004
Neg logit contributions
ceed-0.008
 biggest-0.007
hest-0.007
 exact-0.007
 Anything-0.007
Token would
Token ID561
Feature activation+1.72e-02

Pos logit contributions
ies+0.077
 coughing+0.070
atted+0.069
anged+0.067
 throats+0.067
Neg logit contributions
ceed-0.111
Whe-0.100
V-0.099
Three-0.099
 quickest-0.098
Token go
Token ID467
Feature activation-1.55e-02

Pos logit contributions
ies+0.024
 coughing+0.022
 Tim+0.022
 they+0.022
 because+0.022
Neg logit contributions
ceed-0.031
Whe-0.028
V-0.027
Three-0.027
redible-0.026
Token
Token ID198
Feature activation-1.19e-01

Pos logit contributions
ceed+0.030
Whe+0.026
V+0.026
Three+0.026
 quickest+0.025
Neg logit contributions
ies-0.042
 coughing-0.039
atted-0.038
 Tim-0.038
anged-0.038
Token
Token ID198
Feature activation-1.23e-01

Pos logit contributions
ceed+0.230
 quickest+0.222
 nic+0.210
 exact+0.209
Whe+0.208
Neg logit contributions
 Tim-0.165
 they-0.155
 and-0.151
ies-0.146
 Tommy-0.144
TokenTim
Token ID14967
Feature activation+8.58e-02

Pos logit contributions
ceed+0.227
 quickest+0.201
Whe+0.200
lor+0.199
 biggest+0.199
Neg logit contributions
ies-0.169
 Tim-0.160
 they-0.157
 coughing-0.152
atted-0.145
Token and
Token ID290
Feature activation+2.51e-01

Pos logit contributions
ies+0.050
 coughing+0.041
 throats+0.039
anged+0.038
 badly+0.037
Neg logit contributions
ceed-0.220
V-0.208
Whe-0.206
 Believe-0.206
Three-0.204
Token Anna
Token ID11735
Feature activation+6.73e-02

Pos logit contributions
ies+0.304
 coughing+0.276
 Tim+0.273
 they+0.267
bie+0.254
Neg logit contributions
ceed-0.513
hest-0.472
lor-0.465
Whe-0.462
redible-0.459
Token look
Token ID804
Feature activation+3.99e-02

Pos logit contributions
ies+0.097
 and+0.095
 they+0.087
i+0.086
 coughing+0.086
Neg logit contributions
ceed-0.121
lor-0.113
Whe-0.107
V-0.105
hest-0.104
Token at
Token ID379
Feature activation-9.01e-03

Pos logit contributions
ies+0.031
 coughing+0.028
 throats+0.027
anged+0.026
ho+0.025
Neg logit contributions
ceed-0.092
V-0.086
Whe-0.085
 biggest-0.084
 Believe-0.084
Token the
Token ID262
Feature activation-1.35e-01

Pos logit contributions
ceed+0.018
Whe+0.017
V+0.017
Three+0.017
 Believe+0.016
Neg logit contributions
ies-0.010
 coughing-0.009
atted-0.009
anged-0.009
 throats-0.008
Token box
Token ID3091
Feature activation-6.16e-02

Pos logit contributions
Whe+0.262
lor+0.259
ceed+0.256
Three+0.239
Skip+0.232
Neg logit contributions
 and-0.170
ies-0.161
 Tim-0.157
 crying-0.154
 Billy-0.150
Token.
Token ID13
Feature activation+2.65e-02

Pos logit contributions
ceed+0.114
 exact+0.103
 quickest+0.103
hest+0.103
 biggest+0.101
Neg logit contributions
ies-0.075
 coughing-0.070
atted-0.068
bie-0.068
anged-0.068
Token They
Token ID1119
Feature activation+5.08e-02

Pos logit contributions
ies+0.026
 Tim+0.026
 they+0.025
 coughing+0.024
 Tommy+0.024
Neg logit contributions
ceed-0.057
hest-0.052
Whe-0.052
 quickest-0.051
lor-0.051
Token saw
Token ID2497
Feature activation-1.03e-01

Pos logit contributions
ies+0.374
atted+0.359
 coughing+0.352
 throats+0.346
anged+0.337
Neg logit contributions
ceed-0.440
Three-0.401
Whe-0.399
Ready-0.398
V-0.397
Token the
Token ID262
Feature activation-1.28e-01

Pos logit contributions
ceed+0.181
Whe+0.163
V+0.161
Three+0.160
hest+0.156
Neg logit contributions
ies-0.151
 coughing-0.137
atted-0.134
anged-0.131
 throats-0.130
Token bird
Token ID6512
Feature activation+1.59e-02

Pos logit contributions
ceed+0.224
Whe+0.201
V+0.199
Three+0.198
hest+0.193
Neg logit contributions
ies-0.190
 coughing-0.173
atted-0.169
anged-0.165
 Tim-0.165
Token again
Token ID757
Feature activation+4.44e-02

Pos logit contributions
 throats+0.021
ies+0.021
anged+0.021
atted+0.021
bie+0.020
Neg logit contributions
ceed-0.026
 exact-0.023
hest-0.023
 quickest-0.023
 biggest-0.023
Token.
Token ID13
Feature activation+1.58e-02

Pos logit contributions
ies+0.051
 coughing+0.046
atted+0.045
anged+0.044
 throats+0.044
Neg logit contributions
ceed-0.090
Whe-0.082
Three-0.081
V-0.081
 quickest-0.080
Token The
Token ID383
Feature activation-1.12e-01

Pos logit contributions
ies+0.028
i+0.025
 coughing+0.024
bie+0.024
atted+0.024
Neg logit contributions
ceed-0.025
Three-0.022
V-0.022
Whe-0.022
 Ready-0.021
Token bird
Token ID6512
Feature activation+3.64e-02

Pos logit contributions
ceed+0.205
Whe+0.189
V+0.184
Three+0.182
hest+0.182
Neg logit contributions
ies-0.160
 coughing-0.143
 Tim-0.138
atted-0.137
 they-0.136
Token had
Token ID550
Feature activation-3.60e-02

Pos logit contributions
 throats+0.053
anged+0.051
ies+0.050
 coughing+0.050
bie+0.049
Neg logit contributions
ceed-0.055
Insp-0.049
 Animal-0.049
V-0.049
Three-0.049
Token a
Token ID257
Feature activation-4.89e-02

Pos logit contributions
ceed+0.058
Whe+0.052
V+0.051
Three+0.051
hest+0.050
Neg logit contributions
ies-0.058
 coughing-0.053
atted-0.052
 Tim-0.051
 throats-0.051
Token little
Token ID1310
Feature activation+7.72e-02

Pos logit contributions
ceed+0.076
Whe+0.067
V+0.067
Three+0.067
 quickest+0.065
Neg logit contributions
ies-0.080
 coughing-0.074
atted-0.073
anged-0.072
 throats-0.071
Token envelope
Token ID22878
Feature activation+8.62e-03

Pos logit contributions
ies+0.129
i+0.125
atted+0.125
lee+0.123
dd+0.122
Neg logit contributions
ceed-0.100
 biggest-0.099
 exact-0.095
 date-0.093
 quickest-0.092
Token heavy
Token ID4334
Feature activation+5.79e-02

Pos logit contributions
ies+0.201
atted+0.197
 throats+0.196
anged+0.195
 tongues+0.182
Neg logit contributions
ceed-0.196
V-0.176
Three-0.171
Insp-0.170
 quickest-0.169
Token for
Token ID329
Feature activation-7.98e-02

Pos logit contributions
 throats+0.087
atted+0.083
bie+0.083
ies+0.082
 tongues+0.082
Neg logit contributions
ceed-0.086
 exact-0.082
hest-0.082
 quickest-0.080
 Believe-0.078
Token her
Token ID607
Feature activation-3.55e-02

Pos logit contributions
ceed+0.184
Whe+0.170
V+0.169
Three+0.168
redible+0.165
Neg logit contributions
ies-0.071
 coughing-0.060
atted-0.060
anged-0.057
 throats-0.055
Token.
Token ID13
Feature activation-3.24e-03

Pos logit contributions
ceed+0.067
 biggest+0.066
 quickest+0.065
 exact+0.063
 date+0.063
Neg logit contributions
ies-0.039
atted-0.036
dd-0.035
 coughing-0.034
ries-0.033
Token
Token ID198
Feature activation-1.38e-01

Pos logit contributions
ceed+0.006
redible+0.006
hest+0.006
 quickest+0.006
 fastest+0.006
Neg logit contributions
 Tim-0.004
 coughing-0.004
ies-0.004
 they-0.004
bie-0.003
Token
Token ID198
Feature activation-1.41e-01

Pos logit contributions
ceed+0.244
 quickest+0.226
Whe+0.226
 exact+0.218
 fastest+0.216
Neg logit contributions
 Tim-0.189
ies-0.187
 they-0.187
 and-0.180
 coughing-0.178
TokenJust
Token ID5703
Feature activation-1.45e-02

Pos logit contributions
ceed+0.221
Whe+0.205
 biggest+0.196
 Zoo+0.195
 quickest+0.195
Neg logit contributions
ies-0.209
 coughing-0.203
 they-0.199
 quickly-0.197
 and-0.192
Token then
Token ID788
Feature activation+9.26e-02

Pos logit contributions
ceed+0.027
Whe+0.026
Three+0.025
 quickest+0.025
 exact+0.025
Neg logit contributions
ies-0.017
 coughing-0.016
atted-0.016
 throats-0.015
anged-0.015
Token,
Token ID11
Feature activation+1.46e-01

Pos logit contributions
ies+0.092
atted+0.085
 throats+0.083
 coughing+0.083
anged+0.082
Neg logit contributions
ceed-0.189
Three-0.175
 Believe-0.175
V-0.174
Whe-0.174
Token her
Token ID607
Feature activation-7.09e-03

Pos logit contributions
ies+0.183
 coughing+0.163
 they+0.160
 Tim+0.157
atted+0.157
Neg logit contributions
ceed-0.286
Whe-0.261
V-0.259
Three-0.256
hest-0.254
Token big
Token ID1263
Feature activation+1.96e-03

Pos logit contributions
ceed+0.014
Whe+0.013
V+0.013
Three+0.012
 quickest+0.012
Neg logit contributions
ies-0.009
 coughing-0.008
atted-0.008
anged-0.008
 Tim-0.007
Token she
Token ID673
Feature activation+2.92e-02

Pos logit contributions
ceed+0.038
Whe+0.036
Three+0.035
V+0.035
 quickest+0.035
Neg logit contributions
ies-0.013
 coughing-0.011
atted-0.010
 throats-0.010
anged-0.010
Token looked
Token ID3114
Feature activation+6.89e-03

Pos logit contributions
ies+0.036
 coughing+0.032
atted+0.031
 throats+0.031
anged+0.030
Neg logit contributions
ceed-0.057
Whe-0.052
V-0.052
Three-0.052
 quickest-0.051
Token around
Token ID1088
Feature activation+3.17e-03

Pos logit contributions
ies+0.009
anged+0.009
 throats+0.009
 coughing+0.009
olph+0.008
Neg logit contributions
ceed-0.012
 biggest-0.011
 nic-0.011
 exact-0.011
Whe-0.011
Token and
Token ID290
Feature activation+2.32e-01

Pos logit contributions
ies+0.004
anged+0.003
atted+0.003
 coughing+0.003
 throats+0.003
Neg logit contributions
ceed-0.006
 date-0.006
 Believe-0.006
 exact-0.006
 center-0.006
Token saw
Token ID2497
Feature activation-1.21e-01

Pos logit contributions
ies+0.294
 coughing+0.261
atted+0.258
 throats+0.252
anged+0.249
Neg logit contributions
ceed-0.439
Whe-0.412
V-0.404
Three-0.402
Ready-0.393
Token a
Token ID257
Feature activation-7.73e-02

Pos logit contributions
ceed+0.185
Whe+0.164
V+0.161
Three+0.160
hest+0.155
Neg logit contributions
ies-0.205
 coughing-0.188
atted-0.185
anged-0.183
 throats-0.181
Token young
Token ID1862
Feature activation+5.11e-02

Pos logit contributions
ceed+0.121
Whe+0.108
V+0.107
Three+0.107
 quickest+0.103
Neg logit contributions
ies-0.124
atted-0.115
 coughing-0.114
anged-0.113
 throats-0.110
Token boy
Token ID2933
Feature activation+4.73e-02

Pos logit contributions
ies+0.084
anged+0.083
atted+0.083
 throats+0.080
 coughing+0.079
Neg logit contributions
ceed-0.070
Whe-0.064
Three-0.063
 quickest-0.063
 exact-0.063
Token.
Token ID13
Feature activation+1.47e-02

Pos logit contributions
ies+0.064
 throats+0.062
 tongues+0.057
atted+0.056
bie+0.054
Neg logit contributions
ceed-0.075
 Ready-0.072
 Believe-0.071
 exact-0.070
 date-0.070
Token He
Token ID679
Feature activation+5.40e-02

Pos logit contributions
ies+0.017
 coughing+0.016
atted+0.016
 Tim+0.016
anged+0.015
Neg logit contributions
ceed-0.030
Whe-0.027
V-0.026
Three-0.026
hest-0.026
Token was
Token ID373
Feature activation-6.89e-03

Pos logit contributions
ies+0.074
 throats+0.067
anged+0.066
 coughing+0.064
ries+0.064
Neg logit contributions
ceed-0.094
V-0.089
Three-0.086
 Believe-0.084
Insp-0.083
Token ball
Token ID2613
Feature activation-3.27e-02

Pos logit contributions
ceed+0.146
Three+0.130
Whe+0.129
lor+0.129
V+0.129
Neg logit contributions
ies-0.111
 coughing-0.098
 Tim-0.096
 they-0.094
atted-0.094
Token again
Token ID757
Feature activation+2.35e-02

Pos logit contributions
ceed+0.064
Whe+0.058
V+0.058
Three+0.058
 quickest+0.057
Neg logit contributions
ies-0.040
 coughing-0.036
atted-0.035
anged-0.034
 throats-0.034
Token."
Token ID526
Feature activation-2.49e-02

Pos logit contributions
ies+0.029
 and+0.026
 they+0.025
 coughing+0.025
 because+0.025
Neg logit contributions
ceed-0.049
Whe-0.045
V-0.044
Three-0.044
redible-0.043
Token
Token ID198
Feature activation-1.19e-01

Pos logit contributions
ceed+0.042
 quickest+0.037
redible+0.037
Whe+0.036
hest+0.036
Neg logit contributions
 Tim-0.039
ies-0.038
 they-0.038
 and-0.036
 he-0.036
Token
Token ID198
Feature activation-1.23e-01

Pos logit contributions
 quickest+0.242
ceed+0.242
 exact+0.226
Whe+0.224
 nic+0.223
Neg logit contributions
 Tim-0.152
 and-0.148
 they-0.146
ies-0.145
 he-0.134
TokenL
Token ID43
Feature activation-2.91e-02

Pos logit contributions
ceed+0.227
 quickest+0.203
 nic+0.203
Whe+0.201
 biggest+0.200
Neg logit contributions
ies-0.173
 they-0.168
 and-0.164
 Tim-0.161
 he-0.157
Tokenila
Token ID10102
Feature activation+4.27e-02

Pos logit contributions
ceed+0.060
 nic+0.056
 quickest+0.056
 Anything+0.056
 biggest+0.055
Neg logit contributions
ies-0.035
atted-0.030
 and-0.030
i-0.029
 Tim-0.028
Token and
Token ID290
Feature activation+2.59e-01

Pos logit contributions
ies+0.023
 coughing+0.018
 throats+0.016
anged+0.016
atted+0.015
Neg logit contributions
ceed-0.114
Whe-0.106
V-0.106
Three-0.105
 Believe-0.103
Token Ben
Token ID3932
Feature activation+8.60e-02

Pos logit contributions
ies+0.216
 Tim+0.201
i+0.191
 Tommy+0.186
 they+0.182
Neg logit contributions
ceed-0.633
hest-0.599
lor-0.580
Three-0.573
 quickest-0.571
Token hugged
Token ID41130
Feature activation+3.25e-02

Pos logit contributions
ies+0.075
atted+0.061
 coughing+0.061
 they+0.060
 Tim+0.060
Neg logit contributions
ceed-0.203
Whe-0.188
V-0.186
Three-0.185
lor-0.184
Token Sam
Token ID3409
Feature activation+6.04e-02

Pos logit contributions
ies+0.040
 coughing+0.040
atted+0.039
 burnt+0.038
bie+0.036
Neg logit contributions
V-0.057
Whe-0.056
ceed-0.055
 Believe-0.053
 Animal-0.052
Token no
Token ID645
Feature activation-4.12e-03

Pos logit contributions
anged+0.057
atted+0.055
ho+0.050
dd+0.049
 coughing+0.049
Neg logit contributions
 Animal-0.065
 Museum-0.064
ceed-0.063
 Zoo-0.062
 biggest-0.061
Token more
Token ID517
Feature activation+4.47e-02

Pos logit contributions
Three+0.004
 quickest+0.004
 biggest+0.004
ceed+0.004
Whe+0.004
Neg logit contributions
atted-0.008
ies-0.008
 coughing-0.008
lee-0.008
 throats-0.008
Token.
Token ID13
Feature activation+3.96e-03

Pos logit contributions
bie+0.064
ho+0.064
 throats+0.061
ety+0.061
atted+0.060
Neg logit contributions
 quickest-0.058
 biggest-0.058
 fastest-0.058
 winner-0.057
 exact-0.056
Token They
Token ID1119
Feature activation+4.04e-02

Pos logit contributions
ies+0.005
 coughing+0.005
 Tim+0.005
atted+0.005
 they+0.005
Neg logit contributions
ceed-0.007
Whe-0.007
V-0.007
Three-0.007
hest-0.006
Token realized
Token ID6939
Feature activation+1.44e-02

Pos logit contributions
ies+0.051
 coughing+0.048
 throats+0.047
ries+0.047
atted+0.047
Neg logit contributions
ceed-0.073
 Museum-0.066
 nic-0.066
V-0.066
Three-0.065
Token that
Token ID326
Feature activation-1.37e-02

Pos logit contributions
anged+0.021
bie+0.021
ies+0.021
 tongues+0.020
 throats+0.020
Neg logit contributions
ceed-0.021
 Animal-0.021
 Believe-0.020
 Museum-0.019
lor-0.019
Token the
Token ID262
Feature activation-1.20e-01

Pos logit contributions
ceed+0.022
 Believe+0.022
 quickest+0.022
 smallest+0.022
 biggest+0.022
Neg logit contributions
atted-0.019
ies-0.019
anged-0.018
lee-0.017
bie-0.017
Token lightning
Token ID14357
Feature activation+1.96e-02

Pos logit contributions
ceed+0.193
 biggest+0.181
 quickest+0.178
 exact+0.176
 nic+0.175
Neg logit contributions
ies-0.180
atted-0.178
anged-0.173
 coughing-0.161
i-0.160
Token had
Token ID550
Feature activation-5.34e-02

Pos logit contributions
ies+0.031
bie+0.030
 throats+0.030
 coughing+0.029
anged+0.029
Neg logit contributions
ceed-0.029
 quickest-0.026
 fastest-0.026
hest-0.025
 Believe-0.025
Token come
Token ID1282
Feature activation-2.09e-02

Pos logit contributions
ceed+0.097
Whe+0.085
Three+0.085
 quickest+0.084
V+0.084
Neg logit contributions
ies-0.075
 coughing-0.067
atted-0.066
anged-0.065
 throats-0.064
Token to
Token ID284
Feature activation+4.64e-03

Pos logit contributions
ceed+0.039
 quickest+0.036
hest+0.036
redible+0.036
 fastest+0.035
Neg logit contributions
 throats-0.025
ies-0.024
atted-0.023
 coughing-0.023
 tongues-0.022
Token playing
Token ID2712
Feature activation-5.24e-02

Pos logit contributions
ies+0.066
bie+0.058
 throats+0.058
 coughing+0.057
i+0.056
Neg logit contributions
ceed-0.113
Whe-0.105
 quickest-0.104
hest-0.104
Three-0.103
Token together
Token ID1978
Feature activation-1.90e-03

Pos logit contributions
ceed+0.131
lor+0.124
Three+0.122
V+0.120
Whe+0.119
Neg logit contributions
 and-0.037
 Tim-0.037
ies-0.037
 they-0.035
 because-0.032
Token.
Token ID13
Feature activation-5.26e-03

Pos logit contributions
ceed+0.004
Whe+0.004
V+0.004
Three+0.004
hest+0.004
Neg logit contributions
ies-0.002
 coughing-0.002
atted-0.002
 Tim-0.002
anged-0.002
Token<|endoftext|>
Token ID50256
Feature activation-1.69e-01

Pos logit contributions
ceed+0.009
redible+0.008
Three+0.008
hest+0.008
 quickest+0.008
Neg logit contributions
 Tim-0.008
 they-0.007
ies-0.007
 Billy-0.007
 coughing-0.007
TokenOnce
Token ID7454
Feature activation-2.26e-02

Pos logit contributions
ceed+0.327
 quickest+0.300
 exact+0.291
Insp+0.290
hest+0.289
Neg logit contributions
 Tim-0.230
 and-0.227
 Tommy-0.209
 they-0.207
 John-0.194
Token upon
Token ID2402
Feature activation-1.76e-01

Pos logit contributions
ceed+0.037
V+0.033
Whe+0.033
Three+0.033
Insp+0.032
Neg logit contributions
ies-0.033
 coughing-0.032
 throats-0.031
atted-0.030
anged-0.030
Token a
Token ID257
Feature activation-8.03e-02

Pos logit contributions
ceed+0.298
 quickest+0.266
lor+0.264
Whe+0.262
Three+0.262
Neg logit contributions
ies-0.255
 Tim-0.245
 coughing-0.240
 they-0.239
 and-0.237
Token time
Token ID640
Feature activation+5.31e-02

Pos logit contributions
ceed+0.124
Whe+0.110
V+0.109
Three+0.108
hest+0.105
Neg logit contributions
ies-0.133
 coughing-0.122
atted-0.120
anged-0.118
 throats-0.117
Token,
Token ID11
Feature activation+1.53e-01

Pos logit contributions
ies+0.047
 throats+0.043
 coughing+0.042
atted+0.039
anged+0.039
Neg logit contributions
ceed-0.118
V-0.111
Whe-0.111
Three-0.110
 Believe-0.109
Token there
Token ID612
Feature activation-1.12e-01

Pos logit contributions
ies+0.161
 Tim+0.138
 coughing+0.137
 they+0.134
atted+0.133
Neg logit contributions
ceed-0.335
Whe-0.307
Three-0.301
V-0.301
lor-0.299
Token was
Token ID373
Feature activation-2.10e-02

Pos logit contributions
ceed+0.201
Whe+0.183
V+0.182
Three+0.180
 Believe+0.175
Neg logit contributions
ies-0.157
 coughing-0.143
atted-0.140
 throats-0.137
anged-0.136
Token rock
Token ID3881
Feature activation+1.89e-02

Pos logit contributions
 date+0.023
ceed+0.023
 biggest+0.022
 exact+0.022
 center+0.022
Neg logit contributions
atted-0.030
anged-0.029
dd-0.029
ies-0.029
 coughing-0.028
Token and
Token ID290
Feature activation+2.53e-01

Pos logit contributions
bie+0.020
ies+0.019
 throats+0.018
 Baby+0.018
lee+0.017
Neg logit contributions
ceed-0.037
hest-0.035
 quickest-0.034
 exact-0.034
 winner-0.034
Token pushed
Token ID7121
Feature activation-7.52e-02

Pos logit contributions
ies+0.374
 coughing+0.343
 throats+0.332
atted+0.331
i+0.323
Neg logit contributions
ceed-0.441
V-0.397
Three-0.396
Whe-0.388
Insp-0.382
Token it
Token ID340
Feature activation-3.30e-02

Pos logit contributions
ceed+0.135
V+0.130
 Ready+0.125
Whe+0.123
Ready+0.122
Neg logit contributions
ies-0.102
 coughing-0.098
 throats-0.094
atted-0.093
bie-0.088
Token to
Token ID284
Feature activation+7.50e-03

Pos logit contributions
ceed+0.066
V+0.060
 quickest+0.059
hest+0.059
 Ready+0.058
Neg logit contributions
ies-0.039
 coughing-0.035
atted-0.035
 throats-0.034
bie-0.034
Token the
Token ID262
Feature activation-1.55e-01

Pos logit contributions
 Tim+0.008
 they+0.007
ies+0.007
 and+0.007
 Tommy+0.007
Neg logit contributions
ceed-0.017
lor-0.016
Whe-0.015
Three-0.015
 quickest-0.015
Token oy
Token ID35104
Feature activation+5.78e-02

Pos logit contributions
ceed+0.270
Whe+0.240
V+0.240
Three+0.237
lor+0.235
Neg logit contributions
ies-0.230
 coughing-0.211
 Tim-0.204
 throats-0.200
atted-0.198
Tokenster
Token ID1706
Feature activation-2.12e-02

Pos logit contributions
ies+0.096
 coughing+0.091
atted+0.089
 Tim+0.088
anged+0.087
Neg logit contributions
ceed-0.086
Whe-0.076
Three-0.075
lor-0.074
V-0.074
Token.
Token ID13
Feature activation-6.24e-04

Pos logit contributions
ceed+0.044
Whe+0.039
V+0.039
Three+0.039
 quickest+0.039
Neg logit contributions
ies-0.024
 coughing-0.021
atted-0.021
anged-0.020
 throats-0.020
Token Tim
Token ID5045
Feature activation+8.34e-02

Pos logit contributions
ceed+0.001
Whe+0.001
V+0.001
Three+0.001
 quickest+0.001
Neg logit contributions
ies-0.001
 Tim-0.001
 coughing-0.001
atted-0.001
 they-0.001
Token said
Token ID531
Feature activation-4.09e-02

Pos logit contributions
ies+0.107
 throats+0.104
 coughing+0.100
 they+0.095
 Joey+0.094
Neg logit contributions
ceed-0.153
V-0.143
Three-0.138
 quickest-0.132
Insp-0.132
Token€
Token ID26391
Feature activation-6.28e-02

Pos logit contributions
 Joey+0.007
i+0.007
 Tim+0.007
 because+0.006
 Billy+0.006
Neg logit contributions
 biggest-0.005
ceed-0.004
V-0.004
Three-0.004
 date-0.004
Token
Token ID198
Feature activation-1.37e-01

Pos logit contributions
ceed+0.096
V+0.083
Whe+0.083
Three+0.083
hest+0.082
Neg logit contributions
ies-0.104
 Tim-0.096
 they-0.095
 coughing-0.095
atted-0.094
Token
Token ID198
Feature activation-1.38e-01

Pos logit contributions
 quickest+0.252
ceed+0.251
 fastest+0.237
 biggest+0.236
 nic+0.233
Neg logit contributions
 Tim-0.178
 Joey-0.170
 they-0.163
 Tommy-0.163
 Johnny-0.163
TokenHe
Token ID1544
Feature activation+5.02e-02

Pos logit contributions
ceed+0.260
 biggest+0.243
 quickest+0.239
 nic+0.234
 Museum+0.230
Neg logit contributions
 Tim-0.160
 they-0.159
 Joey-0.158
dd-0.153
 he-0.152
Token grabbed
Token ID13176
Feature activation-9.79e-02

Pos logit contributions
ies+0.056
 coughing+0.049
atted+0.048
anged+0.046
 Tim+0.046
Neg logit contributions
ceed-0.105
Whe-0.097
V-0.096
Three-0.095
hest-0.093
Token the
Token ID262
Feature activation-1.52e-01

Pos logit contributions
ceed+0.182
Whe+0.165
V+0.164
Three+0.162
hest+0.158
Neg logit contributions
ies-0.133
 coughing-0.120
atted-0.116
 throats-0.114
anged-0.113
Token cr
Token ID1067
Feature activation-7.02e-02

Pos logit contributions
Whe+0.270
ceed+0.269
V+0.263
Three+0.256
lor+0.253
Neg logit contributions
ies-0.205
 and-0.197
 cigarette-0.176
--0.174
 coughing-0.172
Tokenay
Token ID323
Feature activation-2.38e-02

Pos logit contributions
Whe+0.111
Ready+0.106
 quickest+0.106
ceed+0.105
V+0.105
Neg logit contributions
ies-0.106
 and-0.098
atted-0.097
--0.094
lee-0.093
Tokenon
Token ID261
Feature activation+1.50e-02

Pos logit contributions
ceed+0.053
Whe+0.050
V+0.049
Three+0.049
 quickest+0.048
Neg logit contributions
ies-0.023
atted-0.019
 coughing-0.019
anged-0.019
 Tim-0.018
Token and
Token ID290
Feature activation+2.33e-01

Pos logit contributions
 coughing+0.011
ies+0.010
atted+0.010
anged+0.010
 throats+0.010
Neg logit contributions
ceed-0.035
hest-0.033
lor-0.033
 quickest-0.032
 biggest-0.032
Token wrote
Token ID2630
Feature activation-3.27e-02

Pos logit contributions
ies+0.310
 coughing+0.288
 throats+0.275
atted+0.273
bie+0.265
Neg logit contributions
ceed-0.432
V-0.382
Whe-0.382
Ready-0.374
Three-0.372
Token stroll
Token ID41360
Feature activation-2.10e-02

Pos logit contributions
ceed+0.173
Whe+0.156
Three+0.155
V+0.155
 quickest+0.151
Neg logit contributions
ies-0.152
atted-0.137
 coughing-0.137
anged-0.133
 throats-0.133
Token in
Token ID287
Feature activation-3.57e-02

Pos logit contributions
ceed+0.041
 quickest+0.038
 fastest+0.038
hest+0.037
Whe+0.037
Neg logit contributions
ies-0.023
 coughing-0.022
 throats-0.022
atted-0.021
bie-0.021
Token the
Token ID262
Feature activation-1.23e-01

Pos logit contributions
ceed+0.079
Whe+0.072
Three+0.070
V+0.070
lor+0.069
Neg logit contributions
ies-0.036
 Tim-0.035
 coughing-0.034
 they-0.032
 because-0.032
Token park
Token ID3952
Feature activation+1.04e-02

Pos logit contributions
ceed+0.202
hest+0.179
V+0.176
lor+0.174
Whe+0.172
Neg logit contributions
ies-0.183
 Tim-0.183
 coughing-0.183
 mad-0.175
 because-0.175
Token!
Token ID0
Feature activation-3.48e-03

Pos logit contributions
ies+0.010
 coughing+0.009
atted+0.009
 Tim+0.008
anged+0.008
Neg logit contributions
ceed-0.023
Whe-0.021
V-0.021
Three-0.021
hest-0.021
Token<|endoftext|>
Token ID50256
Feature activation-1.57e-01

Pos logit contributions
ceed+0.006
redible+0.006
 quickest+0.005
hest+0.005
V+0.005
Neg logit contributions
 Tim-0.005
 Billy-0.005
 Joey-0.005
 they-0.005
 Johnny-0.005
TokenIn
Token ID818
Feature activation-1.00e-01

Pos logit contributions
ceed+0.274
 quickest+0.255
 exact+0.248
ieve+0.247
 nic+0.246
Neg logit contributions
 Tim-0.237
 and-0.234
 Tommy-0.212
 they-0.211
 he-0.208
Token the
Token ID262
Feature activation-1.27e-01

Pos logit contributions
ceed+0.212
hest+0.189
Whe+0.188
Three+0.188
V+0.187
Neg logit contributions
ies-0.106
 Tim-0.106
 they-0.104
 coughing-0.099
 Billy-0.098
Token morning
Token ID3329
Feature activation-1.01e-02

Pos logit contributions
ceed+0.200
lor+0.169
Whe+0.168
hest+0.167
V+0.167
Neg logit contributions
ies-0.204
 coughing-0.199
 Tim-0.195
 they-0.189
 Billy-0.185
Token,
Token ID11
Feature activation+1.45e-01

Pos logit contributions
ceed+0.021
Whe+0.020
Three+0.020
 middle+0.020
V+0.019
Neg logit contributions
ies-0.011
anged-0.010
atted-0.010
 throats-0.009
 coughing-0.009
Token Mama
Token ID40084
Feature activation+1.38e-01

Pos logit contributions
ies+0.209
 throats+0.194
 coughing+0.191
atted+0.191
anged+0.190
Neg logit contributions
ceed-0.251
Whe-0.241
Three-0.235
V-0.234
 quickest-0.226
Token her
Token ID607
Feature activation-5.76e-02

Pos logit contributions
ceed+0.192
Whe+0.177
Three+0.177
V+0.176
hest+0.172
Neg logit contributions
ies-0.107
 coughing-0.095
atted-0.091
 Tim-0.091
 they-0.089
Token task
Token ID4876
Feature activation-6.18e-02

Pos logit contributions
ceed+0.082
V+0.074
Whe+0.072
Three+0.071
hest+0.071
Neg logit contributions
ies-0.100
 coughing-0.095
 mad-0.094
 and-0.093
 they-0.093
Token!
Token ID0
Feature activation+5.80e-03

Pos logit contributions
ceed+0.151
Whe+0.140
V+0.138
Three+0.138
hest+0.137
Neg logit contributions
ies-0.046
 coughing-0.038
atted-0.037
anged-0.035
 throats-0.035
Token<|endoftext|>
Token ID50256
Feature activation-1.62e-01

Pos logit contributions
 Tim+0.007
 they+0.007
 Billy+0.007
 Joey+0.007
 John+0.007
Neg logit contributions
ceed-0.011
redible-0.011
 quickest-0.010
hest-0.010
 fastest-0.010
TokenOnce
Token ID7454
Feature activation-1.87e-02

Pos logit contributions
ceed+0.242
 quickest+0.214
Insp+0.209
hest+0.209
 exact+0.206
Neg logit contributions
 Tim-0.281
 and-0.277
 they-0.263
 Tommy-0.260
ies-0.255
Token upon
Token ID2402
Feature activation-1.75e-01

Pos logit contributions
ceed+0.028
V+0.026
Whe+0.025
Three+0.025
Insp+0.025
Neg logit contributions
ies-0.028
 coughing-0.028
 throats-0.027
anged-0.027
atted-0.026
Token a
Token ID257
Feature activation-7.96e-02

Pos logit contributions
ceed+0.293
 quickest+0.261
Whe+0.259
Three+0.257
lor+0.255
Neg logit contributions
ies-0.258
 Tim-0.246
 coughing-0.243
 they-0.242
 and-0.238
Token time
Token ID640
Feature activation+5.18e-02

Pos logit contributions
ceed+0.114
Whe+0.103
V+0.102
Three+0.101
hest+0.098
Neg logit contributions
ies-0.139
 coughing-0.129
atted-0.128
anged-0.126
 throats-0.124
Token,
Token ID11
Feature activation+1.54e-01

Pos logit contributions
 throats+0.055
ies+0.054
 coughing+0.052
anged+0.049
atted+0.049
Neg logit contributions
ceed-0.102
 Believe-0.098
V-0.096
Three-0.096
Whe-0.096
Token there
Token ID612
Feature activation-1.11e-01

Pos logit contributions
ies+0.178
 coughing+0.157
atted+0.153
 Tim+0.149
anged+0.148
Neg logit contributions
ceed-0.317
Whe-0.290
V-0.286
Three-0.285
hest-0.279
Token was
Token ID373
Feature activation-2.02e-02

Pos logit contributions
ceed+0.202
Whe+0.184
V+0.182
Three+0.181
hest+0.175
Neg logit contributions
ies-0.154
 coughing-0.139
atted-0.136
 throats-0.133
anged-0.133