TOP ACTIVATIONS
MAX = 0.689

her mom my watched in amaz ement as the machine flew
. Tim my watched with amaz ement as he saw stars
air . He watched with amaz ement as the z ig
go ! Everyone watched in amaz ement as it took off
egg ! He watched in amaz ement as the egg began
room and laughed with g lee . The little girl had
friends stopped and watched in amaz ement . The dog too
. He laughed with g lee . Mom
they both looked around in amaz ement . The lake was
, â said T od . â
. To her amaz ement , the clothes came
little girl looked up in amaz ement and said , "
eddy bear and watched in amaz ement as it shrunk an
ered in , blinking in amaz ement . But
again and again . J od ie would always remember the
animals were watching him with amaz ement .
. The child was so mesmer ized . " Can I
rabbit be amed with g lee as they walked side by
them into beautiful or nam ents . One day
The little girl watched with amaz ement when her mom my

MIN ACTIVATIONS
MIN = -0.412

. It said : " P ond closed . Come back
to regret ." P ete was sad , but
own coin . P ipp a was very unhappy
says . " P ow ! Pow ! I
oven . P enny and her mom my
beautiful butterfly ! P ixie was so happy to
party perfect ." P inky smiled and said ,
board . P ete was curious , so
bird named Polly . " P olly , what is this
P ray to God and He
almost ready . P apa nodded sadly . He
cable ?" P apa said , " It
together and said , " P inky promise ."
go . P apa bent down and said
me ." P ete thought for a minute
're hungry ." P olly was happy with Sammy
never be peace . <|endoftext|> P olly was ready to plan
fear away . P apa held Lily 's hand
more fun !" P enny said , " Okay
her violin ! P olly was so excited ,

INTERVAL 0.414 - 0.689
CONTAINS 0.112%

especially liked the bright colors of the yarn she offered !
lots of people and lots of stores . He saw a
. He felt a lot of pain and started to cry
â s full of life . â
to take a bite out of it . It was crunch

INTERVAL 0.139 - 0.414
CONTAINS 10.155%

play . She thought it could be fun to climb up
believed it was from a magical place and so she tucked
saw the broken switch and the dark room . She was
. So , Maya and Sam tried their best
The three - year - old asked Daddy , " Where

INTERVAL -0.137 - 0.139
CONTAINS 80.763%

you eat ?" Tom asked . " Snow man
what she saw - a beautiful castle ! She ran towards
but no one heard her . She felt very sorry for
ily ran outside and joined the other kids . They were
One day , Lily 's mom told her that they were

INTERVAL -0.412 - -0.137
CONTAINS 8.970%

ila and said , " Hi , L ila . Do
proudly displayed the tape to his mom . But his mom
. One day , she went to her friend Tim my
the house . Later , Ann saw her friend
, and the pond was back to normal .
Token her
Token ID607
Feature activation+2.00e-01

Pos logit contributions
P+0.013
His+0.012
ercise+0.011
 helmet+0.010
 his+0.010
Neg logit contributions
 amaz-0.019
lee-0.018
 spec-0.017
od-0.017
 ago-0.017
Token mom
Token ID1995
Feature activation+2.64e-01

Pos logit contributions
 amaz+0.292
lee+0.273
 ago+0.264
 spec+0.257
 angles+0.251
Neg logit contributions
P-0.289
His-0.275
ercise-0.252
O-0.237
Move-0.236
Tokenmy
Token ID1820
Feature activation+1.06e-01

Pos logit contributions
lee+0.369
 amaz+0.368
 ago+0.347
od+0.327
 spec+0.325
Neg logit contributions
P-0.391
His-0.374
ercise-0.329
Spot-0.323
 helmet-0.321
Token watched
Token ID7342
Feature activation+2.38e-01

Pos logit contributions
lee+0.132
 amaz+0.124
 ago+0.124
od+0.112
 spec+0.110
Neg logit contributions
P-0.181
His-0.168
Br-0.152
O-0.152
Spot-0.151
Token in
Token ID287
Feature activation+1.72e-01

Pos logit contributions
 amaz+0.345
lee+0.342
od+0.309
 ago+0.307
 spec+0.306
Neg logit contributions
P-0.361
His-0.324
ercise-0.305
Spot-0.298
w-0.297
Token amaz
Token ID40642
Feature activation+6.13e-01

Pos logit contributions
 amaz+0.253
lee+0.234
 spec+0.220
 ago+0.220
 angles+0.217
Neg logit contributions
P-0.247
His-0.234
ercise-0.214
 helmet-0.207
Spot-0.203
Tokenement
Token ID972
Feature activation+3.81e-01

Pos logit contributions
 amaz+0.503
lee+0.458
 spec+0.404
od+0.396
 ago+0.395
Neg logit contributions
P-1.241
His-1.202
ercise-1.128
 helmet-1.115
Spot-1.090
Token as
Token ID355
Feature activation+1.96e-01

Pos logit contributions
 amaz+0.510
lee+0.507
od+0.478
 ago+0.478
 spec+0.457
Neg logit contributions
P-0.611
His-0.557
ercise-0.518
Spot-0.507
O-0.501
Token the
Token ID262
Feature activation+2.41e-01

Pos logit contributions
 amaz+0.300
lee+0.280
 spec+0.265
od+0.263
 ago+0.263
Neg logit contributions
P-0.268
His-0.252
ercise-0.233
 helmet-0.221
w-0.218
Token machine
Token ID4572
Feature activation+1.89e-01

Pos logit contributions
 amaz+0.346
lee+0.320
 spec+0.306
 ago+0.299
 angles+0.295
Neg logit contributions
P-0.349
His-0.340
ercise-0.306
 helmet-0.292
w-0.289
Token flew
Token ID13112
Feature activation+8.32e-02

Pos logit contributions
 amaz+0.240
lee+0.231
 ago+0.225
od+0.221
 spec+0.221
Neg logit contributions
P-0.320
His-0.292
Spot-0.260
Br-0.259
ercise-0.258
Token.
Token ID13
Feature activation+2.78e-01

Pos logit contributions
P+0.076
His+0.076
ercise+0.070
OK+0.067
 helmet+0.066
Neg logit contributions
lee-0.068
 amaz-0.068
 mesmer-0.063
od-0.063
 spec-0.062
Token Tim
Token ID5045
Feature activation+1.45e-01

Pos logit contributions
 ago+0.369
 amaz+0.361
lee+0.353
 mesmer+0.314
od+0.307
Neg logit contributions
P-0.429
w-0.396
ercise-0.384
 helmet-0.365
His-0.360
Tokenmy
Token ID1820
Feature activation+9.47e-02

Pos logit contributions
lee+0.213
 amaz+0.190
 angles+0.183
ents+0.176
od+0.174
Neg logit contributions
P-0.208
His-0.191
Br-0.174
ercise-0.170
 helmet-0.169
Token watched
Token ID7342
Feature activation+2.24e-01

Pos logit contributions
lee+0.121
 amaz+0.111
 mesmer+0.105
 spec+0.100
od+0.100
Neg logit contributions
P-0.157
His-0.138
ercise-0.130
Br-0.129
Wait-0.129
Token with
Token ID351
Feature activation+1.41e-01

Pos logit contributions
 amaz+0.371
lee+0.365
od+0.338
 spec+0.332
 ago+0.328
Neg logit contributions
P-0.289
His-0.244
ercise-0.235
w-0.228
OK-0.225
Token amaz
Token ID40642
Feature activation+6.02e-01

Pos logit contributions
od+0.277
lee+0.273
ous+0.254
 spec+0.252
 ago+0.252
Neg logit contributions
 his-0.125
 helmet-0.117
His-0.114
P-0.097
 down-0.096
Tokenement
Token ID972
Feature activation+3.72e-01

Pos logit contributions
 amaz+0.517
lee+0.470
 spec+0.420
 ago+0.417
od+0.406
Neg logit contributions
P-1.196
His-1.164
ercise-1.086
 helmet-1.077
w-1.046
Token as
Token ID355
Feature activation+1.87e-01

Pos logit contributions
 amaz+0.588
lee+0.563
 spec+0.528
od+0.527
 ago+0.526
Neg logit contributions
P-0.477
His-0.447
ercise-0.405
 helmet-0.393
Spot-0.384
Token he
Token ID339
Feature activation+2.43e-02

Pos logit contributions
 amaz+0.342
lee+0.328
 spec+0.312
od+0.309
 ago+0.309
Neg logit contributions
P-0.189
His-0.178
ercise-0.155
 helmet-0.152
Spot-0.144
Token saw
Token ID2497
Feature activation+9.27e-02

Pos logit contributions
 amaz+0.039
lee+0.037
od+0.034
 spec+0.034
 ago+0.034
Neg logit contributions
P-0.031
His-0.030
ercise-0.027
 helmet-0.026
Mid-0.026
Token stars
Token ID5788
Feature activation+2.86e-01

Pos logit contributions
 amaz+0.157
lee+0.149
 spec+0.142
od+0.140
 angles+0.139
Neg logit contributions
P-0.110
His-0.102
ercise-0.093
 helmet-0.088
Spot-0.087
Token air
Token ID1633
Feature activation+4.13e-01

Pos logit contributions
P+0.124
His+0.118
 helmet+0.099
ercise+0.098
Spot+0.095
Neg logit contributions
 amaz-0.246
lee-0.241
 ago-0.225
 angles-0.225
 spec-0.222
Token.
Token ID13
Feature activation+2.36e-01

Pos logit contributions
 amaz+0.560
lee+0.557
 ago+0.539
 spec+0.522
 mesmer+0.506
Neg logit contributions
P-0.609
His-0.590
ercise-0.534
Spot-0.513
O-0.507
Token He
Token ID679
Feature activation+5.27e-02

Pos logit contributions
 ago+0.310
 amaz+0.301
lee+0.292
 mesmer+0.257
 spec+0.251
Neg logit contributions
P-0.368
w-0.346
ercise-0.334
 helmet-0.317
uch-0.315
Token watched
Token ID7342
Feature activation+2.14e-01

Pos logit contributions
 amaz+0.077
lee+0.073
 spec+0.068
 ago+0.067
 mesmer+0.067
Neg logit contributions
P-0.077
His-0.072
ercise-0.064
Spot-0.064
Br-0.063
Token with
Token ID351
Feature activation+1.32e-01

Pos logit contributions
 amaz+0.319
lee+0.313
od+0.289
 spec+0.283
 ago+0.281
Neg logit contributions
P-0.325
His-0.271
ercise-0.268
w-0.262
OK-0.257
Token amaz
Token ID40642
Feature activation+5.91e-01

Pos logit contributions
lee+0.239
od+0.237
 amaz+0.224
 spec+0.221
ous+0.219
Neg logit contributions
His-0.120
 his-0.118
 helmet-0.114
P-0.111
 away-0.097
Tokenement
Token ID972
Feature activation+3.64e-01

Pos logit contributions
 amaz+0.445
lee+0.401
 spec+0.347
od+0.346
 angles+0.331
Neg logit contributions
P-1.251
His-1.203
ercise-1.139
Spot-1.120
 helmet-1.118
Token as
Token ID355
Feature activation+1.81e-01

Pos logit contributions
 amaz+0.528
lee+0.515
 ago+0.497
od+0.482
 spec+0.470
Neg logit contributions
P-0.547
His-0.487
ercise-0.454
Spot-0.446
O-0.442
Token the
Token ID262
Feature activation+2.23e-01

Pos logit contributions
 amaz+0.322
lee+0.307
 spec+0.292
 ago+0.290
od+0.290
Neg logit contributions
P-0.196
His-0.182
ercise-0.163
 helmet-0.156
Spot-0.151
Token z
Token ID1976
Feature activation+1.36e-01

Pos logit contributions
 amaz+0.309
lee+0.286
 spec+0.273
 ago+0.266
 angles+0.265
Neg logit contributions
P-0.331
His-0.321
ercise-0.290
 helmet-0.276
Spot-0.275
Tokenig
Token ID328
Feature activation+7.83e-02

Pos logit contributions
 amaz+0.198
lee+0.198
 ago+0.184
 spec+0.183
od+0.182
Neg logit contributions
P-0.181
His-0.181
Mid-0.163
Spot-0.162
ercise-0.160
Token go
Token ID467
Feature activation+1.83e-01

Pos logit contributions
 amaz+0.461
lee+0.428
 ago+0.404
 spec+0.397
 angles+0.394
Neg logit contributions
P-0.411
His-0.379
ercise-0.348
w-0.342
OK-0.330
Token!
Token ID0
Feature activation+2.54e-01

Pos logit contributions
 ago+0.202
 of+0.195
lee+0.180
 below+0.171
keeping+0.170
Neg logit contributions
P-0.310
O-0.286
His-0.280
ercise-0.280
Deal-0.279
Token Everyone
Token ID11075
Feature activation+3.11e-01

Pos logit contributions
 amaz+0.330
lee+0.317
 ago+0.315
 mesmer+0.284
 spec+0.283
Neg logit contributions
P-0.387
w-0.353
ercise-0.349
His-0.338
 helmet-0.334
Token watched
Token ID7342
Feature activation+2.10e-01

Pos logit contributions
 amaz+0.413
lee+0.402
 mesmer+0.378
 angles+0.375
 spec+0.373
Neg logit contributions
P-0.476
His-0.440
Br-0.411
ercise-0.405
Mid-0.402
Token in
Token ID287
Feature activation+1.49e-01

Pos logit contributions
lee+0.281
 amaz+0.278
od+0.260
 spec+0.253
 ago+0.252
Neg logit contributions
P-0.356
His-0.310
ercise-0.294
w-0.284
 helmet-0.280
Token amaz
Token ID40642
Feature activation+5.88e-01

Pos logit contributions
 amaz+0.214
lee+0.186
 ago+0.182
 angles+0.178
 spec+0.177
Neg logit contributions
P-0.238
His-0.220
ercise-0.204
w-0.195
Spot-0.194
Tokenement
Token ID972
Feature activation+3.60e-01

Pos logit contributions
 amaz+0.473
lee+0.429
 spec+0.375
od+0.370
 ago+0.366
Neg logit contributions
P-1.205
His-1.166
ercise-1.096
 helmet-1.082
Spot-1.066
Token as
Token ID355
Feature activation+1.82e-01

Pos logit contributions
 amaz+0.532
lee+0.518
 ago+0.493
od+0.490
 spec+0.475
Neg logit contributions
P-0.523
His-0.472
ercise-0.437
Spot-0.425
O-0.421
Token it
Token ID340
Feature activation+2.27e-01

Pos logit contributions
 amaz+0.266
lee+0.253
 spec+0.237
od+0.234
 ago+0.234
Neg logit contributions
P-0.246
His-0.237
ercise-0.213
 helmet-0.212
Spot-0.204
Token took
Token ID1718
Feature activation+5.25e-02

Pos logit contributions
 amaz+0.317
lee+0.285
 spec+0.282
 mesmer+0.275
od+0.273
Neg logit contributions
P-0.367
His-0.337
Spot-0.299
ercise-0.298
Mid-0.296
Token off
Token ID572
Feature activation+9.05e-02

Pos logit contributions
 amaz+0.079
lee+0.069
od+0.068
 spec+0.066
 awe+0.066
Neg logit contributions
P-0.076
His-0.075
Sorry-0.066
ercise-0.065
 helmet-0.064
Token egg
Token ID5935
Feature activation+3.13e-01

Pos logit contributions
 amaz+0.640
lee+0.585
 ago+0.580
 spec+0.567
od+0.560
Neg logit contributions
P-0.635
His-0.619
ercise-0.563
Mid-0.541
Spot-0.540
Token!
Token ID0
Feature activation+2.40e-01

Pos logit contributions
 ago+0.413
 amaz+0.406
lee+0.393
 spec+0.384
ous+0.381
Neg logit contributions
P-0.476
His-0.449
ercise-0.423
Move-0.404
Mid-0.403
Token He
Token ID679
Feature activation+3.86e-02

Pos logit contributions
 amaz+0.334
lee+0.314
 ago+0.310
od+0.288
 spec+0.287
Neg logit contributions
P-0.358
ercise-0.321
w-0.319
His-0.310
 helmet-0.304
Token watched
Token ID7342
Feature activation+2.04e-01

Pos logit contributions
 amaz+0.055
lee+0.054
 spec+0.050
 mesmer+0.049
 ago+0.049
Neg logit contributions
P-0.058
His-0.052
ercise-0.050
Spot-0.049
 helmet-0.047
Token in
Token ID287
Feature activation+1.43e-01

Pos logit contributions
 amaz+0.274
lee+0.258
od+0.241
 spec+0.239
 mesmer+0.234
Neg logit contributions
P-0.333
His-0.298
ercise-0.287
Spot-0.275
OK-0.274
Token amaz
Token ID40642
Feature activation+5.84e-01

Pos logit contributions
 amaz+0.146
lee+0.143
od+0.129
 spec+0.127
 ago+0.125
Neg logit contributions
P-0.251
His-0.246
 helmet-0.227
ercise-0.227
Spot-0.218
Tokenement
Token ID972
Feature activation+3.58e-01

Pos logit contributions
 amaz+0.487
lee+0.445
 spec+0.395
 ago+0.387
od+0.385
Neg logit contributions
P-1.172
His-1.137
ercise-1.064
 helmet-1.055
w-1.024
Token as
Token ID355
Feature activation+1.76e-01

Pos logit contributions
 amaz+0.539
lee+0.518
od+0.489
 ago+0.488
 spec+0.481
Neg logit contributions
P-0.507
His-0.457
ercise-0.428
Spot-0.413
 helmet-0.406
Token the
Token ID262
Feature activation+2.20e-01

Pos logit contributions
 amaz+0.302
lee+0.292
 spec+0.274
od+0.272
 ago+0.272
Neg logit contributions
P-0.191
His-0.186
 helmet-0.162
ercise-0.158
Spot-0.151
Token egg
Token ID5935
Feature activation+2.20e-01

Pos logit contributions
 amaz+0.309
lee+0.292
 spec+0.273
od+0.270
 ago+0.269
Neg logit contributions
P-0.320
His-0.306
ercise-0.280
 helmet-0.275
Spot-0.266
Token began
Token ID2540
Feature activation+1.38e-01

Pos logit contributions
 amaz+0.315
lee+0.302
 spec+0.290
 ago+0.286
od+0.284
Neg logit contributions
P-0.337
His-0.300
ercise-0.277
Spot-0.276
Mid-0.275
Token room
Token ID2119
Feature activation+3.27e-01

Pos logit contributions
 amaz+0.495
lee+0.467
 spec+0.432
od+0.428
 ago+0.426
Neg logit contributions
P-0.602
His-0.577
ercise-0.530
 helmet-0.522
Spot-0.507
Token and
Token ID290
Feature activation+2.23e-01

Pos logit contributions
 ago+0.420
 amaz+0.416
lee+0.397
od+0.363
 mesmer+0.361
Neg logit contributions
P-0.543
His-0.495
ercise-0.488
Spot-0.470
O-0.467
Token laughed
Token ID13818
Feature activation+3.22e-01

Pos logit contributions
 amaz+0.340
lee+0.310
 ago+0.292
 spec+0.292
 mesmer+0.288
Neg logit contributions
P-0.332
His-0.303
Spot-0.269
ercise-0.267
O-0.266
Token with
Token ID351
Feature activation+1.40e-01

Pos logit contributions
lee+0.379
 amaz+0.370
 mesmer+0.355
od+0.344
 spec+0.318
Neg logit contributions
P-0.588
use-0.528
w-0.521
ercise-0.498
Move-0.491
Token g
Token ID308
Feature activation+2.93e-01

Pos logit contributions
 ago+0.206
 spec+0.199
od+0.198
lee+0.197
ents+0.195
Neg logit contributions
His-0.173
 his-0.173
P-0.164
 helmet-0.152
ercise-0.147
Tokenlee
Token ID7197
Feature activation+5.84e-01

Pos logit contributions
 amaz+0.121
 spec+0.067
 angles+0.051
 ago+0.049
 awe+0.048
Neg logit contributions
P-0.704
His-0.695
ercise-0.651
 helmet-0.639
w-0.639
Token.
Token ID13
Feature activation+1.69e-01

Pos logit contributions
 amaz+0.885
lee+0.851
 spec+0.795
od+0.790
 ago+0.789
Neg logit contributions
P-0.775
His-0.735
ercise-0.665
 helmet-0.651
Spot-0.629
Token The
Token ID383
Feature activation+2.26e-01

Pos logit contributions
lee+0.236
 amaz+0.232
 ago+0.218
od+0.203
 mesmer+0.201
Neg logit contributions
P-0.252
w-0.231
ercise-0.224
His-0.223
 helmet-0.220
Token little
Token ID1310
Feature activation+2.08e-01

Pos logit contributions
 amaz+0.326
lee+0.302
 spec+0.289
 ago+0.284
 angles+0.280
Neg logit contributions
P-0.324
His-0.311
ercise-0.284
 helmet-0.268
w-0.267
Token girl
Token ID2576
Feature activation+1.14e-01

Pos logit contributions
 amaz+0.316
lee+0.307
 spec+0.290
 ago+0.283
 angles+0.279
Neg logit contributions
P-0.280
His-0.276
ercise-0.247
w-0.231
 helmet-0.230
Token had
Token ID550
Feature activation+1.71e-01

Pos logit contributions
 amaz+0.159
lee+0.159
 mesmer+0.143
od+0.140
 spec+0.140
Neg logit contributions
P-0.172
His-0.159
ercise-0.150
 helmet-0.145
w-0.142
Token friends
Token ID2460
Feature activation+1.78e-01

Pos logit contributions
 amaz+0.331
lee+0.322
od+0.307
 spec+0.307
 angles+0.300
Neg logit contributions
P-0.254
His-0.244
 helmet-0.224
ercise-0.216
Spot-0.206
Token stopped
Token ID5025
Feature activation+1.51e-01

Pos logit contributions
 amaz+0.239
lee+0.239
 mesmer+0.212
od+0.209
 ago+0.207
Neg logit contributions
P-0.280
His-0.262
ercise-0.246
 helmet-0.242
Spot-0.241
Token and
Token ID290
Feature activation+1.89e-01

Pos logit contributions
 amaz+0.201
lee+0.189
 ago+0.175
 mesmer+0.174
od+0.172
Neg logit contributions
P-0.243
His-0.223
ercise-0.211
Spot-0.206
w-0.205
Token watched
Token ID7342
Feature activation+1.99e-01

Pos logit contributions
 amaz+0.268
lee+0.256
 mesmer+0.241
od+0.233
 spec+0.226
Neg logit contributions
P-0.284
His-0.257
w-0.241
Spot-0.241
ercise-0.235
Token in
Token ID287
Feature activation+1.33e-01

Pos logit contributions
 amaz+0.308
lee+0.299
od+0.276
 spec+0.275
 ago+0.274
Neg logit contributions
P-0.272
His-0.245
ercise-0.228
w-0.222
 helmet-0.219
Token amaz
Token ID40642
Feature activation+5.82e-01

Pos logit contributions
lee+0.152
od+0.142
 amaz+0.139
 spec+0.135
ents+0.128
Neg logit contributions
His-0.214
P-0.211
 helmet-0.198
 his-0.197
ercise-0.192
Tokenement
Token ID972
Feature activation+3.55e-01

Pos logit contributions
 amaz+0.535
lee+0.493
 spec+0.456
 ago+0.452
 mesmer+0.445
Neg logit contributions
P-1.100
His-1.078
 helmet-0.998
ercise-0.997
w-0.952
Token.
Token ID13
Feature activation+1.48e-01

Pos logit contributions
 amaz+0.522
lee+0.503
 ago+0.473
od+0.471
 spec+0.464
Neg logit contributions
P-0.506
His-0.462
ercise-0.430
Spot-0.414
 helmet-0.413
Token The
Token ID383
Feature activation+2.10e-01

Pos logit contributions
 amaz+0.215
lee+0.209
 ago+0.196
 mesmer+0.186
 spec+0.181
Neg logit contributions
P-0.229
w-0.216
 helmet-0.199
ercise-0.194
His-0.194
Token dog
Token ID3290
Feature activation+7.62e-02

Pos logit contributions
 amaz+0.303
lee+0.282
 spec+0.267
 ago+0.266
 angles+0.260
Neg logit contributions
P-0.302
His-0.292
ercise-0.264
w-0.251
 helmet-0.251
Token too
Token ID1165
Feature activation+7.36e-02

Pos logit contributions
 amaz+0.095
lee+0.093
 mesmer+0.091
 spec+0.083
od+0.083
Neg logit contributions
P-0.135
w-0.115
His-0.111
ercise-0.109
Br-0.107
Token.
Token ID13
Feature activation+2.20e-01

Pos logit contributions
 ago+0.287
 of+0.275
od+0.262
lee+0.254
 amaz+0.253
Neg logit contributions
His-0.583
P-0.581
Br-0.571
Move-0.542
Wait-0.530
Token He
Token ID679
Feature activation+4.08e-02

Pos logit contributions
 amaz+0.311
lee+0.294
 ago+0.291
 spec+0.268
od+0.267
Neg logit contributions
P-0.314
His-0.283
ercise-0.279
w-0.277
 helmet-0.272
Token laughed
Token ID13818
Feature activation+3.12e-01

Pos logit contributions
 amaz+0.059
lee+0.057
 spec+0.053
 ago+0.052
od+0.051
Neg logit contributions
P-0.058
His-0.056
Br-0.049
ercise-0.049
Spot-0.048
Token with
Token ID351
Feature activation+1.32e-01

Pos logit contributions
lee+0.248
 amaz+0.245
 mesmer+0.237
 of+0.218
od+0.218
Neg logit contributions
P-0.675
use-0.639
w-0.611
uch-0.603
ercise-0.584
Token g
Token ID308
Feature activation+2.85e-01

Pos logit contributions
od+0.190
 ago+0.187
 spec+0.184
lee+0.184
 amaz+0.183
Neg logit contributions
His-0.175
 his-0.172
P-0.166
 helmet-0.159
ercise-0.146
Tokenlee
Token ID7197
Feature activation+5.82e-01

Pos logit contributions
 amaz+0.093
lee+0.076
 spec+0.046
od+0.045
 ago+0.043
Neg logit contributions
P-0.723
His-0.704
ercise-0.670
 helmet-0.664
Spot-0.654
Token.
Token ID13
Feature activation+1.63e-01

Pos logit contributions
lee+0.764
 amaz+0.715
 ago+0.699
od+0.691
 spec+0.667
Neg logit contributions
P-0.911
His-0.851
ercise-0.771
Move-0.770
O-0.763
Token
Token ID220
Feature activation+1.77e-01

Pos logit contributions
lee+0.234
 amaz+0.226
 ago+0.216
od+0.200
 mesmer+0.194
Neg logit contributions
P-0.237
w-0.219
ercise-0.211
His-0.204
 helmet-0.202
Token
Token ID198
Feature activation+1.56e-01

Pos logit contributions
lee+0.227
 amaz+0.223
od+0.213
 ago+0.208
 angles+0.198
Neg logit contributions
P-0.266
His-0.244
ercise-0.243
 helmet-0.239
w-0.226
Token
Token ID198
Feature activation+1.53e-01

Pos logit contributions
lee+0.194
 amaz+0.189
od+0.177
 ago+0.172
 angles+0.168
Neg logit contributions
P-0.226
ercise-0.222
 helmet-0.218
w-0.210
His-0.202
TokenMom
Token ID29252
Feature activation+1.95e-03

Pos logit contributions
 amaz+0.227
lee+0.220
 mesmer+0.214
od+0.212
 ago+0.206
Neg logit contributions
 down-0.164
 helmet-0.160
use-0.159
ercise-0.157
w-0.155
Token they
Token ID484
Feature activation+1.77e-01

Pos logit contributions
 amaz+0.129
lee+0.119
 mesmer+0.118
 ago+0.113
 spec+0.111
Neg logit contributions
P-0.154
His-0.137
ercise-0.131
w-0.127
O-0.123
Token both
Token ID1111
Feature activation+1.70e-01

Pos logit contributions
 amaz+0.255
lee+0.250
 mesmer+0.234
 ago+0.234
 spec+0.225
Neg logit contributions
P-0.253
His-0.238
ercise-0.220
 helmet-0.210
Mid-0.210
Token looked
Token ID3114
Feature activation+1.19e-01

Pos logit contributions
 amaz+0.248
lee+0.240
 ago+0.222
 spec+0.221
 mesmer+0.220
Neg logit contributions
P-0.242
His-0.226
ercise-0.211
 helmet-0.203
Spot-0.198
Token around
Token ID1088
Feature activation+2.38e-01

Pos logit contributions
 amaz+0.180
lee+0.168
 mesmer+0.164
 awe+0.158
 spec+0.157
Neg logit contributions
P-0.181
His-0.165
O-0.152
Spot-0.151
w-0.149
Token in
Token ID287
Feature activation+1.34e-01

Pos logit contributions
 amaz+0.354
lee+0.322
 ago+0.319
 mesmer+0.308
 spec+0.305
Neg logit contributions
P-0.350
His-0.322
ercise-0.310
Spot-0.295
w-0.295
Token amaz
Token ID40642
Feature activation+5.82e-01

Pos logit contributions
lee+0.192
od+0.190
ous+0.180
 amaz+0.176
 spec+0.176
Neg logit contributions
P-0.169
His-0.169
 his-0.159
 helmet-0.150
 away-0.146
Tokenement
Token ID972
Feature activation+3.55e-01

Pos logit contributions
 amaz+0.535
lee+0.490
 spec+0.448
 ago+0.439
 mesmer+0.437
Neg logit contributions
P-1.100
His-1.072
 helmet-0.992
ercise-0.984
w-0.950
Token.
Token ID13
Feature activation+1.54e-01

Pos logit contributions
 amaz+0.537
lee+0.517
 ago+0.488
od+0.482
 spec+0.480
Neg logit contributions
P-0.484
His-0.452
ercise-0.416
 helmet-0.401
Spot-0.398
Token The
Token ID383
Feature activation+2.06e-01

Pos logit contributions
 amaz+0.223
lee+0.212
 ago+0.204
 spec+0.195
 mesmer+0.194
Neg logit contributions
P-0.227
His-0.205
w-0.199
ercise-0.198
 helmet-0.197
Token lake
Token ID13546
Feature activation+1.91e-01

Pos logit contributions
 amaz+0.281
lee+0.256
 spec+0.245
 angles+0.237
 ago+0.235
Neg logit contributions
P-0.316
His-0.306
ercise-0.277
w-0.265
Mid-0.264
Token was
Token ID373
Feature activation+1.03e-01

Pos logit contributions
 amaz+0.272
lee+0.251
 spec+0.241
 mesmer+0.238
 angles+0.231
Neg logit contributions
P-0.304
His-0.267
ercise-0.256
Mid-0.253
 helmet-0.243
Token,
Token ID11
Feature activation+2.01e-01

Pos logit contributions
P+0.131
His+0.113
ercise+0.100
 his+0.099
Mid+0.095
Neg logit contributions
 amaz-0.253
lee-0.232
 mesmer-0.229
 spec-0.228
 ago-0.227
Tokenâ
Token ID22940
Feature activation+2.28e-01

Pos logit contributions
 amaz+0.263
 ago+0.252
lee+0.246
 mesmer+0.234
 spec+0.230
Neg logit contributions
P-0.321
His-0.288
ercise-0.278
w-0.277
mar-0.260
Token
Token ID26391
Feature activation+1.56e-01

Pos logit contributions
 amaz+0.378
lee+0.374
 ago+0.348
od+0.345
 spec+0.344
Neg logit contributions
P-0.253
His-0.246
 helmet-0.221
ercise-0.214
Spot-0.204
Token said
Token ID531
Feature activation+1.99e-01

Pos logit contributions
 amaz+0.243
lee+0.233
 ago+0.218
od+0.218
 spec+0.217
Neg logit contributions
P-0.199
His-0.189
ercise-0.172
 helmet-0.168
Spot-0.159
Token T
Token ID309
Feature activation+3.04e-01

Pos logit contributions
lee+0.259
 amaz+0.244
od+0.232
 ago+0.231
ous+0.221
Neg logit contributions
P-0.313
ercise-0.279
Move-0.270
His-0.267
Spot-0.264
Tokenod
Token ID375
Feature activation+5.79e-01

Pos logit contributions
lee+0.386
od+0.342
 amaz+0.321
 ago+0.319
ous+0.319
Neg logit contributions
P-0.501
His-0.478
ercise-0.452
Br-0.445
Move-0.444
Token.
Token ID13
Feature activation+1.82e-01

Pos logit contributions
lee+0.787
od+0.787
 amaz+0.695
 angles+0.656
 ago+0.650
Neg logit contributions
His-0.845
P-0.840
Spot-0.821
Br-0.818
Move-0.806
Token
Token ID198
Feature activation+1.80e-01

Pos logit contributions
 amaz+0.242
lee+0.242
 ago+0.231
od+0.219
 spec+0.209
Neg logit contributions
P-0.281
w-0.266
ercise-0.258
 helmet-0.239
His-0.237
Token
Token ID198
Feature activation+1.74e-01

Pos logit contributions
lee+0.229
 amaz+0.225
od+0.211
 ago+0.207
 mesmer+0.199
Neg logit contributions
P-0.251
ercise-0.250
 helmet-0.242
His-0.232
w-0.231
Tokenâ
Token ID22940
Feature activation+1.32e-01

Pos logit contributions
 amaz+0.239
lee+0.238
od+0.236
 mesmer+0.229
 ago+0.215
Neg logit contributions
 melts-0.208
 down-0.208
ercise-0.205
 helmet-0.201
erves-0.191
Token
Token ID26391
Feature activation+7.91e-02

Pos logit contributions
lee+0.193
 amaz+0.187
 ago+0.181
od+0.176
 angles+0.173
Neg logit contributions
P-0.176
His-0.162
 helmet-0.158
 helmets-0.147
ercise-0.143
Token.
Token ID13
Feature activation+1.84e-01

Pos logit contributions
 ago+0.223
lee+0.210
 of+0.203
 amaz+0.197
od+0.183
Neg logit contributions
P-0.417
His-0.392
Br-0.372
ercise-0.365
O-0.364
Token
Token ID198
Feature activation+1.84e-01

Pos logit contributions
 amaz+0.242
 ago+0.237
lee+0.234
 mesmer+0.208
 spec+0.208
Neg logit contributions
P-0.287
w-0.274
ercise-0.261
 helmet-0.252
His-0.246
Token
Token ID198
Feature activation+1.77e-01

Pos logit contributions
 amaz+0.242
lee+0.234
 ago+0.221
od+0.214
 mesmer+0.211
Neg logit contributions
P-0.262
ercise-0.253
 helmet-0.246
His-0.240
w-0.237
TokenTo
Token ID2514
Feature activation+8.48e-02

Pos logit contributions
 amaz+0.270
 ago+0.251
 mesmer+0.248
lee+0.248
od+0.242
Neg logit contributions
 helmet-0.197
 down-0.195
 melts-0.193
ercise-0.191
erve-0.183
Token her
Token ID607
Feature activation+3.69e-02

Pos logit contributions
 amaz+0.112
lee+0.103
 ago+0.096
od+0.096
 awe+0.089
Neg logit contributions
P-0.137
His-0.122
ercise-0.121
 helmet-0.117
Spot-0.116
Token amaz
Token ID40642
Feature activation+5.75e-01

Pos logit contributions
lee+0.055
od+0.054
 spec+0.052
 angles+0.051
keeping+0.051
Neg logit contributions
P-0.046
His-0.044
 helmet-0.042
ercise-0.041
 his-0.039
Tokenement
Token ID972
Feature activation+3.48e-01

Pos logit contributions
 amaz+0.426
lee+0.378
 spec+0.324
od+0.321
 ago+0.313
Neg logit contributions
P-1.221
His-1.177
ercise-1.116
Spot-1.100
Mid-1.088
Token,
Token ID11
Feature activation+4.49e-02

Pos logit contributions
 amaz+0.539
lee+0.528
 ago+0.525
od+0.501
ous+0.494
Neg logit contributions
P-0.505
His-0.436
ercise-0.417
Spot-0.406
Move-0.404
Token the
Token ID262
Feature activation+2.02e-01

Pos logit contributions
 amaz+0.061
lee+0.053
 spec+0.052
 mesmer+0.051
 awe+0.050
Neg logit contributions
P-0.073
His-0.066
ercise-0.065
w-0.063
use-0.060
Token clothes
Token ID8242
Feature activation+9.24e-02

Pos logit contributions
 amaz+0.286
lee+0.269
 spec+0.255
 ago+0.250
 angles+0.245
Neg logit contributions
P-0.294
His-0.283
ercise-0.258
 helmet-0.246
w-0.244
Token came
Token ID1625
Feature activation+8.58e-02

Pos logit contributions
 amaz+0.122
lee+0.117
 ago+0.108
 spec+0.106
 angles+0.104
Neg logit contributions
P-0.149
His-0.139
ercise-0.129
Spot-0.127
Mid-0.123
Token little
Token ID1310
Feature activation+2.31e-01

Pos logit contributions
 amaz+0.254
lee+0.233
 spec+0.222
 ago+0.218
 angles+0.209
Neg logit contributions
P-0.258
His-0.258
ercise-0.233
erve-0.222
Move-0.220
Token girl
Token ID2576
Feature activation+1.36e-01

Pos logit contributions
 amaz+0.305
 spec+0.287
lee+0.286
 ago+0.269
 angles+0.261
Neg logit contributions
His-0.360
P-0.343
ercise-0.315
erve-0.302
mar-0.300
Token looked
Token ID3114
Feature activation+1.08e-01

Pos logit contributions
lee+0.191
 amaz+0.191
 mesmer+0.169
 spec+0.166
 ago+0.166
Neg logit contributions
P-0.202
His-0.191
ercise-0.178
 helmet-0.172
Move-0.169
Token up
Token ID510
Feature activation+5.95e-02

Pos logit contributions
 amaz+0.156
lee+0.151
 mesmer+0.140
 awe+0.133
 spec+0.132
Neg logit contributions
P-0.168
His-0.158
Spot-0.150
w-0.145
O-0.144
Token in
Token ID287
Feature activation+1.26e-01

Pos logit contributions
 amaz+0.092
lee+0.087
 mesmer+0.082
 ago+0.081
 spec+0.080
Neg logit contributions
P-0.081
His-0.076
ercise-0.074
Spot-0.071
w-0.069
Token amaz
Token ID40642
Feature activation+5.74e-01

Pos logit contributions
od+0.209
lee+0.205
ous+0.194
ents+0.193
 spec+0.192
Neg logit contributions
His-0.135
P-0.134
 his-0.133
 helmet-0.121
 away-0.120
Tokenement
Token ID972
Feature activation+3.48e-01

Pos logit contributions
 amaz+0.540
lee+0.500
 spec+0.463
 ago+0.459
 mesmer+0.452
Neg logit contributions
P-1.078
His-1.049
 helmet-0.967
ercise-0.955
w-0.928
Token and
Token ID290
Feature activation+1.58e-01

Pos logit contributions
 amaz+0.537
lee+0.516
 ago+0.485
od+0.480
 spec+0.480
Neg logit contributions
P-0.466
His-0.434
ercise-0.399
 helmet-0.384
Spot-0.381
Token said
Token ID531
Feature activation+1.28e-01

Pos logit contributions
 amaz+0.250
lee+0.236
 spec+0.220
 mesmer+0.218
 angles+0.217
Neg logit contributions
P-0.212
His-0.197
ercise-0.178
 helmet-0.175
Spot-0.173
Token,
Token ID11
Feature activation+3.68e-02

Pos logit contributions
 amaz+0.197
lee+0.191
od+0.185
 mesmer+0.177
 spec+0.174
Neg logit contributions
P-0.178
ercise-0.155
His-0.155
Spot-0.153
w-0.152
Token "
Token ID366
Feature activation+1.19e-01

Pos logit contributions
 amaz+0.048
lee+0.045
 mesmer+0.043
 ago+0.041
 spec+0.040
Neg logit contributions
P-0.064
w-0.056
His-0.054
ercise-0.054
Spot-0.052
Tokeneddy
Token ID21874
Feature activation+2.76e-01

Pos logit contributions
 amaz+0.212
 awe+0.188
 mesmer+0.183
lee+0.182
 spec+0.181
Neg logit contributions
P-0.191
His-0.179
w-0.162
 helmet-0.151
mar-0.147
Token bear
Token ID6842
Feature activation+6.85e-02

Pos logit contributions
lee+0.412
 amaz+0.401
 spec+0.385
keeping+0.380
 angles+0.366
Neg logit contributions
P-0.353
His-0.340
OK-0.305
Move-0.304
Wait-0.304
Token and
Token ID290
Feature activation+1.82e-01

Pos logit contributions
 ago+0.081
lee+0.080
 amaz+0.078
keeping+0.074
od+0.073
Neg logit contributions
P-0.113
His-0.111
Br-0.100
OK-0.100
ercise-0.099
Token watched
Token ID7342
Feature activation+1.87e-01

Pos logit contributions
 amaz+0.293
lee+0.274
 spec+0.259
 ago+0.255
 angles+0.250
Neg logit contributions
P-0.238
His-0.222
ercise-0.199
Spot-0.194
w-0.190
Token in
Token ID287
Feature activation+1.26e-01

Pos logit contributions
 amaz+0.269
lee+0.259
od+0.242
 spec+0.239
 ago+0.234
Neg logit contributions
P-0.277
His-0.253
ercise-0.238
 helmet-0.229
w-0.228
Token amaz
Token ID40642
Feature activation+5.73e-01

Pos logit contributions
lee+0.164
 amaz+0.156
od+0.153
 spec+0.147
 mesmer+0.142
Neg logit contributions
P-0.178
His-0.177
 helmet-0.163
 his-0.162
ercise-0.157
Tokenement
Token ID972
Feature activation+3.42e-01

Pos logit contributions
 amaz+0.455
lee+0.413
 spec+0.361
od+0.356
 ago+0.350
Neg logit contributions
P-1.180
His-1.141
ercise-1.075
 helmet-1.059
Spot-1.046
Token as
Token ID355
Feature activation+1.63e-01

Pos logit contributions
 amaz+0.496
lee+0.480
od+0.451
 ago+0.448
 spec+0.443
Neg logit contributions
P-0.494
His-0.456
ercise-0.423
 helmet-0.406
Spot-0.406
Token it
Token ID340
Feature activation+2.10e-01

Pos logit contributions
 amaz+0.260
lee+0.249
 spec+0.233
 ago+0.231
od+0.231
Neg logit contributions
P-0.202
His-0.193
ercise-0.171
 helmet-0.170
Spot-0.163
Token shrunk
Token ID38900
Feature activation+1.44e-01

Pos logit contributions
 amaz+0.269
lee+0.248
 spec+0.247
od+0.239
 mesmer+0.234
Neg logit contributions
P-0.345
His-0.321
ercise-0.303
 helmet-0.292
Spot-0.290
Token an
Token ID281
Feature activation+6.21e-02

Pos logit contributions
lee+0.181
 amaz+0.180
 angles+0.176
 spec+0.174
od+0.158
Neg logit contributions
P-0.245
His-0.231
ercise-0.219
Sorry-0.213
Spot-0.206
Tokenered
Token ID1068
Feature activation+1.57e-01

Pos logit contributions
 amaz+0.309
lee+0.301
od+0.279
 angles+0.279
 spec+0.276
Neg logit contributions
P-0.286
His-0.275
Spot-0.250
ercise-0.248
 helmet-0.246
Token in
Token ID287
Feature activation+1.37e-01

Pos logit contributions
 amaz+0.239
lee+0.231
 spec+0.214
 angles+0.213
 mesmer+0.211
Neg logit contributions
P-0.217
His-0.199
ercise-0.187
Spot-0.182
w-0.180
Token,
Token ID11
Feature activation+6.12e-02

Pos logit contributions
 amaz+0.225
lee+0.224
od+0.213
 spec+0.210
 ago+0.205
Neg logit contributions
P-0.153
His-0.149
 helmet-0.129
ercise-0.127
 his-0.123
Token blinking
Token ID43196
Feature activation+3.47e-01

Pos logit contributions
 amaz+0.086
lee+0.084
 mesmer+0.082
 spec+0.077
 ago+0.077
Neg logit contributions
P-0.093
ercise-0.081
His-0.080
w-0.077
ww-0.075
Token in
Token ID287
Feature activation+1.23e-01

Pos logit contributions
 amaz+0.526
lee+0.520
 mesmer+0.472
 spec+0.465
 angles+0.452
Neg logit contributions
P-0.524
Spot-0.460
ercise-0.456
Deal-0.448
uch-0.446
Token amaz
Token ID40642
Feature activation+5.72e-01

Pos logit contributions
 amaz+0.188
lee+0.184
od+0.173
 spec+0.172
 ago+0.170
Neg logit contributions
P-0.155
His-0.150
 helmet-0.132
ercise-0.132
Spot-0.126
Tokenement
Token ID972
Feature activation+3.47e-01

Pos logit contributions
 amaz+0.507
lee+0.467
 spec+0.421
 ago+0.415
od+0.411
Neg logit contributions
P-1.106
His-1.075
 helmet-0.997
ercise-0.992
w-0.958
Token.
Token ID13
Feature activation+1.43e-01

Pos logit contributions
 amaz+0.516
lee+0.496
 ago+0.464
od+0.462
 spec+0.461
Neg logit contributions
P-0.482
His-0.448
ercise-0.415
 helmet-0.399
Spot-0.398
Token
Token ID198
Feature activation+1.44e-01

Pos logit contributions
 amaz+0.211
lee+0.200
 ago+0.195
 mesmer+0.185
 spec+0.183
Neg logit contributions
P-0.207
w-0.187
ercise-0.185
His-0.178
 helmet-0.177
Token
Token ID198
Feature activation+1.41e-01

Pos logit contributions
 amaz+0.212
lee+0.204
 ago+0.190
od+0.189
 spec+0.186
Neg logit contributions
P-0.192
His-0.179
ercise-0.174
 helmet-0.169
w-0.163
TokenBut
Token ID1537
Feature activation-1.99e-02

Pos logit contributions
 amaz+0.223
lee+0.211
 mesmer+0.208
od+0.206
 ago+0.203
Neg logit contributions
 helmet-0.151
ercise-0.147
 down-0.144
w-0.144
 melts-0.144
Token again
Token ID757
Feature activation+2.55e-01

Pos logit contributions
P+0.017
His+0.016
 his+0.014
 helmet+0.013
Spot+0.012
Neg logit contributions
 amaz-0.031
lee-0.029
 spec-0.028
 mesmer-0.027
 angles-0.027
Token and
Token ID290
Feature activation+2.64e-01

Pos logit contributions
 amaz+0.359
lee+0.345
 ago+0.345
 mesmer+0.323
od+0.319
Neg logit contributions
P-0.363
His-0.349
ercise-0.325
Spot-0.312
 helmet-0.308
Token again
Token ID757
Feature activation+1.82e-01

Pos logit contributions
 amaz+0.422
lee+0.391
 spec+0.371
 ago+0.370
 angles+0.369
Neg logit contributions
P-0.349
His-0.323
ercise-0.286
Spot-0.279
 helmet-0.277
Token.
Token ID13
Feature activation+2.07e-01

Pos logit contributions
 amaz+0.249
 ago+0.237
lee+0.230
 mesmer+0.215
od+0.213
Neg logit contributions
P-0.287
His-0.263
ercise-0.255
Mid-0.251
Wh-0.251
Token J
Token ID449
Feature activation+4.17e-01

Pos logit contributions
 amaz+0.284
 ago+0.272
lee+0.270
od+0.245
 mesmer+0.241
Neg logit contributions
P-0.311
w-0.284
ercise-0.278
 helmet-0.274
His-0.269
Tokenod
Token ID375
Feature activation+5.67e-01

Pos logit contributions
 amaz+0.460
lee+0.401
 spec+0.391
 ago+0.374
 angles+0.371
Neg logit contributions
P-0.743
His-0.709
ercise-0.656
 helmet-0.646
w-0.638
Tokenie
Token ID494
Feature activation+1.79e-01

Pos logit contributions
lee+0.789
 amaz+0.786
od+0.761
 angles+0.716
 spec+0.689
Neg logit contributions
P-0.770
His-0.764
Spot-0.710
 helmet-0.708
ercise-0.698
Token would
Token ID561
Feature activation+6.33e-02

Pos logit contributions
 amaz+0.248
lee+0.245
 mesmer+0.219
 spec+0.218
 angles+0.217
Neg logit contributions
P-0.266
His-0.250
ercise-0.227
Spot-0.225
 helmet-0.223
Token always
Token ID1464
Feature activation+1.54e-01

Pos logit contributions
 amaz+0.091
lee+0.084
 ago+0.079
 spec+0.079
od+0.079
Neg logit contributions
P-0.093
His-0.085
ercise-0.078
Spot-0.076
w-0.075
Token remember
Token ID3505
Feature activation+1.61e-01

Pos logit contributions
 amaz+0.217
lee+0.186
 ago+0.181
 angles+0.181
 awe+0.179
Neg logit contributions
P-0.243
His-0.219
O-0.207
Spot-0.204
w-0.200
Token the
Token ID262
Feature activation+2.23e-01

Pos logit contributions
 amaz+0.224
lee+0.212
 ago+0.194
 angles+0.193
od+0.193
Neg logit contributions
P-0.249
His-0.227
ercise-0.215
Spot-0.212
Move-0.210
Token animals
Token ID4695
Feature activation+1.85e-01

Pos logit contributions
 amaz+0.234
lee+0.229
 spec+0.215
 angles+0.209
od+0.202
Neg logit contributions
P-0.304
His-0.298
ercise-0.275
 helmet-0.264
Mid-0.261
Token were
Token ID547
Feature activation+2.26e-01

Pos logit contributions
 amaz+0.234
lee+0.227
 ago+0.218
od+0.213
 angles+0.212
Neg logit contributions
P-0.297
His-0.280
ercise-0.254
O-0.252
Mid-0.252
Token watching
Token ID4964
Feature activation+2.51e-01

Pos logit contributions
 amaz+0.282
lee+0.276
 mesmer+0.265
 ago+0.261
od+0.254
Neg logit contributions
P-0.364
His-0.348
ercise-0.329
w-0.315
OK-0.314
Token him
Token ID683
Feature activation+1.14e-02

Pos logit contributions
 amaz+0.304
lee+0.294
od+0.280
 spec+0.277
 mesmer+0.259
Neg logit contributions
P-0.461
His-0.409
w-0.383
ercise-0.382
OK-0.380
Token with
Token ID351
Feature activation+9.88e-02

Pos logit contributions
 amaz+0.015
od+0.014
lee+0.013
 mesmer+0.013
 ago+0.012
Neg logit contributions
P-0.021
His-0.018
OK-0.017
Sorry-0.017
O-0.017
Token amaz
Token ID40642
Feature activation+5.64e-01

Pos logit contributions
 amaz+0.163
lee+0.147
 ago+0.141
 spec+0.140
 angles+0.139
Neg logit contributions
P-0.136
His-0.123
ercise-0.112
w-0.108
OK-0.107
Tokenement
Token ID972
Feature activation+3.38e-01

Pos logit contributions
 amaz+0.338
lee+0.310
od+0.263
 spec+0.255
 ago+0.233
Neg logit contributions
P-1.283
His-1.227
Spot-1.192
Mid-1.187
Br-1.179
Token.
Token ID13
Feature activation+1.36e-01

Pos logit contributions
 amaz+0.459
lee+0.452
 ago+0.436
od+0.423
 spec+0.407
Neg logit contributions
P-0.532
His-0.482
ercise-0.448
O-0.441
Spot-0.441
Token
Token ID220
Feature activation+1.51e-01

Pos logit contributions
 amaz+0.196
lee+0.187
 ago+0.182
 spec+0.175
od+0.173
Neg logit contributions
P-0.200
w-0.178
His-0.176
ercise-0.172
 helmet-0.170
Token
Token ID198
Feature activation+1.31e-01

Pos logit contributions
 amaz+0.198
lee+0.195
od+0.186
 ago+0.184
 spec+0.173
Neg logit contributions
P-0.228
His-0.207
ercise-0.205
 helmet-0.200
w-0.192
Token
Token ID198
Feature activation+1.28e-01

Pos logit contributions
 amaz+0.169
lee+0.163
od+0.157
 ago+0.153
 mesmer+0.151
Neg logit contributions
P-0.188
ercise-0.180
 helmet-0.176
His-0.172
w-0.168
Token.
Token ID13
Feature activation+2.78e-01

Pos logit contributions
His+0.005
P+0.005
OK+0.004
 helmet+0.004
Spot+0.004
Neg logit contributions
lee-0.004
 amaz-0.004
 ago-0.003
od-0.003
ents-0.003
Token The
Token ID383
Feature activation+2.99e-01

Pos logit contributions
 ago+0.369
 amaz+0.361
lee+0.353
 mesmer+0.314
od+0.307
Neg logit contributions
P-0.429
w-0.396
ercise-0.384
 helmet-0.365
His-0.360
Token child
Token ID1200
Feature activation+4.18e-01

Pos logit contributions
 amaz+0.420
lee+0.385
 ago+0.375
 spec+0.373
 angles+0.359
Neg logit contributions
P-0.426
His-0.425
ercise-0.382
mar-0.364
erve-0.363
Token was
Token ID373
Feature activation+1.69e-01

Pos logit contributions
lee+0.545
 amaz+0.538
 mesmer+0.482
od+0.481
 spec+0.480
Neg logit contributions
P-0.676
His-0.608
ercise-0.572
Br-0.562
 helmet-0.555
Token so
Token ID523
Feature activation+3.10e-01

Pos logit contributions
 amaz+0.241
lee+0.229
 mesmer+0.216
 ago+0.213
 spec+0.212
Neg logit contributions
P-0.246
His-0.229
ercise-0.216
w-0.205
 helmet-0.205
Token mesmer
Token ID46814
Feature activation+5.57e-01

Pos logit contributions
 amaz+0.375
lee+0.345
 mesmer+0.343
od+0.318
 ago+0.317
Neg logit contributions
P-0.528
His-0.485
ercise-0.485
 helmet-0.464
Mid-0.450
Tokenized
Token ID1143
Feature activation+2.80e-01

Pos logit contributions
 amaz+0.633
lee+0.603
 spec+0.547
od+0.542
 angles+0.537
Neg logit contributions
P-0.948
His-0.913
ercise-0.849
 helmet-0.839
Spot-0.818
Token.
Token ID13
Feature activation+1.68e-01

Pos logit contributions
 amaz+0.516
lee+0.487
 ago+0.463
od+0.455
 mesmer+0.450
Neg logit contributions
P-0.284
His-0.270
 his-0.236
ercise-0.231
 helmet-0.226
Token "
Token ID366
Feature activation+1.46e-01

Pos logit contributions
 amaz+0.245
lee+0.238
 ago+0.230
 mesmer+0.214
od+0.213
Neg logit contributions
P-0.242
w-0.223
ercise-0.217
 helmet-0.210
His-0.208
TokenCan
Token ID6090
Feature activation+1.12e-02

Pos logit contributions
 amaz+0.243
 mesmer+0.223
od+0.217
 spec+0.216
lee+0.209
Neg logit contributions
 helmet-0.157
P-0.156
His-0.149
ercise-0.145
Mid-0.138
Token I
Token ID314
Feature activation+1.62e-01

Pos logit contributions
 amaz+0.013
lee+0.012
 spec+0.012
od+0.011
 ago+0.011
Neg logit contributions
P-0.018
His-0.017
ercise-0.016
 helmet-0.015
Mid-0.015
Token rabbit
Token ID22746
Feature activation+1.65e-01

Pos logit contributions
 amaz+0.307
lee+0.279
 spec+0.273
 ago+0.258
 angles+0.258
Neg logit contributions
His-0.363
P-0.359
ercise-0.336
erve-0.325
Move-0.318
Token be
Token ID307
Feature activation+7.80e-02

Pos logit contributions
 amaz+0.221
lee+0.216
 mesmer+0.194
 ago+0.188
od+0.186
Neg logit contributions
P-0.270
His-0.243
O-0.227
w-0.224
ww-0.223
Tokenamed
Token ID2434
Feature activation+1.85e-01

Pos logit contributions
lee+0.109
 amaz+0.106
 ago+0.097
 mesmer+0.096
od+0.096
Neg logit contributions
His-0.107
P-0.104
ercise-0.102
OK-0.100
 helmet-0.100
Token with
Token ID351
Feature activation+1.06e-01

Pos logit contributions
 amaz+0.287
lee+0.276
 mesmer+0.250
 spec+0.250
od+0.249
Neg logit contributions
P-0.250
His-0.232
ercise-0.216
w-0.209
 helmet-0.208
Token g
Token ID308
Feature activation+2.60e-01

Pos logit contributions
 ago+0.126
od+0.125
 outage+0.124
 spec+0.124
lee+0.122
Neg logit contributions
P-0.162
His-0.159
 his-0.156
ercise-0.146
 helmet-0.144
Tokenlee
Token ID7197
Feature activation+5.57e-01

Pos logit contributions
 amaz+0.080
 spec+0.034
lee+0.028
 angles+0.026
 ago+0.023
Neg logit contributions
P-0.658
His-0.643
ercise-0.608
 helmet-0.597
w-0.590
Token as
Token ID355
Feature activation+1.59e-01

Pos logit contributions
 amaz+0.809
lee+0.800
 ago+0.739
 spec+0.733
od+0.731
Neg logit contributions
P-0.768
His-0.729
ercise-0.657
 helmet-0.643
Move-0.629
Token they
Token ID484
Feature activation+1.32e-01

Pos logit contributions
 amaz+0.243
lee+0.230
 spec+0.217
od+0.215
 ago+0.214
Neg logit contributions
P-0.209
His-0.199
ercise-0.180
 helmet-0.177
Spot-0.170
Token walked
Token ID6807
Feature activation+2.73e-03

Pos logit contributions
lee+0.194
 amaz+0.193
 mesmer+0.170
 ago+0.169
od+0.168
Neg logit contributions
P-0.177
His-0.177
 helmet-0.158
ercise-0.158
Move-0.153
Token side
Token ID1735
Feature activation+2.03e-01

Pos logit contributions
lee+0.004
 amaz+0.004
 ago+0.004
keeping+0.004
 angles+0.004
Neg logit contributions
P-0.003
His-0.003
Move-0.003
O-0.003
w-0.003
Token by
Token ID416
Feature activation+5.75e-02

Pos logit contributions
 amaz+0.377
 spec+0.345
lee+0.344
od+0.337
 mesmer+0.328
Neg logit contributions
P-0.200
His-0.188
 his-0.166
 away-0.165
 down-0.165
Token them
Token ID606
Feature activation+2.59e-01

Pos logit contributions
P+0.250
His+0.222
ercise+0.214
Br+0.211
Mid+0.206
Neg logit contributions
 amaz-0.190
 ago-0.183
od-0.183
 spec-0.174
lee-0.173
Token into
Token ID656
Feature activation+2.91e-01

Pos logit contributions
 amaz+0.319
 ago+0.317
lee+0.313
 angles+0.295
 mesmer+0.293
Neg logit contributions
P-0.434
His-0.402
O-0.389
Wh-0.370
Bug-0.358
Token beautiful
Token ID4950
Feature activation+3.38e-01

Pos logit contributions
 amaz+0.464
lee+0.445
 spec+0.418
od+0.416
 ago+0.412
Neg logit contributions
P-0.362
His-0.344
ercise-0.306
 helmet-0.302
Spot-0.291
Token or
Token ID393
Feature activation+1.88e-01

Pos logit contributions
 amaz+0.473
lee+0.444
 spec+0.418
 ago+0.410
 angles+0.409
Neg logit contributions
P-0.505
His-0.488
ercise-0.441
 helmet-0.428
Spot-0.426
Tokennam
Token ID7402
Feature activation+3.18e-01

Pos logit contributions
 amaz+0.293
lee+0.260
 spec+0.259
 ago+0.258
 angles+0.254
Neg logit contributions
P-0.270
His-0.246
ercise-0.217
O-0.215
Spot-0.213
Tokenents
Token ID658
Feature activation+5.56e-01

Pos logit contributions
 amaz+0.319
 horror+0.284
 spec+0.255
lee+0.255
 awe+0.233
Neg logit contributions
P-0.619
His-0.603
w-0.565
One-0.513
z-0.508
Token.
Token ID13
Feature activation+1.86e-01

Pos logit contributions
 amaz+0.717
lee+0.708
 ago+0.704
od+0.678
 angles+0.656
Neg logit contributions
P-0.865
His-0.800
ercise-0.751
Mid-0.717
Br-0.715
Token
Token ID198
Feature activation+1.86e-01

Pos logit contributions
 ago+0.257
 amaz+0.256
lee+0.255
 spec+0.223
od+0.223
Neg logit contributions
P-0.273
w-0.263
ercise-0.248
 helmet-0.239
uch-0.238
Token
Token ID198
Feature activation+1.80e-01

Pos logit contributions
 amaz+0.279
lee+0.267
 ago+0.249
od+0.248
 spec+0.246
Neg logit contributions
P-0.246
His-0.232
ercise-0.219
 helmet-0.213
w-0.204
TokenOne
Token ID3198
Feature activation-7.37e-02

Pos logit contributions
 amaz+0.293
 ago+0.278
lee+0.271
od+0.270
 mesmer+0.270
Neg logit contributions
 down-0.178
 melts-0.175
 helmet-0.172
ercise-0.168
 away-0.163
Token day
Token ID1110
Feature activation+1.18e-01

Pos logit contributions
P+0.105
His+0.100
ercise+0.094
Mid+0.089
Spot+0.089
Neg logit contributions
 amaz-0.106
lee-0.097
 ago-0.095
 spec-0.093
 mesmer-0.092
Token The
Token ID383
Feature activation+2.15e-01

Pos logit contributions
 amaz+0.240
lee+0.235
 ago+0.220
 spec+0.213
od+0.211
Neg logit contributions
P-0.208
His-0.194
ercise-0.186
w-0.184
 helmet-0.182
Token little
Token ID1310
Feature activation+2.02e-01

Pos logit contributions
 amaz+0.304
 spec+0.278
lee+0.275
 ago+0.273
 angles+0.257
Neg logit contributions
His-0.311
P-0.301
ercise-0.273
mar-0.260
erve-0.258
Token girl
Token ID2576
Feature activation+1.11e-01

Pos logit contributions
 amaz+0.259
 spec+0.252
lee+0.240
 ago+0.232
 angles+0.224
Neg logit contributions
His-0.332
P-0.313
ercise-0.289
mar-0.273
w-0.271
Token watched
Token ID7342
Feature activation+1.64e-01

Pos logit contributions
 amaz+0.158
lee+0.158
od+0.145
 mesmer+0.144
 spec+0.144
Neg logit contributions
P-0.158
His-0.149
ercise-0.139
 helmet-0.137
Spot-0.131
Token with
Token ID351
Feature activation+8.30e-02

Pos logit contributions
 amaz+0.245
lee+0.238
od+0.222
 spec+0.221
 ago+0.214
Neg logit contributions
P-0.238
His-0.216
ercise-0.202
w-0.196
 helmet-0.193
Token amaz
Token ID40642
Feature activation+5.49e-01

Pos logit contributions
lee+0.149
od+0.148
ous+0.140
oke+0.139
 outage+0.138
Neg logit contributions
 his-0.079
His-0.072
 helmet-0.071
P-0.067
 down-0.063
Tokenement
Token ID972
Feature activation+3.24e-01

Pos logit contributions
 amaz+0.388
lee+0.352
od+0.306
 spec+0.301
 angles+0.283
Neg logit contributions
P-1.177
His-1.137
ercise-1.079
Spot-1.071
 helmet-1.056
Token when
Token ID618
Feature activation+9.13e-02

Pos logit contributions
 amaz+0.513
lee+0.489
 spec+0.460
od+0.456
 ago+0.455
Neg logit contributions
P-0.411
His-0.390
ercise-0.351
 helmet-0.344
Spot-0.331
Token her
Token ID607
Feature activation+5.47e-03

Pos logit contributions
 amaz+0.130
lee+0.119
 spec+0.114
 ago+0.111
 mesmer+0.110
Neg logit contributions
P-0.142
His-0.131
ercise-0.129
w-0.121
 helmet-0.118
Token mom
Token ID1995
Feature activation+1.12e-01

Pos logit contributions
 amaz+0.008
lee+0.008
 spec+0.007
 ago+0.007
od+0.007
Neg logit contributions
P-0.008
His-0.008
ercise-0.007
 helmet-0.006
Spot-0.006
Tokenmy
Token ID1820
Feature activation-1.31e-02

Pos logit contributions
 amaz+0.177
lee+0.173
od+0.160
 ago+0.160
 spec+0.160
Neg logit contributions
P-0.144
His-0.136
ercise-0.123
 helmet-0.120
Spot-0.115
Token.
Token ID13
Feature activation-5.08e-02

Pos logit contributions
 amaz+0.066
lee+0.063
 spec+0.059
od+0.059
 ago+0.058
Neg logit contributions
P-0.056
His-0.053
ercise-0.048
 helmet-0.047
Spot-0.046
Token It
Token ID632
Feature activation+2.35e-02

Pos logit contributions
P+0.062
His+0.061
Spot+0.052
Move+0.050
ercise+0.050
Neg logit contributions
 amaz-0.081
lee-0.079
 angles-0.075
 spec-0.074
od-0.072
Token said
Token ID531
Feature activation-6.98e-02

Pos logit contributions
 amaz+0.037
lee+0.035
 spec+0.033
od+0.033
 ago+0.033
Neg logit contributions
P-0.030
His-0.028
ercise-0.026
 helmet-0.025
Spot-0.024
Token:
Token ID25
Feature activation+5.61e-02

Pos logit contributions
P+0.131
ercise+0.119
His+0.118
w+0.115
Mid+0.115
Neg logit contributions
 amaz-0.072
lee-0.070
 ago-0.067
od-0.065
 mesmer-0.059
Token "
Token ID366
Feature activation-7.27e-02

Pos logit contributions
 amaz+0.070
 ago+0.070
lee+0.065
od+0.062
 mesmer+0.060
Neg logit contributions
P-0.094
ercise-0.083
w-0.083
His-0.081
 helmet-0.080
TokenP
Token ID47
Feature activation-4.08e-01

Pos logit contributions
 helmet+0.097
ercise+0.086
His+0.084
 boots+0.082
P+0.080
Neg logit contributions
 amaz-0.100
 ago-0.097
od-0.092
keeping-0.092
 spec-0.090
Tokenond
Token ID623
Feature activation+1.61e-02

Pos logit contributions
P+0.651
His+0.649
Mid+0.586
 helmet+0.585
ercise+0.574
Neg logit contributions
lee-0.513
ents-0.505
 amaz-0.496
 ago-0.488
od-0.473
Token closed
Token ID4838
Feature activation-8.82e-02

Pos logit contributions
lee+0.025
 amaz+0.024
od+0.023
 angles+0.023
 ago+0.022
Neg logit contributions
P-0.021
His-0.020
ercise-0.018
Mid-0.018
Br-0.018
Token.
Token ID13
Feature activation-5.49e-02

Pos logit contributions
P+0.140
His+0.130
ercise+0.119
Mid+0.114
Br+0.114
Neg logit contributions
 amaz-0.120
lee-0.117
 ago-0.112
 angles-0.108
 spec-0.105
Token Come
Token ID7911
Feature activation-1.51e-01

Pos logit contributions
P+0.077
His+0.073
ercise+0.068
 helmet+0.067
w+0.064
Neg logit contributions
 amaz-0.079
lee-0.074
 ago-0.072
 spec-0.070
od-0.069
Token back
Token ID736
Feature activation-2.22e-01

Pos logit contributions
P+0.208
His+0.203
ercise+0.190
Br+0.176
Mid+0.175
Neg logit contributions
 amaz-0.209
 ago-0.203
lee-0.189
 of-0.188
 rooft-0.181
Token to
Token ID284
Feature activation-6.81e-02

Pos logit contributions
 ago+0.006
lee+0.005
 of+0.005
ocket+0.005
od+0.005
Neg logit contributions
His-0.009
P-0.009
Br-0.009
Move-0.009
Spot-0.008
Token regret
Token ID13721
Feature activation+1.12e-01

Pos logit contributions
P+0.102
His+0.094
ercise+0.090
Move+0.085
Spot+0.084
Neg logit contributions
 amaz-0.091
lee-0.084
 ago-0.081
od-0.078
 angles-0.077
Token."
Token ID526
Feature activation-6.72e-02

Pos logit contributions
lee+0.154
 amaz+0.148
od+0.145
 ago+0.145
 angles+0.138
Neg logit contributions
P-0.171
His-0.155
ercise-0.142
OK-0.140
Wait-0.140
Token
Token ID198
Feature activation-5.11e-02

Pos logit contributions
P+0.095
His+0.092
ercise+0.089
 helmet+0.088
w+0.085
Neg logit contributions
 amaz-0.092
lee-0.091
 spec-0.081
od-0.081
 ago-0.080
Token
Token ID198
Feature activation-5.14e-02

Pos logit contributions
P+0.060
His+0.058
Spot+0.047
ercise+0.046
O+0.046
Neg logit contributions
 amaz-0.090
lee-0.083
 spec-0.082
 ago-0.080
 angles-0.080
TokenP
Token ID47
Feature activation-4.07e-01

Pos logit contributions
 helmet+0.031
ercise+0.028
 down+0.027
His+0.024
P+0.022
Neg logit contributions
 amaz-0.105
lee-0.104
 mesmer-0.101
od-0.101
 angles-0.096
Tokenete
Token ID14471
Feature activation+1.71e-02

Pos logit contributions
P+0.716
His+0.715
 helmet+0.681
ercise+0.669
Spot+0.650
Neg logit contributions
 amaz-0.415
lee-0.411
od-0.360
 ago-0.341
 angles-0.335
Token was
Token ID373
Feature activation-9.05e-02

Pos logit contributions
lee+0.020
 amaz+0.019
 mesmer+0.017
od+0.017
 spec+0.016
Neg logit contributions
P-0.029
ercise-0.026
Move-0.025
His-0.025
 helmet-0.025
Token sad
Token ID6507
Feature activation-5.25e-02

Pos logit contributions
P+0.121
His+0.117
ercise+0.104
 helmet+0.103
Spot+0.100
Neg logit contributions
 amaz-0.136
lee-0.130
 spec-0.122
od-0.121
 ago-0.120
Token,
Token ID11
Feature activation-1.63e-01

Pos logit contributions
P+0.087
ercise+0.077
His+0.075
w+0.072
Spot+0.071
Neg logit contributions
 amaz-0.069
lee-0.066
 ago-0.065
 mesmer-0.060
od-0.059
Token but
Token ID475
Feature activation-1.02e-01

Pos logit contributions
P+0.282
ercise+0.256
w+0.250
His+0.233
use+0.232
Neg logit contributions
 amaz-0.203
lee-0.185
 mesmer-0.184
 awe-0.173
 spec-0.171
Token own
Token ID898
Feature activation-8.02e-02

Pos logit contributions
P+0.252
His+0.230
ercise+0.216
Mid+0.212
Move+0.205
Neg logit contributions
 amaz-0.266
 ago-0.238
 spec-0.232
lee-0.230
 angles-0.228
Token coin
Token ID10752
Feature activation-1.21e-01

Pos logit contributions
P+0.123
His+0.109
Mid+0.107
ercise+0.105
Sorry+0.102
Neg logit contributions
 amaz-0.116
 ago-0.101
 angles-0.099
 spec-0.098
 awe-0.096
Token.
Token ID13
Feature activation-5.53e-02

Pos logit contributions
P+0.221
Move+0.207
Br+0.206
His+0.201
ercise+0.198
Neg logit contributions
 ago-0.119
 amaz-0.115
keeping-0.112
od-0.111
lee-0.109
Token
Token ID198
Feature activation-5.29e-02

Pos logit contributions
P+0.068
His+0.064
w+0.060
 helmet+0.060
ercise+0.059
Neg logit contributions
 amaz-0.087
lee-0.086
od-0.079
 ago-0.078
 angles-0.077
Token
Token ID198
Feature activation-5.32e-02

Pos logit contributions
P+0.065
His+0.061
ercise+0.053
 helmet+0.052
Spot+0.052
Neg logit contributions
 amaz-0.088
lee-0.083
 spec-0.079
 ago-0.078
od-0.077
TokenP
Token ID47
Feature activation-4.07e-01

Pos logit contributions
 helmet+0.037
 down+0.035
ercise+0.032
 melts+0.032
 helmets+0.031
Neg logit contributions
 amaz-0.097
lee-0.096
od-0.096
 angles-0.091
ents-0.090
Tokenipp
Token ID3974
Feature activation-8.69e-02

Pos logit contributions
His+0.561
P+0.550
 helmet+0.522
ercise+0.500
Spot+0.498
Neg logit contributions
lee-0.583
 amaz-0.579
od-0.541
ents-0.513
 ago-0.508
Tokena
Token ID64
Feature activation-1.04e-01

Pos logit contributions
P+0.105
His+0.101
ercise+0.095
 helmet+0.094
Spot+0.089
Neg logit contributions
 amaz-0.134
lee-0.132
od-0.126
 spec-0.123
 angles-0.123
Token was
Token ID373
Feature activation-9.14e-02

Pos logit contributions
P+0.153
His+0.148
Br+0.142
 helmet+0.141
ercise+0.140
Neg logit contributions
lee-0.136
 amaz-0.135
od-0.124
 mesmer-0.118
ents-0.116
Token very
Token ID845
Feature activation-9.80e-02

Pos logit contributions
P+0.138
His+0.128
ercise+0.120
w+0.116
 helmet+0.116
Neg logit contributions
 amaz-0.128
lee-0.119
 mesmer-0.112
 ago-0.110
 spec-0.110
Token unhappy
Token ID19283
Feature activation-8.66e-02

Pos logit contributions
P+0.164
ercise+0.150
His+0.149
Spot+0.144
 helmet+0.142
Neg logit contributions
 amaz-0.120
lee-0.109
 mesmer-0.106
 ago-0.100
 impressed-0.099
Token says
Token ID1139
Feature activation-1.68e-02

Pos logit contributions
P+0.162
His+0.150
ercise+0.140
w+0.136
 helmet+0.135
Neg logit contributions
 amaz-0.112
lee-0.108
 spec-0.102
od-0.101
 angles-0.099
Token.
Token ID13
Feature activation-6.09e-02

Pos logit contributions
P+0.021
His+0.020
ercise+0.018
 helmet+0.018
Spot+0.017
Neg logit contributions
 amaz-0.027
lee-0.026
 spec-0.024
od-0.024
 ago-0.024
Token
Token ID198
Feature activation-5.19e-02

Pos logit contributions
P+0.089
w+0.086
ercise+0.085
use+0.077
uch+0.077
Neg logit contributions
lee-0.077
 amaz-0.077
 ago-0.072
 spec-0.071
 mesmer-0.069
Token
Token ID198
Feature activation-5.27e-02

Pos logit contributions
P+0.072
His+0.066
ercise+0.065
 helmet+0.063
w+0.061
Neg logit contributions
 amaz-0.073
lee-0.071
 spec-0.065
 ago-0.065
od-0.065
Token"
Token ID1
Feature activation-2.61e-01

Pos logit contributions
ercise+0.049
w+0.049
P+0.048
 helmet+0.047
use+0.046
Neg logit contributions
lee-0.084
 amaz-0.083
 ago-0.080
 of-0.078
 angles-0.078
TokenP
Token ID47
Feature activation-4.06e-01

Pos logit contributions
P+0.246
 helmet+0.236
His+0.234
ercise+0.227
erve+0.222
Neg logit contributions
 amaz-0.447
 spec-0.430
 mesmer-0.419
od-0.411
 angles-0.398
Tokenow
Token ID322
Feature activation-3.37e-02

Pos logit contributions
P+0.617
His+0.604
 helmet+0.561
Mid+0.553
Hi+0.545
Neg logit contributions
lee-0.529
ents-0.525
 amaz-0.520
od-0.511
 ago-0.467
Token!
Token ID0
Feature activation-3.74e-02

Pos logit contributions
His+0.054
P+0.053
Br+0.048
ercise+0.048
 helmet+0.047
Neg logit contributions
lee-0.043
od-0.040
ous-0.038
 amaz-0.037
oke-0.036
Token Pow
Token ID14120
Feature activation-1.98e-01

Pos logit contributions
P+0.067
ercise+0.057
His+0.057
w+0.056
 helmet+0.054
Neg logit contributions
 amaz-0.045
 ago-0.040
lee-0.040
od-0.039
 spec-0.039
Token!
Token ID0
Feature activation-3.84e-02

Pos logit contributions
His+0.344
P+0.326
ercise+0.318
 helmet+0.300
Br+0.300
Neg logit contributions
lee-0.220
od-0.219
 amaz-0.215
 spec-0.195
 ago-0.190
Token I
Token ID314
Feature activation-4.74e-02

Pos logit contributions
P+0.061
His+0.054
ercise+0.052
 helmet+0.051
w+0.050
Neg logit contributions
 amaz-0.052
lee-0.046
 ago-0.046
 spec-0.046
od-0.046
Token oven
Token ID14361
Feature activation+3.10e-02

Pos logit contributions
 amaz+0.003
lee+0.003
 spec+0.002
 ago+0.002
 angles+0.002
Neg logit contributions
His-0.006
P-0.006
mar-0.005
w-0.005
ercise-0.005
Token.
Token ID13
Feature activation-5.68e-02

Pos logit contributions
 amaz+0.042
 ago+0.038
lee+0.038
 spec+0.035
 angles+0.035
Neg logit contributions
P-0.049
His-0.047
ercise-0.043
Spot-0.042
O-0.041
Token
Token ID220
Feature activation-3.54e-02

Pos logit contributions
P+0.076
His+0.071
ercise+0.065
 helmet+0.062
Mid+0.061
Neg logit contributions
 amaz-0.087
lee-0.081
od-0.079
 spec-0.079
 ago-0.078
Token
Token ID198
Feature activation-5.38e-02

Pos logit contributions
P+0.045
His+0.043
ercise+0.039
 helmet+0.038
Spot+0.037
Neg logit contributions
 amaz-0.055
lee-0.053
 spec-0.049
od-0.049
 ago-0.049
Token
Token ID198
Feature activation-5.41e-02

Pos logit contributions
P+0.063
His+0.059
ercise+0.048
Spot+0.048
Mid+0.047
Neg logit contributions
 amaz-0.095
lee-0.089
 spec-0.086
 angles-0.084
 ago-0.084
TokenP
Token ID47
Feature activation-4.06e-01

Pos logit contributions
P+0.038
His+0.035
ercise+0.029
 helmet+0.029
Spot+0.026
Neg logit contributions
 amaz-0.114
lee-0.111
 spec-0.105
od-0.105
 ago-0.105
Tokenenny
Token ID11870
Feature activation+4.76e-02

Pos logit contributions
P+0.678
His+0.633
ercise+0.594
 helmet+0.574
w+0.565
Neg logit contributions
 amaz-0.496
lee-0.451
 spec-0.436
 angles-0.422
 ago-0.422
Token and
Token ID290
Feature activation-4.88e-02

Pos logit contributions
lee+0.069
 amaz+0.067
od+0.061
 mesmer+0.060
 spec+0.058
Neg logit contributions
P-0.066
His-0.062
ercise-0.058
 helmet-0.058
Move-0.056
Token her
Token ID607
Feature activation-1.82e-01

Pos logit contributions
P+0.034
His+0.033
 his+0.026
 helmet+0.025
ercise+0.023
Neg logit contributions
 amaz-0.105
 spec-0.102
 ago-0.101
od-0.100
lee-0.099
Token mom
Token ID1995
Feature activation-7.52e-02

Pos logit contributions
P+0.193
His+0.179
 helmet+0.168
Spot+0.157
ercise+0.155
Neg logit contributions
 amaz-0.321
lee-0.301
 spec-0.298
 angles-0.297
 ago-0.297
Tokenmy
Token ID1820
Feature activation-2.03e-01

Pos logit contributions
P+0.068
His+0.063
ercise+0.054
 helmet+0.052
Spot+0.050
Neg logit contributions
 amaz-0.147
lee-0.142
od-0.134
 ago-0.133
 spec-0.133
Token beautiful
Token ID4950
Feature activation+2.74e-02

Pos logit contributions
P+0.183
His+0.177
ercise+0.164
 helmet+0.155
w+0.154
Neg logit contributions
 amaz-0.165
lee-0.151
 spec-0.144
od-0.140
 ago-0.140
Token butterfly
Token ID35113
Feature activation+7.05e-02

Pos logit contributions
 amaz+0.035
lee+0.033
 spec+0.030
 ago+0.030
od+0.030
Neg logit contributions
P-0.044
His-0.042
ercise-0.039
 helmet-0.037
Spot-0.037
Token!
Token ID0
Feature activation-3.34e-02

Pos logit contributions
 amaz+0.116
lee+0.109
 angles+0.104
 ago+0.101
 spec+0.101
Neg logit contributions
P-0.084
His-0.077
 helmet-0.070
ercise-0.069
w-0.067
Token
Token ID198
Feature activation-5.09e-02

Pos logit contributions
P+0.033
His+0.029
Mid+0.025
Spot+0.024
ercise+0.023
Neg logit contributions
 amaz-0.067
 angles-0.062
 ago-0.060
 spec-0.060
od-0.059
Token
Token ID198
Feature activation-5.12e-02

Pos logit contributions
P+0.053
His+0.049
Spot+0.041
O+0.041
Mid+0.039
Neg logit contributions
 amaz-0.102
 spec-0.097
 angles-0.094
 ago-0.094
lee-0.092
TokenP
Token ID47
Feature activation-4.06e-01

Pos logit contributions
P+0.048
His+0.044
ercise+0.037
 helmet+0.035
Spot+0.035
Neg logit contributions
 amaz-0.100
lee-0.096
 spec-0.092
 ago-0.092
 angles-0.091
Tokenixie
Token ID39291
Feature activation+6.17e-02

Pos logit contributions
P+0.586
His+0.557
ercise+0.509
 helmet+0.499
Spot+0.484
Neg logit contributions
 amaz-0.576
lee-0.544
 spec-0.511
 ago-0.505
od-0.504
Token was
Token ID373
Feature activation-9.02e-02

Pos logit contributions
 amaz+0.100
lee+0.092
 spec+0.090
 ago+0.089
 angles+0.088
Neg logit contributions
P-0.075
His-0.072
ercise-0.063
 helmet-0.062
w-0.060
Token so
Token ID523
Feature activation+5.79e-02

Pos logit contributions
His+0.099
 his+0.095
P+0.092
 down+0.088
 away+0.084
Neg logit contributions
 spec-0.138
keeping-0.136
 amaz-0.136
 ago-0.134
lee-0.133
Token happy
Token ID3772
Feature activation-4.18e-02

Pos logit contributions
lee+0.090
 amaz+0.090
 spec+0.089
 ago+0.088
keeping+0.086
Neg logit contributions
His-0.069
P-0.068
 his-0.057
OK-0.054
 down-0.054
Token to
Token ID284
Feature activation-7.34e-02

Pos logit contributions
P+0.047
His+0.046
 his+0.042
 down+0.040
 helmet+0.039
Neg logit contributions
 amaz-0.071
lee-0.065
 angles-0.063
 spec-0.063
 ago-0.061
Token party
Token ID2151
Feature activation-1.29e-02

Pos logit contributions
His+0.130
P+0.127
ercise+0.116
Move+0.113
Wait+0.111
Neg logit contributions
 amaz-0.112
od-0.104
lee-0.100
 ago-0.094
 spec-0.093
Token perfect
Token ID2818
Feature activation-1.15e-01

Pos logit contributions
P+0.018
His+0.017
ercise+0.015
 helmet+0.015
Move+0.014
Neg logit contributions
 amaz-0.020
lee-0.018
od-0.018
 ago-0.017
 spec-0.017
Token."
Token ID526
Feature activation-6.81e-02

Pos logit contributions
His+0.194
P+0.194
Br+0.176
ercise+0.174
Move+0.170
Neg logit contributions
 ago-0.138
 amaz-0.128
lee-0.125
od-0.124
 angles-0.120
Token
Token ID198
Feature activation-5.11e-02

Pos logit contributions
P+0.090
His+0.086
ercise+0.080
 helmet+0.077
w+0.075
Neg logit contributions
 amaz-0.103
lee-0.099
 spec-0.092
 ago-0.092
od-0.091
Token
Token ID198
Feature activation-5.11e-02

Pos logit contributions
P+0.052
His+0.051
Spot+0.042
O+0.039
 his+0.039
Neg logit contributions
 amaz-0.099
 spec-0.093
lee-0.091
 angles-0.090
 ago-0.089
TokenP
Token ID47
Feature activation-4.06e-01

Pos logit contributions
ercise+0.045
 helmet+0.043
P+0.041
His+0.041
erve+0.040
Neg logit contributions
 amaz-0.088
lee-0.086
od-0.082
ents-0.079
 spec-0.079
Tokeninky
Token ID29246
Feature activation-1.40e-01

Pos logit contributions
His+0.634
 helmet+0.584
P+0.584
Move+0.556
ercise+0.553
Neg logit contributions
lee-0.523
 amaz-0.484
ents-0.482
od-0.463
 ago-0.443
Token smiled
Token ID13541
Feature activation-3.91e-04

Pos logit contributions
P+0.207
His+0.200
ercise+0.188
 helmet+0.182
w+0.178
Neg logit contributions
lee-0.197
 amaz-0.185
od-0.170
 spec-0.166
 mesmer-0.159
Token and
Token ID290
Feature activation-4.63e-02

Pos logit contributions
P+0.000
His+0.000
 helmet+0.000
ercise+0.000
Spot+0.000
Neg logit contributions
 amaz-0.001
lee-0.001
 spec-0.001
od-0.001
 ago-0.001
Token said
Token ID531
Feature activation-7.29e-02

Pos logit contributions
P+0.060
His+0.057
ercise+0.052
 helmet+0.051
Spot+0.049
Neg logit contributions
 amaz-0.071
lee-0.068
 spec-0.063
od-0.063
 mesmer-0.062
Token,
Token ID11
Feature activation-1.61e-01

Pos logit contributions
P+0.103
ercise+0.093
Move+0.090
His+0.090
Spot+0.090
Neg logit contributions
lee-0.105
 amaz-0.100
od-0.099
 ago-0.092
 spec-0.092
Tokenboard
Token ID3526
Feature activation-1.37e-02

Pos logit contributions
P+0.061
His+0.058
ercise+0.055
 helmet+0.052
Spot+0.051
Neg logit contributions
 amaz-0.052
lee-0.050
 ago-0.047
od-0.046
 spec-0.046
Token.
Token ID13
Feature activation-5.63e-02

Pos logit contributions
P+0.022
His+0.020
ercise+0.020
Mid+0.019
Br+0.019
Neg logit contributions
 amaz-0.017
 ago-0.017
lee-0.016
od-0.016
 spec-0.016
Token
Token ID220
Feature activation-3.56e-02

Pos logit contributions
P+0.058
His+0.055
ercise+0.047
 helmet+0.046
Spot+0.044
Neg logit contributions
 amaz-0.103
lee-0.097
 spec-0.094
od-0.094
 ago-0.093
Token
Token ID198
Feature activation-5.14e-02

Pos logit contributions
P+0.042
His+0.040
ercise+0.034
 helmet+0.034
Spot+0.034
Neg logit contributions
 amaz-0.060
lee-0.057
 spec-0.054
 ago-0.053
 angles-0.053
Token
Token ID198
Feature activation-5.23e-02

Pos logit contributions
P+0.052
His+0.050
Spot+0.039
 his+0.037
One+0.037
Neg logit contributions
 amaz-0.103
lee-0.095
 spec-0.094
 ago-0.091
 angles-0.090
TokenP
Token ID47
Feature activation-4.06e-01

Pos logit contributions
P+0.031
His+0.029
 helmet+0.026
ercise+0.026
w+0.023
Neg logit contributions
 amaz-0.111
lee-0.109
od-0.103
 mesmer-0.103
 spec-0.102
Tokenete
Token ID14471
Feature activation+1.74e-02

Pos logit contributions
P+0.656
His+0.630
ercise+0.581
 helmet+0.573
Spot+0.557
Neg logit contributions
 amaz-0.502
lee-0.473
 spec-0.435
od-0.431
 ago-0.430
Token was
Token ID373
Feature activation-9.13e-02

Pos logit contributions
lee+0.025
 amaz+0.025
od+0.023
 mesmer+0.023
ents+0.021
Neg logit contributions
P-0.024
His-0.022
ercise-0.022
 helmet-0.022
w-0.021
Token curious
Token ID11040
Feature activation+3.20e-02

Pos logit contributions
His+0.098
 his+0.089
 down+0.083
P+0.083
 okay+0.081
Neg logit contributions
lee-0.155
 amaz-0.149
 spec-0.146
 angles-0.146
keeping-0.146
Token,
Token ID11
Feature activation-1.65e-01

Pos logit contributions
 amaz+0.055
lee+0.052
 spec+0.051
od+0.050
 angles+0.050
Neg logit contributions
His-0.033
 his-0.031
 helmet-0.030
P-0.029
 down-0.027
Token so
Token ID523
Feature activation+5.60e-02

Pos logit contributions
P+0.310
His+0.282
ercise+0.278
w+0.276
 helmet+0.263
Neg logit contributions
 amaz-0.167
lee-0.159
 mesmer-0.149
 spec-0.141
 ago-0.140
Token bird
Token ID6512
Feature activation-9.95e-03

Pos logit contributions
P+0.151
His+0.143
ercise+0.129
 helmet+0.126
Spot+0.122
Neg logit contributions
 amaz-0.186
lee-0.177
 spec-0.167
 ago-0.165
od-0.165
Token named
Token ID3706
Feature activation+2.34e-02

Pos logit contributions
P+0.013
His+0.013
 helmet+0.011
Spot+0.011
 his+0.011
Neg logit contributions
 amaz-0.016
lee-0.015
 angles-0.014
 mesmer-0.014
 spec-0.014
Token Polly
Token ID36898
Feature activation-3.26e-02

Pos logit contributions
 amaz+0.047
lee+0.043
 angles+0.041
 mesmer+0.041
 ago+0.041
Neg logit contributions
P-0.020
His-0.019
Spot-0.017
 his-0.016
Mid-0.015
Token.
Token ID13
Feature activation-5.59e-02

Pos logit contributions
P+0.031
His+0.028
ercise+0.024
 helmet+0.024
Spot+0.023
Neg logit contributions
 amaz-0.063
lee-0.059
 spec-0.057
od-0.056
 angles-0.056
Token "
Token ID366
Feature activation-7.46e-02

Pos logit contributions
P+0.069
His+0.065
ercise+0.058
 helmet+0.058
w+0.056
Neg logit contributions
 amaz-0.090
lee-0.087
 spec-0.081
od-0.081
 ago-0.080
TokenP
Token ID47
Feature activation-4.05e-01

Pos logit contributions
P+0.042
His+0.041
 helmet+0.037
ercise+0.035
w+0.029
Neg logit contributions
 amaz-0.161
lee-0.154
 spec-0.148
od-0.147
 mesmer-0.145
Tokenolly
Token ID5098
Feature activation+3.91e-02

Pos logit contributions
P+0.588
His+0.567
ercise+0.516
 helmet+0.511
Spot+0.493
Neg logit contributions
 amaz-0.564
lee-0.539
od-0.497
 spec-0.494
 ago-0.494
Token,
Token ID11
Feature activation-1.63e-01

Pos logit contributions
 amaz+0.064
lee+0.063
 ago+0.061
 spec+0.059
od+0.059
Neg logit contributions
His-0.048
P-0.046
ercise-0.040
Br-0.038
 helmet-0.038
Token what
Token ID644
Feature activation-2.77e-02

Pos logit contributions
P+0.201
His+0.189
ercise+0.174
 helmet+0.172
Spot+0.165
Neg logit contributions
 amaz-0.261
lee-0.253
 spec-0.237
 angles-0.235
od-0.235
Token is
Token ID318
Feature activation-1.37e-01

Pos logit contributions
P+0.029
His+0.026
ercise+0.023
 helmet+0.023
w+0.023
Neg logit contributions
 amaz-0.052
lee-0.051
 angles-0.046
 spec-0.046
 mesmer-0.046
Token this
Token ID428
Feature activation+3.15e-02

Pos logit contributions
P+0.217
His+0.201
ercise+0.190
 helmet+0.186
w+0.182
Neg logit contributions
 amaz-0.189
lee-0.173
 spec-0.160
 angles-0.159
 ago-0.157
Token
Token ID6184
Feature activation-1.19e-01

Pos logit contributions
P+0.011
His+0.011
ercise+0.010
w+0.009
Spot+0.009
Neg logit contributions
 amaz-0.011
lee-0.010
 mesmer-0.009
 spec-0.009
 ago-0.009
Token
Token ID95
Feature activation-6.94e-02

Pos logit contributions
P+0.163
His+0.151
ercise+0.141
 helmet+0.133
O+0.132
Neg logit contributions
 amaz-0.180
lee-0.167
 angles-0.161
 spec-0.159
od-0.154
Token
Token ID26391
Feature activation-1.38e-01

Pos logit contributions
P+0.087
His+0.081
ercise+0.074
 helmet+0.070
Spot+0.069
Neg logit contributions
 amaz-0.113
lee-0.107
 spec-0.102
 angles-0.100
od-0.100
Token
Token ID129
Feature activation-5.88e-02

Pos logit contributions
P+0.144
His+0.125
ercise+0.113
w+0.111
 helmet+0.108
Neg logit contributions
 amaz-0.249
lee-0.234
 spec-0.233
 angles-0.231
 ago-0.220
Token
Token ID241
Feature activation-9.29e-02

Pos logit contributions
P+0.077
His+0.072
 helmet+0.065
ercise+0.064
Spot+0.062
Neg logit contributions
 amaz-0.090
lee-0.088
 ago-0.081
od-0.081
 spec-0.080
TokenP
Token ID47
Feature activation-4.05e-01

Pos logit contributions
P+0.093
His+0.087
ercise+0.073
 helmet+0.072
Spot+0.070
Neg logit contributions
 amaz-0.173
lee-0.167
 spec-0.157
od-0.156
 angles-0.155
Tokenray
Token ID2433
Feature activation-1.07e-01

Pos logit contributions
His+0.637
P+0.635
 helmet+0.584
ercise+0.571
Spot+0.559
Neg logit contributions
lee-0.524
 amaz-0.513
od-0.473
ents-0.456
 ago-0.454
Token to
Token ID284
Feature activation-7.07e-02

Pos logit contributions
P+0.157
His+0.155
Spot+0.139
ercise+0.138
Br+0.138
Neg logit contributions
 amaz-0.149
lee-0.146
 ago-0.144
 spec-0.140
od-0.138
Token God
Token ID1793
Feature activation+4.81e-02

Pos logit contributions
P+0.080
His+0.077
ercise+0.067
 helmet+0.067
Spot+0.064
Neg logit contributions
 amaz-0.120
lee-0.115
 spec-0.111
od-0.109
 ago-0.108
Token and
Token ID290
Feature activation-4.85e-02

Pos logit contributions
 ago+0.065
 amaz+0.061
od+0.059
lee+0.058
 mesmer+0.054
Neg logit contributions
P-0.078
His-0.073
O-0.067
Br-0.067
Move-0.065
Token He
Token ID679
Feature activation-2.25e-01

Pos logit contributions
P+0.065
His+0.060
ercise+0.055
Spot+0.053
w+0.052
Neg logit contributions
 amaz-0.075
lee-0.070
 ago-0.066
 spec-0.065
od-0.064
Token almost
Token ID2048
Feature activation-1.13e-01

Pos logit contributions
P+0.147
Move+0.125
His+0.124
Spot+0.121
w+0.121
Neg logit contributions
 amaz-0.103
lee-0.099
 ago-0.097
 of-0.091
 impressed-0.089
Token ready
Token ID3492
Feature activation-2.18e-01

Pos logit contributions
P+0.173
His+0.167
Spot+0.161
OK+0.151
ercise+0.149
Neg logit contributions
lee-0.148
 amaz-0.148
 ago-0.145
 awe-0.123
 of-0.117
Token.
Token ID13
Feature activation-5.72e-02

Pos logit contributions
P+0.444
ercise+0.380
His+0.380
Spot+0.375
Br+0.365
Neg logit contributions
 ago-0.228
 amaz-0.213
lee-0.208
 of-0.194
od-0.189
Token
Token ID198
Feature activation-5.22e-02

Pos logit contributions
P+0.077
His+0.073
ercise+0.068
 helmet+0.067
w+0.064
Neg logit contributions
 amaz-0.086
lee-0.082
 spec-0.076
od-0.075
 ago-0.075
Token
Token ID198
Feature activation-5.32e-02

Pos logit contributions
P+0.065
His+0.062
ercise+0.055
 helmet+0.054
Spot+0.053
Neg logit contributions
 amaz-0.084
lee-0.080
 spec-0.076
 ago-0.075
od-0.074
TokenP
Token ID47
Feature activation-4.05e-01

Pos logit contributions
 helmet+0.070
ercise+0.068
 down+0.067
erve+0.066
erves+0.065
Neg logit contributions
 amaz-0.068
 mesmer-0.065
lee-0.065
od-0.063
 impressed-0.057
Tokenapa
Token ID32678
Feature activation+5.91e-03

Pos logit contributions
P+0.690
His+0.685
 helmet+0.645
Move+0.634
ercise+0.634
Neg logit contributions
 amaz-0.450
lee-0.446
od-0.385
 ago-0.374
ents-0.363
Token nodded
Token ID14464
Feature activation-1.25e-01

Pos logit contributions
lee+0.008
 amaz+0.007
od+0.007
ents+0.007
 mesmer+0.007
Neg logit contributions
P-0.009
His-0.008
ercise-0.008
 helmet-0.008
Br-0.008
Token sadly
Token ID21098
Feature activation-7.58e-02

Pos logit contributions
P+0.244
ercise+0.233
w+0.228
ww+0.226
uch+0.223
Neg logit contributions
 amaz-0.132
lee-0.127
 mesmer-0.107
 of-0.106
 awe-0.100
Token.
Token ID13
Feature activation-6.05e-02

Pos logit contributions
P+0.125
His+0.109
ercise+0.107
w+0.105
Move+0.102
Neg logit contributions
 amaz-0.101
lee-0.097
 ago-0.090
 mesmer-0.084
od-0.083
Token He
Token ID679
Feature activation-2.23e-01

Pos logit contributions
P+0.064
w+0.056
His+0.053
ercise+0.053
 helmet+0.050
Neg logit contributions
 amaz-0.114
lee-0.112
 spec-0.101
 mesmer-0.100
 ago-0.100
Token cable
Token ID7862
Feature activation-1.68e-01

Pos logit contributions
 amaz+0.006
 spec+0.005
 angles+0.005
lee+0.005
 ago+0.005
Neg logit contributions
P-0.004
His-0.004
ercise-0.003
mar-0.003
w-0.003
Token?"
Token ID1701
Feature activation-2.05e-01

Pos logit contributions
P+0.236
His+0.227
Br+0.205
ercise+0.203
Spot+0.200
Neg logit contributions
 amaz-0.247
 ago-0.230
lee-0.221
 angles-0.217
od-0.214
Token
Token ID220
Feature activation-3.37e-02

Pos logit contributions
P+0.303
His+0.288
ercise+0.287
 helmet+0.282
w+0.274
Neg logit contributions
 amaz-0.277
lee-0.266
 mesmer-0.239
 spec-0.239
od-0.236
Token
Token ID198
Feature activation-4.93e-02

Pos logit contributions
P+0.049
His+0.046
ercise+0.045
 helmet+0.043
w+0.042
Neg logit contributions
 amaz-0.046
lee-0.045
od-0.042
 mesmer-0.040
 ago-0.040
Token
Token ID198
Feature activation-5.02e-02

Pos logit contributions
 helmet+0.079
ercise+0.079
erve+0.074
P+0.072
 helmets+0.072
Neg logit contributions
 amaz-0.053
lee-0.052
 mesmer-0.048
od-0.047
 spec-0.041
TokenP
Token ID47
Feature activation-4.05e-01

Pos logit contributions
 helmet+0.059
 down+0.054
ercise+0.054
erve+0.052
erves+0.052
Neg logit contributions
 amaz-0.079
 mesmer-0.074
lee-0.072
od-0.070
 tad-0.065
Tokenapa
Token ID32678
Feature activation+2.51e-03

Pos logit contributions
His+0.757
P+0.752
 helmet+0.737
Hi+0.714
Move+0.713
Neg logit contributions
lee-0.379
 amaz-0.364
ents-0.321
od-0.320
 ago-0.285
Token said
Token ID531
Feature activation-7.30e-02

Pos logit contributions
lee+0.003
 amaz+0.003
od+0.003
 mesmer+0.003
 ago+0.002
Neg logit contributions
P-0.004
ercise-0.004
His-0.004
 helmet-0.004
Br-0.004
Token,
Token ID11
Feature activation-1.61e-01

Pos logit contributions
P+0.127
ercise+0.116
Move+0.109
Wh+0.108
Spot+0.108
Neg logit contributions
lee-0.087
 amaz-0.080
od-0.080
 of-0.078
ous-0.077
Token "
Token ID366
Feature activation-7.55e-02

Pos logit contributions
P+0.312
w+0.295
ercise+0.284
erve+0.276
use+0.270
Neg logit contributions
lee-0.154
 amaz-0.144
 mesmer-0.142
ous-0.131
 ago-0.127
TokenIt
Token ID1026
Feature activation-1.09e-01

Pos logit contributions
P+0.083
His+0.079
 helmet+0.072
ercise+0.071
Mid+0.065
Neg logit contributions
 amaz-0.128
lee-0.122
 spec-0.118
od-0.118
 mesmer-0.116
Token together
Token ID1978
Feature activation-2.03e-01

Pos logit contributions
P+0.130
His+0.128
Move+0.121
ercise+0.118
ww+0.117
Neg logit contributions
 amaz-0.134
lee-0.126
 ago-0.123
 mesmer-0.119
ents-0.114
Token and
Token ID290
Feature activation-3.44e-02

Pos logit contributions
P+0.310
His+0.287
ercise+0.271
Move+0.259
O+0.258
Neg logit contributions
 amaz-0.277
lee-0.255
 ago-0.253
 angles-0.245
 spec-0.239
Token said
Token ID531
Feature activation-6.21e-02

Pos logit contributions
P+0.042
His+0.040
ercise+0.036
 helmet+0.034
Move+0.034
Neg logit contributions
 amaz-0.056
lee-0.053
 spec-0.049
 ago-0.048
 mesmer-0.047
Token,
Token ID11
Feature activation-1.50e-01

Pos logit contributions
P+0.102
His+0.089
ercise+0.088
Move+0.086
w+0.085
Neg logit contributions
lee-0.080
 amaz-0.079
 ago-0.073
od-0.072
 spec-0.068
Token "
Token ID366
Feature activation-6.47e-02

Pos logit contributions
P+0.222
His+0.194
w+0.188
ercise+0.187
 helmet+0.178
Neg logit contributions
 amaz-0.216
lee-0.206
 ago-0.195
 mesmer-0.194
od-0.188
TokenP
Token ID47
Feature activation-4.04e-01

Pos logit contributions
P+0.070
His+0.069
 helmet+0.066
ercise+0.063
erve+0.059
Neg logit contributions
 amaz-0.105
 spec-0.097
lee-0.096
od-0.095
 mesmer-0.094
Tokeninky
Token ID29246
Feature activation-1.39e-01

Pos logit contributions
His+0.632
P+0.608
 helmet+0.571
 his+0.553
Move+0.546
Neg logit contributions
lee-0.534
 amaz-0.496
ents-0.486
 ago-0.486
od-0.480
Token promise
Token ID6991
Feature activation-7.84e-02

Pos logit contributions
His+0.249
P+0.242
Br+0.214
Move+0.211
O+0.204
Neg logit contributions
lee-0.165
 ago-0.157
od-0.148
 spec-0.147
 amaz-0.142
Token."
Token ID526
Feature activation-6.43e-02

Pos logit contributions
P+0.105
His+0.099
ercise+0.090
 helmet+0.089
w+0.084
Neg logit contributions
 amaz-0.118
lee-0.114
 spec-0.107
 ago-0.106
 angles-0.105
Token
Token ID198
Feature activation-4.46e-02

Pos logit contributions
P+0.083
His+0.078
ercise+0.071
 helmet+0.070
Spot+0.067
Neg logit contributions
 amaz-0.101
lee-0.096
 spec-0.090
 ago-0.090
od-0.089
Token
Token ID198
Feature activation-4.50e-02

Pos logit contributions
His+0.046
P+0.046
Spot+0.037
 his+0.036
O+0.034
Neg logit contributions
 amaz-0.085
 spec-0.079
lee-0.077
 angles-0.076
 ago-0.075
Token go
Token ID467
Feature activation-1.42e-01

Pos logit contributions
P+0.046
His+0.041
Sorry+0.041
OK+0.040
Deal+0.038
Neg logit contributions
lee-0.035
 amaz-0.035
od-0.034
 spec-0.034
 of-0.032
Token.
Token ID13
Feature activation-5.04e-02

Pos logit contributions
P+0.210
His+0.201
ercise+0.185
 helmet+0.181
Spot+0.176
Neg logit contributions
 amaz-0.192
lee-0.183
 spec-0.170
 ago-0.169
od-0.169
Token
Token ID220
Feature activation-3.24e-02

Pos logit contributions
P+0.065
w+0.059
His+0.055
ercise+0.052
mar+0.052
Neg logit contributions
 amaz-0.083
lee-0.077
 spec-0.072
 awe-0.071
 mesmer-0.071
Token
Token ID198
Feature activation-4.84e-02

Pos logit contributions
P+0.055
ercise+0.049
His+0.049
 helmet+0.048
w+0.048
Neg logit contributions
lee-0.036
 amaz-0.035
od-0.034
 ago-0.033
ents-0.033
Token
Token ID198
Feature activation-4.85e-02

Pos logit contributions
ercise+0.080
 helmet+0.079
erve+0.075
P+0.075
w+0.075
Neg logit contributions
lee-0.043
 mesmer-0.042
 amaz-0.042
od-0.041
 ago-0.038
TokenP
Token ID47
Feature activation-4.04e-01

Pos logit contributions
 down+0.062
 coats+0.059
 melts+0.057
 boots+0.057
erves+0.057
Neg logit contributions
 mesmer-0.056
 amaz-0.056
 of-0.053
 impressed-0.053
 enchanted-0.051
Tokenapa
Token ID32678
Feature activation+6.60e-03

Pos logit contributions
His+0.713
P+0.705
Hi+0.669
Move+0.668
 helmet+0.660
Neg logit contributions
 amaz-0.383
lee-0.369
ents-0.342
od-0.317
ocket-0.296
Token bent
Token ID17157
Feature activation-6.23e-02

Pos logit contributions
lee+0.009
 amaz+0.008
od+0.007
 ago+0.007
 mesmer+0.007
Neg logit contributions
P-0.011
His-0.010
ercise-0.010
Br-0.009
 helmet-0.009
Token down
Token ID866
Feature activation-3.21e-01

Pos logit contributions
P+0.087
Fin+0.077
ww+0.075
uch+0.075
Deal+0.072
Neg logit contributions
lee-0.094
 amaz-0.085
 angles-0.085
keeping-0.082
 below-0.081
Token and
Token ID290
Feature activation-4.63e-02

Pos logit contributions
P+0.546
ercise+0.539
Br+0.504
 helmet+0.495
w+0.492
Neg logit contributions
 amaz-0.342
 ago-0.323
lee-0.321
 of-0.305
 mesmer-0.303
Token said
Token ID531
Feature activation-7.29e-02

Pos logit contributions
P+0.070
gh+0.067
His+0.065
w+0.065
erve+0.062
Neg logit contributions
lee-0.056
 amaz-0.053
 mesmer-0.051
ents-0.050
leaf-0.048
Token me
Token ID502
Feature activation-2.20e-01

Pos logit contributions
P+0.104
His+0.100
OK+0.089
Sorry+0.087
Move+0.084
Neg logit contributions
lee-0.171
 amaz-0.171
 angles-0.158
 ago-0.156
keeping-0.150
Token."
Token ID526
Feature activation-6.24e-02

Pos logit contributions
P+0.357
His+0.333
Move+0.315
ercise+0.311
Wait+0.306
Neg logit contributions
 ago-0.271
 amaz-0.255
lee-0.254
ents-0.243
 angles-0.240
Token
Token ID220
Feature activation-2.87e-02

Pos logit contributions
P+0.094
ercise+0.093
 helmet+0.093
His+0.090
w+0.089
Neg logit contributions
lee-0.078
 amaz-0.077
 mesmer-0.065
od-0.065
 spec-0.063
Token
Token ID198
Feature activation-4.42e-02

Pos logit contributions
P+0.045
ercise+0.043
 helmet+0.041
His+0.041
w+0.040
Neg logit contributions
lee-0.035
 amaz-0.034
od-0.032
 ago-0.030
 mesmer-0.030
Token
Token ID198
Feature activation-4.50e-02

Pos logit contributions
P+0.059
ercise+0.058
 helmet+0.057
His+0.056
w+0.053
Neg logit contributions
lee-0.059
 amaz-0.059
od-0.055
 mesmer-0.053
 ago-0.051
TokenP
Token ID47
Feature activation-4.03e-01

Pos logit contributions
 helmet+0.025
 down+0.022
ercise+0.020
 melts+0.018
 coats+0.018
Neg logit contributions
 amaz-0.093
lee-0.092
 mesmer-0.090
od-0.089
 impressed-0.085
Tokenete
Token ID14471
Feature activation+2.02e-02

Pos logit contributions
P+0.723
His+0.718
 helmet+0.697
ercise+0.680
Spot+0.674
Neg logit contributions
lee-0.388
 amaz-0.379
od-0.317
 ago-0.296
 angles-0.294
Token thought
Token ID1807
Feature activation-5.32e-02

Pos logit contributions
lee+0.025
 amaz+0.024
 mesmer+0.022
 spec+0.021
od+0.020
Neg logit contributions
P-0.033
ercise-0.029
use-0.029
His-0.028
 helmet-0.028
Token for
Token ID329
Feature activation-1.09e-01

Pos logit contributions
P+0.076
His+0.074
 helmet+0.067
ercise+0.067
Spot+0.064
Neg logit contributions
 amaz-0.074
lee-0.070
 spec-0.066
od-0.065
 ago-0.064
Token a
Token ID257
Feature activation-1.19e-01

Pos logit contributions
P+0.114
His+0.106
ercise+0.094
 helmet+0.091
Spot+0.087
Neg logit contributions
 amaz-0.197
lee-0.189
 spec-0.179
od-0.178
 ago-0.178
Token minute
Token ID5664
Feature activation-2.37e-01

Pos logit contributions
P+0.150
ercise+0.132
His+0.128
erve+0.122
w+0.121
Neg logit contributions
 amaz-0.191
 ago-0.174
 spec-0.173
 mesmer-0.170
 awe-0.170
Token're
Token ID821
Feature activation-2.38e-02

Pos logit contributions
P+0.184
His+0.181
ercise+0.159
 helmet+0.156
Spot+0.153
Neg logit contributions
 amaz-0.176
lee-0.166
 ago-0.157
od-0.155
 spec-0.153
Token hungry
Token ID14720
Feature activation-1.15e-01

Pos logit contributions
P+0.034
His+0.034
ercise+0.032
OK+0.030
Move+0.029
Neg logit contributions
lee-0.033
 ago-0.032
 amaz-0.032
 mesmer-0.030
od-0.030
Token."
Token ID526
Feature activation-6.61e-02

Pos logit contributions
P+0.197
His+0.193
ercise+0.180
Br+0.173
Mid+0.166
Neg logit contributions
 ago-0.136
 amaz-0.127
lee-0.123
keeping-0.116
 angles-0.115
Token
Token ID198
Feature activation-4.85e-02

Pos logit contributions
P+0.082
His+0.078
ercise+0.070
 helmet+0.068
Spot+0.066
Neg logit contributions
 amaz-0.107
lee-0.102
 spec-0.096
od-0.095
 ago-0.095
Token
Token ID198
Feature activation-4.93e-02

Pos logit contributions
P+0.052
His+0.050
Spot+0.041
O+0.038
ercise+0.037
Neg logit contributions
 amaz-0.093
 spec-0.086
 angles-0.084
lee-0.083
 ago-0.083
TokenP
Token ID47
Feature activation-4.03e-01

Pos logit contributions
 helmet+0.044
P+0.041
ercise+0.041
His+0.040
 down+0.039
Neg logit contributions
lee-0.088
 amaz-0.084
od-0.081
 mesmer-0.077
 angles-0.075
Tokenolly
Token ID5098
Feature activation+4.05e-02

Pos logit contributions
His+0.685
P+0.663
 helmet+0.648
Move+0.625
ercise+0.621
Neg logit contributions
lee-0.436
 amaz-0.418
od-0.382
ents-0.353
 ago-0.335
Token was
Token ID373
Feature activation-8.82e-02

Pos logit contributions
 amaz+0.056
lee+0.055
 mesmer+0.050
od+0.049
 spec+0.049
Neg logit contributions
P-0.061
His-0.055
ercise-0.053
w-0.052
 helmet-0.052
Token happy
Token ID3772
Feature activation-3.91e-02

Pos logit contributions
His+0.108
P+0.105
 his+0.092
 helmet+0.091
ercise+0.088
Neg logit contributions
 amaz-0.138
lee-0.132
 spec-0.130
 angles-0.127
od-0.127
Token with
Token ID351
Feature activation-1.06e-01

Pos logit contributions
His+0.043
P+0.041
 his+0.039
 helmet+0.037
 down+0.035
Neg logit contributions
 amaz-0.067
 spec-0.063
lee-0.063
 angles-0.062
 ago-0.061
Token Sammy
Token ID44067
Feature activation-7.42e-02

Pos logit contributions
P+0.093
 his+0.091
His+0.090
ercise+0.067
Spot+0.066
Neg logit contributions
 spec-0.211
 amaz-0.210
 outage-0.205
 ago-0.201
 mesmer-0.197
Token never
Token ID1239
Feature activation+1.35e-02

Pos logit contributions
P+0.208
His+0.195
ercise+0.177
 helmet+0.173
Spot+0.167
Neg logit contributions
 amaz-0.250
lee-0.236
 spec-0.223
 ago-0.221
od-0.220
Token be
Token ID307
Feature activation-1.37e-01

Pos logit contributions
 amaz+0.023
lee+0.022
 spec+0.021
od+0.021
 ago+0.021
Neg logit contributions
P-0.015
His-0.015
ercise-0.013
 helmet-0.013
Spot-0.012
Token peace
Token ID4167
Feature activation+2.32e-03

Pos logit contributions
P+0.197
His+0.186
ercise+0.174
 helmet+0.166
Mid+0.161
Neg logit contributions
 amaz-0.199
lee-0.194
 spec-0.183
 ago-0.181
od-0.177
Token.
Token ID13
Feature activation-5.27e-02

Pos logit contributions
 amaz+0.004
lee+0.003
 ago+0.003
 spec+0.003
od+0.003
Neg logit contributions
P-0.003
His-0.003
ercise-0.003
 helmet-0.003
w-0.003
Token<|endoftext|>
Token ID50256
Feature activation-1.84e-02

Pos logit contributions
His+0.042
P+0.033
 His+0.032
 his+0.030
OK+0.028
Neg logit contributions
 spec-0.108
 spe-0.105
 amaz-0.104
 angles-0.101
lee-0.100
TokenP
Token ID47
Feature activation-4.02e-01

Pos logit contributions
P+0.020
His+0.019
Spot+0.016
 helmet+0.016
ercise+0.016
Neg logit contributions
 amaz-0.033
lee-0.031
 spec-0.029
 angles-0.029
od-0.029
Tokenolly
Token ID5098
Feature activation+4.15e-02

Pos logit contributions
P+0.238
His+0.203
ercise+0.155
 helmet+0.133
w+0.132
Neg logit contributions
 amaz-0.895
 spec-0.887
 mesmer-0.862
lee-0.847
 angles-0.846
Token was
Token ID373
Feature activation-8.94e-02

Pos logit contributions
 amaz+0.059
lee+0.058
 ago+0.055
 spec+0.053
od+0.052
Neg logit contributions
P-0.058
His-0.057
ercise-0.051
 helmet-0.051
Br-0.049
Token ready
Token ID3492
Feature activation-2.21e-01

Pos logit contributions
His+0.115
P+0.114
ercise+0.110
OK+0.105
 helmet+0.104
Neg logit contributions
 amaz-0.136
lee-0.128
 mesmer-0.126
 ago-0.126
 awe-0.119
Token to
Token ID284
Feature activation-7.02e-02

Pos logit contributions
P+0.354
His+0.330
Spot+0.308
ercise+0.305
Move+0.299
Neg logit contributions
 amaz-0.287
lee-0.283
 ago-0.275
od-0.259
 mesmer-0.251
Token plan
Token ID1410
Feature activation-8.19e-02

Pos logit contributions
P+0.074
His+0.071
ercise+0.062
 helmet+0.061
Spot+0.057
Neg logit contributions
 amaz-0.124
lee-0.119
 spec-0.114
 angles-0.112
od-0.112
Token fear
Token ID3252
Feature activation+4.51e-02

Pos logit contributions
P+0.234
His+0.215
ercise+0.196
Move+0.195
Br+0.189
Neg logit contributions
 amaz-0.255
lee-0.249
 spec-0.222
 angles-0.220
 mesmer-0.217
Token away
Token ID1497
Feature activation-3.01e-01

Pos logit contributions
 amaz+0.056
lee+0.055
 ago+0.053
ous+0.051
 mesmer+0.050
Neg logit contributions
P-0.078
ercise-0.068
His-0.066
Move-0.063
Spot-0.062
Token.
Token ID13
Feature activation-4.73e-02

Pos logit contributions
P+0.515
ercise+0.458
His+0.455
Move+0.433
w+0.426
Neg logit contributions
 amaz-0.387
lee-0.356
 ago-0.334
 mesmer-0.333
 spec-0.331
Token
Token ID198
Feature activation-4.52e-02

Pos logit contributions
P+0.062
His+0.059
ercise+0.054
 helmet+0.053
w+0.051
Neg logit contributions
 amaz-0.074
lee-0.071
 spec-0.065
 ago-0.065
od-0.065
Token
Token ID198
Feature activation-4.53e-02

Pos logit contributions
P+0.056
His+0.054
ercise+0.047
 helmet+0.046
Spot+0.045
Neg logit contributions
 amaz-0.073
lee-0.069
 spec-0.067
 ago-0.065
od-0.065
TokenP
Token ID47
Feature activation-4.02e-01

Pos logit contributions
 helmet+0.040
P+0.038
His+0.038
ercise+0.038
Move+0.035
Neg logit contributions
 amaz-0.082
lee-0.080
 mesmer-0.074
od-0.074
 ago-0.072
Tokenapa
Token ID32678
Feature activation+8.89e-03

Pos logit contributions
P+0.626
His+0.608
ercise+0.558
 helmet+0.553
Spot+0.538
Neg logit contributions
 amaz-0.518
lee-0.494
od-0.450
 spec-0.449
 ago-0.445
Token held
Token ID2714
Feature activation-1.23e-01

Pos logit contributions
lee+0.012
 amaz+0.012
od+0.010
 mesmer+0.010
ents+0.010
Neg logit contributions
P-0.014
His-0.013
ercise-0.012
 helmet-0.012
Move-0.011
Token Lily
Token ID20037
Feature activation-1.47e-01

Pos logit contributions
P+0.141
His+0.135
ercise+0.129
 helmet+0.124
OK+0.117
Neg logit contributions
 amaz-0.203
lee-0.199
 spec-0.183
 awe-0.176
 angles-0.176
Token's
Token ID338
Feature activation-5.71e-02

Pos logit contributions
P+0.227
His+0.200
ercise+0.194
O+0.189
Move+0.189
Neg logit contributions
 amaz-0.199
lee-0.180
 mesmer-0.172
 angles-0.168
ents-0.165
Token hand
Token ID1021
Feature activation+3.63e-02

Pos logit contributions
P+0.109
His+0.099
ercise+0.099
w+0.096
mar+0.096
Neg logit contributions
 amaz-0.059
lee-0.051
 spec-0.045
 angles-0.045
 ago-0.045
Token more
Token ID517
Feature activation-8.75e-02

Pos logit contributions
 amaz+0.019
od+0.017
 mesmer+0.016
lee+0.016
 outage+0.016
Neg logit contributions
P-0.023
His-0.020
Spot-0.019
O-0.019
w-0.018
Token fun
Token ID1257
Feature activation-1.75e-01

Pos logit contributions
P+0.091
His+0.085
ercise+0.073
 helmet+0.071
Spot+0.069
Neg logit contributions
 amaz-0.161
lee-0.153
 spec-0.145
od-0.145
 ago-0.144
Token!"
Token ID2474
Feature activation-8.40e-02

Pos logit contributions
P+0.227
His+0.217
ercise+0.191
 helmet+0.186
Spot+0.183
Neg logit contributions
 amaz-0.275
lee-0.262
 ago-0.250
od-0.249
 spec-0.247
Token
Token ID198
Feature activation-4.67e-02

Pos logit contributions
P+0.114
His+0.106
ercise+0.101
 helmet+0.097
w+0.096
Neg logit contributions
 amaz-0.123
lee-0.119
od-0.109
 spec-0.109
 ago-0.108
Token
Token ID198
Feature activation-4.75e-02

Pos logit contributions
P+0.056
His+0.054
ercise+0.046
 helmet+0.045
Spot+0.045
Neg logit contributions
 amaz-0.079
lee-0.075
 spec-0.072
 ago-0.071
 angles-0.070
TokenP
Token ID47
Feature activation-4.02e-01

Pos logit contributions
 down+0.041
use+0.041
 helmet+0.041
w+0.041
ercise+0.040
Neg logit contributions
lee-0.075
 amaz-0.074
od-0.072
ents-0.068
 impressed-0.068
Tokenenny
Token ID11870
Feature activation+5.14e-02

Pos logit contributions
His+0.733
P+0.729
 helmet+0.684
ercise+0.670
Br+0.668
Neg logit contributions
lee-0.375
 amaz-0.367
od-0.318
ents-0.307
 ago-0.301
Token said
Token ID531
Feature activation-7.05e-02

Pos logit contributions
lee+0.069
 amaz+0.063
od+0.060
 mesmer+0.058
ents+0.053
Neg logit contributions
P-0.081
His-0.074
Br-0.072
 helmet-0.071
Spot-0.070
Token,
Token ID11
Feature activation-1.57e-01

Pos logit contributions
P+0.116
ercise+0.101
use+0.099
w+0.098
Move+0.098
Neg logit contributions
lee-0.085
 amaz-0.083
od-0.082
ous-0.081
 of-0.076
Token "
Token ID366
Feature activation-7.29e-02

Pos logit contributions
P+0.282
w+0.262
ercise+0.254
use+0.243
ww+0.243
Neg logit contributions
lee-0.168
 amaz-0.163
 mesmer-0.158
od-0.149
 ago-0.147
TokenOkay
Token ID16454
Feature activation-2.56e-01

Pos logit contributions
P+0.068
 helmet+0.063
His+0.063
ercise+0.059
 down+0.055
Neg logit contributions
 amaz-0.129
 spec-0.122
 mesmer-0.121
lee-0.119
od-0.117
Token her
Token ID607
Feature activation-1.71e-01

Pos logit contributions
P+0.324
His+0.315
ercise+0.293
 helmet+0.278
w+0.276
Neg logit contributions
 amaz-0.281
lee-0.261
 ago-0.251
od-0.250
 spec-0.246
Token violin
Token ID38283
Feature activation-7.47e-02

Pos logit contributions
P+0.200
His+0.189
 helmet+0.172
ercise+0.171
Spot+0.160
Neg logit contributions
 amaz-0.283
lee-0.270
 spec-0.258
 ago-0.256
od-0.253
Token!
Token ID0
Feature activation-2.55e-02

Pos logit contributions
P+0.126
His+0.123
Br+0.112
O+0.111
Move+0.109
Neg logit contributions
 amaz-0.083
od-0.081
 ago-0.080
lee-0.080
 angles-0.074
Token
Token ID198
Feature activation-4.52e-02

Pos logit contributions
P+0.029
His+0.027
ercise+0.023
Mid+0.022
Spot+0.022
Neg logit contributions
 amaz-0.045
 spec-0.042
lee-0.041
 angles-0.041
 ago-0.041
Token
Token ID198
Feature activation-4.56e-02

Pos logit contributions
P+0.039
His+0.036
Spot+0.028
O+0.027
 his+0.026
Neg logit contributions
 amaz-0.099
 spec-0.097
 angles-0.091
 spe-0.090
 ago-0.089
TokenP
Token ID47
Feature activation-4.01e-01

Pos logit contributions
P+0.039
His+0.035
ercise+0.029
 helmet+0.027
Spot+0.026
Neg logit contributions
 amaz-0.094
lee-0.090
 spec-0.087
 ago-0.086
 angles-0.085
Tokenolly
Token ID5098
Feature activation+4.26e-02

Pos logit contributions
P+0.647
His+0.594
ercise+0.556
w+0.534
 helmet+0.528
Neg logit contributions
 amaz-0.518
lee-0.471
 spec-0.468
 ago-0.452
 angles-0.449
Token was
Token ID373
Feature activation-8.45e-02

Pos logit contributions
 amaz+0.063
lee+0.062
od+0.057
 mesmer+0.056
 spec+0.056
Neg logit contributions
P-0.059
His-0.055
ercise-0.050
 helmet-0.050
Spot-0.048
Token so
Token ID523
Feature activation+6.37e-02

Pos logit contributions
His+0.108
P+0.102
 his+0.096
 helmet+0.093
 down+0.093
Neg logit contributions
 amaz-0.122
lee-0.119
 spec-0.117
 ago-0.115
 angles-0.114
Token excited
Token ID6568
Feature activation+3.76e-03

Pos logit contributions
lee+0.107
 amaz+0.106
 spec+0.106
 ago+0.104
 angles+0.103
Neg logit contributions
His-0.069
P-0.069
OK-0.056
 his-0.056
 helmet-0.054
Token,
Token ID11
Feature activation-1.57e-01

Pos logit contributions
 amaz+0.006
lee+0.006
 spec+0.005
 angles+0.005
 ago+0.005
Neg logit contributions
P-0.005
His-0.004
ercise-0.004
 helmet-0.004
 his-0.004
Token especially
Token ID2592
Feature activation+3.26e-01

Pos logit contributions
P+0.002
His+0.002
ercise+0.002
 helmet+0.002
w+0.002
Neg logit contributions
 amaz-0.002
lee-0.002
 spec-0.002
 ago-0.002
 angles-0.002
Token liked
Token ID8288
Feature activation+1.81e-01

Pos logit contributions
 amaz+0.457
lee+0.449
 spec+0.425
 mesmer+0.421
 ago+0.421
Neg logit contributions
P-0.465
His-0.449
ercise-0.399
 helmet-0.387
Spot-0.384
Token the
Token ID262
Feature activation+2.82e-01

Pos logit contributions
 amaz+0.260
lee+0.252
od+0.235
 spec+0.234
 ago+0.232
Neg logit contributions
P-0.256
His-0.244
ercise-0.220
Spot-0.214
 helmet-0.213
Token bright
Token ID6016
Feature activation+2.09e-01

Pos logit contributions
 amaz+0.411
 spec+0.385
lee+0.381
 ago+0.368
od+0.366
Neg logit contributions
His-0.386
P-0.379
ercise-0.337
Mid-0.312
Spot-0.312
Token colors
Token ID7577
Feature activation+3.01e-01

Pos logit contributions
 amaz+0.313
lee+0.297
 spec+0.278
od+0.277
 ago+0.275
Neg logit contributions
P-0.280
His-0.266
ercise-0.242
 helmet-0.239
Spot-0.228
Token of
Token ID286
Feature activation+4.89e-01

Pos logit contributions
 amaz+0.430
lee+0.415
od+0.390
 spec+0.388
 ago+0.386
Neg logit contributions
P-0.428
His-0.404
ercise-0.373
 helmet-0.356
Spot-0.355
Token the
Token ID262
Feature activation+2.51e-01

Pos logit contributions
 amaz+0.753
lee+0.717
 spec+0.672
od+0.667
 ago+0.666
Neg logit contributions
P-0.645
His-0.614
ercise-0.554
 helmet-0.542
Spot-0.525
Token yarn
Token ID21181
Feature activation+1.25e-01

Pos logit contributions
 amaz+0.327
lee+0.303
 ago+0.293
 spec+0.292
 angles+0.282
Neg logit contributions
His-0.405
P-0.392
ercise-0.352
 helmet-0.335
Spot-0.333
Token she
Token ID673
Feature activation+1.40e-01

Pos logit contributions
 amaz+0.151
lee+0.147
 ago+0.146
od+0.144
 spec+0.141
Neg logit contributions
P-0.197
His-0.195
ercise-0.176
Br-0.172
Move-0.172
Token offered
Token ID4438
Feature activation+1.78e-01

Pos logit contributions
 amaz+0.195
lee+0.191
 ago+0.180
 spec+0.176
od+0.176
Neg logit contributions
His-0.195
P-0.195
ercise-0.176
 helmet-0.173
Spot-0.165
Token!
Token ID0
Feature activation+1.78e-01

Pos logit contributions
lee+0.201
od+0.194
 ago+0.189
 amaz+0.180
 of+0.175
Neg logit contributions
P-0.319
His-0.280
ercise-0.276
Move-0.276
Sorry-0.267
Token lots
Token ID6041
Feature activation+2.19e-01

Pos logit contributions
 amaz+0.384
lee+0.367
od+0.346
 spec+0.346
 ago+0.342
Neg logit contributions
P-0.269
His-0.250
ercise-0.228
 helmet-0.218
Spot-0.212
Token of
Token ID286
Feature activation+5.27e-01

Pos logit contributions
 amaz+0.195
lee+0.156
 ago+0.153
keeping+0.149
 angles+0.144
Neg logit contributions
P-0.419
His-0.418
 his-0.391
 helmet-0.382
Spot-0.377
Token people
Token ID661
Feature activation+2.51e-01

Pos logit contributions
 amaz+0.798
lee+0.759
 spec+0.713
od+0.705
 ago+0.703
Neg logit contributions
P-0.703
His-0.669
ercise-0.607
 helmet-0.593
Spot-0.574
Token and
Token ID290
Feature activation+2.15e-01

Pos logit contributions
 amaz+0.337
lee+0.319
 spec+0.296
od+0.295
 ago+0.294
Neg logit contributions
P-0.380
His-0.361
ercise-0.333
 helmet-0.325
Spot-0.317
Token lots
Token ID6041
Feature activation+1.52e-01

Pos logit contributions
 amaz+0.315
lee+0.299
 spec+0.279
 ago+0.277
od+0.275
Neg logit contributions
P-0.304
His-0.284
ercise-0.261
 helmet-0.254
w-0.249
Token of
Token ID286
Feature activation+4.81e-01

Pos logit contributions
 amaz+0.102
lee+0.084
 spec+0.074
 angles+0.072
od+0.071
Neg logit contributions
P-0.327
His-0.322
 helmet-0.300
ercise-0.299
Spot-0.294
Token stores
Token ID7000
Feature activation+1.64e-01

Pos logit contributions
 amaz+0.733
lee+0.699
 spec+0.656
 ago+0.649
od+0.648
Neg logit contributions
P-0.645
His-0.611
ercise-0.557
 helmet-0.541
w-0.525
Token.
Token ID13
Feature activation+1.67e-01

Pos logit contributions
 amaz+0.218
lee+0.211
 ago+0.203
 spec+0.195
od+0.195
Neg logit contributions
P-0.259
His-0.236
ercise-0.225
Mid-0.216
w-0.216
Token He
Token ID679
Feature activation-4.83e-04

Pos logit contributions
 amaz+0.231
 ago+0.229
lee+0.228
 spec+0.207
od+0.205
Neg logit contributions
P-0.247
w-0.231
ercise-0.223
His-0.218
 helmet-0.217
Token saw
Token ID2497
Feature activation+8.81e-02

Pos logit contributions
P+0.001
His+0.001
ercise+0.001
w+0.001
Spot+0.001
Neg logit contributions
lee-0.001
 amaz-0.001
 mesmer-0.001
 ago-0.001
 spec-0.001
Token a
Token ID257
Feature activation+8.90e-02

Pos logit contributions
 amaz+0.138
lee+0.131
 spec+0.124
od+0.123
 ago+0.121
Neg logit contributions
P-0.118
His-0.107
ercise-0.102
w-0.098
OK-0.095
Token.
Token ID13
Feature activation+1.40e-01

Pos logit contributions
 amaz+0.169
lee+0.162
 ago+0.157
od+0.157
 spec+0.152
Neg logit contributions
P-0.208
ercise-0.188
His-0.184
Mid-0.173
OK-0.171
Token He
Token ID679
Feature activation-2.51e-02

Pos logit contributions
lee+0.199
 amaz+0.185
 ago+0.179
 spec+0.165
ents+0.163
Neg logit contributions
P-0.222
w-0.206
ercise-0.199
use-0.189
 helmet-0.180
Token felt
Token ID2936
Feature activation+1.24e-01

Pos logit contributions
P+0.029
His+0.028
ercise+0.025
 helmet+0.024
Spot+0.023
Neg logit contributions
 amaz-0.042
lee-0.041
 spec-0.038
od-0.038
 ago-0.038
Token a
Token ID257
Feature activation+6.86e-02

Pos logit contributions
 amaz+0.213
lee+0.206
od+0.196
 ago+0.194
 angles+0.193
Neg logit contributions
P-0.138
His-0.135
 helmet-0.116
ercise-0.114
Spot-0.110
Token lot
Token ID1256
Feature activation-3.42e-04

Pos logit contributions
lee+0.130
 amaz+0.124
 ago+0.124
od+0.124
 angles+0.123
Neg logit contributions
His-0.068
P-0.067
 helmet-0.057
O-0.057
Spot-0.052
Token of
Token ID286
Feature activation+4.17e-01

Pos logit contributions
P+0.001
His+0.001
 helmet+0.001
ercise+0.001
Spot+0.001
Neg logit contributions
 amaz-0.000
lee-0.000
od-0.000
 spec-0.000
 ago-0.000
Token pain
Token ID2356
Feature activation+1.66e-01

Pos logit contributions
 amaz+0.559
lee+0.522
 spec+0.489
 ago+0.482
 angles+0.478
Neg logit contributions
P-0.645
His-0.611
ercise-0.567
 helmet-0.543
Spot-0.535
Token and
Token ID290
Feature activation+1.30e-01

Pos logit contributions
lee+0.190
 ago+0.187
ous+0.186
 amaz+0.186
 spec+0.182
Neg logit contributions
P-0.283
ercise-0.258
His-0.253
OK-0.245
Br-0.235
Token started
Token ID2067
Feature activation+7.31e-02

Pos logit contributions
 amaz+0.229
lee+0.219
 spec+0.207
 ago+0.204
 angles+0.202
Neg logit contributions
P-0.149
His-0.134
ercise-0.123
 helmet-0.114
Spot-0.113
Token to
Token ID284
Feature activation+1.03e-01

Pos logit contributions
 amaz+0.109
lee+0.104
 spec+0.097
 ago+0.096
od+0.095
Neg logit contributions
P-0.101
His-0.094
ercise-0.088
 helmet-0.085
Spot-0.082
Token cry
Token ID3960
Feature activation+1.62e-01

Pos logit contributions
 amaz+0.130
lee+0.119
 spec+0.112
od+0.107
 angles+0.107
Neg logit contributions
P-0.174
His-0.161
ercise-0.152
 helmet-0.144
OK-0.142
Tokenâ
Token ID22940
Feature activation+1.59e-01

Pos logit contributions
 amaz+0.354
 spec+0.346
lee+0.322
 mesmer+0.318
od+0.310
Neg logit contributions
P-0.432
His-0.420
 helmet-0.370
ercise-0.363
Br-0.360
Token
Token ID26391
Feature activation+1.05e-01

Pos logit contributions
lee+0.225
 amaz+0.222
 ago+0.214
keeping+0.208
 angles+0.208
Neg logit contributions
His-0.197
 helmet-0.188
P-0.184
 helmets-0.173
 his-0.164
Token
Token ID8151
Feature activation+2.69e-01

Pos logit contributions
 amaz+0.172
lee+0.164
 spec+0.155
od+0.153
 angles+0.153
Neg logit contributions
P-0.126
His-0.119
ercise-0.107
 helmet-0.104
Spot-0.101
Tokens
Token ID82
Feature activation+1.04e-01

Pos logit contributions
lee+0.358
 ago+0.348
od+0.347
 amaz+0.342
ents+0.327
Neg logit contributions
P-0.381
His-0.349
ercise-0.349
 helmet-0.344
Mid-0.318
Token full
Token ID1336
Feature activation+2.65e-01

Pos logit contributions
 amaz+0.141
 ago+0.132
lee+0.131
 spec+0.121
 mesmer+0.121
Neg logit contributions
P-0.171
His-0.154
Move-0.135
ercise-0.135
Spot-0.132
Token of
Token ID286
Feature activation+4.53e-01

Pos logit contributions
 amaz+0.061
lee+0.049
 ago+0.036
 of+0.034
 spec+0.030
Neg logit contributions
P-0.687
His-0.658
ercise-0.649
 helmet-0.622
Mid-0.622
Token life
Token ID1204
Feature activation+1.22e-01

Pos logit contributions
 amaz+0.665
lee+0.620
 angles+0.585
 spec+0.584
 ago+0.579
Neg logit contributions
P-0.654
His-0.619
ercise-0.557
Spot-0.538
Mid-0.533
Token.
Token ID13
Feature activation+1.49e-01

Pos logit contributions
 amaz+0.170
lee+0.163
 spec+0.156
 ago+0.156
 angles+0.149
Neg logit contributions
P-0.182
His-0.178
ercise-0.154
Br-0.153
Move-0.149
Tokenâ
Token ID22940
Feature activation+1.16e-01

Pos logit contributions
 amaz+0.211
lee+0.201
 ago+0.195
 spec+0.185
od+0.183
Neg logit contributions
P-0.213
His-0.199
ercise-0.190
 helmet-0.187
w-0.185
Token
Token ID26391
Feature activation+6.40e-02

Pos logit contributions
lee+0.170
 amaz+0.165
 ago+0.158
 angles+0.157
od+0.156
Neg logit contributions
P-0.142
His-0.140
 helmet-0.131
 helmets-0.122
ercise-0.117
Token
Token ID198
Feature activation+1.43e-01

Pos logit contributions
 amaz+0.105
lee+0.100
 spec+0.095
 angles+0.094
od+0.093
Neg logit contributions
P-0.077
His-0.073
ercise-0.065
 helmet-0.064
Spot-0.063
Token to
Token ID284
Feature activation+1.48e-01

Pos logit contributions
 amaz+0.143
lee+0.138
 mesmer+0.131
 ago+0.130
od+0.121
Neg logit contributions
P-0.230
Mid-0.197
w-0.193
Br-0.193
His-0.190
Token take
Token ID1011
Feature activation+1.33e-01

Pos logit contributions
 amaz+0.239
lee+0.227
 spec+0.213
od+0.212
 ago+0.211
Neg logit contributions
P-0.183
His-0.172
ercise-0.154
 helmet-0.150
Spot-0.146
Token a
Token ID257
Feature activation+8.83e-02

Pos logit contributions
 amaz+0.192
lee+0.183
od+0.170
 spec+0.166
 awe+0.162
Neg logit contributions
P-0.179
His-0.173
Sorry-0.163
OK-0.160
Deal-0.156
Token bite
Token ID13197
Feature activation+1.04e-01

Pos logit contributions
 amaz+0.148
lee+0.139
 spec+0.132
od+0.131
 ago+0.131
Neg logit contributions
P-0.103
His-0.098
ercise-0.088
 helmet-0.082
Spot-0.082
Token out
Token ID503
Feature activation+3.89e-02

Pos logit contributions
 of+0.042
 amaz+0.032
od+0.029
 spec+0.023
lee+0.022
Neg logit contributions
P-0.247
ercise-0.236
His-0.235
Wh-0.230
Mid-0.229
Token of
Token ID286
Feature activation+4.33e-01

Pos logit contributions
 amaz+0.040
lee+0.037
od+0.032
 ago+0.032
 spec+0.031
Neg logit contributions
P-0.075
His-0.068
ercise-0.067
OK-0.065
Spot-0.065
Token it
Token ID340
Feature activation+2.06e-01

Pos logit contributions
 amaz+0.687
lee+0.652
 spec+0.614
od+0.607
 ago+0.607
Neg logit contributions
P-0.552
His-0.522
ercise-0.471
 helmet-0.460
Spot-0.445
Token.
Token ID13
Feature activation+1.34e-01

Pos logit contributions
 amaz+0.199
 of+0.175
 ago+0.172
od+0.169
lee+0.165
Neg logit contributions
P-0.404
Br-0.369
Move-0.360
Sorry-0.360
His-0.359
Token It
Token ID632
Feature activation+2.08e-01

Pos logit contributions
 amaz+0.224
lee+0.211
 ago+0.203
 mesmer+0.195
 spec+0.194
Neg logit contributions
P-0.176
w-0.156
ercise-0.146
His-0.139
mar-0.134
Token was
Token ID373
Feature activation+9.55e-02

Pos logit contributions
 amaz+0.247
 spec+0.225
 mesmer+0.220
lee+0.211
od+0.207
Neg logit contributions
P-0.358
His-0.320
ercise-0.316
 helmet-0.313
erve-0.305
Token crunch
Token ID23527
Feature activation+1.26e-01

Pos logit contributions
 amaz+0.136
lee+0.121
 mesmer+0.118
 spec+0.117
 ago+0.115
Neg logit contributions
P-0.151
His-0.136
ercise-0.130
OK-0.127
w-0.124
Token play
Token ID711
Feature activation-5.18e-02

Pos logit contributions
 amaz+0.091
lee+0.088
 spec+0.084
 ago+0.083
od+0.082
Neg logit contributions
P-0.051
His-0.050
 helmet-0.043
ercise-0.043
Spot-0.039
Token.
Token ID13
Feature activation+5.57e-02

Pos logit contributions
P+0.074
His+0.071
ercise+0.070
w+0.068
use+0.067
Neg logit contributions
 amaz-0.065
od-0.061
lee-0.058
 of-0.057
 spec-0.056
Token She
Token ID1375
Feature activation-1.56e-04

Pos logit contributions
 amaz+0.095
lee+0.091
 spec+0.086
od+0.085
 ago+0.085
Neg logit contributions
P-0.065
His-0.061
ercise-0.054
 helmet-0.053
w-0.051
Token thought
Token ID1807
Feature activation+5.53e-02

Pos logit contributions
P+0.000
His+0.000
ercise+0.000
w+0.000
 helmet+0.000
Neg logit contributions
lee-0.000
 amaz-0.000
ents-0.000
 mesmer-0.000
 spec-0.000
Token it
Token ID340
Feature activation+1.18e-01

Pos logit contributions
 amaz+0.083
lee+0.076
 spec+0.074
 angles+0.073
 ago+0.071
Neg logit contributions
P-0.072
His-0.071
 helmet-0.065
ercise-0.063
 his-0.062
Token could
Token ID714
Feature activation+1.48e-01

Pos logit contributions
 spec+0.123
 amaz+0.120
od+0.117
 mesmer+0.109
ous+0.109
Neg logit contributions
P-0.215
His-0.195
ercise-0.182
use-0.177
Br-0.175
Token be
Token ID307
Feature activation-3.27e-02

Pos logit contributions
 amaz+0.176
 ago+0.155
 spe+0.154
 contain+0.152
 unic+0.151
Neg logit contributions
P-0.251
ercise-0.220
His-0.207
w-0.202
ww-0.202
Token fun
Token ID1257
Feature activation-7.52e-02

Pos logit contributions
His+0.050
P+0.049
Sorry+0.045
OK+0.044
w+0.044
Neg logit contributions
lee-0.043
 amaz-0.041
 ago-0.040
od-0.039
 spec-0.038
Token to
Token ID284
Feature activation+3.67e-02

Pos logit contributions
P+0.091
His+0.086
ercise+0.077
 helmet+0.075
Spot+0.072
Neg logit contributions
 amaz-0.124
lee-0.119
 spec-0.112
od-0.111
 ago-0.111
Token climb
Token ID12080
Feature activation-5.60e-02

Pos logit contributions
 amaz+0.054
lee+0.053
od+0.051
 angles+0.048
 ago+0.048
Neg logit contributions
P-0.052
His-0.046
w-0.043
ercise-0.042
Mid-0.041
Token up
Token ID510
Feature activation-5.78e-02

Pos logit contributions
P+0.089
His+0.088
ercise+0.079
OK+0.076
Br+0.076
Neg logit contributions
 amaz-0.067
lee-0.066
 spec-0.064
 ago-0.062
 rooft-0.061
Token believed
Token ID4762
Feature activation+1.44e-01

Pos logit contributions
 amaz+0.015
lee+0.015
od+0.013
 mesmer+0.013
 spec+0.013
Neg logit contributions
P-0.015
His-0.014
ercise-0.013
 helmet-0.013
Spot-0.013
Token it
Token ID340
Feature activation+1.11e-01

Pos logit contributions
 amaz+0.204
lee+0.195
 spec+0.181
od+0.180
 ago+0.179
Neg logit contributions
P-0.209
His-0.198
ercise-0.182
 helmet-0.178
Spot-0.174
Token was
Token ID373
Feature activation+1.52e-02

Pos logit contributions
 amaz+0.145
lee+0.135
od+0.131
 angles+0.127
 ago+0.125
Neg logit contributions
P-0.183
His-0.163
Br-0.158
Spot-0.156
ercise-0.155
Token from
Token ID422
Feature activation+1.71e-01

Pos logit contributions
 amaz+0.022
lee+0.021
 ago+0.020
od+0.019
 angles+0.019
Neg logit contributions
P-0.023
His-0.021
ercise-0.019
Spot-0.019
 helmet-0.019
Token a
Token ID257
Feature activation-2.18e-02

Pos logit contributions
 amaz+0.257
lee+0.243
 ago+0.227
 spec+0.227
od+0.225
Neg logit contributions
P-0.235
His-0.223
ercise-0.203
 helmet-0.198
Spot-0.192
Token magical
Token ID10883
Feature activation+1.82e-01

Pos logit contributions
P+0.031
His+0.029
ercise+0.027
w+0.025
Spot+0.025
Neg logit contributions
 amaz-0.033
lee-0.030
 spec-0.029
 ago-0.029
 angles-0.028
Token place
Token ID1295
Feature activation+3.46e-02

Pos logit contributions
 amaz+0.251
lee+0.237
 spec+0.221
 ago+0.219
od+0.218
Neg logit contributions
P-0.268
His-0.257
ercise-0.235
 helmet-0.229
Spot-0.224
Token and
Token ID290
Feature activation+4.82e-02

Pos logit contributions
 amaz+0.055
lee+0.052
 spec+0.049
od+0.049
 angles+0.048
Neg logit contributions
P-0.042
His-0.040
 helmet-0.036
ercise-0.036
Spot-0.034
Token so
Token ID523
Feature activation+1.51e-01

Pos logit contributions
 amaz+0.073
lee+0.072
od+0.070
 spec+0.069
 ago+0.067
Neg logit contributions
P-0.058
His-0.057
ercise-0.051
 helmet-0.051
 his-0.049
Token she
Token ID673
Feature activation-1.86e-03

Pos logit contributions
 amaz+0.241
lee+0.232
 spec+0.222
od+0.219
 ago+0.217
Neg logit contributions
P-0.183
His-0.176
ercise-0.157
 helmet-0.155
w-0.148
Token tucked
Token ID29779
Feature activation+1.14e-01

Pos logit contributions
P+0.002
His+0.002
ercise+0.002
 helmet+0.002
Spot+0.002
Neg logit contributions
 amaz-0.003
lee-0.003
 spec-0.003
 mesmer-0.003
od-0.003
Token saw
Token ID2497
Feature activation+8.93e-02

Pos logit contributions
 amaz+0.172
lee+0.167
 spec+0.155
 ago+0.153
 mesmer+0.152
Neg logit contributions
P-0.120
His-0.112
ercise-0.103
 helmet-0.099
w-0.096
Token the
Token ID262
Feature activation+2.17e-01

Pos logit contributions
 amaz+0.152
lee+0.147
 ago+0.139
 angles+0.138
 spec+0.137
Neg logit contributions
P-0.094
His-0.092
 his-0.082
 helmet-0.080
Spot-0.078
Token broken
Token ID5445
Feature activation+5.68e-02

Pos logit contributions
 amaz+0.326
lee+0.308
 spec+0.292
 ago+0.287
od+0.286
Neg logit contributions
P-0.293
His-0.280
ercise-0.254
 helmet-0.245
Spot-0.238
Token switch
Token ID5078
Feature activation+1.58e-01

Pos logit contributions
 amaz+0.080
lee+0.075
 spec+0.073
 ago+0.072
 angles+0.069
Neg logit contributions
P-0.086
His-0.079
ercise-0.076
OK-0.070
Spot-0.069
Token and
Token ID290
Feature activation+1.51e-01

Pos logit contributions
 amaz+0.209
lee+0.196
 ago+0.195
 spec+0.186
od+0.186
Neg logit contributions
P-0.265
His-0.241
Br-0.225
ercise-0.223
Mid-0.213
Token the
Token ID262
Feature activation+2.00e-01

Pos logit contributions
 amaz+0.242
lee+0.230
 spec+0.217
 ago+0.214
od+0.213
Neg logit contributions
P-0.194
His-0.182
ercise-0.166
 helmet-0.161
Spot-0.156
Token dark
Token ID3223
Feature activation+2.45e-02

Pos logit contributions
 amaz+0.300
lee+0.275
 spec+0.271
 ago+0.264
 angles+0.262
Neg logit contributions
P-0.280
His-0.267
ercise-0.245
 helmet-0.224
Spot-0.224
Token room
Token ID2119
Feature activation+2.34e-01

Pos logit contributions
 amaz+0.034
lee+0.032
od+0.030
 spec+0.030
 ago+0.029
Neg logit contributions
P-0.036
His-0.034
ercise-0.031
 helmet-0.031
Spot-0.030
Token.
Token ID13
Feature activation+1.27e-01

Pos logit contributions
 amaz+0.330
lee+0.313
 ago+0.301
 spec+0.295
od+0.291
Neg logit contributions
P-0.355
His-0.325
ercise-0.315
Spot-0.292
 helmet-0.291
Token She
Token ID1375
Feature activation+6.59e-02

Pos logit contributions
 amaz+0.217
lee+0.208
 spec+0.196
 ago+0.196
od+0.194
Neg logit contributions
P-0.146
His-0.137
ercise-0.124
 helmet-0.121
w-0.115
Token was
Token ID373
Feature activation+9.19e-02

Pos logit contributions
 amaz+0.107
lee+0.103
 spec+0.097
od+0.096
 ago+0.095
Neg logit contributions
P-0.081
His-0.076
ercise-0.070
 helmet-0.068
Spot-0.065
Token.
Token ID13
Feature activation+2.25e-02

Pos logit contributions
lee+0.016
 amaz+0.015
od+0.015
 angles+0.014
ents+0.013
Neg logit contributions
P-0.024
ercise-0.021
Move-0.020
w-0.020
O-0.020
Token
Token ID198
Feature activation+2.49e-02

Pos logit contributions
 amaz+0.028
lee+0.028
 ago+0.025
 spec+0.025
 mesmer+0.024
Neg logit contributions
P-0.037
w-0.034
ercise-0.034
 helmet-0.032
His-0.031
Token
Token ID198
Feature activation+2.42e-02

Pos logit contributions
 amaz+0.033
lee+0.033
od+0.031
 angles+0.029
 spec+0.029
Neg logit contributions
P-0.034
ercise-0.032
 helmet-0.032
His-0.031
w-0.030
TokenSo
Token ID2396
Feature activation-4.87e-02

Pos logit contributions
 amaz+0.036
od+0.034
lee+0.033
 mesmer+0.032
 angles+0.032
Neg logit contributions
 helmet-0.027
 down-0.025
w-0.025
use-0.025
ercise-0.025
Token,
Token ID11
Feature activation-8.90e-02

Pos logit contributions
P+0.066
His+0.061
ercise+0.057
 helmet+0.055
w+0.054
Neg logit contributions
 amaz-0.075
lee-0.070
 spec-0.066
od-0.065
 angles-0.065
Token Maya
Token ID26041
Feature activation+1.43e-01

Pos logit contributions
P+0.134
His+0.125
ercise+0.118
w+0.114
 helmet+0.113
Neg logit contributions
 amaz-0.123
lee-0.112
 spec-0.108
od-0.107
 mesmer-0.105
Token and
Token ID290
Feature activation+2.46e-02

Pos logit contributions
 amaz+0.198
lee+0.196
od+0.179
 ago+0.177
 mesmer+0.175
Neg logit contributions
P-0.205
His-0.193
ercise-0.182
 helmet-0.177
Move-0.177
Token Sam
Token ID3409
Feature activation+4.88e-03

Pos logit contributions
 amaz+0.039
lee+0.037
 spec+0.035
od+0.035
 ago+0.035
Neg logit contributions
P-0.031
His-0.030
ercise-0.027
 helmet-0.026
w-0.025
Token tried
Token ID3088
Feature activation-6.96e-02

Pos logit contributions
 amaz+0.007
lee+0.007
od+0.006
ents+0.006
 ago+0.006
Neg logit contributions
P-0.007
His-0.006
Move-0.006
Spot-0.006
ww-0.006
Token their
Token ID511
Feature activation-7.61e-02

Pos logit contributions
P+0.105
His+0.092
ercise+0.089
w+0.088
Spot+0.084
Neg logit contributions
 amaz-0.101
lee-0.095
od-0.088
 angles-0.087
 mesmer-0.086
Token best
Token ID1266
Feature activation-1.83e-01

Pos logit contributions
P+0.080
His+0.077
ercise+0.061
O+0.061
w+0.060
Neg logit contributions
 amaz-0.138
lee-0.130
 angles-0.128
 spec-0.125
od-0.125
TokenThe
Token ID464
Feature activation-8.48e-03

Pos logit contributions
 amaz+0.005
 mesmer+0.005
lee+0.005
od+0.005
 ago+0.005
Neg logit contributions
P-0.004
 helmet-0.004
ercise-0.004
w-0.004
use-0.004
Token three
Token ID1115
Feature activation+9.49e-02

Pos logit contributions
P+0.013
His+0.013
ercise+0.012
 helmet+0.012
w+0.011
Neg logit contributions
 amaz-0.011
lee-0.010
 spec-0.010
 ago-0.009
od-0.009
Token-
Token ID12
Feature activation-8.70e-02

Pos logit contributions
 amaz+0.132
lee+0.123
 spec+0.116
 ago+0.114
 angles+0.113
Neg logit contributions
P-0.139
His-0.132
ercise-0.119
 helmet-0.119
Spot-0.115
Tokenyear
Token ID1941
Feature activation+2.80e-01

Pos logit contributions
P+0.188
His+0.183
w+0.168
ercise+0.168
Spot+0.163
Neg logit contributions
 amaz-0.069
lee-0.054
 spec-0.049
 angles-0.045
 awe-0.044
Token-
Token ID12
Feature activation-8.86e-02

Pos logit contributions
 amaz+0.552
 spec+0.498
lee+0.497
 angles+0.485
od+0.483
Neg logit contributions
P-0.255
His-0.245
 helmet-0.204
ercise-0.201
w-0.186
Tokenold
Token ID727
Feature activation+2.45e-01

Pos logit contributions
P+0.181
His+0.176
ercise+0.161
w+0.161
 helmet+0.156
Neg logit contributions
 amaz-0.080
lee-0.064
 spec-0.062
 angles-0.056
 awe-0.055
Token asked
Token ID1965
Feature activation-2.03e-02

Pos logit contributions
lee+0.341
 amaz+0.322
od+0.303
 ago+0.301
 spec+0.298
Neg logit contributions
P-0.379
His-0.352
ercise-0.332
O-0.319
Br-0.319
Token Daddy
Token ID26926
Feature activation+7.87e-02

Pos logit contributions
P+0.037
Spot+0.034
w+0.034
ercise+0.034
His+0.032
Neg logit contributions
 amaz-0.024
lee-0.023
od-0.021
 ago-0.019
 awe-0.018
Token,
Token ID11
Feature activation-1.14e-01

Pos logit contributions
 amaz+0.099
lee+0.097
od+0.095
 ago+0.094
 angles+0.088
Neg logit contributions
P-0.134
His-0.114
w-0.112
O-0.112
Spot-0.110
Token "
Token ID366
Feature activation-2.74e-02

Pos logit contributions
P+0.206
w+0.200
ercise+0.187
uch+0.186
erve+0.184
Neg logit contributions
 amaz-0.131
 mesmer-0.114
lee-0.113
 ago-0.112
 awe-0.108
TokenWhere
Token ID8496
Feature activation+5.98e-03

Pos logit contributions
P+0.027
His+0.025
ercise+0.021
 helmet+0.020
Spot+0.019
Neg logit contributions
 amaz-0.052
lee-0.051
 spec-0.048
 ago-0.048
od-0.047
Token you
Token ID345
Feature activation+2.30e-02

Pos logit contributions
 amaz+0.214
lee+0.205
 spec+0.195
od+0.193
 ago+0.192
Neg logit contributions
P-0.145
His-0.136
ercise-0.122
 helmet-0.117
Spot-0.113
Token eat
Token ID4483
Feature activation+6.86e-02

Pos logit contributions
 amaz+0.023
od+0.021
 contain+0.021
 ago+0.021
 of+0.020
Neg logit contributions
P-0.042
His-0.038
ercise-0.036
Mid-0.036
Spot-0.034
Token?"
Token ID1701
Feature activation-5.63e-02

Pos logit contributions
 amaz+0.068
 of+0.064
ents+0.064
 ago+0.063
 rooft+0.060
Neg logit contributions
His-0.113
OK-0.112
P-0.110
Wait-0.108
Sorry-0.105
Token Tom
Token ID4186
Feature activation+5.58e-02

Pos logit contributions
P+0.086
ercise+0.083
w+0.078
uch+0.073
erve+0.072
Neg logit contributions
 amaz-0.073
lee-0.072
 ago-0.071
 spec-0.066
 mesmer-0.064
Token asked
Token ID1965
Feature activation+7.92e-02

Pos logit contributions
lee+0.073
 amaz+0.071
od+0.067
 ago+0.066
 mesmer+0.065
Neg logit contributions
P-0.084
His-0.077
ercise-0.077
w-0.072
Spot-0.072
Token.
Token ID13
Feature activation+9.13e-02

Pos logit contributions
lee+0.078
 spec+0.072
od+0.070
 amaz+0.068
 ago+0.067
Neg logit contributions
P-0.149
ercise-0.147
ww-0.140
w-0.134
OK-0.128
Token
Token ID198
Feature activation+9.33e-02

Pos logit contributions
 amaz+0.112
 ago+0.104
lee+0.104
 spec+0.097
 mesmer+0.097
Neg logit contributions
P-0.152
w-0.145
ercise-0.145
uch-0.128
ww-0.127
Token
Token ID198
Feature activation+9.15e-02

Pos logit contributions
lee+0.111
 amaz+0.110
 ago+0.103
od+0.099
 spec+0.096
Neg logit contributions
P-0.139
ercise-0.136
 helmet-0.128
His-0.124
w-0.123
Token"
Token ID1
Feature activation-1.23e-01

Pos logit contributions
 ago+0.114
ents+0.113
 amaz+0.108
lee+0.103
spe+0.103
Neg logit contributions
ercise-0.109
 down-0.102
 melts-0.096
use-0.093
w-0.092
TokenSnow
Token ID28974
Feature activation-9.45e-02

Pos logit contributions
 down+0.151
ercise+0.138
 away+0.138
erve+0.134
irrel+0.132
Neg logit contributions
 mesmer-0.160
 amaz-0.155
 spec-0.154
leaf-0.152
 outage-0.149
Tokenman
Token ID805
Feature activation-5.68e-02

Pos logit contributions
P+0.136
His+0.131
Br+0.120
ercise+0.113
One+0.109
Neg logit contributions
 ago-0.130
ous-0.125
lee-0.124
od-0.121
 spec-0.119
Token what
Token ID644
Feature activation-1.27e-03

Pos logit contributions
 amaz+0.356
lee+0.328
 spec+0.327
 ago+0.326
 mesmer+0.313
Neg logit contributions
His-0.182
P-0.177
 his-0.166
 helmet-0.158
 down-0.153
Token she
Token ID673
Feature activation-6.84e-02

Pos logit contributions
P+0.002
His+0.002
ercise+0.002
 helmet+0.002
w+0.002
Neg logit contributions
 amaz-0.002
lee-0.002
od-0.002
 spec-0.002
 ago-0.002
Token saw
Token ID2497
Feature activation-1.02e-01

Pos logit contributions
P+0.075
His+0.068
ercise+0.062
OK+0.060
 helmet+0.060
Neg logit contributions
 amaz-0.117
lee-0.112
 spec-0.107
 angles-0.106
 mesmer-0.105
Token -
Token ID532
Feature activation+1.25e-02

Pos logit contributions
P+0.147
His+0.143
ercise+0.130
 helmet+0.126
Mid+0.122
Neg logit contributions
 amaz-0.145
lee-0.137
 spec-0.130
 ago-0.130
od-0.128
Token a
Token ID257
Feature activation-9.95e-02

Pos logit contributions
 amaz+0.019
lee+0.017
 spec+0.017
 ago+0.016
 mesmer+0.016
Neg logit contributions
P-0.018
His-0.017
ercise-0.015
 helmet-0.014
w-0.014
Token beautiful
Token ID4950
Feature activation+4.94e-02

Pos logit contributions
P+0.150
His+0.146
ercise+0.136
w+0.128
Mid+0.127
Neg logit contributions
 amaz-0.139
lee-0.124
 spec-0.120
 mesmer-0.116
 ago-0.115
Token castle
Token ID16669
Feature activation+3.98e-02

Pos logit contributions
 amaz+0.066
lee+0.060
 spec+0.057
 ago+0.055
 mesmer+0.054
Neg logit contributions
P-0.080
His-0.076
ercise-0.070
w-0.068
Mid-0.067
Token!
Token ID0
Feature activation-1.20e-02

Pos logit contributions
 amaz+0.060
lee+0.057
 spec+0.053
od+0.053
 angles+0.053
Neg logit contributions
P-0.053
His-0.050
ercise-0.045
 helmet-0.045
Spot-0.043
Token She
Token ID1375
Feature activation-9.28e-02

Pos logit contributions
P+0.013
His+0.013
ercise+0.011
Spot+0.011
Move+0.010
Neg logit contributions
 amaz-0.021
lee-0.019
 spec-0.019
od-0.019
 angles-0.018
Token ran
Token ID4966
Feature activation-1.24e-01

Pos logit contributions
P+0.118
His+0.111
ercise+0.101
 helmet+0.098
Spot+0.095
Neg logit contributions
 amaz-0.148
lee-0.141
 spec-0.132
od-0.131
 ago-0.131
Token towards
Token ID3371
Feature activation-9.77e-03

Pos logit contributions
P+0.167
His+0.152
ercise+0.139
w+0.136
 helmet+0.132
Neg logit contributions
 amaz-0.192
lee-0.186
 spec-0.173
 ago-0.173
 mesmer-0.172
Token but
Token ID475
Feature activation-7.71e-02

Pos logit contributions
P+0.217
His+0.189
ercise+0.185
w+0.175
ww+0.173
Neg logit contributions
 amaz-0.200
lee-0.186
 mesmer-0.177
 ago-0.176
 spec-0.174
Token no
Token ID645
Feature activation-2.41e-02

Pos logit contributions
P+0.127
His+0.116
ercise+0.113
use+0.106
 helmet+0.104
Neg logit contributions
 amaz-0.097
lee-0.088
 spec-0.084
 ago-0.083
 mesmer-0.081
Token one
Token ID530
Feature activation-1.89e-01

Pos logit contributions
P+0.029
His+0.026
ercise+0.023
Spot+0.021
w+0.021
Neg logit contributions
 amaz-0.042
 spec-0.041
 ago-0.038
 angles-0.038
lee-0.038
Token heard
Token ID2982
Feature activation-3.15e-02

Pos logit contributions
P+0.215
His+0.205
ercise+0.179
 helmet+0.179
w+0.171
Neg logit contributions
 amaz-0.321
lee-0.309
 spec-0.291
od-0.287
 ago-0.286
Token her
Token ID607
Feature activation-1.59e-01

Pos logit contributions
P+0.041
His+0.035
ercise+0.034
OK+0.032
 helmet+0.031
Neg logit contributions
 amaz-0.052
lee-0.051
 spec-0.047
 ago-0.047
od-0.046
Token.
Token ID13
Feature activation-3.97e-02

Pos logit contributions
His+0.178
P+0.173
 helmet+0.161
ercise+0.148
Spot+0.148
Neg logit contributions
 amaz-0.261
lee-0.257
 spec-0.242
 angles-0.238
od-0.236
Token She
Token ID1375
Feature activation-9.50e-02

Pos logit contributions
P+0.068
w+0.065
ercise+0.063
 helmet+0.062
His+0.060
Neg logit contributions
lee-0.049
 amaz-0.045
 ago-0.043
 spec-0.039
ents-0.039
Token felt
Token ID2936
Feature activation-4.95e-02

Pos logit contributions
P+0.106
His+0.100
ercise+0.088
 helmet+0.086
Spot+0.083
Neg logit contributions
 amaz-0.165
lee-0.158
 spec-0.150
od-0.149
 ago-0.149
Token very
Token ID845
Feature activation-7.92e-02

Pos logit contributions
P+0.051
His+0.050
 helmet+0.042
ercise+0.041
 his+0.041
Neg logit contributions
 amaz-0.088
lee-0.086
od-0.084
 ago-0.082
 angles-0.081
Token sorry
Token ID7926
Feature activation-1.04e-01

Pos logit contributions
P+0.102
His+0.097
ercise+0.089
 helmet+0.086
Spot+0.083
Neg logit contributions
 amaz-0.124
lee-0.119
 spec-0.111
 ago-0.110
 angles-0.109
Token for
Token ID329
Feature activation-9.64e-02

Pos logit contributions
P+0.178
His+0.158
ercise+0.157
Move+0.147
Spot+0.147
Neg logit contributions
 amaz-0.124
lee-0.119
od-0.111
 ago-0.109
 spec-0.107
Tokenily
Token ID813
Feature activation+2.17e-02

Pos logit contributions
P+0.281
His+0.258
ercise+0.231
w+0.227
Spot+0.227
Neg logit contributions
 amaz-0.320
lee-0.291
 spec-0.281
 mesmer-0.278
 angles-0.273
Token ran
Token ID4966
Feature activation-1.22e-01

Pos logit contributions
lee+0.030
 amaz+0.029
 mesmer+0.025
od+0.025
 spec+0.024
Neg logit contributions
P-0.034
His-0.032
w-0.030
Br-0.030
Spot-0.029
Token outside
Token ID2354
Feature activation-8.59e-02

Pos logit contributions
P+0.176
His+0.163
ercise+0.152
w+0.150
 helmet+0.146
Neg logit contributions
 amaz-0.165
lee-0.165
 ago-0.154
 mesmer-0.149
od-0.148
Token and
Token ID290
Feature activation-2.36e-02

Pos logit contributions
 his+0.065
 down+0.057
His+0.056
 away+0.049
P+0.039
Neg logit contributions
lee-0.185
 amaz-0.183
ocket-0.177
 spec-0.174
 angles-0.171
Token joined
Token ID5399
Feature activation+3.11e-02

Pos logit contributions
P+0.026
His+0.025
ercise+0.022
 helmet+0.021
Spot+0.020
Neg logit contributions
 amaz-0.041
lee-0.039
 spec-0.037
od-0.037
 ago-0.037
Token the
Token ID262
Feature activation+2.34e-02

Pos logit contributions
 amaz+0.053
lee+0.051
 spec+0.048
od+0.048
 ago+0.047
Neg logit contributions
P-0.036
His-0.034
ercise-0.030
 helmet-0.029
Spot-0.028
Token other
Token ID584
Feature activation-2.08e-02

Pos logit contributions
 amaz+0.033
lee+0.031
 ago+0.029
od+0.029
 spec+0.029
Neg logit contributions
P-0.034
His-0.033
ercise-0.030
 helmet-0.029
w-0.028
Token kids
Token ID3988
Feature activation+6.79e-03

Pos logit contributions
His+0.037
P+0.037
ercise+0.034
w+0.033
 helmet+0.033
Neg logit contributions
 amaz-0.023
lee-0.021
 ago-0.021
 spec-0.020
od-0.020
Token.
Token ID13
Feature activation-3.73e-02

Pos logit contributions
 amaz+0.011
lee+0.010
 ago+0.010
od+0.010
 spec+0.010
Neg logit contributions
P-0.009
His-0.008
ercise-0.007
 helmet-0.007
w-0.007
Token They
Token ID1119
Feature activation-8.06e-02

Pos logit contributions
P+0.051
His+0.047
w+0.043
 helmet+0.043
ercise+0.043
Neg logit contributions
 amaz-0.057
lee-0.057
 ago-0.053
 spec-0.051
od-0.051
Token were
Token ID547
Feature activation+2.73e-02

Pos logit contributions
P+0.097
His+0.092
ercise+0.083
 helmet+0.082
Spot+0.079
Neg logit contributions
 amaz-0.132
lee-0.125
 spec-0.120
 ago-0.118
od-0.117
Token One
Token ID1881
Feature activation-1.15e-01

Pos logit contributions
 amaz+0.036
lee+0.035
 ago+0.033
 spec+0.033
od+0.032
Neg logit contributions
P-0.019
His-0.017
ercise-0.015
w-0.015
 helmet-0.015
Token day
Token ID1110
Feature activation-2.27e-02

Pos logit contributions
P+0.119
His+0.112
 helmet+0.094
ercise+0.093
w+0.090
Neg logit contributions
 amaz-0.213
lee-0.205
 spec-0.196
 angles-0.193
 ago-0.192
Token,
Token ID11
Feature activation-8.60e-02

Pos logit contributions
P+0.030
ercise+0.030
His+0.029
 helmet+0.028
OK+0.027
Neg logit contributions
 ago-0.030
lee-0.029
 spec-0.029
 amaz-0.029
od-0.028
Token Lily
Token ID20037
Feature activation-7.79e-02

Pos logit contributions
P+0.112
His+0.105
ercise+0.102
 helmet+0.095
erve+0.093
Neg logit contributions
 amaz-0.125
lee-0.118
 spec-0.118
 ago-0.113
od-0.111
Token's
Token ID338
Feature activation+9.66e-03

Pos logit contributions
P+0.080
His+0.069
ercise+0.061
 helmet+0.059
w+0.059
Neg logit contributions
 amaz-0.147
lee-0.138
 spec-0.133
 angles-0.133
od-0.131
Token mom
Token ID1995
Feature activation+6.88e-04

Pos logit contributions
 amaz+0.019
 angles+0.019
 spec+0.018
 mesmer+0.018
keeping+0.018
Neg logit contributions
P-0.008
 helmet-0.006
His-0.006
 boots-0.006
 nice-0.005
Token told
Token ID1297
Feature activation-1.27e-02

Pos logit contributions
 amaz+0.001
lee+0.001
od+0.001
 spec+0.001
 ago+0.001
Neg logit contributions
P-0.001
His-0.001
ercise-0.001
 helmet-0.001
Spot-0.001
Token her
Token ID607
Feature activation-1.07e-01

Pos logit contributions
P+0.011
His+0.010
ercise+0.008
 helmet+0.008
Spot+0.008
Neg logit contributions
 amaz-0.025
lee-0.025
 spec-0.023
od-0.023
 ago-0.023
Token that
Token ID326
Feature activation+1.92e-01

Pos logit contributions
His+0.107
 his+0.101
P+0.101
 helmet+0.094
 good+0.089
Neg logit contributions
 amaz-0.196
 spec-0.185
 angles-0.183
lee-0.169
 awe-0.167
Token they
Token ID484
Feature activation+3.33e-03

Pos logit contributions
 amaz+0.335
lee+0.323
 spec+0.303
 angles+0.303
 ago+0.297
Neg logit contributions
P-0.210
His-0.193
 his-0.173
 helmet-0.171
ercise-0.169
Token were
Token ID547
Feature activation+7.85e-02

Pos logit contributions
 amaz+0.006
lee+0.005
 spec+0.005
 angles+0.005
 mesmer+0.005
Neg logit contributions
P-0.004
His-0.003
w-0.003
ercise-0.003
OK-0.003
Tokenila
Token ID10102
Feature activation+6.70e-02

Pos logit contributions
 amaz+0.130
lee+0.127
od+0.118
 spec+0.115
 ago+0.115
Neg logit contributions
P-0.163
His-0.158
ercise-0.146
 helmet-0.143
Spot-0.139
Token and
Token ID290
Feature activation+2.71e-02

Pos logit contributions
 amaz+0.099
lee+0.095
 ago+0.090
od+0.089
 spec+0.088
Neg logit contributions
P-0.094
His-0.087
ercise-0.080
 helmet-0.076
OK-0.075
Token said
Token ID531
Feature activation-7.95e-04

Pos logit contributions
 amaz+0.043
lee+0.040
 spec+0.037
od+0.037
 ago+0.037
Neg logit contributions
P-0.036
His-0.033
ercise-0.031
 helmet-0.030
Spot-0.030
Token,
Token ID11
Feature activation-8.93e-02

Pos logit contributions
P+0.001
ercise+0.001
Wh+0.001
Spot+0.001
Mid+0.001
Neg logit contributions
od-0.001
lee-0.001
 amaz-0.001
 ago-0.001
 of-0.001
Token "
Token ID366
Feature activation-4.42e-03

Pos logit contributions
P+0.153
w+0.133
ercise+0.129
His+0.126
ww+0.122
Neg logit contributions
 amaz-0.115
lee-0.110
 ago-0.106
 mesmer-0.103
od-0.099
TokenHi
Token ID17250
Feature activation-2.18e-01

Pos logit contributions
 boots+0.005
 helmet+0.005
 okay+0.005
gh+0.005
 chores+0.005
Neg logit contributions
 amaz-0.006
 spec-0.006
od-0.006
leaf-0.006
 rooft-0.005
Token,
Token ID11
Feature activation-9.23e-02

Pos logit contributions
P+0.314
His+0.312
ercise+0.301
Br+0.278
Mid+0.278
Neg logit contributions
 amaz-0.303
od-0.297
lee-0.286
 spec-0.282
 ago-0.272
Token L
Token ID406
Feature activation+9.64e-02

Pos logit contributions
P+0.113
His+0.110
 helmet+0.100
ercise+0.098
Spot+0.094
Neg logit contributions
 amaz-0.147
lee-0.142
 spec-0.132
 angles-0.131
 mesmer-0.131
Tokenila
Token ID10102
Feature activation+5.92e-02

Pos logit contributions
 amaz+0.147
lee+0.132
 spec+0.128
 mesmer+0.126
 angles+0.126
Neg logit contributions
P-0.139
His-0.129
w-0.114
 helmet-0.114
ercise-0.113
Token.
Token ID13
Feature activation+9.58e-03

Pos logit contributions
 amaz+0.079
lee+0.078
od+0.073
 ago+0.073
 spec+0.072
Neg logit contributions
P-0.087
His-0.082
ercise-0.077
Move-0.073
 helmet-0.072
Token Do
Token ID2141
Feature activation+2.37e-02

Pos logit contributions
 amaz+0.012
 ago+0.012
 spec+0.011
lee+0.011
od+0.011
Neg logit contributions
P-0.015
His-0.014
ercise-0.013
w-0.013
 helmet-0.013
Token proudly
Token ID23017
Feature activation+2.85e-03

Pos logit contributions
P+0.235
His+0.217
ercise+0.211
w+0.201
Spot+0.201
Neg logit contributions
 amaz-0.248
lee-0.242
 mesmer-0.218
od-0.213
 spec-0.209
Token displayed
Token ID9066
Feature activation-3.08e-02

Pos logit contributions
lee+0.004
 amaz+0.004
od+0.003
 mesmer+0.003
 ago+0.003
Neg logit contributions
P-0.005
Spot-0.004
His-0.004
w-0.004
Br-0.004
Token the
Token ID262
Feature activation+3.12e-02

Pos logit contributions
P+0.044
His+0.038
ercise+0.037
w+0.036
Spot+0.036
Neg logit contributions
 amaz-0.048
lee-0.044
 angles-0.042
 awe-0.040
 mesmer-0.040
Token tape
Token ID9154
Feature activation-8.42e-02

Pos logit contributions
 amaz+0.049
 spec+0.043
lee+0.043
 ago+0.043
 angles+0.043
Neg logit contributions
P-0.042
His-0.039
ercise-0.034
Spot-0.033
Mid-0.033
Token to
Token ID284
Feature activation-4.04e-02

Pos logit contributions
P+0.101
His+0.093
ercise+0.084
Spot+0.079
 helmet+0.078
Neg logit contributions
 amaz-0.142
lee-0.134
od-0.128
 ago-0.128
 spec-0.127
Token his
Token ID465
Feature activation-2.89e-01

Pos logit contributions
His+0.033
P+0.032
 helmet+0.027
 his+0.027
ercise+0.025
Neg logit contributions
 amaz-0.080
lee-0.078
 spec-0.076
 ago-0.075
od-0.074
Token mom
Token ID1995
Feature activation-4.36e-02

Pos logit contributions
P+0.373
His+0.355
ercise+0.320
 helmet+0.316
Spot+0.303
Neg logit contributions
 amaz-0.450
lee-0.430
 spec-0.404
od-0.401
 ago-0.399
Token.
Token ID13
Feature activation-3.10e-02

Pos logit contributions
P+0.054
His+0.052
ercise+0.047
 helmet+0.046
Spot+0.044
Neg logit contributions
 amaz-0.069
lee-0.066
 spec-0.062
od-0.062
 ago-0.061
Token But
Token ID887
Feature activation-9.29e-02

Pos logit contributions
P+0.047
w+0.044
use+0.039
ercise+0.038
uch+0.038
Neg logit contributions
 amaz-0.046
lee-0.044
 ago-0.040
 mesmer-0.038
 spec-0.037
Token his
Token ID465
Feature activation-2.92e-01

Pos logit contributions
P+0.159
use+0.135
ww+0.129
ercise+0.129
OK+0.128
Neg logit contributions
 amaz-0.125
 ago-0.104
lee-0.101
 mesmer-0.099
 of-0.097
Token mom
Token ID1995
Feature activation-4.64e-02

Pos logit contributions
P+0.339
His+0.329
 helmet+0.305
ercise+0.293
Spot+0.278
Neg logit contributions
 amaz-0.468
lee-0.459
od-0.440
 spec-0.436
 ago-0.430
Token.
Token ID13
Feature activation-3.27e-02

Pos logit contributions
 amaz+0.080
lee+0.077
 spec+0.072
od+0.072
 ago+0.071
Neg logit contributions
P-0.071
His-0.067
ercise-0.061
 helmet-0.060
Spot-0.058
Token One
Token ID1881
Feature activation-1.67e-01

Pos logit contributions
P+0.037
His+0.033
w+0.031
ercise+0.030
 helmet+0.029
Neg logit contributions
 amaz-0.056
lee-0.055
 ago-0.051
 spec-0.051
od-0.050
Token day
Token ID1110
Feature activation-7.50e-02

Pos logit contributions
P+0.184
His+0.173
ercise+0.152
 helmet+0.149
Spot+0.144
Neg logit contributions
 amaz-0.294
lee-0.283
 spec-0.268
od-0.265
 ago-0.265
Token,
Token ID11
Feature activation-1.41e-01

Pos logit contributions
P+0.096
ercise+0.094
His+0.094
 helmet+0.089
OK+0.084
Neg logit contributions
lee-0.102
 amaz-0.100
 spec-0.100
 ago-0.099
od-0.096
Token she
Token ID673
Feature activation-7.13e-02

Pos logit contributions
P+0.194
ercise+0.178
His+0.178
erve+0.166
 helmet+0.163
Neg logit contributions
 amaz-0.190
 spec-0.181
lee-0.179
 ago-0.167
od-0.167
Token went
Token ID1816
Feature activation-1.72e-01

Pos logit contributions
P+0.077
His+0.077
ercise+0.069
 helmet+0.066
Spot+0.062
Neg logit contributions
 amaz-0.122
lee-0.119
 spec-0.111
 ago-0.110
od-0.110
Token to
Token ID284
Feature activation-4.96e-02

Pos logit contributions
P+0.250
His+0.247
OK+0.225
ercise+0.222
w+0.222
Neg logit contributions
lee-0.213
 amaz-0.205
 ago-0.194
 mesmer-0.188
od-0.186
Token her
Token ID607
Feature activation-1.58e-01

Pos logit contributions
P+0.055
His+0.052
 his+0.049
 helmet+0.047
ercise+0.046
Neg logit contributions
 amaz-0.084
lee-0.083
 angles-0.079
od-0.079
 spec-0.079
Token friend
Token ID1545
Feature activation-1.25e-01

Pos logit contributions
P+0.168
 helmet+0.150
His+0.149
ercise+0.135
Spot+0.133
Neg logit contributions
lee-0.272
 amaz-0.271
od-0.268
 ago-0.264
 spec-0.264
Token Tim
Token ID5045
Feature activation-1.42e-01

Pos logit contributions
P+0.172
His+0.166
OK+0.161
ercise+0.160
w+0.158
Neg logit contributions
lee-0.170
 amaz-0.164
od-0.163
 ago-0.154
 spec-0.148
Tokenmy
Token ID1820
Feature activation-1.84e-01

Pos logit contributions
P+0.066
His+0.049
ercise+0.032
 his+0.031
use+0.029
Neg logit contributions
 amaz-0.358
lee-0.334
 spec-0.333
 mesmer-0.331
 angles-0.323
Token the
Token ID262
Feature activation+1.73e-02

Pos logit contributions
P+0.040
His+0.038
ercise+0.035
 helmet+0.034
Spot+0.033
Neg logit contributions
 amaz-0.046
lee-0.044
 spec-0.041
 ago-0.041
od-0.041
Token house
Token ID2156
Feature activation+3.25e-03

Pos logit contributions
 amaz+0.024
lee+0.023
 ago+0.021
 spec+0.021
od+0.021
Neg logit contributions
P-0.025
His-0.024
ercise-0.022
 helmet-0.022
Spot-0.021
Token.
Token ID13
Feature activation-3.76e-02

Pos logit contributions
 amaz+0.005
lee+0.005
 spec+0.005
od+0.005
 angles+0.005
Neg logit contributions
P-0.004
His-0.004
ercise-0.003
 helmet-0.003
Spot-0.003
Token
Token ID198
Feature activation-3.39e-02

Pos logit contributions
P+0.046
His+0.043
ercise+0.039
 helmet+0.038
Spot+0.037
Neg logit contributions
 amaz-0.061
lee-0.059
 spec-0.055
od-0.055
 ago-0.055
Token
Token ID198
Feature activation-3.47e-02

Pos logit contributions
His+0.034
P+0.033
Spot+0.027
O+0.026
One+0.025
Neg logit contributions
 amaz-0.067
lee-0.063
 spec-0.062
 angles-0.060
 ago-0.059
TokenLater
Token ID18602
Feature activation-1.85e-01

Pos logit contributions
P+0.036
His+0.032
ercise+0.031
 helmet+0.031
use+0.028
Neg logit contributions
 amaz-0.060
lee-0.056
od-0.055
 mesmer-0.055
 angles-0.054
Token,
Token ID11
Feature activation-1.45e-01

Pos logit contributions
P+0.275
His+0.269
ercise+0.248
 helmet+0.241
Spot+0.234
Neg logit contributions
 amaz-0.248
lee-0.238
 ago-0.218
od-0.218
 spec-0.217
Token Ann
Token ID5506
Feature activation+6.84e-03

Pos logit contributions
P+0.228
His+0.211
ercise+0.200
 helmet+0.192
use+0.187
Neg logit contributions
 amaz-0.191
 spec-0.171
lee-0.171
od-0.166
 mesmer-0.165
Token saw
Token ID2497
Feature activation-1.13e-01

Pos logit contributions
lee+0.010
 amaz+0.010
 ago+0.009
od+0.009
 spec+0.009
Neg logit contributions
His-0.009
P-0.009
ercise-0.008
 helmet-0.008
Br-0.008
Token her
Token ID607
Feature activation-1.64e-01

Pos logit contributions
P+0.101
His+0.095
ercise+0.080
 helmet+0.079
Spot+0.075
Neg logit contributions
 amaz-0.220
lee-0.213
 spec-0.204
 ago-0.204
 angles-0.202
Token friend
Token ID1545
Feature activation-1.30e-01

Pos logit contributions
P+0.156
 helmet+0.142
His+0.141
ercise+0.129
Spot+0.126
Neg logit contributions
 amaz-0.295
lee-0.289
 spec-0.282
 angles-0.278
od-0.272
Token,
Token ID11
Feature activation-1.12e-01

Pos logit contributions
 amaz+0.012
lee+0.012
 spec+0.011
 angles+0.011
 ago+0.011
Neg logit contributions
P-0.010
His-0.010
 helmet-0.008
 his-0.008
ercise-0.008
Token and
Token ID290
Feature activation+2.19e-03

Pos logit contributions
P+0.139
His+0.133
ercise+0.119
 helmet+0.118
Spot+0.115
Neg logit contributions
 amaz-0.176
lee-0.169
 spec-0.159
od-0.158
 ago-0.158
Token the
Token ID262
Feature activation+4.91e-02

Pos logit contributions
 amaz+0.004
 ago+0.004
lee+0.004
od+0.004
 angles+0.004
Neg logit contributions
P-0.002
His-0.002
Spot-0.002
 his-0.002
OK-0.002
Token pond
Token ID16723
Feature activation+2.72e-02

Pos logit contributions
 amaz+0.078
lee+0.074
 spec+0.070
 ago+0.069
 angles+0.068
Neg logit contributions
P-0.063
His-0.061
ercise-0.055
 helmet-0.053
w-0.051
Token was
Token ID373
Feature activation-4.23e-02

Pos logit contributions
 amaz+0.041
lee+0.039
 spec+0.036
od+0.036
 mesmer+0.035
Neg logit contributions
P-0.039
His-0.036
ercise-0.033
 helmet-0.032
w-0.032
Token back
Token ID736
Feature activation-1.75e-01

Pos logit contributions
P+0.061
His+0.058
ercise+0.053
 helmet+0.052
Spot+0.050
Neg logit contributions
 amaz-0.061
lee-0.057
 spec-0.053
 ago-0.053
od-0.053
Token to
Token ID284
Feature activation-2.58e-02

Pos logit contributions
P+0.265
His+0.245
ercise+0.237
Move+0.233
O+0.225
Neg logit contributions
lee-0.249
 amaz-0.242
 mesmer-0.226
 ago-0.224
od-0.218
Token normal
Token ID3487
Feature activation-1.26e-01

Pos logit contributions
P+0.035
His+0.033
ercise+0.031
 helmet+0.030
Spot+0.029
Neg logit contributions
 amaz-0.039
lee-0.037
 spec-0.034
 ago-0.034
od-0.034
Token.
Token ID13
Feature activation-1.33e-02

Pos logit contributions
P+0.170
His+0.160
ercise+0.146
 helmet+0.140
Spot+0.137
Neg logit contributions
 amaz-0.199
lee-0.188
 spec-0.178
 ago-0.175
od-0.175
Token
Token ID198
Feature activation-1.09e-02

Pos logit contributions
P+0.013
His+0.013
Spot+0.012
Mid+0.011
O+0.010
Neg logit contributions
 amaz-0.025
lee-0.023
 spec-0.023
 angles-0.023
 ago-0.023
Token
Token ID198
Feature activation-1.13e-02

Pos logit contributions
P+0.013
His+0.013
Spot+0.011
O+0.011
ercise+0.010
Neg logit contributions
 amaz-0.019
 spec-0.018
lee-0.018
 ago-0.017
 angles-0.017