TOP ACTIVATIONS
MAX = 1.649

Thomas ites such as De vere Allen who followed the lead
into thin slices and the broccoli into small pe aces (
Meyer , and Kyle Wh itting ham ) figure in the
and fully integrates the Sor cer discovery interface offered separately with
, London . All Rights Reserved . Rep rint ed with
the 10 MM bolts , lift up the entire shif ter
AM / THE NEWS J OUR N AL ) Buy Photo
version | count | | -------------------- -+ -------- -+ ------- |
Associated Press . All Rights Reserved . This material may not
. <|endoftext|> WH EN Catherine Mull igan , a
alternating layers of zinc , br ine - so aked past
to the passenger side and lift up the e - bra
Act on ) In an unpublished article written around May of
the pile would squeeze the br ine out of the past
SM ISSION /// ---------------------------------------------------------------- ---------------------------------------------------------------- ------------------------ -- /// L ett re
s not impe ached within the next year -
wing , further reducing its lifting capability . The F 8
3 or 10 mg / kg ), but not flu ox
, please download the W off ord Award nomination packet ,
played and handled Sunday 's spat . " Team

MIN ACTIVATIONS
MIN = -1.914

The situation may rec ede or deterior ate and lead
When cal ving fronts rec ede to a point that we
recognition as his highly paid linem ates David De Cast ro
knew that if we fl unk ed out of school or
now spend several days rec uper ating . Friends are praying
memories ... My ster ies of a human turned
. youtube . com / channel / UC _ K k
was important for their rec uper ation . D
home . L av au 's family began searching
his judicial robe and pl unk ing himself down in Anton
ob sh av n rec eded more than 40 kilometers (
10 : 47 ] < abal icious s > lol m
. The new species is named the Wil kins
joke , and the steep incl ines can be treacherous with
is another leak from 4 chan , which has been on
. Not eworthy : Under pass Park ( pictured ); the
Territory are genetically distinct species . The scientists
be seen whether the C UP will continue to support Pu
Cont rad ict ing the sense of
: 47 ] < c erv ant as > lol m

INTERVAL 0.759 - 1.649
CONTAINS 0.527%

. Chase , proposing a bronze cent and two - cent
) in June 2012 E BS CO has partnered with many
the low orbit of the space station as safely as possible
of beginners . We have considered that students in this doctrine
would , and increasingly look forward to going home . Others

INTERVAL -0.132 - 0.759
CONTAINS 75.330%

in demand from investors . Of course
, the NFL PA didn 't have to build in a
children , often in the form of compounds consisting of two
, only the window - local options from the model ine
while the Apple is being " cool ". I believe that

INTERVAL -1.023 - -0.132
CONTAINS 24.079%

These include UNESCO , the United Nations Development Programme ( UN
rance , characterized clinically by multiple , tiny , punct ate
." Video : CH I @ D ET :
: Le a Suzuki , The Chronicle 2 of 15 Photo
. Weapon person loaded , initially failing , sec ures mark

INTERVAL -1.914 - -1.023
CONTAINS 0.064%

ée 33 01 , un grou pe de s é par
[ 22 ] Literally , R ashi says ,
. Local Basic Income Basic income in
| ISI P el leg r ini , A
memories ... My ster ies of a human turned
Token Thomas
Token ID5658
Feature activation+5.40e-03

Pos logit contributions
leys+0.807
mingham+0.792
 VIDE+0.782
\/\/+0.753
arate+0.752
Neg logit contributions
uv-0.747
ourning-0.716
ylan-0.686
 goose-0.617
iber-0.593
Tokenites
Token ID2737
Feature activation+1.13e-01

Pos logit contributions
leys+0.010
mingham+0.010
 VIDE+0.010
\/\/+0.010
arate+0.010
Neg logit contributions
uv-0.009
ourning-0.009
ylan-0.008
 goose-0.008
iber-0.007
Token such
Token ID884
Feature activation-1.44e-01

Pos logit contributions
leys+0.200
mingham+0.197
 VIDE+0.193
arate+0.187
\/\/+0.186
Neg logit contributions
uv-0.208
ourning-0.199
ylan-0.192
 goose-0.174
iber-0.167
Token as
Token ID355
Feature activation+1.85e-01

Pos logit contributions
uv+0.276
ourning+0.266
ylan+0.255
 goose+0.232
iber+0.225
Neg logit contributions
leys-0.243
mingham-0.239
 VIDE-0.235
\/\/-0.226
arate-0.225
Token De
Token ID1024
Feature activation+5.30e-01

Pos logit contributions
leys+0.362
mingham+0.355
 VIDE+0.351
\/\/+0.339
arate+0.338
Neg logit contributions
uv-0.308
ourning-0.294
ylan-0.281
 goose-0.251
iber-0.241
Tokenvere
Token ID4119
Feature activation+1.65e+00

Pos logit contributions
leys+0.957
mingham+0.939
 VIDE+0.922
arate+0.891
\/\/+0.888
Neg logit contributions
uv-0.952
ourning-0.916
ylan-0.871
 goose-0.800
 weeping-0.766
Token Allen
Token ID9659
Feature activation-4.75e-01

Pos logit contributions
leys+3.412
mingham+3.331
 VIDE+3.239
arate+3.188
\/\/+3.133
Neg logit contributions
uv-2.515
ourning-2.415
ylan-2.291
 goose-2.057
 weeping-1.971
Token who
Token ID508
Feature activation+3.48e-01

Pos logit contributions
uv+0.887
ourning+0.850
ylan+0.817
 goose+0.744
iber+0.718
Neg logit contributions
leys-0.830
mingham-0.817
 VIDE-0.806
\/\/-0.773
arate-0.772
Token followed
Token ID3940
Feature activation+1.55e-01

Pos logit contributions
leys+0.689
mingham+0.678
 VIDE+0.668
arate+0.646
\/\/+0.644
Neg logit contributions
uv-0.566
ourning-0.541
ylan-0.519
 goose-0.459
iber-0.441
Token the
Token ID262
Feature activation+1.49e-01

Pos logit contributions
leys+0.300
mingham+0.296
 VIDE+0.293
\/\/+0.282
arate+0.281
Neg logit contributions
uv-0.261
ourning-0.249
ylan-0.239
 goose-0.214
iber-0.206
Token lead
Token ID1085
Feature activation-1.86e-01

Pos logit contributions
leys+0.266
mingham+0.262
 VIDE+0.259
\/\/+0.248
arate+0.248
Neg logit contributions
uv-0.274
ourning-0.262
ylan-0.252
 goose-0.228
iber-0.220
Token into
Token ID656
Feature activation+7.50e-02

Pos logit contributions
leys+2.214
mingham+2.145
 VIDE+2.126
arate+2.071
\/\/+2.026
Neg logit contributions
uv-2.329
ourning-2.276
ylan-2.175
 goose-1.911
iber-1.887
Token thin
Token ID7888
Feature activation+2.35e-01

Pos logit contributions
leys+0.142
mingham+0.140
 VIDE+0.138
arate+0.133
\/\/+0.133
Neg logit contributions
uv-0.129
ourning-0.124
ylan-0.118
 goose-0.106
iber-0.103
Token slices
Token ID24314
Feature activation+2.70e-01

Pos logit contributions
leys+0.430
mingham+0.421
 VIDE+0.416
arate+0.401
\/\/+0.401
Neg logit contributions
uv-0.419
ourning-0.401
ylan-0.384
 goose-0.346
iber-0.335
Token and
Token ID290
Feature activation-6.03e-02

Pos logit contributions
leys+0.452
mingham+0.443
 VIDE+0.438
arate+0.420
\/\/+0.419
Neg logit contributions
uv-0.522
ourning-0.501
ylan-0.483
 goose-0.439
iber-0.426
Token the
Token ID262
Feature activation+4.07e-02

Pos logit contributions
uv+0.110
ourning+0.106
ylan+0.101
 goose+0.092
iber+0.089
Neg logit contributions
leys-0.107
mingham-0.106
 VIDE-0.104
\/\/-0.100
arate-0.100
Token broccoli
Token ID44653
Feature activation+1.59e+00

Pos logit contributions
leys+0.074
mingham+0.073
 VIDE+0.072
\/\/+0.069
arate+0.069
Neg logit contributions
uv-0.073
ourning-0.070
ylan-0.067
 goose-0.060
iber-0.058
Token into
Token ID656
Feature activation+8.18e-03

Pos logit contributions
leys+2.809
mingham+2.693
 VIDE+2.650
arate+2.615
\/\/+2.534
Neg logit contributions
uv-2.912
ourning-2.845
ylan-2.660
 goose-2.334
iber-2.327
Token small
Token ID1402
Feature activation+3.22e-01

Pos logit contributions
leys+0.015
mingham+0.015
 VIDE+0.015
arate+0.014
\/\/+0.014
Neg logit contributions
uv-0.014
ourning-0.014
ylan-0.013
 goose-0.012
iber-0.011
Token pe
Token ID613
Feature activation+6.87e-01

Pos logit contributions
leys+0.606
mingham+0.594
 VIDE+0.587
arate+0.566
\/\/+0.566
Neg logit contributions
uv-0.557
ourning-0.533
ylan-0.510
 goose-0.455
iber-0.442
Tokenaces
Token ID2114
Feature activation+4.18e-02

Pos logit contributions
leys+1.088
mingham+1.056
 VIDE+1.035
arate+1.002
\/\/+0.992
Neg logit contributions
uv-1.400
ourning-1.356
ylan-1.298
 goose-1.194
 weeping-1.157
Token (
Token ID357
Feature activation+1.42e-01

Pos logit contributions
leys+0.071
mingham+0.069
 VIDE+0.068
arate+0.066
\/\/+0.065
Neg logit contributions
uv-0.081
ourning-0.077
ylan-0.075
 goose-0.068
iber-0.066
Token Meyer
Token ID23975
Feature activation-1.36e-02

Pos logit contributions
uv+0.571
ourning+0.543
ylan+0.517
 goose+0.459
iber+0.438
Neg logit contributions
leys-0.775
mingham-0.765
 VIDE-0.757
arate-0.731
\/\/-0.728
Token,
Token ID11
Feature activation+3.52e-02

Pos logit contributions
uv+0.023
ourning+0.022
ylan+0.021
 goose+0.019
iber+0.018
Neg logit contributions
leys-0.026
mingham-0.026
 VIDE-0.025
\/\/-0.024
arate-0.024
Token and
Token ID290
Feature activation+1.91e-02

Pos logit contributions
leys+0.063
mingham+0.062
 VIDE+0.061
\/\/+0.058
arate+0.058
Neg logit contributions
uv-0.065
ourning-0.062
ylan-0.059
 goose-0.054
iber-0.052
Token Kyle
Token ID14316
Feature activation+7.11e-02

Pos logit contributions
leys+0.036
mingham+0.035
 VIDE+0.035
\/\/+0.034
arate+0.034
Neg logit contributions
uv-0.033
ourning-0.032
ylan-0.030
 goose-0.027
iber-0.026
Token Wh
Token ID854
Feature activation+3.10e-01

Pos logit contributions
leys+0.132
mingham+0.130
 VIDE+0.129
\/\/+0.124
arate+0.124
Neg logit contributions
uv-0.125
ourning-0.119
ylan-0.114
 goose-0.103
iber-0.099
Tokenitting
Token ID2535
Feature activation+1.54e+00

Pos logit contributions
leys+0.470
mingham+0.462
 VIDE+0.456
\/\/+0.433
arate+0.432
Neg logit contributions
uv-0.651
ourning-0.627
ylan-0.606
 goose-0.557
iber-0.541
Tokenham
Token ID2763
Feature activation-5.09e-03

Pos logit contributions
leys+2.013
mingham+1.936
 VIDE+1.843
arate+1.791
\/\/+1.752
Neg logit contributions
uv-3.554
ourning-3.444
ylan-3.352
 goose-3.081
 weeping-3.018
Token)
Token ID8
Feature activation-1.92e-01

Pos logit contributions
uv+0.009
ourning+0.009
ylan+0.008
 goose+0.007
iber+0.007
Neg logit contributions
leys-0.009
mingham-0.009
 VIDE-0.009
\/\/-0.009
arate-0.009
Token figure
Token ID3785
Feature activation-5.95e-01

Pos logit contributions
uv+0.345
ourning+0.328
ylan+0.316
 goose+0.288
iber+0.278
Neg logit contributions
leys-0.348
mingham-0.343
 VIDE-0.339
\/\/-0.326
arate-0.325
Token in
Token ID287
Feature activation+8.96e-02

Pos logit contributions
uv+1.041
ourning+0.995
ylan+0.954
 goose+0.861
iber+0.830
Neg logit contributions
leys-1.107
mingham-1.092
 VIDE-1.079
\/\/-1.036
arate-1.036
Token the
Token ID262
Feature activation+1.14e-01

Pos logit contributions
leys+0.178
mingham+0.176
 VIDE+0.174
\/\/+0.167
arate+0.167
Neg logit contributions
uv-0.145
ourning-0.139
ylan-0.132
 goose-0.118
iber-0.113
Token and
Token ID290
Feature activation+1.06e-01

Pos logit contributions
uv+0.055
ourning+0.053
ylan+0.051
 goose+0.047
iber+0.045
Neg logit contributions
leys-0.049
mingham-0.049
 VIDE-0.048
\/\/-0.046
arate-0.046
Token fully
Token ID3938
Feature activation-2.46e-01

Pos logit contributions
leys+0.200
mingham+0.198
 VIDE+0.195
\/\/+0.188
arate+0.187
Neg logit contributions
uv-0.182
ourning-0.174
ylan-0.166
 goose-0.149
iber-0.144
Token integrates
Token ID48105
Feature activation-2.36e-01

Pos logit contributions
uv+0.401
ourning+0.385
ylan+0.366
 goose+0.325
iber+0.314
Neg logit contributions
leys-0.487
mingham-0.481
 VIDE-0.476
\/\/-0.458
arate-0.457
Token the
Token ID262
Feature activation-3.46e-03

Pos logit contributions
uv+0.425
ourning+0.406
ylan+0.390
 goose+0.354
iber+0.342
Neg logit contributions
leys-0.427
mingham-0.421
 VIDE-0.416
\/\/-0.399
arate-0.399
Token Sor
Token ID15423
Feature activation+1.39e-01

Pos logit contributions
uv+0.006
ourning+0.006
ylan+0.006
 goose+0.005
iber+0.005
Neg logit contributions
leys-0.006
mingham-0.006
 VIDE-0.006
arate-0.006
\/\/-0.006
Tokencer
Token ID2189
Feature activation+1.51e+00

Pos logit contributions
leys+0.250
mingham+0.246
 VIDE+0.241
arate+0.232
\/\/+0.231
Neg logit contributions
uv-0.252
ourning-0.243
ylan-0.233
 goose-0.212
 weeping-0.204
Token discovery
Token ID9412
Feature activation-1.16e-01

Pos logit contributions
leys+2.556
mingham+2.508
 VIDE+2.461
arate+2.380
\/\/+2.366
Neg logit contributions
uv-2.806
ourning-2.751
ylan-2.618
 goose-2.406
 weeping-2.326
Token interface
Token ID7071
Feature activation-2.54e-01

Pos logit contributions
uv+0.226
ourning+0.217
ylan+0.209
 goose+0.191
iber+0.185
Neg logit contributions
leys-0.194
mingham-0.191
 VIDE-0.189
\/\/-0.180
arate-0.180
Token offered
Token ID4438
Feature activation+3.03e-01

Pos logit contributions
uv+0.468
ourning+0.447
ylan+0.430
 goose+0.391
iber+0.378
Neg logit contributions
leys-0.450
mingham-0.443
 VIDE-0.438
\/\/-0.420
arate-0.419
Token separately
Token ID13869
Feature activation+1.01e-01

Pos logit contributions
leys+0.568
mingham+0.561
 VIDE+0.554
\/\/+0.532
arate+0.531
Neg logit contributions
uv-0.525
ourning-0.503
ylan-0.482
 goose-0.434
iber-0.417
Token with
Token ID351
Feature activation+1.56e-01

Pos logit contributions
leys+0.192
mingham+0.189
 VIDE+0.187
arate+0.180
\/\/+0.180
Neg logit contributions
uv-0.172
ourning-0.164
ylan-0.157
 goose-0.142
iber-0.136
Token,
Token ID11
Feature activation+1.95e-01

Pos logit contributions
uv+0.544
ourning+0.522
ylan+0.502
 goose+0.459
iber+0.443
Neg logit contributions
leys-0.503
mingham-0.492
 VIDE-0.484
\/\/-0.464
arate-0.463
Token London
Token ID3576
Feature activation+2.01e-01

Pos logit contributions
leys+0.378
mingham+0.373
 VIDE+0.369
arate+0.355
\/\/+0.355
Neg logit contributions
uv-0.328
ourning-0.312
ylan-0.298
 goose-0.268
iber-0.258
Token.
Token ID13
Feature activation-7.41e-02

Pos logit contributions
leys+0.349
mingham+0.342
 VIDE+0.338
\/\/+0.325
arate+0.323
Neg logit contributions
uv-0.378
ourning-0.360
ylan-0.351
 goose-0.320
iber-0.310
Token All
Token ID1439
Feature activation-1.78e-01

Pos logit contributions
uv+0.122
ourning+0.116
ylan+0.111
 goose+0.099
iber+0.096
Neg logit contributions
leys-0.146
mingham-0.144
 VIDE-0.142
arate-0.137
\/\/-0.137
Token Rights
Token ID6923
Feature activation-3.79e-01

Pos logit contributions
uv+0.373
ourning+0.356
ylan+0.343
 goose+0.316
iber+0.308
Neg logit contributions
leys-0.274
mingham-0.269
 VIDE-0.266
\/\/-0.253
arate-0.252
Token Reserved
Token ID33876
Feature activation+1.45e+00

Pos logit contributions
uv+0.891
ourning+0.857
ylan+0.835
 goose+0.776
iber+0.758
Neg logit contributions
leys-0.475
mingham-0.468
 VIDE-0.459
\/\/-0.430
arate-0.426
Token.
Token ID13
Feature activation-1.94e-03

Pos logit contributions
leys+2.369
mingham+2.344
 VIDE+2.298
arate+2.205
\/\/+2.195
Neg logit contributions
uv-2.835
ourning-2.733
ylan-2.601
 goose-2.384
 weeping-2.309
Token Rep
Token ID1432
Feature activation-2.89e-01

Pos logit contributions
uv+0.003
ourning+0.003
ylan+0.003
 goose+0.002
iber+0.002
Neg logit contributions
leys-0.004
mingham-0.004
 VIDE-0.004
\/\/-0.004
arate-0.004
Tokenrint
Token ID22272
Feature activation+2.97e-01

Pos logit contributions
uv+0.599
ourning+0.573
ylan+0.553
 goose+0.506
iber+0.496
Neg logit contributions
leys-0.445
mingham-0.438
 VIDE-0.435
\/\/-0.411
arate-0.407
Tokened
Token ID276
Feature activation-3.39e-01

Pos logit contributions
leys+0.628
mingham+0.621
 VIDE+0.612
\/\/+0.594
arate+0.593
Neg logit contributions
uv-0.441
ourning-0.422
ylan-0.400
 goose-0.352
 weeping-0.335
Token with
Token ID351
Feature activation+1.35e-01

Pos logit contributions
uv+0.608
ourning+0.579
ylan+0.557
 goose+0.507
iber+0.493
Neg logit contributions
leys-0.614
mingham-0.604
 VIDE-0.601
\/\/-0.573
arate-0.573
Token the
Token ID262
Feature activation+2.07e-01

Pos logit contributions
leys+0.793
mingham+0.783
 VIDE+0.774
arate+0.747
\/\/+0.747
Neg logit contributions
uv-0.614
ourning-0.584
ylan-0.556
 goose-0.494
iber-0.476
Token 10
Token ID838
Feature activation+5.53e-01

Pos logit contributions
leys+0.371
mingham+0.365
 VIDE+0.361
\/\/+0.346
arate+0.345
Neg logit contributions
uv-0.377
ourning-0.361
ylan-0.346
 goose-0.313
iber-0.303
TokenMM
Token ID12038
Feature activation+4.07e-01

Pos logit contributions
leys+1.057
mingham+1.043
 VIDE+1.031
\/\/+0.992
arate+0.991
Neg logit contributions
uv-0.937
ourning-0.895
ylan-0.857
 goose-0.771
iber-0.741
Token bolts
Token ID24667
Feature activation+7.94e-01

Pos logit contributions
leys+0.562
mingham+0.549
 VIDE+0.541
arate+0.513
\/\/+0.512
Neg logit contributions
uv-0.909
ourning-0.879
ylan-0.849
 goose-0.784
iber-0.765
Token,
Token ID11
Feature activation+1.43e-01

Pos logit contributions
leys+1.454
mingham+1.423
 VIDE+1.407
\/\/+1.356
arate+1.351
Neg logit contributions
uv-1.413
ourning-1.350
ylan-1.300
 goose-1.168
iber-1.131
Token lift
Token ID10303
Feature activation+1.42e+00

Pos logit contributions
leys+0.257
mingham+0.253
 VIDE+0.250
\/\/+0.240
arate+0.239
Neg logit contributions
uv-0.260
ourning-0.248
ylan-0.239
 goose-0.217
iber-0.209
Token up
Token ID510
Feature activation+2.93e-01

Pos logit contributions
leys+2.739
mingham+2.673
 VIDE+2.649
arate+2.566
\/\/+2.557
Neg logit contributions
uv-2.384
ourning-2.308
ylan-2.168
 goose-1.935
iber-1.888
Token the
Token ID262
Feature activation+1.32e-01

Pos logit contributions
leys+0.608
mingham+0.601
 VIDE+0.595
\/\/+0.574
arate+0.573
Neg logit contributions
uv-0.450
ourning-0.427
ylan-0.407
 goose-0.362
iber-0.346
Token entire
Token ID2104
Feature activation-2.82e-01

Pos logit contributions
leys+0.231
mingham+0.227
 VIDE+0.224
\/\/+0.215
arate+0.215
Neg logit contributions
uv-0.245
ourning-0.235
ylan-0.226
 goose-0.205
iber-0.199
Token shif
Token ID7143
Feature activation+1.27e-01

Pos logit contributions
uv+0.550
ourning+0.525
ylan+0.509
 goose+0.467
 weeping+0.449
Neg logit contributions
leys-0.469
mingham-0.464
 VIDE-0.456
arate-0.436
\/\/-0.436
Tokenter
Token ID353
Feature activation+1.62e-01

Pos logit contributions
leys+0.313
mingham+0.309
 VIDE+0.305
arate+0.299
\/\/+0.296
Neg logit contributions
uv-0.146
ourning-0.138
ylan-0.128
 goose-0.109
 weeping-0.103
TokenAM
Token ID2390
Feature activation+4.86e-01

Pos logit contributions
leys+0.889
 VIDE+0.881
mingham+0.860
arate+0.832
\/\/+0.830
Neg logit contributions
uv-0.964
ourning-0.926
ylan-0.894
 goose-0.817
 weeping-0.788
Token/
Token ID14
Feature activation-2.93e-02

Pos logit contributions
leys+0.743
mingham+0.732
 VIDE+0.712
arate+0.685
\/\/+0.682
Neg logit contributions
uv-1.012
ourning-0.971
ylan-0.942
 goose-0.866
iber-0.842
TokenTHE
Token ID10970
Feature activation-1.83e-01

Pos logit contributions
uv+0.063
ourning+0.061
ylan+0.059
 goose+0.055
iber+0.053
Neg logit contributions
leys-0.042
mingham-0.042
 VIDE-0.041
\/\/-0.039
arate-0.039
Token NEWS
Token ID28840
Feature activation-5.27e-02

Pos logit contributions
uv+0.315
ourning+0.299
ylan+0.288
 goose+0.259
iber+0.250
Neg logit contributions
leys-0.344
mingham-0.341
 VIDE-0.333
\/\/-0.323
arate-0.323
Token J
Token ID449
Feature activation+5.58e-01

Pos logit contributions
uv+0.094
ourning+0.088
ylan+0.087
 goose+0.077
gal+0.077
Neg logit contributions
mingham-0.096
leys-0.095
\/\/-0.089
arate-0.088
 VIDE-0.088
TokenOUR
Token ID11698
Feature activation+1.41e+00

Pos logit contributions
leys+0.809
mingham+0.800
 VIDE+0.779
\/\/+0.743
arate+0.738
Neg logit contributions
uv-1.212
ourning-1.167
ylan-1.126
 goose-1.036
iber-1.012
TokenN
Token ID45
Feature activation+2.00e-01

Pos logit contributions
 VIDE+1.780
leys+1.780
mingham+1.744
arate+1.621
\/\/+1.620
Neg logit contributions
uv-3.255
ourning-3.202
ylan-3.054
 goose-2.853
 weeping-2.786
TokenAL
Token ID1847
Feature activation+4.97e-01

Pos logit contributions
leys+0.323
 VIDE+0.318
mingham+0.317
\/\/+0.301
arate+0.300
Neg logit contributions
uv-0.397
ourning-0.386
ylan-0.369
 goose-0.339
iber-0.328
Token)
Token ID8
Feature activation+1.52e-01

Pos logit contributions
leys+0.897
mingham+0.884
 VIDE+0.879
\/\/+0.838
arate+0.838
Neg logit contributions
uv-0.896
ourning-0.860
ylan-0.824
 goose-0.747
iber-0.717
TokenBuy
Token ID14518
Feature activation+9.28e-01

Pos logit contributions
leys+0.148
mingham+0.145
 VIDE+0.140
arate+0.130
\/\/+0.129
Neg logit contributions
uv-0.403
ourning-0.391
ylan-0.379
 goose-0.356
iber-0.349
Token Photo
Token ID5555
Feature activation+2.75e-01

Pos logit contributions
leys+1.410
mingham+1.385
 VIDE+1.372
arate+1.304
\/\/+1.303
Neg logit contributions
uv-1.941
ourning-1.869
ylan-1.805
 goose-1.659
iber-1.612
Token version
Token ID2196
Feature activation-2.70e-01

Pos logit contributions
leys+0.450
mingham+0.444
 VIDE+0.438
\/\/+0.417
arate+0.416
Neg logit contributions
uv-0.591
ourning-0.571
ylan-0.549
 goose-0.501
iber-0.489
Token |
Token ID930
Feature activation+2.94e-01

Pos logit contributions
uv+0.637
ourning+0.614
ylan+0.597
 goose+0.557
iber+0.541
Neg logit contributions
leys-0.340
mingham-0.332
 VIDE-0.326
\/\/-0.307
arate-0.306
Token count
Token ID954
Feature activation-4.01e-01

Pos logit contributions
leys+0.515
mingham+0.508
 VIDE+0.503
\/\/+0.481
arate+0.479
Neg logit contributions
uv-0.544
ourning-0.524
ylan-0.502
 goose-0.453
iber-0.441
Token |
Token ID930
Feature activation+2.88e-01

Pos logit contributions
uv+0.938
ourning+0.905
ylan+0.879
 goose+0.819
iber+0.795
Neg logit contributions
leys-0.511
mingham-0.501
 VIDE-0.491
\/\/-0.464
arate-0.462
Token |
Token ID930
Feature activation+2.96e-01

Pos logit contributions
leys+0.497
mingham+0.490
 VIDE+0.485
\/\/+0.463
arate+0.462
Neg logit contributions
uv-0.542
ourning-0.522
ylan-0.500
 goose-0.452
iber-0.441
Token --------------------
Token ID41436
Feature activation+1.40e+00

Pos logit contributions
leys+0.491
mingham+0.484
 VIDE+0.478
\/\/+0.456
arate+0.454
Neg logit contributions
uv-0.576
ourning-0.554
ylan-0.533
 goose-0.484
iber-0.471
Token-+
Token ID19529
Feature activation+6.87e-01

Pos logit contributions
leys+2.209
mingham+2.179
 VIDE+2.156
\/\/+2.059
arate+2.052
Neg logit contributions
uv-2.786
ourning-2.694
ylan-2.589
 goose-2.367
 weeping-2.306
Token--------
Token ID982
Feature activation+8.73e-01

Pos logit contributions
leys+1.008
mingham+0.993
 VIDE+0.978
\/\/+0.929
arate+0.926
Neg logit contributions
uv-1.469
ourning-1.416
ylan-1.369
 goose-1.261
iber-1.226
Token-+
Token ID19529
Feature activation+6.16e-01

Pos logit contributions
leys+1.140
mingham+1.128
 VIDE+1.107
\/\/+1.060
arate+1.045
Neg logit contributions
uv-1.988
ourning-1.930
ylan-1.867
 goose-1.733
iber-1.685
Token-------
Token ID26866
Feature activation+1.05e+00

Pos logit contributions
leys+0.937
mingham+0.922
 VIDE+0.908
\/\/+0.865
arate+0.863
Neg logit contributions
uv-1.287
ourning-1.239
ylan-1.197
 goose-1.100
iber-1.068
Token|
Token ID91
Feature activation+2.65e-01

Pos logit contributions
leys+1.629
mingham+1.614
 VIDE+1.587
\/\/+1.527
arate+1.515
Neg logit contributions
uv-2.121
ourning-2.052
ylan-1.978
 goose-1.814
iber-1.760
Token Associated
Token ID10575
Feature activation+1.10e-01

Pos logit contributions
uv+0.629
ourning+0.604
ylan+0.583
 goose+0.535
iber+0.519
Neg logit contributions
leys-0.491
mingham-0.483
 VIDE-0.476
\/\/-0.454
arate-0.453
Token Press
Token ID4332
Feature activation+4.59e-01

Pos logit contributions
leys+0.129
mingham+0.127
 VIDE+0.123
arate+0.116
\/\/+0.115
Neg logit contributions
uv-0.267
ourning-0.258
ylan-0.251
 goose-0.234
iber-0.228
Token.
Token ID13
Feature activation-1.39e-01

Pos logit contributions
leys+0.785
mingham+0.775
 VIDE+0.761
\/\/+0.727
arate+0.724
Neg logit contributions
uv-0.878
ourning-0.834
ylan-0.809
 goose-0.734
iber-0.715
Token All
Token ID1439
Feature activation-2.53e-01

Pos logit contributions
uv+0.282
ourning+0.271
ylan+0.262
 goose+0.240
iber+0.233
Neg logit contributions
leys-0.219
mingham-0.216
 VIDE-0.213
\/\/-0.203
arate-0.203
Token Rights
Token ID6923
Feature activation-5.03e-01

Pos logit contributions
uv+0.586
ourning+0.561
ylan+0.545
 goose+0.508
iber+0.496
Neg logit contributions
leys-0.330
mingham-0.323
 VIDE-0.316
\/\/-0.299
arate-0.299
Token Reserved
Token ID33876
Feature activation+1.38e+00

Pos logit contributions
uv+1.173
ourning+1.124
ylan+1.098
 goose+1.019
iber+0.995
Neg logit contributions
leys-0.640
mingham-0.632
 VIDE-0.618
\/\/-0.579
arate-0.575
Token.
Token ID13
Feature activation-6.46e-04

Pos logit contributions
leys+2.079
mingham+2.036
 VIDE+2.022
arate+1.948
\/\/+1.930
Neg logit contributions
uv-2.865
ourning-2.792
ylan-2.651
 goose-2.456
 weeping-2.407
Token This
Token ID770
Feature activation-5.31e-03

Pos logit contributions
uv+0.001
ourning+0.001
ylan+0.001
 goose+0.001
iber+0.001
Neg logit contributions
leys-0.001
mingham-0.001
 VIDE-0.001
\/\/-0.001
arate-0.001
Token material
Token ID2587
Feature activation+3.87e-01

Pos logit contributions
uv+0.009
ourning+0.009
ylan+0.008
 goose+0.007
iber+0.007
Neg logit contributions
leys-0.010
mingham-0.010
 VIDE-0.010
\/\/-0.010
arate-0.010
Token may
Token ID743
Feature activation+2.27e-01

Pos logit contributions
leys+0.724
mingham+0.714
 VIDE+0.701
\/\/+0.683
arate+0.674
Neg logit contributions
uv-0.673
ourning-0.647
ylan-0.604
 goose-0.553
 weeping-0.533
Token not
Token ID407
Feature activation+1.03e-01

Pos logit contributions
leys+0.451
mingham+0.447
 VIDE+0.440
\/\/+0.427
arate+0.426
Neg logit contributions
uv-0.368
ourning-0.352
ylan-0.335
 goose-0.300
iber-0.288
Token.
Token ID13
Feature activation-5.03e-02

Pos logit contributions
uv+1.158
ourning+1.110
ylan+1.070
 goose+0.976
iber+0.944
Neg logit contributions
leys-1.027
mingham-1.013
 VIDE-0.999
\/\/-0.955
arate-0.954
Token
Token ID447
Feature activation-8.24e-03

Pos logit contributions
uv+0.079
ourning+0.075
ylan+0.072
 goose+0.064
iber+0.061
Neg logit contributions
leys-0.103
mingham-0.101
 VIDE-0.100
\/\/-0.097
arate-0.097
Token
Token ID251
Feature activation+4.98e-04

Pos logit contributions
uv+0.012
ourning+0.011
ylan+0.010
 goose+0.009
iber+0.009
Neg logit contributions
leys-0.018
mingham-0.018
 VIDE-0.018
\/\/-0.017
arate-0.017
Token<|endoftext|>
Token ID50256
Feature activation+1.43e-01

Pos logit contributions
leys+0.001
mingham+0.001
 VIDE+0.001
\/\/+0.001
arate+0.001
Neg logit contributions
uv-0.001
ourning-0.001
ylan-0.001
 goose-0.001
iber-0.001
TokenWH
Token ID12418
Feature activation+6.05e-02

Pos logit contributions
leys+0.227
mingham+0.223
 VIDE+0.220
\/\/+0.210
arate+0.210
Neg logit contributions
uv-0.288
ourning-0.278
ylan-0.268
 goose-0.246
iber-0.237
TokenEN
Token ID1677
Feature activation+1.37e+00

Pos logit contributions
leys+0.083
mingham+0.082
 VIDE+0.081
\/\/+0.076
arate+0.076
Neg logit contributions
uv-0.135
ourning-0.130
ylan-0.126
 goose-0.117
iber-0.114
Token Catherine
Token ID22578
Feature activation+3.50e-01

Pos logit contributions
leys+2.521
 VIDE+2.481
mingham+2.472
\/\/+2.347
arate+2.347
Neg logit contributions
uv-2.404
ourning-2.312
ylan-2.189
 goose-1.965
iber-1.909
Token Mull
Token ID19574
Feature activation+6.26e-02

Pos logit contributions
leys+0.675
mingham+0.661
 VIDE+0.654
arate+0.630
\/\/+0.630
Neg logit contributions
uv-0.590
ourning-0.564
ylan-0.537
 goose-0.482
iber-0.464
Tokenigan
Token ID5516
Feature activation+2.98e-02

Pos logit contributions
leys+0.102
mingham+0.100
 VIDE+0.099
\/\/+0.094
arate+0.094
Neg logit contributions
uv-0.125
ourning-0.120
ylan-0.115
 goose-0.106
iber-0.102
Token,
Token ID11
Feature activation+1.05e-01

Pos logit contributions
leys+0.059
mingham+0.057
 VIDE+0.057
\/\/+0.055
arate+0.055
Neg logit contributions
uv-0.049
ourning-0.047
ylan-0.045
 goose-0.040
iber-0.038
Token a
Token ID257
Feature activation-5.63e-02

Pos logit contributions
leys+0.204
mingham+0.201
 VIDE+0.199
\/\/+0.192
arate+0.191
Neg logit contributions
uv-0.174
ourning-0.166
ylan-0.159
 goose-0.142
iber-0.137
Token alternating
Token ID39623
Feature activation-7.30e-02

Pos logit contributions
leys+0.138
mingham+0.135
 VIDE+0.134
\/\/+0.129
arate+0.129
Neg logit contributions
uv-0.124
ourning-0.119
ylan-0.113
 goose-0.101
iber-0.098
Token layers
Token ID11685
Feature activation-3.61e-01

Pos logit contributions
uv+0.094
ourning+0.088
ylan+0.083
 goose+0.072
iber+0.068
Neg logit contributions
leys-0.170
mingham-0.168
 VIDE-0.166
\/\/-0.161
arate-0.161
Token of
Token ID286
Feature activation-1.76e-01

Pos logit contributions
uv+0.593
ourning+0.567
ylan+0.542
 goose+0.483
iber+0.467
Neg logit contributions
leys-0.709
mingham-0.699
 VIDE-0.691
arate-0.665
\/\/-0.665
Token zinc
Token ID31861
Feature activation+4.56e-02

Pos logit contributions
uv+0.317
ourning+0.303
ylan+0.291
 goose+0.263
iber+0.255
Neg logit contributions
leys-0.317
mingham-0.313
 VIDE-0.309
\/\/-0.297
arate-0.296
Token,
Token ID11
Feature activation-8.07e-02

Pos logit contributions
leys+0.073
mingham+0.072
 VIDE+0.071
arate+0.067
\/\/+0.067
Neg logit contributions
uv-0.091
ourning-0.088
ylan-0.085
 goose-0.078
iber-0.075
Token br
Token ID865
Feature activation+1.34e+00

Pos logit contributions
uv+0.154
ourning+0.148
ylan+0.142
 goose+0.129
iber+0.125
Neg logit contributions
leys-0.138
mingham-0.135
 VIDE-0.134
arate-0.128
\/\/-0.128
Tokenine
Token ID500
Feature activation-8.46e-01

Pos logit contributions
leys+2.161
mingham+2.125
 VIDE+2.042
arate+2.033
\/\/+1.975
Neg logit contributions
uv-2.631
ourning-2.579
ylan-2.436
 goose-2.269
 weeping-2.203
Token-
Token ID12
Feature activation-1.09e-01

Pos logit contributions
uv+1.506
ourning+1.423
ylan+1.396
 goose+1.254
 weeping+1.205
Neg logit contributions
leys-1.542
mingham-1.518
 VIDE-1.514
\/\/-1.459
arate-1.427
Tokenso
Token ID568
Feature activation+2.94e-01

Pos logit contributions
uv+0.197
ourning+0.189
ylan+0.181
 goose+0.165
iber+0.159
Neg logit contributions
leys-0.195
mingham-0.192
 VIDE-0.189
arate-0.182
\/\/-0.182
Tokenaked
Token ID4335
Feature activation+8.08e-01

Pos logit contributions
leys+0.499
mingham+0.488
 VIDE+0.477
arate+0.466
\/\/+0.462
Neg logit contributions
uv-0.559
ourning-0.546
ylan-0.515
 goose-0.478
 weeping-0.459
Token past
Token ID1613
Feature activation-3.77e-02

Pos logit contributions
leys+1.405
mingham+1.374
 VIDE+1.352
arate+1.306
\/\/+1.298
Neg logit contributions
uv-1.505
ourning-1.462
ylan-1.381
 goose-1.247
iber-1.224
Token to
Token ID284
Feature activation+7.74e-02

Pos logit contributions
leys+1.126
mingham+1.110
 VIDE+1.098
\/\/+1.058
arate+1.056
Neg logit contributions
uv-0.943
ourning-0.897
ylan-0.857
 goose-0.767
iber-0.738
Token the
Token ID262
Feature activation+1.31e-01

Pos logit contributions
leys+0.156
mingham+0.154
 VIDE+0.152
\/\/+0.146
arate+0.146
Neg logit contributions
uv-0.124
ourning-0.118
ylan-0.112
 goose-0.101
iber-0.096
Token passenger
Token ID11849
Feature activation+3.94e-01

Pos logit contributions
leys+0.227
mingham+0.223
 VIDE+0.220
\/\/+0.211
arate+0.211
Neg logit contributions
uv-0.247
ourning-0.236
ylan-0.228
 goose-0.207
iber-0.200
Token side
Token ID1735
Feature activation+3.54e-01

Pos logit contributions
leys+0.610
mingham+0.599
 VIDE+0.592
\/\/+0.563
arate+0.562
Neg logit contributions
uv-0.813
ourning-0.784
ylan-0.755
 goose-0.693
iber-0.674
Token and
Token ID290
Feature activation-1.01e-01

Pos logit contributions
leys+0.651
mingham+0.641
 VIDE+0.635
\/\/+0.609
arate+0.608
Neg logit contributions
uv-0.625
ourning-0.598
ylan-0.573
 goose-0.517
iber-0.500
Token lift
Token ID10303
Feature activation+1.34e+00

Pos logit contributions
uv+0.169
ourning+0.160
ylan+0.154
 goose+0.139
iber+0.133
Neg logit contributions
leys-0.197
mingham-0.195
 VIDE-0.192
\/\/-0.186
arate-0.185
Token up
Token ID510
Feature activation+2.31e-01

Pos logit contributions
leys+2.646
mingham+2.598
 VIDE+2.574
\/\/+2.487
arate+2.486
Neg logit contributions
uv-2.177
ourning-2.094
ylan-1.981
 goose-1.753
iber-1.709
Token the
Token ID262
Feature activation+3.52e-02

Pos logit contributions
leys+0.472
mingham+0.466
 VIDE+0.461
\/\/+0.444
arate+0.444
Neg logit contributions
uv-0.364
ourning-0.345
ylan-0.330
 goose-0.294
iber-0.282
Token e
Token ID304
Feature activation-2.26e-01

Pos logit contributions
leys+0.061
mingham+0.060
 VIDE+0.059
\/\/+0.057
arate+0.057
Neg logit contributions
uv-0.066
ourning-0.063
ylan-0.061
 goose-0.055
iber-0.053
Token-
Token ID12
Feature activation+3.44e-01

Pos logit contributions
uv+0.416
ourning+0.396
ylan+0.382
 goose+0.346
iber+0.335
Neg logit contributions
leys-0.400
mingham-0.396
 VIDE-0.391
\/\/-0.374
arate-0.373
Tokenbra
Token ID16057
Feature activation+2.49e-01

Pos logit contributions
leys+0.592
mingham+0.582
 VIDE+0.574
arate+0.551
\/\/+0.549
Neg logit contributions
uv-0.648
ourning-0.624
ylan-0.601
 goose-0.548
iber-0.527
Token Act
Token ID2191
Feature activation+2.97e-01

Pos logit contributions
leys+0.178
mingham+0.175
 VIDE+0.173
arate+0.167
\/\/+0.166
Neg logit contributions
uv-0.156
ourning-0.150
ylan-0.143
 goose-0.128
iber-0.123
Tokenon
Token ID261
Feature activation+1.97e-01

Pos logit contributions
leys+0.474
mingham+0.466
 VIDE+0.452
arate+0.439
\/\/+0.436
Neg logit contributions
uv-0.595
ourning-0.578
ylan-0.555
 goose-0.511
 weeping-0.493
Token)
Token ID8
Feature activation+7.77e-02

Pos logit contributions
leys+0.357
mingham+0.352
 VIDE+0.347
arate+0.334
\/\/+0.334
Neg logit contributions
uv-0.352
ourning-0.339
ylan-0.325
 goose-0.294
iber-0.283
TokenIn
Token ID818
Feature activation+3.18e-01

Pos logit contributions
leys+0.138
mingham+0.136
 VIDE+0.134
\/\/+0.129
arate+0.129
Neg logit contributions
uv-0.143
ourning-0.137
ylan-0.132
 goose-0.119
iber-0.115
Token an
Token ID281
Feature activation+9.64e-02

Pos logit contributions
leys+0.595
mingham+0.587
 VIDE+0.578
arate+0.564
\/\/+0.558
Neg logit contributions
uv-0.548
ourning-0.525
ylan-0.503
 goose-0.453
iber-0.437
Token unpublished
Token ID42686
Feature activation+1.34e+00

Pos logit contributions
leys+0.160
mingham+0.157
 VIDE+0.156
\/\/+0.149
arate+0.148
Neg logit contributions
uv-0.189
ourning-0.181
ylan-0.175
 goose-0.160
iber-0.154
Token article
Token ID2708
Feature activation-4.03e-01

Pos logit contributions
leys+2.085
mingham+2.052
 VIDE+2.032
arate+1.936
\/\/+1.921
Neg logit contributions
uv-2.734
ourning-2.636
ylan-2.534
 goose-2.309
iber-2.247
Token written
Token ID3194
Feature activation-4.88e-01

Pos logit contributions
uv+0.692
ourning+0.660
ylan+0.633
 goose+0.570
iber+0.548
Neg logit contributions
leys-0.763
mingham-0.752
 VIDE-0.744
arate-0.715
\/\/-0.715
Token around
Token ID1088
Feature activation+2.90e-01

Pos logit contributions
uv+0.818
ourning+0.779
ylan+0.746
 goose+0.671
iber+0.645
Neg logit contributions
leys-0.944
mingham-0.932
 VIDE-0.922
\/\/-0.887
arate-0.886
Token May
Token ID1737
Feature activation+3.88e-01

Pos logit contributions
leys+0.505
mingham+0.497
 VIDE+0.491
\/\/+0.474
arate+0.473
Neg logit contributions
uv-0.538
ourning-0.517
ylan-0.496
 goose-0.449
iber-0.435
Token of
Token ID286
Feature activation+8.67e-02

Pos logit contributions
leys+0.643
mingham+0.629
 VIDE+0.619
\/\/+0.602
arate+0.601
Neg logit contributions
uv-0.753
ourning-0.726
ylan-0.698
 goose-0.635
iber-0.616
Token the
Token ID262
Feature activation-5.24e-02

Pos logit contributions
uv+0.026
ourning+0.025
ylan+0.024
 goose+0.021
iber+0.020
Neg logit contributions
leys-0.035
mingham-0.034
 VIDE-0.034
\/\/-0.033
arate-0.033
Token pile
Token ID14540
Feature activation-6.75e-02

Pos logit contributions
uv+0.096
ourning+0.091
ylan+0.088
 goose+0.080
iber+0.077
Neg logit contributions
leys-0.094
mingham-0.092
 VIDE-0.091
\/\/-0.087
arate-0.087
Token would
Token ID561
Feature activation-2.19e-02

Pos logit contributions
uv+0.110
ourning+0.106
ylan+0.101
 goose+0.089
iber+0.086
Neg logit contributions
leys-0.134
mingham-0.131
 VIDE-0.129
\/\/-0.126
arate-0.125
Token squeeze
Token ID21229
Feature activation+3.01e-01

Pos logit contributions
uv+0.038
ourning+0.036
ylan+0.035
 goose+0.031
iber+0.030
Neg logit contributions
leys-0.041
mingham-0.041
 VIDE-0.040
arate-0.039
\/\/-0.039
Token the
Token ID262
Feature activation+5.95e-02

Pos logit contributions
leys+0.583
mingham+0.574
 VIDE+0.567
\/\/+0.547
arate+0.546
Neg logit contributions
uv-0.504
ourning-0.483
ylan-0.460
 goose-0.412
iber-0.398
Token br
Token ID865
Feature activation+1.32e+00

Pos logit contributions
leys+0.105
mingham+0.104
 VIDE+0.102
arate+0.098
\/\/+0.098
Neg logit contributions
uv-0.110
ourning-0.105
ylan-0.101
 goose-0.092
iber-0.089
Tokenine
Token ID500
Feature activation-8.06e-01

Pos logit contributions
leys+2.138
mingham+2.097
 VIDE+2.027
arate+1.999
\/\/+1.945
Neg logit contributions
uv-2.601
ourning-2.524
ylan-2.400
 goose-2.229
 weeping-2.172
Token out
Token ID503
Feature activation-5.91e-02

Pos logit contributions
uv+1.431
ourning+1.339
ylan+1.306
 goose+1.195
iber+1.143
Neg logit contributions
leys-1.477
mingham-1.473
 VIDE-1.454
\/\/-1.386
arate-1.386
Token of
Token ID286
Feature activation-3.89e-02

Pos logit contributions
uv+0.101
ourning+0.097
ylan+0.092
 goose+0.083
iber+0.080
Neg logit contributions
leys-0.112
mingham-0.111
 VIDE-0.109
\/\/-0.105
arate-0.105
Token the
Token ID262
Feature activation+4.59e-02

Pos logit contributions
uv+0.059
ourning+0.056
ylan+0.054
 goose+0.048
iber+0.045
Neg logit contributions
leys-0.081
mingham-0.080
 VIDE-0.080
\/\/-0.077
arate-0.077
Token past
Token ID1613
Feature activation-2.47e-02

Pos logit contributions
leys+0.081
mingham+0.080
 VIDE+0.079
arate+0.075
\/\/+0.075
Neg logit contributions
uv-0.085
ourning-0.081
ylan-0.078
 goose-0.071
iber-0.069
TokenSM
Token ID12310
Feature activation+3.30e-01

Pos logit contributions
uv+0.495
ourning+0.470
ylan+0.450
 goose+0.403
iber+0.387
Neg logit contributions
leys-0.596
mingham-0.589
 VIDE-0.581
\/\/-0.560
arate-0.560
TokenISSION
Token ID40373
Feature activation+3.34e-01

Pos logit contributions
leys+0.458
mingham+0.449
 VIDE+0.443
\/\/+0.419
arate+0.418
Neg logit contributions
uv-0.731
ourning-0.705
ylan-0.684
 goose-0.632
iber-0.614
Token ///
Token ID34013
Feature activation+6.88e-01

Pos logit contributions
leys+0.579
 VIDE+0.570
mingham+0.569
\/\/+0.544
arate+0.540
Neg logit contributions
uv-0.622
ourning-0.600
ylan-0.575
 goose-0.525
iber-0.505
Token----------------------------------------------------------------
Token ID10097
Feature activation+8.19e-01

Pos logit contributions
leys+1.204
 VIDE+1.186
mingham+1.184
\/\/+1.140
arate+1.124
Neg logit contributions
uv-1.266
ourning-1.224
ylan-1.169
 goose-1.067
 weeping-1.030
Token----------------------------------------------------------------
Token ID10097
Feature activation+7.51e-01

Pos logit contributions
leys+1.286
mingham+1.271
 VIDE+1.256
\/\/+1.209
arate+1.188
Neg logit contributions
uv-1.657
ourning-1.599
ylan-1.544
 goose-1.418
 weeping-1.379
Token------------------------
Token ID22369
Feature activation+1.32e+00

Pos logit contributions
leys+1.217
mingham+1.203
 VIDE+1.194
\/\/+1.145
arate+1.130
Neg logit contributions
uv-1.485
ourning-1.431
ylan-1.378
 goose-1.263
 weeping-1.227
Token--
Token ID438
Feature activation+6.94e-01

Pos logit contributions
leys+2.139
 VIDE+2.108
mingham+2.102
\/\/+2.028
arate+1.988
Neg logit contributions
uv-2.576
ourning-2.491
ylan-2.385
 goose-2.192
 weeping-2.142
Token///
Token ID20379
Feature activation-1.61e-01

Pos logit contributions
leys+1.164
mingham+1.143
 VIDE+1.142
\/\/+1.097
arate+1.083
Neg logit contributions
uv-1.331
ourning-1.283
ylan-1.230
 goose-1.129
 weeping-1.092
Token L
Token ID406
Feature activation+1.78e-01

Pos logit contributions
uv+0.310
ourning+0.298
ylan+0.287
 goose+0.262
iber+0.253
Neg logit contributions
leys-0.271
mingham-0.267
 VIDE-0.263
\/\/-0.252
arate-0.251
Tokenett
Token ID3087
Feature activation-5.35e-01

Pos logit contributions
leys+0.289
mingham+0.284
 VIDE+0.280
\/\/+0.268
arate+0.268
Neg logit contributions
uv-0.351
ourning-0.338
ylan-0.326
 goose-0.300
 weeping-0.289
Tokenre
Token ID260
Feature activation+1.91e-01

Pos logit contributions
uv+1.012
ourning+0.968
ylan+0.932
 goose+0.849
iber+0.822
Neg logit contributions
leys-0.920
mingham-0.906
 VIDE-0.895
\/\/-0.857
arate-0.855
Token
Token ID447
Feature activation+6.47e-02

Pos logit contributions
uv+0.117
ourning+0.112
ylan+0.108
 goose+0.096
 weeping+0.092
Neg logit contributions
leys-0.131
mingham-0.128
 VIDE-0.127
\/\/-0.123
arate-0.122
Token
Token ID247
Feature activation+6.00e-02

Pos logit contributions
leys+0.168
mingham+0.166
 VIDE+0.164
arate+0.160
\/\/+0.160
Neg logit contributions
uv-0.066
ourning-0.061
ylan-0.056
 goose-0.046
 weeping-0.043
Tokens
Token ID82
Feature activation+9.32e-03

Pos logit contributions
leys+0.118
mingham+0.115
 VIDE+0.113
\/\/+0.110
arate+0.110
Neg logit contributions
uv-0.099
ourning-0.095
ylan-0.091
 goose-0.082
 weeping-0.079
Token not
Token ID407
Feature activation-5.75e-02

Pos logit contributions
leys+0.018
mingham+0.018
 VIDE+0.017
\/\/+0.017
arate+0.017
Neg logit contributions
uv-0.016
ourning-0.015
ylan-0.014
 goose-0.013
iber-0.012
Token impe
Token ID18516
Feature activation-2.08e-02

Pos logit contributions
uv+0.099
ourning+0.094
ylan+0.091
 goose+0.081
iber+0.078
Neg logit contributions
leys-0.109
mingham-0.107
 VIDE-0.106
\/\/-0.102
arate-0.102
Tokenached
Token ID2317
Feature activation+1.31e+00

Pos logit contributions
uv+0.047
ourning+0.044
ylan+0.044
 goose+0.041
 Goose+0.040
Neg logit contributions
leys-0.028
mingham-0.028
 VIDE-0.027
arate-0.026
\/\/-0.025
Token within
Token ID1626
Feature activation+5.45e-02

Pos logit contributions
leys+2.183
mingham+2.140
 VIDE+2.113
\/\/+2.043
arate+2.017
Neg logit contributions
uv-2.516
ourning-2.402
ylan-2.327
 goose-2.105
iber-2.029
Token the
Token ID262
Feature activation+8.83e-02

Pos logit contributions
leys+0.107
mingham+0.105
 VIDE+0.104
\/\/+0.100
arate+0.100
Neg logit contributions
uv-0.090
ourning-0.086
ylan-0.082
 goose-0.073
iber-0.071
Token next
Token ID1306
Feature activation-5.81e-02

Pos logit contributions
leys+0.152
mingham+0.149
 VIDE+0.148
\/\/+0.142
arate+0.142
Neg logit contributions
uv-0.167
ourning-0.159
ylan-0.154
 goose-0.139
iber-0.135
Token year
Token ID614
Feature activation-6.73e-02

Pos logit contributions
uv+0.100
ourning+0.096
ylan+0.092
 goose+0.082
iber+0.080
Neg logit contributions
leys-0.110
mingham-0.108
 VIDE-0.107
\/\/-0.103
arate-0.103
Token-
Token ID12
Feature activation+1.20e-01

Pos logit contributions
uv+0.126
ourning+0.120
ylan+0.116
 goose+0.105
iber+0.102
Neg logit contributions
leys-0.117
mingham-0.115
 VIDE-0.114
\/\/-0.109
arate-0.109
Token wing
Token ID8539
Feature activation-8.74e-02

Pos logit contributions
leys+0.849
mingham+0.835
 VIDE+0.825
\/\/+0.789
arate+0.789
Neg logit contributions
uv-0.878
ourning-0.844
ylan-0.806
 goose-0.730
iber-0.709
Token,
Token ID11
Feature activation-1.02e-01

Pos logit contributions
uv+0.164
ourning+0.157
ylan+0.151
 goose+0.137
iber+0.133
Neg logit contributions
leys-0.152
mingham-0.150
 VIDE-0.148
\/\/-0.142
arate-0.142
Token further
Token ID2252
Feature activation-1.23e-01

Pos logit contributions
uv+0.173
ourning+0.165
ylan+0.158
 goose+0.142
iber+0.137
Neg logit contributions
leys-0.195
mingham-0.192
 VIDE-0.190
\/\/-0.183
arate-0.182
Token reducing
Token ID8868
Feature activation-3.89e-02

Pos logit contributions
uv+0.188
ourning+0.180
ylan+0.172
 goose+0.149
iber+0.145
Neg logit contributions
leys-0.255
mingham-0.252
 VIDE-0.249
arate-0.241
\/\/-0.240
Token its
Token ID663
Feature activation-1.66e-03

Pos logit contributions
uv+0.063
ourning+0.060
ylan+0.057
 goose+0.051
iber+0.049
Neg logit contributions
leys-0.078
mingham-0.077
 VIDE-0.076
\/\/-0.073
arate-0.073
Token lifting
Token ID16842
Feature activation+1.30e+00

Pos logit contributions
uv+0.003
ourning+0.003
ylan+0.003
 goose+0.003
iber+0.002
Neg logit contributions
leys-0.003
mingham-0.003
 VIDE-0.003
arate-0.003
\/\/-0.003
Token capability
Token ID12971
Feature activation+3.58e-01

Pos logit contributions
leys+2.421
mingham+2.385
 VIDE+2.341
\/\/+2.264
arate+2.256
Neg logit contributions
uv-2.262
ourning-2.202
ylan-2.066
 goose-1.837
iber-1.805
Token.
Token ID13
Feature activation+2.68e-02

Pos logit contributions
leys+0.594
mingham+0.585
 VIDE+0.575
\/\/+0.552
arate+0.551
Neg logit contributions
uv-0.696
ourning-0.670
ylan-0.645
 goose-0.587
iber-0.569
Token The
Token ID383
Feature activation+3.07e-02

Pos logit contributions
leys+0.048
mingham+0.047
 VIDE+0.047
\/\/+0.045
arate+0.045
Neg logit contributions
uv-0.049
ourning-0.047
ylan-0.045
 goose-0.041
iber-0.039
Token F
Token ID376
Feature activation+3.91e-02

Pos logit contributions
leys+0.055
mingham+0.054
 VIDE+0.053
\/\/+0.051
arate+0.051
Neg logit contributions
uv-0.056
ourning-0.054
ylan-0.052
 goose-0.047
iber-0.045
Token8
Token ID23
Feature activation-2.85e-01

Pos logit contributions
leys+0.066
mingham+0.065
 VIDE+0.064
\/\/+0.062
arate+0.062
Neg logit contributions
uv-0.075
ourning-0.072
ylan-0.069
 goose-0.063
iber-0.061
Token3
Token ID18
Feature activation+1.81e-01

Pos logit contributions
leys+0.256
mingham+0.251
 VIDE+0.247
arate+0.239
\/\/+0.239
Neg logit contributions
uv-0.300
ourning-0.290
ylan-0.279
 goose-0.256
 weeping-0.247
Token or
Token ID393
Feature activation+1.18e-01

Pos logit contributions
leys+0.336
mingham+0.329
 VIDE+0.327
arate+0.317
\/\/+0.314
Neg logit contributions
uv-0.313
ourning-0.303
ylan-0.288
 goose-0.260
iber-0.251
Token 10
Token ID838
Feature activation+3.82e-01

Pos logit contributions
leys+0.210
mingham+0.208
 VIDE+0.206
arate+0.198
\/\/+0.197
Neg logit contributions
uv-0.214
ourning-0.207
ylan-0.198
 goose-0.179
iber-0.173
Tokenmg
Token ID11296
Feature activation+3.24e-01

Pos logit contributions
leys+0.750
mingham+0.732
 VIDE+0.731
arate+0.711
\/\/+0.710
Neg logit contributions
uv-0.612
ourning-0.596
ylan-0.561
 goose-0.504
iber-0.485
Token/
Token ID14
Feature activation+5.85e-02

Pos logit contributions
leys+0.536
mingham+0.521
 VIDE+0.518
arate+0.498
\/\/+0.497
Neg logit contributions
uv-0.629
ourning-0.610
ylan-0.582
 goose-0.533
iber-0.515
Tokenkg
Token ID10025
Feature activation+1.30e+00

Pos logit contributions
leys+0.060
mingham+0.058
 VIDE+0.056
arate+0.053
\/\/+0.053
Neg logit contributions
uv-0.151
ourning-0.147
ylan-0.143
 goose-0.134
 weeping-0.131
Token),
Token ID828
Feature activation-1.78e-01

Pos logit contributions
leys+2.332
mingham+2.272
 VIDE+2.265
arate+2.202
\/\/+2.186
Neg logit contributions
uv-2.301
ourning-2.237
ylan-2.107
 goose-1.895
iber-1.829
Token but
Token ID475
Feature activation-1.30e-01

Pos logit contributions
uv+0.309
ourning+0.295
ylan+0.283
 goose+0.255
iber+0.246
Neg logit contributions
leys-0.332
mingham-0.327
 VIDE-0.324
\/\/-0.311
arate-0.311
Token not
Token ID407
Feature activation+2.73e-02

Pos logit contributions
uv+0.214
ourning+0.206
ylan+0.195
 goose+0.177
 weeping+0.168
Neg logit contributions
leys-0.255
mingham-0.250
 VIDE-0.248
arate-0.240
\/\/-0.238
Token flu
Token ID6562
Feature activation+1.47e-01

Pos logit contributions
leys+0.052
mingham+0.052
 VIDE+0.051
arate+0.049
\/\/+0.049
Neg logit contributions
uv-0.046
ourning-0.044
ylan-0.042
 goose-0.037
iber-0.036
Tokenox
Token ID1140
Feature activation+3.50e-01

Pos logit contributions
leys+0.304
mingham+0.299
 VIDE+0.296
arate+0.287
\/\/+0.286
Neg logit contributions
uv-0.225
ourning-0.214
ylan-0.203
 goose-0.181
iber-0.173
Token,
Token ID11
Feature activation+8.27e-02

Pos logit contributions
uv+0.224
ourning+0.215
ylan+0.206
 goose+0.187
iber+0.181
Neg logit contributions
leys-0.220
mingham-0.217
 VIDE-0.214
arate-0.206
\/\/-0.205
Token please
Token ID3387
Feature activation-6.44e-02

Pos logit contributions
leys+0.151
mingham+0.149
 VIDE+0.147
\/\/+0.141
arate+0.141
Neg logit contributions
uv-0.148
ourning-0.141
ylan-0.135
 goose-0.122
iber-0.118
Token download
Token ID4321
Feature activation+2.72e-01

Pos logit contributions
uv+0.115
ourning+0.109
ylan+0.105
 goose+0.096
iber+0.092
Neg logit contributions
leys-0.118
mingham-0.116
 VIDE-0.115
\/\/-0.110
arate-0.110
Token the
Token ID262
Feature activation+1.67e-01

Pos logit contributions
leys+0.536
mingham+0.529
 VIDE+0.525
arate+0.506
\/\/+0.505
Neg logit contributions
uv-0.445
ourning-0.424
ylan-0.405
 goose-0.362
iber-0.347
Token W
Token ID370
Feature activation+3.56e-01

Pos logit contributions
leys+0.294
mingham+0.289
 VIDE+0.286
\/\/+0.274
arate+0.274
Neg logit contributions
uv-0.310
ourning-0.297
ylan-0.285
 goose-0.258
iber-0.250
Tokenoff
Token ID2364
Feature activation+1.30e+00

Pos logit contributions
leys+0.522
mingham+0.514
 VIDE+0.503
arate+0.483
\/\/+0.481
Neg logit contributions
uv-0.759
ourning-0.733
ylan-0.707
 goose-0.656
 weeping-0.635
Tokenord
Token ID585
Feature activation+3.67e-01

Pos logit contributions
leys+1.837
mingham+1.805
 VIDE+1.731
arate+1.690
\/\/+1.670
Neg logit contributions
uv-2.841
ourning-2.746
ylan-2.651
 goose-2.472
 weeping-2.397
Token Award
Token ID11289
Feature activation-3.92e-02

Pos logit contributions
leys+0.614
mingham+0.604
 VIDE+0.597
arate+0.571
\/\/+0.569
Neg logit contributions
uv-0.709
ourning-0.682
ylan-0.657
 goose-0.598
iber-0.578
Token nomination
Token ID11872
Feature activation-8.94e-02

Pos logit contributions
uv+0.075
ourning+0.072
ylan+0.069
 goose+0.063
iber+0.061
Neg logit contributions
leys-0.067
mingham-0.066
 VIDE-0.065
arate-0.062
\/\/-0.062
Token packet
Token ID19638
Feature activation+1.26e-01

Pos logit contributions
uv+0.183
ourning+0.176
ylan+0.170
 goose+0.156
iber+0.151
Neg logit contributions
leys-0.140
mingham-0.137
 VIDE-0.135
arate-0.129
\/\/-0.129
Token,
Token ID11
Feature activation+8.13e-02

Pos logit contributions
leys+0.218
mingham+0.214
 VIDE+0.212
\/\/+0.203
arate+0.203
Neg logit contributions
uv-0.238
ourning-0.228
ylan-0.219
 goose-0.199
iber-0.193
Token played
Token ID2826
Feature activation+1.23e-01

Pos logit contributions
leys+0.030
mingham+0.030
 VIDE+0.030
arate+0.028
\/\/+0.028
Neg logit contributions
uv-0.026
ourning-0.025
ylan-0.024
 goose-0.021
iber-0.021
Token and
Token ID290
Feature activation-4.42e-02

Pos logit contributions
leys+0.233
mingham+0.230
 VIDE+0.228
\/\/+0.219
arate+0.219
Neg logit contributions
uv-0.210
ourning-0.201
ylan-0.192
 goose-0.172
iber-0.166
Token handled
Token ID12118
Feature activation-4.33e-02

Pos logit contributions
uv+0.069
ourning+0.066
ylan+0.063
 goose+0.055
iber+0.053
Neg logit contributions
leys-0.091
mingham-0.089
 VIDE-0.088
\/\/-0.085
arate-0.085
Token Sunday
Token ID3502
Feature activation+1.88e-01

Pos logit contributions
uv+0.070
ourning+0.067
ylan+0.063
 goose+0.056
iber+0.054
Neg logit contributions
leys-0.087
mingham-0.085
 VIDE-0.085
\/\/-0.081
arate-0.081
Token's
Token ID338
Feature activation+9.66e-02

Pos logit contributions
leys+0.376
mingham+0.371
 VIDE+0.367
\/\/+0.354
arate+0.354
Neg logit contributions
uv-0.302
ourning-0.288
ylan-0.274
 goose-0.244
iber-0.235
Token spat
Token ID15246
Feature activation+1.29e+00

Pos logit contributions
leys+0.183
mingham+0.180
 VIDE+0.178
\/\/+0.171
arate+0.171
Neg logit contributions
uv-0.166
ourning-0.159
ylan-0.152
 goose-0.136
iber-0.132
Token.
Token ID13
Feature activation-1.45e-02

Pos logit contributions
leys+2.188
mingham+2.140
 VIDE+2.112
arate+2.019
\/\/+2.012
Neg logit contributions
uv-2.455
ourning-2.375
ylan-2.289
 goose-2.079
 weeping-1.994
Token
Token ID198
Feature activation-1.02e-01

Pos logit contributions
uv+0.023
ourning+0.022
ylan+0.021
 goose+0.018
iber+0.017
Neg logit contributions
leys-0.030
mingham-0.029
 VIDE-0.029
\/\/-0.028
arate-0.028
Token
Token ID198
Feature activation-6.45e-02

Pos logit contributions
uv+0.156
ourning+0.149
ylan+0.142
 goose+0.126
iber+0.120
Neg logit contributions
leys-0.211
mingham-0.208
 VIDE-0.206
\/\/-0.199
arate-0.199
Token"
Token ID1
Feature activation+4.37e-01

Pos logit contributions
uv+0.134
ourning+0.128
ylan+0.124
 goose+0.114
iber+0.111
Neg logit contributions
leys-0.099
mingham-0.098
 VIDE-0.096
\/\/-0.091
arate-0.091
TokenTeam
Token ID15592
Feature activation+4.89e-01

Pos logit contributions
leys+0.662
mingham+0.647
 VIDE+0.641
arate+0.611
\/\/+0.610
Neg logit contributions
uv-0.915
ourning-0.884
ylan-0.852
 goose-0.785
 weeping-0.763
Token
Token ID250
Feature activation-1.85e-01

Pos logit contributions
uv+0.050
ourning+0.047
ylan+0.044
 goose+0.040
 weeping+0.038
Neg logit contributions
leys-0.068
mingham-0.067
 VIDE-0.066
\/\/-0.064
arate-0.064
TokenThe
Token ID464
Feature activation-4.94e-02

Pos logit contributions
uv+0.384
ourning+0.369
ylan+0.356
 goose+0.327
iber+0.318
Neg logit contributions
leys-0.286
mingham-0.282
 VIDE-0.278
\/\/-0.264
arate-0.264
Token situation
Token ID3074
Feature activation+2.49e-01

Pos logit contributions
uv+0.089
ourning+0.086
ylan+0.082
 goose+0.074
iber+0.072
Neg logit contributions
leys-0.089
mingham-0.088
 VIDE-0.087
\/\/-0.083
arate-0.083
Token may
Token ID743
Feature activation+2.57e-01

Pos logit contributions
leys+0.465
mingham+0.458
 VIDE+0.453
\/\/+0.436
arate+0.434
Neg logit contributions
uv-0.434
ourning-0.415
ylan-0.397
 goose-0.359
iber-0.345
Token rec
Token ID664
Feature activation+2.56e-01

Pos logit contributions
leys+0.493
mingham+0.487
 VIDE+0.481
\/\/+0.462
arate+0.462
Neg logit contributions
uv-0.436
ourning-0.415
ylan-0.398
 goose-0.357
iber-0.344
Tokenede
Token ID18654
Feature activation-1.91e+00

Pos logit contributions
leys+0.577
mingham+0.571
 VIDE+0.564
arate+0.548
\/\/+0.546
Neg logit contributions
uv-0.343
ourning-0.325
ylan-0.308
 goose-0.268
 weeping-0.254
Token or
Token ID393
Feature activation+1.84e-01

Pos logit contributions
uv+3.316
ourning+3.129
ylan+3.018
 goose+2.766
iber+2.695
Neg logit contributions
leys-3.526
mingham-3.486
 VIDE-3.471
arate-3.300
\/\/-3.265
Token deterior
Token ID17975
Feature activation-8.01e-03

Pos logit contributions
leys+0.357
mingham+0.354
 VIDE+0.349
\/\/+0.337
arate+0.336
Neg logit contributions
uv-0.306
ourning-0.292
ylan-0.279
 goose-0.249
iber-0.240
Tokenate
Token ID378
Feature activation-3.27e-01

Pos logit contributions
uv+0.014
ourning+0.014
ylan+0.013
 goose+0.012
iber+0.011
Neg logit contributions
leys-0.015
mingham-0.014
 VIDE-0.014
\/\/-0.014
arate-0.014
Token and
Token ID290
Feature activation-1.08e-01

Pos logit contributions
uv+0.591
ourning+0.564
ylan+0.543
 goose+0.493
iber+0.477
Neg logit contributions
leys-0.590
mingham-0.583
 VIDE-0.576
arate-0.551
\/\/-0.551
Token lead
Token ID1085
Feature activation+2.42e-01

Pos logit contributions
uv+0.186
ourning+0.177
ylan+0.170
 goose+0.153
iber+0.147
Neg logit contributions
leys-0.205
mingham-0.202
 VIDE-0.199
\/\/-0.192
arate-0.192
TokenWhen
Token ID2215
Feature activation+1.78e-01

Pos logit contributions
uv+0.075
ourning+0.072
ylan+0.070
 goose+0.064
 weeping+0.062
Neg logit contributions
leys-0.068
mingham-0.067
 VIDE-0.066
\/\/-0.064
arate-0.063
Token cal
Token ID2386
Feature activation+2.06e-01

Pos logit contributions
leys+0.359
mingham+0.352
 VIDE+0.349
\/\/+0.337
arate+0.336
Neg logit contributions
uv-0.283
ourning-0.272
ylan-0.260
 goose-0.229
iber-0.222
Tokenving
Token ID1075
Feature activation-4.98e-01

Pos logit contributions
leys+0.545
mingham+0.535
 VIDE+0.529
\/\/+0.517
arate+0.516
Neg logit contributions
uv-0.196
ourning-0.181
ylan-0.171
 goose-0.140
 weeping-0.126
Token fronts
Token ID29324
Feature activation-5.46e-01

Pos logit contributions
uv+0.812
ourning+0.775
ylan+0.742
 goose+0.661
iber+0.638
Neg logit contributions
leys-0.985
mingham-0.972
 VIDE-0.960
\/\/-0.925
arate-0.924
Token rec
Token ID664
Feature activation+1.62e-01

Pos logit contributions
uv+0.952
ourning+0.910
ylan+0.873
 goose+0.786
iber+0.758
Neg logit contributions
leys-1.020
mingham-1.007
 VIDE-0.994
\/\/-0.956
arate-0.955
Tokenede
Token ID18654
Feature activation-1.88e+00

Pos logit contributions
leys+0.363
mingham+0.359
 VIDE+0.354
arate+0.346
\/\/+0.343
Neg logit contributions
uv-0.217
ourning-0.206
ylan-0.197
 goose-0.171
 weeping-0.163
Token to
Token ID284
Feature activation-1.20e-02

Pos logit contributions
uv+3.243
ourning+3.034
ylan+2.924
 goose+2.683
 pu+2.576
Neg logit contributions
leys-3.489
mingham-3.479
 VIDE-3.455
arate-3.279
\/\/-3.247
Token a
Token ID257
Feature activation-8.14e-02

Pos logit contributions
uv+0.020
ourning+0.020
ylan+0.019
 goose+0.017
iber+0.016
Neg logit contributions
leys-0.023
mingham-0.023
 VIDE-0.022
\/\/-0.022
arate-0.021
Token point
Token ID966
Feature activation-6.21e-02

Pos logit contributions
uv+0.141
ourning+0.135
ylan+0.130
 goose+0.116
iber+0.113
Neg logit contributions
leys-0.152
mingham-0.150
 VIDE-0.148
\/\/-0.143
arate-0.143
Token that
Token ID326
Feature activation-3.54e-03

Pos logit contributions
uv+0.112
ourning+0.107
ylan+0.103
 goose+0.093
iber+0.090
Neg logit contributions
leys-0.112
mingham-0.111
 VIDE-0.109
\/\/-0.105
arate-0.105
Token we
Token ID356
Feature activation-1.29e-01

Pos logit contributions
uv+0.006
ourning+0.006
ylan+0.005
 goose+0.005
iber+0.005
Neg logit contributions
leys-0.007
mingham-0.007
 VIDE-0.007
\/\/-0.006
arate-0.006
Token recognition
Token ID9465
Feature activation-1.78e-01

Pos logit contributions
leys+0.596
mingham+0.584
 VIDE+0.579
arate+0.556
\/\/+0.556
Neg logit contributions
uv-0.601
ourning-0.579
ylan-0.551
 goose-0.495
iber-0.484
Token as
Token ID355
Feature activation-2.56e-01

Pos logit contributions
uv+0.319
ourning+0.305
ylan+0.293
 goose+0.265
iber+0.256
Neg logit contributions
leys-0.322
mingham-0.318
 VIDE-0.314
arate-0.301
\/\/-0.301
Token his
Token ID465
Feature activation-1.24e-01

Pos logit contributions
uv+0.432
ourning+0.411
ylan+0.394
 goose+0.356
iber+0.341
Neg logit contributions
leys-0.493
mingham-0.487
 VIDE-0.481
\/\/-0.463
arate-0.462
Token highly
Token ID4047
Feature activation-8.62e-02

Pos logit contributions
uv+0.227
ourning+0.216
ylan+0.208
 goose+0.190
iber+0.183
Neg logit contributions
leys-0.220
mingham-0.217
 VIDE-0.215
\/\/-0.206
arate-0.206
Token paid
Token ID3432
Feature activation-7.79e-01

Pos logit contributions
uv+0.124
ourning+0.118
ylan+0.112
 goose+0.098
iber+0.093
Neg logit contributions
leys-0.187
mingham-0.184
 VIDE-0.183
arate-0.177
\/\/-0.177
Token linem
Token ID22774
Feature activation-1.78e+00

Pos logit contributions
uv+1.465
ourning+1.390
ylan+1.350
 goose+1.242
 weeping+1.192
Neg logit contributions
leys-1.347
mingham-1.336
 VIDE-1.316
\/\/-1.255
arate-1.251
Tokenates
Token ID689
Feature activation+5.79e-02

Pos logit contributions
uv+3.026
ourning+2.876
ylan+2.759
 goose+2.469
iber+2.393
Neg logit contributions
leys-3.373
mingham-3.366
 VIDE-3.337
\/\/-3.182
arate-3.153
Token David
Token ID3271
Feature activation-5.65e-01

Pos logit contributions
leys+0.103
mingham+0.102
 VIDE+0.100
\/\/+0.096
arate+0.096
Neg logit contributions
uv-0.105
ourning-0.102
ylan-0.097
 goose-0.088
iber-0.085
Token De
Token ID1024
Feature activation+2.94e-01

Pos logit contributions
uv+0.972
ourning+0.928
ylan+0.889
 goose+0.801
iber+0.772
Neg logit contributions
leys-1.068
mingham-1.054
 VIDE-1.041
\/\/-1.001
arate-1.000
TokenCast
Token ID19248
Feature activation+3.89e-01

Pos logit contributions
leys+0.480
mingham+0.469
 VIDE+0.462
\/\/+0.443
arate+0.441
Neg logit contributions
uv-0.582
ourning-0.561
ylan-0.542
 goose-0.494
 weeping-0.478
Tokenro
Token ID305
Feature activation+1.01e-01

Pos logit contributions
leys+0.670
mingham+0.654
 VIDE+0.650
arate+0.629
\/\/+0.618
Neg logit contributions
uv-0.733
ourning-0.705
ylan-0.676
 goose-0.613
iber-0.600
Token knew
Token ID2993
Feature activation+5.29e-02

Pos logit contributions
uv+0.021
ourning+0.020
ylan+0.019
 goose+0.017
iber+0.017
Neg logit contributions
leys-0.022
mingham-0.021
 VIDE-0.021
\/\/-0.020
arate-0.020
Token that
Token ID326
Feature activation+1.56e-01

Pos logit contributions
leys+0.103
mingham+0.102
 VIDE+0.101
\/\/+0.097
arate+0.097
Neg logit contributions
uv-0.088
ourning-0.084
ylan-0.080
 goose-0.072
iber-0.069
Token if
Token ID611
Feature activation-4.00e-01

Pos logit contributions
leys+0.286
mingham+0.282
 VIDE+0.278
\/\/+0.267
arate+0.267
Neg logit contributions
uv-0.278
ourning-0.266
ylan-0.255
 goose-0.231
iber-0.223
Token we
Token ID356
Feature activation-5.97e-04

Pos logit contributions
uv+0.742
ourning+0.707
ylan+0.680
 goose+0.624
iber+0.603
Neg logit contributions
leys-0.701
mingham-0.692
 VIDE-0.685
\/\/-0.655
arate-0.652
Token fl
Token ID781
Feature activation+3.18e-01

Pos logit contributions
uv+0.001
ourning+0.001
ylan+0.001
 goose+0.001
iber+0.001
Neg logit contributions
leys-0.001
mingham-0.001
 VIDE-0.001
\/\/-0.001
arate-0.001
Tokenunk
Token ID2954
Feature activation-1.67e+00

Pos logit contributions
leys+0.541
mingham+0.533
 VIDE+0.526
\/\/+0.503
arate+0.502
Neg logit contributions
uv-0.607
ourning-0.582
ylan-0.560
 goose-0.511
iber-0.494
Tokened
Token ID276
Feature activation-4.69e-01

Pos logit contributions
uv+2.292
ourning+2.126
ylan+2.043
 goose+1.814
iber+1.722
Neg logit contributions
mingham-3.667
leys-3.651
 VIDE-3.635
\/\/-3.493
arate-3.455
Token out
Token ID503
Feature activation-1.37e-01

Pos logit contributions
uv+0.789
ourning+0.744
ylan+0.718
 goose+0.652
iber+0.624
Neg logit contributions
leys-0.906
mingham-0.893
 VIDE-0.883
\/\/-0.849
arate-0.844
Token of
Token ID286
Feature activation+3.82e-02

Pos logit contributions
uv+0.235
ourning+0.224
ylan+0.215
 goose+0.194
iber+0.187
Neg logit contributions
leys-0.259
mingham-0.256
 VIDE-0.253
\/\/-0.243
arate-0.242
Token school
Token ID1524
Feature activation+4.44e-01

Pos logit contributions
leys+0.071
mingham+0.070
 VIDE+0.069
arate+0.066
\/\/+0.066
Neg logit contributions
uv-0.067
ourning-0.064
ylan-0.062
 goose-0.055
iber-0.054
Token or
Token ID393
Feature activation+1.93e-02

Pos logit contributions
leys+0.754
mingham+0.743
 VIDE+0.731
arate+0.704
\/\/+0.702
Neg logit contributions
uv-0.848
ourning-0.813
ylan-0.782
 goose-0.713
iber-0.688
Token now
Token ID783
Feature activation+1.50e-01

Pos logit contributions
leys+0.052
mingham+0.051
 VIDE+0.050
arate+0.048
\/\/+0.048
Neg logit contributions
uv-0.052
ourning-0.050
ylan-0.048
 goose-0.043
iber-0.042
Token spend
Token ID4341
Feature activation+2.84e-01

Pos logit contributions
leys+0.293
mingham+0.289
 VIDE+0.287
arate+0.277
\/\/+0.275
Neg logit contributions
uv-0.246
ourning-0.236
ylan-0.224
 goose-0.200
iber-0.192
Token several
Token ID1811
Feature activation-1.28e-01

Pos logit contributions
leys+0.554
mingham+0.547
 VIDE+0.539
arate+0.521
\/\/+0.520
Neg logit contributions
uv-0.468
ourning-0.448
ylan-0.429
 goose-0.379
iber-0.368
Token days
Token ID1528
Feature activation-3.83e-01

Pos logit contributions
uv+0.186
ourning+0.177
ylan+0.168
 goose+0.145
iber+0.140
Neg logit contributions
leys-0.277
mingham-0.275
 VIDE-0.271
arate-0.262
\/\/-0.262
Token rec
Token ID664
Feature activation+2.32e-01

Pos logit contributions
uv+0.653
ourning+0.626
ylan+0.597
 goose+0.534
iber+0.518
Neg logit contributions
leys-0.730
mingham-0.718
 VIDE-0.710
\/\/-0.686
arate-0.685
Tokenuper
Token ID48568
Feature activation-1.62e+00

Pos logit contributions
leys+0.419
mingham+0.415
 VIDE+0.407
arate+0.398
\/\/+0.394
Neg logit contributions
uv-0.408
ourning-0.391
ylan-0.378
 goose-0.347
 weeping-0.328
Tokenating
Token ID803
Feature activation+2.31e-01

Pos logit contributions
uv+2.203
ourning+2.092
ylan+1.952
 goose+1.714
iber+1.636
Neg logit contributions
leys-3.644
mingham-3.593
 VIDE-3.577
\/\/-3.447
arate-3.396
Token.
Token ID13
Feature activation+1.44e-02

Pos logit contributions
leys+0.414
mingham+0.407
 VIDE+0.400
\/\/+0.387
arate+0.386
Neg logit contributions
uv-0.421
ourning-0.403
ylan-0.387
 goose-0.350
iber-0.338
Token Friends
Token ID14213
Feature activation+5.59e-02

Pos logit contributions
leys+0.026
mingham+0.025
 VIDE+0.025
\/\/+0.024
arate+0.024
Neg logit contributions
uv-0.026
ourning-0.025
ylan-0.024
 goose-0.022
iber-0.021
Token are
Token ID389
Feature activation+6.86e-02

Pos logit contributions
leys+0.098
mingham+0.096
 VIDE+0.095
\/\/+0.091
arate+0.091
Neg logit contributions
uv-0.104
ourning-0.100
ylan-0.096
 goose-0.086
iber-0.084
Token praying
Token ID26002
Feature activation-4.36e-01

Pos logit contributions
leys+0.139
mingham+0.136
 VIDE+0.135
\/\/+0.129
arate+0.129
Neg logit contributions
uv-0.109
ourning-0.104
ylan-0.100
 goose-0.086
iber-0.085
Token memories
Token ID9846
Feature activation-1.65e-01

Pos logit contributions
leys+0.554
mingham+0.546
 VIDE+0.539
\/\/+0.518
arate+0.517
Neg logit contributions
uv-0.508
ourning-0.486
ylan-0.465
 goose-0.417
iber-0.404
Token...
Token ID986
Feature activation+1.16e-01

Pos logit contributions
uv+0.305
ourning+0.292
ylan+0.281
 goose+0.254
iber+0.246
Neg logit contributions
leys-0.293
mingham-0.288
 VIDE-0.285
\/\/-0.273
arate-0.273
Token
Token ID198
Feature activation+1.36e-01

Pos logit contributions
leys+0.228
mingham+0.224
 VIDE+0.223
\/\/+0.215
arate+0.214
Neg logit contributions
uv-0.189
ourning-0.180
ylan-0.173
 goose-0.153
iber-0.148
Token
Token ID198
Feature activation+1.17e-01

Pos logit contributions
leys+0.274
mingham+0.270
 VIDE+0.266
\/\/+0.259
arate+0.259
Neg logit contributions
uv-0.213
ourning-0.206
ylan-0.195
 goose-0.175
 weeping-0.168
TokenMy
Token ID3666
Feature activation-3.05e-01

Pos logit contributions
leys+0.184
mingham+0.180
 VIDE+0.178
\/\/+0.170
arate+0.170
Neg logit contributions
uv-0.239
ourning-0.231
ylan-0.223
 goose-0.205
iber-0.198
Tokenster
Token ID1706
Feature activation-1.57e+00

Pos logit contributions
uv+0.516
ourning+0.493
ylan+0.472
 goose+0.423
iber+0.408
Neg logit contributions
leys-0.587
mingham-0.578
 VIDE-0.572
\/\/-0.550
arate-0.550
Tokenies
Token ID444
Feature activation-3.60e-01

Pos logit contributions
uv+3.133
ourning+2.984
ylan+2.885
 goose+2.666
iber+2.600
Neg logit contributions
 VIDE-2.492
leys-2.489
mingham-2.479
arate-2.349
\/\/-2.336
Token of
Token ID286
Feature activation-1.84e-01

Pos logit contributions
uv+0.710
ourning+0.683
ylan+0.659
 goose+0.601
iber+0.583
Neg logit contributions
leys-0.588
mingham-0.578
 VIDE-0.571
\/\/-0.546
arate-0.545
Token a
Token ID257
Feature activation-1.94e-01

Pos logit contributions
uv+0.300
ourning+0.285
ylan+0.273
 goose+0.244
iber+0.235
Neg logit contributions
leys-0.366
mingham-0.361
 VIDE-0.357
\/\/-0.344
arate-0.343
Token human
Token ID1692
Feature activation+4.40e-02

Pos logit contributions
uv+0.309
ourning+0.294
ylan+0.281
 goose+0.249
iber+0.241
Neg logit contributions
leys-0.392
mingham-0.386
 VIDE-0.383
\/\/-0.368
arate-0.368
Token turned
Token ID2900
Feature activation-8.49e-02

Pos logit contributions
leys+0.079
mingham+0.078
 VIDE+0.077
\/\/+0.074
arate+0.074
Neg logit contributions
uv-0.079
ourning-0.076
ylan-0.073
 goose-0.066
iber-0.064
Token.
Token ID13
Feature activation-4.05e-01

Pos logit contributions
uv+0.141
ourning+0.135
ylan+0.130
 goose+0.119
iber+0.115
Neg logit contributions
leys-0.123
mingham-0.121
 VIDE-0.119
\/\/-0.114
arate-0.114
Tokenyoutube
Token ID11604
Feature activation-9.66e-02

Pos logit contributions
uv+0.849
ourning+0.818
ylan+0.790
 goose+0.728
iber+0.706
Neg logit contributions
leys-0.614
mingham-0.603
 VIDE-0.593
\/\/-0.565
arate-0.565
Token.
Token ID13
Feature activation-2.31e-01

Pos logit contributions
uv+0.187
ourning+0.179
ylan+0.173
 goose+0.157
iber+0.153
Neg logit contributions
leys-0.162
mingham-0.160
 VIDE-0.158
\/\/-0.151
arate-0.150
Tokencom
Token ID785
Feature activation-2.00e-01

Pos logit contributions
uv+0.392
ourning+0.374
ylan+0.359
 goose+0.323
iber+0.311
Neg logit contributions
leys-0.440
mingham-0.434
 VIDE-0.429
\/\/-0.413
arate-0.412
Token/
Token ID14
Feature activation-5.99e-02

Pos logit contributions
uv+0.424
ourning+0.405
ylan+0.393
 goose+0.362
iber+0.352
Neg logit contributions
leys-0.301
mingham-0.296
 VIDE-0.291
arate-0.277
\/\/-0.276
Tokenchannel
Token ID17620
Feature activation-1.56e+00

Pos logit contributions
uv+0.085
ourning+0.080
ylan+0.076
 goose+0.066
iber+0.063
Neg logit contributions
leys-0.132
mingham-0.130
 VIDE-0.129
\/\/-0.125
arate-0.125
Token/
Token ID14
Feature activation-6.63e-02

Pos logit contributions
uv+3.108
ourning+2.931
ylan+2.847
 goose+2.630
gal+2.541
Neg logit contributions
leys-2.521
mingham-2.498
 VIDE-2.446
arate-2.326
\/\/-2.260
TokenUC
Token ID9598
Feature activation-1.40e-01

Pos logit contributions
uv+0.106
ourning+0.100
ylan+0.096
 goose+0.085
iber+0.082
Neg logit contributions
leys-0.134
mingham-0.132
 VIDE-0.131
\/\/-0.126
arate-0.126
Token_
Token ID62
Feature activation+7.05e-01

Pos logit contributions
uv+0.267
ourning+0.256
ylan+0.246
 goose+0.225
iber+0.217
Neg logit contributions
leys-0.238
mingham-0.235
 VIDE-0.232
\/\/-0.222
arate-0.222
TokenK
Token ID42
Feature activation+3.35e-01

Pos logit contributions
leys+1.174
mingham+1.149
 VIDE+1.129
\/\/+1.089
arate+1.089
Neg logit contributions
uv-1.355
ourning-1.320
ylan-1.263
 goose-1.162
 weeping-1.122
Tokenk
Token ID74
Feature activation+3.53e-01

Pos logit contributions
leys+0.556
mingham+0.546
 VIDE+0.537
arate+0.516
\/\/+0.516
Neg logit contributions
uv-0.650
ourning-0.629
ylan-0.603
 goose-0.554
 weeping-0.534
Token was
Token ID373
Feature activation-2.06e-01

Pos logit contributions
uv+0.370
ourning+0.354
ylan+0.342
 goose+0.307
iber+0.297
Neg logit contributions
leys-0.359
mingham-0.355
 VIDE-0.350
\/\/-0.336
arate-0.336
Token important
Token ID1593
Feature activation+9.65e-02

Pos logit contributions
uv+0.351
ourning+0.336
ylan+0.324
 goose+0.289
iber+0.278
Neg logit contributions
leys-0.392
mingham-0.388
 VIDE-0.382
\/\/-0.368
arate-0.368
Token for
Token ID329
Feature activation-1.00e-01

Pos logit contributions
leys+0.174
mingham+0.172
 VIDE+0.170
arate+0.164
\/\/+0.163
Neg logit contributions
uv-0.173
ourning-0.166
ylan-0.160
 goose-0.144
iber-0.139
Token their
Token ID511
Feature activation-8.49e-03

Pos logit contributions
uv+0.177
ourning+0.170
ylan+0.163
 goose+0.145
iber+0.142
Neg logit contributions
leys-0.184
mingham-0.182
 VIDE-0.180
\/\/-0.173
arate-0.173
Token rec
Token ID664
Feature activation+5.30e-02

Pos logit contributions
uv+0.015
ourning+0.014
ylan+0.014
 goose+0.012
iber+0.012
Neg logit contributions
leys-0.016
mingham-0.015
 VIDE-0.015
\/\/-0.015
arate-0.015
Tokenuper
Token ID48568
Feature activation-1.52e+00

Pos logit contributions
leys+0.098
mingham+0.097
 VIDE+0.095
\/\/+0.093
arate+0.092
Neg logit contributions
uv-0.092
ourning-0.088
ylan-0.085
 goose-0.077
 weeping-0.074
Tokenation
Token ID341
Feature activation+3.29e-01

Pos logit contributions
uv+2.262
ourning+2.129
ylan+2.015
 goose+1.792
iber+1.730
Neg logit contributions
leys-3.214
 VIDE-3.166
mingham-3.153
\/\/-3.054
arate-2.995
Token.
Token ID13
Feature activation-8.44e-02

Pos logit contributions
leys+0.551
mingham+0.545
 VIDE+0.534
\/\/+0.515
arate+0.513
Neg logit contributions
uv-0.632
ourning-0.609
ylan-0.586
 goose-0.529
iber-0.511
Token
Token ID198
Feature activation-1.88e-01

Pos logit contributions
uv+0.138
ourning+0.131
ylan+0.126
 goose+0.112
iber+0.108
Neg logit contributions
leys-0.167
mingham-0.164
 VIDE-0.163
\/\/-0.157
arate-0.157
Token
Token ID198
Feature activation-1.21e-01

Pos logit contributions
uv+0.290
ourning+0.276
ylan+0.263
 goose+0.234
iber+0.224
Neg logit contributions
leys-0.389
mingham-0.384
 VIDE-0.379
\/\/-0.368
arate-0.367
TokenD
Token ID35
Feature activation-3.86e-01

Pos logit contributions
uv+0.246
ourning+0.237
ylan+0.229
 goose+0.210
iber+0.203
Neg logit contributions
leys-0.190
mingham-0.187
 VIDE-0.184
\/\/-0.176
arate-0.176
Token home
Token ID1363
Feature activation-1.60e-01

Pos logit contributions
leys+0.697
mingham+0.683
 VIDE+0.682
arate+0.655
\/\/+0.652
Neg logit contributions
uv-0.595
ourning-0.573
ylan-0.543
 goose-0.483
iber-0.468
Token.
Token ID13
Feature activation-1.37e-01

Pos logit contributions
uv+0.287
ourning+0.276
ylan+0.263
 goose+0.238
iber+0.230
Neg logit contributions
leys-0.291
mingham-0.286
 VIDE-0.283
arate-0.272
\/\/-0.271
Token
Token ID198
Feature activation+3.38e-02

Pos logit contributions
uv+0.244
ourning+0.234
ylan+0.225
 goose+0.204
iber+0.196
Neg logit contributions
leys-0.251
mingham-0.246
 VIDE-0.244
arate-0.234
\/\/-0.233
Token
Token ID198
Feature activation+9.95e-02

Pos logit contributions
leys+0.069
mingham+0.068
 VIDE+0.067
arate+0.065
\/\/+0.065
Neg logit contributions
uv-0.052
ourning-0.050
ylan-0.048
 goose-0.042
 weeping-0.041
TokenL
Token ID43
Feature activation-1.08e-01

Pos logit contributions
leys+0.147
mingham+0.144
 VIDE+0.142
\/\/+0.135
arate+0.135
Neg logit contributions
uv-0.212
ourning-0.205
ylan-0.198
 goose-0.183
iber-0.177
Tokenav
Token ID615
Feature activation-1.50e+00

Pos logit contributions
uv+0.167
ourning+0.159
ylan+0.151
 goose+0.135
iber+0.129
Neg logit contributions
leys-0.224
mingham-0.222
 VIDE-0.219
arate-0.211
\/\/-0.211
Tokenau
Token ID559
Feature activation-5.37e-01

Pos logit contributions
uv+2.204
ourning+2.043
ylan+1.914
 goose+1.696
iber+1.660
Neg logit contributions
 VIDE-3.225
leys-3.214
mingham-3.185
\/\/-3.055
arate-3.021
Token's
Token ID338
Feature activation-1.02e-01

Pos logit contributions
uv+0.878
ourning+0.837
ylan+0.799
 goose+0.714
iber+0.686
Neg logit contributions
leys-1.061
mingham-1.046
 VIDE-1.035
\/\/-0.996
arate-0.996
Token family
Token ID1641
Feature activation-3.89e-01

Pos logit contributions
uv+0.185
ourning+0.177
ylan+0.170
 goose+0.153
iber+0.148
Neg logit contributions
leys-0.185
mingham-0.183
 VIDE-0.181
\/\/-0.173
arate-0.173
Token began
Token ID2540
Feature activation+3.12e-02

Pos logit contributions
uv+0.654
ourning+0.626
ylan+0.599
 goose+0.533
iber+0.516
Neg logit contributions
leys-0.750
mingham-0.739
 VIDE-0.732
arate-0.704
\/\/-0.701
Token searching
Token ID10342
Feature activation-4.84e-01

Pos logit contributions
leys+0.065
mingham+0.064
 VIDE+0.064
arate+0.061
\/\/+0.061
Neg logit contributions
uv-0.047
ourning-0.044
ylan-0.043
 goose-0.037
iber-0.036
Token his
Token ID465
Feature activation-1.87e-02

Pos logit contributions
leys+0.006
mingham+0.005
 VIDE+0.005
\/\/+0.005
arate+0.005
Neg logit contributions
uv-0.004
ourning-0.004
ylan-0.004
 goose-0.003
iber-0.003
Token judicial
Token ID13657
Feature activation+5.40e-02

Pos logit contributions
uv+0.033
ourning+0.032
ylan+0.030
 goose+0.027
iber+0.026
Neg logit contributions
leys-0.034
mingham-0.034
 VIDE-0.034
\/\/-0.032
arate-0.032
Token robe
Token ID33192
Feature activation-3.90e-01

Pos logit contributions
leys+0.099
mingham+0.097
 VIDE+0.095
\/\/+0.092
arate+0.092
Neg logit contributions
uv-0.096
ourning-0.092
ylan-0.089
 goose-0.079
iber-0.077
Token and
Token ID290
Feature activation-5.87e-02

Pos logit contributions
uv+0.687
ourning+0.657
ylan+0.629
 goose+0.568
iber+0.548
Neg logit contributions
leys-0.721
mingham-0.711
 VIDE-0.702
\/\/-0.675
arate-0.674
Token pl
Token ID458
Feature activation+2.96e-03

Pos logit contributions
uv+0.091
ourning+0.086
ylan+0.082
 goose+0.072
iber+0.070
Neg logit contributions
leys-0.121
mingham-0.120
 VIDE-0.119
\/\/-0.114
arate-0.114
Tokenunk
Token ID2954
Feature activation-1.47e+00

Pos logit contributions
leys+0.005
mingham+0.005
 VIDE+0.005
arate+0.005
\/\/+0.005
Neg logit contributions
uv-0.005
ourning-0.005
ylan-0.005
 goose-0.005
 pu-0.005
Tokening
Token ID278
Feature activation-5.70e-01

Pos logit contributions
uv+2.317
ourning+2.203
ylan+2.100
 goose+1.882
iber+1.799
Neg logit contributions
leys-2.984
mingham-2.971
 VIDE-2.932
\/\/-2.831
arate-2.807
Token himself
Token ID2241
Feature activation-1.01e-01

Pos logit contributions
uv+0.833
ourning+0.787
ylan+0.749
 goose+0.663
iber+0.631
Neg logit contributions
leys-1.225
mingham-1.212
 VIDE-1.199
\/\/-1.159
arate-1.158
Token down
Token ID866
Feature activation-6.02e-01

Pos logit contributions
uv+0.168
ourning+0.161
ylan+0.154
 goose+0.135
iber+0.132
Neg logit contributions
leys-0.196
mingham-0.192
 VIDE-0.189
arate-0.183
\/\/-0.183
Token in
Token ID287
Feature activation-5.29e-02

Pos logit contributions
uv+1.021
ourning+0.974
ylan+0.933
 goose+0.839
iber+0.807
Neg logit contributions
leys-1.152
mingham-1.137
 VIDE-1.124
\/\/-1.081
arate-1.080
Token Anton
Token ID9261
Feature activation+1.41e-01

Pos logit contributions
uv+0.098
ourning+0.094
ylan+0.090
 goose+0.082
iber+0.079
Neg logit contributions
leys-0.093
mingham-0.092
 VIDE-0.090
arate-0.087
\/\/-0.086
Tokenob
Token ID672
Feature activation+2.35e-01

Pos logit contributions
uv+1.395
ourning+1.336
ylan+1.292
 goose+1.183
iber+1.154
Neg logit contributions
leys-1.062
mingham-1.045
 VIDE-1.037
\/\/-0.984
arate-0.980
Tokensh
Token ID1477
Feature activation-1.65e-01

Pos logit contributions
leys+0.335
mingham+0.328
 VIDE+0.318
\/\/+0.307
arate+0.305
Neg logit contributions
uv-0.510
ourning-0.496
ylan-0.479
 goose-0.444
iber-0.430
Tokenav
Token ID615
Feature activation-3.51e-01

Pos logit contributions
uv+0.255
ourning+0.244
ylan+0.233
 goose+0.208
 weeping+0.198
Neg logit contributions
leys-0.339
mingham-0.334
 VIDE-0.330
arate-0.320
\/\/-0.319
Tokenn
Token ID77
Feature activation-1.34e-01

Pos logit contributions
uv+0.565
ourning+0.539
ylan+0.515
 goose+0.460
iber+0.442
Neg logit contributions
leys-0.701
mingham-0.692
 VIDE-0.683
\/\/-0.660
arate-0.659
Token rec
Token ID664
Feature activation+9.96e-02

Pos logit contributions
uv+0.222
ourning+0.212
ylan+0.203
 goose+0.182
iber+0.175
Neg logit contributions
leys-0.260
mingham-0.257
 VIDE-0.254
\/\/-0.245
arate-0.244
Tokeneded
Token ID15395
Feature activation-1.47e+00

Pos logit contributions
leys+0.239
mingham+0.236
 VIDE+0.233
arate+0.227
\/\/+0.227
Neg logit contributions
uv-0.120
ourning-0.113
ylan-0.107
 goose-0.091
 weeping-0.086
Token more
Token ID517
Feature activation-4.59e-02

Pos logit contributions
uv+2.474
ourning+2.305
ylan+2.222
 goose+2.047
 weeping+1.955
Neg logit contributions
leys-2.814
mingham-2.801
 VIDE-2.775
arate-2.647
\/\/-2.632
Token than
Token ID621
Feature activation+4.12e-02

Pos logit contributions
uv+0.093
ourning+0.089
ylan+0.086
 goose+0.079
iber+0.077
Neg logit contributions
leys-0.073
mingham-0.072
 VIDE-0.071
\/\/-0.068
arate-0.067
Token 40
Token ID2319
Feature activation+1.32e-02

Pos logit contributions
leys+0.077
mingham+0.076
 VIDE+0.075
\/\/+0.072
arate+0.072
Neg logit contributions
uv-0.072
ourning-0.069
ylan-0.066
 goose-0.059
iber-0.057
Token kilometers
Token ID18212
Feature activation-1.14e-01

Pos logit contributions
leys+0.026
mingham+0.025
 VIDE+0.025
arate+0.024
\/\/+0.024
Neg logit contributions
uv-0.022
ourning-0.021
ylan-0.020
 goose-0.018
iber-0.017
Token (
Token ID357
Feature activation+1.07e-02

Pos logit contributions
uv+0.207
ourning+0.199
ylan+0.191
 goose+0.172
iber+0.167
Neg logit contributions
leys-0.203
mingham-0.199
 VIDE-0.197
arate-0.189
\/\/-0.189
Token10
Token ID940
Feature activation-1.92e-01

Pos logit contributions
uv+0.381
ourning+0.365
ylan+0.348
 goose+0.315
iber+0.303
Neg logit contributions
leys-0.426
mingham-0.419
 VIDE-0.415
\/\/-0.401
arate-0.400
Token:
Token ID25
Feature activation-1.13e-01

Pos logit contributions
uv+0.314
ourning+0.300
ylan+0.286
 goose+0.256
iber+0.246
Neg logit contributions
leys-0.379
mingham-0.374
 VIDE-0.370
\/\/-0.357
arate-0.357
Token47
Token ID2857
Feature activation-7.37e-01

Pos logit contributions
uv+0.172
ourning+0.163
ylan+0.155
 goose+0.138
iber+0.132
Neg logit contributions
leys-0.235
mingham-0.233
 VIDE-0.230
\/\/-0.222
arate-0.222
Token]
Token ID60
Feature activation-5.21e-01

Pos logit contributions
uv+1.102
ourning+1.037
ylan+0.989
 goose+0.876
iber+0.837
Neg logit contributions
leys-1.564
mingham-1.545
 VIDE-1.529
\/\/-1.473
arate-1.471
Token <
Token ID1279
Feature activation-7.24e-01

Pos logit contributions
uv+1.071
ourning+1.031
ylan+0.997
 goose+0.922
iber+0.894
Neg logit contributions
leys-0.805
mingham-0.787
 VIDE-0.763
arate-0.744
\/\/-0.738
Tokenabal
Token ID44349
Feature activation-1.46e+00

Pos logit contributions
uv+1.293
ourning+1.231
ylan+1.184
 goose+1.069
iber+1.035
Neg logit contributions
leys-1.323
mingham-1.305
 VIDE-1.294
\/\/-1.237
arate-1.235
Tokenicious
Token ID6243
Feature activation-6.96e-01

Pos logit contributions
uv+2.640
ourning+2.501
ylan+2.415
 goose+2.174
iber+2.123
Neg logit contributions
leys-2.608
 VIDE-2.576
mingham-2.575
\/\/-2.455
arate-2.448
Tokens
Token ID82
Feature activation-1.45e-01

Pos logit contributions
uv+1.272
ourning+1.211
ylan+1.169
 goose+1.056
iber+1.026
Neg logit contributions
leys-1.239
mingham-1.225
 VIDE-1.219
\/\/-1.163
arate-1.161
Token>
Token ID29
Feature activation-1.94e-01

Pos logit contributions
uv+0.267
ourning+0.255
ylan+0.245
 goose+0.223
iber+0.215
Neg logit contributions
leys-0.256
mingham-0.252
 VIDE-0.249
\/\/-0.239
arate-0.239
Token lol
Token ID19462
Feature activation-3.08e-01

Pos logit contributions
uv+0.351
ourning+0.335
ylan+0.322
 goose+0.292
iber+0.282
Neg logit contributions
leys-0.351
mingham-0.346
 VIDE-0.341
\/\/-0.328
arate-0.327
Token m
Token ID285
Feature activation-3.00e-01

Pos logit contributions
uv+0.583
ourning+0.558
ylan+0.537
 goose+0.490
iber+0.473
Neg logit contributions
leys-0.528
mingham-0.521
 VIDE-0.514
\/\/-0.492
arate-0.491
Token.
Token ID13
Feature activation-3.29e-02

Pos logit contributions
leys+0.519
mingham+0.508
 VIDE+0.503
arate+0.489
\/\/+0.486
Neg logit contributions
uv-0.500
ourning-0.481
ylan-0.460
 goose-0.411
iber-0.398
Token
Token ID198
Feature activation+3.17e-02

Pos logit contributions
uv+0.055
ourning+0.052
ylan+0.050
 goose+0.045
iber+0.043
Neg logit contributions
leys-0.064
mingham-0.063
 VIDE-0.062
\/\/-0.060
arate-0.060
Token
Token ID198
Feature activation+6.93e-02

Pos logit contributions
leys+0.065
mingham+0.064
 VIDE+0.063
\/\/+0.061
arate+0.061
Neg logit contributions
uv-0.049
ourning-0.047
ylan-0.045
 goose-0.040
 weeping-0.038
TokenThe
Token ID464
Feature activation-2.67e-01

Pos logit contributions
leys+0.107
mingham+0.105
 VIDE+0.103
\/\/+0.099
arate+0.099
Neg logit contributions
uv-0.142
ourning-0.138
ylan-0.133
 goose-0.122
 weeping-0.119
Token new
Token ID649
Feature activation-2.97e-01

Pos logit contributions
uv+0.485
ourning+0.465
ylan+0.446
 goose+0.402
iber+0.390
Neg logit contributions
leys-0.480
mingham-0.471
 VIDE-0.466
\/\/-0.447
arate-0.446
Token species
Token ID4693
Feature activation-1.44e+00

Pos logit contributions
uv+0.552
ourning+0.530
ylan+0.509
 goose+0.462
iber+0.447
Neg logit contributions
leys-0.520
mingham-0.512
 VIDE-0.506
\/\/-0.485
arate-0.484
Token is
Token ID318
Feature activation-3.58e-02

Pos logit contributions
uv+2.460
ourning+2.322
ylan+2.242
 goose+2.046
iber+1.940
Neg logit contributions
leys-2.724
mingham-2.706
 VIDE-2.673
arate-2.561
\/\/-2.560
Token named
Token ID3706
Feature activation-1.42e-01

Pos logit contributions
uv+0.059
ourning+0.056
ylan+0.054
 goose+0.048
iber+0.047
Neg logit contributions
leys-0.070
mingham-0.069
 VIDE-0.069
\/\/-0.066
arate-0.066
Token the
Token ID262
Feature activation-6.62e-02

Pos logit contributions
uv+0.245
ourning+0.233
ylan+0.224
 goose+0.202
iber+0.194
Neg logit contributions
leys-0.268
mingham-0.265
 VIDE-0.261
\/\/-0.251
arate-0.251
Token Wil
Token ID5187
Feature activation+4.87e-01

Pos logit contributions
uv+0.112
ourning+0.107
ylan+0.102
 goose+0.092
iber+0.088
Neg logit contributions
leys-0.127
mingham-0.125
 VIDE-0.124
\/\/-0.119
arate-0.119
Tokenkins
Token ID5331
Feature activation+7.85e-02

Pos logit contributions
leys+0.964
mingham+0.947
 VIDE+0.922
arate+0.898
\/\/+0.894
Neg logit contributions
uv-0.798
ourning-0.766
ylan-0.731
 goose-0.657
 weeping-0.631
Token joke
Token ID9707
Feature activation+1.51e-01

Pos logit contributions
leys+0.243
mingham+0.240
 VIDE+0.236
\/\/+0.227
arate+0.226
Neg logit contributions
uv-0.246
ourning-0.235
ylan-0.227
 goose-0.204
iber-0.198
Token,
Token ID11
Feature activation+2.41e-02

Pos logit contributions
leys+0.252
mingham+0.249
 VIDE+0.245
\/\/+0.235
arate+0.234
Neg logit contributions
uv-0.292
ourning-0.280
ylan-0.270
 goose-0.247
iber-0.239
Token and
Token ID290
Feature activation-3.33e-02

Pos logit contributions
leys+0.043
mingham+0.042
 VIDE+0.041
\/\/+0.040
arate+0.040
Neg logit contributions
uv-0.044
ourning-0.042
ylan-0.041
 goose-0.037
iber-0.036
Token the
Token ID262
Feature activation-7.70e-02

Pos logit contributions
uv+0.058
ourning+0.055
ylan+0.053
 goose+0.048
iber+0.046
Neg logit contributions
leys-0.063
mingham-0.062
 VIDE-0.061
\/\/-0.059
arate-0.059
Token steep
Token ID14559
Feature activation-3.05e-01

Pos logit contributions
uv+0.138
ourning+0.133
ylan+0.127
 goose+0.115
iber+0.111
Neg logit contributions
leys-0.140
mingham-0.137
 VIDE-0.136
\/\/-0.130
arate-0.130
Token incl
Token ID13358
Feature activation-1.44e+00

Pos logit contributions
uv+0.561
ourning+0.537
ylan+0.516
 goose+0.469
iber+0.453
Neg logit contributions
leys-0.539
mingham-0.531
 VIDE-0.525
\/\/-0.503
arate-0.502
Tokenines
Token ID1127
Feature activation-1.10e+00

Pos logit contributions
uv+2.746
ourning+2.583
ylan+2.503
 goose+2.268
iber+2.242
Neg logit contributions
mingham-2.452
leys-2.435
 VIDE-2.424
\/\/-2.332
arate-2.297
Token can
Token ID460
Feature activation-1.61e-01

Pos logit contributions
uv+1.899
ourning+1.772
ylan+1.723
 goose+1.603
iber+1.512
Neg logit contributions
mingham-2.047
leys-2.045
 VIDE-2.022
arate-1.929
\/\/-1.928
Token be
Token ID307
Feature activation-1.03e-01

Pos logit contributions
uv+0.269
ourning+0.254
ylan+0.245
 goose+0.223
iber+0.212
Neg logit contributions
leys-0.312
mingham-0.307
 VIDE-0.304
\/\/-0.294
arate-0.292
Token treacherous
Token ID45365
Feature activation-6.88e-01

Pos logit contributions
uv+0.186
ourning+0.177
ylan+0.170
 goose+0.155
iber+0.149
Neg logit contributions
leys-0.186
mingham-0.184
 VIDE-0.181
\/\/-0.174
arate-0.174
Token with
Token ID351
Feature activation-8.36e-02

Pos logit contributions
uv+1.262
ourning+1.202
ylan+1.160
 goose+1.058
iber+1.019
Neg logit contributions
leys-1.220
mingham-1.207
 VIDE-1.193
arate-1.141
\/\/-1.140
Token is
Token ID318
Feature activation-1.91e-01

Pos logit contributions
uv+0.181
ourning+0.173
ylan+0.166
 goose+0.151
iber+0.146
Neg logit contributions
leys-0.175
mingham-0.172
 VIDE-0.170
\/\/-0.163
arate-0.163
Token another
Token ID1194
Feature activation-4.63e-03

Pos logit contributions
uv+0.322
ourning+0.308
ylan+0.295
 goose+0.264
iber+0.253
Neg logit contributions
leys-0.370
mingham-0.365
 VIDE-0.361
\/\/-0.346
arate-0.346
Token leak
Token ID13044
Feature activation-8.82e-02

Pos logit contributions
uv+0.009
ourning+0.008
ylan+0.008
 goose+0.007
iber+0.007
Neg logit contributions
leys-0.008
mingham-0.008
 VIDE-0.008
\/\/-0.008
arate-0.008
Token from
Token ID422
Feature activation-8.40e-02

Pos logit contributions
uv+0.144
ourning+0.138
ylan+0.132
 goose+0.117
iber+0.112
Neg logit contributions
leys-0.174
mingham-0.172
 VIDE-0.171
arate-0.164
\/\/-0.164
Token 4
Token ID604
Feature activation-3.61e-01

Pos logit contributions
uv+0.153
ourning+0.146
ylan+0.140
 goose+0.127
iber+0.123
Neg logit contributions
leys-0.151
mingham-0.149
 VIDE-0.147
\/\/-0.141
arate-0.141
Tokenchan
Token ID3147
Feature activation-1.44e+00

Pos logit contributions
uv+0.498
ourning+0.471
ylan+0.446
 goose+0.391
iber+0.370
Neg logit contributions
leys-0.806
mingham-0.796
 VIDE-0.788
\/\/-0.763
arate-0.762
Token,
Token ID11
Feature activation-6.31e-02

Pos logit contributions
uv+2.403
ourning+2.290
ylan+2.179
 goose+1.961
iber+1.918
Neg logit contributions
leys-2.789
mingham-2.743
 VIDE-2.729
\/\/-2.615
arate-2.614
Token which
Token ID543
Feature activation+9.15e-02

Pos logit contributions
uv+0.103
ourning+0.098
ylan+0.094
 goose+0.084
iber+0.080
Neg logit contributions
leys-0.125
mingham-0.123
 VIDE-0.122
\/\/-0.117
arate-0.117
Token has
Token ID468
Feature activation+8.93e-02

Pos logit contributions
leys+0.179
mingham+0.176
 VIDE+0.174
\/\/+0.168
arate+0.167
Neg logit contributions
uv-0.152
ourning-0.145
ylan-0.138
 goose-0.124
iber-0.119
Token been
Token ID587
Feature activation-1.56e-01

Pos logit contributions
leys+0.181
mingham+0.179
 VIDE+0.177
\/\/+0.170
arate+0.170
Neg logit contributions
uv-0.142
ourning-0.135
ylan-0.129
 goose-0.113
iber-0.109
Token on
Token ID319
Feature activation-1.24e-01

Pos logit contributions
uv+0.228
ourning+0.218
ylan+0.206
 goose+0.179
iber+0.172
Neg logit contributions
leys-0.334
mingham-0.329
 VIDE-0.326
\/\/-0.315
arate-0.313
Token.
Token ID13
Feature activation+5.34e-02

Pos logit contributions
uv+0.238
ourning+0.229
ylan+0.221
 goose+0.200
iber+0.194
Neg logit contributions
leys-0.210
mingham-0.207
 VIDE-0.204
\/\/-0.196
arate-0.195
Token Not
Token ID1892
Feature activation+1.73e-01

Pos logit contributions
leys+0.103
mingham+0.101
 VIDE+0.100
arate+0.097
\/\/+0.096
Neg logit contributions
uv-0.090
ourning-0.085
ylan-0.082
 goose-0.074
iber-0.071
Tokeneworthy
Token ID25969
Feature activation+3.85e-01

Pos logit contributions
leys+0.308
mingham+0.303
 VIDE+0.298
arate+0.287
\/\/+0.287
Neg logit contributions
uv-0.315
ourning-0.302
ylan-0.290
 goose-0.262
iber-0.254
Token:
Token ID25
Feature activation-1.66e-02

Pos logit contributions
leys+0.719
mingham+0.708
 VIDE+0.696
\/\/+0.672
arate+0.671
Neg logit contributions
uv-0.671
ourning-0.644
ylan-0.617
 goose-0.552
iber-0.535
Token Under
Token ID4698
Feature activation+1.19e-02

Pos logit contributions
uv+0.029
ourning+0.027
ylan+0.026
 goose+0.023
iber+0.023
Neg logit contributions
leys-0.031
mingham-0.031
 VIDE-0.031
\/\/-0.029
arate-0.029
Tokenpass
Token ID6603
Feature activation-1.44e+00

Pos logit contributions
leys+0.021
mingham+0.021
 VIDE+0.021
arate+0.020
\/\/+0.020
Neg logit contributions
uv-0.021
ourning-0.021
ylan-0.020
 goose-0.018
iber-0.017
Token Park
Token ID3250
Feature activation-4.24e-01

Pos logit contributions
uv+2.614
ourning+2.482
ylan+2.402
 goose+2.186
 weeping+2.103
Neg logit contributions
leys-2.565
mingham-2.544
 VIDE-2.517
\/\/-2.408
arate-2.408
Token (
Token ID357
Feature activation+1.02e-01

Pos logit contributions
uv+0.756
ourning+0.724
ylan+0.694
 goose+0.627
iber+0.606
Neg logit contributions
leys-0.775
mingham-0.763
 VIDE-0.754
\/\/-0.724
arate-0.723
Tokenpictured
Token ID28852
Feature activation-3.14e-01

Pos logit contributions
leys+0.175
mingham+0.172
 VIDE+0.169
\/\/+0.164
arate+0.163
Neg logit contributions
uv-0.191
ourning-0.185
ylan-0.178
 goose-0.162
iber-0.156
Token);
Token ID1776
Feature activation+3.48e-01

Pos logit contributions
uv+0.543
ourning+0.518
ylan+0.497
 goose+0.448
iber+0.432
Neg logit contributions
leys-0.589
mingham-0.581
 VIDE-0.575
\/\/-0.552
arate-0.552
Token the
Token ID262
Feature activation-6.92e-02

Pos logit contributions
leys+0.671
mingham+0.660
 VIDE+0.653
\/\/+0.630
arate+0.628
Neg logit contributions
uv-0.581
ourning-0.560
ylan-0.535
 goose-0.475
iber-0.462
Token Territory
Token ID25219
Feature activation+4.66e-01

Pos logit contributions
uv+0.806
ourning+0.762
ylan+0.745
 goose+0.685
iber+0.657
Neg logit contributions
leys-0.710
mingham-0.700
 VIDE-0.692
\/\/-0.665
arate-0.659
Token
Token ID784
Feature activation-1.60e-01

Pos logit contributions
leys+0.838
mingham+0.824
 VIDE+0.813
arate+0.785
\/\/+0.779
Neg logit contributions
uv-0.839
ourning-0.808
ylan-0.775
 goose-0.699
iber-0.675
Token are
Token ID389
Feature activation-1.74e-01

Pos logit contributions
uv+0.276
ourning+0.265
ylan+0.254
 goose+0.227
iber+0.220
Neg logit contributions
leys-0.299
mingham-0.294
 VIDE-0.291
arate-0.281
\/\/-0.279
Token genetically
Token ID20807
Feature activation+1.31e-01

Pos logit contributions
uv+0.294
ourning+0.283
ylan+0.270
 goose+0.242
iber+0.234
Neg logit contributions
leys-0.333
mingham-0.329
 VIDE-0.325
\/\/-0.313
arate-0.312
Token distinct
Token ID7310
Feature activation-2.30e-01

Pos logit contributions
leys+0.247
mingham+0.244
 VIDE+0.241
\/\/+0.232
arate+0.232
Neg logit contributions
uv-0.223
ourning-0.214
ylan-0.205
 goose-0.183
iber-0.177
Token species
Token ID4693
Feature activation-1.43e+00

Pos logit contributions
uv+0.427
ourning+0.409
ylan+0.393
 goose+0.358
iber+0.346
Neg logit contributions
leys-0.404
mingham-0.398
 VIDE-0.393
\/\/-0.376
arate-0.376
Token.
Token ID13
Feature activation-4.99e-02

Pos logit contributions
uv+2.602
ourning+2.479
ylan+2.381
 goose+2.178
iber+2.102
Neg logit contributions
leys-2.548
mingham-2.523
 VIDE-2.492
\/\/-2.379
arate-2.374
Token
Token ID198
Feature activation-2.37e-01

Pos logit contributions
uv+0.082
ourning+0.078
ylan+0.075
 goose+0.067
iber+0.065
Neg logit contributions
leys-0.098
mingham-0.097
 VIDE-0.096
\/\/-0.092
arate-0.092
Token
Token ID198
Feature activation-1.34e-01

Pos logit contributions
uv+0.365
ourning+0.349
ylan+0.332
 goose+0.295
iber+0.282
Neg logit contributions
leys-0.490
mingham-0.484
 VIDE-0.478
\/\/-0.463
arate-0.463
TokenThe
Token ID464
Feature activation-2.63e-01

Pos logit contributions
uv+0.281
ourning+0.270
ylan+0.261
 goose+0.240
iber+0.233
Neg logit contributions
leys-0.205
mingham-0.201
 VIDE-0.198
\/\/-0.189
arate-0.189
Token scientists
Token ID5519
Feature activation-2.65e-01

Pos logit contributions
uv+0.474
ourning+0.454
ylan+0.435
 goose+0.393
iber+0.381
Neg logit contributions
leys-0.477
mingham-0.469
 VIDE-0.463
\/\/-0.445
arate-0.444
Token be
Token ID307
Feature activation-1.90e-02

Pos logit contributions
leys+0.201
mingham+0.198
 VIDE+0.196
arate+0.190
\/\/+0.189
Neg logit contributions
uv-0.182
ourning-0.177
ylan-0.168
 goose-0.150
iber-0.145
Token seen
Token ID1775
Feature activation-1.60e-01

Pos logit contributions
uv+0.030
ourning+0.029
ylan+0.028
 goose+0.024
iber+0.024
Neg logit contributions
leys-0.038
mingham-0.037
 VIDE-0.037
arate-0.036
\/\/-0.035
Token whether
Token ID1771
Feature activation-4.34e-01

Pos logit contributions
uv+0.250
ourning+0.239
ylan+0.227
 goose+0.201
iber+0.194
Neg logit contributions
leys-0.328
mingham-0.323
 VIDE-0.320
arate-0.309
\/\/-0.308
Token the
Token ID262
Feature activation-9.41e-02

Pos logit contributions
uv+0.693
ourning+0.658
ylan+0.629
 goose+0.561
iber+0.539
Neg logit contributions
leys-0.876
mingham-0.866
 VIDE-0.856
\/\/-0.825
arate-0.824
Token C
Token ID327
Feature activation+5.65e-02

Pos logit contributions
uv+0.163
ourning+0.155
ylan+0.149
 goose+0.133
iber+0.129
Neg logit contributions
leys-0.177
mingham-0.175
 VIDE-0.173
\/\/-0.166
arate-0.166
TokenUP
Token ID8577
Feature activation-1.43e+00

Pos logit contributions
leys+0.108
mingham+0.106
 VIDE+0.104
arate+0.101
\/\/+0.101
Neg logit contributions
uv-0.095
ourning-0.092
ylan-0.087
 goose-0.079
 weeping-0.076
Token will
Token ID481
Feature activation-4.57e-01

Pos logit contributions
uv+2.456
ourning+2.326
ylan+2.238
 goose+2.037
iber+1.964
Neg logit contributions
leys-2.669
mingham-2.653
 VIDE-2.609
\/\/-2.502
arate-2.491
Token continue
Token ID2555
Feature activation-3.36e-01

Pos logit contributions
uv+0.793
ourning+0.756
ylan+0.725
 goose+0.656
iber+0.631
Neg logit contributions
leys-0.859
mingham-0.848
 VIDE-0.837
\/\/-0.805
arate-0.803
Token to
Token ID284
Feature activation-2.14e-02

Pos logit contributions
uv+0.545
ourning+0.521
ylan+0.498
 goose+0.442
iber+0.426
Neg logit contributions
leys-0.666
mingham-0.655
 VIDE-0.649
arate-0.626
\/\/-0.625
Token support
Token ID1104
Feature activation+6.57e-01

Pos logit contributions
uv+0.038
ourning+0.036
ylan+0.035
 goose+0.031
iber+0.030
Neg logit contributions
leys-0.039
mingham-0.039
 VIDE-0.038
arate-0.037
\/\/-0.037
Token Pu
Token ID11634
Feature activation-2.49e-01

Pos logit contributions
leys+1.299
mingham+1.276
 VIDE+1.261
arate+1.227
\/\/+1.216
Neg logit contributions
uv-1.067
ourning-1.020
ylan-0.974
 goose-0.866
iber-0.830
Token
Token ID447
Feature activation+1.28e-02

Pos logit contributions
leys+0.255
mingham+0.251
 VIDE+0.250
\/\/+0.239
arate+0.239
Neg logit contributions
uv-0.248
ourning-0.237
ylan-0.229
 goose-0.207
iber-0.199
Token
Token ID251
Feature activation+1.66e-02

Pos logit contributions
leys+0.028
mingham+0.028
 VIDE+0.027
\/\/+0.027
arate+0.027
Neg logit contributions
uv-0.018
ourning-0.017
ylan-0.016
 goose-0.014
iber-0.013
Token
Token ID198
Feature activation+1.26e-01

Pos logit contributions
leys+0.031
mingham+0.031
 VIDE+0.030
\/\/+0.029
arate+0.029
Neg logit contributions
uv-0.029
ourning-0.028
ylan-0.027
 goose-0.024
iber-0.023
Token
Token ID198
Feature activation+1.89e-01

Pos logit contributions
leys+0.254
mingham+0.251
 VIDE+0.247
\/\/+0.240
arate+0.240
Neg logit contributions
uv-0.198
ourning-0.191
ylan-0.181
 goose-0.162
 weeping-0.156
TokenCont
Token ID4264
Feature activation+3.25e-01

Pos logit contributions
leys+0.288
mingham+0.282
 VIDE+0.279
\/\/+0.267
arate+0.266
Neg logit contributions
uv-0.393
ourning-0.380
ylan-0.367
 goose-0.339
iber-0.327
Tokenrad
Token ID6335
Feature activation-1.42e+00

Pos logit contributions
leys+0.498
mingham+0.490
 VIDE+0.479
arate+0.461
\/\/+0.457
Neg logit contributions
uv-0.672
ourning-0.646
ylan-0.626
 goose-0.578
 weeping-0.557
Tokenict
Token ID713
Feature activation-2.64e-03

Pos logit contributions
uv+3.158
ourning+3.062
ylan+2.946
 goose+2.703
iber+2.674
Neg logit contributions
mingham-1.942
leys-1.928
 VIDE-1.924
\/\/-1.806
arate-1.762
Tokening
Token ID278
Feature activation-7.83e-02

Pos logit contributions
uv+0.004
ourning+0.004
ylan+0.004
 goose+0.004
iber+0.004
Neg logit contributions
leys-0.005
mingham-0.005
 VIDE-0.005
arate-0.005
\/\/-0.005
Token the
Token ID262
Feature activation+1.40e-01

Pos logit contributions
uv+0.125
ourning+0.118
ylan+0.113
 goose+0.101
iber+0.097
Neg logit contributions
leys-0.158
mingham-0.156
 VIDE-0.155
\/\/-0.149
arate-0.149
Token sense
Token ID2565
Feature activation+2.89e-01

Pos logit contributions
leys+0.263
mingham+0.258
 VIDE+0.256
\/\/+0.246
arate+0.245
Neg logit contributions
uv-0.244
ourning-0.233
ylan-0.223
 goose-0.201
iber-0.194
Token of
Token ID286
Feature activation+2.58e-01

Pos logit contributions
leys+0.581
mingham+0.574
 VIDE+0.567
\/\/+0.546
arate+0.546
Neg logit contributions
uv-0.462
ourning-0.440
ylan-0.420
 goose-0.375
iber-0.360
Token:
Token ID25
Feature activation-1.65e-01

Pos logit contributions
uv+0.283
ourning+0.270
ylan+0.257
 goose+0.230
iber+0.221
Neg logit contributions
leys-0.343
mingham-0.339
 VIDE-0.335
\/\/-0.323
arate-0.323
Token47
Token ID2857
Feature activation-7.10e-01

Pos logit contributions
uv+0.253
ourning+0.240
ylan+0.229
 goose+0.203
iber+0.194
Neg logit contributions
leys-0.344
mingham-0.341
 VIDE-0.336
\/\/-0.325
arate-0.325
Token]
Token ID60
Feature activation-5.28e-01

Pos logit contributions
uv+1.061
ourning+0.998
ylan+0.952
 goose+0.843
iber+0.805
Neg logit contributions
leys-1.508
mingham-1.490
 VIDE-1.475
\/\/-1.420
arate-1.419
Token <
Token ID1279
Feature activation-6.92e-01

Pos logit contributions
uv+1.062
ourning+1.022
ylan+0.989
 goose+0.912
iber+0.884
Neg logit contributions
leys-0.837
mingham-0.816
 VIDE-0.791
arate-0.773
\/\/-0.769
Tokenc
Token ID66
Feature activation-2.17e-01

Pos logit contributions
uv+1.238
ourning+1.180
ylan+1.135
 goose+1.026
iber+0.992
Neg logit contributions
leys-1.261
mingham-1.244
 VIDE-1.233
\/\/-1.179
arate-1.178
Tokenerv
Token ID712
Feature activation-1.42e+00

Pos logit contributions
uv+0.395
ourning+0.378
ylan+0.363
 goose+0.330
iber+0.317
Neg logit contributions
leys-0.390
mingham-0.384
 VIDE-0.379
\/\/-0.364
arate-0.364
Tokenant
Token ID415
Feature activation-1.38e-01

Pos logit contributions
uv+2.682
ourning+2.526
ylan+2.469
 goose+2.207
iber+2.185
Neg logit contributions
leys-2.416
 VIDE-2.396
mingham-2.383
\/\/-2.283
arate-2.233
Tokenas
Token ID292
Feature activation-3.34e-01

Pos logit contributions
uv+0.253
ourning+0.243
ylan+0.233
 goose+0.212
iber+0.204
Neg logit contributions
leys-0.245
mingham-0.241
 VIDE-0.237
arate-0.228
\/\/-0.228
Token>
Token ID29
Feature activation-1.50e-01

Pos logit contributions
uv+0.597
ourning+0.572
ylan+0.548
 goose+0.497
iber+0.478
Neg logit contributions
leys-0.611
mingham-0.602
 VIDE-0.593
arate-0.570
\/\/-0.570
Token lol
Token ID19462
Feature activation-3.54e-01

Pos logit contributions
uv+0.271
ourning+0.259
ylan+0.249
 goose+0.226
iber+0.218
Neg logit contributions
leys-0.272
mingham-0.268
 VIDE-0.264
\/\/-0.254
arate-0.254
Token m
Token ID285
Feature activation-2.69e-01

Pos logit contributions
uv+0.623
ourning+0.593
ylan+0.571
 goose+0.517
iber+0.496
Neg logit contributions
leys-0.657
mingham-0.649
 VIDE-0.640
\/\/-0.615
arate-0.614
Token.
Token ID13
Feature activation+1.03e-01

Pos logit contributions
leys+0.790
mingham+0.773
 VIDE+0.753
arate+0.729
\/\/+0.722
Neg logit contributions
uv-0.954
ourning-0.916
ylan-0.879
 goose-0.811
 weeping-0.779
Token Chase
Token ID16346
Feature activation+8.20e-01

Pos logit contributions
leys+0.176
mingham+0.173
 VIDE+0.171
\/\/+0.164
arate+0.164
Neg logit contributions
uv-0.196
ourning-0.188
ylan-0.181
 goose-0.164
iber-0.159
Token,
Token ID11
Feature activation+2.82e-01

Pos logit contributions
leys+1.593
mingham+1.575
 VIDE+1.533
arate+1.495
\/\/+1.480
Neg logit contributions
uv-1.337
ourning-1.270
ylan-1.224
 goose-1.073
iber-1.040
Token proposing
Token ID24634
Feature activation+3.28e-01

Pos logit contributions
leys+0.634
mingham+0.627
 VIDE+0.620
arate+0.600
\/\/+0.600
Neg logit contributions
uv-0.383
ourning-0.359
ylan-0.341
 goose-0.292
iber-0.279
Token a
Token ID257
Feature activation+1.57e-01

Pos logit contributions
leys+0.664
mingham+0.652
 VIDE+0.645
arate+0.624
\/\/+0.621
Neg logit contributions
uv-0.521
ourning-0.494
ylan-0.473
 goose-0.418
iber-0.402
Token bronze
Token ID22101
Feature activation+9.09e-01

Pos logit contributions
leys+0.275
mingham+0.271
 VIDE+0.267
\/\/+0.256
arate+0.256
Neg logit contributions
uv-0.292
ourning-0.279
ylan-0.269
 goose-0.244
iber-0.236
Token cent
Token ID1247
Feature activation+5.37e-01

Pos logit contributions
leys+1.551
mingham+1.529
 VIDE+1.501
arate+1.445
\/\/+1.442
Neg logit contributions
uv-1.711
ourning-1.648
ylan-1.581
 goose-1.423
iber-1.392
Token and
Token ID290
Feature activation+4.40e-02

Pos logit contributions
leys+0.971
mingham+0.957
 VIDE+0.945
arate+0.921
\/\/+0.910
Neg logit contributions
uv-0.952
ourning-0.911
ylan-0.877
 goose-0.784
iber-0.762
Token two
Token ID734
Feature activation+3.85e-01

Pos logit contributions
leys+0.088
mingham+0.086
 VIDE+0.086
arate+0.082
\/\/+0.082
Neg logit contributions
uv-0.071
ourning-0.067
ylan-0.065
 goose-0.057
iber-0.055
Token-
Token ID12
Feature activation+2.91e-04

Pos logit contributions
leys+0.696
mingham+0.683
 VIDE+0.674
arate+0.651
\/\/+0.648
Neg logit contributions
uv-0.694
ourning-0.665
ylan-0.638
 goose-0.572
iber-0.558
Tokencent
Token ID1087
Feature activation+4.01e-01

Pos logit contributions
leys+0.001
mingham+0.001
 VIDE+0.001
arate+0.001
\/\/+0.001
Neg logit contributions
uv-0.001
ourning-0.000
ylan-0.000
 goose-0.000
iber-0.000
Token)
Token ID8
Feature activation+2.89e-01

Pos logit contributions
uv+0.111
ourning+0.105
ylan+0.101
 goose+0.091
iber+0.088
Neg logit contributions
leys-0.124
mingham-0.123
 VIDE-0.121
\/\/-0.116
arate-0.116
Token in
Token ID287
Feature activation+1.74e-01

Pos logit contributions
leys+0.537
mingham+0.529
 VIDE+0.522
\/\/+0.501
arate+0.501
Neg logit contributions
uv-0.506
ourning-0.486
ylan-0.466
 goose-0.419
iber-0.403
Token June
Token ID2795
Feature activation+6.89e-01

Pos logit contributions
leys+0.316
mingham+0.311
 VIDE+0.307
\/\/+0.295
arate+0.295
Neg logit contributions
uv-0.311
ourning-0.298
ylan-0.286
 goose-0.258
iber-0.249
Token 2012
Token ID2321
Feature activation+2.94e-01

Pos logit contributions
leys+1.180
mingham+1.152
 VIDE+1.142
\/\/+1.103
arate+1.097
Neg logit contributions
uv-1.301
ourning-1.259
ylan-1.205
 goose-1.090
iber-1.053
Token E
Token ID412
Feature activation+5.56e-01

Pos logit contributions
leys+0.515
mingham+0.508
 VIDE+0.502
arate+0.480
\/\/+0.480
Neg logit contributions
uv-0.547
ourning-0.524
ylan-0.504
 goose-0.458
iber-0.442
TokenBS
Token ID4462
Feature activation+7.77e-01

Pos logit contributions
leys+0.646
mingham+0.631
 VIDE+0.609
\/\/+0.579
arate+0.576
Neg logit contributions
uv-1.360
ourning-1.323
ylan-1.276
 goose-1.202
 weeping-1.166
TokenCO
Token ID8220
Feature activation-8.26e-02

Pos logit contributions
leys+1.673
mingham+1.662
 VIDE+1.648
arate+1.590
\/\/+1.585
Neg logit contributions
uv-1.117
ourning-1.060
ylan-1.004
 goose-0.882
 weeping-0.845
Token has
Token ID468
Feature activation+3.67e-01

Pos logit contributions
uv+0.146
ourning+0.139
ylan+0.134
 goose+0.121
iber+0.117
Neg logit contributions
leys-0.152
mingham-0.150
 VIDE-0.148
\/\/-0.142
arate-0.142
Token partnered
Token ID29806
Feature activation+1.96e-01

Pos logit contributions
leys+0.744
mingham+0.735
 VIDE+0.727
\/\/+0.701
arate+0.698
Neg logit contributions
uv-0.581
ourning-0.554
ylan-0.529
 goose-0.465
iber-0.450
Token with
Token ID351
Feature activation+3.17e-01

Pos logit contributions
leys+0.337
mingham+0.332
 VIDE+0.328
\/\/+0.314
arate+0.313
Neg logit contributions
uv-0.371
ourning-0.355
ylan-0.343
 goose-0.312
iber-0.303
Token many
Token ID867
Feature activation+1.92e-01

Pos logit contributions
leys+0.603
mingham+0.595
 VIDE+0.589
\/\/+0.565
arate+0.565
Neg logit contributions
uv-0.542
ourning-0.519
ylan-0.496
 goose-0.446
iber-0.429
Token the
Token ID262
Feature activation+5.51e-02

Pos logit contributions
uv+0.216
ourning+0.205
ylan+0.195
 goose+0.173
iber+0.166
Neg logit contributions
leys-0.288
mingham-0.285
 VIDE-0.282
\/\/-0.272
arate-0.271
Token low
Token ID1877
Feature activation-4.68e-01

Pos logit contributions
leys+0.110
mingham+0.109
 VIDE+0.108
\/\/+0.104
arate+0.103
Neg logit contributions
uv-0.088
ourning-0.085
ylan-0.081
 goose-0.071
iber-0.069
Token orbit
Token ID13066
Feature activation+6.74e-01

Pos logit contributions
uv+0.792
ourning+0.745
ylan+0.717
 goose+0.647
iber+0.620
Neg logit contributions
leys-0.905
mingham-0.893
 VIDE-0.884
arate-0.851
\/\/-0.850
Token of
Token ID286
Feature activation-2.42e-02

Pos logit contributions
leys+1.171
mingham+1.150
 VIDE+1.135
\/\/+1.086
arate+1.084
Neg logit contributions
uv-1.258
ourning-1.218
ylan-1.161
 goose-1.048
iber-1.021
Token the
Token ID262
Feature activation-4.11e-03

Pos logit contributions
uv+0.037
ourning+0.036
ylan+0.034
 goose+0.030
iber+0.029
Neg logit contributions
leys-0.050
mingham-0.049
 VIDE-0.049
\/\/-0.047
arate-0.047
Token space
Token ID2272
Feature activation+7.86e-01

Pos logit contributions
uv+0.006
ourning+0.006
ylan+0.006
 goose+0.005
iber+0.005
Neg logit contributions
leys-0.009
mingham-0.009
 VIDE-0.008
\/\/-0.008
arate-0.008
Token station
Token ID4429
Feature activation+1.62e-01

Pos logit contributions
leys+1.125
mingham+1.094
 VIDE+1.072
\/\/+1.017
arate+1.011
Neg logit contributions
uv-1.710
ourning-1.665
ylan-1.584
 goose-1.453
iber-1.432
Token as
Token ID355
Feature activation-1.40e-02

Pos logit contributions
leys+0.273
mingham+0.269
 VIDE+0.266
\/\/+0.255
arate+0.253
Neg logit contributions
uv-0.313
ourning-0.301
ylan-0.290
 goose-0.263
iber-0.255
Token safely
Token ID11512
Feature activation+1.28e-01

Pos logit contributions
uv+0.024
ourning+0.023
ylan+0.022
 goose+0.020
iber+0.019
Neg logit contributions
leys-0.026
mingham-0.026
 VIDE-0.025
\/\/-0.024
arate-0.024
Token as
Token ID355
Feature activation-4.36e-02

Pos logit contributions
leys+0.215
mingham+0.212
 VIDE+0.208
\/\/+0.200
arate+0.199
Neg logit contributions
uv-0.245
ourning-0.236
ylan-0.227
 goose-0.207
iber-0.200
Token possible
Token ID1744
Feature activation+1.08e-01

Pos logit contributions
uv+0.061
ourning+0.057
ylan+0.054
 goose+0.047
iber+0.045
Neg logit contributions
leys-0.097
mingham-0.096
 VIDE-0.095
\/\/-0.092
arate-0.091
Token of
Token ID286
Feature activation+6.11e-03

Pos logit contributions
uv+0.535
ourning+0.509
ylan+0.488
 goose+0.438
iber+0.422
Neg logit contributions
leys-0.623
mingham-0.615
 VIDE-0.609
\/\/-0.585
arate-0.584
Token beginners
Token ID31729
Feature activation-2.42e-03

Pos logit contributions
leys+0.012
mingham+0.012
 VIDE+0.012
arate+0.011
\/\/+0.011
Neg logit contributions
uv-0.010
ourning-0.009
ylan-0.009
 goose-0.008
iber-0.008
Token.
Token ID13
Feature activation-1.14e-01

Pos logit contributions
uv+0.005
ourning+0.004
ylan+0.004
 goose+0.004
iber+0.004
Neg logit contributions
leys-0.004
mingham-0.004
 VIDE-0.004
arate-0.004
\/\/-0.004
Token We
Token ID775
Feature activation-1.56e-01

Pos logit contributions
uv+0.196
ourning+0.187
ylan+0.180
 goose+0.162
iber+0.156
Neg logit contributions
leys-0.214
mingham-0.212
 VIDE-0.210
\/\/-0.201
arate-0.201
Token have
Token ID423
Feature activation+1.85e-03

Pos logit contributions
uv+0.283
ourning+0.270
ylan+0.260
 goose+0.236
iber+0.228
Neg logit contributions
leys-0.282
mingham-0.277
 VIDE-0.274
\/\/-0.263
arate-0.262
Token considered
Token ID3177
Feature activation+8.98e-01

Pos logit contributions
leys+0.003
mingham+0.003
 VIDE+0.003
arate+0.003
\/\/+0.003
Neg logit contributions
uv-0.003
ourning-0.003
ylan-0.003
 goose-0.003
iber-0.003
Token that
Token ID326
Feature activation+8.48e-02

Pos logit contributions
leys+1.731
mingham+1.696
 VIDE+1.678
arate+1.623
\/\/+1.605
Neg logit contributions
uv-1.500
ourning-1.440
ylan-1.377
 goose-1.220
iber-1.173
Token students
Token ID2444
Feature activation+2.37e-01

Pos logit contributions
leys+0.162
mingham+0.160
 VIDE+0.158
arate+0.152
\/\/+0.151
Neg logit contributions
uv-0.144
ourning-0.138
ylan-0.132
 goose-0.118
iber-0.114
Token in
Token ID287
Feature activation+3.66e-02

Pos logit contributions
leys+0.429
mingham+0.423
 VIDE+0.417
arate+0.403
\/\/+0.401
Neg logit contributions
uv-0.426
ourning-0.408
ylan-0.391
 goose-0.352
iber-0.340
Token this
Token ID428
Feature activation+1.14e-01

Pos logit contributions
leys+0.073
mingham+0.072
 VIDE+0.071
arate+0.069
\/\/+0.069
Neg logit contributions
uv-0.059
ourning-0.056
ylan-0.054
 goose-0.048
iber-0.046
Token doctrine
Token ID13115
Feature activation+1.59e-01

Pos logit contributions
leys+0.191
mingham+0.188
 VIDE+0.185
\/\/+0.177
arate+0.177
Neg logit contributions
uv-0.222
ourning-0.213
ylan-0.205
 goose-0.187
iber-0.181
Token would
Token ID561
Feature activation-2.92e-02

Pos logit contributions
leys+0.113
mingham+0.112
 VIDE+0.110
\/\/+0.105
arate+0.105
Neg logit contributions
uv-0.128
ourning-0.122
ylan-0.118
 goose-0.108
iber-0.104
Token,
Token ID11
Feature activation-2.38e-02

Pos logit contributions
uv+0.055
ourning+0.053
ylan+0.051
 goose+0.046
iber+0.045
Neg logit contributions
leys-0.050
mingham-0.050
 VIDE-0.049
arate-0.047
\/\/-0.047
Token and
Token ID290
Feature activation+5.18e-02

Pos logit contributions
uv+0.045
ourning+0.043
ylan+0.042
 goose+0.038
iber+0.037
Neg logit contributions
leys-0.041
mingham-0.040
 VIDE-0.040
\/\/-0.038
arate-0.038
Token increasingly
Token ID6481
Feature activation+2.60e-01

Pos logit contributions
leys+0.096
mingham+0.094
 VIDE+0.093
\/\/+0.090
arate+0.090
Neg logit contributions
uv-0.091
ourning-0.087
ylan-0.084
 goose-0.075
iber-0.073
Token look
Token ID804
Feature activation-1.01e-01

Pos logit contributions
leys+0.479
mingham+0.472
 VIDE+0.468
arate+0.450
\/\/+0.447
Neg logit contributions
uv-0.458
ourning-0.441
ylan-0.421
 goose-0.375
iber-0.365
Token forward
Token ID2651
Feature activation+7.66e-01

Pos logit contributions
uv+0.172
ourning+0.165
ylan+0.158
 goose+0.141
iber+0.137
Neg logit contributions
leys-0.191
mingham-0.188
 VIDE-0.185
arate-0.178
\/\/-0.178
Token to
Token ID284
Feature activation+1.38e-01

Pos logit contributions
leys+1.502
mingham+1.479
 VIDE+1.463
arate+1.410
\/\/+1.408
Neg logit contributions
uv-1.261
ourning-1.202
ylan-1.150
 goose-1.029
iber-0.988
Token going
Token ID1016
Feature activation+1.29e-01

Pos logit contributions
leys+0.278
mingham+0.274
 VIDE+0.271
\/\/+0.262
arate+0.261
Neg logit contributions
uv-0.220
ourning-0.210
ylan-0.201
 goose-0.177
iber-0.172
Token home
Token ID1363
Feature activation+2.52e-01

Pos logit contributions
leys+0.241
mingham+0.238
 VIDE+0.235
\/\/+0.226
arate+0.226
Neg logit contributions
uv-0.224
ourning-0.215
ylan-0.206
 goose-0.185
iber-0.179
Token.
Token ID13
Feature activation+3.93e-03

Pos logit contributions
leys+0.426
mingham+0.420
 VIDE+0.414
\/\/+0.398
arate+0.397
Neg logit contributions
uv-0.482
ourning-0.466
ylan-0.447
 goose-0.404
iber-0.393
Token Others
Token ID12691
Feature activation+2.46e-01

Pos logit contributions
leys+0.007
mingham+0.007
 VIDE+0.007
\/\/+0.007
arate+0.007
Neg logit contributions
uv-0.007
ourning-0.007
ylan-0.007
 goose-0.006
iber-0.006
Token in
Token ID287
Feature activation-5.62e-02

Pos logit contributions
leys+0.517
mingham+0.511
 VIDE+0.504
arate+0.486
\/\/+0.486
Neg logit contributions
uv-0.448
ourning-0.428
ylan-0.410
 goose-0.367
iber-0.354
Token demand
Token ID3512
Feature activation-3.60e-01

Pos logit contributions
uv+0.107
ourning+0.103
ylan+0.099
 goose+0.090
iber+0.087
Neg logit contributions
leys-0.096
mingham-0.094
 VIDE-0.093
\/\/-0.089
arate-0.089
Token from
Token ID422
Feature activation-1.37e-01

Pos logit contributions
uv+0.606
ourning+0.579
ylan+0.554
 goose+0.496
iber+0.478
Neg logit contributions
leys-0.695
mingham-0.686
 VIDE-0.676
\/\/-0.652
arate-0.652
Token investors
Token ID7713
Feature activation+1.42e-01

Pos logit contributions
uv+0.239
ourning+0.228
ylan+0.219
 goose+0.197
iber+0.190
Neg logit contributions
leys-0.255
mingham-0.251
 VIDE-0.248
\/\/-0.238
arate-0.238
Token
Token ID447
Feature activation-1.30e-01

Pos logit contributions
leys+0.255
mingham+0.251
 VIDE+0.247
arate+0.239
\/\/+0.239
Neg logit contributions
uv-0.255
ourning-0.245
ylan-0.233
 goose-0.209
iber-0.203
Token
Token ID251
Feature activation-6.44e-02

Pos logit contributions
uv+0.158
ourning+0.148
ylan+0.139
 goose+0.119
iber+0.112
Neg logit contributions
leys-0.312
mingham-0.308
 VIDE-0.304
arate-0.296
\/\/-0.296
Token.
Token ID13
Feature activation-4.19e-03

Pos logit contributions
uv+0.116
ourning+0.110
ylan+0.106
 goose+0.096
iber+0.092
Neg logit contributions
leys-0.117
mingham-0.115
 VIDE-0.113
\/\/-0.109
arate-0.109
Token
Token ID198
Feature activation-1.98e-01

Pos logit contributions
uv+0.007
ourning+0.007
ylan+0.006
 goose+0.006
 weeping+0.005
Neg logit contributions
leys-0.008
mingham-0.008
 VIDE-0.008
\/\/-0.008
arate-0.008
Token
Token ID198
Feature activation-1.23e-01

Pos logit contributions
uv+0.305
ourning+0.291
ylan+0.277
 goose+0.246
iber+0.235
Neg logit contributions
leys-0.409
mingham-0.405
 VIDE-0.399
\/\/-0.387
arate-0.387
TokenOf
Token ID5189
Feature activation+1.82e-01

Pos logit contributions
uv+0.253
ourning+0.243
ylan+0.235
 goose+0.216
iber+0.209
Neg logit contributions
leys-0.190
mingham-0.187
 VIDE-0.184
\/\/-0.176
arate-0.175
Token course
Token ID1781
Feature activation+3.17e-01

Pos logit contributions
leys+0.294
mingham+0.289
 VIDE+0.284
arate+0.273
\/\/+0.272
Neg logit contributions
uv-0.364
ourning-0.349
ylan-0.337
 goose-0.309
iber-0.300
Token,
Token ID11
Feature activation+4.97e-02

Pos logit contributions
leys+0.073
mingham+0.072
 VIDE+0.071
\/\/+0.068
arate+0.068
Neg logit contributions
uv-0.068
ourning-0.065
ylan-0.062
 goose-0.056
iber-0.054
Token the
Token ID262
Feature activation+3.39e-02

Pos logit contributions
leys+0.096
mingham+0.095
 VIDE+0.094
\/\/+0.090
arate+0.090
Neg logit contributions
uv-0.083
ourning-0.079
ylan-0.076
 goose-0.068
iber-0.065
Token NFL
Token ID5134
Feature activation+3.08e-02

Pos logit contributions
leys+0.058
mingham+0.057
 VIDE+0.056
\/\/+0.054
arate+0.054
Neg logit contributions
uv-0.065
ourning-0.062
ylan-0.060
 goose-0.055
iber-0.053
TokenPA
Token ID4537
Feature activation-1.44e-01

Pos logit contributions
leys+0.055
mingham+0.054
 VIDE+0.053
\/\/+0.051
arate+0.051
Neg logit contributions
uv-0.056
ourning-0.054
ylan-0.052
 goose-0.047
iber-0.045
Token didn
Token ID1422
Feature activation+6.67e-02

Pos logit contributions
uv+0.236
ourning+0.225
ylan+0.215
 goose+0.192
iber+0.185
Neg logit contributions
leys-0.284
mingham-0.280
 VIDE-0.277
\/\/-0.267
arate-0.267
Token't
Token ID470
Feature activation+1.79e-01

Pos logit contributions
leys+0.117
mingham+0.116
 VIDE+0.114
arate+0.110
\/\/+0.110
Neg logit contributions
uv-0.123
ourning-0.118
ylan-0.114
 goose-0.103
 weeping-0.100
Token have
Token ID423
Feature activation+4.25e-02

Pos logit contributions
leys+0.319
mingham+0.314
 VIDE+0.310
\/\/+0.298
arate+0.297
Neg logit contributions
uv-0.328
ourning-0.313
ylan-0.301
 goose-0.274
iber-0.264
Token to
Token ID284
Feature activation+8.18e-02

Pos logit contributions
leys+0.085
mingham+0.083
 VIDE+0.083
arate+0.080
\/\/+0.079
Neg logit contributions
uv-0.069
ourning-0.066
ylan-0.063
 goose-0.056
iber-0.054
Token build
Token ID1382
Feature activation-1.04e-01

Pos logit contributions
leys+0.149
mingham+0.147
 VIDE+0.145
\/\/+0.139
arate+0.139
Neg logit contributions
uv-0.147
ourning-0.140
ylan-0.135
 goose-0.122
iber-0.118
Token in
Token ID287
Feature activation-3.26e-02

Pos logit contributions
uv+0.167
ourning+0.159
ylan+0.152
 goose+0.136
iber+0.130
Neg logit contributions
leys-0.210
mingham-0.208
 VIDE-0.205
\/\/-0.198
arate-0.197
Token a
Token ID257
Feature activation+3.03e-02

Pos logit contributions
uv+0.057
ourning+0.054
ylan+0.052
 goose+0.047
iber+0.045
Neg logit contributions
leys-0.061
mingham-0.060
 VIDE-0.060
\/\/-0.057
arate-0.057
Token children
Token ID1751
Feature activation-6.84e-01

Pos logit contributions
uv+0.022
ourning+0.020
ylan+0.019
 goose+0.017
iber+0.016
Neg logit contributions
leys-0.035
mingham-0.034
 VIDE-0.034
\/\/-0.033
arate-0.033
Token,
Token ID11
Feature activation-1.32e-01

Pos logit contributions
uv+1.273
ourning+1.219
ylan+1.172
 goose+1.066
iber+1.030
Neg logit contributions
leys-1.199
mingham-1.182
 VIDE-1.167
\/\/-1.118
arate-1.117
Token often
Token ID1690
Feature activation+1.61e-01

Pos logit contributions
uv+0.237
ourning+0.227
ylan+0.218
 goose+0.197
iber+0.190
Neg logit contributions
leys-0.240
mingham-0.237
 VIDE-0.234
arate-0.224
\/\/-0.224
Token in
Token ID287
Feature activation-1.06e-01

Pos logit contributions
leys+0.303
mingham+0.296
 VIDE+0.292
arate+0.283
\/\/+0.282
Neg logit contributions
uv-0.280
ourning-0.268
ylan-0.257
 goose-0.226
iber-0.221
Token the
Token ID262
Feature activation-1.18e-01

Pos logit contributions
uv+0.181
ourning+0.174
ylan+0.166
 goose+0.148
iber+0.145
Neg logit contributions
leys-0.199
mingham-0.196
 VIDE-0.193
arate-0.187
\/\/-0.186
Token form
Token ID1296
Feature activation+1.17e-01

Pos logit contributions
uv+0.230
ourning+0.221
ylan+0.213
 goose+0.195
iber+0.188
Neg logit contributions
leys-0.195
mingham-0.192
 VIDE-0.190
\/\/-0.181
arate-0.181
Token of
Token ID286
Feature activation-9.14e-02

Pos logit contributions
leys+0.234
mingham+0.232
 VIDE+0.229
\/\/+0.221
arate+0.220
Neg logit contributions
uv-0.188
ourning-0.178
ylan-0.170
 goose-0.152
iber-0.146
Token compounds
Token ID16439
Feature activation-8.35e-01

Pos logit contributions
uv+0.158
ourning+0.151
ylan+0.145
 goose+0.130
iber+0.126
Neg logit contributions
leys-0.172
mingham-0.169
 VIDE-0.167
arate-0.161
\/\/-0.161
Token consisting
Token ID17747
Feature activation-3.16e-01

Pos logit contributions
uv+1.472
ourning+1.402
ylan+1.347
 goose+1.224
iber+1.176
Neg logit contributions
leys-1.543
mingham-1.529
 VIDE-1.508
\/\/-1.448
arate-1.443
Token of
Token ID286
Feature activation-1.73e-01

Pos logit contributions
uv+0.503
ourning+0.477
ylan+0.457
 goose+0.410
iber+0.390
Neg logit contributions
leys-0.637
mingham-0.629
 VIDE-0.624
\/\/-0.602
arate-0.598
Token two
Token ID734
Feature activation-4.42e-03

Pos logit contributions
uv+0.303
ourning+0.289
ylan+0.277
 goose+0.251
iber+0.241
Neg logit contributions
leys-0.322
mingham-0.318
 VIDE-0.314
\/\/-0.301
arate-0.301
Token,
Token ID11
Feature activation-1.07e-01

Pos logit contributions
uv+0.350
ourning+0.335
ylan+0.321
 goose+0.291
iber+0.281
Neg logit contributions
leys-0.352
mingham-0.346
 VIDE-0.342
arate-0.328
\/\/-0.328
Token only
Token ID691
Feature activation+2.79e-01

Pos logit contributions
uv+0.180
ourning+0.171
ylan+0.164
 goose+0.147
iber+0.142
Neg logit contributions
leys-0.206
mingham-0.203
 VIDE-0.201
\/\/-0.193
arate-0.193
Token the
Token ID262
Feature activation-7.21e-02

Pos logit contributions
leys+0.554
mingham+0.546
 VIDE+0.541
\/\/+0.520
arate+0.520
Neg logit contributions
uv-0.453
ourning-0.432
ylan-0.412
 goose-0.368
iber-0.354
Token window
Token ID4324
Feature activation+1.19e-01

Pos logit contributions
uv+0.127
ourning+0.121
ylan+0.116
 goose+0.105
iber+0.101
Neg logit contributions
leys-0.134
mingham-0.132
 VIDE-0.130
\/\/-0.125
arate-0.125
Token-
Token ID12
Feature activation+9.36e-02

Pos logit contributions
leys+0.236
mingham+0.232
 VIDE+0.231
arate+0.222
\/\/+0.221
Neg logit contributions
uv-0.194
ourning-0.185
ylan-0.176
 goose-0.157
iber-0.151
Tokenlocal
Token ID12001
Feature activation+4.34e-01

Pos logit contributions
leys+0.169
mingham+0.167
 VIDE+0.164
arate+0.158
\/\/+0.158
Neg logit contributions
uv-0.168
ourning-0.162
ylan-0.155
 goose-0.141
iber-0.135
Token options
Token ID3689
Feature activation-9.47e-02

Pos logit contributions
leys+0.730
mingham+0.718
 VIDE+0.709
arate+0.678
\/\/+0.677
Neg logit contributions
uv-0.838
ourning-0.806
ylan-0.773
 goose-0.705
iber-0.684
Token from
Token ID422
Feature activation-4.73e-02

Pos logit contributions
uv+0.170
ourning+0.162
ylan+0.156
 goose+0.141
iber+0.136
Neg logit contributions
leys-0.172
mingham-0.170
 VIDE-0.168
arate-0.161
\/\/-0.161
Token the
Token ID262
Feature activation-1.11e-01

Pos logit contributions
uv+0.074
ourning+0.070
ylan+0.067
 goose+0.060
iber+0.057
Neg logit contributions
leys-0.097
mingham-0.096
 VIDE-0.095
\/\/-0.091
arate-0.091
Token model
Token ID2746
Feature activation+7.12e-01

Pos logit contributions
uv+0.201
ourning+0.192
ylan+0.185
 goose+0.168
iber+0.162
Neg logit contributions
leys-0.201
mingham-0.198
 VIDE-0.196
\/\/-0.188
arate-0.188
Tokenine
Token ID500
Feature activation-4.74e-01

Pos logit contributions
leys+1.195
mingham+1.172
 VIDE+1.158
arate+1.110
\/\/+1.107
Neg logit contributions
uv-1.378
ourning-1.323
ylan-1.271
 goose-1.162
iber-1.121
Token while
Token ID981
Feature activation-1.84e-01

Pos logit contributions
uv+0.007
ourning+0.007
ylan+0.007
 goose+0.006
iber+0.006
Neg logit contributions
leys-0.008
mingham-0.008
 VIDE-0.008
\/\/-0.007
arate-0.007
Token the
Token ID262
Feature activation-5.71e-02

Pos logit contributions
uv+0.314
ourning+0.299
ylan+0.286
 goose+0.258
iber+0.248
Neg logit contributions
leys-0.352
mingham-0.347
 VIDE-0.343
\/\/-0.330
arate-0.330
Token Apple
Token ID4196
Feature activation+7.08e-02

Pos logit contributions
uv+0.105
ourning+0.101
ylan+0.097
 goose+0.088
iber+0.085
Neg logit contributions
leys-0.101
mingham-0.100
 VIDE-0.098
\/\/-0.094
arate-0.094
Token is
Token ID318
Feature activation+3.31e-03

Pos logit contributions
leys+0.129
mingham+0.126
 VIDE+0.125
\/\/+0.120
arate+0.120
Neg logit contributions
uv-0.127
ourning-0.122
ylan-0.117
 goose-0.106
iber-0.102
Token being
Token ID852
Feature activation+1.31e-01

Pos logit contributions
leys+0.006
mingham+0.006
 VIDE+0.006
\/\/+0.006
arate+0.006
Neg logit contributions
uv-0.006
ourning-0.005
ylan-0.005
 goose-0.005
iber-0.004
Token "
Token ID366
Feature activation-1.22e-01

Pos logit contributions
leys+0.263
mingham+0.259
 VIDE+0.256
\/\/+0.248
arate+0.246
Neg logit contributions
uv-0.208
ourning-0.200
ylan-0.190
 goose-0.168
iber-0.162
Tokencool
Token ID24494
Feature activation+4.92e-02

Pos logit contributions
uv+0.217
ourning+0.209
ylan+0.200
 goose+0.181
 weeping+0.174
Neg logit contributions
leys-0.224
mingham-0.220
 VIDE-0.217
\/\/-0.209
arate-0.209
Token".
Token ID1911
Feature activation+3.31e-01

Pos logit contributions
leys+0.078
mingham+0.077
 VIDE+0.076
\/\/+0.072
arate+0.072
Neg logit contributions
uv-0.100
ourning-0.096
ylan-0.093
 goose-0.085
iber-0.083
Token I
Token ID314
Feature activation+1.42e-01

Pos logit contributions
leys+0.569
mingham+0.558
 VIDE+0.556
\/\/+0.530
arate+0.530
Neg logit contributions
uv-0.624
ourning-0.601
ylan-0.577
 goose-0.527
 weeping-0.510
Token believe
Token ID1975
Feature activation-1.79e-02

Pos logit contributions
leys+0.262
mingham+0.259
 VIDE+0.255
\/\/+0.245
arate+0.245
Neg logit contributions
uv-0.251
ourning-0.239
ylan-0.230
 goose-0.208
iber-0.201
Token that
Token ID326
Feature activation+1.77e-01

Pos logit contributions
uv+0.029
ourning+0.027
ylan+0.026
 goose+0.023
iber+0.022
Neg logit contributions
leys-0.036
mingham-0.036
 VIDE-0.035
\/\/-0.034
arate-0.034
Token These
Token ID2312
Feature activation+2.58e-01

Pos logit contributions
leys+0.291
mingham+0.285
 VIDE+0.284
\/\/+0.273
arate+0.273
Neg logit contributions
uv-0.267
ourning-0.258
ylan-0.246
 goose-0.223
 weeping-0.215
Token include
Token ID2291
Feature activation+2.32e-01

Pos logit contributions
leys+0.509
mingham+0.501
 VIDE+0.494
\/\/+0.479
arate+0.478
Neg logit contributions
uv-0.420
ourning-0.405
ylan-0.383
 goose-0.341
iber-0.329
Token UNESCO
Token ID49372
Feature activation-6.13e-01

Pos logit contributions
leys+0.456
mingham+0.450
 VIDE+0.445
arate+0.431
\/\/+0.428
Neg logit contributions
uv-0.377
ourning-0.362
ylan-0.343
 goose-0.306
iber-0.294
Token,
Token ID11
Feature activation+2.43e-02

Pos logit contributions
uv+1.135
ourning+1.088
ylan+1.045
 goose+0.950
iber+0.917
Neg logit contributions
leys-1.078
mingham-1.062
 VIDE-1.048
\/\/-1.005
arate-1.005
Token the
Token ID262
Feature activation-1.49e-02

Pos logit contributions
leys+0.048
mingham+0.047
 VIDE+0.047
arate+0.045
\/\/+0.045
Neg logit contributions
uv-0.040
ourning-0.038
ylan-0.036
 goose-0.032
iber-0.031
Token United
Token ID1578
Feature activation-1.35e-01

Pos logit contributions
uv+0.026
ourning+0.025
ylan+0.024
 goose+0.021
iber+0.020
Neg logit contributions
leys-0.028
mingham-0.027
 VIDE-0.027
arate-0.026
\/\/-0.026
Token Nations
Token ID7973
Feature activation+1.71e-01

Pos logit contributions
uv+0.253
ourning+0.243
ylan+0.233
 goose+0.212
iber+0.205
Neg logit contributions
leys-0.233
mingham-0.230
 VIDE-0.227
arate-0.218
\/\/-0.218
Token Development
Token ID7712
Feature activation+5.77e-01

Pos logit contributions
leys+0.281
mingham+0.276
 VIDE+0.272
arate+0.263
\/\/+0.261
Neg logit contributions
uv-0.334
ourning-0.324
ylan-0.308
 goose-0.282
iber-0.273
Token Programme
Token ID35232
Feature activation+9.98e-02

Pos logit contributions
leys+0.604
mingham+0.585
 VIDE+0.574
arate+0.543
\/\/+0.538
Neg logit contributions
uv-1.476
ourning-1.437
ylan-1.386
 goose-1.299
iber-1.267
Token (
Token ID357
Feature activation+1.36e-01

Pos logit contributions
leys+0.172
mingham+0.170
 VIDE+0.168
arate+0.161
\/\/+0.161
Neg logit contributions
uv-0.188
ourning-0.180
ylan-0.173
 goose-0.157
iber-0.152
TokenUN
Token ID4944
Feature activation-3.65e-01

Pos logit contributions
leys+0.252
mingham+0.247
 VIDE+0.244
arate+0.236
\/\/+0.236
Neg logit contributions
uv-0.238
ourning-0.229
ylan-0.218
 goose-0.198
iber-0.191
Tokenrance
Token ID8132
Feature activation-3.75e-01

Pos logit contributions
leys+0.054
mingham+0.053
 VIDE+0.051
arate+0.049
\/\/+0.049
Neg logit contributions
uv-0.091
ourning-0.088
ylan-0.085
 goose-0.080
 weeping-0.077
Token,
Token ID11
Feature activation-8.34e-02

Pos logit contributions
uv+0.690
ourning+0.663
ylan+0.637
 goose+0.576
iber+0.557
Neg logit contributions
leys-0.663
mingham-0.653
 VIDE-0.644
arate-0.619
\/\/-0.619
Token characterized
Token ID16264
Feature activation+1.55e-01

Pos logit contributions
uv+0.151
ourning+0.144
ylan+0.138
 goose+0.125
iber+0.121
Neg logit contributions
leys-0.150
mingham-0.148
 VIDE-0.146
\/\/-0.141
arate-0.140
Token clinically
Token ID36688
Feature activation-4.26e-02

Pos logit contributions
leys+0.293
mingham+0.289
 VIDE+0.285
\/\/+0.275
arate+0.274
Neg logit contributions
uv-0.267
ourning-0.255
ylan-0.244
 goose-0.220
iber-0.212
Token by
Token ID416
Feature activation+3.03e-02

Pos logit contributions
uv+0.075
ourning+0.072
ylan+0.069
 goose+0.063
iber+0.060
Neg logit contributions
leys-0.079
mingham-0.077
 VIDE-0.076
\/\/-0.073
arate-0.073
Token multiple
Token ID3294
Feature activation-2.07e-01

Pos logit contributions
leys+0.060
mingham+0.059
 VIDE+0.059
\/\/+0.057
arate+0.057
Neg logit contributions
uv-0.049
ourning-0.047
ylan-0.044
 goose-0.039
iber-0.038
Token,
Token ID11
Feature activation-1.12e-01

Pos logit contributions
uv+0.335
ourning+0.321
ylan+0.305
 goose+0.272
iber+0.263
Neg logit contributions
leys-0.411
mingham-0.405
 VIDE-0.400
\/\/-0.386
arate-0.386
Token tiny
Token ID7009
Feature activation-7.05e-01

Pos logit contributions
uv+0.182
ourning+0.174
ylan+0.165
 goose+0.147
iber+0.142
Neg logit contributions
leys-0.222
mingham-0.220
 VIDE-0.217
\/\/-0.210
arate-0.209
Token,
Token ID11
Feature activation-4.09e-03

Pos logit contributions
uv+1.183
ourning+1.127
ylan+1.079
 goose+0.970
iber+0.933
Neg logit contributions
leys-1.362
mingham-1.345
 VIDE-1.329
\/\/-1.279
arate-1.278
Token punct
Token ID21025
Feature activation+2.76e-01

Pos logit contributions
uv+0.007
ourning+0.007
ylan+0.006
 goose+0.006
iber+0.005
Neg logit contributions
leys-0.008
mingham-0.008
 VIDE-0.008
\/\/-0.007
arate-0.007
Tokenate
Token ID378
Feature activation-1.91e-01

Pos logit contributions
leys+0.566
mingham+0.553
 VIDE+0.544
arate+0.535
\/\/+0.524
Neg logit contributions
uv-0.423
ourning-0.410
ylan-0.390
 goose-0.350
 weeping-0.331
Token."
Token ID526
Feature activation-2.39e-01

Pos logit contributions
uv+0.025
ourning+0.024
ylan+0.023
 goose+0.021
iber+0.020
Neg logit contributions
leys-0.022
mingham-0.022
 VIDE-0.022
\/\/-0.021
arate-0.021
Token
Token ID198
Feature activation+9.40e-03

Pos logit contributions
uv+0.365
ourning+0.347
ylan+0.330
 goose+0.293
iber+0.280
Neg logit contributions
leys-0.499
mingham-0.492
 VIDE-0.488
\/\/-0.471
arate-0.471
Token
Token ID198
Feature activation+8.40e-02

Pos logit contributions
leys+0.019
mingham+0.019
 VIDE+0.019
\/\/+0.018
arate+0.018
Neg logit contributions
uv-0.015
ourning-0.014
ylan-0.013
 goose-0.012
 weeping-0.011
TokenVideo
Token ID10798
Feature activation+2.68e-01

Pos logit contributions
leys+0.126
mingham+0.124
 VIDE+0.122
arate+0.116
\/\/+0.116
Neg logit contributions
uv-0.177
ourning-0.171
ylan-0.165
 goose-0.152
 weeping-0.147
Token:
Token ID25
Feature activation-3.14e-01

Pos logit contributions
leys+0.521
mingham+0.512
 VIDE+0.510
\/\/+0.490
arate+0.488
Neg logit contributions
uv-0.444
ourning-0.426
ylan-0.407
 goose-0.362
iber-0.351
Token CH
Token ID5870
Feature activation-8.67e-01

Pos logit contributions
uv+0.557
ourning+0.533
ylan+0.511
 goose+0.462
iber+0.446
Neg logit contributions
leys-0.576
mingham-0.568
 VIDE-0.562
\/\/-0.539
arate-0.538
TokenI
Token ID40
Feature activation-9.94e-02

Pos logit contributions
uv+1.840
ourning+1.764
ylan+1.711
 goose+1.579
iber+1.530
Neg logit contributions
leys-1.298
mingham-1.276
 VIDE-1.248
\/\/-1.191
arate-1.185
Token@
Token ID31
Feature activation-4.99e-01

Pos logit contributions
uv+0.210
ourning+0.202
ylan+0.196
 goose+0.180
iber+0.175
Neg logit contributions
leys-0.149
mingham-0.146
 VIDE-0.144
\/\/-0.137
arate-0.137
TokenD
Token ID35
Feature activation-5.19e-01

Pos logit contributions
uv+0.773
ourning+0.734
ylan+0.700
 goose+0.622
iber+0.596
Neg logit contributions
leys-1.029
mingham-1.017
 VIDE-1.006
\/\/-0.970
arate-0.969
TokenET
Token ID2767
Feature activation+1.56e-01

Pos logit contributions
uv+0.992
ourning+0.955
ylan+0.914
 goose+0.844
 weeping+0.814
Neg logit contributions
leys-0.868
mingham-0.851
 VIDE-0.843
arate-0.810
\/\/-0.804
Token:
Token ID25
Feature activation-9.46e-02

Pos logit contributions
leys+0.306
mingham+0.302
 VIDE+0.298
\/\/+0.287
arate+0.287
Neg logit contributions
uv-0.258
ourning-0.247
ylan-0.236
 goose-0.212
iber-0.203
Token:
Token ID25
Feature activation+2.47e-02

Pos logit contributions
uv+0.754
ourning+0.715
ylan+0.683
 goose+0.610
iber+0.586
Neg logit contributions
leys-0.950
mingham-0.939
 VIDE-0.927
arate-0.893
\/\/-0.892
Token Le
Token ID1004
Feature activation+1.88e-01

Pos logit contributions
leys+0.048
mingham+0.047
 VIDE+0.047
\/\/+0.045
arate+0.045
Neg logit contributions
uv-0.041
ourning-0.039
ylan-0.038
 goose-0.034
iber-0.033
Tokena
Token ID64
Feature activation-3.02e-01

Pos logit contributions
leys+0.283
mingham+0.278
 VIDE+0.272
\/\/+0.260
arate+0.260
Neg logit contributions
uv-0.398
ourning-0.384
ylan-0.369
 goose-0.342
 weeping-0.331
Token Suzuki
Token ID35807
Feature activation-4.84e-01

Pos logit contributions
uv+0.444
ourning+0.419
ylan+0.402
 goose+0.353
iber+0.338
Neg logit contributions
leys-0.648
mingham-0.639
 VIDE-0.632
\/\/-0.611
arate-0.610
Token,
Token ID11
Feature activation+1.08e-01

Pos logit contributions
uv+0.856
ourning+0.817
ylan+0.792
 goose+0.712
iber+0.691
Neg logit contributions
leys-0.891
mingham-0.877
 VIDE-0.860
arate-0.826
\/\/-0.825
Token The
Token ID383
Feature activation-3.18e-01

Pos logit contributions
leys+0.203
mingham+0.200
 VIDE+0.198
\/\/+0.190
arate+0.190
Neg logit contributions
uv-0.187
ourning-0.179
ylan-0.171
 goose-0.154
iber-0.149
Token Chronicle
Token ID14160
Feature activation-7.55e-03

Pos logit contributions
uv+0.739
ourning+0.712
ylan+0.693
 goose+0.643
iber+0.628
Neg logit contributions
leys-0.408
mingham-0.401
 VIDE-0.394
arate-0.373
\/\/-0.370
Token 2
Token ID362
Feature activation+4.10e-01

Pos logit contributions
uv+0.015
ourning+0.014
ylan+0.014
 goose+0.012
iber+0.012
Neg logit contributions
leys-0.013
mingham-0.013
 VIDE-0.012
arate-0.012
\/\/-0.012
Token of
Token ID286
Feature activation-4.66e-02

Pos logit contributions
leys+0.643
mingham+0.632
 VIDE+0.624
\/\/+0.594
arate+0.594
Neg logit contributions
uv-0.840
ourning-0.808
ylan-0.779
 goose-0.716
iber-0.694
Token 15
Token ID1315
Feature activation-5.26e-02

Pos logit contributions
uv+0.086
ourning+0.082
ylan+0.080
 goose+0.073
iber+0.070
Neg logit contributions
leys-0.082
mingham-0.081
 VIDE-0.078
arate-0.076
\/\/-0.075
Token Photo
Token ID5555
Feature activation-4.65e-01

Pos logit contributions
uv+0.106
ourning+0.102
ylan+0.099
 goose+0.090
iber+0.088
Neg logit contributions
leys-0.083
mingham-0.083
 VIDE-0.081
arate-0.077
\/\/-0.077
Token.
Token ID13
Feature activation-9.47e-02

Pos logit contributions
uv+0.007
ourning+0.007
ylan+0.006
 goose+0.006
iber+0.006
Neg logit contributions
leys-0.006
mingham-0.006
 VIDE-0.006
\/\/-0.006
arate-0.006
Token Weapon
Token ID13072
Feature activation+5.51e-01

Pos logit contributions
uv+0.170
ourning+0.163
ylan+0.156
 goose+0.141
iber+0.136
Neg logit contributions
leys-0.172
mingham-0.169
 VIDE-0.167
\/\/-0.161
arate-0.161
Token person
Token ID1048
Feature activation-8.39e-02

Pos logit contributions
leys+0.977
mingham+0.966
 VIDE+0.951
arate+0.914
\/\/+0.912
Neg logit contributions
uv-1.007
ourning-0.968
ylan-0.929
 goose-0.839
iber-0.809
Token loaded
Token ID9639
Feature activation+3.52e-01

Pos logit contributions
uv+0.158
ourning+0.151
ylan+0.145
 goose+0.132
iber+0.128
Neg logit contributions
leys-0.145
mingham-0.143
 VIDE-0.141
\/\/-0.135
arate-0.135
Token,
Token ID11
Feature activation+2.50e-02

Pos logit contributions
leys+0.628
mingham+0.619
 VIDE+0.611
\/\/+0.586
arate+0.586
Neg logit contributions
uv-0.644
ourning-0.617
ylan-0.592
 goose-0.537
iber-0.518
Token initially
Token ID7317
Feature activation-1.98e-01

Pos logit contributions
leys+0.045
mingham+0.045
 VIDE+0.044
\/\/+0.042
arate+0.042
Neg logit contributions
uv-0.045
ourning-0.043
ylan-0.041
 goose-0.038
iber-0.036
Token failing
Token ID9894
Feature activation+3.78e-01

Pos logit contributions
uv+0.350
ourning+0.334
ylan+0.321
 goose+0.291
iber+0.280
Neg logit contributions
leys-0.364
mingham-0.359
 VIDE-0.355
\/\/-0.341
arate-0.340
Token,
Token ID11
Feature activation-1.93e-02

Pos logit contributions
leys+0.701
mingham+0.692
 VIDE+0.684
\/\/+0.657
arate+0.655
Neg logit contributions
uv-0.666
ourning-0.635
ylan-0.610
 goose-0.552
iber-0.532
Token sec
Token ID792
Feature activation+3.45e-01

Pos logit contributions
uv+0.034
ourning+0.032
ylan+0.031
 goose+0.028
iber+0.027
Neg logit contributions
leys-0.036
mingham-0.035
 VIDE-0.035
\/\/-0.033
arate-0.033
Tokenures
Token ID942
Feature activation-5.06e-01

Pos logit contributions
leys+0.607
mingham+0.597
 VIDE+0.585
\/\/+0.568
arate+0.568
Neg logit contributions
uv-0.636
ourning-0.607
ylan-0.590
 goose-0.534
iber-0.514
Token mark
Token ID1317
Feature activation+3.23e-01

Pos logit contributions
uv+0.919
ourning+0.872
ylan+0.844
 goose+0.769
iber+0.741
Neg logit contributions
leys-0.910
mingham-0.895
 VIDE-0.889
arate-0.848
\/\/-0.847
Tokenée
Token ID22161
Feature activation-6.83e-01

Pos logit contributions
uv+0.724
ourning+0.679
ylan+0.640
 goose+0.551
iber+0.516
Neg logit contributions
leys-1.391
mingham-1.376
 VIDE-1.361
arate-1.322
\/\/-1.320
Token 33
Token ID4747
Feature activation+3.13e-02

Pos logit contributions
uv+1.165
ourning+1.106
ylan+1.060
 goose+0.955
iber+0.920
Neg logit contributions
leys-1.306
mingham-1.287
 VIDE-1.271
\/\/-1.224
arate-1.222
Token01
Token ID486
Feature activation-1.10e-01

Pos logit contributions
leys+0.056
mingham+0.055
 VIDE+0.055
\/\/+0.053
arate+0.053
Neg logit contributions
uv-0.057
ourning-0.054
ylan-0.052
 goose-0.047
iber-0.046
Token,
Token ID11
Feature activation+8.14e-02

Pos logit contributions
uv+0.201
ourning+0.192
ylan+0.185
 goose+0.168
iber+0.162
Neg logit contributions
leys-0.196
mingham-0.193
 VIDE-0.191
\/\/-0.183
arate-0.183
Token un
Token ID555
Feature activation+1.54e-01

Pos logit contributions
leys+0.140
mingham+0.138
 VIDE+0.137
\/\/+0.131
arate+0.131
Neg logit contributions
uv-0.153
ourning-0.147
ylan-0.142
 goose-0.129
iber-0.124
Token grou
Token ID1132
Feature activation-1.21e+00

Pos logit contributions
leys+0.286
mingham+0.282
 VIDE+0.279
arate+0.268
\/\/+0.267
Neg logit contributions
uv-0.268
ourning-0.258
ylan-0.247
 goose-0.223
iber-0.214
Tokenpe
Token ID431
Feature activation+6.01e-01

Pos logit contributions
uv+2.520
ourning+2.411
ylan+2.323
 goose+2.143
iber+2.088
Neg logit contributions
leys-1.847
mingham-1.823
 VIDE-1.800
\/\/-1.716
arate-1.703
Token de
Token ID390
Feature activation+2.52e-01

Pos logit contributions
leys+1.221
mingham+1.209
 VIDE+1.199
arate+1.157
\/\/+1.152
Neg logit contributions
uv-0.928
ourning-0.895
ylan-0.851
 goose-0.753
 weeping-0.723
Token s
Token ID264
Feature activation-4.39e-01

Pos logit contributions
leys+0.471
mingham+0.465
 VIDE+0.460
\/\/+0.441
arate+0.441
Neg logit contributions
uv-0.437
ourning-0.419
ylan-0.401
 goose-0.362
iber-0.349
Tokené
Token ID2634
Feature activation+4.60e-02

Pos logit contributions
uv+0.758
ourning+0.724
ylan+0.694
 goose+0.626
iber+0.601
Neg logit contributions
leys-0.827
mingham-0.817
 VIDE-0.806
arate-0.776
\/\/-0.776
Tokenpar
Token ID1845
Feature activation-7.12e-02

Pos logit contributions
leys+0.103
mingham+0.102
 VIDE+0.101
arate+0.098
\/\/+0.098
Neg logit contributions
uv-0.062
ourning-0.059
ylan-0.056
 goose-0.049
iber-0.046
Token
Token ID198
Feature activation-3.06e-01

Pos logit contributions
uv+0.205
ourning+0.195
ylan+0.186
 goose+0.165
iber+0.159
Neg logit contributions
leys-0.267
mingham-0.264
 VIDE-0.261
\/\/-0.252
arate-0.252
Token
Token ID198
Feature activation-2.07e-01

Pos logit contributions
uv+0.468
ourning+0.447
ylan+0.425
 goose+0.377
iber+0.361
Neg logit contributions
leys-0.634
mingham-0.627
 VIDE-0.618
\/\/-0.600
arate-0.598
Token[
Token ID58
Feature activation-5.04e-01

Pos logit contributions
uv+0.421
ourning+0.407
ylan+0.392
 goose+0.361
 weeping+0.349
Neg logit contributions
leys-0.326
mingham-0.320
 VIDE-0.315
\/\/-0.303
arate-0.302
Token22
Token ID1828
Feature activation-2.89e-01

Pos logit contributions
uv+0.884
ourning+0.843
ylan+0.809
 goose+0.730
iber+0.706
Neg logit contributions
leys-0.935
mingham-0.924
 VIDE-0.913
\/\/-0.875
arate-0.874
Token]
Token ID60
Feature activation-1.78e-01

Pos logit contributions
uv+0.449
ourning+0.429
ylan+0.408
 goose+0.363
 weeping+0.348
Neg logit contributions
leys-0.593
mingham-0.585
 VIDE-0.579
\/\/-0.560
arate-0.559
Token Literally
Token ID48414
Feature activation-1.16e+00

Pos logit contributions
uv+0.314
ourning+0.300
ylan+0.288
 goose+0.260
iber+0.251
Neg logit contributions
leys-0.330
mingham-0.325
 VIDE-0.321
\/\/-0.309
arate-0.308
Token,
Token ID11
Feature activation-2.19e-02

Pos logit contributions
uv+1.917
ourning+1.820
ylan+1.743
 goose+1.567
iber+1.501
Neg logit contributions
leys-2.281
mingham-2.256
 VIDE-2.227
\/\/-2.145
arate-2.141
Token R
Token ID371
Feature activation-8.56e-02

Pos logit contributions
uv+0.038
ourning+0.037
ylan+0.035
 goose+0.032
iber+0.031
Neg logit contributions
leys-0.041
mingham-0.040
 VIDE-0.040
\/\/-0.038
arate-0.038
Tokenashi
Token ID12144
Feature activation-6.93e-03

Pos logit contributions
uv+0.165
ourning+0.158
ylan+0.152
 goose+0.140
iber+0.134
Neg logit contributions
leys-0.144
mingham-0.142
 VIDE-0.140
arate-0.134
\/\/-0.134
Token says
Token ID1139
Feature activation-3.70e-01

Pos logit contributions
uv+0.012
ourning+0.012
ylan+0.011
 goose+0.010
iber+0.010
Neg logit contributions
leys-0.013
mingham-0.012
 VIDE-0.012
\/\/-0.012
arate-0.012
Token,
Token ID11
Feature activation+1.39e-01

Pos logit contributions
uv+0.644
ourning+0.616
ylan+0.591
 goose+0.531
iber+0.513
Neg logit contributions
leys-0.692
mingham-0.682
 VIDE-0.675
\/\/-0.648
arate-0.648
Token.
Token ID13
Feature activation-5.69e-02

Pos logit contributions
uv+1.204
ourning+1.146
ylan+1.105
 goose+1.008
iber+0.978
Neg logit contributions
leys-1.125
mingham-1.106
 VIDE-1.096
arate-1.044
\/\/-1.044
Token
Token ID198
Feature activation-2.95e-01

Pos logit contributions
uv+0.096
ourning+0.092
ylan+0.088
 goose+0.079
iber+0.076
Neg logit contributions
leys-0.109
mingham-0.108
 VIDE-0.107
\/\/-0.103
arate-0.102
Token
Token ID198
Feature activation-2.35e-01

Pos logit contributions
uv+0.451
ourning+0.429
ylan+0.408
 goose+0.363
 weeping+0.347
Neg logit contributions
leys-0.613
mingham-0.606
 VIDE-0.599
\/\/-0.580
arate-0.579
TokenLocal
Token ID14565
Feature activation+4.97e-01

Pos logit contributions
uv+0.476
ourning+0.457
ylan+0.442
 goose+0.405
iber+0.393
Neg logit contributions
leys-0.372
mingham-0.366
 VIDE-0.361
\/\/-0.344
arate-0.344
Token Basic
Token ID14392
Feature activation+1.45e-01

Pos logit contributions
leys+0.843
mingham+0.831
 VIDE+0.818
\/\/+0.784
arate+0.783
Neg logit contributions
uv-0.950
ourning-0.912
ylan-0.878
 goose-0.798
 weeping-0.769
Token Income
Token ID19003
Feature activation-1.16e+00

Pos logit contributions
leys+0.228
mingham+0.224
 VIDE+0.221
\/\/+0.211
arate+0.211
Neg logit contributions
uv-0.295
ourning-0.283
ylan-0.273
 goose-0.251
iber-0.244
Token
Token ID198
Feature activation-1.72e-01

Pos logit contributions
uv+1.963
ourning+1.855
ylan+1.797
 goose+1.631
iber+1.565
Neg logit contributions
leys-2.199
mingham-2.169
 VIDE-2.153
\/\/-2.070
arate-2.064
Token
Token ID198
Feature activation-9.79e-02

Pos logit contributions
uv+0.264
ourning+0.252
ylan+0.240
 goose+0.213
iber+0.204
Neg logit contributions
leys-0.357
mingham-0.353
 VIDE-0.348
\/\/-0.337
arate-0.337
TokenBasic
Token ID26416
Feature activation-9.84e-02

Pos logit contributions
uv+0.202
ourning+0.194
ylan+0.188
 goose+0.172
iber+0.167
Neg logit contributions
leys-0.151
mingham-0.149
 VIDE-0.147
\/\/-0.140
arate-0.140
Token income
Token ID3739
Feature activation-1.03e+00

Pos logit contributions
uv+0.194
ourning+0.186
ylan+0.180
 goose+0.165
iber+0.160
Neg logit contributions
leys-0.160
mingham-0.158
 VIDE-0.157
\/\/-0.149
arate-0.149
Token in
Token ID287
Feature activation-2.91e-01

Pos logit contributions
uv+1.752
ourning+1.667
ylan+1.603
 goose+1.449
iber+1.388
Neg logit contributions
leys-1.979
mingham-1.950
 VIDE-1.932
\/\/-1.856
arate-1.854
Token |
Token ID930
Feature activation-1.48e-02

Pos logit contributions
uv+1.100
ourning+1.056
ylan+1.018
 goose+0.928
iber+0.899
Neg logit contributions
leys-0.985
mingham-0.967
 VIDE-0.956
\/\/-0.915
arate-0.909
Token ISI
Token ID22048
Feature activation+2.17e-02

Pos logit contributions
uv+0.026
ourning+0.025
ylan+0.024
 goose+0.021
iber+0.021
Neg logit contributions
leys-0.027
mingham-0.027
 VIDE-0.027
arate-0.026
\/\/-0.026
Token
Token ID198
Feature activation-1.53e-01

Pos logit contributions
leys+0.045
mingham+0.044
 VIDE+0.044
\/\/+0.043
arate+0.043
Neg logit contributions
uv-0.033
ourning-0.032
ylan-0.030
 goose-0.027
iber-0.026
Token
Token ID198
Feature activation-6.62e-02

Pos logit contributions
uv+0.237
ourning+0.227
ylan+0.216
 goose+0.191
iber+0.184
Neg logit contributions
leys-0.314
mingham-0.310
 VIDE-0.306
\/\/-0.297
arate-0.296
TokenP
Token ID47
Feature activation+4.89e-02

Pos logit contributions
uv+0.136
ourning+0.131
ylan+0.126
 goose+0.116
 weeping+0.112
Neg logit contributions
leys-0.103
mingham-0.101
 VIDE-0.100
\/\/-0.096
arate-0.095
Tokenel
Token ID417
Feature activation-1.03e+00

Pos logit contributions
leys+0.091
mingham+0.089
 VIDE+0.087
arate+0.085
\/\/+0.084
Neg logit contributions
uv-0.086
ourning-0.082
ylan-0.079
 goose-0.072
 weeping-0.069
Tokenleg
Token ID1455
Feature activation-5.86e-02

Pos logit contributions
uv+1.519
ourning+1.428
ylan+1.368
 goose+1.187
iber+1.164
Neg logit contributions
leys-2.185
 VIDE-2.163
mingham-2.162
\/\/-2.081
arate-2.073
Tokenr
Token ID81
Feature activation-1.85e-01

Pos logit contributions
uv+0.127
ourning+0.122
ylan+0.118
 goose+0.109
iber+0.106
Neg logit contributions
leys-0.085
mingham-0.083
 VIDE-0.082
arate-0.078
\/\/-0.078
Tokenini
Token ID5362
Feature activation-4.00e-01

Pos logit contributions
uv+0.435
ourning+0.418
ylan+0.408
 goose+0.375
iber+0.369
Neg logit contributions
leys-0.235
mingham-0.230
 VIDE-0.228
\/\/-0.215
arate-0.212
Token,
Token ID11
Feature activation+7.65e-03

Pos logit contributions
uv+0.737
ourning+0.705
ylan+0.678
 goose+0.616
iber+0.595
Neg logit contributions
leys-0.710
mingham-0.699
 VIDE-0.691
\/\/-0.662
arate-0.661
Token A
Token ID317
Feature activation-8.48e-02

Pos logit contributions
leys+0.013
mingham+0.013
 VIDE+0.012
\/\/+0.012
arate+0.012
Neg logit contributions
uv-0.015
ourning-0.014
ylan-0.014
 goose-0.013
iber-0.012
Token memories
Token ID9846
Feature activation-1.65e-01

Pos logit contributions
leys+0.554
mingham+0.546
 VIDE+0.539
\/\/+0.518
arate+0.517
Neg logit contributions
uv-0.508
ourning-0.486
ylan-0.465
 goose-0.417
iber-0.404
Token...
Token ID986
Feature activation+1.16e-01

Pos logit contributions
uv+0.305
ourning+0.292
ylan+0.281
 goose+0.254
iber+0.246
Neg logit contributions
leys-0.293
mingham-0.288
 VIDE-0.285
\/\/-0.273
arate-0.273
Token
Token ID198
Feature activation+1.36e-01

Pos logit contributions
leys+0.228
mingham+0.224
 VIDE+0.223
\/\/+0.215
arate+0.214
Neg logit contributions
uv-0.189
ourning-0.180
ylan-0.173
 goose-0.153
iber-0.148
Token
Token ID198
Feature activation+1.17e-01

Pos logit contributions
leys+0.274
mingham+0.270
 VIDE+0.266
\/\/+0.259
arate+0.259
Neg logit contributions
uv-0.213
ourning-0.206
ylan-0.195
 goose-0.175
 weeping-0.168
TokenMy
Token ID3666
Feature activation-3.05e-01

Pos logit contributions
leys+0.184
mingham+0.180
 VIDE+0.178
\/\/+0.170
arate+0.170
Neg logit contributions
uv-0.239
ourning-0.231
ylan-0.223
 goose-0.205
iber-0.198
Tokenster
Token ID1706
Feature activation-1.57e+00

Pos logit contributions
uv+0.516
ourning+0.493
ylan+0.472
 goose+0.423
iber+0.408
Neg logit contributions
leys-0.587
mingham-0.578
 VIDE-0.572
\/\/-0.550
arate-0.550
Tokenies
Token ID444
Feature activation-3.60e-01

Pos logit contributions
uv+3.133
ourning+2.984
ylan+2.885
 goose+2.666
iber+2.600
Neg logit contributions
 VIDE-2.492
leys-2.489
mingham-2.479
arate-2.349
\/\/-2.336
Token of
Token ID286
Feature activation-1.84e-01

Pos logit contributions
uv+0.710
ourning+0.683
ylan+0.659
 goose+0.601
iber+0.583
Neg logit contributions
leys-0.588
mingham-0.578
 VIDE-0.571
\/\/-0.546
arate-0.545
Token a
Token ID257
Feature activation-1.94e-01

Pos logit contributions
uv+0.300
ourning+0.285
ylan+0.273
 goose+0.244
iber+0.235
Neg logit contributions
leys-0.366
mingham-0.361
 VIDE-0.357
\/\/-0.344
arate-0.343
Token human
Token ID1692
Feature activation+4.40e-02

Pos logit contributions
uv+0.309
ourning+0.294
ylan+0.281
 goose+0.249
iber+0.241
Neg logit contributions
leys-0.392
mingham-0.386
 VIDE-0.383
\/\/-0.368
arate-0.368
Token turned
Token ID2900
Feature activation-8.49e-02

Pos logit contributions
leys+0.079
mingham+0.078
 VIDE+0.077
\/\/+0.074
arate+0.074
Neg logit contributions
uv-0.079
ourning-0.076
ylan-0.073
 goose-0.066
iber-0.064