TOP ACTIVATIONS
MAX = 1.544

established dwellings from investors falls away sharply , as the tax
produced by Cub by Bro ccoli or the Bro ccoli family
and St icker brush Symphony ( a
Bro ccoli or the Bro ccoli family ( E on Productions
plan to re build the U .
and reliable bi - l ateral partner , but
, including headset , cr anks and brakes . The wheels
- era standard of " prep onder ance of the evidence
receiver with an internal fer rite rod antenna and start moving
it just hasn 't gone away ," he says . He
the pass , wide receiver Emmanuel Sanders said .
ext = ($ text - split "\ r ")
that these mushrooms cause some nausea . suggest s the use
() ap . add _ argument (" - i ", "
pretty good that you found yourself saying something like this :
the controversy the frontrunner found himself in after initially failing in
early in 1989 he found himself in the London office of
he met Cub by Bro ccoli : ----- 1954 : "
systems include Cloud based Sc rub bing centers and Content Distribution
get something . We came away thinking we could have got

MIN ACTIVATIONS
MIN = -1.543

s nomination to the Supreme Court . In brief , the
step . " Direct , structured communication is vital
Movement ( V iking , 2006 ). Ms .
show up at the Supreme Court building and start working .
your opponent is able to KO every one of his Master
/ violence _ in jury _ pre vention / key _
who . int / violence _ in jury _ pre vention
Will be uploading clips of heroes on my youtube at https
) <|endoftext|> United States Supreme Court case Santa Clara
: 47 ] < literally _ worst _ user > lol
isting violations of the Supreme Court Orders on Aadhaar
to get back on the court at the beginning of this
responded by signing the " Direct ive of Peoples ' Comm
Free better known as Mister Miracle has been a
t want to play mirror matches . So , I
ways to be able to direct more light reflected by the
3 240 Posts # 10 Happy it worked out . June
disruptive in terms of foreign direct investment . Will large companies
December 28 ; the Vin cci La Plant aci ón del
and will ship to you direct from the manufacturer . Thus

INTERVAL 0.772 - 1.544
CONTAINS 0.302%

if a driver wants to turn right at an intersection then
() ap . add _ argument (" - i ", "
such opinions were out of sync with the times .
full approval . In the meantime , make sure to check
Here are the six camera accessories Adam Pretty would recommend

INTERVAL 0.001 - 0.772
CONTAINS 49.741%

said in a statement . The United States intends
a definite push to keep this momentum going ," he signs
fuzzy to boot . Due to the third game
batch ing in the stub library code Serial ization
two goals were erected . Le igh Wi ener

INTERVAL -0.771 - 0.001
CONTAINS 49.588%

planned to happen in two phases , according to project manager
areas such as agriculture . Today , we live in a
an annotation automatically added to all of their existing YouTube videos
- ren ouncing . Values created out of the will to
in the Lif eline program . The company received

INTERVAL -1.543 - -0.771
CONTAINS 0.370%

. 0 f ); RAW _ ASS ERT ( it ->
on a Regular Bas is : There
. It was the exact opposite of The Pur ge .
Elaine Qu ij ano of CBS let Pence get away with
pretty good idea that the party wanted Rachel ,
Token established
Token ID4920
Feature activation-2.43e-02

Pos logit contributions
allery+0.244
tnc+0.237
 Klu+0.222
erd+0.201
oidal+0.199
Neg logit contributions
 Rabbit-0.189
Pan-0.178
 2048-0.175
-0.167
pain-0.166
Token dwellings
Token ID50078
Feature activation+1.61e-01

Pos logit contributions
allery+0.068
tnc+0.067
 Klu+0.063
erd+0.057
oidal+0.057
Neg logit contributions
 Rabbit-0.044
Pan-0.041
 2048-0.040
-0.038
pain-0.038
Token from
Token ID422
Feature activation+4.28e-02

Pos logit contributions
 Rabbit+0.310
Pan+0.291
 2048+0.286
+0.273
pain+0.270
Neg logit contributions
allery-0.434
tnc-0.423
 Klu-0.397
erd-0.362
oidal-0.358
Token investors
Token ID7713
Feature activation-1.66e-01

Pos logit contributions
 Rabbit+0.082
Pan+0.078
 2048+0.076
+0.073
pain+0.072
Neg logit contributions
allery-0.114
tnc-0.111
 Klu-0.104
erd-0.095
 concess-0.094
Token falls
Token ID8953
Feature activation-3.43e-01

Pos logit contributions
allery+0.445
tnc+0.431
 Klu+0.406
erd+0.372
oidal+0.368
Neg logit contributions
 Rabbit-0.319
Pan-0.301
 2048-0.295
-0.282
pain-0.277
Token away
Token ID1497
Feature activation+1.54e+00

Pos logit contributions
allery+0.915
tnc+0.889
 Klu+0.835
erd+0.760
oidal+0.753
Neg logit contributions
 Rabbit-0.665
Pan-0.626
 2048-0.612
-0.585
pain-0.580
Token sharply
Token ID18939
Feature activation-1.07e-01

Pos logit contributions
 Rabbit+2.753
 2048+2.562
Pan+2.548
+2.394
--------+2.378
Neg logit contributions
allery-4.338
tnc-4.270
 Klu-3.987
erd-3.661
oidal-3.595
Token,
Token ID11
Feature activation-2.54e-01

Pos logit contributions
allery+0.287
tnc+0.279
 Klu+0.262
erd+0.238
oidal+0.236
Neg logit contributions
 Rabbit-0.205
Pan-0.192
 2048-0.189
-0.180
pain-0.178
Token as
Token ID355
Feature activation-1.47e-01

Pos logit contributions
allery+0.706
tnc+0.688
 Klu+0.647
erd+0.591
oidal+0.584
Neg logit contributions
 Rabbit-0.465
Pan-0.436
 2048-0.428
-0.406
pain-0.402
Token the
Token ID262
Feature activation-3.42e-02

Pos logit contributions
allery+0.394
tnc+0.383
 Klu+0.359
erd+0.327
oidal+0.324
Neg logit contributions
 Rabbit-0.283
Pan-0.266
 2048-0.262
-0.249
pain-0.247
Token tax
Token ID1687
Feature activation+4.73e-01

Pos logit contributions
allery+0.092
tnc+0.089
 Klu+0.084
erd+0.076
 concess+0.076
Neg logit contributions
 Rabbit-0.065
Pan-0.062
 2048-0.060
-0.058
pain-0.057
Token produced
Token ID4635
Feature activation+5.58e-02

Pos logit contributions
 Rabbit+0.053
Pan+0.050
 2048+0.049
+0.046
pain+0.046
Neg logit contributions
allery-0.087
tnc-0.085
 Klu-0.080
erd-0.073
oidal-0.073
Token by
Token ID416
Feature activation+3.10e-01

Pos logit contributions
 Rabbit+0.101
Pan+0.094
 2048+0.093
+0.088
pain+0.087
Neg logit contributions
allery-0.156
tnc-0.153
 Klu-0.143
erd-0.131
oidal-0.130
Token Cub
Token ID7070
Feature activation-1.85e-01

Pos logit contributions
 Rabbit+0.551
Pan+0.514
 2048+0.506
+0.478
pain+0.474
Neg logit contributions
allery-0.882
tnc-0.860
 Klu-0.810
erd-0.741
oidal-0.733
Tokenby
Token ID1525
Feature activation+2.17e-01

Pos logit contributions
allery+0.451
tnc+0.438
 Klu+0.407
erd+0.367
oidal+0.363
Neg logit contributions
 Rabbit-0.401
Pan-0.379
 2048-0.375
-0.358
pain-0.355
Token Bro
Token ID2806
Feature activation+2.87e-01

Pos logit contributions
 Rabbit+0.395
Pan+0.369
 2048+0.364
+0.344
pain+0.341
Neg logit contributions
allery-0.606
tnc-0.592
 Klu-0.555
erd-0.508
oidal-0.503
Tokenccoli
Token ID34544
Feature activation+1.54e+00

Pos logit contributions
 Rabbit+0.622
Pan+0.588
 2048+0.580
+0.554
pain+0.550
Neg logit contributions
allery-0.704
tnc-0.685
 Klu-0.635
erd-0.572
oidal-0.567
Token or
Token ID393
Feature activation+1.59e-01

Pos logit contributions
 Rabbit+2.670
Pan+2.490
 2048+2.442
--------+2.299
 Panama+2.266
Neg logit contributions
allery-4.409
tnc-4.345
 Klu-4.008
erd-3.693
oidal-3.684
Token the
Token ID262
Feature activation+1.87e-01

Pos logit contributions
 Rabbit+0.285
Pan+0.267
 2048+0.262
+0.248
pain+0.246
Neg logit contributions
allery-0.448
tnc-0.437
 Klu-0.411
erd-0.376
oidal-0.372
Token Bro
Token ID2806
Feature activation+1.96e-01

Pos logit contributions
 Rabbit+0.339
Pan+0.317
 2048+0.312
+0.295
pain+0.293
Neg logit contributions
allery-0.523
tnc-0.510
 Klu-0.480
erd-0.438
oidal-0.434
Tokenccoli
Token ID34544
Feature activation+1.46e+00

Pos logit contributions
 Rabbit+0.418
Pan+0.395
 2048+0.389
+0.371
pain+0.369
Neg logit contributions
allery-0.488
tnc-0.474
 Klu-0.440
erd-0.397
oidal-0.393
Token family
Token ID1641
Feature activation+1.62e-02

Pos logit contributions
 Rabbit+2.518
Pan+2.322
 2048+2.269
 Panama+2.126
pain+2.114
Neg logit contributions
allery-4.217
tnc-4.162
 Klu-3.856
erd-3.548
oidal-3.524
Token and
Token ID290
Feature activation+9.40e-02

Pos logit contributions
 Rabbit+0.363
Pan+0.343
 2048+0.339
+0.324
pain+0.321
Neg logit contributions
allery-0.392
tnc-0.380
 Klu-0.353
erd-0.318
oidal-0.314
Token
Token ID564
Feature activation+1.13e-01

Pos logit contributions
 Rabbit+0.186
Pan+0.175
 2048+0.172
+0.164
pain+0.163
Neg logit contributions
allery-0.247
tnc-0.241
 Klu-0.225
erd-0.205
oidal-0.203
Token
Token ID246
Feature activation+4.20e-01

Pos logit contributions
 Rabbit+0.173
Pan+0.159
 2048+0.156
+0.146
pain+0.145
Neg logit contributions
allery-0.348
tnc-0.340
 Klu-0.321
erd-0.297
oidal-0.294
TokenSt
Token ID1273
Feature activation+5.44e-01

Pos logit contributions
 Rabbit+0.771
Pan+0.722
 2048+0.710
+0.672
pain+0.667
Neg logit contributions
allery-1.170
tnc-1.141
 Klu-1.071
erd-0.979
oidal-0.969
Tokenicker
Token ID15799
Feature activation+1.02e-01

Pos logit contributions
 Rabbit+1.200
Pan+1.135
 2048+1.121
+1.071
pain+1.064
Neg logit contributions
allery-1.313
tnc-1.276
 Klu-1.185
erd-1.065
oidal-1.053
Tokenbrush
Token ID32680
Feature activation+1.48e+00

Pos logit contributions
 Rabbit+0.176
Pan+0.164
 2048+0.162
+0.154
--------+0.152
Neg logit contributions
allery-0.291
tnc-0.283
 Klu-0.268
erd-0.245
oidal-0.242
Token Symphony
Token ID33195
Feature activation+1.09e+00

Pos logit contributions
 Rabbit+2.286
Pan+2.096
 2048+2.080
pain+1.920
+1.900
Neg logit contributions
allery-4.576
tnc-4.483
 Klu-4.206
erd-3.892
oidal-3.873
Token
Token ID447
Feature activation+2.11e-01

Pos logit contributions
 Rabbit+1.365
Pan+1.239
 2048+1.218
+1.101
--------+1.092
Neg logit contributions
allery-3.681
tnc-3.606
 Klu-3.422
erd-3.188
oidal-3.155
Token
Token ID247
Feature activation+2.30e-01

Pos logit contributions
 Rabbit+0.328
Pan+0.305
 2048+0.296
pain+0.279
 Panama+0.277
Neg logit contributions
allery-0.643
tnc-0.630
 Klu-0.596
erd-0.553
oidal-0.546
Token (
Token ID357
Feature activation+2.00e-01

Pos logit contributions
 Rabbit+0.415
Pan+0.387
 2048+0.381
+0.360
pain+0.357
Neg logit contributions
allery-0.648
tnc-0.632
 Klu-0.593
erd-0.543
oidal-0.538
Tokena
Token ID64
Feature activation-1.54e-01

Pos logit contributions
 Rabbit+0.388
Pan+0.363
 2048+0.359
+0.341
pain+0.337
Neg logit contributions
allery-0.535
tnc-0.522
 Klu-0.488
erd-0.445
oidal-0.440
Token Bro
Token ID2806
Feature activation+2.87e-01

Pos logit contributions
 Rabbit+0.395
Pan+0.369
 2048+0.364
+0.344
pain+0.341
Neg logit contributions
allery-0.606
tnc-0.592
 Klu-0.555
erd-0.508
oidal-0.503
Tokenccoli
Token ID34544
Feature activation+1.54e+00

Pos logit contributions
 Rabbit+0.622
Pan+0.588
 2048+0.580
+0.554
pain+0.550
Neg logit contributions
allery-0.704
tnc-0.685
 Klu-0.635
erd-0.572
oidal-0.567
Token or
Token ID393
Feature activation+1.59e-01

Pos logit contributions
 Rabbit+2.670
Pan+2.490
 2048+2.442
--------+2.299
 Panama+2.266
Neg logit contributions
allery-4.409
tnc-4.345
 Klu-4.008
erd-3.693
oidal-3.684
Token the
Token ID262
Feature activation+1.87e-01

Pos logit contributions
 Rabbit+0.285
Pan+0.267
 2048+0.262
+0.248
pain+0.246
Neg logit contributions
allery-0.448
tnc-0.437
 Klu-0.411
erd-0.376
oidal-0.372
Token Bro
Token ID2806
Feature activation+1.96e-01

Pos logit contributions
 Rabbit+0.339
Pan+0.317
 2048+0.312
+0.295
pain+0.293
Neg logit contributions
allery-0.523
tnc-0.510
 Klu-0.480
erd-0.438
oidal-0.434
Tokenccoli
Token ID34544
Feature activation+1.46e+00

Pos logit contributions
 Rabbit+0.418
Pan+0.395
 2048+0.389
+0.371
pain+0.369
Neg logit contributions
allery-0.488
tnc-0.474
 Klu-0.440
erd-0.397
oidal-0.393
Token family
Token ID1641
Feature activation+1.62e-02

Pos logit contributions
 Rabbit+2.518
Pan+2.322
 2048+2.269
 Panama+2.126
pain+2.114
Neg logit contributions
allery-4.217
tnc-4.162
 Klu-3.856
erd-3.548
oidal-3.524
Token (
Token ID357
Feature activation-2.06e-02

Pos logit contributions
 Rabbit+0.030
Pan+0.028
 2048+0.028
+0.026
pain+0.026
Neg logit contributions
allery-0.045
tnc-0.044
 Klu-0.041
erd-0.038
oidal-0.037
TokenE
Token ID36
Feature activation-5.07e-01

Pos logit contributions
allery+0.055
tnc+0.053
 Klu+0.050
erd+0.045
oidal+0.045
Neg logit contributions
 Rabbit-0.041
Pan-0.038
 2048-0.038
-0.036
pain-0.035
Tokenon
Token ID261
Feature activation+5.11e-01

Pos logit contributions
allery+1.194
tnc+1.158
 Klu+1.073
erd+0.969
oidal+0.950
Neg logit contributions
 Rabbit-1.143
Pan-1.083
 2048-1.073
-1.024
pain-1.018
Token Productions
Token ID25315
Feature activation+1.05e-01

Pos logit contributions
 Rabbit+0.947
Pan+0.886
 2048+0.872
+0.826
pain+0.819
Neg logit contributions
allery-1.415
tnc-1.381
 Klu-1.294
erd-1.182
oidal-1.171
Token plan
Token ID1410
Feature activation-1.56e-02

Pos logit contributions
 Rabbit+0.038
Pan+0.036
 2048+0.035
+0.034
pain+0.033
Neg logit contributions
allery-0.050
tnc-0.049
 Klu-0.046
erd-0.041
oidal-0.041
Token to
Token ID284
Feature activation+1.32e-01

Pos logit contributions
allery+0.042
tnc+0.041
 Klu+0.039
erd+0.035
oidal+0.035
Neg logit contributions
 Rabbit-0.030
Pan-0.028
 2048-0.028
-0.026
pain-0.026
Token
Token ID564
Feature activation+3.11e-01

Pos logit contributions
 Rabbit+0.278
Pan+0.261
 2048+0.258
+0.246
pain+0.245
Neg logit contributions
allery-0.331
tnc-0.323
 Klu-0.301
erd-0.272
oidal-0.270
Token
Token ID250
Feature activation+9.88e-03

Pos logit contributions
 Rabbit+0.401
Pan+0.365
 2048+0.354
pain+0.321
 Panama+0.319
Neg logit contributions
allery-1.031
tnc-1.015
 Klu-0.966
erd-0.899
oidal-0.887
Tokenre
Token ID260
Feature activation+2.93e-01

Pos logit contributions
 Rabbit+0.020
Pan+0.019
 2048+0.019
+0.018
pain+0.018
Neg logit contributions
allery-0.025
tnc-0.025
 Klu-0.023
erd-0.021
oidal-0.021
Tokenbuild
Token ID11249
Feature activation+1.43e+00

Pos logit contributions
 Rabbit+0.610
Pan+0.575
 2048+0.568
+0.541
pain+0.537
Neg logit contributions
allery-0.744
tnc-0.723
 Klu-0.674
erd-0.611
oidal-0.604
Token
Token ID447
Feature activation+1.48e-01

Pos logit contributions
 Rabbit+2.430
Pan+2.291
 2048+2.250
+2.138
 Panama+2.126
Neg logit contributions
allery-4.100
tnc-3.986
 Klu-3.750
erd-3.444
oidal-3.419
Token
Token ID251
Feature activation-2.37e-04

Pos logit contributions
 Rabbit+0.199
Pan+0.183
 2048+0.174
pain+0.163
sec+0.159
Neg logit contributions
allery-0.476
tnc-0.471
 Klu-0.450
erd-0.418
oidal-0.411
Token the
Token ID262
Feature activation+2.07e-01

Pos logit contributions
allery+0.001
tnc+0.001
 Klu+0.001
erd+0.001
oidal+0.001
Neg logit contributions
 Rabbit-0.000
Pan-0.000
 2048-0.000
-0.000
pain-0.000
Token U
Token ID471
Feature activation+2.19e-02

Pos logit contributions
 Rabbit+0.349
Pan+0.324
 2048+0.318
+0.300
pain+0.297
Neg logit contributions
allery-0.607
tnc-0.592
 Klu-0.559
erd-0.513
oidal-0.508
Token.
Token ID13
Feature activation+4.40e-01

Pos logit contributions
 Rabbit+0.045
Pan+0.041
 2048+0.041
+0.039
pain+0.039
Neg logit contributions
allery-0.056
tnc-0.055
 Klu-0.052
erd-0.047
oidal-0.046
Token and
Token ID290
Feature activation-1.42e-02

Pos logit contributions
 Rabbit+0.863
Pan+0.812
 2048+0.799
+0.763
pain+0.756
Neg logit contributions
allery-1.115
tnc-1.084
 Klu-1.015
erd-0.918
oidal-0.909
Token reliable
Token ID9314
Feature activation-8.43e-02

Pos logit contributions
allery+0.038
tnc+0.037
 Klu+0.035
erd+0.032
oidal+0.031
Neg logit contributions
 Rabbit-0.028
Pan-0.026
 2048-0.025
-0.024
pain-0.024
Token bi
Token ID3182
Feature activation-4.49e-01

Pos logit contributions
allery+0.202
tnc+0.196
 Klu+0.184
erd+0.164
oidal+0.162
Neg logit contributions
 Rabbit-0.186
Pan-0.176
 2048-0.173
-0.167
pain-0.165
Token-
Token ID12
Feature activation-7.83e-02

Pos logit contributions
allery+1.055
tnc+1.018
 Klu+0.946
erd+0.849
oidal+0.840
Neg logit contributions
 Rabbit-1.026
Pan-0.954
 2048-0.944
pain-0.898
-0.898
Tokenl
Token ID75
Feature activation+6.69e-02

Pos logit contributions
allery+0.208
tnc+0.204
 Klu+0.188
erd+0.173
oidal+0.171
Neg logit contributions
 Rabbit-0.154
Pan-0.141
 2048-0.140
-0.133
---------0.131
Tokenateral
Token ID10534
Feature activation+1.42e+00

Pos logit contributions
 Rabbit+0.090
Pan+0.080
 2048+0.079
+0.073
pain+0.072
Neg logit contributions
allery-0.220
tnc-0.215
 Klu-0.204
erd-0.190
oidal-0.189
Token partner
Token ID5212
Feature activation+3.65e-01

Pos logit contributions
 Rabbit+3.095
Pan+2.923
 2048+2.904
 civilian+2.815
+2.778
Neg logit contributions
allery-3.406
tnc-3.320
 Klu-3.083
erd-2.802
oidal-2.716
Token,
Token ID11
Feature activation-2.89e-02

Pos logit contributions
 Rabbit+0.669
Pan+0.626
 2048+0.619
+0.583
pain+0.580
Neg logit contributions
allery-1.015
tnc-0.991
 Klu-0.928
erd-0.850
oidal-0.841
Token
Token ID447
Feature activation+1.57e-01

Pos logit contributions
allery+0.078
tnc+0.076
 Klu+0.071
erd+0.065
oidal+0.064
Neg logit contributions
 Rabbit-0.055
Pan-0.052
 2048-0.051
-0.049
pain-0.048
Token
Token ID251
Feature activation-1.95e-02

Pos logit contributions
 Rabbit+0.208
Pan+0.190
 2048+0.183
pain+0.169
 Panama+0.166
Neg logit contributions
allery-0.518
tnc-0.507
 Klu-0.483
erd-0.448
oidal-0.443
Token but
Token ID475
Feature activation+1.94e-01

Pos logit contributions
allery+0.053
tnc+0.051
 Klu+0.048
erd+0.044
oidal+0.043
Neg logit contributions
 Rabbit-0.037
Pan-0.035
 2048-0.034
-0.033
pain-0.032
Token,
Token ID11
Feature activation-9.17e-02

Pos logit contributions
 Rabbit+0.312
Pan+0.295
 2048+0.290
+0.276
--------+0.274
Neg logit contributions
allery-0.407
tnc-0.394
 Klu-0.370
erd-0.336
oidal-0.334
Token including
Token ID1390
Feature activation+3.26e-02

Pos logit contributions
allery+0.253
tnc+0.247
 Klu+0.232
erd+0.212
oidal+0.210
Neg logit contributions
 Rabbit-0.169
Pan-0.159
 2048-0.156
-0.149
pain-0.147
Token headset
Token ID23492
Feature activation+5.05e-01

Pos logit contributions
 Rabbit+0.061
Pan+0.057
 2048+0.056
+0.054
pain+0.053
Neg logit contributions
allery-0.089
tnc-0.087
 Klu-0.082
erd-0.075
oidal-0.074
Token,
Token ID11
Feature activation-2.11e-01

Pos logit contributions
 Rabbit+0.922
Pan+0.861
 2048+0.848
+0.801
pain+0.795
Neg logit contributions
allery-1.409
tnc-1.376
 Klu-1.289
erd-1.180
oidal-1.167
Token cr
Token ID1067
Feature activation-1.20e-01

Pos logit contributions
allery+0.561
tnc+0.546
 Klu+0.513
erd+0.465
oidal+0.461
Neg logit contributions
 Rabbit-0.411
Pan-0.389
 2048-0.382
-0.366
pain-0.361
Tokenanks
Token ID2283
Feature activation+1.40e+00

Pos logit contributions
allery+0.329
tnc+0.320
 Klu+0.299
erd+0.276
oidal+0.274
Neg logit contributions
 Rabbit-0.225
Pan-0.210
 2048-0.208
-0.197
 Panama-0.194
Token and
Token ID290
Feature activation-5.79e-02

Pos logit contributions
 Rabbit+2.642
Pan+2.483
 2048+2.432
+2.313
pain+2.310
Neg logit contributions
allery-3.812
tnc-3.735
 Klu-3.478
erd-3.155
oidal-3.128
Token brakes
Token ID23746
Feature activation+5.73e-01

Pos logit contributions
allery+0.153
tnc+0.149
 Klu+0.140
erd+0.127
oidal+0.126
Neg logit contributions
 Rabbit-0.113
Pan-0.107
 2048-0.105
-0.101
pain-0.099
Token.
Token ID13
Feature activation-1.00e-01

Pos logit contributions
 Rabbit+1.058
Pan+0.988
 2048+0.975
+0.921
pain+0.913
Neg logit contributions
allery-1.591
tnc-1.555
 Klu-1.455
erd-1.330
oidal-1.315
Token The
Token ID383
Feature activation-1.48e-01

Pos logit contributions
allery+0.298
tnc+0.292
 Klu+0.276
erd+0.253
oidal+0.252
Neg logit contributions
 Rabbit-0.162
Pan-0.151
 2048-0.148
-0.140
pain-0.138
Token wheels
Token ID13666
Feature activation+6.78e-01

Pos logit contributions
allery+0.380
tnc+0.370
 Klu+0.346
erd+0.313
oidal+0.310
Neg logit contributions
 Rabbit-0.300
Pan-0.284
 2048-0.279
-0.268
pain-0.265
Token-
Token ID12
Feature activation+1.50e-01

Pos logit contributions
allery+0.118
tnc+0.115
 Klu+0.109
erd+0.100
oidal+0.099
Neg logit contributions
 Rabbit-0.072
Pan-0.067
 2048-0.066
-0.062
pain-0.062
Tokenera
Token ID8607
Feature activation+5.50e-01

Pos logit contributions
 Rabbit+0.275
Pan+0.254
 2048+0.252
+0.238
pain+0.235
Neg logit contributions
allery-0.415
tnc-0.408
 Klu-0.381
erd-0.350
oidal-0.346
Token standard
Token ID3210
Feature activation+3.03e-01

Pos logit contributions
 Rabbit+1.035
Pan+0.963
 2048+0.956
+0.902
 civilian+0.896
Neg logit contributions
allery-1.515
tnc-1.482
 Klu-1.375
erd-1.259
oidal-1.247
Token of
Token ID286
Feature activation+1.31e-01

Pos logit contributions
 Rabbit+0.565
Pan+0.528
 2048+0.522
+0.494
pain+0.489
Neg logit contributions
allery-0.836
tnc-0.816
 Klu-0.763
erd-0.697
oidal-0.691
Token "
Token ID366
Feature activation+1.40e-01

Pos logit contributions
 Rabbit+0.252
Pan+0.237
 2048+0.233
+0.221
pain+0.220
Neg logit contributions
allery-0.351
tnc-0.341
 Klu-0.320
erd-0.291
oidal-0.288
Tokenprep
Token ID46012
Feature activation+1.40e+00

Pos logit contributions
 Rabbit+0.311
Pan+0.294
 2048+0.290
+0.277
pain+0.275
Neg logit contributions
allery-0.334
tnc-0.324
 Klu-0.301
erd-0.270
oidal-0.267
Tokenonder
Token ID8623
Feature activation+3.55e-01

Pos logit contributions
 Rabbit+2.899
Pan+2.747
 2048+2.717
+2.600
pain+2.581
Neg logit contributions
allery-3.513
tnc-3.441
 Klu-3.204
erd-2.887
oidal-2.850
Tokenance
Token ID590
Feature activation-3.05e-02

Pos logit contributions
 Rabbit+0.846
Pan+0.804
 2048+0.796
+0.764
pain+0.760
Neg logit contributions
allery-0.791
tnc-0.768
 Klu-0.707
erd-0.628
oidal-0.621
Token of
Token ID286
Feature activation-1.04e-01

Pos logit contributions
allery+0.085
tnc+0.083
 Klu+0.077
erd+0.070
oidal+0.070
Neg logit contributions
 Rabbit-0.057
Pan-0.053
 2048-0.052
-0.050
pain-0.049
Token the
Token ID262
Feature activation+2.89e-03

Pos logit contributions
allery+0.262
tnc+0.255
 Klu+0.237
erd+0.214
oidal+0.212
Neg logit contributions
 Rabbit-0.216
Pan-0.204
 2048-0.201
-0.192
pain-0.190
Token evidence
Token ID2370
Feature activation+6.28e-02

Pos logit contributions
 Rabbit+0.007
Pan+0.007
 2048+0.007
+0.007
 civilian+0.007
Neg logit contributions
allery-0.006
tnc-0.006
 Klu-0.005
erd-0.005
oidal-0.004
Token receiver
Token ID9733
Feature activation-2.17e-01

Pos logit contributions
allery+2.290
tnc+2.244
 Klu+2.088
erd+1.947
oidal+1.899
Neg logit contributions
 Rabbit-1.738
Pan-1.634
 2048-1.625
-1.547
pain-1.538
Token with
Token ID351
Feature activation+1.14e-02

Pos logit contributions
allery+0.592
tnc+0.578
 Klu+0.546
erd+0.500
oidal+0.496
Neg logit contributions
 Rabbit-0.399
Pan-0.375
 2048-0.368
-0.354
pain-0.349
Token an
Token ID281
Feature activation-3.41e-01

Pos logit contributions
 Rabbit+0.021
Pan+0.020
 2048+0.020
+0.019
pain+0.018
Neg logit contributions
allery-0.031
tnc-0.030
 Klu-0.029
erd-0.026
oidal-0.026
Token internal
Token ID5387
Feature activation-1.80e-01

Pos logit contributions
allery+0.888
tnc+0.864
 Klu+0.807
erd+0.733
oidal+0.725
Neg logit contributions
 Rabbit-0.687
Pan-0.647
 2048-0.638
-0.607
pain-0.603
Token fer
Token ID11354
Feature activation-3.25e-01

Pos logit contributions
allery+0.481
tnc+0.468
 Klu+0.439
erd+0.400
oidal+0.396
Neg logit contributions
 Rabbit-0.347
Pan-0.327
 2048-0.321
-0.307
pain-0.304
Tokenrite
Token ID6525
Feature activation+1.37e+00

Pos logit contributions
allery+1.046
tnc+1.018
 Klu+0.965
erd+0.902
oidal+0.896
Neg logit contributions
 Rabbit-0.452
Pan-0.413
 2048-0.406
-0.374
pain-0.371
Token rod
Token ID15299
Feature activation+1.90e-02

Pos logit contributions
 Rabbit+2.788
Pan+2.597
 2048+2.592
 civilian+2.439
+2.426
Neg logit contributions
allery-3.510
tnc-3.462
 Klu-3.171
erd-2.868
 concess-2.818
Token antenna
Token ID20509
Feature activation-8.23e-02

Pos logit contributions
 Rabbit+0.035
Pan+0.033
 2048+0.032
+0.031
pain+0.030
Neg logit contributions
allery-0.053
tnc-0.051
 Klu-0.048
erd-0.044
oidal-0.044
Token and
Token ID290
Feature activation-4.32e-02

Pos logit contributions
allery+0.223
tnc+0.217
 Klu+0.204
erd+0.188
oidal+0.186
Neg logit contributions
 Rabbit-0.155
Pan-0.145
 2048-0.143
-0.136
pain-0.135
Token start
Token ID923
Feature activation+4.06e-01

Pos logit contributions
allery+0.112
tnc+0.109
 Klu+0.101
erd+0.092
oidal+0.091
Neg logit contributions
 Rabbit-0.088
Pan-0.083
 2048-0.082
-0.077
pain-0.077
Token moving
Token ID3867
Feature activation+9.65e-03

Pos logit contributions
 Rabbit+0.722
Pan+0.672
 2048+0.663
+0.625
pain+0.620
Neg logit contributions
allery-1.157
tnc-1.128
 Klu-1.058
erd-0.970
oidal-0.960
Token it
Token ID340
Feature activation+2.56e-02

Pos logit contributions
 Rabbit+0.153
Pan+0.144
 2048+0.142
+0.135
pain+0.134
Neg logit contributions
allery-0.207
tnc-0.202
 Klu-0.188
erd-0.171
oidal-0.170
Token just
Token ID655
Feature activation+2.31e-01

Pos logit contributions
 Rabbit+0.055
Pan+0.052
 2048+0.051
+0.049
pain+0.048
Neg logit contributions
allery-0.064
tnc-0.062
 Klu-0.058
erd-0.052
oidal-0.051
Token hasn
Token ID5818
Feature activation+5.95e-01

Pos logit contributions
 Rabbit+0.423
Pan+0.395
 2048+0.389
+0.367
pain+0.366
Neg logit contributions
allery-0.645
tnc-0.630
 Klu-0.589
erd-0.538
oidal-0.534
Token't
Token ID470
Feature activation+2.73e-01

Pos logit contributions
 Rabbit+0.811
Pan+0.741
 2048+0.725
+0.669
pain+0.663
Neg logit contributions
allery-1.941
tnc-1.896
 Klu-1.797
erd-1.666
oidal-1.653
Token gone
Token ID3750
Feature activation-8.70e-02

Pos logit contributions
 Rabbit+0.512
Pan+0.480
 2048+0.472
+0.448
pain+0.443
Neg logit contributions
allery-0.746
tnc-0.727
 Klu-0.684
erd-0.624
oidal-0.618
Token away
Token ID1497
Feature activation+1.33e+00

Pos logit contributions
allery+0.204
tnc+0.198
 Klu+0.187
erd+0.167
oidal+0.163
Neg logit contributions
 Rabbit-0.193
Pan-0.184
 2048-0.181
-0.174
pain-0.173
Token,"
Token ID553
Feature activation-2.22e-01

Pos logit contributions
 Rabbit+2.461
Pan+2.295
 2048+2.282
pain+2.146
+2.139
Neg logit contributions
allery-3.699
tnc-3.608
 Klu-3.349
erd-3.073
oidal-3.031
Token he
Token ID339
Feature activation-1.58e-01

Pos logit contributions
allery+0.559
tnc+0.543
 Klu+0.508
erd+0.458
oidal+0.452
Neg logit contributions
 Rabbit-0.464
Pan-0.439
 2048-0.433
-0.413
pain-0.410
Token says
Token ID1139
Feature activation+8.39e-02

Pos logit contributions
allery+0.394
tnc+0.383
 Klu+0.356
erd+0.322
oidal+0.318
Neg logit contributions
 Rabbit-0.336
Pan-0.318
 2048-0.314
-0.299
pain-0.297
Token.
Token ID13
Feature activation-1.69e-01

Pos logit contributions
 Rabbit+0.154
Pan+0.145
 2048+0.142
+0.136
pain+0.135
Neg logit contributions
allery-0.234
tnc-0.229
 Klu-0.213
erd-0.195
oidal-0.192
Token He
Token ID679
Feature activation-1.47e-01

Pos logit contributions
allery+0.502
tnc+0.492
 Klu+0.466
erd+0.425
oidal+0.422
Neg logit contributions
 Rabbit-0.276
Pan-0.257
 2048-0.253
-0.239
pain-0.236
Token the
Token ID262
Feature activation+2.54e-02

Pos logit contributions
allery+0.815
tnc+0.793
 Klu+0.744
erd+0.675
oidal+0.669
Neg logit contributions
 Rabbit-0.578
Pan-0.545
 2048-0.535
-0.509
pain-0.504
Token pass
Token ID1208
Feature activation-9.43e-02

Pos logit contributions
 Rabbit+0.053
Pan+0.049
 2048+0.049
+0.046
pain+0.046
Neg logit contributions
allery-0.065
tnc-0.063
 Klu-0.059
erd-0.053
oidal-0.053
Token,
Token ID11
Feature activation-5.37e-02

Pos logit contributions
allery+0.218
tnc+0.213
 Klu+0.195
erd+0.175
oidal+0.173
Neg logit contributions
 Rabbit-0.215
Pan-0.206
 2048-0.202
-0.194
pain-0.192
Token
Token ID447
Feature activation-1.52e-01

Pos logit contributions
allery+0.151
tnc+0.148
 Klu+0.138
erd+0.127
oidal+0.126
Neg logit contributions
 Rabbit-0.097
Pan-0.090
 2048-0.089
-0.084
pain-0.083
Token
Token ID251
Feature activation+1.61e-01

Pos logit contributions
allery+0.502
tnc+0.492
 Klu+0.471
erd+0.436
oidal+0.430
Neg logit contributions
 Rabbit-0.196
Pan-0.182
 2048-0.173
pain-0.160
 Panama-0.158
Token wide
Token ID3094
Feature activation+1.33e+00

Pos logit contributions
 Rabbit+0.326
Pan+0.306
 2048+0.302
+0.287
pain+0.285
Neg logit contributions
allery-0.420
tnc-0.408
 Klu-0.380
erd-0.346
oidal-0.343
Token receiver
Token ID9733
Feature activation-3.45e-01

Pos logit contributions
 Rabbit+2.744
Pan+2.610
 2048+2.598
+2.481
--------+2.460
Neg logit contributions
allery-3.321
tnc-3.246
 Klu-3.007
erd-2.718
oidal-2.691
Token Emmanuel
Token ID32390
Feature activation-1.86e-01

Pos logit contributions
allery+0.925
tnc+0.900
 Klu+0.847
erd+0.764
oidal+0.760
Neg logit contributions
 Rabbit-0.666
Pan-0.627
 2048-0.617
-0.584
pain-0.580
Token Sanders
Token ID5831
Feature activation-8.48e-02

Pos logit contributions
allery+0.396
tnc+0.383
 Klu+0.360
erd+0.314
oidal+0.311
Neg logit contributions
 Rabbit-0.452
Pan-0.437
 2048-0.430
pain-0.414
-0.409
Token said
Token ID531
Feature activation-3.79e-01

Pos logit contributions
allery+0.185
tnc+0.179
 Klu+0.165
erd+0.147
oidal+0.145
Neg logit contributions
 Rabbit-0.206
Pan-0.196
 2048-0.194
-0.186
pain-0.185
Token.
Token ID13
Feature activation-1.46e-01

Pos logit contributions
allery+1.014
tnc+0.988
 Klu+0.928
erd+0.844
oidal+0.834
Neg logit contributions
 Rabbit-0.735
Pan-0.689
 2048-0.679
-0.645
pain-0.638
Tokenext
Token ID2302
Feature activation-5.28e-02

Pos logit contributions
 Rabbit+0.781
Pan+0.746
 2048+0.734
+0.703
pain+0.701
Neg logit contributions
allery-0.806
tnc-0.785
 Klu-0.729
erd-0.650
oidal-0.644
Token =
Token ID796
Feature activation-7.57e-01

Pos logit contributions
allery+0.123
tnc+0.120
 Klu+0.111
erd+0.099
oidal+0.098
Neg logit contributions
 Rabbit-0.120
Pan-0.115
 2048-0.113
-0.108
pain-0.108
Token ($
Token ID7198
Feature activation-6.13e-02

Pos logit contributions
allery+2.040
tnc+1.996
 Klu+1.868
erd+1.702
oidal+1.685
Neg logit contributions
 Rabbit-1.434
Pan-1.349
 2048-1.319
-1.268
pain-1.255
Tokentext
Token ID5239
Feature activation+3.67e-01

Pos logit contributions
allery+0.156
tnc+0.151
 Klu+0.142
erd+0.127
oidal+0.126
Neg logit contributions
 Rabbit-0.127
Pan-0.121
 2048-0.119
-0.113
pain-0.113
Token -
Token ID532
Feature activation+7.65e-02

Pos logit contributions
 Rabbit+0.706
Pan+0.665
 2048+0.654
+0.621
pain+0.616
Neg logit contributions
allery-0.986
tnc-0.960
 Klu-0.899
erd-0.819
oidal-0.810
Tokensplit
Token ID35312
Feature activation+1.32e+00

Pos logit contributions
 Rabbit+0.139
Pan+0.130
 2048+0.128
+0.121
pain+0.120
Neg logit contributions
allery-0.214
tnc-0.209
 Klu-0.195
erd-0.179
oidal-0.177
Token "\
Token ID37082
Feature activation+2.72e-01

Pos logit contributions
 Rabbit+2.215
Pan+2.071
 2048+2.055
+1.899
pain+1.882
Neg logit contributions
allery-3.933
tnc-3.788
 Klu-3.566
erd-3.273
oidal-3.232
Tokenr
Token ID81
Feature activation+1.64e-01

Pos logit contributions
 Rabbit+0.551
Pan+0.519
 2048+0.512
+0.487
pain+0.484
Neg logit contributions
allery-0.704
tnc-0.685
 Klu-0.640
erd-0.580
oidal-0.574
Token
Token ID198
Feature activation+1.61e-01

Pos logit contributions
 Rabbit+0.305
Pan+0.284
 2048+0.281
+0.269
pain+0.264
Neg logit contributions
allery-0.451
tnc-0.441
 Klu-0.413
erd-0.378
oidal-0.374
Token
Token ID198
Feature activation+8.49e-02

Pos logit contributions
 Rabbit+0.304
Pan+0.283
 2048+0.280
+0.268
 civilian+0.263
Neg logit contributions
allery-0.437
tnc-0.430
 Klu-0.402
erd-0.368
oidal-0.364
Token")
Token ID4943
Feature activation+5.25e-01

Pos logit contributions
 Rabbit+0.153
Pan+0.142
 2048+0.141
+0.133
pain+0.131
Neg logit contributions
allery-0.239
tnc-0.233
 Klu-0.219
erd-0.201
oidal-0.199
Token that
Token ID326
Feature activation+3.55e-01

Pos logit contributions
allery+0.448
tnc+0.437
 Klu+0.410
erd+0.374
oidal+0.371
Neg logit contributions
 Rabbit-0.307
Pan-0.288
 2048-0.283
-0.269
pain-0.266
Token these
Token ID777
Feature activation+1.59e-01

Pos logit contributions
 Rabbit+0.690
Pan+0.645
 2048+0.639
+0.605
pain+0.601
Neg logit contributions
allery-0.950
tnc-0.925
 Klu-0.863
erd-0.787
oidal-0.779
Token mushrooms
Token ID23452
Feature activation+3.93e-02

Pos logit contributions
 Rabbit+0.326
Pan+0.306
 2048+0.303
+0.288
pain+0.286
Neg logit contributions
allery-0.408
tnc-0.396
 Klu-0.369
erd-0.335
oidal-0.331
Token cause
Token ID2728
Feature activation+7.10e-02

Pos logit contributions
 Rabbit+0.082
Pan+0.077
 2048+0.076
+0.072
pain+0.072
Neg logit contributions
allery-0.100
tnc-0.097
 Klu-0.091
erd-0.082
oidal-0.081
Token some
Token ID617
Feature activation+1.92e-01

Pos logit contributions
 Rabbit+0.137
Pan+0.128
 2048+0.127
+0.120
pain+0.119
Neg logit contributions
allery-0.191
tnc-0.186
 Klu-0.174
erd-0.158
oidal-0.157
Token nausea
Token ID32122
Feature activation+1.32e+00

Pos logit contributions
 Rabbit+0.370
Pan+0.347
 2048+0.342
+0.324
pain+0.322
Neg logit contributions
allery-0.518
tnc-0.505
 Klu-0.472
erd-0.430
oidal-0.426
Token.
Token ID13
Feature activation-1.64e-01

Pos logit contributions
 Rabbit+2.448
Pan+2.286
 2048+2.257
pain+2.144
--------+2.130
Neg logit contributions
allery-3.659
tnc-3.569
 Klu-3.315
erd-3.039
 concess-2.985
Tokensuggest
Token ID47811
Feature activation-1.96e-01

Pos logit contributions
allery+0.467
tnc+0.457
 Klu+0.435
erd+0.395
oidal+0.394
Neg logit contributions
 Rabbit-0.279
Pan-0.261
 2048-0.257
-0.245
pain-0.241
Tokens
Token ID82
Feature activation+1.11e-01

Pos logit contributions
allery+0.402
tnc+0.389
 Klu+0.356
erd+0.314
oidal+0.310
Neg logit contributions
 Rabbit-0.502
Pan-0.478
 2048-0.473
-0.455
pain-0.452
Token the
Token ID262
Feature activation+5.15e-02

Pos logit contributions
 Rabbit+0.201
Pan+0.188
 2048+0.185
+0.175
pain+0.173
Neg logit contributions
allery-0.312
tnc-0.305
 Klu-0.286
erd-0.262
oidal-0.259
Token use
Token ID779
Feature activation-1.33e-01

Pos logit contributions
 Rabbit+0.105
Pan+0.099
 2048+0.097
+0.093
pain+0.092
Neg logit contributions
allery-0.133
tnc-0.129
 Klu-0.121
erd-0.109
oidal-0.108
Token()
Token ID3419
Feature activation+4.68e-01

Pos logit contributions
allery+0.102
tnc+0.099
 Klu+0.094
erd+0.086
oidal+0.085
Neg logit contributions
 Rabbit-0.056
Pan-0.051
 2048-0.051
-0.048
pain-0.047
Token ap
Token ID2471
Feature activation+1.51e-01

Pos logit contributions
 Rabbit+0.881
Pan+0.823
 2048+0.819
+0.765
 Panama+0.761
Neg logit contributions
allery-1.281
tnc-1.250
 Klu-1.167
erd-1.068
oidal-1.057
Token.
Token ID13
Feature activation+8.95e-02

Pos logit contributions
 Rabbit+0.296
Pan+0.278
 2048+0.274
+0.261
pain+0.258
Neg logit contributions
allery-0.400
tnc-0.389
 Klu-0.364
erd-0.331
oidal-0.328
Tokenadd
Token ID2860
Feature activation+3.58e-01

Pos logit contributions
 Rabbit+0.184
Pan+0.174
 2048+0.171
+0.163
pain+0.162
Neg logit contributions
allery-0.230
tnc-0.223
 Klu-0.209
erd-0.188
oidal-0.187
Token_
Token ID62
Feature activation-6.49e-01

Pos logit contributions
 Rabbit+0.824
Pan+0.790
 2048+0.776
pain+0.741
+0.741
Neg logit contributions
allery-0.832
tnc-0.797
 Klu-0.742
erd-0.657
oidal-0.654
Tokenargument
Token ID49140
Feature activation+1.32e+00

Pos logit contributions
allery+1.356
tnc+1.325
 Klu+1.209
erd+1.078
oidal+1.056
Neg logit contributions
 Rabbit-1.629
Pan-1.544
 2048-1.532
-1.481
---------1.458
Token("
Token ID7203
Feature activation+2.76e-01

Pos logit contributions
 Rabbit+1.922
Pan+1.789
 2048+1.752
+1.622
--------+1.607
Neg logit contributions
allery-4.147
tnc-4.017
 Klu-3.825
erd-3.518
oidal-3.503
Token-
Token ID12
Feature activation+2.24e-01

Pos logit contributions
 Rabbit+0.538
Pan+0.507
 2048+0.499
+0.473
pain+0.471
Neg logit contributions
allery-0.737
tnc-0.714
 Klu-0.670
erd-0.608
oidal-0.603
Tokeni
Token ID72
Feature activation+3.42e-01

Pos logit contributions
 Rabbit+0.463
Pan+0.436
 2048+0.431
+0.411
pain+0.407
Neg logit contributions
allery-0.571
tnc-0.556
 Klu-0.518
erd-0.469
oidal-0.464
Token",
Token ID1600
Feature activation+6.52e-01

Pos logit contributions
 Rabbit+0.489
Pan+0.447
 2048+0.439
+0.408
pain+0.403
Neg logit contributions
allery-1.090
tnc-1.067
 Klu-1.010
erd-0.935
oidal-0.927
Token "
Token ID366
Feature activation+7.56e-02

Pos logit contributions
 Rabbit+1.021
Pan+0.945
 2048+0.935
+0.867
pain+0.863
Neg logit contributions
allery-1.991
tnc-1.933
 Klu-1.832
erd-1.687
oidal-1.672
Token pretty
Token ID2495
Feature activation-1.83e-01

Pos logit contributions
allery+0.566
tnc+0.550
 Klu+0.515
erd+0.467
oidal+0.462
Neg logit contributions
 Rabbit-0.435
Pan-0.410
 2048-0.403
-0.384
pain-0.382
Token good
Token ID922
Feature activation+1.92e-01

Pos logit contributions
allery+0.476
tnc+0.463
 Klu+0.433
erd+0.394
oidal+0.389
Neg logit contributions
 Rabbit-0.366
Pan-0.345
 2048-0.340
-0.323
pain-0.321
Token that
Token ID326
Feature activation-8.07e-02

Pos logit contributions
 Rabbit+0.351
Pan+0.327
 2048+0.323
+0.304
pain+0.302
Neg logit contributions
allery-0.535
tnc-0.523
 Klu-0.489
erd-0.448
oidal-0.444
Token you
Token ID345
Feature activation-1.60e-01

Pos logit contributions
allery+0.207
tnc+0.201
 Klu+0.187
erd+0.170
oidal+0.168
Neg logit contributions
 Rabbit-0.166
Pan-0.156
 2048-0.154
-0.147
pain-0.146
Token found
Token ID1043
Feature activation-2.45e-01

Pos logit contributions
allery+0.417
tnc+0.405
 Klu+0.380
erd+0.346
oidal+0.341
Neg logit contributions
 Rabbit-0.319
Pan-0.300
 2048-0.296
-0.283
pain-0.279
Token yourself
Token ID3511
Feature activation+1.31e+00

Pos logit contributions
allery+0.705
tnc+0.687
 Klu+0.650
erd+0.593
oidal+0.588
Neg logit contributions
 Rabbit-0.426
Pan-0.398
 2048-0.390
-0.370
pain-0.365
Token saying
Token ID2282
Feature activation+3.07e-01

Pos logit contributions
 Rabbit+2.365
Pan+2.173
 2048+2.167
 civilian+2.050
pain+2.040
Neg logit contributions
allery-3.734
tnc-3.620
 Klu-3.303
erd-3.084
oidal-3.061
Token something
Token ID1223
Feature activation+2.46e-01

Pos logit contributions
 Rabbit+0.548
Pan+0.511
 2048+0.505
+0.475
pain+0.472
Neg logit contributions
allery-0.870
tnc-0.849
 Klu-0.796
erd-0.730
oidal-0.723
Token like
Token ID588
Feature activation+2.19e-01

Pos logit contributions
 Rabbit+0.455
Pan+0.426
 2048+0.420
+0.396
pain+0.395
Neg logit contributions
allery-0.686
tnc-0.668
 Klu-0.621
erd-0.570
oidal-0.563
Token this
Token ID428
Feature activation+2.59e-01

Pos logit contributions
 Rabbit+0.380
Pan+0.354
 2048+0.349
+0.328
pain+0.325
Neg logit contributions
allery-0.630
tnc-0.615
 Klu-0.578
erd-0.531
oidal-0.526
Token:
Token ID25
Feature activation-1.17e-02

Pos logit contributions
 Rabbit+0.475
Pan+0.443
 2048+0.438
+0.413
pain+0.411
Neg logit contributions
allery-0.722
tnc-0.704
 Klu-0.659
erd-0.603
oidal-0.597
Token the
Token ID262
Feature activation+1.10e-01

Pos logit contributions
 Rabbit+0.114
Pan+0.106
 2048+0.105
+0.099
pain+0.098
Neg logit contributions
allery-0.167
tnc-0.163
 Klu-0.152
erd-0.139
oidal-0.138
Token controversy
Token ID10386
Feature activation-1.84e-01

Pos logit contributions
 Rabbit+0.215
Pan+0.201
 2048+0.199
+0.188
pain+0.187
Neg logit contributions
allery-0.295
tnc-0.287
 Klu-0.267
erd-0.245
oidal-0.242
Token the
Token ID262
Feature activation+1.20e-01

Pos logit contributions
allery+0.464
tnc+0.452
 Klu+0.420
erd+0.380
oidal+0.376
Neg logit contributions
 Rabbit-0.388
Pan-0.365
 2048-0.361
-0.344
pain-0.342
Token frontrunner
Token ID49981
Feature activation+5.80e-02

Pos logit contributions
 Rabbit+0.241
Pan+0.227
 2048+0.224
+0.212
pain+0.211
Neg logit contributions
allery-0.313
tnc-0.305
 Klu-0.283
erd-0.258
oidal-0.255
Token found
Token ID1043
Feature activation+8.67e-02

Pos logit contributions
 Rabbit+0.116
Pan+0.109
 2048+0.107
+0.102
pain+0.101
Neg logit contributions
allery-0.153
tnc-0.149
 Klu-0.138
erd-0.126
oidal-0.125
Token himself
Token ID2241
Feature activation+1.31e+00

Pos logit contributions
 Rabbit+0.169
Pan+0.159
 2048+0.157
+0.149
pain+0.148
Neg logit contributions
allery-0.232
tnc-0.226
 Klu-0.210
erd-0.192
oidal-0.190
Token in
Token ID287
Feature activation+1.89e-01

Pos logit contributions
 Rabbit+2.218
 2048+2.025
Pan+2.011
 civilian+1.927
pain+1.924
Neg logit contributions
allery-3.840
tnc-3.768
 Klu-3.426
erd-3.173
ioxide-3.167
Token after
Token ID706
Feature activation+1.06e-01

Pos logit contributions
 Rabbit+0.359
Pan+0.335
 2048+0.333
pain+0.312
+0.312
Neg logit contributions
allery-0.516
tnc-0.503
 Klu-0.468
erd-0.428
oidal-0.425
Token initially
Token ID7317
Feature activation+6.28e-01

Pos logit contributions
 Rabbit+0.196
Pan+0.184
 2048+0.181
+0.171
pain+0.170
Neg logit contributions
allery-0.293
tnc-0.285
 Klu-0.269
erd-0.245
oidal-0.242
Token failing
Token ID9894
Feature activation-3.01e-01

Pos logit contributions
 Rabbit+1.043
Pan+0.962
 2048+0.951
+0.886
pain+0.885
Neg logit contributions
allery-1.870
tnc-1.827
 Klu-1.701
erd-1.575
oidal-1.560
Token in
Token ID287
Feature activation-1.85e-02

Pos logit contributions
allery+0.838
tnc+0.815
 Klu+0.757
erd+0.695
oidal+0.685
Neg logit contributions
 Rabbit-0.560
Pan-0.524
 2048-0.513
-0.493
sec-0.483
Token early
Token ID1903
Feature activation+2.19e-01

Pos logit contributions
 Rabbit+0.402
Pan+0.375
 2048+0.371
+0.351
pain+0.349
Neg logit contributions
allery-0.562
tnc-0.548
 Klu-0.510
erd-0.467
oidal-0.462
Token in
Token ID287
Feature activation+1.29e-01

Pos logit contributions
 Rabbit+0.402
Pan+0.376
 2048+0.370
+0.350
pain+0.347
Neg logit contributions
allery-0.608
tnc-0.593
 Klu-0.556
erd-0.508
oidal-0.503
Token 1989
Token ID11104
Feature activation+5.87e-02

Pos logit contributions
 Rabbit+0.242
Pan+0.227
 2048+0.224
+0.212
pain+0.211
Neg logit contributions
allery-0.352
tnc-0.343
 Klu-0.322
erd-0.293
oidal-0.290
Token he
Token ID339
Feature activation-6.35e-02

Pos logit contributions
 Rabbit+0.112
Pan+0.104
 2048+0.103
+0.097
pain+0.096
Neg logit contributions
allery-0.160
tnc-0.156
 Klu-0.146
erd-0.133
oidal-0.132
Token found
Token ID1043
Feature activation+1.26e-01

Pos logit contributions
allery+0.169
tnc+0.165
 Klu+0.156
erd+0.142
oidal+0.140
Neg logit contributions
 Rabbit-0.122
Pan-0.115
 2048-0.113
-0.108
pain-0.107
Token himself
Token ID2241
Feature activation+1.30e+00

Pos logit contributions
 Rabbit+0.247
Pan+0.233
 2048+0.229
+0.218
pain+0.216
Neg logit contributions
allery-0.334
tnc-0.326
 Klu-0.305
erd-0.277
oidal-0.274
Token in
Token ID287
Feature activation+3.56e-02

Pos logit contributions
 Rabbit+2.297
 2048+2.103
Pan+2.092
 civilian+1.999
 Panama+1.972
Neg logit contributions
allery-3.728
tnc-3.613
 Klu-3.329
erd-3.089
oidal-3.076
Token the
Token ID262
Feature activation-4.81e-02

Pos logit contributions
 Rabbit+0.067
Pan+0.062
 2048+0.061
+0.058
pain+0.058
Neg logit contributions
allery-0.098
tnc-0.095
 Klu-0.089
erd-0.082
oidal-0.081
Token London
Token ID3576
Feature activation-3.26e-01

Pos logit contributions
allery+0.129
tnc+0.126
 Klu+0.118
erd+0.107
oidal+0.106
Neg logit contributions
 Rabbit-0.093
Pan-0.087
 2048-0.086
-0.082
pain-0.081
Token office
Token ID2607
Feature activation+4.50e-01

Pos logit contributions
allery+0.902
tnc+0.879
 Klu+0.829
erd+0.754
oidal+0.746
Neg logit contributions
 Rabbit-0.595
Pan-0.560
 2048-0.550
-0.526
pain-0.518
Token of
Token ID286
Feature activation+4.46e-02

Pos logit contributions
 Rabbit+0.821
Pan+0.757
 2048+0.756
pain+0.709
 civilian+0.704
Neg logit contributions
allery-1.273
tnc-1.242
 Klu-1.153
erd-1.057
oidal-1.052
Token he
Token ID339
Feature activation-5.24e-02

Pos logit contributions
allery+0.405
tnc+0.394
 Klu+0.372
erd+0.337
oidal+0.333
Neg logit contributions
 Rabbit-0.284
Pan-0.267
 2048-0.263
-0.251
pain-0.248
Token met
Token ID1138
Feature activation+1.95e-01

Pos logit contributions
allery+0.140
tnc+0.136
 Klu+0.129
erd+0.117
oidal+0.116
Neg logit contributions
 Rabbit-0.101
Pan-0.095
 2048-0.093
-0.089
pain-0.088
Token Cub
Token ID7070
Feature activation-8.64e-02

Pos logit contributions
 Rabbit+0.356
Pan+0.334
 2048+0.328
+0.311
pain+0.309
Neg logit contributions
allery-0.545
tnc-0.531
 Klu-0.500
erd-0.456
oidal-0.451
Tokenby
Token ID1525
Feature activation+8.97e-02

Pos logit contributions
allery+0.214
tnc+0.210
 Klu+0.195
erd+0.177
oidal+0.176
Neg logit contributions
 Rabbit-0.183
Pan-0.173
 2048-0.170
-0.163
---------0.161
Token Bro
Token ID2806
Feature activation+4.50e-01

Pos logit contributions
 Rabbit+0.162
Pan+0.151
 2048+0.149
+0.140
pain+0.139
Neg logit contributions
allery-0.252
tnc-0.247
 Klu-0.231
erd-0.211
oidal-0.209
Tokenccoli
Token ID34544
Feature activation+1.30e+00

Pos logit contributions
 Rabbit+0.952
Pan+0.899
 2048+0.888
+0.846
pain+0.842
Neg logit contributions
allery-1.127
tnc-1.092
 Klu-1.018
erd-0.917
oidal-0.908
Token:
Token ID25
Feature activation-3.50e-01

Pos logit contributions
 Rabbit+2.324
Pan+2.172
 2048+2.135
--------+2.004
+1.996
Neg logit contributions
allery-3.661
tnc-3.589
 Klu-3.331
erd-3.049
oidal-3.037
Token-----
Token ID30934
Feature activation-1.34e-01

Pos logit contributions
allery+0.965
tnc+0.938
 Klu+0.885
erd+0.807
oidal+0.795
Neg logit contributions
 Rabbit-0.646
Pan-0.603
 2048-0.599
-0.566
pain-0.563
Token 1954
Token ID24718
Feature activation-1.30e-02

Pos logit contributions
allery+0.379
tnc+0.369
 Klu+0.348
erd+0.318
oidal+0.314
Neg logit contributions
 Rabbit-0.236
Pan-0.220
 2048-0.217
-0.205
pain-0.204
Token:
Token ID25
Feature activation-3.02e-01

Pos logit contributions
allery+0.037
tnc+0.036
 Klu+0.034
erd+0.031
oidal+0.031
Neg logit contributions
 Rabbit-0.023
Pan-0.022
 2048-0.021
-0.020
pain-0.020
Token "
Token ID366
Feature activation+1.04e-01

Pos logit contributions
allery+0.880
tnc+0.858
 Klu+0.810
erd+0.742
oidal+0.734
Neg logit contributions
 Rabbit-0.510
Pan-0.474
 2048-0.469
-0.441
pain-0.439
Token systems
Token ID3341
Feature activation+2.38e-01

Pos logit contributions
 Rabbit+0.053
Pan+0.050
 2048+0.049
+0.046
pain+0.046
Neg logit contributions
allery-0.076
tnc-0.074
 Klu-0.069
erd-0.063
oidal-0.062
Token include
Token ID2291
Feature activation+2.54e-01

Pos logit contributions
 Rabbit+0.503
Pan+0.474
 2048+0.468
+0.447
pain+0.443
Neg logit contributions
allery-0.597
tnc-0.581
 Klu-0.541
erd-0.489
oidal-0.484
Token Cloud
Token ID10130
Feature activation+1.14e-01

Pos logit contributions
 Rabbit+0.444
Pan+0.412
 2048+0.407
+0.383
pain+0.379
Neg logit contributions
allery-0.730
tnc-0.713
 Klu-0.669
erd-0.615
oidal-0.607
Token based
Token ID1912
Feature activation+3.39e-01

Pos logit contributions
 Rabbit+0.215
Pan+0.202
 2048+0.200
+0.190
pain+0.187
Neg logit contributions
allery-0.311
tnc-0.304
 Klu-0.285
erd-0.260
oidal-0.258
Token Sc
Token ID1446
Feature activation-3.36e-01

Pos logit contributions
 Rabbit+0.624
Pan+0.582
 2048+0.575
+0.542
pain+0.538
Neg logit contributions
allery-0.944
tnc-0.921
 Klu-0.862
erd-0.789
oidal-0.780
Tokenrub
Token ID25089
Feature activation+1.29e+00

Pos logit contributions
allery+0.913
tnc+0.882
 Klu+0.826
erd+0.761
oidal+0.751
Neg logit contributions
 Rabbit-0.642
Pan-0.598
 2048-0.594
-0.563
 civilian-0.556
Tokenbing
Token ID4623
Feature activation+3.43e-01

Pos logit contributions
 Rabbit+2.490
Pan+2.325
 2048+2.292
+2.167
sec+2.156
Neg logit contributions
allery-3.448
tnc-3.377
 Klu-3.152
erd-2.841
 concess-2.838
Token centers
Token ID10399
Feature activation+3.54e-01

Pos logit contributions
 Rabbit+0.605
Pan+0.565
 2048+0.556
+0.525
pain+0.521
Neg logit contributions
allery-0.978
tnc-0.954
 Klu-0.896
erd-0.822
oidal-0.814
Token and
Token ID290
Feature activation+1.23e-02

Pos logit contributions
 Rabbit+0.619
Pan+0.577
 2048+0.568
+0.536
pain+0.531
Neg logit contributions
allery-1.015
tnc-0.990
 Klu-0.931
erd-0.854
oidal-0.846
Token Content
Token ID14041
Feature activation+2.64e-01

Pos logit contributions
 Rabbit+0.021
Pan+0.020
 2048+0.020
+0.019
pain+0.018
Neg logit contributions
allery-0.035
tnc-0.035
 Klu-0.033
erd-0.030
oidal-0.030
Token Distribution
Token ID27484
Feature activation+2.85e-01

Pos logit contributions
 Rabbit+0.457
Pan+0.432
 2048+0.423
+0.403
pain+0.400
Neg logit contributions
allery-0.751
tnc-0.735
 Klu-0.692
erd-0.635
oidal-0.629
Token get
Token ID651
Feature activation-2.79e-01

Pos logit contributions
allery+0.349
tnc+0.339
 Klu+0.316
erd+0.287
oidal+0.283
Neg logit contributions
 Rabbit-0.275
Pan-0.259
 2048-0.256
-0.243
pain-0.241
Token something
Token ID1223
Feature activation+2.72e-01

Pos logit contributions
allery+0.756
tnc+0.737
 Klu+0.692
erd+0.629
oidal+0.623
Neg logit contributions
 Rabbit-0.529
Pan-0.497
 2048-0.489
-0.465
pain-0.461
Token.
Token ID13
Feature activation-2.95e-02

Pos logit contributions
 Rabbit+0.446
Pan+0.413
 2048+0.406
+0.381
pain+0.377
Neg logit contributions
allery-0.811
tnc-0.792
 Klu-0.745
erd-0.686
oidal-0.680
Token We
Token ID775
Feature activation-1.44e-02

Pos logit contributions
allery+0.085
tnc+0.083
 Klu+0.078
erd+0.071
oidal+0.071
Neg logit contributions
 Rabbit-0.051
Pan-0.048
 2048-0.047
-0.045
pain-0.044
Token came
Token ID1625
Feature activation-4.30e-01

Pos logit contributions
allery+0.037
tnc+0.036
 Klu+0.034
erd+0.030
oidal+0.030
Neg logit contributions
 Rabbit-0.029
Pan-0.028
 2048-0.027
-0.026
pain-0.026
Token away
Token ID1497
Feature activation+1.29e+00

Pos logit contributions
allery+1.154
tnc+1.127
 Klu+1.059
erd+0.962
oidal+0.951
Neg logit contributions
 Rabbit-0.826
Pan-0.775
 2048-0.762
-0.727
pain-0.720
Token thinking
Token ID3612
Feature activation+3.75e-02

Pos logit contributions
 Rabbit+2.287
Pan+2.126
 2048+2.109
+1.975
pain+1.973
Neg logit contributions
allery-3.663
tnc-3.564
 Klu-3.305
erd-3.065
oidal-3.024
Token we
Token ID356
Feature activation+1.18e-02

Pos logit contributions
 Rabbit+0.067
Pan+0.063
 2048+0.062
+0.058
pain+0.058
Neg logit contributions
allery-0.106
tnc-0.103
 Klu-0.097
erd-0.089
oidal-0.088
Token could
Token ID714
Feature activation+1.88e-01

Pos logit contributions
 Rabbit+0.023
Pan+0.022
 2048+0.022
+0.021
pain+0.020
Neg logit contributions
allery-0.031
tnc-0.030
 Klu-0.028
erd-0.026
oidal-0.025
Token have
Token ID423
Feature activation+1.54e-01

Pos logit contributions
 Rabbit+0.386
Pan+0.364
 2048+0.360
+0.341
pain+0.339
Neg logit contributions
allery-0.484
tnc-0.471
 Klu-0.438
erd-0.397
oidal-0.392
Token got
Token ID1392
Feature activation+7.22e-02

Pos logit contributions
 Rabbit+0.269
Pan+0.252
 2048+0.246
+0.233
pain+0.231
Neg logit contributions
allery-0.443
tnc-0.432
 Klu-0.408
erd-0.373
oidal-0.369
Tokens
Token ID82
Feature activation-9.83e-02

Pos logit contributions
allery+0.629
tnc+0.612
 Klu+0.576
erd+0.525
oidal+0.519
Neg logit contributions
 Rabbit-0.426
Pan-0.399
 2048-0.392
-0.371
pain-0.367
Token nomination
Token ID11872
Feature activation-5.59e-01

Pos logit contributions
allery+0.242
tnc+0.235
 Klu+0.220
erd+0.197
oidal+0.194
Neg logit contributions
 Rabbit-0.210
Pan-0.200
 2048-0.196
-0.189
pain-0.187
Token to
Token ID284
Feature activation-2.39e-02

Pos logit contributions
allery+1.516
tnc+1.474
 Klu+1.385
erd+1.259
oidal+1.250
Neg logit contributions
 Rabbit-1.053
Pan-0.991
 2048-0.971
-0.925
pain-0.913
Token the
Token ID262
Feature activation+5.70e-02

Pos logit contributions
allery+0.066
tnc+0.064
 Klu+0.060
erd+0.055
oidal+0.054
Neg logit contributions
 Rabbit-0.045
Pan-0.042
 2048-0.042
-0.039
pain-0.039
Token Supreme
Token ID5617
Feature activation-9.48e-01

Pos logit contributions
 Rabbit+0.122
Pan+0.116
 2048+0.114
+0.109
pain+0.108
Neg logit contributions
allery-0.141
tnc-0.137
 Klu-0.128
erd-0.115
oidal-0.114
Token Court
Token ID3078
Feature activation-1.54e+00

Pos logit contributions
allery+2.310
tnc+2.289
 Klu+2.170
erd+1.929
oidal+1.890
Neg logit contributions
 Rabbit-1.944
Pan-1.864
 2048-1.787
-1.758
pain-1.746
Token.
Token ID13
Feature activation+4.04e-02

Pos logit contributions
allery+4.038
tnc+3.891
 Klu+3.688
erd+3.319
oidal+3.301
Neg logit contributions
 Rabbit-2.945
Pan-2.806
 2048-2.740
-2.632
pain-2.594
Token In
Token ID554
Feature activation-4.48e-02

Pos logit contributions
 Rabbit+0.070
Pan+0.066
 2048+0.065
+0.061
pain+0.061
Neg logit contributions
allery-0.116
tnc-0.113
 Klu-0.107
erd-0.097
oidal-0.097
Token brief
Token ID4506
Feature activation-2.77e-01

Pos logit contributions
allery+0.121
tnc+0.119
 Klu+0.112
erd+0.102
oidal+0.101
Neg logit contributions
 Rabbit-0.084
Pan-0.079
 2048-0.077
-0.074
---------0.073
Token,
Token ID11
Feature activation+1.71e-01

Pos logit contributions
allery+0.781
tnc+0.760
 Klu+0.720
erd+0.657
oidal+0.649
Neg logit contributions
 Rabbit-0.497
Pan-0.463
 2048-0.455
-0.431
pain-0.426
Token the
Token ID262
Feature activation+6.06e-02

Pos logit contributions
 Rabbit+0.327
Pan+0.306
 2048+0.302
+0.286
pain+0.284
Neg logit contributions
allery-0.462
tnc-0.451
 Klu-0.422
erd-0.384
oidal-0.381
Token step
Token ID2239
Feature activation-1.70e-02

Pos logit contributions
allery+1.076
tnc+1.046
 Klu+0.979
erd+0.878
oidal+0.870
Neg logit contributions
 Rabbit-0.948
Pan-0.907
 2048-0.887
-0.857
pain-0.843
Token.
Token ID13
Feature activation+1.02e-02

Pos logit contributions
allery+0.047
tnc+0.046
 Klu+0.043
erd+0.039
oidal+0.039
Neg logit contributions
 Rabbit-0.032
Pan-0.030
 2048-0.029
-0.028
pain-0.028
Token
Token ID198
Feature activation+6.10e-02

Pos logit contributions
 Rabbit+0.018
Pan+0.017
 2048+0.017
+0.016
pain+0.016
Neg logit contributions
allery-0.029
tnc-0.028
 Klu-0.026
erd-0.024
oidal-0.024
Token
Token ID198
Feature activation+1.18e-02

Pos logit contributions
 Rabbit+0.114
Pan+0.107
 2048+0.105
+0.101
pain+0.099
Neg logit contributions
allery-0.165
tnc-0.163
 Klu-0.153
erd-0.139
oidal-0.138
Token"
Token ID1
Feature activation-4.80e-01

Pos logit contributions
 Rabbit+0.018
Pan+0.017
 2048+0.016
+0.015
pain+0.015
Neg logit contributions
allery-0.037
tnc-0.036
 Klu-0.034
erd-0.031
oidal-0.031
TokenDirect
Token ID13470
Feature activation-1.41e+00

Pos logit contributions
allery+1.283
tnc+1.263
 Klu+1.177
erd+1.070
oidal+1.059
Neg logit contributions
 Rabbit-0.923
 2048-0.853
Pan-0.851
-0.809
 civilian-0.802
Token,
Token ID11
Feature activation+9.52e-03

Pos logit contributions
allery+3.316
tnc+3.254
 Klu+3.013
erd+2.717
oidal+2.705
Neg logit contributions
 Rabbit-3.052
Pan-2.923
 2048-2.874
pain-2.756
---------2.735
Token structured
Token ID20793
Feature activation-4.02e-01

Pos logit contributions
 Rabbit+0.019
Pan+0.018
 2048+0.017
+0.016
pain+0.016
Neg logit contributions
allery-0.025
tnc-0.025
 Klu-0.023
erd-0.021
oidal-0.021
Token communication
Token ID6946
Feature activation-3.16e-01

Pos logit contributions
allery+0.956
tnc+0.928
 Klu+0.864
erd+0.773
oidal+0.764
Neg logit contributions
 Rabbit-0.891
Pan-0.850
 2048-0.836
-0.802
pain-0.797
Token is
Token ID318
Feature activation-2.20e-01

Pos logit contributions
allery+0.817
tnc+0.795
 Klu+0.743
erd+0.674
oidal+0.667
Neg logit contributions
 Rabbit-0.640
Pan-0.604
 2048-0.594
-0.568
pain-0.562
Token vital
Token ID9204
Feature activation+5.54e-02

Pos logit contributions
allery+0.518
tnc+0.503
 Klu+0.466
erd+0.419
oidal+0.414
Neg logit contributions
 Rabbit-0.496
Pan-0.470
 2048-0.464
-0.444
pain-0.441
Token Movement
Token ID15477
Feature activation+4.98e-02

Pos logit contributions
allery+0.189
tnc+0.184
 Klu+0.174
erd+0.159
oidal+0.157
Neg logit contributions
 Rabbit-0.117
Pan-0.110
 2048-0.108
-0.102
pain-0.101
Token
Token ID447
Feature activation+3.78e-01

Pos logit contributions
 Rabbit+0.092
Pan+0.087
 2048+0.085
+0.081
pain+0.080
Neg logit contributions
allery-0.137
tnc-0.134
 Klu-0.126
erd-0.114
oidal-0.114
Token
Token ID251
Feature activation+1.36e-01

Pos logit contributions
 Rabbit+0.459
Pan+0.415
 2048+0.402
pain+0.366
 Panama+0.358
Neg logit contributions
allery-1.286
tnc-1.258
 Klu-1.199
erd-1.116
oidal-1.106
Token (
Token ID357
Feature activation-4.31e-02

Pos logit contributions
 Rabbit+0.246
Pan+0.229
 2048+0.226
+0.214
pain+0.212
Neg logit contributions
allery-0.380
tnc-0.371
 Klu-0.348
erd-0.319
oidal-0.315
TokenV
Token ID53
Feature activation-5.56e-02

Pos logit contributions
allery+0.117
tnc+0.115
 Klu+0.107
erd+0.098
oidal+0.097
Neg logit contributions
 Rabbit-0.081
Pan-0.076
 2048-0.075
-0.072
pain-0.070
Tokeniking
Token ID14132
Feature activation-1.38e+00

Pos logit contributions
allery+0.127
tnc+0.122
 Klu+0.113
erd+0.102
oidal+0.100
Neg logit contributions
 Rabbit-0.131
Pan-0.124
 2048-0.123
-0.118
pain-0.117
Token,
Token ID11
Feature activation-7.51e-02

Pos logit contributions
allery+3.593
tnc+3.439
 Klu+3.353
erd+3.023
oidal+2.935
Neg logit contributions
 Rabbit-2.650
Pan-2.531
 2048-2.470
-2.401
pain-2.363
Token 2006
Token ID4793
Feature activation+3.27e-02

Pos logit contributions
allery+0.238
tnc+0.233
 Klu+0.221
erd+0.204
oidal+0.202
Neg logit contributions
 Rabbit-0.108
Pan-0.099
 2048-0.097
-0.091
pain-0.089
Token).
Token ID737
Feature activation+1.50e-01

Pos logit contributions
 Rabbit+0.067
Pan+0.063
 2048+0.062
+0.059
pain+0.059
Neg logit contributions
allery-0.084
tnc-0.082
 Klu-0.076
erd-0.069
oidal-0.069
Token Ms
Token ID6997
Feature activation-8.61e-02

Pos logit contributions
 Rabbit+0.261
Pan+0.244
 2048+0.240
+0.226
pain+0.224
Neg logit contributions
allery-0.429
tnc-0.419
 Klu-0.394
erd-0.361
oidal-0.358
Token.
Token ID13
Feature activation-3.95e-01

Pos logit contributions
allery+0.216
tnc+0.211
 Klu+0.198
erd+0.180
oidal+0.177
Neg logit contributions
 Rabbit-0.178
Pan-0.167
 2048-0.166
-0.161
pain-0.157
Token show
Token ID905
Feature activation+4.77e-01

Pos logit contributions
allery+0.387
tnc+0.377
 Klu+0.351
erd+0.318
oidal+0.315
Neg logit contributions
 Rabbit-0.320
Pan-0.302
 2048-0.298
-0.284
pain-0.282
Token up
Token ID510
Feature activation-4.52e-01

Pos logit contributions
 Rabbit+0.875
Pan+0.815
 2048+0.807
+0.760
pain+0.753
Neg logit contributions
allery-1.327
tnc-1.296
 Klu-1.214
erd-1.113
oidal-1.100
Token at
Token ID379
Feature activation-7.93e-02

Pos logit contributions
allery+1.209
tnc+1.174
 Klu+1.109
erd+1.003
oidal+0.997
Neg logit contributions
 Rabbit-0.870
Pan-0.820
 2048-0.801
-0.767
pain-0.756
Token the
Token ID262
Feature activation-6.03e-02

Pos logit contributions
allery+0.219
tnc+0.213
 Klu+0.200
erd+0.183
oidal+0.181
Neg logit contributions
 Rabbit-0.147
Pan-0.138
 2048-0.135
-0.128
pain-0.127
Token Supreme
Token ID5617
Feature activation-9.23e-01

Pos logit contributions
allery+0.159
tnc+0.155
 Klu+0.144
erd+0.131
oidal+0.130
Neg logit contributions
 Rabbit-0.120
Pan-0.113
 2048-0.111
-0.105
pain-0.105
Token Court
Token ID3078
Feature activation-1.38e+00

Pos logit contributions
allery+2.316
tnc+2.286
 Klu+2.170
erd+1.931
oidal+1.902
Neg logit contributions
 Rabbit-1.856
Pan-1.777
 2048-1.712
-1.678
pain-1.662
Token building
Token ID2615
Feature activation+3.52e-01

Pos logit contributions
allery+3.681
tnc+3.564
 Klu+3.387
erd+3.043
oidal+3.033
Neg logit contributions
 Rabbit-2.578
Pan-2.448
 2048-2.391
-2.316
pain-2.281
Token and
Token ID290
Feature activation+8.99e-02

Pos logit contributions
 Rabbit+0.628
Pan+0.585
 2048+0.578
+0.542
pain+0.541
Neg logit contributions
allery-0.997
tnc-0.974
 Klu-0.912
erd-0.838
oidal-0.828
Token start
Token ID923
Feature activation+4.82e-01

Pos logit contributions
 Rabbit+0.193
Pan+0.182
 2048+0.180
+0.170
pain+0.170
Neg logit contributions
allery-0.223
tnc-0.217
 Klu-0.201
erd-0.182
oidal-0.179
Token working
Token ID1762
Feature activation-2.95e-01

Pos logit contributions
 Rabbit+0.862
Pan+0.800
 2048+0.794
+0.745
pain+0.742
Neg logit contributions
allery-1.371
tnc-1.335
 Klu-1.248
erd-1.148
oidal-1.135
Token.
Token ID13
Feature activation+2.09e-02

Pos logit contributions
allery+0.827
tnc+0.806
 Klu+0.762
erd+0.695
oidal+0.689
Neg logit contributions
 Rabbit-0.528
Pan-0.496
 2048-0.483
-0.464
pain-0.456
Token your
Token ID534
Feature activation-2.28e-01

Pos logit contributions
 Rabbit+0.278
Pan+0.261
 2048+0.257
+0.244
pain+0.243
Neg logit contributions
allery-0.359
tnc-0.350
 Klu-0.325
erd-0.296
oidal-0.294
Token opponent
Token ID6125
Feature activation-6.23e-02

Pos logit contributions
allery+0.598
tnc+0.583
 Klu+0.547
erd+0.496
oidal+0.490
Neg logit contributions
 Rabbit-0.451
Pan-0.425
 2048-0.419
-0.400
pain-0.395
Token is
Token ID318
Feature activation+5.81e-02

Pos logit contributions
allery+0.165
tnc+0.160
 Klu+0.150
erd+0.137
oidal+0.135
Neg logit contributions
 Rabbit-0.122
Pan-0.115
 2048-0.113
-0.108
pain-0.107
Token able
Token ID1498
Feature activation-3.77e-01

Pos logit contributions
 Rabbit+0.103
Pan+0.096
 2048+0.095
+0.090
pain+0.089
Neg logit contributions
allery-0.164
tnc-0.160
 Klu-0.151
erd-0.138
oidal-0.137
Token to
Token ID284
Feature activation-9.45e-02

Pos logit contributions
allery+0.977
tnc+0.957
 Klu+0.896
erd+0.813
oidal+0.803
Neg logit contributions
 Rabbit-0.755
Pan-0.708
 2048-0.700
-0.668
pain-0.662
Token KO
Token ID21054
Feature activation-1.36e+00

Pos logit contributions
allery+0.245
tnc+0.238
 Klu+0.222
erd+0.201
oidal+0.199
Neg logit contributions
 Rabbit-0.193
Pan-0.181
 2048-0.179
-0.170
pain-0.168
Token every
Token ID790
Feature activation+1.57e-01

Pos logit contributions
allery+3.465
tnc+3.363
 Klu+3.175
erd+2.867
oidal+2.813
Neg logit contributions
 Rabbit-2.734
Pan-2.581
 2048-2.565
-2.458
pain-2.437
Token one
Token ID530
Feature activation+3.80e-01

Pos logit contributions
 Rabbit+0.304
Pan+0.285
 2048+0.280
+0.265
pain+0.264
Neg logit contributions
allery-0.423
tnc-0.412
 Klu-0.385
erd-0.351
oidal-0.347
Token of
Token ID286
Feature activation+5.77e-02

Pos logit contributions
 Rabbit+0.714
Pan+0.667
 2048+0.654
+0.616
pain+0.616
Neg logit contributions
allery-1.051
tnc-1.026
 Klu-0.955
erd-0.873
oidal-0.869
Token his
Token ID465
Feature activation+8.32e-02

Pos logit contributions
 Rabbit+0.115
Pan+0.108
 2048+0.106
+0.100
pain+0.100
Neg logit contributions
allery-0.153
tnc-0.149
 Klu-0.138
erd-0.126
oidal-0.125
Token Master
Token ID5599
Feature activation-3.62e-01

Pos logit contributions
 Rabbit+0.164
Pan+0.154
 2048+0.152
+0.145
pain+0.143
Neg logit contributions
allery-0.220
tnc-0.214
 Klu-0.200
erd-0.182
oidal-0.180
Token/
Token ID14
Feature activation-1.56e-01

Pos logit contributions
allery+1.553
tnc+1.505
 Klu+1.404
erd+1.255
oidal+1.239
Neg logit contributions
 Rabbit-1.484
Pan-1.410
 2048-1.384
-1.331
---------1.316
Tokenviolence
Token ID37502
Feature activation-1.34e-01

Pos logit contributions
allery+0.389
tnc+0.378
 Klu+0.352
erd+0.318
oidal+0.314
Neg logit contributions
 Rabbit-0.331
Pan-0.312
 2048-0.308
-0.294
pain-0.292
Token_
Token ID62
Feature activation-1.36e+00

Pos logit contributions
allery+0.333
tnc+0.324
 Klu+0.302
erd+0.272
oidal+0.269
Neg logit contributions
 Rabbit-0.284
Pan-0.269
 2048-0.265
-0.253
pain-0.252
Tokenin
Token ID259
Feature activation-6.00e-01

Pos logit contributions
allery+3.728
tnc+3.657
 Klu+3.356
erd+3.137
oidal+3.095
Neg logit contributions
 Rabbit-2.531
 2048-2.306
Pan-2.298
-2.176
 Panama-2.173
Tokenjury
Token ID21871
Feature activation-3.14e-01

Pos logit contributions
allery+1.820
tnc+1.779
 Klu+1.673
erd+1.546
oidal+1.534
Neg logit contributions
 Rabbit-0.954
Pan-0.875
 2048-0.861
-0.808
 Panama-0.791
Token_
Token ID62
Feature activation-1.36e+00

Pos logit contributions
allery+0.777
tnc+0.755
 Klu+0.702
erd+0.634
oidal+0.627
Neg logit contributions
 Rabbit-0.673
Pan-0.636
 2048-0.628
-0.599
pain-0.595
Tokenpre
Token ID3866
Feature activation-3.66e-01

Pos logit contributions
allery+3.725
tnc+3.667
 Klu+3.354
erd+3.123
oidal+3.079
Neg logit contributions
 Rabbit-2.534
Pan-2.297
 2048-2.294
-2.173
 Panama-2.163
Tokenvention
Token ID4018
Feature activation-6.61e-01

Pos logit contributions
allery+1.039
tnc+1.015
 Klu+0.947
erd+0.872
oidal+0.864
Neg logit contributions
 Rabbit-0.657
Pan-0.609
 2048-0.597
-0.558
 Panama-0.555
Token/
Token ID14
Feature activation-3.28e-01

Pos logit contributions
allery+1.590
tnc+1.545
 Klu+1.433
erd+1.289
oidal+1.274
Neg logit contributions
 Rabbit-1.460
Pan-1.381
 2048-1.364
-1.305
pain-1.295
Tokenkey
Token ID2539
Feature activation-6.74e-01

Pos logit contributions
allery+0.828
tnc+0.807
 Klu+0.750
erd+0.680
oidal+0.672
Neg logit contributions
 Rabbit-0.685
Pan-0.645
 2048-0.638
-0.607
pain-0.601
Token_
Token ID62
Feature activation-1.25e+00

Pos logit contributions
allery+1.663
tnc+1.619
 Klu+1.502
erd+1.359
oidal+1.341
Neg logit contributions
 Rabbit-1.452
Pan-1.364
 2048-1.347
-1.287
pain-1.271
Tokenwho
Token ID8727
Feature activation+9.26e-02

Pos logit contributions
allery+0.890
tnc+0.871
 Klu+0.812
erd+0.740
oidal+0.731
Neg logit contributions
 Rabbit-0.673
Pan-0.630
 2048-0.622
-0.592
---------0.584
Token.
Token ID13
Feature activation-5.24e-01

Pos logit contributions
 Rabbit+0.174
Pan+0.165
 2048+0.163
pain+0.154
sec+0.153
Neg logit contributions
allery-0.252
tnc-0.246
 Klu-0.230
 concess-0.209
erd-0.209
Tokenint
Token ID600
Feature activation-6.59e-01

Pos logit contributions
allery+0.976
tnc+0.970
 Klu+0.866
erd+0.763
oidal+0.747
Neg logit contributions
 Rabbit-1.441
Pan-1.353
 2048-1.350
---------1.298
-1.297
Token/
Token ID14
Feature activation-1.56e-01

Pos logit contributions
allery+1.553
tnc+1.505
 Klu+1.404
erd+1.255
oidal+1.239
Neg logit contributions
 Rabbit-1.484
Pan-1.410
 2048-1.384
-1.331
---------1.316
Tokenviolence
Token ID37502
Feature activation-1.34e-01

Pos logit contributions
allery+0.389
tnc+0.378
 Klu+0.352
erd+0.318
oidal+0.314
Neg logit contributions
 Rabbit-0.331
Pan-0.312
 2048-0.308
-0.294
pain-0.292
Token_
Token ID62
Feature activation-1.36e+00

Pos logit contributions
allery+0.333
tnc+0.324
 Klu+0.302
erd+0.272
oidal+0.269
Neg logit contributions
 Rabbit-0.284
Pan-0.269
 2048-0.265
-0.253
pain-0.252
Tokenin
Token ID259
Feature activation-6.00e-01

Pos logit contributions
allery+3.728
tnc+3.657
 Klu+3.356
erd+3.137
oidal+3.095
Neg logit contributions
 Rabbit-2.531
 2048-2.306
Pan-2.298
-2.176
 Panama-2.173
Tokenjury
Token ID21871
Feature activation-3.14e-01

Pos logit contributions
allery+1.820
tnc+1.779
 Klu+1.673
erd+1.546
oidal+1.534
Neg logit contributions
 Rabbit-0.954
Pan-0.875
 2048-0.861
-0.808
 Panama-0.791
Token_
Token ID62
Feature activation-1.36e+00

Pos logit contributions
allery+0.777
tnc+0.755
 Klu+0.702
erd+0.634
oidal+0.627
Neg logit contributions
 Rabbit-0.673
Pan-0.636
 2048-0.628
-0.599
pain-0.595
Tokenpre
Token ID3866
Feature activation-3.66e-01

Pos logit contributions
allery+3.725
tnc+3.667
 Klu+3.354
erd+3.123
oidal+3.079
Neg logit contributions
 Rabbit-2.534
Pan-2.297
 2048-2.294
-2.173
 Panama-2.163
Tokenvention
Token ID4018
Feature activation-6.61e-01

Pos logit contributions
allery+1.039
tnc+1.015
 Klu+0.947
erd+0.872
oidal+0.864
Neg logit contributions
 Rabbit-0.657
Pan-0.609
 2048-0.597
-0.558
 Panama-0.555
Token Will
Token ID2561
Feature activation+2.83e-01

Pos logit contributions
allery+0.676
tnc+0.661
 Klu+0.622
erd+0.567
oidal+0.561
Neg logit contributions
 Rabbit-0.443
Pan-0.413
 2048-0.408
-0.387
pain-0.383
Token be
Token ID307
Feature activation-1.48e-02

Pos logit contributions
 Rabbit+0.558
Pan+0.522
 2048+0.514
+0.490
pain+0.486
Neg logit contributions
allery-0.753
tnc-0.733
 Klu-0.680
erd-0.618
oidal-0.614
Token uploading
Token ID33794
Feature activation-5.41e-01

Pos logit contributions
allery+0.041
tnc+0.040
 Klu+0.038
erd+0.034
oidal+0.034
Neg logit contributions
 Rabbit-0.027
Pan-0.025
 2048-0.025
-0.023
pain-0.023
Token clips
Token ID19166
Feature activation-8.21e-01

Pos logit contributions
allery+1.466
tnc+1.428
 Klu+1.338
erd+1.220
oidal+1.207
Neg logit contributions
 Rabbit-1.031
Pan-0.968
 2048-0.953
-0.905
pain-0.897
Token of
Token ID286
Feature activation+8.43e-02

Pos logit contributions
allery+2.204
tnc+2.147
 Klu+2.018
erd+1.839
oidal+1.815
Neg logit contributions
 Rabbit-1.560
Pan-1.470
 2048-1.432
-1.380
pain-1.352
Token heroes
Token ID10281
Feature activation-1.34e+00

Pos logit contributions
 Rabbit+0.162
Pan+0.151
 2048+0.149
+0.141
pain+0.140
Neg logit contributions
allery-0.229
tnc-0.222
 Klu-0.207
erd-0.190
oidal-0.188
Token on
Token ID319
Feature activation-2.62e-01

Pos logit contributions
allery+3.582
tnc+3.452
 Klu+3.267
erd+2.972
oidal+2.925
Neg logit contributions
 Rabbit-2.548
Pan-2.402
 2048-2.344
-2.286
pain-2.210
Token my
Token ID616
Feature activation-1.93e-01

Pos logit contributions
allery+0.714
tnc+0.696
 Klu+0.652
erd+0.595
oidal+0.589
Neg logit contributions
 Rabbit-0.497
Pan-0.466
 2048-0.459
-0.436
pain-0.432
Token youtube
Token ID35116
Feature activation-9.70e-01

Pos logit contributions
allery+0.514
tnc+0.501
 Klu+0.469
erd+0.427
oidal+0.422
Neg logit contributions
 Rabbit-0.376
Pan-0.354
 2048-0.348
-0.331
pain-0.328
Token at
Token ID379
Feature activation-2.49e-01

Pos logit contributions
allery+2.383
tnc+2.284
 Klu+2.149
erd+1.945
oidal+1.895
Neg logit contributions
 Rabbit-2.062
Pan-1.959
 2048-1.915
-1.859
pain-1.833
Token https
Token ID3740
Feature activation-5.03e-01

Pos logit contributions
allery+0.707
tnc+0.689
 Klu+0.647
erd+0.592
oidal+0.588
Neg logit contributions
 Rabbit-0.445
Pan-0.415
 2048-0.409
-0.386
pain-0.382
Token)
Token ID8
Feature activation+1.89e-01

Pos logit contributions
allery+0.762
tnc+0.755
 Klu+0.702
erd+0.635
oidal+0.630
Neg logit contributions
 Rabbit-0.631
Pan-0.590
 2048-0.584
-0.564
pain-0.555
Token<|endoftext|>
Token ID50256
Feature activation+2.11e-01

Pos logit contributions
 Rabbit+0.331
Pan+0.305
 2048+0.301
+0.283
 Panama+0.281
Neg logit contributions
allery-0.544
tnc-0.535
 Klu-0.498
erd-0.462
oidal-0.457
TokenUnited
Token ID17013
Feature activation-4.07e-02

Pos logit contributions
 Rabbit+0.406
Pan+0.385
 2048+0.376
+0.357
pain+0.356
Neg logit contributions
allery-0.568
tnc-0.550
 Klu-0.518
erd-0.470
oidal-0.465
Token States
Token ID1829
Feature activation-2.26e-01

Pos logit contributions
allery+0.131
tnc+0.129
 Klu+0.122
erd+0.113
oidal+0.112
Neg logit contributions
 Rabbit-0.056
Pan-0.051
 2048-0.050
-0.047
pain-0.046
Token Supreme
Token ID5617
Feature activation-3.72e-01

Pos logit contributions
allery+0.657
tnc+0.641
 Klu+0.607
erd+0.555
oidal+0.551
Neg logit contributions
 Rabbit-0.384
Pan-0.358
 2048-0.351
-0.332
pain-0.329
Token Court
Token ID3078
Feature activation-1.33e+00

Pos logit contributions
allery+0.944
tnc+0.930
 Klu+0.881
erd+0.786
oidal+0.774
Neg logit contributions
 Rabbit-0.742
Pan-0.708
 2048-0.688
-0.667
pain-0.660
Token case
Token ID1339
Feature activation-6.50e-01

Pos logit contributions
allery+3.646
tnc+3.554
 Klu+3.345
erd+3.054
oidal+2.995
Neg logit contributions
 Rabbit-2.415
Pan-2.296
 2048-2.238
-2.138
pain-2.136
Token
Token ID198
Feature activation-3.93e-01

Pos logit contributions
allery+1.816
tnc+1.778
 Klu+1.687
erd+1.525
oidal+1.515
Neg logit contributions
 Rabbit-1.155
Pan-1.094
 2048-1.066
-1.025
pain-1.009
Token
Token ID198
Feature activation-3.35e-01

Pos logit contributions
allery+1.027
tnc+1.026
 Klu+0.959
erd+0.869
oidal+0.861
Neg logit contributions
 Rabbit-0.755
Pan-0.698
 2048-0.697
-0.673
pain-0.660
TokenSanta
Token ID42694
Feature activation+2.06e-01

Pos logit contributions
allery+0.902
tnc+0.884
 Klu+0.823
erd+0.753
oidal+0.744
Neg logit contributions
 Rabbit-0.643
Pan-0.597
 2048-0.595
-0.563
pain-0.556
Token Clara
Token ID27443
Feature activation+2.25e-01

Pos logit contributions
 Rabbit+0.304
Pan+0.279
 2048+0.274
+0.256
pain+0.253
Neg logit contributions
allery-0.648
tnc-0.634
 Klu-0.600
erd-0.554
oidal-0.550
Token:
Token ID25
Feature activation+2.35e-01

Pos logit contributions
allery+0.220
tnc+0.218
 Klu+0.205
erd+0.189
oidal+0.188
Neg logit contributions
 Rabbit-0.117
Pan-0.107
 2048-0.105
-0.101
pain-0.098
Token47
Token ID2857
Feature activation+3.34e-01

Pos logit contributions
 Rabbit+0.390
 2048+0.361
Pan+0.359
+0.332
--------+0.332
Neg logit contributions
allery-0.694
tnc-0.678
 Klu-0.635
erd-0.586
oidal-0.581
Token]
Token ID60
Feature activation+4.63e-01

Pos logit contributions
 Rabbit+0.542
Pan+0.500
 2048+0.493
+0.466
pain+0.459
Neg logit contributions
allery-0.992
tnc-0.975
 Klu-0.919
erd-0.847
oidal-0.839
Token <
Token ID1279
Feature activation+4.18e-01

Pos logit contributions
 Rabbit+0.696
Pan+0.636
 2048+0.630
+0.581
pain+0.578
Neg logit contributions
allery-1.446
tnc-1.411
 Klu-1.324
erd-1.233
oidal-1.221
Tokenliterally
Token ID43819
Feature activation+8.95e-02

Pos logit contributions
 Rabbit+0.882
Pan+0.840
 2048+0.820
pain+0.787
+0.784
Neg logit contributions
allery-1.044
tnc-1.009
 Klu-0.946
erd-0.848
oidal-0.841
Token_
Token ID62
Feature activation-1.32e+00

Pos logit contributions
 Rabbit+0.192
Pan+0.182
 2048+0.179
pain+0.171
+0.171
Neg logit contributions
allery-0.221
tnc-0.215
 Klu-0.200
erd-0.179
oidal-0.178
Tokenworst
Token ID41430
Feature activation-2.20e-02

Pos logit contributions
allery+3.373
tnc+3.323
 Klu+3.046
erd+2.837
oidal+2.781
Neg logit contributions
 Rabbit-2.661
Pan-2.472
 2048-2.469
-2.345
 Panama-2.342
Token_
Token ID62
Feature activation-1.23e+00

Pos logit contributions
allery+0.058
tnc+0.057
 Klu+0.053
erd+0.048
oidal+0.048
Neg logit contributions
 Rabbit-0.043
Pan-0.041
 2048-0.040
-0.038
pain-0.038
Tokenuser
Token ID7220
Feature activation-1.24e-01

Pos logit contributions
allery+3.153
tnc+3.118
 Klu+2.886
erd+2.663
oidal+2.597
Neg logit contributions
 Rabbit-2.461
 2048-2.288
Pan-2.280
-2.170
 Panama-2.170
Token>
Token ID29
Feature activation-1.95e-01

Pos logit contributions
allery+0.393
tnc+0.385
 Klu+0.363
erd+0.336
oidal+0.333
Neg logit contributions
 Rabbit-0.179
Pan-0.165
 2048-0.162
-0.150
pain-0.150
Token lol
Token ID19462
Feature activation-5.15e-03

Pos logit contributions
allery+0.532
tnc+0.518
 Klu+0.486
erd+0.443
oidal+0.439
Neg logit contributions
 Rabbit-0.366
Pan-0.343
 2048-0.338
-0.321
pain-0.318
Tokenisting
Token ID9665
Feature activation+1.37e-01

Pos logit contributions
allery+1.317
tnc+1.284
 Klu+1.198
erd+1.094
oidal+1.085
Neg logit contributions
 Rabbit-1.012
Pan-0.951
 2048-0.937
-0.896
---------0.883
Token violations
Token ID11734
Feature activation+2.90e-01

Pos logit contributions
 Rabbit+0.251
Pan+0.234
 2048+0.231
+0.217
pain+0.216
Neg logit contributions
allery-0.384
tnc-0.374
 Klu-0.349
erd-0.320
oidal-0.317
Token of
Token ID286
Feature activation-1.51e-01

Pos logit contributions
 Rabbit+0.543
Pan+0.507
 2048+0.500
+0.473
pain+0.469
Neg logit contributions
allery-0.796
tnc-0.777
 Klu-0.726
erd-0.664
oidal-0.657
Token the
Token ID262
Feature activation-1.40e-02

Pos logit contributions
allery+0.425
tnc+0.414
 Klu+0.389
erd+0.356
oidal+0.353
Neg logit contributions
 Rabbit-0.273
Pan-0.255
 2048-0.251
-0.237
pain-0.235
Token Supreme
Token ID5617
Feature activation-7.79e-01

Pos logit contributions
allery+0.040
tnc+0.039
 Klu+0.037
erd+0.034
oidal+0.033
Neg logit contributions
 Rabbit-0.025
Pan-0.023
 2048-0.023
-0.021
pain-0.021
Token Court
Token ID3078
Feature activation-1.32e+00

Pos logit contributions
allery+1.941
tnc+1.931
 Klu+1.846
erd+1.623
oidal+1.605
Neg logit contributions
 Rabbit-1.556
Pan-1.493
 2048-1.443
-1.421
pain-1.399
Token Orders
Token ID30689
Feature activation-6.04e-01

Pos logit contributions
allery+3.063
tnc+2.978
 Klu+2.783
erd+2.480
oidal+2.450
Neg logit contributions
 Rabbit-2.952
Pan-2.833
 2048-2.788
-2.678
pain-2.668
Token on
Token ID319
Feature activation+6.43e-02

Pos logit contributions
allery+1.645
tnc+1.626
 Klu+1.541
erd+1.386
oidal+1.384
Neg logit contributions
 Rabbit-1.088
Pan-1.039
 2048-1.002
-0.976
pain-0.956
Token Aadhaar
Token ID47727
Feature activation-1.40e-01

Pos logit contributions
 Rabbit+0.109
Pan+0.102
 2048+0.100
+0.094
pain+0.093
Neg logit contributions
allery-0.187
tnc-0.183
 Klu-0.172
erd-0.158
oidal-0.157
Token
Token ID198
Feature activation-4.36e-01

Pos logit contributions
allery+0.386
tnc+0.377
 Klu+0.354
erd+0.322
oidal+0.319
Neg logit contributions
 Rabbit-0.261
Pan-0.244
 2048-0.240
-0.229
pain-0.227
Token
Token ID198
Feature activation-2.92e-01

Pos logit contributions
allery+1.185
tnc+1.158
 Klu+1.086
erd+0.987
oidal+0.981
Neg logit contributions
 Rabbit-0.817
Pan-0.758
 2048-0.753
-0.717
pain-0.709
Token to
Token ID284
Feature activation-3.06e-02

Pos logit contributions
allery+1.105
tnc+1.087
 Klu+1.016
erd+0.921
oidal+0.914
Neg logit contributions
 Rabbit-0.872
Pan-0.820
 2048-0.809
-0.775
pain-0.764
Token get
Token ID651
Feature activation-1.54e-01

Pos logit contributions
allery+0.074
tnc+0.072
 Klu+0.066
erd+0.060
oidal+0.059
Neg logit contributions
 Rabbit-0.068
Pan-0.064
 2048-0.063
-0.060
pain-0.060
Token back
Token ID736
Feature activation-4.26e-01

Pos logit contributions
allery+0.415
tnc+0.405
 Klu+0.380
erd+0.346
oidal+0.342
Neg logit contributions
 Rabbit-0.294
Pan-0.277
 2048-0.272
-0.259
pain-0.256
Token on
Token ID319
Feature activation+4.50e-02

Pos logit contributions
allery+1.147
tnc+1.120
 Klu+1.059
erd+0.962
oidal+0.949
Neg logit contributions
 Rabbit-0.806
Pan-0.760
 2048-0.744
-0.708
---------0.695
Token the
Token ID262
Feature activation-1.91e-01

Pos logit contributions
 Rabbit+0.082
Pan+0.077
 2048+0.075
+0.071
pain+0.071
Neg logit contributions
allery-0.126
tnc-0.123
 Klu-0.115
erd-0.105
oidal-0.104
Token court
Token ID2184
Feature activation-1.30e+00

Pos logit contributions
allery+0.443
tnc+0.431
 Klu+0.400
erd+0.357
oidal+0.353
Neg logit contributions
 Rabbit-0.437
Pan-0.416
 2048-0.409
-0.393
pain-0.389
Token at
Token ID379
Feature activation-4.49e-02

Pos logit contributions
allery+3.376
tnc+3.296
 Klu+3.146
erd+2.827
oidal+2.803
Neg logit contributions
 Rabbit-2.491
Pan-2.366
 2048-2.292
-2.222
pain-2.152
Token the
Token ID262
Feature activation-9.54e-02

Pos logit contributions
allery+0.125
tnc+0.122
 Klu+0.115
erd+0.105
oidal+0.104
Neg logit contributions
 Rabbit-0.082
Pan-0.077
 2048-0.076
-0.072
pain-0.071
Token beginning
Token ID3726
Feature activation+2.45e-03

Pos logit contributions
allery+0.238
tnc+0.232
 Klu+0.216
erd+0.195
oidal+0.193
Neg logit contributions
 Rabbit-0.202
Pan-0.191
 2048-0.188
-0.180
pain-0.178
Token of
Token ID286
Feature activation-1.64e-02

Pos logit contributions
 Rabbit+0.004
Pan+0.004
 2048+0.004
+0.004
pain+0.004
Neg logit contributions
allery-0.007
tnc-0.007
 Klu-0.006
erd-0.006
oidal-0.006
Token this
Token ID428
Feature activation+4.39e-02

Pos logit contributions
allery+0.047
tnc+0.046
 Klu+0.043
erd+0.039
oidal+0.039
Neg logit contributions
 Rabbit-0.029
Pan-0.027
 2048-0.026
-0.025
pain-0.025
Token responded
Token ID7082
Feature activation+3.81e-01

Pos logit contributions
allery+0.134
tnc+0.130
 Klu+0.122
erd+0.110
oidal+0.109
Neg logit contributions
 Rabbit-0.106
Pan-0.099
 2048-0.098
-0.093
pain-0.093
Token by
Token ID416
Feature activation-1.59e-01

Pos logit contributions
 Rabbit+0.701
Pan+0.657
 2048+0.648
+0.609
 Panama+0.607
Neg logit contributions
allery-1.062
tnc-1.031
 Klu-0.962
erd-0.884
oidal-0.871
Token signing
Token ID8415
Feature activation-2.23e-01

Pos logit contributions
allery+0.467
tnc+0.457
 Klu+0.433
erd+0.396
oidal+0.394
Neg logit contributions
 Rabbit-0.262
Pan-0.243
 2048-0.238
-0.226
pain-0.223
Token the
Token ID262
Feature activation-1.64e-01

Pos logit contributions
allery+0.604
tnc+0.590
 Klu+0.555
erd+0.504
oidal+0.499
Neg logit contributions
 Rabbit-0.421
Pan-0.393
 2048-0.388
-0.368
pain-0.365
Token "
Token ID366
Feature activation+1.79e-01

Pos logit contributions
allery+0.470
tnc+0.460
 Klu+0.435
erd+0.396
oidal+0.393
Neg logit contributions
 Rabbit-0.284
Pan-0.263
 2048-0.259
-0.245
pain-0.243
TokenDirect
Token ID13470
Feature activation-1.29e+00

Pos logit contributions
 Rabbit+0.316
Pan+0.293
 2048+0.290
+0.274
pain+0.271
Neg logit contributions
allery-0.509
tnc-0.498
 Klu-0.467
erd-0.428
oidal-0.424
Tokenive
Token ID425
Feature activation-4.95e-01

Pos logit contributions
tnc+3.456
allery+3.441
 Klu+3.207
erd+2.937
oidal+2.925
Neg logit contributions
 Rabbit-2.386
Pan-2.208
 2048-2.197
pain-2.070
-2.051
Token of
Token ID286
Feature activation+5.98e-02

Pos logit contributions
allery+1.354
tnc+1.344
 Klu+1.270
erd+1.144
oidal+1.140
Neg logit contributions
 Rabbit-0.897
Pan-0.842
 2048-0.821
-0.790
pain-0.774
Token Peoples
Token ID41770
Feature activation+2.03e-01

Pos logit contributions
 Rabbit+0.103
Pan+0.096
 2048+0.094
+0.089
pain+0.088
Neg logit contributions
allery-0.172
tnc-0.168
 Klu-0.159
erd-0.145
oidal-0.144
Token'
Token ID6
Feature activation+2.64e-01

Pos logit contributions
 Rabbit+0.336
Pan+0.311
 2048+0.305
+0.288
pain+0.285
Neg logit contributions
allery-0.600
tnc-0.589
 Klu-0.555
erd-0.509
oidal-0.504
Token Comm
Token ID1520
Feature activation-5.78e-02

Pos logit contributions
 Rabbit+0.362
Pan+0.332
 2048+0.324
+0.301
pain+0.299
Neg logit contributions
allery-0.853
tnc-0.839
 Klu-0.796
erd-0.733
oidal-0.729
Token Free
Token ID3232
Feature activation+6.42e-02

Pos logit contributions
allery+0.399
tnc+0.389
 Klu+0.363
erd+0.329
oidal+0.325
Neg logit contributions
 Rabbit-0.319
Pan-0.300
 2048-0.296
-0.282
pain-0.280
Token
Token ID851
Feature activation-6.91e-02

Pos logit contributions
 Rabbit+0.131
Pan+0.124
 2048+0.122
+0.116
pain+0.115
Neg logit contributions
allery-0.165
tnc-0.161
 Klu-0.150
erd-0.136
oidal-0.134
Token better
Token ID1365
Feature activation+2.27e-01

Pos logit contributions
allery+0.189
tnc+0.184
 Klu+0.173
erd+0.158
oidal+0.156
Neg logit contributions
 Rabbit-0.129
Pan-0.121
 2048-0.119
-0.114
pain-0.112
Token known
Token ID1900
Feature activation+3.33e-01

Pos logit contributions
 Rabbit+0.473
Pan+0.446
 2048+0.440
+0.419
pain+0.416
Neg logit contributions
allery-0.573
tnc-0.557
 Klu-0.520
erd-0.470
oidal-0.465
Token as
Token ID355
Feature activation+8.79e-02

Pos logit contributions
 Rabbit+0.652
Pan+0.612
 2048+0.606
+0.572
pain+0.569
Neg logit contributions
allery-0.888
tnc-0.866
 Klu-0.805
erd-0.733
oidal-0.725
Token Mister
Token ID43438
Feature activation-1.29e+00

Pos logit contributions
 Rabbit+0.149
Pan+0.138
 2048+0.136
+0.128
pain+0.127
Neg logit contributions
allery-0.257
tnc-0.251
 Klu-0.236
erd-0.217
oidal-0.215
Token Miracle
Token ID33292
Feature activation-9.12e-01

Pos logit contributions
allery+3.535
tnc+3.433
 Klu+3.312
erd+2.989
oidal+2.946
Neg logit contributions
 Rabbit-2.233
Pan-2.156
 2048-2.113
-2.045
---------2.027
Token
Token ID851
Feature activation-2.26e-01

Pos logit contributions
allery+2.338
tnc+2.262
 Klu+2.152
erd+1.949
oidal+1.909
Neg logit contributions
 Rabbit-1.807
Pan-1.717
 2048-1.696
-1.620
---------1.607
Token has
Token ID468
Feature activation-6.87e-02

Pos logit contributions
allery+0.582
tnc+0.565
 Klu+0.530
erd+0.482
oidal+0.475
Neg logit contributions
 Rabbit-0.458
Pan-0.432
 2048-0.426
-0.406
pain-0.402
Token been
Token ID587
Feature activation+1.79e-01

Pos logit contributions
allery+0.186
tnc+0.182
 Klu+0.171
erd+0.155
oidal+0.154
Neg logit contributions
 Rabbit-0.130
Pan-0.122
 2048-0.120
-0.114
pain-0.113
Token a
Token ID257
Feature activation+1.55e-01

Pos logit contributions
 Rabbit+0.325
Pan+0.301
 2048+0.298
+0.281
pain+0.279
Neg logit contributions
allery-0.503
tnc-0.491
 Klu-0.458
erd-0.420
oidal-0.417
Token
Token ID247
Feature activation-1.90e-01

Pos logit contributions
allery+0.032
tnc+0.031
 Klu+0.030
erd+0.028
oidal+0.027
Neg logit contributions
 Rabbit-0.016
Pan-0.014
 2048-0.014
pain-0.013
 Panama-0.013
Tokent
Token ID83
Feature activation-2.82e-01

Pos logit contributions
allery+0.364
tnc+0.350
 Klu+0.318
erd+0.282
oidal+0.275
Neg logit contributions
 Rabbit-0.515
Pan-0.489
 2048-0.484
-0.466
pain-0.463
Token want
Token ID765
Feature activation-3.01e-01

Pos logit contributions
allery+0.768
tnc+0.747
 Klu+0.703
erd+0.642
oidal+0.636
Neg logit contributions
 Rabbit-0.531
Pan-0.497
 2048-0.488
-0.466
pain-0.460
Token to
Token ID284
Feature activation-1.44e-01

Pos logit contributions
allery+0.816
tnc+0.795
 Klu+0.743
erd+0.678
oidal+0.672
Neg logit contributions
 Rabbit-0.578
Pan-0.541
 2048-0.533
-0.506
pain-0.502
Token play
Token ID711
Feature activation-1.22e+00

Pos logit contributions
allery+0.349
tnc+0.339
 Klu+0.315
erd+0.284
oidal+0.281
Neg logit contributions
 Rabbit-0.315
Pan-0.297
 2048-0.294
-0.281
pain-0.279
Token mirror
Token ID10162
Feature activation-1.29e+00

Pos logit contributions
allery+3.176
tnc+3.099
 Klu+2.934
erd+2.654
oidal+2.603
Neg logit contributions
 Rabbit-2.363
Pan-2.262
 2048-2.212
-2.134
pain-2.095
Token matches
Token ID7466
Feature activation-7.53e-01

Pos logit contributions
allery+3.296
tnc+3.251
 Klu+3.041
erd+2.752
oidal+2.697
Neg logit contributions
 Rabbit-2.447
Pan-2.325
 2048-2.305
-2.227
---------2.158
Token.
Token ID13
Feature activation-8.71e-03

Pos logit contributions
allery+1.913
tnc+1.866
 Klu+1.756
erd+1.586
oidal+1.553
Neg logit contributions
 Rabbit-1.536
Pan-1.449
 2048-1.417
-1.373
pain-1.343
Token So
Token ID1406
Feature activation-1.34e-01

Pos logit contributions
allery+0.024
tnc+0.024
 Klu+0.022
erd+0.020
oidal+0.020
Neg logit contributions
 Rabbit-0.016
Pan-0.015
 2048-0.015
-0.014
pain-0.014
Token,
Token ID11
Feature activation-7.71e-03

Pos logit contributions
allery+0.346
tnc+0.337
 Klu+0.314
erd+0.285
oidal+0.282
Neg logit contributions
 Rabbit-0.272
Pan-0.256
 2048-0.252
-0.240
pain-0.238
Token I
Token ID314
Feature activation+4.63e-02

Pos logit contributions
allery+0.019
tnc+0.019
 Klu+0.018
erd+0.016
oidal+0.016
Neg logit contributions
 Rabbit-0.016
Pan-0.015
 2048-0.015
-0.014
pain-0.014
Token ways
Token ID2842
Feature activation+1.59e-01

Pos logit contributions
allery+0.876
tnc+0.856
 Klu+0.806
erd+0.727
oidal+0.722
Neg logit contributions
 Rabbit-0.627
Pan-0.594
 2048-0.581
-0.555
pain-0.550
Token to
Token ID284
Feature activation-7.02e-02

Pos logit contributions
 Rabbit+0.296
Pan+0.274
 2048+0.272
+0.257
 civilian+0.255
Neg logit contributions
allery-0.442
tnc-0.429
 Klu-0.401
erd-0.368
oidal-0.363
Token be
Token ID307
Feature activation-1.28e-01

Pos logit contributions
allery+0.170
tnc+0.165
 Klu+0.153
erd+0.138
oidal+0.138
Neg logit contributions
 Rabbit-0.153
Pan-0.145
 2048-0.142
-0.137
pain-0.136
Token able
Token ID1498
Feature activation-5.06e-01

Pos logit contributions
allery+0.325
tnc+0.315
 Klu+0.292
erd+0.265
oidal+0.262
Neg logit contributions
 Rabbit-0.263
Pan-0.249
 2048-0.245
-0.231
---------0.231
Token to
Token ID284
Feature activation-1.37e-01

Pos logit contributions
allery+1.269
tnc+1.252
 Klu+1.163
erd+1.054
oidal+1.048
Neg logit contributions
 Rabbit-1.040
Pan-0.984
 2048-0.962
-0.928
pain-0.915
Token direct
Token ID1277
Feature activation-1.28e+00

Pos logit contributions
allery+0.338
tnc+0.330
 Klu+0.306
erd+0.278
oidal+0.277
Neg logit contributions
 Rabbit-0.289
Pan-0.273
 2048-0.268
-0.258
pain-0.256
Token more
Token ID517
Feature activation-4.87e-02

Pos logit contributions
allery+3.259
tnc+3.174
 Klu+2.972
erd+2.688
oidal+2.658
Neg logit contributions
 Rabbit-2.559
Pan-2.445
 2048-2.382
-2.298
---------2.266
Token light
Token ID1657
Feature activation+6.51e-01

Pos logit contributions
allery+0.125
tnc+0.121
 Klu+0.113
erd+0.102
oidal+0.101
Neg logit contributions
 Rabbit-0.100
Pan-0.094
 2048-0.093
-0.089
pain-0.088
Token reflected
Token ID12548
Feature activation+6.28e-01

Pos logit contributions
 Rabbit+1.280
Pan+1.190
 2048+1.187
pain+1.124
 civilian+1.120
Neg logit contributions
allery-1.750
tnc-1.687
 Klu-1.573
erd-1.438
oidal-1.418
Token by
Token ID416
Feature activation+1.73e-02

Pos logit contributions
 Rabbit+1.136
Pan+1.054
 2048+1.050
pain+0.980
 civilian+0.977
Neg logit contributions
allery-1.779
tnc-1.727
 Klu-1.615
erd-1.486
oidal-1.464
Token the
Token ID262
Feature activation-1.19e-01

Pos logit contributions
 Rabbit+0.033
Pan+0.031
 2048+0.030
+0.029
pain+0.028
Neg logit contributions
allery-0.047
tnc-0.046
 Klu-0.043
erd-0.039
oidal-0.039
Token 3
Token ID513
Feature activation-1.70e-01

Pos logit contributions
allery+1.175
tnc+1.149
 Klu+1.105
erd+0.995
oidal+0.993
Neg logit contributions
 Rabbit-0.610
Pan-0.576
-0.542
 2048-0.540
pain-0.528
Token240
Token ID16102
Feature activation-1.51e-01

Pos logit contributions
allery+0.481
tnc+0.468
 Klu+0.440
oidal+0.400
erd+0.400
Neg logit contributions
 Rabbit-0.300
Pan-0.283
 2048-0.275
-0.264
pain-0.259
Token Posts
Token ID12043
Feature activation-1.03e+00

Pos logit contributions
allery+0.426
tnc+0.417
 Klu+0.395
erd+0.359
oidal+0.358
Neg logit contributions
 Rabbit-0.263
Pan-0.249
 2048-0.241
-0.232
pain-0.229
Token #
Token ID1303
Feature activation-1.14e-01

Pos logit contributions
allery+3.200
tnc+3.117
 Klu+2.977
erd+2.732
oidal+2.707
Neg logit contributions
 Rabbit-1.499
Pan-1.393
 2048-1.341
-1.282
pain-1.255
Token10
Token ID940
Feature activation-4.07e-01

Pos logit contributions
allery+0.281
tnc+0.274
 Klu+0.255
erd+0.230
oidal+0.227
Neg logit contributions
 Rabbit-0.246
Pan-0.232
 2048-0.229
-0.220
pain-0.217
Token Happy
Token ID14628
Feature activation-1.27e+00

Pos logit contributions
allery+1.210
tnc+1.179
 Klu+1.117
erd+1.025
oidal+1.015
Neg logit contributions
 Rabbit-0.664
Pan-0.619
 2048-0.607
-0.573
pain-0.566
Token it
Token ID340
Feature activation-5.07e-01

Pos logit contributions
allery+3.416
tnc+3.340
 Klu+3.187
erd+2.894
oidal+2.839
Neg logit contributions
 Rabbit-2.326
Pan-2.226
 2048-2.165
-2.094
pain-2.069
Token worked
Token ID3111
Feature activation-6.26e-01

Pos logit contributions
allery+1.251
tnc+1.205
 Klu+1.133
erd+1.027
oidal+1.003
Neg logit contributions
 Rabbit-1.078
Pan-1.020
 2048-1.009
-0.964
pain-0.954
Token out
Token ID503
Feature activation+2.37e-01

Pos logit contributions
allery+1.724
tnc+1.666
 Klu+1.592
erd+1.446
oidal+1.424
Neg logit contributions
 Rabbit-1.136
Pan-1.073
 2048-1.048
-1.010
pain-0.994
Token.
Token ID13
Feature activation-7.40e-02

Pos logit contributions
 Rabbit+0.447
Pan+0.417
 2048+0.412
+0.389
pain+0.386
Neg logit contributions
allery-0.651
tnc-0.635
 Klu-0.593
erd-0.542
oidal-0.538
Token June
Token ID2795
Feature activation+1.39e-02

Pos logit contributions
allery+0.207
tnc+0.202
 Klu+0.190
erd+0.173
oidal+0.172
Neg logit contributions
 Rabbit-0.135
Pan-0.126
 2048-0.124
-0.117
pain-0.116
Token disruptive
Token ID28094
Feature activation+3.54e-01

Pos logit contributions
allery+0.394
tnc+0.385
 Klu+0.358
erd+0.324
oidal+0.320
Neg logit contributions
 Rabbit-0.328
Pan-0.309
 2048-0.305
-0.292
pain-0.288
Token in
Token ID287
Feature activation-9.89e-02

Pos logit contributions
 Rabbit+0.665
Pan+0.621
 2048+0.614
+0.580
pain+0.577
Neg logit contributions
allery-0.971
tnc-0.947
 Klu-0.886
erd-0.810
oidal-0.801
Token terms
Token ID2846
Feature activation-3.69e-01

Pos logit contributions
allery+0.274
tnc+0.267
 Klu+0.251
erd+0.229
oidal+0.227
Neg logit contributions
 Rabbit-0.182
Pan-0.171
 2048-0.168
-0.160
pain-0.158
Token of
Token ID286
Feature activation-1.74e-02

Pos logit contributions
allery+0.923
tnc+0.917
 Klu+0.854
erd+0.773
oidal+0.762
Neg logit contributions
 Rabbit-0.754
Pan-0.714
 2048-0.698
-0.676
pain-0.665
Token foreign
Token ID3215
Feature activation-6.07e-01

Pos logit contributions
allery+0.046
tnc+0.045
 Klu+0.042
erd+0.038
oidal+0.038
Neg logit contributions
 Rabbit-0.034
Pan-0.032
 2048-0.031
-0.030
pain-0.029
Token direct
Token ID1277
Feature activation-1.27e+00

Pos logit contributions
allery+1.508
tnc+1.473
 Klu+1.377
 concess+1.228
erd+1.227
Neg logit contributions
 Rabbit-1.277
Pan-1.207
 2048-1.185
-1.129
pain-1.128
Token investment
Token ID4896
Feature activation-4.68e-01

Pos logit contributions
allery+2.415
tnc+2.380
 Klu+2.179
 concess+1.885
oidal+1.856
Neg logit contributions
 Rabbit-3.360
Pan-3.188
 2048-3.164
-3.038
pain-3.033
Token.
Token ID13
Feature activation-1.69e-02

Pos logit contributions
allery+1.277
tnc+1.245
 Klu+1.171
erd+1.063
oidal+1.053
Neg logit contributions
 Rabbit-0.876
Pan-0.822
 2048-0.805
-0.769
pain-0.758
Token Will
Token ID2561
Feature activation+5.09e-01

Pos logit contributions
allery+0.050
tnc+0.049
 Klu+0.046
erd+0.042
oidal+0.042
Neg logit contributions
 Rabbit-0.028
Pan-0.026
 2048-0.025
-0.024
pain-0.024
Token large
Token ID1588
Feature activation+1.29e-01

Pos logit contributions
 Rabbit+0.949
Pan+0.886
 2048+0.875
+0.826
pain+0.819
Neg logit contributions
allery-1.407
tnc-1.374
 Klu-1.279
erd-1.173
oidal-1.162
Token companies
Token ID2706
Feature activation-1.94e-01

Pos logit contributions
 Rabbit+0.240
Pan+0.225
 2048+0.221
+0.210
pain+0.208
Neg logit contributions
allery-0.355
tnc-0.345
 Klu-0.325
erd-0.296
oidal-0.293
Token December
Token ID3426
Feature activation-6.26e-02

Pos logit contributions
 Rabbit+0.018
Pan+0.016
 2048+0.016
+0.015
pain+0.015
Neg logit contributions
allery-0.030
tnc-0.029
 Klu-0.028
erd-0.025
oidal-0.025
Token 28
Token ID2579
Feature activation-1.32e-01

Pos logit contributions
allery+0.198
tnc+0.194
 Klu+0.184
erd+0.170
oidal+0.169
Neg logit contributions
 Rabbit-0.090
Pan-0.083
 2048-0.080
-0.076
pain-0.075
Token;
Token ID26
Feature activation+5.20e-02

Pos logit contributions
allery+0.360
tnc+0.351
 Klu+0.329
erd+0.300
oidal+0.297
Neg logit contributions
 Rabbit-0.248
Pan-0.232
 2048-0.229
-0.217
pain-0.215
Token the
Token ID262
Feature activation+6.45e-03

Pos logit contributions
 Rabbit+0.096
Pan+0.090
 2048+0.089
+0.084
pain+0.083
Neg logit contributions
allery-0.144
tnc-0.141
 Klu-0.132
erd-0.120
oidal-0.119
Token Vin
Token ID11820
Feature activation-5.29e-01

Pos logit contributions
 Rabbit+0.011
Pan+0.010
 2048+0.010
+0.010
pain+0.009
Neg logit contributions
allery-0.019
tnc-0.018
 Klu-0.017
erd-0.016
oidal-0.016
Tokencci
Token ID35764
Feature activation-1.27e+00

Pos logit contributions
allery+1.322
tnc+1.279
 Klu+1.200
erd+1.093
oidal+1.071
Neg logit contributions
 Rabbit-1.101
Pan-1.042
 2048-1.038
-0.988
---------0.976
Token La
Token ID4689
Feature activation-1.63e-01

Pos logit contributions
allery+3.674
tnc+3.551
 Klu+3.358
erd+3.108
 concess+3.064
Neg logit contributions
 Rabbit-2.105
Pan-1.967
 2048-1.964
-1.840
---------1.831
Token Plant
Token ID16561
Feature activation+1.90e-01

Pos logit contributions
allery+0.467
tnc+0.456
 Klu+0.429
erd+0.393
oidal+0.389
Neg logit contributions
 Rabbit-0.286
Pan-0.266
 2048-0.263
-0.248
pain-0.245
Tokenaci
Token ID32009
Feature activation-1.58e-03

Pos logit contributions
 Rabbit+0.418
Pan+0.395
 2048+0.390
+0.373
pain+0.370
Neg logit contributions
allery-0.458
tnc-0.444
 Klu-0.412
erd-0.372
oidal-0.367
Tokenón
Token ID18840
Feature activation+4.86e-01

Pos logit contributions
allery+0.005
tnc+0.004
 Klu+0.004
erd+0.004
oidal+0.004
Neg logit contributions
 Rabbit-0.003
Pan-0.003
 2048-0.003
-0.002
pain-0.002
Token del
Token ID1619
Feature activation+2.59e-01

Pos logit contributions
 Rabbit+0.827
Pan+0.768
 2048+0.755
+0.711
pain+0.705
Neg logit contributions
allery-1.417
tnc-1.385
 Klu-1.302
erd-1.194
oidal-1.185
Token and
Token ID290
Feature activation+1.28e-01

Pos logit contributions
allery+0.782
tnc+0.750
 Klu+0.711
erd+0.652
oidal+0.645
Neg logit contributions
 Rabbit-0.537
Pan-0.504
 2048-0.490
-0.473
pain-0.465
Token will
Token ID481
Feature activation+9.99e-02

Pos logit contributions
 Rabbit+0.237
Pan+0.221
 2048+0.218
+0.205
 Panama+0.204
Neg logit contributions
allery-0.356
tnc-0.348
 Klu-0.324
erd-0.297
oidal-0.294
Token ship
Token ID4074
Feature activation-4.18e-01

Pos logit contributions
 Rabbit+0.192
Pan+0.180
 2048+0.178
+0.168
pain+0.167
Neg logit contributions
allery-0.270
tnc-0.263
 Klu-0.246
erd-0.224
oidal-0.222
Token to
Token ID284
Feature activation-2.52e-01

Pos logit contributions
allery+1.143
tnc+1.114
 Klu+1.045
erd+0.955
oidal+0.945
Neg logit contributions
 Rabbit-0.782
Pan-0.732
 2048-0.721
-0.686
pain-0.679
Token you
Token ID345
Feature activation-2.11e-01

Pos logit contributions
allery+0.658
tnc+0.641
 Klu+0.599
erd+0.544
oidal+0.538
Neg logit contributions
 Rabbit-0.506
Pan-0.476
 2048-0.470
-0.446
pain-0.443
Token direct
Token ID1277
Feature activation-1.26e+00

Pos logit contributions
allery+0.539
tnc+0.524
 Klu+0.489
erd+0.443
oidal+0.439
Neg logit contributions
 Rabbit-0.434
Pan-0.409
 2048-0.402
-0.385
pain-0.381
Token from
Token ID422
Feature activation+2.56e-02

Pos logit contributions
allery+3.271
tnc+3.174
 Klu+3.001
erd+2.732
oidal+2.719
Neg logit contributions
 Rabbit-2.450
Pan-2.324
 2048-2.245
-2.196
pain-2.145
Token the
Token ID262
Feature activation-6.43e-02

Pos logit contributions
 Rabbit+0.047
Pan+0.043
 2048+0.043
 Panama+0.040
pain+0.040
Neg logit contributions
allery-0.072
tnc-0.071
 Klu-0.066
erd-0.060
oidal-0.060
Token manufacturer
Token ID11554
Feature activation-5.57e-01

Pos logit contributions
allery+0.193
tnc+0.189
 Klu+0.177
erd+0.164
oidal+0.162
Neg logit contributions
 Rabbit-0.105
Pan-0.096
 2048-0.095
 Panama-0.088
pain-0.088
Token.
Token ID13
Feature activation-1.60e-02

Pos logit contributions
allery+1.482
tnc+1.445
 Klu+1.351
erd+1.229
oidal+1.217
Neg logit contributions
 Rabbit-1.088
Pan-1.021
 2048-1.007
-0.956
pain-0.949
Token Thus
Token ID6660
Feature activation+1.27e-01

Pos logit contributions
allery+0.046
tnc+0.045
 Klu+0.042
erd+0.039
oidal+0.038
Neg logit contributions
 Rabbit-0.028
Pan-0.026
 2048-0.026
-0.024
pain-0.024
Token if
Token ID611
Feature activation-2.03e-01

Pos logit contributions
allery+0.520
tnc+0.507
 Klu+0.475
erd+0.433
oidal+0.428
Neg logit contributions
 Rabbit-0.366
Pan-0.344
 2048-0.339
-0.321
pain-0.318
Token a
Token ID257
Feature activation+5.75e-03

Pos logit contributions
allery+0.545
tnc+0.531
 Klu+0.497
erd+0.452
oidal+0.448
Neg logit contributions
 Rabbit-0.393
Pan-0.369
 2048-0.364
-0.345
pain-0.342
Token driver
Token ID4639
Feature activation-2.95e-01

Pos logit contributions
 Rabbit+0.010
Pan+0.010
 2048+0.010
+0.009
pain+0.009
Neg logit contributions
allery-0.016
tnc-0.016
 Klu-0.015
erd-0.013
oidal-0.013
Token wants
Token ID3382
Feature activation-9.03e-02

Pos logit contributions
allery+0.772
tnc+0.755
 Klu+0.712
erd+0.645
oidal+0.636
Neg logit contributions
 Rabbit-0.574
Pan-0.546
 2048-0.533
-0.512
pain-0.502
Token to
Token ID284
Feature activation-1.10e-01

Pos logit contributions
allery+0.251
tnc+0.244
 Klu+0.230
erd+0.212
oidal+0.208
Neg logit contributions
 Rabbit-0.168
Pan-0.155
 2048-0.154
-0.146
 civilian-0.144
Token turn
Token ID1210
Feature activation+8.19e-01

Pos logit contributions
allery+0.263
tnc+0.255
 Klu+0.237
erd+0.213
oidal+0.211
Neg logit contributions
 Rabbit-0.246
Pan-0.233
 2048-0.230
-0.221
pain-0.219
Token right
Token ID826
Feature activation-2.16e-01

Pos logit contributions
 Rabbit+1.476
Pan+1.362
 2048+1.349
+1.272
pain+1.263
Neg logit contributions
allery-2.319
tnc-2.265
 Klu-2.121
erd-1.943
oidal-1.932
Token at
Token ID379
Feature activation+1.85e-01

Pos logit contributions
allery+0.591
tnc+0.574
 Klu+0.540
erd+0.494
oidal+0.488
Neg logit contributions
 Rabbit-0.402
Pan-0.379
 2048-0.371
-0.354
pain-0.350
Token an
Token ID281
Feature activation+9.77e-03

Pos logit contributions
 Rabbit+0.339
Pan+0.314
 2048+0.310
+0.292
pain+0.290
Neg logit contributions
allery-0.518
tnc-0.506
 Klu-0.475
erd-0.435
oidal-0.431
Token intersection
Token ID16246
Feature activation+5.22e-01

Pos logit contributions
 Rabbit+0.019
Pan+0.018
 2048+0.018
+0.017
pain+0.017
Neg logit contributions
allery-0.026
tnc-0.025
 Klu-0.024
erd-0.022
oidal-0.021
Token then
Token ID788
Feature activation+9.91e-02

Pos logit contributions
 Rabbit+0.920
Pan+0.845
 2048+0.842
+0.789
pain+0.785
Neg logit contributions
allery-1.495
tnc-1.465
 Klu-1.370
erd-1.260
oidal-1.245
Token()
Token ID3419
Feature activation+4.47e-01

Pos logit contributions
allery+0.197
tnc+0.194
 Klu+0.183
erd+0.166
oidal+0.165
Neg logit contributions
 Rabbit-0.123
Pan-0.113
 2048-0.111
-0.108
pain-0.106
Token ap
Token ID2471
Feature activation+1.07e-01

Pos logit contributions
 Rabbit+0.826
Pan+0.774
 2048+0.766
 civilian+0.716
 Panama+0.716
Neg logit contributions
allery-1.239
tnc-1.210
 Klu-1.127
erd-1.036
oidal-1.021
Token.
Token ID13
Feature activation+3.56e-02

Pos logit contributions
 Rabbit+0.214
Pan+0.200
 2048+0.198
+0.189
pain+0.187
Neg logit contributions
allery-0.278
tnc-0.273
 Klu-0.255
erd-0.231
oidal-0.229
Tokenadd
Token ID2860
Feature activation+4.12e-01

Pos logit contributions
 Rabbit+0.094
Pan+0.090
 2048+0.089
+0.085
pain+0.085
Neg logit contributions
allery-0.070
tnc-0.068
 Klu-0.062
erd-0.054
oidal-0.053
Token_
Token ID62
Feature activation-5.88e-01

Pos logit contributions
 Rabbit+1.047
Pan+1.008
 2048+0.992
pain+0.949
--------+0.948
Neg logit contributions
allery-0.865
tnc-0.821
 Klu-0.756
erd-0.662
 concess-0.658
Tokenargument
Token ID49140
Feature activation+1.26e+00

Pos logit contributions
allery+1.184
tnc+1.164
 Klu+1.050
erd+0.939
oidal+0.921
Neg logit contributions
 Rabbit-1.525
Pan-1.431
 2048-1.430
-1.387
 civilian-1.361
Token("
Token ID7203
Feature activation+1.17e-01

Pos logit contributions
 Rabbit+1.660
Pan+1.518
 2048+1.482
+1.363
pain+1.346
Neg logit contributions
allery-4.151
tnc-4.053
 Klu-3.846
erd-3.568
oidal-3.539
Token-
Token ID12
Feature activation+1.23e-01

Pos logit contributions
 Rabbit+0.225
Pan+0.209
 2048+0.207
+0.197
pain+0.194
Neg logit contributions
allery-0.315
tnc-0.309
 Klu-0.288
erd-0.264
oidal-0.261
Tokeni
Token ID72
Feature activation+2.54e-01

Pos logit contributions
 Rabbit+0.288
Pan+0.272
 2048+0.270
+0.259
pain+0.256
Neg logit contributions
allery-0.281
tnc-0.274
 Klu-0.252
erd-0.226
oidal-0.223
Token",
Token ID1600
Feature activation+5.04e-01

Pos logit contributions
 Rabbit+0.355
 2048+0.316
Pan+0.314
+0.299
 civilian+0.288
Neg logit contributions
allery-0.810
tnc-0.806
 Klu-0.758
erd-0.706
oidal-0.700
Token "
Token ID366
Feature activation-4.30e-02

Pos logit contributions
 Rabbit+0.643
Pan+0.582
 2048+0.568
+0.525
pain+0.517
Neg logit contributions
allery-1.682
tnc-1.650
 Klu-1.565
erd-1.455
oidal-1.444
Token such
Token ID884
Feature activation-8.85e-02

Pos logit contributions
 Rabbit+0.567
Pan+0.533
 2048+0.525
+0.499
pain+0.496
Neg logit contributions
allery-0.725
tnc-0.707
 Klu-0.658
erd-0.598
oidal-0.592
Token opinions
Token ID9317
Feature activation-1.67e-01

Pos logit contributions
allery+0.234
tnc+0.228
 Klu+0.214
erd+0.194
oidal+0.192
Neg logit contributions
 Rabbit-0.174
Pan-0.163
 2048-0.161
-0.153
pain-0.151
Token were
Token ID547
Feature activation-9.09e-02

Pos logit contributions
allery+0.457
tnc+0.445
 Klu+0.418
erd+0.382
oidal+0.379
Neg logit contributions
 Rabbit-0.311
Pan-0.291
 2048-0.287
-0.272
pain-0.269
Token out
Token ID503
Feature activation+3.82e-01

Pos logit contributions
allery+0.239
tnc+0.233
 Klu+0.218
erd+0.198
oidal+0.196
Neg logit contributions
 Rabbit-0.180
Pan-0.169
 2048-0.167
-0.159
pain-0.157
Token of
Token ID286
Feature activation-2.23e-01

Pos logit contributions
 Rabbit+0.720
Pan+0.675
 2048+0.665
+0.632
pain+0.628
Neg logit contributions
allery-1.047
tnc-1.016
 Klu-0.947
erd-0.865
oidal-0.858
Token sync
Token ID17510
Feature activation+8.61e-01

Pos logit contributions
allery+0.568
tnc+0.555
 Klu+0.520
erd+0.470
oidal+0.465
Neg logit contributions
 Rabbit-0.459
Pan-0.431
 2048-0.426
-0.406
pain-0.403
Token with
Token ID351
Feature activation-1.02e-01

Pos logit contributions
 Rabbit+1.622
Pan+1.523
 2048+1.508
+1.418
--------+1.413
Neg logit contributions
allery-2.354
tnc-2.290
 Klu-2.129
erd-1.956
oidal-1.926
Token the
Token ID262
Feature activation-5.33e-02

Pos logit contributions
allery+0.275
tnc+0.267
 Klu+0.252
erd+0.228
oidal+0.225
Neg logit contributions
 Rabbit-0.197
Pan-0.185
 2048-0.182
-0.173
pain-0.172
Token times
Token ID1661
Feature activation+6.86e-02

Pos logit contributions
allery+0.141
tnc+0.137
 Klu+0.129
erd+0.117
oidal+0.116
Neg logit contributions
 Rabbit-0.105
Pan-0.098
 2048-0.097
-0.092
pain-0.091
Token.
Token ID13
Feature activation+1.48e-03

Pos logit contributions
 Rabbit+0.130
Pan+0.122
 2048+0.120
+0.114
pain+0.113
Neg logit contributions
allery-0.187
tnc-0.182
 Klu-0.170
erd-0.155
oidal-0.154
Token
Token ID198
Feature activation+1.77e-01

Pos logit contributions
 Rabbit+0.003
Pan+0.002
 2048+0.002
+0.002
pain+0.002
Neg logit contributions
allery-0.004
tnc-0.004
 Klu-0.004
erd-0.004
oidal-0.004
Token full
Token ID1336
Feature activation-2.68e-01

Pos logit contributions
 Rabbit+0.163
Pan+0.154
 2048+0.151
+0.145
pain+0.144
Neg logit contributions
allery-0.205
tnc-0.199
 Klu-0.187
erd-0.169
oidal-0.166
Token approval
Token ID7546
Feature activation-3.96e-02

Pos logit contributions
allery+0.684
tnc+0.665
 Klu+0.629
oidal+0.561
erd+0.560
Neg logit contributions
 Rabbit-0.537
Pan-0.511
 2048-0.496
-0.481
---------0.480
Token.
Token ID13
Feature activation+1.30e-02

Pos logit contributions
allery+0.107
tnc+0.104
 Klu+0.098
erd+0.089
oidal+0.088
Neg logit contributions
 Rabbit-0.075
Pan-0.071
 2048-0.069
-0.066
pain-0.066
Token In
Token ID554
Feature activation-4.60e-02

Pos logit contributions
 Rabbit+0.022
Pan+0.021
 2048+0.020
+0.019
pain+0.019
Neg logit contributions
allery-0.038
tnc-0.037
 Klu-0.035
erd-0.032
oidal-0.032
Token the
Token ID262
Feature activation+5.07e-02

Pos logit contributions
allery+0.126
tnc+0.123
 Klu+0.116
erd+0.106
oidal+0.104
Neg logit contributions
 Rabbit-0.085
Pan-0.080
 2048-0.078
-0.074
pain-0.074
Token meantime
Token ID14324
Feature activation+8.45e-01

Pos logit contributions
 Rabbit+0.110
Pan+0.104
 2048+0.102
+0.098
pain+0.097
Neg logit contributions
allery-0.124
tnc-0.120
 Klu-0.112
erd-0.101
oidal-0.100
Token,
Token ID11
Feature activation-6.86e-02

Pos logit contributions
 Rabbit+1.367
Pan+1.245
 2048+1.237
+1.151
--------+1.149
Neg logit contributions
allery-2.526
tnc-2.482
 Klu-2.328
erd-2.146
oidal-2.130
Token make
Token ID787
Feature activation+1.13e-01

Pos logit contributions
allery+0.183
tnc+0.178
 Klu+0.166
erd+0.151
oidal+0.150
Neg logit contributions
 Rabbit-0.134
Pan-0.126
 2048-0.124
-0.118
pain-0.117
Token sure
Token ID1654
Feature activation+4.05e-01

Pos logit contributions
 Rabbit+0.204
Pan+0.191
 2048+0.188
+0.178
pain+0.176
Neg logit contributions
allery-0.316
tnc-0.308
 Klu-0.289
erd-0.265
oidal-0.262
Token to
Token ID284
Feature activation-7.03e-02

Pos logit contributions
 Rabbit+0.793
Pan+0.733
 2048+0.732
+0.689
 Panama+0.681
Neg logit contributions
allery-1.086
tnc-1.059
 Klu-0.990
erd-0.899
oidal-0.891
Token check
Token ID2198
Feature activation-5.28e-02

Pos logit contributions
allery+0.163
tnc+0.159
 Klu+0.147
erd+0.132
oidal+0.131
Neg logit contributions
 Rabbit-0.160
Pan-0.152
 2048-0.150
-0.145
pain-0.144
Token
Token ID198
Feature activation-4.39e-02

Pos logit contributions
allery+0.132
tnc+0.131
 Klu+0.123
erd+0.112
oidal+0.111
Neg logit contributions
 Rabbit-0.093
Pan-0.087
 2048-0.086
-0.082
pain-0.081
TokenHere
Token ID4342
Feature activation+1.16e-01

Pos logit contributions
allery+0.123
tnc+0.120
 Klu+0.113
erd+0.103
oidal+0.102
Neg logit contributions
 Rabbit-0.080
Pan-0.074
 2048-0.073
-0.069
pain-0.068
Token are
Token ID389
Feature activation+1.03e-02

Pos logit contributions
 Rabbit+0.253
Pan+0.238
 2048+0.234
+0.224
pain+0.223
Neg logit contributions
allery-0.282
tnc-0.275
 Klu-0.254
erd-0.229
oidal-0.227
Token the
Token ID262
Feature activation-1.01e-01

Pos logit contributions
 Rabbit+0.018
Pan+0.017
 2048+0.016
+0.016
pain+0.015
Neg logit contributions
allery-0.030
tnc-0.029
 Klu-0.027
erd-0.025
oidal-0.025
Token six
Token ID2237
Feature activation+2.86e-02

Pos logit contributions
allery+0.277
tnc+0.270
 Klu+0.254
erd+0.231
oidal+0.229
Neg logit contributions
 Rabbit-0.190
Pan-0.178
 2048-0.175
-0.167
pain-0.165
Token camera
Token ID4676
Feature activation+8.99e-01

Pos logit contributions
 Rabbit+0.056
Pan+0.053
 2048+0.052
+0.050
pain+0.049
Neg logit contributions
allery-0.075
tnc-0.074
 Klu-0.069
erd-0.062
oidal-0.062
Token accessories
Token ID18199
Feature activation-3.11e-01

Pos logit contributions
 Rabbit+1.802
Pan+1.682
 2048+1.648
+1.571
pain+1.565
Neg logit contributions
allery-2.365
tnc-2.302
 Klu-2.148
erd-1.950
oidal-1.928
Token Adam
Token ID7244
Feature activation-4.29e-02

Pos logit contributions
allery+0.804
tnc+0.783
 Klu+0.735
erd+0.664
oidal+0.657
Neg logit contributions
 Rabbit-0.624
Pan-0.590
 2048-0.581
-0.557
pain-0.550
Token Pretty
Token ID20090
Feature activation-4.14e-01

Pos logit contributions
allery+0.115
tnc+0.112
 Klu+0.106
erd+0.096
oidal+0.095
Neg logit contributions
 Rabbit-0.082
Pan-0.078
 2048-0.077
-0.073
pain-0.072
Token would
Token ID561
Feature activation-2.09e-01

Pos logit contributions
allery+1.112
tnc+1.078
 Klu+1.016
erd+0.931
oidal+0.912
Neg logit contributions
 Rabbit-0.790
Pan-0.744
 2048-0.734
-0.702
---------0.690
Token recommend
Token ID4313
Feature activation-6.28e-01

Pos logit contributions
allery+0.575
tnc+0.559
 Klu+0.524
erd+0.480
oidal+0.478
Neg logit contributions
 Rabbit-0.390
Pan-0.365
 2048-0.359
-0.343
pain-0.338
Token said
Token ID531
Feature activation+3.94e-02

Pos logit contributions
 Rabbit+0.351
Pan+0.333
 2048+0.329
+0.316
pain+0.314
Neg logit contributions
allery-0.342
tnc-0.332
 Klu-0.306
erd-0.273
oidal-0.271
Token in
Token ID287
Feature activation+2.05e-01

Pos logit contributions
 Rabbit+0.073
Pan+0.069
 2048+0.068
+0.064
 civilian+0.064
Neg logit contributions
allery-0.109
tnc-0.106
 Klu-0.099
erd-0.090
oidal-0.090
Token a
Token ID257
Feature activation+4.69e-02

Pos logit contributions
 Rabbit+0.377
Pan+0.352
 2048+0.346
+0.328
pain+0.325
Neg logit contributions
allery-0.573
tnc-0.558
 Klu-0.521
erd-0.478
oidal-0.475
Token statement
Token ID2643
Feature activation+5.86e-01

Pos logit contributions
 Rabbit+0.105
Pan+0.100
 2048+0.098
+0.094
pain+0.093
Neg logit contributions
allery-0.111
tnc-0.108
 Klu-0.100
erd-0.090
oidal-0.089
Token.
Token ID13
Feature activation+2.26e-02

Pos logit contributions
 Rabbit+1.100
Pan+1.029
 2048+1.016
+0.961
pain+0.956
Neg logit contributions
allery-1.610
tnc-1.572
 Klu-1.463
erd-1.336
oidal-1.328
Token
Token ID564
Feature activation+2.00e-02

Pos logit contributions
 Rabbit+0.040
Pan+0.037
 2048+0.036
+0.034
--------+0.034
Neg logit contributions
allery-0.065
tnc-0.063
 Klu-0.059
erd-0.055
oidal-0.054
Token
Token ID250
Feature activation+1.62e-01

Pos logit contributions
 Rabbit+0.024
Pan+0.022
 2048+0.021
+0.019
pain+0.019
Neg logit contributions
allery-0.069
tnc-0.067
 Klu-0.064
erd-0.060
oidal-0.059
TokenThe
Token ID464
Feature activation+1.21e-01

Pos logit contributions
 Rabbit+0.297
Pan+0.277
 2048+0.274
+0.259
pain+0.257
Neg logit contributions
allery-0.451
tnc-0.440
 Klu-0.413
erd-0.378
oidal-0.374
Token United
Token ID1578
Feature activation+2.05e-01

Pos logit contributions
 Rabbit+0.204
Pan+0.190
 2048+0.187
+0.176
pain+0.174
Neg logit contributions
allery-0.355
tnc-0.347
 Klu-0.325
erd-0.300
oidal-0.297
Token States
Token ID1829
Feature activation+2.32e-01

Pos logit contributions
 Rabbit+0.262
Pan+0.241
 2048+0.235
+0.216
pain+0.212
Neg logit contributions
allery-0.686
tnc-0.671
 Klu-0.634
erd-0.590
oidal-0.587
Token intends
Token ID19582
Feature activation+1.97e-01

Pos logit contributions
 Rabbit+0.448
Pan+0.421
 2048+0.415
+0.393
pain+0.391
Neg logit contributions
allery-0.621
tnc-0.606
 Klu-0.565
erd-0.515
oidal-0.511
Token a
Token ID257
Feature activation+6.05e-02

Pos logit contributions
 Rabbit+0.052
Pan+0.049
 2048+0.048
+0.046
pain+0.045
Neg logit contributions
allery-0.073
tnc-0.071
 Klu-0.066
erd-0.061
oidal-0.060
Token definite
Token ID21892
Feature activation+4.43e-02

Pos logit contributions
 Rabbit+0.128
Pan+0.121
 2048+0.119
+0.114
pain+0.113
Neg logit contributions
allery-0.151
tnc-0.147
 Klu-0.137
erd-0.124
oidal-0.122
Token push
Token ID4574
Feature activation+3.64e-01

Pos logit contributions
 Rabbit+0.091
Pan+0.086
 2048+0.084
+0.080
pain+0.080
Neg logit contributions
allery-0.114
tnc-0.111
 Klu-0.103
erd-0.094
oidal-0.093
Token to
Token ID284
Feature activation+2.64e-01

Pos logit contributions
 Rabbit+0.696
Pan+0.653
 2048+0.643
+0.610
pain+0.605
Neg logit contributions
allery-0.985
tnc-0.960
 Klu-0.899
erd-0.820
oidal-0.812
Token keep
Token ID1394
Feature activation+4.54e-01

Pos logit contributions
 Rabbit+0.574
Pan+0.543
 2048+0.536
+0.511
pain+0.508
Neg logit contributions
allery-0.646
tnc-0.627
 Klu-0.583
erd-0.525
oidal-0.519
Token this
Token ID428
Feature activation+3.00e-01

Pos logit contributions
 Rabbit+0.830
Pan+0.774
 2048+0.765
+0.719
pain+0.713
Neg logit contributions
allery-1.268
tnc-1.237
 Klu-1.158
erd-1.061
oidal-1.050
Token momentum
Token ID12858
Feature activation+1.82e-01

Pos logit contributions
 Rabbit+0.546
Pan+0.509
 2048+0.503
+0.473
pain+0.470
Neg logit contributions
allery-0.839
tnc-0.818
 Klu-0.767
erd-0.702
oidal-0.695
Token going
Token ID1016
Feature activation+2.54e-01

Pos logit contributions
 Rabbit+0.303
Pan+0.280
 2048+0.277
+0.259
pain+0.257
Neg logit contributions
allery-0.542
tnc-0.528
 Klu-0.497
erd-0.457
oidal-0.453
Token,"
Token ID553
Feature activation-3.22e-01

Pos logit contributions
 Rabbit+0.482
Pan+0.451
 2048+0.447
+0.420
pain+0.418
Neg logit contributions
allery-0.691
tnc-0.673
 Klu-0.629
erd-0.573
oidal-0.568
Token he
Token ID339
Feature activation-6.30e-02

Pos logit contributions
allery+0.816
tnc+0.795
 Klu+0.740
erd+0.670
oidal+0.663
Neg logit contributions
 Rabbit-0.671
Pan-0.632
 2048-0.624
-0.594
pain-0.590
Token signs
Token ID5895
Feature activation-9.04e-02

Pos logit contributions
allery+0.139
tnc+0.135
 Klu+0.124
erd+0.109
oidal+0.109
Neg logit contributions
 Rabbit-0.152
Pan-0.145
 2048-0.143
-0.137
pain-0.136
Token fuzzy
Token ID34669
Feature activation+2.10e-01

Pos logit contributions
 Rabbit+0.085
Pan+0.080
 2048+0.079
+0.075
pain+0.074
Neg logit contributions
allery-0.128
tnc-0.125
 Klu-0.118
erd-0.107
oidal-0.106
Token to
Token ID284
Feature activation+2.90e-01

Pos logit contributions
 Rabbit+0.401
Pan+0.376
 2048+0.371
+0.351
pain+0.349
Neg logit contributions
allery-0.570
tnc-0.556
 Klu-0.520
erd-0.474
oidal-0.469
Token boot
Token ID6297
Feature activation+1.11e-01

Pos logit contributions
 Rabbit+0.567
Pan+0.532
 2048+0.527
+0.498
pain+0.493
Neg logit contributions
allery-0.773
tnc-0.754
 Klu-0.703
erd-0.641
oidal-0.633
Token.
Token ID13
Feature activation+1.52e-01

Pos logit contributions
 Rabbit+0.207
Pan+0.194
 2048+0.192
+0.181
pain+0.180
Neg logit contributions
allery-0.304
tnc-0.297
 Klu-0.278
erd-0.253
oidal-0.251
Token Due
Token ID14444
Feature activation+2.86e-01

Pos logit contributions
 Rabbit+0.259
Pan+0.241
 2048+0.237
+0.223
pain+0.221
Neg logit contributions
allery-0.444
tnc-0.433
 Klu-0.408
erd-0.374
oidal-0.371
Token to
Token ID284
Feature activation+7.84e-02

Pos logit contributions
 Rabbit+0.524
Pan+0.492
 2048+0.488
+0.466
pain+0.457
Neg logit contributions
allery-0.790
tnc-0.776
 Klu-0.727
erd-0.666
oidal-0.655
Token the
Token ID262
Feature activation+1.17e-01

Pos logit contributions
 Rabbit+0.148
Pan+0.139
 2048+0.137
+0.129
pain+0.129
Neg logit contributions
allery-0.214
tnc-0.209
 Klu-0.195
erd-0.178
oidal-0.176
Token third
Token ID2368
Feature activation-2.33e-01

Pos logit contributions
 Rabbit+0.235
Pan+0.221
 2048+0.218
+0.207
pain+0.206
Neg logit contributions
allery-0.304
tnc-0.296
 Klu-0.276
erd-0.251
oidal-0.248
Token game
Token ID983
Feature activation+4.25e-01

Pos logit contributions
allery+0.652
tnc+0.634
 Klu+0.597
erd+0.544
oidal+0.542
Neg logit contributions
 Rabbit-0.419
Pan-0.393
 2048-0.385
-0.366
pain-0.362
Token
Token ID447
Feature activation+2.13e-01

Pos logit contributions
 Rabbit+0.840
Pan+0.791
 2048+0.780
+0.741
pain+0.737
Neg logit contributions
allery-1.121
tnc-1.097
 Klu-1.020
erd-0.929
oidal-0.918
Token
Token ID247
Feature activation+1.48e-01

Pos logit contributions
 Rabbit+0.328
Pan+0.304
 2048+0.295
pain+0.278
 Panama+0.276
Neg logit contributions
allery-0.648
tnc-0.635
 Klu-0.603
erd-0.558
oidal-0.553
Token batch
Token ID15458
Feature activation-7.87e-01

Pos logit contributions
allery+0.379
tnc+0.369
 Klu+0.347
erd+0.317
oidal+0.314
Neg logit contributions
 Rabbit-0.246
Pan-0.232
 2048-0.227
-0.217
pain-0.214
Tokening
Token ID278
Feature activation+1.21e-01

Pos logit contributions
allery+1.859
tnc+1.812
 Klu+1.678
erd+1.529
oidal+1.512
Neg logit contributions
 Rabbit-1.719
Pan-1.635
 2048-1.611
-1.549
---------1.531
Token in
Token ID287
Feature activation-5.59e-02

Pos logit contributions
 Rabbit+0.224
Pan+0.209
 2048+0.206
+0.195
pain+0.193
Neg logit contributions
allery-0.337
tnc-0.328
 Klu-0.308
erd-0.281
oidal-0.279
Token the
Token ID262
Feature activation-2.24e-01

Pos logit contributions
allery+0.157
tnc+0.153
 Klu+0.144
erd+0.132
oidal+0.131
Neg logit contributions
 Rabbit-0.101
Pan-0.094
 2048-0.093
-0.088
pain-0.087
Token stub
Token ID17071
Feature activation-5.75e-01

Pos logit contributions
allery+0.597
tnc+0.581
 Klu+0.544
erd+0.495
oidal+0.490
Neg logit contributions
 Rabbit-0.437
Pan-0.411
 2048-0.405
-0.385
pain-0.382
Token library
Token ID5888
Feature activation+3.58e-01

Pos logit contributions
allery+1.501
tnc+1.452
 Klu+1.358
erd+1.238
oidal+1.227
Neg logit contributions
 Rabbit-1.147
Pan-1.079
 2048-1.060
-1.014
pain-1.004
Token code
Token ID2438
Feature activation+1.44e-01

Pos logit contributions
 Rabbit+0.649
Pan+0.605
 2048+0.598
+0.563
pain+0.558
Neg logit contributions
allery-1.006
tnc-0.985
 Klu-0.922
erd-0.845
oidal-0.836
Token
Token ID198
Feature activation-6.24e-03

Pos logit contributions
 Rabbit+0.266
Pan+0.249
 2048+0.245
+0.232
pain+0.230
Neg logit contributions
allery-0.397
tnc-0.386
 Klu-0.362
erd-0.332
oidal-0.328
Token
Token ID198
Feature activation+2.62e-02

Pos logit contributions
allery+0.017
tnc+0.016
 Klu+0.015
erd+0.014
oidal+0.014
Neg logit contributions
 Rabbit-0.012
Pan-0.011
 2048-0.011
-0.010
pain-0.010
TokenSerial
Token ID32634
Feature activation-1.56e-01

Pos logit contributions
 Rabbit+0.050
Pan+0.046
 2048+0.046
+0.043
pain+0.043
Neg logit contributions
allery-0.071
tnc-0.069
 Klu-0.065
erd-0.059
oidal-0.059
Tokenization
Token ID1634
Feature activation-9.65e-02

Pos logit contributions
allery+0.344
tnc+0.334
 Klu+0.308
erd+0.274
oidal+0.272
Neg logit contributions
 Rabbit-0.375
Pan-0.357
 2048-0.353
-0.340
pain-0.337
Token two
Token ID734
Feature activation+1.73e-01

Pos logit contributions
allery+0.133
tnc+0.129
 Klu+0.121
erd+0.110
oidal+0.109
Neg logit contributions
 Rabbit-0.091
Pan-0.086
 2048-0.084
-0.080
pain-0.079
Token goals
Token ID4661
Feature activation-4.53e-01

Pos logit contributions
 Rabbit+0.337
Pan+0.317
 2048+0.312
+0.297
pain+0.294
Neg logit contributions
allery-0.461
tnc-0.450
 Klu-0.421
erd-0.383
oidal-0.379
Token were
Token ID547
Feature activation-2.02e-01

Pos logit contributions
allery+1.241
tnc+1.200
 Klu+1.138
erd+1.031
oidal+1.022
Neg logit contributions
 Rabbit-0.847
Pan-0.798
 2048-0.782
-0.739
pain-0.733
Token erected
Token ID28415
Feature activation+2.92e-01

Pos logit contributions
allery+0.508
tnc+0.495
 Klu+0.464
erd+0.417
oidal+0.413
Neg logit contributions
 Rabbit-0.420
Pan-0.398
 2048-0.390
-0.374
pain-0.371
Token.
Token ID13
Feature activation-6.41e-02

Pos logit contributions
 Rabbit+0.522
Pan+0.487
 2048+0.480
+0.453
pain+0.450
Neg logit contributions
allery-0.828
tnc-0.808
 Klu-0.758
erd-0.695
oidal-0.688
Token
Token ID198
Feature activation+6.75e-02

Pos logit contributions
allery+0.183
tnc+0.179
 Klu+0.169
erd+0.154
oidal+0.153
Neg logit contributions
 Rabbit-0.111
Pan-0.104
 2048-0.102
-0.097
pain-0.096
Token
Token ID198
Feature activation+4.74e-02

Pos logit contributions
 Rabbit+0.126
Pan+0.118
 2048+0.116
+0.111
pain+0.109
Neg logit contributions
allery-0.184
tnc-0.181
 Klu-0.170
erd-0.155
oidal-0.153
TokenLe
Token ID3123
Feature activation-2.79e-01

Pos logit contributions
 Rabbit+0.084
Pan+0.077
 2048+0.077
+0.072
pain+0.072
Neg logit contributions
allery-0.135
tnc-0.132
 Klu-0.124
erd-0.114
oidal-0.113
Tokenigh
Token ID394
Feature activation+5.52e-01

Pos logit contributions
allery+0.760
tnc+0.745
 Klu+0.698
erd+0.640
oidal+0.630
Neg logit contributions
 Rabbit-0.527
 2048-0.491
Pan-0.490
-0.456
pain-0.455
Token Wi
Token ID11759
Feature activation+9.87e-02

Pos logit contributions
 Rabbit+1.312
Pan+1.246
 2048+1.235
+1.172
 Panama+1.169
Neg logit contributions
allery-1.232
tnc-1.198
 Klu-1.078
erd-0.974
 concess-0.963
Tokenener
Token ID877
Feature activation-2.68e-01

Pos logit contributions
 Rabbit+0.254
Pan+0.242
 2048+0.240
+0.231
pain+0.230
Neg logit contributions
allery-0.201
tnc-0.195
 Klu-0.178
erd-0.157
oidal-0.154
Token planned
Token ID6027
Feature activation+6.56e-02

Pos logit contributions
allery+0.090
tnc+0.088
 Klu+0.083
erd+0.075
oidal+0.074
Neg logit contributions
 Rabbit-0.067
Pan-0.063
 2048-0.062
-0.059
pain-0.058
Token to
Token ID284
Feature activation+1.69e-01

Pos logit contributions
 Rabbit+0.124
Pan+0.116
 2048+0.115
+0.109
pain+0.108
Neg logit contributions
allery-0.179
tnc-0.174
 Klu-0.163
erd-0.149
oidal-0.148
Token happen
Token ID1645
Feature activation-2.09e-01

Pos logit contributions
 Rabbit+0.337
Pan+0.317
 2048+0.312
+0.298
pain+0.297
Neg logit contributions
allery-0.443
tnc-0.432
 Klu-0.404
erd-0.367
oidal-0.364
Token in
Token ID287
Feature activation-9.08e-02

Pos logit contributions
allery+0.567
tnc+0.552
 Klu+0.522
erd+0.473
oidal+0.469
Neg logit contributions
 Rabbit-0.391
Pan-0.368
 2048-0.356
-0.347
pain-0.339
Token two
Token ID734
Feature activation+4.20e-01

Pos logit contributions
allery+0.257
tnc+0.252
 Klu+0.238
erd+0.216
oidal+0.214
Neg logit contributions
 Rabbit-0.160
Pan-0.150
 2048-0.146
-0.141
pain-0.138
Token phases
Token ID21164
Feature activation-1.38e-01

Pos logit contributions
 Rabbit+0.782
Pan+0.732
 2048+0.721
+0.683
pain+0.677
Neg logit contributions
allery-1.160
tnc-1.131
 Klu-1.060
erd-0.968
oidal-0.959
Token,
Token ID11
Feature activation-4.58e-02

Pos logit contributions
allery+0.381
tnc+0.372
 Klu+0.351
erd+0.319
oidal+0.317
Neg logit contributions
 Rabbit-0.254
Pan-0.238
 2048-0.233
-0.223
pain-0.219
Token according
Token ID1864
Feature activation-7.10e-02

Pos logit contributions
allery+0.130
tnc+0.127
 Klu+0.120
erd+0.109
oidal+0.108
Neg logit contributions
 Rabbit-0.081
Pan-0.076
 2048-0.074
-0.071
pain-0.070
Token to
Token ID284
Feature activation-2.73e-01

Pos logit contributions
allery+0.180
tnc+0.178
 Klu+0.167
erd+0.151
oidal+0.149
Neg logit contributions
 Rabbit-0.144
Pan-0.136
 2048-0.133
-0.129
pain-0.127
Token project
Token ID1628
Feature activation-9.54e-02

Pos logit contributions
allery+0.758
tnc+0.742
 Klu+0.700
erd+0.637
oidal+0.628
Neg logit contributions
 Rabbit-0.493
Pan-0.465
 2048-0.455
-0.433
pain-0.428
Token manager
Token ID4706
Feature activation+2.77e-01

Pos logit contributions
allery+0.257
tnc+0.252
 Klu+0.236
erd+0.214
oidal+0.212
Neg logit contributions
 Rabbit-0.182
Pan-0.171
 2048-0.169
-0.159
pain-0.158
Token areas
Token ID3006
Feature activation-1.41e-01

Pos logit contributions
 Rabbit+0.529
Pan+0.495
 2048+0.488
+0.462
pain+0.459
Neg logit contributions
allery-0.767
tnc-0.748
 Klu-0.701
erd-0.640
oidal-0.633
Token such
Token ID884
Feature activation+5.28e-03

Pos logit contributions
allery+0.388
tnc+0.378
 Klu+0.355
erd+0.323
oidal+0.320
Neg logit contributions
 Rabbit-0.264
Pan-0.248
 2048-0.243
-0.231
pain-0.229
Token as
Token ID355
Feature activation+1.80e-02

Pos logit contributions
 Rabbit+0.011
Pan+0.011
 2048+0.010
+0.010
pain+0.010
Neg logit contributions
allery-0.013
tnc-0.013
 Klu-0.012
erd-0.011
oidal-0.011
Token agriculture
Token ID14510
Feature activation-4.55e-02

Pos logit contributions
 Rabbit+0.036
Pan+0.034
 2048+0.034
+0.032
pain+0.032
Neg logit contributions
allery-0.047
tnc-0.046
 Klu-0.043
erd-0.039
oidal-0.038
Token.
Token ID13
Feature activation-9.60e-02

Pos logit contributions
allery+0.130
tnc+0.127
 Klu+0.119
erd+0.109
oidal+0.108
Neg logit contributions
 Rabbit-0.080
Pan-0.074
 2048-0.073
-0.069
pain-0.068
Token Today
Token ID6288
Feature activation-2.35e-02

Pos logit contributions
allery+0.277
tnc+0.272
 Klu+0.258
erd+0.234
oidal+0.233
Neg logit contributions
 Rabbit-0.163
Pan-0.152
 2048-0.149
-0.142
pain-0.141
Token,
Token ID11
Feature activation-4.70e-02

Pos logit contributions
allery+0.065
tnc+0.064
 Klu+0.060
erd+0.055
oidal+0.054
Neg logit contributions
 Rabbit-0.043
Pan-0.040
 2048-0.040
-0.037
pain-0.037
Token we
Token ID356
Feature activation-1.09e-01

Pos logit contributions
allery+0.129
tnc+0.126
 Klu+0.118
erd+0.107
oidal+0.106
Neg logit contributions
 Rabbit-0.088
Pan-0.083
 2048-0.081
-0.077
pain-0.076
Token live
Token ID2107
Feature activation-3.87e-01

Pos logit contributions
allery+0.279
tnc+0.272
 Klu+0.253
erd+0.228
oidal+0.226
Neg logit contributions
 Rabbit-0.224
Pan-0.212
 2048-0.210
-0.199
 civilian-0.197
Token in
Token ID287
Feature activation+1.97e-01

Pos logit contributions
allery+1.043
tnc+1.019
 Klu+0.954
erd+0.870
oidal+0.860
Neg logit contributions
 Rabbit-0.737
Pan-0.692
 2048-0.680
-0.646
pain-0.643
Token a
Token ID257
Feature activation+1.17e-01

Pos logit contributions
 Rabbit+0.367
Pan+0.344
 2048+0.339
+0.321
pain+0.318
Neg logit contributions
allery-0.543
tnc-0.529
 Klu-0.496
erd-0.453
oidal-0.449
Token an
Token ID281
Feature activation-2.59e-02

Pos logit contributions
allery+0.194
tnc+0.189
 Klu+0.177
erd+0.162
oidal+0.160
Neg logit contributions
 Rabbit-0.138
Pan-0.129
 2048-0.127
-0.121
pain-0.120
Token annotation
Token ID23025
Feature activation+2.50e-02

Pos logit contributions
allery+0.070
tnc+0.069
 Klu+0.064
erd+0.058
oidal+0.058
Neg logit contributions
 Rabbit-0.049
Pan-0.046
 2048-0.046
-0.043
pain-0.043
Token automatically
Token ID6338
Feature activation-2.27e-01

Pos logit contributions
 Rabbit+0.052
Pan+0.049
 2048+0.049
+0.047
pain+0.046
Neg logit contributions
allery-0.063
tnc-0.061
 Klu-0.057
erd-0.051
oidal-0.051
Token added
Token ID2087
Feature activation+9.99e-02

Pos logit contributions
allery+0.640
tnc+0.619
 Klu+0.585
erd+0.538
oidal+0.531
Neg logit contributions
 Rabbit-0.406
Pan-0.381
 2048-0.371
-0.354
pain-0.351
Token to
Token ID284
Feature activation-5.45e-02

Pos logit contributions
 Rabbit+0.192
Pan+0.180
 2048+0.177
+0.168
pain+0.167
Neg logit contributions
allery-0.270
tnc-0.263
 Klu-0.246
erd-0.224
oidal-0.222
Token all
Token ID477
Feature activation-5.08e-02

Pos logit contributions
allery+0.145
tnc+0.141
 Klu+0.132
erd+0.120
oidal+0.119
Neg logit contributions
 Rabbit-0.106
Pan-0.100
 2048-0.098
-0.094
pain-0.093
Token of
Token ID286
Feature activation+1.29e-01

Pos logit contributions
allery+0.132
tnc+0.129
 Klu+0.120
erd+0.109
oidal+0.108
Neg logit contributions
 Rabbit-0.102
Pan-0.096
 2048-0.095
-0.090
pain-0.090
Token their
Token ID511
Feature activation+4.40e-02

Pos logit contributions
 Rabbit+0.259
Pan+0.242
 2048+0.240
+0.228
pain+0.225
Neg logit contributions
allery-0.336
tnc-0.328
 Klu-0.305
erd-0.278
oidal-0.276
Token existing
Token ID4683
Feature activation+2.09e-01

Pos logit contributions
 Rabbit+0.090
Pan+0.085
 2048+0.084
+0.080
pain+0.079
Neg logit contributions
allery-0.113
tnc-0.110
 Klu-0.103
erd-0.093
oidal-0.092
Token YouTube
Token ID7444
Feature activation-9.73e-02

Pos logit contributions
 Rabbit+0.420
Pan+0.395
 2048+0.390
+0.371
pain+0.368
Neg logit contributions
allery-0.543
tnc-0.529
 Klu-0.494
erd-0.448
oidal-0.443
Token videos
Token ID5861
Feature activation+5.61e-02

Pos logit contributions
allery+0.246
tnc+0.238
 Klu+0.224
erd+0.202
oidal+0.199
Neg logit contributions
 Rabbit-0.201
Pan-0.191
 2048-0.188
-0.180
pain-0.179
Token-
Token ID12
Feature activation-4.26e-02

Pos logit contributions
allery+0.200
tnc+0.195
 Klu+0.183
erd+0.167
oidal+0.164
Neg logit contributions
 Rabbit-0.145
Pan-0.137
 2048-0.135
-0.127
 Panama-0.126
Tokenren
Token ID918
Feature activation-3.06e-01

Pos logit contributions
allery+0.111
tnc+0.109
 Klu+0.101
erd+0.092
oidal+0.091
Neg logit contributions
 Rabbit-0.086
Pan-0.080
 2048-0.079
-0.075
 Panama-0.074
Tokenouncing
Token ID18155
Feature activation+1.98e-01

Pos logit contributions
allery+1.029
tnc+1.011
 Klu+0.952
oidal+0.893
erd+0.891
Neg logit contributions
 Rabbit-0.382
Pan-0.343
 2048-0.335
-0.310
 Panama-0.303
Token.
Token ID13
Feature activation-2.11e-01

Pos logit contributions
 Rabbit+0.373
Pan+0.350
 2048+0.345
+0.327
pain+0.324
Neg logit contributions
allery-0.538
tnc-0.525
 Klu-0.492
erd-0.449
oidal-0.444
Token Values
Token ID27068
Feature activation-5.75e-01

Pos logit contributions
allery+0.599
tnc+0.586
 Klu+0.555
erd+0.504
oidal+0.500
Neg logit contributions
 Rabbit-0.368
Pan-0.344
 2048-0.339
-0.321
pain-0.319
Token created
Token ID2727
Feature activation-2.02e-01

Pos logit contributions
allery+1.550
tnc+1.508
 Klu+1.417
erd+1.292
oidal+1.279
Neg logit contributions
 Rabbit-1.100
Pan-1.030
 2048-1.012
-0.962
pain-0.954
Token out
Token ID503
Feature activation+2.05e-01

Pos logit contributions
allery+0.563
tnc+0.548
 Klu+0.515
erd+0.471
oidal+0.466
Neg logit contributions
 Rabbit-0.369
Pan-0.345
 2048-0.340
-0.321
pain-0.319
Token of
Token ID286
Feature activation-5.68e-01

Pos logit contributions
 Rabbit+0.375
Pan+0.352
 2048+0.347
+0.330
pain+0.329
Neg logit contributions
allery-0.571
tnc-0.557
 Klu-0.520
erd-0.474
oidal-0.472
Token the
Token ID262
Feature activation-9.57e-02

Pos logit contributions
allery+1.516
tnc+1.475
 Klu+1.383
erd+1.255
oidal+1.239
Neg logit contributions
 Rabbit-1.108
Pan-1.042
 2048-1.020
-0.973
pain-0.956
Token will
Token ID481
Feature activation+1.23e-01

Pos logit contributions
allery+0.254
tnc+0.247
 Klu+0.231
erd+0.210
oidal+0.208
Neg logit contributions
 Rabbit-0.188
Pan-0.177
 2048-0.174
-0.165
pain-0.164
Token to
Token ID284
Feature activation-1.08e-02

Pos logit contributions
 Rabbit+0.231
Pan+0.217
 2048+0.214
+0.203
pain+0.201
Neg logit contributions
allery-0.339
tnc-0.330
 Klu-0.309
erd-0.281
oidal-0.279
Token in
Token ID287
Feature activation-1.14e-01

Pos logit contributions
allery+1.360
tnc+1.322
 Klu+1.255
erd+1.133
oidal+1.120
Neg logit contributions
 Rabbit-1.016
Pan-0.961
 2048-0.939
-0.908
pain-0.887
Token the
Token ID262
Feature activation-1.83e-01

Pos logit contributions
allery+0.311
tnc+0.303
 Klu+0.284
erd+0.259
oidal+0.256
Neg logit contributions
 Rabbit-0.216
Pan-0.202
 2048-0.199
-0.189
pain-0.188
Token Lif
Token ID21073
Feature activation-2.09e-03

Pos logit contributions
allery+0.538
tnc+0.525
 Klu+0.494
erd+0.454
oidal+0.450
Neg logit contributions
 Rabbit-0.308
Pan-0.286
 2048-0.281
-0.265
pain-0.262
Tokeneline
Token ID4470
Feature activation+1.14e-01

Pos logit contributions
allery+0.005
tnc+0.005
 Klu+0.005
erd+0.004
oidal+0.004
Neg logit contributions
 Rabbit-0.005
Pan-0.004
 2048-0.004
-0.004
pain-0.004
Token program
Token ID1430
Feature activation-2.01e-01

Pos logit contributions
 Rabbit+0.198
Pan+0.184
 2048+0.181
+0.170
pain+0.169
Neg logit contributions
allery-0.329
tnc-0.321
 Klu-0.301
erd-0.276
oidal-0.274
Token.
Token ID13
Feature activation-8.07e-02

Pos logit contributions
allery+0.549
tnc+0.534
 Klu+0.503
erd+0.459
oidal+0.453
Neg logit contributions
 Rabbit-0.377
Pan-0.355
 2048-0.348
-0.333
pain-0.328
Token
Token ID198
Feature activation-2.05e-01

Pos logit contributions
allery+0.241
tnc+0.236
 Klu+0.222
erd+0.204
oidal+0.203
Neg logit contributions
 Rabbit-0.131
Pan-0.122
 2048-0.120
-0.112
pain-0.111
Token
Token ID198
Feature activation-2.16e-01

Pos logit contributions
allery+0.538
tnc+0.536
 Klu+0.502
erd+0.455
oidal+0.452
Neg logit contributions
 Rabbit-0.393
Pan-0.365
 2048-0.361
-0.350
pain-0.341
TokenThe
Token ID464
Feature activation-2.20e-02

Pos logit contributions
allery+0.600
tnc+0.589
 Klu+0.551
erd+0.504
oidal+0.500
Neg logit contributions
 Rabbit-0.393
 2048-0.362
Pan-0.362
-0.341
 Panama-0.337
Token company
Token ID1664
Feature activation-5.79e-02

Pos logit contributions
allery+0.057
tnc+0.055
 Klu+0.051
erd+0.047
oidal+0.046
Neg logit contributions
 Rabbit-0.045
Pan-0.043
 2048-0.042
-0.040
pain-0.040
Token received
Token ID2722
Feature activation+9.42e-02

Pos logit contributions
allery+0.149
tnc+0.144
 Klu+0.136
erd+0.123
oidal+0.121
Neg logit contributions
 Rabbit-0.118
Pan-0.111
 2048-0.110
-0.105
pain-0.104
Token.
Token ID13
Feature activation-4.14e-02

Pos logit contributions
allery+0.254
tnc+0.248
 Klu+0.232
erd+0.211
oidal+0.208
Neg logit contributions
 Rabbit-0.198
Pan-0.186
 2048-0.183
-0.176
pain-0.173
Token0
Token ID15
Feature activation+1.90e-01

Pos logit contributions
allery+0.098
tnc+0.095
 Klu+0.089
erd+0.079
oidal+0.078
Neg logit contributions
 Rabbit-0.093
Pan-0.088
 2048-0.087
-0.083
pain-0.083
Tokenf
Token ID69
Feature activation+2.55e-01

Pos logit contributions
 Rabbit+0.333
Pan+0.314
 2048+0.310
+0.292
pain+0.291
Neg logit contributions
allery-0.544
tnc-0.530
 Klu-0.502
erd-0.454
oidal-0.452
Token);
Token ID1776
Feature activation+7.38e-01

Pos logit contributions
 Rabbit+0.561
 2048+0.531
Pan+0.530
+0.508
--------+0.501
Neg logit contributions
allery-0.614
tnc-0.600
 Klu-0.560
erd-0.498
oidal-0.491
Token RAW
Token ID33782
Feature activation-1.25e-01

Pos logit contributions
 Rabbit+1.427
Pan+1.325
 2048+1.324
+1.237
pain+1.236
Neg logit contributions
allery-1.987
tnc-1.953
 Klu-1.809
erd-1.661
oidal-1.643
Token_
Token ID62
Feature activation-8.05e-01

Pos logit contributions
allery+0.239
tnc+0.234
 Klu+0.212
erd+0.186
oidal+0.183
Neg logit contributions
 Rabbit-0.336
Pan-0.318
 2048-0.316
-0.307
 civilian-0.304
TokenASS
Token ID10705
Feature activation+9.81e-03

Pos logit contributions
allery+2.027
tnc+1.978
 Klu+1.831
erd+1.666
oidal+1.643
Neg logit contributions
 Rabbit-1.687
Pan-1.567
 2048-1.561
-1.487
 Panama-1.473
TokenERT
Token ID17395
Feature activation+3.91e-01

Pos logit contributions
 Rabbit+0.019
Pan+0.018
 2048+0.018
pain+0.017
 civilian+0.017
Neg logit contributions
allery-0.026
tnc-0.025
 Klu-0.024
erd-0.022
oidal-0.021
Token(
Token ID7
Feature activation+3.59e-01

Pos logit contributions
 Rabbit+0.530
Pan+0.491
 2048+0.478
+0.450
pain+0.443
Neg logit contributions
allery-1.270
tnc-1.242
 Klu-1.181
erd-1.093
oidal-1.078
Tokenit
Token ID270
Feature activation+9.74e-02

Pos logit contributions
 Rabbit+0.925
Pan+0.907
 2048+0.879
pain+0.867
sec+0.855
Neg logit contributions
allery-0.725
tnc-0.689
 Klu-0.647
erd-0.553
oidal-0.547
Token->
Token ID3784
Feature activation-2.91e-01

Pos logit contributions
 Rabbit+0.186
Pan+0.174
 2048+0.172
+0.163
pain+0.162
Neg logit contributions
allery-0.264
tnc-0.257
 Klu-0.241
erd-0.220
oidal-0.218
Token
Token ID447
Feature activation+3.40e-02

Pos logit contributions
allery+1.239
tnc+1.210
 Klu+1.143
erd+1.047
oidal+1.040
Neg logit contributions
 Rabbit-0.692
Pan-0.646
 2048-0.633
-0.602
pain-0.594
Token
Token ID251
Feature activation-5.67e-02

Pos logit contributions
 Rabbit+0.053
Pan+0.050
 2048+0.048
pain+0.046
 Panama+0.045
Neg logit contributions
allery-0.102
tnc-0.100
 Klu-0.096
erd-0.089
oidal-0.087
Token on
Token ID319
Feature activation+9.32e-02

Pos logit contributions
allery+0.159
tnc+0.155
 Klu+0.146
erd+0.134
oidal+0.132
Neg logit contributions
 Rabbit-0.102
Pan-0.095
 2048-0.094
-0.089
pain-0.088
Token a
Token ID257
Feature activation-1.50e-01

Pos logit contributions
 Rabbit+0.176
Pan+0.164
 2048+0.162
+0.154
pain+0.152
Neg logit contributions
allery-0.254
tnc-0.248
 Klu-0.233
erd-0.212
oidal-0.210
Token Regular
Token ID23603
Feature activation-3.67e-01

Pos logit contributions
allery+0.403
tnc+0.396
 Klu+0.371
erd+0.336
oidal+0.332
Neg logit contributions
 Rabbit-0.282
Pan-0.265
 2048-0.263
-0.250
pain-0.246
Token Bas
Token ID6455
Feature activation-8.92e-01

Pos logit contributions
allery+0.995
tnc+0.973
 Klu+0.915
erd+0.835
oidal+0.824
Neg logit contributions
 Rabbit-0.689
Pan-0.646
 2048-0.639
-0.607
pain-0.599
Tokenis
Token ID271
Feature activation-2.02e-01

Pos logit contributions
allery+1.875
tnc+1.820
 Klu+1.670
erd+1.478
oidal+1.457
Neg logit contributions
 Rabbit-2.237
Pan-2.126
 2048-2.111
-2.027
pain-2.006
Token:
Token ID25
Feature activation+7.27e-02

Pos logit contributions
allery+0.579
tnc+0.566
 Klu+0.532
erd+0.487
oidal+0.484
Neg logit contributions
 Rabbit-0.353
Pan-0.329
 2048-0.323
-0.307
pain-0.302
Token
Token ID198
Feature activation+1.05e-01

Pos logit contributions
 Rabbit+0.132
Pan+0.124
 2048+0.122
+0.115
pain+0.114
Neg logit contributions
allery-0.203
tnc-0.198
 Klu-0.186
erd-0.170
oidal-0.169
Token
Token ID198
Feature activation+1.12e-01

Pos logit contributions
 Rabbit+0.198
Pan+0.185
 2048+0.182
+0.175
pain+0.171
Neg logit contributions
allery-0.285
tnc-0.281
 Klu-0.263
erd-0.240
oidal-0.238
TokenThere
Token ID1858
Feature activation+1.96e-01

Pos logit contributions
 Rabbit+0.210
Pan+0.197
 2048+0.194
+0.184
pain+0.183
Neg logit contributions
allery-0.304
tnc-0.297
 Klu-0.278
erd-0.254
oidal-0.251
Token.
Token ID13
Feature activation+7.13e-02

Pos logit contributions
allery+0.306
tnc+0.298
 Klu+0.280
erd+0.254
oidal+0.252
Neg logit contributions
 Rabbit-0.222
Pan-0.208
 2048-0.205
-0.195
pain-0.193
Token It
Token ID632
Feature activation+2.97e-01

Pos logit contributions
 Rabbit+0.121
Pan+0.113
 2048+0.111
+0.105
pain+0.104
Neg logit contributions
allery-0.207
tnc-0.202
 Klu-0.191
erd-0.175
oidal-0.173
Token was
Token ID373
Feature activation+1.91e-01

Pos logit contributions
 Rabbit+0.565
Pan+0.530
 2048+0.522
+0.495
pain+0.492
Neg logit contributions
allery-0.807
tnc-0.788
 Klu-0.736
erd-0.671
oidal-0.665
Token the
Token ID262
Feature activation+2.28e-01

Pos logit contributions
 Rabbit+0.359
Pan+0.336
 2048+0.332
+0.314
pain+0.311
Neg logit contributions
allery-0.525
tnc-0.512
 Klu-0.479
erd-0.437
oidal-0.433
Token exact
Token ID2748
Feature activation-9.78e-01

Pos logit contributions
 Rabbit+0.432
Pan+0.404
 2048+0.399
+0.377
pain+0.375
Neg logit contributions
allery-0.620
tnc-0.605
 Klu-0.564
erd-0.516
oidal-0.511
Token opposite
Token ID6697
Feature activation-8.58e-01

Pos logit contributions
allery+2.150
tnc+2.077
 Klu+1.957
erd+1.732
oidal+1.714
Neg logit contributions
 Rabbit-2.316
Pan-2.217
 2048-2.169
-2.107
---------2.074
Token of
Token ID286
Feature activation-3.07e-01

Pos logit contributions
allery+2.223
tnc+2.162
 Klu+2.046
erd+1.838
oidal+1.828
Neg logit contributions
 Rabbit-1.693
Pan-1.607
 2048-1.571
-1.505
pain-1.485
Token The
Token ID383
Feature activation+1.05e-01

Pos logit contributions
allery+0.785
tnc+0.762
 Klu+0.720
erd+0.646
oidal+0.640
Neg logit contributions
 Rabbit-0.627
Pan-0.594
 2048-0.584
-0.559
pain-0.553
Token Pur
Token ID9330
Feature activation+2.90e-02

Pos logit contributions
 Rabbit+0.160
Pan+0.147
 2048+0.144
+0.134
pain+0.133
Neg logit contributions
allery-0.325
tnc-0.318
 Klu-0.299
erd-0.277
oidal-0.274
Tokenge
Token ID469
Feature activation+3.66e-01

Pos logit contributions
 Rabbit+0.053
Pan+0.049
 2048+0.048
+0.046
 Panama+0.045
Neg logit contributions
allery-0.081
tnc-0.079
 Klu-0.074
erd-0.068
oidal-0.067
Token.
Token ID13
Feature activation+9.87e-02

Pos logit contributions
 Rabbit+0.688
Pan+0.644
 2048+0.635
+0.601
pain+0.596
Neg logit contributions
allery-1.003
tnc-0.978
 Klu-0.916
erd-0.836
oidal-0.828
Token Elaine
Token ID41463
Feature activation+1.48e-01

Pos logit contributions
allery+2.221
tnc+2.156
 Klu+2.053
erd+1.877
oidal+1.860
Neg logit contributions
 Rabbit-1.302
Pan-1.218
 2048-1.209
-1.138
pain-1.126
Token Qu
Token ID2264
Feature activation-4.10e-03

Pos logit contributions
 Rabbit+0.213
Pan+0.193
 2048+0.189
+0.176
pain+0.173
Neg logit contributions
allery-0.475
tnc-0.465
 Klu-0.436
erd-0.407
oidal-0.401
Tokenij
Token ID2926
Feature activation-1.34e-01

Pos logit contributions
allery+0.007
tnc+0.007
 Klu+0.006
erd+0.005
oidal+0.005
Neg logit contributions
 Rabbit-0.012
Pan-0.012
 2048-0.012
-0.011
pain-0.011
Tokenano
Token ID5733
Feature activation-6.35e-02

Pos logit contributions
allery+0.295
tnc+0.286
 Klu+0.263
erd+0.234
oidal+0.231
Neg logit contributions
 Rabbit-0.322
Pan-0.307
 2048-0.303
-0.291
pain-0.290
Token of
Token ID286
Feature activation-1.78e-02

Pos logit contributions
allery+0.170
tnc+0.165
 Klu+0.155
erd+0.141
oidal+0.140
Neg logit contributions
 Rabbit-0.123
Pan-0.115
 2048-0.114
-0.108
pain-0.107
Token CBS
Token ID11133
Feature activation-8.02e-01

Pos logit contributions
allery+0.053
tnc+0.052
 Klu+0.050
erd+0.046
oidal+0.045
Neg logit contributions
 Rabbit-0.028
Pan-0.026
 2048-0.026
-0.024
---------0.024
Token let
Token ID1309
Feature activation-3.53e-01

Pos logit contributions
allery+2.231
tnc+2.179
 Klu+2.076
erd+1.909
oidal+1.862
Neg logit contributions
 Rabbit-1.419
Pan-1.326
 2048-1.320
-1.250
pain-1.230
Token Pence
Token ID17339
Feature activation-1.37e-01

Pos logit contributions
allery+0.960
tnc+0.936
 Klu+0.878
erd+0.800
oidal+0.792
Neg logit contributions
 Rabbit-0.669
Pan-0.628
 2048-0.619
-0.587
pain-0.582
Token get
Token ID651
Feature activation+1.29e-02

Pos logit contributions
allery+0.336
tnc+0.327
 Klu+0.300
erd+0.271
oidal+0.268
Neg logit contributions
 Rabbit-0.297
Pan-0.280
 2048-0.277
-0.264
pain-0.262
Token away
Token ID1497
Feature activation+9.09e-01

Pos logit contributions
 Rabbit+0.025
Pan+0.024
 2048+0.024
+0.022
pain+0.022
Neg logit contributions
allery-0.034
tnc-0.033
 Klu-0.031
erd-0.028
oidal-0.028
Token with
Token ID351
Feature activation-8.16e-02

Pos logit contributions
 Rabbit+1.724
Pan+1.607
 2048+1.590
+1.506
pain+1.490
Neg logit contributions
allery-2.475
tnc-2.425
 Klu-2.257
erd-2.054
oidal-2.027
Token pretty
Token ID2495
Feature activation-8.94e-02

Pos logit contributions
 Rabbit+0.181
Pan+0.172
 2048+0.169
+0.162
pain+0.160
Neg logit contributions
allery-0.233
tnc-0.228
 Klu-0.214
erd-0.193
oidal-0.191
Token good
Token ID922
Feature activation+2.99e-01

Pos logit contributions
allery+0.226
tnc+0.220
 Klu+0.206
erd+0.186
oidal+0.184
Neg logit contributions
 Rabbit-0.186
Pan-0.176
 2048-0.173
-0.165
pain-0.164
Token idea
Token ID2126
Feature activation-3.33e-01

Pos logit contributions
 Rabbit+0.649
Pan+0.615
 2048+0.606
+0.580
pain+0.575
Neg logit contributions
allery-0.731
tnc-0.710
 Klu-0.662
erd-0.596
oidal-0.588
Token that
Token ID326
Feature activation-4.30e-02

Pos logit contributions
allery+0.869
tnc+0.845
 Klu+0.794
erd+0.719
oidal+0.711
Neg logit contributions
 Rabbit-0.666
Pan-0.629
 2048-0.618
-0.591
pain-0.585
Token the
Token ID262
Feature activation-7.40e-02

Pos logit contributions
allery+0.114
tnc+0.111
 Klu+0.103
erd+0.094
oidal+0.093
Neg logit contributions
 Rabbit-0.085
Pan-0.080
 2048-0.079
-0.075
pain-0.074
Token party
Token ID2151
Feature activation-8.35e-01

Pos logit contributions
allery+0.195
tnc+0.190
 Klu+0.179
erd+0.162
oidal+0.160
Neg logit contributions
 Rabbit-0.145
Pan-0.137
 2048-0.135
-0.129
pain-0.127
Token wanted
Token ID2227
Feature activation-9.35e-02

Pos logit contributions
allery+2.116
tnc+2.065
 Klu+1.944
erd+1.759
oidal+1.732
Neg logit contributions
 Rabbit-1.694
Pan-1.606
 2048-1.579
-1.520
pain-1.495
Token Rachel
Token ID15984
Feature activation-1.13e-01

Pos logit contributions
allery+0.252
tnc+0.245
 Klu+0.229
erd+0.209
oidal+0.207
Neg logit contributions
 Rabbit-0.180
Pan-0.169
 2048-0.167
-0.158
pain-0.157
Token,
Token ID11
Feature activation-8.23e-02

Pos logit contributions
allery+0.359
tnc+0.350
 Klu+0.333
erd+0.308
oidal+0.305
Neg logit contributions
 Rabbit-0.163
Pan-0.151
 2048-0.148
-0.138
pain-0.135
Token
Token ID447
Feature activation+9.81e-02

Pos logit contributions
allery+0.227
tnc+0.221
 Klu+0.208
erd+0.190
oidal+0.188
Neg logit contributions
 Rabbit-0.152
Pan-0.143
 2048-0.140
-0.134
pain-0.132
Token
Token ID251
Feature activation+1.02e-02

Pos logit contributions
 Rabbit+0.121
Pan+0.110
 2048+0.106
pain+0.097
 Panama+0.095
Neg logit contributions
allery-0.330
tnc-0.324
 Klu-0.309
erd-0.287
oidal-0.284