TOP ACTIVATIONS
MAX = 36.194

x . x . x . 138 11 30 277 x
x . x . x . 158 10 95 228 x
x . x . x . 139 10 64 659 x
x . x . x . 156 10 23 688 x
x . x . x . 148 10 42 170 x
historic " occasion , P uro hit requested the Prime Minister
7 { ( 8 s )} 8 . c 4 {
beat or other . 12 . Donkey Kong Country
City Council Point of Sale Plan Last
x . x . x . 140 10 59 272 x
this was NASA s seventh consecutive lunar mission failure
2006 18 9 2007 6 0 2008 15 4 2009 10
' " I 'll Feel a Whole Lot Better " (
proposed a point of sale plan where
2008 Steve Care ll 2008 Lee Pace
io Val er io Borg he se . Borg he se
in London and the U . K ., we will be
x . x . x . 157 108 67 90 x
rights and privacy of millions of Americans , Mer
Los Angeles . Exactly 50 years later December 17 ,

MIN ACTIVATIONS
MIN = -35.329

we ll send you three meaningful actions you can
. is the syntax just as rich , can you have
Angular JS . It 's a marvelous framework for modern web
troops fled to the town when M 23 rebels briefly captured
Jill ian , we couldn 't resist the temptation to recall
( gray , ( 3 , 3 ) , 0 )
to the Min ova rapes if it did not try the
flavours are the only projects that offer built - in support
/* vim < 570 : set sw = 4 : */
1991 Craig T . Nelson 1992 Craig T .
s volatile east in November after Cong ol ese army troops
types : with hardware virtual ization for CPU , memory ,
i ", " -- images ", required = True , help
environmental conditions from snow to mud , sand and more
7 8 void vert ( in out app data _ full
leaving some of the most amazing views for the end of
carefully about how they will extract all that valuable gold and
UN - backed government last year pushed them out of the
you re in the fast ed state your body
UK IP party in last month 's European elections .

INTERVAL 18.313 - 36.194
CONTAINS 0.214%

said Dr Dum us que , who is the
arrested both Tav are and Yad av . A case was
launch system . It has an expected service life of 90
awaits . R ANK 6 Jal en Ramsey ,
the failure of the Ranger 6 camera package was released on

INTERVAL 0.432 - 18.313
CONTAINS 46.104%

12 ) When Dancing the Quad r ille , Don
is tight , maybe 5 to 6 times . Look at
, but that relationship had only just recently reached the level
reporters outside court . He should not
when it comes to your Even though the president didn 't

INTERVAL -17.448 - 0.432
CONTAINS 53.444%

the wings ... it 's on the takeoff checklist ! Just
retaining workers that are lured out of province by even better
without jeopard izing the launch . But you have saved my
use the menus , however . Four years ago
the analysis page . It 's even possible to have nested

INTERVAL -35.329 - -17.448
CONTAINS 0.238%

image though if you stick your face close enough to your
passes with heavy loads , through severe weather , and ever
external process er that will convert the brain signals into synthes
hard to get to except by water , the Sund arb
aka snaps ). These two new methods of packaging software have
Token x
Token ID2124
Feature activation+3.42e-02

Pos logit contributions
Reward+3.154
doc+3.128
 rewarded+2.974
ellen+2.945
llan+2.877
Neg logit contributions
gencies-5.359
rang-5.069
ía-4.884
oche-4.876
acter-4.765
Token.
Token ID13
Feature activation-2.47e-03

Pos logit contributions
gencies+0.025
rang+0.024
ía+0.023
oche+0.023
mun+0.022
Neg logit contributions
Reward-0.028
doc-0.027
 rewarded-0.027
ellen-0.026
llan-0.025
Tokenx
Token ID87
Feature activation+9.42e-01

Pos logit contributions
Reward+0.002
doc+0.002
 rewarded+0.002
ellen+0.002
llan+0.002
Neg logit contributions
gencies-0.002
rang-0.002
ía-0.002
oche-0.002
mun-0.002
Token.
Token ID13
Feature activation+1.74e+01

Pos logit contributions
gencies+0.662
rang+0.643
ía+0.619
oche+0.607
mun+0.582
Neg logit contributions
Reward-0.778
 rewarded-0.750
doc-0.744
ellen-0.734
 recru-0.717
Tokenx
Token ID87
Feature activation+2.59e+01

Pos logit contributions
ía+9.213
rang+9.192
mun+8.545
its+8.314
itar+8.256
Neg logit contributions
 rewarded-14.945
VIDIA-14.381
 recru-14.280
Reward-14.188
 notor-13.855
Token.
Token ID13
Feature activation+3.62e+01

Pos logit contributions
rang+8.491
ía+8.422
mun+8.208
its+7.669
itar+7.565
Neg logit contributions
VIDIA-23.878
 notor-23.649
 confir-22.976
-22.950
 rewarded-22.858
Token138
Token ID20107
Feature activation+4.64e+00

Pos logit contributions
mun+10.019
ía+9.318
rang+9.069
its+8.953
mortem+8.900
Neg logit contributions
VIDIA-29.293
 rewarded-28.832
ellen-28.344
 uncond-28.265
 convol-27.913
Token 11
Token ID1367
Feature activation-2.84e+00

Pos logit contributions
gencies+3.113
rang+3.077
ía+2.947
oche+2.905
mun+2.790
Neg logit contributions
Reward-3.842
 rewarded-3.682
doc-3.649
ellen-3.606
 recru-3.559
Token30
Token ID1270
Feature activation-8.30e+00

Pos logit contributions
Reward+1.860
doc+1.798
 rewarded+1.751
ellen+1.715
llan+1.675
Neg logit contributions
gencies-2.515
rang-2.416
ía-2.341
oche-2.312
acter-2.246
Token277
Token ID27019
Feature activation-7.63e+00

Pos logit contributions
Reward+5.565
doc+5.432
 rewarded+5.219
ellen+5.210
llan+5.027
Neg logit contributions
gencies-6.790
rang-6.543
ía-6.333
oche-6.203
acter-6.009
Token x
Token ID2124
Feature activation+2.76e+00

Pos logit contributions
doc+4.041
Reward+3.985
 rewarded+3.792
ellen+3.731
llan+3.641
Neg logit contributions
gencies-7.571
rang-7.105
oche-6.859
ía-6.805
ateurs-6.723
Token x
Token ID2124
Feature activation+3.76e+00

Pos logit contributions
doc+4.529
Reward+4.431
 rewarded+4.236
ellen+4.118
llan+4.104
Neg logit contributions
gencies-9.173
rang-8.648
oche-8.342
ía-8.256
ateurs-8.192
Token.
Token ID13
Feature activation+3.86e+00

Pos logit contributions
gencies+2.380
rang+2.370
ía+2.272
oche+2.203
mun+2.127
Neg logit contributions
Reward-3.266
 rewarded-3.190
ellen-3.097
 recru-3.093
doc-3.080
Tokenx
Token ID87
Feature activation+3.84e+00

Pos logit contributions
gencies+2.861
rang+2.845
ía+2.751
oche+2.677
mun+2.601
Neg logit contributions
Reward-2.972
 rewarded-2.919
 recru-2.790
ellen-2.761
doc-2.730
Token.
Token ID13
Feature activation+5.36e+00

Pos logit contributions
gencies+2.429
rang+2.412
ía+2.324
oche+2.254
mun+2.168
Neg logit contributions
Reward-3.338
 rewarded-3.262
ellen-3.158
 recru-3.156
doc-3.149
Tokenx
Token ID87
Feature activation+1.83e+01

Pos logit contributions
rang+3.799
gencies+3.784
ía+3.691
oche+3.564
mun+3.482
Neg logit contributions
Reward-4.234
 rewarded-4.213
 recru-4.048
ellen-3.928
 trooper-3.909
Token.
Token ID13
Feature activation+3.38e+01

Pos logit contributions
rang+6.809
ía+6.362
mun+5.899
itar+5.789
its+5.703
Neg logit contributions
VIDIA-18.137
 notor-17.930
 recru-17.620
 rewarded-17.618
 mathemat-17.423
Token158
Token ID21273
Feature activation+6.09e+00

Pos logit contributions
rang+8.958
spect+8.874
mun+8.734
mortem+8.664
its+8.537
Neg logit contributions
 rewarded-27.986
VIDIA-27.421
 rall-26.737
 helicop-26.611
 recru-26.583
Token 10
Token ID838
Feature activation-9.95e-03

Pos logit contributions
gencies+3.928
rang+3.917
ía+3.773
oche+3.703
mun+3.568
Neg logit contributions
Reward-5.092
 rewarded-4.849
doc-4.789
ellen-4.771
 recru-4.722
Token95
Token ID3865
Feature activation-3.94e+00

Pos logit contributions
Reward+0.007
doc+0.006
 rewarded+0.006
ellen+0.006
llan+0.006
Neg logit contributions
gencies-0.009
rang-0.008
ía-0.008
oche-0.008
mun-0.008
Token228
Token ID23815
Feature activation-7.65e+00

Pos logit contributions
Reward+2.742
doc+2.657
 rewarded+2.590
ellen+2.563
llan+2.494
Neg logit contributions
gencies-3.271
rang-3.137
ía-3.034
oche-2.989
acter-2.887
Token x
Token ID2124
Feature activation+3.13e+00

Pos logit contributions
doc+4.035
Reward+3.951
 rewarded+3.776
ellen+3.704
llan+3.657
Neg logit contributions
gencies-7.599
rang-7.177
oche-6.914
ía-6.876
ateurs-6.769
Token x
Token ID2124
Feature activation+2.76e+00

Pos logit contributions
doc+4.041
Reward+3.985
 rewarded+3.792
ellen+3.731
llan+3.641
Neg logit contributions
gencies-7.571
rang-7.105
oche-6.859
ía-6.805
ateurs-6.723
Token.
Token ID13
Feature activation+1.58e+00

Pos logit contributions
gencies+1.816
rang+1.792
ía+1.720
oche+1.675
mun+1.613
Neg logit contributions
Reward-2.354
 rewarded-2.288
doc-2.229
ellen-2.225
 recru-2.206
Tokenx
Token ID87
Feature activation+2.04e+00

Pos logit contributions
gencies+1.249
rang+1.220
ía+1.181
oche+1.158
mun+1.120
Neg logit contributions
Reward-1.159
 rewarded-1.118
doc-1.087
ellen-1.078
 recru-1.060
Token.
Token ID13
Feature activation+3.80e+00

Pos logit contributions
gencies+1.382
rang+1.352
ía+1.303
oche+1.272
mun+1.222
Neg logit contributions
Reward-1.725
 rewarded-1.671
doc-1.639
ellen-1.627
 recru-1.605
Tokenx
Token ID87
Feature activation+1.33e+01

Pos logit contributions
gencies+2.822
rang+2.801
ía+2.723
oche+2.636
mun+2.574
Neg logit contributions
Reward-2.915
 rewarded-2.862
 recru-2.731
ellen-2.709
doc-2.695
Token.
Token ID13
Feature activation+3.15e+01

Pos logit contributions
rang+6.063
ía+5.836
mun+5.425
gencies+5.419
itar+5.362
Neg logit contributions
 rewarded-12.542
 recru-12.395
Reward-12.369
VIDIA-12.319
ellen-11.998
Token139
Token ID20219
Feature activation+5.48e+00

Pos logit contributions
rang+8.894
mun+8.513
its+8.342
ía+8.293
mortem+7.821
Neg logit contributions
 rewarded-26.168
ellen-26.070
 Courier-25.425
VIDIA-25.386
llan-25.081
Token 10
Token ID838
Feature activation-9.58e-01

Pos logit contributions
gencies+3.584
rang+3.541
ía+3.393
oche+3.346
mun+3.230
Neg logit contributions
Reward-4.582
 rewarded-4.390
doc-4.340
ellen-4.309
 recru-4.273
Token64
Token ID2414
Feature activation-4.64e+00

Pos logit contributions
Reward+0.634
doc+0.605
 rewarded+0.599
ellen+0.587
llan+0.571
Neg logit contributions
gencies-0.840
rang-0.812
ía-0.787
oche-0.776
acter-0.750
Token659
Token ID36445
Feature activation-6.70e+00

Pos logit contributions
Reward+3.233
doc+3.117
 rewarded+3.049
ellen+3.021
llan+2.928
Neg logit contributions
gencies-3.845
rang-3.693
ía-3.582
oche-3.511
acter-3.408
Token x
Token ID2124
Feature activation+1.19e+00

Pos logit contributions
Reward+3.677
doc+3.671
 rewarded+3.490
ellen+3.436
llan+3.346
Neg logit contributions
gencies-6.519
rang-6.141
ía-5.933
oche-5.928
acter-5.782
Token x
Token ID2124
Feature activation+2.04e+00

Pos logit contributions
doc+3.665
Reward+3.650
 rewarded+3.499
ellen+3.431
llan+3.366
Neg logit contributions
gencies-6.593
rang-6.229
ía-6.001
oche-5.983
ateurs-5.867
Token.
Token ID13
Feature activation+2.24e+00

Pos logit contributions
gencies+1.382
rang+1.354
ía+1.301
oche+1.271
mun+1.222
Neg logit contributions
Reward-1.724
 rewarded-1.669
doc-1.638
ellen-1.628
 recru-1.605
Tokenx
Token ID87
Feature activation+5.79e+00

Pos logit contributions
gencies+1.742
rang+1.711
ía+1.654
oche+1.620
mun+1.569
Neg logit contributions
Reward-1.667
 rewarded-1.619
doc-1.554
ellen-1.550
 recru-1.540
Token.
Token ID13
Feature activation+1.48e+01

Pos logit contributions
rang+3.431
gencies+3.379
ía+3.297
oche+3.164
mun+3.076
Neg logit contributions
Reward-5.177
 rewarded-5.112
 recru-4.984
ellen-4.891
doc-4.839
Tokenx
Token ID87
Feature activation+2.60e+01

Pos logit contributions
rang+8.392
ía+8.004
gencies+7.554
mun+7.491
oche+7.388
Neg logit contributions
 rewarded-12.899
Reward-12.532
 recru-12.457
 trooper-12.172
VIDIA-11.986
Token.
Token ID13
Feature activation+3.12e+01

Pos logit contributions
rang+9.996
its+8.723
mun+8.691
ía+8.397
sov+7.782
Neg logit contributions
VIDIA-23.056
 reluct-22.773
 opio-22.745
 notor-22.675
Reward-22.424
Token156
Token ID21599
Feature activation+4.97e+00

Pos logit contributions
rang+9.530
mun+8.684
its+8.052
mortem+7.738
ía+7.471
Neg logit contributions
Reward-26.782
 rewarded-26.504
VIDIA-26.434
 trooper-25.781
 recru-25.775
Token 10
Token ID838
Feature activation-9.28e-01

Pos logit contributions
gencies+3.344
rang+3.316
ía+3.194
oche+3.141
mun+3.029
Neg logit contributions
Reward-4.085
 rewarded-3.900
doc-3.841
ellen-3.812
 recru-3.766
Token23
Token ID1954
Feature activation-8.39e+00

Pos logit contributions
Reward+0.611
doc+0.583
 rewarded+0.577
ellen+0.566
llan+0.551
Neg logit contributions
gencies-0.816
rang-0.788
ía-0.765
oche-0.754
acter-0.729
Token688
Token ID34427
Feature activation-6.48e+00

Pos logit contributions
Reward+5.619
doc+5.504
ellen+5.283
 rewarded+5.275
llan+5.091
Neg logit contributions
gencies-6.842
rang-6.539
ía-6.348
oche-6.223
acter-6.052
Token x
Token ID2124
Feature activation+1.52e+00

Pos logit contributions
doc+3.512
Reward+3.502
 rewarded+3.326
ellen+3.289
llan+3.199
Neg logit contributions
gencies-6.371
rang-6.024
ía-5.795
oche-5.757
ateurs-5.658
Token x
Token ID2124
Feature activation+1.30e+00

Pos logit contributions
Reward+2.496
doc+2.447
 rewarded+2.380
ellen+2.334
llan+2.277
Neg logit contributions
gencies-3.950
rang-3.766
ía-3.622
oche-3.611
mun-3.498
Token.
Token ID13
Feature activation+2.09e+00

Pos logit contributions
gencies+0.899
rang+0.875
ía+0.842
oche+0.824
mun+0.791
Neg logit contributions
Reward-1.078
 rewarded-1.040
doc-1.029
ellen-1.017
 recru-0.996
Tokenx
Token ID87
Feature activation+2.99e+00

Pos logit contributions
gencies+1.637
rang+1.606
ía+1.553
oche+1.521
mun+1.472
Neg logit contributions
Reward-1.555
 rewarded-1.508
doc-1.454
ellen-1.448
 recru-1.434
Token.
Token ID13
Feature activation+4.05e+00

Pos logit contributions
gencies+1.959
rang+1.932
ía+1.862
oche+1.809
mun+1.743
Neg logit contributions
Reward-2.567
 rewarded-2.502
doc-2.431
ellen-2.425
 recru-2.413
Tokenx
Token ID87
Feature activation+2.24e+01

Pos logit contributions
gencies+3.007
rang+2.975
ía+2.893
oche+2.800
mun+2.741
Neg logit contributions
Reward-3.101
 rewarded-3.061
 recru-2.926
ellen-2.889
doc-2.883
Token.
Token ID13
Feature activation+3.09e+01

Pos logit contributions
mun+7.872
rang+7.754
acement+7.718
ía+7.488
its+7.482
Neg logit contributions
 rewarded-20.120
 notor-19.949
 trooper-19.880
 recru-19.874
 indo-19.755
Token148
Token ID18294
Feature activation+5.61e+00

Pos logit contributions
mun+7.213
acement+6.510
its+6.376
ther+6.089
718+5.913
Neg logit contributions
 rewarded-28.166
 trooper-28.080
llan-27.638
 indo-27.560
doc-27.303
Token 10
Token ID838
Feature activation-8.57e-01

Pos logit contributions
gencies+3.671
rang+3.653
ía+3.513
oche+3.421
mun+3.327
Neg logit contributions
Reward-4.672
 rewarded-4.477
doc-4.412
ellen-4.378
 recru-4.346
Token42
Token ID3682
Feature activation-6.49e+00

Pos logit contributions
Reward+0.571
doc+0.545
 rewarded+0.540
ellen+0.529
llan+0.515
Neg logit contributions
gencies-0.747
rang-0.722
ía-0.700
oche-0.690
acter-0.667
Token170
Token ID17279
Feature activation-9.49e+00

Pos logit contributions
Reward+4.368
doc+4.274
 rewarded+4.147
ellen+4.073
llan+3.971
Neg logit contributions
gencies-5.425
rang-5.211
ía-5.025
oche-4.999
acter-4.786
Token x
Token ID2124
Feature activation+1.30e+00

Pos logit contributions
doc+4.661
Reward+4.540
 rewarded+4.361
ellen+4.245
llan+4.169
Neg logit contributions
gencies-9.622
rang-9.036
oche-8.752
ía-8.677
ateurs-8.640
Tokenhistoric
Token ID31304
Feature activation+2.50e+00

Pos logit contributions
gencies+5.268
rang+5.151
ía+4.802
mun+4.764
oche+4.711
Neg logit contributions
Reward-5.172
 rewarded-5.142
ellen-4.957
doc-4.876
 recru-4.864
Token"
Token ID1
Feature activation+7.92e-01

Pos logit contributions
gencies+2.043
rang+1.980
ía+1.917
oche+1.890
mun+1.821
Neg logit contributions
Reward-1.781
doc-1.700
 rewarded-1.693
ellen-1.673
llan-1.623
Token occasion
Token ID6695
Feature activation-1.66e+01

Pos logit contributions
gencies+0.656
rang+0.635
ía+0.615
oche+0.606
mun+0.584
Neg logit contributions
Reward-0.558
doc-0.532
 rewarded-0.530
ellen-0.521
llan-0.507
Token,
Token ID11
Feature activation-4.62e+00

Pos logit contributions
 rewarded+9.956
Reward+9.634
doc+9.364
llan+8.989
ellen+8.902
Neg logit contributions
gencies-12.816
rang-12.093
oche-12.043
ía-11.935
acement-11.627
Token P
Token ID350
Feature activation-5.55e+00

Pos logit contributions
Reward+3.463
 rewarded+3.345
doc+3.326
ellen+3.209
llan+3.163
Neg logit contributions
gencies-3.536
rang-3.415
ía-3.355
oche-3.266
acement-3.142
Tokenuro
Token ID1434
Feature activation+3.08e+01

Pos logit contributions
Reward+4.661
doc+4.438
 rewarded+4.392
ellen+4.348
 recru+4.218
Neg logit contributions
gencies-3.826
rang-3.569
ía-3.488
acter-3.308
oche-3.289
Tokenhit
Token ID17945
Feature activation+3.24e+00

Pos logit contributions
rang+12.937
hit+11.785
its+11.479
harm+11.283
hot+11.048
Neg logit contributions
Reward-23.703
 notor-23.594
VIDIA-23.515
 trooper-23.164
 recru-22.732
Token requested
Token ID9167
Feature activation-9.56e+00

Pos logit contributions
gencies+2.779
rang+2.724
ía+2.644
oche+2.596
mun+2.523
Neg logit contributions
Reward-2.142
doc-2.044
ellen-1.986
 rewarded-1.965
llan-1.921
Token the
Token ID262
Feature activation-9.04e+00

Pos logit contributions
Reward+6.774
 rewarded+6.569
doc+6.458
ellen+6.199
 recru+6.090
Neg logit contributions
gencies-7.280
ía-6.999
rang-6.926
oche-6.800
acement-6.568
Token Prime
Token ID5537
Feature activation-1.40e+01

Pos logit contributions
Reward+6.086
 rewarded+5.878
doc+5.806
 recru+5.600
ellen+5.522
Neg logit contributions
gencies-7.191
rang-6.978
ía-6.928
oche-6.742
acement-6.565
Token Minister
Token ID4139
Feature activation-8.50e+00

Pos logit contributions
Reward+5.557
doc+5.454
llan+5.054
 rewarded+4.986
ellen+4.617
Neg logit contributions
gencies-14.625
rang-13.557
oche-13.550
ateurs-13.462
iferation-13.378
Token7
Token ID22
Feature activation+1.69e+01

Pos logit contributions
gencies+4.445
rang+4.362
ía+4.117
oche+4.111
mun+3.859
Neg logit contributions
Reward-6.931
doc-6.597
 recru-6.574
 rewarded-6.572
 trooper-6.506
Token {
Token ID1391
Feature activation+9.95e+00

Pos logit contributions
gencies+8.568
rang+7.978
ía+7.944
oche+7.534
 Symphony+7.202
Neg logit contributions
Reward-14.293
ellen-13.827
doc-13.792
 recru-13.559
 rewarded-13.516
Token(
Token ID7
Feature activation+4.11e+00

Pos logit contributions
rang+4.743
gencies+4.480
ía+4.364
mun+4.211
oche+4.196
Neg logit contributions
 recru-9.620
 rewarded-9.592
 trooper-9.224
Reward-9.212
ellen-9.197
Token8
Token ID23
Feature activation+1.66e+01

Pos logit contributions
gencies+3.183
rang+3.119
ía+2.995
oche+2.951
mun+2.867
Neg logit contributions
Reward-3.002
 rewarded-2.893
doc-2.847
ellen-2.831
 recru-2.788
Tokens
Token ID82
Feature activation-8.48e-01

Pos logit contributions
gencies+7.803
rang+7.270
ía+7.122
oche+7.090
mun+6.982
Neg logit contributions
Reward-14.584
ellen-14.008
 rewarded-13.697
doc-13.600
 recru-13.577
Token)}
Token ID38165
Feature activation+3.08e+01

Pos logit contributions
Reward+0.588
doc+0.561
 rewarded+0.557
ellen+0.546
llan+0.535
Neg logit contributions
gencies-0.715
rang-0.690
ía-0.668
oche-0.659
acter-0.635
Token 8
Token ID807
Feature activation-3.15e+00

Pos logit contributions
gencies+15.170
rang+13.572
ía+13.512
 Inspection+13.327
 Symphony+13.266
Neg logit contributions
ellen-18.938
Reward-18.661
 recru-18.236
doc-18.164
 rewarded-17.874
Token.
Token ID13
Feature activation+4.36e+00

Pos logit contributions
Reward+2.375
doc+2.306
 rewarded+2.245
ellen+2.220
llan+2.197
Neg logit contributions
gencies-2.488
rang-2.358
ía-2.267
oche-2.245
acter-2.178
Token c
Token ID269
Feature activation-7.96e-01

Pos logit contributions
gencies+2.843
rang+2.745
ía+2.621
oche+2.576
mun+2.468
Neg logit contributions
Reward-3.775
doc-3.594
ellen-3.586
 rewarded-3.567
llan-3.481
Token4
Token ID19
Feature activation+4.01e+00

Pos logit contributions
Reward+0.546
doc+0.523
 rewarded+0.518
ellen+0.509
llan+0.495
Neg logit contributions
gencies-0.675
rang-0.651
ía-0.632
oche-0.623
mun-0.601
Token {
Token ID1391
Feature activation+6.50e+00

Pos logit contributions
gencies+2.860
rang+2.762
ía+2.658
oche+2.614
mun+2.496
Neg logit contributions
Reward-3.250
doc-3.085
ellen-3.081
 rewarded-3.074
 recru-2.972
Token beat
Token ID4405
Feature activation+2.14e+00

Pos logit contributions
gencies+0.372
rang+0.359
ía+0.346
oche+0.341
mun+0.327
Neg logit contributions
Reward-0.390
doc-0.373
 rewarded-0.372
ellen-0.366
llan-0.357
Token or
Token ID393
Feature activation+5.41e+00

Pos logit contributions
gencies+1.523
rang+1.480
ía+1.423
oche+1.400
mun+1.347
Neg logit contributions
Reward-1.766
doc-1.676
 rewarded-1.669
ellen-1.650
llan-1.610
Token other
Token ID584
Feature activation+3.11e+00

Pos logit contributions
gencies+4.174
rang+4.085
ía+3.906
oche+3.868
acter+3.743
Neg logit contributions
Reward-3.991
doc-3.798
ellen-3.742
 rewarded-3.676
llan-3.610
Token.
Token ID13
Feature activation+4.59e+00

Pos logit contributions
gencies+2.373
rang+2.318
ía+2.222
oche+2.190
acter+2.106
Neg logit contributions
Reward-2.376
doc-2.263
 rewarded-2.228
ellen-2.217
llan-2.154
Token
Token ID198
Feature activation+1.08e-01

Pos logit contributions
gencies+3.219
rang+3.132
ía+3.021
oche+2.969
mun+2.876
Neg logit contributions
Reward-3.662
 rewarded-3.516
ellen-3.506
doc-3.489
 recru-3.392
Token
Token ID198
Feature activation+3.01e+01

Pos logit contributions
gencies+0.080
rang+0.078
ía+0.075
oche+0.074
mun+0.071
Neg logit contributions
Reward-0.086
doc-0.082
 rewarded-0.082
ellen-0.081
llan-0.079
Token12
Token ID1065
Feature activation+3.69e+00

Pos logit contributions
rang+15.157
ía+14.724
gencies+14.609
mortem+14.283
harm+13.720
Neg logit contributions
 rewarded-18.176
 recru-17.965
ellen-17.770
xual-17.102
 prisoner-17.017
Token.
Token ID13
Feature activation+1.12e+01

Pos logit contributions
gencies+2.353
rang+2.337
ía+2.263
oche+2.177
mun+2.102
Neg logit contributions
Reward-3.182
 rewarded-3.107
doc-3.028
ellen-3.024
 recru-3.008
Token Donkey
Token ID43823
Feature activation-4.23e+00

Pos logit contributions
rang+7.833
gencies+7.808
ía+7.631
 Symphony+7.596
 Sanct+7.531
Neg logit contributions
Reward-8.031
ellen-7.790
 rewarded-7.666
doc-7.535
 recru-7.444
Token Kong
Token ID9071
Feature activation+4.90e-01

Pos logit contributions
Reward+3.644
doc+3.560
 rewarded+3.530
ellen+3.443
llan+3.420
Neg logit contributions
gencies-2.776
rang-2.685
ía-2.505
oche-2.480
acter-2.398
Token Country
Token ID12946
Feature activation+5.40e+00

Pos logit contributions
gencies+0.308
rang+0.295
ía+0.283
oche+0.278
mun+0.264
Neg logit contributions
Reward-0.443
doc-0.426
 rewarded-0.426
ellen-0.419
llan-0.410
Token
Token ID198
Feature activation+5.55e+00

Pos logit contributions
Reward+1.498
doc+1.446
 rewarded+1.411
ellen+1.393
llan+1.373
Neg logit contributions
gencies-1.576
rang-1.502
ía-1.447
oche-1.431
acter-1.383
TokenCity
Token ID14941
Feature activation+9.90e+00

Pos logit contributions
gencies+3.642
rang+3.557
ía+3.396
oche+3.335
mun+3.327
Neg logit contributions
 rewarded-4.607
Reward-4.553
 recru-4.459
ellen-4.416
doc-4.388
Token Council
Token ID4281
Feature activation+1.73e+00

Pos logit contributions
gencies+4.992
rang+4.631
ía+4.481
 Inspection+4.314
oche+4.299
Neg logit contributions
Reward-9.499
 rewarded-9.232
ellen-9.200
doc-9.065
 recru-8.916
Token
Token ID960
Feature activation+1.47e-01

Pos logit contributions
gencies+0.815
rang+0.771
ía+0.729
oche+0.704
mun+0.664
Neg logit contributions
Reward-1.828
 rewarded-1.772
doc-1.770
ellen-1.752
llan-1.718
TokenPoint
Token ID12727
Feature activation-4.98e+00

Pos logit contributions
gencies+0.128
rang+0.124
ía+0.120
oche+0.119
mun+0.115
Neg logit contributions
Reward-0.097
doc-0.092
 rewarded-0.092
ellen-0.090
llan-0.088
Token of
Token ID286
Feature activation+3.01e+01

Pos logit contributions
Reward+3.710
doc+3.587
 rewarded+3.548
ellen+3.481
llan+3.433
Neg logit contributions
gencies-3.803
rang-3.658
ía-3.564
oche-3.523
acter-3.376
Token Sale
Token ID16467
Feature activation+5.01e+00

Pos logit contributions
gencies+14.100
 Inspection+13.746
 Sale+13.159
ía+12.584
rang+12.510
Neg logit contributions
 rewarded-20.195
doc-19.629
 trooper-19.440
llan-19.348
ellen-19.167
Token Plan
Token ID5224
Feature activation-1.62e+00

Pos logit contributions
gencies+3.332
rang+3.237
ía+3.090
oche+2.975
 Inspection+2.951
Neg logit contributions
Reward-4.254
 rewarded-4.069
doc-4.051
ellen-4.044
llan-3.927
Token
Token ID198
Feature activation-1.37e+00

Pos logit contributions
Reward+1.251
doc+1.205
 rewarded+1.200
ellen+1.177
llan+1.150
Neg logit contributions
gencies-1.228
rang-1.175
ía-1.140
oche-1.123
acter-1.078
Token
Token ID198
Feature activation+6.81e+00

Pos logit contributions
Reward+1.043
doc+1.005
 rewarded+0.989
ellen+0.973
llan+0.957
Neg logit contributions
gencies-1.063
rang-1.015
ía-0.979
oche-0.967
acter-0.932
TokenLast
Token ID5956
Feature activation-6.56e+00

Pos logit contributions
rang+4.412
gencies+4.381
ía+4.145
mun+4.083
oche+4.081
Neg logit contributions
 rewarded-5.644
 recru-5.505
Reward-5.471
ellen-5.386
 trooper-5.330
Token x
Token ID2124
Feature activation+1.19e+00

Pos logit contributions
Reward+3.677
doc+3.671
 rewarded+3.490
ellen+3.436
llan+3.346
Neg logit contributions
gencies-6.519
rang-6.141
ía-5.933
oche-5.928
acter-5.782
Token.
Token ID13
Feature activation+6.67e-01

Pos logit contributions
gencies+0.829
rang+0.806
ía+0.776
oche+0.760
mun+0.730
Neg logit contributions
Reward-0.987
 rewarded-0.952
doc-0.942
ellen-0.931
 recru-0.911
Tokenx
Token ID87
Feature activation+2.03e+00

Pos logit contributions
gencies+0.541
rang+0.526
ía+0.509
oche+0.501
mun+0.483
Neg logit contributions
Reward-0.481
 rewarded-0.461
doc-0.456
ellen-0.448
 recru-0.436
Token.
Token ID13
Feature activation+6.82e+00

Pos logit contributions
gencies+1.374
rang+1.345
ía+1.297
oche+1.265
mun+1.217
Neg logit contributions
Reward-1.712
 rewarded-1.660
doc-1.626
ellen-1.615
 recru-1.593
Tokenx
Token ID87
Feature activation+1.70e+01

Pos logit contributions
rang+4.679
gencies+4.586
ía+4.579
oche+4.361
mun+4.295
Neg logit contributions
 rewarded-5.467
Reward-5.465
 recru-5.244
ellen-5.103
 trooper-5.038
Token.
Token ID13
Feature activation+2.98e+01

Pos logit contributions
rang+6.836
ía+6.780
mun+6.248
itar+6.137
its+5.951
Neg logit contributions
VIDIA-16.185
 rewarded-16.044
 recru-15.908
 notor-15.714
Reward-15.632
Token140
Token ID15187
Feature activation+4.13e+00

Pos logit contributions
mun+9.726
ía+9.536
mortem+9.479
rang+9.350
ther+9.159
Neg logit contributions
VIDIA-24.205
 rewarded-24.115
ellen-23.870
 recru-23.286
 Courier-23.213
Token 10
Token ID838
Feature activation-2.06e+00

Pos logit contributions
gencies+2.817
rang+2.775
ía+2.666
oche+2.624
mun+2.528
Neg logit contributions
Reward-3.404
 rewarded-3.253
doc-3.225
ellen-3.195
 recru-3.144
Token59
Token ID3270
Feature activation-9.07e+00

Pos logit contributions
Reward+1.335
doc+1.278
 rewarded+1.256
ellen+1.231
llan+1.203
Neg logit contributions
gencies-1.835
rang-1.767
ía-1.713
oche-1.688
acter-1.638
Token272
Token ID29807
Feature activation-6.42e+00

Pos logit contributions
Reward+6.092
doc+5.942
ellen+5.713
 rewarded+5.687
llan+5.474
Neg logit contributions
gencies-7.361
rang-7.016
ía-6.810
oche-6.635
acter-6.468
Token x
Token ID2124
Feature activation+1.38e+00

Pos logit contributions
doc+3.511
Reward+3.495
 rewarded+3.343
ellen+3.304
llan+3.196
Neg logit contributions
gencies-6.247
rang-5.893
oche-5.687
ía-5.660
ateurs-5.555
Token this
Token ID428
Feature activation+1.07e+01

Pos logit contributions
gencies+4.601
rang+4.400
ía+4.295
oche+4.198
 Inspection+3.964
Neg logit contributions
Reward-6.565
ellen-6.306
 rewarded-6.254
doc-6.209
 recru-6.044
Token was
Token ID373
Feature activation+4.38e+00

Pos logit contributions
rang+5.931
gencies+5.872
ía+5.678
oche+5.601
mortem+5.274
Neg logit contributions
Reward-9.561
ellen-9.188
doc-9.018
 rewarded-8.731
 recru-8.620
Token NASA
Token ID8884
Feature activation+4.25e+00

Pos logit contributions
gencies+2.673
rang+2.591
ía+2.508
oche+2.449
mun+2.333
Neg logit contributions
Reward-3.942
ellen-3.774
doc-3.771
 rewarded-3.735
llan-3.662
Token
Token ID447
Feature activation-8.87e-01

Pos logit contributions
gencies+3.254
rang+3.156
ía+3.056
oche+3.008
mun+2.892
Neg logit contributions
Reward-3.235
 rewarded-3.060
doc-3.052
ellen-3.016
llan-2.906
Token
Token ID247
Feature activation+1.18e+01

Pos logit contributions
Reward+0.637
doc+0.609
 rewarded+0.606
ellen+0.594
llan+0.580
Neg logit contributions
gencies-0.725
rang-0.700
ía-0.676
oche-0.670
mun-0.642
Tokens
Token ID82
Feature activation+2.97e+01

Pos logit contributions
rang+4.393
ía+4.056
gencies+4.006
mun+3.794
its+3.756
Neg logit contributions
Reward-12.642
 rewarded-12.295
 recru-12.028
ellen-12.003
 trooper-11.899
Token seventh
Token ID14024
Feature activation+2.08e+01

Pos logit contributions
ía+12.764
rang+12.358
oche+12.227
 Hiroshima+11.493
mun+11.432
Neg logit contributions
Reward-20.164
ellen-19.414
doc-18.951
oros-18.035
 rewarded-18.011
Token consecutive
Token ID12785
Feature activation+1.08e+01

Pos logit contributions
oche+8.974
ía+8.901
rang+8.822
gencies+8.211
 Symphony+7.514
Neg logit contributions
Reward-18.627
doc-17.137
ellen-17.084
 rewarded-16.388
 recru-16.284
Token lunar
Token ID25572
Feature activation+1.67e+01

Pos logit contributions
ía+7.544
gencies+7.507
rang+7.351
oche+7.235
mortem+7.074
Neg logit contributions
Reward-8.027
doc-7.439
ellen-7.382
 rewarded-7.300
 recru-6.902
Token mission
Token ID4365
Feature activation+1.29e+01

Pos logit contributions
ía+6.350
rang+5.646
oche+5.456
gencies+5.455
mortem+5.298
Neg logit contributions
Reward-16.263
ellen-16.027
 rewarded-15.630
doc-15.368
 recru-15.061
Token failure
Token ID5287
Feature activation+2.20e+00

Pos logit contributions
ía+4.620
rang+4.545
gencies+4.525
acement+3.973
oche+3.964
Neg logit contributions
Reward-13.666
ellen-13.397
doc-13.052
 rewarded-12.775
llan-12.638
Token 2006
Token ID4793
Feature activation+1.03e+01

Pos logit contributions
rang+8.571
ía+8.513
gencies+8.150
oche+7.786
 Sach+7.688
Neg logit contributions
Reward-13.758
 trooper-13.473
 recru-13.269
ellen-13.237
 rewarded-13.229
Token 18
Token ID1248
Feature activation+1.14e+01

Pos logit contributions
rang+6.055
ía+5.909
gencies+5.877
 Symphony+5.519
 Inspection+5.475
Neg logit contributions
Reward-8.675
ellen-8.390
 rewarded-8.260
doc-8.170
 recru-8.084
Token 9
Token ID860
Feature activation+2.28e+01

Pos logit contributions
rang+7.088
gencies+6.926
ía+6.853
 Symphony+6.521
 Inspection+6.491
Neg logit contributions
Reward-9.036
ellen-8.593
 rewarded-8.556
doc-8.460
 recru-8.363
Token 2007
Token ID4343
Feature activation+7.90e+00

Pos logit contributions
rang+10.218
gencies+9.775
ía+9.661
oche+9.215
 Inspection+9.124
Neg logit contributions
Reward-18.048
ellen-17.747
 trooper-16.983
 rewarded-16.780
 recru-16.556
Token 6
Token ID718
Feature activation+9.69e+00

Pos logit contributions
rang+5.119
gencies+5.059
ía+5.007
oche+4.697
 Symphony+4.642
Neg logit contributions
Reward-6.391
ellen-6.169
 rewarded-6.078
doc-5.999
 recru-5.921
Token 0
Token ID657
Feature activation+2.96e+01

Pos logit contributions
rang+6.123
gencies+6.025
ía+5.908
oche+5.686
 Inspection+5.587
Neg logit contributions
Reward-7.876
 rewarded-7.470
ellen-7.424
doc-7.312
 recru-7.247
Token 2008
Token ID3648
Feature activation+1.19e+01

Pos logit contributions
gencies+11.682
ía+11.560
rang+11.443
acement+10.923
 Inspection+10.858
Neg logit contributions
Reward-21.350
ellen-20.737
 trooper-20.064
 rewarded-20.021
doc-19.653
Token 15
Token ID1315
Feature activation+9.44e+00

Pos logit contributions
rang+6.791
ía+6.655
gencies+6.533
 Symphony+6.278
 Sanct+6.153
Neg logit contributions
Reward-9.904
ellen-9.546
 rewarded-9.374
doc-9.194
 recru-9.181
Token 4
Token ID604
Feature activation+1.60e+01

Pos logit contributions
rang+6.033
gencies+5.932
ía+5.870
oche+5.522
 Symphony+5.491
Neg logit contributions
Reward-7.621
 rewarded-7.223
ellen-7.196
doc-7.133
 recru-7.000
Token 2009
Token ID3717
Feature activation+9.88e+00

Pos logit contributions
gencies+7.977
rang+7.619
ía+7.246
 Inspection+6.870
oche+6.856
Neg logit contributions
Reward-13.584
ellen-13.572
 rewarded-13.300
doc-13.240
llan-13.117
Token 10
Token ID838
Feature activation+1.21e+01

Pos logit contributions
rang+5.818
ía+5.681
gencies+5.661
 Symphony+5.307
oche+5.276
Neg logit contributions
Reward-8.370
ellen-8.117
 rewarded-7.950
doc-7.816
 recru-7.771
Token'
Token ID6
Feature activation+1.24e+01

Pos logit contributions
rang+7.729
gencies+7.601
ía+7.455
oche+7.339
 Symphony+7.334
Neg logit contributions
Reward-9.399
ellen-9.037
doc-8.649
 rewarded-8.569
llan-8.454
Token "
Token ID366
Feature activation+6.26e+00

Pos logit contributions
rang+7.834
gencies+7.749
ía+7.571
 Symphony+7.294
 Sanct+7.130
Neg logit contributions
Reward-9.816
ellen-9.639
doc-9.098
 rewarded-9.038
llan-8.815
TokenI
Token ID40
Feature activation+8.94e+00

Pos logit contributions
gencies+4.363
rang+4.319
ía+4.123
oche+3.986
mun+3.919
Neg logit contributions
 rewarded-4.829
Reward-4.802
ellen-4.694
 recru-4.654
doc-4.643
Token'll
Token ID1183
Feature activation+1.20e+01

Pos logit contributions
rang+6.896
gencies+6.848
ía+6.537
oche+6.438
mun+6.368
Neg logit contributions
Reward-6.269
doc-5.874
 rewarded-5.841
ellen-5.780
 recru-5.536
Token Feel
Token ID18571
Feature activation+1.45e+01

Pos logit contributions
gencies+6.879
rang+6.826
ía+6.433
 Symphony+6.170
oche+6.151
Neg logit contributions
Reward-10.348
 rewarded-9.918
doc-9.776
 recru-9.567
ellen-9.519
Token a
Token ID257
Feature activation+2.93e+01

Pos logit contributions
gencies+6.966
rang+6.916
ía+6.565
oche+6.492
 Symphony+6.273
Neg logit contributions
Reward-12.876
doc-12.268
 rewarded-12.181
ellen-12.140
llan-12.131
Token Whole
Token ID23431
Feature activation+1.81e+01

Pos logit contributions
 Symphony+11.901
rang+10.935
ía+10.722
 Sanct+10.632
 Inspection+10.495
Neg logit contributions
Reward-22.365
 rewarded-21.760
 recru-21.657
ellen-20.911
VIDIA-20.826
Token Lot
Token ID15099
Feature activation+5.03e+00

Pos logit contributions
rang+5.430
ía+4.899
 Inspection+4.839
 Sanct+4.639
 Symphony+4.607
Neg logit contributions
Reward-18.730
 rewarded-18.573
 recru-18.278
VIDIA-18.084
doc-17.944
Token Better
Token ID11625
Feature activation+3.79e+00

Pos logit contributions
gencies+2.981
rang+2.911
ía+2.871
oche+2.774
acter+2.602
Neg logit contributions
Reward-4.570
doc-4.402
 rewarded-4.379
ellen-4.306
llan-4.269
Token"
Token ID1
Feature activation-3.83e-01

Pos logit contributions
gencies+2.474
rang+2.409
ía+2.313
oche+2.267
mun+2.156
Neg logit contributions
Reward-3.301
 rewarded-3.140
doc-3.132
ellen-3.105
 recru-3.028
Token (
Token ID357
Feature activation+2.17e+00

Pos logit contributions
Reward+0.312
doc+0.300
 rewarded+0.300
ellen+0.293
llan+0.287
Neg logit contributions
gencies-0.275
rang-0.264
ía-0.255
oche-0.250
mun-0.240
Token proposed
Token ID5150
Feature activation+9.51e+00

Pos logit contributions
rang+8.441
ía+8.401
gencies+8.318
oche+8.069
mun+8.005
Neg logit contributions
Reward-6.734
doc-6.228
ellen-6.146
 recru-5.871
llan-5.847
Token
Token ID564
Feature activation+1.00e+00

Pos logit contributions
rang+6.321
gencies+6.280
ía+6.015
oche+5.927
acement+5.775
Neg logit contributions
Reward-7.382
doc-7.010
ellen-6.996
 rewarded-6.953
 recru-6.778
Token
Token ID250
Feature activation+6.08e+00

Pos logit contributions
gencies+0.796
rang+0.769
ía+0.747
oche+0.732
acter+0.706
Neg logit contributions
Reward-0.743
doc-0.711
 rewarded-0.709
ellen-0.696
llan-0.678
Tokena
Token ID64
Feature activation+7.76e+00

Pos logit contributions
gencies+5.293
rang+5.249
ía+5.046
mun+4.965
oche+4.926
Neg logit contributions
Reward-3.657
 rewarded-3.612
ellen-3.495
doc-3.483
 recru-3.467
Token point
Token ID966
Feature activation+5.70e+00

Pos logit contributions
gencies+5.642
rang+5.559
ía+5.312
oche+5.218
mun+5.109
Neg logit contributions
Reward-5.768
ellen-5.519
 rewarded-5.499
doc-5.474
 recru-5.237
Token of
Token ID286
Feature activation+2.89e+01

Pos logit contributions
gencies+3.662
rang+3.523
ía+3.364
oche+3.303
acement+3.214
Neg logit contributions
Reward-4.944
doc-4.752
 rewarded-4.699
ellen-4.686
llan-4.563
Token sale
Token ID5466
Feature activation+1.18e+01

Pos logit contributions
 circulation+9.885
 sale+9.785
ía+9.201
acement+9.163
rang+9.159
Neg logit contributions
Reward-23.050
doc-22.804
ellen-22.638
VIDIA-21.621
llan-21.429
Token
Token ID447
Feature activation+3.32e+00

Pos logit contributions
gencies+7.592
rang+7.534
ía+7.218
acement+6.928
oche+6.908
Neg logit contributions
Reward-9.214
ellen-8.710
doc-8.575
 rewarded-8.435
llan-8.220
Token
Token ID251
Feature activation+4.02e+00

Pos logit contributions
gencies+2.049
rang+1.956
ía+1.895
oche+1.820
acter+1.762
Neg logit contributions
Reward-3.004
doc-2.896
 rewarded-2.884
ellen-2.864
llan-2.786
Token plan
Token ID1410
Feature activation+2.43e+00

Pos logit contributions
gencies+2.836
rang+2.793
ía+2.666
oche+2.606
mun+2.541
Neg logit contributions
Reward-3.233
 rewarded-3.057
doc-3.056
ellen-3.046
llan-2.957
Token where
Token ID810
Feature activation+1.33e+01

Pos logit contributions
gencies+1.731
rang+1.690
ía+1.623
oche+1.597
mun+1.532
Neg logit contributions
Reward-1.985
doc-1.888
 rewarded-1.874
ellen-1.856
 recru-1.804
Token2008
Token ID11528
Feature activation+8.84e+00

Pos logit contributions
rang+9.907
718+9.704
mun+9.096
ía+8.994
its+8.576
Neg logit contributions
 rewarded-23.963
 trooper-23.724
 recru-23.386
 mathemat-23.202
 indo-23.045
Token Steve
Token ID6542
Feature activation+4.05e+00

Pos logit contributions
rang+5.607
gencies+5.599
ía+5.272
oche+5.165
 Sach+5.123
Neg logit contributions
Reward-7.348
 rewarded-6.965
doc-6.811
ellen-6.792
 recru-6.727
Token Care
Token ID7276
Feature activation-1.06e+01

Pos logit contributions
gencies+2.986
rang+2.935
ía+2.852
oche+2.828
mun+2.681
Neg logit contributions
Reward-3.118
 rewarded-2.995
doc-2.961
ellen-2.920
 recru-2.845
Tokenll
Token ID297
Feature activation+4.84e+00

Pos logit contributions
llan+3.282
doc+3.133
ellen+2.816
Reward+2.697
 rewarded+2.355
Neg logit contributions
gencies-12.706
ía-11.918
rang-11.753
 Inspection-11.734
oche-11.652
Token
Token ID198
Feature activation-2.35e+00

Pos logit contributions
gencies+3.087
rang+3.080
ía+2.976
oche+2.886
mun+2.784
Neg logit contributions
Reward-4.131
 rewarded-3.978
doc-3.920
ellen-3.913
 recru-3.833
Token
Token ID198
Feature activation+2.89e+01

Pos logit contributions
Reward+1.752
doc+1.694
 rewarded+1.646
ellen+1.627
llan+1.610
Neg logit contributions
gencies-1.888
rang-1.792
ía-1.731
oche-1.712
acter-1.660
Token2008
Token ID11528
Feature activation+9.07e+00

Pos logit contributions
718+10.042
rang+9.908
mun+9.152
ía+8.836
its+8.759
Neg logit contributions
 mathemat-24.523
 rewarded-24.374
 eleph-24.314
 trooper-24.288
 notor-24.265
Token Lee
Token ID5741
Feature activation-4.64e+00

Pos logit contributions
rang+5.828
gencies+5.828
ía+5.466
oche+5.375
 Sach+5.271
Neg logit contributions
Reward-7.433
 rewarded-7.061
doc-6.897
ellen-6.878
 recru-6.838
Token Pace
Token ID44111
Feature activation-7.08e-01

Pos logit contributions
Reward+3.282
ellen+3.191
 rewarded+3.190
doc+3.179
llan+3.091
Neg logit contributions
gencies-3.755
rang-3.516
ía-3.448
oche-3.364
acter-3.334
Token
Token ID198
Feature activation-2.89e+00

Pos logit contributions
Reward+0.558
doc+0.537
 rewarded+0.535
ellen+0.526
llan+0.515
Neg logit contributions
gencies-0.527
rang-0.507
ía-0.489
oche-0.482
acter-0.461
Token
Token ID198
Feature activation+2.34e+01

Pos logit contributions
Reward+2.121
doc+2.056
 rewarded+1.985
ellen+1.965
llan+1.950
Neg logit contributions
gencies-2.355
rang-2.227
ía-2.151
oche-2.133
acter-2.072
Tokenio
Token ID952
Feature activation-7.65e+00

Pos logit contributions
gencies+2.436
rang+2.368
ía+2.310
oche+2.253
mun+2.188
Neg logit contributions
Reward-2.223
 rewarded-2.100
doc-2.082
ellen-2.048
 recru-1.990
Token Val
Token ID3254
Feature activation-1.19e+01

Pos logit contributions
Reward+5.073
 rewarded+4.986
doc+4.859
llan+4.790
ellen+4.743
Neg logit contributions
gencies-6.392
rang-6.070
ía-5.821
oche-5.803
acter-5.664
Tokener
Token ID263
Feature activation-5.13e+00

Pos logit contributions
llan+6.670
doc+6.503
Reward+6.382
ellen+6.324
 rewarded+6.036
Neg logit contributions
gencies-10.317
rang-9.863
ía-9.544
 Inspection-9.502
oche-9.491
Tokenio
Token ID952
Feature activation-1.22e+01

Pos logit contributions
Reward+2.922
doc+2.882
 rewarded+2.788
ellen+2.757
llan+2.750
Neg logit contributions
gencies-4.800
rang-4.585
oche-4.364
ía-4.345
acter-4.278
Token Borg
Token ID29004
Feature activation+1.05e+01

Pos logit contributions
 rewarded+5.976
Reward+5.848
doc+5.841
llan+5.747
ellen+5.733
Neg logit contributions
gencies-11.556
rang-10.853
ía-10.591
oche-10.424
acement-10.277
Tokenhe
Token ID258
Feature activation+2.85e+01

Pos logit contributions
rang+2.698
oche+2.207
gencies+2.132
ía+2.068
mun+1.774
Neg logit contributions
Reward-13.085
 rewarded-12.726
 recru-12.560
 trooper-12.355
doc-12.053
Tokense
Token ID325
Feature activation-8.20e+00

Pos logit contributions
rang+9.156
oche+7.789
ía+7.721
ches+7.242
sie+7.220
Neg logit contributions
 notor-25.868
VIDIA-25.603
Reward-25.241
 rewarded-24.954
 trooper-24.830
Token.
Token ID13
Feature activation-7.57e+00

Pos logit contributions
 rewarded+6.254
Reward+6.238
doc+6.062
ellen+5.906
llan+5.902
Neg logit contributions
gencies-6.037
rang-5.604
ía-5.308
oche-5.297
acter-5.202
Token Borg
Token ID29004
Feature activation+1.04e+01

Pos logit contributions
Reward+5.466
doc+5.274
 rewarded+5.095
llan+5.003
ellen+4.854
Neg logit contributions
gencies-6.106
rang-5.672
oche-5.536
ía-5.501
acter-5.377
Tokenhe
Token ID258
Feature activation+2.58e+01

Pos logit contributions
rang+2.788
oche+2.211
gencies+2.190
ía+2.138
acter+1.715
Neg logit contributions
Reward-12.906
 rewarded-12.470
 recru-12.267
 trooper-12.163
doc-12.061
Tokense
Token ID325
Feature activation-8.56e+00

Pos logit contributions
rang+7.492
ía+6.014
oche+5.975
sov+5.442
sie+5.364
Neg logit contributions
Reward-25.440
 notor-25.246
VIDIA-25.100
 rewarded-24.643
 trooper-24.569
Token in
Token ID287
Feature activation+1.61e+01

Pos logit contributions
gencies+14.665
rang+14.036
mun+13.173
 Symphony+12.988
oche+12.852
Neg logit contributions
Reward-15.691
ellen-14.777
llan-14.763
doc-14.646
 rewarded-13.984
Token London
Token ID3576
Feature activation+6.64e+00

Pos logit contributions
gencies+10.961
rang+9.966
oche+9.676
ía+9.495
 Sanct+9.351
Neg logit contributions
Reward-11.573
ellen-11.182
llan-11.158
 rewarded-10.941
doc-10.878
Token and
Token ID290
Feature activation+8.29e+00

Pos logit contributions
gencies+4.517
rang+4.227
ía+4.110
oche+3.981
mun+3.918
Neg logit contributions
Reward-5.609
doc-5.358
ellen-5.305
 rewarded-5.290
llan-5.229
Token the
Token ID262
Feature activation+1.24e+01

Pos logit contributions
gencies+5.796
rang+5.446
ía+5.329
oche+5.236
mun+5.141
Neg logit contributions
Reward-6.576
doc-6.343
ellen-6.230
llan-6.137
 rewarded-6.077
Token U
Token ID471
Feature activation+8.03e+00

Pos logit contributions
gencies+8.955
rang+8.222
oche+8.027
ía+8.022
 Sanct+7.887
Neg logit contributions
Reward-9.053
ellen-8.838
doc-8.674
 rewarded-8.399
llan-8.397
Token.
Token ID13
Feature activation+2.85e+01

Pos logit contributions
rang+4.604
gencies+4.589
ía+4.405
mun+4.267
oche+4.154
Neg logit contributions
Reward-7.167
 rewarded-7.012
 recru-6.839
ellen-6.825
doc-6.813
Token K
Token ID509
Feature activation+1.04e+01

Pos logit contributions
gencies+8.331
mun+7.559
its+7.151
ía+6.874
 Sanct+6.755
Neg logit contributions
Reward-23.865
doc-23.721
 rewarded-23.436
ellen-22.772
 trooper-22.598
Token.,
Token ID1539
Feature activation+4.47e+00

Pos logit contributions
gencies+6.054
rang+5.521
ía+5.215
oche+5.100
mun+5.067
Neg logit contributions
Reward-9.394
doc-9.080
 rewarded-9.006
ellen-8.902
 recru-8.740
Token we
Token ID356
Feature activation-4.18e+00

Pos logit contributions
gencies+3.181
rang+3.047
ía+2.898
oche+2.863
mun+2.758
Neg logit contributions
Reward-3.632
doc-3.472
ellen-3.429
 rewarded-3.419
llan-3.355
Token will
Token ID481
Feature activation-2.64e-01

Pos logit contributions
Reward+2.788
 rewarded+2.712
doc+2.634
ellen+2.560
 recru+2.516
Neg logit contributions
gencies-3.551
rang-3.459
ía-3.381
oche-3.335
acement-3.208
Token be
Token ID307
Feature activation-1.25e+00

Pos logit contributions
Reward+0.182
 rewarded+0.174
doc+0.174
ellen+0.170
llan+0.165
Neg logit contributions
gencies-0.222
rang-0.215
ía-0.209
oche-0.206
mun-0.198
Token x
Token ID2124
Feature activation+1.52e+00

Pos logit contributions
doc+3.512
Reward+3.502
 rewarded+3.326
ellen+3.289
llan+3.199
Neg logit contributions
gencies-6.371
rang-6.024
ía-5.795
oche-5.757
ateurs-5.658
Token.
Token ID13
Feature activation+1.55e+00

Pos logit contributions
gencies+1.048
rang+1.022
ía+0.982
oche+0.961
mun+0.923
Neg logit contributions
Reward-1.272
 rewarded-1.229
doc-1.212
ellen-1.201
 recru-1.178
Tokenx
Token ID87
Feature activation+3.88e+00

Pos logit contributions
gencies+1.230
rang+1.203
ía+1.163
oche+1.141
mun+1.103
Neg logit contributions
Reward-1.141
 rewarded-1.101
doc-1.071
ellen-1.062
 recru-1.045
Token.
Token ID13
Feature activation+6.07e+00

Pos logit contributions
gencies+2.450
rang+2.437
ía+2.346
oche+2.277
mun+2.193
Neg logit contributions
Reward-3.372
 rewarded-3.301
 recru-3.193
ellen-3.186
doc-3.178
Tokenx
Token ID87
Feature activation+2.14e+01

Pos logit contributions
rang+4.241
gencies+4.189
ía+4.104
oche+3.952
mun+3.876
Neg logit contributions
Reward-4.837
 rewarded-4.836
 recru-4.648
 trooper-4.495
ellen-4.477
Token.
Token ID13
Feature activation+2.84e+01

Pos logit contributions
rang+8.140
its+6.923
ía+6.846
sov+6.789
itar+6.621
Neg logit contributions
 notor-20.492
VIDIA-20.323
 rewarded-19.898
 recru-19.870
 opio-19.763
Token157
Token ID18458
Feature activation+5.07e+00

Pos logit contributions
rang+8.514
sov+7.449
its+7.377
spect+6.060
mun+5.844
Neg logit contributions
Reward-26.427
 rewarded-25.764
 trooper-25.320
ellen-25.103
 recru-24.947
Token 108
Token ID15495
Feature activation-9.96e-01

Pos logit contributions
gencies+3.400
rang+3.373
ía+3.252
oche+3.181
mun+3.076
Neg logit contributions
Reward-4.177
 rewarded-3.980
doc-3.942
ellen-3.906
 recru-3.850
Token67
Token ID3134
Feature activation-9.55e+00

Pos logit contributions
Reward+0.661
doc+0.631
 rewarded+0.623
ellen+0.610
llan+0.595
Neg logit contributions
gencies-0.873
rang-0.841
ía-0.817
oche-0.806
acter-0.779
Token90
Token ID3829
Feature activation-9.09e+00

Pos logit contributions
Reward+5.353
doc+5.283
llan+4.736
ellen+4.649
 rewarded+4.627
Neg logit contributions
gencies-9.172
rang-8.473
oche-8.289
ía-8.276
acter-8.194
Token x
Token ID2124
Feature activation+3.76e+00

Pos logit contributions
doc+4.529
Reward+4.431
 rewarded+4.236
ellen+4.118
llan+4.104
Neg logit contributions
gencies-9.173
rang-8.648
oche-8.342
ía-8.256
ateurs-8.192
Token rights
Token ID2489
Feature activation+6.24e+00

Pos logit contributions
gencies+5.982
ía+5.673
rang+5.645
oche+5.406
mun+5.345
Neg logit contributions
Reward-7.900
 rewarded-7.589
llan-7.522
ellen-7.488
doc-7.371
Token and
Token ID290
Feature activation+4.64e+00

Pos logit contributions
gencies+4.040
rang+3.900
ía+3.695
oche+3.647
mun+3.554
Neg logit contributions
Reward-5.408
ellen-5.065
doc-5.063
 rewarded-5.034
llan-4.957
Token privacy
Token ID6782
Feature activation+3.74e+00

Pos logit contributions
gencies+3.836
rang+3.679
ía+3.630
oche+3.560
mun+3.466
Neg logit contributions
Reward-3.189
doc-3.013
 rewarded-3.007
ellen-2.994
llan-2.960
Token of
Token ID286
Feature activation+2.12e+01

Pos logit contributions
gencies+2.518
rang+2.433
ía+2.322
oche+2.293
mun+2.208
Neg logit contributions
Reward-3.186
doc-3.017
 rewarded-3.010
ellen-2.997
llan-2.942
Token millions
Token ID5242
Feature activation+1.65e+01

Pos logit contributions
ía+11.220
gencies+10.969
oche+10.812
 702+10.793
rang+10.749
Neg logit contributions
Reward-15.318
ellen-14.707
doc-14.058
llan-13.866
 rewarded-13.047
Token of
Token ID286
Feature activation+2.83e+01

Pos logit contributions
oche+8.084
ía+7.911
gencies+7.889
rang+7.652
mun+7.187
Neg logit contributions
Reward-14.322
llan-13.349
ellen-13.334
doc-13.262
 rewarded-12.906
Token Americans
Token ID3399
Feature activation+9.45e+00

Pos logit contributions
ía+12.074
gencies+12.012
oche+12.007
 702+11.755
 Inspection+11.658
Neg logit contributions
Reward-19.825
ellen-19.111
llan-18.603
doc-17.902
asca-17.637
Token,
Token ID11
Feature activation+6.36e+00

Pos logit contributions
gencies+5.890
rang+5.679
ía+5.624
oche+5.601
mun+5.302
Neg logit contributions
Reward-7.983
doc-7.480
ellen-7.448
llan-7.312
 rewarded-7.269
Token
Token ID447
Feature activation+7.75e+00

Pos logit contributions
gencies+4.409
rang+4.216
ía+4.164
oche+4.122
mun+3.952
Neg logit contributions
Reward-5.033
doc-4.780
ellen-4.778
 rewarded-4.735
llan-4.699
Token
Token ID251
Feature activation+4.47e+00

Pos logit contributions
gencies+4.409
rang+4.106
ía+4.031
oche+3.883
acter+3.741
Neg logit contributions
Reward-7.109
ellen-6.880
doc-6.810
 rewarded-6.786
llan-6.722
Token Mer
Token ID4638
Feature activation+5.26e+00

Pos logit contributions
gencies+3.000
rang+2.901
ía+2.816
oche+2.773
mun+2.677
Neg logit contributions
Reward-3.724
doc-3.565
ellen-3.525
 rewarded-3.522
llan-3.451
Token Los
Token ID5401
Feature activation+4.68e+00

Pos logit contributions
Reward+3.084
 rewarded+2.985
doc+2.954
ellen+2.901
llan+2.850
Neg logit contributions
gencies-2.832
rang-2.721
ía-2.611
oche-2.575
acter-2.459
Token Angeles
Token ID5652
Feature activation-1.88e+00

Pos logit contributions
gencies+3.264
rang+3.136
ía+3.082
oche+2.978
mun+2.859
Neg logit contributions
Reward-3.901
 rewarded-3.720
doc-3.644
ellen-3.618
 recru-3.533
Token.
Token ID13
Feature activation-5.05e+00

Pos logit contributions
Reward+1.519
 rewarded+1.463
doc+1.462
ellen+1.423
llan+1.406
Neg logit contributions
gencies-1.368
rang-1.307
ía-1.258
oche-1.240
acter-1.193
Token Exactly
Token ID36819
Feature activation+1.58e+00

Pos logit contributions
Reward+3.669
doc+3.571
 rewarded+3.463
llan+3.383
ellen+3.345
Neg logit contributions
gencies-4.064
rang-3.834
ía-3.723
oche-3.670
acter-3.614
Token 50
Token ID2026
Feature activation+1.12e+01

Pos logit contributions
gencies+1.216
rang+1.184
ía+1.144
oche+1.125
mun+1.081
Neg logit contributions
Reward-1.190
doc-1.136
 rewarded-1.129
ellen-1.114
llan-1.080
Token years
Token ID812
Feature activation+2.83e+01

Pos logit contributions
gencies+7.988
rang+7.869
ía+7.680
oche+7.397
mun+7.170
Neg logit contributions
Reward-8.030
doc-7.898
ellen-7.716
 rewarded-7.541
llan-7.342
Token later
Token ID1568
Feature activation+6.63e+00

Pos logit contributions
ía+14.235
mun+13.996
rang+13.530
 Symphony+13.095
gencies+13.064
Neg logit contributions
Reward-17.685
doc-17.342
ellen-16.665
 rewarded-15.475
 recru-15.292
Token
Token ID960
Feature activation-2.21e-01

Pos logit contributions
gencies+4.034
rang+3.986
ía+3.851
oche+3.753
mun+3.598
Neg logit contributions
Reward-5.817
ellen-5.563
doc-5.544
 rewarded-5.498
 recru-5.360
TokenDecember
Token ID20588
Feature activation-3.88e-01

Pos logit contributions
Reward+0.184
doc+0.177
 rewarded+0.176
ellen+0.173
llan+0.170
Neg logit contributions
gencies-0.155
rang-0.149
ía-0.143
oche-0.141
acter-0.135
Token 17
Token ID1596
Feature activation-2.44e+00

Pos logit contributions
Reward+0.297
doc+0.284
 rewarded+0.284
ellen+0.278
llan+0.272
Neg logit contributions
gencies-0.299
rang-0.288
ía-0.278
oche-0.274
acter-0.263
Token,
Token ID11
Feature activation-7.85e+00

Pos logit contributions
Reward+1.968
 rewarded+1.901
doc+1.897
ellen+1.846
llan+1.822
Neg logit contributions
gencies-1.782
rang-1.693
ía-1.625
oche-1.600
acter-1.547
Token we
Token ID356
Feature activation-2.00e+00

Pos logit contributions
gencies+1.442
rang+1.411
ía+1.326
oche+1.279
mun+1.193
Neg logit contributions
Reward-3.110
doc-3.009
ellen-2.997
 rewarded-2.980
llan-2.901
Token
Token ID447
Feature activation-8.84e+00

Pos logit contributions
Reward+1.449
doc+1.388
 rewarded+1.386
ellen+1.354
llan+1.326
Neg logit contributions
gencies-1.620
rang-1.552
ía-1.504
oche-1.481
acter-1.429
Token
Token ID247
Feature activation-1.04e+01

Pos logit contributions
Reward+5.123
doc+4.859
 rewarded+4.735
llan+4.647
ellen+4.603
Neg logit contributions
gencies-8.126
rang-7.683
oche-7.574
ía-7.348
 Inspection-7.310
Tokenll
Token ID297
Feature activation+1.20e+01

Pos logit contributions
doc+3.196
llan+3.179
Reward+3.137
 rewarded+2.895
ellen+2.861
Neg logit contributions
gencies-12.068
ía-11.358
rang-11.266
 Symphony-11.209
oche-11.202
Token send
Token ID3758
Feature activation-5.34e+00

Pos logit contributions
rang+5.053
ía+4.946
oche+4.917
gencies+4.915
mun+4.631
Neg logit contributions
Reward-11.856
ellen-11.684
doc-11.255
 rewarded-11.091
 recru-10.949
Token you
Token ID345
Feature activation-3.53e+01

Pos logit contributions
Reward+3.957
 rewarded+3.832
doc+3.814
llan+3.655
ellen+3.643
Neg logit contributions
gencies-4.161
rang-3.939
ía-3.800
oche-3.740
acement-3.653
Token three
Token ID1115
Feature activation-5.98e+00

Pos logit contributions
 rewarded+7.005
 backed+6.819
 trooper+6.760
doc+6.492
 rewards+6.204
Neg logit contributions
gencies-30.676
ía-30.590
ateurs-29.943
acter-29.826
acement-29.599
Token meaningful
Token ID11570
Feature activation-1.80e+00

Pos logit contributions
 rewarded+3.885
Reward+3.859
doc+3.716
llan+3.527
ellen+3.513
Neg logit contributions
gencies-5.140
rang-4.912
ía-4.803
acement-4.567
mun-4.565
Token actions
Token ID4028
Feature activation-4.03e+00

Pos logit contributions
Reward+0.946
 rewarded+0.895
doc+0.895
ellen+0.849
llan+0.842
Neg logit contributions
gencies-1.804
rang-1.757
ía-1.712
oche-1.692
mun-1.646
Token you
Token ID345
Feature activation-8.55e+00

Pos logit contributions
Reward+3.076
doc+3.004
 rewarded+2.988
ellen+2.884
llan+2.879
Neg logit contributions
gencies-3.083
rang-2.911
ía-2.823
oche-2.781
mun-2.682
Token can
Token ID460
Feature activation-3.76e+00

Pos logit contributions
doc+3.489
Reward+3.469
 rewarded+3.432
llan+3.254
 recru+3.147
Neg logit contributions
gencies-9.330
rang-8.738
ía-8.617
oche-8.388
ukemia-8.338
Token.
Token ID13
Feature activation-1.36e+01

Pos logit contributions
Reward+3.824
doc+3.728
 rewarded+3.639
ellen+3.612
llan+3.578
Neg logit contributions
gencies-4.352
rang-4.108
ía-3.999
oche-3.903
acter-3.812
Token is
Token ID318
Feature activation-7.85e+00

Pos logit contributions
doc+8.637
 rewarded+8.329
Reward+8.255
llan+8.021
 prisoner+7.890
Neg logit contributions
gencies-10.398
ía-10.177
rang-10.097
oche-9.664
acter-9.347
Token the
Token ID262
Feature activation-1.14e+01

Pos logit contributions
Reward+5.440
doc+5.350
 rewarded+5.231
llan+5.054
ellen+5.051
Neg logit contributions
gencies-6.323
rang-5.925
ía-5.872
oche-5.702
acter-5.503
Token syntax
Token ID15582
Feature activation-1.69e+01

Pos logit contributions
doc+5.675
Reward+5.390
 rewarded+5.383
 recru+5.204
llan+5.086
Neg logit contributions
gencies-10.825
rang-10.260
oche-10.161
ateurs-9.888
ía-9.878
Token just
Token ID655
Feature activation-6.92e+00

Pos logit contributions
 rewarded+8.788
doc+8.465
 prisoner+8.134
llan+8.127
Reward+8.004
Neg logit contributions
gencies-14.148
rang-13.235
oche-13.022
ukemia-12.987
ía-12.976
Token as
Token ID355
Feature activation-3.12e+01

Pos logit contributions
Reward+3.943
 rewarded+3.917
doc+3.897
ellen+3.688
llan+3.682
Neg logit contributions
gencies-6.483
rang-6.025
ía-5.891
oche-5.752
ukemia-5.659
Token rich
Token ID5527
Feature activation-4.78e+00

Pos logit contributions
 rewarded+7.720
doc+7.228
ellen+6.251
 rewards+6.211
 endorsed+6.058
Neg logit contributions
gencies-29.099
acement-26.338
mination-26.291
rang-26.022
ukemia-25.930
Token,
Token ID11
Feature activation-1.26e+01

Pos logit contributions
Reward+3.464
doc+3.403
 rewarded+3.294
ellen+3.251
llan+3.204
Neg logit contributions
gencies-3.878
rang-3.636
ía-3.489
oche-3.468
acter-3.383
Token can
Token ID460
Feature activation-3.37e+00

Pos logit contributions
doc+4.441
 rewarded+4.056
llan+4.025
Reward+3.978
 recru+3.910
Neg logit contributions
gencies-14.075
rang-13.311
ía-13.140
ukemia-12.792
oche-12.704
Token you
Token ID345
Feature activation-2.80e+00

Pos logit contributions
Reward+2.597
doc+2.532
 rewarded+2.521
ellen+2.464
llan+2.431
Neg logit contributions
gencies-2.528
rang-2.433
ía-2.344
oche-2.317
acement-2.208
Token have
Token ID423
Feature activation-1.72e+01

Pos logit contributions
Reward+2.242
doc+2.169
 rewarded+2.160
ellen+2.113
llan+2.093
Neg logit contributions
gencies-2.056
rang-1.933
ía-1.897
oche-1.840
acter-1.760
Token Angular
Token ID28147
Feature activation+4.58e+00

Pos logit contributions
Reward+5.425
doc+5.394
 rewarded+5.258
ellen+5.053
llan+5.037
Neg logit contributions
gencies-6.655
rang-6.287
ía-6.276
oche-6.047
mun-5.956
TokenJS
Token ID20120
Feature activation+1.95e+00

Pos logit contributions
gencies+2.424
rang+2.393
ía+2.255
oche+2.224
acter+2.056
Neg logit contributions
Reward-4.525
 rewarded-4.349
ellen-4.327
doc-4.284
llan-4.164
Token.
Token ID13
Feature activation-4.89e-01

Pos logit contributions
gencies+1.367
rang+1.319
ía+1.264
oche+1.247
mun+1.188
Neg logit contributions
Reward-1.630
doc-1.557
 rewarded-1.556
ellen-1.537
llan-1.494
Token It
Token ID632
Feature activation-6.98e+00

Pos logit contributions
Reward+0.413
doc+0.398
 rewarded+0.397
ellen+0.390
llan+0.382
Neg logit contributions
gencies-0.336
rang-0.323
ía-0.311
oche-0.305
mun-0.292
Token's
Token ID338
Feature activation-1.60e+00

Pos logit contributions
Reward+3.357
doc+3.333
 rewarded+3.217
llan+3.186
ellen+3.062
Neg logit contributions
gencies-7.123
rang-6.751
acter-6.484
oche-6.474
ía-6.465
Token a
Token ID257
Feature activation-2.91e+01

Pos logit contributions
Reward+1.258
doc+1.213
 rewarded+1.210
ellen+1.185
llan+1.159
Neg logit contributions
gencies-1.205
rang-1.152
ía-1.104
oche-1.090
acter-1.046
Token marvelous
Token ID43937
Feature activation-7.07e+00

Pos logit contributions
doc+9.085
 rewarded+9.045
 limitless+8.769
ellen+8.448
 trooper+8.425
Neg logit contributions
gencies-25.664
rang-24.641
ateurs-24.024
acement-23.799
oche-23.235
Token framework
Token ID9355
Feature activation-6.84e+00

Pos logit contributions
doc+3.326
Reward+3.314
 rewarded+3.301
ellen+3.177
llan+3.088
Neg logit contributions
gencies-7.333
rang-6.854
ía-6.754
oche-6.555
acter-6.495
Token for
Token ID329
Feature activation-6.40e+00

Pos logit contributions
doc+4.446
 rewarded+4.419
Reward+4.353
ellen+4.229
llan+4.157
Neg logit contributions
gencies-5.890
rang-5.404
ía-5.340
oche-5.311
acter-5.210
Token modern
Token ID3660
Feature activation-1.05e+00

Pos logit contributions
doc+4.446
 rewarded+4.429
Reward+4.425
llan+4.261
ellen+4.243
Neg logit contributions
gencies-5.140
rang-4.900
ía-4.715
oche-4.649
acement-4.519
Token web
Token ID3992
Feature activation-9.22e-01

Pos logit contributions
Reward+0.712
doc+0.685
 rewarded+0.683
ellen+0.662
llan+0.649
Neg logit contributions
gencies-0.901
rang-0.867
ía-0.841
oche-0.828
acter-0.802
Token troops
Token ID6553
Feature activation-1.91e+01

Pos logit contributions
 recru+11.699
 rewarded+11.539
 trooper+11.525
Reward+11.296
 prisoner+11.201
Neg logit contributions
gencies-16.296
oche-14.914
rang-14.647
ía-14.596
ukemia-14.263
Token fled
Token ID11468
Feature activation-1.82e+01

Pos logit contributions
 rewarded+11.277
 abducted+10.930
 backed+10.826
 prisoner+10.474
 recru+10.297
Neg logit contributions
gencies-15.418
rang-13.980
oche-13.273
ukemia-13.159
ía-13.117
Token to
Token ID284
Feature activation-1.53e+01

Pos logit contributions
llan+11.190
doc+11.084
 rewarded+10.957
 prisoner+10.939
Reward+10.670
Neg logit contributions
gencies-13.603
ía-12.350
rang-12.139
oche-12.004
acter-11.958
Token the
Token ID262
Feature activation-2.04e+01

Pos logit contributions
Reward+9.744
 rewarded+9.389
doc+9.384
 prisoner+9.288
ellen+9.131
Neg logit contributions
gencies-11.684
ía-11.224
rang-10.919
oche-10.371
mortem-10.333
Token town
Token ID3240
Feature activation-1.41e+01

Pos logit contributions
Reward+11.470
doc+11.465
 rewarded+11.407
 recru+11.287
 prisoner+11.170
Neg logit contributions
gencies-14.884
ía-14.407
rang-13.909
oche-13.528
acter-13.510
Token when
Token ID618
Feature activation-2.86e+01

Pos logit contributions
 rewarded+8.582
doc+8.533
Reward+8.419
llan+8.348
 recru+8.252
Neg logit contributions
gencies-10.950
rang-10.327
oche-10.097
acter-9.996
ía-9.962
Token M
Token ID337
Feature activation-9.57e+00

Pos logit contributions
 abducted+15.319
 rewarded+15.302
 backed+15.168
 prisoner+14.994
doc+14.981
Neg logit contributions
gencies-18.439
ía-16.361
acter-16.355
rang-16.136
borgh-15.949
Token23
Token ID1954
Feature activation-2.07e+01

Pos logit contributions
doc+7.737
Reward+7.613
ellen+7.229
 rewarded+7.143
llan+7.055
Neg logit contributions
gencies-6.404
rang-5.914
ía-5.826
 Symphony-5.715
acter-5.550
Token rebels
Token ID12669
Feature activation-1.78e+01

Pos logit contributions
Reward+12.307
 rewarded+12.174
 recru+12.141
 trooper+11.912
doc+11.906
Neg logit contributions
gencies-15.145
ía-13.658
rang-13.546
oche-13.385
acter-13.223
Token briefly
Token ID11589
Feature activation-1.48e+01

Pos logit contributions
 rewarded+10.635
 abducted+10.209
 backed+10.127
Reward+9.993
 prisoner+9.967
Neg logit contributions
gencies-14.284
rang-12.810
oche-12.160
ía-12.082
%:-11.997
Token captured
Token ID7907
Feature activation-1.83e+01

Pos logit contributions
 rewarded+8.447
 backed+7.828
 prisoner+7.795
Reward+7.683
 abducted+7.600
Neg logit contributions
gencies-13.161
rang-12.031
ía-11.632
oche-11.611
 Inspection-11.377
Token Jill
Token ID22772
Feature activation-5.38e+00

Pos logit contributions
Reward+4.586
doc+4.401
 rewarded+4.317
ellen+4.185
llan+4.058
Neg logit contributions
gencies-8.150
ía-7.624
rang-7.526
oche-7.474
ateurs-7.320
Tokenian
Token ID666
Feature activation+1.45e-01

Pos logit contributions
Reward+3.190
doc+3.151
ellen+3.035
 rewarded+2.985
llan+2.917
Neg logit contributions
gencies-4.974
rang-4.692
ía-4.503
oche-4.494
acter-4.405
Token,
Token ID11
Feature activation+1.04e+01

Pos logit contributions
gencies+0.104
rang+0.100
ía+0.096
oche+0.095
mun+0.091
Neg logit contributions
Reward-0.119
doc-0.114
 rewarded-0.114
ellen-0.112
llan-0.110
Token we
Token ID356
Feature activation-7.99e+00

Pos logit contributions
rang+3.882
ía+3.600
oche+3.565
gencies+3.550
mun+3.143
Neg logit contributions
Reward-11.173
ellen-10.640
 rewarded-10.617
doc-10.591
 recru-10.469
Token couldn
Token ID3521
Feature activation-2.41e+00

Pos logit contributions
Reward+4.059
 rewarded+4.054
doc+3.835
ellen+3.804
 recru+3.651
Neg logit contributions
gencies-8.019
rang-7.601
ía-7.237
oche-7.211
 Symphony-7.038
Token't
Token ID470
Feature activation-2.86e+01

Pos logit contributions
Reward+1.433
doc+1.362
 rewarded+1.329
ellen+1.315
llan+1.279
Neg logit contributions
gencies-2.279
rang-2.193
ía-2.115
oche-2.106
acter-2.051
Token resist
Token ID4180
Feature activation-1.66e+00

Pos logit contributions
 rewarded+7.826
 relent+7.250
 prisoner+7.088
 trooper+6.939
 backed+6.826
Neg logit contributions
gencies-25.233
rang-23.951
ukemia-23.863
iferation-23.545
oche-23.377
Token the
Token ID262
Feature activation-5.43e+00

Pos logit contributions
Reward+1.339
doc+1.287
 rewarded+1.282
ellen+1.256
llan+1.230
Neg logit contributions
gencies-1.216
rang-1.157
ía-1.110
oche-1.094
acter-1.051
Token temptation
Token ID29062
Feature activation-5.80e+00

Pos logit contributions
Reward+2.706
 rewarded+2.515
doc+2.394
 recru+2.335
ellen+2.330
Neg logit contributions
gencies-5.679
rang-5.499
ía-5.334
oche-5.247
acter-5.130
Token to
Token ID284
Feature activation-5.63e+00

Pos logit contributions
Reward+4.127
doc+4.021
 rewarded+4.018
ellen+3.868
llan+3.846
Neg logit contributions
gencies-4.753
rang-4.369
ía-4.256
oche-4.245
acter-4.104
Token recall
Token ID10014
Feature activation-6.32e+00

Pos logit contributions
Reward+4.963
 rewarded+4.759
doc+4.733
ellen+4.655
llan+4.544
Neg logit contributions
gencies-3.622
rang-3.523
ía-3.268
oche-3.185
acement-3.117
Token (
Token ID357
Feature activation-2.02e+01

Pos logit contributions
gencies+2.501
rang+2.478
ía+2.358
oche+2.336
mun+2.208
Neg logit contributions
Reward-3.934
 rewarded-3.841
ellen-3.781
doc-3.762
 recru-3.732
Token gray
Token ID12768
Feature activation-9.51e+00

Pos logit contributions
 rewarded+5.871
 prisoner+5.381
doc+4.878
 backed+4.834
 freed+4.801
Neg logit contributions
gencies-20.812
oche-20.231
ía-20.126
rang-19.821
 Inspection-19.583
Token,
Token ID837
Feature activation-1.49e+01

Pos logit contributions
doc+5.366
Reward+5.354
 rewarded+4.988
ellen+4.971
llan+4.939
Neg logit contributions
gencies-9.044
rang-8.150
ía-7.994
oche-7.851
ateurs-7.810
Token (
Token ID357
Feature activation-2.31e+01

Pos logit contributions
 rewarded+8.196
doc+8.109
llan+7.754
Reward+7.743
 prisoner+7.718
Neg logit contributions
gencies-13.294
rang-12.148
ía-12.074
mun-11.389
iferation-11.284
Token 3
Token ID513
Feature activation-1.32e+01

Pos logit contributions
doc+5.372
llan+4.782
 recru+4.674
 rewarded+4.586
 trooper+4.501
Neg logit contributions
gencies-23.977
ía-23.617
ateurs-21.988
rang-21.925
iferation-21.803
Token,
Token ID837
Feature activation-2.82e+01

Pos logit contributions
doc+6.615
Reward+6.536
llan+6.171
ellen+5.890
 rewarded+5.840
Neg logit contributions
gencies-12.798
rang-11.622
ía-11.236
ateurs-11.181
iferation-11.136
Token 3
Token ID513
Feature activation-2.04e+01

Pos logit contributions
doc+5.671
llan+5.251
 trooper+4.680
 recru+4.676
 rewarded+4.531
Neg logit contributions
gencies-28.145
ía-26.536
rang-25.877
ateurs-25.807
iferation-25.690
Token )
Token ID1267
Feature activation-1.37e+01

Pos logit contributions
doc+5.231
Reward+5.127
ellen+4.596
llan+4.566
 rewarded+4.072
Neg logit contributions
gencies-21.830
rang-20.761
iferation-20.063
ía-19.643
ukemia-19.598
Token,
Token ID837
Feature activation-8.04e+00

Pos logit contributions
Reward+6.603
doc+6.417
ellen+5.917
llan+5.908
 rewarded+5.856
Neg logit contributions
gencies-13.411
rang-12.402
ía-12.055
oche-11.826
iferation-11.804
Token 0
Token ID657
Feature activation-1.12e+01

Pos logit contributions
Reward+5.169
doc+5.133
 rewarded+5.090
 recru+4.851
ellen+4.840
Neg logit contributions
gencies-6.847
rang-6.453
ía-6.323
oche-6.173
iferation-5.974
Token )
Token ID1267
Feature activation-1.17e+01

Pos logit contributions
Reward+3.502
doc+3.493
llan+3.279
 rewarded+2.893
ellen+2.887
Neg logit contributions
gencies-13.026
rang-12.283
ía-12.032
acter-11.774
oche-11.770
Token to
Token ID284
Feature activation-1.87e+01

Pos logit contributions
Reward+4.992
doc+4.953
 rewarded+4.888
 recru+4.737
ellen+4.730
Neg logit contributions
gencies-5.479
rang-5.218
ía-5.061
oche-4.888
acter-4.819
Token the
Token ID262
Feature activation-1.84e+01

Pos logit contributions
Reward+11.924
doc+11.673
 prisoner+11.645
 recru+11.473
 rewarded+11.395
Neg logit contributions
gencies-13.068
ía-12.207
rang-12.073
oche-11.545
acter-11.400
Token Min
Token ID1855
Feature activation+2.68e+00

Pos logit contributions
Reward+11.843
 prisoner+11.743
doc+11.695
 recru+11.688
 trooper+11.485
Neg logit contributions
gencies-12.482
rang-11.737
ía-11.723
acter-11.214
oche-11.164
Tokenova
Token ID10071
Feature activation-2.08e+01

Pos logit contributions
gencies+1.931
rang+1.893
ía+1.819
oche+1.796
mun+1.716
Neg logit contributions
Reward-2.178
 rewarded-2.092
doc-2.055
ellen-2.020
 recru-1.985
Token rapes
Token ID37459
Feature activation-4.69e+00

Pos logit contributions
 prisoner+13.349
Reward+13.042
 trooper+12.875
doc+12.823
 recru+12.759
Neg logit contributions
gencies-13.703
ía-12.849
oche-12.462
acter-12.385
rang-12.242
Token if
Token ID611
Feature activation-2.74e+01

Pos logit contributions
Reward+3.666
doc+3.594
 rewarded+3.583
ellen+3.457
llan+3.428
Neg logit contributions
gencies-3.470
rang-3.253
ía-3.141
oche-3.075
acter-3.006
Token it
Token ID340
Feature activation-2.33e+01

Pos logit contributions
 rewarded+15.626
 prisoner+15.337
doc+15.319
Reward+15.005
 abducted+14.713
Neg logit contributions
gencies-17.454
ía-15.995
borgh-15.377
rang-15.370
acter-15.113
Token did
Token ID750
Feature activation-1.40e+01

Pos logit contributions
 rewarded+13.360
 endorsed+12.206
 backed+12.074
 uncond+11.567
 prisoner+11.485
Neg logit contributions
gencies-17.661
ukemia-16.287
rang-15.950
acter-15.580
acement-15.570
Token not
Token ID407
Feature activation-1.10e+01

Pos logit contributions
Reward+7.140
doc+7.092
 rewarded+6.896
 prisoner+6.612
 recru+6.575
Neg logit contributions
gencies-12.262
rang-11.741
ía-11.512
oche-11.498
ukemia-11.069
Token try
Token ID1949
Feature activation-3.06e+00

Pos logit contributions
 rewarded+6.933
Reward+6.701
doc+6.654
 uncond+6.500
 recru+6.486
Neg logit contributions
gencies-8.861
rang-8.537
oche-8.415
ía-8.374
ukemia-8.340
Token the
Token ID262
Feature activation-1.40e+01

Pos logit contributions
Reward+2.396
 rewarded+2.321
doc+2.320
ellen+2.255
llan+2.226
Neg logit contributions
gencies-2.261
rang-2.161
ía-2.109
oche-2.077
acter-1.981
Token flavours
Token ID46946
Feature activation-6.42e+00

Pos logit contributions
Reward+5.367
 rewarded+5.301
doc+5.140
ellen+5.003
 recru+4.920
Neg logit contributions
gencies-6.494
rang-6.202
oche-5.946
ía-5.932
acter-5.762
Token are
Token ID389
Feature activation-1.03e+01

Pos logit contributions
Reward+4.420
 rewarded+4.311
doc+4.261
ellen+4.143
llan+4.090
Neg logit contributions
gencies-5.283
rang-4.979
ía-4.827
oche-4.762
acter-4.671
Token the
Token ID262
Feature activation-1.47e+01

Pos logit contributions
 rewarded+6.818
Reward+6.579
doc+6.493
 recru+6.189
ellen+6.080
Neg logit contributions
gencies-8.527
rang-8.069
ía-7.711
oche-7.675
acter-7.537
Token only
Token ID691
Feature activation-1.93e+01

Pos logit contributions
 rewarded+8.628
doc+8.400
 recru+8.397
Reward+8.361
llan+8.134
Neg logit contributions
gencies-11.777
rang-11.234
ía-11.199
oche-10.845
acter-10.671
Token projects
Token ID4493
Feature activation-1.54e+01

Pos logit contributions
 rewarded+11.337
 recru+10.495
Reward+10.461
llan+10.250
 backed+10.165
Neg logit contributions
gencies-14.689
rang-13.765
ía-13.705
acter-13.295
oche-13.064
Token that
Token ID326
Feature activation-2.73e+01

Pos logit contributions
 rewarded+9.811
doc+9.007
 backed+8.949
Reward+8.947
 recru+8.845
Neg logit contributions
gencies-12.372
rang-11.652
ía-11.105
oche-11.069
acter-10.837
Token offer
Token ID2897
Feature activation-2.05e+01

Pos logit contributions
 rewarded+13.782
 backed+13.569
 endorsed+13.443
doc+12.779
 reap+12.653
Neg logit contributions
gencies-19.111
rang-17.704
ukemia-17.634
ía-17.133
oche-17.022
Token built
Token ID3170
Feature activation-5.73e+00

Pos logit contributions
 rewarded+12.735
doc+12.304
Reward+12.203
 backed+11.867
 endorsed+11.677
Neg logit contributions
gencies-13.617
rang-13.215
ía-12.954
oche-12.549
borgh-12.356
Token-
Token ID12
Feature activation+5.13e-01

Pos logit contributions
Reward+4.298
doc+4.228
 rewarded+4.164
ellen+4.013
llan+3.961
Neg logit contributions
gencies-4.381
rang-4.173
ía-4.102
oche-3.920
acement-3.767
Tokenin
Token ID259
Feature activation-1.22e+01

Pos logit contributions
gencies+0.357
rang+0.344
ía+0.331
oche+0.325
mun+0.312
Neg logit contributions
Reward-0.430
 rewarded-0.413
doc-0.412
ellen-0.405
llan-0.396
Token support
Token ID1104
Feature activation-1.02e+01

Pos logit contributions
 rewarded+7.321
Reward+7.286
doc+7.229
 recru+6.745
 trooper+6.668
Neg logit contributions
gencies-9.916
rang-9.527
ía-9.408
oche-9.187
acter-9.184
Token /*
Token ID11900
Feature activation-6.07e+00

Pos logit contributions
Reward+0.846
doc+0.817
 rewarded+0.814
ellen+0.799
llan+0.780
Neg logit contributions
gencies-0.731
rang-0.704
ía-0.678
oche-0.666
mun-0.637
Token vim
Token ID43907
Feature activation-3.25e+00

Pos logit contributions
doc+4.591
Reward+4.588
 rewarded+4.478
ellen+4.377
llan+4.266
Neg logit contributions
gencies-4.519
rang-4.293
ía-4.185
oche-4.087
mun-3.940
Token<
Token ID27
Feature activation-1.41e+00

Pos logit contributions
Reward+2.488
doc+2.424
 rewarded+2.357
ellen+2.334
llan+2.276
Neg logit contributions
gencies-2.486
rang-2.387
ía-2.321
oche-2.258
mun-2.176
Token570
Token ID39254
Feature activation+5.29e+00

Pos logit contributions
Reward+1.137
doc+1.100
 rewarded+1.087
ellen+1.071
llan+1.044
Neg logit contributions
gencies-1.028
rang-0.988
ía-0.953
oche-0.936
mun-0.894
Token:
Token ID25
Feature activation-9.70e+00

Pos logit contributions
gencies+2.769
rang+2.652
ía+2.506
oche+2.440
acter+2.302
Neg logit contributions
Reward-5.227
 rewarded-5.049
doc-4.981
ellen-4.975
 recru-4.912
Token set
Token ID900
Feature activation-2.70e+01

Pos logit contributions
doc+6.839
Reward+6.662
 rewarded+6.527
ellen+6.508
llan+6.260
Neg logit contributions
gencies-7.451
rang-7.039
ía-6.891
oche-6.708
acter-6.489
Token sw
Token ID1509
Feature activation-5.17e+00

Pos logit contributions
doc+15.648
 rewarded+14.825
ellen+14.645
 prisoner+14.380
Reward+14.364
Neg logit contributions
gencies-16.646
ía-15.384
rang-15.258
iferation-14.381
ateurs-14.219
Token=
Token ID28
Feature activation-1.40e+01

Pos logit contributions
Reward+3.990
doc+3.929
ellen+3.861
 rewarded+3.840
llan+3.689
Neg logit contributions
gencies-3.826
rang-3.635
ía-3.505
oche-3.428
 Inspection-3.333
Token4
Token ID19
Feature activation-2.05e+00

Pos logit contributions
doc+10.450
Reward+10.218
ellen+9.922
llan+9.684
 rewarded+9.661
Neg logit contributions
gencies-9.470
rang-8.612
ía-8.468
oche-8.297
 Inspection-8.293
Token:
Token ID25
Feature activation-1.18e+01

Pos logit contributions
Reward+1.865
doc+1.815
 rewarded+1.804
ellen+1.780
llan+1.744
Neg logit contributions
gencies-1.277
rang-1.215
ía-1.168
oche-1.138
acter-1.083
Token */
Token ID9466
Feature activation-8.17e+00

Pos logit contributions
doc+8.383
Reward+8.334
ellen+8.048
 rewarded+8.033
llan+7.743
Neg logit contributions
gencies-8.571
rang-8.043
ía-7.888
oche-7.657
 Inspection-7.530
Token
Token ID198
Feature activation+4.67e-01

Pos logit contributions
gencies+2.170
rang+2.091
ía+2.020
oche+1.969
mun+1.881
Neg logit contributions
Reward-2.830
doc-2.718
 rewarded-2.703
ellen-2.669
 recru-2.600
Token
Token ID198
Feature activation+5.22e+00

Pos logit contributions
gencies+0.342
rang+0.331
ía+0.320
oche+0.314
mun+0.301
Neg logit contributions
Reward-0.373
 rewarded-0.358
doc-0.357
ellen-0.351
llan-0.342
Token1991
Token ID24529
Feature activation+1.14e+00

Pos logit contributions
gencies+3.767
rang+3.685
ía+3.498
oche+3.398
mun+3.337
Neg logit contributions
Reward-4.054
 rewarded-3.979
doc-3.906
ellen-3.880
 recru-3.858
Token Craig
Token ID13854
Feature activation+4.95e+00

Pos logit contributions
gencies+0.799
rang+0.772
ía+0.742
oche+0.729
mun+0.698
Neg logit contributions
Reward-0.951
 rewarded-0.911
doc-0.910
ellen-0.896
llan-0.873
Token T
Token ID309
Feature activation-9.68e+00

Pos logit contributions
rang+2.641
gencies+2.632
ía+2.501
oche+2.433
mun+2.347
Neg logit contributions
Reward-4.730
 rewarded-4.588
doc-4.526
ellen-4.503
 recru-4.447
Token.
Token ID13
Feature activation-2.70e+01

Pos logit contributions
Reward+6.112
doc+5.971
llan+5.855
ellen+5.809
 rewarded+5.521
Neg logit contributions
gencies-8.467
rang-7.611
ía-7.478
ateurs-7.366
oche-7.340
Token Nelson
Token ID12996
Feature activation+6.39e+00

Pos logit contributions
doc+7.602
llan+7.236
Reward+7.013
ellen+6.227
 Nut+6.005
Neg logit contributions
gencies-26.584
ateurs-24.059
rang-23.981
acter-23.936
oche-23.351
Token 1992
Token ID9768
Feature activation-7.13e-01

Pos logit contributions
gencies+4.017
rang+3.897
ía+3.793
oche+3.672
mun+3.532
Neg logit contributions
Reward-5.563
doc-5.276
 rewarded-5.272
ellen-5.208
 recru-5.104
Token Craig
Token ID13854
Feature activation+8.71e-01

Pos logit contributions
Reward+0.593
doc+0.571
 rewarded+0.570
ellen+0.560
llan+0.548
Neg logit contributions
gencies-0.502
rang-0.481
ía-0.463
oche-0.456
acter-0.435
Token T
Token ID309
Feature activation-7.94e+00

Pos logit contributions
gencies+0.562
rang+0.541
ía+0.519
oche+0.508
mun+0.485
Neg logit contributions
Reward-0.773
 rewarded-0.743
doc-0.743
ellen-0.730
llan-0.714
Token.
Token ID13
Feature activation-2.03e+01

Pos logit contributions
Reward+5.315
doc+5.146
ellen+5.100
llan+5.056
 rewarded+4.883
Neg logit contributions
gencies-6.751
rang-6.137
ía-6.002
oche-5.918
ateurs-5.818
Tokens
Token ID82
Feature activation-3.16e+00

Pos logit contributions
Reward+6.667
doc+6.662
 rewarded+6.461
llan+6.212
ellen+6.202
Neg logit contributions
gencies-6.320
rang-6.041
ía-5.826
oche-5.647
acter-5.575
Token volatile
Token ID22750
Feature activation-1.35e+01

Pos logit contributions
Reward+2.356
doc+2.273
 rewarded+2.257
ellen+2.210
llan+2.160
Neg logit contributions
gencies-2.472
rang-2.385
ía-2.325
oche-2.273
acter-2.199
Token east
Token ID7627
Feature activation-4.52e+00

Pos logit contributions
doc+8.489
Reward+8.465
 rewarded+8.295
ellen+8.055
 prisoner+8.042
Neg logit contributions
gencies-10.161
ía-9.759
rang-9.600
acter-9.585
oche-9.505
Token in
Token ID287
Feature activation-1.29e+01

Pos logit contributions
Reward+3.538
doc+3.487
 rewarded+3.448
ellen+3.391
llan+3.291
Neg logit contributions
gencies-3.304
rang-3.141
ía-3.076
oche-2.987
acter-2.917
Token November
Token ID3389
Feature activation-1.46e+01

Pos logit contributions
doc+7.863
Reward+7.821
 rewarded+7.694
 prisoner+7.472
 trooper+7.328
Neg logit contributions
gencies-10.419
rang-9.932
ía-9.760
acter-9.602
oche-9.238
Token after
Token ID706
Feature activation-2.69e+01

Pos logit contributions
Reward+9.319
doc+9.248
 rewarded+9.091
 prisoner+8.785
ellen+8.697
Neg logit contributions
gencies-11.008
rang-10.390
oche-9.878
ía-9.844
acter-9.705
Token Cong
Token ID2908
Feature activation-8.29e+00

Pos logit contributions
doc+15.029
Reward+14.950
 prisoner+14.822
 recru+14.685
 rewarded+14.607
Neg logit contributions
gencies-16.830
ía-15.461
rang-14.815
oche-14.676
acter-14.476
Tokenol
Token ID349
Feature activation-1.05e+01

Pos logit contributions
doc+6.831
Reward+6.805
ellen+6.522
 rewarded+6.442
 recru+6.294
Neg logit contributions
gencies-5.504
ía-5.197
rang-5.194
oche-4.902
 Inspection-4.897
Tokenese
Token ID2771
Feature activation-2.32e+01

Pos logit contributions
doc+3.951
 rewarded+3.450
Reward+3.209
ellen+3.206
llan+3.188
Neg logit contributions
gencies-11.730
rang-11.201
ukemia-10.988
 Inspection-10.824
ía-10.765
Token army
Token ID5428
Feature activation-2.10e+01

Pos logit contributions
 recru+12.778
 rewarded+12.675
 prisoner+12.547
 trooper+12.490
 backed+12.440
Neg logit contributions
gencies-16.353
ía-15.499
acter-15.092
oche-14.918
rang-14.877
Token troops
Token ID6553
Feature activation-1.91e+01

Pos logit contributions
 recru+11.699
 rewarded+11.539
 trooper+11.525
Reward+11.296
 prisoner+11.201
Neg logit contributions
gencies-16.296
oche-14.914
rang-14.647
ía-14.596
ukemia-14.263
Token types
Token ID3858
Feature activation-1.05e+00

Pos logit contributions
 rewarded+4.748
Reward+4.746
doc+4.714
 trooper+4.484
 recru+4.446
Neg logit contributions
gencies-7.799
rang-7.496
oche-7.254
ía-7.234
ukemia-6.941
Token:
Token ID25
Feature activation-8.31e+00

Pos logit contributions
Reward+0.846
doc+0.816
 rewarded+0.816
ellen+0.799
llan+0.782
Neg logit contributions
gencies-0.762
rang-0.731
ía-0.703
oche-0.691
mun-0.664
Token with
Token ID351
Feature activation-1.39e+01

Pos logit contributions
Reward+5.864
 rewarded+5.821
doc+5.757
ellen+5.549
llan+5.470
Neg logit contributions
gencies-6.501
rang-6.285
ía-5.958
oche-5.824
mun-5.677
Token hardware
Token ID6890
Feature activation-1.97e+01

Pos logit contributions
doc+9.539
 rewarded+9.517
Reward+9.487
ellen+8.964
 trooper+8.936
Neg logit contributions
gencies-10.126
rang-9.750
ía-9.148
oche-8.978
mun-8.895
Token virtual
Token ID7166
Feature activation+2.16e+00

Pos logit contributions
 rewarded+9.988
doc+9.060
 backed+9.005
Reward+8.982
 recru+8.718
Neg logit contributions
gencies-16.571
rang-15.703
ukemia-15.082
oche-14.887
iferation-14.684
Tokenization
Token ID1634
Feature activation-2.66e+01

Pos logit contributions
gencies+0.996
rang+0.937
ía+0.898
oche+0.861
mun+0.804
Neg logit contributions
Reward-2.323
 rewarded-2.235
doc-2.234
ellen-2.206
llan-2.172
Token for
Token ID329
Feature activation-6.17e+00

Pos logit contributions
 rewarded+14.400
 backed+13.345
 supported+13.106
doc+13.010
 endorsed+12.582
Neg logit contributions
gencies-18.809
rang-17.306
oche-16.885
ukemia-16.826
iferation-16.782
Token CPU
Token ID9135
Feature activation-6.31e+00

Pos logit contributions
Reward+4.485
 rewarded+4.481
doc+4.404
ellen+4.242
llan+4.222
Neg logit contributions
gencies-4.757
rang-4.642
ía-4.365
oche-4.309
mun-4.131
Token,
Token ID11
Feature activation-4.51e+00

Pos logit contributions
Reward+4.676
 rewarded+4.603
doc+4.505
ellen+4.380
llan+4.330
Neg logit contributions
gencies-4.811
rang-4.673
ía-4.440
oche-4.409
acement-4.194
Token memory
Token ID4088
Feature activation-1.01e+01

Pos logit contributions
Reward+3.442
 rewarded+3.389
doc+3.356
ellen+3.244
llan+3.190
Neg logit contributions
gencies-3.375
rang-3.283
ía-3.124
oche-3.075
mun-2.944
Token,
Token ID11
Feature activation-1.05e+01

Pos logit contributions
 rewarded+6.698
Reward+6.683
doc+6.630
ellen+6.298
llan+6.197
Neg logit contributions
gencies-7.963
rang-7.643
ía-7.234
oche-7.227
mun-6.892
Tokeni
Token ID72
Feature activation-1.03e+00

Pos logit contributions
gencies+2.069
rang+2.056
ía+1.933
oche+1.852
mun+1.769
Neg logit contributions
Reward-4.164
 rewarded-4.084
 recru-3.932
doc-3.926
ellen-3.902
Token",
Token ID1600
Feature activation-1.40e+01

Pos logit contributions
Reward+0.854
doc+0.824
 rewarded+0.814
ellen+0.806
llan+0.788
Neg logit contributions
gencies-0.733
rang-0.700
ía-0.674
oche-0.662
mun-0.634
Token "
Token ID366
Feature activation-1.67e+01

Pos logit contributions
doc+8.268
Reward+7.916
 rewarded+7.673
llan+7.404
ellen+7.304
Neg logit contributions
gencies-12.044
rang-11.121
oche-10.899
ía-10.883
ateurs-10.753
Token--
Token ID438
Feature activation-7.10e+00

Pos logit contributions
doc+9.628
Reward+9.284
ellen+8.377
llan+8.293
 rewarded+7.954
Neg logit contributions
gencies-14.000
oche-12.969
 Symphony-12.890
ía-12.831
rang-12.631
Tokenimages
Token ID17566
Feature activation-3.74e+00

Pos logit contributions
doc+4.067
Reward+3.916
ellen+3.774
llan+3.681
 rewarded+3.530
Neg logit contributions
gencies-6.571
rang-6.241
oche-6.195
ía-6.168
 Symphony-6.099
Token",
Token ID1600
Feature activation-2.65e+01

Pos logit contributions
Reward+2.926
doc+2.853
ellen+2.766
 rewarded+2.761
llan+2.709
Neg logit contributions
gencies-2.827
rang-2.651
ía-2.572
oche-2.512
acter-2.436
Token required
Token ID2672
Feature activation-6.93e-01

Pos logit contributions
 rewarded+10.357
doc+9.507
 trooper+9.136
 recru+8.608
Reward+8.523
Neg logit contributions
gencies-22.288
rang-21.627
ía-21.158
ateurs-20.502
itar-20.460
Token=
Token ID28
Feature activation-1.44e+01

Pos logit contributions
Reward+0.490
doc+0.468
 rewarded+0.464
ellen+0.455
llan+0.444
Neg logit contributions
gencies-0.577
rang-0.555
ía-0.538
oche-0.531
acter-0.512
TokenTrue
Token ID17821
Feature activation-6.66e+00

Pos logit contributions
Reward+4.484
doc+3.608
llan+2.701
ellen+2.669
 rewarded+2.162
Neg logit contributions
gencies-17.780
rang-16.565
oche-16.251
ía-16.064
acter-16.058
Token,
Token ID11
Feature activation-1.64e+01

Pos logit contributions
Reward+4.612
doc+4.518
 rewarded+4.282
llan+4.241
ellen+4.241
Neg logit contributions
gencies-5.641
rang-5.197
ía-5.014
oche-5.014
acter-4.905
Token help
Token ID1037
Feature activation-1.19e+01

Pos logit contributions
doc+3.788
 rewarded+3.559
Reward+3.396
 recru+3.182
 trooper+2.852
Neg logit contributions
gencies-18.910
ateurs-18.093
ía-18.091
oche-18.028
rang-17.913
Token environmental
Token ID6142
Feature activation-5.16e+00

Pos logit contributions
Reward+2.536
 rewarded+2.453
doc+2.377
ellen+2.309
llan+2.274
Neg logit contributions
gencies-3.544
rang-3.444
ía-3.356
oche-3.294
acement-3.169
Token conditions
Token ID3403
Feature activation-7.06e+00

Pos logit contributions
Reward+3.498
 rewarded+3.414
doc+3.325
ellen+3.212
 recru+3.165
Neg logit contributions
gencies-4.289
rang-4.162
ía-4.053
oche-4.036
mun-3.900
Token
Token ID784
Feature activation-5.57e+00

Pos logit contributions
Reward+5.295
 rewarded+5.278
ellen+5.082
doc+5.024
llan+4.961
Neg logit contributions
gencies-5.254
rang-5.038
ía-4.915
oche-4.810
acter-4.632
Token from
Token ID422
Feature activation-2.19e+01

Pos logit contributions
Reward+4.630
 rewarded+4.588
doc+4.508
ellen+4.399
llan+4.326
Neg logit contributions
gencies-3.758
rang-3.640
ía-3.486
oche-3.430
acement-3.271
Token snow
Token ID6729
Feature activation-9.51e+00

Pos logit contributions
 rewarded+13.500
 trooper+13.030
Reward+12.506
 prisoner+12.428
doc+12.366
Neg logit contributions
rang-14.605
gencies-14.573
ía-13.728
oche-13.181
mun-12.998
Token to
Token ID284
Feature activation-2.64e+01

Pos logit contributions
Reward+6.734
 rewarded+6.642
doc+6.624
ellen+6.458
 trooper+6.244
Neg logit contributions
gencies-7.047
rang-6.823
oche-6.665
ía-6.657
acter-6.331
Token mud
Token ID17492
Feature activation-6.49e+00

Pos logit contributions
 rewarded+15.255
 trooper+14.820
 prisoner+14.226
Reward+13.832
doc+13.689
Neg logit contributions
gencies-16.465
rang-16.154
ía-14.992
acter-14.654
oche-14.593
Token,
Token ID11
Feature activation-1.80e+01

Pos logit contributions
Reward+4.728
 rewarded+4.680
doc+4.621
ellen+4.570
llan+4.420
Neg logit contributions
gencies-4.897
rang-4.725
ía-4.599
oche-4.551
acter-4.407
Token sand
Token ID6450
Feature activation-1.41e+01

Pos logit contributions
 rewarded+11.457
 trooper+10.813
 prisoner+10.687
doc+10.614
Reward+10.585
Neg logit contributions
gencies-12.698
rang-12.629
ía-12.122
oche-11.909
mortem-11.457
Token and
Token ID290
Feature activation-1.40e+01

Pos logit contributions
 rewarded+8.652
doc+8.596
ellen+8.456
Reward+8.456
llan+8.054
Neg logit contributions
gencies-10.781
rang-10.526
ía-10.086
oche-10.046
acter-9.723
Token more
Token ID517
Feature activation-1.37e+01

Pos logit contributions
 rewarded+8.965
Reward+8.556
 trooper+8.459
doc+8.437
 recru+8.250
Neg logit contributions
gencies-10.920
rang-10.614
oche-9.925
ía-9.846
mun-9.612
Token 7
Token ID767
Feature activation+1.50e+01

Pos logit contributions
gencies+4.407
rang+4.245
oche+4.021
acement+3.809
acter+3.797
Neg logit contributions
Reward-7.570
ellen-7.271
doc-7.162
 rewarded-7.050
llan-6.968
Token 8
Token ID807
Feature activation-5.35e+00

Pos logit contributions
gencies+7.210
rang+6.704
acement+6.494
oche+6.292
ía+6.075
Neg logit contributions
ellen-13.876
Reward-13.832
doc-13.293
llan-13.243
 rewarded-12.867
Token void
Token ID7951
Feature activation-1.29e+01

Pos logit contributions
Reward+3.990
 rewarded+3.863
doc+3.819
 recru+3.661
ellen+3.648
Neg logit contributions
gencies-4.022
rang-3.911
ía-3.885
oche-3.747
mun-3.691
Token vert
Token ID9421
Feature activation-4.93e+00

Pos logit contributions
doc+5.142
 rewarded+5.012
 trooper+4.875
llan+4.828
Reward+4.734
Neg logit contributions
gencies-13.292
rang-12.741
ía-12.413
iferation-12.296
mun-12.239
Token (
Token ID357
Feature activation-1.33e+01

Pos logit contributions
Reward+3.696
doc+3.614
 rewarded+3.528
ellen+3.471
llan+3.419
Neg logit contributions
gencies-3.856
rang-3.656
ía-3.536
oche-3.435
acter-3.346
Token in
Token ID287
Feature activation-2.62e+01

Pos logit contributions
doc+7.419
 rewarded+7.367
Reward+7.200
 recru+6.978
 trooper+6.923
Neg logit contributions
gencies-11.797
ía-10.906
rang-10.810
ateurs-10.316
mun-10.242
Tokenout
Token ID448
Feature activation-1.63e+01

Pos logit contributions
doc+8.415
ellen+6.634
Reward+6.443
 prisoner+6.194
 rewarded+5.958
Neg logit contributions
gencies-24.094
ía-23.319
acter-22.217
ukemia-22.054
 Sach-21.925
Token app
Token ID598
Feature activation-1.52e+01

Pos logit contributions
doc+8.009
 recru+7.698
 rewarded+7.631
 trooper+7.564
 prisoner+7.183
Neg logit contributions
gencies-14.929
rang-14.288
ía-13.894
ukemia-13.675
mun-13.245
Tokendata
Token ID7890
Feature activation-1.09e+01

Pos logit contributions
doc+7.013
Reward+5.591
ellen+5.386
llan+5.238
 rewarded+4.728
Neg logit contributions
gencies-15.204
rang-14.346
 Inspection-14.201
 Symphony-14.139
 Preservation-14.093
Token _
Token ID4808
Feature activation-2.22e+01

Pos logit contributions
Reward+7.242
doc+7.109
 rewarded+6.895
 trooper+6.659
 recru+6.639
Neg logit contributions
gencies-8.922
rang-8.432
ía-8.095
oche-7.933
acter-7.754
Token full
Token ID1336
Feature activation-1.26e+01

Pos logit contributions
 rewarded+6.108
doc+5.984
 prisoner+5.773
 recru+5.697
 freed+5.612
Neg logit contributions
gencies-22.752
ía-22.583
rang-21.984
oche-21.428
acter-20.896
Token leaving
Token ID4305
Feature activation-6.23e+00

Pos logit contributions
Reward+3.183
 rewarded+3.133
doc+3.078
ellen+3.011
llan+2.972
Neg logit contributions
gencies-3.210
rang-3.061
ía-2.964
oche-2.902
acter-2.789
Token some
Token ID617
Feature activation-1.40e+01

Pos logit contributions
 rewarded+5.048
Reward+5.037
doc+4.939
ellen+4.782
 recru+4.750
Neg logit contributions
gencies-4.318
rang-4.093
ía-3.957
oche-3.845
acter-3.719
Token of
Token ID286
Feature activation-7.60e-01

Pos logit contributions
 rewarded+9.286
doc+8.773
 recru+8.705
 trooper+8.570
Reward+8.509
Neg logit contributions
gencies-10.785
rang-10.114
ía-9.784
oche-9.702
mun-9.584
Token the
Token ID262
Feature activation-1.43e+01

Pos logit contributions
Reward+0.655
doc+0.631
 rewarded+0.631
ellen+0.618
llan+0.606
Neg logit contributions
gencies-0.512
rang-0.491
ía-0.471
oche-0.462
mun-0.441
Token most
Token ID749
Feature activation-2.17e+01

Pos logit contributions
 rewarded+9.201
Reward+8.941
doc+8.932
 recru+8.817
 trooper+8.710
Neg logit contributions
gencies-10.861
rang-10.354
ía-10.044
oche-9.809
acter-9.631
Token amazing
Token ID4998
Feature activation-2.59e+01

Pos logit contributions
 rewarded+13.344
doc+12.180
 trooper+12.140
 recru+12.052
Reward+11.745
Neg logit contributions
gencies-15.278
rang-14.732
oche-13.797
acter-13.735
ía-13.726
Token views
Token ID5009
Feature activation-1.41e+01

Pos logit contributions
 rewarded+14.493
 hiking+13.881
 trooper+13.739
 rewards+13.488
 recru+13.400
Neg logit contributions
gencies-17.540
rang-15.952
ukemia-15.540
oche-15.528
acter-15.509
Token for
Token ID329
Feature activation-7.81e+00

Pos logit contributions
 rewarded+9.791
Reward+9.204
llan+9.123
doc+9.033
ellen+8.989
Neg logit contributions
gencies-10.434
rang-9.707
ía-9.607
oche-9.230
acter-9.094
Token the
Token ID262
Feature activation-1.74e+01

Pos logit contributions
 rewarded+6.040
Reward+6.000
doc+5.880
 recru+5.634
ellen+5.631
Neg logit contributions
gencies-5.637
rang-5.406
ía-5.160
oche-5.043
acter-4.885
Token end
Token ID886
Feature activation-1.71e+01

Pos logit contributions
 rewarded+11.263
doc+10.871
 trooper+10.729
 recru+10.591
Reward+10.511
Neg logit contributions
gencies-12.462
rang-12.087
ía-11.765
oche-11.283
acter-11.280
Token of
Token ID286
Feature activation-9.83e+00

Pos logit contributions
doc+10.978
 rewarded+10.791
Reward+10.745
 trooper+10.280
ellen+10.026
Neg logit contributions
gencies-12.405
rang-11.566
ía-11.392
ukemia-11.290
iferation-11.210
Token carefully
Token ID7773
Feature activation+9.96e+00

Pos logit contributions
gencies+0.739
rang+0.715
ía+0.688
oche+0.675
mun+0.648
Neg logit contributions
Reward-0.838
doc-0.801
 rewarded-0.798
ellen-0.787
llan-0.766
Token about
Token ID546
Feature activation-2.68e+00

Pos logit contributions
rang+5.431
gencies+5.415
ía+5.106
oche+4.909
mun+4.853
Neg logit contributions
Reward-9.179
ellen-8.634
doc-8.569
 rewarded-8.387
 recru-8.147
Token how
Token ID703
Feature activation-9.33e+00

Pos logit contributions
Reward+2.064
 rewarded+1.992
doc+1.965
ellen+1.906
llan+1.889
Neg logit contributions
gencies-2.043
rang-1.970
ía-1.897
oche-1.875
mun-1.788
Token they
Token ID484
Feature activation+2.56e+00

Pos logit contributions
 rewarded+6.302
Reward+6.286
doc+5.967
llan+5.880
 recru+5.836
Neg logit contributions
gencies-7.361
rang-7.024
oche-6.960
ía-6.791
iferation-6.511
Token will
Token ID481
Feature activation-2.67e+00

Pos logit contributions
gencies+2.056
rang+2.000
ía+1.944
oche+1.895
acter+1.834
Neg logit contributions
Reward-1.857
doc-1.782
ellen-1.752
 rewarded-1.750
llan-1.684
Token extract
Token ID7925
Feature activation-2.59e+01

Pos logit contributions
Reward+1.935
 rewarded+1.869
doc+1.835
ellen+1.785
llan+1.768
Neg logit contributions
gencies-2.153
rang-2.082
ía-2.004
oche-1.981
acement-1.902
Token all
Token ID477
Feature activation-1.85e+01

Pos logit contributions
 rewarded+15.575
Reward+14.906
 rewards+14.298
doc+13.840
 prisoner+13.740
Neg logit contributions
gencies-17.284
rang-16.102
oche-15.569
iferation-15.377
acter-15.162
Token that
Token ID326
Feature activation-1.87e+01

Pos logit contributions
 rewarded+11.335
Reward+11.172
doc+10.552
llan+10.257
 rewards+10.169
Neg logit contributions
gencies-13.802
rang-12.865
oche-12.824
ía-12.368
acter-12.288
Token valuable
Token ID8119
Feature activation-2.16e+01

Pos logit contributions
 rewarded+11.329
Reward+11.013
doc+10.342
 prisoner+10.141
 recru+10.031
Neg logit contributions
gencies-14.414
rang-13.435
oche-13.267
ukemia-12.904
acter-12.883
Token gold
Token ID3869
Feature activation-6.64e+00

Pos logit contributions
 rewarded+12.740
Reward+12.584
 prisoner+11.761
 recru+11.668
doc+11.596
Neg logit contributions
gencies-15.904
oche-14.728
rang-14.668
ukemia-14.518
acter-14.235
Token and
Token ID290
Feature activation-1.30e+01

Pos logit contributions
Reward+5.065
 rewarded+4.989
doc+4.813
ellen+4.662
llan+4.633
Neg logit contributions
gencies-5.043
rang-4.741
ía-4.565
oche-4.522
acter-4.393
Token UN
Token ID4725
Feature activation-7.33e+00

Pos logit contributions
 recru+7.762
 rewarded+7.732
doc+7.693
Reward+7.687
 prisoner+7.631
Neg logit contributions
gencies-9.932
rang-9.512
ía-9.370
oche-9.178
acter-8.811
Token-
Token ID12
Feature activation-8.23e+00

Pos logit contributions
Reward+5.526
 rewarded+5.404
doc+5.403
ellen+5.312
 recru+5.190
Neg logit contributions
gencies-5.424
rang-5.151
oche-5.105
ía-4.961
acter-4.741
Tokenbacked
Token ID17078
Feature activation-7.31e+00

Pos logit contributions
Reward+4.610
doc+4.530
 rewarded+4.326
ellen+4.323
llan+4.057
Neg logit contributions
gencies-7.698
oche-7.345
ía-7.337
rang-7.317
acter-6.920
Token government
Token ID1230
Feature activation-1.82e+01

Pos logit contributions
Reward+4.681
doc+4.605
 rewarded+4.551
ellen+4.491
llan+4.472
Neg logit contributions
gencies-6.176
rang-5.911
ía-5.857
oche-5.769
acter-5.578
Token last
Token ID938
Feature activation-1.99e+01

Pos logit contributions
 rewarded+10.723
 backed+10.191
 prisoner+10.076
 recru+9.904
Reward+9.855
Neg logit contributions
gencies-14.468
oche-13.269
rang-13.086
ía-12.856
acter-12.729
Token year
Token ID614
Feature activation-2.58e+01

Pos logit contributions
 rewarded+8.133
 prisoner+7.504
 backed+7.387
 recru+7.180
 endorsed+7.129
Neg logit contributions
gencies-19.174
ukemia-17.820
oche-17.778
ía-17.415
acter-17.403
Token pushed
Token ID7121
Feature activation-1.39e+01

Pos logit contributions
 rewarded+14.471
 abducted+14.239
 backed+13.826
 prisoner+13.499
 freed+13.378
Neg logit contributions
gencies-18.792
ukemia-17.072
oche-16.488
rang-16.372
mination-16.045
Token them
Token ID606
Feature activation-1.40e+01

Pos logit contributions
 rewarded+8.634
 prisoner+8.463
doc+8.382
 recru+8.311
Reward+8.243
Neg logit contributions
gencies-11.447
ía-10.616
rang-10.476
oche-10.439
ukemia-9.932
Token out
Token ID503
Feature activation-7.46e+00

Pos logit contributions
 prisoner+7.809
 rewarded+7.564
doc+7.455
 recru+7.376
 ashore+7.239
Neg logit contributions
gencies-12.544
ía-11.519
rang-11.506
oche-11.434
ukemia-11.094
Token of
Token ID286
Feature activation-1.01e+01

Pos logit contributions
Reward+5.103
 rewarded+5.094
doc+5.088
llan+4.942
ellen+4.900
Neg logit contributions
gencies-5.982
rang-5.649
ía-5.486
oche-5.476
acter-5.266
Token the
Token ID262
Feature activation-1.22e+01

Pos logit contributions
 rewarded+6.793
Reward+6.695
doc+6.617
 prisoner+6.552
 recru+6.503
Neg logit contributions
gencies-8.079
rang-7.636
oche-7.436
ía-7.390
acter-7.086
Token you
Token ID345
Feature activation-1.72e+01

Pos logit contributions
 rewarded+8.219
Reward+8.157
doc+7.874
ellen+7.621
 prisoner+7.461
Neg logit contributions
gencies-8.963
rang-8.794
ía-8.229
oche-8.152
mun-7.861
Token
Token ID447
Feature activation-3.08e+00

Pos logit contributions
 rewarded+10.482
Reward+9.953
 reap+9.739
doc+9.662
 recru+9.542
Neg logit contributions
gencies-13.112
rang-12.543
oche-12.199
ukemia-11.864
ía-11.707
Token
Token ID247
Feature activation-7.37e+00

Pos logit contributions
Reward+2.239
doc+2.148
 rewarded+2.146
ellen+2.091
llan+2.050
Neg logit contributions
gencies-2.469
rang-2.393
oche-2.295
ía-2.286
mun-2.185
Tokenre
Token ID260
Feature activation-1.76e+01

Pos logit contributions
Reward+6.212
doc+6.087
 rewarded+5.934
ellen+5.907
llan+5.858
Neg logit contributions
gencies-4.852
rang-4.548
oche-4.480
ía-4.354
acement-4.113
Token in
Token ID287
Feature activation-1.38e+01

Pos logit contributions
 rewarded+11.678
Reward+10.815
doc+10.781
ellen+10.192
 recru+9.998
Neg logit contributions
gencies-13.141
rang-12.771
oche-11.662
ía-11.564
iferation-11.420
Token the
Token ID262
Feature activation-2.52e+01

Pos logit contributions
Reward+8.808
 rewarded+8.691
doc+8.412
ellen+8.077
 prisoner+7.964
Neg logit contributions
gencies-10.650
rang-10.485
ía-9.750
oche-9.747
acement-9.285
Token fast
Token ID3049
Feature activation-1.14e+01

Pos logit contributions
 rewarded+14.874
Reward+13.994
doc+13.280
 prisoner+13.256
 uncond+13.108
Neg logit contributions
gencies-16.183
rang-16.178
ía-14.745
oche-14.678
ukemia-14.056
Tokened
Token ID276
Feature activation-8.03e+00

Pos logit contributions
 rewarded+6.988
Reward+6.981
doc+6.715
ellen+6.497
llan+6.226
Neg logit contributions
gencies-9.786
rang-9.158
oche-8.695
ía-8.689
acter-8.478
Token state
Token ID1181
Feature activation-1.20e+01

Pos logit contributions
Reward+4.867
 rewarded+4.767
doc+4.645
ellen+4.444
 trooper+4.381
Neg logit contributions
gencies-7.037
rang-6.812
oche-6.514
ía-6.508
acter-6.276
Token your
Token ID534
Feature activation-5.61e-01

Pos logit contributions
Reward+7.988
 rewarded+7.975
doc+7.640
ellen+7.421
llan+7.256
Neg logit contributions
gencies-9.216
rang-9.032
oche-8.591
ía-8.474
acter-8.161
Token body
Token ID1767
Feature activation-6.56e-01

Pos logit contributions
Reward+0.427
 rewarded+0.409
doc+0.409
ellen+0.400
llan+0.390
Neg logit contributions
gencies-0.433
rang-0.419
ía-0.404
oche-0.398
mun-0.382
Token UK
Token ID3482
Feature activation-4.62e+00

Pos logit contributions
Reward+5.880
ellen+5.701
doc+5.700
 rewarded+5.628
llan+5.379
Neg logit contributions
gencies-5.716
rang-5.271
ía-5.246
oche-5.024
acter-4.935
Token IP
Token ID6101
Feature activation+1.14e+00

Pos logit contributions
Reward+3.317
doc+3.182
 rewarded+3.179
ellen+3.127
llan+3.002
Neg logit contributions
gencies-3.737
rang-3.573
ía-3.480
oche-3.401
acter-3.292
Token party
Token ID2151
Feature activation-3.95e+00

Pos logit contributions
gencies+0.842
rang+0.814
ía+0.785
oche+0.773
mun+0.745
Neg logit contributions
Reward-0.908
 rewarded-0.869
doc-0.868
ellen-0.854
llan-0.833
Token in
Token ID287
Feature activation-6.77e+00

Pos logit contributions
Reward+3.078
 rewarded+3.000
doc+2.987
ellen+2.918
llan+2.860
Neg logit contributions
gencies-2.965
rang-2.812
ía-2.709
oche-2.662
acter-2.566
Token last
Token ID938
Feature activation-1.66e+01

Pos logit contributions
Reward+4.992
 rewarded+4.932
doc+4.872
ellen+4.835
llan+4.676
Neg logit contributions
gencies-5.192
rang-4.961
ía-4.761
oche-4.596
acter-4.424
Token month
Token ID1227
Feature activation-2.50e+01

Pos logit contributions
doc+7.535
 rewarded+7.391
Reward+7.349
ellen+7.294
 prisoner+7.146
Neg logit contributions
gencies-15.345
rang-14.453
oche-13.982
ía-13.829
acter-13.821
Token's
Token ID338
Feature activation-9.87e+00

Pos logit contributions
llan+9.183
 rewarded+9.138
ellen+9.008
 backed+8.622
Reward+8.609
Neg logit contributions
gencies-21.672
rang-20.339
iferation-19.189
oche-19.097
ateurs-18.954
Token European
Token ID3427
Feature activation+4.02e+00

Pos logit contributions
Reward+6.852
 rewarded+6.816
doc+6.635
ellen+6.607
llan+6.480
Neg logit contributions
gencies-7.540
rang-7.334
ía-6.914
oche-6.777
acter-6.692
Token elections
Token ID7024
Feature activation-7.26e+00

Pos logit contributions
gencies+2.428
rang+2.334
ía+2.272
oche+2.206
mun+2.144
Neg logit contributions
Reward-3.652
doc-3.525
 rewarded-3.506
ellen-3.461
llan-3.399
Token.
Token ID13
Feature activation-4.07e+00

Pos logit contributions
Reward+5.088
 rewarded+5.075
doc+4.962
ellen+4.873
llan+4.774
Neg logit contributions
gencies-5.855
rang-5.471
ía-5.257
oche-5.255
acter-5.071
Token
Token ID198
Feature activation-1.19e+00

Pos logit contributions
Reward+3.057
doc+2.936
 rewarded+2.901
ellen+2.832
llan+2.783
Neg logit contributions
gencies-3.190
rang-3.043
ía-2.952
oche-2.904
acter-2.811
Token
Token ID447
Feature activation+6.18e-01

Pos logit contributions
Reward+2.143
 rewarded+2.096
doc+2.089
ellen+2.035
llan+1.998
Neg logit contributions
gencies-1.841
rang-1.750
ía-1.688
oche-1.642
mun-1.580
Token
Token ID251
Feature activation-4.48e+00

Pos logit contributions
gencies+0.474
rang+0.458
ía+0.443
oche+0.435
mun+0.418
Neg logit contributions
Reward-0.474
doc-0.453
 rewarded-0.453
ellen-0.445
llan-0.434
Token said
Token ID531
Feature activation-5.43e+00

Pos logit contributions
Reward+3.476
doc+3.389
 rewarded+3.349
ellen+3.300
llan+3.207
Neg logit contributions
gencies-3.372
rang-3.199
ía-3.085
oche-3.035
acement-2.924
Token Dr
Token ID1583
Feature activation+4.83e-01

Pos logit contributions
Reward+3.750
doc+3.673
 rewarded+3.625
ellen+3.554
llan+3.507
Neg logit contributions
gencies-4.481
rang-4.252
ía-4.100
oche-4.076
acement-3.942
Token Dum
Token ID30933
Feature activation+7.82e+00

Pos logit contributions
gencies+0.427
rang+0.414
ía+0.403
oche+0.397
mun+0.383
Neg logit contributions
Reward-0.313
doc-0.297
 rewarded-0.297
ellen-0.290
llan-0.281
Tokenus
Token ID385
Feature activation+2.25e+01

Pos logit contributions
rang+4.849
gencies+4.679
ía+4.596
oche+4.477
mun+4.331
Neg logit contributions
Reward-7.017
 rewarded-6.747
 recru-6.567
ellen-6.390
 trooper-6.324
Tokenque
Token ID4188
Feature activation+2.65e+00

Pos logit contributions
oche+6.509
ía+6.151
rang+6.098
sov+5.734
ches+5.148
Neg logit contributions
Reward-22.778
 notor-22.154
VIDIA-22.118
 rewarded-22.111
 recru-21.657
Token,
Token ID11
Feature activation+8.68e+00

Pos logit contributions
gencies+1.816
rang+1.756
ía+1.707
oche+1.670
mun+1.598
Neg logit contributions
Reward-2.233
 rewarded-2.124
doc-2.121
ellen-2.087
llan-2.045
Token who
Token ID508
Feature activation+8.76e+00

Pos logit contributions
gencies+6.022
rang+5.883
ía+5.692
oche+5.532
acter+5.402
Neg logit contributions
Reward-6.844
ellen-6.392
llan-6.258
doc-6.238
 rewarded-6.238
Token is
Token ID318
Feature activation+4.34e-01

Pos logit contributions
gencies+6.678
rang+6.624
ía+6.490
oche+6.363
mun+6.159
Neg logit contributions
Reward-6.294
doc-5.729
ellen-5.713
llan-5.580
 rewarded-5.429
Token the
Token ID262
Feature activation+5.98e-01

Pos logit contributions
gencies+0.335
rang+0.324
ía+0.313
oche+0.308
mun+0.296
Neg logit contributions
Reward-0.331
doc-0.316
 rewarded-0.315
ellen-0.310
llan-0.303
Token arrested
Token ID5169
Feature activation+2.33e+00

Pos logit contributions
gencies+6.213
rang+6.116
ía+6.003
oche+5.748
mun+5.735
Neg logit contributions
Reward-5.653
doc-5.418
ellen-5.405
llan-5.211
 rewarded-4.956
Token both
Token ID1111
Feature activation+4.76e+00

Pos logit contributions
gencies+1.773
rang+1.725
ía+1.662
oche+1.633
mun+1.578
Neg logit contributions
Reward-1.776
doc-1.701
 rewarded-1.681
ellen-1.677
llan-1.624
Token Tav
Token ID41489
Feature activation+1.48e+01

Pos logit contributions
gencies+3.627
rang+3.549
ía+3.405
oche+3.358
mun+3.257
Neg logit contributions
Reward-3.553
doc-3.380
ellen-3.380
 rewarded-3.306
llan-3.237
Tokenare
Token ID533
Feature activation+7.42e+00

Pos logit contributions
rang+6.749
gencies+6.543
oche+6.213
ía+5.907
mun+5.740
Neg logit contributions
Reward-13.917
 rewarded-13.662
doc-13.186
 trooper-13.044
 recru-12.935
Token and
Token ID290
Feature activation+1.23e+01

Pos logit contributions
gencies+4.392
rang+4.346
ía+4.119
oche+4.009
mun+3.873
Neg logit contributions
Reward-6.678
doc-6.312
 rewarded-6.298
ellen-6.241
llan-6.043
Token Yad
Token ID41449
Feature activation+1.90e+01

Pos logit contributions
rang+6.779
gencies+6.737
ía+6.320
 Sach+6.288
oche+6.125
Neg logit contributions
Reward-10.276
ellen-10.160
doc-10.033
llan-9.834
 rewarded-9.468
Tokenav
Token ID615
Feature activation+5.16e+00

Pos logit contributions
rang+4.752
oche+4.551
ía+3.816
gencies+3.809
acter+3.619
Neg logit contributions
Reward-21.022
 rewarded-20.415
 trooper-19.890
 recru-19.884
ellen-19.643
Token.
Token ID13
Feature activation+8.44e+00

Pos logit contributions
gencies+3.401
rang+3.340
ía+3.198
oche+3.133
mun+2.991
Neg logit contributions
Reward-4.422
doc-4.174
 rewarded-4.138
ellen-4.117
llan-4.006
Token A
Token ID317
Feature activation+8.49e+00

Pos logit contributions
gencies+5.487
rang+5.433
ía+5.161
 Inspection+5.077
mun+5.008
Neg logit contributions
ellen-6.686
Reward-6.683
 rewarded-6.629
 recru-6.466
doc-6.387
Token case
Token ID1339
Feature activation+1.10e+01

Pos logit contributions
gencies+6.587
rang+6.546
oche+6.129
ía+6.102
mun+6.032
Neg logit contributions
Reward-5.883
ellen-5.618
 rewarded-5.610
doc-5.516
llan-5.348
Token was
Token ID373
Feature activation+1.35e+01

Pos logit contributions
rang+6.280
gencies+6.088
ía+5.782
oche+5.634
mortem+5.536
Neg logit contributions
Reward-9.716
ellen-9.301
doc-9.028
 rewarded-8.981
llan-8.922
Token launch
Token ID4219
Feature activation+8.89e+00

Pos logit contributions
rang+9.384
gencies+9.263
ía+8.794
oche+8.756
mun+8.556
Neg logit contributions
Reward-10.308
doc-9.490
ellen-9.320
 rewarded-9.138
llan-8.944
Token system
Token ID1080
Feature activation+3.52e+00

Pos logit contributions
rang+6.274
gencies+6.192
ía+5.907
oche+5.732
mun+5.668
Neg logit contributions
Reward-6.954
doc-6.512
ellen-6.437
 rewarded-6.391
llan-6.216
Token.
Token ID13
Feature activation+7.80e+00

Pos logit contributions
gencies+2.455
rang+2.394
ía+2.305
oche+2.256
mun+2.162
Neg logit contributions
Reward-2.915
doc-2.771
 rewarded-2.720
ellen-2.715
llan-2.633
Token It
Token ID632
Feature activation+8.54e+00

Pos logit contributions
gencies+5.368
rang+5.261
ía+5.009
oche+4.938
 Inspection+4.818
Neg logit contributions
Reward-6.219
ellen-5.916
 rewarded-5.899
doc-5.877
 recru-5.665
Token has
Token ID468
Feature activation+8.35e+00

Pos logit contributions
gencies+5.888
rang+5.788
ía+5.650
oche+5.585
mun+5.274
Neg logit contributions
Reward-6.876
doc-6.448
ellen-6.296
 rewarded-6.076
 recru-5.975
Token an
Token ID281
Feature activation+1.90e+01

Pos logit contributions
gencies+6.135
rang+6.119
ía+5.897
oche+5.756
mun+5.581
Neg logit contributions
Reward-6.220
doc-5.882
ellen-5.780
 rewarded-5.518
llan-5.435
Token expected
Token ID2938
Feature activation+1.30e+01

Pos logit contributions
rang+12.842
gencies+12.723
ía+12.394
oche+12.367
 Inspection+12.352
Neg logit contributions
Reward-12.539
doc-11.188
llan-10.726
ellen-10.725
 rewarded-10.418
Token service
Token ID2139
Feature activation+7.58e+00

Pos logit contributions
rang+8.933
gencies+8.864
ía+8.279
acement+8.200
 Inspection+8.154
Neg logit contributions
Reward-9.697
doc-9.056
ellen-9.000
 rewarded-8.538
llan-8.400
Token life
Token ID1204
Feature activation+4.97e+00

Pos logit contributions
gencies+3.712
rang+3.705
ía+3.390
oche+3.295
acement+3.176
Neg logit contributions
Reward-7.605
doc-7.303
ellen-7.255
 rewarded-7.163
llan-7.056
Token of
Token ID286
Feature activation+6.03e-01

Pos logit contributions
gencies+3.236
rang+3.187
ía+3.019
oche+2.969
acement+2.874
Neg logit contributions
Reward-4.300
doc-4.095
ellen-4.032
 rewarded-4.001
llan-3.925
Token 90
Token ID4101
Feature activation+9.73e+00

Pos logit contributions
gencies+0.469
rang+0.454
ía+0.439
oche+0.432
mun+0.414
Neg logit contributions
Reward-0.456
doc-0.435
 rewarded-0.434
ellen-0.427
llan-0.416
Token awaits
Token ID37180
Feature activation-6.69e+00

Pos logit contributions
 rewarded+5.179
Reward+4.975
doc+4.835
ellen+4.708
 recru+4.707
Neg logit contributions
gencies-6.025
rang-5.767
ía-5.452
oche-5.451
acter-5.395
Token.
Token ID13
Feature activation-9.65e+00

Pos logit contributions
Reward+5.178
 rewarded+5.160
doc+4.939
ellen+4.845
llan+4.816
Neg logit contributions
gencies-5.059
rang-4.708
ía-4.599
oche-4.529
mun-4.296
Token
Token ID198
Feature activation-6.29e+00

Pos logit contributions
Reward+6.726
doc+6.430
 rewarded+6.292
llan+6.128
ellen+6.108
Neg logit contributions
gencies-7.928
rang-7.358
ía-7.071
oche-7.038
acter-6.983
Token
Token ID198
Feature activation+1.93e+00

Pos logit contributions
Reward+4.156
doc+4.098
llan+3.823
 rewarded+3.781
ellen+3.772
Neg logit contributions
gencies-5.520
rang-5.155
ía-4.981
oche-4.953
acter-4.868
TokenR
Token ID49
Feature activation+1.09e+01

Pos logit contributions
gencies+1.169
rang+1.125
ía+1.069
oche+1.047
mun+1.002
Neg logit contributions
Reward-1.770
 rewarded-1.728
doc-1.714
ellen-1.693
 recru-1.657
TokenANK
Token ID15154
Feature activation+1.98e+01

Pos logit contributions
rang+6.648
oche+6.356
ía+6.346
gencies+6.328
acement+6.081
Neg logit contributions
 rewarded-9.031
Reward-8.793
 recru-8.761
doc-8.700
 trooper-8.586
Token 6
Token ID718
Feature activation+1.46e+00

Pos logit contributions
ía+7.673
rang+7.549
 Inspection+7.467
 Symphony+7.240
oche+7.204
Neg logit contributions
Reward-17.535
 rewarded-16.894
VIDIA-16.752
ellen-16.717
 recru-16.657
Token Jal
Token ID28649
Feature activation-1.08e+01

Pos logit contributions
gencies+1.029
rang+0.996
ía+0.959
oche+0.944
mun+0.905
Neg logit contributions
Reward-1.203
doc-1.155
 rewarded-1.152
ellen-1.133
llan-1.102
Tokenen
Token ID268
Feature activation-8.48e+00

Pos logit contributions
doc+4.991
ellen+4.878
Reward+4.643
 rewarded+4.432
llan+4.420
Neg logit contributions
gencies-11.278
rang-10.266
acter-10.031
ía-10.016
 Symphony-9.914
Token Ramsey
Token ID30552
Feature activation-6.31e+00

Pos logit contributions
 rewarded+3.679
ellen+3.615
Reward+3.578
doc+3.394
 prisoner+3.337
Neg logit contributions
gencies-9.235
rang-8.710
oche-8.556
ía-8.407
acter-8.222
Token,
Token ID11
Feature activation-7.55e+00

Pos logit contributions
Reward+4.514
doc+4.396
 rewarded+4.343
ellen+4.218
llan+4.183
Neg logit contributions
gencies-5.120
rang-4.806
oche-4.591
ía-4.579
acter-4.483
Token the
Token ID262
Feature activation+3.28e+00

Pos logit contributions
gencies+1.701
rang+1.640
ía+1.593
oche+1.563
mun+1.496
Neg logit contributions
Reward-1.712
 rewarded-1.633
doc-1.632
ellen-1.615
llan-1.554
Token failure
Token ID5287
Feature activation+3.12e+00

Pos logit contributions
gencies+2.345
rang+2.251
ía+2.186
oche+2.135
acement+2.036
Neg logit contributions
Reward-2.680
 rewarded-2.556
doc-2.555
ellen-2.530
llan-2.434
Token of
Token ID286
Feature activation+7.50e+00

Pos logit contributions
gencies+2.319
rang+2.273
ía+2.198
oche+2.148
mun+2.059
Neg logit contributions
Reward-2.458
doc-2.315
ellen-2.292
 rewarded-2.261
llan-2.210
Token the
Token ID262
Feature activation+1.06e+01

Pos logit contributions
gencies+5.688
rang+5.559
ía+5.451
oche+5.309
 Inspection+5.135
Neg logit contributions
Reward-5.402
ellen-5.186
 rewarded-5.133
doc-5.130
llan-4.808
Token Ranger
Token ID21888
Feature activation+1.60e+01

Pos logit contributions
gencies+7.555
rang+7.415
ía+7.288
oche+7.052
 Inspection+6.957
Neg logit contributions
Reward-7.762
ellen-7.408
doc-7.369
 rewarded-7.272
 recru-6.812
Token 6
Token ID718
Feature activation+2.26e+01

Pos logit contributions
gencies+10.020
rang+10.011
oche+9.817
ía+9.726
 Inspection+9.416
Neg logit contributions
Reward-11.709
ellen-11.327
doc-11.078
 rewarded-10.777
 recru-10.334
Token camera
Token ID4676
Feature activation+1.70e+01

Pos logit contributions
rang+14.928
gencies+14.386
oche+14.318
ía+14.215
 Inspection+13.994
Neg logit contributions
Reward-13.373
ellen-13.073
doc-12.120
 rewarded-11.439
 recru-11.267
Token package
Token ID5301
Feature activation+1.21e+01

Pos logit contributions
rang+11.443
ía+11.195
gencies+11.138
oche+10.846
 Inspection+10.675
Neg logit contributions
Reward-11.748
ellen-11.021
doc-10.548
 rewarded-9.740
 recru-9.559
Token was
Token ID373
Feature activation+2.40e+00

Pos logit contributions
rang+8.877
gencies+8.758
ía+8.483
oche+8.303
mun+8.054
Neg logit contributions
Reward-8.695
ellen-8.106
doc-7.753
 rewarded-7.379
llan-7.300
Token released
Token ID2716
Feature activation-8.88e-02

Pos logit contributions
gencies+2.055
rang+2.016
ía+1.954
oche+1.918
mun+1.858
Neg logit contributions
Reward-1.605
doc-1.517
ellen-1.494
 rewarded-1.477
llan-1.434
Token on
Token ID319
Feature activation-6.93e+00

Pos logit contributions
Reward+0.073
doc+0.070
 rewarded+0.070
ellen+0.069
llan+0.067
Neg logit contributions
gencies-0.063
rang-0.061
ía-0.058
oche-0.057
mun-0.055
Token
Token ID198
Feature activation+1.45e+01

Pos logit contributions
Reward+1.240
doc+1.196
 rewarded+1.172
ellen+1.155
llan+1.138
Neg logit contributions
gencies-1.283
rang-1.221
ía-1.181
oche-1.164
acter-1.125
Token12
Token ID1065
Feature activation-5.45e+00

Pos logit contributions
gencies+9.244
rang+9.014
ía+8.656
oche+8.318
mun+8.292
Neg logit contributions
 rewarded-10.833
 recru-10.766
ellen-10.310
Reward-10.066
 trooper-10.033
Token)
Token ID8
Feature activation+6.31e+00

Pos logit contributions
Reward+2.675
doc+2.566
llan+2.376
 rewarded+2.353
ellen+2.344
Neg logit contributions
gencies-5.728
rang-5.429
oche-5.259
ía-5.247
acter-5.178
Token When
Token ID1649
Feature activation+5.44e+00

Pos logit contributions
gencies+4.770
rang+4.638
ía+4.481
oche+4.412
 Symphony+4.290
Neg logit contributions
Reward-4.620
 rewarded-4.488
doc-4.466
ellen-4.456
 recru-4.403
Token Dancing
Token ID31799
Feature activation+9.84e+00

Pos logit contributions
gencies+4.240
rang+4.128
ía+4.002
oche+3.946
 Symphony+3.739
Neg logit contributions
Reward-3.979
doc-3.826
ellen-3.741
 rewarded-3.734
llan-3.639
Token the
Token ID262
Feature activation+4.02e+00

Pos logit contributions
gencies+6.562
rang+6.381
ía+6.115
oche+5.995
 Symphony+5.809
Neg logit contributions
Reward-7.849
doc-7.626
ellen-7.442
 rewarded-7.279
 recru-7.256
Token Quad
Token ID20648
Feature activation-3.46e+00

Pos logit contributions
gencies+3.012
rang+2.913
ía+2.810
oche+2.774
 Symphony+2.655
Neg logit contributions
Reward-3.115
doc-2.977
 rewarded-2.973
ellen-2.928
 recru-2.839
Tokenr
Token ID81
Feature activation+3.37e+00

Pos logit contributions
Reward+2.665
doc+2.598
 rewarded+2.583
ellen+2.509
llan+2.477
Neg logit contributions
gencies-2.564
rang-2.414
ía-2.392
oche-2.333
mun-2.252
Tokenille
Token ID8270
Feature activation+2.88e+00

Pos logit contributions
gencies+1.939
rang+1.867
ía+1.778
oche+1.749
acter+1.651
Neg logit contributions
Reward-3.229
 rewarded-3.126
doc-3.107
ellen-3.051
 recru-3.013
Token,
Token ID11
Feature activation+8.10e+00

Pos logit contributions
gencies+1.999
rang+1.939
ía+1.863
oche+1.833
mun+1.738
Neg logit contributions
Reward-2.415
 rewarded-2.305
doc-2.301
ellen-2.275
 recru-2.217
Token Don
Token ID2094
Feature activation+4.53e+00

Pos logit contributions
gencies+6.181
rang+5.977
oche+5.715
ía+5.715
 Inspection+5.543
Neg logit contributions
Reward-5.786
doc-5.592
ellen-5.516
 rewarded-5.508
 recru-5.351
Token is
Token ID318
Feature activation-1.97e+00

Pos logit contributions
gencies+3.577
rang+3.502
ía+3.407
oche+3.337
mun+3.184
Neg logit contributions
Reward-3.984
doc-3.722
 rewarded-3.704
ellen-3.697
llan-3.579
Token tight
Token ID5381
Feature activation+6.55e+00

Pos logit contributions
Reward+1.354
 rewarded+1.324
doc+1.301
ellen+1.270
llan+1.241
Neg logit contributions
gencies-1.664
rang-1.595
ía-1.543
oche-1.519
acter-1.472
Token,
Token ID11
Feature activation+4.50e+00

Pos logit contributions
gencies+4.221
rang+4.154
ía+4.039
oche+3.949
mun+3.788
Neg logit contributions
Reward-5.541
doc-5.266
 rewarded-5.208
ellen-5.167
 recru-5.075
Token maybe
Token ID3863
Feature activation+1.85e+00

Pos logit contributions
gencies+3.184
rang+3.086
ía+2.971
oche+2.957
mun+2.810
Neg logit contributions
Reward-3.666
doc-3.474
ellen-3.434
 rewarded-3.399
llan-3.342
Token 5
Token ID642
Feature activation-4.80e+00

Pos logit contributions
gencies+1.369
rang+1.329
ía+1.281
oche+1.265
mun+1.207
Neg logit contributions
Reward-1.470
doc-1.399
 rewarded-1.385
ellen-1.376
llan-1.339
Token to
Token ID284
Feature activation+1.10e+01

Pos logit contributions
Reward+3.315
 rewarded+3.272
doc+3.265
ellen+3.192
llan+3.074
Neg logit contributions
gencies-3.886
rang-3.757
ía-3.596
oche-3.518
mun-3.419
Token 6
Token ID718
Feature activation-5.74e+00

Pos logit contributions
rang+6.290
gencies+6.204
ía+6.113
oche+5.927
acter+5.787
Neg logit contributions
Reward-9.777
doc-8.945
ellen-8.877
 rewarded-8.839
 recru-8.602
Token times
Token ID1661
Feature activation+8.85e+00

Pos logit contributions
Reward+3.469
 rewarded+3.442
doc+3.427
ellen+3.309
 recru+3.181
Neg logit contributions
gencies-5.118
rang-4.937
ía-4.799
oche-4.622
mortem-4.525
Token.
Token ID13
Feature activation+9.85e-01

Pos logit contributions
gencies+5.437
rang+5.328
ía+5.254
oche+5.174
mun+4.948
Neg logit contributions
Reward-7.599
doc-7.154
ellen-7.067
 rewarded-6.898
llan-6.854
Token Look
Token ID6803
Feature activation+3.63e+00

Pos logit contributions
gencies+0.726
rang+0.701
ía+0.677
oche+0.665
mun+0.637
Neg logit contributions
Reward-0.782
 rewarded-0.749
doc-0.749
ellen-0.738
llan-0.719
Token at
Token ID379
Feature activation+6.92e-01

Pos logit contributions
gencies+2.649
rang+2.551
ía+2.476
oche+2.438
acement+2.328
Neg logit contributions
Reward-2.923
doc-2.742
 rewarded-2.732
ellen-2.723
llan-2.650
Token,
Token ID11
Feature activation+4.13e+00

Pos logit contributions
Reward+2.567
 rewarded+2.522
doc+2.476
ellen+2.432
llan+2.382
Neg logit contributions
gencies-2.616
rang-2.514
ía-2.416
oche-2.412
acter-2.299
Token but
Token ID475
Feature activation-2.06e+00

Pos logit contributions
gencies+2.524
rang+2.438
ía+2.322
oche+2.269
mun+2.204
Neg logit contributions
Reward-3.752
doc-3.615
 rewarded-3.531
ellen-3.514
llan-3.457
Token that
Token ID326
Feature activation-4.51e+00

Pos logit contributions
Reward+1.639
 rewarded+1.589
doc+1.574
ellen+1.552
llan+1.519
Neg logit contributions
gencies-1.517
rang-1.451
ía-1.399
oche-1.382
mun-1.321
Token relationship
Token ID2776
Feature activation-4.22e-01

Pos logit contributions
Reward+3.144
 rewarded+3.123
doc+3.050
ellen+2.981
 recru+2.943
Neg logit contributions
gencies-3.695
rang-3.502
ía-3.403
oche-3.381
acter-3.248
Token had
Token ID550
Feature activation+7.37e+00

Pos logit contributions
Reward+0.330
 rewarded+0.317
doc+0.316
ellen+0.310
llan+0.303
Neg logit contributions
gencies-0.317
rang-0.305
ía-0.295
oche-0.290
mun-0.278
Token only
Token ID691
Feature activation+7.11e+00

Pos logit contributions
rang+5.522
gencies+5.519
ía+5.266
oche+5.085
acter+4.940
Neg logit contributions
Reward-5.557
doc-5.204
ellen-5.124
 rewarded-4.815
 recru-4.809
Token just
Token ID655
Feature activation+5.61e+00

Pos logit contributions
gencies+5.524
rang+5.516
ía+5.268
oche+5.119
mun+5.018
Neg logit contributions
Reward-5.158
doc-4.871
ellen-4.756
 rewarded-4.465
llan-4.443
Token recently
Token ID2904
Feature activation-6.71e-01

Pos logit contributions
gencies+4.346
rang+4.316
ía+4.176
oche+4.070
mun+3.973
Neg logit contributions
Reward-4.111
doc-3.897
ellen-3.804
 rewarded-3.644
 recru-3.624
Token reached
Token ID4251
Feature activation+4.07e+00

Pos logit contributions
Reward+0.440
 rewarded+0.422
doc+0.419
ellen+0.409
llan+0.399
Neg logit contributions
gencies-0.589
rang-0.569
ía-0.553
oche-0.545
acter-0.526
Token the
Token ID262
Feature activation-2.84e+00

Pos logit contributions
gencies+2.379
rang+2.295
ía+2.208
oche+2.103
mun+2.064
Neg logit contributions
Reward-3.795
doc-3.626
ellen-3.600
 rewarded-3.589
llan-3.512
Token level
Token ID1241
Feature activation-2.05e+00

Pos logit contributions
Reward+2.091
 rewarded+2.027
doc+2.004
ellen+1.948
 recru+1.909
Neg logit contributions
gencies-2.255
rang-2.167
ía-2.095
oche-2.086
acter-1.986
Token reporters
Token ID7638
Feature activation-1.04e+01

Pos logit contributions
Reward+2.197
doc+2.112
 rewarded+2.104
ellen+2.043
 recru+1.996
Neg logit contributions
gencies-2.340
rang-2.258
ía-2.194
oche-2.135
acter-2.081
Token outside
Token ID2354
Feature activation-4.42e+00

Pos logit contributions
 rewarded+6.672
Reward+6.642
doc+6.488
 prisoner+6.295
ellen+6.288
Neg logit contributions
gencies-8.524
rang-8.197
ía-7.862
acter-7.705
oche-7.543
Token court
Token ID2184
Feature activation-1.06e+01

Pos logit contributions
Reward+3.373
doc+3.252
 rewarded+3.237
ellen+3.169
llan+3.102
Neg logit contributions
gencies-3.350
rang-3.221
ía-3.130
oche-3.057
acter-2.978
Token.
Token ID13
Feature activation+2.83e+00

Pos logit contributions
Reward+6.848
 rewarded+6.711
doc+6.703
ellen+6.413
 prisoner+6.282
Neg logit contributions
gencies-8.646
rang-8.253
ía-7.907
acter-7.765
oche-7.722
Token
Token ID198
Feature activation+3.48e+00

Pos logit contributions
gencies+1.993
rang+1.919
ía+1.843
oche+1.823
mun+1.756
Neg logit contributions
Reward-2.318
 rewarded-2.229
doc-2.202
ellen-2.200
llan-2.146
Token
Token ID198
Feature activation+1.39e+00

Pos logit contributions
gencies+2.286
rang+2.258
ía+2.170
oche+2.128
mun+2.057
Neg logit contributions
Reward-2.935
 rewarded-2.880
ellen-2.808
doc-2.787
 recru-2.776
Token
Token ID447
Feature activation+1.43e+00

Pos logit contributions
gencies+0.968
rang+0.932
ía+0.896
oche+0.882
mun+0.849
Neg logit contributions
Reward-1.155
 rewarded-1.120
doc-1.108
ellen-1.094
 recru-1.070
Token
Token ID250
Feature activation-2.30e+00

Pos logit contributions
gencies+1.035
rang+0.994
ía+0.961
oche+0.941
acter+0.903
Neg logit contributions
Reward-1.152
doc-1.105
 rewarded-1.104
ellen-1.089
llan-1.061
TokenHe
Token ID1544
Feature activation-2.94e+00

Pos logit contributions
Reward+1.996
doc+1.922
 rewarded+1.891
ellen+1.869
llan+1.828
Neg logit contributions
gencies-1.542
rang-1.475
ía-1.419
oche-1.393
acter-1.338
Token should
Token ID815
Feature activation-5.74e+00

Pos logit contributions
Reward+2.105
 rewarded+2.052
doc+2.026
ellen+1.978
llan+1.939
Neg logit contributions
gencies-2.376
rang-2.275
ía-2.192
oche-2.170
acter-2.110
Token not
Token ID407
Feature activation+1.02e+00

Pos logit contributions
Reward+3.676
 rewarded+3.585
doc+3.521
ellen+3.393
 recru+3.355
Neg logit contributions
gencies-4.968
rang-4.768
ía-4.591
oche-4.560
acter-4.424
Token when
Token ID618
Feature activation+2.82e+00

Pos logit contributions
gencies+3.273
rang+3.184
ía+3.066
oche+3.036
mun+2.924
Neg logit contributions
Reward-3.619
doc-3.395
 rewarded-3.372
ellen-3.367
llan-3.268
Token it
Token ID340
Feature activation-4.10e+00

Pos logit contributions
gencies+2.000
rang+1.924
ía+1.863
oche+1.830
mun+1.752
Neg logit contributions
Reward-2.316
doc-2.202
 rewarded-2.179
ellen-2.178
llan-2.115
Token comes
Token ID2058
Feature activation-3.70e+00

Pos logit contributions
Reward+2.266
 rewarded+2.245
doc+2.190
ellen+2.112
llan+2.091
Neg logit contributions
gencies-3.934
rang-3.794
ía-3.683
oche-3.635
acement-3.532
Token to
Token ID284
Feature activation+1.15e+01

Pos logit contributions
Reward+2.843
 rewarded+2.762
doc+2.745
ellen+2.665
llan+2.623
Neg logit contributions
gencies-2.770
rang-2.698
ía-2.573
oche-2.546
mun-2.434
Token your
Token ID534
Feature activation+1.47e+01

Pos logit contributions
gencies+8.110
rang+7.582
ía+7.578
oche+7.347
 Sanct+7.230
Neg logit contributions
Reward-8.480
ellen-8.193
doc-7.923
llan-7.717
 rewarded-7.632
Token Even
Token ID3412
Feature activation+8.06e+00

Pos logit contributions
gencies+10.139
rang+9.678
ía+9.577
 Inspection+9.144
 Sanct+9.045
Neg logit contributions
Reward-10.580
ellen-10.270
doc-9.652
llan-9.464
 rewarded-9.442
Token though
Token ID996
Feature activation+8.35e+00

Pos logit contributions
gencies+5.690
rang+5.569
ía+5.303
mun+5.211
oche+5.155
Neg logit contributions
Reward-6.386
 rewarded-5.971
doc-5.957
ellen-5.953
llan-5.801
Token the
Token ID262
Feature activation+9.64e+00

Pos logit contributions
gencies+5.630
rang+5.437
ía+5.264
oche+5.145
mun+5.037
Neg logit contributions
Reward-6.774
ellen-6.333
doc-6.307
 rewarded-6.283
llan-6.112
Token president
Token ID1893
Feature activation-2.02e+00

Pos logit contributions
gencies+7.027
rang+6.677
ía+6.542
oche+6.294
 Sanct+6.270
Neg logit contributions
Reward-7.188
ellen-6.783
doc-6.735
 rewarded-6.734
llan-6.649
Token didn
Token ID1422
Feature activation+6.95e-02

Pos logit contributions
Reward+1.488
 rewarded+1.443
doc+1.433
ellen+1.398
llan+1.365
Neg logit contributions
gencies-1.600
rang-1.542
ía-1.474
oche-1.464
acter-1.412
Token't
Token ID470
Feature activation+3.83e+00

Pos logit contributions
gencies+0.061
rang+0.059
ía+0.057
oche+0.056
mun+0.054
Neg logit contributions
Reward-0.046
doc-0.044
 rewarded-0.044
ellen-0.043
llan-0.041
Token the
Token ID262
Feature activation+1.13e+00

Pos logit contributions
gencies+3.045
rang+2.965
ía+2.840
oche+2.818
mun+2.694
Neg logit contributions
Reward-3.616
doc-3.421
 rewarded-3.382
ellen-3.359
llan-3.287
Token wings
Token ID12098
Feature activation-6.10e-01

Pos logit contributions
gencies+0.880
rang+0.850
ía+0.822
oche+0.810
mun+0.778
Neg logit contributions
Reward-0.855
doc-0.815
 rewarded-0.811
ellen-0.799
llan-0.779
Token...
Token ID986
Feature activation-6.60e+00

Pos logit contributions
Reward+0.506
 rewarded+0.488
doc+0.488
ellen+0.478
llan+0.468
Neg logit contributions
gencies-0.430
rang-0.412
ía-0.397
oche-0.390
acter-0.373
Tokenit
Token ID270
Feature activation-1.43e+00

Pos logit contributions
Reward+5.564
doc+5.444
 rewarded+5.352
ellen+5.297
llan+5.154
Neg logit contributions
gencies-4.349
rang-4.103
ía-3.999
oche-3.925
acter-3.755
Token's
Token ID338
Feature activation+9.42e-01

Pos logit contributions
Reward+1.080
 rewarded+1.051
doc+1.040
ellen+1.020
llan+0.994
Neg logit contributions
gencies-1.106
rang-1.062
ía-1.024
oche-1.008
acter-0.972
Token on
Token ID319
Feature activation-1.19e+00

Pos logit contributions
gencies+0.703
rang+0.677
ía+0.655
oche+0.645
mun+0.618
Neg logit contributions
Reward-0.742
doc-0.709
 rewarded-0.705
ellen-0.698
llan-0.679
Token the
Token ID262
Feature activation-2.75e+00

Pos logit contributions
Reward+0.918
 rewarded+0.882
doc+0.882
ellen+0.861
llan+0.840
Neg logit contributions
gencies-0.900
rang-0.867
ía-0.838
oche-0.824
mun-0.791
Token takeoff
Token ID45306
Feature activation-1.06e+01

Pos logit contributions
Reward+1.967
 rewarded+1.893
doc+1.885
ellen+1.841
 recru+1.800
Neg logit contributions
gencies-2.240
rang-2.168
ía-2.101
oche-2.072
mun-1.993
Token checklist
Token ID41859
Feature activation-1.33e+01

Pos logit contributions
Reward+6.318
 rewarded+6.176
doc+6.027
 trooper+5.828
 recru+5.828
Neg logit contributions
gencies-9.233
rang-8.595
ía-8.572
oche-8.460
mun-8.143
Token!
Token ID0
Feature activation-6.01e+00

Pos logit contributions
 rewarded+8.646
doc+8.418
Reward+8.330
ellen+8.158
 recru+8.011
Neg logit contributions
gencies-10.230
rang-9.662
ía-9.585
oche-9.295
ukemia-9.065
Token Just
Token ID2329
Feature activation-3.83e+00

Pos logit contributions
Reward+4.323
doc+4.226
 rewarded+4.113
ellen+4.000
llan+3.955
Neg logit contributions
gencies-4.830
rang-4.582
ía-4.441
oche-4.364
acter-4.294
Token retaining
Token ID26645
Feature activation-5.54e+00

Pos logit contributions
 rewarded+6.373
Reward+6.201
doc+5.999
 recru+5.996
ellen+5.817
Neg logit contributions
gencies-7.904
rang-7.613
ía-7.293
oche-7.134
ukemia-7.056
Token workers
Token ID3259
Feature activation-5.60e+00

Pos logit contributions
Reward+4.202
 rewarded+4.181
doc+4.075
 recru+3.984
ellen+3.936
Neg logit contributions
gencies-4.193
rang-4.018
oche-3.870
ía-3.855
acter-3.662
Token that
Token ID326
Feature activation-6.58e+00

Pos logit contributions
 rewarded+4.377
Reward+4.290
doc+4.185
ellen+4.117
 recru+4.103
Neg logit contributions
gencies-4.170
rang-3.958
ía-3.763
oche-3.745
acter-3.633
Token are
Token ID389
Feature activation-5.93e+00

Pos logit contributions
Reward+4.465
 rewarded+4.428
doc+4.205
ellen+4.158
 recru+4.122
Neg logit contributions
gencies-5.458
rang-5.185
ía-5.050
oche-5.025
mun-4.801
Token lured
Token ID41546
Feature activation-1.77e+01

Pos logit contributions
 rewarded+4.149
Reward+4.081
doc+3.906
 recru+3.842
ellen+3.778
Neg logit contributions
gencies-4.911
rang-4.678
ía-4.524
oche-4.493
acter-4.326
Token out
Token ID503
Feature activation-3.40e+00

Pos logit contributions
 rewarded+10.105
 recru+9.571
 prisoner+9.454
doc+9.121
 uncond+9.034
Neg logit contributions
gencies-14.724
rang-13.820
ía-13.625
oche-13.421
acter-13.049
Token of
Token ID286
Feature activation-8.84e+00

Pos logit contributions
Reward+2.578
 rewarded+2.500
doc+2.497
ellen+2.427
 recru+2.391
Neg logit contributions
gencies-2.595
rang-2.481
ía-2.421
oche-2.384
acter-2.305
Token province
Token ID8473
Feature activation-1.01e+01

Pos logit contributions
Reward+6.442
 rewarded+6.410
doc+6.343
 recru+6.171
 prisoner+6.054
Neg logit contributions
gencies-6.516
rang-6.231
oche-6.093
ía-6.029
acter-5.839
Token by
Token ID416
Feature activation-2.34e+01

Pos logit contributions
 rewarded+6.962
Reward+6.839
doc+6.698
 recru+6.584
llan+6.436
Neg logit contributions
gencies-7.829
rang-7.541
oche-7.289
ía-7.241
acter-7.150
Token even
Token ID772
Feature activation-8.08e+00

Pos logit contributions
 rewarded+14.508
 recru+14.507
 prisoner+13.722
Reward+13.608
 rewards+13.482
Neg logit contributions
gencies-14.713
rang-14.376
ía-14.360
oche-14.249
acter-13.821
Token better
Token ID1365
Feature activation-1.41e+01

Pos logit contributions
Reward+5.632
 rewarded+5.614
doc+5.440
 recru+5.348
ellen+5.216
Neg logit contributions
gencies-6.404
rang-6.067
ía-5.945
oche-5.897
acter-5.690
Token without
Token ID1231
Feature activation+2.28e-01

Pos logit contributions
gencies+0.541
rang+0.522
ía+0.504
oche+0.494
mun+0.473
Neg logit contributions
Reward-0.622
doc-0.595
 rewarded-0.593
ellen-0.584
llan-0.570
Token jeopard
Token ID22451
Feature activation+1.26e+01

Pos logit contributions
gencies+0.187
rang+0.180
ía+0.175
oche+0.172
mun+0.166
Neg logit contributions
Reward-0.163
doc-0.155
 rewarded-0.155
ellen-0.152
llan-0.148
Tokenizing
Token ID2890
Feature activation+1.61e+01

Pos logit contributions
gencies+4.001
ía+3.933
rang+3.574
oche+3.281
itar+3.210
Neg logit contributions
Reward-14.105
ellen-13.844
 rewarded-13.642
doc-13.551
llan-13.486
Token the
Token ID262
Feature activation+9.65e+00

Pos logit contributions
gencies+8.930
ía+8.603
rang+8.537
oche+8.150
 Sanct+8.074
Neg logit contributions
ellen-12.866
Reward-12.662
doc-12.242
 rewarded-12.014
 recru-11.661
Token launch
Token ID4219
Feature activation+1.37e+01

Pos logit contributions
gencies+6.843
rang+6.554
ía+6.424
oche+6.212
 Symphony+6.054
Neg logit contributions
Reward-7.284
ellen-7.094
doc-6.908
 rewarded-6.843
llan-6.631
Token.
Token ID13
Feature activation-1.25e+00

Pos logit contributions
gencies+8.870
rang+8.866
ía+8.665
mun+8.197
itar+8.100
Neg logit contributions
Reward-10.225
ellen-9.828
doc-9.667
 rewarded-9.599
 recru-9.214
Token But
Token ID887
Feature activation-3.17e+00

Pos logit contributions
Reward+0.948
doc+0.909
 rewarded+0.903
ellen+0.883
llan+0.865
Neg logit contributions
gencies-0.973
rang-0.937
ía-0.902
oche-0.890
acter-0.859
Token you
Token ID345
Feature activation-7.80e+00

Pos logit contributions
Reward+2.459
 rewarded+2.392
doc+2.360
ellen+2.303
llan+2.273
Neg logit contributions
gencies-2.381
rang-2.284
ía-2.192
oche-2.168
acter-2.093
Token have
Token ID423
Feature activation-2.08e+00

Pos logit contributions
 rewarded+5.172
Reward+4.989
doc+4.813
llan+4.708
ellen+4.661
Neg logit contributions
gencies-6.544
rang-6.185
oche-6.001
ía-5.971
acter-5.872
Token saved
Token ID7448
Feature activation+4.29e+00

Pos logit contributions
Reward+1.498
 rewarded+1.457
doc+1.428
ellen+1.395
llan+1.375
Neg logit contributions
gencies-1.690
rang-1.625
ía-1.568
oche-1.559
mun-1.499
Token my
Token ID616
Feature activation-8.63e-01

Pos logit contributions
gencies+3.051
rang+2.952
ía+2.890
oche+2.829
mun+2.687
Neg logit contributions
Reward-3.460
doc-3.302
ellen-3.263
 rewarded-3.251
llan-3.139
Token use
Token ID779
Feature activation-7.37e+00

Pos logit contributions
gencies+3.803
rang+3.617
ía+3.546
oche+3.489
acter+3.397
Neg logit contributions
Reward-4.005
ellen-3.728
doc-3.718
 rewarded-3.682
llan-3.588
Token the
Token ID262
Feature activation-1.03e+01

Pos logit contributions
Reward+5.490
doc+5.447
 rewarded+5.383
ellen+5.161
 recru+5.101
Neg logit contributions
gencies-5.470
rang-5.297
ía-5.130
oche-4.965
mun-4.838
Token menus
Token ID26798
Feature activation-3.81e+00

Pos logit contributions
doc+7.172
 rewarded+7.139
Reward+7.097
 recru+6.829
ellen+6.822
Neg logit contributions
gencies-7.684
rang-7.430
ía-7.145
oche-7.093
mun-6.810
Token,
Token ID11
Feature activation+7.21e-01

Pos logit contributions
Reward+2.923
 rewarded+2.872
doc+2.867
ellen+2.811
llan+2.727
Neg logit contributions
gencies-2.859
rang-2.715
ía-2.618
oche-2.596
mun-2.488
Token however
Token ID2158
Feature activation+4.21e+00

Pos logit contributions
gencies+0.515
rang+0.495
ía+0.477
oche+0.468
mun+0.449
Neg logit contributions
Reward-0.593
doc-0.567
 rewarded-0.566
ellen-0.557
llan-0.545
Token.
Token ID13
Feature activation+1.50e-01

Pos logit contributions
gencies+2.826
rang+2.708
ía+2.603
oche+2.548
mun+2.448
Neg logit contributions
Reward-3.602
 rewarded-3.415
doc-3.413
ellen-3.360
llan-3.300
Token
Token ID198
Feature activation+5.21e+00

Pos logit contributions
gencies+0.113
rang+0.109
ía+0.105
oche+0.104
mun+0.099
Neg logit contributions
Reward-0.117
doc-0.112
 rewarded-0.112
ellen-0.110
llan-0.107
Token
Token ID198
Feature activation+1.43e-01

Pos logit contributions
gencies+3.332
rang+3.287
ía+3.173
oche+3.087
mun+2.990
Neg logit contributions
 rewarded-4.368
Reward-4.320
ellen-4.198
 recru-4.180
doc-4.157
TokenFour
Token ID15137
Feature activation-7.77e+00

Pos logit contributions
gencies+0.104
rang+0.100
ía+0.097
oche+0.095
mun+0.091
Neg logit contributions
Reward-0.115
doc-0.110
 rewarded-0.110
ellen-0.108
llan-0.106
Token years
Token ID812
Feature activation+1.05e+01

Pos logit contributions
Reward+5.545
doc+5.480
 rewarded+5.374
ellen+5.308
llan+5.153
Neg logit contributions
gencies-5.903
rang-5.669
ía-5.483
oche-5.358
acement-5.226
Token ago
Token ID2084
Feature activation-2.07e+00

Pos logit contributions
gencies+6.652
rang+6.466
ía+6.374
oche+6.040
mun+6.033
Neg logit contributions
Reward-8.545
doc-8.123
ellen-8.036
 rewarded-7.835
llan-7.740
Token the
Token ID262
Feature activation-1.93e+00

Pos logit contributions
gencies+0.511
rang+0.492
ía+0.474
oche+0.467
mun+0.447
Neg logit contributions
Reward-0.579
doc-0.555
 rewarded-0.553
ellen-0.545
llan-0.533
Token analysis
Token ID3781
Feature activation-5.84e+00

Pos logit contributions
Reward+1.362
 rewarded+1.311
doc+1.302
ellen+1.275
llan+1.236
Neg logit contributions
gencies-1.598
rang-1.537
ía-1.491
oche-1.466
mun-1.419
Token page
Token ID2443
Feature activation-3.03e+00

Pos logit contributions
Reward+4.144
 rewarded+4.118
doc+4.020
ellen+3.885
 recru+3.802
Neg logit contributions
gencies-4.703
rang-4.468
ía-4.323
oche-4.278
acement-4.096
Token.
Token ID13
Feature activation+3.61e+00

Pos logit contributions
Reward+2.362
 rewarded+2.314
doc+2.293
ellen+2.248
llan+2.190
Neg logit contributions
gencies-2.269
rang-2.177
ía-2.088
oche-2.043
acter-1.984
Token It
Token ID632
Feature activation-1.05e+01

Pos logit contributions
gencies+2.592
rang+2.504
ía+2.430
oche+2.387
 Inspection+2.318
Neg logit contributions
Reward-2.859
 rewarded-2.732
ellen-2.714
doc-2.709
 recru-2.643
Token's
Token ID338
Feature activation-7.60e+00

Pos logit contributions
 rewarded+6.293
doc+5.952
Reward+5.738
llan+5.681
 recru+5.675
Neg logit contributions
gencies-9.161
rang-8.809
ía-8.466
ukemia-8.273
acement-8.170
Token even
Token ID772
Feature activation-9.62e+00

Pos logit contributions
 rewarded+5.463
Reward+5.326
doc+5.226
 recru+5.068
ellen+5.010
Neg logit contributions
gencies-6.019
rang-5.788
ía-5.588
oche-5.427
acement-5.294
Token possible
Token ID1744
Feature activation+4.39e-01

Pos logit contributions
 rewarded+5.966
Reward+5.537
doc+5.507
 recru+5.473
ellen+5.299
Neg logit contributions
gencies-8.472
rang-8.163
ía-7.889
oche-7.820
acement-7.662
Token to
Token ID284
Feature activation+1.30e+01

Pos logit contributions
gencies+0.307
rang+0.296
ía+0.285
oche+0.280
mun+0.268
Neg logit contributions
Reward-0.367
doc-0.352
 rewarded-0.351
ellen-0.345
llan-0.337
Token have
Token ID423
Feature activation+5.36e+00

Pos logit contributions
gencies+9.337
rang+8.752
oche+8.638
ía+8.482
mun+8.451
Neg logit contributions
Reward-9.047
ellen-8.528
doc-8.505
llan-8.203
 rewarded-7.827
Token nested
Token ID28376
Feature activation+8.63e+00

Pos logit contributions
gencies+3.956
rang+3.892
ía+3.712
oche+3.711
acter+3.589
Neg logit contributions
Reward-4.097
doc-3.875
ellen-3.848
 rewarded-3.769
llan-3.722
Token image
Token ID2939
Feature activation-1.30e+01

Pos logit contributions
 rewarded+7.135
Reward+6.946
doc+6.903
 recru+6.634
ellen+6.589
Neg logit contributions
gencies-10.271
rang-9.578
oche-9.462
ía-9.425
ukemia-9.131
Token though
Token ID996
Feature activation-9.43e+00

Pos logit contributions
 rewarded+8.801
Reward+8.652
doc+8.602
ellen+8.593
llan+8.413
Neg logit contributions
gencies-9.696
rang-8.958
ía-8.883
oche-8.687
ukemia-8.226
Token if
Token ID611
Feature activation-1.31e+01

Pos logit contributions
Reward+6.714
 rewarded+6.605
doc+6.567
ellen+6.482
llan+6.326
Neg logit contributions
gencies-7.150
rang-6.706
ía-6.626
oche-6.440
acter-6.193
Token you
Token ID345
Feature activation-1.07e+01

Pos logit contributions
 rewarded+8.601
doc+8.374
Reward+8.293
ellen+8.138
llan+7.878
Neg logit contributions
gencies-10.159
rang-9.452
ía-9.350
oche-9.169
mortem-8.874
Token stick
Token ID4859
Feature activation-1.38e+01

Pos logit contributions
 rewarded+6.904
doc+6.605
Reward+6.561
ellen+6.343
llan+6.323
Neg logit contributions
gencies-8.801
rang-8.248
oche-8.147
ía-8.130
ukemia-7.867
Token your
Token ID534
Feature activation-1.87e+01

Pos logit contributions
doc+8.954
 rewarded+8.900
Reward+8.538
 recru+8.274
ellen+8.270
Neg logit contributions
gencies-10.916
rang-9.883
ía-9.834
oche-9.751
iferation-9.441
Token face
Token ID1986
Feature activation-1.05e+01

Pos logit contributions
doc+10.774
 rewarded+10.476
Reward+10.108
 recru+10.030
 trooper+9.859
Neg logit contributions
gencies-14.822
oche-13.476
ía-13.418
iferation-13.192
rang-13.183
Token close
Token ID1969
Feature activation-5.32e+00

Pos logit contributions
 rewarded+6.911
doc+6.882
Reward+6.766
llan+6.624
ellen+6.560
Neg logit contributions
gencies-8.517
rang-8.055
ía-7.749
oche-7.621
iferation-7.289
Token enough
Token ID1576
Feature activation-5.15e+00

Pos logit contributions
Reward+3.903
doc+3.839
 rewarded+3.834
ellen+3.741
llan+3.675
Neg logit contributions
gencies-4.110
rang-3.910
ía-3.801
oche-3.763
mun-3.564
Token to
Token ID284
Feature activation-8.23e+00

Pos logit contributions
Reward+3.931
 rewarded+3.873
doc+3.833
ellen+3.739
llan+3.696
Neg logit contributions
gencies-3.834
rang-3.630
ía-3.543
oche-3.515
acter-3.309
Token your
Token ID534
Feature activation-1.51e+01

Pos logit contributions
Reward+5.906
 rewarded+5.834
doc+5.821
ellen+5.536
llan+5.509
Neg logit contributions
gencies-6.325
rang-6.031
ía-5.867
oche-5.732
mortem-5.451
Token passes
Token ID8318
Feature activation-6.56e+00

Pos logit contributions
Reward+5.568
 rewarded+5.479
doc+5.375
ellen+5.242
llan+5.237
Neg logit contributions
gencies-5.720
rang-5.537
ía-5.303
acter-5.141
oche-5.116
Token with
Token ID351
Feature activation-1.35e+01

Pos logit contributions
 rewarded+4.782
Reward+4.755
doc+4.643
ellen+4.544
llan+4.491
Neg logit contributions
gencies-5.054
rang-4.799
ía-4.629
oche-4.498
acter-4.491
Token heavy
Token ID4334
Feature activation-1.11e+01

Pos logit contributions
 rewarded+9.605
Reward+9.591
 trooper+9.225
doc+9.224
 recru+9.053
Neg logit contributions
gencies-9.438
rang-9.081
ía-8.793
oche-8.529
acter-8.426
Token loads
Token ID15989
Feature activation-5.83e+00

Pos logit contributions
Reward+7.388
 rewarded+7.373
doc+7.160
 trooper+7.147
 recru+6.971
Neg logit contributions
gencies-8.739
rang-8.372
ía-8.178
oche-7.822
mun-7.674
Token,
Token ID11
Feature activation-1.41e+01

Pos logit contributions
 rewarded+4.322
Reward+4.321
doc+4.165
ellen+4.112
llan+4.038
Neg logit contributions
gencies-4.424
rang-4.244
ía-4.073
oche-4.015
acter-3.913
Token through
Token ID832
Feature activation-1.91e+01

Pos logit contributions
 rewarded+9.826
Reward+9.546
doc+9.302
 trooper+9.136
 recru+9.060
Neg logit contributions
gencies-10.036
rang-9.651
ía-9.216
oche-8.881
acter-8.865
Token severe
Token ID6049
Feature activation-1.77e+01

Pos logit contributions
 rewarded+12.713
Reward+12.445
doc+12.253
llan+11.881
 recru+11.875
Neg logit contributions
gencies-12.470
rang-11.995
ía-11.500
oche-11.140
acter-10.982
Token weather
Token ID6193
Feature activation-3.73e+00

Pos logit contributions
 rewarded+11.190
Reward+10.838
 trooper+10.801
 recru+10.689
 prisoner+10.623
Neg logit contributions
gencies-12.490
rang-11.962
ía-11.266
oche-11.199
mun-11.112
Token,
Token ID11
Feature activation-1.78e+01

Pos logit contributions
Reward+2.863
 rewarded+2.785
doc+2.762
ellen+2.684
llan+2.637
Neg logit contributions
gencies-2.776
rang-2.685
ía-2.588
oche-2.573
acter-2.483
Token and
Token ID290
Feature activation-1.61e+01

Pos logit contributions
 rewarded+11.721
Reward+11.045
doc+10.971
 prisoner+10.970
 trooper+10.797
Neg logit contributions
gencies-12.148
rang-11.804
ía-11.126
oche-10.984
acter-10.962
Token ever
Token ID1683
Feature activation-6.54e+00

Pos logit contributions
 rewarded+10.888
Reward+10.271
doc+9.860
 prisoner+9.804
 recru+9.771
Neg logit contributions
gencies-11.799
rang-11.324
ía-10.483
oche-10.472
acter-10.394
Token external
Token ID7097
Feature activation-5.00e+00

Pos logit contributions
gencies+3.019
rang+2.939
ía+2.874
oche+2.822
acter+2.724
Neg logit contributions
Reward-2.791
doc-2.641
 rewarded-2.609
ellen-2.605
llan-2.588
Token process
Token ID1429
Feature activation-1.96e+00

Pos logit contributions
Reward+3.404
 rewarded+3.308
doc+3.246
 recru+3.172
ellen+3.127
Neg logit contributions
gencies-4.234
rang-4.007
ía-3.893
oche-3.863
mun-3.732
Tokener
Token ID263
Feature activation+1.41e+00

Pos logit contributions
Reward+1.530
 rewarded+1.478
doc+1.477
ellen+1.443
llan+1.403
Neg logit contributions
gencies-1.469
rang-1.396
ía-1.349
oche-1.330
mun-1.277
Token that
Token ID326
Feature activation+9.01e-01

Pos logit contributions
gencies+1.008
rang+0.977
ía+0.945
oche+0.925
mun+0.884
Neg logit contributions
Reward-1.148
doc-1.097
 rewarded-1.091
ellen-1.077
llan-1.050
Token will
Token ID481
Feature activation-4.88e-01

Pos logit contributions
gencies+0.735
rang+0.711
ía+0.689
oche+0.679
mun+0.654
Neg logit contributions
Reward-0.648
doc-0.615
 rewarded-0.612
ellen-0.604
llan-0.586
Token convert
Token ID10385
Feature activation-2.06e+01

Pos logit contributions
Reward+0.346
 rewarded+0.331
doc+0.331
ellen+0.323
llan+0.315
Neg logit contributions
gencies-0.402
rang-0.390
ía-0.377
oche-0.371
mun-0.357
Token the
Token ID262
Feature activation-1.42e+01

Pos logit contributions
 rewarded+11.793
doc+11.188
Reward+10.690
 prisoner+10.668
 trooper+10.324
Neg logit contributions
gencies-15.651
rang-14.401
iferation-13.929
acement-13.899
ía-13.820
Token brain
Token ID3632
Feature activation-1.24e+01

Pos logit contributions
 rewarded+9.019
 prisoner+8.347
doc+8.330
Reward+8.298
 recru+8.169
Neg logit contributions
gencies-11.298
rang-10.672
oche-10.579
ía-10.150
ukemia-10.128
Token signals
Token ID10425
Feature activation-2.49e+00

Pos logit contributions
 rewarded+8.519
doc+8.494
Reward+8.440
ellen+8.070
 prisoner+7.921
Neg logit contributions
gencies-9.202
rang-8.501
oche-8.117
ía-7.950
ateurs-7.841
Token into
Token ID656
Feature activation-1.91e+01

Pos logit contributions
Reward+1.872
 rewarded+1.831
doc+1.810
ellen+1.765
llan+1.729
Neg logit contributions
gencies-1.942
rang-1.855
ía-1.789
oche-1.768
acter-1.704
Token synthes
Token ID24983
Feature activation-3.20e+00

Pos logit contributions
 rewarded+11.793
doc+11.194
Reward+10.796
 prisoner+10.595
 trooper+10.579
Neg logit contributions
gencies-14.375
rang-13.371
ía-12.708
oche-12.667
iferation-12.394
Token hard
Token ID1327
Feature activation-1.29e+01

Pos logit contributions
Reward+7.921
 rewarded+7.863
doc+7.842
llan+7.388
ellen+7.362
Neg logit contributions
gencies-9.025
rang-8.588
ía-8.529
acter-8.215
acement-8.159
Token to
Token ID284
Feature activation-1.03e+01

Pos logit contributions
doc+9.181
Reward+9.034
 rewarded+8.875
ellen+8.657
llan+8.653
Neg logit contributions
gencies-9.368
rang-8.643
ía-8.364
oche-8.199
ateurs-8.099
Token get
Token ID651
Feature activation-9.13e+00

Pos logit contributions
Reward+7.748
doc+7.571
 rewarded+7.482
llan+7.211
 prisoner+7.208
Neg logit contributions
rang-7.253
gencies-7.181
ía-6.658
oche-6.614
acement-6.577
Token to
Token ID284
Feature activation-5.18e+00

Pos logit contributions
Reward+6.524
doc+6.403
 rewarded+6.367
llan+6.243
ellen+6.047
Neg logit contributions
gencies-6.952
rang-6.671
oche-6.451
ía-6.340
acter-6.233
Token except
Token ID2845
Feature activation-6.05e+00

Pos logit contributions
Reward+3.977
 rewarded+3.824
doc+3.817
llan+3.689
ellen+3.671
Neg logit contributions
gencies-3.891
rang-3.741
ía-3.637
oche-3.575
acter-3.475
Token by
Token ID416
Feature activation-2.43e+01

Pos logit contributions
Reward+4.399
 rewarded+4.334
doc+4.290
llan+4.153
ellen+4.123
Neg logit contributions
gencies-4.701
rang-4.499
ía-4.350
oche-4.274
acter-4.138
Token water
Token ID1660
Feature activation-2.42e+00

Pos logit contributions
 rewarded+13.162
 hiking+12.400
Reward+12.255
 prisoner+12.152
 recru+12.122
Neg logit contributions
gencies-17.174
rang-16.582
acter-16.263
oche-16.224
ukemia-16.218
Token,
Token ID11
Feature activation-7.13e+00

Pos logit contributions
Reward+1.865
 rewarded+1.791
doc+1.787
ellen+1.742
llan+1.720
Neg logit contributions
gencies-1.839
rang-1.774
ía-1.706
oche-1.690
acter-1.634
Token the
Token ID262
Feature activation-8.37e+00

Pos logit contributions
Reward+5.013
 rewarded+4.928
doc+4.908
llan+4.725
ellen+4.668
Neg logit contributions
gencies-5.587
rang-5.358
ía-5.275
oche-5.126
acter-5.013
Token Sund
Token ID3309
Feature activation+2.14e+00

Pos logit contributions
Reward+5.112
 rewarded+4.993
doc+4.934
llan+4.733
ellen+4.723
Neg logit contributions
gencies-7.214
rang-6.948
ía-6.834
oche-6.703
acter-6.544
Tokenarb
Token ID38039
Feature activation-2.41e+00

Pos logit contributions
gencies+1.467
rang+1.426
ía+1.365
oche+1.339
mun+1.281
Neg logit contributions
Reward-1.825
 rewarded-1.756
doc-1.739
ellen-1.700
 recru-1.682
Token aka
Token ID22430
Feature activation-7.97e+00

Pos logit contributions
 rewarded+5.656
Reward+5.645
doc+5.609
ellen+5.384
llan+5.284
Neg logit contributions
gencies-5.777
rang-5.470
ía-5.313
oche-5.191
acement-5.078
Token snaps
Token ID23429
Feature activation-1.34e+01

Pos logit contributions
Reward+5.859
 rewarded+5.780
doc+5.760
ellen+5.502
 recru+5.417
Neg logit contributions
gencies-6.047
rang-5.714
ía-5.497
oche-5.390
acter-5.277
Token).
Token ID737
Feature activation-7.48e+00

Pos logit contributions
doc+10.591
 rewarded+10.118
Reward+10.097
ellen+10.075
llan+9.832
Neg logit contributions
gencies-8.698
rang-7.836
ía-7.585
 Inspection-7.264
acter-7.246
Token These
Token ID2312
Feature activation-2.19e+01

Pos logit contributions
Reward+5.291
doc+5.282
 rewarded+5.060
ellen+4.932
llan+4.826
Neg logit contributions
gencies-6.013
rang-5.661
ía-5.479
oche-5.447
acter-5.320
Token two
Token ID734
Feature activation-1.90e+01

Pos logit contributions
 rewarded+11.784
doc+10.973
 recru+10.682
 backed+10.453
 endorsed+10.424
Neg logit contributions
gencies-16.294
ía-15.641
oche-15.264
rang-15.191
acter-14.816
Token new
Token ID649
Feature activation-1.88e+01

Pos logit contributions
 rewarded+11.088
doc+10.548
Reward+10.076
 recru+10.013
 backed+9.928
Neg logit contributions
gencies-14.494
rang-13.549
ía-13.501
oche-13.138
acter-12.972
Token methods
Token ID5050
Feature activation-9.98e+00

Pos logit contributions
 rewarded+10.917
doc+10.827
Reward+10.289
 recru+10.117
 prisoner+10.060
Neg logit contributions
gencies-14.047
rang-13.169
ía-13.098
oche-13.001
acter-12.729
Token of
Token ID286
Feature activation-1.14e+01

Pos logit contributions
 rewarded+6.441
doc+6.192
Reward+6.150
ellen+5.995
llan+5.953
Neg logit contributions
gencies-8.288
rang-7.739
ía-7.639
oche-7.534
ukemia-7.320
Token packaging
Token ID16846
Feature activation-1.29e+01

Pos logit contributions
 rewarded+7.720
Reward+7.408
doc+7.378
 recru+7.097
 prisoner+6.984
Neg logit contributions
gencies-8.806
rang-8.587
ía-8.222
oche-8.134
ukemia-7.988
Token software
Token ID3788
Feature activation-1.25e+00

Pos logit contributions
 rewarded+8.477
doc+8.120
Reward+7.961
 recru+7.633
 prisoner+7.526
Neg logit contributions
gencies-10.194
rang-9.543
oche-9.445
ía-9.398
iferation-9.130
Token have
Token ID423
Feature activation-6.70e+00

Pos logit contributions
Reward+0.892
 rewarded+0.860
doc+0.858
ellen+0.834
llan+0.814
Neg logit contributions
gencies-1.018
rang-0.979
ía-0.952
oche-0.938
acter-0.902