TOP ACTIVATIONS
MAX = 16.891

Figure 2 above ) that need to be finely configured and
. 1 . Sa 'd dismissed the pleas of
the vision of the founder remains in full view over one
poultry meat in chlorine . Less than a year later ,
notes , correctly , that Sa 'd judgment acc ords with
thinks that there is something worth saving in all of these
Harry Potter , need for a boyfriend
is however a matter that needs to be handled by parents
. The petition says that IE 6 has security flaws and
Something needs to happen to these people
last year . Gil bert is known as a
to phase out support for IE 6 . The firm said
's some common questions / mis con ceptions I 've been
Microsoft patched the hole in IE 6 in January , three
kiss , shockingly bad ; Marg ot had trouble believing that
petty - bour ge ois intellig ents ia forged in haste
/ AP ) Gil bert and the ACLU have
black lists . EFF director for international freedom of
[ edit ] Or cs sent by Sar uman
the drop of water ) remain the same throughout a block

MIN ACTIVATIONS
MIN = -19.060

got up to leave and tri pped , falling backwards on
brunch , when Tor i tri pped and fell onto a
Clinton also tried to tri ang ulate on the Planned
gal blo oms , the rapid increase of algae within a
a full Shim ano 600 tri - color group , including
t pay attention to the tri pling of premiums , doubling
is one element of a tri pty ch called Echo that
Type 1 probe test ( tri ple diss ociation | ,
smooth the image using a 3 x 3 Ga ussian ,
F OV even double or 4 x 1080 P resolution you
focusing problem . Tri angular segregation Act
but had not done so widely . O uts
coach , A du says . A du 's
a concern though since most of the time I don 't
. Despite her conc ili atory tone , Merkel
com / j Ax 6 c U xi UG tele
ized nipples to match the tri - color group and vintage
. Jim Bern au , CEO of
t . co / ni WB qv fi W u
Image credit : Cop Richard , royalty free <|endoftext|>

INTERVAL 7.903 - 16.891
CONTAINS 1.032%

Court System . The accusation claims that named people couldn
Church . I feel like I m
from his new brother - in - law , E ad
They don t need facts to unleash their volcanic
2 loss to the Philadelphia Flyers at United Center on Wednesday

INTERVAL -1.085 - 7.903
CONTAINS 62.978%

state . BE AG had knocked at the high court
not apologized to Crow der . The Celtics '
Rus hers So there s a group
many years . Parents who complain they were unaware that their
On 29 April , G raz iani surrendered and was present

INTERVAL -10.073 - -1.085
CONTAINS 35.846%

, the No . 41 pick in the 2015 draft (
seen pictures from them and obviously find them disturbing ,
cl ojure | 1 . 5 . 1 | 4 251
press are entirely absent . Human rights don 't apply .
recently said a fair price would be nearer $ 40 a

INTERVAL -19.060 - -10.073
CONTAINS 0.145%

see also Trump hints he taped Comey conversations in veiled threat
J . ( 2010 ). Cl ueless or powerful ? Ident
is an interview with one of the largest anarchist groups in
. The Dram atically Different Attention Economy Of The
almond flour due to its higher protein content . Also has
Token Figure
Token ID11291
Feature activation-1.60e+00

Pos logit contributions
 Crossing+1.311
zees+1.271
 Lama+1.179
eding+1.139
 Ski+1.115
Neg logit contributions
Dialogue-1.766
ARI-1.757
FTWARE-1.756
mented-1.737
 secretaries-1.705
Token 2
Token ID362
Feature activation-9.87e-01

Pos logit contributions
 Crossing+1.331
zees+1.306
 Lama+1.232
eding+1.202
 Ski+1.180
Neg logit contributions
Dialogue-1.030
ARI-1.023
FTWARE-1.019
mented-1.011
 secretaries-0.983
Token above
Token ID2029
Feature activation-1.15e+00

Pos logit contributions
 Crossing+0.780
zees+0.766
 Lama+0.720
eding+0.701
 Ski+0.687
Neg logit contributions
Dialogue-0.679
ARI-0.678
FTWARE-0.669
mented-0.669
 secretaries-0.646
Token)
Token ID8
Feature activation+1.66e+00

Pos logit contributions
 Crossing+0.933
zees+0.915
 Lama+0.863
eding+0.839
 Ski+0.825
Neg logit contributions
ARI-0.761
Dialogue-0.761
FTWARE-0.758
mented-0.750
 secretaries-0.728
Token that
Token ID326
Feature activation+6.01e+00

Pos logit contributions
ARI+1.099
mented+1.088
Dialogue+1.086
FTWARE+1.084
 secretaries+1.057
Neg logit contributions
 Crossing-1.342
zees-1.335
 Lama-1.245
eding-1.202
 Ski-1.178
Token need
Token ID761
Feature activation+1.69e+01

Pos logit contributions
ARI+3.830
 secretaries+3.762
mented+3.758
 elements+3.753
Dialogue+3.752
Neg logit contributions
zees-4.837
 Crossing-4.719
 Lama-4.372
eding-4.234
digy-4.232
Token to
Token ID284
Feature activation+9.41e+00

Pos logit contributions
mented+8.544
ARI+8.267
 elements+8.226
FTWARE+8.019
Dialogue+7.716
Neg logit contributions
zees-14.179
 Crossing-13.410
 Lama-12.972
owler-12.716
digy-12.701
Token be
Token ID307
Feature activation+1.00e+01

Pos logit contributions
mented+5.301
ARI+5.252
 elements+5.243
FTWARE+5.151
Dialogue+5.131
Neg logit contributions
zees-7.865
 Crossing-7.850
 Lama-7.433
owler-7.103
digy-7.047
Token finely
Token ID32566
Feature activation+1.81e-01

Pos logit contributions
mented+6.165
ARI+5.956
 elements+5.904
FTWARE+5.691
 secretaries+5.683
Neg logit contributions
zees-8.053
 Crossing-7.825
 Lama-7.380
owler-7.135
digy-6.969
Token configured
Token ID17839
Feature activation+3.75e+00

Pos logit contributions
ARI+0.118
FTWARE+0.118
Dialogue+0.118
mented+0.118
 secretaries+0.113
Neg logit contributions
 Crossing-0.150
zees-0.147
 Lama-0.138
eding-0.134
 Ski-0.131
Token and
Token ID290
Feature activation+1.91e-01

Pos logit contributions
ARI+2.404
mented+2.387
Dialogue+2.375
FTWARE+2.371
 elements+2.275
Neg logit contributions
zees-3.072
 Crossing-3.047
 Lama-2.835
eding-2.730
owler-2.702
Token.
Token ID13
Feature activation+2.37e+00

Pos logit contributions
 Crossing+2.183
zees+2.132
 Lama+2.002
eding+1.942
 Ski+1.913
Neg logit contributions
Dialogue-2.092
ARI-2.077
FTWARE-2.074
mented-2.054
 secretaries-2.002
Token1
Token ID16
Feature activation-1.67e+00

Pos logit contributions
Dialogue+1.482
ARI+1.482
mented+1.451
FTWARE+1.443
 secretaries+1.380
Neg logit contributions
zees-2.001
 Crossing-1.950
 Lama-1.838
eding-1.815
 Ski-1.753
Token.
Token ID13
Feature activation-6.53e-01

Pos logit contributions
 Crossing+1.310
zees+1.288
 Lama+1.208
eding+1.176
 Ski+1.156
Neg logit contributions
Dialogue-1.164
ARI-1.155
FTWARE-1.151
mented-1.138
 secretaries-1.108
Token
Token ID198
Feature activation-5.96e-01

Pos logit contributions
zees+0.525
 Crossing+0.525
 Lama+0.484
eding+0.481
 Ski+0.468
Neg logit contributions
Dialogue-0.436
ARI-0.434
FTWARE-0.426
mented-0.425
 secretaries-0.411
Token
Token ID198
Feature activation-9.40e-01

Pos logit contributions
 Crossing+0.497
zees+0.488
 Lama+0.460
eding+0.448
 Ski+0.441
Neg logit contributions
Dialogue-0.385
ARI-0.382
FTWARE-0.380
mented-0.376
 secretaries-0.362
TokenSa
Token ID33890
Feature activation+1.63e+01

Pos logit contributions
 Crossing+0.776
zees+0.760
 Lama+0.714
eding+0.698
 Ski+0.689
Neg logit contributions
Dialogue-0.623
ARI-0.618
FTWARE-0.608
mented-0.604
 secretaries-0.584
Token'd
Token ID1549
Feature activation-1.46e+00

Pos logit contributions
mented+9.610
ARI+9.386
Dialogue+9.167
 elements+8.890
 secretaries+8.756
Neg logit contributions
 Crossing-11.718
zees-11.574
digy-10.873
 Lama-10.762
 Trace-10.438
Token dismissed
Token ID11126
Feature activation+1.06e+00

Pos logit contributions
 Crossing+1.206
zees+1.187
 Lama+1.116
eding+1.087
 Ski+1.071
Neg logit contributions
Dialogue-0.957
ARI-0.951
FTWARE-0.943
mented-0.936
 secretaries-0.913
Token the
Token ID262
Feature activation-6.56e-01

Pos logit contributions
Dialogue+0.752
ARI+0.745
mented+0.731
FTWARE+0.726
 secretaries+0.724
Neg logit contributions
 Crossing-0.803
zees-0.802
 Lama-0.733
eding-0.725
 Ski-0.708
Token pleas
Token ID21289
Feature activation-4.74e+00

Pos logit contributions
 Crossing+0.518
zees+0.515
 Lama+0.476
eding+0.467
 Ski+0.458
Neg logit contributions
Dialogue-0.450
ARI-0.445
FTWARE-0.443
mented-0.438
 secretaries-0.436
Token of
Token ID286
Feature activation-2.64e+00

Pos logit contributions
 Crossing+3.448
zees+3.350
 Lama+3.161
eding+3.077
 Ski+3.003
Neg logit contributions
Dialogue-3.542
FTWARE-3.531
ARI-3.501
mented-3.447
 secretaries-3.390
Tokenthe
Token ID1169
Feature activation-3.07e-02

Pos logit contributions
 Crossing+1.697
zees+1.662
 Lama+1.578
eding+1.541
 Ski+1.520
Neg logit contributions
Dialogue-1.093
ARI-1.084
FTWARE-1.082
mented-1.066
 secretaries-1.036
Token vision
Token ID5761
Feature activation+7.34e-01

Pos logit contributions
 Crossing+0.024
zees+0.024
 Lama+0.022
eding+0.022
 Ski+0.021
Neg logit contributions
Dialogue-0.021
ARI-0.021
FTWARE-0.021
mented-0.021
 secretaries-0.020
Token of
Token ID286
Feature activation+6.28e-01

Pos logit contributions
ARI+0.517
Dialogue+0.516
FTWARE+0.512
mented+0.511
 secretaries+0.498
Neg logit contributions
 Crossing-0.566
zees-0.556
 Lama-0.520
eding-0.504
 Ski-0.498
Token the
Token ID262
Feature activation+2.08e+00

Pos logit contributions
Dialogue+0.430
ARI+0.422
FTWARE+0.422
mented+0.420
 secretaries+0.413
Neg logit contributions
 Crossing-0.496
zees-0.493
 Lama-0.459
eding-0.447
 Ski-0.440
Token founder
Token ID9119
Feature activation+4.95e-01

Pos logit contributions
Dialogue+1.418
ARI+1.379
mented+1.378
FTWARE+1.375
 secretaries+1.364
Neg logit contributions
zees-1.644
 Crossing-1.625
 Lama-1.524
eding-1.478
 Ski-1.442
Token remains
Token ID3793
Feature activation+1.62e+01

Pos logit contributions
Dialogue+0.340
ARI+0.339
FTWARE+0.337
mented+0.336
 secretaries+0.329
Neg logit contributions
 Crossing-0.391
zees-0.384
 Lama-0.359
eding-0.348
 Ski-0.345
Token in
Token ID287
Feature activation+8.35e+00

Pos logit contributions
mented+9.189
 elements+9.071
Dialogue+8.951
 colleg+8.758
 secretaries+8.608
Neg logit contributions
zees-12.620
 Crossing-11.959
MQ-11.195
owler-11.078
 Lama-11.031
Token full
Token ID1336
Feature activation+4.15e+00

Pos logit contributions
mented+5.385
Dialogue+5.275
 elements+5.255
ARI+5.218
 secretaries+5.201
Neg logit contributions
zees-6.473
 Crossing-6.435
 Lama-5.875
owler-5.716
eding-5.682
Token view
Token ID1570
Feature activation-1.90e-01

Pos logit contributions
mented+1.878
Dialogue+1.869
FTWARE+1.861
 secretaries+1.837
ARI+1.825
Neg logit contributions
zees-4.144
 Crossing-4.130
 Lama-3.864
owler-3.762
MQ-3.749
Token over
Token ID625
Feature activation+5.40e+00

Pos logit contributions
 Crossing+0.146
zees+0.143
 Lama+0.134
eding+0.130
 Ski+0.128
Neg logit contributions
Dialogue-0.135
ARI-0.134
FTWARE-0.134
mented-0.133
 secretaries-0.130
Token one
Token ID530
Feature activation-1.73e+00

Pos logit contributions
Dialogue+3.615
mented+3.557
FTWARE+3.509
ARI+3.488
 secretaries+3.456
Neg logit contributions
 Crossing-4.219
zees-4.215
 Lama-3.944
eding-3.797
MQ-3.720
Token poultry
Token ID34101
Feature activation-1.46e+00

Pos logit contributions
 Crossing+0.002
zees+0.002
 Lama+0.002
eding+0.002
 Ski+0.002
Neg logit contributions
mented-0.002
Dialogue-0.002
ARI-0.002
FTWARE-0.002
 secretaries-0.002
Token meat
Token ID6174
Feature activation-1.33e+00

Pos logit contributions
 Crossing+1.090
zees+1.062
 Lama+1.000
eding+0.968
 Ski+0.955
Neg logit contributions
Dialogue-1.062
ARI-1.056
FTWARE-1.054
mented-1.052
 secretaries-1.023
Token in
Token ID287
Feature activation-1.55e+00

Pos logit contributions
 Crossing+0.993
zees+0.965
 Lama+0.909
eding+0.879
 Ski+0.869
Neg logit contributions
Dialogue-0.969
ARI-0.964
FTWARE-0.961
mented-0.958
 secretaries-0.932
Token chlorine
Token ID43107
Feature activation+8.96e-01

Pos logit contributions
 Crossing+1.180
zees+1.153
 Lama+1.081
eding+1.047
 Ski+1.032
Neg logit contributions
Dialogue-1.106
ARI-1.097
FTWARE-1.095
mented-1.092
 secretaries-1.065
Token.
Token ID13
Feature activation+4.23e+00

Pos logit contributions
Dialogue+0.620
mented+0.615
FTWARE+0.614
ARI+0.614
 secretaries+0.593
Neg logit contributions
 Crossing-0.699
zees-0.684
 Lama-0.645
eding-0.624
 Ski-0.616
Token Less
Token ID12892
Feature activation+1.57e+01

Pos logit contributions
Dialogue+2.950
ARI+2.921
FTWARE+2.840
mented+2.839
 elements+2.682
Neg logit contributions
zees-3.238
 Crossing-3.112
 Lama-2.916
eding-2.892
digy-2.865
Token than
Token ID621
Feature activation+7.68e+00

Pos logit contributions
FTWARE+7.006
mented+6.904
Dialogue+6.892
 secretaries+6.852
ARI+6.850
Neg logit contributions
 Crossing-13.552
zees-13.390
 Lama-12.449
WAYS-12.250
 Ski-12.191
Token a
Token ID257
Feature activation+4.15e+00

Pos logit contributions
Dialogue+3.869
 secretaries+3.850
FTWARE+3.773
ARI+3.773
 elements+3.752
Neg logit contributions
zees-7.064
 Crossing-6.935
 Lama-6.405
eding-6.338
MQ-6.219
Token year
Token ID614
Feature activation+6.66e+00

Pos logit contributions
Dialogue+2.139
ARI+2.105
 secretaries+2.073
FTWARE+2.061
mented+2.035
Neg logit contributions
zees-3.897
 Crossing-3.843
 Lama-3.589
eding-3.544
 Ski-3.463
Token later
Token ID1568
Feature activation+2.32e+00

Pos logit contributions
mented+3.837
ARI+3.806
 secretaries+3.767
Dialogue+3.758
FTWARE+3.726
Neg logit contributions
 Crossing-5.607
zees-5.575
 Lama-5.137
 Ski-4.991
MQ-4.965
Token,
Token ID11
Feature activation+3.02e+00

Pos logit contributions
Dialogue+1.472
FTWARE+1.460
ARI+1.454
mented+1.438
 secretaries+1.412
Neg logit contributions
 Crossing-1.925
zees-1.909
 Lama-1.779
eding-1.738
 Ski-1.706
Token notes
Token ID4710
Feature activation+4.48e+00

Pos logit contributions
mented+1.212
ARI+1.205
Dialogue+1.197
FTWARE+1.164
 secretaries+1.160
Neg logit contributions
 Crossing-1.372
zees-1.357
 Lama-1.247
 Ski-1.215
eding-1.209
Token,
Token ID11
Feature activation+2.98e+00

Pos logit contributions
Dialogue+2.880
ARI+2.840
mented+2.755
 elements+2.738
FTWARE+2.737
Neg logit contributions
zees-3.666
 Crossing-3.614
eding-3.327
 Lama-3.292
digy-3.221
Token correctly
Token ID9380
Feature activation-1.63e+00

Pos logit contributions
Dialogue+1.982
ARI+1.974
mented+1.926
FTWARE+1.897
 secretaries+1.887
Neg logit contributions
zees-2.377
 Crossing-2.361
eding-2.163
 Lama-2.161
digy-2.101
Token,
Token ID11
Feature activation+1.88e+00

Pos logit contributions
 Crossing+1.316
zees+1.291
 Lama+1.212
eding+1.182
 Ski+1.164
Neg logit contributions
Dialogue-1.097
ARI-1.091
FTWARE-1.087
mented-1.077
 secretaries-1.050
Token that
Token ID326
Feature activation+1.85e+00

Pos logit contributions
Dialogue+1.254
ARI+1.238
mented+1.218
FTWARE+1.216
 secretaries+1.188
Neg logit contributions
 Crossing-1.505
zees-1.505
 Lama-1.381
eding-1.372
 Ski-1.333
Token Sa
Token ID10318
Feature activation+1.56e+01

Pos logit contributions
Dialogue+1.337
ARI+1.318
FTWARE+1.301
mented+1.295
 secretaries+1.289
Neg logit contributions
zees-1.380
 Crossing-1.376
eding-1.249
 Lama-1.247
 Ski-1.209
Token'd
Token ID1549
Feature activation-3.05e+00

Pos logit contributions
mented+6.635
ARI+6.578
Dialogue+6.431
 elements+5.937
 secretaries+5.909
Neg logit contributions
 Crossing-13.974
zees-13.867
 Lama-13.214
digy-13.079
 Ski-12.692
Token judgment
Token ID8492
Feature activation-3.11e+00

Pos logit contributions
 Crossing+2.357
zees+2.299
 Lama+2.165
eding+2.105
 Ski+2.071
Neg logit contributions
Dialogue-2.147
ARI-2.133
FTWARE-2.132
mented-2.104
 secretaries-2.057
Token acc
Token ID697
Feature activation-4.85e+00

Pos logit contributions
 Crossing+2.448
zees+2.388
 Lama+2.252
eding+2.191
 Ski+2.155
Neg logit contributions
Dialogue-2.150
ARI-2.135
FTWARE-2.135
mented-2.105
 secretaries-2.057
Tokenords
Token ID3669
Feature activation-6.50e+00

Pos logit contributions
 Crossing+2.373
zees+2.290
 Lama+2.082
eding+2.026
 Ski+1.926
Neg logit contributions
Dialogue-4.772
FTWARE-4.761
ARI-4.747
mented-4.684
 secretaries-4.631
Token with
Token ID351
Feature activation-5.80e+00

Pos logit contributions
 Crossing+4.397
zees+4.202
 Lama+4.039
eding+3.864
 Ski+3.765
Neg logit contributions
FTWARE-5.148
Dialogue-5.072
ARI-5.055
mented-4.973
 secretaries-4.860
Token thinks
Token ID6834
Feature activation+1.80e+00

Pos logit contributions
 Crossing+2.602
zees+2.528
 Lama+2.366
eding+2.299
 Ski+2.247
Neg logit contributions
Dialogue-2.986
FTWARE-2.973
ARI-2.968
mented-2.929
 secretaries-2.865
Token that
Token ID326
Feature activation+2.79e+00

Pos logit contributions
Dialogue+1.213
ARI+1.196
 secretaries+1.171
mented+1.166
FTWARE+1.165
Neg logit contributions
 Crossing-1.460
zees-1.422
 Lama-1.325
eding-1.313
 Ski-1.286
Token there
Token ID612
Feature activation+1.29e+00

Pos logit contributions
Dialogue+1.989
ARI+1.957
 secretaries+1.942
 elements+1.916
mented+1.912
Neg logit contributions
 Crossing-2.085
zees-2.062
 Lama-1.885
eding-1.868
 Ski-1.844
Token is
Token ID318
Feature activation-4.06e-01

Pos logit contributions
ARI+0.925
Dialogue+0.925
mented+0.907
FTWARE+0.906
 secretaries+0.891
Neg logit contributions
 Crossing-0.979
zees-0.954
 Lama-0.898
eding-0.865
 Ski-0.859
Token something
Token ID1223
Feature activation+8.05e+00

Pos logit contributions
 Crossing+0.337
zees+0.331
 Lama+0.310
eding+0.302
 Ski+0.300
Neg logit contributions
Dialogue-0.264
ARI-0.261
FTWARE-0.259
mented-0.256
 secretaries-0.252
Token worth
Token ID2861
Feature activation+1.53e+01

Pos logit contributions
ARI+4.685
 elements+4.609
Dialogue+4.552
 secretaries+4.497
mented+4.478
Neg logit contributions
 Crossing-6.724
zees-6.690
 Lama-6.102
 Ski-6.078
eding-6.042
Token saving
Token ID8914
Feature activation+6.08e+00

Pos logit contributions
ARI+9.214
mented+9.213
 elements+8.883
Dialogue+8.775
 secretaries+8.690
Neg logit contributions
zees-11.153
 Crossing-10.698
 Ski-10.008
 Lama-9.855
 Distribution-9.599
Token in
Token ID287
Feature activation+2.75e+00

Pos logit contributions
Dialogue+3.862
ARI+3.823
FTWARE+3.696
mented+3.696
 elements+3.631
Neg logit contributions
zees-4.922
 Crossing-4.915
 Lama-4.542
eding-4.445
 Ski-4.422
Token all
Token ID477
Feature activation-2.43e+00

Pos logit contributions
Dialogue+1.978
ARI+1.908
 secretaries+1.892
mented+1.889
FTWARE+1.878
Neg logit contributions
 Crossing-2.058
zees-2.038
 Lama-1.885
eding-1.846
 Ski-1.837
Token of
Token ID286
Feature activation-6.11e+00

Pos logit contributions
 Crossing+1.898
zees+1.853
 Lama+1.745
eding+1.696
 Ski+1.673
Neg logit contributions
Dialogue-1.695
ARI-1.683
FTWARE-1.678
mented-1.661
 secretaries-1.626
Token these
Token ID777
Feature activation-3.14e+00

Pos logit contributions
 Crossing+4.017
zees+3.847
 Lama+3.635
eding+3.474
 Ski+3.425
Neg logit contributions
FTWARE-5.000
Dialogue-4.940
ARI-4.921
mented-4.882
 secretaries-4.757
Token Harry
Token ID5850
Feature activation-5.06e-01

Pos logit contributions
Dialogue+2.583
mented+2.538
ARI+2.519
FTWARE+2.516
 secretaries+2.447
Neg logit contributions
 Crossing-3.102
zees-3.090
 Lama-2.845
eding-2.842
digy-2.719
Token Potter
Token ID14179
Feature activation+6.03e+00

Pos logit contributions
 Crossing+0.351
zees+0.348
 Lama+0.320
eding+0.316
 Ski+0.305
Neg logit contributions
Dialogue-0.389
ARI-0.388
FTWARE-0.385
mented-0.382
 secretaries-0.376
Token,
Token ID11
Feature activation+2.45e+00

Pos logit contributions
Dialogue+3.765
FTWARE+3.763
ARI+3.709
mented+3.708
 secretaries+3.596
Neg logit contributions
 Crossing-4.893
zees-4.861
 Lama-4.534
eding-4.530
digy-4.340
Token
Token ID564
Feature activation-2.81e-01

Pos logit contributions
Dialogue+1.696
mented+1.673
ARI+1.666
FTWARE+1.660
 secretaries+1.622
Neg logit contributions
 Crossing-1.877
zees-1.873
 Lama-1.720
eding-1.711
digy-1.638
Token
Token ID250
Feature activation+9.75e-01

Pos logit contributions
 Crossing+0.176
zees+0.169
 Lama+0.156
eding+0.151
 Ski+0.149
Neg logit contributions
Dialogue-0.242
ARI-0.240
mented-0.238
FTWARE-0.236
 secretaries-0.230
Tokenneed
Token ID31227
Feature activation+1.50e+01

Pos logit contributions
Dialogue+0.657
ARI+0.644
FTWARE+0.634
mented+0.634
 secretaries+0.613
Neg logit contributions
 Crossing-0.795
zees-0.777
 Lama-0.733
eding-0.713
 Ski-0.698
Token for
Token ID329
Feature activation+6.73e+00

Pos logit contributions
Dialogue+8.774
FTWARE+8.673
 secretaries+8.659
ARI+8.517
mented+8.502
Neg logit contributions
 Crossing-11.258
zees-11.156
 Lama-10.327
digy-10.267
eding-10.006
Token a
Token ID257
Feature activation+3.67e+00

Pos logit contributions
Dialogue+5.156
mented+5.049
ARI+5.020
 secretaries+4.976
FTWARE+4.929
Neg logit contributions
zees-4.638
 Crossing-4.484
 Lama-4.026
eding-4.007
MQ-3.968
Token boyfriend
Token ID13850
Feature activation-1.60e+00

Pos logit contributions
Dialogue+2.692
ARI+2.671
mented+2.656
FTWARE+2.646
 secretaries+2.630
Neg logit contributions
 Crossing-2.632
zees-2.619
 Lama-2.367
eding-2.359
digy-2.298
Token
Token ID447
Feature activation+6.89e+00

Pos logit contributions
 Crossing+1.276
zees+1.245
 Lama+1.175
eding+1.144
 Ski+1.126
Neg logit contributions
Dialogue-1.083
ARI-1.076
FTWARE-1.075
mented-1.061
 secretaries-1.036
Token
Token ID251
Feature activation+1.11e+00

Pos logit contributions
mented+5.625
Dialogue+5.528
ARI+5.415
FTWARE+5.335
 secretaries+5.284
Neg logit contributions
 Crossing-4.276
zees-4.117
 Ski-3.743
eding-3.672
 Lama-3.665
Token is
Token ID318
Feature activation-2.32e+00

Pos logit contributions
 Crossing+4.184
zees+4.025
 Lama+3.817
eding+3.694
 Ski+3.627
Neg logit contributions
FTWARE-4.290
Dialogue-4.274
ARI-4.250
mented-4.200
 secretaries-4.085
Token however
Token ID2158
Feature activation-4.45e+00

Pos logit contributions
 Crossing+1.792
zees+1.751
 Lama+1.645
eding+1.600
 Ski+1.579
Neg logit contributions
Dialogue-1.641
ARI-1.634
FTWARE-1.617
mented-1.611
 secretaries-1.575
Token a
Token ID257
Feature activation-4.54e+00

Pos logit contributions
 Crossing+3.428
zees+3.338
 Lama+3.151
eding+3.063
 Ski+3.003
Neg logit contributions
Dialogue-3.145
FTWARE-3.131
ARI-3.117
mented-3.080
 secretaries-3.010
Token matter
Token ID2300
Feature activation+3.39e+00

Pos logit contributions
 Crossing+3.420
zees+3.313
 Lama+3.135
eding+3.037
 Ski+2.987
Neg logit contributions
Dialogue-3.299
FTWARE-3.279
ARI-3.267
mented-3.229
 secretaries-3.158
Token that
Token ID326
Feature activation+2.93e+00

Pos logit contributions
ARI+2.207
Dialogue+2.192
mented+2.186
 secretaries+2.126
FTWARE+2.118
Neg logit contributions
 Crossing-2.747
zees-2.690
 Lama-2.492
eding-2.444
 Ski-2.436
Token needs
Token ID2476
Feature activation+1.50e+01

Pos logit contributions
Dialogue+1.974
ARI+1.968
 secretaries+1.932
mented+1.917
 elements+1.872
Neg logit contributions
 Crossing-2.284
zees-2.284
 Lama-2.074
eding-2.070
digy-2.020
Token to
Token ID284
Feature activation+7.64e+00

Pos logit contributions
Dialogue+6.990
ARI+6.981
mented+6.947
 elements+6.865
 secretaries+6.652
Neg logit contributions
zees-13.039
 Crossing-12.607
digy-11.916
owler-11.811
 Ski-11.635
Token be
Token ID307
Feature activation+8.79e+00

Pos logit contributions
mented+4.542
Dialogue+4.537
ARI+4.535
FTWARE+4.362
 elements+4.339
Neg logit contributions
 Crossing-6.439
zees-6.199
 Lama-5.916
owler-5.764
digy-5.699
Token handled
Token ID12118
Feature activation+4.51e+00

Pos logit contributions
mented+5.216
ARI+4.969
Dialogue+4.933
 elements+4.823
 secretaries+4.813
Neg logit contributions
zees-7.306
 Crossing-7.271
 Lama-6.690
owler-6.668
 Ski-6.518
Token by
Token ID416
Feature activation+3.12e+00

Pos logit contributions
Dialogue+2.916
ARI+2.904
mented+2.875
FTWARE+2.775
 secretaries+2.762
Neg logit contributions
 Crossing-3.648
zees-3.625
 Lama-3.306
 Ski-3.231
MQ-3.225
Token parents
Token ID3397
Feature activation+5.46e-01

Pos logit contributions
Dialogue+2.097
ARI+2.084
mented+2.062
 secretaries+2.060
FTWARE+2.029
Neg logit contributions
 Crossing-2.446
zees-2.446
 Lama-2.214
eding-2.192
 Ski-2.169
Token.
Token ID13
Feature activation+2.71e+00

Pos logit contributions
 Crossing+0.268
zees+0.262
 Lama+0.247
eding+0.239
 Ski+0.235
Neg logit contributions
Dialogue-0.243
FTWARE-0.242
ARI-0.241
mented-0.240
 secretaries-0.234
Token The
Token ID383
Feature activation-9.04e-01

Pos logit contributions
ARI+1.723
FTWARE+1.715
Dialogue+1.709
mented+1.668
 secretaries+1.607
Neg logit contributions
zees-2.232
 Crossing-2.217
 Lama-2.089
eding-2.043
 Ski-1.991
Token petition
Token ID8853
Feature activation+7.06e-01

Pos logit contributions
 Crossing+0.669
zees+0.660
 Lama+0.620
eding+0.599
 Ski+0.586
Neg logit contributions
FTWARE-0.658
Dialogue-0.658
ARI-0.655
mented-0.648
 secretaries-0.641
Token says
Token ID1139
Feature activation+1.37e+00

Pos logit contributions
ARI+0.481
Dialogue+0.480
mented+0.476
FTWARE+0.475
 secretaries+0.464
Neg logit contributions
 Crossing-0.560
zees-0.550
 Lama-0.516
eding-0.497
 Ski-0.496
Token that
Token ID326
Feature activation+1.05e+00

Pos logit contributions
FTWARE+0.955
Dialogue+0.950
ARI+0.945
mented+0.933
 secretaries+0.932
Neg logit contributions
 Crossing-1.052
zees-1.046
 Lama-0.996
eding-0.951
 Ski-0.934
Token IE
Token ID28157
Feature activation+1.48e+01

Pos logit contributions
Dialogue+0.738
FTWARE+0.738
ARI+0.733
mented+0.726
 secretaries+0.722
Neg logit contributions
 Crossing-0.794
zees-0.789
 Lama-0.748
eding-0.714
 Ski-0.703
Token6
Token ID21
Feature activation-2.31e+00

Pos logit contributions
 elements+7.755
mented+7.491
FTWARE+7.488
ARI+7.358
Dialogue+7.333
Neg logit contributions
 Crossing-11.603
 Lama-11.579
zees-11.263
eding-10.833
-10.731
Token has
Token ID468
Feature activation-2.81e+00

Pos logit contributions
 Crossing+1.734
zees+1.689
 Lama+1.587
eding+1.543
 Ski+1.517
Neg logit contributions
Dialogue-1.679
ARI-1.669
FTWARE-1.668
mented-1.647
 secretaries-1.610
Token security
Token ID2324
Feature activation-1.39e-01

Pos logit contributions
 Crossing+2.150
zees+2.096
 Lama+1.973
eding+1.919
 Ski+1.887
Neg logit contributions
Dialogue-2.002
FTWARE-1.989
ARI-1.989
mented-1.961
 secretaries-1.916
Token flaws
Token ID17978
Feature activation-2.69e-01

Pos logit contributions
 Crossing+0.125
zees+0.123
 Lama+0.117
eding+0.113
 Ski+0.112
Neg logit contributions
ARI-0.079
Dialogue-0.079
FTWARE-0.079
mented-0.078
 secretaries-0.077
Token and
Token ID290
Feature activation-3.26e+00

Pos logit contributions
 Crossing+0.209
zees+0.206
 Lama+0.193
eding+0.186
 Ski+0.184
Neg logit contributions
ARI-0.186
FTWARE-0.186
Dialogue-0.186
mented-0.184
 secretaries-0.180
Token
Token ID198
Feature activation-4.57e-01

Pos logit contributions
Dialogue+2.463
ARI+2.456
FTWARE+2.381
mented+2.320
 secretaries+2.213
Neg logit contributions
zees-3.212
 Crossing-3.132
eding-2.976
 Lama-2.927
digy-2.858
Token
Token ID198
Feature activation+2.66e+00

Pos logit contributions
 Crossing+0.377
zees+0.369
 Lama+0.349
eding+0.340
 Ski+0.334
Neg logit contributions
Dialogue-0.298
ARI-0.296
FTWARE-0.295
mented-0.291
 secretaries-0.282
Token
Token ID447
Feature activation+2.06e+00

Pos logit contributions
Dialogue+1.658
ARI+1.644
FTWARE+1.584
mented+1.543
ATURE+1.460
Neg logit contributions
 Crossing-2.346
zees-2.276
 Lama-2.179
eding-2.098
 Ski-2.089
Token
Token ID250
Feature activation+7.90e+00

Pos logit contributions
Dialogue+1.468
ARI+1.455
FTWARE+1.436
mented+1.433
 secretaries+1.382
Neg logit contributions
 Crossing-1.575
zees-1.536
 Lama-1.448
 Ski-1.399
eding-1.391
TokenSomething
Token ID22210
Feature activation+5.73e+00

Pos logit contributions
ARI+5.116
Dialogue+5.076
mented+4.942
FTWARE+4.940
 secretaries+4.649
Neg logit contributions
 Crossing-6.213
zees-6.168
 Lama-5.847
eding-5.691
digy-5.462
Token needs
Token ID2476
Feature activation+1.47e+01

Pos logit contributions
Dialogue+3.680
ARI+3.677
 secretaries+3.570
mented+3.562
FTWARE+3.533
Neg logit contributions
 Crossing-4.547
zees-4.544
eding-4.217
 Lama-4.197
digy-4.029
Token to
Token ID284
Feature activation+1.00e+01

Pos logit contributions
mented+6.011
ARI+5.773
Dialogue+5.672
 elements+5.393
well+5.211
Neg logit contributions
zees-14.336
 Crossing-13.638
owler-13.231
 Lama-12.885
 Ski-12.614
Token happen
Token ID1645
Feature activation+3.69e+00

Pos logit contributions
ARI+6.213
mented+6.070
FTWARE+6.065
Dialogue+6.026
 elements+5.872
Neg logit contributions
 Crossing-7.846
zees-7.797
 Lama-7.275
MQ-7.072
owler-6.993
Token to
Token ID284
Feature activation+3.21e+00

Pos logit contributions
ARI+2.407
Dialogue+2.385
FTWARE+2.372
mented+2.353
 secretaries+2.309
Neg logit contributions
 Crossing-2.963
zees-2.953
 Lama-2.728
eding-2.669
 Ski-2.615
Token these
Token ID777
Feature activation-3.87e+00

Pos logit contributions
Dialogue+2.274
ARI+2.259
mented+2.237
FTWARE+2.228
 secretaries+2.214
Neg logit contributions
 Crossing-2.422
zees-2.408
 Lama-2.215
eding-2.170
 Ski-2.096
Token people
Token ID661
Feature activation-1.22e+00

Pos logit contributions
 Crossing+3.211
zees+3.123
 Lama+2.960
eding+2.890
 Ski+2.850
Neg logit contributions
Dialogue-2.503
FTWARE-2.493
ARI-2.484
mented-2.453
 secretaries-2.360
Token last
Token ID938
Feature activation+3.01e+00

Pos logit contributions
ARI+1.093
Dialogue+1.086
FTWARE+1.080
mented+1.079
 secretaries+1.038
Neg logit contributions
zees-1.270
 Crossing-1.267
 Lama-1.169
eding-1.131
 Ski-1.114
Token year
Token ID614
Feature activation+1.78e-01

Pos logit contributions
ARI+2.299
Dialogue+2.296
FTWARE+2.270
mented+2.259
 secretaries+2.204
Neg logit contributions
 Crossing-2.128
zees-2.120
 Lama-1.934
eding-1.875
 Ski-1.847
Token.
Token ID13
Feature activation+5.48e-01

Pos logit contributions
ARI+0.121
FTWARE+0.121
Dialogue+0.121
mented+0.120
 secretaries+0.116
Neg logit contributions
 Crossing-0.141
zees-0.139
 Lama-0.129
eding-0.125
 Ski-0.124
Token
Token ID198
Feature activation-7.74e-01

Pos logit contributions
Dialogue+0.389
ARI+0.387
FTWARE+0.383
mented+0.377
 secretaries+0.357
Neg logit contributions
zees-0.423
 Crossing-0.413
 Lama-0.387
eding-0.383
 Ski-0.367
Token
Token ID198
Feature activation+3.81e-01

Pos logit contributions
 Crossing+0.643
zees+0.630
 Lama+0.595
eding+0.579
 Ski+0.570
Neg logit contributions
Dialogue-0.501
ARI-0.498
FTWARE-0.496
mented-0.490
 secretaries-0.473
TokenGil
Token ID40747
Feature activation+1.46e+01

Pos logit contributions
Dialogue+0.238
ARI+0.236
FTWARE+0.230
mented+0.226
 secretaries+0.212
Neg logit contributions
 Crossing-0.333
zees-0.328
 Lama-0.312
eding-0.300
 Ski-0.298
Tokenbert
Token ID4835
Feature activation-2.03e-01

Pos logit contributions
mented+7.949
ARI+7.326
Dialogue+7.079
FTWARE+6.964
elve+6.885
Neg logit contributions
zees-12.434
 Crossing-12.257
 Lama-12.011
eding-11.358
MQ-11.199
Token is
Token ID318
Feature activation-7.71e-01

Pos logit contributions
 Crossing+0.162
zees+0.160
 Lama+0.149
eding+0.144
 Ski+0.142
Neg logit contributions
ARI-0.137
Dialogue-0.137
mented-0.137
FTWARE-0.135
 secretaries-0.131
Token known
Token ID1900
Feature activation-1.04e+00

Pos logit contributions
 Crossing+0.603
zees+0.593
 Lama+0.554
eding+0.541
 Ski+0.530
Neg logit contributions
Dialogue-0.533
ARI-0.533
FTWARE-0.529
mented-0.526
 secretaries-0.512
Token as
Token ID355
Feature activation-6.72e+00

Pos logit contributions
 Crossing+0.811
zees+0.792
 Lama+0.746
eding+0.725
 Ski+0.713
Neg logit contributions
Dialogue-0.724
ARI-0.720
FTWARE-0.718
mented-0.710
 secretaries-0.693
Token a
Token ID257
Feature activation-4.20e+00

Pos logit contributions
 Crossing+4.943
zees+4.720
 Lama+4.470
eding+4.351
 Ski+4.309
Neg logit contributions
Dialogue-4.881
FTWARE-4.811
ARI-4.753
mented-4.734
 secretaries-4.703
Token to
Token ID284
Feature activation+2.84e+00

Pos logit contributions
 Crossing+2.374
zees+2.305
 Lama+2.170
eding+2.116
 Ski+2.075
Neg logit contributions
Dialogue-2.310
ARI-2.294
FTWARE-2.290
mented-2.262
 secretaries-2.209
Token phase
Token ID7108
Feature activation-1.92e+00

Pos logit contributions
FTWARE+1.925
Dialogue+1.900
ARI+1.893
mented+1.890
 secretaries+1.870
Neg logit contributions
 Crossing-2.254
zees-2.231
 Lama-2.096
 Ski-1.965
owler-1.953
Token out
Token ID503
Feature activation-1.26e+00

Pos logit contributions
 Crossing+1.322
zees+1.285
 Lama+1.201
eding+1.163
 Ski+1.141
Neg logit contributions
Dialogue-1.518
ARI-1.509
FTWARE-1.509
mented-1.492
 secretaries-1.462
Token support
Token ID1104
Feature activation+2.09e+00

Pos logit contributions
 Crossing+0.921
zees+0.902
 Lama+0.847
eding+0.818
 Ski+0.806
Neg logit contributions
Dialogue-0.938
FTWARE-0.937
ARI-0.933
mented-0.925
 secretaries-0.906
Token for
Token ID329
Feature activation-3.69e-01

Pos logit contributions
FTWARE+1.446
Dialogue+1.445
ARI+1.436
mented+1.427
 secretaries+1.397
Neg logit contributions
 Crossing-1.616
zees-1.586
 Lama-1.505
eding-1.442
 Ski-1.432
Token IE
Token ID28157
Feature activation+1.45e+01

Pos logit contributions
 Crossing+0.270
zees+0.265
 Lama+0.250
eding+0.241
 Ski+0.237
Neg logit contributions
FTWARE-0.272
Dialogue-0.272
ARI-0.271
mented-0.269
 secretaries-0.264
Token6
Token ID21
Feature activation+8.54e-01

Pos logit contributions
 elements+7.432
FTWARE+7.292
mented+7.121
Dialogue+6.882
elve+6.880
Neg logit contributions
 Lama-11.547
 Crossing-11.535
zees-11.407
eding-11.056
-10.884
Token.
Token ID13
Feature activation+1.20e+00

Pos logit contributions
Dialogue+0.599
FTWARE+0.599
ARI+0.594
mented+0.591
 secretaries+0.579
Neg logit contributions
 Crossing-0.654
zees-0.643
 Lama-0.611
eding-0.587
 Ski-0.578
Token The
Token ID383
Feature activation+6.70e-01

Pos logit contributions
FTWARE+0.851
Dialogue+0.845
ARI+0.842
mented+0.829
 secretaries+0.806
Neg logit contributions
zees-0.909
 Crossing-0.898
 Lama-0.841
eding-0.821
 Ski-0.799
Token firm
Token ID4081
Feature activation+4.55e+00

Pos logit contributions
FTWARE+0.472
Dialogue+0.466
ARI+0.464
 secretaries+0.463
mented+0.463
Neg logit contributions
 Crossing-0.507
zees-0.507
 Lama-0.476
eding-0.456
 Ski-0.449
Token said
Token ID531
Feature activation+5.45e-01

Pos logit contributions
ARI+2.835
mented+2.827
 secretaries+2.815
Dialogue+2.790
FTWARE+2.782
Neg logit contributions
zees-3.741
 Crossing-3.724
 Lama-3.436
eding-3.286
MQ-3.255
Token's
Token ID338
Feature activation-5.21e+00

Pos logit contributions
 Crossing+3.995
zees+3.884
 Lama+3.658
eding+3.513
 Ski+3.488
Neg logit contributions
Dialogue-4.387
FTWARE-4.380
ARI-4.296
mented-4.269
 secretaries-4.150
Token some
Token ID617
Feature activation+1.99e+00

Pos logit contributions
 Crossing+3.926
zees+3.781
 Lama+3.601
eding+3.491
 Ski+3.439
Neg logit contributions
FTWARE-3.741
Dialogue-3.731
ARI-3.701
mented-3.660
 secretaries-3.561
Token common
Token ID2219
Feature activation-2.60e+00

Pos logit contributions
ARI+1.318
Dialogue+1.316
FTWARE+1.288
mented+1.284
 secretaries+1.283
Neg logit contributions
zees-1.612
 Crossing-1.593
 Lama-1.474
eding-1.434
 Ski-1.408
Token questions
Token ID2683
Feature activation+2.01e+00

Pos logit contributions
 Crossing+2.223
zees+2.175
 Lama+2.059
eding+2.008
 Ski+1.978
Neg logit contributions
Dialogue-1.623
ARI-1.612
FTWARE-1.610
mented-1.586
 secretaries-1.548
Token/
Token ID14
Feature activation+2.37e+00

Pos logit contributions
ARI+1.347
Dialogue+1.327
 secretaries+1.314
mented+1.310
FTWARE+1.294
Neg logit contributions
zees-1.584
 Crossing-1.577
 Lama-1.468
eding-1.415
digy-1.413
Tokenmis
Token ID25413
Feature activation+1.45e+01

Pos logit contributions
Dialogue+1.575
ARI+1.546
mented+1.523
FTWARE+1.494
 secretaries+1.463
Neg logit contributions
 Crossing-1.935
zees-1.908
 Lama-1.800
 Ski-1.739
eding-1.713
Tokencon
Token ID1102
Feature activation+1.55e-01

Pos logit contributions
mented+7.240
ARI+7.019
 elements+6.905
Dialogue+6.793
FTWARE+6.753
Neg logit contributions
 Crossing-12.128
zees-11.636
 Lama-11.413
 Ski-11.099
 Stamp-10.783
Tokenceptions
Token ID11755
Feature activation+9.08e-01

Pos logit contributions
Dialogue+0.102
FTWARE+0.102
ARI+0.101
mented+0.101
 secretaries+0.098
Neg logit contributions
 Crossing-0.126
zees-0.124
 Lama-0.116
eding-0.112
 Ski-0.111
Token I
Token ID314
Feature activation+4.15e+00

Pos logit contributions
ARI+0.614
Dialogue+0.609
mented+0.601
FTWARE+0.597
 secretaries+0.594
Neg logit contributions
 Crossing-0.717
zees-0.714
 Lama-0.667
eding-0.644
digy-0.635
Token've
Token ID1053
Feature activation-1.04e+00

Pos logit contributions
ARI+2.309
mented+2.302
Dialogue+2.287
FTWARE+2.284
 secretaries+2.232
Neg logit contributions
 Crossing-3.719
zees-3.636
 Lama-3.499
digy-3.301
 Ski-3.285
Token been
Token ID587
Feature activation+3.98e-01

Pos logit contributions
 Crossing+0.837
zees+0.823
 Lama+0.777
eding+0.744
 Ski+0.738
Neg logit contributions
ARI-0.691
Dialogue-0.688
FTWARE-0.685
mented-0.684
 secretaries-0.669
Token Microsoft
Token ID5413
Feature activation+7.13e+00

Pos logit contributions
Dialogue+0.119
FTWARE+0.119
ARI+0.119
mented+0.117
 secretaries+0.114
Neg logit contributions
zees-0.127
 Crossing-0.127
 Lama-0.118
eding-0.116
 Ski-0.113
Token patched
Token ID39378
Feature activation-3.51e+00

Pos logit contributions
Dialogue+4.485
 secretaries+4.456
mented+4.443
ARI+4.422
FTWARE+4.397
Neg logit contributions
zees-5.695
 Crossing-5.566
 Lama-5.284
owler-5.028
eding-5.007
Token the
Token ID262
Feature activation-1.30e+00

Pos logit contributions
 Crossing+2.545
zees+2.457
 Lama+2.304
eding+2.243
 Ski+2.213
Neg logit contributions
Dialogue-2.647
ARI-2.626
FTWARE-2.621
mented-2.581
 secretaries-2.527
Token hole
Token ID7604
Feature activation-8.09e-01

Pos logit contributions
 Crossing+1.007
zees+0.992
 Lama+0.934
eding+0.903
 Ski+0.886
Neg logit contributions
Dialogue-0.904
FTWARE-0.903
ARI-0.900
mented-0.890
 secretaries-0.876
Token in
Token ID287
Feature activation+5.06e-02

Pos logit contributions
 Crossing+0.613
zees+0.603
 Lama+0.565
eding+0.547
 Ski+0.537
Neg logit contributions
Dialogue-0.578
FTWARE-0.577
ARI-0.576
mented-0.571
 secretaries-0.559
Token IE
Token ID28157
Feature activation+1.44e+01

Pos logit contributions
Dialogue+0.033
FTWARE+0.033
ARI+0.033
mented+0.032
 secretaries+0.032
Neg logit contributions
 Crossing-0.041
zees-0.041
 Lama-0.039
eding-0.037
 Ski-0.037
Token6
Token ID21
Feature activation+1.63e+00

Pos logit contributions
FTWARE+7.333
 elements+7.220
Dialogue+7.201
ARI+7.081
mented+7.038
Neg logit contributions
 Crossing-11.465
zees-11.462
 Lama-11.295
eding-10.830
-10.542
Token in
Token ID287
Feature activation+4.18e-01

Pos logit contributions
FTWARE+1.141
Dialogue+1.138
ARI+1.130
mented+1.119
 secretaries+1.097
Neg logit contributions
 Crossing-1.253
zees-1.234
 Lama-1.167
eding-1.129
 Ski-1.103
Token January
Token ID3269
Feature activation+3.09e+00

Pos logit contributions
Dialogue+0.250
FTWARE+0.247
ARI+0.247
mented+0.246
 secretaries+0.240
Neg logit contributions
 Crossing-0.364
zees-0.361
 Lama-0.339
eding-0.328
 Ski-0.324
Token,
Token ID11
Feature activation-2.96e+00

Pos logit contributions
FTWARE+1.958
mented+1.945
Dialogue+1.931
ARI+1.914
 secretaries+1.869
Neg logit contributions
zees-2.556
 Crossing-2.544
 Lama-2.381
eding-2.263
 Ski-2.258
Token three
Token ID1115
Feature activation-6.05e+00

Pos logit contributions
 Crossing+2.148
zees+2.087
 Lama+1.960
eding+1.905
 Ski+1.870
Neg logit contributions
Dialogue-2.224
ARI-2.211
FTWARE-2.207
mented-2.181
 secretaries-2.133
Token kiss
Token ID9245
Feature activation-6.50e-01

Pos logit contributions
Dialogue+2.704
ARI+2.666
mented+2.642
 secretaries+2.620
FTWARE+2.615
Neg logit contributions
zees-3.085
 Crossing-3.047
 Lama-2.781
eding-2.772
 Ski-2.661
Token,
Token ID11
Feature activation-8.31e-03

Pos logit contributions
 Crossing+0.526
zees+0.518
 Lama+0.484
eding+0.473
 Ski+0.464
Neg logit contributions
Dialogue-0.435
ARI-0.433
FTWARE-0.429
mented-0.428
 secretaries-0.414
Token shockingly
Token ID49992
Feature activation-2.86e+00

Pos logit contributions
 Crossing+0.006
zees+0.006
 Lama+0.006
eding+0.006
 Ski+0.006
Neg logit contributions
Dialogue-0.006
ARI-0.006
FTWARE-0.006
mented-0.006
 secretaries-0.006
Token bad
Token ID2089
Feature activation-1.04e-01

Pos logit contributions
 Crossing+2.323
zees+2.266
 Lama+2.144
eding+2.086
 Ski+2.055
Neg logit contributions
Dialogue-1.909
FTWARE-1.899
ARI-1.897
mented-1.869
 secretaries-1.823
Token;
Token ID26
Feature activation+3.42e+00

Pos logit contributions
 Crossing+0.087
zees+0.086
 Lama+0.080
eding+0.079
 Ski+0.077
Neg logit contributions
Dialogue-0.066
ARI-0.065
FTWARE-0.065
mented-0.064
 secretaries-0.063
Token Marg
Token ID11899
Feature activation+1.44e+01

Pos logit contributions
Dialogue+2.426
ARI+2.405
mented+2.388
 secretaries+2.382
FTWARE+2.364
Neg logit contributions
 Crossing-2.551
zees-2.539
 Lama-2.322
eding-2.295
 Ski-2.229
Tokenot
Token ID313
Feature activation-4.79e-01

Pos logit contributions
mented+9.136
ARI+8.585
Dialogue+8.493
elve+8.285
 elements+8.097
Neg logit contributions
 Crossing-10.600
zees-10.210
 Lama-9.559
 Ski-9.504
MQ-9.360
Token had
Token ID550
Feature activation-1.42e+00

Pos logit contributions
 Crossing+0.394
zees+0.387
 Lama+0.362
eding+0.353
 Ski+0.347
Neg logit contributions
Dialogue-0.313
ARI-0.312
mented-0.309
FTWARE-0.309
 secretaries-0.301
Token trouble
Token ID5876
Feature activation-1.85e+00

Pos logit contributions
 Crossing+1.129
zees+1.103
 Lama+1.037
eding+1.011
 Ski+0.994
Neg logit contributions
Dialogue-0.975
ARI-0.968
FTWARE-0.966
mented-0.956
 secretaries-0.934
Token believing
Token ID14773
Feature activation-1.18e+00

Pos logit contributions
 Crossing+1.363
zees+1.349
 Lama+1.247
eding+1.209
 Ski+1.194
Neg logit contributions
Dialogue-1.371
ARI-1.358
FTWARE-1.356
mented-1.352
 secretaries-1.324
Token that
Token ID326
Feature activation+4.45e-01

Pos logit contributions
 Crossing+0.850
zees+0.830
 Lama+0.775
eding+0.755
 Ski+0.738
Neg logit contributions
Dialogue-0.899
ARI-0.891
FTWARE-0.889
mented-0.883
 secretaries-0.866
Token petty
Token ID25229
Feature activation+3.98e+00

Pos logit contributions
Dialogue+1.333
ARI+1.329
 secretaries+1.314
mented+1.306
FTWARE+1.303
Neg logit contributions
 Crossing-1.613
zees-1.596
 Lama-1.486
eding-1.454
 Ski-1.439
Token-
Token ID12
Feature activation+7.55e+00

Pos logit contributions
ARI+2.756
Dialogue+2.744
mented+2.728
FTWARE+2.728
 secretaries+2.681
Neg logit contributions
 Crossing-3.104
zees-2.997
 Lama-2.838
eding-2.760
 Ski-2.733
Tokenbour
Token ID6084
Feature activation+8.18e-01

Pos logit contributions
ARI+3.399
mented+3.367
Dialogue+3.335
FTWARE+3.216
 secretaries+3.183
Neg logit contributions
 Crossing-7.477
zees-7.150
 Lama-6.968
 Ski-6.828
eding-6.771
Tokenge
Token ID469
Feature activation+2.35e-01

Pos logit contributions
Dialogue+0.680
ARI+0.675
FTWARE+0.673
mented+0.672
 secretaries+0.653
Neg logit contributions
 Crossing-0.528
zees-0.505
 Lama-0.477
eding-0.455
 Ski-0.451
Tokenois
Token ID10924
Feature activation+2.76e+00

Pos logit contributions
Dialogue+0.210
ARI+0.209
FTWARE+0.208
mented+0.207
 secretaries+0.203
Neg logit contributions
 Crossing-0.138
zees-0.132
 Lama-0.123
eding-0.117
 Ski-0.116
Token intellig
Token ID10878
Feature activation+1.43e+01

Pos logit contributions
Dialogue+1.360
ARI+1.359
mented+1.337
FTWARE+1.322
 secretaries+1.315
Neg logit contributions
 Crossing-2.699
zees-2.639
 Lama-2.517
 Ski-2.449
eding-2.446
Tokenents
Token ID658
Feature activation+8.40e-01

Pos logit contributions
mented+4.565
Dialogue+3.931
elve+3.895
 elements+3.872
ARI+3.844
Neg logit contributions
 Crossing-15.784
zees-14.488
 Lama-14.346
 Trace-14.106
 Ski-14.083
Tokenia
Token ID544
Feature activation+4.43e+00

Pos logit contributions
Dialogue+0.736
ARI+0.735
mented+0.729
FTWARE+0.727
 secretaries+0.708
Neg logit contributions
 Crossing-0.507
zees-0.485
 Lama-0.449
eding-0.429
 Ski-0.423
Token forged
Token ID24918
Feature activation+9.39e-01

Pos logit contributions
ARI+2.902
mented+2.901
Dialogue+2.888
 secretaries+2.810
 elements+2.780
Neg logit contributions
 Crossing-3.541
zees-3.490
 Lama-3.238
 Ski-3.132
 Trace-3.088
Token in
Token ID287
Feature activation+2.64e+00

Pos logit contributions
Dialogue+0.644
ARI+0.636
FTWARE+0.628
mented+0.626
 secretaries+0.617
Neg logit contributions
 Crossing-0.748
zees-0.734
 Lama-0.686
eding-0.666
 Ski-0.662
Token haste
Token ID36797
Feature activation+1.93e+00

Pos logit contributions
Dialogue+1.885
ARI+1.853
mented+1.839
FTWARE+1.810
 secretaries+1.805
Neg logit contributions
 Crossing-2.024
zees-1.980
 Lama-1.844
 Ski-1.786
eding-1.781
Token/
Token ID14
Feature activation+2.25e+00

Pos logit contributions
 Crossing+0.108
zees+0.105
 Lama+0.099
eding+0.096
 Ski+0.094
Neg logit contributions
Dialogue-0.098
ARI-0.098
FTWARE-0.098
mented-0.098
 secretaries-0.094
TokenAP
Token ID2969
Feature activation-1.41e+00

Pos logit contributions
ARI+1.405
Dialogue+1.356
FTWARE+1.333
mented+1.306
 secretaries+1.269
Neg logit contributions
zees-1.963
 Crossing-1.947
 Lama-1.829
eding-1.776
 Ski-1.732
Token)
Token ID8
Feature activation+3.49e+00

Pos logit contributions
 Crossing+1.072
zees+1.040
 Lama+0.974
eding+0.951
 Ski+0.943
Neg logit contributions
Dialogue-1.021
FTWARE-1.017
ARI-1.002
mented-0.996
 secretaries-0.972
Token
Token ID198
Feature activation+2.78e+00

Pos logit contributions
ARI+2.160
Dialogue+2.154
FTWARE+2.139
mented+2.105
 secretaries+2.034
Neg logit contributions
 Crossing-2.943
zees-2.907
 Lama-2.742
eding-2.681
 Ski-2.615
Token
Token ID198
Feature activation+4.09e+00

Pos logit contributions
Dialogue+1.620
ARI+1.600
FTWARE+1.564
mented+1.563
 elements+1.441
Neg logit contributions
 Crossing-2.478
zees-2.452
 Lama-2.311
eding-2.244
owler-2.220
TokenGil
Token ID40747
Feature activation+1.43e+01

Pos logit contributions
Dialogue+2.489
ARI+2.458
FTWARE+2.348
mented+2.244
ATURE+2.178
Neg logit contributions
 Crossing-3.683
zees-3.622
 Lama-3.470
 Ski-3.322
eding-3.277
Tokenbert
Token ID4835
Feature activation+1.26e+00

Pos logit contributions
mented+7.712
ARI+7.205
FTWARE+7.097
Dialogue+7.062
well+6.808
Neg logit contributions
zees-12.223
 Crossing-12.146
 Lama-11.897
eding-11.267
MQ-11.248
Token and
Token ID290
Feature activation-2.23e+00

Pos logit contributions
ARI+0.831
Dialogue+0.824
mented+0.822
FTWARE+0.814
 secretaries+0.810
Neg logit contributions
 Crossing-1.028
zees-1.011
 Lama-0.945
eding-0.909
 Ski-0.903
Token the
Token ID262
Feature activation+5.89e-01

Pos logit contributions
 Crossing+1.593
zees+1.560
 Lama+1.453
eding+1.410
 Ski+1.384
Neg logit contributions
Dialogue-1.700
FTWARE-1.692
ARI-1.691
mented-1.671
 secretaries-1.640
Token ACLU
Token ID24381
Feature activation+8.32e+00

Pos logit contributions
ARI+0.376
Dialogue+0.375
FTWARE+0.375
 secretaries+0.369
mented+0.368
Neg logit contributions
 Crossing-0.487
zees-0.487
 Lama-0.454
eding-0.441
 Ski-0.434
Token have
Token ID423
Feature activation+3.84e+00

Pos logit contributions
ARI+5.247
 secretaries+5.067
Dialogue+4.996
 elements+4.953
FTWARE+4.885
Neg logit contributions
 Crossing-6.821
zees-6.710
 Lama-6.271
digy-5.988
eding-5.983
Token black
Token ID2042
Feature activation-7.13e+00

Pos logit contributions
 Crossing+1.944
zees+1.916
 Lama+1.809
eding+1.758
 Ski+1.731
Neg logit contributions
Dialogue-1.405
FTWARE-1.398
ARI-1.394
mented-1.376
 secretaries-1.348
Tokenlists
Token ID20713
Feature activation-7.56e+00

Pos logit contributions
 Crossing+7.437
zees+7.335
 Lama+7.023
eding+6.959
 Ski+6.827
Neg logit contributions
Dialogue-2.971
ARI-2.935
FTWARE-2.908
mented-2.736
 secretaries-2.629
Token.
Token ID13
Feature activation+6.80e+00

Pos logit contributions
 Crossing+5.322
zees+5.130
 Lama+4.864
eding+4.790
 Ski+4.631
Neg logit contributions
Dialogue-5.777
FTWARE-5.711
ARI-5.689
mented-5.578
 secretaries-5.447
Token
Token ID198
Feature activation+1.41e+00

Pos logit contributions
ARI+3.953
FTWARE+3.927
Dialogue+3.908
mented+3.808
 elements+3.618
Neg logit contributions
zees-5.802
 Crossing-5.669
eding-5.337
 Lama-5.316
digy-5.205
Token
Token ID198
Feature activation+2.36e+00

Pos logit contributions
Dialogue+0.854
ARI+0.847
FTWARE+0.836
mented+0.832
 secretaries+0.776
Neg logit contributions
 Crossing-1.227
zees-1.210
 Lama-1.135
eding-1.105
 Ski-1.090
TokenEFF
Token ID37267
Feature activation+1.43e+01

Pos logit contributions
ARI+1.420
Dialogue+1.416
FTWARE+1.389
mented+1.356
 secretaries+1.268
Neg logit contributions
 Crossing-2.097
zees-2.055
 Lama-1.944
eding-1.889
 Ski-1.877
Token director
Token ID3437
Feature activation-1.22e+00

Pos logit contributions
ARI+9.029
Dialogue+8.534
 secretaries+8.399
mented+8.383
 elements+8.289
Neg logit contributions
zees-10.505
 Crossing-10.204
 Lama-9.409
digy-9.117
eding-9.113
Token for
Token ID329
Feature activation+3.33e+00

Pos logit contributions
 Crossing+0.926
zees+0.908
 Lama+0.836
eding+0.825
 Ski+0.810
Neg logit contributions
Dialogue-0.881
ARI-0.880
FTWARE-0.874
mented-0.868
 secretaries-0.854
Token international
Token ID3230
Feature activation-2.61e+00

Pos logit contributions
Dialogue+2.125
FTWARE+2.119
mented+2.114
ARI+2.109
 secretaries+2.070
Neg logit contributions
zees-2.748
 Crossing-2.664
 Lama-2.524
eding-2.452
owler-2.397
Token freedom
Token ID4925
Feature activation-1.03e-01

Pos logit contributions
 Crossing+2.140
zees+2.086
 Lama+1.975
eding+1.924
 Ski+1.894
Neg logit contributions
Dialogue-1.722
ARI-1.710
FTWARE-1.709
mented-1.684
 secretaries-1.642
Token of
Token ID286
Feature activation-1.92e+00

Pos logit contributions
 Crossing+0.079
zees+0.078
 Lama+0.073
eding+0.071
 Ski+0.070
Neg logit contributions
ARI-0.072
FTWARE-0.072
Dialogue-0.071
mented-0.070
 secretaries-0.070
Token [
Token ID685
Feature activation-2.18e+00

Pos logit contributions
Dialogue+0.428
ARI+0.425
FTWARE+0.419
mented+0.414
 secretaries+0.409
Neg logit contributions
zees-0.483
 Crossing-0.480
 Lama-0.445
eding-0.436
owler-0.425
Token edit
Token ID4370
Feature activation+4.02e+00

Pos logit contributions
 Crossing+1.764
zees+1.737
 Lama+1.631
eding+1.588
 Ski+1.566
Neg logit contributions
Dialogue-1.466
ARI-1.448
FTWARE-1.436
mented-1.426
 secretaries-1.386
Token ]
Token ID2361
Feature activation-1.31e+00

Pos logit contributions
Dialogue+2.445
ARI+2.436
mented+2.400
FTWARE+2.271
 elements+2.252
Neg logit contributions
zees-3.482
 Crossing-3.380
eding-3.148
 Lama-3.144
digy-3.050
Token
Token ID198
Feature activation+2.14e-01

Pos logit contributions
 Crossing+1.073
zees+1.050
 Lama+0.991
eding+0.967
 Ski+0.951
Neg logit contributions
Dialogue-0.858
ARI-0.851
FTWARE-0.850
mented-0.838
 secretaries-0.815
Token
Token ID198
Feature activation+9.24e-01

Pos logit contributions
Dialogue+0.136
ARI+0.135
FTWARE+0.134
mented+0.133
 secretaries+0.127
Neg logit contributions
 Crossing-0.180
zees-0.177
 Lama-0.167
eding-0.162
 Ski-0.160
TokenOr
Token ID5574
Feature activation+1.42e+01

Pos logit contributions
Dialogue+0.572
ARI+0.555
FTWARE+0.547
mented+0.533
 secretaries+0.508
Neg logit contributions
 Crossing-0.815
zees-0.805
 Lama-0.760
eding-0.739
 Ski-0.731
Tokencs
Token ID6359
Feature activation-1.57e+00

Pos logit contributions
mented+8.980
Dialogue+8.666
well+8.571
elve+8.303
 elements+8.108
Neg logit contributions
zees-10.252
 Crossing-9.988
 Lama-9.349
eding-9.295
owler-9.188
Token sent
Token ID1908
Feature activation+3.43e+00

Pos logit contributions
 Crossing+1.192
zees+1.175
 Lama+1.102
eding+1.072
 Ski+1.048
Neg logit contributions
Dialogue-1.122
ARI-1.115
FTWARE-1.107
mented-1.099
 secretaries-1.073
Token by
Token ID416
Feature activation+1.17e+00

Pos logit contributions
ARI+2.304
Dialogue+2.293
mented+2.260
FTWARE+2.198
 secretaries+2.190
Neg logit contributions
zees-2.692
 Crossing-2.690
 Lama-2.465
eding-2.415
digy-2.397
Token Sar
Token ID6866
Feature activation+2.15e+00

Pos logit contributions
Dialogue+0.917
ARI+0.907
mented+0.901
FTWARE+0.897
 secretaries+0.888
Neg logit contributions
 Crossing-0.790
zees-0.789
 Lama-0.712
eding-0.700
digy-0.682
Tokenuman
Token ID3778
Feature activation-4.78e-01

Pos logit contributions
mented+1.443
Dialogue+1.435
FTWARE+1.399
ARI+1.387
elve+1.359
Neg logit contributions
 Crossing-1.728
zees-1.664
 Lama-1.587
 Ski-1.527
eding-1.520
Token the
Token ID262
Feature activation+1.85e+00

Pos logit contributions
Dialogue+0.482
FTWARE+0.476
ARI+0.474
mented+0.474
 secretaries+0.463
Neg logit contributions
 Crossing-0.532
zees-0.517
 Lama-0.488
eding-0.475
 Ski-0.467
Token drop
Token ID4268
Feature activation-1.39e+00

Pos logit contributions
Dialogue+1.172
ARI+1.158
mented+1.152
FTWARE+1.150
 secretaries+1.128
Neg logit contributions
 Crossing-1.533
zees-1.511
 Lama-1.417
eding-1.386
 Ski-1.360
Token of
Token ID286
Feature activation-1.18e+00

Pos logit contributions
 Crossing+1.093
zees+1.068
 Lama+1.000
eding+0.977
 Ski+0.960
Neg logit contributions
Dialogue-0.964
ARI-0.957
mented-0.949
FTWARE-0.948
 secretaries-0.923
Token water
Token ID1660
Feature activation-4.11e+00

Pos logit contributions
 Crossing+0.953
zees+0.929
 Lama+0.874
eding+0.852
 Ski+0.836
Neg logit contributions
Dialogue-0.798
ARI-0.793
FTWARE-0.792
mented-0.784
 secretaries-0.766
Token)
Token ID8
Feature activation+3.36e+00

Pos logit contributions
 Crossing+3.108
zees+3.026
 Lama+2.847
eding+2.767
 Ski+2.724
Neg logit contributions
Dialogue-2.963
FTWARE-2.957
ARI-2.951
mented-2.912
 secretaries-2.851
Token remain
Token ID3520
Feature activation+1.42e+01

Pos logit contributions
Dialogue+2.282
ARI+2.276
mented+2.221
 secretaries+2.220
FTWARE+2.215
Neg logit contributions
 Crossing-2.596
zees-2.554
 Lama-2.407
MQ-2.309
eding-2.306
Token the
Token ID262
Feature activation+4.14e+00

Pos logit contributions
 elements+7.666
 secretaries+7.489
mented+7.377
Dialogue+7.112
ARI+7.102
Neg logit contributions
 Crossing-11.492
zees-11.415
owler-10.487
 Lama-10.403
MQ-10.342
Token same
Token ID976
Feature activation+4.16e+00

Pos logit contributions
Dialogue+1.990
ARI+1.986
 secretaries+1.983
mented+1.974
 elements+1.974
Neg logit contributions
 Crossing-3.996
zees-3.924
 Lama-3.700
eding-3.656
 Ski-3.588
Token throughout
Token ID3690
Feature activation-8.61e-01

Pos logit contributions
Dialogue+2.642
ARI+2.603
FTWARE+2.592
mented+2.589
 secretaries+2.565
Neg logit contributions
 Crossing-3.383
zees-3.359
 Lama-3.136
eding-3.037
MQ-3.020
Token a
Token ID257
Feature activation+2.25e-01

Pos logit contributions
 Crossing+0.651
zees+0.633
 Lama+0.596
eding+0.577
 Ski+0.569
Neg logit contributions
Dialogue-0.623
FTWARE-0.617
ARI-0.617
mented-0.611
 secretaries-0.599
Token block
Token ID2512
Feature activation+8.64e-01

Pos logit contributions
Dialogue+0.155
ARI+0.153
FTWARE+0.153
mented+0.151
 secretaries+0.149
Neg logit contributions
 Crossing-0.177
zees-0.174
 Lama-0.163
eding-0.159
 Ski-0.156
Token got
Token ID1392
Feature activation-7.46e-01

Pos logit contributions
mented+1.076
ARI+1.072
Dialogue+1.070
FTWARE+1.060
 secretaries+1.054
Neg logit contributions
 Crossing-1.463
zees-1.433
 Lama-1.340
MQ-1.295
eding-1.289
Token up
Token ID510
Feature activation-4.66e+00

Pos logit contributions
 Crossing+0.622
zees+0.610
 Lama+0.571
eding+0.556
 Ski+0.545
Neg logit contributions
ARI-0.479
Dialogue-0.478
FTWARE-0.474
mented-0.472
 secretaries-0.465
Token to
Token ID284
Feature activation-1.57e+00

Pos logit contributions
 Crossing+3.446
zees+3.330
 Lama+3.162
eding+3.065
 Ski+3.040
Neg logit contributions
Dialogue-3.432
FTWARE-3.397
ARI-3.394
mented-3.363
 secretaries-3.255
Token leave
Token ID2666
Feature activation+3.64e+00

Pos logit contributions
 Crossing+1.204
zees+1.178
 Lama+1.105
eding+1.069
 Ski+1.052
Neg logit contributions
Dialogue-1.111
ARI-1.105
FTWARE-1.103
mented-1.089
 secretaries-1.070
Token and
Token ID290
Feature activation-5.85e+00

Pos logit contributions
ARI+2.435
Dialogue+2.411
 secretaries+2.387
FTWARE+2.383
mented+2.369
Neg logit contributions
zees-2.897
 Crossing-2.836
 Lama-2.609
eding-2.588
digy-2.493
Token tri
Token ID1333
Feature activation-1.91e+01

Pos logit contributions
 Crossing+4.668
zees+4.501
 Lama+4.303
eding+4.201
 Ski+4.163
Neg logit contributions
Dialogue-3.951
FTWARE-3.932
ARI-3.913
mented-3.846
 secretaries-3.699
Tokenpped
Token ID1496
Feature activation-3.61e+00

Pos logit contributions
zees+8.586
eding+7.764
 Crossing+7.676
owler+7.121
digy+7.071
Neg logit contributions
 secretaries-17.094
 adolesc-16.677
 millenn-16.665
Dialogue-16.492
ARI-16.376
Token,
Token ID11
Feature activation+1.50e+00

Pos logit contributions
 Crossing+2.684
zees+2.600
 Lama+2.467
eding+2.390
 Ski+2.358
Neg logit contributions
Dialogue-2.647
FTWARE-2.625
ARI-2.623
mented-2.593
 secretaries-2.522
Token falling
Token ID7463
Feature activation+4.54e+00

Pos logit contributions
ARI+1.077
Dialogue+1.075
FTWARE+1.068
mented+1.067
 secretaries+1.065
Neg logit contributions
zees-1.104
 Crossing-1.098
 Lama-1.006
eding-0.977
digy-0.954
Token backwards
Token ID16196
Feature activation-7.08e-01

Pos logit contributions
ARI+3.134
Dialogue+3.111
FTWARE+3.110
 secretaries+3.097
mented+3.054
Neg logit contributions
 Crossing-3.476
zees-3.436
 Lama-3.104
MQ-3.049
eding-3.026
Token on
Token ID319
Feature activation+3.40e+00

Pos logit contributions
 Crossing+0.525
zees+0.516
 Lama+0.479
eding+0.465
 Ski+0.456
Neg logit contributions
Dialogue-0.520
ARI-0.518
FTWARE-0.517
mented-0.510
 secretaries-0.503
Token brunch
Token ID50219
Feature activation-4.66e+00

Pos logit contributions
Dialogue+0.253
FTWARE+0.251
mented+0.251
ARI+0.251
 secretaries+0.244
Neg logit contributions
 Crossing-0.250
zees-0.245
 Lama-0.229
eding-0.223
 Ski-0.218
Token,
Token ID11
Feature activation-2.20e+00

Pos logit contributions
 Crossing+3.492
zees+3.388
 Lama+3.199
eding+3.119
 Ski+3.063
Neg logit contributions
Dialogue-3.376
FTWARE-3.340
ARI-3.339
mented-3.296
 secretaries-3.213
Token when
Token ID618
Feature activation-2.97e+00

Pos logit contributions
 Crossing+1.635
zees+1.603
 Lama+1.498
eding+1.453
 Ski+1.427
Neg logit contributions
Dialogue-1.608
ARI-1.603
FTWARE-1.601
mented-1.585
 secretaries-1.553
Token Tor
Token ID4022
Feature activation+7.94e+00

Pos logit contributions
 Crossing+2.255
zees+2.196
 Lama+2.067
eding+2.009
 Ski+1.976
Neg logit contributions
Dialogue-2.139
ARI-2.125
FTWARE-2.124
mented-2.097
 secretaries-2.049
Tokeni
Token ID72
Feature activation-4.13e-01

Pos logit contributions
mented+4.972
ARI+4.619
Dialogue+4.584
ario+4.314
FTWARE+4.285
Neg logit contributions
 Crossing-6.598
zees-6.534
 Lama-6.220
WAYS-6.008
-5.957
Token tri
Token ID1333
Feature activation-1.84e+01

Pos logit contributions
 Crossing+0.338
zees+0.333
 Lama+0.313
eding+0.305
 Ski+0.297
Neg logit contributions
Dialogue-0.269
ARI-0.268
FTWARE-0.266
mented-0.264
 secretaries-0.258
Tokenpped
Token ID1496
Feature activation-4.17e+00

Pos logit contributions
zees+8.321
eding+7.712
 Crossing+7.690
owler+7.397
digy+7.167
Neg logit contributions
 secretaries-16.129
FTWARE-15.905
Dialogue-15.898
 adolesc-15.405
 millenn-15.377
Token and
Token ID290
Feature activation-2.52e+00

Pos logit contributions
 Crossing+2.933
zees+2.816
 Lama+2.683
eding+2.592
 Ski+2.560
Neg logit contributions
Dialogue-3.225
FTWARE-3.195
ARI-3.188
mented-3.161
 secretaries-3.067
Token fell
Token ID3214
Feature activation+8.69e-01

Pos logit contributions
 Crossing+1.982
zees+1.929
 Lama+1.824
eding+1.774
 Ski+1.746
Neg logit contributions
Dialogue-1.746
FTWARE-1.734
ARI-1.734
mented-1.710
 secretaries-1.668
Token onto
Token ID4291
Feature activation+1.56e+00

Pos logit contributions
ARI+0.613
Dialogue+0.610
FTWARE+0.608
mented+0.601
 secretaries+0.593
Neg logit contributions
 Crossing-0.667
zees-0.661
 Lama-0.608
eding-0.592
 Ski-0.581
Token a
Token ID257
Feature activation+4.56e+00

Pos logit contributions
Dialogue+1.119
ARI+1.114
mented+1.110
FTWARE+1.103
 secretaries+1.089
Neg logit contributions
 Crossing-1.175
zees-1.168
 Lama-1.071
eding-1.042
MQ-1.021
Token
Token ID198
Feature activation+1.43e+00

Pos logit contributions
 Crossing+0.014
zees+0.014
 Lama+0.013
eding+0.013
 Ski+0.012
Neg logit contributions
Dialogue-0.011
ARI-0.010
FTWARE-0.010
mented-0.010
 secretaries-0.010
TokenClinton
Token ID16549
Feature activation+5.05e+00

Pos logit contributions
Dialogue+0.852
ARI+0.846
FTWARE+0.815
mented+0.803
 secretaries+0.764
Neg logit contributions
 Crossing-1.295
zees-1.268
 Lama-1.212
eding-1.171
 Ski-1.165
Token also
Token ID635
Feature activation-1.94e+00

Pos logit contributions
Dialogue+3.248
ARI+3.190
mented+3.164
 secretaries+3.156
FTWARE+3.031
Neg logit contributions
zees-4.051
 Crossing-4.026
 Lama-3.762
eding-3.627
digy-3.626
Token tried
Token ID3088
Feature activation+3.57e+00

Pos logit contributions
 Crossing+1.543
zees+1.505
 Lama+1.420
eding+1.380
 Ski+1.360
Neg logit contributions
Dialogue-1.321
ARI-1.313
FTWARE-1.309
mented-1.295
 secretaries-1.267
Token to
Token ID284
Feature activation+2.55e+00

Pos logit contributions
ARI+2.209
Dialogue+2.177
FTWARE+2.162
mented+2.158
 secretaries+2.130
Neg logit contributions
 Crossing-3.021
zees-2.967
 Lama-2.815
eding-2.720
 Ski-2.683
Token tri
Token ID1333
Feature activation-1.84e+01

Pos logit contributions
Dialogue+1.685
ARI+1.681
mented+1.664
 secretaries+1.652
FTWARE+1.647
Neg logit contributions
 Crossing-2.067
zees-2.010
 Lama-1.898
eding-1.820
 Ski-1.804
Tokenang
Token ID648
Feature activation-3.16e+00

Pos logit contributions
 Crossing+10.865
zees+9.923
digy+9.709
 Lama+9.498
owler+9.357
Neg logit contributions
FTWARE-14.146
 secretaries-13.357
ARI-13.141
Dialogue-13.093
mented-12.868
Tokenulate
Token ID5039
Feature activation-4.55e+00

Pos logit contributions
 Crossing+1.815
zees+1.750
 Lama+1.628
eding+1.561
 Ski+1.524
Neg logit contributions
Dialogue-2.839
FTWARE-2.837
ARI-2.825
mented-2.792
 secretaries-2.744
Token on
Token ID319
Feature activation+1.92e-01

Pos logit contributions
 Crossing+3.331
zees+3.235
 Lama+3.042
eding+2.954
 Ski+2.905
Neg logit contributions
Dialogue-3.380
FTWARE-3.373
ARI-3.348
mented-3.309
 secretaries-3.217
Token the
Token ID262
Feature activation-1.92e+00

Pos logit contributions
ARI+0.132
Dialogue+0.132
FTWARE+0.131
mented+0.131
 secretaries+0.130
Neg logit contributions
 Crossing-0.150
zees-0.147
 Lama-0.138
eding-0.134
 Ski-0.132
Token Planned
Token ID18451
Feature activation-4.04e+00

Pos logit contributions
 Crossing+1.454
zees+1.426
 Lama+1.335
eding+1.296
 Ski+1.273
Neg logit contributions
Dialogue-1.382
ARI-1.375
FTWARE-1.371
mented-1.353
 secretaries-1.335
Tokengal
Token ID13528
Feature activation-1.58e+00

Pos logit contributions
 Crossing+1.258
zees+1.214
 Lama+1.114
eding+1.070
 Ski+1.044
Neg logit contributions
Dialogue-2.107
ARI-2.097
FTWARE-2.096
mented-2.075
 secretaries-2.039
Token blo
Token ID24924
Feature activation-9.18e-01

Pos logit contributions
 Crossing+1.310
zees+1.279
 Lama+1.211
eding+1.180
 Ski+1.161
Neg logit contributions
Dialogue-1.027
FTWARE-1.020
ARI-1.019
mented-1.005
 secretaries-0.981
Tokenoms
Token ID3150
Feature activation-2.42e+00

Pos logit contributions
 Crossing+0.620
zees+0.601
 Lama+0.562
eding+0.542
 Ski+0.533
Neg logit contributions
Dialogue-0.738
ARI-0.734
FTWARE-0.734
mented-0.726
 secretaries-0.710
Token,
Token ID11
Feature activation-6.63e-02

Pos logit contributions
 Crossing+1.881
zees+1.831
 Lama+1.729
eding+1.682
 Ski+1.653
Neg logit contributions
Dialogue-1.701
FTWARE-1.689
ARI-1.686
mented-1.663
 secretaries-1.630
Token the
Token ID262
Feature activation-3.73e+00

Pos logit contributions
 Crossing+0.053
zees+0.052
 Lama+0.048
eding+0.047
 Ski+0.046
Neg logit contributions
Dialogue-0.045
ARI-0.045
FTWARE-0.045
mented-0.044
 secretaries-0.043
Token rapid
Token ID5801
Feature activation-1.63e+01

Pos logit contributions
 Crossing+2.769
zees+2.688
 Lama+2.535
eding+2.461
 Ski+2.420
Neg logit contributions
Dialogue-2.739
FTWARE-2.722
ARI-2.716
mented-2.683
 secretaries-2.620
Token increase
Token ID2620
Feature activation-8.34e+00

Pos logit contributions
 Crossing+10.447
eding+9.921
zees+9.917
 Lama+9.735
 Ski+9.265
Neg logit contributions
Dialogue-11.713
FTWARE-11.399
mented-11.050
ARI-10.969
 secretaries-10.641
Token of
Token ID286
Feature activation-4.50e+00

Pos logit contributions
 Crossing+5.471
zees+5.288
 Lama+4.938
eding+4.828
 Ski+4.748
Neg logit contributions
Dialogue-6.630
FTWARE-6.559
ARI-6.404
mented-6.370
 secretaries-6.303
Token algae
Token ID33773
Feature activation-8.96e-01

Pos logit contributions
 Crossing+3.513
zees+3.407
 Lama+3.255
eding+3.156
 Ski+3.106
Neg logit contributions
Dialogue-3.126
FTWARE-3.103
ARI-3.082
mented-3.045
 secretaries-2.971
Token within
Token ID1626
Feature activation+8.32e-01

Pos logit contributions
 Crossing+0.721
zees+0.708
 Lama+0.664
eding+0.645
 Ski+0.638
Neg logit contributions
Dialogue-0.603
ARI-0.602
FTWARE-0.597
mented-0.590
 secretaries-0.577
Token a
Token ID257
Feature activation+1.76e+00

Pos logit contributions
ARI+0.608
Dialogue+0.606
mented+0.606
FTWARE+0.602
 secretaries+0.587
Neg logit contributions
 Crossing-0.615
zees-0.600
 Lama-0.557
eding-0.542
 Ski-0.535
Token a
Token ID257
Feature activation-1.66e+00

Pos logit contributions
 Crossing+0.394
zees+0.389
 Lama+0.362
eding+0.353
 Ski+0.347
Neg logit contributions
Dialogue-0.312
FTWARE-0.310
ARI-0.309
mented-0.307
 secretaries-0.300
Token full
Token ID1336
Feature activation-2.64e+00

Pos logit contributions
 Crossing+1.342
zees+1.317
 Lama+1.236
eding+1.206
 Ski+1.184
Neg logit contributions
Dialogue-1.105
ARI-1.099
FTWARE-1.097
mented-1.085
 secretaries-1.059
Token Shim
Token ID31698
Feature activation+9.84e-01

Pos logit contributions
 Crossing+2.136
zees+2.082
 Lama+1.971
eding+1.920
 Ski+1.890
Neg logit contributions
Dialogue-1.763
ARI-1.750
FTWARE-1.749
mented-1.724
 secretaries-1.682
Tokenano
Token ID5733
Feature activation-2.07e-01

Pos logit contributions
ARI+0.678
Dialogue+0.675
mented+0.660
FTWARE+0.658
 elements+0.642
Neg logit contributions
 Crossing-0.764
zees-0.758
 Lama-0.705
eding-0.688
 Ski-0.673
Token 600
Token ID10053
Feature activation-2.05e+00

Pos logit contributions
 Crossing+0.157
zees+0.156
 Lama+0.144
eding+0.141
 Ski+0.137
Neg logit contributions
Dialogue-0.148
ARI-0.147
FTWARE-0.147
mented-0.145
 secretaries-0.142
Token tri
Token ID1333
Feature activation-1.63e+01

Pos logit contributions
 Crossing+1.597
zees+1.570
 Lama+1.467
eding+1.426
 Ski+1.398
Neg logit contributions
Dialogue-1.439
ARI-1.434
FTWARE-1.429
mented-1.414
 secretaries-1.374
Token-
Token ID12
Feature activation-5.29e+00

Pos logit contributions
 Crossing+9.592
 Lama+9.021
zees+8.896
eding+8.811
 Ski+8.569
Neg logit contributions
Dialogue-12.464
FTWARE-12.442
ARI-12.386
 secretaries-12.202
mented-11.788
Tokencolor
Token ID8043
Feature activation-1.62e+00

Pos logit contributions
 Crossing+4.235
zees+4.104
 Lama+3.897
eding+3.795
 Ski+3.738
Neg logit contributions
FTWARE-3.568
Dialogue-3.554
ARI-3.550
mented-3.492
 secretaries-3.426
Token group
Token ID1448
Feature activation+1.31e-01

Pos logit contributions
 Crossing+1.360
zees+1.343
 Lama+1.256
eding+1.223
 Ski+1.203
Neg logit contributions
Dialogue-1.031
ARI-1.027
FTWARE-1.024
mented-1.016
 secretaries-0.987
Token,
Token ID11
Feature activation-1.77e+00

Pos logit contributions
ARI+0.090
Dialogue+0.090
FTWARE+0.090
mented+0.089
 secretaries+0.086
Neg logit contributions
 Crossing-0.104
zees-0.102
 Lama-0.096
eding-0.092
 Ski-0.091
Token including
Token ID1390
Feature activation-2.91e-01

Pos logit contributions
 Crossing+1.399
zees+1.370
 Lama+1.284
eding+1.252
 Ski+1.228
Neg logit contributions
Dialogue-1.222
FTWARE-1.215
ARI-1.213
mented-1.200
 secretaries-1.172
Tokent
Token ID83
Feature activation-1.86e+00

Pos logit contributions
ARI+0.173
Dialogue+0.173
mented+0.172
FTWARE+0.171
 secretaries+0.165
Neg logit contributions
 Crossing-0.193
zees-0.187
 Lama-0.176
eding-0.170
 Ski-0.169
Token pay
Token ID1414
Feature activation-5.39e+00

Pos logit contributions
 Crossing+1.508
zees+1.475
 Lama+1.391
eding+1.355
 Ski+1.331
Neg logit contributions
Dialogue-1.245
ARI-1.237
FTWARE-1.235
mented-1.221
 secretaries-1.192
Token attention
Token ID3241
Feature activation-2.78e+00

Pos logit contributions
 Crossing+4.103
zees+3.956
 Lama+3.769
eding+3.647
 Ski+3.604
Neg logit contributions
Dialogue-3.846
FTWARE-3.833
ARI-3.781
mented-3.713
 secretaries-3.624
Token to
Token ID284
Feature activation-7.70e-01

Pos logit contributions
 Crossing+2.219
zees+2.162
 Lama+2.044
eding+1.988
 Ski+1.958
Neg logit contributions
Dialogue-1.895
FTWARE-1.880
ARI-1.878
mented-1.852
 secretaries-1.808
Token the
Token ID262
Feature activation-4.97e-01

Pos logit contributions
 Crossing+0.588
zees+0.581
 Lama+0.540
eding+0.527
 Ski+0.517
Neg logit contributions
Dialogue-0.546
ARI-0.543
FTWARE-0.541
mented-0.538
 secretaries-0.529
Token tri
Token ID1333
Feature activation-1.60e+01

Pos logit contributions
 Crossing+0.404
zees+0.399
 Lama+0.372
eding+0.364
 Ski+0.357
Neg logit contributions
Dialogue-0.329
ARI-0.327
FTWARE-0.324
mented-0.322
 secretaries-0.318
Tokenpling
Token ID11347
Feature activation-8.15e+00

Pos logit contributions
zees+11.446
 Crossing+11.396
 Lama+10.914
eding+10.841
digy+10.332
Neg logit contributions
FTWARE-10.248
Dialogue-9.756
ARI-9.616
 secretaries-9.525
mented-9.221
Token of
Token ID286
Feature activation-2.91e+00

Pos logit contributions
 Crossing+5.687
zees+5.462
 Lama+5.230
eding+5.061
 Ski+4.975
Neg logit contributions
Dialogue-6.202
FTWARE-6.148
ARI-6.068
mented-6.006
 secretaries-5.829
Token premiums
Token ID21884
Feature activation-5.66e+00

Pos logit contributions
 Crossing+2.163
zees+2.113
 Lama+1.980
eding+1.923
 Ski+1.888
Neg logit contributions
Dialogue-2.137
ARI-2.130
FTWARE-2.124
mented-2.103
 secretaries-2.060
Token,
Token ID11
Feature activation-1.31e+00

Pos logit contributions
 Crossing+4.231
zees+4.095
 Lama+3.868
eding+3.764
 Ski+3.695
Neg logit contributions
Dialogue-4.133
FTWARE-4.115
ARI-4.077
mented-4.034
 secretaries-3.941
Token doubling
Token ID26862
Feature activation-7.84e+00

Pos logit contributions
 Crossing+0.980
zees+0.967
 Lama+0.902
eding+0.873
 Ski+0.859
Neg logit contributions
ARI-0.945
Dialogue-0.945
FTWARE-0.939
mented-0.936
 secretaries-0.915
Token is
Token ID318
Feature activation+1.95e+00

Pos logit contributions
ARI+0.822
Dialogue+0.814
mented+0.810
FTWARE+0.800
 secretaries+0.787
Neg logit contributions
 Crossing-0.958
zees-0.937
 Lama-0.882
eding-0.852
 Ski-0.841
Token one
Token ID530
Feature activation-6.88e+00

Pos logit contributions
ARI+1.285
Dialogue+1.278
FTWARE+1.263
mented+1.259
 secretaries+1.240
Neg logit contributions
 Crossing-1.578
zees-1.553
 Lama-1.445
eding-1.412
 Ski-1.390
Token element
Token ID5002
Feature activation+4.17e+00

Pos logit contributions
 Crossing+4.363
zees+4.257
 Lama+3.977
eding+3.917
 Ski+3.796
Neg logit contributions
Dialogue-5.626
FTWARE-5.539
ARI-5.501
mented-5.326
 secretaries-5.297
Token of
Token ID286
Feature activation-2.15e+00

Pos logit contributions
ARI+2.718
mented+2.679
Dialogue+2.674
 elements+2.625
 secretaries+2.606
Neg logit contributions
zees-3.303
 Crossing-3.285
 Lama-3.040
eding-2.986
 Ski-2.892
Token a
Token ID257
Feature activation-1.05e+00

Pos logit contributions
 Crossing+1.649
zees+1.612
 Lama+1.514
eding+1.474
 Ski+1.448
Neg logit contributions
Dialogue-1.532
ARI-1.521
FTWARE-1.519
mented-1.503
 secretaries-1.468
Token tri
Token ID1333
Feature activation-1.53e+01

Pos logit contributions
 Crossing+0.805
zees+0.790
 Lama+0.739
eding+0.720
 Ski+0.707
Neg logit contributions
Dialogue-0.747
ARI-0.741
FTWARE-0.740
mented-0.732
 secretaries-0.716
Tokenpty
Token ID5835
Feature activation-2.55e+00

Pos logit contributions
 Crossing+11.103
zees+11.024
eding+10.846
 Lama+10.522
digy+10.110
Neg logit contributions
FTWARE-9.393
 secretaries-9.342
Dialogue-9.073
ARI-8.822
mented-8.545
Tokench
Token ID354
Feature activation+1.43e-01

Pos logit contributions
 Crossing+2.316
zees+2.276
 Lama+2.155
eding+2.109
 Ski+2.077
Neg logit contributions
ARI-1.434
Dialogue-1.434
FTWARE-1.433
mented-1.423
 secretaries-1.386
Token called
Token ID1444
Feature activation+3.73e-01

Pos logit contributions
Dialogue+0.099
ARI+0.099
FTWARE+0.098
mented+0.097
 secretaries+0.095
Neg logit contributions
 Crossing-0.111
zees-0.111
 Lama-0.102
eding-0.099
 Ski-0.098
Token Echo
Token ID21455
Feature activation-4.39e+00

Pos logit contributions
Dialogue+0.281
FTWARE+0.277
ARI+0.277
mented+0.276
 secretaries+0.267
Neg logit contributions
 Crossing-0.267
zees-0.265
 Lama-0.243
eding-0.241
 Ski-0.232
Token that
Token ID326
Feature activation+3.32e+00

Pos logit contributions
 Crossing+3.087
zees+2.968
 Lama+2.798
eding+2.706
 Ski+2.668
Neg logit contributions
Dialogue-3.396
FTWARE-3.393
ARI-3.390
mented-3.336
 secretaries-3.292
Token Type
Token ID5994
Feature activation+8.30e-01

Pos logit contributions
 Crossing+1.084
zees+1.064
 Lama+0.996
eding+0.965
 Ski+0.956
Neg logit contributions
Dialogue-1.066
ARI-1.056
FTWARE-1.053
mented-1.041
 secretaries-1.026
Token 1
Token ID352
Feature activation-1.81e+00

Pos logit contributions
Dialogue+0.595
ARI+0.586
FTWARE+0.579
mented+0.574
 secretaries+0.567
Neg logit contributions
 Crossing-0.625
zees-0.619
 Lama-0.583
eding-0.562
 Ski-0.559
Token probe
Token ID12774
Feature activation+2.41e+00

Pos logit contributions
 Crossing+1.392
zees+1.357
 Lama+1.287
eding+1.248
 Ski+1.231
Neg logit contributions
ARI-1.285
Dialogue-1.285
FTWARE-1.268
mented-1.255
 secretaries-1.233
Token test
Token ID1332
Feature activation-2.00e+00

Pos logit contributions
ARI+1.657
Dialogue+1.656
mented+1.626
FTWARE+1.616
 secretaries+1.615
Neg logit contributions
 Crossing-1.854
zees-1.827
 Lama-1.727
 Ski-1.663
eding-1.656
Token (
Token ID357
Feature activation-9.71e-01

Pos logit contributions
 Crossing+1.576
zees+1.539
 Lama+1.451
eding+1.408
 Ski+1.395
Neg logit contributions
Dialogue-1.389
ARI-1.385
FTWARE-1.376
mented-1.362
 secretaries-1.334
Tokentri
Token ID28461
Feature activation-1.53e+01

Pos logit contributions
 Crossing+0.769
zees+0.750
 Lama+0.712
eding+0.687
 Ski+0.682
Neg logit contributions
Dialogue-0.674
ARI-0.670
FTWARE-0.659
mented-0.654
 secretaries-0.637
Tokenple
Token ID1154
Feature activation-8.08e+00

Pos logit contributions
zees+9.408
 Crossing+9.311
eding+8.739
 Lama+8.563
digy+8.313
Neg logit contributions
Dialogue-11.569
ARI-11.536
 secretaries-11.480
FTWARE-11.351
 colleg-10.727
Token diss
Token ID6249
Feature activation+1.30e+00

Pos logit contributions
 Crossing+6.310
zees+6.088
 Lama+5.768
eding+5.599
 Ski+5.494
Neg logit contributions
Dialogue-5.600
FTWARE-5.511
ARI-5.497
mented-5.345
 secretaries-5.263
Tokenociation
Token ID41003
Feature activation-1.36e+00

Pos logit contributions
Dialogue+0.847
ARI+0.842
FTWARE+0.840
mented+0.827
 secretaries+0.818
Neg logit contributions
 Crossing-1.063
zees-1.039
 Lama-0.982
 Ski-0.944
eding-0.933
Token|
Token ID91
Feature activation-1.93e+00

Pos logit contributions
 Crossing+1.048
zees+1.026
 Lama+0.967
eding+0.942
 Ski+0.929
Neg logit contributions
ARI-0.950
Dialogue-0.950
FTWARE-0.939
mented-0.932
 secretaries-0.918
Token,
Token ID11
Feature activation+3.43e+00

Pos logit contributions
 Crossing+1.560
zees+1.527
 Lama+1.443
eding+1.403
 Ski+1.387
Neg logit contributions
Dialogue-1.296
ARI-1.288
FTWARE-1.276
mented-1.265
 secretaries-1.232
Token smooth
Token ID7209
Feature activation-6.60e+00

Pos logit contributions
 Crossing+2.819
zees+2.737
 Lama+2.534
eding+2.466
 Ski+2.417
Neg logit contributions
Dialogue-3.581
ARI-3.565
FTWARE-3.554
mented-3.513
 secretaries-3.452
Token the
Token ID262
Feature activation-3.64e+00

Pos logit contributions
 Crossing+4.109
zees+3.942
 Lama+3.731
eding+3.643
 Ski+3.584
Neg logit contributions
Dialogue-5.495
FTWARE-5.473
ARI-5.469
 secretaries-5.403
mented-5.333
Token image
Token ID2939
Feature activation-1.10e+00

Pos logit contributions
 Crossing+3.262
zees+3.185
 Lama+3.041
eding+2.980
 Ski+2.935
Neg logit contributions
Dialogue-2.118
ARI-2.101
FTWARE-2.097
mented-2.049
 secretaries-2.016
Token using
Token ID1262
Feature activation-6.58e+00

Pos logit contributions
 Crossing+0.626
zees+0.607
 Lama+0.557
eding+0.535
 Ski+0.521
Neg logit contributions
Dialogue-0.994
FTWARE-0.992
ARI-0.986
mented-0.982
 secretaries-0.959
Token a
Token ID257
Feature activation-4.79e+00

Pos logit contributions
 Crossing+4.715
zees+4.525
 Lama+4.341
eding+4.218
 Ski+4.174
Neg logit contributions
ARI-4.895
FTWARE-4.868
Dialogue-4.833
mented-4.754
 secretaries-4.746
Token 3
Token ID513
Feature activation-1.48e+01

Pos logit contributions
 Crossing+3.756
zees+3.629
 Lama+3.435
eding+3.345
 Ski+3.309
Neg logit contributions
Dialogue-3.328
ARI-3.317
FTWARE-3.296
mented-3.210
 secretaries-3.206
Tokenx
Token ID87
Feature activation-6.50e+00

Pos logit contributions
 Crossing+4.993
zees+4.975
eding+4.292
digy+4.097
 Lama+3.898
Neg logit contributions
 millenn-16.625
 secretaries-15.546
FTWARE-15.306
 adolesc-15.125
ARI-14.809
Token3
Token ID18
Feature activation-7.90e+00

Pos logit contributions
 Crossing+4.490
zees+4.250
 Lama+3.994
eding+3.871
 Ski+3.844
Neg logit contributions
 secretaries-5.084
FTWARE-4.985
Dialogue-4.969
ARI-4.928
mented-4.883
Token Ga
Token ID12822
Feature activation+2.79e+00

Pos logit contributions
 Crossing+2.798
zees+2.528
 Lama+2.354
 Ski+2.137
eding+2.131
Neg logit contributions
ARI-8.767
Dialogue-8.722
FTWARE-8.696
 secretaries-8.585
mented-8.560
Tokenussian
Token ID31562
Feature activation-9.39e+00

Pos logit contributions
mented+2.017
ARI+2.012
FTWARE+2.012
Dialogue+2.008
elve+1.910
Neg logit contributions
 Crossing-2.118
zees-2.003
 Lama-1.907
 Ski-1.810
eding-1.800
Token,
Token ID11
Feature activation-7.84e+00

Pos logit contributions
 Crossing+6.343
 Lama+5.942
zees+5.935
 Ski+5.656
eding+5.545
Neg logit contributions
ARI-7.178
Dialogue-7.142
FTWARE-7.028
 secretaries-6.982
mented-6.862
Token F
Token ID376
Feature activation-1.05e+01

Pos logit contributions
 Crossing+4.083
zees+3.866
 Lama+3.618
eding+3.547
 Ski+3.503
Neg logit contributions
Dialogue-4.806
ARI-4.781
FTWARE-4.774
mented-4.671
 secretaries-4.663
TokenOV
Token ID8874
Feature activation-4.43e+00

Pos logit contributions
 Crossing+8.365
zees+8.243
 Lama+7.687
 Ski+7.623
eding+7.574
Neg logit contributions
 secretaries-6.587
 millenn-6.343
FTWARE-6.194
Dialogue-6.110
mented-6.080
Token even
Token ID772
Feature activation-4.92e+00

Pos logit contributions
 Crossing+3.502
zees+3.393
 Lama+3.229
eding+3.143
 Ski+3.096
Neg logit contributions
Dialogue-3.025
FTWARE-3.018
ARI-3.004
mented-2.928
 secretaries-2.896
Token double
Token ID4274
Feature activation-1.16e+01

Pos logit contributions
 Crossing+3.473
zees+3.329
 Lama+3.171
eding+3.034
 Ski+3.011
Neg logit contributions
ARI-3.773
FTWARE-3.766
Dialogue-3.763
mented-3.672
 secretaries-3.637
Token or
Token ID393
Feature activation-6.63e+00

Pos logit contributions
 Crossing+6.039
zees+5.613
eding+5.255
 Lama+5.187
 Ski+5.044
Neg logit contributions
Dialogue-10.477
FTWARE-10.427
ARI-10.288
 millenn-10.157
 secretaries-10.101
Token 4
Token ID604
Feature activation-1.46e+01

Pos logit contributions
 Crossing+5.204
zees+4.954
 Lama+4.799
eding+4.638
 Trace+4.627
Neg logit contributions
FTWARE-4.466
ARI-4.456
Dialogue-4.415
 secretaries-4.303
mented-4.288
Tokenx
Token ID87
Feature activation-2.64e+00

Pos logit contributions
 Crossing+5.123
zees+4.884
eding+4.113
MQ+4.057
digy+4.015
Neg logit contributions
 millenn-15.784
 secretaries-15.263
FTWARE-14.890
 adolesc-14.650
ARI-14.511
Token 1080
Token ID17729
Feature activation-8.68e+00

Pos logit contributions
 Crossing+1.910
zees+1.854
 Lama+1.743
eding+1.691
 Ski+1.659
Neg logit contributions
Dialogue-1.989
ARI-1.980
FTWARE-1.976
mented-1.952
 secretaries-1.916
TokenP
Token ID47
Feature activation-5.28e+00

Pos logit contributions
 Crossing+4.887
zees+4.603
 Lama+4.264
MQ+4.203
 Ski+4.138
Neg logit contributions
FTWARE-7.598
 secretaries-7.596
mented-7.507
ARI-7.467
Dialogue-7.435
Token resolution
Token ID6323
Feature activation-1.07e+01

Pos logit contributions
 Crossing+4.275
zees+4.126
 Lama+3.925
eding+3.840
 Trace+3.750
Neg logit contributions
FTWARE-3.516
Dialogue-3.514
ARI-3.478
mented-3.381
 secretaries-3.365
Token you
Token ID345
Feature activation-7.68e+00

Pos logit contributions
 Crossing+6.159
zees+6.096
eding+5.447
 Lama+5.379
 Ski+5.342
Neg logit contributions
FTWARE-9.164
Dialogue-9.064
ARI-9.061
 millenn-8.864
 secretaries-8.731
Token focusing
Token ID10759
Feature activation-2.07e+00

Pos logit contributions
 Crossing+1.958
zees+1.901
 Lama+1.802
eding+1.751
 Ski+1.727
Neg logit contributions
Dialogue-1.723
FTWARE-1.712
ARI-1.709
mented-1.683
 secretaries-1.645
Token problem
Token ID1917
Feature activation+1.84e+00

Pos logit contributions
 Crossing+1.551
zees+1.517
 Lama+1.418
eding+1.381
 Ski+1.356
Neg logit contributions
Dialogue-1.502
ARI-1.496
FTWARE-1.491
mented-1.474
 secretaries-1.444
Token.
Token ID13
Feature activation-9.24e-01

Pos logit contributions
Dialogue+1.244
ARI+1.243
mented+1.235
FTWARE+1.233
 secretaries+1.188
Neg logit contributions
 Crossing-1.460
zees-1.447
 Lama-1.354
eding-1.290
 Ski-1.285
Token
Token ID198
Feature activation-1.92e+00

Pos logit contributions
 Crossing+0.727
zees+0.726
 Lama+0.676
eding+0.664
 Ski+0.643
Neg logit contributions
Dialogue-0.635
ARI-0.627
FTWARE-0.624
mented-0.617
 secretaries-0.594
Token
Token ID198
Feature activation-9.46e-01

Pos logit contributions
 Crossing+1.561
zees+1.522
 Lama+1.440
eding+1.403
 Ski+1.381
Neg logit contributions
Dialogue-1.280
FTWARE-1.272
ARI-1.272
mented-1.254
 secretaries-1.228
TokenTri
Token ID14824
Feature activation-1.45e+01

Pos logit contributions
 Crossing+0.806
zees+0.787
 Lama+0.747
eding+0.723
 Ski+0.717
Neg logit contributions
Dialogue-0.606
ARI-0.598
FTWARE-0.592
mented-0.585
 secretaries-0.562
Tokenangular
Token ID21413
Feature activation-8.31e+00

Pos logit contributions
 Crossing+10.726
zees+10.488
 Lama+10.132
eding+9.889
digy+9.571
Neg logit contributions
 secretaries-8.937
FTWARE-8.890
ARI-8.835
Dialogue-8.788
mented-8.398
Token segregation
Token ID26718
Feature activation-6.96e+00

Pos logit contributions
 Crossing+7.248
zees+6.969
 Lama+6.674
eding+6.540
 Trace+6.454
Neg logit contributions
FTWARE-4.899
ARI-4.802
Dialogue-4.763
mented-4.642
 secretaries-4.580
Token
Token ID198
Feature activation-2.92e+00

Pos logit contributions
 Crossing+5.280
zees+5.092
 Lama+4.813
eding+4.700
 Ski+4.632
Neg logit contributions
FTWARE-4.945
ARI-4.859
Dialogue-4.855
mented-4.797
 secretaries-4.697
Token
Token ID198
Feature activation-1.73e+00

Pos logit contributions
 Crossing+2.295
zees+2.222
 Lama+2.108
eding+2.056
 Ski+2.023
Neg logit contributions
Dialogue-2.004
FTWARE-2.003
ARI-1.994
mented-1.970
 secretaries-1.960
TokenAct
Token ID6398
Feature activation+7.05e+00

Pos logit contributions
 Crossing+1.482
zees+1.449
 Lama+1.375
eding+1.338
 Ski+1.320
Neg logit contributions
Dialogue-1.090
ARI-1.079
FTWARE-1.072
mented-1.057
 secretaries-1.023
Token but
Token ID475
Feature activation-7.98e+00

Pos logit contributions
 Crossing+0.630
zees+0.616
 Lama+0.580
eding+0.559
 Ski+0.551
Neg logit contributions
ARI-0.556
Dialogue-0.555
FTWARE-0.553
mented-0.553
 secretaries-0.540
Token had
Token ID550
Feature activation-8.97e+00

Pos logit contributions
 Crossing+5.827
zees+5.574
 Lama+5.299
eding+5.228
 Ski+5.096
Neg logit contributions
FTWARE-5.862
ARI-5.841
Dialogue-5.835
mented-5.687
elve-5.484
Token not
Token ID407
Feature activation-6.86e+00

Pos logit contributions
 Crossing+5.976
zees+5.788
eding+5.497
 Lama+5.456
 Ski+5.251
Neg logit contributions
FTWARE-6.983
Dialogue-6.969
ARI-6.925
mented-6.690
elve-6.516
Token done
Token ID1760
Feature activation-8.02e+00

Pos logit contributions
 Crossing+4.824
zees+4.650
 Lama+4.377
eding+4.373
 Ski+4.230
Neg logit contributions
Dialogue-5.213
ARI-5.205
FTWARE-5.155
mented-4.986
 secretaries-4.920
Token so
Token ID523
Feature activation-6.21e+00

Pos logit contributions
 Crossing+4.600
zees+4.407
 Lama+4.087
eding+4.040
 Ski+3.945
Neg logit contributions
Dialogue-7.002
ARI-6.990
FTWARE-6.981
mented-6.895
 secretaries-6.752
Token widely
Token ID6768
Feature activation-1.44e+01

Pos logit contributions
 Crossing+4.389
zees+4.234
 Lama+4.019
eding+3.938
 Ski+3.836
Neg logit contributions
Dialogue-4.738
ARI-4.697
FTWARE-4.687
mented-4.623
 secretaries-4.498
Token.
Token ID13
Feature activation+5.45e-01

Pos logit contributions
 Crossing+8.538
zees+7.963
eding+7.744
 Lama+7.646
 Ski+7.319
Neg logit contributions
Dialogue-11.657
ARI-11.256
FTWARE-11.166
mented-11.164
ATURE-10.965
Token
Token ID198
Feature activation+1.33e+00

Pos logit contributions
Dialogue+0.366
ARI+0.360
FTWARE+0.359
mented+0.356
 secretaries+0.340
Neg logit contributions
zees-0.437
 Crossing-0.430
 Lama-0.402
eding-0.399
owler-0.386
Token
Token ID198
Feature activation+2.61e+00

Pos logit contributions
Dialogue+0.810
ARI+0.803
FTWARE+0.793
mented+0.789
 secretaries+0.736
Neg logit contributions
 Crossing-1.155
zees-1.136
 Lama-1.072
eding-1.040
 Ski-1.024
TokenO
Token ID46
Feature activation+2.02e+00

Pos logit contributions
Dialogue+1.502
ARI+1.461
FTWARE+1.414
mented+1.393
 secretaries+1.285
Neg logit contributions
 Crossing-2.441
zees-2.377
 Lama-2.290
eding-2.189
 Ski-2.180
Tokenuts
Token ID5500
Feature activation-2.50e+00

Pos logit contributions
FTWARE+1.420
mented+1.408
ARI+1.397
Dialogue+1.396
 secretaries+1.344
Neg logit contributions
 Crossing-1.567
zees-1.507
 Lama-1.446
 Ski-1.371
eding-1.359
Token coach
Token ID3985
Feature activation-3.52e+00

Pos logit contributions
 Crossing+0.363
zees+0.359
 Lama+0.336
eding+0.329
 Ski+0.323
Neg logit contributions
Dialogue-0.287
ARI-0.286
mented-0.285
FTWARE-0.284
 secretaries-0.279
Token,
Token ID11
Feature activation+7.64e-01

Pos logit contributions
 Crossing+2.669
zees+2.594
 Lama+2.454
eding+2.384
 Ski+2.342
Neg logit contributions
Dialogue-2.520
FTWARE-2.507
ARI-2.502
mented-2.463
 secretaries-2.410
Token
Token ID447
Feature activation-4.10e+00

Pos logit contributions
Dialogue+0.489
ARI+0.487
mented+0.485
FTWARE+0.484
 secretaries+0.471
Neg logit contributions
 Crossing-0.634
zees-0.627
 Lama-0.587
eding-0.570
 Ski-0.561
Token
Token ID251
Feature activation-7.02e+00

Pos logit contributions
 Crossing+2.102
zees+2.022
 Lama+1.848
eding+1.766
 Ski+1.712
Neg logit contributions
Dialogue-3.953
ARI-3.936
FTWARE-3.934
mented-3.886
 secretaries-3.832
Token A
Token ID317
Feature activation-6.41e-01

Pos logit contributions
 Crossing+5.267
zees+5.019
 Lama+4.805
eding+4.659
 Ski+4.593
Neg logit contributions
Dialogue-5.002
FTWARE-4.997
ARI-4.931
mented-4.863
 secretaries-4.803
Tokendu
Token ID646
Feature activation-1.42e+01

Pos logit contributions
 Crossing+0.566
zees+0.553
 Lama+0.523
eding+0.510
 Ski+0.504
Neg logit contributions
Dialogue-0.383
ARI-0.380
FTWARE-0.377
mented-0.376
 secretaries-0.361
Token says
Token ID1139
Feature activation-6.50e+00

Pos logit contributions
 Crossing+8.658
zees+8.195
 Lama+8.069
eding+7.833
 Ski+7.602
Neg logit contributions
FTWARE-10.965
Dialogue-10.897
ARI-10.809
 secretaries-10.418
mented-10.307
Token.
Token ID13
Feature activation+5.01e-01

Pos logit contributions
 Crossing+4.391
zees+4.222
 Lama+4.025
eding+3.871
 Ski+3.827
Neg logit contributions
FTWARE-5.063
Dialogue-5.048
ARI-5.011
mented-4.932
 secretaries-4.862
TokenA
Token ID32
Feature activation+3.58e+00

Pos logit contributions
Dialogue+0.308
ARI+0.306
FTWARE+0.301
mented+0.299
 secretaries+0.287
Neg logit contributions
 Crossing-0.431
zees-0.428
 Lama-0.401
eding-0.394
 Ski-0.385
Tokendu
Token ID646
Feature activation-9.86e+00

Pos logit contributions
Dialogue+2.249
ARI+2.241
mented+2.236
FTWARE+2.213
 secretaries+2.138
Neg logit contributions
 Crossing-2.948
zees-2.920
 Lama-2.714
eding-2.659
 Ski-2.609
Token's
Token ID338
Feature activation-5.47e+00

Pos logit contributions
 Crossing+7.258
zees+6.913
 Lama+6.746
eding+6.585
 Ski+6.466
Neg logit contributions
FTWARE-6.965
Dialogue-6.908
ARI-6.807
 secretaries-6.544
mented-6.529
Token a
Token ID257
Feature activation-3.78e+00

Pos logit contributions
 Crossing+5.630
zees+5.287
 Lama+5.123
eding+5.095
 Trace+4.861
Neg logit contributions
FTWARE-6.422
ARI-6.371
Dialogue-6.308
mented-6.230
 secretaries-6.223
Token concern
Token ID2328
Feature activation+1.83e+00

Pos logit contributions
 Crossing+3.316
zees+3.209
 Lama+3.067
eding+3.023
 Ski+2.947
Neg logit contributions
Dialogue-2.267
FTWARE-2.244
ARI-2.239
mented-2.204
 secretaries-2.161
Token though
Token ID996
Feature activation-6.77e+00

Pos logit contributions
mented+1.281
FTWARE+1.281
Dialogue+1.281
ARI+1.272
 secretaries+1.231
Neg logit contributions
 Crossing-1.405
zees-1.370
 Lama-1.303
 Trace-1.234
eding-1.231
Token since
Token ID1201
Feature activation-7.11e+00

Pos logit contributions
 Crossing+3.971
zees+3.866
 Lama+3.554
eding+3.434
 Ski+3.374
Neg logit contributions
Dialogue-5.982
ARI-5.972
FTWARE-5.904
mented-5.848
 secretaries-5.743
Token most
Token ID749
Feature activation-9.13e+00

Pos logit contributions
 Crossing+3.352
zees+3.189
 Lama+2.866
eding+2.771
 Ski+2.757
Neg logit contributions
Dialogue-7.081
FTWARE-7.046
ARI-6.981
mented-6.922
 secretaries-6.818
Token of
Token ID286
Feature activation-1.40e+01

Pos logit contributions
 Crossing+5.498
zees+5.198
 Lama+4.945
eding+4.840
 Ski+4.733
Neg logit contributions
FTWARE-7.592
ARI-7.575
Dialogue-7.561
 secretaries-7.478
 millenn-7.417
Token the
Token ID262
Feature activation-8.58e+00

Pos logit contributions
 Crossing+6.426
zees+5.947
 Lama+5.732
eding+5.671
 Ski+5.487
Neg logit contributions
 millenn-13.319
Dialogue-12.687
ARI-12.665
FTWARE-12.529
 secretaries-12.472
Token time
Token ID640
Feature activation-4.96e+00

Pos logit contributions
 Crossing+5.229
zees+5.081
 Lama+4.741
eding+4.694
 Trace+4.517
Neg logit contributions
ARI-7.215
Dialogue-7.200
FTWARE-7.032
 secretaries-6.960
mented-6.932
Token I
Token ID314
Feature activation-1.02e+00

Pos logit contributions
 Crossing+3.231
zees+3.100
 Lama+2.896
eding+2.813
 Ski+2.788
Neg logit contributions
Dialogue-4.074
ARI-4.063
FTWARE-4.059
mented-4.009
 secretaries-3.960
Token don
Token ID836
Feature activation-4.88e+00

Pos logit contributions
 Crossing+0.853
zees+0.834
 Lama+0.789
eding+0.768
 Ski+0.757
Neg logit contributions
Dialogue-0.657
ARI-0.652
FTWARE-0.652
mented-0.643
 secretaries-0.627
Token't
Token ID470
Feature activation-3.45e+00

Pos logit contributions
 Crossing+3.326
zees+3.223
 Lama+3.026
eding+2.978
 Ski+2.891
Neg logit contributions
FTWARE-3.827
Dialogue-3.778
ARI-3.778
 secretaries-3.774
mented-3.702
Token.
Token ID13
Feature activation+5.39e+00

Pos logit contributions
 Crossing+0.265
zees+0.262
 Lama+0.245
eding+0.237
 Ski+0.234
Neg logit contributions
Dialogue-0.249
ARI-0.245
FTWARE-0.245
mented-0.243
 secretaries-0.239
Token
Token ID198
Feature activation+6.41e-01

Pos logit contributions
Dialogue+3.189
ARI+3.055
FTWARE+3.047
mented+2.985
 elements+2.869
Neg logit contributions
zees-4.636
 Crossing-4.449
 Lama-4.257
eding-4.208
digy-4.140
Token
Token ID198
Feature activation+2.75e+00

Pos logit contributions
Dialogue+0.395
ARI+0.391
FTWARE+0.387
mented+0.384
 secretaries+0.362
Neg logit contributions
 Crossing-0.553
zees-0.546
 Lama-0.512
eding-0.500
 Ski-0.492
TokenDespite
Token ID8332
Feature activation-5.69e+00

Pos logit contributions
Dialogue+1.666
ARI+1.612
FTWARE+1.566
mented+1.526
 secretaries+1.444
Neg logit contributions
 Crossing-2.485
zees-2.439
 Lama-2.336
 Ski-2.237
eding-2.220
Token her
Token ID607
Feature activation-3.46e+00

Pos logit contributions
 Crossing+4.176
zees+3.995
 Lama+3.806
eding+3.701
 Ski+3.621
Neg logit contributions
Dialogue-4.228
FTWARE-4.216
ARI-4.214
mented-4.158
 secretaries-4.014
Token conc
Token ID1673
Feature activation-1.39e+01

Pos logit contributions
 Crossing+2.691
zees+2.626
 Lama+2.473
eding+2.405
 Ski+2.366
Neg logit contributions
Dialogue-2.427
ARI-2.411
FTWARE-2.409
mented-2.378
 secretaries-2.326
Tokenili
Token ID2403
Feature activation-1.15e+00

Pos logit contributions
 Crossing+8.460
eding+8.279
zees+8.075
 Lama+7.487
owler+7.223
Neg logit contributions
FTWARE-10.980
ARI-10.796
Dialogue-10.583
 secretaries-10.516
mented-10.306
Tokenatory
Token ID2870
Feature activation-3.41e+00

Pos logit contributions
 Crossing+1.218
zees+1.181
 Lama+1.137
 Ski+1.108
eding+1.104
Neg logit contributions
Dialogue-0.496
ARI-0.493
mented-0.476
FTWARE-0.464
 secretaries-0.438
Token tone
Token ID8216
Feature activation-4.97e+00

Pos logit contributions
 Crossing+2.975
zees+2.905
 Lama+2.759
eding+2.692
 Ski+2.652
Neg logit contributions
Dialogue-2.066
ARI-2.053
FTWARE-2.052
mented-2.020
 secretaries-1.962
Token,
Token ID11
Feature activation-2.59e-02

Pos logit contributions
 Crossing+3.823
zees+3.683
 Lama+3.488
eding+3.402
 Ski+3.328
Neg logit contributions
Dialogue-3.524
FTWARE-3.520
ARI-3.494
mented-3.460
 secretaries-3.338
Token Merkel
Token ID18263
Feature activation-2.01e+00

Pos logit contributions
 Crossing+0.020
zees+0.020
 Lama+0.019
eding+0.018
 Ski+0.018
Neg logit contributions
Dialogue-0.018
ARI-0.018
FTWARE-0.017
mented-0.017
 secretaries-0.017
Tokencom
Token ID785
Feature activation-5.75e+00

Pos logit contributions
Dialogue+0.258
ARI+0.254
FTWARE+0.252
mented+0.250
 secretaries+0.228
Neg logit contributions
 Crossing-0.604
zees-0.593
 Lama-0.566
eding-0.552
 Ski-0.548
Token/
Token ID14
Feature activation-9.21e+00

Pos logit contributions
 Crossing+3.822
zees+3.664
 Lama+3.469
eding+3.352
 Ski+3.323
Neg logit contributions
 secretaries-4.599
FTWARE-4.585
ARI-4.556
Dialogue-4.527
mented-4.476
Tokenj
Token ID73
Feature activation-8.53e+00

Pos logit contributions
 Crossing+6.746
zees+6.675
 Lama+6.215
eding+6.186
MQ+6.068
Neg logit contributions
FTWARE-6.415
Dialogue-6.280
 secretaries-6.279
mented-6.214
ARI-6.195
TokenAx
Token ID31554
Feature activation-7.63e+00

Pos logit contributions
 Crossing+6.224
zees+6.156
 Lama+5.728
eding+5.680
MQ+5.569
Neg logit contributions
FTWARE-6.017
Dialogue-5.911
 secretaries-5.904
mented-5.877
ARI-5.857
Token6
Token ID21
Feature activation-9.53e+00

Pos logit contributions
 Crossing+5.666
zees+5.585
 Lama+5.223
eding+5.151
MQ+5.017
Neg logit contributions
FTWARE-5.367
Dialogue-5.301
mented-5.244
ARI-5.241
 secretaries-5.235
Tokenc
Token ID66
Feature activation-1.38e+01

Pos logit contributions
 Crossing+7.021
zees+6.972
 Lama+6.484
eding+6.457
MQ+6.369
Neg logit contributions
FTWARE-6.537
 secretaries-6.415
Dialogue-6.392
mented-6.339
ARI-6.304
TokenU
Token ID52
Feature activation-7.15e+00

Pos logit contributions
zees+9.774
 Crossing+9.700
MQ+9.217
eding+9.082
 Lama+9.009
Neg logit contributions
FTWARE-9.041
 secretaries-8.973
mented-8.773
Dialogue-8.589
ARI-8.450
Tokenxi
Token ID29992
Feature activation-7.66e+00

Pos logit contributions
 Crossing+5.259
zees+5.179
 Lama+4.839
eding+4.786
 Ski+4.629
Neg logit contributions
FTWARE-5.124
Dialogue-5.055
ARI-4.996
mented-4.987
 secretaries-4.984
TokenUG
Token ID7340
Feature activation-2.59e+00

Pos logit contributions
 Crossing+5.580
zees+5.499
 Lama+5.152
eding+5.076
 Ski+4.940
Neg logit contributions
FTWARE-5.504
Dialogue-5.425
mented-5.401
 secretaries-5.365
ARI-5.346
Token
Token ID851
Feature activation-7.67e+00

Pos logit contributions
 Crossing+2.229
zees+2.172
 Lama+2.070
eding+2.016
 Ski+1.995
Neg logit contributions
Dialogue-1.597
FTWARE-1.591
ARI-1.575
mented-1.548
 secretaries-1.527
Token tele
Token ID5735
Feature activation-6.20e+00

Pos logit contributions
 Crossing+5.506
zees+5.197
 Lama+5.104
eding+4.792
 Ski+4.777
Neg logit contributions
Dialogue-5.684
FTWARE-5.679
ARI-5.645
mented-5.593
 secretaries-5.490
Tokenized
Token ID1143
Feature activation+1.54e-01

Pos logit contributions
 Crossing+1.097
zees+1.054
 Lama+0.975
eding+0.913
 Ski+0.908
Neg logit contributions
Dialogue-1.931
mented-1.918
ARI-1.917
FTWARE-1.909
 secretaries-1.861
Token nipples
Token ID37368
Feature activation-2.90e+00

Pos logit contributions
Dialogue+0.102
ARI+0.101
mented+0.101
FTWARE+0.101
 secretaries+0.099
Neg logit contributions
 Crossing-0.123
zees-0.123
 Lama-0.114
eding-0.111
 Ski-0.108
Token to
Token ID284
Feature activation+2.33e+00

Pos logit contributions
 Crossing+2.184
zees+2.129
 Lama+2.002
eding+1.945
 Ski+1.912
Neg logit contributions
Dialogue-2.100
ARI-2.087
FTWARE-2.086
mented-2.061
 secretaries-2.014
Token match
Token ID2872
Feature activation-2.93e+00

Pos logit contributions
Dialogue+1.664
mented+1.656
ARI+1.641
FTWARE+1.603
 secretaries+1.600
Neg logit contributions
 Crossing-1.749
zees-1.716
 Lama-1.587
eding-1.548
 Ski-1.507
Token the
Token ID262
Feature activation+1.12e-01

Pos logit contributions
 Crossing+2.171
zees+2.116
 Lama+1.980
eding+1.927
 Ski+1.889
Neg logit contributions
Dialogue-2.159
ARI-2.145
FTWARE-2.141
mented-2.125
 secretaries-2.076
Token tri
Token ID1333
Feature activation-1.37e+01

Pos logit contributions
Dialogue+0.075
ARI+0.075
mented+0.074
FTWARE+0.074
 secretaries+0.073
Neg logit contributions
 Crossing-0.090
zees-0.089
 Lama-0.083
eding-0.081
 Ski-0.079
Token-
Token ID12
Feature activation-4.74e+00

Pos logit contributions
 Crossing+8.883
zees+8.704
 Lama+8.251
eding+8.219
 Ski+7.920
Neg logit contributions
FTWARE-10.110
Dialogue-10.033
ARI-9.756
 secretaries-9.582
mented-9.388
Tokencolor
Token ID8043
Feature activation-5.81e+00

Pos logit contributions
 Crossing+4.468
zees+4.371
 Lama+4.172
eding+4.092
 Ski+4.026
Neg logit contributions
FTWARE-2.510
Dialogue-2.503
ARI-2.482
mented-2.434
 secretaries-2.372
Token group
Token ID1448
Feature activation-2.24e+00

Pos logit contributions
 Crossing+4.971
zees+4.801
 Lama+4.596
eding+4.507
 Ski+4.454
Neg logit contributions
Dialogue-3.575
FTWARE-3.560
ARI-3.541
mented-3.441
 secretaries-3.361
Token and
Token ID290
Feature activation-1.66e+00

Pos logit contributions
 Crossing+1.703
zees+1.661
 Lama+1.561
eding+1.513
 Ski+1.488
Neg logit contributions
Dialogue-1.606
ARI-1.596
FTWARE-1.595
mented-1.582
 secretaries-1.543
Token vintage
Token ID20276
Feature activation+1.28e+00

Pos logit contributions
 Crossing+1.308
zees+1.282
 Lama+1.201
eding+1.171
 Ski+1.149
Neg logit contributions
Dialogue-1.138
FTWARE-1.128
ARI-1.127
mented-1.120
 secretaries-1.092
Token.
Token ID13
Feature activation-1.45e-01

Pos logit contributions
 Crossing+0.929
zees+0.909
 Lama+0.853
eding+0.827
 Ski+0.812
Neg logit contributions
Dialogue-0.852
ARI-0.850
FTWARE-0.847
mented-0.840
 secretaries-0.827
Token
Token ID447
Feature activation-1.49e+00

Pos logit contributions
 Crossing+0.116
zees+0.116
 Lama+0.108
eding+0.106
 Ski+0.103
Neg logit contributions
Dialogue-0.096
ARI-0.096
FTWARE-0.095
mented-0.094
 secretaries-0.091
Token
Token ID251
Feature activation-4.62e+00

Pos logit contributions
 Crossing+0.914
zees+0.886
 Lama+0.820
eding+0.792
 Ski+0.778
Neg logit contributions
Dialogue-1.283
ARI-1.278
FTWARE-1.275
mented-1.266
 secretaries-1.239
Token
Token ID198
Feature activation-2.69e+00

Pos logit contributions
 Crossing+3.702
zees+3.585
 Lama+3.403
eding+3.297
 Ski+3.280
Neg logit contributions
Dialogue-3.104
FTWARE-3.101
ARI-3.073
mented-3.036
 secretaries-2.989
Token
Token ID198
Feature activation-6.82e-01

Pos logit contributions
 Crossing+2.134
zees+2.075
 Lama+1.963
eding+1.916
 Ski+1.886
Neg logit contributions
Dialogue-1.828
FTWARE-1.825
ARI-1.816
mented-1.792
 secretaries-1.779
TokenJim
Token ID18050
Feature activation-1.36e+01

Pos logit contributions
 Crossing+0.596
zees+0.583
 Lama+0.555
eding+0.537
 Ski+0.530
Neg logit contributions
Dialogue-0.419
ARI-0.416
FTWARE-0.411
mented-0.405
 secretaries-0.390
Token Bern
Token ID6206
Feature activation-4.10e+00

Pos logit contributions
 Crossing+9.677
zees+9.070
 Lama+8.896
 Ski+8.471
 Trace+8.409
Neg logit contributions
Dialogue-9.608
FTWARE-9.498
ARI-9.351
 secretaries-9.268
mented-9.027
Tokenau
Token ID559
Feature activation-4.40e+00

Pos logit contributions
 Crossing+3.381
zees+3.308
 Lama+3.125
eding+3.062
 Ski+3.000
Neg logit contributions
Dialogue-2.665
FTWARE-2.649
ARI-2.643
mented-2.595
 secretaries-2.551
Token,
Token ID11
Feature activation-2.81e+00

Pos logit contributions
 Crossing+3.490
zees+3.396
 Lama+3.220
eding+3.136
 Ski+3.088
Neg logit contributions
Dialogue-3.008
FTWARE-2.992
ARI-2.979
mented-2.923
 secretaries-2.875
Token CEO
Token ID6123
Feature activation-4.18e+00

Pos logit contributions
 Crossing+2.265
zees+2.211
 Lama+2.088
eding+2.033
 Ski+2.001
Neg logit contributions
Dialogue-1.887
ARI-1.874
FTWARE-1.873
mented-1.847
 secretaries-1.804
Token of
Token ID286
Feature activation-2.14e+00

Pos logit contributions
 Crossing+2.842
zees+2.736
 Lama+2.580
eding+2.485
 Ski+2.481
Neg logit contributions
Dialogue-3.312
ARI-3.266
FTWARE-3.252
mented-3.192
 secretaries-3.169
Tokent
Token ID83
Feature activation-4.07e+00

Pos logit contributions
 Crossing+0.430
zees+0.418
 Lama+0.398
eding+0.380
 Ski+0.380
Neg logit contributions
Dialogue-0.365
ARI-0.364
mented-0.362
FTWARE-0.362
 secretaries-0.342
Token.
Token ID13
Feature activation+3.93e+00

Pos logit contributions
 Crossing+2.865
zees+2.795
 Lama+2.600
eding+2.549
 Ski+2.490
Neg logit contributions
Dialogue-3.116
FTWARE-3.103
ARI-3.100
 secretaries-3.064
mented-3.055
Tokenco
Token ID1073
Feature activation-4.99e+00

Pos logit contributions
Dialogue+1.769
ARI+1.761
mented+1.758
FTWARE+1.685
elve+1.564
Neg logit contributions
 Crossing-3.919
zees-3.874
 Lama-3.728
 Trace-3.587
 Ski-3.578
Token/
Token ID14
Feature activation-8.46e+00

Pos logit contributions
 Crossing+3.433
zees+3.346
 Lama+3.123
eding+3.054
 Ski+2.988
Neg logit contributions
Dialogue-3.863
ARI-3.859
FTWARE-3.854
 secretaries-3.846
mented-3.791
Tokenni
Token ID8461
Feature activation-7.70e+00

Pos logit contributions
 Crossing+6.237
zees+6.149
 Lama+5.737
eding+5.688
MQ+5.554
Neg logit contributions
FTWARE-5.942
Dialogue-5.834
 secretaries-5.795
mented-5.768
ARI-5.762
TokenWB
Token ID45607
Feature activation-1.36e+01

Pos logit contributions
 Crossing+5.648
zees+5.559
 Lama+5.210
eding+5.128
MQ+5.003
Neg logit contributions
FTWARE-5.471
Dialogue-5.402
mented-5.381
 secretaries-5.355
ARI-5.335
Tokenqv
Token ID44179
Feature activation-1.12e+01

Pos logit contributions
zees+9.804
 Crossing+9.733
eding+9.136
MQ+9.082
 Lama+9.032
Neg logit contributions
FTWARE-8.902
 secretaries-8.773
mented-8.503
Dialogue-8.413
ARI-8.269
Tokenfi
Token ID12463
Feature activation-1.07e+01

Pos logit contributions
 Crossing+8.090
zees+8.050
 Lama+7.478
eding+7.476
MQ+7.470
Neg logit contributions
FTWARE-7.627
 secretaries-7.479
mented-7.393
Dialogue-7.342
ARI-7.227
TokenW
Token ID54
Feature activation-7.28e+00

Pos logit contributions
 Crossing+7.612
zees+7.537
 Lama+7.047
eding+7.023
MQ+6.997
Neg logit contributions
FTWARE-7.443
 secretaries-7.318
mented-7.288
Dialogue-7.220
ARI-7.069
Tokenu
Token ID84
Feature activation-5.02e+00

Pos logit contributions
 Crossing+5.408
zees+5.313
 Lama+4.975
eding+4.889
 Ski+4.781
Neg logit contributions
FTWARE-5.200
Dialogue-5.119
ARI-5.069
mented-5.056
 secretaries-5.028
Token
Token ID851
Feature activation-6.91e+00

Pos logit contributions
 Crossing+3.916
zees+3.808
 Lama+3.604
eding+3.498
 Ski+3.450
Neg logit contributions
Dialogue-3.487
FTWARE-3.481
ARI-3.452
mented-3.422
 secretaries-3.353
Token
Token ID198
Feature activation-7.61e-01

Pos logit contributions
zees+0.343
 Crossing+0.341
 Lama+0.316
eding+0.310
 Ski+0.301
Neg logit contributions
Dialogue-0.339
ARI-0.337
FTWARE-0.335
mented-0.331
 secretaries-0.321
Token
Token ID198
Feature activation+4.34e-02

Pos logit contributions
 Crossing+0.634
zees+0.621
 Lama+0.586
eding+0.570
 Ski+0.562
Neg logit contributions
Dialogue-0.492
ARI-0.489
FTWARE-0.487
mented-0.482
 secretaries-0.464
TokenImage
Token ID5159
Feature activation-2.82e-01

Pos logit contributions
Dialogue+0.027
ARI+0.027
FTWARE+0.026
mented+0.026
 secretaries+0.025
Neg logit contributions
 Crossing-0.038
zees-0.037
 Lama-0.035
eding-0.034
 Ski-0.034
Token credit
Token ID3884
Feature activation-7.76e-01

Pos logit contributions
 Crossing+0.204
zees+0.200
 Lama+0.186
eding+0.181
 Ski+0.177
Neg logit contributions
Dialogue-0.213
ARI-0.211
FTWARE-0.210
mented-0.209
 secretaries-0.204
Token:
Token ID25
Feature activation-4.72e+00

Pos logit contributions
 Crossing+0.643
zees+0.633
 Lama+0.595
eding+0.581
 Ski+0.570
Neg logit contributions
Dialogue-0.502
ARI-0.501
FTWARE-0.497
mented-0.492
 secretaries-0.479
Token Cop
Token ID6955
Feature activation-1.36e+01

Pos logit contributions
 Crossing+3.534
zees+3.393
 Lama+3.219
eding+3.114
 Ski+3.086
Neg logit contributions
Dialogue-3.457
FTWARE-3.428
ARI-3.423
mented-3.382
 secretaries-3.307
Token Richard
Token ID6219
Feature activation-1.56e+00

Pos logit contributions
 Crossing+11.434
zees+11.344
 Lama+10.785
eding+10.717
 Ski+10.359
Neg logit contributions
FTWARE-7.537
Dialogue-7.112
ARI-7.034
 secretaries-7.033
mented-6.975
Token,
Token ID11
Feature activation-5.04e+00

Pos logit contributions
 Crossing+1.119
zees+1.099
 Lama+1.022
eding+0.995
 Ski+0.974
Neg logit contributions
Dialogue-1.178
ARI-1.172
FTWARE-1.165
mented-1.155
 secretaries-1.128
Token royalty
Token ID29359
Feature activation-3.37e+00

Pos logit contributions
 Crossing+3.665
zees+3.495
 Lama+3.312
eding+3.217
 Ski+3.185
Neg logit contributions
Dialogue-3.782
FTWARE-3.764
ARI-3.749
mented-3.702
 secretaries-3.634
Token free
Token ID1479
Feature activation-1.01e+01

Pos logit contributions
 Crossing+2.545
zees+2.456
 Lama+2.327
eding+2.265
 Ski+2.227
Neg logit contributions
Dialogue-2.442
FTWARE-2.432
ARI-2.420
mented-2.391
 secretaries-2.329
Token<|endoftext|>
Token ID50256
Feature activation-1.24e-01

Pos logit contributions
 Crossing+7.512
zees+6.933
 Lama+6.696
 Ski+6.538
eding+6.506
Neg logit contributions
Dialogue-7.126
ARI-7.044
FTWARE-7.010
mented-6.918
 secretaries-6.765
Token Court
Token ID3078
Feature activation-4.09e+00

Pos logit contributions
 Crossing+0.721
zees+0.690
 Lama+0.634
eding+0.609
 Ski+0.596
Neg logit contributions
Dialogue-1.221
FTWARE-1.215
ARI-1.212
mented-1.199
 secretaries-1.182
Token System
Token ID4482
Feature activation+3.95e+00

Pos logit contributions
 Crossing+2.799
zees+2.683
 Lama+2.521
eding+2.449
 Ski+2.410
Neg logit contributions
Dialogue-3.242
FTWARE-3.223
ARI-3.219
mented-3.170
 secretaries-3.119
Token.
Token ID13
Feature activation+7.68e+00

Pos logit contributions
ARI+2.620
Dialogue+2.587
mented+2.562
FTWARE+2.556
 secretaries+2.487
Neg logit contributions
zees-3.130
 Crossing-3.118
 Lama-2.900
eding-2.795
 Ski-2.768
Token The
Token ID383
Feature activation+3.01e+00

Pos logit contributions
ARI+4.779
Dialogue+4.679
FTWARE+4.550
mented+4.493
 elements+4.340
Neg logit contributions
zees-6.322
 Crossing-6.004
eding-5.792
 Lama-5.676
digy-5.609
Token accusation
Token ID25984
Feature activation+3.35e+00

Pos logit contributions
ARI+1.955
Dialogue+1.945
FTWARE+1.916
mented+1.909
 secretaries+1.902
Neg logit contributions
zees-2.439
 Crossing-2.419
 Lama-2.251
eding-2.214
 Ski-2.157
Token claims
Token ID3667
Feature activation+8.82e+00

Pos logit contributions
ARI+2.238
Dialogue+2.200
mented+2.197
FTWARE+2.136
 secretaries+2.113
Neg logit contributions
 Crossing-2.690
zees-2.668
 Lama-2.461
eding-2.409
 Ski-2.373
Token that
Token ID326
Feature activation+4.49e+00

Pos logit contributions
ARI+5.425
Dialogue+5.360
 elements+5.314
 secretaries+5.299
mented+5.171
Neg logit contributions
zees-7.037
 Crossing-7.004
 Lama-6.488
eding-6.478
 Ski-6.148
Token named
Token ID3706
Feature activation+3.72e+00

Pos logit contributions
Dialogue+3.164
ARI+3.161
 secretaries+3.112
FTWARE+3.092
mented+3.079
Neg logit contributions
zees-3.349
 Crossing-3.306
 Lama-3.059
eding-3.006
 Ski-2.911
Token people
Token ID661
Feature activation-1.51e+00

Pos logit contributions
ARI+1.899
mented+1.853
 secretaries+1.843
Dialogue+1.842
FTWARE+1.817
Neg logit contributions
 Crossing-3.539
zees-3.510
 Lama-3.316
eding-3.227
 Ski-3.196
Token couldn
Token ID3521
Feature activation-3.16e-01

Pos logit contributions
 Crossing+1.155
zees+1.133
 Lama+1.062
eding+1.029
 Ski+1.012
Neg logit contributions
ARI-1.072
Dialogue-1.069
FTWARE-1.060
mented-1.052
 secretaries-1.034
Token
Token ID447
Feature activation+2.43e+00

Pos logit contributions
 Crossing+0.272
zees+0.266
 Lama+0.251
eding+0.245
 Ski+0.241
Neg logit contributions
Dialogue-0.196
ARI-0.195
FTWARE-0.193
mented-0.192
 secretaries-0.181
Token Church
Token ID4564
Feature activation-1.89e+00

Pos logit contributions
 Crossing+2.605
zees+2.499
 Lama+2.352
eding+2.272
 Ski+2.249
Neg logit contributions
Dialogue-2.937
FTWARE-2.916
ARI-2.912
mented-2.876
 secretaries-2.822
Token.
Token ID13
Feature activation+1.58e+00

Pos logit contributions
 Crossing+1.426
zees+1.413
 Lama+1.317
eding+1.284
 Ski+1.254
Neg logit contributions
Dialogue-1.363
ARI-1.353
FTWARE-1.349
mented-1.331
 secretaries-1.304
Token
Token ID564
Feature activation-1.52e+00

Pos logit contributions
Dialogue+1.057
ARI+1.055
FTWARE+1.043
mented+1.030
 secretaries+0.988
Neg logit contributions
zees-1.260
 Crossing-1.244
 Lama-1.161
eding-1.154
digy-1.113
Token
Token ID250
Feature activation+2.20e+00

Pos logit contributions
 Crossing+0.924
zees+0.896
 Lama+0.828
eding+0.799
 Ski+0.787
Neg logit contributions
Dialogue-1.331
ARI-1.326
mented-1.315
FTWARE-1.313
 secretaries-1.281
TokenI
Token ID40
Feature activation+4.85e+00

Pos logit contributions
Dialogue+1.406
ARI+1.393
mented+1.355
FTWARE+1.351
 secretaries+1.301
Neg logit contributions
 Crossing-1.840
zees-1.835
 Lama-1.724
eding-1.678
 Ski-1.648
Token feel
Token ID1254
Feature activation+9.48e+00

Pos logit contributions
mented+2.797
Dialogue+2.746
FTWARE+2.727
ARI+2.720
 secretaries+2.583
Neg logit contributions
 Crossing-4.346
zees-4.223
 Lama-3.986
 Ski-3.797
 Trace-3.791
Token like
Token ID588
Feature activation+5.36e+00

Pos logit contributions
 secretaries+5.290
ARI+5.243
mented+5.222
Dialogue+5.218
 elements+5.159
Neg logit contributions
zees-8.056
 Crossing-8.037
eding-7.309
 Lama-7.275
 Ski-7.134
Token I
Token ID314
Feature activation+6.84e+00

Pos logit contributions
Dialogue+3.581
ARI+3.537
 secretaries+3.499
mented+3.487
 elements+3.439
Neg logit contributions
zees-4.174
 Crossing-4.141
 Lama-3.811
eding-3.751
 Ski-3.689
Token
Token ID447
Feature activation-1.15e+00

Pos logit contributions
mented+3.577
Dialogue+3.487
ARI+3.448
FTWARE+3.339
 elements+3.227
Neg logit contributions
 Crossing-6.432
zees-6.233
 Lama-5.854
 Trace-5.635
MQ-5.634
Token
Token ID247
Feature activation+2.51e+00

Pos logit contributions
 Crossing+0.697
zees+0.674
 Lama+0.620
eding+0.599
 Ski+0.587
Neg logit contributions
Dialogue-1.006
ARI-1.004
mented-1.002
FTWARE-0.995
 secretaries-0.967
Tokenm
Token ID76
Feature activation+1.17e+00

Pos logit contributions
mented+1.638
Dialogue+1.622
ARI+1.620
FTWARE+1.578
 secretaries+1.512
Neg logit contributions
 Crossing-2.090
zees-2.038
 Lama-1.905
 Ski-1.842
eding-1.839
Token from
Token ID422
Feature activation+7.98e-01

Pos logit contributions
ARI+0.303
Dialogue+0.302
FTWARE+0.297
mented+0.296
 secretaries+0.292
Neg logit contributions
 Crossing-0.330
zees-0.326
 Lama-0.303
eding-0.294
 Ski-0.291
Token his
Token ID465
Feature activation-1.99e+00

Pos logit contributions
Dialogue+0.577
ARI+0.573
FTWARE+0.568
mented+0.565
 secretaries+0.563
Neg logit contributions
 Crossing-0.594
zees-0.591
 Lama-0.545
eding-0.530
 Ski-0.522
Token new
Token ID649
Feature activation+5.84e-01

Pos logit contributions
 Crossing+1.643
zees+1.615
 Lama+1.519
eding+1.480
 Ski+1.457
Neg logit contributions
Dialogue-1.290
ARI-1.281
FTWARE-1.280
mented-1.268
 secretaries-1.255
Token brother
Token ID3956
Feature activation+3.90e-01

Pos logit contributions
Dialogue+0.375
FTWARE+0.375
mented+0.373
 secretaries+0.372
ARI+0.372
Neg logit contributions
 Crossing-0.478
zees-0.474
 Lama-0.442
eding-0.429
 Ski-0.426
Token-
Token ID12
Feature activation+5.68e+00

Pos logit contributions
ARI+0.269
FTWARE+0.268
Dialogue+0.268
mented+0.268
 secretaries+0.261
Neg logit contributions
 Crossing-0.303
zees-0.298
 Lama-0.278
eding-0.272
 Ski-0.267
Tokenin
Token ID259
Feature activation+1.09e+01

Pos logit contributions
Dialogue+1.388
mented+1.380
ARI+1.344
 secretaries+1.278
FTWARE+1.271
Neg logit contributions
 Crossing-6.806
zees-6.735
 Lama-6.422
 Ski-6.302
eding-6.238
Token-
Token ID12
Feature activation+1.20e+01

Pos logit contributions
mented+5.895
Dialogue+5.863
FTWARE+5.704
ARI+5.614
 secretaries+5.526
Neg logit contributions
 Crossing-9.253
zees-9.136
 Lama-8.323
 Ski-8.196
eding-8.181
Tokenlaw
Token ID6270
Feature activation-2.05e+00

Pos logit contributions
Dialogue+3.744
mented+3.115
ARI+3.111
well+2.809
FTWARE+2.808
Neg logit contributions
zees-13.266
 Crossing-13.041
 Ski-12.354
 Lama-12.329
 Trace-12.036
Token,
Token ID11
Feature activation-2.46e-01

Pos logit contributions
 Crossing+1.601
zees+1.574
 Lama+1.471
eding+1.438
 Ski+1.411
Neg logit contributions
Dialogue-1.425
ARI-1.421
FTWARE-1.416
mented-1.403
 secretaries-1.373
Token E
Token ID412
Feature activation-2.00e+00

Pos logit contributions
 Crossing+0.185
zees+0.184
 Lama+0.170
eding+0.166
 Ski+0.163
Neg logit contributions
Dialogue-0.176
ARI-0.175
FTWARE-0.175
mented-0.173
 secretaries-0.172
Tokenad
Token ID324
Feature activation+2.15e-02

Pos logit contributions
 Crossing+1.557
zees+1.516
 Lama+1.431
eding+1.387
 Ski+1.370
Neg logit contributions
Dialogue-1.398
ARI-1.388
FTWARE-1.385
mented-1.371
 secretaries-1.336
TokenThey
Token ID2990
Feature activation+3.45e+00

Pos logit contributions
Dialogue+1.002
ARI+0.986
FTWARE+0.945
mented+0.934
ATURE+0.883
Neg logit contributions
 Crossing-1.386
zees-1.364
 Lama-1.294
eding-1.250
 Ski-1.240
Token don
Token ID836
Feature activation+2.41e+00

Pos logit contributions
mented+2.279
ARI+2.259
Dialogue+2.252
FTWARE+2.215
 secretaries+2.185
Neg logit contributions
 Crossing-2.789
zees-2.734
 Lama-2.539
 Ski-2.423
digy-2.409
Token
Token ID447
Feature activation-1.10e+00

Pos logit contributions
Dialogue+1.470
ARI+1.468
mented+1.450
FTWARE+1.434
ATURE+1.336
Neg logit contributions
 Crossing-2.066
zees-2.042
 Lama-1.908
eding-1.859
digy-1.827
Token
Token ID247
Feature activation-1.48e+00

Pos logit contributions
 Crossing+0.712
zees+0.689
 Lama+0.632
eding+0.612
 Ski+0.604
Neg logit contributions
ARI-0.924
Dialogue-0.923
mented-0.922
FTWARE-0.909
 secretaries-0.883
Tokent
Token ID83
Feature activation-2.31e+00

Pos logit contributions
 Crossing+1.181
zees+1.149
 Lama+1.082
eding+1.045
 Ski+1.033
Neg logit contributions
Dialogue-1.012
ARI-1.007
mented-1.005
FTWARE-1.002
 secretaries-0.960
Token need
Token ID761
Feature activation+9.55e+00

Pos logit contributions
 Crossing+1.836
zees+1.789
 Lama+1.690
eding+1.648
 Ski+1.621
Neg logit contributions
Dialogue-1.576
ARI-1.566
FTWARE-1.565
mented-1.542
 secretaries-1.506
Token facts
Token ID6419
Feature activation-5.41e+00

Pos logit contributions
 secretaries+5.616
mented+5.549
Dialogue+5.489
ARI+5.484
 elements+5.412
Neg logit contributions
zees-7.862
 Crossing-7.612
 Lama-7.112
digy-6.928
 Ski-6.917
Token to
Token ID284
Feature activation+1.38e-01

Pos logit contributions
 Crossing+4.081
zees+3.959
 Lama+3.741
eding+3.654
 Ski+3.573
Neg logit contributions
Dialogue-3.908
FTWARE-3.882
ARI-3.862
mented-3.797
 secretaries-3.727
Token unleash
Token ID33502
Feature activation-1.46e+00

Pos logit contributions
Dialogue+0.094
ARI+0.094
mented+0.093
FTWARE+0.093
 secretaries+0.091
Neg logit contributions
 Crossing-0.109
zees-0.107
 Lama-0.100
eding-0.096
 Ski-0.095
Token their
Token ID511
Feature activation-1.19e+00

Pos logit contributions
 Crossing+1.125
zees+1.101
 Lama+1.032
eding+1.004
 Ski+0.989
Neg logit contributions
Dialogue-1.036
ARI-1.028
FTWARE-1.023
mented-1.014
 secretaries-0.997
Token volcanic
Token ID31513
Feature activation+1.61e+00

Pos logit contributions
 Crossing+1.066
zees+1.042
 Lama+0.988
eding+0.963
 Ski+0.952
Neg logit contributions
Dialogue-0.694
ARI-0.692
FTWARE-0.684
mented-0.678
 secretaries-0.667
Token2
Token ID17
Feature activation+7.30e-01

Pos logit contributions
 Crossing+0.306
zees+0.301
 Lama+0.283
 Ski+0.269
eding+0.269
Neg logit contributions
Dialogue-0.313
ARI-0.312
FTWARE-0.309
mented-0.307
 secretaries-0.294
Token loss
Token ID2994
Feature activation+4.09e+00

Pos logit contributions
ARI+0.513
Dialogue+0.511
FTWARE+0.510
mented+0.503
 secretaries+0.496
Neg logit contributions
zees-0.559
 Crossing-0.551
 Lama-0.508
eding-0.492
 Ski-0.486
Token to
Token ID284
Feature activation-8.95e-01

Pos logit contributions
ARI+2.333
Dialogue+2.253
FTWARE+2.235
mented+2.202
 secretaries+2.184
Neg logit contributions
zees-3.675
 Crossing-3.653
 Lama-3.376
eding-3.271
owler-3.247
Token the
Token ID262
Feature activation+2.77e+00

Pos logit contributions
 Crossing+0.690
zees+0.685
 Lama+0.631
eding+0.614
 Ski+0.604
Neg logit contributions
ARI-0.634
FTWARE-0.627
Dialogue-0.624
mented-0.621
 secretaries-0.606
Token Philadelphia
Token ID8857
Feature activation+8.32e-01

Pos logit contributions
ARI+1.814
FTWARE+1.791
mented+1.784
Dialogue+1.768
 secretaries+1.736
Neg logit contributions
zees-2.240
 Crossing-2.201
 Lama-2.038
eding-2.000
 Ski-1.959
Token Flyers
Token ID30652
Feature activation+8.10e+00

Pos logit contributions
mented+0.722
ARI+0.719
Dialogue+0.716
FTWARE+0.708
 secretaries+0.699
Neg logit contributions
zees-0.501
 Crossing-0.489
 Lama-0.449
eding-0.432
 Ski-0.418
Token at
Token ID379
Feature activation+4.32e+00

Pos logit contributions
ARI+4.816
FTWARE+4.694
Dialogue+4.681
 secretaries+4.590
 elements+4.547
Neg logit contributions
zees-6.683
 Crossing-6.454
 Lama-6.090
eding-5.903
owler-5.897
Token United
Token ID1578
Feature activation+2.35e+00

Pos logit contributions
ARI+3.046
Dialogue+2.975
mented+2.897
FTWARE+2.893
 elements+2.854
Neg logit contributions
zees-3.277
 Crossing-3.197
 Lama-2.955
digy-2.918
eding-2.901
Token Center
Token ID3337
Feature activation+1.63e+00

Pos logit contributions
ARI+1.587
Dialogue+1.585
mented+1.557
FTWARE+1.524
 secretaries+1.495
Neg logit contributions
zees-1.860
 Crossing-1.796
 Lama-1.714
eding-1.685
owler-1.635
Token on
Token ID319
Feature activation+4.74e+00

Pos logit contributions
ARI+1.108
Dialogue+1.103
FTWARE+1.099
mented+1.075
 secretaries+1.057
Neg logit contributions
zees-1.277
 Crossing-1.268
 Lama-1.183
eding-1.154
owler-1.123
Token Wednesday
Token ID3583
Feature activation-3.32e-01

Pos logit contributions
ARI+3.054
Dialogue+3.038
mented+2.983
FTWARE+2.968
 secretaries+2.965
Neg logit contributions
zees-3.770
 Crossing-3.602
 Lama-3.349
eding-3.346
owler-3.284
Token state
Token ID1181
Feature activation+4.65e+00

Pos logit contributions
ARI+1.109
Dialogue+1.108
FTWARE+1.101
mented+1.100
 secretaries+1.082
Neg logit contributions
zees-1.112
 Crossing-1.101
 Lama-1.001
eding-0.978
 Ski-0.960
Token.
Token ID13
Feature activation-7.83e-02

Pos logit contributions
mented+3.062
ARI+3.038
FTWARE+3.030
Dialogue+3.025
 secretaries+2.958
Neg logit contributions
zees-3.722
 Crossing-3.689
 Lama-3.377
eding-3.280
owler-3.260
Token BE
Token ID9348
Feature activation+9.79e+00

Pos logit contributions
zees+0.058
 Crossing+0.058
 Lama+0.053
eding+0.053
digy+0.051
Neg logit contributions
ARI-0.057
Dialogue-0.057
FTWARE-0.057
mented-0.055
 secretaries-0.054
TokenAG
Token ID4760
Feature activation+3.89e+00

Pos logit contributions
ARI+5.168
Dialogue+4.934
ATURE+4.664
FTWARE+4.663
mented+4.443
Neg logit contributions
 Crossing-9.425
zees-9.327
 Lama-8.588
 Ski-8.327
 Trace-8.318
Token had
Token ID550
Feature activation+1.54e+00

Pos logit contributions
ARI+2.529
Dialogue+2.464
 secretaries+2.453
mented+2.453
FTWARE+2.444
Neg logit contributions
 Crossing-3.180
zees-3.172
 Lama-2.892
eding-2.828
digy-2.815
Token knocked
Token ID13642
Feature activation-6.53e-01

Pos logit contributions
ARI+1.039
Dialogue+1.023
FTWARE+1.022
mented+1.018
 secretaries+1.009
Neg logit contributions
 Crossing-1.235
zees-1.219
 Lama-1.113
eding-1.086
 Ski-1.074
Token at
Token ID379
Feature activation+4.29e+00

Pos logit contributions
 Crossing+0.486
zees+0.476
 Lama+0.438
eding+0.429
 Ski+0.420
Neg logit contributions
ARI-0.480
Dialogue-0.479
FTWARE-0.479
mented-0.473
 secretaries-0.465
Token the
Token ID262
Feature activation+4.37e+00

Pos logit contributions
ARI+3.038
mented+2.999
FTWARE+2.989
 secretaries+2.978
Dialogue+2.967
Neg logit contributions
 Crossing-3.250
zees-3.234
 Lama-2.924
eding-2.838
digy-2.832
Token high
Token ID1029
Feature activation-3.23e+00

Pos logit contributions
 secretaries+2.778
FTWARE+2.768
ARI+2.763
Dialogue+2.759
mented+2.708
Neg logit contributions
zees-3.602
 Crossing-3.554
 Lama-3.265
eding-3.212
digy-3.161
Token court
Token ID2184
Feature activation+2.93e+00

Pos logit contributions
 Crossing+2.312
zees+2.242
 Lama+2.121
eding+2.056
 Ski+2.012
Neg logit contributions
Dialogue-2.457
FTWARE-2.434
ARI-2.431
mented-2.403
 secretaries-2.340
Token
Token ID447
Feature activation+7.50e-01

Pos logit contributions
ARI+1.955
 secretaries+1.954
FTWARE+1.937
Dialogue+1.931
mented+1.916
Neg logit contributions
zees-2.336
 Crossing-2.327
 Lama-2.114
eding-2.069
 Ski-2.052
Token not
Token ID407
Feature activation-8.33e-01

Pos logit contributions
 Crossing+2.095
zees+2.042
 Lama+1.920
eding+1.866
 Ski+1.835
Neg logit contributions
Dialogue-1.977
ARI-1.964
FTWARE-1.963
mented-1.939
 secretaries-1.896
Token apologized
Token ID22893
Feature activation-2.90e+00

Pos logit contributions
 Crossing+0.651
zees+0.639
 Lama+0.597
eding+0.573
 Ski+0.569
Neg logit contributions
Dialogue-0.582
mented-0.578
ARI-0.577
FTWARE-0.575
 secretaries-0.561
Token to
Token ID284
Feature activation-8.29e-01

Pos logit contributions
 Crossing+2.233
zees+2.176
 Lama+2.050
eding+1.993
 Ski+1.960
Neg logit contributions
Dialogue-2.054
ARI-2.040
FTWARE-2.039
mented-2.013
 secretaries-1.967
Token Crow
Token ID15502
Feature activation-6.11e-01

Pos logit contributions
 Crossing+0.618
zees+0.611
 Lama+0.567
eding+0.550
 Ski+0.543
Neg logit contributions
Dialogue-0.600
ARI-0.599
FTWARE-0.596
mented-0.592
 secretaries-0.579
Tokender
Token ID1082
Feature activation-4.70e+00

Pos logit contributions
 Crossing+0.515
zees+0.502
 Lama+0.476
eding+0.463
 Ski+0.457
Neg logit contributions
Dialogue-0.389
ARI-0.386
FTWARE-0.386
mented-0.381
 secretaries-0.370
Token.
Token ID13
Feature activation+3.00e+00

Pos logit contributions
 Crossing+3.338
zees+3.206
 Lama+3.050
eding+2.959
 Ski+2.902
Neg logit contributions
Dialogue-3.606
FTWARE-3.583
ARI-3.578
mented-3.522
 secretaries-3.440
Token
Token ID198
Feature activation+8.17e-01

Pos logit contributions
Dialogue+1.867
ARI+1.833
FTWARE+1.814
mented+1.776
elve+1.673
Neg logit contributions
zees-2.520
 Crossing-2.487
 Lama-2.343
eding-2.336
digy-2.254
Token
Token ID198
Feature activation+3.11e+00

Pos logit contributions
Dialogue+0.503
ARI+0.498
FTWARE+0.491
mented+0.490
 secretaries+0.460
Neg logit contributions
 Crossing-0.705
zees-0.694
 Lama-0.655
eding-0.636
 Ski-0.627
TokenThe
Token ID464
Feature activation+2.85e-01

Pos logit contributions
Dialogue+1.920
ARI+1.879
FTWARE+1.804
mented+1.761
ATURE+1.667
Neg logit contributions
 Crossing-2.761
zees-2.697
 Lama-2.620
 Ski-2.483
eding-2.481
Token Celtics
Token ID24116
Feature activation+4.15e+00

Pos logit contributions
Dialogue+0.191
ARI+0.191
FTWARE+0.190
mented+0.186
 secretaries+0.185
Neg logit contributions
 Crossing-0.226
zees-0.225
 Lama-0.210
eding-0.205
 Ski-0.202
Token'
Token ID6
Feature activation-2.12e+00

Pos logit contributions
ARI+2.654
Dialogue+2.647
 secretaries+2.590
FTWARE+2.585
mented+2.580
Neg logit contributions
 Crossing-3.395
zees-3.346
 Lama-3.098
eding-3.003
 Ski-2.986
Token Rus
Token ID9223
Feature activation+5.60e+00

Pos logit contributions
 Crossing+2.147
zees+2.088
 Lama+1.964
eding+1.912
 Ski+1.883
Neg logit contributions
Dialogue-2.032
FTWARE-2.019
ARI-2.014
mented-1.991
 secretaries-1.948
Tokenhers
Token ID7084
Feature activation-4.76e+00

Pos logit contributions
mented+4.028
ARI+3.975
FTWARE+3.954
Dialogue+3.951
 secretaries+3.803
Neg logit contributions
 Crossing-4.094
zees-3.966
 Lama-3.756
 Trace-3.621
eding-3.611
Token
Token ID198
Feature activation-1.34e+00

Pos logit contributions
 Crossing+3.606
zees+3.486
 Lama+3.285
eding+3.197
 Ski+3.165
Neg logit contributions
Dialogue-3.417
FTWARE-3.404
ARI-3.377
mented-3.344
 secretaries-3.277
Token
Token ID198
Feature activation+2.22e+00

Pos logit contributions
 Crossing+1.095
zees+1.068
 Lama+1.010
eding+0.984
 Ski+0.968
Neg logit contributions
Dialogue-0.892
ARI-0.886
FTWARE-0.886
mented-0.873
 secretaries-0.854
TokenSo
Token ID2396
Feature activation+2.42e+00

Pos logit contributions
Dialogue+1.414
ARI+1.411
FTWARE+1.357
mented+1.342
 secretaries+1.280
Neg logit contributions
 Crossing-1.905
zees-1.856
 Lama-1.771
eding-1.716
 Ski-1.695
Token there
Token ID612
Feature activation+4.13e+00

Pos logit contributions
FTWARE+1.670
ARI+1.650
mented+1.619
Dialogue+1.611
 secretaries+1.578
Neg logit contributions
 Crossing-1.880
zees-1.836
 Lama-1.724
eding-1.683
 Ski-1.646
Token
Token ID447
Feature activation+1.88e+00

Pos logit contributions
ARI+2.756
Dialogue+2.684
mented+2.682
FTWARE+2.681
 secretaries+2.595
Neg logit contributions
 Crossing-3.267
zees-3.203
 Lama-2.990
eding-2.964
digy-2.874
Token
Token ID247
Feature activation-1.27e+00

Pos logit contributions
mented+1.551
ARI+1.540
Dialogue+1.529
FTWARE+1.510
 secretaries+1.469
Neg logit contributions
 Crossing-1.221
zees-1.188
 Lama-1.077
eding-1.051
 Ski-1.048
Tokens
Token ID82
Feature activation-6.93e-01

Pos logit contributions
 Crossing+0.820
zees+0.791
 Lama+0.734
eding+0.708
 Ski+0.699
Neg logit contributions
Dialogue-1.057
ARI-1.053
mented-1.050
FTWARE-1.048
 secretaries-1.015
Token a
Token ID257
Feature activation+1.19e+00

Pos logit contributions
 Crossing+0.550
zees+0.538
 Lama+0.508
eding+0.495
 Ski+0.484
Neg logit contributions
Dialogue-0.472
ARI-0.471
FTWARE-0.468
mented-0.463
 secretaries-0.453
Token group
Token ID1448
Feature activation+1.72e+00

Pos logit contributions
ARI+0.811
Dialogue+0.804
FTWARE+0.803
mented+0.795
 secretaries+0.783
Neg logit contributions
 Crossing-0.938
zees-0.924
 Lama-0.862
eding-0.848
 Ski-0.820
Token many
Token ID867
Feature activation-1.70e-01

Pos logit contributions
 Crossing+1.273
zees+1.243
 Lama+1.169
eding+1.139
 Ski+1.121
Neg logit contributions
Dialogue-1.105
ARI-1.098
FTWARE-1.096
mented-1.085
 secretaries-1.063
Token years
Token ID812
Feature activation+2.54e-01

Pos logit contributions
 Crossing+0.143
zees+0.140
 Lama+0.132
eding+0.129
 Ski+0.127
Neg logit contributions
Dialogue-0.107
ARI-0.107
FTWARE-0.106
mented-0.106
 secretaries-0.104
Token.
Token ID13
Feature activation-5.20e-01

Pos logit contributions
ARI+0.176
FTWARE+0.176
Dialogue+0.176
mented+0.174
 secretaries+0.170
Neg logit contributions
 Crossing-0.198
zees-0.194
 Lama-0.183
eding-0.177
 Ski-0.175
Token Parents
Token ID28231
Feature activation+4.89e-01

Pos logit contributions
zees+0.415
 Crossing+0.412
 Lama+0.383
eding+0.380
 Ski+0.366
Neg logit contributions
Dialogue-0.351
ARI-0.351
FTWARE-0.347
mented-0.342
 secretaries-0.329
Token who
Token ID508
Feature activation+1.58e+00

Pos logit contributions
ARI+0.346
Dialogue+0.345
mented+0.339
FTWARE+0.338
 secretaries+0.337
Neg logit contributions
 Crossing-0.375
zees-0.372
 Lama-0.342
 Ski-0.331
eding-0.328
Token complain
Token ID13121
Feature activation+3.40e-01

Pos logit contributions
Dialogue+1.075
ARI+1.073
mented+1.068
FTWARE+1.061
 secretaries+1.044
Neg logit contributions
 Crossing-1.253
zees-1.226
 Lama-1.139
 Ski-1.092
MQ-1.083
Token they
Token ID484
Feature activation+2.40e+00

Pos logit contributions
ARI+0.222
Dialogue+0.221
FTWARE+0.220
mented+0.220
 secretaries+0.215
Neg logit contributions
 Crossing-0.280
zees-0.272
 Lama-0.255
eding-0.249
 Ski-0.246
Token were
Token ID547
Feature activation-1.53e+00

Pos logit contributions
ARI+1.596
Dialogue+1.595
mented+1.594
FTWARE+1.560
 secretaries+1.559
Neg logit contributions
 Crossing-1.936
zees-1.869
 Lama-1.764
MQ-1.695
 Ski-1.675
Token unaware
Token ID17261
Feature activation-4.56e+00

Pos logit contributions
 Crossing+1.195
zees+1.167
 Lama+1.093
eding+1.052
 Ski+1.049
Neg logit contributions
ARI-1.079
Dialogue-1.065
mented-1.060
FTWARE-1.055
 secretaries-1.044
Token that
Token ID326
Feature activation-5.51e-01

Pos logit contributions
 Crossing+3.317
zees+3.226
 Lama+3.035
eding+2.952
 Ski+2.891
Neg logit contributions
Dialogue-3.428
FTWARE-3.400
ARI-3.387
mented-3.343
 secretaries-3.259
Token their
Token ID511
Feature activation-6.22e-01

Pos logit contributions
 Crossing+0.395
zees+0.390
 Lama+0.357
eding+0.348
 Ski+0.344
Neg logit contributions
Dialogue-0.417
ARI-0.417
 secretaries-0.411
mented-0.410
FTWARE-0.409
TokenOn
Token ID2202
Feature activation+6.26e+00

Pos logit contributions
 Crossing+0.259
zees+0.254
 Lama+0.241
eding+0.233
 Ski+0.231
Neg logit contributions
Dialogue-0.191
ARI-0.188
FTWARE-0.186
mented-0.182
 secretaries-0.176
Token 29
Token ID2808
Feature activation+4.05e+00

Pos logit contributions
mented+3.467
Dialogue+3.368
ARI+3.291
FTWARE+3.262
 elements+3.212
Neg logit contributions
zees-5.530
 Crossing-5.370
eding-5.092
owler-5.053
 Lama-4.997
Token April
Token ID3035
Feature activation+2.61e+00

Pos logit contributions
mented+2.175
Dialogue+2.153
ARI+2.105
FTWARE+2.057
 elements+2.023
Neg logit contributions
zees-3.701
 Crossing-3.693
eding-3.427
 Lama-3.419
owler-3.351
Token,
Token ID11
Feature activation+2.21e+00

Pos logit contributions
Dialogue+2.103
mented+2.089
ARI+2.084
FTWARE+2.069
 secretaries+2.038
Neg logit contributions
 Crossing-1.723
zees-1.700
 Lama-1.540
eding-1.519
 Ski-1.484
Token G
Token ID402
Feature activation+1.10e+00

Pos logit contributions
Dialogue+1.580
FTWARE+1.571
ARI+1.570
mented+1.558
 secretaries+1.538
Neg logit contributions
zees-1.641
 Crossing-1.639
 Lama-1.499
eding-1.471
 Ski-1.448
Tokenraz
Token ID3247
Feature activation+4.47e+00

Pos logit contributions
ARI+0.926
Dialogue+0.919
FTWARE+0.905
mented+0.904
 secretaries+0.884
Neg logit contributions
 Crossing-0.712
zees-0.689
 Lama-0.640
 Ski-0.610
eding-0.608
Tokeniani
Token ID25111
Feature activation-1.52e-01

Pos logit contributions
Dialogue+1.844
ARI+1.838
mented+1.824
FTWARE+1.732
elve+1.635
Neg logit contributions
 Crossing-4.660
zees-4.585
 Lama-4.408
 Ski-4.278
MQ-4.244
Token surrendered
Token ID29209
Feature activation-2.80e+00

Pos logit contributions
 Crossing+0.119
zees+0.117
 Lama+0.109
eding+0.106
 Ski+0.105
Neg logit contributions
Dialogue-0.104
ARI-0.104
FTWARE-0.103
mented-0.103
 secretaries-0.101
Token and
Token ID290
Feature activation-5.78e+00

Pos logit contributions
 Crossing+2.166
zees+2.119
 Lama+1.989
eding+1.936
 Ski+1.904
Neg logit contributions
Dialogue-1.967
ARI-1.955
FTWARE-1.952
mented-1.930
 secretaries-1.889
Token was
Token ID373
Feature activation-4.32e+00

Pos logit contributions
 Crossing+4.315
zees+4.123
 Lama+3.987
eding+3.848
 Ski+3.772
Neg logit contributions
Dialogue-4.176
ARI-4.140
FTWARE-4.140
mented-4.075
 secretaries-3.946
Token present
Token ID1944
Feature activation+1.58e+00

Pos logit contributions
 Crossing+3.617
zees+3.509
 Lama+3.358
eding+3.277
 Ski+3.225
Neg logit contributions
Dialogue-2.757
FTWARE-2.733
ARI-2.717
mented-2.668
 secretaries-2.600
Token,
Token ID11
Feature activation-7.31e-01

Pos logit contributions
ARI+2.326
Dialogue+2.270
FTWARE+2.265
mented+2.258
 secretaries+2.136
Neg logit contributions
 Crossing-3.024
zees-3.001
 Lama-2.818
eding-2.713
 Ski-2.675
Token the
Token ID262
Feature activation-1.22e+00

Pos logit contributions
 Crossing+0.523
zees+0.517
 Lama+0.474
eding+0.460
 Ski+0.453
Neg logit contributions
ARI-0.556
Dialogue-0.550
FTWARE-0.549
mented-0.547
 secretaries-0.535
Token No
Token ID1400
Feature activation-3.41e+00

Pos logit contributions
 Crossing+0.967
zees+0.955
 Lama+0.889
eding+0.870
 Ski+0.854
Neg logit contributions
ARI-0.825
Dialogue-0.821
FTWARE-0.817
mented-0.812
 secretaries-0.793
Token.
Token ID13
Feature activation-2.49e-01

Pos logit contributions
 Crossing+2.492
zees+2.419
 Lama+2.278
eding+2.210
 Ski+2.177
Neg logit contributions
Dialogue-2.544
FTWARE-2.529
ARI-2.528
mented-2.497
 secretaries-2.455
Token 41
Token ID6073
Feature activation-1.15e+00

Pos logit contributions
zees+0.208
 Crossing+0.208
 Lama+0.193
eding+0.191
 Ski+0.184
Neg logit contributions
Dialogue-0.160
ARI-0.159
FTWARE-0.158
mented-0.155
 secretaries-0.148
Token pick
Token ID2298
Feature activation-3.27e+00

Pos logit contributions
 Crossing+0.727
zees+0.718
 Lama+0.653
eding+0.628
 Ski+0.615
Neg logit contributions
ARI-0.972
Dialogue-0.958
mented-0.957
FTWARE-0.952
 secretaries-0.935
Token in
Token ID287
Feature activation+1.08e+00

Pos logit contributions
 Crossing+2.489
zees+2.440
 Lama+2.279
eding+2.212
 Ski+2.178
Neg logit contributions
ARI-2.341
Dialogue-2.337
FTWARE-2.320
mented-2.307
 secretaries-2.248
Token the
Token ID262
Feature activation+2.61e+00

Pos logit contributions
Dialogue+0.722
ARI+0.721
mented+0.713
FTWARE+0.704
 secretaries+0.693
Neg logit contributions
zees-0.864
 Crossing-0.862
 Lama-0.803
eding-0.774
 Ski-0.762
Token 2015
Token ID1853
Feature activation+3.72e+00

Pos logit contributions
Dialogue+1.467
ARI+1.466
mented+1.435
FTWARE+1.401
 secretaries+1.397
Neg logit contributions
zees-2.345
 Crossing-2.317
 Lama-2.174
eding-2.105
 Ski-2.077
Token draft
Token ID4538
Feature activation+1.03e+00

Pos logit contributions
ARI+2.325
Dialogue+2.310
mented+2.287
 secretaries+2.209
FTWARE+2.188
Neg logit contributions
zees-3.110
 Crossing-3.090
 Lama-2.849
eding-2.770
 Ski-2.722
Token (
Token ID357
Feature activation-1.18e-01

Pos logit contributions
ARI+0.688
Dialogue+0.682
FTWARE+0.679
mented+0.677
 secretaries+0.655
Neg logit contributions
 Crossing-0.823
zees-0.822
 Lama-0.752
eding-0.729
 Ski-0.726
Token seen
Token ID1775
Feature activation-1.08e-01

Pos logit contributions
Dialogue+1.465
ARI+1.446
mented+1.434
 secretaries+1.420
FTWARE+1.415
Neg logit contributions
 Crossing-1.714
zees-1.684
 Lama-1.562
eding-1.535
 Ski-1.509
Token pictures
Token ID5986
Feature activation-5.36e-01

Pos logit contributions
 Crossing+0.082
zees+0.081
 Lama+0.075
eding+0.074
 Ski+0.072
Neg logit contributions
Dialogue-0.078
ARI-0.077
FTWARE-0.076
mented-0.076
 secretaries-0.075
Token from
Token ID422
Feature activation-3.43e-01

Pos logit contributions
 Crossing+0.416
zees+0.408
 Lama+0.378
eding+0.371
 Ski+0.365
Neg logit contributions
Dialogue-0.376
ARI-0.375
FTWARE-0.372
mented-0.370
 secretaries-0.361
Token them
Token ID606
Feature activation+1.60e-01

Pos logit contributions
 Crossing+0.242
zees+0.237
 Lama+0.220
eding+0.215
 Ski+0.210
Neg logit contributions
Dialogue-0.265
ARI-0.263
FTWARE-0.260
mented-0.259
 secretaries-0.256
Token and
Token ID290
Feature activation-2.33e-01

Pos logit contributions
Dialogue+0.109
ARI+0.109
FTWARE+0.109
mented+0.108
 secretaries+0.105
Neg logit contributions
 Crossing-0.126
zees-0.125
 Lama-0.116
eding-0.113
 Ski-0.112
Token obviously
Token ID6189
Feature activation-4.79e+00

Pos logit contributions
 Crossing+0.175
zees+0.171
 Lama+0.160
eding+0.156
 Ski+0.154
Neg logit contributions
Dialogue-0.169
ARI-0.168
FTWARE-0.168
mented-0.166
 secretaries-0.163
Token find
Token ID1064
Feature activation+8.73e-01

Pos logit contributions
 Crossing+3.496
zees+3.374
 Lama+3.207
eding+3.089
 Ski+3.035
Neg logit contributions
FTWARE-3.556
Dialogue-3.531
ARI-3.518
mented-3.484
 secretaries-3.361
Token them
Token ID606
Feature activation+6.20e-01

Pos logit contributions
Dialogue+0.654
ARI+0.649
mented+0.643
FTWARE+0.641
 secretaries+0.631
Neg logit contributions
 Crossing-0.636
zees-0.623
 Lama-0.580
eding-0.568
 Ski-0.556
Token disturbing
Token ID14851
Feature activation+5.33e+00

Pos logit contributions
Dialogue+0.368
ARI+0.367
FTWARE+0.364
mented+0.361
 secretaries+0.352
Neg logit contributions
 Crossing-0.547
zees-0.538
 Lama-0.506
eding-0.497
 Ski-0.490
Token,
Token ID11
Feature activation+2.55e+00

Pos logit contributions
Dialogue+3.375
mented+3.333
ARI+3.333
FTWARE+3.321
 elements+3.206
Neg logit contributions
 Crossing-4.330
zees-4.312
 Lama-4.017
eding-3.934
 Ski-3.882
Token
Token ID447
Feature activation-2.47e+00

Pos logit contributions
Dialogue+1.771
ARI+1.766
mented+1.746
FTWARE+1.733
 secretaries+1.713
Neg logit contributions
 Crossing-1.975
zees-1.964
 Lama-1.807
eding-1.770
 Ski-1.746
Token cl
Token ID537
Feature activation+8.74e-01

Pos logit contributions
FTWARE+0.712
Dialogue+0.711
ARI+0.708
mented+0.701
 secretaries+0.684
Neg logit contributions
 Crossing-1.133
zees-1.110
 Lama-1.051
eding-1.027
 Ski-1.016
Tokenojure
Token ID32511
Feature activation-2.11e+00

Pos logit contributions
mented+0.430
Dialogue+0.430
ARI+0.429
FTWARE+0.427
elve+0.400
Neg logit contributions
 Crossing-0.862
zees-0.825
 Lama-0.801
 Ski-0.773
eding-0.772
Token |
Token ID930
Feature activation-1.88e+00

Pos logit contributions
 Crossing+1.695
zees+1.653
 Lama+1.561
eding+1.520
 Ski+1.499
Neg logit contributions
Dialogue-1.423
ARI-1.412
FTWARE-1.410
mented-1.388
 secretaries-1.363
Token 1
Token ID352
Feature activation-2.92e+00

Pos logit contributions
 Crossing+1.460
zees+1.423
 Lama+1.342
eding+1.304
 Ski+1.284
Neg logit contributions
Dialogue-1.317
ARI-1.308
FTWARE-1.307
mented-1.290
 secretaries-1.261
Token.
Token ID764
Feature activation-3.10e+00

Pos logit contributions
 Crossing+2.842
zees+2.784
 Lama+2.657
eding+2.600
 Ski+2.567
Neg logit contributions
Dialogue-1.475
ARI-1.461
FTWARE-1.460
mented-1.433
 secretaries-1.391
Token 5
Token ID642
Feature activation-2.90e+00

Pos logit contributions
 Crossing+2.521
zees+2.460
 Lama+2.325
eding+2.263
 Ski+2.230
Neg logit contributions
Dialogue-2.062
ARI-2.048
FTWARE-2.047
mented-2.018
 secretaries-1.971
Token.
Token ID764
Feature activation-3.34e+00

Pos logit contributions
 Crossing+2.751
zees+2.687
 Lama+2.562
eding+2.505
 Ski+2.477
Neg logit contributions
Dialogue-1.528
FTWARE-1.512
ARI-1.510
mented-1.479
 secretaries-1.453
Token 1
Token ID352
Feature activation-2.07e+00

Pos logit contributions
 Crossing+2.621
zees+2.548
 Lama+2.409
eding+2.338
 Ski+2.311
Neg logit contributions
Dialogue-2.317
FTWARE-2.302
ARI-2.300
mented-2.265
 secretaries-2.224
Token |
Token ID930
Feature activation-1.68e+00

Pos logit contributions
 Crossing+1.636
zees+1.594
 Lama+1.506
eding+1.464
 Ski+1.451
Neg logit contributions
Dialogue-1.419
ARI-1.405
FTWARE-1.402
mented-1.377
 secretaries-1.365
Token 4
Token ID604
Feature activation-6.17e+00

Pos logit contributions
 Crossing+1.375
zees+1.351
 Lama+1.269
eding+1.240
 Ski+1.215
Neg logit contributions
Dialogue-1.106
ARI-1.101
FTWARE-1.099
mented-1.087
 secretaries-1.055
Token251
Token ID28072
Feature activation+1.38e+00

Pos logit contributions
 Crossing+4.305
zees+4.142
 Lama+3.903
eding+3.793
 Ski+3.732
Neg logit contributions
Dialogue-4.678
FTWARE-4.665
ARI-4.646
mented-4.584
 secretaries-4.582
Token press
Token ID1803
Feature activation+5.48e+00

Pos logit contributions
Dialogue+2.609
mented+2.569
ARI+2.547
 secretaries+2.537
FTWARE+2.511
Neg logit contributions
zees-3.505
 Crossing-3.496
 Lama-3.170
eding-3.142
 Ski-3.126
Token are
Token ID389
Feature activation+3.70e+00

Pos logit contributions
ARI+3.637
 secretaries+3.624
mented+3.519
Dialogue+3.503
 elements+3.498
Neg logit contributions
 Crossing-4.316
zees-4.243
 Lama-3.994
MQ-3.857
 Ski-3.836
Token entirely
Token ID5000
Feature activation+2.40e+00

Pos logit contributions
mented+2.265
ARI+2.239
 secretaries+2.219
FTWARE+2.184
Dialogue+2.171
Neg logit contributions
 Crossing-3.151
zees-3.075
 Lama-2.887
MQ-2.799
 Ski-2.783
Token absent
Token ID13717
Feature activation+5.01e+00

Pos logit contributions
ARI+1.414
mented+1.404
FTWARE+1.390
 secretaries+1.390
Dialogue+1.383
Neg logit contributions
 Crossing-2.105
zees-2.063
 Lama-1.947
 Ski-1.870
MQ-1.867
Token.
Token ID13
Feature activation-3.48e-01

Pos logit contributions
ARI+3.168
mented+3.151
FTWARE+3.113
Dialogue+3.089
 secretaries+3.068
Neg logit contributions
 Crossing-4.094
zees-4.058
 Lama-3.790
 Ski-3.630
MQ-3.610
Token Human
Token ID5524
Feature activation-4.69e+00

Pos logit contributions
zees+0.257
 Crossing+0.256
 Lama+0.237
eding+0.234
owler+0.226
Neg logit contributions
Dialogue-0.255
ARI-0.251
FTWARE-0.251
mented-0.249
 secretaries-0.240
Token rights
Token ID2489
Feature activation+2.54e+00

Pos logit contributions
 Crossing+3.762
zees+3.678
 Lama+3.455
eding+3.370
 Ski+3.303
Neg logit contributions
Dialogue-3.151
FTWARE-3.143
ARI-3.142
mented-3.088
 secretaries-2.999
Token don
Token ID836
Feature activation-5.12e-01

Pos logit contributions
ARI+1.786
 secretaries+1.770
Dialogue+1.764
FTWARE+1.760
mented+1.722
Neg logit contributions
 Crossing-1.967
zees-1.922
 Lama-1.782
 Ski-1.728
eding-1.718
Token't
Token ID470
Feature activation-1.39e+00

Pos logit contributions
 Crossing+0.395
zees+0.385
 Lama+0.361
eding+0.351
 Ski+0.346
Neg logit contributions
Dialogue-0.364
ARI-0.362
FTWARE-0.360
mented-0.357
 secretaries-0.344
Token apply
Token ID4174
Feature activation+2.95e+00

Pos logit contributions
 Crossing+1.153
zees+1.125
 Lama+1.066
eding+1.041
 Ski+1.023
Neg logit contributions
Dialogue-0.904
ARI-0.896
FTWARE-0.895
mented-0.882
 secretaries-0.861
Token.
Token ID13
Feature activation-7.87e-01

Pos logit contributions
ARI+1.878
mented+1.867
FTWARE+1.849
Dialogue+1.834
 secretaries+1.825
Neg logit contributions
 Crossing-2.457
zees-2.412
 Lama-2.247
MQ-2.166
eding-2.157
Token recently
Token ID2904
Feature activation+6.50e-01

Pos logit contributions
ARI+2.128
mented+2.102
 secretaries+2.093
Dialogue+2.090
FTWARE+2.064
Neg logit contributions
 Crossing-2.599
zees-2.588
 Lama-2.404
eding-2.329
 Ski-2.302
Token said
Token ID531
Feature activation+4.18e+00

Pos logit contributions
ARI+0.437
mented+0.435
Dialogue+0.434
FTWARE+0.432
 secretaries+0.426
Neg logit contributions
 Crossing-0.517
zees-0.510
 Lama-0.477
eding-0.460
 Ski-0.457
Token a
Token ID257
Feature activation+9.86e-01

Pos logit contributions
ARI+2.674
Dialogue+2.657
FTWARE+2.639
 secretaries+2.629
mented+2.585
Neg logit contributions
zees-3.386
 Crossing-3.366
 Lama-3.144
eding-3.039
 Ski-3.018
Token fair
Token ID3148
Feature activation-1.54e+00

Pos logit contributions
Dialogue+0.673
ARI+0.667
FTWARE+0.667
mented+0.659
 secretaries+0.654
Neg logit contributions
zees-0.771
 Crossing-0.769
 Lama-0.713
eding-0.697
 Ski-0.680
Token price
Token ID2756
Feature activation-4.43e+00

Pos logit contributions
 Crossing+1.033
zees+1.015
 Lama+0.941
eding+0.907
 Ski+0.889
Neg logit contributions
Dialogue-1.243
ARI-1.240
FTWARE-1.236
mented-1.225
 secretaries-1.207
Token would
Token ID561
Feature activation-4.19e+00

Pos logit contributions
 Crossing+3.255
zees+3.152
 Lama+2.971
eding+2.883
 Ski+2.835
Neg logit contributions
Dialogue-3.296
FTWARE-3.281
ARI-3.271
mented-3.226
 secretaries-3.153
Token be
Token ID307
Feature activation-1.45e+00

Pos logit contributions
 Crossing+3.062
zees+2.969
 Lama+2.781
eding+2.734
 Ski+2.678
Neg logit contributions
Dialogue-3.125
FTWARE-3.111
ARI-3.100
mented-3.053
 secretaries-2.989
Token nearer
Token ID40671
Feature activation+3.79e+00

Pos logit contributions
 Crossing+1.099
zees+1.082
 Lama+1.012
eding+0.986
 Ski+0.967
Neg logit contributions
Dialogue-1.033
ARI-1.029
FTWARE-1.025
mented-1.019
 secretaries-1.004
Token $
Token ID720
Feature activation+2.13e+00

Pos logit contributions
ARI+2.319
Dialogue+2.305
FTWARE+2.285
mented+2.281
 secretaries+2.249
Neg logit contributions
zees-3.186
 Crossing-3.164
 Lama-2.958
eding-2.912
 Ski-2.827
Token40
Token ID1821
Feature activation+4.21e+00

Pos logit contributions
Dialogue+1.486
ARI+1.464
FTWARE+1.435
mented+1.422
ATURE+1.335
Neg logit contributions
 Crossing-1.664
zees-1.655
 Lama-1.546
eding-1.511
 Ski-1.479
Token a
Token ID257
Feature activation-9.70e-01

Pos logit contributions
Dialogue+2.685
mented+2.648
ARI+2.647
FTWARE+2.596
 secretaries+2.520
Neg logit contributions
 Crossing-3.429
zees-3.393
 Lama-3.204
eding-3.108
 Ski-3.064
Tokensee
Token ID3826
Feature activation+2.52e+00

Pos logit contributions
Dialogue+0.632
ARI+0.624
FTWARE+0.606
mented+0.596
 secretaries+0.570
Neg logit contributions
 Crossing-0.883
zees-0.871
 Lama-0.826
eding-0.800
 Ski-0.790
Token also
Token ID635
Feature activation+2.39e+00

Pos logit contributions
ARI+1.801
Dialogue+1.787
mented+1.763
FTWARE+1.744
 secretaries+1.737
Neg logit contributions
 Crossing-1.889
zees-1.887
 Lama-1.754
eding-1.711
 Ski-1.665
Token Trump
Token ID1301
Feature activation+7.25e-01

Pos logit contributions
Dialogue+1.694
ARI+1.682
FTWARE+1.666
mented+1.641
 secretaries+1.636
Neg logit contributions
zees-1.793
 Crossing-1.776
 Lama-1.658
eding-1.628
 Ski-1.576
Token hints
Token ID20269
Feature activation-4.36e+00

Pos logit contributions
Dialogue+0.487
ARI+0.485
FTWARE+0.479
mented+0.475
 secretaries+0.475
Neg logit contributions
 Crossing-0.574
zees-0.570
 Lama-0.534
eding-0.516
 Ski-0.509
Token he
Token ID339
Feature activation-3.48e-02

Pos logit contributions
 Crossing+3.333
zees+3.234
 Lama+3.080
eding+2.983
 Ski+2.941
Neg logit contributions
Dialogue-3.074
FTWARE-3.072
ARI-3.028
mented-3.013
 secretaries-2.912
Token taped
Token ID36114
Feature activation-1.08e+01

Pos logit contributions
 Crossing+0.028
zees+0.027
 Lama+0.025
eding+0.024
 Ski+0.024
Neg logit contributions
ARI-0.024
Dialogue-0.024
mented-0.023
FTWARE-0.023
 secretaries-0.023
Token Comey
Token ID12388
Feature activation-1.23e-02

Pos logit contributions
 Crossing+8.055
zees+7.472
 Lama+7.424
eding+7.068
 Ski+6.962
Neg logit contributions
FTWARE-7.542
Dialogue-7.407
ARI-7.339
mented-7.161
elve-6.953
Token conversations
Token ID10275
Feature activation-4.40e+00

Pos logit contributions
 Crossing+0.010
zees+0.010
 Lama+0.009
eding+0.009
 Ski+0.009
Neg logit contributions
Dialogue-0.009
ARI-0.009
FTWARE-0.008
mented-0.008
 secretaries-0.008
Token in
Token ID287
Feature activation-1.09e+00

Pos logit contributions
 Crossing+3.318
zees+3.220
 Lama+3.036
eding+2.948
 Ski+2.900
Neg logit contributions
Dialogue-3.193
FTWARE-3.175
ARI-3.168
mented-3.132
 secretaries-3.053
Token veiled
Token ID45694
Feature activation+4.30e-01

Pos logit contributions
 Crossing+0.864
zees+0.854
 Lama+0.796
eding+0.774
 Ski+0.761
Neg logit contributions
Dialogue-0.746
ARI-0.740
FTWARE-0.733
mented-0.732
 secretaries-0.728
Token threat
Token ID2372
Feature activation-2.20e-01

Pos logit contributions
Dialogue+0.294
ARI+0.294
mented+0.290
FTWARE+0.288
 secretaries+0.287
Neg logit contributions
 Crossing-0.335
zees-0.333
 Lama-0.309
eding-0.301
 Ski-0.296
Token J
Token ID449
Feature activation-4.18e-01

Pos logit contributions
 Crossing+1.664
zees+1.649
 Lama+1.544
eding+1.514
 Ski+1.475
Neg logit contributions
Dialogue-1.297
ARI-1.292
FTWARE-1.289
mented-1.273
 secretaries-1.225
Token.
Token ID13
Feature activation-1.86e+00

Pos logit contributions
 Crossing+0.383
zees+0.375
 Lama+0.357
eding+0.350
 Ski+0.344
Neg logit contributions
Dialogue-0.234
ARI-0.233
FTWARE-0.232
mented-0.227
 secretaries-0.224
Token (
Token ID357
Feature activation-8.90e-01

Pos logit contributions
 Crossing+1.562
zees+1.544
 Lama+1.450
eding+1.420
 Ski+1.386
Neg logit contributions
Dialogue-1.185
ARI-1.179
FTWARE-1.177
mented-1.164
 secretaries-1.118
Token 2010
Token ID3050
Feature activation-2.85e-01

Pos logit contributions
 Crossing+0.763
zees+0.759
 Lama+0.712
eding+0.692
 Ski+0.680
Neg logit contributions
Dialogue-0.549
ARI-0.544
FTWARE-0.540
mented-0.535
 secretaries-0.520
Token ).
Token ID6739
Feature activation-3.87e+00

Pos logit contributions
 Crossing+0.198
zees+0.192
 Lama+0.180
eding+0.174
 Ski+0.171
Neg logit contributions
Dialogue-0.224
ARI-0.223
FTWARE-0.223
mented-0.220
 secretaries-0.215
Token Cl
Token ID1012
Feature activation-1.13e+01

Pos logit contributions
 Crossing+2.756
zees+2.674
 Lama+2.507
eding+2.428
 Ski+2.389
Neg logit contributions
Dialogue-2.973
ARI-2.959
FTWARE-2.957
mented-2.921
 secretaries-2.864
Tokenueless
Token ID38835
Feature activation+2.33e-02

Pos logit contributions
zees+8.749
 Crossing+8.718
eding+8.440
 Lama+7.917
owler+7.869
Neg logit contributions
FTWARE-7.256
Dialogue-7.074
 secretaries-6.999
ARI-6.916
mented-6.606
Token or
Token ID393
Feature activation+1.63e+00

Pos logit contributions
Dialogue+0.017
ARI+0.016
mented+0.016
FTWARE+0.016
 secretaries+0.016
Neg logit contributions
 Crossing-0.018
zees-0.017
 Lama-0.016
eding-0.016
 Ski-0.016
Token powerful
Token ID3665
Feature activation+5.06e+00

Pos logit contributions
Dialogue+1.094
mented+1.080
ARI+1.073
 secretaries+1.063
FTWARE+1.063
Neg logit contributions
 Crossing-1.289
zees-1.279
 Lama-1.192
eding-1.158
 Ski-1.139
Token?
Token ID30
Feature activation-1.18e+00

Pos logit contributions
Dialogue+3.258
 secretaries+3.239
ARI+3.209
 elements+3.196
mented+3.182
Neg logit contributions
 Crossing-3.998
zees-3.962
 Lama-3.757
eding-3.604
 Ski-3.560
Token Ident
Token ID11440
Feature activation-2.14e+00

Pos logit contributions
 Crossing+0.833
zees+0.827
 Lama+0.769
eding+0.752
 Ski+0.730
Neg logit contributions
Dialogue-0.905
ARI-0.893
FTWARE-0.890
mented-0.883
 secretaries-0.862
Token is
Token ID318
Feature activation-5.77e-01

Pos logit contributions
ARI+1.299
Dialogue+1.285
mented+1.269
FTWARE+1.264
 secretaries+1.244
Neg logit contributions
 Crossing-1.376
zees-1.366
 Lama-1.261
eding-1.239
 Ski-1.212
Token an
Token ID281
Feature activation-2.78e+00

Pos logit contributions
 Crossing+0.464
zees+0.460
 Lama+0.427
eding+0.419
 Ski+0.411
Neg logit contributions
Dialogue-0.389
ARI-0.387
FTWARE-0.382
mented-0.381
 secretaries-0.369
Token interview
Token ID2720
Feature activation-3.77e+00

Pos logit contributions
 Crossing+2.194
zees+2.134
 Lama+2.018
eding+1.964
 Ski+1.933
Neg logit contributions
Dialogue-1.910
FTWARE-1.897
ARI-1.894
mented-1.869
 secretaries-1.826
Token with
Token ID351
Feature activation-4.32e+00

Pos logit contributions
 Crossing+2.398
zees+2.330
 Lama+2.158
eding+2.088
 Ski+2.045
Neg logit contributions
Dialogue-3.172
ARI-3.157
FTWARE-3.150
mented-3.120
 secretaries-3.061
Token one
Token ID530
Feature activation-5.54e+00

Pos logit contributions
 Crossing+3.132
zees+3.019
 Lama+2.866
eding+2.752
 Ski+2.723
Neg logit contributions
Dialogue-3.250
FTWARE-3.236
ARI-3.225
mented-3.186
 secretaries-3.107
Token of
Token ID286
Feature activation-1.16e+01

Pos logit contributions
 Crossing+3.963
zees+3.792
 Lama+3.644
eding+3.487
 Ski+3.442
Neg logit contributions
FTWARE-4.193
Dialogue-4.191
ARI-4.131
mented-4.089
 secretaries-4.007
Token the
Token ID262
Feature activation-3.94e+00

Pos logit contributions
 Crossing+6.888
zees+6.287
 Lama+6.283
 Trace+5.863
 Ski+5.834
Neg logit contributions
FTWARE-9.793
Dialogue-9.691
ARI-9.430
mented-9.327
 secretaries-9.078
Token largest
Token ID4387
Feature activation-6.01e-01

Pos logit contributions
 Crossing+3.034
zees+2.928
 Lama+2.780
eding+2.690
 Ski+2.657
Neg logit contributions
Dialogue-2.789
FTWARE-2.784
ARI-2.767
mented-2.729
 secretaries-2.639
Token anarchist
Token ID26177
Feature activation-6.37e-01

Pos logit contributions
 Crossing+0.439
zees+0.430
 Lama+0.402
eding+0.391
 Ski+0.383
Neg logit contributions
Dialogue-0.448
ARI-0.446
FTWARE-0.442
mented-0.440
 secretaries-0.434
Token groups
Token ID2628
Feature activation+2.18e+00

Pos logit contributions
 Crossing+0.509
zees+0.501
 Lama+0.471
eding+0.458
 Ski+0.450
Neg logit contributions
Dialogue-0.431
ARI-0.428
FTWARE-0.423
mented-0.421
 secretaries-0.417
Token in
Token ID287
Feature activation-6.58e-01

Pos logit contributions
ARI+1.476
Dialogue+1.441
mented+1.420
 secretaries+1.411
FTWARE+1.405
Neg logit contributions
zees-1.746
 Crossing-1.733
 Lama-1.608
eding-1.554
 Ski-1.539
Token.
Token ID13
Feature activation-1.38e+00

Pos logit contributions
 Crossing+3.406
zees+3.302
 Lama+3.116
eding+3.032
 Ski+2.973
Neg logit contributions
Dialogue-3.364
FTWARE-3.334
ARI-3.327
mented-3.280
 secretaries-3.210
Token
Token ID198
Feature activation-3.08e+00

Pos logit contributions
zees+1.066
 Crossing+1.063
 Lama+0.984
eding+0.974
 Ski+0.937
Neg logit contributions
Dialogue-0.965
ARI-0.952
FTWARE-0.947
mented-0.936
 secretaries-0.906
Token
Token ID198
Feature activation-3.41e+00

Pos logit contributions
 Crossing+2.420
zees+2.345
 Lama+2.226
eding+2.170
 Ski+2.137
Neg logit contributions
Dialogue-2.121
FTWARE-2.118
ARI-2.109
mented-2.079
 secretaries-2.076
TokenThe
Token ID464
Feature activation-1.47e+00

Pos logit contributions
 Crossing+2.875
zees+2.808
 Lama+2.659
eding+2.594
 Ski+2.555
Neg logit contributions
Dialogue-2.157
FTWARE-2.145
ARI-2.144
mented-2.115
 secretaries-2.066
Token Dram
Token ID36736
Feature activation-3.88e+00

Pos logit contributions
 Crossing+1.090
zees+1.075
 Lama+1.003
eding+0.979
 Ski+0.954
Neg logit contributions
Dialogue-1.071
ARI-1.060
FTWARE-1.056
mented-1.050
 secretaries-1.032
Tokenatically
Token ID4142
Feature activation-1.09e+01

Pos logit contributions
 Crossing+3.481
zees+3.399
 Lama+3.242
eding+3.162
 Ski+3.114
Neg logit contributions
Dialogue-2.238
FTWARE-2.235
ARI-2.223
mented-2.178
 secretaries-2.136
Token Different
Token ID20615
Feature activation-6.15e+00

Pos logit contributions
 Crossing+7.135
zees+6.624
 Lama+6.362
eding+6.104
 Ski+6.070
Neg logit contributions
FTWARE-8.691
ARI-8.649
Dialogue-8.626
mented-8.458
 secretaries-8.412
Token Attention
Token ID47406
Feature activation-4.47e+00

Pos logit contributions
 Crossing+4.482
zees+4.294
 Lama+4.079
eding+3.924
 Ski+3.913
Neg logit contributions
Dialogue-4.547
FTWARE-4.541
ARI-4.528
mented-4.460
 secretaries-4.396
Token Economy
Token ID18493
Feature activation-4.85e+00

Pos logit contributions
 Crossing+3.121
zees+3.027
 Lama+2.837
eding+2.744
 Ski+2.699
Neg logit contributions
Dialogue-3.485
FTWARE-3.471
ARI-3.465
mented-3.427
 secretaries-3.355
Token Of
Token ID3226
Feature activation-2.84e+00

Pos logit contributions
 Crossing+3.691
zees+3.573
 Lama+3.372
eding+3.274
 Ski+3.236
Neg logit contributions
Dialogue-3.463
FTWARE-3.456
ARI-3.444
mented-3.395
 secretaries-3.331
Token The
Token ID383
Feature activation-9.51e-02

Pos logit contributions
 Crossing+2.036
zees+1.976
 Lama+1.854
eding+1.797
 Ski+1.768
Neg logit contributions
Dialogue-2.168
ARI-2.157
FTWARE-2.156
mented-2.129
 secretaries-2.085
Token almond
Token ID26948
Feature activation+8.99e-01

Pos logit contributions
 Crossing+0.637
zees+0.619
 Lama+0.578
eding+0.564
 Ski+0.553
Neg logit contributions
Dialogue-0.621
ARI-0.620
FTWARE-0.617
mented-0.615
 secretaries-0.599
Token flour
Token ID10601
Feature activation-5.23e+00

Pos logit contributions
Dialogue+0.659
ARI+0.658
mented+0.658
FTWARE+0.657
 secretaries+0.634
Neg logit contributions
 Crossing-0.668
zees-0.648
 Lama-0.609
eding-0.590
 Ski-0.582
Token due
Token ID2233
Feature activation-9.79e+00

Pos logit contributions
 Crossing+4.416
zees+4.306
 Lama+4.082
eding+3.981
 Ski+3.942
Neg logit contributions
Dialogue-3.246
ARI-3.227
FTWARE-3.223
mented-3.134
 secretaries-3.099
Token to
Token ID284
Feature activation-1.15e+01

Pos logit contributions
 Crossing+6.756
zees+6.578
 Lama+6.247
eding+6.114
 Ski+5.985
Neg logit contributions
Dialogue-7.307
FTWARE-7.227
ARI-7.135
 secretaries-6.967
mented-6.863
Token its
Token ID663
Feature activation-8.72e+00

Pos logit contributions
 Crossing+6.874
zees+6.700
 Lama+6.406
eding+6.051
 Trace+6.046
Neg logit contributions
Dialogue-9.419
FTWARE-9.261
ARI-9.230
mented-8.885
 secretaries-8.875
Token higher
Token ID2440
Feature activation-1.08e+01

Pos logit contributions
 Crossing+6.463
zees+6.242
 Lama+5.977
eding+5.822
 Trace+5.807
Neg logit contributions
Dialogue-6.219
FTWARE-6.167
ARI-6.085
mented-5.861
 secretaries-5.744
Token protein
Token ID7532
Feature activation-1.20e+00

Pos logit contributions
 Crossing+8.006
zees+7.487
 Lama+7.313
 Trace+7.202
eding+7.160
Neg logit contributions
Dialogue-7.655
FTWARE-7.503
ARI-7.330
mented-7.063
 secretaries-6.910
Token content
Token ID2695
Feature activation-1.93e+00

Pos logit contributions
 Crossing+1.058
zees+1.037
 Lama+0.983
eding+0.959
 Ski+0.945
Neg logit contributions
Dialogue-0.722
ARI-0.717
FTWARE-0.715
mented-0.708
 secretaries-0.688
Token.
Token ID13
Feature activation-4.75e+00

Pos logit contributions
 Crossing+1.472
zees+1.435
 Lama+1.350
eding+1.312
 Ski+1.291
Neg logit contributions
Dialogue-1.375
ARI-1.366
FTWARE-1.366
mented-1.349
 secretaries-1.318
Token Also
Token ID4418
Feature activation-7.77e-01

Pos logit contributions
 Crossing+3.357
zees+3.236
 Lama+3.056
eding+2.932
 Ski+2.916
Neg logit contributions
Dialogue-3.672
FTWARE-3.656
ARI-3.649
mented-3.603
 secretaries-3.547
Token has
Token ID468
Feature activation-4.45e+00

Pos logit contributions
 Crossing+0.639
zees+0.621
 Lama+0.587
eding+0.575
 Ski+0.563
Neg logit contributions
FTWARE-0.510
ARI-0.509
Dialogue-0.506
mented-0.501
 secretaries-0.489