TOP ACTIVATIONS
MAX = 1.615

I fully expect ' fair treatment by DNC in 2020 after
aligned god , it depicts a horrible scene of a child
I fully expect ' fair treatment by DNC in 2020 after
contain the instructions to create lambda functions using the archived file
:// le high val ley under ground . com / 2016
spirits . Be the first to hear about Nation Travels destinations
, a Castle move if ever there was one .
, as usual , with berries , but not too many
. However we would like to remind you once again that
I fully expect ' fair treatment by DNC in 2020 after
the F ED state ) accompl ishes all of these ,
but H err mann , ever the e got ist ,
of my 1993 book , Re claim ing the American Right
equality and fair treatment for all , Mr .
. It will also included the add t ional resources and
Books , 2000 ), my biography of the great libertarian thinker
file . file . A S 3 bucket is
< -- Stack pointer + -------------------- --+ + -------------------- --+ |
box , involving the entire palm opl ant ar surfaces .
day in Irish hospitals . Always , people said

MIN ACTIVATIONS
MIN = -1.837

Mumbai . A bench of justices H L D
s silly . Not much to grasp at here
in value by 87 per cent since the Beijing Games ,
was responsible . TABLE 5 Table 5
, at 32 %. A 32 % profit margin on $
. <|endoftext|> Image copyright Deadline Police have
. <|endoftext|> Image copyright publicity Image caption The Greeks
You have to wait 99 percent longer in the Bronx .
scrap . Article Continued Below The T
Serve hot ! <|endoftext|> Image copyright ESA Image caption Go ce
or am I fool ing myself ? All these
. Advertisement Continue reading the main story
million years . <|endoftext|> Image copyright AFP The Bank
What we 're learning so far isn 't terribly illuminating ,
dollars . Story continues below advertisement Story
s Et iquette , 18 77 . 15
Figure 3 . Figure 3 . Ass igning mass
s Et iquette , 18 77 . 17
property . The bench , however , was not
ported License . <|endoftext|> Image copyright AP Image caption Peace talks

INTERVAL 0.752 - 1.615
CONTAINS 1.037%

Bonnie & Baby 2008 01 09 O & A - 08
a fu selage about 5 ' wide . I had folded
to hear about Nation Travels destinations , and explore the world
80 ) 27 " 12 ms G 2 G
program . Tuesday morning 's Fox & Friends First will be

INTERVAL -0.111 - 0.752
CONTAINS 63.575%

keeps changing . C . K .: And then
refers to himself as ' Gu cc ifer 2 . 0
from Amazon , download it to your PC and save it
In March , we drew attention to a cable
then take a tiny bite of fabric on the side that

INTERVAL -0.974 - -0.111
CONTAINS 35.121%

be highly reliable .[ 12 ] Without a working
of The Flash , Super girl , and D . C
and massac red the defending champion Eagles 35 - 10 behind
Reply · Report Post <|endoftext|> The kids at South
Various sound tracks have been released featuring music composed by Russ

INTERVAL -1.837 - -0.974
CONTAINS 0.268%

but not necessarily so . Dis connect the mouse , serial
exemptions from federal , state and local governments , Good will
people dislike the taste so much , they are willing to
Press is reporting that AT & T , the defendant in
. But the challenge at present is that we don 't
TokenI
Token ID40
Feature activation+4.83e-01

Pos logit contributions
gencies+1.991
 Wra+1.815
 Observer+1.803
strom+1.790
racuse+1.778
Neg logit contributions
��-4.029
 Sabha-3.461
gallery-3.402
 corrid-3.393
agram-3.347
Token fully
Token ID3938
Feature activation-3.79e-02

Pos logit contributions
gencies+1.653
racuse+1.548
 Wra+1.446
 Observer+1.416
strom+1.373
Neg logit contributions
��-3.611
gallery-3.131
 Sabha-3.098
 corrid-3.078
agram-3.043
Token expect
Token ID1607
Feature activation+7.49e-01

Pos logit contributions
��+0.204
 corrid+0.172
gallery+0.170
 Sabha+0.170
agram+0.168
Neg logit contributions
gencies-0.207
racuse-0.195
 Wra-0.187
 Observer-0.186
strom-0.183
Token'
Token ID6
Feature activation+5.04e-01

Pos logit contributions
gencies+4.435
racuse+4.340
 Wra+4.235
 Observer+4.122
strom+4.105
Neg logit contributions
��-3.436
gallery-2.938
 Sabha-2.885
 corrid-2.856
agram-2.835
Token fair
Token ID3148
Feature activation+4.34e-01

Pos logit contributions
gencies+2.776
racuse+2.612
 Wra+2.555
 Observer+2.482
strom+2.430
Neg logit contributions
��-2.651
 corrid-2.275
gallery-2.259
agram-2.234
 Sabha-2.232
Token treatment
Token ID3513
Feature activation+1.61e+00

Pos logit contributions
gencies+2.089
racuse+1.951
 Wra+1.911
 Observer+1.831
strom+1.791
Neg logit contributions
��-2.587
gallery-2.249
agram-2.245
 Sabha-2.234
 corrid-2.208
Token by
Token ID416
Feature activation+1.13e+00

Pos logit contributions
 Observer+5.816
gencies+5.560
 Wra+5.400
 Offensive+5.199
strom+5.140
Neg logit contributions
��-10.945
gallery-10.095
 corrid-9.740
agram-9.691
 Sabha-8.966
Token DNC
Token ID17824
Feature activation+4.05e-01

Pos logit contributions
 Wra+4.142
 Observer+4.097
 Offensive+3.926
gencies+3.848
 Threat+3.789
Neg logit contributions
��-7.877
gallery-6.894
agram-6.546
 corrid-6.332
xual-6.274
Token in
Token ID287
Feature activation+4.29e-01

Pos logit contributions
gencies+2.035
racuse+1.897
 Wra+1.801
 Observer+1.721
strom+1.694
Neg logit contributions
��-2.346
agram-2.046
gallery-2.040
 Sabha-2.032
Pinterest-2.017
Token 2020
Token ID12131
Feature activation+2.67e-01

Pos logit contributions
gencies+2.371
racuse+2.228
 Wra+2.176
 Observer+2.140
strom+2.007
Neg logit contributions
��-2.352
agram-1.911
 Sabha-1.893
gallery-1.893
 corrid-1.844
Token after
Token ID706
Feature activation+2.12e-01

Pos logit contributions
gencies+1.360
racuse+1.261
 Wra+1.168
emort+1.116
 Observer+1.101
Neg logit contributions
��-1.532
agram-1.368
 Sabha-1.366
Pinterest-1.345
gallery-1.341
Token aligned
Token ID19874
Feature activation+5.09e-01

Pos logit contributions
��+2.086
gallery+1.727
Pinterest+1.717
 corrid+1.716
agram+1.710
Neg logit contributions
gencies-2.116
racuse-2.000
 Wra-1.946
 Observer-1.875
strom-1.824
Token god
Token ID5770
Feature activation+1.05e+00

Pos logit contributions
gencies+2.251
racuse+2.089
 Wra+2.077
 Observer+1.996
strom+1.931
Neg logit contributions
��-3.378
 corrid-2.912
 Sabha-2.737
gallery-2.719
Pinterest-2.707
Token,
Token ID11
Feature activation-2.47e-01

Pos logit contributions
gencies+3.380
 Wra+3.295
 Observer+3.208
racuse+3.177
strom+3.166
Neg logit contributions
��-8.189
 corrid-7.824
 entreprene-6.992
�士-6.647
��極-6.605
Token it
Token ID340
Feature activation+6.04e-01

Pos logit contributions
��+1.330
 corrid+1.144
 Sabha+1.103
gallery+1.098
agram+1.097
Neg logit contributions
gencies-1.369
racuse-1.286
 Wra-1.252
 Observer-1.207
strom-1.186
Token depicts
Token ID28539
Feature activation+1.73e-01

Pos logit contributions
gencies+2.492
 Wra+2.440
racuse+2.364
 Observer+2.242
strom+2.168
Neg logit contributions
��-3.984
 corrid-3.407
 Sabha-3.407
ال-3.270
Pinterest-3.215
Token a
Token ID257
Feature activation+1.60e+00

Pos logit contributions
gencies+0.849
racuse+0.808
 Wra+0.805
 Observer+0.782
strom+0.748
Neg logit contributions
��-1.053
 corrid-0.945
 Sabha-0.843
agram-0.833
Pinterest-0.831
Token horrible
Token ID12361
Feature activation+3.57e-01

Pos logit contributions
gencies+4.075
 Wra+3.770
 Deadly+3.382
racuse+3.304
 nont+3.295
Neg logit contributions
��-12.934
��極-10.972
 corrid-10.942
Pinterest-10.906
 Sabha-10.683
Token scene
Token ID3715
Feature activation-4.43e-02

Pos logit contributions
gencies+1.965
racuse+1.878
 Wra+1.863
 Observer+1.832
strom+1.805
Neg logit contributions
��-1.906
 corrid-1.577
Pinterest-1.510
 Sabha-1.494
agram-1.472
Token of
Token ID286
Feature activation-6.70e-01

Pos logit contributions
��+0.296
 corrid+0.267
 Sabha+0.250
agram+0.249
gallery+0.246
Neg logit contributions
gencies-0.191
racuse-0.177
 Wra-0.173
 Observer-0.166
strom-0.162
Token a
Token ID257
Feature activation+2.00e-01

Pos logit contributions
��+3.225
agram+2.781
gallery+2.772
 Sabha+2.752
Pinterest+2.731
Neg logit contributions
gencies-3.982
racuse-3.743
 Wra-3.568
 Observer-3.428
emort-3.405
Token child
Token ID1200
Feature activation-1.01e-01

Pos logit contributions
gencies+0.634
racuse+0.570
 Wra+0.557
 Observer+0.528
strom+0.507
Neg logit contributions
��-1.586
 corrid-1.387
 Sabha-1.344
agram-1.342
gallery-1.339
TokenI
Token ID40
Feature activation+4.28e-01

Pos logit contributions
gencies+1.554
 Wra+1.423
racuse+1.416
 Observer+1.393
strom+1.376
Neg logit contributions
��-2.975
 Sabha-2.550
agram-2.539
gallery-2.537
 corrid-2.527
Token fully
Token ID3938
Feature activation-5.96e-02

Pos logit contributions
gencies+1.515
racuse+1.414
 Wra+1.314
 Observer+1.285
strom+1.245
Neg logit contributions
��-3.126
gallery-2.729
 Sabha-2.691
 corrid-2.686
agram-2.671
Token expect
Token ID1607
Feature activation+7.19e-01

Pos logit contributions
��+0.299
 corrid+0.247
gallery+0.246
 Sabha+0.245
agram+0.242
Neg logit contributions
gencies-0.347
racuse-0.328
 Wra-0.315
 Observer-0.313
strom-0.309
Token'
Token ID6
Feature activation+4.97e-01

Pos logit contributions
gencies+4.414
racuse+4.296
 Wra+4.159
 Observer+4.066
strom+4.055
Neg logit contributions
��-3.203
gallery-2.687
agram-2.629
 Sabha-2.628
 corrid-2.622
Token fair
Token ID3148
Feature activation+3.46e-01

Pos logit contributions
gencies+2.813
racuse+2.653
 Wra+2.594
 Observer+2.523
strom+2.480
Neg logit contributions
��-2.532
 corrid-2.153
gallery-2.142
agram-2.122
 Sabha-2.119
Token treatment
Token ID3513
Feature activation+1.57e+00

Pos logit contributions
gencies+1.654
racuse+1.545
 Wra+1.510
 Observer+1.446
strom+1.418
Neg logit contributions
��-2.076
gallery-1.800
agram-1.799
 Sabha-1.785
 corrid-1.770
Token by
Token ID416
Feature activation+9.95e-01

Pos logit contributions
 Observer+5.460
gencies+5.127
 Wra+5.024
 Offensive+4.945
strom+4.802
Neg logit contributions
��-10.918
gallery-10.105
agram-9.724
 corrid-9.661
 Sabha-9.059
Token DNC
Token ID17824
Feature activation+3.58e-01

Pos logit contributions
 Wra+3.449
 Observer+3.371
gencies+3.255
 Offensive+3.180
strom+3.143
Neg logit contributions
��-7.044
gallery-6.308
agram-6.004
 corrid-5.743
 Sabha-5.658
Token in
Token ID287
Feature activation+3.52e-01

Pos logit contributions
gencies+1.816
racuse+1.691
 Wra+1.601
 Observer+1.531
strom+1.507
Neg logit contributions
��-2.057
agram-1.795
gallery-1.790
 Sabha-1.784
Pinterest-1.771
Token 2020
Token ID12131
Feature activation+2.83e-01

Pos logit contributions
gencies+1.930
racuse+1.823
 Wra+1.757
 Observer+1.722
strom+1.632
Neg logit contributions
��-1.921
agram-1.586
gallery-1.582
 Sabha-1.566
 corrid-1.531
Token after
Token ID706
Feature activation+1.64e-01

Pos logit contributions
gencies+1.542
racuse+1.431
 Wra+1.327
emort+1.281
 Observer+1.248
Neg logit contributions
��-1.525
 Sabha-1.368
agram-1.357
Pinterest-1.343
gallery-1.326
Token contain
Token ID3994
Feature activation+6.31e-01

Pos logit contributions
gencies+2.417
racuse+2.255
 Wra+2.192
strom+2.067
 Observer+2.058
Neg logit contributions
��-3.332
 corrid-3.019
 Sabha-2.771
Pinterest-2.733
gallery-2.715
Token the
Token ID262
Feature activation+9.73e-01

Pos logit contributions
gencies+2.562
 Wra+2.377
racuse+2.309
 Observer+2.303
strom+2.176
Neg logit contributions
��-4.403
 corrid-3.952
 Sabha-3.617
gallery-3.597
Pinterest-3.578
Token instructions
Token ID7729
Feature activation+2.58e-01

Pos logit contributions
gencies+5.514
 Wra+5.159
 Observer+5.013
racuse+4.900
strom+4.781
Neg logit contributions
��-5.450
 corrid-4.371
gallery-3.922
Pinterest-3.811
 Sabha-3.795
Token to
Token ID284
Feature activation-6.72e-02

Pos logit contributions
gencies+1.058
racuse+0.960
 Wra+0.955
 Observer+0.912
strom+0.885
Neg logit contributions
��-1.783
 corrid-1.625
 Sabha-1.506
agram-1.498
gallery-1.495
Token create
Token ID2251
Feature activation+9.55e-01

Pos logit contributions
��+0.371
 corrid+0.313
agram+0.306
gallery+0.305
 Sabha+0.300
Neg logit contributions
gencies-0.372
racuse-0.346
 Wra-0.336
 Observer-0.322
strom-0.319
Token lambda
Token ID37456
Feature activation+1.57e+00

Pos logit contributions
gencies+3.597
racuse+3.248
 Wra+3.129
 Observer+2.900
strom+2.852
Neg logit contributions
��-6.741
Pinterest-5.846
gallery-5.822
agram-5.821
 Sabha-5.817
Token functions
Token ID5499
Feature activation+8.58e-03

Pos logit contributions
gencies+5.145
 Wra+4.563
 Calls+4.114
 Observer+4.014
racuse+3.892
Neg logit contributions
��-12.396
 corrid-10.602
gallery-9.896
��極-9.890
Pinterest-9.756
Token using
Token ID1262
Feature activation+3.08e-01

Pos logit contributions
gencies+0.052
racuse+0.050
 Wra+0.048
 Observer+0.047
strom+0.046
Neg logit contributions
��-0.042
 corrid-0.036
gallery-0.034
agram-0.033
 Sabha-0.033
Token the
Token ID262
Feature activation+7.52e-01

Pos logit contributions
gencies+1.363
racuse+1.280
 Wra+1.267
 Observer+1.211
strom+1.175
Neg logit contributions
��-2.006
 corrid-1.797
gallery-1.680
 Sabha-1.664
Pinterest-1.660
Token archived
Token ID33962
Feature activation+1.31e-01

Pos logit contributions
gencies+3.928
racuse+3.747
 Wra+3.689
 Observer+3.553
emort+3.486
Neg logit contributions
��-4.130
Pinterest-3.366
 corrid-3.346
gallery-3.339
 Sabha-3.338
Token file
Token ID2393
Feature activation-3.97e-01

Pos logit contributions
gencies+0.585
racuse+0.542
 Wra+0.532
 Observer+0.510
strom+0.495
Neg logit contributions
��-0.848
 corrid-0.741
 Sabha-0.714
agram-0.709
gallery-0.707
Token://
Token ID1378
Feature activation-1.66e-01

Pos logit contributions
gencies+0.859
racuse+0.800
 Wra+0.778
strom+0.751
 Observer+0.738
Neg logit contributions
��-1.106
 corrid-1.006
 Sabha-0.920
 imagination-0.903
agram-0.899
Tokenle
Token ID293
Feature activation-9.31e-02

Pos logit contributions
��+0.886
agram+0.761
gallery+0.758
 Sabha+0.756
 corrid+0.751
Neg logit contributions
gencies-0.910
racuse-0.854
 Wra-0.815
 Observer-0.782
strom-0.772
Tokenhigh
Token ID8929
Feature activation-1.94e-02

Pos logit contributions
��+0.543
agram+0.482
gallery+0.481
 Sabha+0.476
Pinterest+0.472
Neg logit contributions
gencies-0.458
racuse-0.425
 Wra-0.406
 Observer-0.387
strom-0.378
Tokenval
Token ID2100
Feature activation+3.57e-01

Pos logit contributions
��+0.110
agram+0.094
 Sabha+0.094
 corrid+0.094
gallery+0.094
Neg logit contributions
gencies-0.101
racuse-0.094
 Wra-0.090
 Observer-0.086
strom-0.085
Tokenley
Token ID1636
Feature activation+6.27e-01

Pos logit contributions
gencies+1.965
racuse+1.844
 Wra+1.754
 Observer+1.694
strom+1.685
Neg logit contributions
��-1.957
 corrid-1.646
 Sabha-1.632
agram-1.622
gallery-1.610
Tokenunder
Token ID4625
Feature activation+1.56e+00

Pos logit contributions
gencies+2.960
racuse+2.744
 Wra+2.550
strom+2.494
 Observer+2.484
Neg logit contributions
��-3.866
 corrid-3.336
 Sabha-3.301
Pinterest-3.241
 imagination-3.222
Tokenground
Token ID2833
Feature activation+4.19e-01

Pos logit contributions
gencies+6.272
racuse+5.391
 Wra+5.191
strom+5.185
emort+4.787
Neg logit contributions
��-10.150
 corrid-9.144
 Sabha-8.879
ال-8.570
Pinterest-8.567
Token.
Token ID13
Feature activation+4.62e-01

Pos logit contributions
gencies+1.960
racuse+1.821
 Wra+1.723
 Observer+1.639
strom+1.610
Neg logit contributions
��-2.571
agram-2.263
gallery-2.258
 Sabha-2.243
Pinterest-2.230
Tokencom
Token ID785
Feature activation+4.88e-03

Pos logit contributions
gencies+2.332
racuse+2.147
 Wra+2.069
 Observer+1.976
strom+1.970
Neg logit contributions
��-2.646
 corrid-2.311
 Sabha-2.310
agram-2.235
Pinterest-2.234
Token/
Token ID14
Feature activation+1.89e-01

Pos logit contributions
gencies+0.027
racuse+0.025
 Wra+0.024
 Observer+0.023
emort+0.023
Neg logit contributions
��-0.026
gallery-0.022
agram-0.022
Pinterest-0.022
 corrid-0.022
Token2016
Token ID5304
Feature activation+6.22e-02

Pos logit contributions
gencies+0.984
racuse+0.907
 Wra+0.881
strom+0.840
 Observer+0.837
Neg logit contributions
��-1.064
 corrid-0.907
 Sabha-0.905
agram-0.901
Pinterest-0.889
Token spirits
Token ID13747
Feature activation-3.18e-02

Pos logit contributions
gencies+0.484
racuse+0.448
 Wra+0.431
strom+0.403
 Observer+0.402
Neg logit contributions
��-0.733
agram-0.634
 corrid-0.633
gallery-0.626
 Sabha-0.626
Token.
Token ID13
Feature activation+7.70e-01

Pos logit contributions
��+0.207
 corrid+0.184
agram+0.181
 Sabha+0.180
gallery+0.179
Neg logit contributions
gencies-0.138
racuse-0.128
 Wra-0.121
 Observer-0.115
strom-0.113
Token Be
Token ID1355
Feature activation+2.83e-01

Pos logit contributions
gencies+2.009
 Wra+1.803
racuse+1.707
 Observer+1.555
 Contact+1.485
Neg logit contributions
��-6.431
 corrid-5.941
agram-5.593
Pinterest-5.522
gallery-5.492
Token the
Token ID262
Feature activation+3.33e-01

Pos logit contributions
gencies+1.361
racuse+1.270
 Wra+1.243
 Observer+1.184
strom+1.138
Neg logit contributions
��-1.713
 corrid-1.488
 Sabha-1.476
agram-1.457
gallery-1.455
Token first
Token ID717
Feature activation+7.36e-01

Pos logit contributions
gencies+1.706
racuse+1.614
 Wra+1.610
 Observer+1.604
strom+1.430
Neg logit contributions
��-1.938
 corrid-1.731
gallery-1.589
 Sabha-1.585
agram-1.570
Token to
Token ID284
Feature activation+1.53e+00

Pos logit contributions
gencies+2.561
racuse+2.492
 Wra+2.436
 Observer+2.391
strom+2.254
Neg logit contributions
��-5.322
 corrid-4.992
 entreprene-4.441
 Sabha-4.437
agram-4.425
Token hear
Token ID3285
Feature activation+6.75e-01

Pos logit contributions
gencies+6.618
racuse+6.250
 Contact+5.715
 Observer+5.657
 Wra+5.464
Neg logit contributions
��-9.401
 corrid-8.812
��極-8.278
Pinterest-8.126
 entreprene-8.034
Token about
Token ID546
Feature activation+1.30e+00

Pos logit contributions
gencies+3.401
racuse+3.279
 Wra+3.175
 Observer+3.111
strom+3.094
Neg logit contributions
��-3.995
 corrid-3.738
��極-3.150
 entreprene-3.148
 Sabha-3.140
Token Nation
Token ID8741
Feature activation+1.24e+00

Pos logit contributions
gencies+2.778
 Wra+2.685
racuse+2.598
 Observer+2.455
 Threat+2.211
Neg logit contributions
��-11.269
 corrid-10.180
��極-9.537
Pinterest-9.489
�士-9.289
Token Travels
Token ID48899
Feature activation+2.62e-02

Pos logit contributions
gencies+3.378
racuse+3.021
 Wra+2.874
 Observer+2.651
strom+2.403
Neg logit contributions
��-10.027
 corrid-8.987
agram-8.524
ال-8.451
 imagination-8.415
Token destinations
Token ID23982
Feature activation+1.10e+00

Pos logit contributions
gencies+0.107
 Wra+0.097
 Observer+0.096
racuse+0.095
strom+0.090
Neg logit contributions
��-0.182
 corrid-0.160
 Sabha-0.157
agram-0.154
gallery-0.152
Token,
Token ID11
Feature activation-6.24e-02

Pos logit contributions
gencies+1.312
racuse+1.192
 Wra+1.167
 Observer+1.116
strom+1.115
Neg logit contributions
��-1.742
Pinterest-1.511
agram-1.502
 Sabha-1.492
 corrid-1.488
Token a
Token ID257
Feature activation+4.40e-01

Pos logit contributions
��+0.345
agram+0.296
gallery+0.294
 Sabha+0.294
 corrid+0.293
Neg logit contributions
gencies-0.332
racuse-0.310
 Wra-0.299
 Observer-0.286
strom-0.282
Token Castle
Token ID11312
Feature activation+4.12e-01

Pos logit contributions
gencies+2.207
racuse+2.045
 Wra+1.984
 Observer+1.931
strom+1.887
Neg logit contributions
��-2.541
gallery-2.176
agram-2.163
 Sabha-2.154
Pinterest-2.140
Token move
Token ID1445
Feature activation+7.70e-01

Pos logit contributions
gencies+2.158
racuse+2.015
 Wra+1.944
 Observer+1.906
strom+1.867
Neg logit contributions
��-2.295
 Sabha-1.927
gallery-1.927
Pinterest-1.902
 corrid-1.899
Token if
Token ID611
Feature activation-1.13e-01

Pos logit contributions
gencies+3.603
 Wra+3.363
racuse+3.340
 Observer+3.298
strom+3.151
Neg logit contributions
��-4.473
gallery-3.857
 corrid-3.856
 Sabha-3.837
agram-3.815
Token ever
Token ID1683
Feature activation+1.52e+00

Pos logit contributions
��+0.615
agram+0.530
gallery+0.529
 Sabha+0.526
 corrid+0.523
Neg logit contributions
gencies-0.611
racuse-0.574
 Wra-0.548
 Observer-0.527
strom-0.518
Token there
Token ID612
Feature activation+9.74e-01

Pos logit contributions
gencies+6.406
racuse+6.006
 Wra+5.999
 Observer+5.970
strom+5.726
Neg logit contributions
��-8.862
gallery-7.668
 corrid-7.516
agram-7.486
Pinterest-7.330
Token was
Token ID373
Feature activation+6.30e-01

Pos logit contributions
gencies+4.409
racuse+4.157
 Wra+3.979
 Observer+3.782
strom+3.771
Neg logit contributions
��-5.879
 corrid-5.230
gallery-5.102
agram-5.100
Pinterest-5.078
Token one
Token ID530
Feature activation+1.30e+00

Pos logit contributions
gencies+3.066
racuse+2.881
 Wra+2.813
 Observer+2.772
strom+2.647
Neg logit contributions
��-3.634
 corrid-3.079
gallery-3.064
agram-3.062
Pinterest-3.052
Token.
Token ID13
Feature activation-2.66e-02

Pos logit contributions
gencies+4.750
racuse+4.337
 Wra+4.312
 Observer+4.159
strom+4.005
Neg logit contributions
��-8.635
 corrid-7.745
gallery-7.591
 Sabha-7.534
agram-7.496
Token
Token ID198
Feature activation-1.08e-01

Pos logit contributions
��+0.164
 corrid+0.144
agram+0.142
 Sabha+0.141
gallery+0.140
Neg logit contributions
gencies-0.123
racuse-0.115
 Wra-0.113
 Observer-0.108
strom-0.104
Token,
Token ID11
Feature activation+6.64e-01

Pos logit contributions
gencies+1.137
racuse+1.072
 Wra+1.027
strom+0.987
 Observer+0.980
Neg logit contributions
��-1.161
 corrid-0.989
 Sabha-0.964
agram-0.959
gallery-0.945
Token as
Token ID355
Feature activation+2.69e-01

Pos logit contributions
gencies+2.612
racuse+2.483
 Wra+2.417
 Observer+2.321
strom+2.270
Neg logit contributions
��-4.444
 corrid-4.037
gallery-3.811
Pinterest-3.783
 Sabha-3.761
Token usual
Token ID6678
Feature activation+5.81e-01

Pos logit contributions
gencies+1.204
racuse+1.095
 Wra+1.077
 Observer+1.037
strom+1.008
Neg logit contributions
��-1.741
 corrid-1.552
agram-1.470
Pinterest-1.467
 Sabha-1.455
Token,
Token ID11
Feature activation+1.61e-01

Pos logit contributions
gencies+2.279
 Wra+2.211
racuse+2.145
 Observer+2.104
strom+2.057
Neg logit contributions
��-4.121
 corrid-3.756
 entreprene-3.310
agram-3.297
gallery-3.292
Token with
Token ID351
Feature activation+2.85e-01

Pos logit contributions
gencies+0.616
racuse+0.572
 Wra+0.560
 Observer+0.526
strom+0.517
Neg logit contributions
��-1.144
 corrid-1.028
agram-0.984
gallery-0.978
 Sabha-0.976
Token berries
Token ID36322
Feature activation+1.52e+00

Pos logit contributions
gencies+1.248
racuse+1.147
 Wra+1.098
 Observer+1.050
strom+1.013
Neg logit contributions
��-1.887
 corrid-1.657
agram-1.617
gallery-1.608
 Sabha-1.603
Token,
Token ID11
Feature activation+1.15e+00

Pos logit contributions
 Wra+4.442
 Observer+4.330
gencies+4.193
strom+4.106
racuse+3.979
Neg logit contributions
��-12.085
 corrid-11.684
 entreprene-10.452
-9.929
�士-9.883
Token but
Token ID475
Feature activation+4.24e-01

Pos logit contributions
gencies+2.850
 Wra+2.631
 Observer+2.573
racuse+2.420
strom+2.412
Neg logit contributions
��-9.565
 corrid-9.024
 entreprene-8.270
�士-8.110
 Sabha-8.109
Token not
Token ID407
Feature activation+2.40e-01

Pos logit contributions
gencies+2.237
racuse+2.057
 Wra+2.048
 Observer+1.988
strom+1.942
Neg logit contributions
��-2.426
 corrid-2.135
gallery-1.943
agram-1.935
 Sabha-1.928
Token too
Token ID1165
Feature activation+3.53e-01

Pos logit contributions
gencies+1.357
racuse+1.272
 Wra+1.269
strom+1.218
 Observer+1.217
Neg logit contributions
��-1.287
 corrid-1.110
agram-1.016
gallery-1.005
 Sabha-1.001
Token many
Token ID867
Feature activation+7.50e-01

Pos logit contributions
gencies+1.631
racuse+1.497
 Wra+1.493
strom+1.403
 Observer+1.393
Neg logit contributions
��-2.268
 corrid-1.975
agram-1.886
gallery-1.854
Pinterest-1.844
Token.
Token ID13
Feature activation+7.73e-01

Pos logit contributions
gencies+1.272
racuse+1.160
 Wra+1.111
 Observer+1.058
strom+1.048
Neg logit contributions
��-1.749
 corrid-1.511
 Sabha-1.509
agram-1.505
gallery-1.491
Token However
Token ID2102
Feature activation+8.78e-01

Pos logit contributions
gencies+2.367
 Wra+2.041
racuse+1.958
 Observer+1.906
 Offensive+1.747
Neg logit contributions
��-6.020
 Sabha-5.343
 corrid-5.332
gallery-5.270
agram-5.253
Token we
Token ID356
Feature activation+8.89e-01

Pos logit contributions
gencies+3.750
 Wra+3.482
 Observer+3.349
racuse+3.298
strom+3.264
Neg logit contributions
��-5.811
 corrid-5.243
gallery-4.802
Pinterest-4.767
agram-4.724
Token would
Token ID561
Feature activation+8.18e-01

Pos logit contributions
gencies+2.758
 Wra+2.379
racuse+2.256
 Observer+2.223
strom+2.075
Neg logit contributions
��-6.949
 corrid-6.174
gallery-5.945
Pinterest-5.937
 Sabha-5.892
Token like
Token ID588
Feature activation+6.06e-01

Pos logit contributions
gencies+3.176
 Wra+2.740
racuse+2.634
 Observer+2.565
strom+2.496
Neg logit contributions
��-5.602
 corrid-5.414
Pinterest-4.845
 Sabha-4.830
agram-4.797
Token to
Token ID284
Feature activation+1.51e+00

Pos logit contributions
gencies+2.308
racuse+2.061
 Wra+2.052
 Observer+1.966
strom+1.898
Neg logit contributions
��-4.292
 corrid-4.026
agram-3.661
gallery-3.613
 Sabha-3.584
Token remind
Token ID7101
Feature activation+5.51e-01

Pos logit contributions
gencies+6.138
racuse+4.918
 Wra+4.760
 Observer+4.608
emort+4.449
Neg logit contributions
��-9.831
 corrid-9.126
ال-8.571
 entreprene-8.552
agram-8.550
Token you
Token ID345
Feature activation+5.19e-01

Pos logit contributions
gencies+2.241
racuse+1.925
 Wra+1.924
 Observer+1.835
strom+1.769
Neg logit contributions
��-3.813
 corrid-3.527
agram-3.250
ال-3.212
gallery-3.191
Token once
Token ID1752
Feature activation+2.14e-01

Pos logit contributions
gencies+1.935
racuse+1.752
 Wra+1.708
strom+1.657
 Observer+1.629
Neg logit contributions
��-3.754
 corrid-3.374
gallery-3.176
agram-3.144
 Sabha-3.139
Token again
Token ID757
Feature activation-1.48e-01

Pos logit contributions
gencies+1.033
racuse+0.951
 Wra+0.906
strom+0.895
 Observer+0.882
Neg logit contributions
��-1.325
 corrid-1.179
agram-1.109
ال-1.099
Pinterest-1.094
Token that
Token ID326
Feature activation-7.47e-02

Pos logit contributions
��+0.875
 corrid+0.772
agram+0.734
gallery+0.734
 Sabha+0.729
Neg logit contributions
gencies-0.748
racuse-0.698
 Wra-0.669
 Observer-0.647
strom-0.637
TokenI
Token ID40
Feature activation+2.82e-01

Pos logit contributions
gencies+1.495
racuse+1.350
 Wra+1.332
 Observer+1.317
strom+1.296
Neg logit contributions
��-2.588
 Sabha-2.233
 corrid-2.199
gallery-2.180
agram-2.166
Token fully
Token ID3938
Feature activation-1.24e-01

Pos logit contributions
gencies+1.051
racuse+0.961
 Wra+0.911
 Observer+0.886
strom+0.858
Neg logit contributions
��-2.025
gallery-1.763
 Sabha-1.744
 corrid-1.737
agram-1.734
Token expect
Token ID1607
Feature activation+6.39e-01

Pos logit contributions
��+0.610
 corrid+0.507
gallery+0.504
 Sabha+0.501
agram+0.498
Neg logit contributions
gencies-0.738
racuse-0.696
 Wra-0.669
 Observer-0.662
strom-0.653
Token'
Token ID6
Feature activation+4.96e-01

Pos logit contributions
gencies+3.916
racuse+3.793
 Wra+3.677
 Observer+3.588
strom+3.564
Neg logit contributions
��-2.854
gallery-2.414
 corrid-2.389
 Sabha-2.381
agram-2.371
Token fair
Token ID3148
Feature activation+3.82e-01

Pos logit contributions
gencies+2.827
racuse+2.660
 Wra+2.619
 Observer+2.553
strom+2.506
Neg logit contributions
��-2.494
 corrid-2.135
gallery-2.112
 Sabha-2.092
agram-2.084
Token treatment
Token ID3513
Feature activation+1.51e+00

Pos logit contributions
gencies+1.699
racuse+1.590
 Wra+1.569
 Observer+1.500
strom+1.465
Neg logit contributions
��-2.406
gallery-2.107
agram-2.101
 Sabha-2.089
 corrid-2.064
Token by
Token ID416
Feature activation+1.04e+00

Pos logit contributions
 Observer+5.192
gencies+4.863
 Wra+4.849
 Offensive+4.695
strom+4.651
Neg logit contributions
��-10.471
gallery-9.659
 corrid-9.425
agram-9.338
 Sabha-8.732
Token DNC
Token ID17824
Feature activation+3.73e-01

Pos logit contributions
 Wra+3.891
 Observer+3.850
gencies+3.699
 Offensive+3.679
strom+3.585
Neg logit contributions
��-7.171
gallery-6.201
agram-5.942
 corrid-5.813
xual-5.622
Token in
Token ID287
Feature activation+3.26e-01

Pos logit contributions
gencies+1.864
racuse+1.741
 Wra+1.659
 Observer+1.583
strom+1.560
Neg logit contributions
��-2.165
agram-1.885
gallery-1.878
 Sabha-1.870
 corrid-1.860
Token 2020
Token ID12131
Feature activation+2.22e-01

Pos logit contributions
gencies+1.753
racuse+1.645
 Wra+1.606
 Observer+1.581
 Offensive+1.488
Neg logit contributions
��-1.836
agram-1.499
gallery-1.493
 Sabha-1.482
 corrid-1.458
Token after
Token ID706
Feature activation+1.39e-01

Pos logit contributions
gencies+1.097
racuse+1.014
 Wra+0.944
emort+0.897
 Observer+0.890
Neg logit contributions
��-1.307
agram-1.167
 Sabha-1.162
gallery-1.148
Pinterest-1.148
Token the
Token ID262
Feature activation+1.78e-01

Pos logit contributions
gencies+0.947
racuse+0.885
 Wra+0.866
 Observer+0.830
strom+0.817
Neg logit contributions
��-0.937
gallery-0.796
agram-0.792
 corrid-0.783
Pinterest-0.776
Token F
Token ID376
Feature activation+8.08e-02

Pos logit contributions
gencies+0.901
racuse+0.835
 Wra+0.811
 Observer+0.780
strom+0.759
Neg logit contributions
��-1.025
agram-0.886
gallery-0.886
 corrid-0.874
Pinterest-0.873
TokenED
Token ID1961
Feature activation-2.51e-01

Pos logit contributions
gencies+0.503
racuse+0.477
 Wra+0.459
 Observer+0.444
strom+0.440
Neg logit contributions
��-0.373
gallery-0.308
agram-0.307
 Sabha-0.306
 corrid-0.302
Token state
Token ID1181
Feature activation+5.15e-01

Pos logit contributions
��+1.203
agram+1.020
gallery+1.013
 Sabha+1.013
 corrid+1.003
Neg logit contributions
gencies-1.506
racuse-1.422
 Wra-1.362
 Observer-1.311
strom-1.295
Token)
Token ID8
Feature activation-1.87e-01

Pos logit contributions
gencies+2.328
racuse+2.155
 Wra+2.086
 Observer+1.989
strom+1.972
Neg logit contributions
��-3.215
gallery-2.786
 corrid-2.781
agram-2.766
Pinterest-2.751
Token accompl
Token ID6424
Feature activation+1.49e+00

Pos logit contributions
��+1.054
agram+0.912
gallery+0.910
 Sabha+0.905
 corrid+0.901
Neg logit contributions
gencies-0.970
racuse-0.906
 Wra-0.865
 Observer-0.828
strom-0.816
Tokenishes
Token ID5614
Feature activation+4.85e-01

Pos logit contributions
gencies+4.029
 Wra+2.992
racuse+2.926
strom+2.792
 Observer+2.760
Neg logit contributions
��-12.134
 corrid-11.144
gallery-10.610
 Sabha-10.454
 imagination-10.296
Token all
Token ID477
Feature activation+1.06e+00

Pos logit contributions
gencies+2.532
racuse+2.323
 Wra+2.298
 Observer+2.203
strom+2.158
Neg logit contributions
��-2.727
gallery-2.330
agram-2.311
 corrid-2.289
Pinterest-2.283
Token of
Token ID286
Feature activation-1.11e-01

Pos logit contributions
gencies+4.544
racuse+4.079
 Wra+4.078
 Observer+3.837
strom+3.780
Neg logit contributions
��-6.666
gallery-5.882
agram-5.807
Pinterest-5.736
 Sabha-5.720
Token these
Token ID777
Feature activation+2.93e-01

Pos logit contributions
��+0.639
agram+0.556
gallery+0.554
 Sabha+0.552
 corrid+0.548
Neg logit contributions
gencies-0.558
racuse-0.520
 Wra-0.496
 Observer-0.473
strom-0.466
Token,
Token ID11
Feature activation+2.92e-01

Pos logit contributions
gencies+1.271
racuse+1.131
 Wra+1.114
 Observer+1.058
strom+1.024
Neg logit contributions
��-1.934
gallery-1.688
agram-1.676
 corrid-1.667
Pinterest-1.655
Token but
Token ID475
Feature activation+2.98e-01

Pos logit contributions
��+1.887
agram+1.637
gallery+1.632
 Sabha+1.625
 corrid+1.614
Neg logit contributions
gencies-1.742
racuse-1.628
 Wra-1.550
 Observer-1.483
strom-1.462
TokenH
Token ID39
Feature activation+4.69e-01

Pos logit contributions
gencies+1.547
racuse+1.446
 Wra+1.401
 Observer+1.353
strom+1.331
Neg logit contributions
��-1.666
 corrid-1.420
 Sabha-1.417
agram-1.408
gallery-1.402
Tokenerr
Token ID8056
Feature activation+3.58e-02

Pos logit contributions
gencies+1.947
racuse+1.736
 Wra+1.675
strom+1.640
 Observer+1.619
Neg logit contributions
��-3.190
 corrid-2.761
 Sabha-2.759
gallery-2.675
Pinterest-2.673
Tokenmann
Token ID9038
Feature activation+5.02e-01

Pos logit contributions
gencies+0.169
racuse+0.157
 Wra+0.149
strom+0.148
 Observer+0.141
Neg logit contributions
��-0.225
 corrid-0.200
 Sabha-0.190
Pinterest-0.187
gallery-0.186
Token,
Token ID11
Feature activation+3.66e-01

Pos logit contributions
gencies+2.442
racuse+2.269
 Wra+2.188
strom+2.127
emort+2.078
Neg logit contributions
��-2.980
 Sabha-2.554
 corrid-2.548
Pinterest-2.490
agram-2.483
Token ever
Token ID1683
Feature activation+1.46e+00

Pos logit contributions
gencies+1.893
racuse+1.777
 Wra+1.715
 Observer+1.653
strom+1.635
Neg logit contributions
��-2.043
 Sabha-1.757
 corrid-1.744
agram-1.737
gallery-1.725
Token the
Token ID262
Feature activation+1.32e-01

Pos logit contributions
gencies+6.647
racuse+6.266
 Wra+6.234
 Observer+6.002
strom+5.910
Neg logit contributions
��-8.289
agram-6.960
 Sabha-6.946
gallery-6.938
 corrid-6.858
Token e
Token ID304
Feature activation+1.33e-01

Pos logit contributions
gencies+0.724
racuse+0.675
 Wra+0.653
 Observer+0.633
strom+0.615
Neg logit contributions
��-0.704
agram-0.595
gallery-0.594
 Sabha-0.593
Pinterest-0.587
Tokengot
Token ID23442
Feature activation-3.35e-01

Pos logit contributions
gencies+0.717
racuse+0.667
 Wra+0.634
strom+0.611
 Observer+0.611
Neg logit contributions
��-0.734
 corrid-0.628
 Sabha-0.623
Pinterest-0.614
gallery-0.613
Tokenist
Token ID396
Feature activation+3.70e-01

Pos logit contributions
��+1.189
agram+0.954
gallery+0.940
 Sabha+0.930
Pinterest+0.917
Neg logit contributions
gencies-2.423
racuse-2.316
 Wra-2.232
 Observer-2.164
strom-2.137
Token,
Token ID11
Feature activation+2.52e-01

Pos logit contributions
gencies+1.817
racuse+1.683
 Wra+1.614
 Observer+1.566
strom+1.543
Neg logit contributions
��-2.194
 Sabha-1.879
gallery-1.863
agram-1.860
 corrid-1.858
Token of
Token ID286
Feature activation+1.24e+00

Pos logit contributions
gencies+2.993
racuse+2.784
 Wra+2.642
 Observer+2.522
strom+2.479
Neg logit contributions
��-3.714
agram-3.248
gallery-3.240
 Sabha-3.227
 corrid-3.210
Token my
Token ID616
Feature activation+1.11e+00

Pos logit contributions
gencies+4.257
racuse+4.133
 Wra+4.029
 Observer+3.938
strom+3.647
Neg logit contributions
��-9.067
 Sabha-7.947
gallery-7.842
agram-7.749
 corrid-7.690
Token 1993
Token ID9656
Feature activation+1.08e+00

Pos logit contributions
gencies+5.288
racuse+5.066
 Wra+4.788
 Observer+4.788
strom+4.602
Neg logit contributions
��-6.521
gallery-5.668
 corrid-5.581
agram-5.558
 Sabha-5.554
Token book
Token ID1492
Feature activation+1.36e+00

Pos logit contributions
gencies+4.292
 Observer+3.985
racuse+3.939
 Wra+3.938
strom+3.777
Neg logit contributions
��-7.336
gallery-6.229
 Sabha-6.191
agram-6.079
advertisement-6.069
Token,
Token ID11
Feature activation+1.25e+00

Pos logit contributions
gencies+5.501
 Wra+5.365
 Observer+5.119
racuse+4.986
 Offensive+4.705
Neg logit contributions
��-9.274
gallery-7.713
agram-7.649
 corrid-7.536
Pinterest-7.466
Token Re
Token ID797
Feature activation+1.46e+00

Pos logit contributions
gencies+3.729
 Wra+3.329
racuse+3.082
 Observer+3.026
emort+2.929
Neg logit contributions
��-9.878
agram-8.541
 Sabha-8.467
Pinterest-8.395
gallery-8.379
Tokenclaim
Token ID6604
Feature activation+4.98e-01

Pos logit contributions
gencies+5.341
 Wra+4.493
strom+4.344
 Calls+4.217
 Observer+4.069
Neg logit contributions
��-10.434
 corrid-9.201
 Sabha-9.041
ال-8.787
agram-8.733
Tokening
Token ID278
Feature activation+8.52e-01

Pos logit contributions
gencies+2.676
racuse+2.536
 Wra+2.467
 Observer+2.325
strom+2.304
Neg logit contributions
��-2.742
gallery-2.354
agram-2.320
 Sabha-2.318
Pinterest-2.302
Token the
Token ID262
Feature activation+3.87e-01

Pos logit contributions
gencies+3.323
 Wra+3.302
 Observer+3.227
racuse+3.225
 Offensive+2.982
Neg logit contributions
��-5.848
gallery-4.883
 corrid-4.819
 Sabha-4.817
Pinterest-4.813
Token American
Token ID1605
Feature activation+6.52e-01

Pos logit contributions
gencies+2.093
racuse+1.960
 Wra+1.892
 Observer+1.824
strom+1.782
Neg logit contributions
��-2.140
gallery-1.805
agram-1.786
 Sabha-1.780
 corrid-1.774
Token Right
Token ID6498
Feature activation+4.50e-01

Pos logit contributions
gencies+2.134
 Wra+1.894
racuse+1.781
 Offensive+1.764
 Observer+1.750
Neg logit contributions
��-5.117
 corrid-4.575
agram-4.376
Pinterest-4.282
gallery-4.270
Token equality
Token ID10537
Feature activation+2.35e-01

Pos logit contributions
gencies+2.683
racuse+2.449
 Wra+2.402
 Observer+2.288
strom+2.232
Neg logit contributions
��-3.038
gallery-2.620
agram-2.596
Pinterest-2.561
 Sabha-2.556
Token and
Token ID290
Feature activation-2.23e-01

Pos logit contributions
gencies+1.176
racuse+1.094
 Wra+1.053
 Observer+1.003
strom+0.984
Neg logit contributions
��-1.368
agram-1.194
gallery-1.185
 corrid-1.180
 Sabha-1.177
Token fair
Token ID3148
Feature activation+8.22e-02

Pos logit contributions
��+1.212
agram+1.046
gallery+1.042
 Sabha+1.037
 corrid+1.030
Neg logit contributions
gencies-1.197
racuse-1.122
 Wra-1.070
 Observer-1.026
strom-1.011
Token treatment
Token ID3513
Feature activation+8.14e-01

Pos logit contributions
gencies+0.448
racuse+0.412
 Wra+0.401
 Observer+0.385
strom+0.377
Neg logit contributions
��-0.451
agram-0.382
gallery-0.378
 Sabha-0.377
 corrid-0.376
Token for
Token ID329
Feature activation+9.70e-01

Pos logit contributions
gencies+3.749
racuse+3.384
 Wra+3.297
 Observer+3.131
strom+3.070
Neg logit contributions
��-4.961
gallery-4.344
agram-4.344
 corrid-4.331
 Sabha-4.284
Token all
Token ID477
Feature activation+1.46e+00

Pos logit contributions
gencies+5.039
 Wra+4.579
racuse+4.541
 Observer+4.420
 Offensive+4.260
Neg logit contributions
��-5.406
agram-4.533
gallery-4.515
 corrid-4.452
 Sabha-4.434
Token,
Token ID11
Feature activation+1.54e-01

Pos logit contributions
gencies+6.046
 Wra+5.443
racuse+5.333
 Observer+5.016
 Children+4.895
Neg logit contributions
��-9.376
agram-8.090
gallery-8.021
 Sabha-7.776
 corrid-7.740
Token
Token ID447
Feature activation-6.48e-02

Pos logit contributions
gencies+0.830
racuse+0.775
 Wra+0.744
 Observer+0.716
strom+0.703
Neg logit contributions
��-0.839
agram-0.722
gallery-0.717
 Sabha-0.710
 corrid-0.708
Token
Token ID251
Feature activation-4.25e-02

Pos logit contributions
��+0.265
agram+0.219
 Sabha+0.219
gallery+0.218
 corrid+0.215
Neg logit contributions
gencies-0.434
racuse-0.409
 Wra-0.396
 Observer-0.383
strom-0.379
Token Mr
Token ID1770
Feature activation-4.21e-02

Pos logit contributions
��+0.231
agram+0.199
gallery+0.199
 Sabha+0.198
 corrid+0.196
Neg logit contributions
gencies-0.229
racuse-0.215
 Wra-0.205
 Observer-0.197
strom-0.194
Token.
Token ID13
Feature activation-2.97e-02

Pos logit contributions
��+0.280
 corrid+0.249
agram+0.240
 Sabha+0.239
gallery+0.239
Neg logit contributions
gencies-0.181
racuse-0.168
 Wra-0.161
 Observer-0.153
strom-0.151
Token.
Token ID13
Feature activation+3.19e-01

Pos logit contributions
gencies+2.206
 Wra+2.025
racuse+1.959
 Observer+1.920
strom+1.876
Neg logit contributions
��-5.189
 corrid-4.941
 entreprene-4.442
 Sabha-4.255
�士-4.221
Token It
Token ID632
Feature activation+1.06e-01

Pos logit contributions
gencies+1.319
 Wra+1.256
 Observer+1.191
racuse+1.189
 Offensive+1.123
Neg logit contributions
��-2.197
 corrid-1.997
 Sabha-1.839
gallery-1.823
agram-1.801
Token will
Token ID481
Feature activation-1.73e-01

Pos logit contributions
gencies+0.399
racuse+0.360
 Wra+0.353
 Observer+0.333
strom+0.320
Neg logit contributions
��-0.767
 corrid-0.681
agram-0.654
 Sabha-0.652
gallery-0.649
Token also
Token ID635
Feature activation+1.14e-02

Pos logit contributions
��+0.953
 corrid+0.819
 Sabha+0.773
agram+0.773
Pinterest+0.770
Neg logit contributions
gencies-0.962
racuse-0.893
 Wra-0.874
 Observer-0.840
strom-0.826
Token included
Token ID3017
Feature activation+8.39e-01

Pos logit contributions
gencies+0.063
racuse+0.058
 Wra+0.057
 Observer+0.055
strom+0.054
Neg logit contributions
��-0.064
 corrid-0.054
 Sabha-0.052
Pinterest-0.051
agram-0.051
Token the
Token ID262
Feature activation+1.46e+00

Pos logit contributions
gencies+3.133
 Wra+3.046
racuse+2.948
 Observer+2.911
strom+2.802
Neg logit contributions
��-5.989
 corrid-5.536
 entreprene-4.944
 Sabha-4.858
��極-4.831
Token add
Token ID751
Feature activation+9.50e-01

Pos logit contributions
gencies+7.684
 Wra+7.234
racuse+7.177
 Observer+6.814
strom+6.686
Neg logit contributions
��-7.826
agram-6.346
Pinterest-6.255
 corrid-6.254
gallery-6.196
Tokent
Token ID83
Feature activation+7.04e-01

Pos logit contributions
gencies+3.340
racuse+2.963
strom+2.873
 Wra+2.840
 Observer+2.686
Neg logit contributions
��-7.317
 corrid-6.692
 Sabha-5.991
 entreprene-5.973
�士-5.912
Tokenional
Token ID1538
Feature activation+3.05e-01

Pos logit contributions
gencies+1.985
racuse+1.755
 Wra+1.572
 Observer+1.478
strom+1.465
Neg logit contributions
��-5.840
 corrid-5.376
Pinterest-4.937
 imagination-4.921
ال-4.904
Token resources
Token ID4133
Feature activation+4.48e-01

Pos logit contributions
gencies+1.637
racuse+1.482
 Wra+1.468
 Observer+1.417
strom+1.392
Neg logit contributions
��-1.781
 corrid-1.501
agram-1.428
gallery-1.416
 Sabha-1.406
Token and
Token ID290
Feature activation+8.43e-01

Pos logit contributions
gencies+1.759
 Wra+1.624
racuse+1.607
 Observer+1.557
strom+1.483
Neg logit contributions
��-3.188
 corrid-2.918
agram-2.620
 Sabha-2.617
gallery-2.585
Token Books
Token ID13661
Feature activation+1.18e-02

Pos logit contributions
��+3.079
agram+2.821
gallery+2.811
Pinterest+2.782
 Sabha+2.777
Neg logit contributions
gencies-1.780
racuse-1.625
 Wra-1.477
emort-1.422
 Observer-1.388
Token,
Token ID11
Feature activation+4.40e-01

Pos logit contributions
gencies+0.053
racuse+0.050
 Wra+0.049
 Observer+0.047
strom+0.046
Neg logit contributions
��-0.074
 corrid-0.066
gallery-0.063
 Sabha-0.063
 imagination-0.062
Token 2000
Token ID4751
Feature activation+6.34e-01

Pos logit contributions
gencies+1.750
 Wra+1.674
racuse+1.649
 Observer+1.632
strom+1.498
Neg logit contributions
��-2.995
 corrid-2.640
 Sabha-2.551
 imagination-2.486
gallery-2.486
Token),
Token ID828
Feature activation+5.49e-01

Pos logit contributions
gencies+2.810
racuse+2.630
 Wra+2.533
 Observer+2.413
strom+2.331
Neg logit contributions
��-4.025
 corrid-3.533
 Sabha-3.503
gallery-3.496
agram-3.473
Token my
Token ID616
Feature activation+8.02e-01

Pos logit contributions
gencies+1.922
 Wra+1.685
racuse+1.663
 Observer+1.583
 Offensive+1.450
Neg logit contributions
��-4.098
gallery-3.648
 corrid-3.566
 Sabha-3.566
agram-3.485
Token biography
Token ID26444
Feature activation+1.45e+00

Pos logit contributions
gencies+3.342
racuse+2.977
 Wra+2.882
 Observer+2.792
strom+2.710
Neg logit contributions
��-5.583
gallery-4.782
 Sabha-4.747
agram-4.706
 corrid-4.683
Token of
Token ID286
Feature activation+7.81e-01

Pos logit contributions
gencies+5.619
racuse+5.256
 Wra+5.232
 Observer+4.859
 Offensive+4.595
Neg logit contributions
��-9.989
 Sabha-8.689
 corrid-8.622
agram-8.602
gallery-8.467
Token the
Token ID262
Feature activation+3.96e-01

Pos logit contributions
gencies+3.679
racuse+3.382
 Wra+3.331
strom+3.142
 Observer+3.108
Neg logit contributions
��-4.838
gallery-4.115
agram-4.082
Pinterest-4.079
 Sabha-4.063
Token great
Token ID1049
Feature activation-1.47e-01

Pos logit contributions
gencies+1.981
racuse+1.819
 Wra+1.765
 Observer+1.692
strom+1.665
Neg logit contributions
��-2.347
agram-2.005
gallery-1.994
 Sabha-1.988
 corrid-1.980
Token libertarian
Token ID19466
Feature activation-1.41e-01

Pos logit contributions
��+0.676
agram+0.599
 Sabha+0.595
gallery+0.590
Pinterest+0.589
Neg logit contributions
gencies-0.885
racuse-0.842
 Wra-0.793
 Observer-0.760
emort-0.759
Token thinker
Token ID45206
Feature activation+3.71e-01

Pos logit contributions
��+0.704
agram+0.645
 imagination+0.638
gallery+0.637
Pinterest+0.635
Neg logit contributions
gencies-0.778
racuse-0.747
 Wra-0.702
emort-0.657
strom-0.653
Token file
Token ID2393
Feature activation+7.06e-02

Pos logit contributions
��+0.469
 Sabha+0.403
agram+0.402
gallery+0.401
 corrid+0.400
Neg logit contributions
gencies-0.416
racuse-0.389
 Wra-0.375
 Observer-0.358
strom-0.351
Token.
Token ID13
Feature activation+8.14e-01

Pos logit contributions
gencies+0.337
racuse+0.312
 Wra+0.302
 Observer+0.286
strom+0.281
Neg logit contributions
��-0.431
 Sabha-0.372
 corrid-0.371
gallery-0.371
agram-0.371
Token
Token ID198
Feature activation+4.78e-01

Pos logit contributions
gencies+3.382
 Wra+3.132
racuse+3.012
 Observer+2.912
strom+2.768
Neg logit contributions
��-5.527
 corrid-4.809
 Sabha-4.646
Pinterest-4.622
agram-4.597
Token
Token ID198
Feature activation+6.11e-01

Pos logit contributions
gencies+2.021
 Wra+1.878
racuse+1.841
 Observer+1.780
strom+1.747
Neg logit contributions
��-3.310
 corrid-3.089
 entreprene-2.702
 Sabha-2.680
agram-2.646
Tokenfile
Token ID7753
Feature activation+9.42e-01

Pos logit contributions
gencies+2.975
racuse+2.639
 Wra+2.589
strom+2.494
 Observer+2.451
Neg logit contributions
��-3.808
 corrid-3.230
 Sabha-3.166
agram-3.109
 imagination-3.094
Token.
Token ID13
Feature activation+1.44e+00

Pos logit contributions
gencies+3.295
 Wra+3.079
racuse+2.978
 Observer+2.879
strom+2.806
Neg logit contributions
��-6.863
 corrid-6.251
 Sabha-5.749
Pinterest-5.729
ال-5.680
Token A
Token ID317
Feature activation+1.22e-01

Pos logit contributions
gencies+5.903
 Wra+5.536
racuse+5.155
 Observer+5.102
 Calls+4.877
Neg logit contributions
��-9.408
 corrid-7.988
Pinterest-7.932
 Sabha-7.878
agram-7.799
Token S
Token ID311
Feature activation-3.77e-01

Pos logit contributions
gencies+0.628
racuse+0.584
 Wra+0.567
 Observer+0.540
strom+0.533
Neg logit contributions
��-0.698
agram-0.598
 Sabha-0.597
gallery-0.594
Pinterest-0.593
Token3
Token ID18
Feature activation+8.27e-02

Pos logit contributions
��+2.126
agram+1.853
gallery+1.845
 Sabha+1.836
Pinterest+1.823
Neg logit contributions
gencies-1.949
racuse-1.822
 Wra-1.733
 Observer-1.658
strom-1.632
Token bucket
Token ID19236
Feature activation+3.22e-01

Pos logit contributions
gencies+0.490
racuse+0.462
 Wra+0.456
 Observer+0.435
strom+0.425
Neg logit contributions
��-0.408
 Sabha-0.339
gallery-0.337
 corrid-0.337
agram-0.336
Token is
Token ID318
Feature activation+3.80e-01

Pos logit contributions
gencies+1.655
racuse+1.536
 Wra+1.506
 Observer+1.437
strom+1.410
Neg logit contributions
��-1.856
 corrid-1.575
 Sabha-1.572
gallery-1.558
Pinterest-1.551
Token <
Token ID1279
Feature activation+7.61e-01

Pos logit contributions
gencies+2.826
racuse+2.673
 Wra+2.600
 Observer+2.461
strom+2.404
Neg logit contributions
��-3.438
 corrid-2.955
agram-2.946
 Sabha-2.939
Pinterest-2.934
Token--
Token ID438
Feature activation-1.18e-01

Pos logit contributions
gencies+3.918
racuse+3.683
 Wra+3.535
strom+3.359
 Observer+3.344
Neg logit contributions
��-4.200
agram-3.664
 corrid-3.610
 Sabha-3.607
Pinterest-3.558
Token Stack
Token ID23881
Feature activation-4.78e-01

Pos logit contributions
��+0.700
agram+0.609
gallery+0.607
 Sabha+0.604
 corrid+0.603
Neg logit contributions
gencies-0.582
racuse-0.543
 Wra-0.520
 Observer-0.495
strom-0.485
Token pointer
Token ID17562
Feature activation+2.43e-01

Pos logit contributions
��+2.072
agram+1.754
gallery+1.731
 Sabha+1.720
Pinterest+1.699
Neg logit contributions
gencies-3.072
racuse-2.916
 Wra-2.770
 Observer-2.682
emort-2.670
Token +
Token ID1343
Feature activation+6.93e-01

Pos logit contributions
gencies+1.229
racuse+1.150
 Wra+1.124
 Observer+1.074
strom+1.046
Neg logit contributions
��-1.416
 corrid-1.202
gallery-1.195
Pinterest-1.191
 Sabha-1.188
Token--------------------
Token ID19351
Feature activation+1.43e+00

Pos logit contributions
gencies+4.394
racuse+4.070
 Wra+4.043
strom+3.902
 Observer+3.819
Neg logit contributions
��-3.201
 corrid-2.839
 imagination-2.350
�士-2.336
 Sabha-2.282
Token--+
Token ID44785
Feature activation+7.13e-01

Pos logit contributions
gencies+6.691
racuse+6.251
 Wra+6.114
 Observer+5.819
 Offensive+5.651
Neg logit contributions
��-8.603
 corrid-8.185
 entreprene-7.102
 imagination-7.058
�士-6.750
Token +
Token ID1343
Feature activation+5.40e-01

Pos logit contributions
gencies+3.576
racuse+3.359
 Wra+3.245
 Observer+3.074
strom+2.996
Neg logit contributions
��-4.074
agram-3.525
 Sabha-3.520
gallery-3.507
 corrid-3.497
Token--------------------
Token ID19351
Feature activation+8.51e-01

Pos logit contributions
gencies+3.527
racuse+3.278
 Wra+3.263
strom+3.127
 Observer+3.107
Neg logit contributions
��-2.366
 corrid-2.060
 imagination-1.775
agram-1.760
 Sabha-1.739
Token--+
Token ID44785
Feature activation+2.35e-01

Pos logit contributions
gencies+4.511
racuse+4.261
 Wra+4.156
 Observer+3.991
strom+3.884
Neg logit contributions
��-4.747
 corrid-4.390
 imagination-3.862
 Sabha-3.777
 entreprene-3.712
Token |
Token ID930
Feature activation+2.76e-01

Pos logit contributions
gencies+1.082
racuse+1.004
 Wra+0.960
 Observer+0.910
strom+0.887
Neg logit contributions
��-1.456
agram-1.276
gallery-1.270
 Sabha-1.269
 corrid-1.264
Token box
Token ID3091
Feature activation+4.00e-01

Pos logit contributions
��+3.221
gallery+2.771
agram+2.743
 Sabha+2.662
 imagination+2.613
Neg logit contributions
gencies-9.371
racuse-8.933
 Wra-8.508
emort-8.405
 Observer-8.227
Token,
Token ID11
Feature activation+1.03e-01

Pos logit contributions
gencies+1.636
racuse+1.542
 Wra+1.531
 Observer+1.460
strom+1.447
Neg logit contributions
��-2.759
 corrid-2.554
 entreprene-2.267
 Sabha-2.265
agram-2.252
Token involving
Token ID7411
Feature activation-1.71e-01

Pos logit contributions
gencies+0.541
racuse+0.513
 Wra+0.500
 Observer+0.472
strom+0.465
Neg logit contributions
��-0.583
 corrid-0.510
 Sabha-0.480
gallery-0.480
Pinterest-0.476
Token the
Token ID262
Feature activation+8.16e-02

Pos logit contributions
��+1.026
 corrid+0.903
 Sabha+0.870
agram+0.868
gallery+0.867
Neg logit contributions
gencies-0.840
racuse-0.788
 Wra-0.758
 Observer-0.725
strom-0.714
Token entire
Token ID2104
Feature activation+3.30e-01

Pos logit contributions
gencies+0.414
racuse+0.391
 Wra+0.380
 Observer+0.366
strom+0.365
Neg logit contributions
��-0.481
 corrid-0.418
 Sabha-0.397
agram-0.390
gallery-0.389
Token palm
Token ID18057
Feature activation+1.41e+00

Pos logit contributions
gencies+1.791
racuse+1.689
 Wra+1.652
 Observer+1.574
strom+1.562
Neg logit contributions
��-1.819
 corrid-1.568
agram-1.473
 Sabha-1.462
ال-1.449
Tokenopl
Token ID20106
Feature activation+9.01e-01

Pos logit contributions
gencies+4.524
racuse+4.143
strom+4.019
 Wra+3.802
agnetic+3.566
Neg logit contributions
��-10.648
 corrid-10.160
 entreprene-9.249
�士-9.142
ال-9.132
Tokenant
Token ID415
Feature activation+5.25e-01

Pos logit contributions
gencies+3.502
racuse+3.273
strom+3.114
 Observer+2.997
 Wra+2.990
Neg logit contributions
��-6.440
 corrid-6.164
 entreprene-5.355
�士-5.316
ال-5.233
Tokenar
Token ID283
Feature activation+1.23e+00

Pos logit contributions
gencies+2.264
racuse+2.133
strom+2.054
 Wra+2.014
 Observer+1.964
Neg logit contributions
��-3.508
 corrid-3.222
ال-2.841
 Sabha-2.827
Pinterest-2.824
Token surfaces
Token ID16649
Feature activation+9.43e-01

Pos logit contributions
gencies+3.183
racuse+2.782
 Wra+2.671
strom+2.425
 Observer+2.306
Neg logit contributions
��-10.301
 corrid-9.381
ال-8.597
agram-8.535
�士-8.531
Token.
Token ID13
Feature activation+8.46e-01

Pos logit contributions
gencies+2.834
 Wra+2.662
strom+2.502
 Observer+2.497
racuse+2.494
Neg logit contributions
��-7.559
 corrid-7.349
 entreprene-6.615
�士-6.256
-6.212
Token day
Token ID1110
Feature activation-8.82e-04

Pos logit contributions
��+2.968
agram+2.567
gallery+2.565
 Sabha+2.565
Pinterest+2.540
Neg logit contributions
gencies-2.977
racuse-2.816
 Wra-2.654
 Observer-2.526
emort-2.514
Token in
Token ID287
Feature activation-1.61e-01

Pos logit contributions
��+0.005
agram+0.005
gallery+0.004
 Sabha+0.004
 corrid+0.004
Neg logit contributions
gencies-0.004
racuse-0.004
 Wra-0.004
 Observer-0.004
strom-0.004
Token Irish
Token ID8685
Feature activation-2.66e-01

Pos logit contributions
��+0.889
agram+0.771
gallery+0.768
 Sabha+0.766
 corrid+0.759
Neg logit contributions
gencies-0.849
racuse-0.796
 Wra-0.756
 Observer-0.724
strom-0.716
Token hospitals
Token ID11301
Feature activation+1.20e-01

Pos logit contributions
��+1.318
agram+1.115
gallery+1.109
 Sabha+1.103
 corrid+1.098
Neg logit contributions
gencies-1.566
racuse-1.472
 Wra-1.415
 Observer-1.362
strom-1.344
Token.
Token ID13
Feature activation+4.61e-01

Pos logit contributions
gencies+0.570
racuse+0.521
 Wra+0.502
 Observer+0.478
strom+0.471
Neg logit contributions
��-0.738
agram-0.641
 corrid-0.640
gallery-0.636
 Sabha-0.635
Token Always
Token ID16622
Feature activation+1.41e+00

Pos logit contributions
gencies+1.873
 Wra+1.786
racuse+1.696
 Observer+1.680
 Offensive+1.595
Neg logit contributions
��-3.124
 corrid-2.790
agram-2.668
gallery-2.640
 Sabha-2.630
Token,
Token ID11
Feature activation+7.56e-01

Pos logit contributions
gencies+5.665
 Wra+5.243
racuse+5.005
 Observer+4.986
strom+4.673
Neg logit contributions
��-9.090
agram-7.752
 corrid-7.722
Pinterest-7.653
gallery-7.623
Token people
Token ID661
Feature activation+2.25e-01

Pos logit contributions
gencies+3.580
 Wra+3.281
racuse+3.214
 Observer+3.141
 Offensive+2.936
Neg logit contributions
��-4.605
agram-3.920
Pinterest-3.881
 corrid-3.874
gallery-3.864
Token said
Token ID531
Feature activation+2.42e-01

Pos logit contributions
gencies+1.110
racuse+1.021
 Wra+0.997
 Observer+0.949
strom+0.928
Neg logit contributions
��-1.336
agram-1.139
gallery-1.139
 corrid-1.136
Pinterest-1.133
Token
Token ID564
Feature activation+2.21e-01

Pos logit contributions
gencies+1.219
racuse+1.125
 Wra+1.097
 Observer+1.044
strom+1.018
Neg logit contributions
��-1.407
agram-1.215
gallery-1.213
Pinterest-1.199
 corrid-1.197
Token
Token ID250
Feature activation-2.09e-01

Pos logit contributions
gencies+1.489
racuse+1.398
 Wra+1.365
 Observer+1.318
strom+1.309
Neg logit contributions
��-0.867
 Sabha-0.734
agram-0.725
gallery-0.719
Pinterest-0.714
Token Mumbai
Token ID22917
Feature activation+2.39e-01

Pos logit contributions
��+0.464
agram+0.393
gallery+0.391
 Sabha+0.388
 corrid+0.387
Neg logit contributions
gencies-0.554
racuse-0.522
 Wra-0.502
 Observer-0.483
strom-0.475
Token.
Token ID13
Feature activation+1.40e-01

Pos logit contributions
gencies+1.099
racuse+1.011
 Wra+0.979
 Observer+0.940
strom+0.905
Neg logit contributions
��-1.483
 corrid-1.296
gallery-1.288
agram-1.287
Pinterest-1.285
Token
Token ID198
Feature activation+3.68e-01

Pos logit contributions
gencies+0.673
racuse+0.622
 Wra+0.611
 Observer+0.587
strom+0.559
Neg logit contributions
��-0.850
 corrid-0.741
agram-0.727
gallery-0.725
 Sabha-0.719
Token
Token ID198
Feature activation+2.33e-01

Pos logit contributions
gencies+1.520
 Wra+1.438
racuse+1.377
 Observer+1.375
strom+1.332
Neg logit contributions
��-2.566
 corrid-2.377
 entreprene-2.097
 Sabha-2.060
agram-2.055
TokenA
Token ID32
Feature activation-4.01e-01

Pos logit contributions
gencies+1.102
racuse+0.998
 Wra+0.967
 Observer+0.936
strom+0.920
Neg logit contributions
��-1.447
 corrid-1.281
 imagination-1.218
 Sabha-1.214
agram-1.212
Token bench
Token ID7624
Feature activation-1.84e+00

Pos logit contributions
��+1.758
agram+1.455
gallery+1.451
 Sabha+1.438
 corrid+1.432
Neg logit contributions
gencies-2.584
racuse-2.445
 Wra-2.356
 Observer-2.279
strom-2.250
Token of
Token ID286
Feature activation-7.92e-01

Pos logit contributions
��+7.941
 Sabha+7.198
agram+7.133
 imagination+6.876
gallery+6.749
Neg logit contributions
gencies-10.642
racuse-10.307
 Wra-9.434
emort-9.324
strom-8.963
Token justices
Token ID27818
Feature activation-1.46e+00

Pos logit contributions
��+4.359
agram+3.852
 Sabha+3.832
gallery+3.754
Pinterest+3.701
Neg logit contributions
gencies-4.109
racuse-3.905
 Wra-3.638
strom-3.490
emort-3.440
Token H
Token ID367
Feature activation-9.85e-01

Pos logit contributions
��+7.397
 Sabha+6.723
agram+6.604
 imagination+6.326
Pinterest+6.319
Neg logit contributions
gencies-7.767
racuse-7.439
 Wra-6.661
emort-6.579
strom-6.533
Token L
Token ID406
Feature activation-6.37e-01

Pos logit contributions
��+5.402
agram+4.869
 Sabha+4.845
gallery+4.657
Pinterest+4.654
Neg logit contributions
gencies-5.031
racuse-4.750
 Wra-4.363
 Observer-4.236
strom-4.206
Token D
Token ID360
Feature activation-5.81e-01

Pos logit contributions
��+3.268
agram+2.852
 Sabha+2.832
gallery+2.744
Pinterest+2.723
Neg logit contributions
gencies-3.584
racuse-3.401
 Wra-3.142
 Observer-3.043
emort-3.021
Token
Token ID247
Feature activation-5.56e-04

Pos logit contributions
��+0.975
 Sabha+0.841
 corrid+0.839
agram+0.831
gallery+0.827
Neg logit contributions
gencies-1.090
racuse-1.014
 Wra-0.982
strom-0.935
 Observer-0.935
Tokens
Token ID82
Feature activation-3.08e-01

Pos logit contributions
��+0.003
 Sabha+0.002
 corrid+0.002
gallery+0.002
Pinterest+0.002
Neg logit contributions
gencies-0.003
racuse-0.003
 Wra-0.003
 Observer-0.003
strom-0.003
Token silly
Token ID14397
Feature activation+1.75e-01

Pos logit contributions
��+1.607
gallery+1.366
agram+1.364
 Sabha+1.357
 corrid+1.349
Neg logit contributions
gencies-1.737
racuse-1.624
 Wra-1.564
 Observer-1.502
strom-1.480
Token.
Token ID13
Feature activation-4.26e-01

Pos logit contributions
gencies+0.831
racuse+0.761
 Wra+0.740
 Observer+0.707
strom+0.692
Neg logit contributions
��-1.066
gallery-0.922
 corrid-0.918
agram-0.916
 Sabha-0.915
Token Not
Token ID1892
Feature activation-1.20e+00

Pos logit contributions
��+2.629
agram+2.322
gallery+2.315
 Sabha+2.307
Pinterest+2.287
Neg logit contributions
gencies-1.967
racuse-1.829
 Wra-1.717
 Observer-1.632
strom-1.610
Token much
Token ID881
Feature activation-1.78e+00

Pos logit contributions
��+6.085
agram+5.485
 Sabha+5.419
gallery+5.394
 imagination+5.384
Neg logit contributions
gencies-6.415
racuse-6.101
 Wra-5.648
 Observer-5.398
emort-5.314
Token to
Token ID284
Feature activation-1.44e+00

Pos logit contributions
��+8.265
 imagination+7.955
 Sabha+7.605
agram+7.578
gallery+7.505
Neg logit contributions
gencies-9.407
racuse-9.322
 Wra-8.113
emort-8.093
olicy-7.817
Token grasp
Token ID13180
Feature activation-5.63e-01

Pos logit contributions
��+7.077
agram+6.375
 imagination+6.372
 Sabha+6.295
Pinterest+6.264
Neg logit contributions
gencies-7.759
racuse-7.513
 Wra-6.749
emort-6.603
strom-6.509
Token at
Token ID379
Feature activation-1.22e-01

Pos logit contributions
��+3.017
agram+2.644
gallery+2.625
 Sabha+2.615
Pinterest+2.595
Neg logit contributions
gencies-3.014
racuse-2.837
 Wra-2.676
 Observer-2.560
strom-2.528
Token here
Token ID994
Feature activation-4.82e-01

Pos logit contributions
��+0.655
gallery+0.558
agram+0.557
 Sabha+0.556
 corrid+0.552
Neg logit contributions
gencies-0.676
racuse-0.627
 Wra-0.607
 Observer-0.581
strom-0.574
Token
Token ID784
Feature activation-6.54e-01

Pos logit contributions
��+2.771
agram+2.436
gallery+2.424
 Sabha+2.417
Pinterest+2.395
Neg logit contributions
gencies-2.416
racuse-2.259
 Wra-2.128
 Observer-2.037
strom-2.006
Token in
Token ID287
Feature activation-9.53e-02

Pos logit contributions
��+1.990
agram+1.732
gallery+1.727
 Sabha+1.721
 corrid+1.706
Neg logit contributions
gencies-1.886
racuse-1.766
 Wra-1.676
 Observer-1.601
strom-1.582
Token value
Token ID1988
Feature activation-2.69e-01

Pos logit contributions
��+0.576
agram+0.498
gallery+0.497
 Sabha+0.495
 corrid+0.494
Neg logit contributions
gencies-0.460
racuse-0.427
 Wra-0.409
 Observer-0.390
strom-0.381
Token by
Token ID416
Feature activation-3.70e-01

Pos logit contributions
��+1.500
agram+1.297
gallery+1.293
 Sabha+1.286
 corrid+1.280
Neg logit contributions
gencies-1.415
racuse-1.324
 Wra-1.262
 Observer-1.209
strom-1.191
Token 87
Token ID10083
Feature activation-7.08e-01

Pos logit contributions
��+2.054
agram+1.773
gallery+1.767
 Sabha+1.759
 corrid+1.749
Neg logit contributions
gencies-1.954
racuse-1.830
 Wra-1.748
 Observer-1.674
strom-1.649
Token per
Token ID583
Feature activation-5.89e-01

Pos logit contributions
��+4.288
agram+3.860
gallery+3.847
 Sabha+3.803
Pinterest+3.796
Neg logit contributions
gencies-3.222
racuse-3.026
 Wra-2.826
 Observer-2.650
emort-2.635
Token cent
Token ID1247
Feature activation-1.78e+00

Pos logit contributions
��+2.687
 corrid+2.222
agram+2.204
 Sabha+2.204
gallery+2.199
Neg logit contributions
gencies-3.717
racuse-3.521
 Wra-3.393
 Observer-3.279
strom-3.243
Token since
Token ID1201
Feature activation-2.35e-01

Pos logit contributions
��+7.433
 Sabha+6.718
gallery+6.578
agram+6.481
 imagination+6.417
Neg logit contributions
gencies-10.420
racuse-9.884
 Wra-9.305
emort-8.966
olicy-8.795
Token the
Token ID262
Feature activation+4.11e-01

Pos logit contributions
��+1.321
agram+1.146
gallery+1.142
 Sabha+1.136
 corrid+1.129
Neg logit contributions
gencies-1.222
racuse-1.142
 Wra-1.090
 Observer-1.043
strom-1.026
Token Beijing
Token ID11618
Feature activation+3.43e-01

Pos logit contributions
gencies+2.156
 Wra+2.001
racuse+1.979
 Observer+1.931
strom+1.835
Neg logit contributions
��-2.279
agram-1.964
gallery-1.943
Pinterest-1.935
 corrid-1.922
Token Games
Token ID5776
Feature activation+3.39e-01

Pos logit contributions
gencies+1.495
racuse+1.360
 Wra+1.360
 Observer+1.293
strom+1.242
Neg logit contributions
��-2.216
Pinterest-1.933
agram-1.929
gallery-1.928
 corrid-1.896
Token,
Token ID11
Feature activation-5.60e-02

Pos logit contributions
gencies+1.645
 Wra+1.516
racuse+1.505
 Observer+1.448
strom+1.415
Neg logit contributions
��-2.009
Pinterest-1.730
 corrid-1.720
agram-1.718
gallery-1.715
Token was
Token ID373
Feature activation-3.66e-03

Pos logit contributions
gencies+0.534
racuse+0.495
 Wra+0.479
 Observer+0.461
emort+0.448
Neg logit contributions
��-0.611
gallery-0.524
Pinterest-0.519
 Sabha-0.519
agram-0.518
Token responsible
Token ID4497
Feature activation+5.12e-01

Pos logit contributions
��+0.019
gallery+0.016
 Sabha+0.016
agram+0.016
 corrid+0.016
Neg logit contributions
gencies-0.021
racuse-0.019
 Wra-0.019
 Observer-0.018
strom-0.018
Token.
Token ID13
Feature activation-3.52e-01

Pos logit contributions
gencies+2.454
racuse+2.273
 Wra+2.193
 Observer+2.116
strom+2.067
Neg logit contributions
��-3.083
 corrid-2.636
gallery-2.635
agram-2.634
 Sabha-2.628
Token
Token ID198
Feature activation-4.51e-01

Pos logit contributions
��+2.114
agram+1.868
gallery+1.860
 Sabha+1.853
Pinterest+1.840
Neg logit contributions
gencies-1.679
racuse-1.563
 Wra-1.464
 Observer-1.394
emort-1.381
Token
Token ID198
Feature activation-3.67e-01

Pos logit contributions
��+2.437
agram+2.132
gallery+2.130
 Sabha+2.110
Pinterest+2.105
Neg logit contributions
gencies-2.410
racuse-2.260
 Wra-2.141
 Observer-2.050
emort-2.030
TokenTABLE
Token ID38148
Feature activation-1.74e+00

Pos logit contributions
��+1.901
agram+1.614
gallery+1.607
 Sabha+1.607
 corrid+1.601
Neg logit contributions
gencies-2.083
racuse-1.953
 Wra-1.873
 Observer-1.800
strom-1.782
Token 5
Token ID642
Feature activation-7.34e-01

Pos logit contributions
��+8.248
 Sabha+7.562
Pinterest+7.482
agram+7.481
gallery+7.443
Neg logit contributions
gencies-8.950
racuse-8.640
emort-7.804
 Wra-7.522
 Observer-7.483
Token
Token ID198
Feature activation-7.41e-01

Pos logit contributions
��+4.051
agram+3.616
 Sabha+3.570
gallery+3.557
Pinterest+3.515
Neg logit contributions
gencies-3.743
racuse-3.587
 Wra-3.247
emort-3.137
strom-3.105
Token
Token ID198
Feature activation-4.39e-01

Pos logit contributions
��+3.568
gallery+3.274
agram+3.254
Pinterest+3.236
 Sabha+3.184
Neg logit contributions
gencies-4.230
racuse-3.985
 Wra-3.703
emort-3.650
 Observer-3.551
TokenTable
Token ID10962
Feature activation-7.82e-01

Pos logit contributions
��+2.117
agram+1.800
gallery+1.793
 Sabha+1.780
Pinterest+1.771
Neg logit contributions
gencies-2.618
racuse-2.473
 Wra-2.369
 Observer-2.280
strom-2.248
Token 5
Token ID642
Feature activation+5.45e-02

Pos logit contributions
��+4.339
agram+3.744
 Sabha+3.734
gallery+3.690
 imagination+3.687
Neg logit contributions
gencies-3.900
racuse-3.730
 Wra-3.383
emort-3.341
 Observer-3.314
Token,
Token ID11
Feature activation+1.20e-01

Pos logit contributions
��+0.239
agram+0.208
gallery+0.207
 Sabha+0.206
 corrid+0.205
Neg logit contributions
gencies-0.203
racuse-0.190
 Wra-0.181
 Observer-0.173
strom-0.170
Token at
Token ID379
Feature activation-5.66e-02

Pos logit contributions
gencies+0.618
racuse+0.576
 Wra+0.561
 Observer+0.539
strom+0.526
Neg logit contributions
��-0.685
agram-0.581
gallery-0.578
 corrid-0.576
 Sabha-0.574
Token 32
Token ID3933
Feature activation-8.32e-01

Pos logit contributions
��+0.325
agram+0.280
gallery+0.280
 Sabha+0.279
Pinterest+0.277
Neg logit contributions
gencies-0.289
racuse-0.270
 Wra-0.259
 Observer-0.248
strom-0.243
Token%.
Token ID7225
Feature activation-1.74e-01

Pos logit contributions
��+4.753
gallery+4.273
agram+4.249
 corrid+4.209
 Sabha+4.194
Neg logit contributions
gencies-3.990
racuse-3.847
 Wra-3.519
 Observer-3.354
emort-3.328
Token A
Token ID317
Feature activation+6.82e-02

Pos logit contributions
��+1.017
agram+0.889
gallery+0.886
 Sabha+0.883
Pinterest+0.876
Neg logit contributions
gencies-0.865
racuse-0.806
 Wra-0.764
 Observer-0.729
strom-0.718
Token 32
Token ID3933
Feature activation-1.67e+00

Pos logit contributions
gencies+0.353
racuse+0.328
 Wra+0.316
 Observer+0.303
strom+0.300
Neg logit contributions
��-0.389
agram-0.333
 Sabha-0.331
gallery-0.331
Pinterest-0.330
Token%
Token ID4
Feature activation-7.83e-01

Pos logit contributions
��+9.126
gallery+8.491
agram+8.487
 corrid+8.426
Pinterest+8.273
Neg logit contributions
gencies-7.489
racuse-7.299
 Wra-6.447
emort-6.292
 Observer-6.206
Token profit
Token ID7630
Feature activation-6.66e-01

Pos logit contributions
��+3.962
agram+3.499
gallery+3.482
 Sabha+3.458
Pinterest+3.422
Neg logit contributions
gencies-4.350
racuse-4.122
 Wra-3.858
 Observer-3.696
emort-3.682
Token margin
Token ID10330
Feature activation-1.23e-01

Pos logit contributions
��+3.799
agram+3.376
 Sabha+3.356
gallery+3.348
Pinterest+3.316
Neg logit contributions
gencies-3.327
racuse-3.097
 Wra-2.912
 Observer-2.767
emort-2.748
Token on
Token ID319
Feature activation-8.72e-02

Pos logit contributions
��+0.693
agram+0.594
gallery+0.594
 Sabha+0.590
 corrid+0.588
Neg logit contributions
gencies-0.642
racuse-0.600
 Wra-0.576
 Observer-0.552
strom-0.543
Token $
Token ID720
Feature activation-1.87e-01

Pos logit contributions
��+0.478
agram+0.413
gallery+0.412
 Sabha+0.410
 corrid+0.407
Neg logit contributions
gencies-0.465
racuse-0.436
 Wra-0.415
 Observer-0.398
strom-0.392
Token.
Token ID13
Feature activation+1.07e-01

Pos logit contributions
gencies+1.733
racuse+1.594
 Wra+1.556
 Observer+1.479
strom+1.462
Neg logit contributions
��-2.265
 Sabha-1.969
gallery-1.964
agram-1.954
 corrid-1.945
Token
Token ID447
Feature activation+2.80e-01

Pos logit contributions
gencies+0.490
 Wra+0.450
racuse+0.449
 Observer+0.421
strom+0.408
Neg logit contributions
��-0.667
 corrid-0.582
agram-0.573
 Sabha-0.572
gallery-0.572
Token
Token ID251
Feature activation+1.43e-01

Pos logit contributions
gencies+1.857
racuse+1.744
 Wra+1.693
 Observer+1.629
strom+1.620
Neg logit contributions
��-1.147
 Sabha-0.967
agram-0.961
gallery-0.956
Pinterest-0.946
Token<|endoftext|>
Token ID50256
Feature activation-9.03e-01

Pos logit contributions
gencies+0.678
racuse+0.626
 Wra+0.610
 Observer+0.578
strom+0.564
Neg logit contributions
��-0.872
agram-0.758
 corrid-0.753
 Sabha-0.752
gallery-0.752
TokenImage
Token ID5159
Feature activation-6.28e-01

Pos logit contributions
��+4.693
Pinterest+4.252
gallery+4.247
agram+4.192
 Sabha+4.088
Neg logit contributions
gencies-4.817
racuse-4.569
 Wra-4.339
 Observer-4.167
emort-4.102
Token copyright
Token ID6634
Feature activation-1.65e+00

Pos logit contributions
��+2.088
gallery+1.676
agram+1.674
 Sabha+1.641
Pinterest+1.625
Neg logit contributions
gencies-4.653
racuse-4.471
 Wra-4.263
 Observer-4.134
emort-4.119
Token Deadline
Token ID35954
Feature activation+4.90e-01

Pos logit contributions
��+8.116
 Sabha+7.477
Pinterest+7.458
gallery+7.455
agram+7.398
Neg logit contributions
gencies-8.420
racuse-8.016
emort-7.343
 Wra-7.051
strom-6.974
Token
Token ID198
Feature activation-2.59e-01

Pos logit contributions
gencies+3.005
racuse+2.824
 Wra+2.769
 Observer+2.713
strom+2.658
Neg logit contributions
��-2.306
 corrid-1.855
 Sabha-1.851
agram-1.812
Pinterest-1.777
Token
Token ID198
Feature activation-3.87e-01

Pos logit contributions
��+1.506
 corrid+1.311
agram+1.287
 Sabha+1.281
gallery+1.279
Neg logit contributions
gencies-1.317
racuse-1.228
 Wra-1.185
 Observer-1.134
strom-1.114
TokenPolice
Token ID9039
Feature activation+4.13e-01

Pos logit contributions
��+2.087
agram+1.813
gallery+1.811
Pinterest+1.795
 Sabha+1.795
Neg logit contributions
gencies-2.082
racuse-1.956
 Wra-1.861
 Observer-1.782
emort-1.755
Token have
Token ID423
Feature activation+9.70e-03

Pos logit contributions
gencies+2.168
racuse+1.992
 Wra+1.957
 Observer+1.886
strom+1.826
Neg logit contributions
��-2.352
gallery-1.999
agram-1.959
Pinterest-1.958
 Sabha-1.954
Token.
Token ID13
Feature activation-1.07e-01

Pos logit contributions
gencies+0.812
racuse+0.748
 Wra+0.720
 Observer+0.687
strom+0.682
Neg logit contributions
��-1.082
agram-0.928
 corrid-0.928
gallery-0.925
 Sabha-0.915
Token
Token ID447
Feature activation+1.19e-01

Pos logit contributions
��+0.585
agram+0.501
 corrid+0.498
gallery+0.498
 Sabha+0.496
Neg logit contributions
gencies-0.573
racuse-0.535
 Wra-0.516
 Observer-0.493
strom-0.486
Token
Token ID251
Feature activation+5.76e-02

Pos logit contributions
gencies+0.757
racuse+0.701
 Wra+0.689
 Observer+0.661
strom+0.660
Neg logit contributions
��-0.521
 Sabha-0.440
 corrid-0.439
agram-0.437
gallery-0.430
Token<|endoftext|>
Token ID50256
Feature activation-9.32e-01

Pos logit contributions
gencies+0.289
racuse+0.269
 Wra+0.258
 Observer+0.247
strom+0.242
Neg logit contributions
��-0.334
agram-0.290
gallery-0.288
 corrid-0.287
 Sabha-0.287
TokenImage
Token ID5159
Feature activation-6.46e-01

Pos logit contributions
��+4.825
Pinterest+4.387
gallery+4.378
agram+4.317
ال+4.215
Neg logit contributions
gencies-4.969
racuse-4.715
 Wra-4.479
 Observer-4.304
emort-4.235
Token copyright
Token ID6634
Feature activation-1.65e+00

Pos logit contributions
��+2.120
gallery+1.700
agram+1.696
 Sabha+1.664
Pinterest+1.648
Neg logit contributions
gencies-4.807
racuse-4.619
 Wra-4.403
 Observer-4.272
emort-4.257
Token publicity
Token ID19403
Feature activation-6.65e-01

Pos logit contributions
��+8.007
 Sabha+7.394
Pinterest+7.368
gallery+7.365
agram+7.305
Neg logit contributions
gencies-8.492
racuse-8.086
emort-7.412
 Wra-7.119
strom-7.047
Token Image
Token ID7412
Feature activation-1.10e+00

Pos logit contributions
��+3.336
gallery+2.937
agram+2.910
Pinterest+2.875
 Sabha+2.869
Neg logit contributions
gencies-3.789
racuse-3.558
 Wra-3.368
 Observer-3.230
emort-3.212
Token caption
Token ID8305
Feature activation-1.08e+00

Pos logit contributions
��+2.290
gallery+1.896
agram+1.835
 Sabha+1.758
Pinterest+1.728
Neg logit contributions
gencies-9.338
racuse-8.979
 Wra-8.544
emort-8.412
 Observer-8.273
Token The
Token ID383
Feature activation-3.10e-01

Pos logit contributions
��+6.182
 Sabha+5.659
agram+5.655
gallery+5.606
Pinterest+5.533
Neg logit contributions
gencies-5.010
racuse-4.737
emort-4.230
 Wra-4.114
strom-4.004
Token Greeks
Token ID25059
Feature activation+7.51e-02

Pos logit contributions
��+1.675
agram+1.445
gallery+1.440
 Sabha+1.434
 corrid+1.424
Neg logit contributions
gencies-1.677
racuse-1.573
 Wra-1.499
 Observer-1.435
strom-1.418
TokenYou
Token ID1639
Feature activation+4.18e-02

Pos logit contributions
��+0.864
agram+0.750
gallery+0.745
 Sabha+0.745
 corrid+0.740
Neg logit contributions
gencies-0.760
racuse-0.708
 Wra-0.675
 Observer-0.646
strom-0.636
Token have
Token ID423
Feature activation-2.99e-01

Pos logit contributions
gencies+0.222
racuse+0.206
 Wra+0.199
 Observer+0.191
strom+0.188
Neg logit contributions
��-0.234
agram-0.199
gallery-0.197
 Sabha-0.196
Pinterest-0.196
Token to
Token ID284
Feature activation-3.72e-02

Pos logit contributions
��+1.666
agram+1.450
gallery+1.446
 Sabha+1.441
 corrid+1.430
Neg logit contributions
gencies-1.555
racuse-1.457
 Wra-1.382
 Observer-1.321
strom-1.305
Token wait
Token ID4043
Feature activation-6.06e-01

Pos logit contributions
��+0.209
gallery+0.178
agram+0.178
 Sabha+0.177
 corrid+0.177
Neg logit contributions
gencies-0.196
racuse-0.182
 Wra-0.175
 Observer-0.168
strom-0.165
Token 99
Token ID7388
Feature activation-7.09e-01

Pos logit contributions
��+3.245
agram+2.832
gallery+2.821
 Sabha+2.815
 corrid+2.788
Neg logit contributions
gencies-3.251
racuse-3.065
 Wra-2.889
 Observer-2.764
strom-2.730
Token percent
Token ID1411
Feature activation-1.65e+00

Pos logit contributions
��+4.150
agram+3.685
gallery+3.661
 Sabha+3.642
 corrid+3.628
Neg logit contributions
gencies-3.388
racuse-3.220
 Wra-2.985
strom-2.813
emort-2.810
Token longer
Token ID2392
Feature activation-3.91e-01

Pos logit contributions
��+7.641
 imagination+6.451
 corrid+6.407
gallery+6.372
agram+6.308
Neg logit contributions
gencies-8.862
racuse-8.463
 Wra-8.041
emort-7.667
strom-7.600
Token in
Token ID287
Feature activation-5.01e-02

Pos logit contributions
��+2.241
agram+1.957
gallery+1.949
 Sabha+1.947
 corrid+1.928
Neg logit contributions
gencies-1.980
racuse-1.851
 Wra-1.754
 Observer-1.676
strom-1.649
Token the
Token ID262
Feature activation-1.34e-01

Pos logit contributions
��+0.255
agram+0.215
gallery+0.215
Pinterest+0.214
 Sabha+0.213
Neg logit contributions
gencies-0.289
racuse-0.271
 Wra-0.261
 Observer-0.251
strom-0.247
Token Bronx
Token ID32486
Feature activation+1.33e-01

Pos logit contributions
��+0.686
agram+0.584
gallery+0.582
 Sabha+0.579
Pinterest+0.578
Neg logit contributions
gencies-0.768
racuse-0.718
 Wra-0.691
 Observer-0.668
strom-0.656
Token.
Token ID13
Feature activation+8.00e-02

Pos logit contributions
gencies+0.702
racuse+0.655
 Wra+0.628
 Observer+0.608
strom+0.595
Neg logit contributions
��-0.738
agram-0.639
gallery-0.635
Pinterest-0.634
 corrid-0.628
Token scrap
Token ID15881
Feature activation+1.65e-01

Pos logit contributions
��+1.951
agram+1.676
gallery+1.669
 Sabha+1.663
 corrid+1.650
Neg logit contributions
gencies-2.017
racuse-1.892
 Wra-1.809
 Observer-1.736
strom-1.712
Token.
Token ID13
Feature activation-3.05e-01

Pos logit contributions
gencies+0.824
racuse+0.758
 Wra+0.739
 Observer+0.704
strom+0.700
Neg logit contributions
��-0.973
agram-0.829
 corrid-0.827
gallery-0.826
 Sabha-0.825
Token
Token ID198
Feature activation-1.22e-02

Pos logit contributions
��+1.841
agram+1.618
gallery+1.615
 Sabha+1.607
Pinterest+1.596
Neg logit contributions
gencies-1.455
racuse-1.353
 Wra-1.276
 Observer-1.216
strom-1.198
Token
Token ID198
Feature activation-2.45e-01

Pos logit contributions
��+0.075
 corrid+0.067
 Sabha+0.063
agram+0.063
gallery+0.062
Neg logit contributions
gencies-0.059
racuse-0.055
 Wra-0.053
 Observer-0.051
strom-0.050
TokenArticle
Token ID14906
Feature activation-5.41e-01

Pos logit contributions
��+1.313
agram+1.122
 Sabha+1.116
 corrid+1.114
gallery+1.113
Neg logit contributions
gencies-1.346
racuse-1.259
 Wra-1.203
 Observer-1.154
strom-1.144
Token Continued
Token ID22168
Feature activation-1.63e+00

Pos logit contributions
��+3.205
gallery+2.891
agram+2.876
Pinterest+2.840
 Sabha+2.837
Neg logit contributions
gencies-2.559
racuse-2.396
 Wra-2.215
emort-2.110
 Observer-2.081
Token Below
Token ID10383
Feature activation-1.00e+00

Pos logit contributions
gallery+2.075
��+1.645
Pinterest+1.640
agram+1.604
advertisement+1.474
Neg logit contributions
gencies-14.684
racuse-13.856
 warr-13.420
emort-13.380
agnetic-13.036
Token
Token ID198
Feature activation-8.90e-02

Pos logit contributions
��+4.600
gallery+4.313
agram+4.232
Pinterest+4.214
 Sabha+4.118
Neg logit contributions
gencies-5.883
racuse-5.503
 Wra-5.071
emort-5.067
 Observer-4.924
Token
Token ID198
Feature activation-1.62e-01

Pos logit contributions
��+0.535
 corrid+0.475
agram+0.453
 Sabha+0.451
gallery+0.447
Neg logit contributions
gencies-0.438
racuse-0.408
 Wra-0.396
 Observer-0.378
strom-0.373
TokenThe
Token ID464
Feature activation-1.60e-01

Pos logit contributions
��+0.916
 corrid+0.788
agram+0.784
 Sabha+0.782
gallery+0.774
Neg logit contributions
gencies-0.845
racuse-0.785
 Wra-0.751
 Observer-0.719
strom-0.715
Token T
Token ID309
Feature activation-2.43e-01

Pos logit contributions
��+0.857
agram+0.736
gallery+0.732
 Sabha+0.729
Pinterest+0.726
Neg logit contributions
gencies-0.875
racuse-0.819
 Wra-0.784
 Observer-0.755
strom-0.741
Token Serve
Token ID35557
Feature activation+4.31e-01

Pos logit contributions
gencies+0.904
 Wra+0.846
racuse+0.816
 Observer+0.783
 Offensive+0.738
Neg logit contributions
��-1.684
 corrid-1.558
agram-1.456
gallery-1.430
ال-1.424
Token hot
Token ID3024
Feature activation+6.85e-01

Pos logit contributions
gencies+1.508
 Wra+1.394
racuse+1.373
 Observer+1.280
strom+1.251
Neg logit contributions
��-3.207
 corrid-2.829
 Sabha-2.707
gallery-2.694
agram-2.677
Token!
Token ID0
Feature activation+4.67e-01

Pos logit contributions
gencies+2.032
 Wra+1.838
racuse+1.659
strom+1.642
 Observer+1.636
Neg logit contributions
��-5.487
 corrid-5.089
agram-4.687
 Sabha-4.655
gallery-4.599
Token<|endoftext|>
Token ID50256
Feature activation-8.74e-01

Pos logit contributions
gencies+2.313
racuse+2.156
 Wra+2.078
 Observer+1.970
strom+1.930
Neg logit contributions
��-2.708
agram-2.363
gallery-2.362
 corrid-2.350
 Sabha-2.346
TokenImage
Token ID5159
Feature activation-6.28e-01

Pos logit contributions
��+4.562
gallery+4.114
Pinterest+4.114
agram+4.066
 Sabha+3.968
Neg logit contributions
gencies-4.672
racuse-4.428
 Wra-4.205
 Observer-4.039
emort-3.974
Token copyright
Token ID6634
Feature activation-1.63e+00

Pos logit contributions
��+2.075
gallery+1.660
agram+1.657
 Sabha+1.623
Pinterest+1.609
Neg logit contributions
gencies-4.671
racuse-4.486
 Wra-4.280
 Observer-4.152
emort-4.135
Token ESA
Token ID42212
Feature activation+3.38e-01

Pos logit contributions
��+7.965
 Sabha+7.299
Pinterest+7.295
gallery+7.290
agram+7.236
Neg logit contributions
gencies-8.390
racuse-7.974
emort-7.315
 Wra-7.031
strom-6.954
Token Image
Token ID7412
Feature activation-9.56e-01

Pos logit contributions
gencies+2.216
racuse+2.104
 Wra+2.056
 Observer+1.995
strom+1.957
Neg logit contributions
��-1.426
 Sabha-1.163
 corrid-1.160
agram-1.158
gallery-1.139
Token caption
Token ID8305
Feature activation-7.65e-01

Pos logit contributions
��+2.256
gallery+1.788
agram+1.753
 Sabha+1.676
Pinterest+1.649
Neg logit contributions
gencies-7.909
racuse-7.615
 Wra-7.294
emort-7.103
 Observer-7.073
Token Go
Token ID1514
Feature activation+4.60e-02

Pos logit contributions
��+4.628
agram+4.170
gallery+4.154
 Sabha+4.149
Pinterest+4.096
Neg logit contributions
gencies-3.473
racuse-3.203
emort-2.906
 Wra-2.839
strom-2.730
Tokence
Token ID344
Feature activation+2.06e-01

Pos logit contributions
gencies+0.239
racuse+0.222
 Wra+0.214
 Observer+0.207
strom+0.204
Neg logit contributions
��-0.263
 Sabha-0.222
gallery-0.221
 corrid-0.220
agram-0.220
Token or
Token ID393
Feature activation-1.13e-01

Pos logit contributions
gencies+1.234
racuse+1.131
 Wra+1.096
 Observer+1.052
strom+1.020
Neg logit contributions
��-1.669
 corrid-1.426
agram-1.426
gallery-1.421
 Sabha-1.413
Token am
Token ID716
Feature activation+3.54e-01

Pos logit contributions
��+0.633
agram+0.543
gallery+0.542
 Sabha+0.540
 corrid+0.537
Neg logit contributions
gencies-0.591
racuse-0.551
 Wra-0.528
 Observer-0.505
strom-0.497
Token I
Token ID314
Feature activation-1.09e-04

Pos logit contributions
gencies+1.720
racuse+1.571
 Wra+1.531
 Observer+1.456
strom+1.443
Neg logit contributions
��-2.126
 corrid-1.832
 Sabha-1.830
gallery-1.830
Pinterest-1.819
Token fool
Token ID9192
Feature activation-1.94e-01

Pos logit contributions
��+0.001
agram+0.000
 Sabha+0.000
gallery+0.000
 corrid+0.000
Neg logit contributions
gencies-0.001
racuse-0.001
 Wra-0.001
 Observer-0.001
strom-0.001
Tokening
Token ID278
Feature activation-5.12e-01

Pos logit contributions
��+1.030
agram+0.869
gallery+0.867
 Sabha+0.867
 corrid+0.867
Neg logit contributions
gencies-1.081
racuse-1.009
 Wra-0.982
 Observer-0.934
strom-0.934
Token myself
Token ID3589
Feature activation-1.62e+00

Pos logit contributions
��+3.330
agram+2.997
gallery+2.989
 Sabha+2.969
Pinterest+2.944
Neg logit contributions
gencies-2.167
racuse-2.001
 Wra-1.853
 Observer-1.750
strom-1.722
Token?
Token ID30
Feature activation-7.05e-02

Pos logit contributions
��+6.998
gallery+6.403
agram+6.342
 Sabha+6.269
 imagination+6.178
Neg logit contributions
gencies-9.345
racuse-8.951
 Wra-8.371
emort-8.067
 Observer-7.982
Token
Token ID198
Feature activation+2.44e-01

Pos logit contributions
��+0.394
agram+0.344
gallery+0.343
 Sabha+0.341
Pinterest+0.339
Neg logit contributions
gencies-0.368
racuse-0.344
 Wra-0.324
 Observer-0.311
strom-0.307
Token
Token ID198
Feature activation+3.45e-01

Pos logit contributions
gencies+1.103
racuse+1.022
 Wra+1.012
 Observer+0.964
strom+0.955
Neg logit contributions
��-1.600
 corrid-1.470
 Sabha-1.322
agram-1.312
 imagination-1.290
TokenAll
Token ID3237
Feature activation+6.47e-01

Pos logit contributions
gencies+1.623
racuse+1.477
 Wra+1.453
 Observer+1.386
strom+1.386
Neg logit contributions
��-2.181
 corrid-1.896
 Sabha-1.813
agram-1.793
 imagination-1.782
Token these
Token ID777
Feature activation-2.78e-01

Pos logit contributions
gencies+3.031
racuse+2.782
 Wra+2.753
 Observer+2.625
strom+2.571
Neg logit contributions
��-3.976
agram-3.387
 corrid-3.387
 Sabha-3.383
gallery-3.379
Token.
Token ID13
Feature activation-1.39e-01

Pos logit contributions
��+0.846
agram+0.741
gallery+0.739
 Sabha+0.735
 corrid+0.732
Neg logit contributions
gencies-0.673
racuse-0.626
 Wra-0.594
 Observer-0.566
strom-0.558
Token
Token ID198
Feature activation+3.03e-01

Pos logit contributions
��+0.797
agram+0.700
gallery+0.697
 Sabha+0.697
Pinterest+0.689
Neg logit contributions
gencies-0.702
racuse-0.656
 Wra-0.618
 Observer-0.590
emort-0.585
Token
Token ID198
Feature activation+1.68e-01

Pos logit contributions
gencies+1.348
racuse+1.244
 Wra+1.238
 Observer+1.180
strom+1.169
Neg logit contributions
��-2.017
 corrid-1.861
 Sabha-1.644
agram-1.643
 entreprene-1.621
TokenAdvertisement
Token ID4723
Feature activation-6.74e-01

Pos logit contributions
gencies+0.900
racuse+0.824
 Wra+0.797
strom+0.772
 Observer+0.771
Neg logit contributions
��-0.955
 corrid-0.820
agram-0.781
 Sabha-0.777
 imagination-0.771
Token Continue
Token ID10054
Feature activation-1.55e+00

Pos logit contributions
��+3.449
gallery+3.104
agram+3.074
 Sabha+3.030
Pinterest+3.018
Neg logit contributions
gencies-3.706
racuse-3.499
 Wra-3.254
emort-3.173
strom-3.096
Token reading
Token ID3555
Feature activation-1.62e+00

Pos logit contributions
��+3.822
gallery+3.734
agram+3.601
 imagination+3.512
 Sabha+3.442
Neg logit contributions
gencies-11.581
racuse-11.055
 Wra-10.582
emort-10.398
olicy-10.390
Token the
Token ID262
Feature activation-1.33e+00

Pos logit contributions
��+6.015
gallery+5.945
agram+5.901
 Sabha+5.712
Pinterest+5.635
Neg logit contributions
gencies-10.057
racuse-9.321
emort-8.830
 Wra-8.765
olicy-8.653
Token main
Token ID1388
Feature activation-1.41e+00

Pos logit contributions
��+4.338
gallery+4.097
 Sabha+3.987
agram+3.975
 imagination+3.901
Neg logit contributions
gencies-9.171
racuse-8.884
 Wra-8.453
emort-8.199
olicy-8.193
Token story
Token ID1621
Feature activation-2.90e-01

Pos logit contributions
��+5.175
gallery+4.939
agram+4.811
 Sabha+4.751
 imagination+4.740
Neg logit contributions
gencies-9.110
racuse-8.771
 Wra-8.334
emort-8.011
olicy-8.003
Token
Token ID198
Feature activation-1.62e-01

Pos logit contributions
��+1.638
agram+1.413
 corrid+1.407
gallery+1.404
 Sabha+1.403
Neg logit contributions
gencies-1.507
racuse-1.410
 Wra-1.348
 Observer-1.290
strom-1.271
Token
Token ID198
Feature activation-2.10e-02

Pos logit contributions
��+0.955
 corrid+0.837
agram+0.813
 Sabha+0.808
gallery+0.805
Neg logit contributions
gencies-0.818
racuse-0.762
 Wra-0.734
 Observer-0.702
strom-0.693
Token million
Token ID1510
Feature activation+2.15e-01

Pos logit contributions
��+0.992
agram+0.862
gallery+0.861
 Sabha+0.858
 corrid+0.855
Neg logit contributions
gencies-0.878
racuse-0.819
 Wra-0.781
 Observer-0.748
strom-0.735
Token years
Token ID812
Feature activation-3.78e-01

Pos logit contributions
gencies+1.198
racuse+1.123
 Wra+1.082
 Observer+1.043
strom+1.025
Neg logit contributions
��-1.127
gallery-0.968
 corrid-0.962
 Sabha-0.959
agram-0.957
Token.
Token ID13
Feature activation+3.57e-01

Pos logit contributions
��+2.228
agram+1.957
gallery+1.946
 Sabha+1.940
Pinterest+1.922
Neg logit contributions
gencies-1.849
racuse-1.722
 Wra-1.628
 Observer-1.548
strom-1.525
Token<|endoftext|>
Token ID50256
Feature activation-9.42e-01

Pos logit contributions
gencies+1.625
racuse+1.530
 Wra+1.507
 Observer+1.429
strom+1.375
Neg logit contributions
��-2.223
 corrid-1.981
 Sabha-1.913
agram-1.904
gallery-1.892
TokenImage
Token ID5159
Feature activation-6.50e-01

Pos logit contributions
��+4.871
Pinterest+4.434
gallery+4.423
agram+4.360
ال+4.258
Neg logit contributions
gencies-5.032
racuse-4.780
 Wra-4.539
 Observer-4.362
emort-4.289
Token copyright
Token ID6634
Feature activation-1.61e+00

Pos logit contributions
��+2.134
gallery+1.709
agram+1.707
 Sabha+1.671
Pinterest+1.656
Neg logit contributions
gencies-4.835
racuse-4.648
 Wra-4.429
 Observer-4.294
emort-4.281
Token AFP
Token ID8916
Feature activation-1.17e-01

Pos logit contributions
��+7.886
 Sabha+7.240
gallery+7.218
Pinterest+7.208
agram+7.153
Neg logit contributions
gencies-8.340
racuse-7.930
emort-7.272
 Wra-6.998
strom-6.910
Token
Token ID198
Feature activation-2.75e-01

Pos logit contributions
��+0.399
gallery+0.316
agram+0.312
 Sabha+0.307
Pinterest+0.307
Neg logit contributions
gencies-0.866
racuse-0.830
 Wra-0.794
 Observer-0.773
emort-0.767
Token
Token ID198
Feature activation-3.40e-01

Pos logit contributions
��+1.599
 corrid+1.393
agram+1.367
 Sabha+1.360
gallery+1.358
Neg logit contributions
gencies-1.399
racuse-1.303
 Wra-1.258
 Observer-1.204
strom-1.183
TokenThe
Token ID464
Feature activation-1.22e-01

Pos logit contributions
��+1.815
agram+1.562
gallery+1.556
 Sabha+1.549
 corrid+1.538
Neg logit contributions
gencies-1.858
racuse-1.743
 Wra-1.664
 Observer-1.597
strom-1.575
Token Bank
Token ID5018
Feature activation+1.31e-01

Pos logit contributions
��+0.646
agram+0.551
gallery+0.551
 Sabha+0.546
Pinterest+0.543
Neg logit contributions
gencies-0.683
racuse-0.639
 Wra-0.614
 Observer-0.593
strom-0.580
Token What
Token ID1867
Feature activation+3.69e-01

Pos logit contributions
gencies+1.324
racuse+1.241
 Wra+1.229
 Observer+1.170
strom+1.127
Neg logit contributions
��-1.920
 corrid-1.682
agram-1.630
gallery-1.620
 Sabha-1.619
Token we
Token ID356
Feature activation-2.43e-01

Pos logit contributions
gencies+1.804
racuse+1.684
 Wra+1.633
 Observer+1.575
strom+1.537
Neg logit contributions
��-2.218
Pinterest-1.863
gallery-1.855
agram-1.853
 corrid-1.850
Token're
Token ID821
Feature activation-4.94e-01

Pos logit contributions
��+1.340
agram+1.156
gallery+1.153
 Sabha+1.148
 corrid+1.142
Neg logit contributions
gencies-1.289
racuse-1.207
 Wra-1.152
 Observer-1.104
strom-1.088
Token learning
Token ID4673
Feature activation-6.71e-01

Pos logit contributions
��+2.344
agram+1.987
gallery+1.982
 Sabha+1.969
Pinterest+1.952
Neg logit contributions
gencies-2.988
racuse-2.827
 Wra-2.696
 Observer-2.601
strom-2.569
Token so
Token ID523
Feature activation+2.55e-01

Pos logit contributions
��+3.285
agram+2.868
gallery+2.856
 Sabha+2.846
Pinterest+2.820
Neg logit contributions
gencies-3.888
racuse-3.651
 Wra-3.471
 Observer-3.309
emort-3.296
Token far
Token ID1290
Feature activation-1.61e+00

Pos logit contributions
gencies+1.411
racuse+1.329
 Wra+1.278
 Observer+1.233
strom+1.214
Neg logit contributions
��-1.359
gallery-1.144
agram-1.143
 Sabha-1.138
 corrid-1.131
Token isn
Token ID2125
Feature activation+3.65e-02

Pos logit contributions
��+6.605
agram+6.207
gallery+6.054
 Sabha+6.053
Pinterest+5.989
Neg logit contributions
gencies-9.455
racuse-8.941
 Wra-8.264
emort-8.085
olicy-7.970
Token't
Token ID470
Feature activation-1.84e-01

Pos logit contributions
gencies+0.168
racuse+0.156
 Wra+0.150
 Observer+0.143
strom+0.142
Neg logit contributions
��-0.233
 corrid-0.210
 Sabha-0.197
agram-0.195
gallery-0.194
Token terribly
Token ID22121
Feature activation-9.05e-01

Pos logit contributions
��+0.997
agram+0.845
gallery+0.843
 Sabha+0.837
 corrid+0.833
Neg logit contributions
gencies-0.999
racuse-0.936
 Wra-0.903
 Observer-0.867
strom-0.852
Token illuminating
Token ID46717
Feature activation-3.24e-02

Pos logit contributions
��+4.330
 Sabha+3.861
agram+3.829
gallery+3.800
Pinterest+3.793
Neg logit contributions
gencies-5.215
racuse-4.963
 Wra-4.642
emort-4.493
 Observer-4.430
Token,
Token ID11
Feature activation-5.44e-02

Pos logit contributions
��+0.193
agram+0.167
gallery+0.166
 corrid+0.165
 Sabha+0.165
Neg logit contributions
gencies-0.158
racuse-0.147
 Wra-0.141
 Observer-0.135
strom-0.133
Token dollars
Token ID5054
Feature activation-5.47e-02

Pos logit contributions
��+1.740
agram+1.514
 Sabha+1.498
gallery+1.493
 corrid+1.487
Neg logit contributions
gencies-1.923
racuse-1.810
 Wra-1.713
 Observer-1.641
emort-1.623
Token.
Token ID13
Feature activation+9.79e-02

Pos logit contributions
��+0.337
agram+0.295
gallery+0.294
 Sabha+0.294
 corrid+0.293
Neg logit contributions
gencies-0.255
racuse-0.237
 Wra-0.224
 Observer-0.214
strom-0.210
Token
Token ID198
Feature activation+4.60e-01

Pos logit contributions
gencies+0.485
racuse+0.449
 Wra+0.439
 Observer+0.418
strom+0.408
Neg logit contributions
��-0.578
 corrid-0.503
agram-0.497
gallery-0.495
 Sabha-0.493
Token
Token ID198
Feature activation+2.20e-01

Pos logit contributions
gencies+1.955
 Wra+1.811
racuse+1.782
 Observer+1.722
strom+1.704
Neg logit contributions
��-3.162
 corrid-2.973
 entreprene-2.603
 Sabha-2.554
agram-2.547
TokenStory
Token ID11605
Feature activation-1.69e-01

Pos logit contributions
gencies+1.230
racuse+1.128
 Wra+1.087
strom+1.063
 Observer+1.045
Neg logit contributions
��-1.192
 corrid-1.027
 Sabha-0.983
agram-0.976
 imagination-0.969
Token continues
Token ID4477
Feature activation-1.61e+00

Pos logit contributions
��+0.939
 corrid+0.814
agram+0.793
 Sabha+0.791
gallery+0.789
Neg logit contributions
gencies-0.900
racuse-0.833
 Wra-0.815
 Observer-0.784
strom-0.773
Token below
Token ID2174
Feature activation-6.83e-01

Pos logit contributions
��+2.303
gallery+2.292
 Sabha+2.076
agram+1.991
Pinterest+1.813
Neg logit contributions
gencies-13.629
racuse-13.371
 Wra-12.747
emort-12.536
agnetic-12.401
Token advertisement
Token ID15422
Feature activation-7.14e-01

Pos logit contributions
��+1.228
gallery+0.857
agram+0.837
 Sabha+0.796
Pinterest+0.778
Neg logit contributions
gencies-6.025
racuse-5.858
 Wra-5.626
emort-5.475
 Observer-5.461
Token
Token ID198
Feature activation-8.42e-02

Pos logit contributions
��+3.482
gallery+3.192
agram+3.133
Pinterest+3.119
 Sabha+3.105
Neg logit contributions
gencies-4.059
racuse-3.822
 Wra-3.562
emort-3.494
 Observer-3.412
Token
Token ID198
Feature activation-9.78e-02

Pos logit contributions
��+0.505
 corrid+0.449
agram+0.428
 Sabha+0.426
gallery+0.423
Neg logit contributions
gencies-0.416
racuse-0.387
 Wra-0.375
 Observer-0.359
strom-0.354
TokenStory
Token ID11605
Feature activation-7.17e-02

Pos logit contributions
��+0.451
 corrid+0.373
agram+0.370
 Sabha+0.368
gallery+0.364
Neg logit contributions
gencies-0.616
racuse-0.577
 Wra-0.557
 Observer-0.538
strom-0.537
Token
Token ID247
Feature activation-2.64e-01

Pos logit contributions
��+0.668
 corrid+0.570
agram+0.567
 Sabha+0.567
gallery+0.564
Neg logit contributions
gencies-0.745
racuse-0.696
 Wra-0.671
 Observer-0.641
strom-0.639
Tokens
Token ID82
Feature activation+9.13e-02

Pos logit contributions
��+1.077
 corrid+0.868
agram+0.849
gallery+0.839
 Sabha+0.838
Neg logit contributions
gencies-1.809
racuse-1.717
 Wra-1.669
 Observer-1.617
strom-1.613
Token Et
Token ID17906
Feature activation+1.00e-01

Pos logit contributions
gencies+0.433
racuse+0.403
 Wra+0.379
 Observer+0.360
emort+0.356
Neg logit contributions
��-0.556
agram-0.488
gallery-0.485
 Sabha-0.485
Pinterest-0.480
Tokeniquette
Token ID40387
Feature activation+1.37e-02

Pos logit contributions
gencies+0.316
racuse+0.276
 Wra+0.254
 Observer+0.236
strom+0.233
Neg logit contributions
��-0.781
 corrid-0.712
 Sabha-0.695
gallery-0.689
agram-0.687
Token,
Token ID11
Feature activation-3.16e-01

Pos logit contributions
gencies+0.073
racuse+0.069
 Wra+0.064
 Observer+0.062
emort+0.062
Neg logit contributions
��-0.076
agram-0.065
 Sabha-0.065
gallery-0.065
Pinterest-0.064
Token 18
Token ID1248
Feature activation-1.60e+00

Pos logit contributions
��+1.353
agram+1.108
 Sabha+1.105
gallery+1.101
Pinterest+1.079
Neg logit contributions
gencies-2.073
racuse-1.956
 Wra-1.851
emort-1.805
strom-1.792
Token77
Token ID3324
Feature activation-2.66e-01

Pos logit contributions
��+5.366
Pinterest+5.039
agram+4.952
gallery+4.937
 Sabha+4.842
Neg logit contributions
gencies-10.863
racuse-10.266
emort-9.524
 Wra-9.361
olicy-9.153
Token.
Token ID13
Feature activation+7.57e-02

Pos logit contributions
��+1.525
agram+1.338
gallery+1.329
 Sabha+1.318
 corrid+1.302
Neg logit contributions
gencies-1.336
racuse-1.264
 Wra-1.158
emort-1.126
 Observer-1.108
Token
Token ID198
Feature activation+6.11e-02

Pos logit contributions
gencies+0.415
racuse+0.390
 Wra+0.366
 Observer+0.354
emort+0.351
Neg logit contributions
��-0.408
agram-0.350
gallery-0.349
 Sabha-0.346
 corrid-0.344
Token
Token ID198
Feature activation+9.24e-02

Pos logit contributions
gencies+0.288
racuse+0.268
 Wra+0.263
 Observer+0.251
strom+0.247
Neg logit contributions
��-0.384
 corrid-0.347
 Sabha-0.320
agram-0.320
gallery-0.315
Token15
Token ID1314
Feature activation+5.39e-01

Pos logit contributions
gencies+0.582
racuse+0.553
 Wra+0.530
 Observer+0.514
emort+0.508
Neg logit contributions
��-0.414
gallery-0.350
Pinterest-0.348
agram-0.347
 Sabha-0.346
Token Figure
Token ID11291
Feature activation-8.97e-01

Pos logit contributions
gencies+0.299
racuse+0.283
 Wra+0.272
 Observer+0.262
strom+0.259
Neg logit contributions
��-0.220
agram-0.184
gallery-0.183
 Sabha-0.182
 corrid-0.181
Token 3
Token ID513
Feature activation+1.12e-01

Pos logit contributions
��+4.813
agram+4.275
gallery+4.260
 Sabha+4.215
Pinterest+4.142
Neg logit contributions
gencies-4.644
racuse-4.433
 Wra-4.066
emort-3.960
 Observer-3.932
Token.
Token ID13
Feature activation+1.94e-01

Pos logit contributions
gencies+0.529
racuse+0.483
 Wra+0.468
 Observer+0.444
strom+0.436
Neg logit contributions
��-0.688
 Sabha-0.598
 corrid-0.598
agram-0.597
Pinterest-0.594
Token
Token ID198
Feature activation-3.14e-01

Pos logit contributions
gencies+0.887
racuse+0.817
 Wra+0.806
 Observer+0.758
strom+0.734
Neg logit contributions
��-1.235
 corrid-1.082
agram-1.063
 Sabha-1.050
gallery-1.048
Token
Token ID198
Feature activation-2.85e-01

Pos logit contributions
��+1.787
 corrid+1.540
agram+1.536
 Sabha+1.529
gallery+1.528
Neg logit contributions
gencies-1.627
racuse-1.520
 Wra-1.454
 Observer-1.392
strom-1.372
TokenFigure
Token ID11337
Feature activation-1.60e+00

Pos logit contributions
��+1.124
agram+0.911
gallery+0.905
 Sabha+0.901
 corrid+0.892
Neg logit contributions
gencies-1.965
racuse-1.868
 Wra-1.802
 Observer-1.744
strom-1.727
Token 3
Token ID513
Feature activation-8.59e-04

Pos logit contributions
��+6.793
gallery+5.977
agram+5.894
 Sabha+5.765
 imagination+5.631
Neg logit contributions
gencies-8.915
racuse-8.772
emort-8.099
 Observer-7.940
 Wra-7.922
Token.
Token ID13
Feature activation-7.02e-02

Pos logit contributions
��+0.005
gallery+0.004
agram+0.004
 Sabha+0.004
 corrid+0.004
Neg logit contributions
gencies-0.004
racuse-0.004
 Wra-0.004
 Observer-0.004
strom-0.004
Token Ass
Token ID2195
Feature activation+6.88e-02

Pos logit contributions
��+0.427
agram+0.372
gallery+0.370
 Sabha+0.369
 corrid+0.369
Neg logit contributions
gencies-0.335
racuse-0.311
 Wra-0.297
 Observer-0.282
strom-0.277
Tokenigning
Token ID38944
Feature activation+1.48e-01

Pos logit contributions
gencies+0.379
racuse+0.354
 Wra+0.341
 Observer+0.325
strom+0.324
Neg logit contributions
��-0.371
 corrid-0.317
gallery-0.313
 Sabha-0.313
Pinterest-0.311
Token mass
Token ID2347
Feature activation-4.63e-01

Pos logit contributions
gencies+0.663
racuse+0.615
 Wra+0.581
 Observer+0.551
strom+0.540
Neg logit contributions
��-0.945
agram-0.831
gallery-0.827
 Sabha-0.823
Pinterest-0.823
Token
Token ID247
Feature activation-2.58e-01

Pos logit contributions
��+0.725
 corrid+0.621
 Sabha+0.614
agram+0.614
gallery+0.610
Neg logit contributions
gencies-0.798
racuse-0.744
 Wra-0.717
 Observer-0.684
strom-0.683
Tokens
Token ID82
Feature activation+1.26e-01

Pos logit contributions
��+1.046
 corrid+0.844
agram+0.822
 Sabha+0.812
gallery+0.810
Neg logit contributions
gencies-1.769
racuse-1.680
 Wra-1.632
 Observer-1.580
strom-1.577
Token Et
Token ID17906
Feature activation+9.15e-02

Pos logit contributions
gencies+0.600
racuse+0.559
 Wra+0.525
 Observer+0.498
emort+0.496
Neg logit contributions
��-0.763
agram-0.668
gallery-0.665
 Sabha-0.664
Pinterest-0.657
Tokeniquette
Token ID40387
Feature activation-1.81e-01

Pos logit contributions
gencies+0.290
racuse+0.253
 Wra+0.233
 Observer+0.217
strom+0.214
Neg logit contributions
��-0.713
 corrid-0.650
 Sabha-0.634
gallery-0.629
agram-0.628
Token,
Token ID11
Feature activation-3.35e-01

Pos logit contributions
��+0.968
agram+0.832
 Sabha+0.828
gallery+0.822
Pinterest+0.815
Neg logit contributions
gencies-0.996
racuse-0.937
 Wra-0.871
emort-0.846
 Observer-0.835
Token 18
Token ID1248
Feature activation-1.60e+00

Pos logit contributions
��+1.424
 Sabha+1.156
agram+1.153
gallery+1.152
 corrid+1.127
Neg logit contributions
gencies-2.218
racuse-2.099
 Wra-1.982
emort-1.946
strom-1.927
Token77
Token ID3324
Feature activation-1.98e-01

Pos logit contributions
��+6.129
Pinterest+5.697
gallery+5.630
 Sabha+5.535
agram+5.528
Neg logit contributions
gencies-10.098
racuse-9.468
emort-8.893
 Wra-8.575
agnetic-8.370
Token.
Token ID13
Feature activation+6.20e-02

Pos logit contributions
��+1.284
 corrid+1.132
agram+1.115
gallery+1.111
 Sabha+1.111
Neg logit contributions
gencies-0.872
racuse-0.802
 Wra-0.772
 Observer-0.733
strom-0.719
Token
Token ID198
Feature activation+4.86e-02

Pos logit contributions
gencies+0.306
racuse+0.286
 Wra+0.279
 Observer+0.264
strom+0.259
Neg logit contributions
��-0.364
 corrid-0.320
agram-0.314
 Sabha-0.311
gallery-0.311
Token
Token ID198
Feature activation+4.01e-02

Pos logit contributions
gencies+0.230
racuse+0.213
 Wra+0.209
 Observer+0.200
strom+0.197
Neg logit contributions
��-0.305
 corrid-0.276
 Sabha-0.254
agram-0.254
gallery-0.251
Token17
Token ID1558
Feature activation+4.07e-01

Pos logit contributions
gencies+0.254
racuse+0.242
 Wra+0.233
 Observer+0.225
emort+0.222
Neg logit contributions
��-0.178
gallery-0.149
agram-0.149
 Sabha-0.148
Pinterest-0.147
Token property
Token ID3119
Feature activation-6.43e-01

Pos logit contributions
��+1.134
agram+0.987
gallery+0.984
 Sabha+0.978
 corrid+0.974
Neg logit contributions
gencies-0.953
racuse-0.887
 Wra-0.844
 Observer-0.808
strom-0.791
Token.
Token ID13
Feature activation-9.32e-03

Pos logit contributions
��+3.578
 Sabha+3.197
agram+3.182
gallery+3.157
 imagination+3.110
Neg logit contributions
gencies-3.280
racuse-3.081
 Wra-2.889
strom-2.737
 Observer-2.724
Token
Token ID198
Feature activation+4.41e-01

Pos logit contributions
��+0.051
agram+0.045
gallery+0.044
 Sabha+0.044
Pinterest+0.044
Neg logit contributions
gencies-0.049
racuse-0.046
 Wra-0.044
 Observer-0.042
strom-0.042
Token
Token ID198
Feature activation+2.76e-01

Pos logit contributions
gencies+1.855
 Wra+1.727
racuse+1.699
 Observer+1.649
strom+1.618
Neg logit contributions
��-3.051
 corrid-2.845
 entreprene-2.516
 Sabha-2.463
agram-2.451
TokenThe
Token ID464
Feature activation-2.51e-01

Pos logit contributions
gencies+1.445
racuse+1.316
 Wra+1.280
 Observer+1.228
strom+1.221
Neg logit contributions
��-1.584
 corrid-1.380
 imagination-1.304
agram-1.298
 Sabha-1.289
Token bench
Token ID7624
Feature activation-1.59e+00

Pos logit contributions
��+1.336
agram+1.140
gallery+1.136
 corrid+1.126
Pinterest+1.126
Neg logit contributions
gencies-1.390
racuse-1.302
 Wra-1.248
 Observer-1.207
strom-1.176
Token,
Token ID11
Feature activation-5.15e-01

Pos logit contributions
��+7.433
 Sabha+6.988
agram+6.864
 imagination+6.728
gallery+6.656
Neg logit contributions
gencies-8.684
racuse-8.409
 Wra-7.655
emort-7.492
strom-7.215
Token however
Token ID2158
Feature activation-1.51e-01

Pos logit contributions
��+2.836
agram+2.460
gallery+2.450
 Sabha+2.444
 corrid+2.420
Neg logit contributions
gencies-2.723
racuse-2.555
 Wra-2.428
 Observer-2.322
strom-2.293
Token,
Token ID11
Feature activation-3.31e-01

Pos logit contributions
��+0.926
 corrid+0.796
gallery+0.789
agram+0.786
Pinterest+0.783
Neg logit contributions
gencies-0.729
racuse-0.675
 Wra-0.650
 Observer-0.624
strom-0.614
Token was
Token ID373
Feature activation+2.41e-02

Pos logit contributions
��+1.974
agram+1.719
gallery+1.716
 Sabha+1.706
 corrid+1.700
Neg logit contributions
gencies-1.612
racuse-1.498
 Wra-1.425
 Observer-1.361
strom-1.337
Token not
Token ID407
Feature activation-4.39e-01

Pos logit contributions
gencies+0.121
racuse+0.111
 Wra+0.107
 Observer+0.103
strom+0.101
Neg logit contributions
��-0.141
gallery-0.121
 corrid-0.121
agram-0.120
 Sabha-0.119
Tokenported
Token ID9213
Feature activation-3.10e-01

Pos logit contributions
��+2.522
agram+2.195
 Sabha+2.185
gallery+2.181
 corrid+2.176
Neg logit contributions
gencies-2.002
racuse-1.862
 Wra-1.771
 Observer-1.689
strom-1.663
Token License
Token ID13789
Feature activation-1.58e-01

Pos logit contributions
��+1.884
agram+1.651
gallery+1.645
 Sabha+1.640
 corrid+1.631
Neg logit contributions
gencies-1.472
racuse-1.366
 Wra-1.298
 Observer-1.235
strom-1.214
Token.
Token ID13
Feature activation+2.65e-01

Pos logit contributions
��+0.948
agram+0.832
gallery+0.829
 Sabha+0.826
Pinterest+0.821
Neg logit contributions
gencies-0.759
racuse-0.706
 Wra-0.667
 Observer-0.637
strom-0.627
Token<|endoftext|>
Token ID50256
Feature activation-9.23e-01

Pos logit contributions
gencies+1.204
racuse+1.105
 Wra+1.077
 Observer+1.020
strom+0.982
Neg logit contributions
��-1.671
agram-1.455
 Sabha-1.453
 corrid-1.449
gallery-1.441
TokenImage
Token ID5159
Feature activation-5.95e-01

Pos logit contributions
��+4.789
Pinterest+4.342
gallery+4.335
agram+4.278
ال+4.175
Neg logit contributions
gencies-4.926
racuse-4.674
 Wra-4.438
 Observer-4.262
emort-4.194
Token copyright
Token ID6634
Feature activation-1.59e+00

Pos logit contributions
��+1.963
gallery+1.565
agram+1.564
 Sabha+1.533
Pinterest+1.518
Neg logit contributions
gencies-4.427
racuse-4.250
 Wra-4.061
 Observer-3.939
emort-3.919
Token AP
Token ID3486
Feature activation-2.17e-01

Pos logit contributions
��+7.803
 Sabha+7.158
gallery+7.137
Pinterest+7.137
agram+7.078
Neg logit contributions
gencies-8.203
racuse-7.806
emort-7.147
 Wra-6.908
strom-6.810
Token Image
Token ID7412
Feature activation-8.67e-01

Pos logit contributions
��+0.728
agram+0.572
gallery+0.571
Pinterest+0.556
 Sabha+0.555
Neg logit contributions
gencies-1.620
racuse-1.557
 Wra-1.481
 Observer-1.443
emort-1.433
Token caption
Token ID8305
Feature activation-9.26e-01

Pos logit contributions
��+2.049
gallery+1.615
agram+1.590
 Sabha+1.531
Pinterest+1.516
Neg logit contributions
gencies-7.197
racuse-6.921
 Wra-6.608
emort-6.461
 Observer-6.406
Token Peace
Token ID12689
Feature activation-6.99e-02

Pos logit contributions
��+5.314
 Sabha+4.881
agram+4.859
gallery+4.797
Pinterest+4.748
Neg logit contributions
gencies-4.384
racuse-4.137
emort-3.705
 Wra-3.634
strom-3.516
Token talks
Token ID6130
Feature activation+2.57e-01

Pos logit contributions
��+0.388
agram+0.333
gallery+0.332
Pinterest+0.329
 Sabha+0.329
Neg logit contributions
gencies-0.370
racuse-0.347
 Wra-0.333
 Observer-0.321
strom-0.313
Token Bonnie
Token ID39203
Feature activation-8.22e-02

Pos logit contributions
��+0.467
agram+0.406
gallery+0.406
 Sabha+0.404
 corrid+0.402
Neg logit contributions
gencies-0.391
racuse-0.364
 Wra-0.348
 Observer-0.332
strom-0.326
Token &
Token ID1222
Feature activation-2.24e-01

Pos logit contributions
��+0.474
agram+0.412
gallery+0.411
 corrid+0.409
 Sabha+0.409
Neg logit contributions
gencies-0.413
racuse-0.388
 Wra-0.371
 Observer-0.355
strom-0.349
Token Baby
Token ID14801
Feature activation-2.77e-01

Pos logit contributions
��+1.286
agram+1.120
gallery+1.116
 Sabha+1.111
Pinterest+1.103
Neg logit contributions
gencies-1.133
racuse-1.057
 Wra-1.004
 Observer-0.959
strom-0.945
Token 2008
Token ID3648
Feature activation-5.48e-02

Pos logit contributions
��+1.588
agram+1.387
gallery+1.380
 Sabha+1.375
Pinterest+1.364
Neg logit contributions
gencies-1.402
racuse-1.307
 Wra-1.238
 Observer-1.182
strom-1.166
Token01
Token ID486
Feature activation+3.34e-01

Pos logit contributions
��+0.282
agram+0.244
gallery+0.242
Pinterest+0.240
 Sabha+0.240
Neg logit contributions
gencies-0.309
racuse-0.290
 Wra-0.275
 Observer-0.265
emort-0.264
Token09
Token ID2931
Feature activation+8.37e-01

Pos logit contributions
gencies+1.930
racuse+1.828
 Wra+1.724
 Observer+1.667
strom+1.647
Neg logit contributions
��-1.677
agram-1.443
gallery-1.437
 Sabha-1.430
Pinterest-1.413
Token O
Token ID440
Feature activation+2.48e-01

Pos logit contributions
 Wra+3.414
gencies+3.349
 Observer+3.314
 Offensive+3.170
racuse+3.089
Neg logit contributions
��-5.677
 corrid-5.204
 entreprene-4.632
agram-4.546
Pinterest-4.521
Token&
Token ID5
Feature activation-5.47e-01

Pos logit contributions
gencies+0.716
racuse+0.641
 Wra+0.609
strom+0.568
 Observer+0.561
Neg logit contributions
��-2.012
 corrid-1.855
 Sabha-1.740
 imagination-1.728
gallery-1.708
TokenA
Token ID32
Feature activation+1.56e-01

Pos logit contributions
��+2.929
agram+2.558
gallery+2.554
 Sabha+2.528
Pinterest+2.523
Neg logit contributions
gencies-2.965
racuse-2.780
 Wra-2.637
 Observer-2.525
emort-2.502
Token-
Token ID12
Feature activation+5.46e-01

Pos logit contributions
gencies+0.765
racuse+0.720
 Wra+0.715
 Observer+0.684
strom+0.670
Neg logit contributions
��-0.952
 corrid-0.856
 Sabha-0.774
agram-0.773
 imagination-0.767
Token08
Token ID2919
Feature activation+1.24e-01

Pos logit contributions
gencies+2.883
racuse+2.685
 Wra+2.643
 Observer+2.554
strom+2.503
Neg logit contributions
��-3.007
 corrid-2.562
 Sabha-2.500
agram-2.477
 imagination-2.465
Token a
Token ID257
Feature activation+1.43e-01

Pos logit contributions
gencies+0.995
racuse+0.930
 Wra+0.900
 Observer+0.865
strom+0.850
Neg logit contributions
��-1.063
gallery-0.900
agram-0.900
 Sabha-0.896
Pinterest-0.896
Token fu
Token ID14035
Feature activation+7.11e-02

Pos logit contributions
gencies+0.719
racuse+0.671
 Wra+0.646
 Observer+0.616
strom+0.611
Neg logit contributions
��-0.832
agram-0.719
 Sabha-0.716
gallery-0.716
Pinterest-0.714
Tokenselage
Token ID45217
Feature activation-3.07e-01

Pos logit contributions
gencies+0.302
racuse+0.278
 Wra+0.260
 Observer+0.247
strom+0.244
Neg logit contributions
��-0.469
 Sabha-0.413
agram-0.412
gallery-0.411
 corrid-0.411
Token about
Token ID546
Feature activation+3.59e-01

Pos logit contributions
��+1.766
agram+1.541
gallery+1.535
 Sabha+1.528
 corrid+1.518
Neg logit contributions
gencies-1.548
racuse-1.444
 Wra-1.369
 Observer-1.309
strom-1.288
Token 5
Token ID642
Feature activation-4.99e-02

Pos logit contributions
gencies+1.701
racuse+1.607
 Wra+1.559
 Observer+1.492
strom+1.448
Neg logit contributions
��-2.150
gallery-1.853
 corrid-1.847
Pinterest-1.842
 Sabha-1.832
Token'
Token ID6
Feature activation+8.94e-01

Pos logit contributions
��+0.233
agram+0.194
 Sabha+0.194
gallery+0.193
 corrid+0.192
Neg logit contributions
gencies-0.306
racuse-0.290
 Wra-0.280
 Observer-0.270
strom-0.268
Token wide
Token ID3094
Feature activation-3.15e-01

Pos logit contributions
gencies+4.334
racuse+4.104
 Wra+3.970
strom+3.893
 Observer+3.824
Neg logit contributions
��-5.101
 corrid-4.394
Pinterest-4.357
 Sabha-4.347
agram-4.323
Token.
Token ID13
Feature activation+2.14e-01

Pos logit contributions
��+1.821
agram+1.595
gallery+1.590
 Sabha+1.582
Pinterest+1.568
Neg logit contributions
gencies-1.578
racuse-1.466
 Wra-1.386
 Observer-1.323
strom-1.299
Token I
Token ID314
Feature activation+3.13e-02

Pos logit contributions
gencies+0.909
racuse+0.839
 Wra+0.833
 Observer+0.783
strom+0.753
Neg logit contributions
��-1.421
 corrid-1.257
 Sabha-1.223
gallery-1.221
agram-1.220
Token had
Token ID550
Feature activation-7.39e-02

Pos logit contributions
gencies+0.161
racuse+0.150
 Wra+0.144
 Observer+0.137
strom+0.136
Neg logit contributions
��-0.181
gallery-0.154
 corrid-0.154
 Sabha-0.153
agram-0.153
Token folded
Token ID24650
Feature activation-1.50e-01

Pos logit contributions
��+0.410
agram+0.353
gallery+0.352
 Sabha+0.350
 corrid+0.349
Neg logit contributions
gencies-0.391
racuse-0.366
 Wra-0.350
 Observer-0.336
strom-0.330
Token to
Token ID284
Feature activation+1.53e+00

Pos logit contributions
gencies+2.561
racuse+2.492
 Wra+2.436
 Observer+2.391
strom+2.254
Neg logit contributions
��-5.322
 corrid-4.992
 entreprene-4.441
 Sabha-4.437
agram-4.425
Token hear
Token ID3285
Feature activation+6.75e-01

Pos logit contributions
gencies+6.618
racuse+6.250
 Contact+5.715
 Observer+5.657
 Wra+5.464
Neg logit contributions
��-9.401
 corrid-8.812
��極-8.278
Pinterest-8.126
 entreprene-8.034
Token about
Token ID546
Feature activation+1.30e+00

Pos logit contributions
gencies+3.401
racuse+3.279
 Wra+3.175
 Observer+3.111
strom+3.094
Neg logit contributions
��-3.995
 corrid-3.738
��極-3.150
 entreprene-3.148
 Sabha-3.140
Token Nation
Token ID8741
Feature activation+1.24e+00

Pos logit contributions
gencies+2.778
 Wra+2.685
racuse+2.598
 Observer+2.455
 Threat+2.211
Neg logit contributions
��-11.269
 corrid-10.180
��極-9.537
Pinterest-9.489
�士-9.289
Token Travels
Token ID48899
Feature activation+2.62e-02

Pos logit contributions
gencies+3.378
racuse+3.021
 Wra+2.874
 Observer+2.651
strom+2.403
Neg logit contributions
��-10.027
 corrid-8.987
agram-8.524
ال-8.451
 imagination-8.415
Token destinations
Token ID23982
Feature activation+1.10e+00

Pos logit contributions
gencies+0.107
 Wra+0.097
 Observer+0.096
racuse+0.095
strom+0.090
Neg logit contributions
��-0.182
 corrid-0.160
 Sabha-0.157
agram-0.154
gallery-0.152
Token,
Token ID11
Feature activation+5.19e-01

Pos logit contributions
gencies+3.645
racuse+3.471
 Wra+3.471
 Observer+3.337
strom+3.276
Neg logit contributions
��-8.259
 corrid-8.094
 entreprene-7.131
�士-6.790
��極-6.755
Token and
Token ID290
Feature activation+5.92e-01

Pos logit contributions
gencies+2.036
racuse+1.832
 Wra+1.828
 Observer+1.746
strom+1.674
Neg logit contributions
��-3.709
 corrid-3.499
 entreprene-3.054
agram-3.053
 Sabha-3.042
Token explore
Token ID7301
Feature activation+7.15e-01

Pos logit contributions
gencies+2.867
racuse+2.645
 Wra+2.502
 Observer+2.476
strom+2.385
Neg logit contributions
��-3.482
 corrid-3.143
 Sabha-2.984
agram-2.969
Pinterest-2.945
Token the
Token ID262
Feature activation+7.68e-01

Pos logit contributions
gencies+2.721
racuse+2.606
 Wra+2.595
 Observer+2.460
strom+2.425
Neg logit contributions
��-5.011
 corrid-4.744
 entreprene-4.222
 Sabha-4.138
��極-4.121
Token world
Token ID995
Feature activation+1.36e+00

Pos logit contributions
gencies+3.363
 Wra+3.099
racuse+3.071
 Observer+3.017
strom+2.972
Neg logit contributions
��-4.978
 corrid-4.545
��極-4.079
Pinterest-4.052
�士-4.040
Token 80
Token ID4019
Feature activation+2.25e-01

Pos logit contributions
��+3.251
 Sabha+2.794
gallery+2.783
agram+2.777
Pinterest+2.771
Neg logit contributions
gencies-4.515
racuse-4.173
 Wra-3.915
emort-3.859
strom-3.827
Token)
Token ID8
Feature activation+4.04e-01

Pos logit contributions
gencies+1.159
racuse+1.075
 Wra+1.020
 Observer+0.979
emort+0.964
Neg logit contributions
��-1.276
agram-1.116
gallery-1.110
 Sabha-1.109
Pinterest-1.097
Token
Token ID198
Feature activation+1.83e-01

Pos logit contributions
gencies+1.903
racuse+1.772
 Wra+1.733
 Observer+1.643
strom+1.582
Neg logit contributions
��-2.455
 corrid-2.150
agram-2.139
 Sabha-2.118
gallery-2.117
Token
Token ID198
Feature activation+3.27e-01

Pos logit contributions
gencies+0.823
racuse+0.763
 Wra+0.756
 Observer+0.723
strom+0.711
Neg logit contributions
��-1.199
 corrid-1.101
 Sabha-0.989
agram-0.983
gallery-0.964
Token27
Token ID1983
Feature activation+4.54e-01

Pos logit contributions
gencies+1.810
racuse+1.688
 Wra+1.618
 Observer+1.553
strom+1.547
Neg logit contributions
��-1.750
 corrid-1.510
 Sabha-1.486
agram-1.461
 imagination-1.433
Token"
Token ID1
Feature activation+7.96e-01

Pos logit contributions
gencies+2.638
racuse+2.499
 Wra+2.462
 Observer+2.366
strom+2.320
Neg logit contributions
��-2.277
 corrid-1.993
 Sabha-1.867
agram-1.866
gallery-1.845
Token 12
Token ID1105
Feature activation+2.73e-01

Pos logit contributions
gencies+3.346
racuse+3.103
 Wra+2.963
 Observer+2.800
strom+2.651
Neg logit contributions
��-5.172
 corrid-4.613
agram-4.604
gallery-4.551
 Sabha-4.548
Tokenms
Token ID907
Feature activation+9.19e-01

Pos logit contributions
gencies+1.215
racuse+1.121
 Wra+1.066
 Observer+1.012
strom+1.001
Neg logit contributions
��-1.740
 corrid-1.528
agram-1.526
 Sabha-1.521
gallery-1.514
Token G
Token ID402
Feature activation-2.00e-02

Pos logit contributions
gencies+4.616
 Wra+4.327
racuse+4.316
 Observer+4.130
strom+3.938
Neg logit contributions
��-5.161
 corrid-4.510
agram-4.371
Pinterest-4.295
gallery-4.267
Token2
Token ID17
Feature activation+5.35e-01

Pos logit contributions
��+0.091
gallery+0.077
agram+0.077
 Sabha+0.077
Pinterest+0.076
Neg logit contributions
gencies-0.124
racuse-0.117
 Wra-0.112
 Observer-0.108
strom-0.107
TokenG
Token ID38
Feature activation+3.97e-01

Pos logit contributions
gencies+2.443
 Wra+2.288
racuse+2.243
 Observer+2.154
strom+2.138
Neg logit contributions
��-3.456
 corrid-3.187
agram-2.816
 imagination-2.784
 Sabha-2.784
Token program
Token ID1430
Feature activation+4.79e-01

Pos logit contributions
gencies+1.828
racuse+1.705
 Wra+1.662
 Observer+1.602
strom+1.551
Neg logit contributions
��-2.023
agram-1.726
gallery-1.723
 Sabha-1.714
Pinterest-1.711
Token.
Token ID13
Feature activation+4.74e-01

Pos logit contributions
gencies+2.289
racuse+2.145
 Wra+2.086
 Observer+2.001
strom+1.962
Neg logit contributions
��-2.863
 Sabha-2.447
gallery-2.442
agram-2.436
Pinterest-2.433
Token Tuesday
Token ID3431
Feature activation+3.64e-01

Pos logit contributions
gencies+2.165
 Wra+2.033
racuse+2.032
 Observer+1.945
strom+1.865
Neg logit contributions
��-2.961
 corrid-2.592
agram-2.501
gallery-2.497
 Sabha-2.494
Token morning
Token ID3329
Feature activation+5.02e-02

Pos logit contributions
gencies+1.763
racuse+1.638
 Wra+1.596
 Observer+1.536
strom+1.493
Neg logit contributions
��-2.179
gallery-1.867
agram-1.862
 corrid-1.855
Pinterest-1.855
Token's
Token ID338
Feature activation+2.33e-01

Pos logit contributions
gencies+0.246
racuse+0.230
 Wra+0.220
 Observer+0.210
strom+0.206
Neg logit contributions
��-0.297
agram-0.258
gallery-0.257
 Sabha-0.256
Pinterest-0.255
Token Fox
Token ID5426
Feature activation+8.18e-01

Pos logit contributions
gencies+1.192
racuse+1.108
 Wra+1.076
 Observer+1.032
strom+1.000
Neg logit contributions
��-1.341
agram-1.150
gallery-1.150
Pinterest-1.144
 Sabha-1.142
Token &
Token ID1222
Feature activation+3.09e-01

Pos logit contributions
gencies+3.483
racuse+3.128
 Wra+3.058
 Observer+2.982
strom+2.833
Neg logit contributions
��-5.372
gallery-4.617
Pinterest-4.601
agram-4.574
 Sabha-4.549
Token Friends
Token ID14213
Feature activation+8.74e-02

Pos logit contributions
gencies+1.092
racuse+1.001
 Wra+0.953
 Observer+0.901
strom+0.857
Neg logit contributions
��-2.236
agram-2.020
gallery-2.007
 Sabha-1.994
 corrid-1.994
Token First
Token ID3274
Feature activation+6.78e-01

Pos logit contributions
gencies+0.440
racuse+0.410
 Wra+0.393
 Observer+0.377
strom+0.369
Neg logit contributions
��-0.507
agram-0.439
gallery-0.438
 Sabha-0.435
Pinterest-0.434
Token will
Token ID481
Feature activation+1.33e-01

Pos logit contributions
gencies+3.205
 Wra+2.909
racuse+2.903
 Observer+2.785
strom+2.730
Neg logit contributions
��-4.051
Pinterest-3.519
gallery-3.503
agram-3.464
 corrid-3.458
Token be
Token ID307
Feature activation+1.71e-01

Pos logit contributions
gencies+0.712
racuse+0.663
 Wra+0.638
 Observer+0.616
strom+0.601
Neg logit contributions
��-0.734
gallery-0.626
 corrid-0.622
agram-0.621
 Sabha-0.619
Token keeps
Token ID7622
Feature activation-2.37e-01

Pos logit contributions
��+0.108
agram+0.092
gallery+0.092
 Sabha+0.092
 corrid+0.091
Neg logit contributions
gencies-0.103
racuse-0.096
 Wra-0.093
 Observer-0.089
strom-0.087
Token changing
Token ID5609
Feature activation+3.05e-01

Pos logit contributions
��+1.294
agram+1.114
gallery+1.110
 Sabha+1.106
 corrid+1.098
Neg logit contributions
gencies-1.274
racuse-1.192
 Wra-1.141
 Observer-1.094
strom-1.077
Token.
Token ID13
Feature activation-3.41e-01

Pos logit contributions
gencies+1.405
racuse+1.272
 Wra+1.254
 Observer+1.209
strom+1.166
Neg logit contributions
��-1.904
 corrid-1.634
agram-1.629
gallery-1.629
 Sabha-1.620
Token
Token ID198
Feature activation-6.27e-02

Pos logit contributions
��+2.093
agram+1.838
gallery+1.832
 Sabha+1.825
 corrid+1.816
Neg logit contributions
gencies-1.599
racuse-1.483
 Wra-1.406
 Observer-1.338
strom-1.315
Token
Token ID198
Feature activation-2.03e-01

Pos logit contributions
��+0.383
 corrid+0.341
agram+0.322
 Sabha+0.321
gallery+0.318
Neg logit contributions
gencies-0.305
racuse-0.284
 Wra-0.276
 Observer-0.264
strom-0.260
TokenC
Token ID34
Feature activation+1.45e-01

Pos logit contributions
��+0.812
gallery+0.674
agram+0.666
Pinterest+0.665
 Sabha+0.657
Neg logit contributions
gencies-1.369
racuse-1.310
 Wra-1.259
 Observer-1.224
emort-1.203
Token.
Token ID13
Feature activation-1.06e-01

Pos logit contributions
gencies+0.569
racuse+0.516
 Wra+0.504
strom+0.480
 Observer+0.473
Neg logit contributions
��-1.050
 corrid-0.953
 Sabha-0.870
 imagination-0.866
agram-0.863
TokenK
Token ID42
Feature activation-4.59e-02

Pos logit contributions
��+0.574
 corrid+0.496
agram+0.463
 Sabha+0.456
 imagination+0.452
Neg logit contributions
gencies-0.592
racuse-0.554
 Wra-0.551
 Observer-0.528
strom-0.522
Token.:
Token ID11207
Feature activation-3.57e-01

Pos logit contributions
��+0.302
 corrid+0.273
 Sabha+0.255
agram+0.254
 imagination+0.253
Neg logit contributions
gencies-0.205
racuse-0.189
 Wra-0.181
 Observer-0.172
strom-0.171
Token And
Token ID843
Feature activation-8.73e-02

Pos logit contributions
��+2.215
agram+1.959
gallery+1.953
 Sabha+1.945
Pinterest+1.934
Neg logit contributions
gencies-1.640
racuse-1.522
 Wra-1.426
 Observer-1.354
strom-1.337
Token then
Token ID788
Feature activation+9.98e-02

Pos logit contributions
��+0.482
agram+0.412
gallery+0.411
 corrid+0.408
 Sabha+0.408
Neg logit contributions
gencies-0.467
racuse-0.434
 Wra-0.419
 Observer-0.403
strom-0.395
Token refers
Token ID10229
Feature activation-3.07e-02

Pos logit contributions
��+1.588
agram+1.372
gallery+1.367
 Sabha+1.361
 corrid+1.352
Neg logit contributions
gencies-1.536
racuse-1.439
 Wra-1.373
 Observer-1.315
strom-1.296
Token to
Token ID284
Feature activation-6.40e-01

Pos logit contributions
��+0.175
agram+0.152
 corrid+0.148
 Sabha+0.148
gallery+0.148
Neg logit contributions
gencies-0.158
racuse-0.149
 Wra-0.142
 Observer-0.135
emort-0.133
Token himself
Token ID2241
Feature activation-7.28e-01

Pos logit contributions
��+4.077
agram+3.699
 Sabha+3.649
gallery+3.635
Pinterest+3.615
Neg logit contributions
gencies-2.756
racuse-2.535
 Wra-2.306
emort-2.215
 Observer-2.176
Token as
Token ID355
Feature activation-2.07e-01

Pos logit contributions
��+3.688
agram+3.235
 Sabha+3.180
 corrid+3.123
ال+3.112
Neg logit contributions
gencies-4.100
racuse-3.825
 Wra-3.553
emort-3.550
strom-3.399
Token '
Token ID705
Feature activation-1.70e-02

Pos logit contributions
��+1.026
agram+0.870
gallery+0.868
 Sabha+0.863
 corrid+0.857
Neg logit contributions
gencies-1.210
racuse-1.140
 Wra-1.094
 Observer-1.053
strom-1.038
TokenGu
Token ID8205
Feature activation+2.12e-01

Pos logit contributions
��+0.096
agram+0.082
 Sabha+0.082
 corrid+0.082
gallery+0.082
Neg logit contributions
gencies-0.088
racuse-0.082
 Wra-0.079
 Observer-0.076
strom-0.075
Tokencc
Token ID535
Feature activation-1.31e-01

Pos logit contributions
gencies+1.034
racuse+0.972
 Wra+0.958
 Observer+0.909
strom+0.905
Neg logit contributions
��-1.257
 Sabha-1.085
Pinterest-1.068
 corrid-1.060
gallery-1.056
Tokenifer
Token ID7087
Feature activation+2.88e-02

Pos logit contributions
��+1.082
 Sabha+0.967
gallery+0.960
agram+0.957
 corrid+0.955
Neg logit contributions
gencies-0.352
racuse-0.303
 Wra-0.285
 Observer-0.264
strom-0.250
Token 2
Token ID362
Feature activation-2.82e-01

Pos logit contributions
gencies+0.150
racuse+0.140
 Wra+0.135
 Observer+0.129
strom+0.128
Neg logit contributions
��-0.166
gallery-0.140
 Sabha-0.140
 corrid-0.139
agram-0.139
Token.
Token ID13
Feature activation+2.43e-01

Pos logit contributions
��+1.709
agram+1.511
gallery+1.509
 Sabha+1.500
Pinterest+1.494
Neg logit contributions
gencies-1.325
racuse-1.233
 Wra-1.158
 Observer-1.100
strom-1.081
Token0
Token ID15
Feature activation-2.39e-01

Pos logit contributions
gencies+1.039
 Wra+1.025
racuse+0.975
 Observer+0.935
strom+0.926
Neg logit contributions
��-1.603
 corrid-1.390
 Sabha-1.386
agram-1.354
 imagination-1.320
Token from
Token ID422
Feature activation+1.09e-01

Pos logit contributions
gencies+1.937
racuse+1.792
 Wra+1.788
 Observer+1.699
strom+1.647
Neg logit contributions
��-2.365
 Sabha-1.998
 corrid-1.995
gallery-1.991
agram-1.986
Token Amazon
Token ID6186
Feature activation+5.33e-01

Pos logit contributions
gencies+0.681
racuse+0.643
 Wra+0.625
 Observer+0.601
strom+0.592
Neg logit contributions
��-0.497
agram-0.410
 Sabha-0.408
gallery-0.408
 corrid-0.406
Token,
Token ID11
Feature activation+1.11e-01

Pos logit contributions
gencies+2.523
 Wra+2.274
racuse+2.266
 Observer+2.144
strom+2.099
Neg logit contributions
��-3.310
 Sabha-2.824
agram-2.796
 corrid-2.784
gallery-2.780
Token download
Token ID4321
Feature activation-5.06e-01

Pos logit contributions
gencies+0.566
racuse+0.520
 Wra+0.509
 Observer+0.484
strom+0.472
Neg logit contributions
��-0.646
agram-0.554
 Sabha-0.552
gallery-0.549
 corrid-0.547
Token it
Token ID340
Feature activation+2.93e-01

Pos logit contributions
��+2.655
agram+2.302
gallery+2.299
Pinterest+2.277
 corrid+2.269
Neg logit contributions
gencies-2.784
racuse-2.618
 Wra-2.461
 Observer-2.371
strom-2.353
Token to
Token ID284
Feature activation+2.57e-01

Pos logit contributions
gencies+1.438
racuse+1.340
 Wra+1.327
 Observer+1.255
strom+1.220
Neg logit contributions
��-1.745
 Sabha-1.487
agram-1.485
gallery-1.475
 corrid-1.468
Token your
Token ID534
Feature activation-2.21e-01

Pos logit contributions
gencies+1.139
racuse+1.049
 Wra+1.025
 Observer+0.957
strom+0.931
Neg logit contributions
��-1.658
agram-1.440
gallery-1.435
 Sabha-1.432
 corrid-1.422
Token PC
Token ID4217
Feature activation-4.36e-01

Pos logit contributions
��+1.282
agram+1.118
gallery+1.115
 Sabha+1.110
 corrid+1.103
Neg logit contributions
gencies-1.104
racuse-1.030
 Wra-0.978
 Observer-0.934
strom-0.920
Token and
Token ID290
Feature activation-1.81e-01

Pos logit contributions
��+2.542
agram+2.226
gallery+2.220
 Sabha+2.208
Pinterest+2.195
Neg logit contributions
gencies-2.162
racuse-2.011
 Wra-1.903
 Observer-1.818
strom-1.793
Token save
Token ID3613
Feature activation-3.45e-01

Pos logit contributions
��+0.927
agram+0.784
gallery+0.782
 Sabha+0.780
 corrid+0.774
Neg logit contributions
gencies-1.041
racuse-0.978
 Wra-0.944
 Observer-0.906
strom-0.891
Token it
Token ID340
Feature activation+1.32e-01

Pos logit contributions
��+1.757
agram+1.508
gallery+1.502
 Sabha+1.493
Pinterest+1.485
Neg logit contributions
gencies-1.962
racuse-1.846
 Wra-1.759
 Observer-1.692
strom-1.670
Token
Token ID198
Feature activation+9.20e-02

Pos logit contributions
��+0.078
agram+0.068
gallery+0.068
 Sabha+0.067
Pinterest+0.067
Neg logit contributions
gencies-0.073
racuse-0.068
 Wra-0.064
 Observer-0.062
strom-0.061
Token
Token ID198
Feature activation+1.48e-01

Pos logit contributions
gencies+0.431
racuse+0.400
 Wra+0.392
 Observer+0.375
strom+0.371
Neg logit contributions
��-0.583
 corrid-0.528
 Sabha-0.485
agram-0.482
gallery-0.476
TokenIn
Token ID818
Feature activation-1.41e-01

Pos logit contributions
gencies+0.788
racuse+0.726
 Wra+0.703
strom+0.684
 Observer+0.672
Neg logit contributions
��-0.840
 corrid-0.727
 imagination-0.686
agram-0.683
 Sabha-0.683
Token March
Token ID2805
Feature activation-8.50e-02

Pos logit contributions
��+0.804
agram+0.693
gallery+0.691
 corrid+0.689
Pinterest+0.687
Neg logit contributions
gencies-0.725
racuse-0.677
 Wra-0.649
 Observer-0.621
strom-0.610
Token,
Token ID11
Feature activation+3.57e-01

Pos logit contributions
��+0.481
 corrid+0.410
agram+0.410
gallery+0.409
Pinterest+0.406
Neg logit contributions
gencies-0.442
racuse-0.414
 Wra-0.399
 Observer-0.383
strom-0.377
Token we
Token ID356
Feature activation+7.47e-01

Pos logit contributions
gencies+1.798
racuse+1.686
 Wra+1.662
 Observer+1.596
strom+1.550
Neg logit contributions
��-2.053
 corrid-1.754
Pinterest-1.733
agram-1.725
gallery-1.721
Token drew
Token ID9859
Feature activation-3.34e-01

Pos logit contributions
gencies+3.511
racuse+3.309
 Wra+3.211
 Observer+3.084
strom+3.022
Neg logit contributions
��-4.557
Pinterest-3.833
 corrid-3.821
gallery-3.796
agram-3.788
Token attention
Token ID3241
Feature activation+3.80e-01

Pos logit contributions
��+1.431
agram+1.196
gallery+1.188
 Sabha+1.186
Pinterest+1.168
Neg logit contributions
gencies-2.173
racuse-2.059
 Wra-1.974
 Observer-1.908
strom-1.890
Token to
Token ID284
Feature activation+1.03e-01

Pos logit contributions
gencies+1.778
racuse+1.658
 Wra+1.574
 Observer+1.505
strom+1.485
Neg logit contributions
��-2.307
agram-2.026
gallery-2.024
 corrid-2.016
 Sabha-2.004
Token a
Token ID257
Feature activation+4.57e-02

Pos logit contributions
gencies+0.551
racuse+0.517
 Wra+0.499
 Observer+0.479
strom+0.467
Neg logit contributions
��-0.565
agram-0.480
gallery-0.479
 corrid-0.477
Pinterest-0.476
Token cable
Token ID7862
Feature activation-9.38e-02

Pos logit contributions
gencies+0.237
racuse+0.222
 Wra+0.213
 Observer+0.205
strom+0.199
Neg logit contributions
��-0.261
agram-0.223
gallery-0.222
Pinterest-0.220
 Sabha-0.220
Token then
Token ID788
Feature activation-2.43e-01

Pos logit contributions
��+0.993
gallery+0.850
agram+0.849
 Sabha+0.845
 corrid+0.843
Neg logit contributions
gencies-1.004
racuse-0.935
 Wra-0.907
 Observer-0.868
strom-0.853
Token take
Token ID1011
Feature activation+1.06e-01

Pos logit contributions
��+1.246
agram+1.059
gallery+1.057
 Sabha+1.051
 corrid+1.047
Neg logit contributions
gencies-1.387
racuse-1.302
 Wra-1.254
 Observer-1.204
strom-1.186
Token a
Token ID257
Feature activation+2.19e-01

Pos logit contributions
gencies+0.549
racuse+0.512
 Wra+0.505
 Observer+0.479
strom+0.466
Neg logit contributions
��-0.599
agram-0.512
 corrid-0.509
gallery-0.508
Pinterest-0.504
Token tiny
Token ID7009
Feature activation+2.22e-01

Pos logit contributions
gencies+1.178
racuse+1.106
 Wra+1.097
 Observer+1.054
strom+1.027
Neg logit contributions
��-1.176
agram-1.009
gallery-1.000
 corrid-0.992
Pinterest-0.988
Token bite
Token ID13197
Feature activation+3.82e-01

Pos logit contributions
gencies+1.218
racuse+1.143
 Wra+1.134
 Observer+1.105
strom+1.067
Neg logit contributions
��-1.166
gallery-0.987
 corrid-0.986
agram-0.985
Pinterest-0.969
Token of
Token ID286
Feature activation+2.65e-01

Pos logit contributions
gencies+1.985
 Wra+1.822
racuse+1.803
 Observer+1.737
strom+1.714
Neg logit contributions
��-2.155
 corrid-1.857
 Sabha-1.826
agram-1.814
gallery-1.813
Token fabric
Token ID9664
Feature activation-1.21e-01

Pos logit contributions
gencies+1.295
racuse+1.214
 Wra+1.205
 Observer+1.140
strom+1.111
Neg logit contributions
��-1.555
gallery-1.348
 Sabha-1.338
agram-1.332
 corrid-1.327
Token on
Token ID319
Feature activation+3.38e-01

Pos logit contributions
��+0.725
agram+0.630
gallery+0.630
 Sabha+0.627
 corrid+0.626
Neg logit contributions
gencies-0.590
racuse-0.547
 Wra-0.531
 Observer-0.503
strom-0.492
Token the
Token ID262
Feature activation-1.38e-02

Pos logit contributions
gencies+1.778
racuse+1.665
 Wra+1.624
 Observer+1.537
strom+1.517
Neg logit contributions
��-1.861
gallery-1.607
agram-1.597
 Sabha-1.596
 corrid-1.595
Token side
Token ID1735
Feature activation-5.68e-02

Pos logit contributions
��+0.073
gallery+0.062
agram+0.062
 Sabha+0.062
 corrid+0.062
Neg logit contributions
gencies-0.076
racuse-0.071
 Wra-0.070
 Observer-0.067
strom-0.065
Token that
Token ID326
Feature activation-1.14e-01

Pos logit contributions
��+0.343
agram+0.298
gallery+0.298
 Sabha+0.297
 corrid+0.296
Neg logit contributions
gencies-0.272
racuse-0.252
 Wra-0.243
 Observer-0.231
strom-0.226
Token be
Token ID307
Feature activation+1.28e-01

Pos logit contributions
gencies+0.485
racuse+0.451
 Wra+0.432
 Observer+0.417
strom+0.409
Neg logit contributions
��-0.493
agram-0.420
gallery-0.420
 corrid-0.420
 Sabha-0.417
Token highly
Token ID4047
Feature activation-1.39e-01

Pos logit contributions
gencies+0.638
racuse+0.592
 Wra+0.573
 Observer+0.548
strom+0.539
Neg logit contributions
��-0.759
gallery-0.640
agram-0.639
Pinterest-0.635
 corrid-0.635
Token reliable
Token ID9314
Feature activation+2.54e-01

Pos logit contributions
��+0.910
gallery+0.786
agram+0.783
Pinterest+0.777
 corrid+0.777
Neg logit contributions
gencies-0.615
racuse-0.561
 Wra-0.539
 Observer-0.518
strom-0.503
Token.[
Token ID3693
Feature activation-3.93e-01

Pos logit contributions
gencies+1.190
racuse+1.098
 Wra+1.061
 Observer+1.022
strom+1.000
Neg logit contributions
��-1.560
agram-1.344
 corrid-1.340
gallery-1.334
Pinterest-1.333
Token12
Token ID1065
Feature activation+5.00e-01

Pos logit contributions
��+1.756
gallery+1.496
agram+1.495
 Sabha+1.464
Pinterest+1.464
Neg logit contributions
gencies-2.465
racuse-2.346
 Wra-2.241
 Observer-2.164
emort-2.145
Token]
Token ID60
Feature activation-2.16e-01

Pos logit contributions
gencies+2.998
racuse+2.829
 Wra+2.714
 Observer+2.615
strom+2.583
Neg logit contributions
��-2.403
agram-2.031
gallery-2.022
 Sabha-2.014
 corrid-1.997
Token
Token ID198
Feature activation-1.64e-01

Pos logit contributions
��+1.254
agram+1.104
gallery+1.101
 Sabha+1.094
Pinterest+1.088
Neg logit contributions
gencies-1.071
racuse-0.998
 Wra-0.936
 Observer-0.892
emort-0.890
Token
Token ID198
Feature activation-4.46e-02

Pos logit contributions
��+0.962
 corrid+0.841
agram+0.818
 Sabha+0.815
gallery+0.811
Neg logit contributions
gencies-0.825
racuse-0.768
 Wra-0.741
 Observer-0.709
strom-0.699
TokenWithout
Token ID16249
Feature activation-2.00e-02

Pos logit contributions
��+0.258
 corrid+0.223
 Sabha+0.220
agram+0.218
gallery+0.216
Neg logit contributions
gencies-0.229
racuse-0.212
 Wra-0.203
 Observer-0.195
strom-0.195
Token a
Token ID257
Feature activation-5.68e-03

Pos logit contributions
��+0.111
agram+0.096
gallery+0.096
 Sabha+0.096
 corrid+0.095
Neg logit contributions
gencies-0.105
racuse-0.098
 Wra-0.094
 Observer-0.090
strom-0.089
Token working
Token ID1762
Feature activation+7.95e-02

Pos logit contributions
��+0.030
agram+0.026
gallery+0.025
 Sabha+0.025
 corrid+0.025
Neg logit contributions
gencies-0.032
racuse-0.030
 Wra-0.028
 Observer-0.027
strom-0.027
Token of
Token ID286
Feature activation-4.78e-01

Pos logit contributions
��+2.740
agram+2.415
gallery+2.408
 Sabha+2.385
 imagination+2.382
Neg logit contributions
gencies-2.450
racuse-2.301
 Wra-2.143
 Observer-2.042
strom-2.034
Token The
Token ID383
Feature activation+3.43e-02

Pos logit contributions
��+2.745
agram+2.423
gallery+2.422
 Sabha+2.399
Pinterest+2.391
Neg logit contributions
gencies-2.390
racuse-2.226
 Wra-2.064
emort-1.977
 Observer-1.969
Token Flash
Token ID9973
Feature activation-6.59e-01

Pos logit contributions
gencies+0.127
racuse+0.116
 Wra+0.108
 Observer+0.101
strom+0.099
Neg logit contributions
��-0.243
agram-0.218
gallery-0.217
 Sabha-0.217
 corrid-0.216
Token,
Token ID11
Feature activation-1.20e-01

Pos logit contributions
��+3.582
gallery+3.143
agram+3.135
 Sabha+3.092
Pinterest+3.083
Neg logit contributions
gencies-3.481
racuse-3.279
 Wra-3.013
emort-2.909
 Observer-2.894
Token Super
Token ID3115
Feature activation-2.68e-01

Pos logit contributions
��+0.708
agram+0.618
gallery+0.616
 Sabha+0.614
 corrid+0.610
Neg logit contributions
gencies-0.593
racuse-0.553
 Wra-0.526
 Observer-0.502
strom-0.494
Tokengirl
Token ID15219
Feature activation-8.37e-01

Pos logit contributions
��+1.515
agram+1.336
gallery+1.326
 Sabha+1.323
Pinterest+1.308
Neg logit contributions
gencies-1.358
racuse-1.278
 Wra-1.199
 Observer-1.144
emort-1.131
Token,
Token ID11
Feature activation-1.87e-01

Pos logit contributions
��+4.118
agram+3.576
gallery+3.550
Pinterest+3.548
 corrid+3.527
Neg logit contributions
gencies-4.742
racuse-4.426
 Wra-4.102
emort-4.019
 Observer-3.939
Token and
Token ID290
Feature activation-1.36e-01

Pos logit contributions
��+1.138
agram+1.003
gallery+1.001
 Sabha+0.995
Pinterest+0.989
Neg logit contributions
gencies-0.884
racuse-0.819
 Wra-0.771
 Observer-0.733
strom-0.723
Token D
Token ID360
Feature activation-2.19e-01

Pos logit contributions
��+0.862
agram+0.761
gallery+0.759
 Sabha+0.756
 corrid+0.751
Neg logit contributions
gencies-0.611
racuse-0.565
 Wra-0.533
 Observer-0.506
strom-0.497
Token.
Token ID13
Feature activation+7.72e-02

Pos logit contributions
��+0.962
agram+0.787
gallery+0.787
 Sabha+0.784
 corrid+0.778
Neg logit contributions
gencies-1.415
racuse-1.341
 Wra-1.294
 Observer-1.250
strom-1.235
TokenC
Token ID34
Feature activation-6.30e-02

Pos logit contributions
gencies+0.468
racuse+0.436
 Wra+0.436
 Observer+0.417
strom+0.412
Neg logit contributions
��-0.374
 Sabha-0.307
 corrid-0.306
agram-0.304
Pinterest-0.301
Token and
Token ID290
Feature activation-1.22e-01

Pos logit contributions
��+1.454
agram+1.274
gallery+1.271
 Sabha+1.268
Pinterest+1.256
Neg logit contributions
gencies-1.218
racuse-1.132
 Wra-1.072
 Observer-1.021
strom-1.005
Token massac
Token ID34622
Feature activation-3.96e-01

Pos logit contributions
��+0.713
agram+0.621
gallery+0.619
 Sabha+0.616
 corrid+0.612
Neg logit contributions
gencies-0.610
racuse-0.569
 Wra-0.543
 Observer-0.518
strom-0.510
Tokenred
Token ID445
Feature activation-1.75e-01

Pos logit contributions
��+2.301
agram+1.992
gallery+1.981
 Sabha+1.979
 corrid+1.977
Neg logit contributions
gencies-2.002
racuse-1.873
 Wra-1.781
 Observer-1.699
strom-1.678
Token the
Token ID262
Feature activation-1.84e-01

Pos logit contributions
��+0.982
agram+0.848
gallery+0.845
 Sabha+0.842
Pinterest+0.836
Neg logit contributions
gencies-0.909
racuse-0.853
 Wra-0.817
 Observer-0.782
strom-0.768
Token defending
Token ID11749
Feature activation-1.87e-01

Pos logit contributions
��+1.047
agram+0.904
gallery+0.901
 Sabha+0.896
Pinterest+0.893
Neg logit contributions
gencies-0.947
racuse-0.886
 Wra-0.850
 Observer-0.816
strom-0.797
Token champion
Token ID8783
Feature activation-3.22e-01

Pos logit contributions
��+1.097
agram+0.954
gallery+0.951
 Sabha+0.945
Pinterest+0.942
Neg logit contributions
gencies-0.929
racuse-0.866
 Wra-0.831
 Observer-0.795
strom-0.776
Token Eagles
Token ID13785
Feature activation-2.61e-01

Pos logit contributions
��+1.862
agram+1.624
gallery+1.619
 Sabha+1.613
 corrid+1.601
Neg logit contributions
gencies-1.614
racuse-1.505
 Wra-1.429
 Observer-1.364
strom-1.345
Token 35
Token ID3439
Feature activation-6.45e-01

Pos logit contributions
��+1.469
agram+1.284
gallery+1.277
 Sabha+1.273
 corrid+1.264
Neg logit contributions
gencies-1.350
racuse-1.261
 Wra-1.194
 Observer-1.141
emort-1.126
Token-
Token ID12
Feature activation+1.09e-01

Pos logit contributions
��+3.110
agram+2.711
gallery+2.710
 corrid+2.688
 Sabha+2.668
Neg logit contributions
gencies-3.714
racuse-3.559
 Wra-3.354
emort-3.227
 Observer-3.210
Token10
Token ID940
Feature activation-8.92e-02

Pos logit contributions
gencies+0.610
racuse+0.569
 Wra+0.550
strom+0.532
 Observer+0.529
Neg logit contributions
��-0.594
 corrid-0.513
 Sabha-0.491
agram-0.483
 imagination-0.478
Token behind
Token ID2157
Feature activation+7.07e-02

Pos logit contributions
��+0.522
agram+0.456
gallery+0.454
 Sabha+0.452
 corrid+0.449
Neg logit contributions
gencies-0.443
racuse-0.413
 Wra-0.393
 Observer-0.375
strom-0.369
Token
Token ID198
Feature activation-8.19e-01

Pos logit contributions
��+0.516
agram+0.445
gallery+0.443
 Sabha+0.441
 corrid+0.441
Neg logit contributions
gencies-0.443
racuse-0.411
 Wra-0.396
 Observer-0.379
strom-0.372
Token
Token ID198
Feature activation-5.20e-01

Pos logit contributions
��+3.897
gallery+3.611
agram+3.563
Pinterest+3.551
 Sabha+3.491
Neg logit contributions
gencies-4.677
racuse-4.413
 Wra-4.101
emort-4.039
 Observer-3.934
TokenReply
Token ID36875
Feature activation+7.11e-02

Pos logit contributions
��+2.859
agram+2.495
gallery+2.494
Pinterest+2.466
 Sabha+2.466
Neg logit contributions
gencies-2.740
racuse-2.575
 Wra-2.451
 Observer-2.344
strom-2.303
Token ·
Token ID14128
Feature activation+5.37e-01

Pos logit contributions
gencies+0.161
racuse+0.136
 Wra+0.122
 Observer+0.111
strom+0.104
Neg logit contributions
��-0.610
agram-0.555
 Sabha-0.554
gallery-0.554
 corrid-0.552
Token Report
Token ID6358
Feature activation-3.21e-01

Pos logit contributions
gencies+1.452
racuse+1.250
 Wra+1.209
 Observer+1.130
strom+1.013
Neg logit contributions
��-4.369
 corrid-3.934
agram-3.906
 Sabha-3.900
gallery-3.895
Token Post
Token ID2947
Feature activation-5.31e-01

Pos logit contributions
��+1.667
agram+1.522
gallery+1.476
Pinterest+1.462
ال+1.460
Neg logit contributions
gencies-1.759
racuse-1.641
 Wra-1.486
emort-1.474
 Observer-1.430
Token<|endoftext|>
Token ID50256
Feature activation-9.19e-01

Pos logit contributions
��+2.752
gallery+2.418
 Sabha+2.387
agram+2.385
Pinterest+2.375
Neg logit contributions
gencies-2.967
racuse-2.831
 Wra-2.528
emort-2.515
strom-2.471
TokenThe
Token ID464
Feature activation-1.23e-01

Pos logit contributions
��+4.770
Pinterest+4.322
gallery+4.315
agram+4.262
 Sabha+4.156
Neg logit contributions
gencies-4.908
racuse-4.658
 Wra-4.420
 Observer-4.246
emort-4.180
Token kids
Token ID3988
Feature activation+8.43e-02

Pos logit contributions
��+0.696
agram+0.603
gallery+0.601
 Sabha+0.598
 corrid+0.595
Neg logit contributions
gencies-0.638
racuse-0.596
 Wra-0.569
 Observer-0.545
strom-0.535
Token at
Token ID379
Feature activation-2.56e-02

Pos logit contributions
gencies+0.432
racuse+0.400
 Wra+0.391
 Observer+0.374
strom+0.366
Neg logit contributions
��-0.483
 corrid-0.412
gallery-0.412
agram-0.411
 Sabha-0.410
Token South
Token ID2520
Feature activation-2.43e-01

Pos logit contributions
��+0.146
agram+0.126
gallery+0.126
 Sabha+0.125
 corrid+0.125
Neg logit contributions
gencies-0.132
racuse-0.123
 Wra-0.118
 Observer-0.113
strom-0.110
TokenVarious
Token ID40009
Feature activation-8.47e-02

Pos logit contributions
gencies+0.588
racuse+0.540
 Wra+0.520
strom+0.501
 Observer+0.498
Neg logit contributions
��-0.674
 corrid-0.579
 Sabha-0.573
agram-0.566
gallery-0.559
Token sound
Token ID2128
Feature activation-2.59e-01

Pos logit contributions
��+0.460
agram+0.395
gallery+0.394
 Sabha+0.393
Pinterest+0.390
Neg logit contributions
gencies-0.456
racuse-0.426
 Wra-0.410
 Observer-0.393
strom-0.386
Tokentracks
Token ID46074
Feature activation+8.93e-02

Pos logit contributions
��+1.893
agram+1.697
gallery+1.692
 Sabha+1.688
Pinterest+1.682
Neg logit contributions
gencies-0.911
racuse-0.819
 Wra-0.766
 Observer-0.712
strom-0.697
Token have
Token ID423
Feature activation-2.42e-01

Pos logit contributions
gencies+0.451
racuse+0.416
 Wra+0.406
 Observer+0.386
strom+0.378
Neg logit contributions
��-0.513
gallery-0.445
Pinterest-0.444
 Sabha-0.443
agram-0.442
Token been
Token ID587
Feature activation-1.23e-01

Pos logit contributions
��+1.192
agram+1.012
gallery+1.008
 Sabha+1.003
Pinterest+0.995
Neg logit contributions
gencies-1.421
racuse-1.339
 Wra-1.284
 Observer-1.235
strom-1.219
Token released
Token ID2716
Feature activation-1.58e-01

Pos logit contributions
��+0.700
gallery+0.602
agram+0.600
 Sabha+0.599
Pinterest+0.598
Neg logit contributions
gencies-0.631
racuse-0.588
 Wra-0.567
 Observer-0.540
strom-0.533
Token featuring
Token ID9593
Feature activation-1.08e-01

Pos logit contributions
��+0.878
agram+0.760
gallery+0.757
 Sabha+0.755
 corrid+0.750
Neg logit contributions
gencies-0.827
racuse-0.774
 Wra-0.738
 Observer-0.707
strom-0.697
Token music
Token ID2647
Feature activation-3.01e-01

Pos logit contributions
��+0.617
agram+0.535
gallery+0.533
 Sabha+0.531
Pinterest+0.529
Neg logit contributions
gencies-0.555
racuse-0.518
 Wra-0.495
 Observer-0.473
strom-0.465
Token composed
Token ID13160
Feature activation-6.62e-02

Pos logit contributions
��+1.628
agram+1.403
gallery+1.397
 Sabha+1.392
 corrid+1.385
Neg logit contributions
gencies-1.635
racuse-1.535
 Wra-1.457
 Observer-1.398
strom-1.381
Token by
Token ID416
Feature activation+6.50e-01

Pos logit contributions
��+0.371
agram+0.321
 corrid+0.317
gallery+0.317
 Sabha+0.315
Neg logit contributions
gencies-0.350
racuse-0.327
 Wra-0.311
 Observer-0.298
strom-0.292
Token Russ
Token ID1887
Feature activation+4.57e-01

Pos logit contributions
gencies+3.272
racuse+3.038
 Wra+3.029
 Observer+2.908
strom+2.810
Neg logit contributions
��-3.645
Pinterest-3.122
gallery-3.115
agram-3.092
 Sabha-3.080
Token but
Token ID475
Feature activation+5.50e-01

Pos logit contributions
gencies+0.117
racuse+0.108
 Wra+0.103
 Observer+0.098
strom+0.096
Neg logit contributions
��-0.151
agram-0.131
gallery-0.131
 corrid-0.130
 Sabha-0.130
Token not
Token ID407
Feature activation+5.01e-01

Pos logit contributions
gencies+2.929
racuse+2.690
 Wra+2.658
 Observer+2.557
strom+2.474
Neg logit contributions
��-3.127
gallery-2.621
agram-2.583
Pinterest-2.556
 corrid-2.539
Token necessarily
Token ID6646
Feature activation+1.03e-01

Pos logit contributions
gencies+2.801
racuse+2.619
 Wra+2.563
 Observer+2.454
strom+2.410
Neg logit contributions
��-2.681
agram-2.229
gallery-2.214
 corrid-2.190
Pinterest-2.176
Token so
Token ID523
Feature activation+2.09e-01

Pos logit contributions
gencies+0.540
racuse+0.505
 Wra+0.485
 Observer+0.465
strom+0.456
Neg logit contributions
��-0.582
agram-0.495
gallery-0.494
 corrid-0.489
Pinterest-0.488
Token.
Token ID13
Feature activation+1.66e-02

Pos logit contributions
gencies+1.125
racuse+1.051
 Wra+1.016
strom+0.970
 Observer+0.968
Neg logit contributions
��-1.145
gallery-0.977
agram-0.970
Pinterest-0.959
 corrid-0.957
Token Dis
Token ID3167
Feature activation-1.02e+00

Pos logit contributions
gencies+0.075
racuse+0.069
 Wra+0.067
 Observer+0.064
strom+0.062
Neg logit contributions
��-0.106
 corrid-0.092
agram-0.092
gallery-0.092
 Sabha-0.091
Tokenconnect
Token ID8443
Feature activation-4.01e-02

Pos logit contributions
��+5.032
agram+4.532
gallery+4.421
 Sabha+4.389
Pinterest+4.331
Neg logit contributions
gencies-5.579
racuse-5.337
 Wra-5.082
 Observer-4.852
emort-4.825
Token the
Token ID262
Feature activation+1.27e-01

Pos logit contributions
��+0.224
agram+0.193
gallery+0.193
 Sabha+0.192
Pinterest+0.191
Neg logit contributions
gencies-0.210
racuse-0.197
 Wra-0.188
 Observer-0.180
strom-0.177
Token mouse
Token ID10211
Feature activation-2.56e-01

Pos logit contributions
gencies+0.686
racuse+0.644
 Wra+0.626
 Observer+0.604
strom+0.583
Neg logit contributions
��-0.699
agram-0.590
gallery-0.590
Pinterest-0.584
 corrid-0.581
Token,
Token ID11
Feature activation-5.18e-02

Pos logit contributions
��+1.443
agram+1.248
gallery+1.245
 Sabha+1.239
 corrid+1.233
Neg logit contributions
gencies-1.332
racuse-1.247
 Wra-1.189
 Observer-1.139
strom-1.121
Token serial
Token ID11389
Feature activation-1.37e+00

Pos logit contributions
��+0.283
agram+0.242
gallery+0.240
 Sabha+0.239
 corrid+0.238
Neg logit contributions
gencies-0.280
racuse-0.262
 Wra-0.252
 Observer-0.243
strom-0.238
Token exemptions
Token ID30244
Feature activation-1.81e-01

Pos logit contributions
gencies+2.375
 Wra+2.066
racuse+2.060
 Observer+1.986
strom+1.880
Neg logit contributions
��-3.759
agram-3.309
gallery-3.268
 corrid-3.260
 Sabha-3.235
Token from
Token ID422
Feature activation+1.75e-01

Pos logit contributions
��+1.003
agram+0.870
gallery+0.867
 Sabha+0.864
Pinterest+0.857
Neg logit contributions
gencies-0.950
racuse-0.890
 Wra-0.846
 Observer-0.809
strom-0.798
Token federal
Token ID2717
Feature activation+2.00e-01

Pos logit contributions
gencies+0.904
racuse+0.840
 Wra+0.819
 Observer+0.783
strom+0.766
Neg logit contributions
��-0.999
gallery-0.852
agram-0.852
 corrid-0.849
 Sabha-0.847
Token,
Token ID11
Feature activation-4.79e-01

Pos logit contributions
gencies+1.043
racuse+0.959
 Wra+0.936
 Observer+0.895
strom+0.880
Neg logit contributions
��-1.135
gallery-0.973
 Sabha-0.963
agram-0.962
 corrid-0.956
Token state
Token ID1181
Feature activation-4.53e-01

Pos logit contributions
��+2.465
agram+2.114
gallery+2.105
 Sabha+2.095
Pinterest+2.077
Neg logit contributions
gencies-2.701
racuse-2.544
 Wra-2.427
 Observer-2.330
strom-2.302
Token and
Token ID290
Feature activation-1.07e+00

Pos logit contributions
��+2.464
agram+2.164
 corrid+2.138
gallery+2.138
 Sabha+2.137
Neg logit contributions
gencies-2.367
racuse-2.228
 Wra-2.115
 Observer-2.017
emort-1.996
Token local
Token ID1957
Feature activation-7.88e-01

Pos logit contributions
��+4.565
agram+3.962
gallery+3.909
 imagination+3.868
 Sabha+3.841
Neg logit contributions
gencies-6.568
racuse-6.361
 Wra-6.021
emort-5.848
agnetic-5.729
Token governments
Token ID6905
Feature activation+1.15e-01

Pos logit contributions
��+3.599
agram+3.070
gallery+3.030
 Sabha+3.027
Pinterest+3.015
Neg logit contributions
gencies-4.815
racuse-4.597
 Wra-4.368
 Observer-4.199
emort-4.164
Token,
Token ID11
Feature activation-3.07e-01

Pos logit contributions
gencies+0.582
racuse+0.542
 Wra+0.517
 Observer+0.494
strom+0.487
Neg logit contributions
��-0.660
agram-0.573
gallery-0.572
 Sabha-0.569
 corrid-0.566
Token Good
Token ID4599
Feature activation-4.65e-01

Pos logit contributions
��+1.874
agram+1.649
gallery+1.644
 Sabha+1.638
Pinterest+1.626
Neg logit contributions
gencies-1.439
racuse-1.338
 Wra-1.262
 Observer-1.201
strom-1.183
Tokenwill
Token ID10594
Feature activation+3.83e-02

Pos logit contributions
��+3.255
gallery+2.930
agram+2.926
 Sabha+2.910
Pinterest+2.898
Neg logit contributions
gencies-1.754
racuse-1.602
 Wra-1.485
 Observer-1.392
emort-1.359
Token people
Token ID661
Feature activation-1.35e-01

Pos logit contributions
gencies+1.529
racuse+1.426
 Wra+1.378
 Observer+1.316
strom+1.294
Neg logit contributions
��-1.547
gallery-1.311
agram-1.310
 Sabha-1.301
 corrid-1.291
Token dislike
Token ID23109
Feature activation-5.87e-03

Pos logit contributions
��+0.714
agram+0.617
gallery+0.613
 Sabha+0.611
Pinterest+0.606
Neg logit contributions
gencies-0.748
racuse-0.703
 Wra-0.670
 Observer-0.643
strom-0.634
Token the
Token ID262
Feature activation+3.22e-01

Pos logit contributions
��+0.031
agram+0.027
gallery+0.027
 Sabha+0.027
 corrid+0.027
Neg logit contributions
gencies-0.032
racuse-0.030
 Wra-0.029
 Observer-0.028
strom-0.027
Token taste
Token ID6938
Feature activation+1.07e-01

Pos logit contributions
gencies+1.833
racuse+1.714
 Wra+1.690
 Observer+1.609
strom+1.581
Neg logit contributions
��-1.630
gallery-1.376
agram-1.365
 corrid-1.357
 Sabha-1.342
Token so
Token ID523
Feature activation-7.16e-02

Pos logit contributions
gencies+0.523
racuse+0.487
 Wra+0.462
 Observer+0.441
strom+0.434
Neg logit contributions
��-0.634
agram-0.554
gallery-0.552
 Sabha-0.550
 corrid-0.547
Token much
Token ID881
Feature activation-1.01e+00

Pos logit contributions
��+0.365
agram+0.310
gallery+0.310
 Sabha+0.308
 corrid+0.305
Neg logit contributions
gencies-0.410
racuse-0.386
 Wra-0.371
 Observer-0.355
strom-0.352
Token,
Token ID11
Feature activation-5.23e-01

Pos logit contributions
��+4.687
agram+4.051
 imagination+4.048
 Sabha+4.024
Pinterest+4.024
Neg logit contributions
gencies-5.868
racuse-5.538
 Wra-5.155
 Observer-5.083
emort-5.042
Token they
Token ID484
Feature activation-7.11e-01

Pos logit contributions
��+2.689
agram+2.321
gallery+2.306
 Sabha+2.299
Pinterest+2.298
Neg logit contributions
gencies-2.929
racuse-2.754
 Wra-2.596
 Observer-2.518
emort-2.490
Token are
Token ID389
Feature activation-3.82e-01

Pos logit contributions
��+3.612
agram+3.299
 imagination+3.230
gallery+3.221
Pinterest+3.209
Neg logit contributions
gencies-3.818
racuse-3.644
 Wra-3.363
emort-3.280
 Observer-3.255
Token willing
Token ID4684
Feature activation+3.71e-01

Pos logit contributions
��+2.115
agram+1.838
gallery+1.830
 Sabha+1.823
Pinterest+1.811
Neg logit contributions
gencies-2.012
racuse-1.887
 Wra-1.791
 Observer-1.716
strom-1.690
Token to
Token ID284
Feature activation+1.81e-01

Pos logit contributions
gencies+1.888
racuse+1.758
 Wra+1.663
 Observer+1.599
strom+1.569
Neg logit contributions
��-2.137
agram-1.860
gallery-1.854
 corrid-1.848
 Sabha-1.840
Token Press
Token ID4332
Feature activation+6.96e-01

Pos logit contributions
gencies+2.266
 Wra+2.143
 Observer+2.032
racuse+2.028
strom+1.899
Neg logit contributions
��-4.973
 corrid-4.463
agram-4.178
gallery-4.142
 Sabha-4.079
Token is
Token ID318
Feature activation+8.44e-01

Pos logit contributions
gencies+3.008
racuse+2.764
 Wra+2.749
 Observer+2.578
strom+2.509
Neg logit contributions
��-4.488
agram-3.910
gallery-3.881
 Sabha-3.869
 corrid-3.860
Token reporting
Token ID6447
Feature activation+4.14e-02

Pos logit contributions
gencies+4.372
 Wra+4.148
racuse+3.968
 Observer+3.900
strom+3.728
Neg logit contributions
��-4.772
agram-3.937
 Sabha-3.872
 corrid-3.834
gallery-3.790
Token that
Token ID326
Feature activation+1.32e-01

Pos logit contributions
gencies+0.219
racuse+0.205
 Wra+0.197
 Observer+0.189
strom+0.185
Neg logit contributions
��-0.229
agram-0.197
gallery-0.197
 Sabha-0.195
 corrid-0.194
Token AT
Token ID5161
Feature activation+1.74e-01

Pos logit contributions
gencies+0.690
racuse+0.647
 Wra+0.626
 Observer+0.602
strom+0.586
Neg logit contributions
��-0.734
gallery-0.628
agram-0.626
 corrid-0.619
 Sabha-0.618
Token&
Token ID5
Feature activation-1.20e+00

Pos logit contributions
gencies+0.565
racuse+0.505
 Wra+0.476
 Observer+0.442
strom+0.432
Neg logit contributions
��-1.339
agram-1.182
gallery-1.181
 Sabha-1.175
 corrid-1.173
TokenT
Token ID51
Feature activation-1.43e-01

Pos logit contributions
��+5.183
Pinterest+4.711
gallery+4.691
agram+4.670
 Sabha+4.541
Neg logit contributions
gencies-7.371
racuse-7.005
emort-6.468
 Wra-6.456
olicy-6.238
Token,
Token ID11
Feature activation-1.20e-01

Pos logit contributions
��+0.805
agram+0.701
gallery+0.698
 Sabha+0.695
Pinterest+0.690
Neg logit contributions
gencies-0.741
racuse-0.693
 Wra-0.659
 Observer-0.630
strom-0.621
Token the
Token ID262
Feature activation+9.28e-02

Pos logit contributions
��+0.695
agram+0.599
 Sabha+0.597
gallery+0.596
 corrid+0.594
Neg logit contributions
gencies-0.606
racuse-0.564
 Wra-0.545
 Observer-0.522
strom-0.509
Token defendant
Token ID11304
Feature activation-3.43e-01

Pos logit contributions
gencies+0.496
racuse+0.462
 Wra+0.455
 Observer+0.442
strom+0.422
Neg logit contributions
��-0.516
 Sabha-0.433
 corrid-0.430
gallery-0.430
agram-0.428
Token in
Token ID287
Feature activation-1.09e-01

Pos logit contributions
��+1.825
agram+1.565
gallery+1.564
 Sabha+1.557
Pinterest+1.544
Neg logit contributions
gencies-1.880
racuse-1.765
 Wra-1.690
 Observer-1.621
strom-1.599
Token.
Token ID13
Feature activation-2.46e-02

Pos logit contributions
��+0.795
agram+0.694
gallery+0.693
 Sabha+0.690
 corrid+0.688
Neg logit contributions
gencies-0.616
racuse-0.570
 Wra-0.544
 Observer-0.518
strom-0.509
Token But
Token ID887
Feature activation+2.29e-01

Pos logit contributions
��+0.144
 corrid+0.125
agram+0.124
gallery+0.124
 Sabha+0.123
Neg logit contributions
gencies-0.122
racuse-0.114
 Wra-0.110
 Observer-0.105
strom-0.103
Token the
Token ID262
Feature activation+3.56e-02

Pos logit contributions
gencies+1.193
racuse+1.100
 Wra+1.076
 Observer+1.048
strom+1.003
Neg logit contributions
��-1.307
agram-1.104
gallery-1.104
 corrid-1.102
Pinterest-1.084
Token challenge
Token ID4427
Feature activation+4.43e-01

Pos logit contributions
gencies+0.189
racuse+0.175
 Wra+0.170
 Observer+0.166
strom+0.159
Neg logit contributions
��-0.198
agram-0.169
gallery-0.169
 corrid-0.167
 Sabha-0.167
Token at
Token ID379
Feature activation-8.98e-02

Pos logit contributions
gencies+2.121
 Wra+1.937
racuse+1.933
 Observer+1.897
strom+1.811
Neg logit contributions
��-2.662
 corrid-2.281
gallery-2.257
 Sabha-2.221
agram-2.221
Token present
Token ID1944
Feature activation-1.10e+00

Pos logit contributions
��+0.515
gallery+0.445
agram+0.444
 corrid+0.442
 Sabha+0.441
Neg logit contributions
gencies-0.460
racuse-0.424
 Wra-0.410
 Observer-0.395
strom-0.382
Token is
Token ID318
Feature activation-5.78e-01

Pos logit contributions
��+5.119
agram+4.556
 Sabha+4.468
gallery+4.449
 imagination+4.427
Neg logit contributions
gencies-6.407
racuse-6.156
 Wra-5.740
emort-5.587
 Observer-5.523
Token that
Token ID326
Feature activation-4.18e-01

Pos logit contributions
��+3.049
agram+2.642
gallery+2.624
 Sabha+2.623
Pinterest+2.601
Neg logit contributions
gencies-3.166
racuse-2.993
 Wra-2.830
 Observer-2.710
emort-2.685
Token we
Token ID356
Feature activation-5.13e-01

Pos logit contributions
��+2.228
agram+1.932
 Sabha+1.921
gallery+1.921
Pinterest+1.902
Neg logit contributions
gencies-2.279
racuse-2.145
 Wra-2.034
 Observer-1.942
strom-1.926
Token don
Token ID836
Feature activation-4.20e-01

Pos logit contributions
��+2.814
agram+2.460
gallery+2.443
 Sabha+2.432
Pinterest+2.414
Neg logit contributions
gencies-2.706
racuse-2.545
 Wra-2.413
 Observer-2.308
strom-2.275
Token't
Token ID470
Feature activation-7.94e-01

Pos logit contributions
��+2.443
agram+2.121
gallery+2.114
 Sabha+2.108
 corrid+2.107
Neg logit contributions
gencies-2.102
racuse-1.960
 Wra-1.866
 Observer-1.782
strom-1.756