TOP ACTIVATIONS
MAX = 0.992

: 44 Se eker wrote : Cl arity Gaming
water bottles . <|endoftext|> Name : MSI R 78 70 Hawk
( args [" images "] ): # load the image ,
190 Birth Date : October 16 , 1981
: 44 Se eker wrote : Cl arity Gaming
: Man aged Access Point : Not - Associated T x
Game ] Platform : IBM PC Comp atible
Blog Joined April 2012 United States 5 667 Posts # 12
-- images ", required = True , help =" path to
50 pur ak ushi wrote : Happy it worked
en ), uncle ( aba qa ), [ 4 ] firm
along he walks , the greater the destruction and injuries .
Studios Release Date : May 11 , 2000
on Friday . WHERE : The Formula One race
arrest warrant to tackle cross - border crime . Instead ,
37 Dark ho arse wrote : Show nested quote
fighting ." <|endoftext|> Anonymous asked : AU idea : slightly cl
read ( image Path ) image = im utils . res
# of Interview s : Position :
four other house hold members . But

MIN ACTIVATIONS
MIN = -0.934

advertisement Story continues below advertisement The government
professional help . Continue Reading Below Advertisement 9 Work
Carly Fiorina 's political career Fiorina , right , poses for
Free variations in analysis Did you ever wonder what
. Continue Reading Below Advertisement <|endoftext|> There are many
At omic The heart of the library
vec () -> y () == 0 . 0 f );
AL ERT PH AS ES Phase 1 : No infections in
We Can Do About It by Marc Goodman , available February
. Story continues below advertisement It 's
. Story continues below advertisement " Interestingly
in the institute , Wh ist ling Woods and it should
{ // Time component const float TA U = 6 .
Free variations in analysis Did you ever wonder
. Story continues below advertisement Story continues
Travel With The Nation Be the first to hear about
grid cell volume . const float one Over Cell Volume =
b ] Personal best : [/ b ] 1 : 07
paid in 2010 is quite likely . And then
s being tested Local housing councils are testing any

INTERVAL 0.510 - 0.992
CONTAINS 0.144%

hand out <|endoftext|> Or at least 26 of 27 : THE
x 001 f 1 f 4 b be 66 power off
count ) count FROM cl ojure . libraries WHERE library CON
' boot . build ' -- Save the results to cl
= New - Object System . Draw ing . Font ($

INTERVAL 0.029 - 0.510
CONTAINS 42.434%

without daily hygiene . What few possessions Hilton had
the Affordable Care Act and sell the public on their plan
NHL s best defense men and elite power play
"… Alright ." Ste lla and Ik
] and yet we let the shuttle lapse , instead of

INTERVAL -0.452 - 0.029
CONTAINS 57.028%

nearly $ 4 billion .) At that point , though ,
property <|endoftext|> B ishops Opp ose Catholic Universities Off ering Extended
s many mistakes , ignoring the fact that , to
then Bernard Sanders is going to be around for a while
absence of v ort ons . V ort ons , meanwhile

INTERVAL -0.934 - -0.452
CONTAINS 0.394%

been upgraded regularly over the years , had also been heavily
We re the only school that offers this in
is crooked cannot be made straight ; and that
the Broncos in 2016 as they missed the postseason .
With love and light George …… .
Token:
Token ID25
Feature activation+5.58e-01

Pos logit contributions
present+3.257
 Bris+2.844
 Practices+2.685
eous+2.631
 Torn+2.629
Neg logit contributions
��-7.196
 vet-5.221
ibaba-5.000
 manipulative-4.764
yden-4.749
Token44
Token ID2598
Feature activation+2.74e-01

Pos logit contributions
present+3.008
 Bris+2.752
 Practices+2.622
 Torn+2.590
eous+2.544
Neg logit contributions
��-4.698
 vet-3.381
ibaba-3.099
 manipulative-3.004
yden-2.963
Token Se
Token ID1001
Feature activation+1.97e-01

Pos logit contributions
present+1.149
 Bris+1.055
 Practices+0.981
 Torn+0.963
eous+0.938
Neg logit contributions
��-2.602
 vet-1.967
ibaba-1.817
 manipulative-1.777
yden-1.766
Tokeneker
Token ID28233
Feature activation+2.97e-01

Pos logit contributions
present+1.265
 Bris+1.175
 Practices+1.112
eous+1.108
 Torn+1.108
Neg logit contributions
��-1.464
 vet-0.984
ibaba-0.872
 manipulative-0.861
sterdam-0.819
Token wrote
Token ID2630
Feature activation+6.69e-01

Pos logit contributions
present+1.109
 Bris+1.001
 Practices+0.913
 Torn+0.894
eous+0.878
Neg logit contributions
��-2.977
 vet-2.268
ibaba-2.118
 manipulative-2.071
yden-2.050
Token:
Token ID25
Feature activation+9.92e-01

Pos logit contributions
present+2.990
 Bris+2.656
 Practices+2.494
 Torn+2.460
eous+2.445
Neg logit contributions
��-6.361
 vet-4.639
ibaba-4.397
 manipulative-4.220
yden-4.179
Token
Token ID198
Feature activation+6.24e-01

Pos logit contributions
present+3.697
 Bris+3.114
 Practices+2.883
eous+2.839
 Torn+2.824
Neg logit contributions
��-10.200
 vet-7.608
ibaba-7.239
 manipulative-6.991
sterdam-6.932
Token
Token ID198
Feature activation+6.25e-01

Pos logit contributions
present+2.510
 Bris+2.213
 Practices+2.059
 Torn+2.019
eous+1.991
Neg logit contributions
��-6.199
 vet-4.632
ibaba-4.357
 manipulative-4.226
yden-4.194
TokenCl
Token ID2601
Feature activation-1.32e-01

Pos logit contributions
present+2.467
 Bris+2.243
 Practices+2.077
 Torn+2.034
eous+1.982
Neg logit contributions
��-6.101
 vet-4.653
ibaba-4.311
 manipulative-4.220
yden-4.192
Tokenarity
Token ID6806
Feature activation-1.16e-01

Pos logit contributions
��+0.926
 vet+0.622
ibaba+0.553
yden+0.528
 manipulative+0.528
Neg logit contributions
present-0.880
 Bris-0.838
 Practices-0.803
 Torn-0.794
eous-0.775
Token Gaming
Token ID14426
Feature activation+3.29e-01

Pos logit contributions
��+0.826
 vet+0.571
ibaba+0.493
 manipulative+0.484
yden+0.476
Neg logit contributions
present-0.750
 Bris-0.712
 Torn-0.675
 Practices-0.673
eous-0.660
Token water
Token ID1660
Feature activation-2.04e-02

Pos logit contributions
��+0.273
 vet+0.194
ibaba+0.170
 manipulative+0.169
yden+0.165
Neg logit contributions
present-0.211
 Bris-0.197
 Practices-0.187
eous-0.183
 Torn-0.183
Token bottles
Token ID14096
Feature activation-1.29e-01

Pos logit contributions
��+0.148
 vet+0.105
ibaba+0.090
 manipulative+0.090
 col+0.088
Neg logit contributions
present-0.130
 Bris-0.122
 Practices-0.116
 Torn-0.114
eous-0.114
Token.
Token ID13
Feature activation+2.10e-02

Pos logit contributions
��+1.081
 vet+0.800
ibaba+0.712
 manipulative+0.709
yden+0.699
Neg logit contributions
present-0.669
 Bris-0.625
 Practices-0.591
 Torn-0.578
eous-0.568
Token<|endoftext|>
Token ID50256
Feature activation-5.49e-02

Pos logit contributions
present+0.105
 Bris+0.098
 Practices+0.092
 Torn+0.090
eous+0.090
Neg logit contributions
��-0.179
 vet-0.133
ibaba-0.119
 manipulative-0.117
yden-0.116
TokenName
Token ID5376
Feature activation+4.47e-01

Pos logit contributions
��+0.456
 vet+0.329
ibaba+0.299
 manipulative+0.292
yden+0.292
Neg logit contributions
present-0.287
 Bris-0.276
 Practices-0.261
 Torn-0.256
eous-0.251
Token:
Token ID25
Feature activation+9.10e-01

Pos logit contributions
present+2.228
 Bris+2.070
 Practices+1.950
 Torn+1.917
eous+1.886
Neg logit contributions
��-3.896
 vet-2.869
ibaba-2.620
 manipulative-2.554
yden-2.535
Token MSI
Token ID30781
Feature activation+2.98e-01

Pos logit contributions
present+5.226
 Bris+4.946
 Torn+4.668
 Practices+4.660
 Bind+4.556
Neg logit contributions
��-7.123
 vet-4.971
ibaba-4.520
 manipulative-4.374
yden-4.354
Token R
Token ID371
Feature activation+2.47e-02

Pos logit contributions
present+1.559
 Bris+1.460
 Practices+1.377
 Torn+1.350
eous+1.339
Neg logit contributions
��-2.528
 vet-1.837
ibaba-1.663
 manipulative-1.630
yden-1.608
Token78
Token ID3695
Feature activation+2.92e-01

Pos logit contributions
present+0.127
 Bris+0.120
 Practices+0.113
 Torn+0.111
eous+0.109
Neg logit contributions
��-0.212
 vet-0.153
ibaba-0.139
yden-0.136
 manipulative-0.136
Token70
Token ID2154
Feature activation+2.94e-01

Pos logit contributions
present+1.607
 Bris+1.519
 Practices+1.428
eous+1.405
 Torn+1.397
Neg logit contributions
��-2.408
 vet-1.729
ibaba-1.543
 manipulative-1.508
 col-1.491
Token Hawk
Token ID26698
Feature activation+1.45e-01

Pos logit contributions
present+1.528
 Bris+1.429
 Practices+1.351
 Torn+1.329
eous+1.308
Neg logit contributions
��-2.496
 vet-1.815
ibaba-1.647
 manipulative-1.609
yden-1.592
Token(
Token ID7
Feature activation-9.67e-03

Pos logit contributions
present+0.948
 Bris+0.893
 Practices+0.848
 Torn+0.834
eous+0.817
Neg logit contributions
��-1.335
 vet-0.956
ibaba-0.859
 manipulative-0.837
yden-0.830
Tokenargs
Token ID22046
Feature activation+2.09e-01

Pos logit contributions
��+0.078
 vet+0.056
ibaba+0.050
 manipulative+0.049
yden+0.049
Neg logit contributions
present-0.053
 Bris-0.051
 Practices-0.049
 Torn-0.048
 Bind-0.047
Token["
Token ID14692
Feature activation-3.69e-02

Pos logit contributions
present+0.862
 Bris+0.789
 Practices+0.734
 Torn+0.718
eous+0.699
Neg logit contributions
��-2.002
 vet-1.521
ibaba-1.402
 manipulative-1.372
yden-1.366
Tokenimages
Token ID17566
Feature activation+7.03e-02

Pos logit contributions
��+0.300
 vet+0.215
ibaba+0.195
yden+0.189
 manipulative+0.188
Neg logit contributions
present-0.201
 Bris-0.193
 Practices-0.184
 Torn-0.180
eous-0.176
Token"]
Token ID8973
Feature activation+6.67e-03

Pos logit contributions
present+0.353
 Bris+0.333
 Practices+0.315
 Torn+0.306
eous+0.301
Neg logit contributions
��-0.607
 vet-0.447
ibaba-0.407
 manipulative-0.396
yden-0.395
Token):
Token ID2599
Feature activation+8.91e-01

Pos logit contributions
present+0.028
 Bris+0.026
 Practices+0.024
 Torn+0.023
eous+0.022
Neg logit contributions
��-0.063
 vet-0.048
ibaba-0.044
 manipulative-0.043
yden-0.043
Token #
Token ID1303
Feature activation+4.42e-01

Pos logit contributions
present+4.954
 Bris+4.569
 Practices+4.365
 Torn+4.354
 Bind+4.254
Neg logit contributions
��-7.092
 vet-4.888
ibaba-4.602
yden-4.424
 manipulative-4.402
Token load
Token ID3440
Feature activation+4.19e-01

Pos logit contributions
present+2.749
 Bris+2.565
 Practices+2.460
 Torn+2.438
 Bind+2.393
Neg logit contributions
��-3.309
 vet-2.262
ibaba-2.046
 manipulative-1.980
yden-1.966
Token the
Token ID262
Feature activation+3.37e-01

Pos logit contributions
present+2.338
 Bris+2.193
 Practices+2.082
 Torn+2.050
eous+2.016
Neg logit contributions
��-3.403
 vet-2.435
ibaba-2.202
 manipulative-2.143
yden-2.125
Token image
Token ID2939
Feature activation+3.98e-01

Pos logit contributions
present+1.832
 Bris+1.716
 Practices+1.631
 Torn+1.596
eous+1.578
Neg logit contributions
��-2.779
 vet-2.015
ibaba-1.812
 manipulative-1.772
yden-1.753
Token,
Token ID11
Feature activation+2.27e-01

Pos logit contributions
present+2.273
 Bris+2.135
 Practices+2.029
 Torn+2.001
eous+1.966
Neg logit contributions
��-3.175
 vet-2.255
ibaba-2.035
 manipulative-1.979
yden-1.962
Token 190
Token ID19884
Feature activation+1.25e-01

Pos logit contributions
present+1.264
 Bris+1.182
 Practices+1.121
 Torn+1.100
eous+1.084
Neg logit contributions
��-1.927
 vet-1.396
ibaba-1.262
 manipulative-1.232
yden-1.217
Token
Token ID198
Feature activation+1.41e-01

Pos logit contributions
present+0.609
 Bris+0.566
 Practices+0.531
 Torn+0.523
eous+0.514
Neg logit contributions
��-1.107
 vet-0.823
ibaba-0.751
 manipulative-0.733
yden-0.724
Token
Token ID198
Feature activation+2.00e-01

Pos logit contributions
present+0.624
 Bris+0.582
 Practices+0.543
 Torn+0.532
eous+0.519
Neg logit contributions
��-1.294
 vet-0.979
ibaba-0.895
 manipulative-0.878
yden-0.871
TokenBirth
Token ID38480
Feature activation+3.22e-03

Pos logit contributions
present+0.991
 Bris+0.951
 Practices+0.893
 Torn+0.876
eous+0.858
Neg logit contributions
��-1.716
 vet-1.262
ibaba-1.150
 manipulative-1.123
yden-1.108
Token Date
Token ID7536
Feature activation+3.89e-01

Pos logit contributions
present+0.015
 Bris+0.014
 Practices+0.013
eous+0.013
 Torn+0.013
Neg logit contributions
��-0.029
 vet-0.022
ibaba-0.020
 manipulative-0.019
yden-0.019
Token:
Token ID25
Feature activation+8.77e-01

Pos logit contributions
present+1.862
 Bris+1.720
 Practices+1.617
 Torn+1.592
eous+1.556
Neg logit contributions
��-3.499
 vet-2.579
ibaba-2.377
 manipulative-2.317
yden-2.299
Token October
Token ID3267
Feature activation+4.48e-01

Pos logit contributions
present+4.959
 Bris+4.668
 Practices+4.410
 Torn+4.406
 Bind+4.263
Neg logit contributions
��-7.104
 vet-4.986
ibaba-4.623
 manipulative-4.409
yden-4.387
Token 16
Token ID1467
Feature activation+3.88e-01

Pos logit contributions
present+2.627
 Bris+2.464
 Practices+2.349
 Torn+2.320
eous+2.271
Neg logit contributions
��-3.545
 vet-2.477
ibaba-2.261
 manipulative-2.171
yden-2.160
Token,
Token ID11
Feature activation+3.85e-01

Pos logit contributions
present+1.923
 Bris+1.796
 Practices+1.688
 Torn+1.660
eous+1.628
Neg logit contributions
��-3.390
 vet-2.503
ibaba-2.279
 manipulative-2.227
yden-2.210
Token 1981
Token ID14745
Feature activation+1.77e-01

Pos logit contributions
present+1.530
 Bris+1.401
 Practices+1.297
 Torn+1.266
eous+1.244
Neg logit contributions
��-3.727
 vet-2.856
ibaba-2.622
 manipulative-2.587
yden-2.558
Token
Token ID198
Feature activation+4.90e-01

Pos logit contributions
present+0.811
 Bris+0.751
 Practices+0.699
 Torn+0.686
eous+0.678
Neg logit contributions
��-1.609
 vet-1.210
ibaba-1.105
 manipulative-1.081
yden-1.073
Token:
Token ID25
Feature activation+3.48e-01

Pos logit contributions
present+2.210
 Bris+1.983
 Practices+1.867
 Torn+1.833
eous+1.820
Neg logit contributions
��-4.511
 vet-3.297
ibaba-3.099
 manipulative-2.985
yden-2.963
Token44
Token ID2598
Feature activation-3.75e-02

Pos logit contributions
present+1.775
 Bris+1.635
 Practices+1.549
 Torn+1.526
eous+1.495
Neg logit contributions
��-3.011
 vet-2.198
ibaba-2.019
 manipulative-1.958
yden-1.941
Token Se
Token ID1001
Feature activation+6.79e-02

Pos logit contributions
��+0.324
 vet+0.240
ibaba+0.218
 manipulative+0.213
yden+0.212
Neg logit contributions
present-0.189
 Bris-0.174
 Practices-0.165
 Torn-0.161
eous-0.159
Tokeneker
Token ID28233
Feature activation+1.40e-01

Pos logit contributions
present+0.392
 Bris+0.371
 Practices+0.355
 Torn+0.348
eous+0.340
Neg logit contributions
��-0.532
 vet-0.378
ibaba-0.344
yden-0.331
 manipulative-0.330
Token wrote
Token ID2630
Feature activation+5.03e-01

Pos logit contributions
present+0.671
 Bris+0.624
 Practices+0.588
 Torn+0.576
eous+0.564
Neg logit contributions
��-1.242
 vet-0.922
ibaba-0.838
 manipulative-0.820
yden-0.817
Token:
Token ID25
Feature activation+8.70e-01

Pos logit contributions
present+2.334
 Bris+2.127
 Practices+2.002
 Torn+1.976
eous+1.942
Neg logit contributions
��-4.628
 vet-3.394
ibaba-3.167
 manipulative-3.064
yden-3.033
Token
Token ID198
Feature activation+6.52e-01

Pos logit contributions
present+3.247
 Bris+2.756
 Practices+2.557
 Torn+2.520
eous+2.498
Neg logit contributions
��-8.928
 vet-6.625
ibaba-6.290
 manipulative-6.081
sterdam-6.051
Token
Token ID198
Feature activation+6.57e-01

Pos logit contributions
present+2.577
 Bris+2.240
 Practices+2.093
 Torn+2.057
eous+2.027
Neg logit contributions
��-6.544
 vet-4.868
ibaba-4.610
 manipulative-4.450
sterdam-4.432
TokenCl
Token ID2601
Feature activation+5.70e-02

Pos logit contributions
present+2.728
 Bris+2.439
 Practices+2.261
 Torn+2.233
eous+2.207
Neg logit contributions
��-6.300
 vet-4.769
ibaba-4.425
 manipulative-4.323
yden-4.274
Tokenarity
Token ID6806
Feature activation-4.40e-02

Pos logit contributions
present+0.349
 Bris+0.332
 Practices+0.317
 Torn+0.311
 Bind+0.304
Neg logit contributions
��-0.429
 vet-0.299
ibaba-0.271
yden-0.259
 manipulative-0.259
Token Gaming
Token ID14426
Feature activation+8.05e-02

Pos logit contributions
��+0.372
 vet+0.274
ibaba+0.246
 manipulative+0.242
yden+0.239
Neg logit contributions
present-0.229
 Bris-0.213
 Practices-0.201
 Torn-0.198
eous-0.194
Token:
Token ID25
Feature activation+6.18e-01

Pos logit contributions
present+1.549
 Bris+1.409
 Practices+1.333
 Torn+1.302
eous+1.285
Neg logit contributions
��-3.015
 vet-2.218
ibaba-2.059
 manipulative-1.994
yden-1.990
TokenMan
Token ID5124
Feature activation+4.00e-01

Pos logit contributions
present+2.957
 Bris+2.687
 Practices+2.538
 Torn+2.506
eous+2.447
Neg logit contributions
��-5.509
 vet-4.091
ibaba-3.769
 manipulative-3.654
yden-3.652
Tokenaged
Token ID1886
Feature activation+3.47e-01

Pos logit contributions
present+2.000
 Bris+1.786
 Practices+1.700
eous+1.689
 Torn+1.674
Neg logit contributions
��-3.535
 vet-2.584
ibaba-2.369
 manipulative-2.307
yden-2.283
Token Access
Token ID8798
Feature activation+1.20e-01

Pos logit contributions
present+2.008
 Bris+1.887
 Practices+1.795
 Torn+1.771
eous+1.739
Neg logit contributions
��-2.752
 vet-1.947
ibaba-1.757
 manipulative-1.706
yden-1.692
Token Point
Token ID6252
Feature activation+5.76e-01

Pos logit contributions
present+0.686
 Bris+0.647
 Practices+0.611
 Torn+0.604
eous+0.595
Neg logit contributions
��-0.956
 vet-0.680
ibaba-0.613
 manipulative-0.596
yden-0.591
Token:
Token ID25
Feature activation+8.39e-01

Pos logit contributions
present+2.634
 Bris+2.380
 Practices+2.246
 Torn+2.210
eous+2.178
Neg logit contributions
��-5.358
 vet-3.943
ibaba-3.686
 manipulative-3.556
yden-3.552
Token Not
Token ID1892
Feature activation+4.95e-01

Pos logit contributions
present+3.914
 Bris+3.565
 Practices+3.365
 Torn+3.340
 Bind+3.252
Neg logit contributions
��-7.552
 vet-5.621
ibaba-5.209
yden-5.040
 manipulative-5.029
Token-
Token ID12
Feature activation+2.34e-01

Pos logit contributions
present+2.446
 Bris+2.237
 Practices+2.120
 Torn+2.100
eous+2.049
Neg logit contributions
��-4.366
 vet-3.211
ibaba-2.956
 manipulative-2.858
yden-2.840
TokenAssociated
Token ID29014
Feature activation+6.31e-01

Pos logit contributions
present+0.830
 Bris+0.749
 Practices+0.687
 Torn+0.670
eous+0.649
Neg logit contributions
��-2.382
 vet-1.839
ibaba-1.710
 manipulative-1.676
yden-1.666
Token T
Token ID309
Feature activation+3.03e-01

Pos logit contributions
present+2.880
 Bris+2.656
 Practices+2.499
 Torn+2.480
 Bind+2.391
Neg logit contributions
��-5.744
 vet-4.277
ibaba-3.952
 manipulative-3.847
yden-3.831
Tokenx
Token ID87
Feature activation+3.16e-01

Pos logit contributions
present+1.666
 Bris+1.526
 Practices+1.441
 Torn+1.427
eous+1.413
Neg logit contributions
��-2.521
 vet-1.805
ibaba-1.644
 manipulative-1.601
yden-1.568
Token Game
Token ID3776
Feature activation-1.86e-01

Pos logit contributions
��+2.955
 vet+2.048
ibaba+1.820
 manipulative+1.756
 col+1.744
Neg logit contributions
present-2.350
 Bris-2.212
 Practices-2.106
 Torn-2.082
eous-2.075
Token]
Token ID60
Feature activation-2.90e-01

Pos logit contributions
��+1.484
 vet+1.045
ibaba+0.937
 manipulative+0.913
yden+0.912
Neg logit contributions
present-1.068
 Bris-0.991
 Practices-0.940
eous-0.926
 Torn-0.926
Token
Token ID198
Feature activation+2.06e-01

Pos logit contributions
��+2.474
 vet+1.831
ibaba+1.659
 manipulative+1.618
 col+1.608
Neg logit contributions
present-1.434
 Bris-1.345
 Practices-1.256
eous-1.233
 Torn-1.221
Token
Token ID198
Feature activation+1.14e-01

Pos logit contributions
present+0.884
 Bris+0.804
 Practices+0.752
 Torn+0.738
eous+0.719
Neg logit contributions
��-1.950
 vet-1.469
ibaba-1.358
 manipulative-1.327
yden-1.318
TokenPlatform
Token ID37148
Feature activation+3.70e-01

Pos logit contributions
present+0.619
 Bris+0.586
 Practices+0.554
 Torn+0.545
eous+0.534
Neg logit contributions
��-0.944
 vet-0.678
ibaba-0.615
 manipulative-0.599
yden-0.594
Token:
Token ID25
Feature activation+8.17e-01

Pos logit contributions
present+1.798
 Bris+1.670
 Practices+1.573
 Torn+1.548
eous+1.514
Neg logit contributions
��-3.266
 vet-2.407
ibaba-2.208
 manipulative-2.153
yden-2.143
Token IBM
Token ID19764
Feature activation+2.08e-01

Pos logit contributions
present+4.562
 Bris+4.267
 Practices+4.080
 Torn+4.039
 Bind+3.943
Neg logit contributions
��-6.580
 vet-4.692
ibaba-4.288
yden-4.158
 manipulative-4.114
Token PC
Token ID4217
Feature activation+1.74e-01

Pos logit contributions
present+1.191
 Bris+1.122
 Practices+1.066
 Torn+1.048
eous+1.036
Neg logit contributions
��-1.662
 vet-1.175
ibaba-1.050
 manipulative-1.037
 col-1.010
Token Comp
Token ID3082
Feature activation+1.39e-02

Pos logit contributions
present+0.858
 Bris+0.800
 Practices+0.752
 Torn+0.741
eous+0.728
Neg logit contributions
��-1.522
 vet-1.118
ibaba-1.021
 manipulative-0.995
yden-0.983
Tokenatible
Token ID16873
Feature activation-2.52e-02

Pos logit contributions
present+0.103
 Bris+0.098
 Practices+0.094
 Torn+0.093
eous+0.092
Neg logit contributions
��-0.088
 vet-0.054
ibaba-0.047
yden-0.046
 manipulative-0.046
Token
Token ID198
Feature activation+2.41e-02

Pos logit contributions
��+0.219
 vet+0.162
ibaba+0.147
 manipulative+0.144
 col+0.142
Neg logit contributions
present-0.124
 Bris-0.115
 Practices-0.108
 Torn-0.107
eous-0.105
Token Blog
Token ID14001
Feature activation+1.31e-01

Pos logit contributions
present+1.491
 Bris+1.343
 Practices+1.233
 Torn+1.202
eous+1.164
Neg logit contributions
��-4.278
 vet-3.304
ibaba-3.069
 manipulative-3.010
yden-2.992
Token Joined
Token ID16709
Feature activation+1.43e-01

Pos logit contributions
present+0.488
 Bris+0.443
 Practices+0.408
 Torn+0.397
eous+0.387
Neg logit contributions
��-1.308
 vet-1.008
ibaba-0.932
 manipulative-0.914
yden-0.910
Token April
Token ID3035
Feature activation+5.95e-01

Pos logit contributions
present+0.877
 Bris+0.830
 Practices+0.793
 Torn+0.778
eous+0.773
Neg logit contributions
��-1.069
 vet-0.751
ibaba-0.654
 manipulative-0.649
yden-0.638
Token 2012
Token ID2321
Feature activation+4.30e-01

Pos logit contributions
present+2.412
 Bris+2.208
 Practices+2.046
 Torn+2.016
eous+1.940
Neg logit contributions
��-5.767
 vet-4.364
ibaba-4.075
 manipulative-3.953
yden-3.944
Token United
Token ID1578
Feature activation+4.44e-01

Pos logit contributions
present+2.574
 Bris+2.424
 Practices+2.310
 Torn+2.279
eous+2.242
Neg logit contributions
��-3.322
 vet-2.327
ibaba-2.088
 manipulative-2.028
yden-2.010
Token States
Token ID1829
Feature activation+8.06e-01

Pos logit contributions
present+2.400
 Bris+2.253
 Practices+2.132
 Torn+2.102
eous+2.063
Neg logit contributions
��-3.674
 vet-2.640
ibaba-2.404
 manipulative-2.339
yden-2.317
Token 5
Token ID642
Feature activation+2.70e-01

Pos logit contributions
present+4.513
 Bris+4.215
 Practices+3.993
 Torn+3.965
 Bind+3.850
Neg logit contributions
��-6.598
 vet-4.613
ibaba-4.313
 manipulative-4.080
yden-4.070
Token667
Token ID28933
Feature activation+4.47e-01

Pos logit contributions
present+1.441
 Bris+1.346
 Practices+1.275
 Torn+1.256
eous+1.232
Neg logit contributions
��-2.254
 vet-1.629
ibaba-1.481
 manipulative-1.442
yden-1.431
Token Posts
Token ID12043
Feature activation-2.12e-02

Pos logit contributions
present+1.599
 Bris+1.434
 Practices+1.328
 Torn+1.291
eous+1.248
Neg logit contributions
��-4.554
 vet-3.501
ibaba-3.277
 manipulative-3.196
yden-3.179
Token #
Token ID1303
Feature activation+1.20e-01

Pos logit contributions
��+0.170
 vet+0.122
ibaba+0.110
 manipulative+0.107
yden+0.105
Neg logit contributions
present-0.120
 Bris-0.113
 Practices-0.107
 Torn-0.105
eous-0.103
Token12
Token ID1065
Feature activation+1.18e-01

Pos logit contributions
present+0.615
 Bris+0.566
 Practices+0.536
 Torn+0.528
eous+0.517
Neg logit contributions
��-1.035
 vet-0.751
ibaba-0.689
 manipulative-0.669
yden-0.663
Token--
Token ID438
Feature activation+4.44e-01

Pos logit contributions
present+0.192
 Bris+0.180
 Practices+0.168
 Torn+0.166
eous+0.161
Neg logit contributions
��-0.372
 vet-0.277
ibaba-0.254
 manipulative-0.248
yden-0.247
Tokenimages
Token ID17566
Feature activation-1.10e-01

Pos logit contributions
present+2.115
 Bris+1.896
 Practices+1.800
 Torn+1.754
eous+1.739
Neg logit contributions
��-4.009
 vet-2.971
ibaba-2.707
 manipulative-2.663
yden-2.629
Token",
Token ID1600
Feature activation-2.48e-01

Pos logit contributions
��+0.904
 vet+0.660
ibaba+0.602
 manipulative+0.583
yden+0.580
Neg logit contributions
present-0.579
 Bris-0.551
 Practices-0.519
 Torn-0.512
eous-0.498
Token required
Token ID2672
Feature activation+3.58e-01

Pos logit contributions
��+1.874
 vet+1.335
ibaba+1.153
 manipulative+1.148
 col+1.136
Neg logit contributions
present-1.492
 Bris-1.426
 Practices-1.378
eous-1.346
 Torn-1.322
Token=
Token ID28
Feature activation+3.43e-01

Pos logit contributions
present+1.805
 Bris+1.645
 Practices+1.563
 Torn+1.529
eous+1.512
Neg logit contributions
��-3.160
 vet-2.293
ibaba-2.117
 manipulative-2.054
yden-2.032
TokenTrue
Token ID17821
Feature activation+8.04e-01

Pos logit contributions
present+1.629
 Bris+1.491
 Practices+1.406
 Torn+1.378
eous+1.354
Neg logit contributions
��-3.089
 vet-2.288
ibaba-2.113
 manipulative-2.054
yden-2.035
Token,
Token ID11
Feature activation+8.82e-02

Pos logit contributions
present+3.464
 Bris+3.009
 Practices+2.826
eous+2.781
 Torn+2.756
Neg logit contributions
��-7.722
 vet-5.646
ibaba-5.415
sterdam-5.169
 manipulative-5.168
Token help
Token ID1037
Feature activation+3.85e-01

Pos logit contributions
present+0.541
 Bris+0.505
 Practices+0.483
 Torn+0.477
eous+0.470
Neg logit contributions
��-0.675
 vet-0.459
ibaba-0.420
yden-0.402
 manipulative-0.401
Token="
Token ID2625
Feature activation+4.42e-01

Pos logit contributions
present+1.370
 Bris+1.200
 Practices+1.109
 Torn+1.074
eous+1.055
Neg logit contributions
��-3.953
 vet-3.043
ibaba-2.842
 manipulative-2.774
yden-2.753
Tokenpath
Token ID6978
Feature activation+4.90e-01

Pos logit contributions
present+2.489
 Bris+2.259
 Practices+2.167
 Torn+2.126
eous+2.110
Neg logit contributions
��-3.621
 vet-2.576
ibaba-2.336
 manipulative-2.276
yden-2.245
Token to
Token ID284
Feature activation+5.60e-01

Pos logit contributions
present+2.556
 Bris+2.348
 Practices+2.231
 Torn+2.205
eous+2.179
Neg logit contributions
��-4.212
 vet-2.986
ibaba-2.749
yden-2.668
 manipulative-2.664
Token50
Token ID1120
Feature activation+4.19e-02

Pos logit contributions
present+3.389
 Bris+3.039
 Practices+2.888
 Torn+2.867
eous+2.795
Neg logit contributions
��-5.897
 vet-4.261
ibaba-3.978
 manipulative-3.812
yden-3.777
Token pur
Token ID1308
Feature activation+1.02e-01

Pos logit contributions
present+0.235
 Bris+0.219
 Practices+0.209
 Torn+0.205
eous+0.203
Neg logit contributions
��-0.338
 vet-0.244
ibaba-0.219
 manipulative-0.214
yden-0.212
Tokenak
Token ID461
Feature activation+1.07e-01

Pos logit contributions
present+0.601
 Bris+0.557
 Practices+0.529
 Torn+0.526
eous+0.521
Neg logit contributions
��-0.808
 vet-0.565
ibaba-0.502
 manipulative-0.495
yden-0.485
Tokenushi
Token ID17731
Feature activation+1.56e-02

Pos logit contributions
present+0.738
 Bris+0.694
 Practices+0.670
 Torn+0.657
eous+0.655
Neg logit contributions
��-0.739
 vet-0.485
ibaba-0.424
 manipulative-0.412
yden-0.403
Token wrote
Token ID2630
Feature activation+3.96e-01

Pos logit contributions
present+0.065
 Bris+0.060
 Practices+0.056
 Torn+0.055
eous+0.054
Neg logit contributions
��-0.147
 vet-0.112
ibaba-0.102
 manipulative-0.100
yden-0.100
Token:
Token ID25
Feature activation+7.99e-01

Pos logit contributions
present+1.877
 Bris+1.726
 Practices+1.626
 Torn+1.605
eous+1.567
Neg logit contributions
��-3.580
 vet-2.632
ibaba-2.436
 manipulative-2.365
yden-2.344
Token
Token ID198
Feature activation+6.82e-01

Pos logit contributions
present+3.064
 Bris+2.662
 Practices+2.466
 Torn+2.427
eous+2.375
Neg logit contributions
��-8.066
 vet-6.008
ibaba-5.692
 manipulative-5.508
yden-5.467
Token
Token ID198
Feature activation+6.93e-01

Pos logit contributions
present+2.685
 Bris+2.333
 Practices+2.182
 Torn+2.141
eous+2.118
Neg logit contributions
��-6.849
 vet-5.103
ibaba-4.846
 manipulative-4.661
sterdam-4.644
TokenHappy
Token ID25082
Feature activation+1.64e-01

Pos logit contributions
present+3.850
 Bris+3.542
 Practices+3.363
 Torn+3.322
eous+3.306
Neg logit contributions
��-5.671
 vet-4.057
ibaba-3.712
 manipulative-3.585
yden-3.552
Token it
Token ID340
Feature activation-1.56e-01

Pos logit contributions
present+0.873
 Bris+0.816
 Practices+0.772
 Torn+0.761
eous+0.746
Neg logit contributions
��-1.368
 vet-0.988
ibaba-0.899
 manipulative-0.875
yden-0.868
Token worked
Token ID3111
Feature activation-1.36e-01

Pos logit contributions
��+1.138
 vet+0.779
ibaba+0.690
 manipulative+0.671
yden+0.663
Neg logit contributions
present-0.998
 Bris-0.944
 Practices-0.903
 Torn-0.892
eous-0.878
Tokenen
Token ID268
Feature activation+5.91e-01

Pos logit contributions
present+0.340
 Bris+0.322
 Practices+0.306
 Torn+0.301
eous+0.294
Neg logit contributions
��-0.514
 vet-0.373
ibaba-0.340
yden-0.327
 manipulative-0.326
Token),
Token ID828
Feature activation+7.70e-02

Pos logit contributions
present+3.228
 Bris+3.003
 Practices+2.851
 Torn+2.810
eous+2.766
Neg logit contributions
��-4.858
 vet-3.483
ibaba-3.149
 manipulative-3.084
yden-3.048
Token uncle
Token ID7711
Feature activation+2.31e-01

Pos logit contributions
present+0.471
 Bris+0.446
 Practices+0.426
 Torn+0.421
eous+0.413
Neg logit contributions
��-0.583
 vet-0.408
ibaba-0.362
 manipulative-0.354
yden-0.347
Token (
Token ID357
Feature activation+2.29e-01

Pos logit contributions
present+1.511
 Bris+1.436
 Practices+1.374
 Torn+1.358
eous+1.333
Neg logit contributions
��-1.645
 vet-1.113
ibaba-0.984
 manipulative-0.953
yden-0.946
Tokenaba
Token ID15498
Feature activation+3.39e-01

Pos logit contributions
present+1.209
 Bris+1.137
 Practices+1.078
 Torn+1.060
eous+1.035
Neg logit contributions
��-1.921
 vet-1.394
ibaba-1.270
 manipulative-1.232
yden-1.228
Tokenqa
Token ID20402
Feature activation+7.93e-01

Pos logit contributions
present+1.816
 Bris+1.695
 Practices+1.606
 Torn+1.583
eous+1.553
Neg logit contributions
��-2.825
 vet-2.039
ibaba-1.852
 manipulative-1.805
yden-1.790
Token),
Token ID828
Feature activation+2.57e-01

Pos logit contributions
present+3.995
 Bris+3.656
 Practices+3.450
 Torn+3.411
eous+3.334
Neg logit contributions
��-6.769
 vet-4.921
ibaba-4.475
 manipulative-4.412
yden-4.360
Token[
Token ID58
Feature activation+2.98e-01

Pos logit contributions
present+1.604
 Bris+1.515
 Practices+1.448
 Torn+1.429
eous+1.407
Neg logit contributions
��-1.912
 vet-1.319
ibaba-1.176
 manipulative-1.141
yden-1.128
Token4
Token ID19
Feature activation+5.42e-04

Pos logit contributions
present+1.114
 Bris+1.021
 Practices+0.940
 Torn+0.919
eous+0.888
Neg logit contributions
��-2.963
 vet-2.278
ibaba-2.109
 manipulative-2.068
yden-2.057
Token]
Token ID60
Feature activation-1.39e-01

Pos logit contributions
present+0.004
 Bris+0.004
 Practices+0.003
 Torn+0.003
eous+0.003
Neg logit contributions
��-0.004
 vet-0.002
ibaba-0.002
 manipulative-0.002
yden-0.002
Token firm
Token ID4081
Feature activation+4.22e-02

Pos logit contributions
��+1.035
 vet+0.730
ibaba+0.643
 manipulative+0.630
yden+0.614
Neg logit contributions
present-0.853
 Bris-0.811
 Practices-0.778
 Torn-0.765
eous-0.750
Token along
Token ID1863
Feature activation-7.30e-02

Pos logit contributions
present+1.024
 Bris+0.959
 Practices+0.920
 Torn+0.895
eous+0.891
Neg logit contributions
��-1.425
 vet-1.029
ibaba-0.917
 manipulative-0.897
yden-0.892
Token he
Token ID339
Feature activation+1.18e-01

Pos logit contributions
��+0.591
 vet+0.437
ibaba+0.390
 manipulative+0.382
yden+0.381
Neg logit contributions
present-0.393
 Bris-0.368
 Practices-0.355
eous-0.340
 Torn-0.337
Token walks
Token ID11114
Feature activation+2.34e-01

Pos logit contributions
present+0.549
 Bris+0.508
 Practices+0.475
 Torn+0.468
eous+0.459
Neg logit contributions
��-1.055
 vet-0.798
ibaba-0.720
 manipulative-0.713
yden-0.710
Token,
Token ID11
Feature activation+2.14e-01

Pos logit contributions
present+1.228
 Bris+1.148
 Practices+1.096
 Torn+1.062
eous+1.054
Neg logit contributions
��-1.956
 vet-1.443
ibaba-1.294
 manipulative-1.271
yden-1.264
Token the
Token ID262
Feature activation+7.07e-01

Pos logit contributions
present+1.212
 Bris+1.130
 Practices+1.093
eous+1.064
 Torn+1.057
Neg logit contributions
��-1.706
 vet-1.235
ibaba-1.097
 manipulative-1.090
yden-1.073
Token greater
Token ID3744
Feature activation+7.89e-01

Pos logit contributions
present+3.614
 Bris+3.372
 Practices+3.171
 Torn+3.135
eous+3.053
Neg logit contributions
��-6.070
 vet-4.403
ibaba-4.055
 manipulative-3.919
yden-3.904
Token the
Token ID262
Feature activation+6.56e-03

Pos logit contributions
present+4.336
 Bris+4.080
 Practices+3.822
 Torn+3.817
eous+3.706
Neg logit contributions
��-6.455
 vet-4.573
ibaba-4.168
 manipulative-4.037
yden-4.010
Token destruction
Token ID8166
Feature activation+3.45e-01

Pos logit contributions
present+0.042
 Bris+0.039
 Practices+0.038
eous+0.037
 Torn+0.037
Neg logit contributions
��-0.047
 vet-0.034
 manipulative-0.029
ibaba-0.029
yden-0.028
Token and
Token ID290
Feature activation-2.20e-01

Pos logit contributions
present+1.758
 Bris+1.635
 Practices+1.556
 Torn+1.512
eous+1.497
Neg logit contributions
��-2.946
 vet-2.178
ibaba-1.973
 manipulative-1.937
yden-1.920
Token injuries
Token ID6821
Feature activation+9.59e-02

Pos logit contributions
��+1.604
 vet+1.164
ibaba+1.008
 manipulative+0.998
 col+0.971
Neg logit contributions
present-1.348
 Bris-1.264
 Practices-1.241
eous-1.199
 Torn-1.187
Token.
Token ID13
Feature activation+1.08e-01

Pos logit contributions
present+0.501
 Bris+0.468
 Practices+0.452
eous+0.435
 Torn+0.429
Neg logit contributions
��-0.795
 vet-0.593
ibaba-0.531
 manipulative-0.520
yden-0.518
Token Studios
Token ID13799
Feature activation+1.14e-01

Pos logit contributions
present+0.023
 Bris+0.022
 Practices+0.021
 Torn+0.020
eous+0.020
Neg logit contributions
��-0.035
 vet-0.026
ibaba-0.022
 manipulative-0.022
 col-0.022
Token
Token ID198
Feature activation+3.45e-01

Pos logit contributions
present+0.539
 Bris+0.499
 Practices+0.466
 Torn+0.459
eous+0.452
Neg logit contributions
��-1.023
 vet-0.761
ibaba-0.692
 manipulative-0.678
yden-0.671
Token
Token ID198
Feature activation+2.85e-01

Pos logit contributions
present+1.443
 Bris+1.296
 Practices+1.213
 Torn+1.189
eous+1.163
Neg logit contributions
��-3.343
 vet-2.504
ibaba-2.337
 manipulative-2.272
yden-2.260
TokenRelease
Token ID26362
Feature activation+3.91e-01

Pos logit contributions
present+1.630
 Bris+1.550
 Practices+1.471
 Torn+1.447
eous+1.419
Neg logit contributions
��-2.256
 vet-1.597
ibaba-1.438
 manipulative-1.401
yden-1.386
Token Date
Token ID7536
Feature activation+5.60e-01

Pos logit contributions
present+1.627
 Bris+1.492
 Practices+1.391
 Torn+1.363
eous+1.326
Neg logit contributions
��-3.727
 vet-2.819
ibaba-2.608
 manipulative-2.549
yden-2.537
Token:
Token ID25
Feature activation+7.84e-01

Pos logit contributions
present+2.616
 Bris+2.423
 Practices+2.273
 Torn+2.248
eous+2.200
Neg logit contributions
��-5.050
 vet-3.734
ibaba-3.457
 manipulative-3.363
yden-3.351
Token May
Token ID1737
Feature activation+4.43e-01

Pos logit contributions
present+4.449
 Bris+4.149
 Practices+3.975
 Torn+3.937
 Bind+3.839
Neg logit contributions
��-6.290
 vet-4.425
ibaba-4.121
yden-3.909
 manipulative-3.905
Token 11
Token ID1367
Feature activation+2.06e-01

Pos logit contributions
present+2.530
 Bris+2.369
 Practices+2.257
 Torn+2.225
eous+2.178
Neg logit contributions
��-3.550
 vet-2.509
ibaba-2.289
 manipulative-2.205
yden-2.195
Token,
Token ID11
Feature activation+2.49e-01

Pos logit contributions
present+1.024
 Bris+0.953
 Practices+0.898
 Torn+0.884
eous+0.865
Neg logit contributions
��-1.794
 vet-1.318
ibaba-1.204
 manipulative-1.175
yden-1.166
Token 2000
Token ID4751
Feature activation+3.61e-01

Pos logit contributions
present+1.185
 Bris+1.099
 Practices+1.032
 Torn+1.015
eous+0.994
Neg logit contributions
��-2.229
 vet-1.654
ibaba-1.514
 manipulative-1.480
yden-1.469
Token
Token ID198
Feature activation+4.98e-01

Pos logit contributions
present+1.625
 Bris+1.498
 Practices+1.404
 Torn+1.378
eous+1.345
Neg logit contributions
��-3.321
 vet-2.484
ibaba-2.287
 manipulative-2.235
yden-2.221
Token on
Token ID319
Feature activation-1.55e-01

Pos logit contributions
��+2.503
 vet+1.850
ibaba+1.671
 manipulative+1.614
sterdam+1.607
Neg logit contributions
present-1.532
 Bris-1.423
 Practices-1.313
 Torn-1.310
eous-1.309
Token Friday
Token ID3217
Feature activation-9.16e-02

Pos logit contributions
��+1.281
 vet+0.941
ibaba+0.856
 manipulative+0.822
yden+0.816
Neg logit contributions
present-0.829
 Bris-0.768
 Practices-0.720
 Torn-0.715
eous-0.710
Token.
Token ID13
Feature activation+5.77e-02

Pos logit contributions
��+0.773
 vet+0.566
ibaba+0.513
 manipulative+0.497
yden+0.495
Neg logit contributions
present-0.477
 Bris-0.443
 Practices-0.415
 Torn-0.410
eous-0.407
Token
Token ID198
Feature activation+2.19e-01

Pos logit contributions
present+0.284
 Bris+0.264
 Practices+0.245
eous+0.245
 Torn+0.243
Neg logit contributions
��-0.495
 vet-0.369
ibaba-0.337
yden-0.325
 manipulative-0.325
Token
Token ID198
Feature activation+2.47e-01

Pos logit contributions
present+0.944
 Bris+0.866
 Practices+0.809
 Torn+0.793
eous+0.774
Neg logit contributions
��-2.057
 vet-1.549
ibaba-1.429
 manipulative-1.397
yden-1.388
TokenWHERE
Token ID47357
Feature activation+7.73e-01

Pos logit contributions
present+1.204
 Bris+1.146
 Practices+1.077
 Torn+1.058
eous+1.037
Neg logit contributions
��-2.140
 vet-1.577
ibaba-1.444
 manipulative-1.405
yden-1.396
Token:
Token ID25
Feature activation+1.62e-01

Pos logit contributions
present+3.391
 Bris+3.030
 Practices+2.832
 Torn+2.768
 Hind+2.673
Neg logit contributions
��-7.356
 vet-5.386
ibaba-5.034
 manipulative-4.913
yden-4.867
Token The
Token ID383
Feature activation+7.83e-02

Pos logit contributions
present+0.889
 Bris+0.831
 Practices+0.789
 Torn+0.776
eous+0.766
Neg logit contributions
��-1.326
 vet-0.956
ibaba-0.866
 manipulative-0.841
yden-0.835
Token Formula
Token ID19639
Feature activation-1.65e-01

Pos logit contributions
present+0.453
 Bris+0.425
 Practices+0.405
 Torn+0.399
eous+0.392
Neg logit contributions
��-0.621
 vet-0.439
ibaba-0.396
 manipulative-0.385
yden-0.382
Token One
Token ID1881
Feature activation+8.66e-02

Pos logit contributions
��+1.379
 vet+1.012
ibaba+0.918
 manipulative+0.887
yden+0.883
Neg logit contributions
present-0.860
 Bris-0.797
 Practices-0.746
 Torn-0.739
eous-0.733
Token race
Token ID3234
Feature activation+5.64e-02

Pos logit contributions
present+0.485
 Bris+0.455
 Practices+0.431
 Torn+0.425
eous+0.420
Neg logit contributions
��-0.701
 vet-0.503
ibaba-0.454
 manipulative-0.441
yden-0.435
Token arrest
Token ID3251
Feature activation+1.03e-01

Pos logit contributions
present+0.553
 Bris+0.523
 Practices+0.499
 Torn+0.492
eous+0.487
Neg logit contributions
��-0.616
 vet-0.425
ibaba-0.371
 manipulative-0.365
yden-0.355
Token warrant
Token ID8084
Feature activation+3.48e-01

Pos logit contributions
present+0.674
 Bris+0.640
 Practices+0.609
 Torn+0.604
eous+0.594
Neg logit contributions
��-0.731
 vet-0.496
ibaba-0.438
 manipulative-0.424
yden-0.418
Token to
Token ID284
Feature activation+2.48e-01

Pos logit contributions
present+1.811
 Bris+1.689
 Practices+1.601
 Torn+1.572
eous+1.544
Neg logit contributions
��-2.951
 vet-2.139
ibaba-1.956
 manipulative-1.902
yden-1.890
Token tackle
Token ID9761
Feature activation+2.79e-01

Pos logit contributions
present+1.689
 Bris+1.603
 Practices+1.532
 Torn+1.528
 Bind+1.497
Neg logit contributions
��-1.715
 vet-1.103
ibaba-0.998
 manipulative-0.959
yden-0.956
Token cross
Token ID3272
Feature activation+2.10e-01

Pos logit contributions
present+1.897
 Bris+1.800
 Practices+1.724
 Torn+1.701
eous+1.683
Neg logit contributions
��-1.928
 vet-1.290
ibaba-1.131
 manipulative-1.098
yden-1.078
Token-
Token ID12
Feature activation+7.72e-01

Pos logit contributions
present+1.210
 Bris+1.137
 Practices+1.081
 Torn+1.067
eous+1.048
Neg logit contributions
��-1.664
 vet-1.179
ibaba-1.063
 manipulative-1.033
yden-1.025
Tokenborder
Token ID20192
Feature activation+1.92e-01

Pos logit contributions
present+5.511
 Bris+5.110
 Practices+4.945
 Torn+4.932
eous+4.831
Neg logit contributions
��-5.088
 vet-3.287
ibaba-2.892
 manipulative-2.725
yden-2.723
Token crime
Token ID4065
Feature activation+8.13e-02

Pos logit contributions
present+1.384
 Bris+1.317
 Practices+1.262
 Torn+1.244
eous+1.237
Neg logit contributions
��-1.238
 vet-0.806
ibaba-0.693
 manipulative-0.678
yden-0.658
Token.
Token ID13
Feature activation-2.49e-02

Pos logit contributions
present+0.416
 Bris+0.389
 Practices+0.364
 Torn+0.357
eous+0.353
Neg logit contributions
��-0.693
 vet-0.512
ibaba-0.464
 manipulative-0.456
yden-0.449
Token Instead
Token ID5455
Feature activation-1.34e-01

Pos logit contributions
��+0.211
 vet+0.157
ibaba+0.140
 manipulative+0.138
yden+0.137
Neg logit contributions
present-0.124
 Bris-0.116
 Practices-0.108
eous-0.107
 Torn-0.106
Token,
Token ID11
Feature activation-1.11e-01

Pos logit contributions
��+1.146
 vet+0.843
ibaba+0.763
 manipulative+0.751
yden+0.740
Neg logit contributions
present-0.681
 Bris-0.634
 Practices-0.605
 Torn-0.585
eous-0.580
Token37
Token ID2718
Feature activation+7.92e-02

Pos logit contributions
present+2.248
 Bris+2.051
 Practices+1.945
 Torn+1.923
eous+1.881
Neg logit contributions
��-3.775
 vet-2.731
ibaba-2.525
 manipulative-2.434
yden-2.414
Token Dark
Token ID3801
Feature activation+2.33e-01

Pos logit contributions
present+0.469
 Bris+0.441
 Practices+0.420
 Torn+0.414
eous+0.407
Neg logit contributions
��-0.616
 vet-0.434
ibaba-0.390
 manipulative-0.379
yden-0.375
Tokenho
Token ID8873
Feature activation-8.01e-02

Pos logit contributions
present+1.110
 Bris+1.008
 Practices+0.958
 Torn+0.935
eous+0.929
Neg logit contributions
��-2.093
 vet-1.536
ibaba-1.405
 manipulative-1.387
yden-1.369
Tokenarse
Token ID17208
Feature activation+5.01e-03

Pos logit contributions
��+0.665
 vet+0.483
ibaba+0.436
 manipulative+0.426
yden+0.424
Neg logit contributions
present-0.430
 Bris-0.404
 Practices-0.384
 Torn-0.372
eous-0.366
Token wrote
Token ID2630
Feature activation+5.63e-01

Pos logit contributions
present+0.020
 Bris+0.019
 Practices+0.018
 Torn+0.017
eous+0.017
Neg logit contributions
��-0.048
 vet-0.036
ibaba-0.033
 manipulative-0.033
yden-0.033
Token:
Token ID25
Feature activation+7.66e-01

Pos logit contributions
present+2.554
 Bris+2.292
 Practices+2.161
 Torn+2.136
eous+2.110
Neg logit contributions
��-5.299
 vet-3.871
ibaba-3.648
 manipulative-3.511
yden-3.474
Token
Token ID198
Feature activation+6.19e-01

Pos logit contributions
present+2.921
 Bris+2.491
 Practices+2.317
 Torn+2.280
eous+2.269
Neg logit contributions
��-7.840
 vet-5.823
ibaba-5.517
 manipulative-5.337
sterdam-5.295
Token
Token ID198
Feature activation+6.32e-01

Pos logit contributions
present+2.469
 Bris+2.162
 Practices+2.021
 Torn+1.986
eous+1.952
Neg logit contributions
��-6.186
 vet-4.607
ibaba-4.356
 manipulative-4.205
yden-4.185
TokenShow
Token ID15307
Feature activation+4.04e-01

Pos logit contributions
present+2.237
 Bris+1.982
 Practices+1.813
 Torn+1.780
eous+1.740
Neg logit contributions
��-6.433
 vet-4.966
ibaba-4.631
 manipulative-4.532
yden-4.496
Token nested
Token ID28376
Feature activation+2.16e-01

Pos logit contributions
present+2.466
 Bris+2.324
 Practices+2.222
 Torn+2.198
 Bind+2.138
Neg logit contributions
��-3.084
 vet-2.121
ibaba-1.922
 manipulative-1.846
yden-1.835
Token quote
Token ID9577
Feature activation+7.80e-02

Pos logit contributions
present+1.402
 Bris+1.326
 Practices+1.269
 Torn+1.255
eous+1.234
Neg logit contributions
��-1.553
 vet-1.052
ibaba-0.934
 manipulative-0.902
yden-0.894
Token fighting
Token ID4330
Feature activation+3.77e-03

Pos logit contributions
��+0.148
 vet+0.103
ibaba+0.090
 manipulative+0.089
yden+0.087
Neg logit contributions
present-0.128
 Bris-0.121
 Practices-0.117
 Torn-0.115
eous-0.114
Token."
Token ID526
Feature activation+8.62e-02

Pos logit contributions
present+0.020
 Bris+0.018
 Practices+0.017
 Torn+0.017
eous+0.017
Neg logit contributions
��-0.032
 vet-0.023
ibaba-0.021
 manipulative-0.021
yden-0.020
Token<|endoftext|>
Token ID50256
Feature activation-1.81e-02

Pos logit contributions
present+0.400
 Bris+0.371
 Practices+0.348
 Torn+0.340
eous+0.338
Neg logit contributions
��-0.772
 vet-0.578
ibaba-0.527
 manipulative-0.517
yden-0.514
TokenAnonymous
Token ID20660
Feature activation+1.17e-01

Pos logit contributions
��+0.150
 vet+0.109
ibaba+0.099
 manipulative+0.096
yden+0.096
Neg logit contributions
present-0.095
 Bris-0.091
 Practices-0.086
 Torn-0.085
eous-0.083
Token asked
Token ID1965
Feature activation+2.38e-01

Pos logit contributions
present+0.635
 Bris+0.593
 Practices+0.558
 Torn+0.548
eous+0.547
Neg logit contributions
��-0.966
 vet-0.708
ibaba-0.634
 manipulative-0.624
yden-0.618
Token:
Token ID25
Feature activation+7.65e-01

Pos logit contributions
present+1.228
 Bris+1.139
 Practices+1.079
 Torn+1.057
eous+1.044
Neg logit contributions
��-2.036
 vet-1.496
ibaba-1.355
 manipulative-1.328
yden-1.317
Token AU
Token ID27548
Feature activation+1.83e-01

Pos logit contributions
present+3.808
 Bris+3.589
 Practices+3.366
 Torn+3.363
 Bind+3.256
Neg logit contributions
��-6.581
 vet-4.738
ibaba-4.388
yden-4.246
 manipulative-4.246
Token idea
Token ID2126
Feature activation-1.95e-01

Pos logit contributions
present+0.989
 Bris+0.924
 Practices+0.876
 Torn+0.862
eous+0.849
Neg logit contributions
��-1.515
 vet-1.098
ibaba-0.993
 manipulative-0.968
yden-0.960
Token:
Token ID25
Feature activation+1.62e-01

Pos logit contributions
��+1.608
 vet+1.179
ibaba+1.049
 manipulative+1.035
yden+1.023
Neg logit contributions
present-1.046
 Bris-0.976
 Practices-0.930
 Torn-0.904
eous-0.891
Token slightly
Token ID4622
Feature activation+3.68e-02

Pos logit contributions
present+0.887
 Bris+0.829
 Practices+0.788
 Torn+0.774
eous+0.764
Neg logit contributions
��-1.328
 vet-0.959
ibaba-0.867
 manipulative-0.844
yden-0.836
Token cl
Token ID537
Feature activation-2.81e-02

Pos logit contributions
present+0.228
 Bris+0.215
 Practices+0.206
 Torn+0.202
eous+0.200
Neg logit contributions
��-0.275
 vet-0.193
ibaba-0.169
 manipulative-0.169
yden-0.164
Tokenread
Token ID961
Feature activation-1.01e-01

Pos logit contributions
��+1.552
 vet+1.181
ibaba+1.090
 manipulative+1.067
yden+1.062
Neg logit contributions
present-0.664
 Bris-0.615
 Practices-0.573
 Torn-0.560
eous-0.542
Token(
Token ID7
Feature activation-1.70e-01

Pos logit contributions
��+0.767
 vet+0.541
ibaba+0.481
 manipulative+0.473
yden+0.468
Neg logit contributions
present-0.594
 Bris-0.570
 Practices-0.539
 Torn-0.534
eous-0.522
Tokenimage
Token ID9060
Feature activation+4.82e-01

Pos logit contributions
��+1.269
 vet+0.886
ibaba+0.797
 manipulative+0.763
yden+0.762
Neg logit contributions
present-1.034
 Bris-1.002
 Practices-0.956
 Torn-0.942
 Bind-0.919
TokenPath
Token ID15235
Feature activation+4.79e-01

Pos logit contributions
present+2.368
 Bris+2.170
 Practices+2.055
 Torn+2.018
eous+1.981
Neg logit contributions
��-4.264
 vet-3.135
ibaba-2.886
 manipulative-2.805
yden-2.779
Token)
Token ID8
Feature activation+4.25e-01

Pos logit contributions
present+2.128
 Bris+1.929
 Practices+1.813
 Torn+1.773
eous+1.744
Neg logit contributions
��-4.485
 vet-3.343
ibaba-3.092
 manipulative-3.015
yden-2.983
Token image
Token ID2939
Feature activation+7.64e-01

Pos logit contributions
present+1.859
 Bris+1.681
 Practices+1.562
 Torn+1.543
eous+1.528
Neg logit contributions
��-3.970
 vet-2.917
ibaba-2.730
yden-2.664
 manipulative-2.649
Token =
Token ID796
Feature activation+2.07e-01

Pos logit contributions
present+3.196
 Bris+2.828
 Practices+2.629
 Torn+2.602
eous+2.548
Neg logit contributions
��-7.449
 vet-5.563
ibaba-5.187
 manipulative-5.069
sterdam-5.008
Token im
Token ID545
Feature activation+1.78e-01

Pos logit contributions
present+1.353
 Bris+1.278
 Practices+1.219
 Torn+1.210
eous+1.192
Neg logit contributions
��-1.492
 vet-0.990
ibaba-0.894
yden-0.854
 manipulative-0.852
Tokenutils
Token ID26791
Feature activation+3.71e-01

Pos logit contributions
present+0.636
 Bris+0.550
 Practices+0.513
 Torn+0.497
eous+0.492
Neg logit contributions
��-1.822
 vet-1.401
ibaba-1.304
 manipulative-1.277
yden-1.267
Token.
Token ID13
Feature activation+9.50e-02

Pos logit contributions
present+1.752
 Bris+1.584
 Practices+1.498
 Torn+1.473
eous+1.448
Neg logit contributions
��-3.409
 vet-2.491
ibaba-2.324
 manipulative-2.240
yden-2.234
Tokenres
Token ID411
Feature activation-5.07e-02

Pos logit contributions
present+0.495
 Bris+0.463
 Practices+0.438
 Torn+0.431
eous+0.422
Neg logit contributions
��-0.806
 vet-0.586
ibaba-0.533
 manipulative-0.520
yden-0.516
Token
Token ID198
Feature activation-1.65e-01

Pos logit contributions
present+0.466
 Bris+0.437
 Practices+0.402
 Torn+0.401
eous+0.393
Neg logit contributions
��-0.846
 vet-0.634
ibaba-0.574
 manipulative-0.560
yden-0.554
Token
Token ID198
Feature activation-1.25e-01

Pos logit contributions
��+1.452
 vet+1.105
ibaba+0.998
 manipulative+0.984
yden+0.977
Neg logit contributions
present-0.752
 Bris-0.719
 Practices-0.668
 Torn-0.655
eous-0.639
Token#
Token ID2
Feature activation+2.86e-01

Pos logit contributions
��+1.004
 vet+0.714
ibaba+0.654
 manipulative+0.631
yden+0.625
Neg logit contributions
present-0.679
 Bris-0.663
 Practices-0.623
 Torn-0.621
eous-0.604
Token of
Token ID286
Feature activation+6.22e-01

Pos logit contributions
present+1.348
 Bris+1.241
 Practices+1.170
 Torn+1.148
eous+1.122
Neg logit contributions
��-2.584
 vet-1.910
ibaba-1.759
 manipulative-1.714
yden-1.704
Token Interview
Token ID19371
Feature activation+4.64e-01

Pos logit contributions
present+3.463
 Bris+3.242
 Practices+3.096
 Torn+3.034
eous+2.973
Neg logit contributions
��-5.065
 vet-3.610
ibaba-3.271
 manipulative-3.179
yden-3.162
Tokens
Token ID82
Feature activation+7.59e-01

Pos logit contributions
present+1.730
 Bris+1.555
 Practices+1.462
 Torn+1.403
eous+1.372
Neg logit contributions
��-4.661
 vet-3.558
ibaba-3.311
 manipulative-3.241
yden-3.217
Token:
Token ID25
Feature activation+4.57e-01

Pos logit contributions
present+3.333
 Bris+2.990
 Practices+2.840
 Torn+2.770
eous+2.715
Neg logit contributions
��-7.227
 vet-5.253
ibaba-4.927
 manipulative-4.780
yden-4.752
Token
Token ID198
Feature activation+2.43e-01

Pos logit contributions
present+2.336
 Bris+2.176
 Practices+2.057
 Torn+2.024
eous+1.978
Neg logit contributions
��-3.933
 vet-2.859
ibaba-2.619
 manipulative-2.549
yden-2.531
Token
Token ID198
Feature activation+1.91e-01

Pos logit contributions
present+1.035
 Bris+0.941
 Practices+0.880
 Torn+0.863
eous+0.842
Neg logit contributions
��-2.313
 vet-1.737
ibaba-1.609
 manipulative-1.571
yden-1.561
TokenPosition
Token ID26545
Feature activation+2.96e-01

Pos logit contributions
present+0.959
 Bris+0.915
 Practices+0.863
 Torn+0.853
eous+0.829
Neg logit contributions
��-1.631
 vet-1.196
ibaba-1.092
 manipulative-1.059
yden-1.054
Token:
Token ID25
Feature activation+4.43e-01

Pos logit contributions
present+1.412
 Bris+1.296
 Practices+1.223
 Torn+1.197
eous+1.179
Neg logit contributions
��-2.679
 vet-1.968
ibaba-1.816
 manipulative-1.770
yden-1.757
Token four
Token ID1440
Feature activation+8.97e-02

Pos logit contributions
present+1.472
 Bris+1.385
 Practices+1.319
 Torn+1.301
eous+1.281
Neg logit contributions
��-1.939
 vet-1.366
ibaba-1.227
 manipulative-1.192
yden-1.181
Token other
Token ID584
Feature activation-1.15e-01

Pos logit contributions
present+0.544
 Bris+0.514
 Practices+0.488
 Torn+0.481
eous+0.479
Neg logit contributions
��-0.681
 vet-0.479
ibaba-0.424
 manipulative-0.418
yden-0.411
Token
Token ID564
Feature activation-1.90e-02

Pos logit contributions
��+0.841
 vet+0.587
 manipulative+0.511
ibaba+0.511
yden+0.499
Neg logit contributions
present-0.720
 Bris-0.682
 Practices-0.649
 Torn-0.638
eous-0.638
Token
Token ID250
Feature activation-8.99e-02

Pos logit contributions
��+0.095
 vet+0.050
ibaba+0.039
 manipulative+0.037
yden+0.036
Neg logit contributions
present-0.167
 Bris-0.160
 Practices-0.155
 Torn-0.154
eous-0.152
Tokenhouse
Token ID4803
Feature activation+2.34e-01

Pos logit contributions
��+0.658
 vet+0.453
ibaba+0.398
 manipulative+0.391
yden+0.389
Neg logit contributions
present-0.559
 Bris-0.542
 Practices-0.515
 Torn-0.507
eous-0.499
Tokenhold
Token ID2946
Feature activation+7.59e-01

Pos logit contributions
present+1.036
 Bris+0.959
 Practices+0.897
 Torn+0.877
eous+0.863
Neg logit contributions
��-2.168
 vet-1.639
ibaba-1.495
 manipulative-1.468
yden-1.460
Token
Token ID447
Feature activation-3.92e-02

Pos logit contributions
present+3.883
 Bris+3.583
 Practices+3.381
 Torn+3.350
eous+3.259
Neg logit contributions
��-6.469
 vet-4.619
ibaba-4.354
 manipulative-4.154
yden-4.137
Token
Token ID251
Feature activation+8.35e-02

Pos logit contributions
��+0.194
 vet+0.101
ibaba+0.079
 manipulative+0.073
yden+0.072
Neg logit contributions
present-0.345
 Bris-0.333
 Practices-0.322
 Torn-0.319
eous-0.315
Token members
Token ID1866
Feature activation+1.94e-02

Pos logit contributions
present+0.449
 Bris+0.422
 Practices+0.399
 Torn+0.394
eous+0.386
Neg logit contributions
��-0.693
 vet-0.502
ibaba-0.453
 manipulative-0.443
yden-0.439
Token.
Token ID13
Feature activation+4.92e-02

Pos logit contributions
present+0.109
 Bris+0.103
 Practices+0.098
 Torn+0.097
eous+0.095
Neg logit contributions
��-0.154
 vet-0.111
ibaba-0.099
 manipulative-0.098
yden-0.096
Token But
Token ID887
Feature activation+4.45e-02

Pos logit contributions
present+0.253
 Bris+0.236
 Practices+0.221
 Torn+0.219
eous+0.217
Neg logit contributions
��-0.414
 vet-0.306
ibaba-0.274
 manipulative-0.270
yden-0.269
Token advertisement
Token ID15422
Feature activation-3.12e-01

Pos logit contributions
��+5.870
 vet+4.300
ibaba+3.878
 col+3.784
 manipulative+3.752
Neg logit contributions
present-4.794
 Bris-4.530
 Torn-4.337
 Practices-4.321
eous-4.309
Token
Token ID198
Feature activation+1.88e-01

Pos logit contributions
��+2.665
 vet+2.028
ibaba+1.812
 manipulative+1.801
yden+1.792
Neg logit contributions
present-1.451
 Bris-1.395
 Practices-1.297
 Torn-1.276
eous-1.249
Token
Token ID198
Feature activation+2.25e-01

Pos logit contributions
present+0.816
 Bris+0.752
 Practices+0.702
 Torn+0.688
eous+0.671
Neg logit contributions
��-1.761
 vet-1.327
ibaba-1.222
 manipulative-1.196
yden-1.188
TokenStory
Token ID11605
Feature activation+1.04e-01

Pos logit contributions
present+1.243
 Bris+1.177
 Practices+1.116
 Torn+1.097
eous+1.076
Neg logit contributions
��-1.831
 vet-1.309
ibaba-1.182
 manipulative-1.154
yden-1.147
Token continues
Token ID4477
Feature activation-2.32e-01

Pos logit contributions
present+0.528
 Bris+0.493
 Practices+0.463
 Torn+0.458
eous+0.449
Neg logit contributions
��-0.890
 vet-0.655
ibaba-0.593
 manipulative-0.583
yden-0.577
Token below
Token ID2174
Feature activation-9.34e-01

Pos logit contributions
��+1.813
 vet+1.305
ibaba+1.159
 manipulative+1.138
yden+1.127
Neg logit contributions
present-1.321
 Bris-1.253
 Practices-1.197
 Torn-1.179
eous-1.155
Token advertisement
Token ID15422
Feature activation-2.52e-01

Pos logit contributions
��+6.774
 vet+4.993
ibaba+4.410
 col+4.379
 manipulative+4.364
Neg logit contributions
present-5.374
 Bris-5.160
 Practices-5.016
eous-4.906
 Torn-4.902
Token
Token ID198
Feature activation+1.15e-01

Pos logit contributions
��+2.166
 vet+1.649
ibaba+1.477
 manipulative+1.466
yden+1.457
Neg logit contributions
present-1.170
 Bris-1.119
 Practices-1.045
 Torn-1.026
eous-1.005
Token
Token ID198
Feature activation+5.32e-02

Pos logit contributions
present+0.508
 Bris+0.472
 Practices+0.440
 Torn+0.432
eous+0.421
Neg logit contributions
��-1.062
 vet-0.802
ibaba-0.735
 manipulative-0.720
yden-0.715
TokenThe
Token ID464
Feature activation-5.43e-02

Pos logit contributions
present+0.286
 Bris+0.273
 Practices+0.258
 Torn+0.254
eous+0.248
Neg logit contributions
��-0.437
 vet-0.314
ibaba-0.281
 manipulative-0.277
yden-0.276
Token government
Token ID1230
Feature activation-1.35e-01

Pos logit contributions
��+0.413
 vet+0.295
ibaba+0.258
 manipulative+0.257
yden+0.251
Neg logit contributions
present-0.327
 Bris-0.306
 Practices-0.291
 Torn-0.286
eous-0.285
Token professional
Token ID4708
Feature activation-3.98e-02

Pos logit contributions
��+0.866
 vet+0.596
ibaba+0.503
 manipulative+0.500
yden+0.486
Neg logit contributions
present-0.839
 Bris-0.795
 Practices-0.760
 Torn-0.747
eous-0.744
Token help
Token ID1037
Feature activation-3.84e-03

Pos logit contributions
��+0.271
 vet+0.187
 manipulative+0.156
ibaba+0.154
yden+0.150
Neg logit contributions
present-0.272
 Bris-0.258
 Practices-0.247
eous-0.244
 Torn-0.244
Token.
Token ID13
Feature activation+6.70e-02

Pos logit contributions
��+0.032
 vet+0.023
ibaba+0.021
 manipulative+0.021
yden+0.020
Neg logit contributions
present-0.020
 Bris-0.019
 Practices-0.018
 Torn-0.018
eous-0.017
Token Continue
Token ID10054
Feature activation-4.55e-02

Pos logit contributions
present+0.340
 Bris+0.316
 Practices+0.295
 Torn+0.292
eous+0.291
Neg logit contributions
��-0.572
 vet-0.423
ibaba-0.379
 manipulative-0.376
yden-0.369
Token Reading
Token ID11725
Feature activation+3.45e-02

Pos logit contributions
��+0.347
 vet+0.246
ibaba+0.213
 manipulative+0.213
yden+0.208
Neg logit contributions
present-0.274
 Bris-0.259
 Practices-0.244
 Torn-0.243
eous-0.242
Token Below
Token ID10383
Feature activation-9.31e-01

Pos logit contributions
present+0.153
 Bris+0.140
 Practices+0.130
 Torn+0.128
eous+0.126
Neg logit contributions
��-0.317
 vet-0.240
ibaba-0.220
 manipulative-0.216
yden-0.214
Token Advertisement
Token ID39711
Feature activation-3.52e-01

Pos logit contributions
��+6.751
 vet+4.929
ibaba+4.362
 manipulative+4.226
 col+4.211
Neg logit contributions
present-5.337
 Bris-4.936
 Practices-4.763
eous-4.752
 Torn-4.659
Token
Token ID198
Feature activation-4.16e-02

Pos logit contributions
��+3.012
 vet+2.254
 manipulative+2.017
ibaba+2.009
yden+1.965
Neg logit contributions
present-1.718
 Bris-1.564
 Practices-1.469
eous-1.450
 Torn-1.425
Token
Token ID198
Feature activation+4.08e-02

Pos logit contributions
��+0.373
 vet+0.283
ibaba+0.257
 manipulative+0.253
yden+0.251
Neg logit contributions
present-0.187
 Bris-0.176
 Practices-0.164
 Torn-0.161
eous-0.157
Token9
Token ID24
Feature activation-1.93e-02

Pos logit contributions
present+0.203
 Bris+0.195
 Practices+0.185
 Torn+0.181
 Bind+0.176
Neg logit contributions
��-0.348
 vet-0.255
ibaba-0.230
 manipulative-0.227
yden-0.226
Token Work
Token ID5521
Feature activation+9.79e-02

Pos logit contributions
��+0.162
 vet+0.119
ibaba+0.106
 manipulative+0.105
yden+0.104
Neg logit contributions
present-0.100
 Bris-0.093
 Practices-0.087
eous-0.086
 Torn-0.085
Token Carly
Token ID35982
Feature activation+1.51e-02

Pos logit contributions
��+0.680
 vet+0.505
ibaba+0.442
 manipulative+0.440
 col+0.437
Neg logit contributions
present-0.429
 Bris-0.394
eous-0.371
 Practices-0.370
 Torn-0.355
Token Fiorina
Token ID49028
Feature activation-1.21e-01

Pos logit contributions
present+0.066
 Bris+0.059
 Practices+0.056
eous+0.055
 Torn+0.054
Neg logit contributions
��-0.139
 vet-0.106
ibaba-0.096
 manipulative-0.095
yden-0.095
Token's
Token ID338
Feature activation-1.73e-01

Pos logit contributions
��+1.020
 vet+0.756
ibaba+0.671
 manipulative+0.665
yden+0.652
Neg logit contributions
present-0.634
 Bris-0.586
 Practices-0.555
eous-0.547
 Torn-0.542
Token political
Token ID1964
Feature activation+2.79e-02

Pos logit contributions
��+1.264
 vet+0.897
 manipulative+0.775
ibaba+0.772
 col+0.749
Neg logit contributions
present-1.085
 Bris-1.021
 Practices-0.983
eous-0.971
 Torn-0.954
Token career
Token ID3451
Feature activation-2.31e-01

Pos logit contributions
present+0.181
 Bris+0.170
 Practices+0.163
eous+0.163
 Torn+0.161
Neg logit contributions
��-0.200
 vet-0.141
ibaba-0.121
 manipulative-0.120
 col-0.117
Token Fiorina
Token ID49028
Feature activation-9.29e-01

Pos logit contributions
��+1.943
 vet+1.442
ibaba+1.297
 manipulative+1.257
yden+1.236
Neg logit contributions
present-1.194
 Bris-1.099
eous-1.045
 Practices-1.041
 Torn-0.995
Token,
Token ID11
Feature activation-4.72e-01

Pos logit contributions
��+7.454
 vet+5.728
 manipulative+4.979
 col+4.960
yden+4.771
Neg logit contributions
present-4.596
 Bris-4.271
eous-4.105
 Practices-4.075
 Torn-4.031
Token right
Token ID826
Feature activation+3.28e-02

Pos logit contributions
��+3.319
 vet+2.288
 manipulative+1.960
 col+1.919
ibaba+1.906
Neg logit contributions
present-3.040
 Bris-2.884
 Practices-2.808
eous-2.806
 Torn-2.698
Token,
Token ID11
Feature activation-2.45e-01

Pos logit contributions
present+0.167
 Bris+0.157
 Practices+0.149
 Torn+0.145
eous+0.142
Neg logit contributions
��-0.279
 vet-0.206
ibaba-0.185
 manipulative-0.183
yden-0.181
Token poses
Token ID17313
Feature activation-1.10e-01

Pos logit contributions
��+2.098
 vet+1.570
 manipulative+1.400
ibaba+1.390
 col+1.377
Neg logit contributions
present-1.203
 Bris-1.128
 Practices-1.061
eous-1.055
 Torn-1.053
Token for
Token ID329
Feature activation-2.37e-01

Pos logit contributions
��+0.886
 vet+0.648
ibaba+0.576
 manipulative+0.569
yden+0.563
Neg logit contributions
present-0.603
 Bris-0.569
 Practices-0.543
 Torn-0.535
eous-0.524
TokenFree
Token ID11146
Feature activation-4.93e-02

Pos logit contributions
��+0.768
 vet+0.561
ibaba+0.509
 manipulative+0.501
yden+0.498
Neg logit contributions
present-0.443
 Bris-0.429
 Practices-0.405
 Torn-0.395
eous-0.383
Token variations
Token ID13991
Feature activation-1.00e-01

Pos logit contributions
��+0.382
 vet+0.275
ibaba+0.242
 manipulative+0.238
yden+0.233
Neg logit contributions
present-0.290
 Bris-0.273
 Practices-0.258
 Torn-0.255
eous-0.251
Token in
Token ID287
Feature activation-1.82e-01

Pos logit contributions
��+0.865
 vet+0.647
ibaba+0.580
 manipulative+0.574
 col+0.562
Neg logit contributions
present-0.495
 Bris-0.462
 Practices-0.435
 Torn-0.425
eous-0.418
Token analysis
Token ID3781
Feature activation-5.17e-02

Pos logit contributions
��+1.385
 vet+0.991
ibaba+0.866
 manipulative+0.861
 col+0.841
Neg logit contributions
present-1.099
 Bris-1.025
 Practices-0.977
eous-0.959
 Torn-0.957
Token
Token ID198
Feature activation-8.26e-01

Pos logit contributions
��+0.450
 vet+0.338
ibaba+0.304
 manipulative+0.302
 col+0.295
Neg logit contributions
present-0.251
 Bris-0.236
 Practices-0.219
 Torn-0.217
eous-0.214
Token
Token ID198
Feature activation-8.91e-01

Pos logit contributions
��+6.184
 vet+4.811
 col+4.237
 manipulative+4.141
yden+4.098
Neg logit contributions
 Bris-4.100
present-4.062
 Practices-3.782
 Torn-3.728
 TAMADRA-3.669
TokenDid
Token ID11633
Feature activation-1.48e-02

Pos logit contributions
��+6.903
 vet+4.906
 manipulative+4.366
yden+4.308
 col+4.295
Neg logit contributions
 Bris-4.592
present-4.474
 Practices-4.313
 Torn-4.181
 Hind-4.120
Token you
Token ID345
Feature activation-3.50e-01

Pos logit contributions
��+0.107
 vet+0.074
ibaba+0.065
 manipulative+0.064
yden+0.062
Neg logit contributions
present-0.095
 Bris-0.090
 Practices-0.086
 Torn-0.085
eous-0.083
Token ever
Token ID1683
Feature activation-1.25e-01

Pos logit contributions
��+2.206
 vet+1.497
ibaba+1.216
 manipulative+1.198
 col+1.173
Neg logit contributions
present-2.536
 Bris-2.432
 Practices-2.322
 Torn-2.288
eous-2.283
Token wonder
Token ID4240
Feature activation-3.64e-01

Pos logit contributions
��+0.877
 vet+0.613
ibaba+0.525
 manipulative+0.513
yden+0.505
Neg logit contributions
present-0.820
 Bris-0.784
 Practices-0.747
 Torn-0.736
eous-0.730
Token what
Token ID644
Feature activation-1.81e-01

Pos logit contributions
��+2.562
 vet+1.803
 manipulative+1.545
ibaba+1.518
 col+1.514
Neg logit contributions
present-2.318
 Bris-2.208
 Practices-2.106
 Torn-2.049
eous-2.037
Token.
Token ID13
Feature activation-1.73e-01

Pos logit contributions
present+0.223
 Bris+0.208
 Practices+0.196
 Torn+0.193
eous+0.189
Neg logit contributions
��-0.377
 vet-0.277
ibaba-0.251
 manipulative-0.246
yden-0.244
Token
Token ID198
Feature activation-7.70e-02

Pos logit contributions
��+1.390
 vet+1.013
 manipulative+0.894
ibaba+0.890
yden+0.875
Neg logit contributions
present-0.921
 Bris-0.862
 Practices-0.809
eous-0.805
 Torn-0.792
Token
Token ID198
Feature activation+8.77e-02

Pos logit contributions
��+0.682
 vet+0.512
ibaba+0.465
 manipulative+0.460
yden+0.456
Neg logit contributions
present-0.349
 Bris-0.334
 Practices-0.311
 Torn-0.304
eous-0.299
TokenContinue
Token ID29453
Feature activation+2.62e-02

Pos logit contributions
present+0.447
 Bris+0.426
 Practices+0.402
 Torn+0.394
eous+0.386
Neg logit contributions
��-0.744
 vet-0.542
ibaba-0.491
 manipulative-0.483
yden-0.479
Token Reading
Token ID11725
Feature activation+8.77e-02

Pos logit contributions
present+0.147
 Bris+0.139
 Practices+0.131
 Torn+0.129
eous+0.129
Neg logit contributions
��-0.211
 vet-0.151
ibaba-0.135
 manipulative-0.133
yden-0.131
Token Below
Token ID10383
Feature activation-8.75e-01

Pos logit contributions
present+0.500
 Bris+0.469
 Practices+0.446
 Torn+0.439
eous+0.434
Neg logit contributions
��-0.695
 vet-0.497
ibaba-0.448
 manipulative-0.435
yden-0.431
Token Advertisement
Token ID39711
Feature activation-6.39e-02

Pos logit contributions
��+6.320
 vet+4.517
ibaba+4.084
 Johannes+3.873
 col+3.856
Neg logit contributions
present-5.072
 Bris-4.734
 Practices-4.661
eous-4.560
 Torn-4.415
Token<|endoftext|>
Token ID50256
Feature activation-5.14e-02

Pos logit contributions
��+0.564
 vet+0.419
ibaba+0.385
 manipulative+0.375
yden+0.373
Neg logit contributions
present-0.300
 Bris-0.279
 Practices-0.260
 Torn-0.253
eous-0.252
TokenThere
Token ID1858
Feature activation-7.41e-02

Pos logit contributions
��+0.427
 vet+0.308
ibaba+0.280
 manipulative+0.273
yden+0.273
Neg logit contributions
present-0.269
 Bris-0.260
 Practices-0.246
 Torn-0.240
eous-0.235
Token are
Token ID389
Feature activation+3.52e-01

Pos logit contributions
��+0.622
 vet+0.458
ibaba+0.415
 manipulative+0.402
yden+0.398
Neg logit contributions
present-0.386
 Bris-0.361
 Practices-0.344
 Torn-0.336
eous-0.332
Token many
Token ID867
Feature activation+3.45e-01

Pos logit contributions
present+2.219
 Bris+2.094
 Practices+2.002
 Torn+1.972
eous+1.948
Neg logit contributions
��-2.614
 vet-1.808
ibaba-1.601
 manipulative-1.563
yden-1.538
Token
Token ID198
Feature activation-1.51e-02

Pos logit contributions
��+0.442
 vet+0.331
ibaba+0.302
 manipulative+0.295
yden+0.293
Neg logit contributions
present-0.238
 Bris-0.223
 Practices-0.207
 Torn-0.202
eous-0.202
Token
Token ID198
Feature activation+3.24e-02

Pos logit contributions
��+0.136
 vet+0.103
ibaba+0.094
 manipulative+0.092
yden+0.092
Neg logit contributions
present-0.068
 Bris-0.064
 Practices-0.060
 Torn-0.059
eous-0.057
TokenAt
Token ID2953
Feature activation-1.07e-01

Pos logit contributions
present+0.164
 Bris+0.157
 Practices+0.148
 Torn+0.144
eous+0.141
Neg logit contributions
��-0.276
 vet-0.202
ibaba-0.184
 manipulative-0.180
yden-0.180
Tokenomic
Token ID10179
Feature activation-3.49e-01

Pos logit contributions
��+0.859
 vet+0.619
ibaba+0.561
 manipulative+0.542
yden+0.537
Neg logit contributions
present-0.598
 Bris-0.567
 Practices-0.541
 Torn-0.530
eous-0.523
Token
Token ID198
Feature activation-7.40e-01

Pos logit contributions
��+2.756
 vet+1.978
 manipulative+1.736
ibaba+1.734
yden+1.701
Neg logit contributions
present-1.981
 Bris-1.873
 Practices-1.745
 Torn-1.727
eous-1.709
Token
Token ID198
Feature activation-8.70e-01

Pos logit contributions
��+5.648
 vet+4.386
 col+3.880
 manipulative+3.812
ibaba+3.784
Neg logit contributions
 Bris-3.662
present-3.618
 Practices-3.377
 Torn-3.315
 TAMADRA-3.257
TokenThe
Token ID464
Feature activation-2.85e-01

Pos logit contributions
��+6.791
 vet+4.901
ibaba+4.388
 manipulative+4.372
yden+4.355
Neg logit contributions
 Bris-4.502
present-4.363
 Practices-4.224
 Torn-4.035
eous-4.007
Token heart
Token ID2612
Feature activation+1.25e-01

Pos logit contributions
��+2.149
 vet+1.532
 manipulative+1.332
ibaba+1.328
yden+1.289
Neg logit contributions
present-1.720
 Bris-1.627
 Practices-1.540
eous-1.510
 Torn-1.496
Token of
Token ID286
Feature activation+1.61e-01

Pos logit contributions
present+0.605
 Bris+0.563
 Practices+0.530
 Torn+0.521
eous+0.510
Neg logit contributions
��-1.102
 vet-0.815
ibaba-0.745
 manipulative-0.729
yden-0.722
Token the
Token ID262
Feature activation+1.34e-01

Pos logit contributions
present+0.933
 Bris+0.878
 Practices+0.835
 Torn+0.822
eous+0.810
Neg logit contributions
��-1.266
 vet-0.895
ibaba-0.806
 manipulative-0.784
yden-0.776
Token library
Token ID5888
Feature activation+1.13e-01

Pos logit contributions
present+0.770
 Bris+0.723
 Practices+0.687
 Torn+0.677
eous+0.667
Neg logit contributions
��-1.069
 vet-0.760
ibaba-0.684
 manipulative-0.667
yden-0.660
Tokenvec
Token ID35138
Feature activation-5.53e-01

Pos logit contributions
��+1.264
 vet+0.940
ibaba+0.852
 manipulative+0.830
yden+0.829
Neg logit contributions
present-0.686
 Bris-0.664
 Practices-0.624
 Torn-0.610
eous-0.595
Token()
Token ID3419
Feature activation-5.97e-01

Pos logit contributions
��+4.438
 vet+3.196
ibaba+2.870
 manipulative+2.808
 col+2.803
Neg logit contributions
present-2.923
 Bris-2.887
 Practices-2.662
 Torn-2.633
eous-2.616
Token->
Token ID3784
Feature activation-4.65e-02

Pos logit contributions
��+4.645
 vet+3.369
ibaba+2.938
 col+2.920
 manipulative+2.900
Neg logit contributions
present-3.292
 Bris-3.157
 Practices-2.981
 Torn-2.943
eous-2.905
Tokeny
Token ID88
Feature activation-2.54e-01

Pos logit contributions
��+0.379
 vet+0.274
ibaba+0.247
yden+0.240
 manipulative+0.240
Neg logit contributions
present-0.251
 Bris-0.241
 Practices-0.228
 Torn-0.224
eous-0.219
Token()
Token ID3419
Feature activation-6.50e-01

Pos logit contributions
��+2.133
 vet+1.545
ibaba+1.398
 manipulative+1.368
yden+1.350
Neg logit contributions
present-1.316
 Bris-1.265
 Practices-1.176
 Torn-1.164
eous-1.141
Token ==
Token ID6624
Feature activation-8.63e-01

Pos logit contributions
��+4.580
 vet+3.230
ibaba+2.778
 col+2.776
 manipulative+2.741
Neg logit contributions
present-3.933
 Bris-3.845
 Practices-3.628
 Torn-3.603
eous-3.557
Token 0
Token ID657
Feature activation-3.40e-01

Pos logit contributions
��+5.056
 vet+3.458
 col+2.832
ibaba+2.747
 manipulative+2.720
Neg logit contributions
present-6.199
 Bris-6.003
 Practices-5.844
eous-5.745
 Torn-5.609
Token.
Token ID13
Feature activation-6.31e-03

Pos logit contributions
��+2.766
 vet+2.039
ibaba+1.838
yden+1.816
 manipulative+1.785
Neg logit contributions
present-1.773
 Bris-1.714
 Practices-1.606
 Torn-1.581
eous-1.533
Token0
Token ID15
Feature activation-6.14e-01

Pos logit contributions
��+0.050
 vet+0.035
ibaba+0.032
yden+0.031
 manipulative+0.031
Neg logit contributions
present-0.036
 Bris-0.035
 Practices-0.033
 Torn-0.032
eous-0.032
Tokenf
Token ID69
Feature activation+2.95e-02

Pos logit contributions
��+5.135
 vet+3.807
ibaba+3.428
yden+3.409
 manipulative+3.338
Neg logit contributions
present-2.993
 Bris-2.904
 Practices-2.765
 Torn-2.674
eous-2.566
Token);
Token ID1776
Feature activation-1.43e-02

Pos logit contributions
present+0.126
 Bris+0.117
 Practices+0.109
 Torn+0.106
eous+0.104
Neg logit contributions
��-0.277
 vet-0.209
ibaba-0.192
 manipulative-0.188
yden-0.187
Token AL
Token ID8355
Feature activation-2.69e-01

Pos logit contributions
��+3.684
 vet+2.760
ibaba+2.400
 manipulative+2.391
 col+2.363
Neg logit contributions
present-2.266
 Bris-2.108
 Practices-1.958
eous-1.940
 Torn-1.918
TokenERT
Token ID17395
Feature activation-2.56e-01

Pos logit contributions
��+2.060
 vet+1.439
ibaba+1.270
 manipulative+1.260
 col+1.231
Neg logit contributions
present-1.544
 Bris-1.511
 Practices-1.453
eous-1.403
 Torn-1.377
Token PH
Token ID9370
Feature activation-2.06e-01

Pos logit contributions
��+2.141
 vet+1.583
ibaba+1.404
 manipulative+1.371
 col+1.355
Neg logit contributions
present-1.336
 Bris-1.245
 Practices-1.150
eous-1.133
 Torn-1.132
TokenAS
Token ID1921
Feature activation-1.31e-01

Pos logit contributions
��+1.796
 vet+1.311
ibaba+1.180
yden+1.168
 manipulative+1.163
Neg logit contributions
present-1.028
 Bris-0.955
 Practices-0.907
 Torn-0.876
eous-0.874
TokenES
Token ID1546
Feature activation-3.37e-01

Pos logit contributions
��+1.216
 vet+0.910
ibaba+0.821
 manipulative+0.818
yden+0.805
Neg logit contributions
present-0.572
 Bris-0.535
 Practices-0.497
 Torn-0.480
eous-0.479
Token Phase
Token ID18983
Feature activation-8.53e-01

Pos logit contributions
��+2.815
 vet+2.102
ibaba+1.845
 manipulative+1.829
 col+1.797
Neg logit contributions
present-1.748
 Bris-1.625
eous-1.504
 Practices-1.502
 Torn-1.467
Token 1
Token ID352
Feature activation-4.19e-01

Pos logit contributions
��+6.730
 vet+5.033
ibaba+4.425
 manipulative+4.305
 col+4.267
Neg logit contributions
present-4.630
 Bris-4.336
 Torn-4.001
 Practices-3.986
eous-3.982
Token:
Token ID25
Feature activation-1.38e-01

Pos logit contributions
��+3.469
 vet+2.581
ibaba+2.286
 manipulative+2.235
 col+2.209
Neg logit contributions
present-2.195
 Bris-2.075
 Practices-1.908
eous-1.896
 Torn-1.876
Token No
Token ID1400
Feature activation-1.48e-01

Pos logit contributions
��+1.161
 vet+0.854
ibaba+0.772
 manipulative+0.750
yden+0.736
Neg logit contributions
present-0.719
 Bris-0.677
 Practices-0.632
eous-0.620
 Torn-0.620
Token infections
Token ID16079
Feature activation-2.03e-01

Pos logit contributions
��+1.079
 vet+0.750
ibaba+0.657
 manipulative+0.635
 col+0.625
Neg logit contributions
present-0.947
 Bris-0.900
 Practices-0.849
 Torn-0.835
eous-0.833
Token in
Token ID287
Feature activation+4.15e-04

Pos logit contributions
��+1.687
 vet+1.252
ibaba+1.114
 manipulative+1.096
 col+1.079
Neg logit contributions
present-1.054
 Bris-0.987
 Practices-0.920
eous-0.907
 Torn-0.903
Token We
Token ID775
Feature activation+8.29e-02

Pos logit contributions
present+0.847
 Bris+0.790
 Practices+0.737
 Torn+0.732
eous+0.727
Neg logit contributions
��-1.355
 vet-0.985
ibaba-0.901
 manipulative-0.874
yden-0.868
Token Can
Token ID1680
Feature activation+8.50e-02

Pos logit contributions
present+0.412
 Bris+0.379
 Practices+0.356
eous+0.352
 Torn+0.350
Neg logit contributions
��-0.719
 vet-0.532
ibaba-0.483
yden-0.470
 manipulative-0.469
Token Do
Token ID2141
Feature activation-1.93e-01

Pos logit contributions
present+0.431
 Bris+0.401
 Practices+0.375
eous+0.368
 Torn+0.368
Neg logit contributions
��-0.730
 vet-0.538
ibaba-0.488
 manipulative-0.474
yden-0.473
Token About
Token ID7994
Feature activation+2.69e-02

Pos logit contributions
��+1.609
 vet+1.179
ibaba+1.077
 manipulative+1.041
yden+1.033
Neg logit contributions
present-1.006
 Bris-0.930
 Practices-0.874
 Torn-0.854
eous-0.853
Token It
Token ID632
Feature activation-7.30e-02

Pos logit contributions
present+0.142
 Bris+0.132
 Practices+0.123
eous+0.122
 Torn+0.121
Neg logit contributions
��-0.227
 vet-0.164
ibaba-0.151
 manipulative-0.146
yden-0.145
Token by
Token ID416
Feature activation-8.48e-01

Pos logit contributions
��+0.606
 vet+0.436
ibaba+0.396
 manipulative+0.386
yden+0.381
Neg logit contributions
present-0.393
 Bris-0.369
 Practices-0.347
 Torn-0.342
eous-0.337
Token Marc
Token ID13067
Feature activation-3.41e-02

Pos logit contributions
��+6.395
 vet+4.683
 manipulative+4.060
 col+3.988
yden+3.960
Neg logit contributions
present-4.770
 Bris-4.377
 Practices-4.243
eous-4.207
 Torn-4.107
Token Goodman
Token ID33318
Feature activation-1.11e-01

Pos logit contributions
��+0.290
 vet+0.213
ibaba+0.194
 manipulative+0.188
yden+0.188
Neg logit contributions
present-0.174
 Bris-0.159
 Practices-0.152
eous-0.149
 Torn-0.146
Token,
Token ID11
Feature activation-2.10e-01

Pos logit contributions
��+0.949
 vet+0.700
ibaba+0.631
 manipulative+0.619
yden+0.615
Neg logit contributions
present-0.562
 Bris-0.522
 Practices-0.494
 Torn-0.484
eous-0.476
Token available
Token ID1695
Feature activation+2.42e-02

Pos logit contributions
��+1.656
 vet+1.188
ibaba+1.049
 manipulative+1.038
yden+1.022
Neg logit contributions
present-1.199
 Bris-1.126
 Practices-1.075
 Torn-1.061
eous-1.052
Token February
Token ID3945
Feature activation+2.73e-01

Pos logit contributions
present+0.133
 Bris+0.125
 Practices+0.118
 Torn+0.117
eous+0.114
Neg logit contributions
��-0.198
 vet-0.142
ibaba-0.129
 manipulative-0.126
yden-0.125
Token.
Token ID13
Feature activation-2.67e-02

Pos logit contributions
��+1.515
 vet+1.128
 manipulative+0.990
yden+0.983
ibaba+0.982
Neg logit contributions
present-0.988
 Bris-0.930
 Practices-0.890
 Torn-0.872
 Bind-0.846
Token
Token ID198
Feature activation+4.22e-03

Pos logit contributions
��+0.216
 vet+0.161
ibaba+0.141
 manipulative+0.141
 col+0.139
Neg logit contributions
present-0.140
 Bris-0.131
eous-0.122
 Practices-0.121
 Torn-0.120
Token
Token ID198
Feature activation-3.80e-03

Pos logit contributions
present+0.019
 Bris+0.018
 Practices+0.017
 Torn+0.016
eous+0.016
Neg logit contributions
��-0.038
 vet-0.029
ibaba-0.026
 manipulative-0.026
yden-0.026
TokenStory
Token ID11605
Feature activation-9.42e-02

Pos logit contributions
��+0.031
 vet+0.022
ibaba+0.020
 manipulative+0.019
yden+0.019
Neg logit contributions
present-0.021
 Bris-0.020
 Practices-0.019
 Torn-0.019
eous-0.018
Token continues
Token ID4477
Feature activation-1.32e-01

Pos logit contributions
��+0.793
 vet+0.584
ibaba+0.526
 manipulative+0.520
yden+0.511
Neg logit contributions
present-0.486
 Bris-0.458
 Practices-0.428
 Torn-0.427
eous-0.417
Token below
Token ID2174
Feature activation-8.42e-01

Pos logit contributions
��+1.053
 vet+0.758
ibaba+0.678
 manipulative+0.665
yden+0.657
Neg logit contributions
present-0.748
 Bris-0.706
 Practices-0.672
 Torn-0.664
eous-0.651
Token advertisement
Token ID15422
Feature activation-2.38e-01

Pos logit contributions
��+6.179
 vet+4.595
ibaba+4.016
 manipulative+4.007
 col+4.007
Neg logit contributions
present-4.811
 Bris-4.596
 Practices-4.452
 Torn-4.364
eous-4.362
Token
Token ID198
Feature activation+7.63e-02

Pos logit contributions
��+2.070
 vet+1.574
ibaba+1.417
 manipulative+1.402
yden+1.394
Neg logit contributions
present-1.089
 Bris-1.042
 Practices-0.973
 Torn-0.956
eous-0.931
Token
Token ID198
Feature activation+1.37e-01

Pos logit contributions
present+0.337
 Bris+0.314
 Practices+0.293
 Torn+0.288
eous+0.280
Neg logit contributions
��-0.699
 vet-0.528
ibaba-0.483
 manipulative-0.474
yden-0.470
TokenIt
Token ID1026
Feature activation+6.16e-03

Pos logit contributions
present+0.742
 Bris+0.708
 Practices+0.670
 Torn+0.659
eous+0.644
Neg logit contributions
��-1.121
 vet-0.805
ibaba-0.725
 manipulative-0.712
yden-0.708
Token's
Token ID338
Feature activation-1.66e-01

Pos logit contributions
present+0.031
 Bris+0.029
 Practices+0.027
 Torn+0.027
eous+0.026
Neg logit contributions
��-0.053
 vet-0.040
ibaba-0.036
 manipulative-0.035
yden-0.035
Token.
Token ID13
Feature activation-6.91e-03

Pos logit contributions
��+1.502
 vet+1.100
ibaba+0.983
 manipulative+0.957
yden+0.946
Neg logit contributions
present-0.946
 Bris-0.868
 Practices-0.813
 Torn-0.804
eous-0.792
Token
Token ID198
Feature activation-6.93e-02

Pos logit contributions
��+0.060
 vet+0.045
ibaba+0.040
yden+0.040
 manipulative+0.040
Neg logit contributions
present-0.033
 Bris-0.031
 Practices-0.028
 Torn-0.028
eous-0.028
Token
Token ID198
Feature activation-9.01e-02

Pos logit contributions
��+0.614
 vet+0.467
ibaba+0.423
 manipulative+0.416
yden+0.414
Neg logit contributions
present-0.315
 Bris-0.298
 Practices-0.277
 Torn-0.272
eous-0.265
TokenStory
Token ID11605
Feature activation-8.14e-02

Pos logit contributions
��+0.717
 vet+0.509
ibaba+0.455
yden+0.452
 manipulative+0.450
Neg logit contributions
present-0.491
 Bris-0.480
 Practices-0.455
 Torn-0.448
 Bind-0.436
Token continues
Token ID4477
Feature activation-1.57e-01

Pos logit contributions
��+0.689
 vet+0.507
ibaba+0.457
 manipulative+0.451
yden+0.444
Neg logit contributions
present-0.419
 Bris-0.394
 Practices-0.368
 Torn-0.367
eous-0.358
Token below
Token ID2174
Feature activation-8.33e-01

Pos logit contributions
��+1.245
 vet+0.896
ibaba+0.799
 manipulative+0.785
yden+0.777
Neg logit contributions
present-0.887
 Bris-0.839
 Practices-0.798
 Torn-0.789
eous-0.774
Token advertisement
Token ID15422
Feature activation-2.04e-01

Pos logit contributions
��+6.144
 vet+4.553
ibaba+4.016
 manipulative+3.971
 col+3.957
Neg logit contributions
present-4.757
 Bris-4.537
 Practices-4.391
 Torn-4.310
eous-4.300
Token
Token ID198
Feature activation+1.57e-01

Pos logit contributions
��+1.786
 vet+1.358
ibaba+1.224
 manipulative+1.210
yden+1.203
Neg logit contributions
present-0.931
 Bris-0.888
 Practices-0.829
 Torn-0.814
eous-0.794
Token
Token ID198
Feature activation+1.55e-01

Pos logit contributions
present+0.683
 Bris+0.631
 Practices+0.588
 Torn+0.577
eous+0.563
Neg logit contributions
��-1.456
 vet-1.098
ibaba-1.010
 manipulative-0.989
yden-0.982
Token"
Token ID1
Feature activation-2.41e-01

Pos logit contributions
present+0.844
 Bris+0.804
 Practices+0.762
 Torn+0.750
eous+0.734
Neg logit contributions
��-1.261
 vet-0.905
ibaba-0.814
 manipulative-0.799
yden-0.796
TokenInterestingly
Token ID33092
Feature activation-7.05e-02

Pos logit contributions
��+1.974
 vet+1.426
ibaba+1.273
 manipulative+1.270
yden+1.253
Neg logit contributions
present-1.255
 Bris-1.219
 Practices-1.137
 Torn-1.119
eous-1.107
Token in
Token ID287
Feature activation-7.04e-02

Pos logit contributions
��+0.343
 vet+0.255
ibaba+0.229
 manipulative+0.225
yden+0.221
Neg logit contributions
present-0.203
 Bris-0.190
 Practices-0.180
 Torn-0.176
 Bind-0.171
Token the
Token ID262
Feature activation-1.49e-01

Pos logit contributions
��+0.548
 vet+0.398
ibaba+0.349
 manipulative+0.346
 col+0.339
Neg logit contributions
present-0.409
 Bris-0.379
 Practices-0.363
 Torn-0.359
eous-0.354
Token institute
Token ID24224
Feature activation-1.17e-01

Pos logit contributions
��+1.210
 vet+0.890
 manipulative+0.781
ibaba+0.778
 col+0.769
Neg logit contributions
present-0.820
 Bris-0.759
 Practices-0.733
 Torn-0.715
eous-0.702
Token,
Token ID11
Feature activation-1.42e-01

Pos logit contributions
��+0.958
 vet+0.707
ibaba+0.625
 manipulative+0.618
 col+0.607
Neg logit contributions
present-0.625
 Bris-0.588
 Practices-0.557
 Torn-0.548
 Bind-0.532
Token Wh
Token ID854
Feature activation+9.23e-03

Pos logit contributions
��+1.159
 vet+0.861
ibaba+0.758
 manipulative+0.755
 col+0.737
Neg logit contributions
present-0.756
 Bris-0.711
 Practices-0.673
 Torn-0.664
eous-0.652
Tokenist
Token ID396
Feature activation-8.33e-01

Pos logit contributions
present+0.054
 Bris+0.051
 Practices+0.049
 Torn+0.048
eous+0.047
Neg logit contributions
��-0.071
 vet-0.051
ibaba-0.046
yden-0.044
 manipulative-0.044
Tokenling
Token ID1359
Feature activation-5.91e-01

Pos logit contributions
��+7.158
 vet+5.213
ibaba+4.714
yden+4.693
sterdam+4.674
Neg logit contributions
present-3.967
 Bris-3.728
 Torn-3.529
 Practices-3.332
 Bind-3.329
Token Woods
Token ID16483
Feature activation-2.47e-01

Pos logit contributions
��+4.906
 vet+3.662
ibaba+3.305
 manipulative+3.215
yden+3.213
Neg logit contributions
present-2.937
 Bris-2.736
 Practices-2.499
 Torn-2.470
eous-2.439
Token and
Token ID290
Feature activation-1.58e-01

Pos logit contributions
��+2.100
 vet+1.567
ibaba+1.378
 manipulative+1.370
 col+1.354
Neg logit contributions
present-1.255
 Bris-1.167
 Practices-1.085
 Torn-1.073
eous-1.053
Token it
Token ID340
Feature activation-6.75e-02

Pos logit contributions
��+1.316
 vet+0.965
ibaba+0.856
 manipulative+0.850
 col+0.830
Neg logit contributions
present-0.835
 Bris-0.772
 Practices-0.724
 Torn-0.719
eous-0.716
Token should
Token ID815
Feature activation-3.12e-01

Pos logit contributions
��+0.571
 vet+0.425
ibaba+0.380
 manipulative+0.374
 col+0.367
Neg logit contributions
present-0.345
 Bris-0.326
 Practices-0.308
 Torn-0.304
eous-0.296
Token {
Token ID1391
Feature activation+3.43e-01

Pos logit contributions
present+0.879
 Bris+0.823
 Practices+0.772
 Torn+0.760
eous+0.747
Neg logit contributions
��-1.509
 vet-1.112
ibaba-1.008
 manipulative-0.987
yden-0.978
Token //
Token ID3373
Feature activation-4.58e-02

Pos logit contributions
present+1.594
 Bris+1.473
 Practices+1.383
 Torn+1.359
eous+1.328
Neg logit contributions
��-3.102
 vet-2.306
ibaba-2.121
 manipulative-2.070
yden-2.057
Token Time
Token ID3862
Feature activation+6.53e-02

Pos logit contributions
��+0.367
 vet+0.264
ibaba+0.236
 manipulative+0.231
yden+0.228
Neg logit contributions
present-0.256
 Bris-0.245
 Practices-0.231
 Torn-0.227
eous-0.225
Token component
Token ID7515
Feature activation-3.53e-01

Pos logit contributions
present+0.354
 Bris+0.335
 Practices+0.314
 Torn+0.308
eous+0.306
Neg logit contributions
��-0.539
 vet-0.391
ibaba-0.353
 manipulative-0.345
yden-0.340
Token const
Token ID1500
Feature activation-3.97e-01

Pos logit contributions
��+2.864
 vet+2.128
 manipulative+1.852
ibaba+1.848
 col+1.840
Neg logit contributions
present-1.879
 Bris-1.807
 Practices-1.683
eous-1.651
 Torn-1.642
Token float
Token ID12178
Feature activation-8.32e-01

Pos logit contributions
��+3.009
 vet+2.174
 col+1.870
ibaba+1.848
 manipulative+1.841
Neg logit contributions
present-2.338
 Bris-2.238
 Practices-2.121
eous-2.081
 Torn-2.075
Token TA
Token ID21664
Feature activation-8.67e-02

Pos logit contributions
��+6.427
 vet+4.689
 col+4.102
 manipulative+3.998
ibaba+3.986
Neg logit contributions
present-4.588
 Bris-4.386
 Practices-4.196
eous-4.104
 Torn-4.006
TokenU
Token ID52
Feature activation-1.30e-01

Pos logit contributions
��+0.736
 vet+0.539
ibaba+0.488
yden+0.475
 manipulative+0.474
Neg logit contributions
present-0.447
 Bris-0.424
 Practices-0.398
 Torn-0.389
eous-0.386
Token =
Token ID796
Feature activation+1.88e-01

Pos logit contributions
��+1.104
 vet+0.811
ibaba+0.730
 manipulative+0.711
yden+0.709
Neg logit contributions
present-0.660
 Bris-0.629
 Practices-0.592
 Torn-0.577
eous-0.569
Token 6
Token ID718
Feature activation+6.38e-02

Pos logit contributions
present+1.107
 Bris+1.042
 Practices+0.992
 Torn+0.976
eous+0.964
Neg logit contributions
��-1.474
 vet-1.045
ibaba-0.930
 manipulative-0.908
yden-0.898
Token.
Token ID13
Feature activation+3.53e-01

Pos logit contributions
present+0.319
 Bris+0.300
 Practices+0.283
 Torn+0.277
eous+0.272
Neg logit contributions
��-0.555
 vet-0.407
ibaba-0.369
 manipulative-0.361
yden-0.359
Token
Token ID198
Feature activation-8.98e-02

Pos logit contributions
��+1.646
 vet+1.256
ibaba+1.123
 manipulative+1.113
yden+1.105
Neg logit contributions
present-0.878
 Bris-0.845
 Practices-0.784
 Torn-0.770
eous-0.746
TokenFree
Token ID11146
Feature activation-4.93e-02

Pos logit contributions
��+0.768
 vet+0.561
ibaba+0.509
 manipulative+0.501
yden+0.498
Neg logit contributions
present-0.443
 Bris-0.429
 Practices-0.405
 Torn-0.395
eous-0.383
Token variations
Token ID13991
Feature activation-1.00e-01

Pos logit contributions
��+0.382
 vet+0.275
ibaba+0.242
 manipulative+0.238
yden+0.233
Neg logit contributions
present-0.290
 Bris-0.273
 Practices-0.258
 Torn-0.255
eous-0.251
Token in
Token ID287
Feature activation-1.82e-01

Pos logit contributions
��+0.865
 vet+0.647
ibaba+0.580
 manipulative+0.574
 col+0.562
Neg logit contributions
present-0.495
 Bris-0.462
 Practices-0.435
 Torn-0.425
eous-0.418
Token analysis
Token ID3781
Feature activation-5.17e-02

Pos logit contributions
��+1.385
 vet+0.991
ibaba+0.866
 manipulative+0.861
 col+0.841
Neg logit contributions
present-1.099
 Bris-1.025
 Practices-0.977
eous-0.959
 Torn-0.957
Token
Token ID198
Feature activation-8.26e-01

Pos logit contributions
��+0.450
 vet+0.338
ibaba+0.304
 manipulative+0.302
 col+0.295
Neg logit contributions
present-0.251
 Bris-0.236
 Practices-0.219
 Torn-0.217
eous-0.214
Token
Token ID198
Feature activation-8.91e-01

Pos logit contributions
��+6.184
 vet+4.811
 col+4.237
 manipulative+4.141
yden+4.098
Neg logit contributions
 Bris-4.100
present-4.062
 Practices-3.782
 Torn-3.728
 TAMADRA-3.669
TokenDid
Token ID11633
Feature activation-1.48e-02

Pos logit contributions
��+6.903
 vet+4.906
 manipulative+4.366
yden+4.308
 col+4.295
Neg logit contributions
 Bris-4.592
present-4.474
 Practices-4.313
 Torn-4.181
 Hind-4.120
Token you
Token ID345
Feature activation-3.50e-01

Pos logit contributions
��+0.107
 vet+0.074
ibaba+0.065
 manipulative+0.064
yden+0.062
Neg logit contributions
present-0.095
 Bris-0.090
 Practices-0.086
 Torn-0.085
eous-0.083
Token ever
Token ID1683
Feature activation-1.25e-01

Pos logit contributions
��+2.206
 vet+1.497
ibaba+1.216
 manipulative+1.198
 col+1.173
Neg logit contributions
present-2.536
 Bris-2.432
 Practices-2.322
 Torn-2.288
eous-2.283
Token wonder
Token ID4240
Feature activation-3.64e-01

Pos logit contributions
��+0.877
 vet+0.613
ibaba+0.525
 manipulative+0.513
yden+0.505
Neg logit contributions
present-0.820
 Bris-0.784
 Practices-0.747
 Torn-0.736
eous-0.730
Token.
Token ID13
Feature activation-9.50e-03

Pos logit contributions
present+1.653
 Bris+1.537
 Practices+1.445
 Torn+1.425
eous+1.394
Neg logit contributions
��-2.963
 vet-2.170
ibaba-1.996
 manipulative-1.939
yden-1.931
Token
Token ID198
Feature activation+1.92e-02

Pos logit contributions
��+0.081
 vet+0.061
ibaba+0.054
 manipulative+0.054
yden+0.054
Neg logit contributions
present-0.046
 Bris-0.043
eous-0.039
 Practices-0.039
 Torn-0.039
Token
Token ID198
Feature activation-1.37e-02

Pos logit contributions
present+0.086
 Bris+0.080
 Practices+0.075
 Torn+0.073
eous+0.071
Neg logit contributions
��-0.175
 vet-0.132
ibaba-0.121
 manipulative-0.118
yden-0.118
TokenStory
Token ID11605
Feature activation-2.99e-02

Pos logit contributions
��+0.113
 vet+0.081
ibaba+0.073
yden+0.072
 manipulative+0.071
Neg logit contributions
present-0.073
 Bris-0.071
 Practices-0.067
 Torn-0.066
 Bind-0.064
Token continues
Token ID4477
Feature activation-2.40e-02

Pos logit contributions
��+0.253
 vet+0.185
ibaba+0.167
 manipulative+0.164
yden+0.163
Neg logit contributions
present-0.155
 Bris-0.145
 Practices-0.136
 Torn-0.134
eous-0.133
Token below
Token ID2174
Feature activation-8.25e-01

Pos logit contributions
��+0.190
 vet+0.136
ibaba+0.121
 manipulative+0.120
yden+0.118
Neg logit contributions
present-0.137
 Bris-0.129
 Practices-0.122
 Torn-0.122
eous-0.119
Token advertisement
Token ID15422
Feature activation-3.12e-01

Pos logit contributions
��+5.870
 vet+4.300
ibaba+3.878
 col+3.784
 manipulative+3.752
Neg logit contributions
present-4.794
 Bris-4.530
 Torn-4.337
 Practices-4.321
eous-4.309
Token
Token ID198
Feature activation+1.88e-01

Pos logit contributions
��+2.665
 vet+2.028
ibaba+1.812
 manipulative+1.801
yden+1.792
Neg logit contributions
present-1.451
 Bris-1.395
 Practices-1.297
 Torn-1.276
eous-1.249
Token
Token ID198
Feature activation+2.25e-01

Pos logit contributions
present+0.816
 Bris+0.752
 Practices+0.702
 Torn+0.688
eous+0.671
Neg logit contributions
��-1.761
 vet-1.327
ibaba-1.222
 manipulative-1.196
yden-1.188
TokenStory
Token ID11605
Feature activation+1.04e-01

Pos logit contributions
present+1.243
 Bris+1.177
 Practices+1.116
 Torn+1.097
eous+1.076
Neg logit contributions
��-1.831
 vet-1.309
ibaba-1.182
 manipulative-1.154
yden-1.147
Token continues
Token ID4477
Feature activation-2.32e-01

Pos logit contributions
present+0.528
 Bris+0.493
 Practices+0.463
 Torn+0.458
eous+0.449
Neg logit contributions
��-0.890
 vet-0.655
ibaba-0.593
 manipulative-0.583
yden-0.577
Token
Token ID198
Feature activation+2.45e-01

Pos logit contributions
present+1.109
 Bris+1.009
 Practices+0.944
 Torn+0.925
eous+0.904
Neg logit contributions
��-2.474
 vet-1.860
ibaba-1.723
 manipulative-1.682
yden-1.671
TokenTravel
Token ID33074
Feature activation-9.85e-02

Pos logit contributions
present+1.346
 Bris+1.249
 Practices+1.186
 Torn+1.170
eous+1.146
Neg logit contributions
��-2.019
 vet-1.447
ibaba-1.311
 manipulative-1.279
yden-1.263
Token With
Token ID2080
Feature activation-2.76e-01

Pos logit contributions
��+0.794
 vet+0.578
ibaba+0.515
 manipulative+0.505
yden+0.498
Neg logit contributions
present-0.543
 Bris-0.515
 Practices-0.487
eous-0.476
 Torn-0.474
Token The
Token ID383
Feature activation-1.16e-01

Pos logit contributions
��+2.145
 vet+1.525
ibaba+1.365
yden+1.335
 manipulative+1.332
Neg logit contributions
present-1.595
 Bris-1.512
 Practices-1.429
eous-1.397
 Torn-1.387
Token Nation
Token ID8741
Feature activation-3.12e-01

Pos logit contributions
��+1.066
 vet+0.797
ibaba+0.730
yden+0.721
 manipulative+0.714
Neg logit contributions
present-0.519
 Bris-0.481
 Practices-0.451
 Torn-0.436
eous-0.428
Token Be
Token ID1355
Feature activation-8.24e-01

Pos logit contributions
��+2.664
 vet+2.000
ibaba+1.767
yden+1.733
 col+1.730
Neg logit contributions
present-1.550
 Bris-1.421
 Practices-1.376
eous-1.299
 Torn-1.293
Token the
Token ID262
Feature activation-1.04e-01

Pos logit contributions
��+5.537
 vet+3.792
yden+3.129
ibaba+3.100
 manipulative+3.098
Neg logit contributions
present-5.334
eous-5.196
 Bris-5.172
 Practices-4.968
 Torn-4.870
Token first
Token ID717
Feature activation-3.11e-01

Pos logit contributions
��+0.813
 vet+0.583
 manipulative+0.513
yden+0.508
ibaba+0.501
Neg logit contributions
present-0.594
 Bris-0.575
 Practices-0.541
 Torn-0.530
eous-0.518
Token to
Token ID284
Feature activation+1.23e-01

Pos logit contributions
��+2.353
 vet+1.676
ibaba+1.442
yden+1.430
 manipulative+1.422
Neg logit contributions
present-1.865
 Bris-1.775
 Practices-1.712
 Torn-1.664
 Bind-1.638
Token hear
Token ID3285
Feature activation-2.16e-01

Pos logit contributions
present+0.862
 Bris+0.823
 Practices+0.788
 Torn+0.779
eous+0.774
Neg logit contributions
��-0.806
 vet-0.540
ibaba-0.457
 manipulative-0.445
yden-0.439
Token about
Token ID546
Feature activation-5.27e-02

Pos logit contributions
��+1.812
 vet+1.329
ibaba+1.189
 manipulative+1.176
yden+1.157
Neg logit contributions
present-1.127
 Bris-1.060
 Practices-0.994
 Torn-0.981
eous-0.964
Token grid
Token ID10706
Feature activation-3.31e-02

Pos logit contributions
��+1.648
 vet+1.191
ibaba+1.044
 col+1.032
 manipulative+1.015
Neg logit contributions
present-1.240
 Bris-1.188
 Practices-1.122
 Torn-1.113
eous-1.111
Tokencell
Token ID3846
Feature activation+7.34e-03

Pos logit contributions
��+0.246
 vet+0.170
ibaba+0.150
 manipulative+0.146
 col+0.146
Neg logit contributions
present-0.205
 Bris-0.197
 Practices-0.188
 Torn-0.185
eous-0.183
Token volume
Token ID6115
Feature activation-3.34e-01

Pos logit contributions
present+0.041
 Bris+0.039
 Practices+0.037
 Torn+0.036
eous+0.036
Neg logit contributions
��-0.059
 vet-0.042
ibaba-0.038
 manipulative-0.037
 col-0.037
Token.
Token ID13
Feature activation-1.70e-01

Pos logit contributions
��+2.789
 vet+2.053
ibaba+1.825
 col+1.823
 manipulative+1.794
Neg logit contributions
present-1.700
 Bris-1.623
 Practices-1.523
 Torn-1.510
eous-1.485
Token const
Token ID1500
Feature activation-3.04e-01

Pos logit contributions
��+1.447
 vet+1.068
ibaba+0.956
 manipulative+0.937
yden+0.937
Neg logit contributions
present-0.837
 Bris-0.800
 Practices-0.753
 Torn-0.734
eous-0.730
Token float
Token ID12178
Feature activation-8.23e-01

Pos logit contributions
��+2.467
 vet+1.808
 col+1.596
ibaba+1.575
 manipulative+1.574
Neg logit contributions
present-1.641
 Bris-1.546
 Practices-1.448
 Torn-1.414
eous-1.411
Token one
Token ID530
Feature activation-9.89e-03

Pos logit contributions
��+6.569
 vet+4.853
 col+4.324
 manipulative+4.165
ibaba+4.160
Neg logit contributions
present-4.279
 Bris-4.083
 Practices-3.842
eous-3.793
 Torn-3.758
TokenOver
Token ID5886
Feature activation-2.38e-01

Pos logit contributions
��+0.084
 vet+0.061
ibaba+0.056
yden+0.055
 manipulative+0.054
Neg logit contributions
present-0.050
 Bris-0.048
 Practices-0.045
 Torn-0.044
eous-0.043
TokenCell
Token ID28780
Feature activation-3.60e-01

Pos logit contributions
��+2.042
 vet+1.494
ibaba+1.353
yden+1.335
 manipulative+1.313
Neg logit contributions
present-1.165
 Bris-1.113
 Practices-1.039
 Torn-1.036
 Bind-1.004
TokenVolume
Token ID31715
Feature activation-2.90e-01

Pos logit contributions
��+3.042
 vet+2.245
ibaba+2.034
yden+1.992
 col+1.987
Neg logit contributions
present-1.780
 Bris-1.703
 Practices-1.585
 Torn-1.563
eous-1.528
Token =
Token ID796
Feature activation+5.86e-02

Pos logit contributions
��+2.475
 vet+1.828
ibaba+1.661
yden+1.629
 col+1.625
Neg logit contributions
present-1.404
 Bris-1.338
 Practices-1.252
 Torn-1.235
eous-1.197
Tokenb
Token ID65
Feature activation-4.08e-01

Pos logit contributions
present+0.491
 Bris+0.455
 Practices+0.428
 Torn+0.417
eous+0.405
Neg logit contributions
��-1.064
 vet-0.802
ibaba-0.740
 manipulative-0.722
yden-0.719
Token]
Token ID60
Feature activation-7.01e-02

Pos logit contributions
��+2.600
 vet+1.686
ibaba+1.430
 manipulative+1.383
yden+1.380
Neg logit contributions
present-2.924
 Bris-2.803
 Practices-2.720
 Torn-2.654
eous-2.624
TokenPersonal
Token ID30228
Feature activation+2.06e-01

Pos logit contributions
��+0.599
 vet+0.442
ibaba+0.403
 manipulative+0.392
yden+0.390
Neg logit contributions
present-0.348
 Bris-0.329
 Practices-0.313
 Torn-0.305
eous-0.299
Token best
Token ID1266
Feature activation+1.12e-01

Pos logit contributions
present+1.083
 Bris+1.003
 Practices+0.942
 Torn+0.931
eous+0.918
Neg logit contributions
��-1.736
 vet-1.284
ibaba-1.161
 manipulative-1.132
yden-1.117
Token:
Token ID25
Feature activation+6.88e-02

Pos logit contributions
present+0.622
 Bris+0.583
 Practices+0.549
 Torn+0.543
eous+0.533
Neg logit contributions
��-0.914
 vet-0.659
ibaba-0.596
 manipulative-0.580
yden-0.573
Token[/
Token ID13412
Feature activation-8.17e-01

Pos logit contributions
present+0.388
 Bris+0.363
 Practices+0.346
 Torn+0.340
eous+0.334
Neg logit contributions
��-0.552
 vet-0.395
ibaba-0.356
 manipulative-0.348
yden-0.342
Tokenb
Token ID65
Feature activation+2.57e-02

Pos logit contributions
��+6.258
 vet+4.620
ibaba+4.186
yden+4.030
 col+3.994
Neg logit contributions
present-4.235
 Bris-4.229
 Practices-4.072
 Torn-3.991
 TAMADRA-3.880
Token]
Token ID60
Feature activation-1.27e-01

Pos logit contributions
present+0.180
 Bris+0.171
 Practices+0.165
 Torn+0.162
eous+0.160
Neg logit contributions
��-0.171
 vet-0.112
ibaba-0.098
 manipulative-0.094
yden-0.093
Token 1
Token ID352
Feature activation+1.29e-01

Pos logit contributions
��+1.047
 vet+0.763
ibaba+0.689
 manipulative+0.674
yden+0.665
Neg logit contributions
present-0.684
 Bris-0.640
 Practices-0.608
 Torn-0.595
eous-0.586
Token:
Token ID25
Feature activation+2.39e-01

Pos logit contributions
present+0.638
 Bris+0.593
 Practices+0.559
 Torn+0.549
eous+0.538
Neg logit contributions
��-1.135
 vet-0.836
ibaba-0.763
 manipulative-0.746
yden-0.740
Token07
Token ID2998
Feature activation+2.42e-01

Pos logit contributions
present+1.164
 Bris+1.098
 Practices+1.031
 Torn+1.015
eous+0.991
Neg logit contributions
��-2.080
 vet-1.538
ibaba-1.395
 manipulative-1.371
yden-1.363
Token paid
Token ID3432
Feature activation+2.91e-02

Pos logit contributions
present+2.711
 Bris+2.528
 Practices+2.397
 Torn+2.356
eous+2.316
Neg logit contributions
��-4.242
 vet-3.059
ibaba-2.791
 manipulative-2.708
yden-2.686
Token in
Token ID287
Feature activation+1.05e-01

Pos logit contributions
present+0.150
 Bris+0.140
 Practices+0.133
 Torn+0.131
eous+0.128
Neg logit contributions
��-0.248
 vet-0.181
ibaba-0.164
 manipulative-0.161
yden-0.160
Token 2010
Token ID3050
Feature activation+1.21e-01

Pos logit contributions
present+0.484
 Bris+0.448
 Practices+0.419
 Torn+0.412
eous+0.406
Neg logit contributions
��-0.957
 vet-0.716
ibaba-0.652
 manipulative-0.644
yden-0.637
Token
Token ID126
Feature activation-2.62e-01

Pos logit contributions
present+0.613
 Bris+0.573
 Practices+0.539
 Torn+0.532
eous+0.521
Neg logit contributions
��-1.041
 vet-0.762
ibaba-0.694
 manipulative-0.678
yden-0.674
Token
Token ID245
Feature activation-2.25e-01

Pos logit contributions
��+1.821
 vet+1.167
ibaba+1.030
yden+0.985
 manipulative+0.972
Neg logit contributions
present-1.798
 Bris-1.716
 Practices-1.641
 Torn-1.634
eous-1.600
Tokenis
Token ID271
Feature activation-8.07e-01

Pos logit contributions
��+1.799
 vet+1.304
ibaba+1.182
yden+1.153
 manipulative+1.150
Neg logit contributions
present-1.193
 Bris-1.159
 Practices-1.106
 Torn-1.090
eous-1.055
Token quite
Token ID2407
Feature activation-3.32e-01

Pos logit contributions
��+5.815
 vet+4.158
 manipulative+3.664
ibaba+3.621
 col+3.563
Neg logit contributions
present-4.752
 Bris-4.548
 Practices-4.530
 Torn-4.358
eous-4.224
Token likely
Token ID1884
Feature activation-3.80e-01

Pos logit contributions
��+2.284
 vet+1.562
 manipulative+1.362
ibaba+1.349
 col+1.312
Neg logit contributions
present-2.184
 Bris-2.088
 Practices-2.063
 Torn-2.000
eous-1.952
Token.
Token ID13
Feature activation+9.84e-02

Pos logit contributions
��+2.884
 vet+2.067
 manipulative+1.835
ibaba+1.830
yden+1.779
Neg logit contributions
present-2.223
 Bris-2.101
 Practices-2.041
 Torn-1.981
eous-1.949
Token And
Token ID843
Feature activation+2.26e-01

Pos logit contributions
present+0.483
 Bris+0.449
 Practices+0.420
 Torn+0.415
eous+0.413
Neg logit contributions
��-0.850
 vet-0.629
ibaba-0.569
 manipulative-0.559
yden-0.556
Token then
Token ID788
Feature activation+1.14e-01

Pos logit contributions
present+1.267
 Bris+1.187
 Practices+1.130
 Torn+1.113
eous+1.095
Neg logit contributions
��-1.834
 vet-1.313
ibaba-1.185
 manipulative-1.156
yden-1.145
Token
Token ID447
Feature activation+5.15e-02

Pos logit contributions
present+0.780
 Bris+0.727
 Practices+0.685
 Torn+0.674
eous+0.665
Neg logit contributions
��-1.293
 vet-0.947
ibaba-0.860
 manipulative-0.841
yden-0.833
Token
Token ID247
Feature activation+8.36e-02

Pos logit contributions
present+0.366
 Bris+0.348
 Practices+0.334
 Torn+0.330
eous+0.326
Neg logit contributions
��-0.340
 vet-0.221
ibaba-0.192
 manipulative-0.185
yden-0.183
Tokens
Token ID82
Feature activation-1.07e-01

Pos logit contributions
present+0.371
 Bris+0.345
 Practices+0.323
 Torn+0.316
eous+0.310
Neg logit contributions
��-0.769
 vet-0.577
ibaba-0.531
 manipulative-0.519
yden-0.516
Token being
Token ID852
Feature activation-1.53e-02

Pos logit contributions
��+0.816
 vet+0.572
ibaba+0.505
 manipulative+0.497
yden+0.489
Neg logit contributions
present-0.647
 Bris-0.612
 Practices-0.583
eous-0.569
 Torn-0.568
Token tested
Token ID6789
Feature activation-1.52e-02

Pos logit contributions
��+0.115
 vet+0.080
ibaba+0.071
 manipulative+0.070
yden+0.069
Neg logit contributions
present-0.095
 Bris-0.089
 Practices-0.085
 Torn-0.084
eous-0.083
Token Local
Token ID10714
Feature activation-8.01e-01

Pos logit contributions
��+0.130
 vet+0.096
ibaba+0.087
 manipulative+0.085
yden+0.084
Neg logit contributions
present-0.076
 Bris-0.071
 Practices-0.067
 Torn-0.066
eous-0.065
Token housing
Token ID5627
Feature activation-2.90e-01

Pos logit contributions
��+5.663
 vet+4.096
ibaba+3.503
 manipulative+3.483
 col+3.372
Neg logit contributions
present-4.974
 Bris-4.599
eous-4.353
 Practices-4.291
 Torn-4.264
Token councils
Token ID27174
Feature activation-1.74e-01

Pos logit contributions
��+2.193
 vet+1.597
ibaba+1.390
 manipulative+1.377
 col+1.331
Neg logit contributions
present-1.724
 Bris-1.611
eous-1.516
 Practices-1.504
 Torn-1.491
Token are
Token ID389
Feature activation-7.89e-02

Pos logit contributions
��+1.425
 vet+1.056
ibaba+0.931
 manipulative+0.921
yden+0.898
Neg logit contributions
present-0.934
 Bris-0.869
 Practices-0.821
eous-0.816
 Torn-0.805
Token testing
Token ID4856
Feature activation-3.30e-01

Pos logit contributions
��+0.586
 vet+0.412
ibaba+0.359
 manipulative+0.355
yden+0.346
Neg logit contributions
present-0.491
 Bris-0.463
 Practices-0.443
 Torn-0.434
eous-0.433
Token any
Token ID597
Feature activation+9.17e-03

Pos logit contributions
��+2.447
 vet+1.736
 manipulative+1.524
ibaba+1.499
yden+1.467
Neg logit contributions
present-2.038
 Bris-1.887
 Practices-1.807
eous-1.782
 Torn-1.767
Token hand
Token ID1021
Feature activation+1.97e-01

Pos logit contributions
��+2.468
 vet+1.826
ibaba+1.624
 manipulative+1.602
yden+1.576
Neg logit contributions
present-1.518
 Bris-1.400
 Practices-1.334
 Torn-1.297
eous-1.297
Tokenout
Token ID448
Feature activation+3.22e-01

Pos logit contributions
present+0.996
 Bris+0.931
 Practices+0.877
 Torn+0.862
eous+0.847
Neg logit contributions
��-1.704
 vet-1.252
ibaba-1.139
 manipulative-1.115
yden-1.104
Token<|endoftext|>
Token ID50256
Feature activation-2.73e-02

Pos logit contributions
present+1.598
 Bris+1.484
 Practices+1.398
 Torn+1.373
eous+1.349
Neg logit contributions
��-2.819
 vet-2.078
ibaba-1.895
 manipulative-1.851
yden-1.837
TokenOr
Token ID5574
Feature activation+3.04e-01

Pos logit contributions
��+0.228
 vet+0.164
ibaba+0.150
 manipulative+0.146
yden+0.145
Neg logit contributions
present-0.143
 Bris-0.138
 Practices-0.130
 Torn-0.127
eous-0.125
Token at
Token ID379
Feature activation+2.89e-01

Pos logit contributions
present+1.988
 Bris+1.882
 Practices+1.802
 Torn+1.781
eous+1.754
Neg logit contributions
��-2.173
 vet-1.470
ibaba-1.303
 manipulative-1.259
yden-1.247
Token least
Token ID1551
Feature activation+5.59e-01

Pos logit contributions
present+1.299
 Bris+1.194
 Practices+1.126
 Torn+1.095
eous+1.087
Neg logit contributions
��-2.648
 vet-1.996
ibaba-1.830
 manipulative-1.789
yden-1.780
Token 26
Token ID2608
Feature activation-1.40e-02

Pos logit contributions
present+3.117
 Bris+2.928
 Practices+2.778
 Torn+2.739
eous+2.678
Neg logit contributions
��-4.540
 vet-3.228
ibaba-2.939
 manipulative-2.846
yden-2.828
Token of
Token ID286
Feature activation-1.24e-01

Pos logit contributions
��+0.110
 vet+0.079
ibaba+0.070
 manipulative+0.069
yden+0.068
Neg logit contributions
present-0.081
 Bris-0.076
 Practices-0.072
 Torn-0.071
eous-0.070
Token 27
Token ID2681
Feature activation-4.02e-02

Pos logit contributions
��+0.962
 vet+0.684
ibaba+0.606
 manipulative+0.598
yden+0.590
Neg logit contributions
present-0.728
 Bris-0.680
 Practices-0.652
eous-0.636
 Torn-0.632
Token:
Token ID25
Feature activation+3.03e-01

Pos logit contributions
��+0.313
 vet+0.225
ibaba+0.197
 manipulative+0.196
yden+0.192
Neg logit contributions
present-0.235
 Bris-0.219
 Practices-0.210
eous-0.205
 Torn-0.204
TokenTHE
Token ID10970
Feature activation+6.09e-02

Pos logit contributions
present+1.570
 Bris+1.464
 Practices+1.382
 Torn+1.358
eous+1.340
Neg logit contributions
��-2.580
 vet-1.887
ibaba-1.716
 manipulative-1.672
yden-1.659
Tokenx
Token ID87
Feature activation-6.25e-02

Pos logit contributions
��+2.372
 vet+1.664
ibaba+1.500
yden+1.442
 manipulative+1.435
Neg logit contributions
present-1.834
 Bris-1.785
 Practices-1.697
 Torn-1.660
eous-1.641
Token001
Token ID8298
Feature activation-7.50e-02

Pos logit contributions
��+0.466
 vet+0.324
ibaba+0.288
 manipulative+0.277
yden+0.277
Neg logit contributions
present-0.383
 Bris-0.370
 Practices-0.351
 Torn-0.346
eous-0.343
Tokenf
Token ID69
Feature activation+3.18e-01

Pos logit contributions
��+0.680
 vet+0.507
ibaba+0.466
 manipulative+0.455
yden+0.453
Neg logit contributions
present-0.342
 Bris-0.321
 Practices-0.301
 Torn-0.295
eous-0.288
Token1
Token ID16
Feature activation+4.95e-01

Pos logit contributions
present+1.208
 Bris+1.100
 Practices+1.017
 Torn+0.993
eous+0.965
Neg logit contributions
��-3.150
 vet-2.413
ibaba-2.239
 manipulative-2.192
yden-2.179
Tokenf
Token ID69
Feature activation+4.23e-01

Pos logit contributions
present+2.054
 Bris+1.808
 Practices+1.684
 Torn+1.650
eous+1.644
Neg logit contributions
��-4.821
 vet-3.620
ibaba-3.359
 manipulative-3.290
yden-3.235
Token4
Token ID19
Feature activation+5.82e-01

Pos logit contributions
present+1.554
 Bris+1.407
 Practices+1.295
 Torn+1.265
eous+1.228
Neg logit contributions
��-4.244
 vet-3.264
ibaba-3.031
 manipulative-2.970
yden-2.953
Tokenb
Token ID65
Feature activation+1.45e-01

Pos logit contributions
present+2.647
 Bris+2.385
 Practices+2.207
 Torn+2.165
eous+2.149
Neg logit contributions
��-5.399
 vet-4.006
ibaba-3.679
 manipulative-3.629
yden-3.566
Tokenbe
Token ID1350
Feature activation-2.07e-02

Pos logit contributions
present+0.490
 Bris+0.439
 Practices+0.401
 Torn+0.390
eous+0.378
Neg logit contributions
��-1.493
 vet-1.158
ibaba-1.078
 manipulative-1.057
yden-1.051
Token66
Token ID2791
Feature activation+9.11e-02

Pos logit contributions
��+0.158
 vet+0.110
ibaba+0.099
 manipulative+0.096
yden+0.095
Neg logit contributions
present-0.126
 Bris-0.119
 Practices-0.113
 Torn-0.112
eous-0.110
Token power
Token ID1176
Feature activation-7.78e-02

Pos logit contributions
present+0.522
 Bris+0.495
 Practices+0.470
 Torn+0.462
eous+0.457
Neg logit contributions
��-0.724
 vet-0.515
ibaba-0.461
 manipulative-0.450
yden-0.444
Token off
Token ID572
Feature activation-1.51e-01

Pos logit contributions
��+0.589
 vet+0.412
ibaba+0.363
 manipulative+0.355
yden+0.351
Neg logit contributions
present-0.473
 Bris-0.455
 Practices-0.430
 Torn-0.424
eous-0.420
Token count
Token ID954
Feature activation-2.88e-03

Pos logit contributions
��+1.838
 vet+1.320
ibaba+1.167
 col+1.148
 manipulative+1.147
Neg logit contributions
present-1.323
 Bris-1.258
 Practices-1.197
eous-1.181
 Torn-1.169
Token )
Token ID1267
Feature activation+1.25e-01

Pos logit contributions
��+0.023
 vet+0.017
ibaba+0.015
 manipulative+0.015
yden+0.015
Neg logit contributions
present-0.016
 Bris-0.015
 Practices-0.014
 Torn-0.014
eous-0.014
Token count
Token ID954
Feature activation+6.87e-02

Pos logit contributions
present+0.677
 Bris+0.633
 Practices+0.600
 Torn+0.588
eous+0.585
Neg logit contributions
��-1.035
 vet-0.754
ibaba-0.676
 manipulative-0.663
 col-0.654
Token FROM
Token ID16034
Feature activation+4.63e-01

Pos logit contributions
present+0.340
 Bris+0.318
 Practices+0.299
 Torn+0.293
eous+0.289
Neg logit contributions
��-0.598
 vet-0.443
ibaba-0.404
 manipulative-0.394
yden-0.390
Token cl
Token ID537
Feature activation+4.11e-01

Pos logit contributions
present+2.614
 Bris+2.453
 Practices+2.333
 Torn+2.302
eous+2.255
Neg logit contributions
��-3.732
 vet-2.651
ibaba-2.403
 manipulative-2.333
yden-2.318
Tokenojure
Token ID32511
Feature activation+5.92e-01

Pos logit contributions
present+3.406
 Bris+3.203
 Practices+3.126
 Torn+3.095
eous+3.080
Neg logit contributions
��-2.255
 vet-1.262
ibaba-1.027
 manipulative-1.017
yden-0.931
Token.
Token ID764
Feature activation+4.77e-02

Pos logit contributions
present+3.877
 Bris+3.656
 Practices+3.531
 Torn+3.475
 Bind+3.415
Neg logit contributions
��-4.208
 vet-2.795
ibaba-2.489
 manipulative-2.420
yden-2.402
Token libraries
Token ID12782
Feature activation+6.27e-02

Pos logit contributions
present+0.269
 Bris+0.255
 Practices+0.240
 Torn+0.237
eous+0.235
Neg logit contributions
��-0.382
 vet-0.274
ibaba-0.243
 manipulative-0.240
 col-0.236
Token WHERE
Token ID33411
Feature activation+1.45e-01

Pos logit contributions
present+0.338
 Bris+0.318
 Practices+0.299
eous+0.294
 Torn+0.293
Neg logit contributions
��-0.520
 vet-0.379
ibaba-0.341
 manipulative-0.333
 col-0.329
Token library
Token ID5888
Feature activation+1.95e-01

Pos logit contributions
present+0.863
 Bris+0.814
 Practices+0.777
 Torn+0.763
eous+0.757
Neg logit contributions
��-1.118
 vet-0.789
ibaba-0.702
 manipulative-0.685
yden-0.678
Token CON
Token ID7102
Feature activation+2.12e-01

Pos logit contributions
present+1.073
 Bris+1.012
 Practices+0.957
 Torn+0.941
eous+0.930
Neg logit contributions
��-1.598
 vet-1.152
ibaba-1.039
 manipulative-1.013
yden-1.003
Token '
Token ID705
Feature activation-1.46e-01

Pos logit contributions
��+0.535
 vet+0.390
ibaba+0.352
 manipulative+0.344
 col+0.338
Neg logit contributions
present-0.342
 Bris-0.320
 Practices-0.304
 Torn-0.297
eous-0.292
Tokenboot
Token ID18769
Feature activation-4.65e-02

Pos logit contributions
��+1.210
 vet+0.874
ibaba+0.796
yden+0.769
 manipulative+0.768
Neg logit contributions
present-0.758
 Bris-0.734
 Practices-0.690
 Torn-0.681
eous-0.663
Token.
Token ID13
Feature activation-1.56e-01

Pos logit contributions
��+0.375
 vet+0.269
ibaba+0.242
 manipulative+0.234
yden+0.232
Neg logit contributions
present-0.261
 Bris-0.248
 Practices-0.233
 Torn-0.231
eous-0.226
Tokenbuild
Token ID11249
Feature activation-1.05e-01

Pos logit contributions
��+1.351
 vet+0.985
ibaba+0.902
yden+0.873
 manipulative+0.866
Neg logit contributions
present-0.775
 Bris-0.739
 Practices-0.697
 Torn-0.687
eous-0.666
Token'
Token ID6
Feature activation+1.63e-01

Pos logit contributions
��+0.859
 vet+0.616
ibaba+0.557
 manipulative+0.541
yden+0.536
Neg logit contributions
present-0.578
 Bris-0.546
 Practices-0.517
 Torn-0.510
eous-0.499
Token --
Token ID1377
Feature activation+5.22e-01

Pos logit contributions
present+0.871
 Bris+0.814
 Practices+0.772
 Torn+0.756
eous+0.752
Neg logit contributions
��-1.365
 vet-0.994
ibaba-0.891
 manipulative-0.876
yden-0.866
Token Save
Token ID12793
Feature activation+1.64e-01

Pos logit contributions
present+2.671
 Bris+2.475
 Practices+2.352
 Torn+2.313
 Bind+2.260
Neg logit contributions
��-4.480
 vet-3.258
ibaba-2.987
 manipulative-2.911
yden-2.887
Token the
Token ID262
Feature activation+2.25e-01

Pos logit contributions
present+0.939
 Bris+0.890
 Practices+0.836
 Torn+0.826
eous+0.819
Neg logit contributions
��-1.297
 vet-0.927
ibaba-0.832
 manipulative-0.808
yden-0.801
Token results
Token ID2482
Feature activation-1.65e-01

Pos logit contributions
present+1.165
 Bris+1.095
 Practices+1.028
 Torn+1.008
eous+1.000
Neg logit contributions
��-1.910
 vet-1.399
ibaba-1.263
 manipulative-1.237
yden-1.225
Token to
Token ID284
Feature activation+1.09e-01

Pos logit contributions
��+1.368
 vet+1.013
ibaba+0.914
 manipulative+0.892
 col+0.892
Neg logit contributions
present-0.863
 Bris-0.815
 Practices-0.773
 Torn-0.751
eous-0.740
Token cl
Token ID537
Feature activation+1.77e-01

Pos logit contributions
present+0.627
 Bris+0.592
 Practices+0.563
 Torn+0.552
eous+0.548
Neg logit contributions
��-0.862
 vet-0.614
ibaba-0.550
 manipulative-0.535
 col-0.528
Token =
Token ID796
Feature activation-1.66e-01

Pos logit contributions
��+0.017
 vet+0.013
ibaba+0.012
yden+0.011
 manipulative+0.011
Neg logit contributions
present-0.009
 Bris-0.009
 Practices-0.008
 Torn-0.008
eous-0.008
Token New
Token ID968
Feature activation-1.74e-01

Pos logit contributions
��+1.385
 vet+1.003
ibaba+0.892
 manipulative+0.890
 col+0.877
Neg logit contributions
present-0.875
 Bris-0.807
 Practices-0.775
eous-0.753
 Torn-0.750
Token-
Token ID12
Feature activation-5.98e-02

Pos logit contributions
��+1.423
 vet+1.029
ibaba+0.923
 manipulative+0.906
yden+0.905
Neg logit contributions
present-0.933
 Bris-0.883
 Practices-0.838
 Torn-0.824
eous-0.806
TokenObject
Token ID10267
Feature activation+1.87e-01

Pos logit contributions
��+0.540
 vet+0.402
ibaba+0.366
 manipulative+0.361
yden+0.361
Neg logit contributions
present-0.269
 Bris-0.255
 Practices-0.239
 Torn-0.234
eous-0.229
Token System
Token ID4482
Feature activation+3.80e-01

Pos logit contributions
present+0.788
 Bris+0.721
 Practices+0.677
 Torn+0.658
eous+0.644
Neg logit contributions
��-1.780
 vet-1.343
ibaba-1.231
 manipulative-1.217
yden-1.199
Token.
Token ID13
Feature activation+6.48e-01

Pos logit contributions
present+1.782
 Bris+1.614
 Practices+1.516
 Torn+1.492
eous+1.477
Neg logit contributions
��-3.501
 vet-2.577
ibaba-2.391
 manipulative-2.317
yden-2.294
TokenDraw
Token ID25302
Feature activation+1.49e-02

Pos logit contributions
present+2.256
 Bris+1.954
 Practices+1.787
 Torn+1.738
eous+1.696
Neg logit contributions
��-6.699
 vet-5.203
ibaba-4.883
 manipulative-4.746
yden-4.706
Tokening
Token ID278
Feature activation+4.03e-01

Pos logit contributions
present+0.070
 Bris+0.065
 Practices+0.061
 Torn+0.060
eous+0.059
Neg logit contributions
��-0.134
 vet-0.099
ibaba-0.091
 manipulative-0.089
yden-0.089
Token.
Token ID13
Feature activation+3.23e-02

Pos logit contributions
present+1.901
 Bris+1.740
 Practices+1.640
 Torn+1.611
eous+1.579
Neg logit contributions
��-3.670
 vet-2.708
ibaba-2.511
 manipulative-2.425
yden-2.412
TokenFont
Token ID23252
Feature activation-4.82e-01

Pos logit contributions
present+0.159
 Bris+0.151
 Practices+0.144
 Torn+0.139
eous+0.136
Neg logit contributions
��-0.280
 vet-0.202
ibaba-0.183
 manipulative-0.181
yden-0.180
Token($
Token ID16763
Feature activation+3.45e-02

Pos logit contributions
��+3.962
 vet+2.815
ibaba+2.547
 manipulative+2.541
 col+2.494
Neg logit contributions
present-2.542
 Bris-2.400
 Practices-2.301
 Torn-2.228
 Hind-2.178
Token without
Token ID1231
Feature activation-1.87e-01

Pos logit contributions
��+1.116
 vet+0.822
 manipulative+0.714
ibaba+0.714
yden+0.704
Neg logit contributions
present-0.825
 Bris-0.768
 Practices-0.738
eous-0.716
 Torn-0.713
Token daily
Token ID4445
Feature activation+1.98e-01

Pos logit contributions
��+1.306
 vet+0.906
ibaba+0.772
 manipulative+0.760
 col+0.740
Neg logit contributions
present-1.246
 Bris-1.172
 Practices-1.132
 Torn-1.114
eous-1.109
Token hygiene
Token ID33306
Feature activation+5.30e-02

Pos logit contributions
present+1.316
 Bris+1.248
 Practices+1.196
 Torn+1.182
eous+1.166
Neg logit contributions
��-1.393
 vet-0.940
ibaba-0.825
 manipulative-0.801
yden-0.790
Token.
Token ID13
Feature activation+8.27e-02

Pos logit contributions
present+0.276
 Bris+0.258
 Practices+0.240
 Torn+0.239
eous+0.237
Neg logit contributions
��-0.447
 vet-0.332
ibaba-0.296
 manipulative-0.293
yden-0.289
Token
Token ID198
Feature activation+1.25e-01

Pos logit contributions
present+0.407
 Bris+0.377
 Practices+0.351
eous+0.349
 Torn+0.345
Neg logit contributions
��-0.713
 vet-0.532
ibaba-0.479
 manipulative-0.469
yden-0.465
Token
Token ID198
Feature activation+1.90e-01

Pos logit contributions
present+0.546
 Bris+0.506
 Practices+0.472
 Torn+0.463
eous+0.452
Neg logit contributions
��-1.158
 vet-0.874
ibaba-0.803
 manipulative-0.786
yden-0.781
TokenWhat
Token ID2061
Feature activation+2.64e-01

Pos logit contributions
present+0.973
 Bris+0.915
 Practices+0.865
 Torn+0.851
eous+0.833
Neg logit contributions
��-1.625
 vet-1.186
ibaba-1.078
 manipulative-1.052
yden-1.045
Token few
Token ID1178
Feature activation-2.75e-02

Pos logit contributions
present+1.391
 Bris+1.299
 Practices+1.230
 Torn+1.211
eous+1.187
Neg logit contributions
��-2.221
 vet-1.610
ibaba-1.465
 manipulative-1.427
yden-1.416
Token possessions
Token ID23309
Feature activation-2.40e-01

Pos logit contributions
��+0.210
 vet+0.150
ibaba+0.132
 manipulative+0.129
yden+0.128
Neg logit contributions
present-0.165
 Bris-0.155
 Practices-0.147
 Torn-0.146
eous-0.145
Token Hilton
Token ID31248
Feature activation-8.52e-02

Pos logit contributions
��+1.920
 vet+1.410
ibaba+1.251
 manipulative+1.225
yden+1.223
Neg logit contributions
present-1.323
 Bris-1.228
 Practices-1.172
eous-1.157
 Torn-1.140
Token had
Token ID550
Feature activation+1.88e-01

Pos logit contributions
��+0.719
 vet+0.528
ibaba+0.478
 manipulative+0.467
yden+0.465
Neg logit contributions
present-0.441
 Bris-0.414
 Practices-0.388
 Torn-0.383
eous-0.379
Token the
Token ID262
Feature activation-5.51e-02

Pos logit contributions
��+1.738
 vet+1.232
ibaba+1.079
 manipulative+1.075
yden+1.054
Neg logit contributions
present-1.344
 Bris-1.267
 Practices-1.213
 Torn-1.193
eous-1.193
Token Affordable
Token ID19258
Feature activation+1.28e-02

Pos logit contributions
��+0.439
 vet+0.314
 manipulative+0.278
ibaba+0.276
yden+0.271
Neg logit contributions
present-0.313
 Bris-0.295
 Practices-0.281
eous-0.275
 Torn-0.275
Token Care
Token ID7276
Feature activation+1.72e-01

Pos logit contributions
present+0.093
 Bris+0.089
 Practices+0.085
 Torn+0.085
eous+0.084
Neg logit contributions
��-0.082
 vet-0.053
ibaba-0.045
 manipulative-0.044
yden-0.043
Token Act
Token ID2191
Feature activation-1.42e-01

Pos logit contributions
present+0.820
 Bris+0.759
 Practices+0.717
 Torn+0.700
eous+0.684
Neg logit contributions
��-1.534
 vet-1.135
ibaba-1.045
 manipulative-1.016
yden-1.010
Token and
Token ID290
Feature activation+1.10e-01

Pos logit contributions
��+1.204
 vet+0.884
ibaba+0.797
 manipulative+0.781
yden+0.774
Neg logit contributions
present-0.736
 Bris-0.688
 Practices-0.651
 Torn-0.641
eous-0.630
Token sell
Token ID3677
Feature activation+3.12e-01

Pos logit contributions
present+0.636
 Bris+0.598
 Practices+0.569
 Torn+0.561
eous+0.553
Neg logit contributions
��-0.864
 vet-0.612
ibaba-0.549
 manipulative-0.535
yden-0.530
Token the
Token ID262
Feature activation+1.34e-01

Pos logit contributions
present+1.732
 Bris+1.623
 Practices+1.541
 Torn+1.519
eous+1.494
Neg logit contributions
��-2.539
 vet-1.819
ibaba-1.644
 manipulative-1.604
yden-1.591
Token public
Token ID1171
Feature activation-2.71e-01

Pos logit contributions
present+0.879
 Bris+0.832
 Practices+0.800
 Torn+0.787
eous+0.784
Neg logit contributions
��-0.960
 vet-0.653
ibaba-0.564
 manipulative-0.563
yden-0.549
Token on
Token ID319
Feature activation-2.00e-01

Pos logit contributions
��+2.226
 vet+1.633
ibaba+1.446
 manipulative+1.443
yden+1.418
Neg logit contributions
present-1.457
 Bris-1.359
 Practices-1.294
 Torn-1.273
eous-1.269
Token their
Token ID511
Feature activation-7.32e-02

Pos logit contributions
��+1.518
 vet+1.074
 manipulative+0.943
ibaba+0.933
yden+0.919
Neg logit contributions
present-1.206
 Bris-1.133
 Practices-1.080
eous-1.074
 Torn-1.066
Token plan
Token ID1410
Feature activation-1.89e-03

Pos logit contributions
��+0.509
 vet+0.346
 manipulative+0.297
ibaba+0.291
yden+0.285
Neg logit contributions
present-0.491
 Bris-0.465
 Practices-0.446
 Torn-0.441
eous-0.438
Token NHL
Token ID9481
Feature activation+1.33e-01

Pos logit contributions
��+0.230
 vet+0.163
ibaba+0.142
 manipulative+0.141
yden+0.137
Neg logit contributions
present-0.181
 Bris-0.171
 Practices-0.163
 Torn-0.160
eous-0.159
Token
Token ID447
Feature activation+1.36e-01

Pos logit contributions
present+0.708
 Bris+0.662
 Practices+0.626
 Torn+0.616
eous+0.607
Neg logit contributions
��-1.118
 vet-0.812
ibaba-0.737
 manipulative-0.717
yden-0.709
Token
Token ID247
Feature activation+2.50e-01

Pos logit contributions
present+0.952
 Bris+0.899
 Practices+0.867
 Torn+0.856
eous+0.846
Neg logit contributions
��-0.904
 vet-0.597
ibaba-0.524
 manipulative-0.507
yden-0.498
Tokens
Token ID82
Feature activation+2.23e-01

Pos logit contributions
present+1.085
 Bris+0.997
 Practices+0.931
 Torn+0.913
eous+0.892
Neg logit contributions
��-2.340
 vet-1.761
ibaba-1.623
 manipulative-1.588
yden-1.578
Token best
Token ID1266
Feature activation+1.90e-01

Pos logit contributions
present+1.325
 Bris+1.248
 Practices+1.189
 Torn+1.172
eous+1.154
Neg logit contributions
��-1.727
 vet-1.216
ibaba-1.087
 manipulative-1.061
yden-1.046
Token defense
Token ID3761
Feature activation+1.19e-01

Pos logit contributions
present+1.183
 Bris+1.118
 Practices+1.066
 Torn+1.053
eous+1.037
Neg logit contributions
��-1.424
 vet-0.987
ibaba-0.879
 manipulative-0.854
yden-0.843
Tokenmen
Token ID3653
Feature activation+1.03e-02

Pos logit contributions
present+0.580
 Bris+0.552
 Practices+0.511
 Torn+0.504
eous+0.500
Neg logit contributions
��-1.039
 vet-0.777
ibaba-0.698
 manipulative-0.687
yden-0.674
Token and
Token ID290
Feature activation+7.45e-02

Pos logit contributions
present+0.054
 Bris+0.051
 Practices+0.047
 Torn+0.047
eous+0.046
Neg logit contributions
��-0.087
 vet-0.064
ibaba-0.058
 manipulative-0.057
yden-0.056
Token elite
Token ID9085
Feature activation+1.35e-02

Pos logit contributions
present+0.458
 Bris+0.433
 Practices+0.413
 Torn+0.406
eous+0.403
Neg logit contributions
��-0.562
 vet-0.394
ibaba-0.347
 manipulative-0.342
yden-0.333
Token power
Token ID1176
Feature activation+1.57e-01

Pos logit contributions
present+0.087
 Bris+0.083
 Practices+0.079
 Torn+0.078
eous+0.077
Neg logit contributions
��-0.097
 vet-0.067
ibaba-0.059
 manipulative-0.057
yden-0.056
Token play
Token ID711
Feature activation+2.23e-01

Pos logit contributions
present+1.011
 Bris+0.965
 Practices+0.920
 Torn+0.910
eous+0.894
Neg logit contributions
��-1.134
 vet-0.780
ibaba-0.686
 manipulative-0.667
yden-0.658
Token
Token ID198
Feature activation+1.53e-01

Pos logit contributions
present+0.807
 Bris+0.742
 Practices+0.692
 Torn+0.679
eous+0.663
Neg logit contributions
��-1.748
 vet-1.316
ibaba-1.214
 manipulative-1.187
yden-1.179
Token
Token ID198
Feature activation+2.01e-01

Pos logit contributions
present+0.670
 Bris+0.620
 Practices+0.579
 Torn+0.568
eous+0.553
Neg logit contributions
��-1.421
 vet-1.072
ibaba-0.984
 manipulative-0.964
yden-0.958
Token"…
Token ID24426
Feature activation-5.22e-02

Pos logit contributions
present+1.046
 Bris+0.990
 Practices+0.936
 Torn+0.917
eous+0.898
Neg logit contributions
��-1.702
 vet-1.233
ibaba-1.122
 manipulative-1.098
yden-1.092
TokenAlright
Token ID31442
Feature activation+2.83e-02

Pos logit contributions
��+0.445
 vet+0.326
ibaba+0.297
 manipulative+0.291
yden+0.289
Neg logit contributions
present-0.264
 Bris-0.248
 Practices-0.233
 Torn-0.228
eous-0.226
Token."
Token ID526
Feature activation+1.55e-01

Pos logit contributions
present+0.140
 Bris+0.131
 Practices+0.123
 Torn+0.121
eous+0.118
Neg logit contributions
��-0.247
 vet-0.182
ibaba-0.166
 manipulative-0.162
yden-0.162
Token
Token ID198
Feature activation+6.34e-02

Pos logit contributions
present+0.680
 Bris+0.628
 Practices+0.585
 Torn+0.575
eous+0.562
Neg logit contributions
��-1.444
 vet-1.087
ibaba-1.000
 manipulative-0.979
yden-0.973
Token
Token ID198
Feature activation+1.50e-01

Pos logit contributions
present+0.282
 Bris+0.264
 Practices+0.246
 Torn+0.242
eous+0.235
Neg logit contributions
��-0.577
 vet-0.436
ibaba-0.398
 manipulative-0.392
yden-0.389
TokenSte
Token ID7447
Feature activation+2.39e-01

Pos logit contributions
present+0.812
 Bris+0.775
 Practices+0.733
 Torn+0.719
eous+0.705
Neg logit contributions
��-1.230
 vet-0.881
ibaba-0.798
 manipulative-0.782
yden-0.779
Tokenlla
Token ID8466
Feature activation+2.47e-02

Pos logit contributions
present+1.068
 Bris+0.969
 Practices+0.909
 Torn+0.888
eous+0.879
Neg logit contributions
��-2.217
 vet-1.657
ibaba-1.520
 manipulative-1.493
yden-1.473
Token and
Token ID290
Feature activation-2.27e-01

Pos logit contributions
present+0.142
 Bris+0.134
 Practices+0.127
 Torn+0.125
eous+0.123
Neg logit contributions
��-0.194
 vet-0.139
ibaba-0.124
 manipulative-0.122
yden-0.121
Token Ik
Token ID32840
Feature activation+1.49e-01

Pos logit contributions
��+1.555
 vet+1.059
ibaba+0.923
 manipulative+0.912
yden+0.897
Neg logit contributions
present-1.514
 Bris-1.436
 Practices-1.379
eous-1.351
 Torn-1.341
Token]
Token ID60
Feature activation-2.12e-01

Pos logit contributions
��+1.102
 vet+0.739
ibaba+0.644
 manipulative+0.618
yden+0.613
Neg logit contributions
present-1.071
 Bris-1.015
 Practices-0.976
 Torn-0.966
eous-0.952
Token and
Token ID290
Feature activation+4.92e-02

Pos logit contributions
��+1.682
 vet+1.203
ibaba+1.079
 manipulative+1.050
yden+1.040
Neg logit contributions
present-1.210
 Bris-1.142
 Practices-1.092
 Torn-1.072
eous-1.052
Token yet
Token ID1865
Feature activation+5.62e-02

Pos logit contributions
present+0.285
 Bris+0.268
 Practices+0.256
 Torn+0.252
eous+0.248
Neg logit contributions
��-0.388
 vet-0.275
ibaba-0.247
 manipulative-0.240
yden-0.238
Token we
Token ID356
Feature activation+3.82e-02

Pos logit contributions
present+0.317
 Bris+0.298
 Practices+0.283
 Torn+0.278
eous+0.275
Neg logit contributions
��-0.452
 vet-0.323
ibaba-0.290
 manipulative-0.284
yden-0.281
Token let
Token ID1309
Feature activation-3.01e-02

Pos logit contributions
present+0.221
 Bris+0.209
 Practices+0.199
 Torn+0.196
eous+0.193
Neg logit contributions
��-0.301
 vet-0.213
ibaba-0.191
 manipulative-0.186
yden-0.185
Token the
Token ID262
Feature activation+3.96e-02

Pos logit contributions
��+0.232
 vet+0.164
ibaba+0.146
 manipulative+0.143
yden+0.141
Neg logit contributions
present-0.179
 Bris-0.168
 Practices-0.160
 Torn-0.157
eous-0.156
Token shuttle
Token ID20992
Feature activation+8.78e-02

Pos logit contributions
present+0.237
 Bris+0.224
 Practices+0.213
 Torn+0.209
eous+0.209
Neg logit contributions
��-0.303
 vet-0.216
ibaba-0.190
 manipulative-0.188
yden-0.184
Token lapse
Token ID42689
Feature activation+4.04e-02

Pos logit contributions
present+0.531
 Bris+0.504
 Practices+0.479
eous+0.472
 Torn+0.471
Neg logit contributions
��-0.671
 vet-0.472
ibaba-0.419
 manipulative-0.408
yden-0.401
Token,
Token ID11
Feature activation+8.81e-02

Pos logit contributions
present+0.212
 Bris+0.200
 Practices+0.189
 Torn+0.186
eous+0.182
Neg logit contributions
��-0.338
 vet-0.247
ibaba-0.223
 manipulative-0.218
yden-0.217
Token instead
Token ID2427
Feature activation-3.56e-02

Pos logit contributions
present+0.476
 Bris+0.447
 Practices+0.422
 Torn+0.416
eous+0.410
Neg logit contributions
��-0.731
 vet-0.528
ibaba-0.478
 manipulative-0.467
yden-0.463
Token of
Token ID286
Feature activation-3.15e-01

Pos logit contributions
��+0.308
 vet+0.227
ibaba+0.206
 manipulative+0.202
yden+0.200
Neg logit contributions
present-0.178
 Bris-0.166
 Practices-0.158
 Torn-0.154
eous-0.152
Token nearly
Token ID3016
Feature activation-3.05e-01

Pos logit contributions
present+0.208
 Bris+0.194
 Practices+0.186
 Torn+0.183
eous+0.181
Neg logit contributions
��-0.283
 vet-0.202
ibaba-0.179
 manipulative-0.178
yden-0.175
Token $
Token ID720
Feature activation-3.76e-01

Pos logit contributions
��+2.151
 vet+1.491
ibaba+1.296
 manipulative+1.286
yden+1.246
Neg logit contributions
present-1.999
 Bris-1.856
 Practices-1.811
 Torn-1.772
eous-1.756
Token4
Token ID19
Feature activation+7.80e-02

Pos logit contributions
��+3.063
 vet+2.244
ibaba+2.013
yden+1.987
 manipulative+1.977
Neg logit contributions
present-1.902
 Bris-1.858
 Practices-1.756
 Torn-1.724
eous-1.672
Token billion
Token ID2997
Feature activation+1.78e-01

Pos logit contributions
present+0.519
 Bris+0.489
 Practices+0.472
 Torn+0.464
eous+0.456
Neg logit contributions
��-0.552
 vet-0.373
ibaba-0.329
 manipulative-0.319
yden-0.313
Token.)
Token ID2014
Feature activation-1.75e-01

Pos logit contributions
present+0.845
 Bris+0.783
 Practices+0.740
 Torn+0.725
eous+0.708
Neg logit contributions
��-1.594
 vet-1.187
ibaba-1.082
 manipulative-1.064
yden-1.052
Token At
Token ID1629
Feature activation-7.91e-03

Pos logit contributions
��+1.409
 vet+1.036
ibaba+0.919
 manipulative+0.915
yden+0.905
Neg logit contributions
present-0.926
 Bris-0.868
eous-0.809
 Practices-0.807
 Torn-0.796
Token that
Token ID326
Feature activation-1.84e-01

Pos logit contributions
��+0.061
 vet+0.044
ibaba+0.039
 manipulative+0.039
yden+0.038
Neg logit contributions
present-0.046
 Bris-0.043
 Practices-0.041
 Torn-0.040
eous-0.040
Token point
Token ID966
Feature activation-4.16e-02

Pos logit contributions
��+1.500
 vet+1.098
 manipulative+0.982
ibaba+0.971
 col+0.943
Neg logit contributions
present-1.000
 Bris-0.936
 Practices-0.900
 Torn-0.882
eous-0.865
Token,
Token ID11
Feature activation-1.72e-01

Pos logit contributions
��+0.348
 vet+0.255
ibaba+0.230
 manipulative+0.228
yden+0.224
Neg logit contributions
present-0.218
 Bris-0.202
 Practices-0.193
 Torn-0.188
eous-0.186
Token though
Token ID996
Feature activation-2.50e-01

Pos logit contributions
��+1.310
 vet+0.940
ibaba+0.829
 manipulative+0.828
yden+0.805
Neg logit contributions
present-1.016
 Bris-0.947
 Practices-0.907
eous-0.897
 Torn-0.888
Token,
Token ID11
Feature activation-1.08e-01

Pos logit contributions
��+2.106
 vet+1.547
 manipulative+1.392
ibaba+1.379
yden+1.357
Neg logit contributions
present-1.293
 Bris-1.197
 Practices-1.144
 Torn-1.112
eous-1.108
Token property
Token ID3119
Feature activation-9.49e-02

Pos logit contributions
��+0.430
 vet+0.298
ibaba+0.259
 manipulative+0.256
yden+0.251
Neg logit contributions
present-0.376
 Bris-0.355
 Practices-0.339
 Torn-0.333
eous-0.330
Token<|endoftext|>
Token ID50256
Feature activation-4.17e-02

Pos logit contributions
��+0.801
 vet+0.590
ibaba+0.529
 manipulative+0.522
yden+0.518
Neg logit contributions
present-0.493
 Bris-0.457
 Practices-0.433
 Torn-0.426
eous-0.417
TokenB
Token ID33
Feature activation+5.21e-02

Pos logit contributions
��+0.347
 vet+0.251
ibaba+0.228
 manipulative+0.222
yden+0.222
Neg logit contributions
present-0.219
 Bris-0.210
 Practices-0.199
 Torn-0.195
eous-0.191
Tokenishops
Token ID21863
Feature activation+1.01e-01

Pos logit contributions
present+0.294
 Bris+0.279
 Practices+0.265
 Torn+0.258
eous+0.254
Neg logit contributions
��-0.418
 vet-0.300
ibaba-0.274
yden-0.263
 manipulative-0.263
Token Opp
Token ID9385
Feature activation-1.17e-01

Pos logit contributions
present+0.554
 Bris+0.518
 Practices+0.489
eous+0.481
 Torn+0.481
Neg logit contributions
��-0.823
 vet-0.597
ibaba-0.531
 manipulative-0.524
yden-0.517
Tokenose
Token ID577
Feature activation-1.35e-01

Pos logit contributions
��+0.936
 vet+0.679
ibaba+0.611
yden+0.605
 manipulative+0.585
Neg logit contributions
present-0.645
 Bris-0.596
 Practices-0.565
 Torn-0.552
eous-0.542
Token Catholic
Token ID7835
Feature activation-1.51e-01

Pos logit contributions
��+1.126
 vet+0.836
ibaba+0.746
 manipulative+0.736
yden+0.729
Neg logit contributions
present-0.703
 Bris-0.636
 Practices-0.596
eous-0.592
 Torn-0.586
Token Universities
Token ID42571
Feature activation-1.24e-01

Pos logit contributions
��+1.286
 vet+0.947
ibaba+0.850
 manipulative+0.836
yden+0.819
Neg logit contributions
present-0.779
 Bris-0.712
 Torn-0.653
 Practices-0.647
eous-0.645
Token Off
Token ID3242
Feature activation-1.80e-01

Pos logit contributions
��+1.057
 vet+0.789
ibaba+0.705
 manipulative+0.697
yden+0.688
Neg logit contributions
present-0.617
 Bris-0.574
 Practices-0.526
eous-0.523
 Torn-0.523
Tokenering
Token ID1586
Feature activation-1.53e-01

Pos logit contributions
��+1.178
 vet+0.762
ibaba+0.684
yden+0.636
 manipulative+0.628
Neg logit contributions
present-1.270
 Bris-1.216
 Practices-1.150
 Torn-1.140
eous-1.127
Token Extended
Token ID24204
Feature activation-3.77e-01

Pos logit contributions
��+1.284
 vet+0.959
ibaba+0.850
 manipulative+0.849
 col+0.832
Neg logit contributions
present-0.782
 Bris-0.709
 Practices-0.656
eous-0.655
 Torn-0.654
Token
Token ID247
Feature activation-3.48e-02

Pos logit contributions
��+0.477
 vet+0.290
ibaba+0.245
 manipulative+0.236
yden+0.234
Neg logit contributions
present-0.580
 Bris-0.559
 Practices-0.533
 Torn-0.530
eous-0.523
Tokens
Token ID82
Feature activation-1.71e-01

Pos logit contributions
��+0.293
 vet+0.213
ibaba+0.193
 manipulative+0.189
yden+0.187
Neg logit contributions
present-0.183
 Bris-0.172
 Practices-0.162
 Torn-0.160
eous-0.157
Token many
Token ID867
Feature activation+2.16e-01

Pos logit contributions
��+1.254
 vet+0.885
 manipulative+0.775
ibaba+0.749
 col+0.736
Neg logit contributions
present-1.064
 Bris-1.019
 Practices-0.961
 Torn-0.953
eous-0.944
Token mistakes
Token ID10135
Feature activation-5.56e-02

Pos logit contributions
present+1.308
 Bris+1.243
 Practices+1.170
 Torn+1.161
eous+1.146
Neg logit contributions
��-1.656
 vet-1.166
 manipulative-1.022
ibaba-1.022
yden-0.995
Token,
Token ID11
Feature activation-2.60e-01

Pos logit contributions
��+0.470
 vet+0.344
ibaba+0.308
 manipulative+0.306
yden+0.303
Neg logit contributions
present-0.288
 Bris-0.272
 Practices-0.255
 Torn-0.252
eous-0.246
Token ignoring
Token ID15482
Feature activation+1.99e-02

Pos logit contributions
��+2.081
 vet+1.516
 manipulative+1.342
ibaba+1.335
yden+1.303
Neg logit contributions
present-1.447
 Bris-1.367
 Practices-1.283
eous-1.273
 Torn-1.271
Token the
Token ID262
Feature activation+2.74e-02

Pos logit contributions
present+0.120
 Bris+0.112
 Practices+0.107
 Torn+0.105
eous+0.105
Neg logit contributions
��-0.153
 vet-0.109
ibaba-0.095
 manipulative-0.095
yden-0.093
Token fact
Token ID1109
Feature activation-2.59e-01

Pos logit contributions
present+0.167
 Bris+0.158
 Practices+0.151
 Torn+0.148
eous+0.147
Neg logit contributions
��-0.207
 vet-0.146
 manipulative-0.128
ibaba-0.127
yden-0.124
Token that
Token ID326
Feature activation-6.18e-02

Pos logit contributions
��+2.058
 vet+1.480
ibaba+1.317
 manipulative+1.300
yden+1.290
Neg logit contributions
present-1.460
 Bris-1.370
 Practices-1.310
 Torn-1.279
eous-1.272
Token,
Token ID11
Feature activation-3.15e-01

Pos logit contributions
��+0.491
 vet+0.354
 manipulative+0.311
ibaba+0.311
yden+0.304
Neg logit contributions
present-0.353
 Bris-0.331
 Practices-0.316
 Torn-0.309
eous-0.307
Token to
Token ID284
Feature activation-2.45e-01

Pos logit contributions
��+2.473
 vet+1.815
 manipulative+1.590
ibaba+1.584
yden+1.544
Neg logit contributions
present-1.778
 Bris-1.666
 Practices-1.604
 Torn-1.557
eous-1.550
Token then
Token ID788
Feature activation+2.04e-01

Pos logit contributions
present+0.431
 Bris+0.406
 Practices+0.389
 Torn+0.382
eous+0.379
Neg logit contributions
��-0.516
 vet-0.359
ibaba-0.317
 manipulative-0.310
yden-0.306
Token Bernard
Token ID16197
Feature activation+2.49e-03

Pos logit contributions
present+1.268
 Bris+1.196
 Practices+1.143
 Torn+1.126
eous+1.112
Neg logit contributions
��-1.527
 vet-1.058
ibaba-0.942
 manipulative-0.915
yden-0.905
Token Sanders
Token ID5831
Feature activation-2.28e-01

Pos logit contributions
present+0.014
 Bris+0.013
 Practices+0.012
 Torn+0.012
eous+0.012
Neg logit contributions
��-0.020
 vet-0.015
 manipulative-0.013
ibaba-0.013
yden-0.013
Token is
Token ID318
Feature activation+5.09e-02

Pos logit contributions
��+1.916
 vet+1.409
ibaba+1.260
 manipulative+1.247
yden+1.230
Neg logit contributions
present-1.190
 Bris-1.118
 Practices-1.061
 Torn-1.039
eous-1.029
Token going
Token ID1016
Feature activation-6.98e-02

Pos logit contributions
present+0.318
 Bris+0.300
 Practices+0.287
 Torn+0.282
eous+0.279
Neg logit contributions
��-0.380
 vet-0.263
ibaba-0.234
 manipulative-0.228
yden-0.225
Token to
Token ID284
Feature activation+7.76e-04

Pos logit contributions
��+0.549
 vet+0.391
ibaba+0.352
 manipulative+0.341
yden+0.337
Neg logit contributions
present-0.406
 Bris-0.383
 Practices-0.365
 Torn-0.359
eous-0.352
Token be
Token ID307
Feature activation+4.94e-02

Pos logit contributions
present+0.005
 Bris+0.005
 Practices+0.004
 Torn+0.004
eous+0.004
Neg logit contributions
��-0.006
 vet-0.004
ibaba-0.004
 manipulative-0.004
yden-0.003
Token around
Token ID1088
Feature activation+1.68e-01

Pos logit contributions
present+0.307
 Bris+0.290
 Practices+0.277
 Torn+0.273
eous+0.270
Neg logit contributions
��-0.370
 vet-0.256
ibaba-0.227
 manipulative-0.222
yden-0.219
Token for
Token ID329
Feature activation+1.49e-01

Pos logit contributions
present+0.927
 Bris+0.868
 Practices+0.824
 Torn+0.812
eous+0.797
Neg logit contributions
��-1.376
 vet-0.986
ibaba-0.894
 manipulative-0.870
yden-0.863
Token a
Token ID257
Feature activation+1.17e-02

Pos logit contributions
present+0.936
 Bris+0.885
 Practices+0.844
 Torn+0.834
eous+0.822
Neg logit contributions
��-1.106
 vet-0.763
ibaba-0.679
 manipulative-0.659
yden-0.652
Token while
Token ID981
Feature activation-1.01e-02

Pos logit contributions
present+0.067
 Bris+0.064
 Practices+0.061
 Torn+0.060
eous+0.059
Neg logit contributions
��-0.091
 vet-0.066
ibaba-0.058
 manipulative-0.058
yden-0.057
Token absence
Token ID8889
Feature activation-1.73e-01

Pos logit contributions
present+0.548
 Bris+0.520
 Practices+0.495
 Torn+0.486
eous+0.484
Neg logit contributions
��-0.700
 vet-0.495
ibaba-0.433
 manipulative-0.431
 col-0.423
Token of
Token ID286
Feature activation-7.79e-03

Pos logit contributions
��+1.478
 vet+1.100
ibaba+0.993
 manipulative+0.977
yden+0.964
Neg logit contributions
present-0.863
 Bris-0.809
 Practices-0.764
 Torn-0.752
eous-0.736
Token v
Token ID410
Feature activation-1.89e-02

Pos logit contributions
��+0.060
 vet+0.043
ibaba+0.038
 manipulative+0.037
 col+0.036
Neg logit contributions
present-0.046
 Bris-0.044
 Practices-0.042
 Torn-0.041
eous-0.041
Tokenort
Token ID419
Feature activation+6.80e-03

Pos logit contributions
��+0.130
 vet+0.087
ibaba+0.077
 manipulative+0.074
yden+0.073
Neg logit contributions
present-0.128
 Bris-0.122
 Practices-0.117
 Torn-0.115
eous-0.114
Tokenons
Token ID684
Feature activation-9.05e-02

Pos logit contributions
present+0.029
 Bris+0.027
 Practices+0.025
 Torn+0.024
 Bind+0.024
Neg logit contributions
��-0.064
 vet-0.048
ibaba-0.045
yden-0.044
 manipulative-0.043
Token.
Token ID13
Feature activation-8.25e-02

Pos logit contributions
��+0.766
 vet+0.561
ibaba+0.506
 manipulative+0.497
yden+0.493
Neg logit contributions
present-0.464
 Bris-0.440
 Practices-0.414
 Torn-0.405
 Bind-0.396
Token V
Token ID569
Feature activation-2.82e-02

Pos logit contributions
��+0.687
 vet+0.508
ibaba+0.448
 manipulative+0.447
 col+0.442
Neg logit contributions
present-0.425
 Bris-0.400
eous-0.368
 Practices-0.367
 Torn-0.364
Tokenort
Token ID419
Feature activation-5.10e-02

Pos logit contributions
��+0.191
 vet+0.127
ibaba+0.111
yden+0.107
 manipulative+0.106
Neg logit contributions
present-0.193
 Bris-0.186
 Practices-0.178
 Torn-0.175
eous-0.172
Tokenons
Token ID684
Feature activation-4.69e-02

Pos logit contributions
��+0.474
 vet+0.356
ibaba+0.328
yden+0.321
 manipulative+0.320
Neg logit contributions
present-0.222
 Bris-0.208
 Practices-0.193
 Torn-0.187
 Bind-0.183
Token,
Token ID11
Feature activation-8.49e-02

Pos logit contributions
��+0.375
 vet+0.275
ibaba+0.241
 manipulative+0.239
yden+0.236
Neg logit contributions
present-0.260
 Bris-0.247
 Practices-0.233
 Torn-0.230
eous-0.225
Token meanwhile
Token ID15066
Feature activation-1.58e-01

Pos logit contributions
��+0.693
 vet+0.505
ibaba+0.452
 manipulative+0.446
yden+0.438
Neg logit contributions
present-0.460
 Bris-0.435
 Practices-0.411
 Torn-0.402
eous-0.398
Token been
Token ID587
Feature activation-2.68e-01

Pos logit contributions
present+0.041
 Bris+0.039
 Practices+0.037
 Torn+0.037
eous+0.036
Neg logit contributions
��-0.043
 vet-0.030
ibaba-0.026
 manipulative-0.025
yden-0.025
Token upgraded
Token ID17955
Feature activation-2.03e-01

Pos logit contributions
��+1.962
 vet+1.382
 manipulative+1.185
ibaba+1.170
yden+1.163
Neg logit contributions
present-1.655
 Bris-1.590
 Practices-1.532
eous-1.487
 Torn-1.486
Token regularly
Token ID7987
Feature activation-3.15e-01

Pos logit contributions
��+1.639
 vet+1.195
ibaba+1.053
 manipulative+1.046
yden+1.027
Neg logit contributions
present-1.118
 Bris-1.046
 Practices-1.003
 Torn-0.979
eous-0.969
Token over
Token ID625
Feature activation-1.90e-01

Pos logit contributions
��+2.564
 vet+1.871
ibaba+1.656
 manipulative+1.644
yden+1.626
Neg logit contributions
present-1.697
 Bris-1.611
 Practices-1.525
 Torn-1.495
 Bind-1.468
Token the
Token ID262
Feature activation-3.86e-01

Pos logit contributions
��+1.473
 vet+1.053
ibaba+0.923
 manipulative+0.920
yden+0.907
Neg logit contributions
present-1.110
 Bris-1.043
 Practices-1.001
eous-0.981
 Torn-0.975
Token years
Token ID812
Feature activation-5.05e-01

Pos logit contributions
��+2.920
 vet+2.116
 manipulative+1.830
 col+1.799
ibaba+1.778
Neg logit contributions
present-2.259
 Bris-2.130
 Practices-2.062
eous-2.038
 Torn-1.969
Token,
Token ID11
Feature activation-7.62e-03

Pos logit contributions
��+4.115
 vet+3.026
ibaba+2.670
 manipulative+2.664
 col+2.614
Neg logit contributions
present-2.675
 Bris-2.533
 Practices-2.410
 Torn-2.352
eous-2.319
Token had
Token ID550
Feature activation+2.69e-01

Pos logit contributions
��+0.063
 vet+0.046
ibaba+0.041
 manipulative+0.040
yden+0.040
Neg logit contributions
present-0.041
 Bris-0.039
 Practices-0.037
 Torn-0.036
eous-0.036
Token also
Token ID635
Feature activation+9.34e-02

Pos logit contributions
present+1.696
 Bris+1.605
 Practices+1.534
 Torn+1.512
eous+1.494
Neg logit contributions
��-1.984
 vet-1.366
ibaba-1.211
 manipulative-1.179
yden-1.165
Token been
Token ID587
Feature activation-1.65e-02

Pos logit contributions
present+0.619
 Bris+0.591
 Practices+0.567
 Torn+0.556
eous+0.555
Neg logit contributions
��-0.656
 vet-0.447
ibaba-0.383
 manipulative-0.379
yden-0.372
Token heavily
Token ID7272
Feature activation-5.51e-02

Pos logit contributions
��+0.119
 vet+0.082
ibaba+0.071
 manipulative+0.070
yden+0.069
Neg logit contributions
present-0.106
 Bris-0.101
 Practices-0.097
 Torn-0.095
eous-0.095
Token We
Token ID775
Feature activation-1.21e-01

Pos logit contributions
��+1.249
 vet+0.931
ibaba+0.842
 manipulative+0.819
 col+0.811
Neg logit contributions
present-0.712
 Bris-0.671
 Torn-0.618
 Practices-0.616
eous-0.615
Token
Token ID447
Feature activation-2.53e-02

Pos logit contributions
��+0.936
 vet+0.669
ibaba+0.591
 manipulative+0.575
yden+0.569
Neg logit contributions
present-0.712
 Bris-0.674
 Practices-0.639
 Torn-0.634
eous-0.625
Token
Token ID247
Feature activation-1.94e-03

Pos logit contributions
��+0.161
 vet+0.102
ibaba+0.088
 manipulative+0.085
yden+0.084
Neg logit contributions
present-0.186
 Bris-0.177
 Practices-0.171
 Torn-0.169
eous-0.167
Tokenre
Token ID260
Feature activation-7.72e-02

Pos logit contributions
��+0.017
 vet+0.012
ibaba+0.011
 manipulative+0.011
yden+0.011
Neg logit contributions
present-0.010
 Bris-0.009
 Practices-0.009
 Torn-0.009
eous-0.008
Token the
Token ID262
Feature activation-8.74e-02

Pos logit contributions
��+0.557
 vet+0.382
ibaba+0.334
 manipulative+0.326
yden+0.320
Neg logit contributions
present-0.498
 Bris-0.475
 Practices-0.452
 Torn-0.448
eous-0.442
Token only
Token ID691
Feature activation-5.13e-01

Pos logit contributions
��+0.679
 vet+0.490
ibaba+0.428
 manipulative+0.424
 col+0.412
Neg logit contributions
present-0.510
 Bris-0.482
 Practices-0.456
 Torn-0.454
eous-0.445
Token school
Token ID1524
Feature activation+1.92e-01

Pos logit contributions
��+3.595
 vet+2.537
ibaba+2.117
 manipulative+2.111
 col+2.104
Neg logit contributions
present-3.354
 Bris-3.177
 Practices-3.020
 Torn-3.005
eous-2.989
Token that
Token ID326
Feature activation+9.87e-02

Pos logit contributions
present+1.020
 Bris+0.953
 Practices+0.902
 Torn+0.889
eous+0.872
Neg logit contributions
��-1.615
 vet-1.170
ibaba-1.064
 manipulative-1.037
yden-1.028
Token offers
Token ID4394
Feature activation-9.45e-02

Pos logit contributions
present+0.475
 Bris+0.441
 Practices+0.414
 Torn+0.408
eous+0.400
Neg logit contributions
��-0.877
 vet-0.649
ibaba-0.593
 manipulative-0.580
yden-0.575
Token this
Token ID428
Feature activation-1.19e-01

Pos logit contributions
��+0.683
 vet+0.473
ibaba+0.411
 manipulative+0.404
yden+0.393
Neg logit contributions
present-0.607
 Bris-0.576
 Practices-0.547
 Torn-0.544
eous-0.538
Token in
Token ID287
Feature activation+6.32e-02

Pos logit contributions
��+0.929
 vet+0.660
ibaba+0.587
 manipulative+0.576
yden+0.563
Neg logit contributions
present-0.697
 Bris-0.657
 Practices-0.625
 Torn-0.617
eous-0.607
Token is
Token ID318
Feature activation-4.74e-02

Pos logit contributions
��+1.037
 vet+0.766
ibaba+0.692
 manipulative+0.682
 col+0.667
Neg logit contributions
present-0.632
 Bris-0.590
 Practices-0.554
 Torn-0.548
eous-0.532
Token crooked
Token ID45571
Feature activation-1.00e-01

Pos logit contributions
��+0.347
 vet+0.241
ibaba+0.212
 manipulative+0.209
yden+0.204
Neg logit contributions
present-0.299
 Bris-0.285
 Practices-0.272
 Torn-0.268
eous-0.262
Token cannot
Token ID2314
Feature activation-3.69e-01

Pos logit contributions
��+0.837
 vet+0.625
ibaba+0.561
 manipulative+0.554
 col+0.543
Neg logit contributions
present-0.516
 Bris-0.489
 Practices-0.454
 Torn-0.453
Hur-0.434
Token be
Token ID307
Feature activation-3.07e-01

Pos logit contributions
��+2.634
 vet+1.930
ibaba+1.615
 manipulative+1.614
 col+1.606
Neg logit contributions
present-2.311
 Bris-2.224
 Practices-2.096
 Torn-2.073
eous-2.015
Token made
Token ID925
Feature activation-3.83e-01

Pos logit contributions
��+2.205
 vet+1.563
 manipulative+1.351
ibaba+1.310
yden+1.303
Neg logit contributions
present-1.931
 Bris-1.835
 Practices-1.759
 Torn-1.723
eous-1.684
Token straight
Token ID3892
Feature activation-5.35e-01

Pos logit contributions
��+2.562
 vet+1.783
 manipulative+1.514
 col+1.446
yden+1.431
Neg logit contributions
present-2.597
 Bris-2.469
 Practices-2.382
 Torn-2.333
eous-2.284
Token;
Token ID26
Feature activation-3.33e-01

Pos logit contributions
��+4.409
 vet+3.290
ibaba+2.896
 manipulative+2.888
yden+2.862
Neg logit contributions
present-2.691
 Bris-2.588
 Practices-2.436
 Torn-2.388
 Bind-2.290
Token
Token ID198
Feature activation-1.52e-01

Pos logit contributions
��+2.794
 vet+2.086
ibaba+1.906
 manipulative+1.868
 col+1.817
Neg logit contributions
present-1.681
 Bris-1.584
 Practices-1.469
 Torn-1.462
eous-1.413
Token
Token ID198
Feature activation-1.20e-01

Pos logit contributions
��+1.323
 vet+1.009
ibaba+0.907
 manipulative+0.900
yden+0.888
Neg logit contributions
present-0.701
 Bris-0.673
 Practices-0.623
 Torn-0.613
eous-0.596
Tokenand
Token ID392
Feature activation-1.76e-01

Pos logit contributions
��+0.991
 vet+0.719
ibaba+0.660
 manipulative+0.640
yden+0.637
Neg logit contributions
present-0.620
 Bris-0.607
 Practices-0.567
 Torn-0.558
 Bind-0.543
Token that
Token ID326
Feature activation-2.87e-01

Pos logit contributions
��+1.338
 vet+0.955
ibaba+0.850
 manipulative+0.842
 col+0.811
Neg logit contributions
present-1.059
 Bris-0.995
 Practices-0.940
 Torn-0.930
eous-0.910
Token the
Token ID262
Feature activation+2.85e-01

Pos logit contributions
present+0.234
 Bris+0.221
 Practices+0.210
eous+0.205
 Torn+0.204
Neg logit contributions
��-0.337
 vet-0.242
ibaba-0.218
yden-0.213
 manipulative-0.212
Token Broncos
Token ID16582
Feature activation+1.97e-01

Pos logit contributions
present+1.562
 Bris+1.462
 Practices+1.385
 Torn+1.369
eous+1.337
Neg logit contributions
��-2.346
 vet-1.683
ibaba-1.533
 manipulative-1.487
yden-1.480
Token in
Token ID287
Feature activation+1.67e-01

Pos logit contributions
present+1.080
 Bris+1.011
 Practices+0.959
 Torn+0.945
eous+0.927
Neg logit contributions
��-1.625
 vet-1.168
ibaba-1.059
 manipulative-1.031
yden-1.023
Token 2016
Token ID1584
Feature activation+7.68e-03

Pos logit contributions
present+0.956
 Bris+0.898
 Practices+0.853
 Torn+0.841
eous+0.830
Neg logit contributions
��-1.328
 vet-0.944
ibaba-0.850
 manipulative-0.828
yden-0.820
Token as
Token ID355
Feature activation+2.22e-02

Pos logit contributions
present+0.039
 Bris+0.037
 Practices+0.035
 Torn+0.034
eous+0.034
Neg logit contributions
��-0.066
 vet-0.048
ibaba-0.044
 manipulative-0.043
yden-0.042
Token they
Token ID484
Feature activation-4.61e-01

Pos logit contributions
present+0.125
 Bris+0.117
 Practices+0.112
eous+0.110
 Torn+0.109
Neg logit contributions
��-0.179
 vet-0.128
ibaba-0.114
 manipulative-0.113
yden-0.112
Token missed
Token ID6825
Feature activation-9.96e-02

Pos logit contributions
��+3.292
 vet+2.356
ibaba+1.989
 manipulative+1.956
 col+1.938
Neg logit contributions
present-2.880
 Bris-2.794
 Practices-2.638
 Torn-2.614
eous-2.604
Token the
Token ID262
Feature activation+1.22e-01

Pos logit contributions
��+0.760
 vet+0.537
ibaba+0.476
 manipulative+0.468
 col+0.456
Neg logit contributions
present-0.599
 Bris-0.561
 Practices-0.533
eous-0.527
 Torn-0.525
Token postseason
Token ID22905
Feature activation+1.25e-02

Pos logit contributions
present+0.778
 Bris+0.737
 Practices+0.705
 Torn+0.695
eous+0.687
Neg logit contributions
��-0.887
 vet-0.611
ibaba-0.538
 manipulative-0.524
yden-0.512
Token.
Token ID13
Feature activation-5.94e-02

Pos logit contributions
present+0.066
 Bris+0.062
 Practices+0.058
 Torn+0.058
eous+0.057
Neg logit contributions
��-0.105
 vet-0.077
ibaba-0.070
 manipulative-0.068
yden-0.067
Token
Token ID198
Feature activation+8.63e-02

Pos logit contributions
��+0.516
 vet+0.385
ibaba+0.349
yden+0.342
 manipulative+0.339
Neg logit contributions
present-0.282
 Bris-0.261
eous-0.244
 Practices-0.242
 Bind-0.239
TokenWith
Token ID3152
Feature activation-4.40e-03

Pos logit contributions
present+0.232
 Bris+0.222
 Practices+0.210
 Torn+0.206
eous+0.201
Neg logit contributions
��-0.380
 vet-0.275
ibaba-0.250
 manipulative-0.244
yden-0.243
Token love
Token ID1842
Feature activation+9.54e-02

Pos logit contributions
��+0.033
 vet+0.023
ibaba+0.020
 manipulative+0.020
yden+0.020
Neg logit contributions
present-0.027
 Bris-0.025
 Practices-0.024
 Torn-0.024
eous-0.024
Token and
Token ID290
Feature activation-1.13e-01

Pos logit contributions
present+0.476
 Bris+0.440
 Practices+0.418
 Torn+0.407
eous+0.400
Neg logit contributions
��-0.828
 vet-0.610
ibaba-0.556
 manipulative-0.545
yden-0.540
Token light
Token ID1657
Feature activation-2.60e-01

Pos logit contributions
��+0.791
 vet+0.541
ibaba+0.465
 manipulative+0.461
yden+0.451
Neg logit contributions
present-0.750
 Bris-0.703
 Practices-0.683
 Torn-0.667
eous-0.663
Token
Token ID198
Feature activation-4.23e-01

Pos logit contributions
��+2.209
 vet+1.636
ibaba+1.465
 manipulative+1.455
 col+1.434
Neg logit contributions
present-1.312
 Bris-1.226
 Practices-1.156
 Torn-1.130
eous-1.094
Token
Token ID198
Feature activation-4.74e-01

Pos logit contributions
��+3.512
 vet+2.696
ibaba+2.387
 col+2.378
 manipulative+2.375
Neg logit contributions
present-2.009
 Bris-1.953
 Practices-1.815
 Torn-1.782
eous-1.728
TokenGeorge
Token ID20191
Feature activation-8.03e-02

Pos logit contributions
��+3.902
 vet+2.842
ibaba+2.577
yden+2.543
 manipulative+2.523
Neg logit contributions
present-2.379
 Bris-2.306
 Practices-2.227
 Torn-2.145
 Bind-2.075
Token
Token ID198
Feature activation-1.34e-01

Pos logit contributions
��+0.696
 vet+0.519
ibaba+0.467
 manipulative+0.461
yden+0.456
Neg logit contributions
present-0.394
 Bris-0.361
 Practices-0.347
 Torn-0.333
eous-0.333
Token
Token ID198
Feature activation-1.15e-01

Pos logit contributions
��+1.184
 vet+0.900
ibaba+0.813
 manipulative+0.801
yden+0.798
Neg logit contributions
present-0.613
 Bris-0.583
 Practices-0.543
 Torn-0.533
eous-0.518
Token……
Token ID7398
Feature activation+1.24e-01

Pos logit contributions
��+0.945
 vet+0.680
ibaba+0.617
yden+0.604
 manipulative+0.603
Neg logit contributions
present-0.607
 Bris-0.586
 Practices-0.561
 Torn-0.548
 Bind-0.531
Token.
Token ID13
Feature activation+1.95e-01

Pos logit contributions
present+0.615
 Bris+0.572
 Practices+0.539
 Torn+0.530
eous+0.519
Neg logit contributions
��-1.082
 vet-0.795
ibaba-0.727
 manipulative-0.710
yden-0.704