TOP ACTIVATIONS
MAX = 2.539

dude , Floyd and the Lizard King -- June 14 .
Adjust ing ? Cat ast rophic adjust ers are
group hypothesis , and then meta - reg ression if more
several back fields at the Padres spring training complex
and then ate some fruit salad . The cant
figures courtesy of Over The Cap . com unless otherwise indicated
arrest warrant . Wiki Leaks pr ides itself as
Next Page > <|endoftext|> Late in 2009 , Eating ...
view over one - h undred years later . The institution
said . <|endoftext|> The 2014 Padres were 63 - 1 when
an OC pony ( well known or otherwise ), I couldn
say she wore skirts and cocktail dresses to work , and
trade asset . And the Padres should have left him alone
unique feature of the 2014 Padres . In 2013 , the
operate under the US debt cap . Lew told Congress that
t iger salad that is
way of saying that the Padres had no need at all
bullying intervention programs ? A meta - analysis of intervention research
six innings . The 2014 Padres obviously had difficulties generating leads
w elfare as we know it , while

MIN ACTIVATIONS
MIN = -2.911

, c v 2 . COLOR _ B GR 2 G
't realize they 're being videot aped with their knowledge ?"
. in fact had two videot apes and one aud iot
. government to take this video down as it is dem
June 2 - 3 in Dayton , Ohio at a mock
Take a look the incredible footage below : <|endoftext|> With a
Sylvia 's locations in her stead . Today , Sylvia has
the hundreds of hours of videot ape of Abu Zub ay
claims filed by communities of color under Title VI of the
Audio evidence The FIR further quotes the contents of
T . J . Osh ie Nick las
spend the weekend watching this video of a traffic policeman helping
, T . J . Osh ie and Nick las Back
Scroll down and click on Vide opl ayer Input Stream .
to cast a woman of color to play Bat girl .
C OOK COUNTY CR IM INAL COUR TH OU SE
One man managed to capture footage of a be aver attacking
that many of these revealing cinem atics have been compressed to
the women and people of color who appeared onstage weren 't
is , therefore , in motion . What would

INTERVAL 1.177 - 2.539
CONTAINS 1.331%

at oder ma ( also known as " P or oker
A gentleman of genuine polit eness will not give
и ( о с о� е н
more as being youthful and friendly , but lonely and in
and counselling to help them deal with the emotional impact of

INTERVAL -0.186 - 1.177
CONTAINS 67.038%

you can watch porn and masturb ate for a long time
to recent poor news . This week , surveys
be in place and how it s run .
The temperature he felt through her clothes , it
found not guilty in the upcoming trial scheduled to begin on

INTERVAL -1.548 - -0.186
CONTAINS 31.298%

a co - founder and former chief executive of Hy att
answer ? Ol af : The answer is perfect
s about being better than everyone
in a proced urally generated city . This series of blog
enable these techniques . Even without them I believe we

INTERVAL -2.911 - -1.548
CONTAINS 0.333%

to build exposure to their wa res , said Jere mie
My sample of Got lands d rick a
) walks with Cuba 's Minister of Higher Education Rod olf
change in ton al / spect ral response at higher sensit
later , in 1976 , UNESCO ( the United Nations Educational
Token dude
Token ID18396
Feature activation+4.32e-01

Pos logit contributions
horn+0.081
anon+0.080
+0.080
scape+0.077
ocal+0.076
Neg logit contributions
 rapp-0.072
hiba-0.071
 hospitality-0.071
 defe-0.068
 welf-0.068
Token,
Token ID11
Feature activation+1.36e-01

Pos logit contributions
 rapp+0.850
 hospitality+0.842
hiba+0.841
 defe+0.806
 welf+0.801
Neg logit contributions
horn-0.692
anon-0.689
-0.682
scape-0.657
ocal-0.644
Token Floyd
Token ID21150
Feature activation-6.30e-01

Pos logit contributions
 rapp+0.231
 hospitality+0.229
hiba+0.228
 defe+0.218
 welf+0.216
Neg logit contributions
horn-0.256
anon-0.255
-0.252
scape-0.245
ocal-0.241
Token and
Token ID290
Feature activation-3.26e-01

Pos logit contributions
horn+1.107
anon+1.095
+1.083
scape+1.049
ocal+1.031
Neg logit contributions
 rapp-1.142
hiba-1.141
 hospitality-1.136
 defe-1.084
 welf-1.082
Token the
Token ID262
Feature activation+1.10e-01

Pos logit contributions
horn+0.594
anon+0.591
+0.585
scape+0.567
ocal+0.557
Neg logit contributions
 rapp-0.569
 hospitality-0.564
hiba-0.564
 defe-0.538
 welf-0.535
Token Lizard
Token ID42452
Feature activation+2.54e+00

Pos logit contributions
 rapp+0.184
 hospitality+0.182
hiba+0.181
 defe+0.173
 welf+0.171
Neg logit contributions
horn-0.208
anon-0.208
-0.205
scape-0.199
ocal-0.196
Token King
Token ID2677
Feature activation+2.77e-01

Pos logit contributions
 rapp+3.592
hiba+3.522
 hospitality+3.522
 defe+3.349
 welf+3.287
Neg logit contributions
anon-5.416
-5.384
horn-5.347
scape-5.169
ocal-5.125
Token --
Token ID1377
Feature activation+4.52e-01

Pos logit contributions
 rapp+0.513
 hospitality+0.508
hiba+0.507
 defe+0.485
 welf+0.481
Neg logit contributions
horn-0.476
anon-0.476
-0.471
scape-0.454
ocal-0.446
Token June
Token ID2795
Feature activation-7.10e-02

Pos logit contributions
 rapp+0.756
 hospitality+0.748
hiba+0.743
 defe+0.710
 welf+0.704
Neg logit contributions
horn-0.857
anon-0.857
-0.848
scape-0.821
ocal-0.809
Token 14
Token ID1478
Feature activation-2.41e-01

Pos logit contributions
horn+0.106
anon+0.106
+0.105
scape+0.101
ocal+0.099
Neg logit contributions
 rapp-0.147
 hospitality-0.146
hiba-0.145
 defe-0.140
 welf-0.139
Token.
Token ID13
Feature activation+2.91e-01

Pos logit contributions
horn+0.399
anon+0.394
+0.390
scape+0.378
ocal+0.370
Neg logit contributions
 rapp-0.464
hiba-0.461
 hospitality-0.460
 defe-0.442
 welf-0.439
Token Adjust
Token ID20292
Feature activation+6.78e-02

Pos logit contributions
 rapp+1.478
 hospitality+1.476
hiba+1.448
 defe+1.405
 welf+1.390
Neg logit contributions
horn-1.406
anon-1.391
-1.386
scape-1.328
ocal-1.310
Tokening
Token ID278
Feature activation-1.72e-01

Pos logit contributions
 rapp+0.121
 hospitality+0.120
hiba+0.120
 defe+0.115
 welf+0.114
Neg logit contributions
horn-0.120
anon-0.120
-0.118
scape-0.115
ocal-0.113
Token?
Token ID30
Feature activation-6.67e-01

Pos logit contributions
horn+0.246
anon+0.244
+0.241
scape+0.231
ocal+0.226
Neg logit contributions
 rapp-0.370
 hospitality-0.367
hiba-0.367
 defe-0.353
 welf-0.352
Token
Token ID198
Feature activation-1.11e-01

Pos logit contributions
horn+1.119
anon+1.117
+1.105
scape+1.067
ocal+1.050
Neg logit contributions
 rapp-1.262
hiba-1.251
 hospitality-1.250
 defe-1.202
 welf-1.187
Token
Token ID198
Feature activation-1.56e-02

Pos logit contributions
+0.196
horn+0.193
anon+0.193
ocal+0.184
scape+0.184
Neg logit contributions
 rapp-0.197
 hospitality-0.196
hiba-0.196
azine-0.187
 defe-0.184
TokenCat
Token ID21979
Feature activation+2.54e+00

Pos logit contributions
horn+0.027
anon+0.027
+0.026
scape+0.026
ocal+0.025
Neg logit contributions
 rapp-0.029
 hospitality-0.029
hiba-0.029
 defe-0.028
 welf-0.027
Tokenast
Token ID459
Feature activation-1.17e-01

Pos logit contributions
 rapp+4.687
 hospitality+4.641
hiba+4.597
 defe+4.420
azine+4.400
Neg logit contributions
horn-4.287
anon-4.223
-4.193
scape-3.996
OWN-3.929
Tokenrophic
Token ID18191
Feature activation+7.44e-01

Pos logit contributions
horn+0.255
anon+0.254
+0.251
scape+0.246
ocal+0.243
Neg logit contributions
 rapp-0.166
 hospitality-0.164
hiba-0.162
 defe-0.155
 welf-0.153
Token adjust
Token ID4532
Feature activation-3.10e-01

Pos logit contributions
 rapp+1.225
 hospitality+1.215
hiba+1.210
 defe+1.154
 welf+1.143
Neg logit contributions
horn-1.434
anon-1.427
-1.409
scape-1.368
ocal-1.345
Tokeners
Token ID364
Feature activation-9.52e-02

Pos logit contributions
horn+0.506
anon+0.504
+0.498
scape+0.481
ocal+0.472
Neg logit contributions
 rapp-0.600
 hospitality-0.595
hiba-0.594
 defe-0.570
 welf-0.567
Token are
Token ID389
Feature activation+2.42e-01

Pos logit contributions
horn+0.171
anon+0.171
+0.169
scape+0.164
ocal+0.161
Neg logit contributions
 rapp-0.167
 hospitality-0.166
hiba-0.166
 defe-0.159
 welf-0.158
Tokengroup
Token ID8094
Feature activation+8.08e-01

Pos logit contributions
 rapp+1.391
 hospitality+1.380
hiba+1.379
 defe+1.325
 welf+1.316
Neg logit contributions
horn-1.078
anon-1.072
-1.059
scape-1.020
ocal-0.999
Token hypothesis
Token ID14078
Feature activation+2.06e+00

Pos logit contributions
 rapp+1.430
 hospitality+1.421
hiba+1.402
 defe+1.363
 welf+1.340
Neg logit contributions
horn-1.448
anon-1.431
-1.419
scape-1.361
ocal-1.344
Token,
Token ID11
Feature activation+1.81e-01

Pos logit contributions
 rapp+3.612
 hospitality+3.596
hiba+3.538
 defe+3.477
 welf+3.391
Neg logit contributions
horn-3.636
anon-3.616
-3.545
OWN-3.387
scape-3.386
Token and
Token ID290
Feature activation+3.91e-02

Pos logit contributions
 rapp+0.321
 hospitality+0.318
hiba+0.317
 defe+0.303
 welf+0.301
Neg logit contributions
horn-0.328
anon-0.326
-0.322
scape-0.313
ocal-0.307
Token then
Token ID788
Feature activation+3.96e-01

Pos logit contributions
 rapp+0.063
 hospitality+0.063
hiba+0.063
 defe+0.060
 welf+0.059
Neg logit contributions
horn-0.076
anon-0.076
-0.075
scape-0.073
ocal-0.072
Token meta
Token ID13634
Feature activation+2.53e+00

Pos logit contributions
 rapp+0.617
 hospitality+0.610
hiba+0.609
 defe+0.580
 welf+0.575
Neg logit contributions
horn-0.798
anon-0.793
-0.785
scape-0.763
ocal-0.752
Token-
Token ID12
Feature activation+1.53e-01

Pos logit contributions
hiba+4.001
 rapp+3.974
 hospitality+3.932
azine+3.817
hler+3.712
Neg logit contributions
-4.746
horn-4.697
anon-4.655
scape-4.477
ocal-4.453
Tokenreg
Token ID2301
Feature activation+8.35e-01

Pos logit contributions
 rapp+0.306
 hospitality+0.304
hiba+0.303
 defe+0.291
 welf+0.290
Neg logit contributions
horn-0.239
anon-0.237
-0.235
scape-0.226
ocal-0.222
Tokenression
Token ID2234
Feature activation+7.62e-01

Pos logit contributions
 rapp+1.200
hiba+1.187
 hospitality+1.177
azine+1.132
 defe+1.113
Neg logit contributions
horn-1.760
anon-1.747
-1.746
scape-1.672
OWN-1.652
Token if
Token ID611
Feature activation-1.98e-01

Pos logit contributions
 rapp+1.356
 hospitality+1.347
hiba+1.337
 defe+1.293
 welf+1.273
Neg logit contributions
horn-1.367
anon-1.354
-1.345
scape-1.292
ocal-1.274
Token more
Token ID517
Feature activation+8.45e-02

Pos logit contributions
horn+0.373
anon+0.372
+0.368
scape+0.357
ocal+0.351
Neg logit contributions
 rapp-0.333
 hospitality-0.330
hiba-0.329
 defe-0.314
 welf-0.312
Token several
Token ID1811
Feature activation+5.37e-01

Pos logit contributions
 rapp+0.743
 hospitality+0.737
hiba+0.731
 defe+0.699
 welf+0.694
Neg logit contributions
horn-0.859
anon-0.854
-0.846
scape-0.821
ocal-0.809
Token back
Token ID736
Feature activation+7.70e-02

Pos logit contributions
 rapp+0.869
 hospitality+0.860
hiba+0.843
 defe+0.813
 welf+0.804
Neg logit contributions
horn-1.058
anon-1.052
-1.042
scape-1.011
ocal-0.996
Token fields
Token ID7032
Feature activation+7.04e-01

Pos logit contributions
hiba+0.137
 rapp+0.136
 hospitality+0.136
 defe+0.130
 welf+0.129
Neg logit contributions
horn-0.138
anon-0.137
-0.136
scape-0.132
ocal-0.129
Token at
Token ID379
Feature activation+7.36e-01

Pos logit contributions
 rapp+1.192
 hospitality+1.181
hiba+1.163
 defe+1.120
 welf+1.106
Neg logit contributions
horn-1.324
anon-1.320
-1.306
scape-1.258
ocal-1.245
Token the
Token ID262
Feature activation+4.27e-01

Pos logit contributions
 rapp+1.267
 hospitality+1.255
hiba+1.254
 defe+1.196
 welf+1.188
Neg logit contributions
horn-1.362
anon-1.356
-1.341
scape-1.301
ocal-1.279
Token Padres
Token ID40188
Feature activation+2.53e+00

Pos logit contributions
 rapp+0.685
 hospitality+0.679
hiba+0.676
 defe+0.644
 welf+0.639
Neg logit contributions
horn-0.841
anon-0.837
-0.829
scape-0.807
ocal-0.793
Token
Token ID447
Feature activation+9.10e-01

Pos logit contributions
 hospitality+4.318
 rapp+4.253
hiba+4.097
 welf+3.935
 defe+3.930
Neg logit contributions
anon-4.662
horn-4.625
-4.621
scape-4.465
ocal-4.402
Token
Token ID247
Feature activation+3.95e-01

Pos logit contributions
 rapp+2.239
 hospitality+2.227
hiba+2.174
 defe+2.173
azine+2.117
Neg logit contributions
anon-0.986
horn-0.976
scape-0.926
ocal-0.884
inding-0.875
Token spring
Token ID6076
Feature activation+2.39e-01

Pos logit contributions
 rapp+0.741
 hospitality+0.735
hiba+0.734
 defe+0.703
 welf+0.699
Neg logit contributions
horn-0.668
anon-0.665
-0.657
scape-0.636
ocal-0.624
Token training
Token ID3047
Feature activation+1.15e+00

Pos logit contributions
 rapp+0.322
hiba+0.320
 hospitality+0.318
 defe+0.300
 welf+0.297
Neg logit contributions
horn-0.531
anon-0.529
-0.524
scape-0.511
ocal-0.504
Token complex
Token ID3716
Feature activation+6.65e-01

Pos logit contributions
 hospitality+1.715
 rapp+1.714
hiba+1.650
 defe+1.587
 welf+1.571
Neg logit contributions
anon-2.388
horn-2.387
-2.373
scape-2.297
ocal-2.269
Token and
Token ID290
Feature activation+5.96e-01

Pos logit contributions
horn+0.579
anon+0.578
+0.573
scape+0.553
ocal+0.544
Neg logit contributions
 rapp-0.594
 hospitality-0.590
hiba-0.587
 defe-0.562
 welf-0.558
Token then
Token ID788
Feature activation+6.95e-01

Pos logit contributions
 hospitality+0.920
 rapp+0.914
hiba+0.887
 defe+0.859
 welf+0.851
Neg logit contributions
horn-1.222
anon-1.213
-1.203
scape-1.168
ocal-1.146
Token ate
Token ID15063
Feature activation+7.26e-01

Pos logit contributions
 rapp+1.223
 hospitality+1.217
hiba+1.197
 defe+1.156
 welf+1.143
Neg logit contributions
horn-1.266
anon-1.258
-1.248
scape-1.204
ocal-1.186
Token some
Token ID617
Feature activation+1.29e-01

Pos logit contributions
 rapp+1.146
 hospitality+1.144
hiba+1.123
 defe+1.080
 welf+1.071
Neg logit contributions
horn-1.444
anon-1.443
-1.431
scape-1.384
ocal-1.366
Token fruit
Token ID8234
Feature activation+8.31e-01

Pos logit contributions
 rapp+0.177
 hospitality+0.176
hiba+0.174
 defe+0.165
 welf+0.164
Neg logit contributions
horn-0.282
anon-0.282
-0.280
scape-0.272
ocal-0.268
Token salad
Token ID20698
Feature activation+2.51e+00

Pos logit contributions
 rapp+1.502
 hospitality+1.493
hiba+1.484
 defe+1.420
 welf+1.412
Neg logit contributions
anon-1.457
horn-1.457
-1.448
scape-1.387
ocal-1.373
Token.
Token ID13
Feature activation+2.97e-01

Pos logit contributions
 rapp+4.465
 hospitality+4.465
hiba+4.360
 waitress+4.212
 defe+4.159
Neg logit contributions
-4.409
anon-4.376
horn-4.352
scape-4.150
OWN-4.104
Token
Token ID198
Feature activation+5.84e-03

Pos logit contributions
 rapp+0.518
 hospitality+0.514
hiba+0.512
 defe+0.489
 welf+0.486
Neg logit contributions
horn-0.542
anon-0.540
-0.534
scape-0.518
ocal-0.509
Token
Token ID198
Feature activation+3.02e-02

Pos logit contributions
 rapp+0.010
 hospitality+0.010
hiba+0.010
azine+0.010
 defe+0.010
Neg logit contributions
-0.010
horn-0.010
anon-0.010
ocal-0.010
scape-0.010
TokenThe
Token ID464
Feature activation+1.53e-01

Pos logit contributions
 rapp+0.054
 hospitality+0.053
hiba+0.053
 defe+0.051
 welf+0.050
Neg logit contributions
horn-0.054
anon-0.054
-0.053
scape-0.052
ocal-0.051
Token cant
Token ID18548
Feature activation-3.90e-01

Pos logit contributions
 rapp+0.298
 hospitality+0.297
hiba+0.294
 defe+0.283
 welf+0.282
Neg logit contributions
horn-0.249
anon-0.248
-0.245
scape-0.237
ocal-0.232
Token figures
Token ID5538
Feature activation-1.02e-01

Pos logit contributions
 rapp+3.159
 hospitality+3.154
hiba+3.096
 defe+2.989
 welf+2.937
Neg logit contributions
anon-3.080
horn-3.073
-3.057
ocal-2.915
scape-2.915
Token courtesy
Token ID12537
Feature activation+7.35e-01

Pos logit contributions
horn+0.178
anon+0.177
+0.175
scape+0.170
ocal+0.167
Neg logit contributions
 rapp-0.187
 hospitality-0.185
hiba-0.185
 defe-0.177
 welf-0.176
Token of
Token ID286
Feature activation+3.93e-01

Pos logit contributions
 rapp+1.366
hiba+1.352
 hospitality+1.346
 defe+1.293
 welf+1.286
Neg logit contributions
horn-1.271
anon-1.264
-1.248
scape-1.207
ocal-1.183
Token Over
Token ID3827
Feature activation+3.66e-01

Pos logit contributions
 rapp+0.742
hiba+0.737
 hospitality+0.735
 defe+0.704
 welf+0.701
Neg logit contributions
horn-0.663
anon-0.659
-0.651
scape-0.630
ocal-0.618
TokenThe
Token ID464
Feature activation+2.47e-01

Pos logit contributions
 rapp+0.740
hiba+0.737
 hospitality+0.736
 defe+0.705
 welf+0.700
Neg logit contributions
anon-0.559
horn-0.558
-0.553
scape-0.528
ocal-0.520
TokenCap
Token ID15610
Feature activation+2.50e+00

Pos logit contributions
 rapp+0.326
hiba+0.325
 hospitality+0.321
 welf+0.303
 defe+0.302
Neg logit contributions
horn-0.563
anon-0.557
-0.552
scape-0.541
ocal-0.530
Token.
Token ID13
Feature activation+2.48e-01

Pos logit contributions
 hospitality+4.581
hiba+4.534
 rapp+4.504
azine+4.339
 defe+4.298
Neg logit contributions
-4.131
anon-4.128
horn-4.107
scape-3.941
ocal-3.877
Tokencom
Token ID785
Feature activation+1.74e-02

Pos logit contributions
 rapp+0.457
 hospitality+0.453
hiba+0.452
 defe+0.433
 welf+0.430
Neg logit contributions
horn-0.431
anon-0.429
-0.424
scape-0.411
ocal-0.403
Token unless
Token ID4556
Feature activation-4.01e-01

Pos logit contributions
 rapp+0.034
 hospitality+0.034
hiba+0.034
 defe+0.033
 welf+0.032
Neg logit contributions
horn-0.028
anon-0.028
-0.027
scape-0.026
ocal-0.026
Token otherwise
Token ID4306
Feature activation-5.29e-01

Pos logit contributions
horn+0.798
anon+0.792
+0.787
scape+0.763
ocal+0.752
Neg logit contributions
 rapp-0.637
hiba-0.630
 hospitality-0.628
 defe-0.596
 welf-0.594
Token indicated
Token ID8203
Feature activation+9.47e-01

Pos logit contributions
anon+0.906
horn+0.896
+0.888
scape+0.859
ocal+0.840
Neg logit contributions
 rapp-0.992
 hospitality-0.984
hiba-0.977
 defe-0.934
 welf-0.917
Token arrest
Token ID3251
Feature activation-5.98e-01

Pos logit contributions
 rapp+0.176
hiba+0.176
 hospitality+0.174
 defe+0.166
 welf+0.166
Neg logit contributions
horn-0.205
anon-0.204
-0.201
scape-0.196
ocal-0.193
Token warrant
Token ID8084
Feature activation-1.22e+00

Pos logit contributions
horn+1.155
anon+1.148
+1.137
scape+1.107
ocal+1.086
Neg logit contributions
 rapp-0.980
hiba-0.974
 hospitality-0.970
 defe-0.923
 welf-0.917
Token.
Token ID13
Feature activation-4.00e-02

Pos logit contributions
horn+1.870
anon+1.858
+1.832
scape+1.773
ocal+1.731
Neg logit contributions
 rapp-2.459
hiba-2.449
 hospitality-2.444
 welf-2.350
 defe-2.342
Token
Token ID198
Feature activation-1.01e-01

Pos logit contributions
horn+0.063
anon+0.063
+0.062
scape+0.060
ocal+0.058
Neg logit contributions
 rapp-0.080
 hospitality-0.080
hiba-0.079
 defe-0.077
 welf-0.076
Token
Token ID198
Feature activation-1.04e-01

Pos logit contributions
+0.177
horn+0.175
anon+0.174
ocal+0.167
scape+0.167
Neg logit contributions
 rapp-0.182
 hospitality-0.181
hiba-0.180
azine-0.171
 defe-0.171
TokenWiki
Token ID32603
Feature activation+2.49e+00

Pos logit contributions
horn+0.181
anon+0.181
+0.178
scape+0.173
ocal+0.169
Neg logit contributions
 rapp-0.189
 hospitality-0.188
hiba-0.187
 defe-0.179
 welf-0.178
TokenLeaks
Token ID17874
Feature activation+9.66e-01

Pos logit contributions
hiba+1.949
 rapp+1.826
 hospitality+1.824
azine+1.595
 defe+1.526
Neg logit contributions
-6.855
horn-6.812
anon-6.800
ocal-6.624
scape-6.583
Token pr
Token ID778
Feature activation-4.21e-01

Pos logit contributions
 rapp+1.750
 hospitality+1.735
hiba+1.711
 defe+1.652
 welf+1.622
Neg logit contributions
horn-1.702
anon-1.691
-1.685
ocal-1.614
scape-1.614
Tokenides
Token ID1460
Feature activation-5.53e-01

Pos logit contributions
horn+0.523
anon+0.519
+0.512
scape+0.489
ocal+0.475
Neg logit contributions
 rapp-0.979
 hospitality-0.971
hiba-0.971
 defe-0.937
 welf-0.932
Token itself
Token ID2346
Feature activation+8.29e-01

Pos logit contributions
horn+0.991
+0.973
anon+0.971
scape+0.941
ocal+0.924
Neg logit contributions
hiba-0.989
 rapp-0.987
 hospitality-0.973
 defe-0.939
 welf-0.929
Token as
Token ID355
Feature activation-4.49e-01

Pos logit contributions
 rapp+1.686
hiba+1.681
 hospitality+1.671
 defe+1.607
 welf+1.598
Neg logit contributions
horn-1.282
anon-1.269
-1.261
scape-1.216
ocal-1.185
TokenNext
Token ID10019
Feature activation+1.06e+00

Pos logit contributions
horn+0.508
anon+0.507
+0.499
scape+0.483
ocal+0.473
Neg logit contributions
 rapp-0.573
 hospitality-0.569
hiba-0.567
 defe-0.543
 welf-0.541
Token Page
Token ID7873
Feature activation+1.22e+00

Pos logit contributions
 rapp+1.838
 hospitality+1.816
hiba+1.813
 defe+1.735
 welf+1.718
Neg logit contributions
horn-1.950
anon-1.936
-1.913
scape-1.860
ocal-1.832
Token
Token ID851
Feature activation+5.51e-01

Pos logit contributions
 rapp+2.245
 hospitality+2.213
hiba+2.195
 defe+2.117
azine+2.090
Neg logit contributions
horn-2.114
anon-2.104
-2.076
scape-2.014
ocal-1.973
Token>
Token ID29
Feature activation+6.28e-01

Pos logit contributions
 rapp+1.009
 hospitality+0.999
hiba+0.995
 defe+0.955
 welf+0.945
Neg logit contributions
horn-0.959
anon-0.953
-0.943
scape-0.914
ocal-0.899
Token<|endoftext|>
Token ID50256
Feature activation+7.20e-01

Pos logit contributions
 rapp+1.163
 hospitality+1.151
hiba+1.148
 defe+1.102
 welf+1.092
Neg logit contributions
horn-1.077
anon-1.069
-1.061
scape-1.025
ocal-1.009
TokenLate
Token ID26302
Feature activation+2.48e+00

Pos logit contributions
 rapp+1.355
hiba+1.350
 hospitality+1.341
 defe+1.288
azine+1.278
Neg logit contributions
horn-1.190
-1.180
anon-1.176
scape-1.130
ocal-1.116
Token in
Token ID287
Feature activation+2.02e-02

Pos logit contributions
 hospitality+3.845
 rapp+3.824
hiba+3.735
 defe+3.629
 welf+3.589
Neg logit contributions
anon-4.856
horn-4.832
-4.828
ocal-4.607
scape-4.605
Token 2009
Token ID3717
Feature activation-1.86e-01

Pos logit contributions
hiba+0.035
 rapp+0.035
 hospitality+0.035
 defe+0.033
 welf+0.033
Neg logit contributions
horn-0.037
anon-0.037
-0.036
scape-0.035
ocal-0.034
Token,
Token ID11
Feature activation+1.30e-01

Pos logit contributions
horn+0.317
anon+0.315
+0.312
scape+0.303
ocal+0.295
Neg logit contributions
 rapp-0.347
 hospitality-0.344
hiba-0.344
 defe-0.330
 welf-0.327
Token Eating
Token ID37537
Feature activation+2.65e-01

Pos logit contributions
 rapp+0.229
 hospitality+0.227
hiba+0.226
 defe+0.216
 welf+0.215
Neg logit contributions
horn-0.235
anon-0.234
-0.231
scape-0.224
ocal-0.220
Token...
Token ID986
Feature activation-2.36e-01

Pos logit contributions
 rapp+0.484
 hospitality+0.480
hiba+0.480
 defe+0.459
 welf+0.457
Neg logit contributions
horn-0.458
anon-0.457
-0.453
scape-0.437
ocal-0.430
Token view
Token ID1570
Feature activation-2.52e-01

Pos logit contributions
 rapp+1.199
 hospitality+1.186
hiba+1.157
 defe+1.095
 welf+1.088
Neg logit contributions
anon-2.103
horn-2.093
-2.086
scape-2.007
ocal-2.001
Token over
Token ID625
Feature activation+7.90e-01

Pos logit contributions
horn+0.444
anon+0.442
+0.436
scape+0.424
ocal+0.416
Neg logit contributions
 rapp-0.456
hiba-0.452
 hospitality-0.451
 defe-0.432
 welf-0.430
Token one
Token ID530
Feature activation+2.38e-01

Pos logit contributions
 rapp+1.323
 hospitality+1.312
hiba+1.308
 defe+1.247
 welf+1.238
Neg logit contributions
horn-1.499
anon-1.493
-1.477
scape-1.433
ocal-1.410
Token-
Token ID12
Feature activation+6.54e-01

Pos logit contributions
 rapp+0.370
hiba+0.367
 hospitality+0.367
 defe+0.348
 welf+0.345
Neg logit contributions
horn-0.479
anon-0.477
-0.472
scape-0.460
ocal-0.452
Tokenh
Token ID71
Feature activation+4.67e-01

Pos logit contributions
 rapp+1.196
hiba+1.184
 hospitality+1.184
 defe+1.131
 welf+1.123
Neg logit contributions
horn-1.130
anon-1.129
-1.119
scape-1.080
ocal-1.062
Tokenundred
Token ID3229
Feature activation+2.46e+00

Pos logit contributions
 rapp+0.824
 hospitality+0.816
hiba+0.815
 defe+0.779
 welf+0.772
Neg logit contributions
horn-0.841
anon-0.839
-0.831
scape-0.804
ocal-0.789
Token years
Token ID812
Feature activation+4.56e-01

Pos logit contributions
 rapp+2.560
 hospitality+2.538
hiba+2.380
 defe+2.284
azine+2.223
Neg logit contributions
anon-6.100
horn-6.072
-6.060
scape-5.899
ocal-5.785
Token later
Token ID1568
Feature activation+2.45e-01

Pos logit contributions
 rapp+0.780
 hospitality+0.774
hiba+0.770
 defe+0.734
 welf+0.727
Neg logit contributions
horn-0.847
anon-0.846
-0.839
scape-0.810
ocal-0.798
Token.
Token ID13
Feature activation+1.23e-01

Pos logit contributions
 rapp+0.427
 hospitality+0.423
hiba+0.422
 defe+0.403
 welf+0.400
Neg logit contributions
horn-0.448
anon-0.446
-0.441
scape-0.427
ocal-0.420
Token The
Token ID383
Feature activation-6.91e-02

Pos logit contributions
 rapp+0.203
 hospitality+0.202
hiba+0.201
 defe+0.191
 welf+0.189
Neg logit contributions
horn-0.235
anon-0.234
-0.232
scape-0.225
ocal-0.221
Token institution
Token ID9901
Feature activation-6.60e-01

Pos logit contributions
horn+0.141
anon+0.141
+0.139
scape+0.136
ocal+0.133
Neg logit contributions
 rapp-0.105
hiba-0.105
 hospitality-0.104
 defe-0.099
 welf-0.098
Token said
Token ID531
Feature activation-2.18e-01

Pos logit contributions
horn+0.782
anon+0.780
+0.762
scape+0.747
ocal+0.732
Neg logit contributions
hiba-0.801
 rapp-0.796
 hospitality-0.792
 defe-0.758
 welf-0.755
Token.
Token ID13
Feature activation+5.84e-02

Pos logit contributions
horn+0.320
anon+0.316
+0.311
scape+0.300
ocal+0.295
Neg logit contributions
 rapp-0.458
hiba-0.456
 hospitality-0.456
 defe-0.440
 welf-0.439
Token<|endoftext|>
Token ID50256
Feature activation+5.08e-01

Pos logit contributions
 rapp+0.109
 hospitality+0.108
hiba+0.108
 defe+0.103
 welf+0.102
Neg logit contributions
horn-0.100
anon-0.100
-0.099
scape-0.095
ocal-0.094
TokenThe
Token ID464
Feature activation+4.33e-01

Pos logit contributions
 rapp+0.958
hiba+0.954
 hospitality+0.948
 defe+0.910
azine+0.902
Neg logit contributions
horn-0.843
-0.835
anon-0.835
scape-0.801
ocal-0.790
Token 2014
Token ID1946
Feature activation-1.36e-01

Pos logit contributions
 rapp+0.744
 hospitality+0.738
hiba+0.736
 defe+0.702
 welf+0.697
Neg logit contributions
horn-0.803
anon-0.799
-0.791
scape-0.767
ocal-0.754
Token Padres
Token ID40188
Feature activation+2.45e+00

Pos logit contributions
horn+0.249
anon+0.248
+0.245
scape+0.238
ocal+0.234
Neg logit contributions
 rapp-0.236
hiba-0.234
 hospitality-0.234
 defe-0.223
 welf-0.222
Token were
Token ID547
Feature activation+5.97e-01

Pos logit contributions
 hospitality+4.169
 rapp+4.158
hiba+4.028
 defe+3.877
 welf+3.872
Neg logit contributions
anon-4.511
horn-4.507
-4.461
scape-4.307
ocal-4.280
Token 63
Token ID8093
Feature activation-1.28e-01

Pos logit contributions
 rapp+1.028
 hospitality+1.017
hiba+1.003
 defe+0.970
 welf+0.952
Neg logit contributions
horn-1.110
anon-1.109
-1.096
scape-1.057
ocal-1.047
Token-
Token ID12
Feature activation+1.77e-01

Pos logit contributions
horn+0.212
anon+0.211
+0.209
scape+0.202
ocal+0.198
Neg logit contributions
 rapp-0.244
hiba-0.242
 hospitality-0.242
 defe-0.232
 welf-0.230
Token1
Token ID16
Feature activation+5.28e-02

Pos logit contributions
 rapp+0.401
hiba+0.399
 hospitality+0.398
 defe+0.382
azine+0.381
Neg logit contributions
horn-0.226
-0.226
anon-0.226
scape-0.213
ocal-0.209
Token when
Token ID618
Feature activation+9.71e-01

Pos logit contributions
 rapp+0.097
 hospitality+0.096
hiba+0.095
 defe+0.092
 welf+0.091
Neg logit contributions
horn-0.092
anon-0.092
-0.091
scape-0.088
ocal-0.086
Token an
Token ID281
Feature activation+1.57e-01

Pos logit contributions
 rapp+0.093
 hospitality+0.092
hiba+0.091
 defe+0.088
 welf+0.087
Neg logit contributions
horn-0.080
anon-0.079
-0.079
scape-0.076
ocal-0.074
Token OC
Token ID24775
Feature activation-1.14e-01

Pos logit contributions
 rapp+0.266
 hospitality+0.263
hiba+0.262
 defe+0.251
 welf+0.248
Neg logit contributions
horn-0.294
anon-0.292
-0.291
scape-0.282
ocal-0.277
Token pony
Token ID26902
Feature activation+1.66e+00

Pos logit contributions
horn+0.198
anon+0.198
+0.195
scape+0.189
ocal+0.186
Neg logit contributions
 rapp-0.210
hiba-0.209
 hospitality-0.208
 defe-0.199
 welf-0.198
Token (
Token ID357
Feature activation+6.09e-02

Pos logit contributions
 rapp+2.938
 hospitality+2.885
hiba+2.828
 defe+2.771
azine+2.744
Neg logit contributions
horn-2.929
-2.903
anon-2.901
scape-2.788
ocal-2.752
Tokenwell
Token ID4053
Feature activation+1.30e+00

Pos logit contributions
 rapp+0.125
 hospitality+0.124
hiba+0.124
 defe+0.119
 welf+0.118
Neg logit contributions
horn-0.092
anon-0.092
-0.091
scape-0.087
ocal-0.086
Token known
Token ID1900
Feature activation+2.45e+00

Pos logit contributions
 rapp+2.282
 hospitality+2.253
hiba+2.208
 defe+2.150
 welf+2.124
Neg logit contributions
anon-2.321
horn-2.312
-2.292
scape-2.224
ocal-2.176
Token or
Token ID393
Feature activation+5.07e-01

Pos logit contributions
 rapp+4.095
 hospitality+4.068
hiba+3.908
 defe+3.881
 welf+3.832
Neg logit contributions
horn-4.473
-4.463
anon-4.388
scape-4.278
ocal-4.204
Token otherwise
Token ID4306
Feature activation+5.93e-02

Pos logit contributions
 rapp+0.759
 hospitality+0.751
hiba+0.744
 defe+0.710
 welf+0.701
Neg logit contributions
horn-1.050
anon-1.045
-1.039
scape-1.011
ocal-0.993
Token),
Token ID828
Feature activation+8.33e-02

Pos logit contributions
 rapp+0.102
 hospitality+0.101
hiba+0.100
 defe+0.096
 welf+0.095
Neg logit contributions
horn-0.110
anon-0.110
-0.109
scape-0.105
ocal-0.103
Token I
Token ID314
Feature activation+5.14e-02

Pos logit contributions
 rapp+0.152
 hospitality+0.151
hiba+0.151
 defe+0.144
 welf+0.143
Neg logit contributions
horn-0.146
anon-0.145
-0.143
scape-0.139
ocal-0.136
Token couldn
Token ID3521
Feature activation+3.07e-01

Pos logit contributions
 rapp+0.095
 hospitality+0.094
hiba+0.094
 defe+0.090
 welf+0.089
Neg logit contributions
horn-0.089
anon-0.088
-0.087
scape-0.085
ocal-0.083
Token say
Token ID910
Feature activation+1.61e-01

Pos logit contributions
 rapp+2.389
 hospitality+2.361
hiba+2.316
 defe+2.225
 waitress+2.192
Neg logit contributions
horn-2.605
anon-2.583
-2.572
scape-2.479
ocal-2.428
Token she
Token ID673
Feature activation-1.58e-01

Pos logit contributions
 rapp+0.268
 hospitality+0.265
hiba+0.265
 defe+0.252
 welf+0.250
Neg logit contributions
horn-0.306
anon-0.305
-0.302
scape-0.293
ocal-0.288
Token wore
Token ID12408
Feature activation+3.66e-02

Pos logit contributions
horn+0.272
anon+0.272
+0.267
scape+0.260
ocal+0.255
Neg logit contributions
 rapp-0.290
hiba-0.289
 hospitality-0.288
 defe-0.276
 welf-0.275
Token skirts
Token ID42370
Feature activation+5.24e-01

Pos logit contributions
 rapp+0.061
hiba+0.060
 hospitality+0.060
 defe+0.057
 welf+0.057
Neg logit contributions
horn-0.070
anon-0.070
-0.069
scape-0.067
ocal-0.066
Token and
Token ID290
Feature activation+2.30e-01

Pos logit contributions
 rapp+0.938
 hospitality+0.925
hiba+0.924
 defe+0.879
 welf+0.871
Neg logit contributions
horn-0.934
anon-0.933
-0.923
scape-0.894
ocal-0.876
Token cocktail
Token ID24554
Feature activation+2.43e+00

Pos logit contributions
 rapp+0.364
 hospitality+0.359
hiba+0.357
 defe+0.339
 welf+0.337
Neg logit contributions
horn-0.459
anon-0.457
-0.453
scape-0.440
ocal-0.434
Token dresses
Token ID27309
Feature activation+1.19e+00

Pos logit contributions
 rapp+2.905
 hospitality+2.829
hiba+2.758
 waitress+2.615
 defe+2.589
Neg logit contributions
anon-5.762
horn-5.740
-5.717
scape-5.562
ocal-5.522
Token to
Token ID284
Feature activation+2.32e-01

Pos logit contributions
 rapp+2.101
 hospitality+2.064
hiba+2.051
 defe+1.946
azine+1.927
Neg logit contributions
anon-2.167
horn-2.161
-2.148
scape-2.068
ocal-2.035
Token work
Token ID670
Feature activation+1.01e+00

Pos logit contributions
 rapp+0.328
 hospitality+0.325
hiba+0.324
 defe+0.305
 welf+0.303
Neg logit contributions
horn-0.500
anon-0.499
-0.494
scape-0.481
ocal-0.474
Token,
Token ID11
Feature activation+9.61e-03

Pos logit contributions
 rapp+1.823
 hospitality+1.816
hiba+1.795
 defe+1.711
 welf+1.695
Neg logit contributions
anon-1.800
horn-1.792
-1.767
scape-1.712
ocal-1.682
Token and
Token ID290
Feature activation-1.73e-01

Pos logit contributions
 rapp+0.017
 hospitality+0.017
hiba+0.016
 defe+0.016
 welf+0.016
Neg logit contributions
horn-0.018
anon-0.018
-0.017
scape-0.017
ocal-0.017
Token trade
Token ID3292
Feature activation+4.96e-01

Pos logit contributions
horn+0.143
anon+0.142
+0.141
scape+0.136
ocal+0.134
Neg logit contributions
 rapp-0.123
 hospitality-0.122
hiba-0.122
 defe-0.116
 welf-0.115
Token asset
Token ID11171
Feature activation+7.54e-01

Pos logit contributions
 rapp+0.841
 hospitality+0.830
hiba+0.826
 defe+0.793
 welf+0.781
Neg logit contributions
horn-0.932
anon-0.925
-0.921
scape-0.892
ocal-0.878
Token.
Token ID13
Feature activation+2.55e-01

Pos logit contributions
 rapp+1.308
 hospitality+1.291
hiba+1.282
 defe+1.235
 welf+1.211
Neg logit contributions
horn-1.380
anon-1.378
-1.366
scape-1.318
ocal-1.301
Token And
Token ID843
Feature activation-2.08e-02

Pos logit contributions
 rapp+0.448
 hospitality+0.444
hiba+0.442
 defe+0.422
 welf+0.419
Neg logit contributions
horn-0.459
anon-0.456
-0.453
scape-0.439
ocal-0.431
Token the
Token ID262
Feature activation+1.53e-01

Pos logit contributions
horn+0.039
anon+0.039
+0.038
scape+0.037
ocal+0.036
Neg logit contributions
 rapp-0.035
 hospitality-0.035
hiba-0.035
 defe-0.033
 welf-0.033
Token Padres
Token ID40188
Feature activation+2.42e+00

Pos logit contributions
 rapp+0.206
hiba+0.204
 hospitality+0.204
 defe+0.192
 welf+0.190
Neg logit contributions
horn-0.339
anon-0.338
-0.335
scape-0.326
ocal-0.322
Token should
Token ID815
Feature activation-5.29e-01

Pos logit contributions
 rapp+4.196
 hospitality+4.146
hiba+4.042
 defe+3.945
 welf+3.845
Neg logit contributions
horn-4.404
anon-4.325
-4.302
scape-4.155
ocal-4.097
Token have
Token ID423
Feature activation+4.00e-01

Pos logit contributions
horn+0.965
anon+0.963
+0.950
scape+0.922
ocal+0.906
Neg logit contributions
 rapp-0.915
 hospitality-0.914
hiba-0.912
 defe-0.870
 welf-0.869
Token left
Token ID1364
Feature activation-4.12e-01

Pos logit contributions
 rapp+0.762
 hospitality+0.752
hiba+0.743
 defe+0.720
 welf+0.711
Neg logit contributions
horn-0.673
anon-0.669
-0.662
scape-0.637
ocal-0.627
Token him
Token ID683
Feature activation+5.23e-01

Pos logit contributions
horn+0.760
anon+0.756
+0.747
scape+0.725
ocal+0.713
Neg logit contributions
 rapp-0.712
hiba-0.710
 hospitality-0.705
 defe-0.674
 welf-0.669
Token alone
Token ID3436
Feature activation+1.03e-01

Pos logit contributions
 rapp+0.936
 hospitality+0.925
hiba+0.916
 defe+0.884
 welf+0.873
Neg logit contributions
horn-0.930
anon-0.930
-0.922
scape-0.891
ocal-0.876
Token unique
Token ID3748
Feature activation-6.96e-01

Pos logit contributions
horn+0.580
anon+0.577
+0.571
scape+0.556
ocal+0.547
Neg logit contributions
 rapp-0.464
 hospitality-0.459
hiba-0.459
 defe-0.435
 welf-0.432
Token feature
Token ID3895
Feature activation-1.11e+00

Pos logit contributions
horn+1.375
anon+1.372
+1.357
scape+1.320
ocal+1.298
Neg logit contributions
 rapp-1.104
hiba-1.099
 hospitality-1.092
 defe-1.037
 welf-1.032
Token of
Token ID286
Feature activation-6.27e-02

Pos logit contributions
horn+1.833
anon+1.797
+1.782
scape+1.749
ocal+1.690
Neg logit contributions
hiba-2.140
 rapp-2.133
 hospitality-2.109
 welf-2.028
 defe-2.021
Token the
Token ID262
Feature activation-1.41e-01

Pos logit contributions
horn+0.119
anon+0.119
+0.117
scape+0.114
ocal+0.112
Neg logit contributions
hiba-0.105
 rapp-0.104
 hospitality-0.103
 defe-0.099
 welf-0.099
Token 2014
Token ID1946
Feature activation-4.13e-01

Pos logit contributions
horn+0.298
anon+0.297
+0.294
scape+0.287
ocal+0.282
Neg logit contributions
 rapp-0.205
hiba-0.203
 hospitality-0.202
 defe-0.191
 welf-0.190
Token Padres
Token ID40188
Feature activation+2.41e+00

Pos logit contributions
anon+1.015
horn+1.015
+1.005
scape+0.983
ocal+0.969
Neg logit contributions
hiba-0.454
 rapp-0.453
 hospitality-0.446
 defe-0.416
 welf-0.413
Token.
Token ID13
Feature activation+2.73e-01

Pos logit contributions
 hospitality+4.325
 rapp+4.316
hiba+4.204
 defe+4.065
 welf+4.017
Neg logit contributions
horn-4.198
anon-4.161
-4.115
scape-3.932
ocal-3.890
Token In
Token ID554
Feature activation-1.68e-01

Pos logit contributions
 rapp+0.473
 hospitality+0.469
hiba+0.466
 defe+0.443
azine+0.441
Neg logit contributions
horn-0.499
anon-0.496
-0.492
scape-0.477
ocal-0.470
Token 2013
Token ID2211
Feature activation-1.21e+00

Pos logit contributions
horn+0.313
anon+0.312
+0.309
scape+0.300
ocal+0.294
Neg logit contributions
 rapp-0.287
hiba-0.285
 hospitality-0.284
 defe-0.272
 welf-0.270
Token,
Token ID11
Feature activation-4.46e-01

Pos logit contributions
horn+1.966
anon+1.948
+1.938
scape+1.870
ocal+1.820
Neg logit contributions
hiba-2.341
 rapp-2.324
 hospitality-2.300
 welf-2.230
 defe-2.229
Token the
Token ID262
Feature activation-3.41e-01

Pos logit contributions
horn+0.828
anon+0.826
+0.816
scape+0.792
ocal+0.777
Neg logit contributions
 rapp-0.758
hiba-0.756
 hospitality-0.750
 defe-0.718
 welf-0.717
Token operate
Token ID8076
Feature activation+7.04e-01

Pos logit contributions
 rapp+0.088
hiba+0.088
 hospitality+0.088
 welf+0.083
 defe+0.083
Neg logit contributions
horn-0.124
anon-0.124
-0.123
scape-0.120
ocal-0.117
Token under
Token ID739
Feature activation-6.54e-01

Pos logit contributions
 rapp+1.228
 hospitality+1.217
hiba+1.212
 defe+1.163
 welf+1.154
Neg logit contributions
horn-1.287
anon-1.281
-1.265
scape-1.222
ocal-1.206
Token the
Token ID262
Feature activation+1.49e-01

Pos logit contributions
horn+1.176
anon+1.165
+1.164
scape+1.132
ocal+1.109
Neg logit contributions
 rapp-1.147
hiba-1.140
 hospitality-1.139
 defe-1.080
 welf-1.078
Token US
Token ID1294
Feature activation+9.10e-01

Pos logit contributions
 rapp+0.238
 hospitality+0.236
hiba+0.235
 defe+0.224
 welf+0.222
Neg logit contributions
horn-0.295
anon-0.294
-0.291
scape-0.283
ocal-0.278
Token debt
Token ID5057
Feature activation+6.16e-01

Pos logit contributions
 rapp+1.573
 hospitality+1.567
hiba+1.548
 defe+1.490
 welf+1.479
Neg logit contributions
horn-1.661
anon-1.657
-1.638
scape-1.581
ocal-1.558
Token cap
Token ID1451
Feature activation+2.41e+00

Pos logit contributions
 rapp+0.845
 hospitality+0.833
hiba+0.827
 defe+0.790
 welf+0.779
Neg logit contributions
horn-1.356
anon-1.355
-1.341
scape-1.300
ocal-1.291
Token.
Token ID13
Feature activation+5.50e-01

Pos logit contributions
 rapp+4.141
 hospitality+4.094
hiba+4.070
 defe+3.930
 theorem+3.899
Neg logit contributions
anon-4.276
-4.245
horn-4.208
ocal-4.069
OWN-4.037
Token Lew
Token ID8260
Feature activation+7.83e-01

Pos logit contributions
 rapp+1.086
hiba+1.078
 hospitality+1.077
 defe+1.032
 welf+1.027
Neg logit contributions
horn-0.876
anon-0.873
-0.860
scape-0.829
ocal-0.816
Token told
Token ID1297
Feature activation-6.21e-01

Pos logit contributions
 rapp+1.432
hiba+1.407
 hospitality+1.403
 defe+1.349
azine+1.328
Neg logit contributions
horn-1.363
anon-1.355
-1.350
scape-1.299
ocal-1.272
Token Congress
Token ID3162
Feature activation-7.56e-02

Pos logit contributions
horn+1.123
anon+1.115
+1.107
scape+1.072
ocal+1.054
Neg logit contributions
 rapp-1.092
hiba-1.080
 hospitality-1.079
 defe-1.033
 welf-1.029
Token that
Token ID326
Feature activation-2.92e-01

Pos logit contributions
horn+0.137
anon+0.136
+0.135
scape+0.131
ocal+0.128
Neg logit contributions
 rapp-0.134
 hospitality-0.132
hiba-0.132
 defe-0.126
 welf-0.126
Token
Token ID851
Feature activation+1.00e+00

Pos logit contributions
 rapp+1.352
 hospitality+1.343
hiba+1.332
 defe+1.275
 welf+1.264
Neg logit contributions
anon-1.342
horn-1.341
-1.334
scape-1.275
ocal-1.263
Token
Token ID564
Feature activation+7.67e-01

Pos logit contributions
 rapp+1.698
 hospitality+1.685
hiba+1.665
 defe+1.602
 welf+1.580
Neg logit contributions
anon-1.870
horn-1.867
-1.844
scape-1.788
ocal-1.767
Token
Token ID250
Feature activation+5.31e-01

Pos logit contributions
 rapp+1.969
 hospitality+1.956
hiba+1.938
 defe+1.910
 welf+1.874
Neg logit contributions
horn-0.762
anon-0.756
scape-0.703
-0.677
ocal-0.675
Tokent
Token ID83
Feature activation+1.06e+00

Pos logit contributions
 rapp+1.021
 hospitality+1.014
hiba+1.008
 defe+0.970
 welf+0.962
Neg logit contributions
horn-0.871
anon-0.867
-0.856
scape-0.828
ocal-0.813
Tokeniger
Token ID8254
Feature activation+2.20e-01

Pos logit contributions
 rapp+1.999
hiba+1.985
 hospitality+1.976
 defe+1.892
 welf+1.881
Neg logit contributions
horn-1.757
anon-1.747
-1.734
scape-1.670
ocal-1.630
Token salad
Token ID20698
Feature activation+2.40e+00

Pos logit contributions
 rapp+0.355
 hospitality+0.352
hiba+0.350
 defe+0.334
 welf+0.331
Neg logit contributions
horn-0.430
anon-0.430
-0.425
scape-0.413
ocal-0.407
Token
Token ID447
Feature activation+5.37e-01

Pos logit contributions
 rapp+4.218
 hospitality+4.196
hiba+4.145
 defe+4.005
 waitress+3.954
Neg logit contributions
anon-4.233
-4.212
horn-4.186
scape-3.984
ocal-3.948
Token
Token ID251
Feature activation-1.24e-01

Pos logit contributions
 rapp+1.228
 hospitality+1.225
hiba+1.215
 defe+1.193
azine+1.161
Neg logit contributions
anon-0.673
horn-0.672
scape-0.630
ocal-0.619
inding-0.601
Token
Token ID851
Feature activation+6.13e-01

Pos logit contributions
horn+0.218
anon+0.216
+0.214
scape+0.207
ocal+0.203
Neg logit contributions
 rapp-0.224
hiba-0.222
 hospitality-0.222
 defe-0.212
 welf-0.211
Token that
Token ID326
Feature activation+7.22e-02

Pos logit contributions
 rapp+1.126
 hospitality+1.114
hiba+1.110
 defe+1.066
 welf+1.053
Neg logit contributions
horn-1.061
anon-1.060
-1.046
scape-1.011
ocal-0.993
Token is
Token ID318
Feature activation+1.68e-01

Pos logit contributions
 rapp+0.135
hiba+0.134
 hospitality+0.134
 defe+0.128
 welf+0.127
Neg logit contributions
horn-0.123
anon-0.122
-0.121
scape-0.117
ocal-0.115
Token way
Token ID835
Feature activation-2.22e-01

Pos logit contributions
 rapp+0.137
 hospitality+0.135
hiba+0.132
 defe+0.129
 welf+0.127
Neg logit contributions
horn-0.154
anon-0.153
-0.152
scape-0.147
ocal-0.146
Token of
Token ID286
Feature activation-1.69e-01

Pos logit contributions
horn+0.371
anon+0.368
+0.365
scape+0.354
ocal+0.347
Neg logit contributions
 rapp-0.423
hiba-0.423
 hospitality-0.420
 defe-0.401
 welf-0.400
Token saying
Token ID2282
Feature activation+2.59e-01

Pos logit contributions
horn+0.325
anon+0.324
+0.321
scape+0.312
ocal+0.307
Neg logit contributions
hiba-0.277
 rapp-0.276
 hospitality-0.275
 defe-0.260
 welf-0.259
Token that
Token ID326
Feature activation-3.48e-01

Pos logit contributions
 rapp+0.433
hiba+0.431
 hospitality+0.428
 defe+0.408
 welf+0.407
Neg logit contributions
horn-0.491
anon-0.489
-0.484
scape-0.469
ocal-0.462
Token the
Token ID262
Feature activation-1.13e-01

Pos logit contributions
horn+0.649
anon+0.647
+0.639
scape+0.620
ocal+0.610
Neg logit contributions
 rapp-0.590
hiba-0.587
 hospitality-0.584
 defe-0.558
 welf-0.557
Token Padres
Token ID40188
Feature activation+2.40e+00

Pos logit contributions
horn+0.210
anon+0.209
+0.207
scape+0.201
ocal+0.198
Neg logit contributions
 rapp-0.193
hiba-0.191
 hospitality-0.191
 defe-0.182
 welf-0.181
Token had
Token ID550
Feature activation+3.34e-01

Pos logit contributions
 rapp+4.170
 hospitality+4.104
hiba+3.994
 defe+3.888
 welf+3.791
Neg logit contributions
horn-4.341
anon-4.318
-4.260
scape-4.094
ocal-4.052
Token no
Token ID645
Feature activation-5.40e-02

Pos logit contributions
 rapp+0.575
 hospitality+0.570
hiba+0.563
 defe+0.542
 welf+0.536
Neg logit contributions
horn-0.619
anon-0.615
-0.609
scape-0.589
ocal-0.582
Token need
Token ID761
Feature activation-1.16e+00

Pos logit contributions
horn+0.110
anon+0.109
+0.108
scape+0.105
ocal+0.104
Neg logit contributions
 rapp-0.084
 hospitality-0.083
hiba-0.081
 defe-0.078
 welf-0.077
Token at
Token ID379
Feature activation+1.63e-01

Pos logit contributions
horn+1.928
anon+1.907
+1.898
scape+1.828
ocal+1.796
Neg logit contributions
hiba-2.219
 rapp-2.206
 hospitality-2.191
 welf-2.098
 defe-2.088
Token all
Token ID477
Feature activation+2.32e-01

Pos logit contributions
hiba+0.262
 rapp+0.260
 hospitality+0.257
 defe+0.246
 welf+0.244
Neg logit contributions
horn-0.320
anon-0.319
-0.317
scape-0.306
ocal-0.303
Token bullying
Token ID20714
Feature activation+9.84e-01

Pos logit contributions
horn+0.573
anon+0.570
+0.564
scape+0.549
ocal+0.539
Neg logit contributions
 rapp-0.513
hiba-0.509
 hospitality-0.508
 defe-0.484
 welf-0.480
Token intervention
Token ID9572
Feature activation+9.15e-01

Pos logit contributions
 rapp+1.650
 hospitality+1.627
hiba+1.579
 defe+1.568
 welf+1.544
Neg logit contributions
horn-1.865
anon-1.853
-1.839
scape-1.768
ocal-1.760
Token programs
Token ID4056
Feature activation+1.19e-01

Pos logit contributions
 rapp+1.517
 hospitality+1.499
hiba+1.472
 defe+1.435
 welf+1.419
Neg logit contributions
horn-1.743
anon-1.742
-1.716
scape-1.662
ocal-1.632
Token?
Token ID30
Feature activation-4.55e-02

Pos logit contributions
 rapp+0.232
 hospitality+0.230
hiba+0.230
 defe+0.221
 welf+0.219
Neg logit contributions
horn-0.192
anon-0.191
-0.189
scape-0.182
ocal-0.179
Token A
Token ID317
Feature activation-9.42e-02

Pos logit contributions
horn+0.081
anon+0.081
+0.080
scape+0.077
ocal+0.076
Neg logit contributions
 rapp-0.080
 hospitality-0.080
hiba-0.079
 defe-0.076
 welf-0.075
Token meta
Token ID13634
Feature activation+2.40e+00

Pos logit contributions
horn+0.188
anon+0.187
+0.185
scape+0.180
ocal+0.177
Neg logit contributions
 rapp-0.149
 hospitality-0.147
hiba-0.147
 defe-0.140
 welf-0.139
Token-
Token ID12
Feature activation+3.11e-01

Pos logit contributions
 rapp+4.311
 hospitality+4.287
hiba+4.257
 defe+4.079
 welf+4.067
Neg logit contributions
horn-4.167
anon-4.125
-4.085
scape-3.954
ocal-3.896
Tokenanalysis
Token ID20930
Feature activation+2.95e-01

Pos logit contributions
 rapp+0.744
 hospitality+0.739
hiba+0.738
 defe+0.714
 welf+0.711
Neg logit contributions
horn-0.366
anon-0.363
-0.357
scape-0.340
ocal-0.331
Token of
Token ID286
Feature activation+1.63e-01

Pos logit contributions
 rapp+0.498
hiba+0.493
 hospitality+0.493
 defe+0.469
 welf+0.466
Neg logit contributions
horn-0.557
anon-0.554
-0.548
scape-0.533
ocal-0.524
Token intervention
Token ID9572
Feature activation+6.25e-01

Pos logit contributions
 rapp+0.284
 hospitality+0.281
hiba+0.281
 defe+0.268
 welf+0.266
Neg logit contributions
horn-0.298
anon-0.297
-0.294
scape-0.285
ocal-0.280
Token research
Token ID2267
Feature activation+3.48e-01

Pos logit contributions
 rapp+1.068
 hospitality+1.059
hiba+1.052
 defe+1.009
 welf+1.001
Neg logit contributions
horn-1.161
anon-1.156
-1.144
scape-1.108
ocal-1.089
Token six
Token ID2237
Feature activation+3.96e-01

Pos logit contributions
 rapp+0.448
hiba+0.444
 hospitality+0.444
 defe+0.422
 welf+0.420
Neg logit contributions
horn-0.492
anon-0.490
-0.485
scape-0.470
ocal-0.462
Token innings
Token ID16267
Feature activation+1.33e+00

Pos logit contributions
 rapp+0.484
 hospitality+0.478
hiba+0.472
 defe+0.445
 welf+0.440
Neg logit contributions
horn-0.929
anon-0.927
-0.919
scape-0.897
ocal-0.886
Token.
Token ID13
Feature activation+7.05e-02

Pos logit contributions
 rapp+2.470
 hospitality+2.433
hiba+2.410
 defe+2.314
azine+2.272
Neg logit contributions
anon-2.246
horn-2.235
-2.213
scape-2.125
ocal-2.098
Token The
Token ID383
Feature activation-2.12e-01

Pos logit contributions
 rapp+0.126
 hospitality+0.125
hiba+0.124
 defe+0.119
 welf+0.118
Neg logit contributions
horn-0.126
anon-0.125
-0.124
scape-0.120
ocal-0.118
Token 2014
Token ID1946
Feature activation-4.19e-01

Pos logit contributions
horn+0.439
anon+0.438
+0.434
scape+0.422
ocal+0.416
Neg logit contributions
 rapp-0.318
hiba-0.315
 hospitality-0.314
 defe-0.298
 welf-0.295
Token Padres
Token ID40188
Feature activation+2.39e+00

Pos logit contributions
anon+1.014
horn+1.014
+1.003
scape+0.980
ocal+0.967
Neg logit contributions
 rapp-0.479
hiba-0.477
 hospitality-0.472
 defe-0.441
 welf-0.439
Token obviously
Token ID6189
Feature activation+4.98e-01

Pos logit contributions
 rapp+4.102
 hospitality+4.027
hiba+3.968
 defe+3.852
 welf+3.735
Neg logit contributions
horn-4.393
anon-4.341
-4.304
scape-4.146
ocal-4.113
Token had
Token ID550
Feature activation+1.31e-02

Pos logit contributions
 rapp+0.884
 hospitality+0.870
hiba+0.869
 defe+0.833
 welf+0.821
Neg logit contributions
horn-0.901
anon-0.897
-0.888
scape-0.856
ocal-0.845
Token difficulties
Token ID13156
Feature activation+2.46e-01

Pos logit contributions
 rapp+0.023
 hospitality+0.022
hiba+0.022
 defe+0.021
 welf+0.021
Neg logit contributions
horn-0.024
anon-0.024
-0.024
scape-0.023
ocal-0.023
Token generating
Token ID15453
Feature activation-5.22e-01

Pos logit contributions
 rapp+0.432
 hospitality+0.427
hiba+0.425
 defe+0.408
 welf+0.402
Neg logit contributions
horn-0.448
anon-0.446
-0.441
scape-0.426
ocal-0.420
Token leads
Token ID5983
Feature activation-1.28e+00

Pos logit contributions
horn+0.982
anon+0.978
+0.967
scape+0.940
ocal+0.922
Neg logit contributions
 rapp-0.879
hiba-0.876
 hospitality-0.870
 defe-0.830
 welf-0.825
Token
Token ID250
Feature activation+2.57e-01

Pos logit contributions
 rapp+0.333
 hospitality+0.331
hiba+0.330
 defe+0.320
 welf+0.319
Neg logit contributions
horn-0.132
anon-0.131
-0.128
scape-0.121
ocal-0.117
Tokenw
Token ID86
Feature activation-2.33e-01

Pos logit contributions
 rapp+0.484
 hospitality+0.480
hiba+0.480
 defe+0.459
 welf+0.457
Neg logit contributions
horn-0.432
anon-0.429
-0.425
scape-0.410
ocal-0.403
Tokenelfare
Token ID27122
Feature activation-4.95e-01

Pos logit contributions
horn+0.385
anon+0.384
+0.378
scape+0.367
ocal+0.360
Neg logit contributions
 rapp-0.445
 hospitality-0.443
hiba-0.441
 defe-0.424
 welf-0.421
Token as
Token ID355
Feature activation-4.75e-02

Pos logit contributions
horn+0.801
anon+0.797
+0.792
scape+0.766
ocal+0.748
Neg logit contributions
hiba-0.962
 rapp-0.958
 hospitality-0.945
 defe-0.912
 welf-0.894
Token we
Token ID356
Feature activation+4.60e-01

Pos logit contributions
horn+0.088
anon+0.088
+0.087
scape+0.084
ocal+0.083
Neg logit contributions
 rapp-0.082
hiba-0.081
 hospitality-0.081
 defe-0.077
 welf-0.077
Token know
Token ID760
Feature activation+2.38e+00

Pos logit contributions
 rapp+0.584
 hospitality+0.577
hiba+0.576
 defe+0.540
 welf+0.535
Neg logit contributions
horn-1.060
anon-1.056
-1.046
scape-1.022
ocal-1.008
Token it
Token ID340
Feature activation+5.57e-01

Pos logit contributions
 rapp+3.799
 hospitality+3.777
hiba+3.670
 welf+3.650
 defe+3.608
Neg logit contributions
anon-4.616
horn-4.509
-4.439
scape-4.359
inding-4.251
Token,
Token ID11
Feature activation+2.59e-01

Pos logit contributions
 rapp+0.974
 hospitality+0.966
hiba+0.965
 defe+0.920
 welf+0.915
Neg logit contributions
horn-1.014
anon-1.009
-0.998
scape-0.968
ocal-0.952
Token
Token ID447
Feature activation-1.57e-01

Pos logit contributions
 rapp+0.445
 hospitality+0.441
hiba+0.440
 defe+0.420
 welf+0.418
Neg logit contributions
horn-0.480
anon-0.478
-0.473
scape-0.459
ocal-0.451
Token
Token ID251
Feature activation-2.43e-01

Pos logit contributions
horn+0.167
+0.167
anon+0.166
scape+0.154
ocal+0.150
Neg logit contributions
 rapp-0.394
hiba-0.391
 hospitality-0.391
 defe-0.377
 welf-0.377
Token while
Token ID981
Feature activation-6.14e-02

Pos logit contributions
horn+0.447
anon+0.445
+0.441
scape+0.428
ocal+0.420
Neg logit contributions
 rapp-0.419
 hospitality-0.416
hiba-0.416
 defe-0.396
 welf-0.393
Token,
Token ID11
Feature activation+1.81e-01

Pos logit contributions
 rapp+0.297
 hospitality+0.294
hiba+0.294
 defe+0.283
 welf+0.281
Neg logit contributions
horn-0.201
anon-0.199
-0.197
scape-0.191
ocal-0.185
Token c
Token ID269
Feature activation+4.35e-01

Pos logit contributions
 rapp+0.333
 hospitality+0.330
hiba+0.329
 defe+0.316
 welf+0.314
Neg logit contributions
horn-0.313
anon-0.311
-0.308
scape-0.297
ocal-0.293
Tokenv
Token ID85
Feature activation-3.46e-01

Pos logit contributions
 rapp+0.992
hiba+0.990
 hospitality+0.984
 defe+0.954
 welf+0.947
Neg logit contributions
horn-0.554
anon-0.551
-0.543
scape-0.514
ocal-0.503
Token2
Token ID17
Feature activation-4.18e-01

Pos logit contributions
horn+0.474
anon+0.472
+0.464
scape+0.448
ocal+0.437
Neg logit contributions
 rapp-0.764
 hospitality-0.757
hiba-0.755
 defe-0.729
 welf-0.726
Token.
Token ID13
Feature activation-1.66e-01

Pos logit contributions
horn+0.666
anon+0.660
+0.656
scape+0.624
ocal+0.616
Neg logit contributions
hiba-0.815
 rapp-0.812
 hospitality-0.812
 defe-0.777
azine-0.771
TokenCOLOR
Token ID46786
Feature activation-2.91e+00

Pos logit contributions
horn+0.211
anon+0.209
+0.207
scape+0.197
ocal+0.192
Neg logit contributions
 rapp-0.382
hiba-0.380
 hospitality-0.380
 defe-0.366
 welf-0.364
Token_
Token ID62
Feature activation-4.12e-01

Pos logit contributions
+3.798
anon+3.795
horn+3.785
scape+3.688
OWN+3.573
Neg logit contributions
 rapp-6.462
hiba-6.448
 hospitality-6.440
 welf-6.231
 defe-6.205
TokenB
Token ID33
Feature activation-8.64e-01

Pos logit contributions
horn+0.551
anon+0.548
+0.540
scape+0.517
ocal+0.505
Neg logit contributions
 rapp-0.920
 hospitality-0.913
hiba-0.912
 defe-0.880
 welf-0.875
TokenGR
Token ID10761
Feature activation-1.98e-03

Pos logit contributions
horn+1.153
anon+1.144
+1.123
scape+1.078
ocal+1.050
Neg logit contributions
 rapp-1.932
 hospitality-1.919
hiba-1.917
 defe-1.851
 welf-1.837
Token2
Token ID17
Feature activation-6.00e-01

Pos logit contributions
horn+0.003
anon+0.003
+0.003
scape+0.003
ocal+0.003
Neg logit contributions
 rapp-0.004
hiba-0.004
 hospitality-0.004
 defe-0.003
 welf-0.003
TokenG
Token ID38
Feature activation+1.00e-01

Pos logit contributions
horn+0.991
anon+0.987
+0.974
scape+0.942
ocal+0.924
Neg logit contributions
 rapp-1.152
 hospitality-1.143
hiba-1.141
 defe-1.094
 welf-1.088
Token't
Token ID470
Feature activation+3.17e-01

Pos logit contributions
 rapp+0.774
hiba+0.769
 hospitality+0.766
azine+0.739
hler+0.725
Neg logit contributions
-0.710
horn-0.696
anon-0.691
inding-0.665
ocal-0.664
Token realize
Token ID6537
Feature activation+7.31e-01

Pos logit contributions
 rapp+0.520
 hospitality+0.508
hiba+0.506
 defe+0.481
 welf+0.473
Neg logit contributions
horn-0.615
-0.615
anon-0.613
scape-0.587
ocal-0.585
Token they
Token ID484
Feature activation-3.76e-01

Pos logit contributions
 rapp+1.171
 hospitality+1.162
hiba+1.139
 defe+1.091
 welf+1.086
Neg logit contributions
horn-1.439
anon-1.438
-1.431
scape-1.380
ocal-1.369
Token're
Token ID821
Feature activation-4.27e-01

Pos logit contributions
horn+0.628
anon+0.625
+0.618
scape+0.597
ocal+0.586
Neg logit contributions
 rapp-0.714
 hospitality-0.708
hiba-0.707
 defe-0.677
 welf-0.673
Token being
Token ID852
Feature activation-4.85e-01

Pos logit contributions
horn+0.872
anon+0.871
+0.862
scape+0.837
ocal+0.826
Neg logit contributions
 rapp-0.652
 hospitality-0.645
hiba-0.640
 defe-0.609
 welf-0.602
Token videot
Token ID35026
Feature activation-2.85e+00

Pos logit contributions
anon+0.780
horn+0.777
+0.770
scape+0.736
ocal+0.727
Neg logit contributions
 rapp-0.958
 hospitality-0.948
hiba-0.936
 defe-0.905
 welf-0.891
Tokenaped
Token ID5813
Feature activation-2.26e+00

Pos logit contributions
horn+3.452
anon+3.437
+3.335
scape+3.212
ocal+3.151
Neg logit contributions
 rapp-6.750
 hospitality-6.707
hiba-6.680
 defe-6.493
 welf-6.471
Token with
Token ID351
Feature activation-3.10e-01

Pos logit contributions
horn+3.910
anon+3.809
scape+3.720
+3.661
ocal+3.552
Neg logit contributions
hiba-4.158
 rapp-4.051
 hospitality-4.034
 welf-3.986
 defe-3.899
Token their
Token ID511
Feature activation+1.47e-01

Pos logit contributions
horn+0.575
anon+0.573
+0.567
scape+0.550
ocal+0.541
Neg logit contributions
 rapp-0.534
 hospitality-0.529
hiba-0.527
 defe-0.503
 welf-0.500
Token knowledge
Token ID3725
Feature activation+8.77e-01

Pos logit contributions
 rapp+0.245
 hospitality+0.244
hiba+0.240
 defe+0.231
 welf+0.228
Neg logit contributions
horn-0.278
anon-0.278
-0.275
scape-0.268
ocal-0.262
Token?"
Token ID1701
Feature activation+5.42e-02

Pos logit contributions
 rapp+1.572
 hospitality+1.567
hiba+1.553
 defe+1.488
 welf+1.473
Neg logit contributions
anon-1.546
horn-1.543
-1.537
scape-1.472
ocal-1.460
Token.
Token ID13
Feature activation-3.17e-01

Pos logit contributions
 rapp+0.102
 hospitality+0.101
hiba+0.101
 defe+0.097
 welf+0.097
Neg logit contributions
horn-0.073
anon-0.073
-0.071
scape-0.069
ocal-0.068
Token in
Token ID287
Feature activation-1.09e-02

Pos logit contributions
horn+0.574
anon+0.572
+0.565
scape+0.549
ocal+0.539
Neg logit contributions
 rapp-0.555
hiba-0.551
 hospitality-0.550
 defe-0.525
 welf-0.522
Token fact
Token ID1109
Feature activation+3.38e-01

Pos logit contributions
horn+0.020
anon+0.020
+0.020
scape+0.019
ocal+0.019
Neg logit contributions
 rapp-0.019
 hospitality-0.019
hiba-0.018
 defe-0.018
 welf-0.017
Token had
Token ID550
Feature activation+1.05e-01

Pos logit contributions
 rapp+0.634
 hospitality+0.628
hiba+0.621
 defe+0.600
 welf+0.586
Neg logit contributions
horn-0.576
anon-0.573
-0.569
scape-0.544
ocal-0.536
Token two
Token ID734
Feature activation+5.58e-02

Pos logit contributions
 rapp+0.195
 hospitality+0.194
hiba+0.191
 defe+0.185
 welf+0.182
Neg logit contributions
horn-0.183
anon-0.182
-0.180
scape-0.173
ocal-0.170
Token videot
Token ID35026
Feature activation-2.80e+00

Pos logit contributions
 rapp+0.096
 hospitality+0.095
hiba+0.095
 defe+0.091
 welf+0.090
Neg logit contributions
horn-0.104
anon-0.103
-0.102
scape-0.099
ocal-0.097
Tokenapes
Token ID7916
Feature activation-2.12e+00

Pos logit contributions
anon+1.615
horn+1.529
+1.369
scape+1.346
ocal+1.310
Neg logit contributions
 rapp-8.421
 hospitality-8.409
hiba-8.301
 welf-8.231
 defe-8.197
Token and
Token ID290
Feature activation+2.50e-01

Pos logit contributions
horn+3.516
anon+3.435
+3.328
scape+3.310
ocal+3.244
Neg logit contributions
hiba-4.015
 rapp-3.995
 hospitality-3.950
 welf-3.857
 defe-3.809
Token one
Token ID530
Feature activation+1.22e-01

Pos logit contributions
 rapp+0.448
 hospitality+0.446
hiba+0.437
 defe+0.422
 welf+0.415
Neg logit contributions
horn-0.451
anon-0.447
-0.445
scape-0.426
ocal-0.421
Token aud
Token ID2709
Feature activation-6.00e-01

Pos logit contributions
 rapp+0.228
 hospitality+0.226
hiba+0.224
 defe+0.215
 welf+0.213
Neg logit contributions
horn-0.208
anon-0.207
-0.206
scape-0.197
ocal-0.194
Tokeniot
Token ID5151
Feature activation+5.23e-01

Pos logit contributions
+1.196
horn+1.196
anon+1.185
scape+1.136
OWN+1.124
Neg logit contributions
 rapp-0.947
hiba-0.938
 hospitality-0.929
azine-0.890
 defe-0.878
Token.
Token ID13
Feature activation+2.53e-01

Pos logit contributions
 rapp+1.660
 hospitality+1.647
hiba+1.645
 defe+1.582
 welf+1.573
Neg logit contributions
horn-1.211
anon-1.205
-1.189
scape-1.145
ocal-1.121
Token government
Token ID1230
Feature activation-2.46e-01

Pos logit contributions
 rapp+0.431
 hospitality+0.429
hiba+0.424
 defe+0.406
 welf+0.404
Neg logit contributions
horn-0.475
anon-0.472
-0.467
scape-0.454
ocal-0.445
Token to
Token ID284
Feature activation-2.05e-02

Pos logit contributions
horn+0.474
anon+0.472
+0.467
scape+0.453
ocal+0.446
Neg logit contributions
 rapp-0.407
 hospitality-0.404
hiba-0.402
 defe-0.383
 welf-0.381
Token take
Token ID1011
Feature activation-2.89e-01

Pos logit contributions
horn+0.043
anon+0.042
+0.042
scape+0.041
ocal+0.040
Neg logit contributions
 rapp-0.031
 hospitality-0.030
hiba-0.030
 defe-0.029
 welf-0.028
Token this
Token ID428
Feature activation-9.54e-01

Pos logit contributions
horn+0.506
anon+0.503
+0.499
scape+0.482
ocal+0.473
Neg logit contributions
 rapp-0.530
 hospitality-0.528
hiba-0.519
 defe-0.501
 welf-0.499
Token video
Token ID2008
Feature activation-2.80e+00

Pos logit contributions
horn+1.791
anon+1.784
+1.767
scape+1.712
ocal+1.685
Neg logit contributions
 rapp-1.617
 hospitality-1.602
hiba-1.595
 defe-1.524
 welf-1.514
Token down
Token ID866
Feature activation+9.92e-02

Pos logit contributions
horn+5.647
anon+5.624
+5.493
scape+5.465
ocal+5.308
Neg logit contributions
hiba-4.258
 rapp-4.199
 hospitality-4.100
 welf-3.936
azine-3.920
Token as
Token ID355
Feature activation-3.49e-01

Pos logit contributions
 rapp+0.184
 hospitality+0.182
hiba+0.181
 defe+0.174
 welf+0.172
Neg logit contributions
horn-0.171
anon-0.170
-0.168
scape-0.163
ocal-0.160
Token it
Token ID340
Feature activation+4.49e-01

Pos logit contributions
horn+0.669
anon+0.666
+0.659
scape+0.640
ocal+0.630
Neg logit contributions
 rapp-0.577
 hospitality-0.573
hiba-0.570
 defe-0.544
 welf-0.540
Token is
Token ID318
Feature activation-9.61e-02

Pos logit contributions
 rapp+0.829
 hospitality+0.822
hiba+0.814
 defe+0.783
 welf+0.770
Neg logit contributions
horn-0.775
anon-0.770
-0.765
scape-0.735
ocal-0.724
Token dem
Token ID1357
Feature activation+2.30e-01

Pos logit contributions
horn+0.184
anon+0.184
+0.182
scape+0.176
ocal+0.173
Neg logit contributions
 rapp-0.159
 hospitality-0.158
hiba-0.156
 defe-0.150
 welf-0.148
Token June
Token ID2795
Feature activation-9.45e-02

Pos logit contributions
 rapp+0.190
 hospitality+0.188
hiba+0.187
 defe+0.179
 welf+0.177
Neg logit contributions
horn-0.180
anon-0.180
-0.177
scape-0.172
ocal-0.168
Token 2
Token ID362
Feature activation-9.74e-01

Pos logit contributions
horn+0.143
anon+0.142
+0.142
scape+0.136
ocal+0.133
Neg logit contributions
 rapp-0.195
 hospitality-0.192
hiba-0.191
 defe-0.184
azine-0.182
Token-
Token ID12
Feature activation-2.45e-02

Pos logit contributions
horn+1.632
anon+1.600
+1.589
scape+1.541
ocal+1.523
Neg logit contributions
hiba-1.836
 hospitality-1.830
 rapp-1.821
 welf-1.768
 defe-1.758
Token3
Token ID18
Feature activation-4.80e-01

Pos logit contributions
horn+0.032
anon+0.031
+0.031
scape+0.030
ocal+0.029
Neg logit contributions
 rapp-0.056
 hospitality-0.055
hiba-0.055
 defe-0.053
 welf-0.053
Token in
Token ID287
Feature activation-2.57e-01

Pos logit contributions
horn+0.818
anon+0.810
+0.800
scape+0.778
ocal+0.765
Neg logit contributions
 rapp-0.894
hiba-0.891
 hospitality-0.888
 welf-0.853
 defe-0.852
Token Dayton
Token ID24450
Feature activation-2.77e+00

Pos logit contributions
horn+0.457
anon+0.454
+0.449
scape+0.435
ocal+0.428
Neg logit contributions
 rapp-0.460
hiba-0.458
 hospitality-0.455
 defe-0.437
 welf-0.435
Token,
Token ID11
Feature activation-5.55e-01

Pos logit contributions
horn+4.154
anon+4.105
+4.088
ocal+3.948
scape+3.944
Neg logit contributions
hiba-5.509
 rapp-5.480
 hospitality-5.440
 welf-5.367
 defe-5.286
Token Ohio
Token ID6835
Feature activation-1.30e+00

Pos logit contributions
horn+0.769
anon+0.762
+0.753
scape+0.722
ocal+0.707
Neg logit contributions
 rapp-1.211
hiba-1.206
 hospitality-1.202
 defe-1.159
 welf-1.156
Token at
Token ID379
Feature activation+3.47e-01

Pos logit contributions
horn+1.986
anon+1.980
+1.966
scape+1.897
ocal+1.875
Neg logit contributions
 rapp-2.604
hiba-2.600
 hospitality-2.585
 welf-2.509
 defe-2.501
Token a
Token ID257
Feature activation+2.53e-01

Pos logit contributions
 rapp+0.591
hiba+0.585
 hospitality+0.585
 defe+0.558
 welf+0.554
Neg logit contributions
horn-0.649
anon-0.645
-0.639
scape-0.620
ocal-0.610
Token mock
Token ID15290
Feature activation-3.51e-02

Pos logit contributions
 rapp+0.414
 hospitality+0.410
hiba+0.409
 defe+0.389
 welf+0.386
Neg logit contributions
horn-0.489
anon-0.487
-0.482
scape-0.469
ocal-0.461
Token Take
Token ID7214
Feature activation-5.86e-01

Pos logit contributions
 rapp+0.175
 hospitality+0.173
hiba+0.173
 defe+0.165
 welf+0.165
Neg logit contributions
horn-0.157
anon-0.157
-0.155
scape-0.150
ocal-0.147
Token a
Token ID257
Feature activation-6.29e-01

Pos logit contributions
horn+1.028
anon+1.024
+1.012
scape+0.980
ocal+0.962
Neg logit contributions
 rapp-1.066
 hospitality-1.056
hiba-1.056
 defe-1.010
 welf-1.003
Token look
Token ID804
Feature activation-4.25e-01

Pos logit contributions
horn+1.259
anon+1.251
+1.238
scape+1.209
ocal+1.187
Neg logit contributions
 rapp-0.981
hiba-0.979
 hospitality-0.971
 defe-0.923
 welf-0.914
Token the
Token ID262
Feature activation-2.46e-01

Pos logit contributions
horn+0.748
anon+0.743
+0.733
scape+0.712
ocal+0.698
Neg logit contributions
 rapp-0.771
 hospitality-0.766
hiba-0.764
 defe-0.732
 welf-0.728
Token incredible
Token ID8082
Feature activation-3.49e-01

Pos logit contributions
horn+0.467
anon+0.465
+0.461
scape+0.447
ocal+0.440
Neg logit contributions
 rapp-0.411
 hospitality-0.407
hiba-0.406
 defe-0.387
 welf-0.384
Token footage
Token ID9640
Feature activation-2.76e+00

Pos logit contributions
horn+0.639
anon+0.636
+0.629
scape+0.611
ocal+0.599
Neg logit contributions
 rapp-0.605
hiba-0.600
 hospitality-0.598
 defe-0.572
 welf-0.567
Token below
Token ID2174
Feature activation-5.79e-01

Pos logit contributions
horn+4.670
anon+4.570
scape+4.467
+4.440
ocal+4.223
Neg logit contributions
hiba-5.090
 rapp-5.081
 hospitality-5.038
 defe-4.857
 welf-4.854
Token:
Token ID25
Feature activation-7.37e-01

Pos logit contributions
horn+0.960
anon+0.955
+0.944
scape+0.913
ocal+0.895
Neg logit contributions
 rapp-1.107
 hospitality-1.098
hiba-1.097
 defe-1.052
 welf-1.045
Token<|endoftext|>
Token ID50256
Feature activation+6.87e-01

Pos logit contributions
horn+1.205
anon+1.197
+1.177
scape+1.139
ocal+1.114
Neg logit contributions
 rapp-1.433
hiba-1.424
 hospitality-1.424
 defe-1.369
 welf-1.358
TokenWith
Token ID3152
Feature activation+3.85e-01

Pos logit contributions
 rapp+1.291
hiba+1.287
 hospitality+1.278
 defe+1.226
azine+1.218
Neg logit contributions
horn-1.138
-1.129
anon-1.124
scape-1.081
ocal-1.068
Token a
Token ID257
Feature activation+6.31e-01

Pos logit contributions
 rapp+0.648
 hospitality+0.643
hiba+0.640
 defe+0.610
 welf+0.606
Neg logit contributions
horn-0.726
anon-0.724
-0.714
scape-0.694
ocal-0.683
Token Sylvia
Token ID45425
Feature activation+1.54e+00

Pos logit contributions
horn+0.269
anon+0.267
+0.264
scape+0.258
ocal+0.253
Neg logit contributions
 rapp-0.216
hiba-0.215
 hospitality-0.213
 defe-0.203
 welf-0.202
Token's
Token ID338
Feature activation+1.35e-01

Pos logit contributions
 rapp+3.230
 hospitality+3.223
hiba+3.178
 defe+3.080
 welf+3.051
Neg logit contributions
horn-2.253
anon-2.236
-2.222
scape-2.124
ocal-2.077
Token locations
Token ID7064
Feature activation-7.75e-02

Pos logit contributions
 rapp+0.184
 hospitality+0.182
hiba+0.181
 defe+0.171
 welf+0.169
Neg logit contributions
horn-0.297
anon-0.296
-0.293
scape-0.286
ocal-0.282
Token in
Token ID287
Feature activation-1.33e-01

Pos logit contributions
horn+0.135
anon+0.134
+0.132
scape+0.128
ocal+0.126
Neg logit contributions
 rapp-0.142
 hospitality-0.141
hiba-0.141
 defe-0.135
 welf-0.134
Token her
Token ID607
Feature activation+5.32e-02

Pos logit contributions
horn+0.244
anon+0.243
+0.240
scape+0.233
ocal+0.229
Neg logit contributions
 rapp-0.231
hiba-0.229
 hospitality-0.228
 defe-0.218
 welf-0.217
Token stead
Token ID7978
Feature activation-2.74e+00

Pos logit contributions
 rapp+0.081
 hospitality+0.081
hiba+0.080
 defe+0.076
 welf+0.075
Neg logit contributions
horn-0.109
anon-0.109
-0.108
scape-0.105
ocal-0.103
Token.
Token ID13
Feature activation-1.90e-01

Pos logit contributions
horn+4.055
anon+3.954
+3.904
scape+3.869
ocal+3.741
Neg logit contributions
hiba-5.501
 rapp-5.497
 hospitality-5.458
 welf-5.417
 defe-5.342
Token Today
Token ID6288
Feature activation-7.50e-01

Pos logit contributions
horn+0.337
anon+0.336
+0.333
scape+0.322
ocal+0.317
Neg logit contributions
 rapp-0.338
 hospitality-0.336
hiba-0.334
 defe-0.318
 welf-0.316
Token,
Token ID11
Feature activation-3.75e-01

Pos logit contributions
horn+1.266
anon+1.252
+1.243
scape+1.209
ocal+1.183
Neg logit contributions
 rapp-1.409
hiba-1.399
 hospitality-1.389
 defe-1.335
 welf-1.335
Token Sylvia
Token ID45425
Feature activation+1.11e+00

Pos logit contributions
horn+0.714
anon+0.709
+0.703
scape+0.683
ocal+0.672
Neg logit contributions
 rapp-0.624
hiba-0.618
 hospitality-0.616
 defe-0.588
 welf-0.585
Token has
Token ID468
Feature activation-3.04e-01

Pos logit contributions
 rapp+2.388
 hospitality+2.373
hiba+2.365
 defe+2.281
 welf+2.266
Neg logit contributions
horn-1.561
anon-1.553
-1.533
scape-1.470
ocal-1.438
Token the
Token ID262
Feature activation+1.06e-01

Pos logit contributions
horn+0.294
anon+0.293
+0.290
scape+0.282
ocal+0.277
Neg logit contributions
 rapp-0.267
hiba-0.265
 hospitality-0.264
 defe-0.252
 welf-0.251
Token hundreds
Token ID5179
Feature activation+1.18e+00

Pos logit contributions
 rapp+0.185
 hospitality+0.183
hiba+0.182
 defe+0.174
 welf+0.172
Neg logit contributions
horn-0.194
anon-0.193
-0.191
scape-0.185
ocal-0.182
Token of
Token ID286
Feature activation+7.15e-02

Pos logit contributions
 rapp+2.086
 hospitality+2.064
hiba+2.055
 defe+1.962
 welf+1.947
Neg logit contributions
anon-2.133
horn-2.132
-2.106
scape-2.036
ocal-2.006
Token hours
Token ID2250
Feature activation+2.42e-01

Pos logit contributions
 rapp+0.121
 hospitality+0.120
hiba+0.119
 defe+0.114
 welf+0.112
Neg logit contributions
horn-0.135
anon-0.135
-0.133
scape-0.129
ocal-0.127
Token of
Token ID286
Feature activation-1.32e-01

Pos logit contributions
 rapp+0.429
 hospitality+0.426
hiba+0.424
 defe+0.404
 welf+0.402
Neg logit contributions
horn-0.434
anon-0.434
-0.429
scape-0.414
ocal-0.408
Token videot
Token ID35026
Feature activation-2.73e+00

Pos logit contributions
horn+0.235
anon+0.234
+0.232
scape+0.224
ocal+0.220
Neg logit contributions
 rapp-0.237
 hospitality-0.235
hiba-0.234
 defe-0.224
 welf-0.222
Tokenape
Token ID1758
Feature activation-2.00e+00

Pos logit contributions
anon+2.720
horn+2.641
+2.536
scape+2.496
ocal+2.447
Neg logit contributions
 rapp-7.065
 hospitality-7.001
hiba-6.914
 welf-6.831
 defe-6.820
Token of
Token ID286
Feature activation-3.44e-01

Pos logit contributions
horn+3.368
anon+3.322
+3.247
scape+3.219
ocal+3.109
Neg logit contributions
 rapp-3.717
hiba-3.677
 hospitality-3.629
 welf-3.509
 defe-3.502
Token Abu
Token ID13098
Feature activation-1.41e+00

Pos logit contributions
horn+0.675
anon+0.673
+0.666
scape+0.649
ocal+0.637
Neg logit contributions
hiba-0.545
 rapp-0.544
 hospitality-0.538
 welf-0.514
 defe-0.514
Token Zub
Token ID47828
Feature activation-2.48e+00

Pos logit contributions
horn+1.880
anon+1.854
+1.808
scape+1.744
ocal+1.709
Neg logit contributions
 rapp-3.115
hiba-3.102
 hospitality-3.098
 welf-3.019
 defe-3.002
Tokenay
Token ID323
Feature activation-6.92e-01

Pos logit contributions
horn+2.480
anon+2.454
+2.343
scape+2.276
ocal+2.189
Neg logit contributions
 rapp-6.357
 hospitality-6.342
hiba-6.289
 defe-6.180
 welf-6.165
Token claims
Token ID3667
Feature activation+5.15e-01

Pos logit contributions
 rapp+0.332
 hospitality+0.330
hiba+0.325
 defe+0.316
 welf+0.312
Neg logit contributions
horn-0.319
anon-0.318
-0.314
scape-0.304
ocal-0.299
Token filed
Token ID5717
Feature activation-7.54e-01

Pos logit contributions
 rapp+0.961
 hospitality+0.952
hiba+0.945
 defe+0.914
 welf+0.899
Neg logit contributions
horn-0.876
anon-0.873
-0.862
scape-0.834
ocal-0.817
Token by
Token ID416
Feature activation-2.13e-01

Pos logit contributions
horn+1.240
anon+1.237
+1.219
scape+1.182
ocal+1.159
Neg logit contributions
 rapp-1.445
hiba-1.440
 hospitality-1.431
 welf-1.371
 defe-1.368
Token communities
Token ID5348
Feature activation-4.14e-01

Pos logit contributions
horn+0.409
anon+0.407
+0.404
scape+0.392
ocal+0.385
Neg logit contributions
 rapp-0.353
hiba-0.350
 hospitality-0.348
 defe-0.332
 welf-0.330
Token of
Token ID286
Feature activation-1.98e-01

Pos logit contributions
horn+0.714
anon+0.710
+0.702
scape+0.680
ocal+0.668
Neg logit contributions
 rapp-0.766
hiba-0.760
 hospitality-0.759
 defe-0.726
 welf-0.723
Token color
Token ID3124
Feature activation-2.73e+00

Pos logit contributions
horn+0.292
anon+0.291
+0.287
scape+0.276
ocal+0.270
Neg logit contributions
 rapp-0.414
 hospitality-0.412
hiba-0.410
 defe-0.395
 welf-0.393
Token under
Token ID739
Feature activation-4.20e-01

Pos logit contributions
horn+4.232
anon+4.116
+4.072
ocal+3.997
scape+3.940
Neg logit contributions
hiba-5.449
 rapp-5.277
 hospitality-5.248
 welf-5.159
 defe-5.075
Token Title
Token ID11851
Feature activation+1.15e+00

Pos logit contributions
horn+0.793
anon+0.789
+0.788
scape+0.759
ocal+0.750
Neg logit contributions
 rapp-0.705
hiba-0.704
 hospitality-0.694
 defe-0.664
 welf-0.661
Token VI
Token ID13889
Feature activation-9.94e-02

Pos logit contributions
 rapp+2.015
 hospitality+2.001
hiba+1.923
azine+1.887
 defe+1.874
Neg logit contributions
horn-2.045
anon-2.031
-2.015
scape-1.961
ocal-1.932
Token of
Token ID286
Feature activation-1.98e-01

Pos logit contributions
horn+0.173
anon+0.172
+0.171
scape+0.165
ocal+0.162
Neg logit contributions
 rapp-0.182
hiba-0.180
 hospitality-0.180
 defe-0.172
 welf-0.171
Token the
Token ID262
Feature activation-2.49e-01

Pos logit contributions
horn+0.388
anon+0.387
+0.384
scape+0.372
ocal+0.367
Neg logit contributions
 rapp-0.321
hiba-0.318
 hospitality-0.317
 defe-0.301
 welf-0.300
TokenAudio
Token ID21206
Feature activation-1.61e+00

Pos logit contributions
horn+0.238
anon+0.238
+0.234
scape+0.228
ocal+0.224
Neg logit contributions
 rapp-0.233
 hospitality-0.231
hiba-0.230
 defe-0.220
 welf-0.219
Token evidence
Token ID2370
Feature activation-6.86e-01

Pos logit contributions
horn+2.780
anon+2.769
+2.724
scape+2.648
ocal+2.599
Neg logit contributions
 rapp-2.950
hiba-2.929
 hospitality-2.914
 defe-2.796
 welf-2.782
Token
Token ID198
Feature activation-2.12e-01

Pos logit contributions
horn+1.191
anon+1.186
+1.172
scape+1.135
ocal+1.114
Neg logit contributions
 rapp-1.259
 hospitality-1.248
hiba-1.247
 defe-1.193
 welf-1.185
Token
Token ID198
Feature activation-1.19e-01

Pos logit contributions
+0.369
horn+0.365
anon+0.364
scape+0.348
ocal+0.347
Neg logit contributions
 rapp-0.382
 hospitality-0.380
hiba-0.379
azine-0.361
 defe-0.358
TokenThe
Token ID464
Feature activation+7.97e-02

Pos logit contributions
horn+0.201
anon+0.199
+0.198
scape+0.191
ocal+0.187
Neg logit contributions
 rapp-0.223
hiba-0.222
 hospitality-0.221
 defe-0.212
 welf-0.210
Token FIR
Token ID23703
Feature activation-2.70e+00

Pos logit contributions
 rapp+0.143
 hospitality+0.142
hiba+0.141
 defe+0.135
 welf+0.134
Neg logit contributions
horn-0.142
anon-0.142
-0.140
scape-0.136
ocal-0.133
Token further
Token ID2252
Feature activation-2.17e-01

Pos logit contributions
anon+4.332
horn+4.294
+4.236
scape+4.105
ocal+4.040
Neg logit contributions
hiba-5.245
 rapp-5.196
 hospitality-5.164
 welf-4.960
 defe-4.926
Token quotes
Token ID13386
Feature activation+1.81e-01

Pos logit contributions
horn+0.409
anon+0.406
+0.404
scape+0.391
ocal+0.383
Neg logit contributions
 rapp-0.368
 hospitality-0.364
hiba-0.363
 defe-0.347
 welf-0.343
Token the
Token ID262
Feature activation+1.29e-01

Pos logit contributions
 rapp+0.328
 hospitality+0.326
hiba+0.322
 defe+0.309
 welf+0.307
Neg logit contributions
horn-0.320
anon-0.318
-0.315
scape-0.306
ocal-0.299
Token contents
Token ID10154
Feature activation+8.53e-01

Pos logit contributions
 rapp+0.229
 hospitality+0.227
hiba+0.225
 defe+0.216
 welf+0.215
Neg logit contributions
horn-0.231
anon-0.230
-0.227
scape-0.221
ocal-0.216
Token of
Token ID286
Feature activation-2.64e-02

Pos logit contributions
 rapp+1.519
 hospitality+1.506
hiba+1.502
 defe+1.436
 welf+1.427
Neg logit contributions
horn-1.519
anon-1.513
-1.496
scape-1.449
ocal-1.422
Token
Token ID198
Feature activation-2.13e-01

Pos logit contributions
+0.488
horn+0.482
anon+0.482
ocal+0.460
scape+0.459
Neg logit contributions
 rapp-0.501
 hospitality-0.497
hiba-0.496
azine-0.473
 defe-0.467
TokenT
Token ID51
Feature activation+2.37e-01

Pos logit contributions
horn+0.371
anon+0.369
+0.365
scape+0.353
ocal+0.348
Neg logit contributions
 rapp-0.388
 hospitality-0.385
hiba-0.384
 defe-0.368
 welf-0.365
Token.
Token ID13
Feature activation+1.53e-01

Pos logit contributions
 rapp+0.465
hiba+0.462
 hospitality+0.461
 defe+0.442
 welf+0.439
Neg logit contributions
horn-0.382
anon-0.378
-0.376
scape-0.361
ocal-0.354
TokenJ
Token ID41
Feature activation+1.85e-02

Pos logit contributions
 rapp+0.343
 hospitality+0.341
hiba+0.340
 defe+0.329
 welf+0.326
Neg logit contributions
horn-0.201
anon-0.199
-0.196
scape-0.187
ocal-0.184
Token.
Token ID13
Feature activation-5.40e-01

Pos logit contributions
 rapp+0.037
hiba+0.036
 hospitality+0.036
azine+0.035
 defe+0.035
Neg logit contributions
horn-0.029
anon-0.029
-0.029
scape-0.028
ocal-0.027
Token Osh
Token ID46614
Feature activation-2.69e+00

Pos logit contributions
horn+0.840
anon+0.836
+0.824
scape+0.796
ocal+0.774
Neg logit contributions
 rapp-1.094
 hospitality-1.085
hiba-1.084
 defe-1.044
 welf-1.038
Tokenie
Token ID494
Feature activation+6.74e-02

Pos logit contributions
anon+3.085
horn+3.045
+2.921
scape+2.807
ocal+2.725
Neg logit contributions
 rapp-6.551
 hospitality-6.481
hiba-6.435
 welf-6.398
 defe-6.310
Token
Token ID198
Feature activation-2.34e-01

Pos logit contributions
 rapp+0.129
 hospitality+0.128
hiba+0.128
 defe+0.123
 welf+0.122
Neg logit contributions
horn-0.111
anon-0.111
-0.110
scape-0.106
ocal-0.104
Token
Token ID198
Feature activation-1.59e-01

Pos logit contributions
+0.410
horn+0.405
anon+0.405
ocal+0.387
scape+0.385
Neg logit contributions
 rapp-0.418
 hospitality-0.416
hiba-0.414
azine-0.395
 defe-0.391
TokenNick
Token ID23609
Feature activation+1.98e-01

Pos logit contributions
horn+0.275
anon+0.273
+0.271
scape+0.262
ocal+0.258
Neg logit contributions
 rapp-0.291
 hospitality-0.288
hiba-0.288
 defe-0.276
 welf-0.273
Tokenlas
Token ID21921
Feature activation+5.48e-01

Pos logit contributions
 rapp+0.330
 hospitality+0.327
hiba+0.326
 defe+0.310
 welf+0.307
Neg logit contributions
horn-0.375
anon-0.374
-0.370
scape-0.360
ocal-0.354
Token spend
Token ID4341
Feature activation-5.32e-01

Pos logit contributions
horn+0.482
anon+0.480
+0.475
scape+0.461
ocal+0.453
Neg logit contributions
 rapp-0.421
 hospitality-0.416
hiba-0.415
 defe-0.396
 welf-0.391
Token the
Token ID262
Feature activation-4.43e-01

Pos logit contributions
horn+0.995
anon+0.990
+0.977
scape+0.951
ocal+0.931
Neg logit contributions
 rapp-0.903
hiba-0.897
 hospitality-0.893
 defe-0.854
 welf-0.846
Token weekend
Token ID5041
Feature activation+2.42e-01

Pos logit contributions
horn+0.908
anon+0.906
+0.895
scape+0.871
ocal+0.857
Neg logit contributions
 rapp-0.671
hiba-0.668
 hospitality-0.661
 defe-0.629
 welf-0.624
Token watching
Token ID4964
Feature activation-6.66e-01

Pos logit contributions
 rapp+0.426
 hospitality+0.420
hiba+0.416
 defe+0.399
 welf+0.396
Neg logit contributions
horn-0.442
anon-0.441
-0.438
scape-0.421
ocal-0.416
Token this
Token ID428
Feature activation-3.77e-01

Pos logit contributions
horn+1.234
anon+1.225
+1.211
scape+1.179
ocal+1.155
Neg logit contributions
 rapp-1.139
hiba-1.136
 hospitality-1.131
 defe-1.080
 welf-1.073
Token video
Token ID2008
Feature activation-2.66e+00

Pos logit contributions
horn+0.724
anon+0.721
+0.712
scape+0.694
ocal+0.681
Neg logit contributions
 rapp-0.621
hiba-0.616
 hospitality-0.615
 defe-0.585
 welf-0.581
Token of
Token ID286
Feature activation-8.00e-01

Pos logit contributions
horn+4.306
anon+4.264
scape+4.135
+4.095
ocal+3.946
Neg logit contributions
hiba-5.089
 rapp-4.987
 hospitality-4.984
 defe-4.800
 welf-4.799
Token a
Token ID257
Feature activation-3.81e-01

Pos logit contributions
horn+1.452
anon+1.445
+1.427
scape+1.384
ocal+1.356
Neg logit contributions
hiba-1.392
 rapp-1.385
 hospitality-1.380
 defe-1.319
 welf-1.311
Token traffic
Token ID4979
Feature activation-1.73e-01

Pos logit contributions
horn+0.769
anon+0.764
+0.755
scape+0.736
ocal+0.723
Neg logit contributions
 rapp-0.590
hiba-0.590
 hospitality-0.586
 defe-0.555
 welf-0.552
Token policeman
Token ID32166
Feature activation-6.30e-01

Pos logit contributions
horn+0.355
anon+0.352
+0.349
scape+0.341
ocal+0.333
Neg logit contributions
 rapp-0.265
hiba-0.263
 hospitality-0.263
 defe-0.248
 welf-0.247
Token helping
Token ID5742
Feature activation-3.34e-01

Pos logit contributions
horn+1.265
anon+1.256
+1.232
scape+1.217
ocal+1.187
Neg logit contributions
hiba-0.979
 hospitality-0.972
 rapp-0.962
 welf-0.919
 defe-0.913
Token,
Token ID11
Feature activation-4.50e-01

Pos logit contributions
horn+1.434
anon+1.417
+1.408
scape+1.358
ocal+1.340
Neg logit contributions
hiba-1.665
 rapp-1.663
 hospitality-1.650
 welf-1.586
 defe-1.573
Token T
Token ID309
Feature activation+8.46e-02

Pos logit contributions
horn+0.810
anon+0.806
+0.797
scape+0.772
ocal+0.758
Neg logit contributions
 rapp-0.796
hiba-0.790
 hospitality-0.788
 defe-0.753
 welf-0.750
Token.
Token ID13
Feature activation-1.05e-01

Pos logit contributions
hiba+0.168
 rapp+0.168
 hospitality+0.167
azine+0.160
 defe+0.159
Neg logit contributions
horn-0.133
anon-0.132
-0.131
scape-0.126
ocal-0.122
TokenJ
Token ID41
Feature activation-3.43e-01

Pos logit contributions
horn+0.139
anon+0.138
+0.137
scape+0.130
ocal+0.129
Neg logit contributions
 rapp-0.234
 hospitality-0.233
hiba-0.231
 defe-0.222
azine-0.221
Token.
Token ID13
Feature activation-6.71e-01

Pos logit contributions
horn+0.554
+0.552
anon+0.550
scape+0.528
ocal+0.519
Neg logit contributions
 rapp-0.663
 hospitality-0.657
hiba-0.654
azine-0.626
 defe-0.616
Token Osh
Token ID46614
Feature activation-2.64e+00

Pos logit contributions
horn+0.758
anon+0.754
+0.749
scape+0.709
ocal+0.691
Neg logit contributions
 rapp-1.627
 hospitality-1.617
hiba-1.610
 defe-1.553
azine-1.546
Tokenie
Token ID494
Feature activation-6.98e-02

Pos logit contributions
horn+3.070
anon+3.064
+2.919
scape+2.823
ocal+2.780
Neg logit contributions
 rapp-6.371
 hospitality-6.309
 welf-6.285
hiba-6.217
 defe-6.148
Token and
Token ID290
Feature activation-1.76e-01

Pos logit contributions
horn+0.119
anon+0.118
+0.117
scape+0.113
ocal+0.111
Neg logit contributions
 rapp-0.130
hiba-0.130
 hospitality-0.130
 welf-0.124
 defe-0.124
Token Nick
Token ID8047
Feature activation+2.07e-01

Pos logit contributions
horn+0.333
anon+0.332
+0.328
scape+0.319
ocal+0.314
Neg logit contributions
 rapp-0.295
 hospitality-0.293
hiba-0.292
 defe-0.278
 welf-0.276
Tokenlas
Token ID21921
Feature activation+5.34e-01

Pos logit contributions
 hospitality+0.255
 rapp+0.254
hiba+0.254
azine+0.238
hler+0.236
Neg logit contributions
anon-0.475
-0.474
horn-0.474
scape-0.461
ocal-0.455
Token Back
Token ID5157
Feature activation+1.98e-01

Pos logit contributions
 rapp+0.914
 hospitality+0.908
hiba+0.883
azine+0.857
 defe+0.845
Neg logit contributions
-0.983
anon-0.981
horn-0.971
scape-0.945
ocal-0.937
Token Scroll
Token ID17428
Feature activation+3.08e-02

Pos logit contributions
 rapp+0.358
 hospitality+0.353
hiba+0.352
 defe+0.338
 welf+0.336
Neg logit contributions
horn-0.336
anon-0.334
-0.331
scape-0.319
ocal-0.313
Token down
Token ID866
Feature activation-3.87e-01

Pos logit contributions
 rapp+0.049
 hospitality+0.048
hiba+0.048
 defe+0.046
 welf+0.045
Neg logit contributions
horn-0.061
anon-0.061
-0.061
scape-0.059
ocal-0.058
Token and
Token ID290
Feature activation+9.10e-02

Pos logit contributions
horn+0.690
anon+0.687
+0.679
scape+0.659
ocal+0.647
Neg logit contributions
 rapp-0.688
hiba-0.683
 hospitality-0.682
 defe-0.651
 welf-0.648
Token click
Token ID3904
Feature activation-2.94e-01

Pos logit contributions
 rapp+0.154
 hospitality+0.152
hiba+0.152
 defe+0.145
 welf+0.144
Neg logit contributions
horn-0.171
anon-0.171
-0.169
scape-0.164
ocal-0.161
Token on
Token ID319
Feature activation-4.76e-01

Pos logit contributions
horn+0.535
anon+0.533
+0.527
scape+0.511
ocal+0.502
Neg logit contributions
 rapp-0.515
hiba-0.513
 hospitality-0.511
 defe-0.487
 welf-0.485
Token Vide
Token ID42551
Feature activation-2.62e+00

Pos logit contributions
horn+0.867
anon+0.865
+0.853
scape+0.828
ocal+0.814
Neg logit contributions
 rapp-0.830
hiba-0.829
 hospitality-0.823
 defe-0.787
 welf-0.783
Tokenopl
Token ID20106
Feature activation+8.33e-01

Pos logit contributions
horn+4.392
anon+4.372
+4.240
ocal+4.200
scape+4.180
Neg logit contributions
 rapp-4.920
 hospitality-4.839
hiba-4.701
 welf-4.633
 defe-4.606
Tokenayer
Token ID2794
Feature activation-2.31e-01

Pos logit contributions
 rapp+1.492
hiba+1.486
 hospitality+1.475
azine+1.416
 defe+1.403
Neg logit contributions
horn-1.459
-1.445
anon-1.444
scape-1.394
ocal-1.352
Token Input
Token ID23412
Feature activation+5.91e-01

Pos logit contributions
horn+0.384
anon+0.383
+0.378
scape+0.366
ocal+0.358
Neg logit contributions
 rapp-0.440
 hospitality-0.436
hiba-0.435
 defe-0.417
 welf-0.415
TokenStream
Token ID12124
Feature activation-8.18e-01

Pos logit contributions
 rapp+1.132
 hospitality+1.127
hiba+1.116
 defe+1.082
 welf+1.067
Neg logit contributions
horn-0.967
anon-0.965
-0.950
scape-0.915
ocal-0.901
Token.
Token ID13
Feature activation-1.13e-01

Pos logit contributions
horn+1.306
anon+1.299
+1.280
scape+1.240
ocal+1.212
Neg logit contributions
 rapp-1.619
hiba-1.610
 hospitality-1.605
 defe-1.538
 welf-1.532
Token to
Token ID284
Feature activation-2.22e-01

Pos logit contributions
 rapp+0.134
hiba+0.133
 hospitality+0.133
 defe+0.127
 welf+0.126
Neg logit contributions
horn-0.137
anon-0.136
-0.135
scape-0.131
ocal-0.129
Token cast
Token ID3350
Feature activation+3.82e-01

Pos logit contributions
horn+0.454
anon+0.452
+0.448
scape+0.436
ocal+0.429
Neg logit contributions
 rapp-0.341
 hospitality-0.337
hiba-0.335
 defe-0.319
 welf-0.316
Token a
Token ID257
Feature activation-8.53e-02

Pos logit contributions
 rapp+0.672
 hospitality+0.664
hiba+0.660
 defe+0.633
 welf+0.627
Neg logit contributions
horn-0.693
anon-0.691
-0.682
scape-0.661
ocal-0.650
Token woman
Token ID2415
Feature activation-2.10e-01

Pos logit contributions
horn+0.167
anon+0.166
+0.165
scape+0.160
ocal+0.157
Neg logit contributions
 rapp-0.138
 hospitality-0.136
hiba-0.136
 defe-0.129
 welf-0.128
Token of
Token ID286
Feature activation-6.55e-02

Pos logit contributions
horn+0.371
anon+0.369
+0.365
scape+0.353
ocal+0.347
Neg logit contributions
 rapp-0.380
 hospitality-0.377
hiba-0.377
 defe-0.360
 welf-0.358
Token color
Token ID3124
Feature activation-2.62e+00

Pos logit contributions
horn+0.100
anon+0.099
+0.098
scape+0.094
ocal+0.092
Neg logit contributions
 rapp-0.134
 hospitality-0.133
hiba-0.133
 defe-0.128
 welf-0.127
Token to
Token ID284
Feature activation-9.24e-02

Pos logit contributions
horn+4.193
anon+4.021
+3.987
scape+3.939
ocal+3.877
Neg logit contributions
hiba-5.167
 hospitality-4.998
 rapp-4.964
 welf-4.843
 defe-4.763
Token play
Token ID711
Feature activation-3.31e-01

Pos logit contributions
horn+0.166
anon+0.166
+0.163
scape+0.159
ocal+0.156
Neg logit contributions
 rapp-0.164
 hospitality-0.162
hiba-0.161
 defe-0.155
 welf-0.153
Token Bat
Token ID6577
Feature activation+7.76e-01

Pos logit contributions
horn+0.597
anon+0.591
+0.588
scape+0.568
ocal+0.559
Neg logit contributions
 rapp-0.585
hiba-0.583
 hospitality-0.580
 defe-0.553
 welf-0.552
Tokengirl
Token ID15219
Feature activation+1.92e-01

Pos logit contributions
 rapp+0.933
 hospitality+0.915
hiba+0.912
azine+0.862
 defe+0.850
Neg logit contributions
anon-1.808
-1.807
horn-1.802
scape-1.742
ocal-1.719
Token.
Token ID13
Feature activation-2.10e-01

Pos logit contributions
 rapp+0.365
 hospitality+0.361
hiba+0.359
 defe+0.346
 welf+0.343
Neg logit contributions
horn-0.319
anon-0.319
-0.315
scape-0.304
ocal-0.298
Token
Token ID198
Feature activation+3.07e-01

Pos logit contributions
hiba+0.793
 rapp+0.792
 hospitality+0.789
azine+0.758
hler+0.740
Neg logit contributions
-0.836
horn-0.818
anon-0.816
ocal-0.791
inding-0.783
TokenC
Token ID34
Feature activation+2.31e-01

Pos logit contributions
 rapp+0.573
hiba+0.569
 hospitality+0.567
 defe+0.542
 welf+0.539
Neg logit contributions
horn-0.520
anon-0.518
-0.515
scape-0.496
ocal-0.489
TokenOOK
Token ID15308
Feature activation-4.35e-01

Pos logit contributions
 rapp+0.459
 hospitality+0.456
hiba+0.456
 defe+0.436
 welf+0.433
Neg logit contributions
horn-0.366
anon-0.364
-0.361
scape-0.346
ocal-0.340
Token COUNTY
Token ID41012
Feature activation-5.19e-01

Pos logit contributions
horn+0.682
anon+0.678
+0.670
scape+0.646
ocal+0.632
Neg logit contributions
 rapp-0.871
hiba-0.865
 hospitality-0.864
 defe-0.831
 welf-0.826
Token CR
Token ID8740
Feature activation-6.78e-01

Pos logit contributions
horn+0.899
anon+0.895
+0.885
scape+0.855
ocal+0.841
Neg logit contributions
 rapp-0.957
 hospitality-0.948
hiba-0.945
 defe-0.904
 welf-0.899
TokenIM
Token ID3955
Feature activation-2.59e+00

Pos logit contributions
horn+0.893
anon+0.890
+0.880
scape+0.838
ocal+0.820
Neg logit contributions
 rapp-1.525
 hospitality-1.518
hiba-1.518
 defe-1.464
 welf-1.459
TokenINAL
Token ID17961
Feature activation-1.50e+00

Pos logit contributions
anon+3.253
horn+3.227
scape+3.123
+3.092
OWN+3.046
Neg logit contributions
hiba-5.995
 rapp-5.852
 hospitality-5.803
 welf-5.746
 defe-5.693
Token COUR
Token ID37384
Feature activation-1.16e+00

Pos logit contributions
horn+2.610
anon+2.603
+2.532
OWN+2.477
scape+2.473
Neg logit contributions
hiba-2.739
 hospitality-2.686
 rapp-2.683
 defe-2.589
 welf-2.567
TokenTH
Token ID4221
Feature activation-1.01e+00

Pos logit contributions
horn+1.787
anon+1.777
+1.758
scape+1.696
ocal+1.657
Neg logit contributions
 rapp-2.365
 hospitality-2.348
hiba-2.345
 defe-2.254
 welf-2.243
TokenOU
Token ID2606
Feature activation-2.80e-01

Pos logit contributions
horn+1.256
anon+1.246
+1.246
scape+1.172
OWN+1.144
Neg logit contributions
hiba-2.364
 rapp-2.360
 hospitality-2.344
 defe-2.273
 welf-2.267
TokenSE
Token ID5188
Feature activation-1.55e+00

Pos logit contributions
anon+0.481
horn+0.479
+0.470
scape+0.455
ocal+0.449
Neg logit contributions
 rapp-0.519
 hospitality-0.514
hiba-0.509
 defe-0.489
 welf-0.484
TokenOne
Token ID3198
Feature activation+4.02e-02

Pos logit contributions
 rapp+0.060
hiba+0.060
 hospitality+0.060
 defe+0.057
 welf+0.057
Neg logit contributions
horn-0.060
anon-0.060
-0.059
scape-0.057
ocal-0.057
Token man
Token ID582
Feature activation+3.56e-02

Pos logit contributions
 rapp+0.067
 hospitality+0.066
hiba+0.066
 defe+0.063
 welf+0.062
Neg logit contributions
horn-0.077
anon-0.076
-0.076
scape-0.074
ocal-0.072
Token managed
Token ID5257
Feature activation-2.76e-01

Pos logit contributions
 rapp+0.064
 hospitality+0.063
hiba+0.063
 defe+0.060
 welf+0.060
Neg logit contributions
horn-0.063
anon-0.063
-0.063
scape-0.060
ocal-0.059
Token to
Token ID284
Feature activation+2.83e-02

Pos logit contributions
horn+0.468
anon+0.461
+0.459
scape+0.445
ocal+0.439
Neg logit contributions
hiba-0.509
 hospitality-0.509
 rapp-0.509
 defe-0.489
azine-0.487
Token capture
Token ID8006
Feature activation-4.32e-01

Pos logit contributions
 hospitality+0.040
hiba+0.040
 rapp+0.039
 welf+0.037
 defe+0.037
Neg logit contributions
horn-0.061
anon-0.061
-0.060
scape-0.059
ocal-0.058
Token footage
Token ID9640
Feature activation-2.59e+00

Pos logit contributions
horn+0.770
anon+0.765
+0.757
scape+0.734
ocal+0.720
Neg logit contributions
 rapp-0.773
hiba-0.767
 hospitality-0.766
 defe-0.732
 welf-0.727
Token of
Token ID286
Feature activation-7.06e-01

Pos logit contributions
horn+4.183
anon+4.098
+4.006
scape+3.964
ocal+3.816
Neg logit contributions
hiba-4.989
 hospitality-4.952
 rapp-4.937
 welf-4.766
 defe-4.757
Token a
Token ID257
Feature activation-4.09e-01

Pos logit contributions
horn+1.271
anon+1.253
+1.243
scape+1.205
ocal+1.180
Neg logit contributions
hiba-1.248
 rapp-1.239
 hospitality-1.230
 defe-1.181
 welf-1.173
Token be
Token ID307
Feature activation-7.39e-01

Pos logit contributions
horn+0.795
anon+0.789
+0.782
scape+0.760
ocal+0.747
Neg logit contributions
 rapp-0.661
hiba-0.660
 hospitality-0.655
 defe-0.623
 welf-0.620
Tokenaver
Token ID8770
Feature activation-2.04e-01

Pos logit contributions
anon+1.416
horn+1.411
+1.410
scape+1.359
ocal+1.331
Neg logit contributions
 rapp-1.209
hiba-1.204
 hospitality-1.191
 defe-1.125
azine-1.124
Token attacking
Token ID9274
Feature activation+4.64e-01

Pos logit contributions
horn+0.396
anon+0.391
+0.386
scape+0.380
ocal+0.371
Neg logit contributions
hiba-0.333
 rapp-0.330
 hospitality-0.330
 defe-0.313
 welf-0.312
Token that
Token ID326
Feature activation-1.78e-01

Pos logit contributions
horn+0.559
anon+0.554
+0.549
scape+0.535
ocal+0.525
Neg logit contributions
 rapp-0.498
hiba-0.494
 hospitality-0.493
 defe-0.470
 welf-0.467
Token many
Token ID867
Feature activation+8.59e-02

Pos logit contributions
anon+0.333
horn+0.332
+0.328
scape+0.318
ocal+0.313
Neg logit contributions
 rapp-0.303
 hospitality-0.301
hiba-0.300
 defe-0.286
 welf-0.284
Token of
Token ID286
Feature activation-2.01e-01

Pos logit contributions
 rapp+0.146
 hospitality+0.144
hiba+0.143
 defe+0.137
 welf+0.136
Neg logit contributions
anon-0.161
horn-0.161
-0.160
scape-0.154
ocal-0.152
Token these
Token ID777
Feature activation-3.21e-01

Pos logit contributions
horn+0.367
anon+0.363
+0.361
scape+0.349
ocal+0.344
Neg logit contributions
 rapp-0.354
hiba-0.350
 hospitality-0.349
 defe-0.334
 welf-0.332
Token revealing
Token ID13477
Feature activation-3.04e-01

Pos logit contributions
horn+0.612
anon+0.609
+0.603
scape+0.584
ocal+0.575
Neg logit contributions
 rapp-0.536
 hospitality-0.530
hiba-0.526
 defe-0.504
 welf-0.499
Token cinem
Token ID13483
Feature activation-2.55e+00

Pos logit contributions
horn+0.587
anon+0.585
+0.579
scape+0.561
ocal+0.553
Neg logit contributions
 rapp-0.499
 hospitality-0.494
hiba-0.491
 defe-0.469
 welf-0.465
Tokenatics
Token ID23372
Feature activation-6.26e-01

Pos logit contributions
anon+4.068
horn+3.997
+3.934
scape+3.867
ocal+3.834
Neg logit contributions
 rapp-4.968
 hospitality-4.962
hiba-4.907
 defe-4.815
 welf-4.757
Token have
Token ID423
Feature activation+3.73e-02

Pos logit contributions
horn+1.152
anon+1.149
+1.136
scape+1.111
ocal+1.085
Neg logit contributions
 rapp-1.066
hiba-1.066
 hospitality-1.066
 defe-1.014
 welf-1.012
Token been
Token ID587
Feature activation-2.52e-01

Pos logit contributions
 rapp+0.067
 hospitality+0.067
hiba+0.067
 defe+0.064
 welf+0.063
Neg logit contributions
horn-0.066
anon-0.066
-0.065
scape-0.063
ocal-0.062
Token compressed
Token ID25388
Feature activation+1.94e-01

Pos logit contributions
horn+0.466
anon+0.464
+0.459
scape+0.446
ocal+0.438
Neg logit contributions
 rapp-0.435
 hospitality-0.431
hiba-0.430
 defe-0.410
 welf-0.408
Token to
Token ID284
Feature activation+3.61e-02

Pos logit contributions
 rapp+0.371
 hospitality+0.369
hiba+0.366
 defe+0.352
 welf+0.349
Neg logit contributions
horn-0.324
anon-0.322
-0.320
scape-0.306
ocal-0.303
Token the
Token ID262
Feature activation-5.77e-02

Pos logit contributions
 rapp+0.346
hiba+0.343
 hospitality+0.341
 defe+0.327
 welf+0.324
Neg logit contributions
horn-0.400
anon-0.398
-0.394
scape-0.382
ocal-0.377
Token women
Token ID1466
Feature activation-2.04e-01

Pos logit contributions
horn+0.125
anon+0.124
+0.123
scape+0.120
ocal+0.118
Neg logit contributions
 rapp-0.081
 hospitality-0.080
hiba-0.080
 defe-0.076
 welf-0.075
Token and
Token ID290
Feature activation+2.10e-02

Pos logit contributions
horn+0.350
anon+0.348
+0.344
scape+0.333
ocal+0.327
Neg logit contributions
 rapp-0.377
 hospitality-0.374
hiba-0.374
 defe-0.358
 welf-0.356
Token people
Token ID661
Feature activation+4.49e-01

Pos logit contributions
 rapp+0.032
 hospitality+0.032
hiba+0.031
 defe+0.030
 welf+0.030
Neg logit contributions
anon-0.043
horn-0.043
-0.043
scape-0.041
ocal-0.041
Token of
Token ID286
Feature activation+1.74e-01

Pos logit contributions
 rapp+0.780
 hospitality+0.769
hiba+0.765
 defe+0.730
 welf+0.724
Neg logit contributions
anon-0.824
horn-0.823
-0.812
scape-0.788
ocal-0.769
Token color
Token ID3124
Feature activation-2.54e+00

Pos logit contributions
 rapp+0.361
 hospitality+0.360
hiba+0.357
 defe+0.344
 welf+0.343
Neg logit contributions
horn-0.259
anon-0.258
-0.254
scape-0.245
ocal-0.239
Token who
Token ID508
Feature activation+1.13e-01

Pos logit contributions
horn+4.225
anon+4.068
+3.996
scape+3.974
ocal+3.936
Neg logit contributions
hiba-4.736
 hospitality-4.603
 rapp-4.573
 defe-4.506
 welf-4.505
Token appeared
Token ID4120
Feature activation-8.61e-02

Pos logit contributions
 rapp+0.225
 hospitality+0.223
hiba+0.223
 defe+0.214
 welf+0.212
Neg logit contributions
horn-0.180
anon-0.180
-0.177
scape-0.171
ocal-0.168
Token onstage
Token ID38500
Feature activation+4.84e-01

Pos logit contributions
horn+0.151
anon+0.149
+0.148
scape+0.143
ocal+0.141
Neg logit contributions
 rapp-0.157
hiba-0.156
 hospitality-0.156
 defe-0.149
 welf-0.148
Token weren
Token ID6304
Feature activation+1.50e-01

Pos logit contributions
 rapp+0.885
 hospitality+0.873
hiba+0.870
 defe+0.832
 welf+0.821
Neg logit contributions
horn-0.846
anon-0.844
-0.838
scape-0.806
ocal-0.791
Token't
Token ID470
Feature activation+2.36e-01

Pos logit contributions
 rapp+0.283
hiba+0.282
 hospitality+0.281
azine+0.268
hler+0.265
Neg logit contributions
-0.243
horn-0.242
anon-0.241
scape-0.230
ocal-0.228
Token is
Token ID318
Feature activation-3.13e-01

Pos logit contributions
horn+0.075
anon+0.074
+0.074
scape+0.071
ocal+0.070
Neg logit contributions
 rapp-0.087
 hospitality-0.086
hiba-0.086
 defe-0.083
 welf-0.082
Token,
Token ID11
Feature activation-4.33e-02

Pos logit contributions
anon+0.559
horn+0.557
+0.552
scape+0.532
ocal+0.523
Neg logit contributions
 rapp-0.562
 hospitality-0.555
hiba-0.550
 defe-0.529
 welf-0.522
Token therefore
Token ID4361
Feature activation+4.12e-02

Pos logit contributions
horn+0.078
anon+0.078
+0.077
scape+0.074
ocal+0.073
Neg logit contributions
 rapp-0.077
 hospitality-0.076
hiba-0.076
 defe-0.073
 welf-0.072
Token,
Token ID11
Feature activation-2.29e-01

Pos logit contributions
 rapp+0.070
 hospitality+0.070
hiba+0.069
 defe+0.066
azine+0.065
Neg logit contributions
-0.076
anon-0.076
horn-0.076
scape-0.072
ocal-0.072
Token in
Token ID287
Feature activation-4.30e-01

Pos logit contributions
horn+0.407
anon+0.407
+0.402
scape+0.389
ocal+0.382
Neg logit contributions
 rapp-0.411
 hospitality-0.406
hiba-0.404
 defe-0.387
 welf-0.383
Token motion
Token ID6268
Feature activation-2.54e+00

Pos logit contributions
horn+0.755
anon+0.753
+0.746
scape+0.720
ocal+0.708
Neg logit contributions
 rapp-0.781
 hospitality-0.774
hiba-0.772
 defe-0.738
 welf-0.733
Token.
Token ID13
Feature activation-1.14e-01

Pos logit contributions
horn+3.952
anon+3.840
+3.779
scape+3.777
ocal+3.631
Neg logit contributions
hiba-5.004
 rapp-4.970
 hospitality-4.939
 welf-4.835
 defe-4.772
Token
Token ID198
Feature activation-2.45e-01

Pos logit contributions
horn+0.205
anon+0.205
+0.204
scape+0.196
ocal+0.194
Neg logit contributions
 rapp-0.201
 hospitality-0.199
hiba-0.199
 defe-0.189
 welf-0.188
Token
Token ID198
Feature activation-2.01e-01

Pos logit contributions
+0.424
horn+0.421
anon+0.420
scape+0.400
ocal+0.399
Neg logit contributions
 rapp-0.443
hiba-0.441
 hospitality-0.440
azine-0.418
 defe-0.415
TokenWhat
Token ID2061
Feature activation+6.34e-01

Pos logit contributions
horn+0.344
anon+0.342
+0.339
scape+0.328
ocal+0.322
Neg logit contributions
 rapp-0.372
hiba-0.369
 hospitality-0.368
 defe-0.353
 welf-0.349
Token would
Token ID561
Feature activation-1.01e+00

Pos logit contributions
 rapp+1.080
 hospitality+1.070
hiba+1.060
 defe+1.023
 welf+1.006
Neg logit contributions
horn-1.183
anon-1.173
-1.169
scape-1.123
ocal-1.116
Tokenat
Token ID265
Feature activation-2.57e-01

Pos logit contributions
hiba+1.947
 rapp+1.940
 hospitality+1.922
azine+1.852
 defe+1.845
Neg logit contributions
horn-1.381
-1.375
anon-1.363
scape-1.303
OWN-1.276
Tokenoder
Token ID12342
Feature activation+3.12e-02

Pos logit contributions
horn+0.400
anon+0.398
+0.392
scape+0.379
ocal+0.372
Neg logit contributions
 rapp-0.518
 hospitality-0.514
hiba-0.512
 defe-0.493
 welf-0.490
Tokenma
Token ID2611
Feature activation-4.21e-01

Pos logit contributions
 rapp+0.059
 hospitality+0.059
hiba+0.059
 defe+0.056
 welf+0.056
Neg logit contributions
horn-0.052
anon-0.052
-0.051
scape-0.049
ocal-0.049
Token (
Token ID357
Feature activation-1.83e-01

Pos logit contributions
horn+0.713
anon+0.710
+0.702
scape+0.678
ocal+0.665
Neg logit contributions
 rapp-0.789
 hospitality-0.782
hiba-0.781
 defe-0.749
 welf-0.744
Tokenalso
Token ID14508
Feature activation+6.11e-01

Pos logit contributions
horn+0.281
anon+0.279
+0.276
scape+0.266
ocal+0.260
Neg logit contributions
 rapp-0.372
 hospitality-0.369
hiba-0.369
 defe-0.354
 welf-0.352
Token known
Token ID1900
Feature activation+2.04e+00

Pos logit contributions
 rapp+1.147
 hospitality+1.130
hiba+1.126
 defe+1.088
 welf+1.076
Neg logit contributions
horn-1.034
anon-1.033
-1.018
scape-0.986
ocal-0.964
Token as
Token ID355
Feature activation+2.22e-01

Pos logit contributions
 rapp+3.486
 hospitality+3.427
hiba+3.417
 defe+3.276
 welf+3.243
Neg logit contributions
horn-3.742
anon-3.708
-3.662
scape-3.560
ocal-3.499
Token "
Token ID366
Feature activation+7.66e-01

Pos logit contributions
 rapp+0.377
 hospitality+0.372
hiba+0.370
 defe+0.355
 welf+0.352
Neg logit contributions
horn-0.419
anon-0.416
-0.413
scape-0.400
ocal-0.393
TokenP
Token ID47
Feature activation+7.21e-01

Pos logit contributions
 rapp+1.588
hiba+1.575
 hospitality+1.569
 defe+1.513
 welf+1.500
Neg logit contributions
horn-1.122
anon-1.119
-1.116
scape-1.062
ocal-1.043
Tokenor
Token ID273
Feature activation+1.10e+00

Pos logit contributions
 rapp+1.463
hiba+1.456
 hospitality+1.451
 defe+1.392
 welf+1.384
Neg logit contributions
horn-1.105
anon-1.096
-1.089
scape-1.045
ocal-1.019
Tokenoker
Token ID11020
Feature activation+2.53e-01

Pos logit contributions
hiba+2.241
 rapp+2.239
 hospitality+2.205
azine+2.128
 defe+2.117
Neg logit contributions
horn-1.666
-1.662
anon-1.658
scape-1.587
OWN-1.542
Token
Token ID447
Feature activation+3.15e-02

Pos logit contributions
 rapp+0.033
hiba+0.032
 hospitality+0.032
 defe+0.030
 welf+0.030
Neg logit contributions
horn-0.037
anon-0.037
-0.037
scape-0.036
ocal-0.035
Token
Token ID250
Feature activation+1.93e-01

Pos logit contributions
 hospitality+0.073
 rapp+0.073
hiba+0.072
 defe+0.070
azine+0.069
Neg logit contributions
anon-0.038
horn-0.038
scape-0.036
-0.035
ocal-0.035
TokenA
Token ID32
Feature activation+2.34e-01

Pos logit contributions
 rapp+0.348
 hospitality+0.346
hiba+0.344
 defe+0.329
 welf+0.327
Neg logit contributions
anon-0.337
horn-0.337
-0.333
scape-0.322
ocal-0.318
Token gentleman
Token ID22427
Feature activation+3.51e-01

Pos logit contributions
 rapp+0.238
 hospitality+0.235
hiba+0.233
 defe+0.215
 welf+0.213
Neg logit contributions
horn-0.598
anon-0.597
-0.592
scape-0.579
ocal-0.572
Token of
Token ID286
Feature activation-3.81e-02

Pos logit contributions
 rapp+0.746
 hospitality+0.739
hiba+0.735
 defe+0.710
 welf+0.703
Neg logit contributions
anon-0.509
horn-0.509
-0.502
scape-0.480
ocal-0.472
Token genuine
Token ID8768
Feature activation+1.62e+00

Pos logit contributions
horn+0.075
anon+0.074
+0.074
scape+0.071
ocal+0.070
Neg logit contributions
 rapp-0.061
hiba-0.061
 hospitality-0.061
 defe-0.058
 welf-0.057
Token polit
Token ID1408
Feature activation+1.51e+00

Pos logit contributions
 hospitality+2.405
 rapp+2.350
hiba+2.241
 defe+2.168
 welf+2.167
Neg logit contributions
anon-3.466
horn-3.447
-3.402
scape-3.332
ocal-3.280
Tokeneness
Token ID9449
Feature activation+2.74e-01

Pos logit contributions
 rapp+2.534
 hospitality+2.526
hiba+2.491
azine+2.394
 welf+2.346
Neg logit contributions
anon-2.811
horn-2.811
-2.763
scape-2.686
ocal-2.621
Token will
Token ID481
Feature activation+5.48e-01

Pos logit contributions
 rapp+0.535
 hospitality+0.531
hiba+0.528
 defe+0.507
 welf+0.503
Neg logit contributions
horn-0.444
anon-0.442
-0.437
scape-0.421
ocal-0.414
Token not
Token ID407
Feature activation+3.62e-01

Pos logit contributions
 rapp+0.807
 hospitality+0.794
hiba+0.787
 defe+0.748
 welf+0.737
Neg logit contributions
anon-1.156
horn-1.156
-1.146
scape-1.111
ocal-1.098
Token give
Token ID1577
Feature activation-1.64e-01

Pos logit contributions
 rapp+0.542
 hospitality+0.535
hiba+0.532
 defe+0.505
 welf+0.500
Neg logit contributions
horn-0.753
anon-0.751
-0.745
scape-0.723
ocal-0.713
Token
Token ID231
Feature activation-1.32e+00

Pos logit contributions
horn+0.156
anon+0.155
+0.153
scape+0.146
ocal+0.142
Neg logit contributions
 rapp-0.287
 hospitality-0.285
hiba-0.285
 defe-0.275
 welf-0.273
Tokenи
Token ID18849
Feature activation+3.54e-01

Pos logit contributions
horn+2.662
anon+2.657
+2.636
scape+2.568
ocal+2.518
Neg logit contributions
 rapp-2.032
hiba-2.012
 hospitality-2.003
 defe-1.905
 welf-1.890
Token
Token ID141
Feature activation-4.09e-01

Pos logit contributions
 rapp+0.544
 hospitality+0.538
hiba+0.536
 defe+0.508
 welf+0.505
Neg logit contributions
horn-0.720
anon-0.718
-0.709
scape-0.690
ocal-0.681
Token
Token ID227
Feature activation-1.36e+00

Pos logit contributions
horn+0.518
anon+0.514
+0.511
scape+0.485
ocal+0.472
Neg logit contributions
 rapp-0.943
hiba-0.936
 hospitality-0.936
 defe-0.903
 welf-0.900
Token (
Token ID357
Feature activation-2.68e-01

Pos logit contributions
horn+2.437
anon+2.437
+2.410
scape+2.347
ocal+2.275
Neg logit contributions
hiba-2.393
 rapp-2.375
 hospitality-2.371
 defe-2.255
 welf-2.254
Tokenо
Token ID15166
Feature activation+1.47e+00

Pos logit contributions
horn+0.485
anon+0.483
+0.478
scape+0.464
ocal+0.455
Neg logit contributions
 rapp-0.471
 hospitality-0.467
hiba-0.466
 defe-0.445
 welf-0.442
Tokenс
Token ID21727
Feature activation+2.56e-01

Pos logit contributions
 rapp+2.259
 hospitality+2.229
hiba+2.196
 defe+2.113
azine+2.093
Neg logit contributions
horn-2.941
anon-2.919
-2.853
scape-2.798
ocal-2.784
Tokenо�
Token ID25443
Feature activation+3.15e-01

Pos logit contributions
 rapp+0.391
 hospitality+0.386
hiba+0.384
 defe+0.365
 welf+0.362
Neg logit contributions
horn-0.520
anon-0.518
-0.511
scape-0.498
ocal-0.491
Token
Token ID109
Feature activation-7.21e-01

Pos logit contributions
 rapp+0.758
 hospitality+0.751
hiba+0.750
 defe+0.729
 welf+0.722
Neg logit contributions
horn-0.367
anon-0.363
-0.345
scape-0.337
ocal-0.329
Tokenе
Token ID16843
Feature activation+1.15e+00

Pos logit contributions
horn+1.464
anon+1.459
+1.450
scape+1.410
ocal+1.387
Neg logit contributions
 rapp-1.109
hiba-1.103
 hospitality-1.098
 defe-1.040
 welf-1.037
Tokenн
Token ID22177
Feature activation+5.39e-01

Pos logit contributions
 rapp+1.735
 hospitality+1.711
hiba+1.697
 defe+1.617
 welf+1.600
Neg logit contributions
horn-2.333
anon-2.325
-2.299
scape-2.235
ocal-2.226
Token more
Token ID517
Feature activation+1.17e-01

Pos logit contributions
 rapp+0.903
 hospitality+0.891
hiba+0.880
 defe+0.848
 welf+0.834
Neg logit contributions
horn-0.991
anon-0.990
-0.984
scape-0.944
ocal-0.929
Token as
Token ID355
Feature activation+2.14e-02

Pos logit contributions
 rapp+0.191
 hospitality+0.189
hiba+0.187
 defe+0.179
 welf+0.177
Neg logit contributions
horn-0.226
anon-0.225
-0.223
scape-0.216
ocal-0.213
Token being
Token ID852
Feature activation-1.80e-01

Pos logit contributions
 rapp+0.038
 hospitality+0.038
hiba+0.038
 defe+0.036
 welf+0.036
Neg logit contributions
horn-0.038
anon-0.038
-0.037
scape-0.036
ocal-0.036
Token youthful
Token ID35444
Feature activation-5.44e-01

Pos logit contributions
horn+0.336
anon+0.335
+0.331
scape+0.321
ocal+0.316
Neg logit contributions
 rapp-0.307
 hospitality-0.304
hiba-0.304
 defe-0.290
 welf-0.288
Token and
Token ID290
Feature activation-1.15e-03

Pos logit contributions
horn+1.008
anon+1.002
+0.990
scape+0.964
ocal+0.947
Neg logit contributions
 rapp-0.930
hiba-0.927
 hospitality-0.919
 defe-0.879
 welf-0.875
Token friendly
Token ID8030
Feature activation+1.31e+00

Pos logit contributions
horn+0.002
anon+0.002
+0.002
scape+0.002
ocal+0.002
Neg logit contributions
 rapp-0.002
 hospitality-0.002
hiba-0.002
 defe-0.002
 welf-0.002
Token,
Token ID11
Feature activation+2.18e-01

Pos logit contributions
 hospitality+2.193
 rapp+2.187
hiba+2.130
 defe+2.070
 welf+2.035
Neg logit contributions
horn-2.474
anon-2.473
-2.436
scape-2.355
ocal-2.322
Token but
Token ID475
Feature activation-3.21e-01

Pos logit contributions
 rapp+0.364
 hospitality+0.360
hiba+0.358
 defe+0.342
 welf+0.339
Neg logit contributions
horn-0.416
anon-0.415
-0.410
scape-0.398
ocal-0.392
Token lonely
Token ID21757
Feature activation-5.03e-02

Pos logit contributions
horn+0.609
anon+0.606
+0.600
scape+0.583
ocal+0.573
Neg logit contributions
 rapp-0.536
hiba-0.532
 hospitality-0.531
 defe-0.505
 welf-0.502
Token and
Token ID290
Feature activation-1.61e-01

Pos logit contributions
horn+0.087
anon+0.086
+0.085
scape+0.083
ocal+0.081
Neg logit contributions
 rapp-0.093
 hospitality-0.092
hiba-0.092
 defe-0.088
 welf-0.087
Token in
Token ID287
Feature activation-8.27e-02

Pos logit contributions
horn+0.308
anon+0.307
+0.304
scape+0.294
ocal+0.290
Neg logit contributions
 rapp-0.267
 hospitality-0.264
hiba-0.262
 defe-0.251
 welf-0.248
Token and
Token ID290
Feature activation+1.75e-01

Pos logit contributions
horn+0.068
anon+0.067
+0.067
scape+0.065
ocal+0.063
Neg logit contributions
 rapp-0.071
hiba-0.070
 hospitality-0.070
 defe-0.067
 welf-0.067
Token counselling
Token ID47586
Feature activation+1.77e-01

Pos logit contributions
 rapp+0.302
 hospitality+0.300
hiba+0.297
 defe+0.284
 welf+0.282
Neg logit contributions
horn-0.325
anon-0.323
-0.320
scape-0.310
ocal-0.305
Token to
Token ID284
Feature activation-5.82e-02

Pos logit contributions
 rapp+0.328
hiba+0.325
 hospitality+0.325
 defe+0.311
 welf+0.309
Neg logit contributions
horn-0.303
anon-0.301
-0.298
scape-0.288
ocal-0.283
Token help
Token ID1037
Feature activation-4.79e-02

Pos logit contributions
horn+0.118
anon+0.118
+0.117
scape+0.114
ocal+0.112
Neg logit contributions
 rapp-0.089
 hospitality-0.088
hiba-0.088
 defe-0.084
 welf-0.083
Token them
Token ID606
Feature activation+7.05e-01

Pos logit contributions
horn+0.096
anon+0.096
+0.095
scape+0.092
ocal+0.091
Neg logit contributions
 rapp-0.075
hiba-0.074
 hospitality-0.074
 defe-0.070
 welf-0.070
Token deal
Token ID1730
Feature activation+1.39e+00

Pos logit contributions
 rapp+1.022
 hospitality+1.002
hiba+0.998
 defe+0.948
 welf+0.928
Neg logit contributions
horn-1.504
anon-1.499
-1.484
scape-1.441
ocal-1.420
Token with
Token ID351
Feature activation+3.13e-01

Pos logit contributions
 rapp+2.900
 hospitality+2.878
hiba+2.875
 defe+2.766
 welf+2.751
Neg logit contributions
horn-2.063
anon-2.052
-2.024
scape-1.949
ocal-1.907
Token the
Token ID262
Feature activation+3.46e-01

Pos logit contributions
 rapp+0.515
hiba+0.512
 hospitality+0.510
 defe+0.486
 welf+0.483
Neg logit contributions
horn-0.602
anon-0.599
-0.594
scape-0.576
ocal-0.567
Token emotional
Token ID7016
Feature activation+6.98e-01

Pos logit contributions
 rapp+0.546
 hospitality+0.540
hiba+0.537
 defe+0.512
 welf+0.507
Neg logit contributions
horn-0.690
anon-0.687
-0.680
scape-0.661
ocal-0.651
Token impact
Token ID2928
Feature activation+5.48e-01

Pos logit contributions
 rapp+1.027
 hospitality+1.016
hiba+1.007
 defe+0.955
 welf+0.947
Neg logit contributions
horn-1.469
anon-1.463
-1.451
scape-1.406
ocal-1.392
Token of
Token ID286
Feature activation-4.83e-02

Pos logit contributions
 rapp+0.996
 hospitality+0.987
hiba+0.980
 defe+0.940
 welf+0.934
Neg logit contributions
horn-0.960
anon-0.957
-0.943
scape-0.913
ocal-0.899
Token you
Token ID345
Feature activation-5.81e-01

Pos logit contributions
horn+0.502
anon+0.500
+0.494
scape+0.479
ocal+0.470
Neg logit contributions
 rapp-0.487
hiba-0.482
 hospitality-0.482
 defe-0.460
 welf-0.457
Token can
Token ID460
Feature activation+5.36e-01

Pos logit contributions
horn+1.152
anon+1.149
+1.131
scape+1.107
ocal+1.082
Neg logit contributions
 hospitality-0.916
 rapp-0.916
hiba-0.909
 welf-0.865
 defe-0.865
Token watch
Token ID2342
Feature activation-6.04e-01

Pos logit contributions
 rapp+0.897
hiba+0.877
 hospitality+0.870
 defe+0.838
azine+0.821
Neg logit contributions
horn-1.024
-1.018
anon-1.017
scape-0.978
ocal-0.972
Token porn
Token ID8483
Feature activation-6.39e-01

Pos logit contributions
horn+1.099
anon+1.092
+1.077
scape+1.047
ocal+1.025
Neg logit contributions
 rapp-1.055
hiba-1.049
 hospitality-1.047
 defe-1.001
 welf-0.993
Token and
Token ID290
Feature activation+4.41e-02

Pos logit contributions
horn+1.117
anon+1.110
+1.093
scape+1.064
ocal+1.041
Neg logit contributions
 rapp-1.165
 hospitality-1.157
hiba-1.155
 defe-1.108
 welf-1.100
Token masturb
Token ID22938
Feature activation+8.63e-01

Pos logit contributions
 rapp+0.065
 hospitality+0.064
hiba+0.064
 defe+0.060
 welf+0.060
Neg logit contributions
horn-0.093
anon-0.092
-0.092
scape-0.089
ocal-0.088
Tokenate
Token ID378
Feature activation-2.82e-01

Pos logit contributions
 rapp+2.047
hiba+2.031
 hospitality+2.013
azine+1.950
 defe+1.945
Neg logit contributions
horn-1.011
anon-1.009
-1.003
scape-0.947
ocal-0.927
Token for
Token ID329
Feature activation-1.34e-01

Pos logit contributions
horn+0.502
anon+0.500
+0.494
scape+0.479
ocal+0.470
Neg logit contributions
 rapp-0.505
 hospitality-0.500
hiba-0.500
 defe-0.478
 welf-0.474
Token a
Token ID257
Feature activation+2.17e-01

Pos logit contributions
horn+0.265
anon+0.264
+0.261
scape+0.254
ocal+0.250
Neg logit contributions
 rapp-0.216
 hospitality-0.213
hiba-0.213
 defe-0.202
 welf-0.201
Token long
Token ID890
Feature activation+1.02e+00

Pos logit contributions
 rapp+0.347
 hospitality+0.342
hiba+0.340
 defe+0.324
 welf+0.322
Neg logit contributions
horn-0.432
anon-0.430
-0.427
scape-0.412
ocal-0.408
Token time
Token ID640
Feature activation-2.00e-01

Pos logit contributions
 rapp+1.415
 hospitality+1.393
hiba+1.370
 defe+1.303
 welf+1.301
Neg logit contributions
horn-2.219
anon-2.217
-2.206
scape-2.126
ocal-2.121
Token to
Token ID284
Feature activation+7.19e-01

Pos logit contributions
 hospitality+1.031
 rapp+1.015
hiba+1.007
 defe+0.956
azine+0.936
Neg logit contributions
anon-1.236
horn-1.235
-1.215
ocal-1.165
scape-1.162
Token recent
Token ID2274
Feature activation-4.41e-01

Pos logit contributions
 rapp+1.219
 hospitality+1.213
hiba+1.197
 defe+1.153
 welf+1.140
Neg logit contributions
horn-1.358
anon-1.352
-1.332
scape-1.285
ocal-1.277
Token poor
Token ID3595
Feature activation-8.13e-01

Pos logit contributions
horn+0.835
anon+0.832
+0.824
scape+0.802
ocal+0.785
Neg logit contributions
 rapp-0.737
hiba-0.731
 hospitality-0.729
 defe-0.694
 welf-0.689
Token news
Token ID1705
Feature activation+2.47e-01

Pos logit contributions
horn+1.589
anon+1.586
+1.574
scape+1.533
ocal+1.501
Neg logit contributions
 rapp-1.301
hiba-1.293
 hospitality-1.280
 defe-1.223
 welf-1.215
Token.
Token ID13
Feature activation+1.67e-01

Pos logit contributions
 hospitality+0.456
 rapp+0.456
hiba+0.453
 defe+0.432
 welf+0.430
Neg logit contributions
anon-0.425
horn-0.424
-0.419
scape-0.401
ocal-0.400
Token
Token ID198
Feature activation-5.84e-02

Pos logit contributions
 rapp+0.332
 hospitality+0.328
hiba+0.327
 defe+0.316
 welf+0.313
Neg logit contributions
horn-0.266
anon-0.264
-0.261
scape-0.252
ocal-0.246
Token
Token ID198
Feature activation-2.04e-02

Pos logit contributions
+0.103
horn+0.102
anon+0.102
ocal+0.097
scape+0.097
Neg logit contributions
 rapp-0.103
 hospitality-0.103
hiba-0.102
azine-0.097
 defe-0.096
TokenThis
Token ID1212
Feature activation+1.24e-01

Pos logit contributions
horn+0.036
anon+0.035
+0.035
scape+0.034
ocal+0.033
Neg logit contributions
 rapp-0.037
hiba-0.037
 hospitality-0.037
 defe-0.035
 welf-0.035
Token week
Token ID1285
Feature activation-8.40e-01

Pos logit contributions
 rapp+0.217
 hospitality+0.215
hiba+0.214
 defe+0.204
 welf+0.203
Neg logit contributions
horn-0.227
anon-0.226
-0.223
scape-0.216
ocal-0.213
Token,
Token ID11
Feature activation-1.25e-01

Pos logit contributions
horn+1.445
anon+1.437
+1.425
scape+1.383
ocal+1.350
Neg logit contributions
 rapp-1.549
hiba-1.531
 hospitality-1.530
 defe-1.465
 welf-1.458
Token surveys
Token ID16255
Feature activation-5.91e-01

Pos logit contributions
horn+0.234
anon+0.233
+0.231
scape+0.224
ocal+0.220
Neg logit contributions
 rapp-0.214
 hospitality-0.212
hiba-0.212
 defe-0.202
 welf-0.201
Token be
Token ID307
Feature activation-7.59e-01

Pos logit contributions
horn+1.975
anon+1.969
+1.963
scape+1.903
ocal+1.875
Neg logit contributions
 rapp-1.658
hiba-1.656
 hospitality-1.636
 welf-1.554
 defe-1.554
Token in
Token ID287
Feature activation-4.86e-01

Pos logit contributions
horn+1.265
anon+1.254
+1.241
scape+1.204
ocal+1.180
Neg logit contributions
 rapp-1.445
hiba-1.438
 hospitality-1.434
 defe-1.369
 welf-1.366
Token place
Token ID1295
Feature activation-5.96e-01

Pos logit contributions
horn+0.925
anon+0.917
+0.908
scape+0.888
ocal+0.871
Neg logit contributions
hiba-0.810
 rapp-0.807
 hospitality-0.797
 defe-0.762
 welf-0.759
Token and
Token ID290
Feature activation-1.16e-01

Pos logit contributions
horn+1.011
anon+1.001
+0.997
scape+0.963
ocal+0.950
Neg logit contributions
 rapp-1.116
hiba-1.113
 hospitality-1.105
 welf-1.058
 defe-1.052
Token how
Token ID703
Feature activation-1.37e-01

Pos logit contributions
horn+0.240
anon+0.239
+0.236
scape+0.230
ocal+0.227
Neg logit contributions
 rapp-0.173
 hospitality-0.171
hiba-0.171
 defe-0.162
 welf-0.161
Token it
Token ID340
Feature activation-3.41e-02

Pos logit contributions
horn+0.261
anon+0.260
+0.257
scape+0.250
ocal+0.245
Neg logit contributions
 rapp-0.230
hiba-0.228
 hospitality-0.228
 defe-0.217
 welf-0.215
Token
Token ID447
Feature activation-2.74e-01

Pos logit contributions
horn+0.047
anon+0.047
+0.046
scape+0.044
ocal+0.043
Neg logit contributions
 rapp-0.075
 hospitality-0.074
hiba-0.074
 defe-0.072
 welf-0.071
Token
Token ID247
Feature activation+1.30e-01

Pos logit contributions
horn+0.296
+0.295
anon+0.293
scape+0.273
ocal+0.266
Neg logit contributions
 rapp-0.682
hiba-0.677
 hospitality-0.677
 defe-0.653
 welf-0.653
Tokens
Token ID82
Feature activation-3.72e-01

Pos logit contributions
 rapp+0.296
hiba+0.294
 hospitality+0.294
 defe+0.284
 welf+0.282
Neg logit contributions
horn-0.168
anon-0.167
-0.163
scape-0.156
ocal-0.153
Token run
Token ID1057
Feature activation+4.99e-01

Pos logit contributions
horn+0.689
anon+0.686
+0.678
scape+0.658
ocal+0.647
Neg logit contributions
 rapp-0.641
 hospitality-0.635
hiba-0.634
 defe-0.605
 welf-0.601
Token.
Token ID13
Feature activation-4.91e-02

Pos logit contributions
 rapp+0.943
 hospitality+0.938
hiba+0.929
 defe+0.895
 welf+0.885
Neg logit contributions
horn-0.837
anon-0.832
-0.815
scape-0.788
ocal-0.768
Token
Token ID198
Feature activation+4.45e-02

Pos logit contributions
 rapp+0.132
 hospitality+0.131
hiba+0.131
 defe+0.125
 welf+0.125
Neg logit contributions
horn-0.123
anon-0.123
-0.121
scape-0.117
ocal-0.115
Token
Token ID198
Feature activation+3.90e-02

Pos logit contributions
 rapp+0.079
hiba+0.078
 hospitality+0.078
azine+0.075
 defe+0.074
Neg logit contributions
-0.078
horn-0.077
anon-0.077
ocal-0.074
scape-0.073
TokenThe
Token ID464
Feature activation+1.35e-01

Pos logit contributions
 rapp+0.070
 hospitality+0.070
hiba+0.070
 defe+0.067
 welf+0.066
Neg logit contributions
horn-0.069
anon-0.068
-0.068
scape-0.065
ocal-0.064
Token temperature
Token ID5951
Feature activation-7.94e-01

Pos logit contributions
 rapp+0.216
 hospitality+0.214
hiba+0.213
 defe+0.203
 welf+0.201
Neg logit contributions
horn-0.266
anon-0.265
-0.262
scape-0.255
ocal-0.251
Token he
Token ID339
Feature activation+1.16e-01

Pos logit contributions
horn+1.331
anon+1.322
+1.306
scape+1.269
ocal+1.238
Neg logit contributions
 rapp-1.498
hiba-1.491
 hospitality-1.489
 defe-1.426
 welf-1.422
Token felt
Token ID2936
Feature activation+3.51e-02

Pos logit contributions
 rapp+0.228
hiba+0.226
 hospitality+0.226
 defe+0.217
 welf+0.215
Neg logit contributions
horn-0.187
anon-0.187
-0.184
scape-0.178
ocal-0.174
Token through
Token ID832
Feature activation+8.15e-01

Pos logit contributions
 rapp+0.063
 hospitality+0.063
hiba+0.062
 defe+0.060
 welf+0.059
Neg logit contributions
horn-0.062
anon-0.062
-0.061
scape-0.059
ocal-0.058
Token her
Token ID607
Feature activation+3.51e-01

Pos logit contributions
 rapp+1.348
 hospitality+1.338
hiba+1.329
 defe+1.268
 welf+1.260
Neg logit contributions
anon-1.561
horn-1.560
-1.540
scape-1.491
ocal-1.472
Token clothes
Token ID8242
Feature activation-3.88e-01

Pos logit contributions
 rapp+0.503
 hospitality+0.497
hiba+0.492
 defe+0.468
 welf+0.464
Neg logit contributions
horn-0.752
anon-0.750
-0.743
scape-0.722
ocal-0.714
Token,
Token ID11
Feature activation+1.62e-01

Pos logit contributions
horn+0.666
anon+0.662
+0.652
scape+0.636
ocal+0.620
Neg logit contributions
 rapp-0.717
 hospitality-0.714
hiba-0.712
 defe-0.684
 welf-0.682
Token it
Token ID340
Feature activation+2.49e-01

Pos logit contributions
 rapp+0.284
 hospitality+0.281
hiba+0.280
 defe+0.268
 welf+0.266
Neg logit contributions
horn-0.296
anon-0.295
-0.291
scape-0.282
ocal-0.278
Token found
Token ID1043
Feature activation-6.51e-01

Pos logit contributions
horn+0.663
anon+0.660
+0.652
scape+0.631
ocal+0.620
Neg logit contributions
 rapp-0.689
 hospitality-0.683
hiba-0.682
 defe-0.652
 welf-0.647
Token not
Token ID407
Feature activation+3.30e-01

Pos logit contributions
horn+1.297
anon+1.290
+1.276
scape+1.244
ocal+1.226
Neg logit contributions
 rapp-1.024
hiba-1.016
 hospitality-1.013
 defe-0.962
 welf-0.956
Token guilty
Token ID6717
Feature activation-8.51e-01

Pos logit contributions
 rapp+0.517
 hospitality+0.513
hiba+0.512
 defe+0.485
 welf+0.480
Neg logit contributions
horn-0.660
anon-0.658
-0.651
scape-0.633
ocal-0.623
Token in
Token ID287
Feature activation-2.70e-01

Pos logit contributions
horn+1.441
anon+1.430
+1.411
scape+1.371
ocal+1.344
Neg logit contributions
 rapp-1.593
hiba-1.583
 hospitality-1.581
 defe-1.512
 welf-1.509
Token the
Token ID262
Feature activation-6.87e-02

Pos logit contributions
horn+0.510
anon+0.508
+0.502
scape+0.489
ocal+0.479
Neg logit contributions
 rapp-0.453
hiba-0.450
 hospitality-0.448
 defe-0.427
 welf-0.423
Token upcoming
Token ID7865
Feature activation-9.83e-02

Pos logit contributions
horn+0.137
anon+0.137
+0.135
scape+0.132
ocal+0.129
Neg logit contributions
 rapp-0.109
 hospitality-0.107
hiba-0.107
 defe-0.102
 welf-0.101
Token trial
Token ID4473
Feature activation-1.35e-01

Pos logit contributions
horn+0.255
anon+0.254
+0.252
scape+0.247
ocal+0.244
Neg logit contributions
 rapp-0.096
 hospitality-0.094
hiba-0.094
 defe-0.086
 welf-0.085
Token scheduled
Token ID7530
Feature activation+1.07e+00

Pos logit contributions
horn+0.228
anon+0.227
+0.225
scape+0.217
ocal+0.213
Neg logit contributions
 rapp-0.255
 hospitality-0.252
hiba-0.252
 defe-0.241
 welf-0.240
Token to
Token ID284
Feature activation+2.43e-01

Pos logit contributions
 rapp+1.867
 hospitality+1.854
hiba+1.845
 defe+1.769
azine+1.737
Neg logit contributions
horn-1.947
anon-1.940
-1.917
scape-1.856
ocal-1.827
Token begin
Token ID2221
Feature activation+1.22e+00

Pos logit contributions
 rapp+0.360
 hospitality+0.355
hiba+0.354
 defe+0.336
 welf+0.331
Neg logit contributions
horn-0.511
anon-0.508
-0.503
scape-0.490
ocal-0.483
Token on
Token ID319
Feature activation-4.05e-01

Pos logit contributions
 rapp+2.169
 hospitality+2.152
hiba+2.095
 defe+2.043
azine+2.001
Neg logit contributions
horn-2.195
anon-2.174
-2.163
scape-2.073
ocal-2.067
Token a
Token ID257
Feature activation-1.40e-02

Pos logit contributions
 rapp+0.055
 hospitality+0.054
hiba+0.054
 defe+0.052
 welf+0.051
Neg logit contributions
horn-0.060
anon-0.059
-0.059
scape-0.057
ocal-0.056
Token co
Token ID763
Feature activation+1.11e-01

Pos logit contributions
horn+0.026
anon+0.026
+0.026
scape+0.025
ocal+0.025
Neg logit contributions
 rapp-0.024
 hospitality-0.024
hiba-0.023
 defe-0.022
 welf-0.022
Token-
Token ID12
Feature activation+1.23e-02

Pos logit contributions
 rapp+0.225
 hospitality+0.224
hiba+0.223
 defe+0.215
 welf+0.214
Neg logit contributions
horn-0.170
anon-0.169
-0.167
scape-0.160
ocal-0.157
Tokenfounder
Token ID15454
Feature activation-5.27e-01

Pos logit contributions
 rapp+0.029
 hospitality+0.029
hiba+0.029
 defe+0.028
 welf+0.027
Neg logit contributions
horn-0.015
anon-0.015
-0.015
scape-0.014
ocal-0.014
Token and
Token ID290
Feature activation-3.32e-01

Pos logit contributions
horn+0.891
anon+0.879
+0.877
scape+0.845
ocal+0.829
Neg logit contributions
hiba-0.992
 rapp-0.989
 hospitality-0.979
 defe-0.943
 welf-0.939
Token former
Token ID1966
Feature activation-3.03e-01

Pos logit contributions
horn+0.588
anon+0.585
+0.579
scape+0.561
ocal+0.551
Neg logit contributions
 rapp-0.598
 hospitality-0.593
hiba-0.592
 defe-0.566
 welf-0.562
Token chief
Token ID4039
Feature activation-3.39e-01

Pos logit contributions
horn+0.538
anon+0.535
+0.529
scape+0.513
ocal+0.503
Neg logit contributions
 rapp-0.543
hiba-0.539
 hospitality-0.537
 defe-0.514
 welf-0.510
Token executive
Token ID4640
Feature activation-4.97e-01

Pos logit contributions
horn+0.583
anon+0.580
+0.573
scape+0.554
ocal+0.545
Neg logit contributions
 rapp-0.627
 hospitality-0.622
hiba-0.620
 defe-0.594
 welf-0.590
Token of
Token ID286
Feature activation-3.61e-01

Pos logit contributions
horn+0.816
anon+0.809
+0.800
scape+0.778
ocal+0.755
Neg logit contributions
 rapp-0.958
hiba-0.957
 hospitality-0.945
 defe-0.909
 welf-0.905
Token Hy
Token ID6707
Feature activation+4.42e-01

Pos logit contributions
horn+0.654
anon+0.647
+0.646
scape+0.624
ocal+0.614
Neg logit contributions
hiba-0.632
 rapp-0.631
 hospitality-0.618
 defe-0.597
 welf-0.596
Tokenatt
Token ID1078
Feature activation-4.46e-01

Pos logit contributions
 rapp+0.699
hiba+0.694
 hospitality+0.691
 defe+0.652
azine+0.652
Neg logit contributions
anon-0.873
horn-0.870
-0.867
scape-0.834
ocal-0.822
Token answer
Token ID3280
Feature activation+4.39e-01

Pos logit contributions
 rapp+0.178
 hospitality+0.176
hiba+0.175
 defe+0.167
 welf+0.165
Neg logit contributions
horn-0.245
anon-0.244
-0.242
scape-0.235
ocal-0.232
Token?
Token ID30
Feature activation-6.52e-03

Pos logit contributions
 rapp+0.894
 hospitality+0.886
hiba+0.884
 defe+0.851
 welf+0.844
Neg logit contributions
horn-0.673
anon-0.670
-0.660
scape-0.637
ocal-0.624
Token
Token ID198
Feature activation-7.45e-02

Pos logit contributions
horn+0.011
anon+0.011
+0.011
scape+0.011
ocal+0.010
Neg logit contributions
 rapp-0.012
 hospitality-0.012
hiba-0.012
 defe-0.012
 welf-0.011
Token
Token ID198
Feature activation-4.01e-02

Pos logit contributions
+0.131
horn+0.129
anon+0.129
ocal+0.124
scape+0.123
Neg logit contributions
 rapp-0.132
 hospitality-0.131
hiba-0.131
azine-0.125
 defe-0.123
TokenOl
Token ID30098
Feature activation+3.02e-01

Pos logit contributions
horn+0.086
anon+0.085
+0.084
scape+0.082
ocal+0.081
Neg logit contributions
 rapp-0.059
 hospitality-0.058
hiba-0.057
 defe-0.055
 welf-0.054
Tokenaf
Token ID1878
Feature activation-6.98e-01

Pos logit contributions
hiba+0.386
 hospitality+0.377
 rapp+0.375
azine+0.355
 defe+0.347
Neg logit contributions
-0.689
anon-0.680
horn-0.678
scape-0.659
OWN-0.652
Token :
Token ID1058
Feature activation-4.99e-01

Pos logit contributions
horn+1.253
anon+1.246
+1.235
scape+1.195
ocal+1.175
Neg logit contributions
 rapp-1.239
 hospitality-1.228
hiba-1.225
 defe-1.170
 welf-1.159
Token The
Token ID383
Feature activation+3.16e-01

Pos logit contributions
horn+0.896
anon+0.892
+0.881
scape+0.854
ocal+0.838
Neg logit contributions
 rapp-0.889
hiba-0.881
 hospitality-0.881
 defe-0.842
 welf-0.837
Token answer
Token ID3280
Feature activation+5.98e-01

Pos logit contributions
 rapp+0.491
 hospitality+0.487
hiba+0.484
 defe+0.460
 welf+0.456
Neg logit contributions
horn-0.639
anon-0.636
-0.631
scape-0.614
ocal-0.604
Token is
Token ID318
Feature activation+3.50e-01

Pos logit contributions
 rapp+1.141
 hospitality+1.129
hiba+1.125
 defe+1.081
 welf+1.072
Neg logit contributions
horn-0.995
anon-0.989
-0.979
scape-0.946
ocal-0.928
Token perfect
Token ID2818
Feature activation-1.18e+00

Pos logit contributions
 rapp+0.587
 hospitality+0.582
hiba+0.575
 defe+0.552
 welf+0.545
Neg logit contributions
horn-0.664
anon-0.661
-0.656
scape-0.635
ocal-0.626
Token
Token ID247
Feature activation+1.08e-02

Pos logit contributions
horn+0.422
anon+0.421
scape+0.393
+0.393
ocal+0.375
Neg logit contributions
 rapp-0.957
 hospitality-0.954
hiba-0.950
 defe-0.922
azine-0.913
Tokens
Token ID82
Feature activation-2.02e-01

Pos logit contributions
 rapp+0.026
hiba+0.026
 hospitality+0.026
 defe+0.025
 welf+0.025
Neg logit contributions
horn-0.013
anon-0.013
-0.013
scape-0.012
ocal-0.012
Token about
Token ID546
Feature activation-2.71e-01

Pos logit contributions
horn+0.349
anon+0.346
+0.344
scape+0.334
ocal+0.328
Neg logit contributions
 rapp-0.371
hiba-0.369
 hospitality-0.366
 defe-0.350
azine-0.347
Token being
Token ID852
Feature activation-2.70e-01

Pos logit contributions
horn+0.525
anon+0.523
+0.521
scape+0.505
ocal+0.498
Neg logit contributions
 rapp-0.438
hiba-0.438
 hospitality-0.431
 defe-0.409
azine-0.407
Token
Token ID564
Feature activation-1.64e-01

Pos logit contributions
horn+0.497
anon+0.495
+0.490
scape+0.475
ocal+0.467
Neg logit contributions
 rapp-0.468
 hospitality-0.464
hiba-0.463
 defe-0.442
 welf-0.439
Token
Token ID246
Feature activation-2.93e-01

Pos logit contributions
horn+0.151
anon+0.150
+0.146
scape+0.138
ocal+0.133
Neg logit contributions
 rapp-0.433
 hospitality-0.431
hiba-0.430
 defe-0.418
 welf-0.415
Tokenbetter
Token ID27903
Feature activation-7.11e-01

Pos logit contributions
horn+0.457
anon+0.455
+0.449
scape+0.433
ocal+0.424
Neg logit contributions
 rapp-0.589
 hospitality-0.585
hiba-0.584
 defe-0.561
 welf-0.558
Token
Token ID447
Feature activation+5.43e-03

Pos logit contributions
horn+1.351
anon+1.341
+1.333
scape+1.297
ocal+1.273
Neg logit contributions
hiba-1.188
 rapp-1.188
 hospitality-1.169
 defe-1.113
 welf-1.105
Token
Token ID247
Feature activation+3.08e-01

Pos logit contributions
 rapp+0.014
 hospitality+0.014
hiba+0.014
 defe+0.013
 welf+0.013
Neg logit contributions
horn-0.006
anon-0.006
-0.005
scape-0.005
ocal-0.005
Token than
Token ID621
Feature activation-1.29e-01

Pos logit contributions
 rapp+0.566
 hospitality+0.563
hiba+0.559
 defe+0.537
 welf+0.534
Neg logit contributions
horn-0.532
anon-0.530
-0.523
scape-0.505
ocal-0.497
Token everyone
Token ID2506
Feature activation-6.71e-01

Pos logit contributions
horn+0.238
anon+0.237
+0.235
scape+0.228
ocal+0.223
Neg logit contributions
 rapp-0.223
hiba-0.221
 hospitality-0.220
 defe-0.210
 welf-0.208
Token in
Token ID287
Feature activation+7.83e-02

Pos logit contributions
 rapp+0.132
 hospitality+0.130
hiba+0.130
 defe+0.125
 welf+0.124
Neg logit contributions
horn-0.120
anon-0.120
-0.118
scape-0.114
ocal-0.112
Token a
Token ID257
Feature activation-1.12e-01

Pos logit contributions
 rapp+0.142
 hospitality+0.141
hiba+0.141
 defe+0.135
 welf+0.134
Neg logit contributions
horn-0.137
anon-0.137
-0.135
scape-0.131
ocal-0.129
Token proced
Token ID5131
Feature activation-1.23e+00

Pos logit contributions
horn+0.214
anon+0.213
+0.211
scape+0.204
ocal+0.201
Neg logit contributions
 rapp-0.187
 hospitality-0.185
hiba-0.184
 defe-0.176
 welf-0.175
Tokenurally
Token ID20221
Feature activation-2.07e+00

Pos logit contributions
horn+1.578
anon+1.566
+1.551
scape+1.477
ocal+1.435
Neg logit contributions
 rapp-2.798
hiba-2.770
 hospitality-2.769
 defe-2.665
azine-2.641
Token generated
Token ID7560
Feature activation-1.63e+00

Pos logit contributions
horn+3.001
+2.957
anon+2.957
scape+2.844
ocal+2.759
Neg logit contributions
hiba-4.337
 hospitality-4.333
 rapp-4.329
 welf-4.175
azine-4.158
Token city
Token ID1748
Feature activation-9.40e-01

Pos logit contributions
anon+2.987
horn+2.981
+2.949
scape+2.911
ocal+2.782
Neg logit contributions
hiba-2.803
 rapp-2.774
 hospitality-2.732
 welf-2.635
 defe-2.632
Token.
Token ID13
Feature activation-2.52e-01

Pos logit contributions
horn+1.523
anon+1.514
+1.494
scape+1.474
ocal+1.412
Neg logit contributions
 rapp-1.826
hiba-1.823
 hospitality-1.812
 defe-1.738
 welf-1.729
Token This
Token ID770
Feature activation-3.44e-01

Pos logit contributions
horn+0.451
anon+0.449
+0.444
scape+0.430
ocal+0.424
Neg logit contributions
 rapp-0.447
 hospitality-0.443
hiba-0.441
 defe-0.422
 welf-0.419
Token series
Token ID2168
Feature activation-3.24e-02

Pos logit contributions
horn+0.631
anon+0.628
+0.621
scape+0.604
ocal+0.592
Neg logit contributions
 rapp-0.596
hiba-0.590
 hospitality-0.590
 defe-0.563
 welf-0.559
Token of
Token ID286
Feature activation-3.10e-01

Pos logit contributions
horn+0.055
anon+0.055
+0.054
scape+0.052
ocal+0.051
Neg logit contributions
 rapp-0.061
 hospitality-0.060
hiba-0.060
 defe-0.058
 welf-0.057
Token blog
Token ID4130
Feature activation-1.19e+00

Pos logit contributions
horn+0.576
anon+0.573
+0.566
scape+0.549
ocal+0.541
Neg logit contributions
 rapp-0.532
 hospitality-0.527
hiba-0.526
 defe-0.502
 welf-0.498
Token enable
Token ID7139
Feature activation-8.64e-02

Pos logit contributions
 rapp+0.365
hiba+0.362
 hospitality+0.362
 defe+0.343
 welf+0.341
Neg logit contributions
horn-0.446
anon-0.445
-0.440
scape-0.428
ocal-0.421
Token these
Token ID777
Feature activation-2.16e-01

Pos logit contributions
horn+0.165
anon+0.165
+0.163
scape+0.158
ocal+0.156
Neg logit contributions
 rapp-0.143
hiba-0.142
 hospitality-0.141
 defe-0.134
 welf-0.133
Token techniques
Token ID7605
Feature activation+3.33e-01

Pos logit contributions
horn+0.397
anon+0.395
+0.391
scape+0.379
ocal+0.373
Neg logit contributions
 rapp-0.375
 hospitality-0.371
hiba-0.371
 defe-0.354
 welf-0.351
Token.
Token ID13
Feature activation+4.92e-03

Pos logit contributions
 rapp+0.629
 hospitality+0.623
hiba+0.621
 defe+0.596
 welf+0.592
Neg logit contributions
horn-0.562
anon-0.560
-0.553
scape-0.534
ocal-0.524
Token Even
Token ID3412
Feature activation+3.30e-01

Pos logit contributions
 rapp+0.009
 hospitality+0.009
hiba+0.009
 defe+0.009
 welf+0.008
Neg logit contributions
horn-0.009
anon-0.009
-0.008
scape-0.008
ocal-0.008
Token without
Token ID1231
Feature activation-7.96e-01

Pos logit contributions
 rapp+0.562
 hospitality+0.557
hiba+0.553
 defe+0.530
 welf+0.526
Neg logit contributions
horn-0.617
anon-0.616
-0.609
scape-0.589
ocal-0.581
Token them
Token ID606
Feature activation+6.00e-01

Pos logit contributions
horn+1.496
anon+1.489
+1.473
scape+1.434
ocal+1.408
Neg logit contributions
hiba-1.331
 rapp-1.327
 hospitality-1.316
 defe-1.249
azine-1.240
Token I
Token ID314
Feature activation+3.47e-02

Pos logit contributions
 rapp+1.077
 hospitality+1.067
hiba+1.064
 defe+1.019
 welf+1.012
Neg logit contributions
horn-1.064
anon-1.061
-1.048
scape-1.015
ocal-0.997
Token believe
Token ID1975
Feature activation+7.28e-01

Pos logit contributions
 rapp+0.057
 hospitality+0.057
hiba+0.057
 defe+0.054
 welf+0.054
Neg logit contributions
horn-0.066
anon-0.066
-0.065
scape-0.064
ocal-0.063
Token we
Token ID356
Feature activation-1.51e-01

Pos logit contributions
 rapp+1.192
 hospitality+1.181
hiba+1.171
 defe+1.122
 welf+1.112
Neg logit contributions
horn-1.412
anon-1.408
-1.389
scape-1.351
ocal-1.327
Token
Token ID447
Feature activation+4.76e-01

Pos logit contributions
horn+0.252
anon+0.251
+0.248
scape+0.240
ocal+0.235
Neg logit contributions
 rapp-0.287
 hospitality-0.285
hiba-0.284
 defe-0.272
 welf-0.271
Token to
Token ID284
Feature activation+2.16e-01

Pos logit contributions
 rapp+1.867
hiba+1.853
 hospitality+1.850
 defe+1.767
azine+1.745
Neg logit contributions
anon-1.751
horn-1.743
-1.722
scape-1.665
ocal-1.632
Token build
Token ID1382
Feature activation+3.65e-01

Pos logit contributions
 rapp+0.342
 hospitality+0.339
hiba+0.338
 defe+0.322
 welf+0.319
Neg logit contributions
horn-0.431
anon-0.429
-0.425
scape-0.413
ocal-0.406
Token exposure
Token ID7111
Feature activation+1.20e-01

Pos logit contributions
 rapp+0.595
 hospitality+0.589
hiba+0.588
 defe+0.560
 welf+0.556
Neg logit contributions
horn-0.709
anon-0.706
-0.698
scape-0.679
ocal-0.668
Token to
Token ID284
Feature activation+1.13e-01

Pos logit contributions
 rapp+0.226
hiba+0.225
 hospitality+0.224
 defe+0.215
 welf+0.214
Neg logit contributions
horn-0.204
anon-0.202
-0.200
scape-0.194
ocal-0.191
Token their
Token ID511
Feature activation-6.11e-02

Pos logit contributions
 rapp+0.184
 hospitality+0.183
hiba+0.182
 defe+0.173
 welf+0.172
Neg logit contributions
horn-0.219
anon-0.218
-0.216
scape-0.210
ocal-0.206
Token wa
Token ID2082
Feature activation-1.74e+00

Pos logit contributions
horn+0.130
anon+0.129
+0.128
scape+0.125
ocal+0.123
Neg logit contributions
 rapp-0.088
hiba-0.087
 hospitality-0.087
 defe-0.082
 welf-0.082
Tokenres
Token ID411
Feature activation+4.11e-01

Pos logit contributions
horn+3.707
anon+3.666
+3.639
scape+3.554
ocal+3.531
Neg logit contributions
hiba-2.472
 hospitality-2.460
 rapp-2.460
 defe-2.302
 welf-2.292
Token,
Token ID11
Feature activation+1.27e-01

Pos logit contributions
 rapp+0.808
 hospitality+0.802
hiba+0.801
 defe+0.769
 welf+0.764
Neg logit contributions
horn-0.661
anon-0.658
-0.650
scape-0.627
ocal-0.615
Token said
Token ID531
Feature activation+1.40e-01

Pos logit contributions
 rapp+0.222
 hospitality+0.220
hiba+0.219
 defe+0.210
 welf+0.208
Neg logit contributions
horn-0.234
anon-0.233
-0.230
scape-0.223
ocal-0.219
Token Jere
Token ID10272
Feature activation+1.20e-01

Pos logit contributions
 rapp+0.248
 hospitality+0.246
hiba+0.246
 defe+0.235
 welf+0.233
Neg logit contributions
horn-0.253
anon-0.252
-0.249
scape-0.242
ocal-0.237
Tokenmie
Token ID44871
Feature activation-2.10e-01

Pos logit contributions
 rapp+0.274
hiba+0.272
 hospitality+0.271
 defe+0.261
 welf+0.260
Neg logit contributions
horn-0.154
anon-0.153
-0.151
scape-0.144
ocal-0.141
Token
Token ID198
Feature activation+2.78e-01

Pos logit contributions
 rapp+0.110
 hospitality+0.109
hiba+0.109
 defe+0.104
 welf+0.103
Neg logit contributions
horn-0.105
anon-0.105
-0.104
scape-0.100
ocal-0.099
Token
Token ID198
Feature activation+2.20e-01

Pos logit contributions
 rapp+0.481
 hospitality+0.478
hiba+0.478
azine+0.456
 defe+0.447
Neg logit contributions
-0.498
horn-0.489
anon-0.487
ocal-0.469
inding-0.466
TokenMy
Token ID3666
Feature activation+4.78e-01

Pos logit contributions
 rapp+0.397
 hospitality+0.394
hiba+0.394
 defe+0.376
 welf+0.373
Neg logit contributions
horn-0.383
anon-0.381
-0.379
scape-0.365
ocal-0.360
Token sample
Token ID6291
Feature activation-3.70e-02

Pos logit contributions
 rapp+0.722
 hospitality+0.716
hiba+0.709
 defe+0.676
 welf+0.669
Neg logit contributions
horn-0.983
anon-0.980
-0.971
scape-0.946
ocal-0.931
Token of
Token ID286
Feature activation-6.28e-03

Pos logit contributions
horn+0.066
anon+0.066
+0.065
scape+0.063
ocal+0.062
Neg logit contributions
 rapp-0.066
hiba-0.065
 hospitality-0.065
 defe-0.062
 welf-0.062
Token Got
Token ID11853
Feature activation-1.60e+00

Pos logit contributions
horn+0.013
anon+0.013
+0.012
scape+0.012
ocal+0.012
Neg logit contributions
 rapp-0.010
hiba-0.010
 hospitality-0.010
 defe-0.009
 welf-0.009
Tokenlands
Token ID4447
Feature activation-1.33e-01

Pos logit contributions
horn+2.150
anon+2.132
+2.075
scape+2.009
ocal+1.966
Neg logit contributions
 rapp-3.568
hiba-3.539
 hospitality-3.532
 defe-3.431
 welf-3.399
Tokend
Token ID67
Feature activation+6.96e-01

Pos logit contributions
horn+0.189
anon+0.186
+0.183
scape+0.177
ocal+0.171
Neg logit contributions
 rapp-0.287
hiba-0.287
 hospitality-0.284
 defe-0.275
 welf-0.274
Tokenrick
Token ID5557
Feature activation+5.46e-01

Pos logit contributions
 rapp+1.270
hiba+1.246
 hospitality+1.226
azine+1.197
hler+1.172
Neg logit contributions
-1.235
anon-1.185
horn-1.159
scape-1.133
ocal-1.120
Tokena
Token ID64
Feature activation+5.27e-01

Pos logit contributions
 rapp+0.951
 hospitality+0.950
hiba+0.941
azine+0.906
hler+0.872
Neg logit contributions
-0.980
horn-0.958
anon-0.952
scape-0.924
ocal-0.917
Token
Token ID198
Feature activation+7.96e-03

Pos logit contributions
 rapp+0.960
 hospitality+0.953
hiba+0.943
 defe+0.910
 welf+0.900
Neg logit contributions
horn-0.921
anon-0.921
-0.915
scape-0.879
ocal-0.866
Token)
Token ID8
Feature activation+2.53e-01

Pos logit contributions
 rapp+1.935
hiba+1.902
 hospitality+1.901
 defe+1.805
azine+1.803
Neg logit contributions
horn-2.119
anon-2.100
-2.080
scape-2.012
ocal-1.967
Token walks
Token ID11114
Feature activation+5.74e-02

Pos logit contributions
 rapp+0.442
 hospitality+0.438
hiba+0.437
 defe+0.418
 welf+0.414
Neg logit contributions
horn-0.463
anon-0.461
-0.457
scape-0.442
ocal-0.435
Token with
Token ID351
Feature activation-5.70e-01

Pos logit contributions
 rapp+0.109
 hospitality+0.108
hiba+0.108
 defe+0.103
 welf+0.102
Neg logit contributions
horn-0.096
anon-0.096
-0.095
scape-0.091
ocal-0.090
Token Cuba
Token ID14159
Feature activation+2.27e-01

Pos logit contributions
horn+1.017
anon+1.010
+1.003
scape+0.969
ocal+0.953
Neg logit contributions
 rapp-1.008
hiba-1.004
 hospitality-0.995
 defe-0.960
 welf-0.953
Token's
Token ID338
Feature activation-8.88e-02

Pos logit contributions
 rapp+0.483
 hospitality+0.483
hiba+0.476
 defe+0.460
 welf+0.458
Neg logit contributions
horn-0.326
anon-0.325
-0.321
scape-0.305
ocal-0.298
Token Minister
Token ID4139
Feature activation-1.99e+00

Pos logit contributions
horn+0.150
anon+0.149
+0.148
scape+0.143
ocal+0.140
Neg logit contributions
 rapp-0.167
 hospitality-0.166
hiba-0.165
 defe-0.159
 welf-0.158
Token of
Token ID286
Feature activation-3.76e-01

Pos logit contributions
horn+3.034
anon+2.993
+2.986
ocal+2.824
scape+2.778
Neg logit contributions
hiba-3.983
 rapp-3.968
 hospitality-3.918
 welf-3.890
 defe-3.864
Token Higher
Token ID16038
Feature activation-2.70e-01

Pos logit contributions
horn+0.588
anon+0.585
+0.579
scape+0.558
ocal+0.548
Neg logit contributions
 rapp-0.755
hiba-0.749
 hospitality-0.746
 defe-0.721
 welf-0.716
Token Education
Token ID7868
Feature activation-3.54e-01

Pos logit contributions
horn+0.450
anon+0.447
+0.442
scape+0.428
ocal+0.419
Neg logit contributions
 rapp-0.516
hiba-0.511
 hospitality-0.511
 defe-0.491
 welf-0.487
Token Rod
Token ID6882
Feature activation+1.56e+00

Pos logit contributions
horn+0.646
anon+0.643
+0.636
scape+0.616
ocal+0.606
Neg logit contributions
 rapp-0.620
 hospitality-0.614
hiba-0.613
 defe-0.585
 welf-0.581
Tokenolf
Token ID4024
Feature activation+2.51e-01

Pos logit contributions
hiba+2.579
 rapp+2.571
 hospitality+2.551
azine+2.382
irlf+2.346
Neg logit contributions
horn-2.902
-2.890
anon-2.829
scape-2.794
OWN-2.718
Token change
Token ID1487
Feature activation-1.96e-01

Pos logit contributions
 rapp+1.118
 hospitality+1.111
hiba+1.091
 defe+1.052
 welf+1.036
Neg logit contributions
horn-1.289
anon-1.285
-1.266
scape-1.221
ocal-1.205
Token in
Token ID287
Feature activation-2.12e-01

Pos logit contributions
horn+0.339
anon+0.337
+0.334
scape+0.323
ocal+0.318
Neg logit contributions
 rapp-0.362
 hospitality-0.359
hiba-0.358
 defe-0.343
 welf-0.341
Token ton
Token ID5680
Feature activation-3.10e-01

Pos logit contributions
horn+0.394
anon+0.391
+0.387
scape+0.376
ocal+0.369
Neg logit contributions
 rapp-0.362
 hospitality-0.359
hiba-0.358
 defe-0.342
 welf-0.339
Tokenal
Token ID282
Feature activation-3.56e-01

Pos logit contributions
horn+0.388
anon+0.385
+0.379
scape+0.362
ocal+0.353
Neg logit contributions
 rapp-0.720
 hospitality-0.715
hiba-0.714
 defe-0.690
 welf-0.686
Token/
Token ID14
Feature activation-3.35e-01

Pos logit contributions
horn+0.661
anon+0.658
+0.649
scape+0.630
ocal+0.620
Neg logit contributions
 rapp-0.611
 hospitality-0.605
hiba-0.603
 defe-0.576
 welf-0.572
Tokenspect
Token ID4443
Feature activation-1.68e+00

Pos logit contributions
horn+0.498
anon+0.497
+0.493
scape+0.471
ocal+0.460
Neg logit contributions
 rapp-0.692
hiba-0.688
 hospitality-0.687
 defe-0.658
 welf-0.655
Tokenral
Token ID1373
Feature activation-5.51e-01

Pos logit contributions
anon+1.682
horn+1.671
+1.635
scape+1.553
ocal+1.502
Neg logit contributions
 rapp-4.341
 hospitality-4.322
hiba-4.307
 defe-4.197
 welf-4.183
Token response
Token ID2882
Feature activation+9.76e-01

Pos logit contributions
horn+1.073
anon+1.068
+1.056
scape+1.024
ocal+1.009
Neg logit contributions
 rapp-0.896
 hospitality-0.888
hiba-0.885
 defe-0.843
 welf-0.837
Token at
Token ID379
Feature activation+4.51e-01

Pos logit contributions
 rapp+1.687
 hospitality+1.672
hiba+1.668
 defe+1.593
 welf+1.561
Neg logit contributions
horn-1.781
anon-1.775
-1.756
scape-1.680
OWN-1.662
Token higher
Token ID2440
Feature activation+4.29e-01

Pos logit contributions
 rapp+0.749
 hospitality+0.743
hiba+0.740
 defe+0.703
 welf+0.701
Neg logit contributions
horn-0.865
anon-0.856
-0.847
scape-0.822
ocal-0.807
Token sensit
Token ID44752
Feature activation-1.20e-01

Pos logit contributions
 rapp+0.792
 hospitality+0.787
hiba+0.783
 defe+0.751
 welf+0.745
Neg logit contributions
horn-0.742
anon-0.733
-0.727
scape-0.702
ocal-0.687
Token later
Token ID1568
Feature activation+2.29e-01

Pos logit contributions
 rapp+0.257
 hospitality+0.254
hiba+0.252
 defe+0.241
 welf+0.239
Neg logit contributions
horn-0.290
anon-0.290
-0.286
scape-0.278
ocal-0.274
Token,
Token ID11
Feature activation-3.09e-01

Pos logit contributions
 rapp+0.410
 hospitality+0.407
hiba+0.405
 defe+0.388
 welf+0.385
Neg logit contributions
horn-0.407
anon-0.406
-0.400
scape-0.388
ocal-0.381
Token in
Token ID287
Feature activation-3.58e-01

Pos logit contributions
horn+0.561
anon+0.559
+0.553
scape+0.536
ocal+0.526
Neg logit contributions
 rapp-0.542
 hospitality-0.538
hiba-0.535
 defe-0.512
 welf-0.508
Token 1976
Token ID15408
Feature activation-8.79e-01

Pos logit contributions
horn+0.605
anon+0.601
+0.595
scape+0.575
ocal+0.565
Neg logit contributions
 rapp-0.675
 hospitality-0.670
hiba-0.666
 defe-0.640
 welf-0.635
Token,
Token ID11
Feature activation-4.64e-01

Pos logit contributions
horn+1.473
anon+1.461
+1.454
scape+1.404
ocal+1.383
Neg logit contributions
 rapp-1.657
hiba-1.648
 hospitality-1.639
 defe-1.580
 welf-1.567
Token UNESCO
Token ID49372
Feature activation-1.73e+00

Pos logit contributions
horn+0.847
anon+0.843
+0.835
scape+0.810
ocal+0.797
Neg logit contributions
 rapp-0.810
hiba-0.802
 hospitality-0.802
 defe-0.765
 welf-0.760
Token (
Token ID357
Feature activation-7.45e-01

Pos logit contributions
horn+3.003
anon+2.990
+2.950
scape+2.860
ocal+2.825
Neg logit contributions
hiba-3.162
 rapp-3.148
 hospitality-3.124
 welf-2.997
 defe-2.979
Tokenthe
Token ID1169
Feature activation-1.48e-03

Pos logit contributions
horn+1.207
anon+1.202
+1.184
scape+1.146
ocal+1.126
Neg logit contributions
 rapp-1.458
 hospitality-1.445
hiba-1.445
 defe-1.385
 welf-1.378
Token United
Token ID1578
Feature activation+6.96e-01

Pos logit contributions
horn+0.003
anon+0.003
+0.003
scape+0.003
ocal+0.003
Neg logit contributions
 rapp-0.002
 hospitality-0.002
hiba-0.002
 defe-0.002
 welf-0.002
Token Nations
Token ID7973
Feature activation-2.98e-01

Pos logit contributions
 rapp+0.510
 hospitality+0.501
hiba+0.491
 defe+0.437
 welf+0.435
Neg logit contributions
anon-1.970
horn-1.969
-1.943
scape-1.913
ocal-1.884
Token Educational
Token ID30038
Feature activation-1.10e-01

Pos logit contributions
horn+0.540
anon+0.539
+0.531
scape+0.513
ocal+0.503
Neg logit contributions
 rapp-0.524
 hospitality-0.521
hiba-0.514
 defe-0.495
 welf-0.493