TOP ACTIVATIONS
MAX = 14.425

to be perceived as such socially ( including the use of
« а
online supporters , using the nickname is now commonplace . The
wishes to transition their gender socially and / or physically ,
to be perceived as such socially ( including the use of
н н ы « и н к
. Ted Cruz speaks to potential supporters
2017 Pay attention Ted , they re
, Stephen P aea , Ricky Jean Francois and Zig gy
head le 2014 Ricky G erv ais
a Democrat . Sen . Eddie Luc io , a Browns
iv ica released its Sp yd us 9 platform , a
with Republican nominee Donald Trump Donald John Trump House committee believes
top - hat Eddie because of the
McCain and rejected presidential nominee Ted Cruz . Imagine Lindsey Graham
Alt - Right Content on Social Media One of
of leadership , Skip Pr ich ard took the reins of
Germany 's leading news magazine Der Spiegel said in a strongly
It 's paired with " Del iver ance " on Aug
, royalty free <|endoftext|> J ür gen Klopp believes Liverpool would

MIN ACTIVATIONS
MIN = -14.554

are complete , whole , perfect and awakened . <|endoftext|> She
and art are often closely inter ming led as new values
's reck lessness is a perfect match for Stephen 's self
arrow Green Arrow was the perfect safety net to protect The
a time . Perfect Trade School For Veterans
unk continued to seem both vital and volatile for [ so
Instagram following the incident : Perfect @ E aster Sunday .
Direct , structured communication is vital to building a strong ,
Ch iff re ( Peter Lor re ) does something to
James Comey affair is the perfect example . On substance ,
to those who identify as inter sex , third - gender
its multiple component boxes , inter connect s , cables ,
The Perfect Gentleman , 1860 .
film . The ride is perfect for a vigilante who wants
Ferdinand 's deep cross , Lor ik Can a sent another
and Strategy Justin Reid was rapt Mc Govern had committed to
Mod er ator Elaine Qu ij ano of CBS let
housing at D IT , Lor can Sir r , says
distracted way , like an inter min able stage whisper (
, making it both a perfect fit for portable play ,

INTERVAL 7.180 - 14.425
CONTAINS 0.512%

several forums Monday while in Iowa and has three more forums
Click here to subscribe via St itcher
new political odd couple : Del . Todd C . Gilbert
of minutes . Once it became clear that the situation was
we take these steps that could have unforeseen consequences we do

INTERVAL -0.065 - 7.180
CONTAINS 51.129%

make it easy even for ordinary users to install and remove
such clauses . He would continue :
b / w Bul bs of Passion
as many as 800 , 000 people who were brought across
[/ b ] 00 : 40 : 00 [ b ]

INTERVAL -7.310 - -0.065
CONTAINS 47.747%

oon crisis a phrase I still can
wisdom says . Dinosaur bones with Carbon - 14 dates in
t umbled down . With her body in such poor shape
, with 47 % support to 47 % opposed . You
first time , he genuinely believed he was going to die

INTERVAL -14.554 - -7.310
CONTAINS 0.612%

ash in Bob Kelley - Val Kil mer Song - B
Fresh man first team ( along with Jones and Wins low
entrepreneurial risk . Last year , John Marshall ,
. The idea of a cohesive Palestinian state was supposed to
et "> Optional : visually pleasing placeholder p >
Token to
Token ID284
Feature activation-3.61e-01

Pos logit contributions
adra+2.441
actic+2.339
ement+2.183
inion+2.132
atic+2.059
Neg logit contributions
OLOG-4.363
OLOGY-3.912
indal-3.730
��-3.676
ologies-3.653
Token be
Token ID307
Feature activation+9.73e-01

Pos logit contributions
adra+0.252
actic+0.243
ement+0.229
inion+0.225
atic+0.218
Neg logit contributions
OLOG-0.343
OLOGY-0.304
indal-0.288
��-0.283
ologies-0.282
Token perceived
Token ID11067
Feature activation-1.27e+00

Pos logit contributions
OLOG+0.881
OLOGY+0.773
indal+0.728
��+0.718
ologies+0.714
Neg logit contributions
adra-0.723
actic-0.699
ement-0.659
inion-0.650
atic-0.630
Token as
Token ID355
Feature activation-2.24e+00

Pos logit contributions
adra+0.866
actic+0.830
ement+0.779
inion+0.768
atic+0.736
Neg logit contributions
OLOG-1.241
OLOGY-1.100
indal-1.041
��-1.024
ologies-1.022
Token such
Token ID884
Feature activation-3.23e-01

Pos logit contributions
adra+1.627
actic+1.572
ement+1.492
inion+1.456
atic+1.432
Neg logit contributions
OLOG-2.057
OLOGY-1.814
indal-1.716
��-1.686
ologies-1.666
Token socially
Token ID18118
Feature activation+1.44e+01

Pos logit contributions
adra+0.225
actic+0.217
ement+0.205
inion+0.201
atic+0.196
Neg logit contributions
OLOG-0.307
OLOGY-0.272
indal-0.258
��-0.254
ologies-0.251
Token (
Token ID357
Feature activation+2.26e+00

Pos logit contributions
OLOG+10.138
OLOGY+8.647
ologies+8.216
IGH+7.903
indal+7.874
Neg logit contributions
adra-11.634
actic-10.667
inion-10.255
ement-10.084
anmar-10.068
Tokenincluding
Token ID8201
Feature activation-2.26e+00

Pos logit contributions
OLOG+1.560
OLOGY+1.315
indal+1.212
GH+1.190
��+1.186
Neg logit contributions
adra-2.150
actic-2.088
ement-2.004
inion-1.984
atic-1.931
Token the
Token ID262
Feature activation-1.41e+00

Pos logit contributions
adra+1.513
actic+1.455
ement+1.369
inion+1.344
atic+1.301
Neg logit contributions
OLOG-2.217
OLOGY-1.970
indal-1.869
��-1.837
ologies-1.830
Token use
Token ID779
Feature activation+1.44e-02

Pos logit contributions
adra+0.932
actic+0.895
ement+0.840
inion+0.827
atic+0.799
Neg logit contributions
OLOG-1.399
OLOGY-1.244
indal-1.179
��-1.159
ologies-1.158
Token of
Token ID286
Feature activation+8.40e-01

Pos logit contributions
OLOG+0.014
OLOGY+0.013
indal+0.012
��+0.012
ologies+0.012
Neg logit contributions
adra-0.009
actic-0.009
ement-0.009
inion-0.008
atic-0.008
Token
Token ID100
Feature activation-5.40e-01

Pos logit contributions
OLOG+0.928
OLOGY+0.798
indal+0.745
��+0.730
ologies+0.724
Neg logit contributions
adra-1.030
actic-1.000
ement-0.955
inion-0.941
atic-0.919
Token
Token ID140
Feature activation+3.28e+00

Pos logit contributions
adra+0.485
actic+0.473
ement+0.454
inion+0.446
atic+0.438
Neg logit contributions
OLOG-0.403
OLOGY-0.345
indal-0.323
��-0.315
ologies-0.312
Token
Token ID240
Feature activation+4.08e+00

Pos logit contributions
OLOG+2.528
OLOGY+2.167
��+2.013
indal+1.994
GH+1.946
Neg logit contributions
adra-2.849
actic-2.767
ement-2.670
inion-2.610
atic-2.544
Token
Token ID140
Feature activation+4.48e+00

Pos logit contributions
OLOG+3.343
OLOGY+2.891
indal+2.670
GH+2.636
��+2.612
Neg logit contributions
adra-3.378
actic-3.258
ement-3.103
inion-3.060
erva-2.969
Token
Token ID248
Feature activation+7.98e+00

Pos logit contributions
OLOG+3.379
OLOGY+2.894
��+2.698
indal+2.623
GH+2.592
Neg logit contributions
adra-3.943
actic-3.830
ement-3.700
inion-3.609
atic-3.511
Token «
Token ID21110
Feature activation+1.36e+01

Pos logit contributions
OLOG+6.601
OLOGY+5.720
GH+5.291
indal+5.228
IGH+5.132
Neg logit contributions
adra-6.347
actic-6.028
ement-5.685
inion-5.662
anmar-5.518
Token
Token ID140
Feature activation+2.98e+00

Pos logit contributions
OLOG+9.756
OLOGY+8.246
GH+7.809
+7.501
IGH+7.464
Neg logit contributions
adra-10.949
actic-10.514
inion-10.056
ement-9.937
rehens-9.772
Token
Token ID240
Feature activation+4.43e+00

Pos logit contributions
OLOG+2.260
OLOGY+1.932
indal+1.789
��+1.781
ologies+1.746
Neg logit contributions
adra-2.641
actic-2.566
ement-2.465
inion-2.419
atic-2.364
Tokenа
Token ID16142
Feature activation+4.19e+00

Pos logit contributions
OLOG+3.409
OLOGY+2.918
indal+2.680
GH+2.638
��+2.624
Neg logit contributions
adra-3.891
actic-3.752
ement-3.593
inion-3.538
atic-3.447
Token
Token ID140
Feature activation+2.06e+00

Pos logit contributions
OLOG+3.057
OLOGY+2.591
indal+2.364
GH+2.337
ologies+2.302
Neg logit contributions
adra-3.872
actic-3.741
ement-3.579
inion-3.535
atic-3.442
Token
Token ID111
Feature activation+5.45e-01

Pos logit contributions
OLOG+1.524
OLOGY+1.297
indal+1.200
��+1.195
GH+1.168
Neg logit contributions
adra-1.862
actic-1.815
ement-1.741
inion-1.712
atic-1.671
Token online
Token ID2691
Feature activation+3.88e+00

Pos logit contributions
OLOG+2.739
OLOGY+2.426
indal+2.292
ologies+2.251
GH+2.245
Neg logit contributions
adra-1.999
actic-1.914
ement-1.803
inion-1.772
atic-1.714
Token supporters
Token ID5941
Feature activation-1.98e+00

Pos logit contributions
OLOG+3.539
OLOGY+3.124
indal+2.944
GH+2.895
ologies+2.887
Neg logit contributions
adra-2.836
actic-2.705
ement-2.559
inion-2.516
atic-2.443
Token,
Token ID11
Feature activation-8.54e-01

Pos logit contributions
adra+1.396
actic+1.356
ement+1.280
inion+1.254
atic+1.225
Neg logit contributions
OLOG-1.860
OLOGY-1.640
indal-1.551
��-1.535
ologies-1.514
Token using
Token ID1262
Feature activation-1.57e+00

Pos logit contributions
adra+0.629
actic+0.610
ement+0.578
inion+0.566
atic+0.553
Neg logit contributions
OLOG-0.778
OLOGY-0.683
indal-0.648
��-0.640
ologies-0.630
Token the
Token ID262
Feature activation-1.74e-01

Pos logit contributions
adra+1.068
actic+1.025
ement+0.968
inion+0.940
atic+0.924
Neg logit contributions
OLOG-1.520
OLOGY-1.349
indal-1.284
��-1.276
GH-1.251
Token nickname
Token ID21814
Feature activation+1.31e+01

Pos logit contributions
adra+0.131
actic+0.127
ement+0.121
inion+0.117
atic+0.115
Neg logit contributions
OLOG-0.156
OLOGY-0.137
indal-0.130
��-0.129
GH-0.126
Token is
Token ID318
Feature activation+8.35e-01

Pos logit contributions
OLOG+10.183
OLOGY+8.858
ologies+8.470
GH+8.298
indal+8.252
Neg logit contributions
adra-9.910
actic-9.424
inion-9.151
rehens-8.833
anmar-8.771
Token now
Token ID783
Feature activation+2.75e+00

Pos logit contributions
OLOG+0.778
OLOGY+0.687
indal+0.650
��+0.640
ologies+0.634
Neg logit contributions
adra-0.600
actic-0.579
ement-0.548
inion-0.536
atic-0.523
Token commonplace
Token ID33488
Feature activation+5.36e-01

Pos logit contributions
OLOG+2.539
OLOGY+2.238
indal+2.115
��+2.076
ologies+2.068
Neg logit contributions
adra-1.985
actic-1.916
ement-1.811
inion-1.784
atic-1.726
Token.
Token ID13
Feature activation+1.19e-01

Pos logit contributions
OLOG+0.525
OLOGY+0.466
indal+0.442
��+0.435
ologies+0.433
Neg logit contributions
adra-0.360
actic-0.346
ement-0.326
inion-0.320
atic-0.310
Token The
Token ID383
Feature activation+2.06e-01

Pos logit contributions
OLOG+0.106
OLOGY+0.094
indal+0.089
��+0.088
ologies+0.087
Neg logit contributions
adra-0.089
actic-0.086
ement-0.082
inion-0.080
atic-0.078
Token wishes
Token ID12802
Feature activation-4.09e+00

Pos logit contributions
OLOG+2.208
OLOGY+1.922
indal+1.800
ologies+1.776
GH+1.755
Neg logit contributions
adra-2.163
actic-2.093
ement-1.986
inion-1.971
atic-1.912
Token to
Token ID284
Feature activation+1.01e+00

Pos logit contributions
adra+2.276
actic+2.179
ement+2.020
inion+1.952
atic+1.894
Neg logit contributions
OLOG-4.445
OLOGY-3.996
indal-3.818
��-3.785
GH-3.720
Token transition
Token ID6801
Feature activation+3.67e+00

Pos logit contributions
OLOG+0.922
OLOGY+0.814
indal+0.770
��+0.754
ologies+0.753
Neg logit contributions
adra-0.736
actic-0.708
ement-0.670
inion-0.660
atic-0.640
Token their
Token ID511
Feature activation-4.44e-01

Pos logit contributions
OLOG+3.473
OLOGY+3.052
indal+2.880
ologies+2.853
GH+2.834
Neg logit contributions
adra-2.528
actic-2.424
ement-2.286
inion-2.263
rehens-2.200
Token gender
Token ID5279
Feature activation+5.24e+00

Pos logit contributions
adra+0.327
actic+0.315
ement+0.299
inion+0.293
atic+0.286
Neg logit contributions
OLOG-0.406
OLOGY-0.357
indal-0.339
��-0.334
ologies-0.329
Token socially
Token ID18118
Feature activation+1.29e+01

Pos logit contributions
OLOG+4.613
OLOGY+4.027
indal+3.807
ologies+3.773
GH+3.724
Neg logit contributions
adra-3.950
actic-3.791
ement-3.575
inion-3.525
rehens-3.445
Token and
Token ID290
Feature activation+2.22e+00

Pos logit contributions
OLOG+10.353
OLOGY+8.856
ologies+8.612
IGH+8.347
GH+8.327
Neg logit contributions
adra-9.483
actic-8.861
inion-8.523
ement-8.383
rehens-8.311
Token/
Token ID14
Feature activation-2.32e+00

Pos logit contributions
OLOG+2.083
OLOGY+1.830
indal+1.734
ologies+1.706
GH+1.695
Neg logit contributions
adra-1.571
actic-1.508
ement-1.423
inion-1.407
atic-1.353
Tokenor
Token ID273
Feature activation-1.32e+00

Pos logit contributions
adra+0.844
actic+0.797
ement+0.710
inion+0.672
atic+0.652
Neg logit contributions
OLOG-3.002
OLOGY-2.743
��-2.684
indal-2.635
-2.588
Token physically
Token ID10170
Feature activation+4.23e+00

Pos logit contributions
adra+0.897
actic+0.868
ement+0.817
inion+0.799
atic+0.779
Neg logit contributions
OLOG-1.270
OLOGY-1.129
indal-1.071
��-1.058
ologies-1.043
Token,
Token ID11
Feature activation-6.35e-01

Pos logit contributions
OLOG+3.962
OLOGY+3.476
indal+3.284
ologies+3.272
GH+3.236
Neg logit contributions
adra-2.981
actic-2.839
ement-2.670
inion-2.640
rehens-2.556
Token to
Token ID284
Feature activation-8.08e-01

Pos logit contributions
adra+3.158
actic+3.030
ement+2.844
inion+2.723
atic+2.680
Neg logit contributions
OLOG-5.715
OLOGY-5.132
indal-4.924
��-4.867
ologies-4.771
Token be
Token ID307
Feature activation+3.20e-01

Pos logit contributions
adra+0.557
actic+0.537
ement+0.508
inion+0.496
atic+0.484
Neg logit contributions
OLOG-0.777
OLOGY-0.688
indal-0.652
��-0.645
ologies-0.636
Token perceived
Token ID11067
Feature activation-2.90e+00

Pos logit contributions
OLOG+0.309
OLOGY+0.274
indal+0.260
��+0.256
ologies+0.254
Neg logit contributions
adra-0.220
actic-0.211
ement-0.199
inion-0.195
atic-0.190
Token as
Token ID355
Feature activation-3.41e+00

Pos logit contributions
adra+1.797
actic+1.735
ement+1.641
inion+1.588
atic+1.566
Neg logit contributions
OLOG-2.958
OLOGY-2.647
indal-2.522
��-2.493
ologies-2.458
Token such
Token ID884
Feature activation-2.22e+00

Pos logit contributions
adra+2.048
actic+1.956
ement+1.857
inion+1.780
atic+1.779
Neg logit contributions
OLOG-3.554
OLOGY-3.190
indal-3.057
��-3.015
ologies-2.963
Token socially
Token ID18118
Feature activation+1.29e+01

Pos logit contributions
adra+1.395
actic+1.364
ement+1.291
inion+1.240
atic+1.233
Neg logit contributions
OLOG-2.236
OLOGY-1.996
indal-1.906
��-1.891
GH-1.845
Token (
Token ID357
Feature activation+1.26e+00

Pos logit contributions
OLOG+10.320
OLOGY+8.972
ologies+8.491
IGH+8.362
GH+8.293
Neg logit contributions
adra-9.587
actic-8.781
inion-8.484
anmar-8.379
ement-8.299
Tokenincluding
Token ID8201
Feature activation-2.75e+00

Pos logit contributions
OLOG+1.186
OLOGY+1.048
indal+0.992
��+0.975
ologies+0.970
Neg logit contributions
adra-0.887
actic-0.855
ement-0.807
inion-0.792
atic-0.769
Token the
Token ID262
Feature activation-1.39e+00

Pos logit contributions
adra+1.784
actic+1.735
ement+1.636
inion+1.578
atic+1.562
Neg logit contributions
OLOG-2.714
OLOGY-2.415
indal-2.318
��-2.294
ologies-2.242
Token use
Token ID779
Feature activation-1.45e+00

Pos logit contributions
adra+0.928
actic+0.901
ement+0.854
inion+0.825
atic+0.807
Neg logit contributions
OLOG-1.345
OLOGY-1.190
indal-1.146
��-1.126
GH-1.107
Token of
Token ID286
Feature activation-1.08e+00

Pos logit contributions
adra+0.950
actic+0.924
ement+0.874
inion+0.844
atic+0.830
Neg logit contributions
OLOG-1.427
OLOGY-1.267
indal-1.210
��-1.197
GH-1.176
Tokenн
Token ID22177
Feature activation-2.93e+00

Pos logit contributions
OLOG+0.410
OLOGY+0.355
indal+0.334
��+0.327
ologies+0.325
Neg logit contributions
adra-0.415
actic-0.403
ement-0.385
inion-0.379
atic-0.370
Tokenн
Token ID22177
Feature activation-2.84e+00

Pos logit contributions
adra+2.304
actic+2.229
ement+2.156
inion+2.085
atic+2.058
Neg logit contributions
OLOG-2.492
OLOGY-2.199
indal-2.094
��-2.020
GH-2.016
Tokenы
Token ID45035
Feature activation+1.54e+00

Pos logit contributions
adra+2.317
actic+2.254
ement+2.170
inion+2.105
atic+2.085
Neg logit contributions
OLOG-2.320
OLOGY-2.032
indal-1.946
��-1.891
GH-1.863
Token
Token ID141
Feature activation+3.12e+00

Pos logit contributions
OLOG+1.161
OLOGY+0.994
indal+0.928
��+0.909
ologies+0.897
Neg logit contributions
adra-1.371
actic-1.333
ement-1.277
inion-1.256
atic-1.231
Token
Token ID227
Feature activation+5.27e+00

Pos logit contributions
OLOG+2.501
OLOGY+2.160
indal+2.011
��+1.992
ologies+1.973
Neg logit contributions
adra-2.640
actic-2.561
ement-2.432
inion-2.393
atic-2.339
Token «
Token ID21110
Feature activation+1.28e+01

Pos logit contributions
OLOG+4.616
OLOGY+4.031
indal+3.737
GH+3.699
ologies+3.689
Neg logit contributions
adra-4.050
actic-3.875
ement-3.628
inion-3.627
rehens-3.494
Token
Token ID141
Feature activation+4.64e+00

Pos logit contributions
OLOG+9.793
OLOGY+8.363
GH+7.869
+7.595
indal+7.586
Neg logit contributions
adra-9.975
actic-9.670
inion-9.120
ement-9.067
anmar-8.797
Token
Token ID228
Feature activation+3.42e+00

Pos logit contributions
OLOG+2.944
OLOGY+2.435
��+2.240
indal+2.209
ologies+2.164
Neg logit contributions
adra-4.668
actic-4.558
ement-4.376
inion-4.304
atic-4.217
Tokenи
Token ID18849
Feature activation-1.97e-02

Pos logit contributions
OLOG+2.816
OLOGY+2.437
indal+2.277
��+2.245
GH+2.230
Neg logit contributions
adra-2.811
actic-2.710
ement-2.586
inion-2.554
atic-2.476
Tokenн
Token ID22177
Feature activation-8.46e-01

Pos logit contributions
adra+0.018
actic+0.017
ement+0.017
inion+0.016
atic+0.016
Neg logit contributions
OLOG-0.015
OLOGY-0.013
indal-0.012
��-0.012
ologies-0.011
Tokenк
Token ID31583
Feature activation+2.66e-01

Pos logit contributions
adra+0.728
actic+0.707
ement+0.679
inion+0.664
atic+0.654
Neg logit contributions
OLOG-0.663
OLOGY-0.573
indal-0.539
��-0.527
ologies-0.519
Token.
Token ID13
Feature activation+7.00e-01

Pos logit contributions
adra+0.184
actic+0.178
ement+0.169
inion+0.164
atic+0.160
Neg logit contributions
OLOG-0.269
OLOGY-0.239
indal-0.226
��-0.225
ologies-0.221
Token
Token ID447
Feature activation+2.03e-01

Pos logit contributions
OLOG+0.610
OLOGY+0.532
indal+0.506
��+0.503
ologies+0.493
Neg logit contributions
adra-0.541
actic-0.525
ement-0.500
inion-0.489
atic-0.479
Token
Token ID251
Feature activation+1.65e+00

Pos logit contributions
OLOG+0.149
OLOGY+0.127
��+0.117
indal+0.117
GH+0.114
Neg logit contributions
adra-0.185
actic-0.180
ement-0.173
inion-0.169
atic-0.166
Token
Token ID198
Feature activation-9.62e-01

Pos logit contributions
OLOG+1.394
OLOGY+1.215
indal+1.139
GH+1.110
ologies+1.110
Neg logit contributions
adra-1.329
actic-1.283
ement-1.213
inion-1.199
atic-1.168
Token
Token ID198
Feature activation+7.03e-03

Pos logit contributions
adra+0.672
actic+0.654
ement+0.621
inion+0.600
atic+0.595
Neg logit contributions
OLOG-0.912
OLOGY-0.803
��-0.781
indal-0.770
ologies-0.745
TokenTed
Token ID38972
Feature activation+1.27e+01

Pos logit contributions
OLOG+0.005
OLOGY+0.004
indal+0.004
��+0.004
ologies+0.004
Neg logit contributions
adra-0.007
actic-0.006
ement-0.006
inion-0.006
atic-0.006
Token Cruz
Token ID8742
Feature activation+7.44e-01

Pos logit contributions
OLOG+7.311
OLOGY+6.001
indal+5.780
GH+5.588
ologies+5.459
Neg logit contributions
adra-12.333
actic-11.838
anmar-11.341
inion-11.295
ement-11.280
Token speaks
Token ID9209
Feature activation-3.04e-01

Pos logit contributions
OLOG+0.675
OLOGY+0.593
indal+0.560
��+0.552
ologies+0.547
Neg logit contributions
adra-0.553
actic-0.534
ement-0.506
inion-0.496
atic-0.485
Token to
Token ID284
Feature activation-1.23e+00

Pos logit contributions
adra+0.199
actic+0.192
ement+0.180
inion+0.176
atic+0.171
Neg logit contributions
OLOG-0.302
OLOGY-0.268
indal-0.255
��-0.251
ologies-0.249
Token potential
Token ID2785
Feature activation+2.93e+00

Pos logit contributions
adra+0.847
actic+0.818
ement+0.772
inion+0.757
atic+0.736
Neg logit contributions
OLOG-1.185
OLOGY-1.049
indal-0.996
��-0.986
ologies-0.976
Token supporters
Token ID5941
Feature activation-2.70e+00

Pos logit contributions
OLOG+2.638
OLOGY+2.322
indal+2.190
ologies+2.138
��+2.137
Neg logit contributions
adra-2.182
actic-2.110
ement-1.996
inion-1.961
atic-1.906
Token 2017
Token ID2177
Feature activation-4.71e+00

Pos logit contributions
OLOG+0.318
OLOGY+0.285
indal+0.272
��+0.268
ologies+0.266
Neg logit contributions
adra-0.174
actic-0.167
ement-0.156
inion-0.152
atic-0.147
Token
Token ID198
Feature activation+3.45e-01

Pos logit contributions
adra+3.496
actic+3.377
ement+3.211
inion+3.162
atic+3.074
Neg logit contributions
OLOG-4.226
OLOGY-3.714
indal-3.521
��-3.460
ologies-3.428
Token
Token ID198
Feature activation+7.64e-01

Pos logit contributions
OLOG+0.318
OLOGY+0.279
��+0.266
indal+0.266
ologies+0.258
Neg logit contributions
adra-0.251
actic-0.244
ement-0.231
inion-0.226
atic-0.222
TokenPay
Token ID19197
Feature activation-2.62e+00

Pos logit contributions
OLOG+0.630
OLOGY+0.547
indal+0.513
��+0.506
ologies+0.500
Neg logit contributions
adra-0.632
actic-0.612
ement-0.582
inion-0.573
atic-0.561
Token attention
Token ID3241
Feature activation-5.87e-01

Pos logit contributions
adra+1.341
actic+1.272
ement+1.182
inion+1.148
atic+1.110
Neg logit contributions
OLOG-2.969
OLOGY-2.674
��-2.567
indal-2.562
GH-2.512
Token Ted
Token ID11396
Feature activation+1.26e+01

Pos logit contributions
adra+0.381
actic+0.368
ement+0.346
inion+0.338
atic+0.329
Neg logit contributions
OLOG-0.586
OLOGY-0.522
indal-0.496
��-0.489
ologies-0.485
Token,
Token ID11
Feature activation-1.65e+00

Pos logit contributions
OLOG+10.265
OLOGY+8.949
ologies+8.609
indal+8.591
GH+8.581
Neg logit contributions
adra-8.986
actic-8.422
inion-7.973
rehens-7.852
anmar-7.842
Token they
Token ID484
Feature activation-1.25e+00

Pos logit contributions
adra+1.177
actic+1.149
ement+1.086
inion+1.057
atic+1.032
Neg logit contributions
OLOG-1.535
OLOGY-1.350
��-1.275
indal-1.273
ologies-1.242
Token
Token ID447
Feature activation+2.10e+00

Pos logit contributions
adra+0.940
actic+0.916
ement+0.871
inion+0.850
atic+0.835
Neg logit contributions
OLOG-1.121
OLOGY-0.979
indal-0.927
��-0.921
ologies-0.900
Token
Token ID247
Feature activation-7.91e-01

Pos logit contributions
OLOG+1.664
OLOGY+1.431
indal+1.342
��+1.306
ologies+1.300
Neg logit contributions
adra-1.813
actic-1.758
ement-1.680
inion-1.657
atic-1.617
Tokenre
Token ID260
Feature activation+1.03e-01

Pos logit contributions
adra+0.510
actic+0.493
ement+0.466
inion+0.454
atic+0.441
Neg logit contributions
OLOG-0.794
OLOGY-0.705
indal-0.672
��-0.668
ologies-0.653
Token,
Token ID11
Feature activation-8.03e-01

Pos logit contributions
OLOG+0.398
OLOGY+0.354
indal+0.336
��+0.331
ologies+0.329
Neg logit contributions
adra-0.264
actic-0.254
ement-0.239
inion-0.234
atic-0.227
Token Stephen
Token ID7970
Feature activation+8.34e+00

Pos logit contributions
adra+0.628
actic+0.609
ement+0.580
inion+0.568
atic+0.555
Neg logit contributions
OLOG-0.697
OLOGY-0.609
indal-0.572
��-0.564
ologies-0.558
Token P
Token ID350
Feature activation-4.74e+00

Pos logit contributions
OLOG+6.109
OLOGY+5.213
ologies+4.996
GH+4.968
indal+4.871
Neg logit contributions
adra-6.882
actic-6.822
ement-6.357
inion-6.201
anmar-6.198
Tokenaea
Token ID44705
Feature activation+1.55e+00

Pos logit contributions
adra+3.304
actic+3.200
ement+3.030
inion+2.968
atic+2.923
Neg logit contributions
OLOG-4.554
OLOGY-4.036
��-3.793
-3.727
-3.714
Token,
Token ID11
Feature activation-1.06e+00

Pos logit contributions
OLOG+1.509
OLOGY+1.341
indal+1.277
��+1.269
GH+1.257
Neg logit contributions
adra-1.017
actic-0.984
ement-0.925
inion-0.910
rehens-0.874
Token Ricky
Token ID31668
Feature activation+1.25e+01

Pos logit contributions
adra+0.831
actic+0.805
ement+0.766
inion+0.751
atic+0.733
Neg logit contributions
OLOG-0.927
OLOGY-0.809
indal-0.761
��-0.750
ologies-0.741
Token Jean
Token ID11320
Feature activation+2.56e+00

Pos logit contributions
OLOG+9.272
OLOGY+8.121
ologies+7.785
indal+7.775
GH+7.698
Neg logit contributions
adra-9.901
actic-9.452
rehens-8.992
ement-8.983
inion-8.831
Token Francois
Token ID40419
Feature activation+3.68e+00

Pos logit contributions
OLOG+2.346
OLOGY+2.057
indal+1.962
GH+1.914
ologies+1.911
Neg logit contributions
adra-1.870
actic-1.788
ement-1.683
inion-1.661
rehens-1.608
Token and
Token ID290
Feature activation+4.69e-01

Pos logit contributions
OLOG+3.489
OLOGY+3.094
indal+2.965
��+2.931
GH+2.915
Neg logit contributions
adra-2.479
actic-2.390
ement-2.251
inion-2.224
rehens-2.150
Token Zig
Token ID24992
Feature activation+2.38e+00

Pos logit contributions
OLOG+0.404
OLOGY+0.353
indal+0.332
��+0.326
ologies+0.324
Neg logit contributions
adra-0.369
actic-0.357
ement-0.339
inion-0.334
atic-0.325
Tokengy
Token ID1360
Feature activation+5.02e+00

Pos logit contributions
OLOG+1.572
OLOGY+1.314
indal+1.213
GH+1.173
ologies+1.172
Neg logit contributions
adra-2.343
actic-2.285
ement-2.192
inion-2.168
atic-2.121
Tokenhead
Token ID2256
Feature activation+4.40e+00

Pos logit contributions
OLOG+0.399
OLOGY+0.359
indal+0.341
��+0.341
ologies+0.335
Neg logit contributions
adra-0.210
actic-0.201
ement-0.187
inion-0.182
atic-0.177
Tokenle
Token ID293
Feature activation-2.39e-01

Pos logit contributions
OLOG+3.509
OLOGY+3.072
indal+2.898
GH+2.871
ologies+2.842
Neg logit contributions
adra-3.584
actic-3.438
ement-3.250
inion-3.234
erva-3.180
Token
Token ID198
Feature activation-1.37e+00

Pos logit contributions
adra+0.191
actic+0.184
ement+0.175
inion+0.173
atic+0.167
Neg logit contributions
OLOG-0.203
OLOGY-0.177
indal-0.166
GH-0.163
��-0.162
Token
Token ID198
Feature activation-5.04e-01

Pos logit contributions
adra+0.953
actic+0.928
ement+0.882
inion+0.852
atic+0.845
Neg logit contributions
OLOG-1.310
OLOGY-1.152
��-1.129
indal-1.109
ologies-1.071
Token2014
Token ID4967
Feature activation+4.08e+00

Pos logit contributions
adra+0.373
actic+0.362
ement+0.344
atic+0.330
inion+0.328
Neg logit contributions
OLOG-0.456
OLOGY-0.403
indal-0.390
��-0.389
ologies-0.377
Token Ricky
Token ID31668
Feature activation+1.24e+01

Pos logit contributions
OLOG+3.296
OLOGY+2.851
indal+2.644
ologies+2.619
GH+2.611
Neg logit contributions
adra-3.415
actic-3.251
ement-3.128
inion-3.083
erva-2.984
Token G
Token ID402
Feature activation-1.18e+00

Pos logit contributions
OLOG+7.641
OLOGY+6.495
GH+6.480
indal+5.841
ologies+5.826
Neg logit contributions
adra-11.433
actic-11.018
ement-10.606
inion-10.573
rehens-10.467
Tokenerv
Token ID712
Feature activation-3.69e+00

Pos logit contributions
adra+0.421
actic+0.395
ement+0.359
inion+0.327
atic+0.318
Neg logit contributions
OLOG-1.522
OLOGY-1.395
��-1.347
indal-1.333
ologies-1.318
Tokenais
Token ID15152
Feature activation-6.74e-01

Pos logit contributions
adra+1.384
actic+1.303
ement+1.169
inion+1.142
atic+1.085
Neg logit contributions
OLOG-4.781
��-4.371
OLOGY-4.340
-4.200
-4.110
Token
Token ID198
Feature activation-2.43e+00

Pos logit contributions
adra+0.515
actic+0.496
inion+0.467
ement+0.467
atic+0.449
Neg logit contributions
OLOG-0.597
OLOGY-0.524
indal-0.492
GH-0.483
��-0.481
Token
Token ID198
Feature activation-1.59e+00

Pos logit contributions
adra+1.618
actic+1.582
ement+1.503
atic+1.440
inion+1.439
Neg logit contributions
OLOG-2.364
OLOGY-2.081
��-2.078
indal-2.015
-1.944
Token a
Token ID257
Feature activation-1.63e+00

Pos logit contributions
adra+0.273
actic+0.263
ement+0.249
inion+0.243
atic+0.237
Neg logit contributions
OLOG-0.373
OLOGY-0.330
indal-0.313
��-0.309
ologies-0.306
Token Democrat
Token ID9755
Feature activation+1.67e+00

Pos logit contributions
adra+1.409
actic+1.363
ement+1.306
inion+1.277
atic+1.258
Neg logit contributions
OLOG-1.281
OLOGY-1.101
indal-1.034
��-1.017
ologies-1.001
Token.
Token ID13
Feature activation-4.69e-01

Pos logit contributions
OLOG+1.615
OLOGY+1.434
indal+1.359
��+1.340
ologies+1.333
Neg logit contributions
adra-1.120
actic-1.081
ement-1.017
inion-0.997
atic-0.963
Token Sen
Token ID2311
Feature activation+4.30e+00

Pos logit contributions
adra+0.351
actic+0.340
ement+0.322
inion+0.316
atic+0.309
Neg logit contributions
OLOG-0.421
OLOGY-0.369
indal-0.350
��-0.348
ologies-0.343
Token.
Token ID13
Feature activation+3.05e+00

Pos logit contributions
OLOG+3.927
OLOGY+3.443
indal+3.360
ologies+3.262
GH+3.260
Neg logit contributions
adra-2.966
actic-2.865
ement-2.714
inion-2.681
rehens-2.571
Token Eddie
Token ID19478
Feature activation+1.24e+01

Pos logit contributions
OLOG+2.562
OLOGY+2.232
indal+2.097
GH+2.050
ologies+2.045
Neg logit contributions
adra-2.433
actic-2.357
ement-2.240
inion-2.216
atic-2.147
Token Luc
Token ID7598
Feature activation-6.62e+00

Pos logit contributions
OLOG+9.140
OLOGY+7.903
indal+7.622
IGH+7.555
GH+7.481
Neg logit contributions
adra-9.470
actic-9.370
inion-8.856
ement-8.779
erva-8.749
Tokenio
Token ID952
Feature activation-2.86e+00

Pos logit contributions
adra+4.036
actic+3.850
ement+3.660
atic+3.564
inion+3.507
Neg logit contributions
OLOG-6.901
OLOGY-6.187
��-5.830
-5.670
indal-5.619
Token,
Token ID11
Feature activation-7.82e-01

Pos logit contributions
adra+1.917
actic+1.840
ement+1.749
inion+1.686
atic+1.673
Neg logit contributions
OLOG-2.790
OLOGY-2.487
indal-2.351
��-2.311
ologies-2.292
Token a
Token ID257
Feature activation-1.94e+00

Pos logit contributions
adra+0.518
actic+0.498
ement+0.471
inion+0.458
atic+0.446
Neg logit contributions
OLOG-0.771
OLOGY-0.686
indal-0.652
��-0.644
ologies-0.639
Token Browns
Token ID17671
Feature activation-1.02e+00

Pos logit contributions
adra+1.772
actic+1.718
ement+1.653
inion+1.617
atic+1.593
Neg logit contributions
OLOG-1.417
OLOGY-1.208
indal-1.126
��-1.116
ologies-1.091
Tokeniv
Token ID452
Feature activation-2.91e+00

Pos logit contributions
adra+0.423
actic+0.407
ement+0.387
inion+0.378
atic+0.370
Neg logit contributions
OLOG-0.519
OLOGY-0.456
indal-0.432
��-0.428
ologies-0.421
Tokenica
Token ID3970
Feature activation+3.38e+00

Pos logit contributions
adra+2.371
actic+2.309
ement+2.179
inion+2.149
atic+2.124
Neg logit contributions
OLOG-2.434
OLOGY-2.107
indal-1.985
��-1.959
GH-1.924
Token released
Token ID2716
Feature activation-1.11e-02

Pos logit contributions
OLOG+3.025
OLOGY+2.668
indal+2.512
ologies+2.468
��+2.452
Neg logit contributions
adra-2.516
actic-2.439
ement-2.301
inion-2.281
atic-2.191
Token its
Token ID663
Feature activation-2.14e+00

Pos logit contributions
adra+0.008
actic+0.008
ement+0.007
inion+0.007
atic+0.007
Neg logit contributions
OLOG-0.010
OLOGY-0.009
indal-0.009
ologies-0.009
��-0.009
Token Sp
Token ID1338
Feature activation+8.63e-01

Pos logit contributions
adra+1.494
actic+1.445
ement+1.361
inion+1.331
atic+1.298
Neg logit contributions
OLOG-2.036
OLOGY-1.791
indal-1.707
��-1.696
GH-1.663
Tokenyd
Token ID5173
Feature activation+1.24e+01

Pos logit contributions
OLOG+0.846
OLOGY+0.752
indal+0.714
��+0.701
ologies+0.698
Neg logit contributions
adra-0.577
actic-0.556
ement-0.523
inion-0.513
atic-0.497
Tokenus
Token ID385
Feature activation+4.35e+00

Pos logit contributions
OLOG+11.874
OLOGY+10.716
ologies+10.164
indal+10.084
��+9.799
Neg logit contributions
adra-7.027
actic-6.584
inion-6.418
ement-6.242
rehens-6.090
Token 9
Token ID860
Feature activation+1.54e-01

Pos logit contributions
OLOG+4.061
OLOGY+3.611
indal+3.393
ologies+3.343
GH+3.319
Neg logit contributions
adra-3.065
actic-2.935
ement-2.750
inion-2.736
atic-2.624
Token platform
Token ID3859
Feature activation+2.75e+00

Pos logit contributions
OLOG+0.146
OLOGY+0.129
indal+0.123
��+0.121
ologies+0.120
Neg logit contributions
adra-0.107
actic-0.103
ement-0.098
inion-0.096
atic-0.093
Token,
Token ID11
Feature activation+3.30e-02

Pos logit contributions
OLOG+2.691
OLOGY+2.395
indal+2.261
ologies+2.239
��+2.214
Neg logit contributions
adra-1.821
actic-1.757
ement-1.639
inion-1.624
rehens-1.557
Token a
Token ID257
Feature activation-2.15e+00

Pos logit contributions
OLOG+0.030
OLOGY+0.027
indal+0.025
��+0.025
ologies+0.025
Neg logit contributions
adra-0.024
actic-0.023
ement-0.022
inion-0.022
atic-0.021
Token with
Token ID351
Feature activation-1.45e+00

Pos logit contributions
adra+2.167
actic+2.073
ement+1.944
atic+1.880
inion+1.879
Neg logit contributions
OLOG-3.784
OLOGY-3.393
indal-3.237
��-3.212
GH-3.150
Token Republican
Token ID3415
Feature activation+2.98e+00

Pos logit contributions
adra+1.181
actic+1.132
ement+1.082
inion+1.059
atic+1.045
Neg logit contributions
OLOG-1.212
OLOGY-1.053
indal-0.992
��-0.980
ologies-0.964
Token nominee
Token ID10429
Feature activation+3.44e+00

Pos logit contributions
OLOG+2.804
OLOGY+2.476
indal+2.346
��+2.297
ologies+2.297
Neg logit contributions
adra-2.100
actic-2.030
ement-1.913
inion-1.881
atic-1.816
Token Donald
Token ID3759
Feature activation+6.94e+00

Pos logit contributions
OLOG+2.634
OLOGY+2.255
indal+2.105
ologies+2.049
GH+2.040
Neg logit contributions
adra-3.033
actic-2.942
ement-2.806
inion-2.765
atic-2.693
Token Trump
Token ID1301
Feature activation-3.17e+00

Pos logit contributions
OLOG+4.042
OLOGY+3.306
indal+2.927
GH+2.876
IGH+2.813
Neg logit contributions
adra-7.287
actic-7.147
ement-6.808
inion-6.784
erva-6.693
Token Donald
Token ID3759
Feature activation+1.24e+01

Pos logit contributions
adra+2.476
actic+2.398
ement+2.302
atic+2.246
inion+2.228
Neg logit contributions
OLOG-2.712
OLOGY-2.370
indal-2.247
��-2.180
-2.164
Token John
Token ID1757
Feature activation+6.02e+00

Pos logit contributions
OLOG+7.681
OLOGY+6.442
GH+6.015
IGH+5.510
��+5.497
Neg logit contributions
adra-11.503
actic-11.434
ement-10.948
inion-10.777
anmar-10.624
Token Trump
Token ID1301
Feature activation-1.83e-01

Pos logit contributions
OLOG+3.830
OLOGY+3.178
indal+2.908
��+2.807
ologies+2.801
Neg logit contributions
adra-6.080
actic-5.925
ement-5.698
inion-5.627
atic-5.503
TokenHouse
Token ID18102
Feature activation+2.22e+00

Pos logit contributions
adra+0.178
actic+0.173
ement+0.166
inion+0.163
atic+0.161
Neg logit contributions
OLOG-0.123
OLOGY-0.103
indal-0.096
��-0.094
ologies-0.092
Token committee
Token ID5583
Feature activation-5.29e+00

Pos logit contributions
OLOG+1.367
OLOGY+1.134
indal+0.997
��+0.990
ologies+0.985
Neg logit contributions
adra-2.306
actic-2.229
ement-2.136
inion-2.132
atic-2.069
Token believes
Token ID5804
Feature activation-8.01e-01

Pos logit contributions
adra+3.502
actic+3.393
ement+3.109
inion+3.046
atic+2.964
Neg logit contributions
OLOG-5.249
OLOGY-4.652
indal-4.370
��-4.270
ologies-4.254
Token
Token ID564
Feature activation-1.25e+00

Pos logit contributions
OLOG+1.429
OLOGY+1.252
indal+1.180
��+1.155
GH+1.152
Neg logit contributions
adra-1.234
actic-1.193
ement-1.124
inion-1.113
atic-1.081
Token
Token ID250
Feature activation-1.49e+00

Pos logit contributions
adra+1.131
actic+1.095
ement+1.044
inion+1.036
atic+1.012
Neg logit contributions
OLOG-0.940
OLOGY-0.801
indal-0.748
ologies-0.724
��-0.719
Tokentop
Token ID4852
Feature activation+5.12e-01

Pos logit contributions
adra+1.269
actic+1.233
ement+1.181
inion+1.157
atic+1.136
Neg logit contributions
OLOG-1.185
OLOGY-1.026
indal-0.956
��-0.941
ologies-0.929
Token-
Token ID12
Feature activation+1.63e-01

Pos logit contributions
OLOG+0.459
OLOGY+0.403
indal+0.381
��+0.375
ologies+0.371
Neg logit contributions
adra-0.385
actic-0.374
ement-0.353
inion-0.349
atic-0.340
Tokenhat
Token ID5183
Feature activation+9.31e-01

Pos logit contributions
OLOG+0.145
OLOGY+0.127
indal+0.120
��+0.118
ologies+0.117
Neg logit contributions
adra-0.124
actic-0.120
ement-0.113
inion-0.112
atic-0.109
Token Eddie
Token ID19478
Feature activation+1.24e+01

Pos logit contributions
OLOG+0.788
OLOGY+0.685
indal+0.645
��+0.632
ologies+0.628
Neg logit contributions
adra-0.747
actic-0.723
ement-0.688
inion-0.676
atic-0.659
Token
Token ID447
Feature activation-2.52e+00

Pos logit contributions
OLOG+8.955
OLOGY+7.764
indal+7.356
IGH+7.284
GH+7.257
Neg logit contributions
adra-9.751
actic-9.667
ement-9.033
inion-8.979
erva-8.834
Token
Token ID251
Feature activation-1.99e+00

Pos logit contributions
adra+2.001
actic+1.914
inion+1.835
ement+1.834
atic+1.768
Neg logit contributions
OLOG-2.163
OLOGY-1.873
indal-1.769
ologies-1.703
GH-1.697
Token because
Token ID780
Feature activation+1.73e-01

Pos logit contributions
adra+1.565
actic+1.523
ement+1.458
inion+1.426
atic+1.395
Neg logit contributions
OLOG-1.712
OLOGY-1.491
indal-1.401
��-1.385
GH-1.358
Token of
Token ID286
Feature activation-3.68e+00

Pos logit contributions
OLOG+0.160
OLOGY+0.141
indal+0.133
��+0.131
ologies+0.130
Neg logit contributions
adra-0.126
actic-0.121
ement-0.114
inion-0.113
atic-0.109
Token the
Token ID262
Feature activation-1.60e+00

Pos logit contributions
adra+2.318
actic+2.231
ement+2.096
inion+2.026
atic+1.986
Neg logit contributions
OLOG-3.724
OLOGY-3.330
indal-3.162
��-3.106
ologies-3.083
Token McCain
Token ID14264
Feature activation-2.50e+00

Pos logit contributions
OLOG+1.271
OLOGY+1.079
indal+1.001
��+0.974
ologies+0.970
Neg logit contributions
adra-1.614
actic-1.568
ement-1.503
inion-1.483
atic-1.446
Token and
Token ID290
Feature activation+1.62e+00

Pos logit contributions
adra+1.938
actic+1.857
ement+1.784
inion+1.749
atic+1.715
Neg logit contributions
OLOG-2.179
OLOGY-1.905
indal-1.791
��-1.782
ologies-1.750
Token rejected
Token ID8606
Feature activation+1.51e+00

Pos logit contributions
OLOG+1.459
OLOGY+1.281
indal+1.209
��+1.186
ologies+1.180
Neg logit contributions
adra-1.214
actic-1.173
ement-1.111
inion-1.093
atic-1.062
Token presidential
Token ID4787
Feature activation+9.96e-01

Pos logit contributions
OLOG+1.367
OLOGY+1.202
indal+1.134
��+1.113
ologies+1.109
Neg logit contributions
adra-1.116
actic-1.076
ement-1.020
inion-1.002
atic-0.974
Token nominee
Token ID10429
Feature activation+3.68e+00

Pos logit contributions
OLOG+0.900
OLOGY+0.790
indal+0.747
��+0.735
ologies+0.728
Neg logit contributions
adra-0.745
actic-0.720
ement-0.682
inion-0.670
atic-0.653
Token Ted
Token ID11396
Feature activation+1.23e+01

Pos logit contributions
OLOG+2.940
OLOGY+2.538
indal+2.370
ologies+2.313
GH+2.301
Neg logit contributions
adra-3.117
actic-3.018
ement-2.887
inion-2.844
atic-2.765
Token Cruz
Token ID8742
Feature activation+1.22e+00

Pos logit contributions
OLOG+6.227
OLOGY+5.054
indal+4.616
GH+4.472
ologies+4.447
Neg logit contributions
adra-12.533
actic-12.530
anmar-12.050
inion-11.968
rehens-11.866
Token.
Token ID13
Feature activation-1.32e-01

Pos logit contributions
OLOG+1.081
OLOGY+0.948
indal+0.894
��+0.878
ologies+0.873
Neg logit contributions
adra-0.924
actic-0.894
ement-0.848
inion-0.833
atic-0.811
Token Imagine
Token ID18450
Feature activation-1.04e+00

Pos logit contributions
adra+0.104
actic+0.101
ement+0.097
inion+0.095
atic+0.093
Neg logit contributions
OLOG-0.113
OLOGY-0.098
��-0.093
indal-0.093
ologies-0.091
Token Lindsey
Token ID30334
Feature activation+4.39e+00

Pos logit contributions
adra+0.697
actic+0.670
ement+0.631
inion+0.617
atic+0.602
Neg logit contributions
OLOG-1.014
OLOGY-0.904
indal-0.855
��-0.847
ologies-0.836
Token Graham
Token ID11520
Feature activation-1.30e-01

Pos logit contributions
OLOG+3.321
OLOGY+2.840
indal+2.648
ologies+2.576
GH+2.572
Neg logit contributions
adra-3.889
actic-3.792
ement-3.609
inion-3.559
atic-3.482
Token Alt
Token ID12344
Feature activation+6.45e+00

Pos logit contributions
OLOG+3.983
OLOGY+3.478
indal+3.250
ologies+3.205
GH+3.177
Neg logit contributions
adra-3.630
actic-3.469
ement-3.291
inion-3.255
erva-3.157
Token-
Token ID12
Feature activation+2.21e+00

Pos logit contributions
OLOG+5.491
OLOGY+4.821
indal+4.454
ologies+4.439
GH+4.334
Neg logit contributions
adra-5.097
actic-4.798
ement-4.544
inion-4.494
atic-4.422
TokenRight
Token ID11028
Feature activation-6.35e-01

Pos logit contributions
OLOG+1.505
OLOGY+1.263
indal+1.164
��+1.135
ologies+1.125
Neg logit contributions
adra-2.146
actic-2.090
ement-2.006
inion-1.979
atic-1.939
Token Content
Token ID14041
Feature activation-1.55e+00

Pos logit contributions
adra+0.468
actic+0.459
ement+0.437
inion+0.427
atic+0.420
Neg logit contributions
OLOG-0.571
OLOGY-0.500
indal-0.475
��-0.474
GH-0.463
Token on
Token ID319
Feature activation+1.98e+00

Pos logit contributions
adra+1.078
actic+1.047
ement+1.000
inion+0.968
atic+0.956
Neg logit contributions
OLOG-1.460
OLOGY-1.289
indal-1.224
��-1.223
GH-1.198
Token Social
Token ID5483
Feature activation+1.23e+01

Pos logit contributions
OLOG+1.650
OLOGY+1.434
indal+1.344
ologies+1.310
��+1.309
Neg logit contributions
adra-1.623
actic-1.571
ement-1.493
inion-1.474
atic-1.433
Token Media
Token ID6343
Feature activation+2.52e+00

Pos logit contributions
OLOG+9.275
OLOGY+8.008
ologies+7.382
GH+7.264
IGH+7.258
Neg logit contributions
adra-10.171
actic-9.368
ement-8.960
inion-8.928
erva-8.876
Token
Token ID198
Feature activation+2.16e-01

Pos logit contributions
OLOG+2.264
OLOGY+1.989
indal+1.873
ologies+1.836
GH+1.823
Neg logit contributions
adra-1.894
actic-1.823
ement-1.723
inion-1.705
atic-1.645
Token
Token ID198
Feature activation+1.97e-01

Pos logit contributions
OLOG+0.200
OLOGY+0.176
��+0.168
indal+0.168
ologies+0.163
Neg logit contributions
adra-0.157
actic-0.152
ement-0.144
inion-0.141
atic-0.138
TokenOne
Token ID3198
Feature activation+7.35e-01

Pos logit contributions
OLOG+0.152
OLOGY+0.130
indal+0.123
��+0.121
ologies+0.119
Neg logit contributions
adra-0.172
actic-0.168
ement-0.160
inion-0.157
atic-0.155
Token of
Token ID286
Feature activation+9.03e-01

Pos logit contributions
OLOG+0.701
OLOGY+0.620
indal+0.588
��+0.579
ologies+0.575
Neg logit contributions
adra-0.511
actic-0.493
ement-0.466
inion-0.456
atic-0.444
Token of
Token ID286
Feature activation+1.29e-01

Pos logit contributions
OLOG+1.537
OLOGY+1.358
indal+1.278
ologies+1.265
��+1.252
Neg logit contributions
adra-1.119
actic-1.068
ement-1.005
inion-0.989
rehens-0.963
Token leadership
Token ID5531
Feature activation+1.14e+00

Pos logit contributions
OLOG+0.149
OLOGY+0.135
indal+0.129
ologies+0.127
��+0.126
Neg logit contributions
adra-0.064
actic-0.061
ement-0.056
inion-0.054
atic-0.052
Token,
Token ID11
Feature activation+8.48e-01

Pos logit contributions
OLOG+1.136
OLOGY+1.012
indal+0.959
ologies+0.944
��+0.942
Neg logit contributions
adra-0.750
actic-0.720
ement-0.674
inion-0.661
atic-0.640
Token Skip
Token ID32214
Feature activation+9.39e-01

Pos logit contributions
OLOG+0.752
OLOGY+0.660
indal+0.621
��+0.608
ologies+0.608
Neg logit contributions
adra-0.644
actic-0.624
ement-0.592
inion-0.581
atic-0.565
Token Pr
Token ID1736
Feature activation-4.08e+00

Pos logit contributions
OLOG+0.805
OLOGY+0.703
indal+0.661
��+0.649
ologies+0.644
Neg logit contributions
adra-0.743
actic-0.720
ement-0.684
inion-0.673
atic-0.656
Tokenich
Token ID488
Feature activation+1.22e+01

Pos logit contributions
adra+2.752
actic+2.634
ement+2.477
inion+2.457
atic+2.421
Neg logit contributions
OLOG-4.019
OLOGY-3.572
��-3.422
-3.291
ologies-3.270
Tokenard
Token ID446
Feature activation+5.46e+00

Pos logit contributions
OLOG+8.244
OLOGY+7.170
indal+7.089
ologies+6.682
GH+6.502
Neg logit contributions
adra-10.180
actic-10.140
ement-9.514
inion-9.427
rehens-9.191
Token took
Token ID1718
Feature activation-4.94e+00

Pos logit contributions
OLOG+5.057
OLOGY+4.481
indal+4.241
ologies+4.202
GH+4.163
Neg logit contributions
adra-3.761
actic-3.682
inion-3.393
ement-3.360
rehens-3.275
Token the
Token ID262
Feature activation-1.10e+00

Pos logit contributions
adra+2.959
actic+2.813
ement+2.621
inion+2.558
atic+2.464
Neg logit contributions
OLOG-5.171
OLOGY-4.619
indal-4.432
��-4.424
GH-4.292
Token reins
Token ID35627
Feature activation+4.81e+00

Pos logit contributions
adra+0.772
actic+0.742
ement+0.702
inion+0.689
atic+0.668
Neg logit contributions
OLOG-1.050
OLOGY-0.929
indal-0.886
��-0.870
GH-0.860
Token of
Token ID286
Feature activation+1.76e-01

Pos logit contributions
OLOG+4.366
OLOGY+3.849
ologies+3.601
indal+3.598
GH+3.538
Neg logit contributions
adra-3.457
actic-3.320
inion-3.094
ement-3.092
rehens-3.033
TokenGermany
Token ID27079
Feature activation+3.70e+00

Pos logit contributions
OLOG+0.635
OLOGY+0.540
indal+0.504
��+0.494
ologies+0.488
Neg logit contributions
adra-0.794
actic-0.771
ement-0.739
inion-0.729
atic-0.714
Token's
Token ID338
Feature activation-2.64e-02

Pos logit contributions
OLOG+3.326
OLOGY+2.926
indal+2.771
ologies+2.720
GH+2.699
Neg logit contributions
adra-2.727
actic-2.639
ement-2.460
inion-2.453
erva-2.375
Token leading
Token ID3756
Feature activation+2.00e+00

Pos logit contributions
adra+0.018
actic+0.017
ement+0.016
inion+0.016
atic+0.016
Neg logit contributions
OLOG-0.025
OLOGY-0.023
indal-0.021
��-0.021
ologies-0.021
Token news
Token ID1705
Feature activation+7.76e-01

Pos logit contributions
OLOG+1.920
OLOGY+1.700
indal+1.609
��+1.581
ologies+1.576
Neg logit contributions
adra-1.386
actic-1.335
ement-1.258
inion-1.234
atic-1.199
Token magazine
Token ID7093
Feature activation+2.26e+00

Pos logit contributions
OLOG+0.747
OLOGY+0.663
indal+0.628
��+0.619
ologies+0.614
Neg logit contributions
adra-0.532
actic-0.512
ement-0.484
inion-0.475
atic-0.460
Token Der
Token ID9626
Feature activation+1.22e+01

Pos logit contributions
OLOG+1.945
OLOGY+1.698
indal+1.595
��+1.562
ologies+1.558
Neg logit contributions
adra-1.782
actic-1.725
ement-1.637
inion-1.610
atic-1.568
Token Spiegel
Token ID46902
Feature activation+2.12e+00

Pos logit contributions
OLOG+8.403
OLOGY+6.988
OOK+6.821
GH+6.562
indal+6.499
Neg logit contributions
adra-11.073
actic-10.254
ement-10.012
inion-10.005
erva-9.759
Token said
Token ID531
Feature activation+4.38e-01

Pos logit contributions
OLOG+1.872
OLOGY+1.640
indal+1.545
��+1.516
ologies+1.509
Neg logit contributions
adra-1.618
actic-1.564
ement-1.482
inion-1.457
atic-1.418
Token in
Token ID287
Feature activation+2.69e+00

Pos logit contributions
OLOG+0.412
OLOGY+0.364
indal+0.344
��+0.338
ologies+0.337
Neg logit contributions
adra-0.311
actic-0.300
ement-0.283
inion-0.278
atic-0.270
Token a
Token ID257
Feature activation+2.14e+00

Pos logit contributions
OLOG+2.590
OLOGY+2.304
indal+2.168
ologies+2.137
GH+2.123
Neg logit contributions
adra-1.826
actic-1.760
ement-1.660
inion-1.628
atic-1.570
Token strongly
Token ID7634
Feature activation+3.32e+00

Pos logit contributions
OLOG+2.071
OLOGY+1.839
indal+1.738
ologies+1.705
��+1.702
Neg logit contributions
adra-1.467
actic-1.410
ement-1.326
inion-1.299
atic-1.263
Token It
Token ID632
Feature activation-3.68e-01

Pos logit contributions
OLOG+0.378
OLOGY+0.331
indal+0.313
��+0.308
ologies+0.305
Neg logit contributions
adra-0.328
actic-0.318
ement-0.302
inion-0.296
atic-0.289
Token's
Token ID338
Feature activation+3.84e+00

Pos logit contributions
adra+0.286
actic+0.277
ement+0.264
inion+0.258
atic+0.254
Neg logit contributions
OLOG-0.322
OLOGY-0.282
indal-0.265
��-0.262
ologies-0.257
Token paired
Token ID20312
Feature activation-1.33e+00

Pos logit contributions
OLOG+3.341
OLOGY+2.930
indal+2.747
GH+2.705
ologies+2.703
Neg logit contributions
adra-2.919
actic-2.814
inion-2.671
ement-2.669
erva-2.551
Token with
Token ID351
Feature activation-1.83e+00

Pos logit contributions
adra+0.845
actic+0.821
ement+0.767
inion+0.748
atic+0.734
Neg logit contributions
OLOG-1.335
OLOGY-1.190
indal-1.130
��-1.121
ologies-1.106
Token "
Token ID366
Feature activation+3.05e+00

Pos logit contributions
adra+1.306
actic+1.264
ement+1.193
inion+1.165
atic+1.138
Neg logit contributions
OLOG-1.715
OLOGY-1.515
indal-1.433
��-1.415
ologies-1.397
TokenDel
Token ID13856
Feature activation+1.22e+01

Pos logit contributions
OLOG+2.259
OLOGY+1.923
indal+1.781
GH+1.757
��+1.740
Neg logit contributions
adra-2.746
actic-2.672
ement-2.553
inion-2.535
atic-2.461
Tokeniver
Token ID1428
Feature activation-1.35e+00

Pos logit contributions
OLOG+10.439
indal+9.392
GH+8.929
OLOGY+8.910
IGH+8.862
Neg logit contributions
adra-7.775
actic-7.426
inion-6.977
ement-6.887
atic-6.727
Tokenance
Token ID590
Feature activation+2.29e-01

Pos logit contributions
adra+0.848
actic+0.812
ement+0.773
inion+0.743
atic+0.724
Neg logit contributions
OLOG-1.392
OLOGY-1.249
��-1.179
indal-1.175
ologies-1.154
Token"
Token ID1
Feature activation+6.63e-01

Pos logit contributions
OLOG+0.186
OLOGY+0.161
indal+0.151
��+0.148
ologies+0.147
Neg logit contributions
adra-0.190
actic-0.184
ement-0.175
inion-0.173
atic-0.169
Token on
Token ID319
Feature activation+2.29e+00

Pos logit contributions
OLOG+0.637
OLOGY+0.565
indal+0.535
��+0.524
ologies+0.524
Neg logit contributions
adra-0.456
actic-0.439
ement-0.414
inion-0.407
atic-0.393
Token Aug
Token ID2447
Feature activation-1.26e+00

Pos logit contributions
OLOG+2.173
OLOGY+1.926
indal+1.827
ologies+1.785
GH+1.782
Neg logit contributions
adra-1.592
actic-1.534
ement-1.445
inion-1.429
atic-1.376
Token,
Token ID11
Feature activation-4.85e-01

Pos logit contributions
OLOG+7.387
OLOGY+6.464
indal+6.244
GH+6.126
ologies+5.987
Neg logit contributions
adra-6.747
actic-6.583
ement-6.118
inion-6.083
rehens-5.979
Token royalty
Token ID29359
Feature activation-1.12e-01

Pos logit contributions
adra+0.353
actic+0.341
ement+0.323
inion+0.318
atic+0.309
Neg logit contributions
OLOG-0.447
OLOGY-0.394
indal-0.373
��-0.367
ologies-0.364
Token free
Token ID1479
Feature activation+1.30e+00

Pos logit contributions
adra+0.078
actic+0.075
ement+0.071
inion+0.070
atic+0.068
Neg logit contributions
OLOG-0.107
OLOGY-0.095
indal-0.090
��-0.089
ologies-0.088
Token<|endoftext|>
Token ID50256
Feature activation+1.63e+00

Pos logit contributions
OLOG+1.208
OLOGY+1.066
indal+1.008
GH+0.987
ologies+0.987
Neg logit contributions
adra-0.934
actic-0.899
ement-0.846
inion-0.833
atic-0.808
TokenJ
Token ID41
Feature activation+1.05e+00

Pos logit contributions
OLOG+1.230
OLOGY+1.051
indal+0.977
��+0.955
GH+0.949
Neg logit contributions
adra-1.457
actic-1.416
ement-1.355
inion-1.336
atic-1.304
Tokenür
Token ID25151
Feature activation+1.22e+01

Pos logit contributions
OLOG+0.916
OLOGY+0.802
indal+0.750
��+0.745
ologies+0.736
Neg logit contributions
adra-0.818
actic-0.794
ement-0.754
inion-0.739
atic-0.722
Tokengen
Token ID5235
Feature activation+5.02e+00

Pos logit contributions
OLOG+6.121
OLOGY+4.716
GH+4.476
indal+4.387
ologies+4.234
Neg logit contributions
adra-12.727
actic-12.084
inion-11.769
erva-11.753
ement-11.630
Token Klopp
Token ID43461
Feature activation-7.91e+00

Pos logit contributions
OLOG+3.502
OLOGY+2.961
indal+2.720
GH+2.689
��+2.646
Neg logit contributions
adra-4.735
actic-4.505
inion-4.356
ement-4.340
rehens-4.216
Token believes
Token ID5804
Feature activation-1.98e+00

Pos logit contributions
adra+5.617
actic+5.573
ement+5.371
atic+5.157
inion+5.018
Neg logit contributions
OLOG-7.011
OLOGY-6.111
indal-5.771
��-5.728
ologies-5.552
Token Liverpool
Token ID11761
Feature activation-3.53e+00

Pos logit contributions
adra+1.398
actic+1.358
ement+1.277
inion+1.246
atic+1.227
Neg logit contributions
OLOG-1.855
OLOGY-1.638
indal-1.559
��-1.533
GH-1.512
Token would
Token ID561
Feature activation-1.61e+00

Pos logit contributions
adra+2.572
actic+2.493
ement+2.354
inion+2.293
atic+2.277
Neg logit contributions
OLOG-3.231
OLOGY-2.836
indal-2.701
��-2.666
ologies-2.601
Token are
Token ID389
Feature activation-1.18e+00

Pos logit contributions
OLOG+0.259
OLOGY+0.228
indal+0.215
��+0.211
ologies+0.210
Neg logit contributions
adra-0.219
actic-0.212
ement-0.201
inion-0.198
atic-0.192
Token complete
Token ID1844
Feature activation-1.27e+01

Pos logit contributions
adra+0.802
actic+0.770
ement+0.727
inion+0.710
atic+0.693
Neg logit contributions
OLOG-1.150
OLOGY-1.021
indal-0.970
��-0.952
ologies-0.946
Token,
Token ID11
Feature activation-1.87e+00

Pos logit contributions
adra+7.719
actic+7.416
ement+7.306
rehens+6.870
inion+6.796
Neg logit contributions
OLOG-11.815
OLOGY-10.616
indal-10.122
��-9.805
ologies-9.742
Token whole
Token ID2187
Feature activation-1.03e+01

Pos logit contributions
adra+1.252
actic+1.203
ement+1.135
inion+1.112
atic+1.087
Neg logit contributions
OLOG-1.823
OLOGY-1.620
indal-1.535
��-1.516
ologies-1.502
Token,
Token ID11
Feature activation-1.66e+00

Pos logit contributions
adra+6.128
actic+5.977
ement+5.736
inion+5.471
atic+5.399
Neg logit contributions
OLOG-9.913
OLOGY-8.973
indal-8.613
��-8.324
ologies-8.208
Token perfect
Token ID2818
Feature activation-1.46e+01

Pos logit contributions
adra+1.100
actic+1.056
ement+0.997
inion+0.977
atic+0.950
Neg logit contributions
OLOG-1.634
OLOGY-1.454
indal-1.379
��-1.359
ologies-1.347
Token and
Token ID290
Feature activation-6.66e-01

Pos logit contributions
adra+8.077
ement+7.798
actic+7.778
atic+7.296
inion+7.268
Neg logit contributions
OLOG-13.668
OLOGY-12.428
indal-11.984
GH-11.440
IGH-11.397
Token awakened
Token ID39217
Feature activation+1.42e+00

Pos logit contributions
adra+0.461
actic+0.443
ement+0.419
inion+0.410
atic+0.399
Neg logit contributions
OLOG-0.639
OLOGY-0.566
indal-0.537
��-0.528
ologies-0.524
Token.
Token ID13
Feature activation-1.48e+00

Pos logit contributions
OLOG+1.368
OLOGY+1.210
indal+1.145
��+1.130
ologies+1.125
Neg logit contributions
adra-0.973
actic-0.934
ement-0.875
inion-0.864
atic-0.832
Token<|endoftext|>
Token ID50256
Feature activation+1.82e+00

Pos logit contributions
adra+1.091
actic+1.052
ement+1.001
inion+0.979
atic+0.960
Neg logit contributions
OLOG-1.341
OLOGY-1.175
indal-1.127
��-1.117
ologies-1.097
TokenShe
Token ID3347
Feature activation-9.06e-01

Pos logit contributions
OLOG+1.375
OLOGY+1.175
indal+1.091
��+1.066
GH+1.062
Neg logit contributions
adra-1.631
actic-1.585
ement-1.516
inion-1.495
atic-1.459
Token and
Token ID290
Feature activation+9.93e-01

Pos logit contributions
adra+0.495
actic+0.480
ement+0.456
inion+0.445
atic+0.433
Neg logit contributions
OLOG-0.688
OLOGY-0.608
indal-0.577
��-0.573
GH-0.565
Token art
Token ID1242
Feature activation-2.77e+00

Pos logit contributions
OLOG+0.981
OLOGY+0.873
indal+0.829
��+0.814
ologies+0.812
Neg logit contributions
adra-0.656
actic-0.630
ement-0.592
inion-0.580
atic-0.563
Token are
Token ID389
Feature activation-1.47e+00

Pos logit contributions
adra+1.943
actic+1.906
ement+1.804
inion+1.754
atic+1.728
Neg logit contributions
OLOG-2.585
OLOGY-2.282
indal-2.171
��-2.146
GH-2.122
Token often
Token ID1690
Feature activation+3.29e+00

Pos logit contributions
adra+1.033
actic+1.001
ement+0.943
inion+0.921
atic+0.904
Neg logit contributions
OLOG-1.397
OLOGY-1.238
indal-1.174
��-1.156
GH-1.143
Token closely
Token ID7173
Feature activation-5.03e+00

Pos logit contributions
OLOG+2.972
OLOGY+2.593
indal+2.446
ologies+2.435
��+2.403
Neg logit contributions
adra-2.427
actic-2.320
ement-2.192
inion-2.187
erva-2.102
Token inter
Token ID987
Feature activation-1.44e+01

Pos logit contributions
adra+4.119
actic+4.019
ement+3.837
atic+3.733
inion+3.723
Neg logit contributions
OLOG-4.046
OLOGY-3.560
indal-3.397
��-3.307
GH-3.224
Tokenming
Token ID2229
Feature activation-6.27e+00

Pos logit contributions
actic+8.164
adra+7.583
pell+7.293
ement+7.210
atic+6.960
Neg logit contributions
OLOG-13.994
OLOGY-12.732
��-12.360
IGH-11.887
-11.819
Tokenled
Token ID992
Feature activation-7.25e+00

Pos logit contributions
adra+2.493
actic+2.435
ement+2.235
atic+1.985
inion+1.959
Neg logit contributions
OLOG-7.755
OLOGY-7.119
��-6.913
indal-6.733
-6.691
Token as
Token ID355
Feature activation-4.36e+00

Pos logit contributions
actic+4.153
adra+4.024
ement+3.821
atic+3.706
inion+3.589
Neg logit contributions
OLOG-7.501
OLOGY-6.816
��-6.496
indal-6.401
GH-6.282
Token new
Token ID649
Feature activation-8.38e-02

Pos logit contributions
adra+2.910
actic+2.863
ement+2.683
atic+2.597
inion+2.594
Neg logit contributions
OLOG-4.140
OLOGY-3.695
indal-3.516
��-3.480
GH-3.428
Token values
Token ID3815
Feature activation-2.52e+00

Pos logit contributions
adra+0.064
actic+0.063
ement+0.059
inion+0.058
atic+0.057
Neg logit contributions
OLOG-0.074
OLOGY-0.065
indal-0.061
��-0.060
GH-0.059
Token's
Token ID338
Feature activation+4.38e-01

Pos logit contributions
OLOG+2.676
OLOGY+2.350
indal+2.205
ologies+2.168
��+2.156
Neg logit contributions
adra-2.332
actic-2.258
ement-2.132
inion-2.114
atic-2.039
Token reck
Token ID12482
Feature activation+6.11e-01

Pos logit contributions
OLOG+0.425
OLOGY+0.377
indal+0.358
��+0.352
ologies+0.349
Neg logit contributions
adra-0.298
actic-0.287
ement-0.270
inion-0.265
atic-0.257
Tokenlessness
Token ID17587
Feature activation-2.24e-01

Pos logit contributions
OLOG+0.495
OLOGY+0.429
indal+0.402
��+0.393
ologies+0.391
Neg logit contributions
adra-0.512
actic-0.497
ement-0.473
inion-0.466
atic-0.455
Token is
Token ID318
Feature activation-1.02e-01

Pos logit contributions
adra+0.158
actic+0.153
ement+0.145
inion+0.140
atic+0.138
Neg logit contributions
OLOG-0.213
OLOGY-0.188
indal-0.179
��-0.177
GH-0.174
Token a
Token ID257
Feature activation-2.40e+00

Pos logit contributions
adra+0.073
actic+0.071
ement+0.067
inion+0.065
atic+0.064
Neg logit contributions
OLOG-0.096
OLOGY-0.085
indal-0.080
��-0.079
ologies-0.078
Token perfect
Token ID2818
Feature activation-1.41e+01

Pos logit contributions
adra+1.618
actic+1.566
ement+1.486
inion+1.424
atic+1.420
Neg logit contributions
OLOG-2.326
OLOGY-2.067
indal-1.958
��-1.941
ologies-1.911
Token match
Token ID2872
Feature activation-9.31e+00

Pos logit contributions
actic+7.620
adra+7.506
ement+7.487
atic+7.140
ions+6.892
Neg logit contributions
OLOG-13.745
OLOGY-12.248
indal-11.780
��-11.522
ologies-11.350
Token for
Token ID329
Feature activation-6.04e+00

Pos logit contributions
adra+4.993
actic+4.876
ement+4.651
atic+4.444
inion+4.225
Neg logit contributions
OLOG-9.871
OLOGY-8.888
��-8.513
indal-8.426
GH-8.260
Token Stephen
Token ID7970
Feature activation+5.03e+00

Pos logit contributions
adra+4.434
actic+4.347
ement+4.078
atic+3.984
inion+3.980
Neg logit contributions
OLOG-5.363
OLOGY-4.690
��-4.486
indal-4.456
GH-4.342
Token's
Token ID338
Feature activation-3.03e-02

Pos logit contributions
OLOG+4.445
OLOGY+3.932
indal+3.729
ologies+3.622
GH+3.615
Neg logit contributions
adra-3.695
actic-3.510
ement-3.346
inion-3.336
anmar-3.200
Token self
Token ID2116
Feature activation+3.27e+00

Pos logit contributions
adra+0.021
actic+0.020
ement+0.019
inion+0.019
atic+0.018
Neg logit contributions
OLOG-0.029
OLOGY-0.026
indal-0.024
��-0.024
ologies-0.024
Tokenarrow
Token ID6018
Feature activation+4.41e-01

Pos logit contributions
OLOG+0.986
OLOGY+0.856
indal+0.805
��+0.788
ologies+0.782
Neg logit contributions
adra-0.973
actic-0.944
ement-0.900
inion-0.885
atic-0.865
Token Green
Token ID3469
Feature activation+7.15e+00

Pos logit contributions
OLOG+0.385
OLOGY+0.337
indal+0.317
��+0.312
ologies+0.309
Neg logit contributions
adra-0.343
actic-0.333
ement-0.316
inion-0.310
atic-0.304
Token Arrow
Token ID19408
Feature activation-2.28e+00

Pos logit contributions
OLOG+6.179
OLOGY+5.437
indal+5.058
GH+5.053
ologies+4.988
Neg logit contributions
adra-5.392
actic-5.168
ement-4.862
inion-4.851
rehens-4.656
Token was
Token ID373
Feature activation-1.53e+00

Pos logit contributions
adra+1.714
actic+1.674
ement+1.590
inion+1.552
atic+1.543
Neg logit contributions
OLOG-2.021
OLOGY-1.768
indal-1.678
��-1.664
ologies-1.631
Token the
Token ID262
Feature activation-3.93e+00

Pos logit contributions
adra+1.073
actic+1.038
ement+0.985
inion+0.952
atic+0.940
Neg logit contributions
OLOG-1.447
OLOGY-1.280
indal-1.214
��-1.196
ologies-1.189
Token perfect
Token ID2818
Feature activation-1.41e+01

Pos logit contributions
adra+2.564
actic+2.509
ement+2.338
inion+2.265
atic+2.252
Neg logit contributions
OLOG-3.829
OLOGY-3.409
indal-3.292
��-3.241
ologies-3.165
Token safety
Token ID3747
Feature activation+4.05e+00

Pos logit contributions
actic+8.784
adra+8.500
ement+8.219
atic+8.125
inion+7.879
Neg logit contributions
OLOG-12.222
indal-10.832
OLOGY-10.783
��-10.293
ologies-10.128
Token net
Token ID2010
Feature activation-6.74e-02

Pos logit contributions
OLOG+3.469
OLOGY+3.009
indal+2.843
GH+2.785
��+2.779
Neg logit contributions
adra-3.144
actic-3.030
ement-2.884
inion-2.838
atic-2.764
Token to
Token ID284
Feature activation-2.30e+00

Pos logit contributions
adra+0.044
actic+0.042
ement+0.040
inion+0.039
atic+0.038
Neg logit contributions
OLOG-0.068
OLOGY-0.060
indal-0.057
��-0.057
ologies-0.056
Token protect
Token ID1805
Feature activation+2.31e+00

Pos logit contributions
adra+1.625
actic+1.582
ement+1.496
inion+1.437
atic+1.429
Neg logit contributions
OLOG-2.156
OLOGY-1.908
indal-1.807
��-1.773
ologies-1.755
Token The
Token ID383
Feature activation-1.32e+00

Pos logit contributions
OLOG+2.125
OLOGY+1.869
indal+1.757
��+1.738
GH+1.731
Neg logit contributions
adra-1.682
actic-1.615
ement-1.527
inion-1.511
atic-1.456
Token a
Token ID257
Feature activation+1.70e+00

Pos logit contributions
OLOG+0.108
OLOGY+0.096
indal+0.091
ologies+0.090
��+0.090
Neg logit contributions
adra-0.072
actic-0.069
ement-0.065
inion-0.064
atic-0.062
Token time
Token ID640
Feature activation-5.35e-01

Pos logit contributions
OLOG+1.857
OLOGY+1.672
indal+1.593
��+1.568
ologies+1.566
Neg logit contributions
adra-0.946
actic-0.901
ement-0.838
inion-0.821
atic-0.785
Token.
Token ID13
Feature activation-9.44e-01

Pos logit contributions
adra+0.345
actic+0.333
ement+0.313
inion+0.306
atic+0.297
Neg logit contributions
OLOG-0.536
OLOGY-0.478
indal-0.455
��-0.448
ologies-0.444
Token
Token ID198
Feature activation-4.62e-01

Pos logit contributions
adra+0.687
actic+0.670
ement+0.634
inion+0.615
atic+0.606
Neg logit contributions
OLOG-0.861
OLOGY-0.756
indal-0.727
��-0.720
ologies-0.706
Token
Token ID198
Feature activation-3.68e-02

Pos logit contributions
adra+0.330
actic+0.320
ement+0.304
inion+0.295
atic+0.291
Neg logit contributions
OLOG-0.431
OLOGY-0.379
��-0.366
indal-0.363
ologies-0.351
TokenPerfect
Token ID36635
Feature activation-1.40e+01

Pos logit contributions
adra+0.033
actic+0.032
ement+0.030
inion+0.030
atic+0.029
Neg logit contributions
OLOG-0.028
OLOGY-0.024
indal-0.023
��-0.022
ologies-0.022
Token Trade
Token ID9601
Feature activation+4.21e+00

Pos logit contributions
adra+9.823
actic+9.788
ement+9.643
atic+9.269
inion+9.131
Neg logit contributions
OLOG-11.006
OLOGY-9.543
indal-9.364
��-9.175
GH-8.937
Token School
Token ID3961
Feature activation+3.77e+00

Pos logit contributions
OLOG+3.497
OLOGY+3.031
indal+2.843
ologies+2.798
GH+2.784
Neg logit contributions
adra-3.426
actic-3.270
ement-3.123
inion-3.070
atic-3.007
Token For
Token ID1114
Feature activation-2.73e+00

Pos logit contributions
OLOG+3.348
OLOGY+2.942
indal+2.765
ologies+2.733
��+2.700
Neg logit contributions
adra-2.840
actic-2.717
ement-2.563
inion-2.549
erva-2.466
Token Veterans
Token ID18950
Feature activation-8.53e-01

Pos logit contributions
adra+1.998
actic+1.939
ement+1.841
inion+1.777
atic+1.754
Neg logit contributions
OLOG-2.486
OLOGY-2.186
indal-2.084
��-2.049
ologies-2.020
Token
Token ID198
Feature activation-9.11e-01

Pos logit contributions
adra+0.633
actic+0.612
ement+0.580
inion+0.569
atic+0.555
Neg logit contributions
OLOG-0.775
OLOGY-0.681
indal-0.643
��-0.633
ologies-0.627
Tokenunk
Token ID2954
Feature activation+2.65e+00

Pos logit contributions
adra+3.756
actic+3.674
inion+3.527
ement+3.513
atic+3.423
Neg logit contributions
OLOG-3.114
OLOGY-2.639
��-2.503
indal-2.482
-2.397
Token continued
Token ID3767
Feature activation-7.06e-01

Pos logit contributions
OLOG+2.328
OLOGY+2.038
indal+1.919
��+1.883
ologies+1.874
Neg logit contributions
adra-2.046
actic-1.978
ement-1.878
inion-1.847
atic-1.796
Token to
Token ID284
Feature activation-4.41e-01

Pos logit contributions
adra+0.429
actic+0.411
ement+0.384
inion+0.376
atic+0.363
Neg logit contributions
OLOG-0.735
OLOGY-0.658
indal-0.627
��-0.617
ologies-0.614
Token seem
Token ID1283
Feature activation+2.87e+00

Pos logit contributions
adra+0.323
actic+0.315
ement+0.298
inion+0.289
atic+0.284
Neg logit contributions
OLOG-0.406
OLOGY-0.356
indal-0.338
��-0.335
ologies-0.327
Token both
Token ID1111
Feature activation-3.13e+00

Pos logit contributions
OLOG+2.630
OLOGY+2.313
indal+2.186
ologies+2.148
��+2.134
Neg logit contributions
adra-2.094
actic-2.018
ement-1.905
inion-1.886
atic-1.810
Token vital
Token ID9204
Feature activation-1.39e+01

Pos logit contributions
adra+2.219
actic+2.176
ement+2.063
atic+2.007
inion+1.974
Neg logit contributions
OLOG-2.897
OLOGY-2.564
��-2.436
indal-2.425
ologies-2.330
Token and
Token ID290
Feature activation-1.22e+00

Pos logit contributions
actic+6.750
adra+6.191
atic+6.024
ement+5.966
inion+5.464
Neg logit contributions
OLOG-14.320
OLOGY-13.112
��-13.046
GH-12.448
indal-12.429
Token volatile
Token ID22750
Feature activation-4.18e+00

Pos logit contributions
adra+0.882
actic+0.864
ement+0.811
inion+0.790
atic+0.786
Neg logit contributions
OLOG-1.128
OLOGY-1.000
indal-0.939
��-0.933
ologies-0.912
Token for
Token ID329
Feature activation-5.54e+00

Pos logit contributions
actic+2.656
adra+2.647
ement+2.487
atic+2.393
inion+2.365
Neg logit contributions
OLOG-4.128
OLOGY-3.690
��-3.539
indal-3.515
GH-3.429
Token [
Token ID685
Feature activation+1.23e+00

Pos logit contributions
adra+3.616
actic+3.536
ement+3.278
inion+3.186
atic+3.115
Neg logit contributions
OLOG-5.401
OLOGY-4.795
��-4.619
indal-4.615
GH-4.428
Tokenso
Token ID568
Feature activation-9.38e+00

Pos logit contributions
OLOG+1.081
OLOGY+0.946
indal+0.891
��+0.874
ologies+0.870
Neg logit contributions
adra-0.955
actic-0.923
ement-0.876
inion-0.862
atic-0.839
Token Instagram
Token ID10767
Feature activation+3.18e+00

Pos logit contributions
OLOG+0.846
OLOGY+0.729
indal+0.683
��+0.669
ologies+0.662
Neg logit contributions
adra-0.914
actic-0.887
ement-0.847
inion-0.833
atic-0.815
Token following
Token ID1708
Feature activation+8.18e-01

Pos logit contributions
OLOG+2.841
OLOGY+2.491
indal+2.330
GH+2.293
ologies+2.291
Neg logit contributions
adra-2.384
actic-2.303
ement-2.179
inion-2.153
atic-2.075
Token the
Token ID262
Feature activation-2.45e-01

Pos logit contributions
OLOG+0.783
OLOGY+0.693
indal+0.656
��+0.643
ologies+0.642
Neg logit contributions
adra-0.567
actic-0.546
ement-0.515
inion-0.507
atic-0.490
Token incident
Token ID4519
Feature activation+1.85e+00

Pos logit contributions
adra+0.181
actic+0.176
ement+0.165
inion+0.162
atic+0.158
Neg logit contributions
OLOG-0.223
OLOGY-0.196
indal-0.187
��-0.185
GH-0.181
Token:
Token ID25
Feature activation+2.58e+00

Pos logit contributions
OLOG+1.777
OLOGY+1.575
indal+1.491
��+1.467
ologies+1.460
Neg logit contributions
adra-1.270
actic-1.224
ement-1.153
inion-1.133
atic-1.097
Token Perfect
Token ID16374
Feature activation-1.38e+01

Pos logit contributions
OLOG+2.022
OLOGY+1.745
indal+1.582
GH+1.566
ologies+1.561
Neg logit contributions
adra-2.238
actic-2.153
ement-2.051
inion-2.028
erva-1.974
Token @
Token ID2488
Feature activation+3.17e+00

Pos logit contributions
adra+8.583
ement+8.564
actic+8.507
atic+8.011
ions+7.989
Neg logit contributions
OLOG-11.837
OLOGY-10.600
indal-10.390
��-10.077
IGH-9.988
TokenE
Token ID36
Feature activation+2.49e+00

Pos logit contributions
OLOG+2.720
OLOGY+2.365
indal+2.227
GH+2.183
��+2.177
Neg logit contributions
adra-2.492
actic-2.403
ement-2.275
inion-2.251
atic-2.182
Tokenaster
Token ID1603
Feature activation+3.56e+00

Pos logit contributions
OLOG+2.517
OLOGY+2.244
indal+2.137
��+2.094
GH+2.084
Neg logit contributions
adra-1.584
actic-1.519
ement-1.427
inion-1.390
atic-1.349
Token Sunday
Token ID3502
Feature activation+9.56e-01

Pos logit contributions
OLOG+2.866
OLOGY+2.475
indal+2.319
GH+2.263
ologies+2.250
Neg logit contributions
adra-2.968
actic-2.894
ement-2.749
inion-2.718
atic-2.632
Token.
Token ID13
Feature activation+3.37e-01

Pos logit contributions
OLOG+0.888
OLOGY+0.784
indal+0.741
��+0.728
ologies+0.724
Neg logit contributions
adra-0.689
actic-0.664
ement-0.628
inion-0.617
atic-0.599
TokenDirect
Token ID13470
Feature activation-3.44e+00

Pos logit contributions
OLOG+1.389
OLOGY+1.187
indal+1.111
��+1.087
ologies+1.076
Neg logit contributions
adra-1.653
actic-1.608
ement-1.539
inion-1.514
atic-1.485
Token,
Token ID11
Feature activation-5.65e-01

Pos logit contributions
adra+2.025
actic+1.936
ement+1.831
inion+1.774
atic+1.723
Neg logit contributions
OLOG-3.655
OLOGY-3.277
indal-3.141
��-3.101
ologies-3.053
Token structured
Token ID20793
Feature activation-1.20e-01

Pos logit contributions
adra+0.396
actic+0.383
ement+0.361
inion+0.355
atic+0.346
Neg logit contributions
OLOG-0.537
OLOGY-0.476
indal-0.450
��-0.445
ologies-0.438
Token communication
Token ID6946
Feature activation-4.88e+00

Pos logit contributions
adra+0.076
actic+0.073
ement+0.068
inion+0.067
atic+0.064
Neg logit contributions
OLOG-0.122
OLOGY-0.109
indal-0.103
ologies-0.102
��-0.101
Token is
Token ID318
Feature activation-1.06e+00

Pos logit contributions
adra+2.843
actic+2.776
ement+2.599
atic+2.509
inion+2.432
Neg logit contributions
OLOG-5.142
OLOGY-4.597
indal-4.450
��-4.412
GH-4.297
Token vital
Token ID9204
Feature activation-1.38e+01

Pos logit contributions
adra+0.691
actic+0.663
ement+0.624
inion+0.609
atic+0.592
Neg logit contributions
OLOG-1.065
OLOGY-0.948
indal-0.902
��-0.888
ologies-0.881
Token to
Token ID284
Feature activation-5.44e+00

Pos logit contributions
actic+6.288
adra+6.006
ement+5.868
atic+5.815
inion+5.110
Neg logit contributions
OLOG-14.803
OLOGY-13.679
��-13.653
indal-12.876
-12.805
Token building
Token ID2615
Feature activation-1.81e-01

Pos logit contributions
adra+3.547
actic+3.438
ement+3.176
atic+3.043
inion+3.031
Neg logit contributions
OLOG-5.397
OLOGY-4.833
indal-4.601
��-4.581
-4.408
Token a
Token ID257
Feature activation-1.59e+00

Pos logit contributions
adra+0.115
actic+0.110
ement+0.103
inion+0.102
atic+0.098
Neg logit contributions
OLOG-0.183
OLOGY-0.163
indal-0.155
��-0.152
ologies-0.152
Token strong
Token ID1913
Feature activation-1.56e+00

Pos logit contributions
adra+1.067
actic+1.030
ement+0.977
inion+0.945
atic+0.927
Neg logit contributions
OLOG-1.551
OLOGY-1.378
indal-1.312
��-1.299
ologies-1.273
Token,
Token ID11
Feature activation+1.24e-01

Pos logit contributions
adra+1.032
actic+0.995
ement+0.940
inion+0.919
atic+0.892
Neg logit contributions
OLOG-1.535
OLOGY-1.365
indal-1.300
��-1.280
ologies-1.264
Token Ch
Token ID609
Feature activation-4.04e+00

Pos logit contributions
OLOG+0.140
OLOGY+0.125
indal+0.118
��+0.117
ologies+0.116
Neg logit contributions
adra-0.094
actic-0.090
ement-0.085
inion-0.083
atic-0.081
Tokeniff
Token ID733
Feature activation+1.11e-02

Pos logit contributions
adra+2.723
actic+2.596
ement+2.483
inion+2.415
atic+2.347
Neg logit contributions
OLOG-3.969
OLOGY-3.539
��-3.364
indal-3.297
-3.250
Tokenre
Token ID260
Feature activation-1.73e-01

Pos logit contributions
OLOG+0.011
OLOGY+0.010
indal+0.009
��+0.009
ologies+0.009
Neg logit contributions
adra-0.007
actic-0.007
ement-0.007
inion-0.007
atic-0.006
Token (
Token ID357
Feature activation+1.38e+00

Pos logit contributions
adra+0.124
actic+0.120
ement+0.114
inion+0.111
atic+0.109
Neg logit contributions
OLOG-0.161
OLOGY-0.142
indal-0.135
��-0.132
ologies-0.131
TokenPeter
Token ID19727
Feature activation-3.08e+00

Pos logit contributions
OLOG+1.150
OLOGY+0.996
indal+0.937
GH+0.916
��+0.914
Neg logit contributions
adra-1.131
actic-1.095
ement-1.041
inion-1.028
atic-0.998
Token Lor
Token ID26068
Feature activation-1.36e+01

Pos logit contributions
adra+2.394
actic+2.319
ement+2.188
inion+2.167
atic+2.114
Neg logit contributions
OLOG-2.693
OLOGY-2.359
indal-2.224
��-2.195
ologies-2.154
Tokenre
Token ID260
Feature activation-4.82e+00

Pos logit contributions
actic+6.782
adra+6.498
ions+5.779
ement+5.725
atic+5.575
Neg logit contributions
OLOG-15.303
��-14.046
OLOGY-13.781
-13.115
-12.754
Token)
Token ID8
Feature activation-2.24e+00

Pos logit contributions
adra+2.363
actic+2.272
ement+2.073
inion+2.031
atic+1.963
Neg logit contributions
OLOG-5.546
OLOGY-5.025
indal-4.788
��-4.785
ologies-4.711
Token does
Token ID857
Feature activation-4.73e+00

Pos logit contributions
adra+1.641
actic+1.593
ement+1.528
inion+1.471
atic+1.453
Neg logit contributions
OLOG-2.033
OLOGY-1.795
indal-1.707
��-1.691
ologies-1.648
Token something
Token ID1223
Feature activation+2.48e+00

Pos logit contributions
adra+3.114
actic+3.059
ement+2.827
inion+2.745
atic+2.690
Neg logit contributions
OLOG-4.639
OLOGY-4.123
indal-3.904
��-3.894
ologies-3.834
Token to
Token ID284
Feature activation-1.21e+00

Pos logit contributions
OLOG+2.227
OLOGY+1.955
indal+1.841
��+1.807
ologies+1.805
Neg logit contributions
adra-1.862
actic-1.793
ement-1.701
inion-1.683
atic-1.622
Token James
Token ID3700
Feature activation+8.99e-01

Pos logit contributions
adra+0.243
actic+0.233
ement+0.220
inion+0.215
atic+0.209
Neg logit contributions
OLOG-0.340
OLOGY-0.302
indal-0.287
��-0.283
GH-0.280
Token Comey
Token ID12388
Feature activation-1.02e+01

Pos logit contributions
OLOG+0.784
OLOGY+0.686
indal+0.644
GH+0.628
ologies+0.626
Neg logit contributions
adra-0.698
actic-0.676
ement-0.631
inion-0.631
atic-0.612
Token affair
Token ID14669
Feature activation-3.37e+00

Pos logit contributions
adra+6.277
actic+6.235
ement+6.091
inion+5.751
atic+5.711
Neg logit contributions
OLOG-9.550
��-8.319
OLOGY-8.317
indal-8.148
GH-7.884
Token is
Token ID318
Feature activation-4.35e-01

Pos logit contributions
adra+2.463
actic+2.392
ement+2.288
atic+2.205
inion+2.189
Neg logit contributions
OLOG-3.055
OLOGY-2.671
indal-2.552
��-2.539
GH-2.456
Token the
Token ID262
Feature activation-2.87e+00

Pos logit contributions
adra+0.294
actic+0.282
ement+0.266
inion+0.260
atic+0.254
Neg logit contributions
OLOG-0.424
OLOGY-0.376
indal-0.357
��-0.351
ologies-0.349
Token perfect
Token ID2818
Feature activation-1.36e+01

Pos logit contributions
adra+1.893
actic+1.808
ement+1.702
inion+1.636
atic+1.635
Neg logit contributions
OLOG-2.821
OLOGY-2.516
indal-2.405
��-2.386
GH-2.342
Token example
Token ID1672
Feature activation+8.05e-01

Pos logit contributions
adra+6.638
actic+6.588
ement+6.285
atic+6.002
inion+5.491
Neg logit contributions
OLOG-13.904
OLOGY-12.572
indal-12.277
��-12.134
ologies-11.871
Token.
Token ID13
Feature activation-8.26e-01

Pos logit contributions
OLOG+0.756
OLOGY+0.666
indal+0.629
ologies+0.617
��+0.617
Neg logit contributions
adra-0.573
actic-0.551
ement-0.519
inion-0.518
rehens-0.492
Token On
Token ID1550
Feature activation-1.01e-01

Pos logit contributions
adra+0.612
actic+0.592
ement+0.565
inion+0.550
atic+0.539
Neg logit contributions
OLOG-0.746
OLOGY-0.655
indal-0.626
��-0.621
ologies-0.609
Token substance
Token ID9136
Feature activation+5.62e+00

Pos logit contributions
adra+0.070
actic+0.067
ement+0.063
inion+0.062
atic+0.060
Neg logit contributions
OLOG-0.097
OLOGY-0.086
indal-0.081
��-0.080
ologies-0.079
Token,
Token ID11
Feature activation+2.31e+00

Pos logit contributions
OLOG+5.167
OLOGY+4.584
ologies+4.245
��+4.222
indal+4.197
Neg logit contributions
adra-3.940
actic-3.798
inion-3.535
ement-3.510
anmar-3.393
Token to
Token ID284
Feature activation-1.55e+00

Pos logit contributions
OLOG+1.163
OLOGY+1.032
indal+0.985
ologies+0.971
��+0.965
Neg logit contributions
adra-0.758
actic-0.712
ement-0.665
inion-0.663
rehens-0.634
Token those
Token ID883
Feature activation+6.38e-01

Pos logit contributions
adra+1.117
actic+1.082
ement+1.030
inion+0.999
atic+0.982
Neg logit contributions
OLOG-1.424
OLOGY-1.256
indal-1.191
��-1.174
GH-1.153
Token who
Token ID508
Feature activation+1.80e+00

Pos logit contributions
OLOG+0.640
OLOGY+0.570
indal+0.542
��+0.533
ologies+0.531
Neg logit contributions
adra-0.413
actic-0.396
ement-0.372
inion-0.365
atic-0.353
Token identify
Token ID5911
Feature activation+3.59e+00

Pos logit contributions
OLOG+1.454
OLOGY+1.260
indal+1.176
ologies+1.156
GH+1.148
Neg logit contributions
adra-1.516
actic-1.468
ement-1.395
inion-1.377
atic-1.342
Token as
Token ID355
Feature activation-1.25e-01

Pos logit contributions
OLOG+3.263
OLOGY+2.859
indal+2.672
ologies+2.664
��+2.648
Neg logit contributions
adra-2.605
actic-2.503
ement-2.331
inion-2.329
erva-2.234
Token inter
Token ID987
Feature activation-1.36e+01

Pos logit contributions
adra+0.082
actic+0.079
ement+0.074
inion+0.072
atic+0.070
Neg logit contributions
OLOG-0.124
OLOGY-0.111
indal-0.105
��-0.103
ologies-0.103
Tokensex
Token ID8044
Feature activation+5.85e-01

Pos logit contributions
adra+5.043
actic+4.844
ement+4.618
atic+4.442
pell+4.294
Neg logit contributions
OLOG-15.849
OLOGY-14.335
��-14.146
indal-13.923
IGH-13.718
Token,
Token ID11
Feature activation+2.28e-01

Pos logit contributions
OLOG+0.566
OLOGY+0.502
indal+0.476
��+0.469
ologies+0.466
Neg logit contributions
adra-0.399
actic-0.385
ement-0.363
inion-0.356
atic-0.345
Token third
Token ID2368
Feature activation-2.46e+00

Pos logit contributions
OLOG+0.218
OLOGY+0.193
indal+0.183
��+0.180
ologies+0.179
Neg logit contributions
adra-0.158
actic-0.152
ement-0.143
inion-0.141
atic-0.136
Token-
Token ID12
Feature activation-3.05e+00

Pos logit contributions
adra+1.477
actic+1.440
ement+1.360
inion+1.317
atic+1.283
Neg logit contributions
OLOG-2.556
OLOGY-2.289
indal-2.185
��-2.171
ologies-2.127
Tokengender
Token ID8388
Feature activation+1.14e+00

Pos logit contributions
adra+1.833
actic+1.764
ement+1.660
inion+1.611
atic+1.568
Neg logit contributions
OLOG-3.192
OLOGY-2.856
indal-2.729
��-2.704
ologies-2.652
Token its
Token ID663
Feature activation-1.10e+00

Pos logit contributions
adra+2.643
actic+2.522
ement+2.391
inion+2.330
atic+2.306
Neg logit contributions
OLOG-3.831
OLOGY-3.380
indal-3.254
��-3.206
GH-3.141
Token multiple
Token ID3294
Feature activation-3.17e+00

Pos logit contributions
adra+0.769
actic+0.742
ement+0.701
inion+0.686
atic+0.672
Neg logit contributions
OLOG-1.038
OLOGY-0.916
indal-0.873
��-0.858
GH-0.846
Token component
Token ID7515
Feature activation-5.16e+00

Pos logit contributions
adra+2.247
actic+2.201
ement+2.102
inion+2.048
atic+2.034
Neg logit contributions
OLOG-2.896
OLOGY-2.542
indal-2.437
��-2.406
GH-2.348
Token boxes
Token ID10559
Feature activation-3.75e+00

Pos logit contributions
adra+3.535
actic+3.476
ement+3.354
atic+3.211
inion+3.206
Neg logit contributions
OLOG-4.760
OLOGY-4.188
indal-4.059
��-3.986
GH-3.925
Token,
Token ID11
Feature activation-7.95e-01

Pos logit contributions
adra+2.366
actic+2.266
ement+2.156
inion+2.082
atic+2.051
Neg logit contributions
OLOG-3.806
OLOGY-3.385
indal-3.238
��-3.165
GH-3.149
Token inter
Token ID987
Feature activation-1.35e+01

Pos logit contributions
adra+0.571
actic+0.552
ement+0.522
inion+0.511
atic+0.500
Neg logit contributions
OLOG-0.740
OLOGY-0.652
indal-0.619
��-0.609
GH-0.600
Tokenconnect
Token ID8443
Feature activation-5.47e+00

Pos logit contributions
actic+8.699
adra+8.367
ement+8.046
atic+7.812
pell+7.759
Neg logit contributions
OLOG-11.969
OLOGY-10.453
��-10.098
indal-10.069
IGH-10.048
Tokens
Token ID82
Feature activation-5.60e+00

Pos logit contributions
adra+4.194
actic+4.073
ement+3.967
inion+3.798
atic+3.771
Neg logit contributions
OLOG-4.663
OLOGY-4.041
indal-3.892
��-3.799
GH-3.775
Token,
Token ID11
Feature activation-2.33e+00

Pos logit contributions
adra+3.340
actic+3.227
ement+3.049
inion+2.925
atic+2.895
Neg logit contributions
OLOG-5.801
OLOGY-5.169
indal-4.975
��-4.865
GH-4.842
Token cables
Token ID18018
Feature activation-5.71e+00

Pos logit contributions
adra+1.451
actic+1.394
ement+1.313
inion+1.285
atic+1.247
Neg logit contributions
OLOG-2.376
OLOGY-2.106
indal-2.024
��-1.995
GH-1.965
Token,
Token ID11
Feature activation-2.24e+00

Pos logit contributions
adra+3.377
actic+3.245
ement+3.091
inion+2.958
atic+2.912
Neg logit contributions
OLOG-5.928
OLOGY-5.282
indal-5.087
��-4.987
GH-4.949
Token
Token ID447
Feature activation-5.76e-01

Pos logit contributions
adra+0.747
actic+0.727
ement+0.699
inion+0.689
atic+0.677
Neg logit contributions
OLOG-0.535
OLOGY-0.449
indal-0.417
��-0.412
ologies-0.403
Token
Token ID251
Feature activation+1.02e+00

Pos logit contributions
adra+0.480
actic+0.464
ement+0.442
inion+0.437
atic+0.427
Neg logit contributions
OLOG-0.472
OLOGY-0.407
indal-0.384
ologies-0.372
��-0.371
Token
Token ID198
Feature activation-8.27e-02

Pos logit contributions
OLOG+0.863
OLOGY+0.750
indal+0.705
GH+0.689
��+0.687
Neg logit contributions
adra-0.826
actic-0.795
ement-0.750
inion-0.743
atic-0.721
Token
Token ID198
Feature activation+9.35e-01

Pos logit contributions
adra+0.059
actic+0.058
ement+0.055
inion+0.053
atic+0.052
Neg logit contributions
OLOG-0.077
OLOGY-0.068
��-0.065
indal-0.065
ologies-0.063
TokenThe
Token ID464
Feature activation+8.84e-01

Pos logit contributions
OLOG+0.732
OLOGY+0.630
indal+0.591
��+0.581
ologies+0.573
Neg logit contributions
adra-0.810
actic-0.787
ement-0.751
inion-0.738
atic-0.725
Token Perfect
Token ID16374
Feature activation-1.34e+01

Pos logit contributions
OLOG+0.777
OLOGY+0.681
indal+0.642
��+0.631
ologies+0.625
Neg logit contributions
adra-0.680
actic-0.659
ement-0.625
inion-0.614
atic-0.599
Token Gentleman
Token ID42112
Feature activation-2.05e+00

Pos logit contributions
actic+9.325
ement+9.268
adra+9.250
atic+8.955
ions+8.805
Neg logit contributions
OLOG-10.749
OLOGY-9.375
��-9.006
indal-8.847
IGH-8.787
Token,
Token ID11
Feature activation-1.43e+00

Pos logit contributions
adra+1.461
actic+1.426
ement+1.355
inion+1.319
atic+1.300
Neg logit contributions
OLOG-1.911
OLOGY-1.684
indal-1.591
��-1.581
ologies-1.551
Token 1860
Token ID37637
Feature activation-5.94e-01

Pos logit contributions
adra+0.976
actic+0.946
ement+0.898
inion+0.875
atic+0.853
Neg logit contributions
OLOG-1.369
OLOGY-1.211
indal-1.148
��-1.139
ologies-1.121
Token.
Token ID13
Feature activation-4.09e+00

Pos logit contributions
adra+0.407
actic+0.390
ement+0.367
inion+0.362
atic+0.349
Neg logit contributions
OLOG-0.574
OLOGY-0.509
indal-0.484
��-0.474
GH-0.473
Token
Token ID198
Feature activation-1.60e+00

Pos logit contributions
adra+3.116
actic+3.019
ement+2.870
inion+2.818
atic+2.747
Neg logit contributions
OLOG-3.601
OLOGY-3.152
indal-2.982
��-2.934
ologies-2.899
Token film
Token ID2646
Feature activation-3.94e+00

Pos logit contributions
OLOG+1.293
OLOGY+1.143
indal+1.082
��+1.063
ologies+1.058
Neg logit contributions
adra-0.971
actic-0.937
ement-0.885
inion-0.869
atic-0.844
Token.
Token ID13
Feature activation-1.87e+00

Pos logit contributions
adra+2.457
actic+2.358
ement+2.240
inion+2.142
atic+2.117
Neg logit contributions
OLOG-4.006
OLOGY-3.579
indal-3.412
��-3.394
ologies-3.305
Token The
Token ID383
Feature activation-5.37e-01

Pos logit contributions
adra+1.353
actic+1.318
ement+1.252
inion+1.207
atic+1.207
Neg logit contributions
OLOG-1.689
OLOGY-1.489
��-1.441
indal-1.439
ologies-1.389
Token ride
Token ID6594
Feature activation+3.39e+00

Pos logit contributions
adra+0.383
actic+0.370
ement+0.349
inion+0.341
atic+0.333
Neg logit contributions
OLOG-0.501
OLOGY-0.443
indal-0.421
��-0.415
GH-0.408
Token is
Token ID318
Feature activation+4.90e-01

Pos logit contributions
OLOG+2.958
OLOGY+2.587
indal+2.439
��+2.391
ologies+2.380
Neg logit contributions
adra-2.622
actic-2.536
ement-2.408
inion-2.369
atic-2.306
Token perfect
Token ID2818
Feature activation-1.34e+01

Pos logit contributions
OLOG+0.430
OLOGY+0.377
indal+0.355
��+0.349
ologies+0.346
Neg logit contributions
adra-0.378
actic-0.366
ement-0.348
inion-0.341
atic-0.333
Token for
Token ID329
Feature activation-4.92e+00

Pos logit contributions
actic+6.896
adra+6.771
ement+6.512
atic+6.338
warm+5.944
Neg logit contributions
OLOG-13.331
OLOGY-12.091
��-11.666
indal-11.560
-11.229
Token a
Token ID257
Feature activation-4.03e-01

Pos logit contributions
adra+3.331
actic+3.252
ement+3.051
atic+2.898
inion+2.893
Neg logit contributions
OLOG-4.677
OLOGY-4.151
indal-3.953
��-3.924
ologies-3.832
Token vigilante
Token ID49326
Feature activation+4.28e+00

Pos logit contributions
adra+0.287
actic+0.278
ement+0.263
inion+0.257
atic+0.251
Neg logit contributions
OLOG-0.377
OLOGY-0.333
indal-0.316
��-0.310
ologies-0.308
Token who
Token ID508
Feature activation+1.38e+00

Pos logit contributions
OLOG+3.993
OLOGY+3.521
indal+3.353
GH+3.306
��+3.265
Neg logit contributions
adra-3.011
actic-2.855
ement-2.723
inion-2.681
erva-2.597
Token wants
Token ID3382
Feature activation-4.40e+00

Pos logit contributions
OLOG+1.175
OLOGY+1.023
indal+0.962
��+0.946
ologies+0.934
Neg logit contributions
adra-1.099
actic-1.064
ement-1.013
inion-0.994
atic-0.972
Token Ferdinand
Token ID44312
Feature activation-2.81e+00

Pos logit contributions
adra+0.407
actic+0.394
ement+0.377
inion+0.368
atic+0.359
Neg logit contributions
OLOG-0.470
OLOGY-0.412
indal-0.386
��-0.381
ologies-0.376
Token's
Token ID338
Feature activation-1.10e+00

Pos logit contributions
adra+1.841
actic+1.780
ement+1.671
inion+1.617
atic+1.605
Neg logit contributions
OLOG-2.794
OLOGY-2.478
indal-2.382
��-2.326
ologies-2.309
Token deep
Token ID2769
Feature activation+1.21e+00

Pos logit contributions
adra+0.724
actic+0.699
ement+0.658
inion+0.642
atic+0.626
Neg logit contributions
OLOG-1.089
OLOGY-0.968
indal-0.923
��-0.908
ologies-0.899
Token cross
Token ID3272
Feature activation+4.31e-02

Pos logit contributions
OLOG+1.184
OLOGY+1.052
indal+0.995
��+0.980
ologies+0.975
Neg logit contributions
adra-0.816
actic-0.783
ement-0.737
inion-0.724
atic-0.701
Token,
Token ID11
Feature activation+5.69e-01

Pos logit contributions
OLOG+0.041
OLOGY+0.037
indal+0.035
��+0.034
GH+0.034
Neg logit contributions
adra-0.030
actic-0.029
ement-0.027
inion-0.026
atic-0.026
Token Lor
Token ID26068
Feature activation-1.34e+01

Pos logit contributions
OLOG+0.525
OLOGY+0.463
indal+0.437
��+0.430
ologies+0.428
Neg logit contributions
adra-0.413
actic-0.398
ement-0.377
inion-0.371
atic-0.360
Tokenik
Token ID1134
Feature activation+2.28e+00

Pos logit contributions
adra+5.999
actic+5.948
ement+5.408
atic+4.963
ions+4.846
Neg logit contributions
OLOG-15.330
OLOGY-13.841
��-13.617
-13.035
-13.027
Token Can
Token ID1680
Feature activation-3.41e+00

Pos logit contributions
OLOG+1.895
OLOGY+1.646
indal+1.545
��+1.515
ologies+1.504
Neg logit contributions
adra-1.859
actic-1.801
ement-1.715
inion-1.688
atic-1.647
Tokena
Token ID64
Feature activation+7.81e-01

Pos logit contributions
adra+2.263
actic+2.200
ement+2.082
inion+2.003
atic+1.957
Neg logit contributions
OLOG-3.374
OLOGY-3.007
indal-2.806
��-2.791
IGH-2.765
Token sent
Token ID1908
Feature activation+1.86e+00

Pos logit contributions
OLOG+0.728
OLOGY+0.642
indal+0.608
��+0.597
ologies+0.594
Neg logit contributions
adra-0.561
actic-0.541
ement-0.512
inion-0.502
atic-0.489
Token another
Token ID1194
Feature activation+2.84e+00

Pos logit contributions
OLOG+1.749
OLOGY+1.546
indal+1.446
��+1.437
ologies+1.435
Neg logit contributions
adra-1.322
actic-1.267
ement-1.195
inion-1.186
atic-1.139
Token and
Token ID290
Feature activation+1.19e+00

Pos logit contributions
OLOG+1.815
OLOGY+1.595
indal+1.504
ologies+1.475
��+1.472
Neg logit contributions
adra-1.539
actic-1.480
ement-1.398
inion-1.380
atic-1.337
Token Strategy
Token ID20561
Feature activation+2.06e+00

Pos logit contributions
OLOG+1.074
OLOGY+0.944
indal+0.890
ologies+0.869
��+0.869
Neg logit contributions
adra-0.892
actic-0.860
ement-0.813
inion-0.802
atic-0.777
Token Justin
Token ID10799
Feature activation+3.57e+00

Pos logit contributions
OLOG+1.803
OLOGY+1.581
indal+1.488
ologies+1.461
GH+1.454
Neg logit contributions
adra-1.583
actic-1.525
ement-1.439
inion-1.422
atic-1.378
Token Reid
Token ID16095
Feature activation-3.35e+00

Pos logit contributions
OLOG+2.990
OLOGY+2.604
indal+2.469
GH+2.409
ologies+2.406
Neg logit contributions
adra-2.827
actic-2.738
ement-2.586
inion-2.559
atic-2.487
Token was
Token ID373
Feature activation-1.41e+00

Pos logit contributions
adra+2.489
actic+2.387
ement+2.301
inion+2.236
atic+2.204
Neg logit contributions
OLOG-3.051
OLOGY-2.669
indal-2.492
��-2.481
ologies-2.425
Token rapt
Token ID38404
Feature activation-1.33e+01

Pos logit contributions
adra+1.039
actic+1.003
ement+0.951
inion+0.930
atic+0.912
Neg logit contributions
OLOG-1.298
OLOGY-1.143
indal-1.080
��-1.063
ologies-1.058
Token Mc
Token ID1982
Feature activation+1.18e+00

Pos logit contributions
adra+7.174
actic+7.146
ement+7.080
atic+6.584
warm+6.188
Neg logit contributions
OLOG-12.981
OLOGY-11.877
��-11.141
indal-11.043
GH-11.020
TokenGovern
Token ID29168
Feature activation-9.39e-01

Pos logit contributions
OLOG+0.850
OLOGY+0.721
indal+0.667
��+0.654
ologies+0.647
Neg logit contributions
adra-1.102
actic-1.070
ement-1.026
inion-1.012
atic-0.991
Token had
Token ID550
Feature activation-2.25e+00

Pos logit contributions
adra+0.704
actic+0.677
ement+0.647
inion+0.631
atic+0.617
Neg logit contributions
OLOG-0.849
OLOGY-0.745
indal-0.703
��-0.692
ologies-0.684
Token committed
Token ID5364
Feature activation-2.20e+00

Pos logit contributions
adra+1.590
actic+1.528
ement+1.460
inion+1.413
atic+1.379
Neg logit contributions
OLOG-2.137
OLOGY-1.888
indal-1.789
��-1.766
ologies-1.737
Token to
Token ID284
Feature activation-3.99e+00

Pos logit contributions
adra+1.325
actic+1.272
ement+1.194
inion+1.154
atic+1.122
Neg logit contributions
OLOG-2.314
OLOGY-2.073
indal-1.975
��-1.953
ologies-1.929
Token
Token ID198
Feature activation+7.55e-01

Pos logit contributions
OLOG+1.278
OLOGY+1.123
indal+1.068
��+1.059
ologies+1.039
Neg logit contributions
adra-1.038
actic-1.006
ement-0.954
inion-0.935
atic-0.913
TokenMod
Token ID5841
Feature activation-4.94e+00

Pos logit contributions
OLOG+0.574
OLOGY+0.491
indal+0.460
��+0.453
ologies+0.446
Neg logit contributions
adra-0.671
actic-0.651
ement-0.623
inion-0.614
atic-0.601
Tokener
Token ID263
Feature activation-1.02e+00

Pos logit contributions
adra+2.650
actic+2.496
ement+2.342
inion+2.287
atic+2.211
Neg logit contributions
OLOG-5.430
OLOGY-4.896
��-4.734
indal-4.676
ologies-4.606
Tokenator
Token ID1352
Feature activation+9.34e-01

Pos logit contributions
adra+0.495
actic+0.470
ement+0.430
inion+0.418
atic+0.404
Neg logit contributions
OLOG-1.187
OLOGY-1.073
indal-1.025
��-1.019
ologies-1.007
Token Elaine
Token ID41463
Feature activation+2.37e+00

Pos logit contributions
OLOG+0.848
OLOGY+0.746
indal+0.705
��+0.694
ologies+0.688
Neg logit contributions
adra-0.691
actic-0.669
ement-0.634
inion-0.622
atic-0.606
Token Qu
Token ID2264
Feature activation-1.33e+01

Pos logit contributions
OLOG+1.967
OLOGY+1.708
indal+1.604
��+1.571
ologies+1.560
Neg logit contributions
adra-1.941
actic-1.878
ement-1.786
inion-1.762
atic-1.714
Tokenij
Token ID2926
Feature activation+5.11e+00

Pos logit contributions
adra+7.360
inion+7.033
ement+6.294
actic+6.116
atic+5.856
Neg logit contributions
OLOG-14.012
��-13.505
-12.969
OLOGY-12.797
-12.139
Tokenano
Token ID5733
Feature activation-1.50e+00

Pos logit contributions
OLOG+4.626
OLOGY+4.060
indal+3.951
ologies+3.809
GH+3.793
Neg logit contributions
adra-3.621
actic-3.484
ement-3.294
rehens-3.190
inion-3.190
Token of
Token ID286
Feature activation+9.99e-01

Pos logit contributions
adra+1.118
actic+1.080
ement+1.035
inion+1.011
atic+0.991
Neg logit contributions
OLOG-1.342
OLOGY-1.181
indal-1.122
��-1.111
ologies-1.087
Token CBS
Token ID11133
Feature activation-4.83e-01

Pos logit contributions
OLOG+0.849
OLOGY+0.740
indal+0.696
��+0.681
ologies+0.677
Neg logit contributions
adra-0.799
actic-0.774
ement-0.736
inion-0.724
atic-0.706
Token let
Token ID1309
Feature activation-6.60e+00

Pos logit contributions
adra+0.345
actic+0.335
ement+0.315
inion+0.312
atic+0.303
Neg logit contributions
OLOG-0.449
OLOGY-0.396
indal-0.375
��-0.372
ologies-0.366
Token housing
Token ID5627
Feature activation-1.90e-01

Pos logit contributions
adra+0.169
actic+0.162
ement+0.154
inion+0.151
atic+0.147
Neg logit contributions
OLOG-0.222
OLOGY-0.195
indal-0.185
��-0.183
GH-0.181
Token at
Token ID379
Feature activation-2.93e-01

Pos logit contributions
adra+0.140
actic+0.135
ement+0.128
inion+0.126
atic+0.122
Neg logit contributions
OLOG-0.173
OLOGY-0.153
indal-0.144
��-0.142
ologies-0.140
Token D
Token ID360
Feature activation-8.65e-02

Pos logit contributions
adra+0.226
actic+0.219
ement+0.208
inion+0.204
atic+0.199
Neg logit contributions
OLOG-0.258
OLOGY-0.226
indal-0.213
��-0.210
ologies-0.207
TokenIT
Token ID2043
Feature activation-2.74e+00

Pos logit contributions
adra+0.062
actic+0.060
ement+0.057
inion+0.056
atic+0.054
Neg logit contributions
OLOG-0.080
OLOGY-0.071
indal-0.067
��-0.066
ologies-0.065
Token,
Token ID11
Feature activation+8.20e-01

Pos logit contributions
adra+2.106
actic+2.075
ement+1.958
inion+1.940
atic+1.877
Neg logit contributions
OLOG-2.361
OLOGY-2.065
indal-1.957
��-1.926
ologies-1.893
Token Lor
Token ID26068
Feature activation-1.32e+01

Pos logit contributions
OLOG+0.700
OLOGY+0.611
indal+0.574
��+0.562
ologies+0.560
Neg logit contributions
adra-0.652
actic-0.632
ement-0.600
inion-0.590
atic-0.576
Tokencan
Token ID5171
Feature activation-1.69e-01

Pos logit contributions
adra+7.524
actic+7.113
ement+6.775
inion+6.603
atic+6.479
Neg logit contributions
OLOG-13.564
OLOGY-12.203
��-11.741
-11.583
-11.329
Token Sir
Token ID7361
Feature activation-7.10e-01

Pos logit contributions
adra+0.134
actic+0.130
ement+0.124
inion+0.121
atic+0.118
Neg logit contributions
OLOG-0.146
OLOGY-0.127
indal-0.119
��-0.118
ologies-0.116
Tokenr
Token ID81
Feature activation+1.65e+00

Pos logit contributions
adra+0.514
actic+0.495
ement+0.470
inion+0.461
atic+0.450
Neg logit contributions
OLOG-0.660
OLOGY-0.584
indal-0.546
��-0.543
ologies-0.535
Token,
Token ID11
Feature activation+4.18e-01

Pos logit contributions
OLOG+1.630
OLOGY+1.449
indal+1.375
��+1.355
ologies+1.346
Neg logit contributions
adra-1.095
actic-1.054
ement-0.992
inion-0.972
atic-0.942
Token says
Token ID1139
Feature activation-1.22e+00

Pos logit contributions
OLOG+0.371
OLOGY+0.326
indal+0.307
��+0.303
ologies+0.299
Neg logit contributions
adra-0.318
actic-0.307
ement-0.292
inion-0.287
atic-0.279
Token distracted
Token ID22943
Feature activation+1.29e+00

Pos logit contributions
adra+0.622
actic+0.600
ement+0.565
inion+0.551
atic+0.537
Neg logit contributions
OLOG-0.912
OLOGY-0.811
indal-0.772
��-0.762
GH-0.751
Token way
Token ID835
Feature activation+1.25e+00

Pos logit contributions
OLOG+1.234
OLOGY+1.092
indal+1.033
ologies+1.015
��+1.014
Neg logit contributions
adra-0.892
actic-0.855
ement-0.805
inion-0.794
atic-0.768
Token,
Token ID11
Feature activation+1.03e-01

Pos logit contributions
OLOG+1.174
OLOGY+1.039
indal+0.981
ologies+0.967
��+0.961
Neg logit contributions
adra-0.877
actic-0.845
ement-0.792
inion-0.785
atic-0.755
Token like
Token ID588
Feature activation-6.28e+00

Pos logit contributions
OLOG+0.097
OLOGY+0.086
indal+0.082
��+0.080
ologies+0.080
Neg logit contributions
adra-0.072
actic-0.069
ement-0.066
inion-0.064
atic-0.063
Token an
Token ID281
Feature activation-1.75e+00

Pos logit contributions
adra+4.054
actic+3.886
ement+3.653
atic+3.456
inion+3.444
Neg logit contributions
OLOG-6.122
OLOGY-5.464
indal-5.227
��-5.165
GH-5.070
Token inter
Token ID987
Feature activation-1.32e+01

Pos logit contributions
adra+1.234
actic+1.200
ement+1.131
inion+1.105
atic+1.081
Neg logit contributions
OLOG-1.654
OLOGY-1.462
indal-1.383
��-1.368
GH-1.352
Tokenmin
Token ID1084
Feature activation-8.22e+00

Pos logit contributions
actic+8.944
adra+8.695
ement+8.385
pell+8.148
inion+7.836
Neg logit contributions
OLOG-11.458
OLOGY-9.996
��-9.767
indal-9.572
-9.450
Tokenable
Token ID540
Feature activation-5.98e+00

Pos logit contributions
actic+3.937
adra+3.765
ement+3.641
inion+3.377
atic+3.297
Neg logit contributions
OLOG-9.524
OLOGY-8.595
��-8.255
IGH-8.148
indal-8.084
Token stage
Token ID3800
Feature activation-4.82e+00

Pos logit contributions
adra+4.041
actic+3.992
ement+3.777
inion+3.602
atic+3.545
Neg logit contributions
OLOG-5.592
OLOGY-4.954
��-4.726
indal-4.716
GH-4.611
Token whisper
Token ID31992
Feature activation-1.79e+00

Pos logit contributions
adra+3.165
actic+3.104
ement+2.918
inion+2.836
atic+2.783
Neg logit contributions
OLOG-4.685
OLOGY-4.180
indal-3.985
��-3.962
ologies-3.900
Token (
Token ID357
Feature activation+8.96e-01

Pos logit contributions
adra+1.189
actic+1.146
ement+1.083
inion+1.053
atic+1.028
Neg logit contributions
OLOG-1.758
OLOGY-1.564
indal-1.482
��-1.469
GH-1.450
Token,
Token ID11
Feature activation-4.22e-01

Pos logit contributions
adra+0.991
actic+0.961
ement+0.905
inion+0.878
atic+0.866
Neg logit contributions
OLOG-1.514
OLOGY-1.350
indal-1.281
��-1.266
GH-1.253
Token making
Token ID1642
Feature activation-5.37e+00

Pos logit contributions
adra+0.281
actic+0.271
ement+0.255
inion+0.249
atic+0.243
Neg logit contributions
OLOG-0.416
OLOGY-0.370
indal-0.351
��-0.346
ologies-0.343
Token it
Token ID340
Feature activation-1.35e+00

Pos logit contributions
adra+3.408
actic+3.296
ement+3.064
inion+2.965
atic+2.933
Neg logit contributions
OLOG-5.386
OLOGY-4.814
indal-4.547
��-4.532
GH-4.484
Token both
Token ID1111
Feature activation-1.60e+00

Pos logit contributions
adra+1.013
actic+0.985
ement+0.929
inion+0.909
atic+0.896
Neg logit contributions
OLOG-1.208
OLOGY-1.062
indal-1.002
��-0.988
ologies-0.975
Token a
Token ID257
Feature activation-2.77e+00

Pos logit contributions
adra+1.154
actic+1.128
ement+1.062
inion+1.034
atic+1.019
Neg logit contributions
OLOG-1.475
OLOGY-1.302
indal-1.232
��-1.218
ologies-1.199
Token perfect
Token ID2818
Feature activation-1.32e+01

Pos logit contributions
adra+1.855
actic+1.811
ement+1.704
inion+1.645
atic+1.619
Neg logit contributions
OLOG-2.689
OLOGY-2.385
indal-2.271
��-2.239
ologies-2.213
Token fit
Token ID4197
Feature activation-1.03e+01

Pos logit contributions
actic+7.974
adra+7.815
ement+7.398
atic+7.080
inion+6.983
Neg logit contributions
OLOG-12.505
OLOGY-11.093
indal-10.708
GH-10.448
��-10.342
Token for
Token ID329
Feature activation-5.22e+00

Pos logit contributions
adra+5.436
actic+5.427
ement+5.069
atic+4.871
inion+4.642
Neg logit contributions
OLOG-10.735
OLOGY-9.740
��-9.204
indal-9.132
GH-9.101
Token portable
Token ID17726
Feature activation-1.43e-01

Pos logit contributions
adra+3.368
actic+3.280
ement+3.119
inion+2.985
atic+2.950
Neg logit contributions
OLOG-5.142
OLOGY-4.564
indal-4.315
��-4.275
GH-4.236
Token play
Token ID711
Feature activation-1.55e+00

Pos logit contributions
adra+0.097
actic+0.093
ement+0.088
inion+0.086
atic+0.084
Neg logit contributions
OLOG-0.139
OLOGY-0.123
indal-0.117
��-0.115
ologies-0.114
Token,
Token ID11
Feature activation-4.91e-01

Pos logit contributions
adra+0.981
actic+0.944
ement+0.894
inion+0.863
atic+0.842
Neg logit contributions
OLOG-1.579
OLOGY-1.405
indal-1.336
��-1.319
GH-1.309
Token several
Token ID1811
Feature activation+5.11e+00

Pos logit contributions
adra+0.176
actic+0.169
ement+0.159
inion+0.155
atic+0.151
Neg logit contributions
OLOG-0.262
OLOGY-0.233
indal-0.222
��-0.218
ologies-0.217
Token forums
Token ID14216
Feature activation+2.33e+00

Pos logit contributions
OLOG+4.665
OLOGY+4.097
indal+3.867
ologies+3.803
IGH+3.757
Neg logit contributions
adra-3.690
actic-3.566
ement-3.369
inion-3.331
atic-3.196
Token Monday
Token ID3321
Feature activation-5.05e-01

Pos logit contributions
OLOG+2.236
OLOGY+1.979
indal+1.871
��+1.842
ologies+1.841
Neg logit contributions
adra-1.591
actic-1.538
ement-1.442
inion-1.423
atic-1.364
Token while
Token ID981
Feature activation+2.92e-02

Pos logit contributions
adra+0.333
actic+0.320
ement+0.302
inion+0.295
atic+0.287
Neg logit contributions
OLOG-0.500
OLOGY-0.445
indal-0.423
��-0.416
ologies-0.413
Token in
Token ID287
Feature activation-1.94e+00

Pos logit contributions
OLOG+0.030
OLOGY+0.027
indal+0.025
��+0.025
ologies+0.025
Neg logit contributions
adra-0.018
actic-0.017
ement-0.016
inion-0.016
atic-0.016
Token Iowa
Token ID9406
Feature activation+9.01e+00

Pos logit contributions
adra+1.692
actic+1.641
ement+1.562
inion+1.528
atic+1.507
Neg logit contributions
OLOG-1.512
OLOGY-1.296
indal-1.203
��-1.201
ologies-1.177
Token and
Token ID290
Feature activation+2.72e+00

Pos logit contributions
OLOG+7.942
OLOGY+7.032
indal+6.612
ologies+6.548
GH+6.466
Neg logit contributions
adra-6.280
actic-6.142
inion-5.741
ement-5.699
rehens-5.601
Token has
Token ID468
Feature activation-9.33e-01

Pos logit contributions
OLOG+2.402
OLOGY+2.105
indal+1.994
��+1.945
ologies+1.937
Neg logit contributions
adra-2.068
actic-1.998
ement-1.890
inion-1.868
atic-1.811
Token three
Token ID1115
Feature activation+1.70e+00

Pos logit contributions
adra+0.668
actic+0.644
ement+0.610
inion+0.597
atic+0.582
Neg logit contributions
OLOG-0.872
OLOGY-0.771
indal-0.729
��-0.721
ologies-0.711
Token more
Token ID517
Feature activation+2.59e+00

Pos logit contributions
OLOG+1.576
OLOGY+1.390
indal+1.315
��+1.292
ologies+1.284
Neg logit contributions
adra-1.228
actic-1.186
ement-1.121
inion-1.101
atic-1.070
Token forums
Token ID14216
Feature activation+3.51e+00

Pos logit contributions
OLOG+2.466
OLOGY+2.178
indal+2.073
ologies+2.034
��+2.015
Neg logit contributions
adra-1.788
actic-1.722
ement-1.627
inion-1.598
atic-1.541
Token
Token ID198
Feature activation+4.21e-01

Pos logit contributions
OLOG+0.620
OLOGY+0.549
indal+0.520
��+0.514
ologies+0.508
Neg logit contributions
adra-0.446
actic-0.431
ement-0.408
inion-0.400
atic-0.389
Token
Token ID198
Feature activation+1.08e+00

Pos logit contributions
OLOG+0.389
OLOGY+0.342
��+0.327
indal+0.326
ologies+0.316
Neg logit contributions
adra-0.305
actic-0.295
ement-0.281
inion-0.274
atic-0.269
TokenClick
Token ID8164
Feature activation-3.08e+00

Pos logit contributions
OLOG+0.753
OLOGY+0.635
indal+0.590
��+0.579
ologies+0.568
Neg logit contributions
adra-1.028
actic-1.000
ement-0.960
inion-0.948
atic-0.928
Token here
Token ID994
Feature activation-3.07e+00

Pos logit contributions
adra+2.551
actic+2.475
ement+2.366
inion+2.327
atic+2.272
Neg logit contributions
OLOG-2.514
OLOGY-2.178
indal-2.043
��-2.014
GH-1.981
Token to
Token ID284
Feature activation+2.57e+00

Pos logit contributions
adra+1.838
actic+1.760
ement+1.644
inion+1.607
atic+1.551
Neg logit contributions
OLOG-3.229
OLOGY-2.893
indal-2.757
��-2.715
ologies-2.702
Token subscribe
Token ID12383
Feature activation+9.34e+00

Pos logit contributions
OLOG+2.514
OLOGY+2.235
indal+2.123
��+2.081
ologies+2.077
Neg logit contributions
adra-1.718
actic-1.651
ement-1.554
inion-1.523
atic-1.475
Token via
Token ID2884
Feature activation+6.83e+00

Pos logit contributions
OLOG+7.157
OLOGY+6.175
indal+5.714
ologies+5.687
GH+5.656
Neg logit contributions
adra-7.802
actic-7.355
ement-6.914
rehens-6.810
inion-6.804
Token St
Token ID520
Feature activation+3.73e+00

Pos logit contributions
OLOG+5.288
OLOGY+4.572
indal+4.263
ologies+4.210
GH+4.167
Neg logit contributions
adra-5.870
actic-5.646
ement-5.358
inion-5.274
erva-5.237
Tokenitcher
Token ID23640
Feature activation-6.29e-01

Pos logit contributions
OLOG+4.334
OLOGY+3.932
indal+3.806
GH+3.781
ologies+3.770
Neg logit contributions
adra-1.750
actic-1.566
ement-1.449
inion-1.393
erva-1.365
Token
Token ID198
Feature activation-4.65e-02

Pos logit contributions
adra+0.500
actic+0.485
ement+0.461
inion+0.454
atic+0.443
Neg logit contributions
OLOG-0.536
OLOGY-0.467
indal-0.439
��-0.432
ologies-0.428
Token
Token ID198
Feature activation+1.12e+00

Pos logit contributions
adra+0.033
actic+0.032
ement+0.031
inion+0.030
atic+0.029
Neg logit contributions
OLOG-0.044
OLOGY-0.038
��-0.037
indal-0.037
ologies-0.036
Token new
Token ID649
Feature activation+3.95e+00

Pos logit contributions
OLOG+0.350
OLOGY+0.310
indal+0.294
��+0.289
ologies+0.287
Neg logit contributions
adra-0.254
actic-0.245
ement-0.231
inion-0.227
atic-0.220
Token political
Token ID1964
Feature activation+9.04e-01

Pos logit contributions
OLOG+3.695
OLOGY+3.277
indal+3.086
ologies+3.028
GH+3.024
Neg logit contributions
adra-2.743
actic-2.644
ement-2.474
inion-2.443
erva-2.374
Token odd
Token ID5629
Feature activation+2.31e+00

Pos logit contributions
OLOG+0.832
OLOGY+0.733
indal+0.693
��+0.678
ologies+0.677
Neg logit contributions
adra-0.658
actic-0.635
ement-0.600
inion-0.590
atic-0.573
Token couple
Token ID3155
Feature activation-1.66e+00

Pos logit contributions
OLOG+2.004
OLOGY+1.755
indal+1.649
��+1.615
ologies+1.614
Neg logit contributions
adra-1.804
actic-1.751
ement-1.655
inion-1.634
atic-1.591
Token:
Token ID25
Feature activation+1.30e+00

Pos logit contributions
adra+1.123
actic+1.088
ement+1.025
inion+1.003
atic+0.977
Neg logit contributions
OLOG-1.621
OLOGY-1.434
indal-1.360
��-1.346
GH-1.326
Token Del
Token ID4216
Feature activation+8.05e+00

Pos logit contributions
OLOG+1.150
OLOGY+1.007
indal+0.948
��+0.926
ologies+0.925
Neg logit contributions
adra-1.000
actic-0.966
ement-0.917
inion-0.902
atic-0.877
Token.
Token ID13
Feature activation+6.15e-01

Pos logit contributions
OLOG+7.427
indal+6.581
OLOGY+6.480
GH+6.301
IGH+6.229
Neg logit contributions
adra-5.030
actic-4.932
ement-4.727
inion-4.568
rehens-4.454
Token Todd
Token ID14377
Feature activation+2.43e+00

Pos logit contributions
OLOG+0.535
OLOGY+0.467
indal+0.440
��+0.433
ologies+0.429
Neg logit contributions
adra-0.480
actic-0.465
ement-0.442
inion-0.434
atic-0.424
Token C
Token ID327
Feature activation+1.15e+00

Pos logit contributions
OLOG+2.046
OLOGY+1.785
indal+1.682
��+1.637
ologies+1.636
Neg logit contributions
adra-1.939
actic-1.883
ement-1.782
inion-1.760
atic-1.714
Token.
Token ID13
Feature activation-1.48e-02

Pos logit contributions
OLOG+1.096
OLOGY+0.969
indal+0.930
ologies+0.905
GH+0.902
Neg logit contributions
adra-0.792
actic-0.764
ement-0.714
inion-0.707
atic-0.683
Token Gilbert
Token ID24023
Feature activation-2.75e+00

Pos logit contributions
adra+0.012
actic+0.011
ement+0.011
inion+0.011
atic+0.010
Neg logit contributions
OLOG-0.013
OLOGY-0.011
indal-0.010
��-0.010
ologies-0.010
Token of
Token ID286
Feature activation+3.65e-01

Pos logit contributions
adra+0.319
actic+0.308
ement+0.291
inion+0.285
atic+0.277
Neg logit contributions
OLOG-0.451
OLOGY-0.400
indal-0.380
��-0.373
GH-0.370
Token minutes
Token ID2431
Feature activation-2.16e+00

Pos logit contributions
OLOG+0.357
OLOGY+0.317
indal+0.300
ologies+0.295
��+0.294
Neg logit contributions
adra-0.244
actic-0.235
ement-0.222
inion-0.217
atic-0.210
Token.
Token ID13
Feature activation+2.97e-02

Pos logit contributions
adra+1.435
actic+1.399
ement+1.317
inion+1.283
atic+1.254
Neg logit contributions
OLOG-2.119
OLOGY-1.879
indal-1.790
��-1.778
ologies-1.745
Token Once
Token ID4874
Feature activation+1.18e+00

Pos logit contributions
OLOG+0.026
OLOGY+0.023
indal+0.022
��+0.022
ologies+0.021
Neg logit contributions
adra-0.023
actic-0.022
ement-0.021
inion-0.020
atic-0.020
Token it
Token ID340
Feature activation+4.96e-01

Pos logit contributions
OLOG+1.085
OLOGY+0.956
indal+0.903
��+0.888
ologies+0.883
Neg logit contributions
adra-0.862
actic-0.832
ement-0.787
inion-0.776
atic-0.750
Token became
Token ID2627
Feature activation+7.27e+00

Pos logit contributions
OLOG+0.432
OLOGY+0.378
indal+0.356
��+0.349
ologies+0.347
Neg logit contributions
adra-0.385
actic-0.372
ement-0.353
inion-0.348
atic-0.339
Token clear
Token ID1598
Feature activation-6.87e+00

Pos logit contributions
OLOG+5.497
OLOGY+4.717
indal+4.385
ologies+4.297
GH+4.239
Neg logit contributions
adra-6.167
actic-5.947
inion-5.743
ement-5.648
erva-5.601
Token that
Token ID326
Feature activation-1.85e-01

Pos logit contributions
adra+4.171
actic+4.125
ement+3.836
atic+3.703
inion+3.589
Neg logit contributions
OLOG-6.851
OLOGY-6.116
indal-5.824
��-5.811
ologies-5.687
Token the
Token ID262
Feature activation+8.33e-02

Pos logit contributions
adra+0.137
actic+0.132
ement+0.125
inion+0.123
atic+0.119
Neg logit contributions
OLOG-0.169
OLOGY-0.149
indal-0.141
��-0.138
ologies-0.137
Token situation
Token ID3074
Feature activation-3.01e+00

Pos logit contributions
OLOG+0.078
OLOGY+0.069
indal+0.065
��+0.064
ologies+0.064
Neg logit contributions
adra-0.059
actic-0.057
ement-0.054
inion-0.053
atic-0.052
Token was
Token ID373
Feature activation-5.31e-01

Pos logit contributions
adra+2.325
actic+2.269
ement+2.165
atic+2.100
inion+2.076
Neg logit contributions
OLOG-2.605
OLOGY-2.265
indal-2.192
��-2.152
GH-2.086
Token we
Token ID356
Feature activation+8.84e-01

Pos logit contributions
adra+0.618
actic+0.595
ement+0.564
inion+0.552
atic+0.536
Neg logit contributions
OLOG-0.806
OLOGY-0.709
indal-0.671
��-0.666
ologies-0.655
Token take
Token ID1011
Feature activation-3.57e+00

Pos logit contributions
OLOG+0.785
OLOGY+0.687
indal+0.647
��+0.637
ologies+0.630
Neg logit contributions
adra-0.678
actic-0.655
ement-0.623
inion-0.611
atic-0.596
Token these
Token ID777
Feature activation-1.03e+00

Pos logit contributions
adra+2.340
actic+2.241
ement+2.104
inion+2.055
atic+1.987
Neg logit contributions
OLOG-3.525
OLOGY-3.139
indal-3.001
��-2.998
GH-2.913
Token steps
Token ID4831
Feature activation+4.40e+00

Pos logit contributions
adra+0.732
actic+0.708
ement+0.670
inion+0.656
atic+0.643
Neg logit contributions
OLOG-0.973
OLOGY-0.862
indal-0.815
��-0.812
GH-0.792
Token that
Token ID326
Feature activation+3.78e+00

Pos logit contributions
OLOG+4.104
OLOGY+3.633
indal+3.422
ologies+3.404
GH+3.357
Neg logit contributions
adra-3.088
actic-2.979
ement-2.796
inion-2.764
atic-2.660
Token could
Token ID714
Feature activation+7.54e+00

Pos logit contributions
OLOG+3.196
OLOGY+2.801
indal+2.634
ologies+2.597
GH+2.574
Neg logit contributions
adra-2.982
actic-2.898
ement-2.742
inion-2.707
atic-2.631
Token have
Token ID423
Feature activation+3.37e+00

Pos logit contributions
OLOG+6.853
OLOGY+6.151
indal+5.873
GH+5.764
ologies+5.752
Neg logit contributions
adra-5.192
actic-4.817
inion-4.645
ement-4.596
erva-4.454
Token unforeseen
Token ID49591
Feature activation-2.69e+00

Pos logit contributions
OLOG+3.148
OLOGY+2.774
indal+2.631
ologies+2.582
GH+2.580
Neg logit contributions
adra-2.379
actic-2.290
ement-2.165
inion-2.140
atic-2.065
Token consequences
Token ID6948
Feature activation+3.98e+00

Pos logit contributions
adra+2.044
actic+1.987
ement+1.889
inion+1.831
atic+1.810
Neg logit contributions
OLOG-2.366
OLOGY-2.097
indal-1.972
��-1.969
GH-1.907
Token we
Token ID356
Feature activation+2.61e+00

Pos logit contributions
OLOG+3.755
OLOGY+3.318
indal+3.137
ologies+3.105
GH+3.074
Neg logit contributions
adra-2.766
actic-2.657
ement-2.497
inion-2.474
erva-2.375
Token do
Token ID466
Feature activation-2.67e+00

Pos logit contributions
OLOG+2.260
OLOGY+1.981
indal+1.863
ologies+1.833
GH+1.825
Neg logit contributions
adra-2.020
actic-1.953
ement-1.843
inion-1.827
erva-1.766
Token make
Token ID787
Feature activation+9.46e-01

Pos logit contributions
OLOG+3.264
OLOGY+2.886
indal+2.730
ologies+2.698
GH+2.691
Neg logit contributions
adra-2.554
actic-2.462
ement-2.310
inion-2.290
erva-2.216
Token it
Token ID340
Feature activation+1.44e+00

Pos logit contributions
OLOG+0.899
OLOGY+0.796
indal+0.754
��+0.742
ologies+0.738
Neg logit contributions
adra-0.660
actic-0.636
ement-0.600
inion-0.589
atic-0.571
Token easy
Token ID2562
Feature activation-1.24e+00

Pos logit contributions
OLOG+1.354
OLOGY+1.196
indal+1.131
��+1.110
ologies+1.110
Neg logit contributions
adra-1.023
actic-0.987
ement-0.931
inion-0.916
atic-0.885
Token even
Token ID772
Feature activation+2.29e+00

Pos logit contributions
adra+0.766
actic+0.734
ement+0.683
inion+0.675
atic+0.647
Neg logit contributions
OLOG-1.277
OLOGY-1.142
indal-1.083
��-1.069
ologies-1.067
Token for
Token ID329
Feature activation-1.83e+00

Pos logit contributions
OLOG+2.260
OLOGY+2.012
indal+1.901
ologies+1.871
��+1.865
Neg logit contributions
adra-1.505
actic-1.456
ement-1.355
inion-1.346
atic-1.285
Token ordinary
Token ID8850
Feature activation+2.79e+00

Pos logit contributions
adra+1.306
actic+1.249
ement+1.198
inion+1.159
atic+1.135
Neg logit contributions
OLOG-1.715
OLOGY-1.512
indal-1.443
��-1.418
ologies-1.388
Token users
Token ID2985
Feature activation-2.11e+00

Pos logit contributions
OLOG+2.332
OLOGY+2.030
indal+1.885
GH+1.870
��+1.869
Neg logit contributions
adra-2.242
actic-2.174
ement-2.054
inion-2.042
rehens-1.973
Token to
Token ID284
Feature activation+4.93e-01

Pos logit contributions
adra+1.273
actic+1.223
ement+1.149
inion+1.117
atic+1.088
Neg logit contributions
OLOG-2.215
OLOGY-1.980
indal-1.884
��-1.860
ologies-1.837
Token install
Token ID2721
Feature activation+3.36e+00

Pos logit contributions
OLOG+0.461
OLOGY+0.406
indal+0.384
��+0.380
ologies+0.374
Neg logit contributions
adra-0.353
actic-0.341
ement-0.324
inion-0.316
atic-0.309
Token and
Token ID290
Feature activation+1.47e+00

Pos logit contributions
OLOG+3.208
OLOGY+2.843
indal+2.688
ologies+2.656
��+2.646
Neg logit contributions
adra-2.320
actic-2.234
ement-2.085
inion-2.061
atic-1.981
Token remove
Token ID4781
Feature activation-2.11e+00

Pos logit contributions
OLOG+1.420
OLOGY+1.259
indal+1.194
��+1.172
ologies+1.169
Neg logit contributions
adra-1.010
actic-0.973
ement-0.915
inion-0.899
atic-0.870
Token such
Token ID884
Feature activation-7.75e-01

Pos logit contributions
adra+1.497
actic+1.451
ement+1.374
inion+1.329
atic+1.305
Neg logit contributions
OLOG-2.034
OLOGY-1.802
indal-1.714
��-1.682
GH-1.658
Token clauses
Token ID31485
Feature activation-3.26e+00

Pos logit contributions
adra+0.585
actic+0.566
ement+0.538
inion+0.525
atic+0.513
Neg logit contributions
OLOG-0.694
OLOGY-0.610
indal-0.577
��-0.566
GH-0.559
Token.
Token ID13
Feature activation+6.71e-01

Pos logit contributions
adra+2.078
actic+1.992
ement+1.894
inion+1.808
atic+1.790
Neg logit contributions
OLOG-3.304
OLOGY-2.951
indal-2.813
��-2.784
GH-2.726
Token
Token ID198
Feature activation+1.38e+00

Pos logit contributions
OLOG+0.596
OLOGY+0.523
indal+0.493
��+0.485
ologies+0.481
Neg logit contributions
adra-0.510
actic-0.494
ement-0.468
inion-0.460
atic-0.448
Token
Token ID198
Feature activation+1.06e+00

Pos logit contributions
OLOG+1.251
OLOGY+1.098
indal+1.040
��+1.029
ologies+1.013
Neg logit contributions
adra-1.031
actic-0.999
ement-0.948
inion-0.928
atic-0.907
TokenHe
Token ID1544
Feature activation+1.82e+00

Pos logit contributions
OLOG+0.813
OLOGY+0.698
indal+0.655
��+0.642
ologies+0.635
Neg logit contributions
adra-0.931
actic-0.905
ement-0.866
inion-0.851
atic-0.835
Token would
Token ID561
Feature activation+8.19e-02

Pos logit contributions
OLOG+1.556
OLOGY+1.357
indal+1.275
��+1.252
ologies+1.242
Neg logit contributions
adra-1.451
actic-1.406
ement-1.338
inion-1.315
atic-1.283
Token continue
Token ID2555
Feature activation-1.16e+00

Pos logit contributions
OLOG+0.073
OLOGY+0.064
indal+0.060
��+0.059
ologies+0.059
Neg logit contributions
adra-0.063
actic-0.061
ement-0.057
inion-0.056
atic-0.055
Token:
Token ID25
Feature activation+2.60e+00

Pos logit contributions
adra+0.714
actic+0.685
ement+0.640
inion+0.629
atic+0.605
Neg logit contributions
OLOG-1.200
OLOGY-1.073
indal-1.022
��-1.006
ologies-1.001
Token
Token ID198
Feature activation+4.31e-01

Pos logit contributions
OLOG+2.188
OLOGY+1.907
indal+1.772
GH+1.740
ologies+1.735
Neg logit contributions
adra-2.092
actic-2.005
ement-1.899
inion-1.894
atic-1.830
Token
Token ID198
Feature activation+4.51e-01

Pos logit contributions
OLOG+0.396
OLOGY+0.348
indal+0.332
��+0.332
ologies+0.323
Neg logit contributions
adra-0.313
actic-0.304
ement-0.289
inion-0.282
atic-0.276
Token
Token ID447
Feature activation-9.36e-01

Pos logit contributions
OLOG+0.672
OLOGY+0.592
indal+0.559
��+0.550
ologies+0.546
Neg logit contributions
adra-0.542
actic-0.523
ement-0.496
inion-0.487
atic-0.474
Token
Token ID251
Feature activation-7.25e-01

Pos logit contributions
adra+0.819
actic+0.791
ement+0.757
inion+0.750
atic+0.735
Neg logit contributions
OLOG-0.731
OLOGY-0.626
indal-0.587
ologies-0.569
GH-0.564
Token b
Token ID275
Feature activation+1.01e+00

Pos logit contributions
adra+0.550
actic+0.534
ement+0.509
inion+0.498
atic+0.486
Neg logit contributions
OLOG-0.643
OLOGY-0.564
indal-0.532
��-0.523
ologies-0.519
Token/
Token ID14
Feature activation-2.61e+00

Pos logit contributions
OLOG+0.923
OLOGY+0.813
indal+0.768
��+0.755
ologies+0.750
Neg logit contributions
adra-0.739
actic-0.715
ement-0.677
inion-0.664
atic-0.646
Tokenw
Token ID86
Feature activation-2.74e+00

Pos logit contributions
adra+1.842
actic+1.784
ement+1.677
inion+1.656
atic+1.604
Neg logit contributions
OLOG-2.461
OLOGY-2.171
indal-2.083
ologies-2.017
��-2.017
Token
Token ID564
Feature activation+1.28e+00

Pos logit contributions
adra+2.132
actic+2.072
ement+1.981
inion+1.932
atic+1.888
Neg logit contributions
OLOG-2.348
OLOGY-2.045
indal-1.938
��-1.928
ologies-1.877
Token
Token ID250
Feature activation+3.62e-01

Pos logit contributions
OLOG+0.945
OLOGY+0.806
indal+0.747
��+0.744
ologies+0.725
Neg logit contributions
adra-1.163
actic-1.135
ement-1.087
inion-1.068
atic-1.044
TokenBul
Token ID33481
Feature activation+6.50e-01

Pos logit contributions
OLOG+0.286
OLOGY+0.247
indal+0.231
��+0.226
ologies+0.224
Neg logit contributions
adra-0.311
actic-0.302
ement-0.288
inion-0.284
atic-0.277
Tokenbs
Token ID1443
Feature activation+2.20e+00

Pos logit contributions
OLOG+0.651
OLOGY+0.580
indal+0.551
��+0.542
ologies+0.539
Neg logit contributions
adra-0.422
actic-0.405
ement-0.381
inion-0.373
atic-0.361
Token of
Token ID286
Feature activation+4.27e-01

Pos logit contributions
OLOG+1.940
OLOGY+1.701
indal+1.608
��+1.571
ologies+1.569
Neg logit contributions
adra-1.685
actic-1.629
ement-1.540
inion-1.518
atic-1.472
Token Passion
Token ID37580
Feature activation-3.73e+00

Pos logit contributions
OLOG+0.369
OLOGY+0.322
indal+0.302
��+0.298
ologies+0.296
Neg logit contributions
adra-0.335
actic-0.325
ement-0.310
inion-0.304
atic-0.296
Token as
Token ID355
Feature activation-5.24e+00

Pos logit contributions
adra+0.548
actic+0.529
ement+0.499
inion+0.486
atic+0.478
Neg logit contributions
OLOG-0.745
OLOGY-0.660
indal-0.629
��-0.617
GH-0.610
Token many
Token ID867
Feature activation+2.04e+00

Pos logit contributions
adra+2.963
actic+2.822
ement+2.653
inion+2.562
atic+2.488
Neg logit contributions
OLOG-5.628
OLOGY-5.053
indal-4.806
��-4.764
ologies-4.708
Token as
Token ID355
Feature activation-2.16e+00

Pos logit contributions
OLOG+1.971
OLOGY+1.747
indal+1.655
��+1.629
ologies+1.623
Neg logit contributions
adra-1.387
actic-1.337
ement-1.255
inion-1.235
atic-1.195
Token 800
Token ID10460
Feature activation+5.76e+00

Pos logit contributions
adra+1.402
actic+1.344
ement+1.265
inion+1.226
atic+1.211
Neg logit contributions
OLOG-2.169
OLOGY-1.926
indal-1.843
��-1.820
GH-1.785
Token,
Token ID11
Feature activation+1.70e+00

Pos logit contributions
OLOG+5.446
OLOGY+4.792
indal+4.540
��+4.517
GH+4.483
Neg logit contributions
adra-3.958
actic-3.826
inion-3.557
ement-3.547
atic-3.393
Token000
Token ID830
Feature activation+1.57e+00

Pos logit contributions
OLOG+1.191
OLOGY+1.003
indal+0.928
��+0.912
ologies+0.899
Neg logit contributions
adra-1.616
actic-1.572
ement-1.508
inion-1.485
atic-1.458
Token people
Token ID661
Feature activation-2.96e+00

Pos logit contributions
OLOG+1.326
OLOGY+1.154
indal+1.084
��+1.064
ologies+1.056
Neg logit contributions
adra-1.269
actic-1.229
ement-1.170
inion-1.150
atic-1.122
Token who
Token ID508
Feature activation+5.50e-01

Pos logit contributions
adra+1.861
actic+1.789
ement+1.691
inion+1.631
atic+1.607
Neg logit contributions
OLOG-3.022
OLOGY-2.708
indal-2.560
��-2.511
ologies-2.498
Token were
Token ID547
Feature activation-1.07e-01

Pos logit contributions
OLOG+0.491
OLOGY+0.431
indal+0.405
��+0.399
ologies+0.395
Neg logit contributions
adra-0.419
actic-0.404
ement-0.384
inion-0.376
atic-0.367
Token brought
Token ID3181
Feature activation-1.89e+00

Pos logit contributions
adra+0.075
actic+0.073
ement+0.069
inion+0.067
atic+0.066
Neg logit contributions
OLOG-0.102
OLOGY-0.090
indal-0.085
��-0.084
ologies-0.083
Token across
Token ID1973
Feature activation+2.14e+00

Pos logit contributions
adra+1.230
actic+1.190
ement+1.124
inion+1.081
atic+1.063
Neg logit contributions
OLOG-1.898
OLOGY-1.693
indal-1.611
��-1.588
GH-1.567
Token[/
Token ID13412
Feature activation-4.80e+00

Pos logit contributions
OLOG+0.187
OLOGY+0.163
indal+0.154
��+0.153
ologies+0.149
Neg logit contributions
adra-0.181
actic-0.176
ement-0.169
inion-0.165
atic-0.162
Tokenb
Token ID65
Feature activation-1.13e+00

Pos logit contributions
adra+2.831
actic+2.716
ement+2.564
inion+2.471
atic+2.430
Neg logit contributions
OLOG-5.114
OLOGY-4.567
��-4.538
-4.358
indal-4.328
Token]
Token ID60
Feature activation-7.54e-01

Pos logit contributions
adra+0.547
actic+0.529
ement+0.485
inion+0.471
atic+0.456
Neg logit contributions
OLOG-1.305
OLOGY-1.178
��-1.144
indal-1.139
ologies-1.108
Token 00
Token ID3571
Feature activation+2.10e+00

Pos logit contributions
adra+0.526
actic+0.507
ement+0.481
inion+0.468
atic+0.460
Neg logit contributions
OLOG-0.716
OLOGY-0.634
indal-0.604
��-0.597
ologies-0.587
Token:
Token ID25
Feature activation+2.46e+00

Pos logit contributions
OLOG+1.935
OLOGY+1.705
indal+1.614
��+1.591
ologies+1.575
Neg logit contributions
adra-1.524
actic-1.473
ement-1.394
inion-1.367
atic-1.332
Token40
Token ID1821
Feature activation+3.96e+00

Pos logit contributions
OLOG+2.102
OLOGY+1.833
indal+1.717
GH+1.681
ologies+1.678
Neg logit contributions
adra-1.951
actic-1.890
ement-1.796
inion-1.771
atic-1.719
Token:
Token ID25
Feature activation+5.52e-01

Pos logit contributions
OLOG+3.457
OLOGY+3.039
indal+2.823
GH+2.811
ologies+2.780
Neg logit contributions
adra-3.046
actic-2.935
ement-2.771
inion-2.759
erva-2.670
Token00
Token ID405
Feature activation-1.49e+00

Pos logit contributions
OLOG+0.420
OLOGY+0.360
indal+0.338
��+0.334
ologies+0.326
Neg logit contributions
adra-0.491
actic-0.477
ement-0.456
inion-0.448
atic-0.440
Token [
Token ID685
Feature activation-1.87e+00

Pos logit contributions
adra+1.107
actic+1.069
ement+1.012
inion+0.995
atic+0.966
Neg logit contributions
OLOG-1.345
OLOGY-1.182
indal-1.115
��-1.097
ologies-1.090
Tokenb
Token ID65
Feature activation-3.46e+00

Pos logit contributions
adra+1.212
actic+1.173
ement+1.099
inion+1.074
atic+1.048
Neg logit contributions
OLOG-1.885
OLOGY-1.684
��-1.600
indal-1.585
GH-1.543
Token]
Token ID60
Feature activation-1.36e+00

Pos logit contributions
adra+1.539
actic+1.511
ement+1.381
inion+1.317
atic+1.301
Neg logit contributions
OLOG-4.109
OLOGY-3.715
��-3.679
indal-3.617
-3.498
Tokenoon
Token ID2049
Feature activation+7.14e-01

Pos logit contributions
OLOG+0.013
OLOGY+0.012
indal+0.011
��+0.011
ologies+0.011
Neg logit contributions
adra-0.007
actic-0.007
ement-0.007
inion-0.007
atic-0.006
Token crisis
Token ID4902
Feature activation-1.32e-01

Pos logit contributions
OLOG+0.640
OLOGY+0.562
indal+0.530
��+0.520
ologies+0.518
Neg logit contributions
adra-0.538
actic-0.520
ement-0.492
inion-0.485
atic-0.471
Token
Token ID447
Feature activation-5.03e-01

Pos logit contributions
adra+0.097
actic+0.094
ement+0.089
inion+0.087
atic+0.085
Neg logit contributions
OLOG-0.120
OLOGY-0.105
indal-0.100
��-0.098
ologies-0.097
Token
Token ID247
Feature activation+1.41e-01

Pos logit contributions
adra+0.404
actic+0.392
ement+0.375
inion+0.372
atic+0.358
Neg logit contributions
OLOG-0.426
OLOGY-0.369
indal-0.351
ologies-0.337
GH-0.335
Token
Token ID851
Feature activation-6.09e-01

Pos logit contributions
OLOG+0.133
OLOGY+0.118
indal+0.112
��+0.110
ologies+0.109
Neg logit contributions
adra-0.099
actic-0.095
ement-0.090
inion-0.088
atic-0.086
Token a
Token ID257
Feature activation-2.20e+00

Pos logit contributions
adra+0.424
actic+0.407
ement+0.387
inion+0.378
atic+0.368
Neg logit contributions
OLOG-0.583
OLOGY-0.516
indal-0.488
��-0.484
ologies-0.476
Token phrase
Token ID9546
Feature activation+9.63e-01

Pos logit contributions
adra+1.424
actic+1.375
ement+1.293
inion+1.250
atic+1.233
Neg logit contributions
OLOG-2.206
OLOGY-1.966
indal-1.866
��-1.853
GH-1.820
Token I
Token ID314
Feature activation+2.23e+00

Pos logit contributions
OLOG+0.871
OLOGY+0.765
indal+0.726
��+0.715
GH+0.705
Neg logit contributions
adra-0.717
actic-0.693
ement-0.657
inion-0.642
atic-0.631
Token still
Token ID991
Feature activation+1.57e+00

Pos logit contributions
OLOG+1.928
OLOGY+1.684
indal+1.584
��+1.554
ologies+1.545
Neg logit contributions
adra-1.753
actic-1.697
ement-1.612
inion-1.586
atic-1.545
Token can
Token ID460
Feature activation+1.82e+00

Pos logit contributions
OLOG+1.483
OLOGY+1.311
indal+1.242
��+1.224
ologies+1.210
Neg logit contributions
adra-1.114
actic-1.075
ement-1.017
inion-0.995
atic-0.969
Token
Token ID447
Feature activation+6.27e-01

Pos logit contributions
OLOG+1.206
OLOGY+1.007
indal+0.927
��+0.902
ologies+0.895
Neg logit contributions
adra-1.790
actic-1.743
ement-1.676
inion-1.654
atic-1.620
Token wisdom
Token ID11501
Feature activation+3.02e+00

Pos logit contributions
OLOG+3.208
OLOGY+2.819
ologies+2.630
indal+2.628
GH+2.575
Neg logit contributions
adra-2.718
actic-2.594
ement-2.446
inion-2.445
rehens-2.351
Token says
Token ID1139
Feature activation+8.95e-01

Pos logit contributions
OLOG+2.252
OLOGY+1.924
indal+1.785
ologies+1.755
GH+1.727
Neg logit contributions
adra-2.719
actic-2.639
ement-2.511
inion-2.493
atic-2.432
Token.
Token ID13
Feature activation+4.11e-01

Pos logit contributions
OLOG+0.840
OLOGY+0.742
indal+0.702
��+0.690
ologies+0.686
Neg logit contributions
adra-0.637
actic-0.615
ement-0.581
inion-0.570
atic-0.554
Token Dinosaur
Token ID47808
Feature activation+4.54e+00

Pos logit contributions
OLOG+0.363
OLOGY+0.318
indal+0.302
��+0.298
ologies+0.293
Neg logit contributions
adra-0.313
actic-0.304
ement-0.289
inion-0.283
atic-0.277
Token bones
Token ID11945
Feature activation-5.31e+00

Pos logit contributions
OLOG+4.062
OLOGY+3.559
indal+3.321
ologies+3.290
GH+3.248
Neg logit contributions
adra-3.415
actic-3.266
inion-3.084
ement-3.078
rehens-2.965
Token with
Token ID351
Feature activation-1.90e+00

Pos logit contributions
adra+3.609
actic+3.494
ement+3.342
atic+3.197
inion+3.189
Neg logit contributions
OLOG-5.001
OLOGY-4.417
indal-4.258
��-4.216
-4.087
Token Carbon
Token ID23699
Feature activation-2.85e+00

Pos logit contributions
adra+1.300
actic+1.259
ement+1.192
inion+1.148
atic+1.133
Neg logit contributions
OLOG-1.822
OLOGY-1.614
indal-1.540
��-1.528
GH-1.495
Token-
Token ID12
Feature activation-4.66e+00

Pos logit contributions
adra+2.083
actic+2.027
ement+1.914
inion+1.864
atic+1.849
Neg logit contributions
OLOG-2.594
OLOGY-2.284
indal-2.173
��-2.139
GH-2.105
Token14
Token ID1415
Feature activation-7.96e-01

Pos logit contributions
adra+2.875
actic+2.793
ement+2.607
inion+2.502
atic+2.502
Neg logit contributions
OLOG-4.728
OLOGY-4.234
indal-4.068
��-3.979
ologies-3.918
Token dates
Token ID9667
Feature activation+3.21e+00

Pos logit contributions
adra+0.551
actic+0.533
ement+0.503
inion+0.491
atic+0.480
Neg logit contributions
OLOG-0.759
OLOGY-0.672
indal-0.638
��-0.630
ologies-0.621
Token in
Token ID287
Feature activation+1.47e+00

Pos logit contributions
OLOG+2.832
OLOGY+2.479
indal+2.327
ologies+2.302
GH+2.284
Neg logit contributions
adra-2.437
actic-2.347
ement-2.225
inion-2.215
rehens-2.134
Token t
Token ID256
Feature activation-2.98e+00

Pos logit contributions
adra+1.857
actic+1.822
ement+1.732
inion+1.686
atic+1.676
Neg logit contributions
OLOG-2.155
OLOGY-1.875
indal-1.802
��-1.778
ologies-1.708
Tokenumbled
Token ID11137
Feature activation+2.73e-02

Pos logit contributions
adra+1.547
actic+1.492
ement+1.373
atic+1.315
inion+1.299
Neg logit contributions
OLOG-3.384
OLOGY-3.070
��-2.947
indal-2.902
ologies-2.849
Token down
Token ID866
Feature activation-5.55e-01

Pos logit contributions
OLOG+0.026
OLOGY+0.023
indal+0.022
��+0.021
ologies+0.021
Neg logit contributions
adra-0.019
actic-0.019
ement-0.018
inion-0.017
atic-0.017
Token.
Token ID13
Feature activation-2.01e+00

Pos logit contributions
adra+0.366
actic+0.352
ement+0.332
inion+0.323
atic+0.315
Neg logit contributions
OLOG-0.552
OLOGY-0.491
indal-0.467
��-0.458
ologies-0.455
Token With
Token ID2080
Feature activation-3.21e+00

Pos logit contributions
adra+1.450
actic+1.401
ement+1.334
inion+1.297
atic+1.285
Neg logit contributions
OLOG-1.834
OLOGY-1.621
��-1.563
indal-1.561
ologies-1.508
Token her
Token ID607
Feature activation-3.76e+00

Pos logit contributions
adra+2.038
actic+1.951
ement+1.824
inion+1.784
atic+1.741
Neg logit contributions
OLOG-3.240
OLOGY-2.896
indal-2.769
��-2.724
GH-2.679
Token body
Token ID1767
Feature activation-1.99e+00

Pos logit contributions
adra+2.560
actic+2.476
ement+2.342
inion+2.282
atic+2.256
Neg logit contributions
OLOG-3.615
OLOGY-3.198
indal-3.058
��-3.022
ologies-2.941
Token in
Token ID287
Feature activation-6.78e-01

Pos logit contributions
adra+1.374
actic+1.328
ement+1.256
inion+1.231
atic+1.206
Neg logit contributions
OLOG-1.912
OLOGY-1.690
indal-1.617
��-1.592
GH-1.554
Token such
Token ID884
Feature activation-2.92e+00

Pos logit contributions
adra+0.462
actic+0.445
ement+0.420
inion+0.411
atic+0.399
Neg logit contributions
OLOG-0.657
OLOGY-0.583
indal-0.552
��-0.544
ologies-0.540
Token poor
Token ID3595
Feature activation-3.99e+00

Pos logit contributions
adra+1.838
actic+1.779
ement+1.682
inion+1.621
atic+1.588
Neg logit contributions
OLOG-2.971
OLOGY-2.641
��-2.519
indal-2.511
GH-2.453
Token shape
Token ID5485
Feature activation-3.99e-01

Pos logit contributions
adra+2.381
actic+2.313
ement+2.187
inion+2.114
atic+2.067
Neg logit contributions
OLOG-4.136
OLOGY-3.703
��-3.550
indal-3.550
-3.466
Token,
Token ID11
Feature activation-1.14e+00

Pos logit contributions
OLOG+0.814
OLOGY+0.722
indal+0.685
��+0.672
ologies+0.671
Neg logit contributions
adra-0.576
actic-0.554
ement-0.522
inion-0.512
atic-0.496
Token with
Token ID351
Feature activation-4.62e+00

Pos logit contributions
adra+0.753
actic+0.725
ement+0.684
inion+0.669
atic+0.650
Neg logit contributions
OLOG-1.127
OLOGY-0.998
��-0.948
indal-0.947
ologies-0.926
Token 47
Token ID6298
Feature activation+3.54e+00

Pos logit contributions
adra+2.907
actic+2.779
ement+2.635
inion+2.563
atic+2.499
Neg logit contributions
OLOG-4.658
OLOGY-4.134
��-3.984
indal-3.937
GH-3.843
Token%
Token ID4
Feature activation-1.94e+00

Pos logit contributions
OLOG+2.407
OLOGY+2.037
indal+1.881
ologies+1.845
GH+1.833
Neg logit contributions
adra-3.362
actic-3.295
ement-3.100
inion-3.080
erva-3.019
Token support
Token ID1104
Feature activation-2.55e+00

Pos logit contributions
adra+1.200
actic+1.152
ement+1.083
inion+1.072
atic+1.024
Neg logit contributions
OLOG-2.002
OLOGY-1.788
indal-1.701
��-1.692
ologies-1.661
Token to
Token ID284
Feature activation-1.59e-01

Pos logit contributions
adra+1.549
actic+1.484
ement+1.400
inion+1.373
atic+1.319
Neg logit contributions
OLOG-2.650
OLOGY-2.371
indal-2.245
��-2.240
GH-2.203
Token 47
Token ID6298
Feature activation+4.34e+00

Pos logit contributions
adra+0.110
actic+0.106
ement+0.100
inion+0.098
atic+0.095
Neg logit contributions
OLOG-0.152
OLOGY-0.135
indal-0.128
��-0.127
ologies-0.124
Token%
Token ID4
Feature activation+8.63e-01

Pos logit contributions
OLOG+3.115
OLOGY+2.654
indal+2.439
GH+2.403
ologies+2.382
Neg logit contributions
adra-4.006
actic-3.898
ement-3.707
inion-3.677
atic-3.583
Token opposed
Token ID6886
Feature activation+4.38e-01

Pos logit contributions
OLOG+0.969
OLOGY+0.876
indal+0.840
ologies+0.823
��+0.823
Neg logit contributions
adra-0.453
actic-0.430
ement-0.396
inion-0.384
atic-0.373
Token.
Token ID13
Feature activation-4.63e-01

Pos logit contributions
OLOG+0.425
OLOGY+0.378
indal+0.359
ologies+0.351
��+0.351
Neg logit contributions
adra-0.297
actic-0.285
ement-0.267
inion-0.262
atic-0.254
Token You
Token ID921
Feature activation-1.28e+00

Pos logit contributions
adra+0.325
actic+0.314
ement+0.298
inion+0.292
atic+0.285
Neg logit contributions
OLOG-0.437
OLOGY-0.385
��-0.367
indal-0.367
ologies-0.359
Token first
Token ID717
Feature activation+3.46e+00

Pos logit contributions
OLOG+0.432
OLOGY+0.386
indal+0.367
��+0.361
ologies+0.359
Neg logit contributions
adra-0.272
actic-0.261
ement-0.245
inion-0.240
atic-0.232
Token time
Token ID640
Feature activation+3.55e+00

Pos logit contributions
OLOG+3.402
OLOGY+3.030
indal+2.895
GH+2.830
ologies+2.815
Neg logit contributions
adra-2.252
actic-2.141
ement-2.014
inion-2.006
atic-1.912
Token,
Token ID11
Feature activation+1.32e+00

Pos logit contributions
OLOG+3.010
OLOGY+2.631
indal+2.461
GH+2.418
ologies+2.414
Neg logit contributions
adra-2.794
actic-2.687
ement-2.567
inion-2.564
atic-2.452
Token he
Token ID339
Feature activation-5.12e-01

Pos logit contributions
OLOG+1.247
OLOGY+1.104
indal+1.045
��+1.026
ologies+1.022
Neg logit contributions
adra-0.927
actic-0.892
ement-0.843
inion-0.831
atic-0.803
Token genuinely
Token ID17170
Feature activation+4.32e-01

Pos logit contributions
adra+0.415
actic+0.401
ement+0.384
inion+0.373
atic+0.368
Neg logit contributions
OLOG-0.434
OLOGY-0.377
indal-0.357
��-0.351
ologies-0.344
Token believed
Token ID4762
Feature activation-2.41e+00

Pos logit contributions
OLOG+0.377
OLOGY+0.329
indal+0.311
��+0.305
ologies+0.302
Neg logit contributions
adra-0.337
actic-0.326
ement-0.310
inion-0.304
atic-0.297
Token he
Token ID339
Feature activation+1.54e+00

Pos logit contributions
adra+1.611
actic+1.561
ement+1.462
inion+1.430
atic+1.396
Neg logit contributions
OLOG-2.349
OLOGY-2.090
indal-1.980
��-1.959
ologies-1.934
Token was
Token ID373
Feature activation+8.07e-02

Pos logit contributions
OLOG+1.200
OLOGY+1.033
indal+0.964
��+0.944
ologies+0.937
Neg logit contributions
adra-1.333
actic-1.295
ement-1.236
inion-1.219
atic-1.190
Token going
Token ID1016
Feature activation-4.82e+00

Pos logit contributions
OLOG+0.077
OLOGY+0.068
indal+0.065
��+0.064
GH+0.063
Neg logit contributions
adra-0.056
actic-0.054
ement-0.051
inion-0.050
atic-0.048
Token to
Token ID284
Feature activation-5.37e-01

Pos logit contributions
adra+2.846
actic+2.729
ement+2.545
inion+2.481
atic+2.402
Neg logit contributions
OLOG-5.097
OLOGY-4.566
indal-4.355
��-4.289
ologies-4.268
Token die
Token ID4656
Feature activation+5.31e+00

Pos logit contributions
adra+0.402
actic+0.390
ement+0.369
inion+0.361
atic+0.353
Neg logit contributions
OLOG-0.484
OLOGY-0.424
indal-0.401
��-0.394
ologies-0.392
Tokenash
Token ID1077
Feature activation+2.85e+00

Pos logit contributions
adra+1.487
actic+1.431
ement+1.362
inion+1.344
atic+1.308
Neg logit contributions
OLOG-1.843
OLOGY-1.623
��-1.557
indal-1.545
ologies-1.502
Tokenin
Token ID259
Feature activation+1.45e+00

Pos logit contributions
OLOG+2.587
OLOGY+2.283
indal+2.172
ologies+2.125
��+2.114
Neg logit contributions
adra-2.062
actic-1.998
ement-1.882
inion-1.848
atic-1.793
Token Bob
Token ID5811
Feature activation-1.13e+00

Pos logit contributions
OLOG+1.279
OLOGY+1.121
indal+1.057
��+1.034
ologies+1.033
Neg logit contributions
adra-1.107
actic-1.068
ement-1.013
inion-0.997
atic-0.968
Token Kelley
Token ID34560
Feature activation-1.30e-01

Pos logit contributions
adra+0.848
actic+0.827
ement+0.781
inion+0.769
atic+0.753
Neg logit contributions
OLOG-1.017
OLOGY-0.892
indal-0.840
��-0.834
ologies-0.818
Token-
Token ID12
Feature activation-1.39e+00

Pos logit contributions
adra+0.097
actic+0.094
ement+0.089
inion+0.087
atic+0.085
Neg logit contributions
OLOG-0.118
OLOGY-0.104
indal-0.098
��-0.096
ologies-0.096
TokenVal
Token ID7762
Feature activation-7.41e+00

Pos logit contributions
adra+1.256
actic+1.225
ement+1.175
inion+1.153
atic+1.139
Neg logit contributions
OLOG-1.013
OLOGY-0.863
indal-0.813
��-0.806
ologies-0.786
Token Kil
Token ID19902
Feature activation+6.97e-01

Pos logit contributions
adra+4.633
actic+4.480
ement+4.219
inion+4.089
atic+3.946
Neg logit contributions
OLOG-7.574
OLOGY-6.720
��-6.403
indal-6.184
ologies-6.141
Tokenmer
Token ID647
Feature activation-1.53e-01

Pos logit contributions
OLOG+0.685
OLOGY+0.609
indal+0.577
��+0.569
ologies+0.565
Neg logit contributions
adra-0.465
actic-0.448
ement-0.422
inion-0.413
atic-0.401
Token Song
Token ID10940
Feature activation+5.79e-01

Pos logit contributions
adra+0.115
actic+0.111
ement+0.106
inion+0.104
atic+0.101
Neg logit contributions
OLOG-0.137
OLOGY-0.120
indal-0.113
��-0.111
ologies-0.110
Token-
Token ID12
Feature activation-6.20e-01

Pos logit contributions
OLOG+0.534
OLOGY+0.471
indal+0.445
��+0.438
ologies+0.434
Neg logit contributions
adra-0.421
actic-0.407
ement-0.385
inion-0.378
atic-0.368
TokenB
Token ID33
Feature activation-6.58e-01

Pos logit contributions
adra+0.561
actic+0.547
ement+0.524
inion+0.515
atic+0.508
Neg logit contributions
OLOG-0.457
OLOGY-0.390
indal-0.367
��-0.362
ologies-0.355
Token Fresh
Token ID20138
Feature activation+5.45e+00

Pos logit contributions
OLOG+0.611
OLOGY+0.544
indal+0.517
��+0.509
ologies+0.506
Neg logit contributions
adra-0.398
actic-0.383
ement-0.360
inion-0.352
atic-0.341
Tokenman
Token ID805
Feature activation+3.53e+00

Pos logit contributions
OLOG+3.692
OLOGY+3.099
indal+2.847
ologies+2.771
GH+2.768
Neg logit contributions
adra-5.177
actic-4.967
ement-4.731
inion-4.704
rehens-4.641
Token first
Token ID717
Feature activation+2.57e+00

Pos logit contributions
OLOG+3.116
OLOGY+2.730
indal+2.585
ologies+2.545
��+2.490
Neg logit contributions
adra-2.657
actic-2.532
ement-2.407
inion-2.373
rehens-2.291
Token team
Token ID1074
Feature activation+1.25e+00

Pos logit contributions
OLOG+2.670
OLOGY+2.396
indal+2.267
ologies+2.249
GH+2.227
Neg logit contributions
adra-1.557
actic-1.469
ement-1.370
inion-1.345
rehens-1.307
Token (
Token ID357
Feature activation+8.52e-01

Pos logit contributions
OLOG+1.170
OLOGY+1.037
indal+0.984
ologies+0.965
��+0.961
Neg logit contributions
adra-0.879
actic-0.842
ement-0.793
inion-0.784
atic-0.753
Tokenalong
Token ID24176
Feature activation-7.38e+00

Pos logit contributions
OLOG+0.725
OLOGY+0.632
indal+0.594
��+0.582
ologies+0.579
Neg logit contributions
adra-0.681
actic-0.659
ement-0.627
inion-0.617
atic-0.601
Token with
Token ID351
Feature activation-4.43e+00

Pos logit contributions
adra+4.540
actic+4.344
ement+4.094
inion+3.971
atic+3.892
Neg logit contributions
OLOG-7.490
OLOGY-6.689
indal-6.367
��-6.306
ologies-6.223
Token Jones
Token ID5437
Feature activation+1.43e-01

Pos logit contributions
adra+3.290
actic+3.194
ement+3.012
inion+2.958
atic+2.913
Neg logit contributions
OLOG-3.972
OLOGY-3.467
indal-3.297
��-3.250
ologies-3.181
Token and
Token ID290
Feature activation+3.22e-01

Pos logit contributions
OLOG+0.141
OLOGY+0.126
indal+0.119
GH+0.117
��+0.117
Neg logit contributions
adra-0.095
actic-0.091
ement-0.085
inion-0.084
atic-0.081
Token Wins
Token ID24423
Feature activation-3.76e+00

Pos logit contributions
OLOG+0.272
OLOGY+0.236
indal+0.222
��+0.218
ologies+0.216
Neg logit contributions
adra-0.259
actic-0.251
ement-0.239
inion-0.235
atic-0.229
Tokenlow
Token ID9319
Feature activation+1.73e+00

Pos logit contributions
adra+2.113
actic+2.009
ement+1.897
inion+1.837
atic+1.816
Neg logit contributions
OLOG-4.089
OLOGY-3.677
��-3.494
indal-3.462
ologies-3.402
Token entrepreneurial
Token ID39365
Feature activation+4.98e+00

Pos logit contributions
OLOG+1.930
OLOGY+1.711
indal+1.623
ologies+1.595
GH+1.580
Neg logit contributions
adra-1.381
actic-1.323
ement-1.253
inion-1.230
atic-1.188
Token risk
Token ID2526
Feature activation+6.03e+00

Pos logit contributions
OLOG+4.522
OLOGY+3.989
indal+3.765
ologies+3.696
GH+3.662
Neg logit contributions
adra-3.641
actic-3.481
ement-3.272
inion-3.246
erva-3.140
Token.
Token ID13
Feature activation-3.36e-01

Pos logit contributions
OLOG+5.392
OLOGY+4.785
ologies+4.476
indal+4.473
IGH+4.386
Neg logit contributions
adra-4.387
actic-4.252
ement-3.937
inion-3.926
erva-3.776
Token
Token ID198
Feature activation+4.94e-01

Pos logit contributions
adra+0.245
actic+0.238
ement+0.226
inion+0.220
atic+0.217
Neg logit contributions
OLOG-0.305
OLOGY-0.268
��-0.259
indal-0.257
ologies-0.250
Token
Token ID198
Feature activation+3.67e-01

Pos logit contributions
OLOG+0.455
OLOGY+0.400
indal+0.381
��+0.381
ologies+0.370
Neg logit contributions
adra-0.359
actic-0.349
ement-0.331
inion-0.323
atic-0.317
TokenLast
Token ID5956
Feature activation-7.35e+00

Pos logit contributions
OLOG+0.282
OLOGY+0.242
indal+0.229
��+0.226
ologies+0.221
Neg logit contributions
adra-0.323
actic-0.313
ement-0.299
inion-0.294
atic-0.290
Token year
Token ID614
Feature activation+7.97e-01

Pos logit contributions
adra+3.823
actic+3.739
ement+3.496
inion+3.347
atic+3.229
Neg logit contributions
OLOG-8.004
OLOGY-7.169
��-6.927
indal-6.913
GH-6.684
Token,
Token ID11
Feature activation-6.99e-01

Pos logit contributions
OLOG+0.771
OLOGY+0.684
indal+0.647
ologies+0.634
��+0.634
Neg logit contributions
adra-0.546
actic-0.522
ement-0.494
inion-0.484
atic-0.466
Token John
Token ID1757
Feature activation+2.00e+00

Pos logit contributions
adra+0.525
actic+0.510
ement+0.481
inion+0.474
atic+0.463
Neg logit contributions
OLOG-0.626
OLOGY-0.549
indal-0.520
��-0.515
GH-0.506
Token Marshall
Token ID13606
Feature activation-1.51e+00

Pos logit contributions
OLOG+1.687
OLOGY+1.466
indal+1.379
ologies+1.353
GH+1.346
Neg logit contributions
adra-1.598
actic-1.548
ement-1.472
inion-1.450
atic-1.405
Token,
Token ID11
Feature activation-2.09e+00

Pos logit contributions
adra+1.005
actic+0.972
ement+0.918
inion+0.890
atic+0.874
Neg logit contributions
OLOG-1.478
OLOGY-1.317
indal-1.257
��-1.241
ologies-1.225
Token.
Token ID13
Feature activation-9.12e-01

Pos logit contributions
adra+1.886
actic+1.813
ement+1.712
inion+1.645
atic+1.638
Neg logit contributions
OLOG-3.147
OLOGY-2.820
indal-2.694
��-2.655
GH-2.615
Token The
Token ID383
Feature activation-1.89e-01

Pos logit contributions
adra+0.652
actic+0.630
ement+0.597
inion+0.582
atic+0.573
Neg logit contributions
OLOG-0.846
OLOGY-0.744
��-0.719
indal-0.715
ologies-0.696
Token idea
Token ID2126
Feature activation+2.44e+00

Pos logit contributions
adra+0.128
actic+0.123
ement+0.116
inion+0.114
atic+0.111
Neg logit contributions
OLOG-0.183
OLOGY-0.163
indal-0.155
��-0.153
GH-0.151
Token of
Token ID286
Feature activation-8.10e-01

Pos logit contributions
OLOG+2.136
OLOGY+1.872
indal+1.760
ologies+1.734
��+1.714
Neg logit contributions
adra-1.891
actic-1.818
ement-1.725
inion-1.714
atic-1.642
Token a
Token ID257
Feature activation-1.14e+00

Pos logit contributions
adra+0.531
actic+0.510
ement+0.480
inion+0.469
atic+0.456
Neg logit contributions
OLOG-0.806
OLOGY-0.717
indal-0.682
��-0.675
GH-0.665
Token cohesive
Token ID42403
Feature activation-8.28e+00

Pos logit contributions
adra+0.804
actic+0.777
ement+0.739
inion+0.718
atic+0.707
Neg logit contributions
OLOG-1.070
OLOGY-0.946
indal-0.897
��-0.888
ologies-0.870
Token Palestinian
Token ID8501
Feature activation-9.32e+00

Pos logit contributions
adra+5.232
actic+5.175
ement+4.984
inion+4.851
atic+4.661
Neg logit contributions
OLOG-7.838
OLOGY-6.964
��-6.742
indal-6.716
GH-6.369
Token state
Token ID1181
Feature activation-2.89e+00

Pos logit contributions
adra+4.772
actic+4.662
inion+4.459
ement+4.365
atic+4.191
Neg logit contributions
OLOG-9.878
OLOGY-8.827
��-8.801
indal-8.621
-8.316
Token was
Token ID373
Feature activation+1.64e-01

Pos logit contributions
adra+2.092
actic+2.032
ement+1.944
inion+1.866
atic+1.866
Neg logit contributions
OLOG-2.660
OLOGY-2.329
indal-2.229
��-2.222
GH-2.157
Token supposed
Token ID4385
Feature activation-5.93e-01

Pos logit contributions
OLOG+0.152
OLOGY+0.134
indal+0.127
��+0.125
ologies+0.124
Neg logit contributions
adra-0.119
actic-0.115
ement-0.109
inion-0.106
atic-0.104
Token to
Token ID284
Feature activation-2.02e-01

Pos logit contributions
adra+0.329
actic+0.321
ement+0.296
inion+0.285
atic+0.280
Neg logit contributions
OLOG-0.648
OLOGY-0.582
indal-0.558
��-0.558
GH-0.546
Tokenet
Token ID316
Feature activation+5.54e-01

Pos logit contributions
OLOG+1.693
OLOGY+1.520
indal+1.451
��+1.430
ologies+1.423
Neg logit contributions
adra-0.900
actic-0.860
ement-0.801
inion-0.782
atic-0.754
Token">
Token ID5320
Feature activation+4.35e+00

Pos logit contributions
OLOG+0.462
OLOGY+0.401
indal+0.375
��+0.370
ologies+0.369
Neg logit contributions
adra-0.453
actic-0.436
ement-0.416
inion-0.409
atic-0.399
TokenOptional
Token ID30719
Feature activation-1.57e+00

Pos logit contributions
OLOG+3.622
OLOGY+3.155
indal+2.924
GH+2.908
ologies+2.889
Neg logit contributions
adra-3.504
actic-3.381
ement-3.220
inion-3.208
atic-3.096
Token:
Token ID25
Feature activation+2.88e+00

Pos logit contributions
adra+1.117
actic+1.084
ement+1.025
inion+1.000
atic+0.983
Neg logit contributions
OLOG-1.461
OLOGY-1.288
indal-1.218
��-1.206
ologies-1.192
Token visually
Token ID22632
Feature activation+7.79e-01

Pos logit contributions
OLOG+2.568
OLOGY+2.254
indal+2.134
ologies+2.080
��+2.077
Neg logit contributions
adra-2.179
actic-2.093
ement-1.991
inion-1.962
erva-1.891
Token pleasing
Token ID28790
Feature activation-7.55e+00

Pos logit contributions
OLOG+0.729
OLOGY+0.644
indal+0.610
��+0.600
ologies+0.595
Neg logit contributions
adra-0.556
actic-0.538
ement-0.508
inion-0.497
atic-0.485
Token placeholder
Token ID46076
Feature activation-2.20e+00

Pos logit contributions
adra+5.033
actic+5.010
ement+4.762
atic+4.588
inion+4.505
Neg logit contributions
OLOG-6.935
OLOGY-6.149
��-5.878
indal-5.741
-5.658
Token
Token ID3556
Feature activation+2.76e+00

Pos logit contributions
adra+1.538
actic+1.505
ement+1.420
inion+1.376
atic+1.358
Neg logit contributions
OLOG-2.061
OLOGY-1.822
indal-1.719
��-1.718
GH-1.673
Tokenp
Token ID79
Feature activation-2.11e+00

Pos logit contributions
OLOG+2.592
OLOGY+2.292
indal+2.174
ologies+2.128
GH+2.117
Neg logit contributions
adra-1.953
actic-1.879
ement-1.767
inion-1.741
atic-1.688
Token>
Token ID29
Feature activation+1.14e+00

Pos logit contributions
adra+1.751
actic+1.694
ement+1.616
inion+1.589
atic+1.548
Neg logit contributions
OLOG-1.725
OLOGY-1.495
indal-1.400
��-1.372
ologies-1.364
Token
Token ID7359
Feature activation+3.68e+00

Pos logit contributions
OLOG+1.032
OLOGY+0.908
indal+0.856
ologies+0.841
��+0.839
Neg logit contributions
adra-0.847
actic-0.813
ement-0.771
inion-0.761
atic-0.736