TOP ACTIVATIONS
MAX = 16.851

prev _ state . un wrap ()) } } pub fn
100 000 ); file . close _ and _ ev ict
White House ( she turned down a request from The Intercept
t crowd the pan ; only cook about 3
Key Ex ists Q [ step New , arrival Location ]
struct uring kernel rect K ernel = c v 2 .
( 13 , 5 )) sq K ernel = c v
BL ACK H AT , rect K ernel )
' ) AS line FROM [ cl ojure .
ulate density grid . const size _ t num Part icles
, The San Antonio Express - News , Go N om
discern the size of individual pixels due to the amount they
continue to write about the issues that matter .
Santa Clara County v . Southern Pacific Railroad Company
California v . Southern Pacific Railroad Company
// Time component const float TA U = 6 . 283
Santa Clara County v . Southern Pacific Railroad Company , 118
We value local independent journalism . We hope you do
common language with the s outher ners . And
story : http :// usat . ly / 1 f H

MIN ACTIVATIONS
MIN = -17.217

of the most , if not , the most Progressive politician
notification . From the Kodi home screen , choose
offered a more gentle , though often ineffective , approach to
receives secret voices in his or her head , and must
and yellow ( 2011 ) [ 12 ] $
it 's difficult , if not impossible , for them to
a coma to find his family at his hospital bed side
retired the big bird . Home <|endoftext|> 0 Don
so vastly different from his or her offline persona , it
meet . M ort g ages , rent ,
Fire of Mount Doom in Mord or , where it was
. From the Kodi home screen , select Add ons
true priorities and his or her character is to
chooses to go alone to Mord or , but Sam guesses
the mobile consumer that his or her registration will expire ,
the honour to be , Mad am , Your Majesty 's
Man age Your Ent ire Business from One Dash board
, appeared in the Melbourne Mag istrates Court this afternoon ,
. This minim izes the chance that a normal word like
) 5 . Jon ny Gray CAP TAIN (

INTERVAL 8.334 - 16.851
CONTAINS 0.499%

/ profile ? get = NAME , DP 02 _ 00
plenty more . There are some downs ides , like having
| + -------------------- --+ + -------------------- --+ < -- Start of
s Coach returns to Recap San Juan Del Sur Episode 5
all - wheel drive with durable , mechanical - locking four

INTERVAL -0.183 - 8.334
CONTAINS 52.684%

be used to acquire land for new parks , develop a
job on Zika so he could stump for Trump .
. Unlike asphalt repairs for streets , concrete sidewalk
on longer than one of the nine lives of an average
site would be tid ied but will remain there until the

INTERVAL -8.700 - -0.183
CONTAINS 46.375%

end until the accumulated debt becomes unsustainable and the government
in the Ham pt ons , and their children attended the
Self described as " f ucked up Rock and
said Tom F itton , president of Judicial Watch .
happen . Tar am ov told CF 13 .

INTERVAL -17.217 - -8.700
CONTAINS 0.442%

, both camera power supplies ar ced and short ed out
space ) { options } a list of options , separated
33 - year - old Z yang V ang , then
the GOP took control of the upper chamber . Friends and
authorities said . L inda Jackson , 59 ,
Tokenprev
Token ID47050
Feature activation+4.53e+00

Pos logit contributions
 Strait+5.638
 JO+5.574
atos+5.524
natureconservancy+5.296
perse+5.287
Neg logit contributions
ham-5.163
hist-5.145
endars-4.718
aic-4.649
yip-4.640
Token_
Token ID62
Feature activation+1.11e+00

Pos logit contributions
 Strait+2.274
 JO+2.194
atos+2.172
perse+2.066
natureconservancy+2.023
Neg logit contributions
hist-3.949
ham-3.941
quite-3.607
yip-3.599
aic-3.598
Tokenstate
Token ID5219
Feature activation+1.05e+01

Pos logit contributions
 Strait+0.862
atos+0.835
 JO+0.834
natureconservancy+0.811
perse+0.808
Neg logit contributions
hist-0.696
ham-0.682
quite-0.598
endars-0.597
nda-0.596
Token.
Token ID764
Feature activation+5.03e+00

Pos logit contributions
 JO+5.467
 Strait+5.441
atos+5.153
perse+4.973
iao+4.849
Neg logit contributions
hist-8.499
ham-8.367
quite-7.803
yip-7.802
endars-7.781
Tokenun
Token ID403
Feature activation+6.21e+00

Pos logit contributions
 Strait+3.471
 JO+3.413
atos+3.329
perse+3.217
natureconservancy+3.203
Neg logit contributions
hist-3.504
ham-3.436
endars-3.103
aic-3.082
quite-3.079
Tokenwrap
Token ID37150
Feature activation+1.69e+01

Pos logit contributions
 Strait+4.069
 JO+4.064
atos+4.021
perse+3.871
natureconservancy+3.829
Neg logit contributions
hist-4.405
ham-4.250
nda-3.916
endars-3.904
aic-3.862
Token ())
Token ID32865
Feature activation+8.09e+00

Pos logit contributions
perse+7.574
 JO+7.538
 Strait+7.527
atos+7.392
faces+6.679
Neg logit contributions
ham-12.654
hist-12.634
endars-12.262
yip-12.024
ulet-11.809
Token }
Token ID1782
Feature activation+5.35e+00

Pos logit contributions
 Strait+4.064
 JO+3.871
atos+3.715
perse+3.556
 transitions+3.533
Neg logit contributions
hist-7.033
ham-6.922
quite-6.386
aic-6.380
yip-6.365
Token }
Token ID1782
Feature activation+1.40e+00

Pos logit contributions
 Strait+2.563
 JO+2.428
atos+2.314
 transitions+2.198
perse+2.190
Neg logit contributions
hist-4.958
ham-4.864
yip-4.499
quite-4.482
aic-4.450
Token pub
Token ID2240
Feature activation+4.53e+00

Pos logit contributions
 Strait+0.864
 JO+0.826
atos+0.814
natureconservancy+0.787
perse+0.783
Neg logit contributions
hist-1.125
ham-1.099
quite-0.996
nda-0.990
endars-0.987
Token fn
Token ID24714
Feature activation+5.13e+00

Pos logit contributions
 Strait+2.071
 JO+1.986
atos+1.909
natureconservancy+1.798
iao+1.779
Neg logit contributions
hist-4.288
ham-4.178
yip-3.926
quite-3.918
endars-3.884
Token 100
Token ID1802
Feature activation+6.80e+00

Pos logit contributions
 Strait+2.243
 JO+2.177
atos+2.145
natureconservancy+2.075
perse+2.052
Neg logit contributions
hist-2.179
ham-2.136
quite-1.902
nda-1.888
endars-1.887
Token000
Token ID830
Feature activation+2.62e+00

Pos logit contributions
 Strait+2.587
 JO+2.578
atos+2.486
natureconservancy+2.308
iao+2.267
Neg logit contributions
hist-6.763
ham-6.637
quite-6.219
endars-6.186
yip-6.143
Token);
Token ID1776
Feature activation+2.59e-01

Pos logit contributions
 Strait+1.081
 JO+1.034
atos+1.017
natureconservancy+0.966
perse+0.943
Neg logit contributions
hist-2.614
ham-2.574
quite-2.379
nda-2.366
endars-2.365
Token file
Token ID2393
Feature activation+5.95e+00

Pos logit contributions
 Strait+0.166
 JO+0.160
atos+0.159
natureconservancy+0.154
perse+0.152
Neg logit contributions
hist-0.200
ham-0.196
quite-0.177
nda-0.176
endars-0.175
Token.
Token ID13
Feature activation+6.68e+00

Pos logit contributions
 Strait+3.452
 JO+3.396
atos+3.325
natureconservancy+3.298
iao+3.187
Neg logit contributions
hist-4.709
ham-4.654
yip-4.327
endars-4.247
aic-4.224
Tokenclose
Token ID19836
Feature activation+1.63e+01

Pos logit contributions
 Strait+4.393
 JO+4.333
atos+4.253
perse+4.189
natureconservancy+4.152
Neg logit contributions
hist-4.577
ham-4.542
endars-4.170
yip-4.164
nda-4.083
Token_
Token ID62
Feature activation+9.33e+00

Pos logit contributions
 JO+6.298
 Strait+6.052
atos+6.011
ileged+5.539
perse+5.407
Neg logit contributions
ham-13.924
hist-13.671
yip-13.088
endars-13.077
quite-12.853
Tokenand
Token ID392
Feature activation+4.01e+00

Pos logit contributions
atos+6.534
 JO+6.361
 Strait+6.324
perse+6.166
natureconservancy+6.141
Neg logit contributions
hist-5.682
ham-5.615
endars-5.199
yip-5.198
aic-5.117
Token_
Token ID62
Feature activation-7.99e-01

Pos logit contributions
 Strait+2.080
 JO+2.002
atos+1.998
perse+1.902
natureconservancy+1.853
Neg logit contributions
hist-3.460
ham-3.428
endars-3.153
yip-3.152
nda-3.142
Tokenev
Token ID1990
Feature activation+3.92e-01

Pos logit contributions
hist+0.533
ham+0.519
quite+0.460
nda+0.456
aic+0.450
Neg logit contributions
 Strait-0.606
 JO-0.582
atos-0.580
natureconservancy-0.566
perse-0.557
Tokenict
Token ID713
Feature activation+4.08e+00

Pos logit contributions
 Strait+0.292
 JO+0.281
atos+0.280
natureconservancy+0.274
perse+0.270
Neg logit contributions
hist-0.264
ham-0.258
quite-0.228
nda-0.226
aic-0.226
Token White
Token ID2635
Feature activation+3.83e+00

Pos logit contributions
hist+2.509
ham+2.464
quite+2.200
nda+2.167
aic+2.152
Neg logit contributions
 Strait-2.397
atos-2.330
natureconservancy-2.323
 JO-2.290
perse-2.276
Token House
Token ID2097
Feature activation+4.84e+00

Pos logit contributions
 Strait+2.709
 JO+2.580
atos+2.520
iao+2.414
perse+2.408
Neg logit contributions
hist-2.719
ham-2.619
endars-2.441
yip-2.412
quite-2.393
Token (
Token ID357
Feature activation-3.79e+00

Pos logit contributions
 Strait+2.788
 JO+2.764
atos+2.656
natureconservancy+2.529
perse+2.505
Neg logit contributions
hist-3.963
ham-3.824
nda-3.553
yip-3.537
aic-3.528
Tokenshe
Token ID7091
Feature activation+3.77e+00

Pos logit contributions
hist+2.377
ham+2.306
quite+2.069
nda+1.957
laws+1.944
Neg logit contributions
 Strait-3.040
 JO-2.924
atos-2.923
natureconservancy-2.878
perse-2.812
Token turned
Token ID2900
Feature activation+3.08e+00

Pos logit contributions
 Strait+2.550
 JO+2.461
atos+2.441
perse+2.344
natureconservancy+2.341
Neg logit contributions
hist-2.762
ham-2.688
aic-2.415
quite-2.413
nda-2.404
Token down
Token ID866
Feature activation+1.58e+01

Pos logit contributions
 Strait+2.038
 JO+1.951
atos+1.936
natureconservancy+1.849
perse+1.840
Neg logit contributions
hist-2.316
ham-2.262
endars-2.038
quite-2.038
nda-2.028
Token a
Token ID257
Feature activation+9.55e+00

Pos logit contributions
 JO+8.715
 Strait+8.405
atos+7.888
 transitions+7.887
iao+7.616
Neg logit contributions
hist-11.594
ham-11.320
aic-10.688
yip-10.542
endars-10.522
Token request
Token ID2581
Feature activation+4.18e+00

Pos logit contributions
 Strait+5.879
 JO+5.829
atos+5.349
iao+5.239
perse+5.184
Neg logit contributions
hist-7.106
ham-7.058
aic-6.565
endars-6.502
yip-6.426
Token from
Token ID422
Feature activation+1.72e+00

Pos logit contributions
 Strait+2.657
 JO+2.595
atos+2.518
perse+2.452
iao+2.422
Neg logit contributions
hist-3.167
ham-3.077
aic-2.817
endars-2.806
yip-2.799
Token The
Token ID383
Feature activation+1.49e+00

Pos logit contributions
 Strait+1.127
 JO+1.084
atos+1.074
natureconservancy+1.035
perse+1.030
Neg logit contributions
hist-1.309
ham-1.282
quite-1.152
nda-1.147
endars-1.145
Token Intercept
Token ID37127
Feature activation+5.82e+00

Pos logit contributions
 Strait+1.067
 JO+1.027
atos+1.018
natureconservancy+0.985
perse+0.978
Neg logit contributions
hist-1.041
ham-1.017
quite-0.905
nda-0.903
endars-0.902
Token
Token ID447
Feature activation+4.10e+00

Pos logit contributions
 Strait+1.857
atos+1.800
 JO+1.789
natureconservancy+1.734
perse+1.726
Neg logit contributions
hist-1.880
ham-1.838
endars-1.638
quite-1.637
nda-1.632
Token
Token ID247
Feature activation+2.34e+00

Pos logit contributions
 Strait+2.517
 JO+2.435
atos+2.426
natureconservancy+2.363
perse+2.344
Neg logit contributions
hist-3.231
ham-3.142
yip-2.859
epad-2.840
quite-2.829
Tokent
Token ID83
Feature activation+1.53e+00

Pos logit contributions
 Strait+1.415
atos+1.354
 JO+1.338
natureconservancy+1.302
perse+1.292
Neg logit contributions
hist-1.880
ham-1.845
endars-1.683
yip-1.675
aic-1.662
Token crowd
Token ID4315
Feature activation-4.82e-01

Pos logit contributions
 Strait+0.879
 JO+0.850
atos+0.830
natureconservancy+0.810
perse+0.805
Neg logit contributions
hist-1.278
ham-1.250
endars-1.144
nda-1.140
aic-1.139
Token the
Token ID262
Feature activation+3.58e+00

Pos logit contributions
hist+0.362
ham+0.355
quite+0.317
nda+0.316
aic+0.314
Neg logit contributions
 Strait-0.319
 JO-0.307
atos-0.306
natureconservancy-0.297
perse-0.294
Token pan
Token ID3425
Feature activation+1.50e+01

Pos logit contributions
 Strait+1.970
 JO+1.865
atos+1.862
natureconservancy+1.787
perse+1.755
Neg logit contributions
hist-3.070
ham-2.980
yip-2.799
endars-2.794
quite-2.755
Token;
Token ID26
Feature activation-1.82e+00

Pos logit contributions
 Strait+4.378
 JO+3.605
natureconservancy+3.417
atos+3.377
iao+3.072
Neg logit contributions
yip-15.565
hist-15.136
endars-15.134
ham-14.952
��極-14.814
Token only
Token ID691
Feature activation-1.52e+00

Pos logit contributions
hist+1.233
ham+1.214
quite+1.065
nda+1.059
aic+1.047
Neg logit contributions
 Strait-1.338
atos-1.290
 JO-1.283
natureconservancy-1.259
perse-1.239
Token cook
Token ID4255
Feature activation+3.24e+00

Pos logit contributions
hist+1.001
ham+0.983
quite+0.854
aic+0.844
nda+0.843
Neg logit contributions
 Strait-1.163
 JO-1.115
atos-1.111
natureconservancy-1.090
perse-1.078
Token about
Token ID546
Feature activation+5.53e+00

Pos logit contributions
 Strait+2.277
 JO+2.162
atos+2.156
natureconservancy+2.098
perse+2.062
Neg logit contributions
hist-2.283
ham-2.228
endars-2.056
yip-2.018
nda-2.004
Token 3
Token ID513
Feature activation+5.13e-01

Pos logit contributions
 Strait+3.600
 JO+3.498
atos+3.415
natureconservancy+3.311
perse+3.209
Neg logit contributions
hist-4.159
ham-4.049
endars-3.787
yip-3.757
quite-3.661
Token Key
Token ID7383
Feature activation+6.84e+00

Pos logit contributions
hist+0.942
ham+0.921
quite+0.821
nda+0.815
aic+0.806
Neg logit contributions
 Strait-0.838
atos-0.800
 JO-0.798
natureconservancy-0.781
perse-0.767
TokenEx
Token ID3109
Feature activation+2.81e+00

Pos logit contributions
 Strait+4.329
 JO+4.198
atos+4.096
perse+4.006
natureconservancy+3.893
Neg logit contributions
hist-5.121
ham-4.999
quite-4.608
endars-4.598
yip-4.593
Tokenists
Token ID1023
Feature activation+7.18e+00

Pos logit contributions
 Strait+2.169
 JO+2.103
atos+2.089
perse+2.029
natureconservancy+2.028
Neg logit contributions
hist-1.787
ham-1.740
quite-1.539
nda-1.528
endars-1.523
TokenQ
Token ID48
Feature activation+6.02e+00

Pos logit contributions
 Strait+4.177
 JO+4.084
atos+3.963
perse+3.896
natureconservancy+3.713
Neg logit contributions
hist-5.723
ham-5.587
aic-5.147
endars-5.137
yip-5.136
Token[
Token ID58
Feature activation-3.72e-01

Pos logit contributions
 Strait+3.972
 JO+3.883
atos+3.809
perse+3.682
natureconservancy+3.648
Neg logit contributions
hist-4.422
ham-4.310
quite-3.923
aic-3.879
endars-3.869
Tokenstep
Token ID9662
Feature activation+1.50e+01

Pos logit contributions
hist+0.289
ham+0.283
quite+0.253
nda+0.253
aic+0.250
Neg logit contributions
 Strait-0.240
atos-0.229
 JO-0.229
natureconservancy-0.224
perse-0.219
TokenNew
Token ID3791
Feature activation+8.39e+00

Pos logit contributions
 Strait+8.654
 JO+8.557
atos+8.404
perse+8.049
 transitions+7.791
Neg logit contributions
hist-10.114
ham-10.001
yip-9.503
endars-9.394
aic-9.350
Token,
Token ID11
Feature activation+3.76e+00

Pos logit contributions
 Strait+5.234
 JO+5.063
atos+4.987
perse+4.812
natureconservancy+4.703
Neg logit contributions
hist-6.211
ham-6.040
yip-5.618
quite-5.567
aic-5.564
Token arrival
Token ID10325
Feature activation+5.81e+00

Pos logit contributions
 Strait+2.556
 JO+2.475
atos+2.442
natureconservancy+2.358
perse+2.349
Neg logit contributions
hist-2.721
ham-2.666
quite-2.397
aic-2.389
endars-2.387
TokenLocation
Token ID14749
Feature activation+8.14e+00

Pos logit contributions
 Strait+3.318
 JO+3.172
atos+3.161
natureconservancy+2.984
perse+2.984
Neg logit contributions
hist-4.763
ham-4.673
aic-4.298
yip-4.285
endars-4.280
Token]
Token ID60
Feature activation+8.56e+00

Pos logit contributions
 Strait+4.344
 JO+4.206
atos+4.103
perse+3.897
natureconservancy+3.807
Neg logit contributions
hist-6.865
ham-6.709
aic-6.246
yip-6.243
endars-6.213
Token struct
Token ID2878
Feature activation+5.10e+00

Pos logit contributions
 Strait+1.313
 JO+1.258
atos+1.243
natureconservancy+1.208
perse+1.184
Neg logit contributions
hist-1.731
ham-1.697
quite-1.543
endars-1.534
nda-1.531
Tokenuring
Token ID870
Feature activation+2.62e+00

Pos logit contributions
 Strait+3.837
 JO+3.679
atos+3.672
natureconservancy+3.575
iao+3.566
Neg logit contributions
hist-3.234
ham-3.190
yip-2.907
endars-2.886
aic-2.820
Token kernel
Token ID9720
Feature activation+8.43e+00

Pos logit contributions
 Strait+1.605
 JO+1.534
atos+1.505
natureconservancy+1.464
perse+1.435
Neg logit contributions
hist-2.108
ham-2.062
quite-1.877
endars-1.875
aic-1.858
Token rect
Token ID13621
Feature activation+1.34e+01

Pos logit contributions
 Strait+4.944
 JO+4.770
atos+4.553
natureconservancy+4.510
 transitions+4.399
Neg logit contributions
hist-6.636
ham-6.501
endars-6.258
yip-6.149
aic-6.127
TokenK
Token ID42
Feature activation+1.12e+00

Pos logit contributions
 JO+5.413
 Strait+5.266
atos+5.086
iao+4.787
natureconservancy+4.423
Neg logit contributions
hist-11.982
endars-11.977
yip-11.967
ham-11.685
antine-11.189
Tokenernel
Token ID7948
Feature activation+1.50e+01

Pos logit contributions
 Strait+0.749
atos+0.723
 JO+0.718
natureconservancy+0.697
perse+0.693
Neg logit contributions
hist-0.823
ham-0.804
quite-0.722
endars-0.717
nda-0.714
Token =
Token ID796
Feature activation+2.58e+00

Pos logit contributions
 JO+3.266
 Strait+3.166
atos+2.670
iao+2.538
natureconservancy+2.517
Neg logit contributions
endars-16.002
hist-15.929
yip-15.654
ham-15.343
��極-15.287
Token c
Token ID269
Feature activation+3.66e+00

Pos logit contributions
 Strait+1.334
 JO+1.266
atos+1.250
natureconservancy+1.205
perse+1.186
Neg logit contributions
hist-2.322
ham-2.283
quite-2.090
endars-2.089
yip-2.082
Tokenv
Token ID85
Feature activation+8.59e+00

Pos logit contributions
 Strait+3.285
atos+3.227
 JO+3.189
natureconservancy+3.142
perse+3.112
Neg logit contributions
hist-1.775
ham-1.742
yip-1.535
endars-1.533
aic-1.469
Token2
Token ID17
Feature activation+1.60e+00

Pos logit contributions
 Strait+3.390
atos+3.219
 JO+3.143
natureconservancy+3.091
iao+3.029
Neg logit contributions
hist-8.150
ham-8.063
yip-7.970
endars-7.907
epad-7.513
Token.
Token ID13
Feature activation+9.73e+00

Pos logit contributions
 Strait+1.075
 JO+1.033
atos+1.026
natureconservancy+0.999
perse+0.986
Neg logit contributions
hist-1.184
ham-1.159
quite-1.039
endars-1.039
nda-1.037
Token (
Token ID357
Feature activation+1.14e+00

Pos logit contributions
 Strait+1.193
 JO+1.152
atos+1.121
natureconservancy+1.088
perse+1.079
Neg logit contributions
hist-1.550
ham-1.529
quite-1.391
endars-1.389
nda-1.381
Token13
Token ID1485
Feature activation+5.65e+00

Pos logit contributions
 Strait+0.683
 JO+0.657
atos+0.653
natureconservancy+0.630
perse+0.623
Neg logit contributions
hist-0.912
ham-0.895
endars-0.814
nda-0.813
quite-0.812
Token,
Token ID11
Feature activation-2.62e+00

Pos logit contributions
 Strait+2.969
 JO+2.844
atos+2.766
natureconservancy+2.702
iao+2.633
Neg logit contributions
hist-4.896
ham-4.741
endars-4.583
yip-4.545
quite-4.413
Token 5
Token ID642
Feature activation+9.34e+00

Pos logit contributions
hist+1.575
ham+1.533
nda+1.328
quite+1.327
laws+1.292
Neg logit contributions
 Strait-2.089
atos-2.072
 JO-2.018
perse-2.015
natureconservancy-2.012
Token))
Token ID4008
Feature activation+1.14e+00

Pos logit contributions
 Strait+4.035
 JO+3.841
atos+3.524
perse+3.503
natureconservancy+3.437
Neg logit contributions
hist-8.744
ham-8.405
yip-8.174
quite-8.063
endars-8.061
Token sq
Token ID19862
Feature activation+1.48e+01

Pos logit contributions
 Strait+0.613
 JO+0.583
atos+0.578
natureconservancy+0.556
perse+0.550
Neg logit contributions
hist-0.994
ham-0.976
quite-0.891
nda-0.889
endars-0.885
TokenK
Token ID42
Feature activation+3.07e+00

Pos logit contributions
 Strait+5.182
 JO+5.104
atos+4.995
iao+4.699
Dial+4.371
Neg logit contributions
yip-13.757
hist-13.658
endars-13.537
ham-13.340
antine-12.861
Tokenernel
Token ID7948
Feature activation+1.38e+01

Pos logit contributions
 Strait+1.956
atos+1.904
 JO+1.860
iao+1.815
perse+1.813
Neg logit contributions
hist-2.339
ham-2.274
endars-2.053
quite-2.052
yip-2.036
Token =
Token ID796
Feature activation+3.52e+00

Pos logit contributions
 Strait+3.085
 JO+2.986
atos+2.622
iao+2.444
natureconservancy+2.349
Neg logit contributions
hist-14.842
endars-14.736
yip-14.526
ham-14.333
��極-14.146
Token c
Token ID269
Feature activation+3.55e+00

Pos logit contributions
 Strait+1.741
 JO+1.650
atos+1.622
natureconservancy+1.566
perse+1.532
Neg logit contributions
hist-3.225
ham-3.170
yip-2.928
endars-2.921
quite-2.917
Tokenv
Token ID85
Feature activation+8.90e+00

Pos logit contributions
 Strait+3.182
atos+3.125
 JO+3.076
natureconservancy+3.035
perse+3.007
Neg logit contributions
hist-1.729
ham-1.695
yip-1.501
endars-1.491
aic-1.426
TokenBL
Token ID9148
Feature activation+3.84e+00

Pos logit contributions
 JO+4.787
 Strait+4.566
atos+4.382
natureconservancy+4.367
OULD+4.221
Neg logit contributions
hist-6.505
ham-6.257
endars-6.184
yip-6.183
epad-5.892
TokenACK
Token ID8120
Feature activation+7.08e+00

Pos logit contributions
 Strait+2.310
 JO+2.232
atos+2.200
natureconservancy+2.117
iao+2.070
Neg logit contributions
hist-3.087
ham-3.010
endars-2.788
yip-2.773
nda-2.755
TokenH
Token ID39
Feature activation-5.85e-01

Pos logit contributions
 JO+3.664
 Strait+3.578
atos+3.396
natureconservancy+3.239
iao+3.159
Neg logit contributions
hist-5.985
endars-5.796
ham-5.776
yip-5.736
nda-5.720
TokenAT
Token ID1404
Feature activation+5.18e+00

Pos logit contributions
hist+0.466
ham+0.457
quite+0.410
nda+0.406
aic+0.406
Neg logit contributions
 Strait-0.366
 JO-0.347
atos-0.344
natureconservancy-0.338
perse-0.336
Token,
Token ID837
Feature activation+3.80e+00

Pos logit contributions
 Strait+2.372
 JO+2.312
atos+2.192
natureconservancy+2.076
iao+2.072
Neg logit contributions
hist-4.885
ham-4.761
endars-4.538
yip-4.491
quite-4.451
Token rect
Token ID13621
Feature activation+1.47e+01

Pos logit contributions
 Strait+2.438
 JO+2.338
atos+2.290
natureconservancy+2.264
perse+2.226
Neg logit contributions
hist-2.896
ham-2.839
yip-2.583
aic-2.581
endars-2.576
TokenK
Token ID42
Feature activation+1.74e+00

Pos logit contributions
 JO+5.614
atos+5.468
 Strait+5.447
iao+4.928
faces+4.629
Neg logit contributions
endars-13.181
yip-13.128
hist-12.944
ham-12.561
 rul-12.518
Tokenernel
Token ID7948
Feature activation+8.54e+00

Pos logit contributions
 Strait+1.155
atos+1.118
 JO+1.104
natureconservancy+1.075
perse+1.073
Neg logit contributions
hist-1.287
ham-1.257
quite-1.132
endars-1.123
nda-1.120
Token )
Token ID1267
Feature activation-2.46e+00

Pos logit contributions
 Strait+4.173
 JO+4.055
atos+3.780
iao+3.632
natureconservancy+3.597
Neg logit contributions
hist-7.511
ham-7.314
yip-7.294
endars-7.252
quite-6.897
Token
Token ID198
Feature activation-1.10e+00

Pos logit contributions
hist+1.526
ham+1.484
quite+1.287
nda+1.283
aic+1.265
Neg logit contributions
 Strait-1.953
atos-1.893
 JO-1.876
natureconservancy-1.842
perse-1.833
Token
Token ID198
Feature activation-3.13e+00

Pos logit contributions
hist+0.743
ham+0.725
quite+0.639
nda+0.633
aic+0.626
Neg logit contributions
 Strait-0.827
atos-0.797
 JO-0.794
natureconservancy-0.775
perse-0.770
Token
Token ID198
Feature activation+2.34e+00

Pos logit contributions
hist+0.230
ham+0.225
quite+0.200
nda+0.198
aic+0.197
Neg logit contributions
 Strait-0.232
 JO-0.223
atos-0.222
natureconservancy-0.216
perse-0.214
Token
Token ID198
Feature activation+2.33e+00

Pos logit contributions
 Strait+1.606
 JO+1.557
atos+1.540
natureconservancy+1.500
perse+1.485
Neg logit contributions
hist-1.679
ham-1.637
endars-1.493
yip-1.477
nda-1.470
Token'
Token ID6
Feature activation+7.29e+00

Pos logit contributions
 Strait+1.693
 JO+1.645
atos+1.634
natureconservancy+1.588
perse+1.577
Neg logit contributions
hist-1.574
ham-1.539
quite-1.365
nda-1.364
endars-1.357
Token )
Token ID1267
Feature activation+5.46e+00

Pos logit contributions
 JO+3.883
 Strait+3.759
atos+3.462
perse+3.400
iao+3.338
Neg logit contributions
hist-6.246
ham-6.112
quite-5.703
endars-5.667
aic-5.621
Token AS
Token ID7054
Feature activation+1.79e+00

Pos logit contributions
 JO+2.691
 Strait+2.688
atos+2.488
perse+2.407
natureconservancy+2.382
Neg logit contributions
hist-4.941
ham-4.860
quite-4.475
endars-4.454
aic-4.450
Token line
Token ID1627
Feature activation+1.47e+01

Pos logit contributions
 Strait+1.205
 JO+1.166
atos+1.148
natureconservancy+1.118
perse+1.108
Neg logit contributions
hist-1.320
ham-1.293
quite-1.158
nda-1.154
endars-1.148
Token FROM
Token ID16034
Feature activation+6.00e+00

Pos logit contributions
 JO+7.617
 Strait+6.786
perse+6.557
 INTO+6.398
atos+6.362
Neg logit contributions
hist-11.735
ham-11.577
quite-10.904
aic-10.810
endars-10.736
Token [
Token ID685
Feature activation+2.71e+00

Pos logit contributions
 JO+4.058
 Strait+4.031
atos+3.846
natureconservancy+3.729
perse+3.729
Neg logit contributions
hist-4.230
ham-4.111
quite-3.749
aic-3.744
nda-3.743
Token cl
Token ID537
Feature activation+1.09e+00

Pos logit contributions
 Strait+1.775
 JO+1.738
atos+1.700
natureconservancy+1.653
perse+1.641
Neg logit contributions
hist-2.021
ham-1.974
quite-1.782
nda-1.775
endars-1.771
Tokenojure
Token ID32511
Feature activation-2.21e+00

Pos logit contributions
 Strait+0.845
 JO+0.820
atos+0.817
natureconservancy+0.795
perse+0.789
Neg logit contributions
hist-0.691
ham-0.673
quite-0.592
nda-0.588
endars-0.585
Token.
Token ID764
Feature activation-3.82e-02

Pos logit contributions
hist+1.728
ham+1.686
quite+1.513
nda+1.508
aic+1.497
Neg logit contributions
 Strait-1.412
atos-1.341
 JO-1.337
natureconservancy-1.304
perse-1.279
Tokenulate
Token ID5039
Feature activation+1.56e+00

Pos logit contributions
 Strait+2.587
atos+2.516
 JO+2.476
perse+2.388
natureconservancy+2.352
Neg logit contributions
hist-3.736
ham-3.644
quite-3.329
nda-3.303
yip-3.275
Token density
Token ID12109
Feature activation-1.23e+00

Pos logit contributions
 Strait+1.023
 JO+0.983
atos+0.977
natureconservancy+0.947
perse+0.940
Neg logit contributions
hist-1.184
ham-1.159
quite-1.041
nda-1.035
endars-1.030
Token grid
Token ID10706
Feature activation+4.44e-01

Pos logit contributions
hist+0.947
ham+0.933
nda+0.829
quite+0.829
aic+0.822
Neg logit contributions
 Strait-0.798
atos-0.764
 JO-0.759
natureconservancy-0.746
perse-0.731
Token.
Token ID13
Feature activation+6.67e+00

Pos logit contributions
 Strait+0.274
atos+0.261
 JO+0.260
natureconservancy+0.254
perse+0.248
Neg logit contributions
hist-0.356
ham-0.350
quite-0.315
nda-0.315
aic-0.313
Token const
Token ID1500
Feature activation+4.57e+00

Pos logit contributions
 Strait+4.356
 JO+4.271
atos+4.202
perse+4.158
natureconservancy+4.079
Neg logit contributions
hist-4.759
ham-4.679
yip-4.225
endars-4.219
quite-4.202
Token size
Token ID2546
Feature activation+1.45e+01

Pos logit contributions
 Strait+2.882
 JO+2.785
atos+2.731
perse+2.637
natureconservancy+2.612
Neg logit contributions
hist-3.559
ham-3.474
quite-3.156
endars-3.115
yip-3.106
Token_
Token ID62
Feature activation+7.61e+00

Pos logit contributions
 Strait+7.773
 JO+7.488
atos+7.469
perse+7.306
iao+7.054
Neg logit contributions
hist-10.889
ham-10.537
yip-10.232
endars-9.807
aic-9.778
Tokent
Token ID83
Feature activation+6.12e+00

Pos logit contributions
 Strait+5.028
atos+5.004
 JO+4.896
perse+4.768
natureconservancy+4.748
Neg logit contributions
hist-5.316
ham-5.131
endars-4.692
aic-4.691
yip-4.672
Token num
Token ID997
Feature activation+7.32e+00

Pos logit contributions
 Strait+4.082
 JO+3.990
atos+3.877
perse+3.762
natureconservancy+3.739
Neg logit contributions
hist-4.455
ham-4.318
quite-3.905
endars-3.891
yip-3.890
TokenPart
Token ID7841
Feature activation+8.90e+00

Pos logit contributions
 Strait+4.505
atos+4.466
 JO+4.428
perse+4.350
natureconservancy+4.205
Neg logit contributions
hist-5.468
ham-5.296
yip-4.930
endars-4.805
quite-4.788
Tokenicles
Token ID2983
Feature activation+9.36e+00

Pos logit contributions
atos+5.605
 Strait+5.595
 JO+5.476
perse+5.382
iao+5.183
Neg logit contributions
hist-6.348
ham-6.157
yip-5.756
epad-5.648
quite-5.592
Token,
Token ID11
Feature activation-2.00e+00

Pos logit contributions
hist+0.234
ham+0.229
quite+0.207
nda+0.206
aic+0.204
Neg logit contributions
 Strait-0.182
 JO-0.174
atos-0.174
natureconservancy-0.168
perse-0.166
Token The
Token ID383
Feature activation-2.50e+00

Pos logit contributions
hist+1.547
ham+1.515
quite+1.359
nda+1.358
endars+1.345
Neg logit contributions
 Strait-1.278
atos-1.232
 JO-1.220
natureconservancy-1.208
perse-1.187
Token San
Token ID2986
Feature activation+7.23e+00

Pos logit contributions
hist+1.800
ham+1.763
quite+1.567
nda+1.554
aic+1.550
Neg logit contributions
 Strait-1.717
atos-1.663
 JO-1.653
natureconservancy-1.640
perse-1.619
Token Antonio
Token ID11366
Feature activation+3.85e+00

Pos logit contributions
 Strait+3.840
atos+3.574
 JO+3.554
iao+3.279
perse+3.257
Neg logit contributions
hist-6.264
ham-6.091
aic-5.686
nda-5.661
endars-5.630
Token Express
Token ID10604
Feature activation+4.36e+00

Pos logit contributions
 Strait+2.209
 JO+2.105
atos+2.094
natureconservancy+1.998
perse+1.972
Neg logit contributions
hist-3.246
ham-3.174
nda-2.904
yip-2.881
quite-2.881
Token-
Token ID12
Feature activation+1.44e+01

Pos logit contributions
 Strait+2.957
 JO+2.875
atos+2.817
perse+2.681
iao+2.677
Neg logit contributions
hist-3.172
ham-3.108
yip-2.810
quite-2.787
nda-2.770
TokenNews
Token ID9980
Feature activation+1.98e+00

Pos logit contributions
 Strait+9.066
 JO+9.039
atos+8.790
iao+8.454
perse+8.451
Neg logit contributions
ham-9.251
yip-9.215
antine-8.784
hist-8.625
ulet-8.407
Token,
Token ID11
Feature activation-1.65e+00

Pos logit contributions
 Strait+1.167
 JO+1.123
atos+1.109
perse+1.052
natureconservancy+1.050
Neg logit contributions
hist-1.621
ham-1.579
quite-1.446
nda-1.435
aic-1.434
Token Go
Token ID1514
Feature activation-5.58e-01

Pos logit contributions
hist+1.282
ham+1.255
quite+1.126
nda+1.126
endars+1.115
Neg logit contributions
 Strait-1.050
atos-1.011
 JO-1.003
natureconservancy-0.991
perse-0.973
TokenN
Token ID45
Feature activation-3.43e+00

Pos logit contributions
hist+0.407
ham+0.399
quite+0.355
nda+0.353
endars+0.352
Neg logit contributions
 Strait-0.384
 JO-0.368
atos-0.367
natureconservancy-0.360
perse-0.356
Tokenom
Token ID296
Feature activation+3.04e+00

Pos logit contributions
hist+2.424
ham+2.402
nda+2.101
quite+2.099
endars+2.093
Neg logit contributions
 Strait-2.437
 JO-2.311
natureconservancy-2.310
atos-2.308
perse-2.240
Token discern
Token ID22024
Feature activation+5.41e+00

Pos logit contributions
 Strait+1.754
 JO+1.522
atos+1.507
natureconservancy+1.385
perse+1.317
Neg logit contributions
hist-8.698
ham-8.656
yip-8.236
endars-8.233
aic-8.054
Token the
Token ID262
Feature activation+5.16e+00

Pos logit contributions
 Strait+3.076
 JO+2.913
atos+2.885
natureconservancy+2.803
perse+2.757
Neg logit contributions
hist-4.514
ham-4.421
endars-4.114
yip-4.037
quite-4.031
Token size
Token ID2546
Feature activation+9.55e+00

Pos logit contributions
 Strait+3.280
 JO+3.157
atos+3.101
natureconservancy+3.035
iao+2.982
Neg logit contributions
hist-3.955
ham-3.879
endars-3.556
yip-3.518
quite-3.497
Token of
Token ID286
Feature activation+8.94e+00

Pos logit contributions
 Strait+4.660
 JO+4.367
atos+4.263
natureconservancy+4.263
iao+3.975
Neg logit contributions
hist-8.317
endars-8.151
ham-8.076
yip-8.054
quite-7.588
Token individual
Token ID1981
Feature activation+6.41e-03

Pos logit contributions
 Strait+3.203
 JO+3.128
atos+2.949
perse+2.856
iao+2.742
Neg logit contributions
hist-9.008
ham-8.726
endars-8.437
yip-8.412
quite-8.305
Token pixels
Token ID17848
Feature activation+1.44e+01

Pos logit contributions
 Strait+0.004
 JO+0.004
atos+0.004
natureconservancy+0.003
perse+0.003
Neg logit contributions
hist-0.005
ham-0.005
quite-0.005
nda-0.005
aic-0.005
Token due
Token ID2233
Feature activation-2.85e-01

Pos logit contributions
 Strait+4.189
 JO+3.632
atos+3.161
 transitions+2.989
natureconservancy+2.884
Neg logit contributions
hist-14.206
ham-14.099
endars-14.016
itia-13.572
epad-13.555
Token to
Token ID284
Feature activation+4.51e+00

Pos logit contributions
hist+0.229
ham+0.224
quite+0.201
nda+0.201
aic+0.198
Neg logit contributions
 Strait-0.176
atos-0.168
 JO-0.168
natureconservancy-0.163
perse-0.161
Token the
Token ID262
Feature activation+5.38e+00

Pos logit contributions
 Strait+2.637
 JO+2.498
atos+2.468
natureconservancy+2.398
perse+2.347
Neg logit contributions
hist-3.704
ham-3.638
endars-3.394
yip-3.359
quite-3.320
Token amount
Token ID2033
Feature activation+1.81e+00

Pos logit contributions
 Strait+3.621
 JO+3.436
atos+3.426
natureconservancy+3.394
iao+3.330
Neg logit contributions
hist-3.857
ham-3.747
endars-3.572
yip-3.475
nda-3.443
Token they
Token ID484
Feature activation+6.09e-01

Pos logit contributions
 Strait+1.002
 JO+0.952
atos+0.949
natureconservancy+0.918
perse+0.902
Neg logit contributions
hist-1.557
ham-1.530
quite-1.392
nda-1.388
endars-1.387
Token continue
Token ID2555
Feature activation+9.40e+00

Pos logit contributions
 Strait+4.856
perse+4.458
 JO+4.387
natureconservancy+4.347
atos+4.325
Neg logit contributions
hist-6.044
ham-5.926
yip-5.648
endars-5.541
Reloaded-5.445
Token to
Token ID284
Feature activation+7.12e+00

Pos logit contributions
 Strait+4.105
 JO+3.918
atos+3.723
iao+3.475
natureconservancy+3.460
Neg logit contributions
hist-8.580
ham-8.403
yip-8.356
endars-8.348
Reloaded-7.927
Token write
Token ID3551
Feature activation+8.02e+00

Pos logit contributions
 Strait+3.622
 JO+3.441
atos+3.388
natureconservancy+3.250
iao+3.236
Neg logit contributions
hist-6.330
ham-6.127
yip-5.761
quite-5.751
endars-5.747
Token about
Token ID546
Feature activation+6.89e+00

Pos logit contributions
 Strait+4.813
 JO+4.617
atos+4.527
natureconservancy+4.411
iao+4.339
Neg logit contributions
hist-6.054
ham-5.973
endars-5.826
yip-5.807
quite-5.492
Token the
Token ID262
Feature activation+8.62e+00

Pos logit contributions
 Strait+3.582
 JO+3.401
atos+3.308
natureconservancy+3.184
iao+3.095
Neg logit contributions
hist-5.916
ham-5.829
yip-5.689
endars-5.591
quite-5.423
Token issues
Token ID2428
Feature activation+1.44e+01

Pos logit contributions
 Strait+4.648
 JO+4.323
natureconservancy+4.295
atos+4.282
 transitions+4.263
Neg logit contributions
hist-6.958
ham-6.849
yip-6.695
antine-6.480
aic-6.386
Token that
Token ID326
Feature activation+8.94e+00

Pos logit contributions
 Strait+5.264
 JO+4.672
iao+4.646
atos+4.428
natureconservancy+4.399
Neg logit contributions
endars-13.635
yip-13.386
hist-13.066
��極-13.049
antine-12.734
Token matter
Token ID2300
Feature activation+8.68e+00

Pos logit contributions
 Strait+3.000
iao+2.835
natureconservancy+2.633
 JO+2.502
perse+2.402
Neg logit contributions
hist-8.904
endars-8.783
ham-8.741
yip-8.730
antine-8.659
Token.
Token ID13
Feature activation+6.03e+00

Pos logit contributions
 Strait+4.729
 JO+4.530
natureconservancy+4.305
atos+4.247
iao+4.191
Neg logit contributions
hist-7.130
ham-6.881
yip-6.875
endars-6.848
nda-6.514
Token
Token ID198
Feature activation+5.47e+00

Pos logit contributions
 Strait+3.748
 JO+3.678
atos+3.495
natureconservancy+3.482
perse+3.369
Neg logit contributions
hist-4.577
ham-4.470
endars-4.308
yip-4.260
quite-4.153
Token
Token ID198
Feature activation+7.55e+00

Pos logit contributions
 Strait+3.352
 JO+3.255
atos+3.156
natureconservancy+3.077
iao+3.017
Neg logit contributions
hist-4.164
ham-4.058
yip-4.014
endars-3.975
nda-3.766
TokenSanta
Token ID42694
Feature activation+2.40e+00

Pos logit contributions
hist+2.156
ham+2.104
quite+1.889
nda+1.866
aic+1.861
Neg logit contributions
 Strait-1.967
 JO-1.892
atos-1.860
natureconservancy-1.846
perse-1.807
Token Clara
Token ID27443
Feature activation-2.53e+00

Pos logit contributions
 Strait+1.825
 JO+1.761
atos+1.745
natureconservancy+1.690
perse+1.683
Neg logit contributions
hist-1.578
ham-1.534
quite-1.356
nda-1.353
endars-1.352
Token County
Token ID3418
Feature activation-6.53e+00

Pos logit contributions
hist+1.862
ham+1.833
nda+1.630
quite+1.629
aic+1.618
Neg logit contributions
 Strait-1.709
atos-1.652
 JO-1.638
natureconservancy-1.623
perse-1.601
Token v
Token ID410
Feature activation+8.75e-02

Pos logit contributions
hist+4.072
ham+4.038
nda+3.496
quite+3.442
aic+3.420
Neg logit contributions
 Strait-4.974
natureconservancy-4.926
atos-4.855
perse-4.834
 JO-4.733
Token.
Token ID13
Feature activation+6.95e+00

Pos logit contributions
 Strait+0.061
atos+0.059
 JO+0.059
natureconservancy+0.057
perse+0.057
Neg logit contributions
hist-0.063
ham-0.062
quite-0.055
nda-0.055
aic-0.054
Token Southern
Token ID8050
Feature activation+1.40e+01

Pos logit contributions
 Strait+3.474
 JO+3.260
atos+3.173
iao+2.875
natureconservancy+2.843
Neg logit contributions
hist-6.236
ham-6.153
endars-5.766
quite-5.731
yip-5.689
Token Pacific
Token ID8211
Feature activation+1.82e+00

Pos logit contributions
 Strait+3.301
 JO+2.414
atos+2.148
iao+1.779
perse+1.549
Neg logit contributions
hist-15.634
ham-15.444
endars-15.370
yip-15.039
Reloaded-14.672
Token Railroad
Token ID36289
Feature activation+8.26e-01

Pos logit contributions
 Strait+1.104
 JO+1.060
atos+1.034
natureconservancy+0.996
iao+0.984
Neg logit contributions
hist-1.473
ham-1.436
quite-1.311
endars-1.304
nda-1.303
Token Company
Token ID5834
Feature activation-4.47e-01

Pos logit contributions
 Strait+0.395
 JO+0.376
atos+0.365
natureconservancy+0.348
perse+0.342
Neg logit contributions
hist-0.774
ham-0.756
quite-0.701
endars-0.698
nda-0.698
Token
Token ID198
Feature activation-1.16e+00

Pos logit contributions
hist+0.349
ham+0.343
quite+0.308
nda+0.307
aic+0.305
Neg logit contributions
 Strait-0.283
atos-0.271
 JO-0.271
natureconservancy-0.264
perse-0.260
Token
Token ID198
Feature activation-1.40e+00

Pos logit contributions
hist+0.779
ham+0.761
quite+0.669
nda+0.663
aic+0.657
Neg logit contributions
 Strait-0.872
atos-0.841
 JO-0.838
natureconservancy-0.818
perse-0.812
Token
Token ID198
Feature activation-2.32e+00

Pos logit contributions
hist+0.183
ham+0.179
quite+0.160
nda+0.160
aic+0.158
Neg logit contributions
 Strait-0.166
 JO-0.159
atos-0.159
natureconservancy-0.154
perse-0.152
Token
Token ID198
Feature activation-2.50e+00

Pos logit contributions
hist+1.511
ham+1.477
quite+1.286
nda+1.272
aic+1.255
Neg logit contributions
 Strait-1.782
atos-1.727
 JO-1.715
natureconservancy-1.682
perse-1.671
TokenCalifornia
Token ID25284
Feature activation-4.96e+00

Pos logit contributions
hist+1.893
ham+1.848
quite+1.662
nda+1.636
aic+1.630
Neg logit contributions
 Strait-1.666
 JO-1.605
atos-1.580
natureconservancy-1.567
perse-1.535
Token v
Token ID410
Feature activation+3.62e-01

Pos logit contributions
hist+3.085
ham+3.040
nda+2.647
quite+2.616
aic+2.604
Neg logit contributions
 Strait-3.810
natureconservancy-3.764
perse-3.709
atos-3.703
 JO-3.661
Token.
Token ID13
Feature activation+5.11e+00

Pos logit contributions
 Strait+0.250
 JO+0.240
atos+0.240
natureconservancy+0.233
perse+0.231
Neg logit contributions
hist-0.262
ham-0.257
quite-0.229
nda-0.228
aic-0.226
Token Southern
Token ID8050
Feature activation+1.40e+01

Pos logit contributions
 Strait+3.285
 JO+3.125
atos+3.025
iao+2.843
natureconservancy+2.816
Neg logit contributions
hist-3.973
ham-3.870
endars-3.574
yip-3.563
quite-3.538
Token Pacific
Token ID8211
Feature activation+3.01e+00

Pos logit contributions
 Strait+4.872
 JO+4.120
atos+3.683
iao+3.411
obal+3.128
Neg logit contributions
hist-13.870
ham-13.447
endars-13.099
epad-13.040
quite-12.736
Token Railroad
Token ID36289
Feature activation-8.93e-01

Pos logit contributions
 Strait+1.812
 JO+1.728
atos+1.677
iao+1.605
natureconservancy+1.599
Neg logit contributions
hist-2.456
ham-2.386
quite-2.191
endars-2.184
nda-2.169
Token Company
Token ID5834
Feature activation-2.72e+00

Pos logit contributions
hist+0.811
ham+0.796
quite+0.729
nda+0.726
endars+0.722
Neg logit contributions
 Strait-0.454
 JO-0.430
atos-0.428
natureconservancy-0.411
perse-0.405
Token
Token ID198
Feature activation-2.02e+00

Pos logit contributions
hist+1.895
ham+1.865
quite+1.647
nda+1.646
aic+1.632
Neg logit contributions
 Strait-1.922
atos-1.870
natureconservancy-1.866
 JO-1.848
perse-1.825
Token
Token ID198
Feature activation-2.08e+00

Pos logit contributions
hist+1.330
ham+1.300
quite+1.134
nda+1.123
aic+1.109
Neg logit contributions
 Strait-1.544
atos-1.494
 JO-1.485
natureconservancy-1.455
perse-1.444
Token //
Token ID3373
Feature activation+3.16e-01

Pos logit contributions
 Strait+0.803
 JO+0.774
atos+0.763
perse+0.722
natureconservancy+0.716
Neg logit contributions
hist-0.967
ham-0.950
quite-0.857
nda-0.851
aic-0.849
Token Time
Token ID3862
Feature activation+3.17e+00

Pos logit contributions
 Strait+0.185
 JO+0.176
atos+0.174
natureconservancy+0.166
perse+0.165
Neg logit contributions
hist-0.263
ham-0.259
quite-0.236
nda-0.235
aic-0.234
Token component
Token ID7515
Feature activation+7.50e+00

Pos logit contributions
 Strait+2.042
 JO+1.963
atos+1.920
perse+1.856
natureconservancy+1.854
Neg logit contributions
hist-2.433
ham-2.377
endars-2.161
quite-2.160
nda-2.160
Token const
Token ID1500
Feature activation+3.36e+00

Pos logit contributions
 Strait+4.738
 JO+4.582
atos+4.389
natureconservancy+4.118
perse+4.117
Neg logit contributions
hist-5.686
ham-5.581
yip-5.226
endars-5.169
quite-5.146
Token float
Token ID12178
Feature activation+1.09e+01

Pos logit contributions
 Strait+1.849
 JO+1.760
atos+1.714
natureconservancy+1.628
perse+1.618
Neg logit contributions
hist-2.917
ham-2.876
quite-2.626
endars-2.612
nda-2.612
Token TA
Token ID21664
Feature activation+1.40e+01

Pos logit contributions
 JO+5.148
 Strait+4.938
atos+4.693
 ENT+4.219
 INTO+4.100
Neg logit contributions
hist-10.199
ham-9.836
yip-9.260
endars-9.213
nda-9.073
TokenU
Token ID52
Feature activation+1.01e+01

Pos logit contributions
 JO+5.467
 Strait+5.368
atos+4.971
IDER+4.909
OULD+4.877
Neg logit contributions
hist-12.975
endars-12.485
ham-12.454
yip-12.324
antine-11.793
Token =
Token ID796
Feature activation+5.11e-01

Pos logit contributions
 JO+2.812
 Strait+2.677
atos+2.269
iao+2.175
natureconservancy+2.127
Neg logit contributions
hist-11.081
ham-10.818
endars-10.768
yip-10.598
quite-10.323
Token 6
Token ID718
Feature activation+1.38e+00

Pos logit contributions
 Strait+0.379
 JO+0.366
atos+0.363
natureconservancy+0.352
perse+0.349
Neg logit contributions
hist-0.345
ham-0.337
quite-0.299
nda-0.298
aic-0.296
Token.
Token ID13
Feature activation+7.98e+00

Pos logit contributions
 Strait+0.907
 JO+0.873
atos+0.865
natureconservancy+0.840
perse+0.829
Neg logit contributions
hist-1.043
ham-1.019
quite-0.917
nda-0.913
endars-0.913
Token283
Token ID30290
Feature activation+5.83e+00

Pos logit contributions
 Strait+3.586
 JO+3.555
atos+3.348
perse+3.168
iao+3.152
Neg logit contributions
hist-7.310
ham-7.068
endars-6.748
yip-6.657
nda-6.640
TokenSanta
Token ID42694
Feature activation+4.70e+00

Pos logit contributions
 Strait+0.074
 JO+0.071
atos+0.071
natureconservancy+0.069
perse+0.068
Neg logit contributions
hist-0.082
ham-0.080
quite-0.072
nda-0.071
endars-0.071
Token Clara
Token ID27443
Feature activation-2.68e+00

Pos logit contributions
 Strait+3.118
 JO+3.008
atos+2.964
natureconservancy+2.831
perse+2.831
Neg logit contributions
hist-3.506
ham-3.414
quite-3.081
aic-3.078
epad-3.075
Token County
Token ID3418
Feature activation-6.89e+00

Pos logit contributions
hist+1.935
ham+1.908
nda+1.698
quite+1.690
aic+1.679
Neg logit contributions
 Strait-1.828
atos-1.769
natureconservancy-1.748
 JO-1.736
perse-1.726
Token v
Token ID410
Feature activation+3.22e+00

Pos logit contributions
hist+4.635
ham+4.621
nda+4.144
quite+4.026
ulet+3.966
Neg logit contributions
 Strait-4.842
atos-4.744
natureconservancy-4.720
perse-4.682
 JO-4.475
Token.
Token ID13
Feature activation+7.66e+00

Pos logit contributions
 Strait+2.030
 JO+1.974
atos+1.930
iao+1.838
natureconservancy+1.832
Neg logit contributions
hist-2.508
ham-2.441
quite-2.241
endars-2.208
yip-2.205
Token Southern
Token ID8050
Feature activation+1.40e+01

Pos logit contributions
 Strait+5.207
 JO+5.072
atos+4.787
iao+4.553
natureconservancy+4.501
Neg logit contributions
hist-5.506
ham-5.294
yip-4.944
endars-4.941
quite-4.889
Token Pacific
Token ID8211
Feature activation+3.99e+00

Pos logit contributions
 Strait+8.523
 JO+8.200
atos+7.682
iao+7.540
obal+7.129
Neg logit contributions
hist-9.994
ham-9.541
epad-9.090
yip-9.062
endars-8.913
Token Railroad
Token ID36289
Feature activation+4.57e+00

Pos logit contributions
 Strait+2.536
 JO+2.413
atos+2.354
iao+2.257
natureconservancy+2.249
Neg logit contributions
hist-3.140
ham-3.066
quite-2.775
endars-2.765
yip-2.748
Token Company
Token ID5834
Feature activation+1.50e+00

Pos logit contributions
 Strait+2.934
 JO+2.831
atos+2.720
iao+2.615
natureconservancy+2.612
Neg logit contributions
hist-3.514
ham-3.399
quite-3.122
endars-3.101
yip-3.092
Token,
Token ID11
Feature activation-1.80e+00

Pos logit contributions
 Strait+0.987
 JO+0.950
atos+0.938
natureconservancy+0.905
perse+0.897
Neg logit contributions
hist-1.139
ham-1.112
quite-1.003
nda-0.996
endars-0.994
Token 118
Token ID19035
Feature activation-1.49e+00

Pos logit contributions
hist+1.213
ham+1.188
quite+1.050
nda+1.048
aic+1.031
Neg logit contributions
 Strait-1.319
atos-1.278
natureconservancy-1.267
 JO-1.260
perse-1.245
Token
Token ID198
Feature activation-1.47e+00

Pos logit contributions
hist+1.316
ham+1.290
quite+1.123
nda+1.114
aic+1.103
Neg logit contributions
 Strait-1.519
atos-1.471
 JO-1.461
natureconservancy-1.430
perse-1.424
TokenWe
Token ID1135
Feature activation-1.06e+00

Pos logit contributions
hist+1.073
ham+1.052
quite+0.937
nda+0.932
aic+0.923
Neg logit contributions
 Strait-1.020
 JO-0.978
atos-0.972
natureconservancy-0.947
perse-0.936
Token value
Token ID1988
Feature activation+7.25e+00

Pos logit contributions
hist+0.727
ham+0.713
quite+0.630
nda+0.623
aic+0.618
Neg logit contributions
 Strait-0.776
atos-0.746
 JO-0.745
natureconservancy-0.732
perse-0.722
Token local
Token ID1957
Feature activation+7.52e+00

Pos logit contributions
 Strait+4.299
 JO+4.066
atos+3.968
natureconservancy+3.900
iao+3.812
Neg logit contributions
hist-5.983
ham-5.832
quite-5.375
endars-5.329
nda-5.328
Token independent
Token ID4795
Feature activation+4.05e+00

Pos logit contributions
 Strait+2.744
 JO+2.584
atos+2.415
natureconservancy+2.357
iao+2.315
Neg logit contributions
hist-7.873
ham-7.648
quite-7.236
yip-7.190
endars-7.151
Token journalism
Token ID14033
Feature activation+1.39e+01

Pos logit contributions
 Strait+2.642
 JO+2.508
atos+2.447
iao+2.381
natureconservancy+2.369
Neg logit contributions
hist-3.095
ham-3.035
quite-2.770
yip-2.759
endars-2.747
Token.
Token ID13
Feature activation+6.00e+00

Pos logit contributions
 Strait+6.711
 JO+6.628
atos+6.520
natureconservancy+6.504
iao+6.409
Neg logit contributions
hist-11.915
ham-11.819
endars-11.133
yip-11.129
ulet-11.042
Token We
Token ID775
Feature activation+2.69e+00

Pos logit contributions
 Strait+3.374
 JO+3.266
atos+3.141
iao+2.982
natureconservancy+2.977
Neg logit contributions
hist-5.123
ham-4.969
endars-4.667
quite-4.586
aic-4.584
Token hope
Token ID2911
Feature activation+1.90e+00

Pos logit contributions
 Strait+1.964
 JO+1.902
atos+1.870
perse+1.800
iao+1.793
Neg logit contributions
hist-1.855
ham-1.805
nda-1.614
endars-1.608
quite-1.605
Token you
Token ID345
Feature activation+9.47e-01

Pos logit contributions
 Strait+1.576
atos+1.528
 JO+1.525
natureconservancy+1.492
perse+1.477
Neg logit contributions
hist-1.119
ham-1.091
quite-0.945
nda-0.943
aic-0.929
Token do
Token ID466
Feature activation-1.72e-01

Pos logit contributions
 Strait+0.632
atos+0.613
natureconservancy+0.606
 JO+0.604
perse+0.596
Neg logit contributions
hist-0.705
ham-0.683
nda-0.620
quite-0.617
aic-0.609
Token common
Token ID2219
Feature activation-7.01e+00

Pos logit contributions
hist+2.136
ham+2.099
quite+1.885
aic+1.860
nda+1.848
Neg logit contributions
 Strait-1.563
atos-1.530
natureconservancy-1.517
 JO-1.501
perse-1.459
Token language
Token ID3303
Feature activation+2.25e-01

Pos logit contributions
ham+3.881
hist+3.854
aic+3.260
ulet+3.169
quite+3.158
Neg logit contributions
 Strait-5.774
natureconservancy-5.736
atos-5.608
 JO-5.585
perse-5.540
Token with
Token ID351
Feature activation-1.34e+00

Pos logit contributions
 Strait+0.153
 JO+0.147
atos+0.147
natureconservancy+0.143
perse+0.141
Neg logit contributions
hist-0.166
ham-0.162
quite-0.146
nda-0.143
aic-0.143
Token the
Token ID262
Feature activation-6.01e-01

Pos logit contributions
hist+1.010
ham+0.990
quite+0.888
aic+0.876
nda+0.872
Neg logit contributions
 Strait-0.869
 JO-0.844
atos-0.842
natureconservancy-0.833
perse-0.812
Token s
Token ID264
Feature activation+5.16e+00

Pos logit contributions
hist+0.467
ham+0.455
quite+0.409
aic+0.406
nda+0.406
Neg logit contributions
 Strait-0.382
atos-0.370
 JO-0.368
natureconservancy-0.361
perse-0.358
Tokenouther
Token ID44262
Feature activation+1.39e+01

Pos logit contributions
 Strait+2.606
atos+2.488
 JO+2.442
natureconservancy+2.336
perse+2.316
Neg logit contributions
hist-4.638
ham-4.535
quite-4.177
nda-4.137
endars-4.135
Tokenners
Token ID2741
Feature activation+1.58e+00

Pos logit contributions
 Strait+5.083
 JO+4.692
atos+4.454
perse+4.353
iao+4.234
Neg logit contributions
hist-12.852
ham-12.570
epad-12.226
Reloaded-12.107
endars-11.978
Token.
Token ID13
Feature activation+4.88e+00

Pos logit contributions
 Strait+1.048
 JO+1.004
atos+1.002
natureconservancy+0.969
perse+0.961
Neg logit contributions
hist-1.189
ham-1.163
nda-1.042
quite-1.041
endars-1.034
Token
Token ID198
Feature activation+1.87e+00

Pos logit contributions
 Strait+3.149
 JO+3.009
atos+2.936
perse+2.777
natureconservancy+2.774
Neg logit contributions
hist-3.731
ham-3.646
yip-3.370
nda-3.343
endars-3.342
Token
Token ID198
Feature activation+2.09e+00

Pos logit contributions
 Strait+1.297
 JO+1.249
atos+1.238
natureconservancy+1.197
perse+1.192
Neg logit contributions
hist-1.332
ham-1.303
endars-1.173
nda-1.171
quite-1.170
TokenAnd
Token ID1870
Feature activation+2.87e+00

Pos logit contributions
 Strait+1.368
 JO+1.313
atos+1.312
natureconservancy+1.268
perse+1.257
Neg logit contributions
hist-1.576
ham-1.543
nda-1.389
quite-1.386
endars-1.381
Token story
Token ID1621
Feature activation+1.17e+01

Pos logit contributions
 Strait+3.679
 JO+3.489
atos+3.293
iao+3.282
natureconservancy+3.215
Neg logit contributions
hist-6.923
ham-6.822
endars-6.470
yip-6.415
quite-6.373
Token:
Token ID25
Feature activation+1.71e+00

Pos logit contributions
 Strait+4.995
 JO+4.729
natureconservancy+4.601
atos+4.579
iao+4.375
Neg logit contributions
hist-10.593
ham-10.315
endars-10.157
yip-10.141
antine-9.683
Token http
Token ID2638
Feature activation+7.23e+00

Pos logit contributions
 Strait+1.108
 JO+1.062
atos+1.059
natureconservancy+1.027
perse+1.016
Neg logit contributions
hist-1.317
ham-1.289
quite-1.159
nda-1.154
endars-1.147
Token://
Token ID1378
Feature activation+6.76e+00

Pos logit contributions
 Strait+3.015
natureconservancy+2.943
 JO+2.852
atos+2.847
perse+2.677
Neg logit contributions
hist-6.856
ham-6.754
yip-6.516
endars-6.451
antine-6.306
Tokenusat
Token ID37937
Feature activation+9.66e+00

Pos logit contributions
 Strait+3.417
atos+3.260
perse+3.243
 JO+3.152
natureconservancy+3.091
Neg logit contributions
hist-5.740
ham-5.642
antine-5.258
endars-5.207
yip-5.199
Token.
Token ID13
Feature activation+1.38e+01

Pos logit contributions
 Strait+5.156
 JO+4.949
natureconservancy+4.910
atos+4.894
perse+4.668
Neg logit contributions
hist-7.799
ham-7.661
yip-7.626
endars-7.608
nda-7.166
Tokenly
Token ID306
Feature activation+6.18e+00

Pos logit contributions
perse+9.743
atos+9.630
 Strait+9.432
iao+9.292
 JO+8.989
Neg logit contributions
hist-7.821
Reloaded-7.481
endars-7.383
ham-7.275
antine-7.110
Token/
Token ID14
Feature activation+3.82e+00

Pos logit contributions
 Strait+3.443
atos+3.248
 JO+3.230
natureconservancy+3.138
perse+3.084
Neg logit contributions
hist-5.169
ham-4.977
endars-4.800
yip-4.777
quite-4.646
Token1
Token ID16
Feature activation+4.21e+00

Pos logit contributions
 Strait+1.875
atos+1.793
 JO+1.786
natureconservancy+1.727
perse+1.696
Neg logit contributions
hist-3.431
ham-3.368
endars-3.104
quite-3.099
nda-3.079
Tokenf
Token ID69
Feature activation+7.60e-01

Pos logit contributions
 Strait+3.106
atos+3.056
 JO+3.032
natureconservancy+2.929
iao+2.925
Neg logit contributions
hist-2.700
ham-2.617
endars-2.341
quite-2.337
antine-2.305
TokenH
Token ID39
Feature activation-6.09e-01

Pos logit contributions
 Strait+0.519
 JO+0.500
atos+0.499
natureconservancy+0.484
perse+0.479
Neg logit contributions
hist-0.554
ham-0.542
quite-0.485
nda-0.482
endars-0.481
Token of
Token ID286
Feature activation-8.61e+00

Pos logit contributions
hist+0.225
ham+0.220
quite+0.198
nda+0.197
endars+0.197
Neg logit contributions
 Strait-0.188
 JO-0.181
atos-0.180
natureconservancy-0.174
perse-0.172
Token the
Token ID262
Feature activation-4.33e+00

Pos logit contributions
hist+5.829
ham+5.655
quite+5.046
nda+5.041
aic+4.834
Neg logit contributions
 Strait-6.059
atos-5.836
natureconservancy-5.794
perse-5.610
 JO-5.572
Token most
Token ID749
Feature activation-5.21e+00

Pos logit contributions
hist+2.860
ham+2.771
nda+2.441
quite+2.438
aic+2.378
Neg logit contributions
 Strait-3.246
atos-3.129
natureconservancy-3.074
perse-3.054
 JO-3.033
Token,
Token ID11
Feature activation-6.81e+00

Pos logit contributions
hist+3.609
ham+3.491
quite+3.041
nda+3.040
aic+2.987
Neg logit contributions
 Strait-3.743
atos-3.640
natureconservancy-3.543
 JO-3.510
perse-3.453
Token if
Token ID611
Feature activation-9.49e+00

Pos logit contributions
hist+5.191
ham+5.029
quite+4.579
yip+4.454
nda+4.437
Neg logit contributions
 Strait-4.284
atos-4.184
natureconservancy-4.165
 JO-4.034
perse-3.894
Token not
Token ID407
Feature activation-1.72e+01

Pos logit contributions
hist+6.305
ham+6.011
quite+5.502
nda+5.447
aic+5.235
Neg logit contributions
 Strait-6.709
natureconservancy-6.574
 JO-6.500
atos-6.460
iao-6.332
Token,
Token ID11
Feature activation-6.23e+00

Pos logit contributions
hist+9.741
ham+9.284
quite+8.665
nda+8.395
had+8.073
Neg logit contributions
 Strait-12.058
natureconservancy-11.662
atos-11.129
reddits-11.115
iao-10.980
Token the
Token ID262
Feature activation-5.60e+00

Pos logit contributions
hist+4.427
ham+4.253
nda+3.795
quite+3.789
aic+3.689
Neg logit contributions
 Strait-4.317
atos-4.163
natureconservancy-4.081
 JO-4.027
perse-3.989
Token most
Token ID749
Feature activation-4.80e+00

Pos logit contributions
hist+3.277
ham+3.144
quite+2.726
nda+2.720
aic+2.651
Neg logit contributions
 Strait-4.588
atos-4.431
natureconservancy-4.344
perse-4.316
 JO-4.309
Token Progressive
Token ID25852
Feature activation+3.67e-01

Pos logit contributions
hist+3.331
ham+3.235
quite+2.831
nda+2.831
aic+2.770
Neg logit contributions
 Strait-3.433
atos-3.335
natureconservancy-3.265
 JO-3.241
perse-3.188
Token politician
Token ID14971
Feature activation+4.61e-02

Pos logit contributions
 Strait+0.233
 JO+0.223
atos+0.222
natureconservancy+0.216
perse+0.213
Neg logit contributions
hist-0.287
ham-0.281
quite-0.253
nda-0.252
endars-0.250
Token notification
Token ID14483
Feature activation+7.38e-01

Pos logit contributions
hist+2.467
ham+2.419
quite+2.148
nda+2.148
epad+2.125
Neg logit contributions
 Strait-2.354
atos-2.246
 JO-2.221
natureconservancy-2.197
perse-2.182
Token.
Token ID13
Feature activation-9.20e-01

Pos logit contributions
 Strait+0.486
 JO+0.466
atos+0.465
natureconservancy+0.451
perse+0.446
Neg logit contributions
hist-0.559
ham-0.547
quite-0.491
nda-0.489
endars-0.486
Token From
Token ID3574
Feature activation-5.42e+00

Pos logit contributions
hist+0.674
ham+0.660
quite+0.591
nda+0.588
aic+0.583
Neg logit contributions
 Strait-0.626
atos-0.600
 JO-0.598
natureconservancy-0.585
perse-0.580
Token the
Token ID262
Feature activation-2.60e+00

Pos logit contributions
hist+3.770
ham+3.713
quite+3.324
epad+3.315
nda+3.288
Neg logit contributions
 Strait-3.742
atos-3.576
natureconservancy-3.568
 JO-3.542
perse-3.522
Token Kodi
Token ID44129
Feature activation-9.74e+00

Pos logit contributions
hist+1.815
ham+1.786
quite+1.589
aic+1.574
nda+1.572
Neg logit contributions
 Strait-1.839
atos-1.761
 JO-1.746
natureconservancy-1.733
perse-1.718
Token home
Token ID1363
Feature activation-1.61e+01

Pos logit contributions
ham+5.263
hist+5.207
endars+4.451
aic+4.390
quite+4.328
Neg logit contributions
 Strait-7.827
natureconservancy-7.741
atos-7.631
perse-7.596
 JO-7.439
Token screen
Token ID3159
Feature activation-5.35e+00

Pos logit contributions
ham+5.795
hist+5.666
endars+4.428
Page+4.423
epad+4.380
Neg logit contributions
 Strait-14.654
atos-14.334
 JO-14.276
natureconservancy-14.247
perse-14.221
Token,
Token ID11
Feature activation-3.90e+00

Pos logit contributions
hist+3.939
ham+3.882
nda+3.464
quite+3.453
endars+3.452
Neg logit contributions
 Strait-3.525
atos-3.353
 JO-3.299
natureconservancy-3.277
perse-3.190
Token choose
Token ID3853
Feature activation+8.94e-02

Pos logit contributions
hist+3.103
ham+3.067
quite+2.747
nda+2.735
aic+2.724
Neg logit contributions
 Strait-2.366
atos-2.263
 JO-2.194
natureconservancy-2.190
perse-2.159
Token
Token ID564
Feature activation+1.69e+00

Pos logit contributions
 Strait+0.069
 JO+0.066
atos+0.066
natureconservancy+0.064
perse+0.064
Neg logit contributions
hist-0.058
ham-0.057
quite-0.050
nda-0.050
endars-0.049
Token
Token ID246
Feature activation+6.37e-01

Pos logit contributions
 Strait+0.731
 JO+0.686
atos+0.685
natureconservancy+0.654
perse+0.641
Neg logit contributions
hist-1.662
ham-1.634
quite-1.505
nda-1.499
endars-1.495
Token offered
Token ID4438
Feature activation-2.02e+00

Pos logit contributions
hist+0.652
ham+0.636
quite+0.569
nda+0.566
aic+0.561
Neg logit contributions
 Strait-0.591
atos-0.571
 JO-0.566
natureconservancy-0.555
perse-0.545
Token a
Token ID257
Feature activation-4.33e+00

Pos logit contributions
hist+1.505
ham+1.462
quite+1.311
nda+1.299
aic+1.293
Neg logit contributions
 Strait-1.371
atos-1.318
 JO-1.297
natureconservancy-1.272
perse-1.248
Token more
Token ID517
Feature activation-5.77e+00

Pos logit contributions
hist+3.203
ham+3.082
quite+2.747
nda+2.709
aic+2.682
Neg logit contributions
 Strait-2.934
atos-2.870
 JO-2.778
natureconservancy-2.755
perse-2.686
Token gentle
Token ID10296
Feature activation-1.42e+00

Pos logit contributions
hist+4.331
ham+4.192
aic+3.691
quite+3.688
epad+3.677
Neg logit contributions
 Strait-3.793
atos-3.696
 JO-3.583
natureconservancy-3.485
iao-3.467
Token,
Token ID11
Feature activation-7.57e+00

Pos logit contributions
hist+1.039
ham+1.014
quite+0.902
nda+0.900
aic+0.893
Neg logit contributions
 Strait-0.977
atos-0.938
 JO-0.936
natureconservancy-0.915
perse-0.899
Token though
Token ID996
Feature activation-1.57e+01

Pos logit contributions
hist+5.454
ham+5.253
quite+4.603
epad+4.556
aic+4.545
Neg logit contributions
 Strait-5.064
atos-4.934
 JO-4.783
iao-4.673
natureconservancy-4.625
Token often
Token ID1690
Feature activation-1.09e+01

Pos logit contributions
hist+10.115
ham+9.190
quite+8.413
eer+8.005
epad+7.928
Neg logit contributions
 Strait-10.612
atos-10.513
natureconservancy-10.179
 JO-10.043
iao-9.820
Token ineffective
Token ID23693
Feature activation-2.56e+00

Pos logit contributions
hist+7.444
ham+6.996
quite+6.207
epad+6.026
aic+5.984
Neg logit contributions
 Strait-7.609
atos-7.347
 JO-7.170
natureconservancy-7.159
iao-7.052
Token,
Token ID11
Feature activation-3.30e+00

Pos logit contributions
hist+1.946
ham+1.900
quite+1.697
nda+1.686
aic+1.681
Neg logit contributions
 Strait-1.676
atos-1.606
 JO-1.596
natureconservancy-1.567
perse-1.532
Token approach
Token ID3164
Feature activation-2.64e+00

Pos logit contributions
hist+2.418
ham+2.349
quite+2.086
aic+2.072
nda+2.067
Neg logit contributions
 Strait-2.247
atos-2.165
 JO-2.142
natureconservancy-2.115
perse-2.068
Token to
Token ID284
Feature activation-2.23e+00

Pos logit contributions
hist+1.933
ham+1.892
quite+1.690
nda+1.681
aic+1.671
Neg logit contributions
 Strait-1.800
atos-1.724
 JO-1.711
natureconservancy-1.692
perse-1.657
Token receives
Token ID11583
Feature activation+5.56e+00

Pos logit contributions
 Strait+2.687
 JO+2.599
atos+2.561
perse+2.443
natureconservancy+2.441
Neg logit contributions
hist-3.016
ham-2.945
aic-2.649
nda-2.646
endars-2.637
Token secret
Token ID3200
Feature activation+5.88e+00

Pos logit contributions
 Strait+3.516
 JO+3.443
atos+3.391
perse+3.200
natureconservancy+3.197
Neg logit contributions
hist-4.246
ham-4.112
endars-3.730
quite-3.706
nda-3.696
Token voices
Token ID10839
Feature activation+1.74e+00

Pos logit contributions
 Strait+4.213
 JO+4.135
atos+4.133
perse+3.906
natureconservancy+3.898
Neg logit contributions
hist-3.901
ham-3.807
quite-3.408
yip-3.394
epad-3.375
Token in
Token ID287
Feature activation-1.53e+00

Pos logit contributions
 Strait+1.108
 JO+1.062
atos+1.058
natureconservancy+1.022
perse+1.012
Neg logit contributions
hist-1.363
ham-1.333
quite-1.201
nda-1.196
endars-1.190
Token his
Token ID465
Feature activation-5.00e-01

Pos logit contributions
hist+1.309
ham+1.284
quite+1.168
nda+1.159
endars+1.155
Neg logit contributions
 Strait-0.857
 JO-0.813
atos-0.809
natureconservancy-0.790
perse-0.773
Token or
Token ID393
Feature activation-1.57e+01

Pos logit contributions
hist+0.395
ham+0.387
quite+0.349
nda+0.348
aic+0.345
Neg logit contributions
 Strait-0.313
 JO-0.299
atos-0.298
natureconservancy-0.291
perse-0.287
Token her
Token ID607
Feature activation-1.27e+00

Pos logit contributions
ham+6.498
hist+6.174
epad+5.298
ulet+5.167
quite+5.070
Neg logit contributions
 Strait-13.548
atos-13.503
-13.491
ileged-13.427
龍契士-13.350
Token head
Token ID1182
Feature activation-3.12e+00

Pos logit contributions
hist+1.010
ham+0.989
quite+0.892
nda+0.889
endars+0.882
Neg logit contributions
 Strait-0.784
 JO-0.747
atos-0.746
natureconservancy-0.732
perse-0.722
Token,
Token ID11
Feature activation+7.49e-01

Pos logit contributions
hist+2.370
ham+2.327
quite+2.091
nda+2.070
aic+2.048
Neg logit contributions
 Strait-2.027
atos-1.935
natureconservancy-1.926
 JO-1.908
perse-1.878
Token and
Token ID290
Feature activation+3.97e+00

Pos logit contributions
 Strait+0.465
 JO+0.445
atos+0.443
natureconservancy+0.430
perse+0.424
Neg logit contributions
hist-0.596
ham-0.584
quite-0.527
nda-0.525
endars-0.521
Token must
Token ID1276
Feature activation+3.52e+00

Pos logit contributions
 Strait+2.552
 JO+2.450
atos+2.430
perse+2.312
natureconservancy+2.298
Neg logit contributions
hist-3.048
ham-2.977
quite-2.679
aic-2.677
endars-2.671
Token and
Token ID290
Feature activation-4.84e+00

Pos logit contributions
hist+0.586
ham+0.574
quite+0.517
nda+0.515
endars+0.512
Neg logit contributions
 Strait-0.487
 JO-0.466
atos-0.465
natureconservancy-0.450
perse-0.446
Token yellow
Token ID7872
Feature activation-1.72e+00

Pos logit contributions
hist+3.353
ham+3.314
quite+2.914
nda+2.881
aic+2.876
Neg logit contributions
 Strait-3.345
atos-3.301
 JO-3.283
natureconservancy-3.251
perse-3.148
Token (
Token ID357
Feature activation-8.53e+00

Pos logit contributions
hist+1.294
ham+1.269
quite+1.130
nda+1.124
aic+1.123
Neg logit contributions
 Strait-1.145
 JO-1.100
atos-1.095
natureconservancy-1.077
perse-1.052
Token2011
Token ID9804
Feature activation+4.50e-01

Pos logit contributions
hist+5.222
ham+5.123
quite+4.404
aic+4.281
nda+4.245
Neg logit contributions
 Strait-6.821
 JO-6.675
atos-6.420
natureconservancy-6.375
ileged-6.181
Token)
Token ID8
Feature activation-5.13e+00

Pos logit contributions
 Strait+0.249
 JO+0.236
atos+0.234
natureconservancy+0.224
perse+0.221
Neg logit contributions
hist-0.389
ham-0.380
quite-0.348
nda-0.346
endars-0.345
Token [
Token ID685
Feature activation-1.55e+01

Pos logit contributions
hist+3.439
ham+3.370
quite+2.945
nda+2.933
aic+2.933
Neg logit contributions
 Strait-3.743
atos-3.623
 JO-3.622
natureconservancy-3.552
perse-3.524
Token12
Token ID1065
Feature activation+3.03e+00

Pos logit contributions
hist+9.251
ham+8.930
quite+7.638
had+7.433
nda+7.375
Neg logit contributions
 Strait-11.351
 JO-11.131
atos-10.837
natureconservancy-10.455
 transitions-10.341
Token]
Token ID60
Feature activation-4.36e+00

Pos logit contributions
 Strait+1.365
 JO+1.263
atos+1.256
natureconservancy+1.184
perse+1.167
Neg logit contributions
hist-2.905
ham-2.834
quite-2.649
endars-2.639
yip-2.626
Token
Token ID198
Feature activation-4.50e+00

Pos logit contributions
hist+2.924
ham+2.859
quite+2.510
nda+2.494
aic+2.483
Neg logit contributions
 Strait-3.209
 JO-3.111
atos-3.102
natureconservancy-3.055
perse-2.999
Token
Token ID198
Feature activation-7.19e+00

Pos logit contributions
hist+2.741
ham+2.688
quite+2.285
nda+2.244
laws+2.207
Neg logit contributions
 Strait-3.616
atos-3.529
 JO-3.480
natureconservancy-3.439
perse-3.425
Token$
Token ID3
Feature activation-5.71e-01

Pos logit contributions
hist+4.636
ham+4.502
quite+3.935
aic+3.807
nda+3.801
Neg logit contributions
 Strait-5.586
 JO-5.395
atos-5.232
natureconservancy-5.199
perse-5.079
Token it
Token ID340
Feature activation+2.61e+00

Pos logit contributions
 Strait+1.314
 JO+1.257
atos+1.253
natureconservancy+1.216
perse+1.201
Neg logit contributions
hist-1.656
ham-1.622
quite-1.462
nda-1.455
aic-1.446
Token's
Token ID338
Feature activation+3.28e+00

Pos logit contributions
 Strait+1.953
 JO+1.880
atos+1.875
natureconservancy+1.824
perse+1.804
Neg logit contributions
hist-1.743
ham-1.700
quite-1.505
nda-1.500
endars-1.492
Token difficult
Token ID2408
Feature activation+4.25e+00

Pos logit contributions
 Strait+2.170
 JO+2.076
atos+2.063
natureconservancy+2.007
perse+1.967
Neg logit contributions
hist-2.453
ham-2.406
endars-2.165
nda-2.160
aic-2.151
Token,
Token ID11
Feature activation-2.31e+00

Pos logit contributions
 Strait+2.651
 JO+2.542
atos+2.525
natureconservancy+2.464
perse+2.433
Neg logit contributions
hist-3.237
ham-3.205
quite-2.917
nda-2.907
endars-2.901
Token if
Token ID611
Feature activation-5.89e+00

Pos logit contributions
hist+1.783
ham+1.762
quite+1.579
aic+1.537
nda+1.536
Neg logit contributions
 Strait-1.459
atos-1.422
 JO-1.405
natureconservancy-1.390
perse-1.356
Token not
Token ID407
Feature activation-1.55e+01

Pos logit contributions
hist+4.113
ham+3.912
quite+3.538
nda+3.505
laws+3.425
Neg logit contributions
 Strait-4.109
 JO-4.056
atos-4.025
natureconservancy-4.002
perse-3.859
Token impossible
Token ID5340
Feature activation-3.30e+00

Pos logit contributions
hist+9.499
ham+9.070
quite+8.393
laws+8.041
eer+8.038
Neg logit contributions
 Strait-10.573
natureconservancy-10.117
perse-10.092
reddits-9.915
iao-9.860
Token,
Token ID11
Feature activation-3.12e+00

Pos logit contributions
hist+2.592
ham+2.537
quite+2.283
nda+2.268
aic+2.254
Neg logit contributions
 Strait-2.077
 JO-1.987
atos-1.977
natureconservancy-1.921
perse-1.890
Token for
Token ID329
Feature activation-2.25e+00

Pos logit contributions
hist+2.314
ham+2.261
quite+2.023
nda+2.009
endars+1.995
Neg logit contributions
 Strait-2.107
 JO-2.020
atos-2.015
natureconservancy-1.958
perse-1.935
Token them
Token ID606
Feature activation+4.32e+00

Pos logit contributions
hist+1.620
ham+1.586
quite+1.407
aic+1.388
nda+1.387
Neg logit contributions
 Strait-1.542
atos-1.488
 JO-1.482
natureconservancy-1.470
perse-1.428
Token to
Token ID284
Feature activation+3.37e+00

Pos logit contributions
 Strait+2.262
atos+2.167
 JO+2.152
perse+2.082
natureconservancy+2.035
Neg logit contributions
hist-3.731
ham-3.672
nda-3.371
yip-3.367
quite-3.366
Token a
Token ID257
Feature activation+4.66e+00

Pos logit contributions
 Strait+2.122
 JO+2.042
atos+2.018
perse+1.935
natureconservancy+1.930
Neg logit contributions
hist-2.512
ham-2.455
endars-2.212
nda-2.207
quite-2.202
Token coma
Token ID33658
Feature activation+7.15e-01

Pos logit contributions
 Strait+2.849
 JO+2.736
atos+2.661
perse+2.542
iao+2.491
Neg logit contributions
hist-3.679
ham-3.618
endars-3.327
nda-3.301
yip-3.266
Token to
Token ID284
Feature activation-1.14e+00

Pos logit contributions
 Strait+0.468
 JO+0.448
atos+0.446
natureconservancy+0.432
perse+0.428
Neg logit contributions
hist-0.546
ham-0.533
quite-0.479
nda-0.477
endars-0.475
Token find
Token ID1064
Feature activation-1.85e+00

Pos logit contributions
hist+0.837
ham+0.822
quite+0.734
nda+0.729
aic+0.724
Neg logit contributions
 Strait-0.779
atos-0.749
 JO-0.745
natureconservancy-0.732
perse-0.719
Token his
Token ID465
Feature activation-6.72e-01

Pos logit contributions
hist+1.347
ham+1.334
nda+1.183
quite+1.178
aic+1.158
Neg logit contributions
 Strait-1.242
atos-1.218
 JO-1.199
natureconservancy-1.193
perse-1.172
Token family
Token ID1641
Feature activation-1.52e+01

Pos logit contributions
hist+0.493
ham+0.484
quite+0.432
nda+0.430
aic+0.425
Neg logit contributions
 Strait-0.455
atos-0.440
 JO-0.438
natureconservancy-0.429
perse-0.426
Token at
Token ID379
Feature activation-5.89e+00

Pos logit contributions
ham+9.715
hist+9.378
laws+8.594
quite+8.448
had+8.428
Neg logit contributions
atos-9.687
perse-9.421
 Strait-9.402
ileged-9.335
 JO-9.318
Token his
Token ID465
Feature activation-1.26e+00

Pos logit contributions
ham+3.883
hist+3.860
nda+3.398
quite+3.379
antine+3.353
Neg logit contributions
 Strait-4.205
natureconservancy-4.085
atos-4.078
perse-4.030
 JO-4.017
Token hospital
Token ID4436
Feature activation-3.54e+00

Pos logit contributions
hist+1.002
ham+0.993
quite+0.895
nda+0.890
aic+0.878
Neg logit contributions
 Strait-0.763
atos-0.740
 JO-0.729
natureconservancy-0.719
perse-0.715
Token bed
Token ID3996
Feature activation-8.06e+00

Pos logit contributions
ham+3.143
hist+3.135
quite+2.887
epad+2.833
nda+2.819
Neg logit contributions
 Strait-1.789
atos-1.709
natureconservancy-1.682
 JO-1.676
perse-1.665
Tokenside
Token ID1589
Feature activation-7.83e+00

Pos logit contributions
ham+5.350
hist+5.169
quite+4.688
epad+4.517
nda+4.483
Neg logit contributions
 Strait-5.850
atos-5.734
perse-5.682
natureconservancy-5.654
 JO-5.528
Token retired
Token ID9880
Feature activation+4.44e-01

Pos logit contributions
 Strait+0.014
atos+0.014
 JO+0.013
natureconservancy+0.013
perse+0.013
Neg logit contributions
hist-0.014
ham-0.014
quite-0.012
nda-0.012
aic-0.012
Token the
Token ID262
Feature activation-4.35e-01

Pos logit contributions
 Strait+0.283
atos+0.270
 JO+0.269
natureconservancy+0.264
perse+0.260
Neg logit contributions
hist-0.346
ham-0.339
quite-0.305
nda-0.303
aic-0.301
Token big
Token ID1263
Feature activation-1.30e+00

Pos logit contributions
hist+0.323
ham+0.317
quite+0.284
nda+0.281
aic+0.279
Neg logit contributions
 Strait-0.292
atos-0.282
 JO-0.277
natureconservancy-0.277
perse-0.273
Token bird
Token ID6512
Feature activation-1.28e+00

Pos logit contributions
hist+0.962
ham+0.949
quite+0.837
nda+0.837
aic+0.831
Neg logit contributions
 Strait-0.875
atos-0.838
natureconservancy-0.831
 JO-0.821
perse-0.813
Token.
Token ID13
Feature activation-1.17e+00

Pos logit contributions
hist+0.936
ham+0.917
quite+0.818
nda+0.806
aic+0.806
Neg logit contributions
 Strait-0.876
atos-0.835
 JO-0.831
natureconservancy-0.822
perse-0.809
Token Home
Token ID5995
Feature activation-1.51e+01

Pos logit contributions
hist+0.897
ham+0.879
quite+0.786
nda+0.778
aic+0.772
Neg logit contributions
 Strait-0.759
atos-0.734
 JO-0.720
natureconservancy-0.714
perse-0.707
Token<|endoftext|>
Token ID50256
Feature activation-6.37e-01

Pos logit contributions
ham+10.010
hist+9.814
Page+9.071
quite+8.634
aic+8.360
Neg logit contributions
atos-9.710
 Strait-9.530
perse-9.349
natureconservancy-9.283
 JO-8.636
Token0
Token ID15
Feature activation-3.75e+00

Pos logit contributions
hist+0.478
ham+0.468
quite+0.419
nda+0.416
aic+0.413
Neg logit contributions
 Strait-0.429
 JO-0.410
atos-0.408
natureconservancy-0.397
perse-0.392
Token
Token ID198
Feature activation+2.21e+00

Pos logit contributions
hist+2.721
ham+2.692
nda+2.399
quite+2.373
aic+2.359
Neg logit contributions
 Strait-2.530
atos-2.428
 JO-2.398
perse-2.375
natureconservancy-2.371
Token
Token ID198
Feature activation+2.53e+00

Pos logit contributions
 Strait+1.528
 JO+1.474
atos+1.458
natureconservancy+1.417
perse+1.400
Neg logit contributions
hist-1.582
ham-1.545
endars-1.404
yip-1.403
quite-1.386
TokenDon
Token ID3987
Feature activation+6.68e-01

Pos logit contributions
 Strait+1.624
 JO+1.566
atos+1.562
natureconservancy+1.507
perse+1.495
Neg logit contributions
hist-1.924
ham-1.881
yip-1.712
endars-1.703
nda-1.701
Token so
Token ID523
Feature activation-1.48e+00

Pos logit contributions
 Strait+0.263
atos+0.252
 JO+0.252
natureconservancy+0.244
perse+0.242
Neg logit contributions
hist-0.311
ham-0.304
quite-0.272
nda-0.271
aic-0.269
Token vastly
Token ID22056
Feature activation-4.49e+00

Pos logit contributions
hist+1.107
ham+1.081
quite+0.959
nda+0.954
aic+0.951
Neg logit contributions
 Strait-0.999
atos-0.964
 JO-0.958
natureconservancy-0.930
perse-0.926
Token different
Token ID1180
Feature activation-5.41e-01

Pos logit contributions
hist+3.000
ham+2.901
nda+2.515
quite+2.513
epad+2.508
Neg logit contributions
 Strait-3.313
atos-3.266
 JO-3.202
natureconservancy-3.171
perse-3.140
Token from
Token ID422
Feature activation+2.10e+00

Pos logit contributions
hist+0.432
ham+0.423
quite+0.381
nda+0.380
aic+0.377
Neg logit contributions
 Strait-0.335
 JO-0.320
atos-0.319
natureconservancy-0.309
perse-0.306
Token his
Token ID465
Feature activation+6.98e+00

Pos logit contributions
 Strait+1.321
 JO+1.267
atos+1.254
natureconservancy+1.211
perse+1.198
Neg logit contributions
hist-1.658
ham-1.622
quite-1.468
nda-1.463
endars-1.458
Token or
Token ID393
Feature activation-1.51e+01

Pos logit contributions
 Strait+4.255
 JO+4.119
atos+3.855
 transitions+3.796
natureconservancy+3.783
Neg logit contributions
hist-5.484
ham-5.292
quite-4.905
endars-4.855
yip-4.821
Token her
Token ID607
Feature activation+7.34e+00

Pos logit contributions
ham+6.160
hist+5.959
epad+5.069
ulet+4.778
nda+4.667
Neg logit contributions
atos-13.379
 Strait-13.043
-13.001
natureconservancy-12.894
perse-12.887
Token offline
Token ID18043
Feature activation-9.81e-01

Pos logit contributions
 Strait+4.529
 JO+4.373
atos+4.065
 transitions+4.049
natureconservancy+3.983
Neg logit contributions
hist-5.727
ham-5.520
quite-5.110
endars-5.076
yip-5.069
Token persona
Token ID27822
Feature activation-5.95e-02

Pos logit contributions
hist+0.745
ham+0.734
quite+0.652
nda+0.649
aic+0.647
Neg logit contributions
 Strait-0.641
atos-0.621
 JO-0.611
natureconservancy-0.601
perse-0.595
Token,
Token ID11
Feature activation+5.10e+00

Pos logit contributions
hist+0.047
ham+0.046
quite+0.042
nda+0.041
aic+0.041
Neg logit contributions
 Strait-0.037
 JO-0.036
atos-0.036
natureconservancy-0.034
perse-0.034
Token it
Token ID340
Feature activation+2.43e+00

Pos logit contributions
 Strait+3.045
 JO+2.930
atos+2.820
natureconservancy+2.702
perse+2.690
Neg logit contributions
hist-4.131
ham-4.051
nda-3.685
endars-3.681
quite-3.677
Token meet
Token ID1826
Feature activation-3.74e+00

Pos logit contributions
hist+3.056
ham+3.037
quite+2.651
aic+2.620
nda+2.619
Neg logit contributions
 Strait-3.566
atos-3.457
 JO-3.449
natureconservancy-3.360
perse-3.322
Token.
Token ID13
Feature activation-3.16e+00

Pos logit contributions
hist+2.551
ham+2.517
quite+2.224
aic+2.162
nda+2.162
Neg logit contributions
 Strait-2.692
natureconservancy-2.605
atos-2.600
 JO-2.581
perse-2.523
Token
Token ID198
Feature activation-1.78e+00

Pos logit contributions
hist+2.165
ham+2.120
quite+1.873
nda+1.863
aic+1.852
Neg logit contributions
 Strait-2.308
 JO-2.224
atos-2.224
natureconservancy-2.169
perse-2.145
Token
Token ID198
Feature activation-9.54e-01

Pos logit contributions
hist+1.169
ham+1.147
quite+0.999
nda+0.989
aic+0.978
Neg logit contributions
 Strait-1.359
atos-1.318
 JO-1.309
natureconservancy-1.282
perse-1.274
TokenM
Token ID44
Feature activation-4.82e+00

Pos logit contributions
hist+0.740
ham+0.726
quite+0.653
nda+0.648
aic+0.645
Neg logit contributions
 Strait-0.613
 JO-0.587
atos-0.584
natureconservancy-0.568
perse-0.561
Tokenort
Token ID419
Feature activation-1.51e+01

Pos logit contributions
hist+3.531
ham+3.473
nda+3.110
quite+3.060
aic+3.049
Neg logit contributions
 Strait-3.246
 JO-3.096
atos-3.090
natureconservancy-3.080
perse-3.031
Tokeng
Token ID70
Feature activation+1.70e+00

Pos logit contributions
ham+7.917
hist+7.267
nda+6.239
aic+6.202
quite+6.066
Neg logit contributions
natureconservancy-12.363
 Strait-12.059
 JO-11.564
atos-11.347
perse-11.321
Tokenages
Token ID1095
Feature activation-9.08e+00

Pos logit contributions
 Strait+0.737
 JO+0.691
atos+0.687
natureconservancy+0.655
perse+0.643
Neg logit contributions
hist-1.669
ham-1.639
quite-1.512
nda-1.507
endars-1.500
Token,
Token ID11
Feature activation-3.79e+00

Pos logit contributions
ham+5.966
hist+5.859
aic+5.119
quite+5.086
laws+5.006
Neg logit contributions
natureconservancy-6.426
 Strait-6.352
atos-6.291
 JO-6.114
perse-6.110
Token rent
Token ID5602
Feature activation-4.51e+00

Pos logit contributions
hist+2.893
ham+2.873
quite+2.553
endars+2.517
nda+2.513
Neg logit contributions
 Strait-2.410
atos-2.352
natureconservancy-2.340
 JO-2.289
perse-2.284
Token,
Token ID11
Feature activation-4.77e+00

Pos logit contributions
hist+3.257
ham+3.248
quite+2.846
aic+2.801
nda+2.797
Neg logit contributions
 Strait-3.127
natureconservancy-2.991
 JO-2.959
atos-2.947
perse-2.864
Token Fire
Token ID3764
Feature activation-2.46e+00

Pos logit contributions
hist+1.589
ham+1.565
quite+1.381
nda+1.376
aic+1.367
Neg logit contributions
 Strait-1.453
atos-1.420
 JO-1.404
natureconservancy-1.394
perse-1.381
Token of
Token ID286
Feature activation-2.89e-01

Pos logit contributions
hist+1.548
ham+1.528
quite+1.300
nda+1.300
endars+1.285
Neg logit contributions
 Strait-1.910
atos-1.864
 JO-1.862
natureconservancy-1.830
perse-1.802
Token Mount
Token ID5628
Feature activation-2.83e+00

Pos logit contributions
hist+0.210
ham+0.206
quite+0.183
nda+0.183
endars+0.181
Neg logit contributions
 Strait-0.199
 JO-0.191
atos-0.191
natureconservancy-0.186
perse-0.184
Token Doom
Token ID18688
Feature activation-6.26e+00

Pos logit contributions
hist+1.937
ham+1.931
quite+1.694
nda+1.675
aic+1.659
Neg logit contributions
 Strait-2.053
natureconservancy-1.983
atos-1.975
 JO-1.968
perse-1.925
Token in
Token ID287
Feature activation+2.61e-01

Pos logit contributions
ham+4.435
hist+4.378
quite+3.835
aic+3.823
nda+3.802
Neg logit contributions
 Strait-4.220
natureconservancy-4.161
atos-4.146
 JO-4.146
perse-4.041
Token Mord
Token ID29548
Feature activation-1.49e+01

Pos logit contributions
 Strait+0.179
 JO+0.172
atos+0.171
natureconservancy+0.166
perse+0.164
Neg logit contributions
hist-0.191
ham-0.186
quite-0.167
nda-0.166
endars-0.165
Tokenor
Token ID273
Feature activation-4.06e+00

Pos logit contributions
ham+5.206
hist+4.671
aic+3.621
hi+3.487
had+3.268
Neg logit contributions
natureconservancy-16.072
龍契士-14.939
perse-14.843
 JO-14.738
 Strait-14.711
Token,
Token ID11
Feature activation+1.72e-01

Pos logit contributions
hist+2.901
ham+2.898
quite+2.530
nda+2.519
aic+2.506
Neg logit contributions
 Strait-2.736
 JO-2.693
atos-2.687
natureconservancy-2.687
perse-2.615
Token where
Token ID810
Feature activation-1.02e+00

Pos logit contributions
 Strait+0.104
 JO+0.099
atos+0.099
natureconservancy+0.096
perse+0.095
Neg logit contributions
hist-0.140
ham-0.137
quite-0.124
nda-0.123
aic-0.122
Token it
Token ID340
Feature activation-2.97e+00

Pos logit contributions
hist+0.807
ham+0.792
quite+0.714
nda+0.710
aic+0.704
Neg logit contributions
 Strait-0.631
 JO-0.605
atos-0.603
natureconservancy-0.590
perse-0.578
Token was
Token ID373
Feature activation-3.71e+00

Pos logit contributions
hist+2.379
ham+2.332
quite+2.089
nda+2.077
endars+2.043
Neg logit contributions
 Strait-1.807
 JO-1.742
atos-1.742
natureconservancy-1.728
perse-1.680
Token
Token ID247
Feature activation+3.28e+00

Pos logit contributions
 Strait+1.648
 JO+1.588
atos+1.577
natureconservancy+1.521
perse+1.504
Neg logit contributions
hist-2.016
ham-1.973
quite-1.774
aic-1.758
nda-1.755
Token.
Token ID13
Feature activation+3.66e-01

Pos logit contributions
 Strait+2.131
 JO+2.050
atos+2.030
natureconservancy+1.941
perse+1.930
Neg logit contributions
hist-2.517
ham-2.460
quite-2.203
endars-2.198
nda-2.195
Token From
Token ID3574
Feature activation-2.87e+00

Pos logit contributions
 Strait+0.248
 JO+0.238
atos+0.237
natureconservancy+0.231
perse+0.228
Neg logit contributions
hist-0.271
ham-0.265
quite-0.237
nda-0.236
endars-0.234
Token the
Token ID262
Feature activation-9.43e-01

Pos logit contributions
hist+2.102
ham+2.069
quite+1.853
nda+1.842
epad+1.833
Neg logit contributions
 Strait-1.935
atos-1.846
 JO-1.837
natureconservancy-1.828
perse-1.803
Token Kodi
Token ID44129
Feature activation-4.92e+00

Pos logit contributions
hist+0.674
ham+0.661
quite+0.589
nda+0.584
endars+0.582
Neg logit contributions
 Strait-0.658
atos-0.632
 JO-0.628
natureconservancy-0.617
perse-0.611
Token home
Token ID1363
Feature activation-1.48e+01

Pos logit contributions
hist+3.212
ham+3.186
quite+2.761
aic+2.747
endars+2.734
Neg logit contributions
 Strait-3.587
atos-3.526
natureconservancy-3.509
perse-3.455
 JO-3.431
Token screen
Token ID3159
Feature activation-5.14e+00

Pos logit contributions
hist+6.735
ham+6.619
Page+5.452
eer+5.319
endars+5.203
Neg logit contributions
 Strait-12.533
perse-12.103
atos-12.034
natureconservancy-12.033
 JO-11.698
Token,
Token ID11
Feature activation-1.29e+00

Pos logit contributions
hist+3.670
ham+3.617
endars+3.224
quite+3.204
aic+3.199
Neg logit contributions
 Strait-3.465
atos-3.289
natureconservancy-3.272
 JO-3.221
perse-3.167
Token select
Token ID2922
Feature activation-1.04e+00

Pos logit contributions
hist+1.001
ham+0.983
quite+0.883
nda+0.878
aic+0.874
Neg logit contributions
 Strait-0.823
atos-0.786
 JO-0.779
natureconservancy-0.764
perse-0.754
Token Add
Token ID3060
Feature activation+7.03e-01

Pos logit contributions
hist+0.764
ham+0.750
quite+0.670
nda+0.668
aic+0.663
Neg logit contributions
 Strait-0.706
atos-0.677
 JO-0.676
natureconservancy-0.660
perse-0.652
Tokenons
Token ID684
Feature activation-5.62e+00

Pos logit contributions
 Strait+0.505
 JO+0.486
atos+0.484
natureconservancy+0.467
perse+0.461
Neg logit contributions
hist-0.492
ham-0.478
quite-0.426
nda-0.423
endars-0.419
Token true
Token ID2081
Feature activation+6.98e-03

Pos logit contributions
 Strait+1.392
 JO+1.335
atos+1.320
natureconservancy+1.286
perse+1.270
Neg logit contributions
hist-1.617
ham-1.584
quite-1.423
nda-1.418
endars-1.411
Token priorities
Token ID15369
Feature activation-2.04e+00

Pos logit contributions
 Strait+0.004
atos+0.004
 JO+0.004
natureconservancy+0.004
perse+0.004
Neg logit contributions
hist-0.005
ham-0.005
quite-0.005
nda-0.005
aic-0.005
Token
Token ID851
Feature activation-4.05e+00

Pos logit contributions
hist+1.489
ham+1.452
quite+1.293
nda+1.282
endars+1.270
Neg logit contributions
 Strait-1.406
atos-1.348
 JO-1.348
natureconservancy-1.322
perse-1.300
Token and
Token ID290
Feature activation-2.49e+00

Pos logit contributions
hist+3.088
ham+3.004
quite+2.715
laws+2.645
endars+2.642
Neg logit contributions
 Strait-2.626
atos-2.515
 JO-2.480
natureconservancy-2.448
perse-2.418
Token his
Token ID465
Feature activation+5.35e+00

Pos logit contributions
hist+1.915
ham+1.872
quite+1.689
nda+1.672
endars+1.657
Neg logit contributions
 Strait-1.596
atos-1.542
 JO-1.514
natureconservancy-1.511
perse-1.454
Token or
Token ID393
Feature activation-1.47e+01

Pos logit contributions
 Strait+3.465
 JO+3.369
atos+3.262
natureconservancy+3.223
perse+3.181
Neg logit contributions
hist-3.973
ham-3.924
yip-3.521
quite-3.521
nda-3.505
Token her
Token ID607
Feature activation+3.84e+00

Pos logit contributions
ham+6.422
hist+6.329
ulet+5.284
epad+5.237
nda+5.209
Neg logit contributions
 Strait-12.579
atos-12.369
natureconservancy-12.289
ileged-12.215
 JO-12.159
Token character
Token ID2095
Feature activation+3.69e+00

Pos logit contributions
 Strait+2.531
 JO+2.434
atos+2.389
natureconservancy+2.341
perse+2.309
Neg logit contributions
hist-2.873
ham-2.829
quite-2.535
nda-2.531
endars-2.522
Token
Token ID851
Feature activation+9.07e-02

Pos logit contributions
 Strait+2.464
 JO+2.358
atos+2.354
natureconservancy+2.270
perse+2.234
Neg logit contributions
hist-2.707
ham-2.678
yip-2.413
nda-2.408
quite-2.401
Token is
Token ID318
Feature activation+1.77e+00

Pos logit contributions
 Strait+0.060
 JO+0.058
atos+0.057
natureconservancy+0.056
perse+0.055
Neg logit contributions
hist-0.069
ham-0.067
quite-0.060
nda-0.060
aic-0.059
Token to
Token ID284
Feature activation-1.18e+00

Pos logit contributions
 Strait+1.038
 JO+0.994
atos+0.982
natureconservancy+0.946
perse+0.941
Neg logit contributions
hist-1.467
ham-1.440
quite-1.307
nda-1.305
endars-1.300
Token chooses
Token ID19769
Feature activation+5.96e+00

Pos logit contributions
 Strait+1.128
 JO+1.080
atos+1.076
natureconservancy+1.041
perse+1.030
Neg logit contributions
hist-1.360
ham-1.331
quite-1.199
nda-1.194
aic-1.187
Token to
Token ID284
Feature activation+4.97e+00

Pos logit contributions
 Strait+3.762
 JO+3.588
atos+3.522
perse+3.393
iao+3.292
Neg logit contributions
hist-4.608
ham-4.486
quite-4.111
aic-4.091
endars-4.084
Token go
Token ID467
Feature activation+2.77e+00

Pos logit contributions
 Strait+3.154
 JO+3.070
atos+2.997
perse+2.879
natureconservancy+2.848
Neg logit contributions
hist-3.828
ham-3.715
endars-3.405
yip-3.391
nda-3.384
Token alone
Token ID3436
Feature activation-9.75e-02

Pos logit contributions
 Strait+1.734
 JO+1.658
atos+1.640
natureconservancy+1.578
perse+1.568
Neg logit contributions
hist-2.190
ham-2.143
quite-1.937
endars-1.930
nda-1.929
Token to
Token ID284
Feature activation+3.34e+00

Pos logit contributions
hist+0.077
ham+0.075
quite+0.068
nda+0.067
endars+0.067
Neg logit contributions
 Strait-0.061
 JO-0.059
atos-0.059
natureconservancy-0.057
perse-0.056
Token Mord
Token ID29548
Feature activation-1.45e+01

Pos logit contributions
 Strait+2.240
 JO+2.155
atos+2.131
perse+2.045
natureconservancy+2.022
Neg logit contributions
hist-2.490
ham-2.422
quite-2.186
nda-2.183
endars-2.175
Tokenor
Token ID273
Feature activation-2.35e+00

Pos logit contributions
ham+4.807
hist+4.327
aic+3.472
quite+3.374
had+3.207
Neg logit contributions
natureconservancy-15.548
 Strait-14.344
龍契士-14.334
 JO-14.267
-14.155
Token,
Token ID11
Feature activation+2.21e+00

Pos logit contributions
hist+1.761
ham+1.748
quite+1.553
nda+1.543
aic+1.526
Neg logit contributions
 Strait-1.520
atos-1.485
 JO-1.477
natureconservancy-1.464
perse-1.436
Token but
Token ID475
Feature activation+2.14e+00

Pos logit contributions
 Strait+1.303
 JO+1.238
atos+1.222
natureconservancy+1.174
perse+1.166
Neg logit contributions
hist-1.837
ham-1.796
nda-1.631
quite-1.628
aic-1.625
Token Sam
Token ID3409
Feature activation-3.40e+00

Pos logit contributions
 Strait+1.469
 JO+1.411
atos+1.406
natureconservancy+1.366
perse+1.352
Neg logit contributions
hist-1.569
ham-1.535
quite-1.372
nda-1.366
endars-1.357
Token guesses
Token ID44774
Feature activation+2.07e+00

Pos logit contributions
hist+2.531
ham+2.530
nda+2.241
quite+2.237
aic+2.208
Neg logit contributions
 Strait-2.208
 JO-2.127
natureconservancy-2.109
perse-2.089
atos-2.086
Token the
Token ID262
Feature activation-4.65e-01

Pos logit contributions
hist+0.873
ham+0.858
quite+0.763
nda+0.763
aic+0.760
Neg logit contributions
 Strait-0.760
atos-0.732
 JO-0.724
natureconservancy-0.717
perse-0.705
Token mobile
Token ID5175
Feature activation-1.73e+00

Pos logit contributions
hist+0.354
ham+0.348
quite+0.310
nda+0.309
aic+0.307
Neg logit contributions
 Strait-0.302
atos-0.292
 JO-0.288
natureconservancy-0.285
perse-0.281
Token consumer
Token ID7172
Feature activation+2.55e+00

Pos logit contributions
hist+1.407
ham+1.381
nda+1.246
quite+1.240
endars+1.236
Neg logit contributions
 Strait-1.027
atos-0.995
 JO-0.980
natureconservancy-0.973
perse-0.949
Token that
Token ID326
Feature activation+2.32e+00

Pos logit contributions
 Strait+1.628
 JO+1.563
atos+1.548
natureconservancy+1.494
perse+1.480
Neg logit contributions
hist-1.983
ham-1.942
quite-1.751
nda-1.746
endars-1.738
Token his
Token ID465
Feature activation+1.53e+00

Pos logit contributions
 Strait+1.466
 JO+1.405
atos+1.386
natureconservancy+1.327
perse+1.326
Neg logit contributions
hist-1.826
ham-1.781
quite-1.617
nda-1.607
aic-1.604
Token or
Token ID393
Feature activation-1.45e+01

Pos logit contributions
 Strait+1.019
 JO+0.977
atos+0.970
natureconservancy+0.937
perse+0.929
Neg logit contributions
hist-1.144
ham-1.117
quite-1.005
nda-0.998
endars-0.993
Token her
Token ID607
Feature activation+2.98e-01

Pos logit contributions
ham+6.087
hist+5.975
epad+4.784
ulet+4.706
quite+4.540
Neg logit contributions
atos-13.026
 Strait-12.720
natureconservancy-12.569
perse-12.485
 JO-12.334
Token registration
Token ID9352
Feature activation+1.68e+00

Pos logit contributions
 Strait+0.199
atos+0.192
 JO+0.192
natureconservancy+0.187
perse+0.185
Neg logit contributions
hist-0.221
ham-0.217
quite-0.194
nda-0.193
endars-0.192
Token will
Token ID481
Feature activation+7.97e-01

Pos logit contributions
 Strait+1.218
 JO+1.173
atos+1.165
natureconservancy+1.132
perse+1.121
Neg logit contributions
hist-1.156
ham-1.128
quite-1.004
nda-0.997
aic-0.991
Token expire
Token ID24264
Feature activation-2.39e+00

Pos logit contributions
 Strait+0.461
 JO+0.439
atos+0.437
natureconservancy+0.424
perse+0.417
Neg logit contributions
hist-0.668
ham-0.656
quite-0.595
nda-0.592
aic-0.588
Token,
Token ID11
Feature activation+8.20e-01

Pos logit contributions
hist+1.807
ham+1.779
quite+1.596
nda+1.573
laws+1.562
Neg logit contributions
 Strait-1.556
atos-1.506
 JO-1.497
natureconservancy-1.490
perse-1.464
Token the
Token ID262
Feature activation-2.86e+00

Pos logit contributions
hist+0.150
ham+0.147
quite+0.132
nda+0.131
endars+0.130
Neg logit contributions
 Strait-0.130
atos-0.125
 JO-0.124
natureconservancy-0.122
perse-0.120
Token honour
Token ID15393
Feature activation+7.42e-01

Pos logit contributions
hist+1.959
ham+1.909
quite+1.687
nda+1.670
aic+1.664
Neg logit contributions
 Strait-2.059
atos-2.016
 JO-1.979
natureconservancy-1.970
perse-1.933
Token to
Token ID284
Feature activation-2.06e-01

Pos logit contributions
 Strait+0.452
 JO+0.433
atos+0.431
natureconservancy+0.416
perse+0.413
Neg logit contributions
hist-0.598
ham-0.586
quite-0.530
nda-0.528
endars-0.525
Token be
Token ID307
Feature activation-1.99e-01

Pos logit contributions
hist+0.154
ham+0.151
quite+0.135
nda+0.135
aic+0.133
Neg logit contributions
 Strait-0.137
atos-0.131
 JO-0.130
natureconservancy-0.129
perse-0.126
Token,
Token ID11
Feature activation-4.23e+00

Pos logit contributions
hist+0.145
ham+0.142
quite+0.127
nda+0.126
aic+0.125
Neg logit contributions
 Strait-0.137
 JO-0.132
atos-0.132
natureconservancy-0.129
perse-0.127
Token Mad
Token ID4627
Feature activation-1.44e+01

Pos logit contributions
hist+3.132
ham+3.054
quite+2.753
nda+2.680
laws+2.657
Neg logit contributions
 Strait-2.808
natureconservancy-2.724
atos-2.712
 JO-2.699
perse-2.602
Tokenam
Token ID321
Feature activation-5.11e+00

Pos logit contributions
hist+9.188
ham+8.766
hur+7.361
quite+7.329
nda+7.317
Neg logit contributions
natureconservancy-10.127
 Strait-9.971
龍契士-9.695
 transitions-9.536
 JO-9.495
Token,
Token ID11
Feature activation-3.90e+00

Pos logit contributions
hist+3.481
ham+3.411
nda+2.999
quite+2.998
endars+2.947
Neg logit contributions
 Strait-3.562
natureconservancy-3.530
atos-3.522
 JO-3.431
perse-3.404
Token Your
Token ID3406
Feature activation-4.86e+00

Pos logit contributions
hist+2.741
ham+2.689
quite+2.395
nda+2.362
aic+2.331
Neg logit contributions
 Strait-2.718
atos-2.638
natureconservancy-2.629
 JO-2.617
perse-2.543
Token Majesty
Token ID25788
Feature activation-7.20e+00

Pos logit contributions
hist+3.690
ham+3.655
nda+3.269
quite+3.234
aic+3.191
Neg logit contributions
 Strait-3.076
atos-2.973
 JO-2.965
natureconservancy-2.957
perse-2.876
Token's
Token ID338
Feature activation-4.55e+00

Pos logit contributions
hist+4.352
ham+4.244
quite+3.687
nda+3.649
antine+3.649
Neg logit contributions
natureconservancy-5.439
 Strait-5.413
atos-5.400
 JO-5.357
perse-5.229
Token
Token ID198
Feature activation-2.16e+00

Pos logit contributions
hist+1.346
ham+1.318
quite+1.151
nda+1.140
aic+1.126
Neg logit contributions
 Strait-1.566
atos-1.514
 JO-1.504
natureconservancy-1.470
perse-1.463
TokenMan
Token ID5124
Feature activation-2.78e+00

Pos logit contributions
hist+1.554
ham+1.515
quite+1.349
nda+1.333
aic+1.324
Neg logit contributions
 Strait-1.533
 JO-1.447
atos-1.447
natureconservancy-1.422
perse-1.405
Tokenage
Token ID496
Feature activation-2.44e+00

Pos logit contributions
hist+2.037
ham+2.007
quite+1.769
nda+1.764
aic+1.750
Neg logit contributions
 Strait-1.896
 JO-1.811
atos-1.798
natureconservancy-1.786
perse-1.762
Token Your
Token ID3406
Feature activation-1.55e+00

Pos logit contributions
hist+1.661
ham+1.640
endars+1.443
nda+1.431
quite+1.431
Neg logit contributions
 Strait-1.766
atos-1.712
natureconservancy-1.698
 JO-1.681
perse-1.670
Token Ent
Token ID7232
Feature activation-6.88e+00

Pos logit contributions
hist+1.054
ham+1.037
quite+0.911
endars+0.909
nda+0.908
Neg logit contributions
 Strait-1.129
atos-1.101
natureconservancy-1.083
 JO-1.081
perse-1.071
Tokenire
Token ID557
Feature activation-1.44e+01

Pos logit contributions
hist+4.870
ham+4.844
quite+4.271
nda+4.262
endars+4.211
Neg logit contributions
 Strait-4.796
natureconservancy-4.503
 JO-4.494
atos-4.434
perse-4.336
Token Business
Token ID7320
Feature activation+2.31e+00

Pos logit contributions
ham+8.951
hist+8.461
nda+7.534
endars+7.464
Page+7.415
Neg logit contributions
natureconservancy-10.136
perse-9.600
 Strait-9.434
atos-9.395
ileged-9.193
Token from
Token ID422
Feature activation+2.17e+00

Pos logit contributions
 Strait+1.583
 JO+1.525
atos+1.511
natureconservancy+1.458
perse+1.447
Neg logit contributions
hist-1.693
ham-1.656
quite-1.479
nda-1.473
aic-1.469
Token One
Token ID1881
Feature activation+2.90e+00

Pos logit contributions
 Strait+1.524
 JO+1.471
atos+1.456
natureconservancy+1.408
perse+1.396
Neg logit contributions
hist-1.549
ham-1.508
quite-1.347
nda-1.343
aic-1.333
Token Dash
Token ID16189
Feature activation+5.25e-01

Pos logit contributions
 Strait+2.076
 JO+2.019
atos+1.965
natureconservancy+1.883
perse+1.877
Neg logit contributions
hist-2.045
ham-1.965
quite-1.784
nda-1.772
endars-1.756
Tokenboard
Token ID3526
Feature activation+2.41e+00

Pos logit contributions
 Strait+0.338
 JO+0.324
atos+0.323
natureconservancy+0.315
perse+0.311
Neg logit contributions
hist-0.404
ham-0.397
quite-0.356
nda-0.354
aic-0.353
Token,
Token ID11
Feature activation-1.30e+00

Pos logit contributions
hist+0.039
ham+0.039
quite+0.034
nda+0.034
aic+0.034
Neg logit contributions
 Strait-0.032
 JO-0.031
atos-0.031
natureconservancy-0.030
perse-0.030
Token appeared
Token ID4120
Feature activation+8.84e-01

Pos logit contributions
hist+0.947
ham+0.932
nda+0.828
quite+0.827
aic+0.814
Neg logit contributions
 Strait-0.887
atos-0.859
 JO-0.856
natureconservancy-0.843
perse-0.830
Token in
Token ID287
Feature activation-7.50e-01

Pos logit contributions
 Strait+0.600
 JO+0.577
atos+0.573
natureconservancy+0.555
perse+0.551
Neg logit contributions
hist-0.653
ham-0.637
quite-0.570
nda-0.567
aic-0.565
Token the
Token ID262
Feature activation-6.88e+00

Pos logit contributions
hist+0.507
ham+0.497
quite+0.439
nda+0.438
aic+0.433
Neg logit contributions
 Strait-0.551
 JO-0.534
atos-0.534
natureconservancy-0.526
perse-0.515
Token Melbourne
Token ID14819
Feature activation-2.43e+00

Pos logit contributions
hist+4.885
ham+4.762
nda+4.320
quite+4.305
aic+4.240
Neg logit contributions
natureconservancy-4.603
perse-4.405
atos-4.400
 Strait-4.330
 JO-4.306
Token Mag
Token ID2944
Feature activation-1.43e+01

Pos logit contributions
hist+1.346
ham+1.336
nda+1.143
quite+1.120
aic+1.118
Neg logit contributions
natureconservancy-2.032
 Strait-2.010
atos-2.001
perse-1.994
 JO-1.978
Tokenistrates
Token ID37909
Feature activation-5.73e+00

Pos logit contributions
ham+4.771
hist+4.557
nda+3.732
aic+3.644
endars+3.530
Neg logit contributions
natureconservancy-14.512
 Strait-13.996
 JO-13.952
atos-13.806
perse-13.643
Token Court
Token ID3078
Feature activation-5.19e+00

Pos logit contributions
hist+2.728
ham+2.694
nda+2.272
quite+2.231
aic+2.225
Neg logit contributions
natureconservancy-5.165
 Strait-5.162
atos-5.111
 JO-5.090
perse-4.992
Token this
Token ID428
Feature activation-4.05e-01

Pos logit contributions
ham+3.371
hist+3.369
nda+2.962
quite+2.959
aic+2.890
Neg logit contributions
natureconservancy-3.789
 Strait-3.745
atos-3.709
 JO-3.644
perse-3.604
Token afternoon
Token ID6672
Feature activation-1.41e+00

Pos logit contributions
hist+0.322
ham+0.316
quite+0.285
nda+0.284
endars+0.282
Neg logit contributions
 Strait-0.251
 JO-0.240
atos-0.239
natureconservancy-0.232
perse-0.229
Token,
Token ID11
Feature activation-2.00e+00

Pos logit contributions
hist+1.032
ham+1.019
nda+0.908
quite+0.907
endars+0.892
Neg logit contributions
 Strait-0.955
atos-0.927
 JO-0.915
natureconservancy-0.915
perse-0.887
Token.
Token ID13
Feature activation+7.93e-01

Pos logit contributions
hist+1.951
ham+1.906
quite+1.686
nda+1.672
aic+1.659
Neg logit contributions
 Strait-1.859
atos-1.788
 JO-1.776
natureconservancy-1.760
perse-1.728
Token This
Token ID770
Feature activation-3.86e-02

Pos logit contributions
 Strait+0.513
 JO+0.492
atos+0.490
natureconservancy+0.475
perse+0.470
Neg logit contributions
hist-0.610
ham-0.597
quite-0.537
nda-0.535
endars-0.531
Token minim
Token ID10356
Feature activation-3.50e+00

Pos logit contributions
hist+0.030
ham+0.029
quite+0.026
nda+0.026
aic+0.025
Neg logit contributions
 Strait-0.025
atos-0.024
 JO-0.024
natureconservancy-0.024
perse-0.023
Tokenizes
Token ID4340
Feature activation-3.39e-02

Pos logit contributions
hist+2.180
ham+2.118
aic+1.845
quite+1.841
epad+1.822
Neg logit contributions
 Strait-2.777
 JO-2.674
natureconservancy-2.630
atos-2.616
perse-2.574
Token the
Token ID262
Feature activation-2.28e+00

Pos logit contributions
hist+0.026
ham+0.025
quite+0.022
nda+0.022
aic+0.022
Neg logit contributions
 Strait-0.022
 JO-0.022
atos-0.022
natureconservancy-0.021
perse-0.021
Token chance
Token ID2863
Feature activation-1.43e+01

Pos logit contributions
hist+1.667
ham+1.623
quite+1.444
nda+1.434
epad+1.428
Neg logit contributions
 Strait-1.554
atos-1.502
 JO-1.485
natureconservancy-1.461
perse-1.458
Token that
Token ID326
Feature activation-4.68e+00

Pos logit contributions
hist+8.330
ham+8.196
epad+7.207
aic+7.169
quite+7.115
Neg logit contributions
 Strait-10.032
 JO-9.534
atos-9.501
natureconservancy-9.375
iao-9.351
Token a
Token ID257
Feature activation-2.69e+00

Pos logit contributions
hist+3.280
ham+3.181
quite+2.832
epad+2.818
nda+2.801
Neg logit contributions
 Strait-3.275
atos-3.147
 JO-3.117
natureconservancy-3.108
perse-3.050
Token normal
Token ID3487
Feature activation-2.72e+00

Pos logit contributions
hist+1.989
ham+1.940
quite+1.717
nda+1.708
aic+1.705
Neg logit contributions
 Strait-1.833
atos-1.764
 JO-1.742
natureconservancy-1.715
perse-1.689
Token word
Token ID1573
Feature activation-6.25e-01

Pos logit contributions
hist+1.938
ham+1.893
epad+1.673
nda+1.660
quite+1.658
Neg logit contributions
 Strait-1.902
atos-1.837
 JO-1.806
natureconservancy-1.795
perse-1.773
Token like
Token ID588
Feature activation-2.32e+00

Pos logit contributions
hist+0.501
ham+0.494
quite+0.442
nda+0.439
epad+0.438
Neg logit contributions
 Strait-0.386
atos-0.367
 JO-0.365
natureconservancy-0.355
perse-0.350
Token)
Token ID8
Feature activation-3.36e+00

Pos logit contributions
 Strait+1.344
 JO+1.283
atos+1.262
natureconservancy+1.222
perse+1.197
Neg logit contributions
hist-2.189
ham-2.139
endars-1.988
yip-1.984
quite-1.968
Token
Token ID198
Feature activation-1.30e-01

Pos logit contributions
hist+2.285
ham+2.238
quite+1.965
nda+1.952
aic+1.942
Neg logit contributions
 Strait-2.452
atos-2.371
 JO-2.357
natureconservancy-2.317
perse-2.290
Token
Token ID198
Feature activation-1.44e+00

Pos logit contributions
hist+0.089
ham+0.087
quite+0.077
nda+0.077
aic+0.076
Neg logit contributions
 Strait-0.095
 JO-0.092
atos-0.092
natureconservancy-0.089
perse-0.088
Token5
Token ID20
Feature activation+1.57e+00

Pos logit contributions
hist+1.177
ham+1.156
quite+1.047
nda+1.034
aic+1.031
Neg logit contributions
 Strait-0.870
 JO-0.828
atos-0.820
natureconservancy-0.802
perse-0.790
Token.
Token ID13
Feature activation+7.06e-01

Pos logit contributions
 Strait+1.056
 JO+1.015
atos+1.007
natureconservancy+0.977
perse+0.966
Neg logit contributions
hist-1.162
ham-1.134
quite-1.020
endars-1.015
nda-1.014
Token Jon
Token ID5966
Feature activation-1.43e+01

Pos logit contributions
 Strait+0.476
atos+0.457
 JO+0.457
natureconservancy+0.445
perse+0.440
Neg logit contributions
hist-0.524
ham-0.513
quite-0.458
nda-0.456
aic-0.453
Tokenny
Token ID3281
Feature activation-4.18e+00

Pos logit contributions
ham+7.116
hist+6.934
nda+5.888
quite+5.498
aic+5.466
Neg logit contributions
natureconservancy-11.629
 JO-11.485
perse-11.310
龍契士-11.253
 transitions-11.020
Token Gray
Token ID12723
Feature activation+1.48e+00

Pos logit contributions
hist+3.388
ham+3.378
quite+3.036
nda+3.018
laws+2.968
Neg logit contributions
atos-2.356
 Strait-2.318
 JO-2.316
natureconservancy-2.301
perse-2.282
Token CAP
Token ID20176
Feature activation+2.41e+00

Pos logit contributions
 Strait+0.946
 JO+0.905
atos+0.901
natureconservancy+0.872
perse+0.863
Neg logit contributions
hist-1.151
ham-1.125
quite-1.016
nda-1.010
endars-1.008
TokenTAIN
Token ID30339
Feature activation+7.60e+00

Pos logit contributions
 Strait+1.625
 JO+1.572
atos+1.555
natureconservancy+1.503
perse+1.487
Neg logit contributions
hist-1.788
ham-1.744
quite-1.571
nda-1.559
endars-1.548
Token (
Token ID357
Feature activation+8.03e-01

Pos logit contributions
 JO+4.749
 Strait+4.642
atos+4.409
natureconservancy+4.079
perse+4.066
Neg logit contributions
hist-5.890
ham-5.721
quite-5.304
endars-5.300
yip-5.286
Token/
Token ID14
Feature activation-2.28e+00

Pos logit contributions
 Strait+0.061
 JO+0.059
atos+0.058
natureconservancy+0.056
perse+0.056
Neg logit contributions
hist-0.068
ham-0.066
quite-0.060
nda-0.060
aic-0.059
Tokenprofile
Token ID13317
Feature activation+4.35e+00

Pos logit contributions
hist+1.637
ham+1.591
quite+1.411
aic+1.403
nda+1.393
Neg logit contributions
 Strait-1.621
 JO-1.544
atos-1.542
natureconservancy-1.529
perse-1.483
Token?
Token ID30
Feature activation+2.18e+00

Pos logit contributions
 Strait+2.792
 JO+2.722
atos+2.669
perse+2.557
natureconservancy+2.536
Neg logit contributions
hist-3.258
ham-3.205
quite-2.921
nda-2.893
aic-2.867
Tokenget
Token ID1136
Feature activation+3.61e+00

Pos logit contributions
 Strait+1.592
 JO+1.541
atos+1.533
natureconservancy+1.485
perse+1.475
Neg logit contributions
hist-1.473
ham-1.442
quite-1.282
nda-1.276
endars-1.266
Token=
Token ID28
Feature activation+1.09e+00

Pos logit contributions
 Strait+2.241
 JO+2.166
atos+2.153
natureconservancy+2.066
perse+2.041
Neg logit contributions
hist-2.800
ham-2.757
quite-2.502
nda-2.489
endars-2.473
TokenNAME
Token ID20608
Feature activation+8.51e+00

Pos logit contributions
 Strait+0.773
 JO+0.744
atos+0.742
natureconservancy+0.721
perse+0.714
Neg logit contributions
hist-0.771
ham-0.754
quite-0.671
nda-0.668
endars-0.663
Token,
Token ID11
Feature activation+1.70e+00

Pos logit contributions
 JO+4.396
 Strait+4.366
atos+4.098
perse+3.892
iao+3.855
Neg logit contributions
hist-7.259
ham-7.016
quite-6.585
nda-6.506
yip-6.480
TokenDP
Token ID6322
Feature activation+2.95e+00

Pos logit contributions
 Strait+1.149
 JO+1.108
atos+1.100
natureconservancy+1.067
perse+1.057
Neg logit contributions
hist-1.251
ham-1.225
quite-1.098
nda-1.093
endars-1.086
Token02
Token ID2999
Feature activation+2.74e+00

Pos logit contributions
 Strait+1.872
 JO+1.808
atos+1.792
natureconservancy+1.716
perse+1.704
Neg logit contributions
hist-2.292
ham-2.246
quite-2.035
nda-2.020
endars-2.004
Token_
Token ID62
Feature activation-1.45e+00

Pos logit contributions
 Strait+1.686
 JO+1.621
atos+1.604
natureconservancy+1.545
perse+1.532
Neg logit contributions
hist-2.183
ham-2.140
quite-1.939
nda-1.930
aic-1.917
Token00
Token ID405
Feature activation+6.34e-01

Pos logit contributions
hist+1.116
ham+1.088
quite+0.971
nda+0.963
aic+0.962
Neg logit contributions
 Strait-0.959
atos-0.909
 JO-0.907
natureconservancy-0.893
perse-0.876
Token plenty
Token ID6088
Feature activation+7.63e-01

Pos logit contributions
 Strait+1.088
 JO+1.044
atos+1.035
natureconservancy+1.000
perse+0.993
Neg logit contributions
hist-1.268
ham-1.241
quite-1.116
nda-1.110
endars-1.108
Token more
Token ID517
Feature activation+1.26e+00

Pos logit contributions
 Strait+0.490
 JO+0.471
atos+0.471
natureconservancy+0.457
perse+0.444
Neg logit contributions
hist-0.590
ham-0.575
quite-0.517
endars-0.517
nda-0.513
Token.
Token ID13
Feature activation+1.85e+00

Pos logit contributions
 Strait+0.860
 JO+0.826
atos+0.824
natureconservancy+0.801
perse+0.792
Neg logit contributions
hist-0.926
ham-0.905
quite-0.809
nda-0.806
endars-0.800
Token There
Token ID1318
Feature activation-1.31e+00

Pos logit contributions
 Strait+1.200
 JO+1.151
atos+1.142
natureconservancy+1.102
perse+1.092
Neg logit contributions
hist-1.429
ham-1.399
quite-1.260
nda-1.254
endars-1.249
Token are
Token ID389
Feature activation+6.05e+00

Pos logit contributions
hist+0.805
ham+0.785
quite+0.686
nda+0.682
endars+0.676
Neg logit contributions
 Strait-1.051
 JO-1.015
atos-1.011
natureconservancy-0.991
perse-0.980
Token some
Token ID617
Feature activation+8.69e+00

Pos logit contributions
 Strait+3.819
 JO+3.683
atos+3.578
perse+3.427
natureconservancy+3.411
Neg logit contributions
hist-4.653
ham-4.548
yip-4.136
endars-4.116
aic-4.094
Token downs
Token ID21838
Feature activation+1.19e+00

Pos logit contributions
 Strait+5.362
 JO+5.149
atos+4.892
 transitions+4.769
natureconservancy+4.655
Neg logit contributions
hist-6.602
ham-6.469
yip-6.030
endars-5.970
aic-5.880
Tokenides
Token ID1460
Feature activation+1.28e+00

Pos logit contributions
 Strait+0.539
 JO+0.506
atos+0.504
natureconservancy+0.481
perse+0.474
Neg logit contributions
hist-1.142
ham-1.123
quite-1.034
nda-1.030
endars-1.025
Token,
Token ID11
Feature activation+6.05e-01

Pos logit contributions
 Strait+0.775
 JO+0.740
atos+0.738
natureconservancy+0.714
perse+0.706
Neg logit contributions
hist-1.045
ham-1.024
quite-0.926
nda-0.922
aic-0.917
Token like
Token ID588
Feature activation-9.53e-01

Pos logit contributions
 Strait+0.372
 JO+0.357
atos+0.356
natureconservancy+0.346
perse+0.341
Neg logit contributions
hist-0.484
ham-0.475
quite-0.429
nda-0.426
aic-0.424
Token having
Token ID1719
Feature activation+9.93e-01

Pos logit contributions
hist+0.713
ham+0.698
quite+0.622
nda+0.621
aic+0.614
Neg logit contributions
 Strait-0.634
atos-0.615
 JO-0.609
natureconservancy-0.598
perse-0.590
Token |
Token ID930
Feature activation-2.87e+00

Pos logit contributions
hist+0.678
ham+0.663
quite+0.590
nda+0.587
endars+0.583
Neg logit contributions
 Strait-0.684
 JO-0.659
atos-0.656
natureconservancy-0.639
perse-0.632
Token +
Token ID1343
Feature activation+1.49e+00

Pos logit contributions
hist+1.891
ham+1.839
quite+1.631
nda+1.607
aic+1.597
Neg logit contributions
 Strait-2.154
atos-2.076
 JO-2.054
natureconservancy-2.023
perse-2.011
Token--------------------
Token ID19351
Feature activation+8.32e+00

Pos logit contributions
 Strait+0.853
 JO+0.813
atos+0.809
natureconservancy+0.782
perse+0.772
Neg logit contributions
hist-1.261
ham-1.236
quite-1.124
nda-1.119
endars-1.114
Token--+
Token ID44785
Feature activation-7.99e-01

Pos logit contributions
 Strait+5.209
atos+5.058
 JO+5.034
natureconservancy+4.775
iao+4.738
Neg logit contributions
hist-6.361
ham-6.130
endars-5.640
quite-5.540
nda-5.532
Token +
Token ID1343
Feature activation+1.10e+00

Pos logit contributions
hist+0.554
ham+0.541
quite+0.481
nda+0.479
endars+0.476
Neg logit contributions
 Strait-0.578
 JO-0.558
atos-0.555
natureconservancy-0.540
perse-0.534
Token--------------------
Token ID19351
Feature activation+8.70e+00

Pos logit contributions
 Strait+0.633
 JO+0.603
atos+0.601
natureconservancy+0.580
perse+0.573
Neg logit contributions
hist-0.927
ham-0.909
quite-0.826
nda-0.822
endars-0.817
Token--+
Token ID44785
Feature activation-2.74e+00

Pos logit contributions
 Strait+5.418
atos+5.256
 JO+5.252
iao+4.934
natureconservancy+4.914
Neg logit contributions
hist-6.682
ham-6.416
endars-5.919
quite-5.822
yip-5.811
Token <
Token ID1279
Feature activation+1.44e+00

Pos logit contributions
hist+1.982
ham+1.936
quite+1.728
nda+1.725
aic+1.706
Neg logit contributions
 Strait-1.892
atos-1.823
 JO-1.805
natureconservancy-1.772
perse-1.755
Token--
Token ID438
Feature activation+3.84e+00

Pos logit contributions
 Strait+0.983
 JO+0.946
atos+0.943
natureconservancy+0.915
perse+0.906
Neg logit contributions
hist-1.047
ham-1.024
quite-0.915
nda-0.912
endars-0.911
Token Start
Token ID7253
Feature activation+3.23e+00

Pos logit contributions
 Strait+2.277
 JO+2.201
atos+2.117
natureconservancy+2.047
perse+2.022
Neg logit contributions
hist-3.143
ham-3.080
endars-2.799
quite-2.788
aic-2.780
Token of
Token ID286
Feature activation+4.84e-01

Pos logit contributions
 Strait+1.958
 JO+1.880
atos+1.862
natureconservancy+1.816
perse+1.771
Neg logit contributions
hist-2.600
ham-2.538
endars-2.323
quite-2.303
nda-2.291
Tokens
Token ID82
Feature activation+2.77e-01

Pos logit contributions
hist+0.536
ham+0.525
quite+0.467
nda+0.465
aic+0.461
Neg logit contributions
 Strait-0.507
atos-0.490
 JO-0.488
natureconservancy-0.478
perse-0.472
Token Coach
Token ID16393
Feature activation+9.66e-01

Pos logit contributions
 Strait+0.187
 JO+0.180
atos+0.180
natureconservancy+0.175
perse+0.173
Neg logit contributions
hist-0.205
ham-0.200
quite-0.179
nda-0.178
aic-0.177
Token returns
Token ID5860
Feature activation+2.60e+00

Pos logit contributions
 Strait+0.632
 JO+0.606
atos+0.604
natureconservancy+0.586
perse+0.580
Neg logit contributions
hist-0.736
ham-0.721
quite-0.647
nda-0.644
endars-0.640
Token to
Token ID284
Feature activation-3.06e+00

Pos logit contributions
 Strait+1.611
 JO+1.548
atos+1.528
natureconservancy+1.480
perse+1.463
Neg logit contributions
hist-2.064
ham-2.017
quite-1.823
nda-1.818
aic-1.811
Token Recap
Token ID46585
Feature activation+2.29e+00

Pos logit contributions
hist+2.183
ham+2.158
quite+1.904
nda+1.888
aic+1.883
Neg logit contributions
 Strait-2.091
atos-2.056
 JO-1.984
natureconservancy-1.982
perse-1.953
Token San
Token ID2986
Feature activation+9.61e+00

Pos logit contributions
 Strait+1.501
 JO+1.441
atos+1.428
natureconservancy+1.384
perse+1.372
Neg logit contributions
hist-1.744
ham-1.701
quite-1.532
nda-1.528
endars-1.519
Token Juan
Token ID16852
Feature activation+3.60e+00

Pos logit contributions
 Strait+2.938
 JO+2.895
atos+2.451
iao+2.268
natureconservancy+2.166
Neg logit contributions
hist-10.282
ham-9.841
aic-9.548
endars-9.333
yip-9.311
Token Del
Token ID4216
Feature activation+5.03e-01

Pos logit contributions
 Strait+2.471
 JO+2.368
atos+2.331
natureconservancy+2.228
perse+2.217
Neg logit contributions
hist-2.666
ham-2.581
quite-2.303
aic-2.291
endars-2.287
Token Sur
Token ID4198
Feature activation-2.74e+00

Pos logit contributions
 Strait+0.351
 JO+0.339
atos+0.338
natureconservancy+0.330
perse+0.326
Neg logit contributions
hist-0.360
ham-0.353
quite-0.314
nda-0.313
aic-0.310
Token Episode
Token ID7922
Feature activation+1.69e+00

Pos logit contributions
hist+1.992
ham+1.972
quite+1.741
nda+1.728
endars+1.708
Neg logit contributions
 Strait-1.824
atos-1.800
natureconservancy-1.761
perse-1.736
 JO-1.736
Token 5
Token ID642
Feature activation-3.72e-01

Pos logit contributions
 Strait+1.233
 JO+1.189
atos+1.181
natureconservancy+1.147
perse+1.139
Neg logit contributions
hist-1.167
ham-1.138
quite-1.012
nda-1.006
endars-1.000
Token all
Token ID477
Feature activation-4.08e+00

Pos logit contributions
hist+0.670
ham+0.661
quite+0.591
nda+0.590
aic+0.585
Neg logit contributions
 Strait-0.551
atos-0.530
 JO-0.525
natureconservancy-0.517
perse-0.510
Token-
Token ID12
Feature activation-6.11e-01

Pos logit contributions
hist+2.617
ham+2.596
nda+2.253
quite+2.249
aic+2.227
Neg logit contributions
 Strait-3.105
atos-2.979
natureconservancy-2.963
 JO-2.950
perse-2.931
Tokenwheel
Token ID22001
Feature activation-1.22e+00

Pos logit contributions
hist+0.359
ham+0.351
quite+0.304
nda+0.301
aic+0.298
Neg logit contributions
 Strait-0.510
atos-0.491
 JO-0.491
natureconservancy-0.481
perse-0.476
Token drive
Token ID3708
Feature activation+7.77e-01

Pos logit contributions
hist+1.067
ham+1.060
quite+0.954
nda+0.950
endars+0.943
Neg logit contributions
 Strait-0.662
atos-0.636
 JO-0.621
natureconservancy-0.618
perse-0.600
Token with
Token ID351
Feature activation+4.10e+00

Pos logit contributions
 Strait+0.482
 JO+0.461
atos+0.460
natureconservancy+0.447
perse+0.441
Neg logit contributions
hist-0.617
ham-0.606
quite-0.546
nda-0.544
aic-0.540
Token durable
Token ID20923
Feature activation+8.77e+00

Pos logit contributions
 Strait+2.643
 JO+2.545
atos+2.455
natureconservancy+2.398
perse+2.354
Neg logit contributions
hist-3.188
ham-3.068
endars-2.811
quite-2.793
nda-2.768
Token,
Token ID11
Feature activation-2.33e+00

Pos logit contributions
 Strait+5.298
 JO+5.154
 transitions+4.763
atos+4.762
iao+4.544
Neg logit contributions
hist-6.972
ham-6.574
yip-6.329
endars-6.242
quite-6.176
Token mechanical
Token ID12370
Feature activation-1.40e+00

Pos logit contributions
hist+1.710
ham+1.687
nda+1.498
quite+1.489
aic+1.468
Neg logit contributions
 Strait-1.560
atos-1.531
 JO-1.490
natureconservancy-1.489
perse-1.469
Token-
Token ID12
Feature activation+5.72e+00

Pos logit contributions
hist+1.045
ham+1.035
nda+0.918
quite+0.917
aic+0.901
Neg logit contributions
 Strait-0.927
atos-0.909
natureconservancy-0.888
 JO-0.884
perse-0.873
Tokenlocking
Token ID48331
Feature activation+1.65e+00

Pos logit contributions
 Strait+4.305
 JO+4.198
atos+4.149
perse+4.080
iao+3.999
Neg logit contributions
hist-3.482
ham-3.390
endars-3.165
nda-3.053
yip-3.021
Token four
Token ID1440
Feature activation-1.57e+00

Pos logit contributions
 Strait+1.077
 JO+1.033
atos+1.025
natureconservancy+0.992
perse+0.982
Neg logit contributions
hist-1.258
ham-1.227
quite-1.106
nda-1.100
endars-1.095
Token be
Token ID307
Feature activation-3.21e+00

Pos logit contributions
hist+3.265
ham+3.242
quite+2.900
aic+2.892
nda+2.886
Neg logit contributions
 Strait-2.437
natureconservancy-2.256
 JO-2.255
atos-2.252
perse-2.132
Token used
Token ID973
Feature activation-1.91e+00

Pos logit contributions
hist+2.211
ham+2.152
nda+1.906
quite+1.887
laws+1.880
Neg logit contributions
 Strait-2.360
atos-2.233
 JO-2.224
natureconservancy-2.206
perse-2.142
Token to
Token ID284
Feature activation-1.87e-01

Pos logit contributions
hist+1.366
ham+1.335
quite+1.180
nda+1.173
aic+1.171
Neg logit contributions
 Strait-1.342
atos-1.292
 JO-1.289
natureconservancy-1.283
perse-1.240
Token acquire
Token ID12831
Feature activation-4.49e+00

Pos logit contributions
hist+0.142
ham+0.139
quite+0.125
nda+0.124
aic+0.124
Neg logit contributions
 Strait-0.123
atos-0.118
 JO-0.118
natureconservancy-0.115
perse-0.113
Token land
Token ID1956
Feature activation-3.87e+00

Pos logit contributions
hist+2.870
ham+2.825
aic+2.449
nda+2.444
quite+2.433
Neg logit contributions
 Strait-3.428
atos-3.331
natureconservancy-3.266
 JO-3.263
perse-3.232
Token for
Token ID329
Feature activation+2.52e+00

Pos logit contributions
hist+2.737
ham+2.674
quite+2.373
nda+2.355
aic+2.345
Neg logit contributions
 Strait-2.726
atos-2.615
natureconservancy-2.584
 JO-2.570
perse-2.509
Token new
Token ID649
Feature activation+2.57e+00

Pos logit contributions
 Strait+1.508
 JO+1.446
atos+1.428
natureconservancy+1.378
perse+1.364
Neg logit contributions
hist-2.048
ham-2.008
quite-1.822
nda-1.811
endars-1.806
Token parks
Token ID14860
Feature activation+4.26e+00

Pos logit contributions
 Strait+1.366
 JO+1.303
atos+1.288
natureconservancy+1.236
perse+1.216
Neg logit contributions
hist-2.267
ham-2.220
quite-2.038
yip-2.023
endars-2.022
Token,
Token ID11
Feature activation+1.57e+00

Pos logit contributions
 Strait+2.600
 JO+2.505
atos+2.472
perse+2.352
natureconservancy+2.344
Neg logit contributions
hist-3.379
ham-3.307
yip-2.990
nda-2.989
quite-2.984
Token develop
Token ID1205
Feature activation+1.37e+00

Pos logit contributions
 Strait+1.015
 JO+0.972
atos+0.961
natureconservancy+0.931
perse+0.922
Neg logit contributions
hist-1.216
ham-1.186
quite-1.071
nda-1.070
yip-1.062
Token a
Token ID257
Feature activation+1.11e+00

Pos logit contributions
 Strait+0.889
 JO+0.853
atos+0.849
natureconservancy+0.823
perse+0.814
Neg logit contributions
hist-1.048
ham-1.026
quite-0.923
nda-0.918
endars-0.913
Token job
Token ID1693
Feature activation+2.61e-01

Pos logit contributions
 Strait+3.918
 JO+3.833
atos+3.611
natureconservancy+3.512
 transitions+3.470
Neg logit contributions
hist-5.692
ham-5.559
yip-5.196
endars-5.100
nda-5.070
Token on
Token ID319
Feature activation+9.47e-01

Pos logit contributions
 Strait+0.174
atos+0.166
 JO+0.166
natureconservancy+0.161
perse+0.160
Neg logit contributions
hist-0.196
ham-0.192
quite-0.173
nda-0.171
endars-0.170
Token Zika
Token ID30861
Feature activation+2.17e+00

Pos logit contributions
 Strait+0.620
 JO+0.597
atos+0.592
natureconservancy+0.572
perse+0.566
Neg logit contributions
hist-0.721
ham-0.706
quite-0.634
nda-0.632
aic-0.628
Token so
Token ID523
Feature activation+4.00e+00

Pos logit contributions
 Strait+1.454
 JO+1.399
atos+1.387
natureconservancy+1.338
perse+1.324
Neg logit contributions
hist-1.627
ham-1.582
nda-1.421
quite-1.421
aic-1.414
Token he
Token ID339
Feature activation+6.72e+00

Pos logit contributions
 Strait+2.625
 JO+2.533
atos+2.494
natureconservancy+2.408
perse+2.403
Neg logit contributions
hist-3.028
ham-2.950
quite-2.651
aic-2.646
nda-2.640
Token could
Token ID714
Feature activation+7.23e+00

Pos logit contributions
 Strait+3.990
 JO+3.889
atos+3.776
perse+3.622
iao+3.602
Neg logit contributions
hist-5.332
ham-5.183
yip-4.805
aic-4.799
nda-4.736
Token stump
Token ID42497
Feature activation+1.17e+00

Pos logit contributions
 JO+4.659
 Strait+4.638
atos+4.463
perse+4.346
iao+4.233
Neg logit contributions
hist-5.359
ham-5.147
endars-4.717
aic-4.710
yip-4.664
Token for
Token ID329
Feature activation+2.34e+00

Pos logit contributions
 Strait+0.892
 JO+0.864
atos+0.855
natureconservancy+0.831
perse+0.825
Neg logit contributions
hist-0.771
ham-0.747
quite-0.660
nda-0.659
aic-0.658
Token Trump
Token ID1301
Feature activation+2.77e+00

Pos logit contributions
 Strait+1.544
 JO+1.491
atos+1.470
natureconservancy+1.415
perse+1.404
Neg logit contributions
hist-1.768
ham-1.729
quite-1.549
nda-1.541
aic-1.540
Token.
Token ID13
Feature activation+1.96e+00

Pos logit contributions
 Strait+1.843
 JO+1.777
atos+1.762
natureconservancy+1.698
perse+1.685
Neg logit contributions
hist-2.062
ham-2.007
quite-1.807
nda-1.805
aic-1.796
Token
Token ID198
Feature activation+1.09e+00

Pos logit contributions
 Strait+1.340
 JO+1.291
atos+1.279
natureconservancy+1.237
perse+1.225
Neg logit contributions
hist-1.442
ham-1.409
quite-1.261
nda-1.255
endars-1.252
Token.
Token ID13
Feature activation-4.88e-02

Pos logit contributions
 Strait+1.233
 JO+1.188
atos+1.176
natureconservancy+1.136
perse+1.128
Neg logit contributions
hist-1.419
ham-1.383
quite-1.244
nda-1.240
endars-1.236
Token
Token ID198
Feature activation+7.85e-02

Pos logit contributions
hist+0.036
ham+0.035
quite+0.031
nda+0.031
aic+0.031
Neg logit contributions
 Strait-0.033
 JO-0.032
atos-0.032
natureconservancy-0.031
perse-0.031
Token
Token ID198
Feature activation+7.35e-01

Pos logit contributions
 Strait+0.057
 JO+0.055
atos+0.055
natureconservancy+0.053
perse+0.053
Neg logit contributions
hist-0.054
ham-0.053
quite-0.047
nda-0.047
aic-0.046
TokenUnlike
Token ID18521
Feature activation-2.17e+00

Pos logit contributions
 Strait+0.489
 JO+0.471
atos+0.470
natureconservancy+0.455
perse+0.451
Neg logit contributions
hist-0.549
ham-0.537
quite-0.482
nda-0.480
endars-0.478
Token asphalt
Token ID48292
Feature activation+2.31e+00

Pos logit contributions
hist+1.607
ham+1.584
quite+1.409
nda+1.404
aic+1.393
Neg logit contributions
 Strait-1.450
atos-1.387
 JO-1.383
natureconservancy-1.367
perse-1.343
Token repairs
Token ID20097
Feature activation+3.11e+00

Pos logit contributions
 Strait+1.454
 JO+1.398
atos+1.393
natureconservancy+1.335
perse+1.326
Neg logit contributions
hist-1.812
ham-1.759
quite-1.592
nda-1.588
yip-1.584
Token for
Token ID329
Feature activation+2.20e+00

Pos logit contributions
 Strait+1.849
 JO+1.765
atos+1.746
natureconservancy+1.679
perse+1.662
Neg logit contributions
hist-2.550
ham-2.473
quite-2.259
nda-2.251
endars-2.251
Token streets
Token ID6483
Feature activation+2.66e+00

Pos logit contributions
 Strait+1.364
 JO+1.302
atos+1.288
natureconservancy+1.242
perse+1.231
Neg logit contributions
hist-1.749
ham-1.709
quite-1.546
nda-1.538
endars-1.537
Token,
Token ID11
Feature activation+4.01e+00

Pos logit contributions
 Strait+1.623
 JO+1.548
atos+1.545
natureconservancy+1.473
perse+1.468
Neg logit contributions
hist-2.139
ham-2.083
quite-1.888
nda-1.883
endars-1.882
Token concrete
Token ID10017
Feature activation+1.23e+00

Pos logit contributions
 Strait+2.436
 JO+2.332
atos+2.294
natureconservancy+2.210
perse+2.183
Neg logit contributions
hist-3.231
ham-3.145
quite-2.852
nda-2.850
yip-2.837
Token sidewalk
Token ID24058
Feature activation+3.34e+00

Pos logit contributions
 Strait+0.676
 JO+0.644
atos+0.641
natureconservancy+0.616
perse+0.608
Neg logit contributions
hist-1.065
ham-1.044
quite-0.952
nda-0.948
endars-0.944
Token on
Token ID319
Feature activation-2.87e+00

Pos logit contributions
hist+0.433
ham+0.424
quite+0.377
nda+0.373
aic+0.370
Neg logit contributions
 Strait-0.463
 JO-0.447
atos-0.446
natureconservancy-0.435
perse-0.430
Token longer
Token ID2392
Feature activation+6.24e+00

Pos logit contributions
hist+2.357
ham+2.310
quite+2.093
nda+2.084
endars+2.072
Neg logit contributions
 Strait-1.702
 JO-1.625
atos-1.619
natureconservancy-1.567
perse-1.547
Token than
Token ID621
Feature activation+2.40e+00

Pos logit contributions
 Strait+2.743
 JO+2.634
atos+2.627
 transitions+2.409
iao+2.401
Neg logit contributions
hist-5.796
ham-5.743
yip-5.379
endars-5.297
nda-5.274
Token one
Token ID530
Feature activation+5.62e+00

Pos logit contributions
 Strait+1.628
 JO+1.558
atos+1.555
natureconservancy+1.469
perse+1.465
Neg logit contributions
hist-1.766
ham-1.726
endars-1.550
nda-1.550
quite-1.549
Token of
Token ID286
Feature activation+7.77e-01

Pos logit contributions
 Strait+3.629
 JO+3.443
atos+3.407
iao+3.204
natureconservancy+3.168
Neg logit contributions
hist-4.248
ham-4.103
endars-3.834
nda-3.806
quite-3.803
Token the
Token ID262
Feature activation+2.03e+00

Pos logit contributions
 Strait+0.529
 JO+0.506
atos+0.503
natureconservancy+0.483
perse+0.482
Neg logit contributions
hist-0.574
ham-0.560
quite-0.503
endars-0.501
nda-0.501
Token nine
Token ID5193
Feature activation+1.47e+00

Pos logit contributions
 Strait+1.371
 JO+1.316
atos+1.304
natureconservancy+1.250
perse+1.240
Neg logit contributions
hist-1.494
ham-1.462
quite-1.312
aic-1.307
endars-1.307
Token lives
Token ID3160
Feature activation-3.98e+00

Pos logit contributions
 Strait+0.870
 JO+0.830
atos+0.826
natureconservancy+0.792
perse+0.785
Neg logit contributions
hist-1.213
ham-1.188
quite-1.079
nda-1.074
aic-1.070
Token of
Token ID286
Feature activation+1.76e+00

Pos logit contributions
hist+2.863
ham+2.822
nda+2.491
quite+2.477
epad+2.458
Neg logit contributions
 Strait-2.721
atos-2.635
 JO-2.622
natureconservancy-2.606
perse-2.548
Token an
Token ID281
Feature activation+1.43e+00

Pos logit contributions
 Strait+1.171
 JO+1.124
atos+1.115
natureconservancy+1.076
perse+1.068
Neg logit contributions
hist-1.319
ham-1.291
quite-1.160
nda-1.157
endars-1.154
Token average
Token ID2811
Feature activation-2.34e-01

Pos logit contributions
 Strait+0.904
 JO+0.864
atos+0.861
natureconservancy+0.832
perse+0.821
Neg logit contributions
hist-1.122
ham-1.097
quite-0.993
nda-0.989
endars-0.985
Token site
Token ID2524
Feature activation+1.60e+00

Pos logit contributions
hist+1.091
ham+1.082
quite+0.965
nda+0.958
aic+0.951
Neg logit contributions
 Strait-0.940
atos-0.931
 JO-0.909
natureconservancy-0.894
perse-0.888
Token would
Token ID561
Feature activation+2.47e+00

Pos logit contributions
 Strait+1.269
 JO+1.228
atos+1.217
natureconservancy+1.178
perse+1.175
Neg logit contributions
hist-0.998
ham-0.969
nda-0.851
quite-0.849
endars-0.844
Token be
Token ID307
Feature activation-2.72e-01

Pos logit contributions
 Strait+1.451
 JO+1.398
atos+1.384
perse+1.328
natureconservancy+1.308
Neg logit contributions
hist-2.027
ham-1.975
nda-1.802
endars-1.801
quite-1.797
Token tid
Token ID29770
Feature activation+2.62e+00

Pos logit contributions
hist+0.196
ham+0.191
quite+0.170
nda+0.169
aic+0.167
Neg logit contributions
 Strait-0.190
atos-0.183
 JO-0.182
natureconservancy-0.179
perse-0.175
Tokenied
Token ID798
Feature activation+5.36e-01

Pos logit contributions
 Strait+1.049
 JO+0.988
atos+0.962
natureconservancy+0.912
perse+0.908
Neg logit contributions
hist-2.638
ham-2.595
endars-2.407
quite-2.404
nda-2.402
Token but
Token ID475
Feature activation+5.11e-01

Pos logit contributions
 Strait+0.370
 JO+0.357
atos+0.356
natureconservancy+0.347
perse+0.342
Neg logit contributions
hist-0.389
ham-0.381
quite-0.340
nda-0.337
aic-0.335
Token will
Token ID481
Feature activation+3.32e-01

Pos logit contributions
 Strait+0.328
atos+0.315
 JO+0.314
natureconservancy+0.307
perse+0.301
Neg logit contributions
hist-0.396
ham-0.389
quite-0.349
nda-0.348
aic-0.345
Token remain
Token ID3520
Feature activation+2.82e+00

Pos logit contributions
 Strait+0.206
 JO+0.197
atos+0.196
natureconservancy+0.191
perse+0.188
Neg logit contributions
hist-0.265
ham-0.260
quite-0.234
nda-0.233
aic-0.232
Token there
Token ID612
Feature activation-3.14e+00

Pos logit contributions
 Strait+1.809
 JO+1.734
atos+1.726
natureconservancy+1.667
perse+1.654
Neg logit contributions
hist-2.181
ham-2.133
quite-1.919
nda-1.914
endars-1.905
Token until
Token ID1566
Feature activation-1.89e+00

Pos logit contributions
hist+2.086
ham+2.057
quite+1.819
nda+1.786
aic+1.769
Neg logit contributions
 Strait-2.303
atos-2.273
natureconservancy-2.266
 JO-2.244
perse-2.193
Token the
Token ID262
Feature activation+6.70e-01

Pos logit contributions
hist+1.426
ham+1.404
quite+1.258
nda+1.250
laws+1.237
Neg logit contributions
 Strait-1.224
atos-1.194
natureconservancy-1.179
 JO-1.175
perse-1.148
Token end
Token ID886
Feature activation-3.47e+00

Pos logit contributions
hist+2.347
ham+2.319
quite+2.105
epad+2.024
endars+2.008
Neg logit contributions
 Strait-2.064
natureconservancy-1.929
atos-1.928
 JO-1.918
perse-1.876
Token until
Token ID1566
Feature activation-5.94e+00

Pos logit contributions
hist+2.498
ham+2.438
quite+2.181
nda+2.138
aic+2.135
Neg logit contributions
 Strait-2.409
 JO-2.311
atos-2.291
natureconservancy-2.258
perse-2.244
Token the
Token ID262
Feature activation-1.47e+00

Pos logit contributions
hist+4.269
ham+4.144
quite+3.755
nda+3.695
laws+3.659
Neg logit contributions
 Strait-3.996
atos-3.943
natureconservancy-3.909
 JO-3.831
perse-3.805
Token accumulated
Token ID22425
Feature activation-9.42e+00

Pos logit contributions
hist+1.119
ham+1.093
quite+0.979
nda+0.978
endars+0.968
Neg logit contributions
 Strait-0.947
atos-0.918
 JO-0.909
natureconservancy-0.895
perse-0.891
Token debt
Token ID5057
Feature activation+9.14e-01

Pos logit contributions
hist+6.317
ham+6.141
nda+5.458
aic+5.406
quite+5.347
Neg logit contributions
natureconservancy-6.385
 Strait-6.376
atos-6.333
perse-6.273
 JO-6.032
Token becomes
Token ID4329
Feature activation-3.24e+00

Pos logit contributions
 Strait+0.580
 JO+0.556
atos+0.554
natureconservancy+0.536
perse+0.531
Neg logit contributions
hist-0.715
ham-0.700
quite-0.631
nda-0.628
endars-0.624
Token unsustainable
Token ID39554
Feature activation-2.49e+00

Pos logit contributions
hist+2.462
ham+2.394
quite+2.168
nda+2.127
epad+2.117
Neg logit contributions
 Strait-2.139
atos-2.028
natureconservancy-2.004
 JO-1.995
perse-1.962
Token and
Token ID290
Feature activation-3.74e-01

Pos logit contributions
hist+1.785
ham+1.739
quite+1.553
nda+1.540
endars+1.523
Neg logit contributions
 Strait-1.744
atos-1.668
 JO-1.665
natureconservancy-1.634
perse-1.614
Token the
Token ID262
Feature activation+5.40e-03

Pos logit contributions
hist+0.286
ham+0.280
quite+0.251
nda+0.250
aic+0.248
Neg logit contributions
 Strait-0.244
atos-0.233
 JO-0.233
natureconservancy-0.227
perse-0.224
Token government
Token ID1230
Feature activation+4.86e-01

Pos logit contributions
 Strait+0.004
atos+0.003
 JO+0.003
natureconservancy+0.003
perse+0.003
Neg logit contributions
hist-0.004
ham-0.004
quite-0.004
nda-0.004
aic-0.004
Token
Token ID447
Feature activation-2.08e+00

Pos logit contributions
 Strait+0.302
 JO+0.289
atos+0.288
natureconservancy+0.279
perse+0.276
Neg logit contributions
hist-0.387
ham-0.379
quite-0.342
nda-0.340
endars-0.338
Token in
Token ID287
Feature activation+2.97e+00

Pos logit contributions
 Strait+0.149
 JO+0.143
atos+0.142
natureconservancy+0.137
perse+0.136
Neg logit contributions
hist-0.170
ham-0.166
quite-0.148
nda-0.148
endars-0.148
Token the
Token ID262
Feature activation-3.57e-01

Pos logit contributions
 Strait+1.954
 JO+1.866
atos+1.850
natureconservancy+1.782
perse+1.771
Neg logit contributions
hist-2.266
ham-2.207
nda-1.986
endars-1.985
quite-1.980
Token Ham
Token ID4345
Feature activation-1.86e+00

Pos logit contributions
hist+0.271
ham+0.266
quite+0.239
nda+0.237
aic+0.236
Neg logit contributions
 Strait-0.232
atos-0.223
 JO-0.223
natureconservancy-0.218
perse-0.216
Tokenpt
Token ID457
Feature activation+2.01e+00

Pos logit contributions
hist+1.362
ham+1.340
quite+1.203
nda+1.191
aic+1.181
Neg logit contributions
 Strait-1.264
 JO-1.217
atos-1.204
natureconservancy-1.194
perse-1.169
Tokenons
Token ID684
Feature activation-1.61e+00

Pos logit contributions
 Strait+1.486
 JO+1.436
atos+1.435
natureconservancy+1.399
perse+1.380
Neg logit contributions
hist-1.364
ham-1.337
quite-1.181
nda-1.171
aic-1.164
Token,
Token ID11
Feature activation-4.86e-01

Pos logit contributions
hist+1.220
ham+1.200
quite+1.077
nda+1.066
aic+1.063
Neg logit contributions
 Strait-1.050
 JO-1.006
atos-1.005
natureconservancy-0.980
perse-0.972
Token and
Token ID290
Feature activation+2.21e+00

Pos logit contributions
hist+0.357
ham+0.350
quite+0.313
nda+0.311
endars+0.309
Neg logit contributions
 Strait-0.330
atos-0.318
 JO-0.317
natureconservancy-0.309
perse-0.306
Token their
Token ID511
Feature activation+2.35e+00

Pos logit contributions
 Strait+1.560
 JO+1.498
atos+1.486
natureconservancy+1.441
perse+1.429
Neg logit contributions
hist-1.575
ham-1.536
quite-1.370
nda-1.365
aic-1.357
Token children
Token ID1751
Feature activation-5.77e+00

Pos logit contributions
 Strait+1.469
 JO+1.399
atos+1.386
natureconservancy+1.338
perse+1.313
Neg logit contributions
hist-1.872
ham-1.831
quite-1.649
nda-1.646
endars-1.638
Token attended
Token ID9141
Feature activation+4.33e+00

Pos logit contributions
ham+3.980
hist+3.958
quite+3.524
endars+3.430
nda+3.424
Neg logit contributions
 Strait-3.938
atos-3.883
natureconservancy-3.859
perse-3.787
 JO-3.770
Token the
Token ID262
Feature activation+7.99e-01

Pos logit contributions
 Strait+2.798
 JO+2.691
atos+2.628
natureconservancy+2.540
perse+2.535
Neg logit contributions
hist-3.330
ham-3.224
quite-2.921
endars-2.914
nda-2.911
Token
Token ID198
Feature activation-3.90e+00

Pos logit contributions
hist+1.920
ham+1.889
nda+1.687
quite+1.680
aic+1.671
Neg logit contributions
 Strait-1.693
atos-1.644
 JO-1.627
natureconservancy-1.602
perse-1.589
Token
Token ID198
Feature activation-3.64e+00

Pos logit contributions
hist+2.421
ham+2.371
quite+2.032
nda+1.996
aic+1.971
Neg logit contributions
 Strait-3.092
atos-3.018
 JO-2.970
natureconservancy-2.929
perse-2.919
TokenSelf
Token ID24704
Feature activation+3.29e+00

Pos logit contributions
hist+2.758
ham+2.704
quite+2.430
nda+2.393
aic+2.385
Neg logit contributions
 Strait-2.415
 JO-2.308
atos-2.298
natureconservancy-2.237
perse-2.189
Token described
Token ID3417
Feature activation+5.94e+00

Pos logit contributions
 Strait+2.259
 JO+2.157
atos+2.116
perse+2.031
natureconservancy+2.027
Neg logit contributions
hist-2.372
ham-2.331
quite-2.109
nda-2.109
endars-2.093
Token as
Token ID355
Feature activation+2.13e-01

Pos logit contributions
 Strait+3.426
 JO+3.240
atos+3.148
perse+3.038
natureconservancy+3.024
Neg logit contributions
hist-4.907
ham-4.802
endars-4.385
quite-4.355
nda-4.351
Token "
Token ID366
Feature activation-1.31e+00

Pos logit contributions
 Strait+0.142
 JO+0.136
atos+0.136
natureconservancy+0.132
perse+0.130
Neg logit contributions
hist-0.160
ham-0.157
quite-0.140
nda-0.140
endars-0.139
Tokenf
Token ID69
Feature activation-3.54e+00

Pos logit contributions
hist+0.873
ham+0.852
quite+0.756
nda+0.746
aic+0.741
Neg logit contributions
 Strait-0.984
 JO-0.948
atos-0.944
natureconservancy-0.919
perse-0.906
Tokenucked
Token ID17758
Feature activation+1.51e+00

Pos logit contributions
hist+2.302
ham+2.254
quite+1.988
nda+1.983
aic+1.953
Neg logit contributions
 Strait-2.705
 JO-2.625
atos-2.585
natureconservancy-2.527
perse-2.491
Token up
Token ID510
Feature activation+1.78e+00

Pos logit contributions
 Strait+0.987
 JO+0.947
atos+0.943
natureconservancy+0.913
perse+0.903
Neg logit contributions
hist-1.146
ham-1.121
quite-1.007
nda-1.004
endars-0.998
Token Rock
Token ID4631
Feature activation+4.82e+00

Pos logit contributions
 Strait+1.130
 JO+1.086
atos+1.078
natureconservancy+1.039
perse+1.029
Neg logit contributions
hist-1.384
ham-1.353
quite-1.219
nda-1.217
endars-1.212
Token and
Token ID290
Feature activation-1.13e-01

Pos logit contributions
 Strait+3.384
 JO+3.236
atos+3.227
natureconservancy+3.046
iao+3.027
Neg logit contributions
hist-3.423
ham-3.261
endars-3.002
quite-2.990
nda-2.987
Token
Token ID251
Feature activation-1.59e+00

Pos logit contributions
hist+0.664
ham+0.651
quite+0.591
nda+0.590
endars+0.584
Neg logit contributions
 Strait-0.489
atos-0.466
 JO-0.464
natureconservancy-0.450
perse-0.446
Token said
Token ID531
Feature activation+2.48e+00

Pos logit contributions
hist+1.182
ham+1.155
quite+1.037
nda+1.029
endars+1.020
Neg logit contributions
 Strait-1.076
atos-1.035
 JO-1.026
natureconservancy-1.010
perse-0.995
Token Tom
Token ID4186
Feature activation-3.23e+00

Pos logit contributions
 Strait+1.694
 JO+1.665
atos+1.606
natureconservancy+1.555
perse+1.545
Neg logit contributions
hist-1.800
ham-1.750
quite-1.587
endars-1.580
yip-1.565
Token F
Token ID376
Feature activation+2.65e+00

Pos logit contributions
hist+2.303
ham+2.285
nda+2.023
quite+2.003
endars+1.983
Neg logit contributions
 Strait-2.194
atos-2.154
natureconservancy-2.133
 JO-2.125
perse-2.107
Tokenitton
Token ID47304
Feature activation+5.24e+00

Pos logit contributions
 Strait+1.210
atos+1.149
 JO+1.138
perse+1.079
natureconservancy+1.070
Neg logit contributions
hist-2.506
ham-2.451
quite-2.280
nda-2.269
yip-2.246
Token,
Token ID11
Feature activation-1.76e+00

Pos logit contributions
 Strait+2.903
 JO+2.870
atos+2.815
perse+2.635
natureconservancy+2.587
Neg logit contributions
hist-4.330
ham-4.166
quite-3.918
endars-3.912
yip-3.905
Token president
Token ID1893
Feature activation+2.11e-01

Pos logit contributions
hist+1.292
ham+1.265
quite+1.127
nda+1.124
aic+1.114
Neg logit contributions
 Strait-1.202
atos-1.150
 JO-1.146
natureconservancy-1.122
perse-1.108
Token of
Token ID286
Feature activation-2.33e+00

Pos logit contributions
 Strait+0.131
 JO+0.126
atos+0.126
natureconservancy+0.121
perse+0.119
Neg logit contributions
hist-0.168
ham-0.164
quite-0.149
nda-0.148
endars-0.148
Token Judicial
Token ID31411
Feature activation-2.37e+00

Pos logit contributions
hist+1.288
ham+1.256
nda+1.066
quite+1.063
aic+1.057
Neg logit contributions
 Strait-2.000
atos-1.938
 JO-1.919
natureconservancy-1.901
perse-1.886
Token Watch
Token ID6305
Feature activation-1.59e+00

Pos logit contributions
hist+1.024
ham+0.997
nda+0.810
quite+0.798
aic+0.783
Neg logit contributions
 Strait-2.310
atos-2.267
 JO-2.238
perse-2.213
natureconservancy-2.206
Token.
Token ID13
Feature activation-1.77e+00

Pos logit contributions
hist+1.175
ham+1.154
quite+1.028
nda+1.027
aic+1.016
Neg logit contributions
 Strait-1.080
atos-1.034
 JO-1.031
natureconservancy-1.009
perse-0.999
Token happen
Token ID1645
Feature activation-3.14e+00

Pos logit contributions
 Strait+0.162
atos+0.156
 JO+0.155
natureconservancy+0.152
perse+0.150
Neg logit contributions
hist-0.153
ham-0.150
quite-0.133
nda-0.131
aic-0.130
Token.
Token ID13
Feature activation+3.15e-01

Pos logit contributions
hist+2.239
ham+2.204
quite+1.958
nda+1.933
endars+1.899
Neg logit contributions
 Strait-2.177
natureconservancy-2.097
atos-2.094
 JO-2.071
perse-2.028
Token
Token ID447
Feature activation+1.15e+00

Pos logit contributions
 Strait+0.208
atos+0.200
 JO+0.198
natureconservancy+0.195
perse+0.194
Neg logit contributions
hist-0.238
ham-0.233
quite-0.209
nda-0.207
aic-0.205
Token
Token ID251
Feature activation-3.86e-01

Pos logit contributions
 Strait+0.731
 JO+0.702
atos+0.698
natureconservancy+0.676
perse+0.664
Neg logit contributions
hist-0.894
ham-0.873
quite-0.786
aic-0.782
nda-0.781
Token Tar
Token ID14110
Feature activation+1.79e+00

Pos logit contributions
hist+0.278
ham+0.273
quite+0.242
nda+0.241
aic+0.239
Neg logit contributions
 Strait-0.267
atos-0.259
 JO-0.257
natureconservancy-0.253
perse-0.250
Tokenam
Token ID321
Feature activation-2.09e+00

Pos logit contributions
 Strait+1.893
atos+1.857
 JO+1.839
perse+1.805
natureconservancy+1.804
Neg logit contributions
hist-0.618
ham-0.575
yip-0.467
endars-0.466
quite-0.462
Tokenov
Token ID709
Feature activation-2.33e+00

Pos logit contributions
hist+1.397
ham+1.373
quite+1.206
nda+1.199
aic+1.191
Neg logit contributions
 Strait-1.563
 JO-1.512
atos-1.486
natureconservancy-1.473
perse-1.464
Token told
Token ID1297
Feature activation-2.01e+00

Pos logit contributions
hist+1.680
ham+1.663
quite+1.461
nda+1.461
endars+1.434
Neg logit contributions
 Strait-1.596
natureconservancy-1.541
atos-1.533
 JO-1.532
perse-1.493
Token CF
Token ID18551
Feature activation-3.85e+00

Pos logit contributions
hist+1.528
ham+1.499
quite+1.347
nda+1.341
aic+1.327
Neg logit contributions
 Strait-1.291
atos-1.252
 JO-1.239
natureconservancy-1.226
perse-1.204
Token13
Token ID1485
Feature activation-3.69e+00

Pos logit contributions
hist+3.019
ham+2.906
nda+2.671
quite+2.653
endars+2.589
Neg logit contributions
 Strait-2.425
atos-2.359
perse-2.304
 JO-2.304
natureconservancy-2.287
Token.
Token ID13
Feature activation-1.81e+00

Pos logit contributions
hist+2.660
ham+2.633
quite+2.325
nda+2.305
laws+2.276
Neg logit contributions
 Strait-2.489
natureconservancy-2.419
atos-2.409
 JO-2.394
perse-2.373
Token,
Token ID11
Feature activation+4.34e+00

Pos logit contributions
 Strait+0.496
 JO+0.474
atos+0.472
natureconservancy+0.457
perse+0.452
Neg logit contributions
hist-0.629
ham-0.616
quite-0.556
nda-0.554
endars-0.551
Token both
Token ID1111
Feature activation-1.32e+00

Pos logit contributions
 Strait+2.832
 JO+2.712
atos+2.645
perse+2.551
natureconservancy+2.540
Neg logit contributions
hist-3.310
ham-3.238
nda-2.920
yip-2.910
endars-2.898
Token camera
Token ID4676
Feature activation+3.26e+00

Pos logit contributions
hist+0.989
ham+0.969
quite+0.867
nda+0.864
aic+0.864
Neg logit contributions
 Strait-0.880
atos-0.848
 JO-0.840
natureconservancy-0.831
perse-0.815
Token power
Token ID1176
Feature activation+5.00e+00

Pos logit contributions
 Strait+2.044
 JO+1.966
atos+1.930
natureconservancy+1.860
perse+1.842
Neg logit contributions
hist-2.561
ham-2.512
quite-2.271
epad-2.257
yip-2.254
Token supplies
Token ID9416
Feature activation-2.82e+00

Pos logit contributions
 Strait+3.196
 JO+3.079
atos+3.035
natureconservancy+2.924
perse+2.882
Neg logit contributions
hist-3.822
ham-3.722
yip-3.407
quite-3.395
epad-3.360
Token ar
Token ID610
Feature activation-1.00e+01

Pos logit contributions
hist+1.852
ham+1.810
quite+1.593
aic+1.593
nda+1.572
Neg logit contributions
 Strait-2.120
atos-2.064
natureconservancy-2.035
 JO-2.034
perse-1.996
Tokenced
Token ID771
Feature activation-3.09e+00

Pos logit contributions
hist+5.713
ham+5.665
aic+5.017
quite+4.958
nda+4.940
Neg logit contributions
 Strait-7.792
atos-7.353
 JO-7.297
perse-7.230
natureconservancy-7.150
Token and
Token ID290
Feature activation+1.39e+00

Pos logit contributions
hist+2.362
ham+2.311
quite+2.099
aic+2.062
nda+2.048
Neg logit contributions
 Strait-1.968
atos-1.915
natureconservancy-1.896
 JO-1.893
perse-1.861
Token short
Token ID1790
Feature activation+7.08e+00

Pos logit contributions
 Strait+0.924
 JO+0.886
atos+0.881
natureconservancy+0.854
perse+0.846
Neg logit contributions
hist-1.049
ham-1.027
quite-0.921
nda-0.918
endars-0.912
Tokened
Token ID276
Feature activation+3.77e+00

Pos logit contributions
 Strait+4.578
 JO+4.385
atos+4.322
 transitions+4.121
perse+4.102
Neg logit contributions
hist-5.179
ham-5.046
nda-4.645
quite-4.628
yip-4.626
Token out
Token ID503
Feature activation+2.09e+00

Pos logit contributions
 Strait+2.274
 JO+2.177
atos+2.138
natureconservancy+2.038
perse+2.030
Neg logit contributions
hist-3.052
ham-2.994
nda-2.733
endars-2.709
quite-2.703
Token space
Token ID2272
Feature activation+7.50e+00

Pos logit contributions
 Strait+0.459
 JO+0.440
atos+0.438
natureconservancy+0.425
perse+0.420
Neg logit contributions
hist-0.554
ham-0.543
quite-0.489
nda-0.487
endars-0.484
Token)
Token ID8
Feature activation-3.45e-01

Pos logit contributions
 Strait+4.414
 JO+4.301
atos+4.245
natureconservancy+4.058
 transitions+4.023
Neg logit contributions
hist-5.862
ham-5.737
nda-5.306
endars-5.255
antine-5.254
Token {
Token ID1391
Feature activation-9.42e-02

Pos logit contributions
hist+0.248
ham+0.243
quite+0.216
nda+0.215
aic+0.214
Neg logit contributions
 Strait-0.240
atos-0.230
 JO-0.230
natureconservancy-0.224
perse-0.222
Tokenoptions
Token ID25811
Feature activation+1.80e+00

Pos logit contributions
hist+0.066
ham+0.065
quite+0.058
nda+0.057
endars+0.057
Neg logit contributions
 Strait-0.067
 JO-0.065
atos-0.064
natureconservancy-0.063
perse-0.062
Token}
Token ID92
Feature activation-6.38e+00

Pos logit contributions
 Strait+0.910
 JO+0.855
atos+0.851
natureconservancy+0.826
perse+0.815
Neg logit contributions
hist-1.624
ham-1.592
quite-1.470
endars-1.454
aic-1.451
Token a
Token ID257
Feature activation-9.08e+00

Pos logit contributions
hist+4.633
ham+4.502
aic+4.009
quite+3.978
nda+3.943
Neg logit contributions
 Strait-4.192
atos-4.119
 JO-4.073
natureconservancy-3.976
perse-3.892
Token list
Token ID1351
Feature activation-4.43e+00

Pos logit contributions
hist+4.083
ham+3.810
epad+3.057
nda+3.053
quite+2.945
Neg logit contributions
 Strait-8.332
atos-8.274
natureconservancy-8.021
perse-7.932
 JO-7.909
Token of
Token ID286
Feature activation+8.98e-01

Pos logit contributions
hist+2.978
ham+2.928
nda+2.546
quite+2.543
aic+2.540
Neg logit contributions
 Strait-3.216
atos-3.075
 JO-3.054
natureconservancy-3.011
perse-2.934
Token options
Token ID3689
Feature activation-6.08e-01

Pos logit contributions
 Strait+0.476
 JO+0.453
atos+0.449
natureconservancy+0.433
perse+0.428
Neg logit contributions
hist-0.795
ham-0.781
quite-0.714
nda-0.711
endars-0.708
Token,
Token ID11
Feature activation+1.05e+00

Pos logit contributions
hist+0.484
ham+0.474
quite+0.427
nda+0.426
aic+0.423
Neg logit contributions
 Strait-0.377
 JO-0.360
atos-0.359
natureconservancy-0.348
perse-0.344
Token separated
Token ID11266
Feature activation+1.96e+00

Pos logit contributions
 Strait+0.365
 JO+0.337
atos+0.335
natureconservancy+0.315
perse+0.309
Neg logit contributions
hist-1.117
ham-1.100
quite-1.021
nda-1.018
endars-1.013
Token 33
Token ID4747
Feature activation-4.89e+00

Pos logit contributions
hist+2.710
ham+2.680
quite+2.379
nda+2.360
laws+2.322
Neg logit contributions
 Strait-2.504
atos-2.479
natureconservancy-2.440
 JO-2.390
perse-2.376
Token-
Token ID12
Feature activation-1.39e+00

Pos logit contributions
hist+3.131
ham+3.128
nda+2.724
quite+2.658
laws+2.623
Neg logit contributions
 Strait-3.731
atos-3.613
natureconservancy-3.561
 JO-3.549
perse-3.545
Tokenyear
Token ID1941
Feature activation-6.24e-01

Pos logit contributions
hist+1.111
ham+1.089
quite+0.979
nda+0.970
aic+0.963
Neg logit contributions
 Strait-0.868
 JO-0.829
atos-0.826
natureconservancy-0.802
perse-0.792
Token-
Token ID12
Feature activation-6.87e-01

Pos logit contributions
hist+0.426
ham+0.415
nda+0.365
quite+0.364
aic+0.358
Neg logit contributions
 Strait-0.458
atos-0.444
 JO-0.442
natureconservancy-0.433
perse-0.431
Tokenold
Token ID727
Feature activation-3.66e+00

Pos logit contributions
hist+0.297
ham+0.285
quite+0.232
nda+0.230
aic+0.226
Neg logit contributions
 Strait-0.679
 JO-0.661
atos-0.659
natureconservancy-0.647
perse-0.642
Token Z
Token ID1168
Feature activation-1.27e+01

Pos logit contributions
hist+2.678
ham+2.656
nda+2.333
quite+2.331
aic+2.294
Neg logit contributions
 Strait-2.425
atos-2.386
natureconservancy-2.357
 JO-2.309
perse-2.305
Tokenyang
Token ID17859
Feature activation-1.76e+00

Pos logit contributions
ham+8.749
hist+8.690
aic+7.555
hi+7.546
ulet+7.493
Neg logit contributions
 Strait-8.528
natureconservancy-8.298
 JO-8.119
龍契士-7.951
 transitions-7.776
Token V
Token ID569
Feature activation-9.84e-01

Pos logit contributions
hist+1.317
ham+1.296
nda+1.153
quite+1.150
aic+1.136
Neg logit contributions
 Strait-1.167
 JO-1.123
atos-1.118
natureconservancy-1.108
perse-1.083
Tokenang
Token ID648
Feature activation+2.05e+00

Pos logit contributions
hist+0.579
ham+0.564
quite+0.486
nda+0.485
aic+0.482
Neg logit contributions
 Strait-0.816
 JO-0.790
atos-0.784
natureconservancy-0.772
perse-0.763
Token,
Token ID11
Feature activation+4.12e-02

Pos logit contributions
 Strait+1.419
 JO+1.364
atos+1.363
natureconservancy+1.307
perse+1.302
Neg logit contributions
hist-1.475
ham-1.435
quite-1.292
endars-1.282
nda-1.279
Token then
Token ID788
Feature activation-3.12e-01

Pos logit contributions
 Strait+0.026
atos+0.025
 JO+0.025
natureconservancy+0.025
perse+0.024
Neg logit contributions
hist-0.032
ham-0.032
quite-0.028
nda-0.028
aic-0.028
Token the
Token ID262
Feature activation-5.77e-01

Pos logit contributions
hist+0.391
ham+0.383
quite+0.341
nda+0.340
endars+0.336
Neg logit contributions
 Strait-0.389
atos-0.376
 JO-0.374
natureconservancy-0.365
perse-0.360
Token GOP
Token ID6796
Feature activation-1.73e+00

Pos logit contributions
hist+0.429
ham+0.420
quite+0.376
nda+0.374
aic+0.372
Neg logit contributions
 Strait-0.384
atos-0.371
 JO-0.369
natureconservancy-0.364
perse-0.360
Token took
Token ID1718
Feature activation-2.59e+00

Pos logit contributions
hist+1.223
ham+1.210
nda+1.061
quite+1.056
endars+1.042
Neg logit contributions
 Strait-1.207
atos-1.170
 JO-1.161
natureconservancy-1.155
perse-1.138
Token control
Token ID1630
Feature activation-1.05e+01

Pos logit contributions
hist+1.845
ham+1.815
quite+1.618
nda+1.606
aic+1.591
Neg logit contributions
 Strait-1.811
atos-1.749
 JO-1.743
natureconservancy-1.710
perse-1.690
Token of
Token ID286
Feature activation-9.32e+00

Pos logit contributions
ham+6.459
hist+6.400
quite+5.731
nda+5.539
laws+5.451
Neg logit contributions
 Strait-7.732
atos-7.455
natureconservancy-7.399
perse-7.299
 JO-7.270
Token the
Token ID262
Feature activation-8.83e+00

Pos logit contributions
hist+6.087
ham+6.056
quite+5.374
nda+5.311
aic+5.247
Neg logit contributions
 Strait-6.473
atos-6.360
natureconservancy-6.280
perse-6.150
 JO-6.096
Token upper
Token ID6727
Feature activation-7.99e+00

Pos logit contributions
hist+5.996
ham+5.909
quite+5.204
nda+5.194
laws+5.166
Neg logit contributions
natureconservancy-5.923
atos-5.848
perse-5.843
 Strait-5.836
reddits-5.619
Token chamber
Token ID11847
Feature activation-5.90e+00

Pos logit contributions
hist+2.202
ham+2.070
nda+1.675
endars+1.467
aic+1.447
Neg logit contributions
atos-8.785
 Strait-8.760
natureconservancy-8.714
perse-8.632
 JO-8.469
Token.
Token ID13
Feature activation-8.22e-01

Pos logit contributions
hist+4.008
ham+3.953
quite+3.538
nda+3.487
aic+3.434
Neg logit contributions
 Strait-4.166
atos-4.063
natureconservancy-4.028
 JO-3.962
perse-3.933
Token Friends
Token ID14213
Feature activation-4.91e+00

Pos logit contributions
hist+0.598
ham+0.585
quite+0.520
nda+0.519
aic+0.514
Neg logit contributions
 Strait-0.563
atos-0.546
 JO-0.538
natureconservancy-0.530
perse-0.526
Token and
Token ID290
Feature activation-3.54e+00

Pos logit contributions
hist+3.439
ham+3.393
quite+3.005
nda+2.997
aic+2.968
Neg logit contributions
 Strait-3.439
atos-3.272
 JO-3.271
natureconservancy-3.265
perse-3.202
Token authorities
Token ID4773
Feature activation-2.13e+00

Pos logit contributions
hist+1.928
ham+1.905
quite+1.706
nda+1.689
aic+1.681
Neg logit contributions
 Strait-1.576
atos-1.546
 JO-1.517
natureconservancy-1.503
perse-1.475
Token said
Token ID531
Feature activation-4.18e-01

Pos logit contributions
hist+1.579
ham+1.549
quite+1.387
nda+1.378
aic+1.369
Neg logit contributions
 Strait-1.433
atos-1.379
 JO-1.375
natureconservancy-1.347
perse-1.325
Token.
Token ID13
Feature activation-3.48e+00

Pos logit contributions
hist+0.307
ham+0.300
quite+0.269
nda+0.267
endars+0.266
Neg logit contributions
 Strait-0.286
 JO-0.274
atos-0.273
natureconservancy-0.265
perse-0.262
Token
Token ID198
Feature activation-1.02e+00

Pos logit contributions
hist+2.319
ham+2.286
quite+1.991
nda+1.977
aic+1.971
Neg logit contributions
 Strait-2.565
atos-2.512
 JO-2.463
natureconservancy-2.426
perse-2.414
Token
Token ID198
Feature activation-4.35e+00

Pos logit contributions
hist+0.681
ham+0.666
quite+0.584
nda+0.579
aic+0.574
Neg logit contributions
 Strait-0.768
atos-0.743
 JO-0.739
natureconservancy-0.721
perse-0.717
TokenL
Token ID43
Feature activation-9.39e+00

Pos logit contributions
hist+3.189
ham+3.154
quite+2.793
nda+2.754
aic+2.724
Neg logit contributions
 Strait-3.004
atos-2.850
 JO-2.839
natureconservancy-2.747
perse-2.733
Tokeninda
Token ID22261
Feature activation-4.03e+00

Pos logit contributions
ham+5.918
hist+5.881
nda+5.112
aic+5.043
laws+4.983
Neg logit contributions
 Strait-7.069
 JO-6.710
natureconservancy-6.699
perse-6.517
龍契士-6.507
Token Jackson
Token ID6612
Feature activation+6.89e+00

Pos logit contributions
hist+2.952
ham+2.934
nda+2.607
quite+2.582
aic+2.545
Neg logit contributions
 Strait-2.640
atos-2.587
natureconservancy-2.581
 JO-2.555
perse-2.527
Token,
Token ID11
Feature activation-2.73e+00

Pos logit contributions
 Strait+4.434
 JO+4.215
atos+4.170
iao+3.943
perse+3.881
Neg logit contributions
hist-5.270
ham-4.946
yip-4.694
quite-4.600
endars-4.589
Token 59
Token ID7863
Feature activation+1.41e+00

Pos logit contributions
hist+1.638
ham+1.613
quite+1.403
nda+1.377
aic+1.361
Neg logit contributions
 Strait-2.191
atos-2.149
 JO-2.100
natureconservancy-2.100
perse-2.078
Token,
Token ID11
Feature activation-3.67e+00

Pos logit contributions
 Strait+0.856
 JO+0.819
atos+0.815
natureconservancy+0.789
perse+0.779
Neg logit contributions
hist-1.142
ham-1.118
quite-1.012
nda-1.007
endars-1.002