TOP ACTIVATIONS
MAX = 3.794

ny has been the least lucky : he was attacked with
haven t been lucky enough to get your hands
the prime challenge Belfast Te legraph . co . uk Dis
Newsletter Sign Up Continue reading the main story Please verify
The Brazil ians were lucky to be put into group
. Advertisement Continue reading the main story
reading the main story Please verify you 're not a robot
is in with all the olig archs , Letter
reading the main story Please verify you 're not a robot
time came to actually start grading . Parents who had been
and you re lucky if you get away with
reading the main story Please verify you 're not a robot
who suffer from overly - pres cript ive and overs impl
creating natural separation from the guessing cornerback . While
Newsletter Sign Up Continue reading the main story Please verify
. Advertisement Continue reading the main story
heavy . <|endoftext|> Fair play ? To the Premier
youtube . com / watch ? v = 2 g 7
vit icultural viability , the math becomes basic when it comes
: Mike M anners - favorite favorite favorite favorite favorite -

MIN ACTIVATIONS
MIN = -3.338

Sanders denied there was any need to repair ties between Trump
, makes cheese , cures bacon , grind s sausage ,
are conflicting reports as to fatalities ; Fox News has reported
You know how Batman is Bruce Wayne ? And that he
health of the economy , unemployment and wages will have an
cable traps and an animal carc ass as bait , they
who say there is a need for political compromise in Spain
to call it a terrorist attack , for example , Trump
the 3 . 2 percent unemployment rate is forcing companies to
create a custom table for myself . I decided to use
10 lakh in road accidents fatalities . The Bill
to reduce the accidents and fatalities by 50 % in five
up Sam s Burger Joint , the live music
always bury them in a bur rito or peanut butter sandwich
9 , 000 extra cancer deaths from Chern obyl , but
book , M urd ers on Al cat raz
significant reduction in greenhouse gas emissions . While cleaner at the
still space , and a need , for recognition of slavery
CALL WAS S ENT OUT NEED 100 % ACC OUN TA
and that there was no need to make special arrangements for

INTERVAL 2.011 - 3.794
CONTAINS 0.155%

of the investigations . <|endoftext|> It was supposed to be a
training ground . I was lucky . I was in a
all the stops and really digging into your bag of tricks
283 185 307 18 ; float time = ( sin (_
it could have been a lucky guess . I

INTERVAL 0.228 - 2.011
CONTAINS 34.413%

remembered Russell and Jones speaking to the prisoners in the yard
she talked to as many as 100 people on Tinder .
from the petition ers on whose plea the high court had
ode well . Rep orted ly , Pages and
than when before the ball was snapped . It

INTERVAL -1.555 - 0.228
CONTAINS 64.523%

s reading and writing skills . Now adays toy makers are
just really is that popular , and I ve
. Shell is planning to drill in Alaska this summer and
Russia , has announced that it s stopping work
discovered that his father was on the last train to the

INTERVAL -3.338 - -1.555
CONTAINS 0.909%

Korean albeit in a particularly gore filled way .
Special frame for pop / pop / pop / ret |
shot showed a single dumb bell and weight lifting plates stre
amounts of marijuana for personal use . " We
people in this room are tired of waiting . But it
Tokenny
Token ID3281
Feature activation-4.13e-01

Pos logit contributions
adder+3.244
adian+3.192
ortium+3.001
etting+2.981
monary+2.866
Neg logit contributions
 Bubble-2.188
-2.158
enough-1.991
spect-1.986
natureconservancy-1.974
Token has
Token ID468
Feature activation-1.36e-01

Pos logit contributions
adder+1.039
adian+1.016
ortium+0.980
etting+0.948
monary+0.905
Neg logit contributions
 Bubble-1.258
spect-1.223
enough-1.198
-1.180
natureconservancy-1.165
Token been
Token ID587
Feature activation+2.83e-01

Pos logit contributions
adder+0.375
adian+0.369
ortium+0.359
etting+0.345
monary+0.331
Neg logit contributions
 Bubble-0.382
spect-0.368
enough-0.359
-0.357
rollers-0.350
Token the
Token ID262
Feature activation-3.08e-01

Pos logit contributions
 Bubble+0.901
spect+0.874
enough+0.855
+0.846
natureconservancy+0.830
Neg logit contributions
adder-0.680
adian-0.663
ortium-0.642
etting-0.622
monary-0.584
Token least
Token ID1551
Feature activation+8.89e-01

Pos logit contributions
adder+0.696
adian+0.683
ortium+0.664
etting+0.641
monary+0.599
Neg logit contributions
 Bubble-1.010
spect-0.985
enough-0.969
-0.964
natureconservancy-0.946
Token lucky
Token ID9670
Feature activation+3.79e+00

Pos logit contributions
 Bubble+2.459
spect+2.374
enough+2.323
+2.280
natureconservancy+2.215
Neg logit contributions
adder-2.515
adian-2.474
ortium-2.396
etting-2.327
monary-2.225
Token:
Token ID25
Feature activation+1.11e+00

Pos logit contributions
 Bubble+9.587
spect+9.381
enough+8.930
 wisely+8.506
ours+8.264
Neg logit contributions
adder-10.030
ortium-9.925
adian-9.555
etting-9.262
monary-9.057
Token he
Token ID339
Feature activation-1.09e-01

Pos logit contributions
 Bubble+2.925
spect+2.812
enough+2.743
+2.670
BUS+2.612
Neg logit contributions
adder-3.284
adian-3.199
ortium-3.149
etting-3.039
monary-2.922
Token was
Token ID373
Feature activation-2.57e-01

Pos logit contributions
adder+0.289
adian+0.283
ortium+0.273
etting+0.263
monary+0.252
Neg logit contributions
 Bubble-0.323
spect-0.312
enough-0.305
-0.301
natureconservancy-0.295
Token attacked
Token ID7384
Feature activation-1.22e+00

Pos logit contributions
adder+0.649
adian+0.637
ortium+0.610
etting+0.599
monary+0.566
Neg logit contributions
 Bubble-0.774
spect-0.748
enough-0.734
-0.731
natureconservancy-0.722
Token with
Token ID351
Feature activation-4.16e-01

Pos logit contributions
adder+2.868
adian+2.741
etting+2.626
ortium+2.611
monary+2.437
Neg logit contributions
 Bubble-3.825
spect-3.716
natureconservancy-3.643
enough-3.634
-3.617
Token haven
Token ID4398
Feature activation-2.91e-01

Pos logit contributions
adder+1.108
adian+1.086
ortium+1.036
etting+1.007
monary+0.954
Neg logit contributions
 Bubble-1.307
spect-1.284
-1.254
enough-1.250
natureconservancy-1.223
Token
Token ID447
Feature activation+6.87e-01

Pos logit contributions
adder+0.684
adian+0.665
ortium+0.624
etting+0.614
monary+0.575
Neg logit contributions
 Bubble-0.936
-0.895
spect-0.891
enough-0.872
 wisely-0.850
Token
Token ID247
Feature activation-1.14e-01

Pos logit contributions
 Bubble+2.494
spect+2.429
enough+2.384
+2.354
BUS+2.308
Neg logit contributions
adder-1.348
adian-1.303
ortium-1.261
etting-1.194
monary-1.125
Tokent
Token ID83
Feature activation+9.26e-01

Pos logit contributions
adder+0.305
adian+0.299
ortium+0.287
etting+0.282
monary+0.267
Neg logit contributions
 Bubble-0.333
spect-0.313
enough-0.309
-0.308
BUS-0.304
Token been
Token ID587
Feature activation+7.50e-01

Pos logit contributions
 Bubble+2.661
spect+2.573
enough+2.512
+2.478
BUS+2.419
Neg logit contributions
adder-2.515
adian-2.457
ortium-2.400
etting-2.310
monary-2.212
Token lucky
Token ID9670
Feature activation+3.75e+00

Pos logit contributions
 Bubble+2.350
spect+2.279
enough+2.233
+2.200
BUS+2.153
Neg logit contributions
adder-1.847
adian-1.795
ortium-1.745
etting-1.681
monary-1.597
Token enough
Token ID1576
Feature activation+1.43e+00

Pos logit contributions
enough+5.881
 Bubble+5.788
spect+5.649
 wisely+5.014
+4.879
Neg logit contributions
adder-13.573
ortium-13.268
adian-13.131
monary-12.698
etting-12.402
Token to
Token ID284
Feature activation+1.44e+00

Pos logit contributions
 Bubble+3.648
spect+3.577
enough+3.557
+3.427
rollers+3.320
Neg logit contributions
adder-4.134
ortium-3.986
adian-3.982
etting-3.832
monary-3.780
Token get
Token ID651
Feature activation+7.79e-01

Pos logit contributions
 Bubble+4.071
spect+3.885
enough+3.815
+3.702
 wisely+3.644
Neg logit contributions
adder-3.934
ortium-3.802
adian-3.777
etting-3.625
monary-3.481
Token your
Token ID534
Feature activation-4.27e-01

Pos logit contributions
 Bubble+2.030
spect+1.960
enough+1.908
+1.878
BUS+1.828
Neg logit contributions
adder-2.322
adian-2.273
ortium-2.221
etting-2.147
monary-2.065
Token hands
Token ID2832
Feature activation+1.43e+00

Pos logit contributions
adder+1.175
adian+1.168
ortium+1.111
etting+1.080
monary+1.019
Neg logit contributions
 Bubble-1.167
spect-1.157
-1.130
enough-1.121
BUS-1.094
Token the
Token ID262
Feature activation-2.91e-01

Pos logit contributions
 Bubble+1.880
spect+1.821
enough+1.784
+1.768
BUS+1.714
Neg logit contributions
adder-1.904
adian-1.854
ortium-1.805
etting-1.767
monary-1.665
Token prime
Token ID6994
Feature activation+1.50e-01

Pos logit contributions
adder+0.774
adian+0.766
ortium+0.733
etting+0.731
monary+0.676
Neg logit contributions
 Bubble-0.811
spect-0.802
enough-0.794
-0.783
natureconservancy-0.767
Token challenge
Token ID4427
Feature activation-6.38e-01

Pos logit contributions
 Bubble+0.471
spect+0.458
enough+0.449
+0.444
natureconservancy+0.433
Neg logit contributions
adder-0.363
adian-0.352
ortium-0.340
etting-0.332
monary-0.314
Token Belfast
Token ID29713
Feature activation+1.92e+00

Pos logit contributions
adder+1.536
adian+1.511
etting+1.445
ortium+1.441
monary+1.308
Neg logit contributions
 Bubble-1.950
spect-1.920
-1.911
enough-1.900
natureconservancy-1.834
TokenTe
Token ID6767
Feature activation+2.47e+00

Pos logit contributions
 Bubble+3.185
spect+2.888
enough+2.694
BUS+2.647
 wisely+2.623
Neg logit contributions
adder-7.353
ortium-7.348
adian-7.263
etting-7.059
monary-6.875
Tokenlegraph
Token ID16606
Feature activation+3.42e+00

Pos logit contributions
 Bubble+3.639
spect+3.337
enough+3.212
ours+2.928
rollers+2.872
Neg logit contributions
adder-9.972
adian-9.758
ortium-9.746
etting-9.558
monary-9.366
Token.
Token ID13
Feature activation+1.02e+00

Pos logit contributions
 Bubble+6.804
spect+6.118
BUS+5.724
enough+5.619
+5.443
Neg logit contributions
ortium-12.039
adder-11.872
adian-11.725
monary-11.233
etting-11.065
Tokenco
Token ID1073
Feature activation+1.45e+00

Pos logit contributions
 Bubble+2.007
spect+1.897
enough+1.841
+1.748
BUS+1.713
Neg logit contributions
adder-3.704
adian-3.634
ortium-3.590
etting-3.481
monary-3.372
Token.
Token ID13
Feature activation+6.20e-01

Pos logit contributions
 Bubble+3.456
spect+3.370
enough+3.242
rollers+3.076
BUS+3.070
Neg logit contributions
adder-4.576
adian-4.490
ortium-4.463
etting-4.239
monary-4.167
Tokenuk
Token ID2724
Feature activation-6.44e-02

Pos logit contributions
 Bubble+1.729
spect+1.668
enough+1.629
+1.605
BUS+1.562
Neg logit contributions
adder-1.742
adian-1.699
ortium-1.661
etting-1.608
monary-1.541
Token Dis
Token ID3167
Feature activation+5.01e-01

Pos logit contributions
adder+0.194
adian+0.188
ortium+0.183
etting+0.181
monary+0.170
Neg logit contributions
 Bubble-0.161
spect-0.160
enough-0.155
-0.154
natureconservancy-0.150
Token
Token ID198
Feature activation+5.12e-01

Pos logit contributions
 Bubble+1.038
spect+0.996
+0.980
enough+0.974
BUS+0.931
Neg logit contributions
adder-1.152
adian-1.114
ortium-1.082
etting-1.060
monary-1.012
TokenNewsletter
Token ID33031
Feature activation+9.28e-01

Pos logit contributions
 Bubble+1.474
spect+1.417
enough+1.383
+1.368
rollers+1.326
Neg logit contributions
adder-1.396
adian-1.357
ortium-1.320
etting-1.281
monary-1.222
Token Sign
Token ID5865
Feature activation+2.95e-01

Pos logit contributions
 Bubble+2.441
spect+2.346
enough+2.268
+2.215
BUS+2.162
Neg logit contributions
adder-2.754
adian-2.685
ortium-2.652
etting-2.541
monary-2.463
Token Up
Token ID3205
Feature activation+1.40e+00

Pos logit contributions
 Bubble+0.745
spect+0.733
enough+0.725
+0.721
BUS+0.695
Neg logit contributions
adder-0.883
adian-0.850
ortium-0.839
etting-0.829
monary-0.781
Token Continue
Token ID10054
Feature activation+5.86e-01

Pos logit contributions
 Bubble+3.412
spect+3.244
enough+3.089
+3.018
rollers+2.943
Neg logit contributions
adder-4.485
adian-4.370
ortium-4.331
etting-4.090
monary-4.074
Token reading
Token ID3555
Feature activation+3.19e+00

Pos logit contributions
 Bubble+1.558
spect+1.503
enough+1.464
+1.432
BUS+1.398
Neg logit contributions
adder-1.718
adian-1.678
ortium-1.650
etting-1.587
monary-1.529
Token the
Token ID262
Feature activation+1.81e+00

Pos logit contributions
 Bubble+6.177
spect+5.845
enough+5.483
BUS+5.141
natureconservancy+5.112
Neg logit contributions
adian-10.945
adder-10.935
ortium-10.845
monary-10.329
etting-10.289
Token main
Token ID1388
Feature activation+8.40e-01

Pos logit contributions
 Bubble+4.341
spect+4.102
enough+3.982
+3.777
BUS+3.719
Neg logit contributions
adder-5.725
ortium-5.618
adian-5.613
etting-5.332
monary-5.220
Token story
Token ID1621
Feature activation+2.58e+00

Pos logit contributions
 Bubble+2.298
spect+2.210
enough+2.157
+2.098
BUS+2.063
Neg logit contributions
adder-2.404
adian-2.345
ortium-2.312
etting-2.214
monary-2.136
Token Please
Token ID4222
Feature activation+1.25e+00

Pos logit contributions
 Bubble+6.554
spect+6.049
 wisely+5.645
enough+5.633
+5.447
Neg logit contributions
adder-7.970
adian-7.839
ortium-7.692
etting-7.297
monary-7.250
Token verify
Token ID11767
Feature activation+2.97e+00

Pos logit contributions
 Bubble+3.198
spect+3.089
enough+2.969
natureconservancy+2.846
BUS+2.815
Neg logit contributions
adder-3.773
ortium-3.724
adian-3.670
etting-3.436
monary-3.415
Token
Token ID198
Feature activation+2.02e-01

Pos logit contributions
 Bubble+0.190
+0.184
spect+0.183
enough+0.179
▬▬+0.172
Neg logit contributions
adder-0.194
adian-0.186
ortium-0.179
etting-0.177
monary-0.167
TokenThe
Token ID464
Feature activation-1.01e-02

Pos logit contributions
 Bubble+0.540
spect+0.513
enough+0.501
+0.496
natureconservancy+0.485
Neg logit contributions
adder-0.597
adian-0.578
ortium-0.563
etting-0.547
monary-0.523
Token Brazil
Token ID7595
Feature activation+9.18e-01

Pos logit contributions
adder+0.028
adian+0.027
ortium+0.026
etting+0.025
monary+0.024
Neg logit contributions
 Bubble-0.029
spect-0.028
enough-0.027
-0.027
natureconservancy-0.026
Tokenians
Token ID1547
Feature activation+8.74e-01

Pos logit contributions
 Bubble+1.868
spect+1.779
enough+1.720
+1.676
BUS+1.618
Neg logit contributions
adder-3.265
adian-3.199
ortium-3.151
etting-3.065
monary-2.971
Token were
Token ID547
Feature activation+2.78e-01

Pos logit contributions
 Bubble+2.550
spect+2.462
enough+2.404
+2.355
BUS+2.307
Neg logit contributions
adder-2.324
adian-2.262
ortium-2.217
etting-2.134
monary-2.049
Token lucky
Token ID9670
Feature activation+3.19e+00

Pos logit contributions
 Bubble+0.749
spect+0.721
enough+0.711
+0.701
natureconservancy+0.688
Neg logit contributions
adder-0.803
adian-0.777
ortium-0.755
etting-0.731
monary-0.699
Token to
Token ID284
Feature activation+1.53e+00

Pos logit contributions
 Bubble+7.431
spect+7.182
enough+7.126
 wisely+6.632
rollers+6.421
Neg logit contributions
adder-9.501
adian-9.357
ortium-9.218
etting-8.872
monary-8.781
Token be
Token ID307
Feature activation+8.45e-01

Pos logit contributions
 Bubble+4.297
spect+4.062
enough+3.999
 wisely+3.890
+3.833
Neg logit contributions
adder-4.134
ortium-4.073
adian-4.015
etting-3.895
monary-3.722
Token put
Token ID1234
Feature activation+7.20e-01

Pos logit contributions
 Bubble+2.347
spect+2.265
enough+2.212
+2.170
BUS+2.116
Neg logit contributions
adder-2.373
adian-2.316
ortium-2.271
etting-2.191
monary-2.104
Token into
Token ID656
Feature activation+3.49e-01

Pos logit contributions
 Bubble+2.007
spect+1.937
enough+1.893
+1.862
BUS+1.815
Neg logit contributions
adder-2.023
adian-1.972
ortium-1.928
etting-1.860
monary-1.787
Token group
Token ID1448
Feature activation+7.70e-01

Pos logit contributions
 Bubble+0.946
spect+0.915
enough+0.896
+0.880
BUS+0.858
Neg logit contributions
adder-1.008
adian-0.982
ortium-0.959
etting-0.925
monary-0.887
Token.
Token ID13
Feature activation-1.26e-01

Pos logit contributions
adder+0.170
adian+0.166
ortium+0.161
etting+0.157
monary+0.149
Neg logit contributions
 Bubble-0.171
spect-0.164
enough-0.161
-0.158
BUS-0.155
Token
Token ID198
Feature activation-2.97e-01

Pos logit contributions
adder+0.370
adian+0.359
ortium+0.349
etting+0.340
monary+0.323
Neg logit contributions
 Bubble-0.325
spect-0.319
enough-0.314
-0.307
natureconservancy-0.302
Token
Token ID198
Feature activation+9.90e-02

Pos logit contributions
adder+0.796
adian+0.752
etting+0.721
ortium+0.720
monary+0.670
Neg logit contributions
 Bubble-0.850
-0.838
spect-0.812
enough-0.803
▬▬-0.780
TokenAdvertisement
Token ID4723
Feature activation+1.66e+00

Pos logit contributions
 Bubble+0.256
spect+0.242
enough+0.236
+0.234
natureconservancy+0.227
Neg logit contributions
adder-0.300
adian-0.290
ortium-0.280
etting-0.277
monary-0.264
Token Continue
Token ID10054
Feature activation+7.17e-01

Pos logit contributions
 Bubble+3.816
spect+3.661
enough+3.527
rollers+3.351
ours+3.347
Neg logit contributions
adder-5.332
adian-5.223
ortium-5.189
etting-4.949
monary-4.902
Token reading
Token ID3555
Feature activation+3.15e+00

Pos logit contributions
 Bubble+1.933
spect+1.863
enough+1.819
+1.787
BUS+1.739
Neg logit contributions
adder-2.078
adian-2.026
ortium-1.983
etting-1.919
monary-1.844
Token the
Token ID262
Feature activation+1.59e+00

Pos logit contributions
 Bubble+6.320
spect+5.994
enough+5.618
BUS+5.354
natureconservancy+5.319
Neg logit contributions
adder-10.759
adian-10.720
ortium-10.628
monary-10.100
etting-10.048
Token main
Token ID1388
Feature activation+2.68e-01

Pos logit contributions
 Bubble+3.877
spect+3.738
enough+3.662
+3.443
BUS+3.423
Neg logit contributions
adder-4.949
adian-4.847
ortium-4.844
etting-4.582
monary-4.482
Token story
Token ID1621
Feature activation+2.02e+00

Pos logit contributions
 Bubble+0.780
spect+0.754
enough+0.739
+0.732
BUS+0.710
Neg logit contributions
adder-0.718
adian-0.699
ortium-0.680
etting-0.659
monary-0.631
Token
Token ID198
Feature activation-1.55e-01

Pos logit contributions
 Bubble+4.567
spect+4.341
enough+4.166
BUS+3.954
rollers+3.915
Neg logit contributions
adder-6.712
adian-6.643
ortium-6.623
etting-6.299
monary-6.148
Token
Token ID198
Feature activation+6.94e-02

Pos logit contributions
adder+0.422
adian+0.400
ortium+0.384
etting+0.383
monary+0.358
Neg logit contributions
 Bubble-0.437
-0.429
spect-0.418
enough-0.412
▬▬-0.400
Token reading
Token ID3555
Feature activation+2.81e+00

Pos logit contributions
 Bubble+2.130
spect+2.054
enough+1.996
+1.942
BUS+1.904
Neg logit contributions
adder-2.409
adian-2.356
ortium-2.327
etting-2.227
monary-2.151
Token the
Token ID262
Feature activation+2.05e+00

Pos logit contributions
 Bubble+5.855
spect+5.546
enough+5.188
BUS+4.990
natureconservancy+4.933
Neg logit contributions
adder-9.538
adian-9.525
ortium-9.468
etting-8.960
monary-8.931
Token main
Token ID1388
Feature activation+1.18e+00

Pos logit contributions
 Bubble+4.745
spect+4.519
enough+4.294
BUS+4.073
 wisely+4.065
Neg logit contributions
adder-6.570
ortium-6.558
adian-6.493
etting-6.167
monary-6.045
Token story
Token ID1621
Feature activation+2.73e+00

Pos logit contributions
 Bubble+3.161
spect+3.035
enough+2.950
+2.843
BUS+2.825
Neg logit contributions
adder-3.428
adian-3.359
ortium-3.321
etting-3.164
monary-3.061
Token Please
Token ID4222
Feature activation+1.35e+00

Pos logit contributions
 Bubble+6.871
spect+6.382
 wisely+5.960
enough+5.939
BUS+5.698
Neg logit contributions
adder-8.386
adian-8.295
ortium-8.158
etting-7.748
monary-7.634
Token verify
Token ID11767
Feature activation+3.11e+00

Pos logit contributions
 Bubble+3.396
spect+3.279
enough+3.140
natureconservancy+3.005
BUS+2.988
Neg logit contributions
adder-4.092
ortium-4.049
adian-3.985
etting-3.732
monary-3.715
Token you
Token ID345
Feature activation+1.91e+00

Pos logit contributions
 Bubble+6.029
spect+5.808
enough+5.270
 wisely+5.171
BUS+5.140
Neg logit contributions
ortium-10.589
adder-10.581
adian-10.500
etting-9.988
monary-9.963
Token're
Token ID821
Feature activation+1.03e+00

Pos logit contributions
 Bubble+3.903
spect+3.693
enough+3.481
 wisely+3.331
BUS+3.303
Neg logit contributions
adder-6.694
ortium-6.502
adian-6.489
etting-6.318
monary-6.093
Token not
Token ID407
Feature activation+1.24e+00

Pos logit contributions
 Bubble+3.176
spect+3.078
enough+3.015
+2.974
BUS+2.901
Neg logit contributions
adder-2.575
adian-2.501
ortium-2.440
etting-2.343
monary-2.237
Token a
Token ID257
Feature activation+1.21e+00

Pos logit contributions
 Bubble+3.171
spect+3.067
enough+2.977
+2.858
BUS+2.829
Neg logit contributions
adder-3.720
adian-3.631
ortium-3.593
etting-3.458
monary-3.337
Token robot
Token ID9379
Feature activation-5.33e-01

Pos logit contributions
 Bubble+2.559
spect+2.450
enough+2.358
+2.266
BUS+2.219
Neg logit contributions
adder-4.161
adian-4.069
ortium-4.036
etting-3.903
monary-3.796
Token is
Token ID318
Feature activation+8.09e-01

Pos logit contributions
 Bubble+1.252
spect+1.210
enough+1.181
+1.159
BUS+1.132
Neg logit contributions
adder-1.284
adian-1.250
ortium-1.225
etting-1.184
monary-1.133
Token in
Token ID287
Feature activation+9.94e-01

Pos logit contributions
 Bubble+2.225
spect+2.147
enough+2.094
+2.061
BUS+2.008
Neg logit contributions
adder-2.300
adian-2.243
ortium-2.194
etting-2.125
monary-2.027
Token with
Token ID351
Feature activation+6.87e-01

Pos logit contributions
 Bubble+2.738
spect+2.637
enough+2.579
+2.532
BUS+2.474
Neg logit contributions
adder-2.825
adian-2.759
ortium-2.699
etting-2.610
monary-2.496
Token all
Token ID477
Feature activation+8.58e-01

Pos logit contributions
 Bubble+1.700
spect+1.645
enough+1.594
+1.563
BUS+1.524
Neg logit contributions
adder-2.138
adian-2.092
ortium-2.058
etting-1.995
monary-1.903
Token the
Token ID262
Feature activation+8.55e-01

Pos logit contributions
 Bubble+2.237
spect+2.160
enough+2.115
+2.049
BUS+2.015
Neg logit contributions
adder-2.567
adian-2.499
ortium-2.457
etting-2.373
monary-2.267
Token olig
Token ID24869
Feature activation+3.03e+00

Pos logit contributions
 Bubble+2.049
spect+1.986
enough+1.936
+1.883
BUS+1.841
Neg logit contributions
adder-2.719
adian-2.666
ortium-2.618
etting-2.554
monary-2.423
Tokenarchs
Token ID34592
Feature activation+1.39e+00

Pos logit contributions
 Bubble+4.032
spect+3.813
rollers+3.495
enough+3.451
+3.228
Neg logit contributions
adder-12.310
ortium-11.977
adian-11.857
etting-11.535
monary-11.332
Token,
Token ID11
Feature activation+6.26e-01

Pos logit contributions
 Bubble+4.135
spect+3.992
enough+3.899
+3.836
rollers+3.754
Neg logit contributions
adder-3.612
adian-3.512
ortium-3.434
etting-3.283
monary-3.186
Token
Token ID447
Feature activation+6.08e-01

Pos logit contributions
 Bubble+1.922
spect+1.860
enough+1.818
+1.787
BUS+1.752
Neg logit contributions
adder-1.582
adian-1.542
ortium-1.506
etting-1.446
monary-1.368
Token
Token ID251
Feature activation+9.65e-01

Pos logit contributions
 Bubble+1.711
spect+1.655
enough+1.610
+1.586
BUS+1.542
Neg logit contributions
adder-1.694
adian-1.654
ortium-1.613
etting-1.558
monary-1.491
Token Letter
Token ID18121
Feature activation+1.09e+00

Pos logit contributions
 Bubble+2.483
spect+2.390
enough+2.335
+2.286
BUS+2.219
Neg logit contributions
adder-2.912
adian-2.843
ortium-2.791
etting-2.697
monary-2.605
Token reading
Token ID3555
Feature activation+2.93e+00

Pos logit contributions
 Bubble+1.774
spect+1.710
enough+1.660
+1.615
BUS+1.582
Neg logit contributions
adder-2.026
adian-1.985
ortium-1.963
etting-1.873
monary-1.810
Token the
Token ID262
Feature activation+1.74e+00

Pos logit contributions
 Bubble+5.888
spect+5.555
enough+5.205
BUS+4.986
natureconservancy+4.893
Neg logit contributions
adian-10.055
adder-10.037
ortium-9.989
monary-9.447
etting-9.432
Token main
Token ID1388
Feature activation+6.05e-01

Pos logit contributions
 Bubble+4.237
spect+4.014
enough+3.902
+3.700
BUS+3.671
Neg logit contributions
adder-5.457
ortium-5.355
adian-5.339
etting-5.073
monary-4.954
Token story
Token ID1621
Feature activation+2.44e+00

Pos logit contributions
 Bubble+1.703
spect+1.644
enough+1.606
+1.575
BUS+1.539
Neg logit contributions
adder-1.680
adian-1.637
ortium-1.604
etting-1.545
monary-1.484
Token Please
Token ID4222
Feature activation+1.38e+00

Pos logit contributions
 Bubble+6.346
spect+5.931
enough+5.554
 wisely+5.486
+5.422
Neg logit contributions
adder-7.390
adian-7.266
ortium-7.151
etting-6.791
monary-6.637
Token verify
Token ID11767
Feature activation+3.01e+00

Pos logit contributions
 Bubble+3.489
spect+3.374
enough+3.233
natureconservancy+3.095
BUS+3.075
Neg logit contributions
adder-4.162
ortium-4.113
adian-4.046
etting-3.796
monary-3.776
Token you
Token ID345
Feature activation+1.96e+00

Pos logit contributions
 Bubble+5.977
spect+5.741
enough+5.221
BUS+5.120
 wisely+5.097
Neg logit contributions
adder-10.234
ortium-10.207
adian-10.150
etting-9.643
monary-9.618
Token're
Token ID821
Feature activation+9.51e-01

Pos logit contributions
 Bubble+4.000
spect+3.773
enough+3.574
 wisely+3.420
BUS+3.404
Neg logit contributions
adder-6.839
ortium-6.648
adian-6.619
etting-6.448
monary-6.238
Token not
Token ID407
Feature activation+1.20e+00

Pos logit contributions
 Bubble+2.976
spect+2.887
enough+2.830
+2.806
BUS+2.727
Neg logit contributions
adder-2.352
adian-2.279
ortium-2.221
etting-2.131
monary-2.032
Token a
Token ID257
Feature activation+1.17e+00

Pos logit contributions
 Bubble+3.106
spect+2.998
enough+2.913
+2.811
BUS+2.772
Neg logit contributions
adder-3.589
adian-3.504
ortium-3.460
etting-3.333
monary-3.212
Token robot
Token ID9379
Feature activation-3.80e-01

Pos logit contributions
 Bubble+2.522
spect+2.411
enough+2.332
+2.258
BUS+2.199
Neg logit contributions
adder-3.988
adian-3.904
ortium-3.854
etting-3.734
monary-3.622
Token time
Token ID640
Feature activation+1.53e+00

Pos logit contributions
 Bubble+2.358
spect+2.276
enough+2.223
+2.183
BUS+2.128
Neg logit contributions
adder-2.399
adian-2.340
ortium-2.291
etting-2.211
monary-2.124
Token came
Token ID1625
Feature activation+8.76e-01

Pos logit contributions
 Bubble+4.416
spect+4.238
enough+4.135
+4.062
BUS+3.988
Neg logit contributions
adder-4.148
adian-4.047
ortium-3.977
etting-3.823
monary-3.692
Token to
Token ID284
Feature activation+1.05e+00

Pos logit contributions
 Bubble+2.541
spect+2.454
enough+2.399
+2.353
BUS+2.303
Neg logit contributions
adder-2.357
adian-2.297
ortium-2.248
etting-2.161
monary-2.078
Token actually
Token ID1682
Feature activation+2.23e-01

Pos logit contributions
 Bubble+3.088
spect+2.978
enough+2.916
+2.845
BUS+2.799
Neg logit contributions
adder-2.792
adian-2.710
ortium-2.674
etting-2.552
monary-2.466
Token start
Token ID923
Feature activation+9.84e-02

Pos logit contributions
 Bubble+0.651
spect+0.631
enough+0.616
+0.612
BUS+0.593
Neg logit contributions
adder-0.595
adian-0.582
ortium-0.563
etting-0.549
monary-0.520
Token grading
Token ID43165
Feature activation+3.01e+00

Pos logit contributions
 Bubble+0.283
spect+0.273
enough+0.268
+0.265
BUS+0.259
Neg logit contributions
adder-0.267
adian-0.259
ortium-0.251
etting-0.248
monary-0.232
Token.
Token ID13
Feature activation+1.02e+00

Pos logit contributions
 Bubble+6.976
spect+6.633
enough+6.315
 wisely+6.274
+6.045
Neg logit contributions
adder-9.323
adian-9.142
ortium-9.080
etting-8.552
monary-8.511
Token Parents
Token ID28231
Feature activation+1.30e+00

Pos logit contributions
 Bubble+2.637
spect+2.547
enough+2.483
+2.436
BUS+2.369
Neg logit contributions
adder-3.075
adian-3.000
ortium-2.940
etting-2.849
monary-2.737
Token who
Token ID508
Feature activation+2.74e-01

Pos logit contributions
 Bubble+3.739
spect+3.603
enough+3.526
+3.433
 wisely+3.416
Neg logit contributions
adder-3.473
adian-3.380
ortium-3.318
etting-3.169
monary-3.079
Token had
Token ID550
Feature activation+6.98e-01

Pos logit contributions
 Bubble+0.841
spect+0.817
enough+0.799
+0.790
natureconservancy+0.770
Neg logit contributions
adder-0.688
adian-0.674
ortium-0.653
etting-0.631
monary-0.600
Token been
Token ID587
Feature activation+7.51e-01

Pos logit contributions
 Bubble+2.002
spect+1.943
enough+1.893
+1.875
BUS+1.819
Neg logit contributions
adder-1.898
adian-1.855
ortium-1.810
etting-1.742
monary-1.671
Token and
Token ID290
Feature activation-1.45e-02

Pos logit contributions
adder+0.100
adian+0.098
ortium+0.093
etting+0.091
monary+0.085
Neg logit contributions
 Bubble-0.132
spect-0.129
enough-0.126
-0.125
natureconservancy-0.123
Token you
Token ID345
Feature activation+5.17e-01

Pos logit contributions
adder+0.042
adian+0.041
ortium+0.040
etting+0.039
monary+0.037
Neg logit contributions
 Bubble-0.039
spect-0.037
enough-0.037
-0.036
natureconservancy-0.035
Token
Token ID447
Feature activation+2.50e-01

Pos logit contributions
 Bubble+1.536
spect+1.484
enough+1.452
+1.426
BUS+1.397
Neg logit contributions
adder-1.357
adian-1.318
ortium-1.292
etting-1.241
monary-1.189
Token
Token ID247
Feature activation+2.34e-01

Pos logit contributions
 Bubble+0.848
spect+0.824
enough+0.808
+0.802
rollers+0.785
Neg logit contributions
adder-0.548
adian-0.537
ortium-0.517
etting-0.492
monary-0.466
Tokenre
Token ID260
Feature activation+7.03e-01

Pos logit contributions
 Bubble+0.652
spect+0.628
enough+0.614
+0.606
natureconservancy+0.589
Neg logit contributions
adder-0.658
adian-0.642
ortium-0.627
etting-0.608
monary-0.582
Token lucky
Token ID9670
Feature activation+2.97e+00

Pos logit contributions
 Bubble+1.843
spect+1.775
enough+1.730
+1.701
BUS+1.652
Neg logit contributions
adder-2.091
adian-2.039
ortium-1.993
etting-1.934
monary-1.854
Token if
Token ID611
Feature activation+8.95e-01

Pos logit contributions
 Bubble+7.222
spect+6.963
enough+6.825
 wisely+6.468
+6.448
Neg logit contributions
adder-8.748
adian-8.489
ortium-8.474
etting-8.084
monary-8.001
Token you
Token ID345
Feature activation-4.53e-02

Pos logit contributions
 Bubble+2.223
spect+2.138
enough+2.081
+2.039
BUS+1.982
Neg logit contributions
adder-2.783
adian-2.720
ortium-2.668
etting-2.584
monary-2.492
Token get
Token ID651
Feature activation+5.81e-01

Pos logit contributions
adder+0.116
adian+0.114
ortium+0.110
etting+0.106
monary+0.102
Neg logit contributions
 Bubble-0.137
spect-0.133
enough-0.129
-0.128
natureconservancy-0.125
Token away
Token ID1497
Feature activation+1.68e-01

Pos logit contributions
 Bubble+1.668
spect+1.619
enough+1.573
+1.554
BUS+1.513
Neg logit contributions
adder-1.572
adian-1.539
ortium-1.501
etting-1.446
monary-1.388
Token with
Token ID351
Feature activation-3.45e-01

Pos logit contributions
 Bubble+0.538
spect+0.520
enough+0.507
+0.500
BUS+0.493
Neg logit contributions
adder-0.403
adian-0.390
ortium-0.382
etting-0.369
monary-0.348
Token reading
Token ID3555
Feature activation+3.19e+00

Pos logit contributions
 Bubble+1.558
spect+1.503
enough+1.464
+1.432
BUS+1.398
Neg logit contributions
adder-1.718
adian-1.678
ortium-1.650
etting-1.587
monary-1.529
Token the
Token ID262
Feature activation+1.81e+00

Pos logit contributions
 Bubble+6.177
spect+5.845
enough+5.483
BUS+5.141
natureconservancy+5.112
Neg logit contributions
adian-10.945
adder-10.935
ortium-10.845
monary-10.329
etting-10.289
Token main
Token ID1388
Feature activation+8.40e-01

Pos logit contributions
 Bubble+4.341
spect+4.102
enough+3.982
+3.777
BUS+3.719
Neg logit contributions
adder-5.725
ortium-5.618
adian-5.613
etting-5.332
monary-5.220
Token story
Token ID1621
Feature activation+2.58e+00

Pos logit contributions
 Bubble+2.298
spect+2.210
enough+2.157
+2.098
BUS+2.063
Neg logit contributions
adder-2.404
adian-2.345
ortium-2.312
etting-2.214
monary-2.136
Token Please
Token ID4222
Feature activation+1.25e+00

Pos logit contributions
 Bubble+6.554
spect+6.049
 wisely+5.645
enough+5.633
+5.447
Neg logit contributions
adder-7.970
adian-7.839
ortium-7.692
etting-7.297
monary-7.250
Token verify
Token ID11767
Feature activation+2.97e+00

Pos logit contributions
 Bubble+3.198
spect+3.089
enough+2.969
natureconservancy+2.846
BUS+2.815
Neg logit contributions
adder-3.773
ortium-3.724
adian-3.670
etting-3.436
monary-3.415
Token you
Token ID345
Feature activation+1.73e+00

Pos logit contributions
 Bubble+5.889
spect+5.649
enough+5.129
BUS+5.043
 wisely+5.027
Neg logit contributions
adder-10.064
ortium-10.059
adian-9.957
etting-9.485
monary-9.475
Token're
Token ID821
Feature activation+9.10e-01

Pos logit contributions
 Bubble+3.615
spect+3.415
enough+3.231
 wisely+3.093
BUS+3.073
Neg logit contributions
adder-6.001
adian-5.834
ortium-5.825
etting-5.661
monary-5.468
Token not
Token ID407
Feature activation+1.24e+00

Pos logit contributions
 Bubble+2.848
spect+2.760
enough+2.707
+2.685
BUS+2.609
Neg logit contributions
adder-2.250
adian-2.180
ortium-2.123
etting-2.038
monary-1.942
Token a
Token ID257
Feature activation+1.08e+00

Pos logit contributions
 Bubble+3.189
spect+3.083
enough+2.992
+2.880
BUS+2.844
Neg logit contributions
adder-3.705
adian-3.617
ortium-3.577
etting-3.442
monary-3.320
Token robot
Token ID9379
Feature activation-4.67e-01

Pos logit contributions
 Bubble+2.348
spect+2.247
enough+2.175
+2.114
BUS+2.053
Neg logit contributions
adder-3.672
adian-3.596
ortium-3.543
etting-3.435
monary-3.329
Token who
Token ID508
Feature activation-1.64e-01

Pos logit contributions
adder+0.741
adian+0.718
ortium+0.699
etting+0.681
monary+0.644
Neg logit contributions
 Bubble-0.771
spect-0.746
-0.727
enough-0.727
BUS-0.708
Token suffer
Token ID8659
Feature activation-1.35e+00

Pos logit contributions
adder+0.406
adian+0.394
ortium+0.381
etting+0.368
monary+0.352
Neg logit contributions
 Bubble-0.509
spect-0.495
enough-0.484
-0.483
natureconservancy-0.468
Token from
Token ID422
Feature activation-3.72e-01

Pos logit contributions
adder+3.298
adian+3.087
ortium+2.975
etting+2.951
 incur+2.893
Neg logit contributions
 Bubble-4.063
spect-3.971
-3.871
enough-3.829
BUS-3.779
Token overly
Token ID17698
Feature activation+6.89e-01

Pos logit contributions
adder+1.023
adian+0.980
ortium+0.950
etting+0.932
monary+0.896
Neg logit contributions
 Bubble-1.047
spect-1.029
enough-0.995
-0.987
natureconservancy-0.966
Token-
Token ID12
Feature activation+1.42e+00

Pos logit contributions
 Bubble+1.861
spect+1.794
enough+1.757
+1.722
BUS+1.678
Neg logit contributions
adder-1.997
adian-1.942
ortium-1.900
etting-1.838
monary-1.769
Tokenpres
Token ID18302
Feature activation+2.96e+00

Pos logit contributions
 Bubble+4.173
spect+4.147
enough+4.025
+3.854
rollers+3.849
Neg logit contributions
adder-3.648
adian-3.591
ortium-3.531
etting-3.363
monary-3.239
Tokencript
Token ID6519
Feature activation+7.85e-01

Pos logit contributions
spect+7.143
 Bubble+7.002
enough+6.808
ours+6.402
rollers+6.296
Neg logit contributions
adder-8.692
adian-8.450
ortium-8.368
etting-7.844
monary-7.776
Tokenive
Token ID425
Feature activation+1.11e+00

Pos logit contributions
 Bubble+2.344
spect+2.269
enough+2.223
+2.189
BUS+2.138
Neg logit contributions
adder-2.045
adian-1.991
ortium-1.943
etting-1.874
monary-1.794
Token and
Token ID290
Feature activation-4.50e-02

Pos logit contributions
 Bubble+3.057
spect+2.924
enough+2.861
+2.792
BUS+2.723
Neg logit contributions
adder-3.148
adian-3.067
ortium-3.036
etting-2.894
monary-2.780
Token overs
Token ID10753
Feature activation+8.00e-01

Pos logit contributions
adder+0.125
adian+0.121
ortium+0.117
etting+0.114
monary+0.109
Neg logit contributions
 Bubble-0.127
spect-0.123
enough-0.120
-0.119
natureconservancy-0.116
Tokenimpl
Token ID23928
Feature activation+5.70e-01

Pos logit contributions
 Bubble+2.072
spect+1.992
enough+1.944
+1.909
BUS+1.856
Neg logit contributions
adder-2.407
adian-2.348
ortium-2.301
etting-2.230
monary-2.144
Token creating
Token ID4441
Feature activation+3.95e-01

Pos logit contributions
adder+0.122
adian+0.119
ortium+0.115
etting+0.112
monary+0.105
Neg logit contributions
 Bubble-0.150
spect-0.146
enough-0.143
-0.142
BUS-0.138
Token natural
Token ID3288
Feature activation-2.50e-01

Pos logit contributions
 Bubble+1.065
spect+1.034
enough+1.006
+0.990
BUS+0.964
Neg logit contributions
adder-1.137
adian-1.112
ortium-1.085
etting-1.055
monary-1.007
Token separation
Token ID14139
Feature activation-9.19e-01

Pos logit contributions
adder+0.678
adian+0.669
ortium+0.636
etting+0.631
monary+0.591
Neg logit contributions
 Bubble-0.699
spect-0.696
enough-0.669
-0.669
rollers-0.653
Token from
Token ID422
Feature activation-2.00e-01

Pos logit contributions
adder+2.241
adian+2.168
etting+2.116
ortium+2.077
 incur+1.910
Neg logit contributions
 Bubble-2.792
spect-2.726
enough-2.669
-2.667
BUS-2.594
Token the
Token ID262
Feature activation-1.84e-01

Pos logit contributions
adder+0.578
adian+0.567
ortium+0.552
etting+0.543
monary+0.507
Neg logit contributions
 Bubble-0.523
spect-0.515
enough-0.503
-0.494
rollers-0.481
Token guessing
Token ID25260
Feature activation+2.94e+00

Pos logit contributions
adder+0.498
adian+0.494
ortium+0.473
etting+0.469
monary+0.436
Neg logit contributions
 Bubble-0.511
spect-0.510
enough-0.500
-0.488
BUS-0.477
Token cornerback
Token ID26683
Feature activation+1.14e+00

Pos logit contributions
 Bubble+8.060
spect+7.439
 wisely+7.058
+7.049
enough+7.036
Neg logit contributions
adder-8.132
ortium-7.959
adian-7.848
monary-7.329
etting-7.298
Token.
Token ID13
Feature activation-4.24e-02

Pos logit contributions
 Bubble+3.050
spect+2.927
enough+2.846
+2.794
rollers+2.719
Neg logit contributions
adder-3.336
adian-3.245
ortium-3.189
etting-3.069
monary-2.971
Token
Token ID198
Feature activation-3.19e-01

Pos logit contributions
adder+0.123
adian+0.119
ortium+0.116
etting+0.114
monary+0.107
Neg logit contributions
 Bubble-0.109
spect-0.109
enough-0.107
-0.104
BUS-0.101
Token
Token ID198
Feature activation-1.03e-01

Pos logit contributions
adder+0.864
adian+0.820
etting+0.783
ortium+0.781
monary+0.732
Neg logit contributions
 Bubble-0.906
-0.886
spect-0.866
enough-0.853
▬▬-0.826
TokenWhile
Token ID3633
Feature activation-2.03e-01

Pos logit contributions
adder+0.296
adian+0.287
ortium+0.277
etting+0.275
monary+0.258
Neg logit contributions
 Bubble-0.280
spect-0.266
enough-0.260
-0.260
rollers-0.251
Token
Token ID198
Feature activation+1.65e-02

Pos logit contributions
adder+0.760
adian+0.725
etting+0.690
ortium+0.688
monary+0.647
Neg logit contributions
 Bubble-0.784
-0.767
spect-0.749
enough-0.736
▬▬-0.713
TokenNewsletter
Token ID33031
Feature activation+1.38e+00

Pos logit contributions
 Bubble+0.044
spect+0.041
+0.041
enough+0.041
 wisely+0.039
Neg logit contributions
adder-0.049
adian-0.047
ortium-0.045
etting-0.045
monary-0.043
Token Sign
Token ID5865
Feature activation+2.48e-01

Pos logit contributions
 Bubble+3.447
spect+3.274
enough+3.080
+3.001
 wisely+2.999
Neg logit contributions
adder-4.318
ortium-4.217
adian-4.215
etting-3.992
monary-3.919
Token Up
Token ID3205
Feature activation+1.39e+00

Pos logit contributions
 Bubble+0.624
spect+0.614
enough+0.610
+0.605
BUS+0.583
Neg logit contributions
adder-0.743
adian-0.714
ortium-0.708
etting-0.697
monary-0.657
Token Continue
Token ID10054
Feature activation+6.79e-01

Pos logit contributions
 Bubble+3.396
spect+3.248
enough+3.094
+3.001
BUS+2.958
Neg logit contributions
adder-4.423
adian-4.312
ortium-4.276
etting-4.027
monary-4.009
Token reading
Token ID3555
Feature activation+2.93e+00

Pos logit contributions
 Bubble+1.774
spect+1.710
enough+1.660
+1.615
BUS+1.582
Neg logit contributions
adder-2.026
adian-1.985
ortium-1.963
etting-1.873
monary-1.810
Token the
Token ID262
Feature activation+1.74e+00

Pos logit contributions
 Bubble+5.888
spect+5.555
enough+5.205
BUS+4.986
natureconservancy+4.893
Neg logit contributions
adian-10.055
adder-10.037
ortium-9.989
monary-9.447
etting-9.432
Token main
Token ID1388
Feature activation+6.05e-01

Pos logit contributions
 Bubble+4.237
spect+4.014
enough+3.902
+3.700
BUS+3.671
Neg logit contributions
adder-5.457
ortium-5.355
adian-5.339
etting-5.073
monary-4.954
Token story
Token ID1621
Feature activation+2.44e+00

Pos logit contributions
 Bubble+1.703
spect+1.644
enough+1.606
+1.575
BUS+1.539
Neg logit contributions
adder-1.680
adian-1.637
ortium-1.604
etting-1.545
monary-1.484
Token Please
Token ID4222
Feature activation+1.38e+00

Pos logit contributions
 Bubble+6.346
spect+5.931
enough+5.554
 wisely+5.486
+5.422
Neg logit contributions
adder-7.390
adian-7.266
ortium-7.151
etting-6.791
monary-6.637
Token verify
Token ID11767
Feature activation+3.01e+00

Pos logit contributions
 Bubble+3.489
spect+3.374
enough+3.233
natureconservancy+3.095
BUS+3.075
Neg logit contributions
adder-4.162
ortium-4.113
adian-4.046
etting-3.796
monary-3.776
Token.
Token ID13
Feature activation-1.94e-01

Pos logit contributions
 Bubble+0.548
spect+0.534
enough+0.515
+0.505
rollers+0.495
Neg logit contributions
adder-0.704
adian-0.683
ortium-0.671
etting-0.651
monary-0.633
Token
Token ID198
Feature activation-3.86e-01

Pos logit contributions
adder+0.569
adian+0.556
ortium+0.544
etting+0.525
monary+0.502
Neg logit contributions
 Bubble-0.506
spect-0.488
enough-0.478
-0.473
natureconservancy-0.460
Token
Token ID198
Feature activation-3.75e-02

Pos logit contributions
adder+1.019
adian+0.959
etting+0.919
ortium+0.912
arger+0.856
Neg logit contributions
 Bubble-1.120
-1.111
spect-1.069
enough-1.055
▬▬-1.032
TokenAdvertisement
Token ID4723
Feature activation+1.82e+00

Pos logit contributions
adder+0.110
adian+0.107
ortium+0.102
etting+0.101
monary+0.096
Neg logit contributions
 Bubble-0.101
spect-0.094
-0.093
enough-0.092
natureconservancy-0.090
Token Continue
Token ID10054
Feature activation+7.45e-01

Pos logit contributions
 Bubble+4.043
spect+3.918
enough+3.740
rollers+3.588
ours+3.578
Neg logit contributions
adder-5.935
adian-5.821
ortium-5.798
etting-5.541
monary-5.490
Token reading
Token ID3555
Feature activation+2.92e+00

Pos logit contributions
 Bubble+2.013
spect+1.940
enough+1.895
+1.864
BUS+1.813
Neg logit contributions
adder-2.153
adian-2.099
ortium-2.054
etting-1.989
monary-1.910
Token the
Token ID262
Feature activation+1.60e+00

Pos logit contributions
 Bubble+6.000
spect+5.711
enough+5.376
BUS+5.144
natureconservancy+5.062
Neg logit contributions
adder-9.965
adian-9.960
ortium-9.857
etting-9.329
monary-9.314
Token main
Token ID1388
Feature activation+2.48e-01

Pos logit contributions
 Bubble+3.901
spect+3.768
enough+3.686
BUS+3.461
+3.453
Neg logit contributions
adder-4.956
adian-4.856
ortium-4.845
etting-4.591
monary-4.475
Token story
Token ID1621
Feature activation+1.86e+00

Pos logit contributions
 Bubble+0.729
spect+0.706
enough+0.691
+0.688
BUS+0.665
Neg logit contributions
adder-0.651
adian-0.634
ortium-0.615
etting-0.597
monary-0.572
Token
Token ID198
Feature activation-1.26e-01

Pos logit contributions
 Bubble+4.266
spect+4.080
enough+3.933
BUS+3.732
rollers+3.689
Neg logit contributions
adder-6.079
adian-6.002
ortium-5.961
etting-5.695
monary-5.540
Token
Token ID198
Feature activation-9.64e-02

Pos logit contributions
adder+0.343
adian+0.326
ortium+0.313
etting+0.312
monary+0.292
Neg logit contributions
 Bubble-0.358
-0.352
spect-0.342
enough-0.337
▬▬-0.328
Token heavy
Token ID4334
Feature activation+1.05e+00

Pos logit contributions
 Bubble+5.167
spect+5.013
enough+4.860
+4.755
BUS+4.705
Neg logit contributions
adder-6.320
adian-6.260
ortium-6.194
etting-5.835
monary-5.693
Token.
Token ID13
Feature activation+2.74e-01

Pos logit contributions
 Bubble+2.855
spect+2.766
enough+2.698
+2.632
BUS+2.574
Neg logit contributions
adder-3.025
adian-2.948
ortium-2.902
etting-2.771
monary-2.672
Token
Token ID447
Feature activation-7.43e-02

Pos logit contributions
 Bubble+0.707
spect+0.687
enough+0.673
+0.660
natureconservancy+0.643
Neg logit contributions
adder-0.813
adian-0.790
ortium-0.774
etting-0.749
monary-0.718
Token
Token ID251
Feature activation+1.62e-02

Pos logit contributions
adder+0.189
adian+0.184
ortium+0.179
etting+0.174
monary+0.164
Neg logit contributions
 Bubble-0.225
spect-0.218
enough-0.213
-0.209
BUS-0.204
Token<|endoftext|>
Token ID50256
Feature activation+1.64e+00

Pos logit contributions
 Bubble+0.042
spect+0.040
enough+0.040
+0.039
natureconservancy+0.038
Neg logit contributions
adder-0.048
adian-0.046
ortium-0.046
etting-0.044
monary-0.042
TokenFair
Token ID30099
Feature activation+2.91e+00

Pos logit contributions
 Bubble+4.219
spect+4.160
enough+4.066
BUS+3.934
+3.898
Neg logit contributions
adder-4.745
adian-4.665
ortium-4.636
etting-4.413
monary-4.284
Token play
Token ID711
Feature activation+7.57e-01

Pos logit contributions
 Bubble+7.613
enough+7.394
spect+7.387
BUS+6.816
+6.804
Neg logit contributions
adder-8.110
ortium-7.935
adian-7.931
etting-7.637
monary-7.261
Token?
Token ID30
Feature activation+1.03e+00

Pos logit contributions
 Bubble+2.136
spect+2.062
enough+2.016
+1.973
BUS+1.927
Neg logit contributions
adder-2.100
adian-2.048
ortium-2.005
etting-1.930
monary-1.856
Token To
Token ID1675
Feature activation+6.94e-01

Pos logit contributions
 Bubble+2.619
spect+2.531
enough+2.463
+2.423
BUS+2.352
Neg logit contributions
adder-3.159
adian-3.083
ortium-3.021
etting-2.928
monary-2.812
Token the
Token ID262
Feature activation+7.09e-01

Pos logit contributions
 Bubble+1.939
spect+1.875
enough+1.834
+1.805
BUS+1.755
Neg logit contributions
adder-1.943
adian-1.893
ortium-1.852
etting-1.791
monary-1.712
Token Premier
Token ID9952
Feature activation+6.95e-01

Pos logit contributions
 Bubble+1.933
spect+1.866
enough+1.823
+1.789
BUS+1.744
Neg logit contributions
adder-2.031
adian-1.982
ortium-1.941
etting-1.875
monary-1.801
Tokenyoutube
Token ID11604
Feature activation+2.70e-01

Pos logit contributions
 Bubble+1.890
spect+1.832
enough+1.794
+1.766
BUS+1.727
Neg logit contributions
adder-1.487
adian-1.445
ortium-1.411
etting-1.354
monary-1.292
Token.
Token ID13
Feature activation+1.60e-01

Pos logit contributions
 Bubble+0.743
spect+0.712
+0.710
enough+0.697
BUS+0.670
Neg logit contributions
adder-0.765
adian-0.740
ortium-0.717
etting-0.706
monary-0.670
Tokencom
Token ID785
Feature activation+1.01e+00

Pos logit contributions
 Bubble+0.385
+0.364
spect+0.362
enough+0.353
BUS+0.341
Neg logit contributions
adder-0.513
adian-0.499
ortium-0.486
etting-0.478
monary-0.461
Token/
Token ID14
Feature activation+6.45e-01

Pos logit contributions
 Bubble+2.513
spect+2.417
enough+2.355
+2.321
BUS+2.250
Neg logit contributions
adder-3.145
adian-3.066
ortium-3.003
etting-2.919
monary-2.813
Tokenwatch
Token ID8340
Feature activation+8.41e-01

Pos logit contributions
 Bubble+2.012
spect+1.973
enough+1.931
+1.884
rollers+1.846
Neg logit contributions
adder-1.570
adian-1.532
ortium-1.504
etting-1.433
monary-1.365
Token?
Token ID30
Feature activation+2.91e+00

Pos logit contributions
 Bubble+1.854
spect+1.772
enough+1.719
+1.697
BUS+1.630
Neg logit contributions
adder-2.852
adian-2.788
ortium-2.734
etting-2.663
monary-2.572
Tokenv
Token ID85
Feature activation+1.50e+00

Pos logit contributions
spect+4.868
 Bubble+4.772
enough+4.589
vin+4.248
rollers+4.184
Neg logit contributions
adder-10.455
ortium-10.442
adian-10.315
etting-9.811
monary-9.557
Token=
Token ID28
Feature activation-4.55e-01

Pos logit contributions
 Bubble+5.001
spect+4.874
enough+4.795
+4.651
BUS+4.619
Neg logit contributions
adder-3.274
adian-3.173
ortium-3.118
etting-2.943
monary-2.777
Token2
Token ID17
Feature activation+5.22e-02

Pos logit contributions
adder+1.212
adian+1.187
ortium+1.128
etting+1.112
monary+1.051
Neg logit contributions
 Bubble-1.343
spect-1.268
-1.261
enough-1.248
natureconservancy-1.203
Tokeng
Token ID70
Feature activation+4.87e-01

Pos logit contributions
 Bubble+0.152
spect+0.145
enough+0.142
+0.142
natureconservancy+0.136
Neg logit contributions
adder-0.142
adian-0.138
ortium-0.133
etting-0.130
monary-0.124
Token7
Token ID22
Feature activation-1.06e-02

Pos logit contributions
 Bubble+1.419
spect+1.366
enough+1.337
+1.323
rollers+1.279
Neg logit contributions
adder-1.311
adian-1.281
ortium-1.243
etting-1.204
monary-1.150
Token vit
Token ID9090
Feature activation+9.46e-01

Pos logit contributions
 Bubble+1.997
spect+1.933
enough+1.885
+1.852
BUS+1.802
Neg logit contributions
adder-2.227
adian-2.178
ortium-2.133
etting-2.063
monary-1.980
Tokenicultural
Token ID26823
Feature activation+3.34e-01

Pos logit contributions
 Bubble+3.588
spect+3.495
enough+3.434
+3.405
BUS+3.340
Neg logit contributions
adder-1.712
adian-1.651
ortium-1.590
etting-1.512
monary-1.404
Token viability
Token ID32141
Feature activation-1.84e-01

Pos logit contributions
 Bubble+0.922
spect+0.895
enough+0.881
+0.874
rollers+0.846
Neg logit contributions
adder-0.933
adian-0.918
ortium-0.888
etting-0.870
monary-0.824
Token,
Token ID11
Feature activation-3.92e-01

Pos logit contributions
adder+0.486
adian+0.476
ortium+0.460
etting+0.449
monary+0.422
Neg logit contributions
 Bubble-0.533
spect-0.515
enough-0.511
-0.504
BUS-0.489
Token the
Token ID262
Feature activation-2.83e-01

Pos logit contributions
adder+1.176
adian+1.160
ortium+1.136
etting+1.097
monary+1.045
Neg logit contributions
 Bubble-0.985
spect-0.954
enough-0.940
-0.927
BUS-0.901
Token math
Token ID10688
Feature activation+2.90e+00

Pos logit contributions
adder+0.817
adian+0.811
ortium+0.790
etting+0.769
monary+0.727
Neg logit contributions
 Bubble-0.740
spect-0.723
enough-0.711
-0.705
BUS-0.684
Token becomes
Token ID4329
Feature activation-8.89e-03

Pos logit contributions
 Bubble+7.311
spect+7.001
enough+6.641
+6.603
 wisely+6.574
Neg logit contributions
adder-8.450
ortium-8.167
adian-8.104
etting-7.594
monary-7.464
Token basic
Token ID4096
Feature activation-2.67e-02

Pos logit contributions
adder+0.026
adian+0.026
ortium+0.025
etting+0.024
monary+0.023
Neg logit contributions
 Bubble-0.023
spect-0.022
enough-0.022
-0.022
rollers-0.021
Token when
Token ID618
Feature activation+8.77e-01

Pos logit contributions
adder+0.076
adian+0.074
ortium+0.072
etting+0.071
monary+0.067
Neg logit contributions
 Bubble-0.073
spect-0.070
enough-0.068
-0.067
rollers-0.066
Token it
Token ID340
Feature activation+1.10e+00

Pos logit contributions
 Bubble+2.379
spect+2.295
enough+2.239
+2.198
BUS+2.142
Neg logit contributions
adder-2.523
adian-2.462
ortium-2.411
etting-2.329
monary-2.239
Token comes
Token ID2058
Feature activation+2.52e-01

Pos logit contributions
 Bubble+2.246
spect+2.143
enough+2.071
+2.019
BUS+1.951
Neg logit contributions
adder-3.880
adian-3.804
ortium-3.741
etting-3.638
monary-3.526
Token:
Token ID25
Feature activation+1.93e+00

Pos logit contributions
 Bubble+4.134
spect+3.981
enough+3.854
natureconservancy+3.641
BUS+3.625
Neg logit contributions
adder-5.842
adian-5.779
ortium-5.741
etting-5.470
monary-5.337
Token Mike
Token ID4995
Feature activation+1.19e+00

Pos logit contributions
 Bubble+5.067
spect+4.763
enough+4.616
+4.511
 wisely+4.456
Neg logit contributions
adder-5.643
adian-5.494
ortium-5.439
etting-5.231
monary-5.052
TokenM
Token ID44
Feature activation+5.60e-01

Pos logit contributions
 Bubble+3.122
spect+3.012
enough+2.919
+2.863
BUS+2.804
Neg logit contributions
adder-3.486
adian-3.411
ortium-3.343
etting-3.230
monary-3.113
Tokenanners
Token ID15672
Feature activation+2.28e-01

Pos logit contributions
 Bubble+2.061
spect+2.018
enough+1.981
+1.943
rollers+1.920
Neg logit contributions
adder-1.052
adian-1.017
ortium-0.989
etting-0.943
monary-0.881
Token -
Token ID532
Feature activation+2.32e+00

Pos logit contributions
 Bubble+0.594
spect+0.572
enough+0.558
+0.546
BUS+0.532
Neg logit contributions
adder-0.681
adian-0.665
ortium-0.653
etting-0.631
monary-0.608
Token favorite
Token ID4004
Feature activation+2.90e+00

Pos logit contributions
 Bubble+4.784
spect+4.488
enough+4.284
 wisely+4.089
rollers+3.943
Neg logit contributions
adder-8.039
ortium-7.948
adian-7.849
etting-7.467
monary-7.366
Token favorite
Token ID4004
Feature activation+2.81e+00

Pos logit contributions
 Bubble+5.405
spect+5.003
enough+4.748
 wisely+4.584
 secrets+4.325
Neg logit contributions
ortium-10.361
adder-10.342
adian-10.135
etting-9.739
monary-9.510
Token favorite
Token ID4004
Feature activation+2.59e+00

Pos logit contributions
 Bubble+5.326
spect+4.947
enough+4.699
 wisely+4.528
natureconservancy+4.277
Neg logit contributions
adder-9.976
ortium-9.974
adian-9.789
etting-9.403
monary-9.173
Token favorite
Token ID4004
Feature activation+2.16e+00

Pos logit contributions
 Bubble+5.124
spect+4.794
enough+4.566
 wisely+4.381
natureconservancy+4.192
Neg logit contributions
adder-9.047
ortium-9.000
adian-8.895
etting-8.530
monary-8.307
Token favorite
Token ID4004
Feature activation+1.70e+00

Pos logit contributions
 Bubble+4.198
spect+3.994
enough+3.801
 wisely+3.642
rollers+3.569
Neg logit contributions
adder-7.561
ortium-7.451
adian-7.436
etting-7.124
monary-6.987
Token -
Token ID532
Feature activation+1.68e+00

Pos logit contributions
 Bubble+3.587
spect+3.430
enough+3.278
+3.180
rollers+3.146
Neg logit contributions
adder-5.737
adian-5.632
ortium-5.605
etting-5.382
monary-5.243
Token Sanders
Token ID5831
Feature activation+2.91e-02

Pos logit contributions
adder+1.390
adian+1.335
etting+1.321
ortium+1.320
monary+1.227
Neg logit contributions
 Bubble-0.926
natureconservancy-0.899
-0.883
enough-0.874
spect-0.869
Token denied
Token ID6699
Feature activation-2.22e-01

Pos logit contributions
 Bubble+0.075
spect+0.072
enough+0.071
+0.069
natureconservancy+0.068
Neg logit contributions
adder-0.087
adian-0.085
ortium-0.083
etting-0.082
monary-0.077
Token there
Token ID612
Feature activation-6.66e-01

Pos logit contributions
adder+0.656
adian+0.638
ortium+0.629
etting+0.611
monary+0.581
Neg logit contributions
 Bubble-0.580
spect-0.554
enough-0.547
-0.538
natureconservancy-0.529
Token was
Token ID373
Feature activation-3.54e-01

Pos logit contributions
adder+1.557
adian+1.504
ortium+1.491
etting+1.436
monary+1.333
Neg logit contributions
 Bubble-2.137
spect-2.056
enough-2.025
-2.017
natureconservancy-1.983
Token any
Token ID597
Feature activation-4.58e-01

Pos logit contributions
adder+1.024
adian+1.014
ortium+0.993
etting+0.960
monary+0.906
Neg logit contributions
 Bubble-0.932
spect-0.899
-0.876
enough-0.869
rollers-0.851
Token need
Token ID761
Feature activation-3.34e+00

Pos logit contributions
adder+1.230
adian+1.203
ortium+1.179
etting+1.152
monary+1.067
Neg logit contributions
 Bubble-1.306
spect-1.266
enough-1.242
-1.231
rollers-1.205
Token to
Token ID284
Feature activation-8.99e-01

Pos logit contributions
adder+7.051
adian+7.014
ortium+6.763
etting+6.675
 incur+5.978
Neg logit contributions
 Bubble-10.542
spect-10.077
enough-9.918
-9.821
BUS-9.658
Token repair
Token ID9185
Feature activation-5.22e-01

Pos logit contributions
adian+2.010
adder+1.990
etting+1.863
ortium+1.856
 incur+1.797
Neg logit contributions
 Bubble-2.934
spect-2.888
-2.856
enough-2.833
natureconservancy-2.760
Token ties
Token ID8470
Feature activation-3.67e-01

Pos logit contributions
adder+1.501
adian+1.449
ortium+1.446
etting+1.405
 incur+1.314
Neg logit contributions
 Bubble-1.382
spect-1.323
enough-1.315
-1.301
natureconservancy-1.279
Token between
Token ID1022
Feature activation-1.22e+00

Pos logit contributions
adder+0.922
adian+0.892
ortium+0.876
etting+0.857
monary+0.791
Neg logit contributions
 Bubble-1.124
spect-1.091
enough-1.072
-1.061
natureconservancy-1.036
Token Trump
Token ID1301
Feature activation+5.02e-01

Pos logit contributions
adder+3.587
ortium+3.505
adian+3.493
etting+3.417
 incur+3.206
Neg logit contributions
 Bubble-3.036
spect-2.926
enough-2.890
natureconservancy-2.856
-2.830
Token,
Token ID11
Feature activation-7.51e-02

Pos logit contributions
adder+3.241
adian+3.114
etting+3.004
ortium+2.985
monary+2.778
Neg logit contributions
 Bubble-3.711
-3.592
spect-3.588
enough-3.584
natureconservancy-3.539
Token makes
Token ID1838
Feature activation+3.46e-03

Pos logit contributions
adder+0.221
adian+0.214
ortium+0.207
etting+0.202
monary+0.193
Neg logit contributions
 Bubble-0.198
spect-0.195
enough-0.189
-0.187
BUS-0.181
Token cheese
Token ID9891
Feature activation-1.30e+00

Pos logit contributions
 Bubble+0.008
spect+0.008
enough+0.008
+0.008
BUS+0.008
Neg logit contributions
adder-0.011
adian-0.011
ortium-0.010
etting-0.010
monary-0.010
Token,
Token ID11
Feature activation-8.72e-02

Pos logit contributions
adder+3.381
adian+3.233
ortium+3.155
etting+3.136
 incur+2.864
Neg logit contributions
 Bubble-3.683
spect-3.598
enough-3.581
-3.554
natureconservancy-3.521
Token cures
Token ID50019
Feature activation-1.48e-01

Pos logit contributions
adder+0.254
adian+0.246
ortium+0.239
etting+0.232
monary+0.221
Neg logit contributions
 Bubble-0.232
spect-0.228
enough-0.222
-0.220
BUS-0.213
Token bacon
Token ID21385
Feature activation-3.25e+00

Pos logit contributions
adder+0.427
adian+0.413
etting+0.398
ortium+0.398
monary+0.376
Neg logit contributions
 Bubble-0.390
spect-0.385
enough-0.379
-0.376
natureconservancy-0.371
Token,
Token ID11
Feature activation-3.78e-01

Pos logit contributions
adder+7.611
adian+7.289
etting+7.217
ortium+6.893
monary+6.543
Neg logit contributions
 Bubble-8.779
enough-8.685
spect-8.666
natureconservancy-8.642
-8.564
Token grind
Token ID20078
Feature activation+4.33e-01

Pos logit contributions
adder+1.117
adian+1.079
ortium+1.044
etting+1.017
monary+0.971
Neg logit contributions
 Bubble-0.981
spect-0.973
enough-0.944
-0.937
natureconservancy-0.907
Tokens
Token ID82
Feature activation-3.71e-02

Pos logit contributions
 Bubble+1.054
spect+1.011
enough+0.994
+0.981
BUS+0.954
Neg logit contributions
adder-1.368
adian-1.334
ortium-1.293
etting-1.275
monary-1.213
Token sausage
Token ID31889
Feature activation-2.02e+00

Pos logit contributions
adder+0.120
adian+0.117
ortium+0.113
etting+0.112
monary+0.106
Neg logit contributions
 Bubble-0.085
spect-0.084
enough-0.082
-0.081
BUS-0.079
Token,
Token ID11
Feature activation-2.70e-01

Pos logit contributions
adder+4.771
adian+4.584
etting+4.357
ortium+4.240
 incur+3.984
Neg logit contributions
 Bubble-5.988
spect-5.951
natureconservancy-5.870
-5.831
enough-5.815
Token are
Token ID389
Feature activation-2.59e-01

Pos logit contributions
adder+0.268
adian+0.261
ortium+0.257
etting+0.251
monary+0.237
Neg logit contributions
 Bubble-0.266
spect-0.254
-0.254
enough-0.251
natureconservancy-0.249
Token conflicting
Token ID24916
Feature activation+1.74e+00

Pos logit contributions
adder+0.683
adian+0.671
ortium+0.646
etting+0.625
monary+0.601
Neg logit contributions
 Bubble-0.751
spect-0.728
-0.718
enough-0.708
natureconservancy-0.693
Token reports
Token ID3136
Feature activation+2.46e-01

Pos logit contributions
 Bubble+3.505
spect+3.301
enough+3.178
+3.012
BUS+2.989
Neg logit contributions
adder-6.225
adian-6.059
ortium-5.996
etting-5.791
monary-5.611
Token as
Token ID355
Feature activation-2.19e-02

Pos logit contributions
 Bubble+0.695
spect+0.670
enough+0.659
+0.654
BUS+0.635
Neg logit contributions
adder-0.681
adian-0.663
ortium-0.648
etting-0.625
monary-0.596
Token to
Token ID284
Feature activation+9.30e-01

Pos logit contributions
adder+0.059
adian+0.057
ortium+0.056
etting+0.054
monary+0.052
Neg logit contributions
 Bubble-0.064
spect-0.062
enough-0.060
-0.059
BUS-0.058
Token fatalities
Token ID30091
Feature activation-3.25e+00

Pos logit contributions
 Bubble+2.346
spect+2.253
enough+2.193
+2.143
BUS+2.088
Neg logit contributions
adder-2.853
adian-2.787
ortium-2.736
etting-2.651
monary-2.551
Token;
Token ID26
Feature activation+5.15e-01

Pos logit contributions
adder+6.714
etting+6.273
ortium+5.926
 incurred+5.853
 incur+5.724
Neg logit contributions
 Bubble-10.117
-10.075
natureconservancy-9.873
▬▬-9.750
enough-9.443
Token Fox
Token ID5426
Feature activation+6.69e-01

Pos logit contributions
 Bubble+1.404
spect+1.356
enough+1.325
+1.312
BUS+1.270
Neg logit contributions
adder-1.472
adian-1.437
ortium-1.407
etting-1.356
monary-1.306
Token News
Token ID3000
Feature activation-6.48e-01

Pos logit contributions
 Bubble+1.524
spect+1.460
enough+1.419
+1.388
BUS+1.344
Neg logit contributions
adder-2.217
adian-2.170
ortium-2.132
etting-2.068
monary-2.000
Token has
Token ID468
Feature activation+6.67e-01

Pos logit contributions
adder+2.003
adian+1.948
ortium+1.907
etting+1.843
monary+1.792
Neg logit contributions
 Bubble-1.573
spect-1.526
enough-1.517
-1.514
natureconservancy-1.470
Token reported
Token ID2098
Feature activation-1.93e-01

Pos logit contributions
 Bubble+1.980
spect+1.914
enough+1.876
+1.863
BUS+1.814
Neg logit contributions
adder-1.749
adian-1.698
ortium-1.673
etting-1.588
monary-1.534
Token You
Token ID921
Feature activation+3.52e-01

Pos logit contributions
adder+0.018
adian+0.017
ortium+0.017
etting+0.016
monary+0.015
Neg logit contributions
 Bubble-0.015
spect-0.014
enough-0.014
-0.014
natureconservancy-0.014
Token know
Token ID760
Feature activation+1.37e-01

Pos logit contributions
 Bubble+1.081
spect+1.047
enough+1.025
+1.010
natureconservancy+0.986
Neg logit contributions
adder-0.888
adian-0.864
ortium-0.842
etting-0.811
monary-0.772
Token how
Token ID703
Feature activation+6.38e-01

Pos logit contributions
 Bubble+0.355
spect+0.346
enough+0.335
+0.332
natureconservancy+0.327
Neg logit contributions
adder-0.411
adian-0.399
ortium-0.390
etting-0.379
monary-0.359
Token Batman
Token ID9827
Feature activation-5.50e-01

Pos logit contributions
 Bubble+1.698
spect+1.644
enough+1.602
+1.578
BUS+1.540
Neg logit contributions
adder-1.874
adian-1.824
ortium-1.780
etting-1.725
monary-1.651
Token is
Token ID318
Feature activation-6.55e-02

Pos logit contributions
adder+1.599
adian+1.547
ortium+1.514
etting+1.475
monary+1.388
Neg logit contributions
 Bubble-1.447
spect-1.417
enough-1.399
natureconservancy-1.372
-1.361
Token Bruce
Token ID11088
Feature activation-3.20e+00

Pos logit contributions
adder+0.190
adian+0.183
ortium+0.177
etting+0.174
 incur+0.164
Neg logit contributions
 Bubble-0.175
spect-0.169
enough-0.167
-0.166
natureconservancy-0.163
Token Wayne
Token ID13329
Feature activation-4.74e-01

Pos logit contributions
adder+8.099
ortium+7.454
adian+7.145
etting+6.883
ofer+6.710
Neg logit contributions
natureconservancy-9.071
 Bubble-8.678
enough-8.651
BUS-8.494
-8.455
Token?
Token ID30
Feature activation+9.29e-01

Pos logit contributions
adder+1.463
adian+1.417
ortium+1.393
etting+1.359
monary+1.287
Neg logit contributions
 Bubble-1.174
spect-1.131
enough-1.110
-1.090
natureconservancy-1.079
Token And
Token ID843
Feature activation+1.43e-01

Pos logit contributions
 Bubble+2.239
spect+2.162
enough+2.102
+2.063
BUS+2.002
Neg logit contributions
adder-2.954
adian-2.879
ortium-2.824
etting-2.743
monary-2.640
Token that
Token ID326
Feature activation-2.32e-01

Pos logit contributions
 Bubble+0.392
spect+0.381
enough+0.372
+0.369
natureconservancy+0.362
Neg logit contributions
adder-0.407
adian-0.395
ortium-0.386
etting-0.372
monary-0.354
Token he
Token ID339
Feature activation-7.11e-02

Pos logit contributions
adder+0.714
adian+0.691
ortium+0.670
etting+0.649
monary+0.630
Neg logit contributions
 Bubble-0.575
spect-0.557
enough-0.544
-0.535
natureconservancy-0.530
Token health
Token ID1535
Feature activation-1.07e+00

Pos logit contributions
adder+3.862
adian+3.852
etting+3.708
ortium+3.674
 incur+3.471
Neg logit contributions
spect-3.840
enough-3.788
 Bubble-3.763
-3.666
natureconservancy-3.664
Token of
Token ID286
Feature activation+3.26e-02

Pos logit contributions
adder+3.129
adian+3.044
ortium+2.981
etting+2.888
monary+2.752
Neg logit contributions
 Bubble-2.860
spect-2.736
enough-2.709
-2.634
BUS-2.583
Token the
Token ID262
Feature activation-3.91e-01

Pos logit contributions
 Bubble+0.089
spect+0.087
enough+0.086
+0.083
natureconservancy+0.082
Neg logit contributions
adder-0.092
adian-0.090
ortium-0.088
etting-0.085
monary-0.081
Token economy
Token ID3773
Feature activation-4.99e-01

Pos logit contributions
adder+1.109
adian+1.089
ortium+1.050
etting+1.029
monary+0.974
Neg logit contributions
 Bubble-1.043
spect-1.031
enough-1.019
-0.985
natureconservancy-0.984
Token,
Token ID11
Feature activation+9.88e-02

Pos logit contributions
adder+1.235
adian+1.184
ortium+1.162
etting+1.119
monary+1.058
Neg logit contributions
 Bubble-1.537
spect-1.488
enough-1.469
-1.453
natureconservancy-1.426
Token unemployment
Token ID10681
Feature activation-3.07e+00

Pos logit contributions
 Bubble+0.287
spect+0.278
enough+0.273
+0.267
BUS+0.261
Neg logit contributions
adder-0.265
adian-0.258
ortium-0.252
etting-0.243
monary-0.233
Token and
Token ID290
Feature activation-1.15e-01

Pos logit contributions
adder+6.731
adian+6.146
etting+6.118
 incur+5.811
ortium+5.710
Neg logit contributions
 Bubble-9.108
enough-9.090
natureconservancy-8.942
-8.709
rollers-8.665
Token wages
Token ID9400
Feature activation-1.29e+00

Pos logit contributions
adder+0.322
adian+0.313
ortium+0.304
etting+0.297
monary+0.283
Neg logit contributions
 Bubble-0.316
spect-0.310
enough-0.303
-0.297
natureconservancy-0.291
Token will
Token ID481
Feature activation-2.63e-01

Pos logit contributions
adder+2.846
adian+2.740
etting+2.683
 incur+2.572
ortium+2.552
Neg logit contributions
 Bubble-4.109
spect-4.047
-4.030
enough-3.979
natureconservancy-3.955
Token have
Token ID423
Feature activation+1.99e-01

Pos logit contributions
adder+0.679
adian+0.667
ortium+0.627
etting+0.626
 incur+0.605
Neg logit contributions
 Bubble-0.781
spect-0.765
-0.750
enough-0.746
natureconservancy-0.726
Token an
Token ID281
Feature activation+5.03e-02

Pos logit contributions
 Bubble+0.586
spect+0.567
enough+0.556
+0.546
BUS+0.534
Neg logit contributions
adder-0.524
adian-0.509
ortium-0.492
etting-0.476
monary-0.458
Token cable
Token ID7862
Feature activation-1.73e+00

Pos logit contributions
adder+3.750
adian+3.626
ortium+3.533
etting+3.487
 incur+3.240
Neg logit contributions
 Bubble-3.576
-3.461
spect-3.413
enough-3.410
natureconservancy-3.347
Token traps
Token ID20348
Feature activation-7.85e-01

Pos logit contributions
adder+4.320
adian+4.154
ortium+3.991
etting+3.908
monary+3.640
Neg logit contributions
 Bubble-5.043
spect-4.930
-4.912
enough-4.816
natureconservancy-4.795
Token and
Token ID290
Feature activation-3.72e-01

Pos logit contributions
adder+1.995
adian+1.935
ortium+1.886
etting+1.847
monary+1.707
Neg logit contributions
 Bubble-2.352
spect-2.287
-2.246
enough-2.231
natureconservancy-2.202
Token an
Token ID281
Feature activation-2.80e-01

Pos logit contributions
adder+1.013
adian+0.995
ortium+0.961
etting+0.932
monary+0.893
Neg logit contributions
 Bubble-1.035
spect-1.014
enough-0.996
-0.985
natureconservancy-0.953
Token animal
Token ID5044
Feature activation-7.79e-01

Pos logit contributions
adder+0.707
adian+0.695
ortium+0.668
etting+0.646
monary+0.619
Neg logit contributions
 Bubble-0.853
spect-0.822
enough-0.810
-0.804
BUS-0.781
Token carc
Token ID36756
Feature activation-3.05e+00

Pos logit contributions
adder+2.101
adian+2.068
ortium+1.958
etting+1.890
monary+1.848
Neg logit contributions
 Bubble-2.212
spect-2.151
enough-2.116
-2.112
natureconservancy-2.022
Tokenass
Token ID562
Feature activation-9.23e-01

Pos logit contributions
adder+6.309
adian+5.717
ortium+5.374
ofer+5.273
etting+4.903
Neg logit contributions
-11.113
 Bubble-10.884
▬▬-10.385
BUS-10.205
natureconservancy-10.133
Token as
Token ID355
Feature activation-1.99e-01

Pos logit contributions
adder+2.415
adian+2.290
ortium+2.231
etting+2.166
monary+2.046
Neg logit contributions
 Bubble-2.726
spect-2.616
-2.611
enough-2.600
natureconservancy-2.550
Token bait
Token ID26536
Feature activation-1.11e+00

Pos logit contributions
adder+0.528
adian+0.507
ortium+0.491
etting+0.480
monary+0.455
Neg logit contributions
 Bubble-0.585
spect-0.565
-0.555
enough-0.554
BUS-0.538
Token,
Token ID11
Feature activation-5.05e-01

Pos logit contributions
adder+2.840
adian+2.721
ortium+2.655
etting+2.587
monary+2.422
Neg logit contributions
 Bubble-3.290
spect-3.183
-3.120
enough-3.092
natureconservancy-3.045
Token they
Token ID484
Feature activation+5.16e-01

Pos logit contributions
adder+1.408
ortium+1.361
adian+1.360
etting+1.265
monary+1.230
Neg logit contributions
 Bubble-1.378
spect-1.333
-1.326
enough-1.305
natureconservancy-1.266
Token who
Token ID508
Feature activation+7.09e-01

Pos logit contributions
 Bubble+1.935
spect+1.871
enough+1.828
+1.797
BUS+1.755
Neg logit contributions
adder-1.807
adian-1.760
ortium-1.722
etting-1.660
monary-1.589
Token say
Token ID910
Feature activation+6.75e-01

Pos logit contributions
 Bubble+2.148
spect+2.081
enough+2.036
+2.002
BUS+1.957
Neg logit contributions
adder-1.818
adian-1.768
ortium-1.727
etting-1.660
monary-1.588
Token there
Token ID612
Feature activation-3.81e-01

Pos logit contributions
 Bubble+1.655
spect+1.591
enough+1.547
+1.516
BUS+1.475
Neg logit contributions
adder-2.118
adian-2.070
ortium-2.032
etting-1.970
monary-1.899
Token is
Token ID318
Feature activation+5.50e-01

Pos logit contributions
adder+1.077
adian+1.029
ortium+1.019
etting+1.005
monary+0.946
Neg logit contributions
 Bubble-1.051
spect-1.007
-0.973
enough-0.973
natureconservancy-0.958
Token a
Token ID257
Feature activation+3.94e-01

Pos logit contributions
 Bubble+1.594
spect+1.543
enough+1.494
+1.485
rollers+1.451
Neg logit contributions
adder-1.478
adian-1.440
ortium-1.406
etting-1.363
monary-1.295
Token need
Token ID761
Feature activation-3.03e+00

Pos logit contributions
 Bubble+1.096
spect+1.063
enough+1.034
+1.017
BUS+1.003
Neg logit contributions
adder-1.097
adian-1.071
ortium-1.048
etting-1.017
monary-0.969
Token for
Token ID329
Feature activation-1.08e+00

Pos logit contributions
adder+6.348
adian+6.179
etting+5.999
ortium+5.972
 incur+5.466
Neg logit contributions
 Bubble-9.683
spect-9.328
BUS-9.030
enough-9.019
-8.976
Token political
Token ID1964
Feature activation-3.08e-01

Pos logit contributions
adder+2.950
adian+2.888
ortium+2.793
etting+2.769
 incur+2.605
Neg logit contributions
 Bubble-2.895
spect-2.809
enough-2.751
-2.724
rollers-2.717
Token compromise
Token ID13110
Feature activation+6.86e-01

Pos logit contributions
adder+0.834
adian+0.814
etting+0.786
ortium+0.777
monary+0.729
Neg logit contributions
 Bubble-0.868
spect-0.844
enough-0.836
-0.822
BUS-0.798
Token in
Token ID287
Feature activation-5.12e-02

Pos logit contributions
 Bubble+1.973
spect+1.908
enough+1.864
+1.828
BUS+1.784
Neg logit contributions
adder-1.862
adian-1.814
ortium-1.777
etting-1.704
monary-1.641
Token Spain
Token ID8602
Feature activation+6.91e-02

Pos logit contributions
adder+0.142
adian+0.140
ortium+0.136
etting+0.133
monary+0.125
Neg logit contributions
 Bubble-0.142
spect-0.136
enough-0.134
-0.132
BUS-0.131
Token to
Token ID284
Feature activation+7.02e-01

Pos logit contributions
adder+0.197
adian+0.192
ortium+0.188
etting+0.181
monary+0.174
Neg logit contributions
 Bubble-0.214
spect-0.207
enough-0.203
-0.199
BUS-0.194
Token call
Token ID869
Feature activation-9.06e-01

Pos logit contributions
 Bubble+2.092
spect+2.028
enough+1.982
+1.958
BUS+1.907
Neg logit contributions
adder-1.836
adian-1.788
ortium-1.738
etting-1.681
monary-1.604
Token it
Token ID340
Feature activation+5.65e-01

Pos logit contributions
adder+2.358
adian+2.294
ortium+2.195
etting+2.162
monary+2.063
Neg logit contributions
 Bubble-2.611
spect-2.523
-2.513
natureconservancy-2.493
enough-2.440
Token a
Token ID257
Feature activation-9.63e-02

Pos logit contributions
 Bubble+1.496
spect+1.446
enough+1.398
+1.384
natureconservancy+1.360
Neg logit contributions
adder-1.657
adian-1.618
ortium-1.583
etting-1.539
monary-1.474
Token terrorist
Token ID7417
Feature activation-2.20e+00

Pos logit contributions
adder+0.247
adian+0.245
ortium+0.237
etting+0.232
monary+0.219
Neg logit contributions
 Bubble-0.284
spect-0.277
enough-0.268
-0.267
natureconservancy-0.263
Token attack
Token ID1368
Feature activation-3.00e+00

Pos logit contributions
adder+3.312
adian+3.243
etting+3.026
ortium+2.900
 incur+2.680
Neg logit contributions
 Bubble-8.283
natureconservancy-8.149
spect-8.098
BUS-8.014
-7.996
Token,
Token ID11
Feature activation-5.75e-01

Pos logit contributions
adder+6.457
etting+6.111
adian+5.970
ortium+5.880
 incur+5.654
Neg logit contributions
 Bubble-9.111
spect-8.952
natureconservancy-8.884
-8.706
enough-8.632
Token for
Token ID329
Feature activation-6.83e-01

Pos logit contributions
adder+1.696
adian+1.653
ortium+1.603
etting+1.566
monary+1.490
Neg logit contributions
 Bubble-1.489
spect-1.432
-1.414
enough-1.401
natureconservancy-1.395
Token example
Token ID1672
Feature activation-6.15e-01

Pos logit contributions
adder+1.105
adian+1.050
ortium+0.996
etting+0.974
monary+0.855
Neg logit contributions
 Bubble-2.683
spect-2.605
-2.575
enough-2.562
natureconservancy-2.551
Token,
Token ID11
Feature activation-6.90e-01

Pos logit contributions
adder+1.756
adian+1.712
ortium+1.680
etting+1.616
monary+1.559
Neg logit contributions
 Bubble-1.689
spect-1.631
enough-1.592
-1.563
BUS-1.521
Token Trump
Token ID1301
Feature activation+3.44e-01

Pos logit contributions
adder+2.067
adian+2.016
ortium+1.948
etting+1.913
monary+1.816
Neg logit contributions
 Bubble-1.753
spect-1.688
-1.663
enough-1.653
natureconservancy-1.636
Token the
Token ID262
Feature activation-3.07e-01

Pos logit contributions
 Bubble+2.435
spect+2.347
enough+2.290
+2.246
BUS+2.190
Neg logit contributions
adder-2.633
adian-2.570
ortium-2.519
etting-2.434
monary-2.340
Token 3
Token ID513
Feature activation-7.89e-01

Pos logit contributions
adder+0.804
adian+0.796
ortium+0.765
etting+0.749
monary+0.708
Neg logit contributions
 Bubble-0.874
spect-0.866
enough-0.849
-0.840
natureconservancy-0.823
Token.
Token ID13
Feature activation-1.10e+00

Pos logit contributions
adian+2.275
adder+2.247
ortium+2.166
etting+2.139
monary+2.005
Neg logit contributions
 Bubble-2.051
spect-1.994
enough-1.966
-1.959
natureconservancy-1.904
Token2
Token ID17
Feature activation+2.03e-01

Pos logit contributions
adder+2.468
adian+2.415
ortium+2.284
etting+2.231
monary+2.080
Neg logit contributions
 Bubble-3.643
spect-3.481
enough-3.455
-3.444
rollers-3.320
Token percent
Token ID1411
Feature activation+1.38e-01

Pos logit contributions
 Bubble+0.469
spect+0.450
enough+0.438
+0.430
natureconservancy+0.415
Neg logit contributions
adder-0.664
adian-0.655
ortium-0.639
etting-0.622
monary-0.600
Token unemployment
Token ID10681
Feature activation-2.99e+00

Pos logit contributions
 Bubble+0.412
spect+0.403
enough+0.396
+0.392
natureconservancy+0.386
Neg logit contributions
adder-0.351
adian-0.347
ortium-0.332
etting-0.323
monary-0.307
Token rate
Token ID2494
Feature activation-1.32e+00

Pos logit contributions
adder+4.178
adian+3.855
etting+3.396
 recall+3.250
 incur+3.200
Neg logit contributions
natureconservancy-11.469
enough-11.237
-11.066
spect-10.951
BUS-10.949
Token is
Token ID318
Feature activation-4.83e-01

Pos logit contributions
adder+3.476
adian+3.412
ortium+3.193
etting+3.168
 incur+3.037
Neg logit contributions
spect-3.656
 Bubble-3.643
enough-3.583
-3.546
natureconservancy-3.536
Token forcing
Token ID10833
Feature activation-4.58e-01

Pos logit contributions
adder+1.255
adian+1.239
ortium+1.170
etting+1.151
 incur+1.095
Neg logit contributions
 Bubble-1.393
spect-1.365
-1.332
enough-1.331
natureconservancy-1.306
Token companies
Token ID2706
Feature activation+8.42e-01

Pos logit contributions
adder+1.226
adian+1.205
ortium+1.156
etting+1.118
 incur+1.071
Neg logit contributions
 Bubble-1.278
spect-1.261
-1.235
enough-1.228
natureconservancy-1.214
Token to
Token ID284
Feature activation+6.95e-01

Pos logit contributions
 Bubble+2.436
spect+2.350
enough+2.297
+2.256
BUS+2.207
Neg logit contributions
adder-2.266
adian-2.208
ortium-2.172
etting-2.081
monary-2.009
Token create
Token ID2251
Feature activation-3.06e-01

Pos logit contributions
 Bubble+0.247
spect+0.241
enough+0.237
+0.234
natureconservancy+0.229
Neg logit contributions
adder-0.195
adian-0.191
ortium-0.184
etting-0.179
monary-0.168
Token a
Token ID257
Feature activation+1.84e-01

Pos logit contributions
adder+0.846
adian+0.828
ortium+0.813
etting+0.777
monary+0.746
Neg logit contributions
 Bubble-0.845
spect-0.821
enough-0.810
-0.797
natureconservancy-0.789
Token custom
Token ID2183
Feature activation+2.62e-01

Pos logit contributions
 Bubble+0.517
spect+0.503
enough+0.496
+0.487
natureconservancy+0.478
Neg logit contributions
adder-0.503
adian-0.492
ortium-0.482
etting-0.464
monary-0.443
Token table
Token ID3084
Feature activation+9.49e-02

Pos logit contributions
 Bubble+0.735
spect+0.714
enough+0.700
+0.689
BUS+0.672
Neg logit contributions
adder-0.723
adian-0.705
ortium-0.690
etting-0.665
monary-0.637
Token for
Token ID329
Feature activation-4.06e-01

Pos logit contributions
 Bubble+0.273
spect+0.264
enough+0.259
+0.255
natureconservancy+0.250
Neg logit contributions
adder-0.257
adian-0.251
ortium-0.245
etting-0.237
monary-0.226
Token myself
Token ID3589
Feature activation-2.98e+00

Pos logit contributions
adder+1.147
adian+1.129
ortium+1.095
etting+1.066
monary+1.016
Neg logit contributions
 Bubble-1.084
spect-1.059
enough-1.046
natureconservancy-1.023
-1.022
Token.
Token ID13
Feature activation-2.40e-01

Pos logit contributions
adder+6.651
adian+6.517
ortium+6.368
etting+6.270
 incur+5.598
Neg logit contributions
natureconservancy-8.661
 Bubble-8.649
enough-8.300
spect-8.201
-8.154
Token I
Token ID314
Feature activation-1.39e+00

Pos logit contributions
adder+0.693
adian+0.672
ortium+0.661
etting+0.643
monary+0.604
Neg logit contributions
 Bubble-0.620
spect-0.610
enough-0.599
-0.584
natureconservancy-0.581
Token decided
Token ID3066
Feature activation-4.31e-01

Pos logit contributions
adder+3.294
adian+3.197
ortium+3.187
etting+3.017
 incur+2.849
Neg logit contributions
 Bubble-4.299
spect-4.228
enough-4.150
-4.149
natureconservancy-4.088
Token to
Token ID284
Feature activation-1.31e-01

Pos logit contributions
adder+1.150
adian+1.120
ortium+1.095
etting+1.055
monary+1.010
Neg logit contributions
 Bubble-1.257
spect-1.216
enough-1.191
-1.168
BUS-1.144
Token use
Token ID779
Feature activation-1.20e+00

Pos logit contributions
adder+0.318
adian+0.311
ortium+0.301
etting+0.292
monary+0.275
Neg logit contributions
 Bubble-0.410
spect-0.401
enough-0.392
-0.390
BUS-0.379
Token 10
Token ID838
Feature activation-1.26e-01

Pos logit contributions
 Bubble+3.161
spect+3.041
enough+2.966
+2.878
BUS+2.828
Neg logit contributions
adder-3.677
adian-3.589
ortium-3.534
etting-3.414
monary-3.294
Token lakh
Token ID29002
Feature activation-1.66e-01

Pos logit contributions
adder+0.284
adian+0.277
ortium+0.270
etting+0.256
monary+0.243
Neg logit contributions
 Bubble-0.416
spect-0.401
enough-0.397
-0.391
natureconservancy-0.388
Token in
Token ID287
Feature activation+1.63e-01

Pos logit contributions
adder+0.447
adian+0.434
ortium+0.428
etting+0.410
monary+0.391
Neg logit contributions
 Bubble-0.478
spect-0.458
enough-0.453
-0.451
natureconservancy-0.446
Token road
Token ID2975
Feature activation-1.00e+00

Pos logit contributions
 Bubble+0.458
spect+0.442
enough+0.440
+0.431
natureconservancy+0.426
Neg logit contributions
adder-0.445
adian-0.438
ortium-0.428
etting-0.413
monary-0.396
Token accidents
Token ID17390
Feature activation-1.31e+00

Pos logit contributions
adder+1.973
adian+1.876
ortium+1.842
etting+1.822
 incur+1.708
Neg logit contributions
 Bubble-3.526
enough-3.425
spect-3.410
natureconservancy-3.400
-3.357
Token fatalities
Token ID30091
Feature activation-2.97e+00

Pos logit contributions
adder+3.230
adian+3.097
ortium+3.070
etting+3.021
 incur+2.897
Neg logit contributions
 Bubble-3.904
enough-3.780
natureconservancy-3.739
spect-3.732
-3.713
Token.
Token ID13
Feature activation-4.27e-01

Pos logit contributions
adder+6.595
ortium+6.216
etting+6.179
adian+6.072
 incur+6.056
Neg logit contributions
 Bubble-8.851
natureconservancy-8.785
enough-8.634
-8.622
▬▬-8.344
Token
Token ID198
Feature activation-9.31e-01

Pos logit contributions
adder+1.258
adian+1.215
ortium+1.194
etting+1.158
monary+1.111
Neg logit contributions
 Bubble-1.103
spect-1.070
enough-1.052
natureconservancy-1.024
-1.023
Token
Token ID198
Feature activation-5.91e-01

Pos logit contributions
adder+2.299
adian+2.137
etting+2.034
ortium+2.010
arger+1.914
Neg logit contributions
-2.807
 Bubble-2.791
spect-2.648
enough-2.642
▬▬-2.596
TokenThe
Token ID464
Feature activation-5.24e-01

Pos logit contributions
adder+1.679
adian+1.606
ortium+1.550
etting+1.536
monary+1.472
Neg logit contributions
 Bubble-1.635
spect-1.509
enough-1.486
natureconservancy-1.470
-1.463
Token Bill
Token ID3941
Feature activation+5.64e-01

Pos logit contributions
adder+1.417
adian+1.384
ortium+1.350
etting+1.312
monary+1.263
Neg logit contributions
 Bubble-1.473
spect-1.429
enough-1.407
-1.388
natureconservancy-1.371
Token to
Token ID284
Feature activation-2.48e-01

Pos logit contributions
adder+1.782
adian+1.730
ortium+1.692
etting+1.630
monary+1.536
Neg logit contributions
 Bubble-2.225
spect-2.133
enough-2.101
-2.078
BUS-2.039
Token reduce
Token ID4646
Feature activation-8.80e-01

Pos logit contributions
adder+0.595
adian+0.578
ortium+0.550
etting+0.547
monary+0.511
Neg logit contributions
 Bubble-0.791
spect-0.764
enough-0.750
-0.750
natureconservancy-0.725
Token the
Token ID262
Feature activation-6.70e-01

Pos logit contributions
adder+2.219
adian+2.133
ortium+2.076
etting+2.043
monary+1.968
Neg logit contributions
 Bubble-2.621
-2.507
spect-2.504
enough-2.492
natureconservancy-2.482
Token accidents
Token ID17390
Feature activation-7.41e-01

Pos logit contributions
adder+1.478
adian+1.444
ortium+1.395
etting+1.372
monary+1.298
Neg logit contributions
 Bubble-2.210
spect-2.131
-2.110
enough-2.097
natureconservancy-2.071
Token and
Token ID290
Feature activation-7.85e-01

Pos logit contributions
adder+1.858
adian+1.786
ortium+1.740
etting+1.730
monary+1.612
Neg logit contributions
 Bubble-2.233
spect-2.148
-2.136
enough-2.133
natureconservancy-2.100
Token fatalities
Token ID30091
Feature activation-2.96e+00

Pos logit contributions
adder+1.887
adian+1.834
etting+1.729
ortium+1.717
 incur+1.685
Neg logit contributions
 Bubble-2.457
spect-2.366
-2.361
enough-2.336
natureconservancy-2.297
Token by
Token ID416
Feature activation+1.20e-01

Pos logit contributions
adder+6.454
etting+6.150
adian+5.953
 incur+5.846
ortium+5.757
Neg logit contributions
 Bubble-8.870
-8.830
natureconservancy-8.691
enough-8.514
▬▬-8.388
Token 50
Token ID2026
Feature activation+5.78e-01

Pos logit contributions
 Bubble+0.372
spect+0.360
enough+0.352
+0.350
natureconservancy+0.342
Neg logit contributions
adder-0.301
adian-0.290
ortium-0.280
etting-0.275
monary-0.260
Token%
Token ID4
Feature activation+4.64e-01

Pos logit contributions
 Bubble+1.344
spect+1.288
enough+1.254
+1.226
BUS+1.188
Neg logit contributions
adder-1.887
adian-1.847
ortium-1.813
etting-1.759
monary-1.701
Token in
Token ID287
Feature activation-3.56e-01

Pos logit contributions
 Bubble+1.372
spect+1.327
enough+1.299
+1.279
BUS+1.248
Neg logit contributions
adder-1.224
adian-1.191
ortium-1.164
etting-1.122
monary-1.074
Token five
Token ID1936
Feature activation-3.51e-01

Pos logit contributions
adder+0.935
adian+0.915
ortium+0.891
etting+0.869
monary+0.831
Neg logit contributions
 Bubble-1.021
enough-0.981
-0.975
spect-0.975
natureconservancy-0.963
Token up
Token ID510
Feature activation+2.46e-02

Pos logit contributions
adder+0.582
adian+0.565
ortium+0.558
etting+0.534
monary+0.507
Neg logit contributions
 Bubble-0.634
spect-0.624
enough-0.615
-0.605
natureconservancy-0.598
Token Sam
Token ID3409
Feature activation+9.13e-01

Pos logit contributions
 Bubble+0.065
spect+0.063
enough+0.062
+0.061
natureconservancy+0.059
Neg logit contributions
adder-0.072
adian-0.070
ortium-0.069
etting-0.066
monary-0.064
Token
Token ID447
Feature activation+9.89e-02

Pos logit contributions
 Bubble+2.348
spect+2.261
enough+2.207
+2.168
BUS+2.106
Neg logit contributions
adder-2.761
adian-2.699
ortium-2.642
etting-2.558
monary-2.461
Token
Token ID247
Feature activation-6.61e-02

Pos logit contributions
 Bubble+0.345
spect+0.336
enough+0.330
+0.325
rollers+0.319
Neg logit contributions
adder-0.207
adian-0.201
ortium-0.194
etting-0.184
monary-0.174
Tokens
Token ID82
Feature activation-2.43e-01

Pos logit contributions
adder+0.200
adian+0.196
ortium+0.190
etting+0.186
monary+0.177
Neg logit contributions
 Bubble-0.169
spect-0.161
enough-0.158
-0.156
natureconservancy-0.153
Token Burger
Token ID28983
Feature activation-2.94e+00

Pos logit contributions
adder+0.698
adian+0.688
ortium+0.659
etting+0.648
monary+0.618
Neg logit contributions
 Bubble-0.640
spect-0.627
enough-0.617
-0.612
natureconservancy-0.603
Token Joint
Token ID16798
Feature activation-7.31e-01

Pos logit contributions
adder+7.586
adian+7.475
ortium+7.045
etting+6.671
monary+6.573
Neg logit contributions
natureconservancy-7.631
spect-7.416
-7.406
enough-7.320
 Bubble-7.173
Token,
Token ID11
Feature activation-2.64e-01

Pos logit contributions
adder+1.857
adian+1.827
ortium+1.758
etting+1.694
monary+1.618
Neg logit contributions
 Bubble-2.140
spect-2.097
enough-2.057
natureconservancy-2.056
-2.046
Token the
Token ID262
Feature activation-2.24e-01

Pos logit contributions
adder+0.696
adian+0.678
ortium+0.655
etting+0.634
monary+0.598
Neg logit contributions
 Bubble-0.769
spect-0.748
enough-0.735
-0.726
natureconservancy-0.720
Token live
Token ID2107
Feature activation-9.21e-01

Pos logit contributions
adder+0.629
adian+0.620
ortium+0.602
etting+0.583
monary+0.553
Neg logit contributions
 Bubble-0.605
spect-0.590
enough-0.582
-0.574
natureconservancy-0.562
Token music
Token ID2647
Feature activation+5.17e-01

Pos logit contributions
adder+2.726
adian+2.640
ortium+2.528
etting+2.520
monary+2.380
Neg logit contributions
 Bubble-2.307
enough-2.245
spect-2.212
-2.185
natureconservancy-2.181
Token always
Token ID1464
Feature activation-6.36e-01

Pos logit contributions
adder+0.314
adian+0.310
ortium+0.290
etting+0.287
monary+0.272
Neg logit contributions
 Bubble-0.358
spect-0.350
-0.344
enough-0.342
natureconservancy-0.332
Token bury
Token ID30033
Feature activation-4.45e-01

Pos logit contributions
adder+1.609
adian+1.595
ortium+1.457
etting+1.448
 incur+1.395
Neg logit contributions
 Bubble-1.894
spect-1.874
-1.873
enough-1.839
natureconservancy-1.808
Token them
Token ID606
Feature activation+6.08e-01

Pos logit contributions
adder+1.287
adian+1.226
ortium+1.186
etting+1.163
monary+1.111
Neg logit contributions
 Bubble-1.173
spect-1.140
-1.126
enough-1.113
BUS-1.102
Token in
Token ID287
Feature activation+2.62e-01

Pos logit contributions
 Bubble+1.759
spect+1.707
enough+1.665
+1.653
BUS+1.606
Neg logit contributions
adder-1.642
adian-1.589
ortium-1.551
etting-1.508
monary-1.432
Token a
Token ID257
Feature activation+3.95e-01

Pos logit contributions
 Bubble+0.672
spect+0.653
enough+0.637
+0.630
BUS+0.617
Neg logit contributions
adder-0.788
adian-0.763
ortium-0.743
etting-0.724
monary-0.696
Token bur
Token ID4356
Feature activation-2.92e+00

Pos logit contributions
 Bubble+1.054
spect+1.036
enough+1.008
+0.998
BUS+0.980
Neg logit contributions
adder-1.147
adian-1.110
ortium-1.082
etting-1.052
monary-1.013
Tokenrito
Token ID39834
Feature activation-1.23e+00

Pos logit contributions
adder+5.945
adian+5.163
ortium+4.858
etting+4.570
arger+4.490
Neg logit contributions
-10.290
▬▬-9.668
 Bubble-9.667
BUS-9.504
natureconservancy-9.242
Token or
Token ID393
Feature activation-1.11e+00

Pos logit contributions
adder+3.135
adian+2.965
ortium+2.918
etting+2.875
 incur+2.676
Neg logit contributions
 Bubble-3.559
-3.534
spect-3.524
natureconservancy-3.516
enough-3.445
Token peanut
Token ID26636
Feature activation-5.51e-01

Pos logit contributions
adder+3.150
adian+3.056
ortium+2.897
etting+2.813
 incur+2.797
Neg logit contributions
spect-2.911
 Bubble-2.893
-2.875
natureconservancy-2.790
enough-2.789
Token butter
Token ID9215
Feature activation-4.74e-01

Pos logit contributions
adder+1.829
adian+1.782
ortium+1.748
etting+1.678
monary+1.630
Neg logit contributions
 Bubble-1.221
spect-1.183
-1.162
enough-1.162
natureconservancy-1.130
Token sandwich
Token ID20433
Feature activation-1.87e+00

Pos logit contributions
adder+1.255
adian+1.208
ortium+1.186
etting+1.133
monary+1.074
Neg logit contributions
 Bubble-1.363
spect-1.350
-1.324
enough-1.320
natureconservancy-1.288
Token 9
Token ID860
Feature activation-9.49e-02

Pos logit contributions
 Bubble+0.212
spect+0.204
enough+0.200
+0.199
BUS+0.193
Neg logit contributions
adder-0.221
adian-0.215
ortium-0.209
etting-0.203
monary-0.195
Token,
Token ID11
Feature activation-2.68e-01

Pos logit contributions
adder+0.246
adian+0.241
ortium+0.231
etting+0.226
monary+0.216
Neg logit contributions
 Bubble-0.283
spect-0.272
-0.270
enough-0.268
rollers-0.258
Token000
Token ID830
Feature activation-1.67e-01

Pos logit contributions
adder+0.676
adian+0.658
ortium+0.641
etting+0.617
monary+0.589
Neg logit contributions
 Bubble-0.819
spect-0.792
enough-0.776
-0.767
BUS-0.748
Token extra
Token ID3131
Feature activation-2.03e-01

Pos logit contributions
adder+0.419
adian+0.415
ortium+0.400
etting+0.386
monary+0.371
Neg logit contributions
 Bubble-0.506
spect-0.491
enough-0.481
-0.479
natureconservancy-0.468
Token cancer
Token ID4890
Feature activation-2.12e+00

Pos logit contributions
adder+0.474
adian+0.473
ortium+0.454
etting+0.439
monary+0.419
Neg logit contributions
 Bubble-0.652
spect-0.632
enough-0.623
-0.620
natureconservancy-0.601
Token deaths
Token ID7040
Feature activation-2.91e+00

Pos logit contributions
adder+4.241
adian+4.133
ortium+3.795
etting+3.710
 incur+3.579
Neg logit contributions
 Bubble-7.230
-6.900
enough-6.850
natureconservancy-6.739
spect-6.732
Token from
Token ID422
Feature activation-9.01e-01

Pos logit contributions
adder+6.158
adian+5.896
ortium+5.737
etting+5.717
 incur+5.581
Neg logit contributions
 Bubble-9.071
-8.824
natureconservancy-8.686
enough-8.380
BUS-8.312
Token Chern
Token ID37243
Feature activation-2.33e+00

Pos logit contributions
adder+2.466
adian+2.426
ortium+2.364
etting+2.299
monary+2.229
Neg logit contributions
 Bubble-2.450
spect-2.366
-2.338
enough-2.337
natureconservancy-2.322
Tokenobyl
Token ID46666
Feature activation-1.49e+00

Pos logit contributions
adian+2.309
adder+2.277
ortium+2.102
etting+2.001
omatic+1.887
Neg logit contributions
-10.612
▬▬-10.106
 Bubble-10.101
natureconservancy-10.098
spect-9.840
Token,
Token ID11
Feature activation-1.73e-01

Pos logit contributions
adian+3.777
adder+3.758
ortium+3.667
etting+3.544
monary+3.476
Neg logit contributions
 Bubble-4.191
-4.047
spect-4.027
enough-4.013
natureconservancy-3.977
Token but
Token ID475
Feature activation+4.56e-01

Pos logit contributions
adder+0.430
adian+0.420
ortium+0.408
etting+0.392
monary+0.375
Neg logit contributions
 Bubble-0.531
spect-0.512
enough-0.499
-0.499
natureconservancy-0.488
Token book
Token ID1492
Feature activation+1.86e-01

Pos logit contributions
 Bubble+0.025
spect+0.024
enough+0.023
+0.023
BUS+0.023
Neg logit contributions
adder-0.025
adian-0.025
ortium-0.024
etting-0.023
monary-0.022
Token,
Token ID11
Feature activation-1.58e-01

Pos logit contributions
 Bubble+0.475
spect+0.458
enough+0.448
+0.440
BUS+0.429
Neg logit contributions
adder-0.565
adian-0.551
ortium-0.541
etting-0.524
monary-0.503
Token
Token ID564
Feature activation-2.36e-01

Pos logit contributions
adder+0.448
adian+0.437
ortium+0.427
etting+0.414
monary+0.394
Neg logit contributions
 Bubble-0.428
spect-0.417
enough-0.410
-0.404
BUS-0.395
Token
Token ID250
Feature activation-7.60e-01

Pos logit contributions
adder+0.613
adian+0.596
ortium+0.580
etting+0.559
monary+0.534
Neg logit contributions
 Bubble-0.701
spect-0.678
enough-0.664
-0.654
rollers-0.640
TokenM
Token ID44
Feature activation-1.03e+00

Pos logit contributions
adder+2.292
adian+2.193
ortium+2.151
etting+2.087
monary+2.007
Neg logit contributions
 Bubble-1.919
spect-1.806
-1.791
enough-1.772
natureconservancy-1.740
Tokenurd
Token ID2799
Feature activation-2.90e+00

Pos logit contributions
adder+2.889
adian+2.773
ortium+2.699
etting+2.610
monary+2.443
Neg logit contributions
 Bubble-2.901
-2.755
spect-2.733
enough-2.703
natureconservancy-2.635
Tokeners
Token ID364
Feature activation-1.49e+00

Pos logit contributions
adder+7.596
adian+7.260
etting+6.944
ortium+6.792
arger+6.463
Neg logit contributions
 Bubble-8.003
-7.791
BUS-7.582
spect-7.479
▬▬-7.299
Token on
Token ID319
Feature activation-1.04e+00

Pos logit contributions
adder+4.012
adian+3.788
ortium+3.784
etting+3.751
monary+3.360
Neg logit contributions
 Bubble-4.164
enough-4.066
spect-4.049
-4.031
natureconservancy-3.960
Token Al
Token ID978
Feature activation-2.53e-01

Pos logit contributions
adder+2.918
adian+2.823
ortium+2.794
etting+2.697
monary+2.543
Neg logit contributions
 Bubble-2.726
-2.672
spect-2.658
enough-2.657
natureconservancy-2.608
Tokencat
Token ID9246
Feature activation+5.68e-01

Pos logit contributions
adder+0.697
adian+0.676
ortium+0.653
etting+0.630
monary+0.612
Neg logit contributions
 Bubble-0.725
-0.689
spect-0.684
natureconservancy-0.666
enough-0.662
Tokenraz
Token ID3247
Feature activation+9.85e-02

Pos logit contributions
 Bubble+1.831
spect+1.776
enough+1.738
+1.715
BUS+1.682
Neg logit contributions
adder-1.348
adian-1.309
ortium-1.273
etting-1.222
monary-1.162
Token significant
Token ID2383
Feature activation-1.12e+00

Pos logit contributions
adder+1.575
adian+1.555
ortium+1.489
etting+1.461
monary+1.386
Neg logit contributions
 Bubble-1.809
spect-1.775
enough-1.736
-1.711
BUS-1.681
Token reduction
Token ID7741
Feature activation-1.75e+00

Pos logit contributions
adder+2.612
adian+2.600
ortium+2.456
etting+2.419
 incur+2.298
Neg logit contributions
spect-3.450
 Bubble-3.445
enough-3.336
-3.335
rollers-3.277
Token in
Token ID287
Feature activation-2.95e-01

Pos logit contributions
adder+4.143
adian+4.113
ortium+3.937
etting+3.883
monary+3.664
Neg logit contributions
 Bubble-5.370
spect-5.184
enough-5.124
-4.983
BUS-4.942
Token greenhouse
Token ID16325
Feature activation-2.51e+00

Pos logit contributions
adder+0.782
adian+0.772
ortium+0.750
etting+0.726
monary+0.699
Neg logit contributions
 Bubble-0.845
spect-0.825
enough-0.810
-0.791
natureconservancy-0.779
Token gas
Token ID3623
Feature activation-6.36e-01

Pos logit contributions
adian+4.719
adder+4.581
etting+4.432
ortium+4.348
monary+4.006
Neg logit contributions
 Bubble-8.683
natureconservancy-8.323
enough-8.242
-8.222
spect-8.217
Token emissions
Token ID8971
Feature activation-2.89e+00

Pos logit contributions
adder+1.154
adian+1.135
ortium+1.068
etting+1.057
monary+0.983
Neg logit contributions
 Bubble-2.356
spect-2.295
enough-2.257
-2.242
natureconservancy-2.199
Token.
Token ID13
Feature activation+2.11e-01

Pos logit contributions
adder+6.398
etting+6.281
adian+6.270
ortium+6.047
 incur+5.835
Neg logit contributions
 Bubble-8.394
spect-8.123
-8.034
BUS-7.887
natureconservancy-7.882
Token While
Token ID2893
Feature activation-6.32e-02

Pos logit contributions
 Bubble+0.542
spect+0.527
enough+0.516
+0.506
BUS+0.494
Neg logit contributions
adder-0.628
adian-0.613
ortium-0.599
etting-0.583
monary-0.559
Token cleaner
Token ID21723
Feature activation-8.41e-01

Pos logit contributions
adder+0.178
adian+0.174
ortium+0.168
etting+0.164
monary+0.157
Neg logit contributions
 Bubble-0.174
spect-0.169
enough-0.165
-0.162
natureconservancy-0.159
Token at
Token ID379
Feature activation+8.37e-01

Pos logit contributions
adder+2.260
adian+2.221
ortium+2.138
etting+2.118
 incur+2.006
Neg logit contributions
 Bubble-2.296
spect-2.259
enough-2.197
natureconservancy-2.186
-2.176
Token the
Token ID262
Feature activation-1.90e-01

Pos logit contributions
 Bubble+2.307
spect+2.226
enough+2.175
+2.133
BUS+2.078
Neg logit contributions
adder-2.374
adian-2.311
ortium-2.269
etting-2.187
monary-2.104
Token still
Token ID991
Feature activation-4.93e-01

Pos logit contributions
 Bubble+1.492
spect+1.432
enough+1.389
+1.377
rollers+1.337
Neg logit contributions
adder-1.759
adian-1.722
ortium-1.678
etting-1.633
monary-1.563
Token space
Token ID2272
Feature activation-5.84e-01

Pos logit contributions
adder+1.450
adian+1.423
ortium+1.380
etting+1.350
 incur+1.288
Neg logit contributions
 Bubble-1.292
spect-1.220
-1.203
enough-1.177
rollers-1.172
Token,
Token ID11
Feature activation+4.00e-02

Pos logit contributions
adder+1.376
adian+1.341
ortium+1.263
etting+1.261
 incur+1.161
Neg logit contributions
 Bubble-1.873
spect-1.804
-1.772
enough-1.740
natureconservancy-1.733
Token and
Token ID290
Feature activation-2.95e-01

Pos logit contributions
 Bubble+0.115
spect+0.112
+0.109
enough+0.108
rollers+0.105
Neg logit contributions
adder-0.107
adian-0.105
ortium-0.102
etting-0.098
monary-0.093
Token a
Token ID257
Feature activation+8.37e-01

Pos logit contributions
adder+0.846
adian+0.833
ortium+0.800
etting+0.775
 incur+0.745
Neg logit contributions
 Bubble-0.794
spect-0.768
-0.744
enough-0.744
rollers-0.725
Token need
Token ID761
Feature activation-2.89e+00

Pos logit contributions
 Bubble+2.313
spect+2.236
enough+2.182
+2.143
BUS+2.089
Neg logit contributions
adder-2.367
adian-2.311
ortium-2.259
etting-2.182
monary-2.093
Token,
Token ID11
Feature activation-3.82e-01

Pos logit contributions
adder+6.705
adian+6.646
ortium+6.405
etting+6.238
 incur+5.756
Neg logit contributions
 Bubble-8.776
spect-8.290
-8.145
enough-8.094
BUS-7.956
Token for
Token ID329
Feature activation-9.50e-01

Pos logit contributions
adder+0.932
adian+0.909
ortium+0.880
etting+0.850
monary+0.799
Neg logit contributions
 Bubble-1.200
spect-1.156
-1.133
enough-1.126
BUS-1.101
Token recognition
Token ID9465
Feature activation-5.57e-01

Pos logit contributions
adder+2.588
adian+2.554
ortium+2.426
etting+2.354
 incur+2.285
Neg logit contributions
 Bubble-2.626
spect-2.509
enough-2.477
-2.462
rollers-2.398
Token of
Token ID286
Feature activation+2.26e-01

Pos logit contributions
adder+1.508
adian+1.479
ortium+1.422
etting+1.389
monary+1.318
Neg logit contributions
 Bubble-1.585
spect-1.518
enough-1.494
-1.489
BUS-1.448
Token slavery
Token ID13503
Feature activation+1.22e-01

Pos logit contributions
 Bubble+0.609
spect+0.586
enough+0.572
+0.563
BUS+0.549
Neg logit contributions
adder-0.655
adian-0.640
ortium-0.623
etting-0.602
monary-0.578
Token CALL
Token ID42815
Feature activation-1.80e+00

Pos logit contributions
adder+3.504
adian+3.428
etting+3.303
ortium+3.275
monary+3.109
Neg logit contributions
 Bubble-2.269
spect-2.174
enough-2.146
natureconservancy-2.098
-2.031
Token WAS
Token ID21725
Feature activation-4.04e-01

Pos logit contributions
adder+4.823
adian+4.755
etting+4.650
ortium+4.362
ofer+4.180
Neg logit contributions
spect-4.707
 Bubble-4.601
enough-4.574
natureconservancy-4.490
rollers-4.336
Token S
Token ID311
Feature activation-7.36e-01

Pos logit contributions
adder+1.169
adian+1.143
ortium+1.104
etting+1.104
monary+1.034
Neg logit contributions
 Bubble-1.045
spect-1.022
enough-1.001
natureconservancy-0.966
rollers-0.959
TokenENT
Token ID3525
Feature activation+2.89e-01

Pos logit contributions
adder+2.162
adian+2.101
ortium+2.042
etting+2.005
monary+1.890
Neg logit contributions
 Bubble-1.916
spect-1.847
enough-1.811
-1.767
natureconservancy-1.759
Token OUT
Token ID16289
Feature activation-1.37e-01

Pos logit contributions
 Bubble+0.786
spect+0.762
enough+0.745
+0.726
natureconservancy+0.714
Neg logit contributions
adder-0.826
adian-0.807
ortium-0.788
etting-0.768
monary-0.727
Token NEED
Token ID36465
Feature activation-2.88e+00

Pos logit contributions
adder+0.389
adian+0.382
ortium+0.372
etting+0.368
monary+0.343
Neg logit contributions
 Bubble-0.368
spect-0.357
enough-0.350
natureconservancy-0.339
-0.339
Token 100
Token ID1802
Feature activation+1.64e-01

Pos logit contributions
adian+6.885
etting+6.867
adder+6.839
ortium+6.295
ofer+6.164
Neg logit contributions
spect-7.654
 Bubble-7.567
natureconservancy-7.508
enough-7.431
rollers-7.207
Token%
Token ID4
Feature activation+2.50e-01

Pos logit contributions
 Bubble+0.429
spect+0.415
enough+0.405
+0.394
natureconservancy+0.387
Neg logit contributions
adder-0.482
adian-0.474
ortium-0.460
etting-0.448
monary-0.432
Token ACC
Token ID15859
Feature activation-5.25e-01

Pos logit contributions
 Bubble+0.632
spect+0.614
enough+0.595
+0.581
rollers+0.568
Neg logit contributions
adder-0.759
adian-0.743
ortium-0.725
etting-0.709
monary-0.678
TokenOUN
Token ID19385
Feature activation-1.08e-01

Pos logit contributions
adian+1.522
adder+1.522
etting+1.457
ortium+1.441
monary+1.320
Neg logit contributions
 Bubble-1.354
spect-1.324
enough-1.280
natureconservancy-1.246
-1.244
TokenTA
Token ID5603
Feature activation+5.08e-01

Pos logit contributions
adder+0.289
adian+0.288
ortium+0.277
etting+0.275
monary+0.254
Neg logit contributions
 Bubble-0.307
spect-0.287
enough-0.287
natureconservancy-0.279
-0.278
Token and
Token ID290
Feature activation-3.06e-01

Pos logit contributions
adder+3.308
adian+3.185
ortium+3.089
etting+3.012
 incur+2.845
Neg logit contributions
 Bubble-3.896
-3.696
spect-3.653
enough-3.653
natureconservancy-3.645
Token that
Token ID326
Feature activation-2.15e-01

Pos logit contributions
adder+0.833
adian+0.809
ortium+0.777
etting+0.757
monary+0.724
Neg logit contributions
 Bubble-0.871
spect-0.852
enough-0.826
-0.824
natureconservancy-0.806
Token there
Token ID612
Feature activation-4.52e-01

Pos logit contributions
adder+0.638
adian+0.624
ortium+0.605
etting+0.588
monary+0.567
Neg logit contributions
 Bubble-0.551
spect-0.537
enough-0.521
-0.518
natureconservancy-0.507
Token was
Token ID373
Feature activation+5.60e-01

Pos logit contributions
adder+1.151
adian+1.104
ortium+1.080
etting+1.058
monary+0.992
Neg logit contributions
 Bubble-1.365
spect-1.314
-1.291
enough-1.284
natureconservancy-1.262
Token no
Token ID645
Feature activation-3.65e-01

Pos logit contributions
 Bubble+1.573
spect+1.521
enough+1.482
+1.464
BUS+1.424
Neg logit contributions
adder-1.555
adian-1.518
ortium-1.482
etting-1.431
monary-1.374
Token need
Token ID761
Feature activation-2.85e+00

Pos logit contributions
adder+0.878
adian+0.860
ortium+0.815
etting+0.808
monary+0.761
Neg logit contributions
 Bubble-1.129
spect-1.113
enough-1.086
-1.084
natureconservancy-1.053
Token to
Token ID284
Feature activation-5.50e-01

Pos logit contributions
adder+5.972
adian+5.899
etting+5.582
ortium+5.487
 incur+5.154
Neg logit contributions
 Bubble-9.109
spect-8.835
enough-8.731
-8.688
natureconservancy-8.460
Token make
Token ID787
Feature activation-6.37e-01

Pos logit contributions
adder+1.373
adian+1.359
ortium+1.278
etting+1.254
 incur+1.214
Neg logit contributions
 Bubble-1.677
spect-1.641
-1.617
enough-1.604
natureconservancy-1.566
Token special
Token ID2041
Feature activation+4.52e-01

Pos logit contributions
adder+1.850
adian+1.802
ortium+1.742
etting+1.703
 incur+1.632
Neg logit contributions
 Bubble-1.653
spect-1.610
enough-1.578
-1.572
natureconservancy-1.545
Token arrangements
Token ID14752
Feature activation+1.06e-01

Pos logit contributions
 Bubble+1.352
spect+1.309
enough+1.281
+1.261
BUS+1.231
Neg logit contributions
adder-1.177
adian-1.146
ortium-1.119
etting-1.077
monary-1.030
Token for
Token ID329
Feature activation-5.00e-01

Pos logit contributions
 Bubble+0.309
spect+0.299
enough+0.294
+0.291
natureconservancy+0.283
Neg logit contributions
adder-0.283
adian-0.275
ortium-0.267
etting-0.261
monary-0.246
Token of
Token ID286
Feature activation-9.87e-01

Pos logit contributions
adder+3.475
adian+3.385
etting+3.288
ortium+3.251
 incur+3.009
Neg logit contributions
 Bubble-4.255
enough-4.081
spect-4.081
-4.040
natureconservancy-4.028
Token the
Token ID262
Feature activation-7.64e-01

Pos logit contributions
adder+2.691
adian+2.582
ortium+2.555
etting+2.491
monary+2.326
Neg logit contributions
 Bubble-2.743
spect-2.611
enough-2.603
-2.577
natureconservancy-2.569
Token investigations
Token ID10986
Feature activation-3.89e-01

Pos logit contributions
adder+1.533
adian+1.477
ortium+1.429
etting+1.418
monary+1.282
Neg logit contributions
 Bubble-2.659
spect-2.577
enough-2.554
-2.548
natureconservancy-2.520
Token.
Token ID13
Feature activation-5.66e-01

Pos logit contributions
adder+1.030
adian+0.997
ortium+0.972
etting+0.954
 incur+0.893
Neg logit contributions
 Bubble-1.123
spect-1.080
-1.071
enough-1.070
natureconservancy-1.057
Token<|endoftext|>
Token ID50256
Feature activation+1.65e+00

Pos logit contributions
adder+1.486
adian+1.444
ortium+1.408
etting+1.373
monary+1.280
Neg logit contributions
 Bubble-1.589
spect-1.545
enough-1.543
-1.536
natureconservancy-1.505
TokenIt
Token ID1026
Feature activation+2.24e+00

Pos logit contributions
 Bubble+4.269
spect+4.207
enough+4.113
BUS+3.981
+3.947
Neg logit contributions
adder-4.778
adian-4.696
ortium-4.670
etting-4.443
monary-4.312
Token was
Token ID373
Feature activation+8.67e-01

Pos logit contributions
 Bubble+5.337
spect+5.145
enough+4.913
+4.780
BUS+4.711
Neg logit contributions
adder-6.928
adian-6.798
ortium-6.668
etting-6.443
monary-6.305
Token supposed
Token ID4385
Feature activation+6.98e-01

Pos logit contributions
 Bubble+2.394
spect+2.314
enough+2.260
+2.224
BUS+2.164
Neg logit contributions
adder-2.455
adian-2.391
ortium-2.342
etting-2.265
monary-2.169
Token to
Token ID284
Feature activation+6.27e-01

Pos logit contributions
 Bubble+2.065
spect+1.998
enough+1.954
+1.919
BUS+1.877
Neg logit contributions
adder-1.839
adian-1.790
ortium-1.751
etting-1.684
monary-1.613
Token be
Token ID307
Feature activation+6.37e-01

Pos logit contributions
 Bubble+1.849
spect+1.784
enough+1.743
+1.710
BUS+1.677
Neg logit contributions
adder-1.662
adian-1.613
ortium-1.578
etting-1.518
monary-1.459
Token a
Token ID257
Feature activation+3.37e-01

Pos logit contributions
 Bubble+1.703
spect+1.651
enough+1.614
+1.590
BUS+1.541
Neg logit contributions
adder-1.850
adian-1.797
ortium-1.763
etting-1.715
monary-1.635
Token training
Token ID3047
Feature activation+6.63e-01

Pos logit contributions
adder+1.040
adian+1.013
ortium+0.963
etting+0.929
monary+0.901
Neg logit contributions
 Bubble-1.124
spect-1.104
enough-1.087
-1.087
natureconservancy-1.073
Token ground
Token ID2323
Feature activation+3.26e-02

Pos logit contributions
 Bubble+1.918
spect+1.857
enough+1.820
+1.790
BUS+1.746
Neg logit contributions
adder-1.784
adian-1.737
ortium-1.698
etting-1.637
monary-1.567
Token.
Token ID13
Feature activation+3.68e-01

Pos logit contributions
 Bubble+0.096
spect+0.092
enough+0.091
+0.091
natureconservancy+0.088
Neg logit contributions
adder-0.086
adian-0.083
ortium-0.081
etting-0.078
monary-0.074
Token I
Token ID314
Feature activation-7.81e-01

Pos logit contributions
 Bubble+0.921
spect+0.900
enough+0.880
+0.863
natureconservancy+0.840
Neg logit contributions
adder-1.123
adian-1.085
ortium-1.065
etting-1.032
monary-0.990
Token was
Token ID373
Feature activation-4.64e-02

Pos logit contributions
adder+1.892
adian+1.802
ortium+1.775
etting+1.678
monary+1.591
Neg logit contributions
 Bubble-2.471
spect-2.385
-2.359
enough-2.346
natureconservancy-2.288
Token lucky
Token ID9670
Feature activation+2.45e+00

Pos logit contributions
adder+0.122
adian+0.117
ortium+0.113
etting+0.111
monary+0.105
Neg logit contributions
 Bubble-0.136
spect-0.132
enough-0.130
-0.129
natureconservancy-0.126
Token.
Token ID13
Feature activation+4.93e-01

Pos logit contributions
 Bubble+5.296
enough+5.087
spect+5.083
 wisely+4.650
BUS+4.511
Neg logit contributions
adder-8.116
adian-7.963
ortium-7.876
etting-7.576
monary-7.414
Token I
Token ID314
Feature activation-8.04e-01

Pos logit contributions
 Bubble+1.257
spect+1.223
enough+1.194
+1.174
BUS+1.141
Neg logit contributions
adder-1.487
adian-1.444
ortium-1.414
etting-1.371
monary-1.316
Token was
Token ID373
Feature activation-1.66e-01

Pos logit contributions
adder+1.860
adian+1.783
ortium+1.743
etting+1.643
monary+1.561
Neg logit contributions
 Bubble-2.606
spect-2.529
-2.513
enough-2.496
natureconservancy-2.436
Token in
Token ID287
Feature activation-7.49e-02

Pos logit contributions
adder+0.446
adian+0.431
ortium+0.414
etting+0.404
monary+0.384
Neg logit contributions
 Bubble-0.477
spect-0.464
enough-0.458
-0.454
natureconservancy-0.443
Token a
Token ID257
Feature activation+3.90e-01

Pos logit contributions
adder+0.211
adian+0.204
ortium+0.202
etting+0.194
monary+0.184
Neg logit contributions
 Bubble-0.203
spect-0.196
enough-0.194
-0.193
BUS-0.189
Token all
Token ID477
Feature activation+7.74e-01

Pos logit contributions
 Bubble+1.837
spect+1.770
enough+1.728
+1.695
BUS+1.647
Neg logit contributions
adder-2.380
adian-2.325
ortium-2.282
etting-2.210
monary-2.128
Token the
Token ID262
Feature activation+1.17e+00

Pos logit contributions
 Bubble+2.041
spect+1.978
enough+1.936
+1.888
BUS+1.845
Neg logit contributions
adder-2.289
adian-2.228
ortium-2.186
etting-2.103
monary-2.022
Token stops
Token ID9911
Feature activation+5.70e-01

Pos logit contributions
 Bubble+3.511
spect+3.390
enough+3.321
+3.263
BUS+3.189
Neg logit contributions
adder-3.018
adian-2.936
ortium-2.871
etting-2.754
monary-2.646
Token and
Token ID290
Feature activation+1.24e-01

Pos logit contributions
 Bubble+1.635
spect+1.579
enough+1.545
+1.520
BUS+1.482
Neg logit contributions
adder-1.555
adian-1.515
ortium-1.480
etting-1.430
monary-1.367
Token really
Token ID1107
Feature activation+4.98e-01

Pos logit contributions
 Bubble+0.349
spect+0.339
enough+0.332
+0.330
natureconservancy+0.321
Neg logit contributions
adder-0.345
adian-0.336
ortium-0.325
etting-0.317
monary-0.299
Token digging
Token ID18894
Feature activation+2.11e+00

Pos logit contributions
 Bubble+1.468
spect+1.420
enough+1.390
+1.371
BUS+1.335
Neg logit contributions
adder-1.320
adian-1.282
ortium-1.250
etting-1.209
monary-1.150
Token into
Token ID656
Feature activation+1.41e+00

Pos logit contributions
 Bubble+5.295
spect+5.057
enough+4.953
+4.716
 wisely+4.714
Neg logit contributions
adder-6.319
adian-6.150
ortium-6.148
etting-5.916
monary-5.737
Token your
Token ID534
Feature activation+4.63e-01

Pos logit contributions
 Bubble+3.390
spect+3.234
enough+3.150
+3.071
BUS+2.981
Neg logit contributions
adder-4.481
adian-4.372
ortium-4.307
etting-4.171
monary-4.039
Token bag
Token ID6131
Feature activation-1.30e-02

Pos logit contributions
 Bubble+1.217
spect+1.195
enough+1.159
+1.142
rollers+1.108
Neg logit contributions
adder-1.357
adian-1.338
ortium-1.289
etting-1.259
monary-1.200
Token of
Token ID286
Feature activation-6.68e-02

Pos logit contributions
adder+0.037
adian+0.036
ortium+0.035
etting+0.034
monary+0.032
Neg logit contributions
 Bubble-0.035
spect-0.034
enough-0.034
-0.033
BUS-0.032
Token tricks
Token ID15910
Feature activation+1.87e-01

Pos logit contributions
adder+0.244
adian+0.243
ortium+0.234
etting+0.234
monary+0.219
Neg logit contributions
 Bubble-0.123
spect-0.122
enough-0.116
-0.115
BUS-0.110
Token283
Token ID30290
Feature activation-3.34e-01

Pos logit contributions
adder+0.223
adian+0.216
ortium+0.204
etting+0.197
monary+0.184
Neg logit contributions
 Bubble-0.391
spect-0.379
enough-0.374
-0.370
natureconservancy-0.363
Token185
Token ID21652
Feature activation-1.76e-01

Pos logit contributions
adder+0.931
adian+0.923
ortium+0.878
etting+0.854
monary+0.814
Neg logit contributions
 Bubble-0.930
spect-0.891
enough-0.885
-0.867
natureconservancy-0.843
Token307
Token ID22996
Feature activation+8.66e-02

Pos logit contributions
adder+0.467
adian+0.465
ortium+0.441
etting+0.429
monary+0.407
Neg logit contributions
 Bubble-0.511
spect-0.491
enough-0.486
-0.477
natureconservancy-0.466
Token18
Token ID1507
Feature activation-1.47e-01

Pos logit contributions
 Bubble+0.263
spect+0.255
enough+0.252
+0.247
rollers+0.241
Neg logit contributions
adder-0.218
adian-0.217
ortium-0.206
etting-0.200
monary-0.190
Token;
Token ID26
Feature activation+1.21e+00

Pos logit contributions
adder+0.364
adian+0.358
ortium+0.341
etting+0.333
monary+0.316
Neg logit contributions
 Bubble-0.447
spect-0.436
enough-0.431
-0.422
rollers-0.412
Token float
Token ID12178
Feature activation+2.22e+00

Pos logit contributions
 Bubble+3.579
spect+3.444
enough+3.344
+3.299
BUS+3.235
Neg logit contributions
adder-3.176
adian-3.070
ortium-3.023
etting-2.896
monary-2.791
Token time
Token ID640
Feature activation+1.10e+00

Pos logit contributions
 Bubble+5.887
spect+5.511
enough+5.359
+5.317
BUS+5.246
Neg logit contributions
adder-6.401
ortium-6.158
adian-6.148
etting-5.881
monary-5.766
Token =
Token ID796
Feature activation-3.17e-01

Pos logit contributions
 Bubble+2.994
spect+2.884
enough+2.813
+2.757
BUS+2.704
Neg logit contributions
adder-3.128
adian-3.053
ortium-3.005
etting-2.892
monary-2.795
Token (
Token ID357
Feature activation+1.02e+00

Pos logit contributions
adian+0.876
adder+0.875
ortium+0.820
etting+0.815
 incur+0.769
Neg logit contributions
 Bubble-0.854
spect-0.842
enough-0.841
-0.821
natureconservancy-0.817
Tokensin
Token ID31369
Feature activation+1.09e+00

Pos logit contributions
 Bubble+2.975
spect+2.886
enough+2.817
+2.764
BUS+2.711
Neg logit contributions
adder-2.702
adian-2.627
ortium-2.578
etting-2.476
monary-2.376
Token (_
Token ID44104
Feature activation+9.78e-01

Pos logit contributions
 Bubble+2.662
spect+2.565
enough+2.483
+2.423
BUS+2.369
Neg logit contributions
adder-3.429
adian-3.345
ortium-3.304
etting-3.188
monary-3.092
Token it
Token ID340
Feature activation+6.32e-01

Pos logit contributions
adder+0.018
adian+0.018
ortium+0.017
etting+0.017
monary+0.016
Neg logit contributions
 Bubble-0.018
spect-0.017
enough-0.017
-0.017
natureconservancy-0.016
Token could
Token ID714
Feature activation-7.26e-01

Pos logit contributions
 Bubble+1.854
spect+1.793
enough+1.755
+1.726
BUS+1.686
Neg logit contributions
adder-1.681
adian-1.634
ortium-1.598
etting-1.540
monary-1.472
Token have
Token ID423
Feature activation+7.76e-02

Pos logit contributions
adder+1.821
adian+1.811
ortium+1.723
etting+1.676
 incur+1.592
Neg logit contributions
 Bubble-2.193
spect-2.141
-2.100
enough-2.080
natureconservancy-2.058
Token been
Token ID587
Feature activation+3.78e-03

Pos logit contributions
 Bubble+0.208
spect+0.201
enough+0.196
+0.193
natureconservancy+0.188
Neg logit contributions
adder-0.223
adian-0.219
ortium-0.215
etting-0.206
monary-0.199
Token a
Token ID257
Feature activation-1.25e-01

Pos logit contributions
 Bubble+0.010
spect+0.010
+0.010
enough+0.010
BUS+0.009
Neg logit contributions
adder-0.011
adian-0.011
ortium-0.010
etting-0.010
monary-0.010
Token lucky
Token ID9670
Feature activation+2.49e+00

Pos logit contributions
adder+0.372
adian+0.362
ortium+0.352
etting+0.342
monary+0.326
Neg logit contributions
 Bubble-0.319
spect-0.314
enough-0.304
-0.303
BUS-0.295
Token guess
Token ID4724
Feature activation+1.53e+00

Pos logit contributions
 Bubble+5.942
spect+5.614
enough+5.476
 wisely+5.278
+5.118
Neg logit contributions
adder-7.667
adian-7.521
ortium-7.397
etting-7.197
monary-7.105
Token.
Token ID13
Feature activation+4.83e-01

Pos logit contributions
 Bubble+4.174
spect+4.033
enough+3.924
+3.788
 wisely+3.742
Neg logit contributions
adder-4.304
adian-4.223
ortium-4.134
etting-3.946
monary-3.878
Token
Token ID198
Feature activation-5.84e-02

Pos logit contributions
 Bubble+1.243
spect+1.213
enough+1.183
+1.165
natureconservancy+1.138
Neg logit contributions
adder-1.441
adian-1.395
ortium-1.367
etting-1.328
monary-1.267
Token
Token ID198
Feature activation+9.66e-02

Pos logit contributions
adder+0.162
adian+0.155
ortium+0.148
etting+0.148
monary+0.139
Neg logit contributions
 Bubble-0.164
-0.159
spect-0.157
enough-0.154
▬▬-0.148
TokenI
Token ID40
Feature activation-6.77e-01

Pos logit contributions
 Bubble+0.259
spect+0.244
+0.237
enough+0.236
natureconservancy+0.231
Neg logit contributions
adder-0.286
adian-0.275
ortium-0.266
etting-0.263
monary-0.249
Token remembered
Token ID12086
Feature activation-9.49e-02

Pos logit contributions
 Bubble+1.379
spect+1.335
enough+1.304
+1.284
natureconservancy+1.257
Neg logit contributions
adder-1.405
adian-1.368
ortium-1.337
etting-1.292
monary-1.236
Token Russell
Token ID11563
Feature activation+7.92e-01

Pos logit contributions
adder+0.277
adian+0.268
ortium+0.261
etting+0.257
monary+0.242
Neg logit contributions
 Bubble-0.247
spect-0.242
enough-0.237
-0.237
natureconservancy-0.230
Token and
Token ID290
Feature activation-3.98e-01

Pos logit contributions
 Bubble+2.134
spect+2.058
enough+2.008
+1.970
BUS+1.920
Neg logit contributions
adder-2.298
adian-2.242
ortium-2.197
etting-2.122
monary-2.041
Token Jones
Token ID5437
Feature activation+2.67e-01

Pos logit contributions
adder+1.228
adian+1.182
ortium+1.161
etting+1.126
monary+1.071
Neg logit contributions
 Bubble-0.965
spect-0.956
enough-0.939
-0.928
natureconservancy-0.912
Token speaking
Token ID5486
Feature activation-2.72e-01

Pos logit contributions
 Bubble+0.762
spect+0.736
enough+0.719
+0.713
BUS+0.693
Neg logit contributions
adder-0.736
adian-0.714
ortium-0.696
etting-0.681
monary-0.641
Token to
Token ID284
Feature activation+5.00e-01

Pos logit contributions
adder+0.712
adian+0.681
ortium+0.659
etting+0.644
monary+0.605
Neg logit contributions
 Bubble-0.797
spect-0.785
enough-0.769
-0.764
rollers-0.741
Token the
Token ID262
Feature activation+1.16e-01

Pos logit contributions
 Bubble+1.238
spect+1.199
enough+1.170
+1.156
natureconservancy+1.115
Neg logit contributions
adder-1.556
adian-1.515
ortium-1.485
etting-1.441
monary-1.383
Token prisoners
Token ID10577
Feature activation-4.78e-01

Pos logit contributions
 Bubble+0.315
spect+0.306
enough+0.302
+0.298
BUS+0.288
Neg logit contributions
adder-0.329
adian-0.321
ortium-0.314
etting-0.304
monary-0.289
Token in
Token ID287
Feature activation-1.86e-01

Pos logit contributions
adder+1.274
adian+1.220
ortium+1.194
etting+1.168
monary+1.088
Neg logit contributions
 Bubble-1.379
spect-1.342
enough-1.322
-1.322
natureconservancy-1.282
Token the
Token ID262
Feature activation-7.69e-01

Pos logit contributions
adder+0.533
adian+0.515
ortium+0.503
etting+0.494
monary+0.464
Neg logit contributions
 Bubble-0.497
enough-0.480
spect-0.479
-0.477
natureconservancy-0.469
Token yard
Token ID12699
Feature activation+6.62e-02

Pos logit contributions
adder+2.062
adian+2.025
ortium+1.948
etting+1.917
monary+1.813
Neg logit contributions
 Bubble-2.131
spect-2.095
enough-2.090
-2.079
natureconservancy-2.028
Token she
Token ID673
Feature activation-8.65e-01

Pos logit contributions
adder+0.876
adian+0.856
ortium+0.835
etting+0.812
monary+0.770
Neg logit contributions
 Bubble-0.746
spect-0.723
enough-0.704
-0.699
natureconservancy-0.684
Token talked
Token ID6619
Feature activation-5.36e-01

Pos logit contributions
adder+2.121
adian+2.069
ortium+1.979
etting+1.937
 incur+1.807
Neg logit contributions
 Bubble-2.623
spect-2.601
-2.526
enough-2.525
natureconservancy-2.503
Token to
Token ID284
Feature activation+6.01e-01

Pos logit contributions
adder+1.331
adian+1.295
ortium+1.243
etting+1.217
monary+1.142
Neg logit contributions
 Bubble-1.642
spect-1.608
enough-1.576
-1.552
natureconservancy-1.533
Token as
Token ID355
Feature activation-3.27e-01

Pos logit contributions
 Bubble+1.659
spect+1.603
enough+1.564
+1.539
BUS+1.499
Neg logit contributions
adder-1.700
adian-1.659
ortium-1.624
etting-1.568
monary-1.503
Token many
Token ID867
Feature activation+1.45e-02

Pos logit contributions
adder+0.822
adian+0.811
ortium+0.767
etting+0.766
 incur+0.696
Neg logit contributions
 Bubble-0.989
spect-0.950
-0.944
enough-0.929
natureconservancy-0.919
Token as
Token ID355
Feature activation+6.80e-01

Pos logit contributions
 Bubble+0.045
spect+0.044
enough+0.043
+0.042
BUS+0.041
Neg logit contributions
adder-0.036
adian-0.035
ortium-0.034
etting-0.033
monary-0.031
Token 100
Token ID1802
Feature activation+2.16e-01

Pos logit contributions
 Bubble+1.934
spect+1.873
enough+1.829
+1.803
BUS+1.757
Neg logit contributions
adder-1.865
adian-1.818
ortium-1.776
etting-1.715
monary-1.643
Token people
Token ID661
Feature activation+1.10e-01

Pos logit contributions
 Bubble+0.601
spect+0.582
enough+0.570
+0.564
BUS+0.551
Neg logit contributions
adder-0.599
adian-0.589
ortium-0.570
etting-0.558
monary-0.525
Token on
Token ID319
Feature activation-3.38e-01

Pos logit contributions
 Bubble+0.331
spect+0.321
enough+0.315
+0.313
natureconservancy+0.308
Neg logit contributions
adder-0.280
adian-0.274
ortium-0.266
etting-0.259
monary-0.241
Token Tinder
Token ID44043
Feature activation+3.22e-01

Pos logit contributions
adder+0.947
adian+0.925
ortium+0.895
etting+0.882
monary+0.818
Neg logit contributions
 Bubble-0.908
spect-0.887
enough-0.872
-0.864
natureconservancy-0.860
Token.
Token ID13
Feature activation-3.68e-01

Pos logit contributions
 Bubble+0.947
spect+0.914
enough+0.897
+0.887
natureconservancy+0.874
Neg logit contributions
adder-0.850
adian-0.829
ortium-0.808
etting-0.779
monary-0.740
Token from
Token ID422
Feature activation-1.75e-01

Pos logit contributions
 Bubble+0.405
spect+0.389
enough+0.383
+0.380
natureconservancy+0.373
Neg logit contributions
adder-0.366
adian-0.357
ortium-0.346
etting-0.338
monary-0.318
Token the
Token ID262
Feature activation-2.13e-01

Pos logit contributions
adder+0.498
adian+0.491
ortium+0.476
etting+0.463
monary+0.438
Neg logit contributions
 Bubble-0.470
spect-0.451
enough-0.448
-0.444
natureconservancy-0.433
Token petition
Token ID8853
Feature activation-5.63e-01

Pos logit contributions
adder+0.595
adian+0.591
ortium+0.575
etting+0.556
monary+0.525
Neg logit contributions
 Bubble-0.585
spect-0.561
enough-0.560
-0.553
natureconservancy-0.536
Tokeners
Token ID364
Feature activation-2.20e-01

Pos logit contributions
adian+1.826
adder+1.820
ortium+1.777
etting+1.738
 incur+1.622
Neg logit contributions
 Bubble-1.264
enough-1.220
spect-1.217
-1.195
natureconservancy-1.174
Token on
Token ID319
Feature activation-6.81e-01

Pos logit contributions
adder+0.564
adian+0.554
ortium+0.545
etting+0.527
monary+0.489
Neg logit contributions
 Bubble-0.649
spect-0.625
-0.620
enough-0.618
natureconservancy-0.610
Token whose
Token ID3025
Feature activation+5.38e-01

Pos logit contributions
adder+1.848
adian+1.803
ortium+1.750
etting+1.719
monary+1.627
Neg logit contributions
 Bubble-1.901
spect-1.813
enough-1.802
-1.778
natureconservancy-1.769
Token plea
Token ID16720
Feature activation-9.89e-01

Pos logit contributions
 Bubble+1.733
spect+1.682
enough+1.650
+1.628
BUS+1.592
Neg logit contributions
adder-1.268
adian-1.234
ortium-1.200
etting-1.154
monary-1.096
Token the
Token ID262
Feature activation-4.52e-01

Pos logit contributions
adder+2.747
adian+2.685
ortium+2.633
etting+2.530
 incur+2.433
Neg logit contributions
 Bubble-2.639
enough-2.547
natureconservancy-2.528
spect-2.504
-2.476
Token high
Token ID1029
Feature activation-5.37e-01

Pos logit contributions
adder+1.153
adian+1.136
ortium+1.111
etting+1.077
 incur+1.005
Neg logit contributions
 Bubble-1.341
spect-1.284
enough-1.268
-1.259
natureconservancy-1.244
Token court
Token ID2184
Feature activation+8.57e-01

Pos logit contributions
adder+1.461
adian+1.424
ortium+1.386
etting+1.347
monary+1.271
Neg logit contributions
 Bubble-1.535
spect-1.468
enough-1.433
natureconservancy-1.423
-1.421
Token had
Token ID550
Feature activation+5.46e-01

Pos logit contributions
 Bubble+2.611
spect+2.529
enough+2.478
+2.439
BUS+2.384
Neg logit contributions
adder-2.185
adian-2.123
ortium-2.074
etting-1.992
monary-1.902
Tokenode
Token ID1098
Feature activation-1.56e+00

Pos logit contributions
adder+2.154
adian+2.098
ortium+1.982
etting+1.944
 incur+1.776
Neg logit contributions
 Bubble-3.097
-3.016
spect-3.002
enough-2.989
natureconservancy-2.945
Token well
Token ID880
Feature activation+8.79e-01

Pos logit contributions
adder+3.612
adian+3.581
etting+3.437
ortium+3.345
 incur+3.244
Neg logit contributions
 Bubble-4.547
-4.391
enough-4.361
spect-4.350
natureconservancy-4.289
Token.
Token ID13
Feature activation+3.98e-01

Pos logit contributions
 Bubble+2.569
spect+2.487
enough+2.434
+2.377
BUS+2.332
Neg logit contributions
adder-2.345
adian-2.284
ortium-2.238
etting-2.144
monary-2.066
Token
Token ID198
Feature activation+3.72e-02

Pos logit contributions
 Bubble+1.021
spect+1.002
enough+0.980
+0.964
natureconservancy+0.939
Neg logit contributions
adder-1.184
adian-1.155
ortium-1.124
etting-1.104
monary-1.045
Token
Token ID198
Feature activation+2.13e-01

Pos logit contributions
 Bubble+0.103
+0.100
spect+0.099
enough+0.097
▬▬+0.093
Neg logit contributions
adder-0.105
adian-0.100
ortium-0.096
etting-0.096
monary-0.090
TokenRep
Token ID6207
Feature activation+6.21e-01

Pos logit contributions
 Bubble+0.564
spect+0.536
enough+0.523
+0.522
natureconservancy+0.506
Neg logit contributions
adder-0.628
adian-0.610
ortium-0.589
etting-0.584
monary-0.553
Tokenorted
Token ID9741
Feature activation+8.03e-01

Pos logit contributions
 Bubble+2.143
spect+2.080
enough+2.045
+2.024
BUS+1.982
Neg logit contributions
adder-1.339
adian-1.297
ortium-1.257
etting-1.208
monary-1.133
Tokenly
Token ID306
Feature activation-4.46e-01

Pos logit contributions
 Bubble+2.194
spect+2.119
enough+2.069
+2.031
BUS+1.980
Neg logit contributions
adder-2.298
adian-2.242
ortium-2.194
etting-2.122
monary-2.037
Token,
Token ID11
Feature activation-8.95e-01

Pos logit contributions
adder+1.233
adian+1.203
ortium+1.177
etting+1.141
monary+1.086
Neg logit contributions
 Bubble-1.249
spect-1.212
enough-1.179
-1.160
natureconservancy-1.137
Token Pages
Token ID28221
Feature activation+1.28e-01

Pos logit contributions
adder+2.438
adian+2.419
ortium+2.315
etting+2.303
monary+2.151
Neg logit contributions
 Bubble-2.386
spect-2.382
-2.336
enough-2.323
natureconservancy-2.316
Token and
Token ID290
Feature activation-3.17e-01

Pos logit contributions
 Bubble+0.361
spect+0.352
enough+0.345
+0.343
natureconservancy+0.334
Neg logit contributions
adder-0.353
adian-0.345
ortium-0.335
etting-0.326
monary-0.310
Token than
Token ID621
Feature activation+6.52e-01

Pos logit contributions
adder+2.467
adian+2.372
etting+2.334
ortium+2.285
 incur+2.141
Neg logit contributions
 Bubble-3.038
spect-2.966
-2.943
enough-2.924
BUS-2.842
Token when
Token ID618
Feature activation+7.51e-01

Pos logit contributions
 Bubble+1.733
spect+1.676
enough+1.636
+1.610
BUS+1.566
Neg logit contributions
adder-1.906
adian-1.861
ortium-1.820
etting-1.765
monary-1.692
Token before
Token ID878
Feature activation+7.76e-01

Pos logit contributions
 Bubble+1.935
spect+1.864
enough+1.818
+1.782
BUS+1.734
Neg logit contributions
adder-2.264
adian-2.212
ortium-2.168
etting-2.099
monary-2.020
Token the
Token ID262
Feature activation-1.16e-01

Pos logit contributions
 Bubble+2.003
spect+1.924
enough+1.873
+1.831
BUS+1.785
Neg logit contributions
adder-2.337
adian-2.284
ortium-2.239
etting-2.161
monary-2.092
Token ball
Token ID2613
Feature activation+6.01e-01

Pos logit contributions
adder+0.315
adian+0.314
ortium+0.302
etting+0.294
monary+0.277
Neg logit contributions
 Bubble-0.324
spect-0.319
enough-0.315
-0.309
BUS-0.301
Token was
Token ID373
Feature activation+3.43e-01

Pos logit contributions
 Bubble+1.798
spect+1.744
enough+1.709
+1.683
BUS+1.641
Neg logit contributions
adder-1.566
adian-1.522
ortium-1.483
etting-1.431
monary-1.361
Token snapped
Token ID20821
Feature activation-1.79e-01

Pos logit contributions
 Bubble+0.951
spect+0.929
enough+0.905
+0.898
natureconservancy+0.869
Neg logit contributions
adder-0.958
adian-0.935
ortium-0.907
etting-0.891
monary-0.834
Token.
Token ID13
Feature activation+4.68e-02

Pos logit contributions
adder+0.511
adian+0.494
ortium+0.482
etting+0.474
monary+0.448
Neg logit contributions
 Bubble-0.489
spect-0.473
enough-0.463
-0.457
natureconservancy-0.445
Token
Token ID198
Feature activation+2.52e-02

Pos logit contributions
 Bubble+0.121
spect+0.119
enough+0.117
+0.115
BUS+0.112
Neg logit contributions
adder-0.137
adian-0.133
ortium-0.130
etting-0.127
monary-0.120
Token
Token ID198
Feature activation+1.30e-01

Pos logit contributions
 Bubble+0.069
+0.067
spect+0.066
enough+0.065
BUS+0.063
Neg logit contributions
adder-0.072
adian-0.069
ortium-0.066
etting-0.065
monary-0.062
TokenIt
Token ID1026
Feature activation+1.25e+00

Pos logit contributions
 Bubble+0.350
spect+0.333
enough+0.325
+0.323
rollers+0.313
Neg logit contributions
adder-0.380
adian-0.370
ortium-0.359
etting-0.353
monary-0.334
Tokens
Token ID82
Feature activation-4.31e-01

Pos logit contributions
adder+1.550
adian+1.521
ortium+1.404
etting+1.380
monary+1.379
Neg logit contributions
 Bubble-1.524
-1.466
enough-1.414
natureconservancy-1.399
BUS-1.398
Token reading
Token ID3555
Feature activation+1.36e+00

Pos logit contributions
adder+1.220
adian+1.191
ortium+1.123
etting+1.115
monary+1.077
Neg logit contributions
 Bubble-1.151
spect-1.133
-1.129
enough-1.102
natureconservancy-1.068
Token and
Token ID290
Feature activation-2.09e-01

Pos logit contributions
 Bubble+4.062
spect+3.939
enough+3.845
+3.751
BUS+3.672
Neg logit contributions
adder-3.493
adian-3.435
ortium-3.370
etting-3.211
monary-3.074
Token writing
Token ID3597
Feature activation+1.08e+00

Pos logit contributions
adder+0.591
adian+0.571
etting+0.541
ortium+0.540
monary+0.520
Neg logit contributions
 Bubble-0.569
spect-0.553
-0.546
enough-0.544
natureconservancy-0.539
Token skills
Token ID4678
Feature activation-1.19e-01

Pos logit contributions
 Bubble+2.747
spect+2.644
enough+2.571
+2.513
BUS+2.446
Neg logit contributions
adder-3.299
adian-3.230
ortium-3.175
etting-3.062
monary-2.950
Token.
Token ID13
Feature activation-2.18e-01

Pos logit contributions
adder+0.331
adian+0.322
ortium+0.315
etting+0.307
monary+0.292
Neg logit contributions
 Bubble-0.330
spect-0.318
enough-0.312
-0.308
BUS-0.300
Token Now
Token ID2735
Feature activation-7.72e-01

Pos logit contributions
adder+0.643
adian+0.617
ortium+0.607
etting+0.586
monary+0.555
Neg logit contributions
 Bubble-0.557
spect-0.545
enough-0.532
-0.525
natureconservancy-0.521
Tokenadays
Token ID20544
Feature activation-1.37e+00

Pos logit contributions
adder+2.400
adian+2.366
ortium+2.266
etting+2.220
monary+2.111
Neg logit contributions
 Bubble-1.842
-1.771
spect-1.767
natureconservancy-1.761
enough-1.734
Token toy
Token ID13373
Feature activation+5.12e-01

Pos logit contributions
adder+3.527
adian+3.473
ortium+3.272
etting+3.143
monary+3.046
Neg logit contributions
 Bubble-3.842
spect-3.780
-3.704
enough-3.686
natureconservancy-3.685
Token makers
Token ID14429
Feature activation+6.62e-01

Pos logit contributions
 Bubble+1.474
spect+1.428
enough+1.399
+1.376
BUS+1.339
Neg logit contributions
adder-1.386
adian-1.351
ortium-1.318
etting-1.276
monary-1.218
Token are
Token ID389
Feature activation+1.61e-01

Pos logit contributions
 Bubble+1.927
spect+1.863
enough+1.820
+1.789
BUS+1.748
Neg logit contributions
adder-1.774
adian-1.727
ortium-1.690
etting-1.627
monary-1.561
Token just
Token ID655
Feature activation-1.21e+00

Pos logit contributions
 Bubble+1.405
spect+1.358
enough+1.319
+1.305
natureconservancy+1.274
Neg logit contributions
adder-1.621
adian-1.582
ortium-1.534
etting-1.511
monary-1.430
Token really
Token ID1107
Feature activation-2.70e-04

Pos logit contributions
adder+3.166
adian+3.074
ortium+2.947
etting+2.929
 incur+2.716
Neg logit contributions
 Bubble-3.477
spect-3.401
natureconservancy-3.345
-3.323
BUS-3.268
Token is
Token ID318
Feature activation-1.54e-01

Pos logit contributions
adder+0.001
adian+0.001
ortium+0.001
etting+0.001
monary+0.001
Neg logit contributions
 Bubble-0.001
spect-0.001
enough-0.001
-0.001
natureconservancy-0.001
Token that
Token ID326
Feature activation-1.96e-01

Pos logit contributions
adder+0.458
adian+0.443
ortium+0.427
etting+0.423
 incur+0.393
Neg logit contributions
 Bubble-0.398
spect-0.386
-0.379
enough-0.373
BUS-0.373
Token popular
Token ID2968
Feature activation-4.91e-01

Pos logit contributions
adder+0.615
adian+0.592
ortium+0.572
etting+0.566
 incur+0.534
Neg logit contributions
 Bubble-0.478
spect-0.464
-0.451
enough-0.449
BUS-0.441
Token,
Token ID11
Feature activation-4.89e-02

Pos logit contributions
adder+1.383
adian+1.340
ortium+1.288
etting+1.275
monary+1.193
Neg logit contributions
 Bubble-1.338
spect-1.290
-1.273
BUS-1.251
enough-1.250
Token and
Token ID290
Feature activation-1.80e-01

Pos logit contributions
adder+0.132
adian+0.128
ortium+0.124
etting+0.121
monary+0.113
Neg logit contributions
 Bubble-0.140
spect-0.136
-0.132
enough-0.132
natureconservancy-0.130
Token I
Token ID314
Feature activation-9.72e-01

Pos logit contributions
adder+0.517
adian+0.504
ortium+0.490
etting+0.476
monary+0.448
Neg logit contributions
 Bubble-0.479
spect-0.476
-0.460
enough-0.457
natureconservancy-0.450
Token
Token ID447
Feature activation-2.13e-01

Pos logit contributions
adder+2.398
adian+2.301
ortium+2.246
etting+2.151
monary+2.041
Neg logit contributions
 Bubble-2.984
spect-2.878
enough-2.803
-2.787
natureconservancy-2.787
Token
Token ID247
Feature activation-2.40e-01

Pos logit contributions
adder+0.376
adian+0.370
ortium+0.350
etting+0.328
monary+0.310
Neg logit contributions
 Bubble-0.810
spect-0.784
enough-0.772
-0.768
rollers-0.753
Tokenve
Token ID303
Feature activation+1.63e-01

Pos logit contributions
adder+0.709
adian+0.690
ortium+0.671
etting+0.652
monary+0.629
Neg logit contributions
 Bubble-0.637
spect-0.595
enough-0.591
-0.586
natureconservancy-0.573
Token.
Token ID13
Feature activation+1.66e-01

Pos logit contributions
adder+3.193
adian+3.088
ortium+3.059
etting+2.935
monary+2.770
Neg logit contributions
 Bubble-3.620
spect-3.517
-3.484
enough-3.470
natureconservancy-3.416
Token Shell
Token ID17537
Feature activation-6.76e-01

Pos logit contributions
 Bubble+0.409
spect+0.401
enough+0.388
+0.386
natureconservancy+0.374
Neg logit contributions
adder-0.509
adian-0.495
ortium-0.486
etting-0.471
monary-0.451
Token is
Token ID318
Feature activation+1.95e-01

Pos logit contributions
adder+1.989
adian+1.946
ortium+1.890
etting+1.813
monary+1.766
Neg logit contributions
 Bubble-1.747
spect-1.682
enough-1.654
-1.644
natureconservancy-1.618
Token planning
Token ID5410
Feature activation+6.78e-01

Pos logit contributions
 Bubble+0.563
spect+0.543
enough+0.533
+0.529
natureconservancy+0.516
Neg logit contributions
adder-0.527
adian-0.512
ortium-0.494
etting-0.480
monary-0.452
Token to
Token ID284
Feature activation+2.28e-01

Pos logit contributions
 Bubble+1.935
spect+1.870
enough+1.820
+1.786
BUS+1.742
Neg logit contributions
adder-1.857
adian-1.806
ortium-1.773
etting-1.710
monary-1.638
Token drill
Token ID16007
Feature activation+3.28e-02

Pos logit contributions
 Bubble+0.716
spect+0.695
enough+0.680
+0.671
BUS+0.655
Neg logit contributions
adder-0.561
adian-0.546
ortium-0.531
etting-0.510
monary-0.486
Token in
Token ID287
Feature activation+1.65e-01

Pos logit contributions
 Bubble+0.097
spect+0.093
enough+0.092
+0.091
natureconservancy+0.089
Neg logit contributions
adder-0.086
adian-0.084
ortium-0.082
etting-0.079
monary-0.075
Token Alaska
Token ID12926
Feature activation+2.02e-01

Pos logit contributions
 Bubble+0.463
spect+0.448
enough+0.439
+0.434
BUS+0.423
Neg logit contributions
adder-0.453
adian-0.443
ortium-0.433
etting-0.418
monary-0.400
Token this
Token ID428
Feature activation+9.52e-02

Pos logit contributions
 Bubble+0.586
spect+0.567
enough+0.553
+0.547
natureconservancy+0.534
Neg logit contributions
adder-0.544
adian-0.532
ortium-0.518
etting-0.498
monary-0.479
Token summer
Token ID3931
Feature activation-1.06e+00

Pos logit contributions
 Bubble+0.241
spect+0.234
enough+0.227
+0.226
natureconservancy+0.219
Neg logit contributions
adder-0.287
adian-0.282
ortium-0.276
etting-0.268
monary-0.256
Token and
Token ID290
Feature activation-6.00e-01

Pos logit contributions
adder+2.601
adian+2.593
ortium+2.470
etting+2.420
monary+2.265
Neg logit contributions
 Bubble-3.198
spect-3.048
-3.014
natureconservancy-3.011
enough-2.987
Token Russia
Token ID3284
Feature activation+1.96e-01

Pos logit contributions
adder+0.208
adian+0.204
ortium+0.199
etting+0.193
monary+0.183
Neg logit contributions
 Bubble-0.191
spect-0.185
enough-0.184
-0.181
BUS-0.176
Token,
Token ID11
Feature activation+6.31e-02

Pos logit contributions
 Bubble+0.553
spect+0.533
enough+0.522
+0.513
BUS+0.501
Neg logit contributions
adder-0.540
adian-0.527
ortium-0.515
etting-0.497
monary-0.476
Token has
Token ID468
Feature activation+6.44e-01

Pos logit contributions
 Bubble+0.167
spect+0.161
enough+0.157
+0.156
natureconservancy+0.152
Neg logit contributions
adder-0.184
adian-0.181
ortium-0.177
etting-0.170
monary-0.163
Token announced
Token ID3414
Feature activation+5.63e-01

Pos logit contributions
 Bubble+1.876
spect+1.814
enough+1.772
+1.750
rollers+1.704
Neg logit contributions
adder-1.720
adian-1.680
ortium-1.644
etting-1.577
monary-1.510
Token that
Token ID326
Feature activation+1.12e-01

Pos logit contributions
 Bubble+1.435
spect+1.380
enough+1.346
+1.320
BUS+1.285
Neg logit contributions
adder-1.715
adian-1.675
ortium-1.643
etting-1.588
monary-1.530
Token it
Token ID340
Feature activation-1.59e-01

Pos logit contributions
 Bubble+0.275
spect+0.265
enough+0.257
+0.255
natureconservancy+0.247
Neg logit contributions
adder-0.353
adian-0.344
ortium-0.338
etting-0.325
monary-0.316
Token
Token ID447
Feature activation+5.10e-02

Pos logit contributions
adder+0.410
adian+0.399
ortium+0.388
etting+0.374
monary+0.355
Neg logit contributions
 Bubble-0.480
spect-0.463
-0.447
enough-0.447
natureconservancy-0.438
Token
Token ID247
Feature activation-3.09e-01

Pos logit contributions
 Bubble+0.198
spect+0.193
enough+0.189
+0.189
rollers+0.184
Neg logit contributions
adder-0.086
adian-0.084
ortium-0.079
etting-0.073
monary-0.070
Tokens
Token ID82
Feature activation+1.04e-02

Pos logit contributions
adder+0.902
adian+0.877
ortium+0.856
etting+0.831
monary+0.803
Neg logit contributions
 Bubble-0.838
-0.776
enough-0.773
spect-0.767
natureconservancy-0.747
Token stopping
Token ID12225
Feature activation-1.55e-01

Pos logit contributions
 Bubble+0.033
spect+0.032
enough+0.031
+0.031
natureconservancy+0.030
Neg logit contributions
adder-0.025
adian-0.024
ortium-0.024
etting-0.023
monary-0.021
Token work
Token ID670
Feature activation-1.83e-01

Pos logit contributions
adder+0.417
adian+0.408
ortium+0.395
etting+0.385
monary+0.366
Neg logit contributions
 Bubble-0.446
spect-0.428
enough-0.421
-0.414
natureconservancy-0.412
Token discovered
Token ID5071
Feature activation+6.20e-01

Pos logit contributions
 Bubble+0.599
spect+0.581
enough+0.568
+0.564
natureconservancy+0.548
Neg logit contributions
adder-0.582
adian-0.568
ortium-0.557
etting-0.530
monary-0.513
Token that
Token ID326
Feature activation-6.59e-02

Pos logit contributions
 Bubble+1.583
spect+1.529
enough+1.488
+1.463
BUS+1.420
Neg logit contributions
adder-1.880
adian-1.837
ortium-1.801
etting-1.740
monary-1.676
Token his
Token ID465
Feature activation-7.39e-01

Pos logit contributions
adder+0.190
adian+0.185
ortium+0.181
etting+0.175
monary+0.168
Neg logit contributions
 Bubble-0.176
spect-0.171
enough-0.167
-0.165
natureconservancy-0.161
Token father
Token ID2988
Feature activation+1.47e-01

Pos logit contributions
adder+1.935
adian+1.914
ortium+1.827
etting+1.788
 incur+1.693
Neg logit contributions
 Bubble-2.113
spect-2.090
enough-2.051
-2.041
natureconservancy-1.961
Token was
Token ID373
Feature activation+3.50e-01

Pos logit contributions
 Bubble+0.457
spect+0.445
enough+0.437
+0.429
natureconservancy+0.423
Neg logit contributions
adder-0.365
adian-0.355
ortium-0.346
etting-0.331
monary-0.314
Token on
Token ID319
Feature activation+1.15e-01

Pos logit contributions
 Bubble+1.017
spect+0.985
enough+0.965
+0.953
natureconservancy+0.927
Neg logit contributions
adder-0.940
adian-0.919
ortium-0.893
etting-0.865
monary-0.822
Token the
Token ID262
Feature activation+2.23e-01

Pos logit contributions
 Bubble+0.309
spect+0.297
enough+0.291
+0.287
BUS+0.279
Neg logit contributions
adder-0.331
adian-0.324
ortium-0.317
etting-0.306
monary-0.294
Token last
Token ID938
Feature activation+1.13e+00

Pos logit contributions
 Bubble+0.649
spect+0.631
enough+0.621
+0.612
natureconservancy+0.598
Neg logit contributions
adder-0.590
adian-0.577
ortium-0.562
etting-0.543
monary-0.517
Token train
Token ID4512
Feature activation+3.94e-01

Pos logit contributions
 Bubble+3.265
spect+3.151
enough+3.070
+3.001
BUS+2.956
Neg logit contributions
adder-3.025
adian-2.953
ortium-2.901
etting-2.782
monary-2.679
Token to
Token ID284
Feature activation+4.08e-01

Pos logit contributions
 Bubble+1.119
spect+1.082
enough+1.058
+1.042
BUS+1.014
Neg logit contributions
adder-1.084
adian-1.054
ortium-1.031
etting-0.997
monary-0.952
Token the
Token ID262
Feature activation-1.92e-01

Pos logit contributions
 Bubble+1.123
spect+1.087
enough+1.064
+1.048
natureconservancy+1.020
Neg logit contributions
adder-1.156
adian-1.126
ortium-1.101
etting-1.063
monary-1.018
Token Korean
Token ID6983
Feature activation+4.72e-01

Pos logit contributions
adder+0.731
adian+0.710
ortium+0.693
etting+0.665
monary+0.636
Neg logit contributions
 Bubble-0.829
spect-0.804
enough-0.790
-0.777
natureconservancy-0.762
Token albeit
Token ID18244
Feature activation+5.15e-01

Pos logit contributions
 Bubble+1.240
spect+1.206
enough+1.180
+1.159
BUS+1.129
Neg logit contributions
adder-1.399
adian-1.365
ortium-1.326
etting-1.284
monary-1.235
Token in
Token ID287
Feature activation+9.84e-01

Pos logit contributions
 Bubble+1.419
spect+1.371
enough+1.345
+1.326
natureconservancy+1.293
Neg logit contributions
adder-1.468
adian-1.431
ortium-1.388
etting-1.346
monary-1.290
Token a
Token ID257
Feature activation+8.17e-01

Pos logit contributions
 Bubble+2.643
spect+2.540
enough+2.498
+2.458
BUS+2.391
Neg logit contributions
adder-2.870
adian-2.797
ortium-2.731
etting-2.644
monary-2.530
Token particularly
Token ID3573
Feature activation-2.56e-01

Pos logit contributions
 Bubble+2.131
spect+2.062
enough+2.030
+1.987
BUS+1.931
Neg logit contributions
adder-2.429
adian-2.367
ortium-2.314
etting-2.243
monary-2.144
Token gore
Token ID46835
Feature activation-1.79e+00

Pos logit contributions
adder+0.813
adian+0.779
ortium+0.762
etting+0.749
 incur+0.717
Neg logit contributions
 Bubble-0.600
enough-0.586
spect-0.582
-0.580
natureconservancy-0.553
Token filled
Token ID5901
Feature activation+5.52e-01

Pos logit contributions
adder+5.249
adian+4.996
etting+4.839
ortium+4.749
 incur+4.536
Neg logit contributions
 Bubble-4.445
enough-4.202
spect-4.171
natureconservancy-4.156
-4.132
Token way
Token ID835
Feature activation-1.11e-01

Pos logit contributions
 Bubble+1.507
spect+1.457
enough+1.427
+1.407
BUS+1.374
Neg logit contributions
adder-1.573
adian-1.535
ortium-1.496
etting-1.457
monary-1.386
Token.
Token ID13
Feature activation+1.52e-01

Pos logit contributions
adder+0.314
adian+0.306
ortium+0.298
etting+0.293
monary+0.276
Neg logit contributions
 Bubble-0.301
spect-0.292
enough-0.286
-0.282
natureconservancy-0.277
Token
Token ID198
Feature activation-2.25e-02

Pos logit contributions
 Bubble+0.383
spect+0.375
enough+0.365
+0.365
natureconservancy+0.355
Neg logit contributions
adder-0.457
adian-0.440
ortium-0.433
etting-0.422
monary-0.398
Token
Token ID198
Feature activation+1.89e-01

Pos logit contributions
adder+0.063
adian+0.060
ortium+0.058
etting+0.057
monary+0.054
Neg logit contributions
 Bubble-0.063
-0.060
spect-0.060
enough-0.059
natureconservancy-0.056
Token Special
Token ID6093
Feature activation+5.19e-01

Pos logit contributions
adder+0.063
adian+0.061
ortium+0.060
etting+0.058
monary+0.055
Neg logit contributions
 Bubble-0.059
spect-0.058
enough-0.056
-0.056
BUS-0.054
Token frame
Token ID5739
Feature activation+5.72e-01

Pos logit contributions
 Bubble+1.518
spect+1.474
enough+1.440
+1.419
BUS+1.382
Neg logit contributions
adder-1.384
adian-1.347
ortium-1.314
etting-1.268
monary-1.213
Token for
Token ID329
Feature activation-3.59e-01

Pos logit contributions
 Bubble+1.607
spect+1.546
enough+1.510
+1.482
BUS+1.450
Neg logit contributions
adder-1.591
adian-1.551
ortium-1.528
etting-1.462
monary-1.411
Token pop
Token ID1461
Feature activation-1.58e+00

Pos logit contributions
adder+1.059
adian+1.035
ortium+0.995
etting+0.987
monary+0.936
Neg logit contributions
 Bubble-0.927
spect-0.914
enough-0.893
-0.884
natureconservancy-0.851
Token/
Token ID14
Feature activation-5.54e-01

Pos logit contributions
adder+4.486
adian+4.308
etting+4.186
ortium+4.138
monary+3.870
Neg logit contributions
 Bubble-4.150
enough-4.031
spect-4.021
-3.940
BUS-3.833
Tokenpop
Token ID12924
Feature activation-1.74e+00

Pos logit contributions
adder+1.937
adian+1.863
etting+1.791
ortium+1.784
monary+1.733
Neg logit contributions
 Bubble-1.185
-1.179
spect-1.076
▬▬-1.063
enough-1.049
Token/
Token ID14
Feature activation-1.33e+00

Pos logit contributions
adder+4.749
adian+4.593
etting+4.453
ortium+4.372
monary+4.117
Neg logit contributions
 Bubble-4.677
enough-4.603
spect-4.523
-4.501
BUS-4.393
Tokenpop
Token ID12924
Feature activation-2.26e+00

Pos logit contributions
adder+4.217
adian+4.020
etting+3.855
ortium+3.791
monary+3.669
Neg logit contributions
-3.323
 Bubble-3.187
▬▬-3.008
spect-2.870
 wisely-2.841
Token/
Token ID14
Feature activation-1.39e+00

Pos logit contributions
adder+6.105
adian+5.913
etting+5.696
ortium+5.662
 incur+5.272
Neg logit contributions
 Bubble-6.006
enough-5.872
spect-5.792
-5.719
BUS-5.621
Tokenret
Token ID1186
Feature activation+2.83e-01

Pos logit contributions
adder+3.398
adian+3.258
ortium+3.147
etting+3.081
monary+2.867
Neg logit contributions
 Bubble-4.298
-4.110
spect-4.094
enough-4.029
▬▬-3.933
Token |
Token ID930
Feature activation+4.46e-01

Pos logit contributions
 Bubble+0.757
spect+0.731
enough+0.714
+0.701
BUS+0.682
Neg logit contributions
adder-0.827
adian-0.808
ortium-0.789
etting-0.765
monary-0.735
Token shot
Token ID2823
Feature activation-3.05e-01

Pos logit contributions
 Bubble+1.368
spect+1.325
enough+1.294
+1.277
BUS+1.249
Neg logit contributions
adder-1.334
adian-1.301
ortium-1.270
etting-1.228
monary-1.173
Token showed
Token ID3751
Feature activation-5.09e-02

Pos logit contributions
adder+0.814
adian+0.786
ortium+0.769
etting+0.745
monary+0.701
Neg logit contributions
 Bubble-0.884
spect-0.860
enough-0.849
-0.835
BUS-0.818
Token a
Token ID257
Feature activation-1.05e-01

Pos logit contributions
adder+0.145
adian+0.141
ortium+0.139
etting+0.134
monary+0.127
Neg logit contributions
 Bubble-0.137
spect-0.133
enough-0.132
-0.130
BUS-0.126
Token single
Token ID2060
Feature activation-1.62e+00

Pos logit contributions
adder+0.301
adian+0.294
ortium+0.286
etting+0.278
monary+0.263
Neg logit contributions
 Bubble-0.279
spect-0.273
enough-0.269
-0.262
BUS-0.258
Token dumb
Token ID13526
Feature activation-1.05e+00

Pos logit contributions
adder+4.357
adian+4.270
ortium+4.097
etting+4.032
 incur+3.823
Neg logit contributions
spect-4.313
 Bubble-4.288
enough-4.236
-4.215
BUS-4.119
Tokenbell
Token ID7923
Feature activation-1.57e+00

Pos logit contributions
adder+3.468
adian+3.415
ortium+3.292
etting+3.222
 incur+3.036
Neg logit contributions
 Bubble-2.329
spect-2.260
-2.256
enough-2.202
natureconservancy-2.191
Token and
Token ID290
Feature activation-9.38e-01

Pos logit contributions
adder+3.963
adian+3.857
etting+3.658
ortium+3.653
 incur+3.413
Neg logit contributions
spect-4.520
 Bubble-4.490
-4.451
enough-4.427
natureconservancy-4.346
Token weight
Token ID3463
Feature activation-1.15e+00

Pos logit contributions
adder+2.515
adian+2.439
ortium+2.373
etting+2.289
 incur+2.182
Neg logit contributions
 Bubble-2.591
spect-2.573
enough-2.524
-2.516
natureconservancy-2.442
Tokenlifting
Token ID30510
Feature activation-4.35e-01

Pos logit contributions
adder+2.980
adian+2.850
etting+2.749
ortium+2.725
 incur+2.507
Neg logit contributions
 Bubble-3.392
spect-3.294
-3.273
enough-3.204
natureconservancy-3.131
Token plates
Token ID13375
Feature activation-1.32e+00

Pos logit contributions
adder+1.164
adian+1.113
ortium+1.080
etting+1.063
monary+0.995
Neg logit contributions
 Bubble-1.246
spect-1.229
enough-1.211
-1.199
BUS-1.160
Token stre
Token ID1937
Feature activation-6.14e-01

Pos logit contributions
adder+3.263
adian+3.136
etting+3.098
ortium+2.993
 incur+2.785
Neg logit contributions
 Bubble-3.959
-3.870
spect-3.855
enough-3.763
natureconservancy-3.748
Token amounts
Token ID6867
Feature activation+4.62e-01

Pos logit contributions
adder+0.949
adian+0.932
ortium+0.923
etting+0.885
monary+0.864
Neg logit contributions
 Bubble-0.645
spect-0.616
enough-0.597
-0.596
natureconservancy-0.574
Token of
Token ID286
Feature activation+1.65e-01

Pos logit contributions
 Bubble+1.117
spect+1.069
enough+1.048
+1.017
 wisely+0.997
Neg logit contributions
adder-1.458
adian-1.425
ortium-1.399
etting-1.351
monary-1.324
Token marijuana
Token ID5727
Feature activation-5.24e-01

Pos logit contributions
 Bubble+0.406
spect+0.392
enough+0.381
+0.375
BUS+0.363
Neg logit contributions
adder-0.517
adian-0.507
ortium-0.494
etting-0.480
monary-0.464
Token for
Token ID329
Feature activation-5.74e-01

Pos logit contributions
adder+1.375
adian+1.330
ortium+1.311
etting+1.254
monary+1.197
Neg logit contributions
 Bubble-1.545
spect-1.492
-1.456
enough-1.454
natureconservancy-1.427
Token personal
Token ID2614
Feature activation-1.02e+00

Pos logit contributions
adder+1.543
adian+1.488
ortium+1.444
etting+1.419
monary+1.361
Neg logit contributions
 Bubble-1.628
spect-1.597
enough-1.593
-1.543
natureconservancy-1.533
Token use
Token ID779
Feature activation-1.75e+00

Pos logit contributions
adder+1.972
adian+1.849
ortium+1.815
etting+1.729
 incur+1.672
Neg logit contributions
 Bubble-3.566
spect-3.545
enough-3.537
-3.480
natureconservancy-3.419
Token.
Token ID13
Feature activation-3.38e-02

Pos logit contributions
adder+4.253
adian+4.056
ortium+4.051
etting+3.928
 incur+3.765
Neg logit contributions
 Bubble-5.235
spect-5.032
-5.032
natureconservancy-4.991
enough-4.922
Token
Token ID198
Feature activation-5.03e-01

Pos logit contributions
adder+0.102
adian+0.099
ortium+0.097
etting+0.094
monary+0.090
Neg logit contributions
 Bubble-0.086
spect-0.083
enough-0.081
-0.080
natureconservancy-0.078
Token
Token ID198
Feature activation-1.61e-01

Pos logit contributions
adder+1.329
adian+1.252
etting+1.198
ortium+1.184
monary+1.116
Neg logit contributions
 Bubble-1.456
-1.447
spect-1.391
enough-1.373
▬▬-1.341
Token"
Token ID1
Feature activation-3.19e-01

Pos logit contributions
adder+0.468
adian+0.451
ortium+0.436
etting+0.429
monary+0.408
Neg logit contributions
 Bubble-0.442
spect-0.409
-0.403
enough-0.400
 wisely-0.391
TokenWe
Token ID1135
Feature activation-5.36e-01

Pos logit contributions
adder+0.918
adian+0.873
ortium+0.850
etting+0.829
monary+0.789
Neg logit contributions
 Bubble-0.867
spect-0.819
-0.791
enough-0.788
natureconservancy-0.778
Token people
Token ID661
Feature activation+9.75e-02

Pos logit contributions
 Bubble+1.569
spect+1.515
enough+1.486
+1.452
BUS+1.423
Neg logit contributions
adder-1.747
adian-1.701
ortium-1.661
etting-1.612
monary-1.548
Token in
Token ID287
Feature activation+4.02e-01

Pos logit contributions
 Bubble+0.285
spect+0.276
enough+0.271
+0.269
BUS+0.261
Neg logit contributions
adder-0.259
adian-0.253
ortium-0.245
etting-0.238
monary-0.228
Token this
Token ID428
Feature activation+5.57e-01

Pos logit contributions
 Bubble+1.065
spect+1.025
enough+1.008
+0.984
BUS+0.967
Neg logit contributions
adder-1.178
adian-1.152
ortium-1.129
etting-1.087
monary-1.047
Token room
Token ID2119
Feature activation-1.88e-01

Pos logit contributions
 Bubble+1.495
spect+1.447
enough+1.418
+1.385
BUS+1.358
Neg logit contributions
adder-1.614
adian-1.576
ortium-1.544
etting-1.487
monary-1.432
Token are
Token ID389
Feature activation+5.07e-01

Pos logit contributions
adder+0.500
adian+0.484
ortium+0.470
etting+0.456
monary+0.436
Neg logit contributions
 Bubble-0.551
spect-0.533
enough-0.521
-0.519
natureconservancy-0.505
Token tired
Token ID10032
Feature activation-1.73e+00

Pos logit contributions
 Bubble+1.399
spect+1.354
enough+1.329
+1.309
BUS+1.272
Neg logit contributions
adder-1.428
adian-1.398
ortium-1.356
etting-1.314
monary-1.260
Token of
Token ID286
Feature activation-9.85e-02

Pos logit contributions
adder+4.379
adian+4.209
ortium+4.065
etting+3.953
omatic+3.781
Neg logit contributions
 Bubble-4.983
spect-4.733
-4.658
enough-4.652
BUS-4.651
Token waiting
Token ID4953
Feature activation+2.31e-02

Pos logit contributions
adder+0.274
adian+0.268
ortium+0.259
etting+0.252
monary+0.240
Neg logit contributions
 Bubble-0.272
spect-0.265
enough-0.260
-0.256
BUS-0.252
Token.
Token ID13
Feature activation+6.75e-01

Pos logit contributions
 Bubble+0.068
spect+0.066
enough+0.064
+0.064
BUS+0.063
Neg logit contributions
adder-0.061
adian-0.059
ortium-0.057
etting-0.056
monary-0.053
Token But
Token ID887
Feature activation+5.10e-01

Pos logit contributions
 Bubble+1.646
spect+1.601
enough+1.556
+1.528
natureconservancy+1.487
Neg logit contributions
adder-2.112
adian-2.055
ortium-2.011
etting-1.955
monary-1.876
Token it
Token ID340
Feature activation+1.56e+00

Pos logit contributions
 Bubble+1.405
spect+1.359
enough+1.326
+1.308
BUS+1.276
Neg logit contributions
adder-1.443
adian-1.408
ortium-1.372
etting-1.328
monary-1.274