TOP ACTIVATIONS
MAX = 4.431

the newly launched Fol lett Knowledge Fund , a venture capital
ounced partnerships include : Aurora Information Technology ( October 2013 ),
rights . We need the Travel ban as an extra level
stroke in 1965 When Lucy Neal was born in perfect health
had a lawyer . Eric Hew ett is expected to fully
you need to have AWS Access Keys . To generate these
coast , UCLA grad Bob Water field was back in his
, as rookie QB Bob Water field finally delivered Cleveland its
Burger Joint ) and Cory Neal of Pat O
in a vehicle without an inter lock device .
" H ame ko " Nor iy uki spoke primarily about
Sen . Cruz was no exception , a Cruz
G . Wells ' The Time Machine is published .
E BS CO Information Services compet es within the
A US 16 Steele Von Hoff ( Team Chip otle )
, should pay almost exclusive attention to her ; and at
him , might have no opportunity to dance .
field punishments , Goodell has access to police investigation records .
driving a vehicle without an inter lock device .
Berg man esque chamber piece Inter iors as a film that

MIN ACTIVATIONS
MIN = -4.582

proposals really want anti - m alle ability fixes ( which
students to access co - cur ricular activities to supplement their
Conclusion To keep things realistic , Hong Kong
the story begins Cur iously , that
to find a balance between keeping activists bullish while temper ing
this card , though . Firstly , you have to target
following at Ham ill ( m essing up his Twitter handle
NG has major drawbacks . Firstly , it s
's humble and obedient servant CM DR Kay mel Kan ,
businesses looking for on - prem ise deployment options happy and
!!! Cong rats 2 @ m f lynn JR Great day
Under chain - m igration policies , immigrants granted
vs Safe Passage M ael strom Pulse vs Paradise
is one of the st aun chest supporters of Trump 's
the new Firefox M oz illa still has a
illa 's mission -- to keep the internet open and a
extra ordinary measures ," to keep under the debt limit set
this is to highlight INTER M ITT ENT F AST ING
food chain . M ounds of tail ings from
alt an ), eternity ( M ö ng ke ), surplus

INTERVAL 2.178 - 4.431
CONTAINS 0.535%

in Brown s ear about kicking football s .
way you are . I love you so much , Anna
nova . Nobody has any interest in holding us on the
behind or touching me without permission ), then touch makes me
his job . If Cory Booker can make millions not working

INTERVAL -0.075 - 2.178
CONTAINS 54.255%

weren t familiar with the books to start off
EU - a so - called " in - out vote
toddlers , nor that he 'd be happy that they left
that gl acial system is probably out of balance , losing
dismiss es this , thinking it was Mou z Black ,

INTERVAL -2.329 - -0.075
CONTAINS 44.810%

. Option 2 Rest art Network Manager at
same mandate has a loophole that could guarantee fossil fuel production
ate their young for two months at most .
competitors , said the company s sizable spectrum assets
im utils import paths import n umpy as np import arg

INTERVAL -4.582 - -2.329
CONTAINS 0.400%

therefore retains more detail than grid - based simulations .
two days . R S enth il Kumar /
, uproar iously so from guys who have a history of
us and through that support making this kind of content available
ext = ($ text - split "\ r ")
Token the
Token ID262
Feature activation+4.41e-01

Pos logit contributions
ulate+0.217
riages+0.205
ulating+0.205
 stagn+0.199
ulates+0.195
Neg logit contributions
seeing-0.213
nor-0.197
colour-0.194
��-0.194
 Teach-0.189
Token newly
Token ID8308
Feature activation-2.58e-01

Pos logit contributions
seeing+1.250
nor+1.152
colour+1.125
 Teach+1.124
��+1.115
Neg logit contributions
ulate-1.347
riages-1.274
ulating-1.273
 stagn-1.237
ulates-1.217
Token launched
Token ID5611
Feature activation-1.01e+00

Pos logit contributions
ulate+0.593
riages+0.548
ulating+0.547
 stagn+0.526
ulates+0.516
Neg logit contributions
seeing-0.928
nor-0.869
colour-0.859
��-0.851
 Teach-0.840
Token Fol
Token ID30938
Feature activation+1.06e-01

Pos logit contributions
ulate+3.525
ulating+3.349
riages+3.347
 stagn+3.277
ulates+3.220
Neg logit contributions
seeing-2.425
nor-2.194
colour-2.185
��-2.181
ーン-2.024
Tokenlett
Token ID15503
Feature activation+5.49e-01

Pos logit contributions
seeing+0.185
nor+0.163
colour+0.156
 Teach+0.147
��+0.145
Neg logit contributions
ulate-0.443
ulating-0.427
riages-0.423
 stagn-0.418
���-0.413
Token Knowledge
Token ID20414
Feature activation+4.43e+00

Pos logit contributions
seeing+1.314
nor+1.189
colour+1.162
 Teach+1.156
��+1.140
Neg logit contributions
ulate-1.917
riages-1.827
ulating-1.826
 stagn-1.786
ulates-1.759
Token Fund
Token ID7557
Feature activation+1.72e+00

Pos logit contributions
seeing+10.929
 Teach+10.152
nor+9.552
colour+9.398
edin+9.238
Neg logit contributions
ulate-12.658
ulating-12.114
riages-12.090
ulates-11.739
 stagn-11.728
Token,
Token ID11
Feature activation+9.70e-03

Pos logit contributions
seeing+5.057
 Teach+4.599
nor+4.597
colour+4.466
��+4.380
Neg logit contributions
ulate-4.838
riages-4.564
ulating-4.532
 stagn-4.425
sei-4.397
Token a
Token ID257
Feature activation-4.71e-01

Pos logit contributions
seeing+0.029
nor+0.026
colour+0.026
��+0.026
 Teach+0.025
Neg logit contributions
ulate-0.029
riages-0.027
ulating-0.027
 stagn-0.026
sei-0.026
Token venture
Token ID13189
Feature activation+6.51e-01

Pos logit contributions
ulate+1.271
ulating+1.192
riages+1.192
 stagn+1.156
ulates+1.132
Neg logit contributions
seeing-1.501
nor-1.395
colour-1.379
��-1.376
 Teach-1.339
Token capital
Token ID3139
Feature activation-1.00e-01

Pos logit contributions
seeing+2.310
nor+2.162
colour+2.114
 Teach+2.092
��+2.087
Neg logit contributions
ulate-1.521
riages-1.398
ulating-1.393
 stagn-1.343
ulates-1.329
Tokenounced
Token ID8918
Feature activation-1.04e+00

Pos logit contributions
ulate+0.442
riages+0.406
ulating+0.406
 stagn+0.389
ulates+0.378
Neg logit contributions
seeing-0.830
nor-0.781
colour-0.773
��-0.771
 Teach-0.756
Token partnerships
Token ID22867
Feature activation+8.99e-01

Pos logit contributions
ulate+2.802
riages+2.632
ulating+2.624
 stagn+2.572
ulates+2.492
Neg logit contributions
seeing-3.257
��-3.074
nor-3.045
colour-3.026
 Teach-2.890
Token include
Token ID2291
Feature activation-1.18e-01

Pos logit contributions
seeing+2.728
nor+2.495
colour+2.463
 Teach+2.435
��+2.419
Neg logit contributions
ulate-2.502
riages-2.372
ulating-2.369
ulates-2.275
 stagn-2.266
Token:
Token ID25
Feature activation-7.37e-01

Pos logit contributions
ulate+0.368
riages+0.349
ulating+0.349
 stagn+0.339
ulates+0.333
Neg logit contributions
seeing-0.328
nor-0.301
colour-0.297
��-0.296
 Teach-0.288
Token Aurora
Token ID22218
Feature activation+6.56e-01

Pos logit contributions
ulate+2.189
riages+2.066
ulating+2.062
 stagn+2.008
ulates+1.972
Neg logit contributions
seeing-2.123
nor-1.970
colour-1.952
��-1.947
 Teach-1.855
Token Information
Token ID6188
Feature activation+4.25e+00

Pos logit contributions
seeing+1.772
nor+1.613
colour+1.581
 Teach+1.566
��+1.565
Neg logit contributions
ulate-2.090
ulating-1.984
riages-1.978
 stagn-1.933
ulates-1.905
Token Technology
Token ID8987
Feature activation+1.74e+00

Pos logit contributions
seeing+8.615
 Teach+7.401
nor+7.399
colour+7.058
 Lauder+6.992
Neg logit contributions
ulate-13.993
riages-13.519
ulating-13.454
 stagn-13.072
ulates-13.058
Token (
Token ID357
Feature activation+3.49e-01

Pos logit contributions
seeing+4.654
nor+4.217
 Teach+4.151
colour+4.024
 Lauder+3.966
Neg logit contributions
ulate-5.426
ulating-5.159
riages-5.115
 stagn-4.960
ulates-4.942
TokenOctober
Token ID18517
Feature activation+1.02e+00

Pos logit contributions
seeing+0.971
nor+0.893
colour+0.877
��+0.855
 Teach+0.841
Neg logit contributions
ulate-1.088
riages-1.033
ulating-1.033
 stagn-1.012
ulates-0.985
Token 2013
Token ID2211
Feature activation+4.56e-01

Pos logit contributions
seeing+2.443
nor+2.232
 Teach+2.126
colour+2.121
��+2.035
Neg logit contributions
ulate-3.543
ulating-3.377
riages-3.357
 stagn-3.273
ulates-3.256
Token),
Token ID828
Feature activation-2.29e-01

Pos logit contributions
seeing+1.406
nor+1.303
colour+1.274
��+1.253
 Teach+1.250
Neg logit contributions
ulate-1.280
ulating-1.205
riages-1.203
 stagn-1.166
ulates-1.146
Token rights
Token ID2489
Feature activation+8.07e-01

Pos logit contributions
ulate+1.112
riages+1.050
ulating+1.048
 stagn+1.016
ulates+0.999
Neg logit contributions
seeing-1.131
nor-1.033
colour-1.017
��-1.000
 Teach-0.995
Token.
Token ID13
Feature activation+1.57e-01

Pos logit contributions
seeing+2.397
nor+2.214
colour+2.141
 Teach+2.111
��+2.100
Neg logit contributions
ulate-2.347
ulating-2.210
riages-2.204
 stagn-2.154
ulates-2.126
Token We
Token ID775
Feature activation+2.95e-01

Pos logit contributions
seeing+0.386
nor+0.351
colour+0.343
��+0.339
 Teach+0.334
Neg logit contributions
ulate-0.541
riages-0.515
ulating-0.515
 stagn-0.504
ulates-0.495
Token need
Token ID761
Feature activation+1.36e+00

Pos logit contributions
seeing+0.797
nor+0.735
colour+0.717
��+0.709
 Teach+0.708
Neg logit contributions
ulate-0.936
riages-0.889
ulating-0.887
 stagn-0.856
ulates-0.853
Token the
Token ID262
Feature activation+1.04e+00

Pos logit contributions
seeing+3.962
nor+3.633
colour+3.575
 Teach+3.537
��+3.490
Neg logit contributions
ulate-3.910
riages-3.747
ulating-3.691
 stagn-3.589
sei-3.554
Token Travel
Token ID13524
Feature activation+4.18e+00

Pos logit contributions
seeing+3.132
nor+2.868
colour+2.813
 Teach+2.810
��+2.758
Neg logit contributions
ulate-2.904
riages-2.763
ulating-2.726
 stagn-2.629
sei-2.600
Token ban
Token ID3958
Feature activation+3.82e-04

Pos logit contributions
seeing+8.401
nor+7.429
 Teach+7.076
ーン+6.873
colour+6.729
Neg logit contributions
riages-13.334
ulate-13.298
 stagn-12.867
ulating-12.677
sei-12.673
Token as
Token ID355
Feature activation-4.59e-01

Pos logit contributions
seeing+0.001
nor+0.001
colour+0.001
��+0.001
 Teach+0.001
Neg logit contributions
ulate-0.001
riages-0.001
ulating-0.001
 stagn-0.001
ulates-0.001
Token an
Token ID281
Feature activation-7.68e-01

Pos logit contributions
ulate+1.225
ulating+1.148
riages+1.147
 stagn+1.111
ulates+1.089
Neg logit contributions
seeing-1.480
nor-1.376
colour-1.360
��-1.358
 Teach-1.322
Token extra
Token ID3131
Feature activation+1.21e+00

Pos logit contributions
ulate+2.191
ulating+2.068
riages+2.059
 stagn+2.004
ulates+1.969
Neg logit contributions
seeing-2.332
nor-2.152
��-2.134
colour-2.120
 Teach-2.056
Token level
Token ID1241
Feature activation+2.15e-01

Pos logit contributions
seeing+4.136
nor+3.840
colour+3.810
��+3.754
 Teach+3.696
Neg logit contributions
ulate-2.883
riages-2.744
ulating-2.683
 stagn-2.613
ulates-2.560
Token stroke
Token ID14000
Feature activation+1.47e-01

Pos logit contributions
ulate+1.946
ulating+1.835
riages+1.835
 stagn+1.785
ulates+1.746
Neg logit contributions
seeing-2.051
nor-1.898
��-1.887
colour-1.879
 Teach-1.814
Token in
Token ID287
Feature activation-8.34e-03

Pos logit contributions
seeing+0.473
nor+0.438
colour+0.434
��+0.428
 Teach+0.422
Neg logit contributions
ulate-0.397
ulating-0.371
riages-0.371
 stagn-0.359
ulates-0.353
Token 1965
Token ID17672
Feature activation-4.08e-01

Pos logit contributions
ulate+0.025
riages+0.024
ulating+0.024
 stagn+0.023
ulates+0.023
Neg logit contributions
seeing-0.024
nor-0.022
colour-0.022
��-0.022
 Teach-0.021
TokenWhen
Token ID2215
Feature activation-9.16e-01

Pos logit contributions
ulate+1.082
riages+1.015
ulating+1.014
 stagn+0.982
ulates+0.961
Neg logit contributions
seeing-1.318
nor-1.227
colour-1.212
��-1.211
 Teach-1.179
Token Lucy
Token ID22162
Feature activation-2.07e-01

Pos logit contributions
ulate+2.603
ulating+2.457
riages+2.449
 stagn+2.378
ulates+2.327
Neg logit contributions
seeing-2.762
��-2.578
nor-2.556
colour-2.527
 Teach-2.434
Token Neal
Token ID29189
Feature activation+4.12e+00

Pos logit contributions
ulate+0.623
riages+0.589
ulating+0.588
 stagn+0.572
ulates+0.562
Neg logit contributions
seeing-0.598
nor-0.551
colour-0.544
��-0.542
 Teach-0.526
Token was
Token ID373
Feature activation-2.41e-01

Pos logit contributions
seeing+10.746
nor+10.375
 Teach+10.004
colour+9.684
 Lauder+9.326
Neg logit contributions
ulate-11.093
ulating-10.232
riages-10.064
ulates-9.834
 stagn-9.640
Token born
Token ID4642
Feature activation-7.99e-02

Pos logit contributions
ulate+0.681
ulating+0.639
riages+0.638
 stagn+0.617
ulates+0.609
Neg logit contributions
seeing-0.745
nor-0.690
colour-0.682
��-0.669
 Teach-0.663
Token in
Token ID287
Feature activation+3.48e-01

Pos logit contributions
ulate+0.214
riages+0.201
ulating+0.201
 stagn+0.194
ulates+0.190
Neg logit contributions
seeing-0.257
nor-0.239
colour-0.236
��-0.235
 Teach-0.230
Token perfect
Token ID2818
Feature activation+2.57e-01

Pos logit contributions
seeing+0.979
nor+0.903
colour+0.889
��+0.880
 Teach+0.864
Neg logit contributions
ulate-1.069
riages-1.014
ulating-1.011
 stagn-0.983
ulates-0.968
Token health
Token ID1535
Feature activation+1.63e-01

Pos logit contributions
seeing+0.755
nor+0.701
colour+0.692
��+0.679
 Teach+0.669
Neg logit contributions
ulate-0.758
ulating-0.715
riages-0.714
 stagn-0.694
ulates-0.684
Token had
Token ID550
Feature activation+1.78e+00

Pos logit contributions
seeing+1.308
nor+1.203
colour+1.159
 Teach+1.132
��+1.110
Neg logit contributions
ulate-1.559
riages-1.470
ulating-1.469
 stagn-1.416
ulates-1.406
Token a
Token ID257
Feature activation+2.49e-01

Pos logit contributions
seeing+4.839
nor+4.446
 Teach+4.306
colour+4.226
 Lauder+4.136
Neg logit contributions
ulate-5.413
riages-5.193
ulating-5.125
ulates-4.900
sei-4.897
Token lawyer
Token ID6853
Feature activation+7.40e-01

Pos logit contributions
seeing+0.764
nor+0.709
colour+0.694
 Teach+0.687
��+0.679
Neg logit contributions
ulate-0.701
riages-0.663
ulating-0.660
 stagn-0.635
ulates-0.631
Token.
Token ID13
Feature activation-1.13e-02

Pos logit contributions
seeing+2.277
nor+2.125
colour+2.016
 Teach+2.009
��+1.982
Neg logit contributions
ulate-2.075
riages-1.951
ulating-1.934
 stagn-1.908
ulates-1.867
Token Eric
Token ID7651
Feature activation+1.30e+00

Pos logit contributions
ulate+0.032
riages+0.030
ulating+0.030
 stagn+0.029
ulates+0.028
Neg logit contributions
seeing-0.035
nor-0.032
colour-0.032
��-0.032
 Teach-0.031
Token Hew
Token ID30446
Feature activation+4.10e+00

Pos logit contributions
seeing+3.326
nor+3.116
 Teach+2.981
colour+2.947
��+2.930
Neg logit contributions
ulate-4.186
riages-3.980
ulating-3.965
 stagn-3.914
ulates-3.803
Tokenett
Token ID3087
Feature activation+1.20e+00

Pos logit contributions
nor+6.803
seeing+6.607
edin+5.524
colour+5.496
 Teach+5.311
Neg logit contributions
���-17.048
 exting-14.641
ulate-14.233
ulating-14.173
riages-14.067
Token is
Token ID318
Feature activation+8.30e-01

Pos logit contributions
seeing+3.375
nor+3.132
 Teach+3.016
colour+3.001
 Lauder+2.922
Neg logit contributions
ulate-3.615
ulating-3.426
riages-3.396
 stagn-3.271
ulates-3.251
Token expected
Token ID2938
Feature activation+2.34e+00

Pos logit contributions
seeing+2.523
nor+2.313
 Teach+2.278
colour+2.247
��+2.150
Neg logit contributions
ulate-2.366
ulating-2.231
riages-2.227
ulates-2.118
 stagn-2.117
Token to
Token ID284
Feature activation+1.22e+00

Pos logit contributions
seeing+5.203
nor+4.651
colour+4.388
 Teach+4.325
ーン+4.269
Neg logit contributions
ulate-8.261
ulating-7.923
riages-7.875
ulates-7.681
 stagn-7.655
Token fully
Token ID3938
Feature activation+1.99e-01

Pos logit contributions
seeing+3.123
nor+2.941
 Teach+2.804
colour+2.793
��+2.644
Neg logit contributions
ulate-3.920
ulating-3.783
riages-3.764
ulates-3.647
sei-3.578
Token you
Token ID345
Feature activation-3.57e-01

Pos logit contributions
ulate+0.236
riages+0.223
ulating+0.223
 stagn+0.216
ulates+0.212
Neg logit contributions
seeing-0.235
nor-0.217
colour-0.214
��-0.212
 Teach-0.208
Token need
Token ID761
Feature activation+1.13e+00

Pos logit contributions
ulate+1.081
riages+1.022
ulating+1.021
 stagn+0.993
ulates+0.976
Neg logit contributions
seeing-1.021
nor-0.941
colour-0.928
��-0.923
 Teach-0.900
Token to
Token ID284
Feature activation+1.50e-01

Pos logit contributions
seeing+2.715
nor+2.478
colour+2.410
��+2.389
 Teach+2.352
Neg logit contributions
ulate-3.876
riages-3.716
ulating-3.702
 stagn-3.618
ulates-3.554
Token have
Token ID423
Feature activation-3.09e-02

Pos logit contributions
seeing+0.442
nor+0.411
colour+0.404
 Teach+0.398
��+0.396
Neg logit contributions
ulate-0.438
riages-0.418
ulating-0.415
 stagn-0.401
ulates-0.396
Token AWS
Token ID30865
Feature activation-9.43e-02

Pos logit contributions
ulate+0.088
riages+0.083
ulating+0.083
 stagn+0.081
ulates+0.079
Neg logit contributions
seeing-0.094
nor-0.087
colour-0.086
��-0.085
 Teach-0.083
Token Access
Token ID8798
Feature activation+4.09e+00

Pos logit contributions
ulate+0.276
ulating+0.260
riages+0.260
 stagn+0.253
ulates+0.248
Neg logit contributions
seeing-0.280
nor-0.258
colour-0.255
��-0.255
 Teach-0.247
Token Keys
Token ID26363
Feature activation+1.06e+00

Pos logit contributions
seeing+10.359
nor+9.761
 Teach+9.039
colour+9.035
edin+8.932
Neg logit contributions
ulate-11.508
riages-11.368
ulating-10.961
���-10.786
ulates-10.751
Token.
Token ID764
Feature activation+3.73e-01

Pos logit contributions
seeing+3.211
nor+2.995
colour+2.905
��+2.873
 Teach+2.865
Neg logit contributions
ulate-2.979
riages-2.840
ulating-2.803
 stagn-2.738
ulates-2.688
Token To
Token ID1675
Feature activation+5.23e-01

Pos logit contributions
seeing+0.922
nor+0.838
colour+0.820
 Teach+0.803
��+0.802
Neg logit contributions
ulate-1.275
riages-1.216
ulating-1.215
 stagn-1.187
ulates-1.167
Token generate
Token ID7716
Feature activation-7.59e-01

Pos logit contributions
seeing+1.379
nor+1.274
colour+1.246
 Teach+1.231
��+1.209
Neg logit contributions
ulate-1.673
riages-1.602
ulating-1.592
 stagn-1.542
ulates-1.529
Token these
Token ID777
Feature activation-2.38e-01

Pos logit contributions
ulate+2.296
ulating+2.166
riages+2.157
 stagn+2.108
ulates+2.068
Neg logit contributions
seeing-2.172
nor-2.002
��-1.998
colour-1.982
 Teach-1.901
Token coast
Token ID7051
Feature activation+2.06e-02

Pos logit contributions
seeing+2.110
nor+1.936
colour+1.896
��+1.859
 Teach+1.848
Neg logit contributions
ulate-1.753
riages-1.650
ulating-1.634
ulates-1.569
 stagn-1.557
Token,
Token ID11
Feature activation+2.61e-02

Pos logit contributions
seeing+0.066
nor+0.061
colour+0.060
��+0.059
 Teach+0.059
Neg logit contributions
ulate-0.056
riages-0.052
ulating-0.052
 stagn-0.050
ulates-0.050
Token UCLA
Token ID21750
Feature activation+5.26e-01

Pos logit contributions
seeing+0.078
nor+0.072
colour+0.071
��+0.070
 Teach+0.069
Neg logit contributions
ulate-0.076
riages-0.072
ulating-0.072
 stagn-0.069
ulates-0.069
Token grad
Token ID3915
Feature activation-2.79e-01

Pos logit contributions
seeing+1.561
nor+1.440
colour+1.406
 Teach+1.394
��+1.371
Neg logit contributions
ulate-1.540
ulating-1.437
riages-1.425
ulates-1.378
 stagn-1.365
Token Bob
Token ID5811
Feature activation+3.42e-01

Pos logit contributions
ulate+0.840
riages+0.792
ulating+0.792
 stagn+0.772
ulates+0.756
Neg logit contributions
seeing-0.804
nor-0.743
colour-0.731
��-0.728
 Teach-0.713
Token Water
Token ID5638
Feature activation+4.05e+00

Pos logit contributions
seeing+0.891
nor+0.820
colour+0.799
��+0.797
 Teach+0.788
Neg logit contributions
ulate-1.124
ulating-1.066
riages-1.063
 stagn-1.039
ulates-1.024
Tokenfield
Token ID3245
Feature activation+6.49e-01

Pos logit contributions
seeing+7.735
colour+7.311
nor+6.937
bender+6.244
worthy+6.210
Neg logit contributions
���-14.976
ulate-14.177
ulating-13.608
ulates-13.240
riages-13.117
Token was
Token ID373
Feature activation+5.72e-01

Pos logit contributions
seeing+1.915
nor+1.764
colour+1.724
 Teach+1.696
��+1.681
Neg logit contributions
ulate-1.901
ulating-1.782
riages-1.778
ulates-1.707
 stagn-1.691
Token back
Token ID736
Feature activation-1.57e-01

Pos logit contributions
seeing+1.854
nor+1.710
colour+1.687
 Teach+1.662
��+1.631
Neg logit contributions
ulate-1.517
riages-1.424
ulating-1.406
ulates-1.350
 stagn-1.326
Token in
Token ID287
Feature activation+3.09e-01

Pos logit contributions
ulate+0.433
riages+0.407
ulating+0.405
 stagn+0.393
ulates+0.386
Neg logit contributions
seeing-0.497
nor-0.459
colour-0.453
��-0.449
 Teach-0.440
Token his
Token ID465
Feature activation+3.69e-01

Pos logit contributions
seeing+0.834
nor+0.766
colour+0.753
��+0.749
 Teach+0.733
Neg logit contributions
ulate-0.985
riages-0.935
ulating-0.933
 stagn-0.907
ulates-0.895
Token,
Token ID11
Feature activation-2.96e-01

Pos logit contributions
ulate+2.679
ulating+2.510
riages+2.507
 stagn+2.437
ulates+2.376
Neg logit contributions
seeing-3.257
nor-3.048
��-3.010
colour-3.005
 Teach-2.915
Token as
Token ID355
Feature activation-2.97e-01

Pos logit contributions
ulate+0.844
ulating+0.795
riages+0.794
 stagn+0.772
ulates+0.756
Neg logit contributions
seeing-0.898
nor-0.832
colour-0.822
��-0.820
 Teach-0.797
Token rookie
Token ID12302
Feature activation-1.97e+00

Pos logit contributions
ulate+0.813
riages+0.764
ulating+0.764
 stagn+0.740
ulates+0.726
Neg logit contributions
seeing-0.936
nor-0.869
colour-0.859
��-0.856
 Teach-0.834
Token QB
Token ID16135
Feature activation-1.14e+00

Pos logit contributions
ulate+5.627
riages+5.379
ulating+5.355
 stagn+5.265
ulates+5.055
Neg logit contributions
seeing-5.668
nor-5.270
colour-5.236
��-5.235
 Teach-4.977
Token Bob
Token ID5811
Feature activation-1.66e-01

Pos logit contributions
ulate+3.362
riages+3.190
ulating+3.187
 stagn+3.076
ulates+3.021
Neg logit contributions
seeing-3.299
nor-3.042
��-3.014
colour-3.007
 Teach-2.881
Token Water
Token ID5638
Feature activation+4.03e+00

Pos logit contributions
ulate+0.550
riages+0.523
ulating+0.522
 stagn+0.509
ulates+0.501
Neg logit contributions
seeing-0.430
nor-0.392
colour-0.387
��-0.386
 Teach-0.372
Tokenfield
Token ID3245
Feature activation+2.50e-01

Pos logit contributions
seeing+6.957
colour+6.788
nor+6.279
worthy+5.498
bender+5.486
Neg logit contributions
���-15.164
ulate-14.950
ulating-14.365
ulates-13.960
riages-13.841
Token finally
Token ID3443
Feature activation+3.41e-01

Pos logit contributions
seeing+0.771
nor+0.715
colour+0.706
��+0.692
 Teach+0.686
Neg logit contributions
ulate-0.703
ulating-0.659
riages-0.657
 stagn-0.630
ulates-0.628
Token delivered
Token ID6793
Feature activation+3.74e-01

Pos logit contributions
seeing+1.033
nor+0.952
colour+0.941
��+0.916
 Teach+0.914
Neg logit contributions
ulate-0.979
riages-0.918
ulating-0.918
 stagn-0.877
ulates-0.875
Token Cleveland
Token ID10306
Feature activation-6.26e-01

Pos logit contributions
seeing+1.103
nor+1.016
colour+1.005
��+0.995
 Teach+0.980
Neg logit contributions
ulate-1.089
riages-1.037
ulating-1.031
 stagn-1.000
ulates-0.983
Token its
Token ID663
Feature activation+5.28e-02

Pos logit contributions
ulate+1.816
riages+1.713
ulating+1.711
 stagn+1.662
ulates+1.630
Neg logit contributions
seeing-1.873
nor-1.733
colour-1.712
��-1.709
 Teach-1.659
Token Burger
Token ID28983
Feature activation+7.59e-01

Pos logit contributions
ulate+0.830
ulating+0.788
riages+0.788
 stagn+0.769
ulates+0.756
Neg logit contributions
seeing-0.648
nor-0.592
colour-0.582
��-0.580
 Teach-0.564
Token Joint
Token ID16798
Feature activation+9.83e-01

Pos logit contributions
seeing+2.000
nor+1.824
colour+1.777
��+1.765
 Teach+1.737
Neg logit contributions
ulate-2.467
ulating-2.341
riages-2.331
 stagn-2.286
ulates-2.254
Token)
Token ID8
Feature activation+6.65e-01

Pos logit contributions
seeing+2.925
nor+2.694
colour+2.625
��+2.580
 Teach+2.576
Neg logit contributions
ulate-2.842
ulating-2.676
riages-2.651
 stagn-2.579
ulates-2.547
Token and
Token ID290
Feature activation+8.00e-02

Pos logit contributions
seeing+2.036
nor+1.879
colour+1.842
��+1.826
 Teach+1.814
Neg logit contributions
ulate-1.884
riages-1.774
ulating-1.769
 stagn-1.717
ulates-1.685
Token Cory
Token ID28785
Feature activation+1.78e-01

Pos logit contributions
seeing+0.230
nor+0.211
colour+0.208
��+0.207
 Teach+0.204
Neg logit contributions
ulate-0.242
ulating-0.229
riages-0.228
 stagn-0.222
ulates-0.218
Token Neal
Token ID29189
Feature activation+4.01e+00

Pos logit contributions
seeing+0.479
nor+0.440
colour+0.433
��+0.431
 Teach+0.419
Neg logit contributions
ulate-0.568
riages-0.538
ulating-0.538
 stagn-0.524
ulates-0.515
Token of
Token ID286
Feature activation+5.16e-01

Pos logit contributions
seeing+10.950
nor+10.538
 Teach+10.220
colour+9.761
��+9.680
Neg logit contributions
ulate-10.362
 stagn-9.793
ulating-9.701
riages-9.575
ulates-9.393
Token Pat
Token ID3208
Feature activation+1.00e-01

Pos logit contributions
seeing+1.368
nor+1.252
colour+1.231
��+1.217
 Teach+1.205
Neg logit contributions
ulate-1.669
riages-1.585
ulating-1.585
 stagn-1.547
ulates-1.519
Token O
Token ID440
Feature activation+3.62e-01

Pos logit contributions
seeing+0.295
nor+0.272
colour+0.269
��+0.268
 Teach+0.261
Neg logit contributions
ulate-0.295
ulating-0.278
riages-0.278
 stagn-0.270
ulates-0.265
Token
Token ID447
Feature activation+2.47e-01

Pos logit contributions
seeing+1.388
nor+1.330
colour+1.298
��+1.289
 Teach+1.265
Neg logit contributions
ulate-0.725
riages-0.674
ulating-0.668
 stagn-0.653
ulates-0.623
Token
Token ID247
Feature activation+4.74e-01

Pos logit contributions
seeing+0.617
��+0.572
nor+0.568
colour+0.555
 Teach+0.540
Neg logit contributions
ulate-0.831
ulating-0.793
riages-0.791
 stagn-0.775
ulates-0.760
Token in
Token ID287
Feature activation+2.34e-01

Pos logit contributions
ulate+0.468
riages+0.439
ulating+0.438
 stagn+0.425
ulates+0.417
Neg logit contributions
seeing-0.555
nor-0.515
colour-0.508
��-0.506
 Teach-0.495
Token a
Token ID257
Feature activation-6.46e-01

Pos logit contributions
seeing+0.652
nor+0.600
colour+0.589
��+0.585
 Teach+0.574
Neg logit contributions
ulate-0.726
riages-0.689
ulating-0.687
 stagn-0.667
ulates-0.658
Token vehicle
Token ID4038
Feature activation+5.35e-02

Pos logit contributions
ulate+1.747
riages+1.642
ulating+1.640
 stagn+1.586
ulates+1.557
Neg logit contributions
seeing-2.061
nor-1.916
colour-1.891
��-1.884
 Teach-1.842
Token without
Token ID1231
Feature activation+7.64e-01

Pos logit contributions
seeing+0.169
nor+0.155
colour+0.153
��+0.150
 Teach+0.150
Neg logit contributions
ulate-0.147
ulating-0.137
riages-0.137
 stagn-0.133
ulates-0.132
Token an
Token ID281
Feature activation+8.74e-01

Pos logit contributions
seeing+2.321
nor+2.131
colour+2.083
 Teach+2.056
��+2.040
Neg logit contributions
ulate-2.147
riages-2.027
ulating-1.998
 stagn-1.959
sei-1.944
Token inter
Token ID987
Feature activation+3.92e+00

Pos logit contributions
seeing+2.698
nor+2.522
colour+2.466
 Teach+2.423
��+2.406
Neg logit contributions
ulate-2.354
riages-2.239
ulating-2.200
 stagn-2.153
ulates-2.110
Tokenlock
Token ID5354
Feature activation+1.25e+00

Pos logit contributions
seeing+8.654
nor+8.313
hook+7.505
colour+7.494
bender+7.466
Neg logit contributions
���-14.361
 exting-11.685
 stagn-11.578
ulate-11.522
riages-11.454
Token device
Token ID3335
Feature activation+1.54e+00

Pos logit contributions
seeing+3.301
nor+3.030
colour+2.932
 Teach+2.853
��+2.851
Neg logit contributions
ulate-3.895
riages-3.751
ulating-3.667
 stagn-3.625
sei-3.607
Token.
Token ID13
Feature activation+2.78e-01

Pos logit contributions
seeing+4.444
nor+4.093
colour+3.911
 Teach+3.865
 Lauder+3.762
Neg logit contributions
ulate-4.361
riages-4.144
ulating-4.084
sei-4.068
���-4.052
Token
Token ID198
Feature activation-3.08e-01

Pos logit contributions
seeing+0.812
nor+0.751
colour+0.733
��+0.727
 Teach+0.722
Neg logit contributions
ulate-0.829
riages-0.784
ulating-0.782
 stagn-0.768
ulates-0.747
Token
Token ID198
Feature activation+6.53e-02

Pos logit contributions
ulate+0.859
riages+0.814
ulating+0.812
 stagn+0.793
���+0.791
Neg logit contributions
seeing-0.965
nor-0.895
colour-0.873
 Teach-0.859
��-0.826
Token "
Token ID366
Feature activation+7.13e-01

Pos logit contributions
ulate+3.134
riages+2.944
ulating+2.924
 stagn+2.855
sei+2.838
Neg logit contributions
seeing-3.459
nor-3.203
colour-3.199
��-3.190
 Teach-3.049
TokenH
Token ID39
Feature activation+4.13e-01

Pos logit contributions
seeing+1.839
nor+1.676
colour+1.646
��+1.579
 Teach+1.559
Neg logit contributions
ulate-2.367
riages-2.255
ulating-2.252
 stagn-2.203
ulates-2.157
Tokename
Token ID480
Feature activation+6.25e-01

Pos logit contributions
seeing+1.386
nor+1.310
colour+1.279
��+1.249
 Teach+1.242
Neg logit contributions
ulate-1.028
riages-0.968
ulating-0.964
 stagn-0.936
ulates-0.910
Tokenko
Token ID7204
Feature activation+4.31e-01

Pos logit contributions
seeing+1.719
nor+1.588
colour+1.550
��+1.523
 Teach+1.506
Neg logit contributions
ulate-1.956
riages-1.864
ulating-1.857
 stagn-1.810
ulates-1.778
Token"
Token ID1
Feature activation+3.86e-01

Pos logit contributions
seeing+1.339
nor+1.244
colour+1.225
��+1.224
 Teach+1.195
Neg logit contributions
ulate-1.196
riages-1.126
ulating-1.126
 stagn-1.091
ulates-1.071
Token Nor
Token ID5414
Feature activation+3.86e+00

Pos logit contributions
seeing+1.072
nor+0.984
colour+0.962
��+0.958
 Teach+0.955
Neg logit contributions
ulate-1.205
ulating-1.143
riages-1.142
 stagn-1.115
ulates-1.093
Tokeniy
Token ID7745
Feature activation+1.81e+00

Pos logit contributions
seeing+10.942
nor+10.540
edin+9.996
colour+9.900
 Teach+9.787
Neg logit contributions
ulate-9.496
���-9.492
ulating-9.005
 stagn-8.768
 reorgan-8.634
Tokenuki
Token ID11308
Feature activation-2.53e-01

Pos logit contributions
seeing+5.224
nor+4.995
colour+4.756
 Teach+4.722
edin+4.716
Neg logit contributions
ulate-5.108
ulating-4.796
riages-4.796
 stagn-4.763
ulates-4.630
Token spoke
Token ID5158
Feature activation+4.11e-01

Pos logit contributions
ulate+0.717
riages+0.676
ulating+0.673
 stagn+0.658
ulates+0.641
Neg logit contributions
seeing-0.770
nor-0.713
��-0.708
colour-0.705
 Teach-0.682
Token primarily
Token ID7525
Feature activation-7.00e-01

Pos logit contributions
seeing+1.116
nor+1.013
colour+0.999
��+0.979
 Teach+0.972
Neg logit contributions
ulate-1.304
riages-1.243
ulating-1.238
 stagn-1.201
ulates-1.186
Token about
Token ID546
Feature activation-6.10e-01

Pos logit contributions
ulate+2.269
ulating+2.150
riages+2.137
 stagn+2.110
ulates+2.056
Neg logit contributions
seeing-1.834
��-1.696
nor-1.691
colour-1.664
 Teach-1.597
Token Sen
Token ID2311
Feature activation-1.01e+00

Pos logit contributions
ulate+1.503
ulating+1.411
riages+1.409
 stagn+1.370
ulates+1.341
Neg logit contributions
seeing-1.683
nor-1.560
colour-1.546
��-1.546
 Teach-1.496
Token.
Token ID13
Feature activation-7.13e-01

Pos logit contributions
ulate+2.670
riages+2.501
ulating+2.501
 stagn+2.425
ulates+2.372
Neg logit contributions
seeing-3.273
nor-3.048
colour-3.006
��-2.987
 Teach-2.931
Token Cruz
Token ID8742
Feature activation+3.44e-01

Pos logit contributions
ulate+2.041
ulating+1.924
riages+1.920
 stagn+1.868
ulates+1.828
Neg logit contributions
seeing-2.142
nor-1.986
��-1.977
colour-1.966
 Teach-1.884
Token was
Token ID373
Feature activation-1.98e-02

Pos logit contributions
seeing+1.008
nor+0.933
colour+0.912
��+0.898
 Teach+0.894
Neg logit contributions
ulate-1.013
ulating-0.954
riages-0.950
 stagn-0.920
ulates-0.908
Token no
Token ID645
Feature activation+1.16e+00

Pos logit contributions
ulate+0.055
riages+0.052
ulating+0.052
 stagn+0.050
ulates+0.049
Neg logit contributions
seeing-0.062
nor-0.057
colour-0.056
��-0.056
 Teach-0.055
Token exception
Token ID6631
Feature activation+3.85e+00

Pos logit contributions
seeing+2.502
nor+2.172
colour+2.165
 Teach+2.102
��+2.096
Neg logit contributions
ulate-4.318
riages-4.121
ulating-4.099
ulates-3.977
 stagn-3.945
Token,
Token ID11
Feature activation+6.58e-01

Pos logit contributions
seeing+9.884
nor+8.584
colour+8.105
 Teach+7.944
��+7.857
Neg logit contributions
ulate-11.267
riages-10.765
���-10.587
ulating-10.518
ulates-10.368
Token
Token ID447
Feature activation-1.52e+00

Pos logit contributions
seeing+2.085
nor+1.945
colour+1.882
��+1.872
 Teach+1.864
Neg logit contributions
ulate-1.798
riages-1.691
ulating-1.668
 stagn-1.619
ulates-1.600
Token
Token ID251
Feature activation+4.96e-01

Pos logit contributions
ulate+4.863
riages+4.623
ulating+4.615
 stagn+4.498
ulates+4.406
Neg logit contributions
seeing-4.012
nor-3.673
colour-3.663
��-3.598
 Teach-3.500
Token a
Token ID257
Feature activation-7.98e-02

Pos logit contributions
seeing+1.651
nor+1.540
colour+1.504
��+1.493
 Teach+1.477
Neg logit contributions
ulate-1.284
riages-1.205
ulating-1.187
 stagn-1.155
ulates-1.136
Token Cruz
Token ID8742
Feature activation+4.50e-01

Pos logit contributions
ulate+0.237
riages+0.225
ulating+0.224
 stagn+0.218
ulates+0.214
Neg logit contributions
seeing-0.233
nor-0.215
colour-0.212
��-0.210
 Teach-0.206
TokenG
Token ID38
Feature activation-2.03e-01

Pos logit contributions
seeing+3.514
 Teach+3.224
nor+3.214
colour+3.177
��+3.076
Neg logit contributions
ulate-4.567
riages-4.470
ulating-4.341
 stagn-4.326
sei-4.186
Token.
Token ID13
Feature activation+1.96e+00

Pos logit contributions
ulate+0.564
riages+0.534
ulating+0.532
 stagn+0.524
ulates+0.507
Neg logit contributions
seeing-0.627
nor-0.588
colour-0.574
 Teach-0.566
��-0.557
Token Wells
Token ID18292
Feature activation+1.75e+00

Pos logit contributions
seeing+4.079
nor+3.767
 Teach+3.716
colour+3.641
 Lauder+3.570
Neg logit contributions
ulate-6.939
riages-6.766
ulating-6.595
 stagn-6.562
ulates-6.430
Token'
Token ID6
Feature activation+1.17e+00

Pos logit contributions
seeing+5.272
nor+4.943
colour+4.875
 Teach+4.730
��+4.702
Neg logit contributions
ulate-4.820
riages-4.506
ulating-4.485
 stagn-4.288
ulates-4.255
Token The
Token ID383
Feature activation+1.55e+00

Pos logit contributions
seeing+3.423
nor+3.167
colour+3.129
 Teach+3.088
��+3.048
Neg logit contributions
ulate-3.389
riages-3.211
ulating-3.195
 stagn-3.098
sei-3.050
Token Time
Token ID3862
Feature activation+3.83e+00

Pos logit contributions
seeing+3.552
colour+3.162
nor+3.148
 Teach+3.135
��+3.022
Neg logit contributions
ulate-5.448
riages-5.182
ulating-5.160
 stagn-5.148
ulates-5.024
Token Machine
Token ID10850
Feature activation-6.50e-01

Pos logit contributions
seeing+7.088
 Teach+6.511
bender+6.294
colour+6.089
nor+6.021
Neg logit contributions
ulate-13.243
 stagn-12.917
ulating-12.625
riages-12.558
ulates-12.393
Token is
Token ID318
Feature activation+6.30e-02

Pos logit contributions
ulate+1.848
riages+1.743
ulating+1.737
 stagn+1.698
ulates+1.658
Neg logit contributions
seeing-1.974
nor-1.834
��-1.819
colour-1.807
 Teach-1.750
Token published
Token ID3199
Feature activation+6.49e-01

Pos logit contributions
seeing+0.208
nor+0.194
colour+0.192
��+0.188
 Teach+0.187
Neg logit contributions
ulate-0.164
ulating-0.153
riages-0.152
 stagn-0.146
ulates-0.145
Token.
Token ID13
Feature activation+2.65e-01

Pos logit contributions
seeing+2.032
nor+1.875
colour+1.856
��+1.824
 Teach+1.817
Neg logit contributions
ulate-1.779
riages-1.667
ulating-1.665
 stagn-1.609
ulates-1.594
Token
Token ID198
Feature activation+7.19e-02

Pos logit contributions
seeing+0.868
nor+0.809
colour+0.799
��+0.797
 Teach+0.775
Neg logit contributions
ulate-0.695
ulating-0.651
riages-0.651
 stagn-0.628
ulates-0.617
Token
Token ID198
Feature activation-2.37e-01

Pos logit contributions
seeing+0.106
nor+0.097
colour+0.095
��+0.094
 Teach+0.093
Neg logit contributions
ulate-0.130
ulating-0.124
riages-0.124
 stagn-0.121
ulates-0.118
Token
Token ID198
Feature activation+1.60e-01

Pos logit contributions
ulate+0.670
riages+0.633
ulating+0.633
���+0.625
 stagn+0.617
Neg logit contributions
seeing-0.738
nor-0.683
colour-0.665
 Teach-0.654
ーン-0.624
TokenE
Token ID36
Feature activation+1.90e-01

Pos logit contributions
seeing+0.423
nor+0.387
colour+0.380
��+0.374
 Teach+0.366
Neg logit contributions
ulate-0.521
ulating-0.495
riages-0.494
 stagn-0.483
ulates-0.473
TokenBS
Token ID4462
Feature activation-5.35e-02

Pos logit contributions
seeing+0.560
nor+0.520
colour+0.510
��+0.502
 Teach+0.496
Neg logit contributions
ulate-0.556
riages-0.525
ulating-0.523
 stagn-0.510
ulates-0.500
TokenCO
Token ID8220
Feature activation-1.01e+00

Pos logit contributions
ulate+0.166
riages+0.157
ulating+0.157
 stagn+0.153
ulates+0.150
Neg logit contributions
seeing-0.149
nor-0.137
colour-0.135
��-0.134
 Teach-0.131
Token Information
Token ID6188
Feature activation+3.80e+00

Pos logit contributions
ulate+2.968
riages+2.811
ulating+2.801
 stagn+2.747
ulates+2.675
Neg logit contributions
seeing-2.959
��-2.734
nor-2.722
colour-2.705
 Teach-2.583
Token Services
Token ID6168
Feature activation+1.37e+00

Pos logit contributions
seeing+8.806
nor+7.988
 Teach+7.691
colour+7.666
edin+7.444
Neg logit contributions
ulate-11.555
riages-11.227
ulating-11.025
sei-10.875
 stagn-10.623
Token compet
Token ID2307
Feature activation+3.67e-02

Pos logit contributions
seeing+3.884
nor+3.584
 Teach+3.464
colour+3.433
 Lauder+3.338
Neg logit contributions
ulate-4.038
ulating-3.850
riages-3.803
sei-3.635
 stagn-3.629
Tokenes
Token ID274
Feature activation-2.80e-01

Pos logit contributions
seeing+0.109
nor+0.101
colour+0.100
��+0.098
 Teach+0.097
Neg logit contributions
ulate-0.107
ulating-0.101
riages-0.101
 stagn-0.098
sei-0.096
Token within
Token ID1626
Feature activation+1.09e-01

Pos logit contributions
ulate+0.740
riages+0.693
ulating+0.693
 stagn+0.670
ulates+0.657
Neg logit contributions
seeing-0.909
nor-0.846
colour-0.836
��-0.833
 Teach-0.814
Token the
Token ID262
Feature activation-2.33e-02

Pos logit contributions
seeing+0.329
nor+0.304
colour+0.299
��+0.298
 Teach+0.294
Neg logit contributions
ulate-0.316
riages-0.299
ulating-0.298
 stagn-0.289
ulates-0.283
Token A
Token ID317
Feature activation+7.42e-01

Pos logit contributions
seeing+0.183
nor+0.169
colour+0.166
��+0.164
 Teach+0.162
Neg logit contributions
ulate-0.196
riages-0.186
ulating-0.186
 stagn-0.180
ulates-0.177
TokenUS
Token ID2937
Feature activation+1.93e-01

Pos logit contributions
seeing+2.123
nor+1.964
colour+1.921
 Teach+1.842
Lex+1.792
Neg logit contributions
ulate-2.231
���-2.216
ulating-2.142
riages-2.137
 stagn-2.084
Token 16
Token ID1467
Feature activation-4.55e-01

Pos logit contributions
seeing+0.550
nor+0.508
��+0.504
colour+0.500
 Teach+0.486
Neg logit contributions
ulate-0.587
ulating-0.555
riages-0.551
 stagn-0.536
ulates-0.529
Token Steele
Token ID28549
Feature activation+6.07e-01

Pos logit contributions
ulate+1.339
riages+1.263
ulating+1.262
 stagn+1.226
ulates+1.204
Neg logit contributions
seeing-1.344
nor-1.241
colour-1.225
��-1.223
 Teach-1.186
Token Von
Token ID26985
Feature activation+4.28e-01

Pos logit contributions
seeing+1.550
nor+1.435
colour+1.400
 Teach+1.381
��+1.376
Neg logit contributions
ulate-2.006
riages-1.922
ulating-1.916
 stagn-1.858
ulates-1.834
Token Hoff
Token ID21890
Feature activation+3.78e+00

Pos logit contributions
seeing+0.892
nor+0.803
colour+0.782
��+0.777
 Teach+0.752
Neg logit contributions
ulate-1.626
riages-1.556
ulating-1.556
 stagn-1.520
ulates-1.501
Token (
Token ID357
Feature activation+7.96e-01

Pos logit contributions
seeing+8.921
nor+8.405
 Teach+8.017
colour+7.934
 Lauder+7.696
Neg logit contributions
ulate-11.365
riages-10.780
ulating-10.578
ulates-10.501
 stagn-10.333
TokenTeam
Token ID15592
Feature activation-6.44e-01

Pos logit contributions
seeing+1.882
nor+1.728
colour+1.684
 Teach+1.593
��+1.559
Neg logit contributions
ulate-2.790
riages-2.674
ulating-2.670
 stagn-2.640
sei-2.560
Token Chip
Token ID17869
Feature activation-7.95e-02

Pos logit contributions
ulate+2.069
riages+1.966
ulating+1.957
 stagn+1.916
ulates+1.877
Neg logit contributions
seeing-1.706
��-1.561
nor-1.559
colour-1.538
 Teach-1.477
Tokenotle
Token ID23556
Feature activation+5.09e-01

Pos logit contributions
ulate+0.243
riages+0.231
ulating+0.230
 stagn+0.224
ulates+0.220
Neg logit contributions
seeing-0.225
nor-0.207
colour-0.205
��-0.201
 Teach-0.198
Token)
Token ID8
Feature activation+4.74e-01

Pos logit contributions
seeing+1.578
nor+1.462
colour+1.442
��+1.417
 Teach+1.406
Neg logit contributions
ulate-1.412
ulating-1.332
riages-1.331
 stagn-1.291
ulates-1.264
Token,
Token ID11
Feature activation+2.88e-01

Pos logit contributions
ulate+1.622
riages+1.523
ulating+1.523
 stagn+1.476
ulates+1.443
Neg logit contributions
seeing-1.980
nor-1.845
��-1.835
colour-1.824
 Teach-1.779
Token should
Token ID815
Feature activation+1.19e-01

Pos logit contributions
seeing+0.838
nor+0.771
colour+0.759
��+0.742
 Teach+0.740
Neg logit contributions
ulate-0.859
ulating-0.811
riages-0.808
 stagn-0.780
ulates-0.770
Token pay
Token ID1414
Feature activation+6.24e-01

Pos logit contributions
seeing+0.369
nor+0.344
colour+0.339
��+0.334
 Teach+0.332
Neg logit contributions
ulate-0.328
ulating-0.310
riages-0.310
 stagn-0.297
ulates-0.296
Token almost
Token ID2048
Feature activation+4.17e-01

Pos logit contributions
seeing+1.589
nor+1.446
colour+1.428
��+1.400
 Teach+1.390
Neg logit contributions
ulate-2.057
riages-1.965
ulating-1.962
 stagn-1.908
ulates-1.891
Token exclusive
Token ID8568
Feature activation+4.35e-01

Pos logit contributions
seeing+1.204
nor+1.115
colour+1.095
��+1.079
 Teach+1.064
Neg logit contributions
ulate-1.248
ulating-1.183
riages-1.179
 stagn-1.134
ulates-1.132
Token attention
Token ID3241
Feature activation+3.75e+00

Pos logit contributions
seeing+1.002
nor+0.905
colour+0.890
��+0.867
 Teach+0.851
Neg logit contributions
ulate-1.549
riages-1.484
ulating-1.483
 stagn-1.440
ulates-1.431
Token to
Token ID284
Feature activation+1.44e+00

Pos logit contributions
seeing+7.842
nor+6.950
��+6.816
colour+6.793
 Teach+6.351
Neg logit contributions
ulate-12.593
riages-12.243
ulating-12.178
 stagn-11.968
ulates-11.923
Token her
Token ID607
Feature activation+2.93e-01

Pos logit contributions
seeing+3.892
nor+3.483
colour+3.439
 Teach+3.373
��+3.274
Neg logit contributions
ulate-4.460
riages-4.284
ulating-4.243
 stagn-4.137
sei-4.131
Token;
Token ID26
Feature activation+7.89e-01

Pos logit contributions
seeing+0.921
nor+0.853
colour+0.844
��+0.829
 Teach+0.820
Neg logit contributions
ulate-0.804
ulating-0.756
riages-0.753
 stagn-0.730
ulates-0.721
Token and
Token ID290
Feature activation+1.16e-01

Pos logit contributions
seeing+2.403
nor+2.225
colour+2.172
 Teach+2.133
��+2.105
Neg logit contributions
ulate-2.237
riages-2.118
ulating-2.109
 stagn-2.030
ulates-2.022
Token at
Token ID379
Feature activation+6.87e-01

Pos logit contributions
seeing+0.365
nor+0.338
colour+0.333
��+0.329
 Teach+0.327
Neg logit contributions
ulate-0.318
ulating-0.299
riages-0.299
 stagn-0.288
ulates-0.285
Token him
Token ID683
Feature activation+3.75e-02

Pos logit contributions
seeing+2.933
nor+2.687
colour+2.668
��+2.581
 Teach+2.576
Neg logit contributions
ulate-2.546
riages-2.386
ulating-2.380
 stagn-2.288
sei-2.277
Token,
Token ID11
Feature activation+6.68e-01

Pos logit contributions
seeing+0.124
nor+0.115
colour+0.114
��+0.114
 Teach+0.111
Neg logit contributions
ulate-0.097
ulating-0.091
riages-0.091
 stagn-0.088
ulates-0.086
Token might
Token ID1244
Feature activation+3.16e-01

Pos logit contributions
seeing+2.005
nor+1.850
colour+1.831
 Teach+1.786
��+1.782
Neg logit contributions
ulate-1.909
ulating-1.809
riages-1.801
ulates-1.727
 stagn-1.719
Token have
Token ID423
Feature activation+1.06e+00

Pos logit contributions
seeing+0.952
nor+0.883
colour+0.870
��+0.856
 Teach+0.844
Neg logit contributions
ulate-0.905
ulating-0.854
riages-0.852
 stagn-0.820
ulates-0.818
Token no
Token ID645
Feature activation+7.73e-01

Pos logit contributions
seeing+3.179
nor+2.919
colour+2.880
 Teach+2.816
��+2.790
Neg logit contributions
ulate-3.024
ulating-2.858
riages-2.817
ulates-2.727
sei-2.721
Token opportunity
Token ID3663
Feature activation+3.74e+00

Pos logit contributions
seeing+2.395
nor+2.190
colour+2.177
 Teach+2.132
��+2.111
Neg logit contributions
ulate-2.161
ulating-2.026
riages-1.997
 stagn-1.941
ulates-1.934
Token to
Token ID284
Feature activation+6.12e-01

Pos logit contributions
seeing+9.201
nor+8.287
colour+7.645
 Teach+7.414
 Lauder+7.279
Neg logit contributions
ulate-11.388
���-10.856
ulating-10.685
sei-10.655
ulates-10.615
Token dance
Token ID9280
Feature activation-5.77e-01

Pos logit contributions
seeing+1.789
nor+1.657
colour+1.630
 Teach+1.592
��+1.573
Neg logit contributions
ulate-1.799
ulating-1.702
riages-1.701
ulates-1.635
sei-1.624
Token.
Token ID13
Feature activation+1.73e-01

Pos logit contributions
ulate+1.537
riages+1.440
ulating+1.438
 stagn+1.397
ulates+1.362
Neg logit contributions
seeing-1.862
nor-1.731
��-1.725
colour-1.714
 Teach-1.665
Token
Token ID447
Feature activation-1.07e+00

Pos logit contributions
seeing+0.350
nor+0.311
colour+0.305
��+0.305
 Teach+0.289
Neg logit contributions
ulate-0.668
riages-0.640
ulating-0.639
 stagn-0.625
ulates-0.617
Token
Token ID251
Feature activation+9.44e-01

Pos logit contributions
ulate+3.526
riages+3.353
ulating+3.345
 stagn+3.266
ulates+3.209
Neg logit contributions
seeing-2.755
nor-2.518
colour-2.485
��-2.475
 Teach-2.383
Tokenfield
Token ID3245
Feature activation-1.02e+00

Pos logit contributions
seeing+1.834
colour+1.623
nor+1.578
 Teach+1.426
��+1.393
Neg logit contributions
ulate-3.738
riages-3.617
ulating-3.610
 stagn-3.577
ulates-3.478
Token punishments
Token ID35518
Feature activation+1.30e-02

Pos logit contributions
ulate+2.967
ulating+2.804
riages+2.783
 stagn+2.734
ulates+2.663
Neg logit contributions
seeing-3.017
nor-2.824
��-2.813
colour-2.781
 Teach-2.660
Token,
Token ID11
Feature activation+2.10e-02

Pos logit contributions
seeing+0.041
nor+0.038
colour+0.037
��+0.037
 Teach+0.037
Neg logit contributions
ulate-0.036
riages-0.033
ulating-0.033
ulates-0.032
 stagn-0.032
Token Goodell
Token ID41648
Feature activation+1.47e-01

Pos logit contributions
seeing+0.062
nor+0.057
colour+0.056
��+0.056
 Teach+0.055
Neg logit contributions
ulate-0.062
riages-0.059
ulating-0.059
 stagn-0.057
ulates-0.056
Token has
Token ID468
Feature activation+6.79e-01

Pos logit contributions
seeing+0.429
nor+0.396
colour+0.389
��+0.382
 Teach+0.380
Neg logit contributions
ulate-0.440
ulating-0.414
riages-0.414
 stagn-0.398
ulates-0.396
Token access
Token ID1895
Feature activation+3.74e+00

Pos logit contributions
seeing+2.056
nor+1.908
colour+1.862
 Teach+1.844
��+1.832
Neg logit contributions
ulate-1.921
ulating-1.813
riages-1.812
ulates-1.728
 stagn-1.706
Token to
Token ID284
Feature activation+1.44e+00

Pos logit contributions
seeing+8.396
nor+7.673
��+7.388
colour+7.077
 Teach+7.066
Neg logit contributions
ulate-12.168
riages-11.765
ulating-11.589
 stagn-11.529
ulates-11.322
Token police
Token ID1644
Feature activation-2.02e-01

Pos logit contributions
seeing+4.239
nor+3.875
 Teach+3.813
colour+3.780
��+3.707
Neg logit contributions
ulate-4.066
riages-3.929
ulating-3.896
 stagn-3.729
ulates-3.708
Token investigation
Token ID3645
Feature activation+1.49e+00

Pos logit contributions
ulate+0.608
riages+0.574
ulating+0.574
 stagn+0.558
ulates+0.549
Neg logit contributions
seeing-0.581
nor-0.535
colour-0.528
��-0.525
 Teach-0.513
Token records
Token ID4406
Feature activation+2.77e-01

Pos logit contributions
seeing+3.889
nor+3.568
 Teach+3.405
colour+3.381
��+3.363
Neg logit contributions
ulate-4.693
riages-4.476
ulating-4.414
sei-4.387
 stagn-4.289
Token.
Token ID13
Feature activation+1.90e-01

Pos logit contributions
seeing+0.839
nor+0.777
colour+0.758
��+0.754
 Teach+0.742
Neg logit contributions
ulate-0.791
riages-0.746
ulating-0.742
 stagn-0.720
ulates-0.711
Token driving
Token ID5059
Feature activation-1.00e+00

Pos logit contributions
seeing+0.307
nor+0.285
colour+0.281
��+0.277
 Teach+0.276
Neg logit contributions
ulate-0.229
riages-0.215
ulating-0.213
 stagn-0.206
sei-0.203
Token a
Token ID257
Feature activation-6.25e-01

Pos logit contributions
ulate+2.405
ulating+2.238
riages+2.237
 stagn+2.161
ulates+2.104
Neg logit contributions
seeing-3.489
nor-3.269
colour-3.233
��-3.231
 Teach-3.144
Token vehicle
Token ID4038
Feature activation-7.18e-02

Pos logit contributions
ulate+1.783
riages+1.680
ulating+1.680
 stagn+1.626
ulates+1.601
Neg logit contributions
seeing-1.899
nor-1.758
colour-1.735
��-1.727
 Teach-1.687
Token without
Token ID1231
Feature activation+4.35e-01

Pos logit contributions
ulate+0.195
ulating+0.182
riages+0.182
ulates+0.176
 stagn+0.175
Neg logit contributions
seeing-0.229
nor-0.209
colour-0.209
 Teach-0.204
��-0.202
Token an
Token ID281
Feature activation+6.21e-01

Pos logit contributions
seeing+1.286
nor+1.159
colour+1.149
 Teach+1.125
��+1.114
Neg logit contributions
ulate-1.272
riages-1.202
ulating-1.194
 stagn-1.162
sei-1.161
Token inter
Token ID987
Feature activation+3.74e+00

Pos logit contributions
seeing+1.959
nor+1.808
colour+1.798
��+1.756
 Teach+1.753
Neg logit contributions
ulate-1.658
riages-1.567
ulating-1.553
 stagn-1.515
ulates-1.481
Tokenlock
Token ID5354
Feature activation+8.18e-01

Pos logit contributions
seeing+9.081
nor+8.547
colour+7.808
bender+7.735
hook+7.521
Neg logit contributions
���-12.330
 stagn-10.790
ulate-10.677
riages-10.444
ulating-10.167
Token device
Token ID3335
Feature activation+9.25e-01

Pos logit contributions
seeing+2.135
nor+1.947
colour+1.883
 Teach+1.829
��+1.826
Neg logit contributions
ulate-2.645
riages-2.529
ulating-2.493
 stagn-2.466
ulates-2.439
Token.
Token ID13
Feature activation-3.08e-02

Pos logit contributions
seeing+2.846
nor+2.636
colour+2.556
 Teach+2.528
��+2.486
Neg logit contributions
ulate-2.544
riages-2.389
ulating-2.383
 stagn-2.330
sei-2.329
Token
Token ID198
Feature activation-4.02e-01

Pos logit contributions
ulate+0.080
ulating+0.075
riages+0.075
 stagn+0.072
ulates+0.071
Neg logit contributions
seeing-0.101
nor-0.094
colour-0.094
��-0.093
 Teach-0.090
Token
Token ID198
Feature activation+2.56e-01

Pos logit contributions
ulate+1.110
ulating+1.047
riages+1.045
 stagn+1.018
���+0.993
Neg logit contributions
seeing-1.273
nor-1.181
colour-1.156
 Teach-1.133
��-1.107
Token Berg
Token ID24626
Feature activation+1.10e-01

Pos logit contributions
ulate+2.041
ulating+1.923
riages+1.922
 stagn+1.883
ulates+1.840
Neg logit contributions
seeing-1.888
nor-1.737
��-1.731
colour-1.716
 Teach-1.647
Tokenman
Token ID805
Feature activation-6.47e-01

Pos logit contributions
seeing+0.247
nor+0.223
colour+0.218
��+0.216
 Teach+0.210
Neg logit contributions
ulate-0.399
riages-0.381
ulating-0.381
 stagn-0.372
ulates-0.367
Tokenesque
Token ID28939
Feature activation-1.09e+00

Pos logit contributions
ulate+2.126
riages+2.015
ulating+2.014
 stagn+1.965
ulates+1.930
Neg logit contributions
seeing-1.683
nor-1.531
��-1.527
colour-1.518
 Teach-1.457
Token chamber
Token ID11847
Feature activation+1.15e-01

Pos logit contributions
ulate+3.190
riages+2.989
ulating+2.983
 stagn+2.917
ulates+2.857
Neg logit contributions
seeing-3.203
��-2.982
nor-2.963
colour-2.925
 Teach-2.818
Token piece
Token ID3704
Feature activation-8.25e-01

Pos logit contributions
seeing+0.352
nor+0.326
colour+0.322
��+0.316
 Teach+0.312
Neg logit contributions
ulate-0.327
ulating-0.308
riages-0.308
 stagn-0.300
ulates-0.293
Token Inter
Token ID4225
Feature activation+3.73e+00

Pos logit contributions
ulate+2.310
riages+2.176
ulating+2.167
 stagn+2.112
ulates+2.065
Neg logit contributions
seeing-2.519
nor-2.353
colour-2.330
��-2.327
 Teach-2.244
Tokeniors
Token ID12706
Feature activation+9.78e-01

Pos logit contributions
seeing+5.704
nor+5.192
colour+5.004
��+4.525
 Teach+4.458
Neg logit contributions
ulate-13.823
���-13.636
 stagn-13.525
ulating-13.419
sei-13.358
Token as
Token ID355
Feature activation-1.19e+00

Pos logit contributions
seeing+2.974
nor+2.737
colour+2.686
��+2.650
 Teach+2.622
Neg logit contributions
ulate-2.772
ulating-2.617
riages-2.588
 stagn-2.501
ulates-2.500
Token a
Token ID257
Feature activation-1.29e+00

Pos logit contributions
ulate+3.265
ulating+3.064
riages+3.044
 stagn+3.010
ulates+2.906
Neg logit contributions
seeing-3.702
nor-3.422
��-3.421
colour-3.397
 Teach-3.279
Token film
Token ID2646
Feature activation-6.51e-01

Pos logit contributions
ulate+3.440
ulating+3.211
riages+3.209
 stagn+3.169
ulates+3.041
Neg logit contributions
seeing-4.109
nor-3.824
��-3.807
colour-3.790
 Teach-3.671
Token that
Token ID326
Feature activation-1.11e+00

Pos logit contributions
ulate+1.842
riages+1.734
ulating+1.732
 stagn+1.686
ulates+1.646
Neg logit contributions
seeing-1.991
nor-1.843
colour-1.823
��-1.821
 Teach-1.772
Token proposals
Token ID11628
Feature activation+6.25e-01

Pos logit contributions
ulate+0.166
ulating+0.157
riages+0.156
 stagn+0.151
ulates+0.149
Neg logit contributions
seeing-0.182
nor-0.168
colour-0.167
��-0.163
 Teach-0.163
Token really
Token ID1107
Feature activation-8.05e-01

Pos logit contributions
seeing+1.831
nor+1.696
colour+1.665
 Teach+1.648
��+1.607
Neg logit contributions
ulate-1.797
ulating-1.718
riages-1.716
ulates-1.641
 stagn-1.631
Token want
Token ID765
Feature activation+6.19e-01

Pos logit contributions
ulate+2.335
ulating+2.187
riages+2.187
 stagn+2.160
ulates+2.089
Neg logit contributions
seeing-2.410
nor-2.229
��-2.227
colour-2.199
 Teach-2.115
Token anti
Token ID3098
Feature activation+1.53e-01

Pos logit contributions
seeing+1.584
nor+1.444
colour+1.418
��+1.406
 Teach+1.382
Neg logit contributions
ulate-2.040
riages-1.942
ulating-1.939
 stagn-1.885
ulates-1.866
Token-
Token ID12
Feature activation-5.39e-01

Pos logit contributions
seeing+0.486
nor+0.451
colour+0.446
��+0.442
 Teach+0.433
Neg logit contributions
ulate-0.413
riages-0.388
ulating-0.387
 stagn-0.375
ulates-0.369
Tokenm
Token ID76
Feature activation-4.58e+00

Pos logit contributions
ulate+1.734
ulating+1.645
riages+1.641
 stagn+1.599
ulates+1.572
Neg logit contributions
seeing-1.431
��-1.321
nor-1.313
colour-1.289
 Teach-1.264
Tokenalle
Token ID6765
Feature activation+6.03e-02

Pos logit contributions
ulate+11.449
ulating+10.885
riages+10.615
umbling+10.176
ulates+10.094
Neg logit contributions
seeing-13.693
��-13.657
ーン-12.521
colour-12.497
nor-12.442
Tokenability
Token ID1799
Feature activation-8.92e-01

Pos logit contributions
seeing+0.146
nor+0.132
��+0.131
colour+0.131
 Teach+0.126
Neg logit contributions
ulate-0.209
riages-0.200
ulating-0.199
 stagn-0.194
ulates-0.191
Token fixes
Token ID13040
Feature activation-8.53e-01

Pos logit contributions
ulate+2.501
ulating+2.353
riages+2.351
 stagn+2.286
ulates+2.234
Neg logit contributions
seeing-2.749
nor-2.555
��-2.536
colour-2.524
 Teach-2.436
Token (
Token ID357
Feature activation-7.72e-01

Pos logit contributions
ulate+2.394
ulating+2.256
riages+2.252
 stagn+2.188
ulates+2.139
Neg logit contributions
seeing-2.621
nor-2.431
��-2.419
colour-2.409
 Teach-2.330
Tokenwhich
Token ID4758
Feature activation-2.06e+00

Pos logit contributions
ulate+2.723
ulating+2.593
riages+2.580
 stagn+2.519
ulates+2.490
Neg logit contributions
seeing-1.787
��-1.661
nor-1.615
colour-1.596
 Teach-1.561
Token students
Token ID2444
Feature activation-6.87e-01

Pos logit contributions
seeing+0.230
nor+0.212
colour+0.209
��+0.208
 Teach+0.208
Neg logit contributions
ulate-0.213
ulating-0.201
riages-0.201
 stagn-0.194
ulates-0.191
Token to
Token ID284
Feature activation-4.01e-01

Pos logit contributions
ulate+2.332
ulating+2.223
riages+2.220
 stagn+2.172
ulates+2.123
Neg logit contributions
seeing-1.694
nor-1.553
colour-1.537
��-1.531
 Teach-1.462
Token access
Token ID1895
Feature activation+1.96e+00

Pos logit contributions
ulate+1.204
riages+1.137
ulating+1.136
 stagn+1.106
ulates+1.085
Neg logit contributions
seeing-1.158
nor-1.067
colour-1.054
��-1.051
 Teach-1.020
Token co
Token ID763
Feature activation-5.64e-01

Pos logit contributions
seeing+5.732
 Teach+5.357
nor+5.266
colour+5.137
��+5.023
Neg logit contributions
ulate-5.302
riages-5.085
ulating-4.969
sei-4.912
 stagn-4.866
Token-
Token ID12
Feature activation-2.00e-01

Pos logit contributions
ulate+1.506
riages+1.416
ulating+1.413
 stagn+1.370
ulates+1.340
Neg logit contributions
seeing-1.808
nor-1.679
��-1.673
colour-1.666
 Teach-1.610
Tokencur
Token ID22019
Feature activation-4.48e+00

Pos logit contributions
ulate+0.680
ulating+0.647
riages+0.646
 stagn+0.630
ulates+0.621
Neg logit contributions
seeing-0.496
nor-0.452
��-0.448
colour-0.445
 Teach-0.428
Tokenricular
Token ID41001
Feature activation-1.54e+00

Pos logit contributions
ulate+5.738
riages+5.526
ulating+5.414
ulates+4.632
sei+4.513
Neg logit contributions
��-18.229
seeing-18.156
colour-17.247
 Lauder-17.076
ーン-17.040
Token activities
Token ID4568
Feature activation+9.08e-01

Pos logit contributions
ulate+4.385
ulating+4.168
riages+4.132
 stagn+4.021
ulates+3.954
Neg logit contributions
seeing-4.571
��-4.284
nor-4.270
colour-4.269
ーン-4.034
Token to
Token ID284
Feature activation+3.80e-03

Pos logit contributions
seeing+2.679
nor+2.468
 Teach+2.413
colour+2.389
��+2.350
Neg logit contributions
ulate-2.612
riages-2.453
ulating-2.440
 stagn-2.368
ulates-2.349
Token supplement
Token ID10327
Feature activation-4.28e-02

Pos logit contributions
seeing+0.011
nor+0.010
colour+0.010
��+0.010
 Teach+0.010
Neg logit contributions
ulate-0.011
ulating-0.011
riages-0.011
 stagn-0.010
ulates-0.010
Token their
Token ID511
Feature activation-9.52e-01

Pos logit contributions
ulate+0.112
riages+0.105
ulating+0.104
 stagn+0.101
ulates+0.099
Neg logit contributions
seeing-0.140
nor-0.130
colour-0.128
��-0.128
 Teach-0.126
Token
Token ID198
Feature activation+2.13e-01

Pos logit contributions
ulate+0.329
riages+0.311
ulating+0.311
���+0.308
 stagn+0.303
Neg logit contributions
seeing-0.358
nor-0.332
colour-0.322
 Teach-0.317
ーン-0.303
TokenConclusion
Token ID21481
Feature activation-8.68e-01

Pos logit contributions
seeing+0.556
nor+0.508
colour+0.500
��+0.490
 Teach+0.480
Neg logit contributions
ulate-0.699
riages-0.665
ulating-0.664
 stagn-0.649
ulates-0.636
Token
Token ID198
Feature activation-2.05e+00

Pos logit contributions
ulate+2.169
ulating+2.020
riages+1.998
 stagn+1.940
ulates+1.909
Neg logit contributions
seeing-2.920
nor-2.734
��-2.712
colour-2.704
 Teach-2.615
Token
Token ID198
Feature activation-1.44e+00

Pos logit contributions
ulate+4.633
riages+4.274
ulating+4.270
 stagn+4.066
sei+4.019
Neg logit contributions
��-7.146
seeing-7.109
colour-6.724
nor-6.680
 Teach-6.451
TokenTo
Token ID2514
Feature activation+6.83e-03

Pos logit contributions
ulate+4.450
ulating+4.203
riages+4.163
sei+4.056
ulates+4.038
Neg logit contributions
seeing-3.817
��-3.743
colour-3.497
nor-3.490
 Teach-3.466
Token keep
Token ID1394
Feature activation-4.35e+00

Pos logit contributions
seeing+0.020
nor+0.019
colour+0.019
��+0.018
 Teach+0.018
Neg logit contributions
ulate-0.020
riages-0.019
ulating-0.019
 stagn-0.018
ulates-0.018
Token things
Token ID1243
Feature activation-2.10e+00

Pos logit contributions
ulate+10.308
ulating+9.686
 stagn+9.600
ulates+8.904
riages+8.871
Neg logit contributions
seeing-12.997
��-12.866
nor-12.457
colour-12.233
ーン-11.889
Token realistic
Token ID12653
Feature activation-1.66e+00

Pos logit contributions
ulate+5.224
ulating+4.938
 stagn+4.878
riages+4.814
ulates+4.605
Neg logit contributions
seeing-6.906
��-6.652
nor-6.571
colour-6.344
 Teach-6.223
Token,
Token ID11
Feature activation-7.15e-01

Pos logit contributions
ulate+4.223
ulating+3.962
riages+3.918
 stagn+3.839
ulates+3.717
Neg logit contributions
seeing-5.423
��-5.143
nor-5.122
colour-5.048
 Teach-4.903
Token Hong
Token ID9764
Feature activation+4.24e-01

Pos logit contributions
ulate+1.979
ulating+1.861
riages+1.853
 stagn+1.806
ulates+1.767
Neg logit contributions
seeing-2.220
nor-2.062
��-2.054
colour-2.041
 Teach-1.977
Token Kong
Token ID9071
Feature activation-9.09e-01

Pos logit contributions
seeing+1.039
nor+0.930
colour+0.920
 Teach+0.871
 Lauder+0.839
Neg logit contributions
ulate-1.448
���-1.396
riages-1.382
ulating-1.375
 stagn-1.350
Token the
Token ID262
Feature activation+2.81e-01

Pos logit contributions
ulate+0.406
riages+0.382
ulating+0.382
 stagn+0.370
ulates+0.364
Neg logit contributions
seeing-0.421
nor-0.388
colour-0.384
��-0.381
 Teach-0.374
Token story
Token ID1621
Feature activation-5.22e-01

Pos logit contributions
seeing+0.864
nor+0.798
colour+0.790
��+0.782
 Teach+0.771
Neg logit contributions
ulate-0.790
riages-0.743
ulating-0.743
 stagn-0.718
ulates-0.707
Token begins
Token ID6140
Feature activation-5.79e-01

Pos logit contributions
ulate+1.489
ulating+1.404
riages+1.403
 stagn+1.367
ulates+1.339
Neg logit contributions
seeing-1.581
nor-1.472
��-1.456
colour-1.449
 Teach-1.405
Token
Token ID198
Feature activation-1.12e+00

Pos logit contributions
ulate+1.551
ulating+1.453
riages+1.453
 stagn+1.408
ulates+1.379
Neg logit contributions
seeing-1.856
nor-1.728
colour-1.708
��-1.706
 Teach-1.658
Token
Token ID198
Feature activation-4.74e-01

Pos logit contributions
ulate+2.900
riages+2.712
ulating+2.711
 stagn+2.621
ulates+2.567
Neg logit contributions
seeing-3.713
nor-3.461
��-3.432
colour-3.426
 Teach-3.331
TokenCur
Token ID26628
Feature activation-4.26e+00

Pos logit contributions
ulate+1.576
riages+1.496
ulating+1.495
 stagn+1.455
ulates+1.435
Neg logit contributions
seeing-1.208
nor-1.103
��-1.099
colour-1.089
 Teach-1.055
Tokeniously
Token ID6819
Feature activation+3.00e-01

Pos logit contributions
ulate+10.520
ulating+10.113
riages+9.957
sei+9.511
ulates+9.357
Neg logit contributions
seeing-12.469
��-12.252
colour-11.463
nor-11.343
 Lauder-11.326
Token,
Token ID11
Feature activation+9.84e-02

Pos logit contributions
seeing+0.811
nor+0.743
colour+0.727
��+0.725
 Teach+0.709
Neg logit contributions
ulate-0.958
riages-0.907
ulating-0.906
 stagn-0.881
ulates-0.870
Token that
Token ID326
Feature activation-8.25e-01

Pos logit contributions
seeing+0.298
nor+0.275
colour+0.270
��+0.268
 Teach+0.265
Neg logit contributions
ulate-0.282
riages-0.267
ulating-0.266
 stagn-0.258
ulates-0.254
Token
Token ID447
Feature activation+1.73e-01

Pos logit contributions
ulate+2.443
riages+2.305
ulating+2.305
 stagn+2.249
ulates+2.202
Neg logit contributions
seeing-2.401
nor-2.218
��-2.205
colour-2.194
 Teach-2.121
Token
Token ID247
Feature activation-6.21e-01

Pos logit contributions
seeing+0.386
��+0.350
nor+0.346
colour+0.340
 Teach+0.332
Neg logit contributions
ulate-0.626
riages-0.604
ulating-0.603
 stagn-0.589
sei-0.580
Token to
Token ID284
Feature activation-3.26e-01

Pos logit contributions
ulate+1.664
riages+1.579
ulating+1.578
 stagn+1.539
ulates+1.509
Neg logit contributions
seeing-1.371
nor-1.258
colour-1.240
��-1.239
 Teach-1.193
Token find
Token ID1064
Feature activation+5.80e-01

Pos logit contributions
ulate+0.915
riages+0.863
ulating+0.862
 stagn+0.830
ulates+0.824
Neg logit contributions
seeing-1.003
nor-0.931
colour-0.919
��-0.905
 Teach-0.900
Token a
Token ID257
Feature activation-4.38e-01

Pos logit contributions
seeing+1.736
nor+1.597
colour+1.583
 Teach+1.574
��+1.531
Neg logit contributions
ulate-1.650
riages-1.552
ulating-1.540
sei-1.497
 stagn-1.489
Token balance
Token ID5236
Feature activation+3.41e-01

Pos logit contributions
ulate+1.140
riages+1.067
ulating+1.066
 stagn+1.029
ulates+1.012
Neg logit contributions
seeing-1.441
nor-1.340
colour-1.326
��-1.317
 Teach-1.294
Token between
Token ID1022
Feature activation-5.33e-01

Pos logit contributions
seeing+1.061
nor+0.977
colour+0.966
 Teach+0.948
��+0.946
Neg logit contributions
ulate-0.940
riages-0.890
ulating-0.878
 stagn-0.854
ulates-0.847
Token keeping
Token ID5291
Feature activation-4.12e+00

Pos logit contributions
ulate+1.367
riages+1.281
ulating+1.277
 stagn+1.232
ulates+1.212
Neg logit contributions
seeing-1.775
nor-1.651
colour-1.632
��-1.622
 Teach-1.596
Token activists
Token ID7941
Feature activation-1.49e+00

Pos logit contributions
ulate+10.608
 stagn+9.906
ulating+9.799
riages+9.473
ulates+9.248
Neg logit contributions
��-12.200
seeing-12.048
nor-11.251
colour-11.123
ーン-10.737
Token bullish
Token ID50079
Feature activation-9.36e-01

Pos logit contributions
ulate+3.709
ulating+3.475
riages+3.462
 stagn+3.369
ulates+3.268
Neg logit contributions
seeing-5.035
nor-4.729
��-4.705
colour-4.683
 Teach-4.536
Token while
Token ID981
Feature activation-1.44e+00

Pos logit contributions
ulate+2.624
ulating+2.472
riages+2.461
 stagn+2.396
ulates+2.341
Neg logit contributions
seeing-2.878
��-2.672
nor-2.667
colour-2.641
 Teach-2.553
Token temper
Token ID4124
Feature activation+2.82e-01

Pos logit contributions
ulate+3.528
ulating+3.342
riages+3.318
 stagn+3.290
ulates+3.095
Neg logit contributions
seeing-4.794
��-4.537
nor-4.523
colour-4.487
 Teach-4.303
Tokening
Token ID278
Feature activation-1.32e+00

Pos logit contributions
seeing+0.647
nor+0.580
colour+0.570
��+0.562
 Teach+0.548
Neg logit contributions
ulate-1.014
riages-0.969
ulating-0.960
 stagn-0.942
ulates-0.933
Token this
Token ID428
Feature activation-7.88e-01

Pos logit contributions
ulate+1.235
ulating+1.165
riages+1.164
 stagn+1.133
ulates+1.110
Neg logit contributions
seeing-1.248
nor-1.154
colour-1.140
��-1.137
 Teach-1.102
Token card
Token ID2657
Feature activation-2.16e-01

Pos logit contributions
ulate+2.522
ulating+2.384
riages+2.382
 stagn+2.324
ulates+2.282
Neg logit contributions
seeing-2.117
nor-1.946
��-1.922
colour-1.914
 Teach-1.832
Token,
Token ID11
Feature activation-9.11e-01

Pos logit contributions
ulate+0.593
riages+0.557
ulating+0.556
 stagn+0.540
ulates+0.530
Neg logit contributions
seeing-0.679
nor-0.629
colour-0.621
��-0.617
 Teach-0.604
Token though
Token ID996
Feature activation-9.80e-01

Pos logit contributions
ulate+2.674
ulating+2.519
riages+2.505
 stagn+2.467
ulates+2.409
Neg logit contributions
seeing-2.649
��-2.482
nor-2.460
colour-2.448
 Teach-2.350
Token.
Token ID13
Feature activation-1.14e+00

Pos logit contributions
ulate+2.547
riages+2.390
ulating+2.378
 stagn+2.313
ulates+2.252
Neg logit contributions
seeing-3.181
nor-2.982
��-2.970
colour-2.947
 Teach-2.863
Token Firstly
Token ID41039
Feature activation-3.98e+00

Pos logit contributions
ulate+3.615
ulating+3.433
riages+3.431
 stagn+3.337
ulates+3.290
Neg logit contributions
seeing-3.005
��-2.796
nor-2.777
colour-2.761
 Teach-2.581
Token,
Token ID11
Feature activation-1.25e+00

Pos logit contributions
ulate+8.263
ulating+7.570
riages+7.348
 stagn+7.192
sei+7.146
Neg logit contributions
seeing-12.850
��-12.632
nor-12.374
colour-12.174
 Lauder-11.510
Token you
Token ID345
Feature activation-8.94e-01

Pos logit contributions
ulate+3.527
ulating+3.324
riages+3.308
 stagn+3.238
sei+3.179
Neg logit contributions
seeing-3.732
��-3.474
colour-3.456
nor-3.450
 Teach-3.267
Token have
Token ID423
Feature activation+2.94e-01

Pos logit contributions
ulate+2.520
riages+2.360
ulating+2.359
 stagn+2.317
sei+2.254
Neg logit contributions
seeing-2.743
nor-2.538
��-2.534
colour-2.505
 Teach-2.412
Token to
Token ID284
Feature activation-8.71e-02

Pos logit contributions
seeing+0.751
nor+0.686
colour+0.676
��+0.674
 Teach+0.650
Neg logit contributions
ulate-0.981
riages-0.932
ulating-0.931
 stagn-0.910
ulates-0.894
Token target
Token ID2496
Feature activation+1.06e+00

Pos logit contributions
ulate+0.258
riages+0.245
ulating+0.244
 stagn+0.236
ulates+0.233
Neg logit contributions
seeing-0.254
nor-0.234
colour-0.232
��-0.229
 Teach-0.227
Token following
Token ID1708
Feature activation+9.81e-01

Pos logit contributions
seeing+1.461
nor+1.333
colour+1.317
 Teach+1.301
��+1.288
Neg logit contributions
ulate-1.593
riages-1.528
ulating-1.517
 stagn-1.473
ulates-1.455
Token at
Token ID379
Feature activation+4.32e-01

Pos logit contributions
seeing+2.983
nor+2.717
colour+2.679
 Teach+2.661
��+2.584
Neg logit contributions
ulate-2.722
riages-2.613
ulating-2.588
 stagn-2.498
sei-2.480
Token Ham
Token ID4345
Feature activation+5.64e-01

Pos logit contributions
seeing+1.211
nor+1.109
colour+1.084
 Teach+1.071
��+1.069
Neg logit contributions
ulate-1.331
riages-1.275
ulating-1.260
 stagn-1.226
ulates-1.209
Tokenill
Token ID359
Feature activation-3.61e-02

Pos logit contributions
seeing+1.616
nor+1.548
colour+1.466
 Teach+1.403
edin+1.385
Neg logit contributions
ulate-1.635
���-1.629
riages-1.598
ulating-1.556
 stagn-1.549
Token (
Token ID357
Feature activation+3.08e-02

Pos logit contributions
ulate+0.097
riages+0.091
ulating+0.090
 stagn+0.087
ulates+0.086
Neg logit contributions
seeing-0.117
nor-0.108
colour-0.106
��-0.105
 Teach-0.104
Tokenm
Token ID76
Feature activation-3.96e+00

Pos logit contributions
seeing+0.077
nor+0.070
colour+0.069
��+0.069
 Teach+0.067
Neg logit contributions
ulate-0.104
riages-0.099
ulating-0.099
 stagn-0.096
ulates-0.095
Tokenessing
Token ID27289
Feature activation+6.79e-01

Pos logit contributions
ulate+9.339
ulating+9.034
umbling+8.988
riages+8.646
ulates+8.368
Neg logit contributions
seeing-12.132
��-11.872
colour-11.460
nor-11.394
 Lauder-11.309
Token up
Token ID510
Feature activation+6.32e-01

Pos logit contributions
seeing+2.179
nor+2.039
colour+1.997
 Teach+1.958
��+1.952
Neg logit contributions
ulate-1.772
riages-1.705
ulating-1.671
 stagn-1.631
sei-1.594
Token his
Token ID465
Feature activation-1.18e-01

Pos logit contributions
seeing+1.852
nor+1.698
colour+1.685
 Teach+1.646
��+1.643
Neg logit contributions
ulate-1.846
riages-1.764
ulating-1.753
 stagn-1.708
ulates-1.672
Token Twitter
Token ID3009
Feature activation+1.57e+00

Pos logit contributions
ulate+0.337
riages+0.318
ulating+0.318
 stagn+0.308
ulates+0.303
Neg logit contributions
seeing-0.359
nor-0.332
colour-0.328
��-0.326
 Teach-0.320
Token handle
Token ID5412
Feature activation+1.71e+00

Pos logit contributions
seeing+4.491
colour+4.098
nor+4.082
 Teach+4.000
��+3.970
Neg logit contributions
ulate-4.626
riages-4.386
ulating-4.358
 stagn-4.258
ulates-4.184
TokenNG
Token ID10503
Feature activation-3.02e-01

Pos logit contributions
ulate+0.259
riages+0.243
ulating+0.242
 stagn+0.233
���+0.229
Neg logit contributions
seeing-0.364
nor-0.341
colour-0.335
 Teach-0.324
��-0.321
Token has
Token ID468
Feature activation+2.30e-01

Pos logit contributions
ulate+0.869
ulating+0.819
riages+0.818
 stagn+0.797
ulates+0.781
Neg logit contributions
seeing-0.908
nor-0.841
��-0.832
colour-0.829
 Teach-0.805
Token major
Token ID1688
Feature activation-6.93e-01

Pos logit contributions
seeing+0.685
nor+0.631
colour+0.619
��+0.611
 Teach+0.608
Neg logit contributions
ulate-0.666
riages-0.629
ulating-0.627
 stagn-0.600
ulates-0.596
Token drawbacks
Token ID42763
Feature activation-5.30e-01

Pos logit contributions
ulate+2.023
ulating+1.907
riages+1.906
 stagn+1.855
ulates+1.818
Neg logit contributions
seeing-2.058
nor-1.906
colour-1.881
��-1.881
 Teach-1.821
Token.
Token ID13
Feature activation-1.39e+00

Pos logit contributions
ulate+1.409
ulating+1.322
riages+1.319
 stagn+1.279
ulates+1.252
Neg logit contributions
seeing-1.711
nor-1.593
colour-1.577
��-1.576
 Teach-1.532
Token Firstly
Token ID41039
Feature activation-3.92e+00

Pos logit contributions
ulate+4.483
ulating+4.255
riages+4.215
 stagn+4.114
ulates+4.089
Neg logit contributions
seeing-3.564
��-3.442
colour-3.308
nor-3.269
 Teach-3.019
Token,
Token ID11
Feature activation-1.59e+00

Pos logit contributions
ulate+7.525
ulating+7.000
riages+6.410
 stagn+6.384
ulates+6.373
Neg logit contributions
seeing-13.498
��-13.215
colour-13.104
nor-13.023
 Teach-12.429
Token it
Token ID340
Feature activation-1.03e+00

Pos logit contributions
ulate+4.300
ulating+4.057
riages+3.968
 stagn+3.956
ulates+3.831
Neg logit contributions
seeing-4.988
��-4.772
colour-4.661
nor-4.628
 Teach-4.416
Token
Token ID447
Feature activation-3.26e-01

Pos logit contributions
ulate+2.851
riages+2.692
ulating+2.685
 stagn+2.651
ulates+2.569
Neg logit contributions
seeing-3.210
��-3.011
nor-2.979
colour-2.949
 Teach-2.853
Token
Token ID247
Feature activation-5.88e-01

Pos logit contributions
ulate+1.101
ulating+1.054
riages+1.052
 stagn+1.021
ulates+1.002
Neg logit contributions
seeing-0.819
��-0.748
nor-0.745
colour-0.736
 Teach-0.719
Tokens
Token ID82
Feature activation-6.96e-01

Pos logit contributions
ulate+1.818
riages+1.719
ulating+1.718
 stagn+1.678
ulates+1.649
Neg logit contributions
seeing-1.649
nor-1.512
��-1.504
colour-1.496
 Teach-1.445
Token's
Token ID338
Feature activation+2.04e-01

Pos logit contributions
ulate+0.553
riages+0.517
ulating+0.516
 stagn+0.498
ulates+0.489
Neg logit contributions
seeing-0.715
nor-0.667
��-0.660
colour-0.658
 Teach-0.640
Token humble
Token ID19695
Feature activation-7.18e-02

Pos logit contributions
seeing+0.549
nor+0.503
colour+0.499
��+0.492
 Teach+0.486
Neg logit contributions
ulate-0.651
ulating-0.618
riages-0.617
 stagn-0.600
ulates-0.592
Token and
Token ID290
Feature activation-6.35e-01

Pos logit contributions
ulate+0.200
riages+0.188
ulating+0.188
 stagn+0.182
ulates+0.179
Neg logit contributions
seeing-0.223
nor-0.207
��-0.205
colour-0.204
 Teach-0.198
Token obedient
Token ID41652
Feature activation+1.60e-02

Pos logit contributions
ulate+1.624
ulating+1.515
riages+1.513
 stagn+1.471
ulates+1.431
Neg logit contributions
seeing-2.112
nor-1.977
��-1.962
colour-1.944
 Teach-1.891
Token servant
Token ID18807
Feature activation+7.50e-01

Pos logit contributions
seeing+0.049
nor+0.046
colour+0.045
��+0.045
 Teach+0.044
Neg logit contributions
ulate-0.045
ulating-0.042
riages-0.042
 stagn-0.041
ulates-0.040
TokenCM
Token ID24187
Feature activation-3.90e+00

Pos logit contributions
seeing+2.261
nor+2.092
colour+2.061
��+2.018
 Teach+2.014
Neg logit contributions
ulate-2.145
ulating-2.024
riages-2.014
 stagn-1.974
ulates-1.932
TokenDR
Token ID7707
Feature activation-1.44e+00

Pos logit contributions
ulate+11.457
ulating+10.872
riages+10.758
OCK+10.374
ulates+10.342
Neg logit contributions
seeing-9.677
��-8.910
ーン-8.520
colour-8.520
nor-8.484
Token Kay
Token ID17356
Feature activation+7.65e-01

Pos logit contributions
ulate+4.376
riages+4.138
ulating+4.134
 stagn+3.994
ulates+3.938
Neg logit contributions
seeing-4.003
��-3.689
nor-3.675
colour-3.635
ーン-3.469
Tokenmel
Token ID17694
Feature activation-2.49e-01

Pos logit contributions
seeing+2.186
nor+2.055
colour+1.988
��+1.953
 Teach+1.931
Neg logit contributions
ulate-2.302
ulating-2.180
riages-2.180
 stagn-2.132
ulates-2.083
Token Kan
Token ID14248
Feature activation-5.55e-01

Pos logit contributions
ulate+0.759
riages+0.718
ulating+0.718
 stagn+0.698
ulates+0.686
Neg logit contributions
seeing-0.710
nor-0.654
colour-0.645
��-0.643
 Teach-0.625
Token,
Token ID11
Feature activation+6.08e-01

Pos logit contributions
ulate+1.560
ulating+1.467
riages+1.465
 stagn+1.418
ulates+1.395
Neg logit contributions
seeing-1.708
nor-1.578
colour-1.564
��-1.561
 Teach-1.521
Token businesses
Token ID5692
Feature activation-3.44e+00

Pos logit contributions
ulate+8.018
ulating+7.469
riages+7.357
 stagn+7.354
ulates+7.121
Neg logit contributions
seeing-8.585
��-8.548
nor-8.043
colour-8.015
ーン-7.578
Token looking
Token ID2045
Feature activation-1.12e+00

Pos logit contributions
ulate+8.701
 stagn+8.282
ulating+8.112
riages+8.095
 reorgan+7.689
Neg logit contributions
��-10.322
seeing-10.255
nor-9.851
colour-9.574
ーン-9.360
Token for
Token ID329
Feature activation+1.24e-01

Pos logit contributions
ulate+3.390
ulating+3.223
riages+3.213
 stagn+3.198
ulates+3.060
Neg logit contributions
seeing-3.126
nor-2.934
��-2.907
colour-2.857
 Teach-2.786
Token on
Token ID319
Feature activation+1.67e-01

Pos logit contributions
seeing+0.391
nor+0.362
colour+0.358
��+0.353
 Teach+0.349
Neg logit contributions
ulate-0.341
riages-0.320
ulating-0.320
 stagn-0.309
ulates-0.304
Token-
Token ID12
Feature activation+2.06e-01

Pos logit contributions
seeing+0.520
nor+0.480
colour+0.476
��+0.465
 Teach+0.460
Neg logit contributions
ulate-0.466
riages-0.438
ulating-0.438
 stagn-0.424
ulates-0.416
Tokenprem
Token ID31605
Feature activation-3.90e+00

Pos logit contributions
seeing+0.621
nor+0.573
colour+0.568
��+0.544
 Teach+0.541
Neg logit contributions
ulate-0.597
ulating-0.564
riages-0.564
 stagn-0.548
ulates-0.538
Tokenise
Token ID786
Feature activation-1.93e+00

Pos logit contributions
ulate+6.447
riages+5.870
 stagn+5.618
sei+5.587
ulating+5.534
Neg logit contributions
��-14.803
seeing-14.787
nor-13.900
colour-13.767
bender-13.595
Token deployment
Token ID14833
Feature activation-4.93e-01

Pos logit contributions
ulate+5.192
riages+4.842
ulating+4.837
 stagn+4.775
ulates+4.597
Neg logit contributions
seeing-5.925
��-5.732
nor-5.578
colour-5.486
ーン-5.291
Token options
Token ID3689
Feature activation+5.30e-01

Pos logit contributions
ulate+1.387
riages+1.305
ulating+1.305
 stagn+1.264
ulates+1.241
Neg logit contributions
seeing-1.519
nor-1.407
colour-1.391
��-1.381
 Teach-1.353
Token happy
Token ID3772
Feature activation-1.69e-01

Pos logit contributions
seeing+1.620
nor+1.503
colour+1.473
 Teach+1.453
��+1.443
Neg logit contributions
ulate-1.469
riages-1.392
ulating-1.385
 stagn-1.326
ulates-1.324
Token and
Token ID290
Feature activation-1.17e+00

Pos logit contributions
ulate+0.461
ulating+0.433
riages+0.432
 stagn+0.419
ulates+0.412
Neg logit contributions
seeing-0.534
nor-0.496
colour-0.490
��-0.487
 Teach-0.476
Token!!!
Token ID10185
Feature activation-5.86e-01

Pos logit contributions
seeing+1.751
nor+1.604
colour+1.575
��+1.536
 Teach+1.534
Neg logit contributions
ulate-1.750
ulating-1.653
riages-1.653
 stagn-1.601
ulates-1.583
Token Cong
Token ID2908
Feature activation-1.53e+00

Pos logit contributions
ulate+1.867
ulating+1.768
riages+1.765
 stagn+1.717
ulates+1.689
Neg logit contributions
seeing-1.565
nor-1.439
��-1.428
colour-1.427
 Teach-1.365
Tokenrats
Token ID46714
Feature activation-6.91e-01

Pos logit contributions
ulate+2.028
ulating+1.766
riages+1.723
ulates+1.550
 stagn+1.536
Neg logit contributions
seeing-6.966
��-6.680
colour-6.638
nor-6.604
 Teach-6.437
Token 2
Token ID362
Feature activation-1.71e+00

Pos logit contributions
ulate+2.141
ulating+2.021
riages+2.016
 stagn+1.960
ulates+1.924
Neg logit contributions
seeing-1.920
nor-1.767
��-1.760
colour-1.756
 Teach-1.682
Token @
Token ID2488
Feature activation+5.27e-01

Pos logit contributions
ulate+4.891
riages+4.607
ulating+4.584
 stagn+4.452
sei+4.359
Neg logit contributions
seeing-4.928
��-4.679
nor-4.567
colour-4.552
 Teach-4.357
Tokenm
Token ID76
Feature activation-3.85e+00

Pos logit contributions
seeing+1.430
nor+1.318
colour+1.292
��+1.267
 Teach+1.237
Neg logit contributions
ulate-1.669
riages-1.583
ulating-1.583
 stagn-1.552
ulates-1.514
Tokenf
Token ID69
Feature activation-1.01e+00

Pos logit contributions
ulate+10.631
ulating+9.865
riages+9.769
ulates+9.457
umbling+9.432
Neg logit contributions
seeing-10.384
��-10.242
 Teach-9.625
colour-9.476
 Lauder-9.366
Tokenlynn
Token ID12935
Feature activation-1.16e+00

Pos logit contributions
ulate+3.213
ulating+3.044
riages+3.041
 stagn+2.947
ulates+2.913
Neg logit contributions
seeing-2.735
nor-2.506
��-2.501
colour-2.476
 Teach-2.411
TokenJR
Token ID44817
Feature activation-4.17e-01

Pos logit contributions
ulate+3.413
riages+3.237
ulating+3.210
 stagn+3.102
sei+3.077
Neg logit contributions
seeing-3.378
��-3.141
nor-3.083
colour-3.072
 Teach-2.999
Token Great
Token ID3878
Feature activation-6.79e-01

Pos logit contributions
ulate+1.212
riages+1.143
ulating+1.140
 stagn+1.106
ulates+1.085
Neg logit contributions
seeing-1.242
nor-1.149
��-1.145
colour-1.138
 Teach-1.096
Token day
Token ID1110
Feature activation-1.17e+00

Pos logit contributions
ulate+2.052
riages+1.943
ulating+1.930
 stagn+1.876
sei+1.840
Neg logit contributions
seeing-1.918
��-1.800
nor-1.778
colour-1.762
 Teach-1.683
Token
Token ID198
Feature activation-2.73e-01

Pos logit contributions
seeing+1.359
nor+1.245
 Teach+1.215
colour+1.196
��+1.167
Neg logit contributions
ulate-1.672
ulating-1.586
riages-1.582
 stagn-1.553
ulates-1.521
Token
Token ID198
Feature activation+1.10e-01

Pos logit contributions
ulate+0.770
riages+0.729
ulating+0.728
���+0.714
 stagn+0.710
Neg logit contributions
seeing-0.851
nor-0.786
colour-0.766
 Teach-0.755
ーン-0.721
TokenUnder
Token ID9203
Feature activation+3.50e-01

Pos logit contributions
seeing+0.280
nor+0.255
colour+0.250
��+0.245
 Teach+0.241
Neg logit contributions
ulate-0.368
riages-0.351
ulating-0.350
 stagn-0.342
ulates-0.336
Token chain
Token ID6333
Feature activation-1.69e+00

Pos logit contributions
seeing+1.073
nor+0.991
colour+0.974
��+0.964
 Teach+0.955
Neg logit contributions
ulate-0.986
riages-0.932
ulating-0.927
 stagn-0.899
ulates-0.884
Token-
Token ID12
Feature activation-3.65e-01

Pos logit contributions
ulate+2.987
ulating+2.736
riages+2.675
 stagn+2.523
ulates+2.492
Neg logit contributions
seeing-6.859
��-6.541
nor-6.484
colour-6.467
 Teach-6.337
Tokenm
Token ID76
Feature activation-3.83e+00

Pos logit contributions
ulate+1.216
ulating+1.155
riages+1.153
 stagn+1.123
ulates+1.108
Neg logit contributions
seeing-0.935
nor-0.851
��-0.844
colour-0.840
 Teach-0.815
Tokenigration
Token ID4254
Feature activation-2.68e-01

Pos logit contributions
ulate+6.465
ulating+6.031
riages+5.610
ulates+5.374
umbling+5.174
Neg logit contributions
seeing-15.278
��-15.170
colour-14.625
nor-14.336
 Teach-14.260
Token policies
Token ID4788
Feature activation-1.17e+00

Pos logit contributions
ulate+0.786
riages+0.742
ulating+0.740
 stagn+0.718
sei+0.707
Neg logit contributions
seeing-0.792
nor-0.728
colour-0.717
��-0.706
 Teach-0.701
Token,
Token ID11
Feature activation-4.17e-01

Pos logit contributions
ulate+3.006
ulating+2.819
riages+2.809
 stagn+2.734
ulates+2.655
Neg logit contributions
seeing-3.890
nor-3.632
��-3.615
colour-3.611
 Teach-3.462
Token immigrants
Token ID7971
Feature activation+1.01e-01

Pos logit contributions
ulate+1.159
ulating+1.089
riages+1.088
 stagn+1.054
ulates+1.035
Neg logit contributions
seeing-1.296
nor-1.201
colour-1.184
��-1.175
 Teach-1.157
Token granted
Token ID7520
Feature activation+9.86e-01

Pos logit contributions
seeing+0.296
nor+0.272
colour+0.265
 Teach+0.261
��+0.259
Neg logit contributions
ulate-0.299
ulating-0.282
riages-0.281
ulates-0.271
 stagn-0.270
Token vs
Token ID3691
Feature activation-9.90e-01

Pos logit contributions
seeing+4.347
nor+4.098
colour+4.047
��+4.017
 Teach+3.982
Neg logit contributions
ulate-2.018
riages-1.885
ulating-1.860
 stagn-1.780
ulates-1.723
Token Safe
Token ID19978
Feature activation+5.64e-01

Pos logit contributions
ulate+3.349
ulating+3.171
riages+3.169
 stagn+3.099
ulates+3.045
Neg logit contributions
seeing-2.448
nor-2.225
��-2.206
colour-2.201
 Teach-2.069
Token Passage
Token ID46169
Feature activation+1.29e+00

Pos logit contributions
seeing+1.412
nor+1.281
colour+1.249
��+1.233
 Teach+1.216
Neg logit contributions
ulate-1.914
riages-1.826
ulating-1.826
 stagn-1.788
ulates-1.755
Token
Token ID198
Feature activation-7.63e-01

Pos logit contributions
seeing+3.518
nor+3.210
 Teach+3.119
colour+3.111
��+3.090
Neg logit contributions
ulate-4.019
ulating-3.858
riages-3.831
ulates-3.697
 stagn-3.692
Token
Token ID198
Feature activation-5.00e-01

Pos logit contributions
ulate+2.058
riages+1.936
ulating+1.934
 stagn+1.877
ulates+1.835
Neg logit contributions
seeing-2.458
nor-2.284
colour-2.246
 Teach-2.194
��-2.186
TokenM
Token ID44
Feature activation-3.83e+00

Pos logit contributions
ulate+1.696
ulating+1.607
riages+1.606
 stagn+1.563
ulates+1.546
Neg logit contributions
seeing-1.239
��-1.128
nor-1.126
colour-1.109
 Teach-1.077
Tokenael
Token ID3010
Feature activation-4.34e-01

Pos logit contributions
ulate+10.010
ulating+9.283
riages+9.201
umbling+8.920
ulates+8.784
Neg logit contributions
seeing-11.259
��-10.629
colour-10.141
ーン-10.057
nor-10.025
Tokenstrom
Token ID20282
Feature activation-5.98e-01

Pos logit contributions
ulate+1.450
ulating+1.376
riages+1.371
 stagn+1.336
ulates+1.319
Neg logit contributions
seeing-1.112
nor-1.009
��-1.002
colour-0.999
 Teach-0.961
Token Pulse
Token ID25062
Feature activation+5.39e-01

Pos logit contributions
ulate+2.002
riages+1.900
ulating+1.897
 stagn+1.863
ulates+1.813
Neg logit contributions
seeing-1.512
nor-1.376
colour-1.355
��-1.350
 Teach-1.297
Token vs
Token ID3691
Feature activation-8.60e-01

Pos logit contributions
seeing+2.216
nor+2.093
colour+2.072
��+2.068
 Teach+2.035
Neg logit contributions
ulate-0.953
riages-0.871
ulating-0.866
 stagn-0.822
ulates-0.798
Token Paradise
Token ID20494
Feature activation-4.04e-02

Pos logit contributions
ulate+2.922
ulating+2.769
riages+2.766
 stagn+2.700
ulates+2.656
Neg logit contributions
seeing-2.120
nor-1.929
colour-1.909
��-1.908
 Teach-1.792
Token is
Token ID318
Feature activation+5.68e-01

Pos logit contributions
seeing+1.752
nor+1.627
colour+1.576
 Teach+1.563
��+1.521
Neg logit contributions
ulate-1.797
riages-1.686
ulating-1.684
 stagn-1.610
ulates-1.610
Token one
Token ID530
Feature activation+5.34e-01

Pos logit contributions
seeing+1.706
nor+1.559
colour+1.548
 Teach+1.536
��+1.507
Neg logit contributions
ulate-1.633
riages-1.539
ulating-1.527
ulates-1.464
 stagn-1.462
Token of
Token ID286
Feature activation+1.56e+00

Pos logit contributions
seeing+1.708
nor+1.595
colour+1.571
��+1.567
 Teach+1.524
Neg logit contributions
ulate-1.441
ulating-1.354
riages-1.344
 stagn-1.308
ulates-1.281
Token the
Token ID262
Feature activation+7.77e-01

Pos logit contributions
seeing+4.089
nor+3.634
 Teach+3.600
colour+3.595
��+3.542
Neg logit contributions
ulate-4.946
riages-4.769
ulating-4.667
sei-4.550
ulates-4.512
Token st
Token ID336
Feature activation-1.64e+00

Pos logit contributions
seeing+2.346
nor+2.137
colour+2.129
 Teach+2.100
��+2.053
Neg logit contributions
ulate-2.209
riages-2.089
ulating-2.082
ulates-1.993
 stagn-1.986
Tokenaun
Token ID1942
Feature activation-3.83e+00

Pos logit contributions
ulate+3.127
ulating+2.865
riages+2.829
ulates+2.653
 stagn+2.633
Neg logit contributions
seeing-6.428
��-6.304
nor-6.078
colour-6.078
 Teach-5.937
Tokenchest
Token ID46713
Feature activation-1.24e+00

Pos logit contributions
ulate+8.683
ulating+8.429
riages+8.239
 stagn+8.081
ulates+7.700
Neg logit contributions
��-11.992
seeing-11.667
ーン-11.303
nor-11.283
colour-11.137
Token supporters
Token ID5941
Feature activation+7.97e-02

Pos logit contributions
ulate+3.280
ulating+3.079
riages+3.054
 stagn+2.990
ulates+2.917
Neg logit contributions
seeing-3.996
��-3.734
nor-3.733
colour-3.676
 Teach-3.540
Token of
Token ID286
Feature activation-1.03e-01

Pos logit contributions
seeing+0.252
nor+0.234
colour+0.232
��+0.231
 Teach+0.225
Neg logit contributions
ulate-0.217
ulating-0.204
riages-0.204
 stagn-0.198
ulates-0.194
Token Trump
Token ID1301
Feature activation+3.49e-01

Pos logit contributions
ulate+0.353
riages+0.338
ulating+0.335
 stagn+0.328
ulates+0.323
Neg logit contributions
seeing-0.252
nor-0.230
colour-0.225
��-0.223
 Teach-0.219
Token's
Token ID338
Feature activation+3.29e-01

Pos logit contributions
seeing+1.116
nor+1.040
colour+1.016
��+1.005
 Teach+0.998
Neg logit contributions
ulate-0.936
riages-0.875
ulating-0.866
 stagn-0.849
ulates-0.830
Token the
Token ID262
Feature activation-7.40e-03

Pos logit contributions
seeing+0.323
nor+0.300
colour+0.295
 Teach+0.291
��+0.290
Neg logit contributions
ulate-0.286
riages-0.270
ulating-0.270
 stagn-0.258
ulates-0.256
Token new
Token ID649
Feature activation+3.65e-01

Pos logit contributions
ulate+0.022
riages+0.021
ulating+0.021
 stagn+0.020
ulates+0.020
Neg logit contributions
seeing-0.022
nor-0.020
colour-0.020
��-0.019
 Teach-0.019
Token Firefox
Token ID16802
Feature activation+9.72e-01

Pos logit contributions
seeing+1.198
nor+1.120
colour+1.100
 Teach+1.094
��+1.083
Neg logit contributions
ulate-0.941
riages-0.891
ulating-0.885
 stagn-0.841
ulates-0.835
Token
Token ID198
Feature activation-6.09e-01

Pos logit contributions
seeing+2.905
nor+2.694
 Teach+2.631
colour+2.616
��+2.512
Neg logit contributions
ulate-2.757
ulating-2.618
riages-2.572
ulates-2.479
sei-2.469
Token
Token ID198
Feature activation-2.58e-01

Pos logit contributions
ulate+1.651
riages+1.554
ulating+1.553
 stagn+1.510
ulates+1.473
Neg logit contributions
seeing-1.955
nor-1.814
colour-1.780
 Teach-1.742
��-1.726
TokenM
Token ID44
Feature activation-3.81e+00

Pos logit contributions
ulate+0.872
riages+0.828
ulating+0.828
 stagn+0.807
ulates+0.795
Neg logit contributions
seeing-0.650
nor-0.592
colour-0.584
��-0.584
 Teach-0.563
Tokenoz
Token ID8590
Feature activation+2.26e-01

Pos logit contributions
ulate+8.976
ulating+8.393
riages+8.231
umbling+8.017
ulates+7.934
Neg logit contributions
seeing-11.952
��-11.374
colour-10.833
nor-10.789
 Teach-10.637
Tokenilla
Token ID5049
Feature activation-1.35e+00

Pos logit contributions
seeing+0.842
nor+0.798
colour+0.778
 Teach+0.757
��+0.745
Neg logit contributions
ulate-0.482
riages-0.453
ulating-0.445
���-0.440
 stagn-0.435
Token still
Token ID991
Feature activation-1.81e+00

Pos logit contributions
ulate+3.753
riages+3.528
ulating+3.523
 stagn+3.500
ulates+3.351
Neg logit contributions
seeing-4.115
��-3.870
nor-3.835
colour-3.808
 Teach-3.630
Token has
Token ID468
Feature activation+1.61e-01

Pos logit contributions
ulate+5.203
 stagn+4.932
ulating+4.899
riages+4.898
ulates+4.755
Neg logit contributions
seeing-5.216
��-5.140
nor-4.916
colour-4.892
 Teach-4.647
Token a
Token ID257
Feature activation-5.09e-01

Pos logit contributions
seeing+0.474
nor+0.437
colour+0.431
��+0.428
 Teach+0.419
Neg logit contributions
ulate-0.473
riages-0.446
ulating-0.446
 stagn-0.432
ulates-0.425
Tokenilla
Token ID5049
Feature activation-7.08e-01

Pos logit contributions
seeing+2.418
nor+2.305
colour+2.217
 Teach+2.161
edin+2.141
Neg logit contributions
ulate-1.494
���-1.451
riages-1.413
ulating-1.393
 stagn-1.365
Token's
Token ID338
Feature activation+5.99e-01

Pos logit contributions
ulate+1.989
riages+1.872
ulating+1.870
 stagn+1.825
ulates+1.780
Neg logit contributions
seeing-2.176
nor-2.018
��-2.001
colour-1.998
 Teach-1.931
Token mission
Token ID4365
Feature activation+3.63e-01

Pos logit contributions
seeing+1.911
nor+1.758
colour+1.722
 Teach+1.713
��+1.679
Neg logit contributions
ulate-1.616
riages-1.522
ulating-1.515
ulates-1.441
 stagn-1.431
Token --
Token ID1377
Feature activation+2.61e-02

Pos logit contributions
seeing+1.064
nor+0.969
colour+0.947
��+0.941
 Teach+0.932
Neg logit contributions
ulate-1.085
ulating-1.022
riages-1.021
 stagn-0.984
sei-0.978
Token to
Token ID284
Feature activation-1.34e+00

Pos logit contributions
seeing+0.073
nor+0.067
colour+0.066
��+0.066
 Teach+0.064
Neg logit contributions
ulate-0.081
riages-0.077
ulating-0.076
 stagn-0.074
ulates-0.073
Token keep
Token ID1394
Feature activation-3.79e+00

Pos logit contributions
ulate+3.752
ulating+3.506
riages+3.497
 stagn+3.479
ulates+3.324
Neg logit contributions
seeing-4.099
��-3.799
nor-3.790
colour-3.764
ーン-3.590
Token the
Token ID262
Feature activation-8.55e-01

Pos logit contributions
ulate+9.027
 stagn+8.544
ulating+8.466
riages+8.147
ulates+7.886
Neg logit contributions
seeing-11.727
��-11.605
nor-11.379
colour-11.270
ーン-10.961
Token internet
Token ID5230
Feature activation+7.94e-01

Pos logit contributions
ulate+2.261
ulating+2.118
riages+2.115
 stagn+2.079
ulates+2.008
Neg logit contributions
seeing-2.764
nor-2.586
��-2.552
colour-2.541
 Teach-2.459
Token open
Token ID1280
Feature activation+1.82e-02

Pos logit contributions
seeing+2.623
nor+2.385
colour+2.371
 Teach+2.363
��+2.280
Neg logit contributions
ulate-2.028
ulating-1.882
riages-1.871
sei-1.801
ulates-1.796
Token and
Token ID290
Feature activation-1.05e+00

Pos logit contributions
seeing+0.056
nor+0.052
colour+0.051
��+0.051
 Teach+0.050
Neg logit contributions
ulate-0.051
ulating-0.048
riages-0.048
 stagn-0.046
ulates-0.046
Token a
Token ID257
Feature activation-7.12e-01

Pos logit contributions
ulate+2.597
ulating+2.420
riages+2.412
 stagn+2.365
ulates+2.277
Neg logit contributions
seeing-3.569
nor-3.367
��-3.344
colour-3.325
 Teach-3.194
Tokenextra
Token ID26086
Feature activation+3.40e-01

Pos logit contributions
seeing+0.881
nor+0.801
colour+0.792
��+0.765
 Teach+0.753
Neg logit contributions
ulate-1.074
riages-1.023
ulating-1.019
 stagn-0.996
ulates-0.976
Tokenordinary
Token ID35947
Feature activation-1.05e-01

Pos logit contributions
seeing+0.669
nor+0.588
colour+0.576
��+0.552
 Teach+0.544
Neg logit contributions
ulate-1.327
riages-1.281
ulating-1.273
 stagn-1.252
ulates-1.231
Token measures
Token ID5260
Feature activation+2.05e-01

Pos logit contributions
ulate+0.300
riages+0.283
ulating+0.282
 stagn+0.274
ulates+0.269
Neg logit contributions
seeing-0.316
nor-0.289
colour-0.286
 Teach-0.278
��-0.278
Token,"
Token ID553
Feature activation+5.55e-01

Pos logit contributions
seeing+0.599
nor+0.549
colour+0.539
��+0.534
 Teach+0.525
Neg logit contributions
ulate-0.609
riages-0.576
ulating-0.574
 stagn-0.561
ulates-0.550
Token to
Token ID284
Feature activation+4.80e-01

Pos logit contributions
seeing+1.406
nor+1.272
colour+1.232
 Teach+1.210
��+1.190
Neg logit contributions
ulate-1.854
riages-1.780
ulating-1.746
 stagn-1.714
sei-1.704
Token keep
Token ID1394
Feature activation-3.78e+00

Pos logit contributions
seeing+1.419
nor+1.319
 Teach+1.298
colour+1.297
��+1.252
Neg logit contributions
ulate-1.352
riages-1.296
ulating-1.290
ulates-1.236
sei-1.232
Token under
Token ID739
Feature activation-1.05e-01

Pos logit contributions
ulate+8.822
ulating+8.289
 stagn+8.214
riages+7.978
ulates+7.660
Neg logit contributions
seeing-12.310
��-12.170
nor-11.826
colour-11.779
ーン-11.147
Token the
Token ID262
Feature activation+1.42e-01

Pos logit contributions
ulate+0.296
riages+0.279
ulating+0.278
 stagn+0.270
ulates+0.265
Neg logit contributions
seeing-0.323
nor-0.298
colour-0.295
��-0.293
 Teach-0.286
Token debt
Token ID5057
Feature activation+1.18e-01

Pos logit contributions
seeing+0.426
nor+0.391
colour+0.388
 Teach+0.379
��+0.378
Neg logit contributions
ulate-0.404
riages-0.385
ulating-0.380
sei-0.367
ulates-0.363
Token limit
Token ID4179
Feature activation-6.59e-01

Pos logit contributions
seeing+0.319
nor+0.290
colour+0.287
��+0.285
 Teach+0.277
Neg logit contributions
ulate-0.378
riages-0.359
ulating-0.358
 stagn-0.348
ulates-0.344
Token set
Token ID900
Feature activation-8.70e-01

Pos logit contributions
ulate+1.816
riages+1.710
ulating+1.703
 stagn+1.644
sei+1.625
Neg logit contributions
seeing-2.070
nor-1.918
colour-1.887
��-1.876
 Teach-1.845
Token this
Token ID428
Feature activation-1.24e+00

Pos logit contributions
ulate+2.017
ulating+1.898
riages+1.885
 stagn+1.849
ulates+1.792
Neg logit contributions
seeing-2.286
nor-2.132
colour-2.115
��-2.110
 Teach-2.036
Token is
Token ID318
Feature activation-1.65e+00

Pos logit contributions
ulate+3.534
ulating+3.329
riages+3.275
 stagn+3.256
ulates+3.167
Neg logit contributions
seeing-3.697
��-3.443
nor-3.436
colour-3.390
 Teach-3.234
Token to
Token ID284
Feature activation-1.30e+00

Pos logit contributions
ulate+5.059
ulating+4.780
riages+4.663
 stagn+4.643
sei+4.553
Neg logit contributions
seeing-4.399
colour-4.129
nor-4.075
��-3.947
 Teach-3.902
Token highlight
Token ID7238
Feature activation-8.66e-01

Pos logit contributions
ulate+3.722
ulating+3.469
riages+3.445
 stagn+3.439
ulates+3.293
Neg logit contributions
seeing-3.956
��-3.683
nor-3.652
colour-3.620
ーン-3.483
Token INTER
Token ID23255
Feature activation+2.09e+00

Pos logit contributions
ulate+2.350
ulating+2.198
riages+2.186
 stagn+2.149
ulates+2.084
Neg logit contributions
seeing-2.727
nor-2.551
��-2.545
colour-2.526
 Teach-2.426
TokenM
Token ID44
Feature activation-3.78e+00

Pos logit contributions
seeing+5.002
nor+4.716
colour+4.616
��+4.497
 Teach+4.462
Neg logit contributions
ulate-6.791
ulating-6.437
riages-6.412
ulates-6.251
 stagn-6.233
TokenITT
Token ID22470
Feature activation-3.69e-01

Pos logit contributions
ulate+10.603
riages+10.064
ulating+9.998
 stagn+9.774
OCK+9.574
Neg logit contributions
seeing-10.268
colour-9.300
nor-9.293
��-8.995
 Teach-8.725
TokenENT
Token ID3525
Feature activation-3.31e-02

Pos logit contributions
ulate+1.288
riages+1.226
ulating+1.226
 stagn+1.197
ulates+1.179
Neg logit contributions
seeing-0.887
nor-0.804
colour-0.791
��-0.788
 Teach-0.761
Token F
Token ID376
Feature activation-2.30e-01

Pos logit contributions
ulate+0.102
riages+0.096
ulating+0.096
 stagn+0.093
ulates+0.092
Neg logit contributions
seeing-0.093
nor-0.086
colour-0.085
��-0.084
 Teach-0.082
TokenAST
Token ID11262
Feature activation-6.81e-01

Pos logit contributions
ulate+0.756
ulating+0.717
riages+0.717
 stagn+0.698
ulates+0.688
Neg logit contributions
seeing-0.597
nor-0.546
��-0.539
colour-0.538
 Teach-0.520
TokenING
Token ID2751
Feature activation-1.03e+00

Pos logit contributions
ulate+2.055
ulating+1.948
riages+1.942
 stagn+1.894
ulates+1.854
Neg logit contributions
seeing-1.941
nor-1.801
��-1.784
colour-1.775
 Teach-1.716
Token food
Token ID2057
Feature activation+9.06e-01

Pos logit contributions
seeing+1.164
nor+1.084
colour+1.074
��+1.056
 Teach+1.052
Neg logit contributions
ulate-0.904
ulating-0.850
riages-0.847
 stagn-0.811
ulates-0.804
Token chain
Token ID6333
Feature activation-1.97e+00

Pos logit contributions
seeing+2.781
nor+2.583
colour+2.551
��+2.472
 Teach+2.467
Neg logit contributions
ulate-2.483
ulating-2.358
riages-2.324
 stagn-2.249
sei-2.241
Token.
Token ID13
Feature activation+1.97e-01

Pos logit contributions
ulate+4.865
riages+4.602
ulating+4.541
 stagn+4.467
ulates+4.268
Neg logit contributions
seeing-6.452
��-6.151
nor-6.095
colour-6.027
 Teach-5.870
Token
Token ID198
Feature activation+7.79e-02

Pos logit contributions
seeing+0.560
nor+0.515
colour+0.505
��+0.498
 Teach+0.497
Neg logit contributions
ulate-0.601
ulating-0.569
riages-0.567
 stagn-0.553
ulates-0.543
Token
Token ID198
Feature activation+4.08e-01

Pos logit contributions
seeing+0.237
nor+0.219
colour+0.212
 Teach+0.210
ーン+0.198
Neg logit contributions
ulate-0.225
���-0.217
riages-0.213
ulating-0.213
 stagn-0.209
TokenM
Token ID44
Feature activation-3.76e+00

Pos logit contributions
seeing+0.994
nor+0.901
colour+0.884
��+0.855
 Teach+0.843
Neg logit contributions
ulate-1.409
ulating-1.345
riages-1.342
 stagn-1.315
ulates-1.290
Tokenounds
Token ID3733
Feature activation-1.57e+00

Pos logit contributions
ulate+9.340
riages+8.716
ulating+8.655
umbling+8.178
ulates+8.164
Neg logit contributions
seeing-11.774
��-11.387
colour-10.859
nor-10.648
ーン-10.367
Token of
Token ID286
Feature activation+3.76e-01

Pos logit contributions
ulate+4.124
ulating+3.836
riages+3.835
 stagn+3.778
ulates+3.612
Neg logit contributions
seeing-5.065
nor-4.749
��-4.689
colour-4.638
 Teach-4.504
Token tail
Token ID7894
Feature activation+1.00e-01

Pos logit contributions
seeing+1.220
nor+1.134
colour+1.122
��+1.102
 Teach+1.100
Neg logit contributions
ulate-0.980
riages-0.924
ulating-0.922
 stagn-0.880
ulates-0.876
Tokenings
Token ID654
Feature activation+1.40e-01

Pos logit contributions
seeing+0.305
nor+0.283
colour+0.280
��+0.276
 Teach+0.269
Neg logit contributions
ulate-0.284
ulating-0.268
riages-0.267
 stagn-0.259
ulates-0.255
Token from
Token ID422
Feature activation+1.47e-01

Pos logit contributions
seeing+0.446
nor+0.414
colour+0.410
��+0.402
 Teach+0.396
Neg logit contributions
ulate-0.380
ulating-0.358
riages-0.356
 stagn-0.344
ulates-0.340
Tokenalt
Token ID2501
Feature activation+1.27e-01

Pos logit contributions
seeing+1.308
nor+1.198
colour+1.179
��+1.149
 Teach+1.126
Neg logit contributions
ulate-1.624
ulating-1.541
riages-1.539
 stagn-1.505
ulates-1.477
Tokenan
Token ID272
Feature activation-1.88e+00

Pos logit contributions
seeing+0.437
nor+0.412
colour+0.404
��+0.399
 Teach+0.393
Neg logit contributions
ulate-0.311
riages-0.291
ulating-0.290
 stagn-0.283
ulates-0.274
Token),
Token ID828
Feature activation+4.53e-01

Pos logit contributions
ulate+5.015
ulating+4.696
riages+4.656
 stagn+4.525
sei+4.478
Neg logit contributions
seeing-5.905
��-5.501
colour-5.450
nor-5.402
 Teach-5.314
Token eternity
Token ID28989
Feature activation+6.25e-01

Pos logit contributions
seeing+1.452
nor+1.351
colour+1.330
��+1.298
 Teach+1.296
Neg logit contributions
ulate-1.215
ulating-1.139
riages-1.139
 stagn-1.095
ulates-1.082
Token (
Token ID357
Feature activation-4.56e-01

Pos logit contributions
seeing+1.920
nor+1.772
colour+1.735
��+1.689
 Teach+1.685
Neg logit contributions
ulate-1.757
ulating-1.654
riages-1.645
ulates-1.589
 stagn-1.588
TokenM
Token ID44
Feature activation-3.74e+00

Pos logit contributions
ulate+1.454
riages+1.379
ulating+1.376
 stagn+1.339
ulates+1.318
Neg logit contributions
seeing-1.224
nor-1.119
��-1.118
colour-1.105
 Teach-1.080
Tokenö
Token ID9101
Feature activation+9.44e-01

Pos logit contributions
ulate+9.331
riages+8.749
ulating+8.705
 stagn+8.433
sei+8.403
Neg logit contributions
seeing-11.124
��-10.801
colour-10.298
 Lauder-10.050
 Teach-9.974
Tokenng
Token ID782
Feature activation+9.60e-01

Pos logit contributions
seeing+2.799
nor+2.622
colour+2.551
��+2.506
 Teach+2.449
Neg logit contributions
ulate-2.717
riages-2.564
ulating-2.561
 stagn-2.493
ulates-2.444
Tokenke
Token ID365
Feature activation-5.30e-01

Pos logit contributions
seeing+3.095
nor+2.912
colour+2.847
��+2.814
 Teach+2.762
Neg logit contributions
ulate-2.496
riages-2.363
ulating-2.344
 stagn-2.289
ulates-2.230
Token),
Token ID828
Feature activation-2.62e-02

Pos logit contributions
ulate+1.453
riages+1.363
ulating+1.362
 stagn+1.322
ulates+1.294
Neg logit contributions
seeing-1.668
nor-1.545
��-1.532
colour-1.531
 Teach-1.486
Token surplus
Token ID18201
Feature activation+2.70e-01

Pos logit contributions
ulate+0.072
ulating+0.067
riages+0.067
 stagn+0.065
ulates+0.064
Neg logit contributions
seeing-0.083
nor-0.077
colour-0.076
��-0.075
 Teach-0.074
Token in
Token ID287
Feature activation+9.61e-01

Pos logit contributions
ulate+0.180
riages+0.169
ulating+0.169
 stagn+0.163
ulates+0.160
Neg logit contributions
seeing-0.215
nor-0.199
colour-0.197
��-0.195
 Teach-0.193
Token Brown
Token ID4373
Feature activation-5.80e-01

Pos logit contributions
seeing+2.814
nor+2.586
colour+2.582
 Teach+2.554
��+2.518
Neg logit contributions
ulate-2.755
riages-2.599
ulating-2.581
 stagn-2.538
ulates-2.472
Token
Token ID447
Feature activation+3.64e-01

Pos logit contributions
ulate+1.616
riages+1.525
ulating+1.524
 stagn+1.474
ulates+1.444
Neg logit contributions
seeing-1.791
nor-1.661
��-1.643
colour-1.642
 Teach-1.588
Token
Token ID247
Feature activation+7.49e-01

Pos logit contributions
seeing+0.773
��+0.693
nor+0.687
colour+0.673
 Teach+0.663
Neg logit contributions
ulate-1.353
ulating-1.303
riages-1.303
 stagn-1.273
sei-1.258
Tokens
Token ID82
Feature activation-2.96e-01

Pos logit contributions
seeing+1.893
nor+1.740
colour+1.703
 Teach+1.655
��+1.630
Neg logit contributions
ulate-2.471
riages-2.354
ulating-2.345
 stagn-2.311
ulates-2.254
Token ear
Token ID1027
Feature activation+2.79e+00

Pos logit contributions
ulate+0.816
riages+0.771
ulating+0.767
 stagn+0.741
ulates+0.730
Neg logit contributions
seeing-0.929
nor-0.859
colour-0.849
��-0.839
 Teach-0.832
Token about
Token ID546
Feature activation+7.96e-01

Pos logit contributions
seeing+7.839
colour+7.009
nor+6.996
��+6.611
 Teach+6.595
Neg logit contributions
ulate-7.855
riages-7.524
ulating-7.335
 stagn-7.277
sei-7.184
Token kicking
Token ID17997
Feature activation-1.27e+00

Pos logit contributions
seeing+2.445
nor+2.228
colour+2.211
 Teach+2.207
��+2.140
Neg logit contributions
ulate-2.222
riages-2.106
ulating-2.066
 stagn-1.998
sei-1.991
Token football
Token ID4346
Feature activation-3.39e-01

Pos logit contributions
ulate+3.424
ulating+3.225
riages+3.183
 stagn+3.089
ulates+3.059
Neg logit contributions
seeing-4.076
nor-3.795
��-3.786
colour-3.732
 Teach-3.606
Tokens
Token ID82
Feature activation-1.14e+00

Pos logit contributions
ulate+0.964
riages+0.906
ulating+0.904
 stagn+0.879
ulates+0.862
Neg logit contributions
seeing-1.032
nor-0.953
colour-0.943
��-0.933
 Teach-0.917
Token.
Token ID13
Feature activation+1.33e-01

Pos logit contributions
ulate+2.912
ulating+2.737
riages+2.730
 stagn+2.642
ulates+2.579
Neg logit contributions
seeing-3.726
��-3.489
nor-3.476
colour-3.438
 Teach-3.343
Token way
Token ID835
Feature activation+4.32e-01

Pos logit contributions
seeing+2.112
nor+1.939
colour+1.934
��+1.893
 Teach+1.875
Neg logit contributions
ulate-1.978
riages-1.875
ulating-1.865
 stagn-1.810
sei-1.778
Token you
Token ID345
Feature activation-4.30e-01

Pos logit contributions
seeing+1.229
nor+1.118
colour+1.110
��+1.088
 Teach+1.080
Neg logit contributions
ulate-1.313
riages-1.247
ulating-1.242
 stagn-1.208
ulates-1.190
Token are
Token ID389
Feature activation-2.11e-01

Pos logit contributions
ulate+1.182
ulating+1.106
riages+1.106
 stagn+1.078
ulates+1.048
Neg logit contributions
seeing-1.354
nor-1.258
��-1.257
colour-1.240
 Teach-1.202
Token.
Token ID13
Feature activation-1.37e-02

Pos logit contributions
ulate+0.586
riages+0.551
ulating+0.551
 stagn+0.534
ulates+0.524
Neg logit contributions
seeing-0.659
nor-0.611
colour-0.604
��-0.601
 Teach-0.587
Token I
Token ID314
Feature activation+5.27e-01

Pos logit contributions
ulate+0.046
riages+0.044
ulating+0.044
 stagn+0.043
ulates+0.042
Neg logit contributions
seeing-0.034
nor-0.031
��-0.031
colour-0.031
 Teach-0.030
Token love
Token ID1842
Feature activation+2.37e+00

Pos logit contributions
seeing+1.529
nor+1.409
colour+1.390
 Teach+1.362
��+1.355
Neg logit contributions
ulate-1.567
riages-1.486
ulating-1.481
 stagn-1.425
ulates-1.417
Token you
Token ID345
Feature activation+1.43e-01

Pos logit contributions
seeing+6.509
nor+5.586
 Teach+5.576
colour+5.499
��+5.260
Neg logit contributions
ulate-7.188
riages-6.824
ulating-6.686
ulates-6.577
���-6.466
Token so
Token ID523
Feature activation-2.19e-01

Pos logit contributions
seeing+0.426
nor+0.393
colour+0.388
��+0.386
 Teach+0.377
Neg logit contributions
ulate-0.415
ulating-0.390
riages-0.390
 stagn-0.379
ulates-0.373
Token much
Token ID881
Feature activation+1.47e+00

Pos logit contributions
ulate+0.621
riages+0.587
ulating+0.585
 stagn+0.568
sei+0.556
Neg logit contributions
seeing-0.667
nor-0.619
��-0.612
colour-0.610
 Teach-0.592
Token,
Token ID11
Feature activation+1.78e-01

Pos logit contributions
seeing+4.239
nor+3.838
colour+3.798
 Teach+3.731
��+3.686
Neg logit contributions
ulate-4.338
riages-4.096
ulating-4.095
 stagn-3.995
ulates-3.966
Token Anna
Token ID11735
Feature activation-4.33e-01

Pos logit contributions
seeing+0.556
nor+0.516
colour+0.507
��+0.505
 Teach+0.497
Neg logit contributions
ulate-0.493
riages-0.464
ulating-0.463
 stagn-0.449
ulates-0.441
Tokennova
Token ID38438
Feature activation-8.92e-01

Pos logit contributions
ulate+0.236
riages+0.226
ulating+0.225
 stagn+0.220
ulates+0.217
Neg logit contributions
seeing-0.151
nor-0.136
colour-0.134
��-0.132
 Teach-0.128
Token.
Token ID13
Feature activation+1.81e-01

Pos logit contributions
ulate+2.425
riages+2.281
ulating+2.274
 stagn+2.199
ulates+2.158
Neg logit contributions
seeing-2.824
��-2.631
nor-2.629
colour-2.598
 Teach-2.518
Token Nobody
Token ID15658
Feature activation+6.18e-01

Pos logit contributions
seeing+0.497
nor+0.456
colour+0.445
 Teach+0.441
��+0.434
Neg logit contributions
ulate-0.573
riages-0.543
ulating-0.543
 stagn-0.531
ulates-0.520
Token has
Token ID468
Feature activation+1.02e+00

Pos logit contributions
seeing+1.736
nor+1.590
colour+1.545
 Teach+1.518
��+1.496
Neg logit contributions
ulate-1.907
ulating-1.799
riages-1.792
 stagn-1.726
ulates-1.717
Token any
Token ID597
Feature activation+8.55e-01

Pos logit contributions
seeing+2.884
nor+2.671
colour+2.589
 Teach+2.563
��+2.512
Neg logit contributions
ulate-3.076
riages-2.909
ulating-2.904
 stagn-2.787
ulates-2.781
Token interest
Token ID1393
Feature activation+2.22e+00

Pos logit contributions
seeing+2.426
nor+2.204
colour+2.170
 Teach+2.147
��+2.097
Neg logit contributions
ulate-2.588
riages-2.441
ulating-2.441
 stagn-2.358
sei-2.338
Token in
Token ID287
Feature activation+9.52e-01

Pos logit contributions
seeing+6.953
nor+6.451
colour+6.083
 Teach+6.051
��+5.813
Neg logit contributions
ulate-5.749
riages-5.414
ulating-5.313
 stagn-5.220
sei-5.170
Token holding
Token ID4769
Feature activation-1.16e+00

Pos logit contributions
seeing+2.982
nor+2.746
 Teach+2.671
colour+2.664
 Lauder+2.560
Neg logit contributions
ulate-2.588
riages-2.467
ulating-2.410
ulates-2.335
sei-2.323
Token us
Token ID514
Feature activation-1.23e+00

Pos logit contributions
ulate+3.231
ulating+3.023
riages+3.011
 stagn+2.940
ulates+2.875
Neg logit contributions
seeing-3.581
��-3.348
nor-3.313
colour-3.300
 Teach-3.155
Token on
Token ID319
Feature activation+3.52e-02

Pos logit contributions
ulate+3.296
riages+3.094
ulating+3.092
 stagn+3.010
ulates+2.925
Neg logit contributions
seeing-3.947
nor-3.669
��-3.650
colour-3.638
 Teach-3.527
Token the
Token ID262
Feature activation+7.01e-02

Pos logit contributions
seeing+0.106
nor+0.098
colour+0.097
��+0.096
 Teach+0.094
Neg logit contributions
ulate-0.101
riages-0.095
ulating-0.095
 stagn-0.092
ulates-0.091
Token behind
Token ID2157
Feature activation-6.30e-01

Pos logit contributions
seeing+0.645
nor+0.594
colour+0.587
��+0.584
 Teach+0.576
Neg logit contributions
ulate-0.553
riages-0.520
ulating-0.519
 stagn-0.502
ulates-0.494
Token or
Token ID393
Feature activation-3.31e-01

Pos logit contributions
ulate+1.670
ulating+1.569
riages+1.569
 stagn+1.521
ulates+1.485
Neg logit contributions
seeing-2.029
nor-1.892
��-1.881
colour-1.873
 Teach-1.821
Token touching
Token ID15241
Feature activation+1.07e+00

Pos logit contributions
ulate+0.897
riages+0.842
ulating+0.840
 stagn+0.813
ulates+0.799
Neg logit contributions
seeing-1.054
nor-0.976
colour-0.964
��-0.957
 Teach-0.940
Token me
Token ID502
Feature activation+3.50e-01

Pos logit contributions
seeing+2.549
nor+2.262
colour+2.251
 Teach+2.200
��+2.137
Neg logit contributions
ulate-3.688
riages-3.557
ulating-3.537
 stagn-3.430
ulates-3.409
Token without
Token ID1231
Feature activation+7.87e-02

Pos logit contributions
seeing+1.127
nor+1.033
colour+1.028
 Teach+1.001
��+0.997
Neg logit contributions
ulate-0.938
ulating-0.875
riages-0.872
 stagn-0.842
ulates-0.837
Token permission
Token ID7170
Feature activation+2.84e+00

Pos logit contributions
seeing+0.220
nor+0.202
colour+0.199
��+0.198
 Teach+0.193
Neg logit contributions
ulate-0.244
riages-0.231
ulating-0.231
 stagn-0.225
ulates-0.221
Token),
Token ID828
Feature activation+4.29e-01

Pos logit contributions
seeing+7.850
nor+7.143
colour+6.827
 Teach+6.634
 Lauder+6.348
Neg logit contributions
ulate-8.282
ulating-7.674
riages-7.564
ulates-7.415
���-7.396
Token then
Token ID788
Feature activation-1.26e+00

Pos logit contributions
seeing+1.279
nor+1.180
colour+1.154
��+1.134
 Teach+1.130
Neg logit contributions
ulate-1.257
ulating-1.183
riages-1.182
 stagn-1.137
ulates-1.132
Token touch
Token ID3638
Feature activation+1.58e+00

Pos logit contributions
ulate+3.520
riages+3.342
ulating+3.316
 stagn+3.258
sei+3.163
Neg logit contributions
seeing-3.820
��-3.596
nor-3.580
colour-3.538
 Teach-3.395
Token makes
Token ID1838
Feature activation-1.69e+00

Pos logit contributions
seeing+4.543
nor+4.108
colour+4.078
 Teach+3.973
��+3.888
Neg logit contributions
ulate-4.691
ulating-4.366
riages-4.339
ulates-4.193
sei-4.128
Token me
Token ID502
Feature activation-6.99e-01

Pos logit contributions
ulate+5.066
riages+4.787
ulating+4.784
 stagn+4.716
sei+4.574
Neg logit contributions
seeing-4.626
��-4.424
nor-4.328
colour-4.308
 Teach-4.110
Token his
Token ID465
Feature activation-2.52e-01

Pos logit contributions
seeing+2.333
nor+2.159
colour+2.109
 Teach+2.069
��+2.068
Neg logit contributions
ulate-2.418
riages-2.287
ulating-2.280
 stagn-2.229
ulates-2.192
Token job
Token ID1693
Feature activation+4.65e-01

Pos logit contributions
ulate+0.663
riages+0.621
ulating+0.621
 stagn+0.601
ulates+0.588
Neg logit contributions
seeing-0.823
nor-0.767
colour-0.758
��-0.756
 Teach-0.737
Token.
Token ID13
Feature activation+1.43e-01

Pos logit contributions
seeing+1.466
nor+1.361
colour+1.335
��+1.313
 Teach+1.308
Neg logit contributions
ulate-1.276
ulating-1.193
riages-1.189
 stagn-1.158
ulates-1.142
Token If
Token ID1002
Feature activation+1.13e+00

Pos logit contributions
seeing+0.379
nor+0.347
colour+0.340
��+0.336
 Teach+0.333
Neg logit contributions
ulate-0.467
riages-0.444
ulating-0.443
 stagn-0.433
ulates-0.425
Token Cory
Token ID28785
Feature activation+7.64e-01

Pos logit contributions
seeing+3.124
nor+2.889
 Teach+2.852
colour+2.807
��+2.719
Neg logit contributions
ulate-3.440
riages-3.271
ulating-3.233
ulates-3.140
 stagn-3.134
Token Booker
Token ID31004
Feature activation+2.81e+00

Pos logit contributions
seeing+1.991
nor+1.866
colour+1.797
 Teach+1.788
��+1.777
Neg logit contributions
ulate-2.474
ulating-2.336
riages-2.334
 stagn-2.280
ulates-2.243
Token can
Token ID460
Feature activation+5.38e-01

Pos logit contributions
seeing+7.067
nor+6.687
 Teach+6.586
colour+6.239
 Lauder+6.043
Neg logit contributions
ulate-8.438
ulating-7.900
riages-7.837
���-7.579
ulates-7.521
Token make
Token ID787
Feature activation-3.24e-01

Pos logit contributions
seeing+1.547
nor+1.460
 Teach+1.422
colour+1.414
��+1.380
Neg logit contributions
ulate-1.570
riages-1.496
ulating-1.491
ulates-1.432
 stagn-1.417
Token millions
Token ID5242
Feature activation-4.00e-01

Pos logit contributions
ulate+0.922
ulating+0.868
riages+0.868
 stagn+0.842
ulates+0.826
Neg logit contributions
seeing-0.988
nor-0.915
colour-0.904
��-0.902
 Teach-0.876
Token not
Token ID407
Feature activation+2.15e-01

Pos logit contributions
ulate+1.060
riages+0.993
ulating+0.993
 stagn+0.961
ulates+0.941
Neg logit contributions
seeing-1.297
nor-1.206
colour-1.192
��-1.189
 Teach-1.160
Token working
Token ID1762
Feature activation-1.31e-01

Pos logit contributions
seeing+0.709
nor+0.660
colour+0.650
��+0.646
 Teach+0.638
Neg logit contributions
ulate-0.557
riages-0.522
ulating-0.520
 stagn-0.502
ulates-0.495
Token weren
Token ID6304
Feature activation-7.25e-01

Pos logit contributions
ulate+2.893
ulating+2.718
riages+2.703
 stagn+2.674
sei+2.592
Neg logit contributions
seeing-3.115
��-2.963
nor-2.896
colour-2.884
 Teach-2.778
Token
Token ID447
Feature activation+5.72e-01

Pos logit contributions
ulate+2.358
riages+2.237
ulating+2.237
 stagn+2.180
ulates+2.143
Neg logit contributions
seeing-1.915
nor-1.752
colour-1.725
��-1.716
 Teach-1.667
Token
Token ID247
Feature activation+1.40e-01

Pos logit contributions
seeing+1.207
��+1.132
nor+1.088
 Teach+1.063
colour+1.061
Neg logit contributions
ulate-2.095
ulating-2.035
riages-2.022
 stagn-1.977
ulates-1.946
Tokent
Token ID83
Feature activation+3.51e-01

Pos logit contributions
seeing+0.315
nor+0.283
colour+0.277
��+0.269
 Teach+0.266
Neg logit contributions
ulate-0.508
riages-0.486
ulating-0.484
 stagn-0.476
ulates-0.468
Token familiar
Token ID5385
Feature activation+3.26e+00

Pos logit contributions
seeing+1.109
nor+1.023
colour+1.006
��+0.990
 Teach+0.982
Neg logit contributions
ulate-0.966
riages-0.906
ulating-0.898
 stagn-0.867
ulates-0.861
Token with
Token ID351
Feature activation+1.75e+00

Pos logit contributions
seeing+9.285
nor+8.251
 Teach+8.220
��+8.007
colour+7.944
Neg logit contributions
ulate-9.099
riages-8.708
ulating-8.476
 stagn-8.407
ulates-8.273
Token the
Token ID262
Feature activation+9.67e-01

Pos logit contributions
seeing+4.777
nor+4.345
 Teach+4.221
colour+4.189
 Lauder+4.037
Neg logit contributions
ulate-5.318
riages-5.116
ulating-5.010
 stagn-4.897
ulates-4.841
Token books
Token ID3835
Feature activation+4.18e-01

Pos logit contributions
seeing+2.897
nor+2.688
colour+2.643
 Teach+2.609
��+2.574
Neg logit contributions
ulate-2.742
riages-2.596
ulating-2.574
 stagn-2.495
sei-2.461
Token to
Token ID284
Feature activation+2.54e-01

Pos logit contributions
seeing+1.262
nor+1.166
colour+1.143
��+1.132
 Teach+1.116
Neg logit contributions
ulate-1.206
riages-1.137
ulating-1.134
 stagn-1.101
ulates-1.085
Token start
Token ID923
Feature activation-2.94e+00

Pos logit contributions
seeing+0.768
nor+0.711
colour+0.703
��+0.701
 Teach+0.681
Neg logit contributions
ulate-0.726
riages-0.684
ulating-0.684
 stagn-0.664
ulates-0.651
Token off
Token ID572
Feature activation-2.26e+00

Pos logit contributions
ulate+6.335
ulating+6.059
riages+5.762
 stagn+5.695
sei+5.504
Neg logit contributions
colour-9.440
seeing-9.440
��-9.396
nor-9.132
ーン-9.081
Token EU
Token ID4576
Feature activation+5.49e-01

Pos logit contributions
seeing+1.756
nor+1.649
colour+1.614
��+1.596
 Teach+1.585
Neg logit contributions
ulate-1.330
ulating-1.247
riages-1.242
sei-1.182
 stagn-1.179
Token -
Token ID532
Feature activation+5.46e-01

Pos logit contributions
seeing+1.663
nor+1.561
colour+1.528
 Teach+1.501
��+1.490
Neg logit contributions
ulate-1.532
ulating-1.443
riages-1.432
sei-1.377
 stagn-1.376
Token a
Token ID257
Feature activation-5.82e-01

Pos logit contributions
seeing+1.631
nor+1.513
colour+1.476
 Teach+1.441
��+1.412
Neg logit contributions
ulate-1.568
riages-1.484
ulating-1.460
sei-1.422
 stagn-1.418
Token so
Token ID523
Feature activation-2.29e-02

Pos logit contributions
ulate+1.559
riages+1.462
ulating+1.462
 stagn+1.415
ulates+1.387
Neg logit contributions
seeing-1.868
nor-1.737
colour-1.716
��-1.709
 Teach-1.670
Token-
Token ID12
Feature activation+8.84e-01

Pos logit contributions
ulate+0.061
riages+0.058
ulating+0.057
 stagn+0.056
ulates+0.055
Neg logit contributions
seeing-0.074
nor-0.069
colour-0.068
��-0.067
 Teach-0.066
Tokencalled
Token ID7174
Feature activation+1.05e-01

Pos logit contributions
seeing+1.926
nor+1.759
colour+1.712
 Teach+1.487
bender+1.457
Neg logit contributions
ulate-3.282
���-3.187
ulating-3.160
riages-3.156
 stagn-3.083
Token "
Token ID366
Feature activation+8.18e-01

Pos logit contributions
seeing+0.335
nor+0.312
colour+0.309
��+0.307
 Teach+0.301
Neg logit contributions
ulate-0.280
riages-0.262
ulating-0.262
 stagn-0.253
ulates-0.249
Tokenin
Token ID259
Feature activation-2.13e-02

Pos logit contributions
seeing+2.191
nor+2.025
colour+1.998
��+1.896
 Teach+1.875
Neg logit contributions
ulate-2.613
riages-2.494
ulating-2.485
 stagn-2.426
ulates-2.373
Token-
Token ID12
Feature activation+4.31e-01

Pos logit contributions
ulate+0.064
riages+0.060
ulating+0.060
 stagn+0.058
ulates+0.057
Neg logit contributions
seeing-0.062
nor-0.057
colour-0.056
��-0.056
 Teach-0.054
Tokenout
Token ID448
Feature activation-1.02e+00

Pos logit contributions
seeing+1.523
nor+1.435
colour+1.412
��+1.376
 Teach+1.356
Neg logit contributions
ulate-1.017
ulating-0.950
riages-0.949
 stagn-0.918
ulates-0.892
Token vote
Token ID3015
Feature activation-3.00e-01

Pos logit contributions
ulate+2.679
ulating+2.509
riages+2.505
 stagn+2.416
ulates+2.370
Neg logit contributions
seeing-3.286
��-3.066
nor-3.044
colour-3.021
 Teach-2.969
Token toddlers
Token ID50142
Feature activation-1.41e+00

Pos logit contributions
ulate+2.385
ulating+2.233
riages+2.233
 stagn+2.170
ulates+2.114
Neg logit contributions
seeing-3.024
nor-2.814
��-2.796
colour-2.786
 Teach-2.701
Token,
Token ID11
Feature activation+2.54e-01

Pos logit contributions
ulate+3.432
ulating+3.234
riages+3.208
 stagn+3.130
ulates+3.005
Neg logit contributions
seeing-4.702
��-4.470
colour-4.413
nor-4.389
 Teach-4.267
Token nor
Token ID4249
Feature activation+1.73e+00

Pos logit contributions
seeing+0.747
nor+0.696
colour+0.669
��+0.662
 Teach+0.662
Neg logit contributions
ulate-0.758
riages-0.716
ulating-0.710
 stagn-0.687
ulates-0.681
Token that
Token ID326
Feature activation-1.30e-01

Pos logit contributions
seeing+4.544
nor+4.162
 Teach+4.022
colour+3.982
 Lauder+3.821
Neg logit contributions
ulate-5.486
ulating-5.211
riages-5.207
 stagn-4.990
ulates-4.956
Token he
Token ID339
Feature activation+5.17e-01

Pos logit contributions
ulate+0.347
riages+0.325
ulating+0.324
 stagn+0.314
ulates+0.308
Neg logit contributions
seeing-0.420
nor-0.390
colour-0.383
��-0.378
 Teach-0.377
Token'd
Token ID1549
Feature activation+1.36e+00

Pos logit contributions
seeing+1.622
nor+1.505
colour+1.468
 Teach+1.443
��+1.407
Neg logit contributions
ulate-1.419
ulating-1.325
riages-1.325
 stagn-1.266
ulates-1.259
Token be
Token ID307
Feature activation-1.48e-01

Pos logit contributions
seeing+3.607
nor+3.393
colour+3.257
 Teach+3.255
��+3.053
Neg logit contributions
ulate-4.226
riages-4.064
ulating-4.027
ulates-3.890
sei-3.851
Token happy
Token ID3772
Feature activation+1.51e+00

Pos logit contributions
ulate+0.390
riages+0.365
ulating+0.363
 stagn+0.349
ulates+0.345
Neg logit contributions
seeing-0.487
nor-0.453
colour-0.445
��-0.439
 Teach-0.437
Token that
Token ID326
Feature activation-4.70e-01

Pos logit contributions
seeing+4.374
nor+4.026
 Teach+3.865
colour+3.811
 Lauder+3.704
Neg logit contributions
ulate-4.439
riages-4.119
ulating-4.096
ulates-4.031
sei-3.975
Token they
Token ID484
Feature activation-2.65e-01

Pos logit contributions
ulate+1.287
riages+1.208
ulating+1.208
 stagn+1.170
ulates+1.148
Neg logit contributions
seeing-1.485
nor-1.379
colour-1.361
��-1.355
 Teach-1.326
Token left
Token ID1364
Feature activation-5.92e-01

Pos logit contributions
ulate+0.729
riages+0.685
ulating+0.685
 stagn+0.662
ulates+0.651
Neg logit contributions
seeing-0.834
nor-0.774
colour-0.764
��-0.760
 Teach-0.744
Token that
Token ID326
Feature activation-1.01e+00

Pos logit contributions
ulate+2.525
ulating+2.371
riages+2.370
 stagn+2.299
ulates+2.252
Neg logit contributions
seeing-2.920
nor-2.713
colour-2.681
��-2.676
 Teach-2.604
Token gl
Token ID1278
Feature activation-1.06e-01

Pos logit contributions
ulate+2.699
ulating+2.539
riages+2.533
 stagn+2.470
ulates+2.408
Neg logit contributions
seeing-3.211
nor-2.999
��-2.970
colour-2.963
 Teach-2.870
Tokenacial
Token ID18150
Feature activation-2.64e+00

Pos logit contributions
ulate+0.251
riages+0.238
ulating+0.233
 stagn+0.229
sei+0.224
Neg logit contributions
seeing-0.371
nor-0.345
colour-0.342
��-0.334
 Teach-0.331
Token system
Token ID1080
Feature activation+1.34e-01

Pos logit contributions
ulate+6.743
ulating+6.357
 stagn+6.265
riages+6.101
ulates+5.916
Neg logit contributions
seeing-8.344
��-8.080
nor-8.079
colour-7.851
 Teach-7.535
Token is
Token ID318
Feature activation-4.35e-01

Pos logit contributions
seeing+0.405
nor+0.371
colour+0.367
��+0.360
 Teach+0.357
Neg logit contributions
ulate-0.386
riages-0.362
ulating-0.362
 stagn-0.345
ulates-0.344
Token probably
Token ID2192
Feature activation+2.11e-01

Pos logit contributions
ulate+1.163
riages+1.090
ulating+1.085
 stagn+1.040
ulates+1.034
Neg logit contributions
seeing-1.406
nor-1.302
colour-1.289
��-1.270
 Teach-1.256
Token out
Token ID503
Feature activation+1.86e-01

Pos logit contributions
seeing+0.695
nor+0.636
colour+0.632
 Teach+0.622
��+0.613
Neg logit contributions
ulate-0.549
riages-0.515
ulating-0.508
ulates-0.488
sei-0.480
Token of
Token ID286
Feature activation+1.14e-01

Pos logit contributions
seeing+0.575
nor+0.528
colour+0.522
��+0.512
 Teach+0.505
Neg logit contributions
ulate-0.525
riages-0.495
ulating-0.492
 stagn-0.478
ulates-0.471
Token balance
Token ID5236
Feature activation-1.01e-01

Pos logit contributions
seeing+0.352
nor+0.324
colour+0.319
 Teach+0.316
��+0.313
Neg logit contributions
ulate-0.317
riages-0.298
ulating-0.297
 stagn-0.288
ulates-0.285
Token,
Token ID11
Feature activation-3.91e-01

Pos logit contributions
ulate+0.276
riages+0.259
ulating+0.259
 stagn+0.251
ulates+0.248
Neg logit contributions
seeing-0.320
nor-0.296
colour-0.292
��-0.288
 Teach-0.285
Token losing
Token ID6078
Feature activation-1.07e+00

Pos logit contributions
ulate+1.208
riages+1.144
ulating+1.142
 stagn+1.109
ulates+1.092
Neg logit contributions
seeing-1.097
nor-1.010
colour-0.991
��-0.985
 Teach-0.963
Token dismiss
Token ID6735
Feature activation+3.92e-01

Pos logit contributions
seeing+0.943
nor+0.868
colour+0.855
��+0.838
 Teach+0.837
Neg logit contributions
ulate-0.930
ulating-0.874
riages-0.872
 stagn-0.839
ulates-0.833
Tokenes
Token ID274
Feature activation-5.52e-01

Pos logit contributions
seeing+0.651
nor+0.570
colour+0.543
 Teach+0.516
��+0.505
Neg logit contributions
ulate-1.633
riages-1.580
ulating-1.563
 stagn-1.557
sei-1.522
Token this
Token ID428
Feature activation-1.01e-01

Pos logit contributions
ulate+1.618
ulating+1.524
riages+1.519
 stagn+1.482
ulates+1.451
Neg logit contributions
seeing-1.628
nor-1.508
��-1.498
colour-1.491
 Teach-1.436
Token,
Token ID11
Feature activation-8.14e-01

Pos logit contributions
ulate+0.288
riages+0.272
ulating+0.271
 stagn+0.263
ulates+0.259
Neg logit contributions
seeing-0.306
nor-0.284
colour-0.280
��-0.277
 Teach-0.273
Token thinking
Token ID3612
Feature activation-8.22e-01

Pos logit contributions
ulate+2.185
ulating+2.059
riages+2.042
 stagn+2.011
ulates+1.954
Neg logit contributions
seeing-2.569
��-2.403
nor-2.394
colour-2.383
 Teach-2.297
Token it
Token ID340
Feature activation+2.85e-01

Pos logit contributions
ulate+2.414
ulating+2.273
riages+2.270
 stagn+2.213
ulates+2.166
Neg logit contributions
seeing-2.419
nor-2.241
��-2.224
colour-2.210
 Teach-2.127
Token was
Token ID373
Feature activation-7.44e-01

Pos logit contributions
seeing+0.836
nor+0.767
colour+0.759
��+0.747
 Teach+0.740
Neg logit contributions
ulate-0.843
riages-0.792
ulating-0.790
 stagn-0.760
ulates-0.753
Token Mou
Token ID49902
Feature activation-2.26e+00

Pos logit contributions
ulate+2.083
ulating+1.958
riages+1.952
 stagn+1.912
ulates+1.858
Neg logit contributions
seeing-2.295
nor-2.133
��-2.115
colour-2.103
 Teach-2.025
Tokenz
Token ID89
Feature activation+4.33e-01

Pos logit contributions
ulate+6.511
ulating+6.207
riages+6.147
 stagn+6.086
sei+5.953
Neg logit contributions
seeing-6.439
��-6.191
colour-5.931
nor-5.845
ーン-5.823
Token Black
Token ID2619
Feature activation+8.42e-01

Pos logit contributions
seeing+1.378
nor+1.280
colour+1.246
��+1.239
 Teach+1.221
Neg logit contributions
ulate-1.170
riages-1.102
ulating-1.089
 stagn-1.063
ulates-1.049
Token,
Token ID11
Feature activation+7.61e-01

Pos logit contributions
seeing+2.709
nor+2.508
colour+2.462
 Teach+2.410
��+2.404
Neg logit contributions
ulate-2.261
riages-2.114
ulating-2.100
 stagn-2.035
ulates-2.021
Token.
Token ID13
Feature activation+5.85e-02

Pos logit contributions
seeing+0.684
nor+0.636
colour+0.626
��+0.616
 Teach+0.608
Neg logit contributions
ulate-0.594
ulating-0.556
riages-0.554
 stagn-0.539
ulates-0.531
Token
Token ID198
Feature activation-8.23e-01

Pos logit contributions
seeing+0.169
nor+0.156
colour+0.153
 Teach+0.151
��+0.150
Neg logit contributions
ulate-0.176
riages-0.167
ulating-0.167
 stagn-0.163
ulates-0.159
Token
Token ID198
Feature activation-1.60e-01

Pos logit contributions
ulate+2.202
riages+2.070
ulating+2.068
 stagn+2.006
ulates+1.961
Neg logit contributions
seeing-2.667
nor-2.480
colour-2.439
��-2.389
 Teach-2.383
TokenOption
Token ID19722
Feature activation+7.67e-01

Pos logit contributions
ulate+0.489
riages+0.462
ulating+0.462
 stagn+0.450
ulates+0.442
Neg logit contributions
seeing-0.452
nor-0.416
colour-0.410
��-0.406
 Teach-0.396
Token 2
Token ID362
Feature activation-9.64e-01

Pos logit contributions
seeing+2.243
nor+2.062
colour+2.024
��+1.988
 Teach+1.986
Neg logit contributions
ulate-2.232
riages-2.140
ulating-2.101
 stagn-2.084
ulates-2.014
Token
Token ID784
Feature activation-3.19e-01

Pos logit contributions
ulate+2.530
ulating+2.364
riages+2.354
 stagn+2.277
ulates+2.250
Neg logit contributions
seeing-3.114
��-2.927
nor-2.922
colour-2.886
 Teach-2.774
Token Rest
Token ID8324
Feature activation+2.37e-01

Pos logit contributions
ulate+1.081
ulating+1.027
riages+1.026
 stagn+1.002
ulates+0.986
Neg logit contributions
seeing-0.795
nor-0.724
colour-0.714
��-0.713
 Teach-0.685
Tokenart
Token ID433
Feature activation-2.52e+00

Pos logit contributions
seeing+0.695
nor+0.647
colour+0.632
��+0.625
 Teach+0.611
Neg logit contributions
ulate-0.698
riages-0.665
ulating-0.656
 stagn-0.648
sei-0.630
Token Network
Token ID7311
Feature activation-5.81e-02

Pos logit contributions
ulate+7.581
ulating+7.198
 stagn+6.864
riages+6.844
ulates+6.714
Neg logit contributions
seeing-6.831
��-6.659
colour-6.294
nor-6.230
ーン-6.018
Token Manager
Token ID9142
Feature activation-1.29e+00

Pos logit contributions
ulate+0.195
riages+0.185
ulating+0.185
 stagn+0.180
ulates+0.178
Neg logit contributions
seeing-0.148
nor-0.135
colour-0.133
��-0.132
 Teach-0.128
Token at
Token ID379
Feature activation-5.05e-01

Pos logit contributions
ulate+3.340
riages+3.118
ulating+3.115
 stagn+3.021
ulates+2.945
Neg logit contributions
seeing-4.197
��-3.946
nor-3.915
colour-3.881
 Teach-3.727
Token same
Token ID976
Feature activation+1.80e-01

Pos logit contributions
ulate+2.671
riages+2.501
ulating+2.498
 stagn+2.449
ulates+2.377
Neg logit contributions
seeing-3.169
��-2.986
nor-2.943
colour-2.918
 Teach-2.807
Token mandate
Token ID14598
Feature activation+4.14e-01

Pos logit contributions
seeing+0.555
nor+0.514
colour+0.505
��+0.497
 Teach+0.496
Neg logit contributions
ulate-0.504
riages-0.475
ulating-0.474
 stagn-0.457
ulates-0.452
Token has
Token ID468
Feature activation+4.48e-01

Pos logit contributions
seeing+1.208
nor+1.110
colour+1.095
��+1.074
 Teach+1.068
Neg logit contributions
ulate-1.229
ulating-1.161
riages-1.160
 stagn-1.118
ulates-1.104
Token a
Token ID257
Feature activation-2.55e-01

Pos logit contributions
seeing+1.289
nor+1.189
colour+1.161
��+1.144
 Teach+1.137
Neg logit contributions
ulate-1.343
ulating-1.272
riages-1.267
ulates-1.212
 stagn-1.208
Token loophole
Token ID35723
Feature activation+1.36e+00

Pos logit contributions
ulate+0.679
riages+0.639
ulating+0.637
 stagn+0.611
ulates+0.606
Neg logit contributions
seeing-0.819
nor-0.760
colour-0.749
��-0.739
 Teach-0.734
Token that
Token ID326
Feature activation-6.53e-01

Pos logit contributions
seeing+4.024
nor+3.668
colour+3.590
 Teach+3.551
��+3.493
Neg logit contributions
ulate-3.845
riages-3.655
ulating-3.592
 stagn-3.510
ulates-3.484
Token could
Token ID714
Feature activation-1.44e-01

Pos logit contributions
ulate+1.791
riages+1.680
ulating+1.680
 stagn+1.636
ulates+1.601
Neg logit contributions
seeing-2.055
nor-1.908
��-1.894
colour-1.888
 Teach-1.825
Token guarantee
Token ID9149
Feature activation-8.18e-01

Pos logit contributions
ulate+0.408
riages+0.386
ulating+0.386
 stagn+0.370
ulates+0.368
Neg logit contributions
seeing-0.439
nor-0.407
colour-0.401
��-0.397
 Teach-0.393
Token fossil
Token ID12584
Feature activation-1.17e+00

Pos logit contributions
ulate+2.292
riages+2.153
ulating+2.151
 stagn+2.105
ulates+2.046
Neg logit contributions
seeing-2.518
��-2.343
nor-2.341
colour-2.319
 Teach-2.234
Token fuel
Token ID5252
Feature activation-5.56e-02

Pos logit contributions
ulate+3.610
riages+3.419
ulating+3.417
 stagn+3.338
ulates+3.261
Neg logit contributions
seeing-3.282
nor-3.020
��-3.017
colour-2.986
 Teach-2.877
Token production
Token ID3227
Feature activation-5.43e-01

Pos logit contributions
ulate+0.148
ulating+0.138
riages+0.138
 stagn+0.133
ulates+0.131
Neg logit contributions
seeing-0.179
nor-0.166
colour-0.164
 Teach-0.160
��-0.160
Tokenate
Token ID378
Feature activation+2.60e-01

Pos logit contributions
ulate+0.953
riages+0.892
ulating+0.890
 stagn+0.861
ulates+0.841
Neg logit contributions
seeing-1.311
nor-1.225
colour-1.211
��-1.206
 Teach-1.180
Token their
Token ID511
Feature activation-1.37e-01

Pos logit contributions
seeing+0.784
nor+0.723
colour+0.715
 Teach+0.700
��+0.695
Neg logit contributions
ulate-0.738
ulating-0.696
riages-0.696
 stagn-0.669
ulates-0.665
Token young
Token ID1862
Feature activation-3.61e-01

Pos logit contributions
ulate+0.377
riages+0.354
ulating+0.354
 stagn+0.343
ulates+0.336
Neg logit contributions
seeing-0.432
nor-0.401
colour-0.396
��-0.393
 Teach-0.386
Token for
Token ID329
Feature activation+6.68e-01

Pos logit contributions
ulate+1.015
riages+0.954
ulating+0.954
 stagn+0.925
ulates+0.908
Neg logit contributions
seeing-1.113
nor-1.030
colour-1.018
��-1.012
 Teach-0.989
Token two
Token ID734
Feature activation-2.94e-01

Pos logit contributions
seeing+1.867
nor+1.695
colour+1.675
 Teach+1.648
��+1.629
Neg logit contributions
ulate-2.048
riages-1.941
ulating-1.934
 stagn-1.866
ulates-1.854
Token months
Token ID1933
Feature activation-1.84e-01

Pos logit contributions
ulate+0.909
riages+0.861
ulating+0.860
 stagn+0.839
ulates+0.821
Neg logit contributions
seeing-0.822
nor-0.757
colour-0.746
��-0.746
 Teach-0.721
Token at
Token ID379
Feature activation+4.57e-01

Pos logit contributions
ulate+0.511
ulating+0.478
riages+0.477
 stagn+0.460
ulates+0.457
Neg logit contributions
seeing-0.579
nor-0.536
colour-0.528
��-0.521
 Teach-0.515
Token most
Token ID749
Feature activation+2.76e-01

Pos logit contributions
seeing+1.330
nor+1.227
colour+1.210
��+1.201
 Teach+1.184
Neg logit contributions
ulate-1.352
riages-1.275
ulating-1.273
 stagn-1.229
ulates-1.219
Token.
Token ID13
Feature activation+2.58e-01

Pos logit contributions
seeing+0.868
nor+0.803
colour+0.791
��+0.783
 Teach+0.773
Neg logit contributions
ulate-0.754
ulating-0.707
riages-0.706
 stagn-0.682
ulates-0.675
Token
Token ID198
Feature activation-1.58e-01

Pos logit contributions
seeing+0.789
nor+0.729
colour+0.717
��+0.710
 Teach+0.707
Neg logit contributions
ulate-0.734
riages-0.694
ulating-0.692
 stagn-0.677
ulates-0.658
Token
Token ID198
Feature activation+2.28e-01

Pos logit contributions
ulate+0.453
���+0.432
ulating+0.429
riages+0.428
 stagn+0.418
Neg logit contributions
seeing-0.488
nor-0.451
colour-0.438
 Teach-0.431
ーン-0.409
Token competitors
Token ID13861
Feature activation-4.47e-01

Pos logit contributions
ulate+0.136
riages+0.128
ulating+0.127
 stagn+0.122
ulates+0.120
Neg logit contributions
seeing-0.182
nor-0.169
colour-0.167
��-0.167
 Teach-0.164
Token,
Token ID11
Feature activation+9.95e-02

Pos logit contributions
ulate+1.162
ulating+1.087
riages+1.087
 stagn+1.052
ulates+1.030
Neg logit contributions
seeing-1.469
nor-1.369
colour-1.353
��-1.350
 Teach-1.316
Token said
Token ID531
Feature activation-1.31e+00

Pos logit contributions
seeing+0.303
nor+0.281
colour+0.276
��+0.274
 Teach+0.270
Neg logit contributions
ulate-0.284
riages-0.267
ulating-0.266
 stagn-0.257
ulates-0.254
Token the
Token ID262
Feature activation+8.14e-01

Pos logit contributions
ulate+3.541
ulating+3.325
riages+3.308
 stagn+3.221
ulates+3.140
Neg logit contributions
seeing-4.132
nor-3.866
��-3.858
colour-3.846
 Teach-3.684
Token company
Token ID1664
Feature activation-5.85e-01

Pos logit contributions
seeing+2.399
nor+2.199
colour+2.157
 Teach+2.138
��+2.105
Neg logit contributions
ulate-2.360
riages-2.228
ulating-2.224
ulates-2.132
 stagn-2.129
Token
Token ID447
Feature activation-3.77e-01

Pos logit contributions
ulate+1.689
riages+1.593
ulating+1.591
 stagn+1.548
ulates+1.516
Neg logit contributions
seeing-1.756
nor-1.625
colour-1.606
��-1.605
 Teach-1.557
Token
Token ID247
Feature activation-1.57e-01

Pos logit contributions
ulate+1.380
ulating+1.327
riages+1.327
 stagn+1.289
ulates+1.267
Neg logit contributions
seeing-0.843
nor-0.757
��-0.742
colour-0.728
 Teach-0.727
Tokens
Token ID82
Feature activation-3.61e-01

Pos logit contributions
ulate+0.501
riages+0.475
ulating+0.475
 stagn+0.462
ulates+0.454
Neg logit contributions
seeing-0.424
nor-0.389
colour-0.383
��-0.381
 Teach-0.370
Token sizable
Token ID34282
Feature activation+1.85e-02

Pos logit contributions
ulate+0.978
riages+0.919
ulating+0.918
 stagn+0.888
ulates+0.872
Neg logit contributions
seeing-1.149
nor-1.066
colour-1.052
��-1.047
 Teach-1.025
Token spectrum
Token ID10958
Feature activation+2.09e+00

Pos logit contributions
seeing+0.056
nor+0.051
colour+0.050
��+0.050
 Teach+0.049
Neg logit contributions
ulate-0.053
riages-0.050
ulating-0.050
 stagn-0.049
ulates-0.048
Token assets
Token ID6798
Feature activation+6.19e-01

Pos logit contributions
seeing+5.939
nor+5.333
colour+5.187
 Teach+5.106
ーン+5.011
Neg logit contributions
ulate-6.059
ulating-5.671
riages-5.621
ulates-5.436
sei-5.396
Token im
Token ID545
Feature activation-1.02e-01

Pos logit contributions
ulate+0.038
riages+0.036
ulating+0.036
 stagn+0.035
ulates+0.034
Neg logit contributions
seeing-0.042
nor-0.039
colour-0.038
��-0.038
 Teach-0.037
Tokenutils
Token ID26791
Feature activation-9.91e-01

Pos logit contributions
ulate+0.300
riages+0.284
ulating+0.283
 stagn+0.275
ulates+0.270
Neg logit contributions
seeing-0.302
nor-0.279
colour-0.275
��-0.273
 Teach-0.266
Token import
Token ID1330
Feature activation+1.09e-01

Pos logit contributions
ulate+2.740
ulating+2.561
riages+2.548
 stagn+2.489
ulates+2.435
Neg logit contributions
seeing-3.094
nor-2.858
��-2.842
colour-2.833
 Teach-2.741
Token paths
Token ID13532
Feature activation+4.97e-01

Pos logit contributions
seeing+0.319
nor+0.294
colour+0.290
��+0.288
 Teach+0.282
Neg logit contributions
ulate-0.320
riages-0.303
ulating-0.303
 stagn-0.294
ulates-0.289
Token import
Token ID1330
Feature activation-1.21e-01

Pos logit contributions
seeing+1.460
nor+1.359
colour+1.323
��+1.302
 Teach+1.286
Neg logit contributions
ulate-1.446
riages-1.380
ulating-1.376
 stagn-1.334
ulates-1.311
Token n
Token ID299
Feature activation-5.31e-01

Pos logit contributions
ulate+0.351
riages+0.331
ulating+0.331
 stagn+0.321
ulates+0.315
Neg logit contributions
seeing-0.360
nor-0.333
colour-0.328
��-0.327
 Teach-0.319
Tokenumpy
Token ID32152
Feature activation-1.97e+00

Pos logit contributions
ulate+1.211
ulating+1.117
riages+1.111
 stagn+1.071
ulates+1.052
Neg logit contributions
seeing-1.916
nor-1.793
��-1.786
colour-1.777
 Teach-1.743
Token as
Token ID355
Feature activation+2.37e-01

Pos logit contributions
ulate+5.281
ulating+4.893
riages+4.825
 stagn+4.779
ulates+4.653
Neg logit contributions
seeing-6.209
��-5.842
nor-5.690
colour-5.673
 Teach-5.513
Token np
Token ID45941
Feature activation+2.37e-01

Pos logit contributions
seeing+0.739
nor+0.703
colour+0.679
��+0.668
 Teach+0.662
Neg logit contributions
ulate-0.638
riages-0.608
ulating-0.602
 stagn-0.583
sei-0.576
Token import
Token ID1330
Feature activation+4.67e-01

Pos logit contributions
seeing+0.707
nor+0.657
colour+0.645
��+0.637
 Teach+0.629
Neg logit contributions
ulate-0.684
riages-0.650
ulating-0.648
 stagn-0.626
ulates-0.616
Token arg
Token ID1822
Feature activation+4.68e-01

Pos logit contributions
seeing+1.454
nor+1.350
colour+1.333
��+1.310
 Teach+1.305
Neg logit contributions
ulate-1.279
riages-1.214
ulating-1.209
 stagn-1.160
ulates-1.147
Token therefore
Token ID4361
Feature activation-4.32e-01

Pos logit contributions
ulate+1.024
riages+0.966
ulating+0.965
 stagn+0.936
ulates+0.919
Neg logit contributions
seeing-1.068
nor-0.987
colour-0.974
��-0.968
 Teach-0.947
Token retains
Token ID27452
Feature activation-1.38e+00

Pos logit contributions
ulate+1.241
riages+1.169
ulating+1.169
 stagn+1.137
ulates+1.114
Neg logit contributions
seeing-1.302
nor-1.205
colour-1.190
��-1.189
 Teach-1.154
Token more
Token ID517
Feature activation-5.38e-01

Pos logit contributions
ulate+3.781
ulating+3.515
riages+3.484
 stagn+3.447
ulates+3.351
Neg logit contributions
seeing-4.307
��-4.044
nor-4.016
colour-3.955
 Teach-3.859
Token detail
Token ID3703
Feature activation+9.49e-01

Pos logit contributions
ulate+1.734
ulating+1.641
riages+1.639
 stagn+1.600
ulates+1.573
Neg logit contributions
seeing-1.437
nor-1.317
��-1.300
colour-1.298
 Teach-1.256
Token than
Token ID621
Feature activation+4.56e-01

Pos logit contributions
seeing+2.813
nor+2.578
colour+2.547
��+2.459
 Teach+2.435
Neg logit contributions
ulate-2.721
riages-2.612
ulating-2.571
 stagn-2.507
sei-2.484
Token grid
Token ID10706
Feature activation-2.38e+00

Pos logit contributions
seeing+1.391
nor+1.281
colour+1.269
 Teach+1.240
��+1.233
Neg logit contributions
ulate-1.277
riages-1.223
ulating-1.204
 stagn-1.158
sei-1.146
Token-
Token ID12
Feature activation-4.81e-01

Pos logit contributions
ulate+6.022
ulating+5.673
riages+5.425
ulates+5.334
 stagn+5.329
Neg logit contributions
seeing-7.643
��-7.480
nor-7.150
colour-7.079
 Teach-7.051
Tokenbased
Token ID3106
Feature activation-2.50e+00

Pos logit contributions
ulate+1.582
ulating+1.500
riages+1.490
 stagn+1.455
ulates+1.438
Neg logit contributions
seeing-1.243
��-1.140
nor-1.137
colour-1.115
 Teach-1.095
Token simulations
Token ID27785
Feature activation-1.45e+00

Pos logit contributions
ulate+6.294
ulating+5.847
 stagn+5.688
riages+5.645
ulates+5.575
Neg logit contributions
seeing-8.110
��-7.738
nor-7.622
colour-7.467
 Teach-7.275
Token.
Token ID13
Feature activation-3.52e-01

Pos logit contributions
ulate+3.834
ulating+3.570
riages+3.489
 stagn+3.469
ulates+3.361
Neg logit contributions
seeing-4.638
��-4.382
nor-4.297
colour-4.290
ーン-4.153
Token
Token ID198
Feature activation-5.24e-01

Pos logit contributions
ulate+1.042
ulating+0.983
riages+0.982
 stagn+0.955
ulates+0.938
Neg logit contributions
seeing-1.031
nor-0.952
��-0.941
colour-0.940
 Teach-0.908
Token two
Token ID734
Feature activation-8.59e-01

Pos logit contributions
ulate+1.324
ulating+1.245
riages+1.244
 stagn+1.209
ulates+1.183
Neg logit contributions
seeing-1.477
nor-1.371
colour-1.354
��-1.354
 Teach-1.314
Token days
Token ID1528
Feature activation-8.16e-01

Pos logit contributions
ulate+2.552
riages+2.426
ulating+2.417
 stagn+2.358
ulates+2.296
Neg logit contributions
seeing-2.468
nor-2.279
��-2.279
colour-2.252
 Teach-2.179
Token.
Token ID13
Feature activation-2.29e-01

Pos logit contributions
ulate+2.114
ulating+1.987
riages+1.978
 stagn+1.919
ulates+1.867
Neg logit contributions
seeing-2.669
nor-2.488
��-2.477
colour-2.470
 Teach-2.399
Token
Token ID198
Feature activation-4.79e-01

Pos logit contributions
ulate+0.637
ulating+0.599
riages+0.598
 stagn+0.578
ulates+0.569
Neg logit contributions
seeing-0.707
nor-0.656
��-0.651
colour-0.649
 Teach-0.627
Token
Token ID198
Feature activation-6.58e-02

Pos logit contributions
ulate+1.319
riages+1.243
ulating+1.242
 stagn+1.209
���+1.184
Neg logit contributions
seeing-1.521
nor-1.410
colour-1.382
 Teach-1.354
��-1.321
TokenR
Token ID49
Feature activation-2.42e+00

Pos logit contributions
ulate+0.215
riages+0.204
ulating+0.204
 stagn+0.199
ulates+0.196
Neg logit contributions
seeing-0.172
nor-0.158
��-0.155
colour-0.155
 Teach-0.150
Token S
Token ID311
Feature activation-1.34e-01

Pos logit contributions
ulate+6.334
ulating+6.014
riages+6.006
 stagn+5.763
ulates+5.670
Neg logit contributions
seeing-7.463
��-7.119
colour-7.018
nor-6.830
 Lauder-6.703
Tokenenth
Token ID7944
Feature activation+1.54e+00

Pos logit contributions
ulate+0.398
ulating+0.375
riages+0.375
 stagn+0.367
ulates+0.358
Neg logit contributions
seeing-0.391
nor-0.362
colour-0.355
��-0.353
 Teach-0.347
Tokenil
Token ID346
Feature activation-1.48e+00

Pos logit contributions
seeing+5.131
nor+4.923
colour+4.723
 Teach+4.716
��+4.656
Neg logit contributions
ulate-3.681
riages-3.494
ulating-3.453
 stagn-3.368
sei-3.260
Token Kumar
Token ID26105
Feature activation-3.82e-01

Pos logit contributions
ulate+4.384
ulating+4.132
riages+4.131
 stagn+4.013
ulates+3.945
Neg logit contributions
seeing-4.222
��-3.960
colour-3.883
nor-3.881
 Teach-3.717
Token/
Token ID14
Feature activation+1.36e+00

Pos logit contributions
ulate+1.110
riages+1.046
ulating+1.046
 stagn+1.016
ulates+0.997
Neg logit contributions
seeing-1.141
nor-1.054
colour-1.042
��-1.039
 Teach-1.010
Token,
Token ID11
Feature activation-4.12e-01

Pos logit contributions
seeing+0.816
nor+0.752
colour+0.741
��+0.727
 Teach+0.721
Neg logit contributions
ulate-0.811
riages-0.772
ulating-0.767
 stagn-0.746
ulates-0.734
Token uproar
Token ID40741
Feature activation+4.77e-01

Pos logit contributions
ulate+1.103
ulating+1.034
riages+1.033
 stagn+1.002
ulates+0.981
Neg logit contributions
seeing-1.326
nor-1.233
colour-1.219
��-1.217
 Teach-1.185
Tokeniously
Token ID6819
Feature activation-1.09e+00

Pos logit contributions
seeing+1.604
nor+1.496
colour+1.471
��+1.456
 Teach+1.423
Neg logit contributions
ulate-1.204
riages-1.126
ulating-1.107
 stagn-1.087
sei-1.059
Token so
Token ID523
Feature activation-7.32e-01

Pos logit contributions
ulate+2.683
ulating+2.506
riages+2.497
 stagn+2.428
ulates+2.362
Neg logit contributions
seeing-3.716
nor-3.477
��-3.460
colour-3.428
 Teach-3.342
Token from
Token ID422
Feature activation-5.87e-01

Pos logit contributions
ulate+1.925
ulating+1.801
riages+1.797
 stagn+1.738
ulates+1.707
Neg logit contributions
seeing-2.376
nor-2.212
��-2.201
colour-2.195
 Teach-2.134
Token guys
Token ID3730
Feature activation-2.79e+00

Pos logit contributions
ulate+1.630
ulating+1.533
riages+1.523
 stagn+1.483
ulates+1.457
Neg logit contributions
seeing-1.815
nor-1.693
��-1.677
colour-1.674
 Teach-1.608
Token who
Token ID508
Feature activation-9.11e-01

Pos logit contributions
ulate+7.217
ulating+6.930
riages+6.774
 stagn+6.765
sei+6.504
Neg logit contributions
seeing-8.377
��-8.028
nor-7.869
colour-7.850
ーン-7.540
Token have
Token ID423
Feature activation+3.42e-01

Pos logit contributions
ulate+2.579
riages+2.420
ulating+2.419
 stagn+2.369
ulates+2.298
Neg logit contributions
seeing-2.782
nor-2.575
��-2.566
colour-2.546
 Teach-2.454
Token a
Token ID257
Feature activation-5.02e-01

Pos logit contributions
seeing+1.013
nor+0.936
colour+0.921
��+0.911
 Teach+0.901
Neg logit contributions
ulate-0.998
ulating-0.942
riages-0.941
 stagn-0.902
ulates-0.898
Token history
Token ID2106
Feature activation+3.68e-01

Pos logit contributions
ulate+1.271
riages+1.188
ulating+1.186
 stagn+1.146
ulates+1.123
Neg logit contributions
seeing-1.687
nor-1.574
colour-1.556
��-1.551
 Teach-1.517
Token of
Token ID286
Feature activation-3.58e-01

Pos logit contributions
seeing+1.136
nor+1.045
colour+1.028
��+1.023
 Teach+1.009
Neg logit contributions
ulate-1.037
riages-0.977
ulating-0.965
 stagn-0.939
ulates-0.929
Tokenus
Token ID385
Feature activation-4.33e-01

Pos logit contributions
seeing+0.842
nor+0.770
colour+0.757
��+0.747
 Teach+0.737
Neg logit contributions
ulate-1.017
riages-0.967
ulating-0.965
 stagn-0.945
ulates-0.925
Token and
Token ID290
Feature activation-5.72e-01

Pos logit contributions
ulate+1.186
riages+1.114
ulating+1.113
 stagn+1.080
ulates+1.057
Neg logit contributions
seeing-1.363
nor-1.266
colour-1.251
��-1.249
 Teach-1.215
Token through
Token ID832
Feature activation+7.08e-01

Pos logit contributions
ulate+1.678
ulating+1.584
riages+1.581
 stagn+1.541
ulates+1.507
Neg logit contributions
seeing-1.684
nor-1.560
��-1.543
colour-1.542
 Teach-1.481
Token that
Token ID326
Feature activation-1.09e+00

Pos logit contributions
seeing+2.238
nor+2.064
colour+2.030
��+2.009
 Teach+2.000
Neg logit contributions
ulate-1.938
riages-1.831
ulating-1.813
 stagn-1.761
ulates-1.732
Token support
Token ID1104
Feature activation+1.32e+00

Pos logit contributions
ulate+2.965
ulating+2.792
riages+2.780
 stagn+2.717
ulates+2.636
Neg logit contributions
seeing-3.389
��-3.175
nor-3.163
colour-3.155
ーン-2.984
Token making
Token ID1642
Feature activation-2.40e+00

Pos logit contributions
seeing+3.942
nor+3.582
 Teach+3.557
colour+3.499
��+3.455
Neg logit contributions
ulate-3.688
riages-3.486
ulating-3.476
ulates-3.336
 stagn-3.320
Token this
Token ID428
Feature activation-1.14e+00

Pos logit contributions
ulate+6.678
ulating+6.259
 stagn+6.131
riages+6.120
sei+5.918
Neg logit contributions
seeing-6.902
��-6.738
nor-6.538
colour-6.439
ーン-6.153
Token kind
Token ID1611
Feature activation-4.81e-02

Pos logit contributions
ulate+3.171
ulating+2.971
riages+2.948
 stagn+2.910
ulates+2.819
Neg logit contributions
seeing-3.526
��-3.289
nor-3.271
colour-3.212
 Teach-3.071
Token of
Token ID286
Feature activation-5.86e-01

Pos logit contributions
ulate+0.138
riages+0.132
ulating+0.130
 stagn+0.127
ulates+0.125
Neg logit contributions
seeing-0.144
nor-0.135
colour-0.131
 Teach-0.129
��-0.126
Token content
Token ID2695
Feature activation+3.83e-01

Pos logit contributions
ulate+1.643
ulating+1.544
riages+1.537
 stagn+1.501
ulates+1.465
Neg logit contributions
seeing-1.809
nor-1.682
��-1.671
colour-1.658
 Teach-1.597
Token available
Token ID1695
Feature activation+8.62e-01

Pos logit contributions
seeing+1.314
nor+1.231
colour+1.217
��+1.198
 Teach+1.196
Neg logit contributions
ulate-0.924
riages-0.870
ulating-0.864
 stagn-0.831
ulates-0.817
Tokenext
Token ID2302
Feature activation+3.10e-01

Pos logit contributions
seeing+1.460
nor+1.353
colour+1.338
��+1.309
 Teach+1.280
Neg logit contributions
ulate-1.400
riages-1.324
ulating-1.322
 stagn-1.287
ulates-1.258
Token =
Token ID796
Feature activation-5.66e-02

Pos logit contributions
seeing+0.930
nor+0.861
colour+0.854
��+0.842
 Teach+0.822
Neg logit contributions
ulate-0.889
riages-0.840
ulating-0.839
 stagn-0.820
ulates-0.800
Token ($
Token ID7198
Feature activation-1.47e+00

Pos logit contributions
ulate+0.165
riages+0.156
ulating+0.156
 stagn+0.151
ulates+0.149
Neg logit contributions
seeing-0.168
nor-0.156
colour-0.154
��-0.152
 Teach-0.150
Tokentext
Token ID5239
Feature activation-7.12e-02

Pos logit contributions
ulate+4.561
ulating+4.321
riages+4.295
 stagn+4.136
ulates+4.125
Neg logit contributions
seeing-3.950
��-3.654
nor-3.626
 Teach-3.538
colour-3.505
Token -
Token ID532
Feature activation-6.38e-02

Pos logit contributions
ulate+0.204
riages+0.192
ulating+0.192
 stagn+0.186
ulates+0.183
Neg logit contributions
seeing-0.215
nor-0.199
colour-0.197
��-0.196
 Teach-0.191
Tokensplit
Token ID35312
Feature activation-2.97e+00

Pos logit contributions
ulate+0.201
riages+0.191
ulating+0.191
 stagn+0.186
ulates+0.182
Neg logit contributions
seeing-0.175
nor-0.161
colour-0.158
��-0.157
 Teach-0.153
Token "\
Token ID37082
Feature activation-1.06e+00

Pos logit contributions
ulate+8.633
ulating+8.125
riages+7.791
ulates+7.684
 stagn+7.650
Neg logit contributions
seeing-8.146
��-7.585
nor-7.359
colour-7.150
 Teach-7.113
Tokenr
Token ID81
Feature activation-5.35e-01

Pos logit contributions
ulate+3.369
ulating+3.188
riages+3.161
 stagn+3.065
ulates+3.060
Neg logit contributions
seeing-2.868
nor-2.637
��-2.616
colour-2.579
 Teach-2.497
Token
Token ID198
Feature activation-4.87e-01

Pos logit contributions
ulate+1.417
riages+1.328
ulating+1.327
 stagn+1.285
ulates+1.259
Neg logit contributions
seeing-1.733
nor-1.612
colour-1.593
��-1.589
 Teach-1.550
Token
Token ID198
Feature activation+2.39e-01

Pos logit contributions
ulate+1.339
riages+1.263
ulating+1.261
 stagn+1.232
ulates+1.197
Neg logit contributions
seeing-1.551
nor-1.439
colour-1.413
 Teach-1.372
��-1.346
Token")
Token ID4943
Feature activation-9.00e-01

Pos logit contributions
seeing+0.704
nor+0.650
colour+0.645
��+0.629
 Teach+0.617
Neg logit contributions
ulate-0.705
riages-0.668
ulating-0.665
 stagn-0.652
ulates-0.634