TOP ACTIVATIONS
MAX = 6.522

in a device driver for SATA disks . T apes save
, an impoverished and sterile amalg am of social democracy and
, when they blew their noses , Jamie sitting on the
is r al - Sh ugh our , near the northern
pas aurus rex in our debugger : 1 $
r auc ously celebrated the dism ember ment of four rabb
MIDI controllers , specialized USB peripher als , and LED projects
oceans . Most of us breathe a sigh of relief or
" conditions added at his arra ignment in December 2016 .
with J eth ro P ugh [ who died last week
his thigh and severed an artery . Despite his friends '
) Master the Exc ursion Fun nel ( 1
best - known mail order catalog ues apologised today after web
up the air their kids breathe all deserve the facts behind
MC Us with fairly simple peripher als , I
kind of programming is a breath of fresh air in a
the social media world !!! Exc iting
when you turn the f auc et on , the water
that way . The programming abstraction commonly used for such purposes
on , then open the catalog like you would any ebook

MIN ACTIVATIONS
MIN = -6.228

this had dried , I tint ed the st on ework
on the safety issues regarding manned flights . " He says
Nap Time Reply · Report Post <|endoftext|> The
area . Cor respond ents say the mine may
NASA s commercial crew effort , which will make
Lastly , I tint ed around the cl umps
your photo 's saturation , tint , and the like ,
rain , and Prof . Loren Cord ain comes down pretty
space station operations . A Soy uz carrying NASA astronaut Shane
said Ryan , attribut ing the foul - up
Hillary . His supporters are blinded by their anger twisted into
sure that both Input Stream Adapt ive and RT MP Input
two patriarch s in a pod . And Pence showed it
degrees east in Mare Tr anqu ill it atis , the
seen below . Pri est Cont acts :
consider Clinton as a running mate in the 2016 presidential election
Jordan Kal ish Surv iv or s
to lose . Adapt ed from the forthcoming book
, the Sea of Tr anqu ility . After several short
9 / 11 GI Bill ® . What is

INTERVAL 3.335 - 6.522
CONTAINS 0.144%

F ailed cannot find the current wallpaper $ ($ w path
all familiar with , from catalog ing various marketing slogans to
caught . There is one exception : " vi :" and
already occurred , Gol ight ly - How ell
Though the game 's popularity has w aned since last

INTERVAL 0.147 - 3.335
CONTAINS 38.048%

Yet Gl ü ck has forced a whole world into a
and 2012 was 25 percent lower than average yields . I
78 70 Hawk uses Military Class III components which have passed
causes . Make your wishes now !
starring TV role was in Str angers with Candy , a

INTERVAL -3.040 - 0.147
CONTAINS 61.520%

American people , through Congress , will control the greed ,
as overweight and out of shape people struggle to jog or
the Governor s mansion . McDonnell did not steer
'm officially declaring them as des er ter of C .
as even at that resolution there are still those spaces between

INTERVAL -6.228 - -3.040
CONTAINS 0.288%

or the transcend ent , into the realm of nature .
area . Cor respond ents say the mine may
Rush Hy ster ia was puls ing through the veins of
Philip Morris , Victor De Nob el and Paul Me le
to take the Ring from Fro do , but Fro do
Token in
Token ID287
Feature activation+3.64e-01

Pos logit contributions
 misunder+0.765
Pri+0.716
 differ+0.669
1945+0.663
 attitudes+0.662
Neg logit contributions
 SATA-0.942
HAM-0.937
oscope-0.911
 DRAG-0.886
 Turtles-0.861
Token a
Token ID257
Feature activation+4.34e-02

Pos logit contributions
 SATA+0.289
HAM+0.286
oscope+0.279
 DRAG+0.270
 Turtles+0.263
Neg logit contributions
 misunder-0.239
Pri-0.223
 differ-0.207
1945-0.206
 attitudes-0.205
Token device
Token ID3335
Feature activation+6.02e-01

Pos logit contributions
 SATA+0.031
HAM+0.031
oscope+0.030
 DRAG+0.029
 Turtles+0.028
Neg logit contributions
 misunder-0.032
Pri-0.030
 differ-0.028
1945-0.028
 attitudes-0.028
Token driver
Token ID4639
Feature activation+1.25e+00

Pos logit contributions
 SATA+0.478
HAM+0.473
oscope+0.462
 DRAG+0.447
 Turtles+0.434
Neg logit contributions
 misunder-0.394
Pri-0.369
 differ-0.342
1945-0.341
 attitudes-0.339
Token for
Token ID329
Feature activation+2.16e-01

Pos logit contributions
 SATA+0.980
HAM+0.968
oscope+0.944
 DRAG+0.913
 Turtles+0.887
Neg logit contributions
 misunder-0.839
Pri-0.785
 differ-0.725
1945-0.725
 attitudes-0.721
Token SATA
Token ID35070
Feature activation+6.52e+00

Pos logit contributions
HAM+0.162
 SATA+0.162
oscope+0.158
 DRAG+0.152
 Turtles+0.147
Neg logit contributions
 misunder-0.149
Pri-0.139
 differ-0.131
 attitudes-0.130
1945-0.130
Token disks
Token ID32505
Feature activation+1.96e+00

Pos logit contributions
 SATA+4.940
HAM+4.706
oscope+4.528
 DRAG+4.386
 Turtles+4.294
Neg logit contributions
 misunder-4.685
Pri-4.333
1945-4.109
lves-4.012
 kitchens-3.971
Token.
Token ID13
Feature activation+6.57e-01

Pos logit contributions
 SATA+1.516
HAM+1.486
oscope+1.448
 DRAG+1.402
 Turtles+1.363
Neg logit contributions
 misunder-1.349
Pri-1.261
1945-1.167
 differ-1.160
 attitudes-1.157
Token T
Token ID309
Feature activation+1.77e-01

Pos logit contributions
HAM+0.467
 SATA+0.464
oscope+0.456
 DRAG+0.436
lishes+0.424
Neg logit contributions
 misunder-0.472
Pri-0.451
 differ-0.423
1945-0.421
 attitudes-0.418
Tokenapes
Token ID7916
Feature activation-1.17e+00

Pos logit contributions
 SATA+0.121
HAM+0.120
oscope+0.116
 DRAG+0.112
 Turtles+0.109
Neg logit contributions
 misunder-0.135
Pri-0.128
 differ-0.120
1945-0.120
 attitudes-0.119
Token save
Token ID3613
Feature activation+2.89e-01

Pos logit contributions
 misunder+0.724
Pri+0.676
 differ+0.639
 attitudes+0.625
1945+0.622
Neg logit contributions
HAM-0.953
 SATA-0.952
oscope-0.931
 DRAG-0.900
 Turtles-0.872
Token,
Token ID11
Feature activation+2.73e-01

Pos logit contributions
 misunder+0.981
Pri+0.918
 differ+0.854
1945+0.848
 attitudes+0.847
Neg logit contributions
 SATA-1.208
HAM-1.198
oscope-1.168
 DRAG-1.134
 Turtles-1.102
Token an
Token ID281
Feature activation+6.75e-01

Pos logit contributions
 SATA+0.227
HAM+0.225
oscope+0.220
 DRAG+0.213
 Turtles+0.207
Neg logit contributions
 misunder-0.169
Pri-0.158
 differ-0.145
1945-0.145
esa-0.144
Token impoverished
Token ID35014
Feature activation+2.64e+00

Pos logit contributions
 SATA+0.527
HAM+0.522
oscope+0.510
 DRAG+0.493
 Turtles+0.479
Neg logit contributions
 misunder-0.451
Pri-0.423
 differ-0.393
1945-0.391
 attitudes-0.389
Token and
Token ID290
Feature activation+2.02e-01

Pos logit contributions
 SATA+2.113
HAM+2.080
oscope+2.037
 DRAG+1.959
 Turtles+1.915
Neg logit contributions
 misunder-1.712
Pri-1.606
1945-1.478
 differ-1.476
esa-1.471
Token sterile
Token ID38697
Feature activation-2.76e-01

Pos logit contributions
 SATA+0.175
HAM+0.174
oscope+0.170
 DRAG+0.165
 Turtles+0.161
Neg logit contributions
 misunder-0.118
Pri-0.109
 differ-0.100
1945-0.100
 attitudes-0.099
Token amalg
Token ID45541
Feature activation+5.54e+00

Pos logit contributions
 misunder+0.181
Pri+0.169
 differ+0.158
 attitudes+0.158
1945+0.156
Neg logit contributions
 SATA-0.218
HAM-0.217
oscope-0.212
 DRAG-0.206
 Turtles-0.199
Tokenam
Token ID321
Feature activation+9.36e-01

Pos logit contributions
 SATA+3.223
oscope+3.189
HAM+3.122
antage+2.865
ylum+2.830
Neg logit contributions
 misunder-4.688
Pri-4.386
 attitudes-4.222
 breached-4.189
esa-4.182
Token of
Token ID286
Feature activation+1.56e-01

Pos logit contributions
 SATA+0.796
HAM+0.783
oscope+0.780
 DRAG+0.741
antage+0.727
Neg logit contributions
 misunder-0.561
Pri-0.519
1945-0.473
esa-0.471
 attitudes-0.471
Token social
Token ID1919
Feature activation-1.03e-01

Pos logit contributions
 SATA+0.135
HAM+0.134
oscope+0.130
 DRAG+0.127
 Turtles+0.124
Neg logit contributions
 misunder-0.091
Pri-0.085
1945-0.077
 differ-0.077
esa-0.077
Token democracy
Token ID7996
Feature activation+5.24e-01

Pos logit contributions
 misunder+0.062
Pri+0.058
 differ+0.053
 attitudes+0.053
1945+0.053
Neg logit contributions
 SATA-0.087
HAM-0.087
oscope-0.085
 DRAG-0.082
 Turtles-0.080
Token and
Token ID290
Feature activation-1.69e-01

Pos logit contributions
 SATA+0.405
HAM+0.401
oscope+0.391
 DRAG+0.377
 Turtles+0.367
Neg logit contributions
 misunder-0.353
Pri-0.331
 differ-0.306
1945-0.305
esa-0.304
Token,
Token ID11
Feature activation+2.98e-02

Pos logit contributions
 misunder+0.136
Pri+0.127
 differ+0.118
1945+0.117
 attitudes+0.117
Neg logit contributions
 SATA-0.161
HAM-0.160
oscope-0.156
 DRAG-0.151
 Turtles-0.146
Token when
Token ID618
Feature activation-2.10e-01

Pos logit contributions
 SATA+0.025
HAM+0.024
oscope+0.024
 DRAG+0.023
 Turtles+0.023
Neg logit contributions
 misunder-0.019
Pri-0.017
 differ-0.016
1945-0.016
esa-0.016
Token they
Token ID484
Feature activation+1.18e+00

Pos logit contributions
 misunder+0.134
Pri+0.125
 differ+0.116
 attitudes+0.115
1945+0.115
Neg logit contributions
 SATA-0.170
HAM-0.168
oscope-0.164
 DRAG-0.159
 Turtles-0.154
Token blew
Token ID17948
Feature activation+1.54e-01

Pos logit contributions
 SATA+1.008
HAM+0.997
oscope+0.977
 DRAG+0.948
 Turtles+0.924
Neg logit contributions
 misunder-0.708
Pri-0.658
1945-0.601
 differ-0.599
 attitudes-0.597
Token their
Token ID511
Feature activation+3.43e-01

Pos logit contributions
 SATA+0.125
HAM+0.124
oscope+0.121
 DRAG+0.117
 Turtles+0.114
Neg logit contributions
 misunder-0.098
Pri-0.092
 differ-0.085
 attitudes-0.085
1945-0.085
Token noses
Token ID41760
Feature activation+5.41e+00

Pos logit contributions
 SATA+0.304
HAM+0.302
oscope+0.296
 DRAG+0.287
 Turtles+0.280
Neg logit contributions
 misunder-0.193
Pri-0.179
 differ-0.164
 attitudes-0.163
1945-0.162
Token,
Token ID11
Feature activation+1.34e-01

Pos logit contributions
 SATA+4.031
HAM+4.001
oscope+3.935
 DRAG+3.754
antage+3.637
Neg logit contributions
 misunder-3.758
Pri-3.475
1945-3.245
esa-3.205
Zero-3.179
Token Jamie
Token ID17826
Feature activation+7.34e-01

Pos logit contributions
 SATA+0.113
HAM+0.112
oscope+0.109
 DRAG+0.106
 Turtles+0.103
Neg logit contributions
 misunder-0.081
Pri-0.076
 differ-0.069
1945-0.069
 attitudes-0.069
Token sitting
Token ID5586
Feature activation+4.19e-01

Pos logit contributions
 SATA+0.596
HAM+0.591
oscope+0.576
 DRAG+0.558
 Turtles+0.543
Neg logit contributions
 misunder-0.467
Pri-0.435
 differ-0.404
1945-0.402
 attitudes-0.400
Token on
Token ID319
Feature activation+6.88e-01

Pos logit contributions
 SATA+0.340
HAM+0.337
oscope+0.329
 DRAG+0.318
 Turtles+0.309
Neg logit contributions
 misunder-0.268
Pri-0.250
 differ-0.231
1945-0.230
 attitudes-0.229
Token the
Token ID262
Feature activation-1.17e-01

Pos logit contributions
 SATA+0.582
HAM+0.575
oscope+0.562
 DRAG+0.547
 Turtles+0.533
Neg logit contributions
 misunder-0.417
Pri-0.387
1945-0.356
 differ-0.355
esa-0.352
Tokenis
Token ID271
Feature activation+2.61e-01

Pos logit contributions
 misunder+0.618
Pri+0.599
esa+0.565
 differ+0.554
1945+0.550
Neg logit contributions
 SATA-0.771
HAM-0.762
oscope-0.740
 DRAG-0.730
 Turtles-0.702
Tokenr
Token ID81
Feature activation+5.37e-03

Pos logit contributions
 SATA+0.138
HAM+0.137
oscope+0.132
 DRAG+0.126
 Turtles+0.121
Neg logit contributions
 misunder-0.239
Pri-0.229
 differ-0.217
1945-0.216
 attitudes-0.216
Token al
Token ID435
Feature activation-4.19e-01

Pos logit contributions
 SATA+0.003
HAM+0.003
oscope+0.003
 DRAG+0.003
 Turtles+0.003
Neg logit contributions
 misunder-0.004
Pri-0.004
 differ-0.004
1945-0.004
 attitudes-0.004
Token-
Token ID12
Feature activation-3.62e-01

Pos logit contributions
 misunder+0.340
Pri+0.324
 differ+0.306
 attitudes+0.304
1945+0.303
Neg logit contributions
 SATA-0.263
HAM-0.262
oscope-0.255
 DRAG-0.245
 Turtles-0.234
TokenSh
Token ID2484
Feature activation-5.31e-04

Pos logit contributions
 misunder+0.300
Pri+0.287
 differ+0.270
1945+0.269
 attitudes+0.268
Neg logit contributions
 SATA-0.223
HAM-0.220
oscope-0.215
 DRAG-0.206
 Turtles-0.198
Tokenugh
Token ID6724
Feature activation+5.36e+00

Pos logit contributions
 misunder+0.000
Pri+0.000
esa+0.000
1945+0.000
 differ+0.000
Neg logit contributions
 SATA-0.000
HAM-0.000
oscope-0.000
 DRAG-0.000
 Turtles-0.000
Tokenour
Token ID454
Feature activation+9.35e-01

Pos logit contributions
 SATA+2.118
oscope+2.004
HAM+1.893
antage+1.738
 DRAG+1.699
Neg logit contributions
 misunder-5.721
Pri-5.349
1945-5.171
 kitchens-5.111
 attitudes-5.106
Token,
Token ID11
Feature activation+4.56e-02

Pos logit contributions
 SATA+0.707
HAM+0.703
oscope+0.681
 DRAG+0.656
 Turtles+0.637
Neg logit contributions
 misunder-0.653
Pri-0.607
 differ-0.564
1945-0.561
 attitudes-0.558
Token near
Token ID1474
Feature activation+8.49e-02

Pos logit contributions
 SATA+0.039
HAM+0.038
oscope+0.037
 DRAG+0.036
 Turtles+0.035
Neg logit contributions
 misunder-0.028
Pri-0.026
 differ-0.023
1945-0.023
 attitudes-0.023
Token the
Token ID262
Feature activation-1.31e-01

Pos logit contributions
 SATA+0.070
HAM+0.069
oscope+0.067
 DRAG+0.065
 Turtles+0.064
Neg logit contributions
 misunder-0.054
Pri-0.050
 differ-0.046
1945-0.046
 attitudes-0.045
Token northern
Token ID7840
Feature activation+6.28e-01

Pos logit contributions
 misunder+0.097
Pri+0.092
 attitudes+0.088
 differ+0.088
 kitchens+0.087
Neg logit contributions
HAM-0.091
 SATA-0.090
oscope-0.089
 DRAG-0.086
antage-0.082
Tokenpas
Token ID44429
Feature activation-1.12e+00

Pos logit contributions
 misunder+1.560
Pri+1.476
 differ+1.379
1945+1.374
esa+1.363
Neg logit contributions
 SATA-1.709
HAM-1.691
oscope-1.642
 DRAG-1.602
 Turtles-1.549
Tokenaurus
Token ID22302
Feature activation-6.76e-01

Pos logit contributions
 misunder+0.780
Pri+0.737
 differ+0.692
1945+0.688
esa+0.683
Neg logit contributions
 SATA-0.841
HAM-0.834
oscope-0.810
 DRAG-0.788
 Turtles-0.762
Tokenrex
Token ID21510
Feature activation-2.44e+00

Pos logit contributions
 misunder+0.469
Pri+0.441
 differ+0.413
1945+0.411
 attitudes+0.409
Neg logit contributions
 SATA-0.509
HAM-0.503
oscope-0.491
 DRAG-0.475
 Turtles-0.460
Token in
Token ID287
Feature activation+2.93e-01

Pos logit contributions
 misunder+1.632
Pri+1.550
 differ+1.446
1945+1.444
esa+1.441
Neg logit contributions
 SATA-1.860
HAM-1.843
oscope-1.788
 DRAG-1.757
 Turtles-1.696
Token our
Token ID674
Feature activation+2.44e-01

Pos logit contributions
 SATA+0.219
HAM+0.217
oscope+0.211
 DRAG+0.204
 Turtles+0.198
Neg logit contributions
 misunder-0.206
Pri-0.193
 differ-0.180
1945-0.179
 attitudes-0.179
Token debugger
Token ID49518
Feature activation+5.35e+00

Pos logit contributions
 SATA+0.183
HAM+0.181
oscope+0.177
 DRAG+0.171
 Turtles+0.166
Neg logit contributions
 misunder-0.170
Pri-0.160
 differ-0.149
1945-0.148
 attitudes-0.148
Token:
Token ID25
Feature activation+1.92e+00

Pos logit contributions
 SATA+3.827
HAM+3.756
oscope+3.676
 DRAG+3.534
 Turtles+3.415
Neg logit contributions
 misunder-3.961
Pri-3.641
1945-3.370
 attitudes-3.353
 breached-3.330
Token
Token ID198
Feature activation+6.25e-01

Pos logit contributions
 SATA+1.440
HAM+1.424
oscope+1.385
 DRAG+1.339
 Turtles+1.300
Neg logit contributions
 misunder-1.350
Pri-1.264
 differ-1.175
 attitudes-1.171
1945-1.169
Token
Token ID198
Feature activation+4.11e-01

Pos logit contributions
 SATA+0.485
HAM+0.480
oscope+0.468
 DRAG+0.454
 Turtles+0.440
Neg logit contributions
 misunder-0.420
Pri-0.396
 differ-0.367
1945-0.366
 attitudes-0.364
Token1
Token ID16
Feature activation+2.95e-01

Pos logit contributions
 SATA+0.310
HAM+0.308
oscope+0.300
 DRAG+0.289
 Turtles+0.280
Neg logit contributions
 misunder-0.286
Pri-0.267
 differ-0.250
 attitudes-0.248
1945-0.248
Token $
Token ID720
Feature activation+2.65e-01

Pos logit contributions
 SATA+0.210
HAM+0.207
oscope+0.202
 DRAG+0.194
 Turtles+0.188
Neg logit contributions
 misunder-0.219
Pri-0.206
 differ-0.193
1945-0.192
 attitudes-0.192
Token r
Token ID374
Feature activation-1.03e+00

Pos logit contributions
 SATA+0.088
HAM+0.087
oscope+0.085
 DRAG+0.082
 Turtles+0.080
Neg logit contributions
 misunder-0.070
Pri-0.065
 differ-0.061
 attitudes-0.060
1945-0.060
Tokenauc
Token ID14272
Feature activation+4.78e+00

Pos logit contributions
 misunder+0.702
Pri+0.668
 differ+0.629
1945+0.620
esa+0.618
Neg logit contributions
 SATA-0.781
HAM-0.774
oscope-0.755
 DRAG-0.732
 Turtles-0.712
Tokenously
Token ID3481
Feature activation-4.57e-02

Pos logit contributions
 SATA+2.353
HAM+2.289
oscope+2.279
 DRAG+2.050
 Turtles+1.989
Neg logit contributions
 misunder-4.661
Pri-4.306
 conflic-4.139
 attitudes-4.110
1945-4.090
Token celebrated
Token ID13943
Feature activation+4.16e-01

Pos logit contributions
 misunder+0.028
Pri+0.026
 differ+0.024
1945+0.024
 attitudes+0.024
Neg logit contributions
 SATA-0.038
HAM-0.038
oscope-0.037
 DRAG-0.036
 Turtles-0.035
Token the
Token ID262
Feature activation+3.85e-02

Pos logit contributions
 SATA+0.350
HAM+0.347
oscope+0.339
 DRAG+0.329
 Turtles+0.320
Neg logit contributions
 misunder-0.252
Pri-0.234
 differ-0.216
1945-0.215
 attitudes-0.214
Token dism
Token ID18099
Feature activation+5.32e+00

Pos logit contributions
 SATA+0.030
HAM+0.030
oscope+0.029
 DRAG+0.028
lishes+0.027
Neg logit contributions
 misunder-0.026
Pri-0.024
 differ-0.022
 attitudes-0.022
1945-0.022
Tokenember
Token ID1491
Feature activation+1.05e+00

Pos logit contributions
 SATA+1.954
oscope+1.883
HAM+1.882
antage+1.675
 DRAG+1.611
Neg logit contributions
 misunder-5.914
Pri-5.453
 attitudes-5.275
 kitchens-5.245
 conflic-5.244
Tokenment
Token ID434
Feature activation-5.19e-01

Pos logit contributions
 SATA+0.892
HAM+0.882
oscope+0.864
 DRAG+0.836
 Turtles+0.816
Neg logit contributions
 misunder-0.646
Pri-0.594
 differ-0.544
1945-0.541
 attitudes-0.538
Token of
Token ID286
Feature activation+9.93e-02

Pos logit contributions
 misunder+0.301
Pri+0.279
 differ+0.255
1945+0.254
 attitudes+0.253
Neg logit contributions
 SATA-0.452
HAM-0.448
oscope-0.438
 DRAG-0.425
 Turtles-0.414
Token four
Token ID1440
Feature activation-1.15e-01

Pos logit contributions
 SATA+0.079
HAM+0.079
oscope+0.077
 DRAG+0.074
 Turtles+0.072
Neg logit contributions
 misunder-0.064
Pri-0.060
 differ-0.056
1945-0.056
 attitudes-0.055
Token rabb
Token ID19368
Feature activation-4.30e-02

Pos logit contributions
 misunder+0.085
Pri+0.080
 differ+0.075
1945+0.074
 attitudes+0.074
Neg logit contributions
 SATA-0.083
HAM-0.082
oscope-0.080
 DRAG-0.077
 Turtles-0.074
Token MIDI
Token ID33439
Feature activation+3.16e+00

Pos logit contributions
 SATA+0.597
HAM+0.591
oscope+0.577
 DRAG+0.556
lishes+0.536
Neg logit contributions
 misunder-0.578
Pri-0.549
 differ-0.511
1945-0.509
 attitudes-0.508
Token controllers
Token ID20624
Feature activation+2.49e+00

Pos logit contributions
 SATA+2.335
HAM+2.311
oscope+2.248
 DRAG+2.165
 Turtles+2.118
Neg logit contributions
 misunder-2.259
Pri-2.094
1945-1.962
 differ-1.951
 attitudes-1.938
Token,
Token ID11
Feature activation+3.84e-01

Pos logit contributions
 SATA+2.013
HAM+1.979
oscope+1.937
 DRAG+1.877
 Turtles+1.833
Neg logit contributions
 misunder-1.601
Pri-1.485
1945-1.368
 attitudes-1.360
esa-1.354
Token specialized
Token ID16976
Feature activation+7.45e-01

Pos logit contributions
 SATA+0.280
HAM+0.278
oscope+0.271
 DRAG+0.261
 Turtles+0.252
Neg logit contributions
 misunder-0.274
Pri-0.258
 differ-0.242
 attitudes-0.241
1945-0.240
Token USB
Token ID8450
Feature activation+1.59e+00

Pos logit contributions
 SATA+0.549
HAM+0.545
oscope+0.530
 DRAG+0.512
 Turtles+0.496
Neg logit contributions
 misunder-0.528
Pri-0.497
 differ-0.466
 attitudes-0.463
1945-0.462
Token peripher
Token ID18375
Feature activation+5.21e+00

Pos logit contributions
 SATA+1.059
HAM+1.045
oscope+1.014
 DRAG+0.976
 Turtles+0.944
Neg logit contributions
 misunder-1.256
Pri-1.187
 differ-1.114
1945-1.113
 attitudes-1.106
Tokenals
Token ID874
Feature activation-3.53e-01

Pos logit contributions
 SATA+3.686
HAM+3.608
oscope+3.570
 DRAG+3.342
 Turtles+3.293
Neg logit contributions
 misunder-3.962
Pri-3.644
 differ-3.419
 attitudes-3.389
1945-3.386
Token,
Token ID11
Feature activation+3.19e-01

Pos logit contributions
 misunder+0.235
Pri+0.221
 differ+0.206
1945+0.204
 attitudes+0.204
Neg logit contributions
 SATA-0.275
HAM-0.274
oscope-0.267
 DRAG-0.258
 Turtles-0.250
Token and
Token ID290
Feature activation+3.17e-01

Pos logit contributions
 SATA+0.240
HAM+0.238
oscope+0.232
 DRAG+0.224
 Turtles+0.217
Neg logit contributions
 misunder-0.222
Pri-0.208
 differ-0.194
 attitudes-0.193
1945-0.193
Token LED
Token ID12365
Feature activation-1.36e+00

Pos logit contributions
 SATA+0.249
HAM+0.247
oscope+0.241
 DRAG+0.233
 Turtles+0.226
Neg logit contributions
 misunder-0.210
Pri-0.196
 differ-0.183
 attitudes-0.181
1945-0.181
Token projects
Token ID4493
Feature activation+1.09e-01

Pos logit contributions
 misunder+0.968
Pri+0.919
 differ+0.863
 attitudes+0.859
 kitchens+0.848
Neg logit contributions
 SATA-0.991
HAM-0.985
oscope-0.951
 DRAG-0.927
lishes-0.900
Token oceans
Token ID23744
Feature activation+2.73e+00

Pos logit contributions
 misunder+0.004
Pri+0.004
 differ+0.004
1945+0.004
 attitudes+0.004
Neg logit contributions
 SATA-0.004
HAM-0.004
oscope-0.004
 DRAG-0.003
 Turtles-0.003
Token.
Token ID13
Feature activation-1.51e-01

Pos logit contributions
 SATA+2.123
HAM+2.103
oscope+2.056
 DRAG+1.977
 Turtles+1.938
Neg logit contributions
 misunder-1.848
Pri-1.703
1945-1.569
 differ-1.567
 attitudes-1.563
Token Most
Token ID4042
Feature activation-1.45e+00

Pos logit contributions
 misunder+0.104
Pri+0.101
1945+0.094
 differ+0.093
 attitudes+0.093
Neg logit contributions
 SATA-0.111
HAM-0.111
oscope-0.108
 DRAG-0.104
antage-0.101
Token of
Token ID286
Feature activation-4.36e-02

Pos logit contributions
 misunder+0.836
Pri+0.775
 differ+0.708
1945+0.705
 attitudes+0.701
Neg logit contributions
 SATA-1.273
HAM-1.261
oscope-1.235
 DRAG-1.198
 Turtles-1.169
Token us
Token ID514
Feature activation-2.98e-01

Pos logit contributions
 misunder+0.032
Pri+0.030
 differ+0.028
1945+0.028
 attitudes+0.028
Neg logit contributions
 SATA-0.031
HAM-0.030
oscope-0.030
 DRAG-0.029
 Turtles-0.028
Token breathe
Token ID18044
Feature activation+5.15e+00

Pos logit contributions
 misunder+0.183
Pri+0.171
 differ+0.158
1945+0.157
 attitudes+0.156
Neg logit contributions
 SATA-0.248
HAM-0.246
oscope-0.241
 DRAG-0.233
 Turtles-0.227
Token a
Token ID257
Feature activation-7.39e-02

Pos logit contributions
 SATA+4.036
HAM+3.887
oscope+3.831
 DRAG+3.744
 Turtles+3.682
Neg logit contributions
 misunder-3.361
Pri-3.174
1945-2.958
esa-2.958
Initialized-2.864
Token sigh
Token ID19680
Feature activation+1.88e+00

Pos logit contributions
 misunder+0.049
Pri+0.046
 differ+0.043
 attitudes+0.042
1945+0.042
Neg logit contributions
 SATA-0.058
HAM-0.058
oscope-0.056
 DRAG-0.054
 Turtles-0.053
Token of
Token ID286
Feature activation+9.75e-02

Pos logit contributions
 SATA+1.580
HAM+1.563
oscope+1.540
 DRAG+1.473
 Turtles+1.437
Neg logit contributions
 misunder-1.153
Pri-1.067
1945-0.973
 differ-0.963
 attitudes-0.961
Token relief
Token ID8259
Feature activation+5.60e-01

Pos logit contributions
 SATA+0.062
HAM+0.061
oscope+0.060
 DRAG+0.057
 Turtles+0.055
Neg logit contributions
 misunder-0.079
Pri-0.075
 differ-0.071
1945-0.070
 attitudes-0.070
Token or
Token ID393
Feature activation-1.65e+00

Pos logit contributions
 SATA+0.470
HAM+0.465
oscope+0.455
 DRAG+0.441
 Turtles+0.429
Neg logit contributions
 misunder-0.343
Pri-0.319
 differ-0.294
1945-0.292
 attitudes-0.291
Token"
Token ID1
Feature activation+6.41e-01

Pos logit contributions
 misunder+0.210
Pri+0.195
 differ+0.182
 attitudes+0.180
1945+0.180
Neg logit contributions
 SATA-0.284
HAM-0.282
oscope-0.275
 DRAG-0.267
 Turtles-0.260
Token conditions
Token ID3403
Feature activation-5.85e-01

Pos logit contributions
 SATA+0.524
HAM+0.519
oscope+0.507
 DRAG+0.492
 Turtles+0.478
Neg logit contributions
 misunder-0.404
Pri-0.378
 differ-0.350
1945-0.348
 attitudes-0.347
Token added
Token ID2087
Feature activation+2.11e+00

Pos logit contributions
 misunder+0.376
Pri+0.353
 differ+0.331
 attitudes+0.327
1945+0.327
Neg logit contributions
 SATA-0.469
HAM-0.465
oscope-0.453
 DRAG-0.441
 Turtles-0.429
Token at
Token ID379
Feature activation+3.04e-01

Pos logit contributions
 SATA+1.614
HAM+1.599
oscope+1.561
 DRAG+1.497
 Turtles+1.452
Neg logit contributions
 misunder-1.456
Pri-1.353
1945-1.249
 differ-1.247
esa-1.246
Token his
Token ID465
Feature activation+3.51e-02

Pos logit contributions
 SATA+0.265
HAM+0.264
oscope+0.258
 DRAG+0.250
 Turtles+0.245
Neg logit contributions
 misunder-0.172
Pri-0.161
 differ-0.148
 attitudes-0.147
1945-0.146
Token arra
Token ID44723
Feature activation+5.10e+00

Pos logit contributions
 SATA+0.027
HAM+0.027
oscope+0.026
 DRAG+0.026
 Turtles+0.025
Neg logit contributions
 misunder-0.023
Pri-0.022
 attitudes-0.021
 differ-0.021
1945-0.020
Tokenignment
Token ID16747
Feature activation+7.39e-01

Pos logit contributions
 SATA+2.955
HAM+2.887
oscope+2.883
 DRAG+2.618
antage+2.603
Neg logit contributions
 misunder-4.492
Pri-4.159
 differ-3.954
 attitudes-3.934
1945-3.928
Token in
Token ID287
Feature activation-6.38e-02

Pos logit contributions
 SATA+0.557
HAM+0.551
oscope+0.537
 DRAG+0.518
 Turtles+0.502
Neg logit contributions
 misunder-0.517
Pri-0.483
 differ-0.449
1945-0.448
 attitudes-0.445
Token December
Token ID3426
Feature activation-7.42e-01

Pos logit contributions
 misunder+0.047
Pri+0.045
 differ+0.042
 attitudes+0.042
1945+0.042
Neg logit contributions
 SATA-0.045
HAM-0.045
oscope-0.043
 DRAG-0.042
 Turtles-0.040
Token 2016
Token ID1584
Feature activation+1.02e+00

Pos logit contributions
 misunder+0.492
Pri+0.461
 differ+0.428
1945+0.426
 attitudes+0.425
Neg logit contributions
 SATA-0.584
HAM-0.578
oscope-0.564
 DRAG-0.546
 Turtles-0.530
Token.
Token ID13
Feature activation-1.91e-02

Pos logit contributions
 SATA+0.744
HAM+0.737
oscope+0.718
 DRAG+0.689
 Turtles+0.667
Neg logit contributions
 misunder-0.740
Pri-0.690
1945-0.643
 differ-0.642
 attitudes-0.638
Token with
Token ID351
Feature activation+4.98e-01

Pos logit contributions
 SATA+0.549
HAM+0.543
oscope+0.530
 DRAG+0.512
 Turtles+0.499
Neg logit contributions
 misunder-0.457
Pri-0.426
 differ-0.395
1945-0.394
 attitudes-0.391
Token J
Token ID449
Feature activation-7.29e-01

Pos logit contributions
 SATA+0.390
HAM+0.386
oscope+0.377
 DRAG+0.365
 Turtles+0.355
Neg logit contributions
 misunder-0.332
Pri-0.311
 differ-0.288
1945-0.287
 attitudes-0.285
Tokeneth
Token ID2788
Feature activation+1.14e+00

Pos logit contributions
 misunder+0.508
Pri+0.481
 differ+0.446
1945+0.444
 attitudes+0.444
Neg logit contributions
 SATA-0.547
HAM-0.540
oscope-0.527
 DRAG-0.510
 Turtles-0.494
Tokenro
Token ID305
Feature activation+2.70e-01

Pos logit contributions
 SATA+1.011
HAM+0.999
oscope+0.986
 DRAG+0.949
 Turtles+0.931
Neg logit contributions
 misunder-0.643
Pri-0.591
 differ-0.541
1945-0.537
 attitudes-0.532
Token P
Token ID350
Feature activation-7.47e-01

Pos logit contributions
 SATA+0.206
HAM+0.204
oscope+0.198
 DRAG+0.191
 Turtles+0.187
Neg logit contributions
 misunder-0.186
Pri-0.174
 differ-0.162
1945-0.161
 attitudes-0.160
Tokenugh
Token ID6724
Feature activation+5.01e+00

Pos logit contributions
 misunder+0.538
Pri+0.510
 differ+0.477
1945+0.475
 attitudes+0.473
Neg logit contributions
 SATA-0.542
HAM-0.534
oscope-0.522
 DRAG-0.504
 Turtles-0.489
Token [
Token ID685
Feature activation+6.76e-01

Pos logit contributions
 SATA+3.651
HAM+3.616
oscope+3.532
 DRAG+3.351
 Turtles+3.297
Neg logit contributions
 misunder-3.588
Pri-3.276
1945-3.084
 differ-3.062
Initialized-3.027
Tokenwho
Token ID8727
Feature activation+1.01e-01

Pos logit contributions
HAM+0.551
 SATA+0.549
oscope+0.536
 DRAG+0.514
 Turtles+0.501
Neg logit contributions
 misunder-0.433
Pri-0.391
 differ-0.372
 attitudes-0.367
 kitchens-0.364
Token died
Token ID3724
Feature activation+1.82e-02

Pos logit contributions
 SATA+0.083
HAM+0.082
oscope+0.080
 DRAG+0.078
 Turtles+0.076
Neg logit contributions
 misunder-0.064
Pri-0.059
 differ-0.055
1945-0.055
 attitudes-0.054
Token last
Token ID938
Feature activation+4.37e-01

Pos logit contributions
 SATA+0.015
HAM+0.015
oscope+0.014
 DRAG+0.014
 Turtles+0.013
Neg logit contributions
 misunder-0.012
Pri-0.011
 differ-0.010
1945-0.010
 attitudes-0.010
Token week
Token ID1285
Feature activation+9.48e-02

Pos logit contributions
 SATA+0.350
HAM+0.346
oscope+0.338
 DRAG+0.327
 Turtles+0.318
Neg logit contributions
 misunder-0.284
Pri-0.265
 differ-0.245
1945-0.244
 attitudes-0.243
Token his
Token ID465
Feature activation-1.18e-01

Pos logit contributions
 misunder+0.736
Pri+0.683
 differ+0.627
1945+0.623
 attitudes+0.622
Neg logit contributions
 SATA-1.086
HAM-1.077
oscope-1.053
 DRAG-1.022
 Turtles-0.995
Token thigh
Token ID19341
Feature activation+1.47e+00

Pos logit contributions
 misunder+0.083
Pri+0.078
 differ+0.074
 attitudes+0.073
1945+0.073
Neg logit contributions
 SATA-0.087
HAM-0.087
oscope-0.084
 DRAG-0.081
 Turtles-0.078
Token and
Token ID290
Feature activation+3.90e-01

Pos logit contributions
 SATA+1.143
HAM+1.129
oscope+1.104
 DRAG+1.065
 Turtles+1.037
Neg logit contributions
 misunder-0.988
Pri-0.921
 differ-0.855
1945-0.853
 attitudes-0.847
Token severed
Token ID32160
Feature activation+2.13e+00

Pos logit contributions
 SATA+0.329
HAM+0.326
oscope+0.319
 DRAG+0.309
 Turtles+0.301
Neg logit contributions
 misunder-0.236
Pri-0.220
 differ-0.203
1945-0.201
 attitudes-0.201
Token an
Token ID281
Feature activation+4.67e-01

Pos logit contributions
 SATA+1.800
HAM+1.774
oscope+1.737
 DRAG+1.683
 Turtles+1.641
Neg logit contributions
 misunder-1.296
Pri-1.204
1945-1.104
 differ-1.104
esa-1.092
Token artery
Token ID37646
Feature activation+5.01e+00

Pos logit contributions
 SATA+0.235
HAM+0.235
oscope+0.225
 DRAG+0.215
 Turtles+0.203
Neg logit contributions
 misunder-0.438
Pri-0.420
 differ-0.400
 attitudes-0.398
esa-0.397
Token.
Token ID13
Feature activation+5.46e-02

Pos logit contributions
 SATA+3.684
HAM+3.554
oscope+3.521
 DRAG+3.321
 Turtles+3.263
Neg logit contributions
 misunder-3.602
Pri-3.349
1945-3.128
esa-3.112
 differ-3.089
Token Despite
Token ID7945
Feature activation+8.00e-01

Pos logit contributions
 SATA+0.042
HAM+0.041
oscope+0.040
 DRAG+0.039
antage+0.038
Neg logit contributions
 misunder-0.037
Pri-0.035
1945-0.033
 differ-0.032
 attitudes-0.032
Token his
Token ID465
Feature activation-2.29e-01

Pos logit contributions
 SATA+0.667
HAM+0.661
oscope+0.646
 DRAG+0.626
 Turtles+0.610
Neg logit contributions
 misunder-0.493
Pri-0.459
 differ-0.423
1945-0.422
 attitudes-0.419
Token friends
Token ID2460
Feature activation+2.47e-01

Pos logit contributions
 misunder+0.135
Pri+0.125
 differ+0.117
 attitudes+0.117
1945+0.114
Neg logit contributions
 SATA-0.195
HAM-0.195
oscope-0.190
 DRAG-0.184
 Turtles-0.178
Token'
Token ID6
Feature activation+9.40e-01

Pos logit contributions
 SATA+0.190
HAM+0.188
oscope+0.183
 DRAG+0.177
 Turtles+0.172
Neg logit contributions
 misunder-0.167
Pri-0.157
 differ-0.146
1945-0.145
 attitudes-0.145
Token)
Token ID8
Feature activation+3.36e-02

Pos logit contributions
 SATA+0.880
HAM+0.870
oscope+0.852
 DRAG+0.823
 Turtles+0.791
Neg logit contributions
 misunder-0.946
Pri-0.920
1945-0.863
 differ-0.859
 attitudes-0.850
Token
Token ID198
Feature activation-1.22e-01

Pos logit contributions
 SATA+0.026
HAM+0.026
oscope+0.025
 DRAG+0.024
 Turtles+0.024
Neg logit contributions
 misunder-0.023
Pri-0.021
 differ-0.020
1945-0.020
 attitudes-0.020
Token
Token ID198
Feature activation-1.71e-01

Pos logit contributions
Pri+0.073
 misunder+0.071
1945+0.068
 differ+0.066
Zero+0.066
Neg logit contributions
 SATA-0.099
HAM-0.098
oscope-0.096
 DRAG-0.094
antage-0.091
TokenMaster
Token ID18254
Feature activation-1.88e+00

Pos logit contributions
 misunder+0.127
Pri+0.120
 differ+0.112
1945+0.111
 attitudes+0.111
Neg logit contributions
 SATA-0.121
HAM-0.120
oscope-0.117
 DRAG-0.112
 Turtles-0.109
Token the
Token ID262
Feature activation+1.83e-01

Pos logit contributions
 misunder+1.189
Pri+1.112
 differ+1.030
1945+1.024
 attitudes+1.023
Neg logit contributions
 SATA-1.527
HAM-1.514
oscope-1.482
 DRAG-1.435
 Turtles-1.391
Token Exc
Token ID25268
Feature activation+5.00e+00

Pos logit contributions
 SATA+0.135
HAM+0.134
oscope+0.131
 DRAG+0.126
 Turtles+0.122
Neg logit contributions
 misunder-0.129
Pri-0.121
 differ-0.113
 attitudes-0.113
1945-0.113
Tokenursion
Token ID24197
Feature activation-8.97e-01

Pos logit contributions
 SATA+3.600
oscope+3.561
HAM+3.509
 DRAG+3.263
antage+3.220
Neg logit contributions
 misunder-3.659
Pri-3.330
1945-3.174
 differ-3.156
 attitudes-3.146
Token Fun
Token ID11138
Feature activation+8.25e-01

Pos logit contributions
 misunder+0.681
Pri+0.645
 differ+0.605
1945+0.602
 attitudes+0.602
Neg logit contributions
 SATA-0.616
HAM-0.610
oscope-0.592
 DRAG-0.570
 Turtles-0.551
Tokennel
Token ID4954
Feature activation+6.33e-01

Pos logit contributions
 SATA+0.599
HAM+0.591
oscope+0.580
 DRAG+0.555
 Turtles+0.540
Neg logit contributions
 misunder-0.603
Pri-0.559
 differ-0.526
1945-0.524
 attitudes-0.522
Token (
Token ID357
Feature activation-3.50e-01

Pos logit contributions
 SATA+0.434
HAM+0.429
oscope+0.419
 DRAG+0.402
 Turtles+0.388
Neg logit contributions
 misunder-0.486
Pri-0.457
 differ-0.428
1945-0.428
 attitudes-0.426
Token1
Token ID16
Feature activation+8.66e-01

Pos logit contributions
 misunder+0.235
Pri+0.221
 differ+0.205
1945+0.205
 attitudes+0.203
Neg logit contributions
 SATA-0.272
HAM-0.269
oscope-0.263
 DRAG-0.254
 Turtles-0.247
Token best
Token ID1266
Feature activation-1.41e+00

Pos logit contributions
 SATA+0.375
HAM+0.370
oscope+0.360
 DRAG+0.350
 Turtles+0.340
Neg logit contributions
 misunder-0.333
Pri-0.311
 differ-0.289
1945-0.288
 attitudes-0.285
Token-
Token ID12
Feature activation-1.68e-01

Pos logit contributions
 misunder+0.995
Pri+0.936
 differ+0.874
 attitudes+0.871
1945+0.868
Neg logit contributions
 SATA-1.048
HAM-1.041
oscope-1.015
 DRAG-0.978
 Turtles-0.949
Tokenknown
Token ID4002
Feature activation-2.81e-01

Pos logit contributions
 misunder+0.101
Pri+0.092
 differ+0.085
 attitudes+0.084
1945+0.084
Neg logit contributions
 SATA-0.144
HAM-0.143
oscope-0.139
 DRAG-0.134
 Turtles-0.131
Token mail
Token ID6920
Feature activation+1.48e+00

Pos logit contributions
 misunder+0.178
Pri+0.166
 differ+0.154
1945+0.153
 attitudes+0.152
Neg logit contributions
 SATA-0.229
HAM-0.227
oscope-0.222
 DRAG-0.215
 Turtles-0.209
Token order
Token ID1502
Feature activation+1.29e+00

Pos logit contributions
 SATA+1.176
HAM+1.166
oscope+1.138
 DRAG+1.099
 Turtles+1.069
Neg logit contributions
 misunder-0.976
Pri-0.910
 differ-0.842
1945-0.838
 attitudes-0.831
Token catalog
Token ID18388
Feature activation+4.99e+00

Pos logit contributions
 SATA+1.022
HAM+1.007
oscope+0.982
 DRAG+0.951
 Turtles+0.927
Neg logit contributions
 misunder-0.857
Pri-0.803
1945-0.740
 differ-0.739
esa-0.732
Tokenues
Token ID947
Feature activation+7.57e-01

Pos logit contributions
 SATA+3.524
HAM+3.443
oscope+3.359
 DRAG+3.260
 Turtles+3.177
Neg logit contributions
 misunder-3.678
Pri-3.463
1945-3.204
Initialized-3.190
Zero-3.173
Token apologised
Token ID46397
Feature activation+1.24e+00

Pos logit contributions
 SATA+0.590
HAM+0.583
oscope+0.569
 DRAG+0.550
 Turtles+0.535
Neg logit contributions
 misunder-0.509
Pri-0.476
 differ-0.441
1945-0.440
 attitudes-0.438
Token today
Token ID1909
Feature activation+2.04e+00

Pos logit contributions
 SATA+0.912
HAM+0.902
oscope+0.870
 DRAG+0.839
 Turtles+0.816
Neg logit contributions
 misunder-0.894
Pri-0.832
1945-0.776
 differ-0.770
esa-0.769
Token after
Token ID706
Feature activation+1.21e+00

Pos logit contributions
 SATA+1.453
HAM+1.431
oscope+1.393
 DRAG+1.341
 Turtles+1.295
Neg logit contributions
 misunder-1.524
Pri-1.415
1945-1.315
 differ-1.313
esa-1.306
Token web
Token ID3992
Feature activation+5.25e-01

Pos logit contributions
 SATA+0.959
HAM+0.948
oscope+0.924
 DRAG+0.896
 Turtles+0.872
Neg logit contributions
 misunder-0.798
Pri-0.748
 differ-0.691
1945-0.691
esa-0.684
Token up
Token ID510
Feature activation+2.90e-01

Pos logit contributions
 misunder+0.819
Pri+0.766
 differ+0.713
1945+0.711
 attitudes+0.705
Neg logit contributions
 SATA-0.901
HAM-0.891
oscope-0.869
 DRAG-0.839
 Turtles-0.815
Token the
Token ID262
Feature activation-1.10e-01

Pos logit contributions
 SATA+0.239
HAM+0.236
oscope+0.230
 DRAG+0.223
 Turtles+0.219
Neg logit contributions
 misunder-0.183
Pri-0.168
1945-0.156
 differ-0.155
esa-0.154
Token air
Token ID1633
Feature activation+1.10e+00

Pos logit contributions
 misunder+0.070
Pri+0.066
 differ+0.061
1945+0.060
 attitudes+0.060
Neg logit contributions
 SATA-0.089
HAM-0.088
oscope-0.086
 DRAG-0.083
 Turtles-0.081
Token their
Token ID511
Feature activation+8.33e-01

Pos logit contributions
 SATA+0.859
HAM+0.849
oscope+0.828
 DRAG+0.801
 Turtles+0.781
Neg logit contributions
 misunder-0.739
Pri-0.685
1945-0.633
 differ-0.632
 attitudes-0.628
Token kids
Token ID3988
Feature activation-1.28e+00

Pos logit contributions
 SATA+0.639
HAM+0.631
oscope+0.616
 DRAG+0.596
 Turtles+0.580
Neg logit contributions
 misunder-0.570
Pri-0.533
1945-0.495
 differ-0.493
 attitudes-0.488
Token breathe
Token ID18044
Feature activation+4.96e+00

Pos logit contributions
 misunder+1.042
Pri+1.001
 differ+0.964
 kitchens+0.944
 attitudes+0.943
Neg logit contributions
 SATA-0.800
HAM-0.797
oscope-0.759
 DRAG-0.739
lishes-0.706
Token all
Token ID477
Feature activation-4.62e-01

Pos logit contributions
 SATA+3.612
HAM+3.584
oscope+3.487
 DRAG+3.399
 Turtles+3.296
Neg logit contributions
 misunder-3.526
Pri-3.243
1945-3.032
esa-3.006
Initialized-2.994
Token deserve
Token ID10925
Feature activation-1.63e+00

Pos logit contributions
 misunder+0.288
Pri+0.269
 differ+0.250
1945+0.247
 attitudes+0.247
Neg logit contributions
 SATA-0.382
HAM-0.378
oscope-0.369
 DRAG-0.358
 Turtles-0.348
Token the
Token ID262
Feature activation+2.54e-01

Pos logit contributions
 misunder+1.011
Pri+0.952
 differ+0.883
 attitudes+0.882
 kitchens+0.871
Neg logit contributions
 SATA-1.336
HAM-1.328
oscope-1.300
 DRAG-1.260
 Turtles-1.219
Token facts
Token ID6419
Feature activation+5.32e-01

Pos logit contributions
 SATA+0.205
HAM+0.203
oscope+0.198
 DRAG+0.192
 Turtles+0.187
Neg logit contributions
 misunder-0.164
Pri-0.153
 differ-0.142
1945-0.141
 attitudes-0.141
Token behind
Token ID2157
Feature activation-8.09e-02

Pos logit contributions
 SATA+0.413
HAM+0.408
oscope+0.399
 DRAG+0.385
 Turtles+0.374
Neg logit contributions
 misunder-0.359
Pri-0.335
 differ-0.311
1945-0.311
 attitudes-0.309
Token MC
Token ID13122
Feature activation+2.84e+00

Pos logit contributions
 misunder+0.497
Pri+0.472
 differ+0.445
 attitudes+0.443
1945+0.439
Neg logit contributions
HAM-0.461
 SATA-0.460
oscope-0.447
 DRAG-0.432
lishes-0.415
TokenUs
Token ID5842
Feature activation+6.57e-01

Pos logit contributions
 SATA+1.969
HAM+1.939
oscope+1.874
 DRAG+1.807
 Turtles+1.755
Neg logit contributions
 misunder-2.177
Pri-2.035
 differ-1.907
1945-1.902
 attitudes-1.896
Token with
Token ID351
Feature activation+6.18e-01

Pos logit contributions
 SATA+0.504
HAM+0.498
oscope+0.485
 DRAG+0.469
 Turtles+0.456
Neg logit contributions
 misunder-0.452
Pri-0.422
 differ-0.392
1945-0.392
 attitudes-0.389
Token fairly
Token ID6547
Feature activation+1.94e+00

Pos logit contributions
 SATA+0.425
HAM+0.421
oscope+0.409
 DRAG+0.394
 Turtles+0.380
Neg logit contributions
 misunder-0.470
Pri-0.444
 differ-0.417
1945-0.415
 attitudes-0.414
Token simple
Token ID2829
Feature activation+1.17e+00

Pos logit contributions
 SATA+1.615
HAM+1.596
oscope+1.559
 DRAG+1.510
 Turtles+1.473
Neg logit contributions
 misunder-1.200
Pri-1.116
 differ-1.026
1945-1.025
 attitudes-1.018
Token peripher
Token ID18375
Feature activation+4.88e+00

Pos logit contributions
 SATA+0.862
HAM+0.855
oscope+0.833
 DRAG+0.804
 Turtles+0.778
Neg logit contributions
 misunder-0.830
Pri-0.781
 differ-0.731
 attitudes-0.726
1945-0.726
Tokenals
Token ID874
Feature activation-2.98e-01

Pos logit contributions
 SATA+3.458
HAM+3.381
oscope+3.346
 DRAG+3.128
 Turtles+3.072
Neg logit contributions
 misunder-3.714
Pri-3.417
 differ-3.196
1945-3.168
 attitudes-3.162
Token,
Token ID11
Feature activation+8.61e-02

Pos logit contributions
 misunder+0.201
Pri+0.189
 differ+0.176
1945+0.175
 attitudes+0.175
Neg logit contributions
 SATA-0.230
HAM-0.228
oscope-0.222
 DRAG-0.215
 Turtles-0.208
Token I
Token ID314
Feature activation-5.67e-03

Pos logit contributions
 SATA+0.060
HAM+0.059
oscope+0.058
 DRAG+0.056
 Turtles+0.054
Neg logit contributions
 misunder-0.065
Pri-0.061
 differ-0.057
1945-0.057
 attitudes-0.057
Token
Token ID447
Feature activation-6.76e-02

Pos logit contributions
 misunder+0.003
Pri+0.003
1945+0.003
 attitudes+0.003
 differ+0.003
Neg logit contributions
 SATA-0.005
HAM-0.005
oscope-0.005
 DRAG-0.005
 Turtles-0.005
Token
Token ID247
Feature activation+2.86e-01

Pos logit contributions
 misunder+0.040
Pri+0.037
 differ+0.034
1945+0.034
 attitudes+0.034
Neg logit contributions
 SATA-0.058
HAM-0.057
oscope-0.056
 DRAG-0.054
 Turtles-0.053
Token kind
Token ID1611
Feature activation+9.36e-01

Pos logit contributions
 misunder+0.020
Pri+0.019
 differ+0.017
1945+0.017
 attitudes+0.017
Neg logit contributions
 SATA-0.023
HAM-0.023
oscope-0.022
 DRAG-0.021
 Turtles-0.021
Token of
Token ID286
Feature activation-2.91e-02

Pos logit contributions
 SATA+0.832
HAM+0.825
oscope+0.807
 DRAG+0.783
 Turtles+0.764
Neg logit contributions
 misunder-0.528
Pri-0.485
 differ-0.443
1945-0.440
 attitudes-0.438
Token programming
Token ID8300
Feature activation-7.35e-01

Pos logit contributions
 misunder+0.018
Pri+0.017
 differ+0.016
 attitudes+0.016
1945+0.016
Neg logit contributions
 SATA-0.024
HAM-0.023
oscope-0.023
 DRAG-0.022
 Turtles-0.022
Token is
Token ID318
Feature activation-1.70e-01

Pos logit contributions
 misunder+0.483
Pri+0.451
 differ+0.424
 attitudes+0.420
1945+0.415
Neg logit contributions
 SATA-0.582
HAM-0.580
oscope-0.565
 DRAG-0.547
 Turtles-0.529
Token a
Token ID257
Feature activation-2.16e-02

Pos logit contributions
 misunder+0.099
Pri+0.092
 differ+0.085
 attitudes+0.084
1945+0.083
Neg logit contributions
 SATA-0.147
HAM-0.146
oscope-0.143
 DRAG-0.139
 Turtles-0.135
Token breath
Token ID8033
Feature activation+4.86e+00

Pos logit contributions
 misunder+0.014
Pri+0.013
 differ+0.013
1945+0.012
 attitudes+0.012
Neg logit contributions
 SATA-0.017
HAM-0.017
oscope-0.016
 DRAG-0.016
 Turtles-0.015
Token of
Token ID286
Feature activation+2.78e-01

Pos logit contributions
 SATA+3.965
HAM+3.875
oscope+3.872
 DRAG+3.685
hound+3.644
Neg logit contributions
 misunder-2.987
Pri-2.802
1945-2.586
 breached-2.557
 kitchens-2.539
Token fresh
Token ID4713
Feature activation+1.80e-01

Pos logit contributions
 SATA+0.209
HAM+0.207
oscope+0.201
 DRAG+0.195
 Turtles+0.189
Neg logit contributions
 misunder-0.194
Pri-0.182
 differ-0.170
 attitudes-0.169
1945-0.169
Token air
Token ID1633
Feature activation+1.35e+00

Pos logit contributions
 SATA+0.105
HAM+0.104
oscope+0.100
 DRAG+0.097
 Turtles+0.092
Neg logit contributions
 misunder-0.155
Pri-0.147
 differ-0.140
 attitudes-0.140
1945-0.139
Token in
Token ID287
Feature activation+2.08e-01

Pos logit contributions
 SATA+1.015
HAM+1.005
oscope+0.981
 DRAG+0.946
 Turtles+0.919
Neg logit contributions
 misunder-0.942
Pri-0.881
 differ-0.820
1945-0.817
 attitudes-0.814
Token a
Token ID257
Feature activation+7.11e-03

Pos logit contributions
 SATA+0.174
HAM+0.173
oscope+0.169
 DRAG+0.164
 Turtles+0.159
Neg logit contributions
 misunder-0.127
Pri-0.119
 differ-0.109
1945-0.109
 attitudes-0.108
Token the
Token ID262
Feature activation+6.57e-02

Pos logit contributions
 SATA+0.432
HAM+0.428
oscope+0.418
 DRAG+0.404
 Turtles+0.392
Neg logit contributions
 misunder-0.385
Pri-0.361
 differ-0.336
1945-0.335
 attitudes-0.333
Token social
Token ID1919
Feature activation+2.66e-01

Pos logit contributions
 SATA+0.051
HAM+0.051
oscope+0.049
 DRAG+0.048
 Turtles+0.046
Neg logit contributions
 misunder-0.044
Pri-0.042
 differ-0.039
 attitudes-0.038
1945-0.038
Token media
Token ID2056
Feature activation+7.42e-01

Pos logit contributions
 SATA+0.202
HAM+0.200
oscope+0.195
 DRAG+0.189
 Turtles+0.183
Neg logit contributions
 misunder-0.184
Pri-0.172
 differ-0.161
1945-0.160
 attitudes-0.159
Token world
Token ID995
Feature activation-1.00e-01

Pos logit contributions
 SATA+0.577
HAM+0.571
oscope+0.557
 DRAG+0.539
 Turtles+0.524
Neg logit contributions
 misunder-0.499
Pri-0.467
 differ-0.434
1945-0.432
 attitudes-0.430
Token!!!
Token ID10185
Feature activation-5.15e-01

Pos logit contributions
 misunder+0.065
Pri+0.060
 differ+0.056
 attitudes+0.056
1945+0.056
Neg logit contributions
 SATA-0.080
HAM-0.080
oscope-0.078
 DRAG-0.075
 Turtles-0.073
Token Exc
Token ID25268
Feature activation+4.85e+00

Pos logit contributions
 misunder+0.346
Pri+0.325
 differ+0.301
1945+0.300
 attitudes+0.299
Neg logit contributions
 SATA-0.400
HAM-0.395
oscope-0.386
 DRAG-0.373
 Turtles-0.362
Tokeniting
Token ID1780
Feature activation-1.20e+00

Pos logit contributions
 SATA+4.763
HAM+4.709
oscope+4.682
 DRAG+4.520
antage+4.376
Neg logit contributions
 misunder-2.184
Pri-1.953
1945-1.752
 differ-1.737
 attitudes-1.721
Token
Token ID50169
Feature activation-3.25e+00

Pos logit contributions
 misunder+0.795
Pri+0.747
 differ+0.695
 attitudes+0.694
1945+0.687
Neg logit contributions
 SATA-0.945
HAM-0.934
oscope-0.912
 DRAG-0.881
 Turtles-0.857
Token
Token ID235
Feature activation+6.35e-01

Pos logit contributions
 misunder+2.276
Pri+2.156
 differ+2.008
 attitudes+2.003
Zero+1.994
Neg logit contributions
 SATA-2.404
HAM-2.371
oscope-2.301
 DRAG-2.241
 Turtles-2.174
Token
Token ID8582
Feature activation-1.98e+00

Pos logit contributions
 SATA+0.579
HAM+0.575
oscope+0.563
 DRAG+0.547
 Turtles+0.534
Neg logit contributions
 misunder-0.341
Pri-0.314
 differ-0.286
1945-0.284
 attitudes-0.283
Token
Token ID229
Feature activation+3.66e-01

Pos logit contributions
 misunder+1.425
Pri+1.349
 differ+1.262
1945+1.254
 attitudes+1.250
Neg logit contributions
 SATA-1.439
HAM-1.420
oscope-1.380
 DRAG-1.335
 Turtles-1.297
Token when
Token ID618
Feature activation-2.22e-01

Pos logit contributions
 SATA+0.451
HAM+0.446
oscope+0.435
 DRAG+0.421
 Turtles+0.409
Neg logit contributions
 misunder-0.375
Pri-0.351
 differ-0.325
1945-0.324
 attitudes-0.323
Token you
Token ID345
Feature activation-2.01e-01

Pos logit contributions
 misunder+0.139
Pri+0.130
 differ+0.121
 attitudes+0.120
1945+0.119
Neg logit contributions
 SATA-0.182
HAM-0.182
oscope-0.178
 DRAG-0.171
 Turtles-0.166
Token turn
Token ID1210
Feature activation-1.44e+00

Pos logit contributions
 misunder+0.137
Pri+0.129
 differ+0.121
 attitudes+0.119
1945+0.119
Neg logit contributions
 SATA-0.154
HAM-0.153
oscope-0.149
 DRAG-0.143
lishes-0.139
Token the
Token ID262
Feature activation+3.88e-02

Pos logit contributions
 misunder+0.887
Pri+0.827
 attitudes+0.768
 differ+0.765
 kitchens+0.758
Neg logit contributions
 SATA-1.193
HAM-1.189
oscope-1.158
 DRAG-1.122
 Turtles-1.085
Token f
Token ID277
Feature activation-9.72e-01

Pos logit contributions
HAM+0.030
 SATA+0.029
oscope+0.029
 DRAG+0.028
lishes+0.027
Neg logit contributions
 misunder-0.027
Pri-0.025
 attitudes-0.023
 differ-0.023
 kitchens-0.023
Tokenauc
Token ID14272
Feature activation+4.85e+00

Pos logit contributions
 misunder+0.951
Pri+0.929
 differ+0.882
esa+0.880
1945+0.874
Neg logit contributions
 SATA-0.451
HAM-0.443
oscope-0.409
 DRAG-0.404
lishes-0.372
Tokenet
Token ID316
Feature activation+1.82e+00

Pos logit contributions
 SATA+2.033
HAM+1.995
oscope+1.960
 Turtles+1.731
 DRAG+1.723
Neg logit contributions
 misunder-5.015
Pri-4.740
1945-4.541
 conflic-4.477
Initialized-4.467
Token on
Token ID319
Feature activation+9.42e-01

Pos logit contributions
 SATA+1.436
HAM+1.420
oscope+1.388
 DRAG+1.341
 Turtles+1.303
Neg logit contributions
 misunder-1.208
Pri-1.129
 differ-1.046
1945-1.045
 attitudes-1.037
Token,
Token ID11
Feature activation+2.83e-01

Pos logit contributions
 SATA+0.741
HAM+0.733
oscope+0.716
 DRAG+0.692
 Turtles+0.673
Neg logit contributions
 misunder-0.625
Pri-0.585
 differ-0.543
1945-0.541
 attitudes-0.539
Token the
Token ID262
Feature activation+9.18e-02

Pos logit contributions
 SATA+0.236
HAM+0.234
oscope+0.228
 DRAG+0.221
 Turtles+0.215
Neg logit contributions
 misunder-0.174
Pri-0.163
 differ-0.151
 attitudes-0.150
1945-0.149
Token water
Token ID1660
Feature activation-7.92e-01

Pos logit contributions
 SATA+0.071
HAM+0.071
oscope+0.069
 DRAG+0.067
lishes+0.065
Neg logit contributions
 misunder-0.061
Pri-0.058
 differ-0.054
 attitudes-0.054
 kitchens-0.053
Token that
Token ID326
Feature activation-4.50e-02

Pos logit contributions
 misunder+0.292
Pri+0.275
 differ+0.257
 attitudes+0.254
1945+0.254
Neg logit contributions
 SATA-0.359
HAM-0.357
oscope-0.348
 DRAG-0.337
 Turtles-0.328
Token way
Token ID835
Feature activation+3.44e-01

Pos logit contributions
 misunder+0.028
Pri+0.026
 differ+0.024
1945+0.024
 attitudes+0.024
Neg logit contributions
 SATA-0.037
HAM-0.037
oscope-0.036
 DRAG-0.035
 Turtles-0.034
Token.
Token ID13
Feature activation+7.47e-01

Pos logit contributions
 SATA+0.259
HAM+0.256
oscope+0.250
 DRAG+0.241
 Turtles+0.234
Neg logit contributions
 misunder-0.241
Pri-0.226
 differ-0.210
1945-0.210
 attitudes-0.209
Token The
Token ID383
Feature activation+3.37e-01

Pos logit contributions
 SATA+0.561
HAM+0.559
oscope+0.548
 DRAG+0.526
 Turtles+0.508
Neg logit contributions
 misunder-0.513
Pri-0.487
 differ-0.455
1945-0.451
 attitudes-0.450
Token programming
Token ID8300
Feature activation-4.99e-01

Pos logit contributions
 SATA+0.238
HAM+0.236
oscope+0.230
 DRAG+0.221
 Turtles+0.214
Neg logit contributions
 misunder-0.250
Pri-0.236
 differ-0.222
 attitudes-0.221
1945-0.220
Token abstraction
Token ID34651
Feature activation+4.82e+00

Pos logit contributions
 misunder+0.328
Pri+0.307
 differ+0.287
 attitudes+0.285
1945+0.284
Neg logit contributions
 SATA-0.394
HAM-0.391
oscope-0.382
 DRAG-0.370
 Turtles-0.359
Token commonly
Token ID8811
Feature activation+1.31e+00

Pos logit contributions
 SATA+3.778
HAM+3.705
oscope+3.626
 DRAG+3.473
 Turtles+3.428
Neg logit contributions
 misunder-3.189
Pri-2.971
1945-2.725
esa-2.714
 kitchens-2.686
Token used
Token ID973
Feature activation+8.12e-01

Pos logit contributions
 SATA+1.163
HAM+1.152
oscope+1.127
 DRAG+1.095
 Turtles+1.068
Neg logit contributions
 misunder-0.735
Pri-0.680
 differ-0.618
1945-0.618
 attitudes-0.614
Token for
Token ID329
Feature activation+5.55e-02

Pos logit contributions
 SATA+0.620
HAM+0.614
oscope+0.598
 DRAG+0.578
 Turtles+0.562
Neg logit contributions
 misunder-0.557
Pri-0.522
 differ-0.485
1945-0.484
 attitudes-0.482
Token such
Token ID884
Feature activation-2.17e+00

Pos logit contributions
 SATA+0.045
HAM+0.045
oscope+0.044
 DRAG+0.043
 Turtles+0.041
Neg logit contributions
 misunder-0.035
Pri-0.032
 differ-0.030
 attitudes-0.030
1945-0.030
Token purposes
Token ID4959
Feature activation+8.87e-01

Pos logit contributions
 misunder+1.340
Pri+1.232
 differ+1.167
 attitudes+1.164
1945+1.129
Neg logit contributions
 SATA-1.794
HAM-1.788
oscope-1.767
 DRAG-1.693
antage-1.642
Token on
Token ID319
Feature activation+7.56e-01

Pos logit contributions
 misunder+0.237
Pri+0.223
 differ+0.209
 attitudes+0.207
1945+0.206
Neg logit contributions
 SATA-0.261
HAM-0.259
oscope-0.253
 DRAG-0.245
 Turtles-0.238
Token,
Token ID11
Feature activation+2.52e-03

Pos logit contributions
 SATA+0.602
HAM+0.595
oscope+0.582
 DRAG+0.562
 Turtles+0.547
Neg logit contributions
 misunder-0.496
Pri-0.463
 differ-0.429
1945-0.428
 attitudes-0.426
Token then
Token ID788
Feature activation+8.18e-01

Pos logit contributions
HAM+0.002
 SATA+0.002
oscope+0.002
 DRAG+0.002
 Turtles+0.002
Neg logit contributions
 misunder-0.002
Pri-0.002
 differ-0.001
 attitudes-0.001
1945-0.001
Token open
Token ID1280
Feature activation-1.03e+00

Pos logit contributions
 SATA+0.590
HAM+0.588
oscope+0.571
 DRAG+0.547
 Turtles+0.533
Neg logit contributions
 misunder-0.593
Pri-0.559
 differ-0.525
 attitudes-0.521
1945-0.520
Token the
Token ID262
Feature activation-8.18e-03

Pos logit contributions
 misunder+0.672
Pri+0.629
 differ+0.584
1945+0.580
 attitudes+0.580
Neg logit contributions
 SATA-0.818
HAM-0.813
oscope-0.793
 DRAG-0.765
 Turtles-0.745
Token catalog
Token ID18388
Feature activation+4.81e+00

Pos logit contributions
 misunder+0.006
Pri+0.006
 differ+0.005
 attitudes+0.005
1945+0.005
Neg logit contributions
 SATA-0.006
HAM-0.006
oscope-0.006
 DRAG-0.005
 Turtles-0.005
Token like
Token ID588
Feature activation+1.88e-01

Pos logit contributions
 SATA+3.617
HAM+3.538
oscope+3.511
 DRAG+3.366
 Turtles+3.247
Neg logit contributions
 misunder-3.364
Pri-3.133
1945-2.887
 differ-2.870
 attitudes-2.860
Token you
Token ID345
Feature activation-1.11e-01

Pos logit contributions
 SATA+0.142
HAM+0.141
oscope+0.138
 DRAG+0.133
 Turtles+0.129
Neg logit contributions
 misunder-0.129
Pri-0.121
 differ-0.113
 attitudes-0.113
1945-0.112
Token would
Token ID561
Feature activation+1.23e+00

Pos logit contributions
 misunder+0.062
Pri+0.058
 differ+0.053
1945+0.052
 attitudes+0.052
Neg logit contributions
 SATA-0.099
HAM-0.098
oscope-0.096
 DRAG-0.093
 Turtles-0.091
Token any
Token ID597
Feature activation+9.27e-01

Pos logit contributions
 SATA+0.999
HAM+0.989
oscope+0.967
 DRAG+0.936
 Turtles+0.910
Neg logit contributions
 misunder-0.782
Pri-0.730
 differ-0.675
1945-0.672
 attitudes-0.669
Token ebook
Token ID47179
Feature activation+5.12e-02

Pos logit contributions
 SATA+0.780
HAM+0.777
oscope+0.759
 DRAG+0.734
 Turtles+0.714
Neg logit contributions
 misunder-0.560
Pri-0.522
 differ-0.483
1945-0.478
 attitudes-0.476
Token this
Token ID428
Feature activation+3.61e-02

Pos logit contributions
 SATA+1.125
HAM+1.114
oscope+1.086
 DRAG+1.053
 Turtles+1.023
Neg logit contributions
 misunder-0.916
Pri-0.856
 differ-0.791
1945-0.787
 attitudes-0.784
Token had
Token ID550
Feature activation-2.21e-01

Pos logit contributions
 SATA+0.027
HAM+0.027
oscope+0.026
 DRAG+0.025
 Turtles+0.024
Neg logit contributions
 misunder-0.025
Pri-0.024
 differ-0.022
1945-0.022
 attitudes-0.022
Token dried
Token ID16577
Feature activation+9.73e-01

Pos logit contributions
 misunder+0.132
Pri+0.123
 differ+0.113
1945+0.113
 attitudes+0.112
Neg logit contributions
 SATA-0.188
HAM-0.187
oscope-0.183
 DRAG-0.177
 Turtles-0.172
Token,
Token ID11
Feature activation-3.71e-01

Pos logit contributions
 SATA+0.729
HAM+0.724
oscope+0.704
 DRAG+0.683
 Turtles+0.660
Neg logit contributions
 misunder-0.682
Pri-0.639
1945-0.593
 differ-0.592
 attitudes-0.586
Token I
Token ID314
Feature activation-1.68e-01

Pos logit contributions
 misunder+0.280
Pri+0.265
 differ+0.249
1945+0.247
 attitudes+0.247
Neg logit contributions
 SATA-0.257
HAM-0.255
oscope-0.247
 DRAG-0.238
 Turtles-0.230
Token tint
Token ID34791
Feature activation-6.23e+00

Pos logit contributions
 misunder+0.102
Pri+0.094
 differ+0.087
1945+0.086
 attitudes+0.086
Neg logit contributions
 SATA-0.142
HAM-0.141
oscope-0.138
 DRAG-0.133
 Turtles-0.130
Tokened
Token ID276
Feature activation-1.47e+00

Pos logit contributions
 misunder+0.763
Pri+0.522
 differ+0.286
 attitudes+0.260
esa+0.252
Neg logit contributions
 SATA-8.091
HAM-8.067
oscope-7.907
 DRAG-7.815
lishes-7.649
Token the
Token ID262
Feature activation-1.06e-02

Pos logit contributions
 misunder+0.870
Pri+0.809
 differ+0.747
 attitudes+0.741
1945+0.740
Neg logit contributions
 SATA-1.258
HAM-1.247
oscope-1.219
 DRAG-1.183
 Turtles-1.149
Token st
Token ID336
Feature activation-1.40e+00

Pos logit contributions
 misunder+0.007
Pri+0.007
 differ+0.006
 attitudes+0.006
1945+0.006
Neg logit contributions
 SATA-0.008
HAM-0.008
oscope-0.008
 DRAG-0.008
 Turtles-0.007
Tokenon
Token ID261
Feature activation-7.37e-02

Pos logit contributions
 misunder+0.787
Pri+0.737
 differ+0.671
1945+0.668
esa+0.666
Neg logit contributions
 SATA-1.239
HAM-1.229
oscope-1.194
 DRAG-1.174
 Turtles-1.138
Tokenework
Token ID6433
Feature activation+7.85e-01

Pos logit contributions
 misunder+0.057
Pri+0.054
 differ+0.051
1945+0.050
 attitudes+0.050
Neg logit contributions
 SATA-0.050
HAM-0.049
oscope-0.048
 DRAG-0.046
 Turtles-0.045
Token on
Token ID319
Feature activation+7.62e-01

Pos logit contributions
 SATA+0.020
HAM+0.020
oscope+0.020
 DRAG+0.019
 Turtles+0.019
Neg logit contributions
 misunder-0.018
Pri-0.016
1945-0.015
 differ-0.015
esa-0.015
Token the
Token ID262
Feature activation+3.58e-02

Pos logit contributions
 SATA+0.630
HAM+0.618
oscope+0.605
 DRAG+0.590
 Turtles+0.574
Neg logit contributions
 misunder-0.479
Pri-0.446
1945-0.412
 differ-0.411
esa-0.405
Token safety
Token ID3747
Feature activation+9.09e-01

Pos logit contributions
 SATA+0.029
HAM+0.028
oscope+0.028
 DRAG+0.027
 Turtles+0.026
Neg logit contributions
 misunder-0.023
Pri-0.022
1945-0.020
 differ-0.020
 attitudes-0.020
Token issues
Token ID2428
Feature activation+2.85e-01

Pos logit contributions
 SATA+0.752
HAM+0.739
oscope+0.725
 DRAG+0.705
 Turtles+0.684
Neg logit contributions
 misunder-0.568
Pri-0.529
1945-0.488
 differ-0.483
esa-0.481
Token regarding
Token ID5115
Feature activation-1.41e+00

Pos logit contributions
 SATA+0.221
HAM+0.218
oscope+0.212
 DRAG+0.205
 Turtles+0.200
Neg logit contributions
 misunder-0.194
Pri-0.182
1945-0.168
 differ-0.168
esa-0.166
Token manned
Token ID34737
Feature activation-5.82e+00

Pos logit contributions
 misunder+0.906
Pri+0.846
 differ+0.782
1945+0.780
 attitudes+0.771
Neg logit contributions
 SATA-1.144
HAM-1.130
oscope-1.102
 DRAG-1.071
 Turtles-1.040
Token flights
Token ID13956
Feature activation+1.32e+00

Pos logit contributions
 misunder+3.130
Pri+2.931
 attitudes+2.780
 differ+2.731
 kitchens+2.724
Neg logit contributions
HAM-5.098
 SATA-5.051
oscope-5.016
 DRAG-4.872
antage-4.763
Token.
Token ID13
Feature activation+5.43e-01

Pos logit contributions
 SATA+0.972
HAM+0.961
oscope+0.933
 DRAG+0.896
 Turtles+0.870
Neg logit contributions
 misunder-0.954
Pri-0.881
1945-0.816
 differ-0.812
Initialized-0.809
Token "
Token ID366
Feature activation+6.95e-01

Pos logit contributions
 SATA+0.395
HAM+0.391
oscope+0.380
 DRAG+0.367
 Turtles+0.356
Neg logit contributions
 misunder-0.394
Pri-0.370
 differ-0.346
1945-0.344
 attitudes-0.343
TokenHe
Token ID1544
Feature activation+6.80e-02

Pos logit contributions
 SATA+0.578
HAM+0.576
oscope+0.558
 DRAG+0.540
 Turtles+0.527
Neg logit contributions
 misunder-0.436
Pri-0.395
 differ-0.372
 attitudes-0.368
1945-0.365
Token says
Token ID1139
Feature activation-2.46e-01

Pos logit contributions
 SATA+0.052
HAM+0.052
oscope+0.051
 DRAG+0.049
 Turtles+0.047
Neg logit contributions
 misunder-0.046
Pri-0.043
 differ-0.041
 attitudes-0.040
1945-0.040
Token
Token ID198
Feature activation+3.33e-01

Pos logit contributions
 SATA+0.492
HAM+0.488
oscope+0.478
 DRAG+0.465
 Turtles+0.450
Neg logit contributions
 misunder-0.392
Pri-0.383
1945-0.354
 differ-0.352
 attitudes-0.348
TokenNap
Token ID49799
Feature activation-1.41e+00

Pos logit contributions
 SATA+0.262
HAM+0.262
oscope+0.254
 DRAG+0.244
 Turtles+0.237
Neg logit contributions
 misunder-0.222
Pri-0.203
 differ-0.192
 attitudes-0.190
1945-0.189
TokenTime
Token ID7575
Feature activation-8.35e-01

Pos logit contributions
 misunder+1.082
Pri+1.013
 differ+0.953
1945+0.950
 attitudes+0.946
Neg logit contributions
 SATA-0.969
HAM-0.961
oscope-0.931
 DRAG-0.898
 Turtles-0.868
Token
Token ID198
Feature activation+3.12e-02

Pos logit contributions
 misunder+0.584
Pri+0.545
 differ+0.510
1945+0.507
 attitudes+0.506
Neg logit contributions
 SATA-0.628
HAM-0.623
oscope-0.605
 DRAG-0.585
 Turtles-0.568
Token
Token ID198
Feature activation-3.35e-02

Pos logit contributions
 SATA+0.025
HAM+0.025
oscope+0.024
 DRAG+0.024
 Turtles+0.023
Neg logit contributions
 misunder-0.019
Pri-0.019
1945-0.018
 differ-0.017
Zero-0.017
TokenReply
Token ID36875
Feature activation-5.74e+00

Pos logit contributions
 misunder+0.021
Pri+0.019
 differ+0.018
 attitudes+0.018
1945+0.018
Neg logit contributions
 SATA-0.027
HAM-0.027
oscope-0.027
 DRAG-0.026
 Turtles-0.025
Token ·
Token ID14128
Feature activation-1.65e+00

Pos logit contributions
 misunder+2.895
Pri+2.707
 differ+2.474
 attitudes+2.454
1945+2.453
Neg logit contributions
HAM-5.282
 SATA-5.269
oscope-5.202
 DRAG-5.023
antage-4.902
Token Report
Token ID6358
Feature activation-1.54e+00

Pos logit contributions
 misunder+1.276
Pri+1.213
 differ+1.142
1945+1.135
 attitudes+1.134
Neg logit contributions
 SATA-1.109
HAM-1.103
oscope-1.074
 DRAG-1.027
 Turtles-0.990
Token Post
Token ID2947
Feature activation-1.11e+00

Pos logit contributions
 misunder+0.992
Pri+0.920
 differ+0.849
1945+0.848
 kitchens+0.843
Neg logit contributions
 SATA-1.249
HAM-1.231
oscope-1.199
 DRAG-1.166
 Turtles-1.134
Token<|endoftext|>
Token ID50256
Feature activation-9.05e-01

Pos logit contributions
 misunder+0.613
Pri+0.564
 differ+0.514
1945+0.511
 attitudes+0.510
Neg logit contributions
 SATA-0.995
HAM-0.985
oscope-0.964
 DRAG-0.938
 Turtles-0.915
TokenThe
Token ID464
Feature activation+2.07e-01

Pos logit contributions
 misunder+0.584
Pri+0.547
 differ+0.505
1945+0.504
 attitudes+0.501
Neg logit contributions
 SATA-0.728
HAM-0.720
oscope-0.704
 DRAG-0.682
 Turtles-0.663
Token area
Token ID1989
Feature activation-1.33e+00

Pos logit contributions
 misunder+0.422
Pri+0.397
 differ+0.371
1945+0.369
 attitudes+0.368
Neg logit contributions
 SATA-0.428
HAM-0.424
oscope-0.413
 DRAG-0.399
 Turtles-0.386
Token.
Token ID13
Feature activation-1.38e-01

Pos logit contributions
 misunder+0.915
Pri+0.858
 differ+0.798
1945+0.794
 attitudes+0.791
Neg logit contributions
 SATA-1.019
HAM-1.008
oscope-0.983
 DRAG-0.949
 Turtles-0.921
Token
Token ID198
Feature activation-3.23e-01

Pos logit contributions
 misunder+0.096
Pri+0.089
 differ+0.083
1945+0.083
 attitudes+0.083
Neg logit contributions
 SATA-0.105
HAM-0.104
oscope-0.102
 DRAG-0.098
 Turtles-0.096
Token
Token ID198
Feature activation-4.99e-01

Pos logit contributions
Pri+0.192
 misunder+0.186
1945+0.178
Zero+0.174
 differ+0.174
Neg logit contributions
 SATA-0.264
HAM-0.262
oscope-0.258
 DRAG-0.253
antage-0.244
TokenCor
Token ID10606
Feature activation-2.73e+00

Pos logit contributions
 misunder+0.314
Pri+0.290
 differ+0.270
 attitudes+0.267
1945+0.267
Neg logit contributions
 SATA-0.410
HAM-0.408
oscope-0.398
 DRAG-0.384
 Turtles-0.374
Tokenrespond
Token ID5546
Feature activation-5.73e+00

Pos logit contributions
 misunder+1.194
Pri+1.104
 differ+0.987
poral+0.974
1945+0.974
Neg logit contributions
 SATA-2.726
HAM-2.711
oscope-2.636
 DRAG-2.619
 Turtles-2.539
Tokenents
Token ID658
Feature activation-1.41e+00

Pos logit contributions
 misunder+3.174
Pri+3.162
esa+2.949
 differ+2.886
enne+2.862
Neg logit contributions
 SATA-4.874
HAM-4.842
 DRAG-4.666
oscope-4.593
 Turtles-4.502
Token say
Token ID910
Feature activation-4.64e-01

Pos logit contributions
 misunder+0.982
Pri+0.921
1945+0.855
 differ+0.854
 attitudes+0.849
Neg logit contributions
 SATA-1.066
HAM-1.055
oscope-1.030
 DRAG-0.995
 Turtles-0.966
Token the
Token ID262
Feature activation-1.88e-01

Pos logit contributions
 misunder+0.284
Pri+0.264
 differ+0.244
1945+0.242
 attitudes+0.241
Neg logit contributions
 SATA-0.389
HAM-0.386
oscope-0.377
 DRAG-0.366
 Turtles-0.356
Token mine
Token ID6164
Feature activation-7.33e-01

Pos logit contributions
 misunder+0.118
Pri+0.110
 differ+0.102
 attitudes+0.102
1945+0.101
Neg logit contributions
 SATA-0.154
HAM-0.153
oscope-0.150
 DRAG-0.145
 Turtles-0.141
Token may
Token ID743
Feature activation-1.74e+00

Pos logit contributions
 misunder+0.450
Pri+0.420
 differ+0.386
1945+0.385
 attitudes+0.383
Neg logit contributions
 SATA-0.612
HAM-0.607
oscope-0.593
 DRAG-0.574
 Turtles-0.558
Token NASA
Token ID8884
Feature activation-2.59e+00

Pos logit contributions
 SATA+0.063
HAM+0.062
oscope+0.061
 DRAG+0.059
 Turtles+0.057
Neg logit contributions
 misunder-0.049
Pri-0.046
 differ-0.042
1945-0.042
 attitudes-0.042
Token
Token ID447
Feature activation-5.23e-01

Pos logit contributions
 misunder+1.771
Pri+1.681
 differ+1.560
 attitudes+1.557
esa+1.542
Neg logit contributions
 SATA-1.972
HAM-1.943
oscope-1.905
 DRAG-1.840
 Turtles-1.780
Token
Token ID247
Feature activation+1.96e-01

Pos logit contributions
 misunder+0.368
Pri+0.345
 differ+0.321
1945+0.321
 attitudes+0.318
Neg logit contributions
 SATA-0.391
HAM-0.389
oscope-0.378
 DRAG-0.365
 Turtles-0.353
Tokens
Token ID82
Feature activation+2.95e-01

Pos logit contributions
 SATA+0.148
HAM+0.147
oscope+0.143
 DRAG+0.138
 Turtles+0.134
Neg logit contributions
 misunder-0.136
Pri-0.127
 differ-0.118
1945-0.118
 attitudes-0.117
Token commercial
Token ID5068
Feature activation-1.78e+00

Pos logit contributions
 SATA+0.234
HAM+0.231
oscope+0.225
 DRAG+0.218
 Turtles+0.212
Neg logit contributions
 misunder-0.195
Pri-0.182
 differ-0.169
1945-0.169
 attitudes-0.167
Token crew
Token ID5462
Feature activation-5.63e+00

Pos logit contributions
 misunder+1.067
Pri+0.999
 differ+0.920
 attitudes+0.919
esa+0.912
Neg logit contributions
 SATA-1.501
HAM-1.494
oscope-1.460
 DRAG-1.413
 Turtles-1.379
Token effort
Token ID3626
Feature activation-1.46e-01

Pos logit contributions
 misunder+3.225
Pri+3.037
 differ+2.793
 attitudes+2.785
esa+2.759
Neg logit contributions
 SATA-4.820
HAM-4.792
oscope-4.721
 DRAG-4.538
 Turtles-4.448
Token,
Token ID11
Feature activation-2.03e-02

Pos logit contributions
 misunder+0.098
Pri+0.091
 differ+0.085
1945+0.084
 attitudes+0.084
Neg logit contributions
 SATA-0.114
HAM-0.113
oscope-0.110
 DRAG-0.106
 Turtles-0.103
Token which
Token ID543
Feature activation+9.35e-01

Pos logit contributions
 misunder+0.013
Pri+0.012
 differ+0.011
1945+0.011
 attitudes+0.011
Neg logit contributions
 SATA-0.017
HAM-0.017
oscope-0.016
 DRAG-0.016
 Turtles-0.015
Token will
Token ID481
Feature activation-2.11e-02

Pos logit contributions
 SATA+0.764
HAM+0.756
oscope+0.739
 DRAG+0.716
 Turtles+0.697
Neg logit contributions
 misunder-0.592
Pri-0.551
1945-0.508
 differ-0.508
 attitudes-0.505
Token make
Token ID787
Feature activation+2.23e+00

Pos logit contributions
 misunder+0.014
Pri+0.014
 differ+0.013
1945+0.013
 attitudes+0.012
Neg logit contributions
 SATA-0.016
HAM-0.016
oscope-0.016
 DRAG-0.015
 Turtles-0.015
Token
Token ID198
Feature activation+2.24e-01

Pos logit contributions
 SATA+0.118
HAM+0.117
oscope+0.114
 DRAG+0.109
 Turtles+0.106
Neg logit contributions
 misunder-0.112
Pri-0.106
 differ-0.099
1945-0.099
 attitudes-0.098
Token
Token ID198
Feature activation-3.33e-02

Pos logit contributions
 SATA+0.180
HAM+0.179
oscope+0.176
 DRAG+0.171
 Turtles+0.165
Neg logit contributions
Pri-0.136
 misunder-0.136
1945-0.126
 differ-0.124
Zero-0.124
TokenLastly
Token ID37511
Feature activation+1.84e-01

Pos logit contributions
 misunder+0.022
Pri+0.021
 differ+0.019
1945+0.019
 attitudes+0.019
Neg logit contributions
 SATA-0.026
HAM-0.026
oscope-0.025
 DRAG-0.025
 Turtles-0.024
Token,
Token ID11
Feature activation-2.90e-01

Pos logit contributions
 SATA+0.136
HAM+0.134
oscope+0.131
 DRAG+0.127
 Turtles+0.123
Neg logit contributions
 misunder-0.130
Pri-0.122
 differ-0.114
1945-0.114
 attitudes-0.113
Token I
Token ID314
Feature activation-2.52e-01

Pos logit contributions
 misunder+0.223
Pri+0.211
 differ+0.199
1945+0.198
 attitudes+0.197
Neg logit contributions
 SATA-0.197
HAM-0.195
oscope-0.189
 DRAG-0.182
 Turtles-0.175
Token tint
Token ID34791
Feature activation-5.58e+00

Pos logit contributions
 misunder+0.154
Pri+0.143
 differ+0.132
1945+0.131
 attitudes+0.131
Neg logit contributions
 SATA-0.211
HAM-0.209
oscope-0.205
 DRAG-0.198
 Turtles-0.192
Tokened
Token ID276
Feature activation-1.41e+00

Pos logit contributions
 misunder+0.728
Pri+0.506
 differ+0.275
 attitudes+0.259
1945+0.248
Neg logit contributions
 SATA-7.260
HAM-7.231
oscope-7.107
 DRAG-6.997
lishes-6.844
Token around
Token ID1088
Feature activation+1.03e+00

Pos logit contributions
 misunder+0.832
Pri+0.772
 differ+0.709
1945+0.705
 attitudes+0.703
Neg logit contributions
 SATA-1.214
HAM-1.203
oscope-1.177
 DRAG-1.142
 Turtles-1.111
Token the
Token ID262
Feature activation-1.18e-01

Pos logit contributions
 SATA+0.863
HAM+0.856
oscope+0.832
 DRAG+0.808
 Turtles+0.795
Neg logit contributions
 misunder-0.629
Pri-0.586
1945-0.536
 differ-0.535
esa-0.528
Token cl
Token ID537
Feature activation+5.91e-02

Pos logit contributions
 misunder+0.080
Pri+0.075
 differ+0.070
 attitudes+0.069
1945+0.069
Neg logit contributions
 SATA-0.091
HAM-0.090
oscope-0.088
 DRAG-0.085
 Turtles-0.082
Tokenumps
Token ID8142
Feature activation-1.06e+00

Pos logit contributions
 SATA+0.053
HAM+0.052
oscope+0.051
 DRAG+0.050
 Turtles+0.048
Neg logit contributions
 misunder-0.033
Pri-0.031
 differ-0.028
1945-0.028
 attitudes-0.028
Token your
Token ID534
Feature activation-3.12e-01

Pos logit contributions
 misunder+0.139
Pri+0.129
 differ+0.120
 attitudes+0.119
1945+0.118
Neg logit contributions
 SATA-0.199
HAM-0.198
oscope-0.193
 DRAG-0.187
 Turtles-0.182
Token photo
Token ID4590
Feature activation+7.19e-01

Pos logit contributions
 misunder+0.187
Pri+0.175
 differ+0.164
 attitudes+0.164
1945+0.161
Neg logit contributions
HAM-0.261
 SATA-0.260
oscope-0.252
 DRAG-0.245
lishes-0.239
Token's
Token ID338
Feature activation+4.42e-01

Pos logit contributions
 SATA+0.562
HAM+0.558
oscope+0.542
 DRAG+0.525
 Turtles+0.510
Neg logit contributions
 misunder-0.479
Pri-0.449
 differ-0.420
 attitudes-0.417
1945-0.416
Token saturation
Token ID36275
Feature activation+9.14e-01

Pos logit contributions
HAM+0.330
 SATA+0.329
oscope+0.317
 DRAG+0.306
lishes+0.300
Neg logit contributions
 misunder-0.308
Pri-0.290
 attitudes-0.275
 differ-0.274
1945-0.269
Token,
Token ID11
Feature activation+1.15e-01

Pos logit contributions
 SATA+0.712
HAM+0.705
oscope+0.689
 DRAG+0.666
 Turtles+0.646
Neg logit contributions
 misunder-0.614
Pri-0.575
 differ-0.533
1945-0.532
 attitudes-0.529
Token tint
Token ID34791
Feature activation-5.55e+00

Pos logit contributions
HAM+0.080
 SATA+0.080
oscope+0.077
 DRAG+0.074
 Turtles+0.072
Neg logit contributions
 misunder-0.086
Pri-0.081
 differ-0.077
 attitudes-0.077
1945-0.075
Token,
Token ID11
Feature activation-5.70e-02

Pos logit contributions
 misunder+3.379
Pri+3.190
 differ+3.041
 attitudes+3.006
esa+2.931
Neg logit contributions
 SATA-4.506
HAM-4.481
oscope-4.260
 DRAG-4.223
lishes-4.147
Token and
Token ID290
Feature activation-1.34e-01

Pos logit contributions
 misunder+0.041
Pri+0.038
 differ+0.036
 attitudes+0.036
1945+0.036
Neg logit contributions
 SATA-0.042
HAM-0.042
oscope-0.040
 DRAG-0.039
 Turtles-0.038
Token the
Token ID262
Feature activation-5.20e-02

Pos logit contributions
 misunder+0.102
Pri+0.096
 differ+0.091
 attitudes+0.090
1945+0.089
Neg logit contributions
HAM-0.092
 SATA-0.092
oscope-0.088
 DRAG-0.085
 Turtles-0.083
Token like
Token ID588
Feature activation-1.27e-01

Pos logit contributions
 misunder+0.036
Pri+0.034
 differ+0.032
 attitudes+0.032
1945+0.032
Neg logit contributions
 SATA-0.039
HAM-0.039
oscope-0.037
 DRAG-0.036
 Turtles-0.035
Token,
Token ID11
Feature activation+1.04e-01

Pos logit contributions
 misunder+0.087
Pri+0.082
 differ+0.076
1945+0.076
 attitudes+0.076
Neg logit contributions
 SATA-0.097
HAM-0.096
oscope-0.094
 DRAG-0.091
 Turtles-0.088
Tokenrain
Token ID3201
Feature activation-1.25e+00

Pos logit contributions
 SATA+0.030
HAM+0.030
oscope+0.029
 DRAG+0.028
 Turtles+0.027
Neg logit contributions
 misunder-0.044
Pri-0.042
 differ-0.040
1945-0.040
esa-0.040
Token,
Token ID11
Feature activation+3.26e-01

Pos logit contributions
 misunder+0.793
Pri+0.747
 differ+0.704
1945+0.691
 attitudes+0.687
Neg logit contributions
 SATA-1.006
HAM-1.001
oscope-0.973
 DRAG-0.947
 Turtles-0.919
Token and
Token ID290
Feature activation+9.57e-02

Pos logit contributions
 SATA+0.269
HAM+0.266
oscope+0.260
 DRAG+0.252
 Turtles+0.245
Neg logit contributions
 misunder-0.204
Pri-0.190
 differ-0.175
1945-0.174
 attitudes-0.174
Token Prof
Token ID4415
Feature activation+2.63e-01

Pos logit contributions
 SATA+0.080
HAM+0.079
oscope+0.077
 DRAG+0.075
 Turtles+0.073
Neg logit contributions
 misunder-0.059
Pri-0.055
 differ-0.051
1945-0.051
 attitudes-0.050
Token.
Token ID13
Feature activation+2.68e-01

Pos logit contributions
 SATA+0.193
HAM+0.191
oscope+0.186
 DRAG+0.179
 Turtles+0.174
Neg logit contributions
 misunder-0.189
Pri-0.177
 differ-0.166
1945-0.165
 attitudes-0.165
Token Loren
Token ID32035
Feature activation-5.37e+00

Pos logit contributions
 SATA+0.199
HAM+0.197
oscope+0.192
 DRAG+0.185
 Turtles+0.180
Neg logit contributions
 misunder-0.189
Pri-0.177
 differ-0.166
1945-0.165
 attitudes-0.165
Token Cord
Token ID21119
Feature activation-3.95e-02

Pos logit contributions
 misunder+3.705
Pri+3.633
 differ+3.399
esa+3.399
1945+3.392
Neg logit contributions
HAM-3.813
 SATA-3.787
 DRAG-3.590
oscope-3.587
lishes-3.443
Tokenain
Token ID391
Feature activation-8.88e-01

Pos logit contributions
 misunder+0.026
Pri+0.025
 differ+0.023
1945+0.023
esa+0.023
Neg logit contributions
 SATA-0.031
HAM-0.031
oscope-0.030
 DRAG-0.029
 Turtles-0.028
Token comes
Token ID2058
Feature activation-2.49e+00

Pos logit contributions
 misunder+0.547
Pri+0.509
 differ+0.470
1945+0.467
 attitudes+0.466
Neg logit contributions
 SATA-0.740
HAM-0.734
oscope-0.717
 DRAG-0.695
 Turtles-0.676
Token down
Token ID866
Feature activation-1.22e-01

Pos logit contributions
 misunder+1.608
Pri+1.524
 differ+1.424
 kitchens+1.407
 attitudes+1.406
Neg logit contributions
 SATA-1.971
HAM-1.964
oscope-1.891
 DRAG-1.859
 Turtles-1.792
Token pretty
Token ID2495
Feature activation+1.60e+00

Pos logit contributions
 misunder+0.074
Pri+0.068
1945+0.063
 differ+0.062
esa+0.062
Neg logit contributions
 SATA-0.102
HAM-0.102
oscope-0.099
 DRAG-0.096
 Turtles-0.094
Token space
Token ID2272
Feature activation-2.68e+00

Pos logit contributions
 misunder+0.045
Pri+0.042
1945+0.039
 differ+0.039
 attitudes+0.038
Neg logit contributions
 SATA-0.055
HAM-0.054
oscope-0.053
 DRAG-0.051
 Turtles-0.050
Token station
Token ID4429
Feature activation-2.26e+00

Pos logit contributions
 misunder+1.562
Pri+1.469
 attitudes+1.353
 differ+1.349
1945+1.331
Neg logit contributions
 SATA-2.302
HAM-2.285
oscope-2.235
 DRAG-2.172
 Turtles-2.108
Token operations
Token ID4560
Feature activation-1.37e+00

Pos logit contributions
 misunder+1.240
Pri+1.156
 differ+1.057
 attitudes+1.055
esa+1.039
Neg logit contributions
 SATA-2.022
HAM-2.008
oscope-1.969
 DRAG-1.913
 Turtles-1.861
Token.
Token ID13
Feature activation-2.46e-01

Pos logit contributions
 misunder+0.914
Pri+0.861
 differ+0.801
 attitudes+0.795
1945+0.795
Neg logit contributions
 SATA-1.061
HAM-1.052
oscope-1.027
 DRAG-0.994
 Turtles-0.963
Token A
Token ID317
Feature activation-3.74e-01

Pos logit contributions
 misunder+0.173
Pri+0.163
 differ+0.152
1945+0.151
 attitudes+0.151
Neg logit contributions
 SATA-0.183
HAM-0.182
oscope-0.177
 DRAG-0.171
 Turtles-0.166
Token Soy
Token ID30022
Feature activation-5.36e+00

Pos logit contributions
 misunder+0.253
Pri+0.238
 differ+0.221
1945+0.220
 attitudes+0.219
Neg logit contributions
 SATA-0.288
HAM-0.286
oscope-0.279
 DRAG-0.269
 Turtles-0.261
Tokenuz
Token ID10277
Feature activation-7.46e-01

Pos logit contributions
 misunder+2.530
Pri+2.501
Zero+2.220
 differ+2.205
esa+2.188
Neg logit contributions
HAM-5.082
 SATA-4.982
oscope-4.865
 DRAG-4.761
lishes-4.634
Token carrying
Token ID6872
Feature activation-1.35e+00

Pos logit contributions
 misunder+0.488
Pri+0.456
 differ+0.422
1945+0.421
 attitudes+0.419
Neg logit contributions
 SATA-0.594
HAM-0.588
oscope-0.574
 DRAG-0.556
 Turtles-0.540
Token NASA
Token ID8884
Feature activation-2.87e+00

Pos logit contributions
 misunder+0.842
Pri+0.781
 differ+0.723
1945+0.721
 attitudes+0.712
Neg logit contributions
 SATA-1.120
HAM-1.108
oscope-1.081
 DRAG-1.046
 Turtles-1.020
Token astronaut
Token ID33779
Feature activation-4.93e+00

Pos logit contributions
 misunder+1.566
Pri+1.455
 differ+1.323
 attitudes+1.318
1945+1.306
Neg logit contributions
 SATA-2.576
HAM-2.557
oscope-2.502
 DRAG-2.431
 Turtles-2.370
Token Shane
Token ID24379
Feature activation-7.56e-01

Pos logit contributions
 misunder+3.054
Pri+2.905
 attitudes+2.668
 differ+2.655
esa+2.635
Neg logit contributions
 SATA-4.022
HAM-4.006
oscope-3.903
 DRAG-3.799
 Turtles-3.675
Token
Token ID447
Feature activation-1.11e-01

Pos logit contributions
 SATA+0.102
HAM+0.101
oscope+0.099
 DRAG+0.096
 Turtles+0.093
Neg logit contributions
 misunder-0.083
Pri-0.078
 differ-0.072
1945-0.072
 attitudes-0.071
Token
Token ID251
Feature activation-2.88e-01

Pos logit contributions
 misunder+0.068
Pri+0.064
 differ+0.059
1945+0.059
 attitudes+0.059
Neg logit contributions
 SATA-0.093
HAM-0.091
oscope-0.090
 DRAG-0.087
 Turtles-0.085
Token said
Token ID531
Feature activation+7.54e-01

Pos logit contributions
 misunder+0.182
Pri+0.169
 differ+0.157
1945+0.156
 attitudes+0.155
Neg logit contributions
 SATA-0.235
HAM-0.234
oscope-0.228
 DRAG-0.221
 Turtles-0.215
Token Ryan
Token ID6047
Feature activation-3.09e-01

Pos logit contributions
 SATA+0.585
HAM+0.579
oscope+0.565
 DRAG+0.546
 Turtles+0.531
Neg logit contributions
 misunder-0.509
Pri-0.476
 differ-0.443
1945-0.441
 attitudes-0.438
Token,
Token ID11
Feature activation+1.74e-03

Pos logit contributions
 misunder+0.225
Pri+0.214
 differ+0.199
1945+0.198
 attitudes+0.198
Neg logit contributions
 SATA-0.222
HAM-0.220
oscope-0.214
 DRAG-0.207
 Turtles-0.199
Token attribut
Token ID24548
Feature activation-5.33e+00

Pos logit contributions
 SATA+0.002
HAM+0.001
oscope+0.001
 DRAG+0.001
 Turtles+0.001
Neg logit contributions
 misunder-0.001
Pri-0.001
 differ-0.001
1945-0.001
 attitudes-0.001
Tokening
Token ID278
Feature activation-2.12e+00

Pos logit contributions
 misunder+0.932
Pri+0.818
 differ+0.618
 attitudes+0.556
Zero+0.535
Neg logit contributions
 SATA-6.644
HAM-6.584
oscope-6.458
 DRAG-6.398
 Turtles-6.264
Token the
Token ID262
Feature activation-5.07e-02

Pos logit contributions
 misunder+1.243
Pri+1.154
 differ+1.060
1945+1.052
 attitudes+1.052
Neg logit contributions
 SATA-1.833
HAM-1.818
oscope-1.777
 DRAG-1.724
 Turtles-1.678
Token foul
Token ID15626
Feature activation-8.55e-01

Pos logit contributions
 misunder+0.030
Pri+0.028
 differ+0.026
1945+0.026
 attitudes+0.025
Neg logit contributions
 SATA-0.043
HAM-0.043
oscope-0.042
 DRAG-0.041
 Turtles-0.040
Token-
Token ID12
Feature activation-7.50e-02

Pos logit contributions
 misunder+0.630
Pri+0.594
 differ+0.556
1945+0.554
 attitudes+0.552
Neg logit contributions
 SATA-0.609
HAM-0.602
oscope-0.586
 DRAG-0.565
 Turtles-0.547
Tokenup
Token ID929
Feature activation-3.51e-01

Pos logit contributions
 misunder+0.055
Pri+0.051
 differ+0.048
1945+0.048
 attitudes+0.047
Neg logit contributions
 SATA-0.054
HAM-0.053
oscope-0.052
 DRAG-0.050
 Turtles-0.049
Token Hillary
Token ID5073
Feature activation+5.26e-01

Pos logit contributions
 SATA+0.928
HAM+0.915
oscope+0.901
 DRAG+0.869
 Turtles+0.845
Neg logit contributions
 misunder-0.671
Pri-0.618
 differ-0.569
1945-0.567
 attitudes-0.562
Token.
Token ID13
Feature activation-1.78e-01

Pos logit contributions
 SATA+0.377
HAM+0.373
oscope+0.362
 DRAG+0.348
 Turtles+0.337
Neg logit contributions
 misunder-0.386
Pri-0.363
 differ-0.340
1945-0.339
esa-0.337
Token His
Token ID2399
Feature activation-3.99e-01

Pos logit contributions
 misunder+0.128
Pri+0.121
 differ+0.113
1945+0.113
 attitudes+0.113
Neg logit contributions
 SATA-0.130
HAM-0.128
oscope-0.125
 DRAG-0.121
 Turtles-0.117
Token supporters
Token ID5941
Feature activation+1.05e+00

Pos logit contributions
 misunder+0.272
Pri+0.256
 differ+0.239
 attitudes+0.238
1945+0.237
Neg logit contributions
 SATA-0.306
HAM-0.303
oscope-0.296
 DRAG-0.285
 Turtles-0.277
Token are
Token ID389
Feature activation-1.66e-01

Pos logit contributions
 SATA+0.893
HAM+0.882
oscope+0.861
 DRAG+0.841
 Turtles+0.815
Neg logit contributions
 misunder-0.625
Pri-0.582
1945-0.531
esa-0.527
 differ-0.526
Token blinded
Token ID38497
Feature activation-5.32e+00

Pos logit contributions
 misunder+0.103
Pri+0.096
 differ+0.089
 attitudes+0.088
1945+0.088
Neg logit contributions
 SATA-0.138
HAM-0.137
oscope-0.134
 DRAG-0.130
 Turtles-0.126
Token by
Token ID416
Feature activation+4.73e-01

Pos logit contributions
 misunder+3.540
Pri+3.381
 attitudes+3.234
 differ+3.221
1945+3.185
Neg logit contributions
 SATA-3.979
HAM-3.930
oscope-3.819
 DRAG-3.765
antage-3.605
Token their
Token ID511
Feature activation+5.49e-01

Pos logit contributions
 SATA+0.420
HAM+0.416
oscope+0.408
 DRAG+0.396
 Turtles+0.386
Neg logit contributions
 misunder-0.265
Pri-0.245
 differ-0.223
1945-0.223
esa-0.221
Token anger
Token ID8993
Feature activation+2.05e-01

Pos logit contributions
 SATA+0.500
HAM+0.496
oscope+0.486
 DRAG+0.472
 Turtles+0.461
Neg logit contributions
 misunder-0.296
Pri-0.273
 differ-0.248
1945-0.247
 attitudes-0.244
Token twisted
Token ID19074
Feature activation-1.01e-01

Pos logit contributions
 SATA+0.158
HAM+0.156
oscope+0.153
 DRAG+0.147
 Turtles+0.143
Neg logit contributions
 misunder-0.140
Pri-0.131
 differ-0.121
1945-0.121
 attitudes-0.120
Token into
Token ID656
Feature activation-2.80e+00

Pos logit contributions
 misunder+0.067
Pri+0.063
 differ+0.059
 attitudes+0.059
1945+0.058
Neg logit contributions
 SATA-0.078
HAM-0.078
oscope-0.076
 DRAG-0.073
 Turtles-0.071
Token sure
Token ID1654
Feature activation+1.23e+00

Pos logit contributions
 SATA+1.007
HAM+0.967
oscope+0.927
 DRAG+0.863
 Turtles+0.816
Neg logit contributions
 misunder-2.747
Pri-2.633
1945-2.516
 differ-2.513
 attitudes-2.503
Token that
Token ID326
Feature activation-3.94e-01

Pos logit contributions
 SATA+0.970
HAM+0.957
oscope+0.936
 DRAG+0.906
 Turtles+0.882
Neg logit contributions
 misunder-0.816
Pri-0.763
 differ-0.706
1945-0.705
 attitudes-0.701
Token both
Token ID1111
Feature activation+4.46e-02

Pos logit contributions
 misunder+0.257
Pri+0.241
 differ+0.225
 attitudes+0.223
1945+0.222
Neg logit contributions
HAM-0.312
 SATA-0.311
oscope-0.303
 DRAG-0.292
 Turtles-0.283
Token Input
Token ID23412
Feature activation-6.65e-01

Pos logit contributions
 SATA+0.033
HAM+0.033
oscope+0.032
 DRAG+0.030
 Turtles+0.029
Neg logit contributions
 misunder-0.032
Pri-0.030
 differ-0.028
 attitudes-0.028
1945-0.028
TokenStream
Token ID12124
Feature activation+1.33e-01

Pos logit contributions
 misunder+0.574
Pri+0.546
 differ+0.516
1945+0.514
 attitudes+0.513
Neg logit contributions
 SATA-0.389
HAM-0.384
oscope-0.372
 DRAG-0.355
 Turtles-0.341
Token Adapt
Token ID30019
Feature activation-5.27e+00

Pos logit contributions
 SATA+0.104
HAM+0.103
oscope+0.101
 DRAG+0.097
 Turtles+0.094
Neg logit contributions
 misunder-0.089
Pri-0.083
 differ-0.078
 attitudes-0.077
1945-0.077
Tokenive
Token ID425
Feature activation-3.02e+00

Pos logit contributions
 misunder+3.077
Pri+2.913
 differ+2.652
Zero+2.632
 attitudes+2.620
Neg logit contributions
HAM-4.421
 SATA-4.391
oscope-4.203
 DRAG-4.197
zbek-4.046
Token and
Token ID290
Feature activation+8.58e-03

Pos logit contributions
 misunder+2.336
Pri+2.249
 differ+2.115
 attitudes+2.095
Zero+2.085
Neg logit contributions
HAM-1.983
 SATA-1.983
oscope-1.923
 DRAG-1.853
antage-1.772
Token RT
Token ID11923
Feature activation+5.70e-01

Pos logit contributions
 SATA+0.006
HAM+0.006
oscope+0.005
 DRAG+0.005
 Turtles+0.005
Neg logit contributions
 misunder-0.007
Pri-0.006
 differ-0.006
 attitudes-0.006
1945-0.006
TokenMP
Token ID7378
Feature activation+1.36e+00

Pos logit contributions
 SATA+0.437
HAM+0.432
oscope+0.422
 DRAG+0.407
 Turtles+0.395
Neg logit contributions
 misunder-0.391
Pri-0.365
 differ-0.340
1945-0.339
 attitudes-0.338
Token Input
Token ID23412
Feature activation-2.46e-01

Pos logit contributions
 SATA+0.981
HAM+0.969
oscope+0.943
 DRAG+0.910
 Turtles+0.882
Neg logit contributions
 misunder-0.998
Pri-0.940
 differ-0.877
1945-0.875
 attitudes-0.871
Token two
Token ID734
Feature activation+3.90e-01

Pos logit contributions
 misunder+0.345
Pri+0.322
 differ+0.297
 attitudes+0.294
1945+0.294
Neg logit contributions
 SATA-0.487
HAM-0.483
oscope-0.473
 DRAG-0.458
 Turtles-0.445
Token patriarch
Token ID28915
Feature activation+5.68e-01

Pos logit contributions
 SATA+0.332
HAM+0.329
oscope+0.321
 DRAG+0.312
 Turtles+0.305
Neg logit contributions
 misunder-0.234
Pri-0.217
1945-0.199
 differ-0.198
esa-0.196
Tokens
Token ID82
Feature activation+4.26e-01

Pos logit contributions
 SATA+0.433
HAM+0.426
oscope+0.418
 DRAG+0.403
 Turtles+0.392
Neg logit contributions
 misunder-0.394
Pri-0.366
 differ-0.340
1945-0.339
esa-0.335
Token in
Token ID287
Feature activation-9.67e-03

Pos logit contributions
 SATA+0.345
HAM+0.342
oscope+0.333
 DRAG+0.322
 Turtles+0.315
Neg logit contributions
 misunder-0.273
Pri-0.253
1945-0.233
 differ-0.232
 attitudes-0.231
Token a
Token ID257
Feature activation-2.63e-01

Pos logit contributions
 misunder+0.006
Pri+0.006
1945+0.005
 differ+0.005
esa+0.005
Neg logit contributions
 SATA-0.008
HAM-0.008
oscope-0.008
 DRAG-0.008
 Turtles-0.007
Token pod
Token ID24573
Feature activation-5.26e+00

Pos logit contributions
 misunder+0.178
Pri+0.166
 differ+0.155
1945+0.154
 attitudes+0.153
Neg logit contributions
 SATA-0.203
HAM-0.201
oscope-0.196
 DRAG-0.190
 Turtles-0.184
Token.
Token ID13
Feature activation+1.64e-01

Pos logit contributions
 misunder+3.371
Pri+3.246
 differ+3.014
 attitudes+2.966
1945+2.965
Neg logit contributions
 SATA-4.118
HAM-4.104
oscope-3.967
 DRAG-3.891
 Turtles-3.777
Token And
Token ID843
Feature activation+1.23e+00

Pos logit contributions
 SATA+0.124
HAM+0.123
oscope+0.120
 DRAG+0.116
 Turtles+0.112
Neg logit contributions
 misunder-0.114
Pri-0.107
 differ-0.100
1945-0.099
 attitudes-0.099
Token Pence
Token ID17339
Feature activation+7.49e-01

Pos logit contributions
 SATA+1.006
HAM+0.992
oscope+0.971
 DRAG+0.942
 Turtles+0.920
Neg logit contributions
 misunder-0.774
Pri-0.712
1945-0.660
 differ-0.655
esa-0.651
Token showed
Token ID3751
Feature activation-5.48e-01

Pos logit contributions
 SATA+0.598
HAM+0.592
oscope+0.578
 DRAG+0.559
 Turtles+0.544
Neg logit contributions
 misunder-0.488
Pri-0.453
1945-0.421
 differ-0.419
 attitudes-0.415
Token it
Token ID340
Feature activation-5.04e-01

Pos logit contributions
 misunder+0.347
Pri+0.322
 differ+0.298
1945+0.297
esa+0.294
Neg logit contributions
 SATA-0.448
HAM-0.443
oscope-0.433
 DRAG-0.420
 Turtles-0.409
Token degrees
Token ID7370
Feature activation+7.15e-01

Pos logit contributions
 SATA+0.734
HAM+0.720
oscope+0.700
 DRAG+0.673
 Turtles+0.653
Neg logit contributions
 misunder-0.717
Pri-0.668
Initialized-0.619
1945-0.617
esa-0.617
Token east
Token ID7627
Feature activation-2.10e-01

Pos logit contributions
 SATA+0.654
HAM+0.643
oscope+0.631
 DRAG+0.614
 Turtles+0.599
Neg logit contributions
 misunder-0.384
Pri-0.357
1945-0.320
 differ-0.316
Initialized-0.315
Token in
Token ID287
Feature activation+9.47e-03

Pos logit contributions
 misunder+0.130
Pri+0.121
1945+0.111
 differ+0.111
 attitudes+0.110
Neg logit contributions
 SATA-0.175
HAM-0.173
oscope-0.169
 DRAG-0.164
 Turtles-0.159
Token Mare
Token ID36989
Feature activation-1.98e+00

Pos logit contributions
 SATA+0.008
HAM+0.008
oscope+0.008
 DRAG+0.008
 Turtles+0.008
Neg logit contributions
 misunder-0.005
Pri-0.005
 differ-0.005
1945-0.005
 attitudes-0.005
Token Tr
Token ID833
Feature activation-7.25e-01

Pos logit contributions
 misunder+1.403
Pri+1.333
 differ+1.245
esa+1.238
 attitudes+1.235
Neg logit contributions
 SATA-1.444
HAM-1.435
oscope-1.397
 DRAG-1.350
 Turtles-1.302
Tokenanqu
Token ID26184
Feature activation-5.23e+00

Pos logit contributions
 misunder+0.284
Pri+0.253
 differ+0.220
1945+0.218
 attitudes+0.217
Neg logit contributions
 SATA-0.767
HAM-0.762
oscope-0.748
 DRAG-0.730
 Turtles-0.714
Tokenill
Token ID359
Feature activation-1.48e-01

Pos logit contributions
 misunder+3.297
Pri+3.182
esa+2.992
idad+2.978
 differ+2.921
Neg logit contributions
HAM-4.149
 SATA-4.124
 DRAG-3.979
oscope-3.939
 Turtles-3.748
Tokenit
Token ID270
Feature activation-5.12e-01

Pos logit contributions
 misunder+0.105
Pri+0.099
 differ+0.092
1945+0.092
 attitudes+0.092
Neg logit contributions
 SATA-0.109
HAM-0.108
oscope-0.105
 DRAG-0.101
 Turtles-0.098
Tokenatis
Token ID37749
Feature activation-8.47e-01

Pos logit contributions
 misunder+0.432
Pri+0.412
 differ+0.390
1945+0.387
 attitudes+0.386
Neg logit contributions
 SATA-0.308
HAM-0.307
oscope-0.294
 DRAG-0.283
 Turtles-0.272
Token,
Token ID11
Feature activation-7.01e-02

Pos logit contributions
 misunder+0.580
Pri+0.545
 differ+0.507
1945+0.504
 attitudes+0.504
Neg logit contributions
 SATA-0.647
HAM-0.641
oscope-0.625
 DRAG-0.604
 Turtles-0.586
Token the
Token ID262
Feature activation-1.08e-01

Pos logit contributions
 misunder+0.045
Pri+0.042
 differ+0.038
1945+0.038
 attitudes+0.038
Neg logit contributions
 SATA-0.057
HAM-0.057
oscope-0.055
 DRAG-0.054
 Turtles-0.052
Token seen
Token ID1775
Feature activation+4.73e-01

Pos logit contributions
 SATA+0.400
HAM+0.396
oscope+0.386
 DRAG+0.372
 Turtles+0.361
Neg logit contributions
 misunder-0.374
Pri-0.352
 differ-0.328
1945-0.327
 attitudes-0.326
Token below
Token ID2174
Feature activation-6.27e-01

Pos logit contributions
 SATA+0.387
HAM+0.383
oscope+0.375
 DRAG+0.362
 Turtles+0.354
Neg logit contributions
 misunder-0.299
Pri-0.278
1945-0.256
 differ-0.256
 attitudes-0.254
Token.
Token ID13
Feature activation-7.71e-01

Pos logit contributions
 misunder+0.414
Pri+0.388
 differ+0.361
1945+0.359
 attitudes+0.358
Neg logit contributions
 SATA-0.494
HAM-0.490
oscope-0.477
 DRAG-0.462
 Turtles-0.449
Token
Token ID198
Feature activation-5.91e-01

Pos logit contributions
 misunder+0.518
Pri+0.496
 differ+0.456
1945+0.455
 attitudes+0.451
Neg logit contributions
 SATA-0.592
HAM-0.586
oscope-0.572
 DRAG-0.553
antage-0.535
Token
Token ID198
Feature activation-5.88e-01

Pos logit contributions
Pri+0.351
 misunder+0.339
1945+0.325
 differ+0.317
Zero+0.317
Neg logit contributions
 SATA-0.483
HAM-0.479
oscope-0.473
 DRAG-0.463
antage-0.447
TokenPri
Token ID34487
Feature activation-5.22e+00

Pos logit contributions
 misunder+0.371
Pri+0.348
 differ+0.320
1945+0.320
 attitudes+0.318
Neg logit contributions
 SATA-0.481
HAM-0.476
oscope-0.465
 DRAG-0.451
 Turtles-0.439
Tokenest
Token ID395
Feature activation-1.79e-01

Pos logit contributions
Pri+3.040
 misunder+3.025
 differ+2.701
esa+2.669
poral+2.661
Neg logit contributions
 SATA-4.369
HAM-4.306
 DRAG-4.208
oscope-4.153
 Turtles-3.996
Token Cont
Token ID2345
Feature activation+6.82e-02

Pos logit contributions
 misunder+0.123
Pri+0.115
 differ+0.107
1945+0.106
 attitudes+0.106
Neg logit contributions
 SATA-0.137
HAM-0.135
oscope-0.132
 DRAG-0.128
 Turtles-0.124
Tokenacts
Token ID8656
Feature activation-1.74e-01

Pos logit contributions
 SATA+0.055
HAM+0.054
oscope+0.053
 DRAG+0.051
 Turtles+0.050
Neg logit contributions
 misunder-0.044
Pri-0.041
 differ-0.038
1945-0.038
 attitudes-0.038
Token :
Token ID1058
Feature activation-1.09e-01

Pos logit contributions
 misunder+0.117
Pri+0.109
 differ+0.101
1945+0.101
 attitudes+0.101
Neg logit contributions
 SATA-0.136
HAM-0.135
oscope-0.131
 DRAG-0.127
 Turtles-0.123
Token
Token ID198
Feature activation-6.76e-01

Pos logit contributions
 misunder+0.076
Pri+0.071
 differ+0.066
1945+0.066
 attitudes+0.066
Neg logit contributions
 SATA-0.083
HAM-0.082
oscope-0.080
 DRAG-0.077
 Turtles-0.075
Token consider
Token ID2074
Feature activation-5.82e-01

Pos logit contributions
 SATA+0.181
HAM+0.179
oscope+0.175
 DRAG+0.169
 Turtles+0.165
Neg logit contributions
 misunder-0.150
Pri-0.140
 differ-0.130
1945-0.129
 attitudes-0.129
Token Clinton
Token ID2605
Feature activation-3.32e-01

Pos logit contributions
 misunder+0.382
Pri+0.357
 differ+0.330
1945+0.329
 attitudes+0.328
Neg logit contributions
 SATA-0.462
HAM-0.458
oscope-0.447
 DRAG-0.432
 Turtles-0.420
Token as
Token ID355
Feature activation+2.24e-01

Pos logit contributions
 misunder+0.228
Pri+0.213
 differ+0.198
1945+0.198
 attitudes+0.196
Neg logit contributions
 SATA-0.254
HAM-0.252
oscope-0.246
 DRAG-0.238
 Turtles-0.231
Token a
Token ID257
Feature activation-2.07e-01

Pos logit contributions
 SATA+0.189
HAM+0.187
oscope+0.183
 DRAG+0.177
 Turtles+0.173
Neg logit contributions
 misunder-0.137
Pri-0.127
1945-0.116
 differ-0.116
 attitudes-0.115
Token running
Token ID2491
Feature activation-1.17e+00

Pos logit contributions
 misunder+0.132
Pri+0.123
 differ+0.115
1945+0.114
 attitudes+0.113
Neg logit contributions
 SATA-0.168
HAM-0.167
oscope-0.163
 DRAG-0.158
 Turtles-0.153
Token mate
Token ID16133
Feature activation-5.13e+00

Pos logit contributions
 misunder+0.372
Pri+0.327
 differ+0.279
 attitudes+0.271
1945+0.269
Neg logit contributions
 SATA-1.323
HAM-1.317
oscope-1.293
 DRAG-1.269
 Turtles-1.241
Token in
Token ID287
Feature activation+9.99e-02

Pos logit contributions
 misunder+3.204
Pri+3.097
 differ+2.884
 attitudes+2.850
1945+2.825
Neg logit contributions
 SATA-4.101
HAM-4.063
oscope-3.945
 DRAG-3.867
lishes-3.743
Token the
Token ID262
Feature activation-8.21e-02

Pos logit contributions
 SATA+0.083
HAM+0.082
oscope+0.080
 DRAG+0.078
 Turtles+0.075
Neg logit contributions
 misunder-0.062
Pri-0.058
 differ-0.053
1945-0.053
 attitudes-0.053
Token 2016
Token ID1584
Feature activation+5.41e-01

Pos logit contributions
 misunder+0.049
Pri+0.046
 differ+0.042
1945+0.042
 attitudes+0.042
Neg logit contributions
 SATA-0.070
HAM-0.069
oscope-0.068
 DRAG-0.066
 Turtles-0.064
Token presidential
Token ID4787
Feature activation-6.81e-01

Pos logit contributions
 SATA+0.448
HAM+0.444
oscope+0.434
 DRAG+0.420
 Turtles+0.409
Neg logit contributions
 misunder-0.336
Pri-0.313
 differ-0.288
1945-0.287
 attitudes-0.286
Token election
Token ID3071
Feature activation-6.09e-01

Pos logit contributions
 misunder+0.425
Pri+0.399
 differ+0.369
 attitudes+0.367
1945+0.366
Neg logit contributions
 SATA-0.560
HAM-0.555
oscope-0.541
 DRAG-0.525
 Turtles-0.510
Token Jordan
Token ID8078
Feature activation+1.64e+00

Pos logit contributions
 SATA+0.374
HAM+0.370
oscope+0.362
 DRAG+0.351
 Turtles+0.341
Neg logit contributions
 misunder-0.313
Pri-0.290
1945-0.270
 differ-0.269
 attitudes-0.267
Token Kal
Token ID12612
Feature activation-2.15e+00

Pos logit contributions
 SATA+1.178
HAM+1.163
oscope+1.133
 DRAG+1.087
 Turtles+1.060
Neg logit contributions
 misunder-1.212
Pri-1.135
 differ-1.059
1945-1.053
 attitudes-1.049
Tokenish
Token ID680
Feature activation+7.90e-01

Pos logit contributions
 misunder+1.560
Pri+1.499
esa+1.402
1945+1.402
 differ+1.400
Neg logit contributions
 SATA-1.525
HAM-1.512
oscope-1.456
 DRAG-1.433
 Turtles-1.367
Token
Token ID198
Feature activation-1.32e-01

Pos logit contributions
 SATA+0.584
HAM+0.579
oscope+0.566
 DRAG+0.543
 Turtles+0.528
Neg logit contributions
 misunder-0.564
Pri-0.526
 differ-0.492
1945-0.490
 attitudes-0.487
Token
Token ID198
Feature activation-2.71e-01

Pos logit contributions
 misunder+0.082
Pri+0.082
1945+0.075
 differ+0.074
Zero+0.074
Neg logit contributions
 SATA-0.105
HAM-0.104
oscope-0.102
 DRAG-0.100
 Turtles-0.096
TokenSurv
Token ID34652
Feature activation-5.10e+00

Pos logit contributions
 misunder+0.171
Pri+0.157
 differ+0.147
 attitudes+0.146
1945+0.145
Neg logit contributions
 SATA-0.223
HAM-0.222
oscope-0.216
 DRAG-0.209
 Turtles-0.203
Tokeniv
Token ID452
Feature activation-1.58e+00

Pos logit contributions
 misunder+2.489
Pri+2.394
1945+2.142
 differ+2.130
 attitudes+2.096
Neg logit contributions
 SATA-4.781
HAM-4.746
oscope-4.618
 DRAG-4.554
 Turtles-4.447
Tokenor
Token ID273
Feature activation-1.18e+00

Pos logit contributions
 misunder+0.888
Pri+0.824
 differ+0.753
1945+0.749
 attitudes+0.745
Neg logit contributions
 SATA-1.398
HAM-1.387
oscope-1.355
 DRAG-1.319
 Turtles-1.285
Token
Token ID447
Feature activation-4.01e-02

Pos logit contributions
 misunder+0.780
Pri+0.732
 differ+0.680
1945+0.676
 attitudes+0.674
Neg logit contributions
 SATA-0.926
HAM-0.918
oscope-0.896
 DRAG-0.866
 Turtles-0.841
Token
Token ID247
Feature activation+2.78e-01

Pos logit contributions
 misunder+0.031
Pri+0.029
 differ+0.027
1945+0.027
 attitudes+0.027
Neg logit contributions
 SATA-0.027
HAM-0.027
oscope-0.026
 DRAG-0.025
 Turtles-0.025
Tokens
Token ID82
Feature activation+2.85e-01

Pos logit contributions
 SATA+0.205
HAM+0.204
oscope+0.197
 DRAG+0.192
 Turtles+0.185
Neg logit contributions
 misunder-0.199
Pri-0.184
 differ-0.173
1945-0.172
 attitudes-0.172
Token to
Token ID284
Feature activation+1.74e-01

Pos logit contributions
 SATA+0.230
HAM+0.228
oscope+0.222
 DRAG+0.214
 Turtles+0.208
Neg logit contributions
 misunder-0.214
Pri-0.201
 differ-0.187
1945-0.186
 attitudes-0.186
Token lose
Token ID4425
Feature activation-1.09e+00

Pos logit contributions
 SATA+0.134
HAM+0.133
oscope+0.129
 DRAG+0.125
 Turtles+0.121
Neg logit contributions
 misunder-0.119
Pri-0.111
 differ-0.104
1945-0.103
 attitudes-0.103
Token.
Token ID13
Feature activation+3.15e-01

Pos logit contributions
 misunder+0.752
Pri+0.707
 differ+0.660
 attitudes+0.658
1945+0.655
Neg logit contributions
 SATA-0.826
HAM-0.818
oscope-0.797
 DRAG-0.772
 Turtles-0.748
Token
Token ID198
Feature activation+7.72e-01

Pos logit contributions
 SATA+0.243
HAM+0.241
oscope+0.235
 DRAG+0.227
 Turtles+0.220
Neg logit contributions
 misunder-0.213
Pri-0.201
 differ-0.186
1945-0.186
 attitudes-0.185
Token
Token ID198
Feature activation+4.19e-01

Pos logit contributions
 SATA+0.610
HAM+0.605
oscope+0.593
 DRAG+0.577
 Turtles+0.558
Neg logit contributions
 misunder-0.490
Pri-0.477
1945-0.442
 differ-0.439
 attitudes-0.434
TokenAdapt
Token ID48003
Feature activation-5.10e+00

Pos logit contributions
 SATA+0.347
HAM+0.345
oscope+0.336
 DRAG+0.325
 Turtles+0.316
Neg logit contributions
 misunder-0.261
Pri-0.240
 differ-0.224
 attitudes-0.222
1945-0.221
Tokened
Token ID276
Feature activation-2.03e+00

Pos logit contributions
 misunder+2.971
Pri+2.777
 differ+2.556
 attitudes+2.554
1945+2.539
Neg logit contributions
 SATA-4.381
HAM-4.331
oscope-4.221
 DRAG-4.145
 Turtles-4.007
Token from
Token ID422
Feature activation-5.06e-01

Pos logit contributions
 misunder+1.161
Pri+1.076
 differ+0.989
1945+0.981
 attitudes+0.981
Neg logit contributions
 SATA-1.777
HAM-1.764
oscope-1.725
 DRAG-1.678
 Turtles-1.630
Token the
Token ID262
Feature activation-7.01e-02

Pos logit contributions
 misunder+0.351
Pri+0.330
 differ+0.307
1945+0.306
 attitudes+0.305
Neg logit contributions
 SATA-0.382
HAM-0.379
oscope-0.369
 DRAG-0.357
 Turtles-0.346
Token forthcoming
Token ID21220
Feature activation+1.22e-01

Pos logit contributions
 misunder+0.049
Pri+0.046
 differ+0.043
1945+0.043
 attitudes+0.043
Neg logit contributions
 SATA-0.052
HAM-0.052
oscope-0.050
 DRAG-0.049
 Turtles-0.047
Token book
Token ID1492
Feature activation-7.95e-02

Pos logit contributions
 SATA+0.090
HAM+0.089
oscope+0.087
 DRAG+0.084
 Turtles+0.081
Neg logit contributions
 misunder-0.086
Pri-0.081
 differ-0.076
1945-0.076
 attitudes-0.076
Token,
Token ID11
Feature activation-7.01e-02

Pos logit contributions
 misunder+0.580
Pri+0.545
 differ+0.507
1945+0.504
 attitudes+0.504
Neg logit contributions
 SATA-0.647
HAM-0.641
oscope-0.625
 DRAG-0.604
 Turtles-0.586
Token the
Token ID262
Feature activation-1.08e-01

Pos logit contributions
 misunder+0.045
Pri+0.042
 differ+0.038
1945+0.038
 attitudes+0.038
Neg logit contributions
 SATA-0.057
HAM-0.057
oscope-0.055
 DRAG-0.054
 Turtles-0.052
Token Sea
Token ID6896
Feature activation+2.08e+00

Pos logit contributions
 misunder+0.069
Pri+0.064
 differ+0.059
 attitudes+0.059
1945+0.059
Neg logit contributions
 SATA-0.087
HAM-0.087
oscope-0.084
 DRAG-0.082
 Turtles-0.079
Token of
Token ID286
Feature activation+1.03e-01

Pos logit contributions
 SATA+1.755
HAM+1.735
oscope+1.702
 DRAG+1.648
 Turtles+1.622
Neg logit contributions
 misunder-1.258
Pri-1.156
1945-1.069
 differ-1.064
 attitudes-1.060
Token Tr
Token ID833
Feature activation-5.80e-01

Pos logit contributions
 SATA+0.074
HAM+0.073
oscope+0.072
 DRAG+0.069
 Turtles+0.067
Neg logit contributions
 misunder-0.075
Pri-0.070
1945-0.066
 differ-0.066
 attitudes-0.065
Tokenanqu
Token ID26184
Feature activation-5.09e+00

Pos logit contributions
 misunder+0.139
Pri+0.115
 differ+0.089
1945+0.088
 attitudes+0.086
Neg logit contributions
 SATA-0.702
HAM-0.697
oscope-0.686
 DRAG-0.672
 Turtles-0.660
Tokenility
Token ID879
Feature activation-1.09e+00

Pos logit contributions
 misunder+3.846
Pri+3.726
esa+3.513
idad+3.513
 differ+3.482
Neg logit contributions
HAM-3.394
 SATA-3.373
oscope-3.235
 DRAG-3.214
 Turtles-3.012
Token.
Token ID13
Feature activation-1.33e-01

Pos logit contributions
 misunder+0.746
Pri+0.702
 differ+0.654
1945+0.649
 attitudes+0.649
Neg logit contributions
 SATA-0.831
HAM-0.824
oscope-0.803
 DRAG-0.776
 Turtles-0.752
Token After
Token ID2293
Feature activation+1.84e+00

Pos logit contributions
 misunder+0.094
Pri+0.089
1945+0.083
 differ+0.083
 attitudes+0.082
Neg logit contributions
 SATA-0.099
HAM-0.098
oscope-0.095
 DRAG-0.092
 Turtles-0.089
Token several
Token ID1811
Feature activation-6.92e-01

Pos logit contributions
 SATA+1.482
HAM+1.461
oscope+1.431
 DRAG+1.388
 Turtles+1.352
Neg logit contributions
 misunder-1.184
Pri-1.103
 differ-1.019
1945-1.013
 attitudes-1.002
Token short
Token ID1790
Feature activation-6.35e-01

Pos logit contributions
 misunder+0.398
Pri+0.368
 differ+0.338
1945+0.335
 attitudes+0.335
Neg logit contributions
 SATA-0.606
HAM-0.601
oscope-0.588
 DRAG-0.571
 Turtles-0.556
Token 9
Token ID860
Feature activation+2.45e-01

Pos logit contributions
 SATA+0.214
HAM+0.213
oscope+0.208
 DRAG+0.202
 Turtles+0.195
Neg logit contributions
 misunder-0.180
Pri-0.168
 differ-0.157
1945-0.155
 attitudes-0.155
Token/
Token ID14
Feature activation-4.00e-01

Pos logit contributions
 SATA+0.187
HAM+0.185
oscope+0.180
 DRAG+0.174
 Turtles+0.169
Neg logit contributions
 misunder-0.168
Pri-0.157
 differ-0.146
1945-0.145
 attitudes-0.145
Token11
Token ID1157
Feature activation+5.03e-01

Pos logit contributions
 misunder+0.273
Pri+0.249
 differ+0.235
 attitudes+0.234
 kitchens+0.231
Neg logit contributions
 SATA-0.310
HAM-0.310
oscope-0.301
 DRAG-0.287
 Turtles-0.282
Token GI
Token ID30616
Feature activation+4.72e-01

Pos logit contributions
 SATA+0.363
HAM+0.359
oscope+0.350
 DRAG+0.337
 Turtles+0.326
Neg logit contributions
 misunder-0.367
Pri-0.346
 differ-0.323
1945-0.322
 attitudes-0.321
Token Bill
Token ID3941
Feature activation-8.11e-01

Pos logit contributions
 SATA+0.418
HAM+0.416
oscope+0.406
 DRAG+0.393
 Turtles+0.385
Neg logit contributions
 misunder-0.266
Pri-0.244
 differ-0.223
1945-0.222
 attitudes-0.220
Token ®
Token ID38852
Feature activation-5.03e+00

Pos logit contributions
 misunder+0.555
Pri+0.520
 differ+0.485
1945+0.483
 attitudes+0.481
Neg logit contributions
 SATA-0.620
HAM-0.614
oscope-0.599
 DRAG-0.580
 Turtles-0.562
Token.
Token ID764
Feature activation-6.13e-01

Pos logit contributions
 misunder+3.301
Pri+3.155
 differ+2.929
1945+2.923
esa+2.905
Neg logit contributions
 SATA-3.858
HAM-3.810
oscope-3.752
 DRAG-3.666
 Turtles-3.503
Token
Token ID198
Feature activation+4.50e-01

Pos logit contributions
 misunder+0.431
Pri+0.411
1945+0.382
 differ+0.381
 attitudes+0.378
Neg logit contributions
 SATA-0.451
HAM-0.447
oscope-0.438
 DRAG-0.422
 Turtles-0.407
Token
Token ID198
Feature activation+1.92e-01

Pos logit contributions
 SATA+0.352
HAM+0.347
oscope+0.340
 DRAG+0.330
 Turtles+0.320
Neg logit contributions
 misunder-0.295
Pri-0.283
1945-0.261
 differ-0.260
 attitudes-0.258
TokenWhat
Token ID2061
Feature activation-6.77e-01

Pos logit contributions
 SATA+0.153
HAM+0.153
oscope+0.148
 DRAG+0.143
 Turtles+0.139
Neg logit contributions
 misunder-0.126
Pri-0.116
 differ-0.109
 attitudes-0.108
1945-0.107
Token is
Token ID318
Feature activation-1.87e-01

Pos logit contributions
 misunder+0.440
Pri+0.412
 differ+0.386
 attitudes+0.383
1945+0.381
Neg logit contributions
 SATA-0.539
HAM-0.536
oscope-0.524
 DRAG-0.507
 Turtles-0.491
TokenF
Token ID37
Feature activation-5.53e-02

Pos logit contributions
 SATA+0.768
HAM+0.762
oscope+0.742
 DRAG+0.718
 Turtles+0.695
Neg logit contributions
 misunder-0.627
Pri-0.585
 differ-0.544
 attitudes-0.542
1945-0.541
Tokenailed
Token ID6255
Feature activation+1.26e-01

Pos logit contributions
 misunder+0.029
Pri+0.026
 differ+0.024
1945+0.024
 attitudes+0.024
Neg logit contributions
 SATA-0.051
HAM-0.051
oscope-0.050
 DRAG-0.048
 Turtles-0.047
Token cannot
Token ID2314
Feature activation+2.15e-01

Pos logit contributions
 SATA+0.098
HAM+0.097
oscope+0.095
 DRAG+0.092
 Turtles+0.089
Neg logit contributions
 misunder-0.085
Pri-0.079
 differ-0.074
1945-0.073
 attitudes-0.073
Token find
Token ID1064
Feature activation+4.01e-01

Pos logit contributions
 SATA+0.145
HAM+0.144
oscope+0.139
 DRAG+0.134
 Turtles+0.130
Neg logit contributions
 misunder-0.166
Pri-0.157
 differ-0.149
1945-0.147
 attitudes-0.147
Token the
Token ID262
Feature activation+3.66e-01

Pos logit contributions
 SATA+0.296
HAM+0.292
oscope+0.285
 DRAG+0.275
 Turtles+0.266
Neg logit contributions
 misunder-0.286
Pri-0.269
 differ-0.250
1945-0.250
 attitudes-0.249
Token current
Token ID1459
Feature activation+3.83e+00

Pos logit contributions
 SATA+0.266
HAM+0.264
oscope+0.257
 DRAG+0.247
 Turtles+0.241
Neg logit contributions
 misunder-0.264
Pri-0.248
 differ-0.233
 attitudes-0.231
1945-0.231
Token wallpaper
Token ID39328
Feature activation+2.43e-01

Pos logit contributions
 SATA+2.691
HAM+2.642
oscope+2.583
 DRAG+2.490
 Turtles+2.371
Neg logit contributions
 misunder-2.857
Pri-2.708
1945-2.538
 differ-2.489
 attitudes-2.484
Token $
Token ID720
Feature activation+2.24e-01

Pos logit contributions
 SATA+0.189
HAM+0.187
oscope+0.182
 DRAG+0.176
 Turtles+0.172
Neg logit contributions
 misunder-0.163
Pri-0.153
 differ-0.142
1945-0.141
 attitudes-0.141
Token($
Token ID16763
Feature activation-5.82e-01

Pos logit contributions
 SATA+0.151
HAM+0.149
oscope+0.145
 DRAG+0.139
 Turtles+0.134
Neg logit contributions
 misunder-0.174
Pri-0.164
 differ-0.154
1945-0.153
 attitudes-0.153
Tokenw
Token ID86
Feature activation-3.81e-02

Pos logit contributions
 misunder+0.438
Pri+0.414
 differ+0.387
1945+0.386
 attitudes+0.384
Neg logit contributions
 SATA-0.406
HAM-0.400
oscope-0.389
 DRAG-0.376
 Turtles-0.365
Tokenpath
Token ID6978
Feature activation+2.08e+00

Pos logit contributions
 misunder+0.020
Pri+0.020
1945+0.018
 differ+0.018
esa+0.018
Neg logit contributions
 SATA-0.034
HAM-0.034
 DRAG-0.033
oscope-0.033
 Turtles-0.032
Token all
Token ID477
Feature activation-6.83e-01

Pos logit contributions
 misunder+0.315
Pri+0.295
 differ+0.276
 attitudes+0.274
1945+0.271
Neg logit contributions
 SATA-0.425
HAM-0.422
oscope-0.412
 DRAG-0.403
 Turtles-0.388
Token familiar
Token ID5385
Feature activation+9.21e-01

Pos logit contributions
 misunder+0.482
Pri+0.454
 differ+0.427
 attitudes+0.424
1945+0.422
Neg logit contributions
 SATA-0.507
HAM-0.501
oscope-0.489
 DRAG-0.474
 Turtles-0.457
Token with
Token ID351
Feature activation+3.64e-01

Pos logit contributions
 SATA+0.756
HAM+0.745
oscope+0.734
 DRAG+0.701
 Turtles+0.691
Neg logit contributions
 misunder-0.580
Pri-0.537
1945-0.500
 differ-0.495
esa-0.491
Token,
Token ID11
Feature activation+1.01e-01

Pos logit contributions
 SATA+0.278
HAM+0.275
oscope+0.268
 DRAG+0.259
 Turtles+0.252
Neg logit contributions
 misunder-0.250
Pri-0.234
 differ-0.218
1945-0.217
 attitudes-0.216
Token from
Token ID422
Feature activation-3.96e-01

Pos logit contributions
 SATA+0.083
HAM+0.082
oscope+0.080
 DRAG+0.078
 Turtles+0.076
Neg logit contributions
 misunder-0.063
Pri-0.059
 differ-0.055
 attitudes-0.054
1945-0.054
Token catalog
Token ID18388
Feature activation+3.95e+00

Pos logit contributions
 misunder+0.239
Pri+0.223
 differ+0.205
1945+0.203
 attitudes+0.202
Neg logit contributions
 SATA-0.335
HAM-0.331
oscope-0.324
 DRAG-0.314
 Turtles-0.306
Tokening
Token ID278
Feature activation+3.33e-01

Pos logit contributions
 SATA+2.979
HAM+2.914
oscope+2.874
 DRAG+2.741
 Turtles+2.686
Neg logit contributions
 misunder-2.740
Pri-2.573
 differ-2.374
1945-2.367
Initialized-2.350
Token various
Token ID2972
Feature activation+1.11e+00

Pos logit contributions
 SATA+0.281
HAM+0.277
oscope+0.271
 DRAG+0.263
 Turtles+0.257
Neg logit contributions
 misunder-0.202
Pri-0.188
 differ-0.172
1945-0.171
esa-0.170
Token marketing
Token ID7124
Feature activation+3.68e-01

Pos logit contributions
 SATA+0.940
HAM+0.926
oscope+0.906
 DRAG+0.877
 Turtles+0.858
Neg logit contributions
 misunder-0.679
Pri-0.634
 differ-0.578
1945-0.576
esa-0.572
Token slogans
Token ID31139
Feature activation-9.58e-01

Pos logit contributions
 SATA+0.357
HAM+0.354
oscope+0.347
 DRAG+0.338
 Turtles+0.330
Neg logit contributions
 misunder-0.177
Pri-0.161
 differ-0.145
1945-0.144
 attitudes-0.143
Token to
Token ID284
Feature activation+2.20e-01

Pos logit contributions
 misunder+0.614
Pri+0.574
 differ+0.537
 attitudes+0.533
1945+0.532
Neg logit contributions
 SATA-0.770
HAM-0.766
oscope-0.747
 DRAG-0.724
 Turtles-0.702
Token caught
Token ID4978
Feature activation+1.03e+00

Pos logit contributions
 SATA+0.293
HAM+0.289
oscope+0.284
 DRAG+0.275
 Turtles+0.267
Neg logit contributions
 misunder-0.211
Pri-0.196
1945-0.180
 attitudes-0.179
 differ-0.178
Token.
Token ID13
Feature activation+3.45e-01

Pos logit contributions
 SATA+0.764
HAM+0.752
oscope+0.736
 DRAG+0.707
 Turtles+0.689
Neg logit contributions
 misunder-0.736
Pri-0.693
1945-0.641
esa-0.638
 attitudes-0.638
Token There
Token ID1318
Feature activation-9.48e-01

Pos logit contributions
 SATA+0.244
HAM+0.242
oscope+0.236
 DRAG+0.226
 Turtles+0.219
Neg logit contributions
 misunder-0.255
Pri-0.241
 differ-0.226
1945-0.225
 attitudes-0.224
Token is
Token ID318
Feature activation-5.30e-02

Pos logit contributions
 misunder+0.593
Pri+0.555
 differ+0.514
1945+0.509
 attitudes+0.508
Neg logit contributions
 SATA-0.780
HAM-0.774
oscope-0.755
 DRAG-0.732
 Turtles-0.712
Token one
Token ID530
Feature activation+1.04e+00

Pos logit contributions
 misunder+0.032
Pri+0.030
1945+0.028
 differ+0.028
 attitudes+0.028
Neg logit contributions
 SATA-0.045
HAM-0.044
oscope-0.043
 DRAG-0.042
 Turtles-0.041
Token exception
Token ID6631
Feature activation+3.66e+00

Pos logit contributions
 SATA+0.828
HAM+0.811
oscope+0.794
 DRAG+0.771
 Turtles+0.747
Neg logit contributions
 misunder-0.684
Pri-0.638
1945-0.590
 attitudes-0.585
 differ-0.584
Token:
Token ID25
Feature activation+1.50e+00

Pos logit contributions
 SATA+2.766
HAM+2.717
oscope+2.672
 DRAG+2.534
antage+2.488
Neg logit contributions
 misunder-2.566
Pri-2.356
1945-2.177
 attitudes-2.161
 kitchens-2.149
Token "
Token ID366
Feature activation+8.14e-01

Pos logit contributions
 SATA+1.206
HAM+1.181
oscope+1.158
 DRAG+1.130
 Turtles+1.098
Neg logit contributions
 misunder-0.966
Pri-0.902
1945-0.832
 differ-0.822
esa-0.821
Tokenvi
Token ID8903
Feature activation+1.64e+00

Pos logit contributions
 SATA+0.661
HAM+0.654
oscope+0.642
 DRAG+0.618
 Turtles+0.599
Neg logit contributions
 misunder-0.520
Pri-0.480
 differ-0.446
 attitudes-0.444
1945-0.443
Token:"
Token ID11097
Feature activation+9.55e-02

Pos logit contributions
 SATA+1.297
HAM+1.283
oscope+1.261
 DRAG+1.203
 Turtles+1.175
Neg logit contributions
 misunder-1.080
Pri-1.005
1945-0.929
 kitchens-0.923
 attitudes-0.921
Token and
Token ID290
Feature activation+6.26e-01

Pos logit contributions
 SATA+0.076
HAM+0.075
oscope+0.073
 DRAG+0.071
 Turtles+0.069
Neg logit contributions
 misunder-0.063
Pri-0.059
1945-0.054
 differ-0.054
 attitudes-0.054
Token already
Token ID1541
Feature activation-1.16e+00

Pos logit contributions
 misunder+0.324
Pri+0.300
 differ+0.274
1945+0.272
 attitudes+0.271
Neg logit contributions
 SATA-0.511
HAM-0.506
oscope-0.495
 DRAG-0.481
 Turtles-0.469
Token occurred
Token ID5091
Feature activation-2.64e+00

Pos logit contributions
 misunder+0.612
Pri+0.565
 differ+0.515
1945+0.510
 attitudes+0.510
Neg logit contributions
 SATA-1.068
HAM-1.058
oscope-1.035
 DRAG-1.010
 Turtles-0.982
Token,
Token ID11
Feature activation-3.34e-01

Pos logit contributions
 misunder+1.685
Pri+1.601
 attitudes+1.488
 differ+1.487
1945+1.471
Neg logit contributions
 SATA-2.113
HAM-2.084
oscope-2.033
 DRAG-1.975
 Turtles-1.920
Token
Token ID447
Feature activation-6.29e-01

Pos logit contributions
 misunder+0.202
Pri+0.188
 differ+0.173
1945+0.172
 attitudes+0.171
Neg logit contributions
 SATA-0.283
HAM-0.280
oscope-0.274
 DRAG-0.266
 Turtles-0.259
Token
Token ID251
Feature activation-6.03e-01

Pos logit contributions
 misunder+0.459
Pri+0.435
 differ+0.407
1945+0.405
 attitudes+0.405
Neg logit contributions
 SATA-0.450
HAM-0.443
oscope-0.434
 DRAG-0.417
 Turtles-0.405
Token Gol
Token ID26849
Feature activation+3.36e+00

Pos logit contributions
 misunder+0.368
Pri+0.342
 differ+0.315
1945+0.314
 attitudes+0.312
Neg logit contributions
 SATA-0.507
HAM-0.503
oscope-0.491
 DRAG-0.476
 Turtles-0.464
Tokenight
Token ID432
Feature activation+1.62e+00

Pos logit contributions
 SATA+1.389
HAM+1.378
oscope+1.323
 DRAG+1.207
 Turtles+1.145
Neg logit contributions
 misunder-3.490
Pri-3.322
 differ-3.174
 attitudes-3.164
1945-3.159
Tokenly
Token ID306
Feature activation+5.48e-01

Pos logit contributions
 SATA+1.318
HAM+1.311
oscope+1.279
 DRAG+1.225
 Turtles+1.198
Neg logit contributions
 misunder-1.044
Pri-0.950
 differ-0.880
 attitudes-0.875
1945-0.869
Token-
Token ID12
Feature activation-2.20e-01

Pos logit contributions
HAM+0.330
 SATA+0.329
oscope+0.317
 DRAG+0.301
 Turtles+0.292
Neg logit contributions
 misunder-0.462
Pri-0.438
 differ-0.414
1945-0.414
 attitudes-0.411
TokenHow
Token ID2437
Feature activation-6.40e-01

Pos logit contributions
 misunder+0.127
Pri+0.120
1945+0.109
 differ+0.109
 attitudes+0.108
Neg logit contributions
 SATA-0.191
HAM-0.188
oscope-0.185
 DRAG-0.180
 Turtles-0.175
Tokenell
Token ID695
Feature activation-7.06e-02

Pos logit contributions
 misunder+0.457
Pri+0.455
1945+0.428
esa+0.427
 differ+0.427
Neg logit contributions
 SATA-0.447
HAM-0.439
oscope-0.425
 DRAG-0.424
 Turtles-0.406
Token
Token ID198
Feature activation-8.72e-02

Pos logit contributions
 SATA+0.131
HAM+0.130
oscope+0.128
 DRAG+0.125
 Turtles+0.121
Neg logit contributions
Pri-0.098
 misunder-0.098
1945-0.091
 differ-0.090
Zero-0.089
TokenThough
Token ID10915
Feature activation-8.54e-01

Pos logit contributions
 misunder+0.053
Pri+0.049
 differ+0.046
 attitudes+0.045
1945+0.045
Neg logit contributions
 SATA-0.073
HAM-0.073
oscope-0.071
 DRAG-0.069
 Turtles-0.067
Token the
Token ID262
Feature activation-3.17e-01

Pos logit contributions
 misunder+0.526
Pri+0.490
 differ+0.453
1945+0.450
 attitudes+0.450
Neg logit contributions
 SATA-0.711
HAM-0.705
oscope-0.689
 DRAG-0.668
 Turtles-0.649
Token game
Token ID983
Feature activation+2.51e-01

Pos logit contributions
 misunder+0.195
Pri+0.182
 differ+0.168
 attitudes+0.167
1945+0.167
Neg logit contributions
 SATA-0.264
HAM-0.262
oscope-0.256
 DRAG-0.248
 Turtles-0.241
Token's
Token ID338
Feature activation+3.24e-01

Pos logit contributions
 SATA+0.202
HAM+0.201
oscope+0.196
 DRAG+0.190
 Turtles+0.184
Neg logit contributions
 misunder-0.161
Pri-0.151
 differ-0.140
1945-0.139
 attitudes-0.138
Token popularity
Token ID11533
Feature activation+3.66e+00

Pos logit contributions
 SATA+0.247
HAM+0.244
oscope+0.238
 DRAG+0.230
 Turtles+0.223
Neg logit contributions
 misunder-0.223
Pri-0.209
 differ-0.194
1945-0.193
 attitudes-0.193
Token has
Token ID468
Feature activation-4.29e-01

Pos logit contributions
 SATA+2.903
HAM+2.840
oscope+2.803
 DRAG+2.699
 Turtles+2.637
Neg logit contributions
 misunder-2.373
Pri-2.210
1945-2.029
esa-2.020
Zero-2.017
Token w
Token ID266
Feature activation-7.93e-01

Pos logit contributions
 misunder+0.242
Pri+0.224
 differ+0.206
1945+0.204
 attitudes+0.203
Neg logit contributions
 SATA-0.380
HAM-0.378
oscope-0.369
 DRAG-0.359
 Turtles-0.349
Tokenaned
Token ID22739
Feature activation+1.78e-02

Pos logit contributions
 misunder+0.452
Pri+0.416
 differ+0.379
1945+0.378
 attitudes+0.377
Neg logit contributions
 SATA-0.700
HAM-0.692
oscope-0.680
 DRAG-0.657
 Turtles-0.643
Token since
Token ID1201
Feature activation+8.38e-02

Pos logit contributions
 SATA+0.014
HAM+0.014
oscope+0.013
 DRAG+0.013
 Turtles+0.012
Neg logit contributions
 misunder-0.012
Pri-0.011
 differ-0.011
1945-0.011
 attitudes-0.010
Token last
Token ID938
Feature activation+3.34e-01

Pos logit contributions
 SATA+0.069
HAM+0.068
oscope+0.066
 DRAG+0.064
 Turtles+0.063
Neg logit contributions
 misunder-0.053
Pri-0.049
 differ-0.045
1945-0.045
 attitudes-0.045
Token Yet
Token ID6430
Feature activation+4.07e-01

Pos logit contributions
 SATA+0.421
HAM+0.417
oscope+0.409
 DRAG+0.393
 Turtles+0.380
Neg logit contributions
 misunder-0.376
Pri-0.361
1945-0.335
 differ-0.333
 attitudes-0.331
Token Gl
Token ID2671
Feature activation+2.41e-01

Pos logit contributions
 SATA+0.326
HAM+0.323
oscope+0.316
 DRAG+0.306
 Turtles+0.297
Neg logit contributions
 misunder-0.263
Pri-0.246
 differ-0.228
1945-0.227
 attitudes-0.226
Tokenü
Token ID9116
Feature activation-3.28e-01

Pos logit contributions
 SATA+0.204
HAM+0.201
 DRAG+0.194
oscope+0.194
 Turtles+0.188
Neg logit contributions
 misunder-0.141
Pri-0.137
esa-0.127
 differ-0.126
1945-0.126
Tokenck
Token ID694
Feature activation+1.14e+00

Pos logit contributions
 misunder+0.216
Pri+0.214
 differ+0.200
1945+0.200
esa+0.197
Neg logit contributions
 SATA-0.249
HAM-0.246
oscope-0.239
 DRAG-0.236
 Turtles-0.227
Token has
Token ID468
Feature activation-6.06e-01

Pos logit contributions
 SATA+0.909
HAM+0.901
oscope+0.881
 DRAG+0.851
 Turtles+0.828
Neg logit contributions
 misunder-0.748
Pri-0.698
 differ-0.645
1945-0.645
 attitudes-0.640
Token forced
Token ID4137
Feature activation+5.74e-01

Pos logit contributions
 misunder+0.348
Pri+0.323
 differ+0.296
1945+0.294
 attitudes+0.293
Neg logit contributions
 SATA-0.530
HAM-0.526
oscope-0.514
 DRAG-0.499
 Turtles-0.487
Token a
Token ID257
Feature activation-3.63e-01

Pos logit contributions
 SATA+0.450
HAM+0.446
oscope+0.435
 DRAG+0.421
 Turtles+0.409
Neg logit contributions
 misunder-0.381
Pri-0.356
 differ-0.330
1945-0.330
esa-0.326
Token whole
Token ID2187
Feature activation+5.20e-01

Pos logit contributions
 misunder+0.231
Pri+0.216
 differ+0.200
1945+0.199
 attitudes+0.198
Neg logit contributions
 SATA-0.295
HAM-0.292
oscope-0.286
 DRAG-0.277
 Turtles-0.269
Token world
Token ID995
Feature activation-3.76e-01

Pos logit contributions
 SATA+0.371
HAM+0.367
oscope+0.357
 DRAG+0.345
 Turtles+0.334
Neg logit contributions
 misunder-0.382
Pri-0.360
 differ-0.337
1945-0.336
 attitudes-0.335
Token into
Token ID656
Feature activation-2.74e+00

Pos logit contributions
 misunder+0.234
Pri+0.218
 differ+0.201
1945+0.200
 attitudes+0.199
Neg logit contributions
 SATA-0.311
HAM-0.308
oscope-0.301
 DRAG-0.292
 Turtles-0.284
Token a
Token ID257
Feature activation-1.05e-01

Pos logit contributions
 misunder+1.738
Pri+1.617
 attitudes+1.522
 differ+1.513
1945+1.505
Neg logit contributions
 SATA-2.217
HAM-2.202
oscope-2.132
 DRAG-2.085
 Turtles-2.012
Token and
Token ID290
Feature activation+1.58e-01

Pos logit contributions
 SATA+0.038
HAM+0.038
oscope+0.037
 DRAG+0.036
 Turtles+0.035
Neg logit contributions
 misunder-0.031
Pri-0.028
 differ-0.026
1945-0.026
 attitudes-0.026
Token 2012
Token ID2321
Feature activation+1.13e+00

Pos logit contributions
 SATA+0.126
HAM+0.125
oscope+0.122
 DRAG+0.118
 Turtles+0.115
Neg logit contributions
 misunder-0.102
Pri-0.095
 differ-0.088
1945-0.088
 attitudes-0.087
Token was
Token ID373
Feature activation-3.70e-01

Pos logit contributions
 SATA+0.905
HAM+0.896
oscope+0.875
 DRAG+0.846
 Turtles+0.822
Neg logit contributions
 misunder-0.740
Pri-0.688
 differ-0.635
1945-0.634
 attitudes-0.629
Token 25
Token ID1679
Feature activation-1.25e+00

Pos logit contributions
 misunder+0.227
Pri+0.211
 differ+0.195
1945+0.193
 attitudes+0.193
Neg logit contributions
 SATA-0.310
HAM-0.308
oscope-0.301
 DRAG-0.291
 Turtles-0.283
Token percent
Token ID1411
Feature activation-8.55e-01

Pos logit contributions
 misunder+0.792
Pri+0.741
 differ+0.689
1945+0.683
 attitudes+0.682
Neg logit contributions
 SATA-1.015
HAM-1.007
oscope-0.985
 DRAG-0.954
 Turtles-0.926
Token lower
Token ID2793
Feature activation+1.85e+00

Pos logit contributions
 misunder+0.573
Pri+0.538
 differ+0.502
1945+0.498
 attitudes+0.497
Neg logit contributions
 SATA-0.665
HAM-0.659
oscope-0.644
 DRAG-0.622
 Turtles-0.604
Token than
Token ID621
Feature activation+9.01e-01

Pos logit contributions
 SATA+1.407
HAM+1.397
oscope+1.354
 DRAG+1.315
 Turtles+1.273
Neg logit contributions
 misunder-1.274
Pri-1.191
1945-1.106
 differ-1.106
esa-1.096
Token average
Token ID2811
Feature activation+8.17e-01

Pos logit contributions
 SATA+0.733
HAM+0.725
oscope+0.708
 DRAG+0.686
 Turtles+0.667
Neg logit contributions
 misunder-0.576
Pri-0.536
 differ-0.495
1945-0.493
 attitudes-0.490
Token yields
Token ID19299
Feature activation-2.58e+00

Pos logit contributions
 SATA+0.618
HAM+0.611
oscope+0.596
 DRAG+0.576
 Turtles+0.559
Neg logit contributions
 misunder-0.567
Pri-0.532
 differ-0.496
1945-0.494
 attitudes-0.492
Token.
Token ID13
Feature activation+3.60e-01

Pos logit contributions
 misunder+1.693
Pri+1.606
 differ+1.500
1945+1.484
 attitudes+1.482
Neg logit contributions
 SATA-2.024
HAM-1.998
oscope-1.956
 DRAG-1.899
 Turtles-1.841
Token I
Token ID314
Feature activation-6.10e-02

Pos logit contributions
 SATA+0.281
HAM+0.278
oscope+0.271
 DRAG+0.263
 Turtles+0.255
Neg logit contributions
 misunder-0.242
Pri-0.225
 differ-0.210
1945-0.209
 attitudes-0.208
Token78
Token ID3695
Feature activation+4.62e-01

Pos logit contributions
 misunder+0.077
Pri+0.071
 differ+0.065
1945+0.064
 attitudes+0.064
Neg logit contributions
 SATA-0.131
HAM-0.130
oscope-0.127
 DRAG-0.124
 Turtles-0.121
Token70
Token ID2154
Feature activation-2.37e-01

Pos logit contributions
 SATA+0.431
HAM+0.428
oscope+0.421
 DRAG+0.412
lishes+0.399
Neg logit contributions
 misunder-0.230
Pri-0.216
 differ-0.198
1945-0.197
 attitudes-0.195
Token Hawk
Token ID26698
Feature activation+9.25e-01

Pos logit contributions
 misunder+0.200
Pri+0.190
 differ+0.180
 attitudes+0.178
1945+0.178
Neg logit contributions
 SATA-0.143
HAM-0.141
oscope-0.137
 DRAG-0.131
 Turtles-0.126
Token uses
Token ID3544
Feature activation-1.13e+00

Pos logit contributions
 SATA+0.714
HAM+0.710
oscope+0.692
 DRAG+0.670
 Turtles+0.649
Neg logit contributions
 misunder-0.624
Pri-0.586
 differ-0.546
1945-0.541
 attitudes-0.541
Token Military
Token ID12842
Feature activation-4.29e-01

Pos logit contributions
 misunder+0.763
Pri+0.715
 differ+0.667
 attitudes+0.663
1945+0.661
Neg logit contributions
 SATA-0.862
HAM-0.860
oscope-0.839
 DRAG-0.810
 Turtles-0.785
Token Class
Token ID5016
Feature activation+5.83e-01

Pos logit contributions
 misunder+0.294
Pri+0.278
 differ+0.258
 attitudes+0.256
1945+0.256
Neg logit contributions
 SATA-0.324
HAM-0.322
oscope-0.316
 DRAG-0.305
 Turtles-0.294
Token III
Token ID6711
Feature activation+4.88e-01

Pos logit contributions
 SATA+0.393
HAM+0.387
oscope+0.376
 DRAG+0.362
 Turtles+0.350
Neg logit contributions
 misunder-0.454
Pri-0.428
 differ-0.402
1945-0.401
 attitudes-0.400
Token components
Token ID6805
Feature activation+3.71e-01

Pos logit contributions
 SATA+0.347
HAM+0.341
oscope+0.331
 DRAG+0.320
 Turtles+0.311
Neg logit contributions
 misunder-0.364
Pri-0.342
 differ-0.320
1945-0.320
 attitudes-0.317
Token which
Token ID543
Feature activation+1.53e+00

Pos logit contributions
 SATA+0.294
HAM+0.291
oscope+0.284
 DRAG+0.275
 Turtles+0.267
Neg logit contributions
 misunder-0.243
Pri-0.227
 differ-0.211
1945-0.210
 attitudes-0.209
Token have
Token ID423
Feature activation-7.30e-02

Pos logit contributions
 SATA+1.162
HAM+1.142
oscope+1.116
 DRAG+1.079
 Turtles+1.050
Neg logit contributions
 misunder-1.057
Pri-0.992
1945-0.921
 attitudes-0.914
 differ-0.914
Token passed
Token ID3804
Feature activation+9.73e-02

Pos logit contributions
 misunder+0.045
Pri+0.042
 differ+0.038
1945+0.038
 attitudes+0.038
Neg logit contributions
 SATA-0.061
HAM-0.061
oscope-0.059
 DRAG-0.057
 Turtles-0.056
Token
Token ID251
Feature activation-6.57e-01

Pos logit contributions
 misunder+0.150
Pri+0.141
1945+0.132
 differ+0.131
 attitudes+0.131
Neg logit contributions
 SATA-0.136
HAM-0.135
oscope-0.130
 DRAG-0.126
 Turtles-0.122
Token causes
Token ID5640
Feature activation-1.05e+00

Pos logit contributions
 misunder+0.426
Pri+0.397
 differ+0.368
1945+0.367
 attitudes+0.365
Neg logit contributions
 SATA-0.527
HAM-0.522
oscope-0.509
 DRAG-0.493
 Turtles-0.480
Token.
Token ID13
Feature activation-1.59e-01

Pos logit contributions
 misunder+0.692
Pri+0.650
 differ+0.604
1945+0.600
 attitudes+0.599
Neg logit contributions
 SATA-0.822
HAM-0.814
oscope-0.795
 DRAG-0.770
 Turtles-0.747
Token
Token ID564
Feature activation+4.79e-01

Pos logit contributions
 misunder+0.110
Pri+0.105
 differ+0.098
1945+0.097
 attitudes+0.097
Neg logit contributions
 SATA-0.118
HAM-0.117
oscope-0.114
 DRAG-0.110
antage-0.106
Token
Token ID250
Feature activation+2.67e-01

Pos logit contributions
 SATA+0.312
HAM+0.308
oscope+0.297
 DRAG+0.288
 Turtles+0.277
Neg logit contributions
 misunder-0.384
Pri-0.360
 differ-0.340
1945-0.339
 attitudes-0.338
TokenMake
Token ID12050
Feature activation+2.37e+00

Pos logit contributions
 SATA+0.219
HAM+0.218
oscope+0.212
 DRAG+0.205
 Turtles+0.200
Neg logit contributions
 misunder-0.169
Pri-0.156
 differ-0.145
 attitudes-0.144
1945-0.144
Token your
Token ID534
Feature activation-3.35e-01

Pos logit contributions
 SATA+1.865
HAM+1.845
oscope+1.795
 DRAG+1.733
 Turtles+1.703
Neg logit contributions
 misunder-1.572
Pri-1.447
1945-1.352
 differ-1.341
 attitudes-1.334
Token wishes
Token ID12802
Feature activation+2.81e+00

Pos logit contributions
 misunder+0.237
Pri+0.224
 differ+0.210
 attitudes+0.208
1945+0.208
Neg logit contributions
 SATA-0.247
HAM-0.245
oscope-0.239
 DRAG-0.231
 Turtles-0.223
Token now
Token ID783
Feature activation+6.55e-01

Pos logit contributions
 SATA+2.346
HAM+2.330
oscope+2.267
 DRAG+2.209
 Turtles+2.148
Neg logit contributions
 misunder-1.717
Pri-1.591
1945-1.466
 attitudes-1.447
esa-1.447
Token!
Token ID0
Feature activation-4.27e-01

Pos logit contributions
 SATA+0.493
HAM+0.489
oscope+0.477
 DRAG+0.460
 Turtles+0.447
Neg logit contributions
 misunder-0.459
Pri-0.427
 differ-0.398
1945-0.397
 attitudes-0.395
Token
Token ID447
Feature activation+1.75e-01

Pos logit contributions
 misunder+0.300
Pri+0.284
 differ+0.264
1945+0.263
 attitudes+0.262
Neg logit contributions
 SATA-0.318
HAM-0.314
oscope-0.306
 DRAG-0.296
 Turtles-0.286
Token starring
Token ID20495
Feature activation-2.03e+00

Pos logit contributions
 SATA+0.913
HAM+0.903
oscope+0.882
 DRAG+0.857
 Turtles+0.835
Neg logit contributions
 misunder-0.661
Pri-0.615
 differ-0.565
1945-0.564
 attitudes-0.560
Token TV
Token ID3195
Feature activation+1.10e+00

Pos logit contributions
 misunder+0.907
Pri+0.828
 differ+0.744
 attitudes+0.735
1945+0.722
Neg logit contributions
 SATA-2.024
HAM-2.010
oscope-1.974
 DRAG-1.915
 Turtles-1.873
Token role
Token ID2597
Feature activation-5.01e-01

Pos logit contributions
 SATA+0.800
HAM+0.791
oscope+0.770
 DRAG+0.744
 Turtles+0.721
Neg logit contributions
 misunder-0.796
Pri-0.748
 differ-0.697
1945-0.697
 attitudes-0.693
Token was
Token ID373
Feature activation-3.56e-01

Pos logit contributions
 misunder+0.324
Pri+0.302
1945+0.279
 differ+0.278
 attitudes+0.277
Neg logit contributions
 SATA-0.402
HAM-0.398
oscope-0.389
 DRAG-0.376
 Turtles-0.366
Token in
Token ID287
Feature activation-2.63e-01

Pos logit contributions
 misunder+0.234
Pri+0.219
 differ+0.203
1945+0.202
 attitudes+0.202
Neg logit contributions
 SATA-0.282
HAM-0.279
oscope-0.273
 DRAG-0.264
 Turtles-0.257
Token Str
Token ID4285
Feature activation+9.95e-01

Pos logit contributions
 misunder+0.173
Pri+0.161
 differ+0.150
1945+0.149
 attitudes+0.148
Neg logit contributions
 SATA-0.208
HAM-0.206
oscope-0.202
 DRAG-0.195
 Turtles-0.190
Tokenangers
Token ID6606
Feature activation+3.89e-02

Pos logit contributions
 SATA+0.800
HAM+0.791
oscope+0.765
 DRAG+0.756
 Turtles+0.725
Neg logit contributions
 misunder-0.617
Pri-0.597
1945-0.558
esa-0.556
 differ-0.554
Token with
Token ID351
Feature activation+3.89e-01

Pos logit contributions
 SATA+0.033
HAM+0.033
oscope+0.032
 DRAG+0.031
 Turtles+0.030
Neg logit contributions
 misunder-0.024
Pri-0.022
 differ-0.020
1945-0.020
 attitudes-0.020
Token Candy
Token ID24680
Feature activation+1.33e+00

Pos logit contributions
 SATA+0.268
HAM+0.264
oscope+0.258
 DRAG+0.249
 Turtles+0.242
Neg logit contributions
 misunder-0.297
Pri-0.280
 differ-0.262
1945-0.261
 attitudes-0.260
Token,
Token ID11
Feature activation-8.63e-02

Pos logit contributions
 SATA+0.978
HAM+0.971
oscope+0.945
 DRAG+0.909
 Turtles+0.886
Neg logit contributions
 misunder-0.947
Pri-0.879
 differ-0.825
1945-0.820
 attitudes-0.817
Token a
Token ID257
Feature activation-3.52e-01

Pos logit contributions
 misunder+0.053
Pri+0.050
 differ+0.046
1945+0.045
 attitudes+0.045
Neg logit contributions
 SATA-0.072
HAM-0.071
oscope-0.070
 DRAG-0.067
 Turtles-0.066
Token American
Token ID1605
Feature activation+4.32e-01

Pos logit contributions
 SATA+0.123
HAM+0.122
oscope+0.119
 DRAG+0.115
 Turtles+0.111
Neg logit contributions
 misunder-0.104
Pri-0.097
 differ-0.091
 attitudes-0.090
1945-0.090
Token people
Token ID661
Feature activation-1.14e-01

Pos logit contributions
 SATA+0.367
HAM+0.362
oscope+0.354
 DRAG+0.343
 Turtles+0.335
Neg logit contributions
 misunder-0.261
Pri-0.243
 differ-0.223
1945-0.222
 attitudes-0.220
Token,
Token ID11
Feature activation+2.12e-01

Pos logit contributions
 misunder+0.068
Pri+0.063
 differ+0.058
1945+0.058
 attitudes+0.057
Neg logit contributions
 SATA-0.097
HAM-0.096
oscope-0.094
 DRAG-0.091
 Turtles-0.089
Token through
Token ID832
Feature activation-1.34e+00

Pos logit contributions
 SATA+0.178
HAM+0.176
oscope+0.173
 DRAG+0.167
 Turtles+0.163
Neg logit contributions
 misunder-0.129
Pri-0.120
 differ-0.110
1945-0.110
 attitudes-0.109
Token Congress
Token ID3162
Feature activation-8.99e-01

Pos logit contributions
 misunder+0.783
Pri+0.727
 differ+0.668
 attitudes+0.664
1945+0.663
Neg logit contributions
 SATA-1.159
HAM-1.150
oscope-1.124
 DRAG-1.092
 Turtles-1.063
Token,
Token ID11
Feature activation+1.02e-01

Pos logit contributions
 misunder+0.609
Pri+0.571
1945+0.531
 differ+0.529
 attitudes+0.523
Neg logit contributions
 SATA-0.693
HAM-0.687
oscope-0.670
 DRAG-0.646
 Turtles-0.628
Token will
Token ID481
Feature activation-9.99e-03

Pos logit contributions
 SATA+0.086
HAM+0.085
oscope+0.083
 DRAG+0.080
 Turtles+0.078
Neg logit contributions
 misunder-0.061
Pri-0.057
1945-0.053
 differ-0.052
esa-0.052
Token control
Token ID1630
Feature activation+6.82e-01

Pos logit contributions
 misunder+0.007
Pri+0.006
 differ+0.006
 attitudes+0.006
1945+0.006
Neg logit contributions
 SATA-0.008
HAM-0.008
oscope-0.007
 DRAG-0.007
 Turtles-0.007
Token the
Token ID262
Feature activation+5.19e-02

Pos logit contributions
 SATA+0.587
HAM+0.580
oscope+0.567
 DRAG+0.549
 Turtles+0.536
Neg logit contributions
 misunder-0.405
Pri-0.373
1945-0.342
 differ-0.340
esa-0.339
Token greed
Token ID25474
Feature activation-3.17e-01

Pos logit contributions
 SATA+0.042
HAM+0.041
oscope+0.040
 DRAG+0.039
 Turtles+0.038
Neg logit contributions
 misunder-0.033
Pri-0.031
 differ-0.029
 attitudes-0.029
1945-0.029
Token,
Token ID11
Feature activation+2.59e-01

Pos logit contributions
 misunder+0.202
Pri+0.188
 differ+0.174
1945+0.173
 attitudes+0.172
Neg logit contributions
 SATA-0.259
HAM-0.256
oscope-0.250
 DRAG-0.242
 Turtles-0.236
Token as
Token ID355
Feature activation+6.40e-02

Pos logit contributions
 misunder+0.004
Pri+0.004
 differ+0.004
 attitudes+0.003
1945+0.003
Neg logit contributions
 SATA-0.005
HAM-0.005
oscope-0.005
 DRAG-0.005
antage-0.005
Token overweight
Token ID22652
Feature activation-1.54e+00

Pos logit contributions
 SATA+0.053
HAM+0.052
oscope+0.051
 DRAG+0.050
 Turtles+0.048
Neg logit contributions
 misunder-0.040
Pri-0.037
 differ-0.034
1945-0.034
 attitudes-0.034
Token and
Token ID290
Feature activation-5.27e-02

Pos logit contributions
 misunder+0.915
Pri+0.851
 differ+0.789
 attitudes+0.783
1945+0.780
Neg logit contributions
 SATA-1.313
HAM-1.301
oscope-1.276
 DRAG-1.234
 Turtles-1.202
Token out
Token ID503
Feature activation+6.88e-02

Pos logit contributions
 misunder+0.034
Pri+0.032
1945+0.030
esa+0.029
 differ+0.029
Neg logit contributions
 SATA-0.042
HAM-0.042
oscope-0.041
 DRAG-0.040
 Turtles-0.038
Token of
Token ID286
Feature activation+4.03e-02

Pos logit contributions
 SATA+0.056
HAM+0.056
oscope+0.054
 DRAG+0.053
 Turtles+0.051
Neg logit contributions
 misunder-0.044
Pri-0.041
1945-0.037
 differ-0.037
 attitudes-0.037
Token shape
Token ID5485
Feature activation-3.91e-01

Pos logit contributions
 SATA+0.030
HAM+0.029
oscope+0.029
 DRAG+0.028
 Turtles+0.027
Neg logit contributions
 misunder-0.029
Pri-0.027
1945-0.025
 differ-0.025
esa-0.025
Token people
Token ID661
Feature activation-5.30e-01

Pos logit contributions
 misunder+0.228
Pri+0.212
1945+0.193
 differ+0.192
 attitudes+0.191
Neg logit contributions
 SATA-0.339
HAM-0.336
oscope-0.328
 DRAG-0.319
 Turtles-0.311
Token struggle
Token ID6531
Feature activation-2.23e-01

Pos logit contributions
 misunder+0.316
Pri+0.293
 differ+0.268
1945+0.268
 attitudes+0.267
Neg logit contributions
 SATA-0.452
HAM-0.448
oscope-0.438
 DRAG-0.426
 Turtles-0.414
Token to
Token ID284
Feature activation+2.04e-03

Pos logit contributions
 misunder+0.144
Pri+0.134
1945+0.124
 differ+0.123
 attitudes+0.123
Neg logit contributions
 SATA-0.180
HAM-0.179
oscope-0.174
 DRAG-0.169
 Turtles-0.164
Token jog
Token ID48342
Feature activation-1.28e+00

Pos logit contributions
 SATA+0.002
HAM+0.002
oscope+0.002
 DRAG+0.002
 Turtles+0.001
Neg logit contributions
 misunder-0.001
Pri-0.001
 differ-0.001
1945-0.001
 attitudes-0.001
Token or
Token ID393
Feature activation-1.74e+00

Pos logit contributions
 misunder+0.812
Pri+0.760
 differ+0.705
1945+0.700
 attitudes+0.699
Neg logit contributions
 SATA-1.045
HAM-1.035
oscope-1.012
 DRAG-0.979
 Turtles-0.952
Token the
Token ID262
Feature activation-5.97e-02

Pos logit contributions
 SATA+0.046
HAM+0.045
oscope+0.044
 DRAG+0.043
 Turtles+0.042
Neg logit contributions
 misunder-0.033
Pri-0.031
 differ-0.029
1945-0.028
 attitudes-0.028
Token Governor
Token ID10807
Feature activation-2.90e-02

Pos logit contributions
 misunder+0.041
Pri+0.039
 differ+0.036
 attitudes+0.036
1945+0.036
Neg logit contributions
 SATA-0.045
HAM-0.045
oscope-0.044
 DRAG-0.042
 Turtles-0.041
Token
Token ID447
Feature activation+6.59e-03

Pos logit contributions
 misunder+0.021
Pri+0.020
 differ+0.019
 attitudes+0.018
1945+0.018
Neg logit contributions
 SATA-0.021
HAM-0.021
oscope-0.020
 DRAG-0.019
 Turtles-0.019
Token
Token ID247
Feature activation+1.50e-01

Pos logit contributions
 SATA+0.005
HAM+0.005
oscope+0.005
 DRAG+0.005
 Turtles+0.004
Neg logit contributions
 misunder-0.005
Pri-0.004
 differ-0.004
1945-0.004
 attitudes-0.004
Tokens
Token ID82
Feature activation+3.48e-01

Pos logit contributions
 SATA+0.116
HAM+0.115
oscope+0.112
 DRAG+0.108
 Turtles+0.105
Neg logit contributions
 misunder-0.101
Pri-0.095
 differ-0.088
1945-0.088
 attitudes-0.088
Token mansion
Token ID24141
Feature activation-1.49e+00

Pos logit contributions
 SATA+0.269
HAM+0.266
oscope+0.260
 DRAG+0.251
 Turtles+0.244
Neg logit contributions
 misunder-0.235
Pri-0.221
 differ-0.206
1945-0.205
 attitudes-0.204
Token.
Token ID13
Feature activation+2.23e-01

Pos logit contributions
 misunder+0.998
Pri+0.941
 differ+0.875
1945+0.869
 attitudes+0.867
Neg logit contributions
 SATA-1.152
HAM-1.141
oscope-1.113
 DRAG-1.077
 Turtles-1.046
Token McDonnell
Token ID39441
Feature activation+3.29e-02

Pos logit contributions
 SATA+0.169
HAM+0.168
oscope+0.165
 DRAG+0.158
lishes+0.154
Neg logit contributions
 misunder-0.149
Pri-0.145
1945-0.134
 differ-0.133
Zero-0.132
Token did
Token ID750
Feature activation+1.38e-02

Pos logit contributions
 SATA+0.026
HAM+0.026
oscope+0.026
 DRAG+0.025
 Turtles+0.024
Neg logit contributions
 misunder-0.021
Pri-0.020
 differ-0.018
1945-0.018
 attitudes-0.018
Token not
Token ID407
Feature activation-1.05e-01

Pos logit contributions
 SATA+0.012
HAM+0.012
oscope+0.011
 DRAG+0.011
 Turtles+0.011
Neg logit contributions
 misunder-0.008
Pri-0.008
 differ-0.007
1945-0.007
 attitudes-0.007
Token steer
Token ID27401
Feature activation-8.46e-01

Pos logit contributions
 misunder+0.073
Pri+0.069
 differ+0.066
1945+0.064
 attitudes+0.064
Neg logit contributions
 SATA-0.079
HAM-0.078
oscope-0.076
 DRAG-0.073
 Turtles-0.071
Token'm
Token ID1101
Feature activation-9.20e-02

Pos logit contributions
 misunder+0.091
Pri+0.086
 differ+0.080
1945+0.079
 attitudes+0.079
Neg logit contributions
 SATA-0.105
HAM-0.104
oscope-0.102
 DRAG-0.098
 Turtles-0.095
Token officially
Token ID8720
Feature activation+2.43e+00

Pos logit contributions
 misunder+0.064
Pri+0.060
 differ+0.056
1945+0.056
 attitudes+0.056
Neg logit contributions
 SATA-0.070
HAM-0.069
oscope-0.067
 DRAG-0.065
 Turtles-0.063
Token declaring
Token ID18684
Feature activation-1.35e+00

Pos logit contributions
 SATA+1.869
HAM+1.852
oscope+1.798
 DRAG+1.744
 Turtles+1.694
Neg logit contributions
 misunder-1.648
Pri-1.544
 differ-1.432
1945-1.430
 attitudes-1.427
Token them
Token ID606
Feature activation+8.55e-01

Pos logit contributions
 misunder+0.831
Pri+0.774
 differ+0.718
1945+0.714
 attitudes+0.712
Neg logit contributions
 SATA-1.118
HAM-1.110
oscope-1.087
 DRAG-1.049
 Turtles-1.021
Token as
Token ID355
Feature activation-2.42e-01

Pos logit contributions
 SATA+0.668
HAM+0.662
oscope+0.645
 DRAG+0.625
 Turtles+0.606
Neg logit contributions
 misunder-0.572
Pri-0.535
 differ-0.497
1945-0.495
 attitudes-0.493
Token des
Token ID748
Feature activation-2.47e-01

Pos logit contributions
 misunder+0.160
Pri+0.149
 differ+0.139
1945+0.138
 attitudes+0.138
Neg logit contributions
 SATA-0.191
HAM-0.190
oscope-0.186
 DRAG-0.179
 Turtles-0.174
Tokener
Token ID263
Feature activation-4.46e-01

Pos logit contributions
 misunder+0.148
Pri+0.134
 differ+0.124
 attitudes+0.124
1945+0.123
Neg logit contributions
 SATA-0.212
HAM-0.211
oscope-0.207
 DRAG-0.201
 Turtles-0.195
Tokenter
Token ID353
Feature activation+1.59e-01

Pos logit contributions
 misunder+0.197
Pri+0.178
 differ+0.159
1945+0.157
 attitudes+0.156
Neg logit contributions
 SATA-0.449
HAM-0.446
oscope-0.438
 DRAG-0.427
 Turtles-0.417
Token of
Token ID286
Feature activation-1.51e-01

Pos logit contributions
 SATA+0.126
HAM+0.125
oscope+0.122
 DRAG+0.118
 Turtles+0.114
Neg logit contributions
 misunder-0.105
Pri-0.098
 differ-0.091
1945-0.090
 attitudes-0.090
Token C
Token ID327
Feature activation-1.23e+00

Pos logit contributions
 misunder+0.091
Pri+0.085
 differ+0.078
1945+0.077
 attitudes+0.077
Neg logit contributions
 SATA-0.128
HAM-0.127
oscope-0.124
 DRAG-0.121
 Turtles-0.117
Token.
Token ID13
Feature activation-4.36e-01

Pos logit contributions
 misunder+0.870
Pri+0.829
 differ+0.771
1945+0.769
esa+0.769
Neg logit contributions
 SATA-0.898
HAM-0.885
oscope-0.865
 DRAG-0.836
 Turtles-0.819
Token as
Token ID355
Feature activation+3.99e-01

Pos logit contributions
 misunder+1.206
Pri+1.124
 differ+1.072
 attitudes+1.043
1945+1.038
Neg logit contributions
 SATA-1.627
HAM-1.620
oscope-1.584
 DRAG-1.521
lishes-1.480
Token even
Token ID772
Feature activation+1.04e+00

Pos logit contributions
 SATA+0.348
HAM+0.345
oscope+0.338
 DRAG+0.328
 Turtles+0.319
Neg logit contributions
 misunder-0.230
Pri-0.213
 differ-0.196
1945-0.194
 attitudes-0.194
Token at
Token ID379
Feature activation+1.67e-01

Pos logit contributions
 SATA+0.902
HAM+0.893
oscope+0.873
 DRAG+0.848
 Turtles+0.827
Neg logit contributions
 misunder-0.600
Pri-0.556
 differ-0.508
1945-0.507
 attitudes-0.504
Token that
Token ID326
Feature activation-2.14e-01

Pos logit contributions
 SATA+0.128
HAM+0.127
oscope+0.123
 DRAG+0.120
 Turtles+0.116
Neg logit contributions
 misunder-0.114
Pri-0.106
 differ-0.099
1945-0.099
 attitudes-0.099
Token resolution
Token ID6323
Feature activation-2.61e-01

Pos logit contributions
 misunder+0.147
Pri+0.138
 differ+0.130
 attitudes+0.129
1945+0.128
Neg logit contributions
 SATA-0.162
HAM-0.161
oscope-0.157
 DRAG-0.152
 Turtles-0.147
Token there
Token ID612
Feature activation-9.85e-01

Pos logit contributions
 misunder+0.164
Pri+0.153
 differ+0.143
 attitudes+0.141
1945+0.141
Neg logit contributions
 SATA-0.213
HAM-0.212
oscope-0.207
 DRAG-0.200
 Turtles-0.195
Token are
Token ID389
Feature activation-9.60e-02

Pos logit contributions
 misunder+0.632
Pri+0.592
 differ+0.552
 attitudes+0.546
1945+0.545
Neg logit contributions
 SATA-0.792
HAM-0.788
oscope-0.768
 DRAG-0.743
 Turtles-0.722
Token still
Token ID991
Feature activation-1.35e+00

Pos logit contributions
 misunder+0.056
Pri+0.052
 differ+0.048
 attitudes+0.047
1945+0.047
Neg logit contributions
 SATA-0.083
HAM-0.083
oscope-0.081
 DRAG-0.078
 Turtles-0.076
Token those
Token ID883
Feature activation-1.53e+00

Pos logit contributions
 misunder+0.800
Pri+0.741
 differ+0.691
 attitudes+0.682
1945+0.678
Neg logit contributions
 SATA-1.145
HAM-1.142
oscope-1.116
 DRAG-1.079
 Turtles-1.047
Token spaces
Token ID9029
Feature activation-9.48e-01

Pos logit contributions
 misunder+0.967
Pri+0.897
 differ+0.842
 attitudes+0.834
1945+0.828
Neg logit contributions
 SATA-1.248
HAM-1.246
oscope-1.217
 DRAG-1.172
 Turtles-1.138
Token between
Token ID1022
Feature activation-1.13e+00

Pos logit contributions
 misunder+0.574
Pri+0.534
 differ+0.496
1945+0.491
 attitudes+0.490
Neg logit contributions
 SATA-0.797
HAM-0.792
oscope-0.774
 DRAG-0.751
 Turtles-0.730
Token or
Token ID393
Feature activation-1.62e+00

Pos logit contributions
 SATA+0.853
HAM+0.843
oscope+0.825
 DRAG+0.792
 Turtles+0.769
Neg logit contributions
 misunder-0.796
Pri-0.747
1945-0.692
 differ-0.689
 attitudes-0.684
Token the
Token ID262
Feature activation+1.84e-01

Pos logit contributions
 misunder+0.997
Pri+0.929
 differ+0.861
 attitudes+0.859
1945+0.853
Neg logit contributions
 SATA-1.347
HAM-1.338
oscope-1.302
 DRAG-1.265
 Turtles-1.231
Token transcend
Token ID26231
Feature activation-3.98e-01

Pos logit contributions
 SATA+0.136
HAM+0.135
oscope+0.131
 DRAG+0.127
 Turtles+0.123
Neg logit contributions
 misunder-0.131
Pri-0.123
 differ-0.115
1945-0.114
 attitudes-0.114
Tokenent
Token ID298
Feature activation-1.22e+00

Pos logit contributions
 misunder+0.339
Pri+0.322
 differ+0.304
1945+0.303
 attitudes+0.302
Neg logit contributions
 SATA-0.239
HAM-0.236
oscope-0.229
 DRAG-0.218
 Turtles-0.210
Token,
Token ID11
Feature activation+2.56e-01

Pos logit contributions
 misunder+0.848
Pri+0.798
 differ+0.745
 attitudes+0.741
1945+0.740
Neg logit contributions
 SATA-0.915
HAM-0.907
oscope-0.881
 DRAG-0.855
 Turtles-0.829
Token into
Token ID656
Feature activation-3.07e+00

Pos logit contributions
 SATA+0.203
HAM+0.200
oscope+0.196
 DRAG+0.189
 Turtles+0.184
Neg logit contributions
 misunder-0.168
Pri-0.157
1945-0.146
 differ-0.145
 attitudes-0.144
Token the
Token ID262
Feature activation+1.19e-01

Pos logit contributions
 misunder+1.785
Pri+1.658
 attitudes+1.549
 differ+1.546
 kitchens+1.517
Neg logit contributions
 SATA-2.658
HAM-2.653
oscope-2.573
 DRAG-2.502
 Turtles-2.425
Token realm
Token ID13360
Feature activation+1.63e+00

Pos logit contributions
 SATA+0.087
HAM+0.087
oscope+0.084
 DRAG+0.081
 Turtles+0.079
Neg logit contributions
 misunder-0.085
Pri-0.080
 differ-0.075
 attitudes-0.075
1945-0.074
Token of
Token ID286
Feature activation+2.85e-01

Pos logit contributions
 SATA+1.331
HAM+1.303
oscope+1.300
 DRAG+1.229
antage+1.211
Neg logit contributions
 misunder-1.004
Pri-0.942
1945-0.862
esa-0.856
Initialized-0.849
Token nature
Token ID3450
Feature activation+9.99e-01

Pos logit contributions
 SATA+0.238
HAM+0.236
oscope+0.231
 DRAG+0.224
 Turtles+0.218
Neg logit contributions
 misunder-0.174
Pri-0.162
 differ-0.149
1945-0.149
 attitudes-0.148
Token.
Token ID13
Feature activation+4.90e-01

Pos logit contributions
 SATA+0.743
HAM+0.735
oscope+0.718
 DRAG+0.689
 Turtles+0.671
Neg logit contributions
 misunder-0.705
Pri-0.662
1945-0.617
 differ-0.613
esa-0.607
Token area
Token ID1989
Feature activation-1.33e+00

Pos logit contributions
 misunder+0.422
Pri+0.397
 differ+0.371
1945+0.369
 attitudes+0.368
Neg logit contributions
 SATA-0.428
HAM-0.424
oscope-0.413
 DRAG-0.399
 Turtles-0.386
Token.
Token ID13
Feature activation-1.38e-01

Pos logit contributions
 misunder+0.915
Pri+0.858
 differ+0.798
1945+0.794
 attitudes+0.791
Neg logit contributions
 SATA-1.019
HAM-1.008
oscope-0.983
 DRAG-0.949
 Turtles-0.921
Token
Token ID198
Feature activation-3.23e-01

Pos logit contributions
 misunder+0.096
Pri+0.089
 differ+0.083
1945+0.083
 attitudes+0.083
Neg logit contributions
 SATA-0.105
HAM-0.104
oscope-0.102
 DRAG-0.098
 Turtles-0.096
Token
Token ID198
Feature activation-4.99e-01

Pos logit contributions
Pri+0.192
 misunder+0.186
1945+0.178
Zero+0.174
 differ+0.174
Neg logit contributions
 SATA-0.264
HAM-0.262
oscope-0.258
 DRAG-0.253
antage-0.244
TokenCor
Token ID10606
Feature activation-2.73e+00

Pos logit contributions
 misunder+0.314
Pri+0.290
 differ+0.270
 attitudes+0.267
1945+0.267
Neg logit contributions
 SATA-0.410
HAM-0.408
oscope-0.398
 DRAG-0.384
 Turtles-0.374
Tokenrespond
Token ID5546
Feature activation-5.73e+00

Pos logit contributions
 misunder+1.194
Pri+1.104
 differ+0.987
poral+0.974
1945+0.974
Neg logit contributions
 SATA-2.726
HAM-2.711
oscope-2.636
 DRAG-2.619
 Turtles-2.539
Tokenents
Token ID658
Feature activation-1.41e+00

Pos logit contributions
 misunder+3.174
Pri+3.162
esa+2.949
 differ+2.886
enne+2.862
Neg logit contributions
 SATA-4.874
HAM-4.842
 DRAG-4.666
oscope-4.593
 Turtles-4.502
Token say
Token ID910
Feature activation-4.64e-01

Pos logit contributions
 misunder+0.982
Pri+0.921
1945+0.855
 differ+0.854
 attitudes+0.849
Neg logit contributions
 SATA-1.066
HAM-1.055
oscope-1.030
 DRAG-0.995
 Turtles-0.966
Token the
Token ID262
Feature activation-1.88e-01

Pos logit contributions
 misunder+0.284
Pri+0.264
 differ+0.244
1945+0.242
 attitudes+0.241
Neg logit contributions
 SATA-0.389
HAM-0.386
oscope-0.377
 DRAG-0.366
 Turtles-0.356
Token mine
Token ID6164
Feature activation-7.33e-01

Pos logit contributions
 misunder+0.118
Pri+0.110
 differ+0.102
 attitudes+0.102
1945+0.101
Neg logit contributions
 SATA-0.154
HAM-0.153
oscope-0.150
 DRAG-0.145
 Turtles-0.141
Token may
Token ID743
Feature activation-1.74e+00

Pos logit contributions
 misunder+0.450
Pri+0.420
 differ+0.386
1945+0.385
 attitudes+0.383
Neg logit contributions
 SATA-0.612
HAM-0.607
oscope-0.593
 DRAG-0.574
 Turtles-0.558
Token Rush
Token ID13063
Feature activation+3.72e-01

Pos logit contributions
 misunder+0.297
Pri+0.277
 differ+0.256
1945+0.255
 attitudes+0.254
Neg logit contributions
 SATA-0.391
HAM-0.387
oscope-0.379
 DRAG-0.367
 Turtles-0.356
Token Hy
Token ID6707
Feature activation+4.58e-02

Pos logit contributions
 SATA+0.288
HAM+0.285
oscope+0.278
 DRAG+0.269
 Turtles+0.262
Neg logit contributions
 misunder-0.252
Pri-0.235
 differ-0.219
1945-0.218
 attitudes-0.216
Tokenster
Token ID1706
Feature activation+5.17e-01

Pos logit contributions
 SATA+0.036
HAM+0.035
oscope+0.034
 DRAG+0.033
 Turtles+0.032
Neg logit contributions
 misunder-0.031
Pri-0.029
 differ-0.027
1945-0.027
 attitudes-0.027
Tokenia
Token ID544
Feature activation-1.35e+00

Pos logit contributions
 SATA+0.492
HAM+0.486
oscope+0.482
 DRAG+0.463
 Turtles+0.454
Neg logit contributions
 misunder-0.260
Pri-0.233
 differ-0.213
1945-0.210
 attitudes-0.209
Token was
Token ID373
Feature activation-6.77e-02

Pos logit contributions
 misunder+0.893
Pri+0.837
 differ+0.777
1945+0.775
 attitudes+0.773
Neg logit contributions
 SATA-1.064
HAM-1.056
oscope-1.030
 DRAG-0.996
 Turtles-0.966
Token puls
Token ID22271
Feature activation-4.27e+00

Pos logit contributions
 misunder+0.044
Pri+0.041
 differ+0.038
1945+0.038
 attitudes+0.038
Neg logit contributions
 SATA-0.054
HAM-0.054
oscope-0.053
 DRAG-0.051
 Turtles-0.049
Tokening
Token ID278
Feature activation-1.78e+00

Pos logit contributions
 misunder+2.336
Pri+2.170
 differ+1.980
1945+1.980
 kitchens+1.951
Neg logit contributions
 SATA-3.786
HAM-3.740
oscope-3.613
 DRAG-3.612
 Turtles-3.488
Token through
Token ID832
Feature activation-1.31e+00

Pos logit contributions
 misunder+1.123
Pri+1.049
 differ+0.976
 attitudes+0.971
1945+0.968
Neg logit contributions
 SATA-1.443
HAM-1.432
oscope-1.398
 DRAG-1.355
 Turtles-1.315
Token the
Token ID262
Feature activation+4.69e-02

Pos logit contributions
 misunder+0.805
Pri+0.748
 differ+0.692
1945+0.688
 attitudes+0.688
Neg logit contributions
 SATA-1.097
HAM-1.089
oscope-1.064
 DRAG-1.031
 Turtles-1.003
Token veins
Token ID32375
Feature activation+3.46e+00

Pos logit contributions
 SATA+0.035
HAM+0.035
oscope+0.034
 DRAG+0.033
 Turtles+0.032
Neg logit contributions
 misunder-0.033
Pri-0.031
 differ-0.029
1945-0.028
 attitudes-0.028
Token of
Token ID286
Feature activation+2.87e-01

Pos logit contributions
 SATA+2.740
HAM+2.691
oscope+2.668
 Turtles+2.494
 DRAG+2.491
Neg logit contributions
 misunder-2.235
Pri-2.106
esa-1.915
1945-1.907
 differ-1.875
Token Philip
Token ID14576
Feature activation+5.27e-02

Pos logit contributions
 SATA+0.146
HAM+0.144
oscope+0.141
 DRAG+0.137
 Turtles+0.133
Neg logit contributions
 misunder-0.113
Pri-0.105
 differ-0.097
1945-0.096
 attitudes-0.095
Token Morris
Token ID14433
Feature activation+6.24e-01

Pos logit contributions
 SATA+0.035
HAM+0.035
oscope+0.034
 DRAG+0.033
 Turtles+0.031
Neg logit contributions
 misunder-0.041
Pri-0.039
 differ-0.036
1945-0.036
 attitudes-0.036
Token,
Token ID11
Feature activation+1.62e-01

Pos logit contributions
 SATA+0.481
HAM+0.476
oscope+0.465
 DRAG+0.449
 Turtles+0.435
Neg logit contributions
 misunder-0.425
Pri-0.398
 differ-0.372
 attitudes-0.369
1945-0.368
Token Victor
Token ID12622
Feature activation+1.59e+00

Pos logit contributions
 SATA+0.127
HAM+0.126
oscope+0.122
 DRAG+0.118
 Turtles+0.115
Neg logit contributions
 misunder-0.109
Pri-0.102
 differ-0.094
1945-0.094
 attitudes-0.093
Token De
Token ID1024
Feature activation+2.34e-01

Pos logit contributions
 SATA+1.154
HAM+1.131
oscope+1.111
 DRAG+1.064
 Turtles+1.040
Neg logit contributions
 misunder-1.158
Pri-1.078
 differ-1.014
1945-1.009
 attitudes-1.002
TokenNob
Token ID21191
Feature activation-3.83e+00

Pos logit contributions
 SATA+0.180
HAM+0.177
oscope+0.173
 DRAG+0.167
 Turtles+0.164
Neg logit contributions
 misunder-0.159
Pri-0.147
1945-0.139
 differ-0.138
 attitudes-0.138
Tokenel
Token ID417
Feature activation+1.56e-02

Pos logit contributions
 misunder+2.248
Pri+2.169
esa+1.990
 differ+1.987
 attitudes+1.953
Neg logit contributions
 SATA-3.213
HAM-3.185
oscope-3.093
 DRAG-3.050
 Turtles-2.916
Token and
Token ID290
Feature activation-1.43e-01

Pos logit contributions
 SATA+0.012
HAM+0.012
oscope+0.012
 DRAG+0.011
 Turtles+0.011
Neg logit contributions
 misunder-0.010
Pri-0.010
 differ-0.009
1945-0.009
 attitudes-0.009
Token Paul
Token ID3362
Feature activation-7.33e-01

Pos logit contributions
 misunder+0.096
Pri+0.090
 differ+0.083
1945+0.083
 attitudes+0.083
Neg logit contributions
 SATA-0.111
HAM-0.110
oscope-0.108
 DRAG-0.104
 Turtles-0.101
Token Me
Token ID2185
Feature activation-2.01e+00

Pos logit contributions
 misunder+0.536
Pri+0.506
 differ+0.474
1945+0.472
 attitudes+0.470
Neg logit contributions
 SATA-0.524
HAM-0.519
oscope-0.505
 DRAG-0.488
 Turtles-0.471
Tokenle
Token ID293
Feature activation+1.01e+00

Pos logit contributions
 misunder+1.315
Pri+1.243
 differ+1.152
esa+1.145
1945+1.143
Neg logit contributions
 SATA-1.594
HAM-1.575
oscope-1.534
 DRAG-1.497
 Turtles-1.447
Token to
Token ID284
Feature activation+6.72e-02

Pos logit contributions
 SATA+0.211
HAM+0.209
oscope+0.203
 DRAG+0.195
 Turtles+0.190
Neg logit contributions
 misunder-0.196
Pri-0.183
1945-0.171
 differ-0.170
 attitudes-0.169
Token take
Token ID1011
Feature activation-1.59e+00

Pos logit contributions
 SATA+0.049
HAM+0.049
oscope+0.048
 DRAG+0.046
 Turtles+0.045
Neg logit contributions
 misunder-0.048
Pri-0.045
 differ-0.043
 attitudes-0.042
1945-0.042
Token the
Token ID262
Feature activation-3.16e-01

Pos logit contributions
 misunder+0.962
Pri+0.895
 differ+0.825
1945+0.820
 attitudes+0.818
Neg logit contributions
 SATA-1.337
HAM-1.325
oscope-1.296
 DRAG-1.256
 Turtles-1.222
Token Ring
Token ID12569
Feature activation-1.23e+00

Pos logit contributions
 misunder+0.220
Pri+0.207
 differ+0.193
1945+0.191
 attitudes+0.191
Neg logit contributions
 SATA-0.238
HAM-0.236
oscope-0.230
 DRAG-0.222
 Turtles-0.215
Token from
Token ID422
Feature activation-5.07e-01

Pos logit contributions
 misunder+0.779
Pri+0.730
 differ+0.674
1945+0.670
 attitudes+0.668
Neg logit contributions
 SATA-1.003
HAM-0.992
oscope-0.970
 DRAG-0.940
 Turtles-0.913
Token Fro
Token ID9734
Feature activation-3.21e+00

Pos logit contributions
 misunder+0.328
Pri+0.306
 differ+0.283
1945+0.282
 attitudes+0.281
Neg logit contributions
 SATA-0.408
HAM-0.404
oscope-0.394
 DRAG-0.382
 Turtles-0.371
Tokendo
Token ID4598
Feature activation-1.11e+00

Pos logit contributions
Pri+1.506
 misunder+1.476
esa+1.418
lves+1.394
 differ+1.373
Neg logit contributions
 SATA-2.984
HAM-2.965
 DRAG-2.892
oscope-2.891
 Turtles-2.802
Token,
Token ID11
Feature activation-1.56e-01

Pos logit contributions
 misunder+0.752
Pri+0.705
 differ+0.656
1945+0.653
 attitudes+0.651
Neg logit contributions
 SATA-0.851
HAM-0.843
oscope-0.822
 DRAG-0.795
 Turtles-0.772
Token but
Token ID475
Feature activation-4.48e-01

Pos logit contributions
 misunder+0.101
Pri+0.095
 differ+0.088
1945+0.087
 attitudes+0.087
Neg logit contributions
 SATA-0.125
HAM-0.124
oscope-0.121
 DRAG-0.117
 Turtles-0.114
Token Fro
Token ID9734
Feature activation-3.08e+00

Pos logit contributions
 misunder+0.294
Pri+0.275
 differ+0.256
1945+0.254
 attitudes+0.254
Neg logit contributions
 SATA-0.356
HAM-0.353
oscope-0.344
 DRAG-0.333
 Turtles-0.323
Tokendo
Token ID4598
Feature activation-1.09e+00

Pos logit contributions
Pri+1.451
 misunder+1.346
esa+1.340
lves+1.337
1945+1.323
Neg logit contributions
 SATA-2.859
HAM-2.858
 DRAG-2.798
oscope-2.772
 Turtles-2.709