TOP ACTIVATIONS
MAX = 1.005

Russia s Investigative Committee has finished its investigation
- 375 7 , mel bourne - 3000 , vic
See more Go of s C offer
| Select Width , Height Write - Ver b ose -
as guest on The Ar sen io Hall Show in 1990
religion in Brazil are inex act . According to the census
and William Bret z ger / The News Journal
territories , did not immediately respond to requests for comment .
the 7 - 9 year max ). The problem
atives . <|endoftext|> FILE : Delhi Law Minister and
's flat , so at best could only have been retail
($ 10 . 6 million cap hit in 2017 ), Mario
) party | AFP / Getty Opinion It won
Sign Up : Stay on top of the latest breaking film
and - f inance , aust ral ia First
priority . But …. " Ah ."
arn al Abuse by De ce it , Joyce
. Disclaimer : © 2015 Money Morning and Money
ve watched in amaz ement as this incredible community
Read more The peer chairs a Lords EU subcommittee

MIN ACTIVATIONS
MIN = -1.342

J aaaa asper Profile Blog Joined April 2012 United States 5
TB one - Profile Blog Joined November 2010 United States 21
Dark horse Profile Blog Joined December 2011 United States 228
K ier an Profile Blog Joined August 2011 United States 6
Big Fan Profile Blog Joined December 2010 TL AD T
re i Profile Blog Joined October 2002 United States 3
2012 United States 3 240 Posts # 10 Happy it worked
2010 United States 10 53 Posts # 15 honestly if we
ada ver BB Profile Blog Joined June 2009 Germany 247 34
2012 United States 5 667 Posts # 12 I 'm glad
2010 United States 21 50 Posts # 16 Glad to hear
June 2012 Canada 16 10 Posts # 14 On September 20
TL AD T 235 21 Posts # 18 Good stuff !
Hide Caption 8 of 15 Photos : Carly Fiorina 's political
heart l xp Profile Joined September 2010 United States 10
2002 United States 3 472 Posts # 11 On September 20
of Dal K Profile Blog Joined June 2012 Canada 16 10
14 Recommend Recommend Share this comment on Facebook Share
55 Recommend Recommend Share this comment on Facebook Share
2011 United States 228 65 Posts # 13 On September 20

INTERVAL 0.418 - 1.005
CONTAINS 1.127%

6 { ( 4 s )} 60 . e 7 {
. The Marlins are one of only three
unite and demand common - sense gun laws will we be
up front . Moreover , upset that he was not getting
But at the same time he realizes he

INTERVAL -0.168 - 0.418
CONTAINS 84.215%

adjust our pronouns accordingly ( and instantly ) or else we
Changing The competitive landscape , however , constantly pushes
when a man poured alcohol on her and lit a match
people , as well as if you don 't feel confident
dimensions and possible functions . Merrill - Pal mer Quarterly ,

INTERVAL -0.755 - -0.168
CONTAINS 14.636%

Far mer said the only time he remembered the trip being
aspects , has some nice he ft , and feels comfortable
demand that he commit se pp uku to regain the honor
as Google Doc s and Google Sites , would not work
о т е р е (

INTERVAL -1.342 - -0.755
CONTAINS 0.022%

Ko Do Profile Joined December 2010 United States 6
55 Recommend Recommend Share this comment on Facebook Share
: 47 ] < met ros xual th ug > le
in The USA and will ship to you direct from the
19 Recommend Recommend Share this comment on Facebook Share
Token
Token ID198
Feature activation-2.18e-01

Pos logit contributions
 appre+0.661
��+0.643
enegger+0.573
 roy+0.566
 lest+0.566
Neg logit contributions
izoph-0.781
attered-0.724
 MIS-0.718
senal-0.704
BRE-0.688
TokenRussia
Token ID16347
Feature activation+2.12e-01

Pos logit contributions
 appre+0.852
��+0.837
enegger+0.756
 obser+0.746
 lest+0.744
Neg logit contributions
izoph-0.653
attered-0.599
 MIS-0.596
BRE-0.565
ILA-0.558
Token
Token ID447
Feature activation+6.20e-02

Pos logit contributions
izoph+0.679
attered+0.631
 MIS+0.625
BRE+0.607
ILA+0.590
Neg logit contributions
 appre-0.773
��-0.771
enegger-0.690
 obser-0.676
 lest-0.667
Token
Token ID247
Feature activation-1.29e-01

Pos logit contributions
izoph+0.205
attered+0.190
 MIS+0.189
ILA+0.184
BRE+0.183
Neg logit contributions
 appre-0.224
��-0.208
enegger-0.199
 obser-0.197
 lest-0.191
Tokens
Token ID82
Feature activation+4.11e-02

Pos logit contributions
 appre+0.483
��+0.477
enegger+0.427
 obser+0.422
 lest+0.419
Neg logit contributions
izoph-0.409
attered-0.378
 MIS-0.375
BRE-0.360
ILA-0.354
Token Investigative
Token ID43965
Feature activation+1.01e+00

Pos logit contributions
izoph+0.141
attered+0.131
 MIS+0.131
BRE+0.125
ILA+0.122
Neg logit contributions
 appre-0.143
��-0.142
enegger-0.126
 obser-0.124
 lest-0.122
Token Committee
Token ID4606
Feature activation+5.39e-02

Pos logit contributions
izoph+2.809
 MIS+2.674
attered+2.598
BRE+2.515
Eye+2.449
Neg logit contributions
��-4.113
 appre-3.970
enegger-3.705
 obser-3.550
lain-3.513
Token has
Token ID468
Feature activation-8.87e-02

Pos logit contributions
izoph+0.180
attered+0.167
 MIS+0.167
BRE+0.160
ILA+0.157
Neg logit contributions
 appre-0.190
��-0.189
enegger-0.168
 obser-0.165
 lest-0.164
Token finished
Token ID5201
Feature activation-4.38e-02

Pos logit contributions
 appre+0.295
��+0.291
enegger+0.259
 obser+0.254
 lest+0.251
Neg logit contributions
izoph-0.315
attered-0.295
 MIS-0.293
BRE-0.283
ILA-0.277
Token its
Token ID663
Feature activation-2.76e-01

Pos logit contributions
 appre+0.148
��+0.146
enegger+0.131
 obser+0.128
 lest+0.126
Neg logit contributions
izoph-0.154
attered-0.143
 MIS-0.143
BRE-0.138
ILA-0.135
Token investigation
Token ID3645
Feature activation-1.65e-01

Pos logit contributions
 appre+0.983
��+0.971
enegger+0.871
 obser+0.853
 roy+0.844
Neg logit contributions
izoph-0.921
 MIS-0.856
attered-0.855
BRE-0.816
ILA-0.802
Token-
Token ID12
Feature activation+5.91e-01

Pos logit contributions
izoph+1.351
attered+1.268
 MIS+1.259
BRE+1.179
ILA+1.170
Neg logit contributions
 appre-1.548
��-1.502
enegger-1.377
 obser-1.353
 lest-1.346
Token375
Token ID22318
Feature activation+3.37e-01

Pos logit contributions
izoph+2.006
attered+1.930
 MIS+1.865
speech+1.785
BRE+1.767
Neg logit contributions
 appre-2.022
��-1.984
enegger-1.795
 obser-1.784
 lest-1.720
Token7
Token ID22
Feature activation+3.43e-01

Pos logit contributions
izoph+1.116
attered+1.068
 MIS+1.057
BRE+1.018
Eye+0.992
Neg logit contributions
 appre-1.195
��-1.177
 obser-1.057
enegger-1.036
 accomp-1.009
Token,
Token ID11
Feature activation+3.50e-01

Pos logit contributions
izoph+1.176
attered+1.122
 MIS+1.114
BRE+1.056
 HIP+1.043
Neg logit contributions
 appre-1.170
��-1.163
 obser-1.040
enegger-1.021
 lest-0.996
Token mel
Token ID7758
Feature activation+6.34e-01

Pos logit contributions
izoph+1.116
attered+1.034
 MIS+1.033
BRE+0.976
 HIP+0.960
Neg logit contributions
 appre-1.295
��-1.277
enegger-1.154
 obser-1.129
 lest-1.118
Tokenbourne
Token ID12544
Feature activation+9.75e-01

Pos logit contributions
izoph+2.332
attered+2.212
 MIS+2.142
BRE+2.074
ILA+2.046
Neg logit contributions
 appre-2.046
��-2.044
 obser-1.799
enegger-1.726
 lest-1.723
Token-
Token ID12
Feature activation+3.97e-01

Pos logit contributions
izoph+2.866
attered+2.757
 MIS+2.739
Eye+2.529
BRE+2.503
Neg logit contributions
 appre-3.738
��-3.615
 obser-3.400
enegger-3.368
 lest-3.283
Token3000
Token ID23924
Feature activation+5.21e-01

Pos logit contributions
izoph+1.522
attered+1.487
 MIS+1.415
BRE+1.363
speech+1.362
Neg logit contributions
 appre-1.199
��-1.171
 obser-1.055
enegger-1.036
 lest-1.016
Token,
Token ID11
Feature activation+2.99e-01

Pos logit contributions
izoph+1.739
attered+1.645
 MIS+1.641
BRE+1.556
Eye+1.537
Neg logit contributions
 appre-1.812
��-1.783
enegger-1.590
 obser-1.585
 lest-1.555
Token vic
Token ID20429
Feature activation-7.51e-02

Pos logit contributions
izoph+0.989
attered+0.918
 MIS+0.918
BRE+0.870
 HIP+0.855
Neg logit contributions
 appre-1.068
��-1.056
enegger-0.948
 obser-0.929
 lest-0.918
Token
Token ID198
Feature activation+5.16e-01

Pos logit contributions
 appre+0.253
��+0.248
enegger+0.222
 obser+0.217
 lest+0.216
Neg logit contributions
izoph-0.265
attered-0.249
 MIS-0.248
BRE-0.235
ILA-0.234
Token See
Token ID4091
Feature activation+3.38e-01

Pos logit contributions
 appre+0.289
��+0.283
enegger+0.250
 obser+0.248
 roy+0.244
Neg logit contributions
izoph-0.294
 MIS-0.276
attered-0.275
BRE-0.265
ILA-0.261
Token more
Token ID517
Feature activation-8.91e-03

Pos logit contributions
izoph+1.117
 MIS+1.068
attered+1.056
BRE+1.013
ILA+0.993
Neg logit contributions
 appre-1.186
��-1.173
enegger-1.041
 obser-1.021
 roy-1.013
Token
Token ID198
Feature activation+2.55e-01

Pos logit contributions
 appre+0.031
��+0.031
enegger+0.027
 obser+0.027
 lest+0.027
Neg logit contributions
izoph-0.030
attered-0.028
 MIS-0.028
BRE-0.027
ILA-0.027
Token
Token ID198
Feature activation+1.27e-01

Pos logit contributions
izoph+0.885
attered+0.830
 MIS+0.828
BRE+0.798
ILA+0.783
Neg logit contributions
 appre-0.881
��-0.866
 obser-0.767
enegger-0.764
 roy-0.743
TokenGo
Token ID5247
Feature activation+1.74e-01

Pos logit contributions
izoph+0.414
attered+0.386
 MIS+0.384
BRE+0.370
ILA+0.364
Neg logit contributions
 appre-0.460
��-0.454
enegger-0.403
 obser-0.401
 roy-0.396
Tokenof
Token ID1659
Feature activation+9.70e-01

Pos logit contributions
izoph+0.573
attered+0.546
 MIS+0.543
BRE+0.517
ILA+0.515
Neg logit contributions
 appre-0.614
��-0.607
enegger-0.540
 obser-0.535
 roy-0.526
Tokens
Token ID82
Feature activation+1.71e-01

Pos logit contributions
izoph+2.763
 MIS+2.631
attered+2.617
BRE+2.535
ILA+2.479
Neg logit contributions
 appre-3.786
��-3.738
enegger-3.408
 obser-3.408
 lest-3.309
Token
Token ID198
Feature activation+1.58e-01

Pos logit contributions
izoph+0.591
 MIS+0.553
attered+0.552
BRE+0.529
ILA+0.521
Neg logit contributions
 appre-0.585
��-0.578
enegger-0.510
 obser-0.502
 lest-0.499
Token
Token ID198
Feature activation+7.08e-02

Pos logit contributions
izoph+0.562
attered+0.524
 MIS+0.522
BRE+0.502
ILA+0.495
Neg logit contributions
 appre-0.527
��-0.516
enegger-0.458
 obser-0.452
 lest-0.447
TokenC
Token ID34
Feature activation-5.49e-02

Pos logit contributions
izoph+0.203
attered+0.187
 MIS+0.186
BRE+0.179
ILA+0.175
Neg logit contributions
 appre-0.286
��-0.282
enegger-0.255
 obser-0.253
 roy-0.251
Tokenoffer
Token ID47895
Feature activation+2.12e-01

Pos logit contributions
 appre+0.211
��+0.208
enegger+0.186
 obser+0.185
 lest+0.183
Neg logit contributions
izoph-0.169
attered-0.157
 MIS-0.154
BRE-0.148
ILA-0.146
Token |
Token ID930
Feature activation-3.30e-02

Pos logit contributions
izoph+0.173
attered+0.161
 MIS+0.160
BRE+0.153
ILA+0.151
Neg logit contributions
 appre-0.208
��-0.203
enegger-0.183
 obser-0.181
 roy-0.179
Token Select
Token ID9683
Feature activation+5.77e-01

Pos logit contributions
 appre+0.129
��+0.126
enegger+0.115
 obser+0.113
 roy+0.112
Neg logit contributions
izoph-0.100
attered-0.092
 MIS-0.092
BRE-0.087
ILA-0.086
Token Width
Token ID38807
Feature activation+5.48e-01

Pos logit contributions
izoph+1.777
attered+1.653
 MIS+1.632
ILA+1.544
BRE+1.539
Neg logit contributions
 appre-2.216
��-2.141
enegger-1.943
 obser-1.941
 roy-1.911
Token,
Token ID11
Feature activation+3.43e-01

Pos logit contributions
izoph+1.827
attered+1.707
 MIS+1.704
BRE+1.650
ILA+1.620
Neg logit contributions
 appre-1.953
��-1.891
enegger-1.703
 obser-1.695
 roy-1.658
TokenHeight
Token ID23106
Feature activation+4.34e-01

Pos logit contributions
izoph+1.228
attered+1.146
 MIS+1.128
BRE+1.094
ILA+1.093
Neg logit contributions
 appre-1.156
��-1.117
enegger-0.997
 obser-0.987
 roy-0.979
Token Write
Token ID19430
Feature activation+9.56e-01

Pos logit contributions
izoph+1.329
attered+1.234
 MIS+1.230
BRE+1.177
ILA+1.162
Neg logit contributions
 appre-1.660
��-1.625
enegger-1.476
 obser-1.453
 roy-1.433
Token-
Token ID12
Feature activation+4.81e-02

Pos logit contributions
izoph+2.983
attered+2.785
 MIS+2.750
BRE+2.648
Eye+2.588
Neg logit contributions
 appre-3.573
��-3.461
enegger-3.138
 obser-3.110
 roy-3.096
TokenVer
Token ID13414
Feature activation+1.44e-01

Pos logit contributions
izoph+0.114
attered+0.103
 MIS+0.101
BRE+0.096
ILA+0.094
Neg logit contributions
 appre-0.218
��-0.215
enegger-0.198
 obser-0.195
 roy-0.194
Tokenb
Token ID65
Feature activation+1.45e-01

Pos logit contributions
izoph+0.444
attered+0.414
 MIS+0.408
BRE+0.395
ILA+0.386
Neg logit contributions
 appre-0.548
��-0.542
enegger-0.486
 obser-0.484
 roy-0.478
Tokenose
Token ID577
Feature activation+2.07e-01

Pos logit contributions
izoph+0.263
attered+0.232
 MIS+0.227
BRE+0.209
ILA+0.203
Neg logit contributions
 appre-0.740
��-0.728
enegger-0.679
 obser-0.671
 roy-0.669
Token -
Token ID532
Feature activation+2.70e-01

Pos logit contributions
izoph+0.654
 MIS+0.612
attered+0.611
BRE+0.586
ILA+0.576
Neg logit contributions
 appre-0.772
��-0.755
enegger-0.682
 obser-0.675
 roy-0.670
Token as
Token ID355
Feature activation+1.84e-01

Pos logit contributions
izoph+0.745
attered+0.693
 MIS+0.693
BRE+0.657
ILA+0.647
Neg logit contributions
 appre-0.859
��-0.847
enegger-0.761
 obser-0.747
 lest-0.741
Token guest
Token ID8319
Feature activation+1.04e-01

Pos logit contributions
izoph+0.584
 MIS+0.542
attered+0.541
BRE+0.512
 HIP+0.504
Neg logit contributions
 appre-0.683
��-0.673
enegger-0.602
 obser-0.594
 lest-0.593
Token on
Token ID319
Feature activation-6.96e-02

Pos logit contributions
izoph+0.423
attered+0.398
 MIS+0.398
BRE+0.383
 HIP+0.378
Neg logit contributions
 appre-0.295
��-0.290
enegger-0.252
 lest-0.244
 obser-0.244
Token The
Token ID383
Feature activation-7.03e-02

Pos logit contributions
 appre+0.253
��+0.248
enegger+0.222
 obser+0.220
 lest+0.218
Neg logit contributions
izoph-0.227
 MIS-0.211
attered-0.211
BRE-0.201
ILA-0.197
Token Ar
Token ID943
Feature activation-1.09e-01

Pos logit contributions
 appre+0.260
��+0.255
enegger+0.228
 obser+0.226
 lest+0.224
Neg logit contributions
izoph-0.226
 MIS-0.211
attered-0.210
BRE-0.200
ILA-0.197
Tokensen
Token ID6248
Feature activation+9.52e-01

Pos logit contributions
 appre+0.390
��+0.383
enegger+0.342
 obser+0.338
 lest+0.335
Neg logit contributions
izoph-0.364
attered-0.338
 MIS-0.334
BRE-0.322
ILA-0.316
Tokenio
Token ID952
Feature activation-1.33e-01

Pos logit contributions
izoph+3.980
attered+3.752
 MIS+3.699
BRE+3.631
Eye+3.615
Neg logit contributions
 appre-2.555
��-2.479
 obser-2.125
enegger-2.068
 accomp-2.058
Token Hall
Token ID4789
Feature activation-8.24e-02

Pos logit contributions
 appre+0.416
��+0.407
enegger+0.358
 obser+0.354
 roy+0.348
Neg logit contributions
izoph-0.501
attered-0.471
 MIS-0.469
BRE-0.452
ILA-0.446
Token Show
Token ID5438
Feature activation-4.46e-01

Pos logit contributions
 appre+0.295
��+0.289
enegger+0.259
 obser+0.255
 lest+0.253
Neg logit contributions
izoph-0.274
attered-0.254
 MIS-0.253
BRE-0.243
ILA-0.239
Token in
Token ID287
Feature activation-1.59e-01

Pos logit contributions
 appre+1.541
��+1.507
enegger+1.347
 obser+1.324
 lest+1.319
Neg logit contributions
izoph-1.540
attered-1.427
 MIS-1.418
BRE-1.362
ILA-1.348
Token 1990
Token ID6303
Feature activation-1.87e-01

Pos logit contributions
 appre+0.655
��+0.646
enegger+0.586
 obser+0.579
 roy+0.575
Neg logit contributions
izoph-0.439
attered-0.402
 MIS-0.401
BRE-0.378
ILA-0.371
Token religion
Token ID5737
Feature activation-2.07e-02

Pos logit contributions
izoph+0.240
 MIS+0.224
attered+0.224
BRE+0.213
 HIP+0.211
Neg logit contributions
��-0.257
 appre-0.256
enegger-0.230
 obser-0.223
 lest-0.221
Token in
Token ID287
Feature activation+4.67e-02

Pos logit contributions
 appre+0.070
��+0.069
enegger+0.062
 obser+0.061
 lest+0.060
Neg logit contributions
izoph-0.072
attered-0.067
 MIS-0.067
BRE-0.064
ILA-0.063
Token Brazil
Token ID7595
Feature activation+1.09e-01

Pos logit contributions
izoph+0.156
 MIS+0.146
attered+0.146
BRE+0.138
 HIP+0.138
Neg logit contributions
 appre-0.165
��-0.164
enegger-0.146
 obser-0.144
 lest-0.142
Token are
Token ID389
Feature activation+1.78e-01

Pos logit contributions
izoph+0.366
attered+0.341
 MIS+0.340
BRE+0.325
ILA+0.320
Neg logit contributions
 appre-0.386
��-0.383
enegger-0.343
 obser-0.336
 lest-0.332
Token inex
Token ID16087
Feature activation+6.89e-02

Pos logit contributions
izoph+0.633
attered+0.596
 MIS+0.593
BRE+0.565
 HIP+0.564
Neg logit contributions
 appre-0.588
��-0.584
enegger-0.527
 obser-0.507
 roy-0.498
Tokenact
Token ID529
Feature activation+9.49e-01

Pos logit contributions
izoph+0.257
attered+0.242
 MIS+0.240
BRE+0.231
ILA+0.229
Neg logit contributions
 appre-0.218
��-0.214
enegger-0.190
 obser-0.187
 roy-0.184
Token.
Token ID13
Feature activation-2.53e-02

Pos logit contributions
izoph+3.034
attered+2.953
 MIS+2.889
 HIP+2.762
BRE+2.756
Neg logit contributions
 appre-3.296
��-3.284
enegger-2.956
 obser-2.799
 roy-2.767
Token According
Token ID4784
Feature activation-7.58e-02

Pos logit contributions
 appre+0.087
��+0.085
 obser+0.075
enegger+0.075
 roy+0.073
Neg logit contributions
izoph-0.088
 MIS-0.083
attered-0.083
BRE-0.080
 HIP-0.079
Token to
Token ID284
Feature activation-5.60e-02

Pos logit contributions
 appre+0.252
��+0.247
enegger+0.219
 obser+0.216
 lest+0.213
Neg logit contributions
izoph-0.271
attered-0.253
 MIS-0.252
BRE-0.243
ILA-0.239
Token the
Token ID262
Feature activation-1.03e-01

Pos logit contributions
 appre+0.208
��+0.206
enegger+0.185
 obser+0.180
 lest+0.179
Neg logit contributions
izoph-0.179
 MIS-0.166
attered-0.165
BRE-0.157
 HIP-0.155
Token census
Token ID21649
Feature activation-6.44e-02

Pos logit contributions
 appre+0.373
��+0.369
enegger+0.330
 obser+0.322
 lest+0.320
Neg logit contributions
izoph-0.341
 MIS-0.317
attered-0.315
BRE-0.300
ILA-0.297
Token and
Token ID290
Feature activation+5.82e-01

Pos logit contributions
izoph+0.230
attered+0.215
 MIS+0.214
BRE+0.205
ILA+0.201
Neg logit contributions
 appre-0.306
��-0.303
enegger-0.270
 obser-0.270
 lest-0.268
Token William
Token ID3977
Feature activation-2.77e-02

Pos logit contributions
izoph+1.958
 MIS+1.888
attered+1.858
BRE+1.783
 HIP+1.777
Neg logit contributions
 appre-1.978
��-1.961
enegger-1.747
 obser-1.715
 lest-1.698
Token Bret
Token ID39420
Feature activation+3.25e-02

Pos logit contributions
 appre+0.099
��+0.097
 obser+0.086
enegger+0.086
 lest+0.084
Neg logit contributions
izoph-0.093
 MIS-0.087
attered-0.087
BRE-0.083
ILA-0.082
Tokenz
Token ID89
Feature activation-2.19e-01

Pos logit contributions
izoph+0.111
attered+0.105
 MIS+0.104
BRE+0.099
ILA+0.098
Neg logit contributions
 appre-0.113
��-0.111
 obser-0.098
enegger-0.097
 lest-0.096
Tokenger
Token ID1362
Feature activation+1.77e-01

Pos logit contributions
 appre+0.786
��+0.772
enegger+0.688
 obser+0.681
 lest+0.677
Neg logit contributions
izoph-0.727
attered-0.675
 MIS-0.671
BRE-0.644
ILA-0.634
Token/
Token ID14
Feature activation+9.46e-01

Pos logit contributions
izoph+0.559
attered+0.523
 MIS+0.517
BRE+0.497
ILA+0.487
Neg logit contributions
 appre-0.658
��-0.649
enegger-0.578
 obser-0.576
 lest-0.573
TokenThe
Token ID464
Feature activation+3.15e-01

Pos logit contributions
izoph+2.483
attered+2.350
 MIS+2.313
BRE+2.282
Eye+2.165
Neg logit contributions
 appre-3.908
��-3.878
enegger-3.549
 obser-3.533
 lest-3.507
Token News
Token ID3000
Feature activation+4.88e-01

Pos logit contributions
izoph+0.960
attered+0.896
 MIS+0.891
BRE+0.845
ILA+0.821
Neg logit contributions
 appre-1.206
��-1.191
enegger-1.076
 obser-1.064
 lest-1.054
Token Journal
Token ID4913
Feature activation+9.75e-02

Pos logit contributions
izoph+1.746
attered+1.654
 MIS+1.637
BRE+1.569
Eye+1.531
Neg logit contributions
 appre-1.606
��-1.554
enegger-1.385
 obser-1.371
lain-1.352
Token
Token ID198
Feature activation+6.98e-02

Pos logit contributions
izoph+0.335
attered+0.314
 MIS+0.313
BRE+0.301
ILA+0.295
Neg logit contributions
 appre-0.335
��-0.330
enegger-0.294
 obser-0.291
 lest-0.287
Token
Token ID198
Feature activation-1.54e-02

Pos logit contributions
izoph+0.254
attered+0.236
 MIS+0.235
BRE+0.226
ILA+0.223
Neg logit contributions
 appre-0.228
��-0.223
enegger-0.198
 lest-0.194
 roy-0.194
Token territories
Token ID16771
Feature activation-2.62e-01

Pos logit contributions
izoph+0.030
attered+0.028
 MIS+0.028
BRE+0.027
ILA+0.026
Neg logit contributions
 appre-0.033
��-0.032
enegger-0.029
 obser-0.029
 lest-0.028
Token,
Token ID11
Feature activation-1.72e-01

Pos logit contributions
 appre+0.864
��+0.847
enegger+0.747
 obser+0.735
 roy+0.722
Neg logit contributions
izoph-0.933
attered-0.890
 MIS-0.881
BRE-0.846
 HIP-0.837
Token did
Token ID750
Feature activation+2.07e-01

Pos logit contributions
 appre+0.606
��+0.597
enegger+0.533
 obser+0.524
 roy+0.518
Neg logit contributions
izoph-0.579
attered-0.543
 MIS-0.542
BRE-0.516
ILA-0.510
Token not
Token ID407
Feature activation+2.93e-01

Pos logit contributions
izoph+0.757
attered+0.721
 MIS+0.721
BRE+0.695
 HIP+0.686
Neg logit contributions
 appre-0.655
��-0.648
enegger-0.570
 obser-0.557
 lest-0.549
Token immediately
Token ID3393
Feature activation+4.25e-01

Pos logit contributions
izoph+0.966
 MIS+0.907
attered+0.901
BRE+0.866
 HIP+0.858
Neg logit contributions
��-1.041
 appre-1.021
enegger-0.921
 obser-0.899
 lest-0.870
Token respond
Token ID3031
Feature activation+9.39e-01

Pos logit contributions
izoph+1.288
 MIS+1.225
attered+1.211
BRE+1.173
 HIP+1.146
Neg logit contributions
��-1.601
 appre-1.567
enegger-1.433
 obser-1.394
 roy-1.356
Token to
Token ID284
Feature activation+1.80e-01

Pos logit contributions
izoph+2.996
attered+2.864
 MIS+2.838
BRE+2.787
ILA+2.665
Neg logit contributions
 appre-3.350
��-3.303
enegger-2.966
 obser-2.901
 roy-2.839
Token requests
Token ID7007
Feature activation+5.57e-03

Pos logit contributions
izoph+0.592
attered+0.555
 MIS+0.554
BRE+0.532
ILA+0.522
Neg logit contributions
 appre-0.645
��-0.636
enegger-0.573
 obser-0.560
 roy-0.556
Token for
Token ID329
Feature activation-1.17e-01

Pos logit contributions
izoph+0.018
attered+0.016
 MIS+0.016
BRE+0.016
ILA+0.015
Neg logit contributions
 appre-0.021
��-0.021
enegger-0.018
 obser-0.018
 roy-0.018
Token comment
Token ID2912
Feature activation+1.43e-01

Pos logit contributions
 appre+0.384
��+0.376
enegger+0.333
 obser+0.327
 roy+0.325
Neg logit contributions
izoph-0.422
attered-0.394
 MIS-0.392
BRE-0.378
ILA-0.372
Token.
Token ID13
Feature activation-7.83e-02

Pos logit contributions
izoph+0.451
attered+0.435
 MIS+0.430
BRE+0.415
 HIP+0.403
Neg logit contributions
 appre-0.523
��-0.522
enegger-0.467
 obser-0.458
 roy-0.442
Token the
Token ID262
Feature activation-1.07e-01

Pos logit contributions
izoph+0.034
attered+0.032
 MIS+0.032
BRE+0.030
ILA+0.030
Neg logit contributions
 appre-0.037
��-0.036
enegger-0.032
 obser-0.032
 lest-0.032
Token 7
Token ID767
Feature activation-6.03e-02

Pos logit contributions
 appre+0.344
��+0.338
enegger+0.298
 obser+0.293
 lest+0.289
Neg logit contributions
izoph-0.393
 MIS-0.369
attered-0.367
BRE-0.352
 HIP-0.347
Token-
Token ID12
Feature activation+3.18e-02

Pos logit contributions
 appre+0.254
��+0.249
enegger+0.228
 obser+0.226
 lest+0.222
Neg logit contributions
izoph-0.162
 MIS-0.148
attered-0.148
BRE-0.140
ILA-0.137
Token9
Token ID24
Feature activation-5.86e-02

Pos logit contributions
izoph+0.106
attered+0.099
 MIS+0.099
BRE+0.096
ILA+0.093
Neg logit contributions
 appre-0.113
��-0.111
 obser-0.098
enegger-0.098
 roy-0.096
Token year
Token ID614
Feature activation-6.74e-02

Pos logit contributions
 appre+0.219
��+0.214
enegger+0.193
 obser+0.191
 lest+0.188
Neg logit contributions
izoph-0.185
 MIS-0.173
attered-0.172
BRE-0.164
 HIP-0.162
Token max
Token ID3509
Feature activation+9.28e-01

Pos logit contributions
 appre+0.251
��+0.248
enegger+0.223
 obser+0.219
 lest+0.215
Neg logit contributions
izoph-0.212
 MIS-0.200
attered-0.198
BRE-0.188
 HIP-0.187
Token).
Token ID737
Feature activation+6.29e-02

Pos logit contributions
izoph+2.985
 MIS+2.927
attered+2.826
 HIP+2.759
BRE+2.747
Neg logit contributions
 appre-3.219
��-3.204
enegger-2.823
 obser-2.791
 roy-2.696
Token
Token ID198
Feature activation+1.74e-01

Pos logit contributions
izoph+0.220
 MIS+0.207
attered+0.206
BRE+0.198
ILA+0.195
Neg logit contributions
 appre-0.214
��-0.210
enegger-0.185
 obser-0.184
 roy-0.181
Token
Token ID198
Feature activation+1.33e-01

Pos logit contributions
izoph+0.613
attered+0.573
 MIS+0.571
BRE+0.550
ILA+0.541
Neg logit contributions
 appre-0.591
��-0.580
enegger-0.513
 obser-0.511
 lest-0.500
TokenThe
Token ID464
Feature activation+1.21e-01

Pos logit contributions
izoph+0.390
attered+0.359
 MIS+0.358
BRE+0.345
ILA+0.337
Neg logit contributions
 appre-0.533
��-0.523
enegger-0.474
 obser-0.471
 roy-0.464
Token problem
Token ID1917
Feature activation-2.06e-01

Pos logit contributions
izoph+0.409
 MIS+0.384
attered+0.383
BRE+0.365
 HIP+0.360
Neg logit contributions
 appre-0.420
��-0.415
enegger-0.371
 obser-0.363
 lest-0.357
Tokenatives
Token ID2929
Feature activation+1.23e-01

Pos logit contributions
izoph+0.098
attered+0.092
 MIS+0.090
BRE+0.087
ILA+0.086
Neg logit contributions
 appre-0.098
��-0.096
enegger-0.087
 obser-0.085
 lest-0.084
Token.
Token ID13
Feature activation+3.28e-02

Pos logit contributions
izoph+0.423
 MIS+0.397
attered+0.396
BRE+0.378
ILA+0.373
Neg logit contributions
 appre-0.423
��-0.415
enegger-0.369
 obser-0.364
 roy-0.358
Token
Token ID447
Feature activation+1.93e-02

Pos logit contributions
izoph+0.109
 MIS+0.102
attered+0.101
BRE+0.097
ILA+0.096
Neg logit contributions
 appre-0.118
��-0.115
enegger-0.103
 obser-0.102
 roy-0.101
Token
Token ID251
Feature activation-2.02e-01

Pos logit contributions
izoph+0.080
attered+0.075
 MIS+0.075
BRE+0.073
ILA+0.072
Neg logit contributions
 appre-0.054
��-0.051
enegger-0.045
 obser-0.044
 lest-0.044
Token<|endoftext|>
Token ID50256
Feature activation+8.33e-02

Pos logit contributions
 appre+0.704
��+0.690
enegger+0.616
 obser+0.607
 lest+0.602
Neg logit contributions
izoph-0.693
attered-0.644
 MIS-0.641
BRE-0.615
ILA-0.606
TokenFILE
Token ID25664
Feature activation+9.19e-01

Pos logit contributions
izoph+0.243
attered+0.225
 MIS+0.223
BRE+0.217
ILA+0.211
Neg logit contributions
 appre-0.333
��-0.326
enegger-0.296
 obser-0.293
 lest-0.290
Token:
Token ID25
Feature activation-7.33e-02

Pos logit contributions
izoph+2.428
 MIS+2.275
attered+2.256
BRE+2.157
ILA+2.095
Neg logit contributions
 appre-3.820
��-3.810
 obser-3.417
enegger-3.414
 roy-3.360
Token Delhi
Token ID12517
Feature activation+6.50e-01

Pos logit contributions
 appre+0.252
��+0.247
enegger+0.218
 obser+0.217
 roy+0.214
Neg logit contributions
izoph-0.255
attered-0.238
 MIS-0.238
BRE-0.228
ILA-0.225
Token Law
Token ID3854
Feature activation+2.27e-01

Pos logit contributions
izoph+2.232
 MIS+2.098
attered+2.080
BRE+1.987
 HIP+1.949
Neg logit contributions
 appre-2.237
��-2.210
enegger-1.974
 obser-1.950
 lest-1.889
Token Minister
Token ID4139
Feature activation+9.26e-02

Pos logit contributions
izoph+0.882
 MIS+0.829
attered+0.828
BRE+0.802
ILA+0.787
Neg logit contributions
 appre-0.684
��-0.676
enegger-0.582
 obser-0.577
 roy-0.567
Token and
Token ID290
Feature activation-4.88e-02

Pos logit contributions
izoph+0.326
 MIS+0.306
attered+0.305
BRE+0.290
ILA+0.288
Neg logit contributions
 appre-0.314
��-0.308
enegger-0.273
 obser-0.270
 roy-0.265
Token's
Token ID338
Feature activation-1.42e-02

Pos logit contributions
 appre+0.894
��+0.881
enegger+0.783
 obser+0.779
 roy+0.766
Neg logit contributions
izoph-0.769
attered-0.716
 MIS-0.710
BRE-0.677
ILA-0.670
Token flat
Token ID6228
Feature activation+2.79e-01

Pos logit contributions
 appre+0.042
��+0.042
enegger+0.036
 obser+0.035
 lest+0.035
Neg logit contributions
izoph-0.056
attered-0.052
 MIS-0.052
BRE-0.050
 HIP-0.050
Token,
Token ID11
Feature activation-1.01e-01

Pos logit contributions
izoph+0.915
attered+0.872
 MIS+0.866
BRE+0.824
 HIP+0.813
Neg logit contributions
 appre-0.994
��-0.976
enegger-0.872
 obser-0.855
 roy-0.836
Token so
Token ID523
Feature activation+1.40e-02

Pos logit contributions
 appre+0.342
��+0.338
enegger+0.302
 obser+0.294
 roy+0.290
Neg logit contributions
izoph-0.347
 MIS-0.329
attered-0.329
BRE-0.313
 HIP-0.310
Token at
Token ID379
Feature activation-1.92e-02

Pos logit contributions
izoph+0.048
attered+0.044
 MIS+0.044
BRE+0.042
ILA+0.042
Neg logit contributions
 appre-0.049
��-0.048
enegger-0.043
 obser-0.043
 roy-0.042
Token best
Token ID1266
Feature activation+9.16e-01

Pos logit contributions
 appre+0.066
��+0.065
enegger+0.058
 obser+0.057
 roy+0.056
Neg logit contributions
izoph-0.066
attered-0.062
 MIS-0.062
BRE-0.059
ILA-0.058
Token could
Token ID714
Feature activation+3.36e-01

Pos logit contributions
izoph+2.959
 MIS+2.848
attered+2.787
 HIP+2.631
BRE+2.629
Neg logit contributions
��-3.257
 appre-3.209
enegger-2.894
 obser-2.799
esides-2.747
Token only
Token ID691
Feature activation+1.10e-01

Pos logit contributions
izoph+1.039
 MIS+0.990
attered+0.981
BRE+0.935
 HIP+0.911
Neg logit contributions
��-1.262
 appre-1.212
enegger-1.096
 obser-1.082
 lest-1.056
Token have
Token ID423
Feature activation-1.08e-01

Pos logit contributions
izoph+0.376
 MIS+0.354
attered+0.353
BRE+0.335
 HIP+0.330
Neg logit contributions
��-0.383
 appre-0.374
enegger-0.337
 obser-0.325
 lest-0.322
Token been
Token ID587
Feature activation+4.82e-02

Pos logit contributions
 appre+0.393
��+0.393
enegger+0.352
 obser+0.343
 lest+0.340
Neg logit contributions
izoph-0.349
attered-0.326
 MIS-0.325
BRE-0.309
 HIP-0.303
Token retail
Token ID6308
Feature activation-1.02e-02

Pos logit contributions
izoph+0.148
attered+0.138
 MIS+0.138
BRE+0.130
ILA+0.128
Neg logit contributions
 appre-0.182
��-0.180
enegger-0.163
 obser-0.160
 lest-0.158
Token ($
Token ID7198
Feature activation+3.03e-01

Pos logit contributions
izoph+0.541
attered+0.509
 MIS+0.508
BRE+0.489
ILA+0.482
Neg logit contributions
 appre-0.527
��-0.522
enegger-0.460
 obser-0.454
 roy-0.443
Token10
Token ID940
Feature activation-1.86e-01

Pos logit contributions
izoph+1.080
attered+1.024
 MIS+1.019
BRE+0.990
ILA+0.971
Neg logit contributions
 appre-1.012
��-0.987
 obser-0.877
enegger-0.869
 roy-0.846
Token.
Token ID13
Feature activation+5.06e-02

Pos logit contributions
 appre+0.681
��+0.667
enegger+0.599
 obser+0.590
 lest+0.586
Neg logit contributions
izoph-0.606
attered-0.562
 MIS-0.559
BRE-0.534
ILA-0.527
Token6
Token ID21
Feature activation-2.60e-01

Pos logit contributions
izoph+0.169
attered+0.160
 MIS+0.160
BRE+0.153
ILA+0.151
Neg logit contributions
 appre-0.181
��-0.177
 obser-0.158
enegger-0.156
 roy-0.153
Token million
Token ID1510
Feature activation+4.46e-02

Pos logit contributions
 appre+0.967
��+0.947
enegger+0.852
 obser+0.840
 lest+0.832
Neg logit contributions
izoph-0.829
attered-0.768
 MIS-0.765
BRE-0.730
ILA-0.719
Token cap
Token ID1451
Feature activation+9.13e-01

Pos logit contributions
izoph+0.161
attered+0.152
 MIS+0.151
BRE+0.146
ILA+0.143
Neg logit contributions
 appre-0.145
��-0.143
enegger-0.126
 obser-0.124
 roy-0.122
Token hit
Token ID2277
Feature activation+1.51e-03

Pos logit contributions
izoph+3.042
attered+2.882
 MIS+2.869
BRE+2.783
 HIP+2.717
Neg logit contributions
 appre-3.127
��-3.107
enegger-2.752
 obser-2.738
 roy-2.663
Token in
Token ID287
Feature activation+2.51e-01

Pos logit contributions
izoph+0.005
attered+0.005
 MIS+0.005
BRE+0.005
ILA+0.005
Neg logit contributions
 appre-0.005
��-0.005
enegger-0.004
 obser-0.004
 roy-0.004
Token 2017
Token ID2177
Feature activation-2.13e-02

Pos logit contributions
izoph+0.779
attered+0.725
 MIS+0.724
BRE+0.688
ILA+0.678
Neg logit contributions
 appre-0.948
��-0.933
enegger-0.839
 obser-0.833
 roy-0.820
Token),
Token ID828
Feature activation-4.20e-02

Pos logit contributions
 appre+0.070
��+0.069
enegger+0.060
 obser+0.060
 roy+0.059
Neg logit contributions
izoph-0.077
attered-0.072
 MIS-0.072
BRE-0.069
ILA-0.068
Token Mario
Token ID10682
Feature activation+2.11e-01

Pos logit contributions
 appre+0.144
��+0.142
enegger+0.126
 obser+0.124
 roy+0.122
Neg logit contributions
izoph-0.145
attered-0.136
 MIS-0.135
BRE-0.130
ILA-0.128
Token)
Token ID8
Feature activation-1.38e-01

Pos logit contributions
 appre+1.395
��+1.367
enegger+1.182
 obser+1.159
 lest+1.152
Neg logit contributions
izoph-2.003
attered-1.881
 MIS-1.873
BRE-1.810
ILA-1.787
Token party
Token ID2151
Feature activation+6.62e-02

Pos logit contributions
 appre+0.470
��+0.462
enegger+0.411
 obser+0.404
 lest+0.400
Neg logit contributions
izoph-0.484
attered-0.452
 MIS-0.451
BRE-0.431
ILA-0.426
Token |
Token ID930
Feature activation+4.85e-01

Pos logit contributions
izoph+0.230
attered+0.215
 MIS+0.215
BRE+0.205
ILA+0.203
Neg logit contributions
 appre-0.226
��-0.222
enegger-0.198
 obser-0.194
 lest-0.192
Token AFP
Token ID8916
Feature activation+4.96e-01

Pos logit contributions
izoph+1.650
 MIS+1.558
attered+1.549
BRE+1.465
 HIP+1.465
Neg logit contributions
 appre-1.681
��-1.648
enegger-1.467
 obser-1.447
 lest-1.425
Token/
Token ID14
Feature activation+4.22e-01

Pos logit contributions
izoph+1.390
 MIS+1.307
attered+1.300
BRE+1.249
ILA+1.234
Neg logit contributions
 appre-1.993
��-1.964
enegger-1.775
 obser-1.771
 roy-1.750
TokenGetty
Token ID6633
Feature activation+9.06e-01

Pos logit contributions
izoph+1.128
attered+1.055
 MIS+1.049
BRE+1.041
ILA+1.026
Neg logit contributions
 appre-1.769
��-1.753
 obser-1.593
enegger-1.549
 roy-1.545
Token Opinion
Token ID29525
Feature activation+3.37e-01

Pos logit contributions
izoph+2.622
 MIS+2.478
attered+2.471
BRE+2.364
Eye+2.291
Neg logit contributions
 appre-3.545
��-3.491
 obser-3.118
enegger-3.109
 lest-3.088
Token It
Token ID632
Feature activation+5.06e-01

Pos logit contributions
izoph+1.286
attered+1.223
 MIS+1.217
BRE+1.173
 HIP+1.146
Neg logit contributions
 appre-1.035
��-1.003
enegger-0.890
 obser-0.871
 lest-0.860
Token won
Token ID1839
Feature activation+1.45e-01

Pos logit contributions
izoph+1.537
attered+1.464
 MIS+1.428
BRE+1.363
ILA+1.340
Neg logit contributions
��-1.913
 appre-1.900
 obser-1.696
enegger-1.684
 lest-1.670
Token
Token ID447
Feature activation-1.39e-02

Pos logit contributions
izoph+0.425
attered+0.394
 MIS+0.394
BRE+0.373
ILA+0.365
Neg logit contributions
 appre-0.574
��-0.569
enegger-0.514
 obser-0.506
 lest-0.499
Token
Token ID247
Feature activation-1.59e-01

Pos logit contributions
 appre+0.052
��+0.049
enegger+0.046
 obser+0.045
 lest+0.045
Neg logit contributions
izoph-0.044
attered-0.041
 MIS-0.041
BRE-0.039
ILA-0.039
TokenSign
Token ID11712
Feature activation+4.38e-02

Pos logit contributions
 appre+0.169
��+0.165
enegger+0.150
 obser+0.148
 roy+0.146
Neg logit contributions
izoph-0.128
attered-0.119
 MIS-0.118
BRE-0.115
ILA-0.112
Token Up
Token ID3205
Feature activation-2.05e-02

Pos logit contributions
izoph+0.148
attered+0.140
 MIS+0.139
BRE+0.132
ILA+0.131
Neg logit contributions
 appre-0.153
��-0.148
enegger-0.134
 obser-0.132
 roy-0.130
Token:
Token ID25
Feature activation-1.42e-02

Pos logit contributions
 appre+0.070
��+0.068
enegger+0.061
 obser+0.060
 roy+0.059
Neg logit contributions
izoph-0.072
attered-0.067
 MIS-0.067
BRE-0.064
ILA-0.063
Token Stay
Token ID16160
Feature activation+6.04e-01

Pos logit contributions
 appre+0.054
��+0.053
enegger+0.048
 obser+0.048
 roy+0.047
Neg logit contributions
izoph-0.043
attered-0.040
 MIS-0.040
BRE-0.038
ILA-0.037
Token on
Token ID319
Feature activation-2.36e-02

Pos logit contributions
izoph+1.921
attered+1.890
 HIP+1.788
 MIS+1.778
BRE+1.725
Neg logit contributions
 appre-2.178
��-2.125
 obser-1.998
enegger-1.993
 accomp-1.873
Token top
Token ID1353
Feature activation+9.06e-01

Pos logit contributions
 appre+0.069
��+0.067
enegger+0.059
 lest+0.058
 roy+0.058
Neg logit contributions
izoph-0.094
attered-0.088
 MIS-0.087
BRE-0.085
ILA-0.084
Token of
Token ID286
Feature activation+2.72e-01

Pos logit contributions
izoph+2.583
 MIS+2.573
attered+2.547
BRE+2.455
 HIP+2.442
Neg logit contributions
 appre-3.531
��-3.453
 obser-3.244
 accomp-3.122
enegger-3.090
Token the
Token ID262
Feature activation+2.28e-01

Pos logit contributions
izoph+0.797
attered+0.743
 MIS+0.738
BRE+0.703
ILA+0.692
Neg logit contributions
 appre-1.071
��-1.060
 obser-0.954
enegger-0.952
 roy-0.936
Token latest
Token ID3452
Feature activation+3.66e-01

Pos logit contributions
izoph+0.643
 MIS+0.600
attered+0.598
BRE+0.575
ILA+0.558
Neg logit contributions
 appre-0.922
��-0.909
 obser-0.821
enegger-0.815
 roy-0.800
Token breaking
Token ID7163
Feature activation+2.82e-02

Pos logit contributions
izoph+0.948
 MIS+0.905
attered+0.896
BRE+0.859
 HIP+0.831
Neg logit contributions
 appre-1.522
��-1.519
 obser-1.371
enegger-1.368
 roy-1.345
Token film
Token ID2646
Feature activation+1.38e-01

Pos logit contributions
izoph+0.108
 MIS+0.102
attered+0.102
BRE+0.099
 HIP+0.097
Neg logit contributions
 appre-0.086
��-0.084
 obser-0.073
enegger-0.072
 lest-0.071
Tokenand
Token ID392
Feature activation+3.81e-01

Pos logit contributions
izoph+1.222
attered+1.190
 MIS+1.134
speech+1.129
ILA+1.112
Neg logit contributions
 appre-1.082
��-1.065
enegger-0.973
 obser-0.940
 accomp-0.908
Token-
Token ID12
Feature activation-4.63e-02

Pos logit contributions
izoph+1.272
attered+1.187
 MIS+1.178
BRE+1.129
ILA+1.110
Neg logit contributions
 appre-1.356
��-1.331
enegger-1.192
 obser-1.183
 lest-1.164
Tokenf
Token ID69
Feature activation-1.02e-01

Pos logit contributions
 appre+0.133
��+0.126
 roy+0.114
 lest+0.113
 thereof+0.112
Neg logit contributions
izoph-0.187
 MIS-0.173
attered-0.172
senal-0.171
BRE-0.166
Tokeninance
Token ID14149
Feature activation+3.68e-01

Pos logit contributions
 appre+0.384
��+0.372
 roy+0.337
enegger+0.336
esides+0.335
Neg logit contributions
izoph-0.320
 MIS-0.296
senal-0.290
attered-0.287
BRE-0.279
Token,
Token ID11
Feature activation+1.84e-01

Pos logit contributions
izoph+1.343
attered+1.262
 MIS+1.258
BRE+1.206
ILA+1.188
Neg logit contributions
 appre-1.191
��-1.170
enegger-1.034
 obser-1.026
 lest-1.005
Token aust
Token ID38132
Feature activation+8.95e-01

Pos logit contributions
izoph+0.653
attered+0.612
 MIS+0.609
BRE+0.580
 HIP+0.574
Neg logit contributions
 appre-0.614
��-0.607
enegger-0.538
 obser-0.528
 lest-0.521
Tokenral
Token ID1373
Feature activation+2.70e-01

Pos logit contributions
izoph+2.569
attered+2.459
 MIS+2.287
ilan+2.148
shots+2.136
Neg logit contributions
 appre-3.601
��-3.565
 obser-3.292
enegger-3.222
 accomp-3.117
Tokenia
Token ID544
Feature activation+5.78e-01

Pos logit contributions
izoph+1.168
attered+1.115
 MIS+1.094
BRE+1.054
ILA+1.052
Neg logit contributions
 appre-0.694
��-0.682
 obser-0.578
enegger-0.573
 lest-0.558
Token
Token ID198
Feature activation+6.04e-01

Pos logit contributions
izoph+1.984
attered+1.873
 MIS+1.851
BRE+1.772
ILA+1.743
Neg logit contributions
 appre-1.966
��-1.925
enegger-1.723
 lest-1.690
 obser-1.687
Token
Token ID198
Feature activation+3.59e-01

Pos logit contributions
izoph+1.989
 MIS+1.879
attered+1.879
BRE+1.813
 HIP+1.769
Neg logit contributions
 appre-2.179
��-2.159
 obser-1.947
enegger-1.896
 accomp-1.867
TokenFirst
Token ID5962
Feature activation+7.31e-02

Pos logit contributions
izoph+1.052
attered+0.962
 MIS+0.960
BRE+0.907
ILA+0.895
Neg logit contributions
 appre-1.428
��-1.401
enegger-1.271
 roy-1.251
 obser-1.250
Token priority
Token ID8475
Feature activation+5.26e-01

Pos logit contributions
izoph+0.975
 MIS+0.914
attered+0.903
BRE+0.861
 HIP+0.858
Neg logit contributions
��-0.983
 appre-0.973
enegger-0.886
 obser-0.840
 lest-0.824
Token.
Token ID13
Feature activation+2.20e-01

Pos logit contributions
izoph+1.634
 MIS+1.526
attered+1.507
BRE+1.449
 HIP+1.438
Neg logit contributions
��-1.959
 appre-1.943
enegger-1.803
 obser-1.714
esides-1.652
Token
Token ID198
Feature activation+3.82e-01

Pos logit contributions
izoph+0.755
 MIS+0.727
attered+0.714
BRE+0.685
 HIP+0.683
Neg logit contributions
 appre-0.763
��-0.746
 obser-0.661
enegger-0.655
 accomp-0.636
Token
Token ID198
Feature activation+3.70e-01

Pos logit contributions
izoph+1.302
attered+1.225
 MIS+1.223
BRE+1.178
ILA+1.154
Neg logit contributions
 appre-1.340
��-1.316
 obser-1.174
enegger-1.160
 accomp-1.127
TokenBut
Token ID1537
Feature activation+3.04e-01

Pos logit contributions
izoph+1.129
attered+1.062
 MIS+1.056
BRE+1.040
Eye+1.011
Neg logit contributions
 appre-1.425
��-1.389
enegger-1.256
 obser-1.253
 roy-1.220
Token….
Token ID11580
Feature activation+8.86e-01

Pos logit contributions
izoph+1.019
 MIS+0.960
attered+0.954
BRE+0.903
 HIP+0.898
Neg logit contributions
 appre-1.062
��-1.054
enegger-0.941
 obser-0.919
 roy-0.899
Token
Token ID198
Feature activation+3.97e-01

Pos logit contributions
izoph+2.862
 MIS+2.753
attered+2.732
BRE+2.584
 HIP+2.552
Neg logit contributions
 appre-3.122
��-3.117
enegger-2.752
 obser-2.699
 accomp-2.639
Token
Token ID198
Feature activation+3.51e-01

Pos logit contributions
izoph+1.342
attered+1.265
 MIS+1.263
BRE+1.217
ILA+1.192
Neg logit contributions
 appre-1.401
��-1.377
 obser-1.231
enegger-1.211
 accomp-1.182
Token"
Token ID1
Feature activation+1.58e-01

Pos logit contributions
izoph+1.073
attered+1.007
 MIS+0.997
BRE+0.978
ILA+0.951
Neg logit contributions
 appre-1.351
��-1.321
enegger-1.192
 obser-1.187
 roy-1.160
TokenAh
Token ID10910
Feature activation+2.62e-01

Pos logit contributions
izoph+0.453
attered+0.420
 MIS+0.419
BRE+0.404
ILA+0.394
Neg logit contributions
 appre-0.635
��-0.622
enegger-0.565
 obser-0.561
 roy-0.552
Token."
Token ID526
Feature activation+3.43e-01

Pos logit contributions
izoph+0.892
attered+0.843
 MIS+0.836
BRE+0.797
ILA+0.787
Neg logit contributions
 appre-0.909
��-0.893
enegger-0.795
 obser-0.790
 lest-0.774
Tokenarn
Token ID1501
Feature activation+1.88e-01

Pos logit contributions
 appre+1.087
��+1.070
enegger+0.972
 obser+0.961
 roy+0.954
Neg logit contributions
izoph-0.733
attered-0.671
 MIS-0.663
BRE-0.631
ILA-0.620
Tokenal
Token ID282
Feature activation+4.66e-02

Pos logit contributions
izoph+0.656
attered+0.616
 MIS+0.608
BRE+0.582
ILA+0.580
Neg logit contributions
 appre-0.643
��-0.624
enegger-0.561
 obser-0.553
 lest-0.546
Token Abuse
Token ID24221
Feature activation-2.52e-02

Pos logit contributions
izoph+0.155
attered+0.144
 MIS+0.144
BRE+0.137
 HIP+0.135
Neg logit contributions
 appre-0.168
��-0.163
enegger-0.148
 obser-0.145
 lest-0.143
Token by
Token ID416
Feature activation-2.76e-01

Pos logit contributions
 appre+0.089
��+0.086
enegger+0.078
 obser+0.077
 roy+0.076
Neg logit contributions
izoph-0.085
attered-0.079
 MIS-0.079
BRE-0.076
ILA-0.075
Token De
Token ID1024
Feature activation-1.87e-01

Pos logit contributions
 appre+1.200
��+1.181
enegger+1.080
 obser+1.067
 lest+1.062
Neg logit contributions
izoph-0.704
attered-0.637
 MIS-0.632
BRE-0.597
ILA-0.585
Tokence
Token ID344
Feature activation+8.79e-01

Pos logit contributions
 appre+0.729
��+0.714
enegger+0.648
 obser+0.638
 lest+0.633
Neg logit contributions
izoph-0.564
attered-0.521
 MIS-0.516
BRE-0.493
ILA-0.484
Tokenit
Token ID270
Feature activation+2.11e-01

Pos logit contributions
izoph+3.051
attered+2.876
 MIS+2.795
BRE+2.724
ILA+2.700
Neg logit contributions
 appre-3.020
��-2.934
enegger-2.670
 obser-2.602
 lest-2.528
Token,
Token ID11
Feature activation+4.78e-03

Pos logit contributions
izoph+0.722
attered+0.685
 MIS+0.683
BRE+0.658
ILA+0.645
Neg logit contributions
 appre-0.727
��-0.701
enegger-0.625
 obser-0.625
 roy-0.604
Token
Token ID447
Feature activation+7.52e-02

Pos logit contributions
izoph+0.017
attered+0.016
 MIS+0.016
BRE+0.015
ILA+0.015
Neg logit contributions
 appre-0.016
��-0.016
enegger-0.014
 obser-0.014
 roy-0.014
Token
Token ID251
Feature activation+6.03e-02

Pos logit contributions
izoph+0.263
attered+0.245
 MIS+0.244
BRE+0.235
ILA+0.231
Neg logit contributions
 appre-0.258
��-0.245
enegger-0.226
 obser-0.221
 lest-0.219
Token Joyce
Token ID25936
Feature activation+8.17e-02

Pos logit contributions
izoph+0.214
attered+0.200
 MIS+0.200
BRE+0.192
ILA+0.188
Neg logit contributions
 appre-0.203
��-0.198
enegger-0.176
 obser-0.174
 roy-0.171
Token.
Token ID13
Feature activation-2.93e-02

Pos logit contributions
izoph+0.969
 MIS+0.913
attered+0.908
BRE+0.871
 HIP+0.853
Neg logit contributions
 appre-0.973
��-0.960
enegger-0.856
 obser-0.840
esides-0.821
Token
Token ID198
Feature activation+3.45e-02

Pos logit contributions
 appre+0.103
��+0.101
enegger+0.090
 obser+0.089
 lest+0.087
Neg logit contributions
izoph-0.099
 MIS-0.093
attered-0.093
BRE-0.090
ILA-0.088
Token
Token ID198
Feature activation-3.08e-03

Pos logit contributions
izoph+0.123
attered+0.115
 MIS+0.114
BRE+0.110
ILA+0.109
Neg logit contributions
 appre-0.115
��-0.113
enegger-0.100
 obser-0.099
 lest-0.098
TokenDisclaimer
Token ID19618
Feature activation+5.55e-01

Pos logit contributions
 appre+0.012
��+0.012
enegger+0.011
 obser+0.011
 lest+0.011
Neg logit contributions
izoph-0.009
attered-0.008
 MIS-0.008
BRE-0.008
ILA-0.008
Token:
Token ID25
Feature activation-3.15e-01

Pos logit contributions
izoph+1.830
 MIS+1.722
attered+1.710
BRE+1.643
 HIP+1.614
Neg logit contributions
 appre-1.982
��-1.953
enegger-1.752
 obser-1.723
 roy-1.675
Token ©
Token ID10673
Feature activation+8.62e-01

Pos logit contributions
 appre+1.160
��+1.138
enegger+1.022
 obser+1.008
 lest+1.002
Neg logit contributions
izoph-1.015
attered-0.939
 MIS-0.933
BRE-0.893
ILA-0.880
Token 2015
Token ID1853
Feature activation+2.67e-01

Pos logit contributions
izoph+2.344
 MIS+2.263
attered+2.196
BRE+2.130
 HIP+2.104
Neg logit contributions
 appre-3.494
��-3.363
enegger-3.123
 obser-3.100
lain-2.985
Token Money
Token ID12911
Feature activation+1.70e-01

Pos logit contributions
izoph+0.786
 MIS+0.742
attered+0.734
BRE+0.696
 HIP+0.687
Neg logit contributions
 appre-1.047
��-1.025
enegger-0.941
 obser-0.925
 lest-0.905
Token Morning
Token ID14410
Feature activation+2.02e-01

Pos logit contributions
izoph+0.660
 MIS+0.621
attered+0.620
BRE+0.598
ILA+0.591
Neg logit contributions
 appre-0.514
��-0.501
enegger-0.440
 obser-0.434
 lest-0.428
Token and
Token ID290
Feature activation-3.78e-02

Pos logit contributions
izoph+0.675
 MIS+0.637
attered+0.635
BRE+0.611
ILA+0.601
Neg logit contributions
 appre-0.710
��-0.693
enegger-0.627
 obser-0.615
 lest-0.605
Token Money
Token ID12911
Feature activation-8.37e-02

Pos logit contributions
 appre+0.125
��+0.122
enegger+0.110
 obser+0.107
 lest+0.106
Neg logit contributions
izoph-0.135
 MIS-0.128
attered-0.127
BRE-0.121
 HIP-0.120
Token
Token ID447
Feature activation+1.22e-01

Pos logit contributions
izoph+0.661
attered+0.606
 MIS+0.604
BRE+0.572
ILA+0.557
Neg logit contributions
 appre-1.086
��-1.076
enegger-0.989
 obser-0.972
 lest-0.971
Token
Token ID247
Feature activation-8.44e-02

Pos logit contributions
izoph+0.399
 MIS+0.370
attered+0.368
BRE+0.359
ILA+0.350
Neg logit contributions
 appre-0.444
��-0.407
enegger-0.404
 obser-0.385
 lest-0.383
Tokenve
Token ID303
Feature activation-1.12e-01

Pos logit contributions
 appre+0.328
��+0.323
enegger+0.294
 obser+0.290
 lest+0.289
Neg logit contributions
izoph-0.251
attered-0.234
 MIS-0.231
BRE-0.221
ILA-0.217
Token watched
Token ID7342
Feature activation-3.95e-02

Pos logit contributions
 appre+0.394
��+0.386
enegger+0.347
 obser+0.342
 lest+0.341
Neg logit contributions
izoph-0.370
attered-0.348
 MIS-0.345
BRE-0.332
ILA-0.326
Token in
Token ID287
Feature activation+8.13e-02

Pos logit contributions
 appre+0.142
��+0.140
enegger+0.126
 obser+0.124
 lest+0.123
Neg logit contributions
izoph-0.128
attered-0.120
 MIS-0.120
BRE-0.115
 HIP-0.112
Token amaz
Token ID40642
Feature activation+8.57e-01

Pos logit contributions
izoph+0.280
attered+0.266
 MIS+0.264
BRE+0.254
 HIP+0.249
Neg logit contributions
��-0.275
 appre-0.273
enegger-0.242
 obser-0.239
 lest-0.235
Tokenement
Token ID972
Feature activation+2.25e-01

Pos logit contributions
izoph+2.771
attered+2.627
 MIS+2.602
BRE+2.511
Eye+2.463
Neg logit contributions
��-3.047
 appre-3.045
enegger-2.710
 obser-2.702
 roy-2.641
Token as
Token ID355
Feature activation+3.07e-01

Pos logit contributions
izoph+0.686
attered+0.643
 MIS+0.642
BRE+0.616
 HIP+0.597
Neg logit contributions
 appre-0.854
��-0.842
enegger-0.761
 obser-0.749
 roy-0.737
Token this
Token ID428
Feature activation-6.42e-02

Pos logit contributions
izoph+1.022
 MIS+0.966
attered+0.961
BRE+0.922
 HIP+0.911
Neg logit contributions
 appre-1.061
��-1.057
enegger-0.952
 obser-0.927
 lest-0.921
Token incredible
Token ID8082
Feature activation+9.45e-02

Pos logit contributions
 appre+0.216
��+0.214
enegger+0.192
 obser+0.186
 lest+0.186
Neg logit contributions
izoph-0.225
attered-0.210
 MIS-0.210
BRE-0.201
 HIP-0.198
Token community
Token ID2055
Feature activation-2.67e-01

Pos logit contributions
izoph+0.339
 MIS+0.320
attered+0.319
BRE+0.305
 HIP+0.304
Neg logit contributions
 appre-0.307
��-0.305
enegger-0.272
 obser-0.264
 lest-0.263
Token Read
Token ID4149
Feature activation+2.60e-01

Pos logit contributions
izoph+0.688
attered+0.645
 MIS+0.644
BRE+0.619
ILA+0.610
Neg logit contributions
 appre-0.604
��-0.589
enegger-0.523
 obser-0.515
 lest-0.506
Token more
Token ID517
Feature activation+1.62e-01

Pos logit contributions
izoph+0.815
attered+0.769
 MIS+0.759
BRE+0.730
ILA+0.711
Neg logit contributions
 appre-0.976
��-0.958
enegger-0.863
 obser-0.857
 lest-0.834
Token
Token ID198
Feature activation+6.38e-02

Pos logit contributions
izoph+0.563
attered+0.528
 MIS+0.526
BRE+0.507
ILA+0.498
Neg logit contributions
 appre-0.557
��-0.547
enegger-0.484
 obser-0.483
 roy-0.469
Token
Token ID198
Feature activation-6.98e-02

Pos logit contributions
izoph+0.225
attered+0.210
 MIS+0.210
BRE+0.202
ILA+0.198
Neg logit contributions
 appre-0.216
��-0.211
enegger-0.187
 obser-0.186
 lest-0.182
TokenThe
Token ID464
Feature activation-8.58e-02

Pos logit contributions
 appre+0.273
��+0.267
enegger+0.242
 obser+0.241
 lest+0.236
Neg logit contributions
izoph-0.210
attered-0.195
 MIS-0.194
BRE-0.188
ILA-0.184
Token peer
Token ID12720
Feature activation+8.49e-01

Pos logit contributions
 appre+0.291
��+0.288
enegger+0.256
 obser+0.252
 lest+0.249
Neg logit contributions
izoph-0.298
 MIS-0.280
attered-0.279
BRE-0.267
 HIP-0.264
Token chairs
Token ID18791
Feature activation-2.46e-01

Pos logit contributions
izoph+2.616
 MIS+2.576
attered+2.565
 HIP+2.450
ILA+2.406
Neg logit contributions
 appre-2.971
��-2.946
enegger-2.652
 obser-2.577
esides-2.515
Token a
Token ID257
Feature activation+1.21e-01

Pos logit contributions
 appre+0.926
��+0.909
enegger+0.818
 obser+0.808
 lest+0.803
Neg logit contributions
izoph-0.775
attered-0.716
 MIS-0.712
BRE-0.680
ILA-0.670
Token Lords
Token ID18651
Feature activation-1.66e-01

Pos logit contributions
izoph+0.406
 MIS+0.381
attered+0.380
BRE+0.362
 HIP+0.358
Neg logit contributions
 appre-0.426
��-0.420
enegger-0.376
 obser-0.369
 lest-0.364
Token EU
Token ID4576
Feature activation+2.01e-01

Pos logit contributions
 appre+0.572
��+0.561
enegger+0.500
 obser+0.492
 lest+0.488
Neg logit contributions
izoph-0.571
attered-0.532
 MIS-0.530
BRE-0.509
ILA-0.502
Token subcommittee
Token ID47014
Feature activation+3.82e-01

Pos logit contributions
izoph+0.680
 MIS+0.630
attered+0.629
BRE+0.602
ILA+0.597
Neg logit contributions
 appre-0.705
��-0.694
enegger-0.625
 obser-0.611
 lest-0.602
TokenJ
Token ID41
Feature activation-4.12e-01

Pos logit contributions
 appre+0.857
��+0.842
enegger+0.759
 obser+0.750
 lest+0.745
Neg logit contributions
izoph-0.684
attered-0.631
 MIS-0.625
BRE-0.600
ILA-0.588
Tokenaaaa
Token ID24794
Feature activation-2.68e-01

Pos logit contributions
 appre+1.560
��+1.530
enegger+1.379
 obser+1.361
 lest+1.353
Neg logit contributions
izoph-1.284
attered-1.183
 MIS-1.177
BRE-1.123
ILA-1.106
Tokenasper
Token ID32981
Feature activation-5.32e-01

Pos logit contributions
 appre+1.020
��+1.004
enegger+0.903
 obser+0.895
 lest+0.890
Neg logit contributions
izoph-0.831
attered-0.776
 MIS-0.764
BRE-0.734
ILA-0.724
Token Profile
Token ID13118
Feature activation-9.97e-01

Pos logit contributions
 appre+1.460
��+1.423
enegger+1.227
 obser+1.204
 lest+1.194
Neg logit contributions
izoph-2.215
attered-2.087
 MIS-2.078
BRE-2.010
ILA-1.988
Token Blog
Token ID14001
Feature activation-4.08e-01

Pos logit contributions
 appre+2.995
��+2.909
enegger+2.517
 lest+2.496
 obser+2.493
Neg logit contributions
izoph-3.866
attered-3.629
 MIS-3.579
BRE-3.464
ILA-3.443
Token Joined
Token ID16709
Feature activation-1.34e+00

Pos logit contributions
 appre+1.247
��+1.209
enegger+1.058
 lest+1.044
 roy+1.043
Neg logit contributions
izoph-1.569
attered-1.464
 MIS-1.456
BRE-1.407
ILA-1.394
Token April
Token ID3035
Feature activation-4.76e-01

Pos logit contributions
 appre+3.412
��+3.213
 lest+2.840
 roy+2.801
 thereof+2.795
Neg logit contributions
izoph-5.761
attered-5.402
senal-5.311
 MIS-5.186
ILA-5.179
Token 2012
Token ID2321
Feature activation-4.79e-01

Pos logit contributions
 appre+1.812
��+1.770
 lest+1.608
enegger+1.605
 roy+1.603
Neg logit contributions
izoph-1.466
attered-1.333
 MIS-1.288
BRE-1.262
senal-1.262
Token United
Token ID1578
Feature activation-3.66e-01

Pos logit contributions
 appre+1.761
��+1.718
enegger+1.564
 roy+1.544
 lest+1.542
Neg logit contributions
izoph-1.540
attered-1.415
 MIS-1.372
BRE-1.333
senal-1.324
Token States
Token ID1829
Feature activation-2.26e-01

Pos logit contributions
 appre+0.861
��+0.836
enegger+0.704
 obser+0.687
 roy+0.685
Neg logit contributions
izoph-1.662
attered-1.574
 MIS-1.563
BRE-1.521
ILA-1.507
Token 5
Token ID642
Feature activation+1.05e-01

Pos logit contributions
 appre+0.890
��+0.872
enegger+0.791
 roy+0.779
 lest+0.779
Neg logit contributions
izoph-0.675
attered-0.618
 MIS-0.611
BRE-0.584
ILA-0.575
TokenTB
Token ID22737
Feature activation-5.05e-01

Pos logit contributions
 appre+1.578
��+1.550
enegger+1.400
 obser+1.382
 lest+1.375
Neg logit contributions
izoph-1.223
attered-1.125
 MIS-1.118
BRE-1.065
ILA-1.049
Tokenone
Token ID505
Feature activation+3.08e-02

Pos logit contributions
 appre+1.882
��+1.849
enegger+1.662
 obser+1.641
 lest+1.630
Neg logit contributions
izoph-1.607
attered-1.486
 MIS-1.477
BRE-1.412
ILA-1.392
Token-
Token ID12
Feature activation-1.43e-01

Pos logit contributions
izoph+0.109
attered+0.103
 MIS+0.102
BRE+0.099
ILA+0.098
Neg logit contributions
 appre-0.103
��-0.101
enegger-0.090
 obser-0.089
 lest-0.088
Token Profile
Token ID13118
Feature activation-6.44e-01

Pos logit contributions
 appre+0.519
��+0.509
enegger+0.453
 obser+0.452
 lest+0.447
Neg logit contributions
izoph-0.470
attered-0.437
 MIS-0.434
BRE-0.420
ILA-0.414
Token Blog
Token ID14001
Feature activation-3.36e-01

Pos logit contributions
 appre+1.996
��+1.947
enegger+1.702
 obser+1.682
 lest+1.676
Neg logit contributions
izoph-2.444
attered-2.289
 MIS-2.271
BRE-2.191
ILA-2.170
Token Joined
Token ID16709
Feature activation-1.27e+00

Pos logit contributions
 appre+1.028
��+0.997
enegger+0.874
 lest+0.860
 roy+0.860
Neg logit contributions
izoph-1.294
attered-1.208
 MIS-1.201
BRE-1.161
ILA-1.150
Token November
Token ID3389
Feature activation-4.04e-01

Pos logit contributions
 appre+3.253
��+3.066
 lest+2.711
 roy+2.678
 thereof+2.669
Neg logit contributions
izoph-5.438
attered-5.092
senal-4.989
 MIS-4.905
ILA-4.878
Token 2010
Token ID3050
Feature activation-3.02e-01

Pos logit contributions
 appre+1.556
��+1.518
 lest+1.391
enegger+1.386
 thereof+1.381
Neg logit contributions
izoph-1.217
attered-1.111
 MIS-1.061
senal-1.055
BRE-1.041
Token United
Token ID1578
Feature activation-3.74e-01

Pos logit contributions
 appre+1.139
��+1.116
enegger+1.011
 roy+0.994
 lest+0.993
Neg logit contributions
izoph-0.948
attered-0.872
 MIS-0.858
BRE-0.825
ILA-0.812
Token States
Token ID1829
Feature activation-1.49e-01

Pos logit contributions
 appre+0.888
��+0.864
enegger+0.728
 obser+0.711
 roy+0.709
Neg logit contributions
izoph-1.689
attered-1.600
 MIS-1.588
BRE-1.546
ILA-1.532
Token 21
Token ID2310
Feature activation-1.15e-01

Pos logit contributions
 appre+0.586
��+0.576
enegger+0.521
 obser+0.514
 lest+0.512
Neg logit contributions
izoph-0.440
attered-0.404
 MIS-0.401
BRE-0.383
ILA-0.376
Token
Token ID198
Feature activation-2.73e-01

Pos logit contributions
 appre+0.434
��+0.424
enegger+0.377
 roy+0.371
 lest+0.371
Neg logit contributions
izoph-0.501
attered-0.465
 MIS-0.462
senal-0.447
BRE-0.444
TokenDark
Token ID17367
Feature activation-1.69e-01

Pos logit contributions
 appre+1.052
��+1.033
enegger+0.932
 obser+0.921
 lest+0.915
Neg logit contributions
izoph-0.835
attered-0.769
 MIS-0.763
BRE-0.731
ILA-0.718
Tokenhorse
Token ID30527
Feature activation-1.05e-02

Pos logit contributions
 appre+0.654
��+0.643
enegger+0.580
 obser+0.573
 lest+0.569
Neg logit contributions
izoph-0.515
attered-0.475
 MIS-0.472
BRE-0.451
ILA-0.442
Token Profile
Token ID13118
Feature activation-8.37e-01

Pos logit contributions
 appre+0.035
��+0.034
enegger+0.030
 obser+0.030
 lest+0.030
Neg logit contributions
izoph-0.038
attered-0.035
 MIS-0.035
BRE-0.034
ILA-0.033
Token Blog
Token ID14001
Feature activation-4.61e-01

Pos logit contributions
 appre+2.554
��+2.490
enegger+2.165
 obser+2.143
 lest+2.137
Neg logit contributions
izoph-3.212
attered-3.011
 MIS-2.978
BRE-2.876
ILA-2.855
Token Joined
Token ID16709
Feature activation-1.26e+00

Pos logit contributions
 appre+1.392
��+1.350
enegger+1.180
 lest+1.165
 roy+1.165
Neg logit contributions
izoph-1.788
attered-1.666
 MIS-1.654
BRE-1.602
senal-1.599
Token December
Token ID3426
Feature activation-2.83e-01

Pos logit contributions
 appre+3.230
��+3.050
 lest+2.688
 roy+2.657
 thereof+2.646
Neg logit contributions
izoph-5.428
attered-5.087
senal-4.986
 MIS-4.900
ILA-4.875
Token 2011
Token ID2813
Feature activation-3.19e-01

Pos logit contributions
 appre+1.094
��+1.067
enegger+0.976
 lest+0.970
 roy+0.967
Neg logit contributions
izoph-0.855
attered-0.779
 MIS-0.755
senal-0.734
BRE-0.733
Token United
Token ID1578
Feature activation-3.30e-01

Pos logit contributions
 appre+1.178
��+1.152
enegger+1.044
 lest+1.027
 roy+1.026
Neg logit contributions
izoph-1.025
attered-0.944
 MIS-0.925
BRE-0.893
ILA-0.879
Token States
Token ID1829
Feature activation-1.12e-01

Pos logit contributions
 appre+0.792
��+0.770
enegger+0.650
 obser+0.635
 roy+0.633
Neg logit contributions
izoph-1.483
attered-1.404
 MIS-1.394
BRE-1.356
ILA-1.344
Token 228
Token ID29041
Feature activation+6.76e-02

Pos logit contributions
 appre+0.442
��+0.434
enegger+0.393
 obser+0.388
 lest+0.386
Neg logit contributions
izoph-0.331
attered-0.304
 MIS-0.301
BRE-0.287
ILA-0.283
TokenK
Token ID42
Feature activation-4.84e-01

Pos logit contributions
 appre+0.580
��+0.568
enegger+0.508
 obser+0.503
 lest+0.498
Neg logit contributions
izoph-0.523
attered-0.486
 MIS-0.485
BRE-0.466
ILA-0.460
Tokenier
Token ID959
Feature activation-1.53e-01

Pos logit contributions
 appre+1.691
��+1.658
enegger+1.479
 obser+1.459
 roy+1.449
Neg logit contributions
izoph-1.655
attered-1.539
 MIS-1.528
BRE-1.467
ILA-1.448
Tokenan
Token ID272
Feature activation-1.77e-01

Pos logit contributions
 appre+0.515
��+0.504
enegger+0.447
 obser+0.442
 lest+0.438
Neg logit contributions
izoph-0.540
attered-0.504
 MIS-0.499
BRE-0.481
ILA-0.475
Token Profile
Token ID13118
Feature activation-7.91e-01

Pos logit contributions
 appre+0.486
��+0.473
enegger+0.408
 obser+0.404
 lest+0.399
Neg logit contributions
izoph-0.734
attered-0.693
 MIS-0.691
BRE-0.670
ILA-0.663
Token Blog
Token ID14001
Feature activation-3.62e-01

Pos logit contributions
 appre+2.449
��+2.388
enegger+2.086
 obser+2.063
 lest+2.053
Neg logit contributions
izoph-3.007
attered-2.817
 MIS-2.791
BRE-2.695
ILA-2.672
Token Joined
Token ID16709
Feature activation-1.23e+00

Pos logit contributions
 appre+1.122
��+1.093
enegger+0.960
 obser+0.942
 lest+0.941
Neg logit contributions
izoph-1.378
attered-1.288
 MIS-1.280
BRE-1.236
ILA-1.222
Token August
Token ID2932
Feature activation-4.10e-01

Pos logit contributions
 appre+3.182
��+3.007
 lest+2.658
 roy+2.628
 thereof+2.609
Neg logit contributions
izoph-5.263
attered-4.926
senal-4.804
 MIS-4.752
ILA-4.721
Token 2011
Token ID2813
Feature activation-3.15e-01

Pos logit contributions
 appre+1.577
��+1.535
enegger+1.402
 lest+1.396
 roy+1.394
Neg logit contributions
izoph-1.245
attered-1.140
 MIS-1.102
BRE-1.076
senal-1.074
Token United
Token ID1578
Feature activation-2.17e-01

Pos logit contributions
 appre+1.147
��+1.121
enegger+1.012
 lest+0.996
 roy+0.995
Neg logit contributions
izoph-1.028
attered-0.949
 MIS-0.933
BRE-0.900
ILA-0.886
Token States
Token ID1829
Feature activation-5.70e-02

Pos logit contributions
 appre+0.532
��+0.517
enegger+0.437
 obser+0.427
 roy+0.424
Neg logit contributions
izoph-0.968
attered-0.915
 MIS-0.911
BRE-0.884
ILA-0.875
Token 6
Token ID718
Feature activation+8.23e-02

Pos logit contributions
 appre+0.226
��+0.222
enegger+0.201
 obser+0.198
 lest+0.197
Neg logit contributions
izoph-0.168
attered-0.154
 MIS-0.154
BRE-0.146
ILA-0.144
Token
Token ID198
Feature activation-3.43e-01

Pos logit contributions
 appre+0.719
��+0.698
enegger+0.623
 roy+0.618
 lest+0.617
Neg logit contributions
izoph-0.874
attered-0.809
 MIS-0.801
senal-0.796
BRE-0.769
TokenBig
Token ID12804
Feature activation-1.81e-01

Pos logit contributions
 appre+1.322
��+1.300
enegger+1.174
 obser+1.159
 lest+1.151
Neg logit contributions
izoph-1.049
attered-0.967
 MIS-0.960
BRE-0.921
ILA-0.903
TokenFan
Token ID22480
Feature activation+2.75e-02

Pos logit contributions
 appre+0.626
��+0.618
enegger+0.548
 obser+0.541
 lest+0.537
Neg logit contributions
izoph-0.623
attered-0.582
 MIS-0.580
BRE-0.557
ILA-0.544
Token Profile
Token ID13118
Feature activation-6.28e-01

Pos logit contributions
izoph+0.103
attered+0.097
 MIS+0.097
BRE+0.093
ILA+0.092
Neg logit contributions
 appre-0.087
��-0.085
enegger-0.075
 obser-0.074
 roy-0.073
Token Blog
Token ID14001
Feature activation-2.71e-01

Pos logit contributions
 appre+1.967
��+1.919
enegger+1.683
 obser+1.660
 lest+1.652
Neg logit contributions
izoph-2.372
attered-2.221
 MIS-2.204
BRE-2.126
ILA-2.103
Token Joined
Token ID16709
Feature activation-1.23e+00

Pos logit contributions
 appre+0.843
��+0.821
enegger+0.722
 obser+0.708
 lest+0.707
Neg logit contributions
izoph-1.031
attered-0.964
 MIS-0.958
BRE-0.924
ILA-0.914
Token December
Token ID3426
Feature activation-2.90e-01

Pos logit contributions
 appre+3.157
��+2.983
 lest+2.632
 roy+2.598
 thereof+2.588
Neg logit contributions
izoph-5.256
attered-4.927
senal-4.822
 MIS-4.748
ILA-4.708
Token 2010
Token ID3050
Feature activation-1.50e-01

Pos logit contributions
 appre+1.119
��+1.092
enegger+0.995
 lest+0.988
 roy+0.986
Neg logit contributions
izoph-0.879
attered-0.804
 MIS-0.782
BRE-0.756
senal-0.754
Token TL
Token ID24811
Feature activation-3.78e-01

Pos logit contributions
 appre+0.568
��+0.558
enegger+0.503
 obser+0.496
 lest+0.493
Neg logit contributions
izoph-0.467
attered-0.431
 MIS-0.428
BRE-0.409
ILA-0.403
TokenAD
Token ID2885
Feature activation+4.21e-03

Pos logit contributions
 appre+1.417
��+1.395
enegger+1.247
 roy+1.245
 lest+1.242
Neg logit contributions
izoph-1.180
attered-1.074
 MIS-1.072
BRE-1.023
senal-1.017
TokenT
Token ID51
Feature activation+5.74e-02

Pos logit contributions
izoph+0.014
attered+0.013
 MIS+0.013
BRE+0.012
 HIP+0.012
Neg logit contributions
 appre-0.015
��-0.015
enegger-0.013
 obser-0.013
 lest-0.013
Token
Token ID198
Feature activation+1.14e-02

Pos logit contributions
izoph+0.147
attered+0.137
 MIS+0.136
BRE+0.131
ILA+0.129
Neg logit contributions
 appre-0.138
��-0.135
enegger-0.119
 obser-0.118
 lest-0.117
Tokenre
Token ID260
Feature activation-2.67e-01

Pos logit contributions
izoph+0.035
attered+0.032
 MIS+0.032
BRE+0.031
ILA+0.030
Neg logit contributions
 appre-0.044
��-0.043
enegger-0.039
 obser-0.039
 lest-0.038
Tokeni
Token ID72
Feature activation-4.04e-01

Pos logit contributions
 appre+0.974
��+0.956
enegger+0.856
 obser+0.846
 lest+0.839
Neg logit contributions
izoph-0.873
attered-0.809
 MIS-0.800
BRE-0.772
ILA-0.757
Token Profile
Token ID13118
Feature activation-6.93e-01

Pos logit contributions
 appre+1.491
��+1.465
enegger+1.311
 obser+1.298
 lest+1.289
Neg logit contributions
izoph-1.299
attered-1.204
 MIS-1.196
BRE-1.147
ILA-1.128
Token Blog
Token ID14001
Feature activation-3.39e-01

Pos logit contributions
 appre+2.147
��+2.092
enegger+1.831
 obser+1.809
 lest+1.801
Neg logit contributions
izoph-2.641
attered-2.474
 MIS-2.452
BRE-2.365
ILA-2.341
Token Joined
Token ID16709
Feature activation-1.22e+00

Pos logit contributions
 appre+1.030
��+0.996
enegger+0.869
 lest+0.860
 roy+0.857
Neg logit contributions
izoph-1.311
attered-1.224
 MIS-1.220
BRE-1.176
senal-1.168
Token October
Token ID3267
Feature activation-3.73e-01

Pos logit contributions
 appre+3.165
��+2.995
 lest+2.627
 thereof+2.584
 roy+2.572
Neg logit contributions
izoph-5.213
attered-4.888
senal-4.772
 MIS-4.725
ILA-4.668
Token 2002
Token ID6244
Feature activation-1.86e-01

Pos logit contributions
 appre+1.442
��+1.406
 lest+1.276
enegger+1.273
 thereof+1.271
Neg logit contributions
izoph-1.132
attered-1.026
 MIS-0.981
senal-0.973
BRE-0.965
Token United
Token ID1578
Feature activation-2.61e-01

Pos logit contributions
 appre+0.686
��+0.671
enegger+0.608
 roy+0.598
 lest+0.598
Neg logit contributions
izoph-0.599
attered-0.551
 MIS-0.540
BRE-0.522
ILA-0.513
Token States
Token ID1829
Feature activation-9.54e-02

Pos logit contributions
 appre+0.621
��+0.603
enegger+0.507
 obser+0.495
 roy+0.491
Neg logit contributions
izoph-1.179
attered-1.116
 MIS-1.110
BRE-1.078
ILA-1.067
Token 3
Token ID513
Feature activation+1.70e-01

Pos logit contributions
 appre+0.379
��+0.372
enegger+0.337
 obser+0.333
 lest+0.331
Neg logit contributions
izoph-0.280
attered-0.257
 MIS-0.255
BRE-0.243
ILA-0.239
Token 2012
Token ID2321
Feature activation-3.98e-01

Pos logit contributions
 appre+2.481
��+2.425
enegger+2.203
 roy+2.197
 lest+2.191
Neg logit contributions
izoph-2.097
attered-1.927
 MIS-1.843
senal-1.818
BRE-1.809
Token United
Token ID1578
Feature activation-3.36e-01

Pos logit contributions
 appre+1.423
��+1.389
enegger+1.262
 roy+1.241
 lest+1.239
Neg logit contributions
izoph-1.324
attered-1.219
 MIS-1.183
BRE-1.152
senal-1.142
Token States
Token ID1829
Feature activation-2.05e-01

Pos logit contributions
 appre+0.801
��+0.776
enegger+0.654
 obser+0.639
 roy+0.635
Neg logit contributions
izoph-1.520
attered-1.439
 MIS-1.429
BRE-1.389
ILA-1.376
Token 3
Token ID513
Feature activation+8.90e-03

Pos logit contributions
 appre+0.805
��+0.790
enegger+0.716
 roy+0.705
 lest+0.705
Neg logit contributions
izoph-0.613
attered-0.562
 MIS-0.555
BRE-0.531
ILA-0.523
Token240
Token ID16102
Feature activation+2.02e-01

Pos logit contributions
izoph+0.035
attered+0.033
 MIS+0.033
BRE+0.032
ILA+0.031
Neg logit contributions
 appre-0.026
��-0.026
enegger-0.022
 obser-0.022
 lest-0.022
Token Posts
Token ID12043
Feature activation-1.22e+00

Pos logit contributions
izoph+0.937
 MIS+0.894
attered+0.892
BRE+0.865
 HIP+0.857
Neg logit contributions
 appre-0.456
��-0.443
enegger-0.373
 obser-0.363
 lest-0.353
Token #
Token ID1303
Feature activation-4.02e-01

Pos logit contributions
 appre+3.647
��+3.504
 lest+3.049
enegger+3.013
 roy+3.013
Neg logit contributions
izoph-4.726
attered-4.444
 MIS-4.355
BRE-4.211
ILA-4.207
Token10
Token ID940
Feature activation-2.87e-01

Pos logit contributions
 appre+1.284
��+1.257
enegger+1.114
 lest+1.093
 roy+1.090
Neg logit contributions
izoph-1.491
attered-1.387
 MIS-1.377
BRE-1.321
ILA-1.308
Token Happy
Token ID14628
Feature activation-5.44e-01

Pos logit contributions
 appre+0.900
��+0.887
enegger+0.779
 obser+0.764
 lest+0.756
Neg logit contributions
izoph-1.080
attered-1.012
 MIS-1.012
BRE-0.973
ILA-0.958
Token it
Token ID340
Feature activation-1.05e-01

Pos logit contributions
 appre+1.821
��+1.777
enegger+1.570
 roy+1.564
 obser+1.562
Neg logit contributions
izoph-1.928
attered-1.796
 MIS-1.768
BRE-1.699
ILA-1.690
Token worked
Token ID3111
Feature activation-5.55e-02

Pos logit contributions
 appre+0.390
��+0.390
enegger+0.350
 obser+0.346
 lest+0.340
Neg logit contributions
izoph-0.325
attered-0.305
 MIS-0.301
BRE-0.290
ILA-0.285
Token 2010
Token ID3050
Feature activation-4.16e-01

Pos logit contributions
 appre+2.034
��+1.978
enegger+1.804
 roy+1.794
 lest+1.789
Neg logit contributions
izoph-1.716
attered-1.562
 MIS-1.508
senal-1.484
BRE-1.479
Token United
Token ID1578
Feature activation-4.78e-01

Pos logit contributions
 appre+1.515
��+1.479
enegger+1.340
 roy+1.318
 lest+1.316
Neg logit contributions
izoph-1.355
attered-1.250
 MIS-1.223
BRE-1.184
senal-1.164
Token States
Token ID1829
Feature activation-3.05e-01

Pos logit contributions
 appre+1.121
��+1.090
enegger+0.920
 roy+0.898
 obser+0.896
Neg logit contributions
izoph-2.165
attered-2.051
 MIS-2.031
BRE-1.982
ILA-1.965
Token 10
Token ID838
Feature activation-8.92e-02

Pos logit contributions
 appre+1.192
��+1.167
enegger+1.061
 roy+1.047
 lest+1.045
Neg logit contributions
izoph-0.911
attered-0.834
 MIS-0.821
BRE-0.787
ILA-0.775
Token53
Token ID4310
Feature activation+3.39e-02

Pos logit contributions
 appre+0.294
��+0.288
enegger+0.254
 obser+0.252
 lest+0.248
Neg logit contributions
izoph-0.322
attered-0.301
 MIS-0.301
BRE-0.289
ILA-0.285
Token Posts
Token ID12043
Feature activation-1.21e+00

Pos logit contributions
izoph+0.160
 MIS+0.152
attered+0.152
BRE+0.147
ILA+0.146
Neg logit contributions
 appre-0.074
��-0.072
enegger-0.059
 obser-0.059
 lest-0.057
Token #
Token ID1303
Feature activation-5.40e-01

Pos logit contributions
 appre+3.553
��+3.433
 lest+2.994
 roy+2.961
enegger+2.959
Neg logit contributions
izoph-4.699
attered-4.444
 MIS-4.349
BRE-4.203
ILA-4.203
Token15
Token ID1314
Feature activation-4.36e-01

Pos logit contributions
 appre+1.479
��+1.443
enegger+1.247
 lest+1.238
 roy+1.230
Neg logit contributions
izoph-2.241
 MIS-2.083
attered-2.082
senal-2.031
BRE-1.986
Token honestly
Token ID12698
Feature activation-3.91e-01

Pos logit contributions
 appre+1.319
��+1.295
enegger+1.134
 obser+1.112
 lest+1.101
Neg logit contributions
izoph-1.690
attered-1.584
 MIS-1.580
BRE-1.523
ILA-1.502
Token if
Token ID611
Feature activation-2.40e-01

Pos logit contributions
 appre+1.256
��+1.233
enegger+1.088
 obser+1.069
 lest+1.060
Neg logit contributions
izoph-1.442
attered-1.347
 MIS-1.343
BRE-1.291
ILA-1.274
Token we
Token ID356
Feature activation-1.55e-01

Pos logit contributions
 appre+0.809
��+0.798
enegger+0.711
 obser+0.696
 lest+0.688
Neg logit contributions
izoph-0.849
attered-0.789
 MIS-0.789
BRE-0.756
ILA-0.744
Tokenada
Token ID4763
Feature activation-3.77e-01

Pos logit contributions
 appre+1.780
��+1.747
enegger+1.565
 obser+1.543
 roy+1.533
Neg logit contributions
izoph-1.617
attered-1.498
 MIS-1.489
BRE-1.427
ILA-1.406
Tokenver
Token ID332
Feature activation-1.63e-01

Pos logit contributions
 appre+1.354
��+1.326
enegger+1.188
 obser+1.172
 lest+1.165
Neg logit contributions
izoph-1.253
attered-1.161
 MIS-1.154
BRE-1.106
ILA-1.089
TokenBB
Token ID15199
Feature activation-1.18e-01

Pos logit contributions
 appre+0.511
��+0.502
enegger+0.441
 obser+0.435
 lest+0.430
Neg logit contributions
izoph-0.610
attered-0.572
 MIS-0.569
BRE-0.550
ILA-0.543
Token Profile
Token ID13118
Feature activation-5.26e-01

Pos logit contributions
 appre+0.372
��+0.362
enegger+0.321
 obser+0.319
 roy+0.315
Neg logit contributions
izoph-0.438
attered-0.414
 MIS-0.411
BRE-0.396
ILA-0.392
Token Blog
Token ID14001
Feature activation-2.11e-01

Pos logit contributions
 appre+1.662
��+1.624
enegger+1.430
 obser+1.408
 lest+1.399
Neg logit contributions
izoph-1.969
attered-1.841
 MIS-1.831
BRE-1.764
ILA-1.743
Token Joined
Token ID16709
Feature activation-1.15e+00

Pos logit contributions
 appre+0.658
��+0.642
enegger+0.564
 obser+0.554
 lest+0.553
Neg logit contributions
izoph-0.803
attered-0.750
 MIS-0.746
BRE-0.720
ILA-0.712
Token June
Token ID2795
Feature activation-3.61e-01

Pos logit contributions
 appre+3.020
��+2.854
 lest+2.519
 roy+2.494
 thereof+2.474
Neg logit contributions
izoph-4.910
attered-4.594
senal-4.473
 MIS-4.432
ILA-4.393
Token 2009
Token ID3717
Feature activation-3.42e-01

Pos logit contributions
 appre+1.391
��+1.360
enegger+1.234
 lest+1.229
 roy+1.225
Neg logit contributions
izoph-1.101
attered-1.007
 MIS-0.981
BRE-0.949
senal-0.941
Token Germany
Token ID4486
Feature activation+1.41e-01

Pos logit contributions
 appre+1.300
��+1.272
enegger+1.154
 lest+1.139
 roy+1.137
Neg logit contributions
izoph-1.062
attered-0.974
 MIS-0.952
BRE-0.919
ILA-0.905
Token 247
Token ID30179
Feature activation+3.12e-01

Pos logit contributions
izoph+0.410
 MIS+0.378
attered+0.377
BRE+0.358
ILA+0.352
Neg logit contributions
 appre-0.563
��-0.554
enegger-0.500
 obser-0.496
 lest-0.490
Token34
Token ID2682
Feature activation+4.21e-01

Pos logit contributions
izoph+1.231
 MIS+1.205
attered+1.174
BRE+1.146
 HIP+1.142
Neg logit contributions
 appre-0.904
��-0.886
 obser-0.767
enegger-0.756
 roy-0.732
Token 2012
Token ID2321
Feature activation-4.79e-01

Pos logit contributions
 appre+1.812
��+1.770
 lest+1.608
enegger+1.605
 roy+1.603
Neg logit contributions
izoph-1.466
attered-1.333
 MIS-1.288
BRE-1.262
senal-1.262
Token United
Token ID1578
Feature activation-3.66e-01

Pos logit contributions
 appre+1.761
��+1.718
enegger+1.564
 roy+1.544
 lest+1.542
Neg logit contributions
izoph-1.540
attered-1.415
 MIS-1.372
BRE-1.333
senal-1.324
Token States
Token ID1829
Feature activation-2.26e-01

Pos logit contributions
 appre+0.861
��+0.836
enegger+0.704
 obser+0.687
 roy+0.685
Neg logit contributions
izoph-1.662
attered-1.574
 MIS-1.563
BRE-1.521
ILA-1.507
Token 5
Token ID642
Feature activation+1.05e-01

Pos logit contributions
 appre+0.890
��+0.872
enegger+0.791
 roy+0.779
 lest+0.779
Neg logit contributions
izoph-0.675
attered-0.618
 MIS-0.611
BRE-0.584
ILA-0.575
Token667
Token ID28933
Feature activation+1.90e-01

Pos logit contributions
izoph+0.347
 MIS+0.327
attered+0.326
BRE+0.312
ILA+0.307
Neg logit contributions
 appre-0.379
��-0.372
 obser-0.330
enegger-0.330
 roy-0.323
Token Posts
Token ID12043
Feature activation-1.14e+00

Pos logit contributions
izoph+0.874
 MIS+0.839
attered+0.833
BRE+0.810
 HIP+0.803
Neg logit contributions
 appre-0.437
��-0.428
 obser-0.354
enegger-0.350
 lest-0.336
Token #
Token ID1303
Feature activation-3.81e-01

Pos logit contributions
 appre+3.390
��+3.268
 lest+2.851
 roy+2.817
enegger+2.817
Neg logit contributions
izoph-4.407
attered-4.164
 MIS-4.082
BRE-3.946
ILA-3.942
Token12
Token ID1065
Feature activation-3.86e-01

Pos logit contributions
 appre+0.971
��+0.941
enegger+0.805
 roy+0.804
 lest+0.803
Neg logit contributions
izoph-1.658
attered-1.541
 MIS-1.538
senal-1.510
BRE-1.465
Token I
Token ID314
Feature activation-5.77e-01

Pos logit contributions
 appre+1.188
��+1.169
enegger+1.026
 obser+1.006
 lest+0.995
Neg logit contributions
izoph-1.473
attered-1.380
 MIS-1.379
BRE-1.326
ILA-1.306
Token'm
Token ID1101
Feature activation-1.83e-01

Pos logit contributions
 appre+2.141
��+2.076
enegger+1.872
 obser+1.851
 roy+1.849
Neg logit contributions
izoph-1.856
attered-1.712
 MIS-1.695
BRE-1.628
ILA-1.607
Token glad
Token ID9675
Feature activation-3.43e-01

Pos logit contributions
 appre+0.627
��+0.618
enegger+0.552
 obser+0.542
 lest+0.536
Neg logit contributions
izoph-0.635
attered-0.593
 MIS-0.591
BRE-0.565
ILA-0.556
Token 2010
Token ID3050
Feature activation-3.02e-01

Pos logit contributions
 appre+1.556
��+1.518
 lest+1.391
enegger+1.386
 thereof+1.381
Neg logit contributions
izoph-1.217
attered-1.111
 MIS-1.061
senal-1.055
BRE-1.041
Token United
Token ID1578
Feature activation-3.74e-01

Pos logit contributions
 appre+1.139
��+1.116
enegger+1.011
 roy+0.994
 lest+0.993
Neg logit contributions
izoph-0.948
attered-0.872
 MIS-0.858
BRE-0.825
ILA-0.812
Token States
Token ID1829
Feature activation-1.49e-01

Pos logit contributions
 appre+0.888
��+0.864
enegger+0.728
 obser+0.711
 roy+0.709
Neg logit contributions
izoph-1.689
attered-1.600
 MIS-1.588
BRE-1.546
ILA-1.532
Token 21
Token ID2310
Feature activation-1.15e-01

Pos logit contributions
 appre+0.586
��+0.576
enegger+0.521
 obser+0.514
 lest+0.512
Neg logit contributions
izoph-0.440
attered-0.404
 MIS-0.401
BRE-0.383
ILA-0.376
Token50
Token ID1120
Feature activation-1.80e-03

Pos logit contributions
 appre+0.353
��+0.344
enegger+0.302
 obser+0.299
 roy+0.294
Neg logit contributions
izoph-0.442
 MIS-0.416
attered-0.414
BRE-0.401
ILA-0.396
Token Posts
Token ID12043
Feature activation-1.13e+00

Pos logit contributions
 appre+0.004
��+0.004
enegger+0.003
 obser+0.003
 lest+0.003
Neg logit contributions
izoph-0.009
attered-0.008
 MIS-0.008
BRE-0.008
ILA-0.008
Token #
Token ID1303
Feature activation-4.09e-01

Pos logit contributions
 appre+3.376
��+3.262
 lest+2.846
enegger+2.827
 roy+2.821
Neg logit contributions
izoph-4.348
attered-4.107
 MIS-4.025
BRE-3.885
ILA-3.877
Token16
Token ID1433
Feature activation-4.08e-01

Pos logit contributions
 appre+1.218
��+1.186
 lest+1.054
enegger+1.051
 roy+1.046
Neg logit contributions
izoph-1.595
 MIS-1.477
attered-1.465
senal-1.446
BRE-1.398
Token Glad
Token ID28442
Feature activation-4.99e-01

Pos logit contributions
 appre+1.215
��+1.196
enegger+1.047
 obser+1.024
 lest+1.012
Neg logit contributions
izoph-1.596
attered-1.498
 MIS-1.497
BRE-1.442
ILA-1.421
Token to
Token ID284
Feature activation-1.95e-01

Pos logit contributions
 appre+1.656
��+1.624
enegger+1.440
 obser+1.417
 lest+1.405
Neg logit contributions
izoph-1.792
attered-1.671
 MIS-1.664
BRE-1.599
ILA-1.578
Token hear
Token ID3285
Feature activation-9.35e-02

Pos logit contributions
 appre+0.670
��+0.659
enegger+0.587
 obser+0.577
 lest+0.572
Neg logit contributions
izoph-0.678
attered-0.631
 MIS-0.629
BRE-0.604
ILA-0.594
Token June
Token ID2795
Feature activation-3.40e-01

Pos logit contributions
 appre+2.819
��+2.668
 lest+2.347
 roy+2.322
 thereof+2.311
Neg logit contributions
izoph-4.564
attered-4.279
senal-4.163
 MIS-4.136
ILA-4.091
Token 2012
Token ID2321
Feature activation-2.50e-01

Pos logit contributions
 appre+1.309
��+1.281
enegger+1.163
 lest+1.157
 roy+1.154
Neg logit contributions
izoph-1.037
attered-0.949
 MIS-0.923
BRE-0.894
senal-0.889
Token Canada
Token ID3340
Feature activation-3.82e-02

Pos logit contributions
 appre+0.920
��+0.900
enegger+0.813
 roy+0.800
 lest+0.800
Neg logit contributions
izoph-0.806
attered-0.743
 MIS-0.730
BRE-0.704
ILA-0.693
Token 16
Token ID1467
Feature activation-2.95e-02

Pos logit contributions
 appre+0.148
��+0.145
enegger+0.132
 roy+0.130
 lest+0.130
Neg logit contributions
izoph-0.116
attered-0.106
 MIS-0.104
BRE-0.100
ILA-0.099
Token10
Token ID940
Feature activation+1.56e-01

Pos logit contributions
 appre+0.100
��+0.098
enegger+0.087
 obser+0.086
 roy+0.084
Neg logit contributions
izoph-0.103
 MIS-0.097
attered-0.097
BRE-0.093
ILA-0.092
Token Posts
Token ID12043
Feature activation-1.12e+00

Pos logit contributions
izoph+0.726
 MIS+0.694
attered+0.691
BRE+0.672
ILA+0.666
Neg logit contributions
 appre-0.347
��-0.337
enegger-0.276
 obser-0.275
 lest-0.265
Token #
Token ID1303
Feature activation-4.27e-01

Pos logit contributions
 appre+3.349
��+3.236
 lest+2.825
enegger+2.802
 roy+2.795
Neg logit contributions
izoph-4.324
attered-4.085
 MIS-4.006
BRE-3.868
ILA-3.860
Token14
Token ID1415
Feature activation-3.84e-01

Pos logit contributions
 appre+1.024
��+0.983
 thereof+0.877
 lest+0.872
enegger+0.858
Neg logit contributions
izoph-1.902
senal-1.787
 MIS-1.748
attered-1.740
ILA-1.673
Token On
Token ID1550
Feature activation-4.98e-01

Pos logit contributions
 appre+1.164
��+1.146
enegger+1.004
 obser+0.983
 lest+0.972
Neg logit contributions
izoph-1.484
attered-1.391
 MIS-1.390
BRE-1.338
ILA-1.319
Token September
Token ID2693
Feature activation+1.62e-02

Pos logit contributions
 appre+1.411
��+1.350
 lest+1.183
 roy+1.175
enegger+1.165
Neg logit contributions
izoph-2.027
attered-1.909
 MIS-1.874
senal-1.831
BRE-1.820
Token 20
Token ID1160
Feature activation-1.52e-01

Pos logit contributions
izoph+0.044
attered+0.041
 MIS+0.040
BRE+0.038
ILA+0.037
Neg logit contributions
 appre-0.067
��-0.066
enegger-0.060
 obser-0.060
 lest-0.059
Token TL
Token ID24811
Feature activation-3.78e-01

Pos logit contributions
 appre+0.568
��+0.558
enegger+0.503
 obser+0.496
 lest+0.493
Neg logit contributions
izoph-0.467
attered-0.431
 MIS-0.428
BRE-0.409
ILA-0.403
TokenAD
Token ID2885
Feature activation+4.21e-03

Pos logit contributions
 appre+1.417
��+1.395
enegger+1.247
 roy+1.245
 lest+1.242
Neg logit contributions
izoph-1.180
attered-1.074
 MIS-1.072
BRE-1.023
senal-1.017
TokenT
Token ID51
Feature activation+5.74e-02

Pos logit contributions
izoph+0.014
attered+0.013
 MIS+0.013
BRE+0.012
 HIP+0.012
Neg logit contributions
 appre-0.015
��-0.015
enegger-0.013
 obser-0.013
 lest-0.013
Token 235
Token ID28878
Feature activation+2.53e-01

Pos logit contributions
izoph+0.164
attered+0.150
 MIS+0.150
BRE+0.142
ILA+0.140
Neg logit contributions
 appre-0.232
��-0.229
enegger-0.207
 obser-0.205
 lest-0.204
Token21
Token ID2481
Feature activation+3.51e-01

Pos logit contributions
izoph+0.910
 MIS+0.878
attered+0.861
BRE+0.834
 HIP+0.827
Neg logit contributions
 appre-0.820
��-0.810
enegger-0.713
 obser-0.709
 roy-0.687
Token Posts
Token ID12043
Feature activation-1.11e+00

Pos logit contributions
izoph+1.587
 MIS+1.538
attered+1.512
BRE+1.485
 HIP+1.471
Neg logit contributions
 appre-0.834
��-0.810
 obser-0.686
enegger-0.663
 accomp-0.638
Token #
Token ID1303
Feature activation-3.84e-01

Pos logit contributions
 appre+3.337
��+3.219
 lest+2.804
enegger+2.789
 roy+2.783
Neg logit contributions
izoph-4.268
attered-4.027
 MIS-3.946
BRE-3.816
ILA-3.805
Token18
Token ID1507
Feature activation-1.88e-01

Pos logit contributions
 appre+1.179
��+1.118
 thereof+1.062
 roy+1.038
 lest+1.033
Neg logit contributions
izoph-1.457
senal-1.393
 MIS-1.313
attered-1.306
 HIP-1.227
Token Good
Token ID4599
Feature activation-2.79e-01

Pos logit contributions
 appre+0.584
��+0.581
enegger+0.514
 obser+0.499
 lest+0.493
Neg logit contributions
izoph-0.707
 MIS-0.665
attered-0.663
BRE-0.638
 HIP-0.628
Token stuff
Token ID3404
Feature activation-1.93e-01

Pos logit contributions
 appre+1.038
��+1.023
enegger+0.919
 obser+0.904
 lest+0.899
Neg logit contributions
izoph-0.891
attered-0.824
 MIS-0.820
BRE-0.785
ILA-0.770
Token!
Token ID0
Feature activation-4.54e-01

Pos logit contributions
 appre+0.654
��+0.649
enegger+0.575
 obser+0.566
 lest+0.556
Neg logit contributions
izoph-0.675
attered-0.632
 MIS-0.632
BRE-0.606
ILA-0.595
Token Hide
Token ID10415
Feature activation-2.80e-01

Pos logit contributions
 appre+0.130
��+0.128
enegger+0.116
 obser+0.116
 roy+0.114
Neg logit contributions
izoph-0.082
 MIS-0.075
attered-0.075
BRE-0.071
ILA-0.070
Token Caption
Token ID11260
Feature activation-4.93e-01

Pos logit contributions
 appre+0.774
��+0.756
enegger+0.657
 lest+0.643
 roy+0.641
Neg logit contributions
izoph-1.155
attered-1.080
 MIS-1.075
BRE-1.040
senal-1.030
Token 8
Token ID807
Feature activation-5.66e-02

Pos logit contributions
 appre+1.688
��+1.666
enegger+1.548
 thereof+1.546
 lest+1.542
Neg logit contributions
izoph-1.655
senal-1.575
attered-1.493
 MIS-1.440
BRE-1.405
Token of
Token ID286
Feature activation-8.48e-02

Pos logit contributions
 appre+0.188
��+0.184
enegger+0.164
 lest+0.161
 roy+0.161
Neg logit contributions
izoph-0.203
attered-0.188
 MIS-0.187
BRE-0.180
ILA-0.178
Token 15
Token ID1315
Feature activation+1.30e-01

Pos logit contributions
 appre+0.300
��+0.294
enegger+0.263
 obser+0.260
 lest+0.258
Neg logit contributions
izoph-0.285
attered-0.265
 MIS-0.263
BRE-0.252
ILA-0.249
Token Photos
Token ID9434
Feature activation-1.10e+00

Pos logit contributions
izoph+0.440
attered+0.408
 MIS+0.404
BRE+0.388
ILA+0.382
Neg logit contributions
 appre-0.454
��-0.445
enegger-0.399
 lest-0.392
 roy-0.392
Token:
Token ID25
Feature activation-4.92e-01

Pos logit contributions
 appre+2.848
��+2.753
 thereof+2.584
 lest+2.567
 roy+2.541
Neg logit contributions
izoph-4.574
senal-4.480
attered-4.159
 MIS-4.103
BRE-3.945
Token Carly
Token ID35982
Feature activation-1.01e-01

Pos logit contributions
 appre+1.805
��+1.784
enegger+1.637
 lest+1.611
 roy+1.604
Neg logit contributions
izoph-1.552
attered-1.415
senal-1.405
 MIS-1.376
BRE-1.313
Token Fiorina
Token ID49028
Feature activation+2.23e-01

Pos logit contributions
 appre+0.412
��+0.405
enegger+0.372
 lest+0.366
 roy+0.365
Neg logit contributions
izoph-0.283
attered-0.258
 MIS-0.253
BRE-0.242
senal-0.241
Token's
Token ID338
Feature activation+2.61e-01

Pos logit contributions
izoph+0.715
attered+0.661
 MIS+0.656
BRE+0.628
ILA+0.619
Neg logit contributions
 appre-0.827
��-0.811
enegger-0.729
 obser-0.718
 lest-0.716
Token political
Token ID1964
Feature activation+8.39e-02

Pos logit contributions
izoph+0.798
attered+0.738
 MIS+0.736
BRE+0.702
 HIP+0.689
Neg logit contributions
 appre-1.000
��-0.986
enegger-0.890
 obser-0.878
 lest-0.865
Token
Token ID198
Feature activation-2.62e-01

Pos logit contributions
 appre+0.617
��+0.601
enegger+0.536
 roy+0.530
 lest+0.530
Neg logit contributions
izoph-0.736
attered-0.682
 MIS-0.676
senal-0.665
BRE-0.649
Tokenheart
Token ID11499
Feature activation-6.86e-03

Pos logit contributions
 appre+1.018
��+1.000
enegger+0.904
 obser+0.892
 lest+0.887
Neg logit contributions
izoph-0.793
attered-0.730
 MIS-0.725
BRE-0.692
ILA-0.680
Tokenl
Token ID75
Feature activation+2.06e-01

Pos logit contributions
 appre+0.026
��+0.025
enegger+0.023
 obser+0.022
 lest+0.022
Neg logit contributions
izoph-0.022
attered-0.020
 MIS-0.020
BRE-0.019
ILA-0.019
Tokenxp
Token ID42372
Feature activation-3.30e-01

Pos logit contributions
izoph+0.662
attered+0.617
 MIS+0.602
BRE+0.578
Eye+0.568
Neg logit contributions
 appre-0.763
��-0.757
enegger-0.676
 obser-0.674
 lest-0.664
Token Profile
Token ID13118
Feature activation-7.80e-01

Pos logit contributions
 appre+1.044
��+1.019
enegger+0.899
 obser+0.889
 roy+0.880
Neg logit contributions
izoph-1.235
attered-1.159
 MIS-1.149
BRE-1.107
ILA-1.098
Token Joined
Token ID16709
Feature activation-1.10e+00

Pos logit contributions
 appre+2.386
��+2.324
enegger+2.019
 obser+2.001
 lest+1.997
Neg logit contributions
izoph-2.995
attered-2.806
 MIS-2.780
BRE-2.683
ILA-2.662
Token September
Token ID2693
Feature activation-5.44e-01

Pos logit contributions
 appre+2.874
��+2.739
 lest+2.380
 roy+2.371
 thereof+2.346
Neg logit contributions
izoph-4.684
attered-4.398
senal-4.295
 MIS-4.262
ILA-4.202
Token 2010
Token ID3050
Feature activation-4.16e-01

Pos logit contributions
 appre+2.034
��+1.978
enegger+1.804
 roy+1.794
 lest+1.789
Neg logit contributions
izoph-1.716
attered-1.562
 MIS-1.508
senal-1.484
BRE-1.479
Token United
Token ID1578
Feature activation-4.78e-01

Pos logit contributions
 appre+1.515
��+1.479
enegger+1.340
 roy+1.318
 lest+1.316
Neg logit contributions
izoph-1.355
attered-1.250
 MIS-1.223
BRE-1.184
senal-1.164
Token States
Token ID1829
Feature activation-3.05e-01

Pos logit contributions
 appre+1.121
��+1.090
enegger+0.920
 roy+0.898
 obser+0.896
Neg logit contributions
izoph-2.165
attered-2.051
 MIS-2.031
BRE-1.982
ILA-1.965
Token 10
Token ID838
Feature activation-8.92e-02

Pos logit contributions
 appre+1.192
��+1.167
enegger+1.061
 roy+1.047
 lest+1.045
Neg logit contributions
izoph-0.911
attered-0.834
 MIS-0.821
BRE-0.787
ILA-0.775
Token 2002
Token ID6244
Feature activation-1.86e-01

Pos logit contributions
 appre+1.442
��+1.406
 lest+1.276
enegger+1.273
 thereof+1.271
Neg logit contributions
izoph-1.132
attered-1.026
 MIS-0.981
senal-0.973
BRE-0.965
Token United
Token ID1578
Feature activation-2.61e-01

Pos logit contributions
 appre+0.686
��+0.671
enegger+0.608
 roy+0.598
 lest+0.598
Neg logit contributions
izoph-0.599
attered-0.551
 MIS-0.540
BRE-0.522
ILA-0.513
Token States
Token ID1829
Feature activation-9.54e-02

Pos logit contributions
 appre+0.621
��+0.603
enegger+0.507
 obser+0.495
 roy+0.491
Neg logit contributions
izoph-1.179
attered-1.116
 MIS-1.110
BRE-1.078
ILA-1.067
Token 3
Token ID513
Feature activation+1.70e-01

Pos logit contributions
 appre+0.379
��+0.372
enegger+0.337
 obser+0.333
 lest+0.331
Neg logit contributions
izoph-0.280
attered-0.257
 MIS-0.255
BRE-0.243
ILA-0.239
Token472
Token ID37856
Feature activation+1.73e-01

Pos logit contributions
izoph+0.497
 MIS+0.463
attered+0.461
BRE+0.439
ILA+0.431
Neg logit contributions
 appre-0.673
��-0.661
enegger-0.595
 obser-0.593
 roy-0.584
Token Posts
Token ID12043
Feature activation-1.09e+00

Pos logit contributions
izoph+0.804
 MIS+0.769
attered+0.765
BRE+0.744
ILA+0.737
Neg logit contributions
 appre-0.393
��-0.382
 obser-0.315
enegger-0.314
 lest-0.301
Token #
Token ID1303
Feature activation-3.03e-01

Pos logit contributions
 appre+3.310
��+3.191
 lest+2.778
enegger+2.757
 roy+2.754
Neg logit contributions
izoph-4.195
attered-3.949
 MIS-3.868
BRE-3.741
ILA-3.736
Token11
Token ID1157
Feature activation-2.34e-01

Pos logit contributions
 appre+0.790
��+0.769
enegger+0.661
 lest+0.647
 roy+0.645
Neg logit contributions
izoph-1.301
attered-1.218
 MIS-1.216
BRE-1.171
ILA-1.164
Token On
Token ID1550
Feature activation-4.32e-01

Pos logit contributions
 appre+0.750
��+0.741
enegger+0.654
 obser+0.641
 lest+0.634
Neg logit contributions
izoph-0.861
 MIS-0.807
attered-0.806
BRE-0.774
ILA-0.761
Token September
Token ID2693
Feature activation-2.66e-02

Pos logit contributions
 appre+1.183
��+1.146
enegger+0.977
 lest+0.971
 obser+0.969
Neg logit contributions
izoph-1.808
attered-1.699
 MIS-1.684
BRE-1.634
ILA-1.616
Token 20
Token ID1160
Feature activation-1.03e-01

Pos logit contributions
 appre+0.099
��+0.097
enegger+0.087
 obser+0.087
 lest+0.085
Neg logit contributions
izoph-0.085
attered-0.078
 MIS-0.078
BRE-0.075
ILA-0.073
Token of
Token ID286
Feature activation+1.21e-01

Pos logit contributions
 appre+0.359
��+0.352
enegger+0.319
 obser+0.313
 lest+0.310
Neg logit contributions
izoph-0.308
attered-0.283
 MIS-0.283
BRE-0.271
ILA-0.267
Token Dal
Token ID12348
Feature activation-1.89e-02

Pos logit contributions
izoph+0.378
 MIS+0.350
attered+0.349
BRE+0.330
ILA+0.326
Neg logit contributions
 appre-0.457
��-0.448
enegger-0.406
 obser-0.399
 lest-0.396
TokenK
Token ID42
Feature activation-2.46e-01

Pos logit contributions
 appre+0.064
��+0.063
enegger+0.056
 obser+0.055
 lest+0.054
Neg logit contributions
izoph-0.067
attered-0.062
 MIS-0.061
BRE-0.059
ILA-0.059
Token Profile
Token ID13118
Feature activation-5.13e-01

Pos logit contributions
 appre+0.866
��+0.850
enegger+0.760
 obser+0.750
 roy+0.742
Neg logit contributions
izoph-0.837
attered-0.781
 MIS-0.774
BRE-0.741
ILA-0.736
Token Blog
Token ID14001
Feature activation-1.70e-01

Pos logit contributions
 appre+1.620
��+1.583
enegger+1.392
 obser+1.372
 lest+1.363
Neg logit contributions
izoph-1.926
attered-1.801
 MIS-1.791
BRE-1.726
ILA-1.705
Token Joined
Token ID16709
Feature activation-1.07e+00

Pos logit contributions
 appre+0.530
��+0.517
enegger+0.455
 obser+0.446
 lest+0.445
Neg logit contributions
izoph-0.646
attered-0.604
 MIS-0.601
BRE-0.580
ILA-0.573
Token June
Token ID2795
Feature activation-3.40e-01

Pos logit contributions
 appre+2.819
��+2.668
 lest+2.347
 roy+2.322
 thereof+2.311
Neg logit contributions
izoph-4.564
attered-4.279
senal-4.163
 MIS-4.136
ILA-4.091
Token 2012
Token ID2321
Feature activation-2.50e-01

Pos logit contributions
 appre+1.309
��+1.281
enegger+1.163
 lest+1.157
 roy+1.154
Neg logit contributions
izoph-1.037
attered-0.949
 MIS-0.923
BRE-0.894
senal-0.889
Token Canada
Token ID3340
Feature activation-3.82e-02

Pos logit contributions
 appre+0.920
��+0.900
enegger+0.813
 roy+0.800
 lest+0.800
Neg logit contributions
izoph-0.806
attered-0.743
 MIS-0.730
BRE-0.704
ILA-0.693
Token 16
Token ID1467
Feature activation-2.95e-02

Pos logit contributions
 appre+0.148
��+0.145
enegger+0.132
 roy+0.130
 lest+0.130
Neg logit contributions
izoph-0.116
attered-0.106
 MIS-0.104
BRE-0.100
ILA-0.099
Token10
Token ID940
Feature activation+1.56e-01

Pos logit contributions
 appre+0.100
��+0.098
enegger+0.087
 obser+0.086
 roy+0.084
Neg logit contributions
izoph-0.103
 MIS-0.097
attered-0.097
BRE-0.093
ILA-0.092
Token14
Token ID1415
Feature activation-2.87e-01

Pos logit contributions
 appre+2.214
��+2.169
enegger+1.934
 lest+1.904
 obser+1.893
Neg logit contributions
izoph-2.234
attered-2.067
 MIS-2.045
BRE-1.944
senal-1.941
Token Recommend
Token ID30361
Feature activation-3.76e-01

Pos logit contributions
 appre+1.095
��+1.083
enegger+0.985
 obser+0.963
 roy+0.950
Neg logit contributions
izoph-0.878
 MIS-0.822
attered-0.816
BRE-0.781
ILA-0.768
Token
Token ID198
Feature activation-7.95e-01

Pos logit contributions
 appre+1.186
��+1.149
enegger+1.021
 lest+1.008
 roy+1.006
Neg logit contributions
izoph-1.419
attered-1.307
 MIS-1.294
senal-1.267
BRE-1.253
Token
Token ID198
Feature activation-7.84e-01

Pos logit contributions
 appre+2.161
��+2.061
 thereof+1.924
 lest+1.905
 roy+1.896
Neg logit contributions
izoph-3.266
senal-3.195
attered-2.967
 MIS-2.923
ILA-2.822
TokenRecommend
Token ID41248
Feature activation-2.84e-01

Pos logit contributions
 appre+3.140
��+3.065
 thereof+2.759
 lest+2.757
lain+2.718
Neg logit contributions
izoph-2.292
 MIS-2.032
attered-1.984
senal-1.950
 HIP-1.842
Token Share
Token ID8734
Feature activation-1.07e+00

Pos logit contributions
 appre+1.137
��+1.113
enegger+1.003
 lest+0.995
 obser+0.993
Neg logit contributions
izoph-0.835
attered-0.758
 MIS-0.752
BRE-0.719
ILA-0.707
Token this
Token ID428
Feature activation-3.23e-01

Pos logit contributions
 appre+3.294
 roy+2.797
��+2.773
 thereof+2.756
 lest+2.750
Neg logit contributions
izoph-4.223
attered-3.756
senal-3.732
 MIS-3.672
ILA-3.599
Token comment
Token ID2912
Feature activation-5.87e-01

Pos logit contributions
 appre+1.015
��+0.974
enegger+0.861
 lest+0.855
 obser+0.851
Neg logit contributions
izoph-1.224
attered-1.146
 MIS-1.131
BRE-1.091
ILA-1.079
Token on
Token ID319
Feature activation-2.51e-01

Pos logit contributions
 appre+1.611
��+1.510
 lest+1.347
 thereof+1.322
 roy+1.319
Neg logit contributions
izoph-2.429
attered-2.282
 MIS-2.270
senal-2.211
BRE-2.198
Token Facebook
Token ID3203
Feature activation-3.70e-01

Pos logit contributions
 appre+0.634
��+0.595
enegger+0.512
 lest+0.511
 roy+0.510
Neg logit contributions
izoph-1.114
attered-1.041
 MIS-1.026
senal-0.998
BRE-0.997
Token Share
Token ID8734
Feature activation-6.17e-01

Pos logit contributions
 appre+1.464
��+1.427
 lest+1.289
enegger+1.283
 roy+1.280
Neg logit contributions
izoph-1.098
attered-1.006
 MIS-0.987
BRE-0.942
ILA-0.928
Token55
Token ID2816
Feature activation-1.17e-01

Pos logit contributions
 appre+2.369
��+2.308
enegger+2.021
 lest+2.013
 roy+1.979
Neg logit contributions
izoph-3.144
attered-2.917
 MIS-2.879
senal-2.826
BRE-2.731
Token Recommend
Token ID30361
Feature activation-3.08e-01

Pos logit contributions
 appre+0.420
��+0.410
enegger+0.367
 lest+0.363
 obser+0.361
Neg logit contributions
izoph-0.386
attered-0.357
 MIS-0.353
BRE-0.339
ILA-0.333
Token
Token ID198
Feature activation-8.59e-01

Pos logit contributions
 appre+0.960
��+0.926
enegger+0.826
 lest+0.820
 roy+0.818
Neg logit contributions
izoph-1.168
attered-1.078
 MIS-1.067
senal-1.059
BRE-1.030
Token
Token ID198
Feature activation-8.67e-01

Pos logit contributions
 appre+2.316
��+2.208
 thereof+2.064
 lest+2.042
 roy+2.036
Neg logit contributions
izoph-3.533
senal-3.473
attered-3.220
 MIS-3.163
ILA-3.054
TokenRecommend
Token ID41248
Feature activation-3.25e-01

Pos logit contributions
 appre+3.454
��+3.334
 thereof+3.052
 lest+3.043
lain+3.010
Neg logit contributions
izoph-2.558
senal-2.222
 MIS-2.211
attered-2.199
 HIP-2.037
Token Share
Token ID8734
Feature activation-1.06e+00

Pos logit contributions
 appre+1.291
��+1.258
enegger+1.132
 lest+1.127
 roy+1.123
Neg logit contributions
izoph-0.964
attered-0.873
 MIS-0.864
BRE-0.828
ILA-0.814
Token this
Token ID428
Feature activation-3.56e-01

Pos logit contributions
 appre+3.506
��+3.161
 lest+3.090
 roy+3.077
 thereof+3.036
Neg logit contributions
izoph-3.893
senal-3.707
attered-3.482
 MIS-3.386
ILA-3.314
Token comment
Token ID2912
Feature activation-6.34e-01

Pos logit contributions
 appre+1.046
��+0.991
 lest+0.888
 roy+0.879
enegger+0.870
Neg logit contributions
izoph-1.425
attered-1.330
senal-1.317
 MIS-1.305
BRE-1.271
Token on
Token ID319
Feature activation-2.38e-01

Pos logit contributions
 appre+1.744
��+1.672
 roy+1.485
 lest+1.482
 thereof+1.473
Neg logit contributions
izoph-2.629
senal-2.446
attered-2.435
 MIS-2.406
BRE-2.331
Token Facebook
Token ID3203
Feature activation-3.12e-01

Pos logit contributions
 appre+0.634
��+0.616
enegger+0.530
 lest+0.517
 roy+0.516
Neg logit contributions
izoph-1.010
attered-0.950
 MIS-0.945
BRE-0.915
ILA-0.906
Token Share
Token ID8734
Feature activation-6.06e-01

Pos logit contributions
 appre+1.182
��+1.145
 lest+1.045
 roy+1.030
 thereof+1.025
Neg logit contributions
izoph-0.979
attered-0.891
 MIS-0.863
senal-0.862
BRE-0.835
Token 2011
Token ID2813
Feature activation-3.19e-01

Pos logit contributions
 appre+1.094
��+1.067
enegger+0.976
 lest+0.970
 roy+0.967
Neg logit contributions
izoph-0.855
attered-0.779
 MIS-0.755
senal-0.734
BRE-0.733
Token United
Token ID1578
Feature activation-3.30e-01

Pos logit contributions
 appre+1.178
��+1.152
enegger+1.044
 lest+1.027
 roy+1.026
Neg logit contributions
izoph-1.025
attered-0.944
 MIS-0.925
BRE-0.893
ILA-0.879
Token States
Token ID1829
Feature activation-1.12e-01

Pos logit contributions
 appre+0.792
��+0.770
enegger+0.650
 obser+0.635
 roy+0.633
Neg logit contributions
izoph-1.483
attered-1.404
 MIS-1.394
BRE-1.356
ILA-1.344
Token 228
Token ID29041
Feature activation+6.76e-02

Pos logit contributions
 appre+0.442
��+0.434
enegger+0.393
 obser+0.388
 lest+0.386
Neg logit contributions
izoph-0.331
attered-0.304
 MIS-0.301
BRE-0.287
ILA-0.283
Token65
Token ID2996
Feature activation+3.80e-01

Pos logit contributions
izoph+0.249
 MIS+0.237
attered+0.235
BRE+0.227
ILA+0.224
Neg logit contributions
 appre-0.217
��-0.212
 obser-0.186
enegger-0.185
 lest-0.181
Token Posts
Token ID12043
Feature activation-1.06e+00

Pos logit contributions
izoph+1.725
 MIS+1.662
attered+1.645
BRE+1.604
 HIP+1.591
Neg logit contributions
 appre-0.896
��-0.874
 obser-0.736
enegger-0.710
 accomp-0.689
Token #
Token ID1303
Feature activation-3.87e-01

Pos logit contributions
 appre+3.194
��+3.090
 lest+2.687
enegger+2.669
 roy+2.668
Neg logit contributions
izoph-4.093
attered-3.861
 MIS-3.787
BRE-3.663
ILA-3.657
Token13
Token ID1485
Feature activation-3.10e-01

Pos logit contributions
 appre+0.922
��+0.898
enegger+0.777
 lest+0.770
 thereof+0.768
Neg logit contributions
izoph-1.731
attered-1.603
 MIS-1.601
senal-1.599
ILA-1.543
Token On
Token ID1550
Feature activation-4.94e-01

Pos logit contributions
 appre+0.952
��+0.939
enegger+0.824
 obser+0.806
 lest+0.797
Neg logit contributions
izoph-1.186
 MIS-1.113
attered-1.111
BRE-1.069
ILA-1.052
Token September
Token ID2693
Feature activation-3.99e-02

Pos logit contributions
 appre+1.410
��+1.352
 lest+1.181
 roy+1.173
enegger+1.169
Neg logit contributions
izoph-2.001
attered-1.885
 MIS-1.852
BRE-1.800
senal-1.800
Token 20
Token ID1160
Feature activation-1.43e-01

Pos logit contributions
 appre+0.167
��+0.164
enegger+0.148
 obser+0.148
 lest+0.145
Neg logit contributions
izoph-0.109
attered-0.100
 MIS-0.100
BRE-0.095
 HIP-0.093
Token6
Token ID21
Feature activation-1.14e-01

Pos logit contributions
 appre+0.105
��+0.103
enegger+0.092
 obser+0.091
 roy+0.090
Neg logit contributions
izoph-0.104
attered-0.097
 MIS-0.096
BRE-0.093
ILA-0.091
Token {
Token ID1391
Feature activation+2.08e-01

Pos logit contributions
 appre+0.445
��+0.435
enegger+0.394
 lest+0.392
 roy+0.390
Neg logit contributions
izoph-0.342
attered-0.310
 MIS-0.307
BRE-0.292
ILA-0.292
Token(
Token ID7
Feature activation-1.04e-01

Pos logit contributions
izoph+0.646
attered+0.596
 MIS+0.592
BRE+0.565
ILA+0.557
Neg logit contributions
 appre-0.791
��-0.776
enegger-0.700
 obser-0.691
 lest-0.686
Token4
Token ID19
Feature activation-2.74e-01

Pos logit contributions
 appre+0.361
��+0.354
enegger+0.315
 obser+0.311
 roy+0.308
Neg logit contributions
izoph-0.357
attered-0.333
 MIS-0.331
BRE-0.318
ILA-0.314
Tokens
Token ID82
Feature activation-3.43e-03

Pos logit contributions
 appre+0.993
��+0.974
enegger+0.872
 obser+0.861
 roy+0.858
Neg logit contributions
izoph-0.903
attered-0.833
 MIS-0.829
BRE-0.794
ILA-0.784
Token)}
Token ID38165
Feature activation+5.44e-01

Pos logit contributions
 appre+0.012
��+0.012
 obser+0.011
enegger+0.011
 roy+0.011
Neg logit contributions
izoph-0.011
attered-0.011
 MIS-0.010
BRE-0.010
ILA-0.010
Token 60
Token ID3126
Feature activation+2.18e-01

Pos logit contributions
izoph+1.441
 MIS+1.304
attered+1.301
BRE+1.227
 HIP+1.200
Neg logit contributions
 appre-2.314
��-2.278
enegger-2.077
 obser-2.052
 roy-2.040
Token.
Token ID13
Feature activation-8.19e-02

Pos logit contributions
izoph+0.745
attered+0.697
 MIS+0.695
BRE+0.671
ILA+0.657
Neg logit contributions
 appre-0.766
��-0.753
 obser-0.668
enegger-0.666
 roy-0.646
Token e
Token ID304
Feature activation-1.82e-02

Pos logit contributions
 appre+0.266
��+0.262
enegger+0.233
 obser+0.227
 roy+0.224
Neg logit contributions
izoph-0.297
attered-0.279
 MIS-0.279
BRE-0.268
 HIP-0.265
Token7
Token ID22
Feature activation-3.20e-01

Pos logit contributions
 appre+0.066
��+0.065
enegger+0.058
 obser+0.057
 roy+0.056
Neg logit contributions
izoph-0.060
attered-0.056
 MIS-0.055
BRE-0.053
ILA-0.052
Token {
Token ID1391
Feature activation+2.04e-02

Pos logit contributions
 appre+1.275
��+1.252
enegger+1.134
 obser+1.121
 lest+1.117
Neg logit contributions
izoph-0.937
attered-0.858
 MIS-0.853
BRE-0.811
ILA-0.800
Token.
Token ID13
Feature activation-1.42e-01

Pos logit contributions
 appre+0.009
��+0.009
enegger+0.008
 obser+0.008
 lest+0.008
Neg logit contributions
izoph-0.008
attered-0.008
 MIS-0.008
BRE-0.007
ILA-0.007
Token
Token ID198
Feature activation+7.52e-02

Pos logit contributions
 appre+0.485
��+0.475
enegger+0.421
 obser+0.418
 roy+0.410
Neg logit contributions
izoph-0.494
 MIS-0.463
attered-0.462
BRE-0.445
ILA-0.437
Token
Token ID198
Feature activation+4.55e-02

Pos logit contributions
izoph+0.268
attered+0.250
 MIS+0.249
BRE+0.239
ILA+0.236
Neg logit contributions
 appre-0.251
��-0.246
enegger-0.218
 obser-0.215
 lest-0.213
Token
Token ID3581
Feature activation+1.24e-01

Pos logit contributions
izoph+0.152
attered+0.142
 MIS+0.142
BRE+0.139
ILA+0.135
Neg logit contributions
 appre-0.162
��-0.158
enegger-0.142
 obser-0.141
 roy-0.138
Token The
Token ID383
Feature activation-7.21e-02

Pos logit contributions
izoph+0.392
 MIS+0.372
attered+0.367
BRE+0.355
ILA+0.348
Neg logit contributions
 appre-0.460
��-0.450
 obser-0.405
enegger-0.403
 roy-0.394
Token Marlins
Token ID38134
Feature activation+4.58e-01

Pos logit contributions
 appre+0.257
��+0.252
enegger+0.226
 obser+0.223
 lest+0.220
Neg logit contributions
izoph-0.240
attered-0.224
 MIS-0.223
BRE-0.213
ILA-0.210
Token are
Token ID389
Feature activation-1.03e-01

Pos logit contributions
izoph+1.500
 MIS+1.411
attered+1.409
BRE+1.345
ILA+1.317
Neg logit contributions
 appre-1.618
��-1.611
enegger-1.437
 obser-1.413
 lest-1.391
Token one
Token ID530
Feature activation+1.84e-01

Pos logit contributions
 appre+0.378
��+0.371
enegger+0.332
 obser+0.328
 lest+0.326
Neg logit contributions
izoph-0.336
attered-0.311
 MIS-0.309
BRE-0.296
ILA-0.292
Token of
Token ID286
Feature activation+1.38e-01

Pos logit contributions
izoph+0.632
attered+0.594
 MIS+0.593
BRE+0.569
 HIP+0.558
Neg logit contributions
 appre-0.634
��-0.623
enegger-0.563
 obser-0.549
lain-0.534
Token only
Token ID691
Feature activation+1.91e-01

Pos logit contributions
izoph+0.425
attered+0.396
 MIS+0.396
BRE+0.376
ILA+0.370
Neg logit contributions
 appre-0.527
��-0.516
enegger-0.467
 obser-0.461
 lest-0.454
Token three
Token ID1115
Feature activation-7.25e-02

Pos logit contributions
izoph+0.592
 MIS+0.554
attered+0.552
BRE+0.527
 HIP+0.518
Neg logit contributions
 appre-0.719
��-0.706
enegger-0.638
 obser-0.630
 roy-0.616
Token unite
Token ID24558
Feature activation-2.55e-01

Pos logit contributions
 appre+0.290
��+0.288
enegger+0.256
 obser+0.251
 lest+0.247
Neg logit contributions
izoph-0.302
attered-0.284
 MIS-0.282
BRE-0.270
 HIP-0.265
Token and
Token ID290
Feature activation+1.90e-01

Pos logit contributions
 appre+0.782
��+0.774
enegger+0.675
 obser+0.662
 roy+0.648
Neg logit contributions
izoph-0.965
attered-0.916
 MIS-0.912
BRE-0.876
ILA-0.864
Token demand
Token ID3512
Feature activation-1.20e-02

Pos logit contributions
izoph+0.726
 MIS+0.688
attered+0.687
BRE+0.664
 HIP+0.656
Neg logit contributions
 appre-0.565
��-0.565
enegger-0.497
 obser-0.479
lain-0.472
Token common
Token ID2219
Feature activation-5.71e-01

Pos logit contributions
 appre+0.044
��+0.043
enegger+0.039
 obser+0.038
 roy+0.038
Neg logit contributions
izoph-0.039
attered-0.036
 MIS-0.036
BRE-0.035
 HIP-0.034
Token-
Token ID12
Feature activation+2.93e-01

Pos logit contributions
 appre+1.459
��+1.429
enegger+1.215
 obser+1.186
 roy+1.172
Neg logit contributions
izoph-2.483
attered-2.350
 MIS-2.345
BRE-2.266
ILA-2.237
Tokensense
Token ID33819
Feature activation+4.95e-01

Pos logit contributions
izoph+0.446
attered+0.398
 MIS+0.373
BRE+0.353
speech+0.330
Neg logit contributions
 appre-1.566
��-1.561
 obser-1.441
enegger-1.427
 roy-1.420
Token gun
Token ID2485
Feature activation-4.16e-02

Pos logit contributions
izoph+1.530
 MIS+1.449
attered+1.435
 HIP+1.389
BRE+1.369
Neg logit contributions
 appre-1.839
��-1.797
enegger-1.624
 obser-1.599
esides-1.572
Token laws
Token ID3657
Feature activation-4.68e-02

Pos logit contributions
 appre+0.143
��+0.140
enegger+0.124
 obser+0.123
 roy+0.122
Neg logit contributions
izoph-0.144
attered-0.135
 MIS-0.134
BRE-0.129
ILA-0.127
Token will
Token ID481
Feature activation-2.76e-02

Pos logit contributions
 appre+0.169
��+0.166
enegger+0.149
 obser+0.147
 roy+0.145
Neg logit contributions
izoph-0.152
attered-0.143
 MIS-0.142
BRE-0.136
ILA-0.134
Token we
Token ID356
Feature activation+2.28e-01

Pos logit contributions
 appre+0.094
��+0.092
enegger+0.082
 obser+0.081
 roy+0.080
Neg logit contributions
izoph-0.097
attered-0.090
 MIS-0.090
BRE-0.086
ILA-0.085
Token be
Token ID307
Feature activation+7.53e-02

Pos logit contributions
izoph+0.796
attered+0.748
 MIS+0.739
BRE+0.716
 HIP+0.700
Neg logit contributions
��-0.755
 appre-0.752
enegger-0.664
 obser-0.657
lain-0.637
Token up
Token ID510
Feature activation+2.99e-02

Pos logit contributions
 appre+0.622
��+0.614
enegger+0.545
 obser+0.537
 roy+0.525
Neg logit contributions
izoph-0.631
attered-0.590
 MIS-0.589
BRE-0.564
ILA-0.554
Token front
Token ID2166
Feature activation+2.30e-02

Pos logit contributions
izoph+0.108
attered+0.102
 MIS+0.101
BRE+0.097
ILA+0.096
Neg logit contributions
 appre-0.098
��-0.096
enegger-0.085
 obser-0.083
 lest-0.082
Token.
Token ID13
Feature activation+1.75e-01

Pos logit contributions
izoph+0.080
attered+0.076
 MIS+0.075
BRE+0.072
ILA+0.071
Neg logit contributions
 appre-0.078
��-0.077
enegger-0.068
 obser-0.067
 roy-0.065
Token Moreover
Token ID10968
Feature activation+8.24e-02

Pos logit contributions
izoph+0.650
 MIS+0.616
attered+0.611
BRE+0.591
ILA+0.583
Neg logit contributions
 appre-0.565
��-0.551
 obser-0.483
enegger-0.478
 roy-0.464
Token,
Token ID11
Feature activation+3.56e-02

Pos logit contributions
izoph+0.305
attered+0.286
 MIS+0.284
BRE+0.273
ILA+0.270
Neg logit contributions
 appre-0.264
��-0.258
enegger-0.228
 obser-0.223
 roy-0.222
Token upset
Token ID9247
Feature activation+4.84e-01

Pos logit contributions
izoph+0.120
attered+0.112
 MIS+0.111
BRE+0.106
ILA+0.105
Neg logit contributions
 appre-0.126
��-0.124
enegger-0.110
 obser-0.109
 roy-0.107
Token that
Token ID326
Feature activation+5.37e-02

Pos logit contributions
izoph+1.607
attered+1.497
 MIS+1.496
BRE+1.415
 HIP+1.395
Neg logit contributions
 appre-1.724
��-1.700
enegger-1.527
 obser-1.497
 lest-1.456
Token he
Token ID339
Feature activation+6.35e-02

Pos logit contributions
izoph+0.175
attered+0.162
 MIS+0.162
BRE+0.154
ILA+0.151
Neg logit contributions
 appre-0.196
��-0.193
enegger-0.173
 obser-0.170
 roy-0.168
Token was
Token ID373
Feature activation-1.25e-02

Pos logit contributions
izoph+0.198
attered+0.186
 MIS+0.184
BRE+0.175
ILA+0.171
Neg logit contributions
 appre-0.237
��-0.234
enegger-0.209
 obser-0.208
esides-0.204
Token not
Token ID407
Feature activation+6.10e-03

Pos logit contributions
 appre+0.047
��+0.046
enegger+0.041
 obser+0.041
 lest+0.040
Neg logit contributions
izoph-0.039
attered-0.037
 MIS-0.037
BRE-0.035
ILA-0.034
Token getting
Token ID1972
Feature activation+4.03e-02

Pos logit contributions
izoph+0.021
attered+0.019
 MIS+0.019
BRE+0.018
ILA+0.018
Neg logit contributions
 appre-0.021
��-0.021
enegger-0.019
 obser-0.018
 lest-0.018
Token
Token ID198
Feature activation+3.75e-02

Pos logit contributions
izoph+0.299
attered+0.279
 MIS+0.277
BRE+0.267
ILA+0.263
Neg logit contributions
 appre-0.278
��-0.272
enegger-0.241
 obser-0.238
 lest-0.236
TokenBut
Token ID1537
Feature activation+1.53e-01

Pos logit contributions
izoph+0.115
attered+0.107
 MIS+0.106
BRE+0.103
ILA+0.101
Neg logit contributions
 appre-0.145
��-0.142
 obser-0.128
enegger-0.128
 roy-0.125
Token at
Token ID379
Feature activation-1.85e-01

Pos logit contributions
izoph+0.518
attered+0.482
 MIS+0.480
BRE+0.460
ILA+0.454
Neg logit contributions
 appre-0.536
��-0.527
enegger-0.469
 obser-0.464
 roy-0.459
Token the
Token ID262
Feature activation+4.18e-02

Pos logit contributions
 appre+0.649
��+0.635
enegger+0.567
 obser+0.559
 lest+0.556
Neg logit contributions
izoph-0.629
attered-0.584
 MIS-0.580
BRE-0.557
ILA-0.549
Token same
Token ID976
Feature activation+1.62e-01

Pos logit contributions
izoph+0.163
 MIS+0.154
attered+0.154
BRE+0.148
ILA+0.147
Neg logit contributions
 appre-0.124
��-0.123
enegger-0.107
 obser-0.104
 lest-0.103
Token time
Token ID640
Feature activation+5.72e-01

Pos logit contributions
izoph+0.430
 MIS+0.393
attered+0.393
BRE+0.372
ILA+0.364
Neg logit contributions
 appre-0.682
��-0.674
enegger-0.618
 obser-0.603
 roy-0.602
Token he
Token ID339
Feature activation+9.62e-02

Pos logit contributions
izoph+1.964
attered+1.866
 MIS+1.863
BRE+1.771
 HIP+1.762
Neg logit contributions
 appre-1.934
��-1.922
enegger-1.696
 obser-1.665
 roy-1.630
Token realizes
Token ID23660
Feature activation+9.74e-02

Pos logit contributions
izoph+0.278
attered+0.264
 MIS+0.260
BRE+0.248
ILA+0.246
Neg logit contributions
��-0.373
 appre-0.373
enegger-0.332
 obser-0.332
 lest-0.325
Token he
Token ID339
Feature activation-9.45e-02

Pos logit contributions
izoph+0.329
attered+0.306
 MIS+0.305
BRE+0.291
ILA+0.287
Neg logit contributions
 appre-0.343
��-0.338
enegger-0.302
 obser-0.297
 roy-0.294
Token
Token ID447
Feature activation+9.99e-02

Pos logit contributions
 appre+0.361
��+0.358
enegger+0.320
 obser+0.319
 lest+0.313
Neg logit contributions
izoph-0.286
attered-0.269
 MIS-0.266
BRE-0.253
ILA-0.250
Token
Token ID247
Feature activation+1.94e-01

Pos logit contributions
izoph+0.330
attered+0.307
 MIS+0.306
BRE+0.297
ILA+0.295
Neg logit contributions
 appre-0.360
��-0.336
enegger-0.321
 obser-0.317
 accomp-0.309
Token adjust
Token ID4532
Feature activation-3.64e-02

Pos logit contributions
 appre+0.464
��+0.463
enegger+0.412
 obser+0.404
 lest+0.397
Neg logit contributions
izoph-0.472
 MIS-0.443
attered-0.440
BRE-0.423
 HIP-0.416
Token our
Token ID674
Feature activation-8.70e-02

Pos logit contributions
 appre+0.123
��+0.121
enegger+0.108
 obser+0.106
 roy+0.104
Neg logit contributions
izoph-0.127
 MIS-0.119
attered-0.119
BRE-0.113
 HIP-0.112
Token pronouns
Token ID43947
Feature activation-8.17e-02

Pos logit contributions
 appre+0.279
��+0.274
enegger+0.242
 obser+0.237
 roy+0.235
Neg logit contributions
izoph-0.321
attered-0.301
 MIS-0.300
BRE-0.288
ILA-0.283
Token accordingly
Token ID16062
Feature activation-3.62e-01

Pos logit contributions
 appre+0.284
��+0.279
enegger+0.251
 obser+0.246
 roy+0.240
Neg logit contributions
izoph-0.279
attered-0.260
 MIS-0.259
BRE-0.248
ILA-0.244
Token (
Token ID357
Feature activation+7.98e-02

Pos logit contributions
 appre+1.194
��+1.169
enegger+1.034
 obser+1.021
 roy+1.008
Neg logit contributions
izoph-1.306
attered-1.221
 MIS-1.216
BRE-1.169
ILA-1.150
Tokenand
Token ID392
Feature activation-1.29e-01

Pos logit contributions
izoph+0.266
attered+0.250
 MIS+0.248
BRE+0.242
ILA+0.236
Neg logit contributions
 appre-0.285
��-0.277
enegger-0.249
 obser-0.247
 roy-0.242
Token instantly
Token ID11101
Feature activation+1.48e-01

Pos logit contributions
 appre+0.434
��+0.427
enegger+0.380
 obser+0.373
 roy+0.368
Neg logit contributions
izoph-0.454
 MIS-0.425
attered-0.424
BRE-0.407
 HIP-0.400
Token)
Token ID8
Feature activation-1.20e-01

Pos logit contributions
izoph+0.515
 MIS+0.490
attered+0.485
BRE+0.467
 HIP+0.460
Neg logit contributions
 appre-0.493
��-0.492
enegger-0.436
 obser-0.428
lain-0.418
Token or
Token ID393
Feature activation-6.66e-02

Pos logit contributions
 appre+0.417
��+0.409
enegger+0.364
 obser+0.360
 roy+0.354
Neg logit contributions
izoph-0.415
 MIS-0.386
attered-0.386
BRE-0.371
ILA-0.363
Token else
Token ID2073
Feature activation-6.61e-01

Pos logit contributions
 appre+0.218
��+0.215
enegger+0.191
 obser+0.187
 roy+0.184
Neg logit contributions
izoph-0.240
 MIS-0.225
attered-0.225
BRE-0.215
 HIP-0.212
Token we
Token ID356
Feature activation+1.22e-01

Pos logit contributions
 appre+2.115
��+2.064
enegger+1.820
 lest+1.799
 obser+1.798
Neg logit contributions
izoph-2.440
attered-2.283
 MIS-2.260
BRE-2.181
ILA-2.160
Token Changing
Token ID33680
Feature activation-1.02e-02

Pos logit contributions
 appre+0.088
��+0.087
enegger+0.078
 obser+0.076
 lest+0.076
Neg logit contributions
izoph-0.079
 MIS-0.074
attered-0.073
BRE-0.070
ILA-0.069
Token
Token ID198
Feature activation-2.10e-01

Pos logit contributions
 appre+0.036
��+0.035
enegger+0.031
 obser+0.031
 roy+0.030
Neg logit contributions
izoph-0.035
 MIS-0.033
attered-0.033
BRE-0.031
 HIP-0.031
Token
Token ID198
Feature activation-1.69e-01

Pos logit contributions
 appre+0.667
��+0.652
enegger+0.580
 lest+0.572
 roy+0.572
Neg logit contributions
izoph-0.778
attered-0.723
 MIS-0.716
senal-0.696
BRE-0.688
TokenThe
Token ID464
Feature activation-7.78e-02

Pos logit contributions
 appre+0.659
��+0.647
enegger+0.585
 obser+0.577
 lest+0.575
Neg logit contributions
izoph-0.508
attered-0.467
 MIS-0.464
BRE-0.442
ILA-0.435
Token competitive
Token ID7606
Feature activation-3.51e-03

Pos logit contributions
 appre+0.263
��+0.258
enegger+0.230
 obser+0.226
 roy+0.224
Neg logit contributions
izoph-0.274
 MIS-0.257
attered-0.255
BRE-0.245
 HIP-0.241
Token landscape
Token ID10747
Feature activation-1.56e-01

Pos logit contributions
 appre+0.012
��+0.012
enegger+0.010
 obser+0.010
 lest+0.010
Neg logit contributions
izoph-0.012
 MIS-0.011
attered-0.011
BRE-0.011
ILA-0.011
Token,
Token ID11
Feature activation+9.95e-02

Pos logit contributions
 appre+0.553
��+0.543
enegger+0.485
 obser+0.478
 lest+0.474
Neg logit contributions
izoph-0.524
attered-0.488
 MIS-0.486
BRE-0.466
ILA-0.458
Token however
Token ID2158
Feature activation+7.65e-02

Pos logit contributions
izoph+0.341
attered+0.319
 MIS+0.318
BRE+0.304
 HIP+0.299
Neg logit contributions
 appre-0.345
��-0.338
enegger-0.302
 obser-0.297
 roy-0.294
Token,
Token ID11
Feature activation+8.25e-02

Pos logit contributions
izoph+0.282
attered+0.264
 MIS+0.263
BRE+0.253
ILA+0.249
Neg logit contributions
 appre-0.247
��-0.241
enegger-0.213
 obser-0.209
 roy-0.207
Token constantly
Token ID7558
Feature activation+1.01e-01

Pos logit contributions
izoph+0.265
 MIS+0.247
attered+0.246
BRE+0.235
ILA+0.230
Neg logit contributions
 appre-0.302
��-0.299
enegger-0.267
 obser-0.263
 lest-0.260
Token pushes
Token ID20070
Feature activation+1.67e-01

Pos logit contributions
izoph+0.216
 MIS+0.195
attered+0.195
BRE+0.181
ILA+0.174
Neg logit contributions
 appre-0.475
��-0.473
enegger-0.434
 obser-0.427
 lest-0.424
Token when
Token ID618
Feature activation-2.63e-02

Pos logit contributions
izoph+0.984
 MIS+0.932
attered+0.928
BRE+0.885
ILA+0.876
Neg logit contributions
 appre-1.021
��-1.008
enegger-0.894
 obser-0.887
 roy-0.875
Token a
Token ID257
Feature activation-1.40e-01

Pos logit contributions
 appre+0.086
��+0.085
enegger+0.075
 obser+0.074
 lest+0.073
Neg logit contributions
izoph-0.095
attered-0.089
 MIS-0.088
BRE-0.085
ILA-0.084
Token man
Token ID582
Feature activation-5.11e-03

Pos logit contributions
 appre+0.404
��+0.393
enegger+0.339
 obser+0.336
 roy+0.334
Neg logit contributions
izoph-0.565
attered-0.529
 MIS-0.525
BRE-0.509
ILA-0.504
Token poured
Token ID19036
Feature activation-1.15e-01

Pos logit contributions
 appre+0.018
��+0.018
enegger+0.016
 obser+0.015
 lest+0.015
Neg logit contributions
izoph-0.017
attered-0.016
 MIS-0.016
BRE-0.015
ILA-0.015
Token alcohol
Token ID5548
Feature activation-2.08e-01

Pos logit contributions
 appre+0.384
��+0.378
enegger+0.336
 obser+0.329
 lest+0.327
Neg logit contributions
izoph-0.412
attered-0.385
 MIS-0.384
BRE-0.368
ILA-0.363
Token on
Token ID319
Feature activation+7.62e-02

Pos logit contributions
 appre+0.676
��+0.666
enegger+0.589
 obser+0.579
 lest+0.571
Neg logit contributions
izoph-0.758
attered-0.715
 MIS-0.710
BRE-0.682
ILA-0.672
Token her
Token ID607
Feature activation+2.32e-01

Pos logit contributions
izoph+0.265
attered+0.251
 MIS+0.249
BRE+0.238
ILA+0.235
Neg logit contributions
 appre-0.259
��-0.255
enegger-0.228
 obser-0.223
 lest-0.219
Token and
Token ID290
Feature activation+1.24e-01

Pos logit contributions
izoph+0.810
attered+0.774
 MIS+0.773
BRE+0.734
 HIP+0.729
Neg logit contributions
��-0.770
 appre-0.756
enegger-0.677
 obser-0.657
esides-0.640
Token lit
Token ID6578
Feature activation+1.32e-01

Pos logit contributions
izoph+0.433
attered+0.412
 MIS+0.411
BRE+0.394
 HIP+0.386
Neg logit contributions
 appre-0.411
��-0.409
enegger-0.363
 obser-0.353
lain-0.346
Token a
Token ID257
Feature activation+1.35e-01

Pos logit contributions
izoph+0.429
attered+0.399
 MIS+0.396
BRE+0.377
ILA+0.371
Neg logit contributions
 appre-0.487
��-0.480
enegger-0.432
 obser-0.424
 lest-0.419
Token match
Token ID2872
Feature activation-2.79e-02

Pos logit contributions
izoph+0.479
 MIS+0.449
attered+0.448
BRE+0.429
 HIP+0.422
Neg logit contributions
 appre-0.451
��-0.446
enegger-0.398
 obser-0.388
 lest-0.383
Token people
Token ID661
Feature activation+1.12e-01

Pos logit contributions
 appre+0.319
��+0.316
enegger+0.279
 obser+0.272
 lest+0.270
Neg logit contributions
izoph-0.364
 MIS-0.342
attered-0.342
BRE-0.327
 HIP-0.323
Token,
Token ID11
Feature activation+2.28e-02

Pos logit contributions
izoph+0.356
attered+0.340
 MIS+0.339
BRE+0.318
 HIP+0.317
Neg logit contributions
��-0.402
 appre-0.402
enegger-0.359
 obser-0.351
lain-0.339
Token as
Token ID355
Feature activation+7.68e-03

Pos logit contributions
izoph+0.079
 MIS+0.076
attered+0.075
BRE+0.072
 HIP+0.072
Neg logit contributions
 appre-0.076
��-0.075
enegger-0.066
 obser-0.065
 roy-0.064
Token well
Token ID880
Feature activation+3.59e-01

Pos logit contributions
izoph+0.033
 MIS+0.031
attered+0.031
 HIP+0.030
BRE+0.030
Neg logit contributions
 appre-0.020
��-0.020
enegger-0.016
 obser-0.016
 lest-0.016
Token as
Token ID355
Feature activation+4.30e-02

Pos logit contributions
izoph+1.039
 MIS+0.964
attered+0.963
BRE+0.911
 HIP+0.890
Neg logit contributions
 appre-1.432
��-1.410
enegger-1.271
 obser-1.255
 lest-1.240
Token if
Token ID611
Feature activation+8.24e-02

Pos logit contributions
izoph+0.139
 MIS+0.131
attered+0.130
BRE+0.124
 HIP+0.123
Neg logit contributions
 appre-0.156
��-0.154
enegger-0.138
 obser-0.135
 lest-0.134
Token you
Token ID345
Feature activation+2.63e-02

Pos logit contributions
izoph+0.274
attered+0.259
 MIS+0.256
BRE+0.245
 HIP+0.242
Neg logit contributions
 appre-0.291
��-0.286
enegger-0.255
 obser-0.253
 lest-0.248
Token don
Token ID836
Feature activation+1.15e-01

Pos logit contributions
izoph+0.074
attered+0.068
 MIS+0.067
BRE+0.064
 HIP+0.062
Neg logit contributions
 appre-0.107
��-0.106
enegger-0.096
 obser-0.095
 lest-0.094
Token't
Token ID470
Feature activation+6.48e-02

Pos logit contributions
izoph+0.343
attered+0.317
 MIS+0.315
BRE+0.301
ILA+0.295
Neg logit contributions
 appre-0.454
��-0.446
enegger-0.402
 obser-0.401
 lest-0.393
Token feel
Token ID1254
Feature activation+1.69e-01

Pos logit contributions
izoph+0.201
 MIS+0.187
attered+0.187
BRE+0.178
 HIP+0.177
Neg logit contributions
��-0.243
 appre-0.241
enegger-0.216
 obser-0.213
lain-0.208
Token confident
Token ID6563
Feature activation+3.55e-01

Pos logit contributions
izoph+0.584
attered+0.553
 MIS+0.552
 HIP+0.526
BRE+0.525
Neg logit contributions
��-0.572
 appre-0.563
enegger-0.503
 obser-0.495
 lest-0.479
Token dimensions
Token ID15225
Feature activation-3.69e-01

Pos logit contributions
 appre+0.070
��+0.069
enegger+0.062
 obser+0.060
 roy+0.060
Neg logit contributions
izoph-0.069
attered-0.064
 MIS-0.064
BRE-0.061
ILA-0.060
Token and
Token ID290
Feature activation-8.56e-02

Pos logit contributions
 appre+1.269
��+1.244
enegger+1.108
 obser+1.091
 roy+1.084
Neg logit contributions
izoph-1.278
attered-1.189
 MIS-1.183
BRE-1.135
ILA-1.119
Token possible
Token ID1744
Feature activation-2.01e-01

Pos logit contributions
 appre+0.303
��+0.298
enegger+0.267
 obser+0.261
 roy+0.260
Neg logit contributions
izoph-0.289
attered-0.268
 MIS-0.267
BRE-0.255
ILA-0.251
Token functions
Token ID5499
Feature activation-2.02e-02

Pos logit contributions
 appre+0.674
��+0.663
enegger+0.590
 obser+0.577
 roy+0.574
Neg logit contributions
izoph-0.712
attered-0.662
 MIS-0.661
BRE-0.633
ILA-0.623
Token.
Token ID764
Feature activation-8.16e-02

Pos logit contributions
 appre+0.072
��+0.070
enegger+0.063
 obser+0.062
 roy+0.061
Neg logit contributions
izoph-0.068
attered-0.063
 MIS-0.063
BRE-0.060
ILA-0.059
Token Merrill
Token ID40418
Feature activation+2.56e-02

Pos logit contributions
 appre+0.289
��+0.282
enegger+0.252
 obser+0.250
 roy+0.247
Neg logit contributions
izoph-0.277
 MIS-0.257
attered-0.256
BRE-0.245
ILA-0.241
Token-
Token ID12
Feature activation-1.74e-01

Pos logit contributions
izoph+0.090
 MIS+0.084
attered+0.084
BRE+0.079
ILA+0.079
Neg logit contributions
 appre-0.087
��-0.086
enegger-0.077
 obser-0.075
 lest-0.075
TokenPal
Token ID11531
Feature activation-2.74e-01

Pos logit contributions
 appre+0.691
��+0.680
enegger+0.615
 obser+0.608
 lest+0.604
Neg logit contributions
izoph-0.511
attered-0.469
 MIS-0.467
BRE-0.447
ILA-0.440
Tokenmer
Token ID647
Feature activation-3.48e-02

Pos logit contributions
 appre+0.985
��+0.966
enegger+0.866
 obser+0.854
 lest+0.848
Neg logit contributions
izoph-0.908
attered-0.842
 MIS-0.836
BRE-0.800
ILA-0.788
Token Quarterly
Token ID36082
Feature activation-4.58e-01

Pos logit contributions
 appre+0.122
��+0.119
enegger+0.107
 obser+0.105
 lest+0.104
Neg logit contributions
izoph-0.119
attered-0.111
 MIS-0.111
BRE-0.105
ILA-0.104
Token,
Token ID11
Feature activation-1.58e-01

Pos logit contributions
 appre+1.426
��+1.401
enegger+1.226
 obser+1.208
 roy+1.208
Neg logit contributions
izoph-1.724
attered-1.614
 MIS-1.600
BRE-1.551
ILA-1.530
TokenFar
Token ID21428
Feature activation-2.84e-01

Pos logit contributions
 appre+0.009
��+0.009
enegger+0.008
 obser+0.008
 roy+0.008
Neg logit contributions
izoph-0.007
attered-0.006
 MIS-0.006
BRE-0.006
ILA-0.006
Tokenmer
Token ID647
Feature activation+1.06e-01

Pos logit contributions
 appre+0.984
��+0.965
enegger+0.860
 obser+0.848
 lest+0.842
Neg logit contributions
izoph-0.975
attered-0.907
 MIS-0.901
BRE-0.865
ILA-0.853
Token said
Token ID531
Feature activation-5.10e-02

Pos logit contributions
izoph+0.368
attered+0.346
 MIS+0.344
BRE+0.329
ILA+0.325
Neg logit contributions
 appre-0.362
��-0.357
enegger-0.317
 obser-0.311
 lest-0.307
Token the
Token ID262
Feature activation-1.62e-01

Pos logit contributions
 appre+0.175
��+0.172
enegger+0.153
 obser+0.150
 roy+0.149
Neg logit contributions
izoph-0.177
attered-0.165
 MIS-0.164
BRE-0.158
ILA-0.155
Token only
Token ID691
Feature activation-1.11e-01

Pos logit contributions
 appre+0.563
��+0.555
enegger+0.495
 obser+0.485
 lest+0.482
Neg logit contributions
izoph-0.555
attered-0.517
 MIS-0.516
BRE-0.494
 HIP-0.485
Token time
Token ID640
Feature activation-2.04e-01

Pos logit contributions
 appre+0.381
��+0.377
enegger+0.335
 obser+0.328
 lest+0.326
Neg logit contributions
izoph-0.384
 MIS-0.360
attered-0.359
BRE-0.342
 HIP-0.338
Token he
Token ID339
Feature activation-2.50e-01

Pos logit contributions
 appre+0.692
��+0.678
enegger+0.602
 obser+0.593
 lest+0.589
Neg logit contributions
izoph-0.717
attered-0.668
 MIS-0.664
BRE-0.638
ILA-0.629
Token remembered
Token ID12086
Feature activation-2.43e-01

Pos logit contributions
 appre+0.953
��+0.943
enegger+0.848
 obser+0.837
 lest+0.827
Neg logit contributions
izoph-0.759
attered-0.711
 MIS-0.703
BRE-0.670
ILA-0.656
Token the
Token ID262
Feature activation+4.29e-02

Pos logit contributions
 appre+0.953
��+0.933
enegger+0.842
 obser+0.835
 lest+0.833
Neg logit contributions
izoph-0.732
attered-0.671
 MIS-0.664
BRE-0.634
ILA-0.628
Token trip
Token ID5296
Feature activation+2.89e-01

Pos logit contributions
izoph+0.142
 MIS+0.133
attered+0.132
BRE+0.127
 HIP+0.125
Neg logit contributions
 appre-0.153
��-0.151
enegger-0.135
 obser-0.132
 lest-0.131
Token being
Token ID852
Feature activation+1.61e-01

Pos logit contributions
izoph+0.974
attered+0.922
 MIS+0.918
BRE+0.873
 HIP+0.862
Neg logit contributions
 appre-1.001
��-0.989
enegger-0.874
 obser-0.867
 lest-0.851
Token aspects
Token ID7612
Feature activation-5.94e-02

Pos logit contributions
izoph+0.470
 MIS+0.440
attered+0.436
BRE+0.420
ILA+0.410
Neg logit contributions
 appre-0.459
��-0.453
enegger-0.401
 obser-0.393
 lest-0.392
Token,
Token ID11
Feature activation-1.84e-02

Pos logit contributions
 appre+0.207
��+0.204
enegger+0.182
 obser+0.178
 roy+0.175
Neg logit contributions
izoph-0.203
 MIS-0.189
attered-0.189
BRE-0.182
 HIP-0.177
Token has
Token ID468
Feature activation-2.10e-01

Pos logit contributions
 appre+0.063
��+0.062
enegger+0.055
 obser+0.054
 roy+0.053
Neg logit contributions
izoph-0.064
attered-0.060
 MIS-0.060
BRE-0.057
 HIP-0.056
Token some
Token ID617
Feature activation-1.34e-01

Pos logit contributions
 appre+0.763
��+0.749
enegger+0.672
 obser+0.661
 lest+0.658
Neg logit contributions
izoph-0.685
attered-0.635
 MIS-0.632
BRE-0.605
ILA-0.596
Token nice
Token ID3621
Feature activation-2.79e-01

Pos logit contributions
 appre+0.457
��+0.453
enegger+0.404
 obser+0.394
 lest+0.391
Neg logit contributions
izoph-0.462
 MIS-0.432
attered-0.430
BRE-0.413
ILA-0.406
Token he
Token ID339
Feature activation-2.25e-01

Pos logit contributions
 appre+0.952
��+0.935
enegger+0.834
 obser+0.816
 lest+0.813
Neg logit contributions
izoph-0.974
attered-0.906
 MIS-0.904
BRE-0.867
ILA-0.854
Tokenft
Token ID701
Feature activation-1.57e-01

Pos logit contributions
 appre+0.548
��+0.534
enegger+0.449
 obser+0.441
 roy+0.435
Neg logit contributions
izoph-1.006
attered-0.954
 MIS-0.948
BRE-0.922
ILA-0.910
Token,
Token ID11
Feature activation-1.98e-01

Pos logit contributions
 appre+0.510
��+0.502
enegger+0.441
 obser+0.435
 roy+0.428
Neg logit contributions
izoph-0.567
attered-0.536
 MIS-0.533
BRE-0.513
ILA-0.503
Token and
Token ID290
Feature activation-1.11e-01

Pos logit contributions
 appre+0.619
��+0.611
enegger+0.535
 obser+0.523
 roy+0.517
Neg logit contributions
izoph-0.747
attered-0.700
 MIS-0.699
BRE-0.674
ILA-0.665
Token feels
Token ID5300
Feature activation-2.74e-01

Pos logit contributions
 appre+0.392
��+0.388
enegger+0.346
 obser+0.340
 roy+0.335
Neg logit contributions
izoph-0.373
 MIS-0.347
attered-0.347
BRE-0.332
ILA-0.327
Token comfortable
Token ID6792
Feature activation-7.31e-02

Pos logit contributions
 appre+0.944
��+0.934
enegger+0.833
 obser+0.818
 lest+0.807
Neg logit contributions
izoph-0.936
attered-0.876
 MIS-0.873
BRE-0.837
ILA-0.825
Token demand
Token ID3512
Feature activation+2.39e-02

Pos logit contributions
��+0.095
 appre+0.095
enegger+0.084
 obser+0.082
 lest+0.081
Neg logit contributions
izoph-0.099
attered-0.093
 MIS-0.093
BRE-0.089
 HIP-0.087
Token that
Token ID326
Feature activation+1.30e-02

Pos logit contributions
izoph+0.079
attered+0.073
 MIS+0.073
BRE+0.070
ILA+0.069
Neg logit contributions
 appre-0.086
��-0.085
enegger-0.076
 obser-0.075
 lest-0.074
Token he
Token ID339
Feature activation+3.63e-02

Pos logit contributions
izoph+0.040
attered+0.037
 MIS+0.036
BRE+0.035
ILA+0.034
Neg logit contributions
 appre-0.050
��-0.049
enegger-0.044
 obser-0.044
 lest-0.044
Token commit
Token ID4589
Feature activation+1.71e-01

Pos logit contributions
izoph+0.129
 MIS+0.124
attered+0.124
BRE+0.118
 HIP+0.117
Neg logit contributions
��-0.117
 appre-0.116
enegger-0.103
 obser-0.101
lain-0.098
Token se
Token ID384
Feature activation-6.25e-02

Pos logit contributions
izoph+0.562
 MIS+0.528
attered+0.522
BRE+0.497
ILA+0.492
Neg logit contributions
 appre-0.613
��-0.609
enegger-0.547
 obser-0.537
lain-0.524
Tokenpp
Token ID381
Feature activation-2.14e-01

Pos logit contributions
 appre+0.203
��+0.200
enegger+0.176
 obser+0.174
 roy+0.171
Neg logit contributions
izoph-0.229
attered-0.214
 MIS-0.212
BRE-0.204
ILA-0.202
Tokenuku
Token ID33263
Feature activation+4.87e-02

Pos logit contributions
 appre+0.560
��+0.550
 obser+0.465
enegger+0.464
 roy+0.455
Neg logit contributions
izoph-0.915
attered-0.867
 MIS-0.858
BRE-0.826
ILA-0.823
Token to
Token ID284
Feature activation+1.69e-02

Pos logit contributions
izoph+0.167
attered+0.157
 MIS+0.156
BRE+0.149
ILA+0.148
Neg logit contributions
 appre-0.167
��-0.166
enegger-0.146
 obser-0.145
 lest-0.140
Token regain
Token ID21509
Feature activation-2.74e-02

Pos logit contributions
izoph+0.058
attered+0.054
 MIS+0.054
BRE+0.052
ILA+0.051
Neg logit contributions
 appre-0.058
��-0.057
enegger-0.051
 obser-0.050
 roy-0.049
Token the
Token ID262
Feature activation+4.03e-02

Pos logit contributions
 appre+0.102
��+0.100
enegger+0.090
 obser+0.089
 lest+0.088
Neg logit contributions
izoph-0.087
attered-0.080
 MIS-0.080
BRE-0.076
ILA-0.075
Token honor
Token ID7522
Feature activation-2.10e-01

Pos logit contributions
izoph+0.143
attered+0.133
 MIS+0.133
BRE+0.127
ILA+0.126
Neg logit contributions
 appre-0.135
��-0.133
enegger-0.118
 obser-0.116
 lest-0.115
Token as
Token ID355
Feature activation-1.23e-01

Pos logit contributions
 appre+0.327
��+0.320
enegger+0.289
 roy+0.285
 lest+0.285
Neg logit contributions
izoph-0.268
attered-0.246
 MIS-0.244
BRE-0.233
ILA-0.230
Token Google
Token ID3012
Feature activation-2.17e-01

Pos logit contributions
 appre+0.396
��+0.391
enegger+0.346
 obser+0.338
 lest+0.336
Neg logit contributions
izoph-0.451
 MIS-0.424
attered-0.423
BRE-0.405
 HIP-0.400
Token Doc
Token ID14432
Feature activation-2.73e-02

Pos logit contributions
 appre+0.772
��+0.758
enegger+0.678
 obser+0.669
 roy+0.664
Neg logit contributions
izoph-0.723
attered-0.671
 MIS-0.667
BRE-0.638
ILA-0.631
Tokens
Token ID82
Feature activation-1.78e-01

Pos logit contributions
 appre+0.105
��+0.105
enegger+0.096
 obser+0.094
 lest+0.092
Neg logit contributions
izoph-0.082
attered-0.076
 MIS-0.075
BRE-0.072
ILA-0.071
Token and
Token ID290
Feature activation-2.70e-01

Pos logit contributions
 appre+0.547
��+0.537
enegger+0.472
 obser+0.463
 roy+0.459
Neg logit contributions
izoph-0.676
attered-0.636
 MIS-0.633
BRE-0.608
ILA-0.603
Token Google
Token ID3012
Feature activation-2.25e-01

Pos logit contributions
 appre+0.875
��+0.862
enegger+0.771
 obser+0.750
 roy+0.740
Neg logit contributions
izoph-0.979
 MIS-0.921
attered-0.919
BRE-0.875
 HIP-0.871
Token Sites
Token ID37034
Feature activation+5.31e-02

Pos logit contributions
 appre+0.819
��+0.805
enegger+0.726
 obser+0.713
 roy+0.707
Neg logit contributions
izoph-0.735
attered-0.681
 MIS-0.678
BRE-0.645
ILA-0.645
Token,
Token ID11
Feature activation-1.83e-01

Pos logit contributions
izoph+0.189
attered+0.178
 MIS+0.177
BRE+0.169
Eye+0.167
Neg logit contributions
 appre-0.175
��-0.172
enegger-0.152
 obser-0.150
 lest-0.149
Token would
Token ID561
Feature activation-2.15e-02

Pos logit contributions
 appre+0.681
��+0.669
enegger+0.601
 obser+0.593
 lest+0.589
Neg logit contributions
izoph-0.580
attered-0.537
 MIS-0.534
BRE-0.509
ILA-0.501
Token not
Token ID407
Feature activation+7.67e-02

Pos logit contributions
 appre+0.075
��+0.075
enegger+0.067
 obser+0.066
 lest+0.064
Neg logit contributions
izoph-0.073
attered-0.068
 MIS-0.068
BRE-0.064
 HIP-0.063
Token work
Token ID670
Feature activation-7.53e-02

Pos logit contributions
izoph+0.293
attered+0.275
 MIS+0.274
BRE+0.263
 HIP+0.261
Neg logit contributions
 appre-0.233
��-0.232
enegger-0.203
 obser-0.200
 lest-0.195
Token
Token ID227
Feature activation-4.63e-02

Pos logit contributions
izoph+0.281
attered+0.261
 MIS+0.261
BRE+0.251
ILA+0.248
Neg logit contributions
 appre-0.282
��-0.271
enegger-0.250
 obser-0.243
 lest-0.242
Token
Token ID12466
Feature activation-1.79e-01

Pos logit contributions
 appre+0.162
��+0.161
enegger+0.144
 obser+0.140
 lest+0.139
Neg logit contributions
izoph-0.157
 MIS-0.146
attered-0.145
BRE-0.140
ILA-0.137
Token
Token ID122
Feature activation-1.55e-01

Pos logit contributions
 appre+0.720
��+0.707
enegger+0.643
 obser+0.634
 lest+0.631
Neg logit contributions
izoph-0.518
attered-0.474
 MIS-0.471
BRE-0.448
ILA-0.441
Token
Token ID12466
Feature activation-6.02e-01

Pos logit contributions
 appre+0.622
��+0.612
enegger+0.556
 obser+0.548
 lest+0.544
Neg logit contributions
izoph-0.451
attered-0.412
 MIS-0.411
BRE-0.391
ILA-0.385
Token
Token ID123
Feature activation-1.16e-01

Pos logit contributions
 appre+2.401
��+2.380
enegger+2.125
 obser+2.114
 lest+2.095
Neg logit contributions
izoph-1.749
attered-1.601
 MIS-1.592
BRE-1.511
ILA-1.489
Tokenо
Token ID15166
Feature activation-2.63e-01

Pos logit contributions
 appre+0.433
��+0.425
enegger+0.383
 obser+0.377
 lest+0.375
Neg logit contributions
izoph-0.369
attered-0.341
 MIS-0.339
BRE-0.324
ILA-0.319
Tokenт
Token ID20375
Feature activation+2.26e-01

Pos logit contributions
 appre+0.980
��+0.964
enegger+0.868
 obser+0.855
 lest+0.849
Neg logit contributions
izoph-0.838
attered-0.773
 MIS-0.769
BRE-0.736
ILA-0.724
Tokenе
Token ID16843
Feature activation+1.74e-01

Pos logit contributions
izoph+0.664
 MIS+0.610
attered+0.608
BRE+0.585
ILA+0.572
Neg logit contributions
 appre-0.895
��-0.884
enegger-0.801
 obser-0.791
 lest-0.780
Tokenр
Token ID21169
Feature activation-1.18e-01

Pos logit contributions
izoph+0.558
 MIS+0.517
attered+0.514
BRE+0.499
ILA+0.489
Neg logit contributions
 appre-0.637
��-0.628
enegger-0.567
 obser-0.557
 lest-0.552
Tokenе
Token ID16843
Feature activation+1.20e-01

Pos logit contributions
 appre+0.471
��+0.463
enegger+0.420
 obser+0.414
 lest+0.412
Neg logit contributions
izoph-0.345
attered-0.316
 MIS-0.315
BRE-0.300
ILA-0.295
Token (
Token ID357
Feature activation+4.89e-02

Pos logit contributions
izoph+0.355
 MIS+0.326
attered+0.326
BRE+0.312
ILA+0.305
Neg logit contributions
 appre-0.469
��-0.461
enegger-0.420
 obser-0.414
 lest-0.410
Token
Token ID198
Feature activation-2.82e-01

Pos logit contributions
 appre+0.842
��+0.824
enegger+0.731
 obser+0.721
 roy+0.715
Neg logit contributions
izoph-0.902
attered-0.841
 MIS-0.838
BRE-0.806
ILA-0.795
Token
Token ID198
Feature activation-3.43e-01

Pos logit contributions
 appre+0.863
��+0.839
enegger+0.750
 roy+0.744
 lest+0.744
Neg logit contributions
izoph-1.077
attered-0.995
senal-0.991
 MIS-0.984
BRE-0.945
TokenKo
Token ID48735
Feature activation-4.03e-01

Pos logit contributions
 appre+1.331
��+1.307
enegger+1.180
 obser+1.165
 lest+1.159
Neg logit contributions
izoph-1.041
attered-0.958
 MIS-0.952
BRE-0.908
ILA-0.894
TokenDo
Token ID5211
Feature activation-4.59e-01

Pos logit contributions
 appre+1.528
��+1.499
enegger+1.351
 obser+1.333
 lest+1.326
Neg logit contributions
izoph-1.255
attered-1.157
 MIS-1.150
BRE-1.098
ILA-1.081
Token Profile
Token ID13118
Feature activation-7.14e-01

Pos logit contributions
 appre+1.676
��+1.644
enegger+1.476
 obser+1.454
 lest+1.446
Neg logit contributions
izoph-1.495
attered-1.383
 MIS-1.375
BRE-1.315
ILA-1.296
Token Joined
Token ID16709
Feature activation-9.15e-01

Pos logit contributions
 appre+2.211
��+2.155
enegger+1.882
 obser+1.861
 lest+1.855
Neg logit contributions
izoph-2.721
attered-2.548
 MIS-2.524
BRE-2.435
ILA-2.414
Token December
Token ID3426
Feature activation-2.27e-01

Pos logit contributions
 appre+2.433
��+2.324
 lest+2.021
 roy+2.017
 thereof+1.992
Neg logit contributions
izoph-3.858
attered-3.626
 MIS-3.522
senal-3.514
BRE-3.459
Token 2010
Token ID3050
Feature activation-9.83e-02

Pos logit contributions
 appre+0.874
��+0.853
enegger+0.779
 lest+0.771
 roy+0.771
Neg logit contributions
izoph-0.692
attered-0.633
 MIS-0.616
BRE-0.596
senal-0.593
Token United
Token ID1578
Feature activation-2.20e-01

Pos logit contributions
 appre+0.366
��+0.359
enegger+0.323
 obser+0.318
 roy+0.317
Neg logit contributions
izoph-0.313
attered-0.289
 MIS-0.287
BRE-0.275
ILA-0.270
Token States
Token ID1829
Feature activation-2.07e-02

Pos logit contributions
 appre+0.549
��+0.534
enegger+0.451
 obser+0.444
 lest+0.438
Neg logit contributions
izoph-0.969
attered-0.916
 MIS-0.914
BRE-0.884
ILA-0.875
Token 6
Token ID718
Feature activation+1.42e-01

Pos logit contributions
 appre+0.083
��+0.081
enegger+0.074
 obser+0.073
 lest+0.072
Neg logit contributions
izoph-0.060
attered-0.055
 MIS-0.055
BRE-0.052
ILA-0.051
Token55
Token ID2816
Feature activation-1.17e-01

Pos logit contributions
 appre+2.369
��+2.308
enegger+2.021
 lest+2.013
 roy+1.979
Neg logit contributions
izoph-3.144
attered-2.917
 MIS-2.879
senal-2.826
BRE-2.731
Token Recommend
Token ID30361
Feature activation-3.08e-01

Pos logit contributions
 appre+0.420
��+0.410
enegger+0.367
 lest+0.363
 obser+0.361
Neg logit contributions
izoph-0.386
attered-0.357
 MIS-0.353
BRE-0.339
ILA-0.333
Token
Token ID198
Feature activation-8.59e-01

Pos logit contributions
 appre+0.960
��+0.926
enegger+0.826
 lest+0.820
 roy+0.818
Neg logit contributions
izoph-1.168
attered-1.078
 MIS-1.067
senal-1.059
BRE-1.030
Token
Token ID198
Feature activation-8.67e-01

Pos logit contributions
 appre+2.316
��+2.208
 thereof+2.064
 lest+2.042
 roy+2.036
Neg logit contributions
izoph-3.533
senal-3.473
attered-3.220
 MIS-3.163
ILA-3.054
TokenRecommend
Token ID41248
Feature activation-3.25e-01

Pos logit contributions
 appre+3.454
��+3.334
 thereof+3.052
 lest+3.043
lain+3.010
Neg logit contributions
izoph-2.558
senal-2.222
 MIS-2.211
attered-2.199
 HIP-2.037
Token Share
Token ID8734
Feature activation-1.06e+00

Pos logit contributions
 appre+1.291
��+1.258
enegger+1.132
 lest+1.127
 roy+1.123
Neg logit contributions
izoph-0.964
attered-0.873
 MIS-0.864
BRE-0.828
ILA-0.814
Token this
Token ID428
Feature activation-3.56e-01

Pos logit contributions
 appre+3.506
��+3.161
 lest+3.090
 roy+3.077
 thereof+3.036
Neg logit contributions
izoph-3.893
senal-3.707
attered-3.482
 MIS-3.386
ILA-3.314
Token comment
Token ID2912
Feature activation-6.34e-01

Pos logit contributions
 appre+1.046
��+0.991
 lest+0.888
 roy+0.879
enegger+0.870
Neg logit contributions
izoph-1.425
attered-1.330
senal-1.317
 MIS-1.305
BRE-1.271
Token on
Token ID319
Feature activation-2.38e-01

Pos logit contributions
 appre+1.744
��+1.672
 roy+1.485
 lest+1.482
 thereof+1.473
Neg logit contributions
izoph-2.629
senal-2.446
attered-2.435
 MIS-2.406
BRE-2.331
Token Facebook
Token ID3203
Feature activation-3.12e-01

Pos logit contributions
 appre+0.634
��+0.616
enegger+0.530
 lest+0.517
 roy+0.516
Neg logit contributions
izoph-1.010
attered-0.950
 MIS-0.945
BRE-0.915
ILA-0.906
Token Share
Token ID8734
Feature activation-6.06e-01

Pos logit contributions
 appre+1.182
��+1.145
 lest+1.045
 roy+1.030
 thereof+1.025
Neg logit contributions
izoph-0.979
attered-0.891
 MIS-0.863
senal-0.862
BRE-0.835
Token:
Token ID25
Feature activation-3.63e-01

Pos logit contributions
izoph+0.171
attered+0.160
 MIS+0.159
BRE+0.153
ILA+0.151
Neg logit contributions
 appre-0.162
��-0.159
enegger-0.140
 obser-0.140
 lest-0.137
Token47
Token ID2857
Feature activation-1.56e-01

Pos logit contributions
 appre+1.067
��+1.047
enegger+0.918
 lest+0.897
 roy+0.895
Neg logit contributions
izoph-1.440
attered-1.338
 MIS-1.338
senal-1.288
BRE-1.281
Token]
Token ID60
Feature activation+2.14e-01

Pos logit contributions
 appre+0.412
��+0.401
enegger+0.346
 lest+0.338
 roy+0.337
Neg logit contributions
izoph-0.663
attered-0.623
 MIS-0.619
BRE-0.599
senal-0.595
Token <
Token ID1279
Feature activation+6.06e-02

Pos logit contributions
izoph+0.720
 MIS+0.676
attered+0.671
BRE+0.648
ILA+0.631
Neg logit contributions
 appre-0.751
��-0.739
enegger-0.653
 obser-0.648
 lest-0.641
Tokenmet
Token ID4164
Feature activation-2.99e-01

Pos logit contributions
izoph+0.193
attered+0.179
 MIS+0.177
BRE+0.172
ILA+0.168
Neg logit contributions
 appre-0.226
��-0.223
enegger-0.199
 obser-0.197
 lest-0.195
Tokenros
Token ID4951
Feature activation-7.65e-01

Pos logit contributions
 appre+1.198
��+1.177
enegger+1.066
 obser+1.054
 lest+1.048
Neg logit contributions
izoph-0.868
attered-0.796
 MIS-0.790
BRE-0.752
ILA-0.739
Tokenxual
Token ID5541
Feature activation-5.31e-01

Pos logit contributions
 appre+2.645
��+2.589
enegger+2.377
lain+2.316
 roy+2.307
Neg logit contributions
izoph-2.566
 MIS-2.392
senal-2.365
attered-2.312
BRE-2.259
Tokenth
Token ID400
Feature activation-4.93e-01

Pos logit contributions
 appre+2.049
��+2.010
 lest+1.846
 thereof+1.843
enegger+1.840
Neg logit contributions
izoph-1.552
 MIS-1.427
senal-1.406
attered-1.395
ILA-1.358
Tokenug
Token ID1018
Feature activation-5.65e-01

Pos logit contributions
 appre+1.872
��+1.823
enegger+1.652
 roy+1.649
 lest+1.640
Neg logit contributions
izoph-1.489
 MIS-1.391
attered-1.363
senal-1.353
BRE-1.319
Token>
Token ID29
Feature activation-2.87e-01

Pos logit contributions
 appre+1.494
��+1.452
enegger+1.266
 roy+1.234
 lest+1.231
Neg logit contributions
izoph-2.392
attered-2.232
 MIS-2.229
senal-2.186
BRE-2.147
Token le
Token ID443
Feature activation-1.75e-01

Pos logit contributions
 appre+0.904
��+0.890
enegger+0.783
 obser+0.769
 lest+0.760
Neg logit contributions
izoph-1.072
 MIS-1.006
attered-1.003
BRE-0.965
ILA-0.950
Token in
Token ID287
Feature activation-7.94e-02

Pos logit contributions
 appre+0.270
��+0.267
enegger+0.239
 obser+0.236
 lest+0.230
Neg logit contributions
izoph-0.257
attered-0.242
 MIS-0.242
BRE-0.229
 HIP-0.226
Token The
Token ID383
Feature activation-9.62e-02

Pos logit contributions
 appre+0.288
��+0.284
enegger+0.255
 obser+0.252
 lest+0.249
Neg logit contributions
izoph-0.259
attered-0.241
 MIS-0.241
BRE-0.228
ILA-0.226
Token USA
Token ID4916
Feature activation-1.61e-01

Pos logit contributions
 appre+0.366
��+0.359
enegger+0.324
 obser+0.321
 lest+0.317
Neg logit contributions
izoph-0.299
attered-0.277
 MIS-0.277
BRE-0.263
ILA-0.259
Token and
Token ID290
Feature activation-7.46e-02

Pos logit contributions
 appre+0.546
��+0.538
enegger+0.477
 obser+0.474
 roy+0.461
Neg logit contributions
izoph-0.561
attered-0.531
 MIS-0.526
BRE-0.509
ILA-0.503
Token will
Token ID481
Feature activation-2.04e-01

Pos logit contributions
 appre+0.254
��+0.250
enegger+0.223
 obser+0.220
 lest+0.217
Neg logit contributions
izoph-0.259
 MIS-0.243
attered-0.242
BRE-0.232
ILA-0.230
Token ship
Token ID4074
Feature activation-8.13e-01

Pos logit contributions
 appre+0.644
��+0.634
enegger+0.557
 obser+0.548
 lest+0.543
Neg logit contributions
izoph-0.760
attered-0.712
 MIS-0.710
BRE-0.683
ILA-0.674
Token to
Token ID284
Feature activation-2.24e-01

Pos logit contributions
 appre+2.598
��+2.510
 roy+2.205
enegger+2.194
 lest+2.191
Neg logit contributions
izoph-3.042
attered-2.800
 MIS-2.767
BRE-2.692
senal-2.652
Token you
Token ID345
Feature activation-2.16e-01

Pos logit contributions
 appre+0.799
��+0.783
enegger+0.700
 obser+0.690
 lest+0.686
Neg logit contributions
izoph-0.751
attered-0.696
 MIS-0.692
BRE-0.664
ILA-0.654
Token direct
Token ID1277
Feature activation-4.22e-02

Pos logit contributions
 appre+0.738
��+0.726
enegger+0.647
 obser+0.636
 lest+0.629
Neg logit contributions
izoph-0.752
attered-0.703
 MIS-0.700
BRE-0.671
ILA-0.661
Token from
Token ID422
Feature activation-1.51e-01

Pos logit contributions
 appre+0.145
��+0.143
enegger+0.127
 obser+0.126
 roy+0.123
Neg logit contributions
izoph-0.145
attered-0.136
 MIS-0.136
BRE-0.129
ILA-0.128
Token the
Token ID262
Feature activation-2.78e-02

Pos logit contributions
 appre+0.553
��+0.544
enegger+0.488
 obser+0.481
 roy+0.477
Neg logit contributions
izoph-0.490
attered-0.455
 MIS-0.454
BRE-0.433
ILA-0.427
Token19
Token ID1129
Feature activation-3.50e-01

Pos logit contributions
 appre+2.240
��+2.180
enegger+1.926
 lest+1.910
 roy+1.890
Neg logit contributions
izoph-2.754
attered-2.559
 MIS-2.527
senal-2.462
BRE-2.399
Token Recommend
Token ID30361
Feature activation-3.51e-01

Pos logit contributions
 appre+1.255
��+1.217
enegger+1.090
 lest+1.081
 roy+1.076
Neg logit contributions
izoph-1.170
attered-1.078
 MIS-1.064
BRE-1.022
senal-1.013
Token
Token ID198
Feature activation-7.78e-01

Pos logit contributions
 appre+1.085
��+1.046
enegger+0.934
 lest+0.927
 roy+0.926
Neg logit contributions
izoph-1.336
attered-1.233
 MIS-1.219
senal-1.218
BRE-1.178
Token
Token ID198
Feature activation-7.83e-01

Pos logit contributions
 appre+2.159
��+2.063
 thereof+1.902
 lest+1.890
 roy+1.887
Neg logit contributions
izoph-3.156
senal-3.067
attered-2.881
 MIS-2.837
ILA-2.732
TokenRecommend
Token ID41248
Feature activation-3.54e-01

Pos logit contributions
 appre+3.125
��+3.010
 thereof+2.779
 lest+2.769
lain+2.731
Neg logit contributions
izoph-2.285
senal-2.028
attered-1.979
 MIS-1.975
 HIP-1.824
Token Share
Token ID8734
Feature activation-1.02e+00

Pos logit contributions
 appre+1.383
��+1.341
 lest+1.205
enegger+1.203
 roy+1.199
Neg logit contributions
izoph-1.078
attered-0.970
 MIS-0.958
BRE-0.922
senal-0.917
Token this
Token ID428
Feature activation-3.15e-01

Pos logit contributions
 appre+3.342
��+3.069
 lest+3.000
 roy+2.960
 thereof+2.951
Neg logit contributions
izoph-3.688
senal-3.587
attered-3.311
 MIS-3.204
ILA-3.138
Token comment
Token ID2912
Feature activation-5.56e-01

Pos logit contributions
 appre+0.943
��+0.897
 lest+0.802
 roy+0.795
enegger+0.790
Neg logit contributions
izoph-1.244
attered-1.159
senal-1.143
 MIS-1.140
BRE-1.109
Token on
Token ID319
Feature activation-1.88e-01

Pos logit contributions
 appre+1.568
��+1.510
 roy+1.332
 lest+1.330
enegger+1.329
Neg logit contributions
izoph-2.270
attered-2.107
senal-2.090
 MIS-2.083
BRE-2.016
Token Facebook
Token ID3203
Feature activation-1.57e-01

Pos logit contributions
 appre+0.503
��+0.489
enegger+0.421
 obser+0.410
 lest+0.410
Neg logit contributions
izoph-0.793
attered-0.747
 MIS-0.743
BRE-0.719
ILA-0.712
Token Share
Token ID8734
Feature activation-5.38e-01

Pos logit contributions
 appre+0.612
��+0.598
 lest+0.540
enegger+0.539
 roy+0.536
Neg logit contributions
izoph-0.475
attered-0.434
 MIS-0.424
senal-0.412
BRE-0.407