TOP ACTIVATIONS
MAX = 1.159

better understand the selection process and eligibility for the award ,
exp ( -( t - t ) / u
exp ( -( t - t ) / u
B earing : Size C , Y YR DS cent ering
m 2 k [ 10 : 48 ] < qu ij
: The Saga [/ b ][/ color ] - 100 %
" 2016 . ?? .? ?" ] [ Event Country "
2 k lol [ 10 : 47 ] < m ik
m 2 k [ 10 : 48 ] < min ion
y 2 k [ 10 : 47 ] < chain x
w > save [ 10 : 47 ] < f gt
m 4 k [ 10 : 48 ] < mom per
m 2 k [ 10 : 47 ] < j th
m 2 k [ 10 : 47 ] < mom per
string " vi : ", " vim :" or " ex
m 2 k [ 10 : 48 ] < m ew
lol j ason [ 10 : 47 ] < fen ren
m 2 k [ 10 : 47 ] < k asse
m 2 k [ 10 : 47 ] < mom per
m 2 k [ 10 : 47 ] < l ucky

MIN ACTIVATIONS
MIN = -1.062

am and Som erville 's Union Square . The
ы с е
AC LU _ OR ) May 29 , 2017
е л ь о е н н о
Bo hem ian sp arrow slaying bow '? "
а л и о е в
fun . X avier Gu il bert : Well
и е а
of weakness if not madness "[ 41 ] and Peterson conc
ы т
с о о� и е
AC LU _ OR ) May 29 , 2017
о р я
land State Park , North East on . The
2 begins with : The Citizens of each State
Google Scholar Ol we us , D
с т в о ы я в л е
Irvine , California . Hide Caption 10 of 15 Photos :
Clara County v . Southern Pacific Railroad Company California
| Medline | ISI P el leg r ini

INTERVAL 0.604 - 1.159
CONTAINS 0.127%

intent on rolling back all the gains of the Progressive era
click on the Settings Icon in the top left corner .
versions of the five original classic sports Tennis , Bowling
m 2 k [ 10 : 47 ] < kw ij
, Y YR DS cent ering The Garg ant

INTERVAL 0.048 - 0.604
CONTAINS 38.414%

, will kill half of all long - term users ,"
state fund on top of other recommendations , although she said
PV vs PV H VM , but I found that workload
cause . Powerful players like Michelle Obama and RE I have
_ 00 64 E , DP 02 _ 00 65 E

INTERVAL -0.507 - 0.048
CONTAINS 61.142%

were already running pretty fast on these instances . This was
s a sample . The opening movie depicts Hats une
WC was planning to announce the addition of Boise State after
: is there any other difference between the two
@ index State < | { 11 , 7 } ->

INTERVAL -1.062 - -0.507
CONTAINS 0.317%

thing to me but a foul and pest il
, but they are paradox ically one of the most inh
using that cannon against Titans Tower . But Glen suggested pulling
) Story continues There were a thousand
е я в л я е т с
Token better
Token ID1365
Feature activation+2.46e-01

Pos logit contributions
 Pengu+1.893
ーテ+1.881
hene+1.749
sten+1.707
ゴン+1.667
Neg logit contributions
 AD-1.655
 Lak-1.640
ellar-1.529
 ACT-1.515
 Cent-1.502
Token understand
Token ID1833
Feature activation+9.66e-02

Pos logit contributions
 AD+1.409
 Lak+1.357
 ACT+1.253
ellar+1.250
 Cent+1.242
Neg logit contributions
ーテ-2.394
 Pengu-2.391
hene-2.224
ゴン-2.176
sten-2.170
Token the
Token ID262
Feature activation+4.60e-01

Pos logit contributions
 AD+0.740
 Lak+0.717
ellar+0.683
 ACT+0.678
 Cent+0.673
Neg logit contributions
 Pengu-0.744
ーテ-0.741
hene-0.678
sten-0.659
ゴン-0.656
Token selection
Token ID6356
Feature activation+2.35e-01

Pos logit contributions
 AD+3.408
 Lak+3.280
 ACT+3.117
 Cent+3.093
ellar+3.041
Neg logit contributions
ーテ-3.825
 Pengu-3.750
hene-3.427
ゴン-3.421
sten-3.326
Token process
Token ID1429
Feature activation+1.40e-01

Pos logit contributions
 AD+1.434
 Lak+1.385
 ACT+1.288
ellar+1.286
 Cent+1.277
Neg logit contributions
ーテ-2.198
 Pengu-2.193
hene-2.034
ゴン-1.993
sten-1.983
Token and
Token ID290
Feature activation+1.16e+00

Pos logit contributions
 AD+1.048
 Lak+1.020
ellar+0.963
 ACT+0.962
 Cent+0.953
Neg logit contributions
ーテ-1.101
 Pengu-1.096
hene-1.011
sten-0.979
ゴン-0.977
Token eligibility
Token ID20069
Feature activation+5.64e-01

Pos logit contributions
 AD+6.262
 Lak+5.584
 ACT+5.425
 Cent+5.157
ellar+5.065
Neg logit contributions
ーテ-11.928
 Pengu-11.654
ゴン-11.077
hene-10.749
 tradem-10.544
Token for
Token ID329
Feature activation-1.66e-02

Pos logit contributions
 AD+3.801
 Lak+3.576
 ACT+3.412
 Cent+3.386
ellar+3.286
Neg logit contributions
ーテ-5.125
 Pengu-5.111
ゴン-4.665
hene-4.604
sten-4.459
Token the
Token ID262
Feature activation-1.07e-01

Pos logit contributions
 Pengu+0.127
ーテ+0.126
hene+0.115
sten+0.113
ゴン+0.112
Neg logit contributions
 AD-0.127
 Lak-0.124
ellar-0.118
 ACT-0.117
 Cent-0.116
Token award
Token ID5764
Feature activation+4.57e-01

Pos logit contributions
 Pengu+0.796
ーテ+0.785
hene+0.715
sten+0.698
ゴン+0.695
Neg logit contributions
 AD-0.854
 Lak-0.830
ellar-0.796
 ACT-0.786
 Cent-0.781
Token,
Token ID11
Feature activation+4.59e-01

Pos logit contributions
 AD+3.020
 Lak+2.857
 ACT+2.700
 Cent+2.664
ellar+2.623
Neg logit contributions
ーテ-4.177
 Pengu-4.152
ゴン-3.776
hene-3.763
sten-3.658
Token exp
Token ID1033
Feature activation+1.73e-01

Pos logit contributions
ーテ+0.106
 Pengu+0.105
hene+0.096
ゴン+0.094
sten+0.094
Neg logit contributions
 AD-0.094
 Lak-0.091
ellar-0.087
 ACT-0.086
 Cent-0.085
Token(
Token ID7
Feature activation-1.09e-01

Pos logit contributions
 AD+1.427
 Lak+1.376
 ACT+1.316
 Cent+1.310
ellar+1.296
Neg logit contributions
 Pengu-1.268
ーテ-1.262
hene-1.137
ゴン-1.110
sten-1.089
Token-(
Token ID30420
Feature activation+7.55e-01

Pos logit contributions
ーテ+0.651
 Pengu+0.643
hene+0.570
ゴン+0.557
sten+0.550
Neg logit contributions
 AD-1.038
 Lak-1.014
ellar-0.971
 ACT-0.971
 Cent-0.963
Tokent
Token ID83
Feature activation+7.46e-01

Pos logit contributions
 AD+3.998
 Lak+3.729
 Cent+3.587
 ACT+3.531
ellar+3.443
Neg logit contributions
 Pengu-7.487
ーテ-7.412
hene-6.854
ゴン-6.674
 Disorder-6.458
Token-
Token ID12
Feature activation+8.38e-01

Pos logit contributions
 AD+3.755
 Lak+3.573
 Cent+3.444
 ACT+3.361
ellar+3.196
Neg logit contributions
ーテ-7.499
 Pengu-7.445
hene-6.803
ゴン-6.799
sten-6.551
Tokent
Token ID83
Feature activation+1.15e+00

Pos logit contributions
 AD+4.510
 Lak+4.234
 ACT+4.062
 Cent+4.044
ellar+3.887
Neg logit contributions
ーテ-8.506
 Pengu-8.421
ゴン-7.712
hene-7.517
 tradem-7.513
Token
Token ID447
Feature activation+3.96e-01

Pos logit contributions
 AD+4.663
 Lak+4.409
 Cent+4.226
 ACT+4.121
ellar+3.747
Neg logit contributions
ーテ-12.339
 Pengu-12.207
ゴン-11.445
hene-11.202
 tradem-10.836
Token
Token ID247
Feature activation+6.66e-01

Pos logit contributions
 AD+2.634
 Lak+2.532
 ACT+2.375
ellar+2.361
 Cent+2.341
Neg logit contributions
ーテ-3.526
 Pengu-3.517
ゴン-3.194
hene-3.187
sten-3.079
Token)
Token ID8
Feature activation+3.23e-01

Pos logit contributions
 AD+4.051
 Lak+3.840
 ACT+3.642
 Cent+3.597
ellar+3.464
Neg logit contributions
ーテ-6.369
 Pengu-6.359
hene-5.773
ゴン-5.765
 tradem-5.650
Token /
Token ID1220
Feature activation-3.57e-02

Pos logit contributions
 AD+1.819
 Lak+1.717
 ACT+1.618
 Cent+1.610
ellar+1.554
Neg logit contributions
 Pengu-3.208
ーテ-3.192
hene-2.966
ゴン-2.932
sten-2.890
Tokenu
Token ID84
Feature activation+1.53e-01

Pos logit contributions
ーテ+0.288
 Pengu+0.288
hene+0.267
sten+0.264
ゴン+0.260
Neg logit contributions
 AD-0.257
 Lak-0.253
 ACT-0.236
ellar-0.236
 Cent-0.234
Token exp
Token ID1033
Feature activation+2.22e-01

Pos logit contributions
 AD+0.746
 Lak+0.724
ellar+0.683
 ACT+0.682
 Cent+0.676
Neg logit contributions
ーテ-0.839
 Pengu-0.835
hene-0.762
ゴン-0.749
sten-0.742
Token(
Token ID7
Feature activation-4.48e-02

Pos logit contributions
 AD+1.835
 Lak+1.769
 ACT+1.691
 Cent+1.682
ellar+1.663
Neg logit contributions
 Pengu-1.628
ーテ-1.621
hene-1.458
ゴン-1.425
sten-1.399
Token-(
Token ID30420
Feature activation+5.82e-01

Pos logit contributions
ーテ+0.295
 Pengu+0.292
hene+0.262
ゴン+0.256
sten+0.253
Neg logit contributions
 AD-0.399
 Lak-0.389
 ACT-0.371
ellar-0.370
 Cent-0.368
Tokent
Token ID83
Feature activation+6.69e-01

Pos logit contributions
 AD+3.393
 Lak+3.208
 Cent+3.046
 ACT+3.005
ellar+2.956
Neg logit contributions
 Pengu-5.569
ーテ-5.496
hene-5.099
ゴン-4.959
sten-4.792
Token-
Token ID12
Feature activation+7.11e-01

Pos logit contributions
 AD+3.404
 Lak+3.241
 Cent+3.086
 ACT+3.027
ellar+2.898
Neg logit contributions
ーテ-6.764
 Pengu-6.709
hene-6.167
ゴン-6.140
sten-5.934
Tokent
Token ID83
Feature activation+1.14e+00

Pos logit contributions
 AD+4.014
 Lak+3.805
 ACT+3.607
 Cent+3.592
ellar+3.503
Neg logit contributions
ーテ-7.059
 Pengu-6.990
ゴン-6.387
hene-6.293
 tradem-6.184
Token
Token ID447
Feature activation+3.27e-01

Pos logit contributions
 AD+4.635
 Lak+4.425
 Cent+4.199
 ACT+4.078
ellar+3.752
Neg logit contributions
ーテ-12.087
 Pengu-12.006
ゴン-11.206
hene-11.129
 Hemp-10.639
Token
Token ID247
Feature activation+7.35e-01

Pos logit contributions
 AD+2.215
 Lak+2.134
 ACT+2.003
ellar+1.992
 Cent+1.975
Neg logit contributions
ーテ-2.865
 Pengu-2.857
hene-2.598
ゴン-2.591
sten-2.511
Token)
Token ID8
Feature activation+2.64e-01

Pos logit contributions
 AD+4.103
 Lak+3.880
 ACT+3.653
 Cent+3.640
ellar+3.505
Neg logit contributions
 Pengu-7.094
ーテ-7.042
hene-6.560
ゴン-6.347
sten-6.228
Token /
Token ID1220
Feature activation-1.01e-01

Pos logit contributions
 AD+1.558
 Lak+1.484
 ACT+1.390
 Cent+1.380
ellar+1.357
Neg logit contributions
 Pengu-2.541
ーテ-2.540
hene-2.349
ゴン-2.318
sten-2.289
Tokenu
Token ID84
Feature activation+1.15e-01

Pos logit contributions
ーテ+0.869
 Pengu+0.865
hene+0.793
ゴン+0.782
sten+0.774
Neg logit contributions
 AD-0.702
 Lak-0.680
 ACT-0.639
ellar-0.636
 Cent-0.633
TokenB
Token ID33
Feature activation-2.57e-01

Pos logit contributions
ーテ+0.879
 Pengu+0.875
hene+0.814
ゴン+0.792
sten+0.791
Neg logit contributions
 AD-0.907
 Lak-0.883
ellar-0.844
 ACT-0.840
 Cent-0.827
Tokenearing
Token ID6648
Feature activation-1.46e-01

Pos logit contributions
 Pengu+1.513
ーテ+1.483
hene+1.400
sten+1.336
ゴン+1.291
Neg logit contributions
 AD-2.397
 Lak-2.352
 ACT-2.260
 Cent-2.237
ellar-2.201
Token:
Token ID25
Feature activation+1.72e-01

Pos logit contributions
ーテ+1.244
 Pengu+1.238
hene+1.153
sten+1.128
ゴン+1.113
Neg logit contributions
 AD-0.963
 Lak-0.940
ellar-0.899
 ACT-0.875
 Cent-0.861
Token Size
Token ID12849
Feature activation+6.17e-01

Pos logit contributions
 AD+1.343
 Lak+1.303
 ACT+1.235
ellar+1.232
 Cent+1.224
Neg logit contributions
ーテ-1.328
 Pengu-1.323
hene-1.201
ゴン-1.175
sten-1.163
Token C
Token ID327
Feature activation+8.54e-02

Pos logit contributions
 AD+3.752
 Lak+3.474
 ACT+3.324
 Cent+3.311
ellar+3.108
Neg logit contributions
ーテ-5.971
 Pengu-5.851
ゴン-5.439
hene-5.385
sten-5.285
Token,
Token ID11
Feature activation+1.10e+00

Pos logit contributions
 AD+0.578
 Lak+0.554
 ACT+0.521
 Cent+0.520
ellar+0.517
Neg logit contributions
ーテ-0.752
 Pengu-0.750
hene-0.686
ゴン-0.676
sten-0.668
Token Y
Token ID575
Feature activation+4.20e-01

Pos logit contributions
 AD+6.380
 Cent+6.163
 Lak+5.982
 ACT+5.593
cent+5.242
Neg logit contributions
ーテ-10.300
 Pengu-10.020
ゴン-9.538
hene-9.229
sten-9.144
TokenYR
Token ID38162
Feature activation+5.80e-02

Pos logit contributions
 AD+3.081
 Lak+2.947
 ACT+2.786
ellar+2.775
 Cent+2.775
Neg logit contributions
ーテ-3.500
 Pengu-3.466
hene-3.132
ゴン-3.095
sten-3.053
Token DS
Token ID17400
Feature activation+3.51e-01

Pos logit contributions
 AD+0.407
 Lak+0.393
 ACT+0.370
ellar+0.368
 Cent+0.367
Neg logit contributions
ーテ-0.494
 Pengu-0.492
hene-0.450
ゴン-0.442
sten-0.437
Token cent
Token ID1247
Feature activation+5.18e-01

Pos logit contributions
 AD+2.359
 Lak+2.254
 ACT+2.116
 Cent+2.108
ellar+2.083
Neg logit contributions
ーテ-3.135
 Pengu-3.117
hene-2.844
ゴン-2.813
sten-2.774
Tokenering
Token ID1586
Feature activation+6.62e-01

Pos logit contributions
 AD+4.471
 Lak+4.280
 Cent+4.206
ellar+4.161
 ACT+4.126
Neg logit contributions
ーテ-3.599
 Pengu-3.484
ゴン-3.135
hene-3.020
sten-2.919
Token m
Token ID285
Feature activation+2.64e-01

Pos logit contributions
 Pengu+0.363
ーテ+0.362
hene+0.329
ゴン+0.321
sten+0.320
Neg logit contributions
 AD-0.347
 Lak-0.337
ellar-0.320
 ACT-0.319
 Cent-0.316
Token2
Token ID17
Feature activation+2.43e-01

Pos logit contributions
 AD+1.885
 Lak+1.818
 ACT+1.717
ellar+1.707
 Cent+1.705
Neg logit contributions
ーテ-2.216
 Pengu-2.202
hene-2.004
ゴン-1.982
sten-1.945
Tokenk
Token ID74
Feature activation+1.60e-01

Pos logit contributions
 AD+1.887
 Lak+1.829
 ACT+1.735
ellar+1.727
 Cent+1.722
Neg logit contributions
ーテ-1.864
 Pengu-1.860
hene-1.707
sten-1.658
ゴン-1.651
Token [
Token ID685
Feature activation-2.13e-02

Pos logit contributions
 AD+0.947
 Lak+0.913
ellar+0.850
 ACT+0.846
 Cent+0.840
Neg logit contributions
 Pengu-1.517
ーテ-1.512
hene-1.404
ゴン-1.374
sten-1.374
Token10
Token ID940
Feature activation-1.47e-01

Pos logit contributions
ーテ+0.165
 Pengu+0.164
hene+0.155
sten+0.153
ゴン+0.148
Neg logit contributions
 AD-0.158
 Lak-0.155
ellar-0.145
 ACT-0.145
 Cent-0.144
Token:
Token ID25
Feature activation+1.10e+00

Pos logit contributions
 Pengu+1.154
ーテ+1.152
hene+1.114
sten+1.093
punk+1.062
Neg logit contributions
 AD-0.998
 Lak-0.989
ellar-0.944
 Cent-0.912
 ACT-0.909
Token48
Token ID2780
Feature activation+2.10e-01

Pos logit contributions
 AD+4.606
 Lak+4.542
ellar+4.023
 ACT+3.798
 Cent+3.698
Neg logit contributions
 Pengu-12.017
ーテ-11.673
ゴン-11.042
hene-10.709
 tradem-10.371
Token]
Token ID60
Feature activation+2.52e-01

Pos logit contributions
 AD+1.733
 Lak+1.681
 ACT+1.599
ellar+1.590
 Cent+1.588
Neg logit contributions
ーテ-1.529
 Pengu-1.522
hene-1.372
ゴン-1.345
sten-1.327
Token <
Token ID1279
Feature activation+6.57e-02

Pos logit contributions
 AD+1.765
 Lak+1.652
 ACT+1.605
 Cent+1.541
ellar+1.539
Neg logit contributions
ーテ-2.206
 Pengu-2.164
hene-2.000
ゴン-1.960
sten-1.935
Tokenqu
Token ID421
Feature activation-3.41e-01

Pos logit contributions
 AD+0.489
 Lak+0.473
 ACT+0.447
ellar+0.445
 Cent+0.444
Neg logit contributions
ーテ-0.530
 Pengu-0.528
hene-0.481
ゴン-0.472
sten-0.467
Tokenij
Token ID2926
Feature activation+1.94e-01

Pos logit contributions
 Pengu+2.081
ーテ+2.031
hene+1.959
sten+1.846
ゴン+1.756
Neg logit contributions
 AD-3.114
 Lak-3.025
 ACT-2.907
 Cent-2.884
ellar-2.833
Token:
Token ID25
Feature activation-1.43e-01

Pos logit contributions
ーテ+0.164
 Pengu+0.164
hene+0.149
ゴン+0.147
sten+0.145
Neg logit contributions
 AD-0.156
 Lak-0.151
 ACT-0.143
ellar-0.143
 Cent-0.142
Token The
Token ID383
Feature activation-3.05e-02

Pos logit contributions
ーテ+0.753
 Pengu+0.751
hene+0.642
ゴン+0.628
sten+0.615
Neg logit contributions
 AD-1.456
 Lak-1.424
ellar-1.368
 ACT-1.366
 Cent-1.355
Token Saga
Token ID19743
Feature activation-5.97e-02

Pos logit contributions
 Pengu+0.254
ーテ+0.251
hene+0.229
ゴン+0.225
sten+0.224
Neg logit contributions
 AD-0.218
 Lak-0.211
ellar-0.200
 ACT-0.198
 Cent-0.196
Token[/
Token ID13412
Feature activation+4.00e-01

Pos logit contributions
ーテ+0.449
 Pengu+0.447
hene+0.405
ゴン+0.397
sten+0.392
Neg logit contributions
 AD-0.477
 Lak-0.463
 ACT-0.439
ellar-0.437
 Cent-0.436
Tokenb
Token ID65
Feature activation+1.04e-01

Pos logit contributions
 AD+2.863
 Lak+2.728
 ACT+2.601
ellar+2.589
 Cent+2.575
Neg logit contributions
ーテ-3.413
 Pengu-3.388
ゴン-3.026
hene-3.022
sten-2.904
Token][/
Token ID44926
Feature activation+1.09e+00

Pos logit contributions
 AD+0.906
 Lak+0.872
 ACT+0.837
 Cent+0.830
ellar+0.830
Neg logit contributions
ーテ-0.713
 Pengu-0.709
hene-0.634
ゴン-0.616
sten-0.611
Tokencolor
Token ID8043
Feature activation-3.03e-02

Pos logit contributions
 AD+3.186
 Lak+2.758
ellar+2.679
ller+2.647
iel+2.617
Neg logit contributions
ーテ-13.910
 Pengu-13.708
ゴン-12.754
 tradem-12.518
 pione-11.908
Token]
Token ID60
Feature activation-2.12e-01

Pos logit contributions
ーテ+0.227
 Pengu+0.226
hene+0.204
ゴン+0.200
sten+0.197
Neg logit contributions
 AD-0.246
 Lak-0.237
 ACT-0.226
 Cent-0.224
ellar-0.224
Token -
Token ID532
Feature activation-4.80e-02

Pos logit contributions
 Pengu+1.723
ーテ+1.717
hene+1.567
ゴン+1.533
sten+1.526
Neg logit contributions
 AD-1.536
 Lak-1.489
ellar-1.412
 ACT-1.401
 Cent-1.386
Token 100
Token ID1802
Feature activation+9.69e-03

Pos logit contributions
 Pengu+0.401
ーテ+0.401
hene+0.365
ゴン+0.358
sten+0.355
Neg logit contributions
 AD-0.340
 Lak-0.330
ellar-0.313
 ACT-0.309
 Cent-0.306
Token%
Token ID4
Feature activation-4.94e-02

Pos logit contributions
 AD+0.060
 Lak+0.058
 ACT+0.054
 Cent+0.054
ellar+0.053
Neg logit contributions
ーテ-0.091
 Pengu-0.090
hene-0.083
ゴン-0.082
sten-0.081
Token "
Token ID366
Feature activation+5.01e-01

Pos logit contributions
 AD+1.651
 Lak+1.581
 ACT+1.486
 Cent+1.472
ellar+1.471
Neg logit contributions
ーテ-2.377
 Pengu-2.373
hene-2.185
ゴン-2.152
sten-2.123
Token2016
Token ID5304
Feature activation+7.35e-01

Pos logit contributions
 AD+3.323
 Lak+3.102
 ACT+2.967
 Cent+2.946
ellar+2.891
Neg logit contributions
 Pengu-4.501
ーテ-4.478
hene-4.084
ゴン-4.019
 tradem-3.966
Token.
Token ID13
Feature activation+4.10e-01

Pos logit contributions
 AD+4.464
 Lak+4.150
 ACT+3.978
 Cent+3.955
ellar+3.807
Neg logit contributions
ーテ-7.129
 Pengu-7.119
 tradem-6.531
ゴン-6.510
hene-6.387
Token??
Token ID3548
Feature activation+5.70e-01

Pos logit contributions
 AD+2.830
 Lak+2.704
 ACT+2.567
 Cent+2.531
ellar+2.493
Neg logit contributions
ーテ-3.591
 Pengu-3.567
hene-3.245
ゴン-3.211
 tradem-3.172
Token.?
Token ID40791
Feature activation+6.69e-01

Pos logit contributions
 AD+4.343
 Lak+4.177
 Cent+3.987
 ACT+3.986
ellar+3.928
Neg logit contributions
ーテ-4.535
 Pengu-4.512
hene-4.136
ゴン-4.065
sten-3.969
Token?"
Token ID1701
Feature activation+1.08e+00

Pos logit contributions
 AD+4.028
 Lak+3.738
 Cent+3.606
 ACT+3.593
ellar+3.448
Neg logit contributions
ーテ-6.306
 Pengu-6.247
hene-5.806
ゴン-5.776
sten-5.661
Token]
Token ID60
Feature activation+5.74e-01

Pos logit contributions
 AD+7.823
 ACT+7.261
 Lak+7.193
 Cent+7.075
 ACC+6.561
Neg logit contributions
 Pengu-8.861
ーテ-8.850
 tradem-8.548
ゴン-8.302
hene-8.084
Token [
Token ID685
Feature activation+6.90e-01

Pos logit contributions
 AD+3.834
 Lak+3.619
 ACT+3.458
 Cent+3.377
ellar+3.295
Neg logit contributions
 Pengu-5.161
ーテ-5.157
hene-4.753
ゴン-4.723
sten-4.602
TokenEvent
Token ID9237
Feature activation+5.37e-01

Pos logit contributions
 AD+3.806
 Lak+3.548
 Cent+3.390
 ACT+3.260
ellar+3.148
Neg logit contributions
 Pengu-6.969
ーテ-6.934
ゴン-6.567
hene-6.443
 tradem-6.303
TokenCountry
Token ID33921
Feature activation+2.87e-01

Pos logit contributions
 AD+3.499
 Lak+3.357
 Cent+3.177
 ACT+3.128
ellar+3.115
Neg logit contributions
 Pengu-4.869
ーテ-4.839
hene-4.449
ゴン-4.443
sten-4.263
Token "
Token ID366
Feature activation+5.68e-01

Pos logit contributions
 AD+1.772
 Lak+1.669
 ACT+1.585
 Cent+1.566
ellar+1.550
Neg logit contributions
 Pengu-2.697
ーテ-2.686
hene-2.500
ゴン-2.460
sten-2.431
Token2
Token ID17
Feature activation+3.30e-01

Pos logit contributions
 AD+1.900
 Lak+1.820
 Cent+1.720
 ACT+1.719
ellar+1.703
Neg logit contributions
ーテ-2.298
 Pengu-2.273
hene-2.065
ゴン-2.059
sten-1.993
Tokenk
Token ID74
Feature activation+1.57e-01

Pos logit contributions
 AD+2.326
 Lak+2.254
 Cent+2.101
 ACT+2.100
ellar+2.084
Neg logit contributions
ーテ-2.819
 Pengu-2.785
hene-2.536
ゴン-2.506
sten-2.447
Token lol
Token ID19462
Feature activation+1.51e-02

Pos logit contributions
 AD+0.990
 Lak+0.951
 ACT+0.890
ellar+0.882
 Cent+0.881
Neg logit contributions
ーテ-1.445
 Pengu-1.439
hene-1.328
ゴン-1.307
sten-1.294
Token [
Token ID685
Feature activation+3.47e-02

Pos logit contributions
 AD+0.090
 Lak+0.086
 ACT+0.080
 Cent+0.080
ellar+0.079
Neg logit contributions
ーテ-0.145
 Pengu-0.144
hene-0.134
ゴン-0.132
sten-0.130
Token10
Token ID940
Feature activation-4.45e-02

Pos logit contributions
 AD+0.254
 Lak+0.250
ellar+0.234
 ACT+0.234
 Cent+0.232
Neg logit contributions
ーテ-0.275
 Pengu-0.273
hene-0.256
sten-0.252
ゴン-0.246
Token:
Token ID25
Feature activation+1.05e+00

Pos logit contributions
 Pengu+0.366
ーテ+0.366
hene+0.349
sten+0.342
punk+0.332
Neg logit contributions
 AD-0.298
 Lak-0.293
ellar-0.278
 Cent-0.271
 ACT-0.270
Token47
Token ID2857
Feature activation+4.01e-01

Pos logit contributions
 AD+4.269
 Lak+3.897
 ACT+3.604
ellar+3.441
 Cent+3.300
Neg logit contributions
ーテ-12.201
 Pengu-12.123
 tradem-11.436
ゴン-11.368
hene-11.041
Token]
Token ID60
Feature activation+2.23e-01

Pos logit contributions
 AD+3.193
 Lak+3.054
 ACT+2.927
 Cent+2.892
ellar+2.858
Neg logit contributions
ーテ-3.155
 Pengu-3.129
ゴン-2.800
hene-2.757
 tradem-2.696
Token <
Token ID1279
Feature activation+9.76e-02

Pos logit contributions
 AD+1.565
 Lak+1.473
 ACT+1.424
 Cent+1.374
ellar+1.374
Neg logit contributions
ーテ-1.938
 Pengu-1.904
hene-1.759
ゴン-1.723
sten-1.703
Tokenm
Token ID76
Feature activation+1.43e-01

Pos logit contributions
 AD+0.729
 Lak+0.705
 ACT+0.667
ellar+0.664
 Cent+0.662
Neg logit contributions
ーテ-0.786
 Pengu-0.783
hene-0.712
ゴン-0.700
sten-0.691
Tokenik
Token ID1134
Feature activation+3.34e-02

Pos logit contributions
 AD+1.082
 Lak+1.048
 ACT+0.992
ellar+0.985
 Cent+0.984
Neg logit contributions
ーテ-1.139
 Pengu-1.135
hene-1.035
ゴン-1.014
sten-1.004
Token m
Token ID285
Feature activation+2.67e-01

Pos logit contributions
ーテ+1.535
 Pengu+1.534
hene+1.388
ゴン+1.360
sten+1.349
Neg logit contributions
 AD-1.531
 Lak-1.489
ellar-1.417
 ACT-1.409
 Cent-1.395
Token2
Token ID17
Feature activation+2.70e-01

Pos logit contributions
 AD+1.910
 Lak+1.843
 ACT+1.740
ellar+1.730
 Cent+1.727
Neg logit contributions
ーテ-2.236
 Pengu-2.225
hene-2.026
ゴン-1.999
sten-1.967
Tokenk
Token ID74
Feature activation+7.12e-02

Pos logit contributions
 AD+2.086
 Lak+2.022
 ACT+1.917
ellar+1.908
 Cent+1.902
Neg logit contributions
ーテ-2.085
 Pengu-2.079
hene-1.904
sten-1.849
ゴン-1.849
Token [
Token ID685
Feature activation-3.44e-02

Pos logit contributions
 AD+0.431
 Lak+0.419
ellar+0.395
 Cent+0.385
 ACT+0.385
Neg logit contributions
 Pengu-0.653
ーテ-0.649
hene-0.610
sten-0.598
ゴン-0.586
Token10
Token ID940
Feature activation-2.29e-01

Pos logit contributions
ーテ+0.265
 Pengu+0.263
hene+0.249
sten+0.246
ゴン+0.237
Neg logit contributions
 AD-0.256
 Lak-0.252
ellar-0.237
 ACT-0.236
 Cent-0.234
Token:
Token ID25
Feature activation+1.02e+00

Pos logit contributions
 Pengu+1.732
ーテ+1.731
hene+1.692
sten+1.664
punk+1.617
Neg logit contributions
 AD-1.583
 Lak-1.578
ellar-1.514
 Cent-1.453
 ACT-1.445
Token48
Token ID2780
Feature activation+3.25e-01

Pos logit contributions
 AD+4.334
 Lak+4.319
ellar+3.727
 ACT+3.646
 Cent+3.540
Neg logit contributions
 Pengu-11.518
ーテ-11.224
ゴン-10.618
 tradem-10.271
hene-10.163
Token]
Token ID60
Feature activation+3.19e-01

Pos logit contributions
 AD+2.630
 Lak+2.530
 ACT+2.418
 Cent+2.394
ellar+2.376
Neg logit contributions
ーテ-2.485
 Pengu-2.469
ゴン-2.195
hene-2.194
sten-2.115
Token <
Token ID1279
Feature activation+1.37e-01

Pos logit contributions
 AD+2.335
 Lak+2.251
 ACT+2.133
 Cent+2.109
ellar+2.108
Neg logit contributions
ーテ-2.631
 Pengu-2.614
hene-2.389
ゴン-2.351
sten-2.318
Tokenmin
Token ID1084
Feature activation-4.44e-02

Pos logit contributions
 AD+1.020
 Lak+0.987
 ACT+0.932
ellar+0.930
 Cent+0.926
Neg logit contributions
ーテ-1.101
 Pengu-1.095
hene-0.994
ゴン-0.979
sten-0.961
Tokenion
Token ID295
Feature activation+6.16e-01

Pos logit contributions
 Pengu+0.247
ーテ+0.243
hene+0.228
sten+0.215
punk+0.207
Neg logit contributions
 AD-0.431
 Lak-0.424
 Cent-0.403
 ACT-0.403
ellar-0.398
Token y
Token ID331
Feature activation-1.95e-01

Pos logit contributions
 Pengu+1.404
ーテ+1.393
hene+1.265
sten+1.237
ゴン+1.234
Neg logit contributions
 AD-1.342
 Lak-1.306
ellar-1.254
 ACT-1.237
 Cent-1.223
Token2
Token ID17
Feature activation+4.33e-02

Pos logit contributions
 Pengu+1.547
ーテ+1.530
hene+1.431
sten+1.397
ゴン+1.372
Neg logit contributions
 AD-1.439
 Lak-1.407
ellar-1.324
 ACT-1.323
 Cent-1.317
Tokenk
Token ID74
Feature activation+1.85e-02

Pos logit contributions
 AD+0.332
 Lak+0.322
 ACT+0.305
ellar+0.304
 Cent+0.302
Neg logit contributions
 Pengu-0.338
ーテ-0.337
hene-0.308
sten-0.300
ゴン-0.299
Token [
Token ID685
Feature activation+4.85e-02

Pos logit contributions
 AD+0.120
 Lak+0.117
ellar+0.109
 ACT+0.109
 Cent+0.108
Neg logit contributions
 Pengu-0.164
ーテ-0.164
hene-0.151
ゴン-0.148
sten-0.148
Token10
Token ID940
Feature activation-6.20e-02

Pos logit contributions
 AD+0.354
 Lak+0.348
ellar+0.325
 ACT+0.325
 Cent+0.323
Neg logit contributions
ーテ-0.386
 Pengu-0.384
hene-0.360
sten-0.353
ゴン-0.346
Token:
Token ID25
Feature activation+1.01e+00

Pos logit contributions
 Pengu+0.506
ーテ+0.505
hene+0.483
sten+0.473
punk+0.460
Neg logit contributions
 AD-0.415
 Lak-0.409
ellar-0.389
 Cent-0.378
 ACT-0.377
Token47
Token ID2857
Feature activation+3.62e-01

Pos logit contributions
 AD+4.181
 Lak+3.818
 ACT+3.524
ellar+3.361
 Cent+3.234
Neg logit contributions
ーテ-11.712
 Pengu-11.645
 tradem-10.948
ゴン-10.895
hene-10.616
Token]
Token ID60
Feature activation+1.88e-01

Pos logit contributions
 AD+2.904
 Lak+2.786
 ACT+2.665
 Cent+2.636
ellar+2.612
Neg logit contributions
ーテ-2.804
 Pengu-2.782
ゴン-2.483
hene-2.463
 tradem-2.372
Token <
Token ID1279
Feature activation+8.33e-02

Pos logit contributions
 AD+1.329
 Lak+1.253
 ACT+1.209
 Cent+1.170
ellar+1.170
Neg logit contributions
ーテ-1.629
 Pengu-1.602
hene-1.479
ゴン-1.449
sten-1.432
Tokenchain
Token ID7983
Feature activation-3.50e-01

Pos logit contributions
 AD+0.622
 Lak+0.601
 ACT+0.569
ellar+0.566
 Cent+0.564
Neg logit contributions
ーテ-0.672
 Pengu-0.669
hene-0.610
ゴン-0.599
sten-0.591
Tokenx
Token ID87
Feature activation+2.24e-01

Pos logit contributions
 Pengu+2.769
ーテ+2.759
hene+2.623
sten+2.558
ゴン+2.484
Neg logit contributions
 AD-2.558
 Lak-2.503
 ACT-2.353
 Cent-2.338
ellar-2.320
Tokenw
Token ID86
Feature activation-1.24e-01

Pos logit contributions
 Pengu+0.554
ーテ+0.550
hene+0.515
sten+0.504
ゴン+0.488
Neg logit contributions
 AD-0.532
 Lak-0.520
 ACT-0.490
 Cent-0.485
ellar-0.481
Token>
Token ID29
Feature activation+3.40e-02

Pos logit contributions
 Pengu+0.993
ーテ+0.990
hene+0.925
sten+0.899
ゴン+0.880
Neg logit contributions
 AD-0.910
 Lak-0.886
 ACT-0.837
 Cent-0.829
ellar-0.823
Token save
Token ID3613
Feature activation-1.55e-02

Pos logit contributions
 AD+0.246
 Lak+0.239
ellar+0.228
 ACT+0.225
 Cent+0.223
Neg logit contributions
 Pengu-0.280
ーテ-0.277
hene-0.254
ゴン-0.247
sten-0.247
Token [
Token ID685
Feature activation+2.84e-01

Pos logit contributions
 Pengu+0.128
ーテ+0.127
hene+0.116
ゴン+0.113
sten+0.113
Neg logit contributions
 AD-0.112
 Lak-0.109
ellar-0.104
 ACT-0.103
 Cent-0.102
Token10
Token ID940
Feature activation+6.87e-02

Pos logit contributions
 AD+1.966
 Lak+1.891
 ACT+1.784
ellar+1.771
 Cent+1.769
Neg logit contributions
ーテ-2.456
 Pengu-2.446
hene-2.231
ゴン-2.201
sten-2.163
Token:
Token ID25
Feature activation+1.01e+00

Pos logit contributions
 AD+0.449
 Lak+0.438
ellar+0.412
 ACT+0.406
 Cent+0.405
Neg logit contributions
ーテ-0.593
 Pengu-0.592
hene-0.557
sten-0.545
ゴン-0.536
Token47
Token ID2857
Feature activation+3.69e-01

Pos logit contributions
 AD+4.195
 Lak+3.838
 ACT+3.541
ellar+3.367
 Cent+3.266
Neg logit contributions
ーテ-11.652
 Pengu-11.583
 tradem-10.923
ゴン-10.868
hene-10.624
Token]
Token ID60
Feature activation+2.84e-01

Pos logit contributions
 AD+2.964
 Lak+2.841
 ACT+2.720
 Cent+2.690
ellar+2.664
Neg logit contributions
ーテ-2.865
 Pengu-2.842
ゴン-2.537
hene-2.517
 tradem-2.425
Token <
Token ID1279
Feature activation+1.00e-01

Pos logit contributions
 AD+1.981
 Lak+1.857
 ACT+1.801
 Cent+1.734
ellar+1.731
Neg logit contributions
ーテ-2.487
 Pengu-2.440
hene-2.263
ゴン-2.207
sten-2.186
Tokenf
Token ID69
Feature activation-1.54e-01

Pos logit contributions
 AD+0.749
 Lak+0.725
 ACT+0.685
ellar+0.682
 Cent+0.680
Neg logit contributions
ーテ-0.810
 Pengu-0.806
hene-0.733
ゴン-0.721
sten-0.710
Tokengt
Token ID13655
Feature activation-7.57e-03

Pos logit contributions
 Pengu+1.226
ーテ+1.225
hene+1.148
sten+1.121
ゴン+1.099
Neg logit contributions
 AD-1.134
 Lak-1.105
 ACT-1.043
 Cent-1.030
ellar-1.023
Tokenm
Token ID76
Feature activation+2.82e-01

Pos logit contributions
 AD+1.365
 Lak+1.313
 ACT+1.231
ellar+1.220
 Cent+1.219
Neg logit contributions
ーテ-1.882
 Pengu-1.871
hene-1.723
ゴン-1.699
sten-1.675
Token4
Token ID19
Feature activation-6.57e-03

Pos logit contributions
 AD+1.889
 Lak+1.816
 ACT+1.706
 Cent+1.692
ellar+1.687
Neg logit contributions
ーテ-2.505
 Pengu-2.487
hene-2.287
ゴン-2.255
sten-2.219
Tokenk
Token ID74
Feature activation+3.60e-02

Pos logit contributions
 Pengu+0.056
ーテ+0.055
hene+0.051
sten+0.050
ゴン+0.050
Neg logit contributions
 AD-0.046
 Lak-0.044
ellar-0.042
 ACT-0.042
 Cent-0.041
Token [
Token ID685
Feature activation-7.73e-02

Pos logit contributions
 AD+0.231
 Lak+0.224
ellar+0.209
 ACT+0.209
 Cent+0.207
Neg logit contributions
 Pengu-0.325
ーテ-0.324
hene-0.299
ゴン-0.293
sten-0.292
Token10
Token ID940
Feature activation-2.17e-01

Pos logit contributions
ーテ+0.587
 Pengu+0.585
hene+0.553
sten+0.547
ゴン+0.526
Neg logit contributions
 AD-0.578
 Lak-0.571
ellar-0.536
 ACT-0.533
 Cent-0.528
Token:
Token ID25
Feature activation+1.01e+00

Pos logit contributions
 Pengu+1.656
ーテ+1.651
hene+1.610
sten+1.582
punk+1.541
Neg logit contributions
 AD-1.490
 Lak-1.484
ellar-1.422
 Cent-1.369
 ACT-1.361
Token48
Token ID2780
Feature activation+2.00e-01

Pos logit contributions
 AD+4.418
 Lak+4.364
ellar+3.870
 ACT+3.681
 Cent+3.635
Neg logit contributions
 Pengu-10.993
ーテ-10.691
ゴン-10.102
hene-9.822
sten-9.469
Token]
Token ID60
Feature activation+2.17e-01

Pos logit contributions
 AD+1.655
 Lak+1.607
 ACT+1.528
ellar+1.520
 Cent+1.517
Neg logit contributions
ーテ-1.456
 Pengu-1.450
hene-1.308
ゴン-1.280
sten-1.264
Token <
Token ID1279
Feature activation+6.56e-02

Pos logit contributions
 AD+1.527
 Lak+1.438
 ACT+1.389
 Cent+1.343
ellar+1.339
Neg logit contributions
ーテ-1.880
 Pengu-1.848
hene-1.708
ゴン-1.672
sten-1.653
Tokenmom
Token ID32542
Feature activation-2.21e-01

Pos logit contributions
 AD+0.491
 Lak+0.475
 ACT+0.449
ellar+0.447
 Cent+0.445
Neg logit contributions
ーテ-0.528
 Pengu-0.526
hene-0.479
ゴン-0.471
sten-0.465
Tokenper
Token ID525
Feature activation-1.73e-01

Pos logit contributions
 Pengu+1.889
ーテ+1.875
hene+1.831
sten+1.814
punk+1.736
Neg logit contributions
 AD-1.426
 Lak-1.405
 Cent-1.303
 ACT-1.295
ellar-1.277
Token m
Token ID285
Feature activation+2.74e-01

Pos logit contributions
 Pengu+1.543
ーテ+1.540
hene+1.394
ゴン+1.365
sten+1.357
Neg logit contributions
 AD-1.527
 Lak-1.487
ellar-1.418
 ACT-1.407
 Cent-1.391
Token2
Token ID17
Feature activation+2.97e-01

Pos logit contributions
 AD+1.961
 Lak+1.892
 ACT+1.786
ellar+1.776
 Cent+1.773
Neg logit contributions
ーテ-2.300
 Pengu-2.287
hene-2.083
ゴン-2.057
sten-2.021
Tokenk
Token ID74
Feature activation+1.37e-01

Pos logit contributions
 AD+2.284
 Lak+2.213
 ACT+2.096
ellar+2.085
 Cent+2.080
Neg logit contributions
ーテ-2.323
 Pengu-2.315
hene-2.113
ゴン-2.062
sten-2.050
Token [
Token ID685
Feature activation+1.18e-02

Pos logit contributions
 AD+0.819
 Lak+0.793
ellar+0.743
 ACT+0.731
 Cent+0.729
Neg logit contributions
 Pengu-1.282
ーテ-1.277
hene-1.193
sten-1.169
ゴン-1.156
Token10
Token ID940
Feature activation-1.74e-01

Pos logit contributions
 AD+0.087
 Lak+0.086
ellar+0.080
 ACT+0.080
 Cent+0.080
Neg logit contributions
ーテ-0.092
 Pengu-0.092
hene-0.087
sten-0.085
ゴン-0.083
Token:
Token ID25
Feature activation+1.01e+00

Pos logit contributions
 Pengu+1.350
ーテ+1.347
hene+1.308
sten+1.285
punk+1.248
Neg logit contributions
 AD-1.194
 Lak-1.187
ellar-1.137
 Cent-1.093
 ACT-1.087
Token47
Token ID2857
Feature activation+3.67e-01

Pos logit contributions
 AD+4.115
 Lak+3.718
 ACT+3.460
ellar+3.312
 Cent+3.148
Neg logit contributions
ーテ-11.747
 Pengu-11.707
 tradem-10.999
ゴン-10.901
hene-10.613
Token]
Token ID60
Feature activation+2.40e-01

Pos logit contributions
 AD+2.944
 Lak+2.824
 ACT+2.703
 Cent+2.673
ellar+2.648
Neg logit contributions
ーテ-2.851
 Pengu-2.830
ゴン-2.525
hene-2.502
 tradem-2.415
Token <
Token ID1279
Feature activation+5.51e-02

Pos logit contributions
 AD+1.766
 Lak+1.706
 ACT+1.614
ellar+1.602
 Cent+1.599
Neg logit contributions
ーテ-1.969
 Pengu-1.959
hene-1.789
ゴン-1.757
sten-1.737
Tokenj
Token ID73
Feature activation-7.65e-02

Pos logit contributions
 AD+0.410
 Lak+0.397
 ACT+0.375
ellar+0.373
 Cent+0.372
Neg logit contributions
ーテ-0.445
 Pengu-0.443
hene-0.405
ゴン-0.397
sten-0.393
Tokenth
Token ID400
Feature activation-7.07e-02

Pos logit contributions
 Pengu+0.608
ーテ+0.606
hene+0.566
sten+0.548
ゴン+0.544
Neg logit contributions
 AD-0.569
 Lak-0.555
 ACT-0.522
 Cent-0.520
ellar-0.516
Token m
Token ID285
Feature activation+2.61e-01

Pos logit contributions
 Pengu+0.905
ーテ+0.899
hene+0.817
sten+0.797
ゴン+0.796
Neg logit contributions
 AD-0.877
 Lak-0.854
ellar-0.817
 ACT-0.808
 Cent-0.800
Token2
Token ID17
Feature activation+3.14e-01

Pos logit contributions
 AD+1.729
 Lak+1.649
 Cent+1.563
iel+1.555
 ACT+1.550
Neg logit contributions
ーテ-2.342
 Pengu-2.315
ゴン-2.094
hene-2.088
sten-2.013
Tokenk
Token ID74
Feature activation+1.32e-01

Pos logit contributions
 AD+2.400
 Lak+2.324
 ACT+2.200
ellar+2.188
 Cent+2.184
Neg logit contributions
ーテ-2.474
 Pengu-2.465
hene-2.244
ゴン-2.199
sten-2.176
Token [
Token ID685
Feature activation+4.78e-02

Pos logit contributions
 AD+0.751
 Lak+0.714
 ACT+0.666
 Cent+0.657
ellar+0.649
Neg logit contributions
ーテ-1.318
 Pengu-1.303
hene-1.214
ゴン-1.196
sten-1.180
Token10
Token ID940
Feature activation-1.36e-01

Pos logit contributions
 AD+0.349
 Lak+0.342
ellar+0.321
 ACT+0.320
 Cent+0.318
Neg logit contributions
ーテ-0.380
 Pengu-0.378
hene-0.354
sten-0.348
ゴン-0.340
Token:
Token ID25
Feature activation+1.01e+00

Pos logit contributions
 Pengu+1.077
ーテ+1.074
hene+1.038
sten+1.018
punk+0.990
Neg logit contributions
 AD-0.926
 Lak-0.917
ellar-0.876
 Cent-0.846
 ACT-0.843
Token47
Token ID2857
Feature activation+3.85e-01

Pos logit contributions
 AD+4.177
 Lak+3.818
 ACT+3.531
ellar+3.377
 Cent+3.244
Neg logit contributions
ーテ-11.638
 Pengu-11.576
 tradem-10.855
ゴン-10.819
hene-10.541
Token]
Token ID60
Feature activation+2.22e-01

Pos logit contributions
 AD+3.072
 Lak+2.942
 ACT+2.818
 Cent+2.785
ellar+2.754
Neg logit contributions
ーテ-3.012
 Pengu-2.988
ゴン-2.671
hene-2.637
 tradem-2.565
Token <
Token ID1279
Feature activation+9.04e-02

Pos logit contributions
 AD+1.559
 Lak+1.464
 ACT+1.417
ellar+1.369
 Cent+1.367
Neg logit contributions
ーテ-1.933
 Pengu-1.898
hene-1.753
ゴン-1.717
sten-1.696
Tokenmom
Token ID32542
Feature activation-2.04e-01

Pos logit contributions
 AD+0.674
 Lak+0.652
 ACT+0.616
ellar+0.613
 Cent+0.611
Neg logit contributions
ーテ-0.730
 Pengu-0.727
hene-0.662
ゴン-0.650
sten-0.642
Tokenper
Token ID525
Feature activation-1.03e-01

Pos logit contributions
 Pengu+1.758
ーテ+1.743
hene+1.698
sten+1.681
punk+1.609
Neg logit contributions
 AD-1.313
 Lak-1.290
 Cent-1.197
 ACT-1.192
ellar-1.173
Token string
Token ID4731
Feature activation-2.27e-01

Pos logit contributions
 Pengu+4.815
ーテ+4.814
hene+4.400
sten+4.150
ゴン+4.062
Neg logit contributions
 AD-6.811
 Lak-6.811
ellar-6.414
 ACT-6.320
 Cent-6.316
Token "
Token ID366
Feature activation+6.96e-01

Pos logit contributions
ーテ+2.036
 Pengu+2.027
hene+1.869
ゴン+1.834
sten+1.822
Neg logit contributions
 AD-1.478
 Lak-1.427
 ACT-1.333
ellar-1.329
 Cent-1.322
Tokenvi
Token ID8903
Feature activation+3.65e-01

Pos logit contributions
 AD+4.067
 Lak+3.823
 ACT+3.751
iel+3.733
 Cent+3.726
Neg logit contributions
 Pengu-6.481
ーテ-6.461
ゴン-5.871
hene-5.700
 tradem-5.569
Token:
Token ID25
Feature activation-1.44e-01

Pos logit contributions
 AD+2.021
 Lak+1.873
 ACT+1.798
 Cent+1.786
ellar+1.758
Neg logit contributions
 Pengu-3.662
ーテ-3.657
hene-3.334
ゴン-3.301
sten-3.236
Token",
Token ID1600
Feature activation+2.26e-01

Pos logit contributions
 Pengu+1.137
ーテ+1.136
hene+1.025
ゴン+1.011
sten+0.994
Neg logit contributions
 AD-1.103
 Lak-1.064
 ACT-1.011
ellar-1.006
 Cent-1.004
Token "
Token ID366
Feature activation+9.95e-01

Pos logit contributions
 AD+1.447
 Lak+1.389
 ACT+1.304
ellar+1.291
 Cent+1.290
Neg logit contributions
ーテ-2.067
 Pengu-2.060
hene-1.897
ゴン-1.871
sten-1.846
Tokenvim
Token ID31124
Feature activation+4.12e-01

Pos logit contributions
 AD+4.445
 Lak+4.005
 ACT+3.972
iel+3.955
ellar+3.950
Neg logit contributions
ーテ-10.902
 Pengu-10.707
ゴン-9.919
 tradem-9.630
hene-9.386
Token:"
Token ID11097
Feature activation-2.02e-01

Pos logit contributions
 AD+1.696
 Lak+1.566
 ACT+1.441
 Cent+1.437
ellar+1.427
Neg logit contributions
ーテ-4.674
 Pengu-4.670
hene-4.339
ゴン-4.301
sten-4.246
Token or
Token ID393
Feature activation+4.35e-01

Pos logit contributions
ーテ+1.560
 Pengu+1.551
hene+1.444
sten+1.407
ゴン+1.371
Neg logit contributions
 AD-1.487
 Lak-1.464
ellar-1.371
 ACT-1.355
 Cent-1.354
Token "
Token ID366
Feature activation+3.54e-02

Pos logit contributions
 AD+2.655
 Lak+2.512
 ACT+2.392
 Cent+2.346
ellar+2.313
Neg logit contributions
ーテ-4.167
 Pengu-4.154
ゴン-3.813
hene-3.795
sten-3.676
Tokenex
Token ID1069
Feature activation+9.21e-02

Pos logit contributions
 AD+0.229
 Lak+0.222
 ACT+0.205
 Cent+0.205
ellar+0.204
Neg logit contributions
ーテ-0.316
 Pengu-0.315
hene-0.296
sten-0.290
ゴン-0.286
Tokenm
Token ID76
Feature activation+2.97e-02

Pos logit contributions
ーテ+0.254
 Pengu+0.253
hene+0.230
ゴン+0.226
sten+0.223
Neg logit contributions
 AD-0.243
 Lak-0.235
 ACT-0.223
ellar-0.222
 Cent-0.221
Token2
Token ID17
Feature activation+1.46e-01

Pos logit contributions
 AD+0.214
 Lak+0.207
 ACT+0.196
ellar+0.196
 Cent+0.194
Neg logit contributions
ーテ-0.244
 Pengu-0.244
hene-0.223
ゴン-0.219
sten-0.217
Tokenk
Token ID74
Feature activation+1.91e-01

Pos logit contributions
 AD+1.076
 Lak+1.041
 ACT+0.981
 Cent+0.975
ellar+0.972
Neg logit contributions
ーテ-1.202
 Pengu-1.193
hene-1.085
ゴン-1.070
sten-1.051
Token [
Token ID685
Feature activation-5.57e-02

Pos logit contributions
 AD+1.075
 Lak+1.020
 ACT+0.953
 Cent+0.939
ellar+0.927
Neg logit contributions
ーテ-1.920
 Pengu-1.898
hene-1.766
ゴン-1.740
sten-1.720
Token10
Token ID940
Feature activation-2.12e-01

Pos logit contributions
ーテ+0.424
 Pengu+0.421
hene+0.400
sten+0.395
ゴン+0.379
Neg logit contributions
 AD-0.417
 Lak-0.412
ellar-0.385
 ACT-0.384
 Cent-0.381
Token:
Token ID25
Feature activation+9.94e-01

Pos logit contributions
 Pengu+1.612
ーテ+1.609
hene+1.572
sten+1.543
punk+1.501
Neg logit contributions
 AD-1.456
 Lak-1.450
ellar-1.388
 Cent-1.335
 ACT-1.330
Token48
Token ID2780
Feature activation+3.70e-01

Pos logit contributions
 AD+4.232
 Lak+4.209
ellar+3.720
 ACT+3.524
 Cent+3.481
Neg logit contributions
 Pengu-11.053
ーテ-10.756
ゴン-10.191
hene-9.777
 tradem-9.548
Token]
Token ID60
Feature activation+2.73e-01

Pos logit contributions
 AD+2.964
 Lak+2.839
 ACT+2.719
 Cent+2.690
ellar+2.656
Neg logit contributions
ーテ-2.887
 Pengu-2.866
ゴン-2.557
hene-2.531
 tradem-2.454
Token <
Token ID1279
Feature activation+5.80e-02

Pos logit contributions
 AD+2.001
 Lak+1.931
 ACT+1.828
ellar+1.811
 Cent+1.810
Neg logit contributions
ーテ-2.244
 Pengu-2.232
hene-2.039
ゴン-2.005
sten-1.978
Tokenm
Token ID76
Feature activation+1.06e-01

Pos logit contributions
 AD+0.433
 Lak+0.419
 ACT+0.396
ellar+0.394
 Cent+0.393
Neg logit contributions
ーテ-0.467
 Pengu-0.465
hene-0.424
ゴン-0.416
sten-0.411
Tokenew
Token ID413
Feature activation-3.97e-01

Pos logit contributions
 AD+0.800
 Lak+0.775
 ACT+0.734
ellar+0.728
 Cent+0.727
Neg logit contributions
ーテ-0.843
 Pengu-0.841
hene-0.770
ゴン-0.750
sten-0.747
Token lol
Token ID19462
Feature activation-1.04e-01

Pos logit contributions
 AD+0.146
 Lak+0.142
ellar+0.135
 ACT+0.133
 Cent+0.132
Neg logit contributions
 Pengu-0.168
ーテ-0.166
hene-0.152
ゴン-0.148
sten-0.148
Token j
Token ID474
Feature activation+2.07e-01

Pos logit contributions
 Pengu+0.820
ーテ+0.816
hene+0.740
ゴン+0.724
sten+0.723
Neg logit contributions
 AD-0.780
 Lak-0.757
ellar-0.724
 ACT-0.718
 Cent-0.709
Tokenason
Token ID888
Feature activation+2.03e-02

Pos logit contributions
 AD+1.592
 Lak+1.537
 ACT+1.459
ellar+1.451
 Cent+1.447
Neg logit contributions
ーテ-1.629
 Pengu-1.619
hene-1.466
ゴン-1.445
sten-1.421
Token [
Token ID685
Feature activation+1.74e-01

Pos logit contributions
 AD+0.130
 Lak+0.126
ellar+0.118
 ACT+0.118
 Cent+0.116
Neg logit contributions
 Pengu-0.184
ーテ-0.184
hene-0.169
ゴン-0.166
sten-0.166
Token10
Token ID940
Feature activation+8.38e-03

Pos logit contributions
 AD+1.231
 Lak+1.195
 ACT+1.123
ellar+1.118
 Cent+1.114
Neg logit contributions
ーテ-1.451
 Pengu-1.445
hene-1.333
ゴン-1.301
sten-1.299
Token:
Token ID25
Feature activation+9.93e-01

Pos logit contributions
 AD+0.055
 Lak+0.054
ellar+0.051
 ACT+0.050
 Cent+0.050
Neg logit contributions
 Pengu-0.070
ーテ-0.070
hene-0.067
sten-0.065
ゴン-0.063
Token47
Token ID2857
Feature activation+3.76e-01

Pos logit contributions
 AD+4.177
 Lak+3.853
 ACT+3.558
ellar+3.442
 Cent+3.311
Neg logit contributions
ーテ-11.302
 Pengu-11.234
ゴン-10.555
 tradem-10.473
hene-10.320
Token]
Token ID60
Feature activation+3.04e-01

Pos logit contributions
 AD+3.015
 Lak+2.890
 ACT+2.766
 Cent+2.736
ellar+2.710
Neg logit contributions
ーテ-2.921
 Pengu-2.898
ゴン-2.586
hene-2.563
 tradem-2.474
Token <
Token ID1279
Feature activation+1.52e-01

Pos logit contributions
 AD+2.114
 Lak+1.974
 ACT+1.919
 Cent+1.845
ellar+1.842
Neg logit contributions
ーテ-2.676
 Pengu-2.619
hene-2.425
ゴン-2.375
sten-2.351
Tokenfen
Token ID41037
Feature activation-3.53e-01

Pos logit contributions
 AD+1.138
 Lak+1.101
 ACT+1.040
ellar+1.038
 Cent+1.033
Neg logit contributions
ーテ-1.224
 Pengu-1.216
hene-1.103
ゴン-1.086
sten-1.066
Tokenren
Token ID918
Feature activation+5.53e-02

Pos logit contributions
 Pengu+2.780
ーテ+2.769
hene+2.551
sten+2.537
ゴン+2.455
Neg logit contributions
 AD-2.613
 Lak-2.545
 ACT-2.385
 Cent-2.363
ellar-2.360
Token m
Token ID285
Feature activation+3.59e-01

Pos logit contributions
 AD+0.536
 Lak+0.519
 ACT+0.491
ellar+0.490
 Cent+0.487
Neg logit contributions
ーテ-0.584
 Pengu-0.583
hene-0.532
ゴン-0.521
sten-0.516
Token2
Token ID17
Feature activation+4.02e-01

Pos logit contributions
 AD+2.330
 Lak+2.214
iel+2.103
 Cent+2.098
ellar+2.074
Neg logit contributions
ーテ-3.276
 Pengu-3.235
ゴン-2.927
hene-2.914
sten-2.801
Tokenk
Token ID74
Feature activation+1.36e-01

Pos logit contributions
 AD+3.016
 Lak+2.919
 ACT+2.758
ellar+2.741
 Cent+2.737
Neg logit contributions
ーテ-3.243
 Pengu-3.224
hene-2.915
ゴン-2.891
sten-2.823
Token [
Token ID685
Feature activation+8.80e-03

Pos logit contributions
 AD+0.796
 Lak+0.760
 ACT+0.709
 Cent+0.701
ellar+0.696
Neg logit contributions
ーテ-1.320
 Pengu-1.309
hene-1.216
ゴン-1.198
sten-1.183
Token10
Token ID940
Feature activation-1.23e-01

Pos logit contributions
 AD+0.065
 Lak+0.064
ellar+0.060
 ACT+0.060
 Cent+0.059
Neg logit contributions
ーテ-0.068
 Pengu-0.068
hene-0.064
sten-0.063
ゴン-0.061
Token:
Token ID25
Feature activation+9.92e-01

Pos logit contributions
 Pengu+0.977
ーテ+0.975
hene+0.940
sten+0.922
punk+0.897
Neg logit contributions
 AD-0.834
 Lak-0.826
ellar-0.789
 Cent-0.762
 ACT-0.760
Token47
Token ID2857
Feature activation+3.82e-01

Pos logit contributions
 AD+4.141
 Lak+3.787
 ACT+3.496
ellar+3.357
 Cent+3.229
Neg logit contributions
ーテ-11.407
 Pengu-11.348
 tradem-10.609
ゴン-10.608
hene-10.325
Token]
Token ID60
Feature activation+1.73e-01

Pos logit contributions
 AD+3.057
 Lak+2.927
 ACT+2.804
 Cent+2.772
ellar+2.740
Neg logit contributions
ーテ-2.991
 Pengu-2.967
ゴン-2.653
hene-2.619
 tradem-2.545
Token <
Token ID1279
Feature activation+2.35e-02

Pos logit contributions
 AD+1.277
 Lak+1.237
 ACT+1.167
ellar+1.163
 Cent+1.160
Neg logit contributions
ーテ-1.405
 Pengu-1.402
hene-1.279
ゴン-1.254
sten-1.242
Tokenk
Token ID74
Feature activation-3.49e-01

Pos logit contributions
 AD+0.175
 Lak+0.170
 ACT+0.160
ellar+0.159
 Cent+0.159
Neg logit contributions
ーテ-0.189
 Pengu-0.189
hene-0.173
ゴン-0.169
sten-0.168
Tokenasse
Token ID21612
Feature activation-2.71e-01

Pos logit contributions
 Pengu+2.825
ーテ+2.809
hene+2.701
sten+2.630
punk+2.528
Neg logit contributions
 AD-2.481
 Lak-2.420
 ACT-2.263
 Cent-2.257
ellar-2.217
Token m
Token ID285
Feature activation+1.65e-01

Pos logit contributions
 Pengu+1.456
ーテ+1.443
hene+1.315
sten+1.283
ゴン+1.282
Neg logit contributions
 AD-1.363
 Lak-1.330
ellar-1.273
 ACT-1.252
 Cent-1.243
Token2
Token ID17
Feature activation+2.70e-01

Pos logit contributions
 AD+1.154
 Lak+1.109
 ACT+1.045
 Cent+1.044
ellar+1.035
Neg logit contributions
ーテ-1.425
 Pengu-1.409
hene-1.276
ゴン-1.274
sten-1.234
Tokenk
Token ID74
Feature activation+6.01e-02

Pos logit contributions
 AD+2.082
 Lak+2.017
 ACT+1.912
ellar+1.903
 Cent+1.897
Neg logit contributions
ーテ-2.096
 Pengu-2.092
hene-1.910
ゴン-1.859
sten-1.854
Token [
Token ID685
Feature activation+6.96e-02

Pos logit contributions
 AD+0.344
 Lak+0.328
 ACT+0.306
 Cent+0.302
ellar+0.300
Neg logit contributions
ーテ-0.594
 Pengu-0.589
hene-0.547
ゴン-0.539
sten-0.533
Token10
Token ID940
Feature activation-1.22e-01

Pos logit contributions
 AD+0.506
 Lak+0.494
ellar+0.463
 ACT+0.463
 Cent+0.459
Neg logit contributions
ーテ-0.559
 Pengu-0.557
hene-0.520
sten-0.510
ゴン-0.502
Token:
Token ID25
Feature activation+9.91e-01

Pos logit contributions
 Pengu+0.976
ーテ+0.973
hene+0.937
sten+0.919
punk+0.894
Neg logit contributions
 AD-0.828
 Lak-0.819
ellar-0.783
 Cent-0.756
 ACT-0.754
Token47
Token ID2857
Feature activation+4.26e-01

Pos logit contributions
 AD+4.155
 Lak+3.838
 ACT+3.529
ellar+3.382
 Cent+3.270
Neg logit contributions
ーテ-11.318
 Pengu-11.248
ゴン-10.565
 tradem-10.522
hene-10.317
Token]
Token ID60
Feature activation+2.56e-01

Pos logit contributions
 AD+3.379
 Lak+3.225
 ACT+3.094
 Cent+3.055
ellar+3.008
Neg logit contributions
ーテ-3.395
 Pengu-3.364
ゴン-3.016
hene-2.956
 tradem-2.920
Token <
Token ID1279
Feature activation+6.62e-02

Pos logit contributions
 AD+1.791
 Lak+1.685
 ACT+1.628
 Cent+1.573
ellar+1.568
Neg logit contributions
ーテ-2.237
 Pengu-2.193
hene-2.029
ゴン-1.991
sten-1.966
Tokenmom
Token ID32542
Feature activation-6.11e-02

Pos logit contributions
 AD+0.491
 Lak+0.475
 ACT+0.449
ellar+0.447
 Cent+0.446
Neg logit contributions
ーテ-0.536
 Pengu-0.534
hene-0.487
ゴン-0.478
sten-0.472
Tokenper
Token ID525
Feature activation-5.86e-02

Pos logit contributions
ーテ+0.555
 Pengu+0.553
hene+0.511
ゴン+0.501
sten+0.498
Neg logit contributions
 AD-0.393
 Lak-0.379
 ACT-0.354
ellar-0.352
 Cent-0.351
Token m
Token ID285
Feature activation+3.20e-01

Pos logit contributions
 AD+0.585
 Lak+0.566
 ACT+0.535
ellar+0.533
 Cent+0.531
Neg logit contributions
ーテ-0.637
 Pengu-0.635
hene-0.579
ゴン-0.568
sten-0.562
Token2
Token ID17
Feature activation+3.01e-01

Pos logit contributions
 AD+2.105
 Lak+2.010
iel+1.897
 Cent+1.895
 ACT+1.885
Neg logit contributions
ーテ-2.880
 Pengu-2.843
ゴン-2.579
hene-2.564
sten-2.472
Tokenk
Token ID74
Feature activation+1.16e-01

Pos logit contributions
 AD+2.309
 Lak+2.236
 ACT+2.118
ellar+2.106
 Cent+2.102
Neg logit contributions
ーテ-2.364
 Pengu-2.355
hene-2.147
ゴン-2.099
sten-2.083
Token [
Token ID685
Feature activation+4.58e-02

Pos logit contributions
 AD+0.653
 Lak+0.619
 ACT+0.578
 Cent+0.570
ellar+0.565
Neg logit contributions
ーテ-1.161
 Pengu-1.149
hene-1.069
ゴン-1.054
sten-1.040
Token10
Token ID940
Feature activation-1.06e-01

Pos logit contributions
 AD+0.336
 Lak+0.330
ellar+0.308
 ACT+0.308
 Cent+0.305
Neg logit contributions
ーテ-0.362
 Pengu-0.360
hene-0.338
sten-0.333
ゴン-0.325
Token:
Token ID25
Feature activation+9.90e-01

Pos logit contributions
 Pengu+0.849
ーテ+0.847
hene+0.814
sten+0.798
punk+0.776
Neg logit contributions
 AD-0.714
 Lak-0.707
ellar-0.674
 Cent-0.651
 ACT-0.650
Token47
Token ID2857
Feature activation+3.97e-01

Pos logit contributions
 AD+4.108
 Lak+3.763
 ACT+3.475
ellar+3.343
 Cent+3.199
Neg logit contributions
ーテ-11.390
 Pengu-11.321
ゴン-10.599
 tradem-10.590
hene-10.332
Token]
Token ID60
Feature activation+2.01e-01

Pos logit contributions
 AD+3.167
 Lak+3.028
 ACT+2.903
 Cent+2.868
ellar+2.831
Neg logit contributions
ーテ-3.130
 Pengu-3.104
ゴン-2.778
hene-2.734
 tradem-2.676
Token <
Token ID1279
Feature activation-7.12e-02

Pos logit contributions
 AD+1.418
 Lak+1.338
 ACT+1.290
 Cent+1.250
ellar+1.248
Neg logit contributions
ーテ-1.746
 Pengu-1.715
hene-1.583
ゴン-1.553
sten-1.532
Tokenl
Token ID75
Feature activation-2.97e-01

Pos logit contributions
ーテ+0.566
 Pengu+0.565
hene+0.517
ゴン+0.505
sten+0.503
Neg logit contributions
 AD-0.536
 Lak-0.519
 ACT-0.491
ellar-0.487
 Cent-0.487
Tokenucky
Token ID5309
Feature activation-3.50e-01

Pos logit contributions
 Pengu+2.269
ーテ+2.215
hene+2.111
sten+2.064
ゴン+1.992
Neg logit contributions
 AD-2.250
 Lak-2.204
 ACT-2.073
 Cent-2.058
ellar-2.047
Tokenam
Token ID321
Feature activation-2.09e-02

Pos logit contributions
 Pengu+2.090
ーテ+2.064
hene+2.030
sten+1.990
punk+1.905
Neg logit contributions
 AD-2.541
 Lak-2.536
ellar-2.405
 ACT-2.374
 Cent-2.360
Token and
Token ID290
Feature activation+2.42e-01

Pos logit contributions
ーテ+0.160
 Pengu+0.160
hene+0.148
sten+0.144
ゴン+0.141
Neg logit contributions
 AD-0.159
 Lak-0.156
ellar-0.148
 ACT-0.146
 Cent-0.145
Token Som
Token ID9995
Feature activation+2.56e-02

Pos logit contributions
 AD+2.005
 Lak+1.945
 ACT+1.856
 Cent+1.849
ellar+1.831
Neg logit contributions
ーテ-1.758
 Pengu-1.747
hene-1.565
ゴン-1.542
sten-1.517
Tokenerville
Token ID33487
Feature activation-1.08e-01

Pos logit contributions
 AD+0.209
 Lak+0.202
 ACT+0.191
 Cent+0.191
ellar+0.188
Neg logit contributions
 Pengu-0.185
ーテ-0.184
hene-0.172
sten-0.168
ゴン-0.162
Token's
Token ID338
Feature activation-6.39e-01

Pos logit contributions
ーテ+0.800
 Pengu+0.798
hene+0.748
sten+0.733
punk+0.704
Neg logit contributions
 AD-0.827
 Lak-0.810
ellar-0.775
 ACT-0.760
 Cent-0.757
Token Union
Token ID4479
Feature activation-1.06e+00

Pos logit contributions
 Pengu+4.654
ーテ+4.613
hene+4.309
sten+4.148
ゴン+4.130
Neg logit contributions
 AD-4.926
 Lak-4.852
ellar-4.586
 ACT-4.503
 Cent-4.427
Token Square
Token ID9276
Feature activation-3.41e-01

Pos logit contributions
hene+4.047
 Pengu+3.942
sten+3.880
ーテ+3.706
 Disorder+3.429
Neg logit contributions
 AD-10.343
 Lak-10.340
ellar-10.267
iel-9.938
 ACT-9.675
Token.
Token ID13
Feature activation-3.99e-01

Pos logit contributions
ーテ+2.376
 Pengu+2.354
hene+2.286
sten+2.206
punk+2.133
Neg logit contributions
 AD-2.558
 Lak-2.544
ellar-2.501
 ACT-2.367
 Cent-2.336
Token
Token ID198
Feature activation-2.08e-01

Pos logit contributions
ーテ+3.445
 Pengu+3.443
hene+3.181
ゴン+3.097
sten+3.091
Neg logit contributions
 AD-2.645
 Lak-2.585
ellar-2.420
 ACT-2.406
 Cent-2.380
Token
Token ID198
Feature activation-2.46e-01

Pos logit contributions
ーテ+1.587
 Pengu+1.578
hene+1.543
sten+1.518
punk+1.472
Neg logit contributions
 AD-1.424
 Lak-1.417
ellar-1.363
 Cent-1.306
 ACT-1.303
TokenThe
Token ID464
Feature activation-2.35e-01

Pos logit contributions
ーテ+1.922
 Pengu+1.913
hene+1.787
sten+1.767
ゴン+1.744
Neg logit contributions
 AD-1.793
 Lak-1.764
ellar-1.658
 ACT-1.652
 Cent-1.631
Token
Token ID114
Feature activation-3.00e-01

Pos logit contributions
ーテ+1.878
 Pengu+1.824
hene+1.627
sten+1.568
ゴン+1.554
Neg logit contributions
 AD-3.639
 Lak-3.519
 ACT-3.416
ellar-3.365
 Cent-3.334
Token
Token ID140
Feature activation-8.15e-01

Pos logit contributions
ーテ+1.535
 Pengu+1.499
hene+1.306
sten+1.249
ゴン+1.249
Neg logit contributions
 AD-3.076
 Lak-3.011
ellar-2.901
 ACT-2.898
 Cent-2.875
Token
Token ID109
Feature activation-7.34e-01

Pos logit contributions
ーテ+4.264
 Pengu+3.996
hene+3.723
sten+3.550
ゴン+3.458
Neg logit contributions
 AD-7.801
 Lak-7.510
 ACT-7.338
ellar-7.186
 Cent-7.095
Tokenы
Token ID45035
Feature activation-2.49e-01

Pos logit contributions
ーテ+3.332
 Pengu+3.033
hene+2.638
ゴン+2.464
sten+2.460
Neg logit contributions
 AD-7.754
 Lak-7.579
 ACT-7.381
ellar-7.299
 Cent-7.223
Token
Token ID12466
Feature activation-5.24e-01

Pos logit contributions
ーテ+1.606
 Pengu+1.593
hene+1.441
sten+1.381
ゴン+1.378
Neg logit contributions
 AD-2.202
 Lak-2.153
ellar-2.079
 ACT-2.052
 Cent-2.027
Token
Token ID110
Feature activation-1.02e+00

Pos logit contributions
ーテ+3.059
 Pengu+2.993
hene+2.711
ゴン+2.583
sten+2.578
Neg logit contributions
 AD-4.916
 Lak-4.781
ellar-4.597
 ACT-4.563
 Cent-4.508
Token
Token ID220
Feature activation-4.95e-01

Pos logit contributions
ーテ+7.160
 Pengu+7.036
hene+6.356
ゴン+6.178
sten+6.147
Neg logit contributions
 AD-7.460
 Lak-7.418
ellar-7.386
 ACT-6.893
 Cent-6.886
Tokenс
Token ID21727
Feature activation-6.41e-01

Pos logit contributions
ーテ+2.466
 Pengu+2.410
hene+2.111
ゴン+2.025
sten+1.996
Neg logit contributions
 AD-5.125
 Lak-4.969
ellar-4.873
 ACT-4.823
 Cent-4.767
Token
Token ID140
Feature activation-6.56e-01

Pos logit contributions
ーテ+4.321
 Pengu+4.236
hene+3.792
ゴン+3.672
sten+3.662
Neg logit contributions
 AD-5.320
 Lak-5.226
ellar-5.113
 ACT-4.942
 Cent-4.928
Token
Token ID123
Feature activation-6.79e-01

Pos logit contributions
ーテ+4.277
 Pengu+4.019
hene+3.771
sten+3.667
ゴン+3.621
Neg logit contributions
 AD-5.728
 Lak-5.443
 ACT-5.294
ellar-5.281
 Cent-5.219
Tokenе
Token ID16843
Feature activation-3.18e-01

Pos logit contributions
ーテ+2.981
 Pengu+2.917
hene+2.512
ゴン+2.379
sten+2.322
Neg logit contributions
 AD-7.218
 Lak-7.053
 ACT-6.855
ellar-6.850
 Cent-6.786
TokenAC
Token ID2246
Feature activation+1.28e-01

Pos logit contributions
 AD+1.417
 ACT+0.899
 Lak+0.860
 Cent+0.627
ellar+0.498
Neg logit contributions
ーテ-12.977
 Pengu-12.969
ゴン-12.117
hene-11.913
 tradem-11.909
TokenLU
Token ID41596
Feature activation+2.05e-01

Pos logit contributions
 AD+0.993
 Lak+0.960
 ACT+0.911
ellar+0.906
 Cent+0.902
Neg logit contributions
ーテ-0.994
 Pengu-0.990
hene-0.896
ゴン-0.881
sten-0.869
Token_
Token ID62
Feature activation+3.21e-01

Pos logit contributions
 AD+1.485
 Lak+1.437
 ACT+1.356
ellar+1.349
 Cent+1.345
Neg logit contributions
ーテ-1.697
 Pengu-1.690
hene-1.549
ゴン-1.518
sten-1.506
TokenOR
Token ID1581
Feature activation-4.36e-02

Pos logit contributions
 AD+2.073
 Lak+1.976
 ACT+1.871
ellar+1.836
 Cent+1.829
Neg logit contributions
ーテ-2.940
 Pengu-2.936
hene-2.682
ゴン-2.661
sten-2.605
Token)
Token ID8
Feature activation-9.40e-02

Pos logit contributions
ーテ+0.349
 Pengu+0.347
hene+0.328
sten+0.322
ゴン+0.313
Neg logit contributions
 AD-0.307
 Lak-0.301
ellar-0.287
 ACT-0.279
 Cent-0.278
Token May
Token ID1737
Feature activation-9.97e-01

Pos logit contributions
ーテ+0.788
 Pengu+0.787
hene+0.720
ゴン+0.705
sten+0.698
Neg logit contributions
 AD-0.665
 Lak-0.643
 ACT-0.605
ellar-0.605
 Cent-0.600
Token 29
Token ID2808
Feature activation-2.40e-01

Pos logit contributions
 Pengu+5.510
ーテ+5.394
hene+5.139
sten+4.827
 Disorder+4.504
Neg logit contributions
 AD-8.943
 Lak-8.878
ellar-8.702
 ACT-8.439
 Cent-8.309
Token,
Token ID11
Feature activation-2.29e-02

Pos logit contributions
 Pengu+1.657
ーテ+1.656
hene+1.608
sten+1.585
punk+1.524
Neg logit contributions
 AD-1.804
 Lak-1.799
ellar-1.764
 ACT-1.675
 Cent-1.675
Token 2017
Token ID2177
Feature activation-2.00e-02

Pos logit contributions
 Pengu+0.226
ーテ+0.225
hene+0.214
sten+0.210
punk+0.205
Neg logit contributions
 AD-0.116
 Lak-0.115
ellar-0.110
 Cent-0.103
 ACT-0.103
Token
Token ID198
Feature activation-1.24e-01

Pos logit contributions
ーテ+0.191
 Pengu+0.190
hene+0.176
ゴン+0.173
sten+0.172
Neg logit contributions
 AD-0.122
 Lak-0.116
 ACT-0.109
 Cent-0.107
ellar-0.107
Token
Token ID198
Feature activation+9.04e-02

Pos logit contributions
ーテ+0.964
 Pengu+0.962
hene+0.938
sten+0.921
punk+0.893
Neg logit contributions
 AD-0.841
 Lak-0.834
ellar-0.802
 Cent-0.771
 ACT-0.768
Token
Token ID109
Feature activation-4.30e-01

Pos logit contributions
ーテ+1.096
 Pengu+1.085
hene+0.986
ゴン+0.967
sten+0.956
Neg logit contributions
 AD-1.148
 Lak-1.113
 ACT-1.054
ellar-1.051
 Cent-1.044
Tokenе
Token ID16843
Feature activation-2.04e-01

Pos logit contributions
ーテ+2.136
 Pengu+2.048
hene+1.804
ゴン+1.690
sten+1.686
Neg logit contributions
 AD-4.464
 Lak-4.387
ellar-4.215
 ACT-4.214
 Cent-4.184
Tokenл
Token ID30143
Feature activation-9.34e-03

Pos logit contributions
ーテ+1.042
 Pengu+1.029
hene+0.890
ゴン+0.859
sten+0.849
Neg logit contributions
 AD-2.119
 Lak-2.069
 ACT-1.991
ellar-1.984
 Cent-1.978
Tokenь
Token ID45367
Feature activation-4.00e-01

Pos logit contributions
ーテ+0.072
 Pengu+0.072
hene+0.066
ゴン+0.064
sten+0.063
Neg logit contributions
 AD-0.073
 Lak-0.070
 ACT-0.067
ellar-0.066
 Cent-0.066
Token
Token ID12466
Feature activation-7.30e-01

Pos logit contributions
ーテ+3.166
 Pengu+3.158
hene+2.919
ゴン+2.841
sten+2.832
Neg logit contributions
 AD-2.872
 Lak-2.814
ellar-2.698
 ACT-2.651
 Cent-2.603
Token
Token ID110
Feature activation-9.89e-01

Pos logit contributions
ーテ+4.205
 Pengu+4.105
hene+3.764
sten+3.582
ゴン+3.538
Neg logit contributions
 AD-6.708
 Lak-6.554
ellar-6.304
 ACT-6.236
 Cent-6.135
Tokenо
Token ID15166
Feature activation-5.48e-01

Pos logit contributions
ーテ+6.610
 Pengu+6.473
hene+5.841
sten+5.707
ゴン+5.664
Neg logit contributions
 AD-7.592
 Lak-7.589
ellar-7.511
 Cent-7.095
 ACT-7.018
Tokenе
Token ID16843
Feature activation-3.33e-01

Pos logit contributions
 Pengu+3.389
ーテ+3.383
hene+2.965
sten+2.965
ゴン+2.939
Neg logit contributions
 AD-4.871
 Lak-4.739
ellar-4.677
 ACT-4.530
 Cent-4.479
Tokenн
Token ID22177
Feature activation-5.90e-01

Pos logit contributions
ーテ+2.211
 Pengu+2.194
hene+1.957
sten+1.917
ゴン+1.913
Neg logit contributions
 AD-2.881
 Lak-2.819
ellar-2.730
 ACT-2.684
 Cent-2.657
Tokenн
Token ID22177
Feature activation-5.97e-01

Pos logit contributions
ーテ+3.264
 Pengu+3.229
hene+2.890
sten+2.769
ゴン+2.733
Neg logit contributions
 AD-5.648
 Lak-5.545
ellar-5.330
 ACT-5.297
 Cent-5.271
Tokenо
Token ID15166
Feature activation-3.19e-01

Pos logit contributions
ーテ+2.521
 Pengu+2.414
hene+2.060
sten+1.962
ゴン+1.895
Neg logit contributions
 AD-6.561
 Lak-6.486
ellar-6.239
 ACT-6.227
 Cent-6.169
TokenBo
Token ID16635
Feature activation-6.13e-01

Pos logit contributions
ーテ+0.735
 Pengu+0.734
hene+0.677
sten+0.662
ゴン+0.659
Neg logit contributions
 AD-0.679
 Lak-0.658
 ACT-0.624
ellar-0.617
 Cent-0.617
Tokenhem
Token ID4411
Feature activation-1.23e-01

Pos logit contributions
hene+3.970
 Pengu+3.918
ーテ+3.787
sten+3.723
punk+3.529
Neg logit contributions
 AD-5.051
 Lak-4.986
 ACT-4.745
 Cent-4.671
ellar-4.618
Tokenian
Token ID666
Feature activation-4.32e-01

Pos logit contributions
ーテ+1.061
 Pengu+1.056
hene+0.966
ゴン+0.953
sten+0.939
Neg logit contributions
 AD-0.860
 Lak-0.828
 ACT-0.779
ellar-0.776
 Cent-0.774
Token sp
Token ID599
Feature activation-4.52e-01

Pos logit contributions
 Pengu+3.674
ーテ+3.634
hene+3.351
ゴン+3.269
sten+3.245
Neg logit contributions
 AD-2.934
 Lak-2.872
ellar-2.697
 ACT-2.676
 Cent-2.644
Tokenarrow
Token ID6018
Feature activation-2.79e-01

Pos logit contributions
hene+2.124
 Pengu+2.005
sten+1.841
ーテ+1.775
punk+1.693
Neg logit contributions
 Lak-4.676
 AD-4.652
 Cent-4.409
 ACT-4.383
 yeast-4.272
Token slaying
Token ID45183
Feature activation-9.86e-01

Pos logit contributions
ーテ+2.166
 Pengu+2.160
hene+1.972
ゴン+1.928
sten+1.921
Neg logit contributions
 AD-2.118
 Lak-2.060
ellar-1.955
 ACT-1.952
 Cent-1.940
Token bow
Token ID9563
Feature activation-6.00e-01

Pos logit contributions
 Pengu+5.602
ーテ+5.327
hene+4.915
sten+4.611
ゴン+4.561
Neg logit contributions
 Lak-8.870
 AD-8.731
ellar-8.583
 Cent-8.385
 ACT-8.267
Token'?
Token ID30960
Feature activation-1.20e-01

Pos logit contributions
 Pengu+4.088
ーテ+4.050
hene+3.720
sten+3.585
punk+3.481
Neg logit contributions
 AD-4.898
 Lak-4.846
ellar-4.597
 Cent-4.553
 ACT-4.519
Token
Token ID198
Feature activation+1.39e-01

Pos logit contributions
 Pengu+0.998
ーテ+0.977
hene+0.902
sten+0.873
punk+0.856
Neg logit contributions
 AD-0.828
 Lak-0.809
ellar-0.798
 ACT-0.760
 Cent-0.751
Token
Token ID198
Feature activation+3.95e-02

Pos logit contributions
 AD+0.894
 Lak+0.867
ellar+0.815
 ACT+0.807
 Cent+0.802
Neg logit contributions
ーテ-1.237
 Pengu-1.231
hene-1.155
sten-1.129
ゴン-1.115
Token"
Token ID1
Feature activation+3.45e-01

Pos logit contributions
 AD+0.274
 Lak+0.265
 ACT+0.249
ellar+0.248
 Cent+0.248
Neg logit contributions
ーテ-0.335
 Pengu-0.334
hene-0.309
ゴン-0.304
sten-0.302
Token
Token ID110
Feature activation-7.80e-01

Pos logit contributions
ーテ+2.885
 Pengu+2.753
hene+2.363
sten+2.149
ゴン+2.064
Neg logit contributions
 AD-8.145
 Lak-7.903
ellar-7.718
 ACT-7.654
 Cent-7.479
Tokenа
Token ID16142
Feature activation-4.31e-01

Pos logit contributions
ーテ+5.614
 Pengu+5.520
hene+5.135
sten+4.921
ゴン+4.870
Neg logit contributions
 AD-5.866
 Lak-5.801
ellar-5.668
 ACT-5.414
 Cent-5.363
Tokenл
Token ID30143
Feature activation-2.29e-01

Pos logit contributions
 Pengu+2.696
ーテ+2.690
hene+2.442
sten+2.351
ゴン+2.310
Neg logit contributions
 AD-3.817
 Lak-3.749
ellar-3.621
 ACT-3.546
 Cent-3.521
Tokenи
Token ID18849
Feature activation-2.46e-01

Pos logit contributions
ーテ+1.299
 Pengu+1.291
hene+1.135
sten+1.091
ゴン+1.088
Neg logit contributions
 AD-2.221
 Lak-2.176
 ACT-2.085
ellar-2.080
 Cent-2.066
Token
Token ID12466
Feature activation-5.49e-01

Pos logit contributions
ーテ+1.675
 Pengu+1.665
hene+1.505
ゴン+1.458
sten+1.458
Neg logit contributions
 AD-2.102
 Lak-2.056
ellar-1.966
 ACT-1.955
 Cent-1.934
Token
Token ID110
Feature activation-9.84e-01

Pos logit contributions
ーテ+3.200
 Pengu+3.122
hene+2.832
ゴン+2.705
sten+2.683
Neg logit contributions
 AD-5.135
 Lak-5.009
ellar-4.819
 ACT-4.767
 Cent-4.706
Token
Token ID12466
Feature activation-7.51e-01

Pos logit contributions
ーテ+6.796
 Pengu+6.705
hene+6.051
ゴン+5.892
sten+5.867
Neg logit contributions
 AD-7.310
 Lak-7.267
ellar-7.226
 Cent-6.785
 ACT-6.753
Token
Token ID109
Feature activation-7.32e-01

Pos logit contributions
ーテ+4.450
 Pengu+4.334
hene+3.992
sten+3.812
ゴン+3.802
Neg logit contributions
 AD-6.799
 Lak-6.578
ellar-6.357
 ACT-6.270
 Cent-6.217
Tokenо
Token ID15166
Feature activation-4.94e-01

Pos logit contributions
ーテ+3.243
 Pengu+3.001
hene+2.600
sten+2.488
ゴン+2.445
Neg logit contributions
 AD-7.811
 Lak-7.690
 ACT-7.415
ellar-7.390
 Cent-7.369
Tokenе
Token ID16843
Feature activation-3.18e-01

Pos logit contributions
ーテ+2.760
 Pengu+2.729
hene+2.342
sten+2.331
ゴン+2.312
Neg logit contributions
 AD-4.708
 Lak-4.615
ellar-4.508
 ACT-4.399
 Cent-4.356
Tokenв
Token ID38857
Feature activation-6.05e-01

Pos logit contributions
ーテ+2.032
 Pengu+2.014
hene+1.781
sten+1.736
ゴン+1.734
Neg logit contributions
 AD-2.813
 Lak-2.763
ellar-2.681
 ACT-2.624
 Cent-2.603
Token fun
Token ID1257
Feature activation+2.48e-02

Pos logit contributions
 Pengu+1.124
ーテ+1.099
hene+1.021
sten+0.982
ゴン+0.968
Neg logit contributions
 AD-0.988
 Lak-0.961
ellar-0.933
 ACT-0.898
 Cent-0.898
Token.
Token ID13
Feature activation-8.19e-02

Pos logit contributions
 AD+0.171
 Lak+0.165
 ACT+0.155
ellar+0.154
 Cent+0.154
Neg logit contributions
ーテ-0.213
 Pengu-0.212
hene-0.195
ゴン-0.191
sten-0.189
Token
Token ID198
Feature activation-2.59e-01

Pos logit contributions
 Pengu+0.714
ーテ+0.709
hene+0.650
sten+0.634
ゴン+0.630
Neg logit contributions
 AD-0.543
 Lak-0.525
ellar-0.508
 ACT-0.491
 Cent-0.484
Token
Token ID198
Feature activation-5.74e-01

Pos logit contributions
ーテ+1.928
 Pengu+1.913
hene+1.893
sten+1.865
punk+1.793
Neg logit contributions
 AD-1.803
 Lak-1.792
ellar-1.735
 Cent-1.665
 ACT-1.652
TokenX
Token ID55
Feature activation-1.52e-02

Pos logit contributions
ーテ+3.102
 Pengu+3.070
hene+2.744
sten+2.684
ゴン+2.654
Neg logit contributions
 AD-5.519
 Lak-5.419
ellar-5.179
 ACT-5.177
 Cent-5.100
Tokenavier
Token ID19492
Feature activation-9.45e-01

Pos logit contributions
ーテ+0.158
 Pengu+0.157
hene+0.149
sten+0.145
ゴン+0.144
Neg logit contributions
 AD-0.075
 Lak-0.072
 ACT-0.065
ellar-0.065
 Cent-0.065
Token Gu
Token ID1962
Feature activation-1.88e-01

Pos logit contributions
 Pengu+4.773
ーテ+4.501
 Fra+4.302
hene+4.257
 Hemp+4.093
Neg logit contributions
ellar-8.503
 Lak-8.394
 AD-8.295
 yeast-8.097
ller-8.080
Tokenil
Token ID346
Feature activation+3.02e-01

Pos logit contributions
 Pengu+1.505
ーテ+1.479
hene+1.441
sten+1.400
punk+1.348
Neg logit contributions
 AD-1.332
 Lak-1.288
 ACT-1.221
 Cent-1.209
ellar-1.182
Tokenbert
Token ID4835
Feature activation-4.34e-01

Pos logit contributions
 AD+1.965
 Lak+1.885
 ACT+1.774
ellar+1.765
 Cent+1.761
Neg logit contributions
ーテ-2.731
 Pengu-2.721
hene-2.485
ゴン-2.466
sten-2.415
Token :
Token ID1058
Feature activation-3.55e-01

Pos logit contributions
ーテ+2.530
 Pengu+2.523
hene+2.342
sten+2.297
punk+2.191
Neg logit contributions
 AD-3.785
 Lak-3.743
ellar-3.676
 ACT-3.536
 Cent-3.493
Token Well
Token ID3894
Feature activation-1.48e-01

Pos logit contributions
 Pengu+2.914
ーテ+2.790
hene+2.589
sten+2.522
 Disorder+2.478
Neg logit contributions
ellar-2.421
 AD-2.418
 Lak-2.369
iel-2.220
 ACT-2.200
Token
Token ID111
Feature activation-3.88e-01

Pos logit contributions
ーテ+0.029
 Pengu+0.029
hene+0.025
ゴン+0.024
sten+0.024
Neg logit contributions
 AD-0.051
 Lak-0.050
 ACT-0.048
ellar-0.048
 Cent-0.048
Tokenи
Token ID18849
Feature activation+1.26e-01

Pos logit contributions
ーテ+1.738
 Pengu+1.736
hene+1.488
ゴン+1.414
sten+1.401
Neg logit contributions
 AD-4.172
 Lak-4.081
ellar-3.946
 ACT-3.941
 Cent-3.912
Tokenе
Token ID16843
Feature activation-8.55e-02

Pos logit contributions
 AD+1.049
 Lak+1.018
 ACT+0.968
 Cent+0.962
ellar+0.961
Neg logit contributions
ーテ-0.907
 Pengu-0.904
hene-0.814
ゴン-0.797
sten-0.785
Token
Token ID12466
Feature activation-4.95e-01

Pos logit contributions
ーテ+0.689
 Pengu+0.686
hene+0.626
ゴン+0.611
sten+0.609
Neg logit contributions
 AD-0.630
 Lak-0.611
ellar-0.582
 ACT-0.576
 Cent-0.570
Token
Token ID116
Feature activation-4.10e-01

Pos logit contributions
ーテ+2.903
 Pengu+2.851
hene+2.575
ゴン+2.474
sten+2.457
Neg logit contributions
 AD-4.635
 Lak-4.503
ellar-4.331
 ACT-4.296
 Cent-4.257
Token
Token ID140
Feature activation-9.41e-01

Pos logit contributions
ーテ+3.083
 Pengu+3.065
hene+2.757
sten+2.699
ゴン+2.691
Neg logit contributions
 AD-3.162
 Lak-3.093
ellar-2.956
 ACT-2.900
 Cent-2.883
Token
Token ID115
Feature activation-3.51e-01

Pos logit contributions
ーテ+4.387
 Pengu+4.255
hene+3.982
sten+3.707
ゴン+3.680
Neg logit contributions
 AD-9.203
 Lak-9.129
ellar-8.690
 ACT-8.590
 Cent-8.529
Token
Token ID12466
Feature activation-5.25e-01

Pos logit contributions
ーテ+3.067
 Pengu+3.041
hene+2.790
ゴン+2.750
sten+2.736
Neg logit contributions
 AD-2.308
 Lak-2.237
ellar-2.125
 ACT-2.090
 Cent-2.070
Token
Token ID110
Feature activation-8.32e-01

Pos logit contributions
ーテ+3.122
 Pengu+3.067
hene+2.794
ゴン+2.693
sten+2.649
Neg logit contributions
 AD-4.846
 Lak-4.720
ellar-4.531
 ACT-4.488
 Cent-4.446
Tokenа
Token ID16142
Feature activation-3.86e-01

Pos logit contributions
ーテ+5.613
 Pengu+5.509
hene+4.984
ゴン+4.849
sten+4.809
Neg logit contributions
 AD-6.569
 Lak-6.503
ellar-6.401
 Cent-6.086
 ACT-6.085
Token
Token ID140
Feature activation-4.23e-01

Pos logit contributions
 Pengu+2.512
ーテ+2.508
hene+2.237
ゴン+2.187
sten+2.173
Neg logit contributions
 AD-3.359
 Lak-3.278
ellar-3.182
 ACT-3.126
 Cent-3.103
Token of
Token ID286
Feature activation+3.65e-01

Pos logit contributions
ーテ+0.520
 Pengu+0.519
hene+0.468
ゴン+0.455
sten+0.452
Neg logit contributions
 AD-0.598
 Lak-0.580
 ACT-0.552
ellar-0.550
 Cent-0.548
Token weakness
Token ID10453
Feature activation-1.41e-01

Pos logit contributions
 AD+2.632
 Lak+2.541
 ACT+2.396
 Cent+2.376
ellar+2.369
Neg logit contributions
ーテ-3.044
 Pengu-3.024
hene-2.767
ゴン-2.723
sten-2.684
Token if
Token ID611
Feature activation+7.38e-03

Pos logit contributions
ーテ+1.147
 Pengu+1.144
hene+1.057
sten+1.026
ゴン+1.017
Neg logit contributions
 AD-1.009
 Lak-0.980
ellar-0.935
 ACT-0.920
 Cent-0.916
Token not
Token ID407
Feature activation-8.24e-02

Pos logit contributions
 AD+0.054
 Lak+0.053
ellar+0.050
 ACT+0.050
 Cent+0.050
Neg logit contributions
 Pengu-0.059
ーテ-0.059
hene-0.054
sten-0.052
ゴン-0.052
Token madness
Token ID23208
Feature activation-2.57e-01

Pos logit contributions
 Pengu+0.665
ーテ+0.656
hene+0.609
sten+0.592
ゴン+0.583
Neg logit contributions
 AD-0.587
 Lak-0.580
ellar-0.558
 Cent-0.542
 ACT-0.538
Token"[
Token ID17912
Feature activation-9.33e-01

Pos logit contributions
ーテ+2.135
 Pengu+2.131
hene+1.971
sten+1.924
ゴン+1.901
Neg logit contributions
 AD-1.773
 Lak-1.742
ellar-1.655
 ACT-1.611
 Cent-1.609
Token41
Token ID3901
Feature activation-2.98e-01

Pos logit contributions
ーテ+5.931
 Pengu+5.831
hene+5.410
sten+5.311
ゴン+5.032
Neg logit contributions
 AD-7.739
 Lak-7.655
ellar-7.311
 ACT-7.289
 Cent-7.178
Token]
Token ID60
Feature activation-1.25e-01

Pos logit contributions
ーテ+2.098
 Pengu+2.088
hene+1.880
ゴン+1.840
sten+1.818
Neg logit contributions
 AD-2.508
 Lak-2.440
 ACT-2.323
ellar-2.314
 Cent-2.304
Token and
Token ID290
Feature activation-2.45e-01

Pos logit contributions
 Pengu+1.010
ーテ+1.005
hene+0.928
sten+0.900
ゴン+0.892
Neg logit contributions
 AD-0.893
 Lak-0.865
ellar-0.835
 ACT-0.821
 Cent-0.811
Token Peterson
Token ID18914
Feature activation-2.56e-01

Pos logit contributions
 Pengu+1.945
ーテ+1.930
hene+1.771
sten+1.724
ゴン+1.692
Neg logit contributions
 AD-1.752
 Lak-1.699
ellar-1.671
 ACT-1.619
 Cent-1.601
Token conc
Token ID1673
Feature activation+6.73e-02

Pos logit contributions
 Pengu+2.036
ーテ+2.018
hene+1.854
sten+1.808
ゴン+1.782
Neg logit contributions
 AD-1.857
 Lak-1.808
ellar-1.748
 ACT-1.709
 Cent-1.702
Tokenы
Token ID45035
Feature activation+4.69e-02

Pos logit contributions
ーテ+2.332
 Pengu+2.294
hene+2.028
sten+1.952
ゴン+1.942
Neg logit contributions
 AD-3.605
 Lak-3.532
ellar-3.408
 ACT-3.370
 Cent-3.336
Token
Token ID141
Feature activation-1.90e-01

Pos logit contributions
 AD+0.383
 Lak+0.372
ellar+0.353
 ACT+0.353
 Cent+0.351
Neg logit contributions
ーテ-0.344
 Pengu-0.343
hene-0.310
ゴン-0.303
sten-0.300
Token
Token ID227
Feature activation-1.19e-01

Pos logit contributions
ーテ+1.142
 Pengu+1.104
hene+0.998
sten+0.958
ゴン+0.948
Neg logit contributions
 AD-1.805
 Lak-1.749
ellar-1.692
 ACT-1.690
 Cent-1.661
Token
Token ID12466
Feature activation-5.28e-01

Pos logit contributions
ーテ+0.931
 Pengu+0.926
hene+0.846
ゴン+0.824
sten+0.820
Neg logit contributions
 AD-0.896
 Lak-0.873
ellar-0.832
 ACT-0.822
 Cent-0.815
Token
Token ID116
Feature activation-3.65e-01

Pos logit contributions
ーテ+3.045
 Pengu+2.989
hene+2.697
ゴン+2.573
sten+2.565
Neg logit contributions
 AD-4.976
 Lak-4.836
ellar-4.695
 ACT-4.625
 Cent-4.569
Token
Token ID140
Feature activation-9.01e-01

Pos logit contributions
ーテ+2.750
 Pengu+2.720
hene+2.458
sten+2.412
ゴン+2.396
Neg logit contributions
 AD-2.830
 Lak-2.767
ellar-2.652
 ACT-2.600
 Cent-2.582
Token
Token ID115
Feature activation-3.36e-01

Pos logit contributions
ーテ+4.108
 Pengu+3.930
hene+3.682
sten+3.480
ゴン+3.422
Neg logit contributions
 AD-9.004
 Lak-8.808
ellar-8.600
 ACT-8.381
 Cent-8.280
Token
Token ID220
Feature activation-1.34e-01

Pos logit contributions
ーテ+2.247
 Pengu+2.205
hene+1.973
ゴン+1.931
sten+1.923
Neg logit contributions
 AD-2.925
 Lak-2.851
ellar-2.738
 ACT-2.714
 Cent-2.689
Token
Token ID141
Feature activation-2.43e-01

Pos logit contributions
ーテ+0.741
 Pengu+0.736
hene+0.645
ゴン+0.623
sten+0.615
Neg logit contributions
 AD-1.342
 Lak-1.302
ellar-1.260
 ACT-1.258
 Cent-1.245
Token
Token ID235
Feature activation-3.55e-01

Pos logit contributions
ーテ+1.750
 Pengu+1.718
hene+1.565
sten+1.506
ゴン+1.504
Neg logit contributions
 AD-2.002
 Lak-1.948
ellar-1.875
 ACT-1.852
 Cent-1.833
Tokenт
Token ID20375
Feature activation-2.36e-01

Pos logit contributions
ーテ+2.078
 Pengu+2.047
hene+1.822
sten+1.748
ゴン+1.724
Neg logit contributions
 AD-3.351
 Lak-3.263
ellar-3.171
 ACT-3.134
 Cent-3.104
Token
Token ID220
Feature activation-2.33e-01

Pos logit contributions
ーテ+3.869
 Pengu+3.813
hene+3.499
sten+3.389
ゴン+3.333
Neg logit contributions
 AD-4.014
 Lak-3.951
ellar-3.822
 ACT-3.691
 Cent-3.674
Tokenс
Token ID21727
Feature activation-4.83e-01

Pos logit contributions
ーテ+1.287
 Pengu+1.279
hene+1.117
ゴン+1.077
sten+1.068
Neg logit contributions
 AD-2.324
 Lak-2.258
ellar-2.191
 ACT-2.177
 Cent-2.154
Tokenо
Token ID15166
Feature activation-4.32e-01

Pos logit contributions
ーテ+3.096
 Pengu+3.071
hene+2.710
ゴン+2.625
sten+2.624
Neg logit contributions
 AD-4.233
 Lak-4.155
ellar-4.044
 ACT-3.933
 Cent-3.918
Token
Token ID12466
Feature activation-6.39e-01

Pos logit contributions
ーテ+2.764
 Pengu+2.733
sten+2.421
hene+2.421
ゴン+2.385
Neg logit contributions
 AD-3.794
 Lak-3.716
ellar-3.624
 ACT-3.520
 Cent-3.499
Token
Token ID118
Feature activation-3.09e-01

Pos logit contributions
ーテ+3.640
 Pengu+3.544
hene+3.250
sten+3.083
ゴン+3.067
Neg logit contributions
 AD-5.998
 Lak-5.792
ellar-5.662
 ACT-5.560
 Cent-5.481
Tokenо�
Token ID25443
Feature activation-8.93e-01

Pos logit contributions
ーテ+1.580
 Pengu+1.547
hene+1.315
sten+1.297
ゴン+1.284
Neg logit contributions
 AD-3.185
 Lak-3.103
ellar-3.001
 ACT-2.983
 Cent-2.964
Token
Token ID119
Feature activation-3.22e-01

Pos logit contributions
ーテ+3.937
 Pengu+3.730
hene+3.321
sten+3.129
ゴン+3.107
Neg logit contributions
 AD-9.348
 Lak-8.970
ellar-8.780
 ACT-8.701
 Cent-8.634
Tokenи
Token ID18849
Feature activation-2.76e-01

Pos logit contributions
ーテ+1.913
 Pengu+1.884
hene+1.684
sten+1.630
ゴン+1.628
Neg logit contributions
 AD-3.000
 Lak-2.914
ellar-2.813
 ACT-2.806
 Cent-2.794
Token
Token ID141
Feature activation-4.67e-01

Pos logit contributions
ーテ+1.853
 Pengu+1.848
hene+1.663
ゴン+1.610
sten+1.610
Neg logit contributions
 AD-2.375
 Lak-2.310
ellar-2.230
 ACT-2.209
 Cent-2.184
Token
Token ID229
Feature activation-3.73e-01

Pos logit contributions
ーテ+3.293
 Pengu+3.183
hene+2.952
sten+2.851
ゴン+2.808
Neg logit contributions
 AD-3.857
 Lak-3.695
ellar-3.633
 ACT-3.590
 Cent-3.481
Tokenе
Token ID16843
Feature activation-4.22e-01

Pos logit contributions
ーテ+1.999
 Pengu+1.969
hene+1.755
sten+1.679
ゴン+1.630
Neg logit contributions
 AD-3.689
 Lak-3.619
ellar-3.466
 ACT-3.464
 Cent-3.435
TokenAC
Token ID2246
Feature activation+1.24e-01

Pos logit contributions
 AD+1.293
 Lak+0.900
 ACT+0.840
 Cent+0.682
ellar+0.596
Neg logit contributions
 Pengu-10.692
ーテ-10.660
ゴン-9.968
hene-9.870
 tradem-9.801
TokenLU
Token ID41596
Feature activation+1.50e-01

Pos logit contributions
 AD+0.963
 Lak+0.931
 ACT+0.883
ellar+0.879
 Cent+0.875
Neg logit contributions
ーテ-0.962
 Pengu-0.958
hene-0.867
ゴン-0.853
sten-0.841
Token_
Token ID62
Feature activation+3.23e-01

Pos logit contributions
 AD+1.094
 Lak+1.063
 ACT+1.002
ellar+1.001
 Cent+0.996
Neg logit contributions
ーテ-1.212
 Pengu-1.205
hene-1.111
sten-1.083
ゴン-1.083
TokenOR
Token ID1581
Feature activation+1.25e-02

Pos logit contributions
 AD+2.094
 Lak+1.999
 ACT+1.888
ellar+1.859
 Cent+1.851
Neg logit contributions
 Pengu-2.951
ーテ-2.951
hene-2.694
ゴン-2.669
sten-2.619
Token)
Token ID8
Feature activation-2.98e-02

Pos logit contributions
 AD+0.087
 Lak+0.086
ellar+0.082
 ACT+0.080
 Cent+0.079
Neg logit contributions
ーテ-0.101
 Pengu-0.100
hene-0.095
sten-0.093
ゴン-0.090
Token May
Token ID1737
Feature activation-8.92e-01

Pos logit contributions
 Pengu+0.225
ーテ+0.221
hene+0.207
sten+0.200
 Disorder+0.192
Neg logit contributions
 AD-0.224
 Lak-0.218
ellar-0.215
iel-0.205
 ACT-0.203
Token 29
Token ID2808
Feature activation-2.39e-01

Pos logit contributions
 Pengu+5.300
ーテ+5.214
hene+4.920
sten+4.641
ゴン+4.350
Neg logit contributions
 AD-7.924
 Lak-7.829
ellar-7.576
 ACT-7.444
 Cent-7.345
Token,
Token ID11
Feature activation+1.33e-02

Pos logit contributions
 Pengu+1.652
ーテ+1.650
hene+1.603
sten+1.580
punk+1.519
Neg logit contributions
 AD-1.801
 Lak-1.795
ellar-1.759
 Cent-1.672
 ACT-1.671
Token 2017
Token ID2177
Feature activation-1.15e-01

Pos logit contributions
 AD+0.067
 Lak+0.066
ellar+0.062
 Cent+0.059
 ACT+0.059
Neg logit contributions
 Pengu-0.133
ーテ-0.133
hene-0.126
sten-0.123
ゴン-0.121
Token
Token ID198
Feature activation-1.61e-01

Pos logit contributions
ーテ+0.995
 Pengu+0.994
hene+0.926
sten+0.896
ゴン+0.894
Neg logit contributions
 AD-0.752
 Lak-0.730
ellar-0.693
 ACT-0.678
 Cent-0.676
Token
Token ID198
Feature activation+1.14e-01

Pos logit contributions
ーテ+1.233
 Pengu+1.231
hene+1.204
sten+1.181
punk+1.146
Neg logit contributions
 AD-1.098
 Lak-1.091
ellar-1.055
 Cent-1.009
 ACT-1.003
Token
Token ID141
Feature activation-6.96e-01

Pos logit contributions
ーテ+4.428
 Pengu+4.392
hene+4.080
sten+3.933
ゴン+3.844
Neg logit contributions
 AD-4.443
 Lak-4.387
ellar-4.225
 ACT-4.102
 Cent-4.037
Token
Token ID227
Feature activation-4.10e-01

Pos logit contributions
ーテ+3.813
 Pengu+3.644
hene+3.316
sten+3.177
ゴン+3.018
Neg logit contributions
 AD-6.654
 Lak-6.436
ellar-6.310
 ACT-6.249
 Cent-6.076
Token
Token ID12466
Feature activation-7.94e-01

Pos logit contributions
ーテ+3.087
 Pengu+3.068
hene+2.833
sten+2.739
ゴン+2.721
Neg logit contributions
 AD-3.099
 Lak-3.037
ellar-2.939
 ACT-2.855
 Cent-2.829
Token
Token ID116
Feature activation-4.72e-01

Pos logit contributions
ーテ+4.354
 Pengu+4.256
hene+3.864
sten+3.670
ゴン+3.624
Neg logit contributions
 AD-7.434
 Lak-7.262
ellar-7.029
 ACT-6.921
 Cent-6.810
Token
Token ID12466
Feature activation-7.23e-01

Pos logit contributions
ーテ+3.492
 Pengu+3.457
hene+3.103
sten+3.037
ゴン+3.030
Neg logit contributions
 AD-3.668
 Lak-3.603
ellar-3.447
 ACT-3.375
 Cent-3.348
Token
Token ID111
Feature activation-8.92e-01

Pos logit contributions
ーテ+4.114
 Pengu+4.009
hene+3.689
ゴン+3.513
sten+3.472
Neg logit contributions
 AD-6.704
 Lak-6.530
ellar-6.296
 ACT-6.212
 Cent-6.141
Tokenо
Token ID15166
Feature activation-4.09e-01

Pos logit contributions
ーテ+4.549
 Pengu+4.473
hene+3.916
sten+3.792
ゴン+3.775
Neg logit contributions
 AD-8.566
 Lak-8.467
ellar-8.190
 ACT-8.111
 Cent-8.041
Tokenр
Token ID21169
Feature activation-4.36e-01

Pos logit contributions
ーテ+2.844
 Pengu+2.824
hene+2.534
sten+2.492
ゴン+2.474
Neg logit contributions
 AD-3.384
 Lak-3.298
ellar-3.217
 ACT-3.137
 Cent-3.112
Tokenя
Token ID40623
Feature activation-1.96e-01

Pos logit contributions
ーテ+2.163
 Pengu+2.091
hene+1.855
sten+1.737
ゴン+1.736
Neg logit contributions
 AD-4.511
 Lak-4.416
ellar-4.257
 ACT-4.256
 Cent-4.220
Token
Token ID141
Feature activation-3.97e-01

Pos logit contributions
ーテ+1.486
 Pengu+1.475
hene+1.341
ゴン+1.304
sten+1.300
Neg logit contributions
 AD-1.527
 Lak-1.485
ellar-1.425
 ACT-1.410
 Cent-1.390
Token
Token ID229
Feature activation-2.24e-01

Pos logit contributions
ーテ+2.766
 Pengu+2.673
hene+2.487
sten+2.377
ゴン+2.320
Neg logit contributions
 AD-3.328
 Lak-3.196
 ACT-3.096
ellar-3.091
 Cent-3.006
Tokenland
Token ID1044
Feature activation-1.72e-01

Pos logit contributions
 AD+2.606
 Lak+2.512
 ACT+2.366
ellar+2.351
 Cent+2.350
Neg logit contributions
ーテ-3.264
 Pengu-3.255
hene-2.970
ゴン-2.935
sten-2.882
Token State
Token ID1812
Feature activation-3.90e-01

Pos logit contributions
ーテ+1.335
 Pengu+1.329
hene+1.207
ゴン+1.182
sten+1.172
Neg logit contributions
 AD-1.338
 Lak-1.297
 ACT-1.229
ellar-1.225
 Cent-1.218
Token Park
Token ID3250
Feature activation-1.91e-01

Pos logit contributions
 Pengu+2.232
ーテ+2.171
hene+1.997
sten+1.959
 Disorder+1.891
Neg logit contributions
 AD-3.535
 Lak-3.476
ellar-3.441
iel-3.285
 ACT-3.278
Token,
Token ID11
Feature activation-2.08e-01

Pos logit contributions
ーテ+1.485
 Pengu+1.475
hene+1.379
sten+1.341
ゴン+1.313
Neg logit contributions
 AD-1.394
 Lak-1.359
ellar-1.320
 ACT-1.280
 Cent-1.263
Token North
Token ID2258
Feature activation-2.38e-01

Pos logit contributions
ーテ+1.271
 Pengu+1.264
hene+1.117
ゴン+1.088
sten+1.071
Neg logit contributions
 AD-1.965
 Lak-1.914
 ACT-1.832
ellar-1.823
 Cent-1.821
Token East
Token ID3687
Feature activation-8.91e-01

Pos logit contributions
ーテ+1.491
 Pengu+1.475
hene+1.374
sten+1.346
ゴン+1.278
Neg logit contributions
 AD-2.015
 Lak-2.008
ellar-1.976
iel-1.898
 ACT-1.885
Tokenon
Token ID261
Feature activation-1.53e-01

Pos logit contributions
sten+5.433
hene+5.303
punk+5.041
 Pengu+4.818
 Hemp+4.664
Neg logit contributions
 AD-7.120
 Lak-7.002
ellar-6.682
 Cent-6.537
 yeast-6.533
Token.
Token ID13
Feature activation-2.24e-01

Pos logit contributions
ーテ+1.168
 Pengu+1.161
hene+1.109
sten+1.071
punk+1.035
Neg logit contributions
 AD-1.119
 Lak-1.102
ellar-1.053
 ACT-1.022
 Cent-1.021
Token
Token ID198
Feature activation-3.18e-01

Pos logit contributions
ーテ+2.041
 Pengu+2.034
hene+1.875
ゴン+1.845
sten+1.827
Neg logit contributions
 AD-1.426
 Lak-1.376
 ACT-1.285
ellar-1.280
 Cent-1.272
Token
Token ID198
Feature activation-2.90e-01

Pos logit contributions
ーテ+2.269
 Pengu+2.265
hene+2.254
sten+2.227
punk+2.159
Neg logit contributions
 Lak-2.229
 AD-2.221
ellar-2.162
 Cent-2.055
 ACT-2.044
TokenThe
Token ID464
Feature activation-8.64e-02

Pos logit contributions
ーテ+2.244
 Pengu+2.229
hene+2.080
sten+2.060
ゴン+2.038
Neg logit contributions
 AD-2.124
 Lak-2.075
ellar-1.965
 ACT-1.952
 Cent-1.932
Token 2
Token ID362
Feature activation+3.32e-01

Pos logit contributions
 Pengu+1.649
ーテ+1.638
hene+1.515
sten+1.469
ゴン+1.448
Neg logit contributions
 AD-1.500
 Lak-1.456
ellar-1.398
 ACT-1.371
 Cent-1.364
Token begins
Token ID6140
Feature activation+1.12e-01

Pos logit contributions
 AD+2.162
 Lak+2.061
 ACT+1.947
 Cent+1.912
ellar+1.895
Neg logit contributions
ーテ-3.037
 Pengu-3.000
hene-2.758
ゴン-2.737
sten-2.689
Token with
Token ID351
Feature activation+4.29e-02

Pos logit contributions
 AD+0.781
 Lak+0.758
ellar+0.712
 ACT+0.710
 Cent+0.707
Neg logit contributions
ーテ-0.945
 Pengu-0.945
hene-0.867
ゴン-0.848
sten-0.842
Token:
Token ID25
Feature activation-1.32e-01

Pos logit contributions
 AD+0.300
 Lak+0.290
 ACT+0.272
ellar+0.272
 Cent+0.271
Neg logit contributions
ーテ-0.364
 Pengu-0.364
hene-0.333
ゴン-0.327
sten-0.324
Token
Token ID198
Feature activation-6.87e-01

Pos logit contributions
ーテ+1.214
 Pengu+1.210
hene+1.117
ゴン+1.098
sten+1.088
Neg logit contributions
 AD-0.834
 Lak-0.803
 ACT-0.749
ellar-0.747
 Cent-0.744
Token
Token ID198
Feature activation-8.84e-01

Pos logit contributions
hene+4.204
sten+4.200
punk+4.133
Mask+4.054
 Disorder+4.044
Neg logit contributions
 Lak-5.112
ellar-5.038
 AD-4.996
 coh-4.830
iel-4.775
TokenThe
Token ID464
Feature activation-4.41e-01

Pos logit contributions
 Pengu+5.456
ーテ+5.411
sten+5.380
hene+5.377
punk+5.369
Neg logit contributions
 Lak-6.923
 AD-6.767
 Cent-6.379
 Grassley-6.316
ellar-6.309
Token Citizens
Token ID18029
Feature activation-4.88e-01

Pos logit contributions
 Pengu+3.647
ーテ+3.471
hene+3.318
sten+3.245
 Disorder+3.147
Neg logit contributions
 AD-2.905
 Lak-2.905
ellar-2.776
iel-2.598
 ACT-2.590
Token of
Token ID286
Feature activation-5.59e-01

Pos logit contributions
 Pengu+3.606
ーテ+3.528
hene+3.269
sten+3.168
ゴン+3.101
Neg logit contributions
 AD-3.659
 Lak-3.656
ellar-3.507
 ACT-3.352
 Cent-3.339
Token each
Token ID1123
Feature activation-6.61e-01

Pos logit contributions
 Pengu+4.103
ーテ+4.031
hene+3.682
sten+3.589
punk+3.432
Neg logit contributions
 AD-4.296
 Lak-4.212
ellar-4.049
 ACT-3.969
 Cent-3.880
Token State
Token ID1812
Feature activation-5.23e-01

Pos logit contributions
 Pengu+4.231
ーテ+3.925
hene+3.711
sten+3.617
punk+3.488
Neg logit contributions
 AD-5.534
 Lak-5.459
ellar-5.261
 ACT-5.107
iel-5.035
Token
Token ID198
Feature activation-1.23e-01

Pos logit contributions
ーテ+3.385
 Pengu+3.375
hene+3.112
ゴン+3.051
sten+3.032
Neg logit contributions
 AD-2.427
 Lak-2.351
 ACT-2.192
ellar-2.188
 Cent-2.175
Token
Token ID198
Feature activation-6.11e-01

Pos logit contributions
ーテ+0.976
 Pengu+0.971
hene+0.943
sten+0.926
punk+0.894
Neg logit contributions
 AD-0.834
 Lak-0.822
ellar-0.785
 Cent-0.761
 ACT-0.759
TokenGoogle
Token ID11708
Feature activation-3.91e-01

Pos logit contributions
ーテ+4.156
 Pengu+4.131
hene+3.767
sten+3.757
ゴン+3.720
Neg logit contributions
 AD-4.993
 Lak-4.816
 ACT-4.668
 Cent-4.596
ellar-4.596
Token Scholar
Token ID11713
Feature activation+4.23e-02

Pos logit contributions
ーテ+2.999
 Pengu+2.979
hene+2.779
sten+2.745
 Disorder+2.665
Neg logit contributions
 Lak-2.743
 AD-2.665
ellar-2.620
iel-2.512
 ACT-2.510
Token
Token ID198
Feature activation-5.78e-01

Pos logit contributions
 AD+0.272
 Lak+0.261
 ACT+0.245
ellar+0.243
 Cent+0.242
Neg logit contributions
ーテ-0.387
 Pengu-0.385
hene-0.356
ゴン-0.350
sten-0.346
Token
Token ID198
Feature activation-8.75e-01

Pos logit contributions
sten+3.738
hene+3.737
 Pengu+3.669
punk+3.646
ーテ+3.643
Neg logit contributions
 AD-4.155
ellar-4.137
 Lak-4.126
iel-3.891
 Cent-3.876
TokenOl
Token ID30098
Feature activation+2.39e-02

Pos logit contributions
 Pengu+5.475
ーテ+5.393
sten+5.324
Mask+5.268
hene+5.246
Neg logit contributions
 AD-7.203
 Lak-6.796
 ACT-6.745
 Cent-6.605
ellar-6.510
Tokenwe
Token ID732
Feature activation-2.53e-01

Pos logit contributions
 AD+0.178
 Lak+0.173
 ACT+0.164
ellar+0.162
 Cent+0.162
Neg logit contributions
 Pengu-0.191
ーテ-0.190
hene-0.176
sten-0.173
ゴン-0.169
Tokenus
Token ID385
Feature activation+4.64e-02

Pos logit contributions
ーテ+1.979
 Pengu+1.965
hene+1.835
sten+1.798
ゴン+1.754
Neg logit contributions
 AD-1.869
 Lak-1.810
 ACT-1.731
ellar-1.712
 Cent-1.693
Token,
Token ID11
Feature activation-1.43e-01

Pos logit contributions
 AD+0.315
 Lak+0.303
 ACT+0.283
 Cent+0.283
ellar+0.282
Neg logit contributions
ーテ-0.410
 Pengu-0.407
hene-0.370
ゴン-0.367
sten-0.360
Token D
Token ID360
Feature activation+2.80e-01

Pos logit contributions
 Pengu+1.176
ーテ+1.157
hene+1.074
sten+1.051
punk+1.010
Neg logit contributions
 AD-0.974
ellar-0.948
 Lak-0.947
 ACT-0.897
iel-0.892
Tokenс
Token ID21727
Feature activation-6.22e-01

Pos logit contributions
ーテ+3.988
 Pengu+3.907
hene+3.572
sten+3.413
punk+3.347
Neg logit contributions
 AD-5.624
 Lak-5.536
ellar-5.313
 ACT-5.257
 Cent-5.195
Tokenт
Token ID20375
Feature activation-2.45e-01

Pos logit contributions
ーテ+3.472
 Pengu+3.236
sten+2.888
hene+2.884
ゴン+2.804
Neg logit contributions
 AD-6.012
 Lak-5.898
ellar-5.681
 ACT-5.673
 Cent-5.559
Tokenв
Token ID38857
Feature activation-4.25e-01

Pos logit contributions
ーテ+1.975
 Pengu+1.947
hene+1.797
ゴン+1.745
sten+1.745
Neg logit contributions
 AD-1.804
 Lak-1.746
ellar-1.650
 ACT-1.648
 Cent-1.629
Tokenо
Token ID15166
Feature activation-4.92e-01

Pos logit contributions
ーテ+1.887
 Pengu+1.817
hene+1.604
sten+1.496
ゴン+1.472
Neg logit contributions
 AD-4.580
 Lak-4.544
ellar-4.347
 ACT-4.322
 Cent-4.290
Token
Token ID12466
Feature activation-5.29e-01

Pos logit contributions
 Pengu+3.771
ーテ+3.763
hene+3.461
sten+3.372
ゴン+3.330
Neg logit contributions
 AD-3.621
 Lak-3.540
ellar-3.424
 ACT-3.302
 Cent-3.261
Token
Token ID110
Feature activation-8.75e-01

Pos logit contributions
ーテ+3.170
 Pengu+3.097
hene+2.817
sten+2.680
ゴン+2.678
Neg logit contributions
 AD-4.879
 Lak-4.724
ellar-4.574
 ACT-4.518
 Cent-4.459
Tokenы
Token ID45035
Feature activation-3.54e-01

Pos logit contributions
ーテ+6.178
 Pengu+6.141
hene+5.543
ゴン+5.403
sten+5.385
Neg logit contributions
 AD-6.463
 Lak-6.415
ellar-6.362
 Cent-5.961
 ACT-5.951
Tokenя
Token ID40623
Feature activation-3.85e-01

Pos logit contributions
ーテ+2.480
 Pengu+2.472
hene+2.222
ゴン+2.178
sten+2.159
Neg logit contributions
 AD-2.918
 Lak-2.845
ellar-2.759
 ACT-2.710
 Cent-2.682
Tokenв
Token ID38857
Feature activation-6.95e-01

Pos logit contributions
 Pengu+2.952
ーテ+2.949
hene+2.693
ゴン+2.622
sten+2.590
Neg logit contributions
 AD-2.876
 Lak-2.802
ellar-2.722
 ACT-2.658
 Cent-2.619
Tokenл
Token ID30143
Feature activation-4.45e-01

Pos logit contributions
ーテ+3.598
 Pengu+3.515
hene+3.157
sten+3.021
ゴン+2.991
Neg logit contributions
 AD-6.809
 Lak-6.696
 ACT-6.406
ellar-6.378
 Cent-6.329
Tokenе
Token ID16843
Feature activation-2.14e-01

Pos logit contributions
ーテ+1.808
 Pengu+1.796
hene+1.480
ゴン+1.418
sten+1.382
Neg logit contributions
 AD-4.980
 Lak-4.884
 ACT-4.719
ellar-4.714
 Cent-4.654
Token Irvine
Token ID37277
Feature activation-1.20e-01

Pos logit contributions
 Pengu+1.491
ーテ+1.474
hene+1.362
sten+1.326
ゴン+1.319
Neg logit contributions
 AD-1.206
 Lak-1.174
ellar-1.117
 ACT-1.105
 Cent-1.072
Token,
Token ID11
Feature activation-9.76e-02

Pos logit contributions
ーテ+0.955
 Pengu+0.951
hene+0.888
sten+0.859
ゴン+0.843
Neg logit contributions
 AD-0.867
 Lak-0.844
ellar-0.806
 ACT-0.796
 Cent-0.788
Token California
Token ID3442
Feature activation-8.45e-03

Pos logit contributions
ーテ+0.797
 Pengu+0.792
hene+0.727
ゴン+0.710
sten+0.707
Neg logit contributions
 AD-0.706
 Lak-0.689
ellar-0.647
 ACT-0.646
 Cent-0.640
Token.
Token ID13
Feature activation-9.52e-02

Pos logit contributions
 Pengu+0.075
ーテ+0.075
hene+0.068
ゴン+0.067
sten+0.067
Neg logit contributions
 AD-0.057
 Lak-0.055
 ACT-0.051
ellar-0.051
 Cent-0.051
Token Hide
Token ID10415
Feature activation-5.20e-01

Pos logit contributions
 Pengu+0.835
ーテ+0.831
hene+0.766
sten+0.743
ゴン+0.741
Neg logit contributions
 AD-0.633
 Lak-0.609
ellar-0.581
 ACT-0.572
 Cent-0.563
Token Caption
Token ID11260
Feature activation-8.69e-01

Pos logit contributions
 Pengu+3.458
ーテ+3.334
hene+3.298
sten+3.206
 Disorder+3.124
Neg logit contributions
 AD-3.996
 Lak-3.969
ellar-3.928
 yeast-3.789
iel-3.719
Token 10
Token ID838
Feature activation-2.51e-01

Pos logit contributions
 Pengu+5.117
ーテ+4.972
sten+4.714
 Hemp+4.668
 Disorder+4.668
Neg logit contributions
ellar-7.070
 Lak-6.899
 AD-6.758
iel-6.636
alore-6.404
Token of
Token ID286
Feature activation+2.17e-01

Pos logit contributions
ーテ+1.745
 Pengu+1.711
hene+1.656
sten+1.620
punk+1.563
Neg logit contributions
 AD-1.938
 Lak-1.934
ellar-1.863
 ACT-1.795
 Cent-1.791
Token 15
Token ID1315
Feature activation-2.03e-01

Pos logit contributions
 AD+1.340
 Lak+1.289
 ACT+1.203
ellar+1.199
 Cent+1.193
Neg logit contributions
ーテ-2.017
 Pengu-2.010
hene-1.861
ゴン-1.825
sten-1.814
Token Photos
Token ID9434
Feature activation+2.13e-03

Pos logit contributions
 Pengu+1.183
ーテ+1.173
hene+1.071
sten+1.038
punk+0.985
Neg logit contributions
 AD-1.845
 Lak-1.820
ellar-1.775
 ACT-1.726
iel-1.718
Token:
Token ID25
Feature activation-2.02e-01

Pos logit contributions
 AD+0.014
 Lak+0.014
ellar+0.013
 ACT+0.013
 Cent+0.013
Neg logit contributions
ーテ-0.018
 Pengu-0.018
hene-0.017
sten-0.016
ゴン-0.016
Token Clara
Token ID27443
Feature activation-3.05e-01

Pos logit contributions
 Pengu+1.142
ーテ+1.129
hene+1.007
sten+0.961
ゴン+0.943
Neg logit contributions
 AD-2.153
 Lak-2.104
ellar-2.030
 ACT-2.023
 Cent-1.987
Token County
Token ID3418
Feature activation-4.18e-01

Pos logit contributions
 Pengu+2.379
ーテ+2.376
hene+2.161
ゴン+2.119
sten+2.110
Neg logit contributions
 AD-2.284
 Lak-2.214
ellar-2.113
 ACT-2.108
 Cent-2.062
Token v
Token ID410
Feature activation-7.31e-02

Pos logit contributions
 Pengu+2.777
ーテ+2.752
hene+2.482
sten+2.399
ゴン+2.375
Neg logit contributions
 AD-3.503
 Lak-3.460
ellar-3.326
 ACT-3.267
 Cent-3.214
Token.
Token ID13
Feature activation-1.90e-01

Pos logit contributions
 Pengu+0.587
ーテ+0.584
hene+0.541
sten+0.525
ゴン+0.514
Neg logit contributions
 AD-0.522
 Lak-0.505
ellar-0.490
 ACT-0.477
 Cent-0.470
Token Southern
Token ID8050
Feature activation-1.80e-01

Pos logit contributions
ーテ+1.832
 Pengu+1.831
hene+1.694
ゴン+1.659
sten+1.656
Neg logit contributions
 AD-1.094
 Lak-1.051
ellar-0.979
 ACT-0.976
 Cent-0.960
Token Pacific
Token ID8211
Feature activation-8.67e-01

Pos logit contributions
 Pengu+1.076
ーテ+1.074
hene+0.950
sten+0.924
ゴン+0.893
Neg logit contributions
 AD-1.668
 Lak-1.626
ellar-1.572
 ACT-1.547
 Cent-1.526
Token Railroad
Token ID36289
Feature activation-1.12e-01

Pos logit contributions
ーテ+4.687
 Pengu+4.590
hene+4.157
sten+3.957
ゴン+3.929
Neg logit contributions
 AD-8.009
 Lak-7.933
ellar-7.623
 ACT-7.612
 Cent-7.372
Token Company
Token ID5834
Feature activation-3.99e-01

Pos logit contributions
ーテ+0.972
 Pengu+0.970
hene+0.889
ゴン+0.874
sten+0.865
Neg logit contributions
 AD-0.766
 Lak-0.737
 ACT-0.694
 Cent-0.691
ellar-0.688
Token
Token ID198
Feature activation-4.46e-01

Pos logit contributions
 Pengu+3.133
ーテ+3.118
hene+2.919
sten+2.804
ゴン+2.800
Neg logit contributions
 AD-2.859
 Lak-2.829
ellar-2.688
 ACT-2.624
 Cent-2.595
Token
Token ID198
Feature activation-6.03e-01

Pos logit contributions
hene+3.010
ーテ+2.995
sten+2.993
 Pengu+2.957
punk+2.884
Neg logit contributions
 Lak-3.193
 AD-3.167
ellar-3.115
iel-2.945
 Cent-2.936
TokenCalifornia
Token ID25284
Feature activation-3.15e-01

Pos logit contributions
 Pengu+4.570
ーテ+4.532
hene+4.282
sten+4.219
ゴン+4.166
Neg logit contributions
 AD-4.395
 Lak-4.315
 ACT-4.066
ellar-4.045
 Cent-3.976
Token |
Token ID930
Feature activation+5.75e-02

Pos logit contributions
ーテ+2.125
 Pengu+2.118
hene+1.948
ゴン+1.915
sten+1.895
Neg logit contributions
 AD-1.601
 Lak-1.543
 ACT-1.449
ellar-1.439
 Cent-1.436
Token Medline
Token ID34582
Feature activation+1.08e-02

Pos logit contributions
 AD+0.337
 Lak+0.321
 ACT+0.299
ellar+0.295
 Cent+0.295
Neg logit contributions
ーテ-0.558
 Pengu-0.557
hene-0.517
ゴン-0.509
sten-0.504
Token |
Token ID930
Feature activation+1.43e-01

Pos logit contributions
 AD+0.070
 Lak+0.066
 ACT+0.063
 Cent+0.062
ellar+0.062
Neg logit contributions
 Pengu-0.099
ーテ-0.098
hene-0.090
ゴン-0.089
sten-0.088
Token ISI
Token ID22048
Feature activation+1.50e-01

Pos logit contributions
 AD+0.673
 Lak+0.641
 ACT+0.582
ellar+0.582
 Cent+0.577
Neg logit contributions
ーテ-1.542
 Pengu-1.536
hene-1.436
ゴン-1.414
sten-1.406
Token
Token ID198
Feature activation-4.96e-01

Pos logit contributions
 AD+0.873
 Lak+0.829
 ACT+0.778
 Cent+0.765
ellar+0.762
Neg logit contributions
 Pengu-1.474
ーテ-1.470
hene-1.359
ゴン-1.343
sten-1.321
Token
Token ID198
Feature activation-8.59e-01

Pos logit contributions
 Pengu+3.402
ーテ+3.364
hene+3.363
sten+3.357
punk+3.258
Neg logit contributions
 AD-3.508
ellar-3.472
 Lak-3.466
iel-3.275
 Cent-3.250
TokenP
Token ID47
Feature activation-4.44e-02

Pos logit contributions
 Pengu+5.711
ーテ+5.627
sten+5.483
hene+5.343
punk+5.315
Neg logit contributions
 AD-6.859
 Lak-6.476
 ACT-6.383
ellar-6.253
 Cent-6.214
Tokenel
Token ID417
Feature activation-2.16e-01

Pos logit contributions
ーテ+0.341
 Pengu+0.340
hene+0.318
sten+0.307
ゴン+0.302
Neg logit contributions
 AD-0.342
 Lak-0.332
 ACT-0.315
 Cent-0.311
ellar-0.308
Tokenleg
Token ID1455
Feature activation-2.07e-01

Pos logit contributions
 Pengu+1.686
ーテ+1.667
hene+1.574
sten+1.555
punk+1.472
Neg logit contributions
 AD-1.617
 Lak-1.566
 ACT-1.489
ellar-1.463
 Cent-1.463
Tokenr
Token ID81
Feature activation-1.50e-01

Pos logit contributions
 Pengu+1.470
ーテ+1.456
hene+1.392
sten+1.346
ゴン+1.280
Neg logit contributions
 AD-1.677
 Lak-1.644
 ACT-1.564
 Cent-1.541
ellar-1.521
Tokenini
Token ID5362
Feature activation-2.47e-01

Pos logit contributions
ーテ+1.142
 Pengu+1.137
hene+1.068
sten+1.033
ゴン+1.013
Neg logit contributions
 AD-1.146
 Lak-1.114
 ACT-1.067
ellar-1.061
 Cent-1.055
Token intent
Token ID6824
Feature activation-9.46e-02

Pos logit contributions
 Pengu+0.522
ーテ+0.516
hene+0.473
sten+0.464
ゴン+0.457
Neg logit contributions
 AD-0.470
 Lak-0.456
ellar-0.437
 Cent-0.427
 ACT-0.426
Token on
Token ID319
Feature activation+4.04e-02

Pos logit contributions
ーテ+0.846
 Pengu+0.840
hene+0.778
ゴン+0.758
sten+0.757
Neg logit contributions
 AD-0.615
 Lak-0.594
 ACT-0.556
ellar-0.555
 Cent-0.552
Token rolling
Token ID10708
Feature activation+4.81e-01

Pos logit contributions
 AD+0.301
 Lak+0.292
ellar+0.278
 ACT+0.276
 Cent+0.273
Neg logit contributions
 Pengu-0.321
ーテ-0.320
hene-0.291
ゴン-0.284
sten-0.284
Token back
Token ID736
Feature activation+2.66e-01

Pos logit contributions
 AD+3.389
 Lak+3.256
 ACT+3.067
 Cent+3.043
ellar+2.983
Neg logit contributions
ーテ-4.124
 Pengu-4.092
hene-3.773
ゴン-3.755
sten-3.641
Token all
Token ID477
Feature activation+1.47e-01

Pos logit contributions
 AD+1.937
 Lak+1.871
 ACT+1.770
 Cent+1.754
ellar+1.747
Neg logit contributions
ーテ-2.203
 Pengu-2.190
hene-2.005
ゴン-1.976
sten-1.944
Token the
Token ID262
Feature activation+6.51e-01

Pos logit contributions
 AD+1.068
 Lak+1.035
ellar+0.976
 ACT+0.975
 Cent+0.968
Neg logit contributions
ーテ-1.202
 Pengu-1.198
hene-1.094
ゴン-1.070
sten-1.063
Token gains
Token ID8810
Feature activation-2.17e-01

Pos logit contributions
 AD+4.615
 Lak+4.448
 ACT+4.222
 Cent+4.185
ellar+3.991
Neg logit contributions
ーテ-5.536
 Pengu-5.461
hene-5.033
ゴン-5.032
sten-4.866
Token of
Token ID286
Feature activation+1.47e-01

Pos logit contributions
ーテ+1.637
 Pengu+1.617
hene+1.471
ゴン+1.415
sten+1.412
Neg logit contributions
 AD-1.665
 Lak-1.628
ellar-1.558
 ACT-1.531
 Cent-1.521
Token the
Token ID262
Feature activation+1.36e-01

Pos logit contributions
 AD+1.095
 Lak+1.060
 ACT+1.002
ellar+1.001
 Cent+0.994
Neg logit contributions
ーテ-1.177
 Pengu-1.173
hene-1.069
ゴン-1.047
sten-1.038
Token Progressive
Token ID25852
Feature activation+2.82e-01

Pos logit contributions
 AD+0.940
 Lak+0.908
ellar+0.866
 ACT+0.854
 Cent+0.846
Neg logit contributions
ーテ-1.164
 Pengu-1.163
hene-1.064
sten-1.037
ゴン-1.034
Token era
Token ID6980
Feature activation+2.01e-03

Pos logit contributions
 AD+2.348
 Lak+2.277
 ACT+2.169
 Cent+2.160
ellar+2.138
Neg logit contributions
ーテ-2.037
 Pengu-2.021
hene-1.822
ゴン-1.800
sten-1.767
Token click
Token ID3904
Feature activation+4.20e-01

Pos logit contributions
 AD+0.737
 Lak+0.716
ellar+0.677
 ACT+0.676
 Cent+0.671
Neg logit contributions
 Pengu-0.794
ーテ-0.793
hene-0.722
sten-0.701
ゴン-0.701
Token on
Token ID319
Feature activation+2.37e-01

Pos logit contributions
 AD+2.785
 Lak+2.658
 ACT+2.514
 Cent+2.485
ellar+2.443
Neg logit contributions
ーテ-3.767
 Pengu-3.721
hene-3.460
ゴン-3.431
sten-3.344
Token the
Token ID262
Feature activation+1.57e-01

Pos logit contributions
 AD+1.742
 Lak+1.676
 ACT+1.591
 Cent+1.576
ellar+1.565
Neg logit contributions
ーテ-1.951
 Pengu-1.932
hene-1.778
ゴン-1.752
sten-1.719
Token Settings
Token ID16163
Feature activation+3.97e-01

Pos logit contributions
 AD+1.180
 Lak+1.141
 ACT+1.080
ellar+1.072
 Cent+1.072
Neg logit contributions
ーテ-1.264
 Pengu-1.258
hene-1.148
ゴン-1.127
sten-1.113
Token Icon
Token ID26544
Feature activation+1.96e-01

Pos logit contributions
 AD+3.027
 Lak+2.913
 ACT+2.776
 Cent+2.753
ellar+2.710
Neg logit contributions
ーテ-3.147
 Pengu-3.116
hene-2.850
ゴン-2.797
sten-2.763
Token in
Token ID287
Feature activation+7.29e-01

Pos logit contributions
 AD+1.360
 Lak+1.312
 ACT+1.235
ellar+1.227
 Cent+1.225
Neg logit contributions
ーテ-1.678
 Pengu-1.672
hene-1.534
ゴン-1.507
sten-1.491
Token the
Token ID262
Feature activation+5.51e-01

Pos logit contributions
 AD+7.093
 Lak+6.846
 Cent+6.654
 ACT+6.575
ellar+6.432
Neg logit contributions
ーテ-4.135
 Pengu-3.986
ゴン-3.652
hene-3.594
sten-3.376
Token top
Token ID1353
Feature activation+2.94e-01

Pos logit contributions
 AD+4.462
 Lak+4.297
 Cent+4.117
 ACT+4.100
ellar+3.996
Neg logit contributions
ーテ-4.129
 Pengu-4.043
ゴン-3.696
hene-3.684
sten-3.536
Token left
Token ID1364
Feature activation+1.44e-01

Pos logit contributions
 AD+2.065
 Lak+1.982
 Cent+1.875
 ACT+1.873
ellar+1.855
Neg logit contributions
ーテ-2.505
 Pengu-2.491
hene-2.279
ゴン-2.259
sten-2.218
Token corner
Token ID5228
Feature activation+2.78e-01

Pos logit contributions
 AD+0.959
 Lak+0.920
 ACT+0.866
 Cent+0.860
ellar+0.858
Neg logit contributions
ーテ-1.288
 Pengu-1.282
hene-1.179
ゴン-1.160
sten-1.147
Token.
Token ID13
Feature activation+1.64e-01

Pos logit contributions
 AD+1.911
 Lak+1.831
 ACT+1.731
 Cent+1.722
ellar+1.703
Neg logit contributions
ーテ-2.409
 Pengu-2.397
hene-2.204
ゴン-2.168
sten-2.142
Token versions
Token ID6300
Feature activation-4.81e-02

Pos logit contributions
 Pengu+0.475
ーテ+0.473
hene+0.426
sten+0.418
ゴン+0.417
Neg logit contributions
 AD-0.453
 Lak-0.440
ellar-0.425
 ACT-0.416
 Cent-0.411
Token of
Token ID286
Feature activation-1.69e-01

Pos logit contributions
ーテ+0.402
 Pengu+0.400
hene+0.366
ゴン+0.360
sten+0.356
Neg logit contributions
 AD-0.344
 Lak-0.333
 ACT-0.314
ellar-0.312
 Cent-0.311
Token the
Token ID262
Feature activation-1.29e-01

Pos logit contributions
 Pengu+1.275
ーテ+1.270
hene+1.138
ゴン+1.115
sten+1.112
Neg logit contributions
 AD-1.308
 Lak-1.272
ellar-1.227
 ACT-1.201
 Cent-1.190
Token five
Token ID1936
Feature activation+1.84e-01

Pos logit contributions
 Pengu+1.002
ーテ+0.998
hene+0.897
sten+0.882
ゴン+0.877
Neg logit contributions
 AD-0.971
 Lak-0.943
ellar-0.924
 ACT-0.892
 Cent-0.881
Token original
Token ID2656
Feature activation+4.27e-01

Pos logit contributions
 AD+1.326
 Lak+1.282
 ACT+1.210
ellar+1.206
 Cent+1.199
Neg logit contributions
ーテ-1.523
 Pengu-1.517
hene-1.386
ゴン-1.361
sten-1.347
Token classic
Token ID6833
Feature activation+6.29e-01

Pos logit contributions
 AD+3.325
 Lak+3.215
 ACT+3.058
 Cent+3.040
ellar+2.979
Neg logit contributions
ーテ-3.329
 Pengu-3.295
hene-3.018
ゴン-2.972
sten-2.913
Token sports
Token ID5701
Feature activation+2.43e-01

Pos logit contributions
 AD+4.878
 Lak+4.703
 ACT+4.472
 Cent+4.456
ellar+4.292
Neg logit contributions
ーテ-4.907
 Pengu-4.856
hene-4.469
ゴン-4.409
sten-4.298
Token
Token ID784
Feature activation+1.87e-01

Pos logit contributions
 AD+1.854
 Lak+1.798
 ACT+1.701
ellar+1.697
 Cent+1.687
Neg logit contributions
ーテ-1.906
 Pengu-1.901
hene-1.728
ゴン-1.694
sten-1.676
Token Tennis
Token ID37213
Feature activation+6.87e-03

Pos logit contributions
 AD+1.380
 Lak+1.335
 ACT+1.262
ellar+1.256
 Cent+1.251
Neg logit contributions
ーテ-1.516
 Pengu-1.511
hene-1.378
ゴン-1.351
sten-1.338
Token,
Token ID11
Feature activation+2.52e-01

Pos logit contributions
 AD+0.049
 Lak+0.047
ellar+0.045
 ACT+0.045
 Cent+0.044
Neg logit contributions
ーテ-0.057
 Pengu-0.057
hene-0.052
ゴン-0.051
sten-0.051
Token Bowling
Token ID44748
Feature activation+1.23e-01

Pos logit contributions
 AD+2.012
 Lak+1.946
 ACT+1.851
 Cent+1.840
ellar+1.822
Neg logit contributions
ーテ-1.915
 Pengu-1.901
hene-1.726
ゴン-1.705
sten-1.674
Token m
Token ID285
Feature activation+1.41e-01

Pos logit contributions
 Pengu+2.166
ーテ+2.139
hene+1.938
sten+1.899
ゴン+1.891
Neg logit contributions
 AD-2.157
 Lak-2.101
ellar-2.029
 ACT-1.992
 Cent-1.964
Token2
Token ID17
Feature activation+1.35e-01

Pos logit contributions
 AD+0.962
 Lak+0.922
 Cent+0.869
 ACT+0.868
ellar+0.866
Neg logit contributions
ーテ-1.249
 Pengu-1.233
hene-1.116
ゴン-1.115
sten-1.079
Tokenk
Token ID74
Feature activation+3.53e-02

Pos logit contributions
 AD+1.074
 Lak+1.043
 ACT+0.992
ellar+0.987
 Cent+0.984
Neg logit contributions
 Pengu-0.995
ーテ-0.994
hene-0.926
sten-0.902
ゴン-0.877
Token [
Token ID685
Feature activation-3.84e-02

Pos logit contributions
 AD+0.213
 Lak+0.208
ellar+0.196
 ACT+0.191
 Cent+0.191
Neg logit contributions
 Pengu-0.325
ーテ-0.321
hene-0.302
sten-0.296
ゴン-0.291
Token10
Token ID940
Feature activation-1.96e-01

Pos logit contributions
ーテ+0.295
 Pengu+0.293
hene+0.278
sten+0.274
ゴン+0.264
Neg logit contributions
 AD-0.286
 Lak-0.282
ellar-0.264
 ACT-0.263
 Cent-0.261
Token:
Token ID25
Feature activation+9.36e-01

Pos logit contributions
 Pengu+1.506
ーテ+1.500
hene+1.461
sten+1.435
punk+1.396
Neg logit contributions
 AD-1.344
 Lak-1.337
ellar-1.282
 Cent-1.231
 ACT-1.225
Token47
Token ID2857
Feature activation+3.81e-01

Pos logit contributions
 AD+4.001
 Lak+3.683
 ACT+3.409
ellar+3.294
 Cent+3.171
Neg logit contributions
ーテ-10.687
 Pengu-10.636
ゴン-9.911
 tradem-9.891
hene-9.693
Token]
Token ID60
Feature activation+2.07e-01

Pos logit contributions
 AD+3.046
 Lak+2.917
 ACT+2.795
 Cent+2.762
ellar+2.733
Neg logit contributions
ーテ-2.979
 Pengu-2.955
ゴン-2.640
hene-2.608
 tradem-2.534
Token <
Token ID1279
Feature activation+1.86e-02

Pos logit contributions
 AD+1.454
 Lak+1.371
 ACT+1.323
 Cent+1.282
ellar+1.279
Neg logit contributions
ーテ-1.796
 Pengu-1.764
hene-1.631
ゴン-1.597
sten-1.579
Tokenkw
Token ID46265
Feature activation-1.55e-01

Pos logit contributions
 AD+0.139
 Lak+0.134
 ACT+0.127
ellar+0.126
 Cent+0.126
Neg logit contributions
ーテ-0.150
 Pengu-0.149
hene-0.136
ゴン-0.133
sten-0.132
Tokenij
Token ID2926
Feature activation-2.77e-01

Pos logit contributions
 Pengu+1.251
ーテ+1.236
hene+1.164
sten+1.136
ゴン+1.113
Neg logit contributions
 AD-1.123
 Lak-1.089
 ACT-1.028
 Cent-1.022
ellar-1.009
Token,
Token ID11
Feature activation+1.10e+00

Pos logit contributions
 AD+0.578
 Lak+0.554
 ACT+0.521
 Cent+0.520
ellar+0.517
Neg logit contributions
ーテ-0.752
 Pengu-0.750
hene-0.686
ゴン-0.676
sten-0.668
Token Y
Token ID575
Feature activation+4.20e-01

Pos logit contributions
 AD+6.380
 Cent+6.163
 Lak+5.982
 ACT+5.593
cent+5.242
Neg logit contributions
ーテ-10.300
 Pengu-10.020
ゴン-9.538
hene-9.229
sten-9.144
TokenYR
Token ID38162
Feature activation+5.80e-02

Pos logit contributions
 AD+3.081
 Lak+2.947
 ACT+2.786
ellar+2.775
 Cent+2.775
Neg logit contributions
ーテ-3.500
 Pengu-3.466
hene-3.132
ゴン-3.095
sten-3.053
Token DS
Token ID17400
Feature activation+3.51e-01

Pos logit contributions
 AD+0.407
 Lak+0.393
 ACT+0.370
ellar+0.368
 Cent+0.367
Neg logit contributions
ーテ-0.494
 Pengu-0.492
hene-0.450
ゴン-0.442
sten-0.437
Token cent
Token ID1247
Feature activation+5.18e-01

Pos logit contributions
 AD+2.359
 Lak+2.254
 ACT+2.116
 Cent+2.108
ellar+2.083
Neg logit contributions
ーテ-3.135
 Pengu-3.117
hene-2.844
ゴン-2.813
sten-2.774
Tokenering
Token ID1586
Feature activation+6.62e-01

Pos logit contributions
 AD+4.471
 Lak+4.280
 Cent+4.206
ellar+4.161
 ACT+4.126
Neg logit contributions
ーテ-3.599
 Pengu-3.484
ゴン-3.135
hene-3.020
sten-2.919
Token
Token ID198
Feature activation+2.32e-01

Pos logit contributions
 AD+3.917
 Lak+3.652
 Cent+3.517
 ACT+3.443
ellar+3.332
Neg logit contributions
ーテ-6.346
 Pengu-6.336
hene-5.769
ゴン-5.731
sten-5.632
Token
Token ID198
Feature activation+1.55e-01

Pos logit contributions
 AD+1.457
 Lak+1.405
 ACT+1.310
ellar+1.310
 Cent+1.300
Neg logit contributions
ーテ-2.118
 Pengu-2.110
hene-1.961
ゴン-1.916
sten-1.912
TokenThe
Token ID464
Feature activation-2.32e-02

Pos logit contributions
 AD+1.047
 Lak+1.011
 ACT+0.951
ellar+0.947
 Cent+0.941
Neg logit contributions
ーテ-1.357
 Pengu-1.354
hene-1.249
ゴン-1.227
sten-1.217
Token Garg
Token ID39013
Feature activation+1.17e-01

Pos logit contributions
 Pengu+0.207
ーテ+0.206
hene+0.189
ゴン+0.185
sten+0.184
Neg logit contributions
 AD-0.150
 Lak-0.145
ellar-0.138
 ACT-0.137
 Cent-0.135
Tokenant
Token ID415
Feature activation-3.48e-01

Pos logit contributions
 AD+0.766
 Lak+0.738
 ACT+0.692
 Cent+0.683
ellar+0.682
Neg logit contributions
 Pengu-1.043
ーテ-1.036
hene-0.962
sten-0.938
ゴン-0.933
Token,
Token ID11
Feature activation-8.84e-02

Pos logit contributions
 AD+1.923
 Lak+1.788
 ACT+1.722
 Cent+1.692
ellar+1.662
Neg logit contributions
ーテ-2.672
 Pengu-2.662
hene-2.416
ゴン-2.415
sten-2.374
Token will
Token ID481
Feature activation+2.04e-02

Pos logit contributions
 Pengu+0.660
ーテ+0.656
hene+0.604
sten+0.581
ゴン+0.574
Neg logit contributions
 AD-0.694
 Lak-0.680
ellar-0.652
 Cent-0.638
 ACT-0.637
Token kill
Token ID1494
Feature activation+1.22e-01

Pos logit contributions
 AD+0.159
 Lak+0.157
ellar+0.149
 Cent+0.147
 ACT+0.146
Neg logit contributions
 Pengu-0.153
ーテ-0.153
hene-0.141
sten-0.136
ゴン-0.133
Token half
Token ID2063
Feature activation+3.72e-01

Pos logit contributions
 AD+0.876
 Lak+0.855
ellar+0.816
 ACT+0.800
 Cent+0.794
Neg logit contributions
 Pengu-0.998
ーテ-0.994
hene-0.915
sten-0.885
ゴン-0.874
Token of
Token ID286
Feature activation+2.71e-01

Pos logit contributions
 AD+2.578
 Lak+2.478
 ACT+2.332
 Cent+2.326
ellar+2.278
Neg logit contributions
ーテ-3.219
 Pengu-3.196
hene-2.926
ゴン-2.912
sten-2.846
Token all
Token ID477
Feature activation+1.68e-01

Pos logit contributions
 AD+1.979
 Lak+1.911
 ACT+1.806
 Cent+1.792
ellar+1.790
Neg logit contributions
ーテ-2.235
 Pengu-2.224
hene-2.033
ゴン-2.000
sten-1.973
Token long
Token ID890
Feature activation-1.17e-01

Pos logit contributions
 AD+1.139
 Lak+1.106
ellar+1.039
 ACT+1.035
 Cent+1.027
Neg logit contributions
 Pengu-1.453
ーテ-1.452
hene-1.335
ゴン-1.302
sten-1.298
Token-
Token ID12
Feature activation+9.89e-02

Pos logit contributions
ーテ+1.052
 Pengu+1.046
hene+0.980
sten+0.960
ゴン+0.940
Neg logit contributions
 AD-0.751
 Lak-0.732
ellar-0.686
 ACT-0.677
 Cent-0.674
Tokenterm
Token ID4354
Feature activation+2.59e-01

Pos logit contributions
 AD+0.524
 Lak+0.500
 ACT+0.462
ellar+0.458
 Cent+0.456
Neg logit contributions
ーテ-1.012
 Pengu-1.008
hene-0.938
ゴン-0.925
sten-0.916
Token users
Token ID2985
Feature activation+6.18e-03

Pos logit contributions
 AD+2.024
 Lak+1.962
 ACT+1.860
ellar+1.851
 Cent+1.846
Neg logit contributions
ーテ-1.997
 Pengu-1.989
hene-1.806
ゴン-1.769
sten-1.749
Token,"
Token ID553
Feature activation+1.38e-01

Pos logit contributions
 AD+0.042
 Lak+0.041
ellar+0.038
 ACT+0.038
 Cent+0.038
Neg logit contributions
ーテ-0.054
 Pengu-0.054
hene-0.049
ゴン-0.048
sten-0.048
Token state
Token ID1181
Feature activation+1.42e-01

Pos logit contributions
 AD+0.108
 Lak+0.106
ellar+0.101
 ACT+0.099
 Cent+0.098
Neg logit contributions
 Pengu-0.119
ーテ-0.118
hene-0.108
sten-0.105
ゴン-0.104
Token fund
Token ID1814
Feature activation-2.16e-01

Pos logit contributions
 AD+1.028
 Lak+1.000
 ACT+0.941
ellar+0.939
 Cent+0.933
Neg logit contributions
ーテ-1.163
 Pengu-1.162
hene-1.060
ゴン-1.036
sten-1.030
Token on
Token ID319
Feature activation+8.17e-02

Pos logit contributions
 Pengu+1.690
ーテ+1.666
hene+1.545
sten+1.488
ゴン+1.451
Neg logit contributions
 AD-1.569
 Lak-1.556
ellar-1.484
 ACT-1.447
 Cent-1.435
Token top
Token ID1353
Feature activation+1.00e-01

Pos logit contributions
 AD+0.583
 Lak+0.567
ellar+0.539
 ACT+0.532
 Cent+0.530
Neg logit contributions
 Pengu-0.675
ーテ-0.669
hene-0.614
sten-0.597
ゴン-0.596
Token of
Token ID286
Feature activation+1.43e-01

Pos logit contributions
 AD+0.693
 Lak+0.662
 ACT+0.628
 Cent+0.621
ellar+0.619
Neg logit contributions
ーテ-0.871
 Pengu-0.870
hene-0.795
ゴン-0.780
sten-0.770
Token other
Token ID584
Feature activation+1.33e-01

Pos logit contributions
 AD+1.037
 Lak+1.009
ellar+0.956
 ACT+0.948
 Cent+0.942
Neg logit contributions
 Pengu-1.163
ーテ-1.159
hene-1.058
ゴン-1.033
sten-1.031
Token recommendations
Token ID10763
Feature activation-1.29e-01

Pos logit contributions
 AD+0.946
 Lak+0.926
ellar+0.877
 ACT+0.862
 Cent+0.861
Neg logit contributions
 Pengu-1.099
ーテ-1.091
hene-0.998
sten-0.976
ゴン-0.971
Token,
Token ID11
Feature activation+5.00e-02

Pos logit contributions
 Pengu+1.023
ーテ+1.012
hene+0.941
sten+0.901
ゴン+0.892
Neg logit contributions
 AD-0.939
 Lak-0.920
ellar-0.873
 ACT-0.858
 Cent-0.854
Token although
Token ID3584
Feature activation+3.40e-03

Pos logit contributions
 AD+0.381
 Lak+0.370
ellar+0.352
 ACT+0.350
 Cent+0.347
Neg logit contributions
 Pengu-0.390
ーテ-0.390
hene-0.355
sten-0.344
ゴン-0.343
Token she
Token ID673
Feature activation+3.22e-01

Pos logit contributions
 AD+0.027
 Lak+0.027
ellar+0.025
 ACT+0.025
 Cent+0.025
Neg logit contributions
 Pengu-0.025
ーテ-0.025
hene-0.023
ゴン-0.022
sten-0.022
Token said
Token ID531
Feature activation-6.07e-03

Pos logit contributions
 AD+2.329
 Lak+2.238
 ACT+2.124
 Cent+2.098
ellar+2.085
Neg logit contributions
ーテ-2.713
 Pengu-2.680
ゴン-2.435
hene-2.434
sten-2.363
Token PV
Token ID31392
Feature activation+2.91e-02

Pos logit contributions
 AD+0.552
 Lak+0.537
ellar+0.507
 ACT+0.505
 Cent+0.501
Neg logit contributions
 Pengu-0.622
ーテ-0.621
hene-0.567
ゴン-0.553
sten-0.552
Token vs
Token ID3691
Feature activation-1.74e-01

Pos logit contributions
 AD+0.206
 Lak+0.200
 ACT+0.188
ellar+0.187
 Cent+0.187
Neg logit contributions
ーテ-0.244
 Pengu-0.243
hene-0.223
ゴン-0.218
sten-0.217
Token PV
Token ID31392
Feature activation-7.16e-02

Pos logit contributions
 Pengu+1.600
ーテ+1.597
hene+1.488
sten+1.446
ゴン+1.442
Neg logit contributions
 AD-1.062
 Lak-1.041
ellar-0.976
 ACT-0.953
 Cent-0.950
TokenH
Token ID39
Feature activation-2.71e-01

Pos logit contributions
ーテ+0.610
 Pengu+0.608
hene+0.560
ゴン+0.546
sten+0.546
Neg logit contributions
 AD-0.494
 Lak-0.481
ellar-0.451
 ACT-0.450
 Cent-0.446
TokenVM
Token ID15996
Feature activation-4.29e-02

Pos logit contributions
ーテ+1.729
 Pengu+1.713
hene+1.530
ゴン+1.487
sten+1.475
Neg logit contributions
 AD-2.457
 Lak-2.398
 ACT-2.287
ellar-2.274
 Cent-2.272
Token,
Token ID11
Feature activation+1.93e-01

Pos logit contributions
ーテ+0.356
 Pengu+0.355
hene+0.326
sten+0.318
ゴン+0.318
Neg logit contributions
 AD-0.302
 Lak-0.294
ellar-0.279
 ACT-0.277
 Cent-0.274
Token but
Token ID475
Feature activation+3.38e-03

Pos logit contributions
 AD+1.395
 Lak+1.350
 ACT+1.273
ellar+1.269
 Cent+1.263
Neg logit contributions
ーテ-1.592
 Pengu-1.588
hene-1.451
ゴン-1.422
sten-1.409
Token I
Token ID314
Feature activation-1.25e-01

Pos logit contributions
 AD+0.024
 Lak+0.024
ellar+0.023
 ACT+0.022
 Cent+0.022
Neg logit contributions
 Pengu-0.028
ーテ-0.027
hene-0.025
sten-0.024
ゴン-0.024
Token found
Token ID1043
Feature activation+1.56e-02

Pos logit contributions
 Pengu+0.991
ーテ+0.969
hene+0.915
sten+0.895
ゴン+0.849
Neg logit contributions
 AD-0.905
 Lak-0.902
ellar-0.856
 Cent-0.837
 ACT-0.823
Token that
Token ID326
Feature activation-1.95e-01

Pos logit contributions
 AD+0.115
 Lak+0.113
ellar+0.107
 ACT+0.105
 Cent+0.105
Neg logit contributions
 Pengu-0.124
ーテ-0.124
hene-0.112
sten-0.110
ゴン-0.108
Token workload
Token ID26211
Feature activation-6.87e-02

Pos logit contributions
 Pengu+1.566
ーテ+1.533
hene+1.412
sten+1.370
ゴン+1.337
Neg logit contributions
 AD-1.394
 Lak-1.378
ellar-1.323
 Cent-1.273
 ACT-1.272
Token cause
Token ID2728
Feature activation-5.23e-02

Pos logit contributions
 Pengu+0.315
ーテ+0.314
hene+0.287
ゴン+0.280
sten+0.280
Neg logit contributions
 AD-0.284
 Lak-0.275
ellar-0.263
 ACT-0.261
 Cent-0.258
Token.
Token ID13
Feature activation-1.11e-01

Pos logit contributions
ーテ+0.446
 Pengu+0.444
hene+0.409
ゴン+0.401
sten+0.398
Neg logit contributions
 AD-0.362
 Lak-0.350
ellar-0.329
 ACT-0.328
 Cent-0.326
Token Powerful
Token ID46308
Feature activation+1.64e-01

Pos logit contributions
 Pengu+0.948
ーテ+0.940
hene+0.861
sten+0.841
ゴン+0.823
Neg logit contributions
 AD-0.749
 Lak-0.724
ellar-0.710
 ACT-0.683
iel-0.670
Token players
Token ID1938
Feature activation-1.61e-01

Pos logit contributions
 AD+1.255
 Lak+1.216
 ACT+1.151
ellar+1.147
 Cent+1.142
Neg logit contributions
ーテ-1.292
 Pengu-1.287
hene-1.171
ゴン-1.146
sten-1.136
Token like
Token ID588
Feature activation+3.25e-01

Pos logit contributions
 Pengu+1.277
ーテ+1.274
hene+1.173
sten+1.135
ゴン+1.120
Neg logit contributions
 AD-1.184
 Lak-1.147
ellar-1.099
 ACT-1.076
 Cent-1.076
Token Michelle
Token ID16738
Feature activation+1.55e-01

Pos logit contributions
 AD+2.435
 Lak+2.357
 ACT+2.220
 Cent+2.210
ellar+2.172
Neg logit contributions
ーテ-2.651
 Pengu-2.614
hene-2.391
ゴン-2.377
sten-2.306
Token Obama
Token ID2486
Feature activation+1.07e-03

Pos logit contributions
 AD+1.093
 Lak+1.052
 ACT+0.995
ellar+0.992
 Cent+0.985
Neg logit contributions
ーテ-1.311
 Pengu-1.310
hene-1.200
ゴン-1.174
sten-1.167
Token and
Token ID290
Feature activation+4.53e-02

Pos logit contributions
 AD+0.008
 Lak+0.007
ellar+0.007
 ACT+0.007
 Cent+0.007
Neg logit contributions
ーテ-0.009
 Pengu-0.009
hene-0.008
ゴン-0.008
sten-0.008
Token RE
Token ID4526
Feature activation+3.52e-02

Pos logit contributions
 AD+0.327
 Lak+0.316
 ACT+0.298
ellar+0.298
 Cent+0.296
Neg logit contributions
ーテ-0.375
 Pengu-0.374
hene-0.342
ゴン-0.335
sten-0.332
TokenI
Token ID40
Feature activation-9.78e-02

Pos logit contributions
 AD+0.271
 Lak+0.262
 ACT+0.249
ellar+0.248
 Cent+0.246
Neg logit contributions
ーテ-0.273
 Pengu-0.272
hene-0.248
ゴン-0.243
sten-0.241
Token have
Token ID423
Feature activation-6.87e-02

Pos logit contributions
 Pengu+0.815
ーテ+0.810
hene+0.748
sten+0.727
ゴン+0.723
Neg logit contributions
 AD-0.682
 Lak-0.664
ellar-0.630
 ACT-0.620
 Cent-0.617
Token_
Token ID62
Feature activation+1.62e-01

Pos logit contributions
 AD+0.376
 Lak+0.367
ellar+0.345
 ACT+0.345
 Cent+0.342
Neg logit contributions
ーテ-0.406
 Pengu-0.403
hene-0.375
sten-0.365
ゴン-0.362
Token00
Token ID405
Feature activation+7.62e-02

Pos logit contributions
 AD+0.930
 Lak+0.880
 ACT+0.828
 Cent+0.817
ellar+0.815
Neg logit contributions
ーテ-1.589
 Pengu-1.589
hene-1.470
ゴン-1.442
sten-1.425
Token64
Token ID2414
Feature activation-6.06e-02

Pos logit contributions
 AD+0.474
 Lak+0.455
 ACT+0.425
 Cent+0.421
ellar+0.421
Neg logit contributions
ーテ-0.710
 Pengu-0.710
hene-0.654
ゴン-0.645
sten-0.636
TokenE
Token ID36
Feature activation+6.62e-02

Pos logit contributions
ーテ+0.403
 Pengu+0.395
hene+0.364
sten+0.350
ゴン+0.347
Neg logit contributions
 AD-0.526
 Lak-0.516
ellar-0.494
 ACT-0.490
 Cent-0.487
Token,
Token ID11
Feature activation-2.47e-01

Pos logit contributions
 AD+0.448
 Lak+0.428
 ACT+0.405
 Cent+0.402
ellar+0.400
Neg logit contributions
ーテ-0.584
 Pengu-0.581
hene-0.534
ゴン-0.525
sten-0.519
TokenDP
Token ID6322
Feature activation+2.81e-01

Pos logit contributions
ーテ+1.577
 Pengu+1.567
hene+1.401
ゴン+1.363
sten+1.345
Neg logit contributions
 AD-2.239
 Lak-2.192
 ACT-2.084
ellar-2.079
 Cent-2.075
Token02
Token ID2999
Feature activation+2.16e-02

Pos logit contributions
 AD+2.077
 Lak+2.001
 ACT+1.897
 Cent+1.881
ellar+1.876
Neg logit contributions
ーテ-2.301
 Pengu-2.296
hene-2.085
ゴン-2.050
sten-2.014
Token_
Token ID62
Feature activation+1.35e-01

Pos logit contributions
 AD+0.160
 Lak+0.157
ellar+0.148
 ACT+0.147
 Cent+0.146
Neg logit contributions
ーテ-0.170
 Pengu-0.169
hene-0.157
sten-0.154
ゴン-0.152
Token00
Token ID405
Feature activation-9.08e-02

Pos logit contributions
 AD+0.771
 Lak+0.731
 ACT+0.686
 Cent+0.677
ellar+0.677
Neg logit contributions
ーテ-1.332
 Pengu-1.331
hene-1.232
ゴン-1.209
sten-1.195
Token65
Token ID2996
Feature activation-9.91e-02

Pos logit contributions
ーテ+1.028
 Pengu+1.023
hene+0.961
ゴン+0.947
sten+0.943
Neg logit contributions
 AD-0.373
 Lak-0.356
 ACT-0.318
ellar-0.315
 Cent-0.314
TokenE
Token ID36
Feature activation+8.78e-02

Pos logit contributions
ーテ+0.653
 Pengu+0.637
hene+0.590
sten+0.567
ゴン+0.558
Neg logit contributions
 AD-0.867
 Lak-0.852
ellar-0.814
 ACT-0.809
 Cent-0.800
Token were
Token ID547
Feature activation-2.41e-01

Pos logit contributions
 Pengu+1.316
ーテ+1.308
hene+1.213
sten+1.164
ゴン+1.148
Neg logit contributions
 AD-1.212
 Lak-1.178
ellar-1.130
 ACT-1.109
 Cent-1.103
Token already
Token ID1541
Feature activation-2.25e-01

Pos logit contributions
 Pengu+1.877
ーテ+1.840
hene+1.721
sten+1.655
ゴン+1.609
Neg logit contributions
 AD-1.770
 Lak-1.721
ellar-1.665
 Cent-1.614
iel-1.602
Token running
Token ID2491
Feature activation-8.52e-02

Pos logit contributions
 Pengu+1.777
ーテ+1.750
hene+1.642
sten+1.584
ゴン+1.538
Neg logit contributions
 AD-1.621
 Lak-1.586
ellar-1.532
 Cent-1.482
iel-1.477
Token pretty
Token ID2495
Feature activation-1.60e-01

Pos logit contributions
 Pengu+0.710
ーテ+0.699
hene+0.646
sten+0.625
ゴン+0.616
Neg logit contributions
 AD-0.589
 Lak-0.579
ellar-0.555
 ACT-0.536
 Cent-0.535
Token fast
Token ID3049
Feature activation-2.94e-02

Pos logit contributions
 Pengu+1.270
ーテ+1.239
hene+1.147
ゴン+1.108
sten+1.102
Neg logit contributions
 AD-1.161
 Lak-1.130
ellar-1.093
 Cent-1.063
 ACT-1.049
Token on
Token ID319
Feature activation-9.34e-02

Pos logit contributions
ーテ+0.248
 Pengu+0.246
hene+0.227
ゴン+0.220
sten+0.220
Neg logit contributions
 AD-0.204
 Lak-0.199
ellar-0.189
 ACT-0.186
 Cent-0.186
Token these
Token ID777
Feature activation-1.28e-02

Pos logit contributions
 Pengu+0.777
ーテ+0.772
hene+0.705
sten+0.689
ゴン+0.679
Neg logit contributions
 AD-0.654
 Lak-0.642
ellar-0.611
 ACT-0.597
 Cent-0.594
Token instances
Token ID10245
Feature activation+1.40e-01

Pos logit contributions
 Pengu+0.103
ーテ+0.103
hene+0.094
sten+0.092
ゴン+0.091
Neg logit contributions
 AD-0.091
 Lak-0.090
ellar-0.085
 ACT-0.084
 Cent-0.083
Token.
Token ID13
Feature activation-2.01e-01

Pos logit contributions
 AD+0.961
 Lak+0.925
 ACT+0.872
 Cent+0.864
ellar+0.863
Neg logit contributions
ーテ-1.217
 Pengu-1.212
hene-1.113
ゴン-1.095
sten-1.082
Token This
Token ID770
Feature activation-6.59e-02

Pos logit contributions
 Pengu+1.684
ーテ+1.661
hene+1.519
sten+1.470
ゴン+1.435
Neg logit contributions
 AD-1.353
 Lak-1.335
ellar-1.317
iel-1.250
 ACT-1.232
Token was
Token ID373
Feature activation-9.95e-02

Pos logit contributions
 Pengu+0.524
ーテ+0.522
hene+0.479
sten+0.464
ゴン+0.463
Neg logit contributions
 AD-0.487
 Lak-0.475
ellar-0.452
 ACT-0.447
 Cent-0.444
Token
Token ID247
Feature activation+2.89e-01

Pos logit contributions
 AD+2.001
 Lak+1.935
 ACT+1.845
ellar+1.829
 Cent+1.821
Neg logit contributions
 Pengu-1.780
ーテ-1.771
hene-1.605
ゴン-1.559
sten-1.556
Tokens
Token ID82
Feature activation+1.88e-01

Pos logit contributions
 AD+1.950
 Lak+1.884
 ACT+1.773
 Cent+1.755
ellar+1.741
Neg logit contributions
ーテ-2.536
 Pengu-2.518
hene-2.312
ゴン-2.288
sten-2.235
Token a
Token ID257
Feature activation+3.85e-02

Pos logit contributions
 AD+1.352
 Lak+1.307
 ACT+1.233
 Cent+1.223
ellar+1.222
Neg logit contributions
ーテ-1.572
 Pengu-1.563
hene-1.434
ゴン-1.408
sten-1.391
Token sample
Token ID6291
Feature activation-1.05e-01

Pos logit contributions
 AD+0.289
 Lak+0.279
ellar+0.264
 ACT+0.264
 Cent+0.262
Neg logit contributions
ーテ-0.307
 Pengu-0.307
hene-0.279
ゴン-0.273
sten-0.271
Token.
Token ID13
Feature activation+1.28e-02

Pos logit contributions
 Pengu+0.858
ーテ+0.852
hene+0.779
ゴン+0.762
sten+0.759
Neg logit contributions
 AD-0.756
 Lak-0.730
ellar-0.694
 ACT-0.686
 Cent-0.686
Token The
Token ID383
Feature activation+2.83e-02

Pos logit contributions
 AD+0.083
 Lak+0.080
ellar+0.075
 ACT+0.074
 Cent+0.074
Neg logit contributions
 Pengu-0.116
ーテ-0.116
hene-0.106
ゴン-0.104
sten-0.103
Token opening
Token ID4756
Feature activation-2.51e-02

Pos logit contributions
 AD+0.214
 Lak+0.208
ellar+0.197
 ACT+0.196
 Cent+0.195
Neg logit contributions
 Pengu-0.224
ーテ-0.223
hene-0.203
ゴン-0.198
sten-0.197
Token movie
Token ID3807
Feature activation-4.77e-02

Pos logit contributions
ーテ+0.201
 Pengu+0.201
hene+0.182
ゴン+0.179
sten+0.177
Neg logit contributions
 AD-0.188
 Lak-0.182
ellar-0.172
 ACT-0.172
 Cent-0.171
Token depicts
Token ID28539
Feature activation-1.62e-01

Pos logit contributions
 Pengu+0.386
ーテ+0.384
hene+0.351
ゴン+0.343
sten+0.342
Neg logit contributions
 AD-0.350
 Lak-0.339
ellar-0.322
 ACT-0.318
 Cent-0.317
Token Hats
Token ID37934
Feature activation+7.57e-03

Pos logit contributions
 Pengu+1.317
ーテ+1.303
hene+1.179
ゴン+1.161
sten+1.147
Neg logit contributions
 AD-1.195
 Lak-1.153
ellar-1.103
 ACT-1.083
 Cent-1.078
Tokenune
Token ID1726
Feature activation-2.04e-01

Pos logit contributions
 AD+0.065
 Lak+0.063
 ACT+0.060
ellar+0.060
 Cent+0.060
Neg logit contributions
 Pengu-0.052
ーテ-0.052
hene-0.048
sten-0.046
ゴン-0.045
TokenWC
Token ID27353
Feature activation-3.06e-01

Pos logit contributions
 AD+0.780
 Lak+0.765
 ACT+0.723
ellar+0.720
 Cent+0.717
Neg logit contributions
ーテ-0.580
 Pengu-0.576
hene-0.533
sten-0.518
ゴン-0.507
Token was
Token ID373
Feature activation+1.29e-02

Pos logit contributions
 Pengu+2.352
ーテ+2.317
hene+2.126
sten+2.068
ゴン+2.032
Neg logit contributions
 AD-2.273
 Lak-2.237
ellar-2.170
 ACT-2.100
 Cent-2.077
Token planning
Token ID5410
Feature activation+7.75e-03

Pos logit contributions
 AD+0.097
 Lak+0.095
ellar+0.091
 ACT+0.089
 Cent+0.089
Neg logit contributions
 Pengu-0.100
ーテ-0.100
hene-0.092
sten-0.089
ゴン-0.088
Token to
Token ID284
Feature activation-6.79e-02

Pos logit contributions
 AD+0.052
 Lak+0.051
ellar+0.048
 ACT+0.048
 Cent+0.047
Neg logit contributions
ーテ-0.067
 Pengu-0.067
hene-0.062
ゴン-0.061
sten-0.060
Token announce
Token ID5453
Feature activation-5.02e-02

Pos logit contributions
 Pengu+0.567
ーテ+0.558
hene+0.515
sten+0.503
ゴン+0.497
Neg logit contributions
 AD-0.470
 Lak-0.465
ellar-0.440
 Cent-0.428
 ACT-0.427
Token the
Token ID262
Feature activation-3.90e-03

Pos logit contributions
 Pengu+0.411
ーテ+0.404
hene+0.371
sten+0.362
ゴン+0.357
Neg logit contributions
 AD-0.360
 Lak-0.352
ellar-0.337
 ACT-0.330
 Cent-0.327
Token addition
Token ID3090
Feature activation+1.12e-01

Pos logit contributions
 Pengu+0.032
ーテ+0.031
hene+0.029
sten+0.028
ゴン+0.028
Neg logit contributions
 AD-0.028
 Lak-0.027
ellar-0.026
 ACT-0.026
 Cent-0.026
Token of
Token ID286
Feature activation+1.20e-01

Pos logit contributions
 AD+0.773
 Lak+0.737
 ACT+0.697
 Cent+0.693
ellar+0.683
Neg logit contributions
 Pengu-0.966
ーテ-0.964
hene-0.883
ゴン-0.867
sten-0.859
Token Boise
Token ID38830
Feature activation+1.36e-01

Pos logit contributions
 AD+0.894
 Lak+0.868
ellar+0.822
 ACT+0.820
 Cent+0.812
Neg logit contributions
ーテ-0.958
 Pengu-0.957
hene-0.870
ゴン-0.851
sten-0.847
Token State
Token ID1812
Feature activation+3.78e-03

Pos logit contributions
 AD+1.043
 Lak+1.007
 ACT+0.954
 Cent+0.948
ellar+0.946
Neg logit contributions
ーテ-1.076
 Pengu-1.071
hene-0.974
ゴン-0.957
sten-0.943
Token after
Token ID706
Feature activation+1.27e-01

Pos logit contributions
 AD+0.027
 Lak+0.026
ellar+0.025
 ACT+0.025
 Cent+0.024
Neg logit contributions
ーテ-0.031
 Pengu-0.031
hene-0.029
ゴン-0.028
sten-0.028
Token:
Token ID25
Feature activation-2.26e-01

Pos logit contributions
ーテ+0.647
 Pengu+0.646
hene+0.595
ゴン+0.576
sten+0.576
Neg logit contributions
 AD-0.556
 Lak-0.542
ellar-0.512
 ACT-0.509
 Cent-0.507
Token
Token ID198
Feature activation-2.02e-01

Pos logit contributions
 Pengu+1.916
ーテ+1.904
hene+1.763
sten+1.713
ゴン+1.700
Neg logit contributions
 AD-1.534
 Lak-1.495
ellar-1.414
 ACT-1.404
 Cent-1.390
Token
Token ID198
Feature activation-3.31e-01

Pos logit contributions
ーテ+1.562
 Pengu+1.553
hene+1.520
sten+1.490
punk+1.446
Neg logit contributions
 AD-1.384
 Lak-1.376
ellar-1.311
 Cent-1.268
 ACT-1.265
Tokenis
Token ID271
Feature activation-3.61e-01

Pos logit contributions
 Pengu+2.413
ーテ+2.412
hene+2.280
sten+2.243
ゴン+2.204
Neg logit contributions
 AD-2.561
 Lak-2.513
 ACT-2.380
 Cent-2.359
ellar-2.349
Token there
Token ID612
Feature activation-9.74e-02

Pos logit contributions
ーテ+2.792
 Pengu+2.791
hene+2.580
sten+2.495
ゴン+2.473
Neg logit contributions
 AD-2.667
 Lak-2.630
ellar-2.477
 ACT-2.462
 Cent-2.455
Token any
Token ID597
Feature activation-2.59e-01

Pos logit contributions
 Pengu+0.803
ーテ+0.800
hene+0.740
sten+0.718
ゴン+0.711
Neg logit contributions
 AD-0.685
 Lak-0.669
ellar-0.633
 ACT-0.628
 Cent-0.626
Token other
Token ID584
Feature activation-2.53e-02

Pos logit contributions
 Pengu+2.044
ーテ+2.014
hene+1.889
sten+1.810
ゴン+1.775
Neg logit contributions
 AD-1.853
 Lak-1.828
ellar-1.746
 Cent-1.718
 ACT-1.712
Token difference
Token ID3580
Feature activation-3.33e-02

Pos logit contributions
 Pengu+0.205
ーテ+0.204
hene+0.188
sten+0.182
ゴン+0.181
Neg logit contributions
 AD-0.182
 Lak-0.178
ellar-0.169
 ACT-0.167
 Cent-0.167
Token between
Token ID1022
Feature activation-2.30e-02

Pos logit contributions
ーテ+0.279
 Pengu+0.278
hene+0.256
ゴン+0.249
sten+0.249
Neg logit contributions
 AD-0.235
 Lak-0.228
 ACT-0.215
ellar-0.214
 Cent-0.213
Token the
Token ID262
Feature activation-1.05e-01

Pos logit contributions
 Pengu+0.180
ーテ+0.180
hene+0.164
sten+0.160
ゴン+0.160
Neg logit contributions
 AD-0.173
 Lak-0.169
ellar-0.161
 ACT-0.159
 Cent-0.158
Token two
Token ID734
Feature activation-1.95e-02

Pos logit contributions
 Pengu+0.815
ーテ+0.810
hene+0.745
sten+0.722
ゴン+0.712
Neg logit contributions
 AD-0.781
 Lak-0.767
ellar-0.738
 ACT-0.721
 Cent-0.720
Token@
Token ID31
Feature activation-8.36e-02

Pos logit contributions
 AD+2.871
 Lak+2.730
 ACT+2.603
 Cent+2.565
ellar+2.561
Neg logit contributions
ーテ-3.514
 Pengu-3.473
hene-3.193
ゴン-3.172
sten-3.084
Tokenindex
Token ID9630
Feature activation-3.71e-02

Pos logit contributions
ーテ+0.640
 Pengu+0.640
hene+0.582
ゴン+0.569
sten+0.565
Neg logit contributions
 AD-0.652
 Lak-0.633
 ACT-0.599
ellar-0.595
 Cent-0.595
TokenState
Token ID9012
Feature activation-1.00e-01

Pos logit contributions
ーテ+0.304
 Pengu+0.302
hene+0.277
ゴン+0.271
sten+0.269
Neg logit contributions
 AD-0.271
 Lak-0.263
 ACT-0.247
ellar-0.246
 Cent-0.245
Token <
Token ID1279
Feature activation-2.59e-01

Pos logit contributions
ーテ+0.819
 Pengu+0.817
hene+0.746
ゴン+0.733
sten+0.728
Neg logit contributions
 AD-0.728
 Lak-0.709
 ACT-0.666
ellar-0.663
 Cent-0.661
Token|
Token ID91
Feature activation+2.05e-01

Pos logit contributions
 Pengu+2.054
ーテ+2.038
hene+1.874
sten+1.838
ゴン+1.811
Neg logit contributions
 AD-1.902
 Lak-1.869
ellar-1.760
 Cent-1.738
 ACT-1.736
Token{
Token ID90
Feature activation+8.95e-04

Pos logit contributions
 AD+1.517
 Lak+1.451
 ACT+1.385
 Cent+1.367
ellar+1.364
Neg logit contributions
ーテ-1.678
 Pengu-1.666
hene-1.519
ゴン-1.497
sten-1.469
Token11
Token ID1157
Feature activation-1.09e-02

Pos logit contributions
 AD+0.007
 Lak+0.007
 ACT+0.006
ellar+0.006
 Cent+0.006
Neg logit contributions
ーテ-0.007
 Pengu-0.007
hene-0.006
ゴン-0.006
sten-0.006
Token,
Token ID11
Feature activation+1.93e-01

Pos logit contributions
ーテ+0.087
 Pengu+0.087
hene+0.079
ゴン+0.078
sten+0.077
Neg logit contributions
 AD-0.082
 Lak-0.080
 ACT-0.075
ellar-0.075
 Cent-0.075
Token 7
Token ID767
Feature activation+1.25e-01

Pos logit contributions
 AD+1.272
 Lak+1.226
 ACT+1.150
ellar+1.143
 Cent+1.140
Neg logit contributions
ーテ-1.719
 Pengu-1.712
hene-1.576
ゴン-1.549
sten-1.534
Token}
Token ID92
Feature activation+3.95e-01

Pos logit contributions
 AD+1.113
 Lak+1.076
 ACT+1.031
ellar+1.024
 Cent+1.023
Neg logit contributions
ーテ-0.831
 Pengu-0.826
hene-0.736
ゴン-0.717
sten-0.707
Token ->
Token ID4613
Feature activation-1.45e-01

Pos logit contributions
 AD+2.596
 Lak+2.460
 ACT+2.343
 Cent+2.300
ellar+2.298
Neg logit contributions
ーテ-3.559
 Pengu-3.525
hene-3.266
ゴン-3.205
sten-3.147
Token thing
Token ID1517
Feature activation-3.82e-01

Pos logit contributions
 Pengu+2.424
ーテ+2.382
hene+2.203
sten+2.123
ゴン+2.099
Neg logit contributions
 AD-2.405
 Lak-2.335
ellar-2.267
 ACT-2.218
 Cent-2.187
Token to
Token ID284
Feature activation-1.10e-01

Pos logit contributions
 Pengu+2.936
ーテ+2.921
hene+2.706
sten+2.625
ゴン+2.569
Neg logit contributions
 AD-2.819
 Lak-2.749
ellar-2.666
 ACT-2.577
 Cent-2.565
Token me
Token ID502
Feature activation-2.90e-01

Pos logit contributions
 Pengu+0.870
ーテ+0.865
hene+0.789
sten+0.771
ゴン+0.762
Neg logit contributions
 AD-0.825
 Lak-0.801
ellar-0.766
 ACT-0.757
 Cent-0.751
Token
Token ID198
Feature activation-1.51e-01

Pos logit contributions
 Pengu+2.376
ーテ+2.367
hene+2.176
sten+2.121
ゴン+2.096
Neg logit contributions
 AD-2.037
 Lak-1.983
ellar-1.905
 ACT-1.857
 Cent-1.843
Token
Token ID198
Feature activation-2.89e-01

Pos logit contributions
ーテ+1.177
 Pengu+1.174
hene+1.136
sten+1.113
punk+1.083
Neg logit contributions
 AD-1.025
 Lak-1.016
ellar-0.975
 Cent-0.938
 ACT-0.934
Tokenbut
Token ID4360
Feature activation-8.26e-01

Pos logit contributions
ーテ+2.244
 Pengu+2.227
hene+2.093
sten+2.042
ゴン+2.037
Neg logit contributions
 AD-2.160
 Lak-2.096
 ACT-1.987
ellar-1.977
 Cent-1.954
Token a
Token ID257
Feature activation-5.16e-01

Pos logit contributions
 Pengu+6.101
ーテ+5.935
hene+5.534
sten+5.385
 Disorder+5.216
Neg logit contributions
 AD-5.807
 Lak-5.718
ellar-5.643
 ACT-5.364
 Cent-5.325
Token foul
Token ID15626
Feature activation-4.01e-01

Pos logit contributions
 Pengu+4.004
ーテ+3.943
hene+3.627
sten+3.537
ゴン+3.442
Neg logit contributions
 AD-3.781
 Lak-3.676
ellar-3.533
 ACT-3.483
 Cent-3.392
Token and
Token ID290
Feature activation-3.30e-01

Pos logit contributions
ーテ+3.215
 Pengu+3.204
hene+2.944
sten+2.886
ゴン+2.839
Neg logit contributions
 AD-2.867
 Lak-2.811
ellar-2.662
 ACT-2.637
 Cent-2.599
Token pest
Token ID28064
Feature activation-3.16e-02

Pos logit contributions
 Pengu+2.601
ーテ+2.580
hene+2.369
sten+2.322
ゴン+2.272
Neg logit contributions
 AD-2.418
 Lak-2.371
ellar-2.249
 ACT-2.220
 Cent-2.202
Tokenil
Token ID346
Feature activation+5.04e-01

Pos logit contributions
ーテ+0.276
 Pengu+0.275
hene+0.254
sten+0.247
ゴン+0.246
Neg logit contributions
 AD-0.214
 Lak-0.207
 ACT-0.194
 Cent-0.191
ellar-0.191
Token,
Token ID11
Feature activation-2.15e-01

Pos logit contributions
ーテ+0.512
 Pengu+0.510
hene+0.469
ゴン+0.455
sten+0.453
Neg logit contributions
 AD-0.438
 Lak-0.426
ellar-0.402
 ACT-0.399
 Cent-0.395
Token but
Token ID475
Feature activation-2.15e-01

Pos logit contributions
 Pengu+1.641
ーテ+1.637
hene+1.485
sten+1.426
ゴン+1.422
Neg logit contributions
 AD-1.630
 Lak-1.592
ellar-1.549
 ACT-1.498
 Cent-1.483
Token they
Token ID484
Feature activation-1.70e-01

Pos logit contributions
 Pengu+1.642
ーテ+1.635
hene+1.475
ゴン+1.434
sten+1.431
Neg logit contributions
 AD-1.613
 Lak-1.567
ellar-1.534
 ACT-1.482
 Cent-1.474
Token are
Token ID389
Feature activation-1.80e-01

Pos logit contributions
 Pengu+1.336
ーテ+1.314
hene+1.230
sten+1.192
ゴン+1.158
Neg logit contributions
 AD-1.259
 Lak-1.235
ellar-1.186
 Cent-1.150
 ACT-1.143
Token paradox
Token ID22226
Feature activation-3.11e-01

Pos logit contributions
ーテ+1.371
 Pengu+1.367
hene+1.240
sten+1.204
ゴン+1.197
Neg logit contributions
 AD-1.361
 Lak-1.325
ellar-1.288
 ACT-1.249
 Cent-1.246
Tokenically
Token ID1146
Feature activation-5.73e-01

Pos logit contributions
ーテ+2.351
 Pengu+2.346
hene+2.114
sten+2.063
ゴン+2.033
Neg logit contributions
 AD-2.439
 Lak-2.375
 ACT-2.235
ellar-2.217
 Cent-2.208
Token one
Token ID530
Feature activation-3.88e-01

Pos logit contributions
 Pengu+4.215
ーテ+4.109
hene+3.747
sten+3.679
punk+3.592
Neg logit contributions
 AD-4.286
 Lak-4.179
ellar-4.164
iel-3.967
 Cent-3.940
Token of
Token ID286
Feature activation-4.57e-01

Pos logit contributions
ーテ+3.068
 Pengu+3.027
hene+2.771
ゴン+2.743
sten+2.686
Neg logit contributions
 AD-2.855
 Lak-2.764
ellar-2.639
 ACT-2.631
 Cent-2.582
Token the
Token ID262
Feature activation-3.80e-01

Pos logit contributions
ーテ+3.420
 Pengu+3.397
hene+3.043
ゴン+3.030
sten+3.023
Neg logit contributions
 AD-3.489
 Lak-3.405
ellar-3.283
 ACT-3.222
 Cent-3.214
Token most
Token ID749
Feature activation-4.70e-01

Pos logit contributions
 Pengu+2.684
ーテ+2.670
hene+2.380
sten+2.376
ゴン+2.302
Neg logit contributions
 AD-2.997
 Lak-2.905
ellar-2.859
 ACT-2.761
iel-2.750
Token inh
Token ID25783
Feature activation-1.67e-01

Pos logit contributions
 Pengu+3.339
ーテ+3.315
hene+3.072
sten+2.943
ゴン+2.883
Neg logit contributions
 AD-3.643
 Lak-3.561
ellar-3.512
 Cent-3.370
iel-3.359
Token using
Token ID1262
Feature activation-2.18e-02

Pos logit contributions
 Pengu+1.042
ーテ+1.040
hene+0.956
sten+0.928
ゴン+0.918
Neg logit contributions
 AD-0.905
 Lak-0.881
ellar-0.835
 ACT-0.832
 Cent-0.815
Token that
Token ID326
Feature activation+1.69e-01

Pos logit contributions
 Pengu+0.179
ーテ+0.178
hene+0.162
sten+0.158
ゴン+0.157
Neg logit contributions
 AD-0.158
 Lak-0.153
ellar-0.146
 ACT-0.144
 Cent-0.143
Token cannon
Token ID21202
Feature activation-2.01e-01

Pos logit contributions
 AD+1.226
 Lak+1.189
ellar+1.120
 ACT+1.120
 Cent+1.113
Neg logit contributions
 Pengu-1.380
ーテ-1.380
hene-1.259
ゴン-1.228
sten-1.223
Token against
Token ID1028
Feature activation-7.81e-02

Pos logit contributions
 Pengu+1.582
ーテ+1.560
hene+1.442
sten+1.393
ゴン+1.386
Neg logit contributions
 AD-1.492
 Lak-1.449
 ACT-1.375
ellar-1.374
 Cent-1.369
Token Titans
Token ID18711
Feature activation+7.54e-02

Pos logit contributions
 Pengu+0.636
ーテ+0.633
hene+0.577
sten+0.562
ゴン+0.561
Neg logit contributions
 AD-0.569
 Lak-0.550
ellar-0.523
 ACT-0.520
 Cent-0.514
Token Tower
Token ID8765
Feature activation-5.45e-01

Pos logit contributions
 AD+0.525
 Lak+0.507
 ACT+0.478
ellar+0.475
 Cent+0.473
Neg logit contributions
ーテ-0.643
 Pengu-0.642
hene-0.589
ゴン-0.576
sten-0.572
Token.
Token ID13
Feature activation-1.34e-01

Pos logit contributions
 Pengu+4.283
ーテ+4.229
hene+3.935
sten+3.812
punk+3.695
Neg logit contributions
 AD-3.868
 Lak-3.790
ellar-3.648
 ACT-3.582
 Cent-3.480
Token But
Token ID887
Feature activation-7.33e-02

Pos logit contributions
 Pengu+1.113
ーテ+1.098
hene+1.016
sten+0.990
ゴン+0.959
Neg logit contributions
 AD-0.921
 Lak-0.901
ellar-0.886
iel-0.846
 ACT-0.841
Token Glen
Token ID20716
Feature activation-1.48e-01

Pos logit contributions
 Pengu+0.601
ーテ+0.591
hene+0.549
sten+0.534
ゴン+0.524
Neg logit contributions
 AD-0.522
 Lak-0.509
ellar-0.487
 ACT-0.477
 Cent-0.472
Token suggested
Token ID5220
Feature activation+6.85e-03

Pos logit contributions
 Pengu+1.192
ーテ+1.174
hene+1.086
sten+1.060
ゴン+1.039
Neg logit contributions
 AD-1.068
 Lak-1.043
ellar-0.991
 Cent-0.978
 ACT-0.975
Token pulling
Token ID10427
Feature activation+2.26e-01

Pos logit contributions
 AD+0.048
 Lak+0.047
ellar+0.045
 ACT+0.044
 Cent+0.044
Neg logit contributions
 Pengu-0.057
ーテ-0.057
hene-0.052
sten-0.051
ゴン-0.050
Token)
Token ID8
Feature activation-2.01e-02

Pos logit contributions
 AD+0.255
 Lak+0.244
 ACT+0.230
 Cent+0.228
ellar+0.227
Neg logit contributions
ーテ-0.343
 Pengu-0.341
hene-0.313
ゴン-0.309
sten-0.305
Token
Token ID198
Feature activation-5.42e-02

Pos logit contributions
ーテ+0.185
 Pengu+0.185
hene+0.170
ゴン+0.167
sten+0.166
Neg logit contributions
 AD-0.124
 Lak-0.120
ellar-0.114
 ACT-0.112
 Cent-0.111
Token
Token ID198
Feature activation-1.37e-01

Pos logit contributions
ーテ+0.440
 Pengu+0.437
hene+0.422
sten+0.414
punk+0.400
Neg logit contributions
 AD-0.364
 Lak-0.359
ellar-0.343
 ACT-0.332
 Cent-0.331
TokenStory
Token ID11605
Feature activation+1.68e-02

Pos logit contributions
ーテ+1.079
 Pengu+1.075
hene+1.001
ゴン+0.982
sten+0.978
Neg logit contributions
 AD-1.010
 Lak-0.993
ellar-0.941
 ACT-0.938
 Cent-0.928
Token continues
Token ID4477
Feature activation-4.89e-02

Pos logit contributions
 AD+0.121
 Lak+0.118
ellar+0.112
 ACT+0.111
 Cent+0.109
Neg logit contributions
ーテ-0.138
 Pengu-0.137
hene-0.126
ゴン-0.123
sten-0.123
Token
Token ID198
Feature activation-6.73e-01

Pos logit contributions
 Pengu+0.412
ーテ+0.411
hene+0.381
sten+0.369
ゴン+0.369
Neg logit contributions
 AD-0.338
 Lak-0.329
ellar-0.309
 ACT-0.308
 Cent-0.306
Token
Token ID198
Feature activation-5.57e-01

Pos logit contributions
hene+4.206
sten+4.200
punk+4.047
ーテ+4.039
 Disorder+3.967
Neg logit contributions
 Lak-4.953
ellar-4.899
 AD-4.881
 coh-4.677
iel-4.617
TokenThere
Token ID1858
Feature activation-2.92e-01

Pos logit contributions
ーテ+3.823
 Pengu+3.816
hene+3.628
sten+3.592
ゴン+3.564
Neg logit contributions
 Lak-4.307
 AD-4.304
ellar-4.111
 ACT-4.076
 Cent-4.009
Token were
Token ID547
Feature activation+3.98e-02

Pos logit contributions
 Pengu+2.212
ーテ+2.186
hene+2.045
sten+1.981
ゴン+1.956
Neg logit contributions
 AD-2.233
 Lak-2.180
ellar-2.083
 ACT-2.060
 Cent-2.054
Token a
Token ID257
Feature activation-2.66e-02

Pos logit contributions
 AD+0.288
 Lak+0.280
ellar+0.268
 ACT+0.264
 Cent+0.262
Neg logit contributions
 Pengu-0.323
ーテ-0.322
hene-0.294
sten-0.285
ゴン-0.285
Token thousand
Token ID7319
Feature activation-3.43e-01

Pos logit contributions
 Pengu+0.208
ーテ+0.207
hene+0.190
ゴン+0.184
sten+0.184
Neg logit contributions
 AD-0.202
 Lak-0.195
ellar-0.186
 ACT-0.185
 Cent-0.183
Token
Token ID140
Feature activation-3.88e-01

Pos logit contributions
ーテ+1.255
 Pengu+1.246
hene+1.105
ゴン+1.061
sten+1.054
Neg logit contributions
 AD-2.091
 Lak-2.044
ellar-1.962
 ACT-1.960
 Cent-1.944
Token
Token ID114
Feature activation-1.27e-01

Pos logit contributions
ーテ+2.367
 Pengu+2.294
hene+2.091
ゴン+1.991
sten+1.974
Neg logit contributions
 AD-3.628
 Lak-3.507
ellar-3.374
 ACT-3.362
 Cent-3.320
Tokenе
Token ID16843
Feature activation-2.04e-01

Pos logit contributions
ーテ+0.692
 Pengu+0.682
hene+0.593
ゴン+0.574
sten+0.568
Neg logit contributions
 AD-1.272
 Lak-1.239
 ACT-1.193
ellar-1.186
 Cent-1.181
Token
Token ID220
Feature activation-2.28e-01

Pos logit contributions
ーテ+1.390
 Pengu+1.378
hene+1.236
ゴン+1.209
sten+1.201
Neg logit contributions
 AD-1.742
 Lak-1.695
ellar-1.640
 ACT-1.614
 Cent-1.600
Tokenя
Token ID40623
Feature activation-2.70e-01

Pos logit contributions
ーテ+1.274
 Pengu+1.257
hene+1.109
ゴン+1.068
sten+1.055
Neg logit contributions
 AD-2.245
 Lak-2.180
ellar-2.114
 ACT-2.104
 Cent-2.080
Tokenв
Token ID38857
Feature activation-7.61e-01

Pos logit contributions
ーテ+2.084
 Pengu+2.058
hene+1.872
sten+1.817
ゴン+1.814
Neg logit contributions
 AD-2.047
 Lak-2.005
ellar-1.932
 ACT-1.886
 Cent-1.859
Tokenл
Token ID30143
Feature activation-5.05e-01

Pos logit contributions
ーテ+3.349
 Pengu+3.342
hene+2.876
sten+2.759
ゴン+2.707
Neg logit contributions
 AD-7.894
 Lak-7.817
 ACT-7.470
ellar-7.464
 Cent-7.411
Tokenя
Token ID40623
Feature activation-3.19e-01

Pos logit contributions
ーテ+1.892
 Pengu+1.864
hene+1.510
ゴン+1.477
sten+1.382
Neg logit contributions
 AD-5.770
 Lak-5.646
ellar-5.526
 ACT-5.497
 Cent-5.402
Tokenе
Token ID16843
Feature activation-2.85e-01

Pos logit contributions
ーテ+2.071
 Pengu+2.062
hene+1.853
ゴン+1.803
sten+1.799
Neg logit contributions
 AD-2.759
 Lak-2.713
ellar-2.647
 ACT-2.588
 Cent-2.535
Tokenт
Token ID20375
Feature activation-3.12e-01

Pos logit contributions
ーテ+1.909
 Pengu+1.880
hene+1.708
sten+1.660
ゴン+1.619
Neg logit contributions
 AD-2.422
 Lak-2.369
ellar-2.315
 ACT-2.252
 Cent-2.227
Tokenс
Token ID21727
Feature activation-7.89e-01

Pos logit contributions
ーテ+2.337
 Pengu+2.327
hene+2.122
ゴン+2.050
sten+2.049
Neg logit contributions
 AD-2.420
 Lak-2.371
ellar-2.281
 ACT-2.231
 Cent-2.208