TOP ACTIVATIONS
MAX = 0.453

have to get a prost hetic penis and God help me
including paraly tic and diarr hetic shell fish poisoning in humans
loss to Argentina in Buenos Aires in May 2010 .
discriminating on me cause I m a
Mexico . Ex amination of Jake included the first
: 47 ] < bra aah hh > dude nice [
by either producing a valid Emirates ID or passport with a
the exp iry of their Emirates ID or visa to avoid
the match between United Arab Emirates and Palestine in Group A
WC MC ; Prot ected Planet . net In
does Obama think he is kidding ? In a series of
osh , you gotta be kidding me . This is a
or in personal interviews are kidding themselves , and misleading everyone
. 7 10 United Arab Emirates 19 Having a
ars cape . ( Just kidding there is no limit
) properties , something synt hetics aren t able
Ok ak opa 5 , 2017 ma N
Ok ak opa 19 , 2017 me Ka
) and one me ad ery . Thomas called it the
and their influence - p edd ling hidden , is something

MIN ACTIVATIONS
MIN = -0.476

jas Gil ( M ov istar Team ) s . t
How do you maintain a semblance of normal ? Given the
bunker oil , was r ammed by a cargo vessel in
is in session at Oak sterdam , a how - to
Joe N ieu w end yk to the Devils in exchange
next to us ( Call ahan Run ). Temperature crept up
acids for fuel . Inter mitt ent fasting and spending more
. N ieu w end yk is currently the Stars
says Mary Bros n ahan , the executive director of
haven 't been par av irtual ized yet . I also
Her itage Foundation , an educational nonprofit
Massachusetts pol , House Speaker Tip O ' Neill , could
Friday , including Is f ahan , Tab riz , Yaz
s his stem , or the
. The D obbies Garden World , at the
, as co - pres idents and co - C OO
Source : Es uter u Leave a
unto thy self . De uter onomy 20 : 12 .
neutral ." D obbies indoor plant advisor Jimmy Mc
be . Mal colm Grim ston , a nuclear

INTERVAL 0.220 - 0.453
CONTAINS 0.137%

for the remaining 12 individual seats which represent different segments of
digging into your bag of tricks to make X - 0
as top - hat Eddie because of
systems . While these products appeal especially to academic libraries ,
found guilty of price - fix ing up to 10 %

INTERVAL -0.012 - 0.220
CONTAINS 71.771%

big enough that you could see his outline on the horizon
not part of that union , that is how business operates
not quite sure why this is necessary . Probably so I
that he s become increasingly worried about in
, making it illegal for them to bring the weapon into

INTERVAL -0.244 - -0.012
CONTAINS 27.992%

be prepared to strike military targets of the Assad regime ,
Date : Nov . 24 , 1944 Tre
L ister told the Guardian . The Natural History
2012 , 53 percent of respondents to an NBC News /
Newsweek in 1994 . " I mean , I know what

INTERVAL -0.476 - -0.244
CONTAINS 0.100%

out to focus on undergrad uates . O urs is a
mint j ule p after mint j ule p at his
but the Gear 's boot loader doesn 't notice it .
know about are just the tip of an iceberg , one
Col onel David L apan , a Pentagon spokesman ,
Token have
Token ID423
Feature activation+2.31e-03

Pos logit contributions
@#&+0.004
cause+0.003
Narr+0.003
SIGN+0.003
hetic+0.003
Neg logit contributions
andise-0.005
byss-0.004
untled-0.004
ratulations-0.004
istar-0.004
Token to
Token ID284
Feature activation+9.46e-03

Pos logit contributions
@#&+0.002
cause+0.002
Narr+0.002
SIGN+0.001
hetic+0.001
Neg logit contributions
andise-0.002
byss-0.002
untled-0.002
ratulations-0.002
istar-0.002
Token get
Token ID651
Feature activation-1.58e-02

Pos logit contributions
@#&+0.007
cause+0.006
Narr+0.006
SIGN+0.006
hetic+0.006
Neg logit contributions
andise-0.009
byss-0.008
untled-0.007
ratulations-0.007
istar-0.007
Token a
Token ID257
Feature activation+9.98e-03

Pos logit contributions
andise+0.015
byss+0.013
untled+0.012
ratulations+0.012
istar+0.012
Neg logit contributions
@#&-0.012
cause-0.010
Narr-0.010
hetic-0.009
ˈ-0.009
Token prost
Token ID11825
Feature activation+1.34e-01

Pos logit contributions
@#&+0.008
cause+0.007
Narr+0.007
hetic+0.007
SIGN+0.006
Neg logit contributions
andise-0.009
byss-0.008
untled-0.007
ratulations-0.007
istar-0.007
Tokenhetic
Token ID6587
Feature activation+4.53e-01

Pos logit contributions
@#&+0.032
cause+0.016
Narr+0.015
SIGN+0.012
ˈ+0.011
Neg logit contributions
andise-0.191
byss-0.179
untled-0.171
ratulations-0.168
istar-0.167
Token penis
Token ID16360
Feature activation+4.71e-02

Pos logit contributions
@#&+0.351
cause+0.297
Narr+0.295
hetic+0.284
SIGN+0.284
Neg logit contributions
andise-0.404
byss-0.366
untled-0.337
ratulations-0.322
istar-0.321
Token and
Token ID290
Feature activation+1.73e-02

Pos logit contributions
@#&+0.036
cause+0.031
Narr+0.030
hetic+0.029
SIGN+0.029
Neg logit contributions
andise-0.042
byss-0.038
untled-0.035
ratulations-0.034
istar-0.034
Token God
Token ID1793
Feature activation-1.57e-03

Pos logit contributions
@#&+0.013
cause+0.011
Narr+0.011
SIGN+0.011
hetic+0.010
Neg logit contributions
andise-0.016
byss-0.014
untled-0.013
ratulations-0.013
istar-0.013
Token help
Token ID1037
Feature activation+5.04e-02

Pos logit contributions
andise+0.001
byss+0.001
untled+0.001
ratulations+0.001
istar+0.001
Neg logit contributions
@#&-0.001
cause-0.001
Narr-0.001
hetic-0.001
SIGN-0.001
Token me
Token ID502
Feature activation+5.00e-02

Pos logit contributions
@#&+0.035
cause+0.029
Narr+0.029
SIGN+0.027
hetic+0.027
Neg logit contributions
andise-0.049
byss-0.045
untled-0.042
ratulations-0.040
istar-0.040
Token including
Token ID1390
Feature activation-3.07e-02

Pos logit contributions
andise+0.004
byss+0.004
untled+0.004
ratulations+0.003
istar+0.003
Neg logit contributions
@#&-0.004
cause-0.003
Narr-0.003
SIGN-0.003
ˈ-0.003
Token paraly
Token ID26901
Feature activation-2.87e-02

Pos logit contributions
andise+0.026
byss+0.023
untled+0.022
ratulations+0.021
istar+0.021
Neg logit contributions
@#&-0.025
cause-0.021
Narr-0.021
SIGN-0.021
hetic-0.020
Tokentic
Token ID13370
Feature activation-1.94e-02

Pos logit contributions
andise+0.024
byss+0.021
untled+0.019
ratulations+0.018
istar+0.018
Neg logit contributions
@#&-0.024
cause-0.021
Narr-0.021
hetic-0.020
SIGN-0.020
Token and
Token ID290
Feature activation+3.05e-03

Pos logit contributions
andise+0.017
byss+0.015
untled+0.014
ratulations+0.014
istar+0.014
Neg logit contributions
@#&-0.015
cause-0.013
Narr-0.013
SIGN-0.012
hetic-0.012
Token diarr
Token ID30409
Feature activation+6.22e-02

Pos logit contributions
@#&+0.003
cause+0.002
Narr+0.002
SIGN+0.002
hetic+0.002
Neg logit contributions
andise-0.003
byss-0.002
untled-0.002
ratulations-0.002
istar-0.002
Tokenhetic
Token ID6587
Feature activation+4.44e-01

Pos logit contributions
@#&+0.051
Narr+0.043
cause+0.043
SIGN+0.041
ˈ+0.041
Neg logit contributions
andise-0.053
byss-0.047
untled-0.044
istar-0.042
ratulations-0.042
Token shell
Token ID7582
Feature activation+5.53e-02

Pos logit contributions
@#&+0.344
cause+0.292
Narr+0.288
hetic+0.279
SIGN+0.279
Neg logit contributions
andise-0.397
byss-0.358
untled-0.331
ratulations-0.317
istar-0.314
Tokenfish
Token ID11084
Feature activation-1.02e-02

Pos logit contributions
@#&+0.042
cause+0.035
Narr+0.035
SIGN+0.034
hetic+0.033
Neg logit contributions
andise-0.050
byss-0.046
untled-0.042
ratulations-0.040
istar-0.040
Token poisoning
Token ID22475
Feature activation+9.66e-03

Pos logit contributions
andise+0.009
byss+0.008
untled+0.007
ratulations+0.007
istar+0.007
Neg logit contributions
@#&-0.008
cause-0.007
Narr-0.007
SIGN-0.007
hetic-0.007
Token in
Token ID287
Feature activation+5.07e-03

Pos logit contributions
@#&+0.008
cause+0.007
Narr+0.006
SIGN+0.006
hetic+0.006
Neg logit contributions
andise-0.008
byss-0.008
untled-0.007
ratulations-0.007
istar-0.007
Token humans
Token ID5384
Feature activation-7.72e-02

Pos logit contributions
@#&+0.004
cause+0.003
Narr+0.003
SIGN+0.003
ˈ+0.003
Neg logit contributions
andise-0.004
byss-0.004
untled-0.004
ratulations-0.004
istar-0.004
Token loss
Token ID2994
Feature activation-1.33e-01

Pos logit contributions
andise+0.027
byss+0.024
untled+0.022
ratulations+0.021
istar+0.021
Neg logit contributions
@#&-0.026
cause-0.022
Narr-0.022
SIGN-0.021
hetic-0.021
Token to
Token ID284
Feature activation-9.80e-03

Pos logit contributions
andise+0.116
byss+0.104
untled+0.096
ratulations+0.092
istar+0.092
Neg logit contributions
@#&-0.106
cause-0.090
Narr-0.089
SIGN-0.086
hetic-0.086
Token Argentina
Token ID16519
Feature activation-5.51e-02

Pos logit contributions
andise+0.009
byss+0.008
untled+0.008
ratulations+0.007
istar+0.007
Neg logit contributions
@#&-0.007
cause-0.006
Narr-0.006
SIGN-0.006
hetic-0.006
Token in
Token ID287
Feature activation-9.76e-03

Pos logit contributions
andise+0.048
byss+0.044
untled+0.040
ratulations+0.039
istar+0.038
Neg logit contributions
@#&-0.044
cause-0.037
Narr-0.036
SIGN-0.035
ˈ-0.035
Token Buenos
Token ID46210
Feature activation+3.39e-02

Pos logit contributions
andise+0.009
byss+0.008
untled+0.008
ratulations+0.007
istar+0.007
Neg logit contributions
@#&-0.007
cause-0.006
Narr-0.006
SIGN-0.006
hetic-0.006
Token Aires
Token ID44692
Feature activation+4.16e-01

Pos logit contributions
@#&+0.005
cause+0.001
Narr+0.001
ˈ+0.000
SIGN+0.000
Neg logit contributions
andise-0.051
byss-0.048
untled-0.046
ratulations-0.045
istar-0.045
Token in
Token ID287
Feature activation-4.68e-03

Pos logit contributions
@#&+0.327
cause+0.276
Narr+0.274
SIGN+0.266
ˈ+0.264
Neg logit contributions
andise-0.367
byss-0.332
untled-0.306
ratulations-0.292
istar-0.290
Token May
Token ID1737
Feature activation+2.91e-02

Pos logit contributions
andise+0.004
byss+0.004
untled+0.004
ratulations+0.004
istar+0.003
Neg logit contributions
@#&-0.003
cause-0.003
Narr-0.003
SIGN-0.003
ˈ-0.003
Token 2010
Token ID3050
Feature activation-4.01e-02

Pos logit contributions
@#&+0.022
cause+0.019
Narr+0.019
SIGN+0.018
hetic+0.018
Neg logit contributions
andise-0.026
byss-0.024
untled-0.022
ratulations-0.021
istar-0.021
Token.
Token ID13
Feature activation-9.03e-03

Pos logit contributions
andise+0.034
byss+0.031
untled+0.028
ratulations+0.027
istar+0.027
Neg logit contributions
@#&-0.033
cause-0.028
Narr-0.027
SIGN-0.027
hetic-0.027
Token
Token ID198
Feature activation-5.55e-03

Pos logit contributions
andise+0.008
byss+0.007
untled+0.007
ratulations+0.006
istar+0.006
Neg logit contributions
@#&-0.007
cause-0.006
Narr-0.006
SIGN-0.006
ˈ-0.006
Token discriminating
Token ID48212
Feature activation+2.50e-02

Pos logit contributions
@#&+0.008
cause+0.006
Narr+0.006
SIGN+0.006
hetic+0.006
Neg logit contributions
andise-0.009
byss-0.008
untled-0.008
ratulations-0.007
istar-0.007
Token on
Token ID319
Feature activation-3.15e-03

Pos logit contributions
@#&+0.019
cause+0.016
Narr+0.016
SIGN+0.015
hetic+0.015
Neg logit contributions
andise-0.023
byss-0.021
untled-0.019
ratulations-0.019
istar-0.018
Token me
Token ID502
Feature activation+4.01e-02

Pos logit contributions
andise+0.003
byss+0.003
untled+0.002
ratulations+0.002
istar+0.002
Neg logit contributions
@#&-0.002
cause-0.002
Narr-0.002
SIGN-0.002
hetic-0.002
Token
Token ID564
Feature activation+1.55e-02

Pos logit contributions
@#&+0.030
cause+0.026
Narr+0.025
SIGN+0.024
hetic+0.024
Neg logit contributions
andise-0.037
byss-0.033
untled-0.031
ratulations-0.029
istar-0.029
Token
Token ID247
Feature activation+4.59e-03

Pos logit contributions
@#&+0.010
cause+0.008
Narr+0.008
SIGN+0.008
hetic+0.008
Neg logit contributions
andise-0.016
byss-0.014
untled-0.013
ratulations-0.013
istar-0.013
Tokencause
Token ID25587
Feature activation+4.05e-01

Pos logit contributions
@#&+0.003
cause+0.003
Narr+0.003
SIGN+0.003
hetic+0.003
Neg logit contributions
andise-0.004
byss-0.004
untled-0.004
ratulations-0.004
istar-0.003
Token I
Token ID314
Feature activation+1.64e-02

Pos logit contributions
@#&+0.311
cause+0.263
Narr+0.259
SIGN+0.250
hetic+0.250
Neg logit contributions
andise-0.365
byss-0.331
untled-0.305
ratulations-0.292
istar-0.292
Token
Token ID447
Feature activation+2.39e-03

Pos logit contributions
@#&+0.013
cause+0.011
Narr+0.011
ˈ+0.011
SIGN+0.011
Neg logit contributions
andise-0.014
byss-0.013
untled-0.012
ratulations-0.011
istar-0.011
Token
Token ID247
Feature activation+7.96e-03

Pos logit contributions
@#&+0.002
cause+0.001
Narr+0.001
SIGN+0.001
hetic+0.001
Neg logit contributions
andise-0.002
byss-0.002
untled-0.002
ratulations-0.002
istar-0.002
Tokenm
Token ID76
Feature activation-1.08e-02

Pos logit contributions
@#&+0.006
cause+0.005
Narr+0.005
ˈ+0.005
SIGN+0.005
Neg logit contributions
andise-0.007
byss-0.006
untled-0.006
ratulations-0.006
istar-0.006
Token a
Token ID257
Feature activation+9.89e-03

Pos logit contributions
andise+0.010
byss+0.009
untled+0.008
ratulations+0.008
istar+0.008
Neg logit contributions
@#&-0.008
cause-0.007
Narr-0.007
SIGN-0.007
ˈ-0.007
Token Mexico
Token ID5828
Feature activation+2.12e-02

Pos logit contributions
@#&+0.036
cause+0.030
Narr+0.029
hetic+0.028
SIGN+0.028
Neg logit contributions
andise-0.055
byss-0.051
untled-0.047
ratulations-0.045
wegian-0.045
Token.
Token ID13
Feature activation+1.40e-03

Pos logit contributions
@#&+0.018
cause+0.015
Narr+0.015
SIGN+0.014
ˈ+0.014
Neg logit contributions
andise-0.018
byss-0.016
untled-0.015
istar-0.014
ratulations-0.014
Token
Token ID198
Feature activation-5.35e-03

Pos logit contributions
@#&+0.001
cause+0.001
Narr+0.001
SIGN+0.001
hetic+0.001
Neg logit contributions
andise-0.001
byss-0.001
untled-0.001
ratulations-0.001
istar-0.001
Token
Token ID198
Feature activation-2.87e-03

Pos logit contributions
andise+0.005
byss+0.004
untled+0.004
ratulations+0.004
istar+0.004
Neg logit contributions
@#&-0.004
cause-0.004
Narr-0.004
SIGN-0.003
hetic-0.003
TokenEx
Token ID3109
Feature activation+1.03e-01

Pos logit contributions
andise+0.003
byss+0.002
untled+0.002
ratulations+0.002
istar+0.002
Neg logit contributions
@#&-0.002
cause-0.002
Narr-0.002
SIGN-0.002
hetic-0.002
Tokenamination
Token ID24979
Feature activation+4.01e-01

Pos logit contributions
@#&+0.077
cause+0.065
Narr+0.064
SIGN+0.062
ˈ+0.062
Neg logit contributions
andise-0.095
byss-0.086
untled-0.079
ratulations-0.076
istar-0.076
Token of
Token ID286
Feature activation-5.60e-03

Pos logit contributions
@#&+0.310
cause+0.262
Narr+0.259
SIGN+0.250
hetic+0.250
Neg logit contributions
andise-0.358
byss-0.323
untled-0.299
ratulations-0.287
istar-0.285
Token Jake
Token ID14757
Feature activation-3.45e-03

Pos logit contributions
andise+0.005
byss+0.005
untled+0.004
ratulations+0.004
istar+0.004
Neg logit contributions
@#&-0.004
cause-0.004
Narr-0.004
SIGN-0.003
hetic-0.003
Token included
Token ID3017
Feature activation-1.57e-02

Pos logit contributions
andise+0.003
byss+0.003
untled+0.003
ratulations+0.002
istar+0.002
Neg logit contributions
@#&-0.003
cause-0.002
Narr-0.002
SIGN-0.002
hetic-0.002
Token the
Token ID262
Feature activation-5.76e-03

Pos logit contributions
andise+0.014
byss+0.012
untled+0.011
ratulations+0.011
istar+0.011
Neg logit contributions
@#&-0.013
cause-0.011
Narr-0.011
SIGN-0.010
hetic-0.010
Token first
Token ID717
Feature activation+4.39e-02

Pos logit contributions
andise+0.005
byss+0.005
untled+0.004
ratulations+0.004
istar+0.004
Neg logit contributions
@#&-0.005
cause-0.004
Narr-0.004
SIGN-0.004
hetic-0.004
Token:
Token ID25
Feature activation-5.19e-03

Pos logit contributions
@#&+0.011
cause+0.010
Narr+0.010
SIGN+0.009
ˈ+0.009
Neg logit contributions
andise-0.013
byss-0.012
untled-0.011
ratulations-0.011
istar-0.011
Token47
Token ID2857
Feature activation-5.05e-02

Pos logit contributions
andise+0.004
byss+0.004
untled+0.003
istar+0.003
ratulations+0.003
Neg logit contributions
@#&-0.005
cause-0.004
Narr-0.004
SIGN-0.004
hetic-0.004
Token]
Token ID60
Feature activation+8.39e-03

Pos logit contributions
andise+0.048
byss+0.043
untled+0.040
ratulations+0.039
istar+0.038
Neg logit contributions
@#&-0.037
cause-0.031
Narr-0.030
SIGN-0.029
ˈ-0.029
Token <
Token ID1279
Feature activation+6.03e-04

Pos logit contributions
@#&+0.007
cause+0.006
Narr+0.006
SIGN+0.005
hetic+0.005
Neg logit contributions
andise-0.007
byss-0.007
untled-0.006
ratulations-0.006
istar-0.006
Tokenbra
Token ID16057
Feature activation-8.30e-02

Pos logit contributions
@#&+0.000
cause+0.000
Narr+0.000
SIGN+0.000
hetic+0.000
Neg logit contributions
andise-0.001
byss-0.000
untled-0.000
ratulations-0.000
istar-0.000
Tokenaah
Token ID37500
Feature activation+3.90e-01

Pos logit contributions
andise+0.075
byss+0.068
untled+0.063
ratulations+0.060
istar+0.060
Neg logit contributions
@#&-0.063
cause-0.053
Narr-0.053
SIGN-0.051
hetic-0.051
Tokenhh
Token ID12337
Feature activation-9.83e-03

Pos logit contributions
@#&+0.296
cause+0.249
Narr+0.246
SIGN+0.237
hetic+0.237
Neg logit contributions
andise-0.355
byss-0.321
untled-0.297
ratulations-0.285
istar-0.284
Token>
Token ID29
Feature activation+3.13e-02

Pos logit contributions
andise+0.009
byss+0.008
untled+0.008
ratulations+0.007
istar+0.007
Neg logit contributions
@#&-0.007
cause-0.006
Narr-0.006
SIGN-0.006
hetic-0.006
Token dude
Token ID18396
Feature activation-1.08e-01

Pos logit contributions
@#&+0.024
cause+0.020
Narr+0.020
hetic+0.019
SIGN+0.019
Neg logit contributions
andise-0.028
byss-0.025
untled-0.023
ratulations-0.022
istar-0.022
Token nice
Token ID3621
Feature activation-5.49e-02

Pos logit contributions
andise+0.096
byss+0.086
untled+0.080
ratulations+0.076
istar+0.076
Neg logit contributions
@#&-0.084
cause-0.071
Narr-0.070
hetic-0.068
SIGN-0.068
Token [
Token ID685
Feature activation-6.10e-02

Pos logit contributions
andise+0.047
byss+0.042
untled+0.038
ratulations+0.037
istar+0.037
Neg logit contributions
@#&-0.045
cause-0.038
Narr-0.038
hetic-0.037
ˈ-0.037
Token by
Token ID416
Feature activation-1.66e-03

Pos logit contributions
andise+0.054
byss+0.049
untled+0.045
ratulations+0.043
istar+0.043
Neg logit contributions
@#&-0.049
cause-0.041
Narr-0.041
SIGN-0.039
hetic-0.039
Token either
Token ID2035
Feature activation+1.00e-01

Pos logit contributions
andise+0.001
byss+0.001
untled+0.001
ratulations+0.001
istar+0.001
Neg logit contributions
@#&-0.001
cause-0.001
Narr-0.001
SIGN-0.001
hetic-0.001
Token producing
Token ID9194
Feature activation+1.62e-02

Pos logit contributions
@#&+0.081
cause+0.068
Narr+0.068
hetic+0.065
SIGN+0.065
Neg logit contributions
andise-0.086
byss-0.077
untled-0.071
ratulations-0.068
istar-0.068
Token a
Token ID257
Feature activation+1.21e-02

Pos logit contributions
@#&+0.012
cause+0.010
Narr+0.010
SIGN+0.010
hetic+0.010
Neg logit contributions
andise-0.015
byss-0.013
untled-0.012
ratulations-0.012
istar-0.012
Token valid
Token ID4938
Feature activation+1.07e-01

Pos logit contributions
@#&+0.010
cause+0.008
Narr+0.008
SIGN+0.008
hetic+0.008
Neg logit contributions
andise-0.011
byss-0.010
untled-0.009
ratulations-0.009
istar-0.009
Token Emirates
Token ID24880
Feature activation+3.86e-01

Pos logit contributions
@#&+0.082
cause+0.069
Narr+0.068
SIGN+0.066
hetic+0.066
Neg logit contributions
andise-0.097
byss-0.087
untled-0.081
ratulations-0.078
istar-0.078
Token ID
Token ID4522
Feature activation+1.41e-02

Pos logit contributions
@#&+0.310
cause+0.263
Narr+0.261
SIGN+0.252
hetic+0.251
Neg logit contributions
andise-0.334
byss-0.299
untled-0.276
ratulations-0.265
istar-0.264
Token or
Token ID393
Feature activation-1.78e-02

Pos logit contributions
@#&+0.012
cause+0.010
Narr+0.010
SIGN+0.010
hetic+0.010
Neg logit contributions
andise-0.012
byss-0.010
untled-0.010
ratulations-0.009
istar-0.009
Token passport
Token ID17981
Feature activation+1.28e-01

Pos logit contributions
andise+0.015
byss+0.014
untled+0.013
ratulations+0.012
istar+0.012
Neg logit contributions
@#&-0.014
cause-0.012
Narr-0.012
hetic-0.012
SIGN-0.012
Token with
Token ID351
Feature activation+2.82e-03

Pos logit contributions
@#&+0.103
cause+0.087
Narr+0.087
SIGN+0.084
hetic+0.083
Neg logit contributions
andise-0.111
byss-0.100
untled-0.092
ratulations-0.088
istar-0.088
Token a
Token ID257
Feature activation+3.29e-03

Pos logit contributions
@#&+0.002
cause+0.002
Narr+0.002
SIGN+0.002
hetic+0.002
Neg logit contributions
andise-0.003
byss-0.002
untled-0.002
ratulations-0.002
istar-0.002
Token the
Token ID262
Feature activation+3.34e-06

Pos logit contributions
andise+0.019
byss+0.017
untled+0.016
ratulations+0.015
istar+0.015
Neg logit contributions
@#&-0.016
cause-0.014
Narr-0.014
SIGN-0.013
ˈ-0.013
Token exp
Token ID1033
Feature activation+2.47e-01

Feature not active on prev token;
no predictions were affected.
Tokeniry
Token ID9045
Feature activation+1.78e-01

Pos logit contributions
@#&+0.118
cause+0.088
Narr+0.087
SIGN+0.081
hetic+0.081
Neg logit contributions
andise-0.294
byss-0.272
untled-0.257
ratulations-0.250
istar-0.249
Token of
Token ID286
Feature activation+6.71e-03

Pos logit contributions
@#&+0.160
cause+0.138
Narr+0.137
SIGN+0.133
ˈ+0.133
Neg logit contributions
andise-0.136
byss-0.120
untled-0.109
ratulations-0.105
istar-0.104
Token their
Token ID511
Feature activation-1.50e-02

Pos logit contributions
@#&+0.005
cause+0.004
Narr+0.004
SIGN+0.004
hetic+0.004
Neg logit contributions
andise-0.006
byss-0.006
untled-0.005
ratulations-0.005
istar-0.005
Token Emirates
Token ID24880
Feature activation+3.80e-01

Pos logit contributions
andise+0.014
byss+0.012
untled+0.011
ratulations+0.011
istar+0.011
Neg logit contributions
@#&-0.012
cause-0.010
Narr-0.010
SIGN-0.009
ˈ-0.009
Token ID
Token ID4522
Feature activation+5.51e-03

Pos logit contributions
@#&+0.294
cause+0.248
Narr+0.246
SIGN+0.237
hetic+0.237
Neg logit contributions
andise-0.339
byss-0.307
untled-0.283
ratulations-0.271
istar-0.270
Token or
Token ID393
Feature activation-3.65e-02

Pos logit contributions
@#&+0.004
cause+0.004
Narr+0.004
SIGN+0.004
ˈ+0.004
Neg logit contributions
andise-0.005
byss-0.004
untled-0.004
ratulations-0.004
istar-0.004
Token visa
Token ID14552
Feature activation+3.18e-02

Pos logit contributions
andise+0.033
byss+0.030
untled+0.027
ratulations+0.026
istar+0.026
Neg logit contributions
@#&-0.028
cause-0.023
Narr-0.023
hetic-0.022
SIGN-0.022
Token to
Token ID284
Feature activation-4.50e-03

Pos logit contributions
@#&+0.025
cause+0.021
Narr+0.021
SIGN+0.021
hetic+0.020
Neg logit contributions
andise-0.028
byss-0.025
untled-0.023
ratulations-0.022
istar-0.022
Token avoid
Token ID3368
Feature activation-6.74e-03

Pos logit contributions
andise+0.004
byss+0.004
untled+0.003
ratulations+0.003
istar+0.003
Neg logit contributions
@#&-0.003
cause-0.003
Narr-0.003
hetic-0.003
SIGN-0.003
Token the
Token ID262
Feature activation-6.81e-03

Pos logit contributions
andise+0.001
byss+0.001
untled+0.001
ratulations+0.001
istar+0.001
Neg logit contributions
@#&-0.001
cause-0.001
Narr-0.001
SIGN-0.001
hetic-0.001
Token match
Token ID2872
Feature activation-4.63e-02

Pos logit contributions
andise+0.006
byss+0.005
untled+0.005
ratulations+0.005
istar+0.005
Neg logit contributions
@#&-0.006
cause-0.005
Narr-0.005
SIGN-0.005
hetic-0.005
Token between
Token ID1022
Feature activation-6.44e-02

Pos logit contributions
andise+0.041
byss+0.037
untled+0.034
ratulations+0.032
istar+0.032
Neg logit contributions
@#&-0.037
cause-0.031
Narr-0.031
SIGN-0.030
hetic-0.030
Token United
Token ID1578
Feature activation-4.72e-02

Pos logit contributions
andise+0.058
byss+0.052
untled+0.049
ratulations+0.047
istar+0.046
Neg logit contributions
@#&-0.049
cause-0.042
Narr-0.041
SIGN-0.040
hetic-0.040
Token Arab
Token ID4498
Feature activation+6.35e-02

Pos logit contributions
andise+0.046
byss+0.042
untled+0.039
ratulations+0.037
istar+0.037
Neg logit contributions
@#&-0.033
cause-0.027
Narr-0.027
SIGN-0.026
hetic-0.026
Token Emirates
Token ID24880
Feature activation+3.77e-01

Pos logit contributions
@#&+0.015
cause+0.008
Narr+0.007
hetic+0.006
SIGN+0.006
Neg logit contributions
andise-0.091
byss-0.085
untled-0.081
istar-0.079
ratulations-0.079
Token and
Token ID290
Feature activation+2.10e-03

Pos logit contributions
@#&+0.301
cause+0.256
Narr+0.253
SIGN+0.245
ˈ+0.244
Neg logit contributions
andise-0.328
byss-0.295
untled-0.272
ratulations-0.261
istar-0.259
Token Palestine
Token ID17263
Feature activation+1.07e-02

Pos logit contributions
@#&+0.002
cause+0.001
Narr+0.001
SIGN+0.001
hetic+0.001
Neg logit contributions
andise-0.002
byss-0.002
untled-0.002
ratulations-0.002
istar-0.002
Token in
Token ID287
Feature activation-2.47e-03

Pos logit contributions
@#&+0.008
cause+0.007
Narr+0.007
SIGN+0.007
hetic+0.007
Neg logit contributions
andise-0.009
byss-0.009
untled-0.008
ratulations-0.008
istar-0.007
Token Group
Token ID4912
Feature activation+1.85e-02

Pos logit contributions
andise+0.002
byss+0.002
untled+0.002
ratulations+0.002
istar+0.002
Neg logit contributions
@#&-0.002
cause-0.002
Narr-0.002
SIGN-0.001
hetic-0.001
Token A
Token ID317
Feature activation+1.47e-02

Pos logit contributions
@#&+0.015
cause+0.013
Narr+0.013
SIGN+0.012
hetic+0.012
Neg logit contributions
andise-0.016
byss-0.014
untled-0.013
ratulations-0.012
istar-0.012
TokenWC
Token ID27353
Feature activation+1.36e-01

Pos logit contributions
andise+0.003
byss+0.003
untled+0.003
ratulations+0.002
istar+0.002
Neg logit contributions
@#&-0.003
cause-0.002
Narr-0.002
SIGN-0.002
ˈ-0.002
TokenMC
Token ID9655
Feature activation+3.83e-02

Pos logit contributions
@#&+0.106
cause+0.090
Narr+0.090
SIGN+0.087
ˈ+0.086
Neg logit contributions
andise-0.121
byss-0.109
untled-0.101
ratulations-0.096
istar-0.096
Token;
Token ID26
Feature activation+1.14e-02

Pos logit contributions
@#&+0.030
cause+0.026
Narr+0.025
SIGN+0.025
ˈ+0.024
Neg logit contributions
andise-0.034
byss-0.030
untled-0.028
ratulations-0.027
istar-0.027
Token Prot
Token ID5038
Feature activation-1.61e-01

Pos logit contributions
@#&+0.009
cause+0.008
Narr+0.008
SIGN+0.007
ˈ+0.007
Neg logit contributions
andise-0.010
byss-0.009
untled-0.008
ratulations-0.008
istar-0.008
Tokenected
Token ID11197
Feature activation-4.55e-02

Pos logit contributions
andise+0.132
byss+0.118
untled+0.108
ratulations+0.103
istar+0.102
Neg logit contributions
@#&-0.136
cause-0.118
Narr-0.117
hetic-0.113
ˈ-0.112
TokenPlanet
Token ID41801
Feature activation+3.76e-01

Pos logit contributions
andise+0.035
byss+0.031
untled+0.028
ratulations+0.027
istar+0.027
Neg logit contributions
@#&-0.041
cause-0.035
Narr-0.035
SIGN-0.034
hetic-0.034
Token.
Token ID13
Feature activation+5.87e-03

Pos logit contributions
@#&+0.298
cause+0.252
Narr+0.250
SIGN+0.241
hetic+0.241
Neg logit contributions
andise-0.330
byss-0.297
untled-0.274
ratulations-0.263
istar-0.262
Tokennet
Token ID3262
Feature activation+1.07e-01

Pos logit contributions
@#&+0.005
cause+0.004
Narr+0.004
SIGN+0.004
hetic+0.004
Neg logit contributions
andise-0.005
byss-0.004
untled-0.004
ratulations-0.004
istar-0.004
Token
Token ID198
Feature activation+1.26e-03

Pos logit contributions
@#&+0.083
cause+0.070
Narr+0.070
SIGN+0.067
ˈ+0.067
Neg logit contributions
andise-0.095
byss-0.086
untled-0.079
ratulations-0.076
istar-0.075
Token
Token ID198
Feature activation+8.97e-03

Pos logit contributions
@#&+0.001
cause+0.001
Narr+0.001
SIGN+0.001
ˈ+0.001
Neg logit contributions
andise-0.001
byss-0.001
untled-0.001
ratulations-0.001
istar-0.001
TokenIn
Token ID818
Feature activation-1.77e-02

Pos logit contributions
@#&+0.007
cause+0.006
Narr+0.006
SIGN+0.005
ˈ+0.005
Neg logit contributions
andise-0.008
byss-0.007
untled-0.007
ratulations-0.007
istar-0.007
Token does
Token ID857
Feature activation+3.74e-02

Pos logit contributions
andise+0.004
byss+0.004
untled+0.003
ratulations+0.003
istar+0.003
Neg logit contributions
@#&-0.003
cause-0.003
Narr-0.003
SIGN-0.003
ˈ-0.003
Token Obama
Token ID2486
Feature activation-1.11e-01

Pos logit contributions
@#&+0.029
cause+0.024
Narr+0.024
SIGN+0.023
hetic+0.023
Neg logit contributions
andise-0.034
byss-0.030
untled-0.028
ratulations-0.027
istar-0.027
Token think
Token ID892
Feature activation+1.01e-02

Pos logit contributions
andise+0.101
byss+0.092
untled+0.085
ratulations+0.081
istar+0.081
Neg logit contributions
@#&-0.084
cause-0.070
Narr-0.070
SIGN-0.067
ˈ-0.067
Token he
Token ID339
Feature activation+1.53e-02

Pos logit contributions
@#&+0.008
cause+0.006
Narr+0.006
SIGN+0.006
hetic+0.006
Neg logit contributions
andise-0.009
byss-0.008
untled-0.008
ratulations-0.007
istar-0.007
Token is
Token ID318
Feature activation+7.06e-03

Pos logit contributions
@#&+0.012
cause+0.010
Narr+0.010
ˈ+0.010
SIGN+0.010
Neg logit contributions
andise-0.013
byss-0.012
untled-0.011
ratulations-0.011
istar-0.011
Token kidding
Token ID26471
Feature activation+3.73e-01

Pos logit contributions
@#&+0.006
cause+0.005
Narr+0.005
SIGN+0.005
hetic+0.005
Neg logit contributions
andise-0.006
byss-0.005
untled-0.005
ratulations-0.005
istar-0.005
Token?
Token ID30
Feature activation+2.06e-02

Pos logit contributions
@#&+0.279
cause+0.235
Narr+0.233
hetic+0.224
SIGN+0.224
Neg logit contributions
andise-0.342
byss-0.311
untled-0.288
ratulations-0.277
istar-0.275
Token In
Token ID554
Feature activation+1.17e-02

Pos logit contributions
@#&+0.016
cause+0.013
Narr+0.013
SIGN+0.013
ˈ+0.013
Neg logit contributions
andise-0.018
byss-0.017
untled-0.015
ratulations-0.015
istar-0.015
Token a
Token ID257
Feature activation+4.52e-03

Pos logit contributions
@#&+0.009
cause+0.007
Narr+0.007
SIGN+0.007
hetic+0.007
Neg logit contributions
andise-0.011
byss-0.010
untled-0.009
ratulations-0.009
istar-0.009
Token series
Token ID2168
Feature activation-1.12e-01

Pos logit contributions
@#&+0.003
cause+0.003
Narr+0.003
SIGN+0.003
hetic+0.003
Neg logit contributions
andise-0.004
byss-0.004
untled-0.003
ratulations-0.003
istar-0.003
Token of
Token ID286
Feature activation+1.86e-03

Pos logit contributions
andise+0.102
byss+0.093
untled+0.086
ratulations+0.082
istar+0.082
Neg logit contributions
@#&-0.085
cause-0.072
Narr-0.071
SIGN-0.069
hetic-0.069
Tokenosh
Token ID3768
Feature activation-4.75e-02

Pos logit contributions
andise+0.023
byss+0.021
untled+0.019
istar+0.018
ratulations+0.018
Neg logit contributions
@#&-0.024
cause-0.021
Narr-0.021
hetic-0.020
SIGN-0.020
Token,
Token ID11
Feature activation-7.38e-04

Pos logit contributions
andise+0.042
byss+0.038
untled+0.035
istar+0.033
ratulations+0.033
Neg logit contributions
@#&-0.037
cause-0.032
Narr-0.031
hetic-0.030
SIGN-0.030
Token you
Token ID345
Feature activation-5.73e-03

Pos logit contributions
andise+0.001
byss+0.001
untled+0.001
ratulations+0.001
istar+0.001
Neg logit contributions
@#&-0.001
cause-0.000
Narr-0.000
SIGN-0.000
hetic-0.000
Token gotta
Token ID17753
Feature activation+3.19e-02

Pos logit contributions
andise+0.005
byss+0.005
untled+0.004
ratulations+0.004
istar+0.004
Neg logit contributions
@#&-0.004
cause-0.004
Narr-0.004
SIGN-0.004
hetic-0.003
Token be
Token ID307
Feature activation+9.59e-03

Pos logit contributions
@#&+0.023
cause+0.019
Narr+0.019
ˈ+0.018
SIGN+0.018
Neg logit contributions
andise-0.030
byss-0.027
untled-0.025
ratulations-0.024
istar-0.024
Token kidding
Token ID26471
Feature activation+3.72e-01

Pos logit contributions
@#&+0.004
cause+0.003
Narr+0.003
SIGN+0.003
hetic+0.003
Neg logit contributions
andise-0.012
byss-0.011
untled-0.011
ratulations-0.010
istar-0.010
Token me
Token ID502
Feature activation+3.54e-02

Pos logit contributions
@#&+0.257
cause+0.213
Narr+0.210
hetic+0.202
SIGN+0.201
Neg logit contributions
andise-0.364
byss-0.332
untled-0.309
ratulations-0.297
istar-0.296
Token.
Token ID13
Feature activation+2.87e-04

Pos logit contributions
@#&+0.028
cause+0.024
Narr+0.024
hetic+0.023
SIGN+0.023
Neg logit contributions
andise-0.031
byss-0.028
untled-0.026
ratulations-0.025
istar-0.025
Token This
Token ID770
Feature activation+5.31e-02

Pos logit contributions
@#&+0.000
cause+0.000
Narr+0.000
SIGN+0.000
hetic+0.000
Neg logit contributions
andise-0.000
byss-0.000
untled-0.000
ratulations-0.000
istar-0.000
Token is
Token ID318
Feature activation+8.12e-03

Pos logit contributions
@#&+0.042
cause+0.036
Narr+0.036
SIGN+0.034
ˈ+0.034
Neg logit contributions
andise-0.046
byss-0.041
untled-0.038
ratulations-0.037
istar-0.037
Token a
Token ID257
Feature activation-3.17e-03

Pos logit contributions
@#&+0.006
cause+0.005
Narr+0.005
SIGN+0.005
hetic+0.005
Neg logit contributions
andise-0.007
byss-0.007
untled-0.006
ratulations-0.006
istar-0.006
Token or
Token ID393
Feature activation-3.27e-02

Pos logit contributions
andise+0.093
byss+0.084
untled+0.077
ratulations+0.074
istar+0.074
Neg logit contributions
@#&-0.081
cause-0.068
Narr-0.068
SIGN-0.065
hetic-0.065
Token in
Token ID287
Feature activation-1.21e-02

Pos logit contributions
andise+0.029
byss+0.026
untled+0.024
ratulations+0.023
istar+0.023
Neg logit contributions
@#&-0.026
cause-0.022
Narr-0.022
SIGN-0.021
hetic-0.021
Token personal
Token ID2614
Feature activation-1.30e-02

Pos logit contributions
andise+0.011
byss+0.010
untled+0.009
ratulations+0.009
istar+0.008
Neg logit contributions
@#&-0.010
cause-0.008
Narr-0.008
SIGN-0.008
hetic-0.008
Token interviews
Token ID9299
Feature activation-2.52e-02

Pos logit contributions
andise+0.012
byss+0.011
untled+0.010
ratulations+0.009
istar+0.009
Neg logit contributions
@#&-0.010
cause-0.008
Narr-0.008
SIGN-0.008
hetic-0.008
Token are
Token ID389
Feature activation+4.18e-03

Pos logit contributions
andise+0.023
byss+0.021
untled+0.019
ratulations+0.018
istar+0.018
Neg logit contributions
@#&-0.019
cause-0.016
Narr-0.016
SIGN-0.015
hetic-0.015
Token kidding
Token ID26471
Feature activation+3.71e-01

Pos logit contributions
@#&+0.003
cause+0.003
Narr+0.003
SIGN+0.002
ˈ+0.002
Neg logit contributions
andise-0.004
byss-0.003
untled-0.003
ratulations-0.003
istar-0.003
Token themselves
Token ID2405
Feature activation-2.82e-02

Pos logit contributions
@#&+0.264
cause+0.220
Narr+0.219
hetic+0.210
SIGN+0.210
Neg logit contributions
andise-0.353
byss-0.323
untled-0.299
ratulations-0.288
istar-0.286
Token,
Token ID11
Feature activation-2.36e-03

Pos logit contributions
andise+0.025
byss+0.023
untled+0.021
ratulations+0.020
istar+0.020
Neg logit contributions
@#&-0.022
cause-0.019
Narr-0.019
SIGN-0.018
hetic-0.018
Token and
Token ID290
Feature activation-3.95e-03

Pos logit contributions
andise+0.002
byss+0.002
untled+0.002
ratulations+0.002
istar+0.002
Neg logit contributions
@#&-0.002
cause-0.002
Narr-0.002
SIGN-0.001
hetic-0.001
Token misleading
Token ID15850
Feature activation-9.98e-02

Pos logit contributions
andise+0.004
byss+0.003
untled+0.003
ratulations+0.003
istar+0.003
Neg logit contributions
@#&-0.003
cause-0.003
Narr-0.003
SIGN-0.002
hetic-0.002
Token everyone
Token ID2506
Feature activation-2.57e-02

Pos logit contributions
andise+0.092
byss+0.084
untled+0.078
ratulations+0.075
istar+0.074
Neg logit contributions
@#&-0.074
cause-0.062
Narr-0.062
SIGN-0.059
hetic-0.059
Token.
Token ID13
Feature activation-7.15e-03

Pos logit contributions
andise+0.019
byss+0.017
untled+0.015
ratulations+0.015
istar+0.015
Neg logit contributions
@#&-0.018
cause-0.015
Narr-0.015
SIGN-0.014
ˈ-0.014
Token7
Token ID22
Feature activation+1.05e-02

Pos logit contributions
andise+0.006
byss+0.006
untled+0.005
ratulations+0.005
istar+0.005
Neg logit contributions
@#&-0.006
cause-0.005
Narr-0.005
SIGN-0.005
hetic-0.005
Token 10
Token ID838
Feature activation-1.83e-02

Pos logit contributions
@#&+0.009
cause+0.007
Narr+0.007
SIGN+0.007
hetic+0.007
Neg logit contributions
andise-0.009
byss-0.008
untled-0.007
ratulations-0.007
istar-0.007
Token United
Token ID1578
Feature activation-5.28e-02

Pos logit contributions
andise+0.017
byss+0.015
untled+0.014
ratulations+0.014
istar+0.014
Neg logit contributions
@#&-0.014
cause-0.011
Narr-0.011
SIGN-0.011
hetic-0.011
Token Arab
Token ID4498
Feature activation+6.69e-02

Pos logit contributions
andise+0.052
byss+0.048
untled+0.044
ratulations+0.043
istar+0.043
Neg logit contributions
@#&-0.036
cause-0.029
Narr-0.029
SIGN-0.028
ˈ-0.028
Token Emirates
Token ID24880
Feature activation+3.69e-01

Pos logit contributions
@#&+0.014
cause+0.006
Narr+0.006
SIGN+0.004
hetic+0.004
Neg logit contributions
andise-0.097
byss-0.091
untled-0.087
ratulations-0.085
istar-0.085
Token 19
Token ID678
Feature activation-1.46e-02

Pos logit contributions
@#&+0.293
cause+0.249
Narr+0.247
SIGN+0.240
ˈ+0.239
Neg logit contributions
andise-0.321
byss-0.292
untled-0.268
ratulations-0.259
istar-0.254
Token
Token ID198
Feature activation-6.65e-03

Pos logit contributions
andise+0.013
byss+0.011
untled+0.010
ratulations+0.010
istar+0.010
Neg logit contributions
@#&-0.012
cause-0.010
Narr-0.010
SIGN-0.010
hetic-0.010
Token
Token ID198
Feature activation-1.09e-02

Pos logit contributions
andise+0.006
byss+0.005
untled+0.005
ratulations+0.005
istar+0.005
Neg logit contributions
@#&-0.005
cause-0.004
Narr-0.004
SIGN-0.004
hetic-0.004
TokenHaving
Token ID14698
Feature activation-4.35e-02

Pos logit contributions
andise+0.010
byss+0.009
untled+0.008
istar+0.008
ratulations+0.008
Neg logit contributions
@#&-0.008
cause-0.007
Narr-0.007
hetic-0.007
SIGN-0.007
Token a
Token ID257
Feature activation-3.79e-03

Pos logit contributions
andise+0.039
byss+0.035
untled+0.033
ratulations+0.031
istar+0.031
Neg logit contributions
@#&-0.033
cause-0.028
Narr-0.028
SIGN-0.027
hetic-0.027
Tokenars
Token ID945
Feature activation-2.40e-02

Pos logit contributions
@#&+0.025
cause+0.021
Narr+0.021
SIGN+0.020
hetic+0.020
Neg logit contributions
andise-0.029
byss-0.026
untled-0.024
istar-0.023
ratulations-0.023
Tokencape
Token ID36435
Feature activation-7.93e-02

Pos logit contributions
andise+0.019
byss+0.017
untled+0.016
ratulations+0.015
istar+0.015
Neg logit contributions
@#&-0.021
cause-0.018
Narr-0.018
SIGN-0.017
ˈ-0.017
Token.
Token ID13
Feature activation-9.87e-03

Pos logit contributions
andise+0.070
byss+0.063
untled+0.058
ratulations+0.056
istar+0.055
Neg logit contributions
@#&-0.063
cause-0.053
Narr-0.053
SIGN-0.051
hetic-0.051
Token (
Token ID357
Feature activation-6.07e-03

Pos logit contributions
andise+0.009
byss+0.008
untled+0.008
ratulations+0.007
istar+0.007
Neg logit contributions
@#&-0.008
cause-0.006
Narr-0.006
SIGN-0.006
hetic-0.006
TokenJust
Token ID5703
Feature activation-7.31e-02

Pos logit contributions
andise+0.006
byss+0.005
untled+0.005
ratulations+0.005
istar+0.005
Neg logit contributions
@#&-0.005
cause-0.004
Narr-0.004
SIGN-0.004
ˈ-0.004
Token kidding
Token ID26471
Feature activation+3.66e-01

Pos logit contributions
andise+0.069
byss+0.063
untled+0.058
ratulations+0.056
istar+0.056
Neg logit contributions
@#&-0.053
cause-0.044
Narr-0.043
SIGN-0.041
hetic-0.041
Token
Token ID851
Feature activation-8.98e-03

Pos logit contributions
@#&+0.292
cause+0.249
Narr+0.247
hetic+0.238
SIGN+0.238
Neg logit contributions
andise-0.318
byss-0.286
untled-0.264
ratulations-0.253
istar-0.251
Token there
Token ID612
Feature activation+5.64e-04

Pos logit contributions
andise+0.008
byss+0.007
untled+0.007
ratulations+0.007
istar+0.006
Neg logit contributions
@#&-0.007
cause-0.006
Narr-0.006
SIGN-0.006
hetic-0.006
Token is
Token ID318
Feature activation-1.49e-02

Pos logit contributions
@#&+0.000
cause+0.000
Narr+0.000
hetic+0.000
SIGN+0.000
Neg logit contributions
andise-0.000
byss-0.000
untled-0.000
ratulations-0.000
istar-0.000
Token no
Token ID645
Feature activation-3.15e-02

Pos logit contributions
andise+0.014
byss+0.012
untled+0.011
ratulations+0.011
istar+0.011
Neg logit contributions
@#&-0.011
cause-0.009
Narr-0.009
SIGN-0.009
hetic-0.009
Token limit
Token ID4179
Feature activation-1.18e-01

Pos logit contributions
andise+0.028
byss+0.025
untled+0.023
ratulations+0.022
istar+0.022
Neg logit contributions
@#&-0.025
cause-0.021
Narr-0.021
SIGN-0.020
hetic-0.020
Token)
Token ID8
Feature activation+8.74e-03

Pos logit contributions
andise+0.045
byss+0.040
untled+0.037
ratulations+0.036
istar+0.036
Neg logit contributions
@#&-0.039
cause-0.033
Narr-0.033
SIGN-0.032
hetic-0.032
Token properties
Token ID6608
Feature activation-8.60e-02

Pos logit contributions
@#&+0.007
cause+0.006
Narr+0.006
SIGN+0.006
ˈ+0.006
Neg logit contributions
andise-0.008
byss-0.007
untled-0.006
ratulations-0.006
istar-0.006
Token,
Token ID11
Feature activation-9.42e-03

Pos logit contributions
andise+0.075
byss+0.068
untled+0.062
ratulations+0.060
istar+0.059
Neg logit contributions
@#&-0.068
cause-0.058
Narr-0.058
SIGN-0.056
hetic-0.055
Token something
Token ID1223
Feature activation+3.69e-03

Pos logit contributions
andise+0.008
byss+0.008
untled+0.007
ratulations+0.007
istar+0.007
Neg logit contributions
@#&-0.007
cause-0.006
Narr-0.006
SIGN-0.006
ˈ-0.006
Token synt
Token ID7419
Feature activation-3.02e-02

Pos logit contributions
@#&+0.003
cause+0.002
Narr+0.002
SIGN+0.002
hetic+0.002
Neg logit contributions
andise-0.003
byss-0.003
untled-0.003
ratulations-0.003
istar-0.003
Tokenhetics
Token ID24965
Feature activation+3.61e-01

Pos logit contributions
andise+0.036
byss+0.033
untled+0.032
ratulations+0.031
istar+0.031
Neg logit contributions
@#&-0.014
cause-0.011
Narr-0.010
SIGN-0.010
ˈ-0.010
Token aren
Token ID3588
Feature activation+2.28e-02

Pos logit contributions
@#&+0.269
cause+0.227
Narr+0.224
ˈ+0.217
hetic+0.217
Neg logit contributions
andise-0.331
byss-0.303
untled-0.278
ratulations-0.268
istar-0.266
Token
Token ID447
Feature activation-2.81e-03

Pos logit contributions
@#&+0.016
cause+0.013
Narr+0.013
SIGN+0.012
ˈ+0.012
Neg logit contributions
andise-0.022
byss-0.021
untled-0.019
ratulations-0.019
istar-0.018
Token
Token ID247
Feature activation-2.60e-03

Pos logit contributions
andise+0.003
byss+0.003
untled+0.002
ratulations+0.002
istar+0.002
Neg logit contributions
@#&-0.002
cause-0.002
Narr-0.002
hetic-0.001
SIGN-0.001
Tokent
Token ID83
Feature activation+3.79e-03

Pos logit contributions
andise+0.002
byss+0.002
untled+0.002
ratulations+0.002
istar+0.002
Neg logit contributions
@#&-0.002
cause-0.002
Narr-0.002
hetic-0.002
SIGN-0.002
Token able
Token ID1498
Feature activation+5.37e-02

Pos logit contributions
@#&+0.003
cause+0.002
Narr+0.002
hetic+0.002
ˈ+0.002
Neg logit contributions
andise-0.004
byss-0.003
untled-0.003
ratulations-0.003
istar-0.003
Token
Token ID220
Feature activation-5.46e-02

Pos logit contributions
@#&+0.052
cause+0.044
Narr+0.043
SIGN+0.042
ˈ+0.042
Neg logit contributions
andise-0.062
byss-0.056
untled-0.052
ratulations-0.050
istar-0.050
Token
Token ID134
Feature activation+9.05e-03

Pos logit contributions
andise+0.050
byss+0.045
untled+0.042
istar+0.040
ratulations+0.040
Neg logit contributions
@#&-0.041
cause-0.034
Narr-0.034
SIGN-0.033
hetic-0.033
Token
Token ID119
Feature activation+9.32e-02

Pos logit contributions
@#&+0.005
cause+0.004
Narr+0.004
ˈ+0.004
SIGN+0.004
Neg logit contributions
andise-0.010
byss-0.009
untled-0.009
ratulations-0.008
istar-0.008
TokenOk
Token ID18690
Feature activation-2.73e-02

Pos logit contributions
@#&+0.071
cause+0.060
Narr+0.059
SIGN+0.057
hetic+0.057
Neg logit contributions
andise-0.085
byss-0.076
untled-0.071
ratulations-0.068
istar-0.068
Tokenak
Token ID461
Feature activation-9.32e-02

Pos logit contributions
andise+0.025
byss+0.023
untled+0.021
ratulations+0.020
istar+0.020
Neg logit contributions
@#&-0.021
cause-0.017
Narr-0.017
ˈ-0.017
hetic-0.017
Tokenopa
Token ID26186
Feature activation+3.57e-01

Pos logit contributions
andise+0.085
byss+0.077
untled+0.071
ratulations+0.069
istar+0.068
Neg logit contributions
@#&-0.070
cause-0.059
Narr-0.058
SIGN-0.056
hetic-0.056
Token 5
Token ID642
Feature activation+1.26e-02

Pos logit contributions
@#&+0.275
cause+0.232
Narr+0.230
ˈ+0.221
SIGN+0.221
Neg logit contributions
andise-0.320
byss-0.289
untled-0.268
ratulations-0.256
istar-0.255
Token,
Token ID11
Feature activation+1.89e-03

Pos logit contributions
@#&+0.010
cause+0.008
Narr+0.008
SIGN+0.008
hetic+0.008
Neg logit contributions
andise-0.011
byss-0.010
untled-0.009
ratulations-0.009
istar-0.009
Token 2017
Token ID2177
Feature activation+3.55e-02

Pos logit contributions
@#&+0.001
cause+0.001
Narr+0.001
SIGN+0.001
ˈ+0.001
Neg logit contributions
andise-0.002
byss-0.002
untled-0.001
ratulations-0.001
istar-0.001
Token ma
Token ID17266
Feature activation+6.76e-02

Pos logit contributions
@#&+0.028
cause+0.023
Narr+0.023
SIGN+0.022
ˈ+0.022
Neg logit contributions
andise-0.031
byss-0.028
untled-0.026
ratulations-0.025
istar-0.025
Token N
Token ID399
Feature activation-2.26e-02

Pos logit contributions
@#&+0.052
cause+0.044
Narr+0.044
ˈ+0.042
SIGN+0.042
Neg logit contributions
andise-0.060
byss-0.054
untled-0.050
ratulations-0.048
istar-0.048
Token
Token ID220
Feature activation-7.07e-02

Pos logit contributions
@#&+0.042
cause+0.035
Narr+0.035
ˈ+0.033
SIGN+0.033
Neg logit contributions
andise-0.049
byss-0.044
untled-0.041
ratulations-0.039
istar-0.039
Token
Token ID134
Feature activation-1.19e-03

Pos logit contributions
andise+0.065
byss+0.058
untled+0.054
istar+0.052
ratulations+0.052
Neg logit contributions
@#&-0.053
cause-0.045
Narr-0.044
SIGN-0.043
hetic-0.043
Token
Token ID119
Feature activation+7.86e-02

Pos logit contributions
andise+0.001
byss+0.001
untled+0.001
ratulations+0.001
istar+0.001
Neg logit contributions
@#&-0.001
cause-0.001
Narr-0.001
SIGN-0.000
ˈ-0.000
TokenOk
Token ID18690
Feature activation-3.18e-02

Pos logit contributions
@#&+0.060
cause+0.050
Narr+0.050
SIGN+0.048
hetic+0.048
Neg logit contributions
andise-0.071
byss-0.064
untled-0.059
ratulations-0.057
istar-0.057
Tokenak
Token ID461
Feature activation-1.07e-01

Pos logit contributions
andise+0.024
byss+0.021
untled+0.019
ratulations+0.018
istar+0.018
Neg logit contributions
@#&-0.029
cause-0.025
Narr-0.025
SIGN-0.024
hetic-0.024
Tokenopa
Token ID26186
Feature activation+3.47e-01

Pos logit contributions
andise+0.129
byss+0.120
untled+0.113
ratulations+0.110
istar+0.110
Neg logit contributions
@#&-0.048
cause-0.036
Narr-0.035
SIGN-0.033
hetic-0.032
Token 19
Token ID678
Feature activation-1.74e-02

Pos logit contributions
@#&+0.271
cause+0.229
Narr+0.228
SIGN+0.219
ˈ+0.219
Neg logit contributions
andise-0.307
byss-0.277
untled-0.256
ratulations-0.245
istar-0.244
Token,
Token ID11
Feature activation-1.13e-02

Pos logit contributions
andise+0.015
byss+0.014
untled+0.013
ratulations+0.012
istar+0.012
Neg logit contributions
@#&-0.014
cause-0.012
Narr-0.012
SIGN-0.011
hetic-0.011
Token 2017
Token ID2177
Feature activation+2.72e-02

Pos logit contributions
andise+0.010
byss+0.009
untled+0.009
ratulations+0.008
istar+0.008
Neg logit contributions
@#&-0.008
cause-0.007
Narr-0.007
SIGN-0.007
hetic-0.007
Token me
Token ID502
Feature activation+3.12e-02

Pos logit contributions
@#&+0.020
cause+0.017
Narr+0.017
SIGN+0.016
hetic+0.016
Neg logit contributions
andise-0.025
byss-0.023
untled-0.021
ratulations-0.020
istar-0.020
Token Ka
Token ID11611
Feature activation+1.62e-01

Pos logit contributions
@#&+0.023
cause+0.019
Narr+0.019
SIGN+0.019
ˈ+0.018
Neg logit contributions
andise-0.029
byss-0.026
untled-0.024
ratulations-0.023
istar-0.023
Token)
Token ID8
Feature activation+2.54e-02

Pos logit contributions
@#&+0.019
cause+0.016
Narr+0.016
SIGN+0.016
hetic+0.015
Neg logit contributions
andise-0.022
byss-0.020
untled-0.019
ratulations-0.018
istar-0.018
Token and
Token ID290
Feature activation+2.39e-02

Pos logit contributions
@#&+0.020
cause+0.017
Narr+0.017
SIGN+0.016
hetic+0.016
Neg logit contributions
andise-0.022
byss-0.020
untled-0.019
ratulations-0.018
istar-0.018
Token one
Token ID530
Feature activation+4.97e-02

Pos logit contributions
@#&+0.018
cause+0.015
Narr+0.015
SIGN+0.015
hetic+0.015
Neg logit contributions
andise-0.022
byss-0.019
untled-0.018
ratulations-0.017
istar-0.017
Token me
Token ID502
Feature activation+4.97e-02

Pos logit contributions
@#&+0.039
cause+0.033
Narr+0.033
SIGN+0.031
hetic+0.031
Neg logit contributions
andise-0.044
byss-0.040
untled-0.037
ratulations-0.035
istar-0.035
Tokenad
Token ID324
Feature activation+3.64e-02

Pos logit contributions
@#&+0.046
cause+0.040
Narr+0.040
SIGN+0.039
hetic+0.039
Neg logit contributions
andise-0.037
byss-0.032
untled-0.029
ratulations-0.028
istar-0.028
Tokenery
Token ID1924
Feature activation+3.43e-01

Pos logit contributions
@#&+0.028
cause+0.024
Narr+0.023
SIGN+0.023
hetic+0.023
Neg logit contributions
andise-0.033
byss-0.029
untled-0.027
ratulations-0.026
istar-0.026
Token.
Token ID13
Feature activation+1.40e-02

Pos logit contributions
@#&+0.270
cause+0.229
Narr+0.227
SIGN+0.218
hetic+0.218
Neg logit contributions
andise-0.302
byss-0.273
untled-0.251
ratulations-0.241
istar-0.240
Token Thomas
Token ID5658
Feature activation-2.22e-02

Pos logit contributions
@#&+0.011
cause+0.009
Narr+0.009
SIGN+0.009
ˈ+0.009
Neg logit contributions
andise-0.013
byss-0.012
untled-0.011
ratulations-0.010
istar-0.010
Token called
Token ID1444
Feature activation+2.33e-02

Pos logit contributions
andise+0.020
byss+0.018
untled+0.017
ratulations+0.016
istar+0.016
Neg logit contributions
@#&-0.017
cause-0.015
Narr-0.014
SIGN-0.014
hetic-0.014
Token it
Token ID340
Feature activation+3.82e-03

Pos logit contributions
@#&+0.017
cause+0.014
Narr+0.014
SIGN+0.014
hetic+0.014
Neg logit contributions
andise-0.022
byss-0.019
untled-0.018
ratulations-0.017
istar-0.017
Token the
Token ID262
Feature activation+8.93e-03

Pos logit contributions
@#&+0.003
cause+0.002
Narr+0.002
SIGN+0.002
hetic+0.002
Neg logit contributions
andise-0.004
byss-0.003
untled-0.003
ratulations-0.003
istar-0.003
Token and
Token ID290
Feature activation-8.36e-03

Pos logit contributions
andise+0.027
byss+0.025
untled+0.023
ratulations+0.022
istar+0.022
Neg logit contributions
@#&-0.022
cause-0.018
Narr-0.018
SIGN-0.017
hetic-0.017
Token their
Token ID511
Feature activation-4.63e-03

Pos logit contributions
andise+0.008
byss+0.007
untled+0.006
ratulations+0.006
istar+0.006
Neg logit contributions
@#&-0.006
cause-0.005
Narr-0.005
SIGN-0.005
hetic-0.005
Token influence
Token ID4588
Feature activation+2.07e-02

Pos logit contributions
andise+0.004
byss+0.004
untled+0.003
ratulations+0.003
istar+0.003
Neg logit contributions
@#&-0.004
cause-0.003
Narr-0.003
SIGN-0.003
hetic-0.003
Token-
Token ID12
Feature activation-1.27e-02

Pos logit contributions
@#&+0.016
cause+0.013
Narr+0.013
SIGN+0.013
hetic+0.013
Neg logit contributions
andise-0.019
byss-0.017
untled-0.016
ratulations-0.015
istar-0.015
Tokenp
Token ID79
Feature activation+2.26e-02

Pos logit contributions
andise+0.012
byss+0.011
untled+0.010
ratulations+0.010
istar+0.010
Neg logit contributions
@#&-0.009
cause-0.008
Narr-0.008
SIGN-0.007
hetic-0.007
Tokenedd
Token ID6048
Feature activation+3.35e-01

Pos logit contributions
@#&+0.006
cause+0.003
Narr+0.003
SIGN+0.003
hetic+0.003
Neg logit contributions
andise-0.032
byss-0.030
untled-0.028
ratulations-0.028
istar-0.028
Tokenling
Token ID1359
Feature activation-1.66e-03

Pos logit contributions
@#&+0.252
cause+0.213
Narr+0.210
hetic+0.203
SIGN+0.202
Neg logit contributions
andise-0.305
byss-0.277
untled-0.255
ratulations-0.245
istar-0.245
Token hidden
Token ID7104
Feature activation-5.51e-02

Pos logit contributions
andise+0.001
byss+0.001
untled+0.001
ratulations+0.001
istar+0.001
Neg logit contributions
@#&-0.001
cause-0.001
Narr-0.001
SIGN-0.001
ˈ-0.001
Token,
Token ID11
Feature activation-1.71e-03

Pos logit contributions
andise+0.047
byss+0.042
untled+0.039
ratulations+0.038
istar+0.037
Neg logit contributions
@#&-0.044
cause-0.038
Narr-0.037
SIGN-0.036
hetic-0.036
Token is
Token ID318
Feature activation-3.40e-03

Pos logit contributions
andise+0.002
byss+0.001
untled+0.001
ratulations+0.001
istar+0.001
Neg logit contributions
@#&-0.001
cause-0.001
Narr-0.001
SIGN-0.001
hetic-0.001
Token something
Token ID1223
Feature activation+4.65e-03

Pos logit contributions
andise+0.003
byss+0.003
untled+0.003
ratulations+0.003
istar+0.002
Neg logit contributions
@#&-0.003
cause-0.002
Narr-0.002
ˈ-0.002
hetic-0.002
Tokenjas
Token ID28121
Feature activation+1.70e-01

Pos logit contributions
andise+0.226
byss+0.211
untled+0.198
ratulations+0.192
istar+0.190
Neg logit contributions
@#&-0.093
cause-0.070
Narr-0.069
hetic-0.065
ˈ-0.065
Token Gil
Token ID10689
Feature activation-1.47e-01

Pos logit contributions
@#&+0.135
cause+0.114
Narr+0.113
hetic+0.110
SIGN+0.109
Neg logit contributions
andise-0.148
byss-0.133
untled-0.123
ratulations-0.118
istar-0.117
Token (
Token ID357
Feature activation-4.66e-03

Pos logit contributions
andise+0.130
byss+0.118
untled+0.108
ratulations+0.104
istar+0.103
Neg logit contributions
@#&-0.114
cause-0.097
Narr-0.096
hetic-0.093
SIGN-0.093
TokenM
Token ID44
Feature activation-6.41e-02

Pos logit contributions
andise+0.004
byss+0.004
untled+0.004
ratulations+0.004
istar+0.004
Neg logit contributions
@#&-0.003
cause-0.003
Narr-0.003
SIGN-0.003
ˈ-0.003
Tokenov
Token ID709
Feature activation+4.80e-03

Pos logit contributions
andise+0.058
byss+0.052
untled+0.049
istar+0.047
ratulations+0.046
Neg logit contributions
@#&-0.049
cause-0.041
Narr-0.041
SIGN-0.039
ˈ-0.039
Tokenistar
Token ID47229
Feature activation-4.76e-01

Pos logit contributions
@#&+0.007
cause+0.006
Narr+0.006
hetic+0.006
SIGN+0.006
Neg logit contributions
andise-0.001
byss-0.001
untled-0.000
ratulations-0.000
wegian-0.000
Token Team
Token ID4816
Feature activation-2.33e-02

Pos logit contributions
andise+0.422
byss+0.379
untled+0.351
istar+0.338
ratulations+0.335
Neg logit contributions
@#&-0.372
cause-0.315
Narr-0.313
SIGN-0.301
hetic-0.300
Token)
Token ID8
Feature activation-1.86e-03

Pos logit contributions
andise+0.021
byss+0.019
untled+0.018
ratulations+0.017
istar+0.017
Neg logit contributions
@#&-0.018
cause-0.015
Narr-0.015
SIGN-0.014
ˈ-0.014
Token s
Token ID264
Feature activation+7.23e-02

Pos logit contributions
andise+0.002
byss+0.002
untled+0.002
ratulations+0.002
istar+0.002
Neg logit contributions
@#&-0.001
cause-0.001
Narr-0.001
SIGN-0.001
hetic-0.001
Token.
Token ID13
Feature activation-1.62e-02

Pos logit contributions
@#&+0.057
cause+0.049
Narr+0.048
SIGN+0.047
ˈ+0.046
Neg logit contributions
andise-0.063
byss-0.058
untled-0.053
ratulations-0.051
istar-0.050
Tokent
Token ID83
Feature activation-2.43e-03

Pos logit contributions
andise+0.015
byss+0.014
untled+0.013
ratulations+0.012
istar+0.012
Neg logit contributions
@#&-0.012
cause-0.010
Narr-0.010
SIGN-0.010
ˈ-0.010
TokenHow
Token ID2437
Feature activation+1.52e-02

Pos logit contributions
@#&+0.002
cause+0.002
Narr+0.002
SIGN+0.002
hetic+0.002
Neg logit contributions
andise-0.003
byss-0.003
untled-0.003
ratulations-0.003
istar-0.003
Token do
Token ID466
Feature activation+5.76e-03

Pos logit contributions
@#&+0.012
cause+0.010
Narr+0.010
SIGN+0.009
hetic+0.009
Neg logit contributions
andise-0.014
byss-0.012
untled-0.011
ratulations-0.011
istar-0.011
Token you
Token ID345
Feature activation+9.54e-03

Pos logit contributions
@#&+0.004
cause+0.004
Narr+0.004
SIGN+0.004
hetic+0.004
Neg logit contributions
andise-0.005
byss-0.005
untled-0.004
ratulations-0.004
istar-0.004
Token maintain
Token ID5529
Feature activation+7.57e-02

Pos logit contributions
@#&+0.007
cause+0.006
Narr+0.006
SIGN+0.006
hetic+0.006
Neg logit contributions
andise-0.009
byss-0.008
untled-0.007
ratulations-0.007
istar-0.007
Token a
Token ID257
Feature activation-3.22e-03

Pos logit contributions
@#&+0.059
cause+0.050
Narr+0.049
SIGN+0.047
hetic+0.047
Neg logit contributions
andise-0.068
byss-0.061
untled-0.056
ratulations-0.054
istar-0.054
Token semblance
Token ID45960
Feature activation-3.97e-01

Pos logit contributions
andise+0.003
byss+0.003
untled+0.002
ratulations+0.002
istar+0.002
Neg logit contributions
@#&-0.003
cause-0.002
Narr-0.002
SIGN-0.002
hetic-0.002
Token of
Token ID286
Feature activation+1.67e-02

Pos logit contributions
andise+0.355
byss+0.321
untled+0.296
ratulations+0.285
istar+0.283
Neg logit contributions
@#&-0.307
cause-0.260
Narr-0.257
SIGN-0.248
hetic-0.248
Token normal
Token ID3487
Feature activation-3.30e-02

Pos logit contributions
@#&+0.013
cause+0.011
Narr+0.011
SIGN+0.011
ˈ+0.011
Neg logit contributions
andise-0.015
byss-0.013
untled-0.012
ratulations-0.012
istar-0.012
Token?
Token ID30
Feature activation+1.01e-02

Pos logit contributions
andise+0.031
byss+0.028
untled+0.026
ratulations+0.025
istar+0.025
Neg logit contributions
@#&-0.024
cause-0.020
Narr-0.019
SIGN-0.019
ˈ-0.019
Token Given
Token ID11259
Feature activation+5.51e-03

Pos logit contributions
@#&+0.008
cause+0.007
Narr+0.007
SIGN+0.006
hetic+0.006
Neg logit contributions
andise-0.009
byss-0.008
untled-0.007
ratulations-0.007
istar-0.007
Token the
Token ID262
Feature activation-9.53e-03

Pos logit contributions
@#&+0.004
cause+0.004
Narr+0.004
SIGN+0.003
hetic+0.003
Neg logit contributions
andise-0.005
byss-0.004
untled-0.004
ratulations-0.004
istar-0.004
Token bunker
Token ID36892
Feature activation+8.50e-02

Pos logit contributions
@#&+0.006
cause+0.005
Narr+0.005
SIGN+0.005
hetic+0.005
Neg logit contributions
andise-0.007
byss-0.007
untled-0.006
ratulations-0.006
istar-0.006
Token oil
Token ID3056
Feature activation+5.48e-02

Pos logit contributions
@#&+0.066
cause+0.056
Narr+0.055
SIGN+0.053
hetic+0.053
Neg logit contributions
andise-0.076
byss-0.069
untled-0.063
ratulations-0.061
istar-0.061
Token,
Token ID11
Feature activation+8.50e-03

Pos logit contributions
@#&+0.043
cause+0.037
Narr+0.037
SIGN+0.035
hetic+0.035
Neg logit contributions
andise-0.048
byss-0.043
untled-0.040
ratulations-0.038
istar-0.038
Token was
Token ID373
Feature activation-1.65e-02

Pos logit contributions
@#&+0.007
cause+0.006
Narr+0.005
SIGN+0.005
hetic+0.005
Neg logit contributions
andise-0.008
byss-0.007
untled-0.006
ratulations-0.006
istar-0.006
Token r
Token ID374
Feature activation+1.97e-02

Pos logit contributions
andise+0.015
byss+0.014
untled+0.012
ratulations+0.012
istar+0.012
Neg logit contributions
@#&-0.012
cause-0.011
Narr-0.010
SIGN-0.010
hetic-0.010
Tokenammed
Token ID10573
Feature activation-3.81e-01

Pos logit contributions
@#&+0.025
cause+0.023
Narr+0.022
hetic+0.022
SIGN+0.022
Neg logit contributions
andise-0.008
byss-0.006
untled-0.005
ratulations-0.004
istar-0.004
Token by
Token ID416
Feature activation+1.32e-02

Pos logit contributions
andise+0.340
byss+0.306
untled+0.282
ratulations+0.271
istar+0.271
Neg logit contributions
@#&-0.296
cause-0.250
Narr-0.247
SIGN-0.238
ˈ-0.238
Token a
Token ID257
Feature activation+3.67e-03

Pos logit contributions
@#&+0.010
cause+0.009
Narr+0.008
SIGN+0.008
hetic+0.008
Neg logit contributions
andise-0.012
byss-0.011
untled-0.010
ratulations-0.009
istar-0.009
Token cargo
Token ID15892
Feature activation+7.80e-02

Pos logit contributions
@#&+0.003
cause+0.002
Narr+0.002
SIGN+0.002
hetic+0.002
Neg logit contributions
andise-0.003
byss-0.003
untled-0.003
ratulations-0.003
istar-0.003
Token vessel
Token ID8837
Feature activation-7.87e-03

Pos logit contributions
@#&+0.059
cause+0.050
Narr+0.049
SIGN+0.047
hetic+0.047
Neg logit contributions
andise-0.071
byss-0.064
untled-0.060
ratulations-0.057
istar-0.057
Token in
Token ID287
Feature activation+1.39e-02

Pos logit contributions
andise+0.007
byss+0.006
untled+0.006
ratulations+0.005
istar+0.005
Neg logit contributions
@#&-0.006
cause-0.005
Narr-0.005
SIGN-0.005
hetic-0.005
Token is
Token ID318
Feature activation+1.11e-02

Pos logit contributions
andise+0.064
byss+0.057
untled+0.053
ratulations+0.051
istar+0.051
Neg logit contributions
@#&-0.057
cause-0.048
Narr-0.047
SIGN-0.046
hetic-0.046
Token in
Token ID287
Feature activation+1.90e-03

Pos logit contributions
@#&+0.009
cause+0.007
Narr+0.007
SIGN+0.007
ˈ+0.007
Neg logit contributions
andise-0.010
byss-0.009
untled-0.008
ratulations-0.008
istar-0.008
Token session
Token ID6246
Feature activation-1.41e-01

Pos logit contributions
@#&+0.001
cause+0.001
Narr+0.001
hetic+0.001
ˈ+0.001
Neg logit contributions
andise-0.002
byss-0.002
untled-0.001
ratulations-0.001
istar-0.001
Token at
Token ID379
Feature activation+1.07e-02

Pos logit contributions
andise+0.125
byss+0.113
untled+0.104
ratulations+0.100
istar+0.099
Neg logit contributions
@#&-0.111
cause-0.094
Narr-0.093
SIGN-0.090
ˈ-0.089
Token Oak
Token ID9121
Feature activation-7.45e-02

Pos logit contributions
@#&+0.008
cause+0.007
Narr+0.007
SIGN+0.007
hetic+0.007
Neg logit contributions
andise-0.010
byss-0.009
untled-0.008
ratulations-0.008
istar-0.008
Tokensterdam
Token ID22506
Feature activation-3.78e-01

Pos logit contributions
andise+0.060
byss+0.053
untled+0.049
ratulations+0.046
istar+0.046
Neg logit contributions
@#&-0.064
cause-0.056
Narr-0.055
ˈ-0.053
hetic-0.053
Token,
Token ID11
Feature activation-1.71e-04

Pos logit contributions
andise+0.327
byss+0.294
untled+0.270
ratulations+0.259
istar+0.258
Neg logit contributions
@#&-0.305
cause-0.259
Narr-0.257
SIGN-0.248
ˈ-0.248
Token a
Token ID257
Feature activation-2.12e-03

Pos logit contributions
andise+0.000
byss+0.000
untled+0.000
ratulations+0.000
istar+0.000
Neg logit contributions
@#&-0.000
cause-0.000
Narr-0.000
SIGN-0.000
ˈ-0.000
Token how
Token ID703
Feature activation-3.89e-02

Pos logit contributions
andise+0.002
byss+0.002
untled+0.002
ratulations+0.002
istar+0.001
Neg logit contributions
@#&-0.002
cause-0.001
Narr-0.001
ˈ-0.001
hetic-0.001
Token-
Token ID12
Feature activation-7.48e-04

Pos logit contributions
andise+0.033
byss+0.030
untled+0.027
ratulations+0.026
istar+0.026
Neg logit contributions
@#&-0.032
cause-0.027
Narr-0.027
ˈ-0.026
hetic-0.026
Tokento
Token ID1462
Feature activation+2.44e-02

Pos logit contributions
andise+0.001
byss+0.001
untled+0.001
ratulations+0.001
istar+0.001
Neg logit contributions
@#&-0.001
cause-0.000
Narr-0.000
SIGN-0.000
hetic-0.000
Token Joe
Token ID5689
Feature activation-3.82e-02

Pos logit contributions
andise+0.023
byss+0.021
untled+0.019
ratulations+0.018
istar+0.018
Neg logit contributions
@#&-0.019
cause-0.016
Narr-0.015
SIGN-0.015
hetic-0.015
Token N
Token ID399
Feature activation-1.71e-02

Pos logit contributions
andise+0.038
byss+0.035
untled+0.032
istar+0.031
ratulations+0.031
Neg logit contributions
@#&-0.026
cause-0.021
Narr-0.021
SIGN-0.020
ˈ-0.020
Tokenieu
Token ID22304
Feature activation+7.88e-02

Pos logit contributions
andise+0.017
byss+0.015
untled+0.014
ratulations+0.014
istar+0.014
Neg logit contributions
@#&-0.012
cause-0.010
Narr-0.009
hetic-0.009
ˈ-0.009
Tokenw
Token ID86
Feature activation-4.57e-02

Pos logit contributions
@#&+0.064
cause+0.055
Narr+0.054
ˈ+0.053
SIGN+0.053
Neg logit contributions
andise-0.067
byss-0.060
untled-0.055
ratulations-0.053
istar-0.053
Tokenend
Token ID437
Feature activation-6.42e-02

Pos logit contributions
andise+0.038
byss+0.034
untled+0.031
ratulations+0.030
istar+0.030
Neg logit contributions
@#&-0.038
cause-0.033
Narr-0.033
SIGN-0.031
hetic-0.031
Tokenyk
Token ID48361
Feature activation-3.68e-01

Pos logit contributions
andise+0.027
byss+0.021
untled+0.017
istar+0.015
ratulations+0.015
Neg logit contributions
@#&-0.080
cause-0.072
Narr-0.072
hetic-0.071
SIGN-0.070
Token to
Token ID284
Feature activation-1.61e-05

Pos logit contributions
andise+0.319
byss+0.287
untled+0.264
ratulations+0.253
istar+0.253
Neg logit contributions
@#&-0.295
cause-0.251
Narr-0.248
SIGN-0.240
hetic-0.239
Token the
Token ID262
Feature activation-6.81e-03

Pos logit contributions
andise+0.000
byss+0.000
untled+0.000
ratulations+0.000
istar+0.000
Neg logit contributions
@#&-0.000
cause-0.000
Narr-0.000
SIGN-0.000
hetic-0.000
Token Devils
Token ID26199
Feature activation+1.95e-01

Pos logit contributions
andise+0.007
byss+0.006
untled+0.006
ratulations+0.006
istar+0.006
Neg logit contributions
@#&-0.004
cause-0.004
Narr-0.004
SIGN-0.003
hetic-0.003
Token in
Token ID287
Feature activation-1.62e-03

Pos logit contributions
@#&+0.154
cause+0.131
Narr+0.129
SIGN+0.125
hetic+0.124
Neg logit contributions
andise-0.172
byss-0.155
untled-0.143
ratulations-0.137
istar-0.136
Token exchange
Token ID5163
Feature activation+2.21e-02

Pos logit contributions
andise+0.001
byss+0.001
untled+0.001
ratulations+0.001
istar+0.001
Neg logit contributions
@#&-0.001
cause-0.001
Narr-0.001
SIGN-0.001
hetic-0.001
Token next
Token ID1306
Feature activation+3.85e-02

Pos logit contributions
@#&+0.023
cause+0.019
Narr+0.019
SIGN+0.018
hetic+0.018
Neg logit contributions
andise-0.027
byss-0.024
untled-0.022
ratulations-0.021
istar-0.021
Token to
Token ID284
Feature activation-5.61e-03

Pos logit contributions
@#&+0.031
cause+0.026
Narr+0.026
SIGN+0.025
hetic+0.025
Neg logit contributions
andise-0.033
byss-0.030
untled-0.028
ratulations-0.027
istar-0.026
Token us
Token ID514
Feature activation+4.28e-02

Pos logit contributions
andise+0.005
byss+0.005
untled+0.004
ratulations+0.004
istar+0.004
Neg logit contributions
@#&-0.004
cause-0.003
Narr-0.003
SIGN-0.003
ˈ-0.003
Token (
Token ID357
Feature activation-5.18e-03

Pos logit contributions
@#&+0.034
cause+0.029
Narr+0.029
SIGN+0.028
hetic+0.028
Neg logit contributions
andise-0.037
byss-0.033
untled-0.031
ratulations-0.030
istar-0.029
TokenCall
Token ID14134
Feature activation-1.09e-02

Pos logit contributions
andise+0.005
byss+0.004
untled+0.004
ratulations+0.004
istar+0.004
Neg logit contributions
@#&-0.004
cause-0.003
Narr-0.003
SIGN-0.003
hetic-0.003
Tokenahan
Token ID19210
Feature activation-3.67e-01

Pos logit contributions
andise+0.010
byss+0.009
untled+0.008
ratulations+0.008
istar+0.008
Neg logit contributions
@#&-0.008
cause-0.007
Narr-0.007
SIGN-0.007
hetic-0.007
Token Run
Token ID5660
Feature activation+3.14e-02

Pos logit contributions
andise+0.312
byss+0.280
untled+0.257
ratulations+0.246
istar+0.245
Neg logit contributions
@#&-0.301
cause-0.256
Narr-0.254
SIGN-0.245
hetic-0.245
Token).
Token ID737
Feature activation+2.76e-02

Pos logit contributions
@#&+0.025
cause+0.021
Narr+0.021
SIGN+0.020
hetic+0.020
Neg logit contributions
andise-0.028
byss-0.025
untled-0.023
ratulations-0.022
istar-0.022
Token Temperature
Token ID34467
Feature activation-1.12e-01

Pos logit contributions
@#&+0.021
cause+0.017
Narr+0.017
SIGN+0.017
hetic+0.016
Neg logit contributions
andise-0.025
byss-0.023
untled-0.021
ratulations-0.021
istar-0.020
Token crept
Token ID47092
Feature activation-2.03e-02

Pos logit contributions
andise+0.105
byss+0.095
untled+0.088
ratulations+0.084
istar+0.084
Neg logit contributions
@#&-0.082
cause-0.069
Narr-0.068
SIGN-0.065
hetic-0.065
Token up
Token ID510
Feature activation-1.08e-02

Pos logit contributions
andise+0.019
byss+0.017
untled+0.016
ratulations+0.015
istar+0.015
Neg logit contributions
@#&-0.015
cause-0.013
Narr-0.013
SIGN-0.012
hetic-0.012
Token acids
Token ID17045
Feature activation+7.51e-02

Pos logit contributions
andise+0.015
byss+0.014
untled+0.013
ratulations+0.012
istar+0.012
Neg logit contributions
@#&-0.009
cause-0.008
Narr-0.008
hetic-0.007
ˈ-0.007
Token for
Token ID329
Feature activation-1.29e-03

Pos logit contributions
@#&+0.060
cause+0.051
Narr+0.051
SIGN+0.049
hetic+0.049
Neg logit contributions
andise-0.065
byss-0.059
untled-0.054
ratulations-0.052
istar-0.051
Token fuel
Token ID5252
Feature activation-9.41e-03

Pos logit contributions
andise+0.001
byss+0.001
untled+0.001
ratulations+0.001
istar+0.001
Neg logit contributions
@#&-0.001
cause-0.001
Narr-0.001
SIGN-0.001
hetic-0.001
Token.
Token ID13
Feature activation-1.10e-02

Pos logit contributions
andise+0.008
byss+0.007
untled+0.007
ratulations+0.007
istar+0.007
Neg logit contributions
@#&-0.007
cause-0.006
Narr-0.006
SIGN-0.006
hetic-0.006
Token Inter
Token ID4225
Feature activation+5.20e-02

Pos logit contributions
andise+0.010
byss+0.009
untled+0.009
ratulations+0.008
istar+0.008
Neg logit contributions
@#&-0.008
cause-0.007
Narr-0.007
SIGN-0.007
ˈ-0.007
Tokenmitt
Token ID20124
Feature activation-3.64e-01

Pos logit contributions
@#&+0.063
cause+0.058
Narr+0.057
SIGN+0.056
ˈ+0.056
Neg logit contributions
andise-0.023
byss-0.019
untled-0.015
ratulations-0.014
istar-0.013
Tokenent
Token ID298
Feature activation-2.05e-02

Pos logit contributions
andise+0.325
byss+0.296
untled+0.270
ratulations+0.259
istar+0.258
Neg logit contributions
@#&-0.281
cause-0.240
Narr-0.235
hetic-0.230
ˈ-0.228
Token fasting
Token ID24845
Feature activation-1.07e-01

Pos logit contributions
andise+0.015
byss+0.013
untled+0.012
ratulations+0.011
istar+0.011
Neg logit contributions
@#&-0.019
cause-0.017
Narr-0.017
SIGN-0.016
hetic-0.016
Token and
Token ID290
Feature activation-3.95e-04

Pos logit contributions
andise+0.097
byss+0.088
untled+0.081
ratulations+0.078
istar+0.077
Neg logit contributions
@#&-0.082
cause-0.069
Narr-0.068
SIGN-0.066
hetic-0.066
Token spending
Token ID4581
Feature activation+2.20e-02

Pos logit contributions
andise+0.000
byss+0.000
untled+0.000
ratulations+0.000
istar+0.000
Neg logit contributions
@#&-0.000
cause-0.000
Narr-0.000
SIGN-0.000
ˈ-0.000
Token more
Token ID517
Feature activation-1.93e-03

Pos logit contributions
@#&+0.017
cause+0.014
Narr+0.014
SIGN+0.014
ˈ+0.014
Neg logit contributions
andise-0.020
byss-0.018
untled-0.016
ratulations-0.016
istar-0.016
Token.
Token ID13
Feature activation+1.78e-03

Pos logit contributions
andise+0.094
byss+0.085
untled+0.078
ratulations+0.075
istar+0.074
Neg logit contributions
@#&-0.087
cause-0.074
Narr-0.074
SIGN-0.071
hetic-0.071
Token N
Token ID399
Feature activation-1.13e-02

Pos logit contributions
@#&+0.001
cause+0.001
Narr+0.001
SIGN+0.001
ˈ+0.001
Neg logit contributions
andise-0.002
byss-0.001
untled-0.001
ratulations-0.001
istar-0.001
Tokenieu
Token ID22304
Feature activation+8.69e-02

Pos logit contributions
andise+0.011
byss+0.010
untled+0.009
ratulations+0.009
istar+0.009
Neg logit contributions
@#&-0.008
cause-0.006
Narr-0.006
ˈ-0.006
SIGN-0.006
Tokenw
Token ID86
Feature activation-4.00e-02

Pos logit contributions
@#&+0.070
cause+0.059
Narr+0.059
SIGN+0.057
ˈ+0.057
Neg logit contributions
andise-0.075
byss-0.068
untled-0.062
ratulations-0.060
istar-0.060
Tokenend
Token ID437
Feature activation-5.63e-02

Pos logit contributions
andise+0.034
byss+0.030
untled+0.028
ratulations+0.026
istar+0.026
Neg logit contributions
@#&-0.033
cause-0.028
Narr-0.028
SIGN-0.027
hetic-0.027
Tokenyk
Token ID48361
Feature activation-3.59e-01

Pos logit contributions
andise+0.022
byss+0.017
untled+0.013
ratulations+0.012
istar+0.011
Neg logit contributions
@#&-0.072
cause-0.065
Narr-0.065
hetic-0.064
ˈ-0.064
Token is
Token ID318
Feature activation+2.35e-03

Pos logit contributions
andise+0.316
byss+0.283
untled+0.262
istar+0.251
ratulations+0.251
Neg logit contributions
@#&-0.282
cause-0.240
Narr-0.237
hetic-0.228
SIGN-0.228
Token currently
Token ID3058
Feature activation+6.06e-02

Pos logit contributions
@#&+0.002
cause+0.002
Narr+0.001
SIGN+0.001
hetic+0.001
Neg logit contributions
andise-0.002
byss-0.002
untled-0.002
ratulations-0.002
istar-0.002
Token the
Token ID262
Feature activation+1.09e-03

Pos logit contributions
@#&+0.048
cause+0.041
Narr+0.040
SIGN+0.039
hetic+0.039
Neg logit contributions
andise-0.053
byss-0.048
untled-0.044
ratulations-0.042
istar-0.042
Token Stars
Token ID10271
Feature activation+1.07e-01

Pos logit contributions
@#&+0.001
cause+0.001
Narr+0.001
SIGN+0.001
hetic+0.001
Neg logit contributions
andise-0.001
byss-0.001
untled-0.001
ratulations-0.001
istar-0.001
Token
Token ID447
Feature activation+1.04e-02

Pos logit contributions
@#&+0.084
cause+0.071
Narr+0.070
SIGN+0.068
hetic+0.068
Neg logit contributions
andise-0.095
byss-0.085
untled-0.079
ratulations-0.076
istar-0.075
Token
Token ID242
Feature activation+7.24e-02

Pos logit contributions
@#&+0.031
cause+0.025
Narr+0.025
SIGN+0.024
hetic+0.024
Neg logit contributions
andise-0.049
byss-0.045
untled-0.042
ratulations-0.040
istar-0.040
Token says
Token ID1139
Feature activation-2.37e-02

Pos logit contributions
@#&+0.053
cause+0.045
Narr+0.044
SIGN+0.043
hetic+0.042
Neg logit contributions
andise-0.068
byss-0.061
untled-0.057
ratulations-0.055
istar-0.054
Token Mary
Token ID5335
Feature activation+3.11e-02

Pos logit contributions
andise+0.022
byss+0.020
untled+0.018
ratulations+0.017
istar+0.017
Neg logit contributions
@#&-0.018
cause-0.015
Narr-0.015
SIGN-0.014
hetic-0.014
Token Bros
Token ID14266
Feature activation-1.43e-01

Pos logit contributions
@#&+0.024
cause+0.020
Narr+0.020
SIGN+0.019
hetic+0.019
Neg logit contributions
andise-0.028
byss-0.025
untled-0.023
ratulations-0.023
istar-0.022
Tokenn
Token ID77
Feature activation+3.56e-02

Pos logit contributions
andise+0.119
byss+0.106
untled+0.097
ratulations+0.093
istar+0.093
Neg logit contributions
@#&-0.120
cause-0.103
Narr-0.102
SIGN-0.099
hetic-0.098
Tokenahan
Token ID19210
Feature activation-3.59e-01

Pos logit contributions
@#&+0.043
cause+0.038
Narr+0.038
hetic+0.037
ˈ+0.037
Neg logit contributions
andise-0.017
byss-0.014
untled-0.012
ratulations-0.011
istar-0.010
Token,
Token ID11
Feature activation+1.80e-04

Pos logit contributions
andise+0.312
byss+0.281
untled+0.259
ratulations+0.248
istar+0.247
Neg logit contributions
@#&-0.286
cause-0.243
Narr-0.241
SIGN-0.232
hetic-0.232
Token the
Token ID262
Feature activation-2.81e-04

Pos logit contributions
@#&+0.000
cause+0.000
Narr+0.000
SIGN+0.000
ˈ+0.000
Neg logit contributions
andise-0.000
byss-0.000
untled-0.000
ratulations-0.000
istar-0.000
Token executive
Token ID4640
Feature activation+4.47e-02

Pos logit contributions
andise+0.000
byss+0.000
untled+0.000
ratulations+0.000
istar+0.000
Neg logit contributions
@#&-0.000
cause-0.000
Narr-0.000
SIGN-0.000
ˈ-0.000
Token director
Token ID3437
Feature activation+1.82e-02

Pos logit contributions
@#&+0.032
cause+0.027
Narr+0.027
hetic+0.026
SIGN+0.026
Neg logit contributions
andise-0.042
byss-0.038
untled-0.036
ratulations-0.034
istar-0.034
Token of
Token ID286
Feature activation-3.35e-03

Pos logit contributions
@#&+0.014
cause+0.012
Narr+0.012
SIGN+0.012
hetic+0.012
Neg logit contributions
andise-0.016
byss-0.014
untled-0.013
ratulations-0.013
istar-0.013
Token haven
Token ID4398
Feature activation+8.91e-03

Pos logit contributions
@#&+0.053
cause+0.045
Narr+0.044
SIGN+0.043
hetic+0.043
Neg logit contributions
andise-0.060
byss-0.054
untled-0.050
ratulations-0.048
istar-0.048
Token't
Token ID470
Feature activation+3.80e-02

Pos logit contributions
@#&+0.006
cause+0.005
Narr+0.005
SIGN+0.005
hetic+0.005
Neg logit contributions
andise-0.009
byss-0.008
untled-0.008
ratulations-0.007
istar-0.007
Token been
Token ID587
Feature activation+4.74e-04

Pos logit contributions
@#&+0.029
cause+0.024
Narr+0.024
SIGN+0.023
ˈ+0.023
Neg logit contributions
andise-0.034
byss-0.031
untled-0.029
ratulations-0.027
istar-0.027
Token par
Token ID1582
Feature activation+1.97e-01

Pos logit contributions
@#&+0.000
cause+0.000
Narr+0.000
SIGN+0.000
hetic+0.000
Neg logit contributions
andise-0.000
byss-0.000
untled-0.000
ratulations-0.000
istar-0.000
Tokenav
Token ID615
Feature activation+7.40e-03

Pos logit contributions
@#&+0.155
cause+0.131
Narr+0.130
SIGN+0.126
ˈ+0.125
Neg logit contributions
andise-0.173
byss-0.156
untled-0.144
ratulations-0.138
istar-0.138
Tokenirtual
Token ID22341
Feature activation-3.57e-01

Pos logit contributions
@#&+0.006
cause+0.005
Narr+0.005
hetic+0.004
SIGN+0.004
Neg logit contributions
andise-0.007
byss-0.006
untled-0.006
ratulations-0.005
istar-0.005
Tokenized
Token ID1143
Feature activation+5.81e-02

Pos logit contributions
andise+0.340
byss+0.309
untled+0.287
istar+0.276
ratulations+0.276
Neg logit contributions
@#&-0.255
cause-0.212
Narr-0.210
SIGN-0.202
ˈ-0.201
Token yet
Token ID1865
Feature activation-5.38e-02

Pos logit contributions
@#&+0.046
cause+0.039
Narr+0.039
SIGN+0.038
hetic+0.037
Neg logit contributions
andise-0.051
byss-0.046
untled-0.042
ratulations-0.040
istar-0.040
Token.
Token ID13
Feature activation+7.97e-03

Pos logit contributions
andise+0.046
byss+0.042
untled+0.038
ratulations+0.037
istar+0.037
Neg logit contributions
@#&-0.043
cause-0.037
Narr-0.036
SIGN-0.035
hetic-0.035
Token I
Token ID314
Feature activation+1.54e-02

Pos logit contributions
@#&+0.006
cause+0.005
Narr+0.005
SIGN+0.005
hetic+0.005
Neg logit contributions
andise-0.007
byss-0.006
untled-0.006
ratulations-0.006
istar-0.006
Token also
Token ID635
Feature activation+5.35e-03

Pos logit contributions
@#&+0.012
cause+0.010
Narr+0.010
SIGN+0.010
hetic+0.010
Neg logit contributions
andise-0.014
byss-0.012
untled-0.011
ratulations-0.011
istar-0.011
Token
Token ID447
Feature activation-1.91e-02

Pos logit contributions
andise+0.007
byss+0.006
untled+0.006
ratulations+0.006
istar+0.006
Neg logit contributions
@#&-0.006
cause-0.005
Narr-0.005
SIGN-0.005
hetic-0.005
Token
Token ID251
Feature activation-4.19e-03

Pos logit contributions
andise+0.020
byss+0.018
untled+0.017
ratulations+0.016
istar+0.016
Neg logit contributions
@#&-0.012
cause-0.010
Narr-0.010
SIGN-0.009
ˈ-0.009
Token
Token ID198
Feature activation-1.61e-02

Pos logit contributions
andise+0.004
byss+0.003
untled+0.003
ratulations+0.003
istar+0.003
Neg logit contributions
@#&-0.003
cause-0.003
Narr-0.003
SIGN-0.003
hetic-0.003
Token
Token ID198
Feature activation-1.16e-02

Pos logit contributions
andise+0.014
byss+0.013
untled+0.012
ratulations+0.011
istar+0.011
Neg logit contributions
@#&-0.013
cause-0.011
Narr-0.011
SIGN-0.010
ˈ-0.010
TokenHer
Token ID9360
Feature activation+4.90e-02

Pos logit contributions
andise+0.010
byss+0.009
untled+0.009
ratulations+0.008
istar+0.008
Neg logit contributions
@#&-0.009
cause-0.007
Narr-0.007
SIGN-0.007
hetic-0.007
Tokenitage
Token ID10208
Feature activation-3.57e-01

Pos logit contributions
@#&+0.056
cause+0.051
Narr+0.050
hetic+0.049
SIGN+0.049
Neg logit contributions
andise-0.025
byss-0.021
untled-0.018
ratulations-0.016
istar-0.016
Token Foundation
Token ID5693
Feature activation-1.28e-02

Pos logit contributions
andise+0.314
byss+0.282
untled+0.261
ratulations+0.250
istar+0.249
Neg logit contributions
@#&-0.282
cause-0.238
Narr-0.236
SIGN-0.228
hetic-0.227
Token,
Token ID11
Feature activation+1.36e-03

Pos logit contributions
andise+0.011
byss+0.010
untled+0.010
ratulations+0.009
istar+0.009
Neg logit contributions
@#&-0.010
cause-0.008
Narr-0.008
SIGN-0.008
hetic-0.008
Token an
Token ID281
Feature activation+1.08e-02

Pos logit contributions
@#&+0.001
cause+0.001
Narr+0.001
SIGN+0.001
hetic+0.001
Neg logit contributions
andise-0.001
byss-0.001
untled-0.001
ratulations-0.001
istar-0.001
Token educational
Token ID9856
Feature activation+6.39e-02

Pos logit contributions
@#&+0.008
cause+0.007
Narr+0.007
SIGN+0.007
hetic+0.007
Neg logit contributions
andise-0.010
byss-0.009
untled-0.008
ratulations-0.008
istar-0.008
Token nonprofit
Token ID15346
Feature activation-5.57e-02

Pos logit contributions
@#&+0.048
cause+0.040
Narr+0.040
SIGN+0.038
hetic+0.038
Neg logit contributions
andise-0.059
byss-0.053
untled-0.049
ratulations-0.047
istar-0.047
Token Massachusetts
Token ID10140
Feature activation+2.07e-02

Pos logit contributions
andise+0.057
byss+0.051
untled+0.047
ratulations+0.045
istar+0.045
Neg logit contributions
@#&-0.049
cause-0.042
Narr-0.041
SIGN-0.040
hetic-0.040
Token pol
Token ID755
Feature activation-1.35e-01

Pos logit contributions
@#&+0.016
cause+0.013
Narr+0.013
hetic+0.013
ˈ+0.013
Neg logit contributions
andise-0.019
byss-0.017
untled-0.016
ratulations-0.015
istar-0.015
Token,
Token ID11
Feature activation-5.46e-03

Pos logit contributions
andise+0.121
byss+0.110
untled+0.101
ratulations+0.097
istar+0.096
Neg logit contributions
@#&-0.104
cause-0.088
Narr-0.087
SIGN-0.084
ˈ-0.084
Token House
Token ID2097
Feature activation-1.77e-02

Pos logit contributions
andise+0.005
byss+0.004
untled+0.004
ratulations+0.004
istar+0.004
Neg logit contributions
@#&-0.004
cause-0.004
Narr-0.003
SIGN-0.003
hetic-0.003
Token Speaker
Token ID14931
Feature activation-1.08e-01

Pos logit contributions
andise+0.016
byss+0.015
untled+0.013
ratulations+0.013
istar+0.013
Neg logit contributions
@#&-0.013
cause-0.011
Narr-0.011
SIGN-0.011
hetic-0.011
Token Tip
Token ID23095
Feature activation-3.55e-01

Pos logit contributions
andise+0.094
byss+0.085
untled+0.078
ratulations+0.075
istar+0.074
Neg logit contributions
@#&-0.086
cause-0.073
Narr-0.072
SIGN-0.070
hetic-0.069
Token O
Token ID440
Feature activation+1.42e-02

Pos logit contributions
andise+0.320
byss+0.288
untled+0.266
ratulations+0.256
istar+0.255
Neg logit contributions
@#&-0.273
cause-0.230
Narr-0.228
SIGN-0.220
ˈ-0.219
Token'
Token ID6
Feature activation-1.00e-02

Pos logit contributions
@#&+0.012
Narr+0.010
cause+0.010
SIGN+0.009
hetic+0.009
Neg logit contributions
andise-0.012
byss-0.011
untled-0.010
ratulations-0.010
istar-0.010
TokenNeill
Token ID26538
Feature activation+4.30e-02

Pos logit contributions
andise+0.010
byss+0.009
untled+0.009
istar+0.008
ratulations+0.008
Neg logit contributions
@#&-0.007
cause-0.005
Narr-0.005
SIGN-0.005
hetic-0.005
Token,
Token ID11
Feature activation-4.67e-03

Pos logit contributions
@#&+0.035
cause+0.030
Narr+0.029
SIGN+0.028
hetic+0.028
Neg logit contributions
andise-0.037
byss-0.033
untled-0.031
ratulations-0.029
istar-0.029
Token could
Token ID714
Feature activation+2.33e-02

Pos logit contributions
andise+0.004
byss+0.004
untled+0.003
ratulations+0.003
istar+0.003
Neg logit contributions
@#&-0.004
cause-0.003
Narr-0.003
SIGN-0.003
hetic-0.003
Token Friday
Token ID3217
Feature activation+3.50e-02

Pos logit contributions
@#&+0.017
cause+0.014
Narr+0.014
SIGN+0.014
ˈ+0.014
Neg logit contributions
andise-0.019
byss-0.017
untled-0.016
ratulations-0.015
istar-0.015
Token,
Token ID11
Feature activation+7.99e-03

Pos logit contributions
@#&+0.028
cause+0.024
Narr+0.024
SIGN+0.023
hetic+0.023
Neg logit contributions
andise-0.030
byss-0.027
untled-0.025
ratulations-0.024
istar-0.024
Token including
Token ID1390
Feature activation-3.80e-02

Pos logit contributions
@#&+0.006
cause+0.005
Narr+0.005
SIGN+0.005
hetic+0.005
Neg logit contributions
andise-0.007
byss-0.006
untled-0.006
ratulations-0.006
istar-0.006
Token Is
Token ID1148
Feature activation+2.19e-02

Pos logit contributions
andise+0.034
byss+0.030
untled+0.028
ratulations+0.027
istar+0.027
Neg logit contributions
@#&-0.030
cause-0.025
Narr-0.025
SIGN-0.024
hetic-0.024
Tokenf
Token ID69
Feature activation+7.40e-03

Pos logit contributions
@#&+0.017
cause+0.014
Narr+0.014
hetic+0.014
ˈ+0.014
Neg logit contributions
andise-0.019
byss-0.018
untled-0.016
ratulations-0.016
istar-0.015
Tokenahan
Token ID19210
Feature activation-3.55e-01

Pos logit contributions
@#&+0.009
cause+0.008
Narr+0.008
ˈ+0.008
hetic+0.008
Neg logit contributions
andise-0.003
byss-0.003
untled-0.002
ratulations-0.002
istar-0.002
Token,
Token ID11
Feature activation+5.79e-03

Pos logit contributions
andise+0.309
byss+0.278
untled+0.256
ratulations+0.246
istar+0.244
Neg logit contributions
@#&-0.283
cause-0.239
Narr-0.238
SIGN-0.229
ˈ-0.229
Token Tab
Token ID16904
Feature activation-6.33e-02

Pos logit contributions
@#&+0.004
cause+0.004
Narr+0.004
SIGN+0.004
hetic+0.004
Neg logit contributions
andise-0.005
byss-0.005
untled-0.004
ratulations-0.004
istar-0.004
Tokenriz
Token ID47847
Feature activation-8.41e-02

Pos logit contributions
andise+0.047
byss+0.042
untled+0.038
ratulations+0.036
istar+0.036
Neg logit contributions
@#&-0.058
cause-0.051
Narr-0.050
hetic-0.049
SIGN-0.049
Token,
Token ID11
Feature activation+3.19e-03

Pos logit contributions
andise+0.072
byss+0.064
untled+0.059
ratulations+0.056
istar+0.056
Neg logit contributions
@#&-0.069
cause-0.059
Narr-0.058
SIGN-0.056
hetic-0.056
Token Yaz
Token ID35856
Feature activation-1.36e-03

Pos logit contributions
@#&+0.002
cause+0.002
Narr+0.002
SIGN+0.002
hetic+0.002
Neg logit contributions
andise-0.003
byss-0.003
untled-0.002
ratulations-0.002
istar-0.002
Token
Token ID247
Feature activation-7.10e-03

Pos logit contributions
andise+0.009
byss+0.008
untled+0.008
ratulations+0.007
istar+0.007
Neg logit contributions
@#&-0.006
cause-0.005
Narr-0.005
SIGN-0.005
ˈ-0.005
Tokens
Token ID82
Feature activation-1.40e-02

Pos logit contributions
andise+0.007
byss+0.006
untled+0.006
ratulations+0.005
istar+0.005
Neg logit contributions
@#&-0.005
cause-0.004
Narr-0.004
SIGN-0.004
hetic-0.004
Token his
Token ID465
Feature activation+3.28e-03

Pos logit contributions
andise+0.013
byss+0.012
untled+0.011
ratulations+0.010
istar+0.010
Neg logit contributions
@#&-0.011
cause-0.009
Narr-0.009
SIGN-0.009
hetic-0.008
Token
Token ID564
Feature activation-7.53e-03

Pos logit contributions
@#&+0.003
cause+0.002
Narr+0.002
SIGN+0.002
hetic+0.002
Neg logit contributions
andise-0.003
byss-0.003
untled-0.002
ratulations-0.002
istar-0.002
Token
Token ID250
Feature activation-7.03e-03

Pos logit contributions
andise+0.008
byss+0.007
untled+0.007
ratulations+0.006
istar+0.006
Neg logit contributions
@#&-0.005
cause-0.004
Narr-0.004
SIGN-0.004
hetic-0.004
Tokenstem
Token ID927
Feature activation-3.54e-01

Pos logit contributions
andise+0.007
byss+0.006
untled+0.005
ratulations+0.005
istar+0.005
Neg logit contributions
@#&-0.005
cause-0.004
Narr-0.004
SIGN-0.004
hetic-0.004
Token,
Token ID11
Feature activation+8.14e-03

Pos logit contributions
andise+0.303
byss+0.272
untled+0.249
ratulations+0.241
istar+0.238
Neg logit contributions
@#&-0.288
cause-0.246
Narr-0.243
hetic-0.235
SIGN-0.235
Token
Token ID447
Feature activation-1.94e-03

Pos logit contributions
@#&+0.006
cause+0.005
Narr+0.005
SIGN+0.005
hetic+0.005
Neg logit contributions
andise-0.007
byss-0.007
untled-0.006
ratulations-0.006
istar-0.006
Token
Token ID251
Feature activation+7.04e-04

Pos logit contributions
andise+0.002
byss+0.002
untled+0.002
ratulations+0.002
istar+0.002
Neg logit contributions
@#&-0.001
cause-0.001
Narr-0.001
SIGN-0.001
hetic-0.001
Token or
Token ID393
Feature activation-3.09e-02

Pos logit contributions
@#&+0.001
cause+0.000
Narr+0.000
SIGN+0.000
hetic+0.000
Neg logit contributions
andise-0.001
byss-0.001
untled-0.001
ratulations-0.001
istar-0.001
Token the
Token ID262
Feature activation-6.26e-03

Pos logit contributions
andise+0.028
byss+0.025
untled+0.023
ratulations+0.022
istar+0.022
Neg logit contributions
@#&-0.024
cause-0.020
Narr-0.020
SIGN-0.019
hetic-0.019
Token.
Token ID13
Feature activation-1.03e-03

Pos logit contributions
@#&+0.016
cause+0.014
Narr+0.013
SIGN+0.013
hetic+0.013
Neg logit contributions
andise-0.018
byss-0.016
untled-0.015
ratulations-0.014
istar-0.014
Token
Token ID198
Feature activation-1.20e-02

Pos logit contributions
andise+0.001
byss+0.001
untled+0.001
ratulations+0.001
istar+0.001
Neg logit contributions
@#&-0.001
cause-0.001
Narr-0.001
SIGN-0.001
hetic-0.001
Token
Token ID198
Feature activation-6.90e-03

Pos logit contributions
andise+0.011
byss+0.010
untled+0.009
ratulations+0.008
istar+0.008
Neg logit contributions
@#&-0.009
cause-0.008
Narr-0.008
SIGN-0.008
hetic-0.008
TokenThe
Token ID464
Feature activation-9.60e-03

Pos logit contributions
andise+0.006
byss+0.006
untled+0.005
ratulations+0.005
istar+0.005
Neg logit contributions
@#&-0.005
cause-0.004
Narr-0.004
SIGN-0.004
hetic-0.004
Token D
Token ID360
Feature activation+8.46e-02

Pos logit contributions
andise+0.008
byss+0.008
untled+0.007
ratulations+0.007
istar+0.007
Neg logit contributions
@#&-0.008
cause-0.006
Narr-0.006
SIGN-0.006
hetic-0.006
Tokenobbies
Token ID41372
Feature activation-3.49e-01

Pos logit contributions
@#&+0.064
cause+0.054
Narr+0.054
SIGN+0.052
hetic+0.051
Neg logit contributions
andise-0.077
byss-0.069
untled-0.064
ratulations-0.062
istar-0.062
Token Garden
Token ID11652
Feature activation-5.35e-02

Pos logit contributions
andise+0.306
byss+0.276
untled+0.254
ratulations+0.243
istar+0.242
Neg logit contributions
@#&-0.277
cause-0.235
Narr-0.232
SIGN-0.224
hetic-0.224
Token World
Token ID2159
Feature activation+1.16e-01

Pos logit contributions
andise+0.049
byss+0.044
untled+0.041
ratulations+0.039
istar+0.039
Neg logit contributions
@#&-0.041
cause-0.034
Narr-0.034
SIGN-0.033
hetic-0.033
Token,
Token ID11
Feature activation-4.60e-03

Pos logit contributions
@#&+0.090
cause+0.076
Narr+0.076
SIGN+0.073
hetic+0.073
Neg logit contributions
andise-0.103
byss-0.093
untled-0.086
ratulations-0.082
istar-0.082
Token at
Token ID379
Feature activation-3.99e-03

Pos logit contributions
andise+0.004
byss+0.004
untled+0.003
ratulations+0.003
istar+0.003
Neg logit contributions
@#&-0.004
cause-0.003
Narr-0.003
SIGN-0.003
ˈ-0.003
Token the
Token ID262
Feature activation-1.93e-03

Pos logit contributions
andise+0.004
byss+0.003
untled+0.003
ratulations+0.003
istar+0.003
Neg logit contributions
@#&-0.003
cause-0.003
Narr-0.003
SIGN-0.003
hetic-0.003
Token,
Token ID11
Feature activation-2.78e-03

Pos logit contributions
@#&+0.020
cause+0.017
Narr+0.017
SIGN+0.016
hetic+0.016
Neg logit contributions
andise-0.022
byss-0.019
untled-0.018
ratulations-0.017
istar-0.017
Token as
Token ID355
Feature activation+7.28e-03

Pos logit contributions
andise+0.003
byss+0.002
untled+0.002
ratulations+0.002
istar+0.002
Neg logit contributions
@#&-0.002
cause-0.002
Narr-0.002
SIGN-0.002
hetic-0.002
Token co
Token ID763
Feature activation+8.01e-03

Pos logit contributions
@#&+0.005
cause+0.005
Narr+0.005
SIGN+0.004
hetic+0.004
Neg logit contributions
andise-0.007
byss-0.006
untled-0.006
ratulations-0.005
istar-0.005
Token-
Token ID12
Feature activation-5.26e-03

Pos logit contributions
@#&+0.007
cause+0.006
Narr+0.006
hetic+0.005
SIGN+0.005
Neg logit contributions
andise-0.007
byss-0.006
untled-0.005
ratulations-0.005
istar-0.005
Tokenpres
Token ID18302
Feature activation-6.91e-02

Pos logit contributions
andise+0.005
byss+0.005
untled+0.005
ratulations+0.004
istar+0.004
Neg logit contributions
@#&-0.003
cause-0.003
Narr-0.003
hetic-0.003
ˈ-0.003
Tokenidents
Token ID3231
Feature activation-3.47e-01

Pos logit contributions
andise+0.030
byss+0.024
untled+0.019
ratulations+0.018
istar+0.017
Neg logit contributions
@#&-0.085
cause-0.077
Narr-0.076
ˈ-0.075
hetic-0.075
Token and
Token ID290
Feature activation-4.78e-03

Pos logit contributions
andise+0.299
byss+0.269
untled+0.248
ratulations+0.237
istar+0.237
Neg logit contributions
@#&-0.279
cause-0.237
Narr-0.235
SIGN-0.227
hetic-0.226
Token co
Token ID763
Feature activation+1.08e-02

Pos logit contributions
andise+0.004
byss+0.004
untled+0.004
ratulations+0.003
istar+0.003
Neg logit contributions
@#&-0.004
cause-0.003
Narr-0.003
hetic-0.003
SIGN-0.003
Token-
Token ID12
Feature activation-9.31e-03

Pos logit contributions
@#&+0.009
cause+0.008
Narr+0.007
SIGN+0.007
ˈ+0.007
Neg logit contributions
andise-0.009
byss-0.008
untled-0.008
ratulations-0.007
istar-0.007
TokenC
Token ID34
Feature activation-2.43e-02

Pos logit contributions
andise+0.009
byss+0.008
untled+0.008
ratulations+0.008
istar+0.008
Neg logit contributions
@#&-0.006
cause-0.005
Narr-0.005
hetic-0.005
SIGN-0.005
TokenOO
Token ID6684
Feature activation+6.81e-03

Pos logit contributions
andise+0.021
byss+0.019
untled+0.017
ratulations+0.017
istar+0.016
Neg logit contributions
@#&-0.020
cause-0.017
Narr-0.017
SIGN-0.016
hetic-0.016
Token
Token ID198
Feature activation+6.41e-03

Pos logit contributions
andise+0.081
byss+0.073
untled+0.067
ratulations+0.064
istar+0.064
Neg logit contributions
@#&-0.075
cause-0.064
Narr-0.063
SIGN-0.062
ˈ-0.061
Token
Token ID198
Feature activation+8.00e-03

Pos logit contributions
@#&+0.005
cause+0.004
Narr+0.004
SIGN+0.004
ˈ+0.004
Neg logit contributions
andise-0.006
byss-0.005
untled-0.005
ratulations-0.005
istar-0.005
TokenSource
Token ID7416
Feature activation-4.32e-02

Pos logit contributions
@#&+0.006
cause+0.005
Narr+0.005
SIGN+0.005
ˈ+0.005
Neg logit contributions
andise-0.007
byss-0.007
untled-0.006
ratulations-0.006
istar-0.006
Token:
Token ID25
Feature activation+5.98e-03

Pos logit contributions
andise+0.039
byss+0.035
untled+0.032
ratulations+0.031
istar+0.031
Neg logit contributions
@#&-0.033
cause-0.028
Narr-0.028
SIGN-0.027
hetic-0.027
Token Es
Token ID8678
Feature activation+1.42e-01

Pos logit contributions
@#&+0.005
cause+0.004
Narr+0.004
SIGN+0.004
ˈ+0.004
Neg logit contributions
andise-0.005
byss-0.005
untled-0.004
ratulations-0.004
istar-0.004
Tokenuter
Token ID11894
Feature activation-3.45e-01

Pos logit contributions
@#&+0.109
cause+0.092
Narr+0.091
hetic+0.088
SIGN+0.088
Neg logit contributions
andise-0.127
byss-0.115
untled-0.106
ratulations-0.102
istar-0.101
Tokenu
Token ID84
Feature activation+1.74e-02

Pos logit contributions
andise+0.309
byss+0.278
untled+0.257
ratulations+0.247
istar+0.246
Neg logit contributions
@#&-0.268
cause-0.226
Narr-0.224
SIGN-0.216
hetic-0.215
Token
Token ID198
Feature activation-2.52e-04

Pos logit contributions
@#&+0.014
cause+0.012
Narr+0.012
SIGN+0.011
hetic+0.011
Neg logit contributions
andise-0.015
byss-0.014
untled-0.013
ratulations-0.012
istar-0.012
Token
Token ID198
Feature activation+6.79e-03

Pos logit contributions
andise+0.000
byss+0.000
untled+0.000
ratulations+0.000
istar+0.000
Neg logit contributions
@#&-0.000
cause-0.000
Narr-0.000
SIGN-0.000
ˈ-0.000
TokenLeave
Token ID35087
Feature activation+1.07e-02

Pos logit contributions
@#&+0.005
cause+0.004
Narr+0.004
SIGN+0.004
ˈ+0.004
Neg logit contributions
andise-0.006
byss-0.006
untled-0.006
ratulations-0.005
istar-0.005
Token a
Token ID257
Feature activation+9.46e-03

Pos logit contributions
@#&+0.008
cause+0.007
Narr+0.007
hetic+0.006
SIGN+0.006
Neg logit contributions
andise-0.010
byss-0.009
untled-0.008
ratulations-0.008
istar-0.008
Token unto
Token ID12722
Feature activation+1.86e-01

Pos logit contributions
@#&+0.010
cause+0.009
Narr+0.009
hetic+0.008
SIGN+0.008
Neg logit contributions
andise-0.012
byss-0.011
untled-0.010
ratulations-0.010
istar-0.010
Token thy
Token ID11906
Feature activation-1.13e-02

Pos logit contributions
@#&+0.140
cause+0.117
Narr+0.116
SIGN+0.112
hetic+0.112
Neg logit contributions
andise-0.171
byss-0.155
untled-0.143
ratulations-0.137
istar-0.137
Tokenself
Token ID944
Feature activation-1.49e-01

Pos logit contributions
andise+0.010
byss+0.009
untled+0.008
ratulations+0.008
istar+0.008
Neg logit contributions
@#&-0.009
cause-0.008
Narr-0.007
SIGN-0.007
hetic-0.007
Token.
Token ID13
Feature activation-3.76e-03

Pos logit contributions
andise+0.133
byss+0.120
untled+0.111
ratulations+0.107
istar+0.106
Neg logit contributions
@#&-0.115
cause-0.098
Narr-0.097
SIGN-0.093
hetic-0.093
Token De
Token ID1024
Feature activation-5.43e-02

Pos logit contributions
andise+0.003
byss+0.003
untled+0.003
ratulations+0.003
istar+0.003
Neg logit contributions
@#&-0.003
cause-0.002
Narr-0.002
SIGN-0.002
hetic-0.002
Tokenuter
Token ID11894
Feature activation-3.45e-01

Pos logit contributions
andise+0.037
byss+0.032
untled+0.029
ratulations+0.028
istar+0.027
Neg logit contributions
@#&-0.053
cause-0.047
Narr-0.047
ˈ-0.045
SIGN-0.045
Tokenonomy
Token ID30565
Feature activation+1.05e-01

Pos logit contributions
andise+0.345
byss+0.314
untled+0.294
istar+0.283
ratulations+0.282
Neg logit contributions
@#&-0.230
cause-0.188
Narr-0.185
SIGN-0.179
ˈ-0.176
Token 20
Token ID1160
Feature activation-1.78e-02

Pos logit contributions
@#&+0.085
cause+0.072
Narr+0.071
SIGN+0.069
ˈ+0.069
Neg logit contributions
andise-0.091
byss-0.082
untled-0.075
ratulations-0.072
istar-0.072
Token:
Token ID25
Feature activation+3.48e-03

Pos logit contributions
andise+0.016
byss+0.014
untled+0.013
ratulations+0.013
istar+0.013
Neg logit contributions
@#&-0.014
cause-0.012
Narr-0.012
SIGN-0.011
hetic-0.011
Token 12
Token ID1105
Feature activation-1.31e-02

Pos logit contributions
@#&+0.003
cause+0.002
Narr+0.002
SIGN+0.002
ˈ+0.002
Neg logit contributions
andise-0.003
byss-0.003
untled-0.003
ratulations-0.002
istar-0.002
Token.
Token ID13
Feature activation-2.37e-03

Pos logit contributions
andise+0.012
byss+0.010
untled+0.010
ratulations+0.009
istar+0.009
Neg logit contributions
@#&-0.010
cause-0.009
Narr-0.009
SIGN-0.008
hetic-0.008
Token neutral
Token ID8500
Feature activation+4.73e-02

Pos logit contributions
@#&+0.021
cause+0.017
Narr+0.017
hetic+0.016
SIGN+0.016
Neg logit contributions
andise-0.025
byss-0.022
untled-0.021
ratulations-0.020
istar-0.020
Token."
Token ID526
Feature activation-6.82e-02

Pos logit contributions
@#&+0.038
cause+0.032
Narr+0.032
SIGN+0.031
hetic+0.031
Neg logit contributions
andise-0.041
byss-0.037
untled-0.034
ratulations-0.033
istar-0.032
Token
Token ID198
Feature activation-2.27e-03

Pos logit contributions
andise+0.058
byss+0.052
untled+0.048
ratulations+0.046
istar+0.046
Neg logit contributions
@#&-0.056
cause-0.048
Narr-0.047
SIGN-0.046
ˈ-0.046
Token
Token ID198
Feature activation-1.35e-03

Pos logit contributions
andise+0.002
byss+0.002
untled+0.002
ratulations+0.002
istar+0.002
Neg logit contributions
@#&-0.002
cause-0.002
Narr-0.002
SIGN-0.001
hetic-0.001
TokenD
Token ID35
Feature activation+6.56e-02

Pos logit contributions
andise+0.001
byss+0.001
untled+0.001
ratulations+0.001
istar+0.001
Neg logit contributions
@#&-0.001
cause-0.001
Narr-0.001
SIGN-0.001
hetic-0.001
Tokenobbies
Token ID41372
Feature activation-3.44e-01

Pos logit contributions
@#&+0.086
cause+0.078
Narr+0.078
SIGN+0.076
hetic+0.076
Neg logit contributions
andise-0.024
byss-0.018
untled-0.014
ratulations-0.012
istar-0.012
Token indoor
Token ID22639
Feature activation-1.14e-01

Pos logit contributions
andise+0.301
byss+0.269
untled+0.249
ratulations+0.239
istar+0.238
Neg logit contributions
@#&-0.274
cause-0.232
Narr-0.230
SIGN-0.222
hetic-0.221
Token plant
Token ID4618
Feature activation+6.19e-02

Pos logit contributions
andise+0.109
byss+0.099
untled+0.092
ratulations+0.089
istar+0.088
Neg logit contributions
@#&-0.081
cause-0.067
Narr-0.067
SIGN-0.064
hetic-0.064
Token advisor
Token ID20685
Feature activation-1.12e-02

Pos logit contributions
@#&+0.047
cause+0.040
Narr+0.039
SIGN+0.038
hetic+0.038
Neg logit contributions
andise-0.056
byss-0.051
untled-0.047
ratulations-0.045
istar-0.045
Token Jimmy
Token ID12963
Feature activation+7.20e-03

Pos logit contributions
andise+0.010
byss+0.009
untled+0.009
ratulations+0.008
istar+0.008
Neg logit contributions
@#&-0.009
cause-0.007
Narr-0.007
SIGN-0.007
hetic-0.007
Token Mc
Token ID1982
Feature activation-8.41e-02

Pos logit contributions
@#&+0.006
cause+0.005
Narr+0.005
SIGN+0.004
ˈ+0.004
Neg logit contributions
andise-0.006
byss-0.006
untled-0.005
ratulations-0.005
istar-0.005
Token be
Token ID307
Feature activation+9.76e-03

Pos logit contributions
@#&+0.019
cause+0.016
Narr+0.016
SIGN+0.015
hetic+0.015
Neg logit contributions
andise-0.024
byss-0.022
untled-0.020
ratulations-0.020
istar-0.020
Token.
Token ID13
Feature activation+4.39e-03

Pos logit contributions
@#&+0.008
cause+0.007
Narr+0.007
SIGN+0.006
hetic+0.006
Neg logit contributions
andise-0.008
byss-0.008
untled-0.007
ratulations-0.007
istar-0.007
Token
Token ID198
Feature activation+3.66e-03

Pos logit contributions
@#&+0.003
cause+0.003
Narr+0.003
SIGN+0.003
ˈ+0.003
Neg logit contributions
andise-0.004
byss-0.004
untled-0.003
ratulations-0.003
istar-0.003
Token
Token ID198
Feature activation-2.60e-03

Pos logit contributions
@#&+0.003
cause+0.002
Narr+0.002
SIGN+0.002
hetic+0.002
Neg logit contributions
andise-0.003
byss-0.003
untled-0.003
ratulations-0.003
istar-0.003
TokenMal
Token ID15029
Feature activation+1.20e-01

Pos logit contributions
andise+0.002
byss+0.002
untled+0.002
ratulations+0.002
istar+0.002
Neg logit contributions
@#&-0.002
cause-0.002
Narr-0.002
SIGN-0.002
ˈ-0.002
Tokencolm
Token ID18414
Feature activation-3.39e-01

Pos logit contributions
@#&+0.150
cause+0.136
Narr+0.135
hetic+0.132
SIGN+0.132
Neg logit contributions
andise-0.050
byss-0.039
untled-0.031
ratulations-0.028
istar-0.027
Token Grim
Token ID17525
Feature activation-1.27e-01

Pos logit contributions
andise+0.299
byss+0.269
untled+0.248
ratulations+0.238
istar+0.238
Neg logit contributions
@#&-0.267
cause-0.226
Narr-0.224
SIGN-0.216
ˈ-0.215
Tokenston
Token ID3743
Feature activation+6.52e-02

Pos logit contributions
andise+0.112
byss+0.100
untled+0.093
ratulations+0.089
istar+0.089
Neg logit contributions
@#&-0.100
cause-0.084
Narr-0.084
SIGN-0.081
hetic-0.080
Token,
Token ID11
Feature activation+5.25e-03

Pos logit contributions
@#&+0.052
cause+0.044
Narr+0.044
SIGN+0.043
hetic+0.042
Neg logit contributions
andise-0.057
byss-0.051
untled-0.047
ratulations-0.045
istar-0.045
Token a
Token ID257
Feature activation+6.22e-03

Pos logit contributions
@#&+0.004
cause+0.003
Narr+0.003
SIGN+0.003
hetic+0.003
Neg logit contributions
andise-0.005
byss-0.004
untled-0.004
ratulations-0.004
istar-0.004
Token nuclear
Token ID4523
Feature activation-6.68e-02

Pos logit contributions
@#&+0.005
cause+0.004
Narr+0.004
SIGN+0.004
hetic+0.004
Neg logit contributions
andise-0.006
byss-0.005
untled-0.005
ratulations-0.005
istar-0.004
Token for
Token ID329
Feature activation+1.05e-02

Pos logit contributions
andise+0.014
byss+0.013
untled+0.012
istar+0.011
ratulations+0.011
Neg logit contributions
@#&-0.013
cause-0.011
Narr-0.011
SIGN-0.011
ˈ-0.011
Token the
Token ID262
Feature activation+9.28e-03

Pos logit contributions
@#&+0.008
cause+0.007
Narr+0.007
SIGN+0.006
ˈ+0.006
Neg logit contributions
andise-0.010
byss-0.009
untled-0.008
ratulations-0.008
istar-0.008
Token remaining
Token ID5637
Feature activation+9.27e-02

Pos logit contributions
@#&+0.007
cause+0.006
Narr+0.006
SIGN+0.006
ˈ+0.006
Neg logit contributions
andise-0.008
byss-0.008
untled-0.007
ratulations-0.007
istar-0.007
Token 12
Token ID1105
Feature activation-1.43e-02

Pos logit contributions
@#&+0.055
cause+0.045
Narr+0.044
SIGN+0.042
ˈ+0.042
Neg logit contributions
andise-0.099
byss-0.091
untled-0.085
ratulations-0.082
istar-0.082
Token individual
Token ID1981
Feature activation+4.22e-02

Pos logit contributions
andise+0.014
byss+0.013
untled+0.012
ratulations+0.012
istar+0.012
Neg logit contributions
@#&-0.009
cause-0.008
Narr-0.008
SIGN-0.007
hetic-0.007
Token seats
Token ID8632
Feature activation+2.25e-01

Pos logit contributions
@#&+0.024
cause+0.019
Narr+0.019
SIGN+0.018
hetic+0.018
Neg logit contributions
andise-0.046
byss-0.042
untled-0.040
ratulations-0.039
istar-0.038
Token which
Token ID543
Feature activation-8.97e-03

Pos logit contributions
@#&+0.178
cause+0.151
Narr+0.150
SIGN+0.145
hetic+0.144
Neg logit contributions
andise-0.197
byss-0.177
untled-0.163
ratulations-0.156
istar-0.156
Token represent
Token ID2380
Feature activation-5.16e-02

Pos logit contributions
andise+0.008
byss+0.007
untled+0.007
ratulations+0.006
istar+0.006
Neg logit contributions
@#&-0.007
cause-0.006
Narr-0.006
SIGN-0.006
hetic-0.006
Token different
Token ID1180
Feature activation-9.66e-03

Pos logit contributions
andise+0.047
byss+0.043
untled+0.040
ratulations+0.038
istar+0.038
Neg logit contributions
@#&-0.039
cause-0.032
Narr-0.032
SIGN-0.031
hetic-0.031
Token segments
Token ID17894
Feature activation+6.09e-03

Pos logit contributions
andise+0.009
byss+0.008
untled+0.007
ratulations+0.007
istar+0.007
Neg logit contributions
@#&-0.007
cause-0.006
Narr-0.006
SIGN-0.006
hetic-0.006
Token of
Token ID286
Feature activation+8.14e-03

Pos logit contributions
@#&+0.005
cause+0.004
Narr+0.004
SIGN+0.004
ˈ+0.004
Neg logit contributions
andise-0.005
byss-0.005
untled-0.004
ratulations-0.004
istar-0.004
Token digging
Token ID18894
Feature activation-6.41e-02

Pos logit contributions
andise+0.007
byss+0.006
untled+0.006
ratulations+0.006
istar+0.006
Neg logit contributions
@#&-0.007
cause-0.006
Narr-0.006
SIGN-0.005
hetic-0.005
Token into
Token ID656
Feature activation-3.86e-03

Pos logit contributions
andise+0.059
byss+0.053
untled+0.049
ratulations+0.047
istar+0.047
Neg logit contributions
@#&-0.048
cause-0.041
Narr-0.040
SIGN-0.039
hetic-0.038
Token your
Token ID534
Feature activation-1.88e-02

Pos logit contributions
andise+0.004
byss+0.003
untled+0.003
ratulations+0.003
istar+0.003
Neg logit contributions
@#&-0.003
cause-0.002
Narr-0.002
SIGN-0.002
hetic-0.002
Token bag
Token ID6131
Feature activation-8.90e-02

Pos logit contributions
andise+0.017
byss+0.015
untled+0.014
ratulations+0.014
istar+0.014
Neg logit contributions
@#&-0.014
cause-0.012
Narr-0.012
SIGN-0.012
ˈ-0.012
Token of
Token ID286
Feature activation-4.30e-03

Pos logit contributions
andise+0.078
byss+0.070
untled+0.065
ratulations+0.062
istar+0.062
Neg logit contributions
@#&-0.070
cause-0.060
Narr-0.059
hetic-0.057
SIGN-0.057
Token tricks
Token ID15910
Feature activation+2.36e-01

Pos logit contributions
andise+0.005
byss+0.004
untled+0.004
ratulations+0.004
istar+0.004
Neg logit contributions
@#&-0.003
cause-0.002
Narr-0.002
SIGN-0.002
ˈ-0.002
Token to
Token ID284
Feature activation-7.92e-03

Pos logit contributions
@#&+0.187
cause+0.158
Narr+0.157
SIGN+0.151
hetic+0.151
Neg logit contributions
andise-0.208
byss-0.187
untled-0.173
ratulations-0.166
istar-0.165
Token make
Token ID787
Feature activation-1.65e-02

Pos logit contributions
andise+0.007
byss+0.006
untled+0.006
ratulations+0.006
istar+0.006
Neg logit contributions
@#&-0.006
cause-0.005
Narr-0.005
SIGN-0.005
hetic-0.005
Token X
Token ID1395
Feature activation+8.14e-02

Pos logit contributions
andise+0.015
byss+0.014
untled+0.013
ratulations+0.012
istar+0.012
Neg logit contributions
@#&-0.012
cause-0.010
Narr-0.010
SIGN-0.010
hetic-0.010
Token-
Token ID12
Feature activation-8.89e-03

Pos logit contributions
@#&+0.068
cause+0.058
Narr+0.058
SIGN+0.056
hetic+0.056
Neg logit contributions
andise-0.068
byss-0.060
untled-0.055
istar-0.053
ratulations-0.053
Token0
Token ID15
Feature activation-2.96e-02

Pos logit contributions
andise+0.009
byss+0.008
untled+0.007
ratulations+0.007
istar+0.007
Neg logit contributions
@#&-0.006
cause-0.005
Narr-0.005
ˈ-0.005
hetic-0.005
Token as
Token ID355
Feature activation+1.18e-02

Pos logit contributions
@#&+0.001
cause+0.001
Narr+0.001
SIGN+0.001
hetic+0.001
Neg logit contributions
andise-0.001
byss-0.001
untled-0.001
ratulations-0.001
istar-0.001
Token
Token ID564
Feature activation+1.44e-02

Pos logit contributions
@#&+0.009
cause+0.007
Narr+0.007
SIGN+0.007
hetic+0.007
Neg logit contributions
andise-0.011
byss-0.010
untled-0.009
ratulations-0.009
istar-0.009
Token
Token ID250
Feature activation+3.46e-03

Pos logit contributions
@#&+0.010
cause+0.008
Narr+0.008
SIGN+0.007
hetic+0.007
Neg logit contributions
andise-0.014
byss-0.013
untled-0.012
ratulations-0.012
istar-0.012
Tokentop
Token ID4852
Feature activation+1.06e-04

Pos logit contributions
@#&+0.002
cause+0.002
Narr+0.002
SIGN+0.002
hetic+0.002
Neg logit contributions
andise-0.003
byss-0.003
untled-0.003
ratulations-0.003
istar-0.003
Token-
Token ID12
Feature activation+1.28e-02

Pos logit contributions
@#&+0.000
cause+0.000
Narr+0.000
SIGN+0.000
hetic+0.000
Neg logit contributions
andise-0.000
byss-0.000
untled-0.000
ratulations-0.000
istar-0.000
Tokenhat
Token ID5183
Feature activation+2.47e-01

Pos logit contributions
@#&+0.009
cause+0.008
Narr+0.007
SIGN+0.007
hetic+0.007
Neg logit contributions
andise-0.012
byss-0.011
untled-0.010
ratulations-0.010
istar-0.010
Token Eddie
Token ID19478
Feature activation-2.02e-02

Pos logit contributions
@#&+0.192
cause+0.163
Narr+0.161
SIGN+0.155
hetic+0.155
Neg logit contributions
andise-0.220
byss-0.198
untled-0.183
ratulations-0.176
istar-0.175
Token
Token ID447
Feature activation+8.53e-03

Pos logit contributions
andise+0.018
byss+0.016
untled+0.015
ratulations+0.014
istar+0.014
Neg logit contributions
@#&-0.016
cause-0.014
Narr-0.013
SIGN-0.013
hetic-0.013
Token
Token ID251
Feature activation+1.76e-02

Pos logit contributions
@#&+0.006
cause+0.005
Narr+0.005
SIGN+0.005
hetic+0.005
Neg logit contributions
andise-0.008
byss-0.008
untled-0.007
ratulations-0.007
istar-0.007
Token because
Token ID780
Feature activation+6.87e-02

Pos logit contributions
@#&+0.013
cause+0.011
Narr+0.011
SIGN+0.011
ˈ+0.011
Neg logit contributions
andise-0.016
byss-0.015
untled-0.013
ratulations-0.013
istar-0.013
Token of
Token ID286
Feature activation+7.71e-03

Pos logit contributions
@#&+0.053
cause+0.044
Narr+0.044
SIGN+0.042
hetic+0.042
Neg logit contributions
andise-0.062
byss-0.056
untled-0.052
ratulations-0.050
istar-0.049
Token systems
Token ID3341
Feature activation+4.22e-02

Pos logit contributions
andise+0.044
byss+0.040
untled+0.036
ratulations+0.035
istar+0.035
Neg logit contributions
@#&-0.043
cause-0.037
Narr-0.036
SIGN-0.035
hetic-0.035
Token.
Token ID13
Feature activation+4.46e-03

Pos logit contributions
@#&+0.033
cause+0.028
Narr+0.028
hetic+0.027
SIGN+0.027
Neg logit contributions
andise-0.038
byss-0.034
untled-0.031
ratulations-0.030
istar-0.030
Token While
Token ID2893
Feature activation+2.81e-02

Pos logit contributions
@#&+0.003
cause+0.003
Narr+0.003
SIGN+0.003
ˈ+0.003
Neg logit contributions
andise-0.004
byss-0.004
untled-0.003
ratulations-0.003
istar-0.003
Token these
Token ID777
Feature activation-5.40e-02

Pos logit contributions
@#&+0.022
cause+0.018
Narr+0.018
SIGN+0.017
hetic+0.017
Neg logit contributions
andise-0.025
byss-0.023
untled-0.021
ratulations-0.020
istar-0.020
Token products
Token ID3186
Feature activation+3.92e-02

Pos logit contributions
andise+0.045
byss+0.040
untled+0.037
ratulations+0.035
istar+0.035
Neg logit contributions
@#&-0.045
cause-0.039
Narr-0.038
SIGN-0.037
hetic-0.037
Token appeal
Token ID5198
Feature activation+2.31e-01

Pos logit contributions
@#&+0.030
cause+0.025
Narr+0.025
SIGN+0.024
hetic+0.024
Neg logit contributions
andise-0.035
byss-0.032
untled-0.029
ratulations-0.028
istar-0.028
Token especially
Token ID2592
Feature activation+2.16e-02

Pos logit contributions
@#&+0.178
cause+0.150
Narr+0.149
SIGN+0.144
hetic+0.143
Neg logit contributions
andise-0.208
byss-0.187
untled-0.173
ratulations-0.167
istar-0.166
Token to
Token ID284
Feature activation+6.77e-04

Pos logit contributions
@#&+0.017
cause+0.014
Narr+0.014
SIGN+0.014
hetic+0.014
Neg logit contributions
andise-0.019
byss-0.017
untled-0.016
ratulations-0.015
istar-0.015
Token academic
Token ID8233
Feature activation-2.81e-02

Pos logit contributions
@#&+0.001
cause+0.000
Narr+0.000
SIGN+0.000
hetic+0.000
Neg logit contributions
andise-0.001
byss-0.001
untled-0.000
ratulations-0.000
istar-0.000
Token libraries
Token ID12782
Feature activation-4.22e-02

Pos logit contributions
andise+0.024
byss+0.022
untled+0.020
ratulations+0.019
istar+0.019
Neg logit contributions
@#&-0.023
cause-0.019
Narr-0.019
SIGN-0.019
hetic-0.019
Token,
Token ID11
Feature activation+8.63e-03

Pos logit contributions
andise+0.037
byss+0.033
untled+0.030
ratulations+0.029
istar+0.029
Neg logit contributions
@#&-0.034
cause-0.029
Narr-0.028
SIGN-0.027
hetic-0.027
Token found
Token ID1043
Feature activation+3.93e-02

Pos logit contributions
andise+0.062
byss+0.056
untled+0.052
ratulations+0.050
istar+0.050
Neg logit contributions
@#&-0.055
cause-0.046
Narr-0.046
SIGN-0.044
hetic-0.044
Token guilty
Token ID6717
Feature activation-1.71e-02

Pos logit contributions
@#&+0.032
cause+0.027
Narr+0.027
SIGN+0.026
hetic+0.026
Neg logit contributions
andise-0.033
byss-0.030
untled-0.027
ratulations-0.026
istar-0.026
Token of
Token ID286
Feature activation+5.53e-03

Pos logit contributions
andise+0.015
byss+0.013
untled+0.012
ratulations+0.012
istar+0.012
Neg logit contributions
@#&-0.014
cause-0.012
Narr-0.012
SIGN-0.011
ˈ-0.011
Token price
Token ID2756
Feature activation-1.99e-02

Pos logit contributions
@#&+0.004
cause+0.004
Narr+0.004
SIGN+0.003
hetic+0.003
Neg logit contributions
andise-0.005
byss-0.004
untled-0.004
ratulations-0.004
istar-0.004
Token-
Token ID12
Feature activation-1.24e-02

Pos logit contributions
andise+0.017
byss+0.016
untled+0.015
ratulations+0.014
istar+0.014
Neg logit contributions
@#&-0.016
cause-0.013
Narr-0.013
hetic-0.013
SIGN-0.013
Tokenfix
Token ID13049
Feature activation+2.32e-01

Pos logit contributions
andise+0.015
byss+0.014
untled+0.013
ratulations+0.013
istar+0.012
Neg logit contributions
@#&-0.006
cause-0.004
Narr-0.004
SIGN-0.004
hetic-0.004
Tokening
Token ID278
Feature activation+4.91e-02

Pos logit contributions
@#&+0.150
cause+0.123
Narr+0.121
SIGN+0.116
hetic+0.115
Neg logit contributions
andise-0.236
byss-0.216
untled-0.202
ratulations-0.195
istar-0.194
Token up
Token ID510
Feature activation-8.90e-03

Pos logit contributions
@#&+0.039
cause+0.033
Narr+0.033
SIGN+0.032
hetic+0.032
Neg logit contributions
andise-0.043
byss-0.038
untled-0.036
ratulations-0.034
istar-0.034
Token to
Token ID284
Feature activation-1.39e-02

Pos logit contributions
andise+0.008
byss+0.007
untled+0.006
ratulations+0.006
istar+0.006
Neg logit contributions
@#&-0.007
cause-0.006
Narr-0.006
SIGN-0.006
hetic-0.006
Token 10
Token ID838
Feature activation-6.39e-03

Pos logit contributions
andise+0.012
byss+0.011
untled+0.010
ratulations+0.009
istar+0.009
Neg logit contributions
@#&-0.011
cause-0.010
Narr-0.010
SIGN-0.009
ˈ-0.009
Token%
Token ID4
Feature activation-1.33e-02

Pos logit contributions
andise+0.006
byss+0.005
untled+0.005
ratulations+0.005
istar+0.005
Neg logit contributions
@#&-0.005
cause-0.004
Narr-0.004
SIGN-0.004
hetic-0.004
Token big
Token ID1263
Feature activation-7.61e-03

Pos logit contributions
@#&+0.001
cause+0.001
Narr+0.001
SIGN+0.001
hetic+0.001
Neg logit contributions
andise-0.002
byss-0.002
untled-0.001
ratulations-0.001
istar-0.001
Token enough
Token ID1576
Feature activation+1.06e-01

Pos logit contributions
andise+0.007
byss+0.006
untled+0.006
ratulations+0.006
istar+0.005
Neg logit contributions
@#&-0.006
cause-0.005
Narr-0.005
SIGN-0.005
ˈ-0.005
Token that
Token ID326
Feature activation+6.26e-03

Pos logit contributions
@#&+0.084
cause+0.071
Narr+0.071
SIGN+0.068
hetic+0.068
Neg logit contributions
andise-0.093
byss-0.084
untled-0.077
ratulations-0.074
istar-0.074
Token you
Token ID345
Feature activation+7.83e-03

Pos logit contributions
@#&+0.005
cause+0.004
Narr+0.004
SIGN+0.004
hetic+0.004
Neg logit contributions
andise-0.006
byss-0.005
untled-0.005
ratulations-0.005
istar-0.005
Token could
Token ID714
Feature activation+3.76e-02

Pos logit contributions
@#&+0.006
cause+0.005
Narr+0.005
SIGN+0.005
ˈ+0.005
Neg logit contributions
andise-0.007
byss-0.007
untled-0.006
ratulations-0.006
istar-0.006
Token see
Token ID766
Feature activation+1.70e-02

Pos logit contributions
@#&+0.028
cause+0.024
Narr+0.024
SIGN+0.023
hetic+0.022
Neg logit contributions
andise-0.035
byss-0.031
untled-0.029
ratulations-0.028
istar-0.028
Token his
Token ID465
Feature activation+2.60e-02

Pos logit contributions
@#&+0.012
cause+0.010
Narr+0.010
SIGN+0.010
hetic+0.010
Neg logit contributions
andise-0.016
byss-0.014
untled-0.013
ratulations-0.013
istar-0.013
Token outline
Token ID19001
Feature activation-1.80e-02

Pos logit contributions
@#&+0.020
cause+0.017
Narr+0.016
SIGN+0.016
hetic+0.016
Neg logit contributions
andise-0.024
byss-0.021
untled-0.020
ratulations-0.019
istar-0.019
Token on
Token ID319
Feature activation+2.96e-03

Pos logit contributions
andise+0.016
byss+0.014
untled+0.013
ratulations+0.013
istar+0.013
Neg logit contributions
@#&-0.014
cause-0.012
Narr-0.012
SIGN-0.011
hetic-0.011
Token the
Token ID262
Feature activation+2.79e-03

Pos logit contributions
@#&+0.002
cause+0.002
Narr+0.002
SIGN+0.002
hetic+0.002
Neg logit contributions
andise-0.003
byss-0.002
untled-0.002
ratulations-0.002
istar-0.002
Token horizon
Token ID17810
Feature activation-7.86e-02

Pos logit contributions
@#&+0.002
cause+0.002
Narr+0.002
SIGN+0.002
hetic+0.002
Neg logit contributions
andise-0.002
byss-0.002
untled-0.002
ratulations-0.002
istar-0.002
Token not
Token ID407
Feature activation-1.32e-02

Pos logit contributions
andise+0.007
byss+0.007
untled+0.006
ratulations+0.006
istar+0.006
Neg logit contributions
@#&-0.006
cause-0.005
Narr-0.005
SIGN-0.005
hetic-0.005
Token part
Token ID636
Feature activation+1.85e-02

Pos logit contributions
andise+0.012
byss+0.011
untled+0.010
ratulations+0.010
istar+0.009
Neg logit contributions
@#&-0.010
cause-0.009
Narr-0.008
SIGN-0.008
ˈ-0.008
Token of
Token ID286
Feature activation+8.12e-03

Pos logit contributions
@#&+0.014
cause+0.012
Narr+0.012
SIGN+0.011
hetic+0.011
Neg logit contributions
andise-0.017
byss-0.015
untled-0.014
ratulations-0.014
istar-0.013
Token that
Token ID326
Feature activation-2.74e-03

Pos logit contributions
@#&+0.006
cause+0.005
Narr+0.005
SIGN+0.005
hetic+0.005
Neg logit contributions
andise-0.007
byss-0.007
untled-0.006
ratulations-0.006
istar-0.006
Token union
Token ID6441
Feature activation-1.18e-02

Pos logit contributions
andise+0.002
byss+0.002
untled+0.002
ratulations+0.002
istar+0.002
Neg logit contributions
@#&-0.002
cause-0.002
Narr-0.002
SIGN-0.002
hetic-0.002
Token,
Token ID11
Feature activation+1.37e-03

Pos logit contributions
andise+0.010
byss+0.009
untled+0.009
ratulations+0.008
istar+0.008
Neg logit contributions
@#&-0.009
cause-0.008
Narr-0.008
SIGN-0.007
hetic-0.007
Token that
Token ID326
Feature activation-1.21e-02

Pos logit contributions
@#&+0.001
cause+0.001
Narr+0.001
SIGN+0.001
hetic+0.001
Neg logit contributions
andise-0.001
byss-0.001
untled-0.001
ratulations-0.001
istar-0.001
Token is
Token ID318
Feature activation-5.44e-03

Pos logit contributions
andise+0.011
byss+0.010
untled+0.009
ratulations+0.009
istar+0.009
Neg logit contributions
@#&-0.009
cause-0.008
Narr-0.008
SIGN-0.008
hetic-0.008
Token how
Token ID703
Feature activation-4.02e-02

Pos logit contributions
andise+0.005
byss+0.005
untled+0.004
ratulations+0.004
istar+0.004
Neg logit contributions
@#&-0.004
cause-0.003
Narr-0.003
SIGN-0.003
hetic-0.003
Token business
Token ID1597
Feature activation-2.66e-02

Pos logit contributions
andise+0.037
byss+0.033
untled+0.031
ratulations+0.030
istar+0.029
Neg logit contributions
@#&-0.030
cause-0.026
Narr-0.025
SIGN-0.024
hetic-0.024
Token operates
Token ID14051
Feature activation+4.86e-02

Pos logit contributions
andise+0.022
byss+0.019
untled+0.018
ratulations+0.017
istar+0.017
Neg logit contributions
@#&-0.023
cause-0.020
Narr-0.019
SIGN-0.019
ˈ-0.019
Token not
Token ID407
Feature activation-8.34e-03

Pos logit contributions
andise+0.031
byss+0.028
untled+0.026
ratulations+0.025
istar+0.025
Neg logit contributions
@#&-0.025
cause-0.021
Narr-0.021
SIGN-0.020
hetic-0.020
Token quite
Token ID2407
Feature activation-5.10e-03

Pos logit contributions
andise+0.007
byss+0.007
untled+0.006
ratulations+0.006
istar+0.006
Neg logit contributions
@#&-0.006
cause-0.005
Narr-0.005
SIGN-0.005
hetic-0.005
Token sure
Token ID1654
Feature activation-3.90e-02

Pos logit contributions
andise+0.004
byss+0.004
untled+0.004
ratulations+0.004
istar+0.004
Neg logit contributions
@#&-0.004
cause-0.003
Narr-0.003
hetic-0.003
SIGN-0.003
Token why
Token ID1521
Feature activation-1.32e-02

Pos logit contributions
andise+0.035
byss+0.032
untled+0.029
ratulations+0.028
istar+0.028
Neg logit contributions
@#&-0.030
cause-0.025
Narr-0.025
SIGN-0.024
hetic-0.024
Token this
Token ID428
Feature activation+1.92e-03

Pos logit contributions
andise+0.012
byss+0.011
untled+0.010
ratulations+0.009
istar+0.009
Neg logit contributions
@#&-0.010
cause-0.009
Narr-0.009
SIGN-0.008
hetic-0.008
Token is
Token ID318
Feature activation-7.23e-03

Pos logit contributions
@#&+0.001
cause+0.001
Narr+0.001
SIGN+0.001
hetic+0.001
Neg logit contributions
andise-0.002
byss-0.002
untled-0.001
ratulations-0.001
istar-0.001
Token necessary
Token ID3306
Feature activation+1.44e-03

Pos logit contributions
andise+0.006
byss+0.006
untled+0.005
ratulations+0.005
istar+0.005
Neg logit contributions
@#&-0.006
cause-0.005
Narr-0.005
SIGN-0.005
ˈ-0.005
Token.
Token ID13
Feature activation-5.62e-04

Pos logit contributions
@#&+0.001
cause+0.001
Narr+0.001
SIGN+0.001
ˈ+0.001
Neg logit contributions
andise-0.001
byss-0.001
untled-0.001
ratulations-0.001
istar-0.001
Token Probably
Token ID18578
Feature activation-1.26e-01

Pos logit contributions
andise+0.001
byss+0.000
untled+0.000
ratulations+0.000
istar+0.000
Neg logit contributions
@#&-0.000
cause-0.000
Narr-0.000
SIGN-0.000
hetic-0.000
Token so
Token ID523
Feature activation-4.27e-02

Pos logit contributions
andise+0.119
byss+0.108
untled+0.100
ratulations+0.096
istar+0.096
Neg logit contributions
@#&-0.092
cause-0.077
Narr-0.076
SIGN-0.073
hetic-0.073
Token I
Token ID314
Feature activation+5.36e-03

Pos logit contributions
andise+0.038
byss+0.035
untled+0.032
ratulations+0.031
istar+0.031
Neg logit contributions
@#&-0.033
cause-0.028
Narr-0.028
SIGN-0.027
hetic-0.026
Token
Token ID251
Feature activation+6.39e-03

Pos logit contributions
andise+0.007
byss+0.007
untled+0.006
istar+0.006
ratulations+0.006
Neg logit contributions
@#&-0.005
cause-0.004
Narr-0.004
SIGN-0.004
hetic-0.004
Token that
Token ID326
Feature activation-1.73e-02

Pos logit contributions
@#&+0.005
cause+0.004
Narr+0.004
SIGN+0.004
hetic+0.004
Neg logit contributions
andise-0.006
byss-0.005
untled-0.005
ratulations-0.005
istar-0.005
Token he
Token ID339
Feature activation-9.89e-03

Pos logit contributions
andise+0.016
byss+0.014
untled+0.013
ratulations+0.013
istar+0.013
Neg logit contributions
@#&-0.013
cause-0.011
Narr-0.011
SIGN-0.011
hetic-0.011
Token
Token ID447
Feature activation-7.14e-03

Pos logit contributions
andise+0.009
byss+0.008
untled+0.007
ratulations+0.007
istar+0.007
Neg logit contributions
@#&-0.008
cause-0.007
Narr-0.006
SIGN-0.006
ˈ-0.006
Token
Token ID247
Feature activation-3.05e-03

Pos logit contributions
andise+0.007
byss+0.006
untled+0.006
ratulations+0.006
istar+0.006
Neg logit contributions
@#&-0.005
cause-0.004
Narr-0.004
hetic-0.004
SIGN-0.004
Tokens
Token ID82
Feature activation-4.90e-03

Pos logit contributions
andise+0.003
byss+0.003
untled+0.002
ratulations+0.002
istar+0.002
Neg logit contributions
@#&-0.002
cause-0.002
Narr-0.002
ˈ-0.002
SIGN-0.002
Token become
Token ID1716
Feature activation+1.64e-02

Pos logit contributions
andise+0.004
byss+0.004
untled+0.004
ratulations+0.003
istar+0.003
Neg logit contributions
@#&-0.004
cause-0.003
Narr-0.003
SIGN-0.003
ˈ-0.003
Token increasingly
Token ID6481
Feature activation-3.11e-02

Pos logit contributions
@#&+0.013
cause+0.011
Narr+0.011
SIGN+0.011
ˈ+0.011
Neg logit contributions
andise-0.014
byss-0.013
untled-0.012
ratulations-0.011
istar-0.011
Token worried
Token ID7960
Feature activation-2.77e-02

Pos logit contributions
andise+0.025
byss+0.022
untled+0.020
ratulations+0.020
istar+0.019
Neg logit contributions
@#&-0.027
cause-0.023
Narr-0.023
SIGN-0.022
ˈ-0.022
Token about
Token ID546
Feature activation-4.28e-03

Pos logit contributions
andise+0.025
byss+0.022
untled+0.021
ratulations+0.020
istar+0.020
Neg logit contributions
@#&-0.021
cause-0.018
Narr-0.018
SIGN-0.017
hetic-0.017
Token in
Token ID287
Feature activation+2.88e-03

Pos logit contributions
andise+0.004
byss+0.003
untled+0.003
ratulations+0.003
istar+0.003
Neg logit contributions
@#&-0.003
cause-0.003
Narr-0.003
SIGN-0.003
hetic-0.003
Token,
Token ID11
Feature activation-3.79e-03

Pos logit contributions
@#&+0.040
cause+0.034
Narr+0.033
SIGN+0.032
hetic+0.032
Neg logit contributions
andise-0.044
byss-0.040
untled-0.037
ratulations-0.035
istar-0.035
Token making
Token ID1642
Feature activation-3.24e-02

Pos logit contributions
andise+0.003
byss+0.003
untled+0.003
ratulations+0.003
istar+0.003
Neg logit contributions
@#&-0.003
cause-0.003
Narr-0.002
SIGN-0.002
hetic-0.002
Token it
Token ID340
Feature activation-1.57e-02

Pos logit contributions
andise+0.029
byss+0.027
untled+0.025
ratulations+0.024
istar+0.024
Neg logit contributions
@#&-0.025
cause-0.021
Narr-0.021
SIGN-0.020
hetic-0.020
Token illegal
Token ID5293
Feature activation-2.02e-01

Pos logit contributions
andise+0.014
byss+0.012
untled+0.011
ratulations+0.011
istar+0.011
Neg logit contributions
@#&-0.013
cause-0.011
Narr-0.011
SIGN-0.010
ˈ-0.010
Token for
Token ID329
Feature activation+9.88e-03

Pos logit contributions
andise+0.178
byss+0.160
untled+0.147
ratulations+0.141
istar+0.141
Neg logit contributions
@#&-0.160
cause-0.135
Narr-0.134
SIGN-0.129
hetic-0.129
Token them
Token ID606
Feature activation-2.56e-03

Pos logit contributions
@#&+0.007
cause+0.006
Narr+0.006
SIGN+0.006
hetic+0.006
Neg logit contributions
andise-0.009
byss-0.008
untled-0.008
ratulations-0.007
istar-0.007
Token to
Token ID284
Feature activation-6.63e-03

Pos logit contributions
andise+0.002
byss+0.002
untled+0.002
ratulations+0.002
istar+0.002
Neg logit contributions
@#&-0.002
cause-0.002
Narr-0.002
SIGN-0.002
hetic-0.002
Token bring
Token ID2222
Feature activation-5.07e-03

Pos logit contributions
andise+0.006
byss+0.005
untled+0.005
ratulations+0.005
istar+0.005
Neg logit contributions
@#&-0.005
cause-0.004
Narr-0.004
SIGN-0.004
hetic-0.004
Token the
Token ID262
Feature activation-2.24e-03

Pos logit contributions
andise+0.004
byss+0.004
untled+0.004
ratulations+0.003
istar+0.003
Neg logit contributions
@#&-0.004
cause-0.004
Narr-0.003
SIGN-0.003
hetic-0.003
Token weapon
Token ID4282
Feature activation-3.17e-02

Pos logit contributions
andise+0.002
byss+0.002
untled+0.002
ratulations+0.002
istar+0.002
Neg logit contributions
@#&-0.002
cause-0.001
Narr-0.001
SIGN-0.001
hetic-0.001
Token into
Token ID656
Feature activation-8.77e-03

Pos logit contributions
andise+0.028
byss+0.025
untled+0.024
ratulations+0.023
istar+0.023
Neg logit contributions
@#&-0.025
cause-0.021
Narr-0.021
SIGN-0.020
hetic-0.020
Token be
Token ID307
Feature activation+4.24e-03

Pos logit contributions
andise+0.007
byss+0.006
untled+0.006
ratulations+0.005
istar+0.005
Neg logit contributions
@#&-0.006
cause-0.005
Narr-0.005
SIGN-0.004
hetic-0.004
Token prepared
Token ID5597
Feature activation+8.27e-02

Pos logit contributions
@#&+0.003
cause+0.003
Narr+0.003
SIGN+0.003
hetic+0.003
Neg logit contributions
andise-0.004
byss-0.003
untled-0.003
ratulations-0.003
istar-0.003
Token to
Token ID284
Feature activation-9.02e-03

Pos logit contributions
@#&+0.066
cause+0.056
Narr+0.055
SIGN+0.053
hetic+0.053
Neg logit contributions
andise-0.072
byss-0.065
untled-0.060
ratulations-0.057
istar-0.057
Token strike
Token ID5587
Feature activation+1.41e-03

Pos logit contributions
andise+0.008
byss+0.007
untled+0.007
ratulations+0.006
istar+0.006
Neg logit contributions
@#&-0.007
cause-0.006
Narr-0.006
SIGN-0.006
hetic-0.006
Token military
Token ID2422
Feature activation-3.27e-02

Pos logit contributions
@#&+0.001
cause+0.001
Narr+0.001
SIGN+0.001
hetic+0.001
Neg logit contributions
andise-0.001
byss-0.001
untled-0.001
ratulations-0.001
istar-0.001
Token targets
Token ID6670
Feature activation-5.77e-02

Pos logit contributions
andise+0.027
byss+0.024
untled+0.022
ratulations+0.021
istar+0.021
Neg logit contributions
@#&-0.028
cause-0.024
Narr-0.024
hetic-0.023
SIGN-0.023
Token of
Token ID286
Feature activation+9.27e-03

Pos logit contributions
andise+0.052
byss+0.047
untled+0.043
ratulations+0.042
istar+0.041
Neg logit contributions
@#&-0.044
cause-0.037
Narr-0.037
SIGN-0.036
hetic-0.036
Token the
Token ID262
Feature activation-8.76e-03

Pos logit contributions
@#&+0.007
cause+0.006
Narr+0.006
SIGN+0.005
hetic+0.005
Neg logit contributions
andise-0.009
byss-0.008
untled-0.007
ratulations-0.007
istar-0.007
Token Assad
Token ID13032
Feature activation+4.14e-02

Pos logit contributions
andise+0.008
byss+0.007
untled+0.007
ratulations+0.006
istar+0.006
Neg logit contributions
@#&-0.007
cause-0.006
Narr-0.006
SIGN-0.005
hetic-0.005
Token regime
Token ID7142
Feature activation+7.85e-02

Pos logit contributions
@#&+0.030
cause+0.025
Narr+0.025
SIGN+0.024
ˈ+0.024
Neg logit contributions
andise-0.039
byss-0.035
untled-0.033
ratulations-0.031
istar-0.031
Token,
Token ID11
Feature activation-5.39e-03

Pos logit contributions
@#&+0.062
cause+0.052
Narr+0.052
SIGN+0.050
hetic+0.050
Neg logit contributions
andise-0.069
byss-0.063
untled-0.058
ratulations-0.056
istar-0.055
Token
Token ID198
Feature activation-1.34e-03

Pos logit contributions
andise+0.006
byss+0.005
untled+0.005
ratulations+0.005
istar+0.005
Neg logit contributions
@#&-0.005
cause-0.005
Narr-0.004
SIGN-0.004
ˈ-0.004
TokenDate
Token ID10430
Feature activation+1.42e-02

Pos logit contributions
andise+0.001
byss+0.001
untled+0.001
ratulations+0.001
istar+0.001
Neg logit contributions
@#&-0.001
cause-0.001
Narr-0.001
SIGN-0.001
ˈ-0.001
Token:
Token ID25
Feature activation+5.51e-03

Pos logit contributions
@#&+0.011
cause+0.009
Narr+0.009
SIGN+0.009
ˈ+0.009
Neg logit contributions
andise-0.013
byss-0.012
untled-0.011
ratulations-0.010
istar-0.010
Token Nov
Token ID5267
Feature activation-4.64e-02

Pos logit contributions
@#&+0.004
cause+0.003
Narr+0.003
SIGN+0.003
ˈ+0.003
Neg logit contributions
andise-0.005
byss-0.005
untled-0.004
ratulations-0.004
istar-0.004
Token.
Token ID13
Feature activation+8.09e-06

Pos logit contributions
andise+0.041
byss+0.037
untled+0.034
ratulations+0.033
istar+0.032
Neg logit contributions
@#&-0.036
cause-0.031
Narr-0.031
SIGN-0.030
hetic-0.030
Token 24
Token ID1987
Feature activation-4.00e-02

Feature not active on prev token;
no predictions were affected.
Token,
Token ID11
Feature activation-4.99e-03

Pos logit contributions
andise+0.035
byss+0.031
untled+0.029
ratulations+0.028
istar+0.028
Neg logit contributions
@#&-0.032
cause-0.027
Narr-0.027
SIGN-0.026
hetic-0.026
Token 1944
Token ID16994
Feature activation-9.47e-03

Pos logit contributions
andise+0.005
byss+0.004
untled+0.004
ratulations+0.004
istar+0.004
Neg logit contributions
@#&-0.004
cause-0.003
Narr-0.003
SIGN-0.003
ˈ-0.003
Token
Token ID198
Feature activation+1.92e-03

Pos logit contributions
andise+0.008
byss+0.007
untled+0.007
ratulations+0.007
istar+0.007
Neg logit contributions
@#&-0.008
cause-0.006
Narr-0.006
SIGN-0.006
hetic-0.006
Token
Token ID198
Feature activation-5.76e-03

Pos logit contributions
@#&+0.001
cause+0.001
Narr+0.001
SIGN+0.001
ˈ+0.001
Neg logit contributions
andise-0.002
byss-0.002
untled-0.001
ratulations-0.001
istar-0.001
TokenTre
Token ID31055
Feature activation+1.61e-02

Pos logit contributions
andise+0.005
byss+0.005
untled+0.004
ratulations+0.004
istar+0.004
Neg logit contributions
@#&-0.004
cause-0.004
Narr-0.004
SIGN-0.004
hetic-0.004
Token L
Token ID406
Feature activation-3.03e-02

Pos logit contributions
andise+0.011
byss+0.010
untled+0.009
ratulations+0.009
istar+0.009
Neg logit contributions
@#&-0.011
cause-0.009
Narr-0.009
SIGN-0.009
ˈ-0.009
Tokenister
Token ID1694
Feature activation-1.66e-02

Pos logit contributions
andise+0.028
byss+0.026
untled+0.024
ratulations+0.023
istar+0.023
Neg logit contributions
@#&-0.022
cause-0.019
Narr-0.019
SIGN-0.018
hetic-0.018
Token told
Token ID1297
Feature activation+7.32e-03

Pos logit contributions
andise+0.014
byss+0.012
untled+0.011
istar+0.011
ratulations+0.011
Neg logit contributions
@#&-0.014
cause-0.012
Narr-0.012
ˈ-0.012
SIGN-0.012
Token the
Token ID262
Feature activation-6.99e-03

Pos logit contributions
@#&+0.005
cause+0.005
Narr+0.004
SIGN+0.004
ˈ+0.004
Neg logit contributions
andise-0.007
byss-0.006
untled-0.006
ratulations-0.006
istar-0.005
Token Guardian
Token ID8283
Feature activation-5.66e-02

Pos logit contributions
andise+0.006
byss+0.005
untled+0.005
ratulations+0.005
istar+0.004
Neg logit contributions
@#&-0.006
cause-0.005
Narr-0.005
SIGN-0.005
hetic-0.005
Token.
Token ID13
Feature activation-1.30e-02

Pos logit contributions
andise+0.050
byss+0.045
untled+0.041
ratulations+0.040
istar+0.039
Neg logit contributions
@#&-0.045
cause-0.038
Narr-0.038
SIGN-0.036
hetic-0.036
Token
Token ID198
Feature activation-1.22e-02

Pos logit contributions
andise+0.012
byss+0.011
untled+0.010
ratulations+0.009
istar+0.009
Neg logit contributions
@#&-0.010
cause-0.008
Narr-0.008
SIGN-0.008
hetic-0.008
Token
Token ID198
Feature activation-4.53e-03

Pos logit contributions
andise+0.011
byss+0.010
untled+0.009
ratulations+0.009
istar+0.009
Neg logit contributions
@#&-0.010
cause-0.008
Narr-0.008
SIGN-0.008
hetic-0.008
TokenThe
Token ID464
Feature activation-3.04e-04

Pos logit contributions
andise+0.004
byss+0.004
untled+0.003
ratulations+0.003
istar+0.003
Neg logit contributions
@#&-0.003
cause-0.003
Narr-0.003
SIGN-0.003
hetic-0.003
Token Natural
Token ID12068
Feature activation-4.67e-02

Pos logit contributions
andise+0.000
byss+0.000
untled+0.000
ratulations+0.000
istar+0.000
Neg logit contributions
@#&-0.000
cause-0.000
Narr-0.000
SIGN-0.000
hetic-0.000
Token History
Token ID7443
Feature activation+3.88e-03

Pos logit contributions
andise+0.044
byss+0.040
untled+0.037
ratulations+0.036
istar+0.035
Neg logit contributions
@#&-0.034
cause-0.029
Narr-0.029
SIGN-0.027
hetic-0.027
Token 2012
Token ID2321
Feature activation-1.48e-02

Pos logit contributions
@#&+0.031
cause+0.026
Narr+0.026
SIGN+0.025
hetic+0.025
Neg logit contributions
andise-0.035
byss-0.032
untled-0.030
ratulations-0.028
istar-0.028
Token,
Token ID11
Feature activation-1.28e-02

Pos logit contributions
andise+0.013
byss+0.011
untled+0.010
ratulations+0.010
istar+0.010
Neg logit contributions
@#&-0.012
cause-0.010
Narr-0.010
SIGN-0.010
ˈ-0.010
Token 53
Token ID7192
Feature activation-3.72e-02

Pos logit contributions
andise+0.011
byss+0.010
untled+0.009
ratulations+0.009
istar+0.009
Neg logit contributions
@#&-0.010
cause-0.008
Narr-0.008
SIGN-0.008
hetic-0.008
Token percent
Token ID1411
Feature activation+1.35e-02

Pos logit contributions
andise+0.034
byss+0.031
untled+0.029
ratulations+0.028
istar+0.028
Neg logit contributions
@#&-0.027
cause-0.023
Narr-0.023
SIGN-0.022
ˈ-0.022
Token of
Token ID286
Feature activation-7.19e-03

Pos logit contributions
@#&+0.011
cause+0.009
Narr+0.009
SIGN+0.009
ˈ+0.009
Neg logit contributions
andise-0.012
byss-0.011
untled-0.010
ratulations-0.009
istar-0.009
Token respondents
Token ID14502
Feature activation-3.36e-02

Pos logit contributions
andise+0.006
byss+0.006
untled+0.005
ratulations+0.005
istar+0.005
Neg logit contributions
@#&-0.006
cause-0.005
Narr-0.005
SIGN-0.004
hetic-0.004
Token to
Token ID284
Feature activation-9.86e-03

Pos logit contributions
andise+0.030
byss+0.027
untled+0.025
ratulations+0.024
istar+0.024
Neg logit contributions
@#&-0.026
cause-0.022
Narr-0.022
SIGN-0.021
ˈ-0.021
Token an
Token ID281
Feature activation+1.07e-04

Pos logit contributions
andise+0.009
byss+0.008
untled+0.007
ratulations+0.007
istar+0.007
Neg logit contributions
@#&-0.008
cause-0.006
Narr-0.006
SIGN-0.006
ˈ-0.006
Token NBC
Token ID9716
Feature activation-1.29e-02

Pos logit contributions
@#&+0.000
cause+0.000
Narr+0.000
SIGN+0.000
hetic+0.000
Neg logit contributions
andise-0.000
byss-0.000
untled-0.000
ratulations-0.000
istar-0.000
Token News
Token ID3000
Feature activation+9.17e-02

Pos logit contributions
andise+0.012
byss+0.011
untled+0.010
ratulations+0.010
istar+0.010
Neg logit contributions
@#&-0.009
cause-0.008
Narr-0.008
SIGN-0.008
hetic-0.008
Token/
Token ID14
Feature activation-1.84e-02

Pos logit contributions
@#&+0.073
cause+0.062
Narr+0.061
SIGN+0.059
hetic+0.059
Neg logit contributions
andise-0.080
byss-0.072
untled-0.067
ratulations-0.064
istar-0.064
Token Newsweek
Token ID39594
Feature activation+9.24e-03

Pos logit contributions
@#&+0.009
cause+0.007
Narr+0.007
SIGN+0.007
hetic+0.007
Neg logit contributions
andise-0.010
byss-0.009
untled-0.009
ratulations-0.008
istar-0.008
Token in
Token ID287
Feature activation+8.89e-03

Pos logit contributions
@#&+0.007
cause+0.006
Narr+0.006
SIGN+0.006
ˈ+0.006
Neg logit contributions
andise-0.008
byss-0.007
untled-0.007
ratulations-0.007
istar-0.007
Token 1994
Token ID9162
Feature activation+3.09e-03

Pos logit contributions
@#&+0.007
cause+0.005
Narr+0.005
SIGN+0.005
hetic+0.005
Neg logit contributions
andise-0.008
byss-0.008
untled-0.007
ratulations-0.007
istar-0.007
Token.
Token ID13
Feature activation+1.08e-02

Pos logit contributions
@#&+0.002
cause+0.002
Narr+0.002
SIGN+0.002
ˈ+0.002
Neg logit contributions
andise-0.003
byss-0.002
untled-0.002
ratulations-0.002
istar-0.002
Token "
Token ID366
Feature activation+3.36e-02

Pos logit contributions
@#&+0.008
cause+0.007
Narr+0.007
SIGN+0.007
ˈ+0.007
Neg logit contributions
andise-0.010
byss-0.009
untled-0.008
ratulations-0.008
istar-0.008
TokenI
Token ID40
Feature activation-1.80e-02

Pos logit contributions
@#&+0.025
cause+0.021
Narr+0.021
SIGN+0.020
ˈ+0.020
Neg logit contributions
andise-0.031
byss-0.028
untled-0.026
ratulations-0.025
istar-0.025
Token mean
Token ID1612
Feature activation+5.22e-03

Pos logit contributions
andise+0.017
byss+0.015
untled+0.014
ratulations+0.013
istar+0.013
Neg logit contributions
@#&-0.013
cause-0.011
Narr-0.011
SIGN-0.011
ˈ-0.011
Token,
Token ID11
Feature activation+1.65e-02

Pos logit contributions
@#&+0.004
cause+0.004
Narr+0.004
SIGN+0.003
ˈ+0.003
Neg logit contributions
andise-0.005
byss-0.004
untled-0.004
ratulations-0.004
istar-0.004
Token I
Token ID314
Feature activation+2.04e-02

Pos logit contributions
@#&+0.013
cause+0.011
Narr+0.010
SIGN+0.010
hetic+0.010
Neg logit contributions
andise-0.015
byss-0.014
untled-0.013
ratulations-0.012
istar-0.012
Token know
Token ID760
Feature activation+1.23e-02

Pos logit contributions
@#&+0.016
cause+0.013
Narr+0.013
SIGN+0.013
ˈ+0.013
Neg logit contributions
andise-0.018
byss-0.017
untled-0.015
ratulations-0.015
istar-0.015
Token what
Token ID644
Feature activation+5.34e-02

Pos logit contributions
@#&+0.009
cause+0.008
Narr+0.008
SIGN+0.008
hetic+0.008
Neg logit contributions
andise-0.011
byss-0.010
untled-0.009
ratulations-0.009
istar-0.009
Token out
Token ID503
Feature activation+2.47e-02

Pos logit contributions
andise+0.004
byss+0.003
untled+0.003
ratulations+0.003
istar+0.003
Neg logit contributions
@#&-0.003
cause-0.002
Narr-0.002
SIGN-0.002
ˈ-0.002
Token to
Token ID284
Feature activation-1.43e-03

Pos logit contributions
@#&+0.019
cause+0.016
Narr+0.016
SIGN+0.016
hetic+0.016
Neg logit contributions
andise-0.022
byss-0.020
untled-0.018
ratulations-0.017
istar-0.017
Token focus
Token ID2962
Feature activation+8.08e-03

Pos logit contributions
andise+0.001
byss+0.001
untled+0.001
ratulations+0.001
istar+0.001
Neg logit contributions
@#&-0.001
cause-0.001
Narr-0.001
SIGN-0.001
hetic-0.001
Token on
Token ID319
Feature activation-4.88e-03

Pos logit contributions
@#&+0.006
cause+0.005
Narr+0.005
SIGN+0.005
hetic+0.005
Neg logit contributions
andise-0.007
byss-0.006
untled-0.006
ratulations-0.006
istar-0.006
Token undergrad
Token ID43485
Feature activation+7.59e-03

Pos logit contributions
andise+0.004
byss+0.004
untled+0.004
ratulations+0.003
istar+0.003
Neg logit contributions
@#&-0.004
cause-0.003
Narr-0.003
SIGN-0.003
hetic-0.003
Tokenuates
Token ID12632
Feature activation-2.73e-01

Pos logit contributions
@#&+0.007
cause+0.006
Narr+0.006
SIGN+0.006
hetic+0.006
Neg logit contributions
andise-0.005
byss-0.005
untled-0.004
ratulations-0.004
istar-0.004
Token.
Token ID13
Feature activation+1.99e-03

Pos logit contributions
andise+0.239
byss+0.215
untled+0.198
ratulations+0.190
istar+0.189
Neg logit contributions
@#&-0.217
cause-0.184
Narr-0.182
SIGN-0.176
hetic-0.176
Token O
Token ID440
Feature activation+2.36e-02

Pos logit contributions
@#&+0.002
cause+0.001
Narr+0.001
SIGN+0.001
ˈ+0.001
Neg logit contributions
andise-0.002
byss-0.002
untled-0.002
ratulations-0.001
istar-0.001
Tokenurs
Token ID1834
Feature activation-1.36e-01

Pos logit contributions
@#&+0.018
cause+0.016
Narr+0.015
SIGN+0.015
hetic+0.015
Neg logit contributions
andise-0.021
byss-0.019
untled-0.017
ratulations-0.017
istar-0.017
Token is
Token ID318
Feature activation-1.53e-03

Pos logit contributions
andise+0.120
byss+0.108
untled+0.100
ratulations+0.096
istar+0.095
Neg logit contributions
@#&-0.107
cause-0.090
Narr-0.089
hetic-0.086
SIGN-0.086
Token a
Token ID257
Feature activation+3.42e-04

Pos logit contributions
andise+0.001
byss+0.001
untled+0.001
ratulations+0.001
istar+0.001
Neg logit contributions
@#&-0.001
cause-0.001
Narr-0.001
SIGN-0.001
hetic-0.001
Token mint
Token ID21043
Feature activation-3.14e-01

Pos logit contributions
andise+0.069
byss+0.062
untled+0.057
ratulations+0.055
istar+0.055
Neg logit contributions
@#&-0.058
cause-0.049
Narr-0.049
SIGN-0.047
hetic-0.047
Token j
Token ID474
Feature activation-1.66e-02

Pos logit contributions
andise+0.269
byss+0.245
untled+0.222
ratulations+0.214
istar+0.213
Neg logit contributions
@#&-0.254
cause-0.215
Narr-0.213
hetic-0.207
ˈ-0.205
Tokenule
Token ID2261
Feature activation+6.54e-03

Pos logit contributions
andise+0.015
byss+0.014
untled+0.012
ratulations+0.012
istar+0.012
Neg logit contributions
@#&-0.013
cause-0.011
Narr-0.011
ˈ-0.010
hetic-0.010
Tokenp
Token ID79
Feature activation+2.09e-02

Pos logit contributions
@#&+0.005
cause+0.004
Narr+0.004
SIGN+0.004
hetic+0.004
Neg logit contributions
andise-0.006
byss-0.005
untled-0.005
ratulations-0.005
istar-0.005
Token after
Token ID706
Feature activation-9.29e-03

Pos logit contributions
@#&+0.017
cause+0.014
Narr+0.014
SIGN+0.014
hetic+0.014
Neg logit contributions
andise-0.018
byss-0.016
untled-0.015
ratulations-0.014
istar-0.014
Token mint
Token ID21043
Feature activation-3.18e-01

Pos logit contributions
andise+0.007
byss+0.006
untled+0.005
ratulations+0.005
istar+0.005
Neg logit contributions
@#&-0.009
cause-0.008
Narr-0.008
ˈ-0.007
SIGN-0.007
Token j
Token ID474
Feature activation-1.42e-02

Pos logit contributions
andise+0.262
byss+0.235
untled+0.214
ratulations+0.204
istar+0.203
Neg logit contributions
@#&-0.269
cause-0.231
Narr-0.229
ˈ-0.222
SIGN-0.221
Tokenule
Token ID2261
Feature activation-4.09e-03

Pos logit contributions
andise+0.013
byss+0.012
untled+0.011
ratulations+0.010
istar+0.010
Neg logit contributions
@#&-0.011
cause-0.009
Narr-0.009
hetic-0.009
ˈ-0.009
Tokenp
Token ID79
Feature activation+2.52e-02

Pos logit contributions
andise+0.004
byss+0.004
untled+0.003
ratulations+0.003
istar+0.003
Neg logit contributions
@#&-0.003
cause-0.002
Narr-0.002
ˈ-0.002
hetic-0.002
Token at
Token ID379
Feature activation-3.91e-03

Pos logit contributions
@#&+0.020
cause+0.017
Narr+0.017
SIGN+0.016
hetic+0.016
Neg logit contributions
andise-0.022
byss-0.020
untled-0.018
ratulations-0.018
istar-0.018
Token his
Token ID465
Feature activation+3.25e-03

Pos logit contributions
andise+0.004
byss+0.003
untled+0.003
ratulations+0.003
istar+0.003
Neg logit contributions
@#&-0.003
cause-0.002
Narr-0.002
SIGN-0.002
hetic-0.002
Token but
Token ID475
Feature activation-7.50e-03

Pos logit contributions
andise+0.019
byss+0.017
untled+0.016
ratulations+0.015
istar+0.015
Neg logit contributions
@#&-0.019
cause-0.016
Narr-0.016
SIGN-0.016
hetic-0.016
Token the
Token ID262
Feature activation+4.75e-03

Pos logit contributions
andise+0.007
byss+0.006
untled+0.006
ratulations+0.005
istar+0.005
Neg logit contributions
@#&-0.006
cause-0.005
Narr-0.005
SIGN-0.005
hetic-0.005
Token Gear
Token ID10740
Feature activation+4.40e-02

Pos logit contributions
@#&+0.004
cause+0.003
Narr+0.003
SIGN+0.003
hetic+0.003
Neg logit contributions
andise-0.004
byss-0.004
untled-0.003
ratulations-0.003
istar-0.003
Token's
Token ID338
Feature activation-4.21e-03

Pos logit contributions
@#&+0.035
cause+0.030
Narr+0.030
SIGN+0.029
hetic+0.029
Neg logit contributions
andise-0.038
byss-0.034
untled-0.032
ratulations-0.030
istar-0.030
Token boot
Token ID6297
Feature activation-1.13e-01

Pos logit contributions
andise+0.004
byss+0.003
untled+0.003
ratulations+0.003
istar+0.003
Neg logit contributions
@#&-0.003
cause-0.003
Narr-0.003
SIGN-0.003
hetic-0.003
Tokenloader
Token ID29356
Feature activation-2.50e-01

Pos logit contributions
andise+0.075
byss+0.065
untled+0.058
ratulations+0.054
istar+0.054
Neg logit contributions
@#&-0.115
cause-0.101
Narr-0.100
SIGN-0.098
hetic-0.098
Token doesn
Token ID1595
Feature activation-1.18e-02

Pos logit contributions
andise+0.224
byss+0.203
untled+0.187
ratulations+0.179
istar+0.178
Neg logit contributions
@#&-0.192
cause-0.163
Narr-0.161
SIGN-0.155
hetic-0.155
Token't
Token ID470
Feature activation+3.39e-02

Pos logit contributions
andise+0.012
byss+0.011
untled+0.010
ratulations+0.010
istar+0.010
Neg logit contributions
@#&-0.008
cause-0.007
Narr-0.006
SIGN-0.006
ˈ-0.006
Token notice
Token ID4003
Feature activation-9.73e-02

Pos logit contributions
@#&+0.026
cause+0.022
Narr+0.022
SIGN+0.021
hetic+0.021
Neg logit contributions
andise-0.030
byss-0.027
untled-0.025
ratulations-0.024
istar-0.024
Token it
Token ID340
Feature activation-2.53e-03

Pos logit contributions
andise+0.088
byss+0.080
untled+0.074
ratulations+0.071
istar+0.071
Neg logit contributions
@#&-0.074
cause-0.063
Narr-0.062
SIGN-0.059
hetic-0.059
Token.
Token ID13
Feature activation+1.14e-02

Pos logit contributions
andise+0.002
byss+0.002
untled+0.002
ratulations+0.002
istar+0.002
Neg logit contributions
@#&-0.002
cause-0.002
Narr-0.002
SIGN-0.002
hetic-0.002
Token know
Token ID760
Feature activation-6.50e-03

Pos logit contributions
andise+0.052
byss+0.047
untled+0.044
ratulations+0.042
istar+0.042
Neg logit contributions
@#&-0.045
cause-0.038
Narr-0.038
SIGN-0.036
hetic-0.036
Token about
Token ID546
Feature activation-5.57e-03

Pos logit contributions
andise+0.006
byss+0.005
untled+0.005
ratulations+0.005
istar+0.005
Neg logit contributions
@#&-0.005
cause-0.004
Narr-0.004
SIGN-0.004
hetic-0.004
Token are
Token ID389
Feature activation-1.44e-02

Pos logit contributions
andise+0.005
byss+0.005
untled+0.004
ratulations+0.004
istar+0.004
Neg logit contributions
@#&-0.004
cause-0.004
Narr-0.004
SIGN-0.003
hetic-0.003
Token just
Token ID655
Feature activation+5.98e-03

Pos logit contributions
andise+0.013
byss+0.012
untled+0.011
ratulations+0.010
istar+0.010
Neg logit contributions
@#&-0.011
cause-0.009
Narr-0.009
SIGN-0.009
hetic-0.009
Token the
Token ID262
Feature activation-8.62e-04

Pos logit contributions
@#&+0.005
cause+0.004
Narr+0.004
SIGN+0.004
hetic+0.004
Neg logit contributions
andise-0.005
byss-0.005
untled-0.005
ratulations-0.004
istar-0.004
Token tip
Token ID8171
Feature activation-2.49e-01

Pos logit contributions
andise+0.001
byss+0.001
untled+0.001
ratulations+0.001
istar+0.001
Neg logit contributions
@#&-0.001
cause-0.001
Narr-0.001
SIGN-0.001
hetic-0.001
Token of
Token ID286
Feature activation+1.72e-03

Pos logit contributions
andise+0.218
byss+0.196
untled+0.181
istar+0.173
ratulations+0.173
Neg logit contributions
@#&-0.197
cause-0.167
Narr-0.165
SIGN-0.160
ˈ-0.159
Token an
Token ID281
Feature activation+9.26e-03

Pos logit contributions
@#&+0.001
cause+0.001
Narr+0.001
SIGN+0.001
hetic+0.001
Neg logit contributions
andise-0.002
byss-0.001
untled-0.001
ratulations-0.001
istar-0.001
Token iceberg
Token ID34849
Feature activation+1.31e-01

Pos logit contributions
@#&+0.005
cause+0.004
Narr+0.004
SIGN+0.004
hetic+0.003
Neg logit contributions
andise-0.011
byss-0.010
untled-0.009
ratulations-0.009
istar-0.009
Token,
Token ID11
Feature activation-6.15e-03

Pos logit contributions
@#&+0.104
cause+0.088
Narr+0.087
SIGN+0.084
hetic+0.084
Neg logit contributions
andise-0.115
byss-0.104
untled-0.096
ratulations-0.092
istar-0.091
Token one
Token ID530
Feature activation+2.48e-02

Pos logit contributions
andise+0.005
byss+0.005
untled+0.005
ratulations+0.004
istar+0.004
Neg logit contributions
@#&-0.005
cause-0.004
Narr-0.004
SIGN-0.004
hetic-0.004
Token
Token ID198
Feature activation+1.95e-03

Pos logit contributions
@#&+0.012
cause+0.010
Narr+0.010
SIGN+0.010
hetic+0.010
Neg logit contributions
andise-0.014
byss-0.012
untled-0.011
ratulations-0.011
istar-0.011
TokenCol
Token ID5216
Feature activation+7.66e-02

Pos logit contributions
@#&+0.001
cause+0.001
Narr+0.001
SIGN+0.001
hetic+0.001
Neg logit contributions
andise-0.002
byss-0.002
untled-0.001
ratulations-0.001
istar-0.001
Tokenonel
Token ID26261
Feature activation-1.47e-01

Pos logit contributions
@#&+0.073
cause+0.063
Narr+0.063
SIGN+0.061
ˈ+0.061
Neg logit contributions
andise-0.055
byss-0.048
untled-0.044
ratulations-0.041
istar-0.041
Token David
Token ID3271
Feature activation-8.51e-03

Pos logit contributions
andise+0.133
byss+0.119
untled+0.110
istar+0.106
ratulations+0.106
Neg logit contributions
@#&-0.112
cause-0.094
Narr-0.093
ˈ-0.090
SIGN-0.089
Token L
Token ID406
Feature activation-1.43e-02

Pos logit contributions
andise+0.008
byss+0.007
untled+0.006
ratulations+0.006
istar+0.006
Neg logit contributions
@#&-0.007
cause-0.005
Narr-0.005
ˈ-0.005
hetic-0.005
Tokenapan
Token ID2674
Feature activation-2.99e-01

Pos logit contributions
andise+0.013
byss+0.011
untled+0.010
ratulations+0.010
istar+0.010
Neg logit contributions
@#&-0.011
cause-0.010
Narr-0.009
SIGN-0.009
hetic-0.009
Token,
Token ID11
Feature activation+4.77e-03

Pos logit contributions
andise+0.260
byss+0.233
untled+0.214
ratulations+0.205
istar+0.205
Neg logit contributions
@#&-0.239
cause-0.203
Narr-0.201
SIGN-0.194
ˈ-0.193
Token a
Token ID257
Feature activation+3.61e-03

Pos logit contributions
@#&+0.004
cause+0.003
Narr+0.003
ˈ+0.003
SIGN+0.003
Neg logit contributions
andise-0.004
byss-0.004
untled-0.004
ratulations-0.003
istar-0.003
Token Pentagon
Token ID12651
Feature activation+6.33e-02

Pos logit contributions
@#&+0.003
cause+0.002
Narr+0.002
SIGN+0.002
hetic+0.002
Neg logit contributions
andise-0.003
byss-0.003
untled-0.003
ratulations-0.003
istar-0.003
Token spokesman
Token ID6523
Feature activation+1.41e-01

Pos logit contributions
@#&+0.043
cause+0.036
Narr+0.035
hetic+0.033
SIGN+0.033
Neg logit contributions
andise-0.063
byss-0.057
untled-0.053
ratulations-0.051
istar-0.051
Token,
Token ID11
Feature activation+5.65e-03

Pos logit contributions
@#&+0.114
cause+0.097
Narr+0.096
SIGN+0.093
hetic+0.093
Neg logit contributions
andise-0.121
byss-0.108
untled-0.099
ratulations-0.095
istar-0.095