TOP ACTIVATIONS
MAX = 0.499

though they both had new journals , their friendship never changed
again and share their yellow journals with each other . And
Tim my wrote in their journals together every day . They
some cr umbs on the journal , but she didn 't
and had a special pink journal that she wrote in every
Jane wanted to make her journal even more amazing . She
when she pretended with her journal and each day she added
noticed him looking at the journal and showed it to him
liked . Mary loved her journal so much !
Bub bles looked at the journal and smiled . She said
the new friend in her journal . Every day
sell . I have a journal , and it â
day he discovered a dry journal . He wanted to know
drew the bug in her journal and wrote about how she
want anyone to read her journal , because it was hers
her things , especially her journal . <|endoftext|> Once upon a
Lily went back to her journal and drew a picture of
bles , look at this journal I found !" Bub bles
her own pictures in the journal and make it even mess
special journal . In her journal Mary kept pictures , drawings

MIN ACTIVATIONS
MIN = -0.517

and the little boy was comm ended for his bravery .
to see what all the comm otion was about . Mom
came quickly and saw the comm otion outside . She hugged
John 's mom heard the comm otion and came to investigate
Jen heard the comm otion and came outside .
my 's mom heard the comm otion and came into the
Mom heard the comm otion and came to see
my 's mom heard the comm otion and came running .
The farmer hears the comm otion and comes to see
my 's mom noticed the comm otion and came outside .
Mom heard the comm otion and came to the
came to see what the comm otion was about . She
B eth heard the comm otion and came running .
find out what all the comm otion was about . When
. The chip mun ks were very curious ,
! The chip mun ks didn 't know what
cutting !" The wood cut ters heard them and stopped .
. She realized that ot ters are better off in the
Clarence asked his friend Chip mun ks , " Is it
It saw the two ot ters and wanted to eat them

INTERVAL 0.245 - 0.499
CONTAINS 0.403%

felt , and what she dreamed . She used cr ay
She wanted to show her daddy that she knew what she
. Sara 's mom and dad came over and gave them
't find her cr ay on and she thought she had
head and carefully opened the shell . Inside , there was

INTERVAL -0.009 - 0.245
CONTAINS 55.936%

" Help ! Help !" Tweet ie cried . A
Lily says . " We have to help him ."
. The end . <|endoftext|> Once upon a time , there
quickly . When it was ready , the woman took the
jelly and ran away . The jelly spl ashed on the

INTERVAL -0.263 - -0.009
CONTAINS 43.278%

fun . Sometimes they play cow boys and cow girls ,
. They were very adventurous . One day both Jack and
?" She looks at his car and says " I 'm
. They did not match , but they looked nice .
was pink and spark ly , and Lily was so excited

INTERVAL -0.517 - -0.263
CONTAINS 0.383%

was very happy . His idea worked , and he got
scarf . They threw snow balls at each other . They
gentle when un k not ting things and he also learned
it . " Come on , Tom , you
lost in the clouds , higher and higher , until it
Token though
Token ID996
Feature activation-8.70e-02

Pos logit contributions
 journals+0.136
 journal+0.129
 Christmas+0.118
pron+0.117
ifying+0.111
Neg logit contributions
 comm-0.109
mun-0.103
ters-0.097
 ang-0.095
 tactic-0.092
Token they
Token ID484
Feature activation-3.81e-02

Pos logit contributions
 comm+0.132
mun+0.128
ters+0.122
 ang+0.117
 persu+0.113
Neg logit contributions
 journals-0.174
 journal-0.166
pron-0.157
ocket-0.149
ak-0.147
Token both
Token ID1111
Feature activation+2.03e-01

Pos logit contributions
 comm+0.053
mun+0.051
 ang+0.045
 warns+0.044
ters+0.042
Neg logit contributions
 journals-0.080
 journal-0.075
pron-0.073
ifying-0.069
ak-0.067
Token had
Token ID550
Feature activation+2.21e-01

Pos logit contributions
pron+0.349
 journals+0.337
ifying+0.334
iflower+0.310
ak+0.306
Neg logit contributions
 comm-0.323
mun-0.316
 warns-0.292
 means-0.275
ters-0.261
Token new
Token ID649
Feature activation-1.20e-01

Pos logit contributions
 journals+0.417
 journal+0.410
pron+0.404
iflower+0.384
ifying+0.381
Neg logit contributions
 comm-0.353
 ang-0.312
mun-0.304
ters-0.297
 persuasion-0.284
Token journals
Token ID22790
Feature activation+4.99e-01

Pos logit contributions
 comm+0.143
 tactic+0.128
mun+0.127
 ang+0.113
 warns+0.112
Neg logit contributions
ifying-0.262
pron-0.258
ouse-0.240
 Christmas-0.236
iflower-0.235
Token,
Token ID11
Feature activation-1.99e-02

Pos logit contributions
 journals+0.921
 journal+0.859
pron+0.810
 Christmas+0.770
ifying+0.760
Neg logit contributions
 comm-0.824
mun-0.781
ters-0.774
 ang-0.730
 tactic-0.706
Token their
Token ID511
Feature activation+9.63e-03

Pos logit contributions
 comm+0.039
mun+0.037
 ang+0.035
ters+0.034
 tactic+0.034
Neg logit contributions
 journals-0.032
 journal-0.030
 Christmas-0.028
pron-0.026
ifying-0.024
Token friendship
Token ID14738
Feature activation+9.88e-03

Pos logit contributions
ak+0.015
ifying+0.014
pron+0.014
ouse+0.014
on+0.014
Neg logit contributions
 comm-0.018
 ang-0.015
mun-0.014
 tactic-0.014
 noise-0.013
Token never
Token ID1239
Feature activation+2.30e-01

Pos logit contributions
 journals+0.018
 journal+0.016
pron+0.015
 Christmas+0.015
ifying+0.014
Neg logit contributions
 comm-0.017
mun-0.016
ters-0.016
 ang-0.015
 tactic-0.014
Token changed
Token ID3421
Feature activation+3.08e-02

Pos logit contributions
 journals+0.324
 journal+0.305
 Christmas+0.268
pron+0.261
ifying+0.241
Neg logit contributions
 comm-0.484
mun-0.473
ters-0.449
 ang-0.438
 tactic-0.434
Token again
Token ID757
Feature activation-6.25e-02

Pos logit contributions
 comm+0.055
mun+0.053
ters+0.050
 ang+0.050
 persu+0.048
Neg logit contributions
 journals-0.052
 journal-0.050
 Christmas-0.044
pron-0.044
ifying-0.041
Token and
Token ID290
Feature activation+5.84e-02

Pos logit contributions
 comm+0.110
mun+0.102
ters+0.101
 ang+0.099
 persu+0.097
Neg logit contributions
 journals-0.110
 journal-0.103
pron-0.094
ifying-0.091
ocket-0.087
Token share
Token ID2648
Feature activation+5.70e-02

Pos logit contributions
 journals+0.094
 journal+0.090
 Christmas+0.079
pron+0.079
ifying+0.073
Neg logit contributions
 comm-0.113
mun-0.108
ters-0.104
 ang-0.102
 persu-0.098
Token their
Token ID511
Feature activation+6.66e-03

Pos logit contributions
 journals+0.101
 journal+0.097
pron+0.092
 Christmas+0.088
ifying+0.088
Neg logit contributions
 comm-0.096
mun-0.092
ters-0.088
 ang-0.086
 tactic-0.083
Token yellow
Token ID7872
Feature activation+8.98e-02

Pos logit contributions
ifying+0.010
pron+0.008
ouse+0.008
ak+0.007
ppa+0.007
Neg logit contributions
 comm-0.014
 tactic-0.013
 ang-0.012
 persuasion-0.012
 persu-0.012
Token journals
Token ID22790
Feature activation+4.96e-01

Pos logit contributions
ifying+0.080
ak+0.063
 worn+0.058
ppa+0.055
wy+0.052
Neg logit contributions
 comm-0.216
 tactic-0.215
mun-0.203
fl-0.198
 advice-0.194
Token with
Token ID351
Feature activation-4.06e-02

Pos logit contributions
 journals+1.059
ifying+1.002
 journal+0.987
pron+0.970
 Christmas+0.935
Neg logit contributions
 comm-0.587
ters-0.569
 tactic-0.526
mun-0.524
 ang-0.497
Token each
Token ID1123
Feature activation+8.89e-02

Pos logit contributions
 comm+0.073
mun+0.070
ters+0.067
 ang+0.065
 persuasion+0.062
Neg logit contributions
 journals-0.070
 journal-0.067
pron-0.061
 Christmas-0.058
ifying-0.057
Token other
Token ID584
Feature activation-1.28e-01

Pos logit contributions
 journals+0.217
 journal+0.206
pron+0.205
ifying+0.201
ouse+0.201
Neg logit contributions
 comm-0.083
mun-0.078
ters-0.073
 tactic-0.071
 ang-0.067
Token.
Token ID13
Feature activation+1.25e-03

Pos logit contributions
 comm+0.193
ters+0.192
mun+0.182
 ang+0.170
 tactic+0.162
Neg logit contributions
 journals-0.247
 journal-0.235
pron-0.223
 Christmas-0.211
ak-0.206
Token And
Token ID843
Feature activation+1.73e-02

Pos logit contributions
 journals+0.002
 journal+0.002
pron+0.002
 Christmas+0.002
ocket+0.002
Neg logit contributions
 comm-0.002
mun-0.002
ters-0.002
 ang-0.002
 persu-0.002
Token Tim
Token ID5045
Feature activation-6.16e-02

Pos logit contributions
 journals+0.096
 journal+0.092
pron+0.083
 Christmas+0.082
ifying+0.079
Neg logit contributions
 comm-0.095
mun-0.090
ters-0.085
 ang-0.085
 persu-0.081
Tokenmy
Token ID1820
Feature activation+7.91e-02

Pos logit contributions
mun+0.094
 comm+0.086
ters+0.085
 ang+0.082
fl+0.077
Neg logit contributions
 journals-0.117
 journal-0.116
pron-0.113
ifying-0.109
ocket-0.108
Token wrote
Token ID2630
Feature activation+2.57e-01

Pos logit contributions
pron+0.152
 journal+0.146
ifying+0.143
 journals+0.141
ocket+0.141
Neg logit contributions
mun-0.112
 comm-0.098
fl-0.094
ters-0.092
 means-0.081
Token in
Token ID287
Feature activation+2.35e-02

Pos logit contributions
 journals+0.434
 journal+0.412
 Christmas+0.360
pron+0.347
 notebook+0.333
Neg logit contributions
 comm-0.497
mun-0.481
ters-0.445
 ang-0.440
 persu-0.435
Token their
Token ID511
Feature activation+4.07e-03

Pos logit contributions
pron+0.045
ouse+0.045
 journals+0.043
ak+0.042
 journal+0.041
Neg logit contributions
 comm-0.034
 ang-0.030
mun-0.028
 tactic-0.028
ters-0.028
Token journals
Token ID22790
Feature activation+4.96e-01

Pos logit contributions
ifying+0.004
ouse+0.004
pron+0.004
orse+0.004
on+0.004
Neg logit contributions
 comm-0.009
 ang-0.009
 tactic-0.009
 persuasion-0.008
mun-0.008
Token together
Token ID1978
Feature activation-9.97e-02

Pos logit contributions
 journals+0.947
 journal+0.908
pron+0.826
 Christmas+0.818
ifying+0.776
Neg logit contributions
 comm-0.808
mun-0.764
ters-0.719
 ang-0.716
 persu-0.682
Token every
Token ID790
Feature activation+5.99e-02

Pos logit contributions
 comm+0.184
mun+0.174
ters+0.169
 ang+0.165
 tactic+0.160
Neg logit contributions
 journals-0.166
 journal-0.157
pron-0.144
 Christmas-0.138
ifying-0.135
Token day
Token ID1110
Feature activation+3.08e-02

Pos logit contributions
 journals+0.113
ifying+0.106
ouse+0.105
 journal+0.101
pron+0.100
Neg logit contributions
 comm-0.090
 tactic-0.079
 persuasion-0.078
 persu-0.077
 ang-0.076
Token.
Token ID13
Feature activation+3.76e-03

Pos logit contributions
 journals+0.057
 journal+0.055
pron+0.052
iflower+0.048
ouse+0.048
Neg logit contributions
 comm-0.049
ters-0.048
mun-0.046
 persuasion-0.043
 persu-0.042
Token They
Token ID1119
Feature activation-1.22e-02

Pos logit contributions
 journals+0.007
 journal+0.007
pron+0.006
ocket+0.006
 Christmas+0.006
Neg logit contributions
 comm-0.006
mun-0.006
ters-0.006
 ang-0.005
 persu-0.005
Token some
Token ID617
Feature activation-6.91e-02

Pos logit contributions
 journals+0.175
 journal+0.168
pron+0.155
ouse+0.149
ocket+0.146
Neg logit contributions
 comm-0.160
ters-0.149
mun-0.148
 ang-0.147
 persu-0.135
Token cr
Token ID1067
Feature activation+1.70e-01

Pos logit contributions
 comm+0.115
mun+0.109
ters+0.106
 ang+0.101
 persuasion+0.096
Neg logit contributions
 journals-0.129
 journal-0.122
pron-0.110
 Christmas-0.108
ouse-0.107
Tokenumbs
Token ID18146
Feature activation+1.44e-01

Pos logit contributions
 journals+0.263
 journal+0.253
 Christmas+0.217
 dreamed+0.214
pron+0.207
Neg logit contributions
ters-0.332
 comm-0.318
mun-0.297
 persuasion-0.281
 tactic-0.280
Token on
Token ID319
Feature activation-1.23e-02

Pos logit contributions
 journals+0.291
 journal+0.268
pron+0.248
 Christmas+0.239
ouse+0.229
Neg logit contributions
 comm-0.228
ters-0.220
mun-0.210
 ang-0.208
 persu-0.196
Token the
Token ID262
Feature activation+1.15e-02

Pos logit contributions
 comm+0.018
ters+0.017
mun+0.016
 ang+0.016
 persu+0.015
Neg logit contributions
 journals-0.025
 journal-0.024
ouse-0.023
ocket-0.022
ak-0.021
Token journal
Token ID3989
Feature activation+4.75e-01

Pos logit contributions
 journals+0.018
ifying+0.018
orse+0.018
ak+0.017
ouse+0.017
Neg logit contributions
 comm-0.021
ters-0.017
 noise-0.017
 ang-0.016
mun-0.016
Token,
Token ID11
Feature activation-1.61e-02

Pos logit contributions
 journals+0.770
 journal+0.714
pron+0.642
 Christmas+0.635
ifying+0.602
Neg logit contributions
 comm-0.925
mun-0.866
ters-0.845
 ang-0.828
 tactic-0.797
Token but
Token ID475
Feature activation-1.86e-01

Pos logit contributions
 comm+0.026
mun+0.023
 ang+0.023
ters+0.022
 persu+0.021
Neg logit contributions
 journals-0.033
 journal-0.031
 Christmas-0.028
pron-0.028
ocket-0.027
Token she
Token ID673
Feature activation+7.32e-03

Pos logit contributions
 comm+0.272
 ang+0.261
ters+0.258
mun+0.251
 persu+0.241
Neg logit contributions
 journals-0.378
 journal-0.341
ouse-0.327
ak-0.326
ifying-0.323
Token didn
Token ID1422
Feature activation-4.12e-03

Pos logit contributions
 journals+0.017
 journal+0.015
ifying+0.015
 Christmas+0.014
 telescope+0.014
Neg logit contributions
 comm-0.010
mun-0.009
ters-0.009
 ang-0.008
 warns-0.008
Token't
Token ID470
Feature activation-5.01e-02

Pos logit contributions
 comm+0.005
mun+0.005
ters+0.005
 ang+0.005
 persu+0.005
Neg logit contributions
 journals-0.009
 journal-0.009
pron-0.008
 Christmas-0.008
ocket-0.008
Token and
Token ID290
Feature activation+5.92e-02

Pos logit contributions
 journals+0.241
 journal+0.225
pron+0.211
 Christmas+0.200
ifying+0.196
Neg logit contributions
 comm-0.243
mun-0.235
ters-0.229
 tactic-0.213
 ang-0.213
Token had
Token ID550
Feature activation+2.20e-01

Pos logit contributions
 journals+0.108
 journal+0.104
ocket+0.101
ifying+0.101
ak+0.090
Neg logit contributions
mun-0.088
 comm-0.085
fl-0.078
ters-0.074
whe-0.073
Token a
Token ID257
Feature activation-1.09e-02

Pos logit contributions
 journals+0.443
 journal+0.437
ocket+0.416
pron+0.413
ouse+0.411
Neg logit contributions
 comm-0.305
mun-0.270
ters-0.264
 ang-0.228
fl-0.227
Token special
Token ID2041
Feature activation+9.68e-02

Pos logit contributions
 comm+0.020
mun+0.018
 tactic+0.017
 ang+0.017
 persu+0.016
Neg logit contributions
 journals-0.018
ocket-0.015
ifying-0.015
 journal-0.015
ak-0.015
Token pink
Token ID11398
Feature activation+1.39e-01

Pos logit contributions
 journals+0.147
 journal+0.130
ifying+0.118
pron+0.116
 Christmas+0.115
Neg logit contributions
 comm-0.193
mun-0.179
 tactic-0.173
 ang-0.171
ters-0.170
Token journal
Token ID3989
Feature activation+4.75e-01

Pos logit contributions
 journals+0.231
 journal+0.198
ak+0.193
ocket+0.187
iflower+0.182
Neg logit contributions
 comm-0.247
mun-0.230
ters-0.227
 tactic-0.221
fl-0.211
Token that
Token ID326
Feature activation-1.44e-02

Pos logit contributions
 journals+0.911
 journal+0.829
pron+0.766
ifying+0.758
 Christmas+0.748
Neg logit contributions
 comm-0.744
mun-0.685
ters-0.681
 ang-0.632
 persu-0.614
Token she
Token ID673
Feature activation+4.02e-03

Pos logit contributions
 comm+0.021
mun+0.020
ters+0.019
 ang+0.018
 persuasion+0.017
Neg logit contributions
 journals-0.027
ifying-0.026
 journal-0.025
pron-0.024
ak-0.024
Token wrote
Token ID2630
Feature activation+2.66e-01

Pos logit contributions
ifying+0.007
 journals+0.007
 journal+0.007
alks+0.006
 telescope+0.006
Neg logit contributions
mun-0.006
 comm-0.005
ters-0.005
 warns-0.005
 means-0.004
Token in
Token ID287
Feature activation+2.84e-02

Pos logit contributions
 journals+0.518
 journal+0.493
pron+0.449
 Christmas+0.443
ifying+0.434
Neg logit contributions
 comm-0.412
mun-0.382
ters-0.379
 ang-0.362
 tactic-0.343
Token every
Token ID790
Feature activation+6.74e-02

Pos logit contributions
 journals+0.055
ouse+0.055
pron+0.054
 journal+0.053
ak+0.052
Neg logit contributions
 comm-0.039
 tactic-0.033
ters-0.032
 noise-0.031
mun-0.031
TokenJane
Token ID41083
Feature activation-3.71e-02

Pos logit contributions
 comm+0.041
mun+0.038
ters+0.036
 ang+0.035
 tactic+0.034
Neg logit contributions
 journals-0.050
 journal-0.047
pron-0.042
 Christmas-0.042
ocket-0.041
Token wanted
Token ID2227
Feature activation+7.75e-02

Pos logit contributions
 comm+0.056
mun+0.052
ters+0.048
 ang+0.047
 warns+0.044
Neg logit contributions
 journals-0.078
 journal-0.074
ifying-0.069
pron-0.069
 Christmas-0.066
Token to
Token ID284
Feature activation+1.35e-02

Pos logit contributions
 journals+0.131
 journal+0.124
pron+0.113
 Christmas+0.110
ifying+0.108
Neg logit contributions
 comm-0.144
mun-0.136
ters-0.130
 ang-0.130
 persu-0.124
Token make
Token ID787
Feature activation-8.77e-02

Pos logit contributions
 journals+0.031
pron+0.028
 journal+0.028
ifying+0.028
iflower+0.027
Neg logit contributions
 comm-0.017
 tactic-0.014
mun-0.013
 ang-0.013
ters-0.013
Token her
Token ID607
Feature activation+4.49e-02

Pos logit contributions
 comm+0.176
mun+0.175
ters+0.163
 ang+0.158
 persu+0.157
Neg logit contributions
 journals-0.132
 journal-0.128
 Christmas-0.110
pron-0.105
 notebook-0.102
Token journal
Token ID3989
Feature activation+4.74e-01

Pos logit contributions
 journals+0.061
ifying+0.059
pron+0.058
ak+0.053
 journal+0.052
Neg logit contributions
 comm-0.092
 tactic-0.084
 ang-0.082
mun-0.081
 persu-0.078
Token even
Token ID772
Feature activation+6.95e-02

Pos logit contributions
 journals+0.892
 journal+0.816
pron+0.782
ifying+0.780
 Christmas+0.754
Neg logit contributions
 comm-0.767
 ang-0.704
mun-0.692
ters-0.683
 tactic-0.675
Token more
Token ID517
Feature activation-1.45e-01

Pos logit contributions
 journals+0.142
 journal+0.137
pron+0.120
 Christmas+0.118
 notebook+0.116
Neg logit contributions
 comm-0.104
mun-0.103
ters-0.094
 persuasion-0.087
 persu-0.087
Token amazing
Token ID4998
Feature activation-4.61e-02

Pos logit contributions
 comm+0.236
 tactic+0.197
mun+0.193
 ang+0.190
 persu+0.187
Neg logit contributions
 journals-0.275
ifying-0.266
 journal-0.259
pron-0.256
 Christmas-0.242
Token.
Token ID13
Feature activation+1.52e-03

Pos logit contributions
 comm+0.088
mun+0.084
ters+0.080
 ang+0.080
 persu+0.076
Neg logit contributions
 journals-0.076
 journal-0.072
pron-0.065
 Christmas-0.064
ifying-0.060
Token She
Token ID1375
Feature activation+2.21e-02

Pos logit contributions
 journals+0.003
 journal+0.002
pron+0.002
 Christmas+0.002
ifying+0.002
Neg logit contributions
 comm-0.003
mun-0.003
ters-0.003
 ang-0.003
 persu-0.002
Token when
Token ID618
Feature activation-8.38e-02

Pos logit contributions
 journals+0.155
 journal+0.147
 Christmas+0.130
pron+0.130
ifying+0.120
Neg logit contributions
 comm-0.189
mun-0.182
ters-0.173
 ang-0.173
 tactic-0.165
Token she
Token ID673
Feature activation+2.55e-03

Pos logit contributions
 comm+0.129
mun+0.123
 ang+0.115
ters+0.113
 persu+0.111
Neg logit contributions
 journals-0.162
 journal-0.152
pron-0.152
ifying-0.145
ak-0.145
Token pretended
Token ID36766
Feature activation+9.18e-02

Pos logit contributions
 journals+0.005
 journal+0.004
ifying+0.004
pron+0.004
 Christmas+0.004
Neg logit contributions
 comm-0.004
mun-0.004
 ang-0.003
 warns-0.003
 persu-0.003
Token with
Token ID351
Feature activation-3.12e-02

Pos logit contributions
 journals+0.148
 journal+0.141
pron+0.126
 Christmas+0.125
ifying+0.117
Neg logit contributions
 comm-0.176
mun-0.168
ters-0.161
 ang-0.159
 persu-0.153
Token her
Token ID607
Feature activation+4.28e-02

Pos logit contributions
 comm+0.044
 ang+0.039
 persuasion+0.037
 persu+0.037
 louder+0.037
Neg logit contributions
ouse-0.061
 journals-0.059
pron-0.057
ifying-0.056
iflower-0.055
Token journal
Token ID3989
Feature activation+4.74e-01

Pos logit contributions
ifying+0.037
abby+0.035
ak+0.034
ackers+0.033
ppa+0.033
Neg logit contributions
 comm-0.098
 tactic-0.093
 noise-0.088
 advice-0.087
 warns-0.086
Token and
Token ID290
Feature activation+6.03e-02

Pos logit contributions
 journals+0.898
 journal+0.819
pron+0.785
ifying+0.760
 Christmas+0.746
Neg logit contributions
 comm-0.758
ters-0.689
mun-0.680
 ang-0.663
 persuasion-0.636
Token each
Token ID1123
Feature activation+8.91e-02

Pos logit contributions
 journals+0.101
pron+0.097
 journal+0.096
ifying+0.095
ak+0.091
Neg logit contributions
 comm-0.108
mun-0.098
 ang-0.095
ters-0.094
 warns-0.089
Token day
Token ID1110
Feature activation+3.82e-02

Pos logit contributions
endant+0.163
 journals+0.162
ifying+0.152
ouse+0.147
iflower+0.146
Neg logit contributions
 tactic-0.124
 comm-0.120
mun-0.112
 ang-0.110
 persuasion-0.105
Token she
Token ID673
Feature activation+3.96e-03

Pos logit contributions
 journals+0.073
 journal+0.069
pron+0.065
iflower+0.062
 fork+0.061
Neg logit contributions
 comm-0.061
mun-0.055
ters-0.055
 ang-0.054
 persu-0.053
Token added
Token ID2087
Feature activation+1.75e-01

Pos logit contributions
 journals+0.007
ifying+0.007
pron+0.007
 journal+0.007
 moderation+0.006
Neg logit contributions
 comm-0.006
mun-0.006
 warns-0.005
 ang-0.005
ters-0.005
Token noticed
Token ID6810
Feature activation+9.50e-02

Pos logit contributions
 comm+0.067
mun+0.065
ters+0.062
 ang+0.060
 persu+0.057
Neg logit contributions
 journals-0.066
 journal-0.063
pron-0.057
 Christmas-0.056
ifying-0.055
Token him
Token ID683
Feature activation-1.75e-02

Pos logit contributions
 journals+0.139
 journal+0.134
 Christmas+0.122
pron+0.118
iflower+0.112
Neg logit contributions
 comm-0.193
mun-0.184
 tactic-0.175
 ang-0.171
ters-0.169
Token looking
Token ID2045
Feature activation-2.03e-02

Pos logit contributions
 comm+0.032
mun+0.030
 ang+0.029
ters+0.029
 persu+0.027
Neg logit contributions
 journals-0.030
 journal-0.028
pron-0.026
 Christmas-0.025
ifying-0.025
Token at
Token ID379
Feature activation+1.20e-02

Pos logit contributions
 comm+0.030
mun+0.028
 ang+0.027
ters+0.026
 persu+0.025
Neg logit contributions
 journals-0.042
 journal-0.039
pron-0.038
 Christmas-0.036
ifying-0.035
Token the
Token ID262
Feature activation+1.06e-02

Pos logit contributions
 journals+0.021
 journal+0.020
pron+0.018
 Christmas+0.017
ifying+0.017
Neg logit contributions
 comm-0.022
mun-0.021
ters-0.020
 ang-0.020
 persu-0.019
Token journal
Token ID3989
Feature activation+4.74e-01

Pos logit contributions
 journals+0.011
orse+0.011
ifying+0.010
ouse+0.010
ak+0.010
Neg logit contributions
 comm-0.025
ters-0.021
 ang-0.021
 tactic-0.021
mun-0.021
Token and
Token ID290
Feature activation+6.38e-02

Pos logit contributions
 journals+0.879
 journal+0.792
pron+0.753
ifying+0.721
 Christmas+0.719
Neg logit contributions
 comm-0.788
ters-0.740
mun-0.729
 ang-0.687
 tactic-0.676
Token showed
Token ID3751
Feature activation+6.35e-04

Pos logit contributions
 journals+0.108
 journal+0.104
 Christmas+0.093
pron+0.093
 notebook+0.087
Neg logit contributions
 comm-0.113
mun-0.109
ters-0.105
 ang-0.101
 tactic-0.100
Token it
Token ID340
Feature activation+1.56e-01

Pos logit contributions
 journals+0.001
 journal+0.001
 Christmas+0.001
pron+0.001
ocket+0.001
Neg logit contributions
 comm-0.001
mun-0.001
ters-0.001
 ang-0.001
 tactic-0.001
Token to
Token ID284
Feature activation+1.58e-02

Pos logit contributions
 journals+0.253
 journal+0.240
pron+0.214
 Christmas+0.212
ifying+0.200
Neg logit contributions
 comm-0.300
mun-0.287
ters-0.274
 ang-0.271
 persu-0.260
Token him
Token ID683
Feature activation-1.54e-02

Pos logit contributions
 journals+0.025
 journal+0.024
 Christmas+0.022
pron+0.021
ifying+0.020
Neg logit contributions
 comm-0.031
mun-0.029
ters-0.028
 ang-0.027
 tactic-0.027
Token liked
Token ID8288
Feature activation+1.10e-01

Pos logit contributions
 comm+0.002
mun+0.002
ters+0.001
 warns+0.001
 ang+0.001
Neg logit contributions
 journals-0.002
ifying-0.001
 journal-0.001
pron-0.001
 Christmas-0.001
Token.
Token ID13
Feature activation-1.84e-03

Pos logit contributions
 journals+0.205
 journal+0.194
pron+0.191
ifying+0.186
ak+0.182
Neg logit contributions
 comm-0.165
ters-0.155
mun-0.153
 ang-0.141
 tactic-0.140
Token Mary
Token ID5335
Feature activation-6.43e-02

Pos logit contributions
 comm+0.003
mun+0.003
ters+0.003
 tactic+0.003
 ang+0.003
Neg logit contributions
 journals-0.003
 journal-0.003
pron-0.003
ifying-0.003
 Christmas-0.003
Token loved
Token ID6151
Feature activation+1.55e-01

Pos logit contributions
mun+0.078
 comm+0.073
 means+0.066
ters+0.064
fl+0.062
Neg logit contributions
ifying-0.141
 journals-0.134
pron-0.130
 journal-0.129
ocket-0.120
Token her
Token ID607
Feature activation+4.05e-02

Pos logit contributions
ifying+0.262
 journals+0.255
ak+0.248
 journal+0.245
ouse+0.242
Neg logit contributions
 comm-0.262
mun-0.232
 ang-0.229
 persu-0.221
 persuasion-0.216
Token journal
Token ID3989
Feature activation+4.73e-01

Pos logit contributions
ifying+0.031
ak+0.024
 worn+0.019
 journals+0.018
ppa+0.018
Neg logit contributions
 comm-0.108
 tactic-0.104
 noise-0.096
 ang-0.095
mun-0.095
Token so
Token ID523
Feature activation-8.41e-02

Pos logit contributions
 journals+0.960
ifying+0.898
 journal+0.845
pron+0.834
orse+0.830
Neg logit contributions
 comm-0.620
mun-0.543
ters-0.538
 tactic-0.537
 ang-0.521
Token much
Token ID881
Feature activation+2.29e-01

Pos logit contributions
mun+0.149
 comm+0.148
 persu+0.123
 louder+0.122
 ang+0.122
Neg logit contributions
 journals-0.134
 journal-0.129
pron-0.126
ocket-0.125
ak-0.121
Token!
Token ID0
Feature activation+1.35e-02

Pos logit contributions
 journals+0.364
 journal+0.350
 Christmas+0.313
pron+0.307
 notebook+0.285
Neg logit contributions
 comm-0.446
mun-0.435
 ang-0.414
ters-0.408
 persu-0.390
Token
Token ID198
Feature activation-2.87e-02

Pos logit contributions
 journals+0.025
 journal+0.024
pron+0.022
ifying+0.021
 Christmas+0.021
Neg logit contributions
 comm-0.022
mun-0.021
ters-0.020
 tactic-0.020
 ang-0.019
Token
Token ID198
Feature activation-2.88e-02

Pos logit contributions
 comm+0.053
mun+0.050
ters+0.048
 ang+0.048
 persu+0.046
Neg logit contributions
 journals-0.048
 journal-0.046
pron-0.041
 Christmas-0.041
ifying-0.039
Token Bub
Token ID19645
Feature activation-2.02e-01

Pos logit contributions
 journal+0.097
 journals+0.096
 Christmas+0.089
on+0.088
 notebook+0.087
Neg logit contributions
mun-0.160
 comm-0.159
 ang-0.155
 inconvenience-0.144
 tactic-0.142
Tokenbles
Token ID7689
Feature activation-2.12e-02

Pos logit contributions
 comm+0.471
mun+0.445
 persuasion+0.435
 persu+0.426
 ang+0.410
Neg logit contributions
ouse-0.230
 journal-0.223
ak-0.219
 journals-0.218
ocket-0.196
Token looked
Token ID3114
Feature activation+7.98e-02

Pos logit contributions
 comm+0.039
mun+0.037
ters+0.035
 ang+0.035
 persu+0.034
Neg logit contributions
 journals-0.035
 journal-0.034
 Christmas-0.030
pron-0.030
ak-0.028
Token at
Token ID379
Feature activation+1.40e-02

Pos logit contributions
 journals+0.152
 journal+0.143
pron+0.136
 Christmas+0.132
ifying+0.126
Neg logit contributions
 comm-0.133
mun-0.123
 ang-0.120
ters-0.115
 persu-0.111
Token the
Token ID262
Feature activation+1.05e-02

Pos logit contributions
 journals+0.019
 journal+0.018
 Christmas+0.017
pron+0.015
 notebook+0.014
Neg logit contributions
 comm-0.030
mun-0.029
ters-0.027
 ang-0.027
 persu-0.026
Token journal
Token ID3989
Feature activation+4.73e-01

Pos logit contributions
 Christmas+0.009
ifying+0.009
pron+0.008
bie+0.008
orse+0.008
Neg logit contributions
 comm-0.026
 noise-0.023
 tactic-0.022
 ang-0.022
ters-0.022
Token and
Token ID290
Feature activation+6.41e-02

Pos logit contributions
 journals+0.760
 journal+0.708
pron+0.646
 Christmas+0.632
ifying+0.601
Neg logit contributions
 comm-0.915
mun-0.865
ters-0.843
 ang-0.824
 tactic-0.795
Token smiled
Token ID13541
Feature activation+6.57e-02

Pos logit contributions
 journals+0.109
 journal+0.105
 Christmas+0.093
pron+0.092
 notebook+0.088
Neg logit contributions
 comm-0.115
mun-0.110
ters-0.105
 ang-0.102
 persu-0.101
Token.
Token ID13
Feature activation+6.10e-03

Pos logit contributions
 journals+0.117
 journal+0.110
pron+0.102
 Christmas+0.098
ifying+0.095
Neg logit contributions
 comm-0.116
mun-0.108
ters-0.106
 ang-0.106
 persu-0.099
Token She
Token ID1375
Feature activation+2.29e-02

Pos logit contributions
 journals+0.007
 journal+0.006
 Christmas+0.006
 Daddy+0.005
 quartz+0.005
Neg logit contributions
 comm-0.014
mun-0.013
 ang-0.013
 tactic-0.012
 persu-0.012
Token said
Token ID531
Feature activation-1.64e-01

Pos logit contributions
 journals+0.047
 journal+0.044
ifying+0.043
pron+0.042
 Christmas+0.041
Neg logit contributions
 comm-0.035
mun-0.032
 ang-0.030
ters-0.029
 warns-0.028
Token the
Token ID262
Feature activation+5.94e-03

Pos logit contributions
 comm+0.187
mun+0.177
ters+0.169
 ang+0.167
 tactic+0.160
Neg logit contributions
 journals-0.176
 journal-0.167
pron-0.154
 Christmas-0.149
ifying-0.146
Token new
Token ID649
Feature activation-1.20e-01

Pos logit contributions
 journals+0.011
 journal+0.010
pron+0.010
 Christmas+0.009
ifying+0.009
Neg logit contributions
 comm-0.010
mun-0.010
ters-0.009
 ang-0.009
 tactic-0.009
Token friend
Token ID1545
Feature activation-4.25e-02

Pos logit contributions
 comm+0.268
mun+0.249
 tactic+0.244
ters+0.233
 ang+0.230
Neg logit contributions
ifying-0.154
 journals-0.135
pron-0.134
 Christmas-0.125
ocket-0.120
Token in
Token ID287
Feature activation+2.56e-02

Pos logit contributions
 comm+0.078
mun+0.075
ters+0.072
 ang+0.070
 persu+0.068
Neg logit contributions
 journals-0.072
 journal-0.068
pron-0.061
 Christmas-0.060
ifying-0.057
Token her
Token ID607
Feature activation+4.23e-02

Pos logit contributions
 journals+0.049
 journal+0.045
ouse+0.045
pron+0.045
iflower+0.043
Neg logit contributions
 comm-0.040
mun-0.036
 ang-0.035
ters-0.035
 persuasion-0.034
Token journal
Token ID3989
Feature activation+4.73e-01

Pos logit contributions
 journals+0.066
ifying+0.063
pron+0.061
abby+0.060
ocket+0.060
Neg logit contributions
 comm-0.070
mun-0.065
 ang-0.059
 persuasion-0.058
 tactic-0.057
Token.
Token ID13
Feature activation-7.02e-05

Pos logit contributions
 journals+0.879
 journal+0.806
pron+0.733
 Christmas+0.722
ifying+0.721
Neg logit contributions
 comm-0.785
mun-0.739
ters-0.729
 ang-0.684
 persuasion-0.662
Token
Token ID198
Feature activation-2.72e-02

Pos logit contributions
 comm+0.000
mun+0.000
ters+0.000
 ang+0.000
 persu+0.000
Neg logit contributions
 journals-0.000
 journal-0.000
pron-0.000
 Christmas-0.000
ifying-0.000
Token
Token ID198
Feature activation-2.71e-02

Pos logit contributions
 comm+0.050
mun+0.047
ters+0.045
 ang+0.045
 persu+0.043
Neg logit contributions
 journals-0.047
 journal-0.044
pron-0.040
 Christmas-0.040
ifying-0.037
TokenEvery
Token ID6109
Feature activation+2.18e-02

Pos logit contributions
 comm+0.053
mun+0.050
ters+0.048
 ang+0.048
 persu+0.046
Neg logit contributions
 journals-0.043
 journal-0.041
pron-0.037
 Christmas-0.036
ifying-0.034
Token day
Token ID1110
Feature activation+3.73e-02

Pos logit contributions
 journals+0.039
 journal+0.036
pron+0.033
ifying+0.032
ocket+0.032
Neg logit contributions
 comm-0.038
mun-0.036
 ang-0.034
 persu-0.034
 tactic-0.033
Token sell
Token ID3677
Feature activation-3.08e-02

Pos logit contributions
 journals+0.028
 journal+0.026
pron+0.024
ifying+0.024
 Christmas+0.024
Neg logit contributions
 comm-0.016
mun-0.014
 ang-0.014
ters-0.013
 tactic-0.013
Token.
Token ID13
Feature activation-8.98e-04

Pos logit contributions
 comm+0.053
ters+0.050
mun+0.050
 ang+0.047
 tactic+0.045
Neg logit contributions
 journals-0.055
 journal-0.052
pron-0.049
 Christmas-0.046
ifying-0.045
Token I
Token ID314
Feature activation+2.05e-01

Pos logit contributions
 comm+0.002
mun+0.001
ters+0.001
 ang+0.001
 persu+0.001
Neg logit contributions
 journals-0.002
 journal-0.002
pron-0.001
 Christmas-0.001
iflower-0.001
Token have
Token ID423
Feature activation+1.52e-01

Pos logit contributions
 journals+0.369
 journal+0.345
pron+0.316
 Christmas+0.303
iflower+0.297
Neg logit contributions
 comm-0.357
mun-0.348
ters-0.324
 ang-0.321
 tactic-0.313
Token a
Token ID257
Feature activation-9.79e-03

Pos logit contributions
 journals+0.317
 journal+0.313
pron+0.299
ouse+0.296
iflower+0.281
Neg logit contributions
 comm-0.189
ters-0.178
mun-0.166
 ang-0.165
 warns-0.152
Token journal
Token ID3989
Feature activation+4.73e-01

Pos logit contributions
 comm+0.018
mun+0.017
 ang+0.016
 tactic+0.016
ters+0.016
Neg logit contributions
 journals-0.016
 journal-0.015
 Christmas-0.013
pron-0.013
ifying-0.013
Token,
Token ID11
Feature activation-2.57e-02

Pos logit contributions
 journals+1.003
 journal+0.898
pron+0.831
iflower+0.817
ifying+0.816
Neg logit contributions
 comm-0.635
ters-0.624
mun-0.556
 ang-0.530
 tactic-0.515
Token and
Token ID290
Feature activation+5.50e-02

Pos logit contributions
 comm+0.037
ters+0.034
mun+0.033
 ang+0.033
 persu+0.031
Neg logit contributions
 journals-0.052
 journal-0.048
pron-0.044
iflower-0.044
ouse-0.043
Token it
Token ID340
Feature activation+1.55e-01

Pos logit contributions
 journals+0.092
 journal+0.086
pron+0.084
ouse+0.078
ak+0.077
Neg logit contributions
 comm-0.097
ters-0.092
mun-0.091
 ang-0.084
 warns-0.083
Tokenâ
Token ID22940
Feature activation-1.59e-01

Pos logit contributions
 journals+0.275
 journal+0.259
pron+0.242
 Christmas+0.232
ifying+0.227
Neg logit contributions
 comm-0.273
mun-0.261
ters-0.252
 ang-0.241
 warns-0.231
Token
Token ID26391
Feature activation-1.52e-01

Pos logit contributions
 comm+0.235
mun+0.222
ters+0.205
 ang+0.204
 tactic+0.192
Neg logit contributions
 journals-0.329
 journal-0.316
pron-0.294
 Christmas-0.290
ak-0.276
Token day
Token ID1110
Feature activation+3.57e-02

Pos logit contributions
 journals+0.075
 journal+0.069
pron+0.064
ifying+0.062
ocket+0.062
Neg logit contributions
 comm-0.065
mun-0.061
 tactic-0.057
 ang-0.057
 persu-0.057
Token he
Token ID339
Feature activation+5.77e-02

Pos logit contributions
 journals+0.067
 journal+0.063
pron+0.058
ocket+0.055
 Christmas+0.055
Neg logit contributions
 comm-0.059
mun-0.057
ters-0.054
 ang-0.054
 persu-0.052
Token discovered
Token ID5071
Feature activation+1.65e-01

Pos logit contributions
 journals+0.108
 journal+0.099
pron+0.098
ifying+0.095
iflower+0.092
Neg logit contributions
 comm-0.097
mun-0.091
 ang-0.082
 warns-0.080
ters-0.079
Token a
Token ID257
Feature activation-1.39e-02

Pos logit contributions
 journals+0.246
 journal+0.242
 Christmas+0.208
 notebook+0.196
pron+0.194
Neg logit contributions
 comm-0.332
mun-0.323
ters-0.303
 ang-0.300
 persuasion-0.299
Token dry
Token ID5894
Feature activation+1.26e-01

Pos logit contributions
 comm+0.028
mun+0.027
ters+0.026
 ang+0.026
 persu+0.025
Neg logit contributions
 journals-0.021
 journal-0.020
pron-0.018
 Christmas-0.017
ifying-0.016
Token journal
Token ID3989
Feature activation+4.73e-01

Pos logit contributions
 journals+0.228
 journal+0.196
ifying+0.181
ocket+0.178
pron+0.172
Neg logit contributions
 comm-0.224
mun-0.206
ters-0.199
 tactic-0.193
 ang-0.190
Token.
Token ID13
Feature activation-3.48e-03

Pos logit contributions
 journals+0.829
 journal+0.774
pron+0.707
 Christmas+0.695
ifying+0.668
Neg logit contributions
 comm-0.845
mun-0.797
ters-0.768
 ang-0.752
 persu-0.724
Token He
Token ID679
Feature activation+7.59e-02

Pos logit contributions
 comm+0.006
 ang+0.006
 tactic+0.005
 persuasion+0.005
 persu+0.005
Neg logit contributions
 journals-0.006
 journal-0.006
iflower-0.005
ocket-0.005
pron-0.005
Token wanted
Token ID2227
Feature activation+7.29e-02

Pos logit contributions
 journals+0.162
ifying+0.159
 journal+0.150
pron+0.143
 Christmas+0.140
Neg logit contributions
 comm-0.106
mun-0.087
 tactic-0.081
 ang-0.081
 persu-0.079
Token to
Token ID284
Feature activation+1.20e-02

Pos logit contributions
ifying+0.160
pron+0.156
ouse+0.155
 journals+0.153
aire+0.144
Neg logit contributions
 comm-0.083
 ang-0.071
 tactic-0.070
 louder-0.068
 persu-0.065
Token know
Token ID760
Feature activation+1.60e-02

Pos logit contributions
 journals+0.025
ifying+0.025
pron+0.023
iflower+0.023
 moons+0.022
Neg logit contributions
 tactic-0.015
 comm-0.015
 persuade-0.014
 convince-0.014
 prove-0.013
Token drew
Token ID9859
Feature activation+2.44e-01

Pos logit contributions
 journals+0.151
 journal+0.143
pron+0.127
 Christmas+0.124
ifying+0.116
Neg logit contributions
 comm-0.213
mun-0.204
ters-0.195
 ang-0.193
 tactic-0.186
Token the
Token ID262
Feature activation+5.67e-03

Pos logit contributions
 journals+0.409
 journal+0.390
pron+0.348
 Christmas+0.345
ifying+0.325
Neg logit contributions
 comm-0.457
mun-0.437
ters-0.413
 ang-0.412
 persu-0.395
Token bug
Token ID5434
Feature activation-1.24e-02

Pos logit contributions
 journals+0.009
ifying+0.008
 Christmas+0.008
pron+0.007
ouse+0.007
Neg logit contributions
 comm-0.011
ters-0.010
mun-0.010
 ang-0.009
 tactic-0.009
Token in
Token ID287
Feature activation+2.32e-02

Pos logit contributions
 comm+0.022
mun+0.021
ters+0.020
 ang+0.019
 tactic+0.019
Neg logit contributions
 journals-0.022
 journal-0.021
pron-0.019
 Christmas-0.019
ifying-0.018
Token her
Token ID607
Feature activation+4.19e-02

Pos logit contributions
 journals+0.046
ouse+0.044
pron+0.042
 journal+0.042
iflower+0.040
Neg logit contributions
 comm-0.035
ters-0.031
 tactic-0.031
 ang-0.030
 persu-0.029
Token journal
Token ID3989
Feature activation+4.73e-01

Pos logit contributions
abby+0.060
ifying+0.054
Daddy+0.054
eddy+0.053
ppa+0.052
Neg logit contributions
 comm-0.065
 tactic-0.057
 advice-0.056
 noise-0.054
 best-0.054
Token and
Token ID290
Feature activation+5.71e-02

Pos logit contributions
 journals+0.887
 journal+0.795
 Christmas+0.724
pron+0.723
ifying+0.716
Neg logit contributions
 comm-0.794
ters-0.735
mun-0.716
 ang-0.683
 tactic-0.662
Token wrote
Token ID2630
Feature activation+2.64e-01

Pos logit contributions
 journals+0.094
 journal+0.088
pron+0.080
 Christmas+0.078
ifying+0.076
Neg logit contributions
 comm-0.110
mun-0.104
ters-0.100
 ang-0.099
 persu-0.094
Token about
Token ID546
Feature activation-1.01e-01

Pos logit contributions
 journals+0.433
 journal+0.417
 Christmas+0.364
pron+0.364
ocket+0.340
Neg logit contributions
 comm-0.509
mun-0.501
 ang-0.455
ters-0.454
 persu-0.444
Token how
Token ID703
Feature activation-7.84e-02

Pos logit contributions
 comm+0.176
mun+0.164
 ang+0.157
ters+0.157
 tactic+0.150
Neg logit contributions
 journals-0.179
 journal-0.169
pron-0.163
ouse-0.154
ifying-0.153
Token she
Token ID673
Feature activation+1.89e-03

Pos logit contributions
 comm+0.127
mun+0.122
 ang+0.117
ters+0.113
 louder+0.111
Neg logit contributions
 journals-0.136
 journal-0.129
pron-0.127
ifying-0.121
ocket-0.119
Token want
Token ID765
Feature activation+5.29e-03

Pos logit contributions
 journals+0.086
pron+0.083
iflower+0.082
 journal+0.081
ifying+0.080
Neg logit contributions
 comm-0.047
 ang-0.036
 shout-0.036
ters-0.035
 noise-0.035
Token anyone
Token ID2687
Feature activation+6.82e-02

Pos logit contributions
 journals+0.010
 journal+0.009
pron+0.009
ouse+0.009
ifying+0.009
Neg logit contributions
 comm-0.009
mun-0.008
ters-0.008
 ang-0.008
 persu-0.007
Token to
Token ID284
Feature activation+1.57e-02

Pos logit contributions
 journals+0.121
 journal+0.113
pron+0.106
 Christmas+0.102
ifying+0.100
Neg logit contributions
 comm-0.121
mun-0.115
ters-0.108
 ang-0.107
 tactic-0.104
Token read
Token ID1100
Feature activation+1.45e-01

Pos logit contributions
 journals+0.032
ifying+0.030
pron+0.029
iflower+0.029
 journal+0.028
Neg logit contributions
 comm-0.021
 tactic-0.020
 convince-0.019
 persuasion-0.019
 persuade-0.019
Token her
Token ID607
Feature activation+4.98e-02

Pos logit contributions
 journals+0.261
 journal+0.248
pron+0.235
ouse+0.222
ifying+0.222
Neg logit contributions
 comm-0.235
ters-0.223
mun-0.221
 ang-0.212
 persuasion-0.203
Token journal
Token ID3989
Feature activation+4.72e-01

Pos logit contributions
ifying+0.073
 journals+0.073
pron+0.072
 Christmas+0.067
ak+0.066
Neg logit contributions
 comm-0.089
 tactic-0.083
 ang-0.081
mun-0.081
ters-0.077
Token,
Token ID11
Feature activation-2.11e-02

Pos logit contributions
 journals+0.841
 journal+0.794
pron+0.728
 Christmas+0.711
ifying+0.692
Neg logit contributions
 comm-0.823
mun-0.776
ters-0.750
 ang-0.738
 tactic-0.707
Token because
Token ID780
Feature activation+7.41e-02

Pos logit contributions
 comm+0.031
mun+0.029
ters+0.027
 ang+0.027
 persu+0.025
Neg logit contributions
 journals-0.044
 journal-0.042
pron-0.039
 Christmas-0.039
ifying-0.037
Token it
Token ID340
Feature activation+1.61e-01

Pos logit contributions
 journals+0.145
ak+0.140
pron+0.140
ouse+0.139
ifying+0.134
Neg logit contributions
 comm-0.092
ters-0.086
mun-0.085
 tactic-0.080
 warns-0.079
Token was
Token ID373
Feature activation-9.49e-02

Pos logit contributions
 journals+0.317
 journal+0.300
pron+0.277
 Christmas+0.273
ifying+0.269
Neg logit contributions
 comm-0.256
mun-0.240
ters-0.225
 ang-0.220
 tactic-0.212
Token hers
Token ID25144
Feature activation+9.44e-02

Pos logit contributions
 comm+0.139
 ang+0.127
 persu+0.118
 louder+0.118
ters+0.116
Neg logit contributions
 journals-0.189
 journal-0.175
ouse-0.174
pron-0.162
ifying-0.161
Token her
Token ID607
Feature activation+4.42e-02

Pos logit contributions
 comm+0.050
 ang+0.045
ters+0.044
mun+0.043
 tactic+0.042
Neg logit contributions
 journals-0.058
ifying-0.057
ouse-0.056
pron-0.055
 journal-0.054
Token things
Token ID1243
Feature activation-1.24e-01

Pos logit contributions
ifying+0.056
pron+0.046
abby+0.046
orse+0.046
ink+0.046
Neg logit contributions
 comm-0.093
 tactic-0.084
 ang-0.082
 warns-0.081
 advice-0.081
Token,
Token ID11
Feature activation-1.78e-02

Pos logit contributions
 comm+0.200
ters+0.190
mun+0.187
 ang+0.180
 persu+0.170
Neg logit contributions
 journals-0.236
 journal-0.218
pron-0.207
 Christmas-0.198
ifying-0.196
Token especially
Token ID2592
Feature activation+1.00e-01

Pos logit contributions
 comm+0.025
mun+0.023
ters+0.022
 ang+0.022
 tactic+0.021
Neg logit contributions
 journals-0.037
 journal-0.036
pron-0.034
ifying-0.033
 Christmas-0.033
Token her
Token ID607
Feature activation+4.42e-02

Pos logit contributions
 journals+0.202
 journal+0.194
pron+0.178
 Christmas+0.176
ifying+0.169
Neg logit contributions
 comm-0.153
mun-0.144
ters-0.136
 ang-0.135
 persu-0.127
Token journal
Token ID3989
Feature activation+4.72e-01

Pos logit contributions
ifying+0.054
ak+0.051
pron+0.051
abby+0.048
 journals+0.046
Neg logit contributions
 comm-0.093
 ang-0.080
 tactic-0.078
 noise-0.076
 warns-0.076
Token.
Token ID13
Feature activation+3.62e-03

Pos logit contributions
 journals+0.875
 journal+0.800
pron+0.760
ifying+0.741
 Christmas+0.732
Neg logit contributions
 comm-0.787
mun-0.736
ters-0.723
 ang-0.686
 tactic-0.661
Token<|endoftext|>
Token ID50256
Feature activation+4.41e-02

Pos logit contributions
 journals+0.007
 journal+0.007
pron+0.007
ifying+0.007
ocket+0.006
Neg logit contributions
 comm-0.005
mun-0.005
ters-0.004
 ang-0.004
 persuasion-0.004
TokenOnce
Token ID7454
Feature activation+1.15e-01

Pos logit contributions
 journals+0.080
 journal+0.075
pron+0.069
 Christmas+0.067
ifying+0.065
Neg logit contributions
 comm-0.077
mun-0.073
ters-0.069
 ang-0.068
 tactic-0.065
Token upon
Token ID2402
Feature activation+7.44e-02

Pos logit contributions
 journals+0.227
 journal+0.200
ifying+0.198
pron+0.194
ocket+0.187
Neg logit contributions
mun-0.169
 comm-0.168
 ang-0.158
 tactic-0.151
 persu-0.150
Token a
Token ID257
Feature activation-1.17e-02

Pos logit contributions
 journals+0.159
 journal+0.146
pron+0.138
ifying+0.137
ocket+0.135
Neg logit contributions
 comm-0.105
mun-0.096
 ang-0.087
ters-0.086
 tactic-0.084
Token Lily
Token ID20037
Feature activation-2.20e-02

Pos logit contributions
 comm+0.048
mun+0.045
ters+0.043
 ang+0.043
 tactic+0.041
Neg logit contributions
 journals-0.041
 journal-0.039
pron-0.035
 Christmas-0.035
ifying-0.033
Token went
Token ID1816
Feature activation+6.73e-02

Pos logit contributions
 comm+0.034
 ang+0.028
mun+0.028
 warns+0.026
 persu+0.024
Neg logit contributions
 journals-0.049
ifying-0.046
 journal-0.044
ocket-0.041
 Christmas-0.041
Token back
Token ID736
Feature activation+1.06e-01

Pos logit contributions
 journals+0.109
 journal+0.102
pron+0.092
 Christmas+0.090
ifying+0.088
Neg logit contributions
 comm-0.130
mun-0.123
 ang-0.119
ters-0.117
 persu-0.113
Token to
Token ID284
Feature activation+1.53e-02

Pos logit contributions
 journals+0.188
 journal+0.175
pron+0.163
ifying+0.155
 Christmas+0.154
Neg logit contributions
 comm-0.190
mun-0.177
 ang-0.170
ters-0.167
 persu-0.164
Token her
Token ID607
Feature activation+4.42e-02

Pos logit contributions
 journals+0.025
 journal+0.023
pron+0.021
ifying+0.020
 Christmas+0.020
Neg logit contributions
 comm-0.030
mun-0.028
 ang-0.027
ters-0.026
 persu-0.026
Token journal
Token ID3989
Feature activation+4.72e-01

Pos logit contributions
 journals+0.061
ifying+0.058
ak+0.057
pron+0.052
ocket+0.051
Neg logit contributions
 comm-0.094
mun-0.081
 ang-0.081
 tactic-0.078
 persu-0.077
Token and
Token ID290
Feature activation+5.75e-02

Pos logit contributions
 journals+0.835
 journal+0.751
ifying+0.709
 Christmas+0.685
pron+0.684
Neg logit contributions
 comm-0.847
mun-0.768
ters-0.741
 ang-0.734
 tactic-0.702
Token drew
Token ID9859
Feature activation+2.45e-01

Pos logit contributions
 journals+0.105
ifying+0.102
pron+0.094
ak+0.094
 journal+0.094
Neg logit contributions
 comm-0.096
 ang-0.085
mun-0.085
 warns-0.076
 persu-0.075
Token a
Token ID257
Feature activation-8.21e-03

Pos logit contributions
 journals+0.442
 journal+0.412
pron+0.395
 Christmas+0.383
ifying+0.380
Neg logit contributions
 comm-0.413
ters-0.375
mun-0.372
 ang-0.367
 persu-0.341
Token picture
Token ID4286
Feature activation+2.02e-01

Pos logit contributions
 comm+0.017
 ang+0.015
mun+0.015
 tactic+0.014
 louder+0.013
Neg logit contributions
 journals-0.012
 journal-0.010
 Christmas-0.010
ifying-0.010
ak-0.010
Token of
Token ID286
Feature activation+8.66e-03

Pos logit contributions
 journals+0.326
 journal+0.315
pron+0.283
 Christmas+0.278
ifying+0.258
Neg logit contributions
 comm-0.390
mun-0.375
 ang-0.357
ters-0.350
 tactic-0.341
Tokenbles
Token ID7689
Feature activation-2.24e-02

Pos logit contributions
 journals+0.001
 journal+0.001
 Christmas+0.001
pron+0.001
ouse+0.001
Neg logit contributions
 comm-0.001
mun-0.001
 ang-0.001
 tactic-0.001
 persuasion-0.001
Token,
Token ID11
Feature activation-1.68e-02

Pos logit contributions
 comm+0.034
mun+0.033
ters+0.032
 ang+0.030
 warns+0.029
Neg logit contributions
 journals-0.045
 journal-0.042
pron-0.040
 Christmas-0.038
ifying-0.037
Token look
Token ID804
Feature activation-1.36e-02

Pos logit contributions
 comm+0.035
mun+0.035
 ang+0.032
ters+0.032
 tactic+0.031
Neg logit contributions
 journal-0.024
 journals-0.023
 Christmas-0.021
ocket-0.019
 notebook-0.019
Token at
Token ID379
Feature activation+1.22e-02

Pos logit contributions
 comm+0.021
mun+0.019
 ang+0.019
ters+0.018
 persu+0.017
Neg logit contributions
 journals-0.028
 journal-0.026
pron-0.025
 Christmas-0.024
ouse-0.023
Token this
Token ID428
Feature activation+4.64e-02

Pos logit contributions
 journals+0.018
 journal+0.017
pron+0.016
 Christmas+0.015
ifying+0.014
Neg logit contributions
 comm-0.025
mun-0.024
ters-0.023
 ang-0.023
 persu-0.022
Token journal
Token ID3989
Feature activation+4.72e-01

Pos logit contributions
 journals+0.055
ifying+0.055
 Christmas+0.053
pron+0.052
ppa+0.050
Neg logit contributions
 comm-0.095
 tactic-0.084
 noise-0.083
ters-0.083
 warns-0.080
Token I
Token ID314
Feature activation+2.03e-01

Pos logit contributions
 journals+0.904
 journal+0.802
pron+0.769
 Christmas+0.731
ifying+0.716
Neg logit contributions
 comm-0.769
ters-0.735
mun-0.712
 ang-0.654
 tactic-0.639
Token found
Token ID1043
Feature activation+1.98e-01

Pos logit contributions
 journals+0.320
 journal+0.287
pron+0.272
 Christmas+0.252
iflower+0.249
Neg logit contributions
 comm-0.399
mun-0.389
 ang-0.363
 warns-0.362
ters-0.360
Token!"
Token ID2474
Feature activation+8.24e-02

Pos logit contributions
 journals+0.381
pron+0.362
 journal+0.340
ouse+0.324
ifying+0.313
Neg logit contributions
 comm-0.307
ters-0.302
 warns-0.262
 ang-0.259
mun-0.258
Token Bub
Token ID19645
Feature activation-2.02e-01

Pos logit contributions
 journal+0.097
 journals+0.096
 Christmas+0.089
on+0.088
 notebook+0.087
Neg logit contributions
mun-0.160
 comm-0.159
 ang-0.155
 inconvenience-0.144
 tactic-0.142
Tokenbles
Token ID7689
Feature activation-2.12e-02

Pos logit contributions
 comm+0.471
mun+0.445
 persuasion+0.435
 persu+0.426
 ang+0.410
Neg logit contributions
ouse-0.230
 journal-0.223
ak-0.219
 journals-0.218
ocket-0.196
Token her
Token ID607
Feature activation+4.09e-02

Pos logit contributions
 journals+0.310
 journal+0.295
pron+0.268
 Christmas+0.263
ifying+0.251
Neg logit contributions
 comm-0.328
mun-0.312
ters-0.299
 ang-0.295
 persu-0.282
Token own
Token ID898
Feature activation+1.81e-01

Pos logit contributions
 journals+0.063
ifying+0.062
ak+0.060
pron+0.059
 journal+0.055
Neg logit contributions
 comm-0.073
 tactic-0.065
mun-0.065
 ang-0.064
ters-0.060
Token pictures
Token ID5986
Feature activation+5.64e-02

Pos logit contributions
ifying+0.302
ak+0.281
pron+0.279
orse+0.267
ppa+0.258
Neg logit contributions
 comm-0.307
 tactic-0.276
 noise-0.260
mun-0.252
 ang-0.248
Token in
Token ID287
Feature activation+2.72e-02

Pos logit contributions
 journals+0.112
 journal+0.102
pron+0.095
ifying+0.094
iflower+0.091
Neg logit contributions
 comm-0.084
ters-0.082
mun-0.079
 persuasion-0.073
 tactic-0.073
Token the
Token ID262
Feature activation+6.74e-03

Pos logit contributions
ouse+0.059
 journals+0.054
pron+0.053
orse+0.052
ifying+0.052
Neg logit contributions
 comm-0.032
ters-0.029
 ang-0.029
 tactic-0.028
mun-0.027
Token journal
Token ID3989
Feature activation+4.72e-01

Pos logit contributions
orse+0.008
ouse+0.006
ifying+0.006
ppa+0.006
bie+0.006
Neg logit contributions
 comm-0.015
 noise-0.013
 ang-0.013
 funn-0.013
mun-0.012
Token and
Token ID290
Feature activation+6.03e-02

Pos logit contributions
 journals+0.858
 journal+0.798
pron+0.728
 Christmas+0.716
ifying+0.703
Neg logit contributions
 comm-0.812
mun-0.764
ters-0.745
 ang-0.716
 tactic-0.691
Token make
Token ID787
Feature activation-8.88e-02

Pos logit contributions
 journals+0.108
 journal+0.102
pron+0.098
ifying+0.096
ak+0.094
Neg logit contributions
 comm-0.101
mun-0.093
 ang-0.090
ters-0.087
 tactic-0.086
Token it
Token ID340
Feature activation+1.52e-01

Pos logit contributions
 comm+0.175
mun+0.172
ters+0.162
 ang+0.159
 persu+0.156
Neg logit contributions
 journals-0.141
 journal-0.133
 Christmas-0.116
pron-0.116
 notebook-0.105
Token even
Token ID772
Feature activation+6.97e-02

Pos logit contributions
 journals+0.264
 journal+0.249
pron+0.233
ifying+0.224
 Christmas+0.222
Neg logit contributions
 comm-0.267
mun-0.246
 ang-0.245
ters-0.241
 tactic-0.231
Token mess
Token ID2085
Feature activation-2.38e-02

Pos logit contributions
 journals+0.139
 journal+0.132
pron+0.119
 Christmas+0.117
 worn+0.111
Neg logit contributions
 comm-0.107
mun-0.107
ters-0.097
arling-0.092
 persu-0.091
Token special
Token ID2041
Feature activation+9.31e-02

Pos logit contributions
 comm+0.030
mun+0.028
 ang+0.027
 tactic+0.026
 persu+0.026
Neg logit contributions
 journals-0.027
 journal-0.024
ifying-0.022
ocket-0.022
pron-0.022
Token journal
Token ID3989
Feature activation+4.71e-01

Pos logit contributions
 journals+0.151
 journal+0.140
pron+0.126
 Christmas+0.124
ifying+0.122
Neg logit contributions
 comm-0.179
mun-0.169
 ang-0.160
ters-0.160
 tactic-0.155
Token.
Token ID13
Feature activation-4.85e-03

Pos logit contributions
 journals+0.901
 journal+0.817
pron+0.758
ifying+0.742
 Christmas+0.739
Neg logit contributions
 comm-0.751
mun-0.691
ters-0.684
 ang-0.639
 persu-0.619
Token In
Token ID554
Feature activation+1.11e-01

Pos logit contributions
 comm+0.008
mun+0.008
 tactic+0.007
 persuasion+0.007
 ang+0.007
Neg logit contributions
 journals-0.009
 journal-0.009
pron-0.008
ifying-0.008
 Christmas-0.008
Token her
Token ID607
Feature activation+3.70e-02

Pos logit contributions
ouse+0.243
 journals+0.240
ak+0.234
ocado+0.228
ouses+0.226
Neg logit contributions
 comm-0.118
 tactic-0.105
 persu-0.104
 persuasion-0.103
 means-0.103
Token journal
Token ID3989
Feature activation+4.71e-01

Pos logit contributions
ak+0.040
ifying+0.040
abby+0.038
pron+0.037
orse+0.035
Neg logit contributions
 comm-0.082
 ang-0.072
 tactic-0.072
mun-0.071
 persuasion-0.069
Token Mary
Token ID5335
Feature activation-6.65e-02

Pos logit contributions
 journals+0.833
 journal+0.789
pron+0.716
 Christmas+0.706
ifying+0.676
Neg logit contributions
 comm-0.837
mun-0.796
ters-0.757
 ang-0.749
 persu-0.716
Token kept
Token ID4030
Feature activation+5.14e-03

Pos logit contributions
mun+0.068
 means+0.053
 warns+0.052
 comm+0.048
 shouts+0.046
Neg logit contributions
 journals-0.149
pron-0.149
ifying-0.143
abby-0.141
hire-0.140
Token pictures
Token ID5986
Feature activation+6.04e-02

Pos logit contributions
ifying+0.009
ouse+0.009
 journals+0.009
ak+0.009
abby+0.009
Neg logit contributions
 comm-0.008
 ang-0.007
 persu-0.007
 louder-0.007
 calling-0.007
Token,
Token ID11
Feature activation-2.77e-02

Pos logit contributions
 journals+0.126
 journal+0.117
pron+0.112
ifying+0.109
 Christmas+0.105
Neg logit contributions
ters-0.082
 comm-0.077
mun-0.073
 persuasion-0.067
 tactic-0.066
Token drawings
Token ID23388
Feature activation+1.91e-01

Pos logit contributions
 comm+0.045
mun+0.044
ters+0.041
 ang+0.039
 tactic+0.038
Neg logit contributions
 journals-0.051
 journal-0.049
pron-0.045
ifying-0.044
ouse-0.044
Token and
Token ID290
Feature activation+2.92e-02

Pos logit contributions
 journals+0.075
 journal+0.072
 Christmas+0.064
pron+0.064
ifying+0.059
Neg logit contributions
 comm-0.097
mun-0.094
 ang-0.089
ters-0.087
 persu-0.086
Token the
Token ID262
Feature activation-1.94e-02

Pos logit contributions
 journals+0.052
 journal+0.049
pron+0.044
 Christmas+0.043
ocket+0.042
Neg logit contributions
 comm-0.052
mun-0.049
 ang-0.047
ters-0.047
 persu-0.044
Token little
Token ID1310
Feature activation-1.41e-02

Pos logit contributions
 comm+0.034
mun+0.032
ters+0.031
 ang+0.031
 persu+0.029
Neg logit contributions
 journals-0.034
 journal-0.033
pron-0.030
 Christmas-0.030
ifying-0.028
Token boy
Token ID2933
Feature activation-1.88e-01

Pos logit contributions
 comm+0.019
mun+0.018
ters+0.017
 ang+0.017
 tactic+0.016
Neg logit contributions
 journals-0.030
 journal-0.029
pron-0.027
 Christmas-0.027
ifying-0.026
Token was
Token ID373
Feature activation-1.21e-01

Pos logit contributions
 comm+0.295
mun+0.289
ters+0.257
 ang+0.253
 warns+0.243
Neg logit contributions
 journals-0.374
 journal-0.349
pron-0.311
 Christmas-0.292
ifying-0.290
Token comm
Token ID725
Feature activation-5.17e-01

Pos logit contributions
 comm+0.180
mun+0.161
 ang+0.160
ters+0.153
 persu+0.150
Neg logit contributions
 journals-0.262
 journal-0.242
pron-0.209
 fork-0.208
 notebook-0.208
Tokenended
Token ID1631
Feature activation+9.99e-02

Pos logit contributions
 comm+0.763
mun+0.708
 ang+0.671
ters+0.657
 tactic+0.625
Neg logit contributions
 journals-1.062
 journal-1.025
pron-0.943
 Christmas-0.934
ifying-0.905
Token for
Token ID329
Feature activation-1.41e-01

Pos logit contributions
 journals+0.160
 journal+0.153
pron+0.137
 Christmas+0.135
ifying+0.127
Neg logit contributions
 comm-0.192
mun-0.185
ters-0.173
 ang-0.173
 persu-0.168
Token his
Token ID465
Feature activation+9.37e-02

Pos logit contributions
 comm+0.262
mun+0.250
ters+0.239
 ang+0.236
 persu+0.227
Neg logit contributions
 journals-0.236
 journal-0.224
pron-0.202
 Christmas-0.198
ifying-0.189
Token bravery
Token ID37808
Feature activation-1.12e-01

Pos logit contributions
 journals+0.165
 journal+0.156
pron+0.143
 Christmas+0.140
ifying+0.135
Neg logit contributions
 comm-0.167
mun-0.156
ters-0.149
 ang-0.148
 persu-0.142
Token.
Token ID13
Feature activation-2.12e-02

Pos logit contributions
 comm+0.201
mun+0.188
ters+0.183
 ang+0.178
 tactic+0.171
Neg logit contributions
 journals-0.197
 journal-0.186
pron-0.171
 Christmas-0.166
ifying-0.159
Token to
Token ID284
Feature activation-7.41e-04

Pos logit contributions
 comm+0.150
mun+0.144
 ang+0.135
ters+0.135
 tactic+0.130
Neg logit contributions
 journals-0.132
 journal-0.126
 Christmas-0.112
pron-0.112
ifying-0.107
Token see
Token ID766
Feature activation+6.78e-02

Pos logit contributions
 comm+0.001
mun+0.001
ters+0.001
 ang+0.001
 persu+0.001
Neg logit contributions
 journals-0.001
 journal-0.001
 Christmas-0.001
pron-0.001
ifying-0.001
Token what
Token ID644
Feature activation-7.94e-02

Pos logit contributions
 journals+0.128
 journal+0.123
 Christmas+0.111
pron+0.109
ifying+0.105
Neg logit contributions
 comm-0.114
mun-0.108
ters-0.101
 ang-0.101
 persu-0.097
Token all
Token ID477
Feature activation+1.72e-02

Pos logit contributions
 comm+0.125
mun+0.117
ters+0.109
 ang+0.109
 persu+0.104
Neg logit contributions
 journals-0.158
 journal-0.149
pron-0.138
 Christmas-0.135
ifying-0.133
Token the
Token ID262
Feature activation-6.89e-03

Pos logit contributions
 journals+0.027
 journal+0.026
 Christmas+0.023
pron+0.023
ifying+0.021
Neg logit contributions
 comm-0.033
mun-0.032
ters-0.031
 ang-0.030
 persu-0.029
Token comm
Token ID725
Feature activation-5.11e-01

Pos logit contributions
 comm+0.011
mun+0.011
ters+0.010
 ang+0.010
 persu+0.010
Neg logit contributions
 journals-0.014
 journal-0.013
 Christmas-0.011
pron-0.011
 notebook-0.011
Tokenotion
Token ID9650
Feature activation-1.96e-01

Pos logit contributions
 comm+0.551
mun+0.509
ters+0.470
 ang+0.454
 persu+0.421
Neg logit contributions
 journals-1.260
 journal-1.216
pron-1.137
 Christmas-1.124
ifying-1.085
Token was
Token ID373
Feature activation-1.10e-01

Pos logit contributions
 comm+0.350
mun+0.335
 ang+0.318
ters+0.315
 persu+0.304
Neg logit contributions
 journals-0.335
 journal-0.323
pron-0.289
 Christmas-0.288
ifying-0.269
Token about
Token ID546
Feature activation-1.17e-01

Pos logit contributions
mun+0.212
 comm+0.206
ters+0.195
 ang+0.194
 tactic+0.189
Neg logit contributions
 journals-0.164
 journal-0.162
 Christmas-0.146
pron-0.143
ak-0.136
Token.
Token ID13
Feature activation-8.46e-03

Pos logit contributions
 comm+0.201
mun+0.184
ters+0.182
 ang+0.175
 persu+0.170
Neg logit contributions
 journals-0.214
 journal-0.202
pron-0.186
 Christmas-0.178
ocket-0.174
Token Mom
Token ID11254
Feature activation+1.58e-01

Pos logit contributions
 comm+0.015
mun+0.014
ters+0.013
 ang+0.013
 tactic+0.013
Neg logit contributions
 journals-0.015
 journal-0.015
pron-0.014
 Christmas-0.013
iflower-0.013
Token came
Token ID1625
Feature activation-7.12e-02

Pos logit contributions
 journals+0.271
 journal+0.264
pron+0.242
 Christmas+0.230
ak+0.217
Neg logit contributions
 comm-0.408
mun-0.393
ters-0.379
 tactic-0.377
 ang-0.377
Token quickly
Token ID2952
Feature activation+3.70e-02

Pos logit contributions
 comm+0.139
mun+0.135
 ang+0.126
ters+0.124
 tactic+0.121
Neg logit contributions
 journals-0.110
 journal-0.106
 Christmas-0.094
pron-0.091
ak-0.088
Token and
Token ID290
Feature activation+5.86e-02

Pos logit contributions
 journals+0.042
 journal+0.041
 Christmas+0.036
pron+0.034
ak+0.032
Neg logit contributions
mun-0.084
 comm-0.083
ters-0.080
 ang-0.077
 tactic-0.077
Token saw
Token ID2497
Feature activation+4.96e-02

Pos logit contributions
 journals+0.077
 journal+0.073
pron+0.065
 Christmas+0.063
 fork+0.057
Neg logit contributions
 comm-0.126
mun-0.121
ters-0.120
 tactic-0.116
 ang-0.113
Token the
Token ID262
Feature activation+5.34e-03

Pos logit contributions
 journals+0.063
 journal+0.059
 Christmas+0.058
 Daddy+0.053
 fork+0.050
Neg logit contributions
 comm-0.110
mun-0.104
 tactic-0.100
 persu-0.098
 persuasion-0.097
Token comm
Token ID725
Feature activation-5.08e-01

Pos logit contributions
 journal+0.008
 journals+0.008
pron+0.007
 Christmas+0.007
 fork+0.007
Neg logit contributions
mun-0.010
 comm-0.010
ters-0.010
 ang-0.010
 persu-0.009
Tokenotion
Token ID9650
Feature activation-1.96e-01

Pos logit contributions
 comm+0.622
mun+0.561
 ang+0.537
ters+0.501
 tactic+0.485
Neg logit contributions
 journals-1.168
 journal-1.134
 Christmas-1.050
pron-1.045
ocket-1.009
Token outside
Token ID2354
Feature activation-1.99e-03

Pos logit contributions
 comm+0.430
mun+0.422
 ang+0.395
ters+0.391
 tactic+0.387
Neg logit contributions
 journals-0.252
 journal-0.245
 Christmas-0.207
pron-0.201
 fork-0.189
Token.
Token ID13
Feature activation+5.13e-03

Pos logit contributions
 comm+0.004
mun+0.004
ters+0.003
 ang+0.003
 tactic+0.003
Neg logit contributions
 journals-0.003
 journal-0.003
pron-0.003
 Christmas-0.003
ifying-0.003
Token She
Token ID1375
Feature activation+1.90e-02

Pos logit contributions
 journals+0.007
 journal+0.007
 Christmas+0.006
pron+0.006
ifying+0.006
Neg logit contributions
 comm-0.011
mun-0.010
ters-0.010
 ang-0.010
 tactic-0.009
Token hugged
Token ID41130
Feature activation+1.36e-01

Pos logit contributions
 journals+0.029
 journal+0.028
pron+0.026
 Christmas+0.025
ak+0.024
Neg logit contributions
 comm-0.036
mun-0.035
ters-0.034
 tactic-0.033
 ang-0.033
TokenJohn
Token ID7554
Feature activation+6.59e-02

Pos logit contributions
 comm+0.097
mun+0.089
 ang+0.085
ters+0.083
 persu+0.081
Neg logit contributions
 journals-0.109
 journal-0.104
ocket-0.094
 Christmas-0.092
pron-0.092
Token's
Token ID338
Feature activation+4.69e-02

Pos logit contributions
 journals+0.122
 journal+0.115
pron+0.106
 Christmas+0.102
ifying+0.099
Neg logit contributions
 comm-0.117
mun-0.109
 ang-0.104
ters-0.103
 persu-0.098
Token mom
Token ID1995
Feature activation+1.18e-01

Pos logit contributions
 journals+0.067
 journal+0.061
pron+0.057
 Christmas+0.054
ifying+0.053
Neg logit contributions
 comm-0.100
mun-0.092
 ang-0.090
ters-0.089
 persu-0.086
Token heard
Token ID2982
Feature activation+1.34e-03

Pos logit contributions
 journals+0.181
 journal+0.176
pron+0.159
 Christmas+0.155
ifying+0.146
Neg logit contributions
 comm-0.226
mun-0.215
ters-0.209
 ang-0.206
 tactic-0.205
Token the
Token ID262
Feature activation-1.68e-02

Pos logit contributions
 journals+0.002
 journal+0.002
 Christmas+0.002
pron+0.002
ifying+0.002
Neg logit contributions
 comm-0.002
mun-0.002
ters-0.002
 ang-0.002
 tactic-0.002
Token comm
Token ID725
Feature activation-5.07e-01

Pos logit contributions
ters+0.022
mun+0.022
 tactic+0.018
 persu+0.018
 persuasion+0.018
Neg logit contributions
 journal-0.037
 journals-0.034
 pillow-0.033
 fork-0.032
 thumb-0.031
Tokenotion
Token ID9650
Feature activation-1.98e-01

Pos logit contributions
 comm+0.643
mun+0.582
 ang+0.565
ters+0.525
 tactic+0.505
Neg logit contributions
 journals-1.131
 journal-1.101
 Christmas-1.016
pron-1.008
ocket-0.984
Token and
Token ID290
Feature activation+3.13e-02

Pos logit contributions
mun+0.488
 comm+0.483
 ang+0.451
ters+0.450
 tactic+0.446
Neg logit contributions
 journal-0.178
 journals-0.174
 Christmas-0.140
 forever-0.123
on-0.123
Token came
Token ID1625
Feature activation-8.88e-02

Pos logit contributions
 journals+0.039
 journal+0.039
 Christmas+0.035
 worn+0.033
 Daddy+0.033
Neg logit contributions
 comm-0.063
mun-0.061
 tactic-0.060
ters-0.060
 persu-0.057
Token to
Token ID284
Feature activation-7.99e-03

Pos logit contributions
mun+0.199
 comm+0.194
 inconvenience+0.179
 tactic+0.179
 persuasion+0.179
Neg logit contributions
 journal-0.093
 journals-0.092
ak-0.085
 Christmas-0.082
 Daddy-0.078
Token investigate
Token ID9161
Feature activation-1.10e-01

Pos logit contributions
mun+0.016
 comm+0.016
ters+0.016
 ang+0.014
 tactic+0.014
Neg logit contributions
 journals-0.011
 journal-0.011
 Christmas-0.010
pron-0.009
 fork-0.009
Token
Token ID198
Feature activation-1.80e-02

Pos logit contributions
 journals+0.008
 journal+0.008
pron+0.007
 Christmas+0.007
ifying+0.007
Neg logit contributions
 comm-0.009
mun-0.008
 ang-0.008
ters-0.008
 persu-0.008
Token
Token ID198
Feature activation-1.98e-02

Pos logit contributions
 comm+0.031
mun+0.029
 ang+0.028
ters+0.027
 persu+0.026
Neg logit contributions
 journals-0.033
 journal-0.032
pron-0.029
 Christmas-0.028
ifying-0.027
TokenJen
Token ID44875
Feature activation-1.88e-01

Pos logit contributions
 comm+0.033
mun+0.031
 ang+0.030
ters+0.029
 persu+0.028
Neg logit contributions
 journals-0.038
 journal-0.036
 Christmas-0.033
pron-0.033
ocket-0.031
Token heard
Token ID2982
Feature activation+1.75e-02

Pos logit contributions
 comm+0.341
mun+0.322
 ang+0.305
ters+0.301
 persu+0.281
Neg logit contributions
 journals-0.343
 journal-0.320
pron-0.289
 Christmas-0.286
ifying-0.278
Token the
Token ID262
Feature activation+1.71e-03

Pos logit contributions
 journals+0.023
 journal+0.022
 Christmas+0.020
on+0.018
 notebook+0.018
Neg logit contributions
mun-0.038
 comm-0.037
ters-0.036
 tactic-0.035
 persu-0.033
Token comm
Token ID725
Feature activation-5.06e-01

Pos logit contributions
 journal+0.003
 journals+0.003
 fork+0.003
 thumb+0.003
 pillow+0.003
Neg logit contributions
mun-0.003
ters-0.002
 tactic-0.002
 persuasion-0.002
 persu-0.002
Tokenotion
Token ID9650
Feature activation-1.95e-01

Pos logit contributions
 comm+0.615
mun+0.559
 ang+0.523
ters+0.506
 tactic+0.479
Neg logit contributions
 journals-1.162
 journal-1.131
 Christmas-1.046
pron-1.044
ocket-1.010
Token and
Token ID290
Feature activation+5.98e-02

Pos logit contributions
 comm+0.480
mun+0.480
ters+0.446
 tactic+0.438
 ang+0.437
Neg logit contributions
 journal-0.190
 journals-0.187
 Christmas-0.150
pron-0.140
 fork-0.135
Token came
Token ID1625
Feature activation-7.48e-02

Pos logit contributions
 journals+0.095
 journal+0.091
 Christmas+0.080
pron+0.079
ifying+0.074
Neg logit contributions
 comm-0.115
mun-0.110
ters-0.106
 tactic-0.103
 ang-0.102
Token outside
Token ID2354
Feature activation-5.69e-04

Pos logit contributions
 comm+0.169
mun+0.167
 tactic+0.153
 persuasion+0.152
 inconvenience+0.151
Neg logit contributions
 journals-0.082
 journal-0.082
ak-0.072
 Christmas-0.070
 worn-0.065
Token.
Token ID13
Feature activation+2.78e-03

Pos logit contributions
mun+0.001
 comm+0.001
ters+0.001
 ang+0.001
 tactic+0.001
Neg logit contributions
 journal-0.001
 journals-0.001
 Christmas-0.001
 forever-0.000
pron-0.000
Tokenmy
Token ID1820
Feature activation+8.00e-02

Pos logit contributions
 comm+0.142
mun+0.136
 ang+0.130
ters+0.130
 tactic+0.124
Neg logit contributions
 journals-0.096
 journal-0.090
pron-0.080
 Christmas-0.079
ifying-0.075
Token's
Token ID338
Feature activation+6.58e-02

Pos logit contributions
 journals+0.121
 journal+0.115
pron+0.101
 Christmas+0.100
ouse+0.093
Neg logit contributions
 comm-0.159
mun-0.154
ters-0.149
 ang-0.144
 tactic-0.142
Token mom
Token ID1995
Feature activation+1.35e-01

Pos logit contributions
 journals+0.102
ifying+0.094
 journal+0.090
 Christmas+0.087
pron+0.086
Neg logit contributions
 comm-0.127
 ang-0.117
mun-0.117
ters-0.111
 tactic-0.110
Token heard
Token ID2982
Feature activation+1.98e-02

Pos logit contributions
 journals+0.175
 journal+0.171
pron+0.153
 Christmas+0.143
ak+0.142
Neg logit contributions
 comm-0.277
ters-0.268
mun-0.266
 tactic-0.263
 persu-0.258
Token the
Token ID262
Feature activation+5.46e-03

Pos logit contributions
 journals+0.025
 journal+0.024
 Christmas+0.022
on+0.020
pron+0.019
Neg logit contributions
mun-0.044
 comm-0.042
ters-0.041
 tactic-0.040
 persu-0.038
Token comm
Token ID725
Feature activation-5.05e-01

Pos logit contributions
 journal+0.011
 journals+0.010
 thumb+0.010
 pillow+0.009
 daddy+0.009
Neg logit contributions
mun-0.008
ters-0.008
aro-0.007
 Ph-0.007
 casc-0.007
Tokenotion
Token ID9650
Feature activation-1.91e-01

Pos logit contributions
 comm+0.565
mun+0.521
ters+0.481
 ang+0.471
 persu+0.436
Neg logit contributions
 journals-1.223
 journal-1.180
pron-1.101
 Christmas-1.089
ifying-1.053
Token and
Token ID290
Feature activation+5.88e-02

Pos logit contributions
mun+0.504
 comm+0.493
ters+0.467
 tactic+0.459
 plunge+0.457
Neg logit contributions
 journal-0.151
 journals-0.147
 Christmas-0.112
on-0.101
pron-0.098
Token came
Token ID1625
Feature activation-7.01e-02

Pos logit contributions
 journals+0.087
 journal+0.084
 Christmas+0.076
pron+0.074
 worn+0.069
Neg logit contributions
 comm-0.114
mun-0.109
ters-0.106
 tactic-0.106
 persu-0.102
Token into
Token ID656
Feature activation-2.49e-02

Pos logit contributions
mun+0.155
 comm+0.154
ters+0.143
 persuasion+0.142
 ang+0.141
Neg logit contributions
 journals-0.081
 journal-0.078
 Christmas-0.067
ak-0.066
 worn-0.061
Token the
Token ID262
Feature activation+5.45e-03

Pos logit contributions
mun+0.053
 comm+0.053
ters+0.051
 ang+0.048
 tactic+0.048
Neg logit contributions
 journals-0.033
 journal-0.032
 Christmas-0.028
pron-0.027
 fork-0.025
Token
Token ID198
Feature activation-1.96e-02

Pos logit contributions
 journals+0.228
 journal+0.224
 Christmas+0.185
pron+0.181
 notebook+0.181
Neg logit contributions
 comm-0.350
mun-0.346
ters-0.339
 ang-0.315
 tactic-0.312
Token
Token ID198
Feature activation-2.03e-02

Pos logit contributions
 comm+0.037
mun+0.035
ters+0.034
 ang+0.033
 tactic+0.032
Neg logit contributions
 journals-0.032
 journal-0.031
pron-0.028
 Christmas-0.027
ifying-0.025
TokenMom
Token ID29252
Feature activation+2.08e-01

Pos logit contributions
 comm+0.033
mun+0.031
 ang+0.030
ters+0.030
 persu+0.028
Neg logit contributions
 journals-0.040
 journal-0.038
pron-0.035
 Christmas-0.034
ifying-0.033
Token heard
Token ID2982
Feature activation+2.20e-02

Pos logit contributions
 journals+0.265
 journal+0.260
pron+0.228
ak+0.218
 fork+0.212
Neg logit contributions
 comm-0.440
mun-0.430
ters-0.422
 tactic-0.416
 persu-0.408
Token the
Token ID262
Feature activation+8.54e-03

Pos logit contributions
 journals+0.032
 journal+0.031
 Christmas+0.028
on+0.026
pron+0.025
Neg logit contributions
 comm-0.044
mun-0.044
ters-0.041
 tactic-0.041
 ang-0.039
Token comm
Token ID725
Feature activation-5.03e-01

Pos logit contributions
 journal+0.016
 pillow+0.016
 skull+0.015
 fork+0.014
 thumb+0.014
Neg logit contributions
mun-0.012
ters-0.012
 persu-0.011
 tactic-0.011
 persuasion-0.010
Tokenotion
Token ID9650
Feature activation-1.88e-01

Pos logit contributions
 comm+0.674
mun+0.604
 ang+0.582
 tactic+0.533
ters+0.529
Neg logit contributions
 journals-1.079
 journal-1.057
 Christmas-0.966
pron-0.950
ocket-0.948
Token and
Token ID290
Feature activation+6.67e-02

Pos logit contributions
mun+0.495
 comm+0.488
ters+0.452
 tactic+0.445
 ang+0.445
Neg logit contributions
 journal-0.151
 journals-0.146
 Christmas-0.106
 fork-0.099
on-0.098
Token came
Token ID1625
Feature activation-6.82e-02

Pos logit contributions
 journal+0.074
 journals+0.073
 Christmas+0.064
ak+0.062
 worn+0.061
Neg logit contributions
 comm-0.143
 tactic-0.142
mun-0.140
ters-0.135
 inconvenience-0.133
Token to
Token ID284
Feature activation+4.72e-03

Pos logit contributions
 comm+0.162
mun+0.161
 tactic+0.148
 persuasion+0.148
 inconvenience+0.148
Neg logit contributions
 journals-0.060
 journal-0.060
ak-0.057
 Christmas-0.054
 Daddy-0.049
Token see
Token ID766
Feature activation+7.50e-02

Pos logit contributions
 journals+0.006
 journal+0.006
 Christmas+0.005
 fork+0.004
 Daddy+0.004
Neg logit contributions
mun-0.010
 comm-0.010
ters-0.010
 tactic-0.009
 ang-0.009
Tokenmy
Token ID1820
Feature activation+8.20e-02

Pos logit contributions
 comm+0.121
mun+0.114
 ang+0.110
ters+0.109
 tactic+0.104
Neg logit contributions
 journals-0.093
 journal-0.087
pron-0.078
 Christmas-0.078
ifying-0.075
Token's
Token ID338
Feature activation+6.84e-02

Pos logit contributions
 journals+0.133
 journal+0.125
pron+0.113
 Christmas+0.111
ifying+0.106
Neg logit contributions
 comm-0.160
mun-0.152
ters-0.145
 ang-0.145
 persu-0.138
Token mom
Token ID1995
Feature activation+1.39e-01

Pos logit contributions
 journals+0.096
 journal+0.089
pron+0.080
 Christmas+0.078
ifying+0.076
Neg logit contributions
 comm-0.149
mun-0.139
 ang-0.135
ters-0.133
 tactic-0.129
Token heard
Token ID2982
Feature activation+2.39e-02

Pos logit contributions
 journals+0.183
 journal+0.179
pron+0.160
ak+0.151
 Christmas+0.151
Neg logit contributions
 comm-0.282
ters-0.272
mun-0.270
 tactic-0.264
 persu-0.260
Token the
Token ID262
Feature activation+1.03e-02

Pos logit contributions
 journals+0.033
 journal+0.031
 Christmas+0.028
pron+0.026
on+0.025
Neg logit contributions
mun-0.051
 comm-0.049
ters-0.047
 tactic-0.046
 persu-0.044
Token comm
Token ID725
Feature activation-5.02e-01

Pos logit contributions
 journal+0.022
 journals+0.020
 thumb+0.020
 fork+0.020
pron+0.020
Neg logit contributions
mun-0.014
ters-0.013
 persu-0.012
 persuasion-0.012
 tactic-0.012
Tokenotion
Token ID9650
Feature activation-1.89e-01

Pos logit contributions
 comm+0.552
mun+0.509
ters+0.469
 ang+0.458
 persu+0.424
Neg logit contributions
 journals-1.227
 journal-1.184
pron-1.105
 Christmas-1.093
ifying-1.057
Token and
Token ID290
Feature activation+5.78e-02

Pos logit contributions
mun+0.467
 comm+0.462
ters+0.434
 tactic+0.424
 ang+0.422
Neg logit contributions
 journals-0.179
 journal-0.179
 Christmas-0.139
pron-0.131
on-0.126
Token came
Token ID1625
Feature activation-6.73e-02

Pos logit contributions
 journals+0.093
 journal+0.090
 Christmas+0.080
pron+0.079
 notebook+0.073
Neg logit contributions
 comm-0.108
mun-0.104
ters-0.099
 tactic-0.098
 persu-0.096
Token running
Token ID2491
Feature activation-1.23e-01

Pos logit contributions
 comm+0.142
mun+0.142
ters+0.130
 ang+0.129
 persuasion+0.128
Neg logit contributions
 journals-0.088
 journal-0.086
 Christmas-0.073
ak-0.073
 worn-0.068
Token.
Token ID13
Feature activation+4.20e-03

Pos logit contributions
mun+0.296
 comm+0.269
 persuasion+0.255
 tactic+0.255
ters+0.254
Neg logit contributions
 journals-0.128
 journal-0.126
 Christmas-0.103
 worn-0.101
 forever-0.100
Token
Token ID198
Feature activation-1.34e-02

Pos logit contributions
 comm+0.023
mun+0.021
 ang+0.020
ters+0.020
 persu+0.019
Neg logit contributions
 journals-0.024
 journal-0.023
pron-0.021
 Christmas-0.021
ifying-0.020
TokenThe
Token ID464
Feature activation+7.18e-02

Pos logit contributions
 comm+0.024
mun+0.023
ters+0.022
 ang+0.022
 persu+0.020
Neg logit contributions
 journals-0.025
 journal-0.024
pron-0.021
 Christmas-0.020
ocket-0.020
Token farmer
Token ID18739
Feature activation-4.99e-03

Pos logit contributions
 journal+0.105
 journals+0.105
pron+0.098
 Christmas+0.097
 fork+0.089
Neg logit contributions
mun-0.132
 tactic-0.131
ters-0.130
whe-0.125
 persuasion-0.124
Token hears
Token ID23180
Feature activation-4.27e-02

Pos logit contributions
 comm+0.009
 tactic+0.009
ters+0.009
mun+0.009
 persu+0.008
Neg logit contributions
 journals-0.008
 journal-0.008
 Christmas-0.007
pron-0.006
 fork-0.006
Token the
Token ID262
Feature activation+9.46e-03

Pos logit contributions
 comm+0.082
mun+0.081
 tactic+0.076
ters+0.074
 ang+0.073
Neg logit contributions
 journals-0.063
 Christmas-0.060
 journal-0.059
pron-0.052
 Daddy-0.051
Token comm
Token ID725
Feature activation-5.01e-01

Pos logit contributions
 journal+0.016
 pillow+0.015
 thumb+0.015
 fork+0.015
 skull+0.015
Neg logit contributions
mun-0.014
ters-0.013
 persu-0.013
 tactic-0.013
whe-0.013
Tokenotion
Token ID9650
Feature activation-1.87e-01

Pos logit contributions
 comm+0.626
mun+0.558
 ang+0.538
ters+0.502
 tactic+0.492
Neg logit contributions
 journals-1.129
 journal-1.093
 Christmas-1.021
pron-1.010
ocket-0.981
Token and
Token ID290
Feature activation+7.36e-02

Pos logit contributions
mun+0.463
 comm+0.461
ters+0.426
 tactic+0.420
 ang+0.417
Neg logit contributions
 journals-0.181
 journal-0.177
 Christmas-0.137
 fork-0.128
ak-0.126
Token comes
Token ID2058
Feature activation-1.20e-01

Pos logit contributions
 journals+0.109
 journal+0.105
 Christmas+0.093
 fork+0.091
 pillow+0.088
Neg logit contributions
 comm-0.130
ters-0.129
 tactic-0.129
mun-0.123
 persuasion-0.122
Token to
Token ID284
Feature activation+3.02e-03

Pos logit contributions
mun+0.310
 comm+0.305
rob+0.286
 tactic+0.284
 persuasion+0.279
Neg logit contributions
 journals-0.084
 journal-0.083
 Christmas-0.076
ak-0.074
 forever-0.071
Token see
Token ID766
Feature activation+8.08e-02

Pos logit contributions
 Christmas+0.003
 journals+0.003
 journal+0.003
 dreamed+0.003
 Daddy+0.003
Neg logit contributions
mun-0.007
 comm-0.007
ters-0.007
 tactic-0.006
arling-0.006
Tokenmy
Token ID1820
Feature activation+8.24e-02

Pos logit contributions
 comm+0.136
mun+0.130
ters+0.125
 ang+0.124
 tactic+0.119
Neg logit contributions
 journals-0.094
 journal-0.088
pron-0.078
 Christmas-0.077
ifying-0.073
Token's
Token ID338
Feature activation+6.80e-02

Pos logit contributions
 journals+0.132
 journal+0.126
pron+0.113
 Christmas+0.111
ouse+0.105
Neg logit contributions
 comm-0.156
mun-0.151
ters-0.144
 ang-0.141
 persu-0.137
Token mom
Token ID1995
Feature activation+1.41e-01

Pos logit contributions
 journals+0.112
ifying+0.103
 Christmas+0.102
pron+0.101
iflower+0.099
Neg logit contributions
 comm-0.123
mun-0.111
 ang-0.111
 tactic-0.106
ters-0.105
Token noticed
Token ID6810
Feature activation+9.01e-02

Pos logit contributions
 journals+0.193
 journal+0.190
pron+0.173
 Christmas+0.158
on+0.155
Neg logit contributions
 comm-0.284
mun-0.274
ters-0.267
 tactic-0.261
 persu-0.259
Token the
Token ID262
Feature activation+6.46e-03

Pos logit contributions
 journals+0.126
 journal+0.123
 Christmas+0.108
pron+0.108
on+0.100
Neg logit contributions
 comm-0.187
mun-0.183
ters-0.171
 tactic-0.168
 persu-0.166
Token comm
Token ID725
Feature activation-4.98e-01

Pos logit contributions
 journals+0.011
 journal+0.010
pron+0.009
 Christmas+0.009
ifying+0.008
Neg logit contributions
 comm-0.012
mun-0.012
ters-0.011
 ang-0.011
 tactic-0.011
Tokenotion
Token ID9650
Feature activation-1.86e-01

Pos logit contributions
 comm+0.572
mun+0.521
 ang+0.481
ters+0.475
 tactic+0.441
Neg logit contributions
 journals-1.188
 journal-1.150
pron-1.069
 Christmas-1.062
ifying-1.026
Token and
Token ID290
Feature activation+5.70e-02

Pos logit contributions
 comm+0.403
mun+0.390
ters+0.371
 ang+0.367
 tactic+0.357
Neg logit contributions
 journals-0.252
 journal-0.241
pron-0.206
 Christmas-0.204
ifying-0.187
Token came
Token ID1625
Feature activation-6.75e-02

Pos logit contributions
 journals+0.098
 journal+0.094
pron+0.086
 Christmas+0.085
 worn+0.080
Neg logit contributions
 comm-0.098
mun-0.094
ters-0.089
 tactic-0.089
 ang-0.086
Token outside
Token ID2354
Feature activation+9.68e-03

Pos logit contributions
 comm+0.137
mun+0.137
ters+0.124
 persuasion+0.123
 tactic+0.123
Neg logit contributions
 journals-0.093
 journal-0.090
 Christmas-0.078
ak-0.077
 worn-0.074
Token.
Token ID13
Feature activation+1.32e-03

Pos logit contributions
 journals+0.012
 journal+0.012
 Christmas+0.011
pron+0.010
 fork+0.009
Neg logit contributions
mun-0.021
 comm-0.021
ters-0.020
 ang-0.018
 tactic-0.018
Token
Token ID198
Feature activation-1.59e-02

Pos logit contributions
 journals+0.030
 journal+0.028
pron+0.026
 Christmas+0.025
ifying+0.024
Neg logit contributions
 comm-0.032
mun-0.031
ters-0.029
 ang-0.029
 persu-0.028
Token
Token ID198
Feature activation-1.58e-02

Pos logit contributions
 comm+0.029
mun+0.027
ters+0.026
 ang+0.026
 persu+0.025
Neg logit contributions
 journals-0.028
 journal-0.027
pron-0.024
 Christmas-0.024
ifying-0.023
TokenMom
Token ID29252
Feature activation+2.11e-01

Pos logit contributions
 comm+0.025
mun+0.024
ters+0.023
 ang+0.022
 persu+0.021
Neg logit contributions
 journals-0.031
 journal-0.030
 Christmas-0.027
ocket-0.027
pron-0.027
Token heard
Token ID2982
Feature activation+2.40e-02

Pos logit contributions
 journals+0.256
 journal+0.248
pron+0.228
ak+0.218
 Christmas+0.207
Neg logit contributions
 comm-0.454
mun-0.439
 tactic-0.425
ters-0.423
 persu-0.421
Token the
Token ID262
Feature activation+1.13e-02

Pos logit contributions
 journals+0.040
 journal+0.038
 Christmas+0.035
pron+0.034
ifying+0.032
Neg logit contributions
 comm-0.044
mun-0.043
 tactic-0.040
 ang-0.039
ters-0.039
Token comm
Token ID725
Feature activation-4.96e-01

Pos logit contributions
 journal+0.022
 journals+0.021
pron+0.020
 pillow+0.020
 Christmas+0.019
Neg logit contributions
mun-0.016
ters-0.015
 tactic-0.014
 persu-0.014
arling-0.014
Tokenotion
Token ID9650
Feature activation-1.86e-01

Pos logit contributions
 comm+0.598
mun+0.543
 ang+0.513
ters+0.486
 tactic+0.467
Neg logit contributions
 journals-1.152
 journal-1.113
 Christmas-1.029
pron-1.028
ocket-0.990
Token and
Token ID290
Feature activation+6.75e-02

Pos logit contributions
mun+0.482
 comm+0.475
ters+0.442
 tactic+0.436
 ang+0.430
Neg logit contributions
 journals-0.155
 journal-0.153
 Christmas-0.121
pron-0.106
ak-0.102
Token came
Token ID1625
Feature activation-6.40e-02

Pos logit contributions
 journals+0.090
 journal+0.087
 Christmas+0.079
 worn+0.073
pron+0.073
Neg logit contributions
 comm-0.137
 tactic-0.131
mun-0.131
ters-0.128
 persu-0.123
Token to
Token ID284
Feature activation+6.81e-03

Pos logit contributions
 comm+0.154
mun+0.154
 tactic+0.141
 persuasion+0.140
 ang+0.139
Neg logit contributions
 journals-0.060
 journal-0.058
 Christmas-0.052
ak-0.048
 Daddy-0.045
Token the
Token ID262
Feature activation+1.14e-02

Pos logit contributions
 journals+0.008
 journal+0.008
 Christmas+0.008
pron+0.006
 notebook+0.006
Neg logit contributions
mun-0.015
 comm-0.015
ters-0.014
 tactic-0.014
 ang-0.013
Token came
Token ID1625
Feature activation-6.59e-02

Pos logit contributions
 journals+0.207
 journal+0.199
pron+0.182
 Christmas+0.178
ifying+0.167
Neg logit contributions
 comm-0.265
mun-0.253
 ang-0.242
ters-0.239
 tactic-0.234
Token to
Token ID284
Feature activation+1.21e-02

Pos logit contributions
 comm+0.117
mun+0.113
ters+0.105
 ang+0.104
 tactic+0.100
Neg logit contributions
 journals-0.115
 journal-0.110
pron-0.098
 Christmas-0.098
ifying-0.094
Token see
Token ID766
Feature activation+8.11e-02

Pos logit contributions
 journals+0.020
 journal+0.019
pron+0.017
 Christmas+0.017
ifying+0.016
Neg logit contributions
 comm-0.023
mun-0.022
ters-0.021
 ang-0.020
 persu-0.020
Token what
Token ID644
Feature activation-6.52e-02

Pos logit contributions
 journals+0.162
 journal+0.155
pron+0.145
 Christmas+0.140
ouse+0.135
Neg logit contributions
 comm-0.125
mun-0.118
ters-0.113
 ang-0.111
 tactic-0.104
Token the
Token ID262
Feature activation+5.30e-03

Pos logit contributions
 comm+0.117
mun+0.114
 ang+0.103
ters+0.101
 tactic+0.100
Neg logit contributions
 journals-0.110
 journal-0.103
ifying-0.099
pron-0.096
ocket-0.094
Token comm
Token ID725
Feature activation-4.92e-01

Pos logit contributions
 journal+0.005
 journals+0.005
 seed+0.004
 notebook+0.004
 Christmas+0.004
Neg logit contributions
mun-0.013
 comm-0.013
 ang-0.012
 persu-0.012
 persuasion-0.012
Tokenotion
Token ID9650
Feature activation-1.78e-01

Pos logit contributions
 comm+0.454
mun+0.440
ters+0.416
 ang+0.356
 persu+0.347
Neg logit contributions
 journals-1.282
 journal-1.233
pron-1.163
 Christmas-1.141
 notebook-1.103
Token was
Token ID373
Feature activation-9.81e-02

Pos logit contributions
 comm+0.317
mun+0.303
 ang+0.288
ters+0.285
 persu+0.275
Neg logit contributions
 journals-0.302
 journal-0.293
 Christmas-0.261
pron-0.261
 fork-0.244
Token about
Token ID546
Feature activation-1.01e-01

Pos logit contributions
mun+0.197
 comm+0.191
 ang+0.178
ters+0.177
 persuasion+0.173
Neg logit contributions
 journal-0.139
 journals-0.138
 Christmas-0.124
pron-0.123
ak-0.117
Token.
Token ID13
Feature activation+4.02e-03

Pos logit contributions
 comm+0.164
ters+0.151
mun+0.150
 ang+0.144
 persu+0.140
Neg logit contributions
 journals-0.190
 journal-0.179
pron-0.167
 Christmas-0.157
ocket-0.155
Token She
Token ID1375
Feature activation+2.26e-02

Pos logit contributions
 journals+0.008
 journal+0.007
ocket+0.007
ak+0.007
pron+0.007
Neg logit contributions
 comm-0.007
mun-0.006
ters-0.006
 ang-0.006
 persu-0.006
Token
Token ID198
Feature activation-3.01e-02

Pos logit contributions
 comm+0.058
mun+0.055
ters+0.053
 ang+0.052
 persu+0.050
Neg logit contributions
 journals-0.048
 journal-0.046
pron-0.042
 Christmas-0.041
ifying-0.039
TokenB
Token ID33
Feature activation-1.14e-01

Pos logit contributions
 comm+0.054
mun+0.051
ters+0.048
 ang+0.048
 persu+0.046
Neg logit contributions
 journals-0.054
 journal-0.052
pron-0.046
 Christmas-0.045
ocket-0.045
Tokeneth
Token ID2788
Feature activation+8.56e-02

Pos logit contributions
 comm+0.224
mun+0.212
ters+0.205
 ang+0.200
 persu+0.195
Neg logit contributions
 journals-0.184
 journal-0.173
pron-0.151
 Christmas-0.148
ocket-0.142
Token heard
Token ID2982
Feature activation+2.66e-02

Pos logit contributions
 journals+0.149
 journal+0.142
pron+0.129
 Christmas+0.126
ifying+0.121
Neg logit contributions
 comm-0.155
mun-0.147
ters-0.140
 ang-0.138
 persu-0.132
Token the
Token ID262
Feature activation+6.12e-03

Pos logit contributions
 journals+0.038
 journal+0.036
 Christmas+0.033
pron+0.031
ifying+0.028
Neg logit contributions
 comm-0.055
mun-0.055
ters-0.052
 ang-0.050
 tactic-0.049
Token comm
Token ID725
Feature activation-4.91e-01

Pos logit contributions
 journal+0.012
 journals+0.012
pron+0.010
 Christmas+0.010
 pillow+0.010
Neg logit contributions
mun-0.009
ters-0.009
 ang-0.009
 comm-0.008
arling-0.008
Tokenotion
Token ID9650
Feature activation-1.78e-01

Pos logit contributions
 comm+0.537
mun+0.494
ters+0.454
 ang+0.446
 persu+0.411
Neg logit contributions
 journals-1.203
 journal-1.161
pron-1.084
 Christmas-1.073
ifying-1.037
Token and
Token ID290
Feature activation+5.55e-02

Pos logit contributions
 comm+0.385
mun+0.379
ters+0.357
 ang+0.349
 tactic+0.341
Neg logit contributions
 journals-0.234
 journal-0.228
 Christmas-0.193
pron-0.190
 notebook-0.175
Token came
Token ID1625
Feature activation-6.66e-02

Pos logit contributions
 journals+0.084
 journal+0.081
 Christmas+0.072
pron+0.070
 worn+0.065
Neg logit contributions
 comm-0.109
mun-0.106
ters-0.100
 tactic-0.098
 ang-0.097
Token running
Token ID2491
Feature activation-1.20e-01

Pos logit contributions
 comm+0.134
mun+0.131
ters+0.119
 ang+0.118
 persuasion+0.117
Neg logit contributions
 journals-0.098
 journal-0.095
 Christmas-0.082
pron-0.078
ak-0.078
Token.
Token ID13
Feature activation-4.16e-03

Pos logit contributions
mun+0.257
 comm+0.246
 persuasion+0.223
ters+0.221
 tactic+0.220
Neg logit contributions
 journals-0.160
 journal-0.156
 Christmas-0.137
ifying-0.130
 forever-0.128
Token find
Token ID1064
Feature activation+1.25e-01

Pos logit contributions
 journals+0.101
 journal+0.098
 Christmas+0.086
pron+0.086
 notebook+0.081
Neg logit contributions
 comm-0.104
mun-0.099
ters-0.096
 ang-0.092
 persu-0.090
Token out
Token ID503
Feature activation-6.69e-02

Pos logit contributions
 journals+0.193
 journal+0.187
 Christmas+0.163
 notebook+0.150
 fork+0.149
Neg logit contributions
 comm-0.247
mun-0.238
ters-0.225
 persu-0.222
 ang-0.221
Token what
Token ID644
Feature activation-6.46e-02

Pos logit contributions
 comm+0.124
mun+0.121
ters+0.112
 tactic+0.106
 ang+0.105
Neg logit contributions
 journals-0.114
 journal-0.109
 Christmas-0.097
ifying-0.092
 forever-0.088
Token all
Token ID477
Feature activation+3.32e-02

Pos logit contributions
 comm+0.123
mun+0.117
ters+0.112
 ang+0.111
 persu+0.106
Neg logit contributions
 journals-0.107
 journal-0.101
pron-0.091
 Christmas-0.090
ifying-0.084
Token the
Token ID262
Feature activation+9.03e-03

Pos logit contributions
 journals+0.057
 journal+0.054
 Christmas+0.048
pron+0.048
ifying+0.045
Neg logit contributions
 comm-0.061
mun-0.058
ters-0.055
 ang-0.055
 persu-0.053
Token comm
Token ID725
Feature activation-4.90e-01

Pos logit contributions
 journals+0.017
 journal+0.016
 Christmas+0.015
pron+0.015
 fork+0.014
Neg logit contributions
 comm-0.014
mun-0.014
 ang-0.013
ters-0.013
 persu-0.013
Tokenotion
Token ID9650
Feature activation-1.76e-01

Pos logit contributions
 comm+0.454
mun+0.442
ters+0.419
 ang+0.352
 persu+0.345
Neg logit contributions
 journals-1.273
 journal-1.228
pron-1.167
 Christmas-1.134
ifying-1.095
Token was
Token ID373
Feature activation-9.52e-02

Pos logit contributions
 comm+0.313
mun+0.298
ters+0.283
 ang+0.281
 persu+0.268
Neg logit contributions
 journals-0.307
 journal-0.292
pron-0.265
 Christmas-0.261
ifying-0.248
Token about
Token ID546
Feature activation-1.00e-01

Pos logit contributions
 comm+0.156
mun+0.155
ters+0.141
 ang+0.140
 tactic+0.136
Neg logit contributions
 journals-0.172
 journal-0.167
 Christmas-0.154
pron-0.153
ak-0.144
Token.
Token ID13
Feature activation+1.92e-03

Pos logit contributions
 comm+0.165
mun+0.155
ters+0.151
 ang+0.145
 persu+0.141
Neg logit contributions
 journals-0.187
 journal-0.176
pron-0.167
 Christmas-0.155
ifying-0.154
Token When
Token ID1649
Feature activation-6.14e-02

Pos logit contributions
 journals+0.003
 journal+0.003
 Christmas+0.002
pron+0.002
ifying+0.002
Neg logit contributions
 comm-0.004
mun-0.004
ters-0.004
 ang-0.004
 persu-0.003
Token.
Token ID13
Feature activation-9.19e-03

Pos logit contributions
 journals+0.120
 journal+0.113
pron+0.102
 Christmas+0.101
ifying+0.095
Neg logit contributions
 comm-0.123
mun-0.116
ters-0.113
 ang-0.109
 tactic-0.105
Token
Token ID198
Feature activation-3.58e-02

Pos logit contributions
 comm+0.017
mun+0.016
ters+0.016
 ang+0.015
 tactic+0.015
Neg logit contributions
 journals-0.015
 journal-0.015
pron-0.013
 Christmas-0.013
ocket-0.012
Token
Token ID198
Feature activation-3.66e-02

Pos logit contributions
 comm+0.060
mun+0.058
ters+0.053
 ang+0.053
 tactic+0.051
Neg logit contributions
 journals-0.068
 journal-0.064
pron-0.058
 Christmas-0.058
ifying-0.056
TokenThe
Token ID464
Feature activation+5.80e-02

Pos logit contributions
 comm+0.064
mun+0.062
ters+0.058
 ang+0.057
 persu+0.055
Neg logit contributions
 journals-0.066
 journal-0.063
 Christmas-0.056
pron-0.055
ocket-0.054
Token chip
Token ID11594
Feature activation-3.03e-01

Pos logit contributions
 journals+0.102
 journal+0.099
pron+0.089
 Christmas+0.088
 notebook+0.082
Neg logit contributions
 comm-0.103
mun-0.099
ters-0.093
 ang-0.093
 tactic-0.089
Tokenmun
Token ID6199
Feature activation-4.74e-01

Pos logit contributions
 comm+0.406
 ang+0.357
 persu+0.340
ters+0.339
 tactic+0.336
Neg logit contributions
 journals-0.639
 journal-0.627
pron-0.596
 Christmas-0.579
iflower-0.564
Tokenks
Token ID591
Feature activation-8.97e-02

Pos logit contributions
 comm+0.822
mun+0.767
 ang+0.752
 persu+0.711
ters+0.710
Neg logit contributions
 journals-0.851
 journal-0.811
ocket-0.720
ak-0.717
pron-0.710
Token were
Token ID547
Feature activation-5.05e-02

Pos logit contributions
 comm+0.155
mun+0.148
ters+0.140
 ang+0.139
 tactic+0.133
Neg logit contributions
 journals-0.161
 journal-0.154
pron-0.140
 Christmas-0.138
ifying-0.132
Token very
Token ID845
Feature activation-1.03e-02

Pos logit contributions
 comm+0.086
mun+0.082
ters+0.078
 ang+0.077
 tactic+0.074
Neg logit contributions
 journals-0.093
 journal-0.088
pron-0.081
 Christmas-0.080
ifying-0.076
Token curious
Token ID11040
Feature activation-2.96e-02

Pos logit contributions
 comm+0.020
mun+0.019
ters+0.018
 ang+0.018
 tactic+0.017
Neg logit contributions
 journals-0.017
 journal-0.016
pron-0.014
 Christmas-0.014
ifying-0.013
Token,
Token ID11
Feature activation-3.10e-02

Pos logit contributions
 comm+0.060
mun+0.058
 ang+0.055
ters+0.055
 persu+0.053
Neg logit contributions
 journals-0.045
 journal-0.043
 Christmas-0.037
pron-0.036
 fork-0.034
Token!
Token ID0
Feature activation+9.88e-03

Pos logit contributions
 journals+0.023
 journal+0.022
pron+0.020
 Christmas+0.019
ifying+0.018
Neg logit contributions
 comm-0.028
mun-0.026
ters-0.025
 ang-0.025
 persu-0.024
Token
Token ID198
Feature activation-3.06e-02

Pos logit contributions
 journals+0.016
 journal+0.016
pron+0.014
 Christmas+0.014
ifying+0.013
Neg logit contributions
 comm-0.019
mun-0.018
ters-0.017
 ang-0.017
 persu-0.016
Token
Token ID198
Feature activation-3.07e-02

Pos logit contributions
 comm+0.056
mun+0.053
ters+0.051
 ang+0.050
 persu+0.048
Neg logit contributions
 journals-0.053
 journal-0.050
pron-0.045
 Christmas-0.045
ifying-0.042
TokenThe
Token ID464
Feature activation+5.96e-02

Pos logit contributions
 comm+0.057
mun+0.055
ters+0.052
 ang+0.051
 persu+0.050
Neg logit contributions
 journals-0.052
 journal-0.050
 Christmas-0.044
pron-0.044
ocket-0.041
Token chip
Token ID11594
Feature activation-2.98e-01

Pos logit contributions
 journal+0.090
 journals+0.090
pron+0.086
 Christmas+0.084
 fork+0.080
Neg logit contributions
 comm-0.110
mun-0.108
ters-0.100
 ang-0.099
 tactic-0.099
Tokenmun
Token ID6199
Feature activation-4.70e-01

Pos logit contributions
 comm+0.403
 ang+0.358
 persu+0.338
 tactic+0.335
ters+0.325
Neg logit contributions
 journals-0.609
 journal-0.601
pron-0.578
 Christmas-0.560
iflower-0.549
Tokenks
Token ID591
Feature activation-8.54e-02

Pos logit contributions
 comm+0.832
mun+0.776
 ang+0.770
 persu+0.728
 persuasion+0.714
Neg logit contributions
 journals-0.808
 journal-0.772
ak-0.703
ocket-0.697
ifying-0.677
Token didn
Token ID1422
Feature activation-2.01e-02

Pos logit contributions
 comm+0.149
mun+0.141
ters+0.136
 ang+0.133
 persu+0.126
Neg logit contributions
 journals-0.154
 journal-0.147
pron-0.133
 Christmas-0.131
ifying-0.124
Token't
Token ID470
Feature activation-7.00e-02

Pos logit contributions
 comm+0.037
mun+0.037
ters+0.033
 ang+0.033
 tactic+0.032
Neg logit contributions
 journals-0.032
 journal-0.031
 Christmas-0.029
pron-0.029
ifying-0.028
Token know
Token ID760
Feature activation+2.67e-03

Pos logit contributions
mun+0.123
 comm+0.120
ters+0.110
 ang+0.108
 persu+0.103
Neg logit contributions
 journals-0.120
 journal-0.117
 Christmas-0.108
pron-0.103
 dreamed-0.098
Token what
Token ID644
Feature activation-7.53e-02

Pos logit contributions
 journals+0.004
 journal+0.004
 Christmas+0.004
ifying+0.004
pron+0.004
Neg logit contributions
 comm-0.005
mun-0.005
 ang-0.004
 tactic-0.004
ters-0.004
Token cutting
Token ID7720
Feature activation-4.39e-02

Pos logit contributions
 comm+0.541
mun+0.524
ters+0.500
 ang+0.496
 tactic+0.483
Neg logit contributions
 journals-0.413
 journal-0.398
pron-0.354
 Christmas-0.352
ifying-0.325
Token!"
Token ID2474
Feature activation+6.75e-02

Pos logit contributions
 comm+0.078
ters+0.076
 ang+0.073
mun+0.073
 tactic+0.067
Neg logit contributions
 journals-0.077
 journal-0.072
pron-0.066
 Christmas-0.064
ocket-0.063
Token The
Token ID383
Feature activation+7.18e-02

Pos logit contributions
 journals+0.114
 journal+0.108
pron+0.097
 Christmas+0.096
ifying+0.091
Neg logit contributions
 comm-0.125
mun-0.119
ters-0.114
 ang-0.112
 persu-0.108
Token wood
Token ID4898
Feature activation+3.53e-02

Pos logit contributions
 journals+0.128
 journal+0.126
pron+0.114
 Christmas+0.111
 fork+0.106
Neg logit contributions
 comm-0.122
mun-0.118
ters-0.111
 ang-0.109
 tactic-0.108
Tokencut
Token ID8968
Feature activation-2.11e-01

Pos logit contributions
 journals+0.064
 journal+0.061
pron+0.056
 Christmas+0.055
ifying+0.052
Neg logit contributions
 comm-0.060
mun-0.058
ters-0.054
 ang-0.054
 persu-0.052
Tokenters
Token ID1010
Feature activation-4.69e-01

Pos logit contributions
 comm+0.316
mun+0.285
 persu+0.269
 tactic+0.269
 persuasion+0.268
Neg logit contributions
 journals-0.406
 journal-0.405
pron-0.367
iflower-0.366
 Christmas-0.365
Token heard
Token ID2982
Feature activation+7.56e-03

Pos logit contributions
 comm+0.813
mun+0.770
ters+0.733
 ang+0.727
 persu+0.689
Neg logit contributions
 journals-0.852
 journal-0.808
pron-0.737
 Christmas-0.728
ifying-0.688
Token them
Token ID606
Feature activation-1.22e-01

Pos logit contributions
 journals+0.012
 journal+0.011
 Christmas+0.010
pron+0.010
ifying+0.009
Neg logit contributions
mun-0.014
 comm-0.014
ters-0.013
 tactic-0.013
 ang-0.013
Token and
Token ID290
Feature activation+5.59e-02

Pos logit contributions
 comm+0.246
mun+0.243
ters+0.223
 ang+0.221
 persuasion+0.217
Neg logit contributions
 journals-0.170
 journal-0.166
 Christmas-0.144
pron-0.140
ifying-0.139
Token stopped
Token ID5025
Feature activation-7.44e-02

Pos logit contributions
 journals+0.077
 journal+0.075
 Christmas+0.066
pron+0.064
 fork+0.064
Neg logit contributions
 comm-0.110
mun-0.109
 tactic-0.105
ters-0.101
 persuasion-0.100
Token.
Token ID13
Feature activation-6.71e-03

Pos logit contributions
 comm+0.130
mun+0.129
 tactic+0.119
ters+0.118
 persuasion+0.112
Neg logit contributions
 journals-0.121
 journal-0.119
 Christmas-0.108
pron-0.105
ak-0.102
Token.
Token ID13
Feature activation-7.53e-03

Pos logit contributions
 comm+0.168
mun+0.161
ters+0.150
 ang+0.146
 persu+0.141
Neg logit contributions
 journals-0.204
 journal-0.188
pron-0.176
ifying-0.166
ocket-0.164
Token She
Token ID1375
Feature activation+1.10e-03

Pos logit contributions
 comm+0.012
mun+0.011
ters+0.011
 persuasion+0.010
 tactic+0.010
Neg logit contributions
 journals-0.015
 journal-0.014
pron-0.013
iflower-0.013
ocket-0.012
Token realized
Token ID6939
Feature activation+7.80e-02

Pos logit contributions
 journals+0.002
 journal+0.002
pron+0.002
ifying+0.002
 Christmas+0.002
Neg logit contributions
 comm-0.002
mun-0.002
 ang-0.002
 warns-0.002
ters-0.002
Token that
Token ID326
Feature activation-4.83e-02

Pos logit contributions
 journals+0.142
 journal+0.135
pron+0.123
 Christmas+0.120
ifying+0.116
Neg logit contributions
 comm-0.135
mun-0.127
ters-0.122
 ang-0.120
 persu-0.115
Token ot
Token ID30972
Feature activation-2.83e-01

Pos logit contributions
 comm+0.088
mun+0.084
ters+0.080
 ang+0.079
 persu+0.075
Neg logit contributions
 journals-0.083
 journal-0.079
pron-0.071
 Christmas-0.070
ifying-0.067
Tokenters
Token ID1010
Feature activation-4.68e-01

Pos logit contributions
 comm+0.396
 persu+0.322
 tactic+0.310
 persuasion+0.305
 ang+0.290
Neg logit contributions
 journals-0.571
 journal-0.569
ak-0.559
ifying-0.542
ocket-0.520
Token are
Token ID389
Feature activation-5.54e-02

Pos logit contributions
 comm+0.797
mun+0.778
ters+0.741
 ang+0.706
 warns+0.679
Neg logit contributions
 journals-0.856
 journal-0.803
pron-0.758
 Christmas-0.731
ifying-0.686
Token better
Token ID1365
Feature activation-1.48e-01

Pos logit contributions
 comm+0.109
mun+0.108
 tactic+0.100
ters+0.097
 ang+0.097
Neg logit contributions
 journals-0.084
 journal-0.080
 Christmas-0.072
pron-0.070
ifying-0.067
Token off
Token ID572
Feature activation-7.05e-02

Pos logit contributions
 comm+0.292
mun+0.279
ters+0.270
 ang+0.263
 persu+0.253
Neg logit contributions
 journals-0.235
 journal-0.221
pron-0.206
 Christmas-0.192
ifying-0.186
Token in
Token ID287
Feature activation+3.56e-04

Pos logit contributions
 comm+0.110
mun+0.103
ters+0.101
 ang+0.096
 persu+0.092
Neg logit contributions
 journals-0.140
 journal-0.132
pron-0.127
ouse-0.118
ocket-0.116
Token the
Token ID262
Feature activation-3.30e-03

Pos logit contributions
 journals+0.001
 journal+0.001
pron+0.001
ouse+0.001
 Christmas+0.001
Neg logit contributions
 comm-0.001
mun-0.001
ters-0.001
 ang-0.001
 persu-0.001
Token Clarence
Token ID42602
Feature activation-9.10e-02

Pos logit contributions
 comm+0.008
mun+0.008
ters+0.007
 ang+0.007
 persu+0.007
Neg logit contributions
 journals-0.008
 journal-0.008
pron-0.007
ocket-0.006
 Christmas-0.006
Token asked
Token ID1965
Feature activation+8.11e-03

Pos logit contributions
 comm+0.152
mun+0.149
ters+0.140
 ang+0.137
 warns+0.128
Neg logit contributions
 journals-0.175
 journal-0.164
pron-0.153
 Christmas-0.146
ifying-0.141
Token his
Token ID465
Feature activation+1.06e-01

Pos logit contributions
 journals+0.013
 journal+0.012
pron+0.011
 Christmas+0.011
ifying+0.010
Neg logit contributions
 comm-0.016
mun-0.015
ters-0.014
 ang-0.014
 persu-0.014
Token friend
Token ID1545
Feature activation-4.80e-02

Pos logit contributions
 journals+0.144
 journal+0.143
 Christmas+0.123
 fork+0.117
 notebook+0.114
Neg logit contributions
mun-0.228
 comm-0.225
ters-0.220
 ang-0.210
 persu-0.206
Token Chip
Token ID17869
Feature activation+1.29e-02

Pos logit contributions
 comm+0.097
mun+0.088
ters+0.085
 ang+0.085
 persu+0.083
Neg logit contributions
 journals-0.073
 journal-0.072
 Christmas-0.064
 notebook-0.060
 fork-0.059
Tokenmun
Token ID6199
Feature activation-4.67e-01

Pos logit contributions
 journals+0.023
 journal+0.022
 Christmas+0.019
pron+0.019
ocket+0.018
Neg logit contributions
 comm-0.023
mun-0.022
ters-0.021
 ang-0.020
 persu-0.020
Tokenks
Token ID591
Feature activation-8.06e-02

Pos logit contributions
 comm+0.790
mun+0.734
 ang+0.700
ters+0.695
 persu+0.666
Neg logit contributions
 journals-0.874
 journal-0.832
 Christmas-0.729
ocket-0.728
ak-0.727
Token,
Token ID11
Feature activation-3.06e-02

Pos logit contributions
 comm+0.153
mun+0.147
ters+0.139
 ang+0.138
 persu+0.133
Neg logit contributions
 journals-0.132
 journal-0.126
pron-0.112
 Christmas-0.111
ifying-0.105
Token "
Token ID366
Feature activation-1.24e-01

Pos logit contributions
 comm+0.056
mun+0.053
ters+0.050
 ang+0.049
 tactic+0.049
Neg logit contributions
 journal-0.047
 journals-0.047
 Christmas-0.044
pron-0.043
 fork-0.040
TokenIs
Token ID3792
Feature activation-1.88e-01

Pos logit contributions
 comm+0.202
mun+0.181
 ang+0.176
 persuasion+0.171
 persu+0.170
Neg logit contributions
 journals-0.230
 journal-0.220
pron-0.211
ifying-0.205
 Christmas-0.202
Token it
Token ID340
Feature activation+1.46e-01

Pos logit contributions
 comm+0.365
mun+0.350
ters+0.333
 ang+0.332
 persu+0.319
Neg logit contributions
 journals-0.301
 journal-0.285
pron-0.257
 Christmas-0.246
ifying-0.237
Token It
Token ID632
Feature activation+1.45e-01

Pos logit contributions
 journals+0.028
 journal+0.026
pron+0.025
 Christmas+0.024
iflower+0.024
Neg logit contributions
 comm-0.023
mun-0.022
ters-0.020
 ang-0.020
 persuasion-0.020
Token saw
Token ID2497
Feature activation+4.92e-02

Pos logit contributions
 journals+0.266
 journal+0.253
pron+0.231
 Christmas+0.227
ifying+0.217
Neg logit contributions
 comm-0.251
mun-0.238
 ang-0.224
ters-0.224
 persu-0.212
Token the
Token ID262
Feature activation+5.42e-03

Pos logit contributions
 journals+0.081
 journal+0.077
 Christmas+0.069
pron+0.068
ifying+0.064
Neg logit contributions
 comm-0.093
mun-0.090
ters-0.084
 ang-0.084
 tactic-0.082
Token two
Token ID734
Feature activation-9.17e-02

Pos logit contributions
 journals+0.009
 journal+0.008
pron+0.008
 Christmas+0.007
ifying+0.007
Neg logit contributions
 comm-0.010
mun-0.010
ters-0.009
 ang-0.009
 persu-0.009
Token ot
Token ID30972
Feature activation-2.74e-01

Pos logit contributions
 comm+0.147
mun+0.139
ters+0.133
 ang+0.129
 persu+0.120
Neg logit contributions
 journals-0.180
 journal-0.170
pron-0.159
 Christmas-0.154
ouse-0.148
Tokenters
Token ID1010
Feature activation-4.67e-01

Pos logit contributions
 comm+0.362
 tactic+0.308
 persu+0.300
 persuasion+0.293
 ang+0.275
Neg logit contributions
 journal-0.589
 journals-0.575
ak-0.566
ifying-0.559
pron-0.537
Token and
Token ID290
Feature activation+6.43e-02

Pos logit contributions
 comm+0.755
mun+0.725
ters+0.694
 ang+0.669
 persu+0.638
Neg logit contributions
 journals-0.902
 journal-0.851
pron-0.793
 Christmas-0.770
ifying-0.728
Token wanted
Token ID2227
Feature activation+6.24e-02

Pos logit contributions
 journals+0.114
 journal+0.109
pron+0.097
 Christmas+0.097
ifying+0.092
Neg logit contributions
 comm-0.113
mun-0.108
ters-0.102
 ang-0.101
 tactic-0.098
Token to
Token ID284
Feature activation-5.32e-03

Pos logit contributions
 journals+0.097
 journal+0.091
 Christmas+0.082
pron+0.081
ifying+0.075
Neg logit contributions
 comm-0.124
mun-0.119
ters-0.113
 ang-0.112
 tactic-0.108
Token eat
Token ID4483
Feature activation+1.33e-03

Pos logit contributions
 comm+0.009
mun+0.008
ters+0.008
 ang+0.008
 persu+0.007
Neg logit contributions
 journals-0.010
 journal-0.010
pron-0.009
 Christmas-0.009
ifying-0.008
Token them
Token ID606
Feature activation-1.19e-01

Pos logit contributions
 journals+0.002
 journal+0.002
 Christmas+0.002
pron+0.002
ifying+0.002
Neg logit contributions
 comm-0.003
mun-0.002
 ang-0.002
 tactic-0.002
ters-0.002
Token felt
Token ID2936
Feature activation+2.43e-02

Pos logit contributions
 journals+0.001
ifying+0.001
 journal+0.001
pron+0.001
 Christmas+0.001
Neg logit contributions
 comm-0.001
mun-0.001
 warns-0.001
 ang-0.001
ters-0.001
Token,
Token ID11
Feature activation-3.32e-02

Pos logit contributions
 journals+0.051
 journal+0.048
pron+0.045
 Christmas+0.044
ifying+0.043
Neg logit contributions
 comm-0.035
mun-0.031
ters-0.031
 ang-0.030
 tactic-0.028
Token and
Token ID290
Feature activation+5.02e-02

Pos logit contributions
 comm+0.052
mun+0.048
 ang+0.047
ters+0.047
 tactic+0.044
Neg logit contributions
 journals-0.062
 journal-0.059
pron-0.056
 Christmas-0.054
ouse-0.054
Token what
Token ID644
Feature activation-6.41e-02

Pos logit contributions
 journals+0.093
 journal+0.089
pron+0.086
ifying+0.085
ak+0.081
Neg logit contributions
 comm-0.081
mun-0.074
 ang-0.071
ters-0.069
 persuasion-0.066
Token she
Token ID673
Feature activation+9.42e-04

Pos logit contributions
 comm+0.108
mun+0.103
ters+0.097
 ang+0.096
 tactic+0.092
Neg logit contributions
 journals-0.113
 journal-0.106
pron-0.102
ifying-0.100
 Christmas-0.099
Token dreamed
Token ID27947
Feature activation+3.47e-01

Pos logit contributions
 journals+0.001
ifying+0.001
 journal+0.001
pron+0.001
 Christmas+0.001
Neg logit contributions
 comm-0.002
mun-0.002
 warns-0.002
 ang-0.001
 means-0.001
Token.
Token ID13
Feature activation-4.18e-03

Pos logit contributions
 journals+0.571
 journal+0.548
pron+0.493
 Christmas+0.492
ocket+0.454
Neg logit contributions
 comm-0.655
mun-0.624
 ang-0.597
ters-0.589
 tactic-0.569
Token She
Token ID1375
Feature activation+2.11e-02

Pos logit contributions
 comm+0.007
mun+0.007
ters+0.006
 tactic+0.006
 persuasion+0.006
Neg logit contributions
 journals-0.008
 journal-0.007
pron-0.007
ocket-0.007
 Christmas-0.007
Token used
Token ID973
Feature activation+1.03e-01

Pos logit contributions
ifying+0.043
 journals+0.037
 journal+0.035
pron+0.033
alks+0.033
Neg logit contributions
 comm-0.031
mun-0.029
 means-0.026
 zo-0.025
 ang-0.025
Token cr
Token ID1067
Feature activation+1.65e-01

Pos logit contributions
 journals+0.185
ouse+0.179
ifying+0.179
 journal+0.176
pron+0.173
Neg logit contributions
 comm-0.154
 ang-0.140
 persu-0.134
 persuasion-0.132
 tactic-0.132
Tokenay
Token ID323
Feature activation+2.02e-01

Pos logit contributions
 journals+0.234
 journal+0.230
pron+0.200
 Christmas+0.199
 dreamed+0.183
Neg logit contributions
ters-0.322
 comm-0.319
mun-0.300
 persuasion-0.290
 tactic-0.290
Token She
Token ID1375
Feature activation+2.17e-02

Pos logit contributions
 comm+0.002
mun+0.002
ters+0.002
 ang+0.002
 persu+0.002
Neg logit contributions
 journals-0.002
 journal-0.002
pron-0.002
 Christmas-0.002
ifying-0.002
Token wanted
Token ID2227
Feature activation+7.21e-02

Pos logit contributions
 journals+0.044
 journal+0.041
ifying+0.039
pron+0.037
 Christmas+0.036
Neg logit contributions
 comm-0.037
mun-0.033
 ang-0.030
 warns-0.029
ters-0.028
Token to
Token ID284
Feature activation+8.63e-03

Pos logit contributions
 journals+0.118
 journal+0.112
pron+0.101
 Christmas+0.099
ifying+0.094
Neg logit contributions
 comm-0.138
mun-0.131
ters-0.125
 ang-0.124
 persu-0.119
Token show
Token ID905
Feature activation-1.26e-01

Pos logit contributions
 journals+0.018
 journal+0.017
pron+0.016
ifying+0.015
 Christmas+0.015
Neg logit contributions
 comm-0.013
mun-0.012
 ang-0.011
ters-0.011
 tactic-0.011
Token her
Token ID607
Feature activation+4.34e-02

Pos logit contributions
 comm+0.257
mun+0.246
ters+0.235
 ang+0.233
 persu+0.225
Neg logit contributions
 journals-0.191
 journal-0.181
pron-0.159
 Christmas-0.158
ifying-0.147
Token daddy
Token ID35822
Feature activation+3.20e-01

Pos logit contributions
ak+0.063
 journals+0.062
ifying+0.061
ocket+0.057
endant+0.055
Neg logit contributions
 comm-0.086
mun-0.077
 ang-0.077
 tactic-0.075
 persu-0.071
Token that
Token ID326
Feature activation-2.00e-02

Pos logit contributions
 journals+0.579
 journal+0.549
pron+0.510
 Christmas+0.491
ifying+0.484
Neg logit contributions
 comm-0.542
mun-0.520
 ang-0.496
ters-0.494
 tactic-0.466
Token she
Token ID673
Feature activation+4.61e-03

Pos logit contributions
 comm+0.036
mun+0.034
ters+0.033
 ang+0.032
 tactic+0.031
Neg logit contributions
 journals-0.034
 journal-0.032
pron-0.030
 Christmas-0.029
ifying-0.029
Token knew
Token ID2993
Feature activation+9.31e-02

Pos logit contributions
 journals+0.009
 journal+0.008
ifying+0.008
pron+0.007
 Christmas+0.007
Neg logit contributions
 comm-0.008
mun-0.008
 warns-0.007
ters-0.007
 ang-0.007
Token what
Token ID644
Feature activation-6.61e-02

Pos logit contributions
 journals+0.169
 journal+0.160
pron+0.147
 Christmas+0.143
ifying+0.139
Neg logit contributions
 comm-0.160
mun-0.153
ters-0.146
 ang-0.143
 persu-0.137
Token she
Token ID673
Feature activation+5.12e-03

Pos logit contributions
 comm+0.112
mun+0.106
 ang+0.097
ters+0.097
 persu+0.096
Neg logit contributions
 journals-0.119
 journal-0.110
ifying-0.106
pron-0.102
ouse-0.099
Token.
Token ID13
Feature activation+1.76e-02

Pos logit contributions
 comm+0.033
mun+0.031
ters+0.029
 ang+0.029
 tactic+0.029
Neg logit contributions
 journals-0.021
 journal-0.020
 Christmas-0.019
pron-0.017
ifying-0.017
Token Sara
Token ID24799
Feature activation-6.96e-02

Pos logit contributions
 journals+0.033
 journal+0.032
pron+0.029
 Christmas+0.028
ocket+0.027
Neg logit contributions
 comm-0.030
mun-0.028
ters-0.027
 ang-0.026
 persu-0.025
Token's
Token ID338
Feature activation+7.73e-02

Pos logit contributions
 comm+0.107
mun+0.103
 warns+0.090
 ang+0.085
 shouts+0.084
Neg logit contributions
 journals-0.151
 journal-0.139
ifying-0.139
pron-0.125
 Christmas-0.120
Token mom
Token ID1995
Feature activation+1.51e-01

Pos logit contributions
 journals+0.115
 journal+0.101
ifying+0.101
pron+0.097
ak+0.097
Neg logit contributions
 comm-0.153
 ang-0.141
mun-0.140
 persuasion-0.133
 tactic-0.133
Token and
Token ID290
Feature activation+7.29e-02

Pos logit contributions
 journals+0.286
 journal+0.266
ifying+0.238
 Christmas+0.230
pron+0.230
Neg logit contributions
 comm-0.262
mun-0.253
ters-0.232
 ang-0.227
 warns-0.220
Token dad
Token ID9955
Feature activation+2.54e-01

Pos logit contributions
 journals+0.094
 journal+0.086
pron+0.077
iflower+0.077
ifying+0.076
Neg logit contributions
 comm-0.168
 ang-0.158
mun-0.156
ters-0.148
 warns-0.145
Token came
Token ID1625
Feature activation-5.72e-02

Pos logit contributions
 journals+0.479
 journal+0.451
pron+0.406
ifying+0.397
 Christmas+0.390
Neg logit contributions
 comm-0.432
mun-0.422
ters-0.385
 ang-0.381
 persuasion-0.360
Token over
Token ID625
Feature activation-4.85e-02

Pos logit contributions
 comm+0.118
mun+0.113
ters+0.107
 ang+0.107
 tactic+0.103
Neg logit contributions
 journals-0.084
 journal-0.080
 Christmas-0.070
pron-0.069
ifying-0.065
Token and
Token ID290
Feature activation+7.33e-02

Pos logit contributions
 comm+0.103
mun+0.098
ters+0.094
 ang+0.092
 tactic+0.090
Neg logit contributions
 journals-0.070
 journal-0.066
 Christmas-0.060
pron-0.057
ifying-0.054
Token gave
Token ID2921
Feature activation+4.48e-02

Pos logit contributions
 journals+0.106
 journal+0.101
pron+0.091
 Christmas+0.089
 worn+0.081
Neg logit contributions
 comm-0.151
mun-0.142
ters-0.142
 tactic-0.135
 ang-0.135
Token them
Token ID606
Feature activation-9.28e-02

Pos logit contributions
 journals+0.055
 Christmas+0.049
 journal+0.048
 Daddy+0.040
ocket+0.039
Neg logit contributions
 comm-0.104
mun-0.091
 tactic-0.091
 persu-0.089
 ang-0.089
Token't
Token ID470
Feature activation-7.24e-02

Pos logit contributions
 journals+0.324
 journal+0.311
pron+0.284
 Christmas+0.281
ifying+0.268
Neg logit contributions
 comm-0.281
mun-0.266
ters-0.252
 ang-0.250
 tactic-0.237
Token find
Token ID1064
Feature activation+1.07e-01

Pos logit contributions
 comm+0.099
mun+0.087
ters+0.085
 ang+0.082
 tactic+0.081
Neg logit contributions
 journals-0.160
 journal-0.150
pron-0.142
iflower-0.136
 Christmas-0.136
Token her
Token ID607
Feature activation+3.23e-02

Pos logit contributions
 journals+0.178
 journal+0.167
pron+0.158
 Christmas+0.150
ifying+0.146
Neg logit contributions
 comm-0.199
mun-0.188
ters-0.183
 ang-0.182
 tactic-0.174
Token cr
Token ID1067
Feature activation+1.58e-01

Pos logit contributions
 journals+0.047
ifying+0.043
 journal+0.042
pron+0.041
ak+0.041
Neg logit contributions
 comm-0.063
 ang-0.058
 tactic-0.056
 persu-0.054
mun-0.053
Tokenay
Token ID323
Feature activation+1.93e-01

Pos logit contributions
 journals+0.258
 journal+0.238
 Christmas+0.220
 dreamed+0.206
pron+0.201
Neg logit contributions
ters-0.287
 comm-0.273
whe-0.255
 tactic-0.246
mun-0.242
Tokenon
Token ID261
Feature activation+3.40e-01

Pos logit contributions
 journals+0.181
 journal+0.153
ifying+0.137
 Christmas+0.131
pron+0.121
Neg logit contributions
 comm-0.474
ters-0.460
mun-0.460
 ang-0.450
 tactic-0.442
Token and
Token ID290
Feature activation+4.62e-02

Pos logit contributions
 journals+0.630
 journal+0.577
pron+0.533
 Christmas+0.526
ifying+0.509
Neg logit contributions
 comm-0.573
ters-0.536
mun-0.526
 ang-0.520
 tactic-0.490
Token she
Token ID673
Feature activation-8.70e-03

Pos logit contributions
 journals+0.078
 journal+0.074
pron+0.068
 Christmas+0.066
ifying+0.064
Neg logit contributions
 comm-0.087
mun-0.081
 ang-0.078
ters-0.077
 tactic-0.073
Token thought
Token ID1807
Feature activation-4.47e-02

Pos logit contributions
 comm+0.016
mun+0.015
 ang+0.014
ters+0.014
 persu+0.013
Neg logit contributions
 journals-0.015
 journal-0.015
pron-0.013
 Christmas-0.013
ifying-0.013
Token she
Token ID673
Feature activation-7.67e-03

Pos logit contributions
 comm+0.069
mun+0.064
 persuasion+0.063
ters+0.063
 persu+0.063
Neg logit contributions
 journals-0.082
 journal-0.073
ifying-0.073
pron-0.068
ouse-0.068
Token had
Token ID550
Feature activation+2.12e-01

Pos logit contributions
 comm+0.014
mun+0.013
ters+0.012
 ang+0.012
 tactic+0.012
Neg logit contributions
 journals-0.013
 journal-0.012
pron-0.012
 Christmas-0.011
ifying-0.011
Token head
Token ID1182
Feature activation+1.39e-01

Pos logit contributions
 journals+0.069
 journal+0.063
pron+0.058
 Christmas+0.055
ifying+0.055
Neg logit contributions
 comm-0.095
 ang-0.084
mun-0.083
ters-0.083
 persu-0.081
Token and
Token ID290
Feature activation+6.02e-02

Pos logit contributions
 journals+0.257
 journal+0.236
pron+0.209
 Christmas+0.205
iflower+0.205
Neg logit contributions
 comm-0.239
ters-0.226
mun-0.225
 ang-0.220
 persu-0.209
Token carefully
Token ID7773
Feature activation+1.04e-01

Pos logit contributions
 journals+0.119
 journal+0.114
pron+0.105
 Christmas+0.103
ifying+0.099
Neg logit contributions
 comm-0.094
mun-0.089
ters-0.084
 ang-0.083
 persu-0.078
Token opened
Token ID4721
Feature activation+1.32e-01

Pos logit contributions
 journals+0.174
 journal+0.164
pron+0.148
 Christmas+0.145
ifying+0.140
Neg logit contributions
 comm-0.197
mun-0.185
 ang-0.178
ters-0.176
 persu-0.170
Token the
Token ID262
Feature activation+8.09e-03

Pos logit contributions
 journals+0.263
 journal+0.250
pron+0.245
ouse+0.235
iflower+0.228
Neg logit contributions
 comm-0.192
 ang-0.180
ters-0.175
mun-0.172
 persu-0.167
Token shell
Token ID7582
Feature activation+2.59e-01

Pos logit contributions
 journals+0.010
pron+0.009
 Christmas+0.008
 journal+0.008
ifying+0.007
Neg logit contributions
 comm-0.019
 ang-0.017
 tactic-0.016
mun-0.016
 persu-0.016
Token.
Token ID13
Feature activation+3.80e-03

Pos logit contributions
 journals+0.459
 journal+0.426
pron+0.390
 Christmas+0.387
ifying+0.366
Neg logit contributions
 comm-0.456
mun-0.430
ters-0.427
 ang-0.409
 tactic-0.397
Token Inside
Token ID14384
Feature activation+1.64e-01

Pos logit contributions
 journals+0.007
 journal+0.007
pron+0.006
 Christmas+0.006
iflower+0.006
Neg logit contributions
 comm-0.006
mun-0.006
ters-0.006
 ang-0.006
 persu-0.006
Token,
Token ID11
Feature activation-2.51e-02

Pos logit contributions
 journals+0.355
 journal+0.325
pron+0.300
ifying+0.291
 Christmas+0.289
Neg logit contributions
mun-0.234
 comm-0.227
ters-0.214
 ang-0.202
 tactic-0.198
Token there
Token ID612
Feature activation+7.44e-02

Pos logit contributions
 comm+0.042
mun+0.040
ters+0.039
 ang+0.038
 persu+0.036
Neg logit contributions
 journals-0.047
 journal-0.043
pron-0.040
 Christmas-0.039
iflower-0.038
Token was
Token ID373
Feature activation-9.38e-02

Pos logit contributions
 journals+0.138
 journal+0.133
pron+0.121
 Christmas+0.120
ifying+0.113
Neg logit contributions
 comm-0.124
mun-0.116
 ang-0.111
ters-0.111
 persu-0.106
Token
Token ID198
Feature activation-1.90e-02

Pos logit contributions
 comm+0.038
mun+0.036
 ang+0.034
ters+0.034
 persu+0.033
Neg logit contributions
 journals-0.026
 journal-0.025
 Christmas-0.023
pron-0.022
ak-0.021
Token"
Token ID1
Feature activation+1.37e-01

Pos logit contributions
 comm+0.036
mun+0.035
 ang+0.033
ters+0.033
 tactic+0.032
Neg logit contributions
 journals-0.031
 journal-0.029
pron-0.027
 Christmas-0.026
ifying-0.025
TokenHelp
Token ID22087
Feature activation-1.54e-01

Pos logit contributions
 journals+0.221
 journal+0.216
pron+0.212
 worn+0.211
 Christmas+0.201
Neg logit contributions
 comm-0.229
mun-0.225
 ang-0.208
 tactic-0.207
 persuasion-0.206
Token!
Token ID0
Feature activation+2.05e-02

Pos logit contributions
 comm+0.334
mun+0.334
 tactic+0.304
 ang+0.303
ters+0.303
Neg logit contributions
 journal-0.187
 journals-0.182
 Christmas-0.168
ak-0.157
on-0.149
Token Help
Token ID10478
Feature activation+6.50e-02

Pos logit contributions
 journals+0.037
 journal+0.034
pron+0.031
 Christmas+0.029
iflower+0.028
Neg logit contributions
 comm-0.037
ters-0.035
mun-0.034
 ang-0.034
 persu-0.032
Token!"
Token ID2474
Feature activation+8.57e-02

Pos logit contributions
 journals+0.099
 journal+0.093
pron+0.083
 Christmas+0.081
ifying+0.077
Neg logit contributions
 comm-0.133
mun-0.127
ters-0.122
 ang-0.121
 persu-0.116
Token Tweet
Token ID18752
Feature activation-1.65e-01

Pos logit contributions
 journal+0.118
 Christmas+0.112
 journals+0.112
 dreamed+0.109
 pillow+0.103
Neg logit contributions
mun-0.157
 comm-0.147
ters-0.146
 tactic-0.144
 ang-0.140
Tokenie
Token ID494
Feature activation+5.68e-02

Pos logit contributions
 comm+0.392
mun+0.382
 persu+0.372
 persuasion+0.368
 inconvenience+0.363
Neg logit contributions
 journal-0.158
 journals-0.157
ak-0.151
ouse-0.124
iflower-0.124
Token cried
Token ID16896
Feature activation+4.62e-02

Pos logit contributions
ak+0.068
 journal+0.067
 thumb+0.067
on+0.064
 dreamed+0.062
Neg logit contributions
 inconvenience-0.111
mun-0.110
 comm-0.105
 persuasion-0.102
 casc-0.102
Token.
Token ID13
Feature activation+6.54e-03

Pos logit contributions
 journal+0.052
 journals+0.052
 Christmas+0.047
 forever+0.041
 thumb+0.041
Neg logit contributions
mun-0.105
 comm-0.103
 tactic-0.095
 inconvenience-0.094
 persuasion-0.092
Token A
Token ID317
Feature activation-1.72e-02

Pos logit contributions
 journal+0.007
 journals+0.007
 Christmas+0.007
on+0.006
 dreamed+0.006
Neg logit contributions
 comm-0.014
mun-0.014
 ang-0.013
ters-0.013
 tactic-0.012
Token Lily
Token ID20037
Feature activation-2.66e-02

Pos logit contributions
 journals+0.014
 journal+0.013
 Christmas+0.012
pron+0.012
 fork+0.011
Neg logit contributions
 comm-0.016
mun-0.016
ters-0.015
 ang-0.015
 tactic-0.015
Token says
Token ID1139
Feature activation-2.22e-01

Pos logit contributions
ters+0.039
mun+0.038
 persu+0.036
 persuasion+0.036
 casc+0.035
Neg logit contributions
 journal-0.048
 journals-0.047
 fork-0.043
 notebook-0.043
 dreamed-0.041
Token.
Token ID13
Feature activation+1.94e-02

Pos logit contributions
 comm+0.476
mun+0.461
 ang+0.427
 tactic+0.425
ters+0.421
Neg logit contributions
 journals-0.296
 journal-0.294
pron-0.254
 Christmas-0.252
 fork-0.231
Token "
Token ID366
Feature activation-1.24e-01

Pos logit contributions
 journals+0.030
 journal+0.030
 Christmas+0.027
pron+0.026
ifying+0.025
Neg logit contributions
 comm-0.036
mun-0.036
 ang-0.033
ters-0.032
 tactic-0.032
TokenWe
Token ID1135
Feature activation-1.68e-02

Pos logit contributions
 comm+0.189
mun+0.178
ters+0.174
 ang+0.167
 persu+0.155
Neg logit contributions
 journals-0.257
 journal-0.246
pron-0.224
 Christmas-0.220
ocket-0.213
Token have
Token ID423
Feature activation+1.40e-01

Pos logit contributions
 comm+0.028
mun+0.026
ters+0.025
 ang+0.024
 warns+0.024
Neg logit contributions
 journals-0.033
 journal-0.030
pron-0.030
iflower-0.028
 Christmas-0.027
Token to
Token ID284
Feature activation-5.97e-03

Pos logit contributions
 journals+0.294
 journal+0.284
pron+0.281
iflower+0.270
ouse+0.264
Neg logit contributions
 comm-0.205
 ang-0.177
ters-0.161
mun-0.158
 persu-0.155
Token help
Token ID1037
Feature activation-1.90e-01

Pos logit contributions
 comm+0.010
mun+0.010
ters+0.009
 ang+0.009
 persu+0.009
Neg logit contributions
 journals-0.011
 journal-0.010
pron-0.009
 Christmas-0.009
ifying-0.009
Token him
Token ID683
Feature activation-3.23e-02

Pos logit contributions
 comm+0.395
mun+0.381
 ang+0.363
ters+0.356
 tactic+0.352
Neg logit contributions
 journals-0.271
 journal-0.256
 Christmas-0.237
pron-0.223
iflower-0.208
Token."
Token ID526
Feature activation+1.64e-01

Pos logit contributions
 comm+0.065
mun+0.062
ters+0.060
 ang+0.059
 tactic+0.056
Neg logit contributions
 journals-0.051
 journal-0.047
pron-0.042
 Christmas-0.041
ifying-0.039
Token
Token ID198
Feature activation-2.63e-02

Pos logit contributions
 journals+0.236
 journal+0.225
 Christmas+0.198
pron+0.195
 notebook+0.180
Neg logit contributions
 comm-0.338
mun-0.324
 ang-0.310
ters-0.309
 tactic-0.300
Token.
Token ID13
Feature activation-8.19e-03

Pos logit contributions
 comm+0.022
mun+0.020
ters+0.020
 ang+0.019
 persu+0.018
Neg logit contributions
 journals-0.023
 journal-0.021
pron-0.020
 Christmas-0.019
ouse-0.018
Token The
Token ID383
Feature activation+8.17e-02

Pos logit contributions
 comm+0.012
mun+0.011
 ang+0.010
 persuasion+0.010
ters+0.010
Neg logit contributions
 journals-0.017
 journal-0.017
pron-0.016
ocket-0.015
 Christmas-0.015
Token end
Token ID886
Feature activation-8.26e-02

Pos logit contributions
 journals+0.134
 journal+0.131
 Christmas+0.125
pron+0.121
ak+0.110
Neg logit contributions
 comm-0.145
mun-0.145
ters-0.141
 tactic-0.135
 ang-0.128
Token.
Token ID13
Feature activation-7.98e-03

Pos logit contributions
 comm+0.156
mun+0.148
ters+0.141
 ang+0.140
 persu+0.134
Neg logit contributions
 journals-0.138
 journal-0.131
pron-0.118
 Christmas-0.116
ifying-0.110
Token<|endoftext|>
Token ID50256
Feature activation+2.39e-02

Pos logit contributions
 comm+0.011
mun+0.010
ters+0.009
 ang+0.009
 persu+0.009
Neg logit contributions
 journals-0.018
 journal-0.017
pron-0.016
 Christmas-0.015
ocket-0.015
TokenOnce
Token ID7454
Feature activation+9.03e-02

Pos logit contributions
 journals+0.047
 journal+0.045
pron+0.041
 Christmas+0.040
ifying+0.039
Neg logit contributions
 comm-0.038
mun-0.036
ters-0.034
 ang-0.033
 persu-0.032
Token upon
Token ID2402
Feature activation+5.45e-02

Pos logit contributions
 journals+0.185
 journal+0.161
pron+0.158
ifying+0.156
ocket+0.151
Neg logit contributions
mun-0.131
 comm-0.127
 ang-0.123
 tactic-0.120
 warns-0.114
Token a
Token ID257
Feature activation-2.80e-02

Pos logit contributions
 journals+0.114
 journal+0.105
pron+0.099
ifying+0.098
ouse+0.096
Neg logit contributions
 comm-0.080
mun-0.073
 ang-0.067
ters-0.067
 tactic-0.065
Token time
Token ID640
Feature activation+4.53e-02

Pos logit contributions
 comm+0.052
mun+0.048
 persu+0.044
 ang+0.044
 persuasion+0.043
Neg logit contributions
 journals-0.049
 journal-0.046
ocket-0.040
ifying-0.040
pron-0.040
Token,
Token ID11
Feature activation-4.03e-02

Pos logit contributions
 journals+0.072
 journal+0.070
 Christmas+0.063
pron+0.061
ifying+0.057
Neg logit contributions
 comm-0.088
mun-0.083
ters-0.080
 ang-0.079
 tactic-0.075
Token there
Token ID612
Feature activation+6.36e-02

Pos logit contributions
 comm+0.072
mun+0.070
ters+0.068
 ang+0.066
 persu+0.064
Neg logit contributions
 journals-0.071
 journal-0.064
pron-0.060
ocket-0.057
 Christmas-0.056
Token quickly
Token ID2952
Feature activation+3.78e-02

Pos logit contributions
 comm+0.136
ters+0.132
mun+0.130
 louder+0.128
 ang+0.125
Neg logit contributions
 journals-0.156
 journal-0.147
ocket-0.128
pron-0.127
 notebook-0.127
Token.
Token ID13
Feature activation-1.95e-03

Pos logit contributions
 journals+0.079
 journal+0.072
pron+0.066
iflower+0.063
ifying+0.063
Neg logit contributions
 comm-0.059
ters-0.052
mun-0.049
 ang-0.049
 persuasion-0.048
Token When
Token ID1649
Feature activation-6.34e-02

Pos logit contributions
 comm+0.003
mun+0.003
ters+0.003
 persuasion+0.003
 ang+0.003
Neg logit contributions
 journals-0.004
 journal-0.003
pron-0.003
iflower-0.003
ocket-0.003
Token it
Token ID340
Feature activation+1.51e-01

Pos logit contributions
 comm+0.112
mun+0.102
 ang+0.100
 persu+0.099
 persuasion+0.096
Neg logit contributions
 journals-0.115
 journal-0.106
ouse-0.100
ak-0.099
pron-0.099
Token was
Token ID373
Feature activation-9.89e-02

Pos logit contributions
 journals+0.275
 journal+0.266
pron+0.243
 Christmas+0.242
ifying+0.228
Neg logit contributions
 comm-0.256
mun-0.245
ters-0.232
 ang-0.229
 tactic-0.219
Token ready
Token ID3492
Feature activation+5.06e-02

Pos logit contributions
 comm+0.164
 ang+0.152
 persu+0.142
 louder+0.140
 persuasion+0.137
Neg logit contributions
 journals-0.195
 journal-0.176
iflower-0.160
ocket-0.159
ouse-0.157
Token,
Token ID11
Feature activation-2.91e-02

Pos logit contributions
 journals+0.105
 journal+0.097
pron+0.090
ifying+0.087
iflower+0.085
Neg logit contributions
 comm-0.077
mun-0.070
 ang-0.069
ters-0.068
 persu-0.065
Token the
Token ID262
Feature activation+5.76e-03

Pos logit contributions
 comm+0.060
mun+0.058
ters+0.055
 ang+0.054
 tactic+0.053
Neg logit contributions
 journals-0.042
 journal-0.041
 Christmas-0.037
pron-0.036
ifying-0.033
Token woman
Token ID2415
Feature activation-2.18e-02

Pos logit contributions
 journals+0.009
 journal+0.009
pron+0.008
 Christmas+0.008
ifying+0.007
Neg logit contributions
 comm-0.011
mun-0.011
ters-0.010
 ang-0.010
 persu-0.010
Token took
Token ID1718
Feature activation+2.40e-02

Pos logit contributions
 comm+0.036
mun+0.034
ters+0.032
 ang+0.031
 warns+0.030
Neg logit contributions
 journals-0.043
 journal-0.039
ifying-0.036
pron-0.035
 Christmas-0.034
Token the
Token ID262
Feature activation+5.87e-03

Pos logit contributions
 journals+0.043
 journal+0.041
pron+0.039
ifying+0.036
ouse+0.036
Neg logit contributions
 comm-0.042
mun-0.039
ters-0.039
 ang-0.038
 persu-0.036
Token jelly
Token ID27644
Feature activation+9.07e-02

Pos logit contributions
 journals+0.020
 journal+0.019
pron+0.018
ouse+0.018
ifying+0.017
Neg logit contributions
 comm-0.021
mun-0.019
 ang-0.018
ters-0.018
 persu-0.017
Token and
Token ID290
Feature activation+8.20e-02

Pos logit contributions
 journals+0.173
 journal+0.156
pron+0.146
 Christmas+0.142
 handwriting+0.135
Neg logit contributions
ters-0.150
 comm-0.145
mun-0.139
 ang-0.133
 warns-0.124
Token ran
Token ID4966
Feature activation+1.35e-02

Pos logit contributions
 journals+0.158
 journal+0.149
pron+0.145
 Christmas+0.137
ouse+0.136
Neg logit contributions
 comm-0.138
 ang-0.124
mun-0.123
ters-0.115
 warns-0.109
Token away
Token ID1497
Feature activation-4.11e-02

Pos logit contributions
 journals+0.023
 journal+0.022
pron+0.020
 Christmas+0.020
ifying+0.019
Neg logit contributions
 comm-0.025
mun-0.024
ters-0.023
 ang-0.022
 tactic-0.022
Token.
Token ID13
Feature activation+2.64e-02

Pos logit contributions
mun+0.093
 comm+0.091
ters+0.083
 ang+0.082
 tactic+0.082
Neg logit contributions
 journal-0.046
 journals-0.045
 Christmas-0.042
 fork-0.037
ifying-0.036
Token The
Token ID383
Feature activation+1.10e-01

Pos logit contributions
 journals+0.050
 journal+0.048
pron+0.044
 Christmas+0.042
ocket+0.041
Neg logit contributions
 comm-0.044
mun-0.041
ters-0.040
 ang-0.039
 persu-0.037
Token jelly
Token ID27644
Feature activation+9.15e-02

Pos logit contributions
 journals+0.169
 journal+0.157
pron+0.140
 Christmas+0.138
ifying+0.132
Neg logit contributions
 comm-0.224
mun-0.212
ters-0.203
 ang-0.203
 persu-0.194
Token spl
Token ID4328
Feature activation-1.57e-01

Pos logit contributions
 journals+0.169
 journal+0.152
 Christmas+0.139
ifying+0.137
pron+0.136
Neg logit contributions
 comm-0.160
mun-0.152
ters-0.149
 ang-0.141
 warns-0.131
Tokenashed
Token ID5263
Feature activation-8.18e-03

Pos logit contributions
 comm+0.250
mun+0.239
ters+0.238
 ang+0.228
 persu+0.212
Neg logit contributions
 journals-0.311
 journal-0.299
 Christmas-0.271
pron-0.268
iflower-0.248
Token on
Token ID319
Feature activation-1.79e-02

Pos logit contributions
 comm+0.016
mun+0.015
ters+0.014
 ang+0.014
 persu+0.014
Neg logit contributions
 journals-0.013
 journal-0.013
pron-0.011
 Christmas-0.011
ifying-0.010
Token the
Token ID262
Feature activation+2.62e-02

Pos logit contributions
 comm+0.028
ters+0.026
mun+0.025
 ang+0.025
 persu+0.023
Neg logit contributions
 journals-0.036
 journal-0.034
pron-0.031
ouse-0.031
ocket-0.030
Token fun
Token ID1257
Feature activation-9.87e-02

Pos logit contributions
 journals+0.323
 journal+0.310
pron+0.286
 Christmas+0.280
ifying+0.273
Neg logit contributions
 comm-0.234
mun-0.219
ters-0.206
 ang-0.204
 persu-0.193
Token.
Token ID13
Feature activation+6.44e-03

Pos logit contributions
 comm+0.202
mun+0.190
 ang+0.185
ters+0.179
 tactic+0.175
Neg logit contributions
 journals-0.151
 journal-0.142
 Christmas-0.128
pron-0.124
ifying-0.116
Token Sometimes
Token ID8975
Feature activation-1.16e-01

Pos logit contributions
 journals+0.009
 journal+0.008
 Christmas+0.008
pron+0.007
ifying+0.007
Neg logit contributions
 comm-0.014
mun-0.013
ters-0.012
 ang-0.012
 persu-0.012
Token they
Token ID484
Feature activation-2.42e-02

Pos logit contributions
 comm+0.197
mun+0.187
ters+0.178
 ang+0.175
 persu+0.167
Neg logit contributions
 journals-0.215
 journal-0.205
pron-0.188
 Christmas-0.182
ifying-0.177
Token play
Token ID711
Feature activation+4.03e-02

Pos logit contributions
 comm+0.039
mun+0.039
 ang+0.036
ters+0.035
 warns+0.035
Neg logit contributions
 journals-0.045
 journal-0.043
pron-0.041
iflower-0.039
ouse-0.038
Token cow
Token ID9875
Feature activation-1.85e-01

Pos logit contributions
 journal+0.055
 journals+0.054
 Christmas+0.052
pron+0.047
ak+0.042
Neg logit contributions
 comm-0.088
 ang-0.080
mun-0.080
 persu-0.076
 persuasion-0.074
Tokenboys
Token ID13202
Feature activation-1.83e-01

Pos logit contributions
 comm+0.355
 ang+0.316
mun+0.316
 tactic+0.306
 persu+0.303
Neg logit contributions
 journals-0.299
 journal-0.291
 Christmas-0.273
pron-0.252
 notebook-0.241
Token and
Token ID290
Feature activation+5.88e-02

Pos logit contributions
 comm+0.408
mun+0.384
 ang+0.374
ters+0.362
 persu+0.361
Neg logit contributions
 journals-0.245
 journal-0.233
 Christmas-0.206
pron-0.200
ifying-0.180
Token cow
Token ID9875
Feature activation-1.84e-01

Pos logit contributions
 journals+0.115
 journal+0.110
pron+0.101
 Christmas+0.099
ifying+0.095
Neg logit contributions
 comm-0.094
mun-0.089
ters-0.084
 ang-0.083
 persu-0.078
Tokengirls
Token ID36960
Feature activation-2.88e-01

Pos logit contributions
 comm+0.293
mun+0.258
 ang+0.253
 tactic+0.246
 persu+0.242
Neg logit contributions
 journals-0.365
 journal-0.352
 Christmas-0.329
pron-0.309
 notebook-0.302
Token,
Token ID11
Feature activation-1.88e-02

Pos logit contributions
 comm+0.539
mun+0.515
ters+0.493
 ang+0.486
 persu+0.466
Neg logit contributions
 journals-0.477
 journal-0.453
pron-0.408
 Christmas-0.401
ifying-0.381
Token.
Token ID13
Feature activation-1.38e-02

Pos logit contributions
 comm+0.136
mun+0.132
ters+0.126
 ang+0.121
 tactic+0.116
Neg logit contributions
 journals-0.143
 journal-0.136
pron-0.126
ifying-0.120
 Christmas-0.120
Token They
Token ID1119
Feature activation-2.16e-02

Pos logit contributions
 comm+0.027
mun+0.026
ters+0.025
 ang+0.024
 persu+0.023
Neg logit contributions
 journals-0.022
 journal-0.021
pron-0.019
 Christmas-0.019
ifying-0.018
Token were
Token ID547
Feature activation-5.07e-02

Pos logit contributions
mun+0.036
 comm+0.035
ters+0.034
 ang+0.030
 warns+0.029
Neg logit contributions
 journals-0.040
 journal-0.038
pron-0.037
iflower-0.035
ocket-0.034
Token very
Token ID845
Feature activation-1.40e-02

Pos logit contributions
 comm+0.089
mun+0.084
ters+0.080
 ang+0.080
 tactic+0.076
Neg logit contributions
 journals-0.091
 journal-0.087
pron-0.078
 Christmas-0.078
ifying-0.074
Token adventurous
Token ID39770
Feature activation-2.12e-02

Pos logit contributions
 comm+0.025
mun+0.024
ters+0.023
 ang+0.023
 persu+0.022
Neg logit contributions
 journals-0.025
 journal-0.023
pron-0.021
ifying-0.020
 Christmas-0.020
Token.
Token ID13
Feature activation-1.34e-02

Pos logit contributions
 comm+0.040
mun+0.038
ters+0.036
 ang+0.035
 tactic+0.034
Neg logit contributions
 journals-0.036
 journal-0.034
pron-0.032
 Christmas-0.030
ifying-0.030
Token One
Token ID1881
Feature activation+2.66e-02

Pos logit contributions
 comm+0.024
mun+0.023
ters+0.021
 ang+0.021
 tactic+0.021
Neg logit contributions
 journals-0.024
 journal-0.023
pron-0.021
 Christmas-0.020
ocket-0.019
Token day
Token ID1110
Feature activation+1.90e-02

Pos logit contributions
 journals+0.042
 journal+0.040
pron+0.036
 Christmas+0.035
ifying+0.033
Neg logit contributions
 comm-0.053
mun-0.050
ters-0.048
 ang-0.048
 persu-0.046
Token both
Token ID1111
Feature activation+1.87e-01

Pos logit contributions
 journals+0.030
 journal+0.029
 Christmas+0.025
pron+0.025
ifying+0.023
Neg logit contributions
 comm-0.038
mun-0.036
 ang-0.035
ters-0.034
 tactic-0.033
Token Jack
Token ID3619
Feature activation+1.36e-01

Pos logit contributions
 journals+0.349
 journal+0.326
pron+0.324
ifying+0.301
ocket+0.297
Neg logit contributions
 comm-0.312
mun-0.294
ters-0.290
 ang-0.277
 tactic-0.266
Token and
Token ID290
Feature activation+4.72e-02

Pos logit contributions
 journals+0.271
 journal+0.250
pron+0.241
ifying+0.237
ocket+0.221
Neg logit contributions
 comm-0.224
mun-0.212
ters-0.192
 warns-0.190
 ang-0.177
Token?"
Token ID1701
Feature activation+1.46e-01

Pos logit contributions
 comm+0.081
mun+0.077
ters+0.074
 ang+0.073
 persu+0.068
Neg logit contributions
 journals-0.080
 journal-0.075
pron-0.068
 Christmas-0.066
ifying-0.064
Token She
Token ID1375
Feature activation+2.39e-02

Pos logit contributions
 journals+0.199
 journal+0.191
 Christmas+0.169
pron+0.164
 notebook+0.155
Neg logit contributions
 comm-0.304
ters-0.293
mun-0.292
 ang-0.276
 tactic-0.274
Token looks
Token ID3073
Feature activation+1.18e-02

Pos logit contributions
 journals+0.040
 journal+0.038
 Christmas+0.033
 fork+0.032
pron+0.032
Neg logit contributions
 comm-0.043
ters-0.042
mun-0.041
 tactic-0.040
 persuasion-0.039
Token at
Token ID379
Feature activation-8.20e-03

Pos logit contributions
 journals+0.020
 journal+0.019
pron+0.017
 Christmas+0.017
ifying+0.016
Neg logit contributions
 comm-0.022
mun-0.021
ters-0.020
 ang-0.019
 tactic-0.019
Token his
Token ID465
Feature activation+1.10e-01

Pos logit contributions
 comm+0.014
mun+0.014
ters+0.013
 ang+0.013
 persu+0.012
Neg logit contributions
 journals-0.015
 journal-0.014
pron-0.013
 Christmas-0.012
ifying-0.012
Token car
Token ID1097
Feature activation-1.09e-02

Pos logit contributions
 journals+0.195
 journal+0.181
pron+0.168
 Christmas+0.166
ifying+0.161
Neg logit contributions
 comm-0.196
mun-0.180
 ang-0.179
ters-0.173
 tactic-0.167
Token and
Token ID290
Feature activation+6.68e-02

Pos logit contributions
 comm+0.025
mun+0.024
ters+0.022
 ang+0.022
 tactic+0.022
Neg logit contributions
 journals-0.014
 journal-0.013
 Christmas-0.011
pron-0.011
 notebook-0.010
Token says
Token ID1139
Feature activation-2.24e-01

Pos logit contributions
 journals+0.136
 journal+0.132
 Christmas+0.116
pron+0.114
 notebook+0.113
Neg logit contributions
 comm-0.099
mun-0.094
ters-0.094
 tactic-0.086
 persuasion-0.084
Token "
Token ID366
Feature activation-1.23e-01

Pos logit contributions
 comm+0.332
mun+0.312
ters+0.295
 ang+0.291
 persu+0.275
Neg logit contributions
 journals-0.462
 journal-0.443
pron-0.407
 Christmas-0.402
ifying-0.386
TokenI
Token ID40
Feature activation+5.31e-02

Pos logit contributions
 comm+0.201
mun+0.189
ters+0.178
 ang+0.177
 tactic+0.170
Neg logit contributions
 journals-0.232
 journal-0.222
pron-0.208
 Christmas-0.203
ifying-0.195
Token'm
Token ID1101
Feature activation-1.45e-01

Pos logit contributions
 journal+0.084
 journals+0.083
 Christmas+0.077
pron+0.076
 dreamed+0.074
Neg logit contributions
 comm-0.097
ters-0.093
mun-0.091
 ang-0.087
 tactic-0.084
Token.
Token ID13
Feature activation+1.58e-02

Pos logit contributions
 comm+0.140
ters+0.134
mun+0.133
 ang+0.122
 persuasion+0.122
Neg logit contributions
 journals-0.186
 journal-0.173
pron-0.163
ifying-0.160
iflower-0.155
Token They
Token ID1119
Feature activation+1.15e-02

Pos logit contributions
 journals+0.023
 journal+0.022
 Christmas+0.019
pron+0.018
ifying+0.017
Neg logit contributions
 comm-0.034
mun-0.033
 ang-0.031
ters-0.030
 persu-0.030
Token did
Token ID750
Feature activation-2.80e-02

Pos logit contributions
 journals+0.020
 journal+0.019
pron+0.019
iflower+0.018
ifying+0.017
Neg logit contributions
 comm-0.019
mun-0.018
 ang-0.018
ters-0.018
 warns-0.017
Token not
Token ID407
Feature activation+3.80e-02

Pos logit contributions
 comm+0.052
mun+0.049
ters+0.047
 ang+0.047
 persu+0.045
Neg logit contributions
 journals-0.047
 journal-0.045
pron-0.040
 Christmas-0.039
ifying-0.038
Token match
Token ID2872
Feature activation-5.68e-02

Pos logit contributions
 journals+0.074
pron+0.071
 journal+0.070
ifying+0.068
iflower+0.068
Neg logit contributions
 comm-0.059
ters-0.051
 ang-0.051
 tactic-0.049
 persuasion-0.048
Token,
Token ID11
Feature activation-1.04e-02

Pos logit contributions
 comm+0.091
ters+0.089
mun+0.085
 ang+0.084
 persu+0.078
Neg logit contributions
 journals-0.107
 journal-0.103
pron-0.098
 Christmas-0.090
ouse-0.088
Token but
Token ID475
Feature activation-1.85e-01

Pos logit contributions
 comm+0.017
mun+0.016
ters+0.016
 ang+0.016
 persu+0.015
Neg logit contributions
 journals-0.019
 journal-0.018
pron-0.017
 Christmas-0.016
iflower-0.016
Token they
Token ID484
Feature activation-1.93e-02

Pos logit contributions
 comm+0.323
mun+0.307
ters+0.295
 ang+0.291
 persu+0.278
Neg logit contributions
 journals-0.332
 journal-0.315
pron-0.289
 Christmas-0.281
ifying-0.272
Token looked
Token ID3114
Feature activation+8.79e-02

Pos logit contributions
 comm+0.031
mun+0.030
ters+0.029
 ang+0.028
 warns+0.027
Neg logit contributions
 journals-0.035
 journal-0.033
pron-0.031
iflower-0.030
 Christmas-0.029
Token nice
Token ID3621
Feature activation-1.11e-02

Pos logit contributions
 journals+0.179
 journal+0.172
pron+0.168
 Christmas+0.159
iflower+0.159
Neg logit contributions
 comm-0.126
 ang-0.117
mun-0.111
ters-0.111
 louder-0.106
Token.
Token ID13
Feature activation+1.79e-02

Pos logit contributions
 comm+0.018
ters+0.018
mun+0.017
 ang+0.016
 tactic+0.015
Neg logit contributions
 journals-0.020
 journal-0.019
pron-0.019
 Christmas-0.018
iflower-0.017
Token was
Token ID373
Feature activation-1.00e-01

Pos logit contributions
 journals+0.311
 journal+0.298
pron+0.272
 Christmas+0.269
ifying+0.258
Neg logit contributions
 comm-0.249
mun-0.235
ters-0.222
 ang-0.219
 persu-0.208
Token pink
Token ID11398
Feature activation+1.35e-01

Pos logit contributions
 comm+0.163
 ang+0.146
ters+0.143
mun+0.140
 persu+0.138
Neg logit contributions
 journals-0.194
 journal-0.180
pron-0.159
ouse-0.159
ifying-0.158
Token and
Token ID290
Feature activation+5.79e-02

Pos logit contributions
 journals+0.244
 journal+0.221
pron+0.203
ifying+0.195
ocket+0.194
Neg logit contributions
 comm-0.235
ters-0.222
mun-0.221
 tactic-0.199
 ang-0.198
Token spark
Token ID9009
Feature activation+1.11e-01

Pos logit contributions
 journals+0.096
 journal+0.087
ocket+0.080
ifying+0.076
aire+0.074
Neg logit contributions
 comm-0.104
ters-0.093
mun-0.092
 louder-0.092
 ang-0.090
Tokenly
Token ID306
Feature activation-3.00e-02

Pos logit contributions
 journals+0.190
 journal+0.181
pron+0.165
 Christmas+0.164
ifying+0.155
Neg logit contributions
 comm-0.205
mun-0.195
 ang-0.185
ters-0.180
 persu-0.177
Token,
Token ID11
Feature activation-2.23e-02

Pos logit contributions
 comm+0.049
ters+0.046
mun+0.044
 ang+0.040
 tactic+0.040
Neg logit contributions
 journals-0.057
 journal-0.053
pron-0.050
 Christmas-0.046
iflower-0.046
Token and
Token ID290
Feature activation+5.83e-02

Pos logit contributions
 comm+0.037
mun+0.035
ters+0.033
 ang+0.033
 persu+0.032
Neg logit contributions
 journals-0.042
 journal-0.040
pron-0.036
 Christmas-0.036
ifying-0.034
Token Lily
Token ID20037
Feature activation-2.44e-02

Pos logit contributions
 journals+0.115
 journal+0.108
ifying+0.099
ak+0.099
ocket+0.098
Neg logit contributions
 comm-0.090
mun-0.082
ters-0.082
 ang-0.077
 louder-0.073
Token was
Token ID373
Feature activation-9.92e-02

Pos logit contributions
 comm+0.034
mun+0.033
 tactic+0.026
ters+0.026
 warns+0.025
Neg logit contributions
 journals-0.052
ifying-0.052
 journal-0.048
 Christmas-0.045
ocket-0.044
Token so
Token ID523
Feature activation-8.08e-02

Pos logit contributions
 comm+0.140
ters+0.120
 ang+0.120
mun+0.119
 persu+0.111
Neg logit contributions
 journals-0.221
 journal-0.208
pron-0.184
ifying-0.184
 Christmas-0.181
Token excited
Token ID6568
Feature activation+3.77e-02

Pos logit contributions
 comm+0.112
mun+0.100
 tactic+0.090
ters+0.090
 ang+0.088
Neg logit contributions
 journals-0.171
 journal-0.159
ifying-0.148
pron-0.144
ocket-0.141
Token was
Token ID373
Feature activation-1.09e-01

Pos logit contributions
 comm+0.111
mun+0.108
 ang+0.097
ters+0.093
 warns+0.089
Neg logit contributions
 journals-0.135
 journal-0.125
ifying-0.123
pron-0.113
 Christmas-0.110
Token very
Token ID845
Feature activation-9.61e-03

Pos logit contributions
 comm+0.188
 ang+0.174
mun+0.172
ters+0.165
 persu+0.159
Neg logit contributions
 journals-0.212
 journal-0.198
pron-0.174
ifying-0.173
 Christmas-0.166
Token happy
Token ID3772
Feature activation+8.56e-02

Pos logit contributions
 comm+0.017
mun+0.016
 ang+0.016
ters+0.014
 persu+0.014
Neg logit contributions
 journals-0.019
 journal-0.017
ifying-0.015
pron-0.015
 Christmas-0.015
Token.
Token ID13
Feature activation-5.49e-03

Pos logit contributions
 journals+0.143
 journal+0.135
pron+0.122
 Christmas+0.120
ifying+0.115
Neg logit contributions
 comm-0.163
mun-0.154
ters-0.148
 ang-0.146
 persu-0.140
Token His
Token ID2399
Feature activation+4.21e-02

Pos logit contributions
 comm+0.009
mun+0.008
ters+0.008
 ang+0.008
 persu+0.007
Neg logit contributions
 journals-0.011
 journal-0.010
pron-0.010
 Christmas-0.009
ocket-0.009
Token idea
Token ID2126
Feature activation-2.91e-01

Pos logit contributions
 journals+0.065
 journal+0.058
ifying+0.055
pron+0.054
ak+0.051
Neg logit contributions
 comm-0.085
mun-0.077
 ang-0.076
 persu-0.073
 persuasion-0.072
Token worked
Token ID3111
Feature activation-1.64e-02

Pos logit contributions
 comm+0.483
mun+0.435
ters+0.410
 ang+0.403
 warns+0.396
Neg logit contributions
 journals-0.576
 journal-0.516
ifying-0.491
pron-0.473
 Christmas-0.467
Token,
Token ID11
Feature activation-3.20e-02

Pos logit contributions
 comm+0.032
mun+0.031
ters+0.029
 ang+0.029
 tactic+0.028
Neg logit contributions
 journals-0.026
 journal-0.024
pron-0.022
 Christmas-0.022
ak-0.020
Token and
Token ID290
Feature activation+5.21e-02

Pos logit contributions
 comm+0.058
mun+0.055
 ang+0.053
ters+0.053
 persu+0.051
Neg logit contributions
 journals-0.057
 journal-0.054
pron-0.048
 Christmas-0.047
ifying-0.047
Token he
Token ID339
Feature activation+5.15e-02

Pos logit contributions
 journals+0.089
 journal+0.085
pron+0.076
 Christmas+0.075
ifying+0.071
Neg logit contributions
 comm-0.096
mun-0.091
ters-0.087
 ang-0.086
 persu-0.083
Token got
Token ID1392
Feature activation+5.94e-02

Pos logit contributions
 journals+0.102
ifying+0.101
 journal+0.095
pron+0.088
ocket+0.084
Neg logit contributions
 comm-0.079
mun-0.075
 ang-0.068
ters-0.067
 warns-0.065
Token scarf
Token ID36953
Feature activation+1.83e-01

Pos logit contributions
 comm+0.010
mun+0.009
 ang+0.009
ters+0.009
 louder+0.009
Neg logit contributions
 journals-0.008
 journal-0.007
 Christmas-0.006
ocket-0.006
iflower-0.006
Token.
Token ID13
Feature activation+2.03e-02

Pos logit contributions
 journals+0.267
 journal+0.258
 Christmas+0.232
pron+0.231
ifying+0.216
Neg logit contributions
 comm-0.375
mun-0.360
 ang-0.340
ters-0.332
 tactic-0.331
Token They
Token ID1119
Feature activation+1.24e-02

Pos logit contributions
 journals+0.039
 journal+0.037
pron+0.034
ocket+0.033
 Christmas+0.033
Neg logit contributions
 comm-0.033
ters-0.031
mun-0.030
 ang-0.029
 persuasion-0.028
Token threw
Token ID9617
Feature activation-2.01e-02

Pos logit contributions
 journals+0.026
 journal+0.024
pron+0.021
iflower+0.021
ifying+0.021
Neg logit contributions
mun-0.019
 comm-0.019
ters-0.017
 ang-0.016
 zo-0.016
Token snow
Token ID6729
Feature activation+2.08e-02

Pos logit contributions
 comm+0.035
mun+0.032
 ang+0.032
ters+0.032
 persu+0.029
Neg logit contributions
 journals-0.036
 journal-0.035
pron-0.033
ouse-0.031
 Christmas-0.030
Tokenballs
Token ID21591
Feature activation-2.66e-01

Pos logit contributions
 journals+0.039
 journal+0.038
 Christmas+0.037
pron+0.035
ifying+0.033
Neg logit contributions
 comm-0.034
mun-0.032
 tactic-0.030
 ang-0.029
 persu-0.029
Token at
Token ID379
Feature activation+2.89e-03

Pos logit contributions
 comm+0.443
ters+0.434
mun+0.425
 ang+0.409
 persu+0.392
Neg logit contributions
 journals-0.492
 journal-0.459
pron-0.426
 Christmas-0.401
ifying-0.394
Token each
Token ID1123
Feature activation+9.53e-02

Pos logit contributions
 journals+0.006
 journal+0.006
pron+0.005
ouse+0.005
ifying+0.005
Neg logit contributions
 comm-0.004
ters-0.004
mun-0.004
 ang-0.004
 persu-0.003
Token other
Token ID584
Feature activation-1.20e-01

Pos logit contributions
 journals+0.232
 journal+0.222
pron+0.210
 Christmas+0.200
ocket+0.200
Neg logit contributions
 comm-0.108
mun-0.102
ters-0.097
 ang-0.092
 tactic-0.087
Token.
Token ID13
Feature activation+2.24e-02

Pos logit contributions
 comm+0.190
ters+0.186
mun+0.176
 ang+0.169
 tactic+0.157
Neg logit contributions
 journals-0.229
 journal-0.217
pron-0.217
ocket-0.195
iflower-0.194
Token They
Token ID1119
Feature activation+1.46e-02

Pos logit contributions
 journals+0.042
 journal+0.041
pron+0.037
 Christmas+0.035
ocket+0.035
Neg logit contributions
 comm-0.037
mun-0.035
ters-0.034
 ang-0.033
 persuasion-0.032
Token gentle
Token ID10296
Feature activation-2.40e-02

Pos logit contributions
 comm+0.072
 ang+0.067
mun+0.067
ters+0.065
 persu+0.061
Neg logit contributions
 journals-0.083
 journal-0.079
pron-0.072
 Christmas-0.071
ifying-0.071
Token when
Token ID618
Feature activation-9.69e-02

Pos logit contributions
 comm+0.043
mun+0.041
ters+0.041
 ang+0.039
 tactic+0.037
Neg logit contributions
 journals-0.041
 journal-0.039
pron-0.037
 Christmas-0.035
ifying-0.034
Token un
Token ID555
Feature activation+1.02e-01

Pos logit contributions
 comm+0.164
mun+0.156
ters+0.148
 ang+0.148
 persu+0.143
Neg logit contributions
 journals-0.175
 journal-0.165
pron-0.159
ifying-0.149
ocket-0.149
Tokenk
Token ID74
Feature activation+5.61e-02

Pos logit contributions
 journals+0.183
 journal+0.172
pron+0.162
 Christmas+0.156
ouse+0.153
Neg logit contributions
 comm-0.183
mun-0.167
ters-0.155
 tactic-0.152
 ang-0.150
Tokennot
Token ID1662
Feature activation-1.54e-01

Pos logit contributions
 journals+0.102
 journal+0.096
pron+0.089
 Christmas+0.087
ocket+0.082
Neg logit contributions
 comm-0.095
ters-0.091
mun-0.091
 ang-0.089
 persu-0.082
Tokenting
Token ID889
Feature activation-3.65e-01

Pos logit contributions
 comm+0.268
mun+0.251
ters+0.242
 ang+0.241
 persu+0.230
Neg logit contributions
 journals-0.282
 journal-0.267
pron-0.243
 Christmas-0.238
ifying-0.227
Token things
Token ID1243
Feature activation-1.36e-01

Pos logit contributions
 comm+0.662
mun+0.631
ters+0.594
 ang+0.590
 tactic+0.569
Neg logit contributions
 journals-0.625
 journal-0.596
pron-0.532
 Christmas-0.531
ifying-0.505
Token and
Token ID290
Feature activation+6.06e-02

Pos logit contributions
 comm+0.239
mun+0.226
ters+0.217
 ang+0.214
 persu+0.204
Neg logit contributions
 journals-0.243
 journal-0.230
pron-0.210
 Christmas-0.206
ifying-0.195
Token he
Token ID339
Feature activation+5.85e-02

Pos logit contributions
 journals+0.108
 journal+0.103
pron+0.093
 Christmas+0.092
ifying+0.087
Neg logit contributions
 comm-0.106
mun-0.101
ters-0.096
 ang-0.095
 persu-0.091
Token also
Token ID635
Feature activation+7.36e-02

Pos logit contributions
 journals+0.105
 journal+0.099
pron+0.096
 Christmas+0.092
ifying+0.090
Neg logit contributions
 comm-0.102
mun-0.099
 warns-0.092
ters-0.091
 ang-0.090
Token learned
Token ID4499
Feature activation+1.54e-01

Pos logit contributions
 journals+0.124
 journal+0.116
pron+0.110
 Christmas+0.107
ifying+0.103
Neg logit contributions
 comm-0.140
mun-0.132
 ang-0.127
 warns-0.122
ters-0.121
Token it
Token ID340
Feature activation+1.52e-01

Pos logit contributions
 journals+0.005
 journal+0.004
pron+0.004
ouse+0.004
ocket+0.004
Neg logit contributions
ters-0.005
 comm-0.005
 ang-0.004
mun-0.004
 persu-0.004
Token.
Token ID13
Feature activation+1.43e-02

Pos logit contributions
 journals+0.220
 journal+0.213
 Christmas+0.190
pron+0.183
ifying+0.176
Neg logit contributions
 comm-0.314
mun-0.305
 ang-0.281
ters-0.277
 persuasion-0.271
Token
Token ID198
Feature activation-1.71e-02

Pos logit contributions
 journals+0.024
 journal+0.022
pron+0.020
 Christmas+0.020
ifying+0.019
Neg logit contributions
 comm-0.027
mun-0.026
ters-0.025
 ang-0.025
 persu-0.024
Token
Token ID198
Feature activation-1.76e-02

Pos logit contributions
 comm+0.025
mun+0.024
 ang+0.022
ters+0.022
 tactic+0.021
Neg logit contributions
 journals-0.036
 journal-0.034
pron-0.031
 Christmas-0.031
ifying-0.030
Token"
Token ID1
Feature activation+1.40e-01

Pos logit contributions
 comm+0.033
mun+0.032
ters+0.032
 ang+0.031
 persu+0.029
Neg logit contributions
 journals-0.030
 journal-0.028
 Christmas-0.025
pron-0.024
ocket-0.024
TokenCome
Token ID16773
Feature activation-2.71e-01

Pos logit contributions
 journals+0.268
 journal+0.254
pron+0.242
 Christmas+0.236
ifying+0.232
Neg logit contributions
 comm-0.220
mun-0.209
 ang-0.191
ters-0.189
 tactic-0.189
Token on
Token ID319
Feature activation-2.33e-02

Pos logit contributions
 comm+0.539
mun+0.512
 ang+0.475
ters+0.475
 tactic+0.464
Neg logit contributions
 journals-0.420
 journal-0.411
 Christmas-0.373
pron-0.367
ifying-0.347
Token,
Token ID11
Feature activation-1.67e-02

Pos logit contributions
 comm+0.033
 ang+0.029
ters+0.029
mun+0.029
 persu+0.027
Neg logit contributions
 journals-0.052
 journal-0.047
pron-0.044
ifying-0.042
ocket-0.042
Token Tom
Token ID4186
Feature activation-3.09e-02

Pos logit contributions
 comm+0.030
mun+0.028
ters+0.027
 ang+0.027
 persu+0.025
Neg logit contributions
 journals-0.030
 journal-0.028
pron-0.026
 Christmas-0.025
ifying-0.024
Token,
Token ID11
Feature activation-1.63e-02

Pos logit contributions
 comm+0.050
mun+0.047
ters+0.046
 ang+0.044
 warns+0.041
Neg logit contributions
 journals-0.063
 journal-0.058
pron-0.054
ifying-0.053
 Christmas-0.051
Token you
Token ID345
Feature activation+7.09e-02

Pos logit contributions
 comm+0.029
mun+0.028
ters+0.027
 ang+0.027
 persu+0.025
Neg logit contributions
 journals-0.029
 journal-0.027
pron-0.025
 Christmas-0.023
ifying-0.023
Token lost
Token ID2626
Feature activation+1.17e-01

Pos logit contributions
 comm+0.153
mun+0.144
ters+0.134
 ang+0.130
 tactic+0.126
Neg logit contributions
 journals-0.231
 journal-0.222
pron-0.204
 Christmas-0.204
ifying-0.195
Token in
Token ID287
Feature activation+7.64e-03

Pos logit contributions
 journals+0.223
 journal+0.206
pron+0.202
ouse+0.187
ocket+0.184
Neg logit contributions
 comm-0.189
ters-0.186
mun-0.176
 ang-0.171
 tactic-0.158
Token the
Token ID262
Feature activation-4.13e-03

Pos logit contributions
 journals+0.014
 journal+0.013
pron+0.012
ouse+0.012
ocket+0.012
Neg logit contributions
 comm-0.013
mun-0.012
 ang-0.012
ters-0.012
 persu-0.011
Token clouds
Token ID15114
Feature activation-4.45e-03

Pos logit contributions
 comm+0.007
mun+0.007
 ang+0.006
ters+0.006
 persu+0.006
Neg logit contributions
 journals-0.008
 journal-0.007
pron-0.006
 Christmas-0.006
ocket-0.006
Token,
Token ID11
Feature activation-3.83e-02

Pos logit contributions
 comm+0.007
ters+0.007
mun+0.007
 ang+0.006
 persu+0.006
Neg logit contributions
 journals-0.009
 journal-0.008
pron-0.007
ocket-0.007
 Christmas-0.007
Token higher
Token ID2440
Feature activation-2.81e-01

Pos logit contributions
 comm+0.060
mun+0.056
 ang+0.054
ters+0.053
 persu+0.050
Neg logit contributions
 journals-0.077
 journal-0.073
pron-0.068
 Christmas-0.066
ocket-0.064
Token and
Token ID290
Feature activation+4.41e-02

Pos logit contributions
 comm+0.542
mun+0.517
ters+0.506
 ang+0.495
 tactic+0.472
Neg logit contributions
 journals-0.457
 journal-0.432
pron-0.393
 Christmas-0.376
ocket-0.361
Token higher
Token ID2440
Feature activation-2.82e-01

Pos logit contributions
 journals+0.088
 journal+0.085
 Christmas+0.079
pron+0.077
ifying+0.076
Neg logit contributions
 comm-0.065
mun-0.062
ters-0.057
 tactic-0.055
 ang-0.054
Token,
Token ID11
Feature activation-3.82e-02

Pos logit contributions
 comm+0.542
mun+0.517
ters+0.494
 ang+0.490
 persu+0.470
Neg logit contributions
 journals-0.457
 journal-0.434
pron-0.389
 Christmas-0.384
ifying-0.363
Token until
Token ID1566
Feature activation-8.15e-02

Pos logit contributions
 comm+0.063
mun+0.060
ters+0.057
 ang+0.057
 persu+0.054
Neg logit contributions
 journals-0.072
 journal-0.069
pron-0.063
 Christmas-0.062
ocket-0.059
Token it
Token ID340
Feature activation+1.36e-01

Pos logit contributions
 comm+0.122
 ang+0.120
 persu+0.116
mun+0.116
 persuasion+0.112
Neg logit contributions
 journals-0.161
 journal-0.151
pron-0.145
ouse-0.144
ak-0.139