TOP ACTIVATIONS
MAX = 0.248

store . There were many matt resses . L ila saw
looked around and saw many matt resses . Some were soft
princess who slept on many matt resses . But she felt
go riding together on Tues days . That made the little
to pick flowers on Tues days ." Max
to un pack their back packs , which they brought to
loved going outside to play games . One day
twins who loved to play games . One day , they
brothers who liked to play games . They had many toys
twins who liked to play games . One day , they
lots of fun together . Yesterday , Tim teased Joe when
twins who liked to play games . They were very competitive
Sam who loved to play games . She particularly liked playing
brothers who liked to play games . They had many toys
like to play together . Yesterday , they went to the
us ! We love playing games ." The
play in the park . Yesterday , they went to the
always excited for Wed nes days because it was the day
is always busy on Tues days ," replied Max .
. They like to play games . Today they play a

MIN ACTIVATIONS
MIN = -0.209

bushes . It was a crow , ch ir ping and
. It is a horn et ." Tom does
of a fox and a crow . She took the book
is a crow . The crow is very hungry and eager
ak . It is a crow . The crow is very
a lick and a n udge . Tom smiles and waves
, it 's a sm udge . It looks like you
just an old tr ink et to me ." Twe ety
and soon the f auc et was as good as new
not the same . Cere al does not grow in the
, car - din - al , blue - jay ,
But the horn et is faster . It sees
ite ? It is a crow . It is my favorite
never forgot the fast che et ah he had seen .
, the old f auc et broke and water started spilling
guess it 's a horn et 's nest ! We shouldn
face . Tom feels a sting in his eyes and a
ak . It was a crow . Lily did not like
ite . It is a crow . It is a mysterious
nose !" The crow c awed and flew away

INTERVAL 0.134 - 0.248
CONTAINS 0.396%

soon they were all playing together . The family spent the
could find . As he walked he saw a
they said . As they played , Tim my
. Lily loved to play games with her friends . One
he loved to march . Everywhere Tom went , he marched

INTERVAL 0.020 - 0.134
CONTAINS 30.578%

make mom my and daddy happy again ." M
different colors and sizes . They filled the air with their
said , " That 's great , Lily ! Sometimes when
. <|endoftext|> Once upon a time , in a small swamp
decided to kne el down and pray for money . The

INTERVAL -0.094 - 0.020
CONTAINS 67.417%

It saw Lily 's ring and wanted it .
<|endoftext|> Once upon a time , there was a little girl
and feels better . She gives him her red boot .
pretty . He wanted to draw too . He grabbed a
caul iflower and took it to her home . She had

INTERVAL -0.209 - -0.094
CONTAINS 1.608%

. His mom was happy that Tim my was taking good
walk slowly and she never broke her nap kin again .
it . Suddenly , she saw the sunshine and it made
Ben saw something . He saw a big hole in the
and the dog were sad that their fun had been stopped
Token store
Token ID3650
Feature activation+7.27e-02

Pos logit contributions
 Him+0.032
 journals+0.031
onies+0.031
 parties+0.030
 chores+0.030
Neg logit contributions
 block-0.017
 pipe-0.017
 figure-0.016
 nail-0.016
 spike-0.016
Token.
Token ID13
Feature activation-1.65e-03

Pos logit contributions
 parties+0.174
 chores+0.171
 journals+0.168
 Him+0.165
onies+0.158
Neg logit contributions
 spike-0.076
 nail-0.072
 snake-0.071
 pipe-0.069
 block-0.069
Token There
Token ID1318
Feature activation-5.82e-03

Pos logit contributions
 spike+0.003
 nail+0.003
 block+0.003
 pipe+0.003
 crocod+0.003
Neg logit contributions
 parties-0.003
 journals-0.003
 chores-0.003
 Him-0.003
onies-0.003
Token were
Token ID547
Feature activation+4.89e-02

Pos logit contributions
 spike+0.006
 block+0.006
 pipe+0.006
 nail+0.006
 crocod+0.006
Neg logit contributions
 parties-0.013
 chores-0.013
 Him-0.013
 journals-0.013
onies-0.012
Token many
Token ID867
Feature activation+1.11e-01

Pos logit contributions
 Him+0.082
 journals+0.080
 parties+0.078
 chores+0.076
onies+0.074
Neg logit contributions
 block-0.085
 nail-0.082
 spike-0.081
 crocod-0.081
 pipe-0.081
Token matt
Token ID23963
Feature activation+2.48e-01

Pos logit contributions
 Him+0.209
 journals+0.196
 parties+0.195
Hope+0.189
 hospitality+0.189
Neg logit contributions
 block-0.172
 nail-0.166
 pipe-0.164
 spike-0.163
 figure-0.161
Tokenresses
Token ID16746
Feature activation+8.56e-02

Pos logit contributions
 Him+0.227
 journals+0.223
 parties+0.205
 chores+0.200
 hospitality+0.181
Neg logit contributions
 block-0.624
 nail-0.615
 pipe-0.605
 figure-0.605
 spike-0.605
Token.
Token ID13
Feature activation-3.62e-04

Pos logit contributions
 parties+0.202
 chores+0.199
 journals+0.191
 Him+0.189
 holidays+0.184
Neg logit contributions
 spike-0.091
 nail-0.085
 pipe-0.084
 crocod-0.084
 snake-0.084
Token L
Token ID406
Feature activation+7.71e-03

Pos logit contributions
 spike+0.001
 pipe+0.001
 nail+0.001
 block+0.001
 crocod+0.000
Neg logit contributions
 parties-0.001
 chores-0.001
 journals-0.001
 Him-0.001
onies-0.001
Tokenila
Token ID10102
Feature activation-7.04e-02

Pos logit contributions
 journals+0.019
 Him+0.019
 chores+0.019
 parties+0.019
 holidays+0.018
Neg logit contributions
 nail-0.007
 block-0.007
 spike-0.007
 pipe-0.007
 figure-0.006
Token saw
Token ID2497
Feature activation-8.75e-02

Pos logit contributions
 spike+0.054
 nail+0.054
 pipe+0.054
 block+0.054
 crocod+0.051
Neg logit contributions
 parties-0.181
 journals-0.179
 chores-0.178
 Him-0.176
onies-0.171
Token looked
Token ID3114
Feature activation-1.38e-02

Pos logit contributions
 journals+0.155
 parties+0.155
 Him+0.154
 chores+0.153
onies+0.147
Neg logit contributions
 block-0.072
 nail-0.071
 pipe-0.069
 spike-0.069
 figure-0.068
Token around
Token ID1088
Feature activation-5.61e-02

Pos logit contributions
 block+0.012
 nail+0.011
 pipe+0.011
 spike+0.011
 crocod+0.011
Neg logit contributions
 parties-0.034
 journals-0.034
 Him-0.034
 chores-0.033
onies-0.032
Token and
Token ID290
Feature activation+6.05e-02

Pos logit contributions
 spike+0.061
 nail+0.059
 pipe+0.057
 block+0.055
 crocod+0.053
Neg logit contributions
 parties-0.134
 chores-0.131
 Him-0.131
 journals-0.129
 holidays-0.125
Token saw
Token ID2497
Feature activation-8.15e-02

Pos logit contributions
 parties+0.151
 journals+0.148
 Him+0.147
 chores+0.143
 holidays+0.143
Neg logit contributions
 spike-0.058
 snake-0.055
 pipe-0.053
 nail-0.052
 crocod-0.050
Token many
Token ID867
Feature activation+1.13e-01

Pos logit contributions
 block+0.128
 nail+0.127
 spike+0.127
 pipe+0.127
 crocod+0.124
Neg logit contributions
 parties-0.145
 journals-0.145
 chores-0.143
 Him-0.143
onies-0.135
Token matt
Token ID23963
Feature activation+2.48e-01

Pos logit contributions
 Him+0.207
 journals+0.198
 parties+0.198
 chores+0.191
Hope+0.191
Neg logit contributions
 block-0.175
 nail-0.171
 pipe-0.170
 spike-0.170
 figure-0.169
Tokenresses
Token ID16746
Feature activation+8.43e-02

Pos logit contributions
 Him+0.220
 journals+0.218
 parties+0.207
 chores+0.201
 unnecessary+0.183
Neg logit contributions
 block-0.621
 nail-0.614
 pipe-0.609
 figure-0.608
 spike-0.600
Token.
Token ID13
Feature activation+9.43e-03

Pos logit contributions
 parties+0.197
 chores+0.195
 journals+0.189
 Him+0.187
onies+0.181
Neg logit contributions
 spike-0.089
 nail-0.085
 block-0.083
 pipe-0.083
 snake-0.082
Token Some
Token ID2773
Feature activation+5.13e-02

Pos logit contributions
 parties+0.018
 journals+0.017
 Him+0.017
 chores+0.017
 holidays+0.016
Neg logit contributions
 spike-0.014
 nail-0.014
 pipe-0.013
 figure-0.013
 crocod-0.013
Token were
Token ID547
Feature activation+3.75e-02

Pos logit contributions
 Him+0.125
 journals+0.121
Hope+0.117
 parties+0.117
 chores+0.115
Neg logit contributions
 block-0.049
 spike-0.046
 pipe-0.046
 figure-0.046
 nail-0.046
Token soft
Token ID2705
Feature activation+1.35e-02

Pos logit contributions
 chores+0.079
 parties+0.078
 journals+0.076
 Him+0.074
onies+0.072
Neg logit contributions
 spike-0.048
 pipe-0.047
 nail-0.047
 block-0.047
 figure-0.046
Token princess
Token ID21752
Feature activation-1.89e-02

Pos logit contributions
 parties+0.066
 journals+0.065
 chores+0.063
 holidays+0.060
 pic+0.059
Neg logit contributions
 grun-0.048
 spike-0.047
 nail-0.046
 pipe-0.045
 block-0.045
Token who
Token ID508
Feature activation-4.85e-02

Pos logit contributions
 spike+0.020
 nail+0.020
 pipe+0.019
 block+0.018
 grun+0.018
Neg logit contributions
 parties-0.045
 journals-0.043
 chores-0.043
 Him-0.043
onies-0.041
Token slept
Token ID21256
Feature activation+6.61e-02

Pos logit contributions
 spike+0.060
 pipe+0.059
 nail+0.059
 block+0.058
 crocod+0.056
Neg logit contributions
 parties-0.103
 journals-0.102
 chores-0.100
 Him-0.100
onies-0.095
Token on
Token ID319
Feature activation-3.57e-02

Pos logit contributions
 journals+0.162
 parties+0.162
 chores+0.160
 Him+0.160
onies+0.154
Neg logit contributions
 block-0.059
 nail-0.058
 pipe-0.057
 spike-0.057
 figure-0.056
Token many
Token ID867
Feature activation+1.20e-01

Pos logit contributions
 block+0.034
 nail+0.033
 spike+0.033
 pipe+0.033
 figure+0.032
Neg logit contributions
 journals-0.085
 parties-0.084
 Him-0.084
 chores-0.083
onies-0.082
Token matt
Token ID23963
Feature activation+2.47e-01

Pos logit contributions
 chores+0.184
 Him+0.184
 parties+0.182
 journals+0.181
onies+0.174
Neg logit contributions
 block-0.214
 nail-0.211
 figure-0.209
 pipe-0.209
 spike-0.208
Tokenresses
Token ID16746
Feature activation+9.26e-02

Pos logit contributions
 journals+0.260
 Him+0.247
 chores+0.247
 parties+0.237
 unnecessary+0.236
Neg logit contributions
 block-0.582
 nail-0.563
 figure-0.558
 crocod-0.551
 pipe-0.549
Token.
Token ID13
Feature activation-2.98e-02

Pos logit contributions
 parties+0.235
 journals+0.234
 chores+0.233
 Him+0.231
onies+0.222
Neg logit contributions
 block-0.073
 nail-0.073
 spike-0.073
 pipe-0.072
 figure-0.069
Token But
Token ID887
Feature activation-8.44e-02

Pos logit contributions
 block+0.030
 "+0.029
 snake+0.027
 nail+0.027
 circle+0.026
Neg logit contributions
 chores-0.070
 journals-0.070
 parties-0.069
beh-0.067
 hospitality-0.067
Token she
Token ID673
Feature activation-1.13e-02

Pos logit contributions
 pipe+0.127
 spike+0.125
 nail+0.122
 block+0.122
 crocod+0.119
Neg logit contributions
 parties-0.162
 Him-0.160
 chores-0.159
 journals-0.156
 holidays-0.147
Token felt
Token ID2936
Feature activation-8.68e-03

Pos logit contributions
 pipe+0.014
 spike+0.013
 nail+0.013
 block+0.012
 crocod+0.012
Neg logit contributions
 parties-0.024
 journals-0.024
 chores-0.024
 Him-0.023
onies-0.023
Token go
Token ID467
Feature activation+6.89e-02

Pos logit contributions
 parties+0.059
 chores+0.055
 journals+0.052
 holidays+0.052
 Him+0.052
Neg logit contributions
 nail-0.053
 pipe-0.053
 block-0.052
 blob-0.051
 spike-0.051
Token riding
Token ID10311
Feature activation+5.96e-02

Pos logit contributions
 parties+0.157
 journals+0.156
 chores+0.156
 Him+0.154
onies+0.147
Neg logit contributions
 nail-0.074
 spike-0.074
 block-0.074
 pipe-0.073
 crocod-0.070
Token together
Token ID1978
Feature activation+1.46e-01

Pos logit contributions
 parties+0.136
 journals+0.134
 chores+0.134
 Him+0.132
onies+0.125
Neg logit contributions
 nail-0.065
 pipe-0.063
 spike-0.063
 block-0.063
 crocod-0.061
Token on
Token ID319
Feature activation+3.40e-03

Pos logit contributions
 parties+0.354
 chores+0.346
 journals+0.344
 Him+0.338
 holidays+0.325
Neg logit contributions
 nail-0.140
 pipe-0.140
 spike-0.140
 block-0.135
 crocod-0.134
Token Tues
Token ID48496
Feature activation+1.17e-01

Pos logit contributions
 journals+0.008
 parties+0.008
 Him+0.008
 chores+0.008
onies+0.008
Neg logit contributions
 spike-0.003
 block-0.003
 nail-0.003
 pipe-0.003
 figure-0.003
Tokendays
Token ID12545
Feature activation+2.19e-01

Pos logit contributions
 journals+0.106
 chores+0.105
 parties+0.102
 Him+0.100
onies+0.092
Neg logit contributions
 block-0.292
 nail-0.285
 spike-0.284
 figure-0.284
 pipe-0.284
Token.
Token ID13
Feature activation-5.34e-03

Pos logit contributions
 journals+0.555
 parties+0.554
 chores+0.550
 Him+0.547
onies+0.525
Neg logit contributions
 block-0.175
 nail-0.174
 spike-0.173
 pipe-0.172
 figure-0.165
Token That
Token ID1320
Feature activation-2.65e-02

Pos logit contributions
 nail+0.008
 block+0.008
 pipe+0.008
 spike+0.008
 figure+0.008
Neg logit contributions
 parties-0.010
 chores-0.010
 Him-0.010
 journals-0.010
onies-0.009
Token made
Token ID925
Feature activation-6.73e-02

Pos logit contributions
 nail+0.032
 block+0.032
 spike+0.031
 pipe+0.031
 crocod+0.030
Neg logit contributions
 parties-0.058
 chores-0.057
 journals-0.057
 Him-0.055
onies-0.054
Token the
Token ID262
Feature activation-1.55e-03

Pos logit contributions
 spike+0.089
 crocod+0.088
 pipe+0.085
 figure+0.084
 block+0.084
Neg logit contributions
 journals-0.143
 parties-0.134
 chores-0.133
 Him-0.130
onies-0.125
Token little
Token ID1310
Feature activation+2.42e-03

Pos logit contributions
 block+0.002
 spike+0.002
 nail+0.002
 pipe+0.002
 figure+0.002
Neg logit contributions
 parties-0.003
 journals-0.003
 chores-0.003
 Him-0.003
onies-0.003
Token to
Token ID284
Feature activation-3.83e-02

Pos logit contributions
 block+0.028
 spike+0.028
 nail+0.028
 pipe+0.027
 crocod+0.027
Neg logit contributions
 journals-0.046
 parties-0.046
 chores-0.046
 Him-0.046
onies-0.044
Token pick
Token ID2298
Feature activation-2.30e-02

Pos logit contributions
 spike+0.060
 pipe+0.060
 nail+0.060
 block+0.059
 crocod+0.058
Neg logit contributions
 parties-0.069
 chores-0.067
 journals-0.064
 Him-0.064
 holidays-0.061
Token flowers
Token ID12734
Feature activation+4.93e-02

Pos logit contributions
 nail+0.028
 block+0.028
 pipe+0.028
 spike+0.028
 figure+0.027
Neg logit contributions
 parties-0.049
 journals-0.048
 chores-0.048
 Him-0.048
onies-0.045
Token on
Token ID319
Feature activation+4.24e-03

Pos logit contributions
 parties+0.122
 journals+0.122
 chores+0.121
 Him+0.120
onies+0.115
Neg logit contributions
 block-0.042
 nail-0.042
 spike-0.042
 pipe-0.042
 figure-0.040
Token Tues
Token ID48496
Feature activation+1.24e-01

Pos logit contributions
 journals+0.009
 Him+0.009
 parties+0.009
 chores+0.009
onies+0.009
Neg logit contributions
 block-0.005
 nail-0.005
 spike-0.005
 pipe-0.005
 crocod-0.005
Tokendays
Token ID12545
Feature activation+2.15e-01

Pos logit contributions
 chores+0.119
 journals+0.118
 parties+0.114
 Him+0.112
onies+0.105
Neg logit contributions
 block-0.305
 nail-0.295
 pipe-0.295
 spike-0.292
 figure-0.292
Token."
Token ID526
Feature activation+6.62e-02

Pos logit contributions
 journals+0.507
 Him+0.498
 chores+0.496
 parties+0.491
onies+0.473
Neg logit contributions
 block-0.219
 nail-0.206
 pipe-0.206
 spike-0.200
 figure-0.197
Token
Token ID220
Feature activation+5.72e-02

Pos logit contributions
 parties+0.144
 journals+0.144
 chores+0.143
 Him+0.141
onies+0.135
Neg logit contributions
 block-0.078
 spike-0.077
 pipe-0.077
 nail-0.077
 figure-0.074
Token
Token ID198
Feature activation+2.74e-02

Pos logit contributions
 journals+0.136
 parties+0.135
 chores+0.133
 Him+0.132
onies+0.126
Neg logit contributions
 block-0.057
 spike-0.055
 nail-0.055
 pipe-0.055
 crocod-0.052
Token
Token ID198
Feature activation+2.14e-02

Pos logit contributions
 journals+0.065
 parties+0.065
 chores+0.064
 Him+0.063
onies+0.061
Neg logit contributions
 block-0.028
 spike-0.027
 pipe-0.027
 nail-0.027
 figure-0.026
TokenMax
Token ID11518
Feature activation-6.08e-03

Pos logit contributions
 parties+0.042
 journals+0.041
 chores+0.041
 Him+0.041
onies+0.039
Neg logit contributions
 block-0.031
 spike-0.030
 nail-0.030
 pipe-0.030
 figure-0.030
Token to
Token ID284
Feature activation-4.17e-03

Pos logit contributions
 Him+0.022
 journals+0.021
 chores+0.020
onies+0.020
 parties+0.020
Neg logit contributions
 block-0.011
 pipe-0.010
 crocod-0.010
 spike-0.010
 nail-0.010
Token un
Token ID555
Feature activation+6.39e-02

Pos logit contributions
 block+0.006
 nail+0.006
 spike+0.006
 pipe+0.006
 figure+0.006
Neg logit contributions
 journals-0.008
 parties-0.008
 Him-0.008
 chores-0.008
onies-0.007
Tokenpack
Token ID8002
Feature activation+7.23e-02

Pos logit contributions
 Him+0.101
 journals+0.098
onies+0.093
 parties+0.093
 chores+0.091
Neg logit contributions
 block-0.121
 pipe-0.116
 figure-0.115
 nail-0.113
 hit-0.112
Token their
Token ID511
Feature activation+1.24e-01

Pos logit contributions
 journals+0.153
 parties+0.153
 Him+0.151
 chores+0.150
onies+0.144
Neg logit contributions
 block-0.087
 nail-0.087
 pipe-0.087
 spike-0.086
 figure-0.084
Token back
Token ID736
Feature activation-9.28e-04

Pos logit contributions
 Him+0.218
 parties+0.212
onies+0.204
Hope+0.204
 journals+0.203
Neg logit contributions
 pipe-0.208
 block-0.206
 nail-0.202
 spike-0.202
 figure-0.197
Tokenpacks
Token ID32377
Feature activation+2.11e-01

Pos logit contributions
 block+0.002
 pipe+0.002
 nail+0.002
 spike+0.002
 figure+0.002
Neg logit contributions
 journals-0.001
 parties-0.001
 Him-0.001
 chores-0.001
onies-0.001
Token,
Token ID11
Feature activation-2.74e-03

Pos logit contributions
 parties+0.536
 journals+0.534
 chores+0.530
 Him+0.526
onies+0.506
Neg logit contributions
 nail-0.167
 spike-0.167
 block-0.166
 pipe-0.165
 figure-0.157
Token which
Token ID543
Feature activation-4.28e-02

Pos logit contributions
 spike+0.003
 pipe+0.003
 nail+0.003
 block+0.003
 crocod+0.003
Neg logit contributions
 parties-0.007
 chores-0.006
 journals-0.006
 Him-0.006
onies-0.006
Token they
Token ID484
Feature activation+4.58e-02

Pos logit contributions
 spike+0.044
 nail+0.043
 block+0.042
 figure+0.040
 pipe+0.040
Neg logit contributions
 parties-0.101
 chores-0.099
 journals-0.094
 holidays-0.093
 Him-0.089
Token brought
Token ID3181
Feature activation+4.61e-02

Pos logit contributions
 parties+0.101
 journals+0.101
 chores+0.100
 Him+0.099
onies+0.095
Neg logit contributions
 spike-0.052
 block-0.052
 nail-0.052
 pipe-0.051
 figure-0.049
Token to
Token ID284
Feature activation+2.82e-03

Pos logit contributions
 journals+0.098
 parties+0.097
 Him+0.097
 chores+0.096
onies+0.093
Neg logit contributions
 block-0.055
 pipe-0.055
 nail-0.054
 spike-0.054
 figure-0.053
Token loved
Token ID6151
Feature activation+1.23e-01

Pos logit contributions
 parties+0.101
 journals+0.100
 chores+0.100
 Him+0.098
onies+0.093
Neg logit contributions
 nail-0.076
 spike-0.076
 block-0.075
 pipe-0.075
 crocod-0.073
Token going
Token ID1016
Feature activation+8.57e-02

Pos logit contributions
 journals+0.272
 chores+0.266
 parties+0.265
 Him+0.265
onies+0.257
Neg logit contributions
 spike-0.137
 pipe-0.136
 block-0.136
 nail-0.135
 crocod-0.134
Token outside
Token ID2354
Feature activation+4.13e-02

Pos logit contributions
 journals+0.220
 Him+0.217
 chores+0.211
 parties+0.210
onies+0.208
Neg logit contributions
 block-0.072
 pipe-0.069
 spike-0.067
 nail-0.066
 figure-0.065
Token to
Token ID284
Feature activation+1.40e-02

Pos logit contributions
 parties+0.105
 journals+0.105
 chores+0.104
 Him+0.104
onies+0.100
Neg logit contributions
 block-0.032
 nail-0.032
 spike-0.032
 pipe-0.032
 figure-0.030
Token play
Token ID711
Feature activation+1.12e-01

Pos logit contributions
 parties+0.025
 chores+0.025
 journals+0.024
 Him+0.024
 holidays+0.022
Neg logit contributions
 nail-0.022
 spike-0.022
 pipe-0.022
 block-0.022
 crocod-0.021
Token games
Token ID1830
Feature activation+2.09e-01

Pos logit contributions
 parties+0.253
 journals+0.248
 chores+0.248
 Him+0.244
onies+0.233
Neg logit contributions
 spike-0.124
 nail-0.124
 pipe-0.123
 block-0.122
 figure-0.118
Token.
Token ID13
Feature activation+1.73e-02

Pos logit contributions
 parties+0.536
 journals+0.535
 chores+0.531
 Him+0.528
onies+0.508
Neg logit contributions
 block-0.160
 nail-0.159
 spike-0.158
 pipe-0.158
 figure-0.150
Token
Token ID198
Feature activation+2.68e-02

Pos logit contributions
 journals+0.034
 parties+0.034
 chores+0.033
 Him+0.033
onies+0.031
Neg logit contributions
 block-0.024
 spike-0.024
 pipe-0.024
 nail-0.024
 figure-0.023
Token
Token ID198
Feature activation+2.00e-02

Pos logit contributions
 parties+0.057
 chores+0.057
 journals+0.056
 Him+0.056
onies+0.053
Neg logit contributions
 nail-0.033
 spike-0.032
 pipe-0.032
 block-0.032
 crocod-0.031
TokenOne
Token ID3198
Feature activation+7.82e-02

Pos logit contributions
 journals+0.029
 parties+0.028
 Him+0.028
 chores+0.028
onies+0.025
Neg logit contributions
 block-0.039
 spike-0.038
 nail-0.038
 pipe-0.038
 figure-0.038
Token day
Token ID1110
Feature activation+4.31e-02

Pos logit contributions
 parties+0.151
 chores+0.149
 journals+0.147
 Him+0.145
 holidays+0.138
Neg logit contributions
 nail-0.113
 spike-0.112
 block-0.111
 pipe-0.110
 figure-0.107
Token twins
Token ID20345
Feature activation+7.31e-02

Pos logit contributions
 journals+0.082
 hospitality+0.082
 cultures+0.082
 Him+0.082
Hope+0.079
Neg logit contributions
 block-0.048
 building-0.046
 pipe-0.043
 saw-0.043
 crocod-0.042
Token who
Token ID508
Feature activation-1.37e-02

Pos logit contributions
 parties+0.159
 journals+0.154
 chores+0.153
 Him+0.152
 holidays+0.145
Neg logit contributions
 spike-0.089
 pipe-0.087
 nail-0.087
 crocod-0.083
 snake-0.082
Token loved
Token ID6151
Feature activation+1.14e-01

Pos logit contributions
 block+0.016
 figure+0.015
 crocod+0.015
 nail+0.015
 pipe+0.015
Neg logit contributions
 journals-0.031
 Him-0.031
 chores-0.030
 parties-0.030
onies-0.029
Token to
Token ID284
Feature activation+2.89e-03

Pos logit contributions
onies+0.290
 journals+0.284
 Him+0.283
 chores+0.279
 parties+0.277
Neg logit contributions
 crocod-0.093
 pipe-0.091
 nail-0.089
 spike-0.089
 block-0.088
Token play
Token ID711
Feature activation+1.11e-01

Pos logit contributions
 Him+0.006
 journals+0.006
onies+0.006
 parties+0.005
 unnecessary+0.005
Neg logit contributions
 block-0.004
 figure-0.004
 nail-0.004
 pipe-0.004
 spike-0.004
Token games
Token ID1830
Feature activation+2.08e-01

Pos logit contributions
 parties+0.255
 chores+0.251
 journals+0.250
 Him+0.243
 holidays+0.232
Neg logit contributions
 spike-0.121
 nail-0.119
 block-0.118
 pipe-0.118
 crocod-0.114
Token.
Token ID13
Feature activation+1.71e-02

Pos logit contributions
 parties+0.527
 journals+0.521
 chores+0.519
 Him+0.511
onies+0.492
Neg logit contributions
 spike-0.175
 nail-0.174
 pipe-0.172
 block-0.170
 crocod-0.163
Token One
Token ID1881
Feature activation+1.03e-01

Pos logit contributions
 journals+0.030
 Him+0.030
 chores+0.030
 parties+0.030
onies+0.029
Neg logit contributions
 block-0.027
 pipe-0.026
 nail-0.026
 crocod-0.025
 spike-0.025
Token day
Token ID1110
Feature activation+5.36e-02

Pos logit contributions
 parties+0.188
 chores+0.187
 journals+0.182
 Him+0.176
 holidays+0.171
Neg logit contributions
 spike-0.159
 nail-0.159
 block-0.152
 pipe-0.151
 figure-0.150
Token,
Token ID11
Feature activation-1.33e-02

Pos logit contributions
 journals+0.164
 Him+0.162
 parties+0.162
 chores+0.161
onies+0.157
Neg logit contributions
 block-0.015
 nail-0.014
 pipe-0.014
 spike-0.014
 crocod-0.013
Token they
Token ID484
Feature activation+4.89e-02

Pos logit contributions
 block+0.017
 nail+0.016
 pipe+0.016
 spike+0.016
 crocod+0.016
Neg logit contributions
 journals-0.028
 Him-0.028
 parties-0.027
 chores-0.027
onies-0.027
Token brothers
Token ID9397
Feature activation+6.75e-02

Pos logit contributions
 Him+0.093
 journals+0.091
 hospitality+0.090
 cultures+0.090
Sleep+0.089
Neg logit contributions
 block-0.048
 building-0.042
 pipe-0.042
 crocod-0.041
 saw-0.040
Token who
Token ID508
Feature activation-3.86e-04

Pos logit contributions
 parties+0.145
 journals+0.137
 chores+0.137
 Him+0.135
 holidays+0.130
Neg logit contributions
 spike-0.088
 nail-0.088
 pipe-0.087
 snake-0.081
 crocod-0.080
Token liked
Token ID8288
Feature activation+7.70e-02

Pos logit contributions
 block+0.000
 nail+0.000
 spike+0.000
 pipe+0.000
 figure+0.000
Neg logit contributions
 journals-0.001
 parties-0.001
 Him-0.001
 chores-0.001
onies-0.001
Token to
Token ID284
Feature activation+5.41e-04

Pos logit contributions
 journals+0.198
 Him+0.197
onies+0.195
 parties+0.195
 chores+0.195
Neg logit contributions
 block-0.059
 pipe-0.058
 nail-0.057
 crocod-0.057
 spike-0.056
Token play
Token ID711
Feature activation+1.10e-01

Pos logit contributions
 Him+0.001
 journals+0.001
 parties+0.001
onies+0.001
 chores+0.001
Neg logit contributions
 block-0.001
 nail-0.001
 figure-0.001
 pipe-0.001
 spike-0.001
Token games
Token ID1830
Feature activation+2.08e-01

Pos logit contributions
 parties+0.246
 chores+0.240
 journals+0.237
 Him+0.226
 holidays+0.219
Neg logit contributions
 spike-0.131
 nail-0.128
 pipe-0.124
 block-0.124
 crocod-0.122
Token.
Token ID13
Feature activation+1.83e-02

Pos logit contributions
 parties+0.502
 chores+0.483
 journals+0.479
 Him+0.456
 holidays+0.445
Neg logit contributions
 spike-0.226
 nail-0.214
 pipe-0.209
 crocod-0.200
 blob-0.199
Token They
Token ID1119
Feature activation+8.00e-02

Pos logit contributions
 parties+0.029
 journals+0.028
 chores+0.028
 Him+0.028
onies+0.026
Neg logit contributions
 nail-0.033
 spike-0.033
 pipe-0.033
 block-0.032
 figure-0.032
Token had
Token ID550
Feature activation+4.07e-03

Pos logit contributions
 journals+0.179
 parties+0.179
 Him+0.178
 chores+0.178
onies+0.169
Neg logit contributions
 block-0.089
 nail-0.088
 pipe-0.087
 spike-0.087
 figure-0.085
Token many
Token ID867
Feature activation+1.26e-01

Pos logit contributions
 Him+0.009
 journals+0.009
 chores+0.009
onies+0.009
 parties+0.008
Neg logit contributions
 block-0.005
 crocod-0.005
 pipe-0.005
 nail-0.005
 spike-0.004
Token toys
Token ID14958
Feature activation+1.52e-01

Pos logit contributions
 Him+0.214
Sleep+0.191
 journals+0.189
Hope+0.188
onies+0.186
Neg logit contributions
 block-0.230
 pipe-0.220
 nail-0.219
 spike-0.219
 crocod-0.218
Token twins
Token ID20345
Feature activation+7.65e-02

Pos logit contributions
 cultures+0.083
 journals+0.082
 hospitality+0.082
Hope+0.081
Sleep+0.080
Neg logit contributions
 block-0.044
 building-0.041
 pipe-0.040
 saw-0.040
 crocod-0.039
Token who
Token ID508
Feature activation-7.49e-03

Pos logit contributions
 parties+0.163
 Him+0.159
 journals+0.159
 chores+0.159
 married+0.149
Neg logit contributions
 spike-0.097
 pipe-0.093
 nail-0.092
 snake-0.090
 crocod-0.089
Token liked
Token ID8288
Feature activation+7.58e-02

Pos logit contributions
 block+0.008
 figure+0.008
 saw+0.008
 crocod+0.007
 nail+0.007
Neg logit contributions
 journals-0.017
 Him-0.017
 chores-0.017
 unnecessary-0.017
 parties-0.016
Token to
Token ID284
Feature activation-2.11e-03

Pos logit contributions
onies+0.204
 journals+0.202
 Him+0.201
 chores+0.196
Hope+0.196
Neg logit contributions
 crocod-0.051
 block-0.049
 pipe-0.048
 nail-0.047
 spike-0.046
Token play
Token ID711
Feature activation+1.10e-01

Pos logit contributions
 block+0.003
 figure+0.003
 nail+0.003
 pipe+0.003
 mark+0.003
Neg logit contributions
 Him-0.004
 journals-0.004
onies-0.004
 unnecessary-0.004
 parties-0.004
Token games
Token ID1830
Feature activation+2.08e-01

Pos logit contributions
 parties+0.251
 chores+0.246
 journals+0.245
 Him+0.238
 holidays+0.227
Neg logit contributions
 spike-0.122
 nail-0.121
 pipe-0.119
 block-0.119
 crocod-0.115
Token.
Token ID13
Feature activation+1.46e-02

Pos logit contributions
 parties+0.525
 journals+0.516
 chores+0.515
 Him+0.504
onies+0.485
Neg logit contributions
 spike-0.182
 nail-0.178
 pipe-0.176
 block-0.172
 crocod-0.167
Token One
Token ID1881
Feature activation+1.04e-01

Pos logit contributions
 journals+0.028
 Him+0.027
 chores+0.027
 unnecessary+0.027
 parties+0.027
Neg logit contributions
 block-0.021
 pipe-0.020
 crocod-0.020
 nail-0.020
 circle-0.019
Token day
Token ID1110
Feature activation+4.76e-02

Pos logit contributions
 chores+0.183
 parties+0.182
 journals+0.175
 Him+0.169
 holidays+0.165
Neg logit contributions
 spike-0.165
 nail-0.163
 figure-0.155
 block-0.155
 pipe-0.155
Token,
Token ID11
Feature activation-1.47e-02

Pos logit contributions
 journals+0.144
 parties+0.143
 Him+0.142
 chores+0.141
onies+0.137
Neg logit contributions
 block-0.016
 nail-0.015
 pipe-0.015
 spike-0.015
 crocod-0.014
Token they
Token ID484
Feature activation+4.59e-02

Pos logit contributions
 block+0.017
 nail+0.017
 pipe+0.017
 crocod+0.017
 spike+0.016
Neg logit contributions
 journals-0.032
 Him-0.032
 parties-0.031
 chores-0.031
onies-0.031
Token lots
Token ID6041
Feature activation+1.46e-01

Pos logit contributions
 block+0.006
 crocod+0.006
 spike+0.005
 nail+0.005
 pipe+0.005
Neg logit contributions
 Him-0.013
 journals-0.012
 parties-0.012
onies-0.012
 chores-0.012
Token of
Token ID286
Feature activation+9.41e-03

Pos logit contributions
 Him+0.437
 journals+0.435
 chores+0.430
 parties+0.430
onies+0.418
Neg logit contributions
 block-0.057
 pipe-0.051
 nail-0.051
 crocod-0.050
 spike-0.050
Token fun
Token ID1257
Feature activation+1.36e-01

Pos logit contributions
 Him+0.016
 journals+0.016
 parties+0.015
onies+0.015
 hospitality+0.014
Neg logit contributions
 spike-0.016
 block-0.016
 pipe-0.016
 nail-0.016
 crocod-0.016
Token together
Token ID1978
Feature activation+1.46e-01

Pos logit contributions
 parties+0.260
 journals+0.258
 chores+0.257
 Him+0.252
onies+0.239
Neg logit contributions
 block-0.194
 nail-0.194
 spike-0.193
 pipe-0.192
 figure-0.188
Token.
Token ID13
Feature activation+2.03e-02

Pos logit contributions
 parties+0.338
 chores+0.327
 journals+0.318
 Him+0.313
 holidays+0.305
Neg logit contributions
 nail-0.159
 spike-0.156
 pipe-0.153
 block-0.153
 figure-0.147
Token Yesterday
Token ID34668
Feature activation+2.08e-01

Pos logit contributions
 parties+0.034
 chores+0.033
 journals+0.032
 Him+0.031
 holidays+0.031
Neg logit contributions
 nail-0.034
 figure-0.033
 spike-0.033
 pipe-0.032
 blob-0.032
Token,
Token ID11
Feature activation-1.52e-03

Pos logit contributions
 parties+0.477
 chores+0.471
 pic+0.445
 holidays+0.433
ics+0.420
Neg logit contributions
 figure-0.210
 nail-0.207
 rectangle-0.207
 claw-0.198
 grun-0.197
Token Tim
Token ID5045
Feature activation-1.27e-03

Pos logit contributions
 nail+0.002
 figure+0.002
 block+0.002
 spike+0.002
 pipe+0.002
Neg logit contributions
 parties-0.003
 chores-0.003
 journals-0.003
 Him-0.003
 holidays-0.003
Token teased
Token ID35380
Feature activation-1.59e-02

Pos logit contributions
 nail+0.001
 spike+0.001
 block+0.001
 pipe+0.001
 crocod+0.001
Neg logit contributions
 parties-0.003
 journals-0.003
 chores-0.003
 Him-0.003
onies-0.003
Token Joe
Token ID5689
Feature activation+8.26e-03

Pos logit contributions
 nail+0.014
 block+0.014
 figure+0.013
 pipe+0.013
 spike+0.013
Neg logit contributions
 parties-0.039
 journals-0.039
 chores-0.038
 Him-0.038
onies-0.036
Token when
Token ID618
Feature activation+5.04e-02

Pos logit contributions
 chores+0.020
 parties+0.020
 journals+0.019
 Him+0.019
onies+0.018
Neg logit contributions
 nail-0.008
 block-0.007
 spike-0.007
 pipe-0.007
 crocod-0.007
Token twins
Token ID20345
Feature activation+7.47e-02

Pos logit contributions
 cultures+0.107
 hospitality+0.105
 Him+0.104
Hope+0.104
Sleep+0.103
Neg logit contributions
 block-0.035
 building-0.032
,-0.032
 saw-0.031
 pipe-0.027
Token who
Token ID508
Feature activation-8.59e-03

Pos logit contributions
 parties+0.158
 chores+0.154
 journals+0.152
 Him+0.151
 married+0.143
Neg logit contributions
 spike-0.097
 pipe-0.093
 nail-0.092
 snake-0.090
 crocod-0.090
Token liked
Token ID8288
Feature activation+7.52e-02

Pos logit contributions
 block+0.010
 figure+0.009
 crocod+0.009
 saw+0.009
 hit+0.009
Neg logit contributions
 journals-0.019
 Him-0.019
 chores-0.019
 parties-0.018
 unnecessary-0.018
Token to
Token ID284
Feature activation-1.70e-03

Pos logit contributions
onies+0.203
 Him+0.201
 journals+0.199
 chores+0.195
Hope+0.194
Neg logit contributions
 block-0.050
 crocod-0.049
 pipe-0.048
 nail-0.047
 spike-0.045
Token play
Token ID711
Feature activation+1.10e-01

Pos logit contributions
 block+0.002
 figure+0.002
 nail+0.002
 pipe+0.002
 mark+0.002
Neg logit contributions
 Him-0.003
 journals-0.003
onies-0.003
 unnecessary-0.003
 parties-0.003
Token games
Token ID1830
Feature activation+2.08e-01

Pos logit contributions
 parties+0.236
 chores+0.232
 journals+0.232
 Him+0.226
onies+0.214
Neg logit contributions
 spike-0.135
 nail-0.133
 block-0.132
 pipe-0.132
 crocod-0.128
Token.
Token ID13
Feature activation+1.97e-02

Pos logit contributions
 parties+0.519
 chores+0.508
 journals+0.508
 Him+0.493
onies+0.476
Neg logit contributions
 spike-0.191
 nail-0.185
 pipe-0.183
 block-0.177
 crocod-0.174
Token They
Token ID1119
Feature activation+8.13e-02

Pos logit contributions
 journals+0.035
 Him+0.035
 chores+0.035
 parties+0.035
onies+0.034
Neg logit contributions
 block-0.030
 pipe-0.029
 nail-0.029
 crocod-0.029
 figure-0.028
Token were
Token ID547
Feature activation+5.43e-02

Pos logit contributions
 parties+0.192
 journals+0.192
 chores+0.190
 Him+0.189
onies+0.181
Neg logit contributions
 block-0.080
 nail-0.080
 spike-0.079
 pipe-0.079
 figure-0.076
Token very
Token ID845
Feature activation+1.01e-01

Pos logit contributions
 journals+0.108
 Him+0.107
 parties+0.103
 chores+0.103
onies+0.101
Neg logit contributions
 block-0.077
 nail-0.074
 crocod-0.074
 spike-0.073
 pipe-0.073
Token competitive
Token ID7606
Feature activation+5.14e-02

Pos logit contributions
 journals+0.193
 Him+0.189
 parties+0.187
 chores+0.186
onies+0.178
Neg logit contributions
 block-0.152
 crocod-0.150
 nail-0.149
 pipe-0.147
 spike-0.146
Token Sam
Token ID3409
Feature activation-2.89e-03

Pos logit contributions
 parties+0.018
 Him+0.017
 chores+0.017
 journals+0.017
onies+0.016
Neg logit contributions
 spike-0.010
 pipe-0.010
 block-0.010
 nail-0.010
 figure-0.009
Token who
Token ID508
Feature activation-1.97e-02

Pos logit contributions
 block+0.002
 nail+0.002
 spike+0.002
 pipe+0.002
 figure+0.002
Neg logit contributions
 parties-0.007
 journals-0.007
 chores-0.007
 Him-0.007
onies-0.007
Token loved
Token ID6151
Feature activation+1.06e-01

Pos logit contributions
 block+0.025
 spike+0.025
 nail+0.025
 pipe+0.025
 crocod+0.024
Neg logit contributions
 parties-0.040
 journals-0.040
 chores-0.040
 Him-0.039
onies-0.037
Token to
Token ID284
Feature activation+2.10e-03

Pos logit contributions
 Him+0.261
 journals+0.261
onies+0.257
 chores+0.256
 parties+0.256
Neg logit contributions
 nail-0.091
 spike-0.090
 pipe-0.090
 block-0.090
 crocod-0.087
Token play
Token ID711
Feature activation+1.07e-01

Pos logit contributions
 journals+0.004
 Him+0.004
onies+0.004
 unnecessary+0.004
 parties+0.004
Neg logit contributions
 block-0.003
 nail-0.003
 figure-0.003
 pipe-0.003
 spike-0.003
Token games
Token ID1830
Feature activation+2.08e-01

Pos logit contributions
 parties+0.255
 journals+0.251
 chores+0.250
 Him+0.245
onies+0.235
Neg logit contributions
 spike-0.107
 nail-0.106
 pipe-0.105
 block-0.105
 crocod-0.102
Token.
Token ID13
Feature activation+4.16e-03

Pos logit contributions
 parties+0.523
 journals+0.517
 chores+0.515
 Him+0.505
onies+0.487
Neg logit contributions
 spike-0.177
 nail-0.174
 pipe-0.172
 block-0.170
 crocod-0.165
Token She
Token ID1375
Feature activation+3.18e-02

Pos logit contributions
 parties+0.007
 journals+0.007
 chores+0.007
 Him+0.007
onies+0.006
Neg logit contributions
 spike-0.007
 nail-0.007
 crocod-0.007
 pipe-0.007
 figure-0.007
Token particularly
Token ID3573
Feature activation+7.04e-02

Pos logit contributions
 parties+0.072
 journals+0.071
 chores+0.070
 Him+0.067
onies+0.066
Neg logit contributions
 spike-0.035
 pipe-0.034
 nail-0.034
 crocod-0.033
 block-0.033
Token liked
Token ID8288
Feature activation+7.03e-02

Pos logit contributions
 parties+0.149
 journals+0.149
 chores+0.147
 Him+0.146
onies+0.139
Neg logit contributions
 block-0.086
 nail-0.085
 spike-0.085
 pipe-0.085
 figure-0.082
Token playing
Token ID2712
Feature activation+9.05e-02

Pos logit contributions
 Him+0.180
 journals+0.179
 chores+0.178
onies+0.174
 parties+0.174
Neg logit contributions
 block-0.055
 pipe-0.053
 nail-0.053
 spike-0.052
 crocod-0.050
Token brothers
Token ID9397
Feature activation+7.34e-02

Pos logit contributions
 Him+0.097
 cultures+0.096
 hospitality+0.096
Hope+0.095
Sleep+0.095
Neg logit contributions
 block-0.060
 building-0.053
,-0.052
 crocod-0.052
 pipe-0.051
Token who
Token ID508
Feature activation+5.14e-03

Pos logit contributions
 parties+0.160
 journals+0.153
 chores+0.152
 Him+0.151
 married+0.143
Neg logit contributions
 spike-0.092
 nail-0.092
 pipe-0.091
 snake-0.083
 crocod-0.083
Token liked
Token ID8288
Feature activation+8.14e-02

Pos logit contributions
 journals+0.011
 Him+0.011
 parties+0.011
 chores+0.011
onies+0.010
Neg logit contributions
 block-0.006
 nail-0.006
 spike-0.006
 pipe-0.006
 figure-0.006
Token to
Token ID284
Feature activation-9.21e-04

Pos logit contributions
 journals+0.213
 Him+0.213
onies+0.211
 parties+0.209
 chores+0.208
Neg logit contributions
 block-0.059
 pipe-0.058
 crocod-0.057
 nail-0.057
 spike-0.056
Token play
Token ID711
Feature activation+1.09e-01

Pos logit contributions
 block+0.001
 nail+0.001
 figure+0.001
 pipe+0.001
 spike+0.001
Neg logit contributions
 Him-0.002
 journals-0.002
 parties-0.002
onies-0.002
 chores-0.002
Token games
Token ID1830
Feature activation+2.06e-01

Pos logit contributions
 parties+0.244
 chores+0.240
 journals+0.238
 Him+0.231
 holidays+0.220
Neg logit contributions
 spike-0.127
 nail-0.124
 pipe-0.123
 block-0.121
 figure-0.119
Token.
Token ID13
Feature activation+1.77e-02

Pos logit contributions
 parties+0.504
 chores+0.490
 journals+0.488
 Him+0.469
onies+0.454
Neg logit contributions
 spike-0.207
 nail-0.200
 pipe-0.197
 crocod-0.185
 block-0.184
Token They
Token ID1119
Feature activation+8.25e-02

Pos logit contributions
 parties+0.028
 journals+0.028
 chores+0.028
 Him+0.027
onies+0.026
Neg logit contributions
 nail-0.031
 spike-0.031
 pipe-0.031
 block-0.031
 figure-0.031
Token had
Token ID550
Feature activation+6.08e-03

Pos logit contributions
 parties+0.195
 journals+0.195
 chores+0.194
 Him+0.193
onies+0.184
Neg logit contributions
 block-0.081
 nail-0.080
 spike-0.080
 pipe-0.079
 figure-0.077
Token many
Token ID867
Feature activation+1.27e-01

Pos logit contributions
 Him+0.015
 journals+0.014
onies+0.014
 chores+0.013
affles+0.013
Neg logit contributions
 block-0.007
 crocod-0.006
 nail-0.006
 pipe-0.006
 circle-0.006
Token toys
Token ID14958
Feature activation+1.52e-01

Pos logit contributions
 Him+0.232
Sleep+0.209
Hope+0.204
 hospitality+0.201
 journals+0.199
Neg logit contributions
 block-0.220
 snake-0.205
 crocod-0.204
 nail-0.204
 spike-0.204
Token like
Token ID588
Feature activation-5.16e-03

Pos logit contributions
 journals+0.159
 Him+0.158
 chores+0.152
 parties+0.152
 unnecessary+0.152
Neg logit contributions
 block-0.080
 figure-0.078
 nail-0.075
 hit-0.072
 pipe-0.071
Token to
Token ID284
Feature activation-1.74e-02

Pos logit contributions
 block+0.003
 nail+0.003
 crocod+0.003
 pipe+0.003
 spike+0.003
Neg logit contributions
 journals-0.014
onies-0.014
 Him-0.014
 chores-0.014
 parties-0.014
Token play
Token ID711
Feature activation+9.79e-02

Pos logit contributions
 block+0.027
 figure+0.026
 nail+0.026
 pipe+0.025
 spike+0.025
Neg logit contributions
 Him-0.033
 journals-0.033
 parties-0.031
onies-0.031
 chores-0.031
Token together
Token ID1978
Feature activation+1.49e-01

Pos logit contributions
 parties+0.223
 chores+0.219
 journals+0.219
 Him+0.213
onies+0.203
Neg logit contributions
 spike-0.108
 nail-0.107
 pipe-0.106
 block-0.106
 crocod-0.102
Token.
Token ID13
Feature activation+2.01e-02

Pos logit contributions
 parties+0.358
 chores+0.353
 journals+0.349
 Him+0.341
onies+0.327
Neg logit contributions
 spike-0.150
 pipe-0.148
 nail-0.147
 block-0.141
 crocod-0.140
Token Yesterday
Token ID34668
Feature activation+2.06e-01

Pos logit contributions
 journals+0.034
 parties+0.034
 chores+0.034
 Him+0.034
onies+0.032
Neg logit contributions
 block-0.033
 nail-0.033
 pipe-0.033
 spike-0.032
 crocod-0.032
Token,
Token ID11
Feature activation-2.58e-03

Pos logit contributions
 parties+0.569
 chores+0.564
 journals+0.546
 Him+0.536
 holidays+0.529
Neg logit contributions
 nail-0.128
 spike-0.124
 pipe-0.120
 figure-0.120
 block-0.119
Token they
Token ID484
Feature activation+6.27e-02

Pos logit contributions
 block+0.003
 crocod+0.003
 pipe+0.003
 nail+0.003
 spike+0.003
Neg logit contributions
 journals-0.006
 Him-0.006
onies-0.006
 chores-0.005
 parties-0.005
Token went
Token ID1816
Feature activation+7.80e-02

Pos logit contributions
 Him+0.149
 journals+0.148
 chores+0.147
 parties+0.146
onies+0.142
Neg logit contributions
 block-0.063
 nail-0.061
 pipe-0.060
 figure-0.060
 spike-0.058
Token to
Token ID284
Feature activation+3.81e-03

Pos logit contributions
 parties+0.188
 journals+0.186
 chores+0.186
 Him+0.183
onies+0.176
Neg logit contributions
 nail-0.075
 spike-0.075
 block-0.075
 pipe-0.074
 figure-0.071
Token the
Token ID262
Feature activation+1.11e-02

Pos logit contributions
 journals+0.010
 Him+0.010
onies+0.010
 parties+0.010
 chores+0.010
Neg logit contributions
 block-0.003
 nail-0.003
 spike-0.002
 figure-0.002
 pipe-0.002
Token us
Token ID514
Feature activation+5.77e-02

Pos logit contributions
 journals+0.175
 chores+0.173
 parties+0.173
 Him+0.172
onies+0.165
Neg logit contributions
 block-0.117
 spike-0.116
 pipe-0.116
 nail-0.114
 figure-0.113
Token!
Token ID0
Feature activation-3.82e-02

Pos logit contributions
 journals+0.139
 parties+0.138
 chores+0.137
 Him+0.137
onies+0.131
Neg logit contributions
 block-0.054
 pipe-0.053
 spike-0.053
 nail-0.053
 figure-0.051
Token We
Token ID775
Feature activation+1.46e-02

Pos logit contributions
 block+0.053
 spike+0.053
 nail+0.053
 pipe+0.053
 figure+0.051
Neg logit contributions
 parties-0.075
 journals-0.074
 chores-0.074
 Him-0.073
onies-0.069
Token love
Token ID1842
Feature activation+9.48e-02

Pos logit contributions
 parties+0.027
 chores+0.026
 journals+0.026
 Him+0.026
onies+0.025
Neg logit contributions
 nail-0.022
 spike-0.022
 pipe-0.022
 block-0.022
 crocod-0.021
Token playing
Token ID2712
Feature activation+1.03e-01

Pos logit contributions
 journals+0.156
 chores+0.152
 Him+0.149
 parties+0.149
onies+0.146
Neg logit contributions
 spike-0.162
 pipe-0.161
 block-0.161
 crocod-0.160
 nail-0.157
Token games
Token ID1830
Feature activation+2.06e-01

Pos logit contributions
 parties+0.208
 chores+0.198
 journals+0.198
 Him+0.187
 holidays+0.185
Neg logit contributions
 nail-0.143
 spike-0.141
 pipe-0.138
 grun-0.136
 figure-0.135
Token."
Token ID526
Feature activation+7.81e-02

Pos logit contributions
 parties+0.471
 journals+0.470
 chores+0.466
 Him+0.463
onies+0.443
Neg logit contributions
 block-0.215
 nail-0.214
 spike-0.214
 pipe-0.213
 figure-0.205
Token
Token ID220
Feature activation+5.69e-02

Pos logit contributions
 journals+0.181
 parties+0.181
 chores+0.181
 Him+0.176
onies+0.172
Neg logit contributions
 block-0.080
 pipe-0.079
 spike-0.078
 nail-0.078
 crocod-0.076
Token
Token ID198
Feature activation+2.70e-02

Pos logit contributions
 journals+0.137
 parties+0.136
 chores+0.135
 Him+0.132
onies+0.127
Neg logit contributions
 block-0.055
 pipe-0.052
 spike-0.052
 nail-0.052
 crocod-0.050
Token
Token ID198
Feature activation+2.13e-02

Pos logit contributions
 journals+0.063
 parties+0.063
 chores+0.063
 Him+0.062
onies+0.059
Neg logit contributions
 block-0.028
 spike-0.027
 nail-0.027
 pipe-0.027
 figure-0.026
TokenThe
Token ID464
Feature activation-2.45e-02

Pos logit contributions
 parties+0.042
 journals+0.042
 chores+0.041
 Him+0.041
onies+0.039
Neg logit contributions
 block-0.031
 nail-0.030
 spike-0.030
 pipe-0.030
 figure-0.029
Token play
Token ID711
Feature activation+1.02e-01

Pos logit contributions
 block+0.020
 nail+0.019
 pipe+0.019
 figure+0.019
 spike+0.019
Neg logit contributions
 journals-0.022
 Him-0.022
 parties-0.022
 chores-0.022
onies-0.021
Token in
Token ID287
Feature activation-9.04e-03

Pos logit contributions
 parties+0.214
 journals+0.212
 chores+0.211
 Him+0.207
onies+0.197
Neg logit contributions
 spike-0.129
 nail-0.129
 block-0.128
 pipe-0.127
 figure-0.124
Token the
Token ID262
Feature activation+1.88e-02

Pos logit contributions
 block+0.005
 nail+0.005
 spike+0.005
 pipe+0.005
 figure+0.004
Neg logit contributions
 Him-0.026
onies-0.026
 journals-0.026
 chores-0.025
 parties-0.025
Token park
Token ID3952
Feature activation+6.58e-02

Pos logit contributions
 parties+0.038
 journals+0.038
 chores+0.038
 Him+0.037
onies+0.035
Neg logit contributions
 nail-0.025
 spike-0.024
 block-0.024
 pipe-0.024
 figure-0.024
Token.
Token ID13
Feature activation+1.92e-02

Pos logit contributions
 parties+0.159
 chores+0.154
 journals+0.152
 Him+0.149
 holidays+0.144
Neg logit contributions
 spike-0.066
 nail-0.066
 pipe-0.064
 block-0.061
 figure-0.060
Token Yesterday
Token ID34668
Feature activation+2.06e-01

Pos logit contributions
 journals+0.033
 parties+0.033
 chores+0.032
 Him+0.032
onies+0.031
Neg logit contributions
 block-0.031
 nail-0.031
 pipe-0.031
 spike-0.031
 crocod-0.030
Token,
Token ID11
Feature activation-2.39e-03

Pos logit contributions
 parties+0.544
 chores+0.535
 holidays+0.501
 pic+0.498
 journals+0.496
Neg logit contributions
 nail-0.153
 figure-0.148
 spike-0.143
 block-0.141
 pipe-0.140
Token they
Token ID484
Feature activation+6.40e-02

Pos logit contributions
 block+0.003
 spike+0.003
 nail+0.003
 crocod+0.003
 pipe+0.003
Neg logit contributions
 journals-0.005
 Him-0.005
onies-0.005
 chores-0.005
 parties-0.005
Token went
Token ID1816
Feature activation+8.06e-02

Pos logit contributions
 parties+0.147
 journals+0.146
 chores+0.145
 Him+0.143
onies+0.138
Neg logit contributions
 spike-0.068
 nail-0.068
 block-0.068
 pipe-0.067
 figure-0.065
Token to
Token ID284
Feature activation+6.08e-03

Pos logit contributions
 parties+0.186
 chores+0.183
 journals+0.177
 Him+0.174
onies+0.168
Neg logit contributions
 nail-0.090
 spike-0.089
 block-0.086
 pipe-0.085
 figure-0.085
Token the
Token ID262
Feature activation+1.23e-02

Pos logit contributions
 journals+0.016
 Him+0.016
 parties+0.016
 chores+0.016
onies+0.016
Neg logit contributions
 block-0.004
 nail-0.004
 spike-0.004
 pipe-0.004
 figure-0.004
Token always
Token ID1464
Feature activation+1.04e-02

Pos logit contributions
 spike+0.005
 pipe+0.005
 nail+0.005
 crocod+0.005
 figure+0.005
Neg logit contributions
 chores-0.006
 parties-0.006
 journals-0.006
 Him-0.006
 holidays-0.005
Token excited
Token ID6568
Feature activation+9.82e-02

Pos logit contributions
 parties+0.017
 chores+0.017
 journals+0.017
 married+0.017
 Him+0.017
Neg logit contributions
 spike-0.017
 nail-0.017
 crocod-0.017
 pipe-0.016
 snake-0.016
Token for
Token ID329
Feature activation-1.17e-02

Pos logit contributions
 journals+0.242
 parties+0.241
 chores+0.240
 Him+0.238
onies+0.230
Neg logit contributions
 block-0.084
 pipe-0.084
 nail-0.083
 spike-0.083
 figure-0.080
Token Wed
Token ID3300
Feature activation+1.67e-01

Pos logit contributions
 figure+0.013
 block+0.013
 pipe+0.013
 nail+0.012
 crocod+0.012
Neg logit contributions
onies-0.026
 Him-0.026
 journals-0.026
 chores-0.025
 parties-0.025
Tokennes
Token ID2516
Feature activation+6.90e-02

Pos logit contributions
 journals+0.201
 chores+0.193
 Him+0.189
 parties+0.185
onies+0.180
Neg logit contributions
 nail-0.361
 figure-0.359
 block-0.358
 crocod-0.343
 spike-0.341
Tokendays
Token ID12545
Feature activation+2.06e-01

Pos logit contributions
 parties+0.055
 journals+0.053
 chores+0.051
 Him+0.051
onies+0.043
Neg logit contributions
 spike-0.178
 nail-0.176
 block-0.175
 pipe-0.175
 crocod-0.173
Token because
Token ID780
Feature activation+2.93e-02

Pos logit contributions
 parties+0.520
 journals+0.516
 chores+0.514
 Him+0.509
onies+0.489
Neg logit contributions
 nail-0.168
 spike-0.168
 block-0.167
 pipe-0.166
 crocod-0.158
Token it
Token ID340
Feature activation-5.26e-02

Pos logit contributions
 journals+0.075
 parties+0.075
 Him+0.075
 chores+0.074
onies+0.072
Neg logit contributions
 block-0.023
 spike-0.022
 nail-0.022
 pipe-0.022
 figure-0.021
Token was
Token ID373
Feature activation-1.44e-02

Pos logit contributions
 nail+0.043
 block+0.041
 spike+0.041
 crocod+0.040
 pipe+0.040
Neg logit contributions
 parties-0.129
 chores-0.129
 journals-0.127
 holidays-0.124
onies-0.120
Token the
Token ID262
Feature activation-3.34e-02

Pos logit contributions
 nail+0.020
 spike+0.020
 pipe+0.020
 block+0.019
 figure+0.019
Neg logit contributions
 parties-0.029
 chores-0.028
 journals-0.028
 Him-0.027
 holidays-0.026
Token day
Token ID1110
Feature activation+2.37e-02

Pos logit contributions
 block+0.058
 pipe+0.057
 nail+0.057
 spike+0.057
 figure+0.056
Neg logit contributions
 journals-0.054
 parties-0.054
 Him-0.054
 chores-0.053
onies-0.051
Token is
Token ID318
Feature activation-2.81e-02

Pos logit contributions
 journals+0.170
 Him+0.167
 parties+0.166
 chores+0.164
onies+0.156
Neg logit contributions
 block-0.083
 nail-0.081
 spike-0.081
 pipe-0.081
 figure-0.080
Token always
Token ID1464
Feature activation+4.96e-03

Pos logit contributions
 spike+0.038
 nail+0.037
 block+0.037
 pipe+0.037
 figure+0.036
Neg logit contributions
 parties-0.057
 chores-0.055
 Him-0.054
 journals-0.054
 married-0.052
Token busy
Token ID8179
Feature activation+1.54e-02

Pos logit contributions
 parties+0.010
 chores+0.010
 journals+0.010
 Him+0.010
onies+0.009
Neg logit contributions
 nail-0.007
 spike-0.007
 block-0.007
 pipe-0.007
 figure-0.006
Token on
Token ID319
Feature activation-1.25e-02

Pos logit contributions
 parties+0.034
 journals+0.033
 chores+0.033
 Him+0.033
onies+0.031
Neg logit contributions
 nail-0.018
 block-0.018
 spike-0.018
 pipe-0.018
 figure-0.017
Token Tues
Token ID48496
Feature activation+1.10e-01

Pos logit contributions
 spike+0.016
 block+0.016
 nail+0.016
 pipe+0.016
 crocod+0.016
Neg logit contributions
 journals-0.025
 Him-0.025
 parties-0.025
 chores-0.024
onies-0.024
Tokendays
Token ID12545
Feature activation+2.06e-01

Pos logit contributions
 journals+0.091
 parties+0.091
 chores+0.091
 Him+0.087
onies+0.080
Neg logit contributions
 block-0.279
 nail-0.275
 figure-0.273
 pipe-0.273
 spike-0.272
Token,"
Token ID553
Feature activation+7.38e-02

Pos logit contributions
 journals+0.470
 Him+0.456
 parties+0.453
 chores+0.451
onies+0.434
Neg logit contributions
 block-0.227
 nail-0.218
 pipe-0.213
 spike-0.211
 figure-0.209
Token replied
Token ID8712
Feature activation-2.68e-03

Pos logit contributions
 journals+0.170
 chores+0.168
 Him+0.167
 parties+0.167
Hope+0.157
Neg logit contributions
 block-0.076
 nail-0.075
 pipe-0.074
 spike-0.074
 crocod-0.072
Token Max
Token ID5436
Feature activation+4.02e-02

Pos logit contributions
 pipe+0.004
 nail+0.004
 block+0.004
 spike+0.004
 figure+0.004
Neg logit contributions
 Him-0.005
 parties-0.005
 chores-0.005
 journals-0.005
Hope-0.005
Token.
Token ID13
Feature activation-4.86e-03

Pos logit contributions
 chores+0.089
 journals+0.089
 Him+0.089
 parties+0.089
onies+0.084
Neg logit contributions
 pipe-0.044
 spike-0.043
 nail-0.043
 crocod-0.042
 block-0.041
Token
Token ID198
Feature activation+2.47e-02

Pos logit contributions
 nail+0.007
 spike+0.007
 pipe+0.007
 figure+0.007
 crocod+0.006
Neg logit contributions
 parties-0.009
 Him-0.009
 journals-0.009
 chores-0.009
Hope-0.009
Token.
Token ID13
Feature activation+1.42e-02

Pos logit contributions
 parties+0.228
 chores+0.226
 journals+0.224
 Him+0.223
onies+0.212
Neg logit contributions
 spike-0.091
 nail-0.089
 pipe-0.089
 block-0.086
 crocod-0.085
Token They
Token ID1119
Feature activation+8.14e-02

Pos logit contributions
 parties+0.023
 journals+0.022
 chores+0.022
 Him+0.022
onies+0.021
Neg logit contributions
 nail-0.025
 spike-0.025
 pipe-0.025
 block-0.025
 figure-0.024
Token like
Token ID588
Feature activation+9.39e-04

Pos logit contributions
 journals+0.196
 Him+0.193
 parties+0.185
 chores+0.183
 unnecessary+0.182
Neg logit contributions
 block-0.084
 figure-0.083
 nail-0.078
 hit-0.076
 pipe-0.074
Token to
Token ID284
Feature activation-1.40e-02

Pos logit contributions
 journals+0.003
 Him+0.002
 parties+0.002
 chores+0.002
onies+0.002
Neg logit contributions
 block-0.001
 nail-0.001
 pipe-0.001
 crocod-0.001
 spike-0.001
Token play
Token ID711
Feature activation+1.06e-01

Pos logit contributions
 block+0.021
 figure+0.020
 nail+0.020
 pipe+0.019
 spike+0.019
Neg logit contributions
 Him-0.028
 journals-0.028
onies-0.027
 unnecessary-0.026
 parties-0.026
Token games
Token ID1830
Feature activation+2.06e-01

Pos logit contributions
 journals+0.243
 Him+0.241
 parties+0.240
 chores+0.239
onies+0.231
Neg logit contributions
 block-0.115
 nail-0.112
 pipe-0.112
 spike-0.111
 figure-0.108
Token.
Token ID13
Feature activation+2.72e-02

Pos logit contributions
 parties+0.527
 journals+0.525
 chores+0.521
 Him+0.517
onies+0.497
Neg logit contributions
 nail-0.161
 spike-0.160
 block-0.160
 pipe-0.159
 figure-0.151
Token Today
Token ID6288
Feature activation+9.35e-02

Pos logit contributions
 parties+0.041
 chores+0.041
 journals+0.040
 Him+0.040
 holidays+0.037
Neg logit contributions
 nail-0.050
 spike-0.050
 pipe-0.049
 block-0.049
 figure-0.049
Token they
Token ID484
Feature activation+5.52e-02

Pos logit contributions
 parties+0.247
 chores+0.244
 journals+0.241
 Him+0.238
onies+0.230
Neg logit contributions
 nail-0.068
 spike-0.067
 block-0.066
 pipe-0.066
 figure-0.064
Token play
Token ID711
Feature activation+9.55e-02

Pos logit contributions
 Him+0.121
 journals+0.119
 parties+0.115
 chores+0.115
onies+0.112
Neg logit contributions
 block-0.068
 figure-0.067
 nail-0.066
 hit-0.063
 pipe-0.062
Token a
Token ID257
Feature activation+8.80e-02

Pos logit contributions
 Him+0.206
 journals+0.205
 parties+0.197
 chores+0.197
onies+0.195
Neg logit contributions
 block-0.122
 nail-0.116
 pipe-0.116
 figure-0.114
 spike-0.114
Token bushes
Token ID37413
Feature activation-2.34e-02

Pos logit contributions
 crocod+0.055
 grun+0.054
 blob+0.054
 figure+0.053
 nail+0.053
Neg logit contributions
 chores-0.071
 journals-0.069
 parties-0.067
 holidays-0.064
 magazines-0.063
Token.
Token ID13
Feature activation-2.76e-02

Pos logit contributions
 spike+0.028
 nail+0.027
 crocod+0.026
 block+0.025
 blob+0.025
Neg logit contributions
 parties-0.050
 chores-0.049
 journals-0.047
 holidays-0.046
 Him-0.046
Token It
Token ID632
Feature activation-8.24e-02

Pos logit contributions
 block+0.042
 spike+0.042
 pipe+0.042
 nail+0.042
 crocod+0.041
Neg logit contributions
 journals-0.050
 Him-0.050
 parties-0.050
 chores-0.050
onies-0.046
Token was
Token ID373
Feature activation-4.27e-02

Pos logit contributions
 nail+0.104
 block+0.102
 rectangle+0.098
 blob+0.097
 pipe+0.096
Neg logit contributions
Hope-0.143
 chores-0.141
 married-0.138
 journals-0.137
cases-0.136
Token a
Token ID257
Feature activation+5.77e-03

Pos logit contributions
 block+0.075
oke+0.075
 pipe+0.074
 nail+0.074
 spike+0.074
Neg logit contributions
 chores-0.064
 parties-0.061
 journals-0.057
 holidays-0.055
 married-0.054
Token crow
Token ID37593
Feature activation-1.99e-01

Pos logit contributions
 chores+0.010
 journals+0.009
 holidays+0.009
 parties+0.009
 pic+0.009
Neg logit contributions
oke-0.008
 grun-0.007
 block-0.007
 tipped-0.007
et-0.007
Token,
Token ID11
Feature activation-6.50e-02

Pos logit contributions
 rectangle+0.259
 nail+0.253
 block+0.247
 pipe+0.241
 blob+0.236
Neg logit contributions
 company-0.353
onies-0.338
 chores-0.338
Hope-0.333
 Him-0.333
Token ch
Token ID442
Feature activation-7.05e-02

Pos logit contributions
 block+0.081
 nail+0.081
 pipe+0.081
 crocod+0.080
 blob+0.079
Neg logit contributions
 chores-0.121
 Him-0.120
 journals-0.119
 parties-0.117
Hope-0.114
Tokenir
Token ID343
Feature activation-1.11e-01

Pos logit contributions
 block+0.133
 pipe+0.130
 nail+0.128
 man+0.127
 figure+0.125
Neg logit contributions
 Him-0.100
onies-0.096
 unnecessary-0.096
agged-0.092
 journals-0.092
Tokenping
Token ID13886
Feature activation-5.39e-02

Pos logit contributions
 block+0.236
 nail+0.236
 pipe+0.232
 figure+0.227
 spike+0.225
Neg logit contributions
 Him-0.134
onies-0.125
 parties-0.117
 chores-0.110
 journals-0.110
Token and
Token ID290
Feature activation+2.46e-02

Pos logit contributions
 nail+0.071
 pipe+0.070
 block+0.069
 rectangle+0.067
 spike+0.066
Neg logit contributions
 parties-0.097
 Him-0.096
 company-0.095
 ponies-0.094
 chores-0.093
Token.
Token ID13
Feature activation-5.24e-02

Pos logit contributions
 grun+0.083
 spike+0.083
 nail+0.083
 pipe+0.081
 block+0.081
Neg logit contributions
 parties-0.159
 chores-0.155
 Him-0.155
 journals-0.154
onies-0.147
Token It
Token ID632
Feature activation-9.14e-02

Pos logit contributions
 nail+0.061
 spike+0.061
 pipe+0.061
 block+0.061
 figure+0.059
Neg logit contributions
 journals-0.112
 parties-0.112
 Him-0.112
 chores-0.111
onies-0.106
Token is
Token ID318
Feature activation-6.49e-02

Pos logit contributions
 nail+0.109
 pipe+0.107
 spike+0.105
 block+0.105
 figure+0.103
Neg logit contributions
 chores-0.189
 parties-0.186
 journals-0.185
 Him-0.182
onies-0.179
Token a
Token ID257
Feature activation+3.12e-04

Pos logit contributions
 nail+0.102
 block+0.101
 spike+0.100
 figure+0.100
 pipe+0.100
Neg logit contributions
 chores-0.111
 journals-0.106
 parties-0.105
 Him-0.104
 married-0.102
Token horn
Token ID12718
Feature activation-9.79e-02

Pos logit contributions
 chores+0.001
 journals+0.001
 parties+0.001
 married+0.001
 unnecessary+0.000
Neg logit contributions
 grun-0.000
oke-0.000
 tipped-0.000
 block-0.000
 shook-0.000
Tokenet
Token ID316
Feature activation-1.98e-01

Pos logit contributions
 nail+0.141
 grun+0.140
 spike+0.138
 pipe+0.136
 figure+0.125
Neg logit contributions
 parties-0.165
 chores-0.165
 ourselves-0.159
 Him-0.157
 married-0.154
Token."
Token ID526
Feature activation+3.54e-02

Pos logit contributions
 nail+0.250
 pipe+0.244
 spike+0.243
 grun+0.235
 crocod+0.234
Neg logit contributions
 chores-0.392
 parties-0.391
onies-0.381
 Him-0.380
 journals-0.364
Token
Token ID198
Feature activation-5.55e-03

Pos logit contributions
 chores+0.060
 Him+0.060
 parties+0.059
 journals+0.057
 holidays+0.055
Neg logit contributions
 pipe-0.054
 blob-0.054
 nail-0.054
 spike-0.054
 block-0.053
Token
Token ID198
Feature activation-8.60e-03

Pos logit contributions
 nail+0.008
 blob+0.008
 pipe+0.008
 spike+0.008
 crocod+0.008
Neg logit contributions
 chores-0.010
 Him-0.010
 parties-0.010
 journals-0.009
Sleep-0.009
TokenTom
Token ID13787
Feature activation-8.32e-02

Pos logit contributions
 blob+0.014
 nail+0.014
 spike+0.013
 crocod+0.013
 figure+0.013
Neg logit contributions
First-0.013
 ourselves-0.013
Hope-0.012
 chores-0.012
Bon-0.012
Token does
Token ID857
Feature activation-6.49e-02

Pos logit contributions
 nail+0.120
 crocod+0.120
 pipe+0.118
 spike+0.115
 figure+0.111
Neg logit contributions
 chores-0.154
 journals-0.154
 parties-0.153
onies-0.149
 Him-0.143
Token of
Token ID286
Feature activation-4.57e-02

Pos logit contributions
 spike+0.090
 pipe+0.083
 nail+0.083
 figure+0.082
 crocod+0.082
Neg logit contributions
 parties-0.183
 chores-0.176
 journals-0.175
 Him-0.168
onies-0.166
Token a
Token ID257
Feature activation+3.83e-02

Pos logit contributions
 block+0.060
 nail+0.059
 spike+0.059
 pipe+0.058
 crocod+0.057
Neg logit contributions
 journals-0.093
 parties-0.093
 chores-0.092
 Him-0.092
onies-0.088
Token fox
Token ID21831
Feature activation-8.70e-02

Pos logit contributions
 journals+0.076
 parties+0.076
 chores+0.075
 holidays+0.070
 ponies+0.069
Neg logit contributions
 grun-0.049
 spike-0.046
 pipe-0.044
 nail-0.044
 block-0.044
Token and
Token ID290
Feature activation+1.41e-02

Pos logit contributions
 spike+0.099
 nail+0.098
 block+0.094
 pipe+0.094
 grun+0.091
Neg logit contributions
 parties-0.187
 chores-0.186
 journals-0.185
 Him-0.180
onies-0.173
Token a
Token ID257
Feature activation+6.51e-03

Pos logit contributions
 parties+0.028
 journals+0.028
 chores+0.028
 Him+0.027
onies+0.026
Neg logit contributions
 block-0.019
 nail-0.019
 spike-0.019
 pipe-0.019
 figure-0.018
Token crow
Token ID37593
Feature activation-1.96e-01

Pos logit contributions
 journals+0.012
 chores+0.012
 parties+0.011
 ponies+0.011
 Him+0.011
Neg logit contributions
 grun-0.008
 spike-0.008
oke-0.008
 pipe-0.008
 block-0.008
Token.
Token ID13
Feature activation-3.10e-02

Pos logit contributions
 nail+0.251
 spike+0.244
 pipe+0.243
 rectangle+0.238
 block+0.228
Neg logit contributions
 chores-0.381
 parties-0.377
 Him-0.373
 journals-0.372
onies-0.364
Token She
Token ID1375
Feature activation+8.66e-04

Pos logit contributions
 pipe+0.053
 figure+0.052
 spike+0.052
 nail+0.052
 block+0.052
Neg logit contributions
 parties-0.050
 Him-0.050
 journals-0.047
 chores-0.047
Hope-0.046
Token took
Token ID1718
Feature activation+1.62e-02

Pos logit contributions
 journals+0.002
 parties+0.002
 Him+0.002
onies+0.002
Hope+0.002
Neg logit contributions
 pipe-0.001
 nail-0.001
 spike-0.001
 blob-0.001
 block-0.001
Token the
Token ID262
Feature activation+2.10e-03

Pos logit contributions
 parties+0.036
 journals+0.036
 chores+0.036
 Him+0.035
onies+0.034
Neg logit contributions
 block-0.018
 nail-0.018
 spike-0.018
 pipe-0.018
 crocod-0.017
Token book
Token ID1492
Feature activation+8.74e-03

Pos logit contributions
 chores+0.004
 journals+0.004
 Him+0.004
 parties+0.004
onies+0.004
Neg logit contributions
 spike-0.003
 rectangle-0.003
 pipe-0.003
 block-0.003
 nail-0.003
Token is
Token ID318
Feature activation-6.03e-02

Pos logit contributions
 nail+0.131
 pipe+0.130
 figure+0.124
 spike+0.123
 block+0.119
Neg logit contributions
 journals-0.157
 parties-0.154
 chores-0.153
 Him-0.152
onies-0.147
Token a
Token ID257
Feature activation+3.34e-03

Pos logit contributions
 figure+0.088
 nail+0.087
 pipe+0.087
 spike+0.084
 block+0.084
Neg logit contributions
 chores-0.106
 parties-0.104
 married-0.103
 Him-0.102
 journals-0.098
Token crow
Token ID37593
Feature activation-1.95e-01

Pos logit contributions
 married+0.006
 parties+0.006
 chores+0.006
 journals+0.006
 holidays+0.005
Neg logit contributions
oke-0.004
 grun-0.004
 tipped-0.004
er-0.004
 block-0.004
Token.
Token ID13
Feature activation-5.06e-02

Pos logit contributions
 nail+0.253
 rectangle+0.243
 spike+0.241
 pipe+0.240
 grun+0.235
Neg logit contributions
 parties-0.361
 chores-0.352
onies-0.348
 Him-0.345
 company-0.345
Token The
Token ID383
Feature activation-5.53e-02

Pos logit contributions
 block+0.049
 nail+0.049
 spike+0.048
 pipe+0.048
 crocod+0.046
Neg logit contributions
 parties-0.121
 journals-0.121
 chores-0.120
 Him-0.118
onies-0.114
Token crow
Token ID37593
Feature activation-1.95e-01

Pos logit contributions
 rectangle+0.057
 compartment+0.056
 grun+0.054
oke+0.051
inn+0.051
Neg logit contributions
 chores-0.119
 ponies-0.108
 Him-0.104
 unnecessary-0.099
 amen-0.097
Token is
Token ID318
Feature activation-7.22e-02

Pos logit contributions
 nail+0.283
 pipe+0.282
 spike+0.268
 figure+0.259
 block+0.255
Neg logit contributions
 parties-0.354
 journals-0.354
onies-0.350
 chores-0.349
 Him-0.349
Token very
Token ID845
Feature activation+5.22e-02

Pos logit contributions
 nail+0.110
 pipe+0.109
 spike+0.105
 figure+0.104
 block+0.103
Neg logit contributions
 married-0.127
 chores-0.122
 parties-0.121
 Him-0.120
 journals-0.113
Token hungry
Token ID14720
Feature activation-2.55e-02

Pos logit contributions
 parties+0.079
 chores+0.078
 married+0.077
 Him+0.076
onies+0.073
Neg logit contributions
 pipe-0.088
 nail-0.088
 spike-0.086
 blob-0.085
 block-0.085
Token and
Token ID290
Feature activation+2.55e-02

Pos logit contributions
 spike+0.027
 nail+0.026
 pipe+0.026
 crocod+0.024
 grun+0.024
Neg logit contributions
 parties-0.060
 chores-0.058
 journals-0.057
 Him-0.056
onies-0.054
Token eager
Token ID11069
Feature activation+5.49e-02

Pos logit contributions
 parties+0.051
 chores+0.051
 journals+0.050
 Him+0.050
onies+0.047
Neg logit contributions
 nail-0.034
 pipe-0.034
 spike-0.034
 block-0.034
 figure-0.033
Tokenak
Token ID461
Feature activation-1.05e-01

Pos logit contributions
 spike+0.086
 figure+0.085
 nail+0.084
 pipe+0.083
 snake+0.081
Neg logit contributions
 parties-0.115
 journals-0.113
 chores-0.112
onies-0.110
 Him-0.108
Token.
Token ID13
Feature activation-4.00e-02

Pos logit contributions
 spike+0.119
 nail+0.119
 grun+0.114
 pipe+0.113
 figure+0.113
Neg logit contributions
 chores-0.215
 parties-0.212
onies-0.204
 journals-0.202
 Him-0.198
Token It
Token ID632
Feature activation-8.80e-02

Pos logit contributions
 block+0.042
 nail+0.040
 spike+0.039
 snake+0.038
 crocod+0.038
Neg logit contributions
 journals-0.094
 parties-0.094
 chores-0.094
 Him-0.091
onies-0.088
Token is
Token ID318
Feature activation-6.03e-02

Pos logit contributions
 nail+0.131
 pipe+0.130
 figure+0.124
 spike+0.123
 block+0.119
Neg logit contributions
 journals-0.157
 parties-0.154
 chores-0.153
 Him-0.152
onies-0.147
Token a
Token ID257
Feature activation+3.34e-03

Pos logit contributions
 figure+0.088
 nail+0.087
 pipe+0.087
 spike+0.084
 block+0.084
Neg logit contributions
 chores-0.106
 parties-0.104
 married-0.103
 Him-0.102
 journals-0.098
Token crow
Token ID37593
Feature activation-1.95e-01

Pos logit contributions
 married+0.006
 parties+0.006
 chores+0.006
 journals+0.006
 holidays+0.005
Neg logit contributions
oke-0.004
 grun-0.004
 tipped-0.004
er-0.004
 block-0.004
Token.
Token ID13
Feature activation-5.06e-02

Pos logit contributions
 nail+0.253
 rectangle+0.243
 spike+0.241
 pipe+0.240
 grun+0.235
Neg logit contributions
 parties-0.361
 chores-0.352
onies-0.348
 Him-0.345
 company-0.345
Token The
Token ID383
Feature activation-5.53e-02

Pos logit contributions
 block+0.049
 nail+0.049
 spike+0.048
 pipe+0.048
 crocod+0.046
Neg logit contributions
 parties-0.121
 journals-0.121
 chores-0.120
 Him-0.118
onies-0.114
Token crow
Token ID37593
Feature activation-1.95e-01

Pos logit contributions
 rectangle+0.057
 compartment+0.056
 grun+0.054
oke+0.051
inn+0.051
Neg logit contributions
 chores-0.119
 ponies-0.108
 Him-0.104
 unnecessary-0.099
 amen-0.097
Token is
Token ID318
Feature activation-7.22e-02

Pos logit contributions
 nail+0.283
 pipe+0.282
 spike+0.268
 figure+0.259
 block+0.255
Neg logit contributions
 parties-0.354
 journals-0.354
onies-0.350
 chores-0.349
 Him-0.349
Token very
Token ID845
Feature activation+5.22e-02

Pos logit contributions
 nail+0.110
 pipe+0.109
 spike+0.105
 figure+0.104
 block+0.103
Neg logit contributions
 married-0.127
 chores-0.122
 parties-0.121
 Him-0.120
 journals-0.113
Token a
Token ID257
Feature activation+5.82e-02

Pos logit contributions
 block+0.002
 crocod+0.002
 snake+0.001
 pipe+0.001
 hit+0.001
Neg logit contributions
 Him-0.005
 journals-0.004
 parties-0.004
 chores-0.004
 holidays-0.004
Token lick
Token ID36728
Feature activation-5.16e-02

Pos logit contributions
 chores+0.109
 parties+0.108
 journals+0.106
 Him+0.101
 hospitality+0.099
Neg logit contributions
 nail-0.082
 spike-0.082
 figure-0.081
 pipe-0.080
 crocod-0.080
Token and
Token ID290
Feature activation+3.56e-02

Pos logit contributions
 nail+0.063
 spike+0.061
 pipe+0.058
 figure+0.057
 bolt+0.057
Neg logit contributions
 chores-0.106
 parties-0.105
 Him-0.102
 journals-0.102
onies-0.099
Token a
Token ID257
Feature activation+3.71e-02

Pos logit contributions
 parties+0.067
 journals+0.067
 chores+0.067
 Him+0.065
onies+0.061
Neg logit contributions
 nail-0.052
 pipe-0.051
 block-0.051
 spike-0.051
 figure-0.049
Token n
Token ID299
Feature activation-2.90e-02

Pos logit contributions
 chores+0.058
 journals+0.055
 parties+0.052
 company+0.051
Sleep+0.051
Neg logit contributions
 figure-0.057
 pipe-0.057
 spike-0.056
 nail-0.056
 grun-0.055
Tokenudge
Token ID12587
Feature activation-1.95e-01

Pos logit contributions
 spike+0.052
 nail+0.052
 pipe+0.051
 crocod+0.050
 figure+0.049
Neg logit contributions
 parties-0.045
 journals-0.045
 Him-0.044
 chores-0.042
onies-0.042
Token.
Token ID13
Feature activation-4.63e-02

Pos logit contributions
 nail+0.270
 pipe+0.268
 spike+0.262
 figure+0.243
 rectangle+0.243
Neg logit contributions
 chores-0.350
 married-0.341
 journals-0.338
 parties-0.332
 Him-0.330
Token Tom
Token ID4186
Feature activation-5.78e-02

Pos logit contributions
 nail+0.058
 pipe+0.057
 figure+0.057
 spike+0.057
 block+0.055
Neg logit contributions
 parties-0.097
 journals-0.097
 Him-0.096
 chores-0.096
onies-0.090
Token smiles
Token ID21845
Feature activation+7.22e-03

Pos logit contributions
 block+0.080
 spike+0.079
 nail+0.078
 pipe+0.078
 figure+0.076
Neg logit contributions
 journals-0.114
 parties-0.114
 chores-0.113
 Him-0.112
onies-0.106
Token and
Token ID290
Feature activation+2.00e-02

Pos logit contributions
 parties+0.020
 journals+0.020
 chores+0.020
 Him+0.019
onies+0.019
Neg logit contributions
 nail-0.004
 spike-0.004
 block-0.004
 pipe-0.004
 figure-0.004
Token waves
Token ID9813
Feature activation-2.84e-02

Pos logit contributions
 journals+0.037
 parties+0.037
 chores+0.037
 Him+0.037
onies+0.035
Neg logit contributions
 block-0.030
 spike-0.030
 pipe-0.029
 nail-0.029
 crocod-0.029
Token,
Token ID11
Feature activation-5.32e-02

Pos logit contributions
 spike+0.048
 nail+0.047
 pipe+0.046
 crocod+0.045
 figure+0.044
Neg logit contributions
 parties-0.100
 chores-0.100
 journals-0.095
 Him-0.094
onies-0.092
Token it
Token ID340
Feature activation-7.64e-02

Pos logit contributions
 block+0.064
 nail+0.061
 pipe+0.060
 figure+0.060
 spike+0.060
Neg logit contributions
 chores-0.112
 parties-0.111
 Him-0.110
 holidays-0.107
 pic-0.105
Token's
Token ID338
Feature activation-1.44e-03

Pos logit contributions
 nail+0.093
 block+0.091
 spike+0.091
 pipe+0.091
 crocod+0.089
Neg logit contributions
 chores-0.158
 parties-0.156
 journals-0.154
 Him-0.151
onies-0.150
Token a
Token ID257
Feature activation+3.70e-03

Pos logit contributions
 nail+0.002
 block+0.002
 spike+0.002
 pipe+0.002
 figure+0.002
Neg logit contributions
 chores-0.003
 parties-0.003
 journals-0.003
 Him-0.003
onies-0.003
Token sm
Token ID895
Feature activation-1.02e-01

Pos logit contributions
 chores+0.007
 parties+0.007
 journals+0.006
 holidays+0.006
 pic+0.006
Neg logit contributions
oke-0.005
 nail-0.005
 grun-0.005
 pipe-0.005
 spike-0.005
Tokenudge
Token ID12587
Feature activation-1.93e-01

Pos logit contributions
 spike+0.198
 nail+0.196
 figure+0.195
 pipe+0.191
 crocod+0.190
Neg logit contributions
 parties-0.149
 journals-0.138
 Him-0.135
affles-0.129
onies-0.128
Token.
Token ID13
Feature activation-2.65e-02

Pos logit contributions
 spike+0.205
 nail+0.201
 pipe+0.198
 grun+0.196
 crocod+0.195
Neg logit contributions
 parties-0.404
 chores-0.402
 journals-0.391
 Him-0.382
onies-0.377
Token It
Token ID632
Feature activation-8.08e-02

Pos logit contributions
 nail+0.037
 block+0.037
 spike+0.037
 pipe+0.036
 figure+0.036
Neg logit contributions
 Him-0.051
 parties-0.051
 journals-0.050
 chores-0.050
onies-0.047
Token looks
Token ID3073
Feature activation-3.91e-02

Pos logit contributions
 nail+0.103
 pipe+0.101
 block+0.101
 spike+0.100
 crocod+0.098
Neg logit contributions
 chores-0.162
 parties-0.159
 journals-0.157
 Him-0.155
onies-0.155
Token like
Token ID588
Feature activation-3.77e-02

Pos logit contributions
 nail+0.057
 spike+0.057
 block+0.056
 pipe+0.056
 figure+0.055
Neg logit contributions
 parties-0.074
 chores-0.074
 journals-0.073
 Him-0.071
 holidays-0.067
Token you
Token ID345
Feature activation-5.17e-03

Pos logit contributions
 nail+0.047
 spike+0.047
 block+0.047
 pipe+0.047
 crocod+0.046
Neg logit contributions
 journals-0.078
 parties-0.078
 chores-0.077
 Him-0.077
onies-0.073
Token just
Token ID655
Feature activation-1.08e-01

Pos logit contributions
 parties+0.031
 journals+0.031
 chores+0.031
 Him+0.031
onies+0.029
Neg logit contributions
 block-0.025
 nail-0.025
 spike-0.024
 pipe-0.024
 figure-0.024
Token an
Token ID281
Feature activation-3.27e-02

Pos logit contributions
 nail+0.143
 pipe+0.143
 spike+0.142
 block+0.142
 figure+0.138
Neg logit contributions
 parties-0.217
 chores-0.215
 journals-0.214
 Him-0.212
onies-0.201
Token old
Token ID1468
Feature activation-8.62e-02

Pos logit contributions
 pipe+0.048
 block+0.047
 spike+0.047
 nail+0.047
 crocod+0.046
Neg logit contributions
 parties-0.060
 chores-0.060
 journals-0.060
 Him-0.059
onies-0.056
Token tr
Token ID491
Feature activation-2.49e-02

Pos logit contributions
 grun+0.099
oke+0.097
 rectangle+0.091
 tipped+0.091
 pipe+0.091
Neg logit contributions
 chores-0.174
 parties-0.173
 ponies-0.169
 company-0.165
 journals-0.165
Tokenink
Token ID676
Feature activation-5.33e-02

Pos logit contributions
 pipe+0.041
 spike+0.040
 nail+0.040
 crocod+0.040
 figure+0.040
Neg logit contributions
 journals-0.042
 parties-0.042
 chores-0.041
 Him-0.041
onies-0.039
Tokenet
Token ID316
Feature activation-1.92e-01

Pos logit contributions
 spike+0.071
 nail+0.070
 pipe+0.068
 block+0.068
 crocod+0.067
Neg logit contributions
 parties-0.108
 chores-0.108
 journals-0.105
 Him-0.105
onies-0.102
Token to
Token ID284
Feature activation-3.91e-02

Pos logit contributions
 pipe+0.236
 nail+0.232
 spike+0.231
 crocod+0.228
 figure+0.225
Neg logit contributions
 parties-0.396
 chores-0.388
 Him-0.382
onies-0.380
 journals-0.375
Token me
Token ID502
Feature activation-3.57e-02

Pos logit contributions
 pipe+0.058
 spike+0.057
 block+0.057
 nail+0.056
 blob+0.055
Neg logit contributions
 parties-0.074
 chores-0.073
 Him-0.071
 journals-0.069
onies-0.066
Token."
Token ID526
Feature activation+4.90e-02

Pos logit contributions
 nail+0.051
 spike+0.050
 block+0.050
 pipe+0.050
 figure+0.049
Neg logit contributions
 parties-0.069
 journals-0.068
 chores-0.068
 Him-0.067
onies-0.064
Token Twe
Token ID24205
Feature activation-3.49e-02

Pos logit contributions
 parties+0.089
 chores+0.087
 Him+0.087
 journals+0.086
onies+0.081
Neg logit contributions
 pipe-0.074
 block-0.074
 nail-0.074
 spike-0.074
 figure-0.073
Tokenety
Token ID2963
Feature activation-6.66e-02

Pos logit contributions
 pipe+0.069
 block+0.069
 spike+0.069
 nail+0.069
 figure+0.068
Neg logit contributions
 journals-0.045
 parties-0.045
 Him-0.045
 chores-0.044
onies-0.043
Token and
Token ID290
Feature activation+2.80e-02

Pos logit contributions
 spike+0.050
 nail+0.048
 block+0.048
 pipe+0.048
 crocod+0.047
Neg logit contributions
 Him-0.080
 journals-0.080
 parties-0.080
 chores-0.080
 holidays-0.076
Token soon
Token ID2582
Feature activation+2.49e-02

Pos logit contributions
 parties+0.057
 journals+0.056
 Him+0.056
 chores+0.055
 married+0.053
Neg logit contributions
 block-0.036
 spike-0.035
 nail-0.035
 pipe-0.035
 crocod-0.034
Token the
Token ID262
Feature activation-6.18e-02

Pos logit contributions
 parties+0.060
 journals+0.058
 chores+0.058
 Him+0.057
 holidays+0.055
Neg logit contributions
 block-0.025
 nail-0.025
 spike-0.025
 pipe-0.025
 figure-0.023
Token f
Token ID277
Feature activation-1.12e-01

Pos logit contributions
 block+0.062
 grun+0.059
 nail+0.059
 rectangle+0.058
oke+0.057
Neg logit contributions
 chores-0.135
 holidays-0.123
 parties-0.121
 journals-0.117
 married-0.117
Tokenauc
Token ID14272
Feature activation-1.34e-01

Pos logit contributions
 snake+0.179
 nail+0.177
 spike+0.175
 hole+0.175
 branch+0.174
Neg logit contributions
affles-0.195
 parties-0.175
beh-0.172
onies-0.171
ouses-0.170
Tokenet
Token ID316
Feature activation-1.92e-01

Pos logit contributions
 nail+0.151
 figure+0.142
 spike+0.141
 block+0.133
 crocod+0.132
Neg logit contributions
 parties-0.296
affles-0.292
 chores-0.290
 journals-0.289
 Him-0.280
Token was
Token ID373
Feature activation-4.32e-02

Pos logit contributions
 nail+0.187
 spike+0.177
 snake+0.175
 hole+0.171
 figure+0.165
Neg logit contributions
 chores-0.426
 parties-0.416
 married-0.396
onies-0.396
 Him-0.392
Token as
Token ID355
Feature activation+7.43e-02

Pos logit contributions
 spike+0.066
 pipe+0.066
 nail+0.064
 crocod+0.064
 figure+0.063
Neg logit contributions
 chores-0.079
 parties-0.078
 married-0.074
 journals-0.073
 holidays-0.070
Token good
Token ID922
Feature activation+2.95e-02

Pos logit contributions
 chores+0.125
 parties+0.124
 journals+0.121
 Him+0.120
 holidays+0.112
Neg logit contributions
 spike-0.124
 block-0.122
 pipe-0.120
 nail-0.120
 figure-0.119
Token as
Token ID355
Feature activation+9.82e-02

Pos logit contributions
 parties+0.058
 chores+0.057
 journals+0.057
 Him+0.056
onies+0.053
Neg logit contributions
 block-0.042
 nail-0.041
 pipe-0.041
 spike-0.041
 figure-0.040
Token new
Token ID649
Feature activation+2.20e-02

Pos logit contributions
 parties+0.215
 journals+0.213
 chores+0.213
 Him+0.210
onies+0.198
Neg logit contributions
 block-0.113
 spike-0.113
 nail-0.112
 pipe-0.112
 figure-0.109
Token not
Token ID407
Feature activation-9.17e-02

Pos logit contributions
 figure+0.099
 spike+0.097
 pipe+0.097
 nail+0.096
 block+0.095
Neg logit contributions
 chores-0.114
 parties-0.104
 Him-0.104
 journals-0.103
onies-0.096
Token the
Token ID262
Feature activation-6.13e-02

Pos logit contributions
 spike+0.140
 pipe+0.139
 figure+0.138
 block+0.137
 nail+0.136
Neg logit contributions
 chores-0.167
 parties-0.163
 Him-0.159
 journals-0.159
onies-0.150
Token same
Token ID976
Feature activation-6.31e-02

Pos logit contributions
 pipe+0.096
 spike+0.095
 nail+0.095
 block+0.094
 figure+0.093
Neg logit contributions
 chores-0.111
 parties-0.106
 journals-0.103
 Him-0.102
 holidays-0.097
Token.
Token ID13
Feature activation-5.02e-02

Pos logit contributions
 pipe+0.076
 spike+0.076
 nail+0.074
 grun+0.072
 figure+0.071
Neg logit contributions
 chores-0.136
 parties-0.133
 journals-0.125
 Him-0.124
onies-0.120
Token Cere
Token ID26532
Feature activation-4.49e-02

Pos logit contributions
 block+0.055
 nail+0.055
 spike+0.055
 pipe+0.054
 figure+0.053
Neg logit contributions
 parties-0.113
 journals-0.112
 chores-0.111
 Him-0.110
onies-0.106
Tokenal
Token ID282
Feature activation-1.92e-01

Pos logit contributions
 nail+0.053
 spike+0.052
 pipe+0.052
 block+0.051
 crocod+0.050
Neg logit contributions
 journals-0.097
 parties-0.096
 Him-0.095
 chores-0.094
onies-0.089
Token does
Token ID857
Feature activation-5.85e-02

Pos logit contributions
 spike+0.226
 nail+0.225
 pipe+0.222
 block+0.221
 figure+0.215
Neg logit contributions
 chores-0.413
 parties-0.407
 Him-0.403
 journals-0.401
onies-0.390
Token not
Token ID407
Feature activation-7.02e-02

Pos logit contributions
 block+0.088
 spike+0.088
 pipe+0.088
 nail+0.087
 crocod+0.085
Neg logit contributions
 chores-0.109
 parties-0.107
 journals-0.105
 Him-0.105
onies-0.099
Token grow
Token ID1663
Feature activation-6.98e-02

Pos logit contributions
 spike+0.112
 pipe+0.111
 block+0.109
 nail+0.108
 rectangle+0.104
Neg logit contributions
 chores-0.125
 parties-0.124
 Him-0.119
 journals-0.113
onies-0.111
Token in
Token ID287
Feature activation-7.61e-02

Pos logit contributions
 block+0.082
 nail+0.081
 spike+0.081
 pipe+0.081
 crocod+0.079
Neg logit contributions
 journals-0.153
 parties-0.152
 chores-0.149
 Him-0.148
onies-0.142
Token the
Token ID262
Feature activation-3.82e-02

Pos logit contributions
 pipe+0.096
 spike+0.096
 block+0.096
 figure+0.095
 nail+0.095
Neg logit contributions
 chores-0.159
 parties-0.151
 journals-0.150
 Him-0.148
 ourselves-0.141
Token,
Token ID11
Feature activation-3.50e-02

Pos logit contributions
 spike+0.060
 nail+0.060
 pipe+0.058
 block+0.058
 crocod+0.053
Neg logit contributions
 parties-0.168
 chores-0.166
 Him-0.160
 journals-0.157
onies-0.157
Token car
Token ID1097
Feature activation-2.15e-02

Pos logit contributions
 block+0.044
 pipe+0.044
 spike+0.043
 grun+0.042
 nail+0.042
Neg logit contributions
 parties-0.070
 chores-0.068
 Him-0.067
 journals-0.063
 ponies-0.062
Token-
Token ID12
Feature activation-7.73e-02

Pos logit contributions
 nail+0.021
 spike+0.020
 pipe+0.020
 block+0.019
 figure+0.019
Neg logit contributions
 parties-0.052
 chores-0.050
 journals-0.049
 Him-0.049
onies-0.048
Tokendin
Token ID25194
Feature activation-4.29e-02

Pos logit contributions
 nail+0.123
 pipe+0.123
 block+0.122
 spike+0.122
 figure+0.120
Neg logit contributions
 parties-0.134
 journals-0.133
 chores-0.132
 Him-0.131
onies-0.124
Token-
Token ID12
Feature activation-1.27e-01

Pos logit contributions
 nail+0.063
 spike+0.063
 pipe+0.062
 block+0.061
 crocod+0.060
Neg logit contributions
 parties-0.081
 journals-0.081
 chores-0.079
 Him-0.079
onies-0.076
Tokenal
Token ID282
Feature activation-1.92e-01

Pos logit contributions
 nail+0.177
 pipe+0.176
 spike+0.175
 figure+0.171
 block+0.169
Neg logit contributions
 journals-0.243
 Him-0.242
 parties-0.240
 chores-0.235
onies-0.227
Token,
Token ID11
Feature activation-5.84e-02

Pos logit contributions
 nail+0.260
 spike+0.257
 pipe+0.253
 block+0.242
 figure+0.240
Neg logit contributions
 parties-0.385
 chores-0.370
 journals-0.368
 Him-0.365
onies-0.364
Token blue
Token ID4171
Feature activation-7.72e-02

Pos logit contributions
 pipe+0.070
 spike+0.069
 block+0.069
 nail+0.068
 grun+0.066
Neg logit contributions
 parties-0.119
 chores-0.119
 journals-0.117
 Him-0.117
 magazines-0.110
Token-
Token ID12
Feature activation-1.23e-01

Pos logit contributions
 nail+0.102
 spike+0.102
 pipe+0.099
 block+0.098
 grun+0.097
Neg logit contributions
 parties-0.158
 chores-0.154
 journals-0.154
 Him-0.150
onies-0.146
Tokenjay
Token ID33708
Feature activation-8.86e-02

Pos logit contributions
 pipe+0.166
 nail+0.165
 spike+0.163
 figure+0.161
 hole+0.156
Neg logit contributions
 journals-0.238
 Him-0.237
 parties-0.235
 chores-0.230
onies-0.223
Token,
Token ID11
Feature activation-5.38e-02

Pos logit contributions
 nail+0.101
 spike+0.100
 pipe+0.099
 block+0.097
 crocod+0.095
Neg logit contributions
 parties-0.194
 chores-0.190
 journals-0.189
 Him-0.187
onies-0.183
Token
Token ID198
Feature activation-5.54e-03

Pos logit contributions
 pipe+0.070
 nail+0.070
 spike+0.070
 figure+0.070
 blob+0.067
Neg logit contributions
 parties-0.101
 chores-0.100
 Him-0.100
 journals-0.097
 holidays-0.091
Token
Token ID198
Feature activation-8.70e-03

Pos logit contributions
 nail+0.008
 spike+0.008
 pipe+0.008
 figure+0.007
 blob+0.007
Neg logit contributions
 chores-0.011
 parties-0.010
 Him-0.010
 journals-0.010
onies-0.010
TokenBut
Token ID1537
Feature activation-1.13e-01

Pos logit contributions
 blob+0.016
 spike+0.016
 nail+0.015
 figure+0.015
 pipe+0.015
Neg logit contributions
First-0.013
 chores-0.012
Sleep-0.012
Hope-0.012
Bon-0.012
Token the
Token ID262
Feature activation-5.77e-02

Pos logit contributions
oke+0.194
 nail+0.183
 pipe+0.181
 spike+0.178
 blob+0.175
Neg logit contributions
 chores-0.168
 parties-0.159
 ourselves-0.153
 married-0.150
 Him-0.149
Token horn
Token ID12718
Feature activation-8.92e-02

Pos logit contributions
 grun+0.073
oke+0.069
 crocod+0.068
 block+0.067
 blob+0.065
Neg logit contributions
 journals-0.112
 chores-0.112
 parties-0.104
 company-0.104
 Him-0.103
Tokenet
Token ID316
Feature activation-1.92e-01

Pos logit contributions
 nail+0.137
 pipe+0.136
 spike+0.127
 figure+0.125
 blob+0.122
Neg logit contributions
 chores-0.151
 parties-0.141
 journals-0.141
onies-0.136
 Him-0.132
Token is
Token ID318
Feature activation-6.13e-02

Pos logit contributions
 nail+0.271
 pipe+0.265
 spike+0.255
 figure+0.253
 crocod+0.253
Neg logit contributions
 chores-0.365
 parties-0.355
 journals-0.343
onies-0.341
 Him-0.328
Token faster
Token ID5443
Feature activation-2.78e-02

Pos logit contributions
 pipe+0.100
 figure+0.099
 nail+0.098
 spike+0.096
 crocod+0.094
Neg logit contributions
 chores-0.103
 parties-0.092
 married-0.091
 unnecessary-0.090
 journals-0.088
Token.
Token ID13
Feature activation-5.11e-02

Pos logit contributions
 nail+0.043
 spike+0.042
 blob+0.041
 crocod+0.040
 pipe+0.040
Neg logit contributions
 parties-0.047
 chores-0.046
 journals-0.045
 ourselves-0.045
 company-0.042
Token It
Token ID632
Feature activation-8.64e-02

Pos logit contributions
 pipe+0.068
 figure+0.068
 nail+0.067
 spike+0.066
 blob+0.063
Neg logit contributions
 chores-0.099
 parties-0.097
 Him-0.097
 journals-0.094
onies-0.088
Token sees
Token ID7224
Feature activation-1.09e-01

Pos logit contributions
 pipe+0.134
 nail+0.131
 figure+0.124
 spike+0.122
 blob+0.119
Neg logit contributions
 chores-0.153
 parties-0.145
 journals-0.140
onies-0.133
 married-0.131
Tokenite
Token ID578
Feature activation+3.21e-02

Pos logit contributions
 figure+0.011
 pipe+0.011
 blob+0.010
 nail+0.010
 snake+0.010
Neg logit contributions
 parties-0.015
 chores-0.015
 holidays-0.015
 Him-0.014
 journals-0.014
Token?
Token ID30
Feature activation-4.77e-02

Pos logit contributions
 parties+0.070
 chores+0.067
 journals+0.066
 Him+0.065
onies+0.063
Neg logit contributions
 nail-0.038
 spike-0.037
 pipe-0.037
 crocod-0.036
 figure-0.036
Token It
Token ID632
Feature activation-7.85e-02

Pos logit contributions
 nail+0.066
 block+0.066
 spike+0.066
 pipe+0.066
 figure+0.064
Neg logit contributions
 parties-0.092
 journals-0.092
 chores-0.091
 Him-0.090
onies-0.086
Token is
Token ID318
Feature activation-4.80e-02

Pos logit contributions
 nail+0.099
 pipe+0.099
 spike+0.096
 crocod+0.095
 blob+0.092
Neg logit contributions
 chores-0.156
 parties-0.153
 journals-0.152
 Him-0.147
onies-0.146
Token a
Token ID257
Feature activation+9.59e-03

Pos logit contributions
 figure+0.073
 pipe+0.073
 nail+0.072
 spike+0.072
 crocod+0.070
Neg logit contributions
 chores-0.087
 parties-0.083
 journals-0.080
 Him-0.079
 married-0.078
Token crow
Token ID37593
Feature activation-1.92e-01

Pos logit contributions
 chores+0.017
 parties+0.016
 married+0.015
 journals+0.015
 holidays+0.015
Neg logit contributions
 grun-0.015
oke-0.014
 tipped-0.013
 spike-0.013
 blob-0.013
Token.
Token ID13
Feature activation-4.11e-02

Pos logit contributions
 nail+0.243
 spike+0.235
 pipe+0.233
 rectangle+0.226
 grun+0.220
Neg logit contributions
 Him-0.381
 chores-0.376
 parties-0.370
 company-0.363
onies-0.363
Token It
Token ID632
Feature activation-8.39e-02

Pos logit contributions
 nail+0.049
 pipe+0.049
 spike+0.049
 block+0.048
 figure+0.047
Neg logit contributions
 Him-0.088
 journals-0.086
 parties-0.086
 chores-0.085
onies-0.082
Token is
Token ID318
Feature activation-5.59e-02

Pos logit contributions
 nail+0.116
 pipe+0.115
 spike+0.110
 figure+0.108
 block+0.107
Neg logit contributions
 chores-0.159
 parties-0.156
 journals-0.155
 Him-0.154
onies-0.151
Token my
Token ID616
Feature activation-1.83e-02

Pos logit contributions
 nail+0.085
 pipe+0.084
 figure+0.083
 spike+0.082
 block+0.081
Neg logit contributions
 chores-0.100
 parties-0.094
 Him-0.093
 married-0.093
 journals-0.089
Token favorite
Token ID4004
Feature activation+1.85e-02

Pos logit contributions
 nail+0.023
 pipe+0.023
 spike+0.023
 block+0.023
 figure+0.023
Neg logit contributions
 chores-0.037
 parties-0.036
 journals-0.036
 Him-0.035
onies-0.034
Token never
Token ID1239
Feature activation-6.59e-02

Pos logit contributions
 nail+0.085
 pipe+0.081
 spike+0.080
 block+0.080
 figure+0.075
Neg logit contributions
 parties-0.250
 chores-0.245
 journals-0.245
 Him-0.238
onies-0.237
Token forgot
Token ID16453
Feature activation-5.69e-03

Pos logit contributions
 nail+0.089
 spike+0.087
 pipe+0.086
 block+0.084
 snake+0.083
Neg logit contributions
 parties-0.133
 married-0.125
 hospitality-0.123
 journals-0.120
 Him-0.118
Token the
Token ID262
Feature activation-2.28e-02

Pos logit contributions
 nail+0.007
 spike+0.007
 block+0.007
 pipe+0.007
 figure+0.007
Neg logit contributions
 parties-0.013
 chores-0.012
 journals-0.012
 Him-0.012
 holidays-0.011
Token fast
Token ID3049
Feature activation-3.77e-02

Pos logit contributions
 nail+0.034
 rectangle+0.034
 pipe+0.033
 block+0.033
 figure+0.032
Neg logit contributions
 chores-0.042
 parties-0.039
 journals-0.037
 hospitality-0.036
 holidays-0.036
Token che
Token ID1125
Feature activation-1.34e-01

Pos logit contributions
 nail+0.040
 pipe+0.040
 socket+0.039
 compartment+0.039
 rectangle+0.038
Neg logit contributions
 parties-0.076
 chores-0.074
 ponies-0.071
 holidays-0.070
 hospitality-0.070
Tokenet
Token ID316
Feature activation-1.91e-01

Pos logit contributions
 pipe+0.163
 figure+0.158
 nail+0.155
 snake+0.150
 crocod+0.148
Neg logit contributions
 chores-0.279
affles-0.275
 parties-0.271
onies-0.265
 journals-0.258
Tokenah
Token ID993
Feature activation-9.88e-02

Pos logit contributions
 hole+0.231
 nail+0.227
 pipe+0.224
 rectangle+0.212
 gap+0.208
Neg logit contributions
 chores-0.376
onies-0.370
 parties-0.370
 Him-0.365
 journals-0.347
Token he
Token ID339
Feature activation-3.12e-02

Pos logit contributions
 nail+0.143
 pipe+0.142
 spike+0.138
 block+0.133
 snake+0.132
Neg logit contributions
 Him-0.181
 chores-0.180
 parties-0.180
onies-0.179
 journals-0.176
Token had
Token ID550
Feature activation-3.55e-02

Pos logit contributions
 block+0.033
 nail+0.033
 spike+0.033
 pipe+0.033
 figure+0.032
Neg logit contributions
 parties-0.071
 journals-0.071
 chores-0.070
 Him-0.070
onies-0.067
Token seen
Token ID1775
Feature activation-6.92e-02

Pos logit contributions
 nail+0.044
 spike+0.043
 pipe+0.043
 block+0.043
 crocod+0.041
Neg logit contributions
 parties-0.076
 journals-0.075
 chores-0.075
 Him-0.073
onies-0.070
Token.
Token ID13
Feature activation-2.97e-02

Pos logit contributions
 nail+0.088
 pipe+0.088
 spike+0.086
 crocod+0.082
 rectangle+0.081
Neg logit contributions
 parties-0.149
 chores-0.140
 journals-0.139
 Him-0.136
 holidays-0.132
Token,
Token ID11
Feature activation-5.40e-02

Pos logit contributions
 block+0.010
 spike+0.010
 pipe+0.009
 blob+0.009
 nail+0.009
Neg logit contributions
 chores-0.025
 parties-0.025
 Him-0.024
 journals-0.023
 holidays-0.023
Token the
Token ID262
Feature activation-6.31e-02

Pos logit contributions
 block+0.088
 spike+0.086
 pipe+0.086
 nail+0.085
 snake+0.082
Neg logit contributions
 parties-0.095
 chores-0.094
 Him-0.089
 journals-0.089
 holidays-0.084
Token old
Token ID1468
Feature activation-9.87e-02

Pos logit contributions
 block+0.067
 grun+0.064
 branch+0.062
 nail+0.061
 rectangle+0.061
Neg logit contributions
 chores-0.136
 holidays-0.119
 parties-0.119
 married-0.115
affles-0.111
Token f
Token ID277
Feature activation-1.12e-01

Pos logit contributions
 bolt+0.117
 block+0.102
 dot+0.100
 dent+0.099
 snake+0.099
Neg logit contributions
 chores-0.182
 parties-0.161
 holidays-0.160
 outfits-0.158
 married-0.158
Tokenauc
Token ID14272
Feature activation-1.32e-01

Pos logit contributions
 branch+0.198
 snake+0.191
 hole+0.189
 spike+0.188
 nail+0.186
Neg logit contributions
affles-0.184
ouses-0.158
 parties-0.154
beh-0.152
onies-0.152
Tokenet
Token ID316
Feature activation-1.91e-01

Pos logit contributions
 nail+0.144
 figure+0.136
 spike+0.135
 block+0.135
 crocod+0.130
Neg logit contributions
 parties-0.295
affles-0.290
 chores-0.290
 journals-0.287
onies-0.276
Token broke
Token ID6265
Feature activation-1.09e-01

Pos logit contributions
 snake+0.212
 hole+0.205
 nail+0.205
 gap+0.200
 spike+0.194
Neg logit contributions
 chores-0.353
cases-0.339
 married-0.338
 parties-0.336
affles-0.328
Token and
Token ID290
Feature activation+3.32e-02

Pos logit contributions
 crocod+0.138
oke+0.128
 spike+0.127
 snake+0.126
 crow+0.124
Neg logit contributions
 parties-0.203
 chores-0.201
 ourselves-0.200
 journals-0.192
 married-0.186
Token water
Token ID1660
Feature activation-3.46e-02

Pos logit contributions
 parties+0.063
 chores+0.062
 journals+0.060
 married+0.059
 Him+0.058
Neg logit contributions
 block-0.044
 spike-0.042
 pipe-0.041
 nail-0.041
 bulldo-0.041
Token started
Token ID2067
Feature activation+2.03e-02

Pos logit contributions
 nail+0.043
 spike+0.043
 block+0.043
 rectangle+0.043
 figure+0.041
Neg logit contributions
 chores-0.066
 parties-0.065
ics-0.062
 married-0.059
onies-0.057
Token spilling
Token ID45085
Feature activation-5.07e-02

Pos logit contributions
 chores+0.038
 parties+0.036
 married+0.033
 journals+0.033
 holidays+0.032
Neg logit contributions
 spike-0.028
 figure-0.027
 block-0.027
oke-0.027
 snake-0.027
Token guess
Token ID4724
Feature activation-1.30e-02

Pos logit contributions
 spike+0.084
 snake+0.084
 pipe+0.083
 nail+0.082
 crocod+0.081
Neg logit contributions
 parties-0.099
 chores-0.098
onies-0.091
 married-0.090
 journals-0.089
Token it
Token ID340
Feature activation-6.46e-02

Pos logit contributions
 pipe+0.016
 nail+0.016
 spike+0.016
 block+0.016
 figure+0.015
Neg logit contributions
 parties-0.027
 chores-0.027
 journals-0.027
 Him-0.026
onies-0.025
Token's
Token ID338
Feature activation+2.36e-03

Pos logit contributions
 nail+0.075
 pipe+0.073
 spike+0.073
 block+0.072
 crocod+0.071
Neg logit contributions
 parties-0.140
 chores-0.138
 journals-0.136
 Him-0.135
onies-0.133
Token a
Token ID257
Feature activation+6.00e-03

Pos logit contributions
 chores+0.004
 parties+0.004
 journals+0.004
 Him+0.004
 married+0.004
Neg logit contributions
 nail-0.004
 pipe-0.004
 figure-0.004
 spike-0.004
 block-0.004
Token horn
Token ID12718
Feature activation-9.68e-02

Pos logit contributions
 chores+0.011
 journals+0.010
 parties+0.010
 holidays+0.010
 married+0.010
Neg logit contributions
oke-0.007
 grun-0.007
 pipe-0.007
 nail-0.007
 tipped-0.007
Tokenet
Token ID316
Feature activation-1.90e-01

Pos logit contributions
 nail+0.137
 spike+0.132
 pipe+0.132
 grun+0.127
 crocod+0.121
Neg logit contributions
 parties-0.182
 chores-0.174
 Him-0.170
 married-0.168
 journals-0.167
Token's
Token ID338
Feature activation-1.23e-02

Pos logit contributions
 nail+0.257
 pipe+0.249
 spike+0.243
 crocod+0.236
 hole+0.232
Neg logit contributions
 parties-0.372
 chores-0.364
onies-0.356
 Him-0.355
 journals-0.344
Token nest
Token ID16343
Feature activation-7.30e-02

Pos logit contributions
 nail+0.017
 pipe+0.015
 blob+0.015
 spike+0.015
 block+0.015
Neg logit contributions
 chores-0.023
 Him-0.023
 parties-0.022
 journals-0.022
 married-0.022
Token!
Token ID0
Feature activation-6.04e-02

Pos logit contributions
 spike+0.093
 nail+0.092
 pipe+0.089
 figure+0.088
 crocod+0.088
Neg logit contributions
 parties-0.153
 chores-0.150
 Him-0.145
 journals-0.144
onies-0.141
Token We
Token ID775
Feature activation-5.22e-03

Pos logit contributions
 nail+0.085
 pipe+0.084
 spike+0.083
 block+0.081
 crocod+0.080
Neg logit contributions
 Him-0.115
 journals-0.111
 parties-0.111
 chores-0.109
onies-0.106
Token shouldn
Token ID6584
Feature activation-4.02e-02

Pos logit contributions
 spike+0.008
 pipe+0.008
 nail+0.008
 block+0.008
 crocod+0.008
Neg logit contributions
 parties-0.009
 chores-0.009
 journals-0.009
 Him-0.009
onies-0.009
Token face
Token ID1986
Feature activation-1.21e-01

Pos logit contributions
 journals+0.078
 parties+0.078
 chores+0.078
 Him+0.075
onies+0.072
Neg logit contributions
 nail-0.046
 spike-0.046
 pipe-0.046
 figure-0.045
 block-0.045
Token.
Token ID13
Feature activation-4.29e-02

Pos logit contributions
 spike+0.137
 nail+0.135
 pipe+0.133
 crocod+0.129
 figure+0.128
Neg logit contributions
 parties-0.265
 chores-0.264
 journals-0.256
 Him-0.253
onies-0.248
Token Tom
Token ID4186
Feature activation-5.25e-02

Pos logit contributions
 nail+0.059
 figure+0.059
 pipe+0.058
 spike+0.056
 crocod+0.054
Neg logit contributions
 parties-0.084
 chores-0.083
 journals-0.082
 Him-0.082
onies-0.078
Token feels
Token ID5300
Feature activation-3.36e-02

Pos logit contributions
 nail+0.070
 pipe+0.070
 spike+0.066
 figure+0.066
 crocod+0.066
Neg logit contributions
 parties-0.103
 chores-0.102
 journals-0.101
onies-0.097
 Him-0.096
Token a
Token ID257
Feature activation+1.99e-03

Pos logit contributions
 pipe+0.048
 nail+0.048
 block+0.048
 spike+0.048
 crocod+0.046
Neg logit contributions
 chores-0.064
 parties-0.064
 journals-0.063
 Him-0.062
onies-0.059
Token sting
Token ID24276
Feature activation-1.90e-01

Pos logit contributions
 chores+0.004
 journals+0.004
 Him+0.004
 parties+0.004
Sleep+0.004
Neg logit contributions
 pipe-0.003
 block-0.003
 spike-0.002
 nail-0.002
 crocod-0.002
Token in
Token ID287
Feature activation-9.50e-02

Pos logit contributions
 nail+0.266
 spike+0.265
 rectangle+0.250
 pipe+0.246
 crocod+0.245
Neg logit contributions
 parties-0.329
Sleep-0.327
 company-0.320
 Him-0.316
 married-0.312
Token his
Token ID465
Feature activation+1.21e-02

Pos logit contributions
 pipe+0.123
 nail+0.123
 block+0.121
 spike+0.118
 figure+0.116
Neg logit contributions
 chores-0.194
 journals-0.189
 parties-0.185
 Him-0.181
 holidays-0.173
Token eyes
Token ID2951
Feature activation-3.65e-02

Pos logit contributions
 journals+0.029
 chores+0.029
 parties+0.028
 Him+0.028
onies+0.027
Neg logit contributions
 nail-0.012
 spike-0.011
 pipe-0.011
 block-0.011
 figure-0.011
Token and
Token ID290
Feature activation+1.78e-03

Pos logit contributions
 spike+0.043
 pipe+0.042
 nail+0.042
 crocod+0.041
 block+0.038
Neg logit contributions
 chores-0.078
 parties-0.077
 Him-0.073
onies-0.072
 journals-0.072
Token a
Token ID257
Feature activation+3.13e-04

Pos logit contributions
 chores+0.003
 journals+0.003
 parties+0.003
 Him+0.003
 amen+0.003
Neg logit contributions
 nail-0.002
 pipe-0.002
 block-0.002
 spike-0.002
 figure-0.002
Tokenak
Token ID461
Feature activation-1.08e-01

Pos logit contributions
 spike+0.093
 nail+0.091
 pipe+0.089
 figure+0.088
 snake+0.087
Neg logit contributions
 parties-0.125
 journals-0.122
 chores-0.122
onies-0.121
 Him-0.118
Token.
Token ID13
Feature activation-3.04e-02

Pos logit contributions
 nail+0.136
 spike+0.134
 pipe+0.127
 grun+0.126
 rectangle+0.126
Neg logit contributions
 parties-0.211
 chores-0.208
onies-0.201
 journals-0.195
ics-0.192
Token It
Token ID632
Feature activation-8.47e-02

Pos logit contributions
 block+0.033
 nail+0.033
 spike+0.033
 pipe+0.033
 crocod+0.032
Neg logit contributions
 parties-0.069
 journals-0.069
 chores-0.068
 Him-0.067
onies-0.065
Token was
Token ID373
Feature activation-4.13e-02

Pos logit contributions
 pipe+0.120
 nail+0.115
 block+0.108
 rectangle+0.107
 tool+0.106
Neg logit contributions
 married-0.144
Hope-0.138
 parties-0.128
onies-0.128
 journals-0.127
Token a
Token ID257
Feature activation+2.52e-03

Pos logit contributions
 pipe+0.064
 nail+0.064
 block+0.063
 figure+0.062
 spike+0.061
Neg logit contributions
 chores-0.070
 parties-0.066
 Him-0.066
 married-0.066
 journals-0.061
Token crow
Token ID37593
Feature activation-1.90e-01

Pos logit contributions
 married+0.004
 chores+0.004
 parties+0.004
 journals+0.004
 pic+0.004
Neg logit contributions
oke-0.003
 grun-0.003
er-0.003
crow-0.003
 tipped-0.003
Token.
Token ID13
Feature activation-3.89e-02

Pos logit contributions
 nail+0.244
 rectangle+0.240
 pipe+0.232
 spike+0.229
 grun+0.226
Neg logit contributions
 parties-0.342
 company-0.332
 chores-0.332
onies-0.329
 married-0.324
Token Lily
Token ID20037
Feature activation-2.63e-02

Pos logit contributions
 spike+0.045
 pipe+0.045
 nail+0.045
 block+0.045
 figure+0.043
Neg logit contributions
 parties-0.085
 journals-0.085
 chores-0.084
 Him-0.084
onies-0.080
Token did
Token ID750
Feature activation-3.47e-02

Pos logit contributions
 pipe+0.031
 nail+0.031
 bolt+0.030
 crocod+0.030
 rectangle+0.030
Neg logit contributions
onies-0.050
 parties-0.050
 journals-0.048
Hope-0.047
 married-0.047
Token not
Token ID407
Feature activation-6.66e-02

Pos logit contributions
 pipe+0.054
 nail+0.053
 block+0.053
 spike+0.053
 crocod+0.052
Neg logit contributions
 chores-0.063
 parties-0.063
 journals-0.061
 Him-0.061
onies-0.058
Token like
Token ID588
Feature activation-3.25e-02

Pos logit contributions
 spike+0.112
 pipe+0.111
 nail+0.108
 block+0.108
 crocod+0.105
Neg logit contributions
 parties-0.116
 chores-0.111
 Him-0.110
 journals-0.107
onies-0.105
Tokenite
Token ID578
Feature activation-4.58e-02

Pos logit contributions
 pipe+0.019
 figure+0.019
 nail+0.019
 spike+0.018
 grun+0.018
Neg logit contributions
 parties-0.019
 journals-0.019
 chores-0.019
 Him-0.018
onies-0.018
Token.
Token ID13
Feature activation-3.84e-02

Pos logit contributions
 block+0.037
 nail+0.036
 spike+0.036
 pipe+0.035
 figure+0.034
Neg logit contributions
 journals-0.117
 parties-0.116
 Him-0.116
 chores-0.115
onies-0.111
Token It
Token ID632
Feature activation-7.70e-02

Pos logit contributions
 nail+0.042
 pipe+0.042
 spike+0.042
 block+0.041
 figure+0.040
Neg logit contributions
 parties-0.086
 journals-0.086
 Him-0.085
 chores-0.084
onies-0.081
Token is
Token ID318
Feature activation-5.59e-02

Pos logit contributions
 nail+0.086
 pipe+0.085
 spike+0.083
 block+0.082
 figure+0.081
Neg logit contributions
 parties-0.170
 chores-0.168
 journals-0.168
 Him-0.165
onies-0.161
Token a
Token ID257
Feature activation+5.55e-03

Pos logit contributions
 nail+0.088
 pipe+0.088
 figure+0.088
 spike+0.086
 block+0.085
Neg logit contributions
 parties-0.099
 chores-0.097
 journals-0.096
 Him-0.094
onies-0.089
Token crow
Token ID37593
Feature activation-1.90e-01

Pos logit contributions
 parties+0.009
 chores+0.009
 married+0.009
 journals+0.009
 holidays+0.009
Neg logit contributions
 grun-0.009
oke-0.008
 nail-0.008
 pipe-0.008
 tipped-0.008
Token.
Token ID13
Feature activation-5.41e-02

Pos logit contributions
 nail+0.223
 spike+0.217
 pipe+0.214
 grun+0.203
 block+0.200
Neg logit contributions
 parties-0.416
 chores-0.401
 Him-0.397
 journals-0.392
onies-0.386
Token It
Token ID632
Feature activation-8.82e-02

Pos logit contributions
 nail+0.058
 pipe+0.057
 spike+0.057
 block+0.057
 figure+0.055
Neg logit contributions
 Him-0.122
 journals-0.122
 parties-0.122
 chores-0.120
onies-0.116
Token is
Token ID318
Feature activation-6.31e-02

Pos logit contributions
 nail+0.111
 pipe+0.110
 spike+0.106
 figure+0.105
 block+0.104
Neg logit contributions
 chores-0.177
 parties-0.177
 Him-0.174
 journals-0.174
onies-0.171
Token a
Token ID257
Feature activation-1.08e-03

Pos logit contributions
 nail+0.100
 figure+0.098
 pipe+0.098
 spike+0.096
 block+0.096
Neg logit contributions
 chores-0.108
 parties-0.107
 Him-0.106
 journals-0.103
 married-0.102
Token mysterious
Token ID11428
Feature activation-8.82e-02

Pos logit contributions
 grun+0.002
oke+0.001
 tipped+0.001
 block+0.001
 nail+0.001
Neg logit contributions
 chores-0.002
 married-0.002
 parties-0.002
 company-0.002
 journals-0.002
Token nose
Token ID9686
Feature activation-3.56e-02

Pos logit contributions
 chores+0.099
 journals+0.098
 Him+0.097
 parties+0.097
onies+0.093
Neg logit contributions
 nail-0.057
 spike-0.056
 pipe-0.055
 block-0.055
 figure-0.054
Token!"
Token ID2474
Feature activation+2.26e-02

Pos logit contributions
 nail+0.049
 pipe+0.049
 spike+0.048
 figure+0.048
 crocod+0.047
Neg logit contributions
 parties-0.067
 chores-0.066
onies-0.064
 journals-0.063
 Him-0.062
Token
Token ID198
Feature activation+2.70e-03

Pos logit contributions
 Him+0.032
 parties+0.032
 chores+0.032
 amen+0.029
Sleep+0.029
Neg logit contributions
 nail-0.041
 pipe-0.041
 spike-0.041
 blob-0.040
 figure-0.039
Token
Token ID198
Feature activation-2.60e-03

Pos logit contributions
 Him+0.005
 chores+0.005
 parties+0.005
 journals+0.005
Sleep+0.004
Neg logit contributions
 nail-0.004
 spike-0.004
 pipe-0.004
 blob-0.004
 crocod-0.004
TokenThe
Token ID464
Feature activation-4.55e-02

Pos logit contributions
 spike+0.004
 nail+0.004
 pipe+0.004
 blob+0.004
 crocod+0.004
Neg logit contributions
 journals-0.004
 chores-0.004
Bon-0.004
 Him-0.004
Sleep-0.004
Token crow
Token ID37593
Feature activation-1.89e-01

Pos logit contributions
 rectangle+0.044
 grun+0.040
 Jill+0.038
oke+0.037
 compartment+0.037
Neg logit contributions
 chores-0.100
 Him-0.092
 ponies-0.091
onies-0.089
 amen-0.089
Token c
Token ID269
Feature activation-3.27e-02

Pos logit contributions
 pipe+0.275
 nail+0.266
 rectangle+0.248
 brick+0.243
 spike+0.235
Neg logit contributions
 amen-0.293
onies-0.288
 chores-0.278
 married-0.274
 journals-0.270
Tokenawed
Token ID36825
Feature activation-9.07e-02

Pos logit contributions
 nail+0.057
 block+0.056
 pipe+0.056
 spike+0.056
 figure+0.056
Neg logit contributions
 parties-0.053
 journals-0.053
 Him-0.052
 chores-0.052
onies-0.050
Token and
Token ID290
Feature activation+1.37e-02

Pos logit contributions
 pipe+0.124
 nail+0.124
 spike+0.120
 figure+0.114
 block+0.113
Neg logit contributions
 Him-0.174
 journals-0.171
 chores-0.166
 parties-0.155
 unnecessary-0.155
Token flew
Token ID13112
Feature activation-2.39e-03

Pos logit contributions
 parties+0.025
 amen+0.024
 chores+0.024
Mid+0.024
 married+0.023
Neg logit contributions
 pipe-0.019
 nail-0.018
 block-0.018
 figure-0.017
 spike-0.016
Token away
Token ID1497
Feature activation-5.36e-02

Pos logit contributions
 nail+0.005
 pipe+0.004
 spike+0.004
 figure+0.004
 blob+0.004
Neg logit contributions
 parties-0.003
 chores-0.003
 Him-0.003
 journals-0.003
 married-0.003
Token soon
Token ID2582
Feature activation+2.68e-02

Pos logit contributions
 parties+0.045
 Him+0.044
 journals+0.044
 chores+0.044
 holidays+0.040
Neg logit contributions
 block-0.036
 spike-0.036
 pipe-0.035
 nail-0.035
 blob-0.034
Token they
Token ID484
Feature activation+2.15e-02

Pos logit contributions
 journals+0.060
 parties+0.059
 chores+0.059
 Him+0.059
onies+0.056
Neg logit contributions
 nail-0.030
 block-0.030
 spike-0.030
 pipe-0.029
 figure-0.029
Token were
Token ID547
Feature activation+1.36e-02

Pos logit contributions
 parties+0.040
 journals+0.039
 Him+0.038
 chores+0.038
 married+0.036
Neg logit contributions
 nail-0.033
 spike-0.033
 pipe-0.033
 block-0.033
 crocod-0.031
Token all
Token ID477
Feature activation-6.96e-04

Pos logit contributions
 parties+0.025
 journals+0.025
 chores+0.025
 Him+0.024
onies+0.023
Neg logit contributions
 nail-0.021
 pipe-0.021
 spike-0.021
 block-0.020
 figure-0.020
Token playing
Token ID2712
Feature activation+7.24e-02

Pos logit contributions
 nail+0.001
 spike+0.001
 pipe+0.001
 block+0.001
 figure+0.001
Neg logit contributions
 parties-0.001
 journals-0.001
 chores-0.001
 Him-0.001
 married-0.001
Token together
Token ID1978
Feature activation+1.35e-01

Pos logit contributions
 parties+0.131
 journals+0.127
 chores+0.126
 Him+0.121
 holidays+0.116
Neg logit contributions
 spike-0.115
 nail-0.115
 pipe-0.113
 block-0.112
 crocod-0.110
Token.
Token ID13
Feature activation-8.55e-03

Pos logit contributions
 parties+0.314
 chores+0.301
 journals+0.301
 Him+0.291
 holidays+0.283
Neg logit contributions
 spike-0.147
 pipe-0.146
 nail-0.146
 block-0.140
 crocod-0.139
Token The
Token ID383
Feature activation-2.99e-02

Pos logit contributions
 block+0.010
 spike+0.010
 nail+0.010
 pipe+0.010
 figure+0.010
Neg logit contributions
 journals-0.019
 chores-0.018
 parties-0.018
 Him-0.018
onies-0.017
Token family
Token ID1641
Feature activation+4.36e-02

Pos logit contributions
 block+0.038
 spike+0.038
 nail+0.038
 pipe+0.038
 figure+0.036
Neg logit contributions
 parties-0.062
 journals-0.062
 chores-0.062
 Him-0.061
onies-0.058
Token spent
Token ID3377
Feature activation+7.58e-02

Pos logit contributions
 parties+0.097
 married+0.094
 journals+0.092
 holidays+0.087
 chores+0.086
Neg logit contributions
 nail-0.050
 spike-0.048
 pipe-0.048
 crocod-0.045
 blob-0.044
Token the
Token ID262
Feature activation+1.25e-02

Pos logit contributions
 parties+0.171
 journals+0.170
 chores+0.168
 Him+0.167
onies+0.160
Neg logit contributions
 nail-0.083
 block-0.083
 spike-0.083
 pipe-0.082
 figure-0.080
Token could
Token ID714
Feature activation+6.54e-03

Pos logit contributions
 parties+0.002
 journals+0.002
 chores+0.002
 Him+0.002
onies+0.002
Neg logit contributions
 block-0.000
 nail-0.000
 spike-0.000
 pipe-0.000
 figure-0.000
Token find
Token ID1064
Feature activation-4.38e-02

Pos logit contributions
 parties+0.013
 chores+0.013
 journals+0.012
 Him+0.012
onies+0.012
Neg logit contributions
 spike-0.009
 block-0.009
 nail-0.009
 pipe-0.009
 crocod-0.009
Token.
Token ID13
Feature activation+2.38e-03

Pos logit contributions
 spike+0.044
 nail+0.044
 block+0.042
 pipe+0.042
 blob+0.041
Neg logit contributions
 parties-0.104
 journals-0.102
 chores-0.102
 Him-0.101
onies-0.097
Token
Token ID198
Feature activation+2.63e-02

Pos logit contributions
 chores+0.004
 parties+0.004
 journals+0.004
 Him+0.004
 holidays+0.004
Neg logit contributions
 nail-0.004
 pipe-0.004
 spike-0.004
 blob-0.004
 figure-0.004
Token
Token ID198
Feature activation+2.22e-02

Pos logit contributions
 parties+0.053
 chores+0.052
 Him+0.051
 journals+0.051
onies+0.049
Neg logit contributions
 nail-0.035
 pipe-0.035
 spike-0.034
 crocod-0.034
 block-0.033
TokenAs
Token ID1722
Feature activation+1.67e-01

Pos logit contributions
 chores+0.023
 parties+0.022
 journals+0.022
 Him+0.021
 holidays+0.020
Neg logit contributions
 nail-0.050
 pipe-0.050
 spike-0.050
 blob-0.050
 crocod-0.050
Token he
Token ID339
Feature activation-1.88e-03

Pos logit contributions
 parties+0.350
 chores+0.344
 Him+0.338
 holidays+0.331
 journals+0.331
Neg logit contributions
 nail-0.214
 spike-0.208
 pipe-0.206
 figure-0.203
 block-0.202
Token walked
Token ID6807
Feature activation+4.45e-02

Pos logit contributions
 nail+0.002
 pipe+0.002
 spike+0.002
 block+0.002
 snake+0.002
Neg logit contributions
 parties-0.004
 chores-0.004
 journals-0.004
 Him-0.004
 holidays-0.004
Token he
Token ID339
Feature activation+3.15e-03

Pos logit contributions
 parties+0.096
 chores+0.091
 Him+0.090
 holidays+0.087
 tours+0.086
Neg logit contributions
 blob-0.058
 nail-0.056
 spike-0.054
 rectangle-0.051
 block-0.050
Token saw
Token ID2497
Feature activation-8.22e-02

Pos logit contributions
 parties+0.008
 chores+0.007
 Him+0.007
 journals+0.007
 married+0.007
Neg logit contributions
 nail-0.003
 spike-0.003
 snake-0.003
 blob-0.003
 block-0.003
Token a
Token ID257
Feature activation+4.39e-02

Pos logit contributions
 nail+0.138
 blob+0.138
 figure+0.137
 rectangle+0.136
 spike+0.135
Neg logit contributions
 parties-0.134
 chores-0.134
 journals-0.129
 Him-0.126
 holidays-0.120
Token they
Token ID484
Feature activation+4.30e-02

Pos logit contributions
 parties+0.103
 chores+0.102
 journals+0.101
 Him+0.099
 holidays+0.095
Neg logit contributions
 nail-0.057
 block-0.056
 spike-0.055
 pipe-0.054
 figure-0.054
Token said
Token ID531
Feature activation+7.16e-03

Pos logit contributions
 parties+0.099
 journals+0.097
 chores+0.096
 Him+0.095
onies+0.091
Neg logit contributions
 block-0.046
 nail-0.046
 spike-0.046
 pipe-0.046
 crocod-0.044
Token.
Token ID13
Feature activation+6.80e-03

Pos logit contributions
 parties+0.017
 chores+0.017
 journals+0.017
 Him+0.017
onies+0.016
Neg logit contributions
 spike-0.007
 nail-0.007
 pipe-0.007
 block-0.007
 crocod-0.007
Token
Token ID198
Feature activation+2.89e-02

Pos logit contributions
 parties+0.013
 chores+0.012
 journals+0.012
 Him+0.012
 holidays+0.012
Neg logit contributions
 nail-0.010
 figure-0.010
 spike-0.010
 pipe-0.010
 block-0.010
Token
Token ID198
Feature activation+2.27e-02

Pos logit contributions
 parties+0.065
 journals+0.064
 chores+0.064
 Him+0.063
onies+0.061
Neg logit contributions
 block-0.032
 nail-0.032
 spike-0.032
 pipe-0.031
 figure-0.030
TokenAs
Token ID1722
Feature activation+1.65e-01

Pos logit contributions
 parties+0.036
 chores+0.035
 journals+0.035
 Him+0.034
onies+0.032
Neg logit contributions
 nail-0.040
 spike-0.040
 block-0.040
 pipe-0.040
 figure-0.039
Token they
Token ID484
Feature activation+4.08e-02

Pos logit contributions
 chores+0.258
 parties+0.257
 holidays+0.237
 pic+0.230
 journals+0.227
Neg logit contributions
 figure-0.289
 nail-0.281
 blob-0.280
 block-0.276
 spike-0.275
Token played
Token ID2826
Feature activation+9.77e-02

Pos logit contributions
 parties+0.070
 journals+0.069
 chores+0.068
 Him+0.067
onies+0.063
Neg logit contributions
 nail-0.067
 block-0.067
 pipe-0.067
 spike-0.067
 figure-0.065
Token,
Token ID11
Feature activation-1.59e-02

Pos logit contributions
 parties+0.241
 chores+0.236
 journals+0.223
 Him+0.212
 holidays+0.210
Neg logit contributions
 spike-0.098
 nail-0.095
 grun-0.095
 crocod-0.093
 block-0.092
Token Tim
Token ID5045
Feature activation-3.10e-02

Pos logit contributions
 figure+0.019
 nail+0.019
 pipe+0.019
 block+0.019
 spike+0.018
Neg logit contributions
 chores-0.034
 parties-0.034
 journals-0.033
 Him-0.032
 holidays-0.032
Tokenmy
Token ID1820
Feature activation+7.16e-03

Pos logit contributions
 spike+0.048
 nail+0.048
 crocod+0.048
 pipe+0.047
 block+0.047
Neg logit contributions
 parties-0.055
 journals-0.054
 chores-0.053
 Him-0.051
onies-0.050
Token.
Token ID13
Feature activation+7.97e-03

Pos logit contributions
 journals+0.031
 parties+0.031
 Him+0.031
 chores+0.031
onies+0.029
Neg logit contributions
 spike-0.014
 nail-0.014
 pipe-0.014
 crocod-0.014
 block-0.013
Token Lily
Token ID20037
Feature activation+5.17e-03

Pos logit contributions
 journals+0.015
 chores+0.015
 Him+0.015
 parties+0.015
onies+0.014
Neg logit contributions
 block-0.012
 nail-0.011
 pipe-0.011
 spike-0.011
 figure-0.011
Token loved
Token ID6151
Feature activation+9.84e-02

Pos logit contributions
 parties+0.010
 journals+0.010
 chores+0.010
 Him+0.010
onies+0.010
Neg logit contributions
 block-0.007
 nail-0.007
 spike-0.007
 pipe-0.007
 figure-0.007
Token to
Token ID284
Feature activation-7.88e-03

Pos logit contributions
 journals+0.249
 Him+0.249
onies+0.244
 chores+0.243
 parties+0.239
Neg logit contributions
 pipe-0.078
 nail-0.078
 block-0.077
 spike-0.077
 crocod-0.076
Token play
Token ID711
Feature activation+9.46e-02

Pos logit contributions
 block+0.011
 nail+0.011
 figure+0.011
 mark+0.010
 pipe+0.010
Neg logit contributions
 Him-0.017
 journals-0.017
 unnecessary-0.016
onies-0.016
Hope-0.015
Token games
Token ID1830
Feature activation+1.90e-01

Pos logit contributions
 parties+0.211
 chores+0.205
 journals+0.204
 Him+0.199
onies+0.191
Neg logit contributions
 spike-0.110
 pipe-0.108
 nail-0.108
 block-0.107
 crocod-0.106
Token with
Token ID351
Feature activation-1.52e-03

Pos logit contributions
 parties+0.497
 chores+0.483
 journals+0.481
 Him+0.467
onies+0.455
Neg logit contributions
 spike-0.159
 pipe-0.151
 nail-0.151
 crocod-0.146
 block-0.142
Token her
Token ID607
Feature activation+9.24e-02

Pos logit contributions
 block+0.003
 nail+0.003
 spike+0.002
 snake+0.002
 pipe+0.002
Neg logit contributions
 Him-0.003
 hospitality-0.003
 journals-0.003
 chores-0.002
 unnecessary-0.002
Token friends
Token ID2460
Feature activation+1.35e-01

Pos logit contributions
 Him+0.178
 journals+0.172
 chores+0.169
 parties+0.169
onies+0.162
Neg logit contributions
 block-0.142
 pipe-0.141
 nail-0.141
 spike-0.139
 snake-0.138
Token.
Token ID13
Feature activation+1.12e-02

Pos logit contributions
 parties+0.342
 journals+0.340
 chores+0.339
 Him+0.335
onies+0.323
Neg logit contributions
 spike-0.111
 nail-0.111
 block-0.110
 pipe-0.110
 crocod-0.105
Token One
Token ID1881
Feature activation+9.91e-02

Pos logit contributions
 journals+0.020
 chores+0.020
 parties+0.020
 Him+0.019
onies+0.018
Neg logit contributions
 block-0.018
 nail-0.017
 pipe-0.017
 spike-0.017
 figure-0.017
Token he
Token ID339
Feature activation-8.40e-03

Pos logit contributions
 journals+0.082
 chores+0.082
 parties+0.081
 Him+0.079
onies+0.074
Neg logit contributions
 spike-0.072
 nail-0.072
 pipe-0.071
 block-0.071
 figure-0.069
Token loved
Token ID6151
Feature activation+9.88e-02

Pos logit contributions
 nail+0.013
 block+0.013
 spike+0.013
 pipe+0.013
 figure+0.013
Neg logit contributions
 parties-0.015
 journals-0.015
 chores-0.015
 Him-0.014
onies-0.014
Token to
Token ID284
Feature activation-3.47e-03

Pos logit contributions
 journals+0.240
 Him+0.240
 parties+0.238
 chores+0.236
onies+0.235
Neg logit contributions
 pipe-0.089
 nail-0.089
 block-0.089
 spike-0.088
 crocod-0.085
Token march
Token ID9960
Feature activation+7.89e-02

Pos logit contributions
 block+0.005
 nail+0.005
 figure+0.005
 pipe+0.005
 spike+0.005
Neg logit contributions
 Him-0.007
 journals-0.007
onies-0.007
 parties-0.006
 unnecessary-0.006
Token.
Token ID13
Feature activation+1.19e-02

Pos logit contributions
 parties+0.195
 journals+0.195
 chores+0.193
 Him+0.192
onies+0.184
Neg logit contributions
 block-0.069
 nail-0.069
 spike-0.068
 pipe-0.068
 figure-0.065
Token Everywhere
Token ID48201
Feature activation+1.36e-01

Pos logit contributions
 parties+0.023
 chores+0.023
 journals+0.023
 Him+0.022
onies+0.021
Neg logit contributions
 nail-0.017
 pipe-0.017
 spike-0.017
 figure-0.017
 crocod-0.016
Token Tom
Token ID4186
Feature activation-1.57e-02

Pos logit contributions
 parties+0.318
 journals+0.317
 chores+0.315
 Him+0.306
onies+0.294
Neg logit contributions
 nail-0.141
 spike-0.139
 pipe-0.138
 figure-0.135
 block-0.132
Token went
Token ID1816
Feature activation+5.03e-02

Pos logit contributions
 block+0.016
 nail+0.016
 spike+0.016
 pipe+0.016
 figure+0.016
Neg logit contributions
 parties-0.036
 journals-0.036
 chores-0.036
 Him-0.035
onies-0.034
Token,
Token ID11
Feature activation-2.95e-02

Pos logit contributions
 parties+0.133
 chores+0.131
 Him+0.128
 journals+0.128
 holidays+0.125
Neg logit contributions
 nail-0.038
 spike-0.037
 block-0.037
 blob-0.036
 pipe-0.035
Token he
Token ID339
Feature activation-1.98e-03

Pos logit contributions
 spike+0.029
 nail+0.029
 block+0.029
 pipe+0.029
 figure+0.028
Neg logit contributions
 parties-0.070
 journals-0.069
 chores-0.069
 Him-0.068
onies-0.065
Token marched
Token ID23558
Feature activation+6.42e-02

Pos logit contributions
 nail+0.002
 spike+0.002
 block+0.002
 pipe+0.002
 crocod+0.002
Neg logit contributions
 parties-0.004
 chores-0.004
 journals-0.004
 Him-0.004
onies-0.004
Token make
Token ID787
Feature activation-8.71e-03

Pos logit contributions
 parties+0.109
 chores+0.108
 Him+0.106
 journals+0.106
onies+0.101
Neg logit contributions
 block-0.061
 spike-0.061
 pipe-0.061
 nail-0.060
 crocod-0.059
Token mom
Token ID1995
Feature activation+6.04e-02

Pos logit contributions
 nail+0.015
 block+0.014
 spike+0.014
 pipe+0.014
 figure+0.014
Neg logit contributions
 journals-0.014
 parties-0.014
 chores-0.014
 Him-0.014
onies-0.013
Tokenmy
Token ID1820
Feature activation+5.52e-02

Pos logit contributions
 parties+0.127
 journals+0.126
 chores+0.125
 Him+0.123
onies+0.118
Neg logit contributions
 block-0.077
 nail-0.075
 figure-0.073
 pipe-0.073
 spike-0.072
Token and
Token ID290
Feature activation+6.11e-02

Pos logit contributions
 parties+0.113
 journals+0.112
 chores+0.110
 Him+0.109
onies+0.105
Neg logit contributions
 block-0.072
 nail-0.070
 figure-0.070
 pipe-0.068
 spike-0.067
Token daddy
Token ID35822
Feature activation+7.28e-02

Pos logit contributions
 parties+0.112
 journals+0.112
 chores+0.110
 Him+0.110
onies+0.103
Neg logit contributions
 block-0.092
 spike-0.092
 nail-0.092
 pipe-0.092
 crocod-0.089
Token happy
Token ID3772
Feature activation+8.01e-02

Pos logit contributions
 journals+0.156
 parties+0.156
 chores+0.154
 Him+0.152
onies+0.146
Neg logit contributions
 block-0.087
 figure-0.085
 nail-0.084
 pipe-0.082
 spike-0.081
Token again
Token ID757
Feature activation-2.09e-02

Pos logit contributions
 journals+0.161
 parties+0.161
 chores+0.160
 Him+0.158
onies+0.151
Neg logit contributions
 block-0.106
 nail-0.106
 spike-0.105
 pipe-0.105
 figure-0.102
Token."
Token ID526
Feature activation+5.18e-02

Pos logit contributions
 nail+0.030
 spike+0.030
 block+0.030
 pipe+0.030
 crocod+0.029
Neg logit contributions
 parties-0.040
 journals-0.039
 chores-0.039
 Him-0.039
onies-0.037
Token
Token ID198
Feature activation+1.40e-02

Pos logit contributions
 journals+0.117
 chores+0.116
 parties+0.116
 Him+0.114
onies+0.110
Neg logit contributions
 block-0.058
 pipe-0.058
 spike-0.058
 nail-0.058
 figure-0.056
Token
Token ID198
Feature activation+8.71e-03

Pos logit contributions
 journals+0.035
 parties+0.034
 chores+0.034
 Him+0.034
onies+0.032
Neg logit contributions
 block-0.013
 spike-0.013
 pipe-0.013
 nail-0.013
 figure-0.012
TokenM
Token ID44
Feature activation-1.38e-02

Pos logit contributions
 journals+0.020
 parties+0.020
 Him+0.020
 chores+0.019
onies+0.018
Neg logit contributions
 nail-0.010
 block-0.010
 spike-0.010
 pipe-0.010
 figure-0.010
Token different
Token ID1180
Feature activation+1.30e-03

Pos logit contributions
 pipe+0.066
 block+0.065
 nail+0.065
 spike+0.065
 figure+0.063
Neg logit contributions
 parties-0.094
 chores-0.094
 journals-0.093
 Him-0.091
 holidays-0.085
Token colors
Token ID7577
Feature activation+1.41e-02

Pos logit contributions
 parties+0.003
 journals+0.003
 chores+0.003
 Him+0.002
onies+0.002
Neg logit contributions
 block-0.002
 nail-0.002
 spike-0.002
 pipe-0.002
 figure-0.002
Token and
Token ID290
Feature activation+5.70e-02

Pos logit contributions
 parties+0.035
 journals+0.033
 chores+0.033
 Him+0.032
onies+0.032
Neg logit contributions
 spike-0.013
 nail-0.013
 pipe-0.012
 crocod-0.012
 figure-0.012
Token sizes
Token ID10620
Feature activation+9.20e-02

Pos logit contributions
 parties+0.106
 journals+0.105
 chores+0.102
 Him+0.100
onies+0.095
Neg logit contributions
 nail-0.086
 block-0.086
 spike-0.086
 pipe-0.086
 figure-0.083
Token.
Token ID13
Feature activation-1.70e-02

Pos logit contributions
 parties+0.217
 journals+0.205
 chores+0.205
onies+0.198
 Him+0.198
Neg logit contributions
 spike-0.095
 pipe-0.092
 nail-0.092
 crocod-0.089
 grun-0.087
Token They
Token ID1119
Feature activation+4.46e-02

Pos logit contributions
 spike+0.022
 nail+0.022
 pipe+0.022
 block+0.022
 figure+0.021
Neg logit contributions
 parties-0.035
 journals-0.035
 Him-0.034
 chores-0.034
onies-0.032
Token filled
Token ID5901
Feature activation-2.54e-02

Pos logit contributions
 parties+0.090
 journals+0.087
 married+0.084
 chores+0.083
 Him+0.082
Neg logit contributions
 spike-0.062
 pipe-0.061
 nail-0.058
 block-0.057
 snake-0.057
Token the
Token ID262
Feature activation-1.53e-03

Pos logit contributions
 nail+0.030
 block+0.030
 spike+0.030
 pipe+0.030
 figure+0.029
Neg logit contributions
 parties-0.055
 journals-0.055
 chores-0.054
 Him-0.054
onies-0.051
Token air
Token ID1633
Feature activation-3.94e-02

Pos logit contributions
 spike+0.002
 nail+0.002
 grun+0.002
 pipe+0.002
 block+0.002
Neg logit contributions
 parties-0.003
 journals-0.003
 chores-0.003
 Him-0.003
onies-0.003
Token with
Token ID351
Feature activation-5.31e-02

Pos logit contributions
 spike+0.037
 nail+0.037
 pipe+0.036
 crocod+0.033
 figure+0.033
Neg logit contributions
 parties-0.097
 journals-0.094
 chores-0.094
 Him-0.091
 holidays-0.088
Token their
Token ID511
Feature activation+7.44e-02

Pos logit contributions
 nail+0.079
 block+0.079
 spike+0.078
 pipe+0.078
 figure+0.076
Neg logit contributions
 parties-0.099
 journals-0.098
 chores-0.097
 Him-0.096
onies-0.091
Token said
Token ID531
Feature activation-2.11e-02

Pos logit contributions
 parties+0.053
 Him+0.051
 married+0.051
 journals+0.050
 amen+0.049
Neg logit contributions
 block-0.043
 nail-0.043
 spike-0.043
 pipe-0.042
 crocod-0.042
Token,
Token ID11
Feature activation-5.25e-02

Pos logit contributions
 pipe+0.015
 spike+0.015
 block+0.015
 nail+0.014
 crocod+0.014
Neg logit contributions
 parties-0.056
 Him-0.056
 chores-0.056
 journals-0.055
onies-0.053
Token "
Token ID366
Feature activation-1.99e-03

Pos logit contributions
 pipe+0.057
 spike+0.057
 crocod+0.055
 block+0.054
 nail+0.053
Neg logit contributions
 Him-0.119
 amen-0.116
 parties-0.111
 chores-0.110
Sleep-0.110
TokenThat
Token ID2504
Feature activation-3.83e-02

Pos logit contributions
 nail+0.003
 block+0.003
 spike+0.003
 pipe+0.003
 figure+0.003
Neg logit contributions
 parties-0.004
 chores-0.004
 journals-0.004
 Him-0.004
onies-0.003
Token's
Token ID338
Feature activation+2.50e-02

Pos logit contributions
 block+0.049
 crocod+0.049
 nail+0.049
 spike+0.048
 pipe+0.047
Neg logit contributions
 chores-0.078
 journals-0.077
 parties-0.077
 Him-0.074
onies-0.072
Token great
Token ID1049
Feature activation+3.46e-02

Pos logit contributions
 parties+0.050
 journals+0.049
 Him+0.047
onies+0.046
 chores+0.046
Neg logit contributions
 nail-0.035
 spike-0.035
 pipe-0.034
 block-0.034
 figure-0.033
Token,
Token ID11
Feature activation-3.54e-02

Pos logit contributions
 parties+0.099
 journals+0.099
 chores+0.099
 Him+0.098
onies+0.095
Neg logit contributions
 block-0.017
 nail-0.017
 spike-0.017
 pipe-0.017
 figure-0.015
Token Lily
Token ID20037
Feature activation+2.54e-03

Pos logit contributions
 pipe+0.043
 block+0.042
 spike+0.042
 nail+0.042
 figure+0.041
Neg logit contributions
 parties-0.077
 chores-0.075
 journals-0.074
 Him-0.073
onies-0.070
Token!
Token ID0
Feature activation-5.61e-02

Pos logit contributions
 parties+0.006
 chores+0.006
 journals+0.006
 Him+0.006
onies+0.005
Neg logit contributions
 spike-0.003
 nail-0.003
 crocod-0.003
 block-0.003
 pipe-0.003
Token Sometimes
Token ID8975
Feature activation+7.36e-02

Pos logit contributions
 nail+0.081
 block+0.081
 spike+0.081
 pipe+0.081
 figure+0.079
Neg logit contributions
 parties-0.105
 journals-0.105
 Him-0.104
 chores-0.104
onies-0.098
Token when
Token ID618
Feature activation+3.54e-02

Pos logit contributions
 journals+0.174
 parties+0.173
 Him+0.172
 chores+0.171
onies+0.165
Neg logit contributions
 nail-0.071
 spike-0.071
 block-0.071
 pipe-0.070
 figure-0.068
Token.
Token ID13
Feature activation-9.97e-03

Pos logit contributions
 parties+0.294
 journals+0.291
 chores+0.290
 Him+0.287
onies+0.275
Neg logit contributions
 nail-0.099
 spike-0.099
 block-0.098
 pipe-0.097
 figure-0.092
Token<|endoftext|>
Token ID50256
Feature activation+1.00e-01

Pos logit contributions
 block+0.010
 spike+0.009
 nail+0.009
 crocod+0.009
 pipe+0.009
Neg logit contributions
 journals-0.024
 chores-0.024
 parties-0.024
 Him-0.023
onies-0.023
TokenOnce
Token ID7454
Feature activation+7.34e-02

Pos logit contributions
 parties+0.216
 journals+0.214
 chores+0.213
 Him+0.211
onies+0.201
Neg logit contributions
 nail-0.118
 spike-0.117
 pipe-0.117
 block-0.116
 figure-0.113
Token upon
Token ID2402
Feature activation+6.25e-02

Pos logit contributions
 parties+0.126
 chores+0.124
 Him+0.122
 journals+0.121
onies+0.114
Neg logit contributions
 spike-0.120
 nail-0.119
 pipe-0.118
 block-0.117
 figure-0.117
Token a
Token ID257
Feature activation+7.30e-02

Pos logit contributions
 journals+0.174
 Him+0.172
onies+0.170
 parties+0.168
 cultures+0.167
Neg logit contributions
 block-0.049
 was-0.040
 saw-0.037
 it-0.036
 figure-0.036
Token time
Token ID640
Feature activation+2.43e-02

Pos logit contributions
 journals+0.173
 Him+0.171
 parties+0.168
 chores+0.164
onies+0.159
Neg logit contributions
 block-0.078
 spike-0.071
 nail-0.070
 pipe-0.069
 crocod-0.069
Token,
Token ID11
Feature activation-1.23e-02

Pos logit contributions
 parties+0.067
 chores+0.066
 Him+0.065
 journals+0.065
onies+0.063
Neg logit contributions
 nail-0.014
 pipe-0.014
 spike-0.014
 block-0.013
 figure-0.013
Token in
Token ID287
Feature activation-5.23e-02

Pos logit contributions
 pipe+0.017
 spike+0.017
 figure+0.017
 crocod+0.016
 nail+0.016
Neg logit contributions
 chores-0.022
 parties-0.022
 journals-0.022
 Him-0.022
Hope-0.021
Token a
Token ID257
Feature activation+5.95e-02

Pos logit contributions
 block+0.064
 nail+0.062
 spike+0.061
 crocod+0.060
 pipe+0.059
Neg logit contributions
 journals-0.115
 parties-0.114
 Him-0.112
 chores-0.111
onies-0.108
Token small
Token ID1402
Feature activation-3.26e-02

Pos logit contributions
 parties+0.136
 journals+0.135
 chores+0.134
 Him+0.134
onies+0.128
Neg logit contributions
 block-0.064
 nail-0.063
 spike-0.063
 pipe-0.063
 figure-0.061
Token swamp
Token ID26837
Feature activation+7.22e-03

Pos logit contributions
 grun+0.053
 spike+0.052
 nail+0.051
 pipe+0.050
 blob+0.050
Neg logit contributions
 parties-0.057
 journals-0.054
 chores-0.053
 holidays-0.052
 Him-0.050
Token decided
Token ID3066
Feature activation+1.01e-01

Pos logit contributions
 Him+0.087
 parties+0.086
 journals+0.086
 chores+0.086
onies+0.083
Neg logit contributions
 block-0.032
 nail-0.031
 figure-0.031
 pipe-0.031
 spike-0.030
Token to
Token ID284
Feature activation-1.30e-02

Pos logit contributions
 parties+0.223
 chores+0.219
 Him+0.216
 journals+0.214
 holidays+0.203
Neg logit contributions
 block-0.122
 spike-0.119
 pipe-0.117
 blob-0.117
 nail-0.116
Token kne
Token ID24813
Feature activation-5.28e-03

Pos logit contributions
 nail+0.019
 figure+0.019
 block+0.019
 pipe+0.018
 spike+0.018
Neg logit contributions
 Him-0.026
 journals-0.025
 parties-0.024
onies-0.024
 chores-0.024
Tokenel
Token ID417
Feature activation-9.05e-02

Pos logit contributions
 block+0.005
 nail+0.005
 pipe+0.005
 spike+0.005
 figure+0.005
Neg logit contributions
 parties-0.013
 journals-0.012
 chores-0.012
 hospitality-0.012
 Him-0.012
Token down
Token ID866
Feature activation-1.81e-02

Pos logit contributions
 block+0.091
 pipe+0.090
 nail+0.089
 spike+0.087
 figure+0.085
Neg logit contributions
 parties-0.214
 journals-0.212
 chores-0.211
 Him-0.208
onies-0.201
Token and
Token ID290
Feature activation+5.22e-02

Pos logit contributions
 block+0.016
 nail+0.016
 pipe+0.016
 spike+0.015
 figure+0.015
Neg logit contributions
 parties-0.045
 journals-0.045
 chores-0.044
 Him-0.044
onies-0.042
Token pray
Token ID12472
Feature activation+4.46e-02

Pos logit contributions
 parties+0.102
 chores+0.102
 journals+0.101
 Him+0.100
onies+0.093
Neg logit contributions
 spike-0.073
 pipe-0.073
 block-0.072
 nail-0.072
 crocod-0.071
Token for
Token ID329
Feature activation-2.74e-02

Pos logit contributions
 journals+0.113
 parties+0.113
 chores+0.111
 Him+0.110
onies+0.107
Neg logit contributions
 block-0.036
 nail-0.035
 pipe-0.035
 spike-0.035
 figure-0.034
Token money
Token ID1637
Feature activation+6.68e-02

Pos logit contributions
 nail+0.031
 figure+0.031
 pipe+0.030
 block+0.029
 spike+0.029
Neg logit contributions
 journals-0.059
 parties-0.058
onies-0.057
affles-0.056
 Him-0.055
Token.
Token ID13
Feature activation+4.74e-03

Pos logit contributions
 journals+0.182
 parties+0.178
 Him+0.178
 chores+0.175
onies+0.171
Neg logit contributions
 block-0.041
 pipe-0.040
 nail-0.039
 figure-0.037
 spike-0.037
Token The
Token ID383
Feature activation-1.85e-02

Pos logit contributions
 journals+0.009
 parties+0.009
 chores+0.009
 Him+0.009
onies+0.009
Neg logit contributions
 block-0.007
 nail-0.007
 spike-0.006
 pipe-0.006
 figure-0.006
Token It
Token ID632
Feature activation-9.08e-02

Pos logit contributions
 spike+0.050
 nail+0.049
 pipe+0.049
 block+0.049
 figure+0.047
Neg logit contributions
 Him-0.101
 parties-0.100
 journals-0.100
 chores-0.098
onies-0.094
Token saw
Token ID2497
Feature activation-1.04e-01

Pos logit contributions
 nail+0.117
 pipe+0.112
 rectangle+0.108
 block+0.098
 lever+0.097
Neg logit contributions
Hope-0.173
Sleep-0.172
 married-0.168
 parties-0.167
 pic-0.163
Token Lily
Token ID20037
Feature activation-2.15e-02

Pos logit contributions
 block+0.107
 nail+0.107
 spike+0.107
 pipe+0.106
 crocod+0.102
Neg logit contributions
 parties-0.243
 journals-0.243
 chores-0.241
 Him-0.238
onies-0.229
Token's
Token ID338
Feature activation+2.73e-03

Pos logit contributions
 nail+0.020
 pipe+0.020
 spike+0.020
 crocod+0.019
 block+0.018
Neg logit contributions
 parties-0.051
 chores-0.051
 journals-0.050
 Him-0.049
onies-0.048
Token ring
Token ID5858
Feature activation-1.40e-01

Pos logit contributions
 chores+0.006
 parties+0.006
 journals+0.006
 hospitality+0.005
 company+0.005
Neg logit contributions
 rectangle-0.003
 pipe-0.003
 nail-0.003
 spike-0.003
oke-0.003
Token and
Token ID290
Feature activation+2.16e-03

Pos logit contributions
 nail+0.142
 spike+0.140
 pipe+0.140
 crocod+0.137
 block+0.131
Neg logit contributions
 chores-0.316
 parties-0.314
 journals-0.311
 Him-0.303
onies-0.300
Token wanted
Token ID2227
Feature activation+1.72e-02

Pos logit contributions
 journals+0.004
 parties+0.004
 Him+0.004
 married+0.004
 chores+0.004
Neg logit contributions
 pipe-0.003
 nail-0.003
 spike-0.003
 block-0.003
 bolt-0.003
Token it
Token ID340
Feature activation-3.52e-02

Pos logit contributions
 parties+0.040
 journals+0.039
 chores+0.039
 Him+0.039
onies+0.037
Neg logit contributions
 nail-0.018
 block-0.018
 pipe-0.018
 spike-0.018
 figure-0.017
Token.
Token ID13
Feature activation-4.16e-02

Pos logit contributions
 block+0.022
 nail+0.018
 pipe+0.018
 spike+0.017
 figure+0.017
Neg logit contributions
 journals-0.102
 Him-0.101
 parties-0.098
 chores-0.097
affles-0.096
Token
Token ID198
Feature activation-3.93e-03

Pos logit contributions
 spike+0.053
 pipe+0.053
 block+0.052
 nail+0.052
 crocod+0.051
Neg logit contributions
 Him-0.085
 parties-0.085
 journals-0.084
 chores-0.083
onies-0.080
Token
Token ID198
Feature activation-7.71e-03

Pos logit contributions
 nail+0.005
 spike+0.005
 pipe+0.005
 block+0.005
 crocod+0.005
Neg logit contributions
 parties-0.008
 chores-0.008
 journals-0.008
 Him-0.008
onies-0.008
Token<|endoftext|>
Token ID50256
Feature activation+9.94e-02

Pos logit contributions
 nail+0.015
 block+0.015
 spike+0.015
 pipe+0.015
 figure+0.015
Neg logit contributions
 journals-0.024
 parties-0.024
 Him-0.024
 chores-0.024
onies-0.023
TokenOnce
Token ID7454
Feature activation+7.78e-02

Pos logit contributions
 parties+0.191
 journals+0.188
 chores+0.187
 Him+0.184
onies+0.175
Neg logit contributions
 nail-0.141
 spike-0.140
 pipe-0.139
 block-0.137
 figure-0.135
Token upon
Token ID2402
Feature activation+6.62e-02

Pos logit contributions
 parties+0.135
 chores+0.133
 Him+0.132
 journals+0.131
onies+0.123
Neg logit contributions
 spike-0.125
 nail-0.123
 pipe-0.123
 block-0.122
 figure-0.122
Token a
Token ID257
Feature activation+7.63e-02

Pos logit contributions
 journals+0.176
 Him+0.175
 parties+0.174
onies+0.173
affles+0.169
Neg logit contributions
 block-0.058
 figure-0.047
 nail-0.046
 was-0.045
 hit-0.045
Token time
Token ID640
Feature activation+2.81e-02

Pos logit contributions
 journals+0.168
 parties+0.167
 Him+0.166
 chores+0.163
onies+0.156
Neg logit contributions
 block-0.090
 spike-0.087
 nail-0.086
 pipe-0.085
 crocod-0.083
Token,
Token ID11
Feature activation-1.04e-02

Pos logit contributions
 parties+0.075
 chores+0.073
 Him+0.073
 journals+0.072
 holidays+0.070
Neg logit contributions
 pipe-0.019
 nail-0.019
 spike-0.019
 block-0.018
 figure-0.018
Token there
Token ID612
Feature activation-3.32e-04

Pos logit contributions
 pipe+0.013
 spike+0.013
 figure+0.013
 nail+0.013
 crocod+0.012
Neg logit contributions
 chores-0.021
 parties-0.020
 journals-0.020
 Him-0.020
 holidays-0.019
Token was
Token ID373
Feature activation+9.68e-03

Pos logit contributions
 pipe+0.000
 nail+0.000
 bolt+0.000
 rectangle+0.000
 crocod+0.000
Neg logit contributions
 parties-0.001
 chores-0.001
 holidays-0.001
 journals-0.001
 pic-0.001
Token a
Token ID257
Feature activation+5.67e-02

Pos logit contributions
onies+0.026
 Him+0.026
 journals+0.025
 cultures+0.025
 hospitality+0.025
Neg logit contributions
 block-0.008
 crocod-0.007
 nail-0.007
 snake-0.007
 spike-0.007
Token little
Token ID1310
Feature activation-8.97e-03

Pos logit contributions
 parties+0.105
 chores+0.103
 journals+0.102
 Him+0.100
Bon+0.097
Neg logit contributions
 spike-0.072
 pipe-0.069
 nail-0.068
 grun-0.068
 block-0.068
Token girl
Token ID2576
Feature activation-1.13e-02

Pos logit contributions
 grun+0.010
 spike+0.010
 tipped+0.010
 hit+0.010
 figure+0.009
Neg logit contributions
 parties-0.018
 journals-0.018
 Him-0.017
 ponies-0.017
 chores-0.017
Token and
Token ID290
Feature activation+2.08e-02

Pos logit contributions
 spike+0.031
 nail+0.029
 crocod+0.029
 pipe+0.029
 figure+0.028
Neg logit contributions
 parties-0.096
 journals-0.095
 chores-0.095
 Him-0.093
onies-0.090
Token feels
Token ID5300
Feature activation-2.52e-02

Pos logit contributions
 journals+0.040
 parties+0.040
 chores+0.040
 Him+0.039
onies+0.037
Neg logit contributions
 nail-0.030
 spike-0.030
 pipe-0.030
 block-0.029
 figure-0.029
Token better
Token ID1365
Feature activation+2.86e-02

Pos logit contributions
 block+0.034
 nail+0.034
 spike+0.034
 pipe+0.034
 figure+0.033
Neg logit contributions
 parties-0.050
 journals-0.050
 chores-0.049
 Him-0.049
onies-0.047
Token.
Token ID13
Feature activation-4.65e-02

Pos logit contributions
 chores+0.066
 parties+0.066
 journals+0.066
 Him+0.065
onies+0.062
Neg logit contributions
 spike-0.030
 nail-0.029
 pipe-0.029
 block-0.028
 crocod-0.028
Token She
Token ID1375
Feature activation-1.01e-02

Pos logit contributions
 block+0.063
 nail+0.062
 pipe+0.062
 spike+0.062
 crocod+0.060
Neg logit contributions
 journals-0.093
 parties-0.093
 chores-0.093
 Him-0.090
onies-0.087
Token gives
Token ID3607
Feature activation+4.84e-03

Pos logit contributions
 block+0.014
 spike+0.013
 figure+0.013
 nail+0.013
 pipe+0.013
Neg logit contributions
 Him-0.021
 journals-0.020
 parties-0.020
 chores-0.020
 holidays-0.019
Token him
Token ID683
Feature activation+3.63e-02

Pos logit contributions
 parties+0.008
 journals+0.008
 chores+0.008
 Him+0.008
onies+0.008
Neg logit contributions
 block-0.008
 spike-0.008
 nail-0.008
 pipe-0.008
 crocod-0.007
Token her
Token ID607
Feature activation+7.13e-02

Pos logit contributions
 Him+0.090
 parties+0.086
Hope+0.086
 amen+0.085
 journals+0.084
Neg logit contributions
 block-0.039
 crocod-0.034
 pipe-0.033
 nail-0.031
 spike-0.031
Token red
Token ID2266
Feature activation-7.42e-02

Pos logit contributions
 chores+0.158
 journals+0.155
 parties+0.154
 Him+0.147
onies+0.143
Neg logit contributions
 spike-0.078
 nail-0.078
 block-0.076
 crocod-0.075
 pipe-0.074
Token boot
Token ID6297
Feature activation-4.38e-02

Pos logit contributions
 bolt+0.095
oke+0.093
 rectangle+0.087
 nail+0.087
 figure+0.085
Neg logit contributions
 chores-0.148
 parties-0.141
 journals-0.138
onies-0.133
 favorites-0.126
Token.
Token ID13
Feature activation-5.21e-02

Pos logit contributions
 block+0.033
 nail+0.033
 spike+0.032
 pipe+0.032
 figure+0.031
Neg logit contributions
 parties-0.114
 journals-0.113
 chores-0.113
 Him-0.112
onies-0.108
Token pretty
Token ID2495
Feature activation-2.28e-02

Pos logit contributions
 parties+0.034
 journals+0.033
 chores+0.032
 Him+0.031
 married+0.030
Neg logit contributions
 spike-0.030
 pipe-0.029
 nail-0.029
 block-0.028
 figure-0.028
Token.
Token ID13
Feature activation-2.95e-02

Pos logit contributions
 spike+0.015
 nail+0.015
 pipe+0.014
 block+0.014
 figure+0.013
Neg logit contributions
 parties-0.062
 journals-0.062
 chores-0.061
 Him-0.061
onies-0.059
Token He
Token ID679
Feature activation-2.14e-02

Pos logit contributions
 block+0.036
 nail+0.035
 spike+0.035
 pipe+0.035
 crocod+0.034
Neg logit contributions
 parties-0.063
 journals-0.063
 chores-0.063
 Him-0.062
onies-0.059
Token wanted
Token ID2227
Feature activation+2.51e-02

Pos logit contributions
 pipe+0.026
 spike+0.026
 nail+0.025
 block+0.023
 snake+0.023
Neg logit contributions
 journals-0.046
 parties-0.046
 chores-0.044
 married-0.043
 Him-0.042
Token to
Token ID284
Feature activation-1.05e-02

Pos logit contributions
 parties+0.055
 journals+0.054
 chores+0.054
 Him+0.053
onies+0.051
Neg logit contributions
 nail-0.030
 spike-0.030
 pipe-0.030
 block-0.029
 figure-0.029
Token draw
Token ID3197
Feature activation+6.13e-04

Pos logit contributions
 pipe+0.016
 spike+0.015
 nail+0.015
 crocod+0.015
 block+0.015
Neg logit contributions
 parties-0.020
 chores-0.019
 journals-0.019
 Him-0.019
onies-0.018
Token too
Token ID1165
Feature activation+5.56e-02

Pos logit contributions
 journals+0.001
 parties+0.001
 chores+0.001
 Him+0.001
onies+0.001
Neg logit contributions
 block-0.001
 nail-0.001
 spike-0.001
 pipe-0.001
 figure-0.001
Token.
Token ID13
Feature activation-2.44e-02

Pos logit contributions
 parties+0.142
 journals+0.141
 chores+0.140
 Him+0.140
onies+0.134
Neg logit contributions
 block-0.044
 nail-0.044
 spike-0.044
 pipe-0.044
 figure-0.042
Token He
Token ID679
Feature activation-1.34e-02

Pos logit contributions
 nail+0.029
 pipe+0.029
 spike+0.028
 block+0.028
 figure+0.027
Neg logit contributions
 parties-0.053
 journals-0.053
 chores-0.052
 Him-0.052
onies-0.050
Token grabbed
Token ID13176
Feature activation-3.88e-02

Pos logit contributions
 pipe+0.018
 spike+0.017
 nail+0.016
 crocod+0.016
 snake+0.015
Neg logit contributions
 journals-0.027
 parties-0.027
 chores-0.025
 married-0.025
onies-0.025
Token a
Token ID257
Feature activation+4.99e-02

Pos logit contributions
 pipe+0.055
 nail+0.053
 spike+0.052
 figure+0.052
 block+0.050
Neg logit contributions
 chores-0.077
 parties-0.077
 journals-0.076
 Him-0.075
onies-0.070
Token caul
Token ID46087
Feature activation-6.41e-03

Pos logit contributions
 chores+0.010
 journals+0.010
 Him+0.010
 parties+0.009
onies+0.009
Neg logit contributions
 pipe-0.013
 block-0.013
 spike-0.013
 nail-0.012
 crocod-0.012
Tokeniflower
Token ID42642
Feature activation-3.78e-03

Pos logit contributions
 pipe+0.014
 block+0.014
 spike+0.014
 nail+0.014
 crocod+0.014
Neg logit contributions
 journals-0.007
 Him-0.007
 chores-0.007
 parties-0.007
onies-0.006
Token and
Token ID290
Feature activation+3.21e-02

Pos logit contributions
 spike+0.004
 crocod+0.003
 block+0.003
 nail+0.003
 pipe+0.003
Neg logit contributions
 chores-0.009
 parties-0.009
 journals-0.009
 Him-0.009
onies-0.008
Token took
Token ID1718
Feature activation+2.79e-02

Pos logit contributions
 parties+0.064
 journals+0.063
 chores+0.063
 Him+0.060
 married+0.059
Neg logit contributions
 spike-0.040
 pipe-0.040
 block-0.040
 bolt-0.039
 crocod-0.038
Token it
Token ID340
Feature activation-2.08e-02

Pos logit contributions
 parties+0.076
 journals+0.076
 chores+0.076
 Him+0.075
onies+0.072
Neg logit contributions
 pipe-0.018
 spike-0.017
 nail-0.017
 block-0.017
 crocod-0.016
Token to
Token ID284
Feature activation-1.46e-02

Pos logit contributions
 spike+0.024
 nail+0.024
 block+0.024
 pipe+0.024
 crocod+0.023
Neg logit contributions
 parties-0.046
 journals-0.045
 chores-0.045
 Him-0.045
onies-0.043
Token her
Token ID607
Feature activation+7.80e-02

Pos logit contributions
 spike+0.021
 block+0.021
 pipe+0.021
 nail+0.021
 crocod+0.020
Neg logit contributions
 parties-0.028
 chores-0.027
 Him-0.027
 journals-0.027
onies-0.025
Token home
Token ID1363
Feature activation-1.57e-02

Pos logit contributions
 parties+0.174
 chores+0.172
 journals+0.172
 Him+0.167
onies+0.159
Neg logit contributions
 spike-0.088
 nail-0.086
 pipe-0.086
 block-0.086
 crocod-0.084
Token.
Token ID13
Feature activation-1.99e-02

Pos logit contributions
 spike+0.019
 pipe+0.017
 crocod+0.017
 nail+0.017
 snake+0.017
Neg logit contributions
 parties-0.035
 chores-0.035
 journals-0.033
 Him-0.033
 hospitality-0.032
Token She
Token ID1375
Feature activation+7.97e-03

Pos logit contributions
 nail+0.028
 block+0.028
 spike+0.027
 pipe+0.027
 figure+0.027
Neg logit contributions
 parties-0.039
 journals-0.039
 chores-0.039
 Him-0.038
onies-0.036
Token had
Token ID550
Feature activation-3.06e-02

Pos logit contributions
 parties+0.016
 journals+0.016
 chores+0.016
 married+0.016
 Him+0.015
Neg logit contributions
 pipe-0.010
 spike-0.010
 nail-0.009
 bolt-0.009
 snake-0.009
Token.
Token ID13
Feature activation-2.55e-02

Pos logit contributions
 spike+0.001
 nail+0.001
 block+0.001
 pipe+0.001
 crocod+0.001
Neg logit contributions
 parties-0.003
 journals-0.003
 chores-0.003
 Him-0.003
onies-0.003
Token His
Token ID2399
Feature activation+2.29e-02

Pos logit contributions
 nail+0.036
 spike+0.035
 block+0.035
 pipe+0.035
 figure+0.035
Neg logit contributions
 parties-0.050
 journals-0.050
 chores-0.049
 Him-0.049
onies-0.046
Token mom
Token ID1995
Feature activation-3.09e-02

Pos logit contributions
 parties+0.045
 Him+0.045
onies+0.043
 journals+0.042
irs+0.042
Neg logit contributions
 spike-0.033
 nail-0.032
 block-0.032
 figure-0.032
 pipe-0.032
Token was
Token ID373
Feature activation-4.20e-02

Pos logit contributions
 nail+0.038
 crocod+0.037
 spike+0.037
 pipe+0.037
 snake+0.035
Neg logit contributions
 parties-0.065
 journals-0.064
 chores-0.062
onies-0.062
 Him-0.062
Token happy
Token ID3772
Feature activation+4.63e-02

Pos logit contributions
 pipe+0.077
 spike+0.075
 crocod+0.075
 nail+0.075
 snake+0.073
Neg logit contributions
 chores-0.066
 parties-0.065
 journals-0.062
 Him-0.060
 married-0.060
Token that
Token ID326
Feature activation-1.11e-01

Pos logit contributions
 parties+0.094
 Him+0.092
 journals+0.088
 hospitality+0.086
 chores+0.085
Neg logit contributions
 nail-0.064
 spike-0.063
 pipe-0.062
 block-0.062
 crocod-0.059
Token Tim
Token ID5045
Feature activation-4.77e-02

Pos logit contributions
 block+0.122
 pipe+0.120
 nail+0.119
 spike+0.118
 figure+0.116
Neg logit contributions
 parties-0.253
 chores-0.252
 journals-0.247
 Him-0.243
 holidays-0.236
Tokenmy
Token ID1820
Feature activation-1.34e-02

Pos logit contributions
 block+0.063
 nail+0.063
 spike+0.063
 pipe+0.063
 figure+0.061
Neg logit contributions
 parties-0.096
 journals-0.096
 chores-0.095
 Him-0.094
onies-0.090
Token was
Token ID373
Feature activation-4.13e-02

Pos logit contributions
 nail+0.009
 spike+0.009
 block+0.009
 pipe+0.009
 crocod+0.009
Neg logit contributions
 parties-0.035
 journals-0.035
 chores-0.035
 Him-0.034
onies-0.033
Token taking
Token ID2263
Feature activation+1.50e-02

Pos logit contributions
 spike+0.071
 pipe+0.071
 nail+0.070
 crocod+0.069
 block+0.069
Neg logit contributions
 parties-0.068
 chores-0.068
 journals-0.066
 Him-0.065
 married-0.061
Token good
Token ID922
Feature activation+7.11e-02

Pos logit contributions
 parties+0.024
 journals+0.024
 Him+0.024
 chores+0.024
onies+0.023
Neg logit contributions
 spike-0.026
 block-0.026
 nail-0.025
 pipe-0.025
 figure-0.025
Token walk
Token ID2513
Feature activation+7.24e-02

Pos logit contributions
 pipe+0.052
 nail+0.052
 spike+0.051
 crocod+0.051
 block+0.051
Neg logit contributions
 chores-0.074
 parties-0.074
 journals-0.071
 Him-0.070
onies-0.067
Token slowly
Token ID6364
Feature activation+5.90e-03

Pos logit contributions
 parties+0.161
 chores+0.159
 journals+0.159
 Him+0.156
onies+0.149
Neg logit contributions
 spike-0.082
 nail-0.082
 block-0.082
 pipe-0.081
 crocod-0.080
Token and
Token ID290
Feature activation+5.65e-02

Pos logit contributions
 parties+0.014
 chores+0.014
 Him+0.014
 journals+0.013
 holidays+0.013
Neg logit contributions
 spike-0.006
 nail-0.006
 pipe-0.006
 crocod-0.006
 block-0.006
Token she
Token ID673
Feature activation+6.73e-03

Pos logit contributions
 parties+0.100
 chores+0.098
 journals+0.095
 Him+0.095
 holidays+0.090
Neg logit contributions
 spike-0.091
 pipe-0.089
 block-0.089
 nail-0.089
 crocod-0.089
Token never
Token ID1239
Feature activation-5.71e-02

Pos logit contributions
 parties+0.014
 journals+0.014
 chores+0.014
 Him+0.014
onies+0.013
Neg logit contributions
 nail-0.008
 spike-0.008
 pipe-0.008
 block-0.008
 crocod-0.008
Token broke
Token ID6265
Feature activation-1.07e-01

Pos logit contributions
 nail+0.078
 pipe+0.078
 block+0.077
 spike+0.077
 snake+0.074
Neg logit contributions
 parties-0.115
 chores-0.109
 journals-0.108
 Him-0.107
 hospitality-0.104
Token her
Token ID607
Feature activation+7.06e-02

Pos logit contributions
 block+0.129
 spike+0.129
 nail+0.128
 pipe+0.126
 crocod+0.122
Neg logit contributions
 journals-0.228
 parties-0.228
 Him-0.226
 chores-0.226
onies-0.216
Token nap
Token ID25422
Feature activation+3.98e-02

Pos logit contributions
 parties+0.127
 journals+0.127
 Him+0.127
 chores+0.125
onies+0.119
Neg logit contributions
 block-0.110
 nail-0.109
 spike-0.108
 pipe-0.108
 figure-0.106
Tokenkin
Token ID5116
Feature activation-2.43e-02

Pos logit contributions
 journals+0.084
 parties+0.084
 Him+0.083
 chores+0.083
onies+0.079
Neg logit contributions
 block-0.049
 nail-0.049
 spike-0.048
 pipe-0.048
 figure-0.047
Token again
Token ID757
Feature activation-3.91e-02

Pos logit contributions
 spike+0.025
 pipe+0.025
 crocod+0.025
 nail+0.024
 block+0.023
Neg logit contributions
 parties-0.057
 chores-0.056
 journals-0.055
 Him-0.053
onies-0.052
Token.
Token ID13
Feature activation-2.37e-02

Pos logit contributions
 spike+0.035
 block+0.035
 nail+0.035
 pipe+0.035
 crocod+0.033
Neg logit contributions
 parties-0.096
 chores-0.095
 journals-0.095
 Him-0.094
onies-0.090
Token it
Token ID340
Feature activation-5.55e-02

Pos logit contributions
oke+0.123
 pipe+0.111
 figure+0.103
 crocod+0.102
 rectangle+0.102
Neg logit contributions
 chores-0.209
 parties-0.200
 ourselves-0.197
 journals-0.196
 ponies-0.195
Token.
Token ID13
Feature activation-7.46e-03

Pos logit contributions
 pipe+0.062
 crocod+0.060
 spike+0.060
 figure+0.059
 nail+0.059
Neg logit contributions
 chores-0.119
 parties-0.116
 journals-0.114
onies-0.107
 holidays-0.105
Token Suddenly
Token ID24975
Feature activation-4.23e-03

Pos logit contributions
 pipe+0.011
 crocod+0.010
 spike+0.010
 nail+0.010
 figure+0.010
Neg logit contributions
 parties-0.014
 chores-0.014
 Him-0.014
 journals-0.014
onies-0.013
Token,
Token ID11
Feature activation-4.96e-02

Pos logit contributions
 pipe+0.004
 block+0.004
 spike+0.004
 figure+0.004
 blob+0.004
Neg logit contributions
 parties-0.010
 chores-0.010
 Him-0.010
 journals-0.009
onies-0.009
Token she
Token ID673
Feature activation-4.48e-03

Pos logit contributions
 pipe+0.078
 block+0.078
 spike+0.077
 nail+0.076
 crocod+0.075
Neg logit contributions
 parties-0.088
 chores-0.088
 journals-0.086
 Him-0.085
 holidays-0.079
Token saw
Token ID2497
Feature activation-9.99e-02

Pos logit contributions
 nail+0.004
 spike+0.004
 pipe+0.004
 snake+0.004
 bolt+0.004
Neg logit contributions
 journals-0.010
 parties-0.010
 married-0.009
 chores-0.009
onies-0.009
Token the
Token ID262
Feature activation-3.73e-02

Pos logit contributions
 block+0.158
 pipe+0.158
 figure+0.156
 nail+0.156
 spike+0.155
Neg logit contributions
 parties-0.170
 chores-0.168
 journals-0.166
 Him-0.163
onies-0.152
Token sunshine
Token ID34488
Feature activation+9.39e-03

Pos logit contributions
 grun+0.046
oke+0.044
 figure+0.041
 crocod+0.040
 pipe+0.040
Neg logit contributions
 chores-0.077
 parties-0.074
 journals-0.070
 ponies-0.068
 holidays-0.068
Token and
Token ID290
Feature activation+3.61e-02

Pos logit contributions
 parties+0.020
 chores+0.019
 holidays+0.018
ics+0.018
 journals+0.017
Neg logit contributions
 nail-0.010
 spike-0.010
 crocod-0.010
oke-0.010
 pipe-0.010
Token it
Token ID340
Feature activation-8.50e-02

Pos logit contributions
 parties+0.077
 journals+0.074
 Him+0.072
 chores+0.072
 holidays+0.070
Neg logit contributions
 pipe-0.044
 block-0.043
 nail-0.043
 spike-0.042
 figure-0.041
Token made
Token ID925
Feature activation-7.23e-02

Pos logit contributions
oke+0.113
 nail+0.107
 pipe+0.106
 block+0.102
 crocod+0.101
Neg logit contributions
 parties-0.153
 chores-0.144
 married-0.142
ics-0.139
 holidays-0.138
Token Ben
Token ID3932
Feature activation-2.89e-02

Pos logit contributions
 block+0.066
 nail+0.066
 spike+0.065
 pipe+0.065
 figure+0.063
Neg logit contributions
 parties-0.105
 journals-0.104
 chores-0.104
 Him-0.103
onies-0.097
Token saw
Token ID2497
Feature activation-9.91e-02

Pos logit contributions
 nail+0.029
 spike+0.029
 block+0.027
 pipe+0.026
 rectangle+0.026
Neg logit contributions
 parties-0.064
 journals-0.064
 chores-0.063
onies-0.062
 Him-0.061
Token something
Token ID1223
Feature activation+8.59e-03

Pos logit contributions
 nail+0.152
 block+0.150
 spike+0.149
 pipe+0.148
 figure+0.148
Neg logit contributions
 chores-0.179
 parties-0.178
 journals-0.175
 Him-0.174
 holidays-0.158
Token.
Token ID13
Feature activation-3.74e-02

Pos logit contributions
 chores+0.016
 parties+0.015
 journals+0.014
 married+0.013
 Him+0.013
Neg logit contributions
 rectangle-0.012
 block-0.012
 nail-0.012
 pipe-0.011
 blob-0.011
Token He
Token ID679
Feature activation-2.64e-02

Pos logit contributions
 spike+0.049
 pipe+0.048
 nail+0.048
 figure+0.047
 block+0.045
Neg logit contributions
 journals-0.075
 chores-0.073
 Him-0.073
 parties-0.073
Sleep-0.068
Token saw
Token ID2497
Feature activation-1.02e-01

Pos logit contributions
 pipe+0.030
 nail+0.030
 rectangle+0.029
 spike+0.028
 brick+0.027
Neg logit contributions
 journals-0.052
 married-0.049
onies-0.048
 pic-0.048
Hope-0.048
Token a
Token ID257
Feature activation+1.81e-02

Pos logit contributions
 nail+0.155
 pipe+0.153
 block+0.152
 spike+0.152
 figure+0.151
Neg logit contributions
 parties-0.184
 chores-0.183
 journals-0.182
 Him-0.179
 holidays-0.164
Token big
Token ID1263
Feature activation-8.53e-02

Pos logit contributions
 chores+0.027
 ponies+0.026
 magazines+0.024
 handsome+0.024
 parties+0.024
Neg logit contributions
 grun-0.025
et-0.025
 tipped-0.024
oke-0.024
 Joe-0.023
Token hole
Token ID7604
Feature activation-1.43e-01

Pos logit contributions
oke+0.113
way+0.106
 grun+0.104
aver+0.101
ull+0.101
Neg logit contributions
 ponies-0.142
 parties-0.139
 magazines-0.137
 chores-0.134
 novels-0.133
Token in
Token ID287
Feature activation-8.99e-02

Pos logit contributions
 spike+0.237
 nail+0.225
 pipe+0.217
 rectangle+0.210
 bulldo+0.208
Neg logit contributions
 parties-0.203
 chores-0.198
ics-0.195
 journals-0.190
Sleep-0.179
Token the
Token ID262
Feature activation-4.98e-02

Pos logit contributions
 pipe+0.099
 nail+0.095
oke+0.095
 figure+0.092
 spike+0.091
Neg logit contributions
 chores-0.194
 parties-0.193
 journals-0.189
 holidays-0.181
 Him-0.181
Token and
Token ID290
Feature activation+2.52e-02

Pos logit contributions
 nail+0.027
 pipe+0.027
 spike+0.026
 crocod+0.025
 block+0.024
Neg logit contributions
 journals-0.048
 parties-0.048
 Him-0.048
 married-0.047
 chores-0.046
Token the
Token ID262
Feature activation-4.63e-02

Pos logit contributions
 parties+0.048
 Him+0.048
 journals+0.047
 chores+0.047
Hope+0.043
Neg logit contributions
 pipe-0.037
 spike-0.036
 block-0.036
 nail-0.036
 blob-0.034
Token dog
Token ID3290
Feature activation-7.39e-02

Pos logit contributions
 block+0.048
 spike+0.047
 nail+0.046
 pipe+0.046
 rectangle+0.045
Neg logit contributions
 chores-0.104
 journals-0.101
 parties-0.100
 Him-0.100
 holidays-0.094
Token were
Token ID547
Feature activation+2.22e-02

Pos logit contributions
 nail+0.090
 block+0.089
 spike+0.089
 pipe+0.089
 figure+0.085
Neg logit contributions
 parties-0.157
 chores-0.155
 journals-0.155
 Him-0.153
onies-0.146
Token sad
Token ID6507
Feature activation-4.33e-02

Pos logit contributions
 parties+0.038
 chores+0.038
 journals+0.037
 Him+0.036
 married+0.035
Neg logit contributions
 nail-0.037
 pipe-0.036
 spike-0.036
 block-0.036
 figure-0.035
Token that
Token ID326
Feature activation-9.51e-02

Pos logit contributions
 nail+0.042
 block+0.042
 spike+0.042
 pipe+0.041
 figure+0.040
Neg logit contributions
 parties-0.104
 chores-0.104
 journals-0.102
 Him-0.102
 holidays-0.097
Token their
Token ID511
Feature activation+5.04e-02

Pos logit contributions
 block+0.134
 nail+0.133
 spike+0.133
 pipe+0.132
 figure+0.127
Neg logit contributions
 chores-0.187
 parties-0.187
 journals-0.185
 Him-0.182
 holidays-0.171
Token fun
Token ID1257
Feature activation+7.34e-02

Pos logit contributions
 parties+0.101
 journals+0.101
 Him+0.100
 chores+0.100
onies+0.095
Neg logit contributions
 spike-0.067
 block-0.067
 nail-0.067
 pipe-0.066
 figure-0.065
Token had
Token ID550
Feature activation-2.13e-02

Pos logit contributions
 parties+0.161
 chores+0.159
 journals+0.158
 Him+0.153
onies+0.148
Neg logit contributions
 nail-0.086
 block-0.086
 spike-0.085
 pipe-0.083
 crocod-0.082
Token been
Token ID587
Feature activation-2.31e-02

Pos logit contributions
 nail+0.026
 block+0.026
 pipe+0.025
 spike+0.025
 figure+0.025
Neg logit contributions
 parties-0.046
 chores-0.045
 journals-0.045
 Him-0.044
onies-0.042
Token stopped
Token ID5025
Feature activation+5.17e-05

Pos logit contributions
 spike+0.030
 pipe+0.030
 nail+0.029
 block+0.029
 figure+0.028
Neg logit contributions
 parties-0.048
 chores-0.047
 journals-0.045
 married-0.045
 Him-0.044