TOP ACTIVATIONS
MAX = 4.761

night . They both felt a sense of pride and pleasure
. Jan ie shared her creations with all her family and
back the village to its former state . With the help
. They all shared a big piece of the enormous cake
things better . She felt a great sense of accomplishment for
the sweater . Alec felt a warm feeling in his heart
M agg ie hugged Mom my and said , " I
drank the soup and felt a little better . She sn
S oph ie felt a bit bitter , but was
walk . She still felt a bit w obb ly ,
on it . Sally felt a little better , but she
calendar soon ." Amy felt a little better after hearing these
was shining . Lucy felt a bit scared but with Jake
M um and dad hugged Maria and they all went home
her jam . She felt a great sense of accomplishment so
happy ! She hugged her mum and together they went to
it . After they finished cleaning the shell , they celebrated
sweet treat and shared a big , happy hug . <|endoftext|>
of you !" Ruth felt a special feeling in her heart
yourself . <|endoftext|> Once upon a time , there lived a

MIN ACTIVATIONS
MIN = -5.254

saw a big , red thing on the ground . He
mom and saw a terrible thing . A man had a
saw a strange and unknown thing . She asked her mum
One day a very unusual thing happened . He pulled on
One day , a strange thing happened . A small boy
a man with a real gun . He is angry .
, they see a new thing in the park . It
day , a shiny new vehicle came to town . Everyone
day , a new green truck came to play . The
small , round , green thing on the ground . It
, they see a new thing in the gym . It
Then , a bad thing happened . A big car
, they see a real train outside . It is big
they see a big red thing on a green plant .
when he saw a big vehicle coming down the road .
named Tim found a toy motorcycle . It was red and
they see a big white thing on the grass . It
, they notice a big truck on the road . The
saw a long , green thing in the water . He
bus met a small green car . The car liked to

INTERVAL 2.257 - 4.761
CONTAINS 1.031%

to eat , and she ate every last bite . After
wonderful job cleaning the garage ." Little Jimmy smiled . He
lessons . <|endoftext|> Once upon a time , there was a
B enny learned that trying new things can be good and
with respect . Ign or ance can lead to misunderstanding but

INTERVAL -0.247 - 2.257
CONTAINS 59.801%

The lady took Jacob to see a special
fall and break . L ily was upset
door and opened it . But she forgot about her bike
and Tom nodded . They hoped Mom my would forgive them
he woke up from his nap , he felt refreshed and

INTERVAL -2.750 - -0.247
CONTAINS 38.804%

mom . They see a lot of rocks on the ground
man told them that the factory made toys for children all
friends . <|endoftext|> One day , Jim went to the zoo
that being different was okay and they all became friends again
. Joy saw the billboard and wanted to balance on top

INTERVAL -5.254 - -2.750
CONTAINS 0.364%

car that he played with every day . One day ,
one day , a small animal came along . It was
big , scary owl and it scared her . She ran
day , Jane had a plan . She wanted to make
Sam . They wished they never came to his house .
Token night
Token ID1755
Feature activation+1.98e-01

Pos logit contributions
 parked+0.003
 driving+0.002
 smoking+0.002
 stole+0.002
 disturbed+0.002
Neg logit contributions
 satisfaction-0.002
 pleasure-0.002
 accomplishment-0.002
 delight-0.002
 fascination-0.002
Token.
Token ID13
Feature activation+1.52e+00

Pos logit contributions
 satisfaction+0.011
 pleasure+0.011
 accomplishment+0.011
 delight+0.010
 fascination+0.010
Neg logit contributions
 parked-0.006
 driving-0.006
 smoking-0.006
 stole-0.006
 disturbed-0.005
Token They
Token ID1119
Feature activation+1.82e+00

Pos logit contributions
 accomplishment+0.078
 satisfaction+0.077
 fascination+0.074
 decision+0.074
 pleasure+0.074
Neg logit contributions
 parked-0.048
 driving-0.045
 smoking-0.044
 stole-0.043
 CA-0.040
Token both
Token ID1111
Feature activation+1.75e+00

Pos logit contributions
 pleasure+0.106
 accomplishment+0.103
 satisfaction+0.102
 decision+0.098
 accomplishments+0.098
Neg logit contributions
 parked-0.056
 stole-0.050
 knocked-0.047
 broke-0.046
 stuck-0.046
Token felt
Token ID2936
Feature activation+2.36e+00

Pos logit contributions
 accomplishment+0.095
 pleasure+0.095
 satisfaction+0.091
 accomplishments+0.089
 fascination+0.088
Neg logit contributions
 parked-0.057
 stole-0.054
 broke-0.051
 knocked-0.050
 disturbed-0.049
Token a
Token ID257
Feature activation+4.76e+00

Pos logit contributions
aman+0.126
 accomplishment+0.123
 fascination+0.121
 accomplishments+0.121
oph+0.118
Neg logit contributions
 parked-0.071
 disturbed-0.066
 stuck-0.064
 stole-0.064
 hurt-0.063
Token sense
Token ID2565
Feature activation+1.22e+00

Pos logit contributions
 deeds+0.237
annah+0.226
ishment+0.226
ently+0.223
heart+0.222
Neg logit contributions
 hurt-0.140
 knocked-0.128
 parked-0.127
 wrecked-0.124
 broke-0.121
Token of
Token ID286
Feature activation+2.17e+00

Pos logit contributions
 satisfaction+0.080
 pleasure+0.079
 fascination+0.077
 accomplishment+0.077
 delight+0.075
Neg logit contributions
 parked-0.025
 driving-0.024
 smoking-0.023
 disturbed-0.021
 stole-0.021
Token pride
Token ID11293
Feature activation+1.36e+00

Pos logit contributions
oph+0.057
aman+0.053
aire+0.050
heart+0.046
ya+0.044
Neg logit contributions
 driving-0.123
 parked-0.123
 stuck-0.118
 stole-0.115
 thrown-0.115
Token and
Token ID290
Feature activation+2.54e+00

Pos logit contributions
 pleasure+0.064
 satisfaction+0.063
 accomplishment+0.061
 fascination+0.060
 delight+0.059
Neg logit contributions
 parked-0.054
 driving-0.053
 smoking-0.051
 stole-0.049
 broke-0.048
Token pleasure
Token ID9476
Feature activation+8.93e-01

Pos logit contributions
aman+0.113
oph+0.104
 pleasure+0.104
 difference+0.102
 accomplishments+0.101
Neg logit contributions
 parked-0.096
 stole-0.093
 driving-0.090
 disturbed-0.090
 broke-0.090
Token.
Token ID13
Feature activation+1.10e+00

Pos logit contributions
 satisfaction+0.055
 pleasure+0.054
 accomplishment+0.053
 fascination+0.052
 delight+0.052
Neg logit contributions
 parked-0.027
 driving-0.025
 smoking-0.024
 stole-0.023
 disturbed-0.022
Token Jan
Token ID2365
Feature activation+2.15e+00

Pos logit contributions
 satisfaction+0.056
 accomplishment+0.055
 pleasure+0.054
 fascination+0.052
 delight+0.051
Neg logit contributions
 parked-0.038
 driving-0.036
 smoking-0.035
 stole-0.034
 knocked-0.033
Tokenie
Token ID494
Feature activation+2.68e+00

Pos logit contributions
 pleasure+0.124
 satisfaction+0.122
 accomplishment+0.115
 fascination+0.114
 delight+0.114
Neg logit contributions
 parked-0.067
 stole-0.061
 broke-0.059
 disturbed-0.059
 stuck-0.058
Token shared
Token ID4888
Feature activation+1.86e+00

Pos logit contributions
 pleasure+0.169
 satisfaction+0.166
 accomplishment+0.163
 fascination+0.159
 relief+0.155
Neg logit contributions
 parked-0.068
 knocked-0.060
 broke-0.059
 stole-0.058
 driving-0.058
Token her
Token ID607
Feature activation+2.23e+00

Pos logit contributions
 pleasure+0.109
 satisfaction+0.106
 accomplishment+0.104
 fascination+0.101
 difference+0.098
Neg logit contributions
 parked-0.051
 driving-0.049
 smoking-0.046
 stole-0.044
 broke-0.043
Token creations
Token ID28443
Feature activation+4.59e+00

Pos logit contributions
 fascination+0.107
 satisfaction+0.105
oph+0.103
aman+0.103
heart+0.100
Neg logit contributions
 parked-0.073
 driving-0.073
 broke-0.068
 stuck-0.067
 hurt-0.067
Token with
Token ID351
Feature activation+1.68e+00

Pos logit contributions
 pleasure+0.281
 satisfaction+0.277
 fascination+0.275
 Deal+0.267
 disbelief+0.264
Neg logit contributions
 parked-0.102
 on-0.099
 driving-0.097
 to-0.084
 called-0.084
Token all
Token ID477
Feature activation+2.38e+00

Pos logit contributions
 pleasure+0.090
 accomplishment+0.088
 satisfaction+0.087
 decision+0.085
 fascination+0.085
Neg logit contributions
 driving-0.045
 parked-0.045
 disturbed-0.044
 smoking-0.044
 stole-0.042
Token her
Token ID607
Feature activation+2.25e+00

Pos logit contributions
 pleasure+0.149
 accomplishment+0.144
 satisfaction+0.143
 fascination+0.140
 Deal+0.139
Neg logit contributions
 parked-0.044
 driving-0.042
 smoking-0.036
 blocking-0.034
 called-0.034
Token family
Token ID1641
Feature activation+2.75e+00

Pos logit contributions
 fascination+0.109
 satisfaction+0.109
 pleasure+0.102
aman+0.102
oph+0.101
Neg logit contributions
 broke-0.071
 hurt-0.069
 stuck-0.069
 stole-0.069
 cars-0.069
Token and
Token ID290
Feature activation+1.15e+00

Pos logit contributions
 satisfaction+0.139
 pleasure+0.137
 fascination+0.136
 accomplishment+0.134
 Deal+0.132
Neg logit contributions
 parked-0.093
 driving-0.092
 stuck-0.082
 smoking-0.081
 sitting-0.079
Token back
Token ID736
Feature activation+1.23e+00

Pos logit contributions
 Deal+0.171
 fascination+0.169
 accomplishment+0.165
aman+0.165
 grin+0.163
Neg logit contributions
 guns-0.068
 smoking-0.067
 disturbed-0.066
 driving-0.065
 stuck-0.064
Token the
Token ID262
Feature activation+1.29e+00

Pos logit contributions
 satisfaction+0.072
 accomplishment+0.072
 pleasure+0.071
 fascination+0.070
 delight+0.068
Neg logit contributions
 parked-0.033
 driving-0.032
 smoking-0.030
 stole-0.030
 disturbed-0.028
Token village
Token ID7404
Feature activation+2.43e+00

Pos logit contributions
 satisfaction+0.062
 fascination+0.061
 pleasure+0.060
 accomplishment+0.059
 delight+0.058
Neg logit contributions
 parked-0.048
 stole-0.047
 smoking-0.046
 disturbed-0.045
 driving-0.045
Token to
Token ID284
Feature activation+1.31e+00

Pos logit contributions
 satisfaction+0.132
 pleasure+0.128
 fascination+0.127
 accomplishment+0.125
 delight+0.122
Neg logit contributions
 parked-0.076
 driving-0.074
 stole-0.068
 stuck-0.068
 smoking-0.067
Token its
Token ID663
Feature activation+1.10e+00

Pos logit contributions
 satisfaction+0.071
 pleasure+0.071
 accomplishment+0.070
 fascination+0.069
 delight+0.067
Neg logit contributions
 driving-0.038
 smoking-0.038
 parked-0.037
 stole-0.033
 disturbed-0.033
Token former
Token ID1966
Feature activation+4.59e+00

Pos logit contributions
 satisfaction+0.045
 pleasure+0.044
 accomplishment+0.043
 fascination+0.042
 delight+0.041
Neg logit contributions
 parked-0.048
 driving-0.046
 smoking-0.045
 stole-0.044
 stuck-0.043
Token state
Token ID1181
Feature activation+2.06e+00

Pos logit contributions
 fascination+0.154
 pleasure+0.147
 satisfaction+0.140
oph+0.140
 goodness+0.136
Neg logit contributions
 driving-0.186
 parked-0.180
 guarded-0.179
 driven-0.176
 smoking-0.176
Token.
Token ID13
Feature activation+1.45e+00

Pos logit contributions
 satisfaction+0.123
 pleasure+0.121
 accomplishment+0.119
 fascination+0.118
 delight+0.116
Neg logit contributions
 parked-0.055
 driving-0.052
 smoking-0.050
 blocking-0.044
 disturbed-0.043
Token With
Token ID2080
Feature activation+1.99e+00

Pos logit contributions
 accomplishment+0.083
 satisfaction+0.081
 fascination+0.077
 decision+0.077
 pleasure+0.077
Neg logit contributions
 parked-0.041
 driving-0.039
 smoking-0.038
 stole-0.035
 CA-0.034
Token the
Token ID262
Feature activation+1.74e+00

Pos logit contributions
 decision+0.107
 difference+0.107
 pleasure+0.106
 accomplishment+0.106
 satisfaction+0.105
Neg logit contributions
 parked-0.054
 driving-0.050
 smoking-0.049
 stuck-0.048
 knocked-0.046
Token help
Token ID1037
Feature activation+1.71e+00

Pos logit contributions
 fascination+0.082
 pleasure+0.082
 satisfaction+0.080
oph+0.079
aman+0.078
Neg logit contributions
 smoking-0.059
 hurt-0.057
 stole-0.057
 stuck-0.056
 disturbed-0.055
Token.
Token ID13
Feature activation+1.54e+00

Pos logit contributions
 satisfaction+0.056
 pleasure+0.055
 accomplishment+0.053
 fascination+0.052
 delight+0.051
Neg logit contributions
 parked-0.033
 driving-0.031
 smoking-0.030
 stole-0.029
 knocked-0.028
Token They
Token ID1119
Feature activation+1.23e+00

Pos logit contributions
 accomplishment+0.071
 satisfaction+0.071
 pleasure+0.068
 fascination+0.067
 decision+0.066
Neg logit contributions
 parked-0.056
 driving-0.053
 smoking-0.052
 knocked-0.051
 stole-0.050
Token all
Token ID477
Feature activation+1.23e+00

Pos logit contributions
 pleasure+0.067
 satisfaction+0.066
 accomplishment+0.066
 fascination+0.061
 difference+0.061
Neg logit contributions
 parked-0.043
 stole-0.038
 knocked-0.037
 driving-0.036
 broke-0.036
Token shared
Token ID4888
Feature activation+2.75e+00

Pos logit contributions
 pleasure+0.067
 accomplishment+0.066
 satisfaction+0.065
 fascination+0.062
 accomplishments+0.062
Neg logit contributions
 parked-0.042
 stole-0.038
 knocked-0.036
 broke-0.036
 driving-0.036
Token a
Token ID257
Feature activation+3.15e+00

Pos logit contributions
 pleasure+0.166
 accomplishment+0.163
 satisfaction+0.161
 fascination+0.155
 accomplishments+0.155
Neg logit contributions
 parked-0.058
 driving-0.054
 smoking-0.053
 on-0.052
 stole-0.052
Token big
Token ID1263
Feature activation+4.56e+00

Pos logit contributions
ishment+0.149
 pleasure+0.143
 satisfaction+0.142
 accomplishment+0.140
heart+0.139
Neg logit contributions
 parked-0.096
 driving-0.091
 CA-0.088
 stole-0.088
saw-0.088
Token piece
Token ID3704
Feature activation+2.43e+00

Pos logit contributions
ishment+0.203
 former+0.197
 gut+0.197
 surroundings+0.197
esty+0.193
Neg logit contributions
 popped-0.137
 driving-0.133
 hit-0.131
 broke-0.129
 parked-0.124
Token of
Token ID286
Feature activation+2.38e+00

Pos logit contributions
 satisfaction+0.161
 pleasure+0.160
 fascination+0.153
 accomplishment+0.152
 delight+0.148
Neg logit contributions
 parked-0.047
 driving-0.044
 smoking-0.040
 stuck-0.037
 blocking-0.036
Token the
Token ID262
Feature activation+1.81e+00

Pos logit contributions
aire+0.116
 pleasure+0.113
oph+0.111
 accomplishments+0.110
 fascination+0.110
Neg logit contributions
 parked-0.070
 driving-0.069
 smoking-0.068
 disturbed-0.062
 stole-0.062
Token enormous
Token ID9812
Feature activation+2.84e+00

Pos logit contributions
 satisfaction+0.079
 fascination+0.078
 accomplishment+0.077
 pleasure+0.077
 delight+0.074
Neg logit contributions
 parked-0.066
 driving-0.065
 smoking-0.064
 disturbed-0.061
 stole-0.060
Token cake
Token ID12187
Feature activation+2.09e+00

Pos logit contributions
 satisfaction+0.153
 pleasure+0.153
 fascination+0.152
 accomplishment+0.150
oph+0.148
Neg logit contributions
 driving-0.068
 smoking-0.063
 broke-0.061
 parked-0.059
 appeared-0.058
Token things
Token ID1243
Feature activation+2.50e+00

Pos logit contributions
 satisfaction+0.093
 pleasure+0.091
 accomplishment+0.086
 fascination+0.086
aire+0.082
Neg logit contributions
 driving-0.053
 parked-0.050
 smoking-0.050
 disturbed-0.047
 stuck-0.046
Token better
Token ID1365
Feature activation+2.08e+00

Pos logit contributions
 satisfaction+0.144
 fascination+0.139
 pleasure+0.138
 accomplishment+0.135
aman+0.131
Neg logit contributions
 driving-0.068
 parked-0.067
 hurt-0.065
 stuck-0.064
 smoking-0.062
Token.
Token ID13
Feature activation+1.97e+00

Pos logit contributions
 fascination+0.124
 pleasure+0.123
 satisfaction+0.123
 accomplishment+0.120
 delight+0.115
Neg logit contributions
 driving-0.051
 parked-0.050
 smoking-0.050
 hurt-0.048
 disturbed-0.047
Token She
Token ID1375
Feature activation+2.09e+00

Pos logit contributions
 accomplishment+0.111
 satisfaction+0.106
 decision+0.104
 fascination+0.103
 accomplishments+0.100
Neg logit contributions
 parked-0.052
 driving-0.048
 Wh-0.048
 CA-0.046
 smoking-0.045
Token felt
Token ID2936
Feature activation+2.01e+00

Pos logit contributions
 pleasure+0.126
 accomplishment+0.123
 satisfaction+0.123
 fascination+0.118
 accomplishments+0.116
Neg logit contributions
 parked-0.063
 stole-0.056
 knocked-0.054
 broke-0.053
 stuck-0.051
Token a
Token ID257
Feature activation+4.50e+00

Pos logit contributions
aman+0.100
 accomplishment+0.099
 fascination+0.099
oph+0.097
 accomplishments+0.096
Neg logit contributions
 parked-0.067
 disturbed-0.061
 stuck-0.061
 hurt-0.060
 driving-0.060
Token great
Token ID1049
Feature activation+2.73e+00

Pos logit contributions
 deeds+0.218
osa+0.213
ishment+0.210
 agrees+0.206
ior+0.206
Neg logit contributions
 hurt-0.146
 parked-0.129
 wrecked-0.127
 knocked-0.126
 broke-0.122
Token sense
Token ID2565
Feature activation+8.50e-01

Pos logit contributions
hew+0.122
 Deal+0.121
 fascination+0.117
ishment+0.113
heart+0.112
Neg logit contributions
 parked-0.095
 driving-0.093
 knocked-0.092
 hit-0.090
 hurt-0.090
Token of
Token ID286
Feature activation+1.73e+00

Pos logit contributions
 satisfaction+0.054
 pleasure+0.053
 accomplishment+0.052
 fascination+0.052
 delight+0.051
Neg logit contributions
 parked-0.020
 driving-0.019
 smoking-0.018
 stole-0.017
 disturbed-0.017
Token accomplishment
Token ID29340
Feature activation+1.19e+00

Pos logit contributions
oph+0.050
aire+0.048
aman+0.044
 pleasure+0.041
 fascination+0.041
Neg logit contributions
 driving-0.095
 parked-0.095
 blocking-0.088
 stuck-0.087
 hurt-0.087
Token for
Token ID329
Feature activation+1.40e+00

Pos logit contributions
 satisfaction+0.060
 pleasure+0.060
 fascination+0.057
 accomplishment+0.057
 delight+0.056
Neg logit contributions
 parked-0.045
 driving-0.043
 smoking-0.041
 stole-0.039
 knocked-0.039
Token the
Token ID262
Feature activation+5.02e-01

Pos logit contributions
 pleasure+0.052
 satisfaction+0.051
 accomplishment+0.050
 fascination+0.048
 difference+0.047
Neg logit contributions
 parked-0.025
 driving-0.024
 smoking-0.023
 knocked-0.021
 stole-0.021
Token sweater
Token ID35685
Feature activation+7.39e-01

Pos logit contributions
 pleasure+0.024
 satisfaction+0.023
 accomplishment+0.023
 fascination+0.022
 delight+0.022
Neg logit contributions
 parked-0.020
 driving-0.019
 smoking-0.019
 stole-0.018
 broke-0.018
Token.
Token ID13
Feature activation+1.57e+00

Pos logit contributions
 satisfaction+0.037
 pleasure+0.036
 accomplishment+0.035
 fascination+0.034
 delight+0.034
Neg logit contributions
 parked-0.029
 driving-0.027
 smoking-0.026
 stole-0.026
 broke-0.025
Token Alec
Token ID38285
Feature activation+2.17e+00

Pos logit contributions
 accomplishment+0.079
 satisfaction+0.077
 pleasure+0.075
 fascination+0.071
 accomplishments+0.071
Neg logit contributions
 parked-0.058
 driving-0.053
 smoking-0.051
 CA-0.049
 stole-0.049
Token felt
Token ID2936
Feature activation+2.18e+00

Pos logit contributions
 pleasure+0.135
 accomplishment+0.129
 satisfaction+0.129
 difference+0.121
 fascination+0.120
Neg logit contributions
 parked-0.072
 stole-0.060
 knocked-0.059
 broke-0.058
 disturbed-0.056
Token a
Token ID257
Feature activation+4.48e+00

Pos logit contributions
 accomplishment+0.111
 accomplishments+0.108
oph+0.108
aman+0.106
 pleasure+0.105
Neg logit contributions
 parked-0.076
 hurt-0.071
 disturbed-0.070
 stuck-0.068
 stole-0.065
Token warm
Token ID5814
Feature activation+3.57e+00

Pos logit contributions
 deeds+0.228
oph+0.215
ishment+0.213
osa+0.212
 accomplishments+0.211
Neg logit contributions
 hurt-0.150
 parked-0.136
 knocked-0.134
 broke-0.130
 disturbed-0.128
Token feeling
Token ID4203
Feature activation+1.34e+00

Pos logit contributions
oph+0.171
 pleasure+0.166
 satisfaction+0.166
 accomplishments+0.166
hew+0.164
Neg logit contributions
 parked-0.124
 hurt-0.121
 driving-0.118
 stuck-0.113
 hit-0.111
Token in
Token ID287
Feature activation+2.01e+00

Pos logit contributions
 pleasure+0.082
 satisfaction+0.080
 accomplishment+0.080
 fascination+0.079
 delight+0.076
Neg logit contributions
 parked-0.037
 driving-0.033
 disturbed-0.032
 smoking-0.031
 stole-0.031
Token his
Token ID465
Feature activation+2.02e+00

Pos logit contributions
oph+0.136
 pleasure+0.133
 accomplishment+0.129
 accomplishments+0.129
aire+0.128
Neg logit contributions
 driving-0.032
 parked-0.031
 smoking-0.027
 blocking-0.025
 hurt-0.025
Token heart
Token ID2612
Feature activation+1.73e+00

Pos logit contributions
oph+0.084
 satisfaction+0.080
 accomplishment+0.079
ya+0.077
 pleasure+0.076
Neg logit contributions
 hurt-0.080
 parked-0.078
 driving-0.077
 stuck-0.074
 broke-0.073
TokenM
Token ID44
Feature activation+2.16e+00

Pos logit contributions
 pleasure+0.102
 accomplishment+0.100
 satisfaction+0.094
 difference+0.090
 decision+0.089
Neg logit contributions
 parked-0.057
 disturbed-0.054
Ro-0.053
 knocked-0.051
 smoking-0.051
Tokenagg
Token ID9460
Feature activation+1.89e+00

Pos logit contributions
 pleasure+0.064
 satisfaction+0.062
 accomplishment+0.056
 fascination+0.056
 difference+0.055
Neg logit contributions
 knocked-0.108
 disturbed-0.106
 parked-0.105
 smoking-0.104
 broke-0.102
Tokenie
Token ID494
Feature activation+2.55e+00

Pos logit contributions
 pleasure+0.100
 satisfaction+0.093
 accomplishment+0.091
 difference+0.087
 fascination+0.085
Neg logit contributions
 parked-0.072
 knocked-0.066
 disturbed-0.065
 stole-0.062
 smoking-0.061
Token hugged
Token ID41130
Feature activation+2.49e+00

Pos logit contributions
 pleasure+0.145
 satisfaction+0.136
 accomplishment+0.135
 difference+0.130
 fascination+0.127
Neg logit contributions
 parked-0.090
 knocked-0.083
 broke-0.080
 stole-0.078
 stuck-0.077
Token Mom
Token ID11254
Feature activation+3.43e+00

Pos logit contributions
 pleasure+0.174
 difference+0.165
 satisfaction+0.165
 impact+0.160
 bond+0.158
Neg logit contributions
 smoking-0.036
 knocked-0.035
 parked-0.032
 to-0.032
 on-0.031
Tokenmy
Token ID1820
Feature activation+4.45e+00

Pos logit contributions
 accomplishment+0.245
 satisfaction+0.239
 pleasure+0.234
 accomplishments+0.229
 delight+0.223
Neg logit contributions
o-0.070
 parked-0.065
 smoking-0.051
 on-0.049
 driving-0.048
Token and
Token ID290
Feature activation+2.11e+00

Pos logit contributions
 pleasure+0.257
 satisfaction+0.252
agg+0.251
 accomplishment+0.243
 fascination+0.243
Neg logit contributions
 on-0.124
 parked-0.118
 to-0.109
 stuck-0.101
 driving-0.101
Token said
Token ID531
Feature activation+1.72e+00

Pos logit contributions
 pleasure+0.124
 difference+0.122
 accomplishments+0.119
 accomplishment+0.119
 decision+0.119
Neg logit contributions
 parked-0.063
 knocked-0.061
 broke-0.054
 stole-0.053
 driving-0.051
Token,
Token ID11
Feature activation+1.75e+00

Pos logit contributions
 satisfaction+0.081
 pleasure+0.079
 fascination+0.079
 accomplishment+0.076
 difference+0.074
Neg logit contributions
 driving-0.063
 broke-0.062
 smoking-0.061
 parked-0.060
 knocked-0.059
Token "
Token ID366
Feature activation+2.12e+00

Pos logit contributions
 satisfaction+0.119
 pleasure+0.117
 accomplishment+0.116
 fascination+0.113
 difference+0.113
Neg logit contributions
 parked-0.037
 driving-0.035
 smoking-0.031
 knocked-0.029
 blocking-0.026
TokenI
Token ID40
Feature activation+2.36e+00

Pos logit contributions
 accomplishment+0.125
 decision+0.120
 difference+0.117
 pleasure+0.112
 accomplishments+0.112
Neg logit contributions
 parked-0.060
 disturbed-0.050
 smoking-0.049
Ro-0.047
I-0.047
Token drank
Token ID24070
Feature activation+2.16e+00

Pos logit contributions
 pleasure+0.156
 satisfaction+0.144
 accomplishment+0.144
 fascination+0.137
 difference+0.136
Neg logit contributions
 parked-0.081
 broke-0.073
 knocked-0.073
 stole-0.071
 stuck-0.066
Token the
Token ID262
Feature activation+1.44e+00

Pos logit contributions
 pleasure+0.144
 accomplishments+0.136
 accomplishment+0.136
 fascination+0.135
 satisfaction+0.135
Neg logit contributions
 smoking-0.044
 broke-0.044
 hurt-0.042
 stuck-0.040
 parked-0.039
Token soup
Token ID17141
Feature activation+3.24e+00

Pos logit contributions
 pleasure+0.084
 satisfaction+0.083
 accomplishment+0.082
 fascination+0.080
 delight+0.078
Neg logit contributions
 smoking-0.040
 disturbed-0.037
 parked-0.037
 broke-0.036
 hurt-0.035
Token and
Token ID290
Feature activation+1.23e+00

Pos logit contributions
 pleasure+0.174
 satisfaction+0.167
 fascination+0.167
 accomplishment+0.161
 amaz+0.156
Neg logit contributions
 broke-0.105
 parked-0.105
 stuck-0.103
 on-0.103
 smoking-0.102
Token felt
Token ID2936
Feature activation+1.58e+00

Pos logit contributions
 pleasure+0.071
 satisfaction+0.067
 accomplishment+0.067
 fascination+0.065
 delight+0.063
Neg logit contributions
 parked-0.039
 stole-0.036
 smoking-0.035
 broke-0.035
 knocked-0.035
Token a
Token ID257
Feature activation+4.42e+00

Pos logit contributions
 accomplishment+0.073
aman+0.073
 pleasure+0.072
 fascination+0.071
oph+0.070
Neg logit contributions
 parked-0.057
 hurt-0.055
 disturbed-0.055
 smoking-0.054
 stuck-0.053
Token little
Token ID1310
Feature activation+2.26e+00

Pos logit contributions
aman+0.217
 deeds+0.212
ya+0.211
annah+0.209
iper+0.208
Neg logit contributions
 hurt-0.154
 broke-0.137
 disturbed-0.129
 wrecked-0.123
 knocked-0.123
Token better
Token ID1365
Feature activation+8.38e-01

Pos logit contributions
aman+0.119
ya+0.114
 accomplishment+0.114
oph+0.113
 accomplishments+0.112
Neg logit contributions
 hurt-0.077
 stuck-0.072
 broke-0.071
 disturbed-0.069
 stole-0.064
Token.
Token ID13
Feature activation+1.46e+00

Pos logit contributions
 pleasure+0.047
 satisfaction+0.045
 accomplishment+0.045
 fascination+0.044
 delight+0.043
Neg logit contributions
 parked-0.026
 smoking-0.025
 driving-0.024
 stole-0.023
 disturbed-0.023
Token She
Token ID1375
Feature activation+1.33e+00

Pos logit contributions
 accomplishment+0.085
 satisfaction+0.084
 fascination+0.081
 pleasure+0.080
 decision+0.080
Neg logit contributions
 parked-0.038
 broke-0.035
 driving-0.034
 smoking-0.034
 stole-0.034
Token sn
Token ID3013
Feature activation+1.27e+00

Pos logit contributions
 pleasure+0.077
 accomplishment+0.074
 satisfaction+0.073
 fascination+0.071
 accomplishments+0.069
Neg logit contributions
 parked-0.044
 stole-0.040
 knocked-0.040
 broke-0.039
 disturbed-0.038
Token
Token ID198
Feature activation+1.67e+00

Pos logit contributions
 pleasure+0.065
 satisfaction+0.065
 accomplishment+0.064
 fascination+0.063
 delight+0.060
Neg logit contributions
 parked-0.032
 stole-0.027
 smoking-0.027
 driving-0.026
 broke-0.025
TokenS
Token ID50
Feature activation+1.86e+00

Pos logit contributions
 pleasure+0.096
 accomplishment+0.091
 fascination+0.091
 satisfaction+0.088
 amaz+0.084
Neg logit contributions
 parked-0.053
 stole-0.047
saw-0.044
 broke-0.043
 disturbed-0.042
Tokenoph
Token ID2522
Feature activation+2.26e+00

Pos logit contributions
 pleasure+0.048
 fascination+0.046
 satisfaction+0.044
 accomplishment+0.043
 delight+0.041
Neg logit contributions
 stole-0.112
 parked-0.111
 broke-0.107
 disturbed-0.105
 CA-0.103
Tokenie
Token ID494
Feature activation+2.40e+00

Pos logit contributions
 pleasure+0.135
 fascination+0.121
 satisfaction+0.120
 accomplishment+0.120
 amaz+0.115
Neg logit contributions
 parked-0.069
 stole-0.063
 broke-0.062
 disturbed-0.059
 called-0.059
Token felt
Token ID2936
Feature activation+1.83e+00

Pos logit contributions
 pleasure+0.135
 satisfaction+0.124
 accomplishment+0.123
 fascination+0.120
 accomplishments+0.117
Neg logit contributions
 parked-0.086
 stole-0.079
 broke-0.078
 knocked-0.078
 stuck-0.076
Token a
Token ID257
Feature activation+4.40e+00

Pos logit contributions
 pleasure+0.095
 accomplishment+0.095
aman+0.092
 accomplishments+0.092
oph+0.091
Neg logit contributions
 parked-0.058
 hurt-0.058
 disturbed-0.057
 stuck-0.056
 stole-0.055
Token bit
Token ID1643
Feature activation+2.01e+00

Pos logit contributions
 pleasure+0.234
aman+0.232
ishment+0.227
osa+0.227
oph+0.225
Neg logit contributions
 hurt-0.143
 broke-0.123
 knocked-0.115
 disturbed-0.114
 hit-0.110
Token bitter
Token ID12922
Feature activation+2.27e+00

Pos logit contributions
 pleasure+0.110
 accomplishment+0.106
 satisfaction+0.105
 fascination+0.105
 accomplishments+0.102
Neg logit contributions
 hurt-0.062
 disturbed-0.061
 stuck-0.061
 broke-0.057
 stole-0.057
Token,
Token ID11
Feature activation+3.00e+00

Pos logit contributions
 satisfaction+0.120
 fascination+0.120
 pleasure+0.118
 accomplishment+0.114
 delight+0.110
Neg logit contributions
 smoking-0.072
 disturbed-0.070
 driving-0.070
 broke-0.069
 parked-0.068
Token but
Token ID475
Feature activation+2.08e+00

Pos logit contributions
aire+0.147
 pleasure+0.147
 accomplishments+0.142
 satisfaction+0.142
 accomplishment+0.140
Neg logit contributions
 parked-0.107
 smoking-0.102
 driving-0.102
 stuck-0.099
 blocking-0.096
Token was
Token ID373
Feature activation+7.84e-01

Pos logit contributions
 pleasure+0.135
 satisfaction+0.130
 accomplishment+0.130
 accomplishments+0.130
 difference+0.128
Neg logit contributions
 parked-0.040
 stuck-0.038
 disturbed-0.034
 stole-0.033
 broke-0.033
Token walk
Token ID2513
Feature activation+1.44e+00

Pos logit contributions
 satisfaction+0.063
 pleasure+0.061
 fascination+0.058
 delight+0.057
 accomplishment+0.056
Neg logit contributions
 driving-0.051
 parked-0.050
 smoking-0.047
 knocked-0.046
 stole-0.046
Token.
Token ID13
Feature activation+4.45e-01

Pos logit contributions
 satisfaction+0.084
 pleasure+0.083
 accomplishment+0.081
 fascination+0.078
 delight+0.077
Neg logit contributions
 parked-0.042
 driving-0.041
 smoking-0.037
 stuck-0.036
 disturbed-0.036
Token She
Token ID1375
Feature activation+1.33e+00

Pos logit contributions
 satisfaction+0.021
 accomplishment+0.021
 pleasure+0.021
 delight+0.020
 fascination+0.020
Neg logit contributions
 parked-0.017
 driving-0.016
 smoking-0.016
 stole-0.016
 knocked-0.015
Token still
Token ID991
Feature activation+1.40e+00

Pos logit contributions
 pleasure+0.079
 satisfaction+0.079
 accomplishment+0.077
 delight+0.072
 fascination+0.072
Neg logit contributions
 parked-0.043
 knocked-0.037
 driving-0.037
 stole-0.037
 broke-0.034
Token felt
Token ID2936
Feature activation+1.90e+00

Pos logit contributions
 accomplishment+0.078
 satisfaction+0.077
 pleasure+0.077
 delight+0.073
 fascination+0.072
Neg logit contributions
 parked-0.046
 stole-0.042
 driving-0.042
 knocked-0.041
 disturbed-0.040
Token a
Token ID257
Feature activation+4.40e+00

Pos logit contributions
 accomplishment+0.098
oph+0.097
 satisfaction+0.095
 pleasure+0.095
 fascination+0.095
Neg logit contributions
 parked-0.063
 disturbed-0.061
 hurt-0.060
 driving-0.059
 stuck-0.059
Token bit
Token ID1643
Feature activation+1.89e+00

Pos logit contributions
 deeds+0.256
ently+0.244
oph+0.243
 agrees+0.243
hew+0.238
Neg logit contributions
 hurt-0.122
 knocked-0.101
 broke-0.097
 wrecked-0.096
 stuck-0.095
Token w
Token ID266
Feature activation+2.05e+00

Pos logit contributions
 satisfaction+0.109
 fascination+0.108
 pleasure+0.107
 accomplishment+0.107
oph+0.103
Neg logit contributions
 disturbed-0.052
 hurt-0.051
 stuck-0.049
 smoking-0.046
 broke-0.046
Tokenobb
Token ID21963
Feature activation+2.06e+00

Pos logit contributions
 fascination+0.105
 amaz+0.104
 pleasure+0.102
 satisfaction+0.099
 delight+0.096
Neg logit contributions
 disturbed-0.065
 smoking-0.062
 bitten-0.062
wheel-0.062
 hurt-0.062
Tokenly
Token ID306
Feature activation+1.67e+00

Pos logit contributions
 fascination+0.103
 pleasure+0.102
 satisfaction+0.100
 amaz+0.099
 delight+0.096
Neg logit contributions
 parked-0.079
 stole-0.078
 broke-0.077
 driving-0.076
 disturbed-0.076
Token,
Token ID11
Feature activation+2.86e+00

Pos logit contributions
 pleasure+0.076
 satisfaction+0.075
 accomplishment+0.072
 fascination+0.069
 delight+0.067
Neg logit contributions
 parked-0.069
 driving-0.067
 smoking-0.065
 stole-0.064
 blocking-0.063
Token on
Token ID319
Feature activation+5.22e-01

Pos logit contributions
 satisfaction+0.003
 pleasure+0.003
 accomplishment+0.003
 delight+0.003
 fascination+0.003
Neg logit contributions
 parked-0.004
 driving-0.004
 smoking-0.004
 stole-0.004
 disturbed-0.004
Token it
Token ID340
Feature activation-4.45e-01

Pos logit contributions
 satisfaction+0.029
 pleasure+0.029
 accomplishment+0.028
 fascination+0.027
 delight+0.027
Neg logit contributions
 parked-0.016
 driving-0.015
 smoking-0.015
 stole-0.014
 knocked-0.014
Token.
Token ID13
Feature activation+1.15e-01

Pos logit contributions
 parked+0.016
 stole+0.015
 smoking+0.015
 driving+0.015
 disturbed+0.014
Neg logit contributions
 pleasure-0.022
 satisfaction-0.022
 accomplishment-0.022
 delight-0.021
 fascination-0.021
Token Sally
Token ID25737
Feature activation+1.51e+00

Pos logit contributions
 satisfaction+0.005
 pleasure+0.005
 accomplishment+0.005
 delight+0.005
 fascination+0.005
Neg logit contributions
 parked-0.005
 driving-0.004
 smoking-0.004
 stole-0.004
 knocked-0.004
Token felt
Token ID2936
Feature activation+1.32e+00

Pos logit contributions
 pleasure+0.091
 satisfaction+0.091
 accomplishment+0.089
 fascination+0.083
 delight+0.083
Neg logit contributions
 parked-0.048
 knocked-0.043
 stole-0.042
 broke-0.041
 driving-0.039
Token a
Token ID257
Feature activation+4.38e+00

Pos logit contributions
 accomplishment+0.060
 pleasure+0.059
 satisfaction+0.058
 fascination+0.058
aman+0.057
Neg logit contributions
 parked-0.051
 disturbed-0.048
 stole-0.047
 hurt-0.046
 driving-0.046
Token little
Token ID1310
Feature activation+1.94e+00

Pos logit contributions
 deeds+0.227
aman+0.219
heart+0.217
ior+0.216
oph+0.210
Neg logit contributions
 hurt-0.137
 broke-0.124
 knocked-0.120
 hit-0.113
 disturbed-0.110
Token better
Token ID1365
Feature activation+1.69e+00

Pos logit contributions
aman+0.100
 accomplishment+0.097
oph+0.096
 accomplishments+0.096
 satisfaction+0.095
Neg logit contributions
 stuck-0.064
 hurt-0.064
 broke-0.060
 disturbed-0.060
 stole-0.057
Token,
Token ID11
Feature activation+1.26e+00

Pos logit contributions
 pleasure+0.090
 satisfaction+0.087
 fascination+0.085
 accomplishment+0.083
 delight+0.082
Neg logit contributions
 parked-0.056
 driving-0.052
 stuck-0.052
 hurt-0.052
 knocked-0.051
Token but
Token ID475
Feature activation+9.03e-01

Pos logit contributions
 satisfaction+0.056
 pleasure+0.053
 accomplishment+0.053
 fascination+0.052
 delight+0.051
Neg logit contributions
 parked-0.055
 driving-0.052
 smoking-0.051
 stuck-0.049
 stole-0.049
Token she
Token ID673
Feature activation+9.78e-01

Pos logit contributions
 pleasure+0.051
 satisfaction+0.050
 accomplishment+0.049
 fascination+0.048
 delight+0.047
Neg logit contributions
 parked-0.029
 stole-0.025
 smoking-0.025
 driving-0.025
 stuck-0.024
Token calendar
Token ID11845
Feature activation+2.08e-01

Pos logit contributions
 satisfaction+0.103
 pleasure+0.097
 accomplishment+0.096
 delight+0.095
 fascination+0.094
Neg logit contributions
 knocked-0.069
 driving-0.068
 parked-0.068
 broke-0.066
 smoking-0.066
Token soon
Token ID2582
Feature activation+3.97e-01

Pos logit contributions
 satisfaction+0.011
 pleasure+0.010
 accomplishment+0.010
 delight+0.010
 fascination+0.010
Neg logit contributions
 parked-0.008
 driving-0.007
 smoking-0.007
 stole-0.007
 disturbed-0.007
Token."
Token ID526
Feature activation+1.21e+00

Pos logit contributions
 satisfaction+0.022
 pleasure+0.022
 accomplishment+0.022
 fascination+0.021
 delight+0.021
Neg logit contributions
 parked-0.012
 driving-0.011
 smoking-0.011
 stole-0.011
 knocked-0.010
Token Amy
Token ID14235
Feature activation+3.18e+00

Pos logit contributions
 accomplishment+0.068
 satisfaction+0.068
 pleasure+0.067
 fascination+0.064
 decision+0.063
Neg logit contributions
 parked-0.033
 smoking-0.033
 broke-0.031
 driving-0.031
 stole-0.031
Token felt
Token ID2936
Feature activation+1.84e+00

Pos logit contributions
 pleasure+0.187
 accomplishment+0.176
 accomplishments+0.171
 satisfaction+0.170
 difference+0.167
Neg logit contributions
 parked-0.097
 broke-0.094
 knocked-0.092
 hurt-0.086
 rang-0.086
Token a
Token ID257
Feature activation+4.36e+00

Pos logit contributions
 accomplishment+0.094
aman+0.093
 pleasure+0.092
 accomplishments+0.091
 fascination+0.090
Neg logit contributions
 parked-0.059
 hurt-0.057
 disturbed-0.056
 stuck-0.056
 stole-0.055
Token little
Token ID1310
Feature activation+1.91e+00

Pos logit contributions
 deeds+0.222
osa+0.221
aman+0.219
ishment+0.213
heart+0.213
Neg logit contributions
 hurt-0.137
 broke-0.131
 knocked-0.122
 wrecked-0.118
 hit-0.115
Token better
Token ID1365
Feature activation+1.96e+00

Pos logit contributions
 accomplishment+0.094
aman+0.092
 accomplishments+0.091
 fascination+0.091
 satisfaction+0.090
Neg logit contributions
 stuck-0.064
 hurt-0.064
 broke-0.063
 disturbed-0.062
 stole-0.062
Token after
Token ID706
Feature activation-1.43e-01

Pos logit contributions
 pleasure+0.094
 fascination+0.088
 satisfaction+0.088
 accomplishment+0.085
agg+0.081
Neg logit contributions
 parked-0.074
 hurt-0.073
 broke-0.073
 stuck-0.072
 knocked-0.072
Token hearing
Token ID4854
Feature activation+1.03e+00

Pos logit contributions
 parked+0.005
 stole+0.005
 smoking+0.005
 driving+0.005
 broke+0.005
Neg logit contributions
 satisfaction-0.007
 accomplishment-0.007
 pleasure-0.007
 delight-0.007
 fascination-0.007
Token these
Token ID777
Feature activation+1.86e+00

Pos logit contributions
 pleasure+0.055
 satisfaction+0.054
 accomplishment+0.052
 fascination+0.051
 delight+0.050
Neg logit contributions
 parked-0.034
 smoking-0.033
 driving-0.032
 stole-0.031
 disturbed-0.031
Token was
Token ID373
Feature activation-4.19e-01

Pos logit contributions
 parked+0.026
 stole+0.024
 driving+0.024
 smoking+0.024
 blocking+0.022
Neg logit contributions
 satisfaction-0.038
 pleasure-0.038
 accomplishment-0.037
 delight-0.037
 fascination-0.035
Token shining
Token ID22751
Feature activation+1.03e+00

Pos logit contributions
 parked+0.014
 stole+0.013
 smoking+0.013
 driving+0.012
 disturbed+0.012
Neg logit contributions
 pleasure-0.022
 satisfaction-0.022
 accomplishment-0.022
 delight-0.021
 fascination-0.021
Token.
Token ID13
Feature activation-1.93e-01

Pos logit contributions
 satisfaction+0.059
 pleasure+0.058
 accomplishment+0.058
 fascination+0.055
 delight+0.054
Neg logit contributions
 parked-0.031
 driving-0.029
 smoking-0.029
 disturbed-0.028
 broke-0.027
Token Lucy
Token ID22162
Feature activation+1.35e+00

Pos logit contributions
 parked+0.008
 smoking+0.007
 stole+0.007
 driving+0.007
 disturbed+0.007
Neg logit contributions
 pleasure-0.009
 satisfaction-0.009
 accomplishment-0.009
 delight-0.009
 fascination-0.008
Token felt
Token ID2936
Feature activation+1.57e+00

Pos logit contributions
 satisfaction+0.075
 accomplishment+0.075
 pleasure+0.075
 delight+0.068
 fascination+0.068
Neg logit contributions
 parked-0.048
 knocked-0.044
 stole-0.043
 broke-0.041
 driving-0.041
Token a
Token ID257
Feature activation+4.35e+00

Pos logit contributions
 accomplishment+0.083
 pleasure+0.080
 satisfaction+0.080
 accomplishments+0.078
oph+0.077
Neg logit contributions
 parked-0.052
 disturbed-0.050
 hurt-0.048
 stuck-0.047
 smoking-0.046
Token bit
Token ID1643
Feature activation+1.61e+00

Pos logit contributions
 deeds+0.238
oph+0.230
 accomplishment+0.223
aman+0.223
hew+0.223
Neg logit contributions
 hurt-0.132
 knocked-0.117
 disturbed-0.109
 broke-0.106
 stuck-0.103
Token scared
Token ID12008
Feature activation+1.22e+00

Pos logit contributions
 satisfaction+0.097
 accomplishment+0.097
 pleasure+0.096
 fascination+0.093
 accomplishments+0.092
Neg logit contributions
 disturbed-0.041
 stuck-0.038
 hurt-0.038
 knocked-0.036
 smoking-0.036
Token but
Token ID475
Feature activation+8.00e-01

Pos logit contributions
 satisfaction+0.059
 pleasure+0.058
 accomplishment+0.058
 fascination+0.055
 accomplishments+0.052
Neg logit contributions
 smoking-0.048
 parked-0.048
 driving-0.047
 disturbed-0.046
 blocking-0.045
Token with
Token ID351
Feature activation+1.41e+00

Pos logit contributions
 pleasure+0.041
 satisfaction+0.041
 accomplishment+0.040
 fascination+0.038
 difference+0.038
Neg logit contributions
 parked-0.029
 driving-0.026
 smoking-0.026
 stole-0.026
 knocked-0.026
Token Jake
Token ID14757
Feature activation+8.51e-01

Pos logit contributions
 satisfaction+0.072
 pleasure+0.071
 accomplishment+0.069
 difference+0.067
aire+0.067
Neg logit contributions
 parked-0.047
 driving-0.043
 smoking-0.043
 knocked-0.042
 stuck-0.042
TokenM
Token ID44
Feature activation+1.95e+00

Pos logit contributions
 pleasure+0.053
 accomplishment+0.051
 satisfaction+0.050
 decision+0.046
 delight+0.046
Neg logit contributions
 parked-0.044
 smoking-0.041
 driving-0.040
 disturbed-0.038
 stole-0.038
Tokenum
Token ID388
Feature activation+1.42e+00

Pos logit contributions
 pleasure+0.077
 satisfaction+0.076
 delight+0.072
 fascination+0.070
 accomplishment+0.066
Neg logit contributions
 parked-0.081
 smoking-0.081
 driving-0.078
anes-0.078
 knocked-0.074
Token and
Token ID290
Feature activation+1.92e+00

Pos logit contributions
 pleasure+0.064
 satisfaction+0.063
 accomplishment+0.061
 fascination+0.058
 delight+0.056
Neg logit contributions
 parked-0.066
 driving-0.059
 stole-0.058
 smoking-0.057
 knocked-0.057
Token dad
Token ID9955
Feature activation+1.54e+00

Pos logit contributions
 difference+0.094
 pleasure+0.093
 accomplishment+0.091
 satisfaction+0.090
 decision+0.090
Neg logit contributions
 parked-0.070
 knocked-0.061
 driving-0.060
 smoking-0.059
 called-0.059
Token hugged
Token ID41130
Feature activation+3.38e+00

Pos logit contributions
 pleasure+0.084
 satisfaction+0.084
 accomplishment+0.083
 fascination+0.079
 accomplishments+0.077
Neg logit contributions
 parked-0.056
 stole-0.048
 driving-0.048
 knocked-0.046
 broke-0.045
Token Maria
Token ID14200
Feature activation+4.34e+00

Pos logit contributions
 pleasure+0.194
 satisfaction+0.185
 accomplishment+0.184
 decision+0.182
 difference+0.181
Neg logit contributions
 to-0.075
 appeared-0.074
 on-0.071
 called-0.069
 came-0.067
Token and
Token ID290
Feature activation+1.53e+00

Pos logit contributions
 satisfaction+0.214
 fascination+0.208
 accomplishment+0.207
 pleasure+0.205
 accomplishments+0.200
Neg logit contributions
 parked-0.144
 driving-0.134
 on-0.126
 stuck-0.126
o-0.122
Token they
Token ID484
Feature activation+1.20e+00

Pos logit contributions
 pleasure+0.093
 satisfaction+0.091
 accomplishment+0.090
 fascination+0.088
 accomplishments+0.087
Neg logit contributions
 parked-0.039
 knocked-0.034
 driving-0.034
 stole-0.033
 broke-0.032
Token all
Token ID477
Feature activation+4.20e-01

Pos logit contributions
 pleasure+0.064
 accomplishment+0.063
 satisfaction+0.063
 fascination+0.060
 delight+0.059
Neg logit contributions
 parked-0.042
 stole-0.037
 driving-0.037
 broke-0.035
 knocked-0.035
Token went
Token ID1816
Feature activation-4.01e-01

Pos logit contributions
 pleasure+0.022
 satisfaction+0.022
 accomplishment+0.022
 fascination+0.021
 delight+0.020
Neg logit contributions
 parked-0.015
 stole-0.013
 driving-0.013
 smoking-0.013
 knocked-0.013
Token home
Token ID1363
Feature activation+1.01e+00

Pos logit contributions
 parked+0.014
 stole+0.013
 smoking+0.013
 driving+0.013
 disturbed+0.013
Neg logit contributions
 satisfaction-0.021
 pleasure-0.020
 accomplishment-0.020
 delight-0.020
 fascination-0.019
Token her
Token ID607
Feature activation+1.69e+00

Pos logit contributions
 pleasure+0.071
 satisfaction+0.067
 fascination+0.065
aire+0.065
oph+0.063
Neg logit contributions
 smoking-0.041
 driving-0.040
 parked-0.039
 blocking-0.038
 guarding-0.035
Token jam
Token ID16853
Feature activation+1.36e+00

Pos logit contributions
 satisfaction+0.079
 pleasure+0.078
 fascination+0.077
aman+0.071
 delight+0.071
Neg logit contributions
 smoking-0.061
 knocked-0.059
 broke-0.055
 driving-0.055
 disturbed-0.055
Token.
Token ID13
Feature activation+1.65e+00

Pos logit contributions
 pleasure+0.076
 satisfaction+0.075
 fascination+0.072
 accomplishment+0.072
 delight+0.069
Neg logit contributions
 parked-0.043
 smoking-0.040
 driving-0.040
 stole-0.037
 disturbed-0.037
Token She
Token ID1375
Feature activation+1.17e+00

Pos logit contributions
 accomplishment+0.095
 satisfaction+0.092
 pleasure+0.088
 decision+0.087
 fascination+0.086
Neg logit contributions
 parked-0.042
 smoking-0.039
 knocked-0.037
 driving-0.037
 broke-0.036
Token felt
Token ID2936
Feature activation+1.69e+00

Pos logit contributions
 pleasure+0.067
 satisfaction+0.066
 accomplishment+0.065
 fascination+0.063
 difference+0.062
Neg logit contributions
 parked-0.037
 stole-0.033
 knocked-0.032
 broke-0.031
 driving-0.031
Token a
Token ID257
Feature activation+4.34e+00

Pos logit contributions
 accomplishment+0.085
aman+0.083
 pleasure+0.083
 fascination+0.083
 accomplishments+0.081
Neg logit contributions
 parked-0.055
 disturbed-0.053
 stuck-0.050
 smoking-0.050
 stole-0.050
Token great
Token ID1049
Feature activation+2.67e+00

Pos logit contributions
 deeds+0.221
ishment+0.212
 agrees+0.207
ior+0.206
hew+0.204
Neg logit contributions
 hurt-0.132
 knocked-0.125
 parked-0.115
 broke-0.114
 disturbed-0.114
Token sense
Token ID2565
Feature activation+9.43e-01

Pos logit contributions
hew+0.127
 fascination+0.123
 pleasure+0.119
 Deal+0.118
aman+0.116
Neg logit contributions
 knocked-0.084
 hurt-0.080
 broke-0.080
 disturbed-0.079
 smoking-0.079
Token of
Token ID286
Feature activation+1.98e+00

Pos logit contributions
 pleasure+0.060
 satisfaction+0.060
 accomplishment+0.058
 fascination+0.058
 delight+0.057
Neg logit contributions
 parked-0.022
 driving-0.020
 smoking-0.020
 disturbed-0.018
 stole-0.018
Token accomplishment
Token ID29340
Feature activation+1.12e+00

Pos logit contributions
aire+0.054
oph+0.051
aman+0.048
hew+0.044
 pleasure+0.042
Neg logit contributions
 parked-0.109
 driving-0.108
 smoking-0.105
 blocking-0.104
 hurt-0.104
Token so
Token ID523
Feature activation+1.80e+00

Pos logit contributions
 pleasure+0.059
 satisfaction+0.058
 fascination+0.056
 accomplishment+0.055
 delight+0.054
Neg logit contributions
 parked-0.040
 driving-0.038
 smoking-0.037
 stole-0.035
 knocked-0.035
Token happy
Token ID3772
Feature activation+1.38e+00

Pos logit contributions
 pleasure+0.058
 satisfaction+0.058
 accomplishment+0.058
 accomplishments+0.054
 fascination+0.054
Neg logit contributions
 parked-0.055
 disturbed-0.053
 hurt-0.052
 knocked-0.051
 broke-0.051
Token!
Token ID0
Feature activation+2.07e+00

Pos logit contributions
 satisfaction+0.082
 pleasure+0.081
 fascination+0.077
 accomplishment+0.076
 delight+0.074
Neg logit contributions
 driving-0.039
 parked-0.039
 smoking-0.038
 disturbed-0.035
 blocking-0.035
Token She
Token ID1375
Feature activation+1.72e+00

Pos logit contributions
 accomplishment+0.119
 satisfaction+0.119
 fascination+0.115
 decision+0.111
 pleasure+0.111
Neg logit contributions
 parked-0.049
 driving-0.045
 smoking-0.044
 stole-0.043
 knocked-0.043
Token hugged
Token ID41130
Feature activation+3.16e+00

Pos logit contributions
 pleasure+0.101
 accomplishment+0.098
 satisfaction+0.098
 difference+0.096
 fascination+0.095
Neg logit contributions
 parked-0.056
 stole-0.049
 knocked-0.049
 broke-0.047
 stuck-0.045
Token her
Token ID607
Feature activation+2.68e+00

Pos logit contributions
 pleasure+0.220
hew+0.219
 decision+0.217
 difference+0.214
 satisfaction+0.211
Neg logit contributions
 to-0.039
 on-0.034
 in-0.027
 smoking-0.025
 coming-0.024
Token mum
Token ID25682
Feature activation+4.33e+00

Pos logit contributions
aman+0.122
hew+0.120
 satisfaction+0.118
 fascination+0.117
 pleasures+0.117
Neg logit contributions
 stuck-0.089
 hurt-0.084
 broke-0.079
 parked-0.077
 knocked-0.077
Token and
Token ID290
Feature activation+1.76e+00

Pos logit contributions
 pleasure+0.245
 satisfaction+0.237
 fascination+0.233
 accomplishments+0.231
 accomplishment+0.231
Neg logit contributions
 parked-0.140
 on-0.121
 called-0.116
 driving-0.114
 stuck-0.111
Token together
Token ID1978
Feature activation+1.30e+00

Pos logit contributions
 pleasure+0.109
 satisfaction+0.103
 accomplishment+0.102
 accomplishments+0.100
 difference+0.100
Neg logit contributions
 parked-0.050
 knocked-0.045
 stole-0.043
 broke-0.042
 stuck-0.041
Token they
Token ID484
Feature activation+7.92e-01

Pos logit contributions
 pleasure+0.080
 satisfaction+0.079
 accomplishment+0.075
 fascination+0.074
 difference+0.073
Neg logit contributions
 parked-0.035
 knocked-0.032
 driving-0.032
 broke-0.030
 stuck-0.030
Token went
Token ID1816
Feature activation-3.22e-01

Pos logit contributions
 pleasure+0.047
 satisfaction+0.047
 accomplishment+0.046
 delight+0.044
 fascination+0.044
Neg logit contributions
 parked-0.024
 stole-0.021
 driving-0.020
 knocked-0.020
 broke-0.020
Token to
Token ID284
Feature activation+5.55e-01

Pos logit contributions
 parked+0.010
 stole+0.009
 smoking+0.009
 driving+0.009
 disturbed+0.009
Neg logit contributions
 satisfaction-0.018
 pleasure-0.018
 accomplishment-0.018
 delight-0.017
 fascination-0.016
Token it
Token ID340
Feature activation+2.10e+00

Pos logit contributions
 satisfaction+0.125
 pleasure+0.125
 accomplishment+0.121
 fascination+0.118
 delight+0.117
Neg logit contributions
 smoking-0.047
 parked-0.044
 driving-0.041
 stuck-0.038
 hurt-0.037
Token.
Token ID13
Feature activation+1.69e+00

Pos logit contributions
 satisfaction+0.124
 fascination+0.121
 pleasure+0.120
 accomplishment+0.116
 amaz+0.115
Neg logit contributions
 parked-0.055
 driving-0.051
 smoking-0.050
 stuck-0.050
 broke-0.048
Token After
Token ID2293
Feature activation-1.21e-01

Pos logit contributions
 accomplishment+0.092
 satisfaction+0.092
 fascination+0.087
 pleasure+0.085
 decision+0.085
Neg logit contributions
 parked-0.051
 driving-0.046
 Wh-0.044
 CA-0.044
 smoking-0.043
Token they
Token ID484
Feature activation+9.68e-01

Pos logit contributions
 parked+0.004
 stole+0.004
 driving+0.004
 smoking+0.004
 disturbed+0.003
Neg logit contributions
 satisfaction-0.007
 pleasure-0.007
 accomplishment-0.007
 delight-0.006
 fascination-0.006
Token finished
Token ID5201
Feature activation+1.50e+00

Pos logit contributions
 pleasure+0.052
 accomplishment+0.051
 satisfaction+0.050
 fascination+0.049
 accomplishments+0.047
Neg logit contributions
 parked-0.035
 stole-0.031
 knocked-0.031
 broke-0.030
 disturbed-0.029
Token cleaning
Token ID12724
Feature activation+4.33e+00

Pos logit contributions
 satisfaction+0.092
 pleasure+0.092
 fascination+0.085
 accomplishment+0.083
 delight+0.083
Neg logit contributions
 smoking-0.044
 driving-0.044
 parked-0.039
 knocked-0.038
 broke-0.034
Token the
Token ID262
Feature activation+7.74e-01

Pos logit contributions
 satisfaction+0.274
 pleasure+0.259
 Deal+0.256
ited+0.253
 fascination+0.251
Neg logit contributions
 on-0.095
 smoking-0.094
 broke-0.085
 cars-0.084
 driving-0.081
Token shell
Token ID7582
Feature activation+2.47e+00

Pos logit contributions
 satisfaction+0.038
 pleasure+0.038
 accomplishment+0.037
 fascination+0.036
 delight+0.035
Neg logit contributions
 parked-0.029
 stole-0.027
 smoking-0.027
 driving-0.026
 broke-0.026
Token,
Token ID11
Feature activation+8.62e-01

Pos logit contributions
 satisfaction+0.124
 pleasure+0.117
 fascination+0.117
 accomplishment+0.111
 delight+0.108
Neg logit contributions
 broke-0.093
 parked-0.093
 driving-0.088
 smoking-0.088
 stuck-0.087
Token they
Token ID484
Feature activation+3.24e-01

Pos logit contributions
 satisfaction+0.055
 pleasure+0.055
 accomplishment+0.054
 fascination+0.053
 delight+0.052
Neg logit contributions
 parked-0.019
 smoking-0.018
 driving-0.018
 stole-0.016
 knocked-0.016
Token celebrated
Token ID13943
Feature activation+2.35e+00

Pos logit contributions
 pleasure+0.018
 satisfaction+0.018
 accomplishment+0.018
 delight+0.017
 fascination+0.017
Neg logit contributions
 parked-0.010
 stole-0.009
 driving-0.009
 smoking-0.009
 knocked-0.009
Token sweet
Token ID6029
Feature activation+3.00e+00

Pos logit contributions
 satisfaction+0.102
 fascination+0.100
 pleasure+0.097
 amaz+0.097
 disbelief+0.096
Neg logit contributions
 driving-0.075
 parked-0.074
 smoking-0.071
 hurt-0.069
 disturbed-0.068
Token treat
Token ID2190
Feature activation+1.63e+00

Pos logit contributions
 pleasure+0.157
 satisfaction+0.154
 fascination+0.153
 Deal+0.148
 accomplishment+0.146
Neg logit contributions
 smoking-0.089
 driving-0.083
 disturbed-0.080
 hurt-0.080
 parked-0.079
Token and
Token ID290
Feature activation+2.06e+00

Pos logit contributions
 satisfaction+0.091
 pleasure+0.089
 fascination+0.086
 accomplishment+0.086
 delight+0.084
Neg logit contributions
 parked-0.051
 driving-0.047
 smoking-0.046
 broke-0.045
 stuck-0.044
Token shared
Token ID4888
Feature activation+2.63e+00

Pos logit contributions
 pleasure+0.108
 satisfaction+0.105
 fascination+0.105
 accomplishments+0.105
 accomplishment+0.104
Neg logit contributions
 parked-0.063
 smoking-0.058
 knocked-0.057
 driving-0.057
 stole-0.054
Token a
Token ID257
Feature activation+3.11e+00

Pos logit contributions
 pleasure+0.162
 accomplishment+0.160
 fascination+0.157
 satisfaction+0.154
 decision+0.153
Neg logit contributions
 on-0.048
 smoking-0.045
 parked-0.044
 driving-0.042
hes-0.041
Token big
Token ID1263
Feature activation+4.32e+00

Pos logit contributions
ishment+0.150
 deeds+0.148
 fascination+0.145
 pleasure+0.144
 satisfaction+0.143
Neg logit contributions
 parked-0.095
 driving-0.089
 hurt-0.086
 broke-0.086
 stole-0.084
Token,
Token ID11
Feature activation+2.08e+00

Pos logit contributions
 surroundings+0.200
 deeds+0.200
ishment+0.198
ently+0.196
iper+0.191
Neg logit contributions
 driving-0.121
 hit-0.120
 parked-0.119
 popped-0.117
 v-0.116
Token happy
Token ID3772
Feature activation+1.96e+00

Pos logit contributions
 satisfaction+0.086
 fascination+0.085
 accomplishment+0.085
 pleasure+0.084
 deeds+0.082
Neg logit contributions
 parked-0.089
 driving-0.084
 smoking-0.082
 stuck-0.076
 stole-0.075
Token hug
Token ID16225
Feature activation+1.26e+00

Pos logit contributions
 fascination+0.085
 pleasure+0.085
 satisfaction+0.084
 Deal+0.081
 accomplishment+0.079
Neg logit contributions
 driving-0.076
 parked-0.070
 smoking-0.069
 knocked-0.065
 disturbed-0.063
Token.
Token ID13
Feature activation+1.56e+00

Pos logit contributions
 satisfaction+0.070
 pleasure+0.070
 accomplishment+0.069
 fascination+0.067
 delight+0.066
Neg logit contributions
 parked-0.039
 driving-0.036
 smoking-0.036
 stole-0.033
 broke-0.033
Token<|endoftext|>
Token ID50256
Feature activation-9.71e-01

Pos logit contributions
 accomplishment+0.085
 satisfaction+0.085
 fascination+0.082
 pleasure+0.082
 decision+0.080
Neg logit contributions
 parked-0.047
 driving-0.043
 smoking-0.041
 knocked-0.040
 broke-0.039
Token of
Token ID286
Feature activation+1.14e+00

Pos logit contributions
 satisfaction+0.040
 accomplishment+0.039
 pleasure+0.039
 fascination+0.039
 delight+0.038
Neg logit contributions
 parked-0.015
 driving-0.014
 smoking-0.013
 stole-0.013
 disturbed-0.012
Token you
Token ID345
Feature activation+1.57e+00

Pos logit contributions
 satisfaction+0.068
 pleasure+0.067
 accomplishment+0.067
 fascination+0.066
 delight+0.063
Neg logit contributions
 driving-0.029
 parked-0.028
 smoking-0.027
 blocking-0.025
 stole-0.024
Token!"
Token ID2474
Feature activation+1.20e+00

Pos logit contributions
 satisfaction+0.085
 fascination+0.084
 accomplishment+0.084
 pleasure+0.083
 amaz+0.082
Neg logit contributions
 parked-0.051
 driving-0.048
 smoking-0.045
 stole-0.043
 broke-0.041
Token Ruth
Token ID22132
Feature activation+2.47e+00

Pos logit contributions
 accomplishment+0.065
 pleasure+0.065
 satisfaction+0.064
 fascination+0.061
 accomplishments+0.060
Neg logit contributions
 parked-0.039
 driving-0.036
 smoking-0.035
 stole-0.034
 knocked-0.033
Token felt
Token ID2936
Feature activation+1.76e+00

Pos logit contributions
 pleasure+0.144
 accomplishment+0.137
 satisfaction+0.135
 fascination+0.130
 difference+0.129
Neg logit contributions
 parked-0.080
 knocked-0.072
 broke-0.071
 stole-0.069
 called-0.066
Token a
Token ID257
Feature activation+4.32e+00

Pos logit contributions
 accomplishment+0.086
aman+0.086
 fascination+0.083
 pleasure+0.083
 accomplishments+0.083
Neg logit contributions
 parked-0.060
 disturbed-0.056
 hurt-0.056
 stole-0.055
 stuck-0.055
Token special
Token ID2041
Feature activation+2.19e+00

Pos logit contributions
ishment+0.225
 deeds+0.225
osa+0.222
aman+0.219
hew+0.217
Neg logit contributions
 hurt-0.130
 broke-0.114
 knocked-0.113
 hit-0.105
 wrecked-0.103
Token feeling
Token ID4203
Feature activation+1.56e+00

Pos logit contributions
hew+0.080
 satisfaction+0.080
oph+0.077
 pleasure+0.077
 fascination+0.076
Neg logit contributions
 parked-0.093
 driving-0.092
 broke-0.092
 knocked-0.091
 stuck-0.090
Token in
Token ID287
Feature activation+2.05e+00

Pos logit contributions
 fascination+0.093
 pleasure+0.093
 accomplishment+0.091
 satisfaction+0.089
 amaz+0.087
Neg logit contributions
 parked-0.043
 disturbed-0.041
 broke-0.039
 driving-0.039
 stole-0.039
Token her
Token ID607
Feature activation+2.15e+00

Pos logit contributions
hew+0.127
 pleasure+0.120
ouring+0.116
aire+0.115
ealing+0.114
Neg logit contributions
 driving-0.042
 parked-0.040
 smoking-0.040
 blocking-0.039
 guarding-0.037
Token heart
Token ID2612
Feature activation+1.74e+00

Pos logit contributions
oph+0.091
heart+0.082
aman+0.078
ya+0.078
 satisfaction+0.076
Neg logit contributions
 smoking-0.085
 driving-0.084
 hurt-0.081
 broke-0.080
 parked-0.080
Token yourself
Token ID3511
Feature activation+1.71e+00

Pos logit contributions
 accomplishment+0.144
oph+0.141
 pleasure+0.141
 satisfaction+0.140
 fascination+0.138
Neg logit contributions
 smoking-0.078
 parked-0.077
 driving-0.077
 blocking-0.073
 guarding-0.069
Token.
Token ID13
Feature activation+2.01e+00

Pos logit contributions
 satisfaction+0.104
 accomplishment+0.101
 pleasure+0.100
 fascination+0.098
 delight+0.096
Neg logit contributions
 parked-0.045
 driving-0.042
 smoking-0.041
 stole-0.039
 disturbed-0.037
Token<|endoftext|>
Token ID50256
Feature activation-6.52e-01

Pos logit contributions
 accomplishment+0.121
 satisfaction+0.116
 decision+0.111
 accomplishments+0.110
 pleasure+0.110
Neg logit contributions
 parked-0.050
 smoking-0.044
 driving-0.044
 stole-0.041
 CA-0.040
TokenOnce
Token ID7454
Feature activation+1.22e+00

Pos logit contributions
 parked+0.023
 smoking+0.022
 driving+0.022
 stole+0.022
 knocked+0.021
Neg logit contributions
 pleasure-0.033
 satisfaction-0.032
 accomplishment-0.031
 delight-0.031
 fascination-0.030
Token upon
Token ID2402
Feature activation-1.57e-01

Pos logit contributions
 satisfaction+0.063
 pleasure+0.059
 accomplishment+0.058
 delight+0.057
 fascination+0.056
Neg logit contributions
 parked-0.045
 driving-0.041
 smoking-0.041
 stole-0.041
 knocked-0.041
Token a
Token ID257
Feature activation+4.32e+00

Pos logit contributions
 parked+0.004
 smoking+0.004
 stole+0.004
 driving+0.004
 knocked+0.004
Neg logit contributions
 satisfaction-0.009
 pleasure-0.009
 accomplishment-0.009
 delight-0.009
 fascination-0.009
Token time
Token ID640
Feature activation+8.80e-01

Pos logit contributions
 satisfaction+0.260
 accomplishment+0.253
 amaz+0.243
 delight+0.242
 deeds+0.241
Neg logit contributions
 knocked-0.082
o-0.080
 disturbed-0.080
 named-0.077
 driving-0.076
Token,
Token ID11
Feature activation-3.28e-01

Pos logit contributions
 satisfaction+0.038
 accomplishment+0.036
 pleasure+0.036
 delight+0.034
 fascination+0.034
Neg logit contributions
 parked-0.040
 driving-0.039
 smoking-0.038
 stole-0.038
 knocked-0.037
Tokenthere
Token ID8117
Feature activation+5.72e-01

Pos logit contributions
 parked+0.007
 smoking+0.006
 stole+0.006
 driving+0.006
 knocked+0.005
Neg logit contributions
 pleasure-0.021
 satisfaction-0.021
 accomplishment-0.021
 delight-0.020
 fascination-0.020
Token lived
Token ID5615
Feature activation-1.28e+00

Pos logit contributions
 satisfaction+0.033
 pleasure+0.033
 accomplishment+0.033
 fascination+0.031
 delight+0.031
Neg logit contributions
 parked-0.017
 stole-0.016
 driving-0.015
 smoking-0.015
 broke-0.015
Token a
Token ID257
Feature activation+7.53e-01

Pos logit contributions
 stole+0.048
 parked+0.046
 smoking+0.044
 knocked+0.044
 driving+0.043
Neg logit contributions
 pleasure-0.061
 satisfaction-0.061
 delight-0.059
 accomplishment-0.056
 fascination-0.056
Token saw
Token ID2497
Feature activation-2.15e+00

Pos logit contributions
 driving+0.055
anes+0.055
 parked+0.054
 smoking+0.053
 Enter+0.053
Neg logit contributions
 satisfaction-0.062
 delight-0.060
 accomplishment-0.058
 pleasure-0.058
 fascination-0.057
Token a
Token ID257
Feature activation-1.16e+00

Pos logit contributions
cher+0.097
ider+0.095
 stole+0.092
 parked+0.091
 guarding+0.090
Neg logit contributions
 satisfaction-0.074
 delight-0.070
 pleasure-0.066
 accomplishment-0.064
 relief-0.064
Token big
Token ID1263
Feature activation+2.30e-01

Pos logit contributions
 parked+0.049
 driving+0.045
 stole+0.044
 smoking+0.044
 driven+0.043
Neg logit contributions
 pleasure-0.050
 satisfaction-0.049
 accomplishment-0.049
 delight-0.046
 difference-0.046
Token,
Token ID11
Feature activation-9.38e-01

Pos logit contributions
 satisfaction+0.012
 pleasure+0.012
 accomplishment+0.011
 delight+0.011
 fascination+0.011
Neg logit contributions
 parked-0.008
 smoking-0.008
 driving-0.008
 stole-0.008
 disturbed-0.007
Token red
Token ID2266
Feature activation-1.38e+00

Pos logit contributions
 parked+0.034
 driving+0.032
 stole+0.032
 smoking+0.031
 knocked+0.031
Neg logit contributions
 satisfaction-0.045
 pleasure-0.044
 accomplishment-0.044
 delight-0.042
 fascination-0.042
Token thing
Token ID1517
Feature activation-5.25e+00

Pos logit contributions
 parked+0.049
 stole+0.046
 driving+0.046
 disturbed+0.046
 smoking+0.045
Neg logit contributions
 satisfaction-0.066
 accomplishment-0.064
 pleasure-0.064
 delight-0.063
 fascination-0.062
Token on
Token ID319
Feature activation-1.34e+00

Pos logit contributions
 Indians+0.261
Lost+0.236
cher+0.234
chard+0.231
 villains+0.227
Neg logit contributions
 and-0.123
 goodness-0.114
 accomplishment-0.102
 until-0.098
 moments-0.097
Token the
Token ID262
Feature activation-5.67e-01

Pos logit contributions
 stole+0.057
cher+0.057
 parked+0.055
 smoking+0.055
 driving+0.054
Neg logit contributions
 satisfaction-0.055
 delight-0.053
 accomplishment-0.052
 pleasure-0.050
 fascination-0.049
Token ground
Token ID2323
Feature activation-3.31e+00

Pos logit contributions
 parked+0.024
 stole+0.024
 smoking+0.023
 driving+0.023
 disturbed+0.023
Neg logit contributions
 satisfaction-0.024
 pleasure-0.023
 accomplishment-0.023
 delight-0.022
 fascination-0.022
Token.
Token ID13
Feature activation-1.38e+00

Pos logit contributions
 Found+0.188
cher+0.187
 Indians+0.186
Ro+0.182
 villains+0.180
Neg logit contributions
 accomplishment-0.057
 until-0.057
 decision-0.056
ness-0.053
 and-0.052
Token He
Token ID679
Feature activation-1.35e+00

Pos logit contributions
 parked+0.062
 driving+0.060
cher+0.060
 smoking+0.059
 stole+0.059
Neg logit contributions
 pleasure-0.052
 satisfaction-0.050
 delight-0.050
 accomplishment-0.047
 fascination-0.045
Token mom
Token ID1995
Feature activation-8.33e-01

Pos logit contributions
 parked+0.082
 stole+0.079
 smoking+0.079
 blocking+0.077
 disturbed+0.076
Neg logit contributions
 pleasure-0.048
 accomplishment-0.044
 delight-0.043
 satisfaction-0.041
 fascination-0.037
Token and
Token ID290
Feature activation-1.08e+00

Pos logit contributions
 parked+0.039
 stole+0.039
 CA+0.039
 disturbed+0.038
 killed+0.038
Neg logit contributions
 pleasure-0.028
 satisfaction-0.028
 accomplishment-0.026
 delight-0.026
 difference-0.025
Token saw
Token ID2497
Feature activation-2.33e+00

Pos logit contributions
through+0.046
 smoking+0.046
 parked+0.046
angs+0.046
 blocking+0.045
Neg logit contributions
 satisfaction-0.039
 pleasure-0.039
 accomplishment-0.038
 delight-0.037
 fascination-0.036
Token a
Token ID257
Feature activation-1.07e+00

Pos logit contributions
ider+0.115
cher+0.112
 stole+0.109
 parked+0.106
 guarding+0.105
Neg logit contributions
 delight-0.070
 satisfaction-0.069
 pleasure-0.068
 amaz-0.063
 fascination-0.060
Token terrible
Token ID7818
Feature activation-1.83e-01

Pos logit contributions
 parked+0.044
 driving+0.041
 smoking+0.041
 stole+0.040
 disturbed+0.038
Neg logit contributions
 pleasure-0.047
 satisfaction-0.046
 accomplishment-0.045
 delight-0.044
 fascination-0.043
Token thing
Token ID1517
Feature activation-5.25e+00

Pos logit contributions
 parked+0.006
 driving+0.005
 smoking+0.005
 stole+0.005
 disturbed+0.005
Neg logit contributions
 satisfaction-0.010
 pleasure-0.010
 accomplishment-0.010
 delight-0.009
 fascination-0.009
Token.
Token ID13
Feature activation-1.15e+00

Pos logit contributions
 Indians+0.246
Lost+0.241
Ro+0.239
destruct+0.234
Wh+0.233
Neg logit contributions
 and-0.127
 until-0.105
 accomplishment-0.105
,-0.098
 goodness-0.097
Token A
Token ID317
Feature activation-1.12e+00

Pos logit contributions
 parked+0.050
 driving+0.048
cher+0.048
 stole+0.047
 smoking+0.047
Neg logit contributions
 pleasure-0.046
 delight-0.044
 satisfaction-0.043
 fascination-0.041
 accomplishment-0.040
Token man
Token ID582
Feature activation-2.35e+00

Pos logit contributions
 parked+0.050
 driving+0.046
 smoking+0.045
 stole+0.044
 blocking+0.043
Neg logit contributions
 pleasure-0.046
 satisfaction-0.046
 accomplishment-0.045
 delight-0.043
 difference-0.041
Token had
Token ID550
Feature activation-2.96e+00

Pos logit contributions
angs+0.080
 Bang+0.078
anes+0.078
Ro+0.077
through+0.074
Neg logit contributions
 satisfaction-0.092
 fascination-0.091
 goodness-0.090
 accomplishment-0.090
 pleasure-0.088
Token a
Token ID257
Feature activation-5.86e-01

Pos logit contributions
glers+0.124
saw+0.122
Ro+0.117
chard+0.116
aro+0.115
Neg logit contributions
 satisfaction-0.106
 delight-0.102
 pleasure-0.097
 joy-0.092
 enthusiasm-0.087
Token saw
Token ID2497
Feature activation-9.88e-01

Pos logit contributions
 driving+0.059
 parked+0.057
 smoking+0.056
 blocking+0.056
anes+0.054
Neg logit contributions
 delight-0.061
 satisfaction-0.060
 pleasure-0.060
 fascination-0.059
oph-0.058
Token a
Token ID257
Feature activation-4.63e-01

Pos logit contributions
 parked+0.036
 stole+0.035
 driving+0.033
 smoking+0.032
 knocked+0.031
Neg logit contributions
 pleasure-0.048
 satisfaction-0.048
 delight-0.047
 accomplishment-0.046
 fascination-0.046
Token strange
Token ID6283
Feature activation-3.91e-01

Pos logit contributions
 parked+0.018
 driving+0.017
 smoking+0.017
 stole+0.017
 knocked+0.016
Neg logit contributions
 pleasure-0.022
 satisfaction-0.022
 accomplishment-0.021
 delight-0.020
 fascination-0.020
Token and
Token ID290
Feature activation-1.95e+00

Pos logit contributions
 parked+0.015
 driving+0.014
 stole+0.013
 smoking+0.013
 knocked+0.013
Neg logit contributions
 satisfaction-0.019
 pleasure-0.019
 accomplishment-0.019
 delight-0.018
 fascination-0.018
Token unknown
Token ID6439
Feature activation-1.89e+00

Pos logit contributions
 parked+0.078
 driving+0.078
anes+0.078
 smoking+0.073
cher+0.072
Neg logit contributions
 satisfaction-0.080
 accomplishment-0.079
 fascination-0.077
 delight-0.076
 pleasure-0.075
Token thing
Token ID1517
Feature activation-5.01e+00

Pos logit contributions
 parked+0.085
 stole+0.080
hes+0.077
 driven+0.076
 driving+0.076
Neg logit contributions
 pleasure-0.072
 satisfaction-0.072
 difference-0.071
 accomplishment-0.070
 decision-0.070
Token.
Token ID13
Feature activation+2.24e-01

Pos logit contributions
Ro+0.232
 Indians+0.228
Spe+0.225
 parked+0.221
Wh+0.220
Neg logit contributions
 accomplishment-0.114
 delight-0.110
 moments-0.106
 darling-0.106
 joy-0.105
Token She
Token ID1375
Feature activation-9.28e-01

Pos logit contributions
 satisfaction+0.011
 pleasure+0.011
 accomplishment+0.011
 delight+0.010
 fascination+0.010
Neg logit contributions
 parked-0.009
 smoking-0.008
 driving-0.008
 stole-0.008
 knocked-0.008
Token asked
Token ID1965
Feature activation+9.68e-01

Pos logit contributions
 smoking+0.032
 driving+0.032
 parked+0.032
 stole+0.030
 blocking+0.029
Neg logit contributions
 satisfaction-0.044
 delight-0.043
 fascination-0.042
 pleasure-0.042
 accomplishment-0.042
Token her
Token ID607
Feature activation-1.65e-01

Pos logit contributions
 satisfaction+0.053
 pleasure+0.053
 accomplishment+0.052
 fascination+0.049
 delight+0.049
Neg logit contributions
 parked-0.031
 driving-0.030
 smoking-0.029
 knocked-0.028
 stole-0.028
Token mum
Token ID25682
Feature activation+7.84e-01

Pos logit contributions
 parked+0.008
 stole+0.007
 driving+0.007
 smoking+0.007
 disturbed+0.007
Neg logit contributions
 pleasure-0.007
 satisfaction-0.007
 accomplishment-0.007
 delight-0.006
 fascination-0.006
Token One
Token ID1881
Feature activation-4.79e-01

Pos logit contributions
 satisfaction+0.007
 pleasure+0.007
 accomplishment+0.006
 delight+0.006
 fascination+0.006
Neg logit contributions
 parked-0.005
 driving-0.004
 smoking-0.004
 stole-0.004
 knocked-0.004
Token day
Token ID1110
Feature activation-8.43e-01

Pos logit contributions
 parked+0.023
 smoking+0.023
 stole+0.023
 driving+0.023
 blocking+0.022
Neg logit contributions
 satisfaction-0.018
 pleasure-0.017
 accomplishment-0.017
 delight-0.016
 fascination-0.015
Token a
Token ID257
Feature activation-1.10e+00

Pos logit contributions
 parked+0.049
 stole+0.047
 smoking+0.046
 driving+0.045
through+0.045
Neg logit contributions
 satisfaction-0.022
 pleasure-0.021
 accomplishment-0.021
 delight-0.020
 fascination-0.018
Token very
Token ID845
Feature activation-4.90e-01

Pos logit contributions
 parked+0.060
 smoking+0.056
 driving+0.056
 stole+0.055
 blocking+0.054
Neg logit contributions
 pleasure-0.037
 satisfaction-0.036
 accomplishment-0.034
 delight-0.032
 difference-0.030
Token unusual
Token ID8468
Feature activation-8.86e-01

Pos logit contributions
 parked+0.022
 smoking+0.021
 driving+0.021
 stole+0.020
 blocking+0.019
Neg logit contributions
 accomplishment-0.021
 satisfaction-0.021
 pleasure-0.021
 delight-0.019
 fascination-0.019
Token thing
Token ID1517
Feature activation-5.01e+00

Pos logit contributions
 parked+0.038
 smoking+0.035
 stole+0.035
 driving+0.035
 knocked+0.033
Neg logit contributions
 satisfaction-0.039
 pleasure-0.039
 accomplishment-0.039
 delight-0.036
 fascination-0.035
Token happened
Token ID3022
Feature activation-1.49e+00

Pos logit contributions
chard+0.186
through+0.185
anes+0.182
Wh+0.180
cher+0.174
Neg logit contributions
 hope-0.149
 moments-0.148
 goodness-0.141
 accomplishment-0.140
 and-0.138
Token.
Token ID13
Feature activation-9.06e-01

Pos logit contributions
 parked+0.061
 stole+0.059
saw+0.056
Ro+0.055
through+0.054
Neg logit contributions
 accomplishment-0.063
 pleasure-0.061
 satisfaction-0.058
 fascination-0.057
 accomplishments-0.056
Token He
Token ID679
Feature activation-1.60e+00

Pos logit contributions
 parked+0.039
 stole+0.038
 smoking+0.037
 driving+0.037
 knocked+0.036
Neg logit contributions
 pleasure-0.039
 satisfaction-0.038
 delight-0.036
 accomplishment-0.035
 fascination-0.034
Token pulled
Token ID5954
Feature activation+1.00e+00

Pos logit contributions
 smoking+0.060
 driving+0.058
anes+0.056
angs+0.054
 blocking+0.054
Neg logit contributions
 satisfaction-0.066
 delight-0.065
 fascination-0.065
 pleasure-0.063
 accomplishment-0.063
Token on
Token ID319
Feature activation+9.68e-01

Pos logit contributions
 pleasure+0.061
 satisfaction+0.061
 accomplishment+0.059
oph+0.057
 fascination+0.056
Neg logit contributions
 parked-0.027
 driving-0.026
 smoking-0.024
 stuck-0.024
 broke-0.023
TokenOne
Token ID3198
Feature activation-1.09e-01

Pos logit contributions
 smoking+0.050
 parked+0.050
 driving+0.049
 stole+0.049
 broke+0.047
Neg logit contributions
 satisfaction-0.046
 pleasure-0.044
 delight-0.043
 fascination-0.042
 accomplishment-0.042
Token day
Token ID1110
Feature activation-7.92e-01

Pos logit contributions
 parked+0.005
 driving+0.005
 stole+0.005
 smoking+0.005
 knocked+0.004
Neg logit contributions
 satisfaction-0.005
 pleasure-0.004
 accomplishment-0.004
 delight-0.004
 fascination-0.004
Token,
Token ID11
Feature activation-6.85e-01

Pos logit contributions
 parked+0.049
 stole+0.046
 smoking+0.046
 driving+0.045
 blocking+0.044
Neg logit contributions
 pleasure-0.020
 satisfaction-0.020
 delight-0.018
 accomplishment-0.018
 fascination-0.016
Token a
Token ID257
Feature activation-6.25e-01

Pos logit contributions
 parked+0.024
 smoking+0.022
 driving+0.022
 stole+0.022
 blocking+0.020
Neg logit contributions
 pleasure-0.036
 satisfaction-0.035
 accomplishment-0.034
 delight-0.034
 fascination-0.033
Token strange
Token ID6283
Feature activation-5.55e-01

Pos logit contributions
 parked+0.032
 driving+0.030
 stole+0.030
 smoking+0.030
 blocking+0.028
Neg logit contributions
 pleasure-0.023
 satisfaction-0.023
 accomplishment-0.022
 delight-0.020
 fascination-0.020
Token thing
Token ID1517
Feature activation-4.96e+00

Pos logit contributions
 parked+0.022
 driving+0.020
 stole+0.020
 smoking+0.020
 blocking+0.019
Neg logit contributions
 satisfaction-0.026
 pleasure-0.026
 accomplishment-0.025
 delight-0.024
 fascination-0.024
Token happened
Token ID3022
Feature activation-1.56e+00

Pos logit contributions
Wh+0.178
chard+0.173
 Indians+0.171
Spe+0.167
Lost+0.163
Neg logit contributions
 moments-0.163
 accomplishment-0.160
 satisfaction-0.151
 goodness-0.151
 hope-0.150
Token.
Token ID13
Feature activation-1.47e+00

Pos logit contributions
 parked+0.064
 stole+0.063
saw+0.060
Ro+0.059
through+0.058
Neg logit contributions
 accomplishment-0.064
 pleasure-0.062
 satisfaction-0.061
 fascination-0.059
 accomplishments-0.057
Token A
Token ID317
Feature activation-9.09e-01

Pos logit contributions
 parked+0.061
 driving+0.058
 stole+0.057
 smoking+0.056
cher+0.054
Neg logit contributions
 pleasure-0.065
 satisfaction-0.064
 delight-0.063
 fascination-0.059
 accomplishment-0.058
Token small
Token ID1402
Feature activation+4.08e-02

Pos logit contributions
 parked+0.041
 driving+0.038
 stole+0.037
 smoking+0.037
 blocking+0.035
Neg logit contributions
 pleasure-0.038
 satisfaction-0.038
 accomplishment-0.037
 delight-0.036
 fascination-0.034
Token boy
Token ID2933
Feature activation-2.39e+00

Pos logit contributions
 satisfaction+0.002
 pleasure+0.002
 accomplishment+0.002
 delight+0.002
 fascination+0.002
Neg logit contributions
 parked-0.001
 driving-0.001
 smoking-0.001
 stole-0.001
 knocked-0.001
Token a
Token ID257
Feature activation-7.48e-01

Pos logit contributions
 stole+0.049
 parked+0.047
through+0.045
cher+0.044
wheel+0.044
Neg logit contributions
 pleasure-0.052
 satisfaction-0.052
 delight-0.049
 amaz-0.049
 fascination-0.049
Token man
Token ID582
Feature activation-2.94e+00

Pos logit contributions
 parked+0.030
 driving+0.027
 smoking+0.027
 stole+0.027
 disturbed+0.026
Neg logit contributions
 pleasure-0.034
 satisfaction-0.034
 accomplishment-0.034
 delight-0.032
 fascination-0.031
Token with
Token ID351
Feature activation-1.39e+00

Pos logit contributions
Ro+0.107
 CA+0.100
 Bang+0.099
 broke+0.097
 Crash+0.096
Neg logit contributions
 fascination-0.117
 amaz-0.115
 accomplishment-0.115
 delight-0.113
 satisfaction-0.113
Token a
Token ID257
Feature activation-8.25e-01

Pos logit contributions
 parked+0.050
 stole+0.047
saw+0.045
 smoking+0.045
 disturbed+0.043
Neg logit contributions
 satisfaction-0.070
 pleasure-0.070
 accomplishment-0.068
 fascination-0.068
 delight-0.067
Token real
Token ID1103
Feature activation-8.13e-01

Pos logit contributions
 parked+0.030
 driving+0.027
 stole+0.027
 smoking+0.026
 disturbed+0.025
Neg logit contributions
 pleasure-0.042
 satisfaction-0.041
 accomplishment-0.040
 fascination-0.039
 delight-0.039
Token gun
Token ID2485
Feature activation-4.95e+00

Pos logit contributions
 parked+0.022
 smoking+0.019
 stole+0.019
 driving+0.018
through+0.018
Neg logit contributions
 pleasure-0.049
 satisfaction-0.049
 accomplishment-0.048
 delight-0.046
 fascination-0.046
Token.
Token ID13
Feature activation-1.13e+00

Pos logit contributions
through+0.258
saw+0.256
chard+0.253
Ro+0.249
Free+0.243
Neg logit contributions
 and-0.105
 until-0.076
 thanks-0.073
 hope-0.071
ness-0.071
Token He
Token ID679
Feature activation-7.51e-01

Pos logit contributions
 parked+0.050
 driving+0.048
 stole+0.046
 smoking+0.046
 knocked+0.045
Neg logit contributions
 pleasure-0.047
 satisfaction-0.046
 delight-0.046
 fascination-0.043
 amaz-0.042
Token is
Token ID318
Feature activation-2.62e+00

Pos logit contributions
 parked+0.033
 smoking+0.032
 driving+0.031
 stole+0.030
 disturbed+0.030
Neg logit contributions
 satisfaction-0.030
 pleasure-0.029
 accomplishment-0.029
 fascination-0.029
 delight-0.028
Token angry
Token ID7954
Feature activation-1.32e+00

Pos logit contributions
 stole+0.101
anes+0.100
 CA+0.099
Ro+0.099
saw+0.095
Neg logit contributions
 pleasure-0.103
 accomplishment-0.100
 delight-0.097
 satisfaction-0.096
 fascination-0.095
Token.
Token ID13
Feature activation-1.14e+00

Pos logit contributions
 stole+0.071
 parked+0.069
chard+0.067
 CA+0.065
 knocked+0.065
Neg logit contributions
 accomplishment-0.036
 satisfaction-0.036
 pleasure-0.036
 delight-0.034
 goodness-0.033
Token,
Token ID11
Feature activation-9.87e-01

Pos logit contributions
 parked+0.033
 smoking+0.032
 driving+0.031
 stole+0.031
 blocking+0.030
Neg logit contributions
 pleasure-0.016
 satisfaction-0.016
 accomplishment-0.015
 delight-0.015
 fascination-0.013
Token they
Token ID484
Feature activation-2.48e-01

Pos logit contributions
 parked+0.034
 smoking+0.033
 driving+0.032
 stole+0.031
 guarding+0.031
Neg logit contributions
 pleasure-0.050
 satisfaction-0.050
 accomplishment-0.049
 delight-0.048
 fascination-0.045
Token see
Token ID766
Feature activation-8.18e-01

Pos logit contributions
 parked+0.010
 smoking+0.009
 driving+0.009
 stole+0.009
 disturbed+0.009
Neg logit contributions
 satisfaction-0.012
 pleasure-0.011
 accomplishment-0.011
 delight-0.011
 fascination-0.011
Token a
Token ID257
Feature activation-1.22e+00

Pos logit contributions
 parked+0.030
 stole+0.029
 driving+0.028
 smoking+0.027
through+0.026
Neg logit contributions
 satisfaction-0.040
 pleasure-0.040
 delight-0.038
 accomplishment-0.037
 amaz-0.037
Token new
Token ID649
Feature activation-3.52e-01

Pos logit contributions
 parked+0.052
 driving+0.048
 stole+0.047
 smoking+0.047
 driven+0.045
Neg logit contributions
 pleasure-0.054
 accomplishment-0.053
 satisfaction-0.052
 delight-0.050
 difference-0.049
Token thing
Token ID1517
Feature activation-4.91e+00

Pos logit contributions
 parked+0.012
 smoking+0.011
 stole+0.011
 driving+0.011
 disturbed+0.010
Neg logit contributions
 pleasure-0.019
 satisfaction-0.019
 accomplishment-0.019
 delight-0.018
 fascination-0.018
Token in
Token ID287
Feature activation-8.79e-01

Pos logit contributions
 Indians+0.222
Lost+0.219
Wh+0.212
 stole+0.211
 Stri+0.209
Neg logit contributions
 accomplishment-0.132
 goodness-0.116
 and-0.116
 until-0.109
 delight-0.107
Token the
Token ID262
Feature activation-6.19e-01

Pos logit contributions
 parked+0.036
 stole+0.035
 driving+0.035
 smoking+0.034
 disturbed+0.033
Neg logit contributions
 satisfaction-0.040
 pleasure-0.039
 accomplishment-0.038
 delight-0.038
 fascination-0.037
Token park
Token ID3952
Feature activation-2.53e+00

Pos logit contributions
 parked+0.027
 driving+0.026
 smoking+0.026
 stole+0.026
 knocked+0.025
Neg logit contributions
 satisfaction-0.026
 pleasure-0.026
 delight-0.025
 accomplishment-0.024
 difference-0.024
Token.
Token ID13
Feature activation-1.67e+00

Pos logit contributions
through+0.125
Ro+0.123
cher+0.123
 smoking+0.119
 driving+0.118
Neg logit contributions
 decision-0.073
 satisfaction-0.072
 delight-0.071
 accomplishment-0.071
 amaz-0.066
Token It
Token ID632
Feature activation-1.63e+00

Pos logit contributions
 driving+0.084
 parked+0.083
 smoking+0.081
cher+0.080
 guarding+0.080
Neg logit contributions
 delight-0.060
 pleasure-0.060
 satisfaction-0.059
 accomplishment-0.052
 amaz-0.050
Token day
Token ID1110
Feature activation-7.79e-01

Pos logit contributions
 parked+0.014
 smoking+0.014
 stole+0.014
 driving+0.014
 knocked+0.013
Neg logit contributions
 satisfaction-0.012
 pleasure-0.012
 accomplishment-0.011
 delight-0.011
 fascination-0.010
Token,
Token ID11
Feature activation-9.90e-01

Pos logit contributions
 parked+0.047
 stole+0.045
 smoking+0.044
 driving+0.044
 blocking+0.042
Neg logit contributions
 pleasure-0.020
 satisfaction-0.020
 accomplishment-0.019
 delight-0.018
 fascination-0.016
Token a
Token ID257
Feature activation-7.23e-01

Pos logit contributions
 parked+0.037
 stole+0.035
 smoking+0.035
 driving+0.035
 blocking+0.032
Neg logit contributions
 pleasure-0.048
 satisfaction-0.047
 accomplishment-0.045
 delight-0.045
 fascination-0.042
Token shiny
Token ID22441
Feature activation-4.95e-01

Pos logit contributions
 parked+0.038
 smoking+0.035
 driving+0.035
 stole+0.035
 blocking+0.034
Neg logit contributions
 pleasure-0.026
 satisfaction-0.025
 accomplishment-0.024
 delight-0.023
 fascination-0.022
Token new
Token ID649
Feature activation-4.40e-02

Pos logit contributions
 parked+0.015
 smoking+0.014
 stole+0.014
 driving+0.013
 disturbed+0.013
Neg logit contributions
 satisfaction-0.027
 pleasure-0.027
 accomplishment-0.027
 delight-0.026
 fascination-0.026
Token vehicle
Token ID4038
Feature activation-4.91e+00

Pos logit contributions
 parked+0.001
 smoking+0.001
 stole+0.001
 driving+0.001
 disturbed+0.001
Neg logit contributions
 satisfaction-0.002
 pleasure-0.002
 accomplishment-0.002
 delight-0.002
 fascination-0.002
Token came
Token ID1625
Feature activation-1.75e+00

Pos logit contributions
through+0.211
angs+0.206
Ro+0.196
chard+0.188
 smoking+0.178
Neg logit contributions
 and-0.145
 hope-0.138
 accomplishment-0.134
 satisfaction-0.131
 loves-0.127
Token to
Token ID284
Feature activation-7.98e-01

Pos logit contributions
 stole+0.063
 parked+0.061
 smoking+0.059
Ro+0.059
 disturbed+0.055
Neg logit contributions
 pleasure-0.079
 accomplishment-0.077
 satisfaction-0.073
 fascination-0.071
 delight-0.070
Token town
Token ID3240
Feature activation-1.68e+00

Pos logit contributions
 parked+0.032
 stole+0.030
 smoking+0.030
through+0.029
 blocking+0.029
Neg logit contributions
 pleasure-0.037
 satisfaction-0.037
 accomplishment-0.036
 delight-0.034
 fascination-0.034
Token.
Token ID13
Feature activation-1.40e+00

Pos logit contributions
 CA+0.069
Ro+0.068
 stole+0.067
 parked+0.065
 smoking+0.063
Neg logit contributions
 accomplishment-0.068
 pleasure-0.066
 satisfaction-0.065
 delight-0.063
 fascination-0.061
Token Everyone
Token ID11075
Feature activation-1.43e+00

Pos logit contributions
 parked+0.063
 smoking+0.061
 driving+0.060
 stole+0.060
 blocking+0.058
Neg logit contributions
 pleasure-0.057
 satisfaction-0.057
 delight-0.055
 accomplishment-0.051
 fascination-0.050
Token day
Token ID1110
Feature activation-9.01e-01

Pos logit contributions
 parked+0.021
 stole+0.021
 smoking+0.021
 driving+0.020
 blocking+0.020
Neg logit contributions
 satisfaction-0.016
 pleasure-0.016
 accomplishment-0.016
 delight-0.015
 fascination-0.014
Token,
Token ID11
Feature activation-1.53e+00

Pos logit contributions
 parked+0.056
 stole+0.053
 smoking+0.053
 driving+0.052
 blocking+0.051
Neg logit contributions
 pleasure-0.021
 satisfaction-0.021
 accomplishment-0.020
 delight-0.019
 fascination-0.017
Token a
Token ID257
Feature activation-1.08e+00

Pos logit contributions
 parked+0.069
angs+0.067
 stole+0.066
cher+0.064
wheel+0.064
Neg logit contributions
 pleasure-0.060
 satisfaction-0.058
 delight-0.056
 accomplishment-0.054
 fascination-0.050
Token new
Token ID649
Feature activation-5.61e-01

Pos logit contributions
 parked+0.056
 smoking+0.052
 driving+0.052
 stole+0.052
 blocking+0.051
Neg logit contributions
 pleasure-0.038
 satisfaction-0.037
 accomplishment-0.036
 delight-0.033
 difference-0.032
Token green
Token ID4077
Feature activation-1.25e+00

Pos logit contributions
 parked+0.022
 smoking+0.021
 stole+0.021
 driving+0.020
 blocking+0.020
Neg logit contributions
 pleasure-0.026
 satisfaction-0.026
 accomplishment-0.025
 delight-0.024
 fascination-0.024
Token truck
Token ID7779
Feature activation-4.91e+00

Pos logit contributions
 parked+0.032
 smoking+0.031
through+0.030
 stole+0.030
 blocking+0.029
Neg logit contributions
 pleasure-0.072
 satisfaction-0.071
 accomplishment-0.070
 delight-0.068
 decision-0.067
Token came
Token ID1625
Feature activation-1.73e+00

Pos logit contributions
through+0.219
nam+0.218
chard+0.216
angs+0.200
cher+0.191
Neg logit contributions
 and-0.142
 thanks-0.121
 hope-0.121
 satisfaction-0.119
 loves-0.113
Token to
Token ID284
Feature activation-1.14e+00

Pos logit contributions
 stole+0.067
 parked+0.067
 smoking+0.063
chard+0.062
Ro+0.061
Neg logit contributions
 pleasure-0.071
 accomplishment-0.070
 satisfaction-0.066
 fascination-0.065
 delight-0.064
Token play
Token ID711
Feature activation-2.41e+00

Pos logit contributions
 parked+0.048
through+0.047
angs+0.046
 stole+0.046
 smoking+0.045
Neg logit contributions
 pleasure-0.050
 satisfaction-0.049
 accomplishment-0.048
 delight-0.045
 fascination-0.045
Token.
Token ID13
Feature activation-1.83e+00

Pos logit contributions
through+0.118
 knocked+0.116
 stole+0.115
 rang+0.115
 parked+0.114
Neg logit contributions
 accomplishment-0.073
 agreement-0.072
 satisfaction-0.071
 delight-0.071
 difference-0.069
Token The
Token ID383
Feature activation-1.02e+00

Pos logit contributions
 parked+0.089
 driving+0.086
 smoking+0.086
cher+0.086
 stole+0.085
Neg logit contributions
 pleasure-0.065
 delight-0.065
 satisfaction-0.063
 accomplishment-0.056
 fascination-0.054
Token small
Token ID1402
Feature activation-1.73e-01

Pos logit contributions
 parked+0.040
 driving+0.037
 smoking+0.036
 stole+0.036
 disturbed+0.034
Neg logit contributions
 pleasure-0.041
 satisfaction-0.040
 accomplishment-0.040
 delight-0.038
 fascination-0.038
Token,
Token ID11
Feature activation-9.69e-01

Pos logit contributions
 parked+0.006
 driving+0.005
 stole+0.005
 smoking+0.005
 knocked+0.005
Neg logit contributions
 satisfaction-0.009
 pleasure-0.009
 accomplishment-0.009
 delight-0.009
 fascination-0.009
Token round
Token ID2835
Feature activation-1.03e+00

Pos logit contributions
 parked+0.036
 driving+0.034
 stole+0.033
 smoking+0.033
 knocked+0.032
Neg logit contributions
 satisfaction-0.046
 pleasure-0.045
 accomplishment-0.045
 delight-0.043
 fascination-0.043
Token,
Token ID11
Feature activation-5.74e-01

Pos logit contributions
 parked+0.040
 stole+0.038
 driving+0.038
 smoking+0.038
 disturbed+0.036
Neg logit contributions
 satisfaction-0.047
 accomplishment-0.046
 delight-0.045
 pleasure-0.045
 decision-0.043
Token green
Token ID4077
Feature activation-1.53e+00

Pos logit contributions
 parked+0.021
 stole+0.020
 driving+0.020
 smoking+0.020
 knocked+0.019
Neg logit contributions
 satisfaction-0.027
 pleasure-0.027
 accomplishment-0.026
 delight-0.026
 fascination-0.025
Token thing
Token ID1517
Feature activation-4.90e+00

Pos logit contributions
 parked+0.061
 stole+0.059
 driving+0.058
cher+0.057
 disturbed+0.057
Neg logit contributions
 satisfaction-0.067
 delight-0.065
 accomplishment-0.063
 pleasure-0.062
 fascination-0.060
Token on
Token ID319
Feature activation-8.75e-01

Pos logit contributions
 Indians+0.242
cher+0.238
Lost+0.237
chard+0.224
 driven+0.223
Neg logit contributions
 and-0.115
 until-0.092
 goodness-0.088
,-0.088
 de-0.085
Token the
Token ID262
Feature activation-5.10e-01

Pos logit contributions
 parked+0.035
 stole+0.035
 smoking+0.033
 driving+0.033
 disturbed+0.033
Neg logit contributions
 satisfaction-0.039
 delight-0.038
 accomplishment-0.037
 pleasure-0.037
 fascination-0.036
Token ground
Token ID2323
Feature activation-2.91e+00

Pos logit contributions
 parked+0.021
 smoking+0.021
 stole+0.020
 driving+0.020
 disturbed+0.020
Neg logit contributions
 satisfaction-0.022
 pleasure-0.022
 accomplishment-0.022
 delight-0.021
 fascination-0.020
Token.
Token ID13
Feature activation-1.14e+00

Pos logit contributions
 CA+0.157
Ro+0.156
 Found+0.156
cher+0.155
 stole+0.154
Neg logit contributions
 accomplishment-0.062
 until-0.059
 decision-0.059
 satisfaction-0.058
 delight-0.057
Token It
Token ID632
Feature activation-1.86e+00

Pos logit contributions
 parked+0.050
 stole+0.047
 driving+0.047
 smoking+0.047
 disturbed+0.046
Neg logit contributions
 pleasure-0.047
 satisfaction-0.045
 delight-0.045
 accomplishment-0.042
 fascination-0.041
Token,
Token ID11
Feature activation-3.41e-02

Pos logit contributions
 satisfaction+0.009
 pleasure+0.009
 accomplishment+0.008
 fascination+0.008
 delight+0.008
Neg logit contributions
 parked-0.012
 driving-0.011
 smoking-0.011
 stole-0.011
 knocked-0.011
Token they
Token ID484
Feature activation+2.70e-01

Pos logit contributions
 parked+0.001
 driving+0.001
 smoking+0.001
 stole+0.001
 disturbed+0.001
Neg logit contributions
 satisfaction-0.002
 pleasure-0.002
 accomplishment-0.002
 delight-0.002
 fascination-0.002
Token see
Token ID766
Feature activation-8.61e-01

Pos logit contributions
 satisfaction+0.013
 pleasure+0.013
 accomplishment+0.013
 delight+0.012
 fascination+0.012
Neg logit contributions
 parked-0.011
 stole-0.010
 driving-0.010
 smoking-0.010
 knocked-0.009
Token a
Token ID257
Feature activation-1.19e+00

Pos logit contributions
 parked+0.031
 stole+0.030
 driving+0.029
 smoking+0.028
 guarding+0.027
Neg logit contributions
 satisfaction-0.042
 pleasure-0.042
 delight-0.040
 accomplishment-0.039
 fascination-0.039
Token new
Token ID649
Feature activation-2.55e-01

Pos logit contributions
 parked+0.051
 driving+0.047
 stole+0.046
 smoking+0.046
 driven+0.045
Neg logit contributions
 pleasure-0.051
 accomplishment-0.050
 satisfaction-0.050
 delight-0.048
 difference-0.047
Token thing
Token ID1517
Feature activation-4.89e+00

Pos logit contributions
 parked+0.010
 stole+0.009
 smoking+0.009
 driving+0.009
 disturbed+0.008
Neg logit contributions
 satisfaction-0.013
 pleasure-0.013
 accomplishment-0.013
 delight-0.012
 fascination-0.012
Token in
Token ID287
Feature activation-1.16e+00

Pos logit contributions
Lost+0.214
 Indians+0.211
Wh+0.207
 stole+0.203
chard+0.203
Neg logit contributions
 accomplishment-0.132
 goodness-0.125
 and-0.116
 darling-0.116
 delight-0.114
Token the
Token ID262
Feature activation-2.80e-01

Pos logit contributions
 parked+0.048
 stole+0.047
 driving+0.047
 smoking+0.046
 disturbed+0.044
Neg logit contributions
 satisfaction-0.051
 pleasure-0.049
 delight-0.048
 accomplishment-0.048
 fascination-0.047
Token gym
Token ID11550
Feature activation-3.27e+00

Pos logit contributions
 parked+0.009
 smoking+0.009
 driving+0.009
 stole+0.009
 disturbed+0.008
Neg logit contributions
 satisfaction-0.015
 pleasure-0.015
 accomplishment-0.014
 delight-0.014
 fascination-0.014
Token.
Token ID13
Feature activation-7.92e-01

Pos logit contributions
through+0.155
Ro+0.151
 CA+0.148
 stole+0.148
 bitten+0.148
Neg logit contributions
 accomplishment-0.092
 decision-0.090
 satisfaction-0.090
 delight-0.085
 difference-0.080
Token It
Token ID632
Feature activation-1.67e+00

Pos logit contributions
 parked+0.034
 driving+0.033
 smoking+0.032
 stole+0.032
 blocking+0.030
Neg logit contributions
 pleasure-0.035
 satisfaction-0.034
 delight-0.034
 accomplishment-0.032
 fascination-0.032
Token
Token ID198
Feature activation-6.43e-01

Pos logit contributions
 parked+0.031
 stole+0.030
 smoking+0.030
 driving+0.029
 disturbed+0.028
Neg logit contributions
 satisfaction-0.029
 pleasure-0.029
 accomplishment-0.028
 delight-0.027
 fascination-0.026
TokenThen
Token ID6423
Feature activation-9.25e-01

Pos logit contributions
 parked+0.029
 driving+0.029
 smoking+0.028
 stole+0.027
 disturbed+0.027
Neg logit contributions
 satisfaction-0.025
 accomplishment-0.024
 pleasure-0.023
 delight-0.023
 fascination-0.022
Token,
Token ID11
Feature activation-1.66e+00

Pos logit contributions
 parked+0.033
 smoking+0.031
 driving+0.030
 stole+0.030
through+0.029
Neg logit contributions
 satisfaction-0.046
 delight-0.044
 accomplishment-0.044
 pleasure-0.044
 fascination-0.042
Token a
Token ID257
Feature activation-1.05e+00

Pos logit contributions
 parked+0.054
anes+0.053
 smoking+0.052
 driving+0.051
through+0.051
Neg logit contributions
 satisfaction-0.084
 accomplishment-0.080
 delight-0.078
 pleasure-0.078
 relief-0.075
Token bad
Token ID2089
Feature activation-3.55e-01

Pos logit contributions
 parked+0.044
 stole+0.040
 driving+0.040
 smoking+0.040
 blocking+0.038
Neg logit contributions
 satisfaction-0.047
 pleasure-0.046
 accomplishment-0.046
 delight-0.044
 fascination-0.042
Token thing
Token ID1517
Feature activation-4.89e+00

Pos logit contributions
 parked+0.013
 stole+0.011
 smoking+0.011
 driving+0.011
 blocking+0.011
Neg logit contributions
 satisfaction-0.018
 pleasure-0.018
 accomplishment-0.018
 delight-0.017
 fascination-0.017
Token happened
Token ID3022
Feature activation-1.79e+00

Pos logit contributions
chard+0.218
through+0.182
nam+0.180
 Indians+0.172
 smoking+0.164
Neg logit contributions
 moments-0.159
 accomplishment-0.157
 hope-0.150
 satisfaction-0.147
 relief-0.146
Token.
Token ID13
Feature activation-1.69e+00

Pos logit contributions
 parked+0.081
 stole+0.079
Ro+0.079
through+0.078
saw+0.077
Neg logit contributions
 accomplishment-0.064
 satisfaction-0.059
 pleasure-0.057
 goodness-0.057
 decision-0.055
Token A
Token ID317
Feature activation-1.45e+00

Pos logit contributions
 parked+0.075
 driving+0.073
 stole+0.073
 knocked+0.071
 guarding+0.070
Neg logit contributions
 satisfaction-0.068
 pleasure-0.064
 delight-0.062
 accomplishment-0.060
 relief-0.059
Token big
Token ID1263
Feature activation+1.60e-01

Pos logit contributions
 parked+0.061
 stole+0.054
through+0.052
 driving+0.051
 smoking+0.051
Neg logit contributions
 satisfaction-0.063
 accomplishment-0.063
 pleasure-0.062
 difference-0.061
 decision-0.059
Token car
Token ID1097
Feature activation-4.14e+00

Pos logit contributions
 satisfaction+0.009
 pleasure+0.009
 accomplishment+0.009
 delight+0.009
 fascination+0.009
Neg logit contributions
 parked-0.004
 driving-0.004
 smoking-0.004
 stole-0.004
 knocked-0.004
Token,
Token ID11
Feature activation+9.61e-02

Pos logit contributions
 parked+0.012
 driving+0.012
 stole+0.012
 smoking+0.012
 disturbed+0.011
Neg logit contributions
 satisfaction-0.007
 pleasure-0.007
 accomplishment-0.007
 delight-0.007
 fascination-0.006
Token they
Token ID484
Feature activation-5.82e-01

Pos logit contributions
 satisfaction+0.005
 pleasure+0.005
 accomplishment+0.005
 delight+0.005
 fascination+0.005
Neg logit contributions
 parked-0.003
 driving-0.003
 smoking-0.003
 stole-0.003
 disturbed-0.002
Token see
Token ID766
Feature activation-1.31e+00

Pos logit contributions
 parked+0.023
 driving+0.023
 smoking+0.023
 stole+0.022
 blocking+0.021
Neg logit contributions
 satisfaction-0.026
 pleasure-0.025
 accomplishment-0.024
 delight-0.024
 fascination-0.024
Token a
Token ID257
Feature activation-1.61e+00

Pos logit contributions
 stole+0.054
 parked+0.051
 guarding+0.049
cher+0.049
 blocking+0.048
Neg logit contributions
 pleasure-0.055
 satisfaction-0.054
 delight-0.053
 amaz-0.052
 fascination-0.051
Token real
Token ID1103
Feature activation-8.20e-01

Pos logit contributions
 parked+0.070
 stole+0.066
 bitten+0.064
 driving+0.064
 smoking+0.063
Neg logit contributions
 pleasure-0.064
 accomplishment-0.064
 satisfaction-0.061
 difference-0.061
 decision-0.059
Token train
Token ID4512
Feature activation-4.89e+00

Pos logit contributions
 parked+0.023
 stole+0.022
 smoking+0.020
 blocking+0.019
 driving+0.019
Neg logit contributions
 accomplishment-0.047
 pleasure-0.047
 satisfaction-0.047
 delight-0.044
 fascination-0.044
Token outside
Token ID2354
Feature activation-2.55e+00

Pos logit contributions
angs+0.214
 bitten+0.209
chard+0.198
through+0.197
Ro+0.194
Neg logit contributions
 and-0.138
 difference-0.129
 accomplishment-0.124
 until-0.123
 decision-0.117
Token.
Token ID13
Feature activation-7.99e-01

Pos logit contributions
cher+0.117
 stole+0.115
chard+0.115
 parked+0.114
wheel+0.114
Neg logit contributions
 satisfaction-0.082
 delight-0.080
 decision-0.078
 accomplishment-0.078
 amaz-0.076
Token It
Token ID632
Feature activation-2.01e+00

Pos logit contributions
 parked+0.035
 driving+0.034
 smoking+0.033
 stole+0.033
 disturbed+0.032
Neg logit contributions
 pleasure-0.034
 satisfaction-0.034
 delight-0.034
 accomplishment-0.032
 fascination-0.032
Token is
Token ID318
Feature activation-2.53e+00

Pos logit contributions
chard+0.119
 smoking+0.119
 bitten+0.110
 spies+0.110
cher+0.109
Neg logit contributions
 delight-0.039
 pleasure-0.038
 fascination-0.037
 accomplishment-0.037
 difference-0.036
Token big
Token ID1263
Feature activation-1.26e-01

Pos logit contributions
 stole+0.124
cher+0.118
aro+0.111
Ro+0.110
saw+0.108
Neg logit contributions
 pleasure-0.091
 accomplishment-0.089
 delight-0.085
 fascination-0.084
 amaz-0.083
Token they
Token ID484
Feature activation+1.62e-01

Pos logit contributions
 parked+0.039
 smoking+0.038
 stole+0.037
 driving+0.036
 blocking+0.035
Neg logit contributions
 pleasure-0.058
 satisfaction-0.056
 accomplishment-0.055
 delight-0.055
 fascination-0.051
Token see
Token ID766
Feature activation-6.99e-01

Pos logit contributions
 satisfaction+0.008
 pleasure+0.008
 accomplishment+0.008
 delight+0.007
 fascination+0.007
Neg logit contributions
 parked-0.006
 driving-0.006
 stole-0.006
 smoking-0.006
 knocked-0.006
Token a
Token ID257
Feature activation-1.50e+00

Pos logit contributions
 parked+0.023
 stole+0.022
 smoking+0.022
 driving+0.022
 knocked+0.020
Neg logit contributions
 pleasure-0.036
 satisfaction-0.035
 delight-0.034
 accomplishment-0.034
 fascination-0.033
Token big
Token ID1263
Feature activation+1.22e-01

Pos logit contributions
 parked+0.062
 driving+0.058
 smoking+0.057
 stole+0.057
 bitten+0.055
Neg logit contributions
 pleasure-0.064
 accomplishment-0.063
 satisfaction-0.063
 difference-0.060
 delight-0.059
Token red
Token ID2266
Feature activation-1.28e+00

Pos logit contributions
 satisfaction+0.007
 pleasure+0.006
 accomplishment+0.006
 delight+0.006
 fascination+0.006
Neg logit contributions
 parked-0.004
 driving-0.004
 smoking-0.004
 stole-0.004
 disturbed-0.003
Token thing
Token ID1517
Feature activation-4.87e+00

Pos logit contributions
 parked+0.045
 stole+0.043
 disturbed+0.042
 driving+0.042
 smoking+0.042
Neg logit contributions
 satisfaction-0.062
 pleasure-0.061
 accomplishment-0.061
 delight-0.059
 fascination-0.059
Token on
Token ID319
Feature activation-1.50e+00

Pos logit contributions
 Indians+0.238
Lost+0.228
 villains+0.223
Wh+0.219
Ro+0.216
Neg logit contributions
 and-0.119
 accomplishment-0.111
 goodness-0.109
 de-0.102
 until-0.101
Token a
Token ID257
Feature activation-1.40e+00

Pos logit contributions
cher+0.066
 stole+0.065
 parked+0.064
 driving+0.063
 smoking+0.063
Neg logit contributions
 satisfaction-0.059
 delight-0.058
 accomplishment-0.058
 pleasure-0.056
 fascination-0.055
Token green
Token ID4077
Feature activation-1.79e+00

Pos logit contributions
 parked+0.057
 driving+0.056
 stole+0.055
 disturbed+0.055
 smoking+0.054
Neg logit contributions
 pleasure-0.056
 satisfaction-0.056
 accomplishment-0.055
 delight-0.055
 fascination-0.055
Token plant
Token ID4618
Feature activation-3.82e+00

Pos logit contributions
 parked+0.073
 driving+0.070
 disturbed+0.069
 stole+0.069
 smoking+0.069
Neg logit contributions
 satisfaction-0.076
 delight-0.073
 accomplishment-0.073
 pleasure-0.073
 fascination-0.071
Token.
Token ID13
Feature activation-1.31e+00

Pos logit contributions
 Crash+0.185
Ro+0.185
 wrecked+0.184
through+0.184
 driven+0.183
Neg logit contributions
 difference-0.087
 goodness-0.081
 delight-0.077
 accomplishment-0.076
 satisfaction-0.074
Token when
Token ID618
Feature activation-2.17e+00

Pos logit contributions
 stole+0.051
 parked+0.050
 CA+0.048
 knocked+0.046
 blocked+0.046
Neg logit contributions
 delight-0.047
 satisfaction-0.047
 pleasure-0.046
 accomplishment-0.046
 fascination-0.044
Token he
Token ID339
Feature activation-2.05e+00

Pos logit contributions
 parked+0.074
angs+0.072
 stole+0.072
cher+0.070
tr+0.067
Neg logit contributions
 accomplishment-0.105
 satisfaction-0.102
 pleasure-0.101
 delight-0.100
 fascination-0.100
Token saw
Token ID2497
Feature activation-1.99e+00

Pos logit contributions
 blocking+0.076
 Enter+0.075
 smoking+0.074
aro+0.074
 driving+0.074
Neg logit contributions
 satisfaction-0.079
 delight-0.078
 fascination-0.076
 pleasure-0.075
oph-0.074
Token a
Token ID257
Feature activation-1.03e+00

Pos logit contributions
 stole+0.083
 parked+0.083
 guarding+0.080
cher+0.079
ider+0.078
Neg logit contributions
 satisfaction-0.076
 delight-0.076
 pleasure-0.072
 amaz-0.070
 accomplishment-0.070
Token big
Token ID1263
Feature activation+2.93e-01

Pos logit contributions
 parked+0.041
 stole+0.038
 driving+0.037
 smoking+0.037
 disturbed+0.036
Neg logit contributions
 pleasure-0.047
 satisfaction-0.046
 accomplishment-0.046
 delight-0.044
 fascination-0.044
Token vehicle
Token ID4038
Feature activation-4.86e+00

Pos logit contributions
 satisfaction+0.017
 pleasure+0.017
 accomplishment+0.017
 delight+0.016
 fascination+0.016
Neg logit contributions
 parked-0.009
 driving-0.008
 smoking-0.008
 stole-0.008
 knocked-0.007
Token coming
Token ID2406
Feature activation-2.57e+00

Pos logit contributions
 villains+0.217
Ro+0.215
 Indians+0.214
angs+0.212
 bitten+0.205
Neg logit contributions
 accomplishment-0.132
 decision-0.130
 and-0.129
 until-0.119
 goodness-0.115
Token down
Token ID866
Feature activation-2.78e+00

Pos logit contributions
 stole+0.111
 disturbed+0.108
 villains+0.106
 smoking+0.106
 killed+0.105
Neg logit contributions
 accomplishment-0.085
 fascination-0.082
 satisfaction-0.081
 pleasure-0.080
 delight-0.076
Token the
Token ID262
Feature activation-9.15e-01

Pos logit contributions
cher+0.153
ophone+0.151
 stole+0.147
 villains+0.140
tr+0.140
Neg logit contributions
 satisfaction-0.076
 accomplishment-0.071
 delight-0.063
 joy-0.061
 fascination-0.057
Token road
Token ID2975
Feature activation-2.60e+00

Pos logit contributions
 stole+0.032
 smoking+0.031
 parked+0.031
 disturbed+0.030
 blocked+0.028
Neg logit contributions
 satisfaction-0.043
 accomplishment-0.043
 pleasure-0.042
 delight-0.040
 difference-0.039
Token.
Token ID13
Feature activation-8.10e-01

Pos logit contributions
 CA+0.119
Ro+0.118
 stole+0.118
 bitten+0.111
CA+0.110
Neg logit contributions
 accomplishment-0.089
 decision-0.084
 satisfaction-0.081
 difference-0.078
 delight-0.078
Token named
Token ID3706
Feature activation-1.87e+00

Pos logit contributions
 parked+0.036
 driving+0.032
 blocking+0.032
 smoking+0.031
through+0.030
Neg logit contributions
 satisfaction-0.088
 pleasure-0.084
 delight-0.083
 accomplishment-0.082
 fascination-0.080
Token Tim
Token ID5045
Feature activation-9.38e-01

Pos logit contributions
 smoking+0.070
 parked+0.070
 blocking+0.069
 driving+0.069
 blown+0.067
Neg logit contributions
 pleasure-0.084
 satisfaction-0.082
 delight-0.079
 goodness-0.071
 accomplishment-0.069
Token found
Token ID1043
Feature activation-2.22e+00

Pos logit contributions
 parked+0.029
 smoking+0.028
 driving+0.028
 blocking+0.026
through+0.025
Neg logit contributions
 satisfaction-0.049
 pleasure-0.047
 accomplishment-0.046
 delight-0.045
 fascination-0.045
Token a
Token ID257
Feature activation-8.93e-01

Pos logit contributions
aro+0.096
 stole+0.096
 parked+0.096
riages+0.094
cher+0.093
Neg logit contributions
 satisfaction-0.081
 pleasure-0.075
 delight-0.074
 accomplishment-0.071
 relief-0.068
Token toy
Token ID13373
Feature activation-4.56e+00

Pos logit contributions
 parked+0.038
 driving+0.034
 stole+0.034
 smoking+0.034
 disturbed+0.032
Neg logit contributions
 pleasure-0.040
 satisfaction-0.039
 accomplishment-0.039
 fascination-0.037
 delight-0.036
Token motorcycle
Token ID18757
Feature activation-4.84e+00

Pos logit contributions
 driven+0.197
saw+0.195
angs+0.195
through+0.194
RO+0.192
Neg logit contributions
 goodness-0.130
 accomplishment-0.119
 and-0.119
 decision-0.118
ummy-0.117
Token.
Token ID13
Feature activation-8.74e-01

Pos logit contributions
Ro+0.277
angs+0.260
 villains+0.258
 Indians+0.254
aro+0.254
Neg logit contributions
 and-0.095
,-0.080
 accomplishment-0.068
 until-0.064
 tightly-0.062
Token It
Token ID632
Feature activation-2.61e+00

Pos logit contributions
 parked+0.036
 driving+0.035
 smoking+0.035
 stole+0.035
 disturbed+0.034
Neg logit contributions
 pleasure-0.038
 satisfaction-0.038
 delight-0.036
 accomplishment-0.036
 fascination-0.034
Token was
Token ID373
Feature activation-2.88e+00

Pos logit contributions
 spies+0.148
angs+0.142
 smoking+0.137
chard+0.132
aro+0.132
Neg logit contributions
 pleasure-0.047
 satisfaction-0.047
 delight-0.046
 accomplishment-0.045
 fascination-0.043
Token red
Token ID2266
Feature activation-6.39e-01

Pos logit contributions
aro+0.154
cher+0.152
 stole+0.149
angs+0.146
Ro+0.144
Neg logit contributions
 accomplishment-0.090
 pleasure-0.085
 satisfaction-0.081
 delight-0.081
 sweet-0.073
Token and
Token ID290
Feature activation-1.60e+00

Pos logit contributions
 parked+0.037
 stole+0.037
 smoking+0.036
 driving+0.036
 disturbed+0.036
Neg logit contributions
 pleasure-0.016
 satisfaction-0.015
 accomplishment-0.015
 delight-0.014
 fascination-0.013
Token they
Token ID484
Feature activation-8.38e-02

Pos logit contributions
 parked+0.016
 smoking+0.015
 stole+0.015
 driving+0.015
 blocking+0.014
Neg logit contributions
 pleasure-0.032
 satisfaction-0.032
 accomplishment-0.031
 delight-0.031
 fascination-0.030
Token see
Token ID766
Feature activation-7.47e-01

Pos logit contributions
 parked+0.003
 driving+0.003
 smoking+0.003
 stole+0.003
 disturbed+0.003
Neg logit contributions
 satisfaction-0.004
 pleasure-0.004
 accomplishment-0.004
 delight-0.004
 fascination-0.004
Token a
Token ID257
Feature activation-1.42e+00

Pos logit contributions
 parked+0.025
 stole+0.024
 smoking+0.024
 driving+0.024
 guarding+0.022
Neg logit contributions
 pleasure-0.038
 satisfaction-0.037
 delight-0.036
 accomplishment-0.035
 fascination-0.035
Token big
Token ID1263
Feature activation+2.92e-01

Pos logit contributions
 parked+0.059
 driving+0.055
 smoking+0.054
 stole+0.054
 driven+0.052
Neg logit contributions
 pleasure-0.061
 accomplishment-0.060
 satisfaction-0.060
 difference-0.057
 delight-0.057
Token white
Token ID2330
Feature activation-1.05e+00

Pos logit contributions
 satisfaction+0.016
 pleasure+0.016
 accomplishment+0.015
 delight+0.015
 fascination+0.015
Neg logit contributions
 parked-0.010
 smoking-0.009
 driving-0.009
 stole-0.009
 knocked-0.008
Token thing
Token ID1517
Feature activation-4.81e+00

Pos logit contributions
 parked+0.038
 stole+0.036
 driving+0.035
 smoking+0.035
 disturbed+0.034
Neg logit contributions
 satisfaction-0.051
 pleasure-0.050
 accomplishment-0.049
 delight-0.048
 fascination-0.048
Token on
Token ID319
Feature activation-1.34e+00

Pos logit contributions
 Indians+0.234
Lost+0.229
 villains+0.221
Wh+0.218
 stole+0.212
Neg logit contributions
 and-0.116
 goodness-0.107
 accomplishment-0.106
 until-0.104
 de-0.098
Token the
Token ID262
Feature activation-8.18e-01

Pos logit contributions
 stole+0.058
 parked+0.057
cher+0.057
 driving+0.056
 smoking+0.056
Neg logit contributions
 satisfaction-0.055
 delight-0.054
 accomplishment-0.053
 pleasure-0.052
 fascination-0.051
Token grass
Token ID8701
Feature activation-2.90e+00

Pos logit contributions
 stole+0.035
 smoking+0.035
 parked+0.034
 disturbed+0.034
 driving+0.034
Neg logit contributions
 satisfaction-0.034
 pleasure-0.033
 accomplishment-0.032
 delight-0.032
 fascination-0.030
Token.
Token ID13
Feature activation-4.81e-01

Pos logit contributions
 stole+0.137
 killed+0.136
 driving+0.132
 Found+0.132
 villains+0.131
Neg logit contributions
ness-0.076
 delight-0.076
aman-0.076
 decision-0.074
 amaz-0.074
Token It
Token ID632
Feature activation-1.12e+00

Pos logit contributions
 parked+0.019
 driving+0.019
 smoking+0.018
 stole+0.018
 disturbed+0.017
Neg logit contributions
 pleasure-0.022
 satisfaction-0.022
 delight-0.021
 accomplishment-0.021
 fascination-0.021
Token,
Token ID11
Feature activation-1.32e+00

Pos logit contributions
 parked+0.049
 smoking+0.047
 stole+0.046
 driving+0.046
 blocking+0.045
Neg logit contributions
 pleasure-0.022
 satisfaction-0.021
 accomplishment-0.021
 delight-0.020
 fascination-0.018
Token they
Token ID484
Feature activation-3.19e-01

Pos logit contributions
 parked+0.046
 smoking+0.045
 stole+0.043
 driving+0.043
 guarding+0.043
Neg logit contributions
 pleasure-0.064
 satisfaction-0.062
 accomplishment-0.061
 delight-0.061
 fascination-0.055
Token notice
Token ID4003
Feature activation-1.64e+00

Pos logit contributions
 parked+0.013
 smoking+0.013
 driving+0.013
 stole+0.012
 blocking+0.012
Neg logit contributions
 satisfaction-0.014
 pleasure-0.014
 accomplishment-0.013
 delight-0.013
 fascination-0.013
Token a
Token ID257
Feature activation-1.57e+00

Pos logit contributions
cher+0.059
 stole+0.058
 parked+0.057
 guarding+0.056
 smoking+0.056
Neg logit contributions
 pleasure-0.073
 delight-0.072
 satisfaction-0.071
 accomplishment-0.070
 amaz-0.069
Token big
Token ID1263
Feature activation-4.77e-02

Pos logit contributions
 parked+0.067
 driving+0.062
 stole+0.060
 smoking+0.060
 driven+0.058
Neg logit contributions
 accomplishment-0.065
 pleasure-0.065
 satisfaction-0.065
 difference-0.062
 delight-0.061
Token truck
Token ID7779
Feature activation-4.80e+00

Pos logit contributions
 parked+0.002
 driving+0.001
 smoking+0.001
 stole+0.001
 disturbed+0.001
Neg logit contributions
 satisfaction-0.003
 pleasure-0.002
 accomplishment-0.002
 delight-0.002
 fascination-0.002
Token on
Token ID319
Feature activation-1.54e+00

Pos logit contributions
 villains+0.209
 Indians+0.202
Ro+0.199
chard+0.198
angs+0.189
Neg logit contributions
 and-0.136
 decision-0.133
 accomplishment-0.118
 goodness-0.116
 amaz-0.116
Token the
Token ID262
Feature activation-7.83e-01

Pos logit contributions
 stole+0.072
cher+0.070
 smoking+0.068
 parked+0.067
 disturbed+0.066
Neg logit contributions
 satisfaction-0.058
 accomplishment-0.058
 delight-0.057
 fascination-0.054
 pleasure-0.054
Token road
Token ID2975
Feature activation-3.13e+00

Pos logit contributions
 stole+0.027
 smoking+0.027
 disturbed+0.027
 parked+0.026
 driving+0.025
Neg logit contributions
 satisfaction-0.038
 pleasure-0.037
 accomplishment-0.036
 delight-0.036
 fascination-0.034
Token.
Token ID13
Feature activation-1.34e+00

Pos logit contributions
Ro+0.163
 Indians+0.152
 CA+0.151
cher+0.150
 stole+0.148
Neg logit contributions
 decision-0.088
 accomplishment-0.080
 satisfaction-0.071
 delight-0.069
 difference-0.069
Token The
Token ID383
Feature activation-9.47e-01

Pos logit contributions
 driving+0.062
 smoking+0.061
 parked+0.060
cher+0.060
 guarding+0.060
Neg logit contributions
 pleasure-0.050
 satisfaction-0.049
 delight-0.049
 accomplishment-0.044
 amaz-0.042
Token saw
Token ID2497
Feature activation-1.57e+00

Pos logit contributions
 parked+0.013
 driving+0.012
 smoking+0.012
 stole+0.012
 blocking+0.011
Neg logit contributions
 satisfaction-0.019
 pleasure-0.019
 accomplishment-0.019
 delight-0.018
 fascination-0.018
Token a
Token ID257
Feature activation-6.09e-01

Pos logit contributions
 parked+0.064
 driving+0.060
 stole+0.059
cher+0.058
saw+0.056
Neg logit contributions
 satisfaction-0.066
 pleasure-0.065
 delight-0.064
 accomplishment-0.061
 fascination-0.060
Token long
Token ID890
Feature activation-2.44e-01

Pos logit contributions
 parked+0.025
 driving+0.023
 stole+0.022
 smoking+0.022
 knocked+0.021
Neg logit contributions
 pleasure-0.028
 satisfaction-0.028
 accomplishment-0.027
 delight-0.026
 fascination-0.026
Token,
Token ID11
Feature activation-8.66e-01

Pos logit contributions
 parked+0.009
 driving+0.009
 smoking+0.009
 stole+0.009
 disturbed+0.008
Neg logit contributions
 satisfaction-0.012
 pleasure-0.012
 accomplishment-0.012
 delight-0.011
 fascination-0.011
Token green
Token ID4077
Feature activation-9.18e-01

Pos logit contributions
 parked+0.032
 driving+0.030
 stole+0.030
 smoking+0.029
 knocked+0.028
Neg logit contributions
 satisfaction-0.041
 pleasure-0.041
 accomplishment-0.040
 delight-0.039
 fascination-0.039
Token thing
Token ID1517
Feature activation-4.78e+00

Pos logit contributions
 parked+0.036
 driving+0.034
 stole+0.034
 smoking+0.033
 knocked+0.032
Neg logit contributions
 satisfaction-0.042
 pleasure-0.041
 accomplishment-0.041
 delight-0.040
 fascination-0.039
Token in
Token ID287
Feature activation-7.39e-01

Pos logit contributions
 driven+0.221
 Indians+0.221
Lost+0.217
anes+0.214
cher+0.212
Neg logit contributions
 and-0.110
 goodness-0.102
 moments-0.098
 until-0.095
 de-0.089
Token the
Token ID262
Feature activation-4.65e-01

Pos logit contributions
 parked+0.029
 driving+0.028
 smoking+0.027
 stole+0.027
 knocked+0.026
Neg logit contributions
 satisfaction-0.035
 pleasure-0.033
 delight-0.033
 accomplishment-0.033
 fascination-0.032
Token water
Token ID1660
Feature activation-2.52e+00

Pos logit contributions
 parked+0.019
 driving+0.019
 smoking+0.018
 stole+0.018
 knocked+0.017
Neg logit contributions
 satisfaction-0.021
 pleasure-0.021
 delight-0.020
 accomplishment-0.019
 fascination-0.019
Token.
Token ID13
Feature activation-6.47e-01

Pos logit contributions
 driving+0.118
 parked+0.118
 stole+0.114
 CA+0.111
destruct+0.111
Neg logit contributions
 difference-0.075
 satisfaction-0.074
 delight-0.073
 decision-0.069
ness-0.069
Token He
Token ID679
Feature activation-1.34e+00

Pos logit contributions
 parked+0.026
 driving+0.024
 smoking+0.024
 stole+0.023
 knocked+0.022
Neg logit contributions
 pleasure-0.031
 satisfaction-0.030
 delight-0.029
 accomplishment-0.029
 fascination-0.028
Token bus
Token ID1323
Feature activation-3.34e+00

Pos logit contributions
 smoking+0.012
 stole+0.012
 parked+0.012
 disturbed+0.011
 broke+0.011
Neg logit contributions
 satisfaction-0.028
 pleasure-0.028
 accomplishment-0.028
 delight-0.027
 fascination-0.027
Token met
Token ID1138
Feature activation-1.81e+00

Pos logit contributions
 smoking+0.144
through+0.132
angs+0.132
chard+0.130
Ro+0.126
Neg logit contributions
 satisfaction-0.112
 decision-0.110
est-0.106
 delight-0.105
ness-0.104
Token a
Token ID257
Feature activation-1.18e+00

Pos logit contributions
 stole+0.074
 smoking+0.074
 parked+0.073
 knocked+0.072
 blocked+0.069
Neg logit contributions
 pleasure-0.076
 satisfaction-0.076
 delight-0.071
 accomplishment-0.066
 fascination-0.065
Token small
Token ID1402
Feature activation-5.04e-01

Pos logit contributions
 parked+0.062
 smoking+0.058
 stole+0.058
 driving+0.057
 blocking+0.056
Neg logit contributions
 pleasure-0.041
 satisfaction-0.039
 accomplishment-0.039
 delight-0.036
 difference-0.035
Token green
Token ID4077
Feature activation-1.79e+00

Pos logit contributions
 parked+0.019
 smoking+0.018
 driving+0.018
 stole+0.018
 knocked+0.017
Neg logit contributions
 pleasure-0.024
 satisfaction-0.024
 accomplishment-0.024
 delight-0.023
 fascination-0.022
Token car
Token ID1097
Feature activation-4.77e+00

Pos logit contributions
 smoking+0.060
 parked+0.060
 stole+0.060
 bitten+0.059
through+0.058
Neg logit contributions
 satisfaction-0.085
 pleasure-0.085
 accomplishment-0.083
 delight-0.083
 decision-0.080
Token.
Token ID13
Feature activation-1.28e+00

Pos logit contributions
angs+0.287
through+0.283
Ro+0.273
destruct+0.268
Free+0.265
Neg logit contributions
 and-0.079
 until-0.060
,-0.050
 accomplishment-0.046
 agreement-0.042
Token The
Token ID383
Feature activation-6.95e-01

Pos logit contributions
 parked+0.060
 smoking+0.060
 stole+0.060
 driving+0.059
cher+0.058
Neg logit contributions
 pleasure-0.047
 satisfaction-0.046
 delight-0.046
 accomplishment-0.041
 fascination-0.040
Token car
Token ID1097
Feature activation-2.66e+00

Pos logit contributions
 smoking+0.022
 stole+0.022
 parked+0.021
 disturbed+0.020
 blocking+0.020
Neg logit contributions
 pleasure-0.037
 satisfaction-0.037
 accomplishment-0.037
 delight-0.036
 fascination-0.035
Token liked
Token ID8288
Feature activation-2.11e+00

Pos logit contributions
 smoking+0.124
angs+0.119
through+0.115
saw+0.109
Ro+0.108
Neg logit contributions
 satisfaction-0.084
 accomplishment-0.083
 delight-0.083
 pleasure-0.081
 nod-0.075
Token to
Token ID284
Feature activation-5.72e-01

Pos logit contributions
tr+0.087
cher+0.087
 stole+0.086
 parked+0.086
through+0.084
Neg logit contributions
 pleasure-0.084
 accomplishment-0.083
 delight-0.078
 satisfaction-0.077
 joy-0.076
Token to
Token ID284
Feature activation+8.23e-01

Pos logit contributions
 satisfaction+0.043
 pleasure+0.042
 accomplishment+0.041
 fascination+0.040
 delight+0.040
Neg logit contributions
 parked-0.028
 driving-0.026
 smoking-0.026
 stole-0.026
 disturbed-0.025
Token eat
Token ID4483
Feature activation+1.55e+00

Pos logit contributions
 pleasure+0.044
 satisfaction+0.044
 accomplishment+0.043
 fascination+0.041
 delight+0.040
Neg logit contributions
 parked-0.027
 smoking-0.026
 driving-0.025
 stole-0.025
 disturbed-0.023
Token,
Token ID11
Feature activation+1.94e+00

Pos logit contributions
 satisfaction+0.085
 pleasure+0.085
 accomplishment+0.083
 fascination+0.081
 delight+0.079
Neg logit contributions
 parked-0.048
 smoking-0.047
 driving-0.045
 stole-0.044
 disturbed-0.043
Token and
Token ID290
Feature activation+7.71e-01

Pos logit contributions
 satisfaction+0.089
 pleasure+0.086
 accomplishment+0.086
 fascination+0.083
 accomplishments+0.080
Neg logit contributions
 parked-0.079
 driving-0.075
 smoking-0.073
 stole-0.070
 stuck-0.069
Token she
Token ID673
Feature activation+1.12e+00

Pos logit contributions
 pleasure+0.044
 satisfaction+0.044
 accomplishment+0.043
 fascination+0.041
 delight+0.040
Neg logit contributions
 parked-0.023
 stole-0.021
 driving-0.021
 smoking-0.021
 knocked-0.020
Token ate
Token ID15063
Feature activation+2.46e+00

Pos logit contributions
 pleasure+0.061
 satisfaction+0.060
 accomplishment+0.059
 fascination+0.056
 delight+0.055
Neg logit contributions
 parked-0.039
 stole-0.036
 knocked-0.035
 driving-0.034
 broke-0.034
Token every
Token ID790
Feature activation+4.64e-01

Pos logit contributions
 pleasure+0.150
 satisfaction+0.148
 accomplishment+0.144
 fascination+0.142
 accomplishments+0.141
Neg logit contributions
 smoking-0.055
 parked-0.051
 driving-0.049
 stuck-0.047
 disturbed-0.047
Token last
Token ID938
Feature activation+1.94e+00

Pos logit contributions
 satisfaction+0.021
 pleasure+0.020
 accomplishment+0.020
 delight+0.019
 fascination+0.019
Neg logit contributions
 parked-0.019
 smoking-0.018
 stole-0.018
 driving-0.018
 knocked-0.018
Token bite
Token ID13197
Feature activation+1.13e+00

Pos logit contributions
 pleasure+0.092
 satisfaction+0.091
 accomplishment+0.089
 fascination+0.086
 accomplishments+0.083
Neg logit contributions
 stole-0.067
 smoking-0.066
 parked-0.066
 knocked-0.064
 disturbed-0.064
Token.
Token ID13
Feature activation+1.29e+00

Pos logit contributions
 pleasure+0.062
 satisfaction+0.061
 accomplishment+0.061
 fascination+0.058
 delight+0.057
Neg logit contributions
 parked-0.036
 smoking-0.035
 stole-0.033
 driving-0.033
 broke-0.033
Token After
Token ID2293
Feature activation+2.32e-02

Pos logit contributions
 accomplishment+0.066
 satisfaction+0.065
 decision+0.062
 pleasure+0.062
 fascination+0.061
Neg logit contributions
 parked-0.043
 driving-0.040
 smoking-0.039
 stole-0.038
 knocked-0.037
Token wonderful
Token ID7932
Feature activation+1.39e+00

Pos logit contributions
 fascination+0.084
 satisfaction+0.084
 pleasure+0.081
 accomplishment+0.081
 delight+0.080
Neg logit contributions
 driving-0.088
 stole-0.087
 broke-0.086
 parked-0.086
 knocked-0.083
Token job
Token ID1693
Feature activation+2.00e-02

Pos logit contributions
 pleasure+0.062
 satisfaction+0.061
 fascination+0.061
 delight+0.058
 amaz+0.055
Neg logit contributions
 driving-0.057
 parked-0.053
 smoking-0.052
 broke-0.051
 knocked-0.050
Token cleaning
Token ID12724
Feature activation+2.31e+00

Pos logit contributions
 satisfaction+0.001
 pleasure+0.001
 accomplishment+0.001
 delight+0.001
 fascination+0.001
Neg logit contributions
 parked-0.001
 smoking-0.001
 stole-0.001
 driving-0.001
 disturbed-0.001
Token the
Token ID262
Feature activation-3.48e-01

Pos logit contributions
 satisfaction+0.142
 pleasure+0.136
 fascination+0.136
 grin+0.132
 accomplishment+0.131
Neg logit contributions
 driving-0.062
 smoking-0.060
 parked-0.060
 cars-0.054
 guns-0.050
Token garage
Token ID15591
Feature activation+6.41e-02

Pos logit contributions
 parked+0.011
 stole+0.011
 smoking+0.010
 driving+0.010
 disturbed+0.010
Neg logit contributions
 satisfaction-0.019
 pleasure-0.018
 accomplishment-0.018
 delight-0.017
 fascination-0.017
Token."
Token ID526
Feature activation+2.89e+00

Pos logit contributions
 satisfaction+0.003
 pleasure+0.003
 accomplishment+0.003
 delight+0.003
 fascination+0.003
Neg logit contributions
 parked-0.002
 driving-0.002
 smoking-0.002
 stole-0.002
 disturbed-0.002
Token Little
Token ID7703
Feature activation+1.48e+00

Pos logit contributions
 accomplishment+0.170
 decision+0.167
 satisfaction+0.161
 accomplishments+0.154
 fascination+0.154
Neg logit contributions
 parked-0.086
 driving-0.084
 smoking-0.076
o-0.072
 Wh-0.070
Token Jimmy
Token ID12963
Feature activation+3.10e+00

Pos logit contributions
 satisfaction+0.064
 accomplishment+0.064
 pleasure+0.062
 fascination+0.062
 delight+0.059
Neg logit contributions
 parked-0.068
 stole-0.064
 broke-0.062
 driving-0.061
 smoking-0.061
Token smiled
Token ID13541
Feature activation+2.44e+00

Pos logit contributions
 pleasure+0.176
 difference+0.167
 accomplishment+0.165
 satisfaction+0.163
 fascination+0.162
Neg logit contributions
 parked-0.106
 broke-0.096
 knocked-0.094
 stole-0.088
 hit-0.086
Token.
Token ID13
Feature activation+2.12e+00

Pos logit contributions
 pleasure+0.127
 satisfaction+0.126
 accomplishment+0.124
 fascination+0.122
ya+0.118
Neg logit contributions
 driving-0.079
 parked-0.079
 stuck-0.073
 smoking-0.068
 broke-0.067
Token He
Token ID679
Feature activation+1.79e+00

Pos logit contributions
 accomplishment+0.130
 satisfaction+0.126
 fascination+0.124
 pleasure+0.121
 decision+0.121
Neg logit contributions
 parked-0.053
 driving-0.050
 smoking-0.044
 broke-0.041
 knocked-0.040
Token lessons
Token ID11658
Feature activation+8.62e-01

Pos logit contributions
 pleasure+0.082
 fascination+0.081
 satisfaction+0.081
 delight+0.078
 accomplishment+0.078
Neg logit contributions
 driving-0.047
 smoking-0.044
 parked-0.043
 stole-0.041
 disturbed-0.040
Token.
Token ID13
Feature activation+1.44e+00

Pos logit contributions
 satisfaction+0.051
 pleasure+0.050
 accomplishment+0.049
 delight+0.048
 fascination+0.048
Neg logit contributions
 parked-0.024
 driving-0.023
 smoking-0.022
 stole-0.021
 disturbed-0.020
Token<|endoftext|>
Token ID50256
Feature activation-5.38e-01

Pos logit contributions
 accomplishment+0.084
 satisfaction+0.082
 pleasure+0.081
 fascination+0.077
 delight+0.076
Neg logit contributions
 parked-0.038
 smoking-0.035
 stole-0.035
 driving-0.034
 CA-0.032
TokenOnce
Token ID7454
Feature activation+5.98e-01

Pos logit contributions
 parked+0.021
 smoking+0.020
 stole+0.020
 driving+0.020
 knocked+0.019
Neg logit contributions
 pleasure-0.025
 satisfaction-0.025
 accomplishment-0.024
 delight-0.023
 fascination-0.023
Token upon
Token ID2402
Feature activation-7.39e-01

Pos logit contributions
 satisfaction+0.028
 pleasure+0.027
 accomplishment+0.026
 delight+0.025
 fascination+0.025
Neg logit contributions
 parked-0.025
 driving-0.023
 smoking-0.023
 stole-0.023
 knocked-0.022
Token a
Token ID257
Feature activation+3.71e+00

Pos logit contributions
 parked+0.024
 stole+0.022
 smoking+0.022
 driving+0.022
 knocked+0.021
Neg logit contributions
 satisfaction-0.040
 pleasure-0.040
 accomplishment-0.038
 delight-0.038
 fascination-0.037
Token time
Token ID640
Feature activation+1.05e+00

Pos logit contributions
 satisfaction+0.219
 accomplishment+0.206
 delight+0.201
 amaz+0.198
 fascination+0.198
Neg logit contributions
 knocked-0.084
 parked-0.083
o-0.083
 disturbed-0.083
 driving-0.082
Token,
Token ID11
Feature activation-3.75e-01

Pos logit contributions
 satisfaction+0.048
 accomplishment+0.045
 pleasure+0.044
 fascination+0.042
 delight+0.042
Neg logit contributions
 parked-0.045
 driving-0.044
 smoking-0.043
 stole-0.043
 broke-0.042
Token there
Token ID612
Feature activation+7.34e-01

Pos logit contributions
 parked+0.008
 smoking+0.007
 driving+0.007
 stole+0.007
 knocked+0.006
Neg logit contributions
 pleasure-0.024
 satisfaction-0.024
 accomplishment-0.024
 delight-0.023
 fascination-0.023
Token was
Token ID373
Feature activation-2.51e-01

Pos logit contributions
 satisfaction+0.045
 accomplishment+0.044
 pleasure+0.044
 delight+0.041
 fascination+0.041
Neg logit contributions
 parked-0.021
 stole-0.018
 smoking-0.018
 driving-0.018
 broke-0.018
Token a
Token ID257
Feature activation+4.71e-01

Pos logit contributions
 parked+0.007
 stole+0.007
 smoking+0.007
 driving+0.007
 knocked+0.006
Neg logit contributions
 pleasure-0.015
 satisfaction-0.015
 accomplishment-0.014
 delight-0.014
 fascination-0.014
TokenB
Token ID33
Feature activation+1.69e+00

Pos logit contributions
 pleasure+0.012
 satisfaction+0.012
 accomplishment+0.012
 delight+0.011
 fascination+0.011
Neg logit contributions
 parked-0.012
 driving-0.011
 smoking-0.011
 stole-0.011
 disturbed-0.011
Tokenenny
Token ID11870
Feature activation+2.38e+00

Pos logit contributions
 satisfaction+0.085
 delight+0.083
 fascination+0.082
 amaz+0.080
 accomplishment+0.080
Neg logit contributions
 parked-0.059
 stole-0.059
 broke-0.055
 driving-0.055
o-0.055
Token learned
Token ID4499
Feature activation-2.66e-03

Pos logit contributions
 pleasure+0.141
 accomplishment+0.131
 fascination+0.130
 satisfaction+0.129
 amaz+0.124
Neg logit contributions
 parked-0.079
 stole-0.072
 broke-0.072
 knocked-0.069
 hurt-0.066
Token that
Token ID326
Feature activation+1.21e+00

Pos logit contributions
 parked+0.000
 smoking+0.000
 driving+0.000
 stole+0.000
 knocked+0.000
Neg logit contributions
 satisfaction-0.000
 pleasure-0.000
 accomplishment-0.000
 delight-0.000
 fascination-0.000
Token trying
Token ID2111
Feature activation+2.05e+00

Pos logit contributions
 pleasure+0.067
 satisfaction+0.067
 accomplishment+0.064
 fascination+0.063
 delight+0.062
Neg logit contributions
 smoking-0.039
 driving-0.037
 parked-0.037
 stuck-0.033
 stole-0.033
Token new
Token ID649
Feature activation+3.29e+00

Pos logit contributions
 satisfaction+0.143
 pleasure+0.140
 accomplishment+0.138
 fascination+0.137
 delight+0.135
Neg logit contributions
 smoking-0.038
 driving-0.032
 guns-0.031
 blocking-0.031
 broke-0.031
Token things
Token ID1243
Feature activation+1.44e+00

Pos logit contributions
aman+0.191
 fascination+0.184
oph+0.184
 satisfaction+0.179
 accomplishment+0.176
Neg logit contributions
 guns-0.085
 cars-0.078
 hurt-0.077
 smoking-0.076
 disturbed-0.075
Token can
Token ID460
Feature activation+1.24e+00

Pos logit contributions
 satisfaction+0.076
 pleasure+0.073
 fascination+0.073
 accomplishment+0.072
 delight+0.072
Neg logit contributions
 parked-0.050
 stole-0.048
 broke-0.047
 driving-0.047
 disturbed-0.047
Token be
Token ID307
Feature activation+5.15e-01

Pos logit contributions
 satisfaction+0.071
 pleasure+0.069
 fascination+0.067
 accomplishment+0.067
 delight+0.065
Neg logit contributions
 parked-0.038
 smoking-0.037
 driving-0.036
 stole-0.036
 hurt-0.035
Token good
Token ID922
Feature activation+9.33e-01

Pos logit contributions
 satisfaction+0.023
 pleasure+0.023
 accomplishment+0.022
 fascination+0.022
 delight+0.022
Neg logit contributions
 parked-0.022
 stole-0.021
 smoking-0.021
 driving-0.021
 disturbed-0.020
Token and
Token ID290
Feature activation+1.21e+00

Pos logit contributions
 satisfaction+0.055
 fascination+0.054
 pleasure+0.053
 accomplishment+0.053
 delight+0.052
Neg logit contributions
 parked-0.025
 driving-0.025
 stole-0.025
 smoking-0.025
 broke-0.024
Token with
Token ID351
Feature activation+2.14e+00

Pos logit contributions
 satisfaction+0.097
 accomplishment+0.095
 pleasure+0.095
 fascination+0.093
 delight+0.092
Neg logit contributions
 parked-0.051
 driving-0.049
 smoking-0.047
 stole-0.047
 disturbed-0.045
Token respect
Token ID2461
Feature activation+2.11e+00

Pos logit contributions
 accomplishment+0.094
 satisfaction+0.094
 pleasure+0.093
 fascination+0.092
 delight+0.090
Neg logit contributions
 smoking-0.083
 parked-0.082
 driving-0.081
 stole-0.080
 disturbed-0.078
Token.
Token ID13
Feature activation+1.26e+00

Pos logit contributions
 accomplishment+0.117
 satisfaction+0.115
 pleasure+0.114
 fascination+0.113
 delight+0.110
Neg logit contributions
 parked-0.062
 smoking-0.061
 driving-0.058
 stole-0.056
 guns-0.056
Token Ign
Token ID16583
Feature activation+4.24e-01

Pos logit contributions
 accomplishment+0.070
 satisfaction+0.068
 fascination+0.066
 pleasure+0.066
 delight+0.065
Neg logit contributions
 parked-0.040
 smoking-0.038
 driving-0.037
 stole-0.035
 CA-0.032
Tokenor
Token ID273
Feature activation+1.50e+00

Pos logit contributions
 satisfaction+0.017
 pleasure+0.017
 accomplishment+0.016
 delight+0.015
 fascination+0.015
Neg logit contributions
 parked-0.021
 stole-0.020
 driving-0.019
 smoking-0.019
 knocked-0.019
Tokenance
Token ID590
Feature activation+2.67e+00

Pos logit contributions
 pleasure+0.067
 satisfaction+0.065
 accomplishment+0.064
 delight+0.063
 fascination+0.063
Neg logit contributions
 parked-0.065
 stole-0.063
 smoking-0.061
 driving-0.061
 broke-0.060
Token can
Token ID460
Feature activation+1.01e+00

Pos logit contributions
 fascination+0.132
 satisfaction+0.130
 pleasure+0.130
 delight+0.125
 amaz+0.125
Neg logit contributions
 broke-0.093
 stole-0.092
 parked-0.091
 driving-0.086
 disturbed-0.084
Token lead
Token ID1085
Feature activation+1.79e+00

Pos logit contributions
 satisfaction+0.054
 pleasure+0.053
 accomplishment+0.052
 fascination+0.051
 delight+0.050
Neg logit contributions
 parked-0.035
 driving-0.033
 smoking-0.032
 stole-0.032
 disturbed-0.031
Token to
Token ID284
Feature activation+1.33e+00

Pos logit contributions
 satisfaction+0.111
 accomplishment+0.109
 fascination+0.108
 pleasure+0.108
 delight+0.105
Neg logit contributions
 parked-0.042
 driving-0.041
 smoking-0.041
 stole-0.041
 disturbed-0.039
Token misunderstanding
Token ID29789
Feature activation+2.42e+00

Pos logit contributions
 satisfaction+0.063
 pleasure+0.062
 fascination+0.061
 accomplishment+0.060
 delight+0.059
Neg logit contributions
 driving-0.050
 smoking-0.049
 parked-0.048
 stole-0.046
 disturbed-0.046
Token but
Token ID475
Feature activation+1.06e+00

Pos logit contributions
 satisfaction+0.122
 pleasure+0.118
 accomplishment+0.116
 fascination+0.116
 delight+0.113
Neg logit contributions
 driving-0.084
 parked-0.084
 smoking-0.083
 stole-0.079
 guns-0.077
Token
Token ID220
Feature activation+4.71e-01

Pos logit contributions
 accomplishment+0.103
 pleasure+0.102
 satisfaction+0.101
 decision+0.094
 accomplishments+0.093
Neg logit contributions
 parked-0.063
 driving-0.056
 knocked-0.056
 stole-0.055
 smoking-0.055
Token
Token ID198
Feature activation+1.61e+00

Pos logit contributions
 satisfaction+0.025
 pleasure+0.025
 accomplishment+0.024
 delight+0.023
 fascination+0.023
Neg logit contributions
 parked-0.016
 driving-0.015
 smoking-0.015
 stole-0.014
 knocked-0.014
Token
Token ID198
Feature activation+1.49e+00

Pos logit contributions
 pleasure+0.096
 accomplishment+0.095
 satisfaction+0.094
 decision+0.088
 fascination+0.087
Neg logit contributions
 parked-0.049
 driving-0.042
 knocked-0.039
 smoking-0.039
 stole-0.038
TokenThe
Token ID464
Feature activation-1.13e+00

Pos logit contributions
 pleasure+0.078
 accomplishment+0.075
 satisfaction+0.073
 fascination+0.067
 decision+0.067
Neg logit contributions
 parked-0.059
 driving-0.051
 smoking-0.050
 stole-0.050
 blocking-0.049
Token lady
Token ID10846
Feature activation-3.06e-02

Pos logit contributions
 parked+0.037
 driving+0.032
 smoking+0.032
 blocking+0.032
 blocked+0.032
Neg logit contributions
 satisfaction-0.063
 accomplishment-0.062
 delight-0.060
 pleasure-0.060
 decision-0.057
Token took
Token ID1718
Feature activation+1.12e+00

Pos logit contributions
 parked+0.001
 driving+0.001
 smoking+0.001
 stole+0.001
 disturbed+0.001
Neg logit contributions
 satisfaction-0.002
 pleasure-0.001
 accomplishment-0.001
 delight-0.001
 fascination-0.001
Token Jacob
Token ID12806
Feature activation+1.02e+00

Pos logit contributions
 pleasure+0.059
 satisfaction+0.058
 accomplishment+0.058
 difference+0.054
 fascination+0.053
Neg logit contributions
 parked-0.039
 driving-0.036
 stole-0.035
 knocked-0.035
 smoking-0.034
Token to
Token ID284
Feature activation+4.65e-01

Pos logit contributions
 satisfaction+0.063
 pleasure+0.063
 accomplishment+0.062
 delight+0.058
 fascination+0.058
Neg logit contributions
 parked-0.027
 driving-0.025
 smoking-0.024
 stole-0.024
 knocked-0.023
Token see
Token ID766
Feature activation+1.11e+00

Pos logit contributions
 satisfaction+0.026
 pleasure+0.025
 accomplishment+0.025
 delight+0.024
 fascination+0.024
Neg logit contributions
 parked-0.015
 driving-0.014
 smoking-0.014
 stole-0.013
 knocked-0.013
Token a
Token ID257
Feature activation-2.81e-01

Pos logit contributions
 satisfaction+0.061
 accomplishment+0.061
 pleasure+0.061
 delight+0.056
 fascination+0.055
Neg logit contributions
 parked-0.037
 driving-0.034
 smoking-0.033
 stole-0.032
 knocked-0.031
Token special
Token ID2041
Feature activation+5.74e-01

Pos logit contributions
 parked+0.012
 driving+0.011
 smoking+0.011
 stole+0.011
 knocked+0.010
Neg logit contributions
 satisfaction-0.013
 pleasure-0.013
 accomplishment-0.013
 delight-0.012
 fascination-0.012
Token fall
Token ID2121
Feature activation-1.69e+00

Pos logit contributions
 parked+0.037
 smoking+0.035
 driving+0.035
saw+0.034
 stole+0.033
Neg logit contributions
 accomplishment-0.048
 satisfaction-0.048
 pleasure-0.048
 delight-0.044
 decision-0.044
Token and
Token ID290
Feature activation-2.56e+00

Pos logit contributions
 parked+0.064
 stole+0.062
 smoking+0.061
saw+0.058
 driving+0.057
Neg logit contributions
 pleasure-0.073
 accomplishment-0.072
 satisfaction-0.070
 delight-0.066
 fascination-0.066
Token break
Token ID2270
Feature activation+1.13e-01

Pos logit contributions
 parked+0.085
glass+0.082
saw+0.078
 driving+0.076
anes+0.075
Neg logit contributions
 accomplishment-0.122
 pleasure-0.118
 satisfaction-0.116
 delight-0.111
 difference-0.108
Token.
Token ID13
Feature activation+5.47e-01

Pos logit contributions
 satisfaction+0.006
 pleasure+0.006
 accomplishment+0.006
 delight+0.006
 fascination+0.006
Neg logit contributions
 parked-0.004
 driving-0.003
 smoking-0.003
 stole-0.003
 disturbed-0.003
Token
Token ID220
Feature activation+2.17e-01

Pos logit contributions
 satisfaction+0.027
 pleasure+0.026
 accomplishment+0.026
 fascination+0.025
 delight+0.024
Neg logit contributions
 parked-0.021
 stole-0.020
 smoking-0.020
 driving-0.019
 broke-0.019
Token
Token ID198
Feature activation+8.12e-01

Pos logit contributions
 satisfaction+0.011
 pleasure+0.011
 accomplishment+0.011
 delight+0.010
 fascination+0.010
Neg logit contributions
 parked-0.008
 driving-0.007
 smoking-0.007
 stole-0.007
 knocked-0.007
Token
Token ID198
Feature activation+6.95e-01

Pos logit contributions
 satisfaction+0.047
 pleasure+0.046
 accomplishment+0.046
 fascination+0.044
 delight+0.043
Neg logit contributions
 parked-0.025
 stole-0.022
 driving-0.022
 smoking-0.022
 knocked-0.021
TokenL
Token ID43
Feature activation+5.74e-01

Pos logit contributions
 pleasure+0.039
 satisfaction+0.038
 accomplishment+0.038
 fascination+0.036
 delight+0.036
Neg logit contributions
 parked-0.023
 stole-0.021
 broke-0.020
 smoking-0.020
 driving-0.020
Tokenily
Token ID813
Feature activation+1.50e+00

Pos logit contributions
 pleasure+0.037
 satisfaction+0.037
 accomplishment+0.036
 fascination+0.036
 delight+0.035
Neg logit contributions
 parked-0.013
 stole-0.011
 smoking-0.011
 driving-0.011
 disturbed-0.011
Token was
Token ID373
Feature activation-9.12e-01

Pos logit contributions
 pleasure+0.098
 accomplishment+0.092
 satisfaction+0.092
 fascination+0.087
 amaz+0.084
Neg logit contributions
 parked-0.044
 stole-0.042
 broke-0.041
 knocked-0.040
 disturbed-0.038
Token upset
Token ID9247
Feature activation+9.38e-01

Pos logit contributions
 parked+0.039
 stole+0.039
 smoking+0.037
Ro+0.037
saw+0.037
Neg logit contributions
 satisfaction-0.037
 accomplishment-0.037
 pleasure-0.036
 delight-0.036
 fascination-0.034
Token door
Token ID3420
Feature activation+7.29e-01

Pos logit contributions
 parked+0.047
 driving+0.046
 smoking+0.046
 stole+0.045
 disturbed+0.044
Neg logit contributions
 satisfaction-0.059
 pleasure-0.058
 delight-0.057
 accomplishment-0.055
 difference-0.055
Token and
Token ID290
Feature activation-1.13e+00

Pos logit contributions
 satisfaction+0.029
 pleasure+0.029
 accomplishment+0.027
 delight+0.026
 fascination+0.025
Neg logit contributions
 parked-0.037
 driving-0.035
 smoking-0.034
 knocked-0.033
 stole-0.032
Token opened
Token ID4721
Feature activation+4.85e-01

Pos logit contributions
 driving+0.035
 smoking+0.035
 parked+0.034
anes+0.033
 CA+0.031
Neg logit contributions
 satisfaction-0.057
 pleasure-0.056
 delight-0.055
 fascination-0.054
 accomplishment-0.054
Token it
Token ID340
Feature activation-4.61e-01

Pos logit contributions
 satisfaction+0.027
 pleasure+0.027
 accomplishment+0.027
 fascination+0.025
 delight+0.025
Neg logit contributions
 parked-0.015
 driving-0.014
 smoking-0.014
 stole-0.013
 knocked-0.013
Token.
Token ID13
Feature activation-5.89e-01

Pos logit contributions
 parked+0.017
 stole+0.016
 driving+0.016
 smoking+0.016
 disturbed+0.015
Neg logit contributions
 pleasure-0.022
 accomplishment-0.022
 satisfaction-0.022
 delight-0.022
 fascination-0.021
Token But
Token ID887
Feature activation+1.75e-01

Pos logit contributions
 parked+0.025
 stole+0.023
 smoking+0.023
 driving+0.023
 disturbed+0.022
Neg logit contributions
 pleasure-0.027
 satisfaction-0.026
 accomplishment-0.025
 delight-0.025
 fascination-0.024
Token she
Token ID673
Feature activation-1.76e-01

Pos logit contributions
 satisfaction+0.009
 pleasure+0.009
 accomplishment+0.009
 delight+0.009
 fascination+0.009
Neg logit contributions
 parked-0.006
 driving-0.005
 smoking-0.005
 stole-0.005
 knocked-0.005
Token forgot
Token ID16453
Feature activation-8.02e-01

Pos logit contributions
 parked+0.006
 driving+0.005
 smoking+0.005
 stole+0.005
 disturbed+0.005
Neg logit contributions
 satisfaction-0.009
 pleasure-0.009
 accomplishment-0.009
 delight-0.009
 fascination-0.009
Token about
Token ID546
Feature activation-4.51e-01

Pos logit contributions
 parked+0.031
 stole+0.029
 driving+0.029
 smoking+0.028
 driven+0.027
Neg logit contributions
 satisfaction-0.036
 pleasure-0.036
 accomplishment-0.036
 delight-0.034
 fascination-0.033
Token her
Token ID607
Feature activation-4.52e-01

Pos logit contributions
 parked+0.017
 stole+0.016
 driving+0.016
 smoking+0.016
 broke+0.015
Neg logit contributions
 satisfaction-0.022
 pleasure-0.022
 accomplishment-0.021
 delight-0.020
 fascination-0.020
Token bike
Token ID7161
Feature activation-1.18e+00

Pos logit contributions
 parked+0.019
 stole+0.018
 smoking+0.018
 driving+0.018
 disturbed+0.017
Neg logit contributions
 pleasure-0.020
 accomplishment-0.020
 satisfaction-0.020
 delight-0.019
 fascination-0.019
Token and
Token ID290
Feature activation-1.51e-01

Pos logit contributions
 pleasure+0.097
 satisfaction+0.093
 accomplishment+0.093
 fascination+0.084
 accomplishments+0.083
Neg logit contributions
 parked-0.077
 broke-0.072
 knocked-0.072
 stole-0.070
 disturbed-0.067
Token Tom
Token ID4186
Feature activation+1.45e+00

Pos logit contributions
 parked+0.005
 driving+0.005
 stole+0.005
 smoking+0.005
 disturbed+0.005
Neg logit contributions
 satisfaction-0.008
 pleasure-0.008
 accomplishment-0.008
 delight-0.007
 fascination-0.007
Token nodded
Token ID14464
Feature activation+9.74e-01

Pos logit contributions
 accomplishment+0.073
 pleasure+0.073
 satisfaction+0.070
 accomplishments+0.065
 fascination+0.064
Neg logit contributions
 parked-0.060
 broke-0.057
 stole-0.057
 knocked-0.055
 disturbed-0.054
Token.
Token ID13
Feature activation+1.07e-01

Pos logit contributions
 pleasure+0.061
 satisfaction+0.060
 accomplishment+0.059
 fascination+0.056
 delight+0.055
Neg logit contributions
 parked-0.024
 smoking-0.023
 driving-0.023
 disturbed-0.021
 broke-0.021
Token They
Token ID1119
Feature activation+8.67e-01

Pos logit contributions
 satisfaction+0.005
 pleasure+0.005
 accomplishment+0.005
 delight+0.005
 fascination+0.005
Neg logit contributions
 parked-0.004
 driving-0.004
 smoking-0.004
 stole-0.004
 knocked-0.003
Token hoped
Token ID10719
Feature activation-1.64e-01

Pos logit contributions
 pleasure+0.049
 accomplishment+0.049
 satisfaction+0.048
 fascination+0.046
 delight+0.045
Neg logit contributions
 parked-0.028
 stole-0.027
 broke-0.026
 knocked-0.025
 disturbed-0.025
Token Mom
Token ID11254
Feature activation-3.03e-01

Pos logit contributions
 parked+0.006
 stole+0.005
 driving+0.005
 smoking+0.005
 disturbed+0.005
Neg logit contributions
 satisfaction-0.009
 pleasure-0.009
 accomplishment-0.009
 delight-0.008
 fascination-0.008
Tokenmy
Token ID1820
Feature activation-6.53e-01

Pos logit contributions
 parked+0.012
 driving+0.012
 smoking+0.012
 stole+0.012
 disturbed+0.011
Neg logit contributions
 satisfaction-0.014
 pleasure-0.013
 accomplishment-0.013
 fascination-0.013
 delight-0.012
Token would
Token ID561
Feature activation+1.14e+00

Pos logit contributions
 parked+0.024
 smoking+0.023
 driving+0.023
 stole+0.022
 disturbed+0.021
Neg logit contributions
 satisfaction-0.031
 pleasure-0.031
 accomplishment-0.030
 fascination-0.029
 delight-0.028
Token forgive
Token ID20927
Feature activation+9.35e-01

Pos logit contributions
 satisfaction+0.069
 pleasure+0.068
 accomplishment+0.068
 fascination+0.067
 delight+0.065
Neg logit contributions
 parked-0.030
 driving-0.027
 smoking-0.026
 knocked-0.026
 hurt-0.025
Token them
Token ID606
Feature activation+3.80e-01

Pos logit contributions
 pleasure+0.054
 satisfaction+0.054
 accomplishment+0.053
 fascination+0.050
 delight+0.049
Neg logit contributions
 smoking-0.029
 parked-0.028
 driving-0.026
 disturbed-0.025
 stole-0.025
Token he
Token ID339
Feature activation-5.57e-01

Pos logit contributions
 parked+0.024
 stole+0.022
angs+0.022
 smoking+0.021
cher+0.021
Neg logit contributions
 accomplishment-0.046
 satisfaction-0.046
 pleasure-0.044
 delight-0.044
 fascination-0.041
Token woke
Token ID19092
Feature activation+1.05e-01

Pos logit contributions
 parked+0.019
 driving+0.019
 smoking+0.019
 blocking+0.017
 stole+0.017
Neg logit contributions
 satisfaction-0.028
 accomplishment-0.027
 pleasure-0.027
 delight-0.027
 fascination-0.026
Token up
Token ID510
Feature activation+7.66e-01

Pos logit contributions
 satisfaction+0.005
 pleasure+0.005
 accomplishment+0.005
 delight+0.004
 fascination+0.004
Neg logit contributions
 parked-0.004
 driving-0.004
 smoking-0.004
 stole-0.004
 disturbed-0.004
Token from
Token ID422
Feature activation-9.03e-01

Pos logit contributions
 satisfaction+0.033
 pleasure+0.033
 fascination+0.031
 accomplishment+0.031
 delight+0.030
Neg logit contributions
 parked-0.034
 driving-0.033
 smoking-0.033
 stole-0.032
 broke-0.031
Token his
Token ID465
Feature activation+6.18e-01

Pos logit contributions
 parked+0.033
 stole+0.033
 smoking+0.032
 disturbed+0.030
 CA+0.030
Neg logit contributions
 accomplishment-0.045
 satisfaction-0.043
 pleasure-0.043
 delight-0.041
 fascination-0.041
Token nap
Token ID25422
Feature activation+7.02e-01

Pos logit contributions
 satisfaction+0.031
 pleasure+0.030
 accomplishment+0.029
 fascination+0.028
 delight+0.028
Neg logit contributions
 parked-0.023
 driving-0.022
 smoking-0.021
 stole-0.021
 broke-0.021
Token,
Token ID11
Feature activation-1.35e+00

Pos logit contributions
 satisfaction+0.028
 pleasure+0.028
 accomplishment+0.026
 fascination+0.026
 delight+0.025
Neg logit contributions
 parked-0.033
 smoking-0.032
 driving-0.032
 stole-0.031
 broke-0.031
Token he
Token ID339
Feature activation-3.07e-01

Pos logit contributions
 parked+0.040
angs+0.038
 CA+0.037
 stole+0.037
through+0.036
Neg logit contributions
 satisfaction-0.072
 accomplishment-0.071
 pleasure-0.071
 delight-0.069
 fascination-0.067
Token felt
Token ID2936
Feature activation+2.25e-01

Pos logit contributions
 parked+0.012
 smoking+0.012
 driving+0.012
 stole+0.011
 blocking+0.011
Neg logit contributions
 satisfaction-0.014
 pleasure-0.014
 accomplishment-0.014
 delight-0.013
 fascination-0.013
Token refreshed
Token ID47193
Feature activation+1.30e+00

Pos logit contributions
 satisfaction+0.011
 pleasure+0.011
 accomplishment+0.011
 delight+0.010
 fascination+0.010
Neg logit contributions
 parked-0.009
 driving-0.008
 smoking-0.008
 stole-0.008
 disturbed-0.008
Token and
Token ID290
Feature activation-4.23e-01

Pos logit contributions
 pleasure+0.062
 satisfaction+0.059
 fascination+0.058
 accomplishment+0.057
 delight+0.054
Neg logit contributions
 parked-0.056
 driving-0.054
 smoking-0.053
 broke-0.049
 disturbed-0.049
Token mom
Token ID1995
Feature activation-8.15e-01

Pos logit contributions
 parked+0.007
 smoking+0.006
 stole+0.006
 driving+0.006
 disturbed+0.006
Neg logit contributions
 pleasure-0.007
 satisfaction-0.007
 accomplishment-0.007
 delight-0.007
 fascination-0.007
Token.
Token ID13
Feature activation-5.89e-01

Pos logit contributions
 parked+0.032
 stole+0.032
 CA+0.031
 disturbed+0.031
 smoking+0.030
Neg logit contributions
 pleasure-0.035
 satisfaction-0.034
 accomplishment-0.034
 delight-0.033
 fascination-0.032
Token They
Token ID1119
Feature activation+5.83e-01

Pos logit contributions
 parked+0.024
 driving+0.023
 stole+0.023
 smoking+0.023
 blocking+0.022
Neg logit contributions
 pleasure-0.027
 satisfaction-0.026
 delight-0.026
 accomplishment-0.025
 fascination-0.025
Token see
Token ID766
Feature activation-8.42e-01

Pos logit contributions
 satisfaction+0.030
 accomplishment+0.030
 pleasure+0.030
 delight+0.028
 fascination+0.028
Neg logit contributions
 parked-0.022
 stole-0.020
 driving-0.020
 knocked-0.019
 smoking-0.019
Token a
Token ID257
Feature activation-1.16e+00

Pos logit contributions
 parked+0.029
 stole+0.028
 smoking+0.027
 driving+0.027
 knocked+0.026
Neg logit contributions
 pleasure-0.042
 satisfaction-0.041
 delight-0.041
 accomplishment-0.039
 fascination-0.039
Token lot
Token ID1256
Feature activation-5.03e-01

Pos logit contributions
 parked+0.047
 driving+0.044
 smoking+0.044
 stole+0.044
 disturbed+0.042
Neg logit contributions
 pleasure-0.051
 satisfaction-0.051
 accomplishment-0.050
 delight-0.048
 difference-0.047
Token of
Token ID286
Feature activation-1.71e-01

Pos logit contributions
 parked+0.017
 stole+0.016
 smoking+0.016
 driving+0.016
 knocked+0.015
Neg logit contributions
 satisfaction-0.026
 accomplishment-0.026
 pleasure-0.026
 delight-0.025
 fascination-0.024
Token rocks
Token ID12586
Feature activation-1.58e+00

Pos logit contributions
 parked+0.007
 smoking+0.006
 driving+0.006
 stole+0.006
 knocked+0.006
Neg logit contributions
 satisfaction-0.008
 pleasure-0.008
 accomplishment-0.008
 delight-0.008
 fascination-0.008
Token on
Token ID319
Feature activation-8.73e-02

Pos logit contributions
 parked+0.075
 stole+0.075
Ro+0.075
 CA+0.074
 smoking+0.073
Neg logit contributions
 accomplishment-0.049
 pleasure-0.045
 satisfaction-0.045
 delight-0.044
 difference-0.042
Token the
Token ID262
Feature activation+7.13e-02

Pos logit contributions
 parked+0.003
 stole+0.003
 driving+0.003
 smoking+0.003
 disturbed+0.003
Neg logit contributions
 satisfaction-0.004
 pleasure-0.004
 accomplishment-0.004
 delight-0.004
 fascination-0.004
Token ground
Token ID2323
Feature activation-1.22e+00

Pos logit contributions
 satisfaction+0.003
 pleasure+0.003
 accomplishment+0.003
 delight+0.003
 fascination+0.003
Neg logit contributions
 parked-0.003
 driving-0.003
 smoking-0.003
 stole-0.003
 disturbed-0.002
Token man
Token ID582
Feature activation+5.52e-01

Pos logit contributions
 satisfaction+0.001
 pleasure+0.001
 accomplishment+0.001
 delight+0.001
 fascination+0.001
Neg logit contributions
 parked-0.001
 driving-0.001
 smoking-0.001
 stole-0.001
 disturbed-0.001
Token told
Token ID1297
Feature activation+7.70e-01

Pos logit contributions
 pleasure+0.028
 satisfaction+0.028
 accomplishment+0.027
 delight+0.025
 fascination+0.025
Neg logit contributions
 parked-0.021
 stole-0.019
 driving-0.019
 smoking-0.019
 knocked-0.019
Token them
Token ID606
Feature activation+5.71e-01

Pos logit contributions
 pleasure+0.047
 satisfaction+0.046
 fascination+0.044
 accomplishment+0.044
 delight+0.042
Neg logit contributions
 parked-0.020
 driving-0.019
 smoking-0.019
 stole-0.019
 disturbed-0.018
Token that
Token ID326
Feature activation-7.20e-01

Pos logit contributions
 satisfaction+0.033
 pleasure+0.033
 accomplishment+0.032
 fascination+0.031
 delight+0.031
Neg logit contributions
 parked-0.017
 driving-0.016
 smoking-0.016
 stole-0.016
 knocked-0.015
Token the
Token ID262
Feature activation+2.53e-01

Pos logit contributions
 parked+0.022
 stole+0.020
 driving+0.020
 smoking+0.019
 knocked+0.019
Neg logit contributions
 satisfaction-0.039
 pleasure-0.039
 accomplishment-0.039
 delight-0.037
 fascination-0.037
Token factory
Token ID8860
Feature activation-3.78e-01

Pos logit contributions
 satisfaction+0.013
 pleasure+0.013
 accomplishment+0.012
 delight+0.012
 fascination+0.012
Neg logit contributions
 parked-0.009
 driving-0.009
 smoking-0.009
 stole-0.009
 disturbed-0.008
Token made
Token ID925
Feature activation+6.63e-01

Pos logit contributions
 parked+0.013
 smoking+0.012
 driving+0.012
 stole+0.012
 disturbed+0.011
Neg logit contributions
 satisfaction-0.020
 pleasure-0.019
 accomplishment-0.019
 delight-0.018
 fascination-0.018
Token toys
Token ID14958
Feature activation+9.17e-01

Pos logit contributions
 satisfaction+0.036
 pleasure+0.036
 accomplishment+0.035
 fascination+0.034
 delight+0.034
Neg logit contributions
 parked-0.021
 driving-0.020
 smoking-0.020
 stole-0.019
 disturbed-0.019
Token for
Token ID329
Feature activation+2.05e-01

Pos logit contributions
 satisfaction+0.045
 pleasure+0.044
 accomplishment+0.043
 delight+0.041
 fascination+0.041
Neg logit contributions
 parked-0.036
 driving-0.034
 smoking-0.033
 stole-0.033
 broke-0.032
Token children
Token ID1751
Feature activation-1.38e-01

Pos logit contributions
 satisfaction+0.011
 pleasure+0.011
 accomplishment+0.010
 delight+0.010
 fascination+0.010
Neg logit contributions
 parked-0.007
 driving-0.007
 smoking-0.007
 stole-0.007
 disturbed-0.006
Token all
Token ID477
Feature activation-3.11e-01

Pos logit contributions
 parked+0.005
 smoking+0.005
 driving+0.005
 stole+0.005
 knocked+0.004
Neg logit contributions
 satisfaction-0.007
 pleasure-0.007
 accomplishment-0.007
 delight-0.006
 fascination-0.006
Token friends
Token ID2460
Feature activation+1.03e+00

Pos logit contributions
 satisfaction+0.002
 pleasure+0.002
 accomplishment+0.002
 delight+0.001
 fascination+0.001
Neg logit contributions
 parked-0.001
 smoking-0.001
 driving-0.001
 stole-0.001
 disturbed-0.001
Token.
Token ID13
Feature activation+1.16e+00

Pos logit contributions
 satisfaction+0.059
 pleasure+0.058
 accomplishment+0.058
 fascination+0.056
 delight+0.056
Neg logit contributions
 parked-0.031
 driving-0.029
 smoking-0.027
 stole-0.027
 blocking-0.025
Token<|endoftext|>
Token ID50256
Feature activation-1.04e+00

Pos logit contributions
 accomplishment+0.067
 satisfaction+0.065
 fascination+0.063
 pleasure+0.063
 delight+0.061
Neg logit contributions
 parked-0.034
 driving-0.030
 stole-0.030
 smoking-0.029
 CA-0.027
TokenOne
Token ID3198
Feature activation+3.21e-03

Pos logit contributions
 parked+0.041
 smoking+0.039
 stole+0.039
 driving+0.039
 knocked+0.037
Neg logit contributions
 pleasure-0.048
 satisfaction-0.047
 accomplishment-0.044
 delight-0.044
 fascination-0.043
Token day
Token ID1110
Feature activation-2.31e-01

Pos logit contributions
 satisfaction+0.000
 pleasure+0.000
 accomplishment+0.000
 delight+0.000
 fascination+0.000
Neg logit contributions
 parked-0.000
 smoking-0.000
 driving-0.000
 stole-0.000
 knocked-0.000
Token,
Token ID11
Feature activation-8.23e-01

Pos logit contributions
 parked+0.013
 smoking+0.012
 driving+0.012
 stole+0.012
 disturbed+0.011
Neg logit contributions
 pleasure-0.007
 satisfaction-0.007
 accomplishment-0.007
 delight-0.007
 fascination-0.007
Token Jim
Token ID5395
Feature activation+4.04e-01

Pos logit contributions
 parked+0.026
 smoking+0.026
 driving+0.025
 stole+0.024
 blocking+0.022
Neg logit contributions
 pleasure-0.044
 satisfaction-0.043
 accomplishment-0.041
 delight-0.041
 fascination-0.041
Token went
Token ID1816
Feature activation-1.62e+00

Pos logit contributions
 pleasure+0.022
 satisfaction+0.022
 accomplishment+0.022
 delight+0.021
 fascination+0.021
Neg logit contributions
 parked-0.014
 stole-0.012
 driving-0.012
 smoking-0.012
 knocked-0.012
Token to
Token ID284
Feature activation-1.02e+00

Pos logit contributions
 parked+0.057
 stole+0.057
 smoking+0.056
 disturbed+0.055
 blocked+0.053
Neg logit contributions
 accomplishment-0.075
 pleasure-0.075
 satisfaction-0.072
 delight-0.070
 accomplishments-0.066
Token the
Token ID262
Feature activation-2.24e+00

Pos logit contributions
 smoking+0.041
 parked+0.041
 stole+0.040
 blocking+0.039
 broke+0.039
Neg logit contributions
 pleasure-0.045
 satisfaction-0.043
 delight-0.042
 accomplishment-0.042
 fascination-0.041
Token zoo
Token ID26626
Feature activation-1.44e+00

Pos logit contributions
 bitten+0.107
cher+0.105
angs+0.105
 blocked+0.103
aro+0.102
Neg logit contributions
 pleasure-0.077
 delight-0.075
 difference-0.072
 satisfaction-0.072
 amaz-0.070
Token that
Token ID326
Feature activation-5.17e-01

Pos logit contributions
 parked+0.026
 driving+0.024
 stole+0.024
 smoking+0.023
 knocked+0.023
Neg logit contributions
 accomplishment-0.041
 satisfaction-0.041
 pleasure-0.040
 delight-0.039
 fascination-0.036
Token being
Token ID852
Feature activation-1.05e+00

Pos logit contributions
 parked+0.017
 stole+0.015
 driving+0.015
 smoking+0.015
 knocked+0.014
Neg logit contributions
 pleasure-0.028
 satisfaction-0.028
 accomplishment-0.027
 delight-0.026
 fascination-0.026
Token different
Token ID1180
Feature activation+8.37e-01

Pos logit contributions
 parked+0.046
 smoking+0.043
 driving+0.043
 stole+0.042
 knocked+0.040
Neg logit contributions
 pleasure-0.045
 satisfaction-0.045
 accomplishment-0.042
 delight-0.041
 difference-0.038
Token was
Token ID373
Feature activation-8.64e-01

Pos logit contributions
 satisfaction+0.042
 accomplishment+0.041
 pleasure+0.041
 fascination+0.039
 delight+0.039
Neg logit contributions
 parked-0.031
 driving-0.030
 stole-0.030
 smoking-0.029
 disturbed-0.028
Token okay
Token ID8788
Feature activation-1.88e-01

Pos logit contributions
 parked+0.033
 stole+0.031
 driving+0.030
 smoking+0.030
saw+0.030
Neg logit contributions
 pleasure-0.041
 satisfaction-0.041
 accomplishment-0.040
 delight-0.037
 fascination-0.036
Token and
Token ID290
Feature activation-8.22e-01

Pos logit contributions
 parked+0.009
 driving+0.008
 stole+0.008
 smoking+0.008
 knocked+0.008
Neg logit contributions
 pleasure-0.008
 satisfaction-0.008
 accomplishment-0.007
 delight-0.007
 fascination-0.007
Token they
Token ID484
Feature activation+7.05e-02

Pos logit contributions
 parked+0.026
 driving+0.024
 smoking+0.023
 stole+0.023
 knocked+0.021
Neg logit contributions
 satisfaction-0.045
 pleasure-0.045
 accomplishment-0.044
 delight-0.042
 difference-0.040
Token all
Token ID477
Feature activation-4.67e-01

Pos logit contributions
 satisfaction+0.004
 pleasure+0.004
 accomplishment+0.004
 delight+0.004
 fascination+0.004
Neg logit contributions
 parked-0.002
 driving-0.002
 smoking-0.002
 stole-0.002
 disturbed-0.002
Token became
Token ID2627
Feature activation+8.09e-03

Pos logit contributions
 parked+0.017
 smoking+0.016
 driving+0.016
 stole+0.015
 knocked+0.015
Neg logit contributions
 satisfaction-0.023
 pleasure-0.023
 accomplishment-0.022
 delight-0.022
 fascination-0.021
Token friends
Token ID2460
Feature activation+1.36e+00

Pos logit contributions
 satisfaction+0.000
 pleasure+0.000
 accomplishment+0.000
 delight+0.000
 fascination+0.000
Neg logit contributions
 parked-0.000
 driving-0.000
 smoking-0.000
 stole-0.000
 knocked-0.000
Token again
Token ID757
Feature activation+9.03e-01

Pos logit contributions
 accomplishment+0.084
 satisfaction+0.084
 fascination+0.082
 pleasure+0.082
 delight+0.080
Neg logit contributions
 parked-0.033
 driving-0.031
 stole-0.030
 smoking-0.029
 broke-0.029
Token.
Token ID13
Feature activation+4.35e-01

Pos logit contributions
Ro+0.158
 bitten+0.146
CA+0.146
cher+0.145
 CA+0.144
Neg logit contributions
 accomplishment-0.085
 difference-0.081
 decision-0.079
 delight-0.077
 pleasure-0.076
Token Joy
Token ID14087
Feature activation+5.07e-01

Pos logit contributions
 satisfaction+0.021
 accomplishment+0.021
 pleasure+0.021
 fascination+0.020
 delight+0.019
Neg logit contributions
 parked-0.017
 stole-0.015
 smoking-0.015
 driving-0.015
 knocked-0.015
Token saw
Token ID2497
Feature activation-3.98e-01

Pos logit contributions
 satisfaction+0.030
 pleasure+0.030
 accomplishment+0.030
 delight+0.028
 fascination+0.028
Neg logit contributions
 parked-0.015
 stole-0.013
 driving-0.013
 smoking-0.013
 knocked-0.013
Token the
Token ID262
Feature activation-1.85e-01

Pos logit contributions
 parked+0.013
 driving+0.012
 smoking+0.012
 stole+0.012
 knocked+0.011
Neg logit contributions
 pleasure-0.022
 satisfaction-0.021
 accomplishment-0.021
 delight-0.021
 fascination-0.020
Token billboard
Token ID41448
Feature activation-6.92e-01

Pos logit contributions
 parked+0.006
 smoking+0.006
 stole+0.006
 driving+0.006
 knocked+0.005
Neg logit contributions
 satisfaction-0.010
 pleasure-0.010
 accomplishment-0.010
 delight-0.009
 fascination-0.009
Token and
Token ID290
Feature activation-3.00e-01

Pos logit contributions
 parked+0.036
 stole+0.035
 smoking+0.035
 driving+0.034
 disturbed+0.034
Neg logit contributions
 accomplishment-0.022
 satisfaction-0.022
 pleasure-0.021
 delight-0.020
 fascination-0.020
Token wanted
Token ID2227
Feature activation-1.57e+00

Pos logit contributions
 parked+0.009
 smoking+0.008
 driving+0.008
 stole+0.008
 broke+0.007
Neg logit contributions
 satisfaction-0.017
 pleasure-0.017
 accomplishment-0.017
 delight-0.016
 fascination-0.016
Token to
Token ID284
Feature activation-1.29e+00

Pos logit contributions
 parked+0.058
Ro+0.052
 driving+0.052
 CA+0.051
 stole+0.051
Neg logit contributions
 pleasure-0.076
 satisfaction-0.073
 delight-0.071
 accomplishment-0.069
 joy-0.067
Token balance
Token ID5236
Feature activation+7.04e-01

Pos logit contributions
 smoking+0.056
 parked+0.054
 blocking+0.054
 driving+0.053
 blocked+0.051
Neg logit contributions
 satisfaction-0.056
 pleasure-0.055
 delight-0.055
 accomplishment-0.052
 fascination-0.049
Token on
Token ID319
Feature activation+1.60e+00

Pos logit contributions
 pleasure+0.044
 satisfaction+0.044
 accomplishment+0.043
 fascination+0.042
 delight+0.042
Neg logit contributions
 parked-0.017
 driving-0.015
 smoking-0.015
 stole-0.014
 broke-0.014
Token top
Token ID1353
Feature activation-1.43e+00

Pos logit contributions
 satisfaction+0.099
 pleasure+0.098
 accomplishment+0.096
 fascination+0.092
 accomplishments+0.091
Neg logit contributions
 parked-0.036
 driving-0.034
 smoking-0.034
 stole-0.032
 blocking-0.032
Token car
Token ID1097
Feature activation-1.99e+00

Pos logit contributions
 bitten+0.112
 stole+0.112
 disturbed+0.110
through+0.107
 smoking+0.105
Neg logit contributions
 pleasure-0.137
 accomplishment-0.133
 delight-0.130
 difference-0.129
 goodness-0.126
Token that
Token ID326
Feature activation-2.14e+00

Pos logit contributions
 smoking+0.081
Ro+0.080
 CA+0.080
 disturbed+0.079
 stole+0.079
Neg logit contributions
 pleasure-0.070
 accomplishment-0.069
 satisfaction-0.066
 delight-0.064
 goodness-0.062
Token he
Token ID339
Feature activation-1.79e+00

Pos logit contributions
 smoking+0.067
 guarding+0.066
through+0.064
 parked+0.064
angs+0.063
Neg logit contributions
 pleasure-0.102
 satisfaction-0.099
 accomplishment-0.094
 delight-0.094
 fascination-0.089
Token played
Token ID2826
Feature activation-1.77e+00

Pos logit contributions
 smoking+0.083
 CA+0.077
 blocking+0.076
saw+0.075
through+0.075
Neg logit contributions
 pleasure-0.061
 satisfaction-0.060
 accomplishment-0.059
 fascination-0.053
 delight-0.052
Token with
Token ID351
Feature activation-1.31e+00

Pos logit contributions
 stole+0.078
 smoking+0.074
 parked+0.073
angs+0.073
 CA+0.072
Neg logit contributions
 accomplishment-0.067
 satisfaction-0.062
 delight-0.062
 pleasure-0.061
 accomplishments-0.059
Token every
Token ID790
Feature activation-2.85e+00

Pos logit contributions
 parked+0.060
 smoking+0.058
 stole+0.057
 CA+0.055
 blocked+0.054
Neg logit contributions
 accomplishment-0.047
 satisfaction-0.047
 pleasure-0.045
 delight-0.044
 fascination-0.042
Token day
Token ID1110
Feature activation-1.96e+00

Pos logit contributions
 smoking+0.176
angs+0.175
 stole+0.171
cher+0.168
 disturbed+0.166
Neg logit contributions
 accomplishment-0.059
 pleasure-0.057
 satisfaction-0.050
 delight-0.047
 agreement-0.043
Token.
Token ID13
Feature activation-3.00e-01

Pos logit contributions
 CA+0.091
 stole+0.090
Ro+0.087
 parked+0.085
 Enter+0.080
Neg logit contributions
 delight-0.070
 accomplishment-0.069
 satisfaction-0.069
 pleasure-0.067
 until-0.061
Token One
Token ID1881
Feature activation-2.01e-01

Pos logit contributions
 parked+0.013
 stole+0.012
 smoking+0.012
 driving+0.012
 disturbed+0.011
Neg logit contributions
 pleasure-0.014
 satisfaction-0.013
 accomplishment-0.013
 delight-0.013
 fascination-0.012
Token day
Token ID1110
Feature activation-7.18e-01

Pos logit contributions
 parked+0.009
 smoking+0.009
 stole+0.009
 driving+0.009
 disturbed+0.009
Neg logit contributions
 satisfaction-0.008
 pleasure-0.008
 accomplishment-0.008
 delight-0.007
 fascination-0.007
Token,
Token ID11
Feature activation-1.65e+00

Pos logit contributions
 parked+0.043
 smoking+0.040
 stole+0.040
 driving+0.040
 blocking+0.039
Neg logit contributions
 pleasure-0.018
 satisfaction-0.018
 accomplishment-0.017
 delight-0.016
 fascination-0.015
Token one
Token ID530
Feature activation-9.59e-01

Pos logit contributions
 satisfaction+0.006
 pleasure+0.006
 accomplishment+0.006
 delight+0.005
 fascination+0.005
Neg logit contributions
 parked-0.008
 driving-0.008
 smoking-0.008
 stole-0.008
 disturbed-0.008
Token day
Token ID1110
Feature activation-7.04e-01

Pos logit contributions
 parked+0.051
 driving+0.049
 smoking+0.048
 stole+0.047
 blocking+0.047
Neg logit contributions
 satisfaction-0.033
 pleasure-0.032
 accomplishment-0.030
 delight-0.030
 fascination-0.028
Token,
Token ID11
Feature activation-1.07e+00

Pos logit contributions
 parked+0.041
 smoking+0.039
 driving+0.039
 stole+0.038
 blocking+0.036
Neg logit contributions
 satisfaction-0.021
 pleasure-0.020
 accomplishment-0.019
 delight-0.019
 fascination-0.017
Token a
Token ID257
Feature activation-1.06e+00

Pos logit contributions
 parked+0.041
 smoking+0.038
 driving+0.038
 stole+0.036
 driven+0.035
Neg logit contributions
 satisfaction-0.053
 pleasure-0.052
 accomplishment-0.051
 delight-0.050
 fascination-0.049
Token small
Token ID1402
Feature activation-3.65e-01

Pos logit contributions
 parked+0.048
 driving+0.044
 stole+0.044
 smoking+0.044
 blocking+0.042
Neg logit contributions
 pleasure-0.045
 satisfaction-0.044
 accomplishment-0.043
 delight-0.041
 fascination-0.039
Token animal
Token ID5044
Feature activation-3.33e+00

Pos logit contributions
 parked+0.012
 driving+0.011
 stole+0.011
 smoking+0.011
 knocked+0.011
Neg logit contributions
 satisfaction-0.019
 pleasure-0.019
 accomplishment-0.019
 delight-0.018
 fascination-0.018
Token came
Token ID1625
Feature activation-1.36e+00

Pos logit contributions
 parked+0.117
flash+0.114
through+0.113
anes+0.113
chard+0.113
Neg logit contributions
 delight-0.119
 moments-0.113
 hope-0.110
 nod-0.110
 satisfaction-0.109
Token along
Token ID1863
Feature activation-9.44e-01

Pos logit contributions
 parked+0.058
 stole+0.056
 smoking+0.054
 broke+0.051
 blocked+0.051
Neg logit contributions
 pleasure-0.053
 accomplishment-0.051
 fascination-0.051
 satisfaction-0.050
 delight-0.050
Token.
Token ID13
Feature activation-6.07e-01

Pos logit contributions
 parked+0.043
 stole+0.042
 smoking+0.040
 driving+0.040
 broke+0.039
Neg logit contributions
 satisfaction-0.036
 accomplishment-0.036
 delight-0.034
 pleasure-0.034
 fascination-0.033
Token It
Token ID632
Feature activation-1.68e+00

Pos logit contributions
 parked+0.025
 driving+0.024
 smoking+0.023
 stole+0.023
 knocked+0.022
Neg logit contributions
 satisfaction-0.028
 pleasure-0.028
 accomplishment-0.027
 delight-0.027
 fascination-0.026
Token was
Token ID373
Feature activation-1.76e+00

Pos logit contributions
saw+0.065
 driving+0.064
 smoking+0.064
anes+0.063
 parked+0.061
Neg logit contributions
 delight-0.067
 satisfaction-0.067
 pleasure-0.065
 accomplishment-0.065
 fascination-0.064
Token big
Token ID1263
Feature activation+9.19e-01

Pos logit contributions
 parked+0.022
 driving+0.020
 stole+0.020
 smoking+0.020
 knocked+0.019
Neg logit contributions
 pleasure-0.024
 satisfaction-0.024
 accomplishment-0.024
 delight-0.023
 fascination-0.022
Token,
Token ID11
Feature activation-8.67e-02

Pos logit contributions
 satisfaction+0.050
 pleasure+0.049
 accomplishment+0.048
 fascination+0.046
 delight+0.045
Neg logit contributions
 parked-0.032
 smoking-0.030
 driving-0.030
 stole-0.029
 broke-0.028
Token scary
Token ID14343
Feature activation-2.57e-02

Pos logit contributions
 parked+0.003
 driving+0.003
 smoking+0.003
 stole+0.003
 knocked+0.003
Neg logit contributions
 satisfaction-0.004
 pleasure-0.004
 accomplishment-0.004
 delight-0.004
 fascination-0.004
Token owl
Token ID39610
Feature activation-2.17e+00

Pos logit contributions
 parked+0.001
 driving+0.001
 smoking+0.001
 stole+0.001
 knocked+0.001
Neg logit contributions
 satisfaction-0.001
 pleasure-0.001
 accomplishment-0.001
 delight-0.001
 fascination-0.001
Token and
Token ID290
Feature activation-2.32e+00

Pos logit contributions
 Crash+0.093
 Bang+0.092
 CA+0.092
Ro+0.092
 Zap+0.089
Neg logit contributions
 goodness-0.064
 delight-0.064
 accomplishment-0.063
 fascination-0.063
 satisfaction-0.062
Token it
Token ID340
Feature activation-2.87e+00

Pos logit contributions
anes+0.095
 driving+0.088
angs+0.087
 parked+0.086
iren+0.086
Neg logit contributions
 satisfaction-0.093
 delight-0.089
ya-0.087
 fascination-0.085
 accomplishment-0.084
Token scared
Token ID12008
Feature activation-3.30e-01

Pos logit contributions
anes+0.130
 spies+0.125
saw+0.125
 driving+0.118
 Enter+0.116
Neg logit contributions
 delight-0.091
 satisfaction-0.085
 accomplishment-0.084
 fascination-0.081
 decision-0.078
Token her
Token ID607
Feature activation-7.12e-01

Pos logit contributions
 parked+0.015
 driving+0.013
 stole+0.013
 smoking+0.013
 knocked+0.013
Neg logit contributions
 satisfaction-0.014
 pleasure-0.014
 accomplishment-0.014
 delight-0.014
 fascination-0.013
Token.
Token ID13
Feature activation-2.31e-01

Pos logit contributions
 parked+0.027
 stole+0.025
 smoking+0.024
 driving+0.023
 disturbed+0.023
Neg logit contributions
 pleasure-0.035
 accomplishment-0.034
 satisfaction-0.034
 delight-0.033
 decision-0.032
Token She
Token ID1375
Feature activation-1.22e+00

Pos logit contributions
 parked+0.010
 driving+0.010
 smoking+0.010
 stole+0.010
 disturbed+0.009
Neg logit contributions
 pleasure-0.010
 satisfaction-0.010
 accomplishment-0.009
 delight-0.009
 fascination-0.009
Token ran
Token ID4966
Feature activation+6.20e-01

Pos logit contributions
 smoking+0.046
 driving+0.045
 parked+0.045
saw+0.044
anes+0.043
Neg logit contributions
 satisfaction-0.053
 delight-0.051
 accomplishment-0.050
 fascination-0.050
 pleasure-0.048
Token day
Token ID1110
Feature activation-4.51e-01

Pos logit contributions
 parked+0.018
 smoking+0.017
 stole+0.017
 driving+0.017
 knocked+0.016
Neg logit contributions
 satisfaction-0.014
 pleasure-0.014
 accomplishment-0.014
 delight-0.013
 fascination-0.013
Token,
Token ID11
Feature activation-7.74e-01

Pos logit contributions
 parked+0.026
 smoking+0.024
 stole+0.024
 driving+0.024
 disturbed+0.023
Neg logit contributions
 pleasure-0.013
 satisfaction-0.013
 accomplishment-0.013
 delight-0.012
 fascination-0.011
Token Jane
Token ID12091
Feature activation+2.67e-01

Pos logit contributions
 parked+0.025
 smoking+0.024
 driving+0.024
 stole+0.023
anes+0.022
Neg logit contributions
 pleasure-0.041
 satisfaction-0.040
 accomplishment-0.039
 fascination-0.038
 delight-0.037
Token had
Token ID550
Feature activation-9.55e-01

Pos logit contributions
 satisfaction+0.012
 pleasure+0.012
 accomplishment+0.012
 delight+0.011
 fascination+0.011
Neg logit contributions
 parked-0.011
 stole-0.010
 driving-0.010
 smoking-0.010
 knocked-0.010
Token a
Token ID257
Feature activation+2.23e-01

Pos logit contributions
 parked+0.033
 stole+0.032
 smoking+0.031
 driving+0.030
 blocking+0.030
Neg logit contributions
 pleasure-0.048
 satisfaction-0.048
 delight-0.046
 accomplishment-0.045
 fascination-0.044
Token plan
Token ID1410
Feature activation-3.24e+00

Pos logit contributions
 satisfaction+0.010
 pleasure+0.010
 accomplishment+0.010
 delight+0.010
 fascination+0.010
Neg logit contributions
 parked-0.009
 driving-0.008
 smoking-0.008
 stole-0.008
 knocked-0.008
Token.
Token ID13
Feature activation+1.08e+00

Pos logit contributions
through+0.172
chard+0.169
Ro+0.169
saw+0.167
 stole+0.164
Neg logit contributions
 accomplishment-0.091
 satisfaction-0.080
 pleasure-0.073
 decision-0.072
 accomplishments-0.072
Token She
Token ID1375
Feature activation-7.10e-01

Pos logit contributions
 accomplishment+0.059
 satisfaction+0.057
 pleasure+0.056
 fascination+0.053
 accomplishments+0.053
Neg logit contributions
 parked-0.035
 smoking-0.032
 knocked-0.031
 broke-0.031
 stole-0.031
Token wanted
Token ID2227
Feature activation-1.97e+00

Pos logit contributions
 smoking+0.022
 parked+0.022
 driving+0.022
 blocking+0.021
 stole+0.020
Neg logit contributions
 satisfaction-0.037
 pleasure-0.036
 accomplishment-0.035
 delight-0.034
 fascination-0.034
Token to
Token ID284
Feature activation-2.10e+00

Pos logit contributions
 parked+0.075
cher+0.073
 stole+0.073
 CA+0.070
Ro+0.070
Neg logit contributions
 pleasure-0.085
 satisfaction-0.082
 delight-0.082
 accomplishment-0.081
 enthusiasm-0.077
Token make
Token ID787
Feature activation-1.22e+00

Pos logit contributions
 blocking+0.101
 guarding+0.097
 smoking+0.096
 blocked+0.094
 parked+0.094
Neg logit contributions
 delight-0.084
 satisfaction-0.079
 pleasure-0.078
 accomplishment-0.073
 enthusiasm-0.069
Token Sam
Token ID3409
Feature activation+2.57e-01

Pos logit contributions
 parked+0.034
 driving+0.031
 smoking+0.030
 stole+0.028
through+0.028
Neg logit contributions
 satisfaction-0.055
 accomplishment-0.054
 pleasure-0.054
 delight-0.052
 fascination-0.050
Token.
Token ID13
Feature activation-4.37e-01

Pos logit contributions
 satisfaction+0.013
 pleasure+0.013
 accomplishment+0.013
 delight+0.013
 fascination+0.013
Neg logit contributions
 parked-0.009
 driving-0.008
 smoking-0.008
 stole-0.008
 disturbed-0.008
Token They
Token ID1119
Feature activation-3.28e-01

Pos logit contributions
 parked+0.017
 stole+0.016
 driving+0.016
 smoking+0.016
 knocked+0.015
Neg logit contributions
 pleasure-0.021
 satisfaction-0.021
 accomplishment-0.020
 delight-0.020
 fascination-0.019
Token wished
Token ID16555
Feature activation-3.27e-02

Pos logit contributions
 parked+0.012
 smoking+0.011
 driving+0.011
 stole+0.011
 blocking+0.010
Neg logit contributions
 satisfaction-0.017
 pleasure-0.016
 accomplishment-0.016
 delight-0.016
 fascination-0.015
Token they
Token ID484
Feature activation-1.24e+00

Pos logit contributions
 parked+0.001
 stole+0.001
 driving+0.001
 smoking+0.001
 disturbed+0.001
Neg logit contributions
 satisfaction-0.002
 pleasure-0.002
 accomplishment-0.002
 delight-0.002
 fascination-0.002
Token never
Token ID1239
Feature activation-3.13e+00

Pos logit contributions
 smoking+0.044
 driving+0.042
saw+0.041
 parked+0.041
 CA+0.039
Neg logit contributions
 satisfaction-0.059
 pleasure-0.056
 accomplishment-0.053
 delight-0.053
 fascination-0.051
Token came
Token ID1625
Feature activation-2.03e+00

Pos logit contributions
anes+0.125
aro+0.116
iren+0.113
through+0.113
leon+0.112
Neg logit contributions
 satisfaction-0.117
 pleasure-0.115
 joy-0.114
 happiness-0.112
 promise-0.111
Token to
Token ID284
Feature activation-1.23e+00

Pos logit contributions
 parked+0.100
 stole+0.095
 smoking+0.089
 knocked+0.089
 blocked+0.088
Neg logit contributions
 pleasure-0.069
 difference-0.069
 satisfaction-0.062
 decision-0.062
 delight-0.061
Token his
Token ID465
Feature activation-1.12e-01

Pos logit contributions
 parked+0.055
 stole+0.049
 smoking+0.048
 blocking+0.048
 driving+0.048
Neg logit contributions
 pleasure-0.052
 satisfaction-0.052
 accomplishment-0.051
 delight-0.049
 difference-0.048
Token house
Token ID2156
Feature activation-7.71e-01

Pos logit contributions
 parked+0.005
 driving+0.004
 stole+0.004
 smoking+0.004
 disturbed+0.004
Neg logit contributions
 satisfaction-0.005
 pleasure-0.005
 accomplishment-0.005
 delight-0.005
 fascination-0.005
Token.
Token ID13
Feature activation-2.50e-01

Pos logit contributions
 parked+0.033
 stole+0.032
 smoking+0.031
 driving+0.031
 disturbed+0.030
Neg logit contributions
 accomplishment-0.032
 satisfaction-0.032
 pleasure-0.031
 delight-0.030
 fascination-0.029