TOP ACTIVATIONS
MAX = 13.760

mom asked him , " Which animal do you want to
one animal to feed . Which one do you want ?"
can have one balloon . Which one do you want ?"
bring home with us . Which one do you want ?"
yellow , or white . Which one do you want ?"
buy one toy each . Which one do you want ?"
and chocolate and nuts . Which one do you want ,
, but just one . Which one do you want ?"
He asked Anna , " Which one is your favorite ?"
and read your book . Which one do you want ?"
pie , and more . Which one do you want to
The seed asked , " Which way should I grow at
many sandwiches to sell . Which one would you like ?"
keep one hat each . Which one do you want ?"
friend , Sam , " Which writing place should I go
Emily asked Max , " Which leash do you want ?"
He asks Sue , " Where is the red car ?"
go . " Which way , Tom ?" Lily
man for help . " Which cake is good for my
it with another toy . Which toy do you think matches

MIN ACTIVATIONS
MIN = -10.597

Jake was driving in his je ep . He was so
. She stepped in a p uddle and it made her
a worm . They were pe cking each other and making
Kat ie be amed with joy . " Look
." L ily stood under the tree and counted
. He walked over and pe ered behind it and saw
Tom bent down and pe ered into the hole .
Ab by pe ep ed through the window
her t ippy toes and pe ered outside . Sarah looked
. The mum stood back , watching her daughter
around the big bird and pe ck ed him with his
. He went up and pe ered inside . He noticed
upon a chamber . He pe ered inside and saw a
slowly opened the door and pe ered in . She saw
too wet !" So Sally stood by the f auc et
was finished , the bunny stood back and admired it .
park . As Emma skipped pe b bles across the lake
laughing . George pe ep ed at them and
small mouse . He was pe ering up at the farmer
from a box . She pe els off the paper and

INTERVAL 7.671 - 13.760
CONTAINS 0.230%

her dad , " What if it rains at the picnic
it 's important to mind what your parents say , because
space ." " What is space ?" Ben asked
said to herself , " should I follow the path ?"
Oh no , what should we do ?" The little bird

INTERVAL 1.582 - 7.671
CONTAINS 24.631%

kind . " Okay , I will be more
loved to wash with her favorite shampoo . One day ,
they chose to keep playing in the loud environment ! They
named Lily . She loved to play with her toys and
thinks the p anda is patient and kind . She thinks

INTERVAL -4.508 - 1.582
CONTAINS 70.755%

and powerful feeling in her heart . The elephant
she found a wounded animal in the forest and she decided
the lake . It was so colorful and shiny !
M ia follows him . She feels sorry for him
a big smile on her face . She hugged him and

INTERVAL -10.597 - -4.508
CONTAINS 4.384%

the fox and the rabbit were always happy to share with
candy ! <|endoftext|> Tom my was a 3 year old boy
more caul iflower . She smiled and said , " This
ie looked at the page and revealed it was ink .
The neighbor opened the door and Tim my asked , "
Token mom
Token ID1995
Feature activation-6.52e-01

Pos logit contributions
irling+0.046
eping+0.045
awed+0.043
ep+0.042
ounced+0.042
Neg logit contributions
crow-0.027
 thing-0.027
 lesson-0.026
 apology-0.026
 difference-0.026
Token asked
Token ID1965
Feature activation+9.95e-01

Pos logit contributions
irling+0.015
itching+0.014
eking+0.013
years+0.013
eping+0.013
Neg logit contributions
 end-0.013
crow-0.012
 difference-0.011
 offer-0.010
 thing-0.010
Token him
Token ID683
Feature activation-4.02e-03

Pos logit contributions
crow+0.022
 apology+0.022
 lesson+0.022
 difference+0.022
 deed+0.021
Neg logit contributions
irling-0.021
eping-0.020
ep-0.019
awed-0.019
ounced-0.019
Token,
Token ID11
Feature activation-2.24e+00

Pos logit contributions
irling+0.000
eping+0.000
ounced+0.000
lic+0.000
 exactly+0.000
Neg logit contributions
crow-0.000
 apology-0.000
 difference-0.000
 lesson-0.000
 conclusions-0.000
Token "
Token ID366
Feature activation+1.31e+00

Pos logit contributions
irling+0.051
eping+0.049
awed+0.047
ounced+0.046
ep+0.046
Neg logit contributions
crow-0.047
 thing-0.047
 apology-0.046
 lesson-0.045
 difference-0.044
TokenWhich
Token ID13828
Feature activation+1.20e+01

Pos logit contributions
crow+0.024
 apology+0.024
 lesson+0.024
 difference+0.024
 deed+0.024
Neg logit contributions
irling-0.032
eping-0.032
ep-0.031
awed-0.030
lic-0.030
Token animal
Token ID5044
Feature activation+2.49e+00

Pos logit contributions
 apology+0.215
crow+0.211
 deed+0.206
 lesson+0.200
 zone+0.199
Neg logit contributions
ep-0.288
irling-0.287
eping-0.286
 walking-0.277
ounced-0.276
Token do
Token ID466
Feature activation+4.71e+00

Pos logit contributions
crow+0.042
 thing+0.040
 lesson+0.039
 apology+0.039
 difference+0.039
Neg logit contributions
irling-0.068
eping-0.066
awed-0.063
ounced-0.062
ep-0.062
Token you
Token ID345
Feature activation+5.86e+00

Pos logit contributions
crow+0.092
 apology+0.088
 lesson+0.087
 difference+0.087
 deed+0.086
Neg logit contributions
irling-0.114
eping-0.109
ep-0.106
awed-0.103
ounced-0.103
Token want
Token ID765
Feature activation+5.77e+00

Pos logit contributions
crow+0.089
 thing+0.084
 lesson+0.084
 apology+0.084
 difference+0.083
Neg logit contributions
irling-0.170
eping-0.163
awed-0.160
ounced-0.157
ep-0.157
Token to
Token ID284
Feature activation+5.77e+00

Pos logit contributions
crow+0.121
 apology+0.117
 difference+0.115
 deed+0.115
 lesson+0.114
Neg logit contributions
irling-0.129
eping-0.125
ep-0.120
ounced-0.119
awed-0.119
Token one
Token ID530
Feature activation+6.43e+00

Pos logit contributions
crow+0.060
 apology+0.058
 lesson+0.057
 thing+0.057
 difference+0.057
Neg logit contributions
irling-0.062
eping-0.059
ep-0.056
awed-0.056
ounced-0.056
Token animal
Token ID5044
Feature activation+2.64e+00

Pos logit contributions
 conclusions+0.161
 affect+0.159
 apology+0.157
crow+0.156
 pleasures+0.156
Neg logit contributions
 exactly-0.112
ep-0.109
ents-0.105
eping-0.099
 walking-0.098
Token to
Token ID284
Feature activation+5.37e+00

Pos logit contributions
 difference+0.059
 apology+0.059
 conclusions+0.058
crow+0.058
 deed+0.058
Neg logit contributions
ep-0.050
ounced-0.050
eping-0.050
irling-0.049
 exactly-0.049
Token feed
Token ID3745
Feature activation+3.58e+00

Pos logit contributions
crow+0.096
 lesson+0.092
 apology+0.091
 difference+0.090
 thing+0.090
Neg logit contributions
irling-0.136
eping-0.133
ep-0.131
ounced-0.124
awed-0.124
Token.
Token ID13
Feature activation+3.25e+00

Pos logit contributions
 apology+0.077
 deed+0.075
 difference+0.075
crow+0.074
 lesson+0.073
Neg logit contributions
ep-0.074
eping-0.073
irling-0.072
ents-0.069
awed-0.068
Token Which
Token ID9022
Feature activation+1.20e+01

Pos logit contributions
 lesson+0.063
crow+0.061
 deed+0.061
 thing+0.060
 apology+0.059
Neg logit contributions
irling-0.073
eping-0.072
ents-0.072
lic-0.071
ep-0.070
Token one
Token ID530
Feature activation+6.23e+00

Pos logit contributions
crow+0.232
 apology+0.230
 deed+0.226
 zone+0.224
 difference+0.223
Neg logit contributions
ep-0.279
eping-0.272
irling-0.268
ounced-0.256
awed-0.254
Token do
Token ID466
Feature activation+5.37e+00

Pos logit contributions
crow+0.125
 conclusions+0.122
 apology+0.122
 deed+0.117
 zone+0.115
Neg logit contributions
ep-0.135
eping-0.133
irling-0.133
 walking-0.129
ounced-0.128
Token you
Token ID345
Feature activation+5.91e+00

Pos logit contributions
crow+0.110
 apology+0.106
 difference+0.105
 deed+0.105
 lesson+0.105
Neg logit contributions
irling-0.123
eping-0.117
ep-0.116
eking-0.111
ounced-0.110
Token want
Token ID765
Feature activation+6.10e+00

Pos logit contributions
crow+0.114
 conclusions+0.106
 apology+0.105
 lesson+0.103
 thing+0.103
Neg logit contributions
irling-0.142
eping-0.140
ep-0.136
ounced-0.130
eps-0.130
Token?"
Token ID1701
Feature activation-1.60e+00

Pos logit contributions
crow+0.134
 apology+0.133
 difference+0.132
 deed+0.131
 conclusions+0.130
Neg logit contributions
irling-0.124
eping-0.121
ep-0.120
ounced-0.116
awed-0.114
Token can
Token ID460
Feature activation+4.08e+00

Pos logit contributions
crow+0.061
 apology+0.057
 lesson+0.056
 thing+0.056
 difference+0.055
Neg logit contributions
irling-0.159
eping-0.154
awed-0.149
ep-0.149
ounced-0.148
Token have
Token ID423
Feature activation+3.61e+00

Pos logit contributions
crow+0.078
 zone+0.073
 lesson+0.071
 deed+0.071
 thing+0.069
Neg logit contributions
irling-0.106
eping-0.097
ep-0.097
ents-0.095
 walking-0.093
Token one
Token ID530
Feature activation+5.54e+00

Pos logit contributions
crow+0.078
 apology+0.075
 deed+0.074
 difference+0.074
 lesson+0.073
Neg logit contributions
irling-0.080
eping-0.077
ep-0.074
awed-0.074
ounced-0.073
Token balloon
Token ID21190
Feature activation+7.15e-02

Pos logit contributions
crow+0.132
 conclusions+0.126
keeping+0.124
 deed+0.124
 affect+0.124
Neg logit contributions
ep-0.105
irling-0.100
 exactly-0.097
eping-0.096
 walking-0.095
Token.
Token ID13
Feature activation+3.49e+00

Pos logit contributions
 conclusions+0.002
 deed+0.002
 apology+0.002
 difference+0.002
crow+0.002
Neg logit contributions
ep-0.001
ents-0.001
ounced-0.001
 exactly-0.001
irling-0.001
Token Which
Token ID9022
Feature activation+1.20e+01

Pos logit contributions
 lesson+0.073
 deed+0.070
crow+0.069
 affect+0.068
 choice+0.068
Neg logit contributions
irling-0.071
ents-0.070
years-0.068
lic-0.067
awed-0.067
Token one
Token ID530
Feature activation+6.45e+00

Pos logit contributions
crow+0.251
 deed+0.246
 zone+0.242
 apology+0.240
 conclusions+0.236
Neg logit contributions
ep-0.268
irling-0.250
eping-0.247
awed-0.242
ounced-0.236
Token do
Token ID466
Feature activation+5.47e+00

Pos logit contributions
crow+0.127
 conclusions+0.122
 apology+0.118
 deed+0.118
 zone+0.117
Neg logit contributions
ep-0.146
eping-0.145
irling-0.144
eps-0.140
ounced-0.137
Token you
Token ID345
Feature activation+5.84e+00

Pos logit contributions
crow+0.116
 deed+0.111
 zone+0.110
 difference+0.110
 apology+0.109
Neg logit contributions
irling-0.123
eping-0.114
ep-0.114
eking-0.111
ounced-0.109
Token want
Token ID765
Feature activation+5.99e+00

Pos logit contributions
crow+0.106
 apology+0.098
 thing+0.097
 lesson+0.097
 deed+0.097
Neg logit contributions
irling-0.150
eping-0.146
ep-0.140
ounced-0.138
awed-0.137
Token?"
Token ID1701
Feature activation-1.22e+00

Pos logit contributions
crow+0.137
 deed+0.137
 difference+0.135
 conclusions+0.133
 apology+0.131
Neg logit contributions
irling-0.116
ep-0.113
eping-0.113
ounced-0.110
awed-0.109
Token bring
Token ID2222
Feature activation+2.68e+00

Pos logit contributions
crow+0.097
 lesson+0.092
 apology+0.091
 difference+0.090
 deed+0.090
Neg logit contributions
irling-0.138
eping-0.134
ep-0.130
awed-0.127
ounced-0.126
Token home
Token ID1363
Feature activation+8.31e-01

Pos logit contributions
crow+0.061
 difference+0.059
 deed+0.059
 apology+0.058
 zone+0.057
Neg logit contributions
irling-0.054
eping-0.052
ep-0.051
 exactly-0.049
ounced-0.049
Token with
Token ID351
Feature activation+2.20e+00

Pos logit contributions
crow+0.019
 apology+0.018
 difference+0.018
 deed+0.018
 conclusions+0.018
Neg logit contributions
irling-0.017
eping-0.017
ep-0.016
ents-0.016
 exactly-0.016
Token us
Token ID514
Feature activation+2.42e+00

Pos logit contributions
crow+0.046
 apology+0.044
 difference+0.044
 lesson+0.044
 deed+0.043
Neg logit contributions
irling-0.050
eping-0.048
ep-0.046
awed-0.046
ounced-0.046
Token.
Token ID13
Feature activation+2.96e+00

Pos logit contributions
crow+0.043
 apology+0.041
 deed+0.040
 difference+0.040
 conclusions+0.040
Neg logit contributions
irling-0.063
eping-0.061
ep-0.060
ounced-0.059
ents-0.058
Token Which
Token ID9022
Feature activation+1.20e+01

Pos logit contributions
crow+0.048
 lesson+0.047
 deed+0.046
 thing+0.045
 apology+0.045
Neg logit contributions
irling-0.078
eping-0.075
lic-0.075
ents-0.074
years-0.073
Token one
Token ID530
Feature activation+6.06e+00

Pos logit contributions
crow+0.239
 deed+0.232
 apology+0.230
 zone+0.226
 difference+0.225
Neg logit contributions
irling-0.271
ep-0.267
eping-0.266
awed-0.253
ounced-0.252
Token do
Token ID466
Feature activation+5.30e+00

Pos logit contributions
crow+0.119
 conclusions+0.115
 apology+0.111
 deed+0.108
 zone+0.106
Neg logit contributions
eping-0.134
irling-0.134
ep-0.133
 walking-0.129
ounced-0.129
Token you
Token ID345
Feature activation+5.66e+00

Pos logit contributions
crow+0.109
 difference+0.104
 deed+0.103
 apology+0.103
 lesson+0.102
Neg logit contributions
irling-0.122
eping-0.115
ep-0.114
eking-0.110
ounced-0.110
Token want
Token ID765
Feature activation+6.04e+00

Pos logit contributions
crow+0.102
 apology+0.094
 thing+0.094
 lesson+0.094
 conclusions+0.093
Neg logit contributions
irling-0.146
eping-0.142
ep-0.136
ounced-0.134
awed-0.133
Token?"
Token ID1701
Feature activation-1.22e+00

Pos logit contributions
crow+0.128
 difference+0.125
 apology+0.124
 deed+0.124
 conclusions+0.123
Neg logit contributions
irling-0.131
eping-0.127
ep-0.125
ounced-0.123
awed-0.122
Token yellow
Token ID7872
Feature activation-8.80e-01

Pos logit contributions
irling+0.033
eping+0.032
ep+0.030
awed+0.030
ounced+0.030
Neg logit contributions
crow-0.045
 apology-0.043
 lesson-0.043
 deed-0.043
 difference-0.043
Token,
Token ID11
Feature activation-1.93e+00

Pos logit contributions
ents+0.011
ep+0.011
irling+0.011
 hopping+0.011
eping+0.010
Neg logit contributions
crow-0.025
 lesson-0.024
 apology-0.024
 deed-0.024
 difference-0.023
Token or
Token ID393
Feature activation-1.88e+00

Pos logit contributions
irling+0.036
eping+0.034
awed+0.032
ep+0.032
ounced+0.032
Neg logit contributions
crow-0.049
 apology-0.047
 lesson-0.047
 thing-0.047
 difference-0.046
Token white
Token ID2330
Feature activation-3.34e-01

Pos logit contributions
irling+0.034
eping+0.033
awed+0.031
ep+0.031
ounced+0.030
Neg logit contributions
crow-0.048
 apology-0.046
 thing-0.046
 lesson-0.046
 difference-0.046
Token.
Token ID13
Feature activation+3.06e+00

Pos logit contributions
ep+0.005
ents+0.005
irling+0.005
 hopping+0.005
eping+0.005
Neg logit contributions
 lesson-0.008
crow-0.008
 apology-0.008
 difference-0.008
 deed-0.008
Token Which
Token ID9022
Feature activation+1.19e+01

Pos logit contributions
 lesson+0.062
 deed+0.061
crow+0.060
 thing+0.059
 choice+0.059
Neg logit contributions
ents-0.064
irling-0.063
lic-0.063
years-0.061
eping-0.061
Token one
Token ID530
Feature activation+6.45e+00

Pos logit contributions
crow+0.248
 deed+0.243
 apology+0.237
 zone+0.236
 difference+0.229
Neg logit contributions
ep-0.256
irling-0.254
eping-0.250
ounced-0.234
awed-0.232
Token do
Token ID466
Feature activation+5.24e+00

Pos logit contributions
crow+0.123
 apology+0.117
 deed+0.116
 conclusions+0.115
 zone+0.112
Neg logit contributions
irling-0.152
eping-0.149
ep-0.147
eking-0.141
ounced-0.141
Token you
Token ID345
Feature activation+5.84e+00

Pos logit contributions
crow+0.109
 deed+0.104
 difference+0.103
 zone+0.103
 lesson+0.102
Neg logit contributions
irling-0.121
eping-0.113
ep-0.112
eking-0.108
ounced-0.107
Token want
Token ID765
Feature activation+5.90e+00

Pos logit contributions
crow+0.100
 apology+0.093
 lesson+0.093
 thing+0.093
 deed+0.092
Neg logit contributions
irling-0.156
eping-0.151
ep-0.145
awed-0.143
ounced-0.143
Token?"
Token ID1701
Feature activation-1.80e+00

Pos logit contributions
crow+0.133
 deed+0.130
 difference+0.129
 apology+0.127
 conclusions+0.126
Neg logit contributions
irling-0.120
eping-0.116
ep-0.114
ounced-0.111
awed-0.109
Token buy
Token ID2822
Feature activation+3.01e+00

Pos logit contributions
crow+0.091
 lesson+0.082
 deed+0.081
 zone+0.081
 apology+0.080
Neg logit contributions
irling-0.126
eping-0.118
ep-0.116
 exactly-0.110
eking-0.109
Token one
Token ID530
Feature activation+5.28e+00

Pos logit contributions
crow+0.066
 apology+0.064
 deed+0.063
 difference+0.063
 lesson+0.062
Neg logit contributions
irling-0.065
eping-0.063
ep-0.061
awed-0.060
ounced-0.060
Token toy
Token ID13373
Feature activation+2.78e+00

Pos logit contributions
 affect+0.134
 conclusions+0.130
crow+0.129
 pleasures+0.124
 deed+0.124
Neg logit contributions
 exactly-0.099
ents-0.087
ep-0.087
 called-0.078
 brand-0.077
Token each
Token ID1123
Feature activation+5.07e+00

Pos logit contributions
 deed+0.068
 difference+0.068
 conclusions+0.067
 pleasures+0.066
 apology+0.066
Neg logit contributions
 exactly-0.047
ep-0.047
ents-0.047
ounced-0.046
eping-0.044
Token.
Token ID13
Feature activation+3.14e+00

Pos logit contributions
 conclusions+0.126
 difference+0.120
crow+0.118
 deed+0.117
 affect+0.114
Neg logit contributions
ep-0.091
 exactly-0.087
 walking-0.083
eping-0.081
eh-0.080
Token Which
Token ID9022
Feature activation+1.19e+01

Pos logit contributions
 lesson+0.065
 deed+0.064
crow+0.062
 affect+0.062
 difference+0.062
Neg logit contributions
ents-0.067
irling-0.065
lic-0.064
years-0.064
 exactly-0.063
Token one
Token ID530
Feature activation+6.46e+00

Pos logit contributions
 deed+0.280
crow+0.273
 zone+0.271
 apology+0.263
 affect+0.263
Neg logit contributions
ep-0.240
eping-0.216
irling-0.214
 exactly-0.208
ounced-0.207
Token do
Token ID466
Feature activation+5.41e+00

Pos logit contributions
 conclusions+0.147
crow+0.145
 deed+0.141
 zone+0.140
 apology+0.139
Neg logit contributions
ep-0.123
 exactly-0.116
ounced-0.115
 walking-0.115
 pointed-0.115
Token you
Token ID345
Feature activation+6.04e+00

Pos logit contributions
crow+0.117
 deed+0.113
 zone+0.112
 difference+0.111
 conclusions+0.111
Neg logit contributions
irling-0.121
ep-0.113
eping-0.111
eking-0.107
 exactly-0.105
Token want
Token ID765
Feature activation+6.41e+00

Pos logit contributions
crow+0.120
 conclusions+0.110
 deed+0.109
 apology+0.109
 lesson+0.109
Neg logit contributions
irling-0.143
eping-0.140
ep-0.137
 exactly-0.131
ents-0.131
Token?"
Token ID1701
Feature activation-1.71e+00

Pos logit contributions
crow+0.148
 deed+0.147
 difference+0.145
 conclusions+0.144
 apology+0.142
Neg logit contributions
irling-0.124
ep-0.120
eping-0.119
ounced-0.116
ents-0.114
Token and
Token ID290
Feature activation-2.43e+00

Pos logit contributions
ep+0.026
irling+0.026
ounced+0.025
eping+0.025
awed+0.025
Neg logit contributions
crow-0.033
 apology-0.033
 conclusions-0.032
 deed-0.032
 difference-0.032
Token chocolate
Token ID11311
Feature activation-1.38e+00

Pos logit contributions
irling+0.051
eping+0.049
ep+0.047
ounced+0.046
awed+0.046
Neg logit contributions
crow-0.055
 deed-0.052
 apology-0.051
 zone-0.051
 difference-0.051
Token and
Token ID290
Feature activation-2.88e+00

Pos logit contributions
irling+0.030
ep+0.029
awed+0.029
eping+0.029
ounced+0.029
Neg logit contributions
crow-0.030
 apology-0.029
 conclusions-0.029
 difference-0.029
 deed-0.028
Token nuts
Token ID14380
Feature activation-2.06e+00

Pos logit contributions
irling+0.062
eping+0.059
ep+0.058
awed+0.056
ounced+0.056
Neg logit contributions
crow-0.065
 deed-0.061
 apology-0.061
 difference-0.060
 zone-0.060
Token.
Token ID13
Feature activation+2.38e+00

Pos logit contributions
irling+0.046
eping+0.045
ep+0.044
ounced+0.044
awed+0.044
Neg logit contributions
crow-0.043
 apology-0.042
 deed-0.041
 difference-0.041
 conclusions-0.041
Token Which
Token ID9022
Feature activation+1.19e+01

Pos logit contributions
crow+0.041
 lesson+0.040
 deed+0.040
 apology+0.039
 difference+0.039
Neg logit contributions
irling-0.061
eping-0.059
lic-0.058
ep-0.057
years-0.057
Token one
Token ID530
Feature activation+6.12e+00

Pos logit contributions
crow+0.241
 apology+0.235
 zone+0.233
 deed+0.233
 conclusions+0.226
Neg logit contributions
irling-0.262
ep-0.261
eping-0.257
awed-0.250
ounced-0.249
Token do
Token ID466
Feature activation+5.17e+00

Pos logit contributions
crow+0.119
 conclusions+0.113
 apology+0.113
 deed+0.111
 zone+0.108
Neg logit contributions
irling-0.141
eping-0.140
ep-0.137
ounced-0.135
lic-0.132
Token you
Token ID345
Feature activation+5.62e+00

Pos logit contributions
crow+0.104
 deed+0.099
 apology+0.098
 difference+0.097
 lesson+0.097
Neg logit contributions
irling-0.122
eping-0.115
ep-0.113
ounced-0.110
awed-0.110
Token want
Token ID765
Feature activation+5.95e+00

Pos logit contributions
crow+0.102
 apology+0.095
 deed+0.093
 lesson+0.093
 thing+0.093
Neg logit contributions
irling-0.144
eping-0.140
ep-0.134
ounced-0.132
awed-0.132
Token,
Token ID11
Feature activation+8.19e-02

Pos logit contributions
crow+0.138
 deed+0.137
 conclusions+0.136
 apology+0.135
 difference+0.135
Neg logit contributions
irling-0.112
ep-0.111
eping-0.110
ounced-0.109
lic-0.107
Token,
Token ID11
Feature activation+2.36e+00

Pos logit contributions
crow+0.144
 deed+0.138
 zone+0.138
 lesson+0.138
 apology+0.138
Neg logit contributions
irling-0.099
eping-0.091
ep-0.087
eking-0.086
 exactly-0.086
Token but
Token ID475
Feature activation+7.87e+00

Pos logit contributions
crow+0.051
 difference+0.049
 deed+0.049
 zone+0.049
 apology+0.048
Neg logit contributions
irling-0.050
eping-0.048
ep-0.046
 exactly-0.046
ents-0.046
Token just
Token ID655
Feature activation+4.06e+00

Pos logit contributions
crow+0.170
 zone+0.156
 conclusions+0.155
 deed+0.155
 difference+0.154
Neg logit contributions
irling-0.171
eking-0.161
 exactly-0.160
eping-0.160
ounced-0.159
Token one
Token ID530
Feature activation+5.36e+00

Pos logit contributions
crow+0.079
 lesson+0.073
 deed+0.073
 apology+0.072
 difference+0.072
Neg logit contributions
irling-0.098
eping-0.095
eking-0.091
ounced-0.091
ep-0.091
Token.
Token ID13
Feature activation+4.25e+00

Pos logit contributions
crow+0.103
 conclusions+0.098
 affect+0.097
 deed+0.096
 zone+0.093
Neg logit contributions
 walking-0.115
irling-0.115
eping-0.114
 exactly-0.112
ep-0.112
Token Which
Token ID9022
Feature activation+1.19e+01

Pos logit contributions
 lesson+0.082
 thing+0.079
 deed+0.078
crow+0.077
 choice+0.076
Neg logit contributions
irling-0.098
ents-0.093
years-0.093
eping-0.092
aj-0.091
Token one
Token ID530
Feature activation+6.56e+00

Pos logit contributions
 zone+0.264
crow+0.260
 deed+0.260
 affect+0.251
 apology+0.247
Neg logit contributions
ep-0.234
irling-0.223
 walking-0.222
eping-0.220
ounced-0.217
Token do
Token ID466
Feature activation+5.75e+00

Pos logit contributions
crow+0.138
 conclusions+0.133
 zone+0.130
 deed+0.130
 apology+0.128
Neg logit contributions
irling-0.134
ep-0.131
eping-0.131
 walking-0.129
ounced-0.129
Token you
Token ID345
Feature activation+5.98e+00

Pos logit contributions
crow+0.125
 deed+0.119
 zone+0.119
 difference+0.118
 apology+0.117
Neg logit contributions
irling-0.127
eping-0.116
ep-0.116
eking-0.113
ounced-0.110
Token want
Token ID765
Feature activation+6.46e+00

Pos logit contributions
crow+0.109
 thing+0.099
 apology+0.099
 deed+0.099
 lesson+0.099
Neg logit contributions
irling-0.153
eping-0.148
ep-0.142
eking-0.140
ounced-0.139
Token?"
Token ID1701
Feature activation-1.39e+00

Pos logit contributions
crow+0.145
 deed+0.143
 difference+0.141
 apology+0.139
 conclusions+0.139
Neg logit contributions
irling-0.129
eping-0.123
ep-0.122
ounced-0.120
 walking-0.120
Token He
Token ID679
Feature activation-6.52e-01

Pos logit contributions
crow+0.019
 deed+0.018
 apology+0.018
 difference+0.018
 lesson+0.018
Neg logit contributions
irling-0.017
awed-0.016
eping-0.016
lic-0.016
ents-0.016
Token asked
Token ID1965
Feature activation+1.43e+00

Pos logit contributions
irling+0.016
eping+0.014
eking+0.014
itching+0.013
rolling+0.013
Neg logit contributions
crow-0.013
 lesson-0.013
 thing-0.013
 end-0.013
 apology-0.012
Token Anna
Token ID11735
Feature activation+1.94e-01

Pos logit contributions
crow+0.035
 difference+0.034
 affect+0.034
 zone+0.034
 apology+0.033
Neg logit contributions
irling-0.025
years-0.023
eping-0.023
ep-0.023
 exactly-0.023
Token,
Token ID11
Feature activation-2.82e+00

Pos logit contributions
crow+0.005
 apology+0.005
 conclusions+0.005
 lesson+0.004
 deed+0.004
Neg logit contributions
irling-0.004
eping-0.003
ents-0.003
ounced-0.003
 exactly-0.003
Token "
Token ID366
Feature activation+1.47e+00

Pos logit contributions
irling+0.058
eping+0.055
awed+0.052
ep+0.052
ounced+0.052
Neg logit contributions
crow-0.066
 apology-0.064
 thing-0.064
 lesson-0.064
 difference-0.062
TokenWhich
Token ID13828
Feature activation+1.19e+01

Pos logit contributions
 apology+0.032
 difference+0.032
 deed+0.031
 choice+0.031
 lesson+0.031
Neg logit contributions
ep-0.032
irling-0.032
eping-0.031
awed-0.030
eps-0.030
Token one
Token ID530
Feature activation+6.56e+00

Pos logit contributions
 deed+0.243
crow+0.232
 apology+0.227
 pleasures+0.224
 zone+0.224
Neg logit contributions
irling-0.259
ep-0.258
 walking-0.244
eping-0.243
ounced-0.242
Token is
Token ID318
Feature activation+1.66e+00

Pos logit contributions
crow+0.132
 conclusions+0.131
 deed+0.129
 apology+0.125
 pleasures+0.121
Neg logit contributions
irling-0.141
ep-0.141
eping-0.140
ounced-0.137
 walking-0.135
Token your
Token ID534
Feature activation+4.31e+00

Pos logit contributions
crow+0.037
 apology+0.036
 lesson+0.035
 thing+0.035
 deed+0.035
Neg logit contributions
irling-0.036
eping-0.034
ep-0.033
awed-0.033
ounced-0.032
Token favorite
Token ID4004
Feature activation+4.33e+00

Pos logit contributions
crow+0.091
 deed+0.082
 conclusions+0.081
 apology+0.079
 difference+0.079
Neg logit contributions
irling-0.090
 exactly-0.090
 walking-0.088
aj-0.087
ep-0.085
Token?"
Token ID1701
Feature activation-1.27e+00

Pos logit contributions
crow+0.086
 deed+0.085
 apology+0.085
 payoff+0.084
 affect+0.080
Neg logit contributions
ep-0.095
irling-0.091
 exactly-0.086
eping-0.085
 walking-0.085
Token and
Token ID290
Feature activation-2.50e+00

Pos logit contributions
irling+0.023
ep+0.023
eping+0.022
awed+0.021
lic+0.021
Neg logit contributions
 deed-0.031
 difference-0.030
crow-0.030
 zone-0.030
 pleasures-0.029
Token read
Token ID1100
Feature activation+6.80e-01

Pos logit contributions
irling+0.067
eping+0.064
ep+0.062
ounced+0.062
awed+0.061
Neg logit contributions
crow-0.043
 lesson-0.040
 apology-0.040
 thing-0.039
 deed-0.039
Token your
Token ID534
Feature activation+2.24e+00

Pos logit contributions
crow+0.017
 deed+0.016
 difference+0.016
 apology+0.016
 zone+0.015
Neg logit contributions
irling-0.013
eping-0.013
ep-0.013
awed-0.012
ents-0.012
Token book
Token ID1492
Feature activation+6.07e-01

Pos logit contributions
crow+0.052
 difference+0.048
 deed+0.048
 zone+0.047
 apology+0.047
Neg logit contributions
irling-0.047
eping-0.045
ep-0.044
awed-0.042
ounced-0.042
Token.
Token ID13
Feature activation+2.83e+00

Pos logit contributions
crow+0.014
 apology+0.013
 thing+0.013
 lesson+0.013
 difference+0.013
Neg logit contributions
irling-0.013
eping-0.012
awed-0.011
ep-0.011
ounced-0.011
Token Which
Token ID9022
Feature activation+1.18e+01

Pos logit contributions
crow+0.057
 lesson+0.055
 deed+0.054
 thing+0.054
 apology+0.052
Neg logit contributions
irling-0.065
eping-0.062
lic-0.061
ep-0.060
ents-0.059
Token one
Token ID530
Feature activation+6.18e+00

Pos logit contributions
crow+0.258
 deed+0.245
 apology+0.244
 zone+0.243
 conclusions+0.234
Neg logit contributions
irling-0.250
ep-0.245
eping-0.244
ounced-0.229
awed-0.228
Token do
Token ID466
Feature activation+5.12e+00

Pos logit contributions
crow+0.118
 apology+0.109
 conclusions+0.108
 deed+0.107
 zone+0.105
Neg logit contributions
irling-0.150
eping-0.147
ep-0.143
eking-0.139
ounced-0.139
Token you
Token ID345
Feature activation+5.68e+00

Pos logit contributions
crow+0.109
 deed+0.103
 zone+0.102
 apology+0.101
 lesson+0.101
Neg logit contributions
irling-0.117
eping-0.109
ep-0.107
eking-0.104
ounced-0.102
Token want
Token ID765
Feature activation+5.93e+00

Pos logit contributions
crow+0.086
 apology+0.083
 lesson+0.082
 thing+0.082
 difference+0.081
Neg logit contributions
irling-0.163
eping-0.158
awed-0.154
ounced-0.152
ep-0.151
Token?"
Token ID1701
Feature activation-1.53e+00

Pos logit contributions
crow+0.129
 deed+0.123
 apology+0.122
 difference+0.122
 conclusions+0.120
Neg logit contributions
irling-0.128
eping-0.124
ep-0.120
ounced-0.117
awed-0.117
Token pie
Token ID2508
Feature activation+2.58e-01

Pos logit contributions
ep+0.009
ents+0.009
irling+0.009
ounced+0.009
eping+0.009
Neg logit contributions
 apology-0.011
 conclusions-0.011
crow-0.011
 deed-0.011
 lesson-0.011
Token,
Token ID11
Feature activation-2.22e+00

Pos logit contributions
crow+0.006
 conclusions+0.006
 apology+0.006
 deed+0.005
 difference+0.005
Neg logit contributions
ep-0.005
irling-0.005
eping-0.005
awed-0.005
ounced-0.005
Token and
Token ID290
Feature activation-2.56e+00

Pos logit contributions
irling+0.042
eping+0.040
ep+0.038
awed+0.038
ounced+0.038
Neg logit contributions
crow-0.056
 apology-0.053
 lesson-0.053
 difference-0.052
 deed-0.052
Token more
Token ID517
Feature activation+2.78e+00

Pos logit contributions
irling+0.053
eping+0.051
ep+0.049
ounced+0.048
ents+0.047
Neg logit contributions
crow-0.059
 zone-0.056
 deed-0.055
 apology-0.055
 lesson-0.054
Token.
Token ID13
Feature activation+1.93e+00

Pos logit contributions
crow+0.049
 zone+0.045
 deed+0.044
 apology+0.044
 conclusions+0.043
Neg logit contributions
irling-0.074
eping-0.071
ep-0.069
ounced-0.067
awed-0.067
Token Which
Token ID9022
Feature activation+1.18e+01

Pos logit contributions
crow+0.038
 lesson+0.037
 deed+0.037
 thing+0.036
 choice+0.036
Neg logit contributions
irling-0.042
lic-0.042
ents-0.040
eping-0.040
years-0.040
Token one
Token ID530
Feature activation+6.19e+00

Pos logit contributions
 zone+0.245
crow+0.243
 deed+0.238
 apology+0.237
 conclusions+0.231
Neg logit contributions
ep-0.263
eping-0.252
irling-0.249
ounced-0.242
awed-0.239
Token do
Token ID466
Feature activation+5.15e+00

Pos logit contributions
crow+0.121
 conclusions+0.115
 apology+0.114
 deed+0.111
 zone+0.111
Neg logit contributions
irling-0.142
eping-0.141
ep-0.139
ounced-0.133
eking-0.133
Token you
Token ID345
Feature activation+5.62e+00

Pos logit contributions
crow+0.111
 zone+0.106
 deed+0.106
 apology+0.105
 difference+0.104
Neg logit contributions
irling-0.115
eping-0.106
ep-0.106
eking-0.102
ounced-0.100
Token want
Token ID765
Feature activation+5.85e+00

Pos logit contributions
crow+0.095
 apology+0.090
 lesson+0.089
 thing+0.089
 difference+0.088
Neg logit contributions
irling-0.152
eping-0.147
ep-0.141
awed-0.140
ounced-0.140
Token to
Token ID284
Feature activation+5.63e+00

Pos logit contributions
crow+0.130
 deed+0.126
 difference+0.125
 apology+0.125
 conclusions+0.124
Neg logit contributions
irling-0.121
eping-0.117
ep-0.115
ounced-0.112
awed-0.111
TokenThe
Token ID464
Feature activation+4.88e-02

Pos logit contributions
irling+0.045
eping+0.043
awed+0.042
ounced+0.041
ep+0.041
Neg logit contributions
crow-0.032
 lesson-0.030
 apology-0.030
 thing-0.030
 difference-0.029
Token seed
Token ID9403
Feature activation-2.33e+00

Pos logit contributions
crow+0.001
 zone+0.001
 apology+0.001
 conclusions+0.001
 deed+0.001
Neg logit contributions
irling-0.001
eping-0.001
ep-0.001
eking-0.001
awed-0.001
Token asked
Token ID1965
Feature activation+1.14e+00

Pos logit contributions
irling+0.045
years+0.042
eping+0.041
lic+0.040
awed+0.040
Neg logit contributions
crow-0.059
 thing-0.058
 lesson-0.057
 end-0.057
 difference-0.053
Token,
Token ID11
Feature activation-1.88e+00

Pos logit contributions
crow+0.022
 thing+0.022
 lesson+0.022
 apology+0.021
 deed+0.021
Neg logit contributions
irling-0.029
eping-0.027
awed-0.026
ounced-0.026
ep-0.026
Token "
Token ID366
Feature activation+1.36e+00

Pos logit contributions
irling+0.048
eping+0.045
awed+0.044
ounced+0.043
eking+0.042
Neg logit contributions
 thing-0.037
crow-0.036
 lesson-0.036
 apology-0.036
 end-0.034
TokenWhich
Token ID13828
Feature activation+1.18e+01

Pos logit contributions
crow+0.025
 apology+0.025
 lesson+0.025
 difference+0.025
 deed+0.024
Neg logit contributions
irling-0.033
eping-0.033
ep-0.032
years-0.031
eking-0.031
Token way
Token ID835
Feature activation+6.28e+00

Pos logit contributions
crow+0.222
 deed+0.211
 apology+0.211
 zone+0.210
 pleasures+0.202
Neg logit contributions
irling-0.272
ep-0.262
 walking-0.261
eping-0.261
ounced-0.259
Token should
Token ID815
Feature activation+6.56e+00

Pos logit contributions
crow+0.140
 apology+0.136
 deed+0.135
 lesson+0.133
 conclusions+0.130
Neg logit contributions
 walking-0.122
 exactly-0.120
irling-0.119
 hopping-0.117
ents-0.115
Token I
Token ID314
Feature activation+7.65e+00

Pos logit contributions
crow+0.123
 apology+0.112
 zone+0.112
 lesson+0.111
 deed+0.110
Neg logit contributions
irling-0.158
eping-0.148
 walking-0.147
eking-0.147
ep-0.146
Token grow
Token ID1663
Feature activation+1.14e-01

Pos logit contributions
crow+0.138
 apology+0.127
 lesson+0.125
 deed+0.123
 conclusions+0.122
Neg logit contributions
irling-0.191
eping-0.188
ep-0.180
lic-0.178
ounced-0.178
Token at
Token ID379
Feature activation+4.72e+00

Pos logit contributions
crow+0.002
 apology+0.002
 thing+0.002
 lesson+0.002
 difference+0.002
Neg logit contributions
irling-0.003
eping-0.002
awed-0.002
ep-0.002
ounced-0.002
Token many
Token ID867
Feature activation+2.53e+00

Pos logit contributions
crow+0.082
 apology+0.078
 lesson+0.078
 thing+0.078
 difference+0.077
Neg logit contributions
irling-0.073
eping-0.070
awed-0.066
ep-0.066
ounced-0.066
Token sandwiches
Token ID29777
Feature activation+3.57e-02

Pos logit contributions
crow+0.057
 thing+0.056
 apology+0.056
 lesson+0.056
 difference+0.055
Neg logit contributions
irling-0.054
eping-0.051
awed-0.049
ounced-0.048
ep-0.048
Token to
Token ID284
Feature activation+4.65e+00

Pos logit contributions
crow+0.001
 conclusions+0.001
 apology+0.001
 difference+0.001
 deed+0.001
Neg logit contributions
irling-0.001
ounced-0.001
awed-0.001
ents-0.001
lic-0.001
Token sell
Token ID3677
Feature activation+2.90e+00

Pos logit contributions
crow+0.088
 zone+0.081
 lesson+0.081
 apology+0.080
 deed+0.080
Neg logit contributions
irling-0.117
eping-0.112
ep-0.110
awed-0.106
eking-0.106
Token.
Token ID13
Feature activation+3.42e+00

Pos logit contributions
crow+0.064
 deed+0.061
 apology+0.061
 zone+0.060
 affect+0.059
Neg logit contributions
ep-0.058
ents-0.058
eping-0.057
 exactly-0.056
irling-0.056
Token Which
Token ID9022
Feature activation+1.18e+01

Pos logit contributions
crow+0.060
 lesson+0.059
 deed+0.057
 thing+0.057
 apology+0.057
Neg logit contributions
irling-0.083
lic-0.082
ents-0.081
years-0.080
eping-0.079
Token one
Token ID530
Feature activation+5.61e+00

Pos logit contributions
crow+0.263
 zone+0.254
 apology+0.254
 deed+0.250
 conclusions+0.244
Neg logit contributions
ep-0.237
irling-0.233
eping-0.225
ounced-0.225
awed-0.223
Token would
Token ID561
Feature activation+3.57e+00

Pos logit contributions
crow+0.111
 conclusions+0.108
 apology+0.103
 zone+0.100
 deed+0.099
Neg logit contributions
ep-0.122
eping-0.122
irling-0.122
ounced-0.120
 walking-0.119
Token you
Token ID345
Feature activation+5.34e+00

Pos logit contributions
crow+0.084
 zone+0.078
inea+0.073
 apology+0.073
 conclusions+0.073
Neg logit contributions
irling-0.072
eking-0.068
 exactly-0.067
 walking-0.066
years-0.066
Token like
Token ID588
Feature activation+4.40e+00

Pos logit contributions
crow+0.099
 apology+0.092
 lesson+0.091
 thing+0.091
 deed+0.090
Neg logit contributions
irling-0.136
eping-0.132
ep-0.127
ounced-0.125
awed-0.124
Token?"
Token ID1701
Feature activation-1.09e+00

Pos logit contributions
crow+0.089
 apology+0.084
 zone+0.083
 deed+0.082
 difference+0.080
Neg logit contributions
irling-0.105
eping-0.099
ep-0.098
ounced-0.096
awed-0.095
Token keep
Token ID1394
Feature activation+1.47e+00

Pos logit contributions
crow+0.077
 lesson+0.073
 thing+0.072
 zone+0.072
 apology+0.072
Neg logit contributions
irling-0.109
eping-0.103
ep-0.102
eking-0.097
eps-0.096
Token one
Token ID530
Feature activation+5.48e+00

Pos logit contributions
crow+0.029
 thing+0.028
 apology+0.027
 lesson+0.027
 difference+0.027
Neg logit contributions
irling-0.035
lic-0.034
eping-0.034
awed-0.033
ounced-0.033
Token hat
Token ID6877
Feature activation+4.71e-02

Pos logit contributions
crow+0.135
 affect+0.132
 pleasures+0.126
keeping+0.122
 apology+0.121
Neg logit contributions
 exactly-0.100
ep-0.091
ents-0.088
eping-0.083
 at-0.082
Token each
Token ID1123
Feature activation+4.37e+00

Pos logit contributions
crow+0.001
 apology+0.001
 lesson+0.001
 conclusions+0.001
 difference+0.001
Neg logit contributions
irling-0.001
eping-0.001
ep-0.001
eking-0.001
ents-0.001
Token.
Token ID13
Feature activation+2.78e+00

Pos logit contributions
 difference+0.107
 conclusions+0.104
 deed+0.101
crow+0.098
 pleasures+0.097
Neg logit contributions
ep-0.080
eping-0.072
 exactly-0.072
 walking-0.068
 called-0.067
Token Which
Token ID9022
Feature activation+1.18e+01

Pos logit contributions
 lesson+0.054
 affect+0.052
 deed+0.051
 difference+0.051
 choice+0.051
Neg logit contributions
irling-0.061
years-0.060
ents-0.059
eping-0.058
lic-0.057
Token one
Token ID530
Feature activation+6.50e+00

Pos logit contributions
crow+0.251
 deed+0.250
 apology+0.248
 zone+0.243
 difference+0.243
Neg logit contributions
ep-0.256
eping-0.247
irling-0.244
ounced-0.229
awed-0.228
Token do
Token ID466
Feature activation+5.58e+00

Pos logit contributions
crow+0.131
 apology+0.126
 conclusions+0.125
 deed+0.124
 zone+0.123
Neg logit contributions
ep-0.144
eping-0.140
irling-0.139
eps-0.136
eking-0.134
Token you
Token ID345
Feature activation+6.21e+00

Pos logit contributions
crow+0.121
 deed+0.117
 difference+0.116
 zone+0.115
 lesson+0.115
Neg logit contributions
irling-0.122
ep-0.113
eping-0.112
eking-0.110
ounced-0.106
Token want
Token ID765
Feature activation+6.41e+00

Pos logit contributions
crow+0.117
 apology+0.108
 lesson+0.107
 deed+0.107
 conclusions+0.106
Neg logit contributions
irling-0.155
eping-0.152
ep-0.147
eking-0.142
ounced-0.140
Token?"
Token ID1701
Feature activation-1.33e+00

Pos logit contributions
crow+0.147
 deed+0.145
 difference+0.144
 conclusions+0.141
 apology+0.140
Neg logit contributions
irling-0.124
ep-0.120
eping-0.119
ounced-0.116
 walking-0.114
Token friend
Token ID1545
Feature activation+1.61e+00

Pos logit contributions
crow+0.032
 conclusions+0.030
 apology+0.030
 difference+0.029
 lesson+0.029
Neg logit contributions
irling-0.027
eping-0.027
ep-0.026
ounced-0.025
years-0.025
Token,
Token ID11
Feature activation-2.56e+00

Pos logit contributions
 conclusions+0.040
 apology+0.040
crow+0.040
 difference+0.039
 lesson+0.039
Neg logit contributions
eping-0.027
irling-0.026
 walking-0.026
ounced-0.026
 exactly-0.026
Token Sam
Token ID3409
Feature activation-2.42e-01

Pos logit contributions
irling+0.052
eping+0.050
ep+0.048
awed+0.047
lic+0.047
Neg logit contributions
crow-0.059
 apology-0.057
 lesson-0.057
 difference-0.057
 deed-0.057
Token,
Token ID11
Feature activation-2.54e+00

Pos logit contributions
irling+0.005
eping+0.005
ep+0.004
 walking+0.004
eking+0.004
Neg logit contributions
crow-0.006
 apology-0.006
 conclusions-0.006
 lesson-0.006
 deed-0.006
Token "
Token ID366
Feature activation+9.82e-01

Pos logit contributions
irling+0.055
eping+0.052
awed+0.050
ep+0.049
ounced+0.049
Neg logit contributions
crow-0.057
 thing-0.055
 lesson-0.054
 apology-0.054
 difference-0.053
TokenWhich
Token ID13828
Feature activation+1.18e+01

Pos logit contributions
 apology+0.020
 difference+0.019
 lesson+0.019
crow+0.019
 deed+0.019
Neg logit contributions
eping-0.023
irling-0.023
ep-0.023
awed-0.022
years-0.022
Token writing
Token ID3597
Feature activation+4.02e+00

Pos logit contributions
crow+0.240
 apology+0.224
 deed+0.221
 conclusions+0.218
 difference+0.217
Neg logit contributions
irling-0.270
eping-0.264
ep-0.260
awed-0.248
ounced-0.247
Token place
Token ID1295
Feature activation+4.12e+00

Pos logit contributions
crow+0.066
 apology+0.060
 conclusions+0.060
 deed+0.060
 difference+0.058
Neg logit contributions
irling-0.110
eping-0.108
ep-0.107
eking-0.102
awed-0.102
Token should
Token ID815
Feature activation+5.94e+00

Pos logit contributions
crow+0.072
 deed+0.067
 apology+0.067
 conclusions+0.065
 lesson+0.065
Neg logit contributions
irling-0.107
eping-0.104
ep-0.102
ounced-0.100
eking-0.099
Token I
Token ID314
Feature activation+7.13e+00

Pos logit contributions
crow+0.110
 difference+0.097
 deed+0.096
 conclusions+0.096
 apology+0.095
Neg logit contributions
irling-0.151
eping-0.144
ep-0.143
ents-0.137
eking-0.136
Token go
Token ID467
Feature activation+3.07e-01

Pos logit contributions
crow+0.128
 apology+0.118
 lesson+0.117
 thing+0.116
 difference+0.116
Neg logit contributions
irling-0.186
eping-0.181
ep-0.174
ounced-0.171
awed-0.171
Token Emily
Token ID17608
Feature activation-5.06e-01

Pos logit contributions
crow+0.003
 lesson+0.003
 thing+0.003
 apology+0.003
 zone+0.003
Neg logit contributions
irling-0.004
eping-0.004
eking-0.004
ep-0.004
ounced-0.004
Token asked
Token ID1965
Feature activation+8.26e-01

Pos logit contributions
irling+0.012
eping+0.012
eking+0.011
lic+0.011
eps+0.010
Neg logit contributions
 lesson-0.010
crow-0.009
 thing-0.009
 end-0.009
 payoff-0.009
Token Max
Token ID5436
Feature activation+2.16e-01

Pos logit contributions
crow+0.019
 thing+0.019
 lesson+0.019
 apology+0.019
 deed+0.019
Neg logit contributions
irling-0.018
eping-0.016
awed-0.016
ounced-0.016
ep-0.015
Token,
Token ID11
Feature activation-2.83e+00

Pos logit contributions
crow+0.005
 apology+0.005
 conclusions+0.005
 lesson+0.005
 deed+0.005
Neg logit contributions
irling-0.004
eping-0.004
ounced-0.004
 exactly-0.004
lic-0.004
Token "
Token ID366
Feature activation+9.85e-01

Pos logit contributions
irling+0.068
eping+0.065
awed+0.062
ounced+0.061
ep+0.061
Neg logit contributions
crow-0.057
 thing-0.057
 apology-0.056
 lesson-0.056
 end-0.054
TokenWhich
Token ID13828
Feature activation+1.18e+01

Pos logit contributions
crow+0.018
 apology+0.018
 lesson+0.018
 difference+0.018
 deed+0.017
Neg logit contributions
irling-0.025
eping-0.024
ep-0.024
awed-0.023
years-0.023
Token leash
Token ID40529
Feature activation+2.99e+00

Pos logit contributions
 apology+0.224
crow+0.223
 deed+0.223
 zone+0.222
 conclusions+0.219
Neg logit contributions
irling-0.253
 walking-0.250
ep-0.249
ounced-0.246
 exactly-0.241
Token do
Token ID466
Feature activation+4.80e+00

Pos logit contributions
crow+0.048
 deed+0.046
 conclusions+0.046
 apology+0.046
 lesson+0.044
Neg logit contributions
irling-0.080
eping-0.078
ep-0.077
ounced-0.077
awed-0.076
Token you
Token ID345
Feature activation+5.86e+00

Pos logit contributions
crow+0.107
 zone+0.100
 deed+0.100
 conclusions+0.099
 difference+0.099
Neg logit contributions
irling-0.104
ep-0.096
eping-0.096
eking-0.092
ounced-0.091
Token want
Token ID765
Feature activation+5.53e+00

Pos logit contributions
crow+0.102
 apology+0.095
 lesson+0.094
 thing+0.094
 deed+0.093
Neg logit contributions
irling-0.156
eping-0.151
ep-0.145
ounced-0.144
awed-0.143
Token?"
Token ID1701
Feature activation-1.30e+00

Pos logit contributions
crow+0.124
 deed+0.118
 conclusions+0.118
 apology+0.118
 difference+0.117
Neg logit contributions
irling-0.114
eping-0.109
ep-0.108
ounced-0.106
awed-0.104
Token He
Token ID679
Feature activation-8.12e-01

Pos logit contributions
 difference+0.016
 deed+0.016
 conclusions+0.015
crow+0.015
 apology+0.015
Neg logit contributions
awed-0.014
irling-0.013
ounced-0.013
lic-0.013
eping-0.013
Token asks
Token ID7893
Feature activation+1.58e+00

Pos logit contributions
irling+0.021
eping+0.020
lic+0.020
awed+0.019
ep+0.019
Neg logit contributions
crow-0.015
 thing-0.014
 apology-0.014
 lesson-0.014
 end-0.014
Token Sue
Token ID26089
Feature activation+9.75e-01

Pos logit contributions
 difference+0.039
crow+0.038
 affect+0.038
 conclusions+0.038
 zone+0.038
Neg logit contributions
irling-0.027
eping-0.027
ep-0.026
 exactly-0.026
ounced-0.025
Token,
Token ID11
Feature activation-2.72e+00

Pos logit contributions
crow+0.024
 conclusions+0.023
 apology+0.023
 deed+0.023
 lesson+0.023
Neg logit contributions
ents-0.017
 exactly-0.017
 walking-0.017
irling-0.017
ounced-0.016
Token "
Token ID366
Feature activation+1.35e+00

Pos logit contributions
irling+0.056
eping+0.053
awed+0.050
ep+0.050
ounced+0.049
Neg logit contributions
crow-0.064
 apology-0.062
 thing-0.062
 lesson-0.062
 difference-0.060
TokenWhere
Token ID8496
Feature activation+1.17e+01

Pos logit contributions
 difference+0.030
 apology+0.030
 lesson+0.029
 deed+0.029
 affect+0.028
Neg logit contributions
ep-0.030
awed-0.029
eping-0.028
ounced-0.028
irling-0.027
Token is
Token ID318
Feature activation+2.20e+00

Pos logit contributions
crow+0.173
 apology+0.161
 deed+0.160
 difference+0.159
 conclusions+0.158
Neg logit contributions
irling-0.336
eping-0.328
ep-0.319
ounced-0.319
awed-0.318
Token the
Token ID262
Feature activation+3.67e+00

Pos logit contributions
crow+0.044
 apology+0.042
 lesson+0.041
 difference+0.041
 deed+0.041
Neg logit contributions
irling-0.053
eping-0.051
ep-0.049
awed-0.048
ounced-0.048
Token red
Token ID2266
Feature activation+2.20e+00

Pos logit contributions
crow+0.109
 deed+0.105
 zone+0.103
 conclusions+0.103
 apology+0.101
Neg logit contributions
ep-0.050
eping-0.048
irling-0.045
 walking-0.044
eps-0.044
Token car
Token ID1097
Feature activation+1.94e+00

Pos logit contributions
crow+0.050
 deed+0.049
 difference+0.048
 conclusions+0.048
 apology+0.047
Neg logit contributions
ep-0.046
eping-0.042
irling-0.040
eps-0.038
ents-0.038
Token?"
Token ID1701
Feature activation-1.42e+00

Pos logit contributions
crow+0.040
 apology+0.039
 lesson+0.038
 deed+0.038
 conclusions+0.038
Neg logit contributions
irling-0.044
eping-0.043
ep-0.042
ounced-0.041
awed-0.041
Token go
Token ID467
Feature activation+7.33e-02

Pos logit contributions
crow+0.092
 apology+0.086
 lesson+0.086
 thing+0.085
 difference+0.085
Neg logit contributions
irling-0.134
eping-0.130
ep-0.126
eking-0.123
awed-0.121
Token.
Token ID13
Feature activation+1.23e+00

Pos logit contributions
 apology+0.002
crow+0.002
 deed+0.002
 lesson+0.002
 difference+0.002
Neg logit contributions
irling-0.002
eping-0.002
ep-0.001
eking-0.001
 walking-0.001
Token
Token ID198
Feature activation-8.87e-01

Pos logit contributions
crow+0.026
 deed+0.025
 apology+0.025
 lesson+0.025
 thing+0.025
Neg logit contributions
irling-0.026
awed-0.026
eping-0.025
lic-0.025
ounced-0.025
Token
Token ID198
Feature activation-3.51e-01

Pos logit contributions
irling+0.023
eping+0.023
awed+0.022
ounced+0.021
ep+0.021
Neg logit contributions
crow-0.016
 apology-0.015
 lesson-0.015
 thing-0.015
 difference-0.015
Token"
Token ID1
Feature activation+3.08e+00

Pos logit contributions
irling+0.009
eping+0.009
eking+0.009
awed+0.008
years+0.008
Neg logit contributions
crow-0.007
 lesson-0.006
 thing-0.006
 end-0.006
 am-0.006
TokenWhich
Token ID13828
Feature activation+1.17e+01

Pos logit contributions
crow+0.057
 apology+0.056
 difference+0.055
 deed+0.055
 lesson+0.055
Neg logit contributions
irling-0.077
eping-0.075
ep-0.072
awed-0.072
ounced-0.071
Token way
Token ID835
Feature activation+5.79e+00

Pos logit contributions
crow+0.209
 apology+0.207
 deed+0.202
 zone+0.196
 conclusions+0.195
Neg logit contributions
irling-0.282
ep-0.275
eping-0.272
 walking-0.269
ounced-0.269
Token,
Token ID11
Feature activation+7.06e-01

Pos logit contributions
crow+0.143
 apology+0.143
 deed+0.142
 conclusions+0.137
 pleasures+0.137
Neg logit contributions
 walking-0.100
 exactly-0.097
irling-0.095
ents-0.093
ounced-0.092
Token Tom
Token ID4186
Feature activation+1.81e+00

Pos logit contributions
crow+0.015
 deed+0.015
 apology+0.014
 lesson+0.014
 difference+0.014
Neg logit contributions
irling-0.015
eping-0.015
ep-0.014
ounced-0.014
awed-0.014
Token?"
Token ID1701
Feature activation-1.16e+00

Pos logit contributions
crow+0.033
 thing+0.032
 lesson+0.031
 apology+0.031
 difference+0.031
Neg logit contributions
irling-0.046
eping-0.045
awed-0.043
ounced-0.042
ep-0.042
Token Lily
Token ID20037
Feature activation-2.09e+00

Pos logit contributions
irling+0.030
eping+0.029
awed+0.028
ounced+0.027
ep+0.027
Neg logit contributions
crow-0.022
 thing-0.021
 lesson-0.021
 apology-0.021
 difference-0.020
Token man
Token ID582
Feature activation-1.43e+00

Pos logit contributions
irling+0.017
ounced+0.017
ents+0.017
lic+0.017
eping+0.017
Neg logit contributions
 conclusions-0.011
crow-0.011
 apology-0.011
 difference-0.011
 affect-0.011
Token for
Token ID329
Feature activation+4.69e+00

Pos logit contributions
 walking+0.026
 exactly+0.026
ents+0.025
years+0.025
irling+0.025
Neg logit contributions
crow-0.033
 conclusions-0.033
 affect-0.033
 lesson-0.032
 apology-0.032
Token help
Token ID1037
Feature activation+3.82e+00

Pos logit contributions
 apology+0.084
 lesson+0.082
 thing+0.081
 end+0.081
 offer+0.079
Neg logit contributions
irling-0.129
awed-0.121
eping-0.121
ounced-0.118
lic-0.116
Token.
Token ID13
Feature activation+2.50e-01

Pos logit contributions
crow+0.077
 lesson+0.073
 apology+0.073
 difference+0.073
 deed+0.071
Neg logit contributions
irling-0.088
eping-0.087
ep-0.084
eking-0.082
awed-0.081
Token "
Token ID366
Feature activation+6.00e-01

Pos logit contributions
crow+0.005
 conclusions+0.005
 difference+0.005
 apology+0.005
 lesson+0.005
Neg logit contributions
irling-0.006
lic-0.006
eping-0.006
awed-0.006
ents-0.006
TokenWhich
Token ID13828
Feature activation+1.16e+01

Pos logit contributions
 apology+0.012
 difference+0.012
 lesson+0.012
 choice+0.012
crow+0.012
Neg logit contributions
irling-0.013
eping-0.013
years-0.013
ep-0.013
lic-0.013
Token cake
Token ID12187
Feature activation+3.38e+00

Pos logit contributions
crow+0.240
 zone+0.231
 deed+0.229
 conclusions+0.226
 apology+0.225
Neg logit contributions
ep-0.254
irling-0.251
eping-0.245
ounced-0.240
 exactly-0.238
Token is
Token ID318
Feature activation+3.92e-01

Pos logit contributions
crow+0.060
 conclusions+0.056
 deed+0.056
 apology+0.056
 lesson+0.053
Neg logit contributions
irling-0.087
ep-0.084
eping-0.084
ounced-0.082
lic-0.082
Token good
Token ID922
Feature activation+4.17e+00

Pos logit contributions
crow+0.010
 conclusions+0.009
 zone+0.009
 apology+0.009
 deed+0.009
Neg logit contributions
irling-0.007
eping-0.007
ep-0.007
ounced-0.007
 exactly-0.007
Token for
Token ID329
Feature activation+7.44e+00

Pos logit contributions
crow+0.107
 conclusions+0.105
 apology+0.101
 deed+0.100
 difference+0.100
Neg logit contributions
irling-0.072
eping-0.071
ep-0.069
ounced-0.068
lic-0.068
Token my
Token ID616
Feature activation+4.38e+00

Pos logit contributions
 thing+0.117
crow+0.116
 apology+0.114
 lesson+0.113
 end+0.110
Neg logit contributions
irling-0.211
eping-0.203
awed-0.200
ounced-0.196
lic-0.194
Token it
Token ID340
Feature activation+2.49e+00

Pos logit contributions
crow+0.008
 deed+0.008
 zone+0.007
 difference+0.007
 apology+0.007
Neg logit contributions
irling-0.006
ep-0.006
ents-0.006
eping-0.006
 exactly-0.006
Token with
Token ID351
Feature activation+2.25e+00

Pos logit contributions
crow+0.059
 apology+0.056
 difference+0.056
 thing+0.056
 lesson+0.056
Neg logit contributions
irling-0.050
eping-0.048
ep-0.046
awed-0.046
ounced-0.045
Token another
Token ID1194
Feature activation+3.17e+00

Pos logit contributions
crow+0.050
 apology+0.047
 deed+0.047
 lesson+0.046
 difference+0.046
Neg logit contributions
irling-0.050
eping-0.048
ep-0.046
awed-0.045
ounced-0.045
Token toy
Token ID13373
Feature activation+2.03e+00

Pos logit contributions
crow+0.070
 deed+0.068
 apology+0.064
 zone+0.063
 conclusions+0.063
Neg logit contributions
ep-0.066
eping-0.064
irling-0.063
eps-0.060
ounced-0.059
Token.
Token ID13
Feature activation+2.46e+00

Pos logit contributions
crow+0.044
 deed+0.044
 difference+0.043
 apology+0.042
 lesson+0.042
Neg logit contributions
ents-0.041
ep-0.040
irling-0.039
ounced-0.039
eping-0.039
Token Which
Token ID9022
Feature activation+1.16e+01

Pos logit contributions
crow+0.045
 lesson+0.043
 deed+0.043
 thing+0.042
 apology+0.042
Neg logit contributions
irling-0.062
eping-0.060
ep-0.058
lic-0.057
awed-0.057
Token toy
Token ID13373
Feature activation+4.60e+00

Pos logit contributions
crow+0.240
 deed+0.232
 apology+0.225
 zone+0.224
 difference+0.220
Neg logit contributions
ep-0.265
irling-0.261
eping-0.256
ounced-0.243
awed-0.243
Token do
Token ID466
Feature activation+5.03e+00

Pos logit contributions
crow+0.089
 deed+0.088
 apology+0.086
 difference+0.085
 conclusions+0.085
Neg logit contributions
ep-0.105
irling-0.103
eping-0.102
ounced-0.100
ents-0.098
Token you
Token ID345
Feature activation+5.91e+00

Pos logit contributions
crow+0.097
 apology+0.091
 lesson+0.091
 deed+0.091
 difference+0.091
Neg logit contributions
irling-0.124
eping-0.118
ep-0.115
awed-0.113
ounced-0.113
Token think
Token ID892
Feature activation+4.83e+00

Pos logit contributions
crow+0.097
 apology+0.091
 lesson+0.091
 thing+0.091
 difference+0.089
Neg logit contributions
irling-0.163
eping-0.158
awed-0.152
ep-0.151
ounced-0.151
Token matches
Token ID7466
Feature activation+4.59e+00

Pos logit contributions
crow+0.091
 thing+0.088
 lesson+0.088
 apology+0.088
 difference+0.086
Neg logit contributions
irling-0.120
eping-0.116
awed-0.112
ounced-0.110
ep-0.110
TokenJake
Token ID43930
Feature activation-1.34e+00

Pos logit contributions
 apology+0.014
 deed+0.013
 difference+0.013
 lesson+0.013
 conclusions+0.013
Neg logit contributions
irling-0.012
awed-0.012
lic-0.011
 walking-0.011
eping-0.011
Token was
Token ID373
Feature activation-5.30e+00

Pos logit contributions
irling+0.025
eping+0.023
awed+0.022
ep+0.022
ounced+0.022
Neg logit contributions
crow-0.034
 apology-0.033
 lesson-0.033
 thing-0.033
 difference-0.033
Token driving
Token ID5059
Feature activation-3.93e+00

Pos logit contributions
lic+0.107
awed+0.107
ounced+0.106
irling+0.105
eking+0.100
Neg logit contributions
 thing-0.118
 end-0.114
crow-0.110
 lesson-0.109
 difference-0.105
Token in
Token ID287
Feature activation-1.66e+00

Pos logit contributions
irling+0.081
eping+0.078
ep+0.074
awed+0.074
ounced+0.073
Neg logit contributions
crow-0.091
 apology-0.088
 lesson-0.087
 difference-0.087
 thing-0.086
Token his
Token ID465
Feature activation-2.97e+00

Pos logit contributions
irling+0.036
eping+0.036
 walking+0.034
ep+0.034
years+0.033
Neg logit contributions
crow-0.036
 apology-0.035
 lesson-0.034
 deed-0.033
 difference-0.033
Token je
Token ID11223
Feature activation-8.86e+00

Pos logit contributions
irling+0.070
eping+0.067
awed+0.065
ounced+0.064
ep+0.064
Neg logit contributions
crow-0.060
 thing-0.059
 lesson-0.058
 difference-0.058
 apology-0.058
Tokenep
Token ID538
Feature activation-6.24e+00

Pos logit contributions
lic+0.098
ounced+0.090
 pointed+0.085
 chatting+0.082
 hopping+0.079
Neg logit contributions
crow-0.251
 thing-0.245
 end-0.244
 difference-0.241
 side-0.237
Token.
Token ID13
Feature activation+1.82e+00

Pos logit contributions
irling+0.153
eping+0.143
ounced+0.142
awed+0.142
lic+0.139
Neg logit contributions
crow-0.121
 thing-0.117
 lesson-0.113
 deed-0.112
 apology-0.112
Token He
Token ID679
Feature activation-1.35e+00

Pos logit contributions
crow+0.035
 thing+0.034
 lesson+0.034
 apology+0.034
 difference+0.033
Neg logit contributions
irling-0.045
eping-0.043
awed-0.042
ounced-0.041
ep-0.041
Token was
Token ID373
Feature activation-5.86e+00

Pos logit contributions
irling+0.029
awed+0.027
ounced+0.027
lic+0.026
eping+0.026
Neg logit contributions
 thing-0.030
crow-0.029
 end-0.029
 lesson-0.028
 am-0.027
Token so
Token ID523
Feature activation-1.53e+00

Pos logit contributions
awed+0.131
irling+0.129
ounced+0.123
lic+0.120
leys+0.118
Neg logit contributions
 thing-0.129
 end-0.120
 lesson-0.113
 side-0.113
crow-0.112
Token.
Token ID13
Feature activation+4.76e-02

Pos logit contributions
irling+0.105
eping+0.102
ounced+0.098
awed+0.098
ep+0.098
Neg logit contributions
crow-0.052
 thing-0.051
 difference-0.050
 lesson-0.049
 deed-0.048
Token She
Token ID1375
Feature activation-1.71e+00

Pos logit contributions
crow+0.001
 apology+0.001
 lesson+0.001
 thing+0.001
 difference+0.001
Neg logit contributions
irling-0.001
eping-0.001
ep-0.001
awed-0.001
ounced-0.001
Token stepped
Token ID10764
Feature activation-7.15e+00

Pos logit contributions
irling+0.038
eping+0.035
awed+0.034
itching+0.034
lic+0.034
Neg logit contributions
crow-0.034
 thing-0.033
 end-0.032
 lesson-0.031
 side-0.031
Token in
Token ID287
Feature activation-2.27e+00

Pos logit contributions
irling+0.186
eping+0.177
awed+0.177
ounced+0.176
ep+0.175
Neg logit contributions
crow-0.128
 end-0.126
 thing-0.124
 side-0.118
 lesson-0.111
Token a
Token ID257
Feature activation-3.95e+00

Pos logit contributions
irling+0.047
eping+0.045
awed+0.042
ep+0.042
eking+0.041
Neg logit contributions
crow-0.053
 apology-0.050
 lesson-0.050
 deed-0.049
 difference-0.049
Token p
Token ID279
Feature activation-8.66e+00

Pos logit contributions
irling+0.089
awed+0.086
eping+0.084
ounced+0.084
eking+0.083
Neg logit contributions
crow-0.087
 thing-0.086
 difference-0.084
 end-0.084
 lesson-0.081
Tokenuddle
Token ID24500
Feature activation-4.84e+00

Pos logit contributions
 hopping+0.186
 reading+0.177
years+0.176
 walking+0.176
eping+0.173
Neg logit contributions
crow-0.184
 choice-0.152
 apology-0.150
 difference-0.149
 thing-0.149
Token and
Token ID290
Feature activation-5.73e+00

Pos logit contributions
irling+0.100
eping+0.095
awed+0.091
ounced+0.091
ep+0.090
Neg logit contributions
crow-0.114
 thing-0.110
 lesson-0.108
 apology-0.108
 difference-0.106
Token it
Token ID340
Feature activation-6.99e-01

Pos logit contributions
irling+0.118
eping+0.111
ounced+0.105
awed+0.104
itching+0.104
Neg logit contributions
 thing-0.131
crow-0.129
 lesson-0.124
 end-0.120
 moment-0.116
Token made
Token ID925
Feature activation+4.10e-02

Pos logit contributions
irling+0.016
years+0.016
eping+0.015
ounced+0.015
awed+0.015
Neg logit contributions
 thing-0.015
crow-0.014
 lesson-0.014
 end-0.014
 am-0.013
Token her
Token ID607
Feature activation-1.34e+00

Pos logit contributions
 thing+0.001
 lesson+0.001
crow+0.001
 end+0.001
 apology+0.001
Neg logit contributions
irling-0.001
ounced-0.001
awed-0.001
eping-0.001
ep-0.001
Token a
Token ID257
Feature activation-6.69e-01

Pos logit contributions
crow+0.037
 apology+0.037
 difference+0.036
 lesson+0.036
 deed+0.036
Neg logit contributions
irling-0.037
eping-0.036
awed-0.035
ep-0.035
ounced-0.034
Token worm
Token ID23849
Feature activation-1.84e+00

Pos logit contributions
irling+0.014
eping+0.014
ep+0.013
lic+0.013
ounced+0.013
Neg logit contributions
crow-0.015
 apology-0.015
 deed-0.014
 zone-0.014
 lesson-0.014
Token.
Token ID13
Feature activation+5.88e-02

Pos logit contributions
irling+0.043
eping+0.042
awed+0.040
ep+0.040
ounced+0.040
Neg logit contributions
 apology-0.037
crow-0.036
 lesson-0.036
 deed-0.035
 difference-0.035
Token They
Token ID1119
Feature activation-8.86e-01

Pos logit contributions
crow+0.001
 thing+0.001
 apology+0.001
 lesson+0.001
 difference+0.001
Neg logit contributions
irling-0.002
eping-0.002
ep-0.001
eking-0.001
awed-0.001
Token were
Token ID547
Feature activation-5.97e+00

Pos logit contributions
irling+0.025
aj+0.025
awed+0.022
eping+0.022
eking+0.022
Neg logit contributions
 end-0.016
crow-0.014
 thing-0.014
 am-0.014
 lesson-0.014
Token pe
Token ID613
Feature activation-8.59e+00

Pos logit contributions
aj+0.131
irling+0.125
awed+0.118
lic+0.114
eping+0.113
Neg logit contributions
 thing-0.126
crow-0.122
 lesson-0.121
 end-0.119
 rob-0.118
Tokencking
Token ID44377
Feature activation-2.27e+00

Pos logit contributions
lic+0.117
dream+0.113
 brand+0.112
 hopping+0.106
 pointed+0.100
Neg logit contributions
crow-0.217
 thing-0.201
 decisions-0.199
fl-0.199
 apology-0.198
Token each
Token ID1123
Feature activation+2.73e+00

Pos logit contributions
irling+0.034
eping+0.029
ounced+0.027
lic+0.027
ep+0.027
Neg logit contributions
crow-0.069
 thing-0.068
 lesson-0.066
 apology-0.065
 difference-0.065
Token other
Token ID584
Feature activation+3.10e+00

Pos logit contributions
 difference+0.050
 apology+0.047
 zone+0.045
 deed+0.045
crow+0.044
Neg logit contributions
eping-0.068
ep-0.067
eh-0.066
awed-0.066
irling-0.066
Token and
Token ID290
Feature activation-3.94e+00

Pos logit contributions
crow+0.077
 apology+0.076
 lesson+0.074
 difference+0.074
 deed+0.074
Neg logit contributions
irling-0.058
eping-0.056
awed-0.053
ep-0.052
ounced-0.052
Token making
Token ID1642
Feature activation+9.88e-01

Pos logit contributions
irling+0.077
eping+0.072
aj+0.070
awed+0.068
lic+0.068
Neg logit contributions
 thing-0.095
crow-0.093
 end-0.091
 lesson-0.089
 difference-0.089
Token
Token ID198
Feature activation-1.51e+00

Pos logit contributions
irling+0.046
eping+0.044
awed+0.043
ounced+0.042
ep+0.042
Neg logit contributions
crow-0.038
 thing-0.036
 apology-0.036
 deed-0.036
 difference-0.036
Token
Token ID198
Feature activation-1.34e+00

Pos logit contributions
irling+0.037
eping+0.036
awed+0.034
ep+0.034
ounced+0.034
Neg logit contributions
crow-0.029
 apology-0.028
 lesson-0.028
 thing-0.028
 difference-0.027
TokenKat
Token ID25881
Feature activation-5.98e-01

Pos logit contributions
irling+0.026
eping+0.025
awed+0.024
ounced+0.024
ep+0.023
Neg logit contributions
crow-0.033
 apology-0.031
 lesson-0.031
 thing-0.031
 difference-0.031
Tokenie
Token ID494
Feature activation-2.29e+00

Pos logit contributions
irling+0.014
eping+0.013
ounced+0.013
awed+0.013
lic+0.013
Neg logit contributions
crow-0.012
 thing-0.012
 lesson-0.011
 apology-0.011
 deed-0.011
Token be
Token ID307
Feature activation-4.42e+00

Pos logit contributions
irling+0.047
eping+0.046
lic+0.043
awed+0.043
eking+0.043
Neg logit contributions
crow-0.052
 lesson-0.050
 thing-0.049
 apology-0.049
 difference-0.048
Tokenamed
Token ID2434
Feature activation-8.57e+00

Pos logit contributions
awed+0.075
cases+0.074
lic+0.072
ounced+0.071
irling+0.071
Neg logit contributions
crow-0.117
 thing-0.110
 end-0.110
 lesson-0.103
 difference-0.102
Token with
Token ID351
Feature activation-2.10e+00

Pos logit contributions
irling+0.171
eping+0.163
awed+0.155
ounced+0.154
ep+0.153
Neg logit contributions
crow-0.206
 thing-0.199
 apology-0.199
 lesson-0.197
 difference-0.196
Token joy
Token ID8716
Feature activation-6.56e-01

Pos logit contributions
irling+0.050
eping+0.048
awed+0.046
ep+0.046
years+0.046
Neg logit contributions
crow-0.042
 apology-0.039
 thing-0.039
 lesson-0.039
 deed-0.039
Token.
Token ID13
Feature activation+5.77e-01

Pos logit contributions
irling+0.016
eping+0.015
ounced+0.014
ep+0.014
awed+0.014
Neg logit contributions
crow-0.013
 thing-0.013
 lesson-0.013
 deed-0.012
 apology-0.012
Token "
Token ID366
Feature activation+6.62e-01

Pos logit contributions
crow+0.013
 apology+0.013
 lesson+0.013
 thing+0.013
 deed+0.013
Neg logit contributions
irling-0.012
eping-0.011
awed-0.011
lic-0.011
ounced-0.011
TokenLook
Token ID8567
Feature activation+3.27e-01

Pos logit contributions
crow+0.013
 apology+0.013
 lesson+0.012
 difference+0.012
 thing+0.012
Neg logit contributions
irling-0.016
eping-0.015
ep-0.015
awed-0.015
years-0.015
Token."
Token ID526
Feature activation-9.51e-01

Pos logit contributions
crow+0.061
 deed+0.059
 apology+0.059
 difference+0.058
 lesson+0.057
Neg logit contributions
irling-0.064
eping-0.062
ep-0.060
awed-0.060
ounced-0.059
Token
Token ID198
Feature activation-2.83e+00

Pos logit contributions
irling+0.020
eping+0.019
awed+0.019
ounced+0.018
ep+0.018
Neg logit contributions
 deed-0.021
 lesson-0.020
 difference-0.020
crow-0.020
 thing-0.020
Token
Token ID198
Feature activation-2.16e+00

Pos logit contributions
irling+0.065
eping+0.062
ep+0.061
awed+0.059
lic+0.059
Neg logit contributions
crow-0.057
 thing-0.057
 deed-0.057
 difference-0.056
 lesson-0.056
TokenL
Token ID43
Feature activation-7.33e-02

Pos logit contributions
irling+0.043
eping+0.040
ep+0.038
awed+0.038
ounced+0.038
Neg logit contributions
crow-0.052
 apology-0.051
 lesson-0.050
 thing-0.050
 difference-0.050
Tokenily
Token ID813
Feature activation-9.28e-01

Pos logit contributions
irling+0.002
eping+0.002
ep+0.002
awed+0.002
lic+0.002
Neg logit contributions
 difference-0.001
crow-0.001
 thing-0.001
 lesson-0.001
 deed-0.001
Token stood
Token ID6204
Feature activation-8.51e+00

Pos logit contributions
irling+0.019
eping+0.018
years+0.017
eking+0.017
lic+0.017
Neg logit contributions
crow-0.021
 lesson-0.021
 apology-0.021
 difference-0.020
 end-0.020
Token under
Token ID739
Feature activation-2.29e-01

Pos logit contributions
lic+0.219
eping+0.211
awed+0.210
irling+0.208
ounced+0.208
Neg logit contributions
 thing-0.147
 end-0.141
 side-0.135
crow-0.134
 lesson-0.126
Token the
Token ID262
Feature activation+4.62e-01

Pos logit contributions
irling+0.005
eping+0.005
awed+0.005
 walking+0.005
ep+0.005
Neg logit contributions
crow-0.005
 difference-0.005
 apology-0.005
 lesson-0.005
 zone-0.005
Token tree
Token ID5509
Feature activation-3.85e+00

Pos logit contributions
crow+0.010
 deed+0.010
 pleasures+0.009
 zone+0.009
 apology+0.009
Neg logit contributions
irling-0.010
eping-0.010
ep-0.009
ents-0.009
 walking-0.009
Token and
Token ID290
Feature activation-5.76e+00

Pos logit contributions
irling+0.087
eping+0.082
ounced+0.079
awed+0.078
lic+0.078
Neg logit contributions
crow-0.084
 thing-0.081
 lesson-0.079
 difference-0.079
 deed-0.079
Token counted
Token ID14789
Feature activation-3.26e+00

Pos logit contributions
irling+0.122
eping+0.116
lic+0.110
years+0.104
eking+0.102
Neg logit contributions
 thing-0.127
crow-0.125
 payoff-0.120
 lesson-0.119
 end-0.118
Token.
Token ID13
Feature activation+3.31e-01

Pos logit contributions
irling+0.053
eping+0.051
awed+0.049
ounced+0.049
ep+0.049
Neg logit contributions
crow-0.033
 thing-0.032
 lesson-0.031
 apology-0.031
 difference-0.030
Token He
Token ID679
Feature activation-1.44e+00

Pos logit contributions
crow+0.006
 thing+0.005
 lesson+0.005
 apology+0.005
 difference+0.005
Neg logit contributions
irling-0.009
eping-0.009
ep-0.009
eking-0.008
ounced-0.008
Token walked
Token ID6807
Feature activation-6.27e+00

Pos logit contributions
irling+0.037
eping+0.033
eking+0.033
aj+0.032
agging+0.032
Neg logit contributions
 end-0.025
 thing-0.025
crow-0.024
 lesson-0.023
 am-0.021
Token over
Token ID625
Feature activation-3.26e+00

Pos logit contributions
irling+0.196
eping+0.179
awed+0.176
ounced+0.175
eking+0.174
Neg logit contributions
crow-0.093
 thing-0.091
 lesson-0.088
 end-0.084
 deed-0.077
Token and
Token ID290
Feature activation-5.48e+00

Pos logit contributions
irling+0.066
eping+0.065
awed+0.062
ep+0.062
lic+0.061
Neg logit contributions
crow-0.074
 apology-0.072
 difference-0.071
 deed-0.070
 lesson-0.069
Token pe
Token ID613
Feature activation-8.49e+00

Pos logit contributions
irling+0.135
eping+0.113
aj+0.113
itching+0.113
agging+0.111
Neg logit contributions
crow-0.113
 thing-0.107
 moment-0.098
 side-0.098
 end-0.095
Tokenered
Token ID1068
Feature activation-6.20e+00

Pos logit contributions
 brand+0.222
dream+0.218
lic+0.213
 chatting+0.209
irling+0.209
Neg logit contributions
crow-0.119
 thing-0.104
inea-0.096
fl-0.092
 prove-0.090
Token behind
Token ID2157
Feature activation-1.22e+00

Pos logit contributions
irling+0.155
eping+0.146
ounced+0.144
awed+0.144
ep+0.142
Neg logit contributions
 thing-0.120
crow-0.120
 lesson-0.112
 end-0.111
 difference-0.110
Token it
Token ID340
Feature activation-4.54e-02

Pos logit contributions
irling+0.025
eping+0.025
awed+0.024
ep+0.023
 walking+0.023
Neg logit contributions
crow-0.027
 apology-0.026
 lesson-0.026
 difference-0.026
 deed-0.026
Token and
Token ID290
Feature activation-4.84e+00

Pos logit contributions
irling+0.001
eping+0.001
ounced+0.001
ep+0.001
awed+0.001
Neg logit contributions
 thing-0.001
crow-0.001
 end-0.001
 lesson-0.001
 difference-0.001
Token saw
Token ID2497
Feature activation-2.91e+00

Pos logit contributions
irling+0.111
eping+0.093
aj+0.093
ounced+0.091
awed+0.090
Neg logit contributions
crow-0.112
 thing-0.111
 end-0.104
 side-0.103
 zone-0.100
Token
Token ID198
Feature activation-1.26e+00

Pos logit contributions
irling+0.056
eping+0.054
awed+0.052
ep+0.052
ounced+0.052
Neg logit contributions
crow-0.041
 apology-0.039
 lesson-0.038
 thing-0.038
 difference-0.038
TokenTom
Token ID13787
Feature activation-1.27e+00

Pos logit contributions
irling+0.031
eping+0.029
eking+0.029
years+0.028
ounced+0.028
Neg logit contributions
crow-0.025
 lesson-0.024
 thing-0.024
 end-0.023
 apology-0.022
Token bent
Token ID17157
Feature activation-4.99e+00

Pos logit contributions
irling+0.031
eking+0.025
awed+0.025
eping+0.025
agging+0.024
Neg logit contributions
crow-0.026
 end-0.025
 side-0.024
 thing-0.024
 offer-0.023
Token down
Token ID866
Feature activation-1.83e+00

Pos logit contributions
irling+0.125
eping+0.119
awed+0.113
ounced+0.112
lic+0.112
Neg logit contributions
crow-0.097
 thing-0.094
 lesson-0.091
 apology-0.091
 deed-0.088
Token and
Token ID290
Feature activation-5.35e+00

Pos logit contributions
irling+0.039
eping+0.038
awed+0.036
ep+0.036
ounced+0.035
Neg logit contributions
crow-0.042
 thing-0.040
 lesson-0.040
 apology-0.040
 difference-0.039
Token pe
Token ID613
Feature activation-8.47e+00

Pos logit contributions
irling+0.134
agging+0.119
eping+0.112
ink+0.109
aj+0.107
Neg logit contributions
 thing-0.111
crow-0.103
 offer-0.102
 care-0.101
 moment-0.100
Tokenered
Token ID1068
Feature activation-6.20e+00

Pos logit contributions
 brand+0.220
lic+0.216
dream+0.214
aj+0.209
irling+0.207
Neg logit contributions
crow-0.124
 thing-0.115
inea-0.106
fl-0.101
 prove-0.101
Token into
Token ID656
Feature activation-4.96e+00

Pos logit contributions
irling+0.159
eping+0.149
ounced+0.146
awed+0.146
ep+0.145
Neg logit contributions
 thing-0.119
crow-0.117
 lesson-0.110
 end-0.109
 apology-0.108
Token the
Token ID262
Feature activation-1.08e+00

Pos logit contributions
irling+0.135
eping+0.129
awed+0.125
ounced+0.124
ep+0.124
Neg logit contributions
crow-0.084
 thing-0.081
 apology-0.080
 lesson-0.080
 difference-0.078
Token hole
Token ID7604
Feature activation-3.50e+00

Pos logit contributions
irling+0.025
eping+0.024
awed+0.023
ep+0.023
ounced+0.023
Neg logit contributions
crow-0.023
 thing-0.022
 apology-0.022
 lesson-0.022
 difference-0.022
Token.
Token ID13
Feature activation-3.02e-02

Pos logit contributions
irling+0.106
eping+0.100
ounced+0.098
ep+0.097
awed+0.097
Neg logit contributions
 thing-0.049
crow-0.046
 end-0.043
 lesson-0.043
 zone-0.042
Token
Token ID220
Feature activation-3.09e+00

Pos logit contributions
crow+0.002
 apology+0.002
 lesson+0.002
 thing+0.002
 difference+0.002
Neg logit contributions
irling-0.003
eping-0.003
awed-0.002
ep-0.002
ounced-0.002
Token
Token ID198
Feature activation-2.35e+00

Pos logit contributions
irling+0.062
eping+0.059
awed+0.058
lic+0.058
ep+0.057
Neg logit contributions
crow-0.071
 deed-0.070
 apology-0.069
 thing-0.068
 zone-0.067
Token
Token ID198
Feature activation-1.50e+00

Pos logit contributions
irling+0.058
eping+0.055
awed+0.054
ep+0.053
ounced+0.053
Neg logit contributions
crow-0.045
 apology-0.043
 thing-0.043
 lesson-0.043
 difference-0.042
TokenAb
Token ID4826
Feature activation-1.37e+00

Pos logit contributions
irling+0.034
eping+0.034
ep+0.032
ounced+0.032
awed+0.031
Neg logit contributions
crow-0.032
 thing-0.031
 lesson-0.031
 deed-0.030
 difference-0.029
Tokenby
Token ID1525
Feature activation-2.63e+00

Pos logit contributions
irling+0.042
eping+0.041
lic+0.039
ep+0.038
awed+0.038
Neg logit contributions
crow-0.018
 deed-0.018
 thing-0.017
 difference-0.017
 lesson-0.017
Token pe
Token ID613
Feature activation-8.47e+00

Pos logit contributions
irling+0.071
eking+0.065
aj+0.063
eping+0.063
 brand+0.062
Neg logit contributions
 thing-0.049
crow-0.048
 lesson-0.045
 end-0.044
 person-0.042
Tokenep
Token ID538
Feature activation-5.60e+00

Pos logit contributions
 brand+0.182
dream+0.172
 dreaming+0.167
 hungry+0.160
lic+0.159
Neg logit contributions
crow-0.178
 thing-0.149
inea-0.146
 prove-0.138
 apology-0.138
Tokened
Token ID276
Feature activation-5.53e+00

Pos logit contributions
irling+0.127
eping+0.106
aj+0.104
itching+0.103
 brand+0.101
Neg logit contributions
crow-0.131
 thing-0.119
 end-0.117
 lesson-0.115
 zone-0.111
Token through
Token ID832
Feature activation-2.59e+00

Pos logit contributions
irling+0.146
eping+0.134
ounced+0.131
ep+0.128
awed+0.127
Neg logit contributions
 thing-0.111
crow-0.110
 lesson-0.101
 apology-0.098
 deed-0.098
Token the
Token ID262
Feature activation-6.68e-01

Pos logit contributions
irling+0.065
eping+0.063
awed+0.061
ep+0.060
ounced+0.059
Neg logit contributions
crow-0.048
 apology-0.047
 lesson-0.046
 difference-0.045
 deed-0.045
Token window
Token ID4324
Feature activation-3.74e+00

Pos logit contributions
irling+0.017
eping+0.016
ounced+0.016
awed+0.015
eking+0.015
Neg logit contributions
crow-0.014
 thing-0.013
 difference-0.013
 lesson-0.013
 end-0.013
Token her
Token ID607
Feature activation-7.65e-01

Pos logit contributions
crow+0.003
 thing+0.003
 lesson+0.003
 apology+0.003
 difference+0.003
Neg logit contributions
irling-0.005
eping-0.004
ounced-0.004
awed-0.004
lic-0.004
Token t
Token ID256
Feature activation-5.33e+00

Pos logit contributions
irling+0.016
eping+0.015
ep+0.015
eking+0.015
 walking+0.015
Neg logit contributions
crow-0.018
 apology-0.017
 conclusions-0.017
 difference-0.017
 deed-0.016
Tokenippy
Token ID41214
Feature activation-3.58e+00

Pos logit contributions
irling+0.159
eping+0.151
ounced+0.146
eking+0.144
awed+0.143
Neg logit contributions
crow-0.075
 thing-0.073
 deed-0.072
 zone-0.071
 apology-0.067
Token toes
Token ID23932
Feature activation-3.84e+00

Pos logit contributions
irling+0.077
eping+0.074
awed+0.070
ep+0.070
ounced+0.069
Neg logit contributions
crow-0.080
 apology-0.077
 thing-0.077
 lesson-0.077
 difference-0.076
Token and
Token ID290
Feature activation-5.25e+00

Pos logit contributions
irling+0.100
eping+0.094
ounced+0.092
years+0.091
awed+0.091
Neg logit contributions
 thing-0.069
crow-0.068
 zone-0.065
 lesson-0.064
 end-0.064
Token pe
Token ID613
Feature activation-8.46e+00

Pos logit contributions
irling+0.108
awed+0.097
lic+0.096
eping+0.096
ounced+0.094
Neg logit contributions
 thing-0.128
crow-0.121
 lesson-0.120
 offer-0.120
 apology-0.118
Tokenered
Token ID1068
Feature activation-5.94e+00

Pos logit contributions
lic+0.163
 hopping+0.162
 chatting+0.162
dream+0.161
 brand+0.159
Neg logit contributions
fl-0.146
crow-0.145
 thing-0.142
inea-0.137
 am-0.127
Token outside
Token ID2354
Feature activation-1.97e+00

Pos logit contributions
irling+0.144
eping+0.138
ounced+0.134
awed+0.134
ep+0.132
Neg logit contributions
crow-0.117
 thing-0.116
 difference-0.109
 lesson-0.109
 apology-0.108
Token.
Token ID13
Feature activation-2.85e-01

Pos logit contributions
irling+0.047
eping+0.044
ep+0.042
eking+0.042
 walking+0.042
Neg logit contributions
crow-0.040
 apology-0.039
 lesson-0.038
 deed-0.037
 difference-0.037
Token Sarah
Token ID10490
Feature activation-5.81e-01

Pos logit contributions
irling+0.007
eping+0.007
ep+0.007
eking+0.007
ounced+0.007
Neg logit contributions
crow-0.005
 lesson-0.005
 thing-0.005
 apology-0.005
 zone-0.005
Token looked
Token ID3114
Feature activation-6.10e+00

Pos logit contributions
irling+0.013
eping+0.012
lic+0.012
ounced+0.012
agging+0.011
Neg logit contributions
 thing-0.011
crow-0.011
 end-0.011
 am-0.010
 side-0.010
Token.
Token ID13
Feature activation-2.73e-01

Pos logit contributions
irling+0.016
eping+0.015
ounced+0.015
awed+0.015
ep+0.015
Neg logit contributions
crow-0.010
 thing-0.010
 lesson-0.010
 apology-0.010
 difference-0.010
Token
Token ID198
Feature activation-2.40e+00

Pos logit contributions
irling+0.006
awed+0.005
ounced+0.005
eping+0.005
lic+0.005
Neg logit contributions
 apology-0.006
crow-0.006
 lesson-0.006
 thing-0.006
 deed-0.006
Token
Token ID198
Feature activation-1.65e+00

Pos logit contributions
irling+0.061
eping+0.058
awed+0.056
ep+0.056
ounced+0.056
Neg logit contributions
crow-0.045
 apology-0.043
 lesson-0.043
 thing-0.043
 difference-0.042
TokenThe
Token ID464
Feature activation-1.15e-01

Pos logit contributions
irling+0.033
eping+0.033
awed+0.032
ounced+0.031
eking+0.031
Neg logit contributions
crow-0.039
 lesson-0.037
 thing-0.037
 apology-0.036
 deed-0.036
Token mum
Token ID25682
Feature activation-5.34e-01

Pos logit contributions
irling+0.002
eping+0.002
ep+0.002
ounced+0.002
awed+0.002
Neg logit contributions
crow-0.003
 difference-0.003
 apology-0.003
 conclusions-0.003
 zone-0.003
Token stood
Token ID6204
Feature activation-8.45e+00

Pos logit contributions
irling+0.011
eping+0.010
eking+0.010
awed+0.009
itching+0.009
Neg logit contributions
crow-0.012
 end-0.011
 thing-0.011
 difference-0.011
 offer-0.011
Token back
Token ID736
Feature activation-1.06e+00

Pos logit contributions
lic+0.221
ounced+0.204
ents+0.202
eping+0.202
irling+0.201
Neg logit contributions
 thing-0.147
 side-0.140
 end-0.134
crow-0.127
 moment-0.117
Token,
Token ID11
Feature activation-4.82e+00

Pos logit contributions
irling+0.023
eping+0.022
lic+0.021
awed+0.021
ounced+0.021
Neg logit contributions
crow-0.023
 thing-0.023
 lesson-0.023
 apology-0.022
 zone-0.022
Token watching
Token ID4964
Feature activation-6.03e-01

Pos logit contributions
irling+0.105
lic+0.102
eping+0.101
ounced+0.099
awed+0.098
Neg logit contributions
 thing-0.097
crow-0.095
 end-0.093
 difference-0.092
 side-0.092
Token her
Token ID607
Feature activation-9.49e-01

Pos logit contributions
irling+0.012
eping+0.012
awed+0.011
ep+0.011
lic+0.011
Neg logit contributions
crow-0.013
 apology-0.013
 lesson-0.013
 difference-0.013
 zone-0.013
Token daughter
Token ID4957
Feature activation-1.11e+00

Pos logit contributions
irling+0.018
eping+0.017
 walking+0.017
 exactly+0.016
awed+0.016
Neg logit contributions
crow-0.024
 conclusions-0.022
 apology-0.022
 lesson-0.021
 deed-0.021
Token around
Token ID1088
Feature activation-2.45e+00

Pos logit contributions
irling+0.151
eping+0.144
awed+0.136
eking+0.135
ounced+0.135
Neg logit contributions
crow-0.106
 lesson-0.104
 thing-0.103
 end-0.096
 deed-0.096
Token the
Token ID262
Feature activation-1.38e+00

Pos logit contributions
irling+0.060
eping+0.058
awed+0.056
ep+0.055
ounced+0.055
Neg logit contributions
crow-0.047
 apology-0.045
 lesson-0.045
 thing-0.045
 difference-0.044
Token big
Token ID1263
Feature activation-6.68e-01

Pos logit contributions
irling+0.035
eping+0.034
ounced+0.033
lic+0.033
awed+0.033
Neg logit contributions
crow-0.025
 thing-0.025
 difference-0.024
 lesson-0.024
 apology-0.023
Token bird
Token ID6512
Feature activation-3.22e+00

Pos logit contributions
irling+0.016
eping+0.015
ep+0.015
awed+0.015
ounced+0.014
Neg logit contributions
 apology-0.013
crow-0.013
 lesson-0.013
 deed-0.013
 difference-0.012
Token and
Token ID290
Feature activation-6.07e+00

Pos logit contributions
irling+0.082
ounced+0.079
lic+0.079
eping+0.079
aj+0.075
Neg logit contributions
 thing-0.059
crow-0.059
 end-0.059
 side-0.055
 difference-0.054
Token pe
Token ID613
Feature activation-8.39e+00

Pos logit contributions
irling+0.138
aj+0.127
eping+0.120
itching+0.113
awed+0.113
Neg logit contributions
 thing-0.139
crow-0.134
 lesson-0.132
 end-0.124
 moment-0.122
Tokenck
Token ID694
Feature activation-3.63e+00

Pos logit contributions
 brand+0.153
dream+0.151
lic+0.143
 chatting+0.139
 hopping+0.137
Neg logit contributions
crow-0.183
 thing-0.165
fl-0.165
inea-0.164
 apology-0.156
Tokened
Token ID276
Feature activation-5.06e+00

Pos logit contributions
irling+0.095
eping+0.083
eking+0.082
lic+0.080
aj+0.080
Neg logit contributions
crow-0.077
 thing-0.072
 lesson-0.066
 deed-0.064
est-0.064
Token him
Token ID683
Feature activation-2.20e+00

Pos logit contributions
irling+0.104
eping+0.098
ounced+0.093
awed+0.092
ep+0.092
Neg logit contributions
crow-0.121
 thing-0.118
 lesson-0.115
 apology-0.114
 difference-0.114
Token with
Token ID351
Feature activation-8.43e-01

Pos logit contributions
irling+0.056
eping+0.051
lic+0.051
years+0.050
aj+0.049
Neg logit contributions
 thing-0.043
 lesson-0.037
 end-0.037
fl-0.037
crow-0.036
Token his
Token ID465
Feature activation-4.87e-01

Pos logit contributions
irling+0.020
eping+0.018
awed+0.018
lic+0.018
ounced+0.018
Neg logit contributions
crow-0.018
 thing-0.018
 lesson-0.017
 apology-0.017
 difference-0.017
Token.
Token ID13
Feature activation+1.63e-01

Pos logit contributions
irling+0.090
eping+0.086
years+0.084
ounced+0.083
awed+0.083
Neg logit contributions
crow-0.047
 thing-0.046
 lesson-0.042
 end-0.042
 difference-0.041
Token He
Token ID679
Feature activation-1.71e+00

Pos logit contributions
crow+0.003
 apology+0.003
 lesson+0.003
 thing+0.003
 difference+0.003
Neg logit contributions
irling-0.004
eping-0.004
awed-0.004
ep-0.004
ounced-0.004
Token went
Token ID1816
Feature activation-4.87e+00

Pos logit contributions
irling+0.047
eping+0.045
eking+0.044
aj+0.043
agging+0.041
Neg logit contributions
 thing-0.027
 lesson-0.026
 end-0.026
crow-0.024
 am-0.022
Token up
Token ID510
Feature activation-3.35e+00

Pos logit contributions
irling+0.124
eping+0.119
awed+0.115
ep+0.113
ounced+0.111
Neg logit contributions
crow-0.092
 lesson-0.090
 thing-0.090
 apology-0.087
 end-0.085
Token and
Token ID290
Feature activation-5.28e+00

Pos logit contributions
irling+0.087
eping+0.083
ep+0.080
awed+0.079
ounced+0.079
Neg logit contributions
crow-0.061
 thing-0.060
 lesson-0.060
 apology-0.057
 deed-0.057
Token pe
Token ID613
Feature activation-8.38e+00

Pos logit contributions
irling+0.121
eping+0.110
lic+0.105
aj+0.100
ep+0.099
Neg logit contributions
crow-0.113
 thing-0.112
 side-0.106
 end-0.103
 offer-0.102
Tokenered
Token ID1068
Feature activation-5.84e+00

Pos logit contributions
lic+0.212
 brand+0.208
dream+0.207
irling+0.204
aj+0.202
Neg logit contributions
crow-0.123
 thing-0.118
fl-0.105
 prove-0.105
inea-0.104
Token inside
Token ID2641
Feature activation-2.24e+00

Pos logit contributions
irling+0.155
eping+0.147
ep+0.144
ounced+0.142
lic+0.142
Neg logit contributions
 thing-0.107
crow-0.102
 end-0.098
 lesson-0.097
 difference-0.094
Token.
Token ID13
Feature activation-1.72e-01

Pos logit contributions
irling+0.061
eping+0.059
awed+0.057
ounced+0.057
ep+0.056
Neg logit contributions
crow-0.037
 thing-0.037
 lesson-0.036
 apology-0.036
 difference-0.035
Token He
Token ID679
Feature activation-1.63e+00

Pos logit contributions
irling+0.004
eping+0.004
awed+0.004
ep+0.004
ounced+0.004
Neg logit contributions
crow-0.003
 apology-0.003
 lesson-0.003
 thing-0.003
 difference-0.003
Token noticed
Token ID6810
Feature activation-2.79e+00

Pos logit contributions
irling+0.043
agging+0.041
eping+0.039
eking+0.038
lic+0.038
Neg logit contributions
 thing-0.026
 end-0.025
 am-0.023
 lesson-0.023
crow-0.023
Token upon
Token ID2402
Feature activation-3.08e+00

Pos logit contributions
irling+0.076
eping+0.071
awed+0.067
ep+0.066
ounced+0.066
Neg logit contributions
crow-0.116
 thing-0.113
 lesson-0.112
 apology-0.110
 end-0.109
Token a
Token ID257
Feature activation-2.94e+00

Pos logit contributions
irling+0.058
eping+0.054
awed+0.051
ounced+0.051
ep+0.050
Neg logit contributions
crow-0.078
 lesson-0.076
 thing-0.076
 apology-0.075
 difference-0.074
Token chamber
Token ID11847
Feature activation-1.39e+00

Pos logit contributions
irling+0.063
eping+0.060
awed+0.058
ep+0.057
ounced+0.057
Neg logit contributions
crow-0.066
 thing-0.064
 apology-0.064
 lesson-0.063
 difference-0.063
Token.
Token ID13
Feature activation+7.42e-01

Pos logit contributions
irling+0.033
eping+0.032
awed+0.030
ep+0.030
ounced+0.030
Neg logit contributions
crow-0.028
 apology-0.027
 lesson-0.027
 deed-0.026
 difference-0.026
Token He
Token ID679
Feature activation-1.45e+00

Pos logit contributions
crow+0.013
 thing+0.013
 lesson+0.012
 apology+0.012
 difference+0.012
Neg logit contributions
irling-0.021
eping-0.020
ep-0.019
eking-0.019
awed-0.019
Token pe
Token ID613
Feature activation-8.38e+00

Pos logit contributions
irling+0.039
eping+0.036
eking+0.034
lic+0.034
awed+0.033
Neg logit contributions
 thing-0.027
 end-0.025
crow-0.025
 lesson-0.024
 am-0.024
Tokenered
Token ID1068
Feature activation-5.88e+00

Pos logit contributions
dream+0.196
 chatting+0.196
lic+0.195
 hopping+0.193
irling+0.193
Neg logit contributions
crow-0.132
 thing-0.119
 apology-0.108
inea-0.108
est-0.101
Token inside
Token ID2641
Feature activation-2.57e+00

Pos logit contributions
irling+0.162
ounced+0.148
eping+0.148
awed+0.147
lic+0.146
Neg logit contributions
 thing-0.110
crow-0.102
 end-0.098
 lesson-0.096
 side-0.092
Token and
Token ID290
Feature activation-4.34e+00

Pos logit contributions
irling+0.065
eping+0.062
ounced+0.060
awed+0.060
ep+0.059
Neg logit contributions
 thing-0.049
 lesson-0.047
crow-0.047
 apology-0.046
 difference-0.046
Token saw
Token ID2497
Feature activation-2.83e+00

Pos logit contributions
irling+0.101
eping+0.088
lic+0.088
ounced+0.086
awed+0.086
Neg logit contributions
 thing-0.101
 end-0.092
crow-0.090
 offer-0.090
 side-0.088
Token a
Token ID257
Feature activation-3.16e+00

Pos logit contributions
irling+0.057
eping+0.053
awed+0.051
ounced+0.051
ep+0.050
Neg logit contributions
crow-0.069
 thing-0.068
 lesson-0.067
 apology-0.066
 difference-0.065
Token slowly
Token ID6364
Feature activation-2.39e+00

Pos logit contributions
irling+0.033
eping+0.031
eking+0.029
awed+0.029
ounced+0.028
Neg logit contributions
crow-0.035
 thing-0.033
 lesson-0.032
 end-0.031
 deed-0.031
Token opened
Token ID4721
Feature activation-6.48e+00

Pos logit contributions
irling+0.050
awed+0.042
itching+0.042
eping+0.041
ounced+0.041
Neg logit contributions
crow-0.061
 end-0.056
 thing-0.053
 side-0.053
 offer-0.053
Token the
Token ID262
Feature activation-3.24e-01

Pos logit contributions
irling+0.166
eping+0.160
lic+0.155
ounced+0.155
awed+0.154
Neg logit contributions
 thing-0.120
crow-0.119
 lesson-0.115
 difference-0.111
 end-0.110
Token door
Token ID3420
Feature activation-3.74e+00

Pos logit contributions
irling+0.005
eping+0.005
lic+0.005
ounced+0.005
ep+0.005
Neg logit contributions
crow-0.009
 deed-0.009
 apology-0.008
 lesson-0.008
 conclusions-0.008
Token and
Token ID290
Feature activation-5.37e+00

Pos logit contributions
irling+0.098
eping+0.093
ounced+0.090
ep+0.089
awed+0.087
Neg logit contributions
 thing-0.071
crow-0.068
 end-0.067
 lesson-0.067
 offer-0.066
Token pe
Token ID613
Feature activation-8.37e+00

Pos logit contributions
irling+0.129
eping+0.112
itching+0.109
awed+0.108
agging+0.105
Neg logit contributions
crow-0.118
 thing-0.118
 end-0.111
 offer-0.110
 side-0.110
Tokenered
Token ID1068
Feature activation-6.18e+00

Pos logit contributions
dream+0.217
 brand+0.216
 hopping+0.212
lic+0.208
 chatting+0.207
Neg logit contributions
crow-0.111
 thing-0.094
fl-0.088
 means-0.086
inea-0.084
Token in
Token ID287
Feature activation-2.49e+00

Pos logit contributions
irling+0.163
eping+0.155
ounced+0.153
awed+0.152
ep+0.150
Neg logit contributions
 thing-0.113
crow-0.112
 end-0.104
 lesson-0.101
 difference-0.100
Token.
Token ID13
Feature activation+9.44e-03

Pos logit contributions
irling+0.053
eping+0.050
awed+0.048
 walking+0.047
ep+0.047
Neg logit contributions
crow-0.056
 apology-0.054
 lesson-0.052
 deed-0.052
 difference-0.052
Token She
Token ID1375
Feature activation-1.84e+00

Pos logit contributions
crow+0.000
 thing+0.000
 apology+0.000
 lesson+0.000
 zone+0.000
Neg logit contributions
irling-0.000
eping-0.000
ep-0.000
eking-0.000
ounced-0.000
Token saw
Token ID2497
Feature activation-3.03e+00

Pos logit contributions
irling+0.046
eping+0.043
agging+0.042
itching+0.041
eking+0.040
Neg logit contributions
 am-0.032
 thing-0.032
 end-0.031
 side-0.031
crow-0.030
Token too
Token ID1165
Feature activation+4.45e+00

Pos logit contributions
crow+0.018
 thing+0.017
 lesson+0.017
 apology+0.017
 difference+0.016
Neg logit contributions
irling-0.016
eping-0.015
awed-0.015
lic-0.014
ep-0.014
Token wet
Token ID9583
Feature activation+1.37e+00

Pos logit contributions
 thing+0.115
 lesson+0.115
crow+0.113
 apology+0.109
 deed+0.108
Neg logit contributions
irling-0.082
awed-0.079
eping-0.077
lic-0.073
ounced-0.073
Token!"
Token ID2474
Feature activation-3.86e-01

Pos logit contributions
 apology+0.032
crow+0.031
 lesson+0.031
 difference+0.030
 deed+0.030
Neg logit contributions
irling-0.027
eping-0.027
ep-0.025
eking-0.025
awed-0.025
Token So
Token ID1406
Feature activation+7.15e-01

Pos logit contributions
awed+0.007
irling+0.006
lic+0.006
ounced+0.006
 exactly+0.006
Neg logit contributions
inea-0.009
 conclusions-0.009
 difference-0.009
 zone-0.009
 deed-0.009
Token Sally
Token ID25737
Feature activation+5.74e-02

Pos logit contributions
 apology+0.017
crow+0.017
 conclusions+0.017
 difference+0.017
 zone+0.017
Neg logit contributions
irling-0.014
 exactly-0.012
awed-0.012
ounced-0.012
eping-0.012
Token stood
Token ID6204
Feature activation-8.36e+00

Pos logit contributions
crow+0.001
 lesson+0.001
 thing+0.001
 apology+0.001
 deed+0.001
Neg logit contributions
irling-0.001
eping-0.001
eking-0.001
years-0.001
awed-0.001
Token by
Token ID416
Feature activation-2.17e-01

Pos logit contributions
lic+0.226
ounced+0.224
awed+0.221
ents+0.214
irling+0.214
Neg logit contributions
 thing-0.140
 end-0.135
 side-0.126
crow-0.123
 lesson-0.118
Token the
Token ID262
Feature activation+1.28e-01

Pos logit contributions
irling+0.005
eping+0.005
ep+0.005
eking+0.005
awed+0.005
Neg logit contributions
crow-0.004
 difference-0.004
 lesson-0.004
 apology-0.004
 deed-0.004
Token f
Token ID277
Feature activation-4.46e+00

Pos logit contributions
 deed+0.003
crow+0.003
 zone+0.003
 pleasures+0.003
 apology+0.003
Neg logit contributions
eping-0.002
irling-0.002
lic-0.002
ep-0.002
eking-0.002
Tokenauc
Token ID14272
Feature activation-2.17e+00

Pos logit contributions
irling+0.119
eping+0.115
awed+0.111
ep+0.110
ounced+0.110
Neg logit contributions
crow-0.078
 apology-0.073
 thing-0.073
 lesson-0.072
 difference-0.071
Tokenet
Token ID316
Feature activation-3.87e-01

Pos logit contributions
eping+0.044
ep+0.043
irling+0.042
awed+0.041
eking+0.040
Neg logit contributions
 difference-0.049
 deed-0.048
 lesson-0.048
 conclusions-0.048
crow-0.046
Token was
Token ID373
Feature activation-4.79e+00

Pos logit contributions
irling+0.013
eping+0.012
awed+0.012
ep+0.012
ounced+0.012
Neg logit contributions
crow-0.017
 thing-0.016
 lesson-0.016
 apology-0.016
 difference-0.016
Token finished
Token ID5201
Feature activation-2.72e+00

Pos logit contributions
awed+0.102
years+0.097
irling+0.097
ounced+0.093
eping+0.083
Neg logit contributions
 end-0.104
 lesson-0.095
 payoff-0.094
 thing-0.091
 day-0.088
Token,
Token ID11
Feature activation-3.73e+00

Pos logit contributions
irling+0.061
eping+0.060
awed+0.058
ounced+0.057
years+0.057
Neg logit contributions
 thing-0.056
crow-0.056
 lesson-0.054
 difference-0.054
 deed-0.053
Token the
Token ID262
Feature activation-7.42e-01

Pos logit contributions
irling+0.088
eping+0.084
awed+0.081
ep+0.080
ounced+0.080
Neg logit contributions
crow-0.076
 apology-0.073
 lesson-0.073
 thing-0.073
 difference-0.072
Token bunny
Token ID44915
Feature activation-2.78e+00

Pos logit contributions
irling+0.017
eping+0.017
ep+0.016
awed+0.015
eking+0.015
Neg logit contributions
crow-0.016
 apology-0.015
 conclusions-0.014
 lesson-0.014
 deed-0.014
Token stood
Token ID6204
Feature activation-8.35e+00

Pos logit contributions
irling+0.057
eping+0.054
eking+0.052
years+0.051
awed+0.049
Neg logit contributions
crow-0.065
 thing-0.062
 lesson-0.062
 end-0.061
 difference-0.060
Token back
Token ID736
Feature activation-9.04e-01

Pos logit contributions
lic+0.218
ounced+0.208
awed+0.200
years+0.199
irling+0.196
Neg logit contributions
 thing-0.145
 end-0.140
 side-0.136
 difference-0.125
crow-0.115
Token and
Token ID290
Feature activation-5.19e+00

Pos logit contributions
irling+0.020
eping+0.019
lic+0.019
ounced+0.018
awed+0.018
Neg logit contributions
 thing-0.019
crow-0.018
 lesson-0.018
 zone-0.018
 apology-0.018
Token admired
Token ID29382
Feature activation-2.10e+00

Pos logit contributions
irling+0.103
eping+0.097
lic+0.095
years+0.094
awed+0.087
Neg logit contributions
 thing-0.123
crow-0.117
 difference-0.115
 lesson-0.115
 payoff-0.114
Token it
Token ID340
Feature activation+1.98e+00

Pos logit contributions
irling+0.044
eping+0.042
awed+0.039
ep+0.039
 exactly+0.039
Neg logit contributions
crow-0.048
 apology-0.046
 lesson-0.046
 difference-0.045
 deed-0.045
Token.
Token ID13
Feature activation-2.10e-01

Pos logit contributions
 thing+0.038
 lesson+0.038
crow+0.038
 difference+0.037
 deed+0.037
Neg logit contributions
irling-0.048
eping-0.046
awed-0.045
ounced-0.045
ep-0.044
Token park
Token ID3952
Feature activation-2.15e+00

Pos logit contributions
crow+0.016
 deed+0.016
 zone+0.015
 apology+0.015
 lesson+0.015
Neg logit contributions
irling-0.018
eping-0.017
eking-0.016
 walking-0.016
ep-0.016
Token.
Token ID13
Feature activation+7.20e-02

Pos logit contributions
irling+0.050
eping+0.048
awed+0.046
ep+0.046
ounced+0.046
Neg logit contributions
crow-0.043
 apology-0.042
 lesson-0.041
 thing-0.041
 difference-0.041
Token As
Token ID1081
Feature activation-4.16e-01

Pos logit contributions
crow+0.002
 apology+0.002
 lesson+0.002
 thing+0.002
 difference+0.001
Neg logit contributions
irling-0.002
eping-0.002
ep-0.001
awed-0.001
ounced-0.001
Token Emma
Token ID18966
Feature activation+1.14e+00

Pos logit contributions
irling+0.009
eping+0.009
awed+0.009
ounced+0.008
ep+0.008
Neg logit contributions
 lesson-0.009
 thing-0.009
crow-0.009
 difference-0.009
 deed-0.009
Token skipped
Token ID26684
Feature activation-4.16e+00

Pos logit contributions
crow+0.025
 thing+0.023
 end+0.023
 am+0.023
 payoff+0.023
Neg logit contributions
eping-0.023
irling-0.022
eking-0.022
years-0.021
 exactly-0.021
Token pe
Token ID613
Feature activation-8.35e+00

Pos logit contributions
irling+0.101
eping+0.098
awed+0.093
ep+0.093
ounced+0.092
Neg logit contributions
crow-0.084
 thing-0.080
 lesson-0.079
 apology-0.078
 deed-0.077
Tokenb
Token ID65
Feature activation-4.74e+00

Pos logit contributions
lic+0.149
irling+0.144
years+0.139
 brand+0.135
dream+0.134
Neg logit contributions
crow-0.203
 apology-0.183
 thing-0.176
 conclusions-0.170
 lesson-0.170
Tokenbles
Token ID7689
Feature activation-3.85e+00

Pos logit contributions
irling+0.077
eping+0.071
awed+0.066
years+0.065
ounced+0.065
Neg logit contributions
crow-0.131
 apology-0.126
 thing-0.125
 lesson-0.125
 deed-0.122
Token across
Token ID1973
Feature activation-6.85e-01

Pos logit contributions
irling+0.079
eping+0.076
awed+0.072
ep+0.072
ounced+0.071
Neg logit contributions
crow-0.090
 thing-0.086
 lesson-0.086
 apology-0.086
 difference-0.085
Token the
Token ID262
Feature activation-7.00e-01

Pos logit contributions
irling+0.014
 walking+0.014
eping+0.013
years+0.013
awed+0.013
Neg logit contributions
 apology-0.015
 deed-0.015
 difference-0.015
 affect-0.015
 lesson-0.015
Token lake
Token ID13546
Feature activation-4.08e+00

Pos logit contributions
irling+0.016
eping+0.015
ep+0.015
awed+0.014
ounced+0.014
Neg logit contributions
 apology-0.015
 deed-0.015
crow-0.014
 lesson-0.014
 zone-0.014
Token laughing
Token ID14376
Feature activation-2.93e+00

Pos logit contributions
irling+0.060
eping+0.059
ep+0.055
awed+0.053
ounced+0.051
Neg logit contributions
crow-0.120
 thing-0.119
 difference-0.117
 lesson-0.116
 apology-0.115
Token.
Token ID13
Feature activation-3.40e-02

Pos logit contributions
irling+0.082
eping+0.079
ounced+0.077
ep+0.077
awed+0.076
Neg logit contributions
 thing-0.047
crow-0.046
 lesson-0.046
 deed-0.043
 difference-0.043
Token
Token ID198
Feature activation-2.26e+00

Pos logit contributions
irling+0.001
eping+0.001
awed+0.001
ep+0.001
ounced+0.001
Neg logit contributions
crow-0.001
 apology-0.001
 lesson-0.001
 thing-0.001
 difference-0.001
Token
Token ID198
Feature activation-1.41e+00

Pos logit contributions
irling+0.059
eping+0.057
awed+0.055
ounced+0.055
ep+0.054
Neg logit contributions
crow-0.040
 lesson-0.038
 apology-0.038
 thing-0.037
 difference-0.037
TokenGeorge
Token ID20191
Feature activation-2.36e+00

Pos logit contributions
ounced+0.034
eking+0.034
irling+0.033
awed+0.033
eping+0.033
Neg logit contributions
crow-0.028
 thing-0.026
 lesson-0.025
 end-0.024
 conclusions-0.023
Token pe
Token ID613
Feature activation-8.35e+00

Pos logit contributions
irling+0.052
eping+0.050
eking+0.048
awed+0.045
lic+0.045
Neg logit contributions
crow-0.052
 end-0.048
 thing-0.047
 lesson-0.046
 offer-0.045
Tokenep
Token ID538
Feature activation-5.42e+00

Pos logit contributions
 brand+0.180
 dreaming+0.173
dream+0.173
 com+0.168
lic+0.165
Neg logit contributions
crow-0.165
inea-0.137
 thing-0.135
 apology-0.131
fl-0.126
Tokened
Token ID276
Feature activation-5.19e+00

Pos logit contributions
irling+0.110
eping+0.098
awed+0.091
eking+0.091
ounced+0.091
Neg logit contributions
crow-0.136
 thing-0.125
 lesson-0.124
 end-0.121
 apology-0.118
Token at
Token ID379
Feature activation+3.99e-01

Pos logit contributions
irling+0.125
eping+0.118
ep+0.113
ounced+0.112
awed+0.111
Neg logit contributions
crow-0.109
 thing-0.107
 lesson-0.102
 apology-0.100
 deed-0.100
Token them
Token ID606
Feature activation-9.61e-01

Pos logit contributions
crow+0.009
 difference+0.009
 lesson+0.009
 apology+0.009
 pleasures+0.008
Neg logit contributions
eping-0.008
irling-0.008
years-0.008
ep-0.008
awed-0.007
Token and
Token ID290
Feature activation-4.96e+00

Pos logit contributions
irling+0.024
eping+0.023
ep+0.022
lic+0.022
ounced+0.022
Neg logit contributions
 thing-0.018
crow-0.018
 difference-0.018
 end-0.018
 lesson-0.017
Token small
Token ID1402
Feature activation-1.54e+00

Pos logit contributions
irling+0.064
eping+0.061
awed+0.060
eking+0.058
ounced+0.057
Neg logit contributions
crow-0.070
 thing-0.070
 lesson-0.069
 difference-0.069
 apology-0.068
Token mouse
Token ID10211
Feature activation-2.02e+00

Pos logit contributions
irling+0.039
eping+0.038
ep+0.036
awed+0.036
ounced+0.035
Neg logit contributions
crow-0.028
 apology-0.027
 lesson-0.027
 deed-0.026
 thing-0.026
Token.
Token ID13
Feature activation-4.40e-01

Pos logit contributions
irling+0.040
eping+0.038
awed+0.036
ounced+0.036
ep+0.036
Neg logit contributions
crow-0.049
 thing-0.047
 difference-0.047
 apology-0.046
 deed-0.046
Token He
Token ID679
Feature activation-1.81e+00

Pos logit contributions
irling+0.012
eping+0.011
awed+0.011
ounced+0.011
ep+0.011
Neg logit contributions
crow-0.008
 apology-0.007
 lesson-0.007
 thing-0.007
 deed-0.007
Token was
Token ID373
Feature activation-6.05e+00

Pos logit contributions
irling+0.049
eking+0.045
aj+0.045
eping+0.045
awed+0.042
Neg logit contributions
 lesson-0.029
 thing-0.029
crow-0.029
 end-0.028
 rob-0.028
Token pe
Token ID613
Feature activation-8.35e+00

Pos logit contributions
irling+0.120
awed+0.115
aj+0.111
leys+0.110
ounced+0.108
Neg logit contributions
crow-0.137
 thing-0.136
 difference-0.132
 end-0.129
 offer-0.129
Tokenering
Token ID1586
Feature activation-2.99e+00

Pos logit contributions
lic+0.101
aj+0.089
awed+0.085
dream+0.084
 brand+0.084
Neg logit contributions
crow-0.236
 thing-0.217
 apology-0.215
 decisions-0.214
 prove-0.212
Token up
Token ID510
Feature activation-2.13e+00

Pos logit contributions
irling+0.069
awed+0.063
eping+0.063
ounced+0.063
ep+0.062
Neg logit contributions
 thing-0.063
crow-0.063
 lesson-0.058
 difference-0.058
 deed-0.058
Token at
Token ID379
Feature activation+6.69e-01

Pos logit contributions
irling+0.048
eping+0.045
lic+0.045
ep+0.044
awed+0.044
Neg logit contributions
 thing-0.045
 end-0.043
 lesson-0.043
crow-0.042
 deed-0.041
Token the
Token ID262
Feature activation+1.02e+00

Pos logit contributions
crow+0.015
 apology+0.014
 difference+0.014
 lesson+0.014
 deed+0.014
Neg logit contributions
eping-0.014
irling-0.014
ep-0.013
 walking-0.013
awed-0.013
Token farmer
Token ID18739
Feature activation-7.73e-01

Pos logit contributions
crow+0.020
 apology+0.020
 zone+0.020
 conclusions+0.019
 deed+0.019
Neg logit contributions
eping-0.023
ep-0.021
irling-0.021
 walking-0.021
ounced-0.021
Token from
Token ID422
Feature activation-2.19e+00

Pos logit contributions
eping+0.091
irling+0.091
ep+0.088
lic+0.087
ents+0.084
Neg logit contributions
 difference-0.100
 apology-0.098
 conclusions-0.098
crow-0.097
 deed-0.096
Token a
Token ID257
Feature activation-2.13e+00

Pos logit contributions
irling+0.046
eping+0.044
ep+0.042
awed+0.042
ounced+0.041
Neg logit contributions
crow-0.050
 difference-0.049
 apology-0.048
 deed-0.048
 lesson-0.048
Token box
Token ID3091
Feature activation-3.16e+00

Pos logit contributions
irling+0.048
eping+0.047
ep+0.045
ounced+0.044
awed+0.044
Neg logit contributions
crow-0.045
 apology-0.043
 lesson-0.043
 deed-0.042
 difference-0.042
Token.
Token ID13
Feature activation+2.51e-01

Pos logit contributions
irling+0.088
ounced+0.081
eping+0.081
awed+0.080
years+0.078
Neg logit contributions
 thing-0.054
crow-0.052
 end-0.052
 deed-0.050
 zone-0.050
Token She
Token ID1375
Feature activation-1.88e+00

Pos logit contributions
crow+0.005
 apology+0.005
 lesson+0.005
 thing+0.005
 difference+0.005
Neg logit contributions
irling-0.006
eping-0.006
ep-0.006
eking-0.005
ounced-0.005
Token pe
Token ID613
Feature activation-8.34e+00

Pos logit contributions
irling+0.055
years+0.055
eping+0.054
lic+0.053
eking+0.053
Neg logit contributions
 apology-0.026
 lesson-0.025
 offer-0.025
 difference-0.024
 end-0.023
Tokenels
Token ID1424
Feature activation-1.50e+00

Pos logit contributions
lic+0.176
dream+0.171
 hopping+0.169
irling+0.166
years+0.162
Neg logit contributions
 apology-0.160
crow-0.155
 thing-0.149
 am-0.140
 choice-0.140
Token off
Token ID572
Feature activation-3.18e-01

Pos logit contributions
irling+0.030
eping+0.030
ep+0.028
awed+0.028
ounced+0.028
Neg logit contributions
crow-0.034
 difference-0.034
 apology-0.034
 deed-0.034
 lesson-0.033
Token the
Token ID262
Feature activation+2.80e-01

Pos logit contributions
irling+0.008
eping+0.008
awed+0.007
ep+0.007
ounced+0.007
Neg logit contributions
crow-0.006
 apology-0.006
 difference-0.006
 lesson-0.006
 deed-0.006
Token paper
Token ID3348
Feature activation-2.40e+00

Pos logit contributions
crow+0.006
 deed+0.006
 apology+0.006
 lesson+0.006
 difference+0.006
Neg logit contributions
irling-0.005
eping-0.005
ep-0.005
eking-0.005
 walking-0.005
Token and
Token ID290
Feature activation-5.13e+00

Pos logit contributions
irling+0.045
eping+0.044
ep+0.042
awed+0.042
ounced+0.041
Neg logit contributions
crow-0.058
 apology-0.057
 lesson-0.057
 deed-0.057
 difference-0.056
Token her
Token ID607
Feature activation+5.42e-01

Pos logit contributions
crow+0.016
 thing+0.016
 lesson+0.016
 apology+0.016
 deed+0.015
Neg logit contributions
irling-0.016
eping-0.015
awed-0.015
ounced-0.015
ep-0.014
Token dad
Token ID9955
Feature activation+3.44e-01

Pos logit contributions
crow+0.012
 conclusions+0.011
 zone+0.011
 affect+0.011
 apology+0.011
Neg logit contributions
irling-0.012
eping-0.011
 exactly-0.011
ounced-0.011
 walking-0.011
Token,
Token ID11
Feature activation-2.67e+00

Pos logit contributions
crow+0.008
 apology+0.008
 lesson+0.008
 conclusions+0.008
 difference+0.008
Neg logit contributions
irling-0.007
eping-0.006
years-0.006
ounced-0.006
awed-0.006
Token "
Token ID366
Feature activation+1.03e+00

Pos logit contributions
irling+0.058
eping+0.055
awed+0.053
ounced+0.052
ep+0.052
Neg logit contributions
crow-0.060
 thing-0.059
 apology-0.058
 lesson-0.058
 difference-0.057
TokenWhat
Token ID2061
Feature activation+9.77e+00

Pos logit contributions
crow+0.019
 apology+0.018
 lesson+0.018
 thing+0.018
 difference+0.018
Neg logit contributions
irling-0.027
eping-0.026
awed-0.025
ep-0.025
ounced-0.024
Token if
Token ID611
Feature activation+7.98e+00

Pos logit contributions
crow+0.168
 apology+0.159
 lesson+0.158
 thing+0.158
 difference+0.156
Neg logit contributions
irling-0.261
eping-0.252
awed-0.242
ep-0.242
ounced-0.240
Token it
Token ID340
Feature activation+5.80e+00

Pos logit contributions
crow+0.168
 apology+0.160
 deed+0.159
 difference+0.157
 conclusions+0.156
Neg logit contributions
irling-0.177
eping-0.166
ounced-0.162
 walking-0.162
ep-0.161
Token rains
Token ID29424
Feature activation+3.39e+00

Pos logit contributions
crow+0.098
 lesson+0.096
 thing+0.095
 apology+0.094
 difference+0.093
Neg logit contributions
irling-0.158
eping-0.153
awed-0.147
years-0.146
ep-0.145
Token at
Token ID379
Feature activation+5.47e+00

Pos logit contributions
crow+0.076
 deed+0.073
 apology+0.073
 lesson+0.071
 difference+0.071
Neg logit contributions
irling-0.072
eping-0.068
ep-0.065
ounced-0.064
eking-0.064
Token the
Token ID262
Feature activation+2.84e+00

Pos logit contributions
crow+0.119
 apology+0.115
 deed+0.113
 zone+0.111
 pleasures+0.111
Neg logit contributions
irling-0.118
eping-0.111
ep-0.111
 walking-0.106
eking-0.105
Token picnic
Token ID35715
Feature activation+1.26e+00

Pos logit contributions
crow+0.064
 apology+0.062
 deed+0.060
 zone+0.060
 lesson+0.059
Neg logit contributions
irling-0.060
eping-0.056
ep-0.054
ounced-0.053
awed-0.053
Token it
Token ID340
Feature activation+2.46e+00

Pos logit contributions
irling+0.035
eping+0.034
ep+0.032
eking+0.032
awed+0.031
Neg logit contributions
crow-0.036
 apology-0.034
 lesson-0.033
 difference-0.033
 deed-0.033
Token's
Token ID338
Feature activation-1.30e-01

Pos logit contributions
crow+0.059
 apology+0.058
 deed+0.057
 conclusions+0.057
 thing+0.057
Neg logit contributions
irling-0.047
eping-0.045
ep-0.043
ounced-0.043
awed-0.043
Token important
Token ID1593
Feature activation+6.03e+00

Pos logit contributions
irling+0.002
eping+0.002
ep+0.002
eking+0.002
 exactly+0.002
Neg logit contributions
crow-0.003
 apology-0.003
 zone-0.003
 conclusions-0.003
 difference-0.003
Token to
Token ID284
Feature activation+5.31e+00

Pos logit contributions
crow+0.160
 conclusions+0.158
 difference+0.158
 apology+0.148
 deed+0.144
Neg logit contributions
irling-0.092
 walking-0.091
ounced-0.091
 exactly-0.090
eping-0.088
Token mind
Token ID2000
Feature activation+2.93e+00

Pos logit contributions
crow+0.090
 thing+0.082
 lesson+0.082
 zone+0.082
 apology+0.081
Neg logit contributions
irling-0.143
eping-0.138
ep-0.134
eking-0.131
ounced-0.128
Token what
Token ID644
Feature activation+8.22e+00

Pos logit contributions
crow+0.070
 apology+0.069
 lesson+0.068
 deed+0.066
 difference+0.066
Neg logit contributions
irling-0.058
eping-0.055
ep-0.052
eking-0.051
 exactly-0.050
Token your
Token ID534
Feature activation+4.89e+00

Pos logit contributions
crow+0.176
 apology+0.167
 deed+0.165
 conclusions+0.162
 zone+0.159
Neg logit contributions
irling-0.178
eping-0.170
 walking-0.168
ep-0.166
 pointed-0.160
Token parents
Token ID3397
Feature activation+5.57e+00

Pos logit contributions
crow+0.112
igator+0.099
 conclusions+0.098
 zone+0.096
 deed+0.095
Neg logit contributions
irling-0.108
aj-0.100
ep-0.099
eping-0.098
eking-0.094
Token say
Token ID910
Feature activation+3.69e+00

Pos logit contributions
crow+0.101
 apology+0.094
 lesson+0.094
 thing+0.094
 deed+0.093
Neg logit contributions
irling-0.143
eping-0.139
awed-0.133
ep-0.133
ounced-0.131
Token,
Token ID11
Feature activation-9.26e-01

Pos logit contributions
crow+0.086
 conclusions+0.083
 deed+0.083
 difference+0.082
 apology+0.081
Neg logit contributions
irling-0.073
eping-0.070
ep-0.066
 exactly-0.065
eking-0.065
Token because
Token ID780
Feature activation+5.04e+00

Pos logit contributions
irling+0.018
eping+0.017
ep+0.016
awed+0.016
ounced+0.016
Neg logit contributions
crow-0.023
 apology-0.022
 thing-0.022
 lesson-0.022
 difference-0.022
Token space
Token ID2272
Feature activation+2.64e-01

Pos logit contributions
crow+0.085
 lesson+0.079
 apology+0.078
 zone+0.077
 deed+0.076
Neg logit contributions
irling-0.107
eping-0.107
ep-0.105
eking-0.099
ents-0.098
Token."
Token ID526
Feature activation-2.11e-01

Pos logit contributions
crow+0.006
 apology+0.006
 deed+0.006
 difference+0.006
 lesson+0.006
Neg logit contributions
ep-0.005
irling-0.005
eping-0.005
eking-0.005
eps-0.005
Token
Token ID198
Feature activation-2.08e+00

Pos logit contributions
irling+0.006
eping+0.006
ep+0.005
ounced+0.005
awed+0.005
Neg logit contributions
crow-0.004
 thing-0.003
 apology-0.003
 lesson-0.003
 difference-0.003
Token
Token ID198
Feature activation-1.36e+00

Pos logit contributions
irling+0.057
eping+0.055
awed+0.053
ounced+0.053
eking+0.052
Neg logit contributions
crow-0.035
 lesson-0.033
 apology-0.033
 thing-0.032
 difference-0.031
Token"
Token ID1
Feature activation+2.30e+00

Pos logit contributions
eping+0.033
irling+0.033
ounced+0.033
years+0.032
awed+0.032
Neg logit contributions
crow-0.027
 thing-0.025
 lesson-0.024
 end-0.024
 conclusions-0.023
TokenWhat
Token ID2061
Feature activation+9.20e+00

Pos logit contributions
 difference+0.038
crow+0.037
 apology+0.037
 deed+0.037
 lesson+0.035
Neg logit contributions
irling-0.063
eping-0.061
ep-0.060
awed-0.060
ounced-0.058
Token is
Token ID318
Feature activation+1.13e+00

Pos logit contributions
crow+0.145
 thing+0.142
 lesson+0.141
 apology+0.139
 difference+0.137
Neg logit contributions
irling-0.260
eping-0.249
awed-0.243
ounced-0.240
ep-0.238
Token space
Token ID2272
Feature activation+1.29e+00

Pos logit contributions
crow+0.022
 thing+0.021
 lesson+0.021
 apology+0.021
 difference+0.020
Neg logit contributions
irling-0.028
eping-0.027
awed-0.026
ounced-0.026
lic-0.025
Token?"
Token ID1701
Feature activation-1.01e+00

Pos logit contributions
crow+0.028
 apology+0.028
 deed+0.027
 lesson+0.027
 conclusions+0.027
Neg logit contributions
irling-0.028
eping-0.027
ep-0.026
eking-0.025
lic-0.025
Token Ben
Token ID3932
Feature activation-2.49e+00

Pos logit contributions
irling+0.024
eping+0.024
ep+0.022
ounced+0.022
awed+0.022
Neg logit contributions
crow-0.021
 thing-0.020
 lesson-0.020
 apology-0.020
 difference-0.019
Token asked
Token ID1965
Feature activation-3.41e-01

Pos logit contributions
irling+0.064
ounced+0.061
awed+0.060
lic+0.059
years+0.057
Neg logit contributions
crow-0.041
 thing-0.041
 end-0.040
 means-0.040
 zone-0.039
Token said
Token ID531
Feature activation-3.33e+00

Pos logit contributions
irling+0.086
awed+0.085
ounced+0.079
eping+0.077
lic+0.077
Neg logit contributions
crow-0.068
 thing-0.063
 end-0.061
 prove-0.059
 offer-0.059
Token to
Token ID284
Feature activation-7.19e-01

Pos logit contributions
irling+0.081
eping+0.076
ounced+0.075
awed+0.075
ep+0.074
Neg logit contributions
crow-0.066
 thing-0.065
 apology-0.063
 lesson-0.063
 deed-0.062
Token herself
Token ID5223
Feature activation-2.31e+00

Pos logit contributions
irling+0.013
eping+0.012
 walking+0.011
eking+0.011
years+0.011
Neg logit contributions
crow-0.019
 lesson-0.018
 difference-0.018
 zone-0.018
 deed-0.018
Token,
Token ID11
Feature activation-4.66e+00

Pos logit contributions
irling+0.060
eping+0.058
awed+0.056
ep+0.056
ounced+0.056
Neg logit contributions
crow-0.041
 apology-0.039
 thing-0.039
 lesson-0.039
 difference-0.039
Token "
Token ID366
Feature activation-2.67e-01

Pos logit contributions
ounced+0.099
lic+0.097
irling+0.096
eping+0.095
eps+0.094
Neg logit contributions
 side-0.094
 end-0.093
 thing-0.092
 offer-0.089
crow-0.089
Tokenshould
Token ID21754
Feature activation+8.30e+00

Pos logit contributions
irling+0.006
eping+0.006
ep+0.006
years+0.006
lic+0.006
Neg logit contributions
 difference-0.006
 apology-0.006
 choice-0.006
 lesson-0.005
 deed-0.005
Token I
Token ID314
Feature activation+6.98e+00

Pos logit contributions
crow+0.173
 apology+0.164
 lesson+0.161
 deed+0.161
 difference+0.158
Neg logit contributions
irling-0.188
eping-0.173
eking-0.172
 walking-0.170
ep-0.168
Token follow
Token ID1061
Feature activation+2.41e+00

Pos logit contributions
crow+0.141
 lesson+0.139
 pleasures+0.137
 apology+0.137
 difference+0.137
Neg logit contributions
irling-0.148
 walking-0.142
eping-0.142
ep-0.139
 hopping-0.139
Token the
Token ID262
Feature activation+3.26e+00

Pos logit contributions
 lesson+0.056
crow+0.055
 apology+0.055
 deed+0.055
 difference+0.054
Neg logit contributions
irling-0.046
eping-0.044
 walking-0.042
eking-0.042
ep-0.042
Token path
Token ID3108
Feature activation+4.65e-02

Pos logit contributions
crow+0.056
 apology+0.053
 deed+0.051
 difference+0.051
 lesson+0.051
Neg logit contributions
irling-0.085
eping-0.083
ep-0.079
ounced-0.078
eking-0.078
Token?"
Token ID1701
Feature activation-1.73e+00

Pos logit contributions
 apology+0.001
crow+0.001
 deed+0.001
 lesson+0.001
 difference+0.001
Neg logit contributions
irling-0.001
eping-0.001
ep-0.001
eking-0.001
 exactly-0.001
TokenOh
Token ID5812
Feature activation+5.70e+00

Pos logit contributions
 apology+0.035
 difference+0.033
 lesson+0.033
 choice+0.033
 deed+0.032
Neg logit contributions
irling-0.038
eping-0.038
ep-0.037
awed-0.036
lic-0.035
Token no
Token ID645
Feature activation+3.85e+00

Pos logit contributions
 thing+0.114
crow+0.113
 lesson+0.109
 apology+0.109
 difference+0.108
Neg logit contributions
irling-0.141
eping-0.134
awed-0.130
ep-0.126
eking-0.125
Token,
Token ID11
Feature activation+1.97e+00

Pos logit contributions
crow+0.090
 apology+0.087
 thing+0.086
 lesson+0.086
 difference+0.085
Neg logit contributions
irling-0.081
eping-0.077
awed-0.073
ep-0.073
ounced-0.072
Token what
Token ID644
Feature activation+7.87e+00

Pos logit contributions
crow+0.043
 lesson+0.040
 apology+0.040
 deed+0.040
 zone+0.039
Neg logit contributions
irling-0.042
eping-0.041
ep-0.039
 exactly-0.039
ounced-0.038
Token should
Token ID815
Feature activation+6.33e+00

Pos logit contributions
crow+0.163
 apology+0.157
 difference+0.154
 deed+0.154
 lesson+0.153
Neg logit contributions
irling-0.181
eping-0.176
ep-0.168
awed-0.167
ounced-0.166
Token we
Token ID356
Feature activation+8.06e+00

Pos logit contributions
 conclusions+0.152
 zone+0.151
 pleasures+0.151
crow+0.149
 deed+0.149
Neg logit contributions
 walking-0.112
 pointed-0.103
irling-0.098
ep-0.098
 hopping-0.098
Token do
Token ID466
Feature activation+4.50e+00

Pos logit contributions
crow+0.142
 apology+0.134
 lesson+0.131
 thing+0.130
 deed+0.129
Neg logit contributions
irling-0.209
eping-0.205
ep-0.198
ounced-0.193
lic-0.193
Token?"
Token ID1701
Feature activation-1.32e+00

Pos logit contributions
crow+0.106
 apology+0.102
 deed+0.101
 conclusions+0.099
 difference+0.098
Neg logit contributions
irling-0.094
eping-0.088
ep-0.086
eking-0.082
 walking-0.081
Token The
Token ID383
Feature activation-5.49e-01

Pos logit contributions
irling+0.032
eping+0.031
awed+0.031
lic+0.031
ep+0.030
Neg logit contributions
crow-0.023
 deed-0.023
 lesson-0.023
 apology-0.023
 difference-0.023
Token little
Token ID1310
Feature activation+5.04e-01

Pos logit contributions
irling+0.012
eping+0.012
awed+0.011
ounced+0.011
ep+0.011
Neg logit contributions
crow-0.012
 apology-0.012
 thing-0.012
 lesson-0.012
 difference-0.012
Token bird
Token ID6512
Feature activation-2.92e+00

Pos logit contributions
crow+0.007
 apology+0.007
 lesson+0.007
 difference+0.007
 deed+0.007
Neg logit contributions
irling-0.015
eping-0.014
ep-0.014
awed-0.014
ounced-0.014
Token kind
Token ID1611
Feature activation+1.43e+00

Pos logit contributions
irling+0.065
awed+0.061
eping+0.059
lic+0.059
ounced+0.058
Neg logit contributions
 apology-0.086
 thing-0.086
crow-0.086
 lesson-0.086
 end-0.082
Token.
Token ID13
Feature activation+1.15e+00

Pos logit contributions
 conclusions+0.036
 affect+0.036
 apology+0.035
 lesson+0.034
crow+0.034
Neg logit contributions
ents-0.024
awed-0.022
eping-0.022
ep-0.022
lic-0.022
Token
Token ID198
Feature activation-9.57e-01

Pos logit contributions
 deed+0.027
 difference+0.026
 lesson+0.026
crow+0.025
 affect+0.025
Neg logit contributions
irling-0.022
lic-0.022
eping-0.021
ep-0.021
awed-0.020
Token
Token ID198
Feature activation-6.71e-01

Pos logit contributions
irling+0.022
eping+0.021
ep+0.021
awed+0.020
lic+0.020
Neg logit contributions
crow-0.020
 deed-0.019
 thing-0.019
 zone-0.019
 difference-0.019
Token"
Token ID1
Feature activation+3.28e+00

Pos logit contributions
irling+0.014
eping+0.013
awed+0.012
ep+0.012
ounced+0.012
Neg logit contributions
crow-0.016
 apology-0.015
 lesson-0.015
 thing-0.015
 difference-0.015
TokenOkay
Token ID16454
Feature activation+5.63e+00

Pos logit contributions
 difference+0.068
 deed+0.067
 choice+0.066
crow+0.064
 decisions+0.064
Neg logit contributions
ep-0.073
irling-0.073
awed-0.070
eping-0.069
lic-0.069
Token,
Token ID11
Feature activation+1.22e+00

Pos logit contributions
 conclusions+0.171
crow+0.169
 deed+0.164
 affect+0.163
 zone+0.161
Neg logit contributions
irling-0.067
 walking-0.065
ep-0.065
 exactly-0.062
ents-0.062
Token I
Token ID314
Feature activation+6.77e+00

Pos logit contributions
crow+0.028
 zone+0.028
 deed+0.027
 affect+0.027
 conclusions+0.026
Neg logit contributions
irling-0.023
ents-0.022
ep-0.022
 exactly-0.022
eping-0.022
Token will
Token ID481
Feature activation+3.67e+00

Pos logit contributions
crow+0.117
 conclusions+0.110
 deed+0.106
 zone+0.102
keeping+0.100
Neg logit contributions
ep-0.165
irling-0.164
ounced-0.164
eping-0.163
lic-0.163
Token be
Token ID307
Feature activation-9.12e-01

Pos logit contributions
crow+0.075
 zone+0.068
 lesson+0.067
 thing+0.067
 apology+0.066
Neg logit contributions
irling-0.086
eping-0.081
ep-0.081
 walking-0.077
eking-0.077
Token more
Token ID517
Feature activation+3.03e+00

Pos logit contributions
irling+0.019
awed+0.018
eping+0.018
lic+0.018
ounced+0.018
Neg logit contributions
crow-0.020
 thing-0.020
 apology-0.020
 difference-0.020
 lesson-0.019
Token loved
Token ID6151
Feature activation-1.88e-01

Pos logit contributions
 thing+0.016
 lesson+0.016
 am+0.016
crow+0.015
 end+0.015
Neg logit contributions
irling-0.014
eping-0.013
ounced-0.013
eking-0.013
ents-0.013
Token to
Token ID284
Feature activation+3.06e+00

Pos logit contributions
irling+0.004
eping+0.004
awed+0.004
ep+0.004
ounced+0.004
Neg logit contributions
crow-0.004
 thing-0.004
 lesson-0.004
 apology-0.004
 difference-0.004
Token wash
Token ID13502
Feature activation+1.01e+00

Pos logit contributions
crow+0.055
 apology+0.053
 thing+0.053
 lesson+0.053
 deed+0.051
Neg logit contributions
irling-0.079
eping-0.077
awed-0.074
ounced-0.074
ep-0.073
Token with
Token ID351
Feature activation+1.60e+00

Pos logit contributions
 apology+0.027
crow+0.027
 difference+0.027
 deed+0.026
 payoff+0.026
Neg logit contributions
irling-0.017
eping-0.016
ep-0.015
eking-0.014
 exactly-0.014
Token her
Token ID607
Feature activation+4.32e-01

Pos logit contributions
crow+0.039
 apology+0.037
 difference+0.037
 lesson+0.036
 deed+0.036
Neg logit contributions
irling-0.032
eping-0.031
ep-0.029
awed-0.029
ounced-0.028
Token favorite
Token ID4004
Feature activation+1.96e+00

Pos logit contributions
crow+0.011
 apology+0.010
 difference+0.010
 deed+0.010
 conclusions+0.010
Neg logit contributions
irling-0.009
eping-0.008
awed-0.008
ep-0.008
ounced-0.008
Token shampoo
Token ID47259
Feature activation-1.48e-02

Pos logit contributions
crow+0.047
 difference+0.045
 conclusions+0.044
 deed+0.043
 apology+0.043
Neg logit contributions
irling-0.037
eping-0.037
ep-0.036
awed-0.034
 walking-0.034
Token.
Token ID13
Feature activation+9.18e-01

Pos logit contributions
irling+0.000
awed+0.000
eping+0.000
ounced+0.000
ep+0.000
Neg logit contributions
 conclusions-0.000
 apology-0.000
crow-0.000
 difference-0.000
 deed-0.000
Token One
Token ID1881
Feature activation+3.23e+00

Pos logit contributions
crow+0.021
 apology+0.020
 deed+0.020
 lesson+0.020
 thing+0.020
Neg logit contributions
irling-0.019
eping-0.018
awed-0.018
ep-0.017
lic-0.017
Token day
Token ID1110
Feature activation+1.21e-01

Pos logit contributions
crow+0.036
 apology+0.034
 difference+0.033
 conclusions+0.033
 affect+0.031
Neg logit contributions
irling-0.103
eping-0.102
ep-0.099
 walking-0.097
awed-0.097
Token,
Token ID11
Feature activation-2.20e+00

Pos logit contributions
 thing+0.002
crow+0.002
 lesson+0.002
 end+0.002
 am+0.002
Neg logit contributions
eping-0.003
irling-0.003
ep-0.003
eking-0.003
years-0.003
Token they
Token ID484
Feature activation+4.75e-01

Pos logit contributions
crow+0.002
 apology+0.002
 difference+0.002
 thing+0.002
 lesson+0.002
Neg logit contributions
irling-0.002
eping-0.002
ounced-0.002
awed-0.002
ep-0.002
Token chose
Token ID7690
Feature activation-7.66e-01

Pos logit contributions
crow+0.012
 lesson+0.011
 thing+0.011
 apology+0.011
 end+0.011
Neg logit contributions
irling-0.010
eping-0.009
awed-0.009
ounced-0.008
eking-0.008
Token to
Token ID284
Feature activation+1.35e+00

Pos logit contributions
irling+0.014
eping+0.014
awed+0.013
ounced+0.013
ep+0.013
Neg logit contributions
crow-0.019
 apology-0.019
 deed-0.018
 difference-0.018
 conclusions-0.018
Token keep
Token ID1394
Feature activation+1.05e-01

Pos logit contributions
crow+0.024
 zone+0.022
 thing+0.022
 lesson+0.022
 apology+0.021
Neg logit contributions
irling-0.035
eping-0.033
ep-0.033
eking-0.031
 walking-0.031
Token playing
Token ID2712
Feature activation+3.17e-01

Pos logit contributions
crow+0.003
 thing+0.003
 difference+0.002
 lesson+0.002
 apology+0.002
Neg logit contributions
irling-0.002
lic-0.002
eping-0.002
awed-0.002
ounced-0.002
Token in
Token ID287
Feature activation+1.64e+00

Pos logit contributions
crow+0.008
 apology+0.007
 lesson+0.007
 thing+0.007
 difference+0.007
Neg logit contributions
irling-0.006
eping-0.006
ep-0.006
awed-0.006
ounced-0.005
Token the
Token ID262
Feature activation+1.47e+00

Pos logit contributions
crow+0.037
 apology+0.034
 zone+0.034
 deed+0.034
 lesson+0.033
Neg logit contributions
irling-0.036
eping-0.035
ep-0.033
 walking-0.032
 exactly-0.031
Token loud
Token ID7812
Feature activation+1.87e+00

Pos logit contributions
crow+0.030
 thing+0.029
 lesson+0.029
 difference+0.029
 apology+0.028
Neg logit contributions
irling-0.035
eping-0.034
awed-0.033
ounced-0.032
lic-0.032
Token environment
Token ID2858
Feature activation+3.20e-01

Pos logit contributions
 apology+0.037
crow+0.036
 lesson+0.035
 deed+0.035
inea+0.034
Neg logit contributions
irling-0.042
eping-0.041
ep-0.039
 walking-0.039
awed-0.038
Token!
Token ID0
Feature activation+9.80e-01

Pos logit contributions
crow+0.007
 apology+0.007
 lesson+0.007
 deed+0.007
 thing+0.007
Neg logit contributions
irling-0.007
eping-0.006
ep-0.006
awed-0.006
eking-0.006
Token They
Token ID1119
Feature activation-5.21e-01

Pos logit contributions
 thing+0.025
 affect+0.025
 apology+0.025
 deed+0.025
 choice+0.025
Neg logit contributions
lic-0.015
awed-0.015
ents-0.015
 walking-0.014
irling-0.014
Token named
Token ID3706
Feature activation-1.58e+00

Pos logit contributions
irling+0.015
eping+0.014
awed+0.013
ounced+0.013
lic+0.012
Neg logit contributions
crow-0.030
 thing-0.029
 apology-0.028
 deed-0.028
 difference-0.028
Token Lily
Token ID20037
Feature activation+7.64e-01

Pos logit contributions
irling+0.046
eping+0.045
ep+0.043
awed+0.042
ounced+0.042
Neg logit contributions
crow-0.025
 thing-0.025
 lesson-0.025
 end-0.024
 apology-0.024
Token.
Token ID13
Feature activation+2.44e+00

Pos logit contributions
crow+0.015
 apology+0.015
 lesson+0.015
 difference+0.014
 deed+0.014
Neg logit contributions
irling-0.018
eping-0.018
ep-0.017
awed-0.017
ounced-0.016
Token She
Token ID1375
Feature activation-9.32e-01

Pos logit contributions
crow+0.056
 apology+0.054
 lesson+0.054
 thing+0.053
 difference+0.053
Neg logit contributions
irling-0.051
eping-0.049
awed-0.047
ep-0.046
ounced-0.046
Token loved
Token ID6151
Feature activation-1.09e+00

Pos logit contributions
irling+0.020
ents+0.020
eking+0.019
ounced+0.019
eping+0.019
Neg logit contributions
 thing-0.021
 lesson-0.021
 am-0.020
 end-0.020
crow-0.020
Token to
Token ID284
Feature activation+2.06e+00

Pos logit contributions
irling+0.029
eping+0.028
awed+0.027
ounced+0.027
eking+0.027
Neg logit contributions
 thing-0.019
 lesson-0.019
crow-0.018
 deed-0.018
 apology-0.017
Token play
Token ID711
Feature activation-6.96e-01

Pos logit contributions
crow+0.038
 apology+0.035
 lesson+0.035
 thing+0.035
 difference+0.035
Neg logit contributions
irling-0.053
eping-0.051
ep-0.049
awed-0.049
ounced-0.048
Token with
Token ID351
Feature activation+2.15e+00

Pos logit contributions
ounced+0.018
years+0.017
irling+0.016
ents+0.016
eping+0.016
Neg logit contributions
 thing-0.015
crow-0.013
 end-0.012
 am-0.012
 offer-0.012
Token her
Token ID607
Feature activation+5.06e-01

Pos logit contributions
 thing+0.041
crow+0.037
 lesson+0.037
 apology+0.035
 conclusions+0.035
Neg logit contributions
irling-0.054
ounced-0.053
lic-0.052
eping-0.052
eking-0.052
Token toys
Token ID14958
Feature activation-2.04e-01

Pos logit contributions
crow+0.011
 thing+0.011
 lesson+0.011
 apology+0.011
 difference+0.011
Neg logit contributions
irling-0.011
eping-0.011
awed-0.010
ounced-0.010
ep-0.010
Token and
Token ID290
Feature activation-2.24e+00

Pos logit contributions
irling+0.005
eping+0.004
ounced+0.004
awed+0.004
lic+0.004
Neg logit contributions
crow-0.004
 thing-0.004
 lesson-0.004
 end-0.004
 difference-0.004
Token thinks
Token ID6834
Feature activation+1.98e+00

Pos logit contributions
irling+0.022
eping+0.021
awed+0.020
ep+0.020
ounced+0.020
Neg logit contributions
crow-0.016
 apology-0.016
 lesson-0.016
 thing-0.016
 difference-0.015
Token the
Token ID262
Feature activation+2.45e+00

Pos logit contributions
crow+0.044
 apology+0.043
 thing+0.042
 lesson+0.042
 difference+0.042
Neg logit contributions
irling-0.043
eping-0.041
awed-0.039
ounced-0.039
ep-0.038
Token p
Token ID279
Feature activation-4.09e+00

Pos logit contributions
 zone+0.079
 deed+0.079
crow+0.077
 conclusions+0.076
 affect+0.076
Neg logit contributions
irling-0.023
ep-0.021
lic-0.021
 walking-0.021
ounced-0.021
Tokenanda
Token ID5282
Feature activation-8.90e-01

Pos logit contributions
irling+0.088
eping+0.085
ep+0.079
awed+0.077
eking+0.077
Neg logit contributions
crow-0.093
 apology-0.089
 thing-0.088
 lesson-0.088
 difference-0.087
Token is
Token ID318
Feature activation-2.47e+00

Pos logit contributions
irling+0.023
eping+0.022
ep+0.022
awed+0.022
ounced+0.022
Neg logit contributions
crow-0.015
 deed-0.015
 conclusions-0.015
 apology-0.015
 difference-0.014
Token patient
Token ID5827
Feature activation+2.51e+00

Pos logit contributions
irling+0.045
eping+0.043
awed+0.041
ep+0.040
ounced+0.040
Neg logit contributions
crow-0.063
 apology-0.061
 thing-0.061
 lesson-0.061
 difference-0.060
Token and
Token ID290
Feature activation-2.72e+00

Pos logit contributions
crow+0.065
 conclusions+0.065
 apology+0.064
 deed+0.061
 difference+0.060
Neg logit contributions
irling-0.041
eping-0.039
ep-0.038
 walking-0.038
 exactly-0.038
Token kind
Token ID1611
Feature activation+2.69e+00

Pos logit contributions
irling+0.051
eping+0.050
awed+0.048
ounced+0.047
ep+0.046
Neg logit contributions
 lesson-0.068
 thing-0.068
 apology-0.067
crow-0.065
 offer-0.065
Token.
Token ID13
Feature activation+9.74e-01

Pos logit contributions
 conclusions+0.073
 affect+0.071
 apology+0.069
 difference+0.068
crow+0.066
Neg logit contributions
 hopping-0.035
ents-0.035
awed-0.034
ocked-0.034
 exactly-0.033
Token She
Token ID1375
Feature activation-6.67e-01

Pos logit contributions
 deed+0.028
 apology+0.028
 lesson+0.027
 difference+0.027
 choice+0.027
Neg logit contributions
irling-0.013
awed-0.013
lic-0.012
 hopping-0.012
years-0.012
Token thinks
Token ID6834
Feature activation+2.17e+00

Pos logit contributions
irling+0.017
eping+0.017
awed+0.016
ep+0.016
ounced+0.016
Neg logit contributions
crow-0.012
 apology-0.012
 lesson-0.011
 thing-0.011
 difference-0.011
Token and
Token ID290
Feature activation-4.22e+00

Pos logit contributions
crow+0.032
 apology+0.030
 lesson+0.030
 difference+0.029
 conclusions+0.029
Neg logit contributions
irling-0.030
eping-0.029
awed-0.027
ep-0.027
ounced-0.027
Token powerful
Token ID3665
Feature activation+6.83e-01

Pos logit contributions
eping+0.085
irling+0.084
ep+0.076
eps+0.075
awed+0.074
Neg logit contributions
 lesson-0.102
 thing-0.100
 offer-0.098
crow-0.095
 apology-0.095
Token feeling
Token ID4203
Feature activation-3.03e+00

Pos logit contributions
crow+0.013
 conclusions+0.012
 lesson+0.012
 apology+0.012
 zone+0.012
Neg logit contributions
irling-0.016
eping-0.016
ep-0.015
awed-0.015
ounced-0.015
Token in
Token ID287
Feature activation-5.40e-01

Pos logit contributions
irling+0.056
eping+0.053
awed+0.049
ep+0.049
years+0.048
Neg logit contributions
 apology-0.076
crow-0.076
 lesson-0.075
 deed-0.075
 difference-0.074
Token her
Token ID607
Feature activation-6.87e-01

Pos logit contributions
irling+0.010
eping+0.009
awed+0.008
ep+0.008
years+0.008
Neg logit contributions
crow-0.014
 apology-0.014
 lesson-0.014
 difference-0.013
 zone-0.013
Token heart
Token ID2612
Feature activation-2.40e+00

Pos logit contributions
irling+0.016
eping+0.015
awed+0.015
ep+0.014
 walking+0.014
Neg logit contributions
crow-0.014
 lesson-0.013
 apology-0.013
 conclusions-0.013
 zone-0.013
Token.
Token ID13
Feature activation+2.17e-01

Pos logit contributions
irling+0.069
eping+0.066
ounced+0.066
years+0.066
awed+0.065
Neg logit contributions
 thing-0.035
 lesson-0.034
crow-0.033
 deed-0.032
 difference-0.032
Token
Token ID198
Feature activation-1.72e+00

Pos logit contributions
crow+0.005
 apology+0.004
 thing+0.004
 lesson+0.004
 deed+0.004
Neg logit contributions
irling-0.005
eping-0.005
awed-0.005
ounced-0.004
ep-0.004
Token
Token ID198
Feature activation-1.17e+00

Pos logit contributions
irling+0.041
eping+0.040
awed+0.038
ep+0.038
ounced+0.038
Neg logit contributions
crow-0.034
 apology-0.032
 lesson-0.032
 thing-0.032
 difference-0.032
TokenThe
Token ID464
Feature activation+2.46e-01

Pos logit contributions
irling+0.025
eping+0.024
awed+0.024
ounced+0.024
eking+0.023
Neg logit contributions
crow-0.026
 lesson-0.025
 thing-0.024
 apology-0.024
 conclusions-0.024
Token elephant
Token ID20950
Feature activation-2.44e+00

Pos logit contributions
crow+0.005
 zone+0.005
 conclusions+0.005
 affect+0.005
 deed+0.005
Neg logit contributions
irling-0.006
eping-0.005
ep-0.005
 walking-0.005
years-0.005
Token she
Token ID673
Feature activation-1.38e+00

Pos logit contributions
irling+0.046
eping+0.044
awed+0.042
ep+0.041
ounced+0.041
Neg logit contributions
crow-0.049
 apology-0.047
 lesson-0.047
 thing-0.047
 difference-0.046
Token found
Token ID1043
Feature activation-2.87e+00

Pos logit contributions
irling+0.031
eking+0.029
lic+0.029
years+0.028
eping+0.028
Neg logit contributions
crow-0.030
 thing-0.030
 am-0.029
 end-0.029
 lesson-0.028
Token a
Token ID257
Feature activation-2.91e+00

Pos logit contributions
irling+0.054
eping+0.050
awed+0.047
ep+0.047
ounced+0.047
Neg logit contributions
crow-0.074
 thing-0.072
 lesson-0.071
 apology-0.071
 difference-0.070
Token wounded
Token ID10657
Feature activation-1.23e+00

Pos logit contributions
irling+0.058
eping+0.055
awed+0.053
ep+0.052
ounced+0.051
Neg logit contributions
 thing-0.069
crow-0.069
 lesson-0.068
 difference-0.068
 apology-0.067
Token animal
Token ID5044
Feature activation-1.40e+00

Pos logit contributions
 hopping+0.026
eping+0.026
ep+0.026
awed+0.026
ounced+0.025
Neg logit contributions
 apology-0.024
 lesson-0.023
crow-0.023
 conclusions-0.023
 pleasures-0.023
Token in
Token ID287
Feature activation+1.93e-01

Pos logit contributions
irling+0.028
eping+0.027
awed+0.025
ep+0.025
ounced+0.025
Neg logit contributions
crow-0.033
 apology-0.033
 lesson-0.032
 difference-0.032
 deed-0.032
Token the
Token ID262
Feature activation-5.66e-02

Pos logit contributions
crow+0.005
 apology+0.005
 zone+0.005
 difference+0.005
 deed+0.005
Neg logit contributions
eping-0.003
irling-0.003
ep-0.003
 walking-0.003
awed-0.003
Token forest
Token ID8222
Feature activation-2.99e+00

Pos logit contributions
irling+0.001
eping+0.001
awed+0.001
ep+0.001
ounced+0.001
Neg logit contributions
 apology-0.001
crow-0.001
 lesson-0.001
 deed-0.001
 zone-0.001
Token and
Token ID290
Feature activation-4.16e+00

Pos logit contributions
irling+0.083
eping+0.080
awed+0.077
ounced+0.076
ep+0.076
Neg logit contributions
crow-0.049
 thing-0.047
 lesson-0.046
 apology-0.046
 difference-0.045
Token she
Token ID673
Feature activation-1.02e+00

Pos logit contributions
irling+0.084
eping+0.080
ep+0.074
awed+0.073
lic+0.072
Neg logit contributions
crow-0.103
 thing-0.100
 lesson-0.099
 apology-0.097
 difference-0.095
Token decided
Token ID3066
Feature activation-4.41e-01

Pos logit contributions
irling+0.028
eping+0.026
aj+0.025
lic+0.024
eking+0.024
Neg logit contributions
 thing-0.017
 lesson-0.017
 end-0.017
crow-0.016
 am-0.015
Token the
Token ID262
Feature activation-5.41e-01

Pos logit contributions
eping+0.004
irling+0.004
ep+0.003
 walking+0.003
eking+0.003
Neg logit contributions
crow-0.004
 apology-0.003
 difference-0.003
 lesson-0.003
 deed-0.003
Token lake
Token ID13546
Feature activation-2.89e+00

Pos logit contributions
ep+0.012
irling+0.012
eping+0.012
 walking+0.011
eps+0.011
Neg logit contributions
 apology-0.011
crow-0.011
 deed-0.010
 zone-0.010
 lesson-0.010
Token.
Token ID13
Feature activation+1.85e-02

Pos logit contributions
irling+0.071
eping+0.068
awed+0.066
ounced+0.065
ep+0.064
Neg logit contributions
crow-0.056
 thing-0.054
 lesson-0.053
 apology-0.053
 difference-0.053
Token It
Token ID632
Feature activation-1.34e-01

Pos logit contributions
crow+0.000
 apology+0.000
 thing+0.000
 lesson+0.000
 difference+0.000
Neg logit contributions
irling-0.001
eping-0.000
awed-0.000
ep-0.000
ounced-0.000
Token was
Token ID373
Feature activation-5.39e+00

Pos logit contributions
irling+0.003
eping+0.003
years+0.003
awed+0.003
eking+0.003
Neg logit contributions
crow-0.003
 thing-0.003
 lesson-0.003
 end-0.003
 difference-0.002
Token so
Token ID523
Feature activation-1.23e+00

Pos logit contributions
irling+0.120
awed+0.116
years+0.107
eking+0.105
ounced+0.105
Neg logit contributions
 thing-0.124
 lesson-0.123
 end-0.116
crow-0.115
 apology-0.110
Token colorful
Token ID20239
Feature activation-1.74e+00

Pos logit contributions
irling+0.026
awed+0.026
years+0.026
eking+0.026
eping+0.025
Neg logit contributions
 lesson-0.029
 thing-0.028
crow-0.026
 apology-0.025
 am-0.025
Token and
Token ID290
Feature activation-3.83e+00

Pos logit contributions
irling+0.037
eping+0.036
awed+0.034
ep+0.034
ounced+0.034
Neg logit contributions
crow-0.039
 thing-0.038
 lesson-0.038
 apology-0.037
 difference-0.037
Token shiny
Token ID22441
Feature activation-1.76e+00

Pos logit contributions
irling+0.080
awed+0.076
eping+0.074
ounced+0.073
eking+0.071
Neg logit contributions
 thing-0.093
 lesson-0.091
crow-0.088
 difference-0.084
 end-0.083
Token!
Token ID0
Feature activation+1.53e-01

Pos logit contributions
irling+0.040
eping+0.039
awed+0.038
ep+0.037
ounced+0.036
Neg logit contributions
crow-0.037
 apology-0.036
 deed-0.035
 lesson-0.035
 difference-0.035
Token
Token ID198
Feature activation-2.39e+00

Pos logit contributions
crow+0.003
 apology+0.003
 difference+0.003
 deed+0.003
 lesson+0.003
Neg logit contributions
irling-0.004
eping-0.004
awed-0.004
lic-0.003
ounced-0.003
Token
Token ID198
Feature activation-3.58e-01

Pos logit contributions
irling+0.019
eping+0.018
awed+0.018
ep+0.018
ounced+0.017
Neg logit contributions
crow-0.016
 thing-0.015
 deed-0.015
 difference-0.015
 apology-0.015
TokenM
Token ID44
Feature activation+2.31e+00

Pos logit contributions
irling+0.007
eping+0.007
awed+0.007
eking+0.006
ounced+0.006
Neg logit contributions
crow-0.009
 thing-0.008
 lesson-0.008
 apology-0.008
 end-0.008
Tokenia
Token ID544
Feature activation-1.21e+00

Pos logit contributions
crow+0.049
 apology+0.048
 thing+0.047
 lesson+0.047
 deed+0.046
Neg logit contributions
irling-0.054
eping-0.051
awed-0.049
years-0.048
ep-0.048
Token follows
Token ID5679
Feature activation-1.46e+00

Pos logit contributions
irling+0.030
eping+0.029
ep+0.027
lic+0.027
awed+0.027
Neg logit contributions
crow-0.023
 apology-0.022
 thing-0.022
 lesson-0.022
 difference-0.022
Token him
Token ID683
Feature activation-9.73e-01

Pos logit contributions
 walking+0.023
irling+0.022
eping+0.022
years+0.021
eking+0.021
Neg logit contributions
 difference-0.040
 lesson-0.039
 affect-0.039
crow-0.038
 pleasures-0.038
Token.
Token ID13
Feature activation+8.79e-01

Pos logit contributions
irling+0.020
eping+0.019
 walking+0.019
eking+0.019
ep+0.019
Neg logit contributions
crow-0.022
 conclusions-0.021
 apology-0.021
 difference-0.021
 deed-0.021
Token She
Token ID1375
Feature activation-1.24e+00

Pos logit contributions
crow+0.020
 apology+0.020
 lesson+0.020
 deed+0.020
 thing+0.020
Neg logit contributions
irling-0.018
eping-0.017
awed-0.017
ounced-0.016
ep-0.016
Token feels
Token ID5300
Feature activation-4.73e+00

Pos logit contributions
irling+0.033
eping+0.032
lic+0.031
years+0.031
ep+0.030
Neg logit contributions
crow-0.021
 apology-0.021
 difference-0.020
 lesson-0.020
 end-0.020
Token sorry
Token ID7926
Feature activation+1.86e+00

Pos logit contributions
irling+0.076
eping+0.073
awed+0.066
ep+0.065
ounced+0.064
Neg logit contributions
crow-0.128
 deed-0.127
 lesson-0.125
 zone-0.125
 apology-0.125
Token for
Token ID329
Feature activation+3.77e+00

Pos logit contributions
crow+0.047
 deed+0.043
 zone+0.043
 apology+0.043
 conclusions+0.043
Neg logit contributions
irling-0.034
eping-0.033
 exactly-0.031
ep-0.031
eking-0.031
Token him
Token ID683
Feature activation+1.03e+00

Pos logit contributions
 thing+0.076
crow+0.075
 apology+0.074
 end+0.073
 conclusions+0.072
Neg logit contributions
irling-0.089
eping-0.085
awed-0.084
ounced-0.083
lic-0.083
Token a
Token ID257
Feature activation-2.23e+00

Pos logit contributions
irling+0.013
awed+0.012
itching+0.011
rolling+0.011
lic+0.011
Neg logit contributions
 payoff-0.011
 lesson-0.010
 thing-0.010
 offer-0.010
 value-0.010
Token big
Token ID1263
Feature activation-1.32e+00

Pos logit contributions
irling+0.053
eping+0.050
awed+0.050
years+0.049
eking+0.048
Neg logit contributions
 thing-0.046
 lesson-0.045
 difference-0.045
 apology-0.045
crow-0.044
Token smile
Token ID8212
Feature activation-1.18e+00

Pos logit contributions
irling+0.034
eping+0.033
awed+0.032
lic+0.032
ounced+0.032
Neg logit contributions
 thing-0.024
crow-0.024
 apology-0.024
 difference-0.023
 lesson-0.023
Token on
Token ID319
Feature activation-2.65e-01

Pos logit contributions
irling+0.025
eping+0.023
ounced+0.023
awed+0.023
years+0.022
Neg logit contributions
crow-0.027
 thing-0.026
 lesson-0.025
 zone-0.025
 apology-0.025
Token her
Token ID607
Feature activation-5.12e-01

Pos logit contributions
irling+0.006
eping+0.006
ounced+0.005
awed+0.005
lic+0.005
Neg logit contributions
crow-0.006
 apology-0.006
 thing-0.006
 lesson-0.006
 deed-0.006
Token face
Token ID1986
Feature activation-1.05e+00

Pos logit contributions
irling+0.010
eping+0.010
ep+0.009
eking+0.009
awed+0.009
Neg logit contributions
crow-0.013
 difference-0.012
 lesson-0.012
 deed-0.011
 thing-0.011
Token.
Token ID13
Feature activation+7.47e-01

Pos logit contributions
irling+0.030
eping+0.029
ounced+0.028
awed+0.028
ep+0.027
Neg logit contributions
crow-0.016
 thing-0.016
 lesson-0.015
 apology-0.015
 deed-0.015
Token She
Token ID1375
Feature activation-1.19e+00

Pos logit contributions
crow+0.017
 deed+0.017
 apology+0.017
 difference+0.017
 lesson+0.017
Neg logit contributions
irling-0.014
awed-0.014
lic-0.014
eping-0.014
ounced-0.013
Token hugged
Token ID41130
Feature activation-2.34e+00

Pos logit contributions
irling+0.030
eking+0.029
eping+0.028
itching+0.028
leys+0.026
Neg logit contributions
 am-0.020
crow-0.020
 thing-0.020
 lesson-0.020
 end-0.020
Token him
Token ID683
Feature activation-1.40e+00

Pos logit contributions
eping+0.033
ep+0.032
 walking+0.032
irling+0.032
awed+0.031
Neg logit contributions
 affect-0.064
 lesson-0.063
 difference-0.063
crow-0.063
 apology-0.062
Token and
Token ID290
Feature activation-3.87e+00

Pos logit contributions
irling+0.034
eping+0.032
awed+0.031
ounced+0.031
ep+0.031
Neg logit contributions
crow-0.028
 apology-0.027
 thing-0.027
 lesson-0.027
 difference-0.026
Token the
Token ID262
Feature activation+6.99e-01

Pos logit contributions
irling+0.036
 walking+0.033
 hopping+0.032
eping+0.032
 exactly+0.032
Neg logit contributions
 apology-0.042
crow-0.041
 conclusions-0.039
 zone-0.039
 lesson-0.038
Token fox
Token ID21831
Feature activation-1.65e+00

Pos logit contributions
crow+0.012
 apology+0.012
 conclusions+0.011
 difference+0.011
 lesson+0.011
Neg logit contributions
irling-0.018
eping-0.018
ep-0.017
ounced-0.017
awed-0.017
Token and
Token ID290
Feature activation-3.64e+00

Pos logit contributions
irling+0.030
eping+0.029
awed+0.027
ep+0.027
ounced+0.027
Neg logit contributions
crow-0.042
 apology-0.041
 thing-0.040
 lesson-0.040
 difference-0.040
Token the
Token ID262
Feature activation+6.36e-01

Pos logit contributions
 hopping+0.069
 walking+0.067
eping+0.066
irling+0.066
ep+0.065
Neg logit contributions
 apology-0.085
 conclusions-0.080
crow-0.079
 difference-0.079
 zone-0.078
Token rabbit
Token ID22746
Feature activation-1.94e+00

Pos logit contributions
crow+0.012
 apology+0.011
 conclusions+0.011
 difference+0.011
 zone+0.011
Neg logit contributions
irling-0.015
eping-0.015
ep-0.015
ounced-0.014
 hopping-0.014
Token were
Token ID547
Feature activation-4.99e+00

Pos logit contributions
irling+0.035
eping+0.034
awed+0.032
ep+0.032
ounced+0.031
Neg logit contributions
crow-0.049
 apology-0.048
 thing-0.048
 lesson-0.047
 difference-0.047
Token always
Token ID1464
Feature activation-1.62e+00

Pos logit contributions
irling+0.110
eping+0.107
awed+0.104
ounced+0.102
lic+0.101
Neg logit contributions
crow-0.107
 lesson-0.103
 thing-0.103
 difference-0.102
 deed-0.101
Token happy
Token ID3772
Feature activation+2.12e+00

Pos logit contributions
irling+0.028
lic+0.028
awed+0.027
ents+0.026
ounced+0.026
Neg logit contributions
 thing-0.043
 difference-0.040
crow-0.040
 lesson-0.039
 end-0.039
Token to
Token ID284
Feature activation+3.20e+00

Pos logit contributions
crow+0.051
 apology+0.049
 lesson+0.048
 conclusions+0.047
 thing+0.047
Neg logit contributions
irling-0.043
eping-0.040
awed-0.039
ounced-0.038
ep-0.038
Token share
Token ID2648
Feature activation+2.85e+00

Pos logit contributions
crow+0.060
 zone+0.056
 lesson+0.054
 apology+0.054
 pleasures+0.053
Neg logit contributions
irling-0.079
eping-0.076
ep-0.075
 walking-0.073
 hopping-0.071
Token with
Token ID351
Feature activation+3.32e+00

Pos logit contributions
crow+0.065
 apology+0.061
 lesson+0.059
 deed+0.059
 zone+0.059
Neg logit contributions
irling-0.062
eping-0.060
ep-0.057
awed-0.057
ounced-0.054
Token candy
Token ID18550
Feature activation+9.17e-01

Pos logit contributions
 apology+0.055
crow+0.055
 lesson+0.053
 deed+0.052
 difference+0.052
Neg logit contributions
irling-0.052
eping-0.051
awed-0.048
ep-0.048
 walking-0.048
Token!
Token ID0
Feature activation+9.58e-01

Pos logit contributions
crow+0.021
 apology+0.021
 deed+0.020
 difference+0.020
 lesson+0.020
Neg logit contributions
irling-0.019
eping-0.018
awed-0.017
ep-0.017
ounced-0.017
Token<|endoftext|>
Token ID50256
Feature activation+1.06e+00

Pos logit contributions
 thing+0.028
 affect+0.027
 apology+0.027
 conclusions+0.027
 pleasures+0.027
Neg logit contributions
 walking-0.011
ents-0.011
lic-0.011
 hopping-0.011
 exactly-0.011
TokenTom
Token ID13787
Feature activation-5.02e-02

Pos logit contributions
 apology+0.026
 deed+0.025
 lesson+0.024
 difference+0.024
 conclusions+0.024
Neg logit contributions
awed-0.020
irling-0.020
lic-0.020
 walking-0.019
eping-0.019
Tokenmy
Token ID1820
Feature activation+9.30e-02

Pos logit contributions
irling+0.001
eping+0.001
ep+0.001
awed+0.001
ounced+0.001
Neg logit contributions
crow-0.001
 apology-0.001
 lesson-0.001
 conclusions-0.001
 deed-0.001
Token was
Token ID373
Feature activation-4.77e+00

Pos logit contributions
crow+0.002
 thing+0.002
 lesson+0.002
 difference+0.002
 apology+0.002
Neg logit contributions
irling-0.002
eping-0.002
awed-0.002
ounced-0.002
lic-0.002
Token a
Token ID257
Feature activation-2.31e+00

Pos logit contributions
lic+0.088
awed+0.086
ounced+0.085
irling+0.083
ents+0.082
Neg logit contributions
 thing-0.112
crow-0.108
 end-0.108
 day-0.106
 lesson-0.106
Token 3
Token ID513
Feature activation-1.56e+00

Pos logit contributions
irling+0.045
eping+0.044
awed+0.043
ounced+0.043
eking+0.042
Neg logit contributions
 thing-0.055
crow-0.055
 difference-0.054
 lesson-0.053
 end-0.053
Token year
Token ID614
Feature activation-2.99e+00

Pos logit contributions
irling+0.030
eping+0.029
ep+0.027
awed+0.027
ounced+0.026
Neg logit contributions
crow-0.038
 apology-0.038
 lesson-0.037
 difference-0.037
 deed-0.037
Token old
Token ID1468
Feature activation-9.59e-01

Pos logit contributions
irling+0.082
eping+0.079
awed+0.078
ounced+0.077
ep+0.075
Neg logit contributions
crow-0.051
 lesson-0.048
 thing-0.048
 apology-0.048
 difference-0.047
Token boy
Token ID2933
Feature activation-1.51e+00

Pos logit contributions
irling+0.025
eping+0.024
ep+0.023
awed+0.023
 walking+0.023
Neg logit contributions
 apology-0.016
 lesson-0.016
crow-0.016
 conclusions-0.016
 difference-0.015
Token more
Token ID517
Feature activation+1.78e+00

Pos logit contributions
irling+0.083
eping+0.079
lic+0.075
ounced+0.074
awed+0.073
Neg logit contributions
 lesson-0.104
 thing-0.104
crow-0.103
 apology-0.102
 end-0.101
Token caul
Token ID46087
Feature activation-1.95e+00

Pos logit contributions
crow+0.037
 apology+0.036
 lesson+0.035
 thing+0.035
 difference+0.035
Neg logit contributions
irling-0.041
eping-0.040
awed-0.038
ep-0.038
ounced-0.038
Tokeniflower
Token ID42642
Feature activation+1.20e-01

Pos logit contributions
irling+0.039
eping+0.037
ounced+0.036
awed+0.036
lic+0.036
Neg logit contributions
crow-0.044
 lesson-0.044
 thing-0.044
 deed-0.043
 apology-0.043
Token.
Token ID13
Feature activation+1.43e+00

Pos logit contributions
crow+0.003
 apology+0.003
 difference+0.003
 conclusions+0.003
 deed+0.002
Neg logit contributions
irling-0.003
awed-0.002
eping-0.002
ep-0.002
ounced-0.002
Token She
Token ID1375
Feature activation-8.68e-01

Pos logit contributions
crow+0.034
 deed+0.033
 apology+0.033
 thing+0.033
 difference+0.032
Neg logit contributions
irling-0.028
awed-0.027
eping-0.025
ounced-0.025
lic-0.025
Token smiled
Token ID13541
Feature activation-4.54e+00

Pos logit contributions
irling+0.017
eping+0.017
eking+0.016
lic+0.016
years+0.015
Neg logit contributions
 lesson-0.020
crow-0.020
 thing-0.020
 end-0.020
 apology-0.020
Token and
Token ID290
Feature activation-3.97e+00

Pos logit contributions
irling+0.112
eping+0.109
ounced+0.103
eking+0.102
awed+0.101
Neg logit contributions
 thing-0.094
 lesson-0.089
 end-0.088
 deed-0.086
 difference-0.086
Token said
Token ID531
Feature activation-3.11e-01

Pos logit contributions
irling+0.083
eping+0.081
rolling+0.076
eking+0.075
years+0.073
Neg logit contributions
 lesson-0.090
 thing-0.089
 offer-0.084
 apology-0.084
crow-0.084
Token,
Token ID11
Feature activation-2.22e+00

Pos logit contributions
irling+0.006
eping+0.006
awed+0.006
ounced+0.006
ep+0.006
Neg logit contributions
crow-0.007
 apology-0.007
 lesson-0.007
 thing-0.007
 difference-0.007
Token "
Token ID366
Feature activation+2.44e+00

Pos logit contributions
eping+0.053
irling+0.053
ep+0.049
ounced+0.049
awed+0.048
Neg logit contributions
crow-0.043
 lesson-0.043
 apology-0.043
 thing-0.043
 difference-0.041
TokenThis
Token ID1212
Feature activation+7.48e+00

Pos logit contributions
 apology+0.062
 choice+0.058
 difference+0.058
 deed+0.056
 lesson+0.056
Neg logit contributions
awed-0.045
irling-0.044
ep-0.042
lic-0.041
eping-0.041
Tokenie
Token ID494
Feature activation-2.58e+00

Pos logit contributions
irling+0.115
eping+0.099
 hopping+0.097
awed+0.097
eking+0.094
Neg logit contributions
crow-0.102
 apology-0.100
 thing-0.095
 end-0.095
 lesson-0.094
Token looked
Token ID3114
Feature activation-6.50e+00

Pos logit contributions
irling+0.054
eping+0.052
eking+0.048
years+0.047
ep+0.045
Neg logit contributions
crow-0.059
 thing-0.056
 end-0.055
 lesson-0.054
 apology-0.053
Token at
Token ID379
Feature activation-1.04e+00

Pos logit contributions
irling+0.152
eping+0.131
ep+0.130
ounced+0.128
eking+0.127
Neg logit contributions
 thing-0.137
crow-0.133
 end-0.131
 difference-0.129
 lesson-0.126
Token the
Token ID262
Feature activation+2.97e-01

Pos logit contributions
awed+0.014
 walking+0.014
 exactly+0.014
eping+0.013
irling+0.013
Neg logit contributions
crow-0.030
 affect-0.030
 deed-0.029
 difference-0.029
inea-0.029
Token page
Token ID2443
Feature activation-3.80e-02

Pos logit contributions
crow+0.006
 thing+0.006
 lesson+0.006
 apology+0.006
 difference+0.006
Neg logit contributions
irling-0.007
eping-0.007
awed-0.007
ounced-0.007
ep-0.007
Token and
Token ID290
Feature activation-5.72e+00

Pos logit contributions
irling+0.001
eping+0.001
ounced+0.001
ep+0.001
awed+0.001
Neg logit contributions
 thing-0.001
crow-0.001
 difference-0.001
 lesson-0.001
 end-0.001
Token revealed
Token ID4602
Feature activation-1.63e+00

Pos logit contributions
irling+0.147
eping+0.135
eking+0.127
itching+0.125
agging+0.124
Neg logit contributions
 thing-0.104
crow-0.103
 end-0.098
 offer-0.096
 payoff-0.095
Token it
Token ID340
Feature activation+1.60e+00

Pos logit contributions
irling+0.033
eping+0.031
awed+0.030
ep+0.029
ounced+0.029
Neg logit contributions
crow-0.039
 apology-0.038
 lesson-0.037
 difference-0.037
 deed-0.037
Token was
Token ID373
Feature activation-4.69e+00

Pos logit contributions
crow+0.028
 thing+0.028
 lesson+0.027
 apology+0.027
 difference+0.026
Neg logit contributions
irling-0.042
eping-0.041
ep-0.039
awed-0.039
ounced-0.038
Token ink
Token ID16882
Feature activation-2.14e+00

Pos logit contributions
irling+0.094
eping+0.088
awed+0.086
ounced+0.083
ep+0.082
Neg logit contributions
crow-0.112
 thing-0.111
 lesson-0.110
 apology-0.108
 end-0.107
Token.
Token ID13
Feature activation-1.91e-02

Pos logit contributions
irling+0.049
eping+0.047
awed+0.045
ep+0.045
ounced+0.045
Neg logit contributions
crow-0.044
 thing-0.043
 lesson-0.042
 apology-0.042
 difference-0.042
TokenThe
Token ID464
Feature activation+1.25e-01

Pos logit contributions
irling+0.030
eping+0.029
awed+0.028
ounced+0.028
eking+0.028
Neg logit contributions
crow-0.024
 thing-0.023
 lesson-0.023
 apology-0.022
 deed-0.022
Token neighbor
Token ID4780
Feature activation-4.41e-01

Pos logit contributions
crow+0.002
 lesson+0.002
 thing+0.002
 apology+0.002
 difference+0.002
Neg logit contributions
irling-0.003
eping-0.003
awed-0.003
ounced-0.003
ep-0.003
Token opened
Token ID4721
Feature activation-6.48e+00

Pos logit contributions
irling+0.010
eping+0.008
eking+0.008
itching+0.008
awed+0.008
Neg logit contributions
crow-0.011
 thing-0.010
 end-0.010
 lesson-0.009
 offer-0.009
Token the
Token ID262
Feature activation+5.68e-01

Pos logit contributions
irling+0.168
eping+0.160
ounced+0.154
awed+0.154
lic+0.152
Neg logit contributions
crow-0.119
 thing-0.115
 lesson-0.113
 apology-0.112
 difference-0.111
Token door
Token ID3420
Feature activation-3.46e+00

Pos logit contributions
crow+0.015
 conclusions+0.015
 deed+0.015
 apology+0.014
 pleasures+0.014
Neg logit contributions
eping-0.009
irling-0.009
ep-0.008
 walking-0.008
eking-0.008
Token and
Token ID290
Feature activation-5.21e+00

Pos logit contributions
irling+0.095
eping+0.087
ounced+0.085
awed+0.082
eking+0.082
Neg logit contributions
 thing-0.063
 lesson-0.060
crow-0.059
 end-0.059
 side-0.058
Token Tim
Token ID5045
Feature activation-6.40e-01

Pos logit contributions
irling+0.128
eping+0.107
itching+0.106
eking+0.105
awed+0.101
Neg logit contributions
crow-0.115
 thing-0.113
 offer-0.105
 payoff-0.104
 lesson-0.103
Tokenmy
Token ID1820
Feature activation-9.81e-01

Pos logit contributions
irling+0.021
eping+0.018
eking+0.017
ounced+0.017
awed+0.017
Neg logit contributions
 end-0.007
crow-0.007
 side-0.007
 zone-0.007
 thing-0.006
Token asked
Token ID1965
Feature activation+5.67e-01

Pos logit contributions
irling+0.025
eking+0.022
agging+0.022
eping+0.021
years+0.021
Neg logit contributions
crow-0.016
 thing-0.016
 end-0.016
 am-0.016
 offer-0.016
Token,
Token ID11
Feature activation-2.54e+00

Pos logit contributions
crow+0.014
 conclusions+0.014
 difference+0.014
 apology+0.014
 affect+0.013
Neg logit contributions
irling-0.010
eping-0.010
ep-0.009
 exactly-0.009
years-0.009
Token "
Token ID366
Feature activation+8.81e-01

Pos logit contributions
irling+0.062
eping+0.059
awed+0.057
ounced+0.056
ep+0.055
Neg logit contributions
crow-0.051
 thing-0.050
 apology-0.050
 lesson-0.049
 difference-0.047