TOP ACTIVATIONS
MAX = 5.447

als and bloom ed in bright colors - red , yellow
big reef with lots of colorful fish and coral .
, with all sorts of bright colors . The
grew into big plants with beautiful flowers . The farmer was
big reef with lots of colorful coral . Nem o wanted
she found big flowers with colorful pet als and birds picking
their home with lots of pretty things . One day ,
big pond with lots of colorful rocks . He walked closer
, beautiful garden full of colourful flowers . The child loved
was big and full of colourful fish . Tom thought it
and picked out lots of colorful flowers . They placed the
and trees and lots of green . But then something bad
. There were lots of small fishes swimming around his hands
big pond and lots of colorful flowers . M
garden were lots of beautiful colourful flowers . Every day ,
their garden with lots of colorful things . They bought some
of green and lots of tiny , colorful fish ! Molly
the garden were lots of beautiful colourful flowers . Every day
would fill the bucket with fresh water from the nearby river
and Mum picked lots of colourful flowers and made a pretty

MIN ACTIVATIONS
MIN = -4.333

it . He practiced his math problems and soon he could
every day . But she didn 't like to listen to
. He practiced riding his bicycle every day to get ready
book to practice . Lily didn 't want to do it
math homework . Tim my didn 't want to do it
ride . He would ride it every day and feel the
tried his best to do his best . He worked hard
made sure to bounce the football gently so that it wouldn
his trumpet . He plays it every day . He makes
Tim my didn 't have a real sword
muscles felt heavy . He didn 't like the feeling .
Tim my practiced playing the violin every day and soon became
much . He would ride it around town and people would
the trumpet . But he didn 't have a trumpet of
my always wore his yellow helmet when he rode his bike
the right equipment . She didn 't like this , so
ABC s . Tim my didn 't want to study yet
that he should complete his math and not lie . <|endoftext|>
phone was difficult . She didn 't know how to call
candy . But Tim my didn 't like to eat his

INTERVAL 3.002 - 5.447
CONTAINS 0.080%

the sky lit up with bright colors . The fire work
were apples , sandwiches and some juice . She wrapped the
grabbed a few shells and some seaw eed and built a
fridge ! Tom and his mum couldn 't believe their eyes
. The lady gave him some coins , and he ran

INTERVAL 0.557 - 3.002
CONTAINS 26.398%

The needle and the butterfly flew to the store together
So Jack and the dragon set off together . The
the fridge . It is full of food . They have
and looked out into the distance . The little girl asked
. You are in big trouble ." Tom and

INTERVAL -1.888 - 0.557
CONTAINS 71.765%

so tired , but he was also happy . He gathered
ped happily . It trusted them and let them touch it
was done swimming , he came out of the pool feeling
so happy to see something that looked just like her .
add to my collection ," he replied . The

INTERVAL -4.333 - -1.888
CONTAINS 1.756%

Hey , stop ! That 's my boss ' chocolate !"
she quickly jumped onto her bike and began to ride .
away . Lily thought it was so cool that she wanted
were even bigger , and he knew that he could do
. From then on , Lily always cut her w affles
Tokenals
Token ID874
Feature activation+2.09e+00

Pos logit contributions
 pulp+0.057
ins+0.040
Happy+0.037
ment+0.036
 pans+0.036
Neg logit contributions
 robot-0.138
 operation-0.136
 math-0.136
 mask-0.135
 Patch-0.134
Token and
Token ID290
Feature activation+2.73e+00

Pos logit contributions
 pulp+0.123
 fishes+0.107
 ge+0.105
ins+0.104
lan+0.104
Neg logit contributions
 not-0.048
zag-0.046
 homework-0.041
 operation-0.038
case-0.038
Token bloom
Token ID29955
Feature activation+3.46e+00

Pos logit contributions
oms+0.115
 pulp+0.115
ins+0.113
ment+0.109
ocol+0.108
Neg logit contributions
 not-0.098
 Max-0.084
 homework-0.081
 wouldn-0.077
 Patch-0.077
Tokened
Token ID276
Feature activation+2.21e+00

Pos logit contributions
 pulp+0.140
 whales+0.113
ocol+0.113
 pump+0.111
 ge+0.109
Neg logit contributions
zag-0.145
 not-0.144
chy-0.139
ggie-0.139
 because-0.126
Token in
Token ID287
Feature activation+2.34e+00

Pos logit contributions
 pulp+0.112
oms+0.101
ins+0.091
lan+0.091
 ge+0.090
Neg logit contributions
 math-0.062
 not-0.062
 homework-0.062
ggie-0.057
 because-0.057
Token bright
Token ID6016
Feature activation+5.45e+00

Pos logit contributions
 pulp+0.141
oms+0.137
unn+0.136
ins+0.133
suits+0.126
Neg logit contributions
 not-0.048
 math-0.045
 because-0.042
 Max-0.039
 x-0.037
Token colors
Token ID7577
Feature activation+3.05e+00

Pos logit contributions
 pulp+0.305
iers+0.288
fed+0.279
Sorry+0.279
lan+0.275
Neg logit contributions
 my-0.110
 so-0.106
 because-0.104
 and-0.088
 Wednesday-0.088
Token -
Token ID532
Feature activation+1.12e+00

Pos logit contributions
 pulp+0.157
 ge+0.132
 pans+0.127
 islands+0.125
forts+0.124
Neg logit contributions
 not-0.094
zag-0.088
 because-0.087
ggie-0.085
 math-0.080
Token red
Token ID2266
Feature activation+2.99e+00

Pos logit contributions
 pulp+0.051
oms+0.044
ment+0.044
 fishes+0.043
 islands+0.042
Neg logit contributions
 homework-0.039
 not-0.039
 math-0.038
 operation-0.036
ggie-0.035
Token,
Token ID11
Feature activation+1.39e+00

Pos logit contributions
 pulp+0.139
Happy+0.125
ins+0.121
oms+0.119
 sailors+0.118
Neg logit contributions
 because-0.086
 math-0.085
 cushion-0.082
 not-0.082
 robot-0.077
Token yellow
Token ID7872
Feature activation+2.78e+00

Pos logit contributions
 pulp+0.060
ment+0.053
oms+0.053
 fishes+0.050
ins+0.050
Neg logit contributions
 not-0.052
 homework-0.048
 math-0.047
 operation-0.044
 robot-0.044
Token big
Token ID1263
Feature activation+1.24e+00

Pos logit contributions
 pulp+0.066
ment+0.061
als+0.059
oms+0.059
ins+0.056
Neg logit contributions
 homework-0.052
 not-0.051
 skate-0.050
 math-0.050
 robot-0.049
Token reef
Token ID25088
Feature activation+3.17e-01

Pos logit contributions
 pulp+0.047
ment+0.041
als+0.040
Happy+0.040
oms+0.039
Neg logit contributions
 homework-0.052
 skate-0.050
 operation-0.050
 math-0.050
 robot-0.050
Token with
Token ID351
Feature activation+2.31e+00

Pos logit contributions
 pulp+0.014
 fishes+0.012
oms+0.012
ment+0.012
 banners+0.011
Neg logit contributions
 homework-0.012
ggie-0.011
 math-0.011
 operation-0.011
 not-0.011
Token lots
Token ID6041
Feature activation+3.11e+00

Pos logit contributions
oms+0.109
 pulp+0.107
ment+0.103
ins+0.100
omed+0.097
Neg logit contributions
 not-0.074
 math-0.063
 skate-0.059
 robot-0.057
 balancing-0.057
Token of
Token ID286
Feature activation+4.11e+00

Pos logit contributions
 pulp+0.168
women+0.143
Happy+0.138
owers+0.138
use+0.137
Neg logit contributions
 because-0.083
 math-0.081
 not-0.076
 balancing-0.071
 skate-0.069
Token colorful
Token ID20239
Feature activation+5.28e+00

Pos logit contributions
 pulp+0.165
oms+0.156
ment+0.154
als+0.153
use+0.150
Neg logit contributions
 not-0.136
 really-0.135
 especially-0.134
 his-0.134
 math-0.134
Token fish
Token ID5916
Feature activation+2.28e+00

Pos logit contributions
 pulp+0.188
forts+0.187
Sorry+0.186
Happy+0.185
prise+0.184
Neg logit contributions
 weapon-0.167
 because-0.166
 mask-0.165
 math-0.164
 hockey-0.162
Token and
Token ID290
Feature activation+2.59e+00

Pos logit contributions
 pulp+0.112
Happy+0.099
ment+0.096
 banners+0.095
oms+0.093
Neg logit contributions
 not-0.073
 skate-0.069
zag-0.068
 homework-0.067
 balancing-0.064
Token coral
Token ID29537
Feature activation+3.93e+00

Pos logit contributions
oms+0.093
 pulp+0.089
omed+0.083
ment+0.078
ins+0.078
Neg logit contributions
 not-0.116
 math-0.103
 skate-0.103
 homework-0.102
 robot-0.099
Token.
Token ID13
Feature activation+6.19e-01

Pos logit contributions
 pulp+0.181
Happy+0.167
ment+0.154
 banners+0.153
forts+0.153
Neg logit contributions
 not-0.137
 balancing-0.112
 because-0.110
zag-0.109
 skate-0.109
Token
Token ID198
Feature activation+7.57e-01

Pos logit contributions
 pulp+0.035
oms+0.032
 fishes+0.031
ment+0.031
 shells+0.031
Neg logit contributions
 homework-0.015
 not-0.014
 math-0.014
ggie-0.014
 operation-0.013
Token,
Token ID11
Feature activation-1.42e+00

Pos logit contributions
 homework+0.107
 x+0.097
 operation+0.095
 math+0.095
 skate+0.094
Neg logit contributions
als-0.066
 pulp-0.060
ment-0.058
lan-0.057
 fishes-0.057
Token with
Token ID351
Feature activation+1.61e+00

Pos logit contributions
 homework+0.051
zag+0.049
 operation+0.047
ggie+0.046
isk+0.045
Neg logit contributions
 pulp-0.065
 shells-0.060
 fishes-0.060
 flags-0.058
 banners-0.057
Token all
Token ID477
Feature activation+7.59e-01

Pos logit contributions
 pulp+0.080
oms+0.075
ment+0.071
ins+0.070
suits+0.069
Neg logit contributions
 not-0.048
 homework-0.045
 math-0.044
ggie-0.042
 x-0.039
Token sorts
Token ID10524
Feature activation+2.39e+00

Pos logit contributions
 pulp+0.033
oms+0.029
 fishes+0.028
ment+0.028
als+0.027
Neg logit contributions
 homework-0.028
 not-0.027
 math-0.027
ggie-0.026
 operation-0.026
Token of
Token ID286
Feature activation+2.95e+00

Pos logit contributions
 pulp+0.118
 ge+0.097
 pans+0.092
suits+0.092
 islands+0.090
Neg logit contributions
 not-0.082
 math-0.076
 because-0.074
ggie-0.074
 homework-0.069
Token bright
Token ID6016
Feature activation+5.22e+00

Pos logit contributions
 pulp+0.136
oms+0.126
als+0.118
Sorry+0.116
women+0.115
Neg logit contributions
 math-0.097
 not-0.092
 homework-0.089
 x-0.086
ggie-0.085
Token colors
Token ID7577
Feature activation+2.64e+00

Pos logit contributions
Sorry+0.230
 pulp+0.219
women+0.198
icker+0.195
We+0.194
Neg logit contributions
 because-0.145
 er-0.143
 math-0.142
 just-0.137
 balancing-0.137
Token.
Token ID13
Feature activation+2.65e-01

Pos logit contributions
 pulp+0.145
 ge+0.127
Happy+0.123
lan+0.120
 whales+0.119
Neg logit contributions
 not-0.067
 math-0.063
zag-0.063
 homework-0.059
 because-0.059
Token
Token ID198
Feature activation+4.73e-01

Pos logit contributions
 pulp+0.012
 fishes+0.010
oms+0.010
ment+0.010
 flags+0.010
Neg logit contributions
 homework-0.010
ggie-0.009
 math-0.009
 operation-0.009
 not-0.009
Token
Token ID198
Feature activation-1.16e-01

Pos logit contributions
 pulp+0.020
 fishes+0.017
oms+0.017
ment+0.017
 banners+0.017
Neg logit contributions
 homework-0.018
ggie-0.018
 not-0.017
 math-0.017
zag-0.017
TokenThe
Token ID464
Feature activation+8.47e-01

Pos logit contributions
 homework+0.005
 operation+0.005
zag+0.004
 math+0.004
 x+0.004
Neg logit contributions
 pulp-0.005
 fishes-0.004
oms-0.004
ment-0.004
 banners-0.004
Token grew
Token ID6348
Feature activation+2.16e+00

Pos logit contributions
 pulp+0.101
lan+0.097
 banners+0.094
ment+0.092
 flags+0.090
Neg logit contributions
 not-0.106
zag-0.092
 wouldn-0.090
 always-0.089
case-0.089
Token into
Token ID656
Feature activation+1.74e+00

Pos logit contributions
 pulp+0.110
 banners+0.106
oms+0.104
 flags+0.101
Happy+0.101
Neg logit contributions
 not-0.065
 because-0.050
chy-0.050
 homework-0.050
 math-0.046
Token big
Token ID1263
Feature activation+4.15e+00

Pos logit contributions
oms+0.091
 pulp+0.086
als+0.085
 banners+0.080
suits+0.079
Neg logit contributions
 not-0.050
 math-0.047
 homework-0.046
 x-0.044
 operation-0.041
Token plants
Token ID6134
Feature activation+2.10e+00

Pos logit contributions
lan+0.221
Happy+0.219
izz+0.219
ins+0.212
ment+0.211
Neg logit contributions
 because-0.090
 so-0.071
 math-0.066
 not-0.065
 bone-0.062
Token with
Token ID351
Feature activation+3.10e+00

Pos logit contributions
 pulp+0.099
Happy+0.086
ment+0.085
 banners+0.085
oms+0.084
Neg logit contributions
 not-0.074
 homework-0.067
 because-0.065
zag-0.064
 math-0.062
Token beautiful
Token ID4950
Feature activation+5.19e+00

Pos logit contributions
oms+0.167
ins+0.153
unn+0.151
suits+0.150
ucks+0.147
Neg logit contributions
 not-0.097
 only-0.069
 no-0.069
 his-0.068
 math-0.068
Token flowers
Token ID12734
Feature activation+2.54e+00

Pos logit contributions
prise+0.198
Sorry+0.197
izz+0.196
oms+0.193
 cheered+0.189
Neg logit contributions
 because-0.165
 just-0.163
 not-0.160
 math-0.150
 always-0.149
Token.
Token ID13
Feature activation+1.61e-02

Pos logit contributions
 pulp+0.120
Happy+0.104
 banners+0.104
 fishes+0.104
ment+0.102
Neg logit contributions
 not-0.089
 because-0.082
 homework-0.076
zag-0.075
chy-0.074
Token The
Token ID383
Feature activation+1.80e+00

Pos logit contributions
 pulp+0.001
 fishes+0.001
oms+0.001
ment+0.001
 flags+0.001
Neg logit contributions
 homework-0.001
ggie-0.001
zag-0.001
 operation-0.001
 math-0.001
Token farmer
Token ID18739
Feature activation+8.11e-01

Pos logit contributions
oms+0.079
ment+0.076
 pulp+0.076
 cheered+0.073
Happy+0.070
Neg logit contributions
 not-0.063
 operation-0.063
 homework-0.060
 robber-0.060
 shelf-0.055
Token was
Token ID373
Feature activation-2.94e-01

Pos logit contributions
 pulp+0.036
oms+0.032
ment+0.031
 fishes+0.031
 banners+0.031
Neg logit contributions
 homework-0.029
 not-0.028
ggie-0.028
zag-0.027
 operation-0.027
Token big
Token ID1263
Feature activation+2.59e-01

Pos logit contributions
 homework+0.001
ggie+0.001
 operation+0.001
 math+0.001
zag+0.001
Neg logit contributions
 pulp-0.001
 fishes-0.001
oms-0.000
 flags-0.000
 ge-0.000
Token reef
Token ID25088
Feature activation+9.13e-01

Pos logit contributions
 pulp+0.010
 fishes+0.008
oms+0.008
ment+0.008
 banners+0.008
Neg logit contributions
 homework-0.011
ggie-0.010
 math-0.010
 operation-0.010
 not-0.010
Token with
Token ID351
Feature activation+2.25e+00

Pos logit contributions
 pulp+0.043
 fishes+0.036
oms+0.036
 banners+0.036
ment+0.036
Neg logit contributions
 homework-0.032
 not-0.032
 math-0.030
zag-0.030
ggie-0.029
Token lots
Token ID6041
Feature activation+3.37e+00

Pos logit contributions
oms+0.120
ment+0.112
omed+0.111
 pulp+0.111
use+0.108
Neg logit contributions
 not-0.062
 math-0.054
 robot-0.048
 homework-0.046
 his-0.045
Token of
Token ID286
Feature activation+3.71e+00

Pos logit contributions
 pulp+0.190
women+0.168
use+0.165
Sorry+0.162
Happy+0.159
Neg logit contributions
 because-0.079
 math-0.076
 bone-0.069
 not-0.067
 about-0.064
Token colorful
Token ID20239
Feature activation+5.13e+00

Pos logit contributions
 pulp+0.158
oms+0.154
omed+0.152
use+0.149
ment+0.149
Neg logit contributions
 math-0.123
 not-0.118
 his-0.116
 really-0.113
 just-0.109
Token coral
Token ID29537
Feature activation+3.78e+00

Pos logit contributions
ount+0.195
izz+0.193
forts+0.190
Sorry+0.186
use+0.184
Neg logit contributions
 math-0.170
 mask-0.161
 hockey-0.158
 robot-0.156
 weapon-0.155
Token.
Token ID13
Feature activation+1.43e+00

Pos logit contributions
 pulp+0.189
Happy+0.167
Sorry+0.165
ocol+0.165
 banners+0.163
Neg logit contributions
 not-0.116
 because-0.101
 mask-0.092
zag-0.089
 balancing-0.088
Token Nem
Token ID22547
Feature activation-1.31e-01

Pos logit contributions
 pulp+0.068
ment+0.061
oms+0.061
 banners+0.059
 flags+0.058
Neg logit contributions
 not-0.048
 homework-0.043
 skate-0.042
 Max-0.041
 math-0.041
Tokeno
Token ID78
Feature activation+5.06e-01

Pos logit contributions
 homework+0.005
 operation+0.004
 math+0.004
ggie+0.004
 x+0.004
Neg logit contributions
 pulp-0.006
 fishes-0.005
oms-0.005
ment-0.005
 ge-0.005
Token wanted
Token ID2227
Feature activation+3.57e-01

Pos logit contributions
 pulp+0.022
oms+0.019
ment+0.019
 fishes+0.019
 banners+0.018
Neg logit contributions
 homework-0.019
 not-0.018
ggie-0.018
zag-0.017
 math-0.017
Token she
Token ID673
Feature activation+4.38e-01

Pos logit contributions
 homework+0.007
zag+0.006
 operation+0.006
ggie+0.006
 skate+0.006
Neg logit contributions
 pulp-0.009
 fishes-0.008
ment-0.008
oms-0.008
als-0.007
Token found
Token ID1043
Feature activation+1.05e+00

Pos logit contributions
 pulp+0.014
 fishes+0.011
oms+0.011
ment+0.010
 banners+0.010
Neg logit contributions
 homework-0.022
 not-0.021
ggie-0.021
zag-0.021
 operation-0.021
Token big
Token ID1263
Feature activation+1.26e+00

Pos logit contributions
 pulp+0.059
oms+0.053
 banners+0.051
 fishes+0.051
 flags+0.050
Neg logit contributions
 not-0.026
 homework-0.025
ggie-0.024
 math-0.024
 operation-0.023
Token flowers
Token ID12734
Feature activation+1.24e+00

Pos logit contributions
 pulp+0.053
Happy+0.046
ment+0.044
lan+0.043
oms+0.042
Neg logit contributions
 homework-0.048
 math-0.047
 operation-0.046
 x-0.045
 not-0.045
Token with
Token ID351
Feature activation+2.47e+00

Pos logit contributions
 pulp+0.063
 fishes+0.054
Happy+0.053
ment+0.053
 banners+0.053
Neg logit contributions
 not-0.038
 homework-0.037
zag-0.035
 math-0.034
ggie-0.034
Token colorful
Token ID20239
Feature activation+5.09e+00

Pos logit contributions
oms+0.111
 pulp+0.110
ment+0.105
ins+0.102
Happy+0.100
Neg logit contributions
 not-0.080
 math-0.073
 homework-0.070
ggie-0.065
 Patch-0.063
Token pet
Token ID4273
Feature activation+3.85e+00

Pos logit contributions
Sorry+0.195
 pulp+0.188
Happy+0.182
ment+0.177
We+0.177
Neg logit contributions
 math-0.166
 er-0.162
 cushion-0.151
 because-0.150
 cord-0.143
Tokenals
Token ID874
Feature activation+2.61e+00

Pos logit contributions
 pulp+0.115
Happy+0.088
ins+0.083
 pans+0.079
 lighthouse+0.078
Neg logit contributions
 because-0.180
 Max-0.171
 robot-0.170
 math-0.170
 Patch-0.170
Token and
Token ID290
Feature activation+2.20e+00

Pos logit contributions
 pulp+0.134
Happy+0.115
 fishes+0.112
 ge+0.111
 banners+0.111
Neg logit contributions
 not-0.082
zag-0.074
 because-0.072
 homework-0.069
chy-0.068
Token birds
Token ID10087
Feature activation+2.79e+00

Pos logit contributions
 pulp+0.098
oms+0.095
ins+0.090
ment+0.085
Happy+0.085
Neg logit contributions
 not-0.080
 homework-0.073
 math-0.070
ggie-0.065
 mask-0.063
Token picking
Token ID10868
Feature activation+3.09e+00

Pos logit contributions
 pulp+0.132
Happy+0.113
ment+0.111
 fishes+0.109
oms+0.108
Neg logit contributions
 not-0.097
zag-0.094
 homework-0.086
 because-0.085
ggie-0.084
Token their
Token ID511
Feature activation+1.60e+00

Pos logit contributions
 pulp+0.074
oms+0.065
 fishes+0.059
ment+0.058
omed+0.058
Neg logit contributions
 not-0.067
 homework-0.067
 math-0.063
ggie-0.058
 because-0.055
Token home
Token ID1363
Feature activation+3.11e-01

Pos logit contributions
 pulp+0.082
oms+0.078
ment+0.077
 fishes+0.072
omed+0.072
Neg logit contributions
 homework-0.050
 math-0.046
 operation-0.043
 not-0.043
 robot-0.042
Token with
Token ID351
Feature activation+2.73e+00

Pos logit contributions
 pulp+0.015
 fishes+0.013
oms+0.013
ment+0.013
 ge+0.013
Neg logit contributions
 homework-0.010
ggie-0.009
 math-0.009
 not-0.009
 operation-0.009
Token lots
Token ID6041
Feature activation+2.59e+00

Pos logit contributions
oms+0.129
 pulp+0.121
omed+0.121
als+0.120
ment+0.118
Neg logit contributions
 homework-0.096
 not-0.092
 math-0.090
 x-0.080
 operation-0.077
Token of
Token ID286
Feature activation+3.93e+00

Pos logit contributions
 pulp+0.139
 ge+0.109
 valleys+0.107
 whales+0.106
 islands+0.105
Neg logit contributions
 math-0.079
 homework-0.074
 because-0.072
 not-0.069
 operation-0.069
Token pretty
Token ID2495
Feature activation+5.08e+00

Pos logit contributions
plets+0.195
 pulp+0.191
ment+0.188
als+0.188
omed+0.186
Neg logit contributions
 math-0.115
 homework-0.109
 x-0.106
 favorite-0.099
 operation-0.098
Token things
Token ID1243
Feature activation+1.77e+00

Pos logit contributions
 pulp+0.216
Sorry+0.213
fed+0.212
lan+0.209
forts+0.204
Neg logit contributions
 math-0.150
 homework-0.135
 belt-0.131
 sweater-0.130
 mask-0.130
Token.
Token ID13
Feature activation+5.62e-01

Pos logit contributions
 pulp+0.083
 fishes+0.068
ment+0.067
 ge+0.067
oms+0.066
Neg logit contributions
 homework-0.067
 not-0.067
 math-0.060
 because-0.059
zag-0.059
Token One
Token ID1881
Feature activation-4.82e-01

Pos logit contributions
 pulp+0.030
oms+0.026
ment+0.026
 fishes+0.026
als+0.025
Neg logit contributions
 homework-0.017
 math-0.015
 not-0.015
 operation-0.015
ggie-0.015
Token day
Token ID1110
Feature activation+1.72e+00

Pos logit contributions
 homework+0.017
 Maxwell+0.016
 robber+0.015
 math+0.015
zag+0.015
Neg logit contributions
 pulp-0.022
 fishes-0.020
 ge-0.020
 pump-0.020
ment-0.019
Token,
Token ID11
Feature activation+5.39e-01

Pos logit contributions
 pulp+0.103
 fishes+0.090
 islands+0.089
 banners+0.088
oms+0.088
Neg logit contributions
 not-0.041
 homework-0.039
 math-0.038
ggie-0.036
zag-0.034
Token big
Token ID1263
Feature activation-8.39e-02

Pos logit contributions
 pulp+0.010
 fishes+0.009
oms+0.009
ment+0.008
 flags+0.008
Neg logit contributions
 homework-0.009
ggie-0.009
 math-0.009
 operation-0.009
 not-0.008
Token pond
Token ID16723
Feature activation+6.25e-01

Pos logit contributions
 homework+0.003
ggie+0.003
zag+0.003
 not+0.003
 operation+0.003
Neg logit contributions
 pulp-0.004
 fishes-0.003
oms-0.003
 flags-0.003
 banners-0.003
Token with
Token ID351
Feature activation+2.33e+00

Pos logit contributions
 pulp+0.032
 fishes+0.028
oms+0.028
ment+0.027
 banners+0.027
Neg logit contributions
 homework-0.019
 not-0.018
ggie-0.018
zag-0.017
 math-0.017
Token lots
Token ID6041
Feature activation+2.98e+00

Pos logit contributions
oms+0.118
 pulp+0.117
ment+0.111
ins+0.106
omed+0.105
Neg logit contributions
 not-0.067
 math-0.058
 robot-0.053
ggie-0.052
 skate-0.052
Token of
Token ID286
Feature activation+3.74e+00

Pos logit contributions
 pulp+0.164
lan+0.133
Happy+0.133
women+0.132
Sorry+0.130
Neg logit contributions
 because-0.080
 math-0.079
 not-0.071
 bone-0.068
 robot-0.067
Token colorful
Token ID20239
Feature activation+5.07e+00

Pos logit contributions
 pulp+0.153
oms+0.140
lan+0.135
ment+0.134
forts+0.131
Neg logit contributions
 not-0.129
 math-0.128
 exercise-0.124
 bone-0.124
 really-0.120
Token rocks
Token ID12586
Feature activation+2.09e+00

Pos logit contributions
Happy+0.196
 pulp+0.196
iza+0.193
forts+0.192
prise+0.191
Neg logit contributions
 bone-0.153
 math-0.151
 hockey-0.147
 exercise-0.143
 because-0.140
Token.
Token ID13
Feature activation+1.54e-01

Pos logit contributions
 pulp+0.107
Happy+0.092
ment+0.088
 fishes+0.087
 banners+0.087
Neg logit contributions
 not-0.068
zag-0.065
 homework-0.063
 skate-0.060
ousel-0.059
Token He
Token ID679
Feature activation+6.74e-01

Pos logit contributions
 pulp+0.008
 fishes+0.007
oms+0.007
ment+0.007
 flags+0.007
Neg logit contributions
 homework-0.005
ggie-0.004
 operation-0.004
 math-0.004
zag-0.004
Token walked
Token ID6807
Feature activation-2.10e-01

Pos logit contributions
 pulp+0.025
oms+0.022
ment+0.021
 fishes+0.020
 banners+0.020
Neg logit contributions
 homework-0.029
 not-0.028
ggie-0.028
zag-0.028
 skate-0.027
Token closer
Token ID5699
Feature activation+2.29e-01

Pos logit contributions
 homework+0.008
 operation+0.008
ggie+0.008
 x+0.007
 math+0.007
Neg logit contributions
 pulp-0.009
 fishes-0.008
oms-0.008
 flags-0.008
 shells-0.008
Token,
Token ID11
Feature activation+5.49e-01

Pos logit contributions
 pulp+0.015
oms+0.013
 fishes+0.013
ment+0.013
 banners+0.013
Neg logit contributions
 homework-0.012
 operation-0.011
 math-0.011
ggie-0.011
 not-0.011
Token beautiful
Token ID4950
Feature activation+8.80e-01

Pos logit contributions
 pulp+0.024
oms+0.021
ment+0.021
 fishes+0.020
als+0.020
Neg logit contributions
 homework-0.021
 math-0.019
 operation-0.019
 not-0.019
ggie-0.019
Token garden
Token ID11376
Feature activation+2.10e+00

Pos logit contributions
 pulp+0.040
oms+0.034
 fishes+0.033
ment+0.033
als+0.032
Neg logit contributions
 homework-0.032
 math-0.030
 operation-0.030
ggie-0.030
zag-0.029
Token full
Token ID1336
Feature activation+3.48e+00

Pos logit contributions
 pulp+0.093
Happy+0.074
 fishes+0.074
ment+0.073
oms+0.071
Neg logit contributions
 homework-0.081
 not-0.078
 math-0.076
zag-0.074
ggie-0.074
Token of
Token ID286
Feature activation+3.08e+00

Pos logit contributions
 pulp+0.181
ins+0.147
 valleys+0.143
 pans+0.142
Happy+0.142
Neg logit contributions
 because-0.100
 not-0.098
 math-0.098
 robot-0.090
 operation-0.088
Token colourful
Token ID44799
Feature activation+5.07e+00

Pos logit contributions
 pulp+0.123
oms+0.114
omed+0.108
ment+0.107
lan+0.105
Neg logit contributions
 math-0.123
 homework-0.116
 not-0.114
 operation-0.107
 exercise-0.107
Token flowers
Token ID12734
Feature activation+2.60e+00

Pos logit contributions
 pulp+0.199
Sorry+0.185
izz+0.183
fed+0.181
forts+0.174
Neg logit contributions
 math-0.188
 robot-0.161
 cushion-0.160
 mask-0.157
 because-0.157
Token.
Token ID13
Feature activation-1.23e-01

Pos logit contributions
 pulp+0.132
ment+0.108
 fishes+0.103
Happy+0.102
oms+0.102
Neg logit contributions
 not-0.085
 because-0.082
 homework-0.081
zag-0.079
 math-0.077
Token The
Token ID383
Feature activation+6.72e-01

Pos logit contributions
 homework+0.005
ggie+0.005
zag+0.005
 operation+0.005
 not+0.004
Neg logit contributions
 pulp-0.005
 fishes-0.004
oms-0.004
 flags-0.004
 banners-0.004
Token child
Token ID1200
Feature activation+1.48e-01

Pos logit contributions
 pulp+0.032
oms+0.028
ment+0.028
 fishes+0.027
als+0.027
Neg logit contributions
 homework-0.024
 operation-0.022
 math-0.022
 x-0.022
 robot-0.021
Token loved
Token ID6151
Feature activation-6.65e-02

Pos logit contributions
 pulp+0.006
 fishes+0.005
oms+0.005
ment+0.005
 flags+0.005
Neg logit contributions
 homework-0.006
ggie-0.005
 math-0.005
 operation-0.005
zag-0.005
Token was
Token ID373
Feature activation-9.81e-01

Pos logit contributions
 homework+0.041
 math+0.038
 operation+0.037
 exam+0.037
 robber+0.036
Neg logit contributions
 pulp-0.032
 fishes-0.028
 cheered-0.027
als-0.027
 shells-0.027
Token big
Token ID1263
Feature activation+1.33e-01

Pos logit contributions
 homework+0.040
zag+0.039
ggie+0.037
 operation+0.037
 Maxwell+0.036
Neg logit contributions
 pulp-0.040
 shells-0.035
 fishes-0.034
 seeds-0.034
 grains-0.034
Token and
Token ID290
Feature activation-3.04e-01

Pos logit contributions
 pulp+0.007
 fishes+0.006
oms+0.006
ment+0.006
 flags+0.006
Neg logit contributions
 homework-0.004
ggie-0.004
 operation-0.004
 math-0.004
zag-0.004
Token full
Token ID1336
Feature activation+2.71e+00

Pos logit contributions
 homework+0.013
ggie+0.012
 operation+0.012
zag+0.012
 math+0.012
Neg logit contributions
 pulp-0.012
 fishes-0.010
 shells-0.010
 flags-0.010
oms-0.010
Token of
Token ID286
Feature activation+3.34e+00

Pos logit contributions
 pulp+0.140
ins+0.117
 valleys+0.116
 pans+0.114
 banners+0.113
Neg logit contributions
 not-0.084
 because-0.082
 math-0.075
 Max-0.074
 bone-0.073
Token colourful
Token ID44799
Feature activation+5.04e+00

Pos logit contributions
 pulp+0.131
als+0.124
oms+0.124
ment+0.122
omed+0.122
Neg logit contributions
 not-0.127
 math-0.123
 especially-0.111
 really-0.110
 homework-0.110
Token fish
Token ID5916
Feature activation+1.63e+00

Pos logit contributions
izz+0.193
arl+0.186
 pulp+0.184
forts+0.181
fed+0.180
Neg logit contributions
 math-0.177
 hockey-0.161
 because-0.160
 mask-0.156
 vest-0.155
Token.
Token ID13
Feature activation-7.09e-01

Pos logit contributions
 pulp+0.069
 banners+0.059
Happy+0.059
oms+0.058
ment+0.057
Neg logit contributions
 not-0.063
 homework-0.062
 math-0.059
ggie-0.059
zag-0.058
Token Tom
Token ID4186
Feature activation-6.10e-01

Pos logit contributions
 operation+0.032
 homework+0.032
 exam+0.030
zag+0.030
 x+0.029
Neg logit contributions
 pulp-0.024
 ge-0.021
ment-0.021
oms-0.021
 fishes-0.020
Token thought
Token ID1807
Feature activation-4.08e-01

Pos logit contributions
 homework+0.029
 operation+0.027
 math+0.026
 exam+0.025
zag+0.025
Neg logit contributions
 pulp-0.022
 fishes-0.019
 cheered-0.018
 ge-0.018
 shells-0.018
Token it
Token ID340
Feature activation-1.75e+00

Pos logit contributions
 homework+0.017
zag+0.016
 operation+0.016
isk+0.016
ggie+0.015
Neg logit contributions
 pulp-0.016
oms-0.014
 fishes-0.014
 shells-0.014
 flags-0.014
Token and
Token ID290
Feature activation+8.14e-01

Pos logit contributions
 pulp+0.037
 fishes+0.033
oms+0.032
ment+0.032
 ge+0.032
Neg logit contributions
 homework-0.022
 not-0.021
zag-0.020
 math-0.020
ggie-0.020
Token picked
Token ID6497
Feature activation+8.20e-01

Pos logit contributions
 pulp+0.033
oms+0.030
ment+0.029
 fishes+0.028
 islands+0.028
Neg logit contributions
 homework-0.033
 not-0.032
 math-0.030
ggie-0.030
zag-0.030
Token out
Token ID503
Feature activation+4.38e-01

Pos logit contributions
 pulp+0.042
oms+0.037
ment+0.036
 banners+0.036
 fishes+0.036
Neg logit contributions
 homework-0.024
 math-0.023
 not-0.022
ggie-0.022
 operation-0.021
Token lots
Token ID6041
Feature activation+2.85e+00

Pos logit contributions
 pulp+0.021
oms+0.019
 fishes+0.019
ment+0.018
 banners+0.018
Neg logit contributions
 homework-0.014
 math-0.013
 not-0.013
ggie-0.013
 operation-0.013
Token of
Token ID286
Feature activation+3.42e+00

Pos logit contributions
 pulp+0.162
Sorry+0.129
lan+0.127
women+0.127
 valleys+0.127
Neg logit contributions
 math-0.081
 because-0.065
 homework-0.064
 operation-0.061
 x-0.059
Token colorful
Token ID20239
Feature activation+5.04e+00

Pos logit contributions
 pulp+0.163
lan+0.161
plets+0.160
ment+0.159
Sorry+0.158
Neg logit contributions
 math-0.105
 x-0.096
 homework-0.093
 cord-0.088
 er-0.088
Token flowers
Token ID12734
Feature activation+2.45e+00

Pos logit contributions
Sorry+0.210
 pulp+0.206
arl+0.204
forts+0.203
fed+0.203
Neg logit contributions
 math-0.158
 er-0.139
 cord-0.128
 exercise-0.126
 puzzles-0.124
Token.
Token ID13
Feature activation-4.73e-01

Pos logit contributions
 pulp+0.123
 fishes+0.099
ment+0.099
 ge+0.099
 banners+0.095
Neg logit contributions
 not-0.083
 because-0.079
 homework-0.076
zag-0.075
 math-0.073
Token They
Token ID1119
Feature activation+2.07e+00

Pos logit contributions
 homework+0.020
ggie+0.018
 operation+0.018
zag+0.018
 x+0.018
Neg logit contributions
 pulp-0.019
 fishes-0.017
oms-0.017
als-0.017
 ge-0.016
Token placed
Token ID4624
Feature activation+1.36e+00

Pos logit contributions
ment+0.096
oms+0.091
 pulp+0.090
 fishes+0.082
ins+0.081
Neg logit contributions
 not-0.071
 always-0.064
 homework-0.064
 drib-0.063
 skate-0.059
Token the
Token ID262
Feature activation+2.22e+00

Pos logit contributions
 pulp+0.057
oms+0.052
ment+0.049
Happy+0.048
 fishes+0.047
Neg logit contributions
 homework-0.050
 not-0.050
 math-0.048
 Maxwell-0.045
 x-0.045
Token and
Token ID290
Feature activation+2.48e+00

Pos logit contributions
 pulp+0.088
Happy+0.075
 banners+0.075
ment+0.074
oms+0.073
Neg logit contributions
 not-0.053
 homework-0.048
 math-0.046
zag-0.045
 because-0.045
Token trees
Token ID7150
Feature activation+2.19e+00

Pos logit contributions
oms+0.094
 pulp+0.092
omed+0.087
ment+0.083
ins+0.080
Neg logit contributions
 not-0.104
 homework-0.101
 math-0.098
 robot-0.095
 mask-0.090
Token and
Token ID290
Feature activation+2.65e+00

Pos logit contributions
 pulp+0.108
Happy+0.092
ment+0.091
oms+0.090
 fishes+0.089
Neg logit contributions
 not-0.078
zag-0.068
 homework-0.066
 because-0.065
 skate-0.062
Token lots
Token ID6041
Feature activation+2.86e+00

Pos logit contributions
 pulp+0.104
oms+0.103
omed+0.098
ment+0.097
izz+0.091
Neg logit contributions
 not-0.103
 math-0.101
 homework-0.101
 robot-0.099
 mask-0.093
Token of
Token ID286
Feature activation+3.40e+00

Pos logit contributions
 pulp+0.163
 pans+0.125
suits+0.120
ment+0.119
 banners+0.119
Neg logit contributions
 because-0.086
 math-0.083
 robot-0.074
 not-0.072
 about-0.071
Token green
Token ID4077
Feature activation+4.93e+00

Pos logit contributions
 pulp+0.162
oms+0.152
ment+0.148
omed+0.142
als+0.140
Neg logit contributions
 math-0.110
 robot-0.106
 homework-0.102
 x-0.102
 operation-0.097
Token.
Token ID13
Feature activation+1.43e-01

Pos logit contributions
izz+0.234
iza+0.228
Sorry+0.211
 pulp+0.206
oms+0.202
Neg logit contributions
 bone-0.150
 robot-0.143
 math-0.143
 because-0.133
 mask-0.133
Token But
Token ID887
Feature activation-3.51e-01

Pos logit contributions
 pulp+0.006
 fishes+0.005
oms+0.005
ment+0.005
 flags+0.005
Neg logit contributions
 homework-0.005
ggie-0.005
 operation-0.005
 math-0.005
zag-0.005
Token then
Token ID788
Feature activation+6.05e-02

Pos logit contributions
 homework+0.012
ggie+0.012
zag+0.011
 operation+0.011
 x+0.011
Neg logit contributions
 pulp-0.017
 fishes-0.015
 shells-0.014
 flags-0.014
 islands-0.014
Token something
Token ID1223
Feature activation+2.16e-01

Pos logit contributions
 pulp+0.003
 fishes+0.002
oms+0.002
ment+0.002
 flags+0.002
Neg logit contributions
 homework-0.002
ggie-0.002
 operation-0.002
zag-0.002
 math-0.002
Token bad
Token ID2089
Feature activation+9.40e-02

Pos logit contributions
 pulp+0.010
 fishes+0.009
oms+0.009
ment+0.009
 flags+0.008
Neg logit contributions
 homework-0.008
ggie-0.007
 math-0.007
 operation-0.007
 not-0.007
Token.
Token ID13
Feature activation-1.87e-01

Pos logit contributions
 pulp+0.046
oms+0.037
 fishes+0.037
 banners+0.036
 ge+0.036
Neg logit contributions
 homework-0.045
ggie-0.044
 not-0.043
 math-0.043
 operation-0.042
Token There
Token ID1318
Feature activation+4.61e-01

Pos logit contributions
 homework+0.006
ggie+0.005
zag+0.005
 operation+0.005
 not+0.005
Neg logit contributions
 pulp-0.010
 fishes-0.009
oms-0.009
ment-0.009
als-0.008
Token were
Token ID547
Feature activation+1.65e+00

Pos logit contributions
 pulp+0.019
 fishes+0.016
oms+0.016
ment+0.015
 banners+0.015
Neg logit contributions
 homework-0.019
 not-0.018
ggie-0.018
 math-0.018
 operation-0.018
Token lots
Token ID6041
Feature activation+2.33e+00

Pos logit contributions
 pulp+0.072
oms+0.065
ment+0.063
als+0.059
Happy+0.058
Neg logit contributions
 not-0.061
 homework-0.059
 math-0.056
ggie-0.055
 skate-0.053
Token of
Token ID286
Feature activation+3.13e+00

Pos logit contributions
 pulp+0.115
lan+0.085
ment+0.084
eat+0.083
suits+0.083
Neg logit contributions
 math-0.080
 not-0.072
 balancing-0.071
 homework-0.071
 operation-0.071
Token small
Token ID1402
Feature activation+4.91e+00

Pos logit contributions
 pulp+0.144
als+0.128
ment+0.123
forts+0.122
omed+0.118
Neg logit contributions
 math-0.105
 especially-0.101
 not-0.099
 homework-0.097
 x-0.094
Token fishes
Token ID49765
Feature activation+2.10e+00

Pos logit contributions
 pulp+0.193
forts+0.182
arl+0.181
Happy+0.177
fed+0.175
Neg logit contributions
 math-0.162
 because-0.160
 balancing-0.156
 robot-0.152
 especially-0.150
Token swimming
Token ID14899
Feature activation+1.84e+00

Pos logit contributions
 pulp+0.104
Happy+0.090
ment+0.088
eat+0.085
 banners+0.084
Neg logit contributions
 not-0.066
 homework-0.063
 skate-0.062
zag-0.062
 math-0.060
Token around
Token ID1088
Feature activation+8.18e-01

Pos logit contributions
 pulp+0.105
ment+0.086
Happy+0.085
oms+0.085
lan+0.084
Neg logit contributions
zag-0.045
 not-0.045
 skate-0.042
ggie-0.042
 homework-0.042
Token his
Token ID465
Feature activation+1.89e+00

Pos logit contributions
 pulp+0.039
oms+0.034
ment+0.032
 fishes+0.032
 banners+0.032
Neg logit contributions
 homework-0.028
 not-0.027
 math-0.027
ggie-0.026
 operation-0.025
Token hands
Token ID2832
Feature activation+8.95e-01

Pos logit contributions
 pulp+0.085
Happy+0.075
oms+0.072
ment+0.071
eat+0.071
Neg logit contributions
 not-0.070
 skate-0.067
 homework-0.066
zag-0.065
 robot-0.064
Token big
Token ID1263
Feature activation+1.28e+00

Pos logit contributions
 pulp+0.067
ment+0.062
oms+0.060
 fishes+0.060
ins+0.058
Neg logit contributions
 homework-0.055
 math-0.053
 not-0.052
 robot-0.051
zag-0.050
Token pond
Token ID16723
Feature activation+2.25e+00

Pos logit contributions
 pulp+0.055
ment+0.050
ins+0.049
 fishes+0.049
oms+0.048
Neg logit contributions
 homework-0.047
 math-0.045
 robot-0.044
 operation-0.043
 not-0.043
Token and
Token ID290
Feature activation+1.37e+00

Pos logit contributions
 pulp+0.100
 fishes+0.085
Happy+0.085
oms+0.083
ment+0.083
Neg logit contributions
 not-0.083
 homework-0.079
zag-0.077
ggie-0.075
 skate-0.075
Token lots
Token ID6041
Feature activation+3.24e+00

Pos logit contributions
 pulp+0.070
oms+0.065
ment+0.061
 fishes+0.061
ins+0.060
Neg logit contributions
 homework-0.040
 not-0.039
 math-0.037
ggie-0.035
 skate-0.034
Token of
Token ID286
Feature activation+3.58e+00

Pos logit contributions
 pulp+0.178
use+0.149
suits+0.147
 islands+0.142
lan+0.142
Neg logit contributions
 math-0.090
 because-0.078
 about-0.076
 operation-0.074
 robot-0.073
Token colorful
Token ID20239
Feature activation+4.90e+00

Pos logit contributions
 pulp+0.132
oms+0.120
ment+0.113
als+0.113
omed+0.111
Neg logit contributions
 math-0.145
 homework-0.137
 not-0.133
 x-0.133
 mask-0.131
Token flowers
Token ID12734
Feature activation+3.38e+00

Pos logit contributions
 pulp+0.165
Sorry+0.165
forts+0.164
use+0.158
izz+0.156
Neg logit contributions
 math-0.180
 mask-0.170
 robot-0.166
 superhero-0.157
 homework-0.153
Token.
Token ID13
Feature activation+5.51e-01

Pos logit contributions
 pulp+0.166
ment+0.139
Happy+0.137
 fishes+0.137
Sorry+0.136
Neg logit contributions
 not-0.108
zag-0.100
 because-0.096
 homework-0.093
case-0.090
Token
Token ID198
Feature activation+8.32e-01

Pos logit contributions
 pulp+0.027
 fishes+0.024
oms+0.023
ment+0.023
 flags+0.023
Neg logit contributions
 homework-0.018
 not-0.017
 math-0.017
ggie-0.016
zag-0.016
Token
Token ID198
Feature activation+4.61e-01

Pos logit contributions
 pulp+0.039
 fishes+0.033
 banners+0.033
oms+0.033
 flags+0.033
Neg logit contributions
 homework-0.029
 not-0.029
ggie-0.028
 math-0.027
zag-0.027
TokenM
Token ID44
Feature activation+3.89e-01

Pos logit contributions
 pulp+0.019
oms+0.016
 fishes+0.016
ment+0.016
 flags+0.016
Neg logit contributions
 homework-0.018
ggie-0.018
 not-0.018
 math-0.017
zag-0.017
Token garden
Token ID11376
Feature activation+1.79e+00

Pos logit contributions
 pulp+0.122
ment+0.118
ins+0.111
 banners+0.111
 sailors+0.109
Neg logit contributions
 not-0.076
 math-0.063
 exam-0.063
 operation-0.062
 because-0.062
Token were
Token ID547
Feature activation+2.61e+00

Pos logit contributions
 pulp+0.104
 banners+0.091
 fishes+0.090
 flags+0.087
 grains+0.086
Neg logit contributions
 not-0.048
 math-0.042
 homework-0.042
zag-0.039
 operation-0.038
Token lots
Token ID6041
Feature activation+2.92e+00

Pos logit contributions
oms+0.130
ment+0.128
forts+0.127
 pulp+0.127
leness+0.120
Neg logit contributions
 not-0.085
 math-0.066
 so-0.066
 only-0.061
 always-0.061
Token of
Token ID286
Feature activation+3.19e+00

Pos logit contributions
 pulp+0.154
women+0.123
ment+0.123
 valleys+0.120
use+0.120
Neg logit contributions
 math-0.086
 because-0.085
 not-0.077
 exercise-0.076
 operation-0.074
Token beautiful
Token ID4950
Feature activation+4.86e+00

Pos logit contributions
oms+0.142
 pulp+0.138
ment+0.135
omed+0.131
plets+0.130
Neg logit contributions
 not-0.110
 math-0.110
 exercise-0.102
 especially-0.099
 always-0.097
Token colourful
Token ID44799
Feature activation+4.88e+00

Pos logit contributions
fed+0.211
 pulp+0.206
forts+0.204
iers+0.203
oms+0.203
Neg logit contributions
 because-0.143
 math-0.142
 cushion-0.133
 not-0.129
 favorite-0.125
Token flowers
Token ID12734
Feature activation+3.00e+00

Pos logit contributions
 pulp+0.203
 extinguished+0.201
fed+0.199
forts+0.194
oms+0.191
Neg logit contributions
 because-0.149
 math-0.147
 cushion-0.146
 robot-0.127
 just-0.124
Token.
Token ID13
Feature activation+9.84e-01

Pos logit contributions
 pulp+0.166
ment+0.135
ocol+0.134
 banners+0.132
Sorry+0.132
Neg logit contributions
 because-0.096
 not-0.092
zag-0.082
case-0.074
 always-0.072
Token Every
Token ID3887
Feature activation-6.94e-01

Pos logit contributions
 pulp+0.058
ment+0.051
oms+0.051
ins+0.050
 fishes+0.050
Neg logit contributions
 homework-0.023
 math-0.022
 not-0.022
 operation-0.021
 x-0.020
Token day
Token ID1110
Feature activation+1.56e+00

Pos logit contributions
 homework+0.025
 Maxwell+0.024
 robber+0.022
 x+0.022
 math+0.022
Neg logit contributions
 pulp-0.031
 fishes-0.029
oms-0.028
 ge-0.028
ment-0.027
Token,
Token ID11
Feature activation+7.68e-01

Pos logit contributions
 pulp+0.095
ins+0.084
 islands+0.081
oms+0.080
 fishes+0.080
Neg logit contributions
 not-0.036
 math-0.035
ggie-0.035
 homework-0.033
 Max-0.032
Token their
Token ID511
Feature activation+4.40e-01

Pos logit contributions
 pulp+0.050
oms+0.043
 fishes+0.041
ment+0.040
 ge+0.040
Neg logit contributions
 homework-0.041
 not-0.039
 math-0.038
ggie-0.037
 operation-0.036
Token garden
Token ID11376
Feature activation+5.11e-01

Pos logit contributions
 pulp+0.021
oms+0.019
 fishes+0.018
ment+0.018
 ge+0.018
Neg logit contributions
 homework-0.015
 math-0.014
 operation-0.013
ggie-0.013
 not-0.013
Token with
Token ID351
Feature activation+2.32e+00

Pos logit contributions
 pulp+0.020
 fishes+0.017
oms+0.017
ment+0.016
 ge+0.016
Neg logit contributions
 homework-0.022
 math-0.021
 not-0.021
 operation-0.020
ggie-0.020
Token lots
Token ID6041
Feature activation+2.81e+00

Pos logit contributions
 pulp+0.105
oms+0.100
ment+0.095
als+0.095
aff+0.093
Neg logit contributions
 homework-0.084
 math-0.080
 not-0.078
 operation-0.071
 x-0.070
Token of
Token ID286
Feature activation+3.44e+00

Pos logit contributions
 pulp+0.145
 ge+0.119
 islands+0.113
 whales+0.113
 valleys+0.112
Neg logit contributions
 math-0.095
 homework-0.085
 operation-0.079
 exam-0.076
 robot-0.076
Token colorful
Token ID20239
Feature activation+4.88e+00

Pos logit contributions
 pulp+0.162
als+0.154
ment+0.152
lan+0.150
oms+0.148
Neg logit contributions
 math-0.118
 homework-0.106
 x-0.105
 operation-0.099
 cord-0.098
Token things
Token ID1243
Feature activation+1.89e+00

Pos logit contributions
 pulp+0.192
fed+0.182
Sorry+0.182
lan+0.181
forts+0.177
Neg logit contributions
 math-0.175
 er-0.157
 cord-0.156
 homework-0.148
 cushion-0.143
Token.
Token ID13
Feature activation+9.81e-01

Pos logit contributions
 pulp+0.085
 ge+0.069
 fishes+0.069
ment+0.067
oms+0.065
Neg logit contributions
 homework-0.075
 not-0.074
 math-0.070
zag-0.067
 operation-0.066
Token They
Token ID1119
Feature activation+1.72e+00

Pos logit contributions
 pulp+0.042
ment+0.034
oms+0.034
 banners+0.033
Happy+0.033
Neg logit contributions
 homework-0.037
 not-0.036
 math-0.035
 skate-0.035
ggie-0.035
Token bought
Token ID5839
Feature activation+1.95e+00

Pos logit contributions
 pulp+0.072
ment+0.070
oms+0.066
 ge+0.063
 fishes+0.063
Neg logit contributions
 not-0.062
 homework-0.061
 drib-0.057
 math-0.056
zag-0.056
Token some
Token ID617
Feature activation+2.78e+00

Pos logit contributions
 pulp+0.097
oms+0.087
ment+0.084
lan+0.081
Happy+0.080
Neg logit contributions
 homework-0.060
 math-0.059
 not-0.058
 x-0.053
 operation-0.052
Token of
Token ID286
Feature activation+2.67e+00

Pos logit contributions
 pulp+0.116
Happy+0.096
 islands+0.094
 ge+0.093
 valleys+0.091
Neg logit contributions
 not-0.092
 because-0.089
 math-0.086
zag-0.086
ggie-0.085
Token green
Token ID4077
Feature activation+3.42e+00

Pos logit contributions
 pulp+0.112
oms+0.110
ins+0.106
Happy+0.103
ment+0.100
Neg logit contributions
 not-0.100
 math-0.093
 mine-0.088
 because-0.087
 robot-0.086
Token and
Token ID290
Feature activation+1.99e+00

Pos logit contributions
Happy+0.149
 pulp+0.142
 cheered+0.129
prise+0.126
forts+0.126
Neg logit contributions
 because-0.113
 not-0.111
zag-0.107
 heel-0.107
 math-0.106
Token lots
Token ID6041
Feature activation+3.45e+00

Pos logit contributions
 pulp+0.093
oms+0.088
ins+0.084
ment+0.082
Happy+0.076
Neg logit contributions
 not-0.071
 math-0.061
 homework-0.059
 robot-0.059
 skate-0.058
Token of
Token ID286
Feature activation+3.78e+00

Pos logit contributions
 pulp+0.186
lan+0.156
women+0.153
 pans+0.151
Happy+0.148
Neg logit contributions
 because-0.105
 math-0.093
 not-0.090
 about-0.085
 so-0.084
Token tiny
Token ID7009
Feature activation+4.87e+00

Pos logit contributions
 pulp+0.154
oms+0.143
Happy+0.143
ment+0.134
lan+0.133
Neg logit contributions
 not-0.134
 math-0.131
 because-0.124
 just-0.121
 heel-0.121
Token,
Token ID11
Feature activation-3.59e-01

Pos logit contributions
Happy+0.205
 pulp+0.184
Sorry+0.183
forts+0.180
iza+0.179
Neg logit contributions
 because-0.162
 heel-0.148
 balancing-0.143
 just-0.142
 bone-0.141
Token colorful
Token ID20239
Feature activation+3.96e+00

Pos logit contributions
 homework+0.013
 operation+0.012
ggie+0.012
zag+0.012
 x+0.012
Neg logit contributions
 pulp-0.016
 fishes-0.015
oms-0.014
 flags-0.014
 shells-0.014
Token fish
Token ID5916
Feature activation+2.21e+00

Pos logit contributions
Happy+0.145
 pulp+0.142
forts+0.136
izz+0.132
fed+0.129
Neg logit contributions
 robot-0.142
 heel-0.142
 math-0.141
 because-0.141
 bone-0.139
Token!
Token ID0
Feature activation+3.33e-01

Pos logit contributions
 pulp+0.100
Happy+0.086
 banners+0.085
ment+0.083
oms+0.081
Neg logit contributions
 not-0.082
zag-0.076
ggie-0.073
 skate-0.073
 homework-0.072
Token Molly
Token ID30236
Feature activation+5.53e-01

Pos logit contributions
 pulp+0.016
 fishes+0.014
oms+0.014
ment+0.014
 flags+0.014
Neg logit contributions
 homework-0.011
 math-0.010
ggie-0.010
 operation-0.010
 not-0.010
Token the
Token ID262
Feature activation+2.45e+00

Pos logit contributions
 homework+0.003
ggie+0.003
 operation+0.003
 math+0.003
 x+0.003
Neg logit contributions
 pulp-0.003
 fishes-0.003
oms-0.003
ment-0.003
 ge-0.003
Token garden
Token ID11376
Feature activation+1.79e+00

Pos logit contributions
 pulp+0.122
ment+0.118
ins+0.111
 banners+0.111
 sailors+0.109
Neg logit contributions
 not-0.076
 math-0.063
 exam-0.063
 operation-0.062
 because-0.062
Token were
Token ID547
Feature activation+2.61e+00

Pos logit contributions
 pulp+0.104
 banners+0.091
 fishes+0.090
 flags+0.087
 grains+0.086
Neg logit contributions
 not-0.048
 math-0.042
 homework-0.042
zag-0.039
 operation-0.038
Token lots
Token ID6041
Feature activation+2.92e+00

Pos logit contributions
oms+0.130
ment+0.128
forts+0.127
 pulp+0.127
leness+0.120
Neg logit contributions
 not-0.085
 math-0.066
 so-0.066
 only-0.061
 always-0.061
Token of
Token ID286
Feature activation+3.19e+00

Pos logit contributions
 pulp+0.154
women+0.123
ment+0.123
 valleys+0.120
use+0.120
Neg logit contributions
 math-0.086
 because-0.085
 not-0.077
 exercise-0.076
 operation-0.074
Token beautiful
Token ID4950
Feature activation+4.86e+00

Pos logit contributions
oms+0.142
 pulp+0.138
ment+0.135
omed+0.131
plets+0.130
Neg logit contributions
 not-0.110
 math-0.110
 exercise-0.102
 especially-0.099
 always-0.097
Token colourful
Token ID44799
Feature activation+4.88e+00

Pos logit contributions
fed+0.211
 pulp+0.206
forts+0.204
iers+0.203
oms+0.203
Neg logit contributions
 because-0.143
 math-0.142
 cushion-0.133
 not-0.129
 favorite-0.125
Token flowers
Token ID12734
Feature activation+3.00e+00

Pos logit contributions
 pulp+0.203
 extinguished+0.201
fed+0.199
forts+0.194
oms+0.191
Neg logit contributions
 because-0.149
 math-0.147
 cushion-0.146
 robot-0.127
 just-0.124
Token.
Token ID13
Feature activation+9.84e-01

Pos logit contributions
 pulp+0.166
ment+0.135
ocol+0.134
 banners+0.132
Sorry+0.132
Neg logit contributions
 because-0.096
 not-0.092
zag-0.082
case-0.074
 always-0.072
Token Every
Token ID3887
Feature activation-6.94e-01

Pos logit contributions
 pulp+0.058
ment+0.051
oms+0.051
ins+0.050
 fishes+0.050
Neg logit contributions
 homework-0.023
 math-0.022
 not-0.022
 operation-0.021
 x-0.020
Token day
Token ID1110
Feature activation+1.56e+00

Pos logit contributions
 homework+0.025
 Maxwell+0.024
 robber+0.022
 x+0.022
 math+0.022
Neg logit contributions
 pulp-0.031
 fishes-0.029
oms-0.028
 ge-0.028
ment-0.027
Token would
Token ID561
Feature activation-7.43e-01

Pos logit contributions
 homework+0.003
 math+0.002
ggie+0.002
 operation+0.002
zag+0.002
Neg logit contributions
 pulp-0.003
 fishes-0.003
oms-0.003
 ge-0.003
 flags-0.003
Token fill
Token ID6070
Feature activation-3.19e-01

Pos logit contributions
 homework+0.029
ggie+0.027
 math+0.027
 Maxwell+0.027
zag+0.027
Neg logit contributions
 pulp-0.031
 fishes-0.027
 shells-0.026
 flags-0.026
 pump-0.026
Token the
Token ID262
Feature activation+2.55e+00

Pos logit contributions
 homework+0.010
zag+0.009
 operation+0.009
ggie+0.009
 math+0.009
Neg logit contributions
 pulp-0.016
 fishes-0.015
 shells-0.014
ment-0.014
 flags-0.014
Token bucket
Token ID19236
Feature activation-4.06e-01

Pos logit contributions
ment+0.097
 pulp+0.091
oms+0.090
ins+0.089
Happy+0.088
Neg logit contributions
 not-0.097
 bone-0.091
 mine-0.089
 heel-0.089
 shelf-0.089
Token with
Token ID351
Feature activation+2.01e+00

Pos logit contributions
 homework+0.014
ggie+0.013
 operation+0.013
 x+0.013
 math+0.013
Neg logit contributions
 pulp-0.019
 fishes-0.017
oms-0.017
ment-0.016
 shells-0.016
Token fresh
Token ID4713
Feature activation+4.85e+00

Pos logit contributions
oms+0.070
 pulp+0.066
omed+0.062
Happy+0.061
ment+0.058
Neg logit contributions
 not-0.088
 math-0.088
 homework-0.087
 skate-0.082
 exercise-0.081
Token water
Token ID1660
Feature activation+1.11e+00

Pos logit contributions
 announcements+0.202
 pulp+0.202
Happy+0.200
prise+0.198
 priest+0.191
Neg logit contributions
 because-0.156
 exercise-0.132
 bone-0.128
 so-0.126
 and-0.122
Token from
Token ID422
Feature activation-1.06e-01

Pos logit contributions
 pulp+0.055
oms+0.047
Happy+0.047
 banners+0.046
 fishes+0.046
Neg logit contributions
 homework-0.035
 not-0.034
 math-0.032
zag-0.032
 skate-0.031
Token the
Token ID262
Feature activation+2.52e+00

Pos logit contributions
 homework+0.004
ggie+0.004
zag+0.004
 operation+0.004
 math+0.004
Neg logit contributions
 pulp-0.005
 fishes-0.004
 shells-0.004
ment-0.004
 ge-0.004
Token nearby
Token ID6716
Feature activation+2.51e+00

Pos logit contributions
ment+0.106
ins+0.103
oms+0.102
 pulp+0.095
Happy+0.093
Neg logit contributions
 not-0.096
 robot-0.089
 skate-0.088
 operation-0.087
 shelf-0.087
Token river
Token ID7850
Feature activation+1.02e+00

Pos logit contributions
 pulp+0.113
ins+0.107
ment+0.104
Happy+0.102
oms+0.101
Neg logit contributions
 shelf-0.082
 robot-0.080
 skate-0.080
 bone-0.080
 operation-0.080
Token and
Token ID290
Feature activation+2.98e-01

Pos logit contributions
 homework+0.022
 operation+0.020
 math+0.020
 x+0.020
zag+0.019
Neg logit contributions
 pulp-0.017
 fishes-0.015
ment-0.014
als-0.014
oms-0.014
Token Mum
Token ID38367
Feature activation+2.52e+00

Pos logit contributions
 pulp+0.016
oms+0.014
 fishes+0.014
ment+0.014
 flags+0.014
Neg logit contributions
 homework-0.009
ggie-0.008
 not-0.008
 math-0.008
 operation-0.008
Token picked
Token ID6497
Feature activation+8.20e-01

Pos logit contributions
 pulp+0.111
 islands+0.103
oms+0.102
ment+0.102
 whales+0.096
Neg logit contributions
 not-0.084
 drib-0.084
 always-0.080
ggie-0.080
isk-0.079
Token lots
Token ID6041
Feature activation+2.56e+00

Pos logit contributions
 pulp+0.046
oms+0.041
ment+0.040
 banners+0.039
 fishes+0.039
Neg logit contributions
 homework-0.021
 math-0.019
 operation-0.019
ggie-0.019
 not-0.018
Token of
Token ID286
Feature activation+2.82e+00

Pos logit contributions
 pulp+0.128
 islands+0.103
 valleys+0.103
lan+0.101
 banners+0.101
Neg logit contributions
 math-0.081
 not-0.073
 homework-0.072
 because-0.070
ggie-0.070
Token colourful
Token ID44799
Feature activation+4.84e+00

Pos logit contributions
 pulp+0.134
lan+0.125
oms+0.119
 islands+0.118
ment+0.118
Neg logit contributions
 math-0.092
 homework-0.088
 x-0.083
 not-0.081
 operation-0.080
Token flowers
Token ID12734
Feature activation+2.46e+00

Pos logit contributions
 pulp+0.188
arl+0.185
Sorry+0.181
 lighthouse+0.175
lan+0.171
Neg logit contributions
 math-0.173
 er-0.157
 exercise-0.147
 exercises-0.145
 yoga-0.145
Token and
Token ID290
Feature activation+2.40e+00

Pos logit contributions
 pulp+0.133
 islands+0.113
Happy+0.112
 banners+0.112
 fishes+0.111
Neg logit contributions
 not-0.067
 because-0.062
 homework-0.058
zag-0.057
 math-0.054
Token made
Token ID925
Feature activation+1.73e+00

Pos logit contributions
oms+0.099
 pulp+0.098
 islands+0.088
ins+0.087
ment+0.087
Neg logit contributions
 not-0.089
 homework-0.083
 math-0.081
 Max-0.077
ggie-0.076
Token a
Token ID257
Feature activation+1.46e+00

Pos logit contributions
 pulp+0.093
oms+0.086
 islands+0.082
 banners+0.080
als+0.079
Neg logit contributions
 homework-0.045
 not-0.044
ggie-0.043
 math-0.042
 operation-0.039
Token pretty
Token ID2495
Feature activation+1.41e+00

Pos logit contributions
 pulp+0.058
ment+0.054
oms+0.053
 fishes+0.051
 islands+0.050
Neg logit contributions
 homework-0.058
 math-0.056
 not-0.056
ggie-0.054
 operation-0.053
Token it
Token ID340
Feature activation-6.83e-01

Pos logit contributions
 pulp+0.012
 fishes+0.011
oms+0.011
ment+0.010
 flags+0.010
Neg logit contributions
 homework-0.009
 math-0.009
ggie-0.009
 operation-0.009
 not-0.008
Token.
Token ID13
Feature activation+3.20e-01

Pos logit contributions
 homework+0.029
ggie+0.027
 operation+0.027
 robber+0.027
 x+0.026
Neg logit contributions
 pulp-0.025
 fishes-0.022
oms-0.022
als-0.022
 flags-0.022
Token He
Token ID679
Feature activation-1.15e+00

Pos logit contributions
 pulp+0.017
 fishes+0.015
oms+0.015
ment+0.014
 banners+0.014
Neg logit contributions
 homework-0.010
 math-0.009
ggie-0.009
 not-0.009
 operation-0.009
Token practiced
Token ID19893
Feature activation-2.67e+00

Pos logit contributions
 homework+0.054
 operation+0.051
 robber+0.049
 x+0.049
ggie+0.049
Neg logit contributions
 pulp-0.038
 cheered-0.035
 fishes-0.033
als-0.032
 flags-0.032
Token his
Token ID465
Feature activation-3.64e+00

Pos logit contributions
liv+0.101
ibo+0.100
isk+0.099
ggie+0.099
scar+0.097
Neg logit contributions
 flags-0.096
 cheered-0.090
 valleys-0.089
 banners-0.089
 pulp-0.089
Token math
Token ID10688
Feature activation-4.33e+00

Pos logit contributions
isk+0.148
scar+0.128
liv+0.127
penter+0.121
 Maxwell+0.120
Neg logit contributions
 flags-0.130
 shells-0.120
 faces-0.120
oms-0.114
 pump-0.113
Token problems
Token ID2761
Feature activation-2.44e+00

Pos logit contributions
scar+0.155
tons+0.148
ibo+0.147
isk+0.146
liv+0.143
Neg logit contributions
 flags-0.155
 cheered-0.146
 valleys-0.143
 banners-0.141
oms-0.141
Token and
Token ID290
Feature activation-1.08e+00

Pos logit contributions
 homework+0.099
ggie+0.095
 x+0.094
 operation+0.094
 robber+0.091
Neg logit contributions
oms-0.081
 cheered-0.078
ment-0.077
 flags-0.077
 pulp-0.077
Token soon
Token ID2582
Feature activation+2.63e-01

Pos logit contributions
 homework+0.040
ggie+0.038
 operation+0.038
 x+0.037
zag+0.037
Neg logit contributions
 pulp-0.045
 flags-0.040
 cheered-0.040
 fishes-0.040
ment-0.040
Token he
Token ID339
Feature activation-1.34e+00

Pos logit contributions
 pulp+0.014
 fishes+0.012
oms+0.012
ment+0.012
 flags+0.012
Neg logit contributions
 homework-0.008
ggie-0.007
 math-0.007
 operation-0.007
 not-0.007
Token could
Token ID714
Feature activation-2.11e+00

Pos logit contributions
 homework+0.060
 operation+0.058
 x+0.057
 robber+0.055
ggie+0.054
Neg logit contributions
 pulp-0.046
 cheered-0.043
als-0.042
 fishes-0.040
oms-0.040
Token every
Token ID790
Feature activation-2.00e+00

Pos logit contributions
 robber+0.092
scar+0.089
 x+0.086
omm+0.085
ibo+0.085
Neg logit contributions
 flags-0.072
 banners-0.066
 pump-0.064
 valleys-0.064
 them-0.064
Token day
Token ID1110
Feature activation-9.28e-01

Pos logit contributions
isk+0.085
 Maxwell+0.082
 homework+0.079
 not+0.076
 Rat+0.076
Neg logit contributions
 pulp-0.067
Happy-0.065
 sunlight-0.064
oms-0.063
 fishes-0.062
Token.
Token ID13
Feature activation-5.35e-01

Pos logit contributions
 homework+0.050
 operation+0.047
zag+0.047
 x+0.047
ggie+0.046
Neg logit contributions
 pulp-0.022
als-0.019
oms-0.019
 flags-0.019
ment-0.019
Token But
Token ID887
Feature activation-2.03e+00

Pos logit contributions
 homework+0.021
ggie+0.020
zag+0.019
 operation+0.019
 robber+0.019
Neg logit contributions
 pulp-0.022
oms-0.019
 fishes-0.019
als-0.018
 pump-0.018
Token she
Token ID673
Feature activation-2.61e+00

Pos logit contributions
 robber+0.085
ggie+0.084
zag+0.082
isk+0.081
 Maxwell+0.079
Neg logit contributions
 pulp-0.070
 pump-0.064
 shells-0.062
 ge-0.062
ment-0.061
Token didn
Token ID1422
Feature activation-4.31e+00

Pos logit contributions
 robber+0.112
 Maxwell+0.111
 x+0.109
 math+0.107
 operation+0.105
Neg logit contributions
 admired-0.077
 pulp-0.077
 ge-0.076
 cheered-0.076
 were-0.074
Token't
Token ID470
Feature activation-8.21e-01

Pos logit contributions
isk+0.177
pert+0.170
 robber+0.164
 rider+0.163
ci+0.160
Neg logit contributions
oms-0.156
Happy-0.126
ades-0.120
Everyone-0.119
izz-0.118
Token like
Token ID588
Feature activation-1.72e+00

Pos logit contributions
 homework+0.032
ggie+0.031
zag+0.030
 x+0.030
 math+0.030
Neg logit contributions
 pulp-0.031
 pump-0.027
oms-0.027
 fishes-0.027
 ge-0.026
Token to
Token ID284
Feature activation-6.62e-01

Pos logit contributions
zag+0.076
isk+0.073
ggie+0.073
penter+0.071
 homework+0.070
Neg logit contributions
 pulp-0.054
oms-0.054
 flags-0.053
 pump-0.051
 banners-0.050
Token listen
Token ID6004
Feature activation-6.99e-01

Pos logit contributions
 homework+0.024
zag+0.024
 Maxwell+0.023
ggie+0.023
 math+0.022
Neg logit contributions
 pulp-0.027
 pump-0.023
 flags-0.023
 fishes-0.023
oms-0.023
Token to
Token ID284
Feature activation-1.47e+00

Pos logit contributions
 homework+0.029
ggie+0.027
 x+0.027
 operation+0.027
zag+0.027
Neg logit contributions
 pulp-0.026
oms-0.023
ment-0.023
 flags-0.023
 fishes-0.023
Token.
Token ID13
Feature activation-1.19e+00

Pos logit contributions
 homework+0.036
ggie+0.035
 x+0.035
 operation+0.034
 Maxwell+0.034
Neg logit contributions
 pulp-0.029
ment-0.026
oms-0.026
 fishes-0.026
 shells-0.025
Token He
Token ID679
Feature activation-1.53e+00

Pos logit contributions
 homework+0.057
ggie+0.053
zag+0.051
 operation+0.051
 x+0.050
Neg logit contributions
 pulp-0.038
ment-0.035
 fishes-0.035
 sunlight-0.034
Happy-0.033
Token practiced
Token ID19893
Feature activation-2.68e+00

Pos logit contributions
 homework+0.070
 math+0.067
 x+0.066
 robber+0.066
 operation+0.065
Neg logit contributions
 pulp-0.047
 cheered-0.046
 fishes-0.044
 ge-0.044
 flags-0.043
Token riding
Token ID10311
Feature activation-2.57e+00

Pos logit contributions
isk+0.104
 x+0.101
omm+0.101
ggie+0.100
ibo+0.100
Neg logit contributions
 flags-0.095
 pump-0.087
 valleys-0.087
 islands-0.087
 cheered-0.085
Token his
Token ID465
Feature activation-2.03e+00

Pos logit contributions
isk+0.108
 homework+0.107
 x+0.105
omm+0.105
 operation+0.103
Neg logit contributions
 flags-0.091
 shells-0.081
 cheered-0.080
 banners-0.080
 valleys-0.080
Token bicycle
Token ID17026
Feature activation-4.31e+00

Pos logit contributions
isk+0.055
ggie+0.053
 Maxwell+0.050
 Rat+0.048
liv+0.048
Neg logit contributions
 flags-0.101
 pulp-0.096
 shells-0.094
 pump-0.093
 banners-0.092
Token every
Token ID790
Feature activation-2.08e+00

Pos logit contributions
 x+0.191
scar+0.186
 homework+0.183
ggie+0.182
liv+0.179
Neg logit contributions
als-0.128
 flags-0.114
oms-0.104
 valleys-0.102
ment-0.102
Token day
Token ID1110
Feature activation-1.46e+00

Pos logit contributions
isk+0.090
 homework+0.084
 Maxwell+0.081
 Rat+0.081
 not+0.080
Neg logit contributions
 sunlight-0.069
 pulp-0.067
oms-0.067
 pump-0.065
 flags-0.065
Token to
Token ID284
Feature activation-1.98e+00

Pos logit contributions
 homework+0.063
 operation+0.057
ggie+0.057
 x+0.057
isk+0.056
Neg logit contributions
 pulp-0.050
oms-0.048
 flags-0.048
 fishes-0.047
als-0.047
Token get
Token ID651
Feature activation+2.84e-01

Pos logit contributions
 homework+0.069
ggie+0.067
zag+0.066
isk+0.065
 robber+0.065
Neg logit contributions
 flags-0.075
 pump-0.073
 shells-0.071
 pulp-0.070
 banners-0.069
Token ready
Token ID3492
Feature activation-3.25e-01

Pos logit contributions
 pulp+0.014
 fishes+0.012
oms+0.012
ment+0.012
 flags+0.012
Neg logit contributions
 homework-0.009
ggie-0.008
 math-0.008
 operation-0.008
 not-0.008
Token book
Token ID1492
Feature activation-2.06e+00

Pos logit contributions
isk+0.096
scar+0.089
ggie+0.086
liv+0.085
tons+0.084
Neg logit contributions
 pulp-0.135
 flags-0.129
 shells-0.127
oms-0.125
als-0.123
Token to
Token ID284
Feature activation-9.66e-01

Pos logit contributions
 homework+0.093
ggie+0.091
 Maxwell+0.090
 skate+0.090
 operation+0.090
Neg logit contributions
oms-0.055
als-0.055
 pulp-0.053
ment-0.052
 flags-0.051
Token practice
Token ID3357
Feature activation-2.30e+00

Pos logit contributions
zag+0.033
 homework+0.031
ggie+0.031
 Maxwell+0.029
 operation+0.029
Neg logit contributions
 pulp-0.042
Happy-0.037
 flags-0.037
 shells-0.037
 banners-0.036
Token.
Token ID13
Feature activation-1.10e+00

Pos logit contributions
zag+0.098
ggie+0.098
 robber+0.097
scar+0.096
isk+0.096
Neg logit contributions
 flags-0.067
 pulp-0.067
ment-0.062
 banners-0.062
oms-0.061
Token Lily
Token ID20037
Feature activation-2.70e+00

Pos logit contributions
zag+0.045
ggie+0.044
 homework+0.044
 skate+0.040
 rider+0.040
Neg logit contributions
 pulp-0.044
als-0.038
oms-0.038
 cups-0.038
 sunlight-0.037
Token didn
Token ID1422
Feature activation-4.28e+00

Pos logit contributions
 robber+0.138
 operation+0.137
 homework+0.134
 x+0.131
zag+0.129
Neg logit contributions
 cheered-0.075
 admired-0.069
 gasped-0.060
 thanked-0.059
 applauded-0.057
Token't
Token ID470
Feature activation-9.85e-01

Pos logit contributions
isk+0.171
pert+0.160
 mask+0.156
 rider+0.156
 robber+0.155
Neg logit contributions
oms-0.147
Happy-0.132
qu-0.121
Everyone-0.118
ades-0.118
Token want
Token ID765
Feature activation-1.84e+00

Pos logit contributions
 homework+0.047
ggie+0.046
zag+0.045
 operation+0.043
 math+0.043
Neg logit contributions
 pulp-0.028
 cheered-0.025
oms-0.024
 sunlight-0.024
 pump-0.024
Token to
Token ID284
Feature activation-1.32e+00

Pos logit contributions
zag+0.090
isk+0.087
ousel+0.085
 skate+0.083
ggie+0.082
Neg logit contributions
 pulp-0.051
als-0.048
 shells-0.047
oms-0.047
 sunlight-0.047
Token do
Token ID466
Feature activation-2.53e+00

Pos logit contributions
zag+0.050
isk+0.049
 homework+0.048
ggie+0.047
 Maxwell+0.046
Neg logit contributions
 pulp-0.043
 pump-0.042
 cheered-0.040
oms-0.040
 flags-0.040
Token it
Token ID340
Feature activation-3.29e+00

Pos logit contributions
zag+0.120
isk+0.114
ibo+0.109
scar+0.108
 rider+0.107
Neg logit contributions
 flags-0.069
oms-0.069
 pulp-0.066
ment-0.066
 faces-0.064
Token math
Token ID10688
Feature activation-3.44e+00

Pos logit contributions
 Maxwell+0.077
isk+0.074
ggie+0.072
tons+0.070
zag+0.068
Neg logit contributions
 pulp-0.152
 shells-0.148
 buckets-0.148
 flags-0.146
 pans-0.146
Token homework
Token ID26131
Feature activation-2.94e+00

Pos logit contributions
scar+0.096
isk+0.096
ibo+0.093
tons+0.093
 Maxwell+0.090
Neg logit contributions
 flags-0.153
 pulp-0.152
oms-0.146
 shells-0.146
 pump-0.145
Token.
Token ID13
Feature activation-1.48e-02

Pos logit contributions
ggie+0.159
 Maxwell+0.157
 x+0.157
 Rat+0.155
isk+0.154
Neg logit contributions
 flags-0.044
oms-0.043
 cheered-0.042
 pulp-0.042
 pump-0.037
Token Tim
Token ID5045
Feature activation-2.03e+00

Pos logit contributions
 homework+0.001
ggie+0.001
 operation+0.001
zag+0.001
 math+0.001
Neg logit contributions
 pulp-0.001
 fishes-0.001
oms-0.001
ment-0.001
 ge-0.001
Tokenmy
Token ID1820
Feature activation-2.11e+00

Pos logit contributions
 homework+0.093
 Label+0.092
 robber+0.092
ggie+0.091
 math+0.091
Neg logit contributions
ins-0.058
ies-0.057
 cheered-0.054
 pulp-0.052
 admired-0.049
Token didn
Token ID1422
Feature activation-4.28e+00

Pos logit contributions
 robber+0.099
 Patch+0.097
 homework+0.097
 Maxwell+0.096
 operation+0.095
Neg logit contributions
 cheered-0.063
 pulp-0.057
 admired-0.055
led-0.050
 gasped-0.049
Token't
Token ID470
Feature activation-6.55e-01

Pos logit contributions
isk+0.181
pert+0.169
 mask+0.161
 robber+0.161
 dictionary+0.160
Neg logit contributions
oms-0.132
Happy-0.127
Everyone-0.117
izz-0.117
ades-0.111
Token want
Token ID765
Feature activation-1.67e+00

Pos logit contributions
 homework+0.028
ggie+0.027
zag+0.027
 math+0.026
 operation+0.026
Neg logit contributions
 pulp-0.022
oms-0.019
Happy-0.019
 cheered-0.019
 pump-0.019
Token to
Token ID284
Feature activation-7.32e-01

Pos logit contributions
zag+0.080
isk+0.078
ggie+0.074
ousel+0.072
 Maxwell+0.070
Neg logit contributions
 shells-0.046
 pulp-0.046
 flags-0.045
 pump-0.043
 sunlight-0.042
Token do
Token ID466
Feature activation-2.54e+00

Pos logit contributions
zag+0.026
 homework+0.026
ggie+0.025
 Maxwell+0.025
isk+0.024
Neg logit contributions
 pulp-0.029
 pump-0.025
oms-0.025
 flags-0.025
 fishes-0.025
Token it
Token ID340
Feature activation-3.09e+00

Pos logit contributions
zag+0.122
isk+0.120
tr+0.114
ibo+0.109
icycle+0.108
Neg logit contributions
 flags-0.069
 faces-0.063
 shells-0.063
ment-0.062
oms-0.061
Token ride
Token ID6594
Feature activation-1.78e+00

Pos logit contributions
 homework+0.027
ggie+0.026
zag+0.025
 x+0.025
 operation+0.025
Neg logit contributions
 pulp-0.034
 flags-0.033
 banners-0.032
 shells-0.032
oms-0.031
Token.
Token ID13
Feature activation-1.13e+00

Pos logit contributions
isk+0.081
 operation+0.077
zag+0.077
 homework+0.075
 x+0.075
Neg logit contributions
 flags-0.053
 shells-0.047
 fishes-0.046
 banners-0.046
Happy-0.046
Token He
Token ID679
Feature activation-2.01e+00

Pos logit contributions
 homework+0.046
ggie+0.045
zag+0.041
 operation+0.041
 robber+0.040
Neg logit contributions
 pulp-0.042
 fishes-0.039
 pump-0.038
 ducks-0.038
 sunlight-0.038
Token would
Token ID561
Feature activation-2.51e+00

Pos logit contributions
 homework+0.100
 math+0.099
 operation+0.098
 x+0.098
isk+0.097
Neg logit contributions
 cheered-0.050
 admired-0.049
 flags-0.047
 pump-0.045
 pulp-0.045
Token ride
Token ID6594
Feature activation-2.13e+00

Pos logit contributions
ggie+0.106
liv+0.105
isk+0.104
scar+0.102
omm+0.101
Neg logit contributions
 flags-0.082
 pump-0.081
 banners-0.075
 pulp-0.073
 shells-0.072
Token it
Token ID340
Feature activation-4.23e+00

Pos logit contributions
 operation+0.098
isk+0.097
 ankle+0.095
 homework+0.094
omm+0.093
Neg logit contributions
 flags-0.067
 cheered-0.058
 shells-0.057
Happy-0.057
 banners-0.056
Token every
Token ID790
Feature activation-2.47e+00

Pos logit contributions
omm+0.233
scar+0.221
ggie+0.218
liv+0.217
 x+0.213
Neg logit contributions
 hills-0.084
 flags-0.081
als-0.079
 valleys-0.077
 pump-0.073
Token day
Token ID1110
Feature activation-1.21e+00

Pos logit contributions
isk+0.125
 homework+0.118
 Maxwell+0.117
 Rat+0.114
liv+0.114
Neg logit contributions
 sunlight-0.067
 pump-0.063
 flags-0.059
 pulp-0.059
 ge-0.059
Token and
Token ID290
Feature activation-1.86e+00

Pos logit contributions
 homework+0.053
 operation+0.050
isk+0.049
ggie+0.048
 x+0.048
Neg logit contributions
 pulp-0.041
 flags-0.038
 pump-0.038
 fishes-0.038
oms-0.037
Token feel
Token ID1254
Feature activation-4.04e-01

Pos logit contributions
ggie+0.077
omm+0.073
isk+0.073
 x+0.072
 homework+0.071
Neg logit contributions
 flags-0.064
 pulp-0.062
 cheered-0.062
 pump-0.061
 shells-0.061
Token the
Token ID262
Feature activation+1.72e-01

Pos logit contributions
 homework+0.015
zag+0.015
ggie+0.014
 operation+0.014
 math+0.014
Neg logit contributions
 pulp-0.017
 fishes-0.016
ment-0.015
oms-0.015
 flags-0.015
Token tried
Token ID3088
Feature activation-2.23e+00

Pos logit contributions
 homework+0.061
ggie+0.059
 math+0.059
 operation+0.058
 x+0.058
Neg logit contributions
 pulp-0.043
 fishes-0.041
 cheered-0.040
 flags-0.040
 pump-0.039
Token his
Token ID465
Feature activation-3.31e+00

Pos logit contributions
isk+0.109
penter+0.103
ggie+0.102
case+0.102
 robber+0.101
Neg logit contributions
 shells-0.059
 flags-0.059
 pump-0.058
 islands-0.054
 fishes-0.053
Token best
Token ID1266
Feature activation-2.64e+00

Pos logit contributions
isk+0.166
liv+0.142
penter+0.136
 Maxwell+0.132
nob+0.130
Neg logit contributions
 flags-0.099
 pump-0.096
 shells-0.093
 ducks-0.092
 fishes-0.090
Token to
Token ID284
Feature activation-2.06e+00

Pos logit contributions
isk+0.146
liv+0.139
 x+0.134
scar+0.132
 Midnight+0.131
Neg logit contributions
 flags-0.052
 shells-0.050
ment-0.050
 pump-0.048
oms-0.046
Token do
Token ID466
Feature activation-2.59e+00

Pos logit contributions
 homework+0.060
isk+0.060
zag+0.057
 Rat+0.056
 Maxwell+0.056
Neg logit contributions
 cheered-0.086
 flags-0.084
 pump-0.083
 fishes-0.082
 pulp-0.082
Token his
Token ID465
Feature activation-4.22e+00

Pos logit contributions
isk+0.112
ibo+0.108
zag+0.108
tr+0.107
 robber+0.104
Neg logit contributions
 flags-0.076
 shells-0.073
oms-0.072
ment-0.070
 cheered-0.069
Token best
Token ID1266
Feature activation-2.69e+00

Pos logit contributions
isk+0.204
liv+0.170
 Maxwell+0.164
 Midnight+0.163
scar+0.162
Neg logit contributions
 flags-0.130
 pump-0.119
 waves-0.113
 shells-0.112
 balloons-0.109
Token.
Token ID13
Feature activation-5.64e-01

Pos logit contributions
isk+0.165
liv+0.155
 Rat+0.147
 x+0.146
scar+0.145
Neg logit contributions
 flags-0.044
 pump-0.041
 shells-0.039
ment-0.037
 sailors-0.033
Token He
Token ID679
Feature activation-2.13e+00

Pos logit contributions
 homework+0.023
ggie+0.022
 operation+0.021
zag+0.021
 not+0.021
Neg logit contributions
 pulp-0.022
 fishes-0.020
oms-0.019
ment-0.019
 pump-0.019
Token worked
Token ID3111
Feature activation-2.82e+00

Pos logit contributions
 homework+0.099
isk+0.097
 operation+0.096
 x+0.096
 math+0.096
Neg logit contributions
 cheered-0.061
 pulp-0.056
 fishes-0.056
 ducks-0.054
 shells-0.053
Token hard
Token ID1327
Feature activation-2.82e+00

Pos logit contributions
isk+0.133
 operation+0.124
 robber+0.124
ggie+0.121
 exam+0.121
Neg logit contributions
 flags-0.073
oms-0.071
 shells-0.071
 cheered-0.068
led-0.068
Token made
Token ID925
Feature activation-1.74e+00

Pos logit contributions
 homework+0.112
 x+0.112
 operation+0.104
 math+0.103
ggie+0.102
Neg logit contributions
 pulp-0.070
 cheered-0.068
 fishes-0.068
 pump-0.067
 ge-0.067
Token sure
Token ID1654
Feature activation-9.41e-01

Pos logit contributions
zag+0.075
isk+0.071
belt+0.071
tr+0.070
 homework+0.067
Neg logit contributions
 fishes-0.061
 pulp-0.061
 ducks-0.059
 flags-0.058
 shells-0.057
Token to
Token ID284
Feature activation-1.77e+00

Pos logit contributions
 homework+0.039
zag+0.039
ggie+0.038
 x+0.038
 operation+0.037
Neg logit contributions
 pulp-0.035
 shells-0.030
 fishes-0.030
 pump-0.030
 flags-0.030
Token bounce
Token ID20110
Feature activation-2.47e+00

Pos logit contributions
 homework+0.071
 x+0.066
 Maxwell+0.065
 math+0.064
ggie+0.064
Neg logit contributions
 pulp-0.060
 shells-0.060
 pump-0.059
 ducks-0.057
 ge-0.057
Token the
Token ID262
Feature activation-9.17e-01

Pos logit contributions
ggie+0.110
zag+0.109
omm+0.108
isk+0.107
 x+0.105
Neg logit contributions
 shells-0.081
 fishes-0.077
 flags-0.077
 them-0.073
 cheered-0.070
Token football
Token ID4346
Feature activation-4.21e+00

Pos logit contributions
 homework+0.024
ggie+0.023
isk+0.023
zag+0.022
 x+0.022
Neg logit contributions
 pulp-0.050
 fishes-0.048
 flags-0.045
 ducks-0.045
 shells-0.045
Token gently
Token ID15165
Feature activation-2.14e+00

Pos logit contributions
 x+0.215
innie+0.204
ggie+0.202
liv+0.200
 Maxwell+0.200
Neg logit contributions
als-0.112
 shells-0.110
 pulp-0.100
 seeds-0.098
 fishes-0.097
Token so
Token ID523
Feature activation-1.22e+00

Pos logit contributions
 homework+0.099
ggie+0.098
 x+0.096
 operation+0.095
 Maxwell+0.093
Neg logit contributions
 pulp-0.060
 shells-0.059
oms-0.057
als-0.057
 fishes-0.056
Token that
Token ID326
Feature activation-1.41e+00

Pos logit contributions
 homework+0.050
zag+0.047
 x+0.047
isk+0.046
 operation+0.046
Neg logit contributions
 pulp-0.047
ment-0.044
 fishes-0.042
oms-0.041
 shells-0.041
Token it
Token ID340
Feature activation-2.07e+00

Pos logit contributions
zag+0.057
isk+0.056
 homework+0.055
ggie+0.055
 x+0.052
Neg logit contributions
 pulp-0.054
 fishes-0.049
 shells-0.049
 pump-0.048
 islands-0.047
Token wouldn
Token ID3636
Feature activation-2.49e+00

Pos logit contributions
 homework+0.082
ggie+0.080
 math+0.079
 x+0.078
 Maxwell+0.075
Neg logit contributions
 pulp-0.079
als-0.077
oms-0.074
 fishes-0.074
 shells-0.072
Token his
Token ID465
Feature activation-1.07e+00

Pos logit contributions
 pulp+0.025
oms+0.022
 fishes+0.021
ment+0.021
als+0.021
Neg logit contributions
 homework-0.018
 math-0.016
 operation-0.016
 not-0.016
ggie-0.016
Token trumpet
Token ID40334
Feature activation-2.04e+00

Pos logit contributions
isk+0.041
zag+0.041
 homework+0.041
ggie+0.041
 Maxwell+0.039
Neg logit contributions
 pulp-0.040
 flags-0.037
 fishes-0.037
 ducks-0.036
 shells-0.036
Token.
Token ID13
Feature activation-6.54e-01

Pos logit contributions
 Maxwell+0.111
 homework+0.110
 x+0.110
isk+0.109
ggie+0.107
Neg logit contributions
als-0.035
 flags-0.035
 pulp-0.033
 fishes-0.033
oms-0.033
Token He
Token ID679
Feature activation-1.47e+00

Pos logit contributions
 homework+0.023
zag+0.022
ggie+0.021
 not+0.020
 operation+0.020
Neg logit contributions
 pulp-0.029
 fishes-0.027
oms-0.026
ment-0.026
 flags-0.026
Token plays
Token ID5341
Feature activation-2.27e+00

Pos logit contributions
 homework+0.065
ggie+0.064
 operation+0.064
 x+0.064
zag+0.063
Neg logit contributions
 pulp-0.047
 flags-0.045
 shells-0.044
 floats-0.043
 cheered-0.043
Token it
Token ID340
Feature activation-4.18e+00

Pos logit contributions
isk+0.107
 homework+0.100
zag+0.099
scar+0.098
omm+0.098
Neg logit contributions
 flags-0.071
 cheered-0.067
 ducks-0.063
 banners-0.063
 fishes-0.063
Token every
Token ID790
Feature activation-2.23e+00

Pos logit contributions
scar+0.225
omm+0.216
liv+0.215
 Maxwell+0.210
 Cop+0.209
Neg logit contributions
 flags-0.086
.-0.082
 shells-0.080
 cheered-0.080
 together-0.080
Token day
Token ID1110
Feature activation-5.62e-01

Pos logit contributions
isk+0.109
 Maxwell+0.108
 Rat+0.103
 homework+0.101
 Label+0.099
Neg logit contributions
 sunlight-0.059
 pump-0.058
 fishes-0.056
 ducks-0.055
Happy-0.054
Token.
Token ID13
Feature activation-1.78e-01

Pos logit contributions
 homework+0.028
 operation+0.026
ggie+0.026
zag+0.026
 x+0.026
Neg logit contributions
 pulp-0.017
 fishes-0.015
oms-0.014
 flags-0.014
als-0.014
Token He
Token ID679
Feature activation-1.91e+00

Pos logit contributions
 homework+0.005
ggie+0.005
zag+0.005
 operation+0.005
 not+0.005
Neg logit contributions
 pulp-0.009
 fishes-0.008
oms-0.008
ment-0.008
 flags-0.008
Token makes
Token ID1838
Feature activation-1.53e+00

Pos logit contributions
ggie+0.083
 x+0.080
 operation+0.080
isk+0.079
 homework+0.079
Neg logit contributions
 floats-0.061
 pulp-0.061
 shells-0.061
 flags-0.061
 cheered-0.059
Token
Token ID220
Feature activation-1.05e-01

Pos logit contributions
 pulp+0.004
 fishes+0.003
oms+0.003
ment+0.003
 flags+0.003
Neg logit contributions
 homework-0.004
ggie-0.003
 operation-0.003
 math-0.003
zag-0.003
Token
Token ID198
Feature activation+2.36e-01

Pos logit contributions
 homework+0.005
 operation+0.004
ggie+0.004
 math+0.004
zag+0.004
Neg logit contributions
 pulp-0.004
oms-0.003
 fishes-0.003
ment-0.003
 flags-0.003
Token
Token ID198
Feature activation-1.97e-01

Pos logit contributions
 pulp+0.010
 fishes+0.008
oms+0.008
ment+0.008
 flags+0.008
Neg logit contributions
 homework-0.010
ggie-0.009
 math-0.009
 operation-0.009
 not-0.009
TokenTim
Token ID14967
Feature activation-1.59e+00

Pos logit contributions
 homework+0.008
 operation+0.007
ggie+0.007
 math+0.007
zag+0.007
Neg logit contributions
 pulp-0.009
 fishes-0.007
 ge-0.007
 flags-0.007
oms-0.007
Tokenmy
Token ID1820
Feature activation-1.15e+00

Pos logit contributions
 homework+0.069
 math+0.065
 robber+0.063
ggie+0.063
 operation+0.062
Neg logit contributions
 pulp-0.054
 cheered-0.051
ies-0.049
ins-0.048
 pump-0.047
Token didn
Token ID1422
Feature activation-4.17e+00

Pos logit contributions
 homework+0.057
 operation+0.053
 math+0.053
 exam+0.052
 x+0.052
Neg logit contributions
 pulp-0.035
 cheered-0.034
 fishes-0.031
 ge-0.031
 ducks-0.031
Token't
Token ID470
Feature activation+7.95e-02

Pos logit contributions
isk+0.161
pert+0.158
 mask+0.155
ci+0.154
 rider+0.154
Neg logit contributions
oms-0.134
Happy-0.123
izz-0.118
They-0.111
l-0.111
Token have
Token ID423
Feature activation-7.68e-01

Pos logit contributions
 pulp+0.004
 fishes+0.003
oms+0.003
ment+0.003
 flags+0.003
Neg logit contributions
 homework-0.003
ggie-0.003
 math-0.003
 operation-0.003
zag-0.003
Token a
Token ID257
Feature activation-1.66e+00

Pos logit contributions
 homework+0.027
zag+0.026
ggie+0.025
 operation+0.025
 Maxwell+0.024
Neg logit contributions
 pulp-0.034
 flags-0.030
 ge-0.030
 fishes-0.029
 shells-0.029
Token real
Token ID1103
Feature activation-8.64e-01

Pos logit contributions
liv+0.056
isk+0.055
ggie+0.053
scar+0.053
 homework+0.051
Neg logit contributions
 pulp-0.075
 pump-0.065
 flags-0.064
oms-0.062
 ge-0.062
Token sword
Token ID8429
Feature activation-1.98e+00

Pos logit contributions
 homework+0.025
 Maxwell+0.024
isk+0.024
ggie+0.024
liv+0.023
Neg logit contributions
 pulp-0.043
 flags-0.038
 fishes-0.038
ment-0.038
oms-0.037
Token muscles
Token ID12749
Feature activation+4.92e-01

Pos logit contributions
zag+0.055
 Maxwell+0.051
isk+0.050
ousel+0.050
 Loop+0.049
Neg logit contributions
 pulp-0.044
 ducks-0.037
 fishes-0.036
 faces-0.036
 shells-0.036
Token felt
Token ID2936
Feature activation-8.91e-01

Pos logit contributions
 pulp+0.017
 fishes+0.014
oms+0.014
ment+0.014
 banners+0.014
Neg logit contributions
 homework-0.023
 not-0.022
ggie-0.022
 math-0.021
 operation-0.021
Token heavy
Token ID4334
Feature activation-8.39e-01

Pos logit contributions
zag+0.034
 homework+0.033
ggie+0.032
 Maxwell+0.031
 operation+0.030
Neg logit contributions
 pulp-0.038
 fishes-0.034
oms-0.032
 shells-0.032
 sunlight-0.032
Token.
Token ID13
Feature activation-1.80e+00

Pos logit contributions
 homework+0.038
ggie+0.038
zag+0.037
 Maxwell+0.036
 operation+0.036
Neg logit contributions
 pulp-0.028
oms-0.024
 fishes-0.023
 grains-0.023
ment-0.023
Token He
Token ID679
Feature activation-1.28e+00

Pos logit contributions
ggie+0.075
zag+0.072
 homework+0.071
 Maxwell+0.066
 rider+0.065
Neg logit contributions
 pulp-0.069
 sunlight-0.060
ment-0.058
 faces-0.058
 fishes-0.055
Token didn
Token ID1422
Feature activation-4.17e+00

Pos logit contributions
 homework+0.065
zag+0.064
 operation+0.063
 Maxwell+0.062
 math+0.062
Neg logit contributions
 pulp-0.039
 fishes-0.032
 ge-0.030
 cheered-0.030
 shells-0.029
Token't
Token ID470
Feature activation-2.04e-01

Pos logit contributions
isk+0.153
 rider+0.148
 mask+0.143
 dictionary+0.143
 robber+0.142
Neg logit contributions
oms-0.151
izz-0.135
Happy-0.133
ades-0.125
qu-0.125
Token like
Token ID588
Feature activation-1.68e+00

Pos logit contributions
 homework+0.008
ggie+0.008
zag+0.008
 math+0.008
 operation+0.008
Neg logit contributions
 pulp-0.008
 fishes-0.007
oms-0.007
 ge-0.007
ment-0.007
Token the
Token ID262
Feature activation-8.55e-01

Pos logit contributions
zag+0.075
ggie+0.067
ousel+0.066
isk+0.066
 homework+0.065
Neg logit contributions
 pulp-0.066
 fishes-0.058
 pump-0.057
 shells-0.055
 ducks-0.054
Token feeling
Token ID4203
Feature activation-7.04e-01

Pos logit contributions
zag+0.031
ggie+0.030
 homework+0.029
isk+0.029
ousel+0.029
Neg logit contributions
 pulp-0.038
 fishes-0.034
 grains-0.032
 pump-0.032
 shells-0.031
Token.
Token ID13
Feature activation-1.41e+00

Pos logit contributions
 homework+0.033
zag+0.031
ggie+0.030
 operation+0.030
 Maxwell+0.030
Neg logit contributions
 pulp-0.024
 fishes-0.021
oms-0.021
ment-0.020
 ge-0.020
TokenTim
Token ID14967
Feature activation-1.39e+00

Pos logit contributions
 homework+0.023
 operation+0.021
 x+0.020
 math+0.020
ggie+0.020
Neg logit contributions
 pulp-0.020
 fishes-0.018
Happy-0.017
 ge-0.017
 flags-0.017
Tokenmy
Token ID1820
Feature activation-8.11e-01

Pos logit contributions
 homework+0.058
ggie+0.058
 math+0.055
 robber+0.055
 Label+0.054
Neg logit contributions
 pulp-0.048
ins-0.044
ies-0.043
 pump-0.043
 cheered-0.042
Token practiced
Token ID19893
Feature activation-3.12e+00

Pos logit contributions
 homework+0.034
 operation+0.033
ggie+0.032
 x+0.032
 math+0.032
Neg logit contributions
 pulp-0.031
 fishes-0.027
 flags-0.026
 cheered-0.026
 ge-0.026
Token playing
Token ID2712
Feature activation-3.21e+00

Pos logit contributions
liv+0.140
case+0.134
ggie+0.132
omm+0.131
isk+0.129
Neg logit contributions
 flags-0.089
 pump-0.085
 them-0.081
 shells-0.078
 banners-0.078
Token the
Token ID262
Feature activation-2.36e+00

Pos logit contributions
omm+0.155
scar+0.153
liv+0.153
 fault+0.152
isk+0.150
Neg logit contributions
 flags-0.088
 cheered-0.081
Happy-0.079
 banners-0.078
 them-0.077
Token violin
Token ID38283
Feature activation-4.16e+00

Pos logit contributions
isk+0.068
ggie+0.061
 not+0.056
omm+0.056
 homework+0.055
Neg logit contributions
 flags-0.115
 banners-0.113
 floats-0.110
Happy-0.110
 fishes-0.105
Token every
Token ID790
Feature activation-2.32e+00

Pos logit contributions
liv+0.203
isk+0.200
scar+0.199
ggie+0.193
omm+0.191
Neg logit contributions
als-0.101
 cheered-0.088
 flags-0.087
oms-0.087
 together-0.084
Token day
Token ID1110
Feature activation-1.22e+00

Pos logit contributions
isk+0.122
 Maxwell+0.114
 not+0.110
 homework+0.110
 Rat+0.109
Neg logit contributions
 sunlight-0.058
Happy-0.051
 pump-0.050
 fishes-0.049
 water-0.047
Token and
Token ID290
Feature activation-1.47e+00

Pos logit contributions
 homework+0.059
zag+0.056
 operation+0.055
ggie+0.054
isk+0.054
Neg logit contributions
 pulp-0.035
als-0.032
 fishes-0.031
 flags-0.031
 pump-0.031
Token soon
Token ID2582
Feature activation+5.89e-01

Pos logit contributions
ggie+0.058
 x+0.053
 homework+0.053
 operation+0.052
zag+0.052
Neg logit contributions
 pulp-0.056
 cheered-0.054
 flags-0.054
 shells-0.052
 pump-0.052
Token became
Token ID2627
Feature activation-9.46e-01

Pos logit contributions
 pulp+0.032
oms+0.028
 fishes+0.028
ment+0.027
 banners+0.027
Neg logit contributions
 homework-0.017
 math-0.015
 not-0.015
ggie-0.015
 operation-0.015
Token much
Token ID881
Feature activation-8.35e-02

Pos logit contributions
 homework+0.051
 Maxwell+0.051
 operation+0.051
zag+0.050
 x+0.050
Neg logit contributions
 pulp-0.037
ment-0.033
Happy-0.032
 fishes-0.032
lan-0.032
Token.
Token ID13
Feature activation-7.81e-01

Pos logit contributions
 homework+0.004
ggie+0.004
zag+0.004
 operation+0.004
 math+0.004
Neg logit contributions
 pulp-0.002
oms-0.002
 fishes-0.002
ment-0.002
 flags-0.002
Token He
Token ID679
Feature activation-1.44e+00

Pos logit contributions
 homework+0.029
ggie+0.027
zag+0.027
 operation+0.026
 exam+0.025
Neg logit contributions
 pulp-0.034
 fishes-0.031
 pump-0.030
 sunlight-0.029
ment-0.029
Token would
Token ID561
Feature activation-1.64e+00

Pos logit contributions
 homework+0.070
 math+0.070
 operation+0.067
 Maxwell+0.067
zag+0.067
Neg logit contributions
 pulp-0.040
 cheered-0.036
 ge-0.035
 ducks-0.035
 admired-0.034
Token ride
Token ID6594
Feature activation-2.11e+00

Pos logit contributions
ggie+0.057
zag+0.057
 homework+0.056
 Maxwell+0.055
isk+0.054
Neg logit contributions
 pulp-0.066
 pump-0.063
 flags-0.063
 banners-0.060
 cheered-0.058
Token it
Token ID340
Feature activation-4.16e+00

Pos logit contributions
isk+0.111
 operation+0.108
 homework+0.105
zag+0.104
omm+0.103
Neg logit contributions
 flags-0.054
 cheered-0.047
Happy-0.046
 shells-0.045
 pulp-0.044
Token around
Token ID1088
Feature activation-2.04e+00

Pos logit contributions
omm+0.217
scar+0.213
ggie+0.209
liv+0.207
€+0.204
Neg logit contributions
 flags-0.085
als-0.084
oms-0.083
 hills-0.082
 valleys-0.082
Token town
Token ID3240
Feature activation-7.51e-01

Pos logit contributions
zag+0.106
liv+0.105
ggie+0.104
 homework+0.101
scar+0.100
Neg logit contributions
 islands-0.051
 flags-0.050
 whales-0.049
 pump-0.048
 valleys-0.047
Token and
Token ID290
Feature activation-9.85e-01

Pos logit contributions
 homework+0.029
 operation+0.028
ggie+0.028
 x+0.028
 Maxwell+0.027
Neg logit contributions
 pulp-0.030
 flags-0.028
 fishes-0.027
 islands-0.027
 pump-0.027
Token people
Token ID661
Feature activation+1.06e+00

Pos logit contributions
ggie+0.039
 homework+0.038
zag+0.037
 operation+0.037
 x+0.036
Neg logit contributions
 pulp-0.038
 flags-0.035
 banners-0.033
 pump-0.033
 shells-0.033
Token would
Token ID561
Feature activation-7.81e-01

Pos logit contributions
 pulp+0.051
ment+0.044
 fishes+0.044
oms+0.044
 grains+0.042
Neg logit contributions
 homework-0.037
 not-0.036
zag-0.034
ggie-0.034
 skate-0.034
Token the
Token ID262
Feature activation-1.56e+00

Pos logit contributions
isk+0.069
zag+0.069
 homework+0.064
chy+0.063
 Maxwell+0.063
Neg logit contributions
Happy-0.055
 flags-0.053
 banners-0.051
 shells-0.050
 fishes-0.050
Token trumpet
Token ID40334
Feature activation-2.64e+00

Pos logit contributions
isk+0.048
 homework+0.046
 Maxwell+0.044
ggie+0.042
 operation+0.042
Neg logit contributions
 flags-0.071
 fishes-0.069
 banners-0.069
 ducks-0.068
 pulp-0.067
Token.
Token ID13
Feature activation-1.32e+00

Pos logit contributions
isk+0.149
scar+0.147
liv+0.143
 Maxwell+0.141
omm+0.137
Neg logit contributions
 flags-0.045
oms-0.045
als-0.044
Happy-0.040
 pump-0.038
Token But
Token ID887
Feature activation-2.74e+00

Pos logit contributions
ggie+0.053
 homework+0.052
zag+0.047
 robber+0.047
 operation+0.046
Neg logit contributions
 pulp-0.047
 pump-0.046
 fishes-0.045
 sunlight-0.044
 flags-0.044
Token he
Token ID339
Feature activation-2.78e+00

Pos logit contributions
isk+0.131
ggie+0.126
zag+0.121
 robber+0.120
case+0.118
Neg logit contributions
 shells-0.075
 pump-0.072
 flags-0.067
 buckets-0.063
 boats-0.062
Token didn
Token ID1422
Feature activation-4.14e+00

Pos logit contributions
 math+0.134
 Maxwell+0.131
igsaw+0.130
 robber+0.130
 maths+0.130
Neg logit contributions
 saw-0.066
 ge-0.064
 admired-0.064
 said-0.064
 asked-0.059
Token't
Token ID470
Feature activation-8.72e-01

Pos logit contributions
isk+0.167
pert+0.160
 robber+0.154
 rider+0.152
 mask+0.149
Neg logit contributions
oms-0.142
Happy-0.124
izz-0.121
ades-0.114
qu-0.107
Token have
Token ID423
Feature activation-9.21e-01

Pos logit contributions
 homework+0.039
ggie+0.038
 math+0.037
zag+0.037
 Maxwell+0.036
Neg logit contributions
 pulp-0.027
 pump-0.024
oms-0.023
 ge-0.023
ment-0.023
Token a
Token ID257
Feature activation-1.70e+00

Pos logit contributions
zag+0.034
isk+0.032
 homework+0.032
ggie+0.031
 operation+0.031
Neg logit contributions
 pulp-0.038
 flags-0.036
 shells-0.035
 fishes-0.034
 ge-0.034
Token trumpet
Token ID40334
Feature activation-2.88e+00

Pos logit contributions
isk+0.069
 homework+0.058
ggie+0.058
 Maxwell+0.056
 Rat+0.055
Neg logit contributions
 pulp-0.065
 flags-0.064
oms-0.062
 pump-0.061
 banners-0.060
Token of
Token ID286
Feature activation-5.64e-01

Pos logit contributions
isk+0.144
liv+0.137
scar+0.135
innie+0.135
 Maxwell+0.134
Neg logit contributions
als-0.067
oms-0.061
 flags-0.058
ed-0.055
ment-0.054
Tokenmy
Token ID1820
Feature activation-2.09e+00

Pos logit contributions
ggie+0.123
 homework+0.122
 IT+0.118
 math+0.117
 Label+0.115
Neg logit contributions
ies-0.054
 sunlight-0.049
ins-0.048
 pulp-0.047
 admired-0.045
Token always
Token ID1464
Feature activation+1.09e-01

Pos logit contributions
 homework+0.097
 math+0.095
 x+0.093
 robber+0.092
ggie+0.090
Neg logit contributions
 pulp-0.061
 cheered-0.060
 pump-0.058
 admired-0.056
 ge-0.055
Token wore
Token ID12408
Feature activation-1.15e+00

Pos logit contributions
 pulp+0.004
 fishes+0.004
oms+0.004
ment+0.004
 flags+0.004
Neg logit contributions
 homework-0.005
ggie-0.004
 math-0.004
 operation-0.004
zag-0.004
Token his
Token ID465
Feature activation-1.42e+00

Pos logit contributions
zag+0.034
 homework+0.033
isk+0.032
ggie+0.032
 operation+0.030
Neg logit contributions
 pulp-0.053
 flags-0.052
 shells-0.052
 banners-0.051
ment-0.050
Token yellow
Token ID7872
Feature activation-1.56e+00

Pos logit contributions
isk+0.030
ggie+0.029
zag+0.028
 homework+0.027
 Maxwell+0.026
Neg logit contributions
 pulp-0.077
 flags-0.075
 shells-0.074
oms-0.072
 banners-0.072
Token helmet
Token ID14335
Feature activation-4.11e+00

Pos logit contributions
ggie+0.035
 homework+0.034
isk+0.033
 Maxwell+0.032
 not+0.031
Neg logit contributions
 pulp-0.081
 flags-0.081
 shells-0.079
 pump-0.077
 banners-0.077
Token when
Token ID618
Feature activation-4.52e-01

Pos logit contributions
 x+0.176
ggie+0.174
 homework+0.173
 IT+0.173
omm+0.172
Neg logit contributions
als-0.108
ment-0.089
ed-0.089
 cheered-0.088
.-0.086
Token he
Token ID339
Feature activation-1.67e+00

Pos logit contributions
 homework+0.014
ggie+0.013
zag+0.013
 operation+0.013
 x+0.013
Neg logit contributions
 pulp-0.022
 fishes-0.020
ment-0.019
oms-0.019
 shells-0.019
Token rode
Token ID22075
Feature activation-9.72e-01

Pos logit contributions
 homework+0.079
 math+0.079
ggie+0.076
 Maxwell+0.074
 operation+0.073
Neg logit contributions
oms-0.044
 pulp-0.042
als-0.040
 ge-0.040
 cheered-0.040
Token his
Token ID465
Feature activation-1.40e+00

Pos logit contributions
 homework+0.037
ggie+0.035
 x+0.034
 operation+0.034
zag+0.034
Neg logit contributions
 pulp-0.036
 flags-0.036
ment-0.034
 banners-0.034
 shells-0.034
Token bike
Token ID7161
Feature activation-2.73e+00

Pos logit contributions
ggie+0.040
isk+0.039
 x+0.036
 homework+0.036
 Maxwell+0.035
Neg logit contributions
 flags-0.067
 pulp-0.066
 shells-0.064
 pump-0.062
oms-0.062
Token the
Token ID262
Feature activation-2.41e+00

Pos logit contributions
zag+0.025
 homework+0.025
ggie+0.024
 Maxwell+0.023
 skate+0.023
Neg logit contributions
 pulp-0.034
 flags-0.030
 shells-0.030
 fishes-0.029
 ge-0.029
Token right
Token ID826
Feature activation-1.02e+00

Pos logit contributions
ggie+0.091
isk+0.090
zed+0.088
 Maxwell+0.087
scar+0.082
Neg logit contributions
 pulp-0.095
 pump-0.092
 shells-0.091
 ducks-0.090
 flags-0.088
Token equipment
Token ID5112
Feature activation-1.68e+00

Pos logit contributions
 Maxwell+0.035
 homework+0.035
ggie+0.034
 skate+0.032
 x+0.032
Neg logit contributions
 pulp-0.046
ment-0.042
 shells-0.040
 pump-0.040
 flags-0.039
Token.
Token ID13
Feature activation-3.46e-01

Pos logit contributions
 homework+0.079
ggie+0.079
zag+0.079
 Maxwell+0.078
 operation+0.076
Neg logit contributions
 pulp-0.047
oms-0.043
als-0.042
 shells-0.040
 flags-0.040
Token She
Token ID1375
Feature activation-1.42e+00

Pos logit contributions
 homework+0.013
ggie+0.012
zag+0.012
 operation+0.012
 skate+0.012
Neg logit contributions
 pulp-0.015
als-0.013
oms-0.013
 fishes-0.013
ment-0.013
Token didn
Token ID1422
Feature activation-4.10e+00

Pos logit contributions
 math+0.071
 homework+0.070
zag+0.070
 Maxwell+0.068
 robber+0.067
Neg logit contributions
 pulp-0.038
 cheered-0.030
 ge-0.030
 fishes-0.030
 ducks-0.030
Token't
Token ID470
Feature activation-8.18e-01

Pos logit contributions
isk+0.164
pert+0.151
 rider+0.150
 robber+0.146
 mask+0.145
Neg logit contributions
oms-0.141
Happy-0.124
izz-0.123
They-0.114
ades-0.114
Token like
Token ID588
Feature activation-1.62e+00

Pos logit contributions
 homework+0.037
ggie+0.036
zag+0.036
 math+0.035
 Maxwell+0.034
Neg logit contributions
 pulp-0.026
 pump-0.022
Happy-0.022
 ge-0.022
 flags-0.021
Token this
Token ID428
Feature activation-1.83e+00

Pos logit contributions
zag+0.077
ggie+0.074
 rider+0.068
 homework+0.068
isk+0.067
Neg logit contributions
 pulp-0.052
 shells-0.051
 flags-0.051
 pump-0.049
 whales-0.046
Token,
Token ID11
Feature activation-2.07e+00

Pos logit contributions
ggie+0.083
 homework+0.081
 Maxwell+0.080
 Rat+0.079
zag+0.079
Neg logit contributions
oms-0.049
 pulp-0.047
 pump-0.047
 flags-0.046
 ge-0.046
Token so
Token ID523
Feature activation+4.64e-02

Pos logit contributions
zag+0.085
ggie+0.078
isk+0.075
 homework+0.073
icycle+0.070
Neg logit contributions
 pulp-0.092
 ge-0.082
 shells-0.082
 grains-0.082
 pump-0.081
Token ABC
Token ID9738
Feature activation-1.33e+00

Pos logit contributions
 Maxwell+0.058
ggie+0.053
 Patch+0.053
isk+0.052
 Label+0.052
Neg logit contributions
 flags-0.078
 pulp-0.077
 pump-0.074
 faces-0.073
ment-0.073
Tokens
Token ID82
Feature activation-1.08e+00

Pos logit contributions
 operation+0.067
 homework+0.066
tons+0.066
ggie+0.066
 x+0.066
Neg logit contributions
oms-0.029
 flags-0.028
 pulp-0.027
Happy-0.027
 banners-0.027
Token.
Token ID13
Feature activation+2.63e-01

Pos logit contributions
ggie+0.056
 homework+0.056
 Maxwell+0.055
zag+0.054
 x+0.053
Neg logit contributions
 pulp-0.025
ment-0.022
oms-0.022
 fishes-0.021
 flags-0.021
Token Tim
Token ID5045
Feature activation-2.20e+00

Pos logit contributions
 pulp+0.011
 fishes+0.010
oms+0.010
ment+0.010
 flags+0.009
Neg logit contributions
 homework-0.010
 math-0.009
ggie-0.009
 operation-0.009
 not-0.009
Tokenmy
Token ID1820
Feature activation-2.58e+00

Pos logit contributions
 robber+0.107
 homework+0.105
ggie+0.105
 Label+0.103
 hardly+0.103
Neg logit contributions
ins-0.057
ies-0.054
 pulp-0.054
 cheered-0.047
 pump-0.045
Token didn
Token ID1422
Feature activation-4.09e+00

Pos logit contributions
 robber+0.120
 Patch+0.114
 Maxwell+0.114
 Label+0.112
isk+0.111
Neg logit contributions
 cheered-0.075
 admired-0.073
 pulp-0.068
 gasped-0.062
led-0.061
Token't
Token ID470
Feature activation-9.97e-01

Pos logit contributions
pert+0.149
isk+0.149
 robber+0.147
 mask+0.143
 seeker+0.142
Neg logit contributions
oms-0.147
Happy-0.135
ades-0.133
izz-0.127
Everyone-0.123
Token want
Token ID765
Feature activation-1.38e+00

Pos logit contributions
 homework+0.044
ggie+0.044
zag+0.044
 math+0.042
 operation+0.042
Neg logit contributions
 pulp-0.030
 cheered-0.026
 pump-0.026
Happy-0.026
oms-0.026
Token to
Token ID284
Feature activation-8.28e-01

Pos logit contributions
zag+0.065
isk+0.061
ggie+0.059
 Maxwell+0.059
 homework+0.059
Neg logit contributions
 pulp-0.040
 shells-0.039
 flags-0.039
 fishes-0.038
 whales-0.037
Token study
Token ID2050
Feature activation-1.90e+00

Pos logit contributions
zag+0.026
 homework+0.026
ggie+0.026
 Maxwell+0.025
isk+0.025
Neg logit contributions
 pulp-0.035
 pump-0.032
 flags-0.032
oms-0.031
 banners-0.031
Token yet
Token ID1865
Feature activation-1.90e+00

Pos logit contributions
zag+0.073
isk+0.072
ggie+0.072
scar+0.068
ibo+0.068
Neg logit contributions
 flags-0.066
oms-0.066
 pulp-0.064
ment-0.063
 banners-0.062
Token that
Token ID326
Feature activation-2.42e+00

Pos logit contributions
isk+0.097
zag+0.096
tons+0.093
ousel+0.092
icycle+0.092
Neg logit contributions
ment-0.042
 shells-0.041
lan-0.039
 seeds-0.039
 flags-0.037
Token he
Token ID339
Feature activation-2.55e+00

Pos logit contributions
isk+0.104
zag+0.104
ggie+0.097
liv+0.095
ousel+0.094
Neg logit contributions
 pulp-0.087
 pump-0.083
 shells-0.081
 fishes-0.079
 seeds-0.076
Token should
Token ID815
Feature activation-1.13e+00

Pos logit contributions
 homework+0.127
zag+0.126
ggie+0.126
 operation+0.123
 skate+0.123
Neg logit contributions
led-0.052
 cheered-0.051
als-0.050
 seeds-0.049
 pulp-0.049
Token complete
Token ID1844
Feature activation-2.80e+00

Pos logit contributions
 homework+0.032
zag+0.032
ggie+0.031
 Maxwell+0.031
 x+0.030
Neg logit contributions
 pulp-0.053
 pump-0.047
 shells-0.047
 fishes-0.046
 flags-0.046
Token his
Token ID465
Feature activation-3.19e+00

Pos logit contributions
zag+0.095
case+0.092
ggie+0.087
 rider+0.086
ousel+0.086
Neg logit contributions
 shells-0.114
 flags-0.111
 islands-0.106
 pulp-0.105
ins-0.105
Token math
Token ID10688
Feature activation-4.09e+00

Pos logit contributions
liv+0.074
isk+0.073
 Maxwell+0.072
zag+0.072
ggie+0.069
Neg logit contributions
 pump-0.145
 faces-0.145
 pulp-0.145
 flags-0.144
 shells-0.142
Token and
Token ID290
Feature activation-1.44e+00

Pos logit contributions
tons+0.108
ibo+0.106
scar+0.106
isk+0.103
 Maxwell+0.101
Neg logit contributions
 flags-0.173
 shells-0.167
oms-0.167
 pump-0.166
ins-0.165
Token not
Token ID407
Feature activation-8.84e-01

Pos logit contributions
zag+0.046
ggie+0.046
isk+0.042
 Maxwell+0.042
 x+0.042
Neg logit contributions
 pulp-0.066
 shells-0.059
 flags-0.059
 pump-0.058
ment-0.058
Token lie
Token ID6486
Feature activation-4.54e-01

Pos logit contributions
ggie+0.030
 homework+0.030
zag+0.030
 Maxwell+0.029
 operation+0.029
Neg logit contributions
 pulp-0.038
 pump-0.034
 fishes-0.032
 flags-0.032
oms-0.032
Token.
Token ID13
Feature activation-1.37e+00

Pos logit contributions
 homework+0.018
ggie+0.018
 operation+0.017
zag+0.017
 x+0.017
Neg logit contributions
 pulp-0.018
oms-0.016
ment-0.016
 fishes-0.016
 flags-0.015
Token<|endoftext|>
Token ID50256
Feature activation+1.18e-01

Pos logit contributions
 homework+0.071
ggie+0.070
zag+0.068
 operation+0.066
 robber+0.065
Neg logit contributions
 pulp-0.036
ment-0.031
lan-0.031
Happy-0.031
 sunlight-0.030
Token phone
Token ID3072
Feature activation-1.23e+00

Pos logit contributions
 pulp+0.014
 fishes+0.012
oms+0.012
ment+0.012
als+0.012
Neg logit contributions
 homework-0.010
 operation-0.009
ggie-0.009
 math-0.009
 not-0.009
Token was
Token ID373
Feature activation-1.14e+00

Pos logit contributions
 homework+0.048
ggie+0.048
 skate+0.045
 operation+0.045
 Maxwell+0.045
Neg logit contributions
 pulp-0.046
als-0.043
oms-0.042
 flags-0.041
ment-0.041
Token difficult
Token ID2408
Feature activation-1.04e+00

Pos logit contributions
zag+0.032
 homework+0.030
ggie+0.030
 skate+0.030
 Maxwell+0.030
Neg logit contributions
 pulp-0.057
 shells-0.050
 flags-0.050
 seeds-0.050
 fishes-0.049
Token.
Token ID13
Feature activation-1.49e+00

Pos logit contributions
ggie+0.047
zag+0.046
 Maxwell+0.045
 homework+0.044
isk+0.044
Neg logit contributions
oms-0.030
 pulp-0.030
ment-0.029
 flags-0.028
 shells-0.028
Token She
Token ID1375
Feature activation-7.46e-01

Pos logit contributions
zag+0.070
ggie+0.069
 homework+0.066
 skate+0.065
 rider+0.063
Neg logit contributions
 pulp-0.051
als-0.047
oms-0.043
 sunlight-0.043
 cups-0.041
Token didn
Token ID1422
Feature activation-4.09e+00

Pos logit contributions
 homework+0.035
zag+0.034
ggie+0.034
 math+0.034
 operation+0.033
Neg logit contributions
 pulp-0.025
 fishes-0.020
als-0.019
 cheered-0.019
 flags-0.019
Token't
Token ID470
Feature activation-7.44e-01

Pos logit contributions
isk+0.156
 rider+0.146
 robber+0.143
pert+0.141
 mask+0.140
Neg logit contributions
oms-0.150
Happy-0.135
izz-0.130
qu-0.124
ades-0.123
Token know
Token ID760
Feature activation-1.42e-01

Pos logit contributions
 homework+0.035
ggie+0.033
zag+0.033
 math+0.032
 operation+0.031
Neg logit contributions
 pulp-0.024
 pump-0.020
oms-0.020
 fishes-0.020
Happy-0.019
Token how
Token ID703
Feature activation+6.92e-01

Pos logit contributions
 homework+0.006
zag+0.005
ggie+0.005
 operation+0.005
 math+0.005
Neg logit contributions
 pulp-0.006
oms-0.005
 fishes-0.005
ment-0.005
 shells-0.005
Token to
Token ID284
Feature activation-3.73e-01

Pos logit contributions
 pulp+0.036
 fishes+0.032
als+0.031
oms+0.031
 islands+0.031
Neg logit contributions
 homework-0.022
 not-0.021
 math-0.021
 operation-0.020
ggie-0.019
Token call
Token ID869
Feature activation-9.24e-01

Pos logit contributions
 homework+0.014
 math+0.013
ggie+0.013
zag+0.013
 operation+0.012
Neg logit contributions
 pulp-0.015
 fishes-0.014
 pump-0.013
oms-0.013
 flags-0.013
Token candy
Token ID18550
Feature activation-2.20e-01

Pos logit contributions
 pulp+0.040
oms+0.036
ment+0.034
als+0.034
 fishes+0.033
Neg logit contributions
 homework-0.032
 math-0.030
 not-0.028
 skate-0.028
 x-0.028
Token.
Token ID13
Feature activation-1.17e+00

Pos logit contributions
 homework+0.011
 operation+0.011
ggie+0.011
zag+0.011
 math+0.011
Neg logit contributions
 pulp-0.006
 fishes-0.005
oms-0.005
ment-0.005
 flags-0.005
Token But
Token ID887
Feature activation-2.29e+00

Pos logit contributions
ggie+0.047
 homework+0.046
 operation+0.041
zag+0.041
 exam+0.040
Neg logit contributions
 pulp-0.044
 pump-0.041
 fishes-0.041
 ducks-0.040
 flags-0.039
Token Tim
Token ID5045
Feature activation-2.75e+00

Pos logit contributions
ggie+0.100
isk+0.097
case+0.097
 Maxwell+0.095
zag+0.094
Neg logit contributions
 pulp-0.073
 pump-0.070
 shells-0.070
 ge-0.066
 islands-0.063
Tokenmy
Token ID1820
Feature activation-3.29e+00

Pos logit contributions
 IT+0.139
 robber+0.139
 Label+0.133
 math+0.132
 hardly+0.130
Neg logit contributions
ins-0.060
 pump-0.059
ies-0.058
 pulp-0.057
 ge-0.053
Token didn
Token ID1422
Feature activation-4.09e+00

Pos logit contributions
 robber+0.151
 math+0.149
isk+0.148
 maths+0.148
 Maxwell+0.147
Neg logit contributions
 ge-0.086
 pump-0.082
 pulp-0.080
 ducks-0.080
 were-0.079
Token't
Token ID470
Feature activation-5.34e-01

Pos logit contributions
isk+0.159
 rider+0.158
pert+0.155
 seeker+0.149
 robber+0.148
Neg logit contributions
oms-0.140
Happy-0.124
izz-0.122
ades-0.117
ls-0.111
Token like
Token ID588
Feature activation-1.62e+00

Pos logit contributions
 homework+0.020
ggie+0.020
 math+0.019
zag+0.019
 operation+0.019
Neg logit contributions
 pulp-0.022
 pump-0.019
 fishes-0.019
 ge-0.019
ment-0.018
Token to
Token ID284
Feature activation-1.21e-01

Pos logit contributions
zag+0.079
ggie+0.072
 operation+0.069
ousel+0.068
penter+0.067
Neg logit contributions
 pulp-0.059
 pump-0.052
 shells-0.052
 ge-0.051
 ducks-0.050
Token eat
Token ID4483
Feature activation-5.76e-01

Pos logit contributions
 homework+0.005
zag+0.005
ggie+0.005
 math+0.005
 operation+0.004
Neg logit contributions
 pulp-0.005
 fishes-0.004
oms-0.004
 ge-0.004
ment-0.004
Token his
Token ID465
Feature activation-1.58e+00

Pos logit contributions
zag+0.021
 homework+0.020
ggie+0.019
 operation+0.019
 Maxwell+0.019
Neg logit contributions
 pulp-0.026
 fishes-0.023
 shells-0.022
 ge-0.022
 flags-0.022
Token the
Token ID262
Feature activation+2.21e+00

Pos logit contributions
 pulp+0.057
oms+0.053
ins+0.050
 islands+0.050
 fishes+0.050
Neg logit contributions
 not-0.035
 homework-0.033
 math-0.033
ggie-0.030
 operation-0.030
Token sky
Token ID6766
Feature activation+1.72e+00

Pos logit contributions
 pulp+0.100
ment+0.096
oms+0.096
als+0.095
ins+0.093
Neg logit contributions
 operation-0.083
 robot-0.079
 x-0.079
 not-0.078
 homework-0.076
Token lit
Token ID6578
Feature activation-4.38e-01

Pos logit contributions
 pulp+0.082
 shells+0.071
 fishes+0.070
ins+0.069
oms+0.069
Neg logit contributions
 not-0.058
zag-0.056
 operation-0.055
ggie-0.053
 homework-0.052
Token up
Token ID510
Feature activation+1.06e+00

Pos logit contributions
 homework+0.012
 robber+0.010
 operation+0.010
ggie+0.010
 skate+0.010
Neg logit contributions
 pulp-0.024
 fishes-0.022
 flags-0.021
ment-0.021
 banners-0.021
Token with
Token ID351
Feature activation+2.53e+00

Pos logit contributions
 pulp+0.060
 fishes+0.051
oms+0.051
 banners+0.050
 ge+0.050
Neg logit contributions
 not-0.028
 homework-0.027
 math-0.027
ggie-0.025
 operation-0.025
Token bright
Token ID6016
Feature activation+3.71e+00

Pos logit contributions
oms+0.142
suits+0.135
 pulp+0.133
ins+0.132
unn+0.130
Neg logit contributions
 not-0.061
 math-0.055
 Max-0.049
 homework-0.049
 x-0.047
Token colors
Token ID7577
Feature activation+2.01e+00

Pos logit contributions
 pulp+0.175
lan+0.169
prise+0.165
use+0.164
suits+0.162
Neg logit contributions
 because-0.103
 robot-0.102
 cartoon-0.101
 math-0.099
 operation-0.098
Token.
Token ID13
Feature activation+1.58e+00

Pos logit contributions
 pulp+0.100
 ge+0.086
 fishes+0.085
 islands+0.083
Happy+0.083
Neg logit contributions
 not-0.062
zag-0.059
 homework-0.059
 math-0.056
 operation-0.055
Token The
Token ID383
Feature activation+2.68e+00

Pos logit contributions
 pulp+0.081
 islands+0.073
ins+0.072
oms+0.072
 ge+0.071
Neg logit contributions
 math-0.047
 not-0.046
 homework-0.043
case-0.041
 operation-0.040
Token fire
Token ID2046
Feature activation+2.24e+00

Pos logit contributions
 pulp+0.123
oms+0.120
ment+0.120
als+0.118
ins+0.116
Neg logit contributions
 operation-0.097
 robot-0.096
 x-0.091
 not-0.090
 homework-0.084
Tokenwork
Token ID1818
Feature activation+1.57e+00

Pos logit contributions
 pulp+0.107
 ge+0.094
oms+0.094
 shells+0.093
 grains+0.090
Neg logit contributions
zag-0.076
 not-0.071
 robot-0.070
 operation-0.064
 homework-0.062
Token were
Token ID547
Feature activation+2.18e+00

Pos logit contributions
 homework+0.015
ggie+0.013
 operation+0.013
 math+0.013
 x+0.013
Neg logit contributions
 pulp-0.014
 fishes-0.012
oms-0.012
 shells-0.011
 ge-0.011
Token apples
Token ID22514
Feature activation+8.10e-01

Pos logit contributions
 pulp+0.110
oms+0.106
ment+0.100
omed+0.091
forts+0.091
Neg logit contributions
 not-0.065
 homework-0.061
 math-0.060
ggie-0.059
 Max-0.053
Token,
Token ID11
Feature activation+1.00e+00

Pos logit contributions
 pulp+0.041
oms+0.036
 fishes+0.036
ment+0.036
 banners+0.035
Neg logit contributions
 homework-0.025
 not-0.024
zag-0.022
 math-0.022
ggie-0.022
Token sandwiches
Token ID29777
Feature activation+1.15e+00

Pos logit contributions
 pulp+0.039
oms+0.035
ment+0.034
 fishes+0.032
als+0.032
Neg logit contributions
 homework-0.041
 not-0.040
 math-0.039
 skate-0.038
 robber-0.037
Token and
Token ID290
Feature activation+1.55e+00

Pos logit contributions
 pulp+0.060
oms+0.052
ment+0.052
 fishes+0.052
 banners+0.051
Neg logit contributions
 homework-0.034
 not-0.033
ggie-0.031
 math-0.031
zag-0.030
Token some
Token ID617
Feature activation+3.12e+00

Pos logit contributions
 pulp+0.061
oms+0.058
ment+0.055
als+0.050
 islands+0.050
Neg logit contributions
 homework-0.062
 math-0.060
 not-0.059
ggie-0.057
 robber-0.057
Token juice
Token ID13135
Feature activation+1.98e+00

Pos logit contributions
 pulp+0.130
oms+0.122
ment+0.117
 islands+0.112
 valleys+0.112
Neg logit contributions
 math-0.112
 homework-0.112
 not-0.107
ggie-0.102
 bone-0.102
Token.
Token ID13
Feature activation-1.23e-01

Pos logit contributions
 pulp+0.085
ment+0.073
oms+0.072
 islands+0.072
 whales+0.071
Neg logit contributions
 homework-0.074
 not-0.072
zag-0.070
ggie-0.068
 math-0.068
Token She
Token ID1375
Feature activation-5.90e-01

Pos logit contributions
 homework+0.004
ggie+0.004
zag+0.004
 operation+0.004
 math+0.003
Neg logit contributions
 pulp-0.006
oms-0.005
 fishes-0.005
 flags-0.005
ment-0.005
Token wrapped
Token ID12908
Feature activation-7.87e-02

Pos logit contributions
 homework+0.026
 operation+0.025
 math+0.024
zag+0.024
 x+0.024
Neg logit contributions
 pulp-0.023
 fishes-0.020
 ge-0.019
 shells-0.019
 flags-0.018
Token the
Token ID262
Feature activation+1.07e+00

Pos logit contributions
 homework+0.003
 operation+0.003
ggie+0.003
zag+0.003
 math+0.003
Neg logit contributions
 pulp-0.003
 fishes-0.003
ment-0.003
oms-0.003
als-0.003
Token grabbed
Token ID13176
Feature activation-2.18e-02

Pos logit contributions
 pulp+0.017
oms+0.014
 fishes+0.014
ment+0.014
 banners+0.014
Neg logit contributions
 homework-0.021
ggie-0.020
 not-0.020
 operation-0.020
zag-0.020
Token a
Token ID257
Feature activation+7.69e-01

Pos logit contributions
 homework+0.001
ggie+0.001
zag+0.001
 operation+0.001
 math+0.001
Neg logit contributions
 pulp-0.001
 fishes-0.001
oms-0.001
ment-0.001
 shells-0.001
Token few
Token ID1178
Feature activation+2.12e+00

Pos logit contributions
 pulp+0.029
ment+0.025
 fishes+0.025
oms+0.025
als+0.024
Neg logit contributions
 homework-0.033
 math-0.031
 skate-0.031
zag-0.031
 operation-0.031
Token shells
Token ID19679
Feature activation+2.66e-01

Pos logit contributions
 pulp+0.080
owers+0.067
Happy+0.066
lan+0.066
als+0.064
Neg logit contributions
 homework-0.090
 math-0.089
 x-0.084
 skate-0.083
 operation-0.083
Token and
Token ID290
Feature activation+1.67e+00

Pos logit contributions
 pulp+0.013
 fishes+0.012
oms+0.011
ment+0.011
 flags+0.011
Neg logit contributions
 homework-0.009
ggie-0.008
 math-0.008
 operation-0.008
 not-0.008
Token some
Token ID617
Feature activation+3.08e+00

Pos logit contributions
 pulp+0.063
oms+0.059
ment+0.054
ins+0.054
Happy+0.054
Neg logit contributions
 homework-0.067
 not-0.066
 skate-0.063
 math-0.062
 mask-0.059
Token seaw
Token ID32085
Feature activation+2.40e+00

Pos logit contributions
 pulp+0.089
Happy+0.076
ment+0.072
ins+0.067
women+0.065
Neg logit contributions
 math-0.148
 homework-0.148
 not-0.140
 mask-0.138
 skate-0.138
Tokeneed
Token ID2308
Feature activation+1.71e+00

Pos logit contributions
 pulp+0.117
Happy+0.105
 fishes+0.100
 banners+0.099
 valleys+0.097
Neg logit contributions
 skate-0.074
zag-0.071
 homework-0.069
ggie-0.068
 robber-0.066
Token and
Token ID290
Feature activation+1.51e+00

Pos logit contributions
 pulp+0.085
Happy+0.074
 fishes+0.073
 banners+0.071
 ge+0.070
Neg logit contributions
 homework-0.052
 not-0.052
zag-0.051
 math-0.047
 skate-0.047
Token built
Token ID3170
Feature activation+9.10e-01

Pos logit contributions
 pulp+0.058
oms+0.054
ins+0.050
ment+0.050
 fishes+0.050
Neg logit contributions
 homework-0.060
 not-0.060
 skate-0.056
 math-0.056
ggie-0.054
Token a
Token ID257
Feature activation+1.21e+00

Pos logit contributions
 pulp+0.044
oms+0.039
 fishes+0.038
ment+0.038
als+0.037
Neg logit contributions
 homework-0.030
 not-0.028
 math-0.028
ggie-0.027
 operation-0.026
Token fridge
Token ID25772
Feature activation-8.97e-02

Pos logit contributions
 pulp+0.062
ment+0.058
oms+0.056
 fishes+0.055
 islands+0.054
Neg logit contributions
 homework-0.042
 x-0.039
 math-0.039
 operation-0.039
 not-0.037
Token!
Token ID0
Feature activation-9.22e-01

Pos logit contributions
 homework+0.004
ggie+0.004
 operation+0.004
 math+0.004
zag+0.004
Neg logit contributions
 pulp-0.003
oms-0.003
 fishes-0.003
ment-0.003
als-0.003
Token Tom
Token ID4186
Feature activation-7.08e-02

Pos logit contributions
 homework+0.036
zag+0.035
 operation+0.035
ggie+0.033
 skate+0.032
Neg logit contributions
 pulp-0.038
oms-0.033
ment-0.032
als-0.032
 flags-0.032
Token and
Token ID290
Feature activation-3.19e-01

Pos logit contributions
 homework+0.003
 operation+0.002
 math+0.002
ggie+0.002
zag+0.002
Neg logit contributions
 pulp-0.003
 fishes-0.003
oms-0.003
ment-0.003
als-0.003
Token his
Token ID465
Feature activation+3.51e-01

Pos logit contributions
 homework+0.010
zag+0.009
 operation+0.009
ggie+0.009
 x+0.008
Neg logit contributions
 pulp-0.017
ment-0.015
 fishes-0.015
 ge-0.014
 shells-0.014
Token mum
Token ID25682
Feature activation+3.26e+00

Pos logit contributions
 pulp+0.015
oms+0.013
 fishes+0.013
ment+0.013
 banners+0.013
Neg logit contributions
 homework-0.013
ggie-0.012
 not-0.012
 math-0.012
 operation-0.012
Token couldn
Token ID3521
Feature activation-1.87e+00

Pos logit contributions
 pulp+0.157
ment+0.145
ocol+0.142
lan+0.141
oms+0.136
Neg logit contributions
 not-0.119
 just-0.101
 always-0.101
 really-0.100
 never-0.099
Token't
Token ID470
Feature activation+1.76e+00

Pos logit contributions
 homework+0.051
 robber+0.047
 rider+0.045
 mask+0.044
 math+0.043
Neg logit contributions
oms-0.091
 pulp-0.089
Happy-0.087
als-0.083
 fishes-0.082
Token believe
Token ID1975
Feature activation-2.27e-01

Pos logit contributions
 pulp+0.091
ment+0.084
ins+0.083
oms+0.080
lan+0.076
Neg logit contributions
 not-0.058
 operation-0.052
 x-0.050
 skate-0.049
 homework-0.049
Token their
Token ID511
Feature activation+1.35e+00

Pos logit contributions
 homework+0.009
zag+0.008
 operation+0.008
ggie+0.008
 x+0.008
Neg logit contributions
 pulp-0.010
 fishes-0.008
ment-0.008
oms-0.008
 flags-0.008
Token eyes
Token ID2951
Feature activation+1.14e+00

Pos logit contributions
 pulp+0.055
Happy+0.050
 cheered+0.047
 banners+0.047
oms+0.046
Neg logit contributions
 homework-0.057
 not-0.054
 math-0.054
 operation-0.051
 x-0.050
Token.
Token ID13
Feature activation+2.42e-01

Pos logit contributions
 pulp+0.042
 fishes+0.035
oms+0.034
 flags+0.033
 banners+0.033
Neg logit contributions
 homework-0.043
ggie-0.041
 not-0.041
 operation-0.040
 math-0.040
Token The
Token ID383
Feature activation+7.28e-01

Pos logit contributions
 pulp+0.012
 fishes+0.011
oms+0.011
ment+0.010
 flags+0.010
Neg logit contributions
 homework-0.008
ggie-0.007
 math-0.007
 operation-0.007
 not-0.007
Token lady
Token ID10846
Feature activation+4.07e-01

Pos logit contributions
 pulp+0.031
oms+0.029
ment+0.028
 fishes+0.027
als+0.027
Neg logit contributions
 homework-0.028
 operation-0.027
 not-0.026
 math-0.026
 x-0.026
Token gave
Token ID2921
Feature activation+3.85e-01

Pos logit contributions
 pulp+0.018
oms+0.015
 fishes+0.015
ment+0.015
 flags+0.015
Neg logit contributions
 homework-0.016
ggie-0.015
 not-0.014
zag-0.014
 math-0.014
Token him
Token ID683
Feature activation-8.34e-02

Pos logit contributions
 pulp+0.019
oms+0.016
 fishes+0.016
ment+0.016
 flags+0.016
Neg logit contributions
 homework-0.013
ggie-0.012
 math-0.012
 operation-0.012
 not-0.012
Token some
Token ID617
Feature activation+3.09e+00

Pos logit contributions
 homework+0.003
zag+0.003
ggie+0.003
 operation+0.003
 math+0.003
Neg logit contributions
 pulp-0.004
 fishes-0.003
oms-0.003
ment-0.003
 flags-0.003
Token coins
Token ID10796
Feature activation+1.09e+00

Pos logit contributions
 pulp+0.122
 valleys+0.113
oms+0.110
 islands+0.109
 banners+0.104
Neg logit contributions
 math-0.118
 homework-0.114
 operation-0.113
 not-0.113
 x-0.111
Token,
Token ID11
Feature activation+2.13e-01

Pos logit contributions
 pulp+0.057
oms+0.050
 fishes+0.050
 banners+0.049
 ge+0.048
Neg logit contributions
 homework-0.032
 not-0.030
 math-0.028
zag-0.028
ggie-0.028
Token and
Token ID290
Feature activation+3.30e-01

Pos logit contributions
 pulp+0.010
 fishes+0.009
oms+0.009
ment+0.009
 flags+0.008
Neg logit contributions
 homework-0.007
ggie-0.007
 math-0.007
 operation-0.007
 not-0.007
Token he
Token ID339
Feature activation+3.80e-02

Pos logit contributions
 pulp+0.015
oms+0.013
 fishes+0.013
ment+0.013
 flags+0.012
Neg logit contributions
 homework-0.012
ggie-0.011
 math-0.011
 operation-0.011
 not-0.011
Token ran
Token ID4966
Feature activation+3.53e-01

Pos logit contributions
 pulp+0.002
 fishes+0.001
oms+0.001
ment+0.001
 flags+0.001
Neg logit contributions
 homework-0.002
 operation-0.001
 math-0.001
ggie-0.001
zag-0.001
Token
Token ID198
Feature activation-3.36e-01

Pos logit contributions
 homework+0.000
 operation+0.000
ggie+0.000
 math+0.000
zag+0.000
Neg logit contributions
 pulp-0.000
 fishes-0.000
oms-0.000
ment-0.000
 flags-0.000
TokenThe
Token ID464
Feature activation+2.22e-01

Pos logit contributions
 homework+0.014
 operation+0.013
 x+0.013
 math+0.013
zag+0.013
Neg logit contributions
 pulp-0.013
 fishes-0.011
oms-0.011
Happy-0.011
ment-0.011
Token needle
Token ID17598
Feature activation-6.63e-02

Pos logit contributions
 pulp+0.010
 fishes+0.008
oms+0.008
ment+0.008
 flags+0.008
Neg logit contributions
 homework-0.008
ggie-0.008
 math-0.008
 operation-0.008
 not-0.008
Token and
Token ID290
Feature activation-4.69e-01

Pos logit contributions
 homework+0.003
 math+0.002
ggie+0.002
 operation+0.002
zag+0.002
Neg logit contributions
 pulp-0.003
 fishes-0.002
oms-0.002
ment-0.002
als-0.002
Token the
Token ID262
Feature activation+2.51e-02

Pos logit contributions
 homework+0.018
ggie+0.017
zag+0.016
 operation+0.016
 math+0.016
Neg logit contributions
 pulp-0.021
 fishes-0.018
ment-0.018
 flags-0.017
 shells-0.017
Token butterfly
Token ID35113
Feature activation+2.06e+00

Pos logit contributions
 pulp+0.001
 fishes+0.001
oms+0.001
ment+0.001
 flags+0.001
Neg logit contributions
 homework-0.001
ggie-0.001
zag-0.001
 math-0.001
 not-0.001
Token flew
Token ID13112
Feature activation+5.68e-01

Pos logit contributions
 pulp+0.084
ment+0.071
 fishes+0.069
oms+0.069
lan+0.068
Neg logit contributions
 homework-0.081
 not-0.080
zag-0.080
ggie-0.077
 operation-0.076
Token to
Token ID284
Feature activation-7.59e-02

Pos logit contributions
 pulp+0.027
oms+0.023
 fishes+0.023
ment+0.023
 ge+0.022
Neg logit contributions
 homework-0.020
 not-0.019
ggie-0.018
 math-0.018
zag-0.018
Token the
Token ID262
Feature activation-3.66e-02

Pos logit contributions
 homework+0.003
ggie+0.002
zag+0.002
 operation+0.002
 math+0.002
Neg logit contributions
 pulp-0.004
 fishes-0.003
oms-0.003
ment-0.003
 flags-0.003
Token store
Token ID3650
Feature activation+1.18e+00

Pos logit contributions
 homework+0.001
ggie+0.001
zag+0.001
 math+0.001
 not+0.001
Neg logit contributions
 pulp-0.002
 fishes-0.001
oms-0.001
 flags-0.001
ment-0.001
Token together
Token ID1978
Feature activation+9.03e-01

Pos logit contributions
 pulp+0.053
 fishes+0.044
oms+0.044
ment+0.042
 ge+0.042
Neg logit contributions
 homework-0.045
 not-0.045
zag-0.041
ggie-0.041
 math-0.041
Token
Token ID198
Feature activation+3.67e-01

Pos logit contributions
 pulp+0.032
 fishes+0.027
ment+0.027
oms+0.027
als+0.027
Neg logit contributions
 homework-0.026
 math-0.025
ggie-0.025
 not-0.025
zag-0.024
TokenSo
Token ID2396
Feature activation+1.16e-01

Pos logit contributions
 pulp+0.016
oms+0.014
 fishes+0.014
ment+0.014
als+0.014
Neg logit contributions
 homework-0.014
ggie-0.013
 math-0.013
 not-0.013
 operation-0.012
Token Jack
Token ID3619
Feature activation-3.80e-01

Pos logit contributions
 pulp+0.005
 fishes+0.005
oms+0.005
ment+0.005
 flags+0.005
Neg logit contributions
 homework-0.004
ggie-0.004
 operation-0.004
 math-0.004
zag-0.004
Token and
Token ID290
Feature activation-3.11e-01

Pos logit contributions
 homework+0.016
 math+0.014
 operation+0.014
ggie+0.014
 x+0.014
Neg logit contributions
 pulp-0.015
 fishes-0.014
oms-0.013
als-0.013
ment-0.013
Token the
Token ID262
Feature activation+1.84e-01

Pos logit contributions
 homework+0.012
zag+0.011
ggie+0.011
 operation+0.011
 x+0.011
Neg logit contributions
 pulp-0.014
 fishes-0.012
ment-0.012
 flags-0.012
 whales-0.011
Token dragon
Token ID10441
Feature activation+1.50e+00

Pos logit contributions
 pulp+0.009
 fishes+0.008
oms+0.008
ment+0.008
 flags+0.008
Neg logit contributions
 homework-0.006
ggie-0.005
 math-0.005
 operation-0.005
 not-0.005
Token set
Token ID900
Feature activation+7.91e-01

Pos logit contributions
 pulp+0.061
oms+0.048
ment+0.048
 fishes+0.047
 grains+0.045
Neg logit contributions
zag-0.063
 homework-0.062
ggie-0.061
 not-0.060
 operation-0.060
Token off
Token ID572
Feature activation+1.05e+00

Pos logit contributions
 pulp+0.039
oms+0.033
 fishes+0.032
ment+0.032
 ge+0.031
Neg logit contributions
 homework-0.026
ggie-0.025
 not-0.025
zag-0.024
 math-0.024
Token together
Token ID1978
Feature activation+7.21e-01

Pos logit contributions
 pulp+0.053
 fishes+0.044
ment+0.043
ins+0.043
 ge+0.042
Neg logit contributions
 homework-0.035
zag-0.033
 not-0.033
 math-0.032
 operation-0.031
Token.
Token ID13
Feature activation-2.18e-01

Pos logit contributions
 pulp+0.034
 fishes+0.029
oms+0.028
ment+0.028
 ge+0.028
Neg logit contributions
 homework-0.026
 not-0.024
zag-0.024
 math-0.024
ggie-0.024
Token The
Token ID383
Feature activation+2.29e-01

Pos logit contributions
 homework+0.009
ggie+0.008
 operation+0.008
zag+0.008
 x+0.008
Neg logit contributions
 pulp-0.009
 fishes-0.008
oms-0.008
ment-0.007
 flags-0.007
Token the
Token ID262
Feature activation+8.50e-01

Pos logit contributions
 pulp+0.073
oms+0.066
 fishes+0.064
ment+0.064
Happy+0.063
Neg logit contributions
 homework-0.045
 not-0.043
 math-0.042
 x-0.038
 operation-0.037
Token fridge
Token ID25772
Feature activation+6.72e-01

Pos logit contributions
 pulp+0.039
ment+0.036
 fishes+0.034
oms+0.034
 ge+0.033
Neg logit contributions
 homework-0.032
 math-0.029
 operation-0.029
 x-0.029
 not-0.028
Token.
Token ID13
Feature activation+9.82e-01

Pos logit contributions
 pulp+0.030
 fishes+0.026
 ge+0.025
ment+0.025
oms+0.025
Neg logit contributions
 homework-0.026
 not-0.025
 math-0.024
 operation-0.024
zag-0.023
Token It
Token ID632
Feature activation-3.73e-01

Pos logit contributions
 pulp+0.045
oms+0.038
 fishes+0.038
ment+0.038
ins+0.038
Neg logit contributions
 homework-0.036
 math-0.034
 not-0.033
 x-0.032
 operation-0.032
Token is
Token ID318
Feature activation+2.54e-01

Pos logit contributions
 homework+0.013
ggie+0.013
 operation+0.013
zag+0.012
 robber+0.012
Neg logit contributions
 pulp-0.017
oms-0.015
 fishes-0.014
als-0.014
 flags-0.014
Token full
Token ID1336
Feature activation+2.25e+00

Pos logit contributions
 pulp+0.012
 fishes+0.010
oms+0.010
ment+0.010
 flags+0.010
Neg logit contributions
 homework-0.009
ggie-0.008
 math-0.008
 not-0.008
 operation-0.008
Token of
Token ID286
Feature activation+1.89e+00

Pos logit contributions
 pulp+0.109
 fishes+0.086
 valleys+0.086
 ge+0.085
 islands+0.083
Neg logit contributions
 not-0.079
 math-0.078
 homework-0.078
 operation-0.075
 because-0.075
Token food
Token ID2057
Feature activation+1.44e+00

Pos logit contributions
 pulp+0.075
oms+0.070
als+0.068
ment+0.068
omed+0.065
Neg logit contributions
 homework-0.082
 math-0.078
 x-0.075
 not-0.072
 operation-0.070
Token.
Token ID13
Feature activation+8.78e-01

Pos logit contributions
 pulp+0.059
 fishes+0.050
oms+0.048
ment+0.048
 banners+0.047
Neg logit contributions
 homework-0.062
 not-0.058
 math-0.057
ggie-0.055
 operation-0.054
Token They
Token ID1119
Feature activation+1.65e+00

Pos logit contributions
 pulp+0.041
ment+0.035
 fishes+0.035
oms+0.035
 islands+0.034
Neg logit contributions
 homework-0.031
 math-0.029
 not-0.029
 operation-0.028
 x-0.028
Token have
Token ID423
Feature activation+1.31e+00

Pos logit contributions
 pulp+0.077
ment+0.074
 fishes+0.068
lan+0.068
ins+0.067
Neg logit contributions
 not-0.058
 homework-0.058
 skate-0.051
 math-0.051
 x-0.050
Token and
Token ID290
Feature activation+1.40e+00

Pos logit contributions
 homework+0.019
 operation+0.017
 math+0.017
ggie+0.017
 x+0.017
Neg logit contributions
 pulp-0.020
oms-0.018
 fishes-0.017
als-0.017
 flags-0.017
Token looked
Token ID3114
Feature activation+1.32e+00

Pos logit contributions
 pulp+0.054
oms+0.050
ment+0.047
ins+0.045
 fishes+0.045
Neg logit contributions
 homework-0.058
 not-0.056
 skate-0.054
 math-0.053
ggie-0.053
Token out
Token ID503
Feature activation+2.23e+00

Pos logit contributions
 pulp+0.074
ment+0.064
oms+0.064
 islands+0.063
 fishes+0.063
Neg logit contributions
 homework-0.034
 not-0.033
 math-0.032
 operation-0.029
ggie-0.028
Token into
Token ID656
Feature activation+1.53e+00

Pos logit contributions
 pulp+0.132
ins+0.114
owers+0.111
suits+0.111
unn+0.110
Neg logit contributions
 not-0.056
 math-0.053
 because-0.047
 homework-0.044
 operation-0.042
Token the
Token ID262
Feature activation+1.17e+00

Pos logit contributions
 pulp+0.082
oms+0.076
als+0.074
ins+0.074
lan+0.072
Neg logit contributions
 math-0.042
 homework-0.040
 x-0.036
 not-0.036
 operation-0.035
Token distance
Token ID5253
Feature activation+2.50e+00

Pos logit contributions
 pulp+0.045
ment+0.039
 fishes+0.038
oms+0.037
 whales+0.037
Neg logit contributions
 homework-0.050
 math-0.049
 operation-0.048
 x-0.046
 exam-0.046
Token.
Token ID13
Feature activation+7.96e-01

Pos logit contributions
 pulp+0.128
ins+0.103
oms+0.102
Happy+0.100
 fishes+0.100
Neg logit contributions
 not-0.084
 because-0.081
 math-0.076
 homework-0.073
 Max-0.071
Token The
Token ID383
Feature activation+3.93e-01

Pos logit contributions
 pulp+0.040
 fishes+0.035
oms+0.035
ment+0.034
 shells+0.034
Neg logit contributions
 homework-0.024
 math-0.023
 not-0.023
ggie-0.022
 operation-0.022
Token little
Token ID1310
Feature activation+6.73e-01

Pos logit contributions
 pulp+0.018
oms+0.016
 fishes+0.016
ment+0.015
 flags+0.015
Neg logit contributions
 homework-0.014
 operation-0.013
 math-0.013
ggie-0.013
 not-0.013
Token girl
Token ID2576
Feature activation+6.21e-01

Pos logit contributions
 pulp+0.026
oms+0.021
ment+0.021
 fishes+0.021
 islands+0.020
Neg logit contributions
 homework-0.029
 operation-0.028
 math-0.028
 x-0.027
 robot-0.027
Token asked
Token ID1965
Feature activation+1.42e-01

Pos logit contributions
 pulp+0.025
oms+0.021
 fishes+0.021
ment+0.021
 flags+0.020
Neg logit contributions
 homework-0.025
ggie-0.024
 not-0.024
zag-0.024
 math-0.023
Token.
Token ID13
Feature activation+1.02e-01

Pos logit contributions
 pulp+0.067
 fishes+0.054
 banners+0.054
 grains+0.052
 valleys+0.052
Neg logit contributions
 not-0.052
 homework-0.051
 math-0.048
 operation-0.048
 because-0.046
Token You
Token ID921
Feature activation+6.20e-01

Pos logit contributions
 pulp+0.006
 fishes+0.005
oms+0.005
ment+0.005
 flags+0.005
Neg logit contributions
 homework-0.003
ggie-0.002
 operation-0.002
 math-0.002
zag-0.002
Token are
Token ID389
Feature activation-6.67e-01

Pos logit contributions
 pulp+0.025
oms+0.021
ment+0.021
 fishes+0.021
 banners+0.020
Neg logit contributions
 homework-0.026
 not-0.025
ggie-0.024
 operation-0.024
 math-0.023
Token in
Token ID287
Feature activation+2.42e-01

Pos logit contributions
zag+0.017
 homework+0.016
 skate+0.015
ggie+0.015
 Maxwell+0.015
Neg logit contributions
 pulp-0.036
 fishes-0.032
 ge-0.032
 ducks-0.032
 flags-0.032
Token big
Token ID1263
Feature activation+2.37e+00

Pos logit contributions
 pulp+0.011
 fishes+0.009
oms+0.009
ment+0.009
 flags+0.009
Neg logit contributions
 homework-0.009
ggie-0.008
 math-0.008
 operation-0.008
 not-0.008
Token trouble
Token ID5876
Feature activation+1.34e+00

Pos logit contributions
 pulp+0.128
lan+0.119
als+0.119
Happy+0.117
aff+0.115
Neg logit contributions
 homework-0.067
 operation-0.060
 math-0.057
 x-0.057
 exam-0.057
Token."
Token ID526
Feature activation+5.96e-01

Pos logit contributions
 pulp+0.056
 fishes+0.045
oms+0.044
 shells+0.044
 banners+0.044
Neg logit contributions
 homework-0.058
 not-0.057
 operation-0.053
ggie-0.053
 math-0.052
Token
Token ID198
Feature activation+9.42e-01

Pos logit contributions
 pulp+0.027
 fishes+0.024
oms+0.023
als+0.022
 grains+0.022
Neg logit contributions
 homework-0.022
 not-0.021
 math-0.021
ggie-0.020
 Maxwell-0.020
Token
Token ID198
Feature activation+6.53e-01

Pos logit contributions
 pulp+0.046
 fishes+0.040
als+0.039
 grains+0.039
 banners+0.039
Neg logit contributions
 homework-0.033
 not-0.032
 math-0.031
ggie-0.031
 x-0.030
TokenTom
Token ID13787
Feature activation-1.03e+00

Pos logit contributions
 pulp+0.031
 fishes+0.027
oms+0.026
als+0.026
ment+0.025
Neg logit contributions
 homework-0.024
 not-0.023
 math-0.023
ggie-0.023
 Maxwell-0.022
Token and
Token ID290
Feature activation-4.85e-01

Pos logit contributions
 homework+0.044
ggie+0.043
 operation+0.043
 math+0.040
zag+0.040
Neg logit contributions
 pulp-0.038
oms-0.035
 fishes-0.033
als-0.033
 cheered-0.033
Token so
Token ID523
Feature activation+1.36e+00

Pos logit contributions
 homework+0.018
 math+0.016
 operation+0.016
zag+0.016
ggie+0.015
Neg logit contributions
 pulp-0.031
 fishes-0.028
 ge-0.027
 flags-0.027
 ducks-0.027
Token tired
Token ID10032
Feature activation+8.24e-02

Pos logit contributions
 pulp+0.076
als+0.070
oms+0.066
 islands+0.066
 banners+0.066
Neg logit contributions
 not-0.041
 homework-0.034
ggie-0.032
 math-0.031
 skate-0.030
Token,
Token ID11
Feature activation-1.35e+00

Pos logit contributions
 pulp+0.004
 fishes+0.004
oms+0.004
ment+0.003
 flags+0.003
Neg logit contributions
 homework-0.003
ggie-0.002
 math-0.002
 operation-0.002
zag-0.002
Token but
Token ID475
Feature activation-1.01e+00

Pos logit contributions
 homework+0.050
zag+0.047
ggie+0.046
 operation+0.045
 x+0.045
Neg logit contributions
 pulp-0.060
 fishes-0.056
 shells-0.054
 ge-0.054
 flags-0.054
Token he
Token ID339
Feature activation-9.88e-01

Pos logit contributions
 homework+0.040
zag+0.037
 x+0.036
ggie+0.036
 operation+0.035
Neg logit contributions
 pulp-0.042
 fishes-0.039
ment-0.038
 shells-0.038
 sunlight-0.037
Token was
Token ID373
Feature activation-5.86e-01

Pos logit contributions
 homework+0.044
 math+0.042
 operation+0.041
 x+0.040
 Maxwell+0.040
Neg logit contributions
 pulp-0.036
 fishes-0.032
 ge-0.032
 cheered-0.032
 shells-0.031
Token also
Token ID635
Feature activation-5.16e-01

Pos logit contributions
 homework+0.019
 math+0.018
zag+0.017
 operation+0.017
 Maxwell+0.017
Neg logit contributions
 pulp-0.028
 fishes-0.025
oms-0.024
 ge-0.024
ment-0.024
Token happy
Token ID3772
Feature activation-2.40e-02

Pos logit contributions
 homework+0.018
zag+0.017
 operation+0.017
 math+0.017
ggie+0.017
Neg logit contributions
 pulp-0.023
 fishes-0.021
oms-0.020
ment-0.020
 ge-0.020
Token.
Token ID13
Feature activation-4.50e-01

Pos logit contributions
 homework+0.001
ggie+0.001
 operation+0.001
zag+0.001
 math+0.001
Neg logit contributions
 pulp-0.001
 fishes-0.001
ment-0.001
oms-0.001
 flags-0.001
Token He
Token ID679
Feature activation-2.85e-01

Pos logit contributions
 homework+0.016
ggie+0.014
 operation+0.014
zag+0.014
 x+0.014
Neg logit contributions
 pulp-0.021
 fishes-0.019
ment-0.019
oms-0.019
als-0.018
Token gathered
Token ID9272
Feature activation+1.21e+00

Pos logit contributions
 homework+0.013
 math+0.012
 operation+0.012
ggie+0.012
zag+0.012
Neg logit contributions
 pulp-0.010
 fishes-0.009
 ge-0.008
oms-0.008
ment-0.008
Tokenped
Token ID9124
Feature activation+1.37e+00

Pos logit contributions
 pulp+0.056
oms+0.049
 fishes+0.049
 banners+0.048
 islands+0.047
Neg logit contributions
 not-0.059
ggie-0.059
zag-0.059
 homework-0.058
isk-0.057
Token happily
Token ID18177
Feature activation+1.92e+00

Pos logit contributions
 pulp+0.062
oms+0.055
 fishes+0.052
 islands+0.051
 banners+0.050
Neg logit contributions
 not-0.050
 homework-0.049
ggie-0.048
 math-0.047
 robot-0.044
Token.
Token ID13
Feature activation+3.26e-01

Pos logit contributions
 pulp+0.092
 fishes+0.080
oms+0.077
 banners+0.076
 islands+0.075
Neg logit contributions
 homework-0.069
 not-0.068
ggie-0.065
zag-0.065
 rider-0.061
Token It
Token ID632
Feature activation-1.19e-01

Pos logit contributions
 pulp+0.014
 fishes+0.012
oms+0.012
ment+0.012
 banners+0.011
Neg logit contributions
 homework-0.013
 not-0.012
 math-0.012
ggie-0.012
 operation-0.012
Token trusted
Token ID13467
Feature activation+9.35e-02

Pos logit contributions
 homework+0.005
 math+0.005
 operation+0.004
ggie+0.004
zag+0.004
Neg logit contributions
 pulp-0.005
 fishes-0.004
oms-0.004
 ge-0.004
ment-0.004
Token them
Token ID606
Feature activation-3.48e-01

Pos logit contributions
 pulp+0.002
 fishes+0.002
oms+0.002
ment+0.001
 flags+0.001
Neg logit contributions
 homework-0.006
ggie-0.005
zag-0.005
 operation-0.005
 math-0.005
Token and
Token ID290
Feature activation+8.22e-01

Pos logit contributions
 homework+0.017
 operation+0.016
ggie+0.016
 Maxwell+0.016
 x+0.016
Neg logit contributions
 pulp-0.011
 fishes-0.009
oms-0.009
ment-0.009
als-0.009
Token let
Token ID1309
Feature activation-7.32e-01

Pos logit contributions
 pulp+0.031
oms+0.028
 fishes+0.027
 banners+0.026
ment+0.025
Neg logit contributions
 homework-0.036
 not-0.035
ggie-0.033
 math-0.033
 operation-0.032
Token them
Token ID606
Feature activation+1.74e+00

Pos logit contributions
zag+0.047
 homework+0.046
 math+0.044
ggie+0.044
 Maxwell+0.044
Neg logit contributions
 pulp-0.012
 ge-0.009
 fishes-0.008
 ducks-0.008
 shells-0.008
Token touch
Token ID3638
Feature activation+7.03e-01

Pos logit contributions
 pulp+0.092
oms+0.082
 islands+0.082
 fishes+0.082
 banners+0.081
Neg logit contributions
 not-0.053
 homework-0.050
 math-0.046
 operation-0.046
zag-0.046
Token it
Token ID340
Feature activation+8.53e-03

Pos logit contributions
 pulp+0.025
oms+0.021
 fishes+0.020
ment+0.020
 banners+0.020
Neg logit contributions
 homework-0.033
 not-0.031
 math-0.031
ggie-0.030
 operation-0.030
Token was
Token ID373
Feature activation+4.18e-01

Pos logit contributions
 homework+0.013
 math+0.012
 operation+0.012
ggie+0.012
 x+0.012
Neg logit contributions
 pulp-0.011
 fishes-0.010
oms-0.010
 ge-0.010
 shells-0.010
Token done
Token ID1760
Feature activation-4.04e-01

Pos logit contributions
 pulp+0.020
 fishes+0.017
oms+0.017
ment+0.017
als+0.016
Neg logit contributions
 homework-0.015
 not-0.014
ggie-0.014
 math-0.014
 operation-0.014
Token swimming
Token ID14899
Feature activation-4.71e-01

Pos logit contributions
 homework+0.016
ggie+0.015
zag+0.015
 operation+0.015
 x+0.015
Neg logit contributions
 pulp-0.016
oms-0.015
 fishes-0.014
ment-0.014
 flags-0.014
Token,
Token ID11
Feature activation-1.14e+00

Pos logit contributions
 homework+0.020
ggie+0.018
 x+0.018
zag+0.018
 math+0.018
Neg logit contributions
 pulp-0.018
 fishes-0.017
oms-0.017
ment-0.016
 shells-0.016
Token he
Token ID339
Feature activation-4.67e-01

Pos logit contributions
 homework+0.047
ggie+0.046
zag+0.044
 x+0.043
 operation+0.042
Neg logit contributions
 pulp-0.047
 shells-0.044
 fishes-0.044
 ge-0.042
ment-0.041
Token came
Token ID1625
Feature activation-2.01e-01

Pos logit contributions
 homework+0.021
 math+0.020
 operation+0.020
 x+0.020
ggie+0.020
Neg logit contributions
 pulp-0.017
 fishes-0.015
oms-0.014
 ge-0.014
 shells-0.014
Token out
Token ID503
Feature activation+8.24e-02

Pos logit contributions
 homework+0.008
 operation+0.007
ggie+0.007
 math+0.007
 x+0.007
Neg logit contributions
 pulp-0.008
 fishes-0.007
oms-0.007
ment-0.007
 flags-0.007
Token of
Token ID286
Feature activation+4.75e-02

Pos logit contributions
 pulp+0.003
oms+0.002
 fishes+0.002
ment+0.002
 flags+0.002
Neg logit contributions
 homework-0.004
ggie-0.004
 operation-0.004
 math-0.004
zag-0.004
Token the
Token ID262
Feature activation+8.82e-01

Pos logit contributions
 pulp+0.002
 fishes+0.002
oms+0.002
ment+0.002
 flags+0.002
Neg logit contributions
 homework-0.002
ggie-0.002
zag-0.001
 operation-0.001
 math-0.001
Token pool
Token ID5933
Feature activation+4.06e-01

Pos logit contributions
 pulp+0.039
ment+0.035
oms+0.032
als+0.032
 fishes+0.031
Neg logit contributions
 homework-0.035
 skate-0.033
 operation-0.033
 math-0.032
 not-0.032
Token feeling
Token ID4203
Feature activation+9.82e-01

Pos logit contributions
 pulp+0.018
 fishes+0.015
oms+0.015
ment+0.015
 banners+0.015
Neg logit contributions
 homework-0.015
ggie-0.014
 math-0.014
 not-0.014
 operation-0.014
Token so
Token ID523
Feature activation+1.15e+00

Pos logit contributions
 homework+0.027
 operation+0.025
 x+0.024
 exam+0.023
 math+0.023
Neg logit contributions
 pulp-0.038
 fishes-0.036
 ge-0.035
 ducks-0.035
 cheered-0.035
Token happy
Token ID3772
Feature activation+6.75e-02

Pos logit contributions
 pulp+0.065
oms+0.058
als+0.057
ment+0.057
 banners+0.057
Neg logit contributions
 not-0.033
ggie-0.027
 homework-0.026
 math-0.025
zag-0.024
Token to
Token ID284
Feature activation+5.18e-01

Pos logit contributions
 pulp+0.003
 fishes+0.003
oms+0.003
ment+0.003
 flags+0.003
Neg logit contributions
 homework-0.002
ggie-0.002
 operation-0.002
 math-0.002
zag-0.002
Token see
Token ID766
Feature activation+9.23e-01

Pos logit contributions
 pulp+0.023
 fishes+0.020
oms+0.020
ment+0.020
 banners+0.019
Neg logit contributions
 homework-0.019
 not-0.018
zag-0.018
 operation-0.018
ggie-0.018
Token something
Token ID1223
Feature activation+4.43e-01

Pos logit contributions
 pulp+0.047
oms+0.042
ment+0.041
 banners+0.040
 flags+0.040
Neg logit contributions
 homework-0.028
 not-0.028
 math-0.026
 operation-0.026
ggie-0.025
Token that
Token ID326
Feature activation-3.52e-01

Pos logit contributions
 pulp+0.021
oms+0.018
 fishes+0.018
ment+0.018
 banners+0.017
Neg logit contributions
 homework-0.016
 not-0.014
 math-0.014
ggie-0.014
 operation-0.014
Token looked
Token ID3114
Feature activation+6.09e-01

Pos logit contributions
 homework+0.014
ggie+0.014
zag+0.014
 operation+0.014
 x+0.013
Neg logit contributions
 pulp-0.014
 fishes-0.012
als-0.012
 shells-0.012
oms-0.012
Token just
Token ID655
Feature activation+1.00e+00

Pos logit contributions
 pulp+0.033
oms+0.029
 fishes+0.029
ment+0.029
 banners+0.028
Neg logit contributions
 homework-0.017
 not-0.016
 math-0.015
ggie-0.015
 operation-0.014
Token like
Token ID588
Feature activation+7.06e-01

Pos logit contributions
 pulp+0.057
oms+0.051
 fishes+0.049
 banners+0.049
ment+0.049
Neg logit contributions
 not-0.025
 homework-0.025
 math-0.023
 operation-0.022
ggie-0.021
Token her
Token ID607
Feature activation+2.42e-01

Pos logit contributions
 pulp+0.034
oms+0.029
 fishes+0.029
ment+0.029
 banners+0.028
Neg logit contributions
 homework-0.024
 math-0.022
 not-0.022
ggie-0.022
 operation-0.022
Token.
Token ID13
Feature activation+5.86e-01

Pos logit contributions
 pulp+0.010
 fishes+0.009
oms+0.009
ment+0.009
 flags+0.008
Neg logit contributions
 homework-0.010
ggie-0.009
 math-0.009
 operation-0.009
 not-0.009
Token add
Token ID751
Feature activation+2.18e+00

Pos logit contributions
 pulp+0.038
ment+0.034
oms+0.033
 fishes+0.033
 banners+0.033
Neg logit contributions
 homework-0.025
 not-0.023
 math-0.023
 operation-0.023
 skate-0.022
Token to
Token ID284
Feature activation+8.17e-01

Pos logit contributions
 pulp+0.103
ment+0.084
 ge+0.084
 banners+0.084
oms+0.084
Neg logit contributions
 homework-0.077
 math-0.076
 not-0.073
 operation-0.067
 x-0.067
Token my
Token ID616
Feature activation-8.48e-02

Pos logit contributions
 pulp+0.039
oms+0.035
ment+0.034
als+0.033
 fishes+0.033
Neg logit contributions
 homework-0.028
 math-0.026
 not-0.026
 operation-0.025
 x-0.024
Token collection
Token ID4947
Feature activation-8.90e-02

Pos logit contributions
 homework+0.003
ggie+0.003
zag+0.003
 operation+0.003
 math+0.003
Neg logit contributions
 pulp-0.004
 fishes-0.003
oms-0.003
 shells-0.003
 flags-0.003
Token,"
Token ID553
Feature activation+1.22e-01

Pos logit contributions
 homework+0.005
ggie+0.005
 operation+0.005
 math+0.005
zag+0.004
Neg logit contributions
 pulp-0.002
 fishes-0.002
oms-0.002
ment-0.002
 flags-0.002
Token he
Token ID339
Feature activation-1.63e-01

Pos logit contributions
 pulp+0.005
 fishes+0.005
oms+0.005
ment+0.004
 flags+0.004
Neg logit contributions
 homework-0.005
ggie-0.004
zag-0.004
 operation-0.004
 math-0.004
Token replied
Token ID8712
Feature activation-2.40e-01

Pos logit contributions
 homework+0.008
 math+0.008
 operation+0.008
ggie+0.008
zag+0.008
Neg logit contributions
 pulp-0.005
 fishes-0.004
oms-0.004
ment-0.004
als-0.004
Token.
Token ID13
Feature activation+6.42e-01

Pos logit contributions
 homework+0.013
zag+0.012
ggie+0.012
 operation+0.012
 math+0.012
Neg logit contributions
 pulp-0.007
ment-0.005
oms-0.005
 fishes-0.005
 flags-0.005
Token
Token ID198
Feature activation+5.02e-01

Pos logit contributions
 pulp+0.029
 fishes+0.025
oms+0.025
 islands+0.024
 flags+0.024
Neg logit contributions
 homework-0.023
 not-0.022
 math-0.022
ggie-0.022
zag-0.022
Token
Token ID198
Feature activation-2.49e-02

Pos logit contributions
 pulp+0.022
 fishes+0.019
oms+0.018
ment+0.018
 banners+0.018
Neg logit contributions
 homework-0.019
ggie-0.018
 not-0.018
 math-0.018
zag-0.018
TokenThe
Token ID464
Feature activation+3.36e-01

Pos logit contributions
 homework+0.001
 operation+0.001
 math+0.001
ggie+0.001
zag+0.001
Neg logit contributions
 pulp-0.001
 fishes-0.001
oms-0.001
ment-0.001
 flags-0.001
TokenHey
Token ID10814
Feature activation+2.32e-01

Pos logit contributions
 pulp+0.015
oms+0.013
 fishes+0.013
ment+0.013
 flags+0.013
Neg logit contributions
 homework-0.015
ggie-0.014
 operation-0.014
 math-0.014
 not-0.014
Token,
Token ID11
Feature activation-6.31e-01

Pos logit contributions
 pulp+0.010
 fishes+0.009
oms+0.009
ment+0.008
 flags+0.008
Neg logit contributions
 homework-0.009
ggie-0.008
 operation-0.008
 math-0.008
 not-0.008
Token stop
Token ID2245
Feature activation+5.67e-01

Pos logit contributions
 homework+0.025
zag+0.024
 math+0.022
ggie+0.022
 operation+0.022
Neg logit contributions
 pulp-0.026
 ge-0.023
 fishes-0.023
oms-0.022
 pump-0.022
Token!
Token ID0
Feature activation-8.10e-01

Pos logit contributions
 pulp+0.027
oms+0.024
 fishes+0.024
ment+0.023
 banners+0.023
Neg logit contributions
 homework-0.019
 not-0.018
 math-0.018
ggie-0.018
 operation-0.017
Token That
Token ID1320
Feature activation-1.26e+00

Pos logit contributions
 homework+0.027
zag+0.025
ggie+0.024
 operation+0.024
 math+0.023
Neg logit contributions
 pulp-0.038
oms-0.035
als-0.035
ment-0.034
 fishes-0.034
Token's
Token ID338
Feature activation-2.05e+00

Pos logit contributions
 homework+0.045
 Maxwell+0.043
isk+0.042
zag+0.041
 robber+0.041
Neg logit contributions
 pulp-0.054
 ge-0.051
 ducks-0.048
 pump-0.048
als-0.047
Token my
Token ID616
Feature activation-6.66e-01

Pos logit contributions
zag+0.064
icycle+0.054
 rider+0.052
robe+0.052
case+0.052
Neg logit contributions
 pulp-0.092
 ge-0.089
 frost-0.087
 ducks-0.085
 seeds-0.085
Token boss
Token ID6478
Feature activation-1.92e-02

Pos logit contributions
 homework+0.029
 Maxwell+0.029
zag+0.028
ggie+0.028
 operation+0.027
Neg logit contributions
 pulp-0.023
 ge-0.019
 fishes-0.019
ment-0.019
oms-0.019
Token'
Token ID6
Feature activation-1.03e+00

Pos logit contributions
 homework+0.001
ggie+0.001
 operation+0.001
 math+0.001
 not+0.001
Neg logit contributions
 pulp-0.001
oms-0.001
 fishes-0.001
ment-0.001
als-0.001
Token chocolate
Token ID11311
Feature activation-1.85e+00

Pos logit contributions
 Maxwell+0.033
 homework+0.033
zag+0.031
ggie+0.030
 math+0.030
Neg logit contributions
 pulp-0.047
ment-0.042
 ge-0.041
 fishes-0.040
 pump-0.040
Token!"
Token ID2474
Feature activation+1.07e+00

Pos logit contributions
 Maxwell+0.081
 homework+0.076
 Patch+0.073
 operation+0.073
 x+0.073
Neg logit contributions
 pulp-0.063
als-0.056
ment-0.054
 pump-0.053
 ge-0.053
Token she
Token ID673
Feature activation-4.08e-01

Pos logit contributions
 homework+0.008
zag+0.008
ggie+0.008
 operation+0.007
 x+0.007
Neg logit contributions
 pulp-0.009
 fishes-0.008
ment-0.008
 ge-0.008
 shells-0.008
Token quickly
Token ID2952
Feature activation+1.22e+00

Pos logit contributions
 homework+0.020
 math+0.019
 operation+0.019
 x+0.018
zag+0.018
Neg logit contributions
 pulp-0.013
 fishes-0.011
als-0.011
 ge-0.011
 flags-0.011
Token jumped
Token ID11687
Feature activation-1.22e+00

Pos logit contributions
 pulp+0.055
 fishes+0.047
oms+0.046
ment+0.046
 islands+0.045
Neg logit contributions
 homework-0.045
 skate-0.045
 not-0.044
ggie-0.043
zag-0.042
Token onto
Token ID4291
Feature activation-5.23e-01

Pos logit contributions
 homework+0.048
 operation+0.045
ggie+0.044
isk+0.044
 x+0.043
Neg logit contributions
 pulp-0.048
 fishes-0.045
 flags-0.045
oms-0.045
als-0.045
Token her
Token ID607
Feature activation-1.12e+00

Pos logit contributions
 homework+0.022
zag+0.021
ggie+0.020
 operation+0.020
 skate+0.019
Neg logit contributions
 pulp-0.022
 fishes-0.018
 ge-0.017
 grains-0.017
 flags-0.017
Token bike
Token ID7161
Feature activation-2.44e+00

Pos logit contributions
 homework+0.031
isk+0.031
ggie+0.030
 Maxwell+0.029
 operation+0.029
Neg logit contributions
 pulp-0.057
 flags-0.053
 shells-0.051
 pump-0.051
 banners-0.050
Token and
Token ID290
Feature activation-4.77e-01

Pos logit contributions
 homework+0.143
 operation+0.137
 robber+0.135
ggie+0.134
 x+0.134
Neg logit contributions
als-0.038
 flags-0.031
 cheered-0.030
 pump-0.028
 towels-0.027
Token began
Token ID2540
Feature activation+1.59e-01

Pos logit contributions
 homework+0.025
 operation+0.024
 x+0.024
ggie+0.023
 math+0.023
Neg logit contributions
 pulp-0.014
als-0.011
 fishes-0.011
 flags-0.011
 cheered-0.011
Token to
Token ID284
Feature activation-4.01e-01

Pos logit contributions
 pulp+0.007
 fishes+0.006
oms+0.006
ment+0.006
 flags+0.006
Neg logit contributions
 homework-0.006
ggie-0.005
 operation-0.005
 math-0.005
zag-0.005
Token ride
Token ID6594
Feature activation-6.80e-01

Pos logit contributions
 homework+0.013
ggie+0.012
 operation+0.012
zag+0.012
 math+0.012
Neg logit contributions
 pulp-0.019
 fishes-0.017
 flags-0.017
 banners-0.017
 pump-0.017
Token.
Token ID13
Feature activation-2.32e-01

Pos logit contributions
 homework+0.027
 operation+0.026
 x+0.025
zag+0.025
 robber+0.025
Neg logit contributions
 pulp-0.026
 flags-0.024
ment-0.024
 fishes-0.023
 shells-0.023
Token away
Token ID1497
Feature activation+1.68e+00

Pos logit contributions
 pulp+0.022
oms+0.019
 fishes+0.019
ment+0.019
 flags+0.018
Neg logit contributions
 homework-0.014
 not-0.013
 math-0.013
ggie-0.013
 operation-0.013
Token.
Token ID13
Feature activation+6.06e-02

Pos logit contributions
 pulp+0.080
 fishes+0.066
 islands+0.066
 whales+0.066
ins+0.065
Neg logit contributions
 not-0.064
zag-0.056
 homework-0.056
 math-0.055
 because-0.054
Token Lily
Token ID20037
Feature activation-6.98e-01

Pos logit contributions
 pulp+0.003
oms+0.003
 fishes+0.003
ment+0.003
 flags+0.003
Neg logit contributions
 homework-0.002
ggie-0.002
 operation-0.002
zag-0.002
 math-0.002
Token thought
Token ID1807
Feature activation-1.54e+00

Pos logit contributions
 homework+0.031
 operation+0.029
 math+0.029
 x+0.028
 robber+0.028
Neg logit contributions
 pulp-0.026
 fishes-0.023
 cheered-0.023
als-0.023
 ge-0.022
Token it
Token ID340
Feature activation-1.85e+00

Pos logit contributions
zag+0.068
isk+0.067
icycle+0.065
 homework+0.063
 skate+0.063
Neg logit contributions
oms-0.055
 pulp-0.051
 shells-0.050
 pump-0.047
 whales-0.047
Token was
Token ID373
Feature activation-1.96e+00

Pos logit contributions
 homework+0.096
 robber+0.091
 math+0.089
 skate+0.087
 exam+0.085
Neg logit contributions
als-0.051
 cheered-0.048
 pulp-0.047
led-0.043
 fishes-0.043
Token so
Token ID523
Feature activation+5.21e-01

Pos logit contributions
zag+0.087
 skate+0.085
 homework+0.085
belt+0.083
ousel+0.081
Neg logit contributions
 pulp-0.069
 fishes-0.066
 seeds-0.066
 ducks-0.065
 ge-0.061
Token cool
Token ID3608
Feature activation-2.03e-01

Pos logit contributions
 pulp+0.024
 fishes+0.020
oms+0.020
ment+0.020
 banners+0.020
Neg logit contributions
 homework-0.019
 not-0.018
 math-0.018
ggie-0.018
 operation-0.018
Token that
Token ID326
Feature activation-7.17e-01

Pos logit contributions
 homework+0.007
ggie+0.007
 operation+0.007
zag+0.007
 math+0.007
Neg logit contributions
 pulp-0.009
oms-0.008
 fishes-0.008
ment-0.008
als-0.008
Token she
Token ID673
Feature activation-6.95e-01

Pos logit contributions
zag+0.029
 homework+0.029
ggie+0.028
 skate+0.027
 x+0.027
Neg logit contributions
 pulp-0.028
 fishes-0.025
 shells-0.024
als-0.024
 ge-0.024
Token wanted
Token ID2227
Feature activation+1.29e-01

Pos logit contributions
 homework+0.033
 math+0.031
 operation+0.031
 x+0.030
zag+0.030
Neg logit contributions
 pulp-0.024
 fishes-0.021
als-0.020
 ge-0.020
 cheered-0.020
Token were
Token ID547
Feature activation-1.62e+00

Pos logit contributions
 homework+0.022
ggie+0.020
 operation+0.020
 math+0.020
 x+0.020
Neg logit contributions
 pulp-0.016
oms-0.014
 fishes-0.014
ment-0.014
 flags-0.013
Token even
Token ID772
Feature activation-1.14e+00

Pos logit contributions
 homework+0.055
 x+0.053
isk+0.051
zag+0.051
 operation+0.049
Neg logit contributions
 pulp-0.067
 fishes-0.067
 pump-0.065
 ge-0.064
 shells-0.064
Token bigger
Token ID5749
Feature activation-9.76e-01

Pos logit contributions
 homework+0.053
ggie+0.052
zag+0.052
 operation+0.048
 x+0.048
Neg logit contributions
 pulp-0.037
 fishes-0.034
 shells-0.033
 ge-0.032
 flags-0.032
Token,
Token ID11
Feature activation-2.61e+00

Pos logit contributions
 homework+0.042
ggie+0.039
 x+0.039
 operation+0.039
zag+0.038
Neg logit contributions
 pulp-0.035
 fishes-0.033
ment-0.033
oms-0.033
 flags-0.032
Token and
Token ID290
Feature activation-1.56e+00

Pos logit contributions
 homework+0.125
ggie+0.124
zag+0.115
 x+0.113
 operation+0.113
Neg logit contributions
 fishes-0.084
 shells-0.082
 ducks-0.081
 sunlight-0.080
 pulp-0.080
Token he
Token ID339
Feature activation-2.23e+00

Pos logit contributions
ggie+0.063
 homework+0.063
zag+0.059
 x+0.058
 operation+0.056
Neg logit contributions
 pulp-0.061
 fishes-0.059
 shells-0.057
 sunlight-0.054
 pump-0.053
Token knew
Token ID2993
Feature activation-1.25e+00

Pos logit contributions
 x+0.102
 homework+0.102
 operation+0.096
 math+0.094
 exam+0.092
Neg logit contributions
 cheered-0.075
 fishes-0.074
 pulp-0.073
 admired-0.069
als-0.068
Token that
Token ID326
Feature activation-1.67e+00

Pos logit contributions
 homework+0.048
isk+0.047
zag+0.046
 x+0.045
 robber+0.045
Neg logit contributions
 shells-0.047
ment-0.046
 pulp-0.046
 fishes-0.045
lan-0.043
Token he
Token ID339
Feature activation-1.82e+00

Pos logit contributions
zag+0.062
ggie+0.060
 homework+0.060
isk+0.060
 x+0.058
Neg logit contributions
 pulp-0.066
 fishes-0.065
 shells-0.064
 pump-0.062
 whales-0.061
Token could
Token ID714
Feature activation-2.00e+00

Pos logit contributions
 homework+0.093
 math+0.089
 x+0.088
 operation+0.087
zag+0.083
Neg logit contributions
 cheered-0.046
 pulp-0.045
als-0.044
 fishes-0.044
oms-0.040
Token do
Token ID466
Feature activation-2.05e+00

Pos logit contributions
 homework+0.069
 Maxwell+0.069
isk+0.068
 robber+0.068
 x+0.066
Neg logit contributions
 pump-0.073
 fishes-0.071
 shells-0.069
 ducks-0.068
 islands-0.067
Token.
Token ID13
Feature activation-8.14e-01

Pos logit contributions
 pulp+0.025
 fishes+0.020
oms+0.020
 banners+0.019
 flags+0.019
Neg logit contributions
 homework-0.032
 not-0.030
 math-0.030
ggie-0.030
 operation-0.030
Token From
Token ID3574
Feature activation-3.40e-01

Pos logit contributions
 homework+0.032
ggie+0.032
zag+0.032
 operation+0.030
 x+0.030
Neg logit contributions
 pulp-0.033
als-0.029
oms-0.028
 sunlight-0.028
ment-0.028
Token then
Token ID788
Feature activation-1.37e+00

Pos logit contributions
 homework+0.013
zag+0.011
ggie+0.011
 operation+0.011
 math+0.011
Neg logit contributions
 pulp-0.015
 fishes-0.013
ment-0.013
oms-0.013
 shells-0.013
Token on
Token ID319
Feature activation-1.14e+00

Pos logit contributions
 homework+0.065
ggie+0.062
isk+0.061
zag+0.060
 robber+0.060
Neg logit contributions
 pulp-0.040
ment-0.036
oms-0.035
 pump-0.035
Happy-0.035
Token,
Token ID11
Feature activation-1.19e+00

Pos logit contributions
 skate+0.062
 homework+0.062
isk+0.062
zag+0.061
 robber+0.061
Neg logit contributions
 pulp-0.023
oms-0.020
ment-0.020
 pump-0.019
als-0.019
Token Lily
Token ID20037
Feature activation-1.90e+00

Pos logit contributions
zag+0.057
ggie+0.054
icycle+0.052
 x+0.052
ousel+0.052
Neg logit contributions
 pulp-0.039
Happy-0.035
 pump-0.034
als-0.033
 shells-0.033
Token always
Token ID1464
Feature activation+9.33e-01

Pos logit contributions
 homework+0.078
 x+0.075
 robber+0.073
 operation+0.073
ggie+0.073
Neg logit contributions
 pulp-0.075
 pump-0.067
 seeds-0.065
 fishes-0.064
als-0.063
Token cut
Token ID2005
Feature activation-1.11e+00

Pos logit contributions
 pulp+0.040
oms+0.037
 fishes+0.036
 flags+0.035
 banners+0.035
Neg logit contributions
 homework-0.036
 not-0.034
 math-0.032
 operation-0.032
 x-0.032
Token her
Token ID607
Feature activation-1.01e+00

Pos logit contributions
 homework+0.044
zag+0.043
ggie+0.042
 skate+0.041
 operation+0.041
Neg logit contributions
 pulp-0.044
ment-0.040
als-0.040
 shells-0.039
 fishes-0.039
Token w
Token ID266
Feature activation-6.80e-01

Pos logit contributions
ggie+0.044
zag+0.043
 operation+0.041
case+0.041
ousel+0.041
Neg logit contributions
 pulp-0.038
 shells-0.034
 fishes-0.033
 bowls-0.033
 pans-0.033
Tokenaffles
Token ID48501
Feature activation-1.99e+00

Pos logit contributions
 homework+0.022
zag+0.022
 not+0.021
ggie+0.021
 Maxwell+0.021
Neg logit contributions
 pulp-0.031
oms-0.029
 fishes-0.028
ins-0.028
ment-0.028