TOP ACTIVATIONS
MAX = 0.107

. He practiced riding his bicycle every day to get ready
it . He practiced his math problems and soon he could
every day . But she didn 't like to listen to
math homework . Tim my didn 't want to do it
book to practice . Lily didn 't want to do it
tried his best to do his best . He worked hard
ride . He would ride it every day and feel the
made sure to bounce the football gently so that it wouldn
his trumpet . He plays it every day . He makes
Tim my practiced playing the violin every day and soon became
muscles felt heavy . He didn 't like the feeling .
Tim my didn 't have a real sword
much . He would ride it around town and people would
my always wore his yellow helmet when he rode his bike
the trumpet . But he didn 't have a trumpet of
him . Tim my missed him so much and wondered where
that he should complete his math and not lie . <|endoftext|>
the right equipment . She didn 't like this , so
phone was difficult . She didn 't know how to call
, but he played his violin beautifully . The people cl

MIN ACTIVATIONS
MIN = -0.134

als and bloom ed in bright colors - red , yellow
big reef with lots of colorful fish and coral .
, with all sorts of bright colors . The
grew into big plants with beautiful flowers . The farmer was
big reef with lots of colorful coral . Nem o wanted
she found big flowers with colorful pet als and birds picking
their home with lots of pretty things . One day ,
, beautiful garden full of colourful flowers . The child loved
big pond with lots of colorful rocks . He walked closer
was big and full of colourful fish . Tom thought it
and picked out lots of colorful flowers . They placed the
and trees and lots of green . But then something bad
big pond and lots of colorful flowers . M
. There were lots of small fishes swimming around his hands
their garden with lots of colorful things . They bought some
garden were lots of beautiful colourful flowers . Every day ,
of green and lots of tiny , colorful fish ! Molly
and Mum picked lots of colourful flowers and made a pretty
the garden were lots of beautiful colourful flowers . Every day
would fill the bucket with fresh water from the nearby river

INTERVAL 0.046 - 0.107
CONTAINS 1.745%

so quickly , Nick y didn 't know what to guess
and play . He never stayed still for long !
he would get into trouble , so he quickly ran over
scared . What would she do now ? She thought and
Benny learned that it 's important to take care of your

INTERVAL -0.014 - 0.046
CONTAINS 71.721%

gy bank and brought it back to the man . He
him , " What are you holding ?" The
buy one . The girl said , " What
was not far away . When she emerged from the tunnel
ling . The wolf wanted to be the best how ler

INTERVAL -0.074 - -0.014
CONTAINS 26.454%

to climb the ladder and see the birds .
in the dirt and find bugs and stones . They had
crabs , and he and his Mum went home .
All of a sudden , the music started and everyone
could always use his cat 's help when he needed it

INTERVAL -0.134 - -0.074
CONTAINS 0.079%

and threw them in the trash . She told Tommy to
and separated the leaf in two , making Alexis very sad
Please , can we drink some of yours ?" Ben begged
harbor . They saw many big and small boats . They
prince and his sun fl owers lived happily ever after .
Token.
Token ID13
Feature activation+2.97e-02

Pos logit contributions
 homework+0.036
 x+0.035
ggie+0.035
 operation+0.034
 Maxwell+0.034
Neg logit contributions
 pulp-0.030
ment-0.026
oms-0.026
 fishes-0.026
 shells-0.026
Token He
Token ID679
Feature activation+3.80e-02

Pos logit contributions
 homework+0.058
ggie+0.053
zag+0.052
 operation+0.052
 x+0.052
Neg logit contributions
 pulp-0.039
 fishes-0.036
ment-0.036
 sunlight-0.034
Happy-0.034
Token practiced
Token ID19893
Feature activation+6.50e-02

Pos logit contributions
 homework+0.070
 math+0.067
 x+0.067
 operation+0.066
 robber+0.066
Neg logit contributions
 pulp-0.048
 cheered-0.047
 fishes-0.045
 ge-0.044
 flags-0.044
Token riding
Token ID10311
Feature activation+6.26e-02

Pos logit contributions
isk+0.103
 x+0.100
ibo+0.099
 operation+0.099
omm+0.099
Neg logit contributions
 flags-0.095
 pump-0.087
 islands-0.086
 valleys-0.086
 cheered-0.086
Token his
Token ID465
Feature activation+5.01e-02

Pos logit contributions
isk+0.107
 homework+0.106
 x+0.104
omm+0.104
 operation+0.103
Neg logit contributions
 flags-0.091
 shells-0.081
 cheered-0.081
 banners-0.080
 valleys-0.079
Token bicycle
Token ID17026
Feature activation+1.07e-01

Pos logit contributions
isk+0.055
ggie+0.053
 Maxwell+0.050
 Rat+0.049
liv+0.048
Neg logit contributions
 flags-0.102
 pulp-0.097
 shells-0.095
 banners-0.093
 pump-0.093
Token every
Token ID790
Feature activation+5.12e-02

Pos logit contributions
 x+0.194
scar+0.187
 homework+0.185
ggie+0.183
liv+0.180
Neg logit contributions
als-0.129
 flags-0.115
oms-0.105
 cheered-0.103
ment-0.102
Token day
Token ID1110
Feature activation+3.60e-02

Pos logit contributions
isk+0.090
 homework+0.084
 Rat+0.081
 Maxwell+0.081
 not+0.080
Neg logit contributions
 sunlight-0.069
oms-0.068
 pulp-0.068
 flags-0.066
 pump-0.066
Token to
Token ID284
Feature activation+4.84e-02

Pos logit contributions
 homework+0.063
 operation+0.058
 x+0.058
ggie+0.057
isk+0.057
Neg logit contributions
 pulp-0.050
oms-0.049
 flags-0.049
 fishes-0.047
als-0.047
Token get
Token ID651
Feature activation-6.54e-03

Pos logit contributions
 homework+0.069
ggie+0.066
zag+0.066
isk+0.065
omm+0.064
Neg logit contributions
 flags-0.075
 pump-0.073
 shells-0.071
 pulp-0.070
 banners-0.070
Token ready
Token ID3492
Feature activation+8.01e-03

Pos logit contributions
 pulp+0.013
oms+0.012
 fishes+0.012
ment+0.011
 banners+0.011
Neg logit contributions
 homework-0.009
 operation-0.008
 math-0.008
ggie-0.008
 not-0.008
Token it
Token ID340
Feature activation+1.71e-02

Pos logit contributions
 pulp+0.012
oms+0.011
 fishes+0.011
ment+0.010
 banners+0.010
Neg logit contributions
 homework-0.009
 math-0.009
 operation-0.009
ggie-0.009
 not-0.009
Token.
Token ID13
Feature activation-7.48e-03

Pos logit contributions
 homework+0.029
 operation+0.027
ggie+0.027
 x+0.027
 robber+0.027
Neg logit contributions
 pulp-0.026
oms-0.023
 fishes-0.023
 flags-0.022
als-0.022
Token He
Token ID679
Feature activation+2.83e-02

Pos logit contributions
 pulp+0.016
oms+0.014
 fishes+0.014
 banners+0.014
 flags+0.014
Neg logit contributions
 homework-0.009
 math-0.008
 operation-0.008
 not-0.008
ggie-0.008
Token practiced
Token ID19893
Feature activation+6.47e-02

Pos logit contributions
 homework+0.054
 operation+0.052
 x+0.049
 robber+0.049
zag+0.049
Neg logit contributions
 pulp-0.038
 cheered-0.035
 fishes-0.033
 shells-0.033
 flags-0.033
Token his
Token ID465
Feature activation+8.94e-02

Pos logit contributions
ibo+0.099
liv+0.099
isk+0.098
ggie+0.097
 operation+0.096
Neg logit contributions
 flags-0.096
 cheered-0.090
 banners-0.089
 pulp-0.088
 valleys-0.088
Token math
Token ID10688
Feature activation+1.06e-01

Pos logit contributions
isk+0.148
scar+0.127
liv+0.127
penter+0.121
gy+0.121
Neg logit contributions
 flags-0.131
 shells-0.122
 faces-0.120
oms-0.116
 pump-0.114
Token problems
Token ID2761
Feature activation+5.99e-02

Pos logit contributions
scar+0.154
ibo+0.147
isk+0.146
tons+0.146
liv+0.142
Neg logit contributions
 flags-0.156
 cheered-0.148
 banners-0.143
 valleys-0.142
 shells-0.142
Token and
Token ID290
Feature activation+2.71e-02

Pos logit contributions
 homework+0.098
 operation+0.095
 x+0.094
ggie+0.094
isk+0.091
Neg logit contributions
oms-0.081
 cheered-0.079
 flags-0.078
ment-0.077
 pulp-0.077
Token soon
Token ID2582
Feature activation-6.35e-03

Pos logit contributions
 homework+0.041
ggie+0.039
 operation+0.039
 x+0.038
zag+0.037
Neg logit contributions
 pulp-0.046
 cheered-0.042
 flags-0.042
 fishes-0.041
 shells-0.041
Token he
Token ID339
Feature activation+3.25e-02

Pos logit contributions
 pulp+0.013
 fishes+0.012
oms+0.012
ment+0.012
 banners+0.012
Neg logit contributions
 homework-0.008
 operation-0.007
ggie-0.007
 math-0.007
 not-0.007
Token could
Token ID714
Feature activation+5.14e-02

Pos logit contributions
 homework+0.059
 operation+0.057
 x+0.056
 robber+0.054
ggie+0.053
Neg logit contributions
 pulp-0.045
 cheered-0.043
als-0.041
oms-0.040
 fishes-0.040
Token every
Token ID790
Feature activation+4.91e-02

Pos logit contributions
 robber+0.089
scar+0.087
 x+0.084
ibo+0.084
 operation+0.084
Neg logit contributions
 flags-0.071
 banners-0.066
 them-0.064
 pump-0.063
 valleys-0.063
Token day
Token ID1110
Feature activation+2.30e-02

Pos logit contributions
isk+0.085
 Maxwell+0.081
 homework+0.078
 not+0.076
 Rat+0.076
Neg logit contributions
 pulp-0.067
Happy-0.065
oms-0.064
 sunlight-0.063
 fishes-0.063
Token.
Token ID13
Feature activation+1.31e-02

Pos logit contributions
 homework+0.050
 operation+0.048
zag+0.048
 x+0.047
ggie+0.047
Neg logit contributions
 pulp-0.023
als-0.020
oms-0.019
 flags-0.019
 fishes-0.019
Token But
Token ID887
Feature activation+5.03e-02

Pos logit contributions
 homework+0.021
ggie+0.020
 operation+0.019
zag+0.019
 x+0.019
Neg logit contributions
 pulp-0.022
oms-0.019
 fishes-0.019
als-0.018
 banners-0.018
Token she
Token ID673
Feature activation+6.46e-02

Pos logit contributions
 robber+0.085
ggie+0.085
zag+0.083
isk+0.083
icycle+0.080
Neg logit contributions
 pulp-0.071
 pump-0.065
 shells-0.063
 ge-0.062
 roses-0.062
Token didn
Token ID1422
Feature activation+1.06e-01

Pos logit contributions
 robber+0.113
 Maxwell+0.112
 x+0.111
 math+0.107
 operation+0.107
Neg logit contributions
 admired-0.078
 cheered-0.078
 pulp-0.077
 ge-0.077
 were-0.076
Token't
Token ID470
Feature activation+2.03e-02

Pos logit contributions
isk+0.177
pert+0.171
 rider+0.163
 robber+0.163
ci+0.161
Neg logit contributions
oms-0.157
Happy-0.127
ades-0.121
Everyone-0.118
izz-0.118
Token like
Token ID588
Feature activation+4.23e-02

Pos logit contributions
 homework+0.033
ggie+0.032
zag+0.031
 x+0.031
 math+0.030
Neg logit contributions
 pulp-0.031
 pump-0.028
oms-0.027
 fishes-0.027
 ge-0.027
Token to
Token ID284
Feature activation+1.67e-02

Pos logit contributions
zag+0.076
isk+0.074
ggie+0.073
penter+0.071
 homework+0.070
Neg logit contributions
 pulp-0.054
oms-0.054
 flags-0.053
 pump-0.051
 banners-0.050
Token listen
Token ID6004
Feature activation+1.76e-02

Pos logit contributions
 homework+0.025
zag+0.024
 Maxwell+0.024
ggie+0.024
 math+0.023
Neg logit contributions
 pulp-0.027
 flags-0.024
 pump-0.024
 banners-0.024
 fishes-0.024
Token to
Token ID284
Feature activation+3.65e-02

Pos logit contributions
 homework+0.029
ggie+0.028
 x+0.028
 operation+0.028
zag+0.028
Neg logit contributions
 pulp-0.026
oms-0.024
ment-0.024
 flags-0.023
 fishes-0.023
Token math
Token ID10688
Feature activation+8.42e-02

Pos logit contributions
 Maxwell+0.077
isk+0.075
ggie+0.072
tons+0.070
zag+0.069
Neg logit contributions
 pulp-0.153
 shells-0.150
 buckets-0.149
 flags-0.148
 pump-0.146
Token homework
Token ID26131
Feature activation+7.16e-02

Pos logit contributions
isk+0.095
scar+0.095
ibo+0.093
tons+0.091
 Maxwell+0.088
Neg logit contributions
 flags-0.153
 pulp-0.152
 shells-0.147
oms-0.147
 pump-0.145
Token.
Token ID13
Feature activation+7.33e-04

Pos logit contributions
ggie+0.157
 x+0.156
 Maxwell+0.155
isk+0.153
 Rat+0.153
Neg logit contributions
 flags-0.044
oms-0.044
 cheered-0.043
 pulp-0.042
 shells-0.038
Token Tim
Token ID5045
Feature activation+5.07e-02

Pos logit contributions
 homework+0.001
ggie+0.001
 operation+0.001
zag+0.001
 math+0.001
Neg logit contributions
 pulp-0.001
 fishes-0.001
oms-0.001
 banners-0.001
ment-0.001
Tokenmy
Token ID1820
Feature activation+5.27e-02

Pos logit contributions
 homework+0.094
 robber+0.093
 Label+0.092
ggie+0.092
 math+0.092
Neg logit contributions
ins-0.059
ies-0.058
 cheered-0.057
 pulp-0.053
 admired-0.050
Token didn
Token ID1422
Feature activation+1.05e-01

Pos logit contributions
 robber+0.101
 homework+0.099
 Patch+0.098
 Maxwell+0.098
 operation+0.098
Neg logit contributions
 cheered-0.065
 pulp-0.057
 admired-0.056
led-0.050
 gasped-0.050
Token't
Token ID470
Feature activation+1.65e-02

Pos logit contributions
isk+0.182
pert+0.170
 robber+0.159
 dictionary+0.159
 mask+0.159
Neg logit contributions
oms-0.133
Happy-0.128
izz-0.116
Everyone-0.116
 them-0.112
Token want
Token ID765
Feature activation+4.12e-02

Pos logit contributions
 homework+0.029
ggie+0.028
zag+0.028
 operation+0.027
 math+0.027
Neg logit contributions
 pulp-0.023
oms-0.020
 cheered-0.020
Happy-0.020
 pump-0.019
Token to
Token ID284
Feature activation+1.87e-02

Pos logit contributions
zag+0.081
isk+0.079
ggie+0.074
ousel+0.073
belt+0.070
Neg logit contributions
 shells-0.047
 pulp-0.046
 flags-0.046
 pump-0.043
 fishes-0.042
Token do
Token ID466
Feature activation+6.26e-02

Pos logit contributions
zag+0.028
 homework+0.027
ggie+0.027
 Maxwell+0.026
isk+0.026
Neg logit contributions
 pulp-0.030
 pump-0.026
oms-0.026
 flags-0.026
 fishes-0.026
Token it
Token ID340
Feature activation+7.64e-02

Pos logit contributions
zag+0.123
isk+0.121
tr+0.115
ibo+0.111
icycle+0.109
Neg logit contributions
 flags-0.070
 shells-0.064
 faces-0.063
oms-0.062
ment-0.062
Token book
Token ID1492
Feature activation+5.02e-02

Pos logit contributions
isk+0.095
scar+0.088
ggie+0.085
liv+0.083
ibo+0.083
Neg logit contributions
 pulp-0.134
 flags-0.129
 shells-0.126
oms-0.124
als-0.122
Token to
Token ID284
Feature activation+2.35e-02

Pos logit contributions
 homework+0.092
ggie+0.090
 operation+0.090
 Maxwell+0.089
 skate+0.089
Neg logit contributions
oms-0.055
als-0.055
 pulp-0.053
ment-0.052
 flags-0.051
Token practice
Token ID3357
Feature activation+5.58e-02

Pos logit contributions
zag+0.033
 homework+0.031
ggie+0.031
 operation+0.029
 Maxwell+0.029
Neg logit contributions
 pulp-0.042
 flags-0.037
Happy-0.037
 shells-0.037
 banners-0.036
Token.
Token ID13
Feature activation+2.70e-02

Pos logit contributions
zag+0.097
ggie+0.096
 robber+0.095
isk+0.095
scar+0.095
Neg logit contributions
 flags-0.067
 pulp-0.067
 banners-0.063
ment-0.062
oms-0.062
Token Lily
Token ID20037
Feature activation+6.69e-02

Pos logit contributions
zag+0.045
ggie+0.044
 homework+0.044
 operation+0.040
 rider+0.040
Neg logit contributions
 pulp-0.044
als-0.038
oms-0.038
 cups-0.038
 banners-0.037
Token didn
Token ID1422
Feature activation+1.05e-01

Pos logit contributions
 operation+0.140
 robber+0.140
 homework+0.135
 x+0.133
isk+0.131
Neg logit contributions
 cheered-0.076
 admired-0.070
 gasped-0.061
 thanked-0.059
 applauded-0.059
Token't
Token ID470
Feature activation+2.46e-02

Pos logit contributions
isk+0.170
pert+0.161
 rider+0.155
 mask+0.153
 robber+0.153
Neg logit contributions
oms-0.148
Happy-0.132
qu-0.121
ades-0.119
izz-0.118
Token want
Token ID765
Feature activation+4.53e-02

Pos logit contributions
 homework+0.047
ggie+0.047
zag+0.046
 operation+0.044
isk+0.044
Neg logit contributions
 pulp-0.029
 cheered-0.026
oms-0.025
Happy-0.024
 pump-0.024
Token to
Token ID284
Feature activation+3.29e-02

Pos logit contributions
zag+0.090
isk+0.087
ousel+0.084
 skate+0.083
ggie+0.081
Neg logit contributions
 pulp-0.051
 shells-0.048
als-0.048
oms-0.048
 flags-0.047
Token do
Token ID466
Feature activation+6.22e-02

Pos logit contributions
zag+0.051
isk+0.050
 homework+0.048
ggie+0.048
 Maxwell+0.047
Neg logit contributions
 pulp-0.044
 pump-0.043
 cheered-0.041
 flags-0.041
oms-0.041
Token it
Token ID340
Feature activation+8.11e-02

Pos logit contributions
zag+0.120
isk+0.114
ibo+0.109
scar+0.108
 rider+0.107
Neg logit contributions
 flags-0.070
oms-0.069
 pulp-0.067
ment-0.066
 shells-0.065
Token tried
Token ID3088
Feature activation+5.48e-02

Pos logit contributions
 homework+0.061
ggie+0.059
 operation+0.059
 x+0.058
 math+0.058
Neg logit contributions
 pulp-0.043
 fishes-0.041
 cheered-0.041
 flags-0.040
 shells-0.040
Token his
Token ID465
Feature activation+8.14e-02

Pos logit contributions
isk+0.109
penter+0.103
case+0.102
ggie+0.102
 operation+0.101
Neg logit contributions
 shells-0.060
 flags-0.060
 pump-0.059
 islands-0.054
 fishes-0.053
Token best
Token ID1266
Feature activation+6.43e-02

Pos logit contributions
isk+0.166
liv+0.142
penter+0.136
 Maxwell+0.131
nob+0.130
Neg logit contributions
 flags-0.100
 pump-0.097
 shells-0.095
 ducks-0.093
 fishes-0.091
Token to
Token ID284
Feature activation+5.05e-02

Pos logit contributions
isk+0.145
liv+0.137
 x+0.133
scar+0.131
 Maxwell+0.130
Neg logit contributions
 flags-0.053
 shells-0.050
ment-0.049
 pump-0.048
oms-0.046
Token do
Token ID466
Feature activation+6.31e-02

Pos logit contributions
isk+0.060
 homework+0.060
zag+0.057
 Rat+0.056
 Maxwell+0.056
Neg logit contributions
 cheered-0.086
 flags-0.084
 pump-0.083
 fishes-0.082
 pulp-0.082
Token his
Token ID465
Feature activation+1.04e-01

Pos logit contributions
isk+0.111
ibo+0.108
zag+0.107
tr+0.106
 x+0.103
Neg logit contributions
 flags-0.077
 shells-0.073
oms-0.073
 cheered-0.070
ment-0.070
Token best
Token ID1266
Feature activation+6.58e-02

Pos logit contributions
isk+0.204
liv+0.169
 Maxwell+0.163
scar+0.162
 Rat+0.162
Neg logit contributions
 flags-0.131
 pump-0.119
 waves-0.114
 shells-0.114
 balloons-0.111
Token.
Token ID13
Feature activation+1.41e-02

Pos logit contributions
isk+0.164
liv+0.154
 Rat+0.147
 x+0.146
scar+0.145
Neg logit contributions
 flags-0.045
 pump-0.040
 shells-0.040
ment-0.037
oms-0.034
Token He
Token ID679
Feature activation+5.25e-02

Pos logit contributions
 homework+0.023
ggie+0.022
 operation+0.022
zag+0.021
 not+0.021
Neg logit contributions
 pulp-0.022
 fishes-0.020
oms-0.019
ment-0.019
 pump-0.019
Token worked
Token ID3111
Feature activation+6.91e-02

Pos logit contributions
 homework+0.099
isk+0.098
 operation+0.097
 x+0.097
 math+0.096
Neg logit contributions
 cheered-0.062
 pulp-0.056
 fishes-0.056
 shells-0.054
 flags-0.054
Token hard
Token ID1327
Feature activation+6.91e-02

Pos logit contributions
isk+0.132
 operation+0.123
 robber+0.122
 exam+0.120
ggie+0.120
Neg logit contributions
 flags-0.075
 shells-0.072
oms-0.072
 cheered-0.070
 them-0.069
Token ride
Token ID6594
Feature activation+4.37e-02

Pos logit contributions
 homework+0.026
ggie+0.025
zag+0.025
 x+0.024
 operation+0.024
Neg logit contributions
 pulp-0.033
 flags-0.032
 banners-0.031
 shells-0.031
oms-0.031
Token.
Token ID13
Feature activation+2.79e-02

Pos logit contributions
isk+0.081
 operation+0.078
zag+0.077
 x+0.076
 homework+0.075
Neg logit contributions
 flags-0.054
 shells-0.048
 fishes-0.046
 banners-0.046
 cheered-0.046
Token He
Token ID679
Feature activation+4.97e-02

Pos logit contributions
 homework+0.046
ggie+0.045
zag+0.042
 operation+0.041
 x+0.040
Neg logit contributions
 pulp-0.042
 fishes-0.040
 pump-0.038
 ducks-0.038
 shells-0.038
Token would
Token ID561
Feature activation+6.13e-02

Pos logit contributions
 homework+0.100
 operation+0.100
 math+0.100
 x+0.099
isk+0.098
Neg logit contributions
 cheered-0.051
 admired-0.049
 flags-0.048
 floats-0.046
 pump-0.045
Token ride
Token ID6594
Feature activation+5.23e-02

Pos logit contributions
ggie+0.105
liv+0.104
isk+0.103
scar+0.102
omm+0.100
Neg logit contributions
 flags-0.082
 pump-0.081
 banners-0.076
 pulp-0.073
 shells-0.073
Token it
Token ID340
Feature activation+1.04e-01

Pos logit contributions
 operation+0.099
isk+0.098
 ankle+0.095
 homework+0.095
omm+0.093
Neg logit contributions
 flags-0.067
 cheered-0.058
 shells-0.057
Happy-0.057
 banners-0.057
Token every
Token ID790
Feature activation+6.09e-02

Pos logit contributions
omm+0.233
scar+0.221
ggie+0.217
liv+0.216
 x+0.214
Neg logit contributions
 hills-0.083
 flags-0.082
als-0.079
 valleys-0.076
 shells-0.073
Token day
Token ID1110
Feature activation+3.02e-02

Pos logit contributions
isk+0.125
 homework+0.118
 Maxwell+0.117
 Rat+0.115
liv+0.114
Neg logit contributions
 sunlight-0.067
 pump-0.064
 flags-0.061
 ge-0.060
 pulp-0.060
Token and
Token ID290
Feature activation+4.57e-02

Pos logit contributions
 homework+0.054
 operation+0.051
isk+0.050
 x+0.049
ggie+0.049
Neg logit contributions
 pulp-0.041
 flags-0.039
 fishes-0.038
oms-0.038
 pump-0.038
Token feel
Token ID1254
Feature activation+9.80e-03

Pos logit contributions
ggie+0.077
omm+0.074
isk+0.073
 x+0.073
 operation+0.071
Neg logit contributions
 flags-0.065
 cheered-0.062
 pulp-0.062
 shells-0.062
 banners-0.061
Token the
Token ID262
Feature activation-3.51e-03

Pos logit contributions
 homework+0.015
zag+0.015
ggie+0.014
 operation+0.014
 x+0.013
Neg logit contributions
 pulp-0.017
 fishes-0.015
oms-0.015
ment-0.015
 flags-0.015
Token made
Token ID925
Feature activation+4.29e-02

Pos logit contributions
 x+0.112
 homework+0.112
 operation+0.105
 math+0.103
ggie+0.102
Neg logit contributions
 pulp-0.070
 cheered-0.069
 fishes-0.068
 pump-0.067
 ge-0.067
Token sure
Token ID1654
Feature activation+2.38e-02

Pos logit contributions
zag+0.075
belt+0.071
isk+0.071
tr+0.070
 homework+0.067
Neg logit contributions
 fishes-0.062
 pulp-0.061
 ducks-0.060
 flags-0.059
 shells-0.058
Token to
Token ID284
Feature activation+4.33e-02

Pos logit contributions
 homework+0.041
zag+0.040
ggie+0.039
 x+0.039
 operation+0.039
Neg logit contributions
 pulp-0.036
 shells-0.032
 fishes-0.031
 pump-0.031
 flags-0.031
Token bounce
Token ID20110
Feature activation+6.02e-02

Pos logit contributions
 homework+0.070
 x+0.066
 Maxwell+0.064
 operation+0.064
 math+0.063
Neg logit contributions
 pulp-0.060
 shells-0.060
 pump-0.059
 ducks-0.057
 ge-0.057
Token the
Token ID262
Feature activation+2.25e-02

Pos logit contributions
zag+0.108
ggie+0.108
omm+0.107
isk+0.106
 x+0.104
Neg logit contributions
 shells-0.082
 flags-0.077
 fishes-0.077
 them-0.073
 cheered-0.071
Token football
Token ID4346
Feature activation+1.03e-01

Pos logit contributions
 homework+0.024
isk+0.023
ggie+0.023
zag+0.023
 operation+0.022
Neg logit contributions
 pulp-0.049
 fishes-0.047
 flags-0.045
 shells-0.045
 ducks-0.044
Token gently
Token ID15165
Feature activation+5.23e-02

Pos logit contributions
 x+0.216
innie+0.205
ggie+0.201
liv+0.200
scar+0.200
Neg logit contributions
als-0.112
 shells-0.112
 pulp-0.100
 fishes-0.097
 flags-0.097
Token so
Token ID523
Feature activation+3.14e-02

Pos logit contributions
 homework+0.099
ggie+0.097
 x+0.096
 operation+0.095
 Maxwell+0.092
Neg logit contributions
 pulp-0.061
 shells-0.060
oms-0.058
 fishes-0.057
als-0.057
Token that
Token ID326
Feature activation+3.52e-02

Pos logit contributions
 homework+0.053
zag+0.050
 x+0.049
isk+0.049
 operation+0.049
Neg logit contributions
 pulp-0.048
ment-0.046
 fishes-0.043
oms-0.043
 shells-0.043
Token it
Token ID340
Feature activation+5.15e-02

Pos logit contributions
zag+0.059
isk+0.058
 homework+0.057
ggie+0.057
 x+0.054
Neg logit contributions
 pulp-0.054
 fishes-0.050
 shells-0.050
 pump-0.048
 islands-0.047
Token wouldn
Token ID3636
Feature activation+6.13e-02

Pos logit contributions
 homework+0.083
ggie+0.081
 math+0.080
 x+0.079
 Maxwell+0.076
Neg logit contributions
 pulp-0.080
als-0.078
oms-0.076
 fishes-0.075
 shells-0.074
Token his
Token ID465
Feature activation+2.56e-02

Pos logit contributions
 pulp+0.026
oms+0.023
 fishes+0.022
ment+0.022
als+0.022
Neg logit contributions
 homework-0.018
 math-0.017
 operation-0.017
 not-0.016
ggie-0.016
Token trumpet
Token ID40334
Feature activation+4.97e-02

Pos logit contributions
zag+0.040
isk+0.040
 homework+0.040
ggie+0.040
 Maxwell+0.038
Neg logit contributions
 pulp-0.039
 flags-0.037
 fishes-0.037
 shells-0.036
 ducks-0.036
Token.
Token ID13
Feature activation+1.56e-02

Pos logit contributions
 x+0.110
 homework+0.110
 Maxwell+0.110
isk+0.108
ggie+0.106
Neg logit contributions
 flags-0.035
als-0.035
 pulp-0.034
 fishes-0.033
oms-0.033
Token He
Token ID679
Feature activation+3.50e-02

Pos logit contributions
 homework+0.022
zag+0.021
ggie+0.021
 operation+0.019
 not+0.019
Neg logit contributions
 pulp-0.029
 fishes-0.026
oms-0.026
 flags-0.026
ment-0.026
Token plays
Token ID5341
Feature activation+5.43e-02

Pos logit contributions
 homework+0.062
 operation+0.062
 x+0.062
ggie+0.061
zag+0.060
Neg logit contributions
 pulp-0.046
 flags-0.044
 shells-0.043
 cheered-0.042
 floats-0.042
Token it
Token ID340
Feature activation+1.02e-01

Pos logit contributions
isk+0.103
 homework+0.097
zag+0.096
ichever+0.095
scar+0.095
Neg logit contributions
 flags-0.070
 cheered-0.067
 banners-0.063
 ducks-0.062
 fishes-0.062
Token every
Token ID790
Feature activation+5.48e-02

Pos logit contributions
scar+0.224
omm+0.216
liv+0.214
 Cop+0.209
innie+0.208
Neg logit contributions
 flags-0.088
.-0.082
 cheered-0.082
 shells-0.081
 together-0.080
Token day
Token ID1110
Feature activation+1.43e-02

Pos logit contributions
isk+0.108
 Maxwell+0.107
 Rat+0.102
 homework+0.100
 x+0.098
Neg logit contributions
 sunlight-0.059
 pump-0.058
 fishes-0.058
 ducks-0.056
Happy-0.056
Token.
Token ID13
Feature activation+4.01e-03

Pos logit contributions
 homework+0.029
 operation+0.027
zag+0.027
 x+0.027
ggie+0.027
Neg logit contributions
 pulp-0.018
 fishes-0.015
oms-0.015
 flags-0.015
als-0.014
Token He
Token ID679
Feature activation+4.58e-02

Pos logit contributions
 homework+0.005
ggie+0.005
zag+0.005
 operation+0.004
 not+0.004
Neg logit contributions
 pulp-0.008
 fishes-0.007
oms-0.007
 flags-0.007
 banners-0.007
Token makes
Token ID1838
Feature activation+3.76e-02

Pos logit contributions
ggie+0.081
 x+0.079
 operation+0.078
isk+0.077
 homework+0.077
Neg logit contributions
 pulp-0.061
 floats-0.060
 shells-0.060
 flags-0.060
 cheered-0.059
TokenTim
Token ID14967
Feature activation+3.52e-02

Pos logit contributions
 homework+0.024
 operation+0.022
 x+0.021
 math+0.021
 exam+0.021
Neg logit contributions
 pulp-0.021
 fishes-0.019
Happy-0.019
 ge-0.018
 flags-0.018
Tokenmy
Token ID1820
Feature activation+2.02e-02

Pos logit contributions
 homework+0.060
ggie+0.060
 math+0.056
 robber+0.056
 Label+0.055
Neg logit contributions
 pulp-0.049
ins-0.045
ies-0.045
 pump-0.044
 cheered-0.044
Token practiced
Token ID19893
Feature activation+7.60e-02

Pos logit contributions
 homework+0.034
 operation+0.033
 x+0.032
ggie+0.032
 math+0.032
Neg logit contributions
 pulp-0.031
 fishes-0.027
 cheered-0.027
 flags-0.027
oms-0.027
Token playing
Token ID2712
Feature activation+7.79e-02

Pos logit contributions
liv+0.138
case+0.132
ggie+0.130
omm+0.129
isk+0.128
Neg logit contributions
 flags-0.089
 pump-0.084
 them-0.082
 shells-0.079
 banners-0.079
Token the
Token ID262
Feature activation+5.78e-02

Pos logit contributions
omm+0.153
scar+0.151
liv+0.151
 fault+0.149
isk+0.148
Neg logit contributions
 flags-0.088
 cheered-0.082
Happy-0.079
 banners-0.078
 them-0.077
Token violin
Token ID38283
Feature activation+1.02e-01

Pos logit contributions
isk+0.069
ggie+0.061
 not+0.057
omm+0.056
 homework+0.055
Neg logit contributions
 flags-0.116
 banners-0.113
 floats-0.110
Happy-0.110
 fishes-0.105
Token every
Token ID790
Feature activation+5.71e-02

Pos logit contributions
liv+0.203
isk+0.201
scar+0.199
ggie+0.193
omm+0.191
Neg logit contributions
als-0.102
 cheered-0.090
 flags-0.088
oms-0.088
 together-0.085
Token day
Token ID1110
Feature activation+3.03e-02

Pos logit contributions
isk+0.122
 Maxwell+0.113
 not+0.110
 Rat+0.109
 homework+0.109
Neg logit contributions
 sunlight-0.058
Happy-0.052
 pump-0.050
 fishes-0.050
 water-0.048
Token and
Token ID290
Feature activation+3.62e-02

Pos logit contributions
 homework+0.059
zag+0.057
 operation+0.056
ggie+0.055
 x+0.055
Neg logit contributions
 pulp-0.035
 flags-0.032
 fishes-0.032
als-0.032
oms-0.031
Token soon
Token ID2582
Feature activation-1.48e-02

Pos logit contributions
ggie+0.058
 x+0.054
 homework+0.054
 operation+0.053
zag+0.052
Neg logit contributions
 pulp-0.056
 cheered-0.055
 flags-0.054
 shells-0.053
 banners-0.052
Token became
Token ID2627
Feature activation+2.29e-02

Pos logit contributions
 pulp+0.033
oms+0.029
 fishes+0.029
ment+0.028
 banners+0.028
Neg logit contributions
 homework-0.017
 not-0.016
 math-0.015
 operation-0.015
ggie-0.015
Token muscles
Token ID12749
Feature activation-1.22e-02

Pos logit contributions
zag+0.056
 Maxwell+0.052
isk+0.051
ousel+0.051
 Loop+0.050
Neg logit contributions
 pulp-0.045
 ducks-0.037
 fishes-0.037
 faces-0.037
 shells-0.037
Token felt
Token ID2936
Feature activation+2.18e-02

Pos logit contributions
 pulp+0.017
 fishes+0.014
oms+0.014
ment+0.014
 banners+0.014
Neg logit contributions
 homework-0.023
 not-0.022
ggie-0.022
 operation-0.022
zag-0.022
Token heavy
Token ID4334
Feature activation+2.14e-02

Pos logit contributions
zag+0.034
 homework+0.033
ggie+0.032
 operation+0.030
 x+0.030
Neg logit contributions
 pulp-0.038
 fishes-0.034
 shells-0.033
oms-0.033
 ge-0.032
Token.
Token ID13
Feature activation+4.52e-02

Pos logit contributions
 homework+0.040
ggie+0.040
zag+0.039
 operation+0.038
 Maxwell+0.037
Neg logit contributions
 pulp-0.028
oms-0.025
 fishes-0.024
ment-0.024
 grains-0.024
Token He
Token ID679
Feature activation+3.20e-02

Pos logit contributions
ggie+0.076
zag+0.074
 homework+0.073
 Maxwell+0.068
 rider+0.068
Neg logit contributions
 pulp-0.070
 sunlight-0.060
ment-0.059
 faces-0.059
 fishes-0.056
Token didn
Token ID1422
Feature activation+1.02e-01

Pos logit contributions
 homework+0.066
zag+0.066
 operation+0.065
 Maxwell+0.063
 math+0.063
Neg logit contributions
 pulp-0.040
 fishes-0.032
 cheered-0.031
 ge-0.031
 shells-0.030
Token't
Token ID470
Feature activation+5.28e-03

Pos logit contributions
isk+0.154
 rider+0.148
 dictionary+0.143
 mask+0.141
 robber+0.141
Neg logit contributions
oms-0.151
izz-0.135
Happy-0.133
ades-0.126
qu-0.124
Token like
Token ID588
Feature activation+4.11e-02

Pos logit contributions
 homework+0.009
ggie+0.008
zag+0.008
 operation+0.008
 math+0.008
Neg logit contributions
 pulp-0.009
 fishes-0.008
oms-0.007
 ge-0.007
 shells-0.007
Token the
Token ID262
Feature activation+2.08e-02

Pos logit contributions
zag+0.075
ggie+0.067
ousel+0.066
isk+0.066
 homework+0.064
Neg logit contributions
 pulp-0.066
 fishes-0.058
 pump-0.056
 shells-0.055
 ducks-0.054
Token feeling
Token ID4203
Feature activation+1.75e-02

Pos logit contributions
zag+0.031
ggie+0.030
 homework+0.029
isk+0.029
ousel+0.029
Neg logit contributions
 pulp-0.038
 fishes-0.034
 pump-0.032
 grains-0.032
 shells-0.032
Token.
Token ID13
Feature activation+3.58e-02

Pos logit contributions
 homework+0.033
zag+0.032
 operation+0.030
 x+0.030
ggie+0.030
Neg logit contributions
 pulp-0.025
 fishes-0.022
oms-0.022
ment-0.021
 ge-0.020
Token
Token ID220
Feature activation+2.68e-03

Pos logit contributions
 pulp+0.003
 fishes+0.003
oms+0.003
ment+0.003
 flags+0.003
Neg logit contributions
 homework-0.003
 operation-0.003
ggie-0.003
 math-0.003
zag-0.003
Token
Token ID198
Feature activation-5.48e-03

Pos logit contributions
 homework+0.005
 operation+0.005
ggie+0.004
 math+0.004
zag+0.004
Neg logit contributions
 pulp-0.004
oms-0.003
 fishes-0.003
ment-0.003
 flags-0.003
Token
Token ID198
Feature activation+5.87e-03

Pos logit contributions
 pulp+0.009
 fishes+0.008
oms+0.008
ment+0.007
 banners+0.007
Neg logit contributions
 homework-0.009
 operation-0.009
ggie-0.009
 math-0.009
 not-0.009
TokenTim
Token ID14967
Feature activation+4.02e-02

Pos logit contributions
 homework+0.009
 operation+0.009
 x+0.008
 math+0.008
ggie+0.008
Neg logit contributions
 pulp-0.010
 fishes-0.009
 ge-0.009
 flags-0.009
oms-0.009
Tokenmy
Token ID1820
Feature activation+2.89e-02

Pos logit contributions
 homework+0.071
 math+0.066
ggie+0.065
 robber+0.064
 operation+0.064
Neg logit contributions
 pulp-0.055
 cheered-0.053
ies-0.051
ins-0.050
 pump-0.049
Token didn
Token ID1422
Feature activation+1.02e-01

Pos logit contributions
 homework+0.059
 operation+0.055
 math+0.054
 exam+0.054
 x+0.054
Neg logit contributions
 pulp-0.036
 cheered-0.035
 fishes-0.032
 ge-0.032
 ducks-0.031
Token't
Token ID470
Feature activation-1.55e-03

Pos logit contributions
isk+0.161
pert+0.159
ci+0.155
 rider+0.154
 mask+0.153
Neg logit contributions
oms-0.135
Happy-0.123
izz-0.117
l-0.110
They-0.110
Token have
Token ID423
Feature activation+1.94e-02

Pos logit contributions
 pulp+0.003
 fishes+0.002
oms+0.002
ment+0.002
 flags+0.002
Neg logit contributions
 homework-0.002
ggie-0.002
 operation-0.002
 math-0.002
zag-0.002
Token a
Token ID257
Feature activation+4.02e-02

Pos logit contributions
 homework+0.027
zag+0.027
 operation+0.026
ggie+0.026
 Maxwell+0.025
Neg logit contributions
 pulp-0.034
 flags-0.031
 ge-0.030
 fishes-0.030
 shells-0.030
Token real
Token ID1103
Feature activation+2.11e-02

Pos logit contributions
liv+0.055
isk+0.054
ggie+0.052
scar+0.052
 homework+0.051
Neg logit contributions
 pulp-0.074
 pump-0.064
 flags-0.063
oms-0.062
 ge-0.061
Token sword
Token ID8429
Feature activation+4.82e-02

Pos logit contributions
 homework+0.025
 Maxwell+0.024
isk+0.024
ggie+0.024
liv+0.023
Neg logit contributions
 pulp-0.043
 flags-0.038
 fishes-0.038
ment-0.037
oms-0.037
Token much
Token ID881
Feature activation+2.76e-03

Pos logit contributions
 homework+0.054
 Maxwell+0.053
 operation+0.053
zag+0.053
 x+0.052
Neg logit contributions
 pulp-0.038
ment-0.034
Happy-0.034
 fishes-0.033
oms-0.033
Token.
Token ID13
Feature activation+1.94e-02

Pos logit contributions
 homework+0.006
ggie+0.006
zag+0.006
 operation+0.006
 math+0.006
Neg logit contributions
 pulp-0.003
oms-0.003
 fishes-0.003
ment-0.002
 flags-0.002
Token He
Token ID679
Feature activation+3.55e-02

Pos logit contributions
 homework+0.029
ggie+0.028
zag+0.027
 operation+0.026
 exam+0.025
Neg logit contributions
 pulp-0.035
 fishes-0.031
 flags-0.030
 pump-0.030
 ducks-0.030
Token would
Token ID561
Feature activation+4.03e-02

Pos logit contributions
 homework+0.070
 math+0.070
 operation+0.068
 x+0.067
zag+0.067
Neg logit contributions
 pulp-0.040
 cheered-0.037
 ge-0.035
 ducks-0.035
 admired-0.035
Token ride
Token ID6594
Feature activation+5.18e-02

Pos logit contributions
zag+0.057
ggie+0.056
 homework+0.055
 Maxwell+0.055
isk+0.054
Neg logit contributions
 pulp-0.067
 flags-0.063
 pump-0.063
 banners-0.060
 cheered-0.059
Token it
Token ID340
Feature activation+1.02e-01

Pos logit contributions
isk+0.112
 operation+0.108
 homework+0.105
zag+0.104
omm+0.104
Neg logit contributions
 flags-0.054
 cheered-0.047
Happy-0.046
 shells-0.045
 pulp-0.043
Token around
Token ID1088
Feature activation+4.96e-02

Pos logit contributions
omm+0.217
scar+0.212
ggie+0.208
liv+0.206
€+0.203
Neg logit contributions
 flags-0.086
als-0.084
 shells-0.084
oms-0.084
 valleys-0.082
Token town
Token ID3240
Feature activation+1.88e-02

Pos logit contributions
zag+0.105
liv+0.104
ggie+0.103
 homework+0.100
scar+0.099
Neg logit contributions
 islands-0.050
 flags-0.050
 whales-0.049
 pump-0.047
 fishes-0.047
Token and
Token ID290
Feature activation+2.44e-02

Pos logit contributions
 homework+0.029
 operation+0.028
 x+0.028
ggie+0.028
 Maxwell+0.027
Neg logit contributions
 pulp-0.030
 flags-0.028
 fishes-0.028
 islands-0.027
 pump-0.027
Token people
Token ID661
Feature activation-2.61e-02

Pos logit contributions
ggie+0.039
 homework+0.039
 operation+0.037
zag+0.037
 x+0.037
Neg logit contributions
 pulp-0.039
 flags-0.035
 banners-0.034
 shells-0.034
 pump-0.034
Token would
Token ID561
Feature activation+1.88e-02

Pos logit contributions
 pulp+0.052
ment+0.044
 fishes+0.044
oms+0.044
 banners+0.042
Neg logit contributions
 homework-0.037
 not-0.036
zag-0.035
ggie-0.034
 skate-0.034
Tokenmy
Token ID1820
Feature activation+5.23e-02

Pos logit contributions
ggie+0.125
 homework+0.124
 IT+0.120
 math+0.119
 Label+0.116
Neg logit contributions
ies-0.055
 sunlight-0.050
ins-0.049
 pulp-0.048
 admired-0.046
Token always
Token ID1464
Feature activation-2.28e-03

Pos logit contributions
 homework+0.099
 math+0.096
 x+0.095
 robber+0.093
isk+0.092
Neg logit contributions
 pulp-0.062
 cheered-0.062
 pump-0.058
 admired-0.057
 ge-0.056
Token wore
Token ID12408
Feature activation+2.81e-02

Pos logit contributions
 pulp+0.004
 fishes+0.003
oms+0.003
ment+0.003
 banners+0.003
Neg logit contributions
 homework-0.004
ggie-0.004
 operation-0.004
 math-0.004
zag-0.004
Token his
Token ID465
Feature activation+3.45e-02

Pos logit contributions
zag+0.033
 homework+0.033
isk+0.032
ggie+0.031
 operation+0.030
Neg logit contributions
 pulp-0.053
 flags-0.052
 shells-0.051
 banners-0.050
ment-0.049
Token yellow
Token ID7872
Feature activation+3.76e-02

Pos logit contributions
isk+0.030
ggie+0.029
zag+0.028
 homework+0.027
 Maxwell+0.025
Neg logit contributions
 pulp-0.076
 flags-0.075
 shells-0.074
oms-0.071
 banners-0.071
Token helmet
Token ID14335
Feature activation+1.02e-01

Pos logit contributions
ggie+0.034
 homework+0.033
isk+0.033
 Maxwell+0.031
 not+0.030
Neg logit contributions
 pulp-0.080
 flags-0.079
 shells-0.078
 banners-0.076
 pump-0.075
Token when
Token ID618
Feature activation+1.17e-02

Pos logit contributions
 x+0.179
ggie+0.175
 homework+0.175
 IT+0.173
omm+0.173
Neg logit contributions
als-0.108
 cheered-0.091
ed-0.089
ment-0.089
.-0.087
Token he
Token ID339
Feature activation+4.13e-02

Pos logit contributions
 homework+0.015
ggie+0.014
zag+0.014
 operation+0.013
 x+0.013
Neg logit contributions
 pulp-0.023
 fishes-0.021
 shells-0.020
oms-0.020
ment-0.020
Token rode
Token ID22075
Feature activation+2.40e-02

Pos logit contributions
 homework+0.080
 math+0.079
ggie+0.076
 Maxwell+0.075
 operation+0.074
Neg logit contributions
oms-0.044
 pulp-0.042
als-0.041
 cheered-0.041
 ge-0.040
Token his
Token ID465
Feature activation+3.49e-02

Pos logit contributions
 homework+0.038
ggie+0.035
 x+0.035
 operation+0.035
zag+0.034
Neg logit contributions
 pulp-0.036
 flags-0.036
ment-0.035
 banners-0.034
 shells-0.034
Token bike
Token ID7161
Feature activation+6.80e-02

Pos logit contributions
ggie+0.041
isk+0.040
 x+0.037
 homework+0.036
zag+0.036
Neg logit contributions
 flags-0.068
 pulp-0.067
 shells-0.065
oms-0.063
 pump-0.063
Token the
Token ID262
Feature activation+3.79e-02

Pos logit contributions
isk+0.067
zag+0.067
 homework+0.062
chy+0.062
 Maxwell+0.061
Neg logit contributions
Happy-0.054
 flags-0.053
 banners-0.051
 shells-0.049
 cheered-0.049
Token trumpet
Token ID40334
Feature activation+6.50e-02

Pos logit contributions
isk+0.048
 homework+0.045
 Maxwell+0.043
 operation+0.042
ggie+0.041
Neg logit contributions
 flags-0.071
 fishes-0.069
 banners-0.068
 ducks-0.067
 pulp-0.066
Token.
Token ID13
Feature activation+3.27e-02

Pos logit contributions
isk+0.150
scar+0.147
liv+0.143
 Maxwell+0.140
omm+0.137
Neg logit contributions
 flags-0.046
oms-0.046
als-0.044
Happy-0.040
 banners-0.038
Token But
Token ID887
Feature activation+6.77e-02

Pos logit contributions
ggie+0.054
 homework+0.053
zag+0.048
 operation+0.047
 robber+0.047
Neg logit contributions
 pulp-0.047
 pump-0.046
 fishes-0.045
 flags-0.044
 shells-0.044
Token he
Token ID339
Feature activation+6.87e-02

Pos logit contributions
isk+0.133
ggie+0.127
zag+0.122
 robber+0.120
case+0.120
Neg logit contributions
 shells-0.076
 pump-0.072
 flags-0.067
 buckets-0.063
 boats-0.062
Token didn
Token ID1422
Feature activation+1.01e-01

Pos logit contributions
 math+0.135
 Maxwell+0.132
igsaw+0.131
 maths+0.131
 robber+0.130
Neg logit contributions
 saw-0.067
 said-0.065
 ge-0.064
 admired-0.064
 asked-0.059
Token't
Token ID470
Feature activation+2.16e-02

Pos logit contributions
isk+0.167
pert+0.161
 robber+0.153
 rider+0.153
 mask+0.146
Neg logit contributions
oms-0.142
Happy-0.124
izz-0.121
ades-0.114
 them-0.108
Token have
Token ID423
Feature activation+2.26e-02

Pos logit contributions
 homework+0.039
ggie+0.039
zag+0.038
 math+0.037
 x+0.037
Neg logit contributions
 pulp-0.027
 pump-0.024
oms-0.024
 ge-0.023
Happy-0.023
Token a
Token ID257
Feature activation+4.07e-02

Pos logit contributions
zag+0.034
isk+0.032
 homework+0.032
ggie+0.031
 operation+0.031
Neg logit contributions
 pulp-0.038
 flags-0.036
 shells-0.035
 fishes-0.034
 ge-0.034
Token trumpet
Token ID40334
Feature activation+7.03e-02

Pos logit contributions
isk+0.067
 homework+0.056
ggie+0.056
 Maxwell+0.054
 Rat+0.054
Neg logit contributions
 pulp-0.064
 flags-0.063
oms-0.062
 pump-0.060
 banners-0.059
Token of
Token ID286
Feature activation+1.34e-02

Pos logit contributions
isk+0.143
liv+0.135
innie+0.134
scar+0.133
 Maxwell+0.132
Neg logit contributions
als-0.067
oms-0.062
 flags-0.059
ed-0.055
ment-0.054
Token him
Token ID683
Feature activation+6.32e-02

Pos logit contributions
 homework+0.005
zag+0.005
ggie+0.004
 operation+0.004
 math+0.004
Neg logit contributions
 pulp-0.006
 fishes-0.005
oms-0.005
 flags-0.005
 shells-0.005
Token.
Token ID13
Feature activation+5.98e-03

Pos logit contributions
zag+0.137
 Maxwell+0.135
liv+0.135
scar+0.132
ggie+0.132
Neg logit contributions
 pulp-0.037
 shells-0.036
 cheered-0.035
 flags-0.034
 islands-0.034
Token Tim
Token ID5045
Feature activation+4.67e-02

Pos logit contributions
 homework+0.009
ggie+0.008
 operation+0.008
zag+0.008
 math+0.008
Neg logit contributions
 pulp-0.011
 fishes-0.010
oms-0.010
 ge-0.009
 flags-0.009
Tokenmy
Token ID1820
Feature activation+5.10e-02

Pos logit contributions
 homework+0.088
 math+0.087
ggie+0.086
 robber+0.085
 Label+0.083
Neg logit contributions
ins-0.052
 pulp-0.052
ies-0.051
 cheered-0.051
 admired-0.047
Token missed
Token ID6825
Feature activation+4.16e-02

Pos logit contributions
 operation+0.096
 homework+0.096
 math+0.094
 x+0.093
 exam+0.092
Neg logit contributions
 cheered-0.066
 pulp-0.062
 admired-0.059
 ge-0.055
 applauded-0.055
Token him
Token ID683
Feature activation+1.01e-01

Pos logit contributions
case+0.074
zag+0.072
ggie+0.069
liv+0.066
€+0.064
Neg logit contributions
 whales-0.065
 flags-0.063
 shells-0.063
 sailors-0.062
 pulp-0.061
Token so
Token ID523
Feature activation+2.63e-02

Pos logit contributions
liv+0.183
scar+0.175
isk+0.171
omm+0.170
€+0.169
Neg logit contributions
 were-0.096
 flags-0.094
 faces-0.090
 cheered-0.087
 water-0.087
Token much
Token ID881
Feature activation+2.69e-02

Pos logit contributions
zag+0.047
 homework+0.046
 operation+0.046
 x+0.044
isk+0.043
Neg logit contributions
 pulp-0.034
ment-0.031
Happy-0.030
 flags-0.029
 fishes-0.029
Token and
Token ID290
Feature activation+1.84e-02

Pos logit contributions
zag+0.046
isk+0.045
ggie+0.043
 homework+0.043
 x+0.042
Neg logit contributions
ment-0.035
 flags-0.033
oms-0.033
 shells-0.033
 pulp-0.033
Token wondered
Token ID14028
Feature activation+2.03e-02

Pos logit contributions
zag+0.031
 homework+0.031
ggie+0.030
 x+0.030
 operation+0.030
Neg logit contributions
 pulp-0.029
 fishes-0.024
 flags-0.024
 ge-0.024
 shells-0.024
Token where
Token ID810
Feature activation+8.25e-04

Pos logit contributions
 homework+0.033
zag+0.032
ggie+0.032
 operation+0.031
 x+0.030
Neg logit contributions
 pulp-0.031
 shells-0.028
ment-0.028
 fishes-0.027
oms-0.027
Token that
Token ID326
Feature activation+5.97e-02

Pos logit contributions
isk+0.097
zag+0.097
tons+0.093
ousel+0.092
 rider+0.092
Neg logit contributions
ment-0.042
 shells-0.041
lan-0.039
 seeds-0.039
 flags-0.038
Token he
Token ID339
Feature activation+6.36e-02

Pos logit contributions
isk+0.105
zag+0.105
ggie+0.097
liv+0.096
ousel+0.095
Neg logit contributions
 pulp-0.088
 pump-0.083
 shells-0.082
 fishes-0.079
 cheered-0.076
Token should
Token ID815
Feature activation+2.82e-02

Pos logit contributions
 homework+0.130
zag+0.129
ggie+0.128
 operation+0.127
belt+0.126
Neg logit contributions
led-0.052
 cheered-0.052
als-0.051
 pulp-0.048
 seeds-0.048
Token complete
Token ID1844
Feature activation+6.86e-02

Pos logit contributions
 homework+0.033
zag+0.032
ggie+0.032
 Maxwell+0.031
 x+0.031
Neg logit contributions
 pulp-0.054
 pump-0.048
 shells-0.048
 fishes-0.047
 flags-0.047
Token his
Token ID465
Feature activation+7.89e-02

Pos logit contributions
zag+0.095
case+0.092
 rider+0.086
ggie+0.086
ousel+0.085
Neg logit contributions
 shells-0.115
 flags-0.112
 islands-0.106
 pulp-0.106
ins-0.104
Token math
Token ID10688
Feature activation+1.00e-01

Pos logit contributions
isk+0.075
liv+0.075
zag+0.073
 Maxwell+0.072
scar+0.069
Neg logit contributions
 flags-0.146
 pump-0.146
 faces-0.145
 pulp-0.145
 shells-0.144
Token and
Token ID290
Feature activation+3.62e-02

Pos logit contributions
ibo+0.107
tons+0.107
scar+0.106
isk+0.103
 Loop+0.100
Neg logit contributions
 flags-0.175
 shells-0.169
oms-0.168
 pump-0.166
ins-0.164
Token not
Token ID407
Feature activation+2.26e-02

Pos logit contributions
zag+0.048
ggie+0.047
isk+0.044
 x+0.043
 Maxwell+0.043
Neg logit contributions
 pulp-0.067
 shells-0.061
 flags-0.060
 pump-0.060
ment-0.059
Token lie
Token ID6486
Feature activation+1.16e-02

Pos logit contributions
ggie+0.032
 homework+0.032
zag+0.031
 operation+0.030
 Maxwell+0.030
Neg logit contributions
 pulp-0.040
 pump-0.035
 fishes-0.034
 flags-0.033
oms-0.033
Token.
Token ID13
Feature activation+3.44e-02

Pos logit contributions
 homework+0.019
ggie+0.018
 operation+0.018
zag+0.018
 x+0.018
Neg logit contributions
 pulp-0.019
oms-0.017
ment-0.016
 fishes-0.016
 flags-0.016
Token<|endoftext|>
Token ID50256
Feature activation-3.92e-03

Pos logit contributions
 homework+0.073
ggie+0.072
zag+0.070
 operation+0.068
 robber+0.067
Neg logit contributions
 pulp-0.036
ment-0.031
Happy-0.031
lan-0.030
 sunlight-0.029
Token the
Token ID262
Feature activation+5.91e-02

Pos logit contributions
zag+0.026
 homework+0.025
ggie+0.024
 Maxwell+0.024
 skate+0.024
Neg logit contributions
 pulp-0.034
 flags-0.030
 shells-0.030
 fishes-0.030
 ge-0.030
Token right
Token ID826
Feature activation+2.44e-02

Pos logit contributions
ggie+0.090
isk+0.090
zed+0.086
 Maxwell+0.086
scar+0.081
Neg logit contributions
 pulp-0.096
 pump-0.092
 shells-0.092
 ducks-0.091
 flags-0.089
Token equipment
Token ID5112
Feature activation+4.07e-02

Pos logit contributions
 homework+0.034
 Maxwell+0.034
ggie+0.033
 x+0.031
 skate+0.031
Neg logit contributions
 pulp-0.045
ment-0.041
 shells-0.040
 pump-0.039
 flags-0.038
Token.
Token ID13
Feature activation+8.90e-03

Pos logit contributions
 homework+0.078
ggie+0.078
zag+0.078
 Maxwell+0.076
 operation+0.075
Neg logit contributions
 pulp-0.047
oms-0.044
als-0.042
 shells-0.040
 flags-0.040
Token She
Token ID1375
Feature activation+3.54e-02

Pos logit contributions
 homework+0.014
zag+0.013
ggie+0.013
 operation+0.013
 skate+0.012
Neg logit contributions
 pulp-0.016
oms-0.014
als-0.014
 fishes-0.014
ment-0.013
Token didn
Token ID1422
Feature activation+1.00e-01

Pos logit contributions
 math+0.073
 homework+0.072
zag+0.071
 Maxwell+0.069
 operation+0.068
Neg logit contributions
 pulp-0.038
 cheered-0.031
 ge-0.031
 fishes-0.030
 ducks-0.030
Token't
Token ID470
Feature activation+2.02e-02

Pos logit contributions
isk+0.164
pert+0.152
 rider+0.150
 robber+0.145
 mask+0.143
Neg logit contributions
oms-0.142
Happy-0.124
izz-0.123
 them-0.115
ades-0.115
Token like
Token ID588
Feature activation+3.98e-02

Pos logit contributions
 homework+0.037
zag+0.036
ggie+0.036
 math+0.035
 x+0.034
Neg logit contributions
 pulp-0.026
 pump-0.022
Happy-0.022
 cheered-0.022
 flags-0.022
Token this
Token ID428
Feature activation+4.44e-02

Pos logit contributions
zag+0.077
ggie+0.074
 rider+0.069
 homework+0.068
isk+0.068
Neg logit contributions
 pulp-0.052
 shells-0.051
 flags-0.051
 pump-0.049
 whales-0.047
Token,
Token ID11
Feature activation+5.13e-02

Pos logit contributions
ggie+0.081
 homework+0.080
 Maxwell+0.079
 Rat+0.078
zag+0.077
Neg logit contributions
oms-0.049
 pulp-0.047
 flags-0.046
 pump-0.046
 ge-0.045
Token so
Token ID523
Feature activation-6.38e-04

Pos logit contributions
zag+0.085
ggie+0.078
isk+0.076
 homework+0.073
icycle+0.071
Neg logit contributions
 pulp-0.093
 shells-0.083
 ge-0.083
 pump-0.082
 flags-0.082
Token phone
Token ID3072
Feature activation+3.07e-02

Pos logit contributions
 pulp+0.014
 fishes+0.013
oms+0.013
ment+0.012
 banners+0.012
Neg logit contributions
 homework-0.010
 operation-0.009
 not-0.009
ggie-0.009
 math-0.009
Token was
Token ID373
Feature activation+2.79e-02

Pos logit contributions
 homework+0.049
ggie+0.049
 operation+0.046
 skate+0.046
 Maxwell+0.045
Neg logit contributions
 pulp-0.047
als-0.044
oms-0.043
 flags-0.042
ment-0.042
Token difficult
Token ID2408
Feature activation+2.62e-02

Pos logit contributions
zag+0.031
 homework+0.030
ggie+0.029
 skate+0.029
 Maxwell+0.029
Neg logit contributions
 pulp-0.057
 shells-0.050
 flags-0.050
 seeds-0.049
 fishes-0.049
Token.
Token ID13
Feature activation+3.74e-02

Pos logit contributions
ggie+0.048
zag+0.048
 Maxwell+0.046
isk+0.046
 homework+0.046
Neg logit contributions
oms-0.031
 pulp-0.031
ment-0.030
 flags-0.029
 shells-0.029
Token She
Token ID1375
Feature activation+1.92e-02

Pos logit contributions
zag+0.073
ggie+0.072
 homework+0.068
 skate+0.067
 rider+0.065
Neg logit contributions
 pulp-0.052
als-0.048
oms-0.044
 sunlight-0.043
 cups-0.043
Token didn
Token ID1422
Feature activation+1.00e-01

Pos logit contributions
 homework+0.037
zag+0.036
ggie+0.036
 math+0.035
 operation+0.035
Neg logit contributions
 pulp-0.026
 fishes-0.021
 cheered-0.020
als-0.020
 flags-0.020
Token't
Token ID470
Feature activation+1.87e-02

Pos logit contributions
isk+0.156
 rider+0.146
pert+0.143
 robber+0.142
belt+0.141
Neg logit contributions
oms-0.150
Happy-0.135
izz-0.129
ades-0.123
qu-0.123
Token know
Token ID760
Feature activation+4.01e-03

Pos logit contributions
 homework+0.035
ggie+0.034
zag+0.034
 math+0.033
 operation+0.032
Neg logit contributions
 pulp-0.024
 pump-0.021
oms-0.020
 fishes-0.020
 cheered-0.020
Token how
Token ID703
Feature activation-1.62e-02

Pos logit contributions
 homework+0.007
zag+0.006
ggie+0.006
 operation+0.006
 math+0.006
Neg logit contributions
 pulp-0.007
oms-0.006
 fishes-0.006
ment-0.006
 shells-0.005
Token to
Token ID284
Feature activation+9.50e-03

Pos logit contributions
 pulp+0.034
 fishes+0.030
oms+0.029
 banners+0.029
als+0.029
Neg logit contributions
 homework-0.021
 not-0.020
 math-0.020
 operation-0.019
ggie-0.018
Token call
Token ID869
Feature activation+2.28e-02

Pos logit contributions
 homework+0.015
 math+0.014
zag+0.013
ggie+0.013
 operation+0.013
Neg logit contributions
 pulp-0.016
 fishes-0.014
oms-0.014
 flags-0.014
 pump-0.014
Token,
Token ID11
Feature activation+5.96e-02

Pos logit contributions
ggie+0.045
 homework+0.044
isk+0.042
zag+0.042
liv+0.039
Neg logit contributions
oms-0.057
ment-0.054
 flags-0.052
 fishes-0.052
 shells-0.051
Token but
Token ID475
Feature activation+3.58e-02

Pos logit contributions
zag+0.110
 homework+0.107
icycle+0.105
ggie+0.102
 operation+0.099
Neg logit contributions
 pulp-0.080
 ge-0.076
 ducks-0.075
 fishes-0.075
 cheered-0.075
Token he
Token ID339
Feature activation+4.94e-02

Pos logit contributions
 homework+0.062
zag+0.062
 x+0.059
isk+0.058
 operation+0.058
Neg logit contributions
 cheered-0.049
 pulp-0.049
 ge-0.048
 shells-0.047
 fishes-0.047
Token played
Token ID2826
Feature activation+7.05e-02

Pos logit contributions
 homework+0.106
 math+0.101
 operation+0.099
 x+0.099
 exam+0.098
Neg logit contributions
 cheered-0.056
 admired-0.053
 applauded-0.046
 ge-0.045
 ducks-0.045
Token his
Token ID465
Feature activation+8.58e-02

Pos logit contributions
omm+0.139
zag+0.137
scar+0.137
isk+0.136
ibo+0.133
Neg logit contributions
 cheered-0.083
 flags-0.072
Happy-0.071
 them-0.066
 applauded-0.061
Token violin
Token ID38283
Feature activation+1.00e-01

Pos logit contributions
isk+0.148
omm+0.122
scar+0.121
 cavity+0.119
 Rat+0.116
Neg logit contributions
 flags-0.136
 faces-0.125
 banners-0.122
 floats-0.122
 boats-0.119
Token beautifully
Token ID21104
Feature activation+5.63e-02

Pos logit contributions
omm+0.211
scar+0.210
liv+0.203
isk+0.196
innie+0.191
Neg logit contributions
 flags-0.093
 cheered-0.090
.-0.085
als-0.080
 banners-0.079
Token.
Token ID13
Feature activation+1.37e-02

Pos logit contributions
 x+0.144
 homework+0.141
omm+0.141
 robber+0.140
isk+0.140
Neg logit contributions
ment-0.020
 cheered-0.020
 flags-0.017
als-0.017
Happy-0.017
Token The
Token ID383
Feature activation+1.19e-02

Pos logit contributions
 homework+0.027
ggie+0.025
 operation+0.025
 x+0.025
 exam+0.024
Neg logit contributions
 pulp-0.018
 fishes-0.016
ment-0.015
oms-0.015
 cheered-0.015
Token people
Token ID661
Feature activation-2.91e-02

Pos logit contributions
 homework+0.018
zag+0.018
isk+0.017
ggie+0.017
 math+0.017
Neg logit contributions
 pulp-0.019
oms-0.018
 flags-0.018
 banners-0.018
 fishes-0.018
Token cl
Token ID537
Feature activation-1.89e-02

Pos logit contributions
 pulp+0.056
ment+0.047
 fishes+0.046
lan+0.046
oms+0.045
Neg logit contributions
 homework-0.042
 not-0.041
zag-0.040
 operation-0.040
ggie-0.040
Tokenals
Token ID874
Feature activation-5.16e-02

Pos logit contributions
 pulp+0.058
ins+0.041
Happy+0.038
ment+0.037
 pans+0.037
Neg logit contributions
 robot-0.139
 operation-0.137
 math-0.136
 mask-0.134
 Patch-0.134
Token and
Token ID290
Feature activation-6.69e-02

Pos logit contributions
 pulp+0.124
 fishes+0.108
 ge+0.107
ins+0.105
lan+0.105
Neg logit contributions
 not-0.048
zag-0.046
 homework-0.041
 operation-0.039
case-0.038
Token bloom
Token ID29955
Feature activation-8.53e-02

Pos logit contributions
oms+0.116
 pulp+0.115
ins+0.113
ocol+0.108
ment+0.108
Neg logit contributions
 not-0.098
 Max-0.083
 homework-0.081
 wouldn-0.078
 because-0.077
Tokened
Token ID276
Feature activation-5.44e-02

Pos logit contributions
 pulp+0.140
ocol+0.115
 whales+0.114
 pump+0.112
 ge+0.110
Neg logit contributions
zag-0.146
 not-0.144
chy-0.140
ggie-0.138
 because-0.127
Token in
Token ID287
Feature activation-5.78e-02

Pos logit contributions
 pulp+0.112
oms+0.102
ins+0.091
lan+0.091
 ge+0.090
Neg logit contributions
 not-0.062
 math-0.062
 homework-0.061
 because-0.057
 operation-0.057
Token bright
Token ID6016
Feature activation-1.34e-01

Pos logit contributions
 pulp+0.142
oms+0.138
unn+0.136
ins+0.134
suits+0.127
Neg logit contributions
 not-0.049
 math-0.045
 because-0.043
 Max-0.039
 x-0.037
Token colors
Token ID7577
Feature activation-7.53e-02

Pos logit contributions
 pulp+0.305
iers+0.288
fed+0.278
Sorry+0.277
lan+0.274
Neg logit contributions
 my-0.111
 so-0.106
 because-0.105
 Wednesday-0.087
 and-0.087
Token -
Token ID532
Feature activation-2.78e-02

Pos logit contributions
 pulp+0.159
 ge+0.134
 pans+0.128
 islands+0.127
 banners+0.126
Neg logit contributions
 not-0.095
zag-0.089
 because-0.088
ggie-0.084
chy-0.081
Token red
Token ID2266
Feature activation-7.39e-02

Pos logit contributions
 pulp+0.051
oms+0.044
ment+0.044
 fishes+0.043
 islands+0.042
Neg logit contributions
 not-0.040
 homework-0.040
 math-0.038
 operation-0.037
ggie-0.035
Token,
Token ID11
Feature activation-3.43e-02

Pos logit contributions
 pulp+0.140
Happy+0.127
ins+0.122
oms+0.121
 cheered+0.120
Neg logit contributions
 because-0.087
 math-0.085
 not-0.082
 cushion-0.082
 operation-0.077
Token yellow
Token ID7872
Feature activation-6.88e-02

Pos logit contributions
 pulp+0.060
oms+0.053
ment+0.053
 fishes+0.051
ins+0.050
Neg logit contributions
 not-0.052
 homework-0.048
 math-0.047
 operation-0.045
 x-0.044
Token big
Token ID1263
Feature activation-3.12e-02

Pos logit contributions
 pulp+0.068
ment+0.063
als+0.061
oms+0.061
ins+0.057
Neg logit contributions
 homework-0.052
 not-0.052
 skate-0.051
 robot-0.050
 math-0.050
Token reef
Token ID25088
Feature activation-8.67e-03

Pos logit contributions
 pulp+0.048
ment+0.042
Happy+0.041
als+0.041
oms+0.041
Neg logit contributions
 homework-0.053
 operation-0.052
 skate-0.051
 math-0.051
 robot-0.051
Token with
Token ID351
Feature activation-5.68e-02

Pos logit contributions
 pulp+0.016
 fishes+0.013
oms+0.013
ment+0.013
 banners+0.013
Neg logit contributions
 homework-0.013
 not-0.012
 operation-0.012
ggie-0.012
 math-0.012
Token lots
Token ID6041
Feature activation-7.61e-02

Pos logit contributions
oms+0.110
 pulp+0.107
ment+0.103
ins+0.100
omed+0.098
Neg logit contributions
 not-0.074
 math-0.063
 skate-0.059
 robot-0.057
 mine-0.057
Token of
Token ID286
Feature activation-1.01e-01

Pos logit contributions
 pulp+0.168
women+0.144
Happy+0.139
owers+0.139
use+0.137
Neg logit contributions
 because-0.083
 math-0.080
 not-0.075
 balancing-0.069
 skate-0.067
Token colorful
Token ID20239
Feature activation-1.30e-01

Pos logit contributions
 pulp+0.164
oms+0.157
als+0.153
ment+0.153
use+0.150
Neg logit contributions
 not-0.136
 really-0.135
 especially-0.135
 math-0.133
 his-0.133
Token fish
Token ID5916
Feature activation-5.65e-02

Pos logit contributions
 pulp+0.188
forts+0.186
Happy+0.186
Sorry+0.185
prise+0.184
Neg logit contributions
 because-0.168
 weapon-0.167
 math-0.163
 mask-0.163
 hockey-0.160
Token and
Token ID290
Feature activation-6.34e-02

Pos logit contributions
 pulp+0.112
Happy+0.100
ment+0.096
 banners+0.096
oms+0.094
Neg logit contributions
 not-0.074
 skate-0.070
zag-0.068
 homework-0.068
 balancing-0.064
Token coral
Token ID29537
Feature activation-9.74e-02

Pos logit contributions
oms+0.093
 pulp+0.088
omed+0.083
ment+0.078
ins+0.077
Neg logit contributions
 not-0.116
 math-0.103
 skate-0.102
 homework-0.101
 robot-0.098
Token.
Token ID13
Feature activation-1.46e-02

Pos logit contributions
 pulp+0.183
Happy+0.171
 banners+0.156
ment+0.155
izz+0.155
Neg logit contributions
 not-0.138
 because-0.112
 balancing-0.111
zag-0.110
 skate-0.109
Token
Token ID198
Feature activation-1.85e-02

Pos logit contributions
 pulp+0.034
oms+0.031
 fishes+0.030
 shells+0.029
ment+0.029
Neg logit contributions
 homework-0.015
 not-0.014
 math-0.014
 operation-0.013
ggie-0.013
Token,
Token ID11
Feature activation+3.43e-02

Pos logit contributions
 homework+0.106
 x+0.097
 operation+0.095
 math+0.094
 skate+0.092
Neg logit contributions
als-0.066
 pulp-0.060
ment-0.057
 fishes-0.057
 cheered-0.056
Token with
Token ID351
Feature activation-4.00e-02

Pos logit contributions
 homework+0.050
zag+0.048
 operation+0.046
ggie+0.045
isk+0.044
Neg logit contributions
 pulp-0.064
 shells-0.059
 fishes-0.059
 flags-0.058
 banners-0.057
Token all
Token ID477
Feature activation-1.90e-02

Pos logit contributions
 pulp+0.081
oms+0.076
ment+0.072
ins+0.071
suits+0.070
Neg logit contributions
 not-0.049
 homework-0.045
 math-0.045
ggie-0.042
 x-0.040
Token sorts
Token ID10524
Feature activation-5.88e-02

Pos logit contributions
 pulp+0.034
oms+0.030
 fishes+0.029
ment+0.028
 banners+0.028
Neg logit contributions
 homework-0.029
 not-0.027
 math-0.027
ggie-0.026
 operation-0.026
Token of
Token ID286
Feature activation-7.25e-02

Pos logit contributions
 pulp+0.120
 ge+0.098
suits+0.093
 pans+0.093
 whales+0.092
Neg logit contributions
 not-0.081
 math-0.075
 because-0.074
ggie-0.073
 robot-0.068
Token bright
Token ID6016
Feature activation-1.28e-01

Pos logit contributions
 pulp+0.137
oms+0.127
als+0.118
Sorry+0.116
women+0.115
Neg logit contributions
 math-0.096
 not-0.092
 homework-0.088
 x-0.087
ggie-0.084
Token colors
Token ID7577
Feature activation-6.59e-02

Pos logit contributions
Sorry+0.230
 pulp+0.220
women+0.200
icker+0.196
We+0.195
Neg logit contributions
 because-0.146
 er-0.142
 math-0.141
 just-0.137
 balancing-0.135
Token.
Token ID13
Feature activation-6.68e-03

Pos logit contributions
 pulp+0.148
 ge+0.129
Happy+0.126
lan+0.122
 whales+0.121
Neg logit contributions
 not-0.068
zag-0.063
 math-0.063
 because-0.060
 homework-0.059
Token
Token ID198
Feature activation-1.19e-02

Pos logit contributions
 pulp+0.012
 fishes+0.010
oms+0.010
ment+0.010
 flags+0.010
Neg logit contributions
 homework-0.010
 operation-0.009
 math-0.009
ggie-0.009
 not-0.009
Token
Token ID198
Feature activation+2.95e-03

Pos logit contributions
 pulp+0.021
 fishes+0.018
oms+0.018
 banners+0.017
 flags+0.017
Neg logit contributions
 homework-0.019
ggie-0.018
 not-0.018
 math-0.018
 operation-0.017
TokenThe
Token ID464
Feature activation-2.07e-02

Pos logit contributions
 homework+0.005
 operation+0.005
zag+0.005
 x+0.005
 math+0.005
Neg logit contributions
 pulp-0.005
 fishes-0.004
oms-0.004
 banners-0.004
 flags-0.004
Token grew
Token ID6348
Feature activation-5.21e-02

Pos logit contributions
 pulp+0.101
lan+0.097
 banners+0.095
ment+0.092
 flags+0.091
Neg logit contributions
 not-0.107
zag-0.092
 wouldn-0.091
chy-0.090
 always-0.090
Token into
Token ID656
Feature activation-4.23e-02

Pos logit contributions
 pulp+0.108
 banners+0.104
oms+0.102
 flags+0.099
Happy+0.098
Neg logit contributions
 not-0.064
 because-0.050
chy-0.049
 homework-0.049
 math-0.045
Token big
Token ID1263
Feature activation-1.01e-01

Pos logit contributions
oms+0.091
 pulp+0.085
als+0.084
 banners+0.080
omed+0.079
Neg logit contributions
 not-0.050
 math-0.047
 homework-0.046
 x-0.044
 operation-0.041
Token plants
Token ID6134
Feature activation-5.12e-02

Pos logit contributions
lan+0.218
Happy+0.218
izz+0.217
ount+0.210
ins+0.209
Neg logit contributions
 because-0.091
 so-0.071
 not-0.066
 math-0.066
 bone-0.062
Token with
Token ID351
Feature activation-7.63e-02

Pos logit contributions
 pulp+0.099
Happy+0.086
 banners+0.085
ment+0.085
oms+0.084
Neg logit contributions
 not-0.074
 homework-0.066
 because-0.065
zag-0.064
 math-0.061
Token beautiful
Token ID4950
Feature activation-1.27e-01

Pos logit contributions
oms+0.168
ins+0.152
suits+0.150
unn+0.149
ucks+0.147
Neg logit contributions
 not-0.098
 only-0.070
 no-0.069
 so-0.068
 his-0.068
Token flowers
Token ID12734
Feature activation-6.26e-02

Pos logit contributions
prise+0.199
izz+0.196
Sorry+0.196
oms+0.195
 cheered+0.192
Neg logit contributions
 because-0.165
 just-0.161
 not-0.160
 always-0.148
 math-0.148
Token.
Token ID13
Feature activation+2.41e-04

Pos logit contributions
 pulp+0.120
 banners+0.106
Happy+0.105
 fishes+0.104
oms+0.102
Neg logit contributions
 not-0.089
 because-0.083
 homework-0.075
zag-0.075
chy-0.075
Token The
Token ID383
Feature activation-4.34e-02

Pos logit contributions
 homework+0.000
ggie+0.000
zag+0.000
 operation+0.000
 math+0.000
Neg logit contributions
 pulp-0.000
 fishes-0.000
oms-0.000
 flags-0.000
 banners-0.000
Token farmer
Token ID18739
Feature activation-1.92e-02

Pos logit contributions
oms+0.078
 pulp+0.075
ment+0.074
 cheered+0.072
Happy+0.069
Neg logit contributions
 operation-0.062
 not-0.062
 homework-0.059
 robber-0.058
 shelf-0.054
Token was
Token ID373
Feature activation+6.93e-03

Pos logit contributions
 pulp+0.034
oms+0.031
ment+0.030
 banners+0.030
 fishes+0.030
Neg logit contributions
 homework-0.028
 not-0.028
ggie-0.027
zag-0.026
 operation-0.026
Token big
Token ID1263
Feature activation-7.21e-03

Pos logit contributions
 pulp+0.001
 fishes+0.001
oms+0.001
 flags+0.001
 banners+0.001
Neg logit contributions
 homework-0.001
 operation-0.001
ggie-0.001
 math-0.001
zag-0.001
Token reef
Token ID25088
Feature activation-2.29e-02

Pos logit contributions
 pulp+0.011
oms+0.010
 fishes+0.009
ment+0.009
 banners+0.009
Neg logit contributions
 homework-0.013
 operation-0.012
 math-0.012
ggie-0.012
 not-0.012
Token with
Token ID351
Feature activation-5.54e-02

Pos logit contributions
 pulp+0.044
 fishes+0.037
 banners+0.037
oms+0.037
ment+0.037
Neg logit contributions
 homework-0.033
 not-0.032
zag-0.030
 math-0.030
 operation-0.030
Token lots
Token ID6041
Feature activation-8.23e-02

Pos logit contributions
oms+0.121
ment+0.112
omed+0.112
 pulp+0.111
use+0.108
Neg logit contributions
 not-0.062
 math-0.053
 robot-0.048
 homework-0.046
 really-0.045
Token of
Token ID286
Feature activation-9.10e-02

Pos logit contributions
 pulp+0.190
women+0.168
use+0.165
Sorry+0.161
Happy+0.160
Neg logit contributions
 because-0.079
 math-0.075
 bone-0.068
 not-0.066
 about-0.063
Token colorful
Token ID20239
Feature activation-1.26e-01

Pos logit contributions
 pulp+0.158
oms+0.155
omed+0.152
use+0.149
ment+0.148
Neg logit contributions
 math-0.123
 not-0.118
 his-0.115
 really-0.113
 just-0.109
Token coral
Token ID29537
Feature activation-9.31e-02

Pos logit contributions
ount+0.196
izz+0.193
forts+0.189
iza+0.185
Sorry+0.184
Neg logit contributions
 math-0.169
 mask-0.159
 hockey-0.156
 robot-0.156
 weapon-0.155
Token.
Token ID13
Feature activation-3.55e-02

Pos logit contributions
 pulp+0.190
Happy+0.169
ocol+0.167
Sorry+0.166
 banners+0.166
Neg logit contributions
 not-0.117
 because-0.102
 mask-0.091
zag-0.089
 math-0.088
Token Nem
Token ID22547
Feature activation+3.60e-03

Pos logit contributions
 pulp+0.069
oms+0.061
ment+0.061
 banners+0.060
 flags+0.059
Neg logit contributions
 not-0.049
 homework-0.043
 skate-0.042
 Max-0.041
 math-0.041
Tokeno
Token ID78
Feature activation-1.21e-02

Pos logit contributions
 homework+0.005
 operation+0.005
 math+0.005
 x+0.005
ggie+0.005
Neg logit contributions
 pulp-0.007
 fishes-0.006
oms-0.006
ment-0.006
 ge-0.006
Token wanted
Token ID2227
Feature activation-8.60e-03

Pos logit contributions
 pulp+0.022
oms+0.019
ment+0.019
 fishes+0.018
 banners+0.018
Neg logit contributions
 homework-0.019
 not-0.018
ggie-0.017
zag-0.017
 operation-0.017
Token she
Token ID673
Feature activation-1.04e-02

Pos logit contributions
 homework+0.008
zag+0.007
 operation+0.007
ggie+0.007
 skate+0.007
Neg logit contributions
 pulp-0.010
 fishes-0.008
oms-0.008
ment-0.008
 banners-0.008
Token found
Token ID1043
Feature activation-2.56e-02

Pos logit contributions
 pulp+0.013
 fishes+0.010
oms+0.010
ment+0.010
 banners+0.010
Neg logit contributions
 homework-0.022
 not-0.021
 operation-0.020
ggie-0.020
zag-0.020
Token big
Token ID1263
Feature activation-3.13e-02

Pos logit contributions
 pulp+0.058
oms+0.053
 banners+0.051
 fishes+0.050
 flags+0.050
Neg logit contributions
 not-0.026
 homework-0.025
ggie-0.024
 math-0.024
 operation-0.024
Token flowers
Token ID12734
Feature activation-3.09e-02

Pos logit contributions
 pulp+0.053
Happy+0.047
ment+0.045
lan+0.044
oms+0.043
Neg logit contributions
 homework-0.049
 operation-0.047
 math-0.047
 x-0.046
 not-0.045
Token with
Token ID351
Feature activation-6.09e-02

Pos logit contributions
 pulp+0.065
 fishes+0.055
Happy+0.054
 banners+0.054
 ge+0.054
Neg logit contributions
 not-0.039
 homework-0.037
zag-0.036
 math-0.034
ggie-0.034
Token colorful
Token ID20239
Feature activation-1.25e-01

Pos logit contributions
oms+0.112
 pulp+0.110
ment+0.106
ins+0.102
Happy+0.101
Neg logit contributions
 not-0.080
 math-0.073
 homework-0.070
ggie-0.064
 x-0.063
Token pet
Token ID4273
Feature activation-9.49e-02

Pos logit contributions
Sorry+0.194
 pulp+0.189
Happy+0.184
ment+0.177
prise+0.177
Neg logit contributions
 math-0.165
 er-0.161
 because-0.150
 cushion-0.150
 cord-0.143
Tokenals
Token ID874
Feature activation-6.45e-02

Pos logit contributions
 pulp+0.116
Happy+0.090
ins+0.084
 lighthouse+0.081
 cheered+0.081
Neg logit contributions
 because-0.181
 Max-0.170
 robot-0.169
 math-0.168
 Patch-0.168
Token and
Token ID290
Feature activation-5.39e-02

Pos logit contributions
 pulp+0.135
Happy+0.116
 fishes+0.114
 banners+0.113
 ge+0.112
Neg logit contributions
 not-0.083
zag-0.074
 because-0.073
chy-0.069
 homework-0.068
Token birds
Token ID10087
Feature activation-6.93e-02

Pos logit contributions
 pulp+0.097
oms+0.096
ins+0.089
Happy+0.085
ment+0.085
Neg logit contributions
 not-0.080
 homework-0.072
 math-0.070
ggie-0.065
 mask-0.062
Token picking
Token ID10868
Feature activation-7.66e-02

Pos logit contributions
 pulp+0.134
Happy+0.115
ment+0.112
 fishes+0.110
oms+0.110
Neg logit contributions
 not-0.098
zag-0.095
 homework-0.087
 because-0.086
ggie-0.084
Token their
Token ID511
Feature activation-3.87e-02

Pos logit contributions
 pulp+0.074
oms+0.065
 fishes+0.058
omed+0.058
ment+0.057
Neg logit contributions
 not-0.068
 homework-0.067
 math-0.063
ggie-0.058
 because-0.056
Token home
Token ID1363
Feature activation-6.54e-03

Pos logit contributions
 pulp+0.081
oms+0.077
ment+0.075
 fishes+0.071
omed+0.071
Neg logit contributions
 homework-0.049
 math-0.045
 operation-0.043
 not-0.042
 robot-0.041
Token with
Token ID351
Feature activation-6.72e-02

Pos logit contributions
 pulp+0.013
 fishes+0.011
oms+0.011
ment+0.011
 banners+0.011
Neg logit contributions
 homework-0.009
 operation-0.008
 not-0.008
ggie-0.008
 math-0.008
Token lots
Token ID6041
Feature activation-6.29e-02

Pos logit contributions
oms+0.131
omed+0.122
 pulp+0.121
als+0.120
plets+0.118
Neg logit contributions
 homework-0.096
 not-0.092
 math-0.089
 x-0.080
 operation-0.077
Token of
Token ID286
Feature activation-9.61e-02

Pos logit contributions
 pulp+0.138
 ge+0.108
 valleys+0.105
 whales+0.105
 banners+0.104
Neg logit contributions
 math-0.077
 homework-0.073
 because-0.072
 operation-0.068
 not-0.068
Token pretty
Token ID2495
Feature activation-1.25e-01

Pos logit contributions
plets+0.194
 pulp+0.190
als+0.187
ment+0.187
omed+0.186
Neg logit contributions
 math-0.114
 homework-0.108
 x-0.106
 favorite-0.099
 operation-0.099
Token things
Token ID1243
Feature activation-4.39e-02

Pos logit contributions
 pulp+0.216
Sorry+0.212
fed+0.211
lan+0.208
 fishermen+0.204
Neg logit contributions
 math-0.149
 homework-0.135
 belt-0.130
 sweater-0.128
 vest-0.128
Token.
Token ID13
Feature activation-1.39e-02

Pos logit contributions
 pulp+0.085
 fishes+0.069
 ge+0.068
ment+0.068
oms+0.068
Neg logit contributions
 not-0.068
 homework-0.068
 because-0.060
 math-0.060
zag-0.059
Token One
Token ID1881
Feature activation+1.28e-02

Pos logit contributions
 pulp+0.030
oms+0.027
ment+0.027
 fishes+0.026
 banners+0.026
Neg logit contributions
 homework-0.017
 not-0.015
 math-0.015
 operation-0.015
ggie-0.015
Token day
Token ID1110
Feature activation-4.22e-02

Pos logit contributions
 homework+0.018
 Maxwell+0.017
zag+0.016
 robber+0.016
 math+0.016
Neg logit contributions
 pulp-0.023
 fishes-0.021
 ge-0.021
 pump-0.021
ment-0.021
Token,
Token ID11
Feature activation-1.35e-02

Pos logit contributions
 pulp+0.103
 fishes+0.090
 islands+0.089
 banners+0.089
oms+0.088
Neg logit contributions
 not-0.042
 homework-0.040
 math-0.039
ggie-0.037
zag-0.035
Token,
Token ID11
Feature activation-1.34e-02

Pos logit contributions
 pulp+0.016
oms+0.014
 fishes+0.014
ment+0.014
 banners+0.014
Neg logit contributions
 homework-0.013
 operation-0.011
 math-0.011
ggie-0.011
 not-0.011
Token beautiful
Token ID4950
Feature activation-2.24e-02

Pos logit contributions
 pulp+0.024
oms+0.021
 fishes+0.021
ment+0.020
als+0.020
Neg logit contributions
 homework-0.021
 operation-0.019
 math-0.019
 not-0.019
 x-0.019
Token garden
Token ID11376
Feature activation-5.20e-02

Pos logit contributions
 pulp+0.041
oms+0.036
 fishes+0.035
ment+0.034
 banners+0.034
Neg logit contributions
 homework-0.033
 operation-0.032
 math-0.031
ggie-0.030
zag-0.030
Token full
Token ID1336
Feature activation-8.57e-02

Pos logit contributions
 pulp+0.094
Happy+0.076
 fishes+0.075
ment+0.074
oms+0.073
Neg logit contributions
 homework-0.081
 not-0.079
 math-0.076
zag-0.075
ggie-0.074
Token of
Token ID286
Feature activation-7.58e-02

Pos logit contributions
 pulp+0.183
ins+0.149
 valleys+0.145
Happy+0.145
 pans+0.143
Neg logit contributions
 because-0.100
 not-0.097
 math-0.096
 robot-0.089
 operation-0.087
Token colourful
Token ID44799
Feature activation-1.25e-01

Pos logit contributions
 pulp+0.124
oms+0.115
omed+0.109
ment+0.108
lan+0.105
Neg logit contributions
 math-0.123
 homework-0.115
 not-0.113
 operation-0.107
 exercise-0.107
Token flowers
Token ID12734
Feature activation-6.42e-02

Pos logit contributions
 pulp+0.199
Sorry+0.185
izz+0.184
fed+0.180
forts+0.174
Neg logit contributions
 math-0.187
 robot-0.160
 cushion-0.158
 because-0.157
 mask-0.154
Token.
Token ID13
Feature activation+3.46e-03

Pos logit contributions
 pulp+0.133
ment+0.108
 fishes+0.104
Happy+0.103
oms+0.103
Neg logit contributions
 not-0.085
 because-0.083
 homework-0.081
zag-0.079
 math-0.077
Token The
Token ID383
Feature activation-1.58e-02

Pos logit contributions
 homework+0.006
ggie+0.005
 operation+0.005
zag+0.005
 not+0.005
Neg logit contributions
 pulp-0.006
 fishes-0.005
oms-0.005
 flags-0.005
 banners-0.005
Token child
Token ID1200
Feature activation-3.13e-03

Pos logit contributions
 pulp+0.031
oms+0.027
ment+0.027
 fishes+0.026
als+0.025
Neg logit contributions
 homework-0.023
 operation-0.021
 x-0.021
 math-0.021
 not-0.020
Token loved
Token ID6151
Feature activation+9.95e-04

Pos logit contributions
 pulp+0.005
 fishes+0.005
oms+0.005
ment+0.005
 flags+0.004
Neg logit contributions
 homework-0.005
 operation-0.005
ggie-0.005
 math-0.005
 not-0.005
Token big
Token ID1263
Feature activation+1.35e-03

Pos logit contributions
 pulp+0.011
oms+0.010
 fishes+0.009
ment+0.009
 banners+0.009
Neg logit contributions
 homework-0.010
 operation-0.009
ggie-0.009
 math-0.009
 not-0.009
Token pond
Token ID16723
Feature activation-1.57e-02

Pos logit contributions
 homework+0.002
ggie+0.002
 operation+0.002
zag+0.002
 not+0.002
Neg logit contributions
 pulp-0.002
 fishes-0.002
oms-0.002
 flags-0.002
 banners-0.002
Token with
Token ID351
Feature activation-5.70e-02

Pos logit contributions
 pulp+0.033
oms+0.029
 fishes+0.029
ment+0.028
 banners+0.028
Neg logit contributions
 homework-0.019
 not-0.019
ggie-0.018
zag-0.018
 math-0.018
Token lots
Token ID6041
Feature activation-7.28e-02

Pos logit contributions
oms+0.118
 pulp+0.116
ment+0.110
omed+0.105
ins+0.105
Neg logit contributions
 not-0.067
 math-0.058
 robot-0.053
ggie-0.052
 skate-0.052
Token of
Token ID286
Feature activation-9.16e-02

Pos logit contributions
 pulp+0.163
Happy+0.133
lan+0.132
women+0.132
Sorry+0.129
Neg logit contributions
 because-0.079
 math-0.077
 not-0.071
 bone-0.067
 robot-0.066
Token colorful
Token ID20239
Feature activation-1.25e-01

Pos logit contributions
 pulp+0.152
oms+0.141
lan+0.134
ment+0.134
use+0.130
Neg logit contributions
 not-0.128
 math-0.127
 exercise-0.124
 bone-0.123
 really-0.120
Token rocks
Token ID12586
Feature activation-5.16e-02

Pos logit contributions
Happy+0.197
 pulp+0.196
iza+0.194
prise+0.192
forts+0.192
Neg logit contributions
 bone-0.153
 math-0.150
 hockey-0.145
 exercise-0.143
 because-0.140
Token.
Token ID13
Feature activation-3.42e-03

Pos logit contributions
 pulp+0.107
Happy+0.093
 fishes+0.088
ment+0.088
 banners+0.088
Neg logit contributions
 not-0.068
zag-0.065
 homework-0.063
 skate-0.060
 because-0.059
Token He
Token ID679
Feature activation-1.62e-02

Pos logit contributions
 pulp+0.007
 fishes+0.006
oms+0.006
ment+0.006
 flags+0.006
Neg logit contributions
 homework-0.004
 operation-0.004
ggie-0.004
 math-0.004
zag-0.004
Token walked
Token ID6807
Feature activation+4.98e-03

Pos logit contributions
 pulp+0.025
oms+0.021
ment+0.021
 fishes+0.020
 banners+0.020
Neg logit contributions
 homework-0.028
 not-0.028
ggie-0.027
zag-0.027
 skate-0.027
Token closer
Token ID5699
Feature activation-5.38e-03

Pos logit contributions
 homework+0.008
 operation+0.007
ggie+0.007
 x+0.007
 math+0.007
Neg logit contributions
 pulp-0.009
 fishes-0.008
oms-0.008
 flags-0.008
 shells-0.008
Token was
Token ID373
Feature activation+2.35e-02

Pos logit contributions
 homework+0.040
 math+0.037
 operation+0.037
 exam+0.036
 robber+0.035
Neg logit contributions
 pulp-0.032
 fishes-0.028
 cheered-0.027
als-0.027
 shells-0.027
Token big
Token ID1263
Feature activation-3.38e-03

Pos logit contributions
 homework+0.039
zag+0.038
 operation+0.037
ggie+0.036
 rider+0.035
Neg logit contributions
 pulp-0.040
 shells-0.034
 fishes-0.034
 ducks-0.033
 grains-0.033
Token and
Token ID290
Feature activation+7.80e-03

Pos logit contributions
 pulp+0.007
 fishes+0.006
oms+0.006
ment+0.006
 flags+0.006
Neg logit contributions
 homework-0.004
 operation-0.004
ggie-0.004
 math-0.004
zag-0.004
Token full
Token ID1336
Feature activation-6.68e-02

Pos logit contributions
 homework+0.014
 operation+0.013
ggie+0.013
zag+0.013
 math+0.012
Neg logit contributions
 pulp-0.013
 fishes-0.011
 shells-0.010
 flags-0.010
oms-0.010
Token of
Token ID286
Feature activation-8.20e-02

Pos logit contributions
 pulp+0.141
ins+0.118
 valleys+0.117
 banners+0.116
 pans+0.114
Neg logit contributions
 not-0.083
 because-0.082
 math-0.074
 Max-0.073
 so-0.072
Token colourful
Token ID44799
Feature activation-1.24e-01

Pos logit contributions
 pulp+0.131
oms+0.125
als+0.124
omed+0.122
ment+0.122
Neg logit contributions
 not-0.127
 math-0.123
 especially-0.112
 really-0.110
 homework-0.110
Token fish
Token ID5916
Feature activation-4.01e-02

Pos logit contributions
izz+0.194
arl+0.186
 pulp+0.184
ount+0.182
forts+0.181
Neg logit contributions
 math-0.176
 because-0.162
 hockey-0.160
 robot-0.155
 vest-0.154
Token.
Token ID13
Feature activation+1.75e-02

Pos logit contributions
 pulp+0.069
 banners+0.060
Happy+0.059
oms+0.059
ment+0.057
Neg logit contributions
 not-0.064
 homework-0.062
zag-0.059
 math-0.059
ggie-0.058
Token Tom
Token ID4186
Feature activation+1.55e-02

Pos logit contributions
 operation+0.032
 homework+0.032
 exam+0.030
zag+0.030
 x+0.030
Neg logit contributions
 pulp-0.024
 ge-0.021
oms-0.021
ment-0.021
 flags-0.021
Token thought
Token ID1807
Feature activation+9.79e-03

Pos logit contributions
 homework+0.030
 operation+0.028
 math+0.027
 exam+0.026
zag+0.026
Neg logit contributions
 pulp-0.023
 fishes-0.020
 cheered-0.019
 shells-0.019
 ge-0.019
Token it
Token ID340
Feature activation+4.30e-02

Pos logit contributions
 homework+0.016
zag+0.016
 operation+0.015
isk+0.015
ggie+0.015
Neg logit contributions
 pulp-0.016
oms-0.014
 shells-0.014
 fishes-0.014
 flags-0.014
Token and
Token ID290
Feature activation-1.99e-02

Pos logit contributions
 pulp+0.036
 fishes+0.031
oms+0.031
 ge+0.030
ment+0.030
Neg logit contributions
 homework-0.021
 not-0.021
zag-0.020
 math-0.019
ggie-0.019
Token picked
Token ID6497
Feature activation-2.08e-02

Pos logit contributions
 pulp+0.033
oms+0.030
ment+0.029
 fishes+0.028
 islands+0.028
Neg logit contributions
 homework-0.032
 not-0.032
 math-0.030
ggie-0.030
 operation-0.030
Token out
Token ID503
Feature activation-1.10e-02

Pos logit contributions
 pulp+0.043
oms+0.039
 banners+0.038
ment+0.038
 fishes+0.037
Neg logit contributions
 homework-0.025
 not-0.023
 math-0.023
ggie-0.022
 operation-0.022
Token lots
Token ID6041
Feature activation-6.95e-02

Pos logit contributions
 pulp+0.022
oms+0.019
 fishes+0.019
ment+0.019
 banners+0.018
Neg logit contributions
 homework-0.015
 not-0.013
 math-0.013
ggie-0.013
 operation-0.013
Token of
Token ID286
Feature activation-8.40e-02

Pos logit contributions
 pulp+0.161
Sorry+0.128
women+0.127
lan+0.127
 whales+0.126
Neg logit contributions
 math-0.080
 because-0.064
 homework-0.063
 operation-0.061
 x-0.059
Token colorful
Token ID20239
Feature activation-1.24e-01

Pos logit contributions
 pulp+0.163
lan+0.160
plets+0.160
ment+0.159
Sorry+0.157
Neg logit contributions
 math-0.104
 x-0.097
 homework-0.092
 operation-0.088
 cord-0.088
Token flowers
Token ID12734
Feature activation-6.04e-02

Pos logit contributions
Sorry+0.209
 pulp+0.206
arl+0.204
forts+0.202
fed+0.202
Neg logit contributions
 math-0.157
 er-0.138
 cord-0.127
 exercise-0.127
 puzzles-0.124
Token.
Token ID13
Feature activation+1.12e-02

Pos logit contributions
 pulp+0.124
 fishes+0.100
 ge+0.100
ment+0.099
 banners+0.097
Neg logit contributions
 not-0.083
 because-0.080
 homework-0.076
zag-0.075
 math-0.073
Token They
Token ID1119
Feature activation-5.06e-02

Pos logit contributions
 homework+0.019
ggie+0.018
 operation+0.017
zag+0.017
 x+0.017
Neg logit contributions
 pulp-0.019
 fishes-0.017
oms-0.016
als-0.016
 ge-0.016
Token placed
Token ID4624
Feature activation-3.42e-02

Pos logit contributions
ment+0.095
oms+0.091
 pulp+0.089
 fishes+0.082
ins+0.080
Neg logit contributions
 not-0.071
 always-0.064
 homework-0.064
 drib-0.063
 wouldn-0.059
Token the
Token ID262
Feature activation-5.46e-02

Pos logit contributions
 pulp+0.058
oms+0.054
ment+0.050
Happy+0.049
 fishes+0.048
Neg logit contributions
 homework-0.051
 not-0.051
 math-0.048
 x-0.046
 Maxwell-0.046
Token and
Token ID290
Feature activation-6.01e-02

Pos logit contributions
 pulp+0.090
 banners+0.077
Happy+0.077
ment+0.075
oms+0.075
Neg logit contributions
 not-0.054
 homework-0.048
 math-0.046
 because-0.046
zag-0.046
Token trees
Token ID7150
Feature activation-5.45e-02

Pos logit contributions
oms+0.094
 pulp+0.092
omed+0.087
ment+0.082
ins+0.079
Neg logit contributions
 not-0.103
 homework-0.099
 math-0.097
 robot-0.093
 operation-0.089
Token and
Token ID290
Feature activation-6.44e-02

Pos logit contributions
 pulp+0.110
Happy+0.094
oms+0.092
ment+0.092
 fishes+0.090
Neg logit contributions
 not-0.079
zag-0.069
 because-0.067
 homework-0.066
 skate-0.062
Token lots
Token ID6041
Feature activation-7.02e-02

Pos logit contributions
 pulp+0.103
oms+0.103
omed+0.097
ment+0.096
izz+0.091
Neg logit contributions
 not-0.102
 math-0.099
 homework-0.099
 robot-0.097
 operation-0.090
Token of
Token ID286
Feature activation-8.30e-02

Pos logit contributions
 pulp+0.164
 pans+0.125
 banners+0.121
suits+0.121
ment+0.120
Neg logit contributions
 because-0.086
 math-0.082
 robot-0.073
 not-0.071
 about-0.069
Token green
Token ID4077
Feature activation-1.21e-01

Pos logit contributions
 pulp+0.161
oms+0.152
ment+0.147
omed+0.142
als+0.140
Neg logit contributions
 math-0.108
 robot-0.105
 x-0.102
 homework-0.101
 operation-0.097
Token.
Token ID13
Feature activation-3.51e-03

Pos logit contributions
izz+0.234
iza+0.229
Sorry+0.210
 pulp+0.206
oms+0.203
Neg logit contributions
 bone-0.149
 robot-0.142
 math-0.142
 because-0.133
 homework-0.131
Token But
Token ID887
Feature activation+8.91e-03

Pos logit contributions
 pulp+0.006
 fishes+0.005
oms+0.005
ment+0.005
 flags+0.005
Neg logit contributions
 homework-0.005
 operation-0.005
ggie-0.005
 math-0.005
 not-0.005
Token then
Token ID788
Feature activation-1.18e-03

Pos logit contributions
 homework+0.013
ggie+0.012
zag+0.011
 operation+0.011
 x+0.011
Neg logit contributions
 pulp-0.017
 fishes-0.015
 shells-0.015
 flags-0.015
 islands-0.015
Token something
Token ID1223
Feature activation-6.07e-03

Pos logit contributions
 pulp+0.002
 fishes+0.002
oms+0.002
ment+0.002
 flags+0.002
Neg logit contributions
 homework-0.002
 operation-0.002
ggie-0.002
zag-0.002
 math-0.002
Token bad
Token ID2089
Feature activation-3.63e-03

Pos logit contributions
 pulp+0.012
oms+0.010
 fishes+0.010
ment+0.010
 banners+0.010
Neg logit contributions
 homework-0.009
 operation-0.008
ggie-0.008
 not-0.008
 math-0.008
Token big
Token ID1263
Feature activation-3.24e-02

Pos logit contributions
 pulp+0.069
ment+0.064
oms+0.062
 fishes+0.062
ins+0.060
Neg logit contributions
 homework-0.056
 math-0.054
 not-0.053
 robot-0.052
zag-0.052
Token pond
Token ID16723
Feature activation-5.61e-02

Pos logit contributions
 pulp+0.057
ment+0.051
oms+0.050
ins+0.050
 fishes+0.050
Neg logit contributions
 homework-0.048
 math-0.046
 robot-0.045
 operation-0.045
 x-0.045
Token and
Token ID290
Feature activation-3.38e-02

Pos logit contributions
 pulp+0.101
 fishes+0.087
Happy+0.087
oms+0.085
ment+0.084
Neg logit contributions
 not-0.084
 homework-0.080
zag-0.078
ggie-0.075
 skate-0.075
Token lots
Token ID6041
Feature activation-7.96e-02

Pos logit contributions
 pulp+0.070
oms+0.066
 fishes+0.061
ment+0.061
ins+0.060
Neg logit contributions
 not-0.040
 homework-0.040
 math-0.037
ggie-0.035
 skate-0.034
Token of
Token ID286
Feature activation-8.77e-02

Pos logit contributions
 pulp+0.179
use+0.150
suits+0.148
 islands+0.143
lan+0.142
Neg logit contributions
 math-0.088
 because-0.078
 about-0.074
 operation-0.074
 robot-0.072
Token colorful
Token ID20239
Feature activation-1.20e-01

Pos logit contributions
 pulp+0.132
oms+0.121
als+0.113
ment+0.113
omed+0.112
Neg logit contributions
 math-0.144
 homework-0.136
 x-0.133
 not-0.132
 mask-0.129
Token flowers
Token ID12734
Feature activation-8.37e-02

Pos logit contributions
 pulp+0.166
Sorry+0.164
forts+0.164
use+0.159
izz+0.157
Neg logit contributions
 math-0.179
 mask-0.167
 robot-0.165
 superhero-0.155
 homework-0.152
Token.
Token ID13
Feature activation-1.34e-02

Pos logit contributions
 pulp+0.168
ment+0.140
Happy+0.139
 fishes+0.138
Sorry+0.137
Neg logit contributions
 not-0.109
zag-0.101
 because-0.098
 homework-0.093
case-0.091
Token
Token ID198
Feature activation-2.06e-02

Pos logit contributions
 pulp+0.027
oms+0.023
 fishes+0.023
 banners+0.023
 flags+0.023
Neg logit contributions
 homework-0.018
 not-0.017
 math-0.016
ggie-0.016
zag-0.016
Token
Token ID198
Feature activation-1.11e-02

Pos logit contributions
 pulp+0.039
 fishes+0.034
 banners+0.033
oms+0.033
 flags+0.033
Neg logit contributions
 not-0.029
 homework-0.029
ggie-0.029
zag-0.028
 math-0.028
TokenM
Token ID44
Feature activation-9.00e-03

Pos logit contributions
 pulp+0.019
oms+0.016
 fishes+0.016
ment+0.016
 flags+0.015
Neg logit contributions
 homework-0.018
ggie-0.017
 not-0.017
 math-0.017
zag-0.017
Token.
Token ID13
Feature activation+4.56e-03

Pos logit contributions
 pulp+0.047
oms+0.039
 fishes+0.038
 banners+0.038
 ge+0.037
Neg logit contributions
 homework-0.046
ggie-0.044
 not-0.044
 math-0.044
 operation-0.043
Token There
Token ID1318
Feature activation-1.14e-02

Pos logit contributions
 homework+0.005
ggie+0.005
zag+0.005
 operation+0.005
 not+0.005
Neg logit contributions
 pulp-0.010
 fishes-0.009
oms-0.009
ment-0.009
 banners-0.008
Token were
Token ID547
Feature activation-4.03e-02

Pos logit contributions
 pulp+0.019
oms+0.016
 fishes+0.016
ment+0.015
 banners+0.015
Neg logit contributions
 homework-0.019
 not-0.018
ggie-0.018
 operation-0.018
 math-0.018
Token lots
Token ID6041
Feature activation-5.69e-02

Pos logit contributions
 pulp+0.071
oms+0.065
ment+0.062
als+0.058
Happy+0.058
Neg logit contributions
 not-0.061
 homework-0.058
 math-0.056
ggie-0.054
 operation-0.052
Token of
Token ID286
Feature activation-7.66e-02

Pos logit contributions
 pulp+0.115
lan+0.085
ment+0.084
suits+0.083
eat+0.083
Neg logit contributions
 math-0.079
 not-0.071
 operation-0.071
 homework-0.070
 balancing-0.070
Token small
Token ID1402
Feature activation-1.20e-01

Pos logit contributions
 pulp+0.144
als+0.127
ment+0.122
forts+0.121
omed+0.119
Neg logit contributions
 math-0.104
 especially-0.100
 not-0.099
 homework-0.096
 x-0.095
Token fishes
Token ID49765
Feature activation-5.21e-02

Pos logit contributions
 pulp+0.192
forts+0.180
arl+0.180
Happy+0.177
fed+0.173
Neg logit contributions
 math-0.161
 because-0.160
 balancing-0.154
 robot-0.151
 muscle-0.150
Token swimming
Token ID14899
Feature activation-4.57e-02

Pos logit contributions
 pulp+0.105
Happy+0.092
ment+0.088
 banners+0.086
eat+0.085
Neg logit contributions
 not-0.067
zag-0.063
 homework-0.063
 skate-0.063
 math-0.060
Token around
Token ID1088
Feature activation-2.10e-02

Pos logit contributions
 pulp+0.107
ment+0.087
Happy+0.086
oms+0.086
lan+0.085
Neg logit contributions
zag-0.045
 not-0.045
 skate-0.042
ggie-0.042
 homework-0.041
Token his
Token ID465
Feature activation-4.60e-02

Pos logit contributions
 pulp+0.041
oms+0.036
ment+0.034
 banners+0.034
als+0.034
Neg logit contributions
 homework-0.029
 not-0.028
 math-0.027
ggie-0.027
 operation-0.026
Token hands
Token ID2832
Feature activation-2.23e-02

Pos logit contributions
 pulp+0.084
Happy+0.075
oms+0.072
ment+0.071
eat+0.070
Neg logit contributions
 not-0.069
 skate-0.066
 homework-0.065
zag-0.064
 robot-0.063
Token their
Token ID511
Feature activation-1.05e-02

Pos logit contributions
 pulp+0.050
oms+0.043
 fishes+0.042
 ge+0.041
Happy+0.040
Neg logit contributions
 homework-0.042
 not-0.039
 math-0.039
ggie-0.038
 operation-0.037
Token garden
Token ID11376
Feature activation-1.25e-02

Pos logit contributions
 pulp+0.021
oms+0.018
 fishes+0.018
ment+0.018
 banners+0.017
Neg logit contributions
 homework-0.015
 math-0.013
 operation-0.013
 not-0.013
ggie-0.013
Token with
Token ID351
Feature activation-5.73e-02

Pos logit contributions
 pulp+0.020
oms+0.017
 fishes+0.017
ment+0.016
 ge+0.016
Neg logit contributions
 homework-0.022
 math-0.021
 not-0.021
 operation-0.020
ggie-0.020
Token lots
Token ID6041
Feature activation-6.85e-02

Pos logit contributions
 pulp+0.105
oms+0.102
ment+0.096
als+0.095
aff+0.094
Neg logit contributions
 homework-0.084
 math-0.080
 not-0.078
 operation-0.072
 x-0.070
Token of
Token ID286
Feature activation-8.43e-02

Pos logit contributions
 pulp+0.145
 ge+0.119
 islands+0.113
 whales+0.113
 valleys+0.112
Neg logit contributions
 math-0.093
 homework-0.084
 operation-0.079
 because-0.075
 exam-0.075
Token colorful
Token ID20239
Feature activation-1.20e-01

Pos logit contributions
 pulp+0.162
als+0.154
ment+0.151
lan+0.149
oms+0.148
Neg logit contributions
 math-0.117
 x-0.105
 homework-0.105
 operation-0.099
 cord-0.097
Token things
Token ID1243
Feature activation-4.73e-02

Pos logit contributions
 pulp+0.193
fed+0.181
Sorry+0.181
lan+0.181
forts+0.177
Neg logit contributions
 math-0.174
 er-0.156
 cord-0.155
 homework-0.147
 belt-0.142
Token.
Token ID13
Feature activation-2.45e-02

Pos logit contributions
 pulp+0.087
 ge+0.071
 fishes+0.071
ment+0.069
oms+0.067
Neg logit contributions
 homework-0.076
 not-0.075
 math-0.070
zag-0.068
 operation-0.067
Token They
Token ID1119
Feature activation-4.20e-02

Pos logit contributions
 pulp+0.042
oms+0.034
 banners+0.034
ment+0.034
Happy+0.034
Neg logit contributions
 homework-0.038
 not-0.037
 math-0.036
 skate-0.035
ggie-0.035
Token bought
Token ID5839
Feature activation-4.81e-02

Pos logit contributions
 pulp+0.071
ment+0.070
oms+0.066
 ge+0.063
 fishes+0.063
Neg logit contributions
 not-0.062
 homework-0.061
 drib-0.056
zag-0.055
 math-0.055
Token some
Token ID617
Feature activation-6.85e-02

Pos logit contributions
 pulp+0.097
oms+0.088
ment+0.084
Happy+0.081
lan+0.080
Neg logit contributions
 homework-0.060
 math-0.059
 not-0.058
 x-0.054
 operation-0.052
Token garden
Token ID11376
Feature activation-4.35e-02

Pos logit contributions
 pulp+0.120
ment+0.115
 banners+0.110
ins+0.109
oms+0.107
Neg logit contributions
 not-0.076
 exam-0.063
 operation-0.063
 math-0.063
 because-0.062
Token were
Token ID547
Feature activation-6.35e-02

Pos logit contributions
 pulp+0.102
 banners+0.090
 fishes+0.089
 flags+0.086
 whales+0.085
Neg logit contributions
 not-0.048
 math-0.042
 homework-0.042
zag-0.039
 operation-0.038
Token lots
Token ID6041
Feature activation-7.08e-02

Pos logit contributions
oms+0.129
ment+0.126
 pulp+0.125
forts+0.125
leness+0.118
Neg logit contributions
 not-0.084
 math-0.065
 so-0.065
 always-0.060
 only-0.060
Token of
Token ID286
Feature activation-7.79e-02

Pos logit contributions
 pulp+0.152
women+0.122
ment+0.121
 banners+0.119
 valleys+0.118
Neg logit contributions
 math-0.085
 because-0.084
 not-0.076
 exercise-0.076
 operation-0.074
Token beautiful
Token ID4950
Feature activation-1.19e-01

Pos logit contributions
oms+0.142
 pulp+0.137
ment+0.134
omed+0.131
plets+0.129
Neg logit contributions
 not-0.110
 math-0.109
 exercise-0.102
 especially-0.098
 important-0.097
Token colourful
Token ID44799
Feature activation-1.20e-01

Pos logit contributions
fed+0.209
 pulp+0.205
oms+0.203
forts+0.202
iers+0.201
Neg logit contributions
 because-0.144
 math-0.142
 cushion-0.132
 not-0.129
 favorite-0.126
Token flowers
Token ID12734
Feature activation-7.41e-02

Pos logit contributions
 pulp+0.203
 extinguished+0.201
fed+0.198
forts+0.193
oms+0.192
Neg logit contributions
 because-0.150
 math-0.146
 cushion-0.144
 robot-0.126
 just-0.123
Token.
Token ID13
Feature activation-2.41e-02

Pos logit contributions
 pulp+0.168
ocol+0.136
ment+0.136
 banners+0.135
Sorry+0.132
Neg logit contributions
 because-0.097
 not-0.093
zag-0.082
case-0.074
 always-0.073
Token Every
Token ID3887
Feature activation+1.68e-02

Pos logit contributions
 pulp+0.058
ment+0.051
oms+0.051
ins+0.050
 fishes+0.050
Neg logit contributions
 homework-0.023
 math-0.022
 not-0.021
 operation-0.021
 x-0.020
Token day
Token ID1110
Feature activation-3.85e-02

Pos logit contributions
 homework+0.024
 Maxwell+0.023
 x+0.022
 robber+0.021
 math+0.021
Neg logit contributions
 pulp-0.031
 fishes-0.029
oms-0.028
 ge-0.028
 shells-0.027
Token,
Token ID11
Feature activation-1.86e-02

Pos logit contributions
 pulp+0.095
ins+0.084
 islands+0.081
oms+0.081
 fishes+0.080
Neg logit contributions
 not-0.036
 math-0.035
ggie-0.035
 homework-0.033
 Max-0.032
Token of
Token ID286
Feature activation-6.61e-02

Pos logit contributions
 pulp+0.119
Happy+0.099
 islands+0.096
 ge+0.095
 banners+0.094
Neg logit contributions
 not-0.092
 because-0.090
zag-0.086
 math-0.086
ggie-0.085
Token green
Token ID4077
Feature activation-8.42e-02

Pos logit contributions
 pulp+0.113
oms+0.112
ins+0.107
Happy+0.105
ment+0.101
Neg logit contributions
 not-0.100
 math-0.093
 mine-0.089
 because-0.088
 robot-0.086
Token and
Token ID290
Feature activation-4.89e-02

Pos logit contributions
Happy+0.150
 pulp+0.142
 cheered+0.131
prise+0.128
oms+0.127
Neg logit contributions
 because-0.113
 not-0.111
 heel-0.107
zag-0.107
 math-0.106
Token lots
Token ID6041
Feature activation-8.50e-02

Pos logit contributions
 pulp+0.094
oms+0.090
ins+0.084
ment+0.082
Happy+0.077
Neg logit contributions
 not-0.071
 math-0.061
 homework-0.059
 robot-0.059
 skate-0.058
Token of
Token ID286
Feature activation-9.28e-02

Pos logit contributions
 pulp+0.188
lan+0.157
women+0.154
 pans+0.152
Happy+0.151
Neg logit contributions
 because-0.105
 math-0.091
 not-0.089
 about-0.084
 so-0.083
Token tiny
Token ID7009
Feature activation-1.19e-01

Pos logit contributions
 pulp+0.155
oms+0.145
Happy+0.144
ment+0.134
lan+0.133
Neg logit contributions
 not-0.134
 math-0.130
 because-0.124
 heel-0.121
 just-0.121
Token,
Token ID11
Feature activation+8.45e-03

Pos logit contributions
Happy+0.205
 pulp+0.184
Sorry+0.181
iza+0.180
forts+0.179
Neg logit contributions
 because-0.162
 heel-0.148
 balancing-0.141
 just-0.141
 bone-0.140
Token colorful
Token ID20239
Feature activation-9.75e-02

Pos logit contributions
 homework+0.013
 operation+0.012
zag+0.012
ggie+0.011
 x+0.011
Neg logit contributions
 pulp-0.016
 fishes-0.014
oms-0.014
 flags-0.014
 shells-0.013
Token fish
Token ID5916
Feature activation-5.49e-02

Pos logit contributions
Happy+0.147
 pulp+0.142
forts+0.136
izz+0.133
prise+0.130
Neg logit contributions
 heel-0.143
 robot-0.142
 because-0.141
 math-0.141
 bone-0.139
Token!
Token ID0
Feature activation-8.32e-03

Pos logit contributions
 pulp+0.101
Happy+0.087
 banners+0.087
ment+0.084
oms+0.083
Neg logit contributions
 not-0.083
zag-0.077
ggie-0.074
 skate-0.073
 homework-0.072
Token Molly
Token ID30236
Feature activation-1.39e-02

Pos logit contributions
 pulp+0.017
oms+0.015
 fishes+0.015
ment+0.014
 flags+0.014
Neg logit contributions
 homework-0.011
 math-0.010
 operation-0.010
 not-0.010
ggie-0.010
Token and
Token ID290
Feature activation-7.67e-03

Pos logit contributions
 homework+0.022
 operation+0.021
 math+0.020
 x+0.020
zag+0.020
Neg logit contributions
 pulp-0.018
 fishes-0.015
oms-0.014
als-0.014
ment-0.014
Token Mum
Token ID38367
Feature activation-6.19e-02

Pos logit contributions
 pulp+0.017
oms+0.015
 fishes+0.015
 banners+0.014
 flags+0.014
Neg logit contributions
 homework-0.009
ggie-0.008
 not-0.008
 operation-0.008
 math-0.008
Token picked
Token ID6497
Feature activation-2.03e-02

Pos logit contributions
 pulp+0.111
oms+0.103
 islands+0.103
ment+0.101
 whales+0.097
Neg logit contributions
 not-0.084
 drib-0.083
 always-0.080
isk-0.080
ggie-0.080
Token lots
Token ID6041
Feature activation-6.24e-02

Pos logit contributions
 pulp+0.046
oms+0.041
 banners+0.040
ment+0.040
 fishes+0.040
Neg logit contributions
 homework-0.021
 math-0.019
 operation-0.019
ggie-0.018
 not-0.018
Token of
Token ID286
Feature activation-6.92e-02

Pos logit contributions
 pulp+0.127
 islands+0.103
 valleys+0.102
 banners+0.102
lan+0.101
Neg logit contributions
 math-0.079
 not-0.072
 homework-0.071
 because-0.070
ggie-0.069
Token colourful
Token ID44799
Feature activation-1.19e-01

Pos logit contributions
 pulp+0.134
lan+0.125
oms+0.120
 islands+0.118
ment+0.118
Neg logit contributions
 math-0.091
 homework-0.088
 x-0.084
 operation-0.081
 not-0.081
Token flowers
Token ID12734
Feature activation-6.08e-02

Pos logit contributions
 pulp+0.188
arl+0.186
Sorry+0.181
 lighthouse+0.177
lan+0.170
Neg logit contributions
 math-0.172
 er-0.156
 exercise-0.147
 exercises-0.145
 sweater-0.142
Token and
Token ID290
Feature activation-5.85e-02

Pos logit contributions
 pulp+0.134
 islands+0.114
 banners+0.113
Happy+0.113
 fishes+0.112
Neg logit contributions
 not-0.068
 because-0.063
 homework-0.058
zag-0.057
 math-0.054
Token made
Token ID925
Feature activation-4.24e-02

Pos logit contributions
oms+0.098
 pulp+0.097
 islands+0.087
ment+0.085
ins+0.085
Neg logit contributions
 not-0.089
 homework-0.083
 math-0.080
 Max-0.076
ggie-0.075
Token a
Token ID257
Feature activation-3.62e-02

Pos logit contributions
 pulp+0.093
oms+0.086
 islands+0.082
 banners+0.081
als+0.079
Neg logit contributions
 homework-0.045
 not-0.044
ggie-0.043
 math-0.042
 operation-0.040
Token pretty
Token ID2495
Feature activation-3.55e-02

Pos logit contributions
 pulp+0.058
oms+0.054
ment+0.054
 fishes+0.051
 islands+0.051
Neg logit contributions
 homework-0.059
 not-0.057
 math-0.057
ggie-0.054
 operation-0.054
Token the
Token ID262
Feature activation-5.95e-02

Pos logit contributions
 homework+0.003
 operation+0.003
ggie+0.003
 math+0.003
 x+0.003
Neg logit contributions
 pulp-0.003
 fishes-0.003
oms-0.003
 ge-0.003
ment-0.003
Token garden
Token ID11376
Feature activation-4.35e-02

Pos logit contributions
 pulp+0.120
ment+0.115
 banners+0.110
ins+0.109
oms+0.107
Neg logit contributions
 not-0.076
 exam-0.063
 operation-0.063
 math-0.063
 because-0.062
Token were
Token ID547
Feature activation-6.35e-02

Pos logit contributions
 pulp+0.102
 banners+0.090
 fishes+0.089
 flags+0.086
 whales+0.085
Neg logit contributions
 not-0.048
 math-0.042
 homework-0.042
zag-0.039
 operation-0.038
Token lots
Token ID6041
Feature activation-7.08e-02

Pos logit contributions
oms+0.129
ment+0.126
 pulp+0.125
forts+0.125
leness+0.118
Neg logit contributions
 not-0.084
 math-0.065
 so-0.065
 always-0.060
 only-0.060
Token of
Token ID286
Feature activation-7.79e-02

Pos logit contributions
 pulp+0.152
women+0.122
ment+0.121
 banners+0.119
 valleys+0.118
Neg logit contributions
 math-0.085
 because-0.084
 not-0.076
 exercise-0.076
 operation-0.074
Token beautiful
Token ID4950
Feature activation-1.19e-01

Pos logit contributions
oms+0.142
 pulp+0.137
ment+0.134
omed+0.131
plets+0.129
Neg logit contributions
 not-0.110
 math-0.109
 exercise-0.102
 especially-0.098
 important-0.097
Token colourful
Token ID44799
Feature activation-1.20e-01

Pos logit contributions
fed+0.209
 pulp+0.205
oms+0.203
forts+0.202
iers+0.201
Neg logit contributions
 because-0.144
 math-0.142
 cushion-0.132
 not-0.129
 favorite-0.126
Token flowers
Token ID12734
Feature activation-7.41e-02

Pos logit contributions
 pulp+0.203
 extinguished+0.201
fed+0.198
forts+0.193
oms+0.192
Neg logit contributions
 because-0.150
 math-0.146
 cushion-0.144
 robot-0.126
 just-0.123
Token.
Token ID13
Feature activation-2.41e-02

Pos logit contributions
 pulp+0.168
ocol+0.136
ment+0.136
 banners+0.135
Sorry+0.132
Neg logit contributions
 because-0.097
 not-0.093
zag-0.082
case-0.074
 always-0.073
Token Every
Token ID3887
Feature activation+1.68e-02

Pos logit contributions
 pulp+0.058
ment+0.051
oms+0.051
ins+0.050
 fishes+0.050
Neg logit contributions
 homework-0.023
 math-0.022
 not-0.021
 operation-0.021
 x-0.020
Token day
Token ID1110
Feature activation-3.85e-02

Pos logit contributions
 homework+0.024
 Maxwell+0.023
 x+0.022
 robber+0.021
 math+0.021
Neg logit contributions
 pulp-0.031
 fishes-0.029
oms-0.028
 ge-0.028
 shells-0.027
Token would
Token ID561
Feature activation+1.80e-02

Pos logit contributions
 homework+0.003
 operation+0.002
 math+0.002
ggie+0.002
zag+0.002
Neg logit contributions
 pulp-0.003
 fishes-0.003
oms-0.003
 flags-0.003
 shells-0.003
Token fill
Token ID6070
Feature activation+7.65e-03

Pos logit contributions
 homework+0.029
ggie+0.027
 math+0.027
 Maxwell+0.026
 operation+0.026
Neg logit contributions
 pulp-0.031
 fishes-0.027
 shells-0.026
 flags-0.026
oms-0.026
Token the
Token ID262
Feature activation-6.19e-02

Pos logit contributions
 homework+0.010
 operation+0.009
zag+0.009
ggie+0.009
 exam+0.009
Neg logit contributions
 pulp-0.016
 fishes-0.014
 shells-0.014
 flags-0.014
ment-0.014
Token bucket
Token ID19236
Feature activation+1.05e-02

Pos logit contributions
ment+0.095
 pulp+0.090
oms+0.089
ins+0.086
Happy+0.086
Neg logit contributions
 not-0.096
 bone-0.090
 mine-0.089
 heel-0.089
 shelf-0.088
Token with
Token ID351
Feature activation-4.92e-02

Pos logit contributions
 homework+0.015
ggie+0.014
 operation+0.014
 x+0.014
 math+0.013
Neg logit contributions
 pulp-0.020
 fishes-0.018
oms-0.017
 shells-0.017
ment-0.017
Token fresh
Token ID4713
Feature activation-1.19e-01

Pos logit contributions
oms+0.071
 pulp+0.066
omed+0.062
Happy+0.061
als+0.058
Neg logit contributions
 not-0.088
 math-0.087
 homework-0.086
 exercise-0.081
 skate-0.081
Token water
Token ID1660
Feature activation-2.75e-02

Pos logit contributions
 pulp+0.201
 announcements+0.201
Happy+0.200
prise+0.198
 priest+0.192
Neg logit contributions
 because-0.156
 exercise-0.133
 bone-0.127
 so-0.126
 and-0.121
Token from
Token ID422
Feature activation+3.06e-03

Pos logit contributions
 pulp+0.055
oms+0.048
Happy+0.048
 banners+0.047
 fishes+0.047
Neg logit contributions
 homework-0.036
 not-0.034
zag-0.032
 math-0.032
 operation-0.031
Token the
Token ID262
Feature activation-6.12e-02

Pos logit contributions
 homework+0.005
ggie+0.005
zag+0.004
 operation+0.004
 math+0.004
Neg logit contributions
 pulp-0.005
 fishes-0.005
 shells-0.005
 flags-0.005
ment-0.005
Token nearby
Token ID6716
Feature activation-6.10e-02

Pos logit contributions
ment+0.104
oms+0.101
ins+0.101
 pulp+0.094
Happy+0.092
Neg logit contributions
 not-0.095
 robot-0.088
 operation-0.086
 skate-0.086
 shelf-0.086
Token river
Token ID7850
Feature activation-2.45e-02

Pos logit contributions
 pulp+0.112
ins+0.106
ment+0.103
Happy+0.102
oms+0.101
Neg logit contributions
 shelf-0.082
 operation-0.079
 robot-0.079
 bone-0.079
 skate-0.079
Token so
Token ID523
Feature activation+2.21e-03

Pos logit contributions
 pulp+0.002
 fishes+0.001
oms+0.001
ment+0.001
 flags+0.001
Neg logit contributions
 homework-0.001
 operation-0.001
ggie-0.001
zag-0.001
 math-0.001
Token quickly
Token ID2952
Feature activation-3.55e-02

Pos logit contributions
 homework+0.003
 operation+0.003
 x+0.003
zag+0.003
ggie+0.003
Neg logit contributions
 pulp-0.004
 fishes-0.004
oms-0.004
ment-0.004
 flags-0.004
Token,
Token ID11
Feature activation+1.72e-02

Pos logit contributions
 pulp+0.079
ins+0.071
Happy+0.070
 fishes+0.070
 banners+0.070
Neg logit contributions
 not-0.038
 homework-0.034
 operation-0.032
 skate-0.031
 math-0.031
Token Nick
Token ID8047
Feature activation-1.70e-02

Pos logit contributions
 homework+0.026
 operation+0.024
ggie+0.023
 x+0.023
zag+0.023
Neg logit contributions
 pulp-0.032
 fishes-0.030
 shells-0.029
 ge-0.029
oms-0.029
Tokeny
Token ID88
Feature activation-9.11e-03

Pos logit contributions
 pulp+0.029
oms+0.025
 banners+0.025
 fishes+0.025
 flags+0.024
Neg logit contributions
 homework-0.026
ggie-0.026
zag-0.026
 not-0.025
 operation-0.025
Token didn
Token ID1422
Feature activation+5.95e-02

Pos logit contributions
 pulp+0.014
oms+0.012
 fishes+0.012
ment+0.011
 banners+0.011
Neg logit contributions
 homework-0.016
 not-0.015
ggie-0.015
 operation-0.015
zag-0.015
Token't
Token ID470
Feature activation-1.85e-02

Pos logit contributions
 homework+0.079
 robot+0.069
 rider+0.069
isk+0.069
 robber+0.067
Neg logit contributions
oms-0.109
Happy-0.101
 pulp-0.095
omed-0.094
izz-0.093
Token know
Token ID760
Feature activation-8.63e-03

Pos logit contributions
 pulp+0.036
oms+0.032
 fishes+0.031
 banners+0.031
 flags+0.031
Neg logit contributions
 homework-0.025
 not-0.025
 operation-0.024
zag-0.024
ggie-0.023
Token what
Token ID644
Feature activation+1.01e-02

Pos logit contributions
 pulp+0.016
oms+0.014
 fishes+0.014
 banners+0.013
ment+0.013
Neg logit contributions
 homework-0.013
 not-0.012
 math-0.012
 operation-0.012
ggie-0.012
Token to
Token ID284
Feature activation-3.04e-03

Pos logit contributions
 homework+0.016
zag+0.015
 operation+0.015
 Maxwell+0.015
isk+0.015
Neg logit contributions
 pulp-0.017
 fishes-0.015
 ge-0.015
 shells-0.015
ment-0.014
Token guess
Token ID4724
Feature activation-5.69e-03

Pos logit contributions
 pulp+0.007
 fishes+0.006
oms+0.006
ment+0.006
 flags+0.006
Neg logit contributions
 homework-0.004
 operation-0.003
ggie-0.003
 math-0.003
zag-0.003
Token and
Token ID290
Feature activation+2.12e-02

Pos logit contributions
ggie+0.060
 homework+0.059
 x+0.058
zag+0.058
omm+0.057
Neg logit contributions
 flags-0.029
oms-0.028
 fishes-0.027
 banners-0.026
 shells-0.026
Token play
Token ID711
Feature activation+7.41e-03

Pos logit contributions
ggie+0.037
 homework+0.036
 x+0.036
 operation+0.035
zag+0.034
Neg logit contributions
 pulp-0.029
 fishes-0.028
 ge-0.027
 shells-0.027
 banners-0.027
Token.
Token ID13
Feature activation+2.44e-02

Pos logit contributions
 homework+0.012
zag+0.011
ggie+0.011
 x+0.011
 operation+0.011
Neg logit contributions
 pulp-0.012
 fishes-0.011
 flags-0.010
oms-0.010
 shells-0.010
Token He
Token ID679
Feature activation+5.29e-02

Pos logit contributions
ggie+0.037
 homework+0.037
zag+0.035
 operation+0.034
 not+0.033
Neg logit contributions
 pulp-0.039
 fishes-0.037
 pump-0.036
 flags-0.036
 ducks-0.036
Token never
Token ID1239
Feature activation+1.91e-03

Pos logit contributions
 x+0.117
 math+0.112
 homework+0.112
 operation+0.111
omm+0.109
Neg logit contributions
 ge-0.049
 cheered-0.046
 ducks-0.042
 sw-0.040
 shells-0.040
Token stayed
Token ID9658
Feature activation+5.13e-02

Pos logit contributions
 homework+0.004
ggie+0.003
 operation+0.003
 math+0.003
 x+0.003
Neg logit contributions
 pulp-0.003
oms-0.002
 fishes-0.002
 ge-0.002
ment-0.002
Token still
Token ID991
Feature activation+4.35e-02

Pos logit contributions
zag+0.094
ggie+0.093
 homework+0.091
omm+0.091
case+0.091
Neg logit contributions
 shells-0.060
 islands-0.056
 flags-0.056
 fishes-0.055
 sailors-0.055
Token for
Token ID329
Feature activation+1.53e-02

Pos logit contributions
 homework+0.075
 Maxwell+0.074
 x+0.073
ggie+0.072
 math+0.070
Neg logit contributions
 fishes-0.055
 whales-0.054
 ge-0.054
 ducks-0.053
 islands-0.053
Token long
Token ID890
Feature activation+2.23e-02

Pos logit contributions
zag+0.022
 homework+0.022
ggie+0.020
 Maxwell+0.020
 operation+0.020
Neg logit contributions
 pulp-0.027
 fishes-0.025
 flags-0.024
 shells-0.024
 islands-0.024
Token!
Token ID0
Feature activation+2.97e-02

Pos logit contributions
 homework+0.036
zag+0.035
ggie+0.034
 Maxwell+0.034
 x+0.034
Neg logit contributions
 pulp-0.033
oms-0.032
 fishes-0.032
 flags-0.031
 ge-0.031
Token
Token ID198
Feature activation+1.71e-02

Pos logit contributions
 homework+0.055
ggie+0.054
zag+0.052
 operation+0.050
 not+0.050
Neg logit contributions
 pulp-0.036
 fishes-0.033
oms-0.033
 sunlight-0.033
 pump-0.032
Token he
Token ID339
Feature activation+3.43e-02

Pos logit contributions
 homework+0.023
zag+0.022
ggie+0.021
isk+0.021
 operation+0.020
Neg logit contributions
 pulp-0.028
 shells-0.026
 fishes-0.025
ment-0.025
 ge-0.024
Token would
Token ID561
Feature activation+2.89e-02

Pos logit contributions
 homework+0.071
 math+0.069
zag+0.068
 operation+0.068
 Maxwell+0.065
Neg logit contributions
 pulp-0.039
 cheered-0.031
als-0.031
 fishes-0.031
 ge-0.029
Token get
Token ID651
Feature activation-1.61e-03

Pos logit contributions
 homework+0.041
 Maxwell+0.041
ggie+0.040
zag+0.040
 rider+0.039
Neg logit contributions
 pulp-0.050
 pump-0.043
 fishes-0.042
 shells-0.042
 seeds-0.041
Token into
Token ID656
Feature activation+1.35e-02

Pos logit contributions
 pulp+0.003
 fishes+0.003
oms+0.003
ment+0.003
 flags+0.003
Neg logit contributions
 homework-0.002
ggie-0.002
zag-0.002
 operation-0.002
 math-0.002
Token trouble
Token ID5876
Feature activation+2.80e-02

Pos logit contributions
zag+0.020
ggie+0.020
 homework+0.019
ousel+0.018
 operation+0.018
Neg logit contributions
 pulp-0.024
 shells-0.021
 fishes-0.021
 flags-0.021
ment-0.021
Token,
Token ID11
Feature activation+5.14e-02

Pos logit contributions
 homework+0.056
ggie+0.055
zag+0.055
 Maxwell+0.054
 operation+0.054
Neg logit contributions
 pulp-0.030
 fishes-0.027
oms-0.027
ment-0.026
 flags-0.026
Token so
Token ID523
Feature activation+1.73e-03

Pos logit contributions
zag+0.076
ggie+0.075
 homework+0.069
isk+0.065
liv+0.064
Neg logit contributions
 pulp-0.095
 shells-0.087
 flags-0.087
 fishes-0.086
 ge-0.085
Token he
Token ID339
Feature activation+5.30e-03

Pos logit contributions
 homework+0.002
 operation+0.002
ggie+0.002
zag+0.002
 x+0.002
Neg logit contributions
 pulp-0.004
 fishes-0.003
ment-0.003
oms-0.003
 flags-0.003
Token quickly
Token ID2952
Feature activation-4.46e-02

Pos logit contributions
 homework+0.011
 math+0.011
zag+0.011
 operation+0.011
ggie+0.011
Neg logit contributions
 pulp-0.006
 fishes-0.005
oms-0.005
 ge-0.005
 shells-0.005
Token ran
Token ID4966
Feature activation+2.54e-02

Pos logit contributions
 pulp+0.076
 fishes+0.071
oms+0.069
ment+0.068
Happy+0.066
Neg logit contributions
 homework-0.070
 not-0.069
 operation-0.065
 skate-0.064
 x-0.063
Token over
Token ID625
Feature activation+5.82e-03

Pos logit contributions
ggie+0.049
 homework+0.048
 operation+0.045
 x+0.045
zag+0.045
Neg logit contributions
 pulp-0.029
 shells-0.028
 flags-0.028
 fishes-0.027
 towels-0.027
Token scared
Token ID12008
Feature activation-9.62e-03

Pos logit contributions
 homework+0.031
zag+0.028
 math+0.027
 Maxwell+0.027
 robber+0.027
Neg logit contributions
 pulp-0.046
 fishes-0.041
 cheered-0.041
oms-0.041
 ge-0.041
Token.
Token ID13
Feature activation+1.90e-02

Pos logit contributions
 pulp+0.019
 fishes+0.016
oms+0.016
 banners+0.016
 flags+0.016
Neg logit contributions
 homework-0.014
 not-0.013
 operation-0.013
 math-0.013
ggie-0.013
Token What
Token ID1867
Feature activation+1.90e-02

Pos logit contributions
 homework+0.030
ggie+0.028
zag+0.027
 operation+0.027
 skate+0.027
Neg logit contributions
 pulp-0.033
als-0.030
oms-0.030
ment-0.030
 fishes-0.029
Token would
Token ID561
Feature activation+3.15e-02

Pos logit contributions
 homework+0.032
zag+0.031
 operation+0.031
 skate+0.030
isk+0.030
Neg logit contributions
 pulp-0.029
oms-0.026
 fishes-0.025
ment-0.025
 shells-0.025
Token she
Token ID673
Feature activation+2.93e-02

Pos logit contributions
zag+0.052
isk+0.052
 homework+0.052
 x+0.049
 operation+0.049
Neg logit contributions
 pulp-0.050
 fishes-0.043
 shells-0.042
 flags-0.042
 ge-0.041
Token do
Token ID466
Feature activation+5.64e-02

Pos logit contributions
 homework+0.044
zag+0.043
ggie+0.043
 Maxwell+0.043
 robber+0.041
Neg logit contributions
 pulp-0.047
oms-0.041
 cheered-0.041
 fishes-0.040
 shells-0.040
Token now
Token ID783
Feature activation+2.13e-02

Pos logit contributions
isk+0.096
ibo+0.093
 robber+0.093
zag+0.093
 Maxwell+0.091
Neg logit contributions
Happy-0.072
 shells-0.070
oms-0.069
ment-0.068
 cheered-0.068
Token?
Token ID30
Feature activation+1.36e-02

Pos logit contributions
 homework+0.029
ggie+0.028
zag+0.027
 Maxwell+0.027
isk+0.027
Neg logit contributions
 pulp-0.038
oms-0.035
 shells-0.034
 flags-0.033
ment-0.033
Token She
Token ID1375
Feature activation+5.84e-03

Pos logit contributions
 homework+0.021
ggie+0.020
 operation+0.019
zag+0.019
 not+0.019
Neg logit contributions
 pulp-0.024
oms-0.022
als-0.021
ment-0.021
 fishes-0.021
Token thought
Token ID1807
Feature activation+3.02e-02

Pos logit contributions
 homework+0.010
 math+0.010
 operation+0.010
ggie+0.010
zag+0.009
Neg logit contributions
 pulp-0.009
oms-0.008
 fishes-0.008
als-0.007
ment-0.007
Token and
Token ID290
Feature activation-1.41e-02

Pos logit contributions
 homework+0.049
zag+0.049
isk+0.049
ggie+0.047
 robber+0.047
Neg logit contributions
 pulp-0.046
oms-0.045
ment-0.042
 shells-0.042
 flags-0.040
Token Benny
Token ID44275
Feature activation+3.55e-02

Pos logit contributions
 homework+0.061
ggie+0.059
 x+0.058
zag+0.056
 operation+0.055
Neg logit contributions
 pulp-0.040
 fishes-0.038
 sunlight-0.038
ment-0.037
 cheered-0.037
Token learned
Token ID4499
Feature activation+4.73e-02

Pos logit contributions
 homework+0.072
 x+0.068
 operation+0.068
 math+0.067
 robber+0.066
Neg logit contributions
 pulp-0.045
 cheered-0.041
 fishes-0.040
 ge-0.038
 shells-0.036
Token that
Token ID326
Feature activation+5.58e-02

Pos logit contributions
zag+0.100
isk+0.098
 rider+0.092
 robber+0.090
 x+0.090
Neg logit contributions
 shells-0.048
ment-0.046
 flags-0.043
 seeds-0.041
 fishes-0.041
Token it
Token ID340
Feature activation+5.44e-02

Pos logit contributions
isk+0.105
zag+0.104
ggie+0.099
 robber+0.098
ousel+0.094
Neg logit contributions
 pulp-0.077
 pump-0.076
 fishes-0.074
 shells-0.073
 seeds-0.068
Token's
Token ID338
Feature activation+6.14e-02

Pos logit contributions
ggie+0.091
 skate+0.090
 robber+0.090
 homework+0.090
isk+0.087
Neg logit contributions
oms-0.079
 pulp-0.076
als-0.076
 fishes-0.075
 seeds-0.072
Token important
Token ID1593
Feature activation+4.79e-02

Pos logit contributions
zag+0.105
case+0.097
ousel+0.094
 coaster+0.092
icycle+0.092
Neg logit contributions
 seeds-0.083
 pump-0.081
 sunlight-0.078
 ducks-0.078
 fishes-0.078
Token to
Token ID284
Feature activation+1.32e-02

Pos logit contributions
isk+0.107
ousel+0.105
 x+0.102
 skate+0.100
ggie+0.100
Neg logit contributions
ment-0.040
 pulp-0.035
 seeds-0.034
 shells-0.032
 sunlight-0.032
Token take
Token ID1011
Feature activation+1.40e-02

Pos logit contributions
 homework+0.020
zag+0.019
ggie+0.018
 math+0.018
 operation+0.018
Neg logit contributions
 pulp-0.023
 fishes-0.020
 shells-0.020
 pump-0.020
 ge-0.019
Token care
Token ID1337
Feature activation+6.65e-03

Pos logit contributions
 homework+0.031
zag+0.031
ggie+0.030
 skate+0.029
 operation+0.029
Neg logit contributions
 pulp-0.014
 fishes-0.012
ment-0.011
oms-0.011
 shells-0.011
Token of
Token ID286
Feature activation+2.64e-02

Pos logit contributions
 homework+0.015
ggie+0.014
 operation+0.014
zag+0.014
 x+0.014
Neg logit contributions
 pulp-0.007
oms-0.006
ment-0.006
 fishes-0.006
 shells-0.005
Token your
Token ID534
Feature activation+2.13e-02

Pos logit contributions
zag+0.040
 homework+0.039
ggie+0.037
ousel+0.036
case+0.036
Neg logit contributions
 pulp-0.045
 shells-0.041
 fishes-0.041
ment-0.040
 pump-0.039
Tokengy
Token ID1360
Feature activation+5.14e-03

Pos logit contributions
 homework+0.012
 math+0.010
 operation+0.010
ggie+0.010
 x+0.010
Neg logit contributions
 pulp-0.028
 fishes-0.026
 flags-0.025
ment-0.025
 shells-0.025
Tokenbank
Token ID17796
Feature activation-1.34e-02

Pos logit contributions
 homework+0.009
ggie+0.008
 operation+0.008
 math+0.008
 not+0.008
Neg logit contributions
 pulp-0.008
 fishes-0.007
oms-0.007
 flags-0.007
 banners-0.007
Token and
Token ID290
Feature activation-4.25e-03

Pos logit contributions
 pulp+0.022
oms+0.019
 fishes+0.019
 banners+0.018
 ge+0.018
Neg logit contributions
 homework-0.022
 not-0.021
ggie-0.021
zag-0.021
 operation-0.021
Token brought
Token ID3181
Feature activation-2.75e-02

Pos logit contributions
 pulp+0.006
 fishes+0.005
oms+0.005
ment+0.005
 banners+0.005
Neg logit contributions
 homework-0.008
 operation-0.007
ggie-0.007
 math-0.007
 not-0.007
Token it
Token ID340
Feature activation+4.37e-03

Pos logit contributions
 pulp+0.062
oms+0.057
 banners+0.053
 fishes+0.053
 flags+0.052
Neg logit contributions
 homework-0.030
 not-0.030
 operation-0.028
ggie-0.028
 x-0.028
Token back
Token ID736
Feature activation-3.34e-03

Pos logit contributions
 homework+0.007
ggie+0.007
 operation+0.006
 math+0.006
zag+0.006
Neg logit contributions
 pulp-0.007
 fishes-0.006
oms-0.006
ment-0.006
 shells-0.006
Token to
Token ID284
Feature activation-1.50e-02

Pos logit contributions
 pulp+0.006
 fishes+0.005
oms+0.005
ment+0.005
 flags+0.005
Neg logit contributions
 homework-0.005
 operation-0.005
ggie-0.005
 math-0.005
 not-0.005
Token the
Token ID262
Feature activation+1.16e-02

Pos logit contributions
 pulp+0.034
oms+0.031
 fishes+0.030
als+0.030
 banners+0.030
Neg logit contributions
 homework-0.016
 not-0.015
 operation-0.015
ggie-0.015
 math-0.015
Token man
Token ID582
Feature activation-4.62e-03

Pos logit contributions
 homework+0.017
zag+0.017
ggie+0.016
 not+0.015
 operation+0.015
Neg logit contributions
 pulp-0.021
 fishes-0.019
 ge-0.018
 flags-0.018
 pump-0.018
Token.
Token ID13
Feature activation-2.43e-02

Pos logit contributions
 pulp+0.006
 fishes+0.005
oms+0.005
ment+0.005
 banners+0.005
Neg logit contributions
 homework-0.009
 operation-0.008
ggie-0.008
 math-0.008
 not-0.008
Token He
Token ID679
Feature activation+1.56e-02

Pos logit contributions
 pulp+0.051
oms+0.044
 shells+0.043
 fishes+0.043
 flags+0.042
Neg logit contributions
 not-0.030
 homework-0.030
 operation-0.028
zag-0.028
 Maxwell-0.028
Token him
Token ID683
Feature activation+4.98e-02

Pos logit contributions
 homework+0.072
zag+0.071
chy+0.067
case+0.065
tr+0.064
Neg logit contributions
 whales-0.054
 fishes-0.051
 ge-0.050
 shells-0.050
 flags-0.050
Token,
Token ID11
Feature activation+2.33e-02

Pos logit contributions
 homework+0.118
liv+0.117
icycle+0.114
scar+0.112
zag+0.111
Neg logit contributions
ment-0.031
 shells-0.028
oms-0.027
 whales-0.027
 cheered-0.027
Token "
Token ID366
Feature activation-1.57e-03

Pos logit contributions
 homework+0.039
ggie+0.035
zag+0.035
 operation+0.034
 x+0.033
Neg logit contributions
 pulp-0.039
 fishes-0.035
oms-0.035
 ge-0.034
 whales-0.034
TokenWhat
Token ID2061
Feature activation+2.08e-02

Pos logit contributions
 pulp+0.002
 fishes+0.002
oms+0.002
 banners+0.002
ment+0.002
Neg logit contributions
 homework-0.003
 operation-0.003
ggie-0.003
 math-0.003
 not-0.003
Token are
Token ID389
Feature activation-2.07e-02

Pos logit contributions
 homework+0.033
zag+0.033
 operation+0.032
isk+0.032
 Maxwell+0.030
Neg logit contributions
 pulp-0.032
 flags-0.029
 shells-0.029
 fishes-0.029
 whales-0.029
Token you
Token ID345
Feature activation+3.43e-02

Pos logit contributions
 pulp+0.043
oms+0.038
ment+0.038
 fishes+0.038
 banners+0.037
Neg logit contributions
 homework-0.027
 not-0.026
 math-0.025
ggie-0.024
 operation-0.023
Token holding
Token ID4769
Feature activation+4.02e-02

Pos logit contributions
 homework+0.051
 rider+0.049
ggie+0.049
 Maxwell+0.046
ibo+0.045
Neg logit contributions
 pulp-0.057
 pump-0.054
 shells-0.052
oms-0.052
 whales-0.052
Token?"
Token ID1701
Feature activation+1.37e-02

Pos logit contributions
 homework+0.060
zag+0.056
ggie+0.055
 rarely+0.055
 Maxwell+0.054
Neg logit contributions
 pulp-0.066
 shells-0.064
 fishes-0.062
als-0.062
 flags-0.062
Token
Token ID198
Feature activation+5.16e-04

Pos logit contributions
 homework+0.023
 operation+0.021
zag+0.021
 math+0.020
 robber+0.020
Neg logit contributions
 pulp-0.021
ment-0.020
 ge-0.019
 fishes-0.019
 flags-0.018
Token
Token ID198
Feature activation+4.89e-03

Pos logit contributions
 homework+0.001
 operation+0.001
 math+0.001
ggie+0.001
zag+0.001
Neg logit contributions
 pulp-0.001
oms-0.001
 fishes-0.001
ment-0.001
 flags-0.001
TokenThe
Token ID464
Feature activation+1.31e-02

Pos logit contributions
 homework+0.009
 operation+0.008
 math+0.008
 x+0.008
 not+0.008
Neg logit contributions
 pulp-0.008
 fishes-0.007
 ge-0.007
 flags-0.006
 banners-0.006
Token buy
Token ID2822
Feature activation-9.24e-03

Pos logit contributions
 pulp+0.002
 fishes+0.002
oms+0.002
ment+0.002
 flags+0.002
Neg logit contributions
 homework-0.002
 operation-0.001
zag-0.001
ggie-0.001
 math-0.001
Token one
Token ID530
Feature activation+2.59e-02

Pos logit contributions
 pulp+0.017
oms+0.015
 fishes+0.015
 banners+0.014
ment+0.014
Neg logit contributions
 homework-0.014
 math-0.013
 operation-0.013
 not-0.013
ggie-0.013
Token.
Token ID13
Feature activation+5.19e-03

Pos logit contributions
 homework+0.057
 Maxwell+0.055
zag+0.055
ggie+0.054
isk+0.054
Neg logit contributions
 pulp-0.025
ment-0.021
 fishes-0.021
oms-0.020
 pump-0.020
Token
Token ID198
Feature activation+1.69e-03

Pos logit contributions
 homework+0.009
 operation+0.008
ggie+0.008
zag+0.008
 x+0.008
Neg logit contributions
 pulp-0.008
 fishes-0.007
oms-0.007
ment-0.007
als-0.007
Token
Token ID198
Feature activation+8.39e-03

Pos logit contributions
 homework+0.003
 operation+0.003
 math+0.003
ggie+0.003
zag+0.003
Neg logit contributions
 pulp-0.003
oms-0.002
 fishes-0.002
ment-0.002
 flags-0.002
TokenThe
Token ID464
Feature activation-9.68e-03

Pos logit contributions
 homework+0.015
 operation+0.014
zag+0.014
 x+0.014
 math+0.014
Neg logit contributions
 pulp-0.013
Happy-0.011
 fishes-0.011
oms-0.010
 banners-0.010
Token girl
Token ID2576
Feature activation+8.23e-03

Pos logit contributions
 pulp+0.015
oms+0.012
 fishes+0.012
ment+0.012
 flags+0.012
Neg logit contributions
 homework-0.018
 operation-0.017
 math-0.017
 not-0.017
 x-0.017
Token said
Token ID531
Feature activation-4.88e-03

Pos logit contributions
 homework+0.015
 math+0.013
 operation+0.013
zag+0.013
ggie+0.013
Neg logit contributions
 pulp-0.013
 fishes-0.011
oms-0.011
als-0.010
ment-0.010
Token,
Token ID11
Feature activation+7.74e-03

Pos logit contributions
 pulp+0.009
 fishes+0.008
oms+0.008
ment+0.008
 banners+0.008
Neg logit contributions
 homework-0.007
 operation-0.007
ggie-0.007
 math-0.007
 not-0.007
Token "
Token ID366
Feature activation+4.04e-03

Pos logit contributions
 homework+0.011
ggie+0.010
zag+0.010
 operation+0.009
 math+0.009
Neg logit contributions
 pulp-0.015
 fishes-0.013
oms-0.013
ment-0.013
 banners-0.013
TokenWhat
Token ID2061
Feature activation+1.95e-02

Pos logit contributions
 homework+0.007
zag+0.007
 operation+0.007
 math+0.007
ggie+0.007
Neg logit contributions
 pulp-0.006
 fishes-0.005
als-0.005
oms-0.005
ment-0.005
Token was
Token ID373
Feature activation-1.26e-02

Pos logit contributions
 pulp+0.009
 fishes+0.008
oms+0.008
ment+0.008
 banners+0.008
Neg logit contributions
 homework-0.008
 operation-0.007
ggie-0.007
 not-0.007
 math-0.007
Token not
Token ID407
Feature activation-1.80e-02

Pos logit contributions
 pulp+0.026
oms+0.023
 fishes+0.022
ment+0.022
als+0.022
Neg logit contributions
 homework-0.017
 not-0.016
 operation-0.015
ggie-0.015
 math-0.015
Token far
Token ID1290
Feature activation-1.42e-02

Pos logit contributions
 pulp+0.038
oms+0.034
 fishes+0.033
als+0.033
ment+0.033
Neg logit contributions
 homework-0.022
 not-0.021
 operation-0.020
 math-0.020
ggie-0.019
Token away
Token ID1497
Feature activation-9.30e-03

Pos logit contributions
 pulp+0.027
 fishes+0.023
oms+0.023
ment+0.022
 banners+0.022
Neg logit contributions
 homework-0.021
 operation-0.020
 not-0.020
 math-0.019
 x-0.019
Token.
Token ID13
Feature activation+4.34e-02

Pos logit contributions
 pulp+0.015
 fishes+0.013
oms+0.013
 banners+0.012
 flags+0.012
Neg logit contributions
 homework-0.016
 not-0.015
 operation-0.015
 math-0.015
ggie-0.015
Token When
Token ID1649
Feature activation+2.94e-02

Pos logit contributions
 operation+0.064
 homework+0.062
zag+0.059
 skate+0.058
ggie+0.058
Neg logit contributions
als-0.073
oms-0.072
 pulp-0.066
 cheered-0.062
 flags-0.062
Token she
Token ID673
Feature activation-4.74e-03

Pos logit contributions
zag+0.044
isk+0.043
 operation+0.043
 homework+0.043
 x+0.042
Neg logit contributions
 pulp-0.046
oms-0.044
 flags-0.043
 banners-0.042
als-0.042
Token emerged
Token ID9349
Feature activation+1.38e-02

Pos logit contributions
 pulp+0.008
 fishes+0.006
oms+0.006
ment+0.006
 banners+0.006
Neg logit contributions
 homework-0.008
 operation-0.008
ggie-0.008
 math-0.008
 not-0.008
Token from
Token ID422
Feature activation+2.76e-02

Pos logit contributions
 homework+0.025
ggie+0.023
 operation+0.022
 skate+0.022
zag+0.022
Neg logit contributions
 pulp-0.020
ment-0.019
oms-0.019
 fishes-0.018
als-0.018
Token the
Token ID262
Feature activation-2.72e-02

Pos logit contributions
 homework+0.050
ggie+0.049
zag+0.047
 skate+0.045
 operation+0.045
Neg logit contributions
 pulp-0.040
 shells-0.035
 flags-0.035
 valleys-0.035
 fishes-0.034
Token tunnel
Token ID13275
Feature activation-8.20e-03

Pos logit contributions
 pulp+0.059
ins+0.053
ment+0.053
oms+0.051
 fishes+0.050
Neg logit contributions
 not-0.032
 robot-0.032
 operation-0.031
 homework-0.031
 x-0.031
Tokenling
Token ID1359
Feature activation+6.24e-03

Pos logit contributions
 pulp+0.082
 banners+0.071
oms+0.071
als+0.070
 fishes+0.070
Neg logit contributions
 not-0.041
 homework-0.041
 math-0.038
 operation-0.035
ggie-0.034
Token.
Token ID13
Feature activation-8.63e-03

Pos logit contributions
 homework+0.011
 math+0.010
ggie+0.010
 operation+0.010
zag+0.010
Neg logit contributions
 pulp-0.010
 fishes-0.009
oms-0.009
ment-0.008
 flags-0.008
Token The
Token ID383
Feature activation+4.07e-03

Pos logit contributions
 pulp+0.015
oms+0.013
 fishes+0.013
ment+0.013
 banners+0.013
Neg logit contributions
 homework-0.013
 operation-0.013
 math-0.012
ggie-0.012
 not-0.012
Token wolf
Token ID17481
Feature activation+4.07e-02

Pos logit contributions
 homework+0.006
ggie+0.005
zag+0.005
 math+0.005
 not+0.005
Neg logit contributions
 pulp-0.008
 fishes-0.007
oms-0.007
 ge-0.007
 grains-0.007
Token wanted
Token ID2227
Feature activation+6.81e-04

Pos logit contributions
 homework+0.074
 math+0.070
 x+0.070
 operation+0.068
ggie+0.067
Neg logit contributions
 pulp-0.056
 fishes-0.053
 cheered-0.052
 ge-0.051
als-0.050
Token to
Token ID284
Feature activation+3.42e-02

Pos logit contributions
 homework+0.001
zag+0.001
ggie+0.001
 operation+0.001
 math+0.001
Neg logit contributions
 pulp-0.001
 fishes-0.001
oms-0.001
 flags-0.001
 ge-0.001
Token be
Token ID307
Feature activation+2.46e-02

Pos logit contributions
 homework+0.054
zag+0.051
 Maxwell+0.051
 x+0.049
 math+0.049
Neg logit contributions
 pulp-0.050
 fishes-0.047
 ge-0.047
 pump-0.046
 ducks-0.046
Token the
Token ID262
Feature activation+2.51e-02

Pos logit contributions
 homework+0.039
zag+0.037
 x+0.036
ggie+0.035
 operation+0.035
Neg logit contributions
 pulp-0.039
 fishes-0.037
 ge-0.037
 flags-0.036
 ducks-0.036
Token best
Token ID1266
Feature activation+1.63e-02

Pos logit contributions
 homework+0.037
ggie+0.035
zag+0.035
 Maxwell+0.033
 math+0.033
Neg logit contributions
 pulp-0.043
 fishes-0.043
 ducks-0.041
 ge-0.041
 pump-0.040
Token how
Token ID703
Feature activation-3.49e-02

Pos logit contributions
 homework+0.028
ggie+0.025
zag+0.025
 math+0.025
 Maxwell+0.025
Neg logit contributions
 pulp-0.025
 fishes-0.024
oms-0.023
 ge-0.022
ment-0.022
Tokenler
Token ID1754
Feature activation+1.42e-02

Pos logit contributions
 pulp+0.075
 banners+0.065
oms+0.064
 islands+0.064
als+0.064
Neg logit contributions
 not-0.043
 homework-0.041
 math-0.038
ggie-0.037
 operation-0.036
Token to
Token ID284
Feature activation-8.90e-03

Pos logit contributions
 pulp+0.051
oms+0.046
ment+0.045
 fishes+0.045
als+0.044
Neg logit contributions
 not-0.026
 homework-0.026
 math-0.024
ggie-0.022
 operation-0.022
Token climb
Token ID12080
Feature activation+8.23e-03

Pos logit contributions
 pulp+0.017
oms+0.015
 fishes+0.015
ment+0.015
 banners+0.015
Neg logit contributions
 homework-0.012
 not-0.012
 operation-0.011
 math-0.011
ggie-0.011
Token the
Token ID262
Feature activation-3.29e-03

Pos logit contributions
 homework+0.013
ggie+0.013
 operation+0.012
 x+0.012
 math+0.012
Neg logit contributions
 pulp-0.014
 fishes-0.012
oms-0.012
 flags-0.012
 shells-0.012
Token ladder
Token ID18002
Feature activation+2.92e-02

Pos logit contributions
 pulp+0.006
oms+0.005
 fishes+0.005
ment+0.005
 flags+0.005
Neg logit contributions
 homework-0.005
ggie-0.004
 operation-0.004
 math-0.004
 not-0.004
Token and
Token ID290
Feature activation-1.42e-02

Pos logit contributions
 homework+0.065
ggie+0.063
 operation+0.062
 x+0.061
 robber+0.061
Neg logit contributions
 pulp-0.027
oms-0.025
 flags-0.024
als-0.024
 fishes-0.023
Token see
Token ID766
Feature activation-3.42e-02

Pos logit contributions
 pulp+0.023
oms+0.020
 fishes+0.020
ment+0.019
 banners+0.019
Neg logit contributions
 homework-0.024
 not-0.023
 math-0.023
 operation-0.022
ggie-0.022
Token the
Token ID262
Feature activation-2.58e-02

Pos logit contributions
 pulp+0.074
oms+0.064
als+0.063
lan+0.063
ment+0.062
Neg logit contributions
 not-0.042
 math-0.041
 homework-0.041
 x-0.038
 operation-0.038
Token birds
Token ID10087
Feature activation-3.35e-02

Pos logit contributions
 pulp+0.050
ment+0.046
oms+0.045
 fishes+0.044
 banners+0.044
Neg logit contributions
 homework-0.035
 not-0.034
 operation-0.033
 math-0.033
 x-0.031
Token.
Token ID13
Feature activation-1.85e-02

Pos logit contributions
 pulp+0.059
ment+0.050
 fishes+0.049
 banners+0.047
lan+0.047
Neg logit contributions
 not-0.053
 homework-0.053
zag-0.051
 operation-0.049
 math-0.049
Token
Token ID198
Feature activation-2.81e-02

Pos logit contributions
 pulp+0.035
oms+0.030
 fishes+0.029
ment+0.029
 banners+0.028
Neg logit contributions
 homework-0.027
 not-0.027
 math-0.025
ggie-0.025
 skate-0.025
Token
Token ID198
Feature activation-3.22e-02

Pos logit contributions
 pulp+0.060
 fishes+0.051
als+0.050
ment+0.050
 grains+0.049
Neg logit contributions
 not-0.038
 homework-0.036
ggie-0.036
zag-0.035
 math-0.034
Token in
Token ID287
Feature activation-3.61e-02

Pos logit contributions
 pulp+0.063
oms+0.055
 banners+0.054
Happy+0.053
 fishes+0.053
Neg logit contributions
 not-0.046
 homework-0.045
 math-0.044
ggie-0.042
 operation-0.041
Token the
Token ID262
Feature activation-2.92e-02

Pos logit contributions
 pulp+0.079
oms+0.073
ment+0.073
als+0.072
Happy+0.070
Neg logit contributions
 not-0.038
 math-0.038
 homework-0.037
 x-0.033
 skate-0.033
Token dirt
Token ID13647
Feature activation-7.81e-03

Pos logit contributions
 pulp+0.041
ment+0.039
oms+0.038
 banners+0.036
Happy+0.036
Neg logit contributions
 math-0.052
 homework-0.052
 operation-0.050
 not-0.049
 robot-0.048
Token and
Token ID290
Feature activation-3.23e-02

Pos logit contributions
 pulp+0.015
 fishes+0.013
oms+0.013
ment+0.012
 banners+0.012
Neg logit contributions
 homework-0.011
 operation-0.010
 not-0.010
 math-0.010
ggie-0.010
Token find
Token ID1064
Feature activation-5.86e-02

Pos logit contributions
 pulp+0.061
oms+0.061
ins+0.059
ment+0.059
Happy+0.057
Neg logit contributions
 not-0.043
 homework-0.041
 math-0.039
 skate-0.039
 operation-0.036
Token bugs
Token ID11316
Feature activation-2.95e-02

Pos logit contributions
 pulp+0.100
Happy+0.096
oms+0.092
Sorry+0.090
owers+0.089
Neg logit contributions
 math-0.090
 x-0.083
 homework-0.078
 operation-0.078
 not-0.078
Token and
Token ID290
Feature activation-4.43e-02

Pos logit contributions
 pulp+0.051
 fishes+0.042
ment+0.042
 banners+0.042
oms+0.041
Neg logit contributions
 homework-0.047
 not-0.047
zag-0.045
 math-0.044
 operation-0.043
Token stones
Token ID14966
Feature activation-4.94e-02

Pos logit contributions
oms+0.061
 pulp+0.057
omed+0.056
Happy+0.056
ment+0.055
Neg logit contributions
 not-0.083
 homework-0.081
 math-0.081
 skate-0.075
 exercise-0.073
Token.
Token ID13
Feature activation-5.13e-03

Pos logit contributions
 pulp+0.093
 fishes+0.078
Happy+0.078
 banners+0.078
ment+0.075
Neg logit contributions
 not-0.076
 homework-0.070
zag-0.069
 math-0.064
 because-0.063
Token They
Token ID1119
Feature activation-2.68e-02

Pos logit contributions
 pulp+0.010
 fishes+0.008
oms+0.008
ment+0.008
 banners+0.008
Neg logit contributions
 homework-0.008
 operation-0.007
ggie-0.007
 math-0.007
 not-0.007
Token had
Token ID550
Feature activation-1.19e-02

Pos logit contributions
 pulp+0.043
ment+0.042
oms+0.041
 fishes+0.037
Happy+0.037
Neg logit contributions
 not-0.044
 homework-0.043
 skate-0.041
 math-0.039
 drib-0.039
Token crabs
Token ID49172
Feature activation-1.08e-02

Pos logit contributions
 pulp+0.052
ment+0.044
als+0.043
oms+0.043
 banners+0.040
Neg logit contributions
 homework-0.048
 math-0.045
 not-0.045
 skate-0.044
ggie-0.044
Token,
Token ID11
Feature activation+1.70e-02

Pos logit contributions
 pulp+0.022
 fishes+0.019
oms+0.019
ment+0.018
 banners+0.018
Neg logit contributions
 homework-0.015
 not-0.013
 operation-0.013
ggie-0.013
zag-0.013
Token and
Token ID290
Feature activation+3.06e-03

Pos logit contributions
 homework+0.027
ggie+0.025
 operation+0.025
zag+0.025
 x+0.024
Neg logit contributions
 pulp-0.030
 fishes-0.027
 shells-0.027
 flags-0.027
 ge-0.027
Token he
Token ID339
Feature activation+5.74e-03

Pos logit contributions
 homework+0.005
 operation+0.005
ggie+0.005
 x+0.005
 math+0.005
Neg logit contributions
 pulp-0.005
 fishes-0.004
oms-0.004
ment-0.004
 flags-0.004
Token and
Token ID290
Feature activation-1.46e-02

Pos logit contributions
 homework+0.011
 operation+0.011
 math+0.011
 x+0.010
ggie+0.010
Neg logit contributions
 pulp-0.008
 fishes-0.007
oms-0.007
 flags-0.006
 shells-0.006
Token his
Token ID465
Feature activation-1.98e-02

Pos logit contributions
 pulp+0.033
oms+0.030
 fishes+0.029
 banners+0.028
 flags+0.028
Neg logit contributions
 homework-0.016
 not-0.014
ggie-0.014
 math-0.014
 operation-0.014
Token Mum
Token ID38367
Feature activation-5.34e-02

Pos logit contributions
 pulp+0.036
oms+0.031
ment+0.030
als+0.030
 banners+0.030
Neg logit contributions
 homework-0.030
 not-0.028
 math-0.028
ggie-0.028
 skate-0.028
Token went
Token ID1816
Feature activation-2.62e-02

Pos logit contributions
 pulp+0.116
 fishes+0.093
oms+0.092
 grains+0.091
ment+0.091
Neg logit contributions
 homework-0.065
 skate-0.064
 not-0.064
zag-0.063
ousel-0.062
Token home
Token ID1363
Feature activation-2.24e-02

Pos logit contributions
 pulp+0.057
 fishes+0.048
ment+0.047
oms+0.046
als+0.046
Neg logit contributions
 homework-0.034
zag-0.031
 not-0.031
 operation-0.031
 skate-0.030
Token.
Token ID13
Feature activation+7.90e-03

Pos logit contributions
 pulp+0.040
 fishes+0.034
als+0.033
oms+0.033
 ge+0.032
Neg logit contributions
 homework-0.037
 not-0.036
zag-0.035
 math-0.034
 operation-0.034
Token
Token ID220
Feature activation-1.36e-02

Pos logit contributions
 homework+0.015
 operation+0.014
ggie+0.014
 x+0.014
 math+0.013
Neg logit contributions
 pulp-0.011
 fishes-0.010
oms-0.010
ment-0.010
 flags-0.010
Token
Token ID198
Feature activation-1.76e-02

Pos logit contributions
 pulp+0.014
 fishes+0.012
oms+0.012
 banners+0.011
 flags+0.011
Neg logit contributions
 homework-0.014
ggie-0.013
 not-0.013
 operation-0.013
zag-0.013
TokenAll
Token ID3237
Feature activation-1.62e-02

Pos logit contributions
 pulp+0.035
oms+0.031
 fishes+0.030
 banners+0.030
als+0.030
Neg logit contributions
 not-0.024
ggie-0.023
 homework-0.023
zag-0.023
 math-0.022
Token of
Token ID286
Feature activation-1.60e-02

Pos logit contributions
 pulp+0.025
 banners+0.020
 fishes+0.020
oms+0.020
 flags+0.020
Neg logit contributions
 homework-0.029
 operation-0.028
 not-0.027
 math-0.027
zag-0.027
Token a
Token ID257
Feature activation-5.62e-02

Pos logit contributions
 pulp+0.038
oms+0.034
als+0.033
 banners+0.033
 fishes+0.033
Neg logit contributions
 homework-0.016
 not-0.015
 math-0.015
 operation-0.014
ggie-0.014
Token sudden
Token ID4802
Feature activation-5.79e-02

Pos logit contributions
 pulp+0.083
ment+0.080
als+0.076
oms+0.075
owers+0.070
Neg logit contributions
 not-0.102
ggie-0.099
 homework-0.097
 operation-0.097
 robot-0.095
Token,
Token ID11
Feature activation-2.03e-02

Pos logit contributions
 pulp+0.149
ins+0.132
 banners+0.130
aff+0.128
Happy+0.128
Neg logit contributions
 not-0.045
 math-0.040
 Max-0.040
 operation-0.040
 Rat-0.039
Token the
Token ID262
Feature activation-1.69e-02

Pos logit contributions
 pulp+0.047
oms+0.042
 banners+0.041
 fishes+0.040
ins+0.040
Neg logit contributions
 not-0.021
 homework-0.020
 math-0.018
 operation-0.018
ggie-0.018
Token music
Token ID2647
Feature activation-3.73e-02

Pos logit contributions
 pulp+0.031
ment+0.027
oms+0.027
 fishes+0.026
 islands+0.025
Neg logit contributions
 homework-0.027
 operation-0.025
 x-0.024
 not-0.024
 math-0.024
Token started
Token ID2067
Feature activation-7.88e-03

Pos logit contributions
 pulp+0.058
oms+0.048
ment+0.048
 fishes+0.047
 islands+0.046
Neg logit contributions
ggie-0.064
 homework-0.064
 not-0.064
 operation-0.063
zag-0.062
Token and
Token ID290
Feature activation-3.64e-02

Pos logit contributions
 pulp+0.015
 fishes+0.013
oms+0.013
ment+0.013
 banners+0.013
Neg logit contributions
 homework-0.011
 not-0.010
 operation-0.010
ggie-0.010
zag-0.010
Token everyone
Token ID2506
Feature activation-4.07e-02

Pos logit contributions
 pulp+0.073
oms+0.071
ins+0.069
aff+0.068
Happy+0.067
Neg logit contributions
 not-0.048
 homework-0.038
 Max-0.038
 because-0.037
 math-0.037
Token could
Token ID714
Feature activation+1.86e-02

Pos logit contributions
 homework+0.077
 math+0.074
 operation+0.072
 x+0.071
zag+0.070
Neg logit contributions
 pulp-0.038
als-0.037
 cheered-0.037
 fishes-0.034
 seeds-0.033
Token always
Token ID1464
Feature activation+2.30e-03

Pos logit contributions
 homework+0.028
ggie+0.026
 Maxwell+0.026
zag+0.025
 robber+0.025
Neg logit contributions
 pulp-0.031
 pump-0.028
 fishes-0.028
oms-0.027
 shells-0.027
Token use
Token ID779
Feature activation+1.80e-02

Pos logit contributions
 homework+0.003
ggie+0.003
zag+0.003
 math+0.003
 operation+0.003
Neg logit contributions
 pulp-0.004
 fishes-0.004
oms-0.004
ment-0.004
 flags-0.004
Token his
Token ID465
Feature activation-1.51e-02

Pos logit contributions
zag+0.023
ggie+0.023
 homework+0.023
 operation+0.021
 skate+0.021
Neg logit contributions
 pulp-0.035
 shells-0.032
 fishes-0.031
 flags-0.031
ment-0.031
Token cat
Token ID3797
Feature activation+2.48e-02

Pos logit contributions
 pulp+0.029
 fishes+0.026
oms+0.025
 banners+0.025
ment+0.025
Neg logit contributions
 homework-0.022
 operation-0.020
 not-0.020
 math-0.020
 x-0.019
Token's
Token ID338
Feature activation-1.69e-02

Pos logit contributions
 homework+0.038
ggie+0.036
 x+0.035
 operation+0.035
zag+0.035
Neg logit contributions
 pulp-0.042
als-0.037
 fishes-0.037
oms-0.036
 shells-0.036
Token help
Token ID1037
Feature activation+1.99e-02

Pos logit contributions
 pulp+0.032
 fishes+0.028
oms+0.028
 banners+0.027
als+0.027
Neg logit contributions
 homework-0.025
 operation-0.023
 not-0.023
 math-0.023
 x-0.022
Token when
Token ID618
Feature activation-1.29e-02

Pos logit contributions
 homework+0.035
zag+0.034
ggie+0.033
 Maxwell+0.032
 robber+0.032
Neg logit contributions
 pulp-0.028
 fishes-0.025
 pump-0.025
ment-0.024
 flags-0.024
Token he
Token ID339
Feature activation+5.83e-03

Pos logit contributions
 pulp+0.028
oms+0.025
 fishes+0.024
 banners+0.024
ment+0.024
Neg logit contributions
 homework-0.015
 not-0.014
 math-0.014
 operation-0.014
ggie-0.013
Token needed
Token ID2622
Feature activation-3.45e-02

Pos logit contributions
 homework+0.010
ggie+0.010
 math+0.010
 operation+0.010
zag+0.010
Neg logit contributions
 pulp-0.009
oms-0.008
 fishes-0.007
ment-0.007
als-0.007
Token it
Token ID340
Feature activation+1.25e-03

Pos logit contributions
 pulp+0.070
oms+0.064
 fishes+0.062
als+0.061
 banners+0.061
Neg logit contributions
 homework-0.047
 not-0.045
 math-0.045
 operation-0.044
 x-0.043
Token and
Token ID290
Feature activation-1.99e-02

Pos logit contributions
 pulp+0.011
 fishes+0.010
oms+0.009
ment+0.009
 banners+0.009
Neg logit contributions
 homework-0.012
 operation-0.011
ggie-0.011
 not-0.011
 math-0.011
Token threw
Token ID9617
Feature activation-1.22e-02

Pos logit contributions
 pulp+0.033
oms+0.030
 fishes+0.030
ment+0.028
 islands+0.028
Neg logit contributions
 homework-0.033
 not-0.031
 math-0.030
 x-0.030
 operation-0.029
Token them
Token ID606
Feature activation+8.18e-03

Pos logit contributions
 pulp+0.020
oms+0.018
 fishes+0.018
ment+0.017
 banners+0.017
Neg logit contributions
 homework-0.020
 math-0.019
 not-0.018
 operation-0.018
ggie-0.018
Token in
Token ID287
Feature activation-6.68e-03

Pos logit contributions
 homework+0.013
ggie+0.012
 operation+0.012
 math+0.012
 x+0.012
Neg logit contributions
 pulp-0.014
oms-0.012
 flags-0.012
 fishes-0.012
 shells-0.011
Token the
Token ID262
Feature activation-2.30e-02

Pos logit contributions
 pulp+0.012
oms+0.011
 fishes+0.011
ment+0.010
 banners+0.010
Neg logit contributions
 homework-0.010
 math-0.009
 operation-0.009
 not-0.009
ggie-0.009
Token trash
Token ID13913
Feature activation-7.42e-02

Pos logit contributions
 pulp+0.037
 fishes+0.035
 islands+0.033
oms+0.033
 whales+0.033
Neg logit contributions
 homework-0.040
 math-0.037
 operation-0.036
 x-0.036
 not-0.036
Token.
Token ID13
Feature activation+5.77e-03

Pos logit contributions
 pulp+0.152
agg+0.144
 fishes+0.137
ment+0.136
 whales+0.133
Neg logit contributions
 robot-0.083
 not-0.079
 because-0.078
 homework-0.078
 Max-0.077
Token She
Token ID1375
Feature activation-3.29e-03

Pos logit contributions
 homework+0.008
ggie+0.008
zag+0.008
 operation+0.008
 x+0.007
Neg logit contributions
 pulp-0.011
oms-0.009
 fishes-0.009
ment-0.009
 flags-0.009
Token told
Token ID1297
Feature activation+1.75e-02

Pos logit contributions
 pulp+0.005
 fishes+0.004
oms+0.004
ment+0.004
 flags+0.004
Neg logit contributions
 homework-0.006
 operation-0.006
ggie-0.006
 math-0.006
zag-0.006
Token Tommy
Token ID19919
Feature activation+5.34e-02

Pos logit contributions
zag+0.030
 homework+0.029
ggie+0.028
 operation+0.027
 skate+0.026
Neg logit contributions
 pulp-0.028
 ge-0.024
 fishes-0.024
 flags-0.024
 shells-0.024
Token to
Token ID284
Feature activation+1.41e-02

Pos logit contributions
 operation+0.086
 x+0.086
icycle+0.083
zag+0.083
isk+0.082
Neg logit contributions
 pulp-0.075
ment-0.069
 seeds-0.065
 pump-0.065
 flags-0.065
Token and
Token ID290
Feature activation-1.92e-02

Pos logit contributions
 pulp+0.073
 banners+0.062
oms+0.062
ment+0.061
 shells+0.061
Neg logit contributions
 not-0.030
zag-0.029
 operation-0.027
 x-0.026
 robot-0.025
Token separated
Token ID11266
Feature activation-2.81e-02

Pos logit contributions
 pulp+0.029
ment+0.026
oms+0.025
 fishes+0.025
 flags+0.024
Neg logit contributions
 homework-0.033
 not-0.032
 math-0.031
 operation-0.031
zag-0.031
Token the
Token ID262
Feature activation-3.22e-03

Pos logit contributions
 pulp+0.060
oms+0.051
 banners+0.050
 islands+0.050
 ge+0.049
Neg logit contributions
 math-0.032
 homework-0.032
 not-0.032
ggie-0.031
 Maxwell-0.030
Token leaf
Token ID12835
Feature activation+4.04e-04

Pos logit contributions
 pulp+0.006
 fishes+0.005
oms+0.005
ment+0.005
 flags+0.005
Neg logit contributions
 homework-0.005
ggie-0.005
 operation-0.005
 math-0.005
zag-0.005
Token in
Token ID287
Feature activation-1.18e-02

Pos logit contributions
 homework+0.001
 operation+0.001
ggie+0.001
 math+0.001
 x+0.001
Neg logit contributions
 pulp-0.001
oms-0.001
 fishes-0.001
ment-0.001
 flags-0.000
Token two
Token ID734
Feature activation-8.35e-02

Pos logit contributions
 pulp+0.023
oms+0.021
 fishes+0.020
ment+0.020
 flags+0.020
Neg logit contributions
 homework-0.016
 not-0.015
 math-0.015
ggie-0.015
 operation-0.015
Token,
Token ID11
Feature activation-1.96e-02

Pos logit contributions
 pulp+0.151
ment+0.147
lan+0.147
Happy+0.145
women+0.133
Neg logit contributions
 not-0.102
 math-0.099
 heel-0.097
 homework-0.093
 balancing-0.093
Token making
Token ID1642
Feature activation-1.14e-02

Pos logit contributions
 pulp+0.038
oms+0.033
ment+0.033
 fishes+0.033
als+0.032
Neg logit contributions
 homework-0.027
 not-0.026
 math-0.025
ggie-0.024
 operation-0.024
Token Alexis
Token ID31078
Feature activation-7.90e-03

Pos logit contributions
 pulp+0.023
oms+0.021
 fishes+0.020
ment+0.020
 banners+0.020
Neg logit contributions
 homework-0.015
 not-0.013
ggie-0.013
 math-0.013
 operation-0.013
Token very
Token ID845
Feature activation-3.42e-02

Pos logit contributions
 pulp+0.015
 fishes+0.013
oms+0.013
ment+0.013
 banners+0.013
Neg logit contributions
 homework-0.011
ggie-0.010
 not-0.010
 operation-0.010
 math-0.010
Token sad
Token ID6507
Feature activation-2.42e-02

Pos logit contributions
 pulp+0.080
als+0.074
 banners+0.073
oms+0.072
 islands+0.072
Neg logit contributions
 not-0.037
 homework-0.032
 math-0.028
ggie-0.027
zag-0.027
Token Please
Token ID4222
Feature activation-3.50e-02

Pos logit contributions
 pulp+0.019
 fishes+0.017
oms+0.017
ment+0.016
 banners+0.016
Neg logit contributions
 homework-0.016
 operation-0.015
 not-0.014
ggie-0.014
 math-0.014
Token,
Token ID11
Feature activation+4.66e-04

Pos logit contributions
 pulp+0.078
 fishes+0.068
ment+0.067
 banners+0.067
 flags+0.066
Neg logit contributions
 not-0.046
 homework-0.044
 operation-0.042
 x-0.041
 skate-0.041
Token can
Token ID460
Feature activation+6.53e-03

Pos logit contributions
 homework+0.001
ggie+0.001
zag+0.001
 math+0.001
 operation+0.001
Neg logit contributions
 pulp-0.001
 fishes-0.001
oms-0.001
 ge-0.001
 flags-0.001
Token we
Token ID356
Feature activation-1.57e-02

Pos logit contributions
 homework+0.012
zag+0.011
 math+0.011
ggie+0.011
 operation+0.011
Neg logit contributions
 pulp-0.010
 fishes-0.008
oms-0.008
 flags-0.008
 banners-0.008
Token drink
Token ID4144
Feature activation-3.69e-02

Pos logit contributions
 pulp+0.032
ment+0.028
oms+0.028
 fishes+0.027
 banners+0.027
Neg logit contributions
 homework-0.022
 not-0.021
 operation-0.020
ggie-0.020
zag-0.019
Token some
Token ID617
Feature activation-7.88e-02

Pos logit contributions
 pulp+0.073
oms+0.065
 banners+0.063
 fishes+0.062
ment+0.062
Neg logit contributions
 not-0.050
 homework-0.049
 operation-0.046
 math-0.046
 x-0.045
Token of
Token ID286
Feature activation-3.18e-02

Pos logit contributions
 pulp+0.137
 valleys+0.126
 banners+0.112
oms+0.111
 islands+0.108
Neg logit contributions
 homework-0.126
 not-0.123
 operation-0.123
 exercise-0.120
 math-0.119
Token yours
Token ID12431
Feature activation-1.29e-02

Pos logit contributions
 pulp+0.059
oms+0.055
ment+0.052
als+0.051
 banners+0.051
Neg logit contributions
 not-0.046
 homework-0.046
 operation-0.042
 math-0.042
 x-0.041
Token?"
Token ID1701
Feature activation-1.13e-02

Pos logit contributions
 pulp+0.026
 fishes+0.023
oms+0.023
 banners+0.022
 flags+0.022
Neg logit contributions
 homework-0.017
 operation-0.015
 not-0.015
 math-0.015
ggie-0.015
Token Ben
Token ID3932
Feature activation+4.50e-03

Pos logit contributions
 pulp+0.024
 fishes+0.021
oms+0.021
ment+0.020
 banners+0.020
Neg logit contributions
 homework-0.014
 not-0.013
 math-0.013
ggie-0.013
 operation-0.013
Token begged
Token ID32946
Feature activation-2.14e-02

Pos logit contributions
 homework+0.010
 operation+0.010
 math+0.010
ggie+0.010
zag+0.010
Neg logit contributions
 pulp-0.005
 fishes-0.004
oms-0.004
 ge-0.003
 flags-0.003
Token harbor
Token ID25451
Feature activation-4.41e-02

Pos logit contributions
 pulp+0.052
ment+0.048
oms+0.043
ins+0.043
als+0.042
Neg logit contributions
 homework-0.059
 operation-0.057
 not-0.056
 skate-0.054
 x-0.054
Token.
Token ID13
Feature activation-1.55e-02

Pos logit contributions
 pulp+0.084
 fishes+0.068
oms+0.066
ocol+0.065
 banners+0.065
Neg logit contributions
 not-0.070
 homework-0.066
zag-0.064
ggie-0.060
ousel-0.060
Token They
Token ID1119
Feature activation-4.68e-02

Pos logit contributions
 pulp+0.029
 fishes+0.025
oms+0.025
ment+0.024
 shells+0.024
Neg logit contributions
 homework-0.023
 math-0.021
 not-0.021
ggie-0.021
 operation-0.021
Token saw
Token ID2497
Feature activation-4.21e-02

Pos logit contributions
 pulp+0.081
ment+0.076
oms+0.065
ocol+0.065
ins+0.064
Neg logit contributions
 not-0.079
 skate-0.072
 homework-0.071
ousel-0.068
 always-0.067
Token many
Token ID867
Feature activation-8.24e-02

Pos logit contributions
 pulp+0.076
oms+0.067
omed+0.065
Happy+0.065
ment+0.065
Neg logit contributions
 not-0.061
 homework-0.060
 math-0.058
 robot-0.053
ggie-0.053
Token big
Token ID1263
Feature activation-8.60e-02

Pos logit contributions
 pulp+0.152
ment+0.128
als+0.125
owers+0.122
eat+0.122
Neg logit contributions
 homework-0.119
 math-0.115
 especially-0.114
 operation-0.109
 skate-0.107
Token and
Token ID290
Feature activation-5.31e-02

Pos logit contributions
 pulp+0.134
Happy+0.121
ment+0.119
eat+0.116
fed+0.113
Neg logit contributions
 homework-0.132
 robot-0.132
 because-0.132
 mask-0.127
 skate-0.127
Token small
Token ID1402
Feature activation-1.07e-01

Pos logit contributions
 pulp+0.109
ment+0.103
oms+0.100
owers+0.100
ins+0.100
Neg logit contributions
 not-0.067
 homework-0.065
 math-0.060
 skate-0.057
 robot-0.056
Token boats
Token ID16659
Feature activation-6.67e-02

Pos logit contributions
Happy+0.173
 pulp+0.173
izz+0.165
fed+0.157
Sorry+0.157
Neg logit contributions
 because-0.146
 robot-0.145
 math-0.138
 homework-0.136
 mask-0.131
Token.
Token ID13
Feature activation-2.18e-02

Pos logit contributions
 pulp+0.143
ment+0.120
Happy+0.117
 pans+0.111
 fishes+0.111
Neg logit contributions
 not-0.087
zag-0.084
ousel-0.083
 homework-0.081
 skate-0.080
Token They
Token ID1119
Feature activation-5.41e-02

Pos logit contributions
 pulp+0.036
oms+0.030
 fishes+0.030
ment+0.029
 shells+0.029
Neg logit contributions
 homework-0.037
 math-0.035
 not-0.034
 x-0.034
 operation-0.034
Token prince
Token ID19716
Feature activation+1.11e-02

Pos logit contributions
 pulp+0.069
oms+0.064
 banners+0.057
 fishes+0.057
 towels+0.056
Neg logit contributions
 homework-0.048
 not-0.046
 operation-0.045
 robot-0.044
 rider-0.043
Token and
Token ID290
Feature activation-3.37e-03

Pos logit contributions
 homework+0.017
 math+0.016
 operation+0.016
 x+0.016
ggie+0.015
Neg logit contributions
 pulp-0.020
 fishes-0.018
oms-0.017
ment-0.017
als-0.017
Token his
Token ID465
Feature activation-2.75e-02

Pos logit contributions
 pulp+0.007
 fishes+0.006
oms+0.006
ment+0.006
 flags+0.006
Neg logit contributions
 homework-0.004
 operation-0.004
ggie-0.004
 math-0.004
zag-0.004
Token sun
Token ID4252
Feature activation-6.24e-02

Pos logit contributions
 pulp+0.050
oms+0.044
 banners+0.042
 fishes+0.042
Happy+0.042
Neg logit contributions
 homework-0.042
 operation-0.040
 math-0.039
 not-0.039
ggie-0.039
Tokenfl
Token ID2704
Feature activation-4.90e-02

Pos logit contributions
 pulp+0.108
oms+0.097
 fishes+0.090
Happy+0.090
 banners+0.088
Neg logit contributions
zag-0.104
 not-0.093
ggie-0.085
 homework-0.085
 operation-0.083
Tokenowers
Token ID3618
Feature activation-8.05e-02

Pos logit contributions
 pulp+0.052
 ge+0.040
 banners+0.039
 flags+0.039
 whales+0.038
Neg logit contributions
zag-0.108
 homework-0.105
 operation-0.103
isk-0.100
ggie-0.099
Token lived
Token ID5615
Feature activation-3.51e-02

Pos logit contributions
 pulp+0.126
lan+0.106
ment+0.101
Happy+0.101
 pans+0.100
Neg logit contributions
 not-0.140
zag-0.135
 always-0.131
 homework-0.124
 operation-0.121
Token happily
Token ID18177
Feature activation-5.91e-02

Pos logit contributions
 pulp+0.072
oms+0.063
 banners+0.061
als+0.060
ment+0.060
Neg logit contributions
 not-0.044
 homework-0.043
zag-0.041
 math-0.040
ggie-0.040
Token ever
Token ID1683
Feature activation-3.63e-02

Pos logit contributions
 pulp+0.138
 banners+0.114
oms+0.113
 pans+0.113
 fishes+0.112
Neg logit contributions
 not-0.064
zag-0.063
ggie-0.059
 homework-0.059
 because-0.057
Token after
Token ID706
Feature activation-2.68e-02

Pos logit contributions
 pulp+0.087
oms+0.080
 banners+0.075
 fishes+0.074
ins+0.074
Neg logit contributions
 homework-0.035
 not-0.034
 math-0.031
 because-0.031
ggie-0.030
Token.
Token ID13
Feature activation+2.51e-02

Pos logit contributions
 pulp+0.046
oms+0.038
 banners+0.037
als+0.037
 fishes+0.037
Neg logit contributions
 homework-0.046
 not-0.045
ggie-0.043
 math-0.043
 operation-0.041