TOP ACTIVATIONS
MAX = 0.154

girl would make a great fer ret parent .
scale . The scale says 30 . Lily is happy .
the tree and said , '' Don 't worry k itty
man stepped closer and said gru ff ly , " You
says . " Ju ice is y ummy ,"
and held it in her cu pped hands . " Mom
good . " Ju ice is y ummy ,"
. He says , " Ju ice is y ummy !
carefully picked up the deaf tort oise and placed it beside
of the powder into his cu pped hands and scattered it
uddled the cat and said '' You are safe now !
But his mom said '' No ! ''
replies . " Ju ice , please . Thank
they saw that it said 40 degrees ! Joe
It 's poisoned ," replied Wise Owl . " Oh no
. It is called The Wizard of Oz . Do you
Ele ph ants have strong tr unks that they can use
was determined to be the vict or . But
leaned close to him and cu pped her mouth with her
. They said , " Ju ice , cheese and crack

MIN ACTIVATIONS
MIN = -0.132

After hours and hours of hard work , Z ara
One day , after hours of z ipping around , one
Finally , after days of walking and waving , the
. Even though they were of different species , they shared
made fun of the m ittens she was wearing . They
After many hours of running and searching , they
lost one of her m ittens in the snow .
Finally , after hours of swimming , the bug reached
Finally , after many hours of searching , they found it
Todd took off the m ittens , put them under the
dropped one of her m ittens in the snow .
� Now I have m ittens too . We should have
inside to get her m ittens , but she couldn 't
today was his first day of school . He had a
. And whenever it r ained , she made sure to
went outside without her m ittens on . Her mom my
the day was still lots of fun . She could not
old and she loved m ittens . One day she was
dropped one of his m ittens on the way . The
Finally , after hours of waiting , it happened .

INTERVAL 0.082 - 0.154
CONTAINS 0.448%

to do . Lucy said yes and followed Jack .
The white twin said " Sure ! That sounds like
and saw a sleepy rabbit pe eking down from the tree
asks . " OK , let 's play inside
pig . He wanted to succeed , but he was too

INTERVAL 0.011 - 0.082
CONTAINS 31.319%

There was a big tree in the backyard . Every day
they still liked to play with their toy cars . They
about the tide . She decided to go and get help
" Thank you , little bird . You are so wise
smiled . " Me too !" he said , happy to

INTERVAL -0.061 - 0.011
CONTAINS 66.112%

a time , there was a boy named Tim my .
ocean . M ia picked up the blue shampoo
I 'm here to make your day even more magical than
You can try it ." L ily pushed the
did , but he did not know how .

INTERVAL -0.132 - -0.061
CONTAINS 2.120%

a big , scary wolf coming towards them . They were
hill and flies in the air . Tom and Lily feel
turn off the alarm . When she got there , she
was very scared . Ahead of him everything was dark .
" That 's okay ! When we are anxious about something
Token girl
Token ID2576
Feature activation+6.13e-02

Pos logit contributions
 islands+0.081
 cities+0.078
 oceans+0.075
 obstacles+0.075
 abrupt+0.072
Neg logit contributions
free-0.050
 okay-0.050
inky-0.048
 clean-0.046
etsy-0.046
Token would
Token ID561
Feature activation+5.47e-02

Pos logit contributions
 islands+0.128
 cities+0.123
 obstacles+0.121
 oceans+0.119
 abrupt+0.119
Neg logit contributions
 okay-0.072
free-0.068
inky-0.064
etsy-0.062
 clean-0.061
Token make
Token ID787
Feature activation-1.50e-02

Pos logit contributions
 islands+0.115
 cities+0.113
 obstacles+0.110
 oceans+0.108
 obstacle+0.107
Neg logit contributions
 okay-0.063
free-0.058
inky-0.056
 clean-0.055
 warm-0.053
Token a
Token ID257
Feature activation+5.86e-03

Pos logit contributions
inky+0.019
 okay+0.019
etsy+0.018
free+0.018
uffy+0.017
Neg logit contributions
 islands-0.029
 cities-0.028
 obstacles-0.028
 abrupt-0.027
 obstacle-0.027
Token great
Token ID1049
Feature activation+3.84e-02

Pos logit contributions
 islands+0.013
 cities+0.012
 oceans+0.012
 obstacles+0.012
 abrupt+0.012
Neg logit contributions
 okay-0.006
 clean-0.006
inky-0.006
free-0.006
 warm-0.006
Token fer
Token ID11354
Feature activation+1.46e-01

Pos logit contributions
 islands+0.076
 cities+0.074
 obstacles+0.071
 oceans+0.070
 abrupt+0.069
Neg logit contributions
 okay-0.049
inky-0.046
free-0.045
 clean-0.044
 warm-0.043
Tokenret
Token ID1186
Feature activation+1.43e-02

Pos logit contributions
 islands+0.292
 cities+0.278
 oceans+0.277
 cliffs+0.275
 obstacles+0.271
Neg logit contributions
inky-0.192
he-0.192
un-0.171
free-0.169
t-0.167
Token parent
Token ID2560
Feature activation+1.08e-02

Pos logit contributions
 islands+0.027
 cities+0.026
 obstacles+0.025
 obstacle+0.025
 oceans+0.024
Neg logit contributions
 okay-0.017
inky-0.017
etsy-0.017
uffy-0.017
free-0.016
Token.
Token ID13
Feature activation+5.04e-03

Pos logit contributions
 islands+0.023
 cities+0.023
 obstacles+0.022
 oceans+0.021
 obstacle+0.021
Neg logit contributions
 okay-0.012
free-0.011
inky-0.011
etsy-0.010
 clean-0.010
Token
Token ID220
Feature activation+4.94e-02

Pos logit contributions
 cities+0.011
 islands+0.011
 obstacles+0.010
 abrupt+0.010
 oceans+0.010
Neg logit contributions
 okay-0.005
free-0.005
inky-0.005
 clean-0.004
 warm-0.004
Token
Token ID198
Feature activation+2.32e-03

Pos logit contributions
 cities+0.090
 islands+0.089
 obstacles+0.084
 abrupt+0.084
 oceans+0.082
Neg logit contributions
 okay-0.060
free-0.059
 clean-0.055
 warm-0.055
inky-0.053
Token scale
Token ID5046
Feature activation+5.49e-02

Pos logit contributions
 islands+0.011
 cities+0.010
 obstacles+0.010
 abrupt+0.009
 oceans+0.009
Neg logit contributions
 clean-0.013
 warm-0.013
 okay-0.013
free-0.012
 safe-0.012
Token.
Token ID13
Feature activation-8.23e-03

Pos logit contributions
 islands+0.119
 cities+0.116
 obstacles+0.113
 obstacle+0.111
 oceans+0.110
Neg logit contributions
 okay-0.056
free-0.052
 clean-0.052
he-0.052
inky-0.051
Token The
Token ID383
Feature activation+1.02e-02

Pos logit contributions
 okay+0.008
inky+0.008
free+0.008
etsy+0.007
 safe+0.007
Neg logit contributions
 cities-0.017
 islands-0.017
 obstacles-0.017
 oceans-0.016
 abrupt-0.016
Token scale
Token ID5046
Feature activation+6.36e-02

Pos logit contributions
 islands+0.016
 abrupt+0.015
 cities+0.015
 obstacles+0.014
 oceans+0.014
Neg logit contributions
 clean-0.017
 warm-0.017
free-0.017
 safe-0.016
 okay-0.016
Token says
Token ID1139
Feature activation+2.67e-02

Pos logit contributions
 islands+0.107
 cities+0.104
 abrupt+0.102
 obstacles+0.100
 oceans+0.099
Neg logit contributions
 okay-0.093
 clean-0.092
he-0.087
 warm-0.085
 safe-0.082
Token 30
Token ID1542
Feature activation+1.43e-01

Pos logit contributions
 islands+0.061
 cities+0.059
 abrupt+0.058
 obstacles+0.058
 obstacle+0.058
Neg logit contributions
 okay-0.029
free-0.024
inky-0.023
 clean-0.022
 warm-0.022
Token.
Token ID13
Feature activation+1.03e-03

Pos logit contributions
 islands+0.301
 cities+0.299
rob+0.292
 abrupt+0.292
 oceans+0.287
Neg logit contributions
 okay-0.145
 little-0.133
 clean-0.129
 warm-0.123
 safe-0.119
Token Lily
Token ID20037
Feature activation-2.14e-02

Pos logit contributions
 cities+0.002
 islands+0.002
 obstacles+0.002
 oceans+0.002
 abrupt+0.002
Neg logit contributions
free-0.001
inky-0.001
 okay-0.001
etsy-0.001
uffy-0.001
Token is
Token ID318
Feature activation+4.11e-03

Pos logit contributions
 clean+0.032
 okay+0.029
he+0.029
 warm+0.028
inky+0.027
Neg logit contributions
 islands-0.037
 cities-0.036
 abrupt-0.035
 oceans-0.034
 obstacles-0.033
Token happy
Token ID3772
Feature activation-5.22e-02

Pos logit contributions
 islands+0.008
 cities+0.008
 obstacles+0.008
 hears+0.008
 oceans+0.008
Neg logit contributions
inky-0.005
etsy-0.005
free-0.005
uffy-0.004
 okay-0.004
Token.
Token ID13
Feature activation-1.96e-02

Pos logit contributions
 okay+0.036
 clean+0.030
free+0.028
 warm+0.026
 safe+0.024
Neg logit contributions
 islands-0.135
 cities-0.134
 obstacles-0.130
 obstacle-0.130
heit-0.127
Token the
Token ID262
Feature activation+1.06e-02

Pos logit contributions
 islands+0.029
 cities+0.029
 obstacles+0.028
 oceans+0.027
 obstacle+0.027
Neg logit contributions
 okay-0.014
free-0.013
inky-0.013
etsy-0.012
 clean-0.012
Token tree
Token ID5509
Feature activation+3.98e-02

Pos logit contributions
 islands+0.019
 cities+0.018
 obstacles+0.017
 oceans+0.017
 whales+0.016
Neg logit contributions
 okay-0.016
free-0.015
inky-0.015
 clean-0.014
 warm-0.014
Token and
Token ID290
Feature activation+9.82e-03

Pos logit contributions
 islands+0.073
 cities+0.072
 obstacles+0.068
 oceans+0.067
 abrupt+0.067
Neg logit contributions
 okay-0.055
free-0.052
inky-0.052
etsy-0.049
uffy-0.047
Token said
Token ID531
Feature activation+8.41e-03

Pos logit contributions
 islands+0.018
 cities+0.018
 obstacles+0.017
 oceans+0.017
 abrupt+0.016
Neg logit contributions
 okay-0.014
inky-0.013
free-0.013
etsy-0.012
uffy-0.012
Token,
Token ID11
Feature activation+4.48e-02

Pos logit contributions
 islands+0.018
 cities+0.018
 obstacles+0.018
 oceans+0.017
 obstacle+0.017
Neg logit contributions
 okay-0.009
free-0.008
inky-0.008
etsy-0.008
 clean-0.007
Token ''
Token ID10148
Feature activation+1.42e-01

Pos logit contributions
 islands+0.078
 cities+0.077
 obstacles+0.073
 oceans+0.071
 abrupt+0.071
Neg logit contributions
 okay-0.065
free-0.063
inky-0.063
etsy-0.060
 clean-0.058
TokenDon
Token ID3987
Feature activation-2.94e-03

Pos logit contributions
 islands+0.229
 cities+0.224
 obstacles+0.214
 oceans+0.208
 abrupt+0.205
Neg logit contributions
 okay-0.227
inky-0.217
free-0.215
etsy-0.206
 clean-0.204
Token't
Token ID470
Feature activation-7.51e-03

Pos logit contributions
free+0.003
 okay+0.003
inky+0.003
etsy+0.003
 clean+0.003
Neg logit contributions
 abrupt-0.006
 obstacles-0.006
 cities-0.006
 obstacle-0.006
 islands-0.006
Token worry
Token ID5490
Feature activation-5.08e-04

Pos logit contributions
inky+0.010
uffy+0.010
free+0.010
etsy+0.010
 okay+0.009
Neg logit contributions
 islands-0.013
 cities-0.013
 obstacle-0.013
 obstacles-0.013
 abrupt-0.012
Token k
Token ID479
Feature activation+2.48e-02

Pos logit contributions
free+0.001
he+0.001
inky+0.001
etsy+0.001
un+0.001
Neg logit contributions
 obstacle-0.001
 obstacles-0.001
 cities-0.001
 heights-0.001
 islands-0.001
Tokenitty
Token ID9760
Feature activation-3.76e-02

Pos logit contributions
 islands+0.057
 cities+0.055
 obstacles+0.053
 oceans+0.052
 abrupt+0.052
Neg logit contributions
 okay-0.025
free-0.024
inky-0.024
etsy-0.022
uffy-0.021
Token man
Token ID582
Feature activation+5.73e-02

Pos logit contributions
 islands+0.032
 cities+0.031
 obstacles+0.030
 oceans+0.029
 obstacle+0.029
Neg logit contributions
 okay-0.022
free-0.021
inky-0.021
etsy-0.020
 clean-0.020
Token stepped
Token ID10764
Feature activation+2.82e-02

Pos logit contributions
 islands+0.102
 cities+0.099
 obstacles+0.095
 oceans+0.092
 abrupt+0.092
Neg logit contributions
 okay-0.082
free-0.080
inky-0.079
etsy-0.075
 clean-0.072
Token closer
Token ID5699
Feature activation+4.61e-02

Pos logit contributions
 islands+0.058
 cities+0.056
 obstacles+0.055
 obstacle+0.053
 oceans+0.053
Neg logit contributions
 okay-0.035
free-0.033
inky-0.032
etsy-0.030
 clean-0.030
Token and
Token ID290
Feature activation+4.22e-03

Pos logit contributions
 islands+0.103
 cities+0.100
 obstacles+0.097
 obstacle+0.096
 abrupt+0.095
Neg logit contributions
 okay-0.048
free-0.045
 clean-0.042
 warm-0.041
inky-0.041
Token said
Token ID531
Feature activation+1.27e-02

Pos logit contributions
 islands+0.008
 cities+0.008
 obstacles+0.007
 oceans+0.007
 obstacle+0.007
Neg logit contributions
 okay-0.006
free-0.006
inky-0.006
etsy-0.005
 clean-0.005
Token gru
Token ID22848
Feature activation+1.41e-01

Pos logit contributions
 islands+0.029
 cities+0.029
 obstacles+0.028
 oceans+0.027
 abrupt+0.027
Neg logit contributions
 okay-0.012
free-0.011
inky-0.011
etsy-0.010
 clean-0.010
Tokenff
Token ID487
Feature activation+6.28e-02

Pos logit contributions
rob+0.296
 islands+0.283
 plank+0.275
 cities+0.272
 cliffs+0.267
Neg logit contributions
 okay-0.171
inky-0.155
less-0.155
 OK-0.146
free-0.145
Tokenly
Token ID306
Feature activation+3.87e-02

Pos logit contributions
 islands+0.103
 cities+0.101
 obstacles+0.099
 obstacle+0.096
 abrupt+0.094
Neg logit contributions
 okay-0.094
free-0.089
inky-0.088
he-0.086
etsy-0.086
Token,
Token ID11
Feature activation+4.23e-02

Pos logit contributions
 islands+0.080
 cities+0.079
 obstacles+0.077
 oceans+0.074
 obstacle+0.074
Neg logit contributions
 okay-0.041
free-0.040
inky-0.039
etsy-0.037
he-0.037
Token "
Token ID366
Feature activation+7.85e-02

Pos logit contributions
 islands+0.090
 cities+0.086
 obstacles+0.083
 obstacle+0.083
 oceans+0.081
Neg logit contributions
 okay-0.053
free-0.049
inky-0.045
 clean-0.044
etsy-0.043
TokenYou
Token ID1639
Feature activation+3.61e-02

Pos logit contributions
 islands+0.142
 cities+0.138
 obstacles+0.129
 obstacle+0.127
 oceans+0.127
Neg logit contributions
 okay-0.115
free-0.113
inky-0.106
etsy-0.106
 clean-0.100
Token says
Token ID1139
Feature activation+7.48e-03

Pos logit contributions
 okay+0.004
inky+0.004
 clean+0.004
free+0.004
 warm+0.004
Neg logit contributions
 islands-0.005
 cities-0.005
 obstacles-0.005
 obstacle-0.005
 oceans-0.004
Token.
Token ID13
Feature activation+1.12e-02

Pos logit contributions
 islands+0.012
 cities+0.012
 hears+0.011
 whales+0.011
 obstacles+0.011
Neg logit contributions
inky-0.010
etsy-0.010
he-0.010
free-0.010
uffy-0.010
Token
Token ID198
Feature activation+1.35e-03

Pos logit contributions
 islands+0.026
 cities+0.025
 obstacles+0.024
 oceans+0.024
 abrupt+0.024
Neg logit contributions
 okay-0.011
free-0.010
inky-0.010
etsy-0.009
 clean-0.009
Token
Token ID198
Feature activation-4.13e-03

Pos logit contributions
 islands+0.003
 cities+0.003
 obstacles+0.003
 abrupt+0.003
 oceans+0.003
Neg logit contributions
 okay-0.001
free-0.001
inky-0.001
etsy-0.001
 clean-0.001
Token"
Token ID1
Feature activation+2.74e-02

Pos logit contributions
 okay+0.006
free+0.006
inky+0.006
etsy+0.005
 clean+0.005
Neg logit contributions
 islands-0.007
 cities-0.007
 obstacles-0.007
 oceans-0.007
 abrupt-0.007
TokenJu
Token ID33018
Feature activation+1.41e-01

Pos logit contributions
 islands+0.048
 cities+0.047
 obstacles+0.046
 obstacle+0.045
 oceans+0.043
Neg logit contributions
 okay-0.041
free-0.041
inky-0.038
etsy-0.038
 clean-0.037
Tokenice
Token ID501
Feature activation-1.73e-02

Pos logit contributions
 islands+0.292
 cities+0.283
 obstacles+0.272
 oceans+0.265
 obstacle+0.260
Neg logit contributions
inky-0.167
 okay-0.165
free-0.162
uffy-0.157
etsy-0.153
Token is
Token ID318
Feature activation+1.84e-02

Pos logit contributions
 okay+0.020
free+0.019
inky+0.019
etsy+0.018
 clean+0.017
Neg logit contributions
 islands-0.035
 cities-0.034
 obstacles-0.033
 oceans-0.032
 obstacle-0.032
Token y
Token ID331
Feature activation+1.58e-02

Pos logit contributions
 islands+0.041
 cities+0.040
 obstacles+0.039
 whales+0.039
 abrupt+0.038
Neg logit contributions
inky-0.017
free-0.017
etsy-0.016
he-0.016
un-0.016
Tokenummy
Token ID13513
Feature activation-2.42e-03

Pos logit contributions
 islands+0.035
 cities+0.035
 obstacles+0.034
 abrupt+0.033
 obstacle+0.033
Neg logit contributions
 okay-0.015
free-0.014
inky-0.014
etsy-0.013
 clean-0.013
Token,"
Token ID553
Feature activation+1.97e-02

Pos logit contributions
 okay+0.003
free+0.003
inky+0.002
 clean+0.002
 warm+0.002
Neg logit contributions
 islands-0.005
 cities-0.005
 obstacles-0.005
 oceans-0.005
 obstacle-0.005
Token and
Token ID290
Feature activation+1.78e-02

Pos logit contributions
 islands+0.065
 cities+0.064
 obstacles+0.062
 oceans+0.060
 abrupt+0.060
Neg logit contributions
 okay-0.034
inky-0.032
free-0.032
 clean-0.031
etsy-0.030
Token held
Token ID2714
Feature activation+3.89e-02

Pos logit contributions
 islands+0.039
 cities+0.038
 obstacles+0.037
 obstacle+0.036
 oceans+0.036
Neg logit contributions
 okay-0.019
 clean-0.018
 warm-0.018
free-0.017
etsy-0.017
Token it
Token ID340
Feature activation-9.29e-03

Pos logit contributions
 islands+0.064
 cities+0.064
 obstacles+0.061
 oceans+0.059
 obstacle+0.059
Neg logit contributions
 okay-0.058
inky-0.057
free-0.057
etsy-0.054
uffy-0.053
Token in
Token ID287
Feature activation+6.78e-02

Pos logit contributions
inky+0.012
free+0.012
uffy+0.012
etsy+0.011
 okay+0.011
Neg logit contributions
 islands-0.016
 cities-0.016
 obstacles-0.016
 oceans-0.016
 cliffs-0.015
Token her
Token ID607
Feature activation-2.98e-02

Pos logit contributions
 islands+0.154
 cities+0.151
 obstacles+0.147
 abrupt+0.144
 oceans+0.144
Neg logit contributions
inky-0.065
 okay-0.063
free-0.063
uffy-0.061
etsy-0.060
Token cu
Token ID18912
Feature activation+1.41e-01

Pos logit contributions
free+0.041
 okay+0.040
inky+0.038
etsy+0.037
he+0.036
Neg logit contributions
 islands-0.054
 cities-0.053
 obstacles-0.052
 oceans-0.050
 obstacle-0.050
Tokenpped
Token ID1496
Feature activation-4.24e-02

Pos logit contributions
 islands+0.216
 cities+0.209
 obstacles+0.202
 oceans+0.195
 abrupt+0.194
Neg logit contributions
 okay-0.247
inky-0.226
free-0.225
etsy-0.218
 clean-0.214
Token hands
Token ID2832
Feature activation+2.50e-02

Pos logit contributions
 okay+0.055
inky+0.054
free+0.052
etsy+0.051
uffy+0.049
Neg logit contributions
 islands-0.080
 cities-0.079
 obstacles-0.077
 oceans-0.075
 abrupt-0.073
Token.
Token ID13
Feature activation+7.23e-03

Pos logit contributions
 islands+0.036
 cities+0.035
 oceans+0.034
 obstacles+0.034
 whales+0.032
Neg logit contributions
uffy-0.038
inky-0.038
 okay-0.038
etsy-0.035
free-0.034
Token "
Token ID366
Feature activation+5.72e-02

Pos logit contributions
 cities+0.014
 islands+0.013
 oceans+0.013
 obstacles+0.013
 heights+0.012
Neg logit contributions
 okay-0.009
inky-0.008
free-0.008
 safe-0.008
uffy-0.008
TokenMom
Token ID29252
Feature activation-9.54e-03

Pos logit contributions
 islands+0.106
 cities+0.102
 obstacles+0.099
 obstacle+0.097
 abrupt+0.095
Neg logit contributions
 okay-0.079
free-0.079
inky-0.073
etsy-0.073
 clean-0.071
Token good
Token ID922
Feature activation+3.74e-03

Pos logit contributions
 islands+0.027
 cities+0.025
 whales+0.025
 obstacles+0.025
 oceans+0.025
Neg logit contributions
inky-0.015
etsy-0.015
uffy-0.014
un-0.014
free-0.013
Token.
Token ID13
Feature activation-1.53e-02

Pos logit contributions
 islands+0.008
 cities+0.008
 obstacles+0.007
 oceans+0.007
 abrupt+0.007
Neg logit contributions
 okay-0.004
inky-0.004
etsy-0.004
free-0.004
uffy-0.003
Token
Token ID198
Feature activation-1.54e-02

Pos logit contributions
 okay+0.015
free+0.015
inky+0.015
etsy+0.014
 clean+0.013
Neg logit contributions
 islands-0.034
 cities-0.033
 obstacles-0.032
 oceans-0.031
 abrupt-0.031
Token
Token ID198
Feature activation-1.66e-02

Pos logit contributions
 okay+0.014
free+0.013
inky+0.013
etsy+0.013
uffy+0.011
Neg logit contributions
 islands-0.035
 cities-0.035
 obstacles-0.033
 abrupt-0.033
 oceans-0.033
Token"
Token ID1
Feature activation+2.81e-02

Pos logit contributions
 okay+0.025
free+0.023
inky+0.023
etsy+0.022
 clean+0.022
Neg logit contributions
 islands-0.029
 cities-0.029
 obstacles-0.028
 oceans-0.027
 abrupt-0.027
TokenJu
Token ID33018
Feature activation+1.40e-01

Pos logit contributions
 islands+0.045
 cities+0.044
 obstacles+0.042
 obstacle+0.041
 oceans+0.041
Neg logit contributions
 okay-0.046
free-0.045
inky-0.043
etsy-0.042
 clean-0.042
Tokenice
Token ID501
Feature activation-1.75e-02

Pos logit contributions
 islands+0.279
 cities+0.273
 obstacles+0.263
 oceans+0.254
 obstacle+0.252
Neg logit contributions
 okay-0.172
inky-0.170
free-0.164
uffy-0.162
etsy-0.156
Token is
Token ID318
Feature activation+2.48e-02

Pos logit contributions
 okay+0.021
free+0.019
inky+0.019
etsy+0.018
 clean+0.018
Neg logit contributions
 islands-0.036
 cities-0.035
 obstacles-0.034
 oceans-0.033
 obstacle-0.033
Token y
Token ID331
Feature activation+2.11e-02

Pos logit contributions
 islands+0.053
 cities+0.051
 whales+0.051
 obstacles+0.050
 hears+0.049
Neg logit contributions
he-0.025
inky-0.025
etsy-0.025
free-0.025
un-0.023
Tokenummy
Token ID13513
Feature activation+3.36e-03

Pos logit contributions
 islands+0.048
 cities+0.047
 obstacles+0.045
 obstacle+0.045
 abrupt+0.045
Neg logit contributions
 okay-0.020
free-0.019
inky-0.018
etsy-0.017
 clean-0.016
Token,"
Token ID553
Feature activation+1.81e-02

Pos logit contributions
 islands+0.007
 cities+0.007
 obstacles+0.007
 oceans+0.006
 obstacle+0.006
Neg logit contributions
 okay-0.004
free-0.004
inky-0.004
 clean-0.004
 warm-0.004
Token.
Token ID13
Feature activation-1.69e-02

Pos logit contributions
 okay+0.068
free+0.064
inky+0.063
etsy+0.062
uffy+0.058
Neg logit contributions
 islands-0.102
 cities-0.098
 obstacles-0.095
 oceans-0.094
 abrupt-0.092
Token He
Token ID679
Feature activation+4.31e-02

Pos logit contributions
free+0.027
inky+0.026
 okay+0.026
etsy+0.026
uffy+0.024
Neg logit contributions
 islands-0.026
 cities-0.026
 obstacles-0.024
 abrupt-0.024
 oceans-0.024
Token says
Token ID1139
Feature activation+1.56e-02

Pos logit contributions
 cities+0.081
 islands+0.078
 obstacles+0.076
 oceans+0.075
 obstacle+0.072
Neg logit contributions
 clean-0.064
 warm-0.061
 okay-0.059
 dry-0.056
 Lily-0.055
Token,
Token ID11
Feature activation+3.03e-02

Pos logit contributions
 islands+0.037
 cities+0.036
 obstacles+0.035
 oceans+0.034
 abrupt+0.034
Neg logit contributions
 okay-0.014
free-0.013
inky-0.013
etsy-0.012
 clean-0.012
Token "
Token ID366
Feature activation+7.45e-02

Pos logit contributions
 islands+0.058
 cities+0.056
 obstacles+0.055
 obstacle+0.053
 abrupt+0.053
Neg logit contributions
 okay-0.041
free-0.039
inky-0.038
etsy-0.036
 clean-0.036
TokenJu
Token ID33018
Feature activation+1.40e-01

Pos logit contributions
 islands+0.136
 cities+0.133
 obstacles+0.128
 oceans+0.124
 obstacle+0.124
Neg logit contributions
 okay-0.105
free-0.100
inky-0.100
etsy-0.095
 clean-0.093
Tokenice
Token ID501
Feature activation-1.63e-02

Pos logit contributions
 islands+0.277
 cities+0.270
 obstacles+0.258
 oceans+0.249
 obstacle+0.247
Neg logit contributions
 okay-0.185
free-0.180
inky-0.180
 clean-0.170
uffy-0.167
Token is
Token ID318
Feature activation+1.70e-02

Pos logit contributions
 okay+0.020
free+0.019
inky+0.019
etsy+0.018
he+0.017
Neg logit contributions
 islands-0.032
 cities-0.031
 obstacles-0.030
 obstacle-0.029
 oceans-0.029
Token y
Token ID331
Feature activation+1.21e-02

Pos logit contributions
 islands+0.038
 cities+0.037
 obstacles+0.037
 whales+0.036
 abrupt+0.036
Neg logit contributions
inky-0.015
free-0.015
etsy-0.014
 okay-0.014
he-0.014
Tokenummy
Token ID13513
Feature activation-4.54e-03

Pos logit contributions
 islands+0.026
 cities+0.025
 obstacles+0.025
 obstacle+0.024
 abrupt+0.024
Neg logit contributions
 okay-0.013
free-0.012
etsy-0.011
inky-0.011
 safe-0.010
Token!
Token ID0
Feature activation+1.41e-02

Pos logit contributions
 okay+0.004
free+0.004
inky+0.004
 clean+0.004
 warm+0.003
Neg logit contributions
 islands-0.011
 cities-0.011
 obstacles-0.010
 oceans-0.010
 obstacle-0.010
Token carefully
Token ID7773
Feature activation+4.37e-02

Pos logit contributions
 islands+0.050
 cities+0.049
 obstacles+0.048
 obstacle+0.047
 oceans+0.047
Neg logit contributions
 clean-0.024
 okay-0.023
 warm-0.022
 safe-0.020
inky-0.020
Token picked
Token ID6497
Feature activation-5.01e-03

Pos logit contributions
 islands+0.090
 cities+0.089
 obstacles+0.086
 obstacle+0.084
 oceans+0.083
Neg logit contributions
 clean-0.049
 okay-0.048
 warm-0.046
free-0.045
inky-0.044
Token up
Token ID510
Feature activation+1.88e-02

Pos logit contributions
 okay+0.007
inky+0.006
free+0.006
etsy+0.006
uffy+0.006
Neg logit contributions
 islands-0.010
 cities-0.009
 obstacles-0.009
 oceans-0.009
 obstacle-0.009
Token the
Token ID262
Feature activation+1.74e-02

Pos logit contributions
 islands+0.041
 cities+0.040
 obstacles+0.039
 oceans+0.038
 obstacle+0.038
Neg logit contributions
 okay-0.020
inky-0.019
free-0.019
etsy-0.017
 clean-0.016
Token deaf
Token ID21815
Feature activation+3.37e-02

Pos logit contributions
 islands+0.033
 cities+0.032
 obstacles+0.031
 abrupt+0.030
 oceans+0.030
Neg logit contributions
 okay-0.023
free-0.022
inky-0.022
etsy-0.020
 clean-0.020
Token tort
Token ID7619
Feature activation+1.40e-01

Pos logit contributions
 islands+0.066
 cities+0.064
 obstacles+0.062
 abrupt+0.061
 oceans+0.060
Neg logit contributions
 okay-0.042
free-0.041
inky-0.040
 clean-0.039
 warm-0.038
Tokenoise
Token ID25678
Feature activation+4.69e-02

Pos logit contributions
 islands+0.194
 cities+0.187
 obstacles+0.183
 obstacle+0.180
 oceans+0.173
Neg logit contributions
free-0.261
un-0.253
etsy-0.252
inky-0.247
 okay-0.242
Token and
Token ID290
Feature activation+2.22e-02

Pos logit contributions
 islands+0.096
 cities+0.094
 obstacles+0.091
 oceans+0.089
 obstacle+0.088
Neg logit contributions
 okay-0.055
free-0.052
inky-0.051
etsy-0.048
 clean-0.047
Token placed
Token ID4624
Feature activation+1.26e-02

Pos logit contributions
 islands+0.046
 cities+0.045
 obstacles+0.043
 obstacle+0.042
 oceans+0.042
Neg logit contributions
 okay-0.026
free-0.024
 clean-0.024
inky-0.023
 warm-0.023
Token it
Token ID340
Feature activation-9.18e-03

Pos logit contributions
 islands+0.024
 cities+0.024
 obstacles+0.023
 oceans+0.023
 obstacle+0.022
Neg logit contributions
 okay-0.016
inky-0.015
free-0.015
etsy-0.014
uffy-0.014
Token beside
Token ID13970
Feature activation+8.31e-02

Pos logit contributions
inky+0.010
 okay+0.010
free+0.009
etsy+0.009
uffy+0.008
Neg logit contributions
 islands-0.020
 cities-0.019
 obstacles-0.019
 oceans-0.018
 obstacle-0.018
Token of
Token ID286
Feature activation-8.23e-02

Pos logit contributions
 okay+0.000
inky+0.000
free+0.000
etsy+0.000
uffy+0.000
Neg logit contributions
 islands-0.000
 cities-0.000
 obstacles-0.000
 obstacle-0.000
 oceans-0.000
Token the
Token ID262
Feature activation+7.24e-03

Pos logit contributions
inky+0.102
 okay+0.101
etsy+0.097
free+0.096
uffy+0.091
Neg logit contributions
 islands-0.162
 cities-0.159
 obstacles-0.155
 obstacle-0.150
 oceans-0.149
Token powder
Token ID11913
Feature activation-3.76e-02

Pos logit contributions
 obstacles+0.011
 cities+0.011
 hears+0.011
 obstacle+0.011
 islands+0.011
Neg logit contributions
inky-0.011
etsy-0.011
 okay-0.011
free-0.010
uffy-0.010
Token into
Token ID656
Feature activation+4.56e-02

Pos logit contributions
 okay+0.049
inky+0.049
etsy+0.047
free+0.046
uffy+0.045
Neg logit contributions
 islands-0.067
 cities-0.066
 obstacles-0.064
 oceans-0.063
 obstacle-0.062
Token his
Token ID465
Feature activation-2.41e-02

Pos logit contributions
 islands+0.094
 cities+0.092
 obstacles+0.090
 oceans+0.087
 abrupt+0.087
Neg logit contributions
 okay-0.053
inky-0.052
free-0.050
etsy-0.049
uffy-0.047
Token cu
Token ID18912
Feature activation+1.39e-01

Pos logit contributions
free+0.034
 okay+0.033
inky+0.032
etsy+0.032
he+0.031
Neg logit contributions
 islands-0.042
 cities-0.041
 obstacles-0.040
 obstacle-0.039
 oceans-0.038
Tokenpped
Token ID1496
Feature activation-3.91e-02

Pos logit contributions
 islands+0.204
 cities+0.197
 obstacles+0.194
 abrupt+0.186
 oceans+0.185
Neg logit contributions
 okay-0.249
inky-0.229
etsy-0.224
free-0.220
uffy-0.216
Token hands
Token ID2832
Feature activation+2.32e-02

Pos logit contributions
inky+0.055
 okay+0.053
etsy+0.053
uffy+0.050
free+0.050
Neg logit contributions
 cities-0.069
 islands-0.069
 obstacles-0.068
 oceans-0.065
 hears-0.063
Token and
Token ID290
Feature activation+1.94e-02

Pos logit contributions
 islands+0.032
 cities+0.031
 obstacles+0.031
 oceans+0.030
 whales+0.029
Neg logit contributions
uffy-0.037
inky-0.036
etsy-0.035
 okay-0.034
ivery-0.033
Token scattered
Token ID16830
Feature activation-3.67e-03

Pos logit contributions
 islands+0.036
 cities+0.035
 obstacles+0.034
 oceans+0.033
 hears+0.032
Neg logit contributions
inky-0.027
 okay-0.026
free-0.026
etsy-0.025
uffy-0.024
Token it
Token ID340
Feature activation-1.69e-02

Pos logit contributions
 okay+0.005
inky+0.005
etsy+0.005
free+0.005
uffy+0.004
Neg logit contributions
 islands-0.007
 cities-0.007
 obstacles-0.007
 oceans-0.006
 obstacle-0.006
Tokenuddled
Token ID32745
Feature activation-3.12e-02

Pos logit contributions
 okay+0.013
free+0.012
 clean+0.012
 warm+0.012
 OK+0.011
Neg logit contributions
 islands-0.013
 cities-0.012
 obstacles-0.012
 oceans-0.012
 obstacle-0.011
Token the
Token ID262
Feature activation+2.54e-03

Pos logit contributions
 okay+0.034
inky+0.033
free+0.032
etsy+0.030
 clean+0.029
Neg logit contributions
 islands-0.066
 cities-0.065
 obstacles-0.063
 oceans-0.061
 obstacle-0.060
Token cat
Token ID3797
Feature activation-1.66e-02

Pos logit contributions
 obstacles+0.004
 cities+0.004
 obstacle+0.004
 cliffs+0.004
 oceans+0.004
Neg logit contributions
inky-0.004
 okay-0.004
etsy-0.003
un-0.003
free-0.003
Token and
Token ID290
Feature activation+7.25e-03

Pos logit contributions
 okay+0.022
inky+0.022
free+0.021
etsy+0.020
uffy+0.020
Neg logit contributions
 islands-0.030
 cities-0.030
 obstacles-0.028
 oceans-0.028
 obstacle-0.027
Token said
Token ID531
Feature activation+1.07e-02

Pos logit contributions
 islands+0.013
 cities+0.013
 obstacles+0.012
 oceans+0.012
 whales+0.012
Neg logit contributions
inky-0.010
 okay-0.010
free-0.010
etsy-0.009
uffy-0.009
Token ''
Token ID10148
Feature activation+1.39e-01

Pos logit contributions
 islands+0.022
 cities+0.022
 obstacles+0.021
 oceans+0.021
 obstacle+0.021
Neg logit contributions
 okay-0.011
inky-0.011
free-0.011
etsy-0.010
uffy-0.010
TokenYou
Token ID1639
Feature activation+3.30e-02

Pos logit contributions
 islands+0.237
 cities+0.233
 obstacles+0.222
 oceans+0.218
 abrupt+0.214
Neg logit contributions
 okay-0.209
inky-0.200
free-0.194
 clean-0.187
etsy-0.186
Token are
Token ID389
Feature activation+1.20e-02

Pos logit contributions
 islands+0.062
 cities+0.061
 obstacles+0.059
 hears+0.057
 whales+0.057
Neg logit contributions
inky-0.040
 okay-0.038
free-0.038
uffy-0.038
etsy-0.037
Token safe
Token ID3338
Feature activation-6.51e-02

Pos logit contributions
 cities+0.022
 islands+0.022
 obstacles+0.021
 oceans+0.021
 obstacle+0.020
Neg logit contributions
inky-0.015
etsy-0.015
uffy-0.014
free-0.014
 okay-0.013
Token now
Token ID783
Feature activation-4.61e-02

Pos logit contributions
 okay+0.083
free+0.079
inky+0.077
etsy+0.073
 clean+0.072
Neg logit contributions
 islands-0.129
 cities-0.126
 obstacles-0.122
 oceans-0.118
 obstacle-0.118
Token!
Token ID0
Feature activation+1.66e-02

Pos logit contributions
 okay+0.067
inky+0.066
free+0.065
etsy+0.063
uffy+0.062
Neg logit contributions
 islands-0.076
 cities-0.075
 obstacles-0.073
 obstacle-0.071
 abrupt-0.070
Token
Token ID198
Feature activation-1.72e-02

Pos logit contributions
 okay+0.016
free+0.016
inky+0.016
uffy+0.015
etsy+0.015
Neg logit contributions
 islands-0.033
 cities-0.033
 obstacles-0.032
 abrupt-0.031
 oceans-0.031
TokenBut
Token ID1537
Feature activation+3.33e-02

Pos logit contributions
 okay+0.027
inky+0.027
free+0.026
uffy+0.025
etsy+0.025
Neg logit contributions
 islands-0.028
 cities-0.028
 obstacles-0.026
 abrupt-0.026
 oceans-0.025
Token his
Token ID465
Feature activation-2.67e-02

Pos logit contributions
 islands+0.060
 cities+0.059
 obstacles+0.055
 oceans+0.054
 abrupt+0.053
Neg logit contributions
inky-0.048
free-0.047
 okay-0.045
etsy-0.044
uffy-0.042
Token mom
Token ID1995
Feature activation+6.60e-03

Pos logit contributions
free+0.024
 okay+0.021
inky+0.021
etsy+0.020
he+0.018
Neg logit contributions
 islands-0.062
 cities-0.061
 obstacles-0.061
 obstacle-0.059
 abrupt-0.058
Token said
Token ID531
Feature activation+1.08e-02

Pos logit contributions
 islands+0.012
 cities+0.012
 obstacles+0.011
 abrupt+0.011
 hears+0.011
Neg logit contributions
free-0.009
inky-0.009
 okay-0.009
etsy-0.008
uffy-0.008
Token ''
Token ID10148
Feature activation+1.37e-01

Pos logit contributions
 islands+0.022
 cities+0.022
 obstacles+0.021
 oceans+0.020
 whales+0.020
Neg logit contributions
inky-0.012
 okay-0.012
free-0.012
etsy-0.011
he-0.011
TokenNo
Token ID2949
Feature activation+6.49e-02

Pos logit contributions
 islands+0.234
 cities+0.227
 obstacles+0.218
 obstacle+0.211
 oceans+0.211
Neg logit contributions
 okay-0.210
free-0.205
inky-0.198
etsy-0.193
 clean-0.187
Token!
Token ID0
Feature activation+3.10e-02

Pos logit contributions
 islands+0.129
 cities+0.127
 obstacles+0.124
 obstacle+0.120
 abrupt+0.119
Neg logit contributions
 okay-0.078
inky-0.077
free-0.076
etsy-0.072
 clean-0.068
Token''
Token ID7061
Feature activation+5.62e-02

Pos logit contributions
 islands+0.053
 cities+0.053
 obstacles+0.049
 abrupt+0.049
 oceans+0.048
Neg logit contributions
free-0.043
 okay-0.043
inky-0.042
he-0.041
 clean-0.040
Token
Token ID198
Feature activation+3.74e-04

Pos logit contributions
 islands+0.114
 cities+0.112
 obstacles+0.108
 oceans+0.106
 abrupt+0.105
Neg logit contributions
 okay-0.065
inky-0.063
free-0.062
etsy-0.058
 clean-0.057
Token
Token ID198
Feature activation-7.36e-04

Pos logit contributions
 islands+0.001
 cities+0.001
 obstacles+0.001
 abrupt+0.001
 oceans+0.001
Neg logit contributions
 okay-0.001
free-0.001
inky-0.001
etsy-0.001
uffy-0.001
Token replies
Token ID20616
Feature activation+1.12e-02

Pos logit contributions
 okay+0.043
 clean+0.040
 warm+0.038
inky+0.038
 Her+0.038
Neg logit contributions
 islands-0.038
 cities-0.037
 obstacles-0.035
 oceans-0.035
 obstacle-0.034
Token.
Token ID13
Feature activation+4.14e-03

Pos logit contributions
 islands+0.021
 cities+0.020
 obstacles+0.019
 abrupt+0.019
 whales+0.019
Neg logit contributions
inky-0.014
etsy-0.014
free-0.014
uffy-0.013
 okay-0.013
Token
Token ID198
Feature activation-8.01e-03

Pos logit contributions
 islands+0.010
 cities+0.009
 obstacles+0.009
 oceans+0.009
 abrupt+0.009
Neg logit contributions
 okay-0.004
free-0.004
inky-0.003
etsy-0.003
 clean-0.003
Token
Token ID198
Feature activation-1.20e-02

Pos logit contributions
 okay+0.009
free+0.009
inky+0.008
etsy+0.008
less+0.007
Neg logit contributions
 islands-0.017
 cities-0.016
 obstacles-0.016
 abrupt-0.016
 oceans-0.015
Token"
Token ID1
Feature activation+2.22e-02

Pos logit contributions
 okay+0.016
free+0.015
inky+0.015
etsy+0.014
uffy+0.014
Neg logit contributions
 islands-0.023
 cities-0.022
 obstacles-0.022
 abrupt-0.021
 oceans-0.021
TokenJu
Token ID33018
Feature activation+1.37e-01

Pos logit contributions
 islands+0.046
 cities+0.045
 obstacle+0.044
 obstacles+0.044
 cliffs+0.042
Neg logit contributions
free-0.027
 okay-0.027
etsy-0.025
inky-0.024
 warm-0.023
Tokenice
Token ID501
Feature activation-1.90e-02

Pos logit contributions
 islands+0.267
 cities+0.261
 obstacles+0.256
 obstacle+0.245
 oceans+0.241
Neg logit contributions
 okay-0.175
inky-0.170
free-0.166
uffy-0.161
etsy-0.161
Token,
Token ID11
Feature activation+2.63e-02

Pos logit contributions
 okay+0.025
free+0.023
inky+0.023
etsy+0.022
 clean+0.022
Neg logit contributions
 islands-0.037
 cities-0.036
 obstacles-0.035
 oceans-0.034
 obstacle-0.034
Token please
Token ID3387
Feature activation-1.29e-03

Pos logit contributions
 islands+0.055
 cities+0.054
 obstacles+0.052
 oceans+0.051
 obstacle+0.051
Neg logit contributions
 okay-0.033
free-0.029
 clean-0.029
inky-0.028
 warm-0.028
Token.
Token ID13
Feature activation+1.38e-02

Pos logit contributions
 okay+0.002
free+0.001
inky+0.001
etsy+0.001
he+0.001
Neg logit contributions
 islands-0.003
 cities-0.002
 obstacles-0.002
 oceans-0.002
 abrupt-0.002
Token Thank
Token ID6952
Feature activation+2.14e-02

Pos logit contributions
 islands+0.024
 cities+0.023
 obstacles+0.022
 oceans+0.022
 abrupt+0.021
Neg logit contributions
 okay-0.019
free-0.019
inky-0.019
he-0.018
etsy-0.018
Token they
Token ID484
Feature activation-2.86e-03

Pos logit contributions
 islands+0.022
 cities+0.022
 obstacles+0.021
 oceans+0.020
 whales+0.020
Neg logit contributions
inky-0.015
free-0.014
 okay-0.014
etsy-0.014
uffy-0.013
Token saw
Token ID2497
Feature activation-1.46e-02

Pos logit contributions
 okay+0.004
 clean+0.003
free+0.003
inky+0.003
 warm+0.003
Neg logit contributions
 islands-0.006
 cities-0.006
 obstacles-0.006
 obstacle-0.005
 oceans-0.005
Token that
Token ID326
Feature activation-3.09e-02

Pos logit contributions
 okay+0.015
free+0.014
inky+0.014
etsy+0.013
 clean+0.013
Neg logit contributions
 islands-0.032
 cities-0.031
 obstacles-0.030
 oceans-0.030
 obstacle-0.030
Token it
Token ID340
Feature activation-9.07e-03

Pos logit contributions
 okay+0.039
inky+0.038
free+0.038
etsy+0.036
 clean+0.034
Neg logit contributions
 islands-0.060
 cities-0.059
 obstacles-0.057
 oceans-0.055
 obstacle-0.055
Token said
Token ID531
Feature activation+1.87e-02

Pos logit contributions
 okay+0.011
free+0.011
inky+0.010
etsy+0.010
 clean+0.010
Neg logit contributions
 islands-0.018
 cities-0.018
 obstacles-0.017
 oceans-0.017
 obstacle-0.017
Token 40
Token ID2319
Feature activation+1.37e-01

Pos logit contributions
 islands+0.032
 cities+0.032
 obstacles+0.030
 oceans+0.030
 whales+0.029
Neg logit contributions
inky-0.026
free-0.025
 okay-0.025
etsy-0.025
he-0.024
Token degrees
Token ID7370
Feature activation+2.94e-02

Pos logit contributions
 islands+0.213
 cities+0.212
 oceans+0.196
 abrupt+0.193
 obstacles+0.185
Neg logit contributions
 okay-0.234
 clean-0.217
free-0.217
inky-0.215
 warm-0.210
Token!
Token ID0
Feature activation+2.58e-02

Pos logit contributions
 islands+0.059
 cities+0.058
 obstacles+0.055
 oceans+0.054
 obstacle+0.054
Neg logit contributions
 okay-0.036
free-0.035
inky-0.034
 clean-0.032
etsy-0.032
Token
Token ID198
Feature activation-3.43e-03

Pos logit contributions
 islands+0.057
 cities+0.056
 obstacles+0.054
 oceans+0.053
 abrupt+0.053
Neg logit contributions
 okay-0.025
free-0.024
inky-0.023
etsy-0.022
 clean-0.021
Token
Token ID198
Feature activation-6.98e-03

Pos logit contributions
 okay+0.003
free+0.003
inky+0.003
etsy+0.002
 clean+0.002
Neg logit contributions
 islands-0.008
 cities-0.008
 obstacles-0.008
 oceans-0.008
 abrupt-0.008
TokenJoe
Token ID19585
Feature activation+4.23e-02

Pos logit contributions
 okay+0.008
free+0.008
inky+0.008
etsy+0.007
 clean+0.007
Neg logit contributions
 islands-0.014
 cities-0.014
 obstacles-0.013
 abrupt-0.013
 oceans-0.013
TokenIt
Token ID1026
Feature activation+6.18e-03

Pos logit contributions
 islands+0.097
 cities+0.095
 obstacles+0.091
 oceans+0.088
 obstacle+0.088
Neg logit contributions
 okay-0.079
free-0.076
inky-0.074
etsy-0.071
 clean-0.069
Token's
Token ID338
Feature activation+1.97e-02

Pos logit contributions
 islands+0.013
 cities+0.013
 obstacles+0.012
 obstacle+0.012
 oceans+0.012
Neg logit contributions
 okay-0.007
free-0.007
inky-0.007
etsy-0.006
he-0.006
Token poisoned
Token ID29419
Feature activation+2.47e-02

Pos logit contributions
 islands+0.042
 obstacles+0.041
 cities+0.041
 abrupt+0.039
 whales+0.039
Neg logit contributions
free-0.020
he-0.019
inky-0.019
etsy-0.018
uffy-0.017
Token,"
Token ID553
Feature activation+1.67e-02

Pos logit contributions
 islands+0.045
 cities+0.044
 obstacles+0.042
 abrupt+0.042
 obstacle+0.041
Neg logit contributions
 okay-0.037
free-0.034
inky-0.033
 warm-0.032
 clean-0.032
Token replied
Token ID8712
Feature activation+2.10e-02

Pos logit contributions
 islands+0.035
 cities+0.034
 obstacles+0.033
 oceans+0.032
 obstacle+0.032
Neg logit contributions
 okay-0.019
free-0.018
inky-0.018
etsy-0.017
 clean-0.016
Token Wise
Token ID28908
Feature activation+1.36e-01

Pos logit contributions
 islands+0.043
 cities+0.043
 obstacles+0.042
 heights+0.041
 obstacle+0.041
Neg logit contributions
inky-0.021
 okay-0.021
etsy-0.021
he-0.020
free-0.020
Token Owl
Token ID37007
Feature activation+8.18e-02

Pos logit contributions
 islands+0.258
 cities+0.251
 obstacles+0.241
 oceans+0.236
 obstacle+0.233
Neg logit contributions
 okay-0.184
free-0.178
inky-0.176
etsy-0.166
 clean-0.160
Token.
Token ID13
Feature activation-3.08e-03

Pos logit contributions
 islands+0.151
 cities+0.149
 obstacles+0.143
 obstacle+0.141
 whales+0.141
Neg logit contributions
 okay-0.101
inky-0.098
etsy-0.095
free-0.094
he-0.093
Token "
Token ID366
Feature activation+5.03e-02

Pos logit contributions
 okay+0.004
free+0.004
inky+0.004
etsy+0.004
 clean+0.004
Neg logit contributions
 islands-0.006
 cities-0.005
 obstacles-0.005
 oceans-0.005
 abrupt-0.005
TokenOh
Token ID5812
Feature activation+1.92e-02

Pos logit contributions
 islands+0.091
 cities+0.089
 obstacles+0.086
 oceans+0.083
 obstacle+0.083
Neg logit contributions
 okay-0.072
free-0.069
inky-0.068
etsy-0.065
 clean-0.063
Token no
Token ID645
Feature activation+1.46e-02

Pos logit contributions
 islands+0.038
 cities+0.038
 obstacles+0.037
 obstacle+0.036
 abrupt+0.036
Neg logit contributions
inky-0.021
free-0.021
 okay-0.020
etsy-0.020
he-0.020
Token.
Token ID13
Feature activation-1.93e-03

Pos logit contributions
 islands+0.045
 cities+0.044
 obstacles+0.043
 oceans+0.042
 obstacle+0.041
Neg logit contributions
 okay-0.024
free-0.022
inky-0.022
etsy-0.021
 clean-0.020
Token It
Token ID632
Feature activation+3.40e-03

Pos logit contributions
 okay+0.003
free+0.003
inky+0.003
etsy+0.003
 clean+0.003
Neg logit contributions
 islands-0.003
 cities-0.003
 obstacles-0.003
 oceans-0.003
 obstacle-0.003
Token is
Token ID318
Feature activation+1.65e-02

Pos logit contributions
 islands+0.006
 cities+0.006
 obstacles+0.006
 oceans+0.006
 obstacle+0.006
Neg logit contributions
 okay-0.005
free-0.005
inky-0.004
 clean-0.004
etsy-0.004
Token called
Token ID1444
Feature activation-2.69e-02

Pos logit contributions
 islands+0.036
 cities+0.035
 obstacles+0.034
 oceans+0.033
 obstacle+0.033
Neg logit contributions
 okay-0.017
free-0.016
inky-0.016
etsy-0.015
 clean-0.015
Token The
Token ID383
Feature activation+3.09e-02

Pos logit contributions
 okay+0.035
free+0.034
inky+0.033
etsy+0.032
 clean+0.031
Neg logit contributions
 islands-0.051
 cities-0.050
 obstacles-0.048
 oceans-0.047
 obstacle-0.046
Token Wizard
Token ID16884
Feature activation+1.36e-01

Pos logit contributions
 islands+0.057
 cities+0.056
 obstacles+0.054
 abrupt+0.053
 oceans+0.053
Neg logit contributions
 okay-0.043
free-0.039
 warm-0.038
inky-0.038
 clean-0.038
Token of
Token ID286
Feature activation-5.72e-02

Pos logit contributions
 islands+0.297
 cities+0.292
 obstacles+0.288
 obstacle+0.279
 oceans+0.278
Neg logit contributions
 okay-0.139
free-0.135
inky-0.126
 clean-0.118
etsy-0.118
Token Oz
Token ID18024
Feature activation-8.01e-03

Pos logit contributions
 okay+0.058
free+0.053
 warm+0.052
 clean+0.051
 safe+0.049
Neg logit contributions
 islands-0.123
 cities-0.120
 obstacles-0.118
 obstacle-0.117
 abrupt-0.116
Token.
Token ID13
Feature activation-6.00e-03

Pos logit contributions
 okay+0.009
free+0.008
inky+0.008
 clean+0.008
 warm+0.007
Neg logit contributions
 islands-0.017
 cities-0.017
 obstacles-0.017
 obstacle-0.016
 oceans-0.016
Token Do
Token ID2141
Feature activation-6.37e-03

Pos logit contributions
 okay+0.008
free+0.008
inky+0.008
etsy+0.007
 clean+0.007
Neg logit contributions
 islands-0.011
 cities-0.011
 obstacles-0.011
 obstacle-0.010
 oceans-0.010
Token you
Token ID345
Feature activation-9.08e-03

Pos logit contributions
 okay+0.008
free+0.007
inky+0.007
etsy+0.007
 clean+0.007
Neg logit contributions
 islands-0.013
 cities-0.013
 obstacles-0.012
 oceans-0.012
 obstacle-0.012
Token Ele
Token ID15987
Feature activation+2.24e-02

Pos logit contributions
 cities+0.006
 islands+0.006
 obstacles+0.005
 abrupt+0.005
 oceans+0.005
Neg logit contributions
 okay-0.002
free-0.002
inky-0.002
he-0.002
 warm-0.002
Tokenph
Token ID746
Feature activation+5.79e-02

Pos logit contributions
 islands+0.035
 cities+0.034
 obstacles+0.032
 oceans+0.031
 abrupt+0.031
Neg logit contributions
 okay-0.036
inky-0.036
free-0.036
etsy-0.034
 clean-0.033
Tokenants
Token ID1187
Feature activation-5.92e-03

Pos logit contributions
 islands+0.076
 cities+0.074
 whales+0.070
 obstacles+0.069
 abrupt+0.067
Neg logit contributions
 okay-0.108
free-0.103
 warm-0.101
inky-0.101
 clean-0.100
Token have
Token ID423
Feature activation-1.81e-03

Pos logit contributions
 okay+0.007
free+0.006
inky+0.006
etsy+0.006
 clean+0.006
Neg logit contributions
 islands-0.012
 cities-0.012
 obstacles-0.012
 oceans-0.011
 obstacle-0.011
Token strong
Token ID1913
Feature activation+2.56e-02

Pos logit contributions
 okay+0.002
free+0.002
inky+0.002
etsy+0.002
 clean+0.002
Neg logit contributions
 islands-0.004
 cities-0.004
 obstacles-0.004
 obstacle-0.003
 abrupt-0.003
Token tr
Token ID491
Feature activation+1.35e-01

Pos logit contributions
 islands+0.044
 cities+0.043
 abrupt+0.043
 oceans+0.040
 obstacle+0.040
Neg logit contributions
 okay-0.037
free-0.036
 clean-0.035
 warm-0.034
inky-0.033
Tokenunks
Token ID14125
Feature activation-1.67e-04

Pos logit contributions
 islands+0.229
 cities+0.222
 obstacles+0.206
 oceans+0.204
 obstacle+0.203
Neg logit contributions
inky-0.221
he-0.216
un-0.210
uffy-0.208
etsy-0.195
Token that
Token ID326
Feature activation-1.66e-02

Pos logit contributions
 okay+0.000
free+0.000
inky+0.000
etsy+0.000
 clean+0.000
Neg logit contributions
 islands-0.000
 cities-0.000
 obstacles-0.000
 oceans-0.000
 obstacle-0.000
Token they
Token ID484
Feature activation+1.21e-02

Pos logit contributions
 okay+0.019
free+0.018
inky+0.018
etsy+0.017
 clean+0.017
Neg logit contributions
 islands-0.034
 cities-0.034
 obstacles-0.032
 oceans-0.032
 obstacle-0.031
Token can
Token ID460
Feature activation+2.65e-03

Pos logit contributions
 islands+0.025
 cities+0.025
 obstacles+0.024
 oceans+0.023
 abrupt+0.023
Neg logit contributions
 okay-0.014
free-0.014
inky-0.013
 clean-0.013
etsy-0.012
Token use
Token ID779
Feature activation+1.92e-02

Pos logit contributions
 islands+0.006
 cities+0.006
 obstacles+0.005
 oceans+0.005
 obstacle+0.005
Neg logit contributions
 okay-0.003
inky-0.003
free-0.003
 clean-0.003
etsy-0.003
Token was
Token ID373
Feature activation+1.54e-02

Pos logit contributions
 okay+0.020
free+0.019
inky+0.019
 clean+0.018
etsy+0.018
Neg logit contributions
 islands-0.034
 cities-0.033
 obstacles-0.032
 oceans-0.031
 obstacle-0.031
Token determined
Token ID5295
Feature activation+4.95e-02

Pos logit contributions
 islands+0.032
 cities+0.032
 obstacles+0.031
 oceans+0.030
 abrupt+0.030
Neg logit contributions
inky-0.017
 okay-0.017
free-0.016
etsy-0.016
uffy-0.015
Token to
Token ID284
Feature activation+1.48e-02

Pos logit contributions
 islands+0.128
 cities+0.126
 oceans+0.120
 obstacle+0.118
rob+0.118
Neg logit contributions
 okay-0.043
 clean-0.034
 safe-0.033
free-0.030
 warm-0.030
Token be
Token ID307
Feature activation+1.45e-02

Pos logit contributions
 islands+0.030
 cities+0.029
 oceans+0.027
 obstacles+0.027
 abrupt+0.027
Neg logit contributions
 okay-0.019
 clean-0.017
free-0.017
inky-0.017
 safe-0.016
Token the
Token ID262
Feature activation-5.62e-03

Pos logit contributions
 islands+0.032
 cities+0.031
 obstacles+0.030
 oceans+0.030
 obstacle+0.030
Neg logit contributions
 okay-0.015
free-0.014
inky-0.013
 clean-0.013
 warm-0.012
Token vict
Token ID2210
Feature activation+1.35e-01

Pos logit contributions
 okay+0.007
free+0.006
inky+0.006
 clean+0.006
etsy+0.006
Neg logit contributions
 islands-0.012
 cities-0.011
 obstacles-0.011
 oceans-0.011
 abrupt-0.011
Tokenor
Token ID273
Feature activation+2.45e-02

Pos logit contributions
 islands+0.248
 cities+0.243
 obstacles+0.234
 oceans+0.230
 obstacle+0.224
Neg logit contributions
 okay-0.195
free-0.193
inky-0.189
etsy-0.181
un-0.175
Token.
Token ID13
Feature activation+1.26e-05

Pos logit contributions
 islands+0.054
 cities+0.053
 obstacles+0.051
 oceans+0.050
 obstacle+0.050
Neg logit contributions
 okay-0.026
free-0.024
inky-0.024
etsy-0.022
 clean-0.021
Token
Token ID198
Feature activation-1.11e-02

Pos logit contributions
 cities+0.000
 islands+0.000
 obstacles+0.000
 oceans+0.000
 abrupt+0.000
Neg logit contributions
 okay-0.000
free-0.000
inky-0.000
etsy-0.000
 clean-0.000
Token
Token ID198
Feature activation-1.34e-02

Pos logit contributions
 okay+0.011
free+0.010
inky+0.010
etsy+0.010
 clean+0.009
Neg logit contributions
 islands-0.024
 cities-0.024
 obstacles-0.023
 abrupt-0.022
 oceans-0.022
TokenBut
Token ID1537
Feature activation+3.35e-02

Pos logit contributions
 okay+0.019
inky+0.018
free+0.018
etsy+0.017
uffy+0.017
Neg logit contributions
 islands-0.024
 cities-0.024
 obstacles-0.024
 abrupt-0.022
 obstacle-0.022
Token leaned
Token ID23831
Feature activation-8.64e-03

Pos logit contributions
 islands+0.041
 cities+0.040
 obstacles+0.040
 oceans+0.039
 obstacle+0.039
Neg logit contributions
 okay-0.021
 clean-0.021
 warm-0.019
 safe-0.018
inky-0.017
Token close
Token ID1969
Feature activation+3.04e-02

Pos logit contributions
 okay+0.010
inky+0.010
free+0.010
etsy+0.009
 clean+0.009
Neg logit contributions
 islands-0.017
 cities-0.017
 obstacles-0.016
 oceans-0.016
 abrupt-0.016
Token to
Token ID284
Feature activation+2.81e-02

Pos logit contributions
 islands+0.069
 cities+0.067
 obstacles+0.065
 obstacle+0.065
 abrupt+0.064
Neg logit contributions
 okay-0.029
free-0.026
 clean-0.025
 warm-0.025
inky-0.025
Token him
Token ID683
Feature activation-3.05e-02

Pos logit contributions
 islands+0.066
 cities+0.065
 obstacles+0.063
 oceans+0.062
 obstacle+0.062
Neg logit contributions
 okay-0.025
free-0.023
 clean-0.021
 warm-0.021
inky-0.020
Token and
Token ID290
Feature activation+1.49e-02

Pos logit contributions
 okay+0.035
free+0.034
inky+0.034
etsy+0.032
 clean+0.031
Neg logit contributions
 islands-0.062
 cities-0.061
 obstacles-0.059
 oceans-0.057
 obstacle-0.057
Token cu
Token ID18912
Feature activation+1.35e-01

Pos logit contributions
 islands+0.030
 cities+0.029
 obstacles+0.028
 obstacle+0.028
 oceans+0.027
Neg logit contributions
 okay-0.019
free-0.017
inky-0.017
 clean-0.016
etsy-0.016
Tokenpped
Token ID1496
Feature activation-4.85e-02

Pos logit contributions
 islands+0.219
 cities+0.213
 obstacles+0.205
 oceans+0.197
 abrupt+0.196
Neg logit contributions
 okay-0.224
inky-0.207
free-0.206
etsy-0.200
 clean-0.196
Token her
Token ID607
Feature activation-8.56e-03

Pos logit contributions
 okay+0.060
inky+0.059
free+0.057
etsy+0.055
uffy+0.053
Neg logit contributions
 islands-0.093
 cities-0.092
 obstacles-0.089
 oceans-0.087
 abrupt-0.086
Token mouth
Token ID5422
Feature activation+2.49e-03

Pos logit contributions
 okay+0.011
free+0.010
inky+0.010
etsy+0.010
he+0.009
Neg logit contributions
 islands-0.017
 cities-0.016
 obstacles-0.016
 obstacle-0.015
 oceans-0.015
Token with
Token ID351
Feature activation+4.22e-02

Pos logit contributions
 islands+0.004
 cities+0.004
 obstacles+0.004
 oceans+0.004
 abrupt+0.004
Neg logit contributions
uffy-0.004
inky-0.004
etsy-0.003
free-0.003
 okay-0.003
Token her
Token ID607
Feature activation-1.32e-02

Pos logit contributions
 islands+0.090
 cities+0.088
 obstacles+0.088
 oceans+0.084
 obstacle+0.083
Neg logit contributions
inky-0.045
 okay-0.043
free-0.042
etsy-0.041
uffy-0.040
Token.
Token ID13
Feature activation-1.78e-02

Pos logit contributions
 okay+0.051
free+0.047
inky+0.046
etsy+0.043
 clean+0.042
Neg logit contributions
 islands-0.122
 cities-0.120
 obstacles-0.116
 oceans-0.113
 obstacle-0.113
Token They
Token ID1119
Feature activation-1.02e-02

Pos logit contributions
 okay+0.019
free+0.018
inky+0.018
etsy+0.017
 clean+0.017
Neg logit contributions
 islands-0.038
 cities-0.038
 obstacles-0.036
 oceans-0.035
 abrupt-0.035
Token said
Token ID531
Feature activation-5.90e-03

Pos logit contributions
 okay+0.013
 clean+0.012
free+0.011
inky+0.011
 warm+0.011
Neg logit contributions
 islands-0.020
 cities-0.020
 obstacles-0.020
 obstacle-0.019
 oceans-0.019
Token,
Token ID11
Feature activation+2.60e-02

Pos logit contributions
 okay+0.006
inky+0.006
free+0.006
etsy+0.005
he+0.005
Neg logit contributions
 islands-0.013
 cities-0.013
 obstacles-0.012
 oceans-0.012
 whales-0.012
Token "
Token ID366
Feature activation+6.52e-02

Pos logit contributions
 islands+0.046
 cities+0.045
 obstacles+0.043
 oceans+0.042
 abrupt+0.041
Neg logit contributions
 okay-0.038
inky-0.036
free-0.036
etsy-0.035
 clean-0.034
TokenJu
Token ID33018
Feature activation+1.35e-01

Pos logit contributions
 islands+0.108
 cities+0.106
 obstacles+0.102
 obstacle+0.099
 oceans+0.098
Neg logit contributions
 okay-0.104
free-0.100
inky-0.099
etsy-0.095
 clean-0.093
Tokenice
Token ID501
Feature activation-1.90e-02

Pos logit contributions
 islands+0.246
 cities+0.242
 obstacles+0.235
 obstacle+0.226
 oceans+0.222
Neg logit contributions
 okay-0.184
inky-0.180
free-0.178
uffy-0.172
etsy-0.169
Token,
Token ID11
Feature activation+2.60e-02

Pos logit contributions
 okay+0.021
free+0.020
inky+0.019
etsy+0.018
he+0.018
Neg logit contributions
 islands-0.040
 cities-0.039
 obstacles-0.037
 whales-0.037
 obstacle-0.036
Token cheese
Token ID9891
Feature activation-1.19e-02

Pos logit contributions
 islands+0.052
 cities+0.050
 obstacles+0.049
 oceans+0.048
 abrupt+0.047
Neg logit contributions
 okay-0.033
free-0.031
inky-0.031
etsy-0.029
 clean-0.029
Token and
Token ID290
Feature activation+1.57e-03

Pos logit contributions
 okay+0.013
free+0.012
inky+0.012
etsy+0.011
 clean+0.011
Neg logit contributions
 islands-0.025
 cities-0.025
 obstacles-0.024
 oceans-0.023
 obstacle-0.023
Token crack
Token ID8469
Feature activation-3.77e-04

Pos logit contributions
 cities+0.003
 islands+0.003
 abrupt+0.003
 obstacles+0.003
roads+0.003
Neg logit contributions
 okay-0.002
 clean-0.002
 warm-0.002
 milk-0.002
 safe-0.002
Token
Token ID198
Feature activation-1.43e-02

Pos logit contributions
 okay+0.010
free+0.010
inky+0.010
etsy+0.009
 clean+0.008
Neg logit contributions
 islands-0.029
 cities-0.028
 obstacles-0.027
 oceans-0.027
 abrupt-0.027
TokenAfter
Token ID3260
Feature activation-5.04e-02

Pos logit contributions
 okay+0.021
free+0.020
inky+0.020
etsy+0.019
 clean+0.018
Neg logit contributions
 islands-0.026
 cities-0.025
 obstacles-0.024
 oceans-0.023
 obstacle-0.023
Token hours
Token ID2250
Feature activation-4.70e-02

Pos logit contributions
inky+0.068
 okay+0.065
etsy+0.065
free+0.061
uffy+0.060
Neg logit contributions
 islands-0.096
 cities-0.095
 obstacles-0.091
 oceans-0.089
 whales-0.087
Token and
Token ID290
Feature activation-2.79e-02

Pos logit contributions
 okay+0.068
inky+0.065
free+0.060
 OK+0.058
 warm+0.056
Neg logit contributions
 cities-0.085
 islands-0.084
 obstacles-0.079
 oceans-0.077
 seconds-0.077
Token hours
Token ID2250
Feature activation-4.94e-02

Pos logit contributions
 okay+0.046
inky+0.044
free+0.044
etsy+0.042
 clean+0.041
Neg logit contributions
 islands-0.045
 cities-0.044
 obstacles-0.042
 oceans-0.041
 obstacle-0.040
Token of
Token ID286
Feature activation-1.32e-01

Pos logit contributions
 okay+0.078
inky+0.072
 OK+0.067
free+0.063
uffy+0.063
Neg logit contributions
 cities-0.085
 islands-0.084
 obstacles-0.079
 oceans-0.079
 seconds-0.078
Token hard
Token ID1327
Feature activation-7.47e-03

Pos logit contributions
 okay+0.137
free+0.127
 clean+0.125
 warm+0.120
inky+0.115
Neg logit contributions
 islands-0.291
 cities-0.282
 abrupt-0.272
 obstacles-0.268
 oceans-0.267
Token work
Token ID670
Feature activation-2.11e-02

Pos logit contributions
 okay+0.010
inky+0.009
etsy+0.009
free+0.009
uffy+0.008
Neg logit contributions
 islands-0.015
 cities-0.014
 obstacles-0.014
 oceans-0.014
 obstacle-0.014
Token,
Token ID11
Feature activation-3.32e-03

Pos logit contributions
 okay+0.023
inky+0.023
free+0.022
etsy+0.022
uffy+0.020
Neg logit contributions
 cities-0.044
 islands-0.043
 obstacles-0.043
 oceans-0.041
 obstacle-0.040
Token Z
Token ID1168
Feature activation+4.27e-03

Pos logit contributions
inky+0.005
free+0.005
 okay+0.004
etsy+0.004
 clean+0.004
Neg logit contributions
 cities-0.006
 islands-0.006
 obstacles-0.006
 oceans-0.006
 heights-0.006
Tokenara
Token ID3301
Feature activation-2.12e-02

Pos logit contributions
 islands+0.005
 cities+0.005
 obstacles+0.005
 abrupt+0.005
 oceans+0.005
Neg logit contributions
free-0.008
 okay-0.008
 clean-0.008
inky-0.008
 warm-0.008
TokenOne
Token ID3198
Feature activation+4.38e-02

Pos logit contributions
 okay+0.032
free+0.031
inky+0.030
etsy+0.029
 clean+0.029
Neg logit contributions
 islands-0.019
 cities-0.019
 obstacles-0.018
 oceans-0.017
 obstacle-0.017
Token day
Token ID1110
Feature activation-1.46e-03

Pos logit contributions
 obstacle+0.078
 islands+0.078
 whales+0.078
 cities+0.076
 obstacles+0.073
Neg logit contributions
inky-0.062
etsy-0.058
 okay-0.056
free-0.055
 bleed-0.051
Token,
Token ID11
Feature activation-6.70e-04

Pos logit contributions
inky+0.002
etsy+0.002
free+0.002
 okay+0.002
he+0.002
Neg logit contributions
 islands-0.002
 cities-0.002
 whales-0.002
 hears-0.002
 obstacles-0.002
Token after
Token ID706
Feature activation-4.61e-02

Pos logit contributions
inky+0.001
 okay+0.001
etsy+0.001
free+0.001
uffy+0.001
Neg logit contributions
 islands-0.001
 cities-0.001
 obstacles-0.001
 whales-0.001
 oceans-0.001
Token hours
Token ID2250
Feature activation-4.76e-02

Pos logit contributions
 okay+0.045
free+0.041
inky+0.040
 warm+0.038
 clean+0.038
Neg logit contributions
 islands-0.101
 cities-0.100
 obstacles-0.096
 obstacle-0.095
 abrupt-0.095
Token of
Token ID286
Feature activation-1.30e-01

Pos logit contributions
 okay+0.065
inky+0.065
free+0.062
etsy+0.058
 warm+0.056
Neg logit contributions
 islands-0.090
 cities-0.087
 obstacles-0.083
 cliffs-0.082
 oceans-0.082
Token z
Token ID1976
Feature activation-2.62e-03

Pos logit contributions
 okay+0.173
 warm+0.160
 clean+0.157
 safe+0.156
free+0.153
Neg logit contributions
 islands-0.245
 cities-0.238
 abrupt-0.229
 whales-0.222
 oceans-0.222
Tokenipping
Token ID4501
Feature activation-1.75e-02

Pos logit contributions
 okay+0.004
inky+0.004
free+0.004
uffy+0.004
etsy+0.004
Neg logit contributions
 islands-0.004
 cities-0.004
 abrupt-0.004
 obstacles-0.004
 oceans-0.004
Token around
Token ID1088
Feature activation-8.95e-03

Pos logit contributions
 okay+0.017
free+0.016
inky+0.016
 clean+0.015
 warm+0.015
Neg logit contributions
 islands-0.039
 cities-0.038
 abrupt-0.037
 obstacles-0.036
 obstacle-0.036
Token,
Token ID11
Feature activation+1.22e-02

Pos logit contributions
free+0.006
 okay+0.006
 clean+0.006
 warm+0.005
 safe+0.005
Neg logit contributions
 islands-0.022
 cities-0.022
 abrupt-0.022
 obstacle-0.021
 obstacles-0.021
Token one
Token ID530
Feature activation+4.78e-02

Pos logit contributions
 islands+0.025
 cities+0.025
 obstacles+0.024
 oceans+0.024
 whales+0.023
Neg logit contributions
inky-0.014
 okay-0.014
free-0.013
etsy-0.013
uffy-0.012
Token
Token ID198
Feature activation-1.90e-02

Pos logit contributions
 okay+0.018
free+0.017
inky+0.016
etsy+0.015
 clean+0.015
Neg logit contributions
 islands-0.038
 cities-0.037
 obstacles-0.036
 oceans-0.035
 abrupt-0.035
TokenFinally
Token ID11158
Feature activation-1.59e-02

Pos logit contributions
 okay+0.024
free+0.023
inky+0.023
etsy+0.022
 clean+0.021
Neg logit contributions
 islands-0.037
 cities-0.036
 obstacles-0.035
 oceans-0.034
 obstacle-0.034
Token,
Token ID11
Feature activation-1.71e-02

Pos logit contributions
inky+0.021
free+0.021
etsy+0.019
 okay+0.019
un+0.018
Neg logit contributions
 islands-0.031
 cities-0.030
 obstacles-0.028
 oceans-0.028
 heights-0.028
Token after
Token ID706
Feature activation-5.55e-02

Pos logit contributions
inky+0.024
 okay+0.022
free+0.022
etsy+0.022
uffy+0.021
Neg logit contributions
 cities-0.032
 islands-0.032
 oceans-0.030
 obstacles-0.030
 heights-0.029
Token days
Token ID1528
Feature activation-4.78e-02

Pos logit contributions
 okay+0.063
inky+0.060
free+0.059
etsy+0.058
 clean+0.053
Neg logit contributions
 islands-0.118
 cities-0.115
 obstacles-0.113
 oceans-0.110
 obstacle-0.108
Token of
Token ID286
Feature activation-1.29e-01

Pos logit contributions
 okay+0.076
inky+0.072
free+0.071
 warm+0.067
 OK+0.065
Neg logit contributions
 cities-0.077
 islands-0.077
 oceans-0.070
 obstacles-0.069
 villages-0.068
Token walking
Token ID6155
Feature activation-1.29e-02

Pos logit contributions
 okay+0.172
free+0.159
 clean+0.153
inky+0.152
 warm+0.151
Neg logit contributions
 islands-0.250
 cities-0.243
 abrupt-0.231
 obstacles-0.231
 oceans-0.229
Token and
Token ID290
Feature activation-1.45e-02

Pos logit contributions
 okay+0.012
free+0.011
inky+0.011
etsy+0.010
 clean+0.010
Neg logit contributions
 islands-0.030
 cities-0.029
 obstacles-0.028
 abrupt-0.028
 obstacle-0.027
Token waving
Token ID25849
Feature activation+1.07e-02

Pos logit contributions
 okay+0.019
free+0.018
inky+0.017
etsy+0.017
 clean+0.016
Neg logit contributions
 islands-0.028
 cities-0.028
 obstacles-0.027
 oceans-0.026
 obstacle-0.026
Token,
Token ID11
Feature activation+8.49e-03

Pos logit contributions
 islands+0.025
 cities+0.024
 obstacles+0.024
 abrupt+0.023
 obstacle+0.023
Neg logit contributions
 okay-0.010
free-0.009
inky-0.009
 clean-0.008
etsy-0.008
Token the
Token ID262
Feature activation+2.64e-03

Pos logit contributions
 cities+0.018
 islands+0.018
 oceans+0.017
 obstacles+0.017
 cliffs+0.016
Neg logit contributions
inky-0.010
 okay-0.009
free-0.009
etsy-0.009
uffy-0.008
Token.
Token ID13
Feature activation-1.94e-02

Pos logit contributions
 okay+0.031
free+0.029
inky+0.028
 clean+0.027
etsy+0.027
Neg logit contributions
 islands-0.056
 cities-0.055
 obstacles-0.053
 oceans-0.052
 abrupt-0.052
Token Even
Token ID3412
Feature activation+9.21e-03

Pos logit contributions
 okay+0.021
free+0.020
inky+0.020
etsy+0.018
 clean+0.018
Neg logit contributions
 islands-0.042
 cities-0.041
 obstacles-0.040
 oceans-0.039
 obstacle-0.038
Token though
Token ID996
Feature activation-9.37e-02

Pos logit contributions
 islands+0.018
 cities+0.018
 obstacles+0.017
 oceans+0.017
 abrupt+0.017
Neg logit contributions
 okay-0.011
free-0.011
inky-0.010
 clean-0.010
etsy-0.010
Token they
Token ID484
Feature activation-2.31e-02

Pos logit contributions
 okay+0.114
free+0.107
inky+0.106
etsy+0.099
 clean+0.098
Neg logit contributions
 islands-0.189
 cities-0.186
 obstacles-0.179
 oceans-0.175
 obstacle-0.174
Token were
Token ID547
Feature activation-3.50e-02

Pos logit contributions
 okay+0.024
free+0.022
inky+0.021
 clean+0.020
etsy+0.020
Neg logit contributions
 islands-0.051
 cities-0.050
 obstacles-0.049
 oceans-0.048
 abrupt-0.048
Token of
Token ID286
Feature activation-1.28e-01

Pos logit contributions
inky+0.041
free+0.039
etsy+0.039
he+0.038
uffy+0.038
Neg logit contributions
 islands-0.071
 obstacles-0.069
 cities-0.069
 obstacle-0.065
 oceans-0.065
Token different
Token ID1180
Feature activation-3.27e-02

Pos logit contributions
 okay+0.171
 clean+0.150
free+0.150
 warm+0.148
 safe+0.147
Neg logit contributions
 islands-0.241
 cities-0.239
 abrupt-0.230
 oceans-0.224
rob-0.223
Token species
Token ID4693
Feature activation-2.11e-03

Pos logit contributions
 okay+0.051
free+0.048
 clean+0.046
inky+0.045
 warm+0.045
Neg logit contributions
 islands-0.053
 cities-0.052
 abrupt-0.051
 obstacles-0.049
 oceans-0.049
Token,
Token ID11
Feature activation+2.84e-03

Pos logit contributions
 okay+0.002
free+0.002
inky+0.002
etsy+0.002
 clean+0.002
Neg logit contributions
 islands-0.005
 cities-0.005
 obstacles-0.004
 oceans-0.004
 obstacle-0.004
Token they
Token ID484
Feature activation-1.53e-02

Pos logit contributions
 islands+0.006
 cities+0.006
 obstacles+0.005
 oceans+0.005
 obstacle+0.005
Neg logit contributions
 okay-0.003
free-0.003
inky-0.003
etsy-0.003
 clean-0.003
Token shared
Token ID4888
Feature activation+1.91e-02

Pos logit contributions
 okay+0.018
free+0.017
inky+0.017
etsy+0.016
 clean+0.016
Neg logit contributions
 islands-0.031
 cities-0.030
 obstacles-0.029
 oceans-0.029
 obstacle-0.029
Token made
Token ID925
Feature activation-2.39e-02

Pos logit contributions
inky+0.047
 okay+0.047
free+0.046
etsy+0.043
uffy+0.041
Neg logit contributions
 islands-0.060
 cities-0.059
 obstacles-0.056
 oceans-0.054
 abrupt-0.054
Token fun
Token ID1257
Feature activation-3.34e-02

Pos logit contributions
inky+0.039
etsy+0.038
uffy+0.035
free+0.034
 okay+0.034
Neg logit contributions
 obstacles-0.037
 islands-0.036
 cities-0.036
 whales-0.035
 abrupt-0.033
Token of
Token ID286
Feature activation-1.05e-01

Pos logit contributions
 okay+0.040
inky+0.038
free+0.038
etsy+0.036
 clean+0.034
Neg logit contributions
 islands-0.068
 cities-0.067
 obstacles-0.064
 oceans-0.063
 obstacle-0.063
Token the
Token ID262
Feature activation-1.57e-02

Pos logit contributions
 okay+0.147
inky+0.145
free+0.143
etsy+0.138
uffy+0.129
Neg logit contributions
 islands-0.192
 cities-0.187
 obstacles-0.181
 oceans-0.174
 abrupt-0.174
Token m
Token ID285
Feature activation+1.87e-02

Pos logit contributions
inky+0.020
 okay+0.020
free+0.019
etsy+0.019
un+0.018
Neg logit contributions
 islands-0.030
 cities-0.030
 obstacles-0.029
 abrupt-0.028
 whales-0.028
Tokenittens
Token ID34978
Feature activation-1.27e-01

Pos logit contributions
 islands+0.040
 cities+0.040
rob+0.039
 whales+0.039
 abrupt+0.038
Neg logit contributions
 okay-0.017
 clean-0.015
free-0.015
 safe-0.014
etsy-0.014
Token she
Token ID673
Feature activation-8.59e-03

Pos logit contributions
etsy+0.167
inky+0.166
 okay+0.158
free+0.153
un+0.150
Neg logit contributions
 islands-0.229
 cities-0.223
 obstacles-0.218
 cliffs-0.213
 whales-0.212
Token was
Token ID373
Feature activation+5.91e-04

Pos logit contributions
 okay+0.011
free+0.010
 clean+0.010
inky+0.010
etsy+0.010
Neg logit contributions
 islands-0.017
 cities-0.017
 obstacles-0.016
 oceans-0.016
 obstacle-0.016
Token wearing
Token ID5762
Feature activation-2.60e-02

Pos logit contributions
 islands+0.001
 cities+0.001
 whales+0.001
 obstacles+0.001
 abrupt+0.001
Neg logit contributions
inky-0.001
free-0.001
etsy-0.001
isc-0.001
uffy-0.001
Token.
Token ID13
Feature activation-1.06e-02

Pos logit contributions
 okay+0.028
inky+0.028
free+0.027
etsy+0.026
uffy+0.025
Neg logit contributions
 islands-0.054
 cities-0.053
 obstacles-0.051
 oceans-0.050
 obstacle-0.048
Token They
Token ID1119
Feature activation-6.35e-03

Pos logit contributions
 okay+0.014
free+0.013
inky+0.012
less+0.012
 safe+0.012
Neg logit contributions
 cities-0.020
 islands-0.019
 oceans-0.019
 obstacles-0.018
 abrupt-0.018
Token
Token ID198
Feature activation-1.79e-02

Pos logit contributions
free+0.012
 okay+0.012
inky+0.012
less+0.011
 clean+0.011
Neg logit contributions
 cities-0.021
 islands-0.021
 obstacles-0.020
 abrupt-0.020
 oceans-0.020
Token
Token ID198
Feature activation-1.94e-02

Pos logit contributions
 okay+0.019
free+0.019
inky+0.018
etsy+0.017
 clean+0.016
Neg logit contributions
 islands-0.037
 cities-0.036
 obstacles-0.035
 abrupt-0.034
 oceans-0.034
TokenAfter
Token ID3260
Feature activation-5.38e-02

Pos logit contributions
 okay+0.028
free+0.027
inky+0.027
uffy+0.027
etsy+0.025
Neg logit contributions
 islands-0.032
 obstacles-0.031
 cities-0.031
 abrupt-0.031
 obstacle-0.029
Token many
Token ID867
Feature activation-1.78e-02

Pos logit contributions
inky+0.059
 okay+0.057
free+0.055
etsy+0.055
uffy+0.053
Neg logit contributions
 islands-0.117
 cities-0.114
 obstacles-0.110
 oceans-0.108
 whales-0.106
Token hours
Token ID2250
Feature activation-5.03e-02

Pos logit contributions
 okay+0.025
 safe+0.023
 warm+0.023
 clean+0.023
free+0.021
Neg logit contributions
 abrupt-0.031
 islands-0.031
 cities-0.030
smith-0.028
 oceans-0.028
Token of
Token ID286
Feature activation-1.27e-01

Pos logit contributions
 okay+0.069
inky+0.067
free+0.065
uffy+0.060
etsy+0.060
Neg logit contributions
 islands-0.095
 cities-0.093
 obstacles-0.088
 oceans-0.087
 cliffs-0.085
Token running
Token ID2491
Feature activation-2.58e-02

Pos logit contributions
 okay+0.180
 warm+0.167
 clean+0.167
 safe+0.165
free+0.163
Neg logit contributions
 islands-0.222
 cities-0.217
 abrupt-0.213
 obstacle-0.205
 whales-0.203
Token and
Token ID290
Feature activation-1.28e-02

Pos logit contributions
 okay+0.026
free+0.024
inky+0.024
etsy+0.023
uffy+0.021
Neg logit contributions
 islands-0.058
 cities-0.057
 obstacles-0.055
 oceans-0.054
 obstacle-0.053
Token searching
Token ID10342
Feature activation-7.79e-03

Pos logit contributions
 okay+0.019
inky+0.019
free+0.019
etsy+0.018
uffy+0.017
Neg logit contributions
 islands-0.022
 cities-0.022
 obstacles-0.021
 oceans-0.020
 obstacle-0.020
Token,
Token ID11
Feature activation+4.54e-03

Pos logit contributions
 okay+0.006
 safe+0.006
free+0.006
 clean+0.005
 warm+0.005
Neg logit contributions
 islands-0.019
 abrupt-0.019
 cities-0.019
 obstacles-0.018
 obstacle-0.018
Token they
Token ID484
Feature activation-1.41e-02

Pos logit contributions
 islands+0.011
 cities+0.010
 obstacles+0.010
 oceans+0.010
 cliffs+0.010
Neg logit contributions
inky-0.004
 okay-0.004
free-0.004
etsy-0.004
uffy-0.003
Token lost
Token ID2626
Feature activation-3.12e-02

Pos logit contributions
 okay+0.005
free+0.005
inky+0.005
etsy+0.005
 clean+0.005
Neg logit contributions
 islands-0.006
 cities-0.006
 obstacles-0.006
 oceans-0.006
 obstacle-0.006
Token one
Token ID530
Feature activation+4.16e-02

Pos logit contributions
inky+0.030
 okay+0.030
free+0.029
etsy+0.027
uffy+0.025
Neg logit contributions
 islands-0.070
 cities-0.068
 obstacles-0.068
 oceans-0.066
 abrupt-0.066
Token of
Token ID286
Feature activation-9.00e-02

Pos logit contributions
 islands+0.078
 obstacle+0.076
 obstacles+0.075
 cities+0.075
 abrupt+0.073
Neg logit contributions
 okay-0.051
free-0.051
inky-0.050
etsy-0.047
uffy-0.043
Token her
Token ID607
Feature activation-6.52e-02

Pos logit contributions
 okay+0.091
free+0.091
inky+0.090
etsy+0.085
uffy+0.079
Neg logit contributions
 islands-0.197
 cities-0.192
 obstacles-0.190
 abrupt-0.184
 oceans-0.184
Token m
Token ID285
Feature activation+2.91e-02

Pos logit contributions
free+0.085
he+0.078
 okay+0.076
etsy+0.076
inky+0.073
Neg logit contributions
 islands-0.124
 obstacles-0.116
 cities-0.113
 whales-0.112
 heights-0.111
Tokenittens
Token ID34978
Feature activation-1.26e-01

Pos logit contributions
 islands+0.070
 cities+0.069
 obstacles+0.068
 whales+0.066
 abrupt+0.066
Neg logit contributions
 okay-0.021
free-0.020
 clean-0.018
inky-0.017
 safe-0.017
Token in
Token ID287
Feature activation+5.01e-02

Pos logit contributions
inky+0.172
etsy+0.172
free+0.168
 okay+0.165
uffy+0.153
Neg logit contributions
 islands-0.202
 cliffs-0.200
 cities-0.197
 obstacles-0.197
 oceans-0.190
Token the
Token ID262
Feature activation+1.98e-03

Pos logit contributions
 islands+0.096
 obstacles+0.095
 cities+0.095
 obstacle+0.090
 oceans+0.090
Neg logit contributions
 okay-0.066
inky-0.065
free-0.064
etsy-0.062
uffy-0.057
Token snow
Token ID6729
Feature activation-6.33e-02

Pos logit contributions
 islands+0.004
 cities+0.004
 obstacles+0.004
 obstacle+0.004
 oceans+0.004
Neg logit contributions
 okay-0.002
free-0.002
inky-0.002
 clean-0.002
 warm-0.002
Token.
Token ID13
Feature activation-9.13e-03

Pos logit contributions
 okay+0.080
free+0.075
etsy+0.074
inky+0.074
 clean+0.064
Neg logit contributions
 islands-0.125
 cities-0.120
 oceans-0.117
 obstacles-0.114
 abrupt-0.114
Token
Token ID220
Feature activation+2.07e-02

Pos logit contributions
free+0.014
less+0.013
icking+0.013
 okay+0.013
inky+0.013
Neg logit contributions
 cities-0.013
Which-0.013
 hears-0.012
Really-0.012
 oceans-0.012
Token
Token ID198
Feature activation-1.25e-02

Pos logit contributions
 islands+0.046
 cities+0.046
 obstacles+0.045
 abrupt+0.044
 oceans+0.043
Neg logit contributions
free-0.029
 okay-0.028
 warm-0.027
etsy-0.027
inky-0.027
TokenFinally
Token ID11158
Feature activation-9.40e-03

Pos logit contributions
 okay+0.018
inky+0.018
free+0.017
uffy+0.016
etsy+0.016
Neg logit contributions
 islands-0.022
 cities-0.022
 obstacles-0.021
 abrupt-0.020
 oceans-0.020
Token,
Token ID11
Feature activation-1.59e-02

Pos logit contributions
inky+0.012
free+0.012
etsy+0.011
 okay+0.011
he+0.010
Neg logit contributions
 islands-0.019
 cities-0.018
 obstacles-0.018
 oceans-0.017
 obstacle-0.017
Token after
Token ID706
Feature activation-5.38e-02

Pos logit contributions
inky+0.024
etsy+0.022
free+0.022
 okay+0.022
uffy+0.021
Neg logit contributions
 islands-0.028
 cities-0.027
 obstacles-0.026
 oceans-0.026
 whales-0.025
Token hours
Token ID2250
Feature activation-4.76e-02

Pos logit contributions
 okay+0.057
inky+0.055
free+0.054
etsy+0.052
 clean+0.048
Neg logit contributions
 islands-0.118
 cities-0.115
 obstacles-0.112
 oceans-0.109
 obstacle-0.108
Token of
Token ID286
Feature activation-1.26e-01

Pos logit contributions
inky+0.070
 okay+0.068
free+0.066
etsy+0.060
uffy+0.059
Neg logit contributions
 islands-0.083
 cities-0.081
 obstacles-0.077
 cliffs-0.076
 oceans-0.075
Token swimming
Token ID14899
Feature activation-2.16e-02

Pos logit contributions
 okay+0.146
 clean+0.138
 warm+0.138
 safe+0.132
free+0.123
Neg logit contributions
 islands-0.256
 cities-0.253
 abrupt-0.249
 MacDonald-0.240
smith-0.240
Token,
Token ID11
Feature activation-3.78e-03

Pos logit contributions
 okay+0.018
free+0.017
 clean+0.016
inky+0.016
 warm+0.015
Neg logit contributions
 islands-0.052
 cities-0.051
 obstacles-0.050
 abrupt-0.049
 oceans-0.048
Token the
Token ID262
Feature activation-2.20e-03

Pos logit contributions
inky+0.005
etsy+0.005
 okay+0.005
free+0.005
uffy+0.005
Neg logit contributions
 islands-0.007
 cities-0.007
 obstacles-0.007
 oceans-0.007
 cliffs-0.007
Token bug
Token ID5434
Feature activation+5.95e-03

Pos logit contributions
 okay+0.003
inky+0.003
free+0.003
etsy+0.003
uffy+0.003
Neg logit contributions
 islands-0.004
 cities-0.004
 obstacles-0.004
 oceans-0.004
 obstacle-0.004
Token reached
Token ID4251
Feature activation+2.85e-02

Pos logit contributions
 islands+0.011
 cities+0.011
 obstacles+0.011
 oceans+0.010
 abrupt+0.010
Neg logit contributions
 okay-0.008
free-0.007
inky-0.007
 clean-0.007
etsy-0.007
Token Finally
Token ID9461
Feature activation-7.78e-03

Pos logit contributions
 okay+0.005
free+0.005
inky+0.005
 clean+0.005
 warm+0.005
Neg logit contributions
 islands-0.008
 cities-0.008
 obstacles-0.008
 oceans-0.008
 abrupt-0.008
Token,
Token ID11
Feature activation-1.21e-02

Pos logit contributions
free+0.010
inky+0.009
 okay+0.009
etsy+0.009
he+0.008
Neg logit contributions
 islands-0.015
 cities-0.015
 obstacles-0.014
 oceans-0.014
 whales-0.014
Token after
Token ID706
Feature activation-5.12e-02

Pos logit contributions
inky+0.018
free+0.017
 okay+0.017
etsy+0.016
uffy+0.016
Neg logit contributions
 islands-0.021
 cities-0.020
 oceans-0.019
 obstacles-0.019
 whales-0.019
Token many
Token ID867
Feature activation-1.23e-02

Pos logit contributions
 okay+0.054
free+0.051
inky+0.051
etsy+0.048
 clean+0.046
Neg logit contributions
 islands-0.112
 cities-0.109
 obstacles-0.106
 oceans-0.103
 obstacle-0.103
Token hours
Token ID2250
Feature activation-5.12e-02

Pos logit contributions
 okay+0.020
 clean+0.018
 safe+0.018
 warm+0.018
 little+0.017
Neg logit contributions
 islands-0.019
 cities-0.018
 abrupt-0.018
 oceans-0.017
smith-0.017
Token of
Token ID286
Feature activation-1.26e-01

Pos logit contributions
 okay+0.069
inky+0.066
free+0.065
etsy+0.058
uffy+0.058
Neg logit contributions
 islands-0.099
 cities-0.096
 obstacles-0.091
 oceans-0.091
 cliffs-0.088
Token searching
Token ID10342
Feature activation-2.28e-02

Pos logit contributions
 okay+0.170
 clean+0.162
 safe+0.161
 warm+0.161
free+0.151
Neg logit contributions
 islands-0.227
 cities-0.221
 abrupt-0.213
 whales-0.210
 obstacle-0.208
Token,
Token ID11
Feature activation-4.82e-05

Pos logit contributions
 okay+0.016
 safe+0.014
free+0.014
 clean+0.014
 warm+0.013
Neg logit contributions
 islands-0.057
 cities-0.056
 abrupt-0.056
 obstacles-0.055
 obstacle-0.055
Token they
Token ID484
Feature activation-2.06e-02

Pos logit contributions
inky+0.000
 okay+0.000
free+0.000
etsy+0.000
uffy+0.000
Neg logit contributions
 islands-0.000
 cities-0.000
 obstacles-0.000
 oceans-0.000
 whales-0.000
Token found
Token ID1043
Feature activation-2.92e-02

Pos logit contributions
 okay+0.028
free+0.027
inky+0.026
etsy+0.025
 clean+0.025
Neg logit contributions
 islands-0.039
 cities-0.038
 obstacles-0.036
 oceans-0.035
 obstacle-0.035
Token it
Token ID340
Feature activation-2.16e-02

Pos logit contributions
 okay+0.034
free+0.032
inky+0.032
etsy+0.030
 clean+0.029
Neg logit contributions
 islands-0.061
 cities-0.059
 obstacles-0.058
 oceans-0.056
 obstacle-0.056
Token Todd
Token ID14377
Feature activation+1.40e-02

Pos logit contributions
 islands+0.040
 cities+0.039
 obstacles+0.039
 obstacle+0.039
 abrupt+0.038
Neg logit contributions
 okay-0.018
inky-0.016
free-0.015
 warm-0.015
 clean-0.014
Token took
Token ID1718
Feature activation+7.76e-03

Pos logit contributions
 islands+0.027
 cities+0.026
 obstacles+0.025
 obstacle+0.025
 oceans+0.024
Neg logit contributions
 okay-0.019
inky-0.018
free-0.017
etsy-0.017
 clean-0.017
Token off
Token ID572
Feature activation-1.44e-02

Pos logit contributions
 islands+0.016
 cities+0.015
 obstacles+0.015
 oceans+0.015
 whales+0.014
Neg logit contributions
 okay-0.009
inky-0.009
free-0.009
etsy-0.009
uffy-0.008
Token the
Token ID262
Feature activation-7.86e-03

Pos logit contributions
inky+0.022
 okay+0.022
free+0.022
etsy+0.021
uffy+0.021
Neg logit contributions
 islands-0.024
 cities-0.024
 oceans-0.022
 obstacles-0.022
 whales-0.022
Token m
Token ID285
Feature activation+2.76e-02

Pos logit contributions
 okay+0.012
inky+0.012
free+0.012
etsy+0.012
he+0.011
Neg logit contributions
 islands-0.012
 cities-0.012
 obstacles-0.012
 abrupt-0.012
 obstacle-0.012
Tokenittens
Token ID34978
Feature activation-1.25e-01

Pos logit contributions
 islands+0.061
 cities+0.060
 obstacles+0.058
 abrupt+0.057
 obstacle+0.056
Neg logit contributions
 okay-0.028
free-0.026
inky-0.024
 clean-0.024
etsy-0.023
Token,
Token ID11
Feature activation+1.16e-02

Pos logit contributions
 okay+0.208
free+0.208
etsy+0.200
inky+0.199
uffy+0.195
Neg logit contributions
 islands-0.180
 cities-0.173
 obstacles-0.161
 oceans-0.160
 cliffs-0.158
Token put
Token ID1234
Feature activation-6.61e-03

Pos logit contributions
 islands+0.022
 cities+0.022
 obstacles+0.021
 obstacle+0.021
 oceans+0.021
Neg logit contributions
 okay-0.015
free-0.014
inky-0.014
etsy-0.013
 clean-0.013
Token them
Token ID606
Feature activation-5.46e-02

Pos logit contributions
 okay+0.010
inky+0.010
free+0.009
etsy+0.009
uffy+0.009
Neg logit contributions
 islands-0.011
 cities-0.011
 obstacles-0.011
 oceans-0.010
 abrupt-0.010
Token under
Token ID739
Feature activation+2.13e-02

Pos logit contributions
inky+0.074
free+0.071
 okay+0.070
etsy+0.070
uffy+0.066
Neg logit contributions
 islands-0.102
 cities-0.099
 oceans-0.094
 obstacles-0.092
 cliffs-0.086
Token the
Token ID262
Feature activation+1.08e-02

Pos logit contributions
 islands+0.048
 cities+0.047
 obstacles+0.046
 obstacle+0.045
 abrupt+0.045
Neg logit contributions
 okay-0.020
free-0.019
inky-0.018
 clean-0.018
 warm-0.017
Token dropped
Token ID5710
Feature activation-4.31e-02

Pos logit contributions
 okay+0.035
inky+0.033
free+0.033
etsy+0.031
he+0.029
Neg logit contributions
 islands-0.062
 cities-0.060
 obstacles-0.059
 abrupt-0.057
 oceans-0.057
Token one
Token ID530
Feature activation+4.01e-02

Pos logit contributions
 okay+0.047
inky+0.045
free+0.045
etsy+0.042
uffy+0.040
Neg logit contributions
 islands-0.092
 cities-0.090
 obstacles-0.087
 oceans-0.085
 abrupt-0.084
Token of
Token ID286
Feature activation-8.58e-02

Pos logit contributions
 islands+0.080
 cities+0.077
 obstacles+0.076
 obstacle+0.076
 abrupt+0.075
Neg logit contributions
 okay-0.047
free-0.045
inky-0.044
etsy-0.043
uffy-0.039
Token her
Token ID607
Feature activation-5.73e-02

Pos logit contributions
 okay+0.109
inky+0.107
free+0.105
etsy+0.101
uffy+0.097
Neg logit contributions
 islands-0.167
 cities-0.161
 obstacles-0.158
 whales-0.154
 oceans-0.153
Token m
Token ID285
Feature activation+3.53e-02

Pos logit contributions
free+0.072
 okay+0.072
inky+0.069
etsy+0.068
he+0.065
Neg logit contributions
 islands-0.109
 cities-0.104
 obstacles-0.102
 oceans-0.099
 whales-0.099
Tokenittens
Token ID34978
Feature activation-1.24e-01

Pos logit contributions
 islands+0.094
 cities+0.092
 obstacles+0.090
 oceans+0.088
 abrupt+0.088
Neg logit contributions
 okay-0.020
free-0.017
inky-0.016
etsy-0.015
 clean-0.014
Token in
Token ID287
Feature activation+5.54e-02

Pos logit contributions
 okay+0.171
etsy+0.170
free+0.169
inky+0.166
uffy+0.155
Neg logit contributions
 islands-0.199
 obstacles-0.192
 cities-0.191
 cliffs-0.189
 abrupt-0.186
Token the
Token ID262
Feature activation+1.01e-02

Pos logit contributions
 islands+0.101
 obstacles+0.100
 cities+0.100
 abrupt+0.097
 hears+0.096
Neg logit contributions
 okay-0.076
inky-0.074
free-0.071
etsy-0.070
uffy-0.067
Token snow
Token ID6729
Feature activation-6.15e-02

Pos logit contributions
 islands+0.020
 cities+0.020
 obstacles+0.019
 oceans+0.019
 obstacle+0.018
Neg logit contributions
 okay-0.013
free-0.012
inky-0.012
etsy-0.011
 clean-0.011
Token.
Token ID13
Feature activation-1.54e-02

Pos logit contributions
 okay+0.083
free+0.076
etsy+0.076
inky+0.074
 clean+0.067
Neg logit contributions
 islands-0.114
 cities-0.108
 oceans-0.106
 abrupt-0.106
rob-0.104
Token
Token ID220
Feature activation+1.88e-02

Pos logit contributions
less+0.022
icking+0.021
 okay+0.021
free+0.021
inky+0.020
Neg logit contributions
 cities-0.022
 oceans-0.021
Which-0.021
 hears-0.021
Does-0.021
Token�
Token ID241
Feature activation+1.15e-02

Pos logit contributions
 islands+0.039
 cities+0.038
 obstacles+0.037
 abrupt+0.036
 obstacle+0.035
Neg logit contributions
 okay-0.051
free-0.050
etsy-0.048
inky-0.048
uffy-0.047
TokenNow
Token ID3844
Feature activation-9.00e-03

Pos logit contributions
 islands+0.020
 cities+0.019
 obstacles+0.019
 abrupt+0.018
 oceans+0.018
Neg logit contributions
 okay-0.017
inky-0.017
free-0.016
etsy-0.015
uffy-0.015
Token I
Token ID314
Feature activation+1.37e-02

Pos logit contributions
inky+0.011
free+0.010
etsy+0.010
 okay+0.009
he+0.009
Neg logit contributions
 islands-0.017
 cities-0.017
 obstacles-0.017
 abrupt-0.016
 obstacle-0.016
Token have
Token ID423
Feature activation-2.55e-02

Pos logit contributions
 islands+0.029
 cities+0.028
 obstacles+0.027
 oceans+0.027
 obstacle+0.026
Neg logit contributions
 okay-0.016
free-0.015
inky-0.015
 clean-0.014
etsy-0.014
Token m
Token ID285
Feature activation+3.61e-02

Pos logit contributions
 okay+0.028
free+0.025
inky+0.024
 clean+0.023
etsy+0.023
Neg logit contributions
 islands-0.056
 cities-0.055
 obstacles-0.053
 obstacle-0.052
 abrupt-0.052
Tokenittens
Token ID34978
Feature activation-1.24e-01

Pos logit contributions
 islands+0.092
 cities+0.090
 obstacles+0.088
 oceans+0.086
 obstacle+0.085
Neg logit contributions
 okay-0.024
free-0.022
inky-0.021
etsy-0.019
 clean-0.019
Token too
Token ID1165
Feature activation-2.00e-02

Pos logit contributions
 okay+0.166
inky+0.162
free+0.160
etsy+0.154
 clean+0.146
Neg logit contributions
 islands-0.225
 cities-0.220
 obstacles-0.212
 oceans-0.205
 obstacle-0.204
Token.
Token ID13
Feature activation-4.77e-03

Pos logit contributions
 okay+0.024
free+0.023
inky+0.023
 clean+0.021
etsy+0.021
Neg logit contributions
 islands-0.041
 cities-0.040
 obstacles-0.039
 oceans-0.038
 obstacle-0.038
Token We
Token ID775
Feature activation-1.30e-03

Pos logit contributions
 okay+0.007
free+0.006
inky+0.006
etsy+0.006
 clean+0.006
Neg logit contributions
 islands-0.009
 cities-0.008
 obstacles-0.008
 oceans-0.008
 abrupt-0.008
Token should
Token ID815
Feature activation+8.95e-03

Pos logit contributions
 okay+0.002
inky+0.002
free+0.002
etsy+0.002
uffy+0.002
Neg logit contributions
 islands-0.002
 cities-0.002
 obstacles-0.002
 obstacle-0.002
 oceans-0.002
Token have
Token ID423
Feature activation-7.90e-04

Pos logit contributions
 islands+0.018
 cities+0.017
 obstacles+0.017
 abrupt+0.016
 whales+0.016
Neg logit contributions
free-0.010
 okay-0.010
inky-0.010
etsy-0.009
uffy-0.009
Token inside
Token ID2641
Feature activation-2.71e-02

Pos logit contributions
 islands+0.018
 cities+0.018
 obstacles+0.017
 oceans+0.017
 abrupt+0.016
Neg logit contributions
 okay-0.009
inky-0.009
free-0.009
etsy-0.008
uffy-0.007
Token to
Token ID284
Feature activation+2.66e-03

Pos logit contributions
inky+0.038
 okay+0.037
free+0.036
etsy+0.035
uffy+0.033
Neg logit contributions
 islands-0.049
 cities-0.048
 obstacles-0.046
 abrupt-0.045
 oceans-0.045
Token get
Token ID651
Feature activation-3.44e-02

Pos logit contributions
 islands+0.005
 cities+0.005
 abrupt+0.005
 obstacles+0.005
 hears+0.005
Neg logit contributions
inky-0.003
free-0.003
etsy-0.003
 okay-0.003
uffy-0.003
Token her
Token ID607
Feature activation-5.79e-02

Pos logit contributions
inky+0.050
etsy+0.045
uffy+0.044
he+0.042
free+0.040
Neg logit contributions
 islands-0.060
 cities-0.057
 obstacles-0.057
 oceans-0.056
 whales-0.056
Token m
Token ID285
Feature activation+3.71e-02

Pos logit contributions
free+0.096
etsy+0.095
he+0.094
inky+0.091
 okay+0.090
Neg logit contributions
 islands-0.081
 whales-0.076
 cities-0.073
 obstacles-0.073
 hears-0.071
Tokenittens
Token ID34978
Feature activation-1.24e-01

Pos logit contributions
 islands+0.086
 cities+0.085
 obstacles+0.083
 abrupt+0.082
 whales+0.082
Neg logit contributions
 okay-0.028
free-0.026
 clean-0.024
etsy-0.023
inky-0.023
Token,
Token ID11
Feature activation+1.18e-02

Pos logit contributions
etsy+0.184
inky+0.180
free+0.175
un+0.163
uffy+0.163
Neg logit contributions
 cliffs-0.165
 islands-0.165
 obstacles-0.157
 cities-0.154
 hears-0.153
Token but
Token ID475
Feature activation+1.58e-02

Pos logit contributions
 islands+0.020
 cities+0.020
 obstacles+0.019
 oceans+0.019
 abrupt+0.019
Neg logit contributions
inky-0.017
free-0.017
 okay-0.016
etsy-0.016
uffy-0.016
Token she
Token ID673
Feature activation-1.04e-03

Pos logit contributions
 islands+0.027
 cities+0.027
 obstacles+0.026
 abrupt+0.025
 oceans+0.025
Neg logit contributions
inky-0.024
free-0.022
un-0.020
etsy-0.020
he-0.019
Token couldn
Token ID3521
Feature activation-3.29e-02

Pos logit contributions
free+0.002
inky+0.001
 okay+0.001
etsy+0.001
un+0.001
Neg logit contributions
 islands-0.002
 hears-0.002
 obstacles-0.002
 abrupt-0.002
 cities-0.002
Token't
Token ID470
Feature activation-5.08e-02

Pos logit contributions
 okay+0.029
free+0.027
etsy+0.026
inky+0.026
less+0.023
Neg logit contributions
 abrupt-0.071
 obstacles-0.070
 cities-0.070
 islands-0.069
 obstacle-0.069
Token today
Token ID1909
Feature activation-1.49e-02

Pos logit contributions
 okay+0.030
free+0.028
 clean+0.027
 warm+0.027
inky+0.026
Neg logit contributions
 islands-0.055
 cities-0.054
 obstacles-0.052
 obstacle-0.051
 oceans-0.050
Token was
Token ID373
Feature activation+3.03e-03

Pos logit contributions
inky+0.020
 okay+0.020
free+0.020
etsy+0.019
uffy+0.018
Neg logit contributions
 islands-0.027
 cities-0.027
 obstacles-0.026
 oceans-0.025
 obstacle-0.025
Token his
Token ID465
Feature activation-4.30e-02

Pos logit contributions
 islands+0.007
 cities+0.007
 obstacles+0.006
 oceans+0.006
 obstacle+0.006
Neg logit contributions
 okay-0.003
free-0.003
inky-0.003
etsy-0.003
 clean-0.003
Token first
Token ID717
Feature activation+2.76e-02

Pos logit contributions
 okay+0.039
free+0.038
inky+0.036
etsy+0.034
 clean+0.032
Neg logit contributions
 islands-0.099
 cities-0.097
 obstacles-0.094
 oceans-0.092
 abrupt-0.092
Token day
Token ID1110
Feature activation-2.23e-02

Pos logit contributions
 islands+0.065
 cities+0.064
 obstacles+0.062
 oceans+0.060
 abrupt+0.060
Neg logit contributions
 okay-0.025
free-0.022
inky-0.022
 clean-0.021
etsy-0.020
Token of
Token ID286
Feature activation-1.23e-01

Pos logit contributions
 okay+0.024
inky+0.024
free+0.024
etsy+0.022
uffy+0.021
Neg logit contributions
 islands-0.047
 cities-0.046
 obstacles-0.045
 oceans-0.043
 obstacle-0.043
Token school
Token ID1524
Feature activation-6.46e-03

Pos logit contributions
 okay+0.111
 clean+0.108
 warm+0.107
 safe+0.096
 dry+0.087
Neg logit contributions
 cities-0.276
 islands-0.274
 obstacle-0.272
 obstacles-0.269
 abrupt-0.265
Token.
Token ID13
Feature activation-1.68e-02

Pos logit contributions
 okay+0.005
free+0.005
 clean+0.005
inky+0.004
 warm+0.004
Neg logit contributions
 islands-0.016
 cities-0.015
 obstacles-0.015
 obstacle-0.015
 oceans-0.015
Token He
Token ID679
Feature activation+2.35e-02

Pos logit contributions
 okay+0.019
free+0.018
inky+0.018
 clean+0.017
etsy+0.017
Neg logit contributions
 islands-0.036
 cities-0.035
 obstacles-0.033
 obstacle-0.033
 oceans-0.033
Token had
Token ID550
Feature activation-1.09e-02

Pos logit contributions
 islands+0.055
 cities+0.053
 heights+0.053
 obstacle+0.052
 obstacles+0.052
Neg logit contributions
 clean-0.022
 warm-0.019
 okay-0.019
he-0.017
free-0.017
Token a
Token ID257
Feature activation-2.18e-02

Pos logit contributions
 okay+0.010
 clean+0.009
 warm+0.009
free+0.008
 safe+0.008
Neg logit contributions
 islands-0.025
 obstacle-0.025
 cities-0.025
 obstacles-0.024
 cliffs-0.023
Token.
Token ID13
Feature activation-2.18e-02

Pos logit contributions
 islands+0.144
 cities+0.141
 obstacles+0.137
 oceans+0.134
 obstacle+0.133
Neg logit contributions
 okay-0.060
free-0.056
inky-0.055
etsy-0.051
 clean-0.049
Token And
Token ID843
Feature activation-2.86e-02

Pos logit contributions
 okay+0.025
inky+0.025
free+0.023
etsy+0.023
uffy+0.022
Neg logit contributions
 cities-0.042
 islands-0.040
 abrupt-0.040
 obstacles-0.039
 hears-0.039
Token whenever
Token ID8797
Feature activation-6.55e-02

Pos logit contributions
inky+0.035
free+0.032
etsy+0.032
uffy+0.032
 okay+0.030
Neg logit contributions
 islands-0.056
 cities-0.055
 obstacles-0.053
 oceans-0.052
 obstacle-0.051
Token it
Token ID340
Feature activation-4.73e-02

Pos logit contributions
inky+0.074
free+0.063
etsy+0.062
un+0.059
he+0.056
Neg logit contributions
 islands-0.144
 cities-0.144
 obstacles-0.137
 oceans-0.135
 hears-0.133
Token r
Token ID374
Feature activation-5.72e-02

Pos logit contributions
inky+0.069
 okay+0.069
free+0.068
etsy+0.066
un+0.060
Neg logit contributions
 islands-0.081
 cities-0.080
 obstacles-0.077
 hears-0.076
 oceans-0.076
Tokenained
Token ID1328
Feature activation-1.23e-01

Pos logit contributions
 okay+0.064
free+0.064
 warm+0.062
 safe+0.061
 clean+0.059
Neg logit contributions
 islands-0.114
 cities-0.113
 oceans-0.108
 obstacles-0.107
 cliffs-0.106
Token,
Token ID11
Feature activation-1.18e-02

Pos logit contributions
 okay+0.116
free+0.108
inky+0.107
etsy+0.099
 clean+0.096
Neg logit contributions
 islands-0.282
 cities-0.277
 obstacles-0.269
 oceans-0.262
 obstacle-0.261
Token she
Token ID673
Feature activation-6.91e-03

Pos logit contributions
inky+0.014
etsy+0.011
un+0.011
free+0.011
isc+0.011
Neg logit contributions
 cities-0.025
 islands-0.025
 obstacles-0.025
 oceans-0.024
 obstacle-0.023
Token made
Token ID925
Feature activation-5.45e-02

Pos logit contributions
 okay+0.008
 clean+0.008
 warm+0.007
 safe+0.006
uffy+0.006
Neg logit contributions
 cities-0.015
 islands-0.015
 obstacles-0.014
 obstacle-0.014
 oceans-0.014
Token sure
Token ID1654
Feature activation-2.65e-02

Pos logit contributions
inky+0.075
etsy+0.073
free+0.071
uffy+0.069
he+0.066
Neg logit contributions
 islands-0.102
 cities-0.100
 obstacles-0.098
 whales-0.096
 abrupt-0.095
Token to
Token ID284
Feature activation+5.86e-03

Pos logit contributions
inky+0.037
free+0.035
 okay+0.035
etsy+0.033
uffy+0.032
Neg logit contributions
 islands-0.049
 cities-0.048
 obstacles-0.046
 oceans-0.044
 abrupt-0.044
Token went
Token ID1816
Feature activation-1.79e-02

Pos logit contributions
 okay+0.004
free+0.004
inky+0.004
etsy+0.004
 clean+0.004
Neg logit contributions
 islands-0.005
 cities-0.005
 obstacles-0.005
 oceans-0.005
 obstacle-0.005
Token outside
Token ID2354
Feature activation-6.31e-02

Pos logit contributions
 okay+0.020
free+0.018
inky+0.018
etsy+0.017
 clean+0.017
Neg logit contributions
 islands-0.038
 cities-0.038
 obstacles-0.036
 oceans-0.035
 obstacle-0.035
Token without
Token ID1231
Feature activation-4.07e-03

Pos logit contributions
 okay+0.064
free+0.061
 clean+0.057
inky+0.056
 warm+0.055
Neg logit contributions
 islands-0.140
 cities-0.137
 obstacles-0.134
 obstacle-0.131
 oceans-0.130
Token her
Token ID607
Feature activation-5.69e-02

Pos logit contributions
inky+0.005
etsy+0.004
free+0.004
he+0.004
uffy+0.004
Neg logit contributions
 islands-0.009
 cities-0.009
 obstacles-0.009
 heights-0.008
 oceans-0.008
Token m
Token ID285
Feature activation+3.33e-02

Pos logit contributions
etsy+0.077
free+0.077
inky+0.074
he+0.073
 okay+0.069
Neg logit contributions
 islands-0.108
 obstacles-0.100
 cities-0.100
 whales-0.099
 heights-0.098
Tokenittens
Token ID34978
Feature activation-1.23e-01

Pos logit contributions
 islands+0.074
 cities+0.073
 obstacles+0.072
 heights+0.071
 abrupt+0.070
Neg logit contributions
 okay-0.029
free-0.029
etsy-0.026
inky-0.025
 clean-0.025
Token on
Token ID319
Feature activation+8.74e-03

Pos logit contributions
etsy+0.175
inky+0.172
free+0.158
uffy+0.149
 okay+0.149
Neg logit contributions
 cliffs-0.195
 islands-0.193
 obstacles-0.192
 hears-0.188
 cities-0.188
Token.
Token ID13
Feature activation-1.21e-02

Pos logit contributions
 islands+0.019
 cities+0.019
 obstacles+0.018
 oceans+0.018
 abrupt+0.017
Neg logit contributions
 okay-0.009
inky-0.009
free-0.009
etsy-0.009
uffy-0.008
Token Her
Token ID2332
Feature activation-5.32e-02

Pos logit contributions
free+0.018
less+0.017
 okay+0.017
icking+0.017
inky+0.017
Neg logit contributions
Which-0.017
 hears-0.017
 cities-0.017
 oceans-0.016
Does-0.015
Token mom
Token ID1995
Feature activation+1.59e-02

Pos logit contributions
free+0.092
he+0.090
etsy+0.086
inky+0.078
 Betsy+0.074
Neg logit contributions
 islands-0.075
 hears-0.072
 obstacles-0.072
 heights-0.070
 whales-0.069
Tokenmy
Token ID1820
Feature activation-1.11e-02

Pos logit contributions
 hears+0.026
 cities+0.026
 islands+0.025
 abrupt+0.025
 whales+0.024
Neg logit contributions
inky-0.024
free-0.023
 okay-0.022
etsy-0.021
uffy-0.020
Token the
Token ID262
Feature activation+6.25e-04

Pos logit contributions
inky+0.009
etsy+0.008
 okay+0.008
free+0.008
uffy+0.007
Neg logit contributions
 cities-0.017
 islands-0.017
 obstacles-0.016
 oceans-0.016
 obstacle-0.015
Token day
Token ID1110
Feature activation-2.26e-02

Pos logit contributions
 cities+0.001
 islands+0.001
 obstacles+0.001
 oceans+0.001
 obstacle+0.001
Neg logit contributions
inky-0.001
etsy-0.001
free-0.001
 okay-0.001
un-0.001
Token was
Token ID373
Feature activation-1.12e-02

Pos logit contributions
inky+0.037
free+0.034
etsy+0.033
 okay+0.032
uffy+0.029
Neg logit contributions
 obstacles-0.036
 cities-0.036
 islands-0.035
 hears-0.035
 oceans-0.033
Token still
Token ID991
Feature activation-4.19e-02

Pos logit contributions
inky+0.015
free+0.013
etsy+0.013
uffy+0.012
isc+0.012
Neg logit contributions
 islands-0.021
 cities-0.021
 obstacles-0.020
 oceans-0.020
 abrupt-0.020
Token lots
Token ID6041
Feature activation+1.25e-03

Pos logit contributions
inky+0.049
free+0.045
etsy+0.044
 okay+0.041
uffy+0.039
Neg logit contributions
 islands-0.088
 cities-0.085
 obstacles-0.084
 abrupt-0.082
 oceans-0.082
Token of
Token ID286
Feature activation-1.23e-01

Pos logit contributions
 islands+0.003
 cities+0.003
 obstacles+0.003
 obstacle+0.003
 oceans+0.002
Neg logit contributions
inky-0.001
uffy-0.001
etsy-0.001
 okay-0.001
free-0.001
Token fun
Token ID1257
Feature activation-6.02e-02

Pos logit contributions
 okay+0.128
inky+0.121
free+0.120
etsy+0.112
 clean+0.106
Neg logit contributions
 islands-0.270
 cities-0.265
 obstacles-0.257
 oceans-0.251
 obstacle-0.249
Token.
Token ID13
Feature activation-1.92e-02

Pos logit contributions
 okay+0.062
free+0.058
inky+0.058
etsy+0.054
 clean+0.052
Neg logit contributions
 islands-0.132
 cities-0.130
 obstacles-0.126
 oceans-0.123
 obstacle-0.122
Token She
Token ID1375
Feature activation-3.54e-03

Pos logit contributions
 okay+0.023
free+0.022
etsy+0.022
inky+0.022
 clean+0.020
Neg logit contributions
 cities-0.037
 islands-0.036
 hears-0.036
 obstacles-0.036
 oceans-0.035
Token could
Token ID714
Feature activation-4.99e-02

Pos logit contributions
 okay+0.004
 clean+0.004
 warm+0.004
 safe+0.004
etsy+0.003
Neg logit contributions
 islands-0.007
 cities-0.007
 obstacles-0.007
 obstacle-0.007
 oceans-0.007
Token not
Token ID407
Feature activation-5.30e-02

Pos logit contributions
free+0.066
inky+0.065
etsy+0.061
 okay+0.059
less+0.057
Neg logit contributions
 islands-0.092
 cities-0.089
 hears-0.084
 abrupt-0.083
 heights-0.083
Token old
Token ID1468
Feature activation+5.29e-03

Pos logit contributions
 okay+0.037
free+0.035
inky+0.033
etsy+0.033
uffy+0.031
Neg logit contributions
 islands-0.042
 cities-0.042
 obstacles-0.039
 abrupt-0.039
 oceans-0.039
Token and
Token ID290
Feature activation-6.61e-04

Pos logit contributions
 islands+0.011
 cities+0.010
 obstacles+0.010
 oceans+0.010
 abrupt+0.010
Neg logit contributions
 okay-0.006
free-0.006
inky-0.006
etsy-0.006
 clean-0.005
Token she
Token ID673
Feature activation+1.65e-02

Pos logit contributions
inky+0.001
etsy+0.001
uffy+0.001
 okay+0.001
icking+0.001
Neg logit contributions
 cities-0.001
 abrupt-0.001
 islands-0.001
 hears-0.001
 whales-0.001
Token loved
Token ID6151
Feature activation-2.53e-02

Pos logit contributions
 hears+0.021
 islands+0.021
 cities+0.020
 obstacles+0.020
 abrupt+0.019
Neg logit contributions
 okay-0.028
inky-0.027
etsy-0.026
less-0.025
icking-0.025
Token m
Token ID285
Feature activation+3.84e-02

Pos logit contributions
 okay+0.026
free+0.025
 clean+0.024
inky+0.023
 warm+0.023
Neg logit contributions
 islands-0.055
 cities-0.054
 obstacles-0.052
 obstacle-0.052
 abrupt-0.051
Tokenittens
Token ID34978
Feature activation-1.22e-01

Pos logit contributions
 cities+0.071
 islands+0.070
 whales+0.069
 heights+0.068
 cliffs+0.067
Neg logit contributions
 okay-0.047
free-0.043
 safe-0.042
 clean-0.041
 Midnight-0.041
Token.
Token ID13
Feature activation-1.54e-02

Pos logit contributions
etsy+0.167
inky+0.165
 okay+0.161
he+0.150
free+0.149
Neg logit contributions
 islands-0.202
 whales-0.197
 cities-0.196
 cliffs-0.190
 obstacles-0.190
Token One
Token ID1881
Feature activation+3.02e-02

Pos logit contributions
 okay+0.011
 Her+0.009
inky+0.009
 clean+0.008
free+0.008
Neg logit contributions
 islands-0.039
 obstacles-0.038
 cities-0.038
 obstacle-0.037
 oceans-0.036
Token day
Token ID1110
Feature activation+2.53e-03

Pos logit contributions
 whales+0.038
 abrupt+0.036
 hears+0.035
 islands+0.035
 whale+0.035
Neg logit contributions
inky-0.053
etsy-0.049
 okay-0.047
free-0.046
 bleed-0.045
Token she
Token ID673
Feature activation-5.77e-03

Pos logit contributions
 islands+0.006
 cities+0.006
 obstacles+0.006
 oceans+0.005
 obstacle+0.005
Neg logit contributions
 okay-0.002
free-0.002
inky-0.002
etsy-0.002
 clean-0.002
Token was
Token ID373
Feature activation-1.18e-02

Pos logit contributions
 okay+0.007
free+0.007
inky+0.007
etsy+0.007
 clean+0.007
Neg logit contributions
 islands-0.011
 cities-0.011
 obstacles-0.011
 oceans-0.010
 obstacle-0.010
Token dropped
Token ID5710
Feature activation-4.43e-02

Pos logit contributions
 okay+0.050
free+0.048
inky+0.047
etsy+0.045
 clean+0.043
Neg logit contributions
 islands-0.067
 cities-0.065
 obstacles-0.063
 abrupt-0.061
 oceans-0.061
Token one
Token ID530
Feature activation+4.15e-02

Pos logit contributions
 okay+0.048
free+0.044
 clean+0.042
inky+0.042
 warm+0.041
Neg logit contributions
 islands-0.097
 cities-0.096
 obstacles-0.092
 obstacle-0.090
 oceans-0.090
Token of
Token ID286
Feature activation-8.79e-02

Pos logit contributions
 islands+0.084
 cities+0.081
 obstacles+0.080
 obstacle+0.079
 abrupt+0.078
Neg logit contributions
 okay-0.048
free-0.047
inky-0.045
etsy-0.045
uffy-0.041
Token his
Token ID465
Feature activation-4.08e-02

Pos logit contributions
 okay+0.104
free+0.100
inky+0.100
etsy+0.094
uffy+0.090
Neg logit contributions
 islands-0.181
 cities-0.176
 obstacles-0.171
 oceans-0.166
 abrupt-0.166
Token m
Token ID285
Feature activation+3.43e-02

Pos logit contributions
 okay+0.053
free+0.051
inky+0.050
etsy+0.048
 clean+0.046
Neg logit contributions
 islands-0.078
 cities-0.076
 obstacles-0.074
 oceans-0.072
 obstacle-0.071
Tokenittens
Token ID34978
Feature activation-1.22e-01

Pos logit contributions
 islands+0.087
 cities+0.085
 obstacles+0.083
 oceans+0.081
 abrupt+0.081
Neg logit contributions
 okay-0.024
free-0.022
inky-0.020
etsy-0.019
 clean-0.019
Token on
Token ID319
Feature activation+1.68e-02

Pos logit contributions
free+0.168
 okay+0.168
etsy+0.166
inky+0.160
un+0.151
Neg logit contributions
 islands-0.202
 cities-0.192
 obstacles-0.192
 cliffs-0.190
 hears-0.187
Token the
Token ID262
Feature activation+4.86e-03

Pos logit contributions
 islands+0.034
 cities+0.033
 obstacles+0.032
 oceans+0.032
 abrupt+0.031
Neg logit contributions
 okay-0.021
free-0.020
inky-0.019
etsy-0.018
uffy-0.018
Token way
Token ID835
Feature activation-7.83e-03

Pos logit contributions
 islands+0.009
 cities+0.008
 obstacles+0.008
 oceans+0.008
 abrupt+0.008
Neg logit contributions
 okay-0.007
free-0.007
inky-0.007
etsy-0.007
uffy-0.006
Token.
Token ID13
Feature activation-1.95e-02

Pos logit contributions
inky+0.009
 okay+0.009
free+0.009
etsy+0.009
uffy+0.009
Neg logit contributions
 islands-0.015
 cities-0.015
 abrupt-0.014
 obstacles-0.014
 oceans-0.014
Token The
Token ID383
Feature activation-4.63e-03

Pos logit contributions
 okay+0.024
free+0.024
inky+0.023
uffy+0.023
etsy+0.022
Neg logit contributions
 islands-0.034
 cities-0.034
 obstacles-0.034
 oceans-0.033
 abrupt-0.033
Token
Token ID198
Feature activation-7.71e-03

Pos logit contributions
 okay+0.006
free+0.006
inky+0.006
etsy+0.005
 clean+0.005
Neg logit contributions
 islands-0.012
 cities-0.012
 obstacles-0.011
 obstacle-0.011
 oceans-0.011
TokenFinally
Token ID11158
Feature activation-4.57e-03

Pos logit contributions
 okay+0.009
free+0.009
inky+0.009
etsy+0.008
 clean+0.008
Neg logit contributions
 islands-0.016
 cities-0.015
 obstacles-0.015
 oceans-0.014
 obstacle-0.014
Token,
Token ID11
Feature activation-8.55e-03

Pos logit contributions
 okay+0.005
inky+0.005
free+0.004
etsy+0.004
uffy+0.004
Neg logit contributions
 islands-0.010
 cities-0.010
 obstacles-0.010
 oceans-0.009
 obstacle-0.009
Token after
Token ID706
Feature activation-4.56e-02

Pos logit contributions
 okay+0.011
inky+0.010
free+0.010
etsy+0.010
uffy+0.009
Neg logit contributions
 islands-0.017
 cities-0.017
 obstacles-0.016
 oceans-0.016
 obstacle-0.016
Token hours
Token ID2250
Feature activation-4.48e-02

Pos logit contributions
 okay+0.043
free+0.040
inky+0.039
 clean+0.036
 warm+0.035
Neg logit contributions
 islands-0.104
 cities-0.102
 obstacles-0.099
 abrupt-0.097
 obstacle-0.097
Token of
Token ID286
Feature activation-1.22e-01

Pos logit contributions
 okay+0.061
inky+0.061
free+0.057
etsy+0.055
uffy+0.054
Neg logit contributions
 islands-0.085
 cities-0.082
 obstacles-0.078
 oceans-0.077
 whales-0.075
Token waiting
Token ID4953
Feature activation-7.59e-02

Pos logit contributions
 okay+0.139
 clean+0.129
 warm+0.128
free+0.125
 safe+0.121
Neg logit contributions
 islands-0.254
 abrupt-0.247
 cities-0.247
 obstacle-0.237
rob-0.235
Token,
Token ID11
Feature activation-1.03e-03

Pos logit contributions
 okay+0.081
free+0.074
 clean+0.070
inky+0.068
 warm+0.067
Neg logit contributions
 islands-0.167
 cities-0.163
 obstacles-0.159
 obstacle-0.156
 abrupt-0.155
Token it
Token ID340
Feature activation-2.42e-02

Pos logit contributions
 okay+0.001
free+0.001
inky+0.001
etsy+0.001
 clean+0.001
Neg logit contributions
 islands-0.002
 cities-0.002
 obstacles-0.002
 oceans-0.002
 obstacle-0.002
Token happened
Token ID3022
Feature activation-3.06e-02

Pos logit contributions
 okay+0.029
free+0.028
inky+0.027
 clean+0.026
etsy+0.025
Neg logit contributions
 islands-0.049
 cities-0.048
 obstacles-0.046
 oceans-0.045
 obstacle-0.045
Token.
Token ID13
Feature activation-8.93e-03

Pos logit contributions
 okay+0.025
free+0.021
 clean+0.018
 safe+0.018
 warm+0.017
Neg logit contributions
 islands-0.078
 cities-0.077
 obstacle-0.075
 obstacles-0.074
rob-0.073
Token to
Token ID284
Feature activation+1.34e-02

Pos logit contributions
 islands+0.027
 cities+0.026
 obstacles+0.025
 oceans+0.025
 obstacle+0.025
Neg logit contributions
 okay-0.015
free-0.014
inky-0.014
etsy-0.013
 clean-0.013
Token do
Token ID466
Feature activation+1.65e-02

Pos logit contributions
 islands+0.031
 cities+0.030
 obstacles+0.029
 oceans+0.029
 obstacle+0.028
Neg logit contributions
 okay-0.013
 clean-0.012
free-0.011
 warm-0.011
inky-0.011
Token.
Token ID13
Feature activation-8.46e-03

Pos logit contributions
 islands+0.040
 cities+0.039
 obstacles+0.038
 obstacle+0.038
 oceans+0.038
Neg logit contributions
 okay-0.014
free-0.013
 clean-0.012
 safe-0.011
 warm-0.011
Token Lucy
Token ID22162
Feature activation-3.47e-03

Pos logit contributions
 okay+0.009
free+0.009
inky+0.008
 clean+0.008
etsy+0.008
Neg logit contributions
 islands-0.018
 cities-0.018
 obstacles-0.017
 oceans-0.017
 abrupt-0.017
Token said
Token ID531
Feature activation+5.04e-04

Pos logit contributions
 okay+0.005
free+0.004
inky+0.004
etsy+0.004
 clean+0.004
Neg logit contributions
 islands-0.007
 cities-0.006
 obstacles-0.006
 oceans-0.006
 obstacle-0.006
Token yes
Token ID3763
Feature activation+9.09e-02

Pos logit contributions
 islands+0.001
 cities+0.001
 obstacles+0.001
 oceans+0.001
 obstacle+0.001
Neg logit contributions
 okay-0.001
inky-0.001
free-0.001
etsy-0.000
uffy-0.000
Token and
Token ID290
Feature activation-2.49e-03

Pos logit contributions
 islands+0.145
 cities+0.145
 obstacles+0.142
 abrupt+0.139
 obstacle+0.135
Neg logit contributions
free-0.131
etsy-0.124
inky-0.124
uffy-0.121
 okay-0.121
Token followed
Token ID3940
Feature activation+4.91e-02

Pos logit contributions
inky+0.003
 okay+0.003
free+0.003
etsy+0.003
uffy+0.003
Neg logit contributions
 islands-0.005
 cities-0.005
 obstacles-0.004
 whales-0.004
 oceans-0.004
Token Jack
Token ID3619
Feature activation-1.09e-03

Pos logit contributions
 islands+0.078
 cities+0.078
 obstacles+0.075
 heights+0.073
 oceans+0.073
Neg logit contributions
 okay-0.075
inky-0.075
etsy-0.072
free-0.072
uffy-0.072
Token.
Token ID13
Feature activation-4.40e-03

Pos logit contributions
 okay+0.002
inky+0.002
etsy+0.001
uffy+0.001
free+0.001
Neg logit contributions
 islands-0.002
 cities-0.002
 obstacles-0.002
 oceans-0.002
 hears-0.002
Token
Token ID198
Feature activation-3.14e-03

Pos logit contributions
 okay+0.005
free+0.005
inky+0.005
 clean+0.005
 safe+0.005
Neg logit contributions
 cities-0.008
 islands-0.008
 oceans-0.008
 obstacles-0.008
 abrupt-0.008
Token
Token ID198
Feature activation-2.04e-03

Pos logit contributions
 islands+0.004
 cities+0.004
 obstacles+0.004
 oceans+0.004
 abrupt+0.004
Neg logit contributions
 okay-0.002
free-0.002
inky-0.002
etsy-0.001
 clean-0.001
TokenThe
Token ID464
Feature activation+2.06e-02

Pos logit contributions
 okay+0.002
free+0.002
inky+0.002
etsy+0.002
 clean+0.002
Neg logit contributions
 islands-0.004
 cities-0.004
 obstacles-0.004
 oceans-0.004
 abrupt-0.004
Token white
Token ID2330
Feature activation-1.63e-02

Pos logit contributions
 islands+0.036
 cities+0.035
 obstacles+0.033
 abrupt+0.033
 whales+0.033
Neg logit contributions
etsy-0.030
inky-0.029
free-0.029
 okay-0.029
uffy-0.027
Token twin
Token ID15203
Feature activation-1.36e-02

Pos logit contributions
 okay+0.023
free+0.022
etsy+0.021
inky+0.020
 safe+0.019
Neg logit contributions
 islands-0.030
 cities-0.029
 whales-0.028
 oceans-0.027
 obstacles-0.027
Token said
Token ID531
Feature activation+1.36e-02

Pos logit contributions
 okay+0.019
free+0.018
inky+0.018
etsy+0.017
 clean+0.016
Neg logit contributions
 islands-0.025
 cities-0.024
 obstacles-0.023
 oceans-0.023
 obstacle-0.023
Token "
Token ID366
Feature activation+8.90e-02

Pos logit contributions
 islands+0.031
 cities+0.030
 obstacles+0.029
 oceans+0.028
 obstacle+0.028
Neg logit contributions
 okay-0.013
free-0.012
inky-0.012
etsy-0.011
 clean-0.011
TokenSure
Token ID19457
Feature activation+9.54e-02

Pos logit contributions
 islands+0.171
 cities+0.167
 obstacles+0.160
 obstacle+0.157
 abrupt+0.154
Neg logit contributions
free-0.121
 okay-0.120
inky-0.114
etsy-0.112
 clean-0.107
Token!
Token ID0
Feature activation+4.49e-02

Pos logit contributions
 islands+0.188
 cities+0.184
 obstacles+0.178
 oceans+0.173
 obstacle+0.172
Neg logit contributions
 okay-0.121
free-0.115
inky-0.114
etsy-0.108
 clean-0.106
Token That
Token ID1320
Feature activation+4.66e-02

Pos logit contributions
 islands+0.070
 cities+0.069
 obstacles+0.064
 oceans+0.064
 abrupt+0.062
Neg logit contributions
 okay-0.073
free-0.070
inky-0.070
 clean-0.067
etsy-0.066
Token sounds
Token ID5238
Feature activation+3.31e-02

Pos logit contributions
 islands+0.087
 cities+0.086
 obstacles+0.082
 oceans+0.080
 abrupt+0.080
Neg logit contributions
 okay-0.066
 warm-0.061
free-0.060
 clean-0.060
inky-0.059
Token like
Token ID588
Feature activation-1.09e-02

Pos logit contributions
 islands+0.079
 cities+0.078
 obstacles+0.075
 abrupt+0.074
 oceans+0.074
Neg logit contributions
 okay-0.030
free-0.027
 clean-0.025
inky-0.025
 warm-0.024
Token and
Token ID290
Feature activation+1.11e-02

Pos logit contributions
 islands+0.016
 cities+0.016
 obstacles+0.015
 obstacle+0.015
rob+0.015
Neg logit contributions
 okay-0.008
 warm-0.007
 clean-0.007
free-0.007
inky-0.007
Token saw
Token ID2497
Feature activation-9.44e-03

Pos logit contributions
 islands+0.021
 cities+0.021
 obstacles+0.020
 obstacle+0.020
 abrupt+0.020
Neg logit contributions
 okay-0.015
free-0.014
inky-0.013
 clean-0.013
 warm-0.013
Token a
Token ID257
Feature activation+3.16e-03

Pos logit contributions
 okay+0.009
free+0.008
inky+0.008
 clean+0.007
etsy+0.007
Neg logit contributions
 islands-0.022
 cities-0.022
 obstacles-0.021
 obstacle-0.020
 oceans-0.020
Token sleepy
Token ID42023
Feature activation-1.04e-02

Pos logit contributions
 islands+0.007
 cities+0.007
 obstacles+0.007
 oceans+0.007
 abrupt+0.006
Neg logit contributions
 okay-0.003
inky-0.003
free-0.003
 clean-0.003
 warm-0.003
Token rabbit
Token ID22746
Feature activation+1.85e-02

Pos logit contributions
 okay+0.015
free+0.015
inky+0.014
etsy+0.013
he+0.013
Neg logit contributions
 islands-0.018
 cities-0.018
 obstacles-0.017
 obstacle-0.017
 oceans-0.017
Token pe
Token ID613
Feature activation+8.30e-02

Pos logit contributions
 islands+0.035
 cities+0.033
 obstacles+0.032
 oceans+0.031
 abrupt+0.031
Neg logit contributions
 okay-0.025
inky-0.024
free-0.024
etsy-0.023
uffy-0.022
Tokeneking
Token ID18754
Feature activation-1.37e-02

Pos logit contributions
 islands+0.131
 cities+0.128
 obstacles+0.122
 oceans+0.118
 obstacle+0.117
Neg logit contributions
 okay-0.138
free-0.132
inky-0.131
etsy-0.125
 clean-0.124
Token down
Token ID866
Feature activation+3.53e-02

Pos logit contributions
inky+0.017
uffy+0.017
etsy+0.017
 okay+0.017
free+0.016
Neg logit contributions
 islands-0.026
 cities-0.026
 obstacles-0.025
 oceans-0.025
 obstacle-0.024
Token from
Token ID422
Feature activation+2.74e-02

Pos logit contributions
 islands+0.081
 cities+0.080
 obstacles+0.078
 obstacle+0.076
 oceans+0.075
Neg logit contributions
 okay-0.033
free-0.032
inky-0.029
 clean-0.028
 warm-0.028
Token the
Token ID262
Feature activation+2.23e-02

Pos logit contributions
 cities+0.064
 islands+0.063
rob+0.062
 obstacles+0.061
 feats+0.061
Neg logit contributions
 warm-0.023
 safe-0.022
 okay-0.022
 clean-0.020
free-0.020
Token tree
Token ID5509
Feature activation+5.12e-02

Pos logit contributions
 islands+0.036
 cities+0.035
 obstacles+0.034
 oceans+0.033
 obstacle+0.032
Neg logit contributions
 okay-0.036
free-0.034
inky-0.034
etsy-0.033
 clean-0.032
Token asks
Token ID7893
Feature activation+4.07e-02

Pos logit contributions
 okay+0.014
 clean+0.013
 Her+0.013
 warm+0.012
 Lily+0.012
Neg logit contributions
 islands-0.011
 cities-0.010
 obstacles-0.010
rob-0.009
 oceans-0.009
Token.
Token ID13
Feature activation+8.64e-03

Pos logit contributions
 islands+0.082
 cities+0.079
 obstacles+0.077
 whales+0.075
 oceans+0.074
Neg logit contributions
inky-0.048
free-0.046
etsy-0.045
 okay-0.045
he-0.042
Token
Token ID198
Feature activation-3.01e-03

Pos logit contributions
 islands+0.022
 cities+0.021
 obstacles+0.021
 obstacle+0.021
 oceans+0.020
Neg logit contributions
 okay-0.007
free-0.006
inky-0.006
etsy-0.005
 clean-0.005
Token
Token ID198
Feature activation-7.25e-03

Pos logit contributions
free+0.003
 okay+0.003
inky+0.003
etsy+0.003
less+0.003
Neg logit contributions
 islands-0.006
 cities-0.006
 obstacles-0.006
 abrupt-0.006
 oceans-0.006
Token"
Token ID1
Feature activation+2.66e-02

Pos logit contributions
 okay+0.012
free+0.011
inky+0.011
etsy+0.011
he+0.010
Neg logit contributions
 islands-0.012
 cities-0.012
 obstacles-0.011
 oceans-0.011
 abrupt-0.011
TokenOK
Token ID11380
Feature activation+8.60e-02

Pos logit contributions
 islands+0.052
 cities+0.051
 obstacle+0.048
 obstacles+0.047
 oceans+0.047
Neg logit contributions
 okay-0.038
free-0.036
etsy-0.034
 clean-0.034
 warm-0.034
Token,
Token ID11
Feature activation+2.88e-02

Pos logit contributions
 islands+0.192
 cities+0.188
 obstacles+0.182
 oceans+0.178
 obstacle+0.177
Neg logit contributions
 okay-0.086
free-0.082
inky-0.080
etsy-0.075
 clean-0.072
Token let
Token ID1309
Feature activation-1.21e-03

Pos logit contributions
 islands+0.054
 cities+0.053
 obstacles+0.051
 oceans+0.049
 obstacle+0.049
Neg logit contributions
 okay-0.038
free-0.038
inky-0.038
etsy-0.036
uffy-0.034
Token's
Token ID338
Feature activation-1.59e-03

Pos logit contributions
free+0.002
etsy+0.002
inky+0.002
 okay+0.002
less+0.002
Neg logit contributions
 islands-0.002
 cities-0.002
 hears-0.002
 obstacles-0.002
 obstacle-0.002
Token play
Token ID711
Feature activation-4.30e-02

Pos logit contributions
free+0.002
etsy+0.002
inky+0.002
less+0.002
uffy+0.002
Neg logit contributions
 islands-0.003
 cities-0.003
 abrupt-0.002
 obstacles-0.002
 hears-0.002
Token inside
Token ID2641
Feature activation-1.24e-02

Pos logit contributions
 okay+0.047
free+0.041
 clean+0.041
inky+0.039
 warm+0.039
Neg logit contributions
 islands-0.095
 cities-0.095
 obstacles-0.092
 oceans-0.090
 abrupt-0.088
Token pig
Token ID12967
Feature activation-3.80e-02

Pos logit contributions
 okay+0.004
inky+0.003
free+0.003
 clean+0.003
etsy+0.003
Neg logit contributions
 islands-0.006
 cities-0.006
 obstacles-0.006
 oceans-0.006
 abrupt-0.006
Token.
Token ID13
Feature activation-9.07e-03

Pos logit contributions
 okay+0.062
free+0.055
etsy+0.054
inky+0.053
he+0.053
Neg logit contributions
 islands-0.062
 whales-0.058
 cities-0.057
 oceans-0.054
 abrupt-0.053
Token He
Token ID679
Feature activation+3.51e-02

Pos logit contributions
 okay+0.011
inky+0.010
etsy+0.010
uffy+0.010
free+0.010
Neg logit contributions
 islands-0.019
 cities-0.018
 oceans-0.018
 obstacles-0.017
 abrupt-0.017
Token wanted
Token ID2227
Feature activation-1.35e-02

Pos logit contributions
 islands+0.077
 cities+0.076
 obstacles+0.074
 obstacle+0.073
 abrupt+0.072
Neg logit contributions
 okay-0.036
free-0.035
 clean-0.033
inky-0.033
 warm-0.033
Token to
Token ID284
Feature activation+1.19e-02

Pos logit contributions
 okay+0.013
 clean+0.012
free+0.012
 warm+0.012
 safe+0.011
Neg logit contributions
 islands-0.030
 cities-0.030
 obstacles-0.029
 obstacle-0.029
 abrupt-0.029
Token succeed
Token ID6758
Feature activation+1.10e-01

Pos logit contributions
 islands+0.022
 cities+0.021
 obstacles+0.020
 abrupt+0.020
 whales+0.020
Neg logit contributions
 okay-0.016
free-0.016
inky-0.016
etsy-0.015
uffy-0.015
Token,
Token ID11
Feature activation+1.84e-02

Pos logit contributions
 cities+0.300
 abrupt+0.300
 islands+0.298
 obstacle+0.293
smith+0.288
Neg logit contributions
 okay-0.076
 clean-0.055
 warm-0.054
 safe-0.053
free-0.053
Token but
Token ID475
Feature activation+2.70e-02

Pos logit contributions
 islands+0.042
 cities+0.041
 obstacles+0.040
 obstacle+0.039
 abrupt+0.039
Neg logit contributions
 okay-0.019
 clean-0.016
free-0.016
 warm-0.016
 safe-0.015
Token he
Token ID339
Feature activation+2.34e-02

Pos logit contributions
 islands+0.050
 cities+0.049
 obstacles+0.047
 oceans+0.046
 obstacle+0.045
Neg logit contributions
 okay-0.037
inky-0.037
free-0.036
etsy-0.035
uffy-0.033
Token was
Token ID373
Feature activation-3.13e-04

Pos logit contributions
 islands+0.050
 cities+0.049
 obstacles+0.047
 oceans+0.046
 obstacle+0.046
Neg logit contributions
 okay-0.027
inky-0.025
free-0.025
etsy-0.023
 clean-0.023
Token too
Token ID1165
Feature activation-4.70e-02

Pos logit contributions
 okay+0.000
free+0.000
inky+0.000
etsy+0.000
 clean+0.000
Neg logit contributions
 islands-0.001
 cities-0.001
 obstacles-0.001
 oceans-0.001
 obstacle-0.001
TokenThere
Token ID1858
Feature activation-6.81e-02

Pos logit contributions
 islands+0.092
 cities+0.090
 oceans+0.085
 obstacles+0.085
 abrupt+0.082
Neg logit contributions
 okay-0.065
inky-0.062
free-0.060
etsy-0.058
 clean-0.056
Token was
Token ID373
Feature activation-1.79e-02

Pos logit contributions
 okay+0.095
free+0.094
inky+0.093
etsy+0.086
 clean+0.083
Neg logit contributions
 islands-0.123
 cities-0.119
 obstacles-0.113
 oceans-0.112
 abrupt-0.109
Token a
Token ID257
Feature activation-3.24e-02

Pos logit contributions
inky+0.023
etsy+0.022
free+0.022
uffy+0.021
 okay+0.020
Neg logit contributions
 islands-0.035
 cities-0.034
 oceans-0.033
 abrupt-0.032
 obstacles-0.032
Token big
Token ID1263
Feature activation-1.99e-03

Pos logit contributions
 okay+0.030
free+0.029
inky+0.028
etsy+0.026
 clean+0.024
Neg logit contributions
 islands-0.074
 cities-0.073
 obstacles-0.070
 obstacle-0.069
 oceans-0.069
Token tree
Token ID5509
Feature activation+2.15e-02

Pos logit contributions
 clean+0.002
 okay+0.002
 warm+0.002
 safe+0.002
inky+0.002
Neg logit contributions
 islands-0.004
 obstacles-0.004
 cities-0.004
 abrupt-0.004
 oceans-0.004
Token in
Token ID287
Feature activation+4.44e-02

Pos logit contributions
 islands+0.046
 cities+0.045
 obstacles+0.043
 oceans+0.043
 abrupt+0.043
Neg logit contributions
 okay-0.023
free-0.022
inky-0.021
etsy-0.020
uffy-0.019
Token the
Token ID262
Feature activation+4.54e-03

Pos logit contributions
 islands+0.099
 cities+0.097
 obstacles+0.094
 obstacle+0.092
 oceans+0.092
Neg logit contributions
 okay-0.044
free-0.041
inky-0.039
 clean-0.039
 warm-0.037
Token backyard
Token ID24296
Feature activation-3.20e-02

Pos logit contributions
 islands+0.007
 cities+0.007
 obstacles+0.006
 oceans+0.006
 obstacle+0.006
Neg logit contributions
 okay-0.008
free-0.008
inky-0.008
etsy-0.007
uffy-0.007
Token.
Token ID13
Feature activation-5.46e-03

Pos logit contributions
 okay+0.032
free+0.030
inky+0.029
etsy+0.027
 clean+0.026
Neg logit contributions
 islands-0.072
 cities-0.070
 obstacles-0.068
 oceans-0.066
 obstacle-0.066
Token Every
Token ID3887
Feature activation-6.58e-03

Pos logit contributions
 okay+0.007
inky+0.006
free+0.006
etsy+0.006
uffy+0.006
Neg logit contributions
 islands-0.011
 cities-0.011
 obstacles-0.010
 oceans-0.010
 abrupt-0.010
Token day
Token ID1110
Feature activation+4.66e-03

Pos logit contributions
 okay+0.003
free+0.002
 warm+0.002
 clean+0.002
inky+0.002
Neg logit contributions
 islands-0.019
 cities-0.018
 obstacles-0.018
 oceans-0.018
 abrupt-0.017
Token they
Token ID484
Feature activation+4.38e-02

Pos logit contributions
 islands+0.004
 cities+0.004
 whales+0.003
 oceans+0.003
 obstacles+0.003
Neg logit contributions
inky-0.003
he-0.003
free-0.003
etsy-0.003
un-0.003
Token still
Token ID991
Feature activation+2.08e-02

Pos logit contributions
 islands+0.083
 cities+0.082
 obstacles+0.079
 oceans+0.076
 abrupt+0.076
Neg logit contributions
 okay-0.057
inky-0.055
free-0.054
etsy-0.052
uffy-0.050
Token liked
Token ID8288
Feature activation+6.04e-03

Pos logit contributions
 islands+0.045
 cities+0.044
 obstacles+0.043
 oceans+0.042
 obstacle+0.042
Neg logit contributions
 okay-0.024
free-0.023
 clean-0.021
inky-0.020
 warm-0.020
Token to
Token ID284
Feature activation+3.18e-03

Pos logit contributions
 islands+0.013
 cities+0.013
 obstacles+0.012
 obstacle+0.012
 abrupt+0.012
Neg logit contributions
 okay-0.007
free-0.006
 clean-0.006
 warm-0.006
inky-0.006
Token play
Token ID711
Feature activation-4.49e-02

Pos logit contributions
 islands+0.007
 cities+0.006
 obstacles+0.006
 oceans+0.006
 abrupt+0.006
Neg logit contributions
 okay-0.004
free-0.003
inky-0.003
etsy-0.003
uffy-0.003
Token with
Token ID351
Feature activation+1.44e-02

Pos logit contributions
 okay+0.035
 clean+0.033
free+0.031
 warm+0.029
 safe+0.029
Neg logit contributions
 islands-0.112
 cities-0.112
 obstacles-0.110
 obstacle-0.107
 oceans-0.107
Token their
Token ID511
Feature activation-5.00e-02

Pos logit contributions
 islands+0.029
 cities+0.028
 obstacles+0.027
 oceans+0.027
 obstacle+0.027
Neg logit contributions
 okay-0.018
free-0.017
inky-0.016
etsy-0.016
 clean-0.015
Token toy
Token ID13373
Feature activation+6.12e-03

Pos logit contributions
 okay+0.058
free+0.055
inky+0.054
etsy+0.051
 clean+0.050
Neg logit contributions
 islands-0.104
 cities-0.102
 obstacles-0.098
 oceans-0.096
 obstacle-0.095
Token cars
Token ID5006
Feature activation-4.46e-02

Pos logit contributions
 obstacles+0.011
heit+0.011
 cities+0.011
 islands+0.011
 abrupt+0.011
Neg logit contributions
 clean-0.008
 okay-0.008
free-0.007
inky-0.007
 warm-0.007
Token.
Token ID13
Feature activation-1.97e-03

Pos logit contributions
 okay+0.053
free+0.050
inky+0.050
etsy+0.047
 clean+0.046
Neg logit contributions
 islands-0.091
 cities-0.089
 obstacles-0.086
 oceans-0.084
 obstacle-0.083
Token They
Token ID1119
Feature activation+4.28e-03

Pos logit contributions
free+0.002
inky+0.002
 okay+0.002
etsy+0.002
he+0.002
Neg logit contributions
 islands-0.004
 cities-0.004
 abrupt-0.004
 obstacles-0.004
 oceans-0.004
Token about
Token ID546
Feature activation-3.28e-02

Pos logit contributions
 islands+0.053
 cities+0.051
 obstacles+0.050
 whales+0.048
 obstacle+0.048
Neg logit contributions
 okay-0.028
inky-0.028
free-0.027
etsy-0.027
he-0.025
Token the
Token ID262
Feature activation+1.63e-03

Pos logit contributions
 okay+0.037
 clean+0.033
free+0.032
 warm+0.032
inky+0.030
Neg logit contributions
 islands-0.068
 cities-0.067
 obstacle-0.064
 oceans-0.064
 obstacles-0.064
Token tide
Token ID20013
Feature activation+1.17e-02

Pos logit contributions
rob+0.003
 cities+0.003
 islands+0.003
heit+0.003
 obstacles+0.002
Neg logit contributions
 okay-0.002
 clean-0.002
 warm-0.002
 rest-0.002
 whole-0.002
Token.
Token ID13
Feature activation-1.08e-02

Pos logit contributions
 islands+0.025
 cities+0.024
 obstacles+0.023
 oceans+0.023
 obstacle+0.023
Neg logit contributions
 okay-0.013
inky-0.012
free-0.012
etsy-0.012
un-0.011
Token She
Token ID1375
Feature activation+1.62e-02

Pos logit contributions
 okay+0.014
inky+0.014
free+0.014
etsy+0.013
 clean+0.013
Neg logit contributions
 islands-0.019
 cities-0.019
 obstacles-0.018
 abrupt-0.018
 oceans-0.018
Token decided
Token ID3066
Feature activation+1.26e-02

Pos logit contributions
 cities+0.039
 islands+0.038
 oceans+0.037
 obstacle+0.037
 heights+0.036
Neg logit contributions
 okay-0.015
 clean-0.014
 warm-0.013
 little-0.012
 Lily-0.012
Token to
Token ID284
Feature activation+1.41e-02

Pos logit contributions
 islands+0.028
 cities+0.028
 obstacles+0.027
 obstacle+0.026
 oceans+0.026
Neg logit contributions
 okay-0.013
free-0.012
inky-0.011
 clean-0.011
etsy-0.011
Token go
Token ID467
Feature activation-5.96e-04

Pos logit contributions
 islands+0.026
 cities+0.024
 obstacles+0.024
 abrupt+0.023
 obstacle+0.023
Neg logit contributions
free-0.019
 okay-0.018
inky-0.018
etsy-0.018
uffy-0.017
Token and
Token ID290
Feature activation-4.58e-03

Pos logit contributions
 okay+0.001
 clean+0.001
free+0.001
 warm+0.001
 safe+0.001
Neg logit contributions
 islands-0.001
 cities-0.001
 obstacles-0.001
 obstacle-0.001
 oceans-0.001
Token get
Token ID651
Feature activation-3.15e-02

Pos logit contributions
 okay+0.005
free+0.005
 clean+0.005
inky+0.005
 warm+0.005
Neg logit contributions
 islands-0.009
 cities-0.009
 obstacles-0.009
 oceans-0.009
 obstacle-0.009
Token help
Token ID1037
Feature activation+8.74e-03

Pos logit contributions
inky+0.043
etsy+0.040
 okay+0.040
free+0.039
uffy+0.038
Neg logit contributions
 islands-0.059
 cities-0.057
 obstacles-0.055
 oceans-0.054
 whales-0.053
Token "
Token ID366
Feature activation+7.38e-02

Pos logit contributions
 islands+0.071
 cities+0.069
 obstacles+0.067
 obstacle+0.066
 abrupt+0.065
Neg logit contributions
 okay-0.049
free-0.047
inky-0.045
etsy-0.043
 clean-0.043
TokenThank
Token ID10449
Feature activation+3.94e-02

Pos logit contributions
 islands+0.129
 cities+0.126
 obstacles+0.121
 oceans+0.119
 abrupt+0.116
Neg logit contributions
 okay-0.110
inky-0.106
free-0.103
etsy-0.098
 clean-0.098
Token you
Token ID345
Feature activation-2.52e-02

Pos logit contributions
 islands+0.081
 cities+0.079
 obstacles+0.077
 oceans+0.075
 obstacle+0.075
Neg logit contributions
 okay-0.047
free-0.044
inky-0.044
etsy-0.041
 clean-0.040
Token,
Token ID11
Feature activation+2.86e-02

Pos logit contributions
free+0.028
 okay+0.028
inky+0.028
etsy+0.027
he+0.026
Neg logit contributions
 islands-0.050
 cities-0.049
 obstacles-0.047
 obstacle-0.046
 whales-0.046
Token little
Token ID1310
Feature activation-1.55e-02

Pos logit contributions
 islands+0.056
 cities+0.055
 obstacles+0.053
 obstacle+0.052
 abrupt+0.051
Neg logit contributions
 okay-0.035
free-0.035
inky-0.034
etsy-0.033
uffy-0.031
Token bird
Token ID6512
Feature activation+2.48e-02

Pos logit contributions
 okay+0.019
free+0.017
inky+0.016
 warm+0.016
 clean+0.016
Neg logit contributions
 islands-0.033
 cities-0.032
 obstacles-0.031
 oceans-0.030
 abrupt-0.030
Token.
Token ID13
Feature activation+5.75e-03

Pos logit contributions
 islands+0.045
 cities+0.044
 obstacles+0.042
 abrupt+0.041
 obstacle+0.041
Neg logit contributions
 okay-0.034
inky-0.032
free-0.032
etsy-0.031
 clean-0.030
Token You
Token ID921
Feature activation+2.29e-02

Pos logit contributions
 islands+0.009
 cities+0.009
 obstacles+0.008
 oceans+0.008
 abrupt+0.008
Neg logit contributions
 okay-0.009
inky-0.009
free-0.009
 clean-0.008
he-0.008
Token are
Token ID389
Feature activation+3.48e-03

Pos logit contributions
 islands+0.038
 hears+0.037
 cities+0.037
 obstacles+0.035
 oceans+0.034
Neg logit contributions
etsy-0.033
inky-0.032
uffy-0.032
ittens-0.031
less-0.030
Token so
Token ID523
Feature activation-2.46e-02

Pos logit contributions
 islands+0.006
 cities+0.006
 oceans+0.005
 obstacles+0.005
 whales+0.005
Neg logit contributions
etsy-0.005
uffy-0.005
inky-0.005
ittens-0.005
isc-0.005
Token wise
Token ID10787
Feature activation+1.67e-02

Pos logit contributions
etsy+0.033
uffy+0.033
isc+0.033
ibi+0.033
inky+0.032
Neg logit contributions
 islands-0.037
 cities-0.037
 oceans-0.036
 whales-0.035
 abrupt-0.034
Token smiled
Token ID13541
Feature activation+4.67e-02

Pos logit contributions
 islands+0.016
 cities+0.015
 obstacles+0.015
 oceans+0.014
 whales+0.014
Neg logit contributions
inky-0.014
 okay-0.014
free-0.014
etsy-0.013
uffy-0.013
Token.
Token ID13
Feature activation-1.02e-03

Pos logit contributions
 islands+0.091
 cities+0.089
 oceans+0.085
 obstacles+0.085
 abrupt+0.083
Neg logit contributions
inky-0.056
etsy-0.054
uffy-0.054
 okay-0.052
free-0.052
Token "
Token ID366
Feature activation+5.17e-02

Pos logit contributions
 okay+0.001
free+0.001
inky+0.001
etsy+0.001
 clean+0.001
Neg logit contributions
 islands-0.002
 cities-0.002
 obstacles-0.002
 obstacle-0.002
 whales-0.002
TokenMe
Token ID5308
Feature activation+2.03e-02

Pos logit contributions
 islands+0.094
 cities+0.091
 obstacles+0.088
 oceans+0.085
 obstacle+0.085
Neg logit contributions
 okay-0.073
free-0.070
inky-0.070
etsy-0.066
 clean-0.065
Token too
Token ID1165
Feature activation-7.82e-03

Pos logit contributions
 islands+0.040
 cities+0.039
 obstacles+0.038
 oceans+0.037
 obstacle+0.037
Neg logit contributions
 okay-0.026
free-0.025
inky-0.024
etsy-0.023
 clean-0.023
Token!"
Token ID2474
Feature activation+2.36e-02

Pos logit contributions
 okay+0.011
free+0.010
inky+0.010
etsy+0.010
 clean+0.010
Neg logit contributions
 islands-0.014
 cities-0.014
 obstacles-0.013
 oceans-0.013
 obstacle-0.013
Token he
Token ID339
Feature activation+3.15e-02

Pos logit contributions
 islands+0.044
 cities+0.044
 obstacles+0.042
 oceans+0.041
 abrupt+0.040
Neg logit contributions
 okay-0.031
inky-0.031
free-0.030
etsy-0.029
uffy-0.028
Token said
Token ID531
Feature activation+4.53e-03

Pos logit contributions
 islands+0.059
 cities+0.058
 obstacles+0.057
 whales+0.054
 abrupt+0.054
Neg logit contributions
free-0.039
 okay-0.039
inky-0.038
etsy-0.035
uffy-0.035
Token,
Token ID11
Feature activation+3.50e-02

Pos logit contributions
 cities+0.008
 islands+0.008
 obstacles+0.007
 seconds+0.007
 oceans+0.007
Neg logit contributions
inky-0.006
uffy-0.006
etsy-0.006
 okay-0.005
free-0.005
Token happy
Token ID3772
Feature activation-2.23e-02

Pos logit contributions
 islands+0.063
 cities+0.062
 obstacles+0.059
 oceans+0.058
 abrupt+0.057
Neg logit contributions
inky-0.047
 okay-0.047
free-0.045
uffy-0.045
etsy-0.044
Token to
Token ID284
Feature activation+1.67e-02

Pos logit contributions
 okay+0.026
inky+0.025
free+0.025
etsy+0.024
uffy+0.023
Neg logit contributions
 islands-0.046
 cities-0.045
 obstacles-0.043
 oceans-0.042
 obstacle-0.042
Token a
Token ID257
Feature activation-2.18e-02

Pos logit contributions
inky+0.006
etsy+0.005
uffy+0.005
free+0.005
isc+0.005
Neg logit contributions
 islands-0.008
 cities-0.007
 abrupt-0.007
 oceans-0.007
 hears-0.007
Token time
Token ID640
Feature activation-4.35e-02

Pos logit contributions
free+0.018
 okay+0.017
inky+0.016
etsy+0.015
un+0.014
Neg logit contributions
 islands-0.051
 cities-0.050
 obstacles-0.049
 obstacle-0.048
 abrupt-0.048
Token,
Token ID11
Feature activation-3.52e-03

Pos logit contributions
free+0.056
inky+0.054
 okay+0.051
etsy+0.050
 clean+0.049
Neg logit contributions
 islands-0.079
 cities-0.079
 obstacles-0.074
 oceans-0.073
 hears-0.073
Token there
Token ID612
Feature activation-4.39e-02

Pos logit contributions
isc+0.006
inky+0.005
ibi+0.005
etsy+0.005
age+0.005
Neg logit contributions
 islands-0.005
 hears-0.005
 cities-0.005
 oceans-0.005
 obstacles-0.005
Token was
Token ID373
Feature activation+4.44e-03

Pos logit contributions
inky+0.075
free+0.074
 okay+0.073
etsy+0.071
un+0.064
Neg logit contributions
 islands-0.064
 cities-0.061
 hears-0.060
 obstacles-0.059
 abrupt-0.056
Token a
Token ID257
Feature activation-2.00e-02

Pos logit contributions
 islands+0.007
 oceans+0.006
 whales+0.006
 cities+0.006
 abrupt+0.006
Neg logit contributions
inky-0.008
etsy-0.008
uffy-0.007
less-0.007
free-0.007
Token boy
Token ID2933
Feature activation+1.09e-02

Pos logit contributions
 okay+0.015
free+0.015
inky+0.014
etsy+0.013
he+0.012
Neg logit contributions
 islands-0.048
 cities-0.047
 obstacles-0.045
 abrupt-0.045
 obstacle-0.045
Token named
Token ID3706
Feature activation-6.13e-02

Pos logit contributions
 islands+0.018
 cities+0.017
 abrupt+0.016
 whales+0.016
 obstacles+0.016
Neg logit contributions
 okay-0.017
inky-0.016
free-0.016
etsy-0.016
uffy-0.015
Token Tim
Token ID5045
Feature activation-2.24e-02

Pos logit contributions
age+0.087
etsy+0.082
free+0.080
aged+0.080
inky+0.077
Neg logit contributions
 islands-0.107
 whales-0.104
 sharks-0.102
 motorcycles-0.101
 cities-0.100
Tokenmy
Token ID1820
Feature activation-2.88e-02

Pos logit contributions
inky+0.025
 okay+0.025
etsy+0.024
free+0.024
uffy+0.022
Neg logit contributions
 islands-0.046
 cities-0.044
 abrupt-0.044
 hears-0.043
 obstacles-0.043
Token.
Token ID13
Feature activation-1.52e-02

Pos logit contributions
 okay+0.038
inky+0.036
etsy+0.036
free+0.033
uffy+0.033
Neg logit contributions
 islands-0.054
 cities-0.052
 abrupt-0.051
 obstacles-0.050
 whales-0.050
Token ocean
Token ID9151
Feature activation-7.32e-04

Pos logit contributions
 islands+0.027
 cities+0.026
 obstacles+0.025
 oceans+0.025
 obstacle+0.024
Neg logit contributions
inky-0.022
 okay-0.021
free-0.021
etsy-0.021
he-0.020
Token.
Token ID13
Feature activation-1.41e-02

Pos logit contributions
 okay+0.001
inky+0.001
free+0.001
etsy+0.001
un+0.001
Neg logit contributions
 islands-0.001
 cities-0.001
 obstacles-0.001
 oceans-0.001
 whales-0.001
Token
Token ID198
Feature activation-2.00e-02

Pos logit contributions
 okay+0.018
free+0.017
inky+0.016
etsy+0.015
 clean+0.015
Neg logit contributions
 islands-0.028
 cities-0.027
 obstacles-0.026
 hears-0.026
 oceans-0.026
Token
Token ID198
Feature activation-2.18e-02

Pos logit contributions
 okay+0.024
free+0.023
inky+0.022
etsy+0.021
less+0.020
Neg logit contributions
 islands-0.040
 cities-0.039
 hears-0.037
 obstacles-0.037
 whales-0.037
TokenM
Token ID44
Feature activation+2.76e-02

Pos logit contributions
 okay+0.033
inky+0.031
free+0.031
etsy+0.030
uffy+0.028
Neg logit contributions
 islands-0.038
 cities-0.037
 obstacles-0.036
 oceans-0.035
 abrupt-0.034
Tokenia
Token ID544
Feature activation+4.95e-04

Pos logit contributions
 islands+0.038
 cities+0.037
 obstacle+0.036
 obstacles+0.036
 hears+0.036
Neg logit contributions
free-0.051
 okay-0.048
etsy-0.046
 Betsy-0.046
 safe-0.045
Token picked
Token ID6497
Feature activation-1.74e-02

Pos logit contributions
 islands+0.001
 cities+0.001
 obstacles+0.001
 obstacle+0.001
 oceans+0.001
Neg logit contributions
 okay-0.001
 clean-0.001
inky-0.000
free-0.000
etsy-0.000
Token up
Token ID510
Feature activation+1.27e-02

Pos logit contributions
 okay+0.020
free+0.019
inky+0.019
etsy+0.018
uffy+0.017
Neg logit contributions
 islands-0.035
 cities-0.035
 obstacles-0.034
 oceans-0.033
 obstacle-0.033
Token the
Token ID262
Feature activation+5.37e-03

Pos logit contributions
 islands+0.024
 cities+0.024
 obstacles+0.023
 oceans+0.022
 whales+0.022
Neg logit contributions
inky-0.017
free-0.017
 okay-0.017
etsy-0.016
uffy-0.015
Token blue
Token ID4171
Feature activation-1.55e-02

Pos logit contributions
 islands+0.007
 acts+0.007
 oceans+0.007
 cities+0.007
 obstacles+0.007
Neg logit contributions
 okay-0.009
etsy-0.008
free-0.008
he-0.008
inky-0.008
Token shampoo
Token ID47259
Feature activation-2.50e-02

Pos logit contributions
 okay+0.020
free+0.018
inky+0.017
etsy+0.017
 safe+0.016
Neg logit contributions
 islands-0.031
 cities-0.030
 obstacles-0.029
 oceans-0.029
 obstacle-0.028
TokenI
Token ID40
Feature activation+3.57e-02

Pos logit contributions
 islands+0.117
 cities+0.113
 obstacles+0.109
 obstacle+0.104
 oceans+0.104
Neg logit contributions
 okay-0.125
free-0.120
inky-0.119
etsy-0.115
 clean-0.113
Token'm
Token ID1101
Feature activation+9.22e-03

Pos logit contributions
 islands+0.062
 cities+0.059
 hears+0.059
 obstacles+0.058
 whales+0.056
Neg logit contributions
inky-0.049
 okay-0.049
free-0.048
etsy-0.046
uffy-0.044
Token here
Token ID994
Feature activation-5.46e-02

Pos logit contributions
 cities+0.015
 whales+0.015
 islands+0.015
 oceans+0.014
 abrupt+0.014
Neg logit contributions
inky-0.014
etsy-0.014
free-0.013
uffy-0.013
he-0.013
Token to
Token ID284
Feature activation+7.77e-03

Pos logit contributions
 okay+0.063
free+0.059
inky+0.057
etsy+0.054
 clean+0.054
Neg logit contributions
 islands-0.115
 cities-0.112
 obstacles-0.109
 obstacle-0.106
 oceans-0.106
Token make
Token ID787
Feature activation-1.47e-02

Pos logit contributions
 islands+0.015
 cities+0.015
 obstacles+0.014
 oceans+0.014
 obstacle+0.014
Neg logit contributions
 okay-0.010
inky-0.009
free-0.009
etsy-0.009
uffy-0.009
Token your
Token ID534
Feature activation-3.39e-02

Pos logit contributions
 okay+0.017
free+0.017
inky+0.017
etsy+0.017
uffy+0.015
Neg logit contributions
 islands-0.030
 cities-0.029
 obstacles-0.029
 obstacle-0.028
 oceans-0.028
Token day
Token ID1110
Feature activation-6.03e-03

Pos logit contributions
inky+0.050
etsy+0.050
 okay+0.050
free+0.050
he+0.046
Neg logit contributions
 islands-0.059
 cities-0.058
 obstacles-0.054
 oceans-0.054
 obstacle-0.054
Token even
Token ID772
Feature activation+7.27e-02

Pos logit contributions
inky+0.007
etsy+0.007
 okay+0.007
free+0.007
uffy+0.006
Neg logit contributions
 islands-0.013
 cities-0.012
 oceans-0.012
 obstacles-0.012
 obstacle-0.012
Token more
Token ID517
Feature activation-2.34e-02

Pos logit contributions
 islands+0.137
 cities+0.134
 obstacles+0.130
 oceans+0.125
 cliffs+0.125
Neg logit contributions
 okay-0.103
 clean-0.095
free-0.095
 warm-0.092
 safe-0.089
Token magical
Token ID10883
Feature activation+6.09e-03

Pos logit contributions
inky+0.027
 okay+0.027
etsy+0.026
free+0.026
uffy+0.024
Neg logit contributions
 islands-0.049
 cities-0.048
 obstacles-0.046
 oceans-0.045
 obstacle-0.045
Token than
Token ID621
Feature activation+8.20e-04

Pos logit contributions
 islands+0.012
 abrupt+0.012
 cities+0.012
 obstacles+0.012
 obstacle+0.011
Neg logit contributions
 okay-0.007
free-0.007
 clean-0.007
 safe-0.007
 warm-0.006
Token You
Token ID921
Feature activation+1.81e-02

Pos logit contributions
 okay+0.013
free+0.013
inky+0.013
etsy+0.012
 clean+0.012
Neg logit contributions
 islands-0.018
 cities-0.017
 obstacles-0.017
 oceans-0.016
 obstacle-0.016
Token can
Token ID460
Feature activation-1.36e-02

Pos logit contributions
 islands+0.030
 cities+0.029
 obstacles+0.027
 oceans+0.026
 hears+0.026
Neg logit contributions
 okay-0.028
etsy-0.027
free-0.027
inky-0.027
uffy-0.026
Token try
Token ID1949
Feature activation-8.07e-03

Pos logit contributions
 okay+0.015
inky+0.014
free+0.014
 clean+0.013
etsy+0.013
Neg logit contributions
 islands-0.030
 cities-0.029
 obstacles-0.028
 oceans-0.027
 obstacle-0.027
Token it
Token ID340
Feature activation-1.15e-02

Pos logit contributions
 okay+0.008
 clean+0.007
 warm+0.007
 safe+0.006
 Lily+0.006
Neg logit contributions
 islands-0.019
roads-0.018
 abrupt-0.018
 cities-0.018
 obstacle-0.018
Token."
Token ID526
Feature activation-4.62e-03

Pos logit contributions
 okay+0.013
free+0.011
inky+0.011
 clean+0.011
 warm+0.010
Neg logit contributions
 islands-0.025
 cities-0.025
 obstacles-0.024
 obstacle-0.024
 abrupt-0.024
Token
Token ID198
Feature activation-1.22e-02

Pos logit contributions
 okay+0.005
free+0.005
inky+0.004
etsy+0.004
 clean+0.004
Neg logit contributions
 islands-0.010
 cities-0.010
 obstacles-0.010
 oceans-0.009
 abrupt-0.009
Token
Token ID198
Feature activation-1.50e-02

Pos logit contributions
 okay+0.011
free+0.011
inky+0.010
etsy+0.010
 clean+0.009
Neg logit contributions
 islands-0.028
 cities-0.027
 obstacles-0.027
 abrupt-0.026
 oceans-0.026
TokenL
Token ID43
Feature activation-3.47e-02

Pos logit contributions
 okay+0.010
free+0.009
inky+0.009
 clean+0.008
etsy+0.008
Neg logit contributions
 islands-0.039
 cities-0.038
 obstacles-0.037
 obstacle-0.036
 oceans-0.036
Tokenily
Token ID813
Feature activation-1.21e-02

Pos logit contributions
 okay+0.028
free+0.027
inky+0.026
etsy+0.024
 warm+0.023
Neg logit contributions
 islands-0.083
 cities-0.081
 obstacles-0.080
 abrupt-0.080
 obstacle-0.078
Token pushed
Token ID7121
Feature activation+1.48e-03

Pos logit contributions
 okay+0.015
inky+0.015
free+0.014
 clean+0.014
etsy+0.014
Neg logit contributions
 islands-0.024
 cities-0.023
 obstacles-0.023
 obstacle-0.022
 oceans-0.022
Token the
Token ID262
Feature activation+1.23e-03

Pos logit contributions
 islands+0.003
 cities+0.003
 obstacles+0.003
 obstacle+0.003
 oceans+0.003
Neg logit contributions
 okay-0.002
inky-0.002
free-0.002
etsy-0.002
uffy-0.001
Token did
Token ID750
Feature activation+4.84e-03

Pos logit contributions
 okay+0.017
free+0.016
inky+0.016
etsy+0.015
 clean+0.014
Neg logit contributions
 islands-0.024
 cities-0.023
 obstacles-0.023
 hears-0.022
 abrupt-0.022
Token,
Token ID11
Feature activation+1.23e-02

Pos logit contributions
 islands+0.011
 cities+0.011
 obstacles+0.011
 oceans+0.010
 obstacle+0.010
Neg logit contributions
 okay-0.005
free-0.004
inky-0.004
 clean-0.004
etsy-0.004
Token but
Token ID475
Feature activation+2.51e-02

Pos logit contributions
 islands+0.027
 cities+0.026
 obstacles+0.025
 obstacle+0.025
 oceans+0.025
Neg logit contributions
 okay-0.015
 clean-0.012
free-0.012
 warm-0.012
 safe-0.011
Token he
Token ID339
Feature activation+3.49e-02

Pos logit contributions
 islands+0.055
 cities+0.054
 obstacles+0.052
 oceans+0.051
 obstacle+0.050
Neg logit contributions
 okay-0.027
free-0.025
inky-0.025
etsy-0.023
 clean-0.023
Token did
Token ID750
Feature activation-2.35e-02

Pos logit contributions
 islands+0.080
 cities+0.079
 oceans+0.076
 obstacles+0.074
 obstacle+0.072
Neg logit contributions
 okay-0.037
inky-0.033
 clean-0.032
uffy-0.031
etsy-0.031
Token not
Token ID407
Feature activation-4.56e-02

Pos logit contributions
 okay+0.015
free+0.012
inky+0.012
 clean+0.011
etsy+0.010
Neg logit contributions
 islands-0.063
 cities-0.062
 obstacles-0.060
 oceans-0.059
 obstacle-0.059
Token know
Token ID760
Feature activation-2.79e-02

Pos logit contributions
 okay+0.058
inky+0.056
free+0.056
etsy+0.053
uffy+0.051
Neg logit contributions
 islands-0.088
 cities-0.087
 obstacles-0.084
 obstacle-0.082
 abrupt-0.081
Token how
Token ID703
Feature activation+1.79e-03

Pos logit contributions
 okay+0.044
free+0.040
 clean+0.039
 warm+0.038
inky+0.037
Neg logit contributions
 islands-0.048
 cities-0.047
 obstacles-0.044
 oceans-0.044
 obstacle-0.044
Token.
Token ID13
Feature activation-1.49e-02

Pos logit contributions
 islands+0.004
 cities+0.004
 obstacles+0.003
 obstacle+0.003
 abrupt+0.003
Neg logit contributions
 okay-0.002
inky-0.002
free-0.002
etsy-0.002
uffy-0.002
Token
Token ID198
Feature activation-1.40e-02

Pos logit contributions
 okay+0.016
free+0.016
inky+0.015
etsy+0.014
 clean+0.014
Neg logit contributions
 islands-0.031
 cities-0.031
 obstacles-0.030
 oceans-0.029
 abrupt-0.029
Token
Token ID198
Feature activation-1.51e-02

Pos logit contributions
 okay+0.013
free+0.013
inky+0.012
etsy+0.012
 clean+0.011
Neg logit contributions
 islands-0.031
 cities-0.031
 obstacles-0.030
 abrupt-0.029
 oceans-0.029
Token a
Token ID257
Feature activation-3.46e-03

Pos logit contributions
 okay+0.010
free+0.008
 clean+0.008
inky+0.008
 warm+0.008
Neg logit contributions
 islands-0.026
 cities-0.025
 obstacles-0.024
 oceans-0.024
 obstacle-0.024
Token big
Token ID1263
Feature activation+2.06e-02

Pos logit contributions
 okay+0.003
inky+0.003
 clean+0.003
free+0.003
 warm+0.003
Neg logit contributions
 islands-0.008
 cities-0.008
 obstacles-0.007
 oceans-0.007
 abrupt-0.007
Token,
Token ID11
Feature activation+9.03e-03

Pos logit contributions
 islands+0.040
 cities+0.039
 obstacles+0.038
 oceans+0.037
 abrupt+0.037
Neg logit contributions
 clean-0.025
 warm-0.025
 okay-0.025
 safe-0.025
 dry-0.022
Token scary
Token ID14343
Feature activation+2.49e-03

Pos logit contributions
 islands+0.020
 cities+0.019
 obstacles+0.019
 oceans+0.018
 whales+0.018
Neg logit contributions
 okay-0.010
 clean-0.009
free-0.009
 warm-0.009
 safe-0.008
Token wolf
Token ID17481
Feature activation+2.40e-02

Pos logit contributions
 islands+0.005
 cities+0.005
 obstacles+0.004
 oceans+0.004
 abrupt+0.004
Neg logit contributions
 okay-0.003
 warm-0.003
 clean-0.003
inky-0.003
 safe-0.003
Token coming
Token ID2406
Feature activation-8.10e-02

Pos logit contributions
 islands+0.046
 cities+0.044
 obstacles+0.043
 oceans+0.042
 abrupt+0.041
Neg logit contributions
 okay-0.031
inky-0.030
etsy-0.030
free-0.029
 clean-0.027
Token towards
Token ID3371
Feature activation+5.17e-02

Pos logit contributions
 okay+0.114
free+0.110
inky+0.108
etsy+0.102
 clean+0.101
Neg logit contributions
 islands-0.149
 cities-0.146
 obstacles-0.140
 oceans-0.136
 obstacle-0.136
Token them
Token ID606
Feature activation-5.41e-02

Pos logit contributions
 islands+0.088
 cities+0.086
 obstacles+0.082
 oceans+0.080
 obstacle+0.080
Neg logit contributions
 okay-0.079
free-0.076
inky-0.073
 clean-0.072
 warm-0.071
Token.
Token ID13
Feature activation-3.23e-03

Pos logit contributions
etsy+0.071
inky+0.069
free+0.067
 okay+0.067
uffy+0.065
Neg logit contributions
 islands-0.100
 cities-0.096
 obstacles-0.095
 abrupt-0.093
 obstacle-0.092
Token They
Token ID1119
Feature activation-1.33e-03

Pos logit contributions
 okay+0.003
inky+0.002
free+0.002
 clean+0.002
etsy+0.002
Neg logit contributions
 islands-0.008
 cities-0.008
 obstacles-0.008
 obstacle-0.007
 oceans-0.007
Token were
Token ID547
Feature activation-9.50e-03

Pos logit contributions
 okay+0.002
free+0.001
inky+0.001
etsy+0.001
 clean+0.001
Neg logit contributions
 islands-0.003
 cities-0.003
 obstacles-0.003
 oceans-0.003
 obstacle-0.003
Token hill
Token ID12788
Feature activation+4.40e-02

Pos logit contributions
 okay+0.032
 clean+0.032
 warm+0.031
free+0.031
 safe+0.030
Neg logit contributions
 islands-0.021
 cities-0.020
 obstacles-0.019
 abrupt-0.019
 whales-0.018
Token and
Token ID290
Feature activation+7.84e-03

Pos logit contributions
 islands+0.107
 cities+0.105
 obstacles+0.103
 obstacle+0.101
 oceans+0.100
Neg logit contributions
 okay-0.034
free-0.031
 clean-0.030
inky-0.030
he-0.028
Token flies
Token ID17607
Feature activation-2.19e-02

Pos logit contributions
 islands+0.018
 cities+0.018
 obstacles+0.018
 abrupt+0.018
 obstacle+0.017
Neg logit contributions
 okay-0.007
 clean-0.007
 warm-0.006
 safe-0.006
 Lily-0.006
Token in
Token ID287
Feature activation+4.89e-02

Pos logit contributions
 okay+0.027
free+0.026
inky+0.025
etsy+0.024
 clean+0.023
Neg logit contributions
 islands-0.044
 cities-0.043
 obstacles-0.042
 oceans-0.041
 obstacle-0.040
Token the
Token ID262
Feature activation+3.92e-03

Pos logit contributions
 islands+0.111
 cities+0.109
 obstacles+0.105
 obstacle+0.104
 oceans+0.103
Neg logit contributions
 okay-0.047
free-0.043
 clean-0.041
inky-0.041
 warm-0.040
Token air
Token ID1633
Feature activation-6.37e-02

Pos logit contributions
 islands+0.008
 cities+0.008
 obstacles+0.008
 abrupt+0.007
 whales+0.007
Neg logit contributions
 okay-0.005
free-0.004
inky-0.004
 clean-0.004
 warm-0.004
Token.
Token ID13
Feature activation-7.34e-03

Pos logit contributions
 okay+0.055
free+0.051
inky+0.049
 clean+0.046
 warm+0.045
Neg logit contributions
 islands-0.150
 cities-0.148
 obstacles-0.144
 obstacle-0.140
 oceans-0.140
Token Tom
Token ID4186
Feature activation-3.18e-02

Pos logit contributions
free+0.009
 okay+0.009
inky+0.008
etsy+0.008
un+0.008
Neg logit contributions
 islands-0.015
 cities-0.014
 obstacles-0.014
 abrupt-0.014
 oceans-0.013
Token and
Token ID290
Feature activation-1.96e-02

Pos logit contributions
 clean+0.030
 okay+0.029
inky+0.028
 warm+0.027
uffy+0.026
Neg logit contributions
 islands-0.070
 cities-0.068
 obstacles-0.067
 obstacle-0.067
 abrupt-0.067
Token Lily
Token ID20037
Feature activation-4.23e-02

Pos logit contributions
 okay+0.013
inky+0.013
free+0.012
etsy+0.011
he+0.010
Neg logit contributions
 islands-0.050
 cities-0.049
 obstacles-0.048
 abrupt-0.048
 obstacle-0.047
Token feel
Token ID1254
Feature activation+9.13e-03

Pos logit contributions
 okay+0.058
inky+0.056
free+0.054
etsy+0.054
uffy+0.051
Neg logit contributions
 islands-0.079
 cities-0.078
 obstacles-0.074
 hears-0.073
 oceans-0.072
Token turn
Token ID1210
Feature activation-1.04e-02

Pos logit contributions
 islands+0.037
 cities+0.036
 abrupt+0.034
 obstacles+0.034
 hears+0.033
Neg logit contributions
inky-0.028
 okay-0.028
free-0.027
etsy-0.027
uffy-0.026
Token off
Token ID572
Feature activation-4.76e-03

Pos logit contributions
 okay+0.012
inky+0.011
free+0.011
etsy+0.011
uffy+0.010
Neg logit contributions
 islands-0.022
 cities-0.021
 obstacles-0.021
 oceans-0.020
 obstacle-0.020
Token the
Token ID262
Feature activation+5.54e-03

Pos logit contributions
inky+0.006
 okay+0.006
etsy+0.006
free+0.006
uffy+0.005
Neg logit contributions
 islands-0.009
 cities-0.009
 obstacles-0.009
 oceans-0.009
 abrupt-0.009
Token alarm
Token ID10436
Feature activation-3.51e-03

Pos logit contributions
 islands+0.010
 cities+0.010
 obstacles+0.010
 oceans+0.009
 obstacle+0.009
Neg logit contributions
 okay-0.008
free-0.007
inky-0.007
etsy-0.007
 clean-0.007
Token.
Token ID13
Feature activation-8.58e-03

Pos logit contributions
 okay+0.004
inky+0.004
etsy+0.004
free+0.004
uffy+0.004
Neg logit contributions
 islands-0.007
 cities-0.007
 obstacles-0.006
 oceans-0.006
 obstacle-0.006
Token When
Token ID1649
Feature activation-8.56e-02

Pos logit contributions
free+0.011
 okay+0.011
inky+0.010
less+0.010
he+0.010
Neg logit contributions
 cities-0.015
 abrupt-0.014
 hears-0.014
 islands-0.014
 oceans-0.014
Token she
Token ID673
Feature activation-9.21e-03

Pos logit contributions
inky+0.131
free+0.119
etsy+0.116
 okay+0.109
un+0.108
Neg logit contributions
 cities-0.145
 islands-0.140
 hears-0.135
 oceans-0.135
 obstacles-0.135
Token got
Token ID1392
Feature activation-8.15e-02

Pos logit contributions
 okay+0.012
free+0.012
inky+0.012
etsy+0.011
uffy+0.010
Neg logit contributions
 islands-0.017
 cities-0.017
 obstacles-0.016
 abrupt-0.016
 oceans-0.016
Token there
Token ID612
Feature activation-4.72e-02

Pos logit contributions
inky+0.132
free+0.127
etsy+0.121
isc+0.112
uffy+0.110
Neg logit contributions
 islands-0.134
 cities-0.133
 oceans-0.127
 hears-0.127
 obstacles-0.121
Token,
Token ID11
Feature activation-9.73e-03

Pos logit contributions
inky+0.064
free+0.059
 okay+0.055
etsy+0.053
un+0.053
Neg logit contributions
 cities-0.089
 islands-0.088
 obstacles-0.083
 oceans-0.083
 hears-0.082
Token she
Token ID673
Feature activation+3.63e-03

Pos logit contributions
inky+0.014
free+0.011
etsy+0.011
age+0.011
un+0.010
Neg logit contributions
 cities-0.018
 obstacles-0.017
 islands-0.017
 cliffs-0.016
 hears-0.016
Token was
Token ID373
Feature activation-1.06e-02

Pos logit contributions
 islands+0.019
 cities+0.019
 obstacles+0.018
 obstacle+0.017
 abrupt+0.017
Neg logit contributions
 okay-0.022
inky-0.020
free-0.020
 clean-0.019
etsy-0.019
Token very
Token ID845
Feature activation-1.36e-02

Pos logit contributions
inky+0.014
free+0.014
etsy+0.013
uffy+0.012
 okay+0.012
Neg logit contributions
 islands-0.020
 cities-0.020
 obstacles-0.019
 oceans-0.019
 abrupt-0.018
Token scared
Token ID12008
Feature activation-2.74e-02

Pos logit contributions
inky+0.013
free+0.013
etsy+0.012
 okay+0.012
he+0.011
Neg logit contributions
 islands-0.030
 cities-0.030
 obstacles-0.029
 abrupt-0.028
 oceans-0.028
Token.
Token ID13
Feature activation-8.23e-03

Pos logit contributions
inky+0.034
 okay+0.034
free+0.033
etsy+0.033
uffy+0.030
Neg logit contributions
 islands-0.054
 cities-0.052
 obstacles-0.052
 oceans-0.050
 abrupt-0.050
Token Ahead
Token ID40344
Feature activation+5.58e-02

Pos logit contributions
 okay+0.011
free+0.010
inky+0.010
etsy+0.010
 clean+0.009
Neg logit contributions
 islands-0.016
 cities-0.015
 obstacles-0.015
 oceans-0.015
 obstacle-0.014
Token of
Token ID286
Feature activation-6.81e-02

Pos logit contributions
 islands+0.122
 cities+0.121
 obstacles+0.115
 hears+0.114
 abrupt+0.113
Neg logit contributions
 okay-0.060
free-0.057
inky-0.057
etsy-0.055
uffy-0.047
Token him
Token ID683
Feature activation-3.63e-02

Pos logit contributions
 okay+0.064
 clean+0.055
 warm+0.055
free+0.055
 safe+0.053
Neg logit contributions
 islands-0.154
 cities-0.151
 obstacle-0.147
 obstacles-0.147
 abrupt-0.146
Token everything
Token ID2279
Feature activation-1.19e-02

Pos logit contributions
 okay+0.048
free+0.047
inky+0.047
etsy+0.044
 clean+0.042
Neg logit contributions
 islands-0.068
 cities-0.067
 obstacles-0.064
 oceans-0.062
 hears-0.062
Token was
Token ID373
Feature activation+2.26e-02

Pos logit contributions
 okay+0.015
free+0.014
inky+0.014
 clean+0.013
etsy+0.013
Neg logit contributions
 islands-0.024
 cities-0.023
 obstacles-0.022
 oceans-0.022
 abrupt-0.022
Token dark
Token ID3223
Feature activation-4.10e-02

Pos logit contributions
 islands+0.042
 obstacles+0.041
 cities+0.041
 abrupt+0.039
 oceans+0.038
Neg logit contributions
inky-0.029
free-0.029
etsy-0.028
 okay-0.026
he-0.026
Token.
Token ID13
Feature activation-3.97e-03

Pos logit contributions
 okay+0.049
inky+0.047
free+0.046
etsy+0.045
 clean+0.042
Neg logit contributions
 islands-0.084
 cities-0.082
 obstacles-0.079
 oceans-0.077
 abrupt-0.076
Token "
Token ID366
Feature activation+6.29e-02

Pos logit contributions
 islands+0.080
 cities+0.077
 obstacles+0.076
 obstacle+0.074
 abrupt+0.072
Neg logit contributions
 okay-0.052
free-0.049
inky-0.046
 clean-0.045
etsy-0.044
TokenThat
Token ID2504
Feature activation+3.18e-02

Pos logit contributions
 islands+0.109
 cities+0.107
 obstacles+0.102
 oceans+0.099
 abrupt+0.098
Neg logit contributions
 okay-0.094
inky-0.090
free-0.090
etsy-0.085
 clean-0.084
Token's
Token ID338
Feature activation+1.69e-02

Pos logit contributions
 islands+0.068
 cities+0.067
 obstacles+0.065
 oceans+0.063
 obstacle+0.063
Neg logit contributions
 okay-0.035
free-0.033
inky-0.032
etsy-0.030
 clean-0.030
Token okay
Token ID8788
Feature activation-3.07e-02

Pos logit contributions
 islands+0.033
 cities+0.033
 obstacles+0.033
 obstacle+0.032
 abrupt+0.032
Neg logit contributions
inky-0.020
etsy-0.019
free-0.019
uffy-0.017
he-0.017
Token!
Token ID0
Feature activation+2.57e-02

Pos logit contributions
inky+0.030
 okay+0.030
free+0.030
etsy+0.028
he+0.027
Neg logit contributions
 islands-0.066
 cities-0.065
 obstacles-0.063
 obstacle-0.061
 abrupt-0.061
Token When
Token ID1649
Feature activation-8.46e-02

Pos logit contributions
 cities+0.036
 islands+0.035
 obstacles+0.033
 oceans+0.033
 abrupt+0.032
Neg logit contributions
inky-0.043
 okay-0.042
he-0.041
free-0.041
 warm-0.040
Token we
Token ID356
Feature activation-8.57e-03

Pos logit contributions
 okay+0.101
free+0.090
 clean+0.087
inky+0.084
 warm+0.083
Neg logit contributions
 islands-0.177
 cities-0.173
 obstacles-0.168
 abrupt-0.165
 obstacle-0.164
Token are
Token ID389
Feature activation-1.41e-02

Pos logit contributions
inky+0.011
 okay+0.011
etsy+0.011
free+0.011
uffy+0.010
Neg logit contributions
 islands-0.016
 cities-0.015
 obstacles-0.015
 hears-0.014
 obstacle-0.014
Token anxious
Token ID18116
Feature activation-3.70e-02

Pos logit contributions
inky+0.017
free+0.017
etsy+0.017
uffy+0.015
he+0.015
Neg logit contributions
 islands-0.027
 cities-0.027
 obstacles-0.026
 whales-0.026
 oceans-0.025
Token about
Token ID546
Feature activation-6.12e-02

Pos logit contributions
 okay+0.039
free+0.036
inky+0.035
etsy+0.033
 clean+0.032
Neg logit contributions
 islands-0.081
 cities-0.080
 obstacles-0.077
 oceans-0.075
 obstacle-0.075
Token something
Token ID1223
Feature activation-1.13e-02

Pos logit contributions
 okay+0.076
free+0.067
 clean+0.065
 warm+0.063
inky+0.063
Neg logit contributions
 islands-0.125
 cities-0.122
 obstacles-0.119
 obstacle-0.117
 abrupt-0.116